NZ722231B2 - Interrogatory cell-based assays for identifying drug-induced toxicity markers - Google Patents

Interrogatory cell-based assays for identifying drug-induced toxicity markers Download PDF

Info

Publication number
NZ722231B2
NZ722231B2 NZ722231A NZ72223112A NZ722231B2 NZ 722231 B2 NZ722231 B2 NZ 722231B2 NZ 722231 A NZ722231 A NZ 722231A NZ 72223112 A NZ72223112 A NZ 72223112A NZ 722231 B2 NZ722231 B2 NZ 722231B2
Authority
NZ
New Zealand
Prior art keywords
drug
cell
cells
protein
induced
Prior art date
Application number
NZ722231A
Other versions
NZ722231A (en
Inventor
Niven Rajin Narain
Rangaprasad Sarangarajan
Vivek K Vishnudas
Original Assignee
Berg Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Berg Llc filed Critical Berg Llc
Publication of NZ722231A publication Critical patent/NZ722231A/en
Publication of NZ722231B2 publication Critical patent/NZ722231B2/en

Links

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P39/00General protective or antinoxious agents
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P39/00General protective or antinoxious agents
    • A61P39/02Antidotes
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P9/00Drugs for disorders of the cardiovascular system
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P9/00Drugs for disorders of the cardiovascular system
    • A61P9/04Inotropic agents, i.e. stimulants of cardiac contraction; Drugs for heart failure
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P9/00Drugs for disorders of the cardiovascular system
    • A61P9/06Antiarrhythmics
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6834Enzymatic or biochemical coupling of nucleic acids to a solid phase
    • C12Q1/6837Enzymatic or biochemical coupling of nucleic acids to a solid phase using probe arrays or probe chips
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/142Toxicological screening, e.g. expression profiles which identify toxicity
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/5005Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells
    • G01N33/5008Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • G01N33/6803General methods of protein analysis not limited to specific proteins or families of proteins
    • G01N33/6842Proteomic analysis of subsets of protein mixtures with reduced complexity, e.g. membrane proteins, phosphoproteins, organelle proteins
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B5/00ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks

Abstract

The instant application provides methods for identifying modulators of drug-induced toxicity by identifying, from a consensus causal relationship network generated by an artificial intelligence (AI) -based informatics platform, a causal relationship unique in drug-induced toxicity, wherein a gene, lipid, protein, metabolite, transcript, or SNP associated with the unique causal relationship is identified as a modulator of drug-induced toxicity. ipid, protein, metabolite, transcript, or SNP associated with the unique causal relationship is identified as a modulator of drug-induced toxicity.

Description

INTERROGATORY CELL-BASED ASSAYS FOR IDENTIFYING DRUGINDUCED TOXICITY MARKERS Cross-Reference to Related Applications This ation claims priority to U.S. Provisional Application Serial No., 61/650462 filed May 22, 2012, the entire content of which is incorporated herein.
The present ation is a divisional ation of New Zealand Patent Application No. 701908 (), the entire contents of which is hereby incorporated by reference.
Background of the Invention 11540992 Cardiotoxicity refers to a broad range of adverse s on heart function induced by therapeutic molecules. Cardiotoxicity may emerge early in pre-clinical studies or become apparent later in the clinical setting. It is a leading cause of drug withdrawal, accounting for over 45% of all drugs withdrawn since 1994, which results in significant financial burden for drug development. Cardiovascular toxicity includes sed QT duration, arrhythmias, myocardial ischemia, hypertension and thromboembolic complications, and myocardial dysfunction.
Cardiac safety biomarkers currently used by the FDA are QTc prolongation — lectrophysiological arrhythmias, circulating troponin c, heart rate, blood re, lipids, troponin, C—reactive protein (CRP), brain ot B-type natriuretic peptide (BNP), ex vivo platelet aggregation, and imaging kers (cardiac magnetic resonance imaging).
The QTc prolongation is a very robust but complex marker. However, a decision on whether to kill or sustain a drug in early development is hard to make based on QTc alone. In on, QTc is subjective and is ent upon underlying pathologies that can lead to tachyarrythmias.
In view of the foregoing, it is t that new cardiac safety biomarkers, such as lar cardiac safety biomarkers, are needed in the art.
Summary ofthe Invention The platform logy described herein is useful for identifying markers associated with drug-induced toxicity. This rm technology integrates molecular interactions within and across a chy of models starting from primary human cell based model to human clinical samples. This approach leads to the identification of biomarkers that reflect an underlying toxicity caused by a compound or NME that is a potential drug, such as a drug candidate ready to enter phase I clinical trials. Drug induced toxicities can include cardiac, renal, hepatic and other tissue toxicity. The t application provides several novel biomarkers associated with drug-induced toxicity, and which are useful in methods for predicting potential toxicity of a molecule or drug candidate, and as potential therapeutic targets for treating, preventing or counteracting drug-induced toxicity.
The invention described herein is based, at least in part, on a novel, collaborative utilization of network biology, genomic, proteomic, metabolomic, transcriptomic, and bioinformatics tools and methodologies, which, when combined, may be used to study any biological system of interest, such as obtaining insight into the molecular mechanisms associated with or causal for drug-induced toxicity. The platform technology is further described in international PCT Application PCT/U82012/027615, the entire contents of which are hereby expressly incorporated herein. Additional embodiments of the platform technology, including a description of how to carry out platform technology methods ing incorporation of enzyme (e.g., kinase) activity data, are described in US. ation Serial No. 13/607,587, filed on September 7, 2012, the entire contents of which are expressly orated herein by reference. In a first step, cellular modeling systems are ped to probe a drug-induced toxicities, such as cardiotoxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, renaltoxicity or myotoxicity .
A cellular system ng drug-induced ty can se ty- related cells subjected to various -relevant environment stimuli (e.g., lycemia, hypoxia, immuno-stress, and lipid peroxidation, or re to a test molecule or drug candidate). In some embodiments, the cellular modeling system involves cellular cross- talk mechanisms between various interacting cell types related to ic drug-induced toxicity, such as cardiomyocytes, diabetic cardiomyocytes, hepatocytes, kidney cells, al cells, renal cells, or myoblasts. High throughput biological readouts from the cell model system are obtained by using a combination of techniques, including, for example, cutting edge mass spectrometry (LC/MSMS), flow cytometry, ased assays, and functional assays. The high throughput biological readouts are then ted to a bioinformatic analysis to study congruent data trends by in vitro, in vivo, and in silico modeling. The resulting matrices allow for cross-related data mining where linear and non-linear regression analysis were developed to reach conclusive pressure points (or “hubs”). These “hubs”, as presented herein, are candidates for drug discovery.
In ular, these hubs represent potential drug targets for reducing or alleviating drug- induced toxicity and/or drug-induced toxicity markers.
The molecular signatures of the differentials allow for insight into the isms that dictate the alterations in the tissue microenvironment that lead to drug- induced toxicity. Taken together, the combination of the aforementioned technology platform with strategic cellular modeling allows for robust intelligence that can be employed to further establish an understanding of the underlying mechanisms and molecular drivers contributing to drug-induced toxicity, e. g., cardiotoxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, renal toxicity or myotoxicity while creating biomarker libraries that may allow early identification of drug ates at risk for causing drug-induced toxic effects, as well as drug targets that may reduce or alleviate nduced toxicity.
A significant feature of the platform of the ion is that the AI-based system is based on the data sets obtained from the drug-induced toxicity cell model system, without resorting to or taking into consideration any existing knowledge in the art, such as known biological relationships (i.e., no data points are artificial), concerning the drug- induced toxicity. ingly, the resulting statistical models generated from the platform are unbiased. Another significant feature of the platform of the ion and its components, e. g., the cell model systems and data sets obtained therefrom, is that it allows for continual building on the nduced toxicity cell models over time (e. g., by the introduction of new cells and/or conditions), such that an initial, “first generation” consensus causal relationship network generated from a cell model for a drug-induced toxicity can evolve along with the evolution of the cell model itself to a multiple generation causal relationship network (and delta or delta-delta networks obtained therefrom). In this way, both the drug-induced toxicity cell , the data sets from the drug-induced ty cell models, and the causal relationship networks generated from the drug-induced toxicity cell models by using the Platform Technology methods can constantly evolve and build upon previous knowledge obtained from the Platform Technology.
The present invention is based, at least in part, on the identification of novel biomarkers that are associated with drug-induced cardiotoxicity. The invention is further based, at least in part, on the discovery that Coenzyme Q10 is capable of ng or preventing drug-induced cardiotoxicity.
Accordingly, the invention provides methods for fying an agent that causes or is at risk for causing cardiotoxicity. In one ment, the agent is a drug or drug candidate. In one ment, the toxicity is drug-induced toxicity, e. g., cardiotoxicity.
In one embodiment, the agent is a drug or drug candidate for treating diabetes, obesity, a cardiovascular disorder, cancer, a neurological er, or an inflammatory disorder.
In these methods, the amount of one or more kers/proteins in a pair of samples (a first sample not subject to the drug treatment, and a second sample subjected to the drug treatment) is assessed. A tion in the level, expression level, or ty of the one or more biomarkers in the second sample as compared to the level of expression of the one or more biomarkers in the first sample is an indication that the drug causes or is at risk for causing drug-induced cardiotoxicity. In one embodiment, the one or more biomarkers is selected from the markers listed in table 2. The methods of the present invention can be practiced in conjunction with any other method used by the skilled practitioner to fy a drug at risk for g drug-induced cardiotoxocity.
In one embodiment, a drug that may be used in the methods of the invention includes, but is not limited to, Anthracyclines, 5-Fluorouracil, Cisplatin, Trastuzumab, Gemcitabine, Rosiglitazone, Pioglitazone, Troglitazone, Cabergoline, Pergolide, Sumatriptan, Bisphosphonates, and TNF antagonists.
In one aspect, the present invention provides a method for identifying a modulator of drug induced toxicity, said method comprising: (1) establishing a model for drug-induced toxicity using cells associated with duced toxicity to represent a characteristic aspect of drug-induced toxicity; (2) ing a first data set from the model for drug-induced toxicity, wherein the first data set represents one or more of genomics, mics, proteomics, metabolomics, transcriptomics, and single nucleotide polymorphism (SNP) data characterizing the cells associated with drug-induced toxicity, wherein the drug is Anthracyclines, 5-Fluorouracil, Cisplatin, Trastuzumab, Gemcitabine, itazone, Pioglitazone, Troglitazone, Cabergoline, Pergolide, iptan, Bisphosphonates, or a TNF antagonist; (3) ing a second data set from the model for nduced toxicity, wherein the second data set represents a functional activity or a cellular response of the cells ated with drug-induced toxicity; (4) generating a consensus causal relationship network among the expression levels of the one or more of genomics, lipidomics, proteomics, metabolomics, transcriptomics, and single nucleotide polymorphism (SNP) data and the functional ty or cellular response based solely on the first data set and the second data set using a programmed computing device, wherein the generation of the consensus causal relationship (14289890_1):GGG network is carried out by an artificial intelligence (AI) -based informatics platform, and wherein the generation of the consensus causal relationship network is not based on any known biological onships other than the first data set and the second data set; and (5) identifying, from the consensus causal relationship network, a causal relationship unique in drug-induced toxicity, wherein a gene, lipid, protein, metabolite, transcript, or SNP associated with the unique causal relationship is identified as a tor of drug-induced toxicity.
Accordingly, in one aspect, the invention provides a method for identifying a drug that causes or is at risk for causing drug-induced cardiotoxicity, comprising: comparing (i) the level of expression of one or more biomarkers present in a first cell sample obtained prior to the treatment with the drug; with (ii) the level of sion of the one or more biomarkers present in a second cell sample obtained following the treatment with the drug; wherein the one or more biomarkers is selected from the markers listed in table 2; n a modulation in the level of expression of the one or more biomarkers in the second sample as compared to the first sample is an indication that the drug causes or is at risk for causing drug-induced cardiotoxicity.
In one embodiment, the cells are cells of the cardiovascular , e.g., cardiomyocytes. In one embodiment, the cells are diabetic cardiomyocytes. In one embodiment, the drug is a drug or candidate drug for treating diabetes, obesity, vascular disease, cancer, ogical disorder, or inflammatory disorder. In one embodiment, the drug is any one of Anthracyclines, 5-Fluorouracil, tin, Trastuzumab, Gemcitabine, Rosiglitazone, Pioglitazone, tazone, Cabergoline, Pergolide, Sumatriptan, Bisphosphonates, and TNF antagonists.
In one embodiment, a modulation (e.g., an se or a decrease) in the level of expression of one, two, three, four, five, six, seven, eight, nine, ten, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, or more of the biomarkers selected from the markers listed in table 2 in the second (14289890_1):GGG sample as ed to the first sample is an indication that the drug causes or is at risk for causing drug-induced cardiotoxicity.
In one embodiment, a modulation (e.g., an increase or a decrease) in the level of expression of a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, , or en markers selected from a group consisting TIMPl, PTX3, HSP76, FINC, CYB5, PAIl, IBP7 (IGFBP7), lCl7, EDIL3, HMOXl, NUCBl, CSOlO, HSPA4 in the second sample as compared to the first sample is an tion that the drug causes or is at risk for causing drug-induced cardiotoxicity.
Methods for identifying a rescue agent that can reduce or prevent drug-induced cardiotoxicity are also provided by the invention. In these methods, the amount of one or more biomarkers in three samples (a first sample not subjected to the drug treatment, a second sample subjected to the drug treatment, and a third sample ted both to the drug ent and the agent) is assessed. A normalized level of sion of the one or more biomarkers in the third sample as compared to the first sample, with a change of expression in the second example treated with the drug, is an indication that the rescue agent can reduce or prevent drug-induced toxicity. In one embodiment, the one or more biomarkers is selected from the s listed in table 2.
Using the methods described herein, a variety of molecules, particularly including molecules sufficiently small to be able to cross the cell ne, may be screened in order to identify molecules which te, e.g., increase or decrease the expression and/or activity of a marker of the invention. Compounds so identified can be provided to a subject in order to reduce, alleviate or prevent drug-induced ty in the subject.
Accordingly, in another aspect, the invention provides a method for identifying a rescue agent that can reduce or prevent drug-induced cardiotoxicity comprising: (i) determining a normal level of expression of one or more kers present in a first cell sample obtained prior to the treatment with a cardiotoxicity inducing drug; (ii) determining a treated level of expression of the one or more biomarkers present in a second cell sample obtained following the treatment with the cardiotoxicity inducing drug to identify one or more biomarkers with a change of expression in the treated cell sample; (iii) determining the level of expression of the one or more biomarkers with a changed level of expression in the cardiotoxicity inducing drug treated sample present in a third cell sample obtained following the treatment with the cardiotoxicity inducing drug and the rescue agent; and (iv) comparing the level of expression of the one or more biomarkers determined in the third sample with the level of expression of the one or more biomarkers present in the first sample; wherein the one or more biomarkers is selected from the markers listed in table 2; and wherein a normalized level of expression of the one or more biomarkers in the third sample as compared to the first sample is an tion that the rescue agent can reduce or prevent drug-induced cardiotoxicity.
In one embodiment, the cells are cells of the cardiovascular system, e.g., cardiomyocytes. In one embodiment, the cells are diabetic cardiomyocytes. In one embodiment, the drug is a drug or candidate drug for treating diabetes, obesity,cardiovascular disease, cancer, neurological disorder, or inflammatory disorder.
In one embodiment, the drug is is any one of Anthracyclines, 5-Fluorouracil, Cisplatin, Trastuzumab, Gemcitabine, Rosiglitazone, Pioglitazone, tazone, Cabergoline, Pergolide, Sumatriptan, Bisphosphonates, and TNF antagonists. In one embodiment, about the same level of expression of one, two, three, four, five, six, seven, eight, nine, ,11, 12, 13,14, 15,16,17, l8, 19, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, or more of the biomarkers selected from the markers listed in table 2 in the third sample as compared to the first sample is an indication that the rescue agent can reduce or prevent drug-induced cardiotoxicity.
In one embodiment, a normalized level of sion of a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen,markers ed from a group consisting TIMPl, PTX3, HSP76, PINC, CYB5, PAIl, IBP7 (IGFBP7), lCl7, EDIL3, HMOXl, NUCBl, CSOlO, HSPA4, in the third sample as compared to the first sample is an indication that the rescue agent can reduce or prevent drug-induced cardiotoxicity.
The invention r provides methods for ating, ng or preventing drug-induced cardiotoxicity in a subject in need thereof, sing administering to a subject (e. g., a mammal, a human, or a non-human ) an agent identified by the screening methods provided herein, thereby ng or preventing drug-induced cardiotoxicity in the subject. In one embodiment, the agent is administered to a subject that has already been treated with a cardiotoxicity-inducing drug. In one embodiment, the agent is administered to a subject at the same time as treatment of the subject with a cardiotoxicity-inducing drug. In one embodiment, the agent is administered to a subject prior to treatment of the subject with a toxicity-inducing drug.
The invention further es methods for alleviating, reducing or preventing nduced cardiotoxicity in a subject in need thereof, comprising administering Coenzyme Q10 to the subject (e.g., a mammal, a human, or a non-human animal), thereby ng or preventing nduced toxicity in the subject. In one embodiment, the Coenzyme Q10 is administered to a t that has already been treated with a cardiotoxicity-inducing drug. In one embodiment, the Coenzyme Q10 is administered to a subject at the same time as treatment of the subject with a cardiotoxicity-inducing drug. In one embodiment, the Coenzyme Q10 is administered to a subject prior to treatment of the subject with a cardiotoxicity-inducing drug. In one embodiment, the drug-induced cardiotoxicity is associated with modulation of expression of one, two, three, four, five, six, seven, eight, nine, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, or more of the biomarkers selected from the markers listed in table 2. All values presented in the foregoing list can also be the upper or lower limit of ranges, that are intended to be a part of this invention, e.g., between 1 and 5, 1 and 10, 2 and 5, 2 and 10, or 5 and 10 of the foregoing genes (or ns).
In one embodiment, the drug-induced cardiotoxicity is cardiomyopathy, heart failure, atrial fibrillation, cardiomyopathy and heart failure, heart e and LV dysfunction, atrial flutter and lation, or heart valve damage and heart failure.
The invention further provides biomarkers (e.g, genes and/or proteins) that are useful as predictive markers for drug-induced cardiotoxicity. These biomarkers include the markers listed in table 2.
In one embodiment, the drug-induced cardiotoxicity is associated with modulation of a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen, markers ed from a group consisting TIMPl, PTX3, HSP76, FINC, CYB5, PAIl, IBP7 (IGFBP7), 1C17, EDIL3, HMOXl, NUCBl, C8010, HSPA4.
In one embodiment, the predictive markers for drug-induced cardiotoxicity is a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen markers selected from a group consisting TIMPl, PTX3, HSP76, FINC, CYB5, PAIl, IBP7 (IGFBP7), 1C17, EDIL3, HMOXl, NUCBl, C8010, HSPA4.
The ordinary skilled artisan would, however, be able to identify additional biomarkers predictive of drug-induced cardiotoxicity by employing the methods described herein, e. g., by carrying out the methods described in Example 3 but by using a different drug known to induce cardiotoxicity. Exemplary drug-induced cardiotoxicity biomarkers of the invention are further described below.
In one aspect, the invention relates to a method for identifying a modulator of adrug-induced toxicity, said method comprising: (1) establishing a model for drug- induced toxicity, using cells associated with drug-induced toxicity, to represents a characteristic aspect of drug-induced toxicity; (2) obtaining a first data set from the model for nduced toxicity, n the first data set represents one or more of genomics, lipidomics, proteomics, metabolomics, transcriptomics, and single nucleotide polymorphism (SNP) data characterizing the cells associated with drug-induced toxicity; (3) obtaining a second data set from the model for drug-induced toxicity, wherein the second data set represents a onal activity or a cellular response of the cells associated with drug-induced toxicity; (4) generating a consensus causal relationship network among the expression levels of the one or more of cs, lipidomics, proteomics, metabolomics, transcriptomics, and single nucleotide polymorphism (SNP) data and the functional activity or cellular response based solely on the first data set and the second data set using a programmed computing device, wherein the generation of the sus causal relationship network is not based on any known biological relationships other than the first data set and the second data set; (5) identifying, from the consensus causal relationship k, a causal relationship unique in drug-induced toxicity, wherein a gene, lipid, n, metabolite, transcript, or SNP associated with the unique causal relationship is identified as a modulator of drug-induced toxicity.
In certain ments, the tor stimulates or promotes the drug-induced toxicity.
In certain embodiments, the tor inhibits the drug-induced toxicity.
In certain embodiments, the model of the drug-induced toxicity comprises an in vitro culture of cells associated with the drug-induced toxicity, optionally further comprising a matching in vitro culture of control cells.
In certain embodiments, the in vitro culture of the cells is t to an environmental perturbation, and the in vitro e of the matching control cells is identical cells not subject to the nmental perturbation. 101O In certain embodiments, the environmental perturbation comprises one or more of a contact with an agent, a change in culture condition, an introduced genetic modification / mutation, and a vehicle (e.g., vector) that causes a genetic modification / mutation.
In certain embodiments, the first data set comprises n and/or mRNA expression levels of the plurality of genes.
In certain embodiments, the first data set further comprises two or more of genomics, lipidomics, proteomics, metabolomics, transcriptomics, and single nucleotide polymorphism (SNP) data. In certain embodiments, the first data set further comprises three or more of genomics, mics, proteomics, metabolomics, transcriptomics, and single nucleotide polymorphism (SNP) data.
In certain embodiments, the second data set representing the functional activity or cellular response of the cells comprises one or more of bioenergetics, cell proliferation, apoptosis, organellar function, a pe-phenotype ation actualized by functional models selected from ATP, ROS, OXPHOS, and Seahorse assays, global enzyme activity, and an effect of global enzyme activity on the enzyme lic substrates of cells associated with drug-induced toxicity. In one embodiment, the global enzyme activity is global kinase activity. In one embodiment, the effect of global enzyme activity on the enzyme metabolic ates is the phospho proteome of the cell.
In certain embodiments, step (4) is carried out by an artificial intelligence (AI) - based informatics platform.
In certain embodiments, the AI—based informatics platform comprises REFS(TM).
In certain embodiments, the AI—based informatics platform es all data input from the first data set and the second data set without applying a statistical cut-off point.
In certain embodiments, the sus causal relationship network established in step (4) is further refined to a simulation causal relationship network, before step (5), by in silico simulation based on input data, to provide a confidence level of tion for one or more causal relationships within the consensus causal relationship network.
In certain embodiments, the unique causal onship is identified as part of a differential causal relationship network that is uniquely present in cells, and absent in the ng control cells. 1111 In one embodiment, the unique causal relationship identified is a relationship between at least one pair ed from the group consisting of sion of a gene and level of a lipid; expression of a gene and level of a transcript; expression of a gene and level of a metabolite; expression of a first gene and a second gene; expression of a gene and presence of a SNP; expression of a gene and a functional activity; level of a lipid and level of a transcript; level of a lipid and level of a metabolite; level of a first lipid and a second lipid; level of a lipid and presence of a SNP; level of a lipid and a functional activity; level of a first transcript and level of a second transcript; level of a transcript and level of a metabolite; level of a transcript and presence of a SNP; level of a first transcript and a functional activity; level of a first metabolite and level of a second metabolite; level of a lite and presence of a SNP; level of a metabolite and a functional activity; level of a first SNP and ce of a second SNP; and presence of a SNP and a onal activity.
In one embodiment, the functional activity is selected from the group consisting of bioenergetics, cell proliferation, apoptosis, organellar function, kinase activity, protease activity, and a genotype-phenotype association actualized by functional models selected from ATP, ROS, OXPHOS, and Seahorse . In certain embodiments, the method further comprising validating the identified unique causal relationship in a drug- indiced toxicity model.
In one embodiment, the drug-induced ty is drug-induced cardiotoxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, renaltoxicity or myotoxicity.
In one embodiment, the drug-induced cardiotoxicity is cardiomyopathy, heart failure, atrial lation, cardiomyopathy and heart failure, heart failure and LV dysfunction, atrial flutter and fibrillation, or, heart valve damage and heart failure.
In one embodiment, the model for drug-induced toxicity comprises cardiomyocytes, diabetic myocytes, cytes, kidney cells, neuronal cells, renal cells, or myoblasts.
In one embodiment, the model for nduced toxicity comprises a toxicity inducing drug, cancer drug, diabetic drug, neurological drug, or anti-inflammatory drug.
In one embodiment, the drug is Anthracyclines, rouracil, Cisplatin, Trastuzumab, 1212 Gemcitabine, Rosiglitazone, Pioglitazone, Troglitazone, Cabergoline, Pergolide, Sumatriptan, Bisphosphonates, or TNF antagonists.
In one aspect, the invention provides a method for fying a drug that causes or is at risk for causing drug-induced toxicity, comprising: comparing (i) a level of one or more biomarkers present in a first cell sample obtained prior to the treatment with the drug; with (ii) a level of the one or more biomarkers present in a second cell sample obtained following the treatment with the drug; wherein the one or more biomarkers is selected from the modulators identified by the methods bed above; wherein a tion in the level of the one or more biomarkers in the second sample as compared to the first sample is an indication that the drug causes or is at risk for causing drug- induced toxicity.
In one aspect, the invention provides a method for identifying a rescue agent that can reduce or prevent drug-induced toxicity comprising: (i) determining a normal level of one or more biomarkers present in a first cell sample obtained prior to the treatment with a toxicity inducing drug; (ii) determining a d level of the one or more biomarkers present in a second cell sample obtained ing the treatment with the toxicity ng drug to identify one or more biomarkers with a change of level in the treated cell sample; (iii) determining the level of the one or more biomarkers with a d level in the toxicity inducing drug treated sample present in a third cell sample obtained following the treatment with the toxicity inducing drug and the rescue agent; and (iv) comparing the level of the one or more biomarkers determined in the third sample with the level of the one or more biomarkers present in the first sample; wherein the one or more kers is selected from the modulators identified by the methods bed above and wherein a normalized level of the one or more biomarkers in the third sample as compared to the first sample is an indication that the rescue agent can reduce or prevent drug-induced toxicity.
In another aspect, the invention relates to a method for alleviating, ng or ting drug-induced toxicity, comprising stering to a subject the rescue agent identified by the methods described above, thereby reducing or preventing drug-induced toxicity in the subject. 1313 In another aspect, the invention relates to a method for providing a model for drug-induced toxicity for use in a platform method, comprising: establishing a drug- induced toxicity model, using cells associated with the drug-induced ty, to represent a characteristic aspect of the drug-induced toxicity, wherein the model for the drug-induced toxicity is useful for ting data sets used in the platform method; thereby providing a model for drug-induced toxicity for use in a platform method.
In one embodiment, the model for drug-induced toxicity comprises cardiomyocytes, diabetic cardiomyocytes, hepatocytes, kidney cells, neuronal cells, renal cells, or sts.
In another aspect, the invention relates to a method for obtaining a first data set and second data set from a model for drug-induced toxicity for use in a platform method, comprising: (1) obtaining a first data set from the model for drug-induced toxicity for use in a platform method, wherein the model for the drug-induced toxicity comprises cells associated with the drug-induced toxicity, and n the first data set represents expression levels of a plurality of genes in the cells associated with the drug-induced toxicity; (2) ing a second data set from the model for drug-induced ty for use in the platform method, wherein the second data set represents a functional activity or a cellular response of the cells associated with the drug-induced toxicity; thereby obtaining a first data set and second data set from the model for the drug-induced toxicity for use in a platform method.
In another aspect, the invention s to a method for identifying a modulator of drug-induced toxicity, said method sing: (1) generating a sus causal relationship network among a first data set and second data set obtained from a model for the drug-induced ty, wherein the model comprises cells associated with the drug-induced toxicity, and n the first data set represents expression levels of a plurality of genes in the cells and the second data set represents a functional activity or a cellular response of the cells, using a programmed ing device, wherein the generation of the consensus causal relationship network is not based on any known biological relationships other than the first data set and the second data set; (2) identifying, from the sus causal relationship network, a causal onship unique in the drug-induced toxicity, n a gene associated with the unique causal relationship is identified as a modulator of the drug-induced toxicity; thereby identifying a modulator of drug-induced toxicity. 1414 In r aspect, the invention relates to a method for identifying a modulator of a drug-induced toxicity, said method comprising: 1) providing a consensus causal relationship network generated from a model for the drug-induced ty; 2) identifying, from the consensus causal relationship network, a causal relationship unique in the drug-induced toxicity, wherein a gene associated with the unique causal relationship is identified as a modulator of the drug-induced toxicity; thereby identifying a modulator of a drug-induced toxicity.
In n embodiments of the various s, the consensus causal relationship network is ted among a first data set and second data set obtained from the model for the drug-induced ty, wherein the model comprises cells associated with the drug-induced toxicity, and wherein the first data set represents expression levels of a plurality of genes in the cells and the second data set represents a functional activity or a cellular response of the cells, using a programmed computing device, wherein the generation of the sus causal relationship network is not based on any known biological relationships other than the first data set and the second data set.
In certain embodiments, the “environmental perturbation”, also referred to herein as “external stimulus ent”, is a therapeutic agent. In certain embodiments, the external stimulus component is a small molecule (e. g., a small molecule of no more than kDa, 4 kDa, 3 kDa, 2 kDa, 1 kDa, 500 Dalton, or 250 Dalton). In certain ments, the external stimulus component is a biologic. In certain embodiments, the external stimulus component is a chemical. In certain embodiments, the external stimulus ent is endogenous or exogenous to cells. In certain embodiments, the external stimulus component is a MIM or fter. In certain embodiments, the external stimulus component is a stress factor for the cell system, such as hypoxia, lycemia, hyperlipidemia, hyperinsulinemia, and/or lactic acid rich conditions.
In certain embodiments, the external stimulus component may e a therapeutic agent or a candidate eutic agent for treating a drug-induced toxicity, including chemotherapeutic agent, protein-based biological drugs, antibodies, fusion proteins, small molecule drugs, lipids, polysaccharides, nucleic acids, etc.
In certain embodiments, the external stimulus component may be one or more stress factors, such as those typically encountered in vivo under the s drug-induced 1515 toxicities, including hypoxia, hyperglycemic conditions, acidic environment (that may be ed by lactic acid treatment), etc.
In other embodiments, the external stimulus component may include one or more MIMs and/or fters, as defined herein below. Exemplary MIMs include Coenzyme Q10 (also referred to herein as CoQ10) and nds in the Vitamin B family, or nucleosides, mononucleotides or dinucleotides that comprise a compound in the Vitamin B family.
In making cellular output measurements (such as protein expression), either absolute amount (e. g., expression amount) or relative level (e.g., relative expression level) may be used. In one embodiment, absolute amounts (e.g., sion amounts) are used. In one embodiment, relative levels or amounts (e. g., relative expression levels) are used. For example, to determine the relative protein expression level of a cell system, the amount of any given n in the cell system, with or without the external stimulus to the cell system, may be compared to a suitable control cell line or mixture of cell lines (such as all cells used in the same ment) and given a fold-increase or fold-decrease value. The skilled person will appreciate that absolute amounts or relative amounts can be employed in any cellular output measurement, such as gene and/or RNA transcription level, level of lipid, level of metabolite, or any functional output, e. g., level of apoptosis, level of toxicity, level of enzyme (e.g., kinase) activity, or ECAR or OCR as described . A termined threshold level for a fold-increase (e.g., at least 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.5, 3, 3.5, 4, 4.5, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, , 40, 45, 50, 75 or 100 or more fold increase) or fold-decrease (e.g., at least a decrease to 0.9, 0.8, 0.75, 0.7, 0.6, 0.5, 0.45, 0.4, 0.35, 0.3, 0.25, 0.2, 0.15, 0.1 or 0.05 fold, or a decrease to 90%, 80%, 75%, 70%, 65%, 60%, 55%, 50%, 45%, 40%, 35%, 30%, 25%, %, 15%, 10% or 5% or less) may be used to select significant differentials, and the cellular output data for the significant differentials may then be included in the data sets (e. g., first and second data sets) utilized in the rm technology methods of the invention. All values presented in the foregoing list can also be the upper or lower limit of , e.g., between 1.5 and 5 fold, 5 and 10 fold, 2 and 5 fold, or between 0.9 and 0.7, 0.9 and 0.5, or 0.7 and 0.3 fold, are intended to be a part of this invention.
Throughout the present application, all values presented in a list, e.g., such as those above, can also be the upper or lower limit of ranges that are intended to be a part of this invention. 1616 In one embodiment of the methods of the invention, not every observed causal relationship in a causal relationship network may be of biological significance. With respect to any given drug-induced toxicity for which the subject interrogative biological assessment is applied, some (or maybe all) of the causal onships (and the genes associated therewith) may be minative” with respect to the specific biological problem at issue, e.g., either responsible for causing a nduced toxicity (a potential target for therapeutic intervention) or is a biomarker for the drug-induced toxicity (a potential diagnostic or prognostic factor). In one embodiment, an observed causal relationship unique in the drug-induced ty is determinative with respect to the specific biological problem at issue. In one embodiment, not every observed causal relationship unique in the drug-induced toxicity is determinative with respect to the specific problem at issue.
Such determinative causal relationships may be selected by an end user of the subject method, or it may be selected by a bioinformatics software program, such as REFS, DAVID-enabled comparative pathway analysis program, or the KEGG pathway is program. In certain embodiments, more than one bioinformatics software program is used, and consensus s from two or more bioinformatics software programs are red.
As used herein, “differentials” of cellular outputs include differences (e. g., increased or decreased levels) in any one or more parameters of the cellular outputs. In certain embodiments, the differentials are each independently selected from the group consisting of differentials in mRNA transcription, protein expression, lipid expression, protein activity, kinase activity, metabolite / intermediate level, and/or ligand-target interaction. For example, in terms of n expression level, differentials between two ar outputs, such as the outputs associated with a cell system before and after the ent by an external stimulus ent, can be measured and quantitated by using cognized technologies, such as pectrometry based assays (e. g., iTRAQ, 2D- LC-MSMS, etc.) In one aspect, the cell model for a drug-induced toxicity ses a cellular cross-talking system, wherein a first cell system having a first cellular environment with an external us component generates a first modified cellular environment; such that a cross-talking cell system is established by exposing a second cell system having a second ar environment to the first modified cellular environment. 1717 In one embodiment, at least one significant cellular cross-talking differential from the cross-talking cell system is generated; and at least one determinative cellular cross-talking differential is identified such that an ogative biological assessment occurs. In certain embodiments, the at least one significant cellular cross-talking differential is a plurality of differentials.
In n embodiments, the at least one determinative cellular cross-talking differential is selected by the end user. Alternatively, in another embodiment, the at least one determinative cellular cross-talking differential is selected by a bioinformatics software program (such as, e.g., REFS, KEGG pathway analysis or DAVID-enabled comparative pathway analysis) based on the quantitative proteomics data.
In certain embodiments, the method further comprises generating a icant cellular output ential for the first cell .
In certain ments, the differentials are each independently selected from the group consisting of entials in mRNA transcription, protein expression, lipid expression, protein activity, metabolite / intermediate level, and/or ligand-target interaction.
In certain embodiments, the first cell system and the second cell system are independently selected from: a homogeneous population of primary cells, a drug- induced toxicity related cell line, or a normal cell line.
In certain embodiments, the first modified cellular environment comprises factors ed by the first cell system into the first cellular environment, as a result of contacting the first cell system with the external stimulus component. The factors may comprise secreted proteins or other signaling molecules. In certain embodiments, the first modified cellular environment is substantially free of the original external us component.
In certain embodiments, the cross-talking cell system comprises a ell having an insert compartment and a well compartment separated by a membrane. For example, the first cell system may grow in the insert compartment (or the well compartment), and the second cell system may grow in the well compartment (or the insert compartment).
In certain embodiments, the talking cell system comprises a first culture for growing the first cell , and a second culture for g the second cell system. 1818 In this case, the first modified cellular environment may be a conditioned medium from the first cell system.
In certain embodiments, the first cellular environment and the second cellular environment can be identical. In certain embodiments, the first cellular environment and the second cellular environment can be different.
In certain embodiments, the cross-talking cell system comprises a co-culture of the first cell system and the second cell system.
The methods of the invention may be used for, or applied to, any number of “interrogative biological assessments.” Application of the methods of the ion to an interrogative biological ment allows for the identification of one or more modulators of a drug-induced toxicity or determinative cellular process “drivers” of a drug-induced toxicity.
In one embodiment, the interrogative biological assessment is the assessment of the toxicological profile of an agent, e.g., a drug, on a cell, tissue, organ or organism, wherein the identified modulators of drug-induced ty, e. g., determinative cellular process driver (e. g., cellular cross-talk differentials or causal relationships unique in drug-induced toxicity) may be indicators of ty, e.g., cytotoxicity, cardiotoxicity, hepatotoxicity, nephrotoxicity, oxicity, renaltoxicity or myotoxicity, and may in turn be used to predict or identify the toxicological e of the agent. In one embodiment, the identified modulators of a nduced toxicity, e. g., determinative cellular process driver (e.g., ar cross-talk differentials or causal relationships unique in a drug-induced toxicity) is an indicator of cardiotoxicity of a drug or drug candidate, and may in turn be used to predict or identify the cardiotoxicological profile of the drug or drug candidate.
In another , the invention provides a kit for conducting an ogative biological ment using a discovery rm Technology, comprising one or more ts for detecting the presence of, and/or for quantitating the amount of, an analyte that is the subject of a causal relationship network generated from the methods of the invention. In one embodiment, said analyte is the subject of a unique causal relationship in the drug-induced toxicity, e. g., a gene associated with a unique causal relationhip in the drug-induced toxicity. In certain embodiments, the analyte is a n, and the reagents comprise an antibody against the protein, a label for the protein, and/or one or 1919 more agents for ing the protein for high throughput analysis (e.g., mass ometry based sequencing).
It should be understood that all ments described herein, including those described only in examples, are parts of the general description of the invention, and can be combined with any other embodiments of the invention unless explicitly disclaimed or inapplicable.
Brief Description of the Drawings Various embodiments of the present disclosure will be described herein below with reference to the figures wherein: Figure 1: ration of approach to identify therapeutics.
Figure 2: ration of systems biology of cancer and consequence of integrated multi- physiological interactive output regulation.
Figure 3: Illustration of systematic interrogation of biological relevance using MIMS.
Figure 4: Illustration of modeling cancer network to enable interrogative ical query.
Figure 5: Illustration of the interrogative biology platform technology.
Figure 6: Illustration of technologies employed in the platform technology.
Figure 7: Schematic representation of the components of the platform including data tion, data integration, and data mining.
Figure 8: Schematic entation of the systematic interrogation using MIMS and collection of response data from the "omics" cascade.
Figure 9: Sketch of the ents employed to build the in vitro models representing normal and diabetic states.
Figure 10: Schematic representation of the informatics platform REFS™ used to generate causal networks of the protein as they relate to disease pathophysiology.
Figure 11: Schematic entation of the ch towards generation of differential network in diabetic versus normal states and diabetic nodes that are restored to normal states by treatment with MIMS.
Figure 12: A representative differential network in diabetic versus normal states. (14307141_1):GGG Figure 13: A schematic representation of a node and associated edges of interest (Nodelin the center). The cellular functionality associated with each edge is represented.
Figure 14: High level flow chart of an exemplary method, in accordance with some ments.
Figure ISA-15D: High level schematic illustration of the components and process for an AI-based informatics system that may be used with exemplary embodiments.
Figure 16: Flow chart of s in AI-based informatics system that may be used with some exemplary embodiments.
Figure 17: Schematically depicts an exemplary computing environment suitable for practicing exemplary embodiments taught herein.
Figure 18: Illustration of the mathematical approach towards generation of delta- delta networks.
Figure 19: A schematic representing mental design and modeling parameters used to study drug induced toxicity in diabetic cardiomyocytes.
Figure 20: Dysregulation of transcriptional network and expression of human mitochondrial energy metabolism genes in diabetic cardiomyocytes by drug treatment (T): rescue molecule (R) normalizes gene sion.
Figure 21: A. Drug treatment (T) induced expression of GPATl and TAZ in mitochondria from cardiomyocytes conditioned in hyerglycemia. In combination with the rescue molecule (T+R) the levels of GPATl and TAZ were normalized. B.
Synthesis of TAG from G3P.
Figure 22: A. Drug treatment (T) decreases mitochondrial OCR (oxygen ption rate) in myocytes conditioned in hyperglycemia. The rescue molecule (T+R) normalizes OCR. B. Drug treatment (T) ses mitochondrial ATP synthesis in cardiomyocytes conditioned in hyperglycemia.
Figure 23: GO tion of proteins down regulated by drug treatment.
Proteins involved in mitochondrial energy metabolism were down regulated with drug treatment.
Figure 24: Illustration of the mathematical approach towards generation of delta ks. Compare unique edges from T versus UT both the models being in diabetic environment. 2121 Figure 25: A schematic representing potential protein hubs and networks that drive pathophysiology of drug induced toxicity.
Figure 26: Schematic representation of the Interrogative biology platform.
Figure 27: Illustration of cellular functional , data ation and mathematical model Building.
Figure 28: Causal molecular interaction network that drives hysiology of drug-induced toxicity.
Figure 29: Causal molecular interaction sub-network of PTX3 as the central hub that drives pathophysiology of drug-induced toxicity.
Figure 30: Mitochondria ATP synthesis capacity of cardiomyocutes in normal glucose and high glucose conditions.
Figure 31: Causal molecular interaction network of ATP drivers.
Figure 32: Causal molecular interaction sub-network of ATP drivers with P4HB as the central hub.
Figure 33: Unique edges of causal lar ction twork of ATP drivers with P4HB as the central hub.
Figure 34: Illustration of functional toxicomics: multi-omics integration.
Attached herewith, as in Appendix A, are the sequences of all biomarkers referenced herein. All of the information associated with the Gene Bank ion numbers listed in Appendix A and through this application are incorporated herein by reference in the verions available on the filing date of this application.
Detailed ption ofthe Invention I. Overview Exemplary embodiments of the present invention incorporate methods that may be performed using an interrogative biology platform (“the Platform”) that is a tool for understanding a wide variety of drug-induced toxicities, such as cardiotoxicity, 2222 hepatotoxicity, toxicity, neurotoxicity, oxicity or myotoxicity, and the key molecular drivers underlying such nduced toxicities, including factors that enable a nduced toxicity. Some exemplary ments include systems that may incorporate at least a portion of, or all of, the Platform. Some exemplary methods may employ at least some of, or all of the Platform. Goals and objectives of some exemplary embodiments involving the platform are generally outlined below for illustrative purposes: i) to create specific molecular signatures as drivers of critical components of the drug-induced toxicity as they relate to overall pathophysiology of the nt cells, tissues, and/or organs; ii) to generate molecular signatures or differential maps pertaining to the drug-induced toxicity, which may help to identify differential molecular signatures that distinguishes one biological state (e.g., a drug-induced toxicity state) versus a different biological stage (e. g., a normal state), and develop understanding of signatures or molecular entities as they arbitrate mechanisms of change between the two biological states (e. g., from normal to drug-induced toxicity state); and, iii) to investigate the role of “hubs” of molecular activity as potential intervention targets for external control of the drug-induced ty (e. g., to use the hub as a potential therapeutic ), or as potential bio-markers for the drug-induced toxicity in question (e. g., drug-induced toxicity specific biomarkers, in prognostic and/or theranostics uses).
Some exemplary methods involving the Drug-induced Toxicity Platform may include one or more of the following features: 1) modeling the drug-induced toxicities (e.g., cardiotoxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, renaltoxicity, or myotoxicity) and/or ents of the drug-induced ty (e.g., physiology & pathophysiology ated with toxicities) in one or more models, preferably in vitro models, using cells ated with the drug- induced toxicity. For example, the cells may be human derived cells which ly participate in the drug-induced toxicity in question (e.g., heat muscle cells involved in cardiotoxicity). The model may include various cellular cues / conditions / perturbations that are specific to the drug-induced toxicity. Ideally, the model represents various drug- induced toxicity states and flux components, instead of a static assessment of the drug- d toxicity condition. 2323 2) profiling mRNA and/or protein signatures using any art-recognized means. For example, quantitative polymerase chain on (qPCR) & proteomics analysis tools such as Mass Spectrometry (MS). Such mRNA and protein data sets represent biological reaction to nment / bation. Where applicable and possible, lipidomics, metabolomics, and transcriptomics data may also be integrated as supplemental or alternative measures for the drug-induced toxicity in question. SNP analysis is another component that may be used at times in the process. It may be helpful for igating, for example, whether the SNP or a specific mutation has any effect on the drug-induced toxicity. These variables may be used to describe the drug- induced toxicity, either as a static “snapshot,” or as a representation of a dynamic process. 3) assaying for one or more functional activities or cellular responses to cues and bations, including but not limited to bioenergetics, cell proliferation, apoptosis, and organellar function. True pe-phenotype association is actualized by employment of functional models, such as ATP, ROS, OXPHOS, Seahorse , etc. Such functional ties can involve global enzyme activity, such as kinase activity, and/or s of global enzyme activity or the enzyme metabolites or substrates in the cells, e. g., the phosphor proteome of the cells. Such cellular responses represent the reaction of the cells in the drug-induced ty process (or models thereof) in response to the corresponding nduced toxicity state(s) of the mRNA / protein expression, and any other related states in 2) above. 4) integrating functional assay data thus obtained in 3) with proteomics and other data obtained in 2), and determining protein, gene, lipid, enzyme ty and other functional acitivity associations as driven by causality, by employing artificial intelligence based (AI-based) informatics system or platform. Such an AI-based system is based on, and preferably based only on, the data sets obtained in 2) and/or 3), without resorting to existing knowledge concerning the drug-induced toxicity process.
Preferably, no data points are statistically or artificially cut-off. Instead, all obtained data is fed into the AI-system for determining protein, gene, lipid, enzyme activity and other functional acitivity associations. One goal or output of the integration s is one or more differential networks (otherwise may be referred to herein as “delta networks,” or, in some cases, “delta-delta networks” as the case may be) n the different biological states (e.g., drug-induced toxicity vs. normal states). 2424 ) profiling the outputs from the AI-based informatics platform to explore each hub of activity as a potential therapeutic target and/or biomarker. Such profiling can be done ly in silico based on the obtained data sets, Without resorting to any actual wet-lab experiments. 6) validating hub of activity by employing molecular and ar ques. Such nformatic validation of output with wet-lab cell-based experiments may be optional, but they help to create a full-circle of interrogation.
Any or all of the approaches outlined above may be used in any specific application ning any drug-induced toxicity, depending, at least in part, on the nature of the specific application. That is, one or more approaches outlined above may be omitted or modified, and one or more additional approaches may be employed, depending on specific application.
Various schematics illustrating the platform are provided. In particular, an illustration of an exemplary approach to identify therapeutics using the rm is depicted in Figure 1. An illustration of systems biology of cancer and the consequence of integrated multi-physiological interactive output regulation is depicted in Figure 2.
An illustration of a systematic interrogation of ical relevance using MIMS is depicted in Figure 3. An illustration of modeling a cancer network to enable an interrogative biological query is depicted in Figure 4.
Illustrations of the interrogative biology platform and logies employed in the platform are depicted in Figures 5 and 6. A schematic entation of the components of the platform including data collection, data integration, and data mining is depicted in Figure 7. A schematic entation of a systematic interrogation using MIMS and collection of response data from the “omics” cascade is depicted in Figure 8.
Figure 14 is a high level flow chart of an exemplary method 10, in which components of an ary system that may be used to perform the ary method are indicated. Initially, a model (e. g., an in vitro model) is ished for a biological process (e. g., a drug-induced toxicityprocess) and/or components of the biological s (e. g., drug-induced toxicity physiology and pathophysiology) using cells normally associated with the process (step 12). For example, the cells may be human- derived cells that normally participate in the biological process (e. g., drug-induced toxicity). The cell model may include various cellular cues, conditions, and/or perturbations that are specific to the biological s (e. g., drug-induced toxicity). 2525 Ideally, the cell model represents various (drug-induced toxicity) states and flux components of the biological process (e. g., nduced toxicity), instead of a static assessment of the biological process. The comparison cell model may include control cells or normal cells, e. g., cells not exposed to a drug which induces toxicity. Additional description of the cell models appears below in sections III.A and IV.
A first data set is obtained from the cell model for the biological process (e. g. drug-induced toxicity), which includes information enting, by way of example, expression levels of a plurality of genes (e. g., mRNA and/or protein signatures) (step 16) using any known process or system (e.g., quantitative polymerase chain reaction (qPCR) & proteomics analysis tools such as Mass Spectrometry (MS)).
A third data set is obtained from the comparison cell model for the biological process (e. g. drug-induced toxicity) (step 18). The third data set includes information representing, e. g., expression levels of a plurality of genes in the comparison cells from the comparison cell model.
In certain embodiments of the methods of the invention, these first and third data sets are collectively ed to herein as a “first data set” that ents, e.g., expression levels of a plurality of genes in the cells (all cells including ison cells) associated with the biological system (e. g. drug-induced toxicity model).
The first data set and third data set may be obtained from one or more mRNA and/or Protein Signature is System(s). The mRNA and protein data in the first and third data sets may represent biological ons to environment and/or perturbation. Where applicable and possible, lipidomics, metabolomics, and transcriptomics data may also be integrated into the first data set as supplemental or alternative measures for the biological process (e.g. drug-induced toxicity). The SNP analysis is r component that may be used at times in the process. It may be helpful for investigating, for example, whether a single-nucleotide polymorphism (SNP) or a specific on has any effect on the biological process (e.g. drug-induced toxicity). The data variables may be used to be the biological process (e.g. druginduced toxicity) either as a static “snapshot,” or as a representation of a dynamic process. Additional description regarding obtaining information representing expression levels of a plurality of genes in cells appears below in section III.B.
A second data set is ed from the cell model for the biological process (e. g. nduced ty), which includes information representing a functional activity or 2626 se of cells (step 20). rly, a fourth data set is ed from the comparison cell model for the biological process (e.g. drug-induced toxicity), which includes information representing a functional ty or response of the comparison cells (step 22).
In certain embodiments of the methods of the ion, these second and fourth data sets are collectively referred to herein as a “second data set” that represents a functional activity or a ar response of the cells (all cells including comparison cells) associated with the biological system (e. g. drug-induced toxicity).
One or more functional assay systems may be used to obtain information regarding the functional activity or response of cells or of comparison cells. The information regarding functional cellular responses to cues and perturbations may include, but is not limited to, bioenergetics profiling, cell eration, sis, and organellar function. Functional models for processes and ys (e.g., adenosine triphosphate (ATP), reactive oxygen s (ROS), oxidative phosphorylation (OXPHOS), Seahorse assays, etc.,) may be employed to obtain true genotype-phenotype association. Such functional activities can involve global enzyme activity, such as kinase activity, and/or effects of global enzyme activity, or the enzyme metabolites or substrates in the cells, e. g., the phosphor proteome of the cells. The functional activity or cellular responses represent the reaction of the cells in the biological process (or models thereof) in response to the corresponding state(s) of the mRNA / n expression, and any other related applied conditions or perturbations. Additional information regarding obtaining information representing functional activity or se of cells is provided below in section III.B.
The method also includes generating computer-implemented models of the biological processes (e. g. drug-induced toxicity) in the cells and in the control cells.
For example, one or more (e. g., an ensemble of) Bayesian networks of causal relationships between the expression level of the plurality of genes and the functional activity or cellular response may be generated for the cell model (the “generated cell model networks”) from the first data set and the second data set (step 24). The ted cell model networks, individually or collectively, include quantitative probabilistic directional information regarding onships. The generated cell model networks are not based on known biological relationships between gene expression and/or functional activity or cellular response, other than information from the first data 2727 set and second data set. The one or more generated cell model networks may collectively be referred to as a consensus cell model k.
One or more (e.g., an ensemble of) Bayesian networks of causal relationships between the expression level of the plurality of genes and the onal ty or cellular response may be generated for the comparison cell model (the “generated comparison cell model networks”) from the first data set and the second data set (step 26). The generated comparison cell model networks, dually or collectively, include quantitative probabilistic ional information regarding relationships. The generated cell networks are not based on known biological relationships between gene expression and/or functional activity or cellular response, other than the information in the first data set and the second data set. The one or more generated comparison model networks may collectively be refered to as a consensus cell model network.
The generated cell model networks and the generated comparison cell model networks may be created using an cial igence based (AI-based) informatics platform. Further details regarding the creation of the generated cell model networks, the creation of the generated comparison cell model ks and the AI-based informatics system appear below in section III.C and in the description of Figures 2A-3.
It should be noted that many different AI-based platforms or systems may be employed to generate the Bayesian networks of causal relationships including quantitative probabilistic directional information. Although certain es described herein employ one specific commercially available system, i.e., REFSTM (Reverse Engineering/Forward Simulation) from GNS (Cambridge, MA), embodiments are not limited. AI—Based s or Platforms suitable to implement some embodiments employ mathematical algorithms to establish causal relationships among the input variables (e.g., the first and second data sets), based only on the input data without taking into consideration prior existing knowledge about any potential, established, and/or ed biological relationships.
For example, the REFSTM AI-based informatics platform utilizes experimentally d raw (original) or minimally processed input biological data (e. g., genetic, genomic, epigenetic, proteomic, metabolomic, and al data), and rapidly performs trillions of calculations to determine how molecules interact with one another in a complete system. The REFSTM AI-based informatics rm performs a reverse engineering s aimed at creating an in silico computer-implemented cell model 2828 (e. g., generated cell model networks), based on the input data, that quantitatively ents the underlying ical system (e. g. drug-induced toxicity). Further, hypotheses about the underlying biological system can be developed and rapidly simulated based on the computer-implemented cell model, in order to obtain predictions, accompanied by associated confidence levels, regarding the hypotheses.
With this approach, biological systems are represented by quantitative computer- implemented cell models in which “interventions” are simulated to learn ed mechanisms of the biological system (e. g., nduced ty), effective intervention strategies, and/or clinical kers that determine which patients will respond to a given treatment regimen. tional bioinformatics and statistical approaches, as well as approaches based on the ng of known biology, are typically unable to provide these types of insights.
After the generated cell model networks and the generated comparison cell model networks are created, they are compared. One or more causal relationships present in at least some of the generated cell model networks, and absent from, or having at least one significantly different parameter in, the generated comparison cell model networks are identified (step 28). Such a comparison may result in the creation of a differential network. The comparison, identification, and/or differential (delta) k creation may be conducted using a differential network creation module, which is bed in further detail below in section 111D and with respect to the description of Figure 18.
In some embodiments, input data sets are from one cell type and one comparison cell type, which creates an ensemble of cell model networks based on the one cell type and another ensemble of comparison cell model ks based on the one comparison control cell type. A differential may be performed n the ensemble of networks of the one cell type and the le of networks of the comparison cell type(s).
In other embodiments, input data sets are from multiple cell types (e. g., two or more cell types that are normally associated with the particular type of drug-induced toxicity and multiple comparison cell types (e. g., two or more normal cell types, e. g., same cells which are not exposed to the drug). An ensemble of cell model networks may be generated for each cell types and each comparison cell type individually, and/or data from the multiple cell types and the multiple comparison cell types may be combined into respective composite data sets. The composite data sets produce an 2929 ensemble of networks corresponding to the multiple cell types (composite data) and another ensemble of networks corresponding to the multiple comparison cell types (comparison composite data). A ential may be performed on the ensemble of networks for the composite data as compared to the ensemble of networks for the comparison composite data.
In some embodiments, a differential may be performed between two different differential networks. This output may be referred to as a delta-delta network, and is described below with respect to Figure 18.
Quantitative relationship ation may be fied for each relationship in the ted cell model networks (step 30). Similarly, quantitative onship information for each relationship in the generated ison cell model networks may be identified (step 32). The quantitative information regarding the relationship may include a direction indicating causality, a measure of the statistical uncertainty regarding the relationship (e. g., an Area Under the Curve (AUC) statistical measurement), and/or an sion of the quantitative magnitude of the strength of the onship (e.g., a fold). The s relationships in the generated cell model networks may be profiled using the quantitative relationship ation to explore each hub of activity in the networks as a potential therapeutic target and/or ker. Such profiling can be done entirely in silico based on the results from the generated cell model networks, without resorting to any actual wet-lab experiments.
In some embodiments, a hub of activity in the networks may be validated by employing molecular and cellular techniques. Such post-informatic validation of output with wet-lab cell based experiments need not be performed, but it may help to create a full-circle of interrogation.Figure 15 schematically depicts a fied high level representation of the functionality of an exemplary AI-based informatics system (e. g., REFSTM AI-based informatics system) and interactions between the AI-based system and other elements or portions of an interrogative biology platform (“the Platform”). In Figure 15A, various data sets obtained from a model for a ical process (e. g., a drug-induced toxicity model), such as drug dosage, treatment , protein expression, mRNA expression, lipid levels, metabolite levels, kinase activity and any of many other associated functional measures (such as OCR, ECAR) are fed into an AI- based system. As shown in Figure 15B, from the input data sets, the AI-system creates a library of “network fragments” that includes variables (e.g., proteins, , kinases and 3030 metabolites) that drive molecular mechanisms in the biological process (e. g., drug- induced toxicity), in a process referred to as Bayesian nt ation (Figure 1513).
In Figure 15C, the AI-based system selects a subset of the network fragments in the library and constructs an initial trial network from the fragments. The AI-based system also selects a different subset of the network fragments in the library to construct another initial trial network. Eventually an ensemble of initial trial networks are created (e.g., 1000 ks) from different subsets of network fragments in the library. This process may be termed parallel ensemble sampling. Each trial network in the ensemble is evolved or zed by adding, cting and/or substitution additional network fragments from the library. If additional data is obtained, the additional data may be incorporated into the network nts in the library and may be incorporated into the ensemble of trial networks through the evolution of each trial network. After completion of the optimization/evolution process, the ensemble of trial networks may be described as the generated cell model networks.
As shown in Figure 15D, the ensemble of generated cell model ks may be used to simulate the behavior of the biological system (e. g. nduced toxicity). The simulation may be used to predict or of the biological system (e.g. drug-induced toxicity) to changes in conditions, which may be experimentally verified using wet-lab cell-based, or animal-based, experiments. Also, quantitative parameters of relationships in the generated cell model networks may be extracted using the simulation functionality by applying simulated perturbations to each node individually while observing the s on the other nodes in the ted cell model neworks. Further detail is provided below in section III.C.
The automated reverse engineering process of the AI-based informatics system, which is ed in Figures 2A-2D, creates an ensemble of generated cell model networks networks that is an ed and systematic computer-based model of the cells.
The reverse ering determines the probabilistic ional network connections between the molecular measurements in the data, and the phenotypic outcomes of interest. The variation in the molecular measurements enables learning of the probabilistic cause and effect onships between these entities and changes in 3131 endpoints. The machine learning nature of the platform also enables cross training and predictions based on a data set that is constantly evolving.
The network connections between the molecular ements in the data are “probabilistic,” partly because the tion may be based on ations between the ed data sets “learned” by the computer thm. For example, if the sion level of protein X and that of protein Y are positively or negatively correlated, based on statistical analysis of the data set, a causal relationship may be assigned to establish a network connection n proteins X and Y. The reliability of such a putative causal relationship may be further defined by a likelihood of the connection, which can be measured by p-value (e.g., p < 0.1, 0.05, 0.01, etc).
The k connections between the molecular measurements in the data are “directional,” partly because the network connections between the molecular measurements, as determined by the reverse-engineering s, reflects the cause and effect of the relationship between the connected gene / protein, such that raising the expression level of one protein may cause the expression level of the other to rise or fall, depending on whether the connection is stimulatory or inhibitory.
The network connections between the molecular measurements in the data are “quantitative,” partly because the network connections between the molecular measurements, as determined by the process, may be simulated in silico, based on the existing data set and the probabilistic measures associated therewith. For example, in the established network connections n the molecular measurements, it may be possible to theoretically increase or decrease (e.g., by l, 2, 3, 5, 10, 20, 30, 50,100-fold or more) the expression level of a given protein (or a “node” in the network), and quantitatively simulate its s on other connected proteins in the k.
The network connections between the lar measurements in the data are sed,” at least partly because no data points are statistically or artificially cut-off, and partly because the network connections are based on input data alone, without referring to pre-existing knowledge about the biological process in question.
The network connections between the molecular measurements in the data are “systemic” and (unbiased), partly because all potential connections among all input les have been systemically explored, for example, in a pair-wise fashion. The reliance on computing power to execute such systemic probing exponentially ses as the number of input variables increases. 3232 In general, an ensemble of ~l,000 networks is usually sufficient to predict probabilistic causal quantitative relationships among all of the measured entities. The ensemble of ks captures uncertainty in the data and enables the calculation of confidence metrics for each model prediction. Predictions generated using the ensemble of networks together, where differences in the predictions from individual ks in the ensemble represent the degree of uncertainty in the prediction. This feature enables the assignment of confidence metrics for predictions of clinical response generated from the model.
Once the models are reverse-engineered, further simulation queries may be conducted on the ensemble of models to determine key molecular drivers for the biological s in question, such as a drug-induced ty condition.
Sketch of components employed to build examplary In vitro models enting normal and ic statesis is depicted in Figure 9. Schematic entation of an examplary informatics platform REFSTM used to generate causal networks of the protein as they relate to disease pathophysiology is depicted in Figure 10. Schematic representation of examplary ch towards generation of differential k in diabetic versus normal states and diabetic nodes that are restored to normal states by treatment with MIMS is depicted in Figure 11. A entative differential network in diabetic versus normal states is depicted in Figure 12. A schematic representation of a node and associated edges of interest (Nodel in the center) and the cellular functionality associated with each edge is depicted in Figure 13.
The invention having been generally described above, the sections below provide more detailed description for various aspects or elements of the general invention, in conjunction with one or more specific biological systems (e.g. drug-induced toxicity) that can be analyzed using the methods . It should be noted, r, the specific drug-induced toxicity used for illustration purpose below are not limiting. To the contrary, it is intended that other distinct drug-induced toxicities, including any alternatives, modifications, and equivalents thereof, may be analyzed similarly using the subject Platform technology. 11. Definitions 3333 As used herein, certain terms intended to be ically d, but are not already defined in other sections of the specification, are d herein.
The articles “a” and “an” are used herein to refer to one or to more than one (i.e., to at least one) of the grammatical object of the article. By way of e, “an element” means one element or more than one element.
The term “including” is used herein to mean, and is used interchangeably with, the phrase “including but not limited to.” The term “or” is used herein to mean, and is used interchangeably with, the term “and/or,” unless context y indicates otherwise.
The term “such as” is used herein to mean, and is used interchangeably, with the phrase “such as but not limited to.” “Metabolic pathway” refers to a sequence of enzyme-mediated reactions that transform one compound to another and provide intermediates and energy for cellular functions. The metabolic pathway can be linear or cyclic or branched.
“Metabolic state” refers to the lar content of a particular cellular, multicellular or tissue environment at a given point in time as measured by various chemical and biological indicators as they relate to a state of health or disease.
The term “microarray” refers to an array of distinct polynucleotides, oligonucleotides, polypeptides (e. g., antibodies) or peptides synthesized on a substrate, such as paper, nylon or other type of membrane, filter, chip, glass slide, or any other suitable solid support.
The terms “disorders” and “diseases” are used ively and refer to any deviation from the normal structure or function of any part, organ or system of the body (or any combination f). A specific disease is manifested by characteristic symptoms and signs, including biological, chemical and physical s, and is often associated with a variety of other factors including, but not limited to, demographic, environmental, ment, genetic and medically historical factors. n characteristic signs, symptoms, and related s can be quantitated through a variety of methods to yield important diagnostic information.
The term “drug-induced toxicity” includes but is not limited to cardiotoxicity, hepatotoxicity, hephrotoxicity, neurotoxicity, renaltoxicity or myotoxicity. 3434 The term “cardiotoxicity” refers to a broad range of adverse effects on heart function induced by therapeutic molecules. It may emerge early in inical studies or become apparent later in the clinical g. Cardiovascular toxicity described herein es, but is not limited to, any one or more of increased QT duration, arrhythmias, myocardial ischemia, hypertension and thromboembolic complications, myocardial dysfunction, cardiomyopathy, heart failure, atrial fibrillation, cardiomyopathy and heart failure, heart failure and LV dysfunction, atrial flutter and fibrillation, and, heart valve damage and heart failure.
The term “expression” includes the process by which a polypeptide is ed from polynucleotides, such as DNA. The process may involves the transcription of a gene into mRNA and the translation of this mRNA into a polypeptide. Depending on the context in which it is used, “expression” may refer to the tion of RNA, protein or both.
The terms “level of expression of a gene” or “gene expression level” refer to the level of mRNA, as well as pre-mRNA nascent transcript(s), transcript processing intermediates, mature mRNA(s) and degradation products, or the level of protein, encoded by the gene in the cell.
The term “modulation” refers to upregulation (i.e., activation or ation), downregulation (i.e., inhibition or suppression) of a response, or the two in combination or apart. A “modulator” is a compound or molecule that modulates, and may be, e. g., an agonist, antagonist, tor, stimulator, suppressor, or inhibitor.
“Normal level” of a protein, a lipid, a ript, a metabolite, or gene expression refers to the level of the protein, lipid, transcript, metabolite, or gene expression prior to ting the cells with the drug with the potentially toxic drug. A “normal level” can be determined in cells grown under various conditions, e. g., hyperglycemia, hypoxia, if the ty of the drug is to be tested under the same conditions. ated level” refers to a changed value relative to the normal level which is based on ical normal control samples or preferably normal control samples tested in the same experiment. The specific “normal” value will depend, for example, on the type of assay (e. g., ELISA, enzyme activity, immunohistochemistry, PCR), the sample to be tested (e. g., cell type and culture conditions), and other considerations known to 3535 those of skill in the art. Control samples can be used to define cut-offs between normal and abnormal.
A drug is considered to be toxic if treatment of cells with the drug results in a statistically significant change in the level of at least one marker relative to a “normal” or riate l level. It is understood that not all trations of a drug must result in a statistically significant change in the level of the at least one marker. In a preferred ment, a drug is considered to ially have toxicities if a therapeutically relevant concentration of the drug results in a statistically icant change in the level of at least on marker.
A “rescue agent” is considered to be effective in reducing toxicity if the level of the marker is modulated in a statistically significant manner towards the marker level in the “normal cells” when the rescue agent is present at a therapeutically nt concentration. In a preferred embodiment, the rescue agent returns the marker to a level that is not tically different from the level of the marker in the control cells.
The term “control level” refers to an accepted or pre-determined level of a , or preferably the marker level determined in a control sample tested in parallel with the test sample, which is used to compare with the level of a marker in a sample derived from cells not treated with the potentially toxic drug or rescue agent. A “control level” is obtained from cells that are cultured under the same ions, e.g., hypoxia, hyperglycemia, lactic acid, etc.
The term “Trolamine,” as used herein, refers to Trolamine NF, Triethanolamine, TEALAN®, TEAlan 99%, Triethanolamine, 99%, Triethanolamine, NF or Triethanolamine, 99%, NF. These terms may be used interchangeably herein.
The term “genome” refers to the entirety of a biological entity’s (cell, tissue, organ, system, organism) genetic information. It is encoded either in DNA or RNA (in certain viruses, for example). The genome includes both the genes and the non-coding sequences of the DNA.
The term “proteome” refers to the entire set of proteins sed by a genome, a cell, a tissue, or an organism at a given time. More specifically, it may refer to the entire set of expressed proteins in a given type of cells or an organism at a given time under defined conditions. Proteome may include protein variants due to, for example, 3636 alternative splicing of genes and/or post-translational modifications (such as glycosylation or phosphorylation).
The term “transcriptome” refers to the entire set of transcribed RNA molecules, including mRNA, rRNA, tRNA, microRNA and other non-coding RNA produced in one or a population of cells at a given time. The term can be applied to the total set of ripts in a given organism, or to the ic subset of transcripts present in a particular cell type. Unlike the genome, which is y fixed for a given cell line (excluding mutations), the transcriptome can vary with external nmental conditions. Because it includes all mRNA transcripts in the cell, the transcriptome reflects the genes that are being actively expressed at any given time, with the exception of mRNA degradation phenomena such as transcriptional attenuation.
The study of transcriptomics, also referred to as expression profiling, examines the expression level of mRNAs in a given cell population, often using high-throughput techniques based on DNA microarray technology.
The term “metabolome” refers to the complete set of small-molecule metabolites (such as metabolic intermediates, hormones and other signalling les, and secondary metabolites) to be found within a biological sample, such as a single sm, at a given time under a given condition. The metabolome is dynamic, and may change from second to second.
The term “lipidome” refers to the complete set of lipids to be found within a biological sample, such as a single organism, at a given time under a given condition.
The lipidome is dynamic, and may change from second to second.
The term “interactome” refers to the whole set of molecular ctions in a biological system under study (e.g., cells). It can be displayed as a directed graph. lar ctions can occur between molecules belonging to different biochemical families (proteins, nucleic acids, lipids, carbohydrates, etc.) and also within a given family. When spoken in terms of mics, interactome refers to protein-protein interaction network (PPI), or n interaction network (PIN). Another extensively studied type of interactome is the protein-DNA interactome (network formed by transcription factors (and DNA or chromatin regulatory proteins) and their target genes.
The term “cellular output” includes a collection of parameters, preferably measurable parameters, ng to cellullar status, including ut limiting): level of transcription for one or more genes (e.g., measurable by RT-PCR, qPCR, microarray, 3737 etc), level of expression for one or more proteins (e.g., measurable by mass spectrometry or Western blot), absolute activity (e.g., measurable as substrate conversion rates) or relative ty (e.g., measurable as a % value compared to maximum activity) of one or more enzymes or proteins, level of one or more metabolites or intermediates, level of oxidative phosphorylation (e. g., measurable by Oxygen Consumption Rate or OCR), level of glycolysis (e.g., measurable by Extra Cellular Acidification Rate or ECAR), extent of ligand-target binding or interaction, activity of extracellular secreted molecules, etc. The cellular output may include data for a pre- determined number of target genes or proteins, etc., or may include a global assessment for all detectable genes or proteins. For example, mass spectrometry may be used to identify and/or quantitate all detectable proteins expressed in a given sample or cell population, without prior dge as to whether any specific protein may be expressed in the sample or cell population.
As used herein, a “cell system” includes a population of homogeneous or heterogeneous cells. The cells within the system may be growing in vivo, under the natural or physiological environment, or may be g in vitro in, for example, lled tissue e environments. The cells within the system may be vely neous (e.g., no less than 70%, 80%, 90%, 95%, 99%, 99.5%, 99.9% homogeneous), or may contain two or more cell types, such as cell types usually found to grow in close proximity in vivo, or cell types that may interact with one another in vivo through, e. g., paracrine or other long distance inter-cellular ication. The cells within the cell system may be derived from ished cell lines, ing cancer cell lines, immortal cell lines, or normal cell lines, or may be primary cells or cells freshly isolated from live tissues or organs.
Cells in the cell system are typically in contact with a “cellular nment” that may provide nutrients, gases (oxygen or C02, etc), chemicals, or proteinaceous / non- proteinaceous stimulants that may define the conditions that affect cellular behavior.
The cellular environment may be a chemical media with defined chemical ents and/or less well-defined tissue extracts or serum components, and may include a specific pH, C02 content, pressure, and temperature under which the cells grow. Alternatively, the cellular environment may be the l or physiological environment found in vivo for the specific cell system. 3838 In certain embodiments, a cell environment comprises conditions that simulate an aspect of a ical system or process, e.g., simulate a disease state, process, or environment. Such culture conditions include, for example, hyperglycemia, hypoxia, or lactic-rich conditions. Numerous other such conditions are described .
In n embodiments, a cellular environment for a specific cell system also include certain cell surface es of the cell system, such as the types of receptors or ligands on the cell surface and their respective activities, the structure of carbohydrate or lipid molecules, ne polarity or fluidity, status of clustering of certain membrane proteins, etc. These cell surface features may affect the function of nearby cells, such as cells belonging to a ent cell system. In certain other embodiments, however, the cellular environment of a cell system does not include cell surface features of the cell system.
The cellular environment may be altered to become a “modified cellular nment.” Alterations may include changes (e. g. , increase or decrease) in any one or more component found in the cellular environment, including addition of one or more “external stimulus component” to the cellular environment. The environmental perturbation or external stimulus component may be endogenous to the cellular environment (e. g., the cellular environment contains some levels of the stimulant, and more of the same is added to increase its level), or may be exogenous to the cellular environment (e.g., the stimulant is largely absent from the cellular environment prior to the tion). The cellular nment may further be d by secondary changes resulting from adding the al stimulus component, since the external stimulus component may change the ar output of the cell system, including molecules secreted into the cellular environment by the cell system.
As used herein, “external us ent”, also ed to herein as “environmental perturbation”, include any external physical and/or chemical stimulus that may affect cellular function. This may include any large or small organic or nic molecules, natural or synthetic chemicals, temperature shift, pH change, radiation, light (UVA, UVB etc.), microwave, sonic wave, electrical current, modulated or unmodulated magnetic fields, etc.
The term “Multidimensional Intracellular Molecule (MIM)”, is an isolated version or synthetically produced version of an nous molecule that is naturally produced by the body and/or is present in at least one cell of a human. A MIM is 3939 e of entering a cell and the entry into the cell includes complete or partial entry into the cell as long as the biologically active portion of the molecule wholly enters the cell. MIMs are e of inducing a signal uction and/or gene sion mechanism within a cell. MIMs are multidimensional because the molecules have both a therapeutic and a carrier, e. g., drug delivery, effect. MIMs also are multidimensional because the molecules act one way in a disease state and a different way in a normal state. For example, in the case of CoQ-10, administration of CoQ-10 to a melanoma cell in the presence of VEGF leads to a decreased level of Bcl2 which, in turn, leads to a decreased oncogenic potential for the melanoma cell. In contrast, in a normal last, co-administration of CoQ-10 and VEFG has no effect on the levels of Bcl2.
In one embodiment, a MIM is also an epi-shifter In another embodiment, a MIM is not an epi-shifter. In another embodiment, a MIM is characterized by one or more of the foregoing functions. In another embodiment, a MIM is characterized by two or more of the foregoing ons. In a r embodiment, a MIM is characterized by three or more of the foregoing functions. In yet another embodiment, a MIM is characterized by all of the foregoing functions. The skilled artisan will iate that a MIM of the invention is also intended to encompass a mixture of two or more endogenous molecules, wherein the e is characterized by one or more of the foregoing functions. The endogenous molecules in the mixture are present at a ratio such that the mixture functions as a MIM.
MIMs can be lipid based or non-lipid based molecules. Examples of MIMs include, but are not d to, CoQ10, acetyl Co-A, yl Co-A, L-carnitine, amino acids such as, for example, tyrosine, alanine, and cysteine. In one embodiment, the MIM is a small molecule. In one embodiment of the invention, the MIM is not CoQ10. MIMs can be routinely identified by one of skill in the art using any of the assays described in detail herein. MIMs are described in further detail in US 12/777,902 (US 2011-0110914), the entire contents of which are expressly incorporated herein by reference.
As used herein, an tabolic shifter” (epi-shifter) is a molecule that modulates the metabolic shift from a healthy (or normal) state to a disease state and vice versa, thereby maintaining or reestablishing cellular, tissue, organ, system and/or host health in a human. Epi-shifters are capable of effectuating normalization in a tissue microenvironment. For example, an epi-shifter includes any molecule which is capable, 404O when added to or depleted from a cell, of affecting the microenvironment (e.g., the metabolic state) of a cell. The d artisan will appreciate that an epi-shifter of the invention is also intended to encompass a mixture of two or more molecules, wherein the e is characterized by one or more of the foregoing functions. The molecules in the mixture are present at a ratio such that the mixture ons as an epi-shifter.
Examples of epi-shifters include, but are not limited to, CoQ-lO; vitamin D3; ECM components such as fibronectin; immunomodulators, such as TNFa or any of the eukins, e. g., IL-5, IL-12, IL-23; angiogenic s; and apoptotic factors.
In one embodiment, the epi-shifter also is a MIM. In one embodiment, the epi- shifter is not CleO. Epi-shifters can be routinely identified by one of skill in the art using any of the assays described in detail herein. Epi-shifters are described in further detail in US 12/777,902 (US 2011-0110914), the entire contents of which are expressly orated herein by reference.
Other terms not explicitly d in the instant application have meaning as would have been understood by one of ordinary skill in the art. 111. Exemplary Steps and Components of the Platform Technology For illustration purpose only, the following steps of the subject Platform Technology may be described herein below as an ary utility for integrating data obtained from a custom built drug-induced toxicity model, and for identifying novel proteins / pathways driving the pathogenesis of drug-induced toxicity. Relational maps resulting from this analysis provides drug-induced toxicity treatment targets, as well as stic / prognostic markers associated with drug-induced toxicity. However, the subject Platform Technology has general applicability for any drug-induced toxicity, and is not limited to any particular drug-induced toxicityor other specific drug-induced toxicity models.
In addition, although the description below is presented in some portions as discrete steps, it is for ration purpose and simplicity, and thus, in reality, it does not imply such a rigid order and/or demarcation of steps. Moreover, the steps of the ion may be performed separately, and the invention provided herein is intended to encompass each of the individual steps separately, as well as combinations of one or 4141 more (e. g., any one, two, three, four, five, six or all seven steps) steps of the subject Platform Technology, which may be carried out independently of the remaining steps.
The invention also is intended to include all aspects of the Drug-induced Toxicity rm Technology as separate components and embodiments of the invention. For example, the generated data sets are intended to be embodiments of the invention. As further examples, the generated causal relationship ks, generated consensus causal relationship networks, and/or generated simulated causal onship networks, are also intended to be embodiments of the invention. The causal relationships identified as being unique in the drug-induced toxicity system are intended to be ments of the ion. Further, the custom built models for a particular drug-induced toxicity system are also intended to be embodiments of the invention. For example, custom built models for a drug-induced ty state or process, such as, e. g., a custom built model for toxicity (e. g., cardiotoxicity) of a drug, are also intended to be embodiments of the invention.
A. Custom Model Building The first step in the Platform Technology is the establishment of a model for a nduced toxicity system or process. An example of a drug-induced toxicity system or process is cardiotoxicity. As any other complicated biological process or system, cardiotoxicity is a complicated pathological condition terized by multiple unique aspects. For example, chronic imbalance in uptake, utilization, llar biogenesis and secretion in non-adipose tissue (heart and liver) is thought to be at the center of mitochondrial damage and dysfunction and a key player in drug induced cardiotoxicity.
To this end, a custom cardiotoxicity model comprising diabetic and normal cardiomyocytes may be established to simulate the environment of cardiotoxicity, e.g., by creating cell culture ions closely approximating the conditions of a cadiac cell experiencing cardiotoxicity. One or more relevant types of cells may be used in the model, such as, for e, cardiomyocytes, diabetic cardiomyocytes, hepatocytes, kidney cells, neural cells, renal cells, or myoblasts.
One such “environment”, or growth stress condition, is hypoxia, a ion typically found in a number of disease states and in late stage diabetes or in cardiovascular disease due to ischemia and poor circulation. Hypoxia can be induced in 4242 cells in cells using art-recognized s. For example, hypoxia can be induced by placing cell systems in a Modular Incubator Chamber (MIC-101, Billups-Rothenberg Inc. Del Mar, CA), which can be flooded with an rial gas mix containing 5% C02, 2% Oz and 93% nitrogen. Effects can be measured after a pre-determined period, e. g., at 24 hours after hypoxia treatment, with and without onal external stimulus components (e.g., CleO at 0, 50, or 100 MM).
Likewise, lactic acid treatment of cells mimics a cellular environment where glycolysis activity is high. Lactic acid induced stress can be investigated at a final lactic acid concentration of about 12.5 mM at a pre-determined time, e. g., at 24 hours, with or without additional external stimulus components (e.g., CleO at 0, 50, or 100 MM).
Hyperglycemia is normally a condition found in diabetes. As high glucose is known to alter cellular metabolism, agents for the treatment of diabetes can be tested in cells cultured under hyperglycemic conditions. Exposing subject cells to a typical hyperglycemic condition may include adding 10% culture grade glucose to suitable media, such that the final concentration of glucose in the media is about 22 mM.
However, as subjects with type 2 diabetes, are frequently ight or obese, they are ntly treated for other diseases or conditions with other agents, e. g., arthritis with anti-inflammatory agents, cardiovascular disease with cholesterol lowering, blood pressure lowering, or blood thinning . Thus, custom built models can be used to assess drug toxicity in normal ts as compared to subjects to be treated for a first condition with a first agent that also have other diseases or conditions. For example, cells not exposed or exposed to hyperglycemic ions can be tested together to detect differential toxicities of agents in subjects with or without diabetes.
Hyperlipidemia is a condition found, for example, in obesity and cardiovascular disease. ipidemia is also a condition which mimics one aspect of cardiotoxicity.
The hyperlipidemic conditions can be provided by culturing cells in media ning 0.15 mM sodium palmitate.
Individual ions reflecting different aspects of toxicity may be investigated tely in the custom built toxicity model, and/or may be ed together. In one embodiment, combinations of at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 40, 50 or more conditions ing or simulating different aspects of toxicity conditions are investigated in the custom built toxicity model. In one embodiment, individual conditions and, in addition, combinations of at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 4343 , 40, 50 or more of the conditions reflecting or simulating different aspects of toxicity conditions are investigated in the custom built toxicity model. All values presented in the foregoing list can also be the upper or lower limit of ranges, that are intended to be a part of this invention, e.g., between 1 and 5, l and 10, l and 20, l and 30, 2 and 5, 2 and , 5 and 10, l and 20, 5 and 20, 10 and 20, 10 and 25, 10 and 30 or 10 and 50 different ions.
Listed herein below are a few exemplary combinations of ions that can be used to treat cells for building drug-induced toxicity models. Other combinations can be readily formulated depending on the specific interrogative biological assessment that is being conducted. 1. Media only 2 50 MM CTL Coenzyme Q10 (CleO) 3 100 MM CTL Coenzyme Q10 4 12.5 mM Lactic Acid . 125 mM Lactic Acid + 50 MM CTL Coenzyme Q10 6 12.5 mM Lactic Acid + 100 MM CTL Coenzyme Q10 7 Hypoxia 8 Hypoxia + 50 MM CTL Coenzyme Q10 9 a + 100 MM CTL Coenzyme Q10 . Hypoxia + 12.5 mM Lactic Acid ll. Hypoxia + 12.5 mM Lactic Acid + 50 MM CTL Coenzyme Q10 l2. Hypoxia + 12.5 mM Lactic Acid + 100 MM CTL Coenzyme Q10 l3. Media + 22 mM Glucose l4. 50 MM CTL Coenzyme Q10 + 22 mM Glucose l5. 100 MM CTL Coenzyme Q10 + 22 mM Glucose l6. 12.5 mM Lactic Acid + 22 mM e 17. 12.5 mM Lactic Acid + 22 mM Glucose + 50 MM CTL Coenzyme Q10 18. 12.5 mM Lactic Acid + 22 mM e +100 MM CTL Coenzyme Q10 19. Hypoxia + 22 mM Glucose . Hypoxia + 22 mM Glucose + 50 MM CTL me Q10 21. Hypoxia + 22 mM Glucose + 100 MM CTL Coenzyme Q10 22. Hypoxia +l2.5 mM Lactic Acid + 22 mM Glucose 4444 23. Hypoxia +12.5 mM Lactic Acid + 22 mM Glucose + 50 11M CTL Coenzyme Q10 24. Hypoxia + 12.5 mM Lactic Acid + 22 mM Glucose +100 11M CTL Coenzyme Q10 As a control one or more cell lines (e.g.,cardiomyocytes, diabetic cardiomyocytes, hepatocytes, kidney cells, neural cells, renal cells, or myoblasts) are cultured under control conditions in order to identify toxicity unique proteins or pathways (see below). The control may be the comparison cell model described above.
Multiple cells of the same or different origin (for example, cardiomyocytes, diabetic cardiomyocytes, hepatocytes, kidney cells, neural cells, renal cells, or myoblasts), as opposed to a single cell type, may be included in the toxicity model. In certain situations, cross talk or ECS experiments between different cells (cardiomyocytes, diabetic cardiomyocytes, hepatocytes, kidney cells, neuro cells, renal cells, or myoblasts ) may be conducted for several inter-related purposes.
In some embodiments that e cross talk, experiments conducted on the cell models are ed to determine modulation of cellular state or function of one cell system or population (e.g.,cardiomyocytes) by r cell system or population (e.g., diabetic cardiomyocytes) under defined treatment conditions (e. g., hyperglycemia, a (ischemia)). According to a typical setting, a first cell system / population is contacted by an external stimulus components, such as a candidate le (e. g., a small drug molecule, a protein) or a candidate condition (e. g., hypoxia, high glucose environment). In se, the first cell system / population changes its transcriptome, proteome, metabolome, and/or interactome, leading to changes that can be y detected both inside and e the cell. For example, changes in transcriptome can be measured by the transcription level of a plurality of target mRNAs; changes in proteome can be measured by the expression level of a plurality of target proteins; and changes in metabolome can be measured by the level of a ity of target metabolites by assays designed specifically for given metabolites. atively, the above referenced changes in lome and/or proteome, at least with respect to certain secreted lites or proteins, can also be measured by their s on the second cell system / population, including the modulation of the transcriptome, proteome, metabolome, and interactome of the second cell system / population. Therefore, the experiments can be used to 4545 identify the effects of the molecule(s) of interest secreted by the first cell system / population on a second cell system / population under different treatment conditions.
The experiments can also be used to identify any proteins that are modulated as a result of signaling from the first cell system (in se to the external stimulus component treatment) to another cell system, by, for example, differential screening of proteomics.
The same experimental setting can also be adapted for a reverse setting, such that ocal effects between the two cell s can also be assessed. In general, for this type of experiment, the choice of cell line pairs is largely based on the s such as origin, toxicity state and cellular function.
Although two-cell systems are typically involved in this type of experimental setting, similar ments can also be designed for more than two cell systems by, for example, immobilizing each distinct cell system on a separate solid support.
Once the custom model is built, one or more rbations” may be applied to the system, such as genetic variation from patient to patient, or with / without treatment by certain drugs or ugs. See Figure 15D. The effects of such perturbations to the , including the effect on cells d to drug-induced toxicity, and normal control cells, can be measured using various art-recognized or proprietary means, as described in section III.B below.
In an exemplary experiment, myocytes are conditioned in hyperglycemia and hyperlipidemia conditions, and in addition with or without an environmental perturbation, specifically treatment by a diabetic drug known for inducing cardiotoxicity and/or a potential rescue agent CoenzymteO.
The custom built cell model may be established and used throughout the steps of the Platform Technology of the invention to ultimately identify a causal relationship unique in the drug-induced toxicity system, by carrying out the steps described herein.
It will be understood by the skilled artisan, however, that a custom built cell model that is used to generate an initial, “first generation” consensus causal relationship network for a drug-induced toxicity can continually evolve or expand over time, e. g., by the introduction of additional drug-induced toxicity related cell lines and/or additional drug- induced toxicity related ions. onal data from the d cell model, i.e., data from the newly added portion(s) of the cell model, can be collected. The new data collected from an expanded or evolved cell model, i.e., from newly added portion(s) of the cell model, can then be introduced to the data sets previously used to generate the 4646 “first generation” consensus causal relationship network in order to generate a more robust “second generation” consensus causal relationship network. New causal relationships unique to the drug-induced toxicity can then be identified from the d generation” consensus causal onship network. In this way, the evolution of the cell model provides an evolution of the consensus causal relationship networks, y providing new and/or more reliable insights into the modulators of the drug-induced toxicity.
Custom models can also be designed to assess toxicity of drugs used in combination. For example, therapeutic agents for the treatment of a number of conditions including cancer, auto-immune disease, or HIV are typically administered as cocktails of combinations of agents. Further, many subjects have multiple, unrelated conditions to be d simultaneously (e. g., diabetes, arthritis, cardiovascular disease).
Models can be built, either in normal cells or in cells subjected to various culture conditions, to identify combinations of agents that may result in toxicities when administered simultaneously. Thus, the methods provided include testing combinations of agents (e.g., 2, 3, 4, 5, 6, 7, 8 or more) together to ine if the combination results in drug related toxicities, including with agents that do not result in ties alone.
Models can also be built for “personalized medicine” applications in which the specific combination of drugs being administered or considered for administration can be tested using the methods provided herein to determine if the combination of drugs are likely to have unacceptable toxicities. Such combinations can be tested in various cell types (e.g., c cells, kidney cells, nerve cells, muscle cells, liver cells; either cell lines or primary cells cultured from the subject) grown under various conditions to mimic the subject of interest (e. g., grown in high glucose for a subject with diabetes or hypoxia for a subject with ischemia).
Additional es of custom built cell models are described in detail herein.
B. Data Collection In general, two types of data may be collected from any custom built model systems. One type of data (e. g., the first set of data, the third set of data) y relates to the level of certain macromolecules, such as DNA, RNA, protein, lipid, etc. An ary data set in this ry is proteomic data (e. g., qualitative and quantitative 4747 data concerning the expression of all or substantially all measurable proteins from a sample). The other type of data is generally onal data (e.g., the second set of data, the fourth set of data) that reflects the phenotypic changes resulting from the changes in the first type of data. Functional activity or cellular response of the cells can include any one or more of bioenergetics, cell proliferation, apoptosis, organellar function, a genotype-phenotype association actualized by functional models selected from ATP, ROS, OXPHOS, and Seahorse assays, global enzyme activity (e.g., global kinase activity), and an effect of global enzyme activity on the enzyme metabolic ates of cells associated with drug-induced toxicity (e. g., phosphoproteomic data).
With respect to the first type of data, in some example embodiments, tative polymerase chain on (qPCR) and proteomics are performed to profile s in cellular mRNA and protein expression by quantitative polymerase chain reaction (qPCR) and proteomics. Total RNA can be ed using a cial RNA isolation kit. Following cDNA synthesis, specific commercially ble qPCR arrays (e.g., those from SA Biosciences) for disease area or cellular processes such as angiogenesis, apoptosis, and diabetes, may be ed to profile a predetermined set of genes by following a manufacturer’s instructions. For example, the Biorad 4 amplification system can be used for all transcriptional profiling experiments. Following data collection (Ct), the final fold change over control can be ined using the 8Ct method as outlined in manufacturer’s protocol. Proteomic sample analysis can be performed as described in subsequent sections.
The subject method may employ large-scale high-throughput quantitative mic analysis of hundreds of samples of similar character, and provides the data necessary for identifying the cellular output entials.
There are numerous art-recognized technologies suitable for this purpose. An exemplary technique, iTRAQ analysis in combination with mass spectrometry, is briefly described below.
The quantitative proteomics approach is based on stable isotope labeling with the 8-plex iTRAQ reagent and 2D-LC MALDI MS/MS for peptide identification and quantification. Quantification with this technique is relative: peptides and proteins are assigned abundance ratios ve to a reference sample. Common reference samples in multiple iTRAQ experiments facilitate the comparison of samples across multiple iTRAQ experiments. 4848 For example, to implement this analysis scheme, six primary samples and two control pool samples can be combined into one 8-plex iTRAQ mix according to the cturer’s suggestions. This mixture of eight samples then can be fractionated by two-dimensional liquid chromatography; strong cation exchange (SCX) in the first dimension, and reversed-phase HPLC in the second dimension, then can be subjected to mass ometric analysis.
A brief overview of exemplary laboratory procedures that can be employed is provided herein.
Protein extraction: Cells can be lysed with 8 M urea lysis buffer with protease inhibitors (Thermo Scientific Halt Protease inhibitor EDTA-free) and incubate on ice for minutes with vertex for 5 seconds every 10 minutes. Lysis can be completed by ultrasonication in 5 seconds pulse. Cell lysates can be centrifuged at 14000 x g for 15 s (4 0C) to remove ar debris. Bradford assay can be performed to determine the protein tration. 100ug protein from each samples can be reduced (10mM Dithiothreitol (DTT), 55 OC, 1 h), ted (25 mM iodoacetamide, room temperature, minutes) and digested with Trypsin (1:25 W/W, 200 mM triethylammonium bicarbonate (TEAB), 37 0C, 16 h).
Secretome sample preparation: 1) In one embodiment, the cells can be cultured in serum free medium: Conditioned media can be concentrated by freeze dryer, reduced (10mM Dithiothreitol (DTT), 55 OC, 1 h), alkylated (25 mM iodoacetamide, at room temperature, te for 30 minutes), and then ed by actone precipitation. Equal amount of proteins from the trated conditioned media can be digested with Trypsin (1:25 W/W, 200 mM triethylammonium bicarbonate (TEAB), 37 0C, 16 h).
In one embodiment, the cells can be cultured in serum containing medium: The volume of the medium can be reduced using 3k MWCO Vivaspin columns (GE Healthcare Life Sciences), then can be reconstituted WithleBS (Invitrogen). Serum albumin can be depleted from all samples using AlbuVoid column ch Support Group, LLC) ing the manufacturer’s instructions with the modifications of buffer- exchange to ze for condition medium application. iTRAQ 8 Flex Labeling: Aliquot from each tryptic digests in each experimental set can be pooled together to create the pooled control sample. Equal aliquots from each sample and the pooled control sample can be labeled by iTRAQ 8 Flex reagents according to the manufacturer’s protocols (AB Sciex). The reactions can be combined, 4949 vacuumed to dryness, re-suspended by adding 0.1% formic acid, and analyzed by LC- MS/MS. 2D-Nan0LC-MS/MS: All labeled peptides mixtures can be separated by online 2D-nanoLC and analysed by electrospray tandem mass spectrometry. The experiments can be carried out on an Eksigent 2D NanoLC Ultra system connected to an LTQ Orbitrap Velos mass spectrometer equipped with a ectrospray ion source (Thermo Electron, Bremen, Germany).
The peptides mixtures can be injected into a 5 cm SCX column (300um ID, 5pm, PolySULFOETHYL Aspartamide column from PolyLC, Columbia, MD) with a flow of 4 uL / min and eluted in 10 ion exchange elution segments into a C18 trap column (2.5 cm, 100um ID, 5um, 300 A ProteoPep II from New Objective, Woburn, MA) and washed for 5 min with 1%FA. The separation then can be further carried out at 300 nL/min using a gradient of 2-45% B (H2O /0.1%FA (solvent A) and ACN /0.1%FA (solvent B)) for 120 minutes on a 15 cm fused silica column (75pm ID, 5um, 300 A ProteoPep II from New Objective, Woburn, MA).
Full scan MS spectra (m/z 300-2000) can be acquired in the ap with resolution of 30,000. The most intense ions (up to 10) can be sequentially isolated for fragmentation using High energy C-trap Dissociation (HCD) and dynamically exclude for 30 seconds. HCD can be conducted with an isolation width of 1.2 Da. The resulting fragment ions can be scanned in the ap with resolution of 7500. The LTQ Orbitrap Velos can be controlled by ur 2.1 with foundation 1.0.1. es/proteins identification and quantification: Peptides and proteins can be identified by automated database searching using Proteome Discoverer software o Electron) with Mascot search engine t SwissProt database. Search parameters can include 10 ppm for MS tolerance, 0.02 Da for MS2 nce, and full n digestion allowing for up to 2 missed cleavages. Carbamidomethylation (C) can be set as the fixed modification. Oxidation (M), TMT6, and deamidation (NQ) can be set as dynamic modifications. Peptides and protein identifications can be filtered with Mascot Significant Threshold (p<0.05). The filters can be allowed a 99% confidence level of protein identification (1% FDA).
The Proteome Discoverer software can apply correction factors on the reporter ions, and can reject all tation values if not all quantitation channels are present. ve n quantitation can be achieved by normalization at the mean intensity. 505O With t to the second type of data, in some exemplary embodiments, bioenergetics profiling of cancer and normal models may employ the SeahorseTM XF24 analyzer to enable the understanding of ysis and oxidative phosphorylation components .
Specifically, cells can be plated on Seahorse culture plates at optimal densities.
These cells can be plated in 100 pl of media or treatment and left in a 37°C incubator with 5% C02. Two hours later, when the cells are adhered to the 24 well plate, an additional 150 pl of either media or treatment solution can be added and the plates can be left in the culture incubator overnight. This two step seeding procedure allows for even bution of cells in the culture plate. Seahorse cartridges that contain the oxygen and pH sensor can be hydrated overnight in the calibrating fluid in a non-C02 incubator at 37°C. Three mitochondrial drugs are typically loaded onto three ports in the cartridge. ycin, a complex III inhibitor, FCCP, an uncoupler and Rotenone, a complex I inhibitor can be loaded into ports A, B and C respectively of the cartridge.
All stock drugs can be prepared at a 10x concentration in an unbuffered DMEM media.
The cartridges can be first incubated with the mitochondrial compounds in a non-C02 incubator for about 15 minutes prior to the assay. Seahorse culture plates can be washed in DMEM based unbuffered media that contains glucose at a concentration found in the normal growth media. The cells can be layered with 630 ul of the unbuffered media and can be equilibriated in a non-C02 tor before placing in the Seahorse instrument with a precalibrated cartridge. The instrument can be run for four loops with a mix, wait and measure cycle for get a baseline, before injection of drugs h the port is initiated. There can be two loops before the next drug is uced.
OCR (Oxygen consumption rate) and ECAR (Extracullular Acidification Rate) can be recorded by the odes in a 7 pl chamber and can be created with the cartridge pushing against the seahorse culture plate.
C. Data Integration and in silico Model Generation Once relevant data sets have been obtained, integration of data sets and generation of computer-implemented statistical models may be med using an AI- based informatics system or platform (e.g, the REFSTM platform). For example, an exemplary AI-based system may produce simulation-based networks of n associations as key drivers of metabolic end points (ECAR/OCR). See Figure 15. Some 5151 background details regarding the REFSTM system may be found in King et al., “Causal Modeling Using Network Ensemble Simulations of Genetic and Gene sion Data Predicts Genes Involved in toid Arthritis,” PLoS Computational Biology, vol. 7, issue. 3, 1-19 (March 2011) (e100105) and US. Patent 497 to Periwal, the entire contents of each of which is expressly incorporated herein by reference in its entirety. In essence, as described earlier, the REFSTM system is an ed system that employs mathematical algorithms to establish causal relationships among the input variables (e. g., protein expression levels, mRNA expression levels, and the corresponding functional data, such as the OCR / ECAR values measured on se culture plates).
This process is based only on the input data alone, without taking into eration prior existing knowledge about any potential, established, and/or verified biological relationships.
In particular, a significant advantage of the platform of the invention is that the AI-based system is based on the data sets obtained from the cell model, without resorting to or taking into consideration any existing knowledge in the art concerning the biological process. Further, preferably, no data points are statistically or artificially cut- off and, instead, all obtained data is fed into the AI-system for ining protein associations. Accordingly, the resulting statistical models generated from the rm are unbiased, since they do not take into consideration any known biological onships.
Specifically, data from the proteomics and ECAR/OCR can be input into the AI- based information system, which builds statistical models based on data ations, as described above. Simulation-based networks of protein associations are then derived for each disease versus normal scenario, including treatments and conditions using the following methods.
A detailed description of an exemplary process for building the generated (e. g., optimized or evolved) networks appears below with respect to Figure 16. As described above, data from the proteomics and functional cell data is input into the AI-based system (step 210). The input data, which may be raw data or minimally sed data, is pre-processed, which may include ization (e. g., using a quantile function or internal standards) (step 212). The pre-processing may also include imputing missing data values (e. g., by using the K-nearest neighbor (K-NN) algorithm) (step 212). 5252 The ocessed data is used to construct a network fragment library (step 214). The network fragments define quantitative, continuous relationships among all possible small sets (e. g., 2-3 member sets or 2-4 member sets) of ed variables (input data). The onships between the variables in a fragment may be linear, logistic, multinomial, nt or recessive homozygous, etc. The relationship in each fragment is assigned a Bayesian probabilistic score that reflect how likely the candidate relationship is given the input data, and also penalizes the relationship for its mathematical complexity. By g all of the possible pairwise and three-way relationships (and in some embodiments also four-way relationships) inferred from the input data, the most likely fragments in the library can be identified (the likely fragments). Quantitative parameters of the relationship are also computed based on the input data and stored for each fragment. Various model types may be used in fragment enumeration including but not limited to linear regression, logistic regression, (Analysis of Variance) ANOVA models, (Analysis of Covariance) ANCOVA models, non- linear/polynomial regression models and even non-parametric regression. The prior assumptions on model parameters may assume Gull distributions or Bayesian Information ion (BIC) penalties related to the number of parameters used in the model. In a k inference process, each network in an ensemble of initial trial networks is constructed from a subset of fragments in the nt library. Each initial trial network in the ensemble of initial trial networks is constructed with a different subset of the nts from the nt library (step 216).
An overview of the mathematical representations underlying the Bayesian networks and network fragments, which is based on Xing et al., “Causal Modeling Using Network Ensemble tions of Genetic and Gene sion Data Predicts Genes Involved in Rheumatoid Arthritis,” PLoS Computational y, vol. 7, issue. 3, 1-19 (March 2011) (e100105), is presented below.
A multivariate system with random variables X = X . . X be 1,. , n may characterized by a multivariate probability distribution function P(X1,. . ., Xn;®) that es a large number of parameters (9. The multivariate probability distribution function may be factorized and represented by a product of local conditional probability distributions: P(X1,...,Xn;®)= fig(x,|rfl,...,rflg ,o,) 5353 in which each variable X is independent from its non-descendent variables given its K l. 1. parent variables, which are Yj1,..., Y1K . After factorization, each local probability distribution has its own parameters (9,.
The multivariate probability distribution on may be factorized in different ways with each particular factorization and corresponding parameters being a ct ilistic model. Each particular ization (model) can be represented by a Directed Acrylic Graph (DAC) having a vertex for each variable X and directed edges between vertices representing dependences between variables in the local conditional distributions Pl. (Xile1,..., Y1K, ). phs of a DAG, each including a vertex and associated directed edges are network fragments.
A model is evolved or optimized by determining the most likely factorization and the most likely parameters given the input data. This may be described as “learning a Bayesian networ ,” or, in other words, given a training set of input data, finding a network that best s the input data. This is accomplished by using a g function that evaluates each network with respect to the input data.
A Bayesian framework is used to determine the likelihood of a factorization given the input data. Bayes Law states that the posterior probability, P(D|M), of a model M, given data D is proportional to the product of the product of the posterior probability of the data given the model assumptions, P(D|M), multiplied by the prior probability of the model, P(M ), assuming that the probability of the data, P(D), is constant across . This is expressed in the following equation: P(D|M)* P(M) P(M|D) = P(D) The posterior probability of the data assuming the model is the integral of the data likelihood over the prior bution of parameters: P(D|M) = j P(D|M(®))P(®|M )dG).
Assuming all models are equally likely (i.e., that P(M) is a constant), the posterior probability of model M given the data D may be factored into the product of integrals over parameters for each local network fragment Mi as follows: 5454 P):(M|DP1XY(X1“'YIJ'1’ ’ YjK;®81') Note that in the equation above, a leading constant term has been omitted. In some embodiments, a Bayesian Information Criterion (BIC), which takes a negative logarithm of the posterior ility of the model P(D|M) may be used to “Score” each model as follows: Sm (M) = -10gP(M|D)= ZS(M,-) i=1 ’ where the total score S,0, for a model M is a sum of the local scores S, for each local network fragment. The BIC further gives an expression for determining a score each individual k fragment: ’ logN where K(M,) is the number of fitting parameter in model M, and N is the number of samples (data points). SMLE(Mi) is the negative logarithm of the likelihood function for a network fragment, which may be calculated from the functional onships used for each network fragment. For a BIC score, the lower the score, the more likely a model fits the input data.
The ensemble of trial networks is globally optimized, which may be described as optimizing or evolving the networks (step 218). For example, the trial ks may be evolved and optimized according to a Metropolis Monte Carlo Sampling alogorithm.
Simulated annealing may be used to optimize or evolve each trial network in the ensemble through local transformations. In an example simulated ing processes, each trial network is changed by adding a network fragment from the library, by deleted a network fragment from the trial network, by substituting a k fragment or by otherwise changing k topology, and then a new score for the network is calculated. Generally ng, if the score improves, the change is kept and if the score worsens the change is rejected. A “temperature” parameter allows some local changes which worsen the score to be kept, which aids the optimization s in avoiding some local minima. The “temperature” parameter is decreased over time to allow the optimization/evolution process to converge. 5555 All or part of the network inference process may be conducted in parallel for the trial ent networks. Each network may be optimized in parallel on a te processor and/or on a separate computing device. In some embodiments, the optimization process may be conducted on a supercomputer incorporating hundreds to thousands of processors which operate in el. Information may be shared among the optimization processes conducted on parallel processors.
The optimization process may include a network filter that drops any networks from the ensemble that fail to meet a old standard for overall score. The dropped network may be replaced by a new initial network. Further any ks that are not “scale free” may be dropped from the ensemble. After the ensemble of networks has been optimized or evolved, the result may be termed an ensemble of generated cell model networks, which may be collectively referred to as the generated consensus network.
D. Simulation to Extract uantitative Relationshi Information and fol-g Prediction Simulation may be used to extract quantitative ter information regarding each onship in the generated cell model networks (step 220). For example, the simulation for quantitative information extraction may involve perturbing (increasing or decreasing) each node in the network by 10 fold and calculating the ior distributions for the other nodes (e. g., proteins) in the models. The endpoints are ed by t-test with the assumption of 100 samples per group and the 0.01 significance cut-off. The t-test statistic is the median of 100 t-tests. h use of this simulation technique, an AUC (area under the curve) representing the strength of prediction and fold change representing the in silico magnitude of a node driving an end point are generated for each relationship in the ensemble of networks.
A relationship quantification module of a local computer system may be employed to direct the AI-based system to perform the bations and to t the AUC information and fold information. The extracted quantitative information may e fold change and AUC for each edge connecting a parent note to a child node.
In some embodiments, a custom-built R program may be used to extract the quantitative information. 5656 In some embodiments, the ensemble of generated cell model networks can be used through simulation to predict responses to changes in conditions, which may be later verified though wet-lab cell-based, or animal-based, experiments.
The output of the AI—based system may be quantitative onship parameters and/or other simulation predictions (222).
E. Generation of Differential gDelta) Networks A ential network creation module may be used to generate differential (delta) networks between generated cell model ks and generated ison cell model networks. As described above, in some embodiments, the differential network compares all of the quantitative parameters of the relationships in the generated cell model networks and the generated comparison cell model network. The quantitative parameters for each onship in the differential network are based on the comparison.
In some embodiments, a differential may be performed between various differential networks, which may be termed a delta-delta network. An e of a delta-delta network is described below with t to Figure 18 in the Examples section. The differential k creation module may be a program or script written in PERL.
F. Visualization of ks The relationship values for the ensemble of networks and for the differential networks may be visualized using a k visualization program (e.g., Cytoscape open source platform for complex k analysis and visualization from the Cytoscape consortium). In the visual depictions of the networks, the thickness of each edge (e. g., each line connecting the proteins) represents the strength of fold change. The edges are also directional indicating causality, and each edge has an associated prediction confidence level.
G. Exemplary Computer System Figure 17 schematically depicts an exemplary computer system/environment that may be employed in some ments for communicating with the AI—based 5757 informatics , for generating ential networks, for visualizing ks, for saving and storing data, and/or for interacting with a user. As explained above, calculations for an AI-based informatics system may be performed on a separate supercomputer with hundreds or thousands of parallel processors that interacts, directly or ctly, with the exemplary computer system. The environment includes a computing device 100 with associated peripheral devices. ing device 100 is programmable to implement able code 150 for performing various methods, or portions of methods, taught herein. Computing device 100 es a storage device 116, such as a hard-drive, CD-ROM, or other non-transitory computer readable media.
Storage device 116 may store an operating system 118 and other related software.
Computing device 100 may further include memory 106. Memory 106 may comprise a computer system memory or random access memory, such as DRAM, SRAM, EDO RAM, etc. Memory 106 may comprise other types of memory as well, or combinations thereof. Computing device 100 may store, in storage device 116 and/or memory 106, ctions for implementing and processing each n of the executable code 150.
The executable code 150 may include code for communicating with the AI—based informatics system 190, for generating differential ks (e. g., a differential network creation module), for extracting quantitative relationship information from the AI-based informatics system (e.g., a relationship quantification module) and for visualizing networks (e. g., Cytoscape).
In some embodiments, the computing device 100 may communicate directly or indirectly with the AI-based atics system 190 (e. g., a system for executing REFS).
For example, the computing device 100 may communicate with the AI-based informatics system 190 by transferring data files (e. g., data frames) to the AI-based informatics system 190 h a network. Further, the computing device 100 may have executable code 150 that provides an interface and instructions to the AI-based informatics system 190.
In some embodiments, the computing device 100 may communicate directly or indirectly with one or more mental systems 180 that provide data for the input data set. Experimental systems 180 for generating data may include systems for mass spectrometry based proteomics, microarray gene expression, qPCR gene expression, mass spectrometry based metabolomics, and mass spectrometry based lipidomics, SNP 5858 rrays, a panel of functional assays, and other in-vitro biology platforms and technologies.
Computing device 100 also includes processor 102, and may include one or more additional processor(s) 102’, for executing software stored in the memory 106 and other programs for controlling system hardware, peripheral devices and/or eral hardware. Processor 102 and processor(s) 102’ each can be a single core processor or multiple core (104 and 104’) processor. Virtualization may be employed in computing device 100 so that infrastructure and resources in the computing device can be shared dynamically. Virtualized processors may also be used with executable code 150 and other software in storage device 116. A virtual machine 114 may be provided to handle a process running on multiple processors so that the process appears to be using only one computing resource rather than multiple. Multiple virtual machines can also be used with one sor.
A user may interact with computing device 100 through a visual display device 122, such as a computer monitor, which may display a user interface 124 or any other interface. The user interface 124 of the display device 122 may be used to display raw data, visual entations of networks, etc. The visual display device 122 may also y other aspects or elements of exemplary embodiments (e.g., an icon for e device 116). Computing device 100 may include other I/O devices such a keyboard or a multi-point touch interface (e. g., a touchscreen) 108 and a pointing device 110, (e. g., a mouse, trackball and/or trackpad) for ing input from a user. The keyboard 108 and the pointing device 110 may be connected to the visual display device 122 and/or to the computing device 100 via a wired and/or a wireless tion.
Computing device 100 may include a network interface 112 to interface with a network device 126 via a Local Area Network (LAN), Wide Area Network (WAN) or the Internet h a variety of connections including, but not d to, standard telephone lines, LAN or WAN links (e.g., 802.11, T1, T3, 56kb, X25), broadband connections (e. g., ISDN, Frame Relay, ATM), wireless connections, ller area network (CAN), or some combination of any or all of the above. The network interface 112 may comprise a built-in network adapter, network interface card, PCMCIA network card, card bus network adapter, wireless network adapter, USB k adapter, modem or any other device suitable for enabling computing device 100 to interface with any 5959 type of network capable of communication and ming the operations bed herein.
Moreover, computing device 100 may be any computer system such as a workstation, p computer, server, laptop, handheld computer or other form of computing or mmunications device that is capable of communication and that has ient processor power and memory capacity to perform the operations bed herein.
Computing device 100 can be running any operating system 118 such as any of the versions of the MICROSOFT WINDOWS operating systems, the different releases of the Unix and Linux operating systems, any version of the MACOS for Macintosh computers, any embedded ing system, any real-time operating system, any open source operating system, any proprietary operating system, any operating systems for mobile computing devices, or any other operating system capable of running on the computing device and performing the operations bed herein. The operating system may be g in native mode or emulated mode.
IV. Models for Drug-induced Toxicity and Uses Therefor Virtually all drug-induced toxicity involves complicated interactions among different cell types and/or organ systems. Perturbation of critical functions in one cell type or organ may lead to secondary effects on other interacting cells types and organs, and such ream changes may in turn feedback to the initial changes and cause further cations. Therefore, it is beneficial to dissect a given drug-induced toxicity to its components, such as interaction between pairs of cell types or organs, and systemically probe the interactions between these components in order to gain a more complete, global view of the drug-induced toxicity process.
Accordingly, the present invention provides cell models for drug-induced toxicity. To this end, Applicants have built cell models for an exemplary drug-induced toxicity (e. g., cardiotoxicity) which have been employed in the subject discovery Platform Technology. Applicants have conducted ments with the cell models using the subject discovery Platform Technology to generate consensus causal 606O relationship networks, including causal relationships unique in the drug-induced toxicity, and thereby identify “modulators” or critical molecular “drivers” important for the particular drug-induced toxicity.
One significant advantage of the rm Technology and its components, e. g., the custom built cell models and data sets obtained from the nduced toxicity cell models, is that an initial, “first generation” sus causal onship network generated for a drug-induced toxicity can continually evolve or expand over time, e. g., by the introduction of additional cell lines/types and/or additional conditions.
Additional data from the evolved cell model, i.e., data from the newly added portion(s) of the cell model, can be collected. The new data collected from an expanded or evolved cell model, i.e., from newly added portion(s) of the cell model, can then be introduced to the data sets previously used to generate the “first generation” consensus causal relationship network in order to generate a more robust “second generation” sus causal relationship network. New causal relationships unique to the drug- induced toxicity can then be identified from the “second generation” consensus causal relationship network. In this way, the evolution of the drug-induced toxicity cell model provides an ion of the sus causal relationship networks, thereby providing new and/or more reliable insights into the modulators of the drug-induced toxicity. In this way, both the drug-induced toxicity cell models, the data sets from the cell models, and the causal relationship networks generated from the drug-induced toxicity cell models by using the Platform Technology methods can constantly evolve and build upon us knowledge obtained from the Platform Technology.
Accordingly, the invention provides consensus causal relationship networks generated from the drug-induced ty cell models employed in the rm logy. These consensus causal relationship networks may be first generation consensus causal relationship networks, or may be multiple generation consensus causal relationship networks, e.g., 2““ 3““, 4““, 5““, 6““, 7““, 8““, 9““, 10““, 11““, 12““, 13““, 14““, 15““, 16m, 17m, 18m, 19““, 20th or greater generation sus causal relationship networks.
Further, the invention provides simulated consensus causal onship networks generated from the drug-induced toxicity cell models employed in the Platform Technology. These simulated consensus causal relationship networks may be first generation simulated consensus causal relationship networks, or may be multiple generation simulated consensus causal relationship networks, e.g., 2“CL 3rd, 4th, 5”“, 6th, 7th, 6161 8““, 9““, 10““, 11““, 12““, 13““, 14““, 15““, 16““, 17““, 18““, 19““, 20““ or greater simulated generation consensus causal relationship ks. The invention further provides delta networks and delta-delta networks generated from any of the consensus causal relationship networks of the ion.
A custom built cell model for a drug-induced toxicity comprises one or more cells associated with the drug-induced toxicity. The model for a drug-induced toxicity may be established to simulate an environment of the drug-induced toxicity, e. g., environment of drug-induced cardiotoxicity in vivo, by creating conditions (e. g., cell culture conditions) that mimic a characteristic aspect of the drug-induced toxicity.
Multiple cells of the same or different origin, as opposed to a single cell type, may be included in the cell model. In one embodiment, at least 1, 2, 3, 4, 5, 6, 7, 8, 9, , ll, l2, l3, 14, 15, l6, l7, l8, 19, 20, 25, 30, 35, 40, 50 or more different cell lines or cell types are included in the drug-induced toxicity cell model. In one embodiment, the cells are all of the same type, e. g., all cardiomyocytes, but are different established cell lines, e. g., different established cell lines of cardiomyocytes. All values ted in the foregoing list can also be the upper or lower limit of ranges, that are intended to be a part of this invention, e.g., between 1 and 5, l and 10, 2 and 5, or 5 and 15 different cell lines or cell types.
Examples of cell types that may be included in the cell models of the invention e, without limitation, human cells, animal cells, mammalian cells, plant cells, yeast, bacteria, or fungae. In one embodiment, cells of the cell model can include ed cells, such as cancer cells or bacterially or virally infected cells. In one embodiment, cells of the cell model can include drug-induced toxicity associated cells, such as cells involved in diabetes, obesity or vascular drug-induced toxicity state, e. g., aortic smooth muscle cells or hepatocytes. The d person would recognize those cells that are involved in or associated with a particular drug-induced toxicity, e. g., cardiotoxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, renaltoxicity, or myotoxicity, and any such cells may be included in a cell model of the invention, e. g., cardiomyocytes, ic cardiomyocytes, hepatocytes, kidney cells, neuro cells, renal cells, or myoblasts.
Cell models of the invention may include one or more ol cells.” In one embodiment, a control cell may be an untreated or unperturbed cell. In another embodiment, a “control cell” may be a normalcell, e.g., a cell that has not been exposed 6262 to a toxicity-causing agent or drug. In one embodiment, at least 1, 2, 3, 4, 5, 6, 7, 8, 9, , 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50 or more different control cells are included in the cell model. All values presented in the foregoing list can also be the upper or lower limit of ranges, that are intended to be a part of this invention, e. g., between 1 and 5, 1 and 10, 2 and 5, or 5 and 15 different control cell lines or control cell types. In one embodiment, the control cells are all of the same type but are different established cell lines of that cell type. In one embodiment, as a control, one or more normal, e. g., non-diseased, cell lines are ed under similar conditions, and/or are exposed to the same perturbation, as the primary cells of the cell model in order to identify proteins or pathways unique to the drug-induced toxicity.
A custom cell model of the invention may also comprise conditions that mimic a characteristic aspect of the drug-induced toxicity. For example, cell culture conditions may be selected that closely approximating the conditions of a cell in a diabetic environment in vivo for g diabetic drug d toxicity, or of an aortic smooth muscle cell of a patient suffering from drug-induced cardiotoxicity. In some instances, the conditions are stress conditions.Various conditions / stressors may be employed in the cell models of the ion. In one ment, these stressors / conditions may constitute the “perturbation”, e. g., external stimulus, for the cell systems. One exemplary stress condition is hypoxia, a condition typically found, for example, within patients with advanced stage of diabetes. Hypoxia can be d using art-recognized methods. For example, hypoxia can be induced by placing cell s in a r tor Chamber (MIC-101, s-Rothenberg Inc. Del Mar, CA), which can be flooded with an industrial gas mix containing 5% C02, 2% Oz and 93% nitrogen.
Effects can be measured after a pre-determined period, e.g., at 24 hours after hypoxia ent, with and without additional external stimulus components (e.g., CleO at 0, 50, or 100 MM). Likewise, lactic acid treatment mimics a cellular environment where glycolysis activity is high. Lactic acid induced stress can be investigated at a final lactic acid concentration of about 12.5 mM at a pre-determined time, e.g., at 24 hours, with or without additional external stimulus components (e.g., CleO at 0, 50, or 100 MM).
Hyperglycemia is a condition found in diabetes as well as in diabetic drug-induced toxicity. A typical hyperglycemic ion that can be used to treat the t cells include 10% culture grade e added to suitable media to bring up the final concentration of glucose in the media to about 22 mM. Hyperlipidemia is a condition 6363 found, for example, in obesity and cardiovascular disease, and can be used to simulate nduced cardiotoxicity. The hyperlipidemic conditions can be provided by culturing cells in media containing 0.15 mM sodium palmitate. Hyperinsulinemia is a condition found, for example, in diabetes, as well as in diabetic drug-induced toxicity.
The hyperinsulinemic conditions may be d by ing the cells in media containing 1000 nM insulin.
Individual conditions may be investigated separately in the custom built cell models of the invention, and/or may be combined together. In one embodiment, a combination of at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 40, 50 or more conditions reflecting or simulating different characteristic aspects of the biological system are investigated in the custom built cell model. In one embodiment, individual conditions and, in addition, combinations of at least 2, 3, 4, 5, 6, 7, 8, 9, 10, ll, l2, l3, 14, 15,20, , 30, 35, 40, 45, 50 or more of the conditions reflecting or simulating ent characteristic aspects of the drug-induced toxicity are investigated in the custom built drug-induced toxicity cell model. All values presented in the foregoing list can also be the upper or lower limit of ranges, that are intended to be a part of this invention, e.g., between 1 and 5, l and 10, l and 20, l and 30, 2 and 5, 2 and 10, 5 and 10, l and 20, 5 and 20, 10 and 20, 10 and 25, 10 and 30 or 10 and 50 different conditions.
Once the custom drug-induced ty cell model is built, one or more “perturbations” may be applied to the , such as genetic variation from patient to patient, or with / without treatment by certain drugs or pro-drugs. See Figure 15D. The effects of such perturbations to the cell model system can be measured using various art- ized or etary means, as described in n III.B below.
The custom built nduced toxicity cell model may be exposed to a perturbation, e.g., an “environmental perturbation” or nal stimulus component”.
The “environmental perturbation” or “external stimulus component” may be endogenous to the cellular environment (e.g., the ar environment contains some levels of the stimulant, and more of the same is added to increase its level), or may be exogenous to the cellular environment (e. g., the stimulant/perturbation is largely absent from the cellular environment prior to the tion). The cellular environment may further be altered by secondary changes resulting from adding the environmental perturbation or external stimulus component, since the external stimulus component may change the cellular output of the cell system, including molecules secreted into the cellular 6464 environment by the cell system. The nmental perturbation or external stimulus component may include any external physical and/or chemical us that may affect cellular function. This may include any large or small organic or inorganic molecules, l or synthetic chemicals, ature shift, pH change, radiation, light (UVA, UVB eta), microwave, sonic wave, electrical current, modulated or unmodulated magnetic fields, etc. The environmental perturbation or external us component may also include an introduced genetic modification or mutation or a vehicle (e. g., vector) that causes a genetic modification / mutation. (i) Cross-talk cell systems In certain situations, where interaction between two or more cell systems are desired to be investigated, a “cross-talking cell system” may be formed by, for example, bringing the ed ar environment of a first cell system into contact with a second cell system to affect the cellular output of the second cell system.
As used herein, “cross-talk cell system” comprises two or more cell systems, in which the cellular environment of at least one cell system comes into contact with a second cell system, such that at least one cellular output in the second cell system is changed or affected. In certain embodiments, the cell systems within the cross-talk cell system may be in direct contact with one another. In other embodiments, none of the cell systems are in direct t with one another.
For example, in certain embodiments, the cross-talk cell system may be in the form of a transwell, in which a first cell system is growing in an insert and a second cell system is growing in a corresponding well compartment. The two cell systems may be in contact with the same or different media, and may exchange some or all of the media components. External stimulus component added to one cell system may be substantially absorbed by one cell system and/or degraded before it has a chance to diffuse to the other cell system. Alternatively, the external stimulus component may eventually approach or reach an equilibrium within the two cell systems.
In certain embodiments, the talk cell system may adopt the form of tely cultured cell systems, where each cell system may have its own medium and/or culture conditions (temperature, C02 content, pH, etc), or r or identical culture conditions. The two cell systems may come into t by, for example, taking the conditioned medium from one cell system and bringing it into contact with another 6565 cell system. Direct cell-cell contacts between the two cell systems can also be effected if desired. For example, the cells of the two cell systems may be co-cultured at any point if desired, and the co-cultured cell systems can later be separated by, for example, FACS sorting when cells in at least one cell system have a sortable marker or label (such as a stably expressed cent marker protein GFP).
Similarly, in certain embodiments, the cross-talk cell system may simply be a co- e. Selective treatment of cells in one cell system can be effected by first treating the cells in that cell system, before culturing the treated cells in co-culture with cells in another cell system. The co-culture cross-talk cell system setting may be helpful when it is desired to study, for example, effects on a second cell system caused by cell surface changes in a first cell system, after stimulation of the first cell system by an external stimulus component.
The cross-talk cell system of the ion is particularly suitable for exploring the effect of certain pre-determined external stimulus component on the cellular output of one or both cell systems. The primary effect of such a stimulus on the first cell system (with which the us directly contact) may be determined by comparing cellular s (e.g., protein expression level) before and after the first cell system’s contact with the external stimulus, which, as used herein, may be referred to as “(significant) ar output entials.” The secondary effect of such a stimulus on the second cell system, which is mediated through the modified cellular environment of the first cell system (such as its secretome), can also be similarly measured. There, a comparison in, for example, proteome of the second cell system can be made n the proteome of the second cell system with the external stimulus treatment on the first cell system, and the proteome of the second cell system without the external stimulus treatment on the first cell system. Any significant changes observed (in proteome or any other cellular outputs of interest) may be referred to as a “significant cellular cross-talk differential.” In making cellular output ements (such as protein expression), either absolute expression amount or ve expression level may be used. For example, to ine the relative protein expression level of a second cell system, the amount of any given protein in the second cell system, with or without the external stimulus to the first cell system, may be compared to a le control cell line and mixture of cell lines and given a fold-increase or fold-decrease value. A pre-determined threshold level for 6666 such fold-increase (e.g., at least 1.2, 13,14, 151.6, 1.7, 1.8, 1.9, 2, 2.5, 3, 3.5, 4, 4.5, , 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 75 or 100 or more fold se) or fold- decrease (e.g., at least a decrease to 0.95, 0.9, 0.8, 0.75, 0.7, 0.6, 0.5, 0.45, 0.4, 0.35, 0.3, 0.25, 0.2, 0.15, 0.1 or 0.05 fold, or 90%, 80%, 75%, 70%, 65%, 60%, 55%, 50%, 45%, 40%, 35%, 30%, 25%, 20%, 15%, 10% or 5% or less) may be used to select significant cellular cross-talk differentials. All values presented in the foregoing list can also be the upper or lower limit of ranges, e.g., between 1.5 and 5 fold, n 2 and 10 fold, between 1 and 2 fold, or between 0.9 and 0.7 fold, that are intended to be a part of this ion.
Throughout the t application, all values presented in a list, e. g., such as those above, can also be the upper or lower limit of ranges that are intended to be a part of this invention.
To illustrate, in one exemplary two-cell system established to imitate aspects of a drug-induced cardiotoxicity and nephrotoxicity model, a heart smooth muscle cell line (first cell system) may be treated with a hypoxia condition (an external stimulus component), and proteome changes in a kidney cell line (second cell system) resulting from ting the kidney cells with conditioned medium of the heart smooth muscle may be measured using conventional quantitative mass ometry. Significant cellular cross-talking differentials in these kidney cells may be ined, based on ison with a proper control (e. g., similarly cultured kidney cells contacted with conditioned medium from similarly cultured heart smooth muscle cells n_ot treated with hypoxia conditions).
Not every observed significant cellular talking differentials may be of biological significance. With respect to any given drug-induced toxicity for which the subject interrogative biological assessment is applied, some (or maybe all) of the significant cellular cross-talking differentials may be “determinative” with respect to the specific biological problem at issue, e. g., either sible for causing a drug-induced toxicity (a potential target for therapeutic intervention) or is a biomarker for the drug- induced toxicity (a potential diagnostic or prognostic factor).
Such determinative cross-talking differentials may be selected by an end user of the subject method, or it may be selected by a bioinformatics software program, such as DAVID-enabled comparative pathway analysis program, or the KEGG pathway analysis m. In certain embodiments, more than one bioinformatics software program is 6767 used, and consensus results from two or more bioinformatics software programs are preferred.
As used herein, “differentials” of cellular outputs e differences (e.g., increased or decreased levels) in any one or more parameters of the cellular s. For example, in terms of protein expression level, differentials between two ar outputs, such as the outputs associated with a cell system before and after the treatment by an external stimulus component, can be measured and quantitated by using art-recognized technologies, such as mass-spectrometry based assays (e.g., iTRAQ, MSMS, etc.) B. Use of Cell Models for Interrogative Biological Assessments The methods and cell models described herein, and further described in international Application No. PCT/U82012/027615, may be used for, or applied to, any number of rogative ical assessments.” Use of the s of the invention for an interrogative biological assessment tates the identification of “modulators” or determinative cellular process “drivers” of a drug-induced toxicity.
As used herein, an “interrogative biological assessment” may include the identification of one or more modulators of a biological system, e. g., inative cellular s “drivers,” (e. g., an increase or decrease in activity of a biological pathway, or key members of the pathway, or key regulators to members of the y) associated with the environmental perturbation or external stimulus component, or a unique causal onship unique in a biological system or process. It may further include additional steps designed to test or verify whether the identified determinative ar process drivers are necessary and/or sufficient for the downstream events ated with the environmental perturbation or external stimulus component, including in vivo animal models and/or in vitro tissue culture experiments.
In a preferred embodiment, the interrogative biological assessment is the assessment of the drug-induced toxicological profile of an agent, e.g., a drug, on a cell, tissue, organ or organism, wherein the identified modulators of a biological system, e. g., determinative cellular process driver (e. g., cellular cross-talk entials or causal relationships unique in a biological system or process) may be indicators of drug- induced toxicities, e. g., cytotoxicity, cardiotoxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, renaltoxicity, or myotoxicity, and may in turn be used to predict or 6868 identify the logical profile of the drug. In one embodiment, the identified modulators of a drug-induced toxicity, e. g., determinative cellular process driver (e. g., cellular cross-talk differentials or causal relationships unique in a drug-induced toxicity) is an indicator of cardiotoxicity of a drug or drug candidate, and may in turn be used to t or identify the cardiotoxicological e of the drug or drug candidate.
V. Proteomic Sample Analysis In certain embodiments, the subject method employs large-scale high-throughput quantitative proteomic analysis of ds of samples of similar character, and provides the data necessary for identifying the cellular output differentials.
There are us art-recognized technologies suitable for this purpose. An ary technique, iTRAQ analysis in combination with mass spectrometry, is briefly described below.
To provide reference samples for relative quantification with the iTRAQ technique, multiple QC pools are created. Two separate QC pools, consisting of aliquots of each sample, were generated from the Cell #1 and Cell #2 samples - these samples are denoted as QCSl and QCSZ, and QCPl and QCP2 for supernatants and pellets, respectively. In order to allow for protein concentration comparison across the two cell lines, cell pellet aliquots from the QC pools bed above are combined in equal volumes to generate reference samples (QCP).
The quantitative proteomics approach is based on stable isotope labeling with the 8-plex iTRAQ reagent and 2D-LC MALDI MS/MS for e identification and quantification. Quantification with this technique is ve: peptides and proteins are assigned abundance ratios relative to a reference sample. Common reference samples in le iTRAQ experiments facilitate the comparison of samples across multiple iTRAQ experiments.
To implement this analysis , six primary samples and two control pool samples are combined into one 8-plex iTRAQ mix, with the control pool samples d with 113 and 117 reagents according to the manufacturer’s suggestions. This mixture of eight s is then fractionated by two-dimensional liquid chromatography; strong cation exchange (SCX) in the first dimension, and reversedphase HPLC in the second dimension. The HPLC eluent is directly fractionated onto 6969 MALDI plates, and the plates are analyzed on an MDS SCIEX/AB 4800 MALDI TOF/TOF mass spectrometer.
In the absence of additional information, it is assumed that the most important changes in protein sion are those within the same cell types under different treatment conditions. For this reason, primary samples from Cell#l and Cell#2 are analyzed in separate iTRAQ mixes. To facilitate comparison of protein expression in Cell#l vs. Cell#2 samples, universal QCP samples are analyzed in the available “iTRAQ slots” not occupied by primary or cell line specific QC samples (QCl and QC2).
A brief overview of the laboratory procedures employed is provided herein.
A. Protein Extraction From Cell Supernatant Samples For cell supernatant samples (CSN), proteins from the e medium are present in a large excess over proteins secreted by the cultured cells. In an attempt to reduce this background, upfront abundant n ion was implemented. As specific affinity columns are not available for bovine or horse serum proteins, an anti- human Ing4 column was used. While the antibodies are directed against human proteins, the broad icity provided by the onal nature of the antibodies was anticipated to accomplish depletion of both bovine and equine proteins present in the cell culture media that was used.
A ZOO-pl aliquot of the CSN QC material is loaded on a 10-mL Ing4 depletion column before the start of the study to determine the total protein concentration (Bicinchoninic acid (BCA) assay) in the flow-through material. The loading volume is then ed to achieve a ed on containing approximately 40 ug total protein.
B. Protein Extraction From Cell Pellets An aliquot of Cell #1 and Cell #2 is lysed in the “standard” lysis buffer used for the analysis of tissue samples at BGM, and total protein content is determined by the BCA assay. Having established the protein content of these representative cell lystates, all cell pellet samples (including QC samples bed in Section 1.1) were processed to cell lysates. Lysate amounts of imately 40 pg of total protein were carried forward in the processing workflow.
C. Sample Preparation for Mass Spectrometry 707O Sample preparation follows standard ing procedures and tute of the following: 0 Reduction and alkylation of proteins 0 Protein clean-up on reversed-phase column (cell pellets only) 0 Digestion with trypsin 0 iTRAQ labeling 0 Strong cation exchange chromatography — collection of six fractions (Agilent 1200 system) 0 HPLC onation and spotting to MALDI plates (Dionex te3000/Probot system) D. MALDI MS and MS/MS HPLC-MS generally employs online ESI MS/MS strategies. BG Medicine uses an off-line LC-MALDI MS/MS platform that results in better concordance of ed protein sets across the primary samples t the need of injecting the same sample multiple times. Following first pass data tion across all iTRAQ mixes, since the peptide fractions are retained on the MALDI target plates, the samples can be analyzed a second time using a targeted MS/MS acquisition pattern derived from knowledge gained during the first acquisition. In this , maximum observation frequency for all of the identified proteins is accomplished (ideally, every protein should be measured in every iTRAQ mix).
E. Data Processing The data processing process within the BGM Proteomics workflow can be separated into those procedures such as preliminary peptide identification and quantification that are completed for each iTRAQ mix individually (Section 1.5.1) and those processes (Section 1.5.2) such as final assignment of peptides to proteins and final quantification of proteins, which are not ted until data acquisition is completed for the project.
The main data processing steps within the BGM Proteomics w are: 0 Peptide identification using the Mascot (Matrix Sciences) database search engine 0 Automated in house validation of Mascot IDs 0 Quantification of peptides and preliminary quantification of proteins 7171 0 Expert curation of final t 0 Final assignment of peptides from each mix into a common set of proteins using the automated PVT tool 0 r elimination and final quantification of ns (i) Data Processing of Individual iTRAQ Mixes As each iTRAQ mix is processed through the workflow the MS/MS spectra are analyzed using proprietary BGM software tools for peptide and protein fications, as well as initial assessment of quantification information. Based on the results of this preliminary analysis, the y of the workflow for each primary sample in the mix is judged t a set of BGM mance metrics. If a given sample (or mix) does not pass the specified minimal performance s, and additional material is available, that sample is repeated in its entirety and it is data from this second implementation of the workflow that is incorporated in the final t. (ii) Peptide Identification MS/MS spectra was searched against the Uniprot protein sequence database containing human, bovine, and horse sequences augmented by common contaminant sequences such as porcine trypsin. The details of the Mascot search parameters, including the complete list of modifications, are given in Table 1.
Table 1: Mascot Search Parameters Precursor mass tolerance 100 ppm Fragment mass tolerance Variable modifications N—term iTRAQ8 Lysine iTRAQ8 Cys carbamidomethyl Pyro-Glu (N-term) Pyro-Carbamidomethyl Cys (N-term) Deamidation (N only) Oxidation (M) Enzyme specificity Fully Tryptic Number of missed tryptic 2 sites allowed Peptide rank considered 1 7272 After the Mascot search is complete, an auto-validation ure is used to promote (i.e., validate) specific Mascot peptide matches. Differentiation between valid and invalid matches is based on the attained Mascot score relative to the expected Mascot score and the difference between the Rank 1 peptides and Rank 2 peptide Mascot scores. The criteria required for validation are somewhat relaxed if the peptide is one of several matched to a single protein in the iTRAQ mix or if the peptide is t in a catalogue of previously ted peptides. (iii) Peptide and Protein Quantification The set of validated peptides for each mix is utilized to calculate preliminary protein quantification metrics for each mix. Peptide ratios are calculated by dividing the peak area from the iTRAQ label (i.e., m/z 114, 115, 116, 118, 119, or 121) for each validated peptide by the best representation of the peak area of the reference pool (QCl or QC2). This peak area is the average of the 113 and 117 peaks ed both samples pass QC acceptance criteria. Preliminary protein ratios are determined by ating the median ratio of all “useful” validated es matching to that protein. “Useful” peptides are fully iTRAQ labeled (all N-terminal are labeled with either Lysine or PyroGlu) and fully Cysteine labeled (i.e., all Cys residues are alkylated with Carbamidomethyl or N-terminal Pyro-cmc). (iv) Post-acquisition Processing Once all passes of MS/MS data ition are complete for every mix in the project, the data is collated using the three steps discussed below which are aimed at enabling the results from each primary sample to be simply and meaningfully compared to that of another. (v) Global Assignment of e Sequences to Proteins Final assignment of peptide ces to protein accession numbers is carried out through the proprietary Protein Validation Tool (PVT). The PVT procedure determines the best, minimum non-redundant protein set to describe the entire tion of peptides identified in the project. This is an automated procedure that has been optimized to handle data from a homogeneous taxonomy. n assignments for the supernatant experiments were manually d in order to deal with the complexities of mixed taxonomies in the se. Since the automated paradigm is not valid for cell cultures grown in bovine and horse serum 7373 supplemented media, extensive manual curation is necessary to minimize the ambiguity of the source of any given protein. (vi) Normalization of Peptide Ratios The peptide ratios for each sample are normalized based on the method of Vandesompele et al. Genome Biology, 2002, 3(7), research 0034.1-11. This procedure is applied to the cell pellet measurements only. For the supernatant samples, quantitative data are not normalized considering the largest contribution to peptide identifications coming from the media. (vii) Final Calculation of n Ratios A standard statistical outlier elimination ure is used to remove outliers from around each protein median ratio, beyond the 1.96 6 level in the log-transformed data set. Following this elimination process, the final set of protein ratios are (re- )calculated.
VI. Markers of the Invention and Uses Thereof The present invention is based, at least in part, on the identification of novel biomarkers that are associated with drug-induced toxicities, such as a drug-induced toxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, oxicity, or myotoxicity, or response of a drug-induced toxicity to a perturbation, such as a therapeutic agent.
In particular, the ion relates to markers (hereinafter “markers” or “markers of the invention”), which are described in the examples. The ion provides nucleic acids and proteins that are encoded by or correspond to the markers (hereinafter “marker c acids” and r proteins,” respectively). These markers are particularly useful in diagnosing drug-induced toxicity states; sing drug-induced toxicity states; developing drug targets for varies drug-induced toxicity states; screening for the presence of toxicity, preferably drug-induced toxicities, e. g., cardiotoxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, renaltoxicity, or myotoxicity; identifying an agent that cause or is at risk for causing nduced toxicity; identifying an agent that can reduce or prevent drug-induced toxicity; ating, reducing or ting nducedtoxicity; and identifying markers predictive of drug-induced toxicity. 7474 A "marker" is a gene whose altered level of sion in a tissue or cell from its sion level in normal or healthy tissue or cell is associated with a toxicity state, such as a drug-induced toxicity, e. g., cardiotoxicity. A “marker nucleic acid” is a nucleic acid (e.g., mRNA, cDNA) encoded by or corresponding to a marker of the invention. Such marker nucleic acids include DNA (6. g., cDNA) sing the entire or a partial sequence of any of the genes that are markers of the invention or the complement of such a sequence. Such sequences are known to the one of skill in the art and can be found for example, on the NIH government pubmed e. The marker nucleic acids also include RNA comprising the entire or a partial sequence of any of the gene markers of the invention or the complement of such a sequence, wherein all thymidine residues are replaced with uridine residues. A “marker protein” is a protein encoded by or corresponding to a marker of the invention. A marker protein comprises the entire or a partial sequence of any of the marker ns of the invention. Such sequences are known to the one of skill in the art and can be found for example, on the NIH government pubmed website. The terms “protein” and “polypeptide’ are used interchangeably.
A “toxic state associated" body fluid is a fluid which, when in the body of a patient, contacts or passes through sarcoma cells or into which cells or proteins shed from sarcoma cells are capable of passing. Exemplary disease state or toxic state associated body fluids include blood fluids (e.g. whole blood, blood serum, blood having platelets removed therefrom), and are described in more detail below. Disease state or toxic state associated body fluids are not limited to, whole blood, blood having platelets removed therefrom, lymph, prostatic fluid, urine and semen.
The "normal" level of expression of a marker is the level of expression of the marker in cells of a human t or patient not afflicted with a toxicity state.
An “over-expression” or “higher level of expression” of a marker refers to an sion level in a test sample that is greater than the standard error of the assay employed to assess expression, and is preferably at least twice, and more preferably three, four, five, six, seven, eight, nine or ten times the expression level of the marker in a control sample (e. g., sample from a healthy subject not having the marker associated a drug-induce ty state, e.g., toxicit, hepatotoxicity, nephrotoxicity, neurotoxicity, oxicity, or myotoxicity) and preferably, the average expression level of the marker in l control samples. 7575 A “lower level of expression” of a marker refers to an expression level in a test sample that is at least twice, and more preferably three, four, five, six, seven, eight, nine or ten times lower than the expression level of the marker in a control sample (6. g., sample from a healthy subjects not having the marker associated a drug-induced toxicity state, e.g., cardiotoxicity, cardiotoxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, renaltoxicity, or myotoxicity) and preferably, the average expression level of the marker in l control samples.
A "transcribed polynucleotide" or “nucleotide transcript” is a polynucleotide (e. g. an mRNA, hnRNA, a cDNA, or an analog of such RNA or cDNA) which is mentary to or homologous with all or a portion of a mature mRNA made by transcription of a marker of the invention and normal ranscriptional processing (e. g. splicing), if any, of the RNA transcript, and reverse transcription of the RNA transcript.
"Complementary" refers to the broad concept of ce complementarity between regions of two nucleic acid strands or between two regions of the same nucleic acid strand. It is known that an adenine residue of a first nucleic acid region is capable of forming specific hydrogen bonds (”base pairing") with a e of a second nucleic acid region which is antiparallel to the first region if the residue is thymine or uracil.
Similarly, it is known that a cytosine residue of a first nucleic acid strand is capable of base pairing with a residue of a second nucleic acid strand which is antiparallel to the first strand if the e is guanine. A first region of a c acid is complementary to a second region of the same or a ent nucleic acid if, when the two regions are arranged in an antiparallel fashion, at least one nucleotide residue of the first region is capable of base pairing with a residue of the second region. Preferably, the first region comprises a first portion and the second region comprises a second portion, whereby, when the first and second portions are arranged in an antiparallel fashion, at least about 50%, and preferably at least about 75%, at least about 90%, or at least about 95% of the nucleotide residues of the first portion are capable of base pairing with nucleotide residues in the second portion. More preferably, all nucleotide residues of the first portion are capable of base pairing with nucleotide residues in the second portion.
"Homologous" as used , refers to nucleotide ce similarity between two regions of the same nucleic acid strand or n regions of two ent nucleic acid strands. When a nucleotide residue position in both regions is occupied by the 7676 same nucleotide e, then the regions are homologous at that on. A first region is gous to a second region if at least one nucleotide residue position of each region is occupied by the same residue. Homology n two regions is expressed in terms of the tion of nucleotide residue positions of the two regions that are occupied by the same nucleotide residue. By way of example, a region having the nucleotide ce 5'—ATTGCC—3' and a region having the nucleotide sequence 5'- TATGGC—3’ share 50% homology. Preferably, the first region comprises a first portion and the second region comprises a second portion, whereby, at least about 50%, and preferably at least about 75%, at least about 90%, or at least about 95% of the nucleotide residue positions of each of the portions are occupied by the same nucleotide residue.
More preferably, all nucleotide e positions of each of the portions are occupied by the same nucleotide residue.
“Proteins of the invention” ass marker ns and their fragments; variant marker proteins and their fragments; peptides and polypeptides comprising an at least 15 amino acid segment of a marker or variant marker protein; and fusion proteins comprising a marker or variant marker protein, or an at least 15 amino acid segment of a marker or variant marker protein.
The invention further provides antibodies, antibody derivatives and antibody fragments which specifically bind with the marker proteins and fragments of the marker proteins of the present ion. Unless otherwise specified herewithin, the terms “antibody” and “antibodies” broadly encompass naturally-occurring forms of antibodies (e.g., IgG, IgA, IgM, IgE) and recombinant antibodies such as single-chain antibodies, chimeric and humanized antibodies and multi-specific dies, as well as fragments and tives of all of the foregoing, which fragments and derivatives have at least an antigenic binding site. Antibody derivatives may comprise a protein or chemical moiety conjugated to an antibody.
In one embodiment, the markers of the invention are genes or proteins associated with or involved in drug-induced toxicity. Such genes or proteins involved in drug- induced toxicity include, for example, the markers listed in table 2. In some embodiments, the markers of the invention are a ation of at least two, three, four, five, six, seven, eight, nine, 10, ll, l2, l3, 14, 15, l6, l7, 18, 19,20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, or more of the foregoing genes (or proteins). All values presented in the foregoing list can also be the upper or lower limit 7777 of ranges, that are intended to be a part of this invention, e.g., between 1 and 5, 1 and 10, 1 and 20, 1 and 30, 2 and 5, 2 and 10, 5 and 10, 1 and 20, 5 and 20, 10 and 20, 10 and , 10 and 30 of the foregoing genes (or proteins).
A. Cardiotoxicity Associated Markers The present invention is based, at least in part, on the identification of novel biomarkers that are associated with nduced toxicity. The invention is further based, at least in part, on the discovery that Coenzyme Q10 is capable of reducing or preventing drug-induced cardiotoxicity.
Accordingly, the invention provides methods for identifying an agent that causes or is at risk for g nduced cardiotoxicity. In one embodiment, the agent is a drug or drug candidate. In these methods, the amount of one or more biomarkers/proteins in a pair of samples (a first sample not subject to the drug treatment, and a second sample subjected to the drug treatment) is assessed. A modulation in the level of expression of the one or more biomarkers in the second sample as compared to the first sample is an indication that the drug causes or is at risk for causing drug- induced cardiotoxicity. In one embodiment, the one or more biomarkers is ed from the markers listed in table 2. The methods of the t invention can be practiced in conjunction with any other method used by the skilled practitioner to identify a drug at risk for causing drug-induced cardiotoxocity.
Accordingly, in one aspect, the invention provides a method for identifying a drug that causes or is at risk for g drug-induced cardiotoxicity, comprising: comparing (i) the level of expression of one or more biomarkers present in a first cell sample ed prior to the treatment with the drug; with (ii) the level of expression of the one or more biomarkers present in a second cell sample obtained following the treatment with the drug; wherein the one or more biomarkers is selected from the s listed in table 2; wherein a modulation in the level of expression of the one or more biomarkers in the second sample as compared to the first sample is an indication that the drug causes or is at risk for causing drug-induced cardiotoxicity.
In one embodiment, the cells are cells of the cardiovascular system, e.g., cardiomyocytes. In one ment, the cells are diabetic cardiomyocytes. In one embodiment, the drug is a drug or candidate drug for treating diabetes, obesity or cardiovascular disease. 7878 In one embodiment, a modulation (e.g., an increase or a decrease) in the level of expression of one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, sixteen, seventeen, eighteen, en, twenty, twenty-five, thirty, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160 or more of the biomarkers selected from the markers listed in table 2 in the second sample as compared to the first sample is an indication that the drug causes or is at risk for causing drug- d cardiotoxicity.
In one embodiment, a modulation (e.g., an increase or a decrease) in the level of expression of a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen, markers selected from a group consisting TIMPl, PTX3, HSP76, FINC, CYB5, PAIl, IBP7 (IGFBP7), lCl7, EDIL3, HMOXl, NUCBl, CSOlO, HSPA4 in the second sample as compared to the first sample is an indication that the drug causes or is at risk for causing drug-induced cardiotoxicity.
Methods for identifying a rescue agent that can reduce or prevent drug-induced cardiotoxicity are also provided by the invention. In one embodiment, the drug is a drug or drug candidate for treating es, obesity or a vascular disorder. In these s, the amount of one or more biomarkers in three samples (a first sample not subjected to the drug treatment, a second sample subjected to the drug treatment, and a third sample subjected both to the drug treatment and the agent) is assessed. imately a normalized level of expression of the one or more biomarkers, in the third sample as compared to the first , with a changed level of expression in the second , is an indication that the rescue agent can reduce or prevent drug-induced cardiotoxicity. In one embodiment, the one or more biomarkers is selected from the markers listed in table 2.
Using the s described herein, a variety of molecules, particularly ing molecules iently small to be able to cross the cell membrane, may be screened in order to identify molecules which modulate, e.g., increase or decrease the expression and/or activity of a marker of the invention. Compounds so identified can be provided to a subject in order to reduce, alleviate or prevent drug-induced cardiotoxicity in the subject.
Accordingly, in another aspect, the invention provides a method for identifying an agent that can reduce or t drug-induced cardiotoxicity comprising: (i) determining a normal level of expression of one or more biomarkers present in a first 7979 cell sample obtained prior to the treatment with a toxicity ng drug; (ii) determining a treated level of expression of the one or more biomarkers present in a second cell sample obtained following the treatment with the toxicity inducing drug to identify one or more biomarkers with a change of expression in the d cell sample; (iii) determining the level of expression of the one or more biomarkers with a changed level of expression in the toxicity inducing drug treated sample present in a third cell sample obtained following the ent with the toxicity inducing drug and the rescue agent; and (iv) comparing the level of expression of the one or more biomarkers determined in the third sample with the level of expression of the one or more biomarkers determined in the first sample; and a ized level of expression of the one or more biomarkers in the third sample as compared to the first sample is an indication that the agent can reduce or prevent drug-induced cardiotoxicity. In one ment, the one or more biomarkers is selected from the s listed in table 2.
In one embodiment, the cells are cells of the cardiovascular system, e.g., cardiomyocytes. In one embodiment, the cells are diabetic cardiomyocytes. In one embodiment, the drug is a drug or candidate drug for treating diabetes, y or cardiovascular e. In one embodiment, the drug is Anthracyclines, 5-Fluorouracil, Cisplatin, Trastuzumab, Gemcitabine, Rosiglitazone, Pioglitazone, Troglitazone, Cabergoline, Pergolide, Sumatriptan, Bisphosphonates, or TNF nists.In one embodiment, a ized level of expression of one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, sixteen, seventeen, eighteen, nineteen, twenty, twenty-five, thirty, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, or more of the biomarkers selected from the markers listed in table 2 in the third sample as compared to the first sample is an indication that the rescue agent can reduce or prevent drug-induced cardiotoxicity.
In one embodiment, a ized level of expression of a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen markers selected from a group consisting TIMPl, PTX3, HSP76, PINC, CYB5, PAIl, IBP7 (IGFBP7), lCl7, EDIL3, HMOXl, NUCBl, CSOlO, HSPA4, in the third sample as ed to the first sample is an indication that the rescue agent can reduce or prevent drug-induced cardiotoxicity.
In one embodiment, the sample comprises a fluid obtained from the subject. In one embodiment, the fluid is selected from the group consisting of blood fluids, vomit, 808O saliva, lymph, cystic fluid, urine, fluids collected by bronchial lavage, fluids collected by peritoneal rinsing, and gynecological fluids. In one ment, the sample is a blood sample or a component thereof.
In another embodiment, the sample comprises a tissue or component f obtained from the subject. In one embodiment, the tissue is selected from the group consisting of bone, connective tissue, cartilage, lung, liver, kidney, muscle tissue, heart, pancreas, and skin.
In one embodiment, the subject is a human.
In one embodiment, the level of expression of the one or more markers in the biological sample is determined by assaying a transcribed polynucleotide or a portion thereof in the sample. In one embodiment, wherein assaying the transcribed polynucleotide comprises amplifying the transcribed polynucleotide.
In one embodiment, the level of expression of the marker in the subject sample is determined by assaying a protein or a n f in the . In one embodiment, the protein is assayed using a reagent which specifically binds with the n.
In one ment, the level of expression of the one or more markers in the sample is determined using a technique selected from the group consisting of polymerase chain reaction (PCR) ication reaction, reverse-transcriptase PCR analysis, -strand conformation polymorphism analysis (SSCP), mismatch cleavage detection, heteroduplex analysis, Southern blot analysis, Northern blot analysis, Western blot analysis, in situ hybridization, array analysis, deoxyribonucleic acid sequencing, restriction fragment length rphism analysis, and combinations or sub- combinations thereof, of said .
In one embodiment, the level of expression of the marker in the sample is determined using a que selected from the group consisting of immunohistochemistry, immunocytochemistry, flow cytometry, ELISA and mass spectrometry.
In one embodiment, the level of expression of a plurality of markers is determined.
The invention further provides methods for alleviating, reducing or preventing drug-induced toxicity in a subject in need thereof, comprising administering to a subject (e. g., a mammal, a human, or a non-human animal) an agent identified by the screening methods provided herein, thereby reducing or preventing drug-induced 8181 cardiotoxicity in the subject. In one embodiment, the agent is administered to a subject that has already been treated with a cardiotoxicity-inducing drug. In one embodiment, the agent is administered to a t at the same time as treatment of the subject with a toxicity-inducing drug. In one embodiment, the agent is stered to a subject prior to treatment of the subject with a cardiotoxicity-inducing drug.
The invention further provides methods for alleviating, reducing or preventing nduced cardiotoxicity in a subject in need thereof, comprising stering Coenzyme Q10 to the subject (e.g., a mammal, a human, or a non-human animal), thereby reducing or preventing drug-induced cardiotoxicity in the subject. In one embodiment, the Coenzyme Q10 is administered to a t that has already been treated with a cardiotoxicity-inducing drug. In one embodiment, the Coenzyme Q10 is administered to a subject at the same time as treatment of the subject with a cardiotoxicity-inducing drug. In one embodiment, the Coenzyme Q10 is administered to a subject prior to ent of the subject with a cardiotoxicity-inducing drug. In one embodiment, the drug-induced cardiotoxicity is associated with modulation of expression of one, two, three, four, five, six, seven, eight, nine, ten, , twelve, thirteen, fourteen, fifteen, n, seventeen, eighteen, nineteen, twenty, twenty-five, thirty, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, or more of the biomarkers selected from the markers listed in table 2. All values presented in the foregoing list can also be the upper or lower limit of , that are intended to be a part of this invention, e.g., between 1 and 5, l and 10, 2 and 5, 2 and 10, or 5 and 10 of the foregoing genes (or proteins).
In one embodiment, the drug-induced toxicity is associated with modulation of a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen markers selected from a group ting TIMPl, PTX3, HSP76, FINC, CYB5, PAIl, IBP7 (IGFBP7), lCl7, EDIL3, HMOXl, NUCBl, CSOlO, HSPA4.
The invention further provides biomarkers (e.g, genes and/or proteins) that are useful as predictive markers for drug-induced cardiotoxicity. These biomarkers include the markers listed in table 2. In one embodiment, the predictive markers for drug- induced cardiotoxicity is a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen, markers selected from a group ting TIMPl, PTX3, HSP76, FINC, CYB5, PAIl, IBP7 (IGFBP7), lCl7, EDIL3, HMOXl, NUCBl, CSOlO, HSPA4. The ordinary skilled artisan would, however, be able to identify additional 8282 biomarkers predictive of drug-induced cardiotoxicity by employing the methods described herein, e. g., by carrying out the methods bed in Example 3 but by using a different drug known to induce cardiotoxicity. Exemplary drug-induced cardiotoxicity biomarkers of the invention are further described below.
GRP78 and GRP75 are also ed to as glucose response proteins. These proteins are associated with endo/sarcoplasmic reticulum stress (ER stress) of myocytes. SERCA, or sarcoendoplasmic reticulum calcium , tes Ca2+ homeostatsis in cardiac cells. Any disruption of these ATPase can lead to cardiac dysfunction and heart failure. Based upon the data provided herein, GRP75 and GRP78 and the edges around them are novel predictors of drug induced cardiotoxicity.
TIMPl, also referred to as TIMP metalloprotease inhibitor 1, is involved with remodeling of extra cellular matrix in association with MMPs. TIMPl expression is correlated with fibrosis of the heart, and a of vascular endothelial cells also induces TIMPl expression. Based upon the data ed herein, TIMPl is a novel predictor of drug induced cardiactoxicity PTX3, also referred to as Pentraxin 3, s to the family of C Reactive Proteins (CRP) and is a good marker of an inflammatory condition of the heart.
However, plasma PTX3 could also be representative of systemic inflammatory se due to sepsis or other medical conditions. Based upon the data ed , PTX3 may be a novel marker of cardiac function or toxicity. Additionally, the edges associated with PTX 3 in the network could form a novel panel of kers.
HSP76, also referred to as HSPA6, is only known to be expressed in endothelial cells and B lymphocytes. There is no known role for this n in cardiac function.
Based upon the data provided herein, HSP76 may be a novel predictor of drug induced cardiotoxicity PDIA4, PDIAl, also referred to as protein disulphide isomerase family A proteins, are associated with ER stress response, like GRPs. There is no known role for these proteins in cardiac function. Based upon the data provided herein, these proteins may be novel predictors of drug induced cardiotoxicity.
CA2Dl is also referred to as calcium channel, voltage-dependent, alpha 2/delta subunit. The alpha-2/delta subunit of voltage-dependent calcium channel regulates calcium current density and activation/inactivation kinetics of the calcium channel.
CA2Dl plays an important role in excitation-contraction coupling in the heart. There is 8383 no known role for this protein in cardiac on. Based upon the data provided herein, CA2Dl is a novel predictor of drug induced cardiotoxicity GPATl is one of four known glycerolphosphate acyltransferase isoforms, and is located on the mitochondrial outer membrane, allowing reciprocal regulation with carnitine palmitoyltransferase-l. GPATl is upregulated transcriptionally by insulin and SREBP-lc and downregulated acutely by AMP-activated protein kinase, consistent with a role in triacylglycerol synthesis. Based upon the data provided herein, GPATl is a novel predictor of drug induced cardiotoxicity.
TAZ, also referred to as Tafazzin, is highly expressed in c and skeletal muscle. TAZ is involved in the metabolism of cardiolipin and functions as a phospholipid-lysophospholipid transacylase. Tafazzin is responsible for remodeling of a phospholipid cardiolipin (CL), the signature lipid of the mitochondrial inner membrane.
Based upon the data provided herein, TAZ is a novel predictor of drug induced cardiotoxicity Various aspects of the invention are described in further detail in the following subsections.
B. Isolated Nucleic Acid Molecules One aspect of the invention pertains to isolated nucleic acid molecules, including nucleic acids which encode a marker protein or a n thereof. Isolated nucleic acids of the invention also include nucleic acid molecules ient for use as hybridization probes to identify marker nucleic acid molecules, and fragments of marker nucleic acid molecules, e.g., those suitable for use as PCR primers for the amplification or mutation of marker c acid molecules. As used herein, the term "nucleic acid le" is intended to e DNA molecules (e. g., cDNA or genomic DNA) and RNA molecules (e. g., mRNA) and analogs of the DNA or RNA generated using tide analogs. The c acid molecule can be single-stranded or -stranded, but preferably is double-stranded DNA.
An "isolated" c acid molecule is one which is separated from other nucleic acid les which are present in the natural source of the nucleic acid molecule. In one embodiment, an ted" nucleic acid molecule is free of sequences (preferably protein-encoding sequences) which naturally flank the nucleic acid (i.e., ces located at the 5' and 3' ends of the nucleic acid) in the genomic DNA of the organism 8484 from which the nucleic acid is derived. For example, in various embodiments, the isolated c acid molecule can contain less than about 5 kB, 4 kB, 3 kB, 2 kB, 1 kB, 0.5 kB or 0.1 kB of nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is d. In another embodiment, an "isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free of other cellular material, or culture medium when produced by recombinant techniques, or ntially free of chemical precursors or other chemicals when ally synthesized. A c acid molecule that is substantially free of cellular al includes preparations having less than about 30%, 20%, 10%, or 5% of heterologous nucleic acid (also referred to herein as a minating nucleic acid").
A nucleic acid le of the present invention can be isolated using standard molecular biology techniques and the sequence information in the database records described herein. Using all or a portion of such nucleic acid sequences, nucleic acid molecules of the invention can be isolated using standard hybridization and cloning techniques (e.g., as described in Sambrook et al., ed., Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989).
A c acid molecule of the invention can be amplified using cDNA, mRNA, or genomic DNA as a template and appropriate oligonucleotide primers according to standard PCR amplification techniques. The nucleic acid so amplified can be cloned into an appropriate vector and characterized by DNA sequence analysis. Furthermore, nucleotides corresponding to all or a portion of a nucleic acid molecule of the invention can be prepared by standard synthetic techniques, e.g., using an automated DNA synthesizer.
In another preferred embodiment, an isolated c acid molecule of the invention ses a c acid molecule which has a nucleotide sequence complementary to the nucleotide sequence of a marker nucleic acid or to the nucleotide sequence of a nucleic acid encoding a marker n. A nucleic acid molecule which is complementary to a given tide sequence is one which is sufficiently complementary to the given nucleotide sequence that it can hybridize to the given nucleotide sequence thereby g a stable duplex.
Moreover, a nucleic acid molecule of the invention can comprise only a portion of a nucleic acid sequence, wherein the full length nucleic acid sequence comprises a 8585 marker nucleic acid or which encodes a marker protein. Such nucleic acids can be used, for example, as a probe or primer. The probe/primer typically is used as one or more substantially purified oligonucleotides. The oligonucleotide typically comprises a region of nucleotide sequence that hybridizes under stringent conditions to at least about 7, preferably about 15, more preferably about 25, 50, 75, 100, 125, 150, 175, 200, 250, 300, 350, or 400 or more utive nucleotides of a c acid of the invention.
Probes based on the sequence of a nucleic acid molecule of the invention can be used to detect transcripts or genomic sequences corresponding to one or more s of the invention. The probe comprises a label group attached thereto, e.g., a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes can be used as part of a diagnostic test kit for identifying cells or tissues which mis-express the protein, such as by measuring levels of a c acid molecule encoding the protein in a sample of cells from a subject, e.g., detecting mRNA levels or determining whether a gene encoding the protein has been mutated or deleted.
The ion further encompasses nucleic acid molecules that differ, due to racy of the genetic code, from the nucleotide sequence of c acids encoding a marker protein, and thus encode the same protein.
It will be appreciated by those skilled in the art that DNA sequence polymorphisms that lead to changes in the amino acid sequence can exist within a population (e. g., the human population). Such genetic polymorphisms can exist among individuals within a population due to natural allelic variation. An allele is one of a group of genes which occur alternatively at a given genetic locus. In on, it will be appreciated that DNA polymorphisms that affect RNA expression levels can also exist that may affect the l expression level of that gene (e. g., by affecting tion or degradation).
As used herein, the phrase "allelic variant" refers to a nucleotide sequence which occurs at a given locus or to a polypeptide encoded by the nucleotide sequence.
As used herein, the terms "gene" and "recombinant gene" refer to nucleic acid molecules sing an open reading frame ng a polypeptide corresponding to a marker of the invention. Such natural allelic variations can typically result in l-5% variance in the nucleotide sequence of a given gene. Alternative alleles can be identified by sequencing the gene of interest in a number of different individuals. This can be readily carried out by using hybridization probes to identify the same genetic locus in a 8686 variety of individuals. Any and all such nucleotide variations and resulting amino acid polymorphisms or variations that are the result of natural c variation and that do not alter the functional activity are ed to be within the scope of the invention.
In another embodiment, an isolated nucleic acid molecule of the invention is at least 7, 15, 20, 25, 30, 40, 60, 80, 100, 150, 200, 250, 300, 350, 400, 450, 550, 650, 700, 800, 900, 1000, 1200, 1400, 1600, 1800, 2000, 2200, 2400, 2600, 2800, 3000, 3500, 4000, 4500, or more tides in length and hybridizes under stringent conditions to a marker nucleic acid or to a nucleic acid encoding a marker protein. As used herein, the term "hybridizes under stringent conditions" is intended to describe conditions for hybridization and washing under which nucleotide sequences at least 60% (65%, 70%, preferably 75%) identical to each other typically remain hybridized to each other. Such stringent ions are known to those skilled in the art and can be found in sections 6.3.1-6.3.6 of Current Protocols in Molecular Biology, John Wiley & Sons, NY. (1989). A preferred, non-limiting example of stringent hybridization conditions are hybridization in 6X sodium de/sodium e (SSC) at about 45°C, followed by one or more washes in 0.2X SSC, 0.1% SDS at 50-65°C.
In addition to naturally-occurring allelic variants of a nucleic acid molecule of the ion that can exist in the population, the skilled artisan will further appreciate that sequence changes can be introduced by mutation thereby g to changes in the amino acid sequence of the d protein, t altering the biological activity of the protein encoded thereby. For example, one can make nucleotide substitutions leading to amino acid substitutions at "non-essential" amino acid residues. A "non- essential" amino acid residue is a residue that can be altered from the wild-type sequence without altering the ical activity, whereas an "essential" amino acid residue is required for biological activity. For example, amino acid residues that are not conserved or only semi-conserved among gs of various species may be non-essential for activity and thus would be likely targets for alteration. Alternatively, amino acid residues that are conserved among the homologs of various species (e. g., murine and human) may be essential for activity and thus would not be likely targets for tion.
Accordingly, another aspect of the invention pertains to nucleic acid les encoding a variant marker protein that contain changes in amino acid residues that are not essential for activity. Such variant marker proteins differ in amino acid ce from the naturally-occurring marker proteins, yet retain biological activity. In one 8787 embodiment, such a variant marker protein has an amino acid sequence that is at least about 40% identical, 50%, 60%, 70%, 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identical to the amino acid sequence of a marker protein.
An isolated nucleic acid molecule encoding a variant marker protein can be created by introducing one or more nucleotide substitutions, additions or deletions into the nucleotide sequence of marker nucleic acids, such that one or more amino acid residue substitutions, additions, or deletions are introduced into the encoded protein.
Mutations can be introduced by rd techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are made at one or more predicted non-essential amino acid residues. A "conservative amino acid substitution" is one in which the amino acid residue is replaced with an amino acid e having a similar side chain. Families of amino acid residues having r side chains have been defined in the art. These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, ine, serine, threonine, tyrosine, cysteine), non-polar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, phan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, alanine, tryptophan, histidine). Alternatively, ons can be introduced randomly along all or part of the coding sequence, such as by saturation mutagenesis, and the resultant mutants can be ed for biological activity to identify mutants that retain activity. Following mutagenesis, the encoded protein can be sed recombinantly and the activity of the n can be determined.
The present invention encompasses antisense nucleic acid molecules, i.e., molecules which are complementary to a sense nucleic acid of the ion, e. g., complementary to the coding strand of a double-stranded marker cDNA molecule or complementary to a marker mRNA sequence. Accordingly, an antisense c acid of the invention can hydrogen bond to (Le. anneal with) a sense c acid of the invention. The antisense nucleic acid can be complementary to an entire coding strand, or to only a portion thereof, e. g., all or part of the protein coding region (or open reading frame). An antisense c acid molecule can also be antisense to all or part of a non- coding region of the coding strand of a nucleotide sequence encoding a marker protein. 8888 The ding regions ("5' and 3' untranslated regions") are the 5' and 3' sequences which flank the coding region and are not translated into amino acids.
An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45, or 50 or more nucleotides in length. An antisense nucleic acid of the invention can be ucted using chemical sis and enzymatic ligation reactions using procedures known in the art. For example, an antisense nucleic acid (9. g., an nse oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or sly modified nucleotides designed to increase the biological ity of the molecules or to increase the physical stability of the duplex formed between the antisense and sense nucleic acids, e. g., phosphorothioate derivatives and acridine substituted nucleotides can be used. Examples of modified nucleotides which can be used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5- chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5- (carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethylthiouridine, 5- carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosquueosine, inosine, N6-isopentenyladenine, l-methylguanine, l-methylinosine, 2,2-dimethylguanine, 2- adenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethylthiouracil, beta- D-mannosquueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- N6-isopentenyladenine, uraciloxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methylthiouracil, 2-thiouracil, 4-thiouracil, 5- methyluracil, uraciloxyacetic acid methylester, uraciloxyacetic acid (v), yl- 2-thiouracil, 3-(3-aminoNcarboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine.
Alternatively, the nse nucleic acid can be produced biologically using an expression vector into which a c acid has been sub-cloned in an antisense orientation (i.e., RNA ribed from the ed nucleic acid will be of an antisense orientation to a target nucleic acid of interest, described further in the following subsection).
The antisense c acid molecules of the invention are typically administered to a subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or genomic DNA encoding a marker protein to thereby t expression of the marker, e. g., by inhibiting transcription and/or translation. The hybridization can be by tional nucleotide complementarity to form a stable duplex, or, for example, in the 8989 case of an antisense nucleic acid molecule which binds to DNA duplexes, through specific interactions in the major groove of the double helix. Examples of a route of administration of antisense nucleic acid molecules of the invention includes direct injection at a tissue site or on of the antisense nucleic acid into toxicity state associated body fluid. Alternatively, antisense nucleic acid molecules can be modified to target ed cells and then administered systemically. For example, for systemic administration, antisense molecules can be modified such that they specifically bind to receptors or antigens expressed on a selected cell surface, e. g., by linking the antisense nucleic acid molecules to es or antibodies which bind to cell e receptors or antigens. The antisense nucleic acid molecules can also be delivered to cells using the vectors described herein. To achieve sufficient intracellular concentrations of the antisense molecules, vector constructs in which the antisense nucleic acid molecule is placed under the control of a strong pol II or pol III promoter are preferred.
An antisense nucleic acid molecule of the invention can be an oc-anomeric nucleic acid molecule. An oc-anomeric nucleic acid molecule forms specific double- stranded s with complementary RNA in which, contrary to the usual (x-units, the strands run parallel to each other (Gaultier et al., 1987, Nucleic Acids Res. 15:6625- 6641). The antisense c acid le can also comprise a 2 methylribonucleotide (Inoue et al., 1987, Nucleic Acids Res. 15:6131-6148) or a chimeric RNA-DNA analogue (Inoue et al., 1987, FEBS Lett. 215:327-330).
The ion also encompasses ribozymes. Ribozymes are tic RNA molecules with ribonuclease activity which are e of cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes as described in Haselhoff and Gerlach, 1988, Nature 334:585-591) can be used to catalytically cleave mRNA transcripts to thereby inhibit translation of the protein encoded by the mRNA. A ribozyme having icity for a nucleic acid molecule encoding a marker protein can be designed based upon the nucleotide ce of a cDNA corresponding to the . For example, a derivative of a Tetrahymena L—19 IVS RNA can be constructed in which the nucleotide sequence of the active site is complementary to the tide sequence to be cleaved (see Cech et a1. U.S. Patent No. 4,987,071; and Cech et al. U.S. Patent No. 5,116,742).
Alternatively, an mRNA encoding a polypeptide of the invention can be used to select a 9090 catalytic RNA having a specific ribonuclease activity from a pool of RNA les (see, e.g., Bartel and Szostak, 1993, Science 261:1411-1418).
The invention also encompasses nucleic acid molecules which form triple helical structures. For example, expression of a marker of the invention can be ted by targeting nucleotide sequences complementary to the regulatory region of the gene encoding the marker nucleic acid or protein (e. g., the promoter and/or enhancer) to form triple helical structures that prevent transcription of the gene in target cells. See generally Helene (1991) Anticancer Drug Des. 6(6):569-84; Helene (1992) Ann. NY.
Acad. Sci. 660:27-36; and Maher (1992) Bioassays 14(12):807-15.
In various ments, the nucleic acid molecules of the invention can be modified at the base moiety, sugar moiety or phosphate backbone to improve, e. g., the stability, hybridization, or solubility of the molecule. For e, the deoxyribose phosphate backbone of the nucleic acids can be modified to te peptide nucleic acids (see Hyrup et al., 1996, Bioorganic & Medicinal Chemistry 4(1): 5-23). As used herein, the terms "peptide c acids" or "PNAs" refer to nucleic acid , e. g., DNA mimics, in which the deoxyribose phosphate backbone is ed by a pseudopeptide backbone and only the four natural nucleobases are ed. The neutral backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under conditions of low ionic strength. The synthesis of PNA oligomers can be performed using standard solid phase peptide synthesis protocols as described in Hyrup et al. (1996), supra; Perry-O'Keefe et al. (1996) Proc. Natl. Acad. Sci. USA 93:14670- 675.
PNAs can be used in therapeutic and diagnostic applications. For e, PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene expression by, e. g., inducing transcription or translation arrest or inhibiting replication. PNAs can also be used, e. g., in the analysis of single base pair mutations in a gene by, e. g., PNA directed PCR clamping; as artificial ction enzymes when used in combination with other enzymes, e. g., 81 ses (Hyrup (1996), supra; or as probes or primers for DNA sequence and hybridization (Hyrup, 1996, supra; Perry- O'Keefe et al., 1996, Proc. Natl. Acad. Sci. USA 93:14670-675).
In another embodiment, PNAs can be modified, e. g., to enhance their ity or ar uptake, by attaching lipophilic or other helper groups to PNA, by the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery 9191 known in the art. For example, PNA-DNA chimeras can be generated which can combine the advantageous properties of PNA and DNA. Such chimeras allow DNA ition enzymes, e. g., RNase H and DNA rases, to interact with the DNA portion while the PNA portion would provide high binding affinity and specificity.
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms of base stacking, number of bonds between the nucleobases, and orientation (Hyrup, 1996, supra). The synthesis of PNA-DNA chimeras can be performed as described in Hyrup (1996), supra, and Finn et al. (1996) Nucleic Acids Res. 24(17):3357-63. For e, a DNA chain can be synthesized on a solid support using rd phosphoramidite ng chemistry and modified nucleoside s. Compounds such as 5'—(4-methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite can be used as a link between the PNA and the 5' end of DNA (Mag et al., 1989, Nucleic Acids Res. 17:5973-88). PNA monomers are then coupled in a step-wise manner to e a chimeric molecule with a 5' PNA segment and a 3' DNA segment (Finn et al., 1996, Nucleic Acids Res. 24(17):3357-63). Alternatively, chimeric molecules can be synthesized with a 5' DNA segment and a 3' PNA segment (Peterser et al., 1975, Bioorganic Med. Chem. Lett. 5:1119-11124).
In other embodiments, the oligonucleotide can include other appended groups such as peptides (e. g., for ing host cell receptors in vivo), or agents facilitating transport across the cell membrane (see, e. g., Letsinger et al., 1989, Proc. Natl. Acad.
Sci. USA 86:6553-6556; Lemaitre et al., 1987, Proc. Natl. Acad. Sci. USA 84:648-652; PCT Publication No. WO 88/09810) or the brain barrier (see, e. g., PCT Publication No. W0 89/10134). In addition, oligonucleotides can be modified with hybridization-triggered cleavage agents (see, e. g., Krol et al., 1988, Bio/Techniques 6:958-976) or intercalating agents (see, e.g., Zon, 1988, Pharm. Res. 5:539-549). To this end, the oligonucleotide can be conjugated to another molecule, e. g., a peptide, hybridization triggered cross-linking agent, ort agent, hybridization-triggered cleavage agent, etc.
The invention also includes molecular beacon nucleic acids having at least one region which is mentary to a c acid of the invention, such that the lar beacon is useful for quantitating the presence of the nucleic acid of the invention in a sample. A "molecular beacon" nucleic acid is a nucleic acid comprising a pair of complementary regions and having a fluorophore and a fluorescent quencher 9292 associated therewith. The fluorophore and quencher are ated with different portions of the nucleic acid in such an orientation that when the complementary regions are annealed with one another, fluorescence of the fluorophore is quenched by the quencher. When the complementary regions of the nucleic acid are not annealed with one another, fluorescence of the fluorophore is quenched to a lesser degree. Molecular beacon nucleic acids are described, for example, in US. Patent 5,876,930.
C. Isolated Proteins and dies One aspect of the invention pertains to isolated marker proteins and biologically active portions f, as well as ptide fragments suitable for use as immunogens to raise antibodies directed against a marker protein or a fragment thereof. In one embodiment, the native marker protein can be isolated from cells or tissue sources by an appropriate purification scheme using standard protein purification techniques. In another embodiment, a protein or peptide sing the whole or a segment of the marker protein is produced by recombinant DNA techniques. ative to recombinant expression, such protein or peptide can be synthesized chemically using standard peptide synthesis techniques.
An "isolated" or ied" protein or biologically active portion thereof is substantially free of cellular material or other contaminating proteins from the cell or tissue source from which the protein is derived, or substantially free of chemical precursors or other chemicals when chemically synthesized. The language "substantially free of ar material" includes preparations of protein in which the protein is separated from cellular components of the cells from which it is ed or recombinantly produced. Thus, protein that is substantially free of cellular material includes ations of n having less than about 30%, 20%, 10%, or 5% (by dry weight) of heterologous protein (also referred to herein as a "contaminating protein").
When the protein or biologically active portion thereof is recombinantly produced, it is also preferably substantially free of culture medium, i. 6., culture medium represents less than about 20%, 10%, or 5% of the volume of the protein ation. When the protein is produced by chemical synthesis, it is preferably substantially free of al precursors or other chemicals, i.e., it is ted from chemical precursors or other chemicals which are involved in the synthesis of the protein. Accordingly such 9393 preparations of the protein have less than about 30%, 20%, 10%, 5% (by dry weight) of chemical precursors or compounds other than the polypeptide of interest.
Biologically active ns of a marker protein include polypeptides comprising amino acid sequences sufficiently identical to or derived from the amino acid sequence of the marker n, which include fewer amino acids than the full length protein, and exhibit at least one activity of the corresponding full-length protein. Typically, biologically active portions comprise a domain or motif with at least one activity of the corresponding full-length protein. A biologically active portion of a marker protein of the invention can be a ptide which is, for example, 10, 25, 50, 100 or more amino acids in length. er, other biologically active portions, in which other regions of the marker protein are deleted, can be ed by inant techniques and evaluated for one or more of the functional activities of the native form of the marker protein.
Preferred marker proteins are encoded by nucleotide sequences comprising the sequences encoding any of the genes described in the examples. Other useful ns are substantially identical (e.g., at least about 40%, preferably 50%, 60%, 70%, 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99%) to one of these sequences and retain the functional activity of the ponding naturally-occurring marker protein yet differ in amino acid sequence due to natural allelic variation or mutagenesis.
To determine the percent identity of two amino acid sequences or of two nucleic acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in the sequence of a first amino acid or nucleic acid sequence for optimal alignment with a second amino or nucleic acid sequence). The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is ed by the same amino acid residue or nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position. Preferably, the percent identity between the two sequences is calculated using a global alignment. Alternatively, the percent ty between the two sequences is ated using a local alignment. The percent identity between the two ces is a function of the number of identical positions shared by the sequences (i.e., % identity = # of identical positions/total # of positions (e.g., overlapping ons) x100). In one embodiment the two sequences are the same length. In another ment, the two sequences are not the same length. 9494 The determination of percent identity between two sequences can be lished using a mathematical algorithm. A preferred, non-limiting example of a atical algorithm utilized for the comparison of two sequences is the algorithm of Karlin and Altschul (1990) Proc. Natl. Acad. Sci. USA 87:2264-2268, modified as in Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5877. Such an algorithm is orated into the BLASTN and BLASTX programs of Altschul, et al. (1990) J. Mol. Biol. 215:403-410. BLAST tide searches can be performed with the BLASTN program, score = 100, ngth = 12 to obtain nucleotide sequences homologous to a nucleic acid molecules of the invention. BLAST protein searches can be med with the BLASTP program, score = 50, wordlength = 3 to obtain amino acid sequences homologous to a n molecules of the invention. To obtain gapped ents for comparison purposes, a newer version of the BLAST algorithm called Gapped BLAST can be utilized as described in Altschul et al. (1997) Nucleic Acids Res. :3389-3402, which is able to perform gapped local alignments for the programs BLASTN, BLASTP and BLASTX. Alternatively, PSI-Blast can be used to perform an iterated search which detects distant relationships n molecules. When ing BLAST, Gapped BLAST, and PSI-Blast programs, the default parameters of the tive programs (e.g., BLASTX and BLASTN) can be used. See http://www.ncbi.nlm.nih.gov. Another preferred, miting example of a mathematical algorithm utilized for the comparison of sequences is the algorithm of Myers and Miller, (1988) CABIOS 4:11-17. Such an algorithm is incorporated into the ALIGN program (version 2.0) which is part of the GCG sequence alignment software package. When utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be used. Yet another useful algorithm for identifying regions of local sequence similarity and alignment is the FASTA algorithm as described in Pearson and Lipman (1988) Proc. Natl. Acad. Sci. USA 85:2444-2448. When using the FASTA thm for comparing nucleotide or amino acid sequences, a PAM120 weight residue table can, for example, be used with a k-tuple value of 2.
The percent identity between two sequences can be determined using techniques similar to those described above, with or without allowing gaps. In calculating percent identity, only exact matches are counted. 9595 The invention also provides chimeric or fusion proteins comprising a marker protein or a segment thereof. As used herein, a "chimeric protein" or "fusion protein" comprises all or part (preferably a biologically active part) of a marker n operably linked to a heterologous polypeptide (i.e., a polypeptide other than the marker protein).
Within the fusion protein, the term "operably linked" is intended to indicate that the marker protein or segment thereof and the heterologous polypeptide are fused in-frame to each other. The heterologous ptide can be fused to the amino-terminus or the carboxyl-terminus of the marker protein or t.
One useful fusion protein is a GST fusion protein in which a marker n or segment is fused to the carboxyl us of GST sequences. Such fusion proteins can facilitate the purification of a recombinant polypeptide of the ion.
In another embodiment, the fusion protein contains a heterologous signal sequence at its amino terminus. For example, the native signal sequence of a marker protein can be removed and replaced with a signal sequence from another protein. For example, the gp67 secretory sequence of the baculovirus envelope protein can be used as a heterologous signal sequence (Ausubel et al., ed., Current Protocols in lar Biology, John Wiley & Sons, NY, 1992). Other examples of eukaryotic heterologous signal sequences include the secretory sequences of melittin and human placental alkaline phosphatase (Stratagene; La Jolla, California). In yet another example, useful prokaryotic logous signal sequences include the phoA secretory signal (Sambrook et al., supra) and the protein A secretory signal (Pharmacia Biotech; away, New Jersey).
In yet another embodiment, the fusion protein is an immunoglobulin fusion protein in which all or part of a marker protein is fused to sequences derived from a member of the immunoglobulin protein family. The immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical compositions and administered to a subject to inhibit an ction between a ligand (soluble or membrane-bound) and a protein on the surface of a cell (receptor), to thereby suppress signal transduction in vivo.
The immunoglobulin fusion protein can be used to affect the bioavailability of a cognate ligand of a marker protein. Inhibition of ligand/receptor interaction can be useful therapeutically, both for treating proliferative and differentiative ers and for modulating (e. g. promoting or ting) cell survival. er, the globulin fusion proteins of the ion can be used as immunogens to produce antibodies 9696 directed against a marker protein in a subject, to purify ligands and in ing assays to identify molecules which inhibit the interaction of the marker protein with ligands.
Chimeric and fusion proteins of the invention can be produced by standard recombinant DNA techniques. In another embodiment, the fusion gene can be synthesized by conventional techniques including ted DNA sizers.
Alternatively, PCR amplification of gene nts can be carried out using anchor primers which give rise to complementary overhangs between two consecutive gene fragments which can subsequently be annealed and re-amplified to generate a chimeric gene sequence (see, e.g., Ausubel et (1]., . er, many expression vectors are commercially available that already encode a fusion moiety (e.g., a GST polypeptide).
A nucleic acid encoding a polypeptide of the invention can be cloned into such an expression vector such that the fusion moiety is linked in-frame to the polypeptide of the invention.
A signal sequence can be used to facilitate secretion and isolation of marker proteins. Signal sequences are lly characterized by a core of hydrophobic amino acids which are generally d from the mature protein during secretion in one or more cleavage events. Such signal peptides contain processing sites that allow cleavage of the signal sequence from the mature proteins as they pass through the secretory y. Thus, the invention pertains to marker proteins, fusion proteins or segments thereof having a signal sequence, as well as to such proteins from which the signal sequence has been proteolytically cleaved (i.e., the ge products). In one embodiment, a c acid sequence encoding a signal sequence can be operably linked in an sion vector to a protein of interest, such as a marker protein or a segment thereof. The signal sequence directs secretion of the protein, such as from a eukaryotic host into which the expression vector is transformed, and the signal sequence is subsequently or rently cleaved. The protein can then be readily purified from the extracellular medium by art recognized methods. Alternatively, the signal sequence can be linked to the protein of interest using a sequence which facilitates purification, such as with a GST domain.
The present invention also pertains to variants of the marker ns. Such variants have an altered amino acid sequence which can function as either agonists (mimetics) or as antagonists. Variants can be generated by mutagenesis, e.g., discrete point mutation or truncation. An t can retain substantially the same, or a subset, 9797 of the biological activities of the naturally occurring form of the protein. An antagonist of a protein can inhibit one or more of the activities of the naturally occurring form of the protein by, for example, itively binding to a downstream or upstream member of a ar signaling cascade which es the protein of interest. Thus, specific biological effects can be elicited by treatment with a variant of limited function.
Treatment of a subject with a variant having a subset of the biological activities of the naturally occurring form of the n can have fewer side effects in a subject relative to ent with the naturally occurring form of the protein.
Variants of a marker protein which function as either agonists (mimetics) or as antagonists can be identified by ing combinatorial ies of s, e. g., truncation mutants, of the protein of the invention for agonist or antagonist activity. In one embodiment, a variegated library of variants is ted by combinatorial mutagenesis at the nucleic acid level and is encoded by a variegated gene library. A variegated library of variants can be produced by, for example, enzymatically ligating a mixture of synthetic oligonucleotides into gene sequences such that a degenerate set of potential protein ces is expressible as individual polypeptides, or alternatively, as a set of larger fusion proteins (e. g., for phage display). There are a variety of methods which can be used to produce libraries of potential variants of the marker proteins from a degenerate oligonucleotide sequence. Methods for synthesizing degenerate oligonucleotides are known in the art (see, e. g., Narang, 1983, Tetrahedron 39:3; Itakura et al., 1984, Annu. Rev. Biochem. 53:323; Itakura et al., 1984, Science 56; Ike et al., 1983 Nucleic Acid Res. 11:477).
In addition, libraries of segments of a marker protein can be used to generate a variegated population of polypeptides for screening and subsequent selection of variant marker proteins or segments thereof. For example, a library of coding sequence fragments can be ted by treating a double stranded PCR fragment of the coding sequence of st with a se under conditions wherein nicking occurs only about once per molecule, denaturing the double stranded DNA, ring the DNA to form double stranded DNA which can include sense/antisense pairs from different nicked products, removing single stranded portions from reformed duplexes by treatment with $1 nuclease, and ng the resulting fragment library into an expression vector. By this method, an sion library can be derived which encodes amino terminal and al fragments of various sizes of the protein of interest. 9898 Several techniques are known in the art for screening gene products of atorial libraries made by point mutations or truncation, and for screening cDNA libraries for gene products having a ed property. The most widely used techniques, which are amenable to high through-put analysis, for screening large gene libraries typically e cloning the gene library into replicable expression vectors, transforming appropriate cells with the resulting library of vectors, and expressing the combinatorial genes under conditions in which detection of a desired activity facilitates isolation of the vector encoding the gene whose t was detected. Recursive ensemble mutagenesis (REM), a technique which enhances the frequency of functional mutants in the libraries, can be used in combination with the screening assays to identify variants of a protein of the invention (Arkin and Yourvan, 1992, Proc. Natl. Acad. Sci.
USA 89:7811-7815; Delgrave et al., 1993, Protein Engineering 6(3):327- 331).
Another aspect of the invention pertains to antibodies directed against a protein of the invention. In red embodiments, the antibodies specifically bind a marker protein or a nt thereof. The terms "antibody" and "antibodies" as used interchangeably herein refer to immunoglobulin les as well as fragments and derivatives f that comprise an immunologically active portion of an immunoglobulin molecule, (i.e., such a portion contains an antigen binding site which specifically binds an antigen, such as a marker protein, e. g., an e of a marker protein). An antibody which specifically binds to a protein of the invention is an antibody which binds the protein, but does not substantially bind other les in a sample, e. g., a biological sample, which naturally contains the protein. Examples of an immunologically active portion of an immunoglobulin molecule include, but are not limited to, -chain antibodies (scAb), F(ab) and F(ab’)2 fragments.
An isolated protein of the ion or a fragment thereof can be used as an immunogen to generate antibodies. The full-length protein can be used or, alternatively, the invention provides antigenic peptide nts for use as immunogens. The antigenic peptide of a protein of the invention comprises at least 8 (preferably 10, 15, 20, or 30 or more) amino acid residues of the amino acid ce of one of the proteins of the invention, and encompasses at least one e of the protein such that an antibody raised against the peptide forms a specific immune complex with the protein. Preferred epitopes encompassed by the nic peptide are regions that are located on the surface of the protein, e. g., hydrophilic regions. Hydrophobicity sequence analysis, 9999 hydrophilicity sequence analysis, or similar es can be used to identify hydrophilic regions. In preferred embodiments, an isolated marker protein or fragment thereof is used as an gen.
An immunogen typically is used to prepare antibodies by immunizing a suitable (i.e. immunocompetent) subject such as a rabbit, goat, mouse, or other mammal or vertebrate. An appropriate immunogenic preparation can contain, for example, recombinantly-expressed or chemically-synthesized protein or peptide. The preparation can further include an adjuvant, such as 's complete or incomplete adjuvant, or a similar immunostimulatory agent. Preferred immunogen compositions are those that contain no other human proteins such as, for example, gen compositions made using a non-human host cell for recombinant expression of a protein of the invention. In such a manner, the resulting antibody compositions have reduced or no binding of human proteins other than a protein of the invention.
The invention es onal and monoclonal antibodies. The term lonal antibody" or "monoclonal antibody composition", as used herein, refers to a population of dy molecules that contain only one species of an antigen binding site capable of immunoreacting with a particular epitope. red polyclonal and monoclonal antibody compositions are ones that have been selected for antibodies directed t a protein of the invention. Particularly preferred polyclonal and monoclonal antibody preparations are ones that contain only antibodies directed against a marker n or fragment thereof. onal antibodies can be prepared by immunizing a suitable subject with a protein of the invention as an immunogen. The antibody titer in the immunized subject can be monitored over time by standard techniques, such as with an enzyme linked immunosorbent assay (ELISA) using immobilized polypeptide. At an appropriate time after immunization, e. g., when the specific antibody titers are highest, antibody- ing cells can be obtained from the subject and used to prepare monoclonal antibodies (mAb) by standard techniques, such as the hybridoma technique originally described by Kohler and in (1975) Nature 256:495-497, the human B cell hybridoma technique (see Kozbor et al., 1983, Immunol. Today 4:72), the EBV- hybridoma technique (see Cole et al., pp. 77-96 In Monoclonal dies and Cancer Therapy, Alan R. Liss, Inc., 1985) or trioma techniques. The technology for producing hybridomas is well known (see generally Current Protocols in Immunology, Coligan et 100100 al. ed., John Wiley & Sons, New York, 1994). Hybridoma cells producing a monoclonal antibody of the invention are detected by screening the oma culture supernatants for antibodies that bind the polypeptide of interest, e. g., using a standard ELISA assay.
Alternative to preparing onal antibody-secreting hybridomas, a monoclonal antibody directed against a protein of the invention can be identified and isolated by ing a recombinant combinatorial immunoglobulin library (e.g., an antibody phage display library) with the polypeptide of interest. Kits for generating and ing phage display libraries are commercially available (e. g., the Pharmacia inant Phage Antibody System, Catalog No. 2701; and the gene SuerAP Phage Display Kit, Catalog No. 240612). Additionally, examples of methods and reagents particularly le for use in generating and screening antibody display library can be found in, for example, U.S. Patent No. 5,223,409; PCT Publication No.
W0 92/18619; PCT Publication No. W0 91/17271; PCT Publication No. WO 92/20791; PCT Publication No. W0 92/15679; PCT Publication No. WO 93/01288; PCT Publication No. WO 92/01047; PCT Publication No. WO 92/09690; PCT Publication No. WO 90/02809; Fuchs et al. (1991) Bio/Technology 9:1370-1372; Hay et al. (1992) Hum. Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 246:1275- 1281; Griffiths et al. (1993) EMBO J. 12:725-734.
The invention also provides recombinant antibodies that specifically bind a protein of the invention. In preferred embodiments, the recombinant antibodies specifically binds a marker n or fragment thereof. Recombinant antibodies e, but are not d to, chimeric and zed monoclonal antibodies, comprising both human and non-human portions, single-chain antibodies and multi- ic antibodies. A chimeric dy is a molecule in which different portions are derived from different animal species, such as those having a le region derived from a murine mAb and a human immunoglobulin constant region. (See, e. g., Cabilly et al., U.S. Patent No. 4,816,567; and Boss et al., U.S. Patent No. 4,816,397, which are incorporated herein by reference in their entirety.) Single-chain dies have an antigen binding site and consist of a single polypeptide. They can be produced by techniques known in the art, for example using methods described in Ladner et. al U.S.
Pat. No. 4,946,778 (which is incorporated herein by nce in its entirety); Bird et al., (1988) Science 242:423-426; Whitlow et al., (1991) Methods in Enzymology 2:1-9; 101101 Whitlow et al., (1991) Methods in Enzymology 2:97-105; and Huston et al., (1991) Methods in Enzymology Molecular Design and Modeling: Concepts and Applications 203:46-88. Multi-specific dies are antibody molecules having at least two antigen-binding sites that specifically bind different antigens. Such molecules can be produced by techniques known in the art, for example using methods described in Segal, U.S. Patent No. 4,676,980 (the disclosure of which is incorporated herein by reference in its entirety); Holliger et al., (1993) Proc. Natl. Acad. Sci. USA 90:6444-6448; Whitlow et al., (1994) Protein Eng. 7:1017-1026 and U.S. Pat. No. 424.
Humanized antibodies are antibody molecules from non-human species having one or more complementarity determining regions (CDRs) from the non-human species and a framework region from a human globulin molecule. (See, e. g., Queen, U.S. Patent No. 5,585,089, which is incorporated herein by reference in its ty.) Humanized monoclonal antibodies can be produced by recombinant DNA techniques known in the art, for e using methods described in PCT Publication No. WO 87/02671; European Patent Application 184,187; European Patent ation 171,496; European Patent Application 173,494; PCT ation No. WO 86/01533; U.S. Patent No. 4,816,567; European Patent Application 125,023; Better et al. (1988) Science 240:1041-1043; Liu et al. (1987) Proc. Natl. Acad. Sci. USA 84:3439-3443; Liu et al. (1987) J. Immunol. 139:3521- 3526; Sun et al. (1987) Proc. Natl. Acad. Sci. USA 84:214-218; Nishimura et al. (1987) Cancer Res. 47:999-1005; Wood et al. (1985) Nature 314:446-449; and Shaw et al. (1988) J. Natl. Cancer Inst. 80:1553-1559); Morrison (1985) e 229:1202-1207; Oi et al. (1986) Bio/Techniques 4:214; U.S.
Patent 5,225,539; Jones et al. (1986) Nature 321:552—525; Verhoeyan et al. (1988) Science 239:1534; and Beidler et al. (1988) J. Immunol. 141:4053-4060.
More particularly, humanized antibodies can be produced, for example, using transgenic mice which are incapable of expressing endogenous immunoglobulin heavy and light chains genes, but which can express human heavy and light chain genes. The transgenic mice are immunized in the normal fashion with a selected n, e. g., all or a portion of a polypeptide corresponding to a marker of the invention. Monoclonal antibodies directed against the antigen can be ed using conventional hybridoma technology. The human globulin enes harbored by the transgenic mice rearrange during B cell differentiation, and subsequently undergo class switching and somatic mutation. Thus, using such a technique, it is le to produce therapeutically 102102 useful IgG, IgA and IgE antibodies. For an overview of this technology for producing human antibodies, see g and Huszar (1995) Int. Rev. Immunol. 13:65-93). For a ed discussion of this technology for producing human antibodies and human monoclonal antibodies and protocols for producing such antibodies, see, e.g., U.S.
Patent 5,625,126; U.S. Patent 5,633,425; US. Patent 5,569,825; U.S. Patent 5,661,016; and U.S. Patent 5,545,806. In addition, ies such as Abgenix, Inc. (Freemont, CA), can be engaged to provide human dies directed against a ed antigen using technology similar to that described above.
Completely human antibodies which recognize a selected epitope can be generated using a technique referred to as " guided selection." In this approach a selected non-human monoclonal dy, e. g., a murine antibody, is used to guide the selection of a completely human antibody recognizing the same epitope (Jespers et al., 1994, Bio/technology 12:899-903).
The antibodies of the ion can be isolated after production (e. g., from the blood or serum of the subject) or synthesis and further purified by well-known techniques. For example, IgG antibodies can be ed using protein A chromatography. dies specific for a protein of the invention can be ed or (e.g., partially purified) or purified by, e. g., affinity chromatography. For example, a recombinantly expressed and purified (or partially purified) protein of the invention is produced as described herein, and covalently or non-covalently coupled to a solid support such as, for example, a chromatography column. The column can then be used to affinity purify antibodies specific for the proteins of the invention from a sample containing antibodies directed t a large number of different epitopes, thereby generating a substantially purified antibody composition, i.e., one that is substantially free of contaminating antibodies. By a ntially ed dy ition is meant, in this context, that the antibody sample contains at most only 30% (by dry weight) of contaminating antibodies directed against epitopes other than those of the desired protein of the invention, and preferably at most 20%, yet more preferably at most 10%, and most preferably at most 5% (by dry weight) of the sample is contaminating antibodies. A purified antibody composition means that at least 99% of the antibodies in the composition are directed against the desired protein of the invention. 103103 In a preferred embodiment, the substantially purified antibodies of the invention may specifically bind to a signal peptide, a secreted sequence, an extracellular domain, a transmembrane or a cytoplasmic domain or cytoplasmic membrane of a protein of the invention. In a ularly preferred embodiment, the substantially purified antibodies of the invention ically bind to a secreted sequence or an extracellular domain of the amino acid ces of a protein of the invention. In a more preferred embodiment, the substantially purified antibodies of the invention specifically bind to a secreted sequence or an extracellular domain of the amino acid sequences of a marker protein.
An antibody directed against a protein of the ion can be used to isolate the protein by standard techniques, such as affinity chromatography or immunoprecipitation.
Moreover, such an antibody can be used to detect the marker protein or fragment thereof (6. g., in a cellular lysate or cell supernatant) in order to evaluate the level and pattern of expression of the marker. The antibodies can also be used diagnostically to monitor protein levels in tissues or body fluids (e. g. in ty state associated body fluid) as part of a al testing procedure, e. g., to, for example, determine the efficacy of a given treatment regimen. Detection can be facilitated by the use of an antibody derivative, which comprises an antibody of the invention coupled to a detectable substance.
Examples of able substances include s enzymes, prosthetic groups, cent materials, luminescent materials, inescent materials, and radioactive materials. Examples of le enzymes include horseradish peroxidase, alkaline phosphatase, ctosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or rythrin; an example of a luminescent material includes luminol; examples of bioluminescent materials include luciferase, luciferin, and aequorin, and examples of suitable 125 35 131 radioactive material include. . . . I, I, S or 3H.
Antibodies of the invention may also be used as therapeutic agents in treating cancers. In a preferred embodiment, completely human antibodies of the invention are used for therapeutic treatment of human cancer patients, particularly those having a . In another preferred embodiment, antibodies that bind specifically to a marker protein or fragment thereof are used for eutic treatment. Further, such therapeutic antibody may be an antibody tive or immunotoxin comprising an antibody 104104 conjugated to a therapeutic moiety such as a cytotoxin, a therapeutic agent or a radioactive metal ion. A cytotoxin or cytotoxic agent includes any agent that is ental to cells. Examples include taxol, cytochalasin B, gramicidin D, um bromide, emetine, cin, etoposide, tenoposide, vincristine, vinblastine, colchicin, doxorubicin, daunorubicin, dihydroxy anthracin dione, mitoxantrone, mithramycin, actinomycin D, drotestosterone, glucocorticoids, ne, tetracaine, lidocaine, nolol, and puromycin and analogs or homologs f. Therapeutic agents include, but are not limited to, antimetabolites (e.g., methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-fluorouracil decarbazine), alkylating agents (e.g., mechlorethamine, thioepa chlorambucil, melphalan, carmustine (BSNU) and lomustine (CCNU), cyclothosphamide, an, omannitol, streptozotocin, mitomycin C, and cis-dichlorodiamine platinum (II) (DDP) cisplatin), anthracyclines (e.g., daunorubicin (formerly daunomycin) and doxorubicin), antibiotics (e.g., dactinomycin (formerly mycin), bleomycin, mithramycin, and anthramycin (AMC)), and anti-mitotic agents (e. g., vincristine and vinblastine).
The ated antibodies of the invention can be used for modifying a given biological response, for the drug moiety is not to be construed as limited to classical chemical therapeutic agents. For example, the drug moiety may be a protein or polypeptide possessing a desired biological activity. Such proteins may include, for example, a toxin such as ribosome-inhibiting protein (see Better et al., U.S. Patent No. 6,146,631, the disclosure of which is incorporated herein in its entirety), abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a protein such as tumor necrosis factor, .alpha.-interferon, B-interferon, nerve growth , et derived growth factor, tissue plasminogen activator; or, biological response modifiers such as, for example, lymphokines, interleukin-1 ("IL-1"), interleukin-2 ("IL-2"), interleukin-6 ("IL-6"), granulocyte macrophase colony ating factor ("GM-CSF"), granulocyte colony ating factor ("G-CSF"), or other growth factors.
Techniques for conjugating such therapeutic moiety to antibodies are well known, see, e. g., Arnon et al., "Monoclonal Antibodies For Immunotargeting Of Drugs In Cancer Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. , pp. 243-56 (Alan R. Liss, Inc. 1985); Hellstrom et al., "Antibodies For Drug Delivery", in Controlled Drug Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53 (Marcel Dekker, Inc. 1987); Thorpe, "Antibody Carriers Of Cytotoxic Agents In Cancer 105105 Therapy: A Review", in onal Antibodies '84: Biological And Clinical Applications, Pinchera et al. (eds.), pp. 475-506 ; "Analysis, Results, And Future Prospective Of The Therapeutic Use Of Radiolabeled Antibody In Cancer Therapy", in Monoclonal Antibodies For Cancer Detection And y, Baldwin et al. (eds.), pp. 303-16 (Academic Press 1985), and Thorpe et al., "The Preparation And Cytotoxic Properties Of Antibody-Toxin Conjugates", Immunol. Rev., 62:119-58 (1982).
Accordingly, in one aspect, the invention provides substantially purified antibodies, antibody fragments and derivatives, all of which specifically bind to a protein of the invention and preferably, a marker protein. In various embodiments, the substantially purified antibodies of the invention, or fragments or derivatives thereof, can be human, non-human, chimeric and/or humanized dies. In another aspect, the invention provides non-human antibodies, dy fragments and derivatives, all of which specifically bind to a protein of the invention and preferably, a marker protein.
Such non-human dies can be goat, mouse, sheep, horse, chicken, rabbit, or rat antibodies. Alternatively, the non-human antibodies of the invention can be chimeric and/or humanized antibodies. In on, the non-human antibodies of the invention can be onal dies or monoclonal antibodies. In still a further aspect, the invention provides onal antibodies, antibody nts and tives, all of which specifically bind to a protein of the invention and preferably, a marker protein.
The monoclonal antibodies can be human, humanized, chimeric and/or non-human antibodies.
The invention also provides a kit containing an antibody of the invention conjugated to a detectable substance, and instructions for use. Still another aspect of the invention is a pharmaceutical composition comprising an antibody of the invention. In one embodiment, the pharmaceutical composition comprises an antibody of the invention and a pharmaceutically acceptable carrier.
D. Predictive Medicine The present invention pertains to the field of predictive ne in which diagnostic assays, prognostic assays, pharmacogenomics, and monitoring clinical trails are used for prognostic (predictive) es to thereby treat an individual prophylactically. Accordingly, one aspect of the present invention relates to diagnostic assays for determining the level of sion of one or more marker proteins or nucleic 106106 acids, in order to determine whether an individual is at risk of developing drug-induced ty. Such assays can be used for stic or predictive es to thereby prophylactically treat an individual prior to the onset of the disorder.
Yet another aspect of the invention pertains to monitoring the influence of agents (6. g., drugs or other compounds administered either to inhibit or to treat or prevent or drug-induced toxicity {i.e. in order to understand any drug-induced toxic effects that such treatment may have}) on the expression or activity of a marker of the invention in clinical trials. These and other agents are described in further detail in the following E. Diagnostic Assays An exemplary method for detecting the presence or absence of a marker protein or nucleic acid in a biological sample involves ing a ical sample (e. g. toxicity-associated body fluid or tissue sample) from a test subject and contacting the biological sample with a compound or an agent capable of detecting the polypeptide or nucleic acid (e.g., mRNA, genomic DNA, or cDNA). The detection methods of the invention can thus be used to detect mRNA, protein, cDNA, or c DNA, for example, in a biological sample in vitro as well as in vivo. For example, in vitro techniques for detection of mRNA include Northern hybridizations and in situ izations. In vitro techniques for detection of a marker protein include enzyme linked immunosorbent assays (ELISAs), Western blots, immunoprecipitations and fluorescence. In vitro techniques for detection of genomic DNA include Southern hybridizations. In vivo techniques for detection of mRNA include polymerase chain reaction (PCR), Northern hybridizations and in situ hybridizations. Furthermore, in vivo techniques for detection of a marker protein include introducing into a subject a labeled antibody directed against the protein or fragment thereof. For example, the antibody can be labeled with a radioactive marker whose presence and location in a subject can be ed by standard imaging techniques.
A l principle of such diagnostic and prognostic assays involves preparing a sample or reaction mixture that may n a marker, and a probe, under appropriate conditions and for a time sufficient to allow the marker and probe to interact and bind, thus forming a complex that can be removed and/or ed in the reaction mixture.
These assays can be conducted in a variety of ways. 107107 For example, one method to conduct such an assay would involve anchoring the marker or probe onto a solid phase support, also referred to as a substrate, and detecting target marker/probe complexes anchored on the solid phase at the end of the on. In one embodiment of such a method, a sample from a subject, which is to be assayed for presence and/or concentration of marker, can be anchored onto a carrier or solid phase support. In another embodiment, the reverse situation is possible, in which the probe can be anchored to a solid phase and a sample from a subject can be allowed to react as an unanchored component of the assay.
There are many established methods for anchoring assay ents to a solid phase. These include, without limitation, marker or probe molecules which are immobilized through conjugation of biotin and streptavidin. Such biotinylated assay components can be prepared from biotin-NHS (N-hydroxy-succinimide) using ques known in the art (e. g., biotinylation kit, Pierce Chemicals, Rockford, IL), and lized in the wells of streptavidin-coated 96 well plates e Chemical). In certain embodiments, the surfaces with immobilized assay components can be ed in advance and stored.
Other suitable carriers or solid phase supports for such assays include any material capable of binding the class of molecule to which the marker or probe s.
Well-known supports or carriers include, but are not limited to, glass, yrene, nylon, polypropylene, nylon, polyethylene, dextran, amylases, natural and modified celluloses, polyacrylamides, gabbros, and magnetite.
In order to conduct assays with the above mentioned approaches, the non- immobilized component is added to the solid phase upon which the second component is anchored. After the reaction is complete, uncomplexed components may be removed (6. g., by g) under conditions such that any complexes formed will remain immobilized upon the solid phase. The detection of marker/probe complexes ed to the solid phase can be accomplished in a number of methods outlined herein.
In a preferred embodiment, the probe, when it is the ored assay component, can be labeled for the purpose of detection and readout of the assay, either directly or indirectly, with detectable labels discussed herein and which are well-known to one skilled in the art.
It is also possible to directly detect /probe complex formation without further manipulation or labeling of either component (marker or probe), for example by 108108 utilizing the technique of fluorescence energy transfer (see, for example, Lakowicz et al., US. Patent No. 5,631,169; Stavrianopoulos, et al., US. Patent No. 4,868,103). A fluorophore label on the first, ‘donor’ molecule is ed such that, upon excitation with incident light of appropriate wavelength, its emitted fluorescent energy will be absorbed by a fluorescent label on a second ‘acceptor’ molecule, which in turn is able to fluoresce due to the ed energy. Altemately, the ‘donor’ n molecule may simply utilize the natural fluorescent energy of phan residues. Labels are chosen that emit different wavelengths of light, such that the ‘acceptor’ molecule label may be differentiated from that of the ‘donor’. Since the efficiency of energy transfer between the labels is related to the ce separating the les, spatial relationships between the molecules can be assessed. In a situation in which binding occurs between the molecules, the fluorescent emission of the ‘acceptor’ molecule label in the assay should be maximal. An FET binding event can be conveniently ed h standard fluorometric detection means well known in the art (e.g., using a fluorimeter).
In another embodiment, determination of the ability of a probe to recognize a marker can be accomplished without labeling either assay component (probe or marker) by utilizing a technology such as real-time Biomolecular Interaction is (BIA) (see, e.g., Sjolander, S. and Urbaniczky, C., 1991, Anal. Chem. 63:2338-2345 and Szabo et al., 1995, Curr. Opin. Struct. Biol. 5:699-705). As used , “BIA” or “surface plasmon resonance” is a technology for studying biospecific interactions in real time, without labeling any of the interactants (e. g., BIAcore). s in the mass at the binding surface (indicative of a binding event) result in alterations of the refractive index of light near the surface (the optical phenomenon of surface plasmon resonance (SPR)), resulting in a detectable signal which can be used as an tion of real-time reactions between biological molecules.
Alternatively, in another embodiment, analogous diagnostic and prognostic assays can be conducted with marker and probe as solutes in a liquid phase. In such an assay, the xed marker and probe are separated from uncomplexed components by any of a number of standard techniques, including but not limited to: differential centrifugation, tography, electrophoresis and immunoprecipitation. In differential centrifugation, marker/probe xes may be separated from uncomplexed assay components through a series of centrifugal steps, due to the different sedimentation equilibria of xes based on their different sizes and densities (see, 109109 for example, Rivas, G., and , AP, 1993, Trends Biochem Sci. 18(8):284-7).
Standard chromatographic techniques may also be utilized to separate complexed molecules from uncomplexed ones. For example, gel filtration chromatography separates les based on size, and through the utilization of an riate gel tion resin in a column format, for example, the relatively larger complex may be separated from the relatively smaller uncomplexed components. Similarly, the relatively different charge ties of the marker/probe complex as compared to the uncomplexed components may be exploited to differentiate the complex from uncomplexed components, for example through the utilization of ion-exchange chromatography resins. Such resins and chromatographic techniques are well known to one skilled in the art (see, e. g., Heegaard, NH, 1998, J. Mol. Recognit. Winter 11(1- -8; Hage, D.S., and Tweed, S.A. J Chromatogr B Biomed Sci Appl 1997 Oct ;699(1-2):499-525). Gel electrophoresis may also be employed to separate complexed assay components from d components (see, e. g., Ausubel et al., ed., Current ols in Molecular Biology, John Wiley & Sons, New York, 1987-1999). In this technique, protein or nucleic acid complexes are separated based on size or charge, for example. In order to maintain the binding interaction during the electrophoretic process, non-denaturing gel matrix materials and conditions in the absence of reducing agent are typically preferred. Appropriate conditions to the particular assay and components thereof will be well known to one skilled in the art.
In a particular embodiment, the level of marker mRNA can be ined both by in situ and by in vitro formats in a ical sample using methods known in the art.
The term "biological sample" is intended to include tissues, cells, biological fluids and isolates thereof, isolated from a subject, as well as tissues, cells and fluids present within a subject. Many expression detection methods use isolated RNA. For in vitro methods, any RNA isolation technique that does not select t the isolation of mRNA can be utilized for the purification of RNA from cells (see, e. g., Ausubel et al., ed., Current Protocols in lar Biology, John Wiley & Sons, New York 1987-1999).
Additionally, large numbers of tissue samples can readily be processed using techniques well known to those of skill in the art, such as, for example, the single-step RNA isolation process of Chomczynski (1989, US. Patent No. 4,843,155).
The isolated mRNA can be used in ization or amplification assays that include, but are not limited to, rn or Northern analyses, rase chain reaction 110110 analyses and probe arrays. One preferred stic method for the detection of mRNA levels involves contacting the isolated mRNA with a nucleic acid molecule (probe) that can hybridize to the mRNA encoded by the gene being detected. The nucleic acid probe can be, for example, a full-length cDNA, or a portion thereof, such as an oligonucleotide of at least 7, 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under stringent conditions to a mRNA or genomic DNA encoding a marker of the present invention. Other suitable probes for use in the diagnostic assays of the ion are described herein. Hybridization of an mRNA with the probe indicates that the marker in question is being expressed.
In one format, the mRNA is lized on a solid surface and contacted with a probe, for example by running the isolated mRNA on an agarose gel and transferring the mRNA from the gel to a membrane, such as nitrocellulose. In an alternative format, the probe(s) are immobilized on a solid surface and the mRNA is contacted with the probe(s), for e, in an Affymetrix gene chip array. A skilled artisan can y adapt known mRNA detection methods for use in detecting the level of mRNA encoded by the s of the present invention.
An alternative method for determining the level of mRNA marker in a sample involves the process of nucleic acid amplification, e. g., by RT-PCR (the experimental embodiment set forth in Mullis, 1987, U.S. Patent No. 4,683,202), ligase chain reaction y, 1991, Proc. Natl. Acad. Sci. USA, 88:189-193), self sustained sequence replication (Guatelli et al., 1990, Proc. Natl. Acad. Sci. USA 4-1878), transcriptional amplification system (Kwoh et al., 1989, Proc. Natl. Acad. Sci. USA 86: 1 173-1177), Q-Beta Replicase di et al., 1988, Bio/Technology 6: 1 197), rolling circle replication (Lizardi et al., U.S. Patent No. 5,854,033) or any other nucleic acid amplification method, followed by the detection of the amplified les using techniques well known to those of skill in the art. These detection schemes are especially useful for the detection of nucleic acid molecules if such molecules are present in very low numbers. As used herein, amplification primers are defined as being a pair of nucleic acid molecules that can anneal to 5’ or 3’ regions of a gene (plus and minus strands, respectively, or ersa) and contain a short region in n. In general, amplification primers are from about 10 to 30 nucleotides in length and flank a region from about 50 to 200 nucleotides in . Under appropriate conditions and 111111 with appropriate reagents, such primers permit the amplification of a nucleic acid molecule comprising the nucleotide sequence flanked by the primers.
For in situ methods, mRNA does not need to be isolated from the prior to detection. In such methods, a cell or tissue sample is prepared/processed using known histological methods. The sample is then lized on a support, typically a glass slide, and then contacted with a probe that can hybridize to mRNA that encodes the marker.
As an alternative to making determinations based on the te expression level of the marker, determinations may be based on the ized expression level of the . Expression levels are normalized by correcting the absolute expression level of a marker by comparing its expression to the expression of a gene that is not a marker, e.g., a housekeeping gene that is constitutively expressed. Suitable genes for normalization include housekeeping genes such as the actin gene, or epithelial cell- specific genes. This normalization allows the comparison of the expression level in one sample, e.g., a patient sample, to another sample, e. g. , a non-disease or non-toxic sample, or between samples from different sources.
Alternatively, the expression level can be provided as a relative expression level.
To determine a relative expression level of a marker, the level of expression of the marker is determined for 10 or more samples of normal versus disease or toxic cell es, preferably 50 or more samples, prior to the ination of the sion level for the sample in question. The mean expression level of each of the genes d in the larger number of s is determined and this is used as a ne expression level for the marker. The expression level of the marker determined for the test sample (absolute level of expression) is then divided by the mean expression value obtained for that marker. This provides a relative expression level.
Preferably, the samples used in the baseline determination will be from non-toxic cells. The choice of the cell source is dependent on the use of the relative expression level. Using expression found in normal tissues as a mean expression score aids in validating r the marker assayed is ty ic (versus normal cells). In addition, as more data is accumulated, the mean expression value can be revised, providing improved relative expression values based on accumulated data. Expression data from disesase cells or toxic cells provides a means for grading the severity of the disease or toxic state. 112112 In another embodiment of the present ion, a marker n is detected. A preferred agent for detecting marker protein of the invention is an antibody capable of binding to such a protein or a nt f, preferably an antibody with a detectable label. Antibodies can be polyclonal, or more preferably, monoclonal. An intact antibody, or a fragment or derivative thereof (e. g., Fab or 2) can be used. The term "labeled", with regard to the probe or antibody, is ed to encompass direct ng of the probe or antibody by coupling (i.e., physically linking) a detectable substance to the probe or antibody, as well as indirect labeling of the probe or antibody by reactivity with another reagent that is directly labeled. Examples of indirect labeling include detection of a primary antibody using a fluorescently labeled secondary antibody and end-labeling of a DNA probe with biotin such that it can be ed with cently labeled streptavidin.
Proteins from cells can be isolated using techniques that are well known to those of skill in the art. The protein isolation methods employed can, for example, be such as those described in Harlow and Lane (Harlow and Lane, 1988, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York).
A variety of formats can be employed to determine whether a sample contains a protein that binds to a given antibody. Examples of such formats include, but are not limited to, enzyme immunoassay (EIA), radioimmunoassay (RIA), Western blot analysis and enzyme linked immunoabsorbant assay (ELISA). A skilled artisan can readily adapt known n/antibody detection methods for use in determining whether cells express a marker of the present invention.
In one format, antibodies, or antibody fragments or derivatives, can be used in methods such as Western blots or immunofluorescence techniques to detect the sed proteins. In such uses, it is generally preferable to immobilize either the antibody or proteins on a solid support. le solid phase supports or carriers include any support capable of binding an n or an antibody. Well-known supports or carriers include glass, polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, polyacrylamides, gabbros, and magnetite.
One skilled in the art will know many other suitable carriers for binding antibody or antigen, and will be able to adapt such support for use with the present invention. For example, protein isolated from disease or toxic cells can be run on a polyacrylamide gel electrophoresis and lized onto a solid phase t such as nitrocellulose. The 113113 support can then be washed with suitable buffers followed by treatment with the detectably labeled antibody. The solid phase support can then be washed with the buffer a second time to remove unbound antibody. The amount of bound label on the solid support can then be detected by conventional means.
The ion also asses kits for detecting the presence of a marker protein or nucleic acid in a biological sample. Such kits can be used to determine if a subject is suffering from or is at increased risk of developing drug-induced toxicity. For example, the kit can comprise a labeled compound or agent capable of detecting a marker protein or nucleic acid in a biological sample and means for determining the amount of the protein or mRNA in the sample (e.g., an antibody which binds the protein or a fragment thereof, or an ucleotide probe which binds to DNA or mRNA encoding the protein). Kits can also include ctions for interpreting the results obtained using the kit.
For antibody-based kits, the kit can comprise, for example: (1) a first dy (e.g., attached to a solid t) which binds to a marker protein; and, optionally, (2) a second, different antibody which binds to either the protein or the first antibody and is ated to a detectable label.
For oligonucleotide-based kits, the kit can comprise, for example: (1) an oligonucleotide, e. g., a detectably d oligonucleotide, which hybridizes to a nucleic acid ce encoding a marker protein or (2) a pair of primers useful for amplifying a marker nucleic acid molecule. The kit can also comprise, e.g., a buffering agent, a preservative, or a protein stabilizing agent. The kit can r comprise components necessary for detecting the able label (e.g., an enzyme or a ate). The kit can also contain a control sample or a series of control samples which can be assayed and compared to the test sample. Each component of the kit can be enclosed within an individual container and all of the various containers can be within a single package, along with instructions for interpreting the results of the assays performed using the kit.
F. Pharmacogenomics The markers of the invention are also useful as pharmacogenomic markers. As used herein, a “pharmacogenomic marker” is an objective biochemical marker whose expression level correlates with a specific clinical drug response or susceptibility in a patient (see, e.g., McLeod et a1. (1999) Eur. J. Cancer 35(12): 1650-1652). The 114114 presence or quantity of the pharmacogenomic marker expression is related to the predicted se of the patient and more ularly the t’s diseased or toxic cells to therapy with a specific drug or class of drugs. By assessing the presence or quantity of the expression of one or more pharmacogenomic markers in a patient, a drug therapy which is most appropriate for the patient, or which is predicted to have a greater degree of success, may be ed. For example, based on the presence or quantity of RNA or protein encoded by specific tumor markers in a patient, a drug or course of treatment may be selected that is optimized for the treatment of the specific tumor likely to be t in the patient. The use of pharmacogenomic markers therefore permits selecting or designing the most appropriate treatment for each cancer patient without trying different drugs or regimes.
Another aspect of pharmacogenomics deals with genetic conditions that alters the way the body acts on drugs. These pharmacogenetic conditions can occur either as rare s or as polymorphisms. For example, glucosephosphate ogenase (G6PD) deficiency is a common inherited enzymopathy in which the main clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and consumption of fava beans.
As an rative ment, the activity of drug metabolizing enzymes is a major determinant of both the intensity and duration of drug action. The discovery of genetic polymorphisms of drug metabolizing enzymes (e. g., N-acetyltransferase 2 (NAT 2) and cytochrome P450 s CYP2D6 and CYP2Cl9) has provided an explanation as to why some patients do not obtain the expected drug effects or show exaggerated drug response and serious toxicity after taking the standard and safe dose of a drug.
These polymorphisms are expressed in two phenotypes in the population, the extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among different populations. For example, the gene coding for CYP2D6 is highly rphic and several mutations have been identified in PM, which all lead to the absence of onal CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently experience exaggerated drug response and side effects when they receive standard doses.
If a metabolite is the active therapeutic , a PM will show no therapeutic response, as demonstrated for the analgesic effect of codeine mediated by its CYP2D6-formed metabolite morphine. The other extreme are the so called ultra-rapid metabolizers who 115115 do not respond to standard doses. Recently, the molecular basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification.
Thus, the level of expression of a marker of the invention in an individual can be determined to thereby select appropriate s) for therapeutic or prophylactic treatment of the individual. In addition, cogenetic studies can be used to apply genotyping of rphic alleles encoding drug-metabolizing enzymes to the identification of an individual's drug responsiveness phenotype. This knowledge, when applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus enhance therapeutic or prophylactic efficiency when treating a subject with a modulator of expression of a marker of the ion.
G. ring Clinical Trials Monitoring the influence of agents (e.g., drug compounds) on the level of sion of a marker of the invention can be applied not only in basic drug screening, but also in clinical . For example, the iveness of an agent to affect marker expression can be monitored in clinical trials of subjects receiving treatment for cardiotoxicity, or drug-induced toxicity. In a preferred embodiment, the present invention provides a method for monitoring the effectiveness of treatment of a t with an agent (e. g., an agonist, antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) sing the steps of (i) obtaining a pre-administration sample from a subject prior to administration of the agent; (ii) detecting the level of expression of one or more selected markers of the invention in the pre-administration sample; (iii) obtaining one or more post-administration samples from the subject; (iv) detecting the level of expression of the marker(s) in the post- stration samples; (v) comparing the level of expression of the marker(s) in the pre-administration sample with the level of sion of the marker(s) in the post- administration sample or samples; and (vi) altering the administration of the agent to the subject accordingly. For e, increased expression of the marker gene(s) during the course of treatment may indicate ineffective dosage and the desirability of increasing the dosage. Conversely, decreased expression of the marker gene(s) may indicate efficacious treatment and no need to change dosage. 116116 H. Arrays The ion also includes an array comprising a marker of the present invention. The array can be used to assay expression of one or more genes in the array.
In one embodiment, the array can be used to assay gene expression in a tissue to ascertain tissue specificity of genes in the array. In this manner, up to about 7600 genes can be simultaneously assayed for expression. This allows a profile to be developed showing a battery of genes specifically expressed in one or more tissues.
In addition to such ative determination, the invention allows the quantitation of gene expression. Thus, not only tissue specificity, but also the level of expression of a battery of genes in the tissue is ascertainable. Thus, genes can be grouped on the basis of their tissue sion per se and level of expression in that tissue. This is useful, for example, in ascertaining the onship of gene expression between or among tissues. Thus, one tissue can be perturbed and the effect on gene expression in a second tissue can be determined. In this context, the effect of one cell type on another cell type in response to a biological stimulus can be ined. Such a determination is , for example, to know the effect of cell-cell interaction at the level of gene expression. If an agent is administered therapeutically to treat one cell type but has an undesirable effect on another cell type, the invention provides an assay to determine the molecular basis of the undesirable effect and thus provides the opportunity to co-administer a counteracting agent or otherwise treat the undesired effect. Similarly, even within a single cell type, undesirable biological s can be determined at the lar level. Thus, the effects of an agent on expression of other than the target gene can be ascertained and counteracted.
In another embodiment, the array can be used to monitor the time course of sion of one or more genes in the array. This can occur in s biological contexts, as disclosed herein, for example development of drug-induced toxicity, progression of drug-induced toxicity, and processes, such a cellular transformation associated with drug-induced toxicity.
The array is also useful for ascertaining the effect of the expression of a gene on the expression of other genes in the same cell or in different cells. This provides, for example, for a selection of alternate molecular targets for eutic intervention if the ultimate or ream target cannot be regulated. 117117 The array is also useful for ascertaining differential expression patterns of one or more genes in normal and abnormal cells. This provides a y of genes that could serve as a molecular target for diagnosis or therapeutic intervention.
VII. s for Obtaining s Samples useful in the methods of the invention include any tissue, cell, biopsy, or bodily fluid sample that expresses a marker of the invention. In one embodiment, a sample may be a tissue, a cell, whole blood, serum, plasma, buccal scrape, saliva, cerebrospinal fluid, urine, stool, or bronchoalveolar lavage. In preferred embodiments, the tissue sample is a ty state sample. In more preferred embodiments, the tissue sample is a a cardiovascular sample or a drug-induced toxicity sample.
Body samples may be obtained from a subject by a variety of techniques known in the art including, for example, by the use of a biopsy or by scraping or swabbing an area or by using a needle to aspirate bodily fluids. Methods for collecting various body s are well known in the art.
Tissue samples suitable for detecting and quantitating a marker of the invention may be fresh, frozen, or fixed according to methods known to one of skill in the art.
Suitable tissue samples are preferably sectioned and placed on a microscope slide for further es. Alternatively, solid samples, i.e., tissue samples, may be lized and/or homogenized and subsequently analyzed as soluble extracts.
In one embodiment, a freshly obtained biopsy sample is frozen using, for example, liquid nitrogen or difluorodichloromethane. The frozen sample is mounted for sectioning using, for example, OCT, and serially sectioned in a cryostat. The serial ns are collected on a glass microscope slide. For immunohistochemical staining the slides may be coated with, for example, chrome-alum, gelatine or poly-L-lysine to ensure that the sections stick to the slides. In another embodiment, samples are fixed and embedded prior to sectioning. For example, a tissue sample may be fixed in, for example, formalin, ly dehydrated and embedded in, for example, paraffin.
Once the sample is obtained any method known in the art to be suitable for detecting and quantitating a marker of the invention may be used (either at the c acid or at the protein level). Such s are well known in the art and include but are not limited to western blots, northern blots, southern blots, immunohistochemistry, ELISA, e.g., ied ELISA, immunoprecipitation, immunofluorescence, flow 118118 cytometry, immunocytochemistry, mass spectrometrometric es, e.g., MALDI- TOF and SELDI—TOF, nucleic acid hybridization techniques, nucleic acid reverse transcription methods, and nucleic acid amplification methods. In particular embodiments, the expression of a marker of the invention is detected on a protein level using, for example, antibodies that specifically bind these proteins.
Samples may need to be modified in order to make a marker of the invention accessible to antibody binding. In a particular aspect of the immunocytochemistry or immunohistochemistry methods, slides may be erred to a pretreatment buffer and optionally heated to increase antigen accessibility. Heating of the sample in the pretreatment buffer rapidly ts the lipid bi-layer of the cells and makes the antigens (may be the case in fresh ens, but not typically what occurs in fixed ens) more accessible for antibody binding. The terms "pretreatment buffer" and "preparation buffer" are used interchangeably herein to refer to a buffer that is used to prepare cytology or histology samples for immunostaining, particularly by increasing the accessibility of a marker of the invention for antibody binding. The pretreatment buffer may comprise a pH-specific salt solution, a polymer, a detergent, or a nonionic or anionic surfactant such as, for example, an ethyloxylated anionic or nonionic surfactant, an alkanoate or an alkoxylate or even blends of these surfactants or even the use of a bile salt. The pretreatment buffer may, for example, be a solution of 0.1% to 1% of deoxycholic acid, sodium salt, or a solution of sodium laureth-l3-carboxylate (e. g., Sandopan LS) or and ethoxylated anionic complex. In some embodiments, the pretreatment buffer may also be used as a slide storage buffer.
Any method for making marker proteins of the invention more accessible for antibody binding may be used in the practice of the ion, including the antigen retrieval methods known in the art. See, for example, Bibbo, et al. (2002) Acta. Cytol. 29; Saqi, et al. (2003) Diagn. Cytopathol. 27:365-370; Bibbo, et al. (2003) Anal.
Quant. Cytol. Histol. 25:8-11, the entire contents of each of which are orated herein by reference.
Following pretreatment to increase marker protein accessibility, samples may be blocked using an appropriate blocking agent, e. g., a peroxidase ng reagent such as hydrogen peroxide. In some embodiments, the samples may be blocked using a n blocking t to t non-specific binding of the dy. The protein blocking reagent may comprise, for e, purified . An antibody, particularly a 119119 monoclonal or polyclonal antibody that specifically binds to a marker of the invention is then incubated with the sample. One of skill in the art will appreciate that a more accurate prognosis or diagnosis may be obtained in some cases by detecting multiple epitopes on a marker protein of the invention in a patient sample. Therefore, in particular embodiments, at least two antibodies ed to different epitopes of a marker of the invention are used. Where more than one antibody is used, these antibodies may be added to a single sample sequentially as dual antibody reagents or simultaneously as an antibody cocktail. Alternatively, each individual antibody may be added to a separate sample from the same patient, and the resulting data pooled.
Techniques for ing antibody binding are well known in the art. Antibody binding to a marker of the invention may be ed through the use of al reagents that generate a detectable signal that corresponds to the level of antibody binding and, accordingly, to the level of marker n expression. In one of the immunohistochemistry or immunocytochemistry s of the invention, antibody binding is detected through the use of a secondary antibody that is conjugated to a labeled polymer. es of d polymers include but are not limited to polymer- enzyme conjugates. The s in these complexes are typically used to catalyze the deposition of a chromogen at the antigen-antibody g site, thereby resulting in cell staining that corresponds to expression level of the biomarker of interest. Enzymes of particular interest include, but are not limited to, horseradish peroxidase (HRP) and alkaline phosphatase (AP).
In one particular immunohistochemistry or immunocytochemistry method of the invention, dy binding to a marker of the invention is detected through the use of an HRP-labeled polymer that is conjugated to a secondary antibody. Antibody binding can also be detected through the use of a species-specific probe reagent, which binds to monoclonal or polyclonal dies, and a polymer conjugated to HRP, which binds to the species specific probe reagent. Slides are stained for antibody binding using any chromagen, e.g., the chromagen 3,3-diaminobenzidine (DAB), and then rstained with hematoxylin and, optionally, a bluing agent such as ammonium hydroxide or TBS/Tween-20. Other suitable gens include, for example, 3-amino ethylcarbazole (AEC). In some aspects of the ion, slides are reviewed microscopically by a cytotechnologist and/or a pathologist to assess cell staining, e.g., fluorescent staining (i.e., marker expression). Alternatively, samples may be reviewed 120120 via automated microscopy or by personnel with the assistance of computer software that facilitates the identification of positive staining cells.
Detection of antibody binding can be facilitated by coupling the anti-marker antibodies to a detectable substance. es of detectable substances e various s, prosthetic groups, fluorescent materials, luminescent als, bioluminescent materials, and radioactive materials. Examples of suitable s include adish peroxidase, alkaline phosphatase, B-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an e of a luminescent material includes luminol; examples of inescent materials include luciferase, luciferin, and aequorin; and examples of le radioactive material include 125I, 131I, 35S, 14C, or 3H.
In one embodiment of the invention frozen samples are prepared as described above and subsequently stained with antibodies t a marker of the invention diluted to an appropriate concentration using, for example, Tris-buffered saline (TBS). Primary antibodies can be detected by incubating the slides in biotinylated anti-immunoglobulin.
This signal can optionally be amplified and visualized using diaminobenzidine precipitation of the antigen. rmore, slides can be optionally counterstained with, for example, hematoxylin, to visualize the cells.
In another embodiment, fixed and embedded samples are stained with antibodies against a marker of the invention and rstained as described above for frozen ns. In addition, samples may be optionally treated with agents to y the signal in order to visualize antibody staining. For example, a peroxidase-catalyzed deposition of biotinyl-tyramide, which in turn is reacted with peroxidase-conjugated avidin yzed Signal Amplification (CSA) System, DAKO, Carpinteria, CA) may be used.
Tissue-based assays (i.e., immunohistochemistry) are the preferred methods of detecting and quantitating a marker of the invention. In one embodiment, the presence or absence of a marker of the invention may be determined by immunohistochemistry.
In one embodiment, the immunohistochemical analysis uses low concentrations of an anti-marker antibody such that cells lacking the marker do not stain. In r embodiment, the presence or absence of a marker of the invention is determined using 121121 an immunohistochemical method that uses high concentrations of an anti-marker antibody such that cells lacking the marker protein stain heavily. Cells that do not stain contain either mutated marker and fail to produce antigenically recognizable marker n, or are cells in which the ys that regulate marker levels are dysregulated, resulting in steady state sion of negligible marker protein.
One of skill in the art will recognize that the concentration of a particular antibody used to practice the s of the invention will vary depending on such factors as time for binding, level of specificity of the antibody for a marker of the invention, and method of sample preparation. er, when multiple antibodies are used, the required tration may be affected by the order in which the antibodies are applied to the , e. g., simultaneously as a cocktail or sequentially as individual antibody reagents. Furthermore, the detection chemistry used to visualize antibody binding to a marker of the invention must also be optimized to e the desired signal to noise ratio.
In one embodiment of the invention, proteomic methods, e. g., mass spectrometry, are used for detecting and quantitating the marker proteins of the invention. For example, matrix-associated laser desorption/ionization time-of—flight mass spectrometry (MALDI-TOF MS) or surface-enhanced laser desorption/ionization time-of—flight mass spectrometry (SELDI-TOF MS) which involves the application of a biological sample, such as serum, to a protein-binding chip (Wright, G.L., Jr., et al. (2002) Expert Rev M01 Diagn 2:549; Li, J ., et al. (2002) Clin Chem 48:1296; Laronga, C., et al. (2003) Dis Markers ; Petricoin, E.F., et al. (2002) 359:572; Adam, B.L., et al. (2002) Cancer Res 9; Tolson, J., et al. (2004) Lab Invest 84:845; Xiao, Z., et al. (2001) Cancer Res 61:6029) can be used to detect and quantitate the PY-Shc and/or p66-Shc proteins. Mass spectrometric methods are described in, for example, U.S. Patent Nos. 5,622,824, 5,605,798 and 5,547,835, the entire contents of each of which are orated herein by reference.
In other embodiments, the expression of a marker of the invention is detected at the nucleic acid level. Nucleic acid-based techniques for assessing expression are well known in the art and include, for example, determining the level of marker mRNA in a sample from a subject. Many expression detection methods use isolated RNA. Any RNA isolation que that does not select against the isolation of mRNA can be utilized for the purification of RNA from cells that express a marker of the invention 122122 (see, e.g., Ausubel et al., ed., (1987-1999) Current ols in Molecular y (John Wiley & Sons, New York). onally, large numbers of tissue samples can readily be processed using ques well known to those of skill in the art, such as, for example, the single-step RNA isolation process of Chomczynski (1989, U.S. Pat. No. 4,843,155).
The term "probe" refers to any molecule that is capable of selectively binding to a marker of the ion, for example, a nucleotide transcript and/or protein. Probes can be synthesized by one of skill in the art, or derived from appropriate biological preparations. Probes may be specifically designed to be d. Examples of molecules that can be ed as probes include, but are not limited to, RNA, DNA, proteins, antibodies, and organic molecules.
Isolated mRNA can be used in hybridization or amplification assays that include, but are not limited to, Southern or Northern analyses, polymerase chain reaction analyses and probe arrays. One method for the detection of mRNA levels involves contacting the isolated mRNA with a nucleic acid molecule (probe) that can ize to the marker mRNA. The nucleic acid probe can be, for example, a full-length cDNA, or a portion thereof, such as an oligonucleotide of at least 7, 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to ically hybridize under stringent conditions to marker genomic DNA.
In one embodiment, the mRNA is immobilized on a solid e and contacted with a probe, for example by running the isolated mRNA on an agarose gel and transferring the mRNA from the gel to a membrane, such as nitrocellulose. In an alternative embodiment, the probe(s) are immobilized on a solid surface and the mRNA is contacted with the probe(s), for example, in an Affymetrix gene chip array. A skilled artisan can readily adapt known mRNA detection methods for use in detecting the level of marker mRNA.
An alternative method for determining the level of marker mRNA in a sample involves the process of nucleic acid amplification, e.g., by RT-PCR (the experimental embodiment set forth in Mullis, 1987, U.S. Pat. No. 4,683,202), ligase chain reaction (Barany (1991) Proc. Natl. Acad. Sci. USA 88:189-193), self sustained sequence replication (Guatelli et al. (1990) Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional amplification system (Kwoh et al. (1989) Proc. Natl. Acad. Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi et al. (1988) Bio/Technology 6:1197), g circle replication (Lizardi et al., U.S. Pat. No. 033) or any other nucleic acid 123123 amplification , followed by the detection of the amplified molecules using techniques well known to those of skill in the art. These detection schemes are especially useful for the detection of nucleic acid molecules if such molecules are present in very low numbers. In particular aspects of the invention, marker expression is assessed by quantitative fluorogenic RT-PCR (i.e., the TaqManTM System). Such methods typically e pairs of oligonucleotide s that are specific for a marker of the invention.
Methods for ing oligonucleotide primers specific for a known sequence are well known in the art.
The expression levels of a marker of the invention may be monitored using a membrane blot (such as used in hybridization is such as Northern, Southern, dot, and the like), or ells, sample tubes, gels, beads or fibers (or any solid support comprising bound nucleic acids). See US. Pat. Nos. 5,770,722, 5,874,219, 5,744,305, ,677,195 and 5,445,934, which are incorporated herein by reference. The detection of marker expression may also comprise using nucleic acid probes in solution.
In one embodiment of the invention, microarrays are used to detect the sion of a marker of the ion. Microarrays are ularly well suited for this purpose because of the reproducibility between different experiments. DNA microarrays provide one method for the simultaneous measurement of the expression levels of large numbers of genes. Each array consists of a reproducible pattern of capture probes attached to a solid support. d RNA or DNA is hybridized to complementary probes on the array and then detected by laser scanning. ization intensities for each probe on the array are determined and converted to a quantitative value representing relative gene expression levels. See, US. Pat. Nos. 6,040,138, 5,800,992 and 6,020,135, 6,033,860, and 6,344,316, which are incorporated herein by reference.
High-density oligonucleotide arrays are particularly useful for determining the gene expression profile for a large number of RNA's in a sample.
The amounts of marker, and/or a mathematical relationship of the amounts of a marker of the invention may be used to calculate the risk of a toxicity state, e. g., a drug- induced toxicity or cardiotoxicity, in a subject being treated with a drug, the efficacy of a treatment regimen for treating, ting or counteracting a toxicity state, and the like, using the methods of the invention, which may include methods of regression analysis known to one of skill in the art. For example, suitable regression models include, but are not limited to CART (e. g., Hill, T, and Lewicki, P. (2006) 124124 “STATISTICS Methods and Applications” StatSoft, Tulsa, OK), Cox (e.g., www.cvidence-based—medicinecouk), exponential, normal and log normal (e.g., www.obgyn.cam.ac.uk/mrg/statsbook/stsurvan.html), logistic (e.g., www.en.wikipedia.org/wiki/Logistic_regression), parametric, non-parametric, semi- parametric (e.g., www.socserv.mcmaster.ca/jfox/Books/Companion), linear (e.g., www.en.wikipedia.org/wiki/Linear_regression), or additive (e.g., www.en.wikipedia.org/wiki/Generalized_additive_model).
In one embodiment, a sion analysis includes the amounts of marker. In another embodiment, a regression analysis includes a marker mathematical relationship.
In yet another embodiment, a regression analysis of the amounts of marker, and/or a marker mathematical onship may e additional clinical and/or molecular co- variates. Such clinical co-variates include, but are not limited to, nodal status, tumor stage, tumor grade, tumor size, treatment regime, e.g., chemotherapy and/or radiation therapy, clinical outcome (e.g., relapse, e-specific survival, therapy failure), and/or clinical outcome as a function of time after diagnosis, time after initiation of y, and/or time after completion of treatment.
VIII. Kits The invention also provides compositions and kits for identifying an agent at risk for causing drug-induced toxicity, e. g., cardiotoxicity, for prognosing a cardiotoxic state, e. g., a drug-induced cardiotoxicity, recurrence of cardiotoxicity, or survival of a subject being treated for cardiotoxicity. These kits include one or more of the following: a detectable antibody that specifically binds to a marker of the invention, a able antibody that specifically binds to a marker of the invention, reagents for obtaining and/or preparing subject tissue samples for ng, and instructions for use.
The kits of the invention may optionally comprise additional ents useful for performing the methods of the invention. By way of example, the kits may se fluids (e.g., SSC buffer) suitable for annealing complementary nucleic acids or for binding an antibody with a protein with which it specifically binds, one or more sample compartments, an instructional material which describes mance of a method of the ion and tissue specific controls/standards. 125125 IX. Screening Assays Targets of the invention e, but are not limited to, the genes and/or ns listed . Based on the results of experiments bed by Applicants herein, the key proteins modulated in a toxicity state are associated with or can be classified into different pathways or groups of molecules, including cytoskeletal components, transcription factors, apoptotic response, pentose phosphate pathway, biosynthetic pathway, oxidative stress (pro-oxidant), membrane alterations, and oxidative phosphorylation metabolism. Accordingly, in one embodiment of the invention, a marker may include one or more genes (or ns) selected from the markers listed in table 2. In some embodiments, the markers are a combination of at least two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, sixteen, seventeen, eighteen, nineteen, twenty, twenty-five, thirty, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, or more of the foregoing genes (or proteins).
Screening assays useful for identifying modulators of identified markers are described below.
The invention also provides methods (also referred to herein as "screening assays") for identifying modulators, i.e., candidate or test compounds or agents (e.g., proteins, es, peptidomimetics, peptoids, small molecules or other drugs), which are useful for treating or preventing a toxicity state by modulating the expression and/or activity of a marker of the invention. Such assays typically comprise a reaction between a marker of the invention and one or more assay components. The other components may be either the test compound itself, or a combination of test compounds and a natural g partner of a marker of the ion. nds identified via assays such as those described herein may be useful, for example, for modulating, e. g., inhibiting, ameliorating, treating, or preventing aggressiveness of a e state or toxicity state.
The test compounds used in the screening assays of the present ion may be obtained from any available , including systematic libraries of natural and/or synthetic compounds. Test compounds may also be obtained by any of the numerous approaches in combinatorial library methods known in the art, including: biological libraries; peptoid libraries ries of molecules having the functionalities of peptides, but with a novel, non-peptide backbone which are resistant to tic degradation but which nevertheless remain bioactive; see, e.g., Zuckermann et al., 1994, J. Med. Chem. 8-85); spatially sable parallel solid phase or solution phase libraries; 126126 synthetic library methods ing deconvolution; the 'one-bead one-compound' library method; and synthetic library methods using affinity chromatography selection. The biological library and d library approaches are limited to peptide libraries, While the other four ches are applicable to peptide, non-peptide oligomer or small le libraries of compounds (Lam, 1997, Anticancer Drug Des. ).
Examples of methods for the synthesis of molecular libraries can be found in the art, for example in: DeWitt et al. (1993) Proc. Natl. Acad. Sci. USA. 90:6909; Erb et al. (1994) Proc. Natl. Acad. Sci. USA 91:11422; Zuckermann et al. (1994). J. Med.
Chem. 37:2678; Cho et al. (1993) Science 261:1303; Carrell et al. (1994) Angew. Chem.
Int. Ed. Engl. 33:2059; Carell et al. (1994) Angew. Chem. Int. Ed. Engl. 332061; and in Gallop et al. (1994) J. Med. Chem. 37:1233.
Libraries of nds may be presented in solution (e. g., Houghten, 1992, Biotechniques 13:412-421), or on beads (Lam, 1991, Nature 354:82-84), chips (Fodor, 1993, Nature 364:555-556), bacteria and/or spores, (Ladner, USP 5,223,409), plasmids (Cull et al, 1992, Proc Natl Acad Sci USA 89: 1865-1869) or on phage (Scott and Smith, 1990, Science 249:386-390; Devlin, 1990, Science 4-406; CWirla et al, 1990, Proc. Natl. Acad. Sci. 87:6378-6382; Felici, 1991, J. M0l. Biol. 222:301-310; Ladner, supra).
The screening methods of the invention se contacting a toxicity state cell with a test compound and determining the ability of the test compound to modulate the expression and/or activity of a marker of the invention in the cell. The expression and/or activity of a marker of the invention can be determined as described herein.
In another embodiment, the invention provides assays for screening candidate or test compounds which are substrates of a marker of the invention or biologically active portions thereof. In yet another embodiment, the invention provides assays for screening candidate or test compounds which bind to a marker of the invention or biologically active ns thereof. Determining the y of the test compound to directly bind to a marker can be accomplished, for example, by ng the compound with a radioisotope or enzymatic label such that binding of the compound to the marker can be determined by ing the labeled marker compound in a complex. For example, compounds (e.g., marker ates) can be labeled with 131I, 125I, 35S, 14C, or 3H, either directly or indirectly, and the radioisotope ed by direct counting of radioemission or by scintillation ng. Alternatively, assay components can be 127127 enzymatically labeled with, for example, horseradish peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label detected by determination of conversion of an appropriate substrate to product.
This invention further pertains to novel agents fied by the above-described screening assays. Accordingly, it is Within the scope of this invention to further use an agent identified as described herein in an appropriate animal model. For example, an agent capable of modulating the expression and/or activity of a marker of the invention identified as described herein can be used in an animal model to determine the efficacy, toxicity, or side effects of treatment with such an agent. Alternatively, an agent identified as described herein can be used in an animal model to ine the mechanism of action of such an agent. Furthermore, this invention pertains to uses of novel agents fied by the described screening assays for treatment as described above. 128128 Exemplification 0fthe Invention EXAMPLE 1: Employing Platform Technology to Build Models of Drug Induced Cardiotoxicity In this example, the platform technology described in detail in ational PCT Application No. PCT/U82012/027615 was employed to integrate data obtained from a custom built drug-induced cardiotoxicity model, and to identify novel proteins/pathways driving the pathogenesis/ cardiotoxicity of drugs. Relational maps resulting from this analysis have provided drug-induced cardiotoxicity biomarkers.
In the healthy heart contractile function depends on a balance of fatty acid and carbohydrate oxidation. Chronic imbalance in uptake, ation, organellar biogenesis and secretion in non-adipose tissue (heart and liver) is thought to be at the center of mitochondrial damage and dysfunction and a key player in drug induced cardiotoxicity.
Here Applicants describe a systems approach combining n and lipid signatures with functional end point assays specifically looking at ar bioenergetics and mitochondrial ne on. In vitro models comprising diabetic and normal cardiomyocytes supplemented with excessive fatty acid and hyperglycemia were d with a panel of drugs to create signatures and potential mechanisms of toxicity.
Applicants demonstrated the varied effects of drugs in destabilizing the mitochondria by ting the energy metabolism component at various levels ing (i) Dysregulation of transcriptional networks that controls expression of mitochondrial energy metabolism genes; (ii) Induction of GPATI and taffazin in diabetic cardiomyocytes thereby initiating de novo phospholipid synthesis and remodeling in the mitochondrial membrane; and (iii) Altered fate of fatty acid in diabetic cardiomyocytes, influencing uptake, fatty acid ion and ATP synthesis. r, Applicants combined the power of wet lab biology and AI based data mining platform to generate causal network based on bayesian models. Networks of proteins and lipids that are causal for loss of normal cell function were used to discern mechanisms of drug induced toxicity from ar protective mechanisms. This novel approach will serve as a powerful new tool to understand mechanism of toxicity while allowing for development of safer eutics that correct an altered phenotype. 129129 Human cardiomyocytes were subject to conditions simulating an diabetic environment experienced by the disease-relevant cells in vivo. Specifically, the cells were exposed to hyperglycemic conditions and hyperlipidemia conditions. The hyperglycemic condition was induced by culturing cells in media containing 22 mM glucose. The hyperlipidemia condition was induced by culturing the cells in media ning lmM L—carnitine, 0.7mM Oleic acid and 0.7mM Linoleic acid.
The cell model sing the above-mentioned cells, wherein the cells were exposed to each condition described above, was additionally “interrogated” by exposing the cells to an “environmental perturbation” by treating with a diabetic drug (T) which is known to cause cardiotoxicity, a rescue molecule (R) or both the diabetic drug and the rescue molecule (T+R). Specifically, the cells were treated with diabetic drug; or treated with rescue molecule Coenzyme Q10 at 0, SOMM, or lOOuM; or d with both of the diabetic drug and the rescue molecule me Q10.
Cell samples from each condition with each perturbation ent were collected at various times following treatment, including after 6 hours of treatment. For certain conditions, media samples were also collected and analyzed. ling of changes in total cellular protein expression by quantitative proteomics was performed for cell and media samples collected for each ion and with each “environmental perturbation”, i.e, diabetic drug treatment, Coenzyme Q10 treatment or both, using the techniques described above in the detailed description.
Transcriptional profiling experiments were carried out using the Biorad cfx-3 84 amplification system. Following data collection (Ct), the final fold change over l was determined using the 8Ct method as outlined in cturer’s protocol.
Lipidomics ments were carried out using mass spectrometry. Functional assays such as Oxygen consumption rate OCR were measured by employing the Seahorse analyzer essentially as recommended by the manufacturer. OCR was recorded by the electrodes in a 7 ul chamber created with the cartridge pushing t the seahorse culture plate.
As shown in Figure 20, riptional network and expression of human mitochondrial energy metabolism genes in diabetic cardiomyocytes (cardiomyocytes conditioned in hyperglycemic and hyperlipidemia) were compared between perturbed and unperturbed treatments. Specifically, data of transcriptional k and sion of human mitochondrial energy metabolism genes were compared between diabetic 130130 cardiomyocytes treated with diabetic drug (T) and untreated diabetic cardiomyocytes samples (UT). Data of Transcriptional network and expression of human mitochondrial energy metabolism genes were compared between diabetic myocytes treated with both diabetic drug and rescue molecule Coenzyme Q10 (T+R) and ted diabetic cardiomyocytes samples (UT). Comparing to data from untreated diabetic cardiomyocytes, certain genes expression and transcription were altered when diabetic cardiomyocytes were treated with diabetic drug. Rescue molecule Coenzyme Q10 was demonstrated to e the toxic effect of diabetic drug and ize gene expression and transcription.
As shown in Figure 2lA, cardiomyocytes were cultured either in normoglycemia (NG) or hyperglygemia (HG) condition and treated with either diabetic drug alone (T) or with both diabetic drug and rescue molecule Coenzyme Q10 (T+R) . Protein expression levels of GPATl and TAZ for each ion and each treatment were tested with western blotting. Both GPATl and TAZ were upregulated in hyperglycemia conditioned and diabetic drug treated cardiomyocytes. When hyperglycemia conditioned myocytes were treated with both diabetic drug and rescue molecule me Q10, the upregulated protein expression level of GPATl and TAZ were normalized.
As shown in Figure 22A, mitochondrial oxygen consumption rate (%) experiments were d out for hyperglycemia conditioned cardiomyocytes samples.
Hyperglycemia conditioned myocytes were either untreated (UT), treated with diabetic drug Tl which is known to cause cardiotoxicity, treated with diabetic drug T2 which is known to cause cardiotoxicity, treated with both diabetic drug T1 and rescue le Coenzyme Q10 (Tl+R), or treated with both diabetic drug T2 and rescue molecule Coenzyme Q10 (T2+R). Comparing to untreated l samples, mitochondrial OCR was decreased when hyperglycemia conditioned cardiomyocytes were treated with diabetic drug T1 or T2. However, mitochondrial OCR was normalized when hyperglycemia conditioned cardiomyocytes were treated with both diabetic drug and rescue molecule Coenzyme Q10 (Tl + R, or T2 + R).
As shown in Figure 22B, mitochondria ATP sis experiments were carried out for hyperglycemia conditioned myocytes samples. Hyperglycemia conditioned cardiomyocytes were either untreated (UT), treated with a diabetic drug (T), or treated with both diabetic drug and rescue molecule Coenzyme Q10 (T+R). 131131 Comparing to untreated l samples, mitochondrial ATP synthesis was repressed when hyperglycemia conditioned cardiomyocytes were treated with diabetic drug (T).
As shown in Figure 23, based on the collected proteomic data, proteins down regulated by drug treatment were annotated with G0 terms. Proteins ed in mitochondrial energy metabolism were down regulated when hyperglycemia conditioned cardiomyocytes were d with a diabetic drug which is known to cause cardiotoxicity.
Proteomics, lipidomics, transcriptional profiling, functional assays, and western blotting data collected for each condition and with each bation, were then processed by the REFSTM system. ite perturbed networks were generated from ed data obtained from one specific condition (e. g., hyperglycemia, or hyperlipidemia) exposed to each perturbation (e. g., diabetic drug, CleO, or both).
Composite urbed networks were ted from combined data obtained from the same one specific condition (e.g., hyperglycemia, or hyperlipidemia), without perturbation (untreated). Similarly, composite perturbed networks were generated from combined data obtained for a second, control ion (e. g., normal glycemia) d to each perturbation (e. g., diabetic drug, CleO, or both). Composite unperturbed networks were generated from combined data obtained from the same second, control condition (e. g., normal glycemia), without perturbation (untreated).
Each node in the consensus composite networks described above was simulated (by increasing or sing by 10-fold) to generate simulation networks using , as described in detail above in the detailed description.
The area under the curve and fold changes for each edge connecting a parent node to a child node in the simulation ks were extracted by a custom-built program using the R programming language, where the R programming language is an open source software environment for statistical computing and cs.
Delta networks were generated from the simulated ite networks. To generate a drug induced cardiotoxicity condition vs. normal condition differential network in response to the diabetic drug (delt network), steps of comparison were performed as illustrated in Figure 24, by a custom built program using the PERL programming language.
Specifically, as shown in Figure 24, Untreated refers to protein expression networks of untreated control cardiomyocytes in hyperglycemia condition. Drug refers 132132 to protein expression networks of ic drug treated cardiomyocytes in hyperglycemia condition. Unique edges from Drug in the Drug fl Untreated delta network are presented in Figure 25.
Specifically, a simulated composite map of untreated myocytes in hyperglycemia condition and a simulated ite map of diabetic drug treated cardiomyocytes in hyperglycemia condition were compared using a -made Perl program to generate unique edges of the diabetic drug treated cardiomyocytes in hyperglycemia condition. Output from the PERL and R ms were input into Cytoscape, an open source program, to generate a visual representation of the delta network. As shown in Figure 25, the network ents delta networks that are driven by the diabetic drug versus untreated in cardiomyocytes/ cardiotox models in hyperglycemia condition.
From the drug induced toxicity condition vs. normal condition differential network shown in Figure 25, proteins were identified which drive pathophysiology of drug induced cardiotoxicity, such as GRP78, GRP75, TIMPl, PTX3, HSP76, PDIA4, PDIAl, CA2Dl. These ns can function as biomarkers for identification of other cardiotoxicity inducing drugs. These proteins can also function as biomarkers for fication of agents which can alleviate cardiotoxicity.
The experiments bed in this Example demonstrate that perturbed membrane y and altered fate of free fatty acid in ic cardiomyocytes exposed to drug treatment represent the center piece of drug induced toxicity. Data integration and network biology have allowed for an enhanced understanding of cardiotoxicity, and identification of novel biomarkers predictive for cardiotoxicity.
EXAMPLE 2: Employing Models of Drug Induced Cardiotoxicity to fy Additional Markers of Cardiotoxicity The platform technology described above in Example 1 was similarly employed to integrate further data obtained from the same custom built cardiotoxicity model. Five patient cardiomyocyte lines were used to create a model of cardiotoxicity as explained in the above-detailed description. The five cardiomyocyte lines were then ted to a mitochondrial ATP assay to assay for mitochondrial dysfunction imposed by drug treatment or absence there of (as indicated as + and -) under diabetic conditions 133133 (hyperglycemia) and normal ions (normoglycemia). A reduction of ondrial ATP was observed under diabetic conditions upon drug treatment in only 2 out of the 5 cardiotoxicity model (see Figure 30). The s of these further experiments lead to the identification of additional novel proteins/pathways driving the pathogenesis of cardiotoxicity of drugs, as summarized in Figures 26-34.
The causal interaction network identified several novel biomarkers and potential therapeutic s for nduced cardiotoxicity. Relational maps resulting from this analysis as shown in Figures 28, 29, 31-33 have provided additional drug-induced toxicity biomarkers, which are listed below in Table 2. These biomarkers may be used for ting drug-induced cardiotoxicity of a drug, for diagnosis/prognosis of drug-induced cardiotoxicity, and for identifying a rescue agent which can reduce or alleviate drug-induced cardiotoxicity.
Table 2: biomarkers identified by the Interrogative Biology Discovery Platform lA69, lCl7, ACBD3, ACLY, ACTR2, ANXA6, ANXA7, AP2Al, ARCNl, ASNAl, ATAD3A, ATP5A, ATP5B, ATP5D, ATP5Fl, ATP5H, ATPIFl, BSG, Cl4orfl66, CA2Dl, CAPNl, CAPZA2, CARS, CCDC22, CCDC47, CCT7, CLIC4, CMPKl, CNN2, COlA2, CO6Al, COTLl, COX6Bl, CRTAP, CSOlO, CTSA, CTSB, CYB5, DDXl, DDXl7, DDXl8, DLD, EDIL3, EHD2, EIF4A3, ENOZ, EPHXl, ETFA, FERMT2, FINC, FKBlO, FKBP2, FLNC, G3BP2, GOLGA3, GPATl, GPSNZ, GRP75, GRP78, HSPAlA, HSPA4, HSPA9, IBP7, IDHl, IQGAPl, ITBl, ITGBl, MRS, KIF5B, KPNA3, KPNBl, LAMCl, LGALSl, LMO7, M6PRBP1, MACFl, MAPlB, MARS, MDHl, MPRl, MTHFDl, MYHlO, NCL, NHP2Ll, NUCBl,OLAl,P08621,P3H1,P4HA2,P4HB, SEC6lAl (P6l6l9), PAIl, PAPSSZ, PCBPZ, PDCD6, PDIAl, PDIA3, PDIA3, PDIA4, PDLIM7, PEBPl, PFKM, PH4B, PLIN2, POFUTl, PRKDC, PSMAl, PSMA7, PSMD12, PSMD3, PSMD4, PSMD6, PSMEZ, PTBPl, PTX3, Q9BQE5, Q9Y262, RABlB, RP515A, RPL32, RPL7A, RPL8, RPSZS, RPS6, RRASZ, RRPl, SARlB, SDHA, SENPl, , SEPT7, SERPH, SERPINEl, SFRSZ, S 134134 SYNCRIP, TAGLN, TAZ, TGM2, TIMPl, TLNl, TPM4, TRAPl, TSPl, TTLL12, TXNDClZ, UBAlC, UGDH, UGP2, UQCRH, VAMP3, VAPA In one ment, a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen markers selected from a group consisting TIMPl, PTX3, HSP76, FINC, CYB5, PAIl, IBP7 (IGFBP7), lCl7, EDIL3, HMOXl, NUCBl, CSOlO, HSPA4 can be used for predicting drug-induced cardiotoxicity of a drug, for diagnosis/prognosis of drug-induced cardiotoxicity, for identifying a rescue agent which can reduce or alleviate drug-induced cardiotoxicity.
Among the markers listed in Table 2, PTX3, PAIl, IBP7 7) have been reported as markers of cardiomyopathy previously. GRP78 and PDIA3 have been reported as serving important indications of ER stress and hypoxic . The fact that these markers have been identified by the above-descriped rm technology for drug- induced cardiotoxicity, have ted this platform technology for probing novel drug- induced cardiotoxicity biomarkers.
The sDNA sequences of the markers listed in Table 2 are set forth in Appendix A, and are known in the art. 135135 oration by Reference The contents of all cited references (including literature references, patents, patent applications, and websites) that maybe cited throughout this ation are hereby expressly orated by reference in their entirety, as are the references cited therein. The practice of the present ion will employ, unless otherwise indicated, conventional techniques of protein formulation, which are well known in the art.
Equivalents The invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The foregoing embodiments are ore to be considered in all respects illustrative rather than limiting of the invention described herein. Scope of the invention is thus indicated by the appended claims rather than by the foregoing description, and all changes that come within the meaning and range of equivalency of the claims are therefore intended to be embraced herein. 136136 Apmndix A Official Symbol: HSPA5 Official Name: heat shock 70kDa protein 5 (glucose-regulated protein, 78kDa) m:3309 Organism: Homo sapiens Other Aliases: BIP; MlF2; GRP78 Other Designations: 78 kDa glucose-regulated protein; endoplasmic reticulum lumenal Ca(2+)-binding protein grp78; immunoglobulin heavy chain-binding protein Nucleotide sauence: NCBI nce Seguence: NM_005347.4 LOCUS NM_005347 ACCESSION 347 1 gggctggggg agggtatata gtag gcgacggtga ggtcgacgcc ggccaagaca 61 gcacagacag attgacctat tggggtgttt cgcgagtgtg agagggaagc gccgcggcct 121 gtatttctag acctgccctt gttc cctt gtgaccccgg gcccctgccg 181 cctgcaagtc ggaaattgcg ctgtgctcct gtgctacggc ctgg actgcctgct 241 gctgcccaac gcaa gatgaagctc tccctggtgg ccgcgatgct gctgctgctc 301 agcgcggcgc gggccgagga ggaggacaag aaggaggacg tgggcacggt ggtcggcatc 361 qacctgggga ccacctactc ctgcqtcgqc gtqttcaaga acggccgcgt ggagatcatc 421 gccaacgatc agggcaaccg catcacgccg tcctatgtcg ccttcactcc tgaaggggaa 481 cgtctgattg gcgatgccgc caagaaccag thacctcca accccgagaa cacggtcttt 541 gacgccaagc ggctcatcgg ccgcacgtgg aatgacccgt ctgtgcagca ggacatcaag 601 ttcttgccgt tcaaggtggt tgaaaagaaa ccat acattcaagt tgatattgga 661 ggtgggcaaa caaagacatt tgctcctgaa gaaatttctg ccatggttct cactaaaatg 721 aaagaaaccg ctgaggctta tttgggaaag aaggttaccc atgcagttgt tactgtacca 781 gcctatttta ccca acgccaagca accaaagacg ctggaactat tgctggccta 841 aatgttatga ggatcatcaa cgagcctacq gcagctgcta ttgcttatgg cctggataag 9 0 1 aqggaggggg acat cctggtgttt gacctgggtg gcggaacctt cgatgtgtct 137137 96" cttctcacca ttgacaatgg tgtcttcgaa gttgtggcca ctaatggaga tactcatctg "02" ggtggagaag actttgacca gcgtgtcatg gaacacttca tcaaactgta caaaaagaag "08" acgggcaaag atgtcaggaa taga gctgtgcaga aactccggcg cgaggtagaa "14" aaggccaaac gggccctgtc ttctcagcat caagcaagaa ttga gtccttctat "20" gaaggagaag acttttctga gaccctgact cgggccaaat ttgaagagct ggat "26" ctgttccggt ctactatgaa gcccgtccag aaagtgttgg ctga tttgaagaag "32" tctgatattg atgaaattgt tcttgttggt ggctcgactc gaattccaaa gattcagcaa "38" ctggttaaag agttcttcaa tggcaaggaa ccatcccgtg gcataaaccc agct "44" gtagcgtatg g:gctgctgt ccaggctggt gtgctctctg gtga:caaga tacaggtgac "50" ctggtactgc t:gatgtatg tccccttaca cttggtattg aaac:gtggg aggtgtcatg "56" accaaactga t:ccaaggaa cacagtggtg aaga agtc:cagat cttttctaca "62" gcttctgata a :caaccaac tgttacaatc aaggtctatg aagg:gaaag acccctgaca "68" aaagacaatc a:cttctggg tacatttgat ctgactggaa ttcc:cctgc tcctcgtggg "74" gtcccacaga t:gaagtcac ctttgagata aatg gtat:cttcg agtgacagct "80" gaagacaagg g:acagggaa caaaaataag atca ccaa:gacca gaatcgcctg "86" gaag aaatcgaaag gatggttaat gatgctgaga agtt:gctga ggaagacaaa "92" aagctcaagg agcgcattga tactagaaat gaaa ccta ttctctaaag "98" aatcagattg gagataaaga aaagctggga ggtaaacttt cctc:gaaga taaggagacc 204" atggaaaaag ctgtagaaga aaagattgaa tggctggaaa gccaccaaga catt 210" gaagacttca aagctaagaa gaaggaactg gaagaaattg ttcaaccaat caaa 216" ctctatggaa gtgcaggccc tcccccaact ggtgaagagg atacagcaga aaaagatgag 222" ttgtagacac tgatctgcta gtgctgtaat attgtaaata ctggactcag gaacttttgt 228" taggaaaaaa ttgaaagaac ttaagtctcg aatgtaattg gaatcttcac ctcagagtgg 234" agttgaaact gctatagcct aagcggctgt ttactgcttt tcattagcag ttgctcacat 240" gggt gggggggaga agaagaattg gccatcttaa ggta aaaaacctgg 246" gttagggtgt gtgttcacct tgtt ctatttaaca tcat gtgcatctgg 252" tgtaggaagt tttttctacc ataagtgaca ccaataaatg tttgttattt acactggtct 258" aatgtttgtg agaagcttct aattagatca attacttatt ttaggaaatt taagactaga 264" tactcgtgtg tggggtgagg ggagggagta tttggtatgt tgggataagg ttct 270" atttaatgct tccagggatt tttttttttt tttttaaccc tcctgggccc aagtgatcct 138138 276" tccacctcag tctcccagct gacc acaggcttgt taccaccatg tttt 282" gcattaatct aagaaaaggg gagagaagtt aatccacatc tttactcagg caaggggcat 288" ttcacagtgc ccaagagtgg ggttttcttg aacatacttg gtttcctatt tccccttatc 294" tttctaaaac tgcctttctg gtggcttttt ttaaaattat tactaatgat gcttttatag 300" ctgcttggat tctctgagaa atgatgggga gtgagtgatc atta actttataca 306" cttggatttc atttgtaact ttaggatgta aaggtatatt gtgaacccta gctgtgtcag 312" aatctccatc cctgaaattt ctcattagtg gtactggggt gggatcttgg atggtgacat 318" tgaaactaca ctaaatcccc tcactatgaa tgggttgtta aaggcaatgg tttgtgtcaa 324" aactggttta ggattactta gattgtgttc ctgaagaaaa gagtccaggt aaatggtatg 330" atcaataaag gacaggctgg cata aata ttgtaatcct agcactttgg 336" gaggccaagg cgggtggatc acaaggtcaa gagatagaga ccatctttgc caacatggtg 342" aaactccatc tctactgaaa atacaaaaat tagctgggcg tgca agctgaaggc 348" tgaggcagga gaatcactcg aacccgggag gcagaggttg cagtgagccg agatcacacc 354" actgtactcc agcccggcac tccagcctgg gagt gagactccac ctcaaaaaaa 360" aaaaaaagaa tccaatactg cccaaggata ggtattttat agatgggcaa ctggctgaaa 366" ggttaattct ctagggctag tagaactgga tcccaacacc aaactcttaa ttagacctag 372" gcctcagctg cactgcccga aaagcatttg ggcagaccct gagcagaata ctggtctcag 378" gccaagccca atacagccat taaagatgac ctacagtgct gtgtaccctg gggcaatagg 384" gttaaatggt agttagcaac tagggctagt cttcccttac ctcaaaggct ctcactaccg 390" acct agtctgtaac tctttctgag gagctgttac tgaatattaa aaagatagac 396" ttcaactatg aaa Protein seguence: NCBI Reference ce: NP_005338.1 LOCUS NP_005338 ACCESSION NP_005338 1 mklslvaaml lllsaaraee vgtv vgidlgttys cvgvfkngrv eiiandqgnr 61 itpsyvaftp gdaa npen tvfdakrlig rtwndpquq dikflpfkvv 121 ekktkpyiqv digggqtktf apeeisamvl tkmketaeay lgkkvthavv ndaq 181 rqatkdagti aglnvmriin eptaaaiayg ldkregekni ggtf dvslltidng 139139 241 vfevvatngd thlggedqu rvmehfikly kkktgkdvrk dnravqklrr evekakrals 301 sqhqarieie sfyegedfse tltrakfeel nmdlfrstmk dsd lkksdideiv 361 1vggstripk effn gkepsrginp deavaygaav qagvlsgdqd tgdlvlldvc 421 pltlgietvg gvmtkliprn tvvptkksqi fstasdnqpt vtikvyeger pltkdnhllg 481 tfdltgippa prgquievt feidvngilr vtaedkgtgn knkititndq nrltpeeier 541 kfae edkklkerid trnelesyay slknqigdke klggklssed ketmekavee 601 kiewleshqd adiedfkakk keleeivqpi isklygsagp pptgeedtae kdel Grp75 g)_ffigigl §ymmlz HSPA9 Official Name: heat shock 70kDa protein 9 (mortalin) Gene ID: 3313 Organism: Homo sapiens Other Aliases: CSA; MOT; MOT2; GRP75; PBP74; ; HSPAQB; h r D n : 75 kDa glucose-regulated protein; heat shock 70kD protein SB; mortalin, perinuclear; mortalin-2; p66-mortalin; peptide-binding protein 74; stress-70 protein, mitochondrial Nucleotide sauence: NCBI Reference Seguence: NM_004134.6 LOCUS NM_0041 34 ION NM_004134 1 ttcctcccct ggactctttc tgagctcaga gcag ccgggacagg ggct 61 ttctccaacc ctgc ggagcatatt acctgtacgc cctggctccg ggagcggcag 121 atcc tctggtcagg cggcgcgggc ggcgcctcag cggaagagcg ggcctctggg 181 ccgcagtgac caacccccgc ccctcacccc acgtggttgg aggtttccag aagcgctgcc 241 gccaccgcat cgcgcagctc tttgccqth gagcgcttgt ttgctgcctc gtactcctcc 301 atttatccgc catgataagt gccagccgag ctgcagcagc ccgtctcgtg ggcgccgcag 361 cctcccgggg ccctacggcc gcccgccacc aggatagctg gaatggcctt agtcatgagg 421 cttttagact tgtttcaagg cgggattatg catcagaagc ggga gcagttgttg 140140 48" gtattgattt gggtactacc aactcctgcg ttat ggaaggtaaa caagcaaagg 54" tgctggagaa tgccgaaggt gccagaacca cagt tgtggccttt acagcagatg 60" gtgagcgact tgttggaatg ccggccaagc gacaggctgt caccaaccca aacaatacat 66" tttatgctac caagcgtctc attggccggc gatatgatga tcctgaagta cagaaagaca 72" ttaaaaatgt taaa attgtccgtg cctccaatgg tgatgcctgg gttgaggctc 78" aatt gtattctccg agtcagattg ttgt gttgatgaag atgaaagaga 84" ctgcagaaaa ttacttgggg cacacagcaa aaaatgctgt gatcacagtc ccagcttatt 90" tcaatgactc gcagagacag gccactaaag atgctggcca tgga gtgc 96" ttcgggtgat taatgagccc acagctgctg ctcttgccta tggtctagac gaag "02" acaaagtcat tgctgtatat gatttaggtg gtggaacttt tgatatttct atcctggaaa "08" ttcagaaagg agtatttgag gtgaaatcca caaatgggga taccttctta ggtggggaag "14" actttgacca ggccttgcta cggcacattg agtt caagagagag acaggggttg "20" atttgactaa agacaacatg gcacttcaga gggtacggga agctgctgaa aaggctaaat "26" gtgaactctc ctcatctgtg cagactgaca tcaatttgcc ctatcttaca atggattctt "32" ctggacccaa gcatttgaat atgaagttga cccgtgctca atttgaaggg attgtcactg "38" atctaatcag aaggactatc gctccatgcc aaaaagctat gcaagatgca agca "44" agagtgacat aggagaagtg attcttgtgg gtggcatgac taggatgccc aaggttcagc "50" agactgtaca ggatcttttt ggcagagccc aagc tgtcaatcct gatgaggctg "56" tggccattgg agctgccatt cagggaggtg tgttggccgg cgatgtcacg gatgtgctgc "62" tccttgatgt cactcccctg tctctgggta ttgaaactct aggaggtgtc tttaccaaac "68" ttattaatag gaataccact attccaacca agaagagcca ggtattctct actgccgctg "74" aaac gcaagtggaa gtgt gtcagggtga aagagagatg gctggagaca "80" acaaactcct tggacagttt actttgattg gaattccacc agcccctcgt ggagttcctc "86" aagt tacatttgac attgatgcca atgggatagt acatgtttct gctaaagata "92" aaggcacagg acgtgagcag cagattgtaa tccagtcttc tggtggatta gatg "98" atattgaaaa tatggttaaa aatgcagaga aatatgctga agaagaccgg aagg 204" aacgagttga agcagttaat atggctgaag gaatcattca cgacacagaa accaagatgg 210" aagaattcaa ggaccaatta cctgctgatg agtgcaacaa gctgaaagaa gagatttcca 216" aaatgaggga gctcctggct agaaaagaca gcgaaacagg tatt agacaggcag 222" ctct tcagcaggca tcactgaagc tgttcgaaat ggcatacaaa aagatggcat 141141 228" ctgagcgaga aggctctgga agttctggca ctggggaaca aaaggaagat caaaaggagg 234" aaaaacagta ataatagcag aaattttgaa gccagaagga caacatatga agcttaggag 240" tgaagagact tcctgagcag aaatgggcga acttcagtct ttttactgtg tttttgcagt 246" attctatata taatttcctt aatttgtaaa tttagtgacc attagctagt gatcatttaa 252" gtga ttctaacagt ataaagttca caatattcta tgtccctagc ctgtcatttt 258" tcagctgcat gtaaaaggag gtaggatgaa ttgatcatta ttta actattttat 264" gctgaagtga ccatattttc aaggggtgaa accatctcgc acacagcaat gaaggtagtc 270" atccatagac ttgaaatgag accacatatg gggatgagat ccttctagtt agcctagtac 276" tgctgtactg gcctgtatgt acatggggtc cttcaactga ggccttgcaa gtcaagctgg 282" ctgtgccatg tttgtagatg gggcagagga atctagaaca atgggaaact tagctattta 288" tattaggtac agctattaaa acaaggtagg aatgaggcta gacctttaac ttccctaagg 294" catacttttc tagctacctt ctgccctgtg tctggcacct acatccttga tgattgttct 300" cttacccatt ctggaatttt ttttttttta aataaataca atct tgatctcttg 306" tttgtgaggg gtgatgccct tagc ttcaagaata tgccatggct tccc 312" ccca gaaa tacaggattt gctaacactg gttaaaaatg caag 318" atttggaagg gctgttataa aatg agcagtatca gcatgtgcaa atcttgtttg 324" aaggatttta ttttctcccc ttagaccttt ggtacattta gaatcttgaa agtttctaga 330" acat gaaagtttct agatctctaa agtt tttagatctc taacatgaaa 336" gtgg ctattttcag gttgctttca gctccaagta gaaataacca gaattggctt 342" acattaaaga aactgcatct agaaataagt cctaagatac tatttctatg gctcaaaaat 348" aaaaggaacc cagatttctt tcccta Protein seguence: NCBI Reference Seguence: NP_004125.3 LOCUS NP_004125 ACCESSION NP_004125 1 misasraaaa rlvgaaasrg quw nglsheafrl vsrrdyasea ikgavvgidl 61 gttnscvavm egkqakvlen aegarttpsv vaftadgerl vgmpaquav tnpnntfyat 121 krligrrydd pevqkdiknv pfkivrasng dawveahgkl yspsqigafv taen 181 ylghtaknav itvpayfnds qrqatkdagq isglnvlrvi neptaaalay gldksedkvi 142142 241 avydlgggtf disileiqu vfevkstngd tflggedqu allrhivkef kretgvdltk 301 dnmalqrvre aaekakcels nlp yltmdssgpk hlnmkltraq fegivtdlir 361 ham qdaevsksdi gevilvggmt tvq dlfgrapska vnpdeavaig 421 aaiqggvlag dvtdvllldv tplslgietl ggvftklinr nttiptkksq vfstaadgqt 481 qveikchge remagdnkll gqftligipp aprgquiev tfdidangiv hvsakdkgtg 541 reqqiviqss gglskddien mvknaekyae edrrkkerve avnmaegiih dtetkmeefk 601 dqlpadecnk lkeeiskmre llarkdsetg enirqaass qqaslklfem aykkmasere 661 gsgssgtgeq kequeekq TIMP1 Official Symbol: TIMP1 Official Name: TIMP opeptidase inhibitor 1 m:Gene ID: 7076 Organism: Homo sapiens Other Aliases: RP1-230G1.3, CLGI, EPA, EPO, HCI, TIMP Other Designations: TIMP-1; collagenase inhibitor; erythroid potentiating activity; erythroid-potentiating activity; fibroblast collagenase inhibitor; metalloproteinase inhibitor 1; tissue inhibitor of oproteinases 1 Nucleotide sguence: NCBI nce Seguence: NM_003254.2 LOCUS NM_003254 ACCESSION NM_003254 l tttcgtcggc ccgccccttg gcttctgcac tgatggtggg tggatgagta atgcatccag 61 gaagcctgga ggcctgtggt ttccgcaccc gctgccaccc ccgcccctag cgtggacatt 121 ctag cgctcaggcc ctgccgccat agat ccagcgccca gagagacacc 181 agagaaccca ccatggcccc gccc ctggcttctg gcatcctgtt gttgctgtgg 241 gccc ccagcagggc ctgcacctgt gtcccacccc acccacagac ggccttctgc 301 aattccgacc tcgtcatcag ggccaagttc gtggggacac cagaagtcaa ccagaccacc 361 ttataccagc gttatgagat caagatgacc aagatgtata aagggttcca agccttaggg 421 gatgccgctg acatccggtt cgtctacacc cccgccatgg agagtgtctg cggatacttc 481 cacaggtccc gcag cgaggagttt gctg gaaaactgca ggatggactc 143143 54" ttgcacatca gcag ttttgtggct ccctggaaca gcctgagctt agctcagcgc 60" cggggcttca ccaagaccta cactgttggc tgtgaggaat gcacagtgtt tccctgttta 66" tccatcccct gcaaactgca gagtggcact cattgcttgt ggacggacca gctcctccaa 72" ggctctgaaa agggcttcca tcac cttgcctgcc ggga gccagggctg 78" tgcacctggc agtccctgcg gtcccagata atcc tgcccggagt ggaagctgaa 84" gcctgcacag tgtccaccct gttcccactc ccatctttct tccggacaat gaaataaaga 90" gttaccaccc agcagaaaaa aaaaaaaaaa Protein seguence: NCBI nce Seguence: NP_003245.1 LOCUS NP_003245 ACCESSION NP_003245 1 mapfeplasg illllwliap sractcvpph pqtafcnsdl vgtp evnqttlyqr 61 yeikmtkmyk gfqalgdaad irfvytpame svcgyfhrsh nrseefliag 121 tcsfvapwns rgft ktytvgceec tvfpclsipc qusgthclw tdqllqgsek 181 gqurhlacl prepglctwq slrsqia PTX3 Official Sym bol: PTX3 Official Name: pentraxin 3, long Gene ID: 5806 Organism: Homo sapiens Other Aliases: TNFAIP5, TSG-14 Other Designations: TNF alpha-induced protein 5; pentaxin-related gene, rapidly induced by lL-1 beta, tumor necrosis , alpha-induced protein 5; pentaxin- related protein PTX3; pentraxin-3; pentraxin-related gene, rapidly induced by IL- 1 beta; pentraxin-related protein PTX3; tumor is factor alpha-induced protein 5; tumor is factor, alpha-induced protein 5; tumor necrosis factor- inducible gene 14 protein; tumor necrosis factor-inducible protein TSG-14 Nucleotide seguence: NCBI Reference Seguence: NM_002852.3 LOCUS NM_002852 ACCESSION NM_002852 144144 l attcatcccc gctt tcctcagcat ttattaagga gctc ctca 6" ctctcactct cctccgctca aactcagctc acttgagagt ccgc cagctgtgga l2" tttg cgtctctcca gcaatgcatc tccttgcgat tctgttttgt tggt 18" ctgcagtgtt ggccgagaac tcggatgatt atgatctcat gtatgtgaat ttggacaacg 24" aaatagacaa tggactccat cccactgagg accccacgcc gtgcgcctgc ggtcaggagc " aatg ggacaagctc ttcatcatgc actc gcagatgaga gagcgcatgc 36" tgctgcaagc cacggacgac gtcctgcggg gcgagctgca gaggctgcgg gaggagctgg 42" gccggctcgc ggaaagcctg gcgaggccgt gcgcgccggg ggctcccgca gaggccaggc 48" tgaccagtgc tctggacgag cagg cgacccgcga CgnggCCgC aggctggcgc 54" gtatggaggg cgcggaggcg cagcgcccag aggaggcggg gcgcgccctg gccgcggtgc 60" tagaggagct gcggcagacg cgagccgacc tgcacgcggt gcagggctgg gctgcccgga 66" gctggctgcc ggcaggttgt gaaacagcta ttttattccc aatgcgttcc aagaagattt 72" ttggaagcgt gcatccagtg agaccaatga ggcttgagtc ttttagtgcc tgcatttggg 78" ccac agatgtatta aacaaaacca tcctgttttc ctatggcaca aagaggaatc 84" catatgaaat ccagctgtat tacc aatccatagt gtttgtggtg ggtggagagg 90" agaacaaact ggttgctgaa gccatggttt ccctgggaag gtggacccac ctgtgcggca 96" cctggaattc agaggaaggg ctcacatcct tgtgggtaaa tggtgaactg gcggctacca "O2" ctgttgagat ggccacaggt cacattgttc ctgagggagg aatcctgcag attggccaag "08" aaaagaatgg ctgctgtgtg ggtggtggct ttgatgaaac attagccttc tctgggagac "l4" tcacaggctt caa:atctgg gatagtgttc ttagcaatga agagataaga gagaccggag "20" gagcagagtc ttg:cacatc cgggggaata ggtg gggagtcaca cagc "26" cacatggagg agc:cagtat gtttcataaa tgttgtgaaa ctccacttga agccaaagaa "32" agaaactcac act:aaaaca catgccagtt gggaaggtct gaaaactcag atag "38" gaacacttga tgaa agagagagtt gagaccaatc tttatttgta ctggccaaat "44" actgaataaa cag:tgaagg attg gaaaaagctt ttgaggataa tgttactaga "50" ctttatgcca tgg:gctttc agtttaatgc tgtgtctctg aaac tctcaaataa "56" ttaaaaagga ctg:attgtt gaacagaggg acaattgttt tacttttctt tggttaattt "62" tgttttggcc agagatgaat tttacattgg aagaataaca aaataagatt ccat "68" tgttcattgt tattggtatg taccttatta caaaaaaaag atgaaaacat atttatacta "74" caaggtgact taacaactat aaatgtagtt tatgtgttat aatcgaatgt cacgtttttg 145145 1801 agaagatagt catataagtt atattgcaaa agggatttgt attaatttaa gactattttt 1861 gtaaagctct actgtaaata aaatatttta taaaactagc tcacgtcatt taattataaa 1921 tttaagagat gttttggaaa aaaaaaaaaa aaaaa Protein ce: NCBI nce Seguence: NP_002843.2 LOCUS N P_002843 ACCESSION NP_002843 1 lfca lwsavlaens ddydlmyvnl dneidnglhp tedptpcacg qehsewdka 6" imlensqmre rmllqatddv lrgelqure elgrlaesla rpcapgapae ldel 12" lqatrdagrr larmegaeaq rpeeagrala avleequtr adlhavqgwa arswlpagce 18" tailfpmrsk kifgsvhpvr fsac ikaatdvln ktilfsygtk rnpyeiqul 24" syqsivavg geenklvaea mvslgrwthl cgtwnseegl tslwvngela attvematgh " ivpeggilqi gqekngccvg ggfdetlafs niwd svlsneeire tggaeschir 36" gnivgwgvte iqphggaqu HSP76 Official Sym bol: HSPA6 Official Name: heat shock 70kDa protein 6 (HSP7OB') Gene ID: 3310 sm: Homo sapiens Othe r Aliases: Other Designations: heat shock 70 kDa protein 6; heat shock 70 kDa protein 8'; heat shock 70kD protein 6 (HSP7OB') Nucleotide seguence: NCBI Reference Seguence: NM_002155.3 LOCUS NM_002155 ACCESSION NM_002155 1 agagccagcc cggaggagct agaaccttcc ccgcatttct ttcagcagcc tgagtcagag 61 gcgggctggc ctggcgtagc cgcccagcct tcat gccccgatct gcccgaacct 146146 12" gggg tcagcgccgc gccgcgccac ccggctgagt cagcccgggc gggcgagagg 18" ctctcaactg ggcgggaagg tgcgggaagg tgcggaaagg ttcgcgaaag gcgg 24" nggggtcgg gtgaggcgca aaaggataaa aagcccgtgg aagcggagct gagcagatcc " gagccgggct ggctgcagag aaaccgcagg gagagcctca ctgctgagcg cccctcgacg 36" cggc agcagcctcc gtggcctcca gcatccgaca agaagcttca gccatgcagg 42" ccccacggga gctcgcggtg ggcatcgacc tgggcaccac ctactcgtgc gtgggcgtgt 48" ttcagcaggg ggag atcctggcca acgaccaggg caaccgcacc acgcccagct 54" acgtggcctt caccgacacc gagcggctgg tcggggacgc ggccaagagc caggcggccc 60" tgaaccccca caacaccgtg ttcgatgcca agcggctgat cgggcgcaag ttcgcggaca 66" ccacggtgca gtcggacatg aagcactggc ccttccgggt cgag ggcggcaagc 72" ccaaggtgcg cgtatgctac cgcggggagg acaagacgtt ctaccccgag gagatctcgt 78" ccatggtgct gatg acgg ccgaggcgta cctgggccag cccgtgaagc 84" tgat caccgtgccc gcctatttca atgactcgca gcgccaggcc accaaggacg 90" cgggggccat cgcggggctc aacgtgttgc ggatcatcaa cacg gcagctgcca 96" tcgcctatgg gctggaccgg cggggcgcgg gagagcgcaa cgtgctcatt tttgacctgg "O2" gtgggggcac cttcgatgtg ctct acgc tggtgtcttt gaggtgaaag "08" ccactgctgg agatacccac ctgggaggag aggacttcga caaccggctc gtgaaccact "l4" aaga attccggcgg aagcatggga aggacctgag cgggaacaag cgtgccctgc "20" gcaggctgcg cacagcctgt gcca agcgcaccct gtcctccagc acccaggcca "26" ccctggagat agactccctg ttcgagggcg tggacttcta cacgtccatc actcgtgccc "32" gctttgagga actgtgctca gacctcttcc ccct ggagccggtg gccc "38" tgcgggatgc caagctggac aaggcccaga ttcatgacgt cgtcctggtg gggggctcca "44" ctcgcatccc caaggtgcag aagttgctgc aggacttctt caacggcaag gagctgaaca "50" agagcatcaa ccctgatgag gctgtggcct ctgc tgtgcaggcg gccgtgttga "56" tgggggacaa atgtgagaaa gtgcaggatc tcctgctgct ggatgtggct cccctgtctc "62" tggggctgga gacagcaggt ggggtgatga ccacgctgat ccagaggaac gccactatcc "68" ccaccaagca gacccagact ttcaccacct actcggacaa ccagcctggg gtcttcatcc "74" aggtgtatga gggtgagagg gccatgacca aggacaacaa cctgctgggg cgttttgaac "80" gcat ccctcctgcc ccacgtggag agat agaggtgacc tttgacattg "86" atgctaatgg catcctgagc gtgacagcca ctgacaggag cacaggtaag gctaacaaga 147147 1921 tcaccatcac caatgacaag ctga gcaaggagga gagg atggttcatg 1981 aagccgagca gtacaaggct gaggatgagg cccagaggga cagagtggct gccaaaaact 2041 cgctggaggc ccatgtcttc catgtgaaag gttctttgca agaggaaagc cttagggaca 2101 agattcccga cagg cgcaaaatgc aagacaagtg tcgggaagtc cttgcctggc 2161 tggagcacaa ccagctggca gagaaggagg agtatgagca tcagaagagg gagc 2221 aaatctgtcg ccccatcttc tccaggctct ggcc ccct gggggcagca 2281 gttgtggcac tcaagcccgc cagggggacc ccagcaccgg ccccatcatt gaggaggttg 2341 attgaatggc ccttcgtgat aagtcagctg tgactgtcag ggctatgcta tgggccttct 2401 agactgtctt ctatgatcct gcccttcaga gatgaacttt ccctccaaag ctagaacttt 2461 cttcccagga aagt actt tttgggggga gggcggttca tcctcttctg 2521 cttcaaataa aaagtcatta atttattaaa acttgtgtgg cactttaaca ttgctttcac 2581 ctatattttg tgtactttgt catg tatgaatttt gttatgtaaa atatagttat 2641 agacctaaat aaaaaaaaaa aaaa Protein seguence: NCBI Reference Seguence: NP_002146.2 LOCUS NP_002146 ACCESSION NP_002146 1 mqaprelavg idlgttyscv gquqgrvei landqgnrtt psyvaftdte rlvgdaaksq 61 aalnphntvf dakrligrkf adttvqsdmk hwpfrvvseg gkpkvrvcyr gedktfypee 121 issmvlskmk etaeaylgqp vkhavitvpa yfndsqrqat kdagaiagln vlriinepta 181 aaiaygldrr gagernvlif dlgggtfdvs vlsidagvfe vkatagdthl ggedfdnrlv 241 frrk hgkdlsgnkr alrrlrtace rakrtlssst qatleidslf egvdfytsit 301 rarfeelcsd lfrstlepve kalrdakldk aqihdvvlvg gstripkvqk llqdffngke 361 pdea vaygaavqaa vlmgdkcekv qdlllldvap tagg vmttliqrna 421 tqtf ttysdnquv fiquegera mtkdnnllgr felsgippap rgquievtf 481 didangilsv tatdrstgka nkititndkg rlskeeverm vheaeqykae deaqrdrvaa 541 knsleahvfh vkgslqeesl rdkipeedrr kqukcrevl awlehnqlae keeyehqkre 601 leqicrpifs upg gsscgtqarq gdpstgpiie evd PDIA4 148148 Official Symbol: PD|A4 Official Name: protein disulfide isomerase family A, member 4 m: 9601 Organism: Homo sapiens Other Aliases: ERP70, ERP72, ERp-72 Other Designations: ER protein 70; ER protein 72; endoplasmic reticulum resident n 70; asmic reticulum resident protein 72; protein disulfide isomerase related n (calcium-binding protein, intestinal-related); protein disulfide isomerase-associated 4; protein disulfide-isomerase A4 Nucleotide seguence: NCBI nce Seguence: 911.4 LOCUS NM_00491 1 ACCESSION NM_004911 l gttttaaacg cgcagccgag ggccgcgcgc aggagtaggg agggcctagg gcggcggagc 6" cgactcgtcg cggccgaggc gcgcgcggtc cgtgccggcg tcagtctggg attggccggc 12" ccgcgacttc ctccgccccc tgccaatcgc cggggacgac ttccgtgggt ttttccggct 18" cccccgcgtc gctaaggagc gacgggctgt gacc ccgagttctc ggtgcgctca 24" gcggccgccg ggag ctcc gcta ccatgaggcc ccggaaagcc " ttcctgctcc tgctgctctt ggggctggtg cagctgctgg ccgtggcggg tgccgagggc 36" ccggacgagg attcttctaa cagagaaaat gccattgagg atgaagagga ggaggaggag 42" gatg atgaggaaga agacgacttg gaagttaagg aagaaaatgg agtcttggtc 48" ctaaatgatg caaactttga tgtg gctgacaaag tgct gctggagttt 54" tatgctccat ggtgtggaca ttgcaagcag tttgctccgg aatatgaaaa aattgccaac 60" atattaaagg ataaagatcc tcccattcct gttgccaaga tcgatgcaac ctcagcgtct 66" gtgctggcca gcaggtttga tgtgagtggc taccccacca tcaagatcct taagaagggg 72" caggctgtag actacgaggg ctccagaacc caggaagaaa ttgttgccaa ggtcagagaa 78" gtctcccagc ccgactggac gcctccacca gaagtcacgc ttgtgttgac caaagagaac 84" tttgatgaag ttgtgaatga tgcagatatc attctggtgg agttttatgc cccatggtgt 90" ggacactgca agaaacttgc gtat gagaaggccg ccaaggagct cagcaagcgt 96" tctcctccaa ttcccctggc aaaggtcgac gccaccgcag aaacagacct ggccaagagg 149149 "O2" tttgatgtct ctggctatcc caccctgaaa attttccgca aaggaaggcc ttatgactac "08" aacggcccac gagaaaaata tggaatcgtt gattacatga tcgagcagtc cgggcctccc "l4" tccaaggaga ttctgaccct ggtc caggagttcc tgaaggatgg agacgatgtc "20" atcatcatcg ttaa gggggagagt gacccagcct accagcaata ccaggatgcc "26" gctaacaacc tgagagaaga ttacaaattt actt tcagcacaga aatagcaaag "32" ttcttgaaag tctcccaggg gcagttggtt gtaatgcagc ctgagaaatt ccagtccaag "38" tatgagcccc ggagccacat gatggacgtc tcca cccaggactc ggccatcaag "44" gacttcgtgc tgaagtacgc cctgcccctg gttggccacc gcaaggtgtc aaacgatgct "50" aagcgctaca ccaggcgccc cctggtggtc gtctactaca gtgtggactt cagctttgat "56" tacagagctg caactcagtt ttggcggagc aaagtcctag aggtggccaa ccct "62" acct ttgccattgc ggacgaagag gactatgctg gggaggtgaa ggacctgggg "68" ctcagcgaga gtggggagga tgtcaatgcc gccatcctgg acgagagtgg gttc "74" gagc cagaggagtt tgactctgac accctccgcg agtttgtcac tgctttcaaa "80" aaaggaaaac tgaagccagt catcaaatcc cagccagtgc acaa accc "86" gtcaaggtcg tggtgggaaa gacctttgac tccattgtga tggaccccaa gaaggacgtc "92" ctcatcgagt tctacgcgcc atggtgcggg cactgcaagc agctagagcc cgtgtacaac "98" agcctggcca agaagtacaa aaag ggcctggtca tcgccaagat ggacgccact 204" gccaacgacg tccccagcga ccgctataag gtggagggct tccccaccat ctacttcgcc 210" cccagtgggg acaaaaagaa cccagttaaa tttgagggtg gagacagaga tctggagcat 216" ttgagcaagt ttatagaaga acatgccaca aaactgagca ggaccaagga agagctttga 222" aggcctgagg tctgcggaag gtgggaggag gcagacgccc tgcgtggccc atggtcgggg 228" cgtccacgcc gaggccggca gaca gtatctcgga ttcctttttt ttttttttta 234" attttttata ctttggtgtt tcacttcatg ctctgaatac tgaataacca tgaatgactg 240" ttag tttt tacagaggat acatctattt ttatcattat ttggggtttg 246" aaaaattttt ttttacacct tctaatttct ttatttctca aagcagataa ttcttctgtg 252" tgaaaatgtt ttcttttttt aatttaaggt ttcc ttttccaaat catgttgatt 258" ttgctctttg ctttttcgtt gtctgagaaa ttgttggcgt agatttggct tctggtatgt 264" gtttctgatt gcttcctgtt gagcacaaag tgagagctgc cactgagcag ccctgccagg 270" ggtgctgttt caggctgggc atcgccaggc ggcctccctg caaaccaagg gctgggggca 276" catg atccagggtc ccccagggtg ggctcagctc cagggagagg ccacccacgt 150150 2821 ggcagcccca cctcttgaga gcccccagtg ccggagcaga aaggaccctg gacccagagg 2881 cagatactgc ggggtggtag aaaaggtaga gtaggctgtg gaat cgat 2941 cgtt aaaaaaaaaa aaaa n seguence: NCBI Reference Seguence: NP_004902.1 LOCUS NP_004902 ACCESSION NP_004902 1 mrprkaflll lllglvqlla vagaegpded aied eeeeeeeddd eeeddlevke 6" engvlvlnda adkd tvllefyapw cghckqfape ilkd kdppipvakl 12" datsasvlas rfdvsgypti kilkkgqavd yegsrtqeei vakvrevsqp dwtpppevtl 18" vltkenfdev vndadiilve fyapwcghck klapeyekaa kelskrsppi plakvdatae 24" tdlakrfdvs gyptlkifrk grpydyngpr ekygivdymi eqsgppskei ltlkqvqefl " kdgddviiig vfkgesdpay qqudaannl redykfhhtf steiakflkv snglvvmqp 36" equskyepr shmmdvqgst quaikdfvl kyalplvghr kvsndakryt rrplvvvyys 42" vdfsfdyraa tqfwrskvle vakdfpeytf aiadeedyag evkdlglses aiLd 48" esgkkfamep eefdsdtlre fvtafkkgkl kpviksqpvp knnkgpvkvv vgktfdsivm 54" dpkkdvlief yapwcghckq lepvynslak kykquglvi akmdatandv psdrykvegf 60" ptiyfapsgd kknpkaegg drdlehlskf ieehatklsr tkeel PD|A1 Official Symbol: P4HB Official Name: prolyl oxylase, beta polypeptide Gene ID: 5034 Organism: Homo sapiens Other Aliases: DSI, ERBA2L, GIT, P4Hbeta, PDI, PDIA1, PHDB, PO4DB, PO4HB, PROHB Other Designations: cellular thyroid hormone-binding protein; collagen prolyl 4- hydroxylase beta; glutathione-insulin transhydrogenase; p55; procollagenproline , 2-oxoglutarate ygenase (proline 4-hydroxylase), beta polypeptide; prolyl 4-hydroxylase t beta; protein disulfide isomerase family A, member 1; protein disulfide isomerase-associated 1; protein disulfide 151151 isomerase/oxidoreductase; protein disulfide-isomerase; protocollagen ylase; thyroid hormone-binding protein p55 Nucleotide seguence: NCBI nce Seguence: NM_000918.3 LOCUS NM_000918 ACCESSION NM_000918 l gagcctcgaa gtccgccggc caatcgaagg cgggccccag cggcgcgtgc gcgccgcggc 6" cagcgcgcgc gggg gcaggcgcgc cccggaccca ggatttataa aggcgaggcc 12" gggaccggcg cgcgctctcg tcgcccccgc tgtcccggcg accg aagcgccccg 18" cctgatccgt gtccgacatg cgcg ctctgctgtg cctggccgtg gccgccctgg 24" tgcgcgccga cgcccccgag gaggaggacc acgtcctggt gctgcggaaa agcaacttcg " cggaggcgct ggcggcccac aagtacctgc tggtggagtt ctatgcccct tggtgtggcc 36" actgcaaggc tctggcccct gcca ctgg gaagctgaag gcagaaggtt 42" ccgagatcag gttggccaag gtggacgcca cggaggagtc tgacctggcc cagcagtacg 48" gcgtgcgcgg ctatcccacc atcaagttct tcaggaatgg agacacggct tcccccaagg 54" aatatacagc tggcagagag gctgatgaca tcgtgaactg gctgaagaag cgcacgggcc 60" cggctgccac caccctgcct gacggcgcag ctgcagagtc cttggtggag gagg 66" tggctgtcat cttc aaggacgtgg agtcggactc tgccaagcag tttttgcagg 72" cagcagaggc catcgatgac ataccatttg ggatcacttc caacagtgac gtgttctcca 78" aataccagct cgacaaagat ggggttgtcc agaa gtttgatgaa ggccggaaca 84" actttgaagg ggaggtcacc aaggagaacc tgctggactt acac aaccagctgc 90" cccttgtcat cgagttcacc gagcagacag ccccgaagat ttttggaggt gaaatcaaga 96" ctcacatcct gctgttcttg cccaagagtg tgtctgacta tgacggcaaa ctgagcaact "02" tcaaaacagc agccgagagc ttcaagggca agatcctgtt catcttcatc gacagcgacc "08" acaccgacaa ccagcgcatc ctcgagttct ttggcctgaa agag tgcccggccg "14" tgcgcctcat caccctggag gaggagatga ccaagtacaa gcccgaatcg gaggagctga "20" cggcagagag gatcacagag ttctgccacc tgga gggcaaaatc aagccccacc "26" gcca ggagctgccg gaggactggg acaagcagcc tgtcaaggtg cttgttggga "32" agaactttga agacgtggct tttgatgaga aaaaaaacgt ctttgtggag ttctatgccc "38" gtgg caaa cagttggctc ccatttggga taaactggga gagacgtaca 152152 "44" aggaccatga gaacatcgtc atcgccaaga tggactcgac tgccaacgag gtggaggccg "50" tcaaagtgca cagcttcccc aagt tctttcctgc cagtgccgac aggacggtca "56" ttgattacaa cggggaacgc acgctggatg gttttaagaa attcctggag agcggtggcc "62" aggatggggc aggggatgat gacgatctcg aggacctgga agag gagccagaca "68" tggaggaaga tcag aaagctgtga aagatgaact gtaatacgca aagccagacc "74" cgggcgctgc ccct nggggctgc acacccagca gcagcgcacg aagc "80" ctgcggcctc gcttgaagga gggcgtcgcc ggaaacccag ggaacctctc tgaagtgaca "86" cctcacccct acacaccgtc cgttcacccc cgtctcttcc ttctgctttt cggtttttgg "92" aaagggatcc atctccaggc agcccaccct ggtggggctt gtttcctgaa atgt "98" actttttcat acatgagtct gtccagagtg cttgctaccg tgttcggagt ctcgctgcct 204" ccctcccgcg ggaggtttct cctctttttg ccgt ctgtgggatt tttagacatt 210" tttcgacatc agggtatttg ttccaccttg gccaggcctc ctcggagaag cttgtccccc 216" gtgtgggagg gacggagccg gactggacat ggtcactcag taccgcctgc agtgtcgcca 222" tgactgatca ttgc atttttgggt aaatggagac ttccggatcc tgtcagggtg 228" tcccccatgc ctggaagagg tggc tgccagccct cggc acaggcctgg 234" gccttcccct tccctcaagc cagggctcct cctcctgtcg tgggctcatt gtgaccactg 240" gcctctctac agcacggcct gtggcctgtt caaggcagaa ccacgaccct tgactcccgg 246" gtggggaggt ggccaaggat gctggagctg aatcagacgc tgacagttct tcaggcattt 252" ctatttcaca atcgaattga acacattggc caaataaagt tgaaatttta ccacctgtaa 258" aaaaaaaaaa aaaaaa Protein ce: NCBI Reference Seguence: NP_000909.2 LOCUS NP_000909 ACCESSION NP_000909 1 mlrrallcla vaalvradap eeedhvlvlr ksnfaealaa hkyllvefya pwcghckala 61 peyakaagkl kaegseirla kvdateesdl aqqygvrgyp ngdt aspkeytagr 121 eaddivnwlk krtgpaattl pdgaaaeslv essevavigf fkdvesdsak qflqaaeaid 181 dipfgitsns dvfskyqldk dgvvlfkkfd egrnnfegev tkenlldfik hnqlplvief 241 teqtapkifg geikthillf lpksvsdydg klsnfktaae sfkgkilfif idsdhtdnqr 301 ileffglkke ecpavrlitl eeemtkykpe seeltaerit legk ikphlmsqel 153153 361 pedwqupvk vlvgknfedv afdekknva cghc kqlapiwdkl getykdheni 421 viakmdstan eveavkvhsf ptlkffpasa drtvidynge rtldgfkkfl 481 dddledleea eepdmeeddd qkavkdel CA2D1 Official Symbol: CACNA2D1 Official Name: m channel, voltage-dependent, alpha 2/delta subunit 1 Gene ID: 781 Organism: Homo sapiens Other Aliases: H_DJ0560014.1, CACNA2, CACNL2A, CCHL2A Other Designations: calcium l, L type, alpha 2 polypeptide; dihydropyridine-sensitive L-type, calcium channel alpha-2/delta t; voltagedependent calcium channel subunit alpha-2/delta-1 ; voltage-gated calcium channel subunit alpha-2/delta-1 Nucleotide seguence: NCBI Reference Seguence: NM_000722.2 LOCUS NM_000722 ACCESSION NM_000722 1 cggcggaggc aaggcggccg cggcgcggag cagccgacgc acgctagtgg gtccgcccgc 6" caccgcccct tcctcggcgt ccgc ccttgccgtc ccccgcgcgg ctccgcgcct l2" nggCCCng gcgcagccag ccctccagac gcccgcggtc ccggcggcgt gtgctgctct 18" tcctccgccc gcggtttcca gcgccgctcc ttcccccgct tgggcaggga gggggcattg 24" atcttcgatc gcgaagatgg ctgctggctg cctgctggcc ttgactctga cacttttcca " atctttgctc atcggcccct agga gccgttccct tcggccgtca ctatcaaatc 36" atgggtggat aagatgcaag aagaccttgt cacactggca gcaa gtggagtcaa 42" tcagcttgtt tatg agaaatatca agatttgtat actgtggaac atgc 48" acgccagctg gtagaaattg ggga tattgagaaa cttctgagca acagatctaa 54" agccctggtg cgcctggcat tggaagcgga gaaagttcaa gcagctcacc agtggagaga 60.. agattttgca agcaatgaag ttgtctacta caatgcaaag gatgatctcg atcctgagaa 66" aaatgacagt gagccaggca ggat aaaacctgtt ttcattgaag atgctaattt 154154 72" acaa tatc agcacgcagc agtccatatt cctactgaca tctatgaggg 78" ctcaacaatt gtgttaaatg aactcaactg gacaagtgcc ttagatgaag aaaa 84" gaatcgcgag gaagaccctt cattattgtg gcaggttttt ggcagtgcca ctggcctagc 90" tcgatattat ccagcttcac catgggttga taatagtaga actccaaata acct 96" ttatgatgta cgcagaagac catggtacat ccaaggagct ccta aagacatgct "02" tattctggtg gatgtgagtg gaagtgttag tggattgaca cttaaactga tccgaacatc "08" tgtctccgaa atgttagaaa ccctctcaga tgatgatttc gtgaatgtag cttcatttaa "14" cagcaatgct caggatgtaa gctgttttca gcaccttgtc caagcaaatg taagaaataa "20" aaaagtgttg aaagacgcgg atat cacagccaaa ggaattacag agaa "26" gggctttagt tttgcttttg aacagctgct taattataat gtttccagag caaactgcaa "32" taagattatt atgctattca ngatggagg agaagagaga gcccaggaga acaa "38" atacaataaa gataaaaaag tacgtgtatt cacgttttca gttggtcaac atga "44" cagaggacct attcagtgga tggcctgtga aaacaaaggt tattattatg aaattccttc "50" cattggtgca atca atactcagga atatttggat gttttgggaa gaccaatggt "56" tttagcagga gacaaagcta agcaagtcca atggacaaat gtgtacctgg atgcattgga "62" actgggactt gtcattactg gaactcttcc ggtcttcaac ggcc aaaa "68" taagacaaac ttaaagaacc agctgattct gatg ggagtagatg tgtctttgga "74" agatattaaa agactgacac cacgttttac actgtgcccc aatgggtatt actttgcaat "80" cgatcctaat ggttatgttt tattacatcc aaatcttcag ccaaagaacc ccaaatctca "86" ggagccagta acattggatt tccttgatgc agag atta aagtggagat "92" tcgaaataag atgattgatg gggaaagtgg agaaaaaaca ttcagaactc tggttaaatc "98" tgag agatatattg acaaaggaaa caggacatac acatggacac ctgtcaatgg 204" cacagattac agtttggcct tggtattacc aacctacagt ttttactata taaaagccaa 210" actagaagag acaataactc aggccagatc aaaaaagggc aaaatgaagg attcggaaac 216" gcca gataattttg aagaatctgg ctatacattc atagcaccaa gagattactg 222" caatgacctg aaaatatcgg ataataacac tgaatttctt ttaaatttca acgagtttat 228" tgatagaaaa actccaaaca acccatcatg taacgcggat ttgattaata gagtcttgct 234" tgatgcaggc tttacaaatg aacttgtcca aaattactgg caga aaaatatcaa 240" gggagtgaaa gcacgatttg ttgtgactga tggtgggatt accagagttt atcccaaaga 246" ggctggagaa aattggcaag aaaacccaga gacatatgag gacagcttct ataaaaggag 155155 252" cctagataat gataactatg ttttcactgc tccctacttt aacaaaagtg gacctggtgc 258" ctatgaatcg ggcattatgg taagcaaagc tgtagaaata tatattcaag ggaaacttct 264" taaacctgca gttgttggaa ttaaaattga tgtaaattcc tggatagaga atttcaccaa 270" aacctcaatc ccgt gtgctggtcc agtttgtgac tgcaaaagaa acagtgacgt 276" aatggattgt ctgg atgatggtgg gtttcttctg atggcaaatc atgatgatta 282" tactaatcag agat gaga gattgatccc agcttgatga gacacctggt 288" taatatatca gtttatgctt ttaacaaatc ttat cagtcagtat gtgagcccgg 294" tgctgcacca aaacaaggag caggacatcg ctcagcatat gtgccatcag tagcagacat 300" attacaaatt ggctggtggg ccactgctgc tgcctggtct attctacagc agtttctctt 306" gagtttgacc tttccacgac tccttgaggc agttgagatg gaggatgatg acttcacggc 312" ctccctgtcc aagcagagct gcattactga acaaacccag tatttcttcg ataacgacag 318" taaatcattc agtggtgtat tagactgtgg aaactgttcc agaatctttc aaaa 324" gcttatgaac accaacttaa taat gagc aaagggacat gtga 330" cacacgactg ctcatacaag cggagcagac ttctgacggt ccaaatcctt gtgacatggt 336" taagcaaccc agataccgaa aagggcctga tgtctgcttt gataacaatg tcttggagga 342" ttatactgac tgtggtggtg tttctggatt aaatccctcc ctgtggtata tcattggaat 348" ccagtttcta tggc tggtatctgg cagcacacac cgcctgttat gaccttctaa 354" aaaccaaatc tgcatagtta aactccagac cctgccaaaa catgagccct gccctcaatt 360" acagtaacgt agggtcagct ataaaatcag acaaacatta gctgggcctg ggca 366" taacactaag gcgcagactc ctaaggcacc cactggctgc atgtcagggt gtcagatcct 372" taaacgtgtg tgaatgctgc atcatctatg tgtaacatca aagcaaaatc ctatacgtgt 378" cctctattgg aaaatttggg agtttgttgt tgcattgttg gt Protein seguence: NCBI Reference ce: NP_000713.2 LOCUS NP_000713 ACCESSION NP_000713 1 maagcllalt ltlqullig fpsa vtikswvdkm qedlvtlakt asgvnqlvdi 61 yekyqdlytv epnnarqlve iaardiekll snrskalvrl vqaa hqwredfasn 121 evvyynakdd ldpekndsep gsqrikpvfi edanfgrqis thaavhipt diyegstivl 156156 18" nelnwtsald evfkknreed psllqufgs atglaryypa spwvdnsrtp nkidlydvrr 24" rpwyiqgaas pkdmlilvdv sgsvsgltlk lirtsvseml etlsdddfvn naqd " vqa nvrnkkvlkd avnnitakgi tdykkgfsfa feqllnynvs rancnkiiml 36" ftdggeeraq eifnkynkdk kvrvftfsvg qhnydrgpiq wmacenkgyy yeipsigair 42" intqeyldvl grpmvlagdk akqvqwtnvy ldalelglvi tgtlpvfnit tnLk 48" nqlilgvmgv dvsledikrl tprftlcpng pngy vllhpnlqpk npksqepvtl 54" dfldaelend nkmi dgesgektfr tlvksqdery idkgnrtytw tpvngtdysl 60" alvlptysfy yikakleeti tqarskkgkm kdsetlkpdn feesgytfia prdycndlki 66" sdnnteflln fnefidrktp nnpscnadli nrvlldagft ywsk qknikgvkar 72" fvvtdggitr genw qenpetyeds fykrsldndn yvftapyfnk sgpgayesgi 78" eiyi qgkllkpavv gikidvnswi enftktsird pcagpvcdck rnsdvmdcvi 84" lddggfllma nhddytnqig dpsl mrhlvnisvy afnksydyqs vcepgaapkq 90" gaghrsayvp svadilqigw wataaawsil qqfllsltfp rlleavemed ddftaslskq 96" sciteqtqyf fdndsksfsg vldcgncsri fhgeklmntn lifimveskg tcpcdtrlLi 102" qaeqtsdgpn pcdmvkqpry rkgpdvcfdn tdcg gvsglnpslw yiigiqflLl 108" wlvsgsthrl 1 GPAT1 Official Symbol: GPAM Official Name: glycerolphosphate acyltransferase, mitochondrial Gene ID: 57678 Organism: Homo sapiens Other Aliases: RP11-426E5.2, GPAT, GPAT1 Other Designations: GPAT-f ; glycerol 3-phosphate acyltransferase, mitochondrial; glycerolphosphate acyltransferase 1, mitochondrial Nucleotide seguence: NCBI nce Seguence: NM_001244949.
LOCUS NM_001244949 ACCESSION NM_001244949 157157 l tgcgtcatca gggtgcgcca ctgcagctgg ccgg gactggaagt gcgggcttct 6" gcagcagccg aagctggagc tgctagggca gctc tgta ttct 12" gcacccgaaa ctgatagctg agtcctgaag ttttatgtta tgaaacagaa gaactttcat 18" cccagcacat gatttgggaa ttacactttg tgacatggat gaatctgcac tgacccttgg 24" tacaatagat gtttcttatc tgccacattc atcagaatac agtgttggtc gatgtaagca " cacaagtgag gaatggggtg agtgtggctt tagacccacc atcttcagat ctgcaacttt 36" aaaatggaaa gaaagcctaa tgagtcggaa aaggccattt gttggaagat gttgttactc 42" tccc cagagctggg acaaattttt caaccccagt atcccgtctt tgggtttgcg 48" gaatgttatt tatatcaatg aaactcacac ccgc ggatggcttg caagacgcct 54" ttcttacgtt attc aagagcgaga tgtgcataag ggcatgtttg ccaccaatgt 60" gactgaaaat gtgctgaaca gcagtagagt acaagaggca attgcagaag tggctgctga 66" attaaaccct gatggttctg cccagcagca atcaaaagcc gttaacaaag tgaaaaagaa 72" agctaaaagg attcttcaag aaatggttgc cactgtctca ccggcaatga tcagactgac 78" tgggtgggtg ctgctaaaac tgttcaacag cttcttttgg aacattcaaa ttcacaaagg 84" tcaacttgag atggttaaag ctgcaactga gacgaatttg ccgcttctgt ttctaccagt 90" atcc catattgact atctgctgct cactttcatt ctcttctgcc ataacatcaa 96" agcaccatac attgcttcag atct caacatccca atcttcagta ccttgatcca "02" taagcttggg ggcttcttca gaag gctcgatgaa acaccagatg gacggaaaga "08" tgttctctat agagctttgc tccatgggca tatagttgaa ttacttcgac agcagcaatt "l4" cttggagatc gaag gcacacgttc taggagtgga aaaacctctt gtgctcgggc "20" aggacttttg tcagttgtgg tagatactct gtctaccaat gtcatcccag acatcttgat "26" aatacctgtt ggaatctcct atgatcgcat tatcgaaggt cactacaatg gtgaacaact "32" gggcaaacct aagaagaatg agagcctgtg gagtgtagca agaggtgtta ttagaatgtt "38" acgaaaaaac tatggttgtg tccgagtgga ttttgcacag tcct taaaggaata "44" tttagaaagc caaagtcaga tgtc tgctctactt tccctggagc aagcgttgtt "50" accagctata cttccttcaa gacccagtga tgctgctgat gaaggtagag acacgtccat "56" gtcc agaaatgcaa cagatgaatc cctacgaagg aggttgattg caaatctggc "62" tgagcatatt ctattcactg ctagcaagtc ctgtgccatt atgtccacac acattgtggc "68" ttgcctgctc ctctacagac acaggcaggg aattgatctc ttgg actt "74" ctttgtgatg gaag tcctggctcg tgattttgac ctggggttct caggaaattc 158158 180" agaagatgta gtaatgcatg ccatacagct gctgggaaat tgtgtcacaa tcacccacac 186" tagcaggaac gatgagtttt ttatcacccc cagcacaact gtcccatcag tcttcgaact 192" caacttctac agcaatgggg tacttcatgt catg atca tagcttgcag 198" cctttatgca gttctgaaca agaggggact ggggggtccc actagcaccc cacctaacct 204" ccag gagcagctgg tgcggaaggc ggccagcctg tgctaccttc tctccaatga 210" aggcaccatc tcactgcctt gccagacatt ttaccaagtc tgccatgaaa cagtaggaaa 216" gtttatccag tatggcattc tggc cgat gaccaggaag atatcagtcc 222" tgct gagcagcagt gggacaagaa gcttccagaa cctttgtctt ggagaagtga 228" agat gaagacagtg actttgggga ggaacagcga gattgctacc tgaaggtgag 234" ccaatccaag gagcaccagc agtttatcac cttcttacag cttg tgct 240" ggaggcctac agctctgctg ccatctttgt tcacaacttc agtggtcctg ttccagaacc 246" tgagtatctg caaaagttgc acaaatacct aataaccaga acagaaagaa atgttgcagt 252" atatgctgag acat attgtcttgt gaagaatgct gtgaaaatgt ttaaggatat 258" tggggttttc aaggagacca aacaaaagag agtgtctgtt ttagaactga gcagcacttt 264" tctacctcaa tgcaaccgac aaaaacttct agaatatatt ctgagttttg tggtgctgta 270" ggtaacgtgt ggcactgctg gcaaatgaag gtcatgagat gagttccttg taggtaccag 276" cttctggctc aagagttgaa ggtgccatcg cagggtcagg ctgt cccgaagtga 282" tctcctggaa gacaagtgcc ttctccctcc atggatctgt gatcttccca gctctgcatc 288" aacacagcag cctgcagata acacttgggg ggacctcagc ctctattcgc aactcataat 294" ccgtagacta caagatgaaa tctcaataaa ttatttttga gtttattaaa gattgacatt 300" ttaagtacaa cttttaagga ctaattactg acac gtag ctgtgttctg 306" gaactgaatc ttacatggta tacttagtgc g:aa tttgttggta tattatctgg 312" ttagtggtta atgcttcctt taaaaataat tgagtca:cc attcactctt tttcagtttt 318" atctgtcaat agtagctaca tttttaatgg c:tt tatcccaaag tgctttataa 324" attgagtgga ctgatatata tcacacccag gtatcac:gt gctgtccttt gctgtcagat 330" ttagaaatgt ttttaagagc aaaa cagacaa:at tagtttaggt cgggaactga 336" gatattgtaa agtt aacatcagga agttaat:tg gctggcaaaa ttctagggaa 342" acttggccag aaaactggtg ttgaaggctt ttgctca:at aaacaagtgc cattgagttt 348" caaatgacca gcaaatatat ttagaaccct tcctgtt:ta tgtctgtacc tcgtccaccc 354" ctcaggtaat acctgcctct cacaggtaca gctgtttctt ggaaatcctc caaccaaata 159159 360" gcagttttcc taacttgatt agcttgagct gacagactgt tagaatacag ttctctggcc 366" acagctgatg agggctttct gtactgcaca gtgt cccc agtccaggtg 372" actggtaccc actcgagttg tgccgtgcac aacctgtcca gcat gtggtggccc 378" tactgactgg taatggttag aggcatttat ggatttttag ctttgaggaa aaaccatgac 384" ttttaacaaa tttttatggg ttatatgcct aaacccttat gccacatagt ggtaaataat 390" tatgaaaaat ggtctgttca taattggtag gtgccttttg tgagcaggga ttat 396" tggtttatta tggtaattat ggtgattttt taaatatcat gtaatg:taa aacgttttct 102" aacagtttac ttat ctccaagata ttatggaatt aagaat:ttt ccagatgagt 108" taga ttctttgaat ttagtataaa agtactgaga attaag:ttg cata 114" agcttggatt ttaaacactg atagtatctc aatg tgtgtt :tgg gagagggagg 120" gatgctgatt gatatttcac attgtatgaa tgtt tgaaac:cat agcaataatg 126" ctatgctgtt gtgatccctc tctg catttaaaat atattt:ttc tttataggaa Z32" ttgatgtata agtc attgtcagtt gtagtagctc tgatgt:gaa tgagatatca tgttttagca ttccatttta ctgactaggg tagaagaaca cttttc:tgg aggataccca gggagtcttg ggtgttcctt atctggggaa gcaaacattt tttttttcat cctttaaatt gtaaattaag gattactcaa gctcaccatt attcaagatt 156" gggactcgct tcccagtcga cactctgccc tgcctgtcat tgctgcaaag agctgctgct Z62" ttgccaacct aagcaaagaa aatacggctt ctcttgcatt attttccctt ttggttggtt Z68" tgttttctag aagtacgttc agatgctttg gcaa tgtatgattt gctagctctc Z74" ctta actcactgtg aggataaata tgcatgcttt ttgtaattaa ctggtgcttt 180" gaaaatcttt tttaagggag aaaaatctca accaaagtta tgctcatcca ctga 186" cctttgagtt aatttcagca caactcattc ttcagtgcct catgactgaa aacaaaaaac Z92" aaaaaaacga aagcatcttc acaatgaagc ttccagatag caccgttttg ctaaaagata Z98" cattctcatt gttttccaac agtgatggct tccacataag gttaaacaaa ctaggtgctt 504" gtaaataatt tattacagtt tactctatcg catttctgta aatg catgcccttc 510" ttcaggggaa gactgtggtc aagttaaaaa aaaaaaacaa tattaaacaa catgaaactg 516" cagtctgttt ttgaaaatga gaatgtccta agtgattcag ggag ggaagttgtg 522" cactctgaaa atgcatgaaa aacaaaggca aaaactagtg ggaaatgtgt agaactgtta 528" actgagatgg cttcgagtct tccttctgga atctgttaaa tttcacaaag tcatgagggt 534" aaatggagaa aatatttctg ggattacaat gaatgtaagc ccaaattgtg gaattgccag 160160 540" taacctggat ggggaaaagc atttcccata gcactccatg taatatgagt gctctgtgag 546" atgttcatca gtgttttata gaaatggtgt tgctgggaaa ccaagtttgc acctggaaac 552" ttacaatgca ctttagcgca gtaagggctt cggt agtgaaaaac tgtctaaccc 558" agcattgccc aaactatttt gacaccagga tctc gata cttatgaacc 564" tctcactaat gtcctgtgga gaacattttg ggaaacacta tgttagatag ttctttaagg 570" agacaaaacg gtaatgaaca actg gggcagaata tgcatgcatt ttgtaacgtc 576" cagtgtggcg agat ttcc gcag aaaataagca cagaaaatta 582" taatgtaggt agct ctttcctttg atagagagaa cagccccaat gatcctggct 588" ttttcactga acgtatcaga atacatggat gaattggggt aaataaggtt ttaattcaga 594" tctagaagaa agtattgtac gtttgaatgc agatttttat ccacagatag ttgtagtgtt 600" tagacatgac aggacctatc gttgaggttt ctta ctatgggctg tgtt 606" ttttaaaact attttagaaa cctgagactt gccgtctggc attttagttt aatacaaact 612" tgca tttgaaagag attcttgacc ctaa acgtctagag ctctgaaatg 618" tcttgatgga aggtattaaa ctatttgcct gttgtacaaa gaaatgttaa gactcgtgaa 624" aagaattact ataaggtact gtgaaataac tgcgattttg tgagcaaaac atacttggaa 630" atgctgattg atttttatgc ttgttagtgt attgcaagaa acacagaaaa tgtagttttg 636" ttttaataaa attg aacatacaaa aaaaaaaaaa aaaaaa Protein seguence: NCBI Reference Seguence: NP_001231878.1 LOCUS NP_001231878 ACCESSION NP_001231878 l mdesaltlgt idvsylphss eysvgrckht seewgecgfr ptifrsatlk wkeslmsrkr 6" pfvgrccysc tpqswdkffn psipslglrn viyinethtr hrgwlarrls yvlfiqerdv 12" hkgmfatnvt envlnssrvq eaiaevaael npdgsaqqqs kavnkvkkka krilqemvat 18" vspamirltg wvllklfnsf fwniqihkgq lemvkaatet nlpllflpvh rshidyllLt 24" filfchnika pyiasgnnln ipifstlihk lggffirrrl rkdv lyrallhghi " velquqqfl eiflegtrsr sgktscarag llsvvvdtls tnvipdilii pvgisydrii 36" eghyngeqlg kpkkneslws vargvirmlr knygcvrvdf aqpfslkeyl esqsqkpvsa 42" llsleqallp ailpsrpsda tsin esrnatdesl rrrlianlae hilftasksc 161161 481 aimsthivac lllyrhrqgi dlstlvedff vmkeevlard fdlgfsgnse dvvmhaiqll 541 gncvtithts rndeffitps ttvpsvfeln fysngvlhvf acsl yavlnkrglg 601 gptstppnli sqeqlvrkaa sneg tislpcqtfy vgkf iqygiltvae 661 hddqedisps laequdkkl peplswrsde ededsdfgee qrdcylkvsq skehqqfitf 721 lqulgplle ayssaaifvh nfsgpvpepe quklhkyli trternvavy aesatyclvk 781 navkmfkdig vfketqurv svlelsstfl pqcnqulle yilsfvvl Official Symbol: TAZ Official Name: tafazzin Gene ID: 6901 Organism: Homo sapiens Other Aliases: XX-FW83563893, BTHS, CMD3A, EFE, EFE2, G4.5, LVNCX, Tazf Other Designations: protein G4.5 Nucleotide seguence: NCBI Reference Seguence: NM_000116.3 LOCUS NM_000116 ACCESSION NM_000116 1 tttccggcgg cggg ccggggtgcc agcgcccgcc ttcccgtttc ctcccgttcc 6" gcagcgcgcc cacggcctgt gaccccggcg accgctcccc agtgacgaga gagcggggcc 12" gggcgctgct ccggcctgac ctgcgaaggg acctcggtcc agtcccctgt tgcgccgcgc 18" cccctgtccg tccgtgcgcg ggccagtcag gtgt ctcgagcggt cgca 24" gacctagagg cgccccacag gCngCCng ggcgctggga gcgccggccg nggccgggt " ggggatgcct ctgcacgtga agtggccgtt ccccgcggtg ccgccgctca cctggaccct 36" cagc atgg gcttggtggg cacctacagc tgcttctgga acat 42" gaaccacctg accgtgcaca aggt gctgtacgag ctcatcgaga agcgaggccc 48" ggccacgccc ctcatcaccg tgtccaatca ccagtcctgc atggacgacc ctcatctctg 54" ggggatcctg aaactccgcc ggaa cctgaagttg atgcgttgga cccctgcagc 162162 60" tgcagacatc acca aggagctaca ctcccacttc ttcagcttgg gcaagtgtgt 66" gcctgtgtgc cgaggagcag aatttttcca agcagagaat gaggggaaag gtgttctaga 72" cacaggcagg cacatgccag gtgctggaaa aagaagagag aaaggagatg gcgtctacca 78" gatg gacttcattt tggagaagct caaccatggg gactgggtgc tccc 84" agaagggaaa gtgaacatga gttccgaatt cctgcgtttc ggaa gcct 90" gattgctgag tgtcatctca accccatcat cctgcccctg tggcatgtcg gaatgaatga 96" cgtccttcct aacagtccgc cctacttccc tgga cagaaaatca ctgtgctgat "02" cgggaagccc ttcagtgccc tgcctgtact cgagcggctc cgggcggaga acaagtcggc "08" tgtggagatg cggaaagccc tgacggactt agag cagc atctgaagac "14" tcaggcagag caca accacctcca gaga taggccttgc ttgctgcctt "20" ctggattctt ggcccgcaca gggc tgagggatgg actgatgctt ttagctcaaa "26" cgtggctttt agacagattt gttcatagac cctctcaagt gccctctccg agctggtagg "32" cattccagct cctccgtgct tcctcagtta cacaaaggac ctcagctgct actt "38" ggccaagcag ggaggaagaa gcttaggcag ggctctcttt ccttcttgcc ttcagatgtt "44" ctctcccagg ggctggcttc aggagggagc atagaaggca ggtgagcaac cagttggcta "50" ggggagcagg gggcccacca gagctgtgga gaggggaccc tcct cggcctggct "56" cctacccacc gcccttgccg aaccaggagc tgctcactac ctcctcaggg atggccgttg "62" gccacgtctt ccttctgcct gagcttcccc ccgaccacag gccctttcct caggcaaggt "68" ctggcctcag gtgggccgca ggcgggaaaa gcagcccttg gccagaagtc aagcccagcc "74" acgtggagcc tagagtgagg gcctgaggtc tggctgcttg cccccatgct ggcgccaaca "80" acttctccat cctttctgcc tctcaacatc acttgaatcc tagggcctgg gttttcatgt "86" ttttgaaaca gaaccataaa gcatatgtgt tggcttgttg taaaaaaaaa aaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_000107.1 LOCUS NP_000107 ACCESSION NP_000107 1 mplhvkwpfp avppltwtla ssvvmglvgt yscfwtkymn hltvhnrevl yeliekrgpa 61 tplitvsnhq scmddphlwg ilklrhiwnl klmrwtpaaa dicftkelhs hffslgkcvp 121 vcrgaeffqa enegkgvldt grhmpgagkr quk gmdfilekln hgdwvhifpe 163163 181 sefl rfkwgigrli aechlnpiil plwhvgmndv lpnsppyfpr fquitvlig 241 kpfsalpvle rlraenksav emrkaltdfi qeefthktq aeqlhnhlqp gr Official Symbol: COL1A2 Official Name: collagen, type I, alpha 2 m: 1278 Organism: Homo sapiens Other Aliases: Ol4 Other Designations: alpha 2(l)-collagen; 2 type I collagen; en l, alpha-2 polypeptide; collagen alpha-2(l) chain; en of skin, tendon and bone, alpha-2 chain; type | procollagen Nucleotide seguence: NCBI Reference Seguence: NM_000089.3 LOCUS NM_000089 ACCESSION NM_000089 l gtgtcccata gtgtttccaa acttggaaag ggcgggggag ggcgggagga tgcggagggc 6" ggaggtatgc agacaacgag tcagagtttc cccttgaaag cctcaaaagt gtccacgtcc 12" tcaaaaagaa tggaaccaat ttaagaagcc agccccgtgg ccacgtccct tcccccattc 18" gctccctcct ctgcgccccc gcaggctcct cccagctgtg gctgcccggg gccc 24" cagccctccc attggtggag gcccttttgg aggcacccta gggccaggga aacttttgcc " gtataaatag ggcagatccg ggctttatta ttttagcacc gcag gaggtttcgg 36" ctaagttgga ggtactggcc gcat gcccgcgccc gccaggtgat gccg 42" gtgacccagg ggctctgcga cacaaggagt ctgcatgtct aagtgctaga catgctcagc 48" tttgtggata cgcggacttt gttgctgctt gcagtaacct tatgcctagc aacatgccaa 54" tctttacaag aggaaactgt aagaaagggc ccagccggag atagaggacc acgtggagaa 60" aggggtccac caggcccccc agat ggtgaagatg gtcccacagg ccctcctggt 66" ccacctggtc gccc ccctggtctc ggtgggaact ttgctgctca gtatgatgga 72" aaaggagttg gacttggccc tggaccaatg atgg gacctagagg cccacctggt 78" gcagctggag gccc tcaaggtttc caaggacctg ctggtgagcc tggtgaacct 164164 84" gg:caaactg gtcctgcagg tgctcgtggt ccagctggcc ctcctggcaa ggctggtgaa 90" ga:ggtcacc ctggaaaacc cggacgacct agag gagttgttgg accacagggt 96" ggtt tccctggaac actt ttca aaggcattag gggacacaat "02" gg:ctggatg gattgaaggg acagcccggt gctcctggtg tgaagggtga acctggtgcc "08" cc :ggtgaaa atggaactcc aggtcaaaca ggagcccgtg ggcttcctgg tgagagagga "l4" ggtg cccctggccc agctggtgcc Cgtggcagtg atggaagtgt gggtcccgtg "20" gg:cctgctg gtcccattgg gtctgctggc cctccaggct tcccaggtgc cccc "26" aagggtgaaa ttggagctgt tggtaacgct ggtcctgctg ccgg tccccgtggt "32" gaagtgggtc ttccaggcct cccc gttggacctc ctggtaatcc tggagcaaac "38" ggccttactg gtgccaaggg tgctgctggc cttcccggcg ttgctggggc cctc "44" cctggacccc gcggtattcc tggccctgtt ggtgctgccg gtgctactgg tgccagagga "50" cttgttggtg agcctggtcc agctggctcc aaaggagaga gcggtaacaa gggtgagccc "56" ggctctgctg ggccccaagg tcctcctggt cccagtggtg aagaaggaaa gagaggccct "62" aatggggaag ctgc cggccctcca ggacctcctg ggctgagagg tagtcctggt "68" tctcgtggtc ttcctggagc tgatggcaga gctggcgtca tgggccctcc tggtagtcgt "74" ggtgcaagtg gccctgctgg agtccgagga cctaatggag atgctggtcg ccctggggag "80" cctggtctca tgggacccag aggtcttcct ggttcccctg gaaatatcgg ccccgctgga "86" aaagaaggtc ctgtcggcct ccctggcatc aggc ctggcccaat tggcccagct "92" ggagcaagag gagagcctgg caacattgga ttccctggac ccaaaggccc cactggtgat "98" cc:ggcaaaa acggtgataa aggtcatgct ggtcttgctg gtgctcgggg tgctccaggt 204" cc:gatggaa acaatggtgc tcagggacct cctggaccac agggtgttca aggtggaaaa 210" gg:gaacagg gtccccctgg tcctccaggc ttccagggtc tgcctggccc tccc 216" gc:ggtgaag ttggcaaacc aggagaaagg ggtctccatg gtgagtttgg tctccctggt 222" cc :gctggtc ggga acgcggtccc ccaggtgaga gtgg1gctgc tact 228" gg:cctattg gaagccgagg tgga cccccagggc ctga:ggaaa caagggtgaa 234" cc:ggtgtgg ttggtgctgt gggcactgct tctg gtcc:agtgg actcccagga 240" gagaggggtg ctgctggcat acctggaggc aagggagaaa aggg:gaacc tggtctcaga 246" ggtgaaattg gtaaccctgg cagagatggt gctcgtggtg ctcc :ggtgc tgtaggtgcc 252" cctggtcctg ctggagccac aggtgaccgg ggcgaagctg gggc:gctgg tcctgctggt 258" cctgctggtc ctcggggaag ccctggtgaa Cgtggtgagg tcgg:cctgc tggccccaat 165165 264" ggatttgctg gtcctgctgg tggt caacctggtg ctaaaggaga aagaggagcc 270" aaagggccta agggtgaaaa cggtgttgtt ggtcccacag gccccgttgg agctgctggc 276" ccagctggtc caaatggtcc ccccggtcct gctggaagtc gtggtgatgg aggcccccc: 282" actg gtttccctgg tgctgctgga cggactggtc ccccaggacc ctctggtat: 288" tctggccctc ctggtccccc tggtcctgct gggaaagaag ggcttcgtgg tcctcgtgg: 294" gaccaaggtc cagttggccg aactggagaa gtaggtgcag ttggtccccc tggcttcgc: 300" aagg gtccctctgg agaggctggt actgctggac ctcctggcac tcc: 306" cagggtcttc ctcc tggtattctg ggtctccctg gctcgagagg tgaacgtgg: 312" ctaccaggtg ttgctggtgc tgtgggtgaa cctggtcctc ttggcat:gc cggccctcc: 318" ggggcccgtg gtcctcctgg tgctgtgggt agtcctggag tcaacgg:gc tcctggtgaa 324" gctggtcgtg atggcaaccc tgggaacgat ggtcccccag gtcgcga:gg tcaacccgga 330" cacaagggag agcgcggtta ccctggcaat attggtcccg ttggtgc:gc aggtgcacct 336" ggtcctcatg gccccgtggg tcctgc:ggc aaacatggaa accgtgg:ga aactggtcct 342" tctggtcctg ttggtcctgc tggtgc:gtt agag gtcctag:gg cccacaaggc 348" attcgtggcg gaga gcccgg:gaa aaggggccca gaggtct:cc aaag 354" ggacacaatg gattgcaagg tctgcc:ggt atcgctggtc accatgg:ga tcaaggtgct 360" cctggctccg tgggtcctgc tggtcc :agg ggccctgctg gtccttc:gg ccctgctgga 366" aaagatggtc gcactggaca tcctgg:aca gttggacctg ctggcat:cg aggccctcag 372" ggtcaccaag gccctgctgg cccccc:ggt ccccctggcc gacc tccaggtgta 378" ggtg actt tggttacgat ggagacttct acagggctga ccagcctcgc 384" tcagcacctt ctctcagacc caaggactat gaagttgatg ctactctgaa caac 390" aaccagattg agacccttct tactcctgaa ggctctagaa agaacccagc tcgcacatgc 396" cgtgacttga gactcagcca cccagagtgg agcagtggtt actactggat tgaccctaac 102" caaggatgca ctatggatgc tatcaaagta tactgtgatt tctctactgg cgaaacctgt 108" atccgggccc aacctgaaaa catcccagcc tggt gctc caag 114" aaacacgtct ggctaggaga aactatcaat gctggcagcc agtttgaata taatgtagaa 120" ggagtgactt ccaaggaaat ggctacccaa cttgccttca tgcgcctgct ggccaactat 126" gcctctcaga acatcaccta ccactgcaag aacagcattg catacatgga tgaggagact 132" ggcaacctga aaaaggctgt cattctacag ggctctaatg atgttgaact tgttgctgag 138" ggcaacagca ctta cactgttctt gtagatggct gctctaaaaa gacaaatgaa 166166 Z44" tggggaaaga caatcattga atacaaaaca aataagccat cacgcctgcc cttccttgat 150" attgcacctt tcgg tggtgctgac caggaattct ttgtggacat tggcccagtc 156" tgtttcaaat aaatgaactc aatt aaaaaagaaa gaaatttgaa aaaactttct Z62" ctttgccatt tcttcttctt cttttttaac tgaaagctga atccttccat ttcttctgca Z68" catctacttg cttaaattgt gggcaaaaga gaaaaagaag gattgatcag agcattgtgc Z74" aatacagttt cattaactcc ttcccccgct cccccaaaaa tttgaatttt aaca 180" ctcttacacc tgttatggaa aacc tttgtaagaa aaccaaaata aaaattgaaa 186" aataaaaacc ataaacattt gcaccacttg tggcttttga tcca cagagggaag Z92" tttaaaaccc aaacttccaa aggtttaaac tacctcaaaa cactt:ccca tgagtgtgat Z98" ccacattgtt aggtgctgac ctagacagag atgaactgag gttt tgttttgttc 504" ataatacaaa ggtgctaatt attt cagatacttg gttg atggtgctag 510" aagaatttga gaagaaatac tcctgtattg agttgtatcg tgtgg:gtat aaaa 516" atttgattta gcattcatat tttccatctt attcccaatt aaaag:atgc agattatttg 522" cccaaatctt cttcagattc agcatttgtt ctttgccagt ctcat:ttca tcttcttcca 528" tggttccaca gaagctttgt ttcttgggca aaaa ttaaa:tgta cctattttgt 534" atatgtgaga tgtttaaata aattgtgaaa aaaatgaaat aaagcatgtt tggttttcca 540" aaagaacata t Pnfleh1seguence: NCBI Reference Seguence: NP_000080.2 LOCUS NP_000080 ACCESSION 080 l mlsfvdtrtl lllavtlcla thslqeetv rkgpagdrgp pgpp grdgedgptg 6" ppgppgppgp pglggnfaaq ydgkgvglgp gpmglmgprg ppgaagapgp qgfquagep 12" gepgqtgpag argpagppgk agedghpgkp grpgergvvg pqgargfpgt pglpgfkgir 18" ghngldglkg quapgvkge pgapgengtp gqtgarglpg ergrvgapgp agargsdgsv 24" gpvgpagpig sagppgfpga pgpkgeigav gnagpagpag prgevglpgl sgpvgppgnp " gangltgakg aaglpgvaga pglpgprgip gpvgaagatg arglvgepgp agskgesgnk 36" gepgsagpqg ppgpsgeegk rgpngeagsa gppgppglrg Spgsrglpga mgpp 42" gsrgasgpag vrgpngdagr mgpr gnig pagkegpvgl pgidgrpgpi 167167 48" gpagargepg nigfpgpkgp tgdpgkngdk ghaglagarg apgpdgnnga qupgpqqu 54" ggkgquppg ppgfqglpgp sgpagevgkp gerglhgefg lpgpagprge rgppgesgaa 60" gptgpigsrg pdgn kgepgvvgav gtagpsgpsg lpgergaagi pggkgekgep 66" glrgeignpg rdgargapga vgapgpagat gdrgeagaag pagpagprgs pgergevgpa 72" gpngfagpag aagquakge rgakgpkgen gvvgptgpvg pngp pgpagsrgdg 78" gppgmtgfpg aagrtgppgp pgpp gpagkeglrg prgdquvgr tgevgavgpp 84" gfagekgpsg eagtagppgt pgpqgllgap gilglpgsrg erglpgvaga vgepgplgia 90" gppgargppg avgspgvnga pgeagrdgnp gndgppgrdg rgy pgnigpvgaa 96" gapgphgpvg pagkhgnrge tgpsgpvgpa gavgprgpsg pqgirgdkge pgekgprg;p 102" glkghnglqg lpgiaghhgd qgapgsvgpa gpsg rtgh pgtvgpagir 108" gpqghquag ppgppgppgp pgvsgggydf gydgdfyrad qprsapslrp kdyevdat;k 114" slnnqietll knpa rtcrdlrlsh yywi dpnqgctmda ikvycdfstg 120" etciraqpen ipaknwyrss kdkkhvwlge tinagsqfey nvegvtskem atqlafmrLl 126" anyasqnity hcknsiaymd eetgnlkkav ilqgsndvel vaegnsrfty tvlvdgcskk 132" tnewgktiie yktnkpsrlp fldiapldig gadqeffvdi gpvcfk LANIC1 OfibblSwnbokLAMC1 OfimmlNameHamungmnma1(kumenyLAMB2) Gene ID: 3915 Organism: Homo sapiens Other s: 81K3.1, LAMB2 Other Designations: S-LAM gamma; S-Iaminin subunit gamma; |aminin B2 chahu|amhflnsubunnganuna4;Ianunm-10subunnganunaflanunm-11subunn ganunaflanunm-2subunnganunaflanunm-Bsubunnganunaflanunm-4subunn ganunaflanunm-Gsubunnganunaflanunm-7subunnganunaflanunm-Ssubunh gamma; |aminin-9 subunit gamma Nucleotide seguence: NCBI Reference Seguence: NM_002293.3 LOCUS NM_002293 168168 ACCESSION NM_002293 1 gtgcaggctg ctcccggggt aggtgaggga gagg ngcgcgcgg gggcagtggt 6" cggcgagcag cgcggtcctc gctaggggcg cccacccgtc agtctctccg gcgcgagccg l2" ccgccaccgc ccgcgccgga cccc tgggccccca ggctcaagca gcgaagcggc 18" ctccggggga cgccgctagg cgagaggaac gcgccggtgc cttc gccgtgaccc 24" agcgtgcggg ngcgggatg agagggagcc atcgggccgc gccggccctg cggccccggg " ggcggctctg gcccgtgctg gccgtgctgg ngngCCgC cgcggcgggc tgtgcccagg 36" cagccatgga cgagtgcacg gacgagggcg ggcggccgca gcgctgcatg cccgagttcg 42" tcaacgccgc cttcaacgtg gtgg ccaccaacac gact gagg 48" aatactgtgt gcagaccggg gtgaccgggg tcaccaagtc ctgtcacctg tgcgacgccg 54" ggcagcccca cctgcagcac ggggcagcct tcctgaccga ctacaacaac gaca 60" ccacctggtg gcaaagccag accatgctgg ccggggtgca gtaccccagc tccatcaacc 66" tgca aaaa gcttttgaca tcacctatgt gcgtctcaag ttccacacca 72" gccgcccgga gagctttgcc atttacaagc ggga agacgggccc tggattcctt 78" accagtacta cagtggttcc tgtgagaaca cctactccaa ggcaaaccgc ggcttcatca 84" ggacaggagg ggacgagcag caggccttgt gtactgatga attcagtgac atttctcccc 90" tcactggggg caacgtggcc ttttctaccc tggaaggaag gcccagcgcc tataactttg 96" acaatagccc tgtgctgcag gaatgggtaa ctgccactga agta actcttaatc "O2" gcctgaacac ttttggagat gaagtgttta acgatcccaa agttctcaag tcctattatt "08" atgccatctc tgattttgct gtaggtggca gatgtaaatg taatggacac gcaagcgagt "l4" gtatgaagaa cgaatttgat aagctggtgt gtaattgcaa caca tatggagtag "20" actgtgaaaa tcct ttcttcaatg accggccgtg gaggagggca actgcggaaa "26" gtgccagtga atgcctgccc tgtgattgca atggtcgatc ccaggaatgc tacttcgacc "32" ctgaactcta tcgttccact gggg gccactgtac caactgccag gataacacag "38" atggcgccca ctgtgagagg tgccgagaga acttcttccg ccttggcaac aatgaagcct "44" gctcttcatg ccactgtagt cctgtgggct ctctaagcac acagtgtgat agttacggca "50" gatgcagctg taagccagga gtgatggggg acaaatgtga ccag cctggattcc "56" attctctcac tgaagcagga tgcaggccat gctcttgtga tggc agcatagatg "62" atat tgaaacagga agatgtgttt gcaaagacaa tgtcgaaggc ttcaattgtg "68" aaagatgcaa acctggattt tttaatctgg aatcatctaa tcctcggggt tgcacaccct 169169 174" gcttctgctt tgggcattct tctgtctgta caaacgctgt tggctacagt tcta 180" tctcctctac ctttcagatt gatg ggtggcgtgc ggaacagaga tctg 186" aagcatctct cgagtggtcc tctgagaggc aagatatcgc cgtgatctca gacagctact 192" ttcctcggta cttcattgct cctgcaaagt tcttgggcaa gcaggtgttg agttatggtc 198" agaacctctc cttctccttt cgagtggaca ggcgagatac ctct gcagaagacc 204" ttgtgcttga gggagctggc ttaagagtat ctgtaccctt gatcgctcag ggcaattcct 210" atccaagtga gaccactgtg aagtatgtct tcaggctcca tgaagcaaca gattaccctt 216" ggaggcctgc tcttacccct tttgaatttc agaagctcct cttg acctctatca 222" agatacgtgg gacatacagt gagagaagtg ctggatattt ggatgatgtc accctggcaa 228" gtgctcgtcc tgggcctgga gtccctgcaa cttgggtgga gtcctgcacc gtgg 234" gatatggagg gcagttttgt gagatgtgcc tctcaggtta cagaagagaa actcctaatc 240" ttggaccata cagtccatgt gtgctttgcg cctgcaatgg acacagcgag acctgtgatc 246" ctgagacagg tgtttgtaac tgcagagaca atacggctgg cccgcactgt gagaagtgca 252" gtgatgggta ctatggagat gcag gcacctcctc cgattgccaa ccctgtccgt 258" gtcctggagg ttcaagttgt gctgttgttc ccaagacaaa ggaggtggtg tgcaccaact 264" gtcctactgg caccactggt aagagatgtg agctctgtga tgatggctac tttggagacc 270" ccctgggtag aaacggccct gtgagacttt gccgcctgtg ccagtgcagt atcg 276" atcccaatgc agttggaaat tgcaatcgct tgacgggaga atgcctgaag tgcatctata 282" ctgg cttctattgt gaccggtgca aagacggatt aaat cccctggctc 288" ccaatccagc agacaaatgc aaagcctgca attgcaatct gtatgggacc atgaagcagc 294" agagcagctg taaccccgtg acggggcagt gtgaatgttt gcctcacgtg actggccagg 300" actgtggtgc ttgtgaccct ggattctaca agag tgggcaaggc tgtgagaggt 306" gtgactgcca gggc tccaccaatg ggcagtgtga catccgcacc ggccagtgtg 312" agtgccagcc cggcatcact ggtcagcact gctg tgaggtcaac gggt 318" ttggacctga aggctgcaaa ccctgtgact gtcatcctga tctt tcacttcagt 324" gcaaagatga tggtcgctgt gaatgcagag aaggctttgt gggaaatcgc tgtgaccagt 330" gtgaagaaaa ctatttctac aatcggtctt ggcctggctg ccaggaatgt ccagcttgtt 336" accggctggt aaaggataag gttgctgatc atagagtgaa gctccaggaa ttagagagtc 342" tcatagcaaa ccttggaact ggggatgaga tggtgacaga tcaagccttc agac 348" taaaggaagc ggaa gttatggacc tccttcgtga ggcccaggat gtcaaagatg 170170 354" ttgaccagaa tttgatggat cgcctacaga gagtgaataa cactctgtcc agccaaatta 360" gccgtttaca gaatatccgg aataccattg aagagactgg aaacttggct gaacaagcgc 366" gtgcccatgt agagaacaca gagcggttga ttgaaatcgc atccagagaa cttgagaaag 372" caaaagtcgc tgctgccaat gtgtcagtca ctcagccaga atctacaggg aaca 378" acatgactct tttggcagaa gaggctcgaa agcttgctga acgtcataaa caggaagctg 384" atgacattgt tcgagtggca aagacagcca atgatacgtc aactgaggca tacaacctgc 390" ttctgaggac actggcagga gaaaatcaaa cagcatttga gattgaagag cttaatagga 396" agtatgaaca agcgaagaac atctcacagg atctggaaaa tgcc cgagtacatg 102" aggaggccaa aagggccggt gacaaagctg tcta tgccagcgtg gctcagctga 108" gccctttgga ctctgagaca ctggagaatg aagcaaataa cataaagatg gaagctgaga 114" atctggaaca actgattgac cagaaattaa aagattatga ggacctcaga gaagatatga 120" gagggaagga acttgaagtc aagaaccttc tggagaaagg caagactgaa cagcagaccg 126" cagaccaact cctagcccga gctgatgctg ccaaggccct cgctgaagaa gctgcaaaga 132" agggacggga acaa gaagctaatg acattctcaa gaaa gattttgata 138" ggcgtgtgaa cgataacaag acggccgcag aggaggcact aaggaagatt cctgccatca Z44" accagaccat cactgaagcc aatgaaaaga ccagagaagc ccagcaggcc ctgggcagtg 150" ctgcggcgga tgccacagag gccaagaaca aggcccatga ggcggagagg atcgcgagcg 156" ctgtccaaaa cacc aagg cagaagctga aagaactttt gcagaagtta Z62" cagatctgga taatgaggtg aacaatatgt tgaagcaact gcaggaagca gagc Z68" taaagagaaa acaagatgac gctgaccagg acatgatgat ggcagggatg gcttcacagg Z74" ctgctcaaga agccgagatc aatgccagaa aaaa tact agcctcctca 180" gcattattaa tgacctcttg gagcagctgg ggcagctgga tacagtggac ctgaataagc 186" taaacgagat tgaaggcacc ctaaacaaag ccaaagatga ggtc agcgatcttg Z92" ataggaaagt cctg gagaatgaag ccaagaagca ggaggctgcc atcatggact Z98" ataaccgaga tatcgaggag atcatgaagg acattcgcaa tctggaggac aaga 504" ccttaccatc tggctgcttc ccgt ccattgaaaa gccctagtgt ctttagggct 510" ggaaggcagc atccctctga caggggggca gttgtgaggc cacagagtgc cttgacacaa 516" agattacatt tttcagaccc ccactcctct gctgctgtcc atgactgtcc ttttgaacca 522" gtca cagagtttaa gcaa catc cggg aacaaagggt 528" tttatctaat aaagtgtctc ttccattcac gttgctacct tacccacact ttcccttctg 171171 534" atttgcgtga ggacgtggca tcctacgtta agtg gcataagcac tgag 540" cccatgtatg ctggggtaga gcaagtagcc ctcccctgtc tcatcgatac cagcagaacc 546" tcctcagtct cagtactctt gtttctatga aggaaaagtt tggctactaa cagtagcatt 552" gtgatggcca gtatatccag tccatggata aagaaaatgc atctgcatct cctacccctc 558" ttccttctaa gcaaaaggaa ataaacatcc tgtgccaaag gtattggtca tttagaatgt 564" cggtagccat gtgc ttat tatgagtgta ggacactgag ccatccgtgg 570" gtcaggatgc aattatttat aaaagtctcc aggtgaacat ggctgaagat ttttctagta 576" tattaataat tgactaggaa gatgaacttt ttttcagatc tttgggcagc tgataattta 582" aatctggatg ggcagcttgc actcaccaat agaccaaaag acatcttttg atattcttat 588" aact gaag aaatagggat atgataacca ctaaaatttt gttttcaaaa 594" tcaaactaat tcttacagct tttttattag ttagtcttgg aactagtgtt aagtatctgg 600" cagagaacag ttaatcccta aggtcttgac aaaacagaag aaaaacaagc ctcctcgtcc 606" tagtcttttc tagcaaaggg ataaaactta gatggcagct tgtactgtca gaatcccgtg 612" tatccatttg ttcttctgtt ggagagatga gacatttgac ccttagctcc agttttcttc 618" ttcc caga atccctcaaa aaacattgtt tgccaaatcc tggtggcaaa 624" tacttgcact cagtatttca cacagctgcc aacgctatcg agttcctgca gatt 630" taaatccact ctaaaccttc cctctaagtg tagagggaag acccttacgt ggagtttcct 636" agtgggcttc tcaacttttg a:cctcagct ctgtggtttt aagaccacag tgtgacagtt 642" ccctgccaca cacccccttc c:cctaccaa cccacctttg agattcatat atagccttta 648" tgca actttgtact agca ggggcggggt ggggggaaag aaactattat 654" ctgacacact atta a:tatttcaa atttatattt ttgtgtgaat gttttgtgtt 660" ttgtttatca tgattataga a:aaggaatt tatgtaaata tacttagtcc tatttctaga 666" atgacactct gttcactttg c:caattttt cctcttcact ggcacaatgt atctgaatac 672" ctccttccct cccttctaga a:tctttgga ttgtactcca aagaattgtg ccttgtgttt 678" gcagcatctc cattctctaa aattaatata ttcc tccacaccca gccactgtaa 684" agaggtaact ctct tccattgcag tcctgatgat cctaacctgc agcacggtgg 690" ttttacaatg ttccagagca ggaacgccag aagc agga ttaggaaagt 696" ttgctgaaga ggatctttga cgccacagtg ggactagcca ggaatgaggg agaaatgccc 702" tttctggcaa ttgttggagc tggataggta agttttataa gggagtacat tttgactgag 708" cacttagggc atcaggaaca gtgctactta ctgatgggta gactgggaga ggtggtgtaa 172172 714" cttagttctt ccca cttcctgttt ccatctgctt tacc agagtttacc 720" acaagtgttt tgacgatata ctcctgagct ttcactctgc tgcttctccc aggcctcttc 726" tactatggca ggagatgtgg cgtgctgttg caaagttttc ttgt ttcctggcta 732" gttcatttca ttaagtggct acatcctaac atatgcattt ggtcaaggtt gcagaagagg 738" actgaagatt gactgccaag ctagtttggg tgaagttcac tccagcaagt ctcaggccac 744" aatggggtgg tttggtttgg tttcctttta actttctttt tgttatttgc ttttctcctc 750" cacctgtgtg gtatattttt taagcagaat tttatttttt aaaataaaag taca 756" agatgatacc ttaattacac tcccgcaaca cagccattat gtct agctccagtt 762" atctgtattt tatgtaatgt aattgacagg atggctgctg cagaatgctg gttgacacag 768" ggattattat actgctattt ttccctgaat ttttttcctt ccaa ctgtggacct 774" tttatatgtg ccttcacttt agctgtttgc cttaatctct acagccttgc tctccggggt 780" ggttaataaa atgcaacact tggcattttt atgttttaag aaaaacagta ttat 7861 aataaaatct gaatatttgt aacccttt Protein seguence: NCBI nce Seguence: NP_002284.3 LOCUS NP_002284 ACCESSION NP_002284 l mrgshraapa lrprgrlwpv lavlaaaaaa gcaqaamdec tdeggrpqrc mpefvnaazn 6" vtvvatntcg tppeeycvqt gvtgvtksch lcdagqphlq hgaafltdyn nqadttwqu l2" qtmlaquyp ssinltlhlg kafdityvrl kfhtsrpesf aiykrtredg pwipyqyysg 18" skan rgfirtggde qqalctdefs displtggnv afstlegrps aynfdnspvl 24" qewvtatdir vtlnrlntfg devfndpkvl ksyyyaisdf kcng hasecmknef " dklvcnckhn tygvdcekcl pffndrpwrr ataesasecl pcdcngrsqe cyfdpelyrs 36" tghgghctnc qdntdgahce rcrenffrlg nneacsschc spvgslstqc dsygrcsckp 42" gvmgdkcdrc qufhsltea gcrpcscdps gsidecniet grcvckdnve gfncerckpg 48" ffnlessnpr gctpcfcfgh avgy svysisstfq idedgwraeq rdgseaslew 54" sserqdiavi sdsyfpryfi apakflgkqv lsygqnlsfs frvdrrdtrl lega 60" glrvsvplia qgnsypsett vkyvfrlhea tdypwrpalt pfequllnn ltsikirgty 173173 661 sersagyldd vtlasarpgp gvpatwvesc tepvgygqqf cemclsgyrr etpnlgpysp 721 cvlcacnghs etcdpetgvc ncrdntagph cekcsdgyyg dstagtssdc qpcpcpggss 781 cavvpktkev vctncptgtt gkrcelcddg yfgdplgrnq pvrlcrlcqc sdnidpnavg 841 ncnrltgecl kciyntagfy gffg nplapnpadk ckacncnlyg tmqusscnp 901 vtgqceclph vtqucgacd pgfynlqsgq gcercdchal gstngchir tgqcecqui 961 tgthercev pegc kpcdchpegs lslqckddgr cecregfvgn rcdqceenyf 1021 ynrswpgcqe cpacyrlvkd kvadhrvqu eleslianlg tgdemvtdqa fedrlkeaer 1081 evmdllreaq dvkdvdqnlm drlqrvnntl lqni rntieetgnl aeqarahven 1141 terlieiasr elekakvaaa pest gdpnnmtlla eearklaerh kqeaddivrv 1201 aktandtste aynlllrtla genqtafeie elnrkyeqak nisqdlekqa arvheeakra 1261 gdkaveiyas vaqlspldse tleneannik meaenleqli qulkdyedl redmrgkele 1321 vknllekgkt eqqtadqlla radaakalae rdtl qeandilnnl vndn 1381 ktaaeealrk ipainqtite eaqq algsaaadat eaknkaheae riasavqkna 1441 tstkaeaert ldne vnnmlkqlqe aekelkrkqd dadqdmmmag qeae 1501 inarkaknsv tsllsiindl leqlgqldtv dlnklneieq tlnkakdemk vsdldrkvsd 1561 leneakkqea aimdynrdie eimkdirnle dirktlpsgc fntpsiekp SPRC Official Symbol: SPARC Official Name: secreted protein, acidic, cysteine-rich (osteonectin) Mi6678 Organism: Homo sapiens chgr Ali§§§§: ON Other Designations: BM-40; basement-membrane n 40; cysteine-rich protein; osteonectin; secreted protein acidic and rich in cysteine Nucleotide e: NCBI Reference Sequence: NM_003118.3 LOCUS NM_0031 1 8 ACCESSION NM_003118 1 gggagaagga ggaggccggg ggaaggagga gacaggagga ggagggacca cggggtggag 174174 6" taga cccagcccag agctctgagt ggtttcctgt tgcctgtctc taaacccctc 12" cacattcccg cggtccttca gactgcccgg cgct ctgcctgccg cctgcctgcc 18" tgccactgag ggttcccagc accatgaggg cctggatctt ctttctcctt tgcctggccg 24" ggagggcctt ggcagcccct cagcaagaag ccctgcctga tgagacagag gtggtggaag " aaactgtggc agaggtgact gaggtatctg tgggagctaa tcctgtccag gtggaagtag 36" gagaatttga tgatggtgca gaggaaaccg aagaggaggt ggtggcggaa aatccctgcc 42" agaaccacca ctgcaaacac ggcaaggtgt gcgagctgga caac atgt 48" gcgtgtgcca ggaccccacc agctgcccag cccccattgg tgag aaggtgtgca 54" acaa caagaccttc gactcttcct gccacttctt tgccacaaag tgcaccctgg 60" agggcaccaa ccac aagctccacc tggactacat ttgc aaatacatcc 66" ccccttgcct ggactctgag ctgaccgaat tccccctgcg catgcgggac tggctcaaga 72" acgtcctggt caccctgtat gagagggatg aggacaacaa ccttctgact gagaagcaga 78" agctgcgggt gaagaagatc catgagaatg agaagcgcct ggaggcagga gaccaccccg 84" tggagctgct ggcccgggac ttcgagaaga actataacat cttc cctgtacact 90" ggcagttcgg ccagctggac cagcacccca ttgacgggta cctctcccac accgagctgg 96" ctccactgcg tgctcccctc atccccatgg gcac cacccgcttt ttcgagacct "O2" gtgacctgga caag tacatcgccc tggatgagtg ggccggctgc ttcggcatca "08" agcagaagga tatcgacaag gatcttgtga tctaaatcca ctccttccac agtaccggat "l4" ttta accctcccct tcgtgtttcc cccaatgttt aaaatgtttg gatggtttgt "20" tgttctgcct ggagacaagg tgctaacata gatttaagtg aatacattaa cggtgctaaa "26" aatgaaaatt ctaacccaag acatgacatt cttagctgta acttaactat taaggccttt "32" tccacacgca ttaatagtcc catttttctc ttgccatttg tagctttgcc cattgtctta "38" catg ggtggacacg gatctgctgg gctctgcctt acat tgcagcttca "44" acttttctct ttagtgttct gtttgaaact aatacttacc gagtcagact ttgtgttcat "50" ttcatttcag ggtcttggct gcctgtgggc ttccccaggt ggcctggagg tgggcaaagg "56" gaagtaacag acacacgatg ttgtcaagga tggttttggg actagaggct cagtggtggg "62" agagatccct gcagaaccca agaa cgtggtttgc ctgaggctgt aactgagaga "68" aagattctgg ggctgtgtta tgaaaatata ctca ccca gttcatcacc "74" atttcctcct ttacctttca gtgcagtttc ttttcacatt aggctgttgg ttcaaacttt "80" tgggagcacg gactgtcagt tctctgggaa gtggtcagcg gcag ggcttctcct 175175 186" cctctgtctt ttggagaacc agggctcttc tcaggggctc tagggactgc gttt 192" cagccaggaa ggccaaaatc aagagtgaga tgtagaaagt tgtaaaatag aaaaagtgga 198" gttggtgaat cggttgttct ttcctcacat gatt gtcataaggt ttttagcatg 204" ttcctccttt tcttcaccct cccctttttt cttctattaa tcaagagaaa cttcaaagtt 210" aatgggatgg tcggatctca caggctgaga actcgttcac ctccaagcat ttcatgaaaa 216" agctgcttct tattaatcat acaaactctc accatgatgt gaagagtttc cctt 222" caaaataaaa agtaatgact tagaaactgc cttcctgggt gatttgcatg tgtcttagtc 228" ttagtcacct tattatcctg acacaaaaac acatgagcat acatgtctac acatgactac 234" acaaatgcaa acctttgcaa acacattatg cttttgcaca cacacacctg tacacacaca 240" tgtt tatacacagg gagtgtatgg ttcctgtaag cactaagtta gctgttttca 246" tttaatgacc tgtggtttaa cccttttgat cactaccacc attatcagca ccagactgag 252" cagctatatc cttttattaa tcatggtcat tcattcattc attcattcac aaaatattta 258" tgatgtattt cacc aggtcccatg ccaagcactg gggacacagt tatggcaaag 264" tagacaaagc atttgttcat ttggagctta gagtccagga ggaatacatt agataatgac 270" acaatcaaat ataaattgca agatgtcaca atga agggagagta ggagagacca 276" tgagtatgtg gagg acacagcatt gtgc gttc cgtacggcag 282" ccactaccca catgtaactt tttaagattt aaatttaaat tagttaacat tcaaaacgca 288" gctccccaat cacactagca acatttcaag agag ccatgcatga ttagtggtta 294" ccctattgaa taggtcagaa gtagaatctt atca cagaaagttc tattggacag 300" tgctcttcta gatcatcata agactacaga gcacttttca aagctcatgc atca 306" tgttagtgtc gtattttgag ctggggtttt cccc ttagagatag agac 312" ccaagaaatg tgctcaattg caatgggcca catacctaga tctccagatg tcatttcccc 318" tctcttattt taagttatgt taagattact aaaacaataa aagctcctaa aaaatcaaac 324" tgtattctgg cttc tacacagtgg gagggcgagc agtaggagag attggcccat 330" ctgg ccatttgagg aatgcaagcc cagcactagt ctcataatct ctaggaatct 336" gtagagagag gaattgaagt aaatttcagc attggctcat tcagtcattc ggcgacattc 342" atcaggtacc tgcaatgtgt taggggatct tatgagtagg cagcgtgcgt tgct 348" cccctggagc tttctaacat tctagcaggc agaccacaca taaatttgca atactgtttc 354" tgataaaaac gtgctgtaaa ggaaataaag cagagaacta tcatggaaaa aaaaaaaaaa 360" aaaa 176176 Protein seguence: NCBI Reference Seguence: NP_003109.1 LOCUS 109 ACCESSION NP_003109 1 mrawiffllc lagralaapq etev veetvaevte vsvganpqu evgefddgae 61 eteeevvaen pcqnhhckhg kvceldennt pmcchdpts cpapigefek vcsndnktfd 121 sschffatkc tlegtkkghk lhldyigpck yippcldsel tefplrmrdw lknvlvtlye 181 rdednnllte qulrvkkih enekrleagd hpvellardf eknynmyifp vhwquqldq 241 hpidgylsht elaplrapli pmehcttrff etcdldndky ialdewagcf giqudidkd 301 1vi P3H1 Official Symbol: LEPRE1 Official Name: Ieucine proline-enriched proteoglycan (Ieprecan) 1 m: 64175 Organism: Homo sapiens Other Aliases: PSECO109, GROS1, OI8, P3H1 Other Designations: growth suppressor 1; leprecan; Ieucine- and proline- enriched glycan 1; prolyl oxylase 1 tide seguence: NCBI Reference Seguence: NM_001146289.1 LOCUS NM_001146289 ACCESSION NM_001146289 1 atgcgccgcc cggcttggaa ggtggggctt cgcccggggg cgggccttcg ccgggggtag 61 gactccggcc ttggtggcgg gtggctggcg gttccgttag gtctgaggga gcgatggcgg 121 tacgcgcgtt gaagctgctg accacactgc tcgt ggccgctgcc gccg 181 aggtcgagtc cgaggcagga tggggcatgg ctga tctgctcttc gccgagggga 241 ccta cgcgcgcggg cccg gggtggtcct gagcatggaa cgggcgctgc 301 gctcccgggc agccctccgc gcccttcgcc tgcgctgccg cacccagtgt gccgccgact 177177 36" tcccgtggga gctggacccc gactggtccc ccagcccggc ccaggcctcg ggcgccgccg 42" ccctgcgcga cctgagcttc ttcgggggcc ttctgcgtcg ctgc ctgcgccgct 48" ggcc gccggccgcc ctca gcgaagagat ggagctggag ttccgcaagc 54" ggagccccta caactacctg caggtcgcct acttcaagat caacaagttg gagaaagctg 60" ttgctgcagc acacaccttc ttcgtgggca atcctgagca catggaaatg cagcagaacc 66" tagactatta ccaaaccatg gtga aggaggccga cttcaaggat cttgagactc 72" aaccccatat gcaagaattt cgactgggag tgcgactcta ctcagaggaa cagccacagg 78" aagctgtgcc ccacctagag gcggcgctgc actt tgtggcctat gaggagtgcc 84" gtgccctctg cgaagggccc tatgactacg atggctacaa ctaccttgag tacaacgctg 90" acctcttcca ggccatcaca gatcattaca tccaggtcct caactgtaag cagaactgtg 96" tcacggagct tgcttcccac ccaagtcgag agaagccctt tgaagacttc ctcccatcgc "02" attataatta tctgcagttt tata acattgggaa ttatacacag gctgttgaat "08" gtgccaagac ctatcttctc ttcttcccca atgacgaggt gatgaaccaa aatttggcct "l4" attatgcagc tgga caca ccagatccat cggcccccgt gagagtgcca "20" aggagtaccg acagcgaagc ctactggaaa aagaactgct tttcttcgct tatgatgttt "26" ttggaattcc ctttgtggat ccggattcat ggactccaga agaagtgatt cccaagagat "32" agaa acagaagtca gaacgggaaa cagccgtacg catctcccag gagattggga "38" accttatgaa ggaaatcgag gtgg aagagaagac caaggagtca ctggatgtga "44" gcagactgac ccgggaaggt ggccccctgc tgtatgaagg catcagtctc accatgaact "50" tcct gaatggttcc gtgg tgatggacgg cgtaatctct gaccacgagt "56" gtcaggagct gcagagactg accaatgtgg cagcaacctc aggagatggc taccggggtc "62" agacctcccc acatactccc aatgaaaagt tctatggtgt cactgtcttc ctca "68" agctggggca agaaggcaaa gttcctctgc ccca cctgtactac aacgtgacgg "74" agaaggtgcg gcgcatcatg tact tccgcctgga tacgcccctc tacttttcct "80" actctcatct ggtgtgccgc actgccatcg aagaggtcca ggcagagagg aaggatgata "86" gtcatccagt ccacgtggac aactgcatcc tgaatgccga gaccctcgtg tgtgtcaaag "92" agcccccagc ctacaccttc taca gcgccatcct ttacctaaat ggggacttcg "98" atggcggaaa cttttatttc actgaactgg atgccaagac cgtgacggca gaggtgcagc 204" ctcagtgtgg aagagccgtg ggattctctt ctga aaacccacat ggagtgaagg 210" ctgtcaccag ggggcagcgc tgtgccatcg ccctgtggtt caccctggac cctcgacaca 178178 216" gcgagcgggt gagagcagct cgagcgggac agggtgcagg cagatgacct ggtgaagatg 222" ctcttcagcc cagaagagat ggacctctcc caggagcagc ccctggatgc gggc 228" ccccccgaac ctgcacaaga ctca gaat cgaagcccaa ggatgagcta 234" tgacagcgtc caggtcagac ggatgggtga ctagacccat ggagaggaac gcac 240" tctgagctgg ccagcccctc ggggctgcag agcagtgagc ctacatctgc cactcagccg 246" aggggaccct gctcacagcc ttctacatgg tgctactgct cttggagtgg acatgaccag 252" acaccgcacc ccctggatct ggctgagggc tcaggacaca ggcccagcca cccccagggg 258" cctccacagg ccgctgcatg acagcgatac agtacttaag tgtctgtgta gacaaccaaa 264" gaataaatga ttcatggttt tttttacttg gtttgttcag acaatggaaa catt 270" ctgtcaaaaa aaaaa Protein seguence: NCBI Reference Seguence: NP_001139761.1 LOCUS NP_001139761 ACCESSION NP_001139761 l mavralkllt tllavvaaas qaeveseagw gmvtpdllfa egtaayargd wpgvvlsmer 6" alrsraalra lrlrcrtqca adfpweldpd wspspaqasg aaalrdlsff ggllrraacl 12" rrclgppaah slseemelef rkrspynqu vayfkinkle kavaaahtff memq 18" qnldyyqtms fkdl qefr lgvrlyseeq pqeavphlea alqeyfvaye 24" ecralcegpy dydgynyley aitd nckq ncvtelashp srekpfedfl " ufa yynignytqa vecaktyllf fpndevmnqn layyaamlge ehtrsigpre 36" sakeyrqrsl lekellffay dvfgipfvdp dswtpeevip krlqequse retavrisqe 42" ignlmkeiet lveektkesl dvsrltregg pllyegislt mnskllngsq rvvmdgvisd 48" hecqelqut gdgy rgqtsphtpn ekfygvtvfk alklgqegkv plqsahlyyn 54" vtekvrrime syfrldtply fsyshlvcrt aieevqaerk ddshpvhvdn cilnaetlvc 60" vkeppaytfr dysailylng dfdggnfyft eldaktvtae vqpchravg nphg 66" vkavtrgqrc aialwftldp rhservraar agqgagr Official Symbol: COL6A1 179179 Official Name: collagen, type VI, alpha 1 Gene ID: 1291 Organism: Homo sapiens Other Aliases: OPLL Other Designations: alpha 1 (VI) chain (61 AA); collagen VI, alpha-1 polypeptide; collagen alpha-1(Vl) chain Nucleotide seguence: NCBI Reference ce: NM_001848.2 LOCUS NM_001848 ACCESSION NM_001848 l gctctcactc tggctgggag cagaaggcag cctcggtctc gcgg cggcggccca 6" ctctgccctg gccgcgctgt gtggtgaccg caggccccag acatgagggc tgct 12" ctgctgcccc tgctgctgca ggcctgctgg acagccgcgc aggatgagcc ggagaccccg 18" agggccgtgg ccttccagga cgtg gacctgttct ttgtgctgga tgag 24" agcgtggccc tgaggctgaa gccctacggg gccctcgtgg acaaagtcaa gtccttcacc " aagcgcttca tcgacaacct gagggacagg tactaccgct gtgaccgaaa cctggtgtgg 36" aacgcaggcg cgctgcacta cgag gtggagatca tccaaggcct cacgcgcatg 42" cctggcggcc gcgacgcact caaaagcagc gtggacgcgg tcaagtactt tgggaagggc 48" acctacaccg ctat caagaagggg ctggagcagc tcctcgtggg gggctcccac 54" ctgaaggaga ataagtacct gattgtggtg accgacgggc accccctgga gggctacaag 60" gaaccctgtg gggggctgga ggatgctgtg aacgaggcca agcacctggg cgtcaaagtc 66" ttctcggtgg ccatcacacc cgaccacctg gagccgcgtc tgagcatcat cgccacggac 72" cacacgtacc ggcgcaactt cacggcggct gactggggcc agagccgcga cgcagaggag 78" gccatcagcc tcga caccatcgtg gacatgatca aaaataacgt ggagcaagtg 84" tgctgctcct tcgaatgcca gcctgcaaga ggacctccgg ggctccgggg cgaccccggc 90" tttgagggag aacgaggcaa gccggggctc ccaggagaga agggagaagc cggagatcct 96" ggaagacccg gggacctcgg acctgttggg taccagggaa tgaagggaga aaaagggagc 102" Cgtggggaga agggctccag gggacccaag ggctacaagg gagagaaggg caagcgtggc 108" atcgacgggg tggacggcgt gaagggggag tacc tgcc aggctgcaag 114" cccg ggtttgacgg cattcaagga ccccctggcc ccaagggaga ccccggtgcc 180180 "20" tttggactga aaggagaaaa gggcgagcct ggagctgacg gggaggcggg gagaccaggg "26" agctcgggac gaga ccag ccgggagagc ctgggccccc cggagagaaa "32" ggagaggcgg gcgacgaggg agga cctgacggtg cccccgggga gcggggtggc "38" cctggagaga gaggaccacg ggggacccca ggcacgcggg gagg agaccctggt "44" gaagctggcc cgcagggtga tcagggaaga cccg ttggtgtccc tggagacccg "50" gctg gccctatcgg acctaaaggc taccgaggcg atgagggtcc cccagggtcc "56" gagggtgcca gaggagcccc aggacctgcc ggaccccctg gagacccggg gctgatgggt "62" gaaaggggag aagacggccc cgctggaaat ggcaccgagg gcttccccgg cttccccggg "68" tatccgggca acaggggcgc tcccgggata aacggcacga agggctaccc cggcctcaag "74" ggggacgagg gagaagccgg ggaccccgga gacgataaca acgacattgc accccgagga "80" gtcaaaggag ggta ccggggtccc gagggccccc agggaccccc aggacaccaa "86" ggaccgcctg ggccggacga atgcgagatt ttggacatca tcatgaaaat ttgc "92" tgtgaatgca agtgcggccc catcgacctc ctgttcgtgc tggacagctc agagagcatt "98" ggcctgcaga acttcgagat tgccaaggac ttcgtcgtca aggtcatcga ccggctgagc 204" cgggacgagc tggtcaagtt cgagccaggg cagtcgtacg ggt gcagtacagc 210" cacagccaga agca cgtgagcctg cgcagcccca gcatccggaa Cgtgcaggag 216" ctcaaggaag ccatcaagag cctgcagtgg atggcgggcg gcaccttcac gggggaggcc 222" ctgcagtaca cgcgggacca gctgctgccg cccagcccga acaaccgcat cgccctggtc 228" atcactgacg ggcgctcaga cactcagagg gacaccacac cgctcaacgt gctctgcagc 234" cccggcatcc aggtggtctc cgtgggcatc aaagacgtgt ttgacttcat ctca 240" gaccagctca atgtcatttc ttgccaaggc ccat cccagggccg gcccggcctc 246" tcgctggtca aggagaacta gctg ctggaggatg ccttcctgaa gaatgtcacc 252" gcccagatct acaa gaagtgtcca gattacacct gccccatcac gttctcctcc 258" ccggctgaca tcaccatcct cggc tccgccagcg tgggcagcca caactttgac 264" accaccaagc gcttcgccaa ggcc gagcgcttcc tcacagcggg ggac 270" cccgcccacg acgtgcgggt ggcggtggtg cagtacagcg gcacgggcca gcagcgccca 276" gagcgggcgt cgctgcagtt cctgcagaac tacacggccc tggccagtgc cgtcgatgcc 282" atggacttta tcaacgacgc caccgacgtc aacgatgccc tgggctatgt gacccgcttc 288" taccgcgagg cctcgtccgg cgctgccaag aagaggctgc tgctcttctc agatggcaac 294" tcgcagggcg ccacgcccgc tgccatcgag aaggccgtgc aggaagccca gcgggcaggc 181181 300" atcgagatct tggt cgtgggccgc aatg agccccacat ccgcgtcctg 306" gtcaccggca agacggccga gtacgacgtg gcctacggcg agagccacct gttccgtgtc 312" cccagctacc aggccctgct ccgcggtgtc ttccaccaga cagtctccag gaaggtggcg 318" ctgggctagc ccaccctgca cgccggcacc aaaccctgtc ctcccacccc tccccactca 324" tcactaaaca gagtaaaatg gaat gacc aacctgattc tttt 330" ttttaaggaa aagcttggaa agccaggaca caacgctgct gcctgctttg tgcagggtcc 336" tccggggctc agccctgagt tggcatcacc tgcgcagggc cctctggggc tcagccctga 342" gtca cctgcacagg gccctctgag gctcagccct gagctggcgt cacctgtgca 348" gggccctctg gggctcagcc ctgagctggc ctcacctggg ttccccaccc cgggctctcc 354" tgccctgccc tcctgcccgc cctccctcct gcctgcgcag ctccttccct aggcacctct 360" gtgctgcatc ccaccagcct gagcaagacg ccctctcggg gcctgtgccg cactagcctc 366" cctctcctct gtccccatag ctggtttttc ccaccaatcc tcacctaaca gttactttac 372" aattaaactc aaagcaagct cttctcctca gcttggggca gccattggcc tctgtctcgt 378" tttgggaaac caaggtcagg aggccgttgc agacataaat ctcggcgact cggccccgtc 384" gggt cctgctggtg accggcctgg accttggccc tacagccctg gaggccgctg 390" agca ctgaccccga cctcagagag tactcgcagg ggcgctggct gcactcaaga 396" ccctcgagat taacggtgct aaccccgtct gctcctccct cccgcagaga ctggggcctg 402" gactggacat gagagcccct tggtgccaca gagggctgtg taga aacaacgcaa 408" acctctcctt cctcagaata gtgatgtgtt cgacgtttta tcaaaggccc cctttctatg 414" ttcatgttag ttttgctcct tctgtgtttt tttctgaacc atatccatgt tgctgacttt 420" tccaaataaa ggttttcact cctctaaaaa aaaaaaaaaa aaaaaa Protein seguence: NCBI Reference ce: NP_001839.2 LOCUS 839 ACCESSION NP_001839 1 mraarallpl llqacwtaaq depetprava fqdcpvdlff vldtsesval rlkpygalvd 61 kvksftkrfi yyrc drnlvwnaga veii qgltrmpggr dalkssvdav 121 kyfgkgtytd leql lvggshlken kylivvtdgh plegykepcg gledavneak 182182 18" hlgvkvfsva eprl htyr rnftaadwgq srdaeeaisq tidtivdmik 24" nnveqvccsf ecqpargppg lrgdpgfege rgkpglpgek geagdpgrpg dlgpvqugm 3o" kgekgsrgek gsrgpkgykg ekgkrgidgv dgvkgemgyp glpgckgSpg fdgiqupgp 36" kgdpgafglk gekgepgadg eagrpgssgp sgdegquep gppgekgeag degnpgpdga 42" pgerggpger gprgtpgtrg prgdpgeagp quqgregpv nggdpgeag pigpkgyrgd 48" egppgsegar gapgpagppg dpglmgerge dgpagngteg fpgfpgypgn rgapgingtk 54" gypglkgdeg eagdpgddnn diaprgvkga kgyrgpegpq gppghqupg pdeceildii 60" ceck cgpidllfvl dssesiglqn feiakdfvvk vidrlsrdel kaepgqsya 66" gvvqyshsqm qehvslrsps lkea ikslqwmagg tftgealqyt pspn 72" nrialvitdg rsdtqrdttp lnvlcspgiq vvsvgikdvf dfipgsdqln laps 78" qgrpglslvk enyaelleda flknvtaqic idkkcpdytc pitfsspadi tilldgsasv 84" gshnfdttkr erfl tagrtdpahd vrvavvqysg tgqqrperas qulqnytal 90" asavdamdfi ndatdvndal gyvtrfyrea ssgaakkrll lfsdgnsqga tpaaiekavq 96" eaqragieif vvvvgrqvne phirvlvtgk taeydvayge shlfrvpsyq 102" vsrkvalg ObelSwnbd:CRTAP Official Name: cartilage associated protein Gene ID: 10491 Organism: Homo sapiens Other Aliases: CASP, LEPRELB, O|7 Other Designations: cartilage-associated protein; leprecan-like 3 tide seguence: NCBI Reference Seguence: NM_006371.4 LOCUS NM_006371 ACCESSION NM_006371 l aggctggcgt ccccgccccg aaagcactgg gcccgccgcg tcgcaccgtc cctt 61 tccttctccc tccccttttc ccttccttcg tcccttcctt ccttcctttc gccgggcgcg 183183 l2" atggagccgg ggcgccgggg ggccgcggcg ctgctagcgc tgctgtgcgt ggcctgcgcg 18" ctgcgcgccg ggcgcgccca atacgaacgc tacagcttcc gcagcttccc acgggacgag 24" ccgc tcgagtcggc ctaccggcac gcgctggaca agtacagcgg cgagcactgg " agcg tgggctacct cagc ctgcggctgc accgcttgct cagc 36" ttct gccaccgcaa ctgcagcgcc gcgccgcagc ccgagcccgc cgccggcctc 42" gccagctatc ccgagctgcg cctcttcggg ggcctgctgc gccgcgcgca ctgcctcaag 48" cgctgcaagc agggcctgcc agccttccgc cagtcccagc ccagccgcga ggtgctggcg 54" gacttccagc gccgcgagcc ctacaagttc ttcg cttacttcaa taat 60" ctccccaaag ccatcgccgc tgctcacacc tttctactga agcatcctga tgacgaaatg 66" agga acatggcata ttataagagc ggtg ccgaggacta cattaaagac 72" ctggaaacca agtcatatga aagcctgttc atccgagcag tgcgggcata caacggtgag 78" aactggagaa tcac agacatggag ctggcccttc ccgacttctt caaagccttt 84" tacgagtgtc tcgcagcctg cgagggttcc agggagatca aggacttcaa ggatttctac 90" ctttccatag cagatcatta tgtagaagtt ctggaatgca aaatacagtg tgaagagaac 96" ctcaccccag gagg ctatccggtt gagaaatttg tggctaccat ttac "O2" ttgcagtttg cctattataa gttgaacgac ctgaagaatg cagccccctg tgcagtcagc "08" tatctgctct ttgatcagaa tgacaaggtc atgcagcaga acctggtgta ttaccagtac "l4" cacagggaca cttggggcct ctcggatgag cacttccagc ccagacctga agcagttcag "20" ttctttaatg tgaccacact ccagaaggag ctgtatgact ttgctaagga aaatataatg "26" gatgatgatg aagt tgtggaatat gtggatgacc tcttggaact ggaggagacc "32" agctagccca cagcaaccaa agagacttcc tcttggcgtt caggaaacac tttg "38" tccttttccc aacagcccag gctgttgata cctcagagcc ttac tctccaaagt "44" gaaagggaag cccccgtctc tctaactgca tgtcatcagg ggtgagcctg cctttcctat "50" cttcacacct gccacctcat gttcacacct atctttctca cctttttttt gagatggagt "56" ctcgctctct tgcccaggct ggagtgcaat ggcacgttct cagctcactg caacctccgc "62" ctcttgggtt caagcaattc tgctgcatca gcctcccgag tacctgggat tacaggcatg "68" cacg cccggctaat tttgtatttt tagtagagac ggggttttgc catgttggcc "74" aggctggtct cgaactcttg acttcagatg atccatctgc cttggcctcc gctg "80" ggattacagg cgtgagccac catgcccggc ctctttctca cctttacacc tgtcttctta "86" tcctcacatc tgttttcaca tccc tgtcttcctc atgttcacac ttgtcttccc 184184 192" catgttcata gctgcctttc ttaccatttt ggtttgaagg gcagtcttct ctggcttgtt 198" tttttgtttt tcccagaaaa ttat tttttaaata agaaaaacat tcctagaaga 204" tgataattgt ctcc tttggcttat ttgcttttcc agattttagt ctcctttctc 210" cccatccggg aaagatggtg gaagacatag gctaaatttc tccagcctca caatggtctt 216" cacttggtct gacttgtacc aattctagca cccactgaaa aacaagttga gtagagagtg 222" gcag aaatgtggct tttgccccac tttgcatctc caaaattaca acggttggcc 228" gatcccattt gaggacaatg cttagttata agtctccgag ttggaaaagg aagaaagcca 234" gagctgtcta gtttcattca ttctttcagt aaatatttat ccta ctgtgtgcta 240" ggcattgacc tgggaactag agatacttca aaca gggaaagttc cctgtgctca 246" tggagcttac attctacagg gagaaagaga tagccaatac ataggaataa atatatacaa 252" ggtatcatgt agtgataatt gctgtggaga aaaataaagc aggggaggga gtaagaaatc 258" ctggagatga ggctgcagtt ttaaatgggg cctcactggg aatgtgacgt tgagcagaga 264" cgttagggaa gtggatcctg gacaaggcat tccaggcaga ggaacaggat gtgcactgcc 270" ccaaagtgag aacttgctct acgtggtcag gaaagagcag caag cagagtcgtg 276" ggcaggggta gaatggaagg ggct ggggaggaca ggtggtggag ggccttggct 282" tctgctaagt ggaa ccactggagg gtttgaacag gcct tgattgattt 288" atattttgca agggtcattc tagctgccat attgtgaaaa actttagtgg acaagggcag 294" aaggaagagg gaagacctgt taggaagcta ctgcaaggtt ttgg gcca 300" cagcaacagc caaa tatctagatt gaaa agagccaata ggatttgctg 306" agagtttgaa tgtggagtgt aagagaagga agagttaatg atgacattaa ggtttttggc 312" agca atgg agttaccagt tactgaaata gggaaggatg ggctgggtaa 318" gtatggaatt tggtgcaaag caggctgtct gtggttggaa tgggaggttc tggctgcaaa 324" tcaaagtgga gagttctctc ggtc tgcagcagag ctcgagacag ggatctgaat 330" gcacttggtt tattgttggg ggtgctctca acct gtgaaagcct ttatcagtca 336" tttattggct gtgagaagtt ctctgggagt gtgggtacat ttgaaggcaa gtgacttcag 342" ttgagggcaa gtctctggaa aagaggctgt aggcatctgg cagctaccat gcatggtagt 348" gtgttggggg tgggggtcct gggcactggc tgtgtgaagg gatctggcag ggcaccacag 354" cgccccctac tgaaccatca gcatgtcagt ggcatttaaa gccatgcagc tggaggggcc 360" actgagattg tctctgagta ttactgagaa gcaacagaaa agagccatgg atggagccct 366" tgggctctct tggg aaatcagcca aaggactgag aaggagttac cttaaggtca 185185 372" gagaaaacca agagagtgtg gtgttctgga agctgagctt tctttattca acctcattcc 378" cttctccaaa taagccactt gtgtagttgg gcccctccag ggttgaaggc aagaggagaa 384" aggcacagcg tttgggaaac aagacttttc ctgcaatagc ctgggaagga ataaaaggat 390" agagtgtttg ggtttttgtg taatggtggt taattggggt ggaacactca cacgttgtgc 396" tggg cttcccttat cccccagaac ccaa cctcggggaa ctcgggcaca 102" tccttctgtt tctccttcag ctctatcctg ctttcctcat cccttctgac accacgtcct 108" cactcacctg aatc tcag gttctccttt gagggtaccc acccaggaca 114" gtcccctacc acttctgtct tgggctgaag ttgcccacgt ccacaaaatc tgtactccca 120" gcgggggtgt ttggcccgag gagtcagtgt tattactggt ggatgcaccg tgtccacagc 126" caat cccagcgatg atct tacgtggctt cctgctgggg gagatggcct 132" tcacccacgg gatgccgggt tctcctttct ttcctcaccc caacctttac agag 138" cctt ttgaactcag tggggaagag ggtgatgaga caggactaga aagtagtggg Z44" ggacccagcg agtggacgcc ctgctccggg attcctgagt ctgtaaatag tgtgcccagc 150" agctgtgaac tccccttata gcctcaggct gcagtgtcct tcccagctgt gtgagaaaat 156" gaaagccgac gtccacaggg acccaggcag ggttgggtgt tgtgactcac tccacctctg Z62" gcag aggtactgtt gggtccttgt cttgtgagcc tggggtgagc tctctgtaca Z68" tgttgttgtt ccacgtatgg gttgacttgg gggg cgtt cactctctga Z74" agttggcctc ctttcactgg ggattgaaaa gcacctccac ccctacccta gtgatgtccc 180" ctgaggaccc gggtgatagt acagtcaata ttgtcagtac tttgctttga gctg 186" tagagctgag ttaccaaaat ttctatttca aaggaaacca aaccttaaaa aaaaaaaaca Z92" gggc tgggtcttcc acca cctg gtgtgcaggc tgcactcaat Z98" gacctcaacc caacacctcc gtgc ttcttggaag agcctagaag attcctggat 504" ggagacccca ttggttcagc ctcaagtctg gcccgtcttc gaaaaaacaa acacatttgt 510" aagctttgtg ggagcttcca ggcctgctct aagatgcctt gcttgtcctt tgacccatca 516" gcatggagct ttgc tgtttggttc tgcaggctgg tggggaggcc gcccatcgtg 522" gtggggcatc tgtccagccc cattgccact cagggcatcc aaacaggagg cacccgctgg 528" gaagggtcta aagatactcc ttgtggccac tgctactgtt cacacttgac ttgtggagaa 534" gcgaagggct aggt ttgtgtacac ccatgtattt aaaagtgact gactgactga 540" aatgagcaca taccgacata tgcaacatac taataccttc ctgattttcg agactttcta 546" attactacaa ctaacctgtt gtgctcacct ctggaattca gaaagagagc cactgcgagc 186186 552" actgaccaca agggctgcct ggaa atgtgtgctt tcaggagttc ttattagggg 558" aattttaaga gcacaaaaat tttattgaac ccaccttagt gacaaacaga aaatgatttg 564" ttagattgtc agttggaagg ggtttattat gactgctggt gaaataaaca gtgaccagtt 570" tctgccaaca tttatggaaa taacgtttct aggttttaaa tgtgagccgt aaactgaagc 576" tacttcagtt aaaaaaaaaa aaatcaatac tgta gggaaaaggt ggattggtgc 582" cagagaaaac attatttaat agtgtcagaa catggaactg caacacagtt tgtagcaagg 588" cactgagaaa aacccacaac taatccttca tcgcagctgt ctgaactctt gatcatctgc 594" catcccccaa acagtgactt ctttttttct tcca ggcctgaatg acctagtgtg 600" gagagtttaa actatgtaca agggaggaaa gaaaaaaagg aaaggaacct agcc 606" tcagccaaag gttttatgca ttggacttcc tgtgcctgcc cccaggggca atcc 612" ccatgcctgt gcccatgaca tctccctgaa agaggacacc atgacagccc ggctttgcct 618" tgactgaccc actgctaccc cagaaataag aatcaagagc agctattgtt atccttagag 624" tgttttccgt ct agggccgg aaca gccacatatc cttgcacctg acactgtccc 630" cacacaaata gctggctttc gttgcttgtt gaatgaatga gtgagttggc tctatatccc 636" cttggagctg gccggtaaga tattagtgcc tcattttaca agaagagaat aggaaggaat 642" attt ggccaacaga tacaagatag attccagagt tttatctccc actttagggt 648" ggcagccagt aggccaaact ccaaagaccg ttgctgatgt ctttttctgc ctcccctctt 654" tgggttagtg tggtatgtac aagctcactc ttgttgaaaa ttagaaaata gttgaaaaca 660" aaaggttttt gtttttcttt caca ttaaatgttt tacattgctt aaaaaaaaaa 666" aaaaaaaa n seguence: NCBI Reference Seguence: NP_006362.1 LOCUS 362 ACCESSION NP_006362 1 mepgrrgaaa llallcvaca lragraqyer ysfrsfprde lmplesayrh gehw 61 aesvgyleis lrlhrllrds eafchrncsa apqpepaagl asypelrlfg hclk 121 rckqglpafr qsqpsrevla dfqrrepykf quayfkann lpkaiaaaht fllkhpddem 181 yyks yikd letksyeslf iravraynge nwrtsitdme lalpdffkaf 241 yeclaacegs reikdfkdfy yvev leckiqceen ltpviggypv ekfvatmyhy 187187 301 quayyklnd lknaapcavs yllqundkv mqqnlvyyqy hrdtwglsde hfqprpeavq 361 ffnvtthke lydfakenim dddegevvey vddlleleet s SERPH Official Symbol: SERPINH1 Official Name: serpin peptidase inhibitor, clade H (heat shock protein 47), member 1, (collagen binding n 1) Gene ID: 871 Organism: Homo sapiens Other Aliases: PlG14,AsTP3, CBP1, CBP2, HSP47,0|10, PPROM, RA-A47, SERPINH2, gp46 Other Designations: 47 kDa heat shock protein; arsenic-transactivated protein 3; cell proliferation-inducing gene 14 protein; colligin-1; in-2; toid arthritis antigen A-47; rheumatoid arthritis-related antigen RA-A47; serine (or cysteine) proteinase inhibitor, clade H (heat shock protein 47), member 1, gen binding n 1); serine (or ne) nase inhibitor, clade H (heat shock protein 47), member 2, (collagen-binding protein 2); serpin H1 Nucleotide seguence: NCBI Reference Seguence: 207014.1 LOCUS NM_001207014 ACCESSION NM_001207014 1 agtaggaccc aggggccggg aggcgccggc agagggaggg gccgggggcc ggggaggttt 6" tgagggaggt ctttggcttt ttttggcgga gctggggcgc cctccggaag caac l2" tttccagaag ggga cgggcaggag ggggtgggga ctgccatata tagatcccgg 18" gagcagggga gcgggctaag agtagaatcg tgtcgcggct cgagagcgag agtcacgtcc 24" cggcgctagc ccagcccgac ccagaatgaa aaaggcaggc attgacctcc ctctgaggca " gtttccaggc ccaccgtggt gcacgcaaac cacttcctgg ccatgcgctc cctcctgctt 36" ctcagcgcct tctgcctcct ggcc ctggccgccg aggtgaagaa acctgcagcc 42" gcagcagctc ctggcactgc ggagaagttg agccccaagg cggccacgct tgccgagcgc 48" agcgccggcc tggccttcag cttgtaccag gccatggcca aggaccaggc agtggagaac 54" atcctggtgt cacccgtggt ggtggcctcg tcgctagggc tcgtgtcgct gggcggcaag 60" gcgaccacgg cgtcgcaggc caaggcagtg ctgagcgccg agcagctgcg cgacgaggag 188188 66" gtgcacgccg gcctgggcga gctgctgcgc tcactcagca actccacggc gcgcaacgtg 72" acctggaagc tgggcagccg actgtacgga cccagctcag tgagcttcgc tgatgacttc 78" gtgcgcagca gcaagcagca ctacaactgc gagcactcca agatcaactt ccgcgacaag 84" cgcagcgcgc tgcagtccat caacgagtgg gccgcgcaga ccaccgacgg caagctgccc 90" gaggtcacca aggacgtgga gcgcacggac ggcgccctgc tagtcaacgc catgttcttc 96" cact gggatgagaa attccaccac gtgg acaaccgtgg cttcatggtg "02" actcggtcct ataccgtggg tgtcatgatg atgcaccgga caggcctcta caactactac "08" gacgacgaga aggaaaagct gcaaatcgtg gagatgcccc tggcccacaa gctctccagc "14" ctcatcatcc tcatgcccca tcacgtggag cctctcgagc gccttgaaaa gctgctaacc "20" aaagagcagc tctg gaag atgcagaaga aggctgttgc catctccttg "26" cccaagggtg tggtggaggt gacccatgac ctgcagaaac acctggctgg gctgggcctg "32" actgaggcca ttgacaagaa caaggccgac ttgtcacgca tgtcaggcaa gaaggacctg "38" tacctggcca gcgtgttcca cgcc ttgg acacagatgg caaccccttt "44" gaccaggaca tctacgggcg cgaggagctg cgcagcccca agctgttcta cgccgaccac "50" cccttcatct tcctagtgcg ggacacccaa tccc tgctattcat tgggcgcctg "56" gtccggccta agggtgacaa gatgcgagac gagttatagg gcctcagggt gcacacagga "62" tggcaggagg aagg ctcctgagac acatgggtgc tattggggtt gggggggagg .68" tgaggtacca gccttggata ctccatgggg tgggggtgga aaaacagacc ggggttcccg "74" tgtgcctgag cggaccttcc cagctagaat tcactccact tggacatggg ccccagatac "80" catgatgctg agcccggaaa ctccacatcc tgtgggacct gggccatagt cattctgcct "86" gccctgaaag tcccagatca agcctgcctc aatcagtatt catatttata gccaggtacc "92" ttctcacctg tgagaccaaa ttgagctagg ggggtcagcc cttc tgacactaaa "98" acacctcagc tgcctcccca tccc aacctctccc aactataaaa ctaggtgctg 204" cagcccctgg gaccaggcac ccccagaatg acctggccgc agtgaggcgg attgagaagg 210" agctcccagg ttct actc tggtcaagaa gcatcgtgtc tgtg 216" aact ttttgttttg tttcttcctt ttttagttct tcaaagatag ggagggaagg 222" gggaacatga gcctttgttg ctatcaatcc aagaacttat ttgtacattt tttttttcaa 228" tttt ccaatgacat tttgttggag cgtggaagaa aaaa aaa seguence: NCBI Reference Seguence: NP_001193943.1 189189 LOCUS N P_001 193943 ION NP_001193943 1 mrsllllsaf clleaalaae vkkpaaaaap gtaeklspka atlaersagl afslyqamak 61 dqavenilvs pvvvasslgl vslggkatta sqakavlsae qlrdeevhag lgellrslsn 121 starnvtwkl gsrlygpssv vrss kqhyncehsk infrdkrsal qsinewaaqt 181 tdgklpevtk dvertdgall vnamffkphw dekfhhkmvd nrgfmvtrsy tvgvmmmhrt 241 glynyyddek equivempl ahklssliil mphhvepler leklltkeql kiwmgkmqkk 301 avaislpkgv vevthdlqkh laglglteai dknkadlsrm sgkkdlylas vfhatafeld 361 udi ygreelrspk lfyadhpfif lvrdtqsgsl lfigrlvrpk gdkmrdel |TB1 Official : ITGB1 Official Name: integrin, beta 1 (fibronectin receptor, beta polypeptide, antigen CD29 includes MDF2, MSK12) Gene ID: 3688 Organism: Homo sapiens Other Aliases: RP11-479G22.2, CD29, FNRB, GPIIA, MDF2, MSK12, VLA- BETA, VLAB Other Designations: in VLA-4 beta subunit; integrin beta-1 ; very late activation protein, beta ptide tide seguence: NCBI Reference ce: NM_002211.3 LOCUS NM_00221 1 ACCESSION NM_002211 1 atcagacgcg cagaggaggc ggggccgcgg ctggtttcct gccggggggc ggctctgggc 61 cgccgagtcc cctcctcccg agga ggaggagccg ccgccacccg ccgcgcccga 121 cacccgggag gccccgccag cccgcgggag aggcccagcg ggagtcgcgg aacagcaggc 181 ccgagcccac CgCgCnggC cccggacgcc gcgcggaaaa gatgaattta caaccaattt 241 tctggattgg actgatcagt tcagtttgct gtgtgtttgc tcaaacagat gaaaatagat 301 gtttaaaagc aaatgccaaa tcatgtggag aatgtataca agcagggcca aattgtgggt 190190 36" ggtgcacaaa ttcaacattt ttacaggaag gaatgcctac ttctgcacga gatt 42" tagaagcctt aaaaaagaag ggttgccctc cagatgacat tccc agaggctcca 48" aagatataaa gaaaaataaa aatgtaacca accgtagcaa aggaacagca gagaagctca 54" agccagagga tattactcag atccaaccac agcagttggt tttgcgatta agatcagggg 60" agccacagac atttacatta aaattcaaga gagctgaaga catt gacctctact 66" accttatgga cctgtcttac tcaatgaaag tgga gaatgtaaaa agtcttggaa 72" cagatctgat gaatgaaatg aggaggatta cttcggactt cagaattgga tttggctcat 78" ttgtggaaaa gactgtgatg atta gcacaacacc agctaagctc aggaaccctt 84" gcacaagtga acagaactgc accagcccat acaa aaatgtgctc agtcttacta 90" ataaaggaga agtatttaat gttg gaaaacagcg tgga aatttggatt 96" ctccagaagg tggtttcgat gccatcatgc aagttgcagt ttgtggatca ggct "02" ggaggaatgt tacacggctg ctggtgtttt atgc cgggtttcac tttgctggag "08" atgggaaact tggtggcatt gttttaccaa atgatggaca atgtcacctg gaaaataata "l4" tgtacacaat gagccattat tatgattatc ttgc tcaccttgtc cagaaactga "20" gtgaaaataa tattcagaca atttttgcag ttactgaaga atttcagcct gtttacaagg "26" agctgaaaaa cttgatccct aagtcagcag taggaacatt atctgcaaat tctagcaatg "32" taattcagtt tgat gcatacaatt ccctttcctc agaagtcatt ttggaaaacg "38" gcaaattgtc agaaggcgta acaataagtt acaaatctta ctgcaagaac ggggtgaatg "44" gaacagggga aaatggaaga aaatgttcca atatttccat tggagatgag gttcaatttg "50" aaattagcat aacttcaaat aagtgtccaa aaaaggattc tgacagcttt aaaattaggc "56" ctctgggctt tacggaggaa gtagaggtta ttcttcagta catctgtgaa tgtgaatgcc "62" aaagcgaagg catccctgaa agtcccaagt gtcatgaagg aaatgggaca tttgagtgtg "68" gcgcgtgcag gtgcaatgaa gggcgtgttg gtagacattg tgaatgcagc acagatgaag "74" ttaacagtga agacatggat gcttactgca ggaaagaaaa cagttcagaa atctgcagta "80" acaatggaga gtgcgtctgc ggacagtgtg tttgtaggaa gagggataat acaaatgaaa "86" tttattctgg ctgc gagtgtgata atttcaactg tgatagatcc aatggcttaa "92" tttgtggagg aaatggtgtt tgtc gtgtgtgtga gtgcaacccc actg "98" gcagtgcatg tgactgttct ttggatacta gtacttgtga caac ggacagatct 204" gcaatggccg gggcatctgc gagtgtggtg agtg tacagatccg aagtttcaag 210" ggcaaacgtg gtgt cagacctgcc ttggtgtctg tgctgagcat aaagaatgtg 191191 216" ttcagtgcag agccttcaat aaaggagaaa agaaagacac atgcacacag gaatgttcct 222" attttaacat taccaaggta gaaagtcggg acaaattacc ccagccggtc caacctgatc 228" ctgtgtccca ttgtaaggag aaggatgttg gttg gttctatttt acgtattcag 234" tgaatgggaa caacgaggtc atggttcatg ttgtggagaa tccagagtgt cccactggtc 240" cagacatcat tccaattgta gtgg ttgctggaat tgttcttatt ggccttgcat 246" tactgctgat atggaagctt ttaatgataa ttcatgacag aagggagttt gctaaatttg 252" aaaaggagaa aatgaatgcc aaatgggaca cgggtgaaaa tcctatttat aagagtgccg 258" taacaactgt ggtcaatccg aagtatgagg gaaaatgagt actgcccgtg caaatcccac 264" aacactgaat gcaaagtagc aatttccata gtcacagtta ttta gggcaatatt 270" gccatggttt tactcatgtg caggttttga aaatgtacaa tatgtataat ttttaaaatg 276" tatt ttgaaaataa tgttgtaatt catgccaggg actgacaaaa gacttgagac 282" aggatggtta ctcttgtcag ctaaggtcac attgtgcctt tttgaccttt tcttcctgga 288" ctattgaaat caagcttatt ggattaagtg atatttctat agcgattgaa agggcaatag 294" taat gagcatgatg agagtttctg ttaatcatgt attaaaactg atttttagct 300" atat gtcagtttgc agttatgcag aatccaaagt aaatgtcctg ctagctagtt 306" aaggattgtt ttaaatctgt tattttgcta tttgcctgtt agacatgact gatgaca:at 312" ctgaaagaca agtatgttga gagttgctgg atac gtttgaaata c:ac 318" aaaggccatg ggaaaaattc agagagttag gaaggaaaaa ccaa:agctt taaaacc:gt 324" gtgccatttt aagagttact taatgtttgg taacttttat gcct:cactt tacaaat:ca 330" agccttagat aaaagaaccg agcaattttc aaag tcct:gattt agcacta:tt 336" aggc catactttac aaagtatttg ctgaatgggg tgag ttgaatt:at 342" attt ttattttgtt taatgtctgg tgctttctgt cacc :cttct aatcttt:aa 348" tgtatttgtt tgcaattttg gggtaagact ttttttatga gtac:ttttc g:tt 354" tcaa tttgcctttt taatgaacat gtgaagttat actg:ggcta tgcaacagct 360" ctcacctacg cgagtcttac tttgagttag tgccataaca gaccactgta tgtttacttc 366" tcaccatttg agttgcccat cttgtttcac caca ttcttgtttt cttt 372" agttttaaca gttcactttt tacagtgcta tttactgaag ttatttatta aatatgccta 378" aaatacttaa atcggatgtc ttgactctga tgtattttat caggttgtgt aatt 3841 tttatagatt aaagaagttg aggaaaagca aaaaaaaaa Protein seguence: 192192 NCBI Reference Seguence: NP_002202.2 LOCUS NP_002202 ACCESSION 202 l mnlqpifwig lissvccvfa qtdenrclka nakscgeciq agpncgwctn stflqegmpt 6" sarcddleal kkkgcppddi enprgskdik knknvtnrsk gtaeklkped itqiqpqquV l2" lrlrsgepqt ftlkfkraed ypidlyylmd lsysmkddle nvkslgtdlm nemrritsdf 18" rigfgsfvek sttp aklrnpctse qnctspfsyk nkge 24" isgnldspeg gfdaimqvav cgsligwrnv trllvfstda gfhfagdgkl ggivlpndgq " chlennmytm shyydypsia senn iqtifavtee fqpvykelkn lipksavgtl 36" sanssnviql iidaynslss evilengkls egvtisyksy ckngvngtge ngrkcsnisi 42" gdevqfeisi tsnkcpkkds dsfkirplgf teevevilqy qseg ipespkcheg 48" ngtfecgacr cnegrvgrhc ecstdevnse dmdaycrken sseicsnnge cvcgqcvcrk 54" rdntneiysg kfcecdnfnc drsnglicgg ngvckcrvce cnpnytgsac dcsldtstce 60" asngqicngr gicecgvckc tdpqugqtc emcqtclgvc aehkecvqcr afnkgekkdt 66" ctqecsyfni tkvesrdklp qpvqupvsh ckekdvddcw fyftysvngn nevmvhvven 72" pecptgpdii pivagvvagi vliglallli wkllmiihdr ekek mnakwdtgen 78" vttv vnpkyegk FKB10 Official Symbol: FKBP10 Official Name: FK506 binding n 10, 65 kDa Gene ID: 60681 Organism: Homo sapiens Other Aliases: PSEC0056, FKBP65, O|11, OI6, PPIASE, 5 Other Designations: 65 kDa FK506-binding protein; 65 kDa FKBP; FK506- binding protein 10; FKBP-10; FKBP-65; PPIase FKBP10; immunophilin FKBP65; yI-prolyl cis-trans isomerase FKBP10; rotamase Nucleotide seguence: NCBI Reference Seguence: NM_021939.3 LOCUS NM_021939 ACCESSION NM_021939 193193 l cctc tggc caggccccag gtctcgcagc cagggatgga gatgggggga 6" gggggaacct agagttcttt cctc cctcagactc taacacactc agcctggccc 12" cctcctccta ttgcaacccc cgct cctcccggcc aggccagctc agtcttccca 18" attc cacgtggacc agccagggcg ggggtaggga aagaggacag gaagaggggg 24" agccagttct gggaggcggg gggaaggagg ttggtggcga ctccctcgct cgccctcact " gccggcggtc ccaactccag gcaccatgtt ccccgcgggc &gCC acagcctcct 36" ccggctcccc ctgctgcagt tgctgctact ggtggtgcag gccgtgggga gggggctggg 42" ccgcgccagc ccggccgggg gccccctgga agatgtggtc aggt accacatccc 48" cagggcctgt ccccgggaag tgcagatggg ggattttgtg cgctaccact acaacggcac 54" ttttgaagat ggcaagaagt ttgattcaag ctatgatcgc aacaccttgg tggccatcgt 60" ggtgggtgtg gggcgcctca tcactggcat ggaccgaggc ctcatgggca tgtgtgtcaa 66" cgagcggcga cgcctcattg tgcctcccca cctgggctat atcg gcctggcggg 72" gctcattcca ccggatgcca ccctctactt cgatgtggtt ctgctggatg tgtggaacaa 78" ggaagacacc gtgcaggtga gcacattgct gcgcccgccc cactgccccc gcatggtcca 84" ggacggcgac tttgtccgct accactacaa tggcaccctg ctggacggca cctccttcga 90" caccagctac agtaagggcg gcacttatga cacctacgtc ggctctggtt ggctgatcaa 96" gggcatggac caggggctgc tgggcatgtg tcctggagag agaaggaaga ttatcatccc "O2" tccattcctg ggcg gcta tgggacagtg atccccccac aggcctcgct "08" ggtctttcac gtcctcctga ttgacgtgca caacccgaag gacgctgtcc agac "l4" gctggagctc ggct gtgtccgcag agccggggcc ggggacttca tgcgctacca "20" ctacaatggc tccttgatgg acggcaccct cttcgattcc agctactccc gcaaccacac "26" ctacaatacc tatatcgggc agggttacat catccccggg atggaccagg ggctgcaggg "32" tgcctgcatg ggggaacgcc ttac catccccccg cacctcgcct atggggagaa "38" tggaactgga gacaagatcc ctggctctgc aatc ttcaacgtcc atgtcattga "44" cttccacaac cctgcggatg aaat caggacactg tcccggccat ctgagacctg "50" caatgagacc accaagcttg gggactttgt ccat tacaactgtt ctttgctgga "56" cggcacccag ctgttcacct cgcatgacta cggggccccc caggaggcga ctctcggggc "62" caacaaggtg atcgaaggcc tggacacggg cctgcagggc atgtgtgtgg gagagaggcg "68" gcagctcatc gtgcccccgc acctggccca ngggagagt ggagcccggg gagtcccagg "74" cagtgctgtg ctgctgtttg aggtggagct ccgg gaggatgggc tgcccacagg 194194 180" ctacctgttt gtgtggcaca aggaccctcc tgccaacctg tttgaagaca tggacctcaa 186" caaggatggc gaggtccctc cggaggagtt ctccaccttc atcaaggctc aagtgagtga 192" gggcaaagga cgcctcatgc ctgggcagga ccctgagaaa accataggag acatgttcca 198" gaaccaggac cgcaaccagg acggcaagat cacagtcgac gagctcaagc tgaagtcaga 204" tgaggacgag gagcgggtcc acgaggagct ggca gggagcctgg ccaggcctga 210" gacacagagg gcga gggggacagt ggcggtggga ctgacctgct gacagtcacc 216" ctccctctgc tgggatgagg tccaggagcc aaca atggcagagg agacatctct 222" ggtgttccca ccaccctaga tgaaaatcca cagcacagac ctctaccgtg tttctcttcc 228" aaac cacttcctta aaatgtttgg atttgcaaag ccaatttggg gcctgtggag 234" cctggggttg gatagggcca tggctggtcc cccaccatac ctcccctcca catcactgac 240" acagctgagc ttgttatcca tctccccaaa ctttctcttt ctttgtactt cttgtcatcc 246" ccactcccag ccccttttcc tctatgtgac agctccctag gacccctctg ccttcctccc 252" caatcctgac tggctcctag ggaaggggaa ggctcctgga gggcagccct acctctccca 258" tgccctttgc cctcctccct cgcctccagt ggaggctgag ctgaccctgg gctgctggag 264" gccagactgg gctgtagtta gcttttcatc cctaaagaag gctcctttcc ctaaggaacc 270" atagaagaga ggaagaaaac catg tgtgagggaa gctgcttggg ttag 276" ggctatgaaa tcttggattt ggggctgagg ggtgggaggg agggcagagc tctgcacact 282" caaaggctaa actggtgtca gtcctttttt ttcc aaataaaaga ttaaaccaat 288" aa Protein seguence: NCBI Reference Seguence: 758.3 LOCUS NP_068758 ION NP_068758 1 mfpagppshs llrlpllqll llvvqavgrg lgraspaggp ledvvieryh ipracprevq 61 mgdfvryhyn gtfedgkkfd ssydrntlva ivvgvgrlit gmdrglmgmc vnerrrlivp 121 phlgygsigl aglippdatl yfdvvllde nkedtqust llrpphcprm qugdfvryh 181 yngtlldgts fdtsyskggt ydtyvgsgwl ikgmdqgllg rkii ippflaygek 241 ppqa slvfhvllid vhnpkdavql etlelppgcv rragagdfmr yhyngslmdg 301 tlfdssysrn htyntyigqg dqgl qgacmgerrr layg engtgdkipg 195195 361 savlifnvhv idfhnpadvv eirtlsrpse tcnettklgd fvryhyncsl ldgtqlftsh 421 dygapqeatl gankviegld tglqgmcvge rrqlivpphl ahgesgargv pgsavllfev 481 elvsredglp tgylfvwhkd ppanlfedmd vppe efstfikaqv segkgrlmpg 541 igdm nng lklk sdedeervhe e1 FINC Official Sym bol: FN1 Official Name: fibronectin 1 Gene ID: 2335 Organism: Homo sapiens Other Aliases: CIG, ED-B, FINC, FN, FNZ, GFND, GFND2, LETS, MSF Other Designations: cold-insoluble globulin; fibronectin; migration-stimulating facto r Nucleotide seguence: NCBI Reference Seguence: NM_002026.2 LOCUS NM_002026 ACCESSION NM_002026 1 gcccgcgccg gctgtgctgc ggag gagagggaac cccaggcgcg agcgggaaga 6" ggggacctgc aact tctctggtcc tctgcatccc ttctgtccct ccacccgtcc 12" ccttccccac cctctggccc ccaccttctt ggaggcgaca acccccggga ggcattagaa 18" gggatttttc ccgcaggttg cgaagggaag caaacttggt ggcaacttgc ctcccggtgc 24" gggcgtctct cccccaccgt ctcaacatgc ttaggggtcc ggggcccggg ctgctgctgc " tggccgtcca gtgcctgggg acagcggtgc cctccacggg agcctcgaag agcaagaggc 36" aggctcagca tcag ccccagtccc ctgt cagtcaaagc aagcccggtt 42" gttatgacaa tggaaaacac tatcagataa atcaacagtg ggagcggacc tacctaggca 48" tggt ttgtacttgt tatggaggaa gccgaggttt taactgcgag agtaaacctg 54" aagctgaaga gacttgcttt gacaagtaca ctgggaacac ttaccgagtg ggtgacactt 60" gtcc taaagactcc atgatctggg actgtacctg catcggggct gggcgaggga 196196 66" gaataagctg taccatcgca aaccgctgcc atgaaggggg tcagtcctac aagattggtg 72" acacctggag gagaccacat gagactggtg gttacatgtt agagtgtgtg ggta 78" atggaaaagg agaatggacc tgcaagccca tagctgagaa gtgttttgat catgctgctg 84" ggacttccta tgtggtcgga gaaacgtggg agaagcccta ccaaggctgg atgatggtag 90" attgtacttg cctgggagaa ggcagcggac gcatcacttg cacttctaga aatagatgca 96" acgatcagga cacaaggaca tcctatagaa ttggagacac ctggagcaag aaggataatc "02" gaggaaacct gctccagtgc atctgcacag gcaacggccg aggagagtgg aagtgtgaga "08" ggcacacctc tgtgcagacc acatcgagcg gatctggccc cttcaccgat gttcgtgcag "14" ctgtttacca accgcagcct cacccccagc ctcctcccta tggccactgt gtcacagaca "20" gtggtgtggt ctactctgtg gggatgcagt ggctgaagac acaaggaaat aagcaaatgc "26" tttgcacgtg cctgggcaac ggagtcagct gccaagagac agctgtaacc cagacttacg "32" gtggcaactc aaatggagag ccatgtgtct taccattcac tggc aggacgttct "38" actcctgcac cacagaaggg gacg tttg gtgcagcaca acttcgaatt "44" agga atac tctttctgca cagaccacac tgttttggtt cagactcgag "50" attc caatggtgcc cact tccccttcct atacaacaac cacaattaca "56" ctgattgcac ttctgagggc agaagagaca acatgaagtg gtgtgggacc acacagaact "62" atgatgccga ccagaagttt gggttctgcc ccatggctgc ccacgaggaa atctgcacaa "68" ccaatgaagg ggtcatgtac cgcattggag atcagtggga taagcagcat gacatgggtc "74" tgag gtgt gttgggaatg gtcgtgggga atggacatgc attgcctact "80" cgcagcttcg agatcagtgc attgttgatg acatcactta caatgtgaac gacacattcc "86" gtca tgaagagggg cacatgctga actgtacatg cttcggtcag ggtcggggca "92" agtg tgatcccgtc gaccaatgcc aggattcaga gactgggacg ttttatcaaa "98" ttggagattc gaag tatgtgcatg gtgtcagata ccagtgctac tgctatggcc 204" gtggcattgg ggagtggcat tgccaacctt tacagaccta tccaagctca cctg 210" tcgaagtatt tgag actccgagtc agcccaactc ccaccccatc cagtggaatg 216" caccacagcc atctcacatt tccaagtaca ttctcaggtg taaa aattctgtag 222" gccgttggaa ggaagctacc ataccaggcc acttaaactc catc aaaggcctga 228" agcctggtgt ggtatacgag ggccagctca tcagcatcca gcagtacggc caccaagaag 234" tgactcgctt tgacttcacc accaccagca cacc tgtgaccagc aacaccgtga 240" caggagagac gactcccttt tctcctcttg tggccacttc tgaatctgtg accgaaatca 197197 246" gtag ggtc tcctgggtct cagcttccga caccgtgtcg ggattccggg 252" tggaatatga gctgagtgag gagggagatg agccacagta cctggatctt ccaagcacag 258" ccacttctgt gaacatccct gacctgcttc ctggccgaaa atacattgta tatc 264" agatatctga ggatggggag cagagtttga tcctgtctac ttcacaaaca acagcgcctg 270" atgcccctcc tgacccgact gtggaccaag ttgatgacac tgtt gttcgctgga 276" gcagacccca ggctcccatc acagggtaca gaatagtcta ttcgccatca gtagaaggta 282" gcagcacaga actcaacctt cctgaaactg caaactccgt cagt gacttgcaac 288" ctggtgttca gtataacatc actatctatg ctgtggaaga aaatcaagaa agtacacctg 294" ttgtcattca acaagaaacc actggcaccc cacgctcaga tacagtgccc tctcccaggg 300" acctgcagtt tgtggaagtg acagacgtga aggtcaccat catgtggaca ccgcctgaga 306" gtgcagtgac cggctaccgt gtga tccccgtcaa cctgcctggc gagcacgggc 312" agaggctgcc catcagcagg aacacctttg cagaagtcac cgggctgtcc cctggggtca 318" cctattactt caaagtcttt gcagtgagcc atgggaggga gagcaagcct ctgactgctc 324" aacagacaac ggat gctcccacta acctccagtt tgtcaatgaa tcta 330" ctgtcctggt gagatggact ccacctcggg cccagataac aggataccga ctgaccgtgg 336" gccttacccg aagaggacag cccaggcagt acaatgtggg tccctctgtc tccaagtacc 342" cactgaggaa tctgcagcct gcatctgagt tatc cctcgtggcc ataaagggca 348" accaagagag ccccaaagcc actggagtct ttaccacact gcagcctggg agctctattc 354" caccttacaa caccgaggtg actgagacca ccattgtgat cacatggacg cctgctccaa 360" gaattggttt taagctgggt gtacgaccaa gccagggagg acca cgagaagtga 366" cttcagactc aggaagcatc gttgtgtccg ctcc aggagtagaa tacgtctaca 372" ccatccaagt cctgagagat ggacaggaaa gagatgcgcc aattgtaaac aaagtggtga 378" caccattgtc tccaccaaca aacttgcatc tggaggcaaa ccctgacact ggagtgctca 384" cagtctcctg ggagaggagc accaccccag ctgg ttatagaatt accacaaccc 390" ctacaaacgg ccagcaggga aattctttgg tggt ccatgctgat cagagctcct 396" gcacttttga taacctgagt cccggcctgg agtacaatgt cagtgtttac actgtcaagg 402" atgacaagga aagtgtccct atctctgata ccatcatccc tcct cctcccactg 408" acctgcgatt caccaacatt ggtccagaca ccatgcgtgt cacctgggct ccacccccat 414" attt aaccaacttc ctggtgcgtt actcacctgt gaaaaatgag gaagatgttg 420" cagagttgtc aatttctcct aatg cagtggtctt aacaaatctc ctgcctggta 198198 126" cagaatatgt agtgagtgtc tccagtgtct acgaacaaca tgagagcaca cctcttagag Z32" gaagacagaa aacaggtctt gattccccaa ctggcattga cttttctgat attactgcca Z38" actcttttac tgtgcactgg attgctcctc gagccaccat cactggctac aggatccgcc Z44" atcatcccga gcacttcagt gggagacctc gagaagatcg ggtgccccac tctcggaatt 150" ccct caccaacctc actccaggca cagagtatgt ggtcagcatc gttgctctta 156" atggcagaga ggaaagtccc ttattgattg gccaacaatc aacagtttct ccga Z62" gggacctgga tgct gcgaccccca ccagcctact gatcagctgg gatgctcctg Z68" ctgtcacagt gagatattac aggatcactt acggagagac aggaggaaat gtcc Z74" aggagttcac tgtgcctggg agcaagtcta cagctaccat cagcggcctt aaacctggag 180" ttgattatac catcactgtg tatgctgtca ctggccgtgg agacagcccc gcaagcagca 186" agccaatttc cattaattac cgaacagaaa ttgacaaacc atcccagatg caagtgaccg Z92" atgttcagga caacagcatt agtgtcaagt ggctgccttc aagttcccct gttactggtt Z98" acagagtaac caccactccc aaaaatggac caggaccaac aaaaactaaa actgcaggtc 504" cagatcaaac agaaatgact attgaaggct tgcagcccac agtggagtat gtggttagtg 510" tctatgctca gaatccaagc ggagagagtc tggt tcagactgca gtaaccaaca 516" ttgatcgccc actg gcattcactg atgtggatgt cgattccatc aaaattgctt 522" gggaaagccc acaggggcaa gtttccaggt acagggtgac gagc cctgaggatg 528" atga gctattccct gcacctgatg gtgaagaaga cactgcagag ggcc 534" tcagaccggg ttctgagtac acagtcagtg tggttgcctt gcacgatgat atggagagcc 540" agcccctgat tggaacccag tccacagcta ttcctgcacc aactgacctg aagttcactc 546" aggtcacacc cacaagcctg agcgcccagt ggacaccacc caatgttcag ctcactggat 552" tgcg ggtgaccccc aaga ccggaccaat gaaagaaatc aaccttgctc 558" ctgacagctc atccgtggtt gtatcaggac ttatggtggc caccaaatat gaagtgagtg 564" tctatgctct taaggacact ttgacaagca ctca gggagttgtc accactctgg 570" agaatgtcag cccaccaaga agggctcgtg atgc tactgagacc accatcacca 576" ttagctggag aaccaagact gagacgatca ctggcttcca tgcc gttccagcca 582" atggccagac tccaatccag agaaccatca atgt cagaagctac accatcacag 588" gtttacaacc tgac tacaagatct acctgtacac cttgaatgac aatgctcgga 594" gctcccctgt cgac gcctccactg ccattgatgc accatccaac ctgcgtttcc 600" tggccaccac acccaattcc ttgctggtat catggcagcc gccacgtgcc aggattaccg 199199 606" gctacatcat caagtatgag aagcctgggt ccag agaagtggtc cctcggcccc 612" gccctggtgt cacagaggct actattactg gcctggaacc gggaaccgaa tatacaattt 618" atgtcattgc cctgaagaat aatcagaaga gcgagcccct gattggaagg aaaaagacag 624" acgagcttcc ccaactggta acccttccac accccaatct tcatggacca gagatcttgg 630" atgttccttc cacagttcaa aagacccctt tcgtcaccca ccctgggtat gacactggaa 636" atggtattca gcttcctggc ggtc agcaacccag tgttgggcaa caaatgatct 642" ttgaggaaca tggttttagg cggaccacac cgcccacaac ggccaccccc ataaggcata 648" gacc atacccgccg aatgtaggac aagaagctct ctctcagaca accatctcat 654" gggccccatt ccaggacact tctgagtaca tcatttcatg tcatcctgtt ggcactgatg 660" aagaaccctt acagttcagg gttcctggaa cttctaccag tgccactctg acaggcctca 666" ccagaggtgc cacctacaac atcatagtgg aggcactgaa agaccagcag aggcataagg 672" ttcgggaaga ggttgttacc gtgggcaact ctgtcaacga aggcttgaac caacctacgg 678" cgtg ctttgacccc tacacagttt cccattatgc cgttggagat gagtgggaac 684" gaatgtctga atcaggcttt aaactgttgt gccagtgctt aggctttgga agtggtcatt 690" tcagatgtga taga catg acaatggtgt gaactacaag attggagaga 696" agtgggaccg agaa aatggccaga tgatgagctg cacatgtctt gggaacggaa 702" aaggagaatt caagtgtgac cctcatgagg caacgtgtta tgatgatggg tacc 708" acgtaggaga acagtggcag aaggaatatc tcggtgccat ttgctcctgc acatgctttg 714" agcg gcgc tgtgacaact gccgcagacc tgggggtgaa cccagtcccg 720" aaggcactac gtcc tacaaccagt attctcagag ataccatcag agaacaaaca 726" ctaatgttaa ttgcccaatt gagtgcttca tgcctttaga tgtacaggct gacagagaag 732" attcccgaga gtaaatcatc tttccaatcc agaggaacaa gcatgtctct ctgccaagat 738" ccatctaaac tggagtgatg ttagcagacc agag ttcttctttc tttcttaagc 744" cctttgctct ggaggaagtt ctccagcttc agctcaactc acagcttctc caagcatcac 750" cctgggagtt tcctgagggt tttctcataa atgagggctg cacattgcct gttctgcttc 756" gaagtattca ataccgctca gtattttaaa tgaagtgatt ctaagatttg gatc 762" aaag catatgcagc caaccaagat gcaaatgttt tgaaatgata tgaccaaaat 768" tttaagtagg aaagtcaccc aaacacttct gctttcactt ctgg cccgcaatac 774" tgtaggaaca agcatgatct tgttactgtg atattttaaa tatccacagt tttt 780" tgat cctagtaatt gcctagaaat ctct tacctgttat ttatcaattt 200200 786" ttcccagtat ttttatacgg aaaaaattgt attgaaaaca cttagtatgc agttgataag 792" aggaatttgg tataattatg gtgggtgatt attttttata gtgc caaagcttta 798" tgga aagacaactg ttttaataaa agatttacat tccacaactt gaagttcatc 804" tatttgatat aagacacctt cgggggaaat aattcctgtg aatattcttt ttcaattcag 810" caaacatttg aaaatctatg atgtgcaagt gttg atttcagtac aagattttct 816" aaatcagttg ctacaaaaac gttt ttgtcacttc atctcttcac taatggagat 822" agctttacac tttctgcttt aatagattta agtggacccc aatatttatt gcta 828" gtttaccgtt tata atagaaataa tctttagttg ctcttttcta accattgtaa 834" cctt cttccctcca cctttccttc attgaataaa cctctgttca aagagattgc 8401 ggga aataaaaatg atat taaaaaaaaa aaaaaaaaa Protein seguence: NCBI Reference ce: NP_002017.1 LOCUS NP_002017 ACCESSION NP_002017 l mlrgpgpgll llavqclgta vpstgasksk rqaqqqupq spvavsqskp gcydngkhyq 6" inquertyl gnalvctcyg gsrgfncesk peaeetcfdk rvgd tyerpkdsmi l2" gagr grisctianr cheggqsyki gdtwrrphet ggymlecvcl gngkgewtck 18" piaekcfdha vget wekpngwmm vdctclgegs gritctsrnr cndthrtsy 24" rigdtwskkd nrgnllqcic tgngrgewkc erhtqutts sgsgpftdvr aaqupqphp " hcvt dsgvvysvgm qwlktqgnkq mlctclgngv scqetavtqt yggnsngepc 36" vlpftyngrt fyscttegrq dghlwcstts nyeqquysf ctdhtvlvqt rggnsngaLc 42" hfpflynnhn ytdctsegrr dnmkwcgttq nydaqufgf cpmaaheeic ttnegvmyri 48" hdm ghmmrctcvg ngrgewtcia ysqlrdqciv dditynvndt fhkrheeghm 54" lnctcfgqgr grwkcdpvdq chsetgtfy qigdswekyv hgvrchycy grgigewhcq 60" plqtypsssg pvevfitetp sqpnshpiqw napqpshisk yilrwrpkns vgrwkeatip 66" ghlnsytikg lkpgvvyegq lisiqqyghq evtrfdfttt ststpvtsnt vtgettpfsp 72" lvatsesvte itassfvvsw vsasdtvsgf rveyelseeg depqyldlps tatsvnipdl 78" lpgrkyivnv yqisedgeqs lilstsqtta pdappdptvd qudtsivvr wsrpqapitg 84" yrivyspsve gsstelnlpe tansvtlsdl iti yaveenqest pvviqqettg 90" tprsdtvpsp rdquvevtd vkvtimwtpp esavtgyrvd vipvnlpgeh gqupisrnt 201201 96" faevtglspg vtyyfkvfav kplt aqqttkldap tnquvnetd stvlvrwtpp "02" raqitgyrlt vgltrrgqpr qynvgpsvsk yplrnlqpas eytvslvaik gnqespkatg "08" vftthpgss ippyntevte wtpa prigfklgvr psqggeapre vtsdsgsivv "14" sgltpgveyv ytiqvlrdgq erdapivnkv vtplspptnl hleanpdtgv ltvswerstt "20" pditgyritt tptngqqgns leevvhadqs sctfdnlspg vytv kddkesvpis "26" dtiipavppp tdlrftnigp dtmrvtwapp psidltnflv ryspvkneed vaelsispsd "32" navvltnllp gteyvvsvss vyeqhestpl lds ptgidfsdit ansftvhwia "38" pratitgyri rhhpehfsgr predrvphsr nsitltnltp gteyvvsiva lngreesle "44" igqqstvsdv prdlevvaat ptslliswda pavtvryyri gDSp vqeftvpgsk "50" statisglkp gvdytitvya vtgrgdspas skpisinyrt eidkpsqmqv tdqunsisv "56" kwlpssspvt gyrvtttpkn gpgptktkta gpdqtemtie glqptveyvv svyaqnpsge "62" sqplvqtavt nidrpkglaf tdvdvdsiki qu ryrvtysspe dgiheprap "68" dgeedtaelq glrpgseytv svvalhddme sqpligtqst aipaptdlkf tqvtptslsa "74" qwtppnvqlt gyrvrvtpke ktgpmkeinl apdsssvvvs glmvatkyev svyalkdtLt "80" vvtt lenvspprra rvtdatetti tiswrtktet itgfquavp angqtpiqrt "86" ikpdvrsyti tglqutdyk iylytlndna rsspvvidas taidapsnlr flattpnsLl "92" vswqpprari tgyiikyekp vvpr prpgvteati tglepgteyt iyviaLknnq "98" ksepligrkk tdelpqlvtl phpnlhgpei ldvpstvqkt pfvthpgydt gngiqugts 204" gqqpsvgqqm ifeehgfrrt tppttatpir hrprpyppnv gqealsqtti swapfthse 210" yiischpvgt deepqurvp gtstsatltg ltrgatynii vealkdqqrh kvreevvtvg 216" nsvneglnqp tddscfdpyt gdew ermsesgfkl lcqclgfgsg hfrcdssrwc 222" hdngvnykig ekwdrqgeng qmmsctclgn gkgefkcdph eatcyddgkt yhvgeqque 228" sctc wrcd ncrrpggeps pegttgqsyn hrt ntnvncpiec 234" fmpldvqadr edsre CYB5 OfibblSwnbd:CYBSA Official Name: rome b5 type A (microsomal) Gene ID: 1528 Organism: Homo sapiens 202202 Other Aliases: cvss, MCB5 Other Designations: cytochrome b5; type 1 cyt-b5 Note — there are three difference isoforms Nucleotide seguence: NCBI Reference Seguence: 923.3 LOCUS 923 ACCESSION 923 l gcgccccgcc cctgagccgg agcc cccagtgggg ttcccggcgc ggggaatgtc 6" ccgggtggag ctggctgagt cgcgcgctct cccg acggggctgt gtgtgctggg 12" cctggctcgc ggcgaaccga gatggcagag cagtcggacg aggccgtgaa gtactacacc 18" gaga ttcagaagca caaccacagc aagagcacct ggctgatcct gcaccacaag 24" gtgtacgatt tgaccaaatt tctggaagag catcctggtg gggaagaagt tttaagggaa " caagctggag gtgacgctac tgagaacttt gaggatgtcg ggcactctac agatgccagg 36" gaaatgtcca aaacattcat cattggggag ctccatccag atgacagacc aaac 42" aagcctccgg aaactcttat cactactatt gattctagtt ccagttggtg gaccaactgg 48" gtgatccctg ccatctctgc agtggccgtc gccttgatgt atcgcctata catggcagag 54" gactgaacac ctcctcagaa gtcagcgcag gaagagcctg ctttggacac gggagaaaag 60" aagccattgc taactacttc aactgacaga aaccttcact tgaaaacaat gattttaata 66" tatctctttc tttttcttcc gacattagaa acaaaacaaa aagaactgtc ctttctgcgc 72" tcaaattttt cgagtgtgcc tttttattca tctactttat tttgatgttt ccttaatgtg 78" taatttactt attataagca ttta tatt tggcttttaa agtatgcaaa 84" aaaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_683725.1 LOCUS NP_683725 ACCESSION NP_683725 l maeqsdeavk yytleeiqkh nhskstwlil hhkvydltkf leehpggeev lreqaggdat 203203 61 enfedvghst ktfi igelhpddrp klnkppetli ttidssssww tnwvipaisa 121 vavalmyrly maed Nucleotide seguence: NCBI Reference Seguence: 914.3 LOCUS NM_001914 ACCESSION NM_001914 1 gcgccccgcc cctgagccgg ccgcccagcc cccagtgggg ttcccggcgc ggggaatgtc 6" ccgggtggag ctggctgagt cgcgcgctct gctccacccg acggggctgt gtgtgctggg 12" tcgc ggcgaaccga gatggcagag cagtcggacg aggccgtgaa gtactacacc 18" ctagaggaga ttcagaagca caaccacagc acct ggctgatcct caag 24" gtgtacgatt tgaccaaatt tctggaagag ggtg gggaagaagt tttaagggaa " caagctggag gtgacgctac tgagaacttt gaggatgtcg ggcactctac agatgccagg 36" gaaatgtcca aaacattcat cattggggag ctccatccag gacc aaagttaaac 42" aagcctccgg aaccttaaag gcggtgtttc aaggaaactc ttatcactac tattgattct 48" agttccagtt ggtggaccaa gatc cctgccatct ctgcagtggc cgtcgccttg 54" atgtatcgcc tatacatggc agaggactga acacctcctc agaagtcagc agag 60" cctgctttgg acacgggaga aaagaagcca ttgctaacta cttcaactga cagaaacctt 66" cacttgaaaa caatgatttt aatatatctc tttctttttc ttccgacatt agaaacaaaa 72" caaaaagaac tgtcctttct gcgctcaaat ttttcgagtg tgccttttta tact 78" ttattttgat gtttccttaa tgtgtaattt acttattata atct aata 84" tatttggctt ttaaagtatg caaaaaaaaa aaaa Protein seguence: NCBI Reference Seguence: NP_001905.1 LOCUS NP_001905 ACCESSION NP_001905 1 maeqsdeavk yytleeiqkh nhskstwlil hhkvydltkf leehpggeev lreqaggdat 61 enfedvghst daremsktfi igelhpddrp klnkppep 204204 Isoforn13 tide seguence: NCBI Reference Seguence: NM_001190807.2 LOCUS NM_001190807 ACCESSION NM_001190807 1 gcgccccgcc cctgagccgg ccgcccagcc cccagtgggg ttcccggcgc tgtc 6" ccgggtggag ctggctgagt cgcgcgctct gctccacccg acggggctgt gtgtgctggg 12" cctggctcgc ggcgaaccga gatggcagag cagtcggacg aggccgtgaa gtactacacc 18" gaga ttcagaagca caaccacagc aagagcacct ggctgatcct gcaccacaag 24" gtgtacgatt tgaccaaatt tctggaagag ggtg aagt tttaagggaa " caagctggag gtgacgctac tgagaacttt gaggatgtcg ggcactctac agatgccagg 36" gaaatgtcca aaacattcat cattggggag ctccatccag aaactcttat cactactatt 42" agtt ccagttggtg gaccaactgg gtgatccctg ccatctctgc agtggccgtc 48" gccttgatgt atcgcctata catggcagag gactgaacac ctcctcagaa gtcagcgcag 54" gaagagcctg ctttggacac aaag aagccattgc taactacttc aactgacaga 60" aaccttcact tgaaaacaat gattttaata tatctctttc ttcc gacattagaa 66" acaaaacaaa aagaactgtc ctttctgcgc tcaaattttt cgagtgtgcc tttttattca 72" tctactttat gttt ccttaatgtg taatttactt attataagca tgatctttta 78" aaaatatatt tggcttttaa agtatgcaaa aaaa Protein seguence: NCBI Reference Seguence: NP_001177736.1 LOCUS NP_001177736 ACCESSION NP_001177736 1 maeqsdeavk yytleeiqkh nhskstwlil hhkvydltkf leehpggeev lreqaggdat 61 enfedvghst daremsktfi igelhpetli ttidssssww tnwvipaisa vavalmyrly 121 maed 205205 PA|1 Official Symbol: SERPINE1 Official Name: serpin peptidase inhibitor, clade E (nexin, plasminogen activator inhibitor type 1), member 1 Gene ID: 5054 Organism: Homo sapiens Other Aliases: PAI, PAI-1, PAI1, PLANH1 Other ations: elial plasminogen activator inhibitor; plasminogen activator inhibitor 1; serine (or cysteine) proteinase tor, clade E (nexin, plasminogen activator inhibitor type 1), member 1; serpin E1 Nucleotide ce rm 1): NCBI Reference Seguence: NM_000602.4 LOCUS NM_000602 ACCESSION 602 l ggcccacaga ggagcacagc tgtgtttggc tgcagggcca agagcgctgt caagaagacc 6" cacacgcccc cctccagcag ctgaattcct gcagctcagc agccgccgcc agagcaggac 12" gaaccgccaa tcgcaaggca agaa cttcaggatg cagatgtctc cagccctcac 18" ctgcctagtc ctgggcctgg cccttgtctt tggtgaaggg tctgctgtgc accatccccc 24" atcctacgtg gcccacctgg cctcagactt ngggtgagg gtgtttcagc cgca " ggcctccaag gaccgcaacg tggttttctc accctatggg gtggcctcgg tgttggccat 36" gctccagctg acaacaggag gagaaaccca gcagcagatt caagcagcta tgggattcaa 42" gattgatgac aagggcatgg cccccgccct ccggcatctg tacaaggagc ggcc 48" atggaacaag gatgagatca gcaccacaga cgcgatcttc gtccagcggg atctgaagct 54" ggtccagggc ttcatgcccc tcag gctgttccgg agcacggtca agcaagtgga 60" cttttcagag gtggagagag ccagattcat catcaatgac tgggtgaaga cacacacaaa 66" gatc agcaacttgc ttgggaaagg agccgtggac cagctgacac ggctggtgct 72" ggtgaatgcc ctctacttca acggccagtg gaagactccc ttccccgact ccagcaccca 78" ccgccgcctc ttccacaaat cagacggcag cactgtctct gtgcccatga tggctcagac 84" caacaagttc aactatactg agttcaccac gcccgatggc cattactacg acatcctgga 90" actgccctac cacggggaca ccctcagcat gttcattgct tatg aaaaagaggt 206206 96" gcctctctct acca acattctgag tgcccagctc atcagccact ggaaaggcaa "02" catgaccagg ctgccccgcc tcctggttct gcccaagttc tccctggaga ctgaagtcga "08" cctcaggaag cccctagaga acctgggaat gaccgacatg cagt ttcaggctga "14" cttcacgagt gacc aagagcctct ccacgtcgcg ctgc agaaagtgaa "20" gatcgaggtg aacgagagtg tggc ctcctcatcc gtca tagtctcagc "26" ccgcatggcc cccgaggaga tcatcatgga cagacccttc ctctttgtgg tccggcacaa "32" ccccacagga acagtccttt tcatgggcca agtgatggaa ccctgaccct ggggaaagac "38" gccttcatct gggacaaaac tgca tcgggaaaga agaaactccg aagaaaagaa "44" tgtt aatgactctt tctgaaggaa gagaagacat ttgccttttg ttaaaagatg "50" gtaaaccaga tcca agaccttggc ctctccttgg aggaccttta actc "56" ctcc acctgagacc ctgggagaga agtttgaagc acaactccct taaggtctcc "62" aaaccagacg gtgacgcctg cgggaccatc tggggcacct gcttccaccc gtctctctgc "68" ccactcgggt ctgcagacct ggttcccact gaggcccttt gcaggatgga actacggggc "74" ttacaggagc ttttgtgtgc ctggtagaaa ctatttctgt tccagtcaca ttgccatcac "80" tcttgtactg cctgccaccg cggaggaggc tggtgacagg ccaaaggcca gtggaagaaa "86" caccctttca tctcagagtc cactgtggca ctggccaccc ctccccagta caggggtgct "92" gcaggtggca gagtgaatgt cccccatcat gtggcccaac tctcctggcc tggccatctc "98" cctccccaga aacagtgtgc atgggttatt ttggagtgta ggtgacttgt ttactcattg 204" aagcagattt ctgcttcctt ttatttttat aggaatagag gaagaaatgt cgtg 210" cccagctctt caccccccaa tctcttggtg gggaggggtg tacctaaata tttatcatat 216" cctt gagtgcttgt tagagagaaa gagaactact aaggaaaata atattattta 222" aactcgctcc tagtgtttct ttgtggtctg tgtcaccgta tctcaggaag tccagccact 228" tgactggcac acacccctcc ggacatccag cgtgacggag cccacactgc gtgg 234" gaga ccctcgcgcc ccccgcgccc ctctttttcc ccttgatgga aattgaccat 240" acaatttcat cctccttcag gggatcaaaa ggacggagtg caga gactcagatg 246" aggacagagt caat gtgttcaata gatttaggag cagaaatgca aggggctgca 252" tgacctacca ggacagaact ttccccaatt acagggtgac tcacagccgc attggtgact 258" cacttcaatg tgtcatttcc ggctgctgtg tgtgagcagt gtga ggggggggtg 264" ggtgagagag acaggcagct cggattcaac taccttagat aatatttctg aaaacctacc 270" agccagaggg tagggcacaa agatggatgt aatgcacttt gggaggccaa ggcgggagga 207207 276" ttgcttgagc ccaggagttc agcc tgggcaacat accaagaccc ccgtctcttt 282" aaaaatatat atattttaaa tatacttaaa tatatatttc taatatcttt aaatatatat 288" atatatttta aagaccaatt tatgggagaa ttgcacacag atgtgaaatg aatgtaatct 294" aatagaagcc taatcagccc accatgttct ccactgaaaa atcctctttc tttggggttt 300" ttctttcttt cttttttgat tttgcactgg acgt cagccatgta caggatccac 306" ggtg tcaaatgcta ttgaaattgt gttgaattgt atgctttttc acttttgata 312" aataaacatg taaaaatgtt tcaaaaaaat ataa ataaatacga agaatatgtc 318" aggacagtca aaaaaaaaaa aaaaaaa Protein seguence rm 1): NCBI Reference Seguence: NP_000593.1 LOCUS NP_000593 ACCESSION NP_000593 1 mqmspaltcl vlglalvfge gsavhhppsy vahlasdfgv rquqvaqas kdrnvvfspy 61 gvasvlamlq 1ttggetqqq iqaamgfkid dkgmapalrh 1yke1mgpwn kdeisttdai 121 lvq gfmphffrlf rstvkqufs everarfiin kgm isnllgkgav 181 dqltrlvlvn alyfnngkt pfpdssthrr 1fhksdgstv svpmmaqtnk fnytefttpd 241 ghyydilelp yhgdtlsmfi aapyekevpl saltnilsaq lishwkgnmt rlprllvlpk 301 fsletevdlr kplenlgmtd mfrqfqadft slsdqeplhv aqalqukie vnesgtvass 361 stavivsarm apeeiimdrp flfvvrhnpt gtvlfmgqvm ep Nucleotide seguence (isoform 2): NCBI Reference Seguence: NM_001165413.2 LOCUS NM_001165413 ACCESSION NM_001165413 1 ggcccacaga cagc tgtgtttggc tgcagggcca agagcgctgt caagaagacc 61 cacacgcccc gcag ctgaattcct gcagctcagc agccgccgcc agagcaggac 121 gaaccgccaa ggca agaa cttcaggatg cagatgtctc cagccctcac 181 ctgcctagtc ctgggcctgg cccttgtctt tggtgaaggg tctgctgtgc accatccccc 241 atcctacgtg gcct ccaaggaccg caacgtggtt ttctcaccct atggggtggc 208208 " ctcggtgttg gccatgctcc agctgacaac agaa acccagcagc agattcaagc 36" agctatggga ttcaagattg atgacaaggg catggccccc gccctccggc atctgtacaa 42" ggagctcatg gggccatgga acaaggatga gatcagcacc acagacgcga tcttcgtcca 48" gcgggatctg aagctggtcc agggcttcat gccccacttc ttcaggctgt tccggagcac 54" gcaa gtggactttt cagaggtgga gagagccaga ttcatcatca atgactgggt 60" gaagacacac ggta tgatcagcaa cttgcttggg aaaggagccg tggaccagct 66" gacacggctg gtgctggtga atgccctcta cttcaacggc cagtggaaga ctcccttccc 72" cagc acccaccgcc gcctcttcca caaatcagac ggcagcactg tctctgtgcc 78" catgatggct cagaccaaca agttcaacta tactgagttc accacgcccg atggccatta 84" ctacgacatc ctggaactgc cctaccacgg ggacaccctc agcatgttca ttgctgcccc 90" aaaa cctc tctctgccct caccaacatt ctgagtgccc agctcatcag 96" ccactggaaa ggcaacatga ccaggctgcc ccgcctcctg gttctgccca agttctccct "02" ggagactgaa gtcgacctca ggaagcccct agagaacctg accg acatgttcag "08" acagtttcag ttca cgagtctttc agaccaagag cctctccacg tcgcgcaggc "14" gctgcagaaa gtgaagatcg aggtgaacga gagtggcacg gtggcctcct catccacagc "20" agtc tcagcccgca tggcccccga ggagatcatc atggacagac ccttcctctt "26" tgtggtccgg cacaacccca caggaacagt ccttttcatg ggccaagtga tggaaccctg "32" accctgggga aagacgcctt ggac aaaactggag atgcatcggg aaagaagaaa "38" ctccgaagaa aagaatttta atga ctga aggaagagaa gacatttgcc "44" taaa agatggtaaa ccagatctgt ctccaagacc ttggcctctc cttggaggac "50" ctttaggtca aactccctag tctccacctg agaccctggg agagaagttt gaagcacaac "56" tcccttaagg tctccaaacc agacggtgac gcctgcggga ccatctgggg cacctgcttc "62" cacccgtctc tctgcccact cgggtctgca gacctggttc ccactgaggc cagg "68" atggaactac ggggcttaca ggagcttttg tgtgcctggt agaaactatt tctgttccag "74" tcacattgcc atcactcttg ctgc caccgcggag gaggctggtg acaggccaaa "80" ggccagtgga agaaacaccc tttcatctca gagtccactg tggcactggc cacccctccc "86" cagtacaggg gtgctgcagg tggcagagtg aatgtccccc tggc ccaactctcc "92" tggcctggcc atctccctcc ccagaaacag tgtgcatggg ttattttgga gtgtaggtga "98" cttgtttact cattgaagca gatttctgct tccttttatt tttataggaa tagaggaaga 204" aatgtcagat gcgtgcccag ctcttcaccc ctct tggtggggag gggtgtacct 209209 210" aaatatttat catatccttg cccttgagtg cttgttagag agaaagagaa ctactaagga 216" aaataatatt atttaaactc gctcctagtg tttctttgtg gtctgtgtca ctca 222" ggaagtccag ccacttgact ggcacacacc cctccggaca tccagcgtga cggagcccac 228" actgccacct tgtggccgcc tgagaccctc gcgccccccg cgcccctctt tttccccttg 234" atggaaattg accatacaat ttcatcctcc ggat caaaaggacg gagtgggggg 240" acagagactc agatgaggac gttt ccaatgtgtt caatagattt agaa 246" atgcaagggg ctgcatgacc taccaggaca gaactttccc caattacagg gtgactcaca 252" gccgcattgg tgactcactt caatgtgtca gctg ctgtgtgtga gcagtggaca 258" gggg gggtgggtga gagagacagg cagctcggat tcaactacct tagataatat 264" ttctgaaaac ctaccagcca gagggtaggg cacaaagatg gatgtaatgc ggag 270" gccaaggcgg gaggattgct tgagcccagg agttcaagac cagcctgggc aacataccaa 276" gacccccgtc tctttaaaaa tatatatatt ttaaatatac ttaaatatat atttctaata 282" tctttaaata tatatatata ttttaaagac atgg gagaattgca cacagatgtg 288" aaatgaatgt aatctaatag aagcctaatc agcccaccat gttctccact gaaaaatcct 294" ctttctttgg ggtttttctt tttt ttgattttgc actggacggt agcc 300" atgtacagga tccacagggg tggtgtcaaa tgctattgaa attgtgttga attgtatgct 306" ttttcacttt tgataaataa acatgtaaaa atgtttcaaa aaaataataa aataaataaa 3121 tacgaagaat atgtcaggac aaaa aaaaaaaaaa aa Protein sequence (isoform 2): NCBI Reference Seguence: NP_001158885.1 LOCUS NP_001158885 ACCESSION NP_001158885 1 mqmspaltcl vlglalvfge gsavhhppsy vaqaskdrnv vfspygvasv lamlqlttgg 61 etqqqiqaam kgma palrhlykel mgpwnkdeis ttdaiqurd lklvqgfmph 121 ffrlfrstvk era rfiindkat htkgmisnll gkgavdqltr lvlvnalyfn 181 ngktpfpds sthrrlfhks dgstvsvpmm aqtnkfnyte fttpdghyyd ilelpyhgdt 241 lsmfiaapye kevplsaltn ishw kgnmtrlprl lvlpkfslet evdlrkplen 301 lgmtdmfrqf qadftslsdq eplhvaqalq kvkievnesg tavi vsarmapeei 361 imdrpflfvv rhnptgtvlf mgqvmep 210210 MPR1 Official : IGF2R Official Name: insulin-like growth factor 2 receptor m: 3482 Organism: Homo sapiens Other Aliases: CD222, CIMPR, M6P-R, MPR1, MPRI Other Designations: 300 kDa mannose phate receptor; Cl ManP receptor; Cl-MPR; lGF-ll receptor; M6P/IGF2 receptor; M6P/IGF2R; M6PR; MPR 300; cation-independent mannose-6 phosphate receptor; cation- ndent mannosephosphate receptor; n-like growth factor II receptor Nucleotide ce: NCBI Reference Seguence: NM_000876.2 LOCUS NM_000876 ACCESSION NM_000876 l cgagcccagt cgagccgcgc tcacctcggg ctcccgctcc gtctccacct ccgcctttgc 6" ggcg cgaccccgtc ccgggcgcgg cccccagcag tcgcgcgccg ttagcctcgc 12" gcccgccgcg cagtccgggc ccggcgcgat gggggccgcc gccggccgga gcccccacct 18" ggggcccgcg cccgcccgcc gcccgcagcg ctctctgctc ctgctgcagc tgctgctgct 24" cgtcgctgcc ccggggtcca cgcaggccca ggccgccccg ttccccgagc tgtgcagtta " tacatgggaa gctgttgata ccaaaaataa tgtactttat aaaatcaaca tctgtggaag 36" tgtggatatt gtccagtgcg ggccatcaag tgctgtttgt atgcacgact tgaagacacg 42" cacttatcat tcagtgggtg actctgtttt gagaagtgca accagatctc tcctggaatt 48" caacacaaca gtgagctgtg accagcaagg cacaaatcac agagtccaga gcagcattgc 54" gtgt gggaaaaccc tgggaactcc tgta actgcaacag tgca 60" ctactttgag tggaggacca ctgcagcctg caagaaagac atatttaaag caaataagga 66" ggtgccatgc tatgtgtttg atgaagagtt gaggaagcat gatctcaatc ctctgatcaa 72" tggt gcctacttgg tggatgactc cgatccggac acttctctat tcatcaatgt 78" ttgtagagac acac tacgagaccc aggttcacag ctgcgggcct gtccccccgg 84" cactgccgcc tgcctggtaa gaggacacca ggcgtttgat gttggccagc cccgggacgg 90" actgaagctg gtgcgcaagg acaggcttgt cctgagttac gtgagggaag aggcaggaaa 211211 96" gctagacttt ggtc acagccctgc tatt acatttgttt gcccgtcgga "02" gcggagagag ggcaccattc ccaaactcac atcc aactgccgct ttga "08" gtggattact gagtatgcct gccacagaga ttacctggaa agtaaaactt gttctctgag "14" gcag caggatgtct ccatagacct actt gcccagagcg gaggttcatc "20" ctatatttca gatggaaaag aatatttgtt gaat gtctgtggag aaactgaaat "26" acagttctgt aataaaaaac aagctgcagt ttgccaagtg aaaaagagcg atacctctca "32" agtcaaagca gcaggaagat accacaatca gaccctccga tattcggatg tcac "38" cttgatatat tttggaggtg atgaatgcag ctcagggttt cagcggatga gcgtcataaa "44" ctttgagtgc aataaaaccg caggtaacga tgggaaagga actcctgtat ggga "50" ggttgactgc acctacttct tcacatggga atac gcctgtgtta aggagaagga "56" agacctcctc tgcggtgcca ccgacgggaa gaagcgctat gacctgtccg cgctggtccg "62" ccatgcagaa ccagagcaga aagc tgtggatggc agtcagacgg aaacagagaa "68" gaagcatttt ttcattaata tttgtcacag agtgctgcag gaaggcaagg cacgagggtg "74" tcccgaggac gcggcagtgt gtgcagtgga taaaaatgga agtaaaaatc tgggaaaatt "80" tatttcctct cccatgaaag agaaaggaaa cattcaactc tcttattcag atggtgatga "86" tcat ggcaagaaaa ttaaaactaa tatcacactt gtatgcaagc caggtgatct "92" ggaaagtgca ccagtgttga gaacttctgg ggaaggcggt tgcttttatg agtttgagtg "98" gcacacagct gcggcctgtg tgctgtctaa gacagaaggg gagaactgca cggtctttga 204" ctcccaggca gggttttctt ttgacttatc acctctcaca aagaaaaatg gtgcctataa 210" agttgagaca aagaagtatg acttttatat aaatgtgtgt ggcccggtgt ctgtgagccc 216" ctgtcagcca ggag cctgccaggt ggcaaaaagt gatgagaaga cttggaactt 222" gggtctgagt aatgcgaagc tttcatatta tgatgggatg atccaactga actacagagg 228" cggcacaccc tataacaatg aaagacacac agct acgctcatca tctg 234" tgatcgagac gcgggagtgg gcttccctga atatcaggaa gaggataact acaa 240" cttccggtgg tacaccagct atgcctgccc ggaggagccc ctggaatgcg tagtgaccga 246" cccctccacg ctggagcagt acgacctctc cagtctggca aaatctgaag gtggccttgg 252" aggaaactgg atgg acaactcagg ggaacatgtc acgtggagga aatactacat 258" taacgtgtgt cggcctctga atccagtgcc caac cgatatgcat cggcttgcca 264" gatgaagtat gaaaaagatc agggctcctt cactgaagtg gtttccatca gtaacttggg 270" aatggcaaag accggcccgg tggttgagga cagcggcagc ctccttctgg aatacgtgaa 212212 276" tgggtcggcc tgcaccacca gcag acagaccaca tataccacga ggatccatct 282" cgtctgctcc aggggcaggc tgaacagcca cttt tctctcaact gggagtgtgt 288" ggtcagtttc ctgtggaaca cagaggctgc ctgtcccatt cagacaacga cggatacaga 294" ccaggcttgc tctataaggg atcccaacag tggatttgtg tttaatctta atccgctaaa 300" cagttcgcaa ggatataacg tctctggcat tgggaagatt tttatgttta atgtctgcgg 306" cacaatgcct gtctgtggga ccatcctggg aaaacctgct tctggctgtg aggcagaaac 312" ccaaactgaa gagctcaaga attggaagcc gcca gtcggaattg agaaaagcct 318" ccagctgtcc acagagggct tcatcactct gacctacaaa gggcctctct ctgccaaagg 324" taccgctgat gcttttatcg tccgctttgt ttgcaatgat gatgtttact cagggcccct 330" caaattcctg catcaagata tcgactctgg gcaagggatc cgaaacactt actttgagtt 336" tgaaaccgcg ttggcctgtg ttccttctcc agtggactgc caagtcaccg acctggctgg 342" gtac gacctgactg gcac agtcaggaaa acgg ctgttgacac 348" cgat gggagaaaga ggactttcta tttgagcgtt tgcaatcctc tcccttacat 354" tcctggatgc cagggcagcg cagtggggtc ttgcttagtg ggca atagctggaa 360" tctgggtgtg atga gtccccaagc cgcggcgaat ggatctttga gcatcatgta 366" tgtcaacggt gacaagtgtg ggaaccagcg cttctccacc acgt ttgagtgtgc 372" tcagatatcg ccag catttcagct tcaggatggt tgtgagtacg tgtttatctg 378" gagaactgtg gaagcctgtc tcag agtggaaggg gacaactgtg aaga 384" cccaaggcat ggcaacttgt atgacctgaa gcccctgggc ctcaacgaca ccatcgtgag 390" cgctggcgaa tacacttatt acttccgggt gaag ctttcctcag acgtctgccc 396" cacaagtgac aagtccaagg tggtctcctc atgtcaggaa aagcgggaac cgcagggatt 102" tcacaaagtg gcaggtctcc tgactcagaa gctaacttat gaaaatggct tgttaaaaat 108" gaacttcacg gaca cttgccataa ggtttatcag cgctccacag ccatcttctt 114" tgac cgcggcaccc agcggccagt atttctaaag tcag attgttccta 120" cttgtttgag tggcgaacgc agtatgcctg cccacctttc gatctgactg aatgttcatt 126" tggg gctggcaact ccttcgacct ctcgtccctg tcaaggtaca gtgacaactg Z32" ggaagccatc actgggacgg gggacccgga gcactacctc atcaatgtct gcaagtctct Z38" ggccccgcag gctggcactg agccgtgccc tccagaagca gccgcgtgtc tgctgggtgg Z44" ctccaagccc gtgaacctcg gcagggtaag ggacggacct cagtggagag atggcataat 150" tgtcctgaaa tacgttgatg gcgacttatg tccagatggg attcggaaaa agtcaaccac 213213 156" catccgattc acctgcagcg agagccaagt gaactccagg cccatgttca tcagcgccgt Z62" ggaggactgt gagtacacct ttgcctggcc cacagccaca gcctgtccca tgaagagcaa Z68" cgagcatgat gactgccagg tcaccaaccc aagcacagga cacctgtttg gctc Z74" cttaagtggc ggat tcacagctgc ttacagcgag aaggggttgg tttacatgag 180" catctgtggg gagaatgaaa actgccctcc tggcgtgggg gcctgctttg gacagaccag 186" gattagcgtg ggcaaggcca acaagaggct gagatacgtg gaccaggtcc tgcagctggt Z92" gtacaaggat gggtcccctt gtccctccaa atccggcctg agctataaga gtgtgatcag Z98" tttcgtgtgc aggcctgagg ccgggccaac caataggccc atgctcatct ccctggacaa 504" gcagacatgc actctcttct tctcctggca cacgccgctg gcctgcgagc aagcgaccga 510" atgttccgtg ggaa gctctattgt tgacttgtct attc atcgcactgg 516" tggttatgag gcttatgatg agga tgatgcctcc gataccaacc ctgatttcta 522" catcaatatt tgtcagccac taaatcccat gcacggagtg ccctgtcctg ccggagccgc 528" tgtgtgcaaa attg atggtccccc catagatatc gtag caggaccacc 534" caat gcaa atgagattta cttgaatttt agta ctccttgctt 540" agcggacaag aact acacctcgct catcgcgttt cactgtaaga gaggtgtgag 546" aacg cctaagctgt taaggaccag cgagtgcgac tttgtgttcg aatgggagac 552" tcctgtcgtc tgtcctgatg aagtgaggat ggatggctgt accctgacag atgagcagct 558" cctctacagc ttcaacttgt ccagcctttc cacgagcacc tttaaggtga ctcgcgactc 564" gcgcacctac agcgttgggg tgtgcacctt tgcagtcggg ccagaacaag gaggctgtaa 570" ggacggagga gtctgtctgc tctcaggcac caagggggca tcctttggac ggctgcaatc 576" aatgaaactg aggc accaggatga agcggtcgtt ttaagttacg tgaatggtga 582" tcgttgccct accg atgacggcgt cccctgtgtc ttccccttca tattcaatgg 588" gaagagctac gaggagtgca tcatagagag cagggcgaag ctgtggtgta gcacaactgc 594" ggactacgac agagaccacg agtggggctt ctgcagacac tcaaacagct accggacatc 600" cata tttaagtgtg atgaagatga ggacattggg aggccacaag tcttcagtga 606" agtgcgtggg tgtgatgtga catttgagtg gaaaacaaaa gttgtctgcc ctccaaagaa 612" gttggagtgc aaattcgtcc acaa aacctacgac ctgc tctcctctct 618" caccgggtcc tggtccctgg tccacaacgg agtctcgtac tatataaatc tgtgccagaa 624" aatatataaa gggcccctgg gctgctctga aagggccagc atttgcagaa ggaccacaac 630" tggtgacgtc caggtcctgg gactcgttca cacgcagaag ctgggtgtca taggtgacaa 214214 636" agttgttgtc acgtactcca aaggttatcc gtgtggtgga aataagaccg catcctccgt 642" gatagaattg acctgtacaa agacggtggg cagacctgca ttcaagaggt ttgatatcga 648" cagctgcact tactacttca gctgggactc ccgggctgcc tgcgccgtga agcctcagga 654" ggtgcagatg gtgaatggga ccatcaccaa ccctataaat ggcaagagct tcagcctcgg 660" agatatttat tttaagctgt tcagagcctc tggggacatg aggaccaatg gggacaacta 666" cctgtatgag atccaacttt cctccatcac aagctccaga aacccggcgt gctctggagc 672" caacatatgc caggtgaagc ccaacgatca gcacttcagt cggaaagttg gaacctctga 678" caagaccaag tactaccttc aagacggcga tctcgatgtc gtgtttgcct cttcctctaa 684" gtgcggaaag gataagacca agtctgtttc ttccaccatc ttcttccact gtgaccctct 690" ggtggaggac gggatccccg agttcagtca cgagactgcc gactgccagt acctcttctc 696" ttggtacacc tcagccgtgt gtcctctggg ggtgggcttt gacagcgaga atcccgggga 702" cgacgggcag atgcacaagg ggctgtcaga acggagccag gcagtcggcg cggtgctcag 708" cctgctgctg gtggcgctca cctgctgcct gctggccctg ttgctctaca agaaggagag 714" gagggaaaca gtgataagta agctgaccac ttgctgtagg agaagttcca acgtgtccta 720" ctca aaggtgaata aggaagaaga gacagatgag acag agtggctgat 726" ggaagagatc cagctgcctc ggca gggaaaggaa gggcaggaga acggccatat 732" taccaccaag tcagtgaaag gctc cctgcatggg gatgaccagg agga 738" tgaggttctg accatcccag aggtgaaagt tcactcgggc aggggagctg gggcagagag 744" ctcccaccca gtgagaaacg cacagagcaa tgcccttcag gagcgtgagg acgatagggt 750" ggggctggtc aggggtgaga aggcgaggaa agggaagtcc agctctgcac agcagaagac 756" agtgagctcc accaagctgg tgtccttcca tgacgacagc gacgaggacc tcttacacat 762" ccgc agtgcctgca ggggagcacg cggg acagccaagc acctccaacc 768" aaataagact tccactcgat gatgcttcta tgcc agaa actttcaaaa 774" gggaagagtt tttgtgatgg gggagagggt gaaggaggtc cact tgat 780" tgtttacagt cattggaata aggcatggct cagatcggcc acagggcggt accttgtgcc 786" cagggttttg ccccaagtcc tcatttaaaa gcataaggcc ggacgcatct agag 792" ggctgcattc gaagaaaccc ttgctgcttt agtcccgata ttga ccccgatata 798" ttttagcatt ttaattctct ccccctattt attgactttg acaattactc aggtttgaga 804" aaaa aaaaacagcc accgtttctt agca ggggtgtgat gtaccagttt 810" gtccatcttg agatggtgag agtg tatggggcag gcgg gatgttgaac 215215 816" tggtcattaa tgtgtcccct gagttggagc tcattctgtc tcttttctct tttgctttct 822" gtttcttaag ggcacacaca Cgtgcgtgcg agcacacaca cacatacgtg cacagggtcc 828" gcct aggttttgga gcct gttctatgcc tttagtcagg aatggctgca 834" cctttttgca tgatatcttc aagcctgggc gtacagagca tcag tatttttgcc 840" ggctggtgaa ttcaaacaac ctgcccaaag attgatttgt gtgtttgtgt gtgtgtgtgt 846" gtgtgtgtgt gtgtgtgtga gtggagttga ggtgtcagag aatt ttttccagat 852" ttggggtata ggtctcatct cttcaggttc tcatgatacc acctttactg tgcttatttt 858" tttaagaaaa aagtgttgat caaccattcg aaga agccttaatt tgcacagtgt 864" gtgacttaca gaaactgcat gaaaaatcat gggccagagc ccta gcattgcact 870" tggcctcatg ctggagggag gctgggcggg tacagcgcgg aggaggaggg ggcg 876" ggcatggcgt ggaggaggag ggaggccggg cggtcacagc atggaggagg agggaggcgc 882" tgctggtgtt cttattctgg cgcc tttcctgcca tgtttagtga atgacttttc 888" tcgcattgta gaattgtata tagactctgg tgttctattg ctgagaagca aaccgccctg 894" cagcatccct cagcctgtac cggtttggct ggcttgtttg atttcaacat attt 900" tttaaaattg atttttctct tcattttttt ttcaatcaac tttactgtaa tataaagtat 9061 tcaacaattt caataaaaga taaattatta aaa n seguence: NCBI Reference Seguence: NP_000867.2 LOCUS 867 ACCESSION NP_000867 l mgaaagrsph lgpaparrpq rsllllqlll lvaapgstqa qaapfpelcs ytweavdtkn 6" nvlykinicg svdivchps savcmhdlkt rtyhsvgdsv lrsatrslle fnttvscdqq 12" gtnhrvqssi aflcgktlgt pefvtatecv hyfewrttaa ckkdifkank evpcyvfdee 18" lrkhdlnpli klsgaylvdd sdpdtslfin vcrdidtlrd anp gtaaclvrgh 24" qafdvgqprd glklvrkdrl vlsyvreeag kldfcdghsp avtitfvcps erregtipkl " taksncryei achr dylesktcsl sgeqqdvsid ltplaqsggs syisdgkeyl 36" fylnvcgete iqfcnkkqaa chvkksdts ryhn qtlrysdgdl tliyfggdec 42" ssgfqrmsvi nfecnktagn dgkgtpvftg evdctyfftw vkek edllcgatdg 48" kkrydlsalv rhaepeqnwe avdgsqtete kkhffinich rvlqegkarg cpedaavcav 54" dkngsknlgk fisspmkekg niqlsysdgd dcghgkkikt nitlvckpgd lesapvlrts 216216 60" geggcfyefe whtaaacvls ktegenctvf dsqagfsfdl spltkkngay kvetkkydfy 66" invcgpvsvs acq vaksdektwn lglsnaklsy ydgmiqlnyr ggtpynnerh 72" tpratlitfl cdrdagvgfp eyqeednsty nfrwytsyac cvvt dpstleqydl 78" eggl ggnwyamdns rkyy invcrplnpv pgcnryasac quyekdqgs 84" ftevvsisnl pvve dsgsllleyv ngsacttsdg rqttyttrih rlns 90" hpifslnwec vvsflwntea acpiqtttdt dqacsirdpn sgfvfnlnpl nssqunvsg 96" igkifmfnvc gtmpvcgtil gkpasgceae knwk parpvgieks lqlstegfit "02" ltykgplsak gtadafivrf vcnddvysgp lkflhqdids gqgirntyfe fetalacvps "08" pvdcqvtdla gneydltgls tvrkpwtavd tsvdgrkrtf ylsvcnplpy ipgcqgsavg "14" sclvsegnsw nlgvvqmqu aaangslsim yvngdkcgnq rfstritfec aqisgspazq "20" lngceyvfi wrtveacpvv rvegdncevk dprhgnlydl kplglndtiv sageytyyzr "26" vcgklssdvc ptsdkskvvs scqekrepqg fhkvaglltq kltyengllk dtch "32" kqurstaif fycdrgtqrp sdcs ylfewrtqya cppfdltecs fkdgagnsfd "38" lsslsrysdn weaitgtgdp ehylinvcks tepc ppeaaacllg gskpvnlgrv "44" rdgpqwrdgi ivlkyvdgdl cpdgirkkst tirftcsesq vnsrpmfisa vedceytfaw "50" pmks nehddcqvtn pstghlfdls gfta aysekglvym sicgenencp "56" pgvgacfgqt risvgkankr lryvdqvlql vykdgspcps ksglsyksvi sfvcrpeagp "62" tnrpmlisld kqtctlffsw htplaceqat ecsvrngssi vdlsplihrt ggyeaydese "68" ddasdtnpdf yinicqplnp mhgvpcpaga avckvpidgp pidigrvagp pilnpianei "74" ylnfesstpc ladkhfnyts liafhckrgv smgtpkllrt fewe devr "80" mdgctltdeq llysfnlssl ststfkvtrd srtysvgvct favgpqugc kdggvcllsg "86" grlq smkldyrhqd eavvlsyvng drcppetddg vpcvfpfifn gksyeeciie "92" sraklwcstt adydrdhewg fcrhsnsyrt ssiifkcded edigrpqvfs evrgcdvtze "98" wktkvvcppk kleckqukh ktydlrllss ltgswslvhn gvsyyinlcq kiykgplgcs 204" erasicrrtt tgdqulglv htqklgvigd kvvvtyskgy pcggnktass vieltctktv 210" grpafkrfdi dsctyyfswd sraacavkpq evqmvngtit npingksfsl lfra 216" sgdmrtngdn ylyeiqlssi tssrnpacsg anicqvkpnd thsrkvgts dktkyqudg 222" dldvvfasss kcgkdktksv sstiffhcdp lvedgipefs hetadcqylf swytsavcpl 228" gvgfdsenpg ddgthkgls ersqavgavl ltcc ykke rretviskLt 234" tccrrssnvs ykyskvnkee etdenetewl meeiqlpppr qgkegqengh ittksvkaLs 217217 2401 slhgddque devltipevk vhsgrgagae sshpvrnaqs nalqereddr vglvrgekar 2461 kgksssaqqk tvsstklvsf hddsdedllh i 1 A69 Official Sym bol: HLA-A Official Name: major histocompatibility complex, class I, A Gene ID: 3105 Organism: Homo sapiens Other Aliases: DAQB-90011.16-002, HLAA Other Designations: HLA class I histocompatibility n, A-1 alpha chain; MHC class I n HLA-A heavy chain; antigen ting molecule; leukocyte antigen class l-A Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_002116.7 LOCUS NM_002116 ACCESSION NM_002116 l gagaagccaa tcagtgtcgt cgct gttctaaagc gcac ccaccgggac 6" tcagattctc cccagacgcc gaggatggcc gtcatggcgc cccgaaccct cctcctgcta l2" gggg ccctggccct gacccagacc tgggcgggct cccactccat gaggtatttc 18" ttcacatccg tgtcccggcc cggccgcggg gagccccgct tcatcgccgt gggctacgtg 24" gacgacacgc agttcgtgcg cagc gacgccgcga gccagaggat ggagccgcgg " gcgccgtgga tagagcagga ggggccggag tattgggacc aggagacacg gaatgtgaag 36" gcccagtcac agactgaccg agtggacctg gggaccctgc gcggctacta caaccagagc 42" ggtt ctcacaccat ccagataatg tatggctgcg acgtggggtc ggacgggcgc 48" ttcctccgcg ggtaccggca ggacgcctac gacggcaagg tcgc cctgaacgag 54" gacctgcgct cttggaccgc ggcggacatg gcggctcaga tcaccaagcg caagtgggag 60" gcggcccatg aggcggagca agcc tacctggatg gcacgtgcgt ggagtggctc 66" cgcagatacc acgg gaaggagacg ctgcagcgca cggacccccc caagacacat 72" atgacccacc accccatctc tgaccatgag gccaccctga ggtgctgggc cctgggcttc 78" taccctgcgg agatcacact gacctggcag nggatgggg aggaccagac ccaggacacg 218218 84" gagctcgtgg agaccaggcc tgcaggggat ggaaccttcc agaagtgggc ggctgtggtg 90" gtgccttctg gagaggagca gagatacacc tgccatgtgc agcatgaggg tctgcccaag 96" accc tgagatggga gctgtcttcc cagcccacca tccccatcgt gggcatcatt "02" gctggcctgg ttctccttgg agctgtgatc actggagctg ctgc cgtgatgtgg "08" aggaggaaga gctcagatag aggg actc aggctgcaag cagtgacagt "14" gcccagggct ctgatgtgtc cctcacagct tgtaaagtgt gagacagctg ccttgtgtgg "20" gagg caagagttgt tcctgccctt ccctttgtga cttgaagaac cctgactttg "26" tttctgcaaa ggcacctgca tgtgtctgtg ttcgtgtagg cataatgtga ggaggtgggg "32" agaccacccc acccccatgt ccaccatgac cctcttccca cgctgacctg tgctccctcc "38" ccaatcatct ttcctgttcc agagaggtgg ggctgaggtg tctccatctc tgtctcaact "44" tgca ctgagctgta acttcttcct tccctattaa aacc ttagtataaa "50" tttactttct caaattcttg ccatgagagg ttgatgagtt aattaaagga gaagattcct "56" aaaatttgag aata aatggaagac atgagaacct tccagagtcc aaaaaaaaaa "62" aaaaaaaaaa aaaaaa n seguence (variant 1): NCBI Reference Seguence: NP_002107.3 LOCUS NP_002107 ACCESSION NP_002107 1 mavmaprtll lllsgalalt thagshsmr yfftsvsrpg rgeprfiavg yvddtqfvrf 61 dsdaasqrme prapwieqeg peywdqetrn vkaqsqtdrv dlgtlrgyyn htiq 121 imygcdvgsd grflrgyrqd aydgkdyial nedlrswtaa dmaaqitkrk weaaheaeql 181 rayldgtcve wlrrylengk ethrtdppk thmthhpisd heatlrcwal itlt 241 wqrdgedqtq dtelvetrpa gdgtquwaa vvvpsgeeqr ytcthhegl pkpltlrwel 301 ssqptipivg llga vitgavvaav sdrk ggsytqaass dsaqgsdvsl 361 tackv Nucleotide seguence (variant 2): NCBI Reference Seguence: NM_001242758.1 LOCUS NM_001242758 ACCESSION NM_001242758 219219 l gagaagccaa tcagtgtcgt cgcggtcgct gttctaaagt ccgcacgcac ccaccgggac 6" tctc cccagacgcc gaggatggcc gtcatggcgc cccgaaccct cctcctgcta 12" gggg ccct gacccagacc tgggcgggct cccactccat gaggtatttc 18" ttcacatccg tgtcccggcc cggccgcggg gagccccgct ccgt gggctacgtg 24" gacgacacgc agttcgtgcg gttcgacagc gcga gccagaagat gcgg " gcgccgtgga tagagcagga ggggccggag tattgggacc aggagacacg gaatatgaag 36" gcccactcac agactgaccg agcgaacctg gggaccctgc acta caaccagagc 42" ggtt ctcacaccat ccagataatg tatggctgcg acgtggggcc ggacgggcgc 48" ttcctccgcg ggca ggacgcctac gacggcaagg attacatcgc cctgaacgag 54" gacctgcgct cttggaccgc ggcggacatg gcagctcaga tcaccaagcg caagtgggag 60" gcggtccatg cggcggagca gcggagagtc tacctggagg gccggtgcgt ggacgggctc 66" cgcagatacc tggagaacgg gaaggagacg cgca cggacccccc caagacacat 72" atgacccacc accccatctc tgaccatgag gccaccctga gggc cctgggcttc 78" taccctgcgg agatcacact gcag nggatgggg aggaccagac ccaggacacg 84" gagctcgtgg agaccaggcc tgcaggggat ggaaccttcc agaagtgggc ggctgtggtg 90" gtgccttctg gagaggagca gagatacacc tgccatgtgc agcatgaggg tctgcccaag 96" cccctcaccc tgagatggga gctgtcttcc cagcccacca tccccatcgt gggcatcatt "02" gctggcctgg ttctccttgg agctgtgatc actggagctg tggtcgctgc gtgg "08" aggaggaaga gctcagatag aaaaggaggg agttacactc aggctgcaag cagtgacagt "14" gcccagggct ctgatgtgtc tctcacagct tgtaaagtgt gagacagctg ccttgtgtgg "20" gactgagagg caagagttgt tcctgccctt ccctttgtga cttgaagaac cctgactttg "26" tttctgcaaa ggcacctgca tgtgtctgtg ttcgtgtagg cataatgtga ggaggtgggg "32" agagcacccc acccccatgt ccaccatgac cctcttccca cgctgacctg tgctccctct "38" ccaatcatct ttcctgttcc agagaggtgg ggctgaggtg tctccatctc tgtctcaact "44" tcatggtgca ctgagctgta tcct tccctattaa aattagaacc tgagtataaa "50" tttactttct caaattcttg gagg ttgatgagtt aattaaagga gaagattcct 1561 aaaatttgag agacaaaatt aatggaacgc atgagaacct tccagagtcc a Protein sequence nt 2): NCBI Reference Seguence: NP_001229687.1 LOCUS NP_001229687 220220 ACCESSION NP_001229687 1 mavmaprtll lllsgalalt thagshsmr yfftsvsrpg rgeprfiavg yvddtqfvrf 61 dsdaasqkme prapwieqeg peywdqetrn mkahsqtdra nlgtlrgyyn qsedgshtiq 121 imygcdvgpd grflrgyrqd aydgkdyial nedlrswtaa tkrk weavhaaeqr 181 rvylegrcvd glrrylengk ethrtdppk thmthhpisd heatlrcwal gfypaeitlt 241 wqrdgedqtq dtelvetrpa gdgtquwaa vvvpsgeeqr ytcthhegl rwel 301 ssqptipivg iiaglvllga vaav mwrrkssdrk ggsytqaass dvsl 361 tackv Official Symbol: P4HA2 Official Name: prolyl 4-hydroxylase, alpha ptide II M: 8974 Organism: Homo sapiens Other Aliases: UNQ290/PRO330 Other Designations: 4-PH alpha 2; 4-PH alpha-2; lpha(ll); collagen prolyl 4-hydroxylase alpha(ll); procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline 4-hydroxylase), alpha polypeptide ll; procollagen-proline,2-oxoglutarate- 4-dioxygenase subunit alpha-2; prolyl 4-hydroxylase subunit alpha-2 Nucleotide seguence (variant 1): NCBI nce Seguence: NM_004199.2 LOCUS NM_004199 ACCESSION NM_004199 1 agcgttgttt ttccttggca gctgcggaga cccgtgataa ttcgttaact aattcaacaa 6" acgggaccct tctgtgtgcc agaaaccgca tgct aacccagtgg gacaggcgga 12" ttggaagagc gggaaggtcc tggcccagag cagtgtggtg agcgctgtgc tggaagggaa 18" tgcgggcagt gggtacttgg tagagcactg tccg gccagaggac ttcccggagg 24" aggtgaccca tgagctggag gagg aaggctggca aaagggcatc gtggacagag " gaacagccta tgtgagtggg agcagagacc ttggccaatg ctta tggccttgta 36" gtggaagcaa ggtgatgggg aaggaacact gtaggggata gctgtccacg gacgctgtct 42" acaagaccct ggagtgagat cctg gtactgtgcc ctgcatgtgt aagatgccca 221221 48" gttgaccttc gcagcaggag cctggatcag ggcacttcct gcctcaggta ttgctggaca 54" gcccagacac ttccctctgt gaccatgaaa ctctgggtgt ctgcattgct gatggcctgg 60" tttggtgtcc tgagctgtgt gcaggccgaa ttcttcacct ctattgggca catgactgac 66" ctgatttatg cagagaaaga gctggtgcag aaag agtacatcct tgtggaggaa 72" gccaagcttt ccaagattaa gagctgggcc aacaaaatgg aagccttgac tagcaagtca 78" gctgctgatg ctgagggcta cctggctcac cctgtgaatg cctacaaact ggtgaagcgg 84" ctaaacacag actggcctgc gctggaggac cttgtcctgc aggactcagc tttt 90" atcgccaacc tctctgtgca gcggcagttc ttccccactg acga gataggagct 96" gccaaagccc tgatgagact tcaggacaca tacaggctgg acccaggcac caga "02" ggggaacttc ccaa gtaccaggca atgctgagtg tggatgactg gatg "08" ggccgctcgg cctacaatga aggggactat tatcatacgg tgttgtggat ggagcaggtg "l4" ctaaagcagc ttgatgccgg ggaggaggcc accacaacca agtcacaggt gctggactac "20" ctcagctatg ctgtcttcca gttgggtgat ctgcaccgtg ccctggagct cacccgccgc "26" ctgctctccc ttgacccaag ccacgaacga gctggaggga atctgcggta ctttgagcag "32" ttattggagg gaga aaaaacgtta caga cagaagctga gctagcaacc "38" ccagaaggca tctatgagag gcctgtggac tacctgcctg agagggatgt ttacgagagc "44" ctctgtcgtg gggagggtgt caaactgaca ccccgtagac agaagaggct tttctgtagg "50" taccaccatg gcaacagggc cccacagctg ctcattgccc ccttcaaaga ggaggacgag "56" tgggacagcc cgcacatcgt caggtactac gatgtcatgt ctgatgagga aatcgagagg "62" atcaaggaga tcgcaaaacc taaacttgca cgagccaccg ttcgtgatcc caagacagga "68" gtcctcactg tcgccagcta ccgggtttcc aaaagctcct ggctagagga tgac "74" gtgg cccgagtaaa tcgtcggatg cagcatatca cagggttaac agtaaagact "80" gcagaattgt tacaggttgc aaattatgga gtgggaggac agtatgaacc cgac "86" ttctctagga atgatgagcg agatactttc ttag ggacggggaa tcgtgtggct "92" actttcttaa actacatgag agaa gctggtggtg ccaccgtctt ccctgatctg "98" ggggctgcaa tttggcctaa gaagggtaca gctgtgttct ggtacaacct gagc 204" ggggaaggtg actaccgaac aagacatgct gcctgccctg tgcttgtggg ctgcaagtgg 210" gtctccaata agtggttcca tgaacgagga caggagttct tgagaccttg aaca 216" gaagttgact gacatccttt tctgtccttc cccttcctgg agcc catgtcaacg 222" tgacagacac ctttgtatgt tcctttgtat gttcctatca ggctgatttt tggagaaatg 222222 2281 aatgtttgtc tggagcagag ggagaccata ctagggcgac tcctgtgtga ctgaagtccc 2341 agcccttcca ttcagcctgt gccatccctg gccccaaggc taggatcaaa gtggctgcag 2401 tagc tgtctagcgc ctagcaaggt gcctttgtac ctcaggtgtt ttaggtgtga 2461 gatgtttcag tgaaccaaag tacc ttgtttacat gtttgttttt atggcatttc 2521 tatctattgt ggctttacca aaaaataaaa tgtccctacc agaagcctta aaaaaaaaaa 2581 aaaaaaaa n seguence (variant 1): NCBI Reference Seguence: NP_004190.1 LOCUS 190 ACCESSION NP_004190 1 mklwvsallm awfgvlscvq aefftsighm tdliyaekel vqslkeyilv eeaklskiks 6" wankmealts ksaadaegyl yklv krlntdwpal edlvlquaa gfianlqur 12" qffptdedei gaakalmrlq dtyrldpgti srgelpgtky qamlsvddcf gmgrsayneg 18" dyyhtvlwme qvlkqldage eatttksqvl dylsyaqul gdlhralelt rrllsldpsh 24" eraggnlryf eqlleeerek tltnqteael atpegiyerp vdylperdvy egvk " ltprqurlf cryhhgnrap qlliapfkee dewdsphivr yydvmsdeei erikeiakpk 36" laratvrdpk tgvltvasyr vsksswleed ddpvvarvnr rmqhitgltv ktaellqvan 42" ygvggqyeph fdfsrnderd tfkhlgtgnr vatflnymsd veaggatvfp dlgaaiwpkk 48" gtavfwynll yrtr lvgc kwvsnkwfhe rgqeflrpcg stevd Nucleotide ce (variant 2): NCBI Reference Seguence: 017973.1 LOCUS NM_001017973 ACCESSION NM_001017973 1 agcgttgttt ttccttggca gctgcggaga cccgtgataa ttcgttaact aattcaacaa 61 acgggaccct tctgtgtgcc agaaaccgca agcagttgct aacccagtgg gacaggcgga 121 ttggaagagc gggaaggtcc tggcccagag cagtgtggtg agcgctgtgc tggaagggaa 181 tgcgggcagt gggtacttgg tagagcactg actgcctccg ggac ttcccggagg 241 aggtgaccca tgagctggag tggtcagagg aaggctggca aaagggcatc gtggacagag 301 gaacagccta tgtgagtggg gacc ttggccaatg ccattcctta tggccttgta 223223 36" gtggaagcaa ggtgatgggg aaggaacact gtaggggata cacg gacgctgtct 42" acaagaccct agat aacgtgcctg gtactgtgcc ctgcatgtgt aagatgccca 48" gttgaccttc gcagcaggag cctggatcag ggcacttcct gcctcaggta ttgctggaca 54" gcccagacac ttccctctgt gaccatgaaa ctctgggtgt tgct gatggcctgg 60" tttggtgtcc tgagctgtgt gcaggccgaa ttcttcacct ctattgggca catgactgac 66" ctgatttatg cagagaaaga gctggtgcag tctctgaaag agtacatcct tgtggaggaa 72" gccaagcttt ccaagattaa gagctgggcc aacaaaatgg aagccttgac tagcaagtca 78" gctgctgatg ctgagggcta cctggctcac cctgtgaatg cctacaaact ggtgaagcgg 84" ctaaacacag actggcctgc gctggaggac cttgtcctgc cagc tgcaggtttt 90" atcgccaacc tctctgtgca gcggcagttc ttccccactg atgaggacga gataggagct 96" gccc tgatgagact tcaggacaca tacaggctgg acccaggcac aatttccaga "O2" cttc caggaaccaa gtaccaggca atgctgagtg tggatgactg gatg "08" ggccgctcgg cctacaatga aggggactat tatcatacgg tgttgtggat ggagcaggtg "l4" ctaaagcagc ttgatgccgg ggaggaggcc accacaacca agtcacaggt gctggactac "20" ctcagctatg ctgtcttcca gttgggtgat ctgcaccgtg ccctggagct cacccgccgc "26" ctgctctccc ttgacccaag ccacgaacga gctggaggga ggta ctttgagcag "32" ttattggagg aagagagaga gtta acaaatcaga cagaagctga gctagcaacc "38" ccagaaggca tctatgagag gcctgtggac tacctgcctg agagggatgt ttacgagagc "44" ctctgtcgtg gggagggtgt gaca ccccgtagac agaagaggct tttctgtagg "50" taccaccatg gggc cccacagctg ctcattgccc ccttcaaaga ggaggacgag "56" agcc cgcacatcgt caggtactac gatgtcatgt ctgatgagga aatcgagagg "62" gaga tcgcaaaacc taaacttgca cgagccaccg ttcgtgatcc caagacagga "68" gtcctcactg tcgccagcta ccgggtttcc aaaagctcct ggctagagga agatgatgac "74" cctgttgtgg cccgagtaaa tcgtcggatg cagcatatca cagggttaac agtaaagact "80" gcagaattgt tacaggttgc aaattatgga ggac agtatgaacc gcacttcgac "86" ttctctaggc gaccttttga cagcggcctc aaaacagagg ggaataggtt agcgacgttt "92" cttaactaca tgagtgatgt agaagctggt ggtgccaccg tcttccctga ggct "98" tggc ctaagaaggg tacagctgtg ttctggtaca acctcttgcg gagcggggaa 204" ggtgactacc gaacaagaca tgctgcctgc cctgtgcttg tgggctgcaa gtgggtctcc 210" tggt tccatgaacg aggacaggag ttcttgagac cttgtggatc aacagaagtt 224224 216" gactgacatc cttttctgtc cttccccttc ctggtccttc agcccatgtc aacgtgacag 222" acacctttgt atgttccttt gtatgttcct ctga tttttggaga aatgaatgtt 228" tgtctggagc agagggagac catactaggg cgactcctgt gtgactgaag tcccagccct 234" tccattcagc ctgtgccatc cctggcccca aggctaggat caaagtggct gcagcagagt 240" tagctgtcta gcgcctagca aggtgccttt gtacctcagg aggt gtgagatgtt 246" tcagtgaacc aaagttctga taccttgttt acatgtttgt ttttatggca tttctatcta 252" ttgtggcttt accaaaaaat aaaatgtccc taccagaagc cttaaaaaaa aaaaaaaaaa 258" aa Protein ce (variant 2): NCBI Reference Seguence: NP_001017973.1 LOCUS NP_001017973 ACCESSION NP_001017973 1 mklwvsallm awfgvlscvq aefftsighm tdliyaekel vqslkeyilv eeaklskiks 6" wankmealts ksaadaegyl ahpvnayklv krlntdwpal edlvlquaa gfianlqur 12" qffptdedei gaakalmrlq dtyrldpgti srgelpgtky qamlsvddcf yneg 18" dyyhtvlwme qvlkqldage eatttksqvl dylsyaqul gdlhralelt rrllsldpsh 24" eraggnlryf eqlleeerek tltnqteael atpegiyerp vdylperdvy eslcrgegvk " ltprqurlf nrap qlliapfkee dewdsphivr yydvmsdeei erikeiakpk 36" laratvrdpk tgvltvasyr leed ddpvvarvnr rmqhitgltv ktaellqvan 42" ygvggqyeph pfds glktegnrla sdve aggatvfpdl gaaiwpkkgt 48" avfwynllrs trha gckw vsnkwfherg qeflrpcgst evd Nucleotide seguence (variant 3): NCBI Reference ce: NM_001017974.1 LOCUS NM_001017974 ACCESSION NM_001017974 1 aagggaggag gcgccgagct gaccgggcga cgccgcggga ggaa acgccgggag 61 ctgcgagtgt ccagacactt ccctctgtga ccatgaaact ctgggtgtct gcattgctga 121 tggcctggtt tggtgtcctg agctgtgtgc aggccgaatt cttcacctct attgggcaca 181 tgactgacct gatttatgca gagaaagagc tggtgcagtc agag tacatccttg 225225 24" aagc caagctttcc aagattaaga ccaa caaaatggaa gccttgacta " cagc tgctgatgct gagggctacc accc tgtgaatgcc tacaaactgg 36" tgaagcggct aaacacagac tggcctgcgc tggaggacct gcag gactcagctg 42" caggttttat cgccaacctc tctgtgcagc ggcagttctt ccccactgat gaggacgaga 48" taggagctgc caaagccctg atgagacttc aggacacata ggac ccaggcacaa 54" tttccagagg ggaacttcca ggaaccaagt caat tgtg tgct 60" ttgggatggg ggcc tacaatgaag gggactatta tcatacggtg ttgtggatgg 66" agcaggtgct aaagcagctt gatgccgggg aggaggccac cacaaccaag tcacaggtgc 72" tggactacct cagctatgct gtcttccagt tgggtgatct gcaccgtgcc ctggagctca 78" cccgccgcct gctctccctt gacccaagcc acgaacgagc tggagggaat ctgcggtact 84" ttgagcagtt attggaggaa gagagagaaa aaacgttaac aaatcagaca gaagctgagc 90" tagcaacccc agaaggcatc tatgagaggc ctgtggacta tgag agggatgttt 96" acgagagcct ctgtcgtggg gagggtgtca aactgacacc ccgtagacag aagaggcttt "02" tctgtaggta ccaccatggc aacagggccc tgct cattgccccc ttcaaagagg "08" aggacgagtg ggacagcccg cacatcgtca ggtactacga tgtcatgtct gatgaggaaa "14" tcgagaggat caaggagatc gcaaaaccta aacttgcacg agccaccgtt cgtgatccca "20" agacaggagt cctcactgtc gccagctacc gggtttccaa aagctcctgg ctagaggaag "26" atgatgaccc tgttgtggcc cgagtaaatc gtcggatgca gcatatcaca gggttaacag "32" taaagactgc agaattgtta caggttgcaa attatggagt gggaggacag tatgaaccgc "38" acttcgactt ctctaggcga ccttttgaca tcaa aacagagggg aataggttag "44" cgacgtttct taactacatg agtgatgtag aagctggtgg tgccaccgtc ttccctgatc "50" tgggggctgc aatttggcct aagaagggta cagctgtgtt caac ctcttgcgga "56" gcggggaagg tgactaccga acaagacatg ctgcctgccc tgtgcttgtg ggctgcaagt "62" gggtctccaa taagtggttc catgaacgag agtt cttgagacct tgtggatcaa "68" cagaagttga ctgacatcct tttctgtcct tccccttcct ggtccttcag cccatgtcaa "74" cgtgacagac acctttgtat gttcctttgt atgttcctat caggctgatt tttggagaaa "80" tgaatgtttg tctggagcag agggagacca tactagggcg gtgt gactgaagtc "86" ccagcccttc cattcagcct gtgccatccc tggccccaag gctaggatca aagtggctgc "92" gtta gctgtctagc gcctagcaag gtgcctttgt acctcaggtg ttttaggtgt "98" tttc agtgaaccaa agttctgata ccttgtttac atgtttgttt ttatggcatt 226226 2041 tctatctatt gtggctttac caaaaaataa aatgtcccta ccagaagcct taaaaaaaaa 2101 aaaaaaaaaa Protein seguence (variant 3): NCBI Reference Seguence: NP_001017974.1 LOCUS NP_001017974 ACCESSION NP_001017974 l allm awfgvlscvq ighm tdliyaekel vqslkeyilv eeaklskiks 6" wankmealts ksaadaegyl ahpvnayklv krlntdwpal edlvlquaa gfianlqur 12" qffptdedei gaakalmrlq pgti gtky qamlsvddcf gmgrsayneg 18" dyyhtvlwme dage eatttksqvl dylsyaqul gdlhralelt rrllsldpsh 24" eraggnlryf erek tltnqteael atpegiyerp vdylperdvy egvk " ltprqurlf cryhhgnrap qlliapfkee dewdsphivr yydvmsdeei erikeiakpk 36" rdpk tgvltvasyr vsksswleed ddpvvarvnr rmqhitgltv ktaellqvan 42" ygvggqyeph fdfsrrpfds glktegnrla tflnymsdve aggatvfpdl gaaiwpkkgt 48" avfwynllrs gegdyrtrha acpvlvgckw herg qeflrpcgst evd Nucleotide seguence (variant 4): NCBI Reference Seguence: NM_001142598.1 LOCUS NM_001142598 ACCESSION NM_001142598 l aagggaggag gcgccgagct gaccgggcga cgccgcggga ggttctggaa acgccgggag 6" ctgcgagtgt ccagctgcgg agacccgtga taattcgtta actaattcaa caaacgggac l2" ccttctgtgt gccagaaacc gcaagcagtt gctaacccag tgggacaggc ggattggaag 18" agcgggaagg tcctggccca tgtg ccct ctgtgaccat gaaactctgg 24" gtgtctgcat tggc ctggtttggt gtcctgagct gtgtgcaggc cgaattcttc " acctctattg ggcacatgac tgacctgatt tatgcagaga tggt gcagtctctg 36" aaagagtaca tccttgtgga ggaagccaag ctttccaaga ttaagagctg ggccaacaaa 42" atggaagcct tgactagcaa gtcagctgct gatgctgagg gctacctggc tcaccctgtg 48" aatgcctaca aactggtgaa gcggctaaac acagactggc ctgcgctgga ggaccttgtc 54" ctgcaggact cagctgcagg ttttatcgcc aacctctctg tgcagcggca gttcttcccc 227227 60" actgatgagg acgagatagg agctgccaaa gccctgatga gacttcagga cacatacagg 66" ctggacccag gcacaatttc cagaggggaa cttccaggaa ccaagtacca ggcaatgctg 72" agtgtggatg actgctttgg gatgggccgc tcggcctaca atgaagggga tcat 78" acggtgttgt ggatggagca ggtgctaaag cagcttgatg ccggggagga ggccaccaca 84" accaagtcac tgga ctacctcagc tatgctgtct tccagttggg tgatctgcac 90" ctgg agctcacccg ccgcctgctc tcccttgacc caagccacga acgagctgga 96" gggaatctgc ggtactttga gcagttattg gaggaagaga gagaaaaaac aaat "02" cagacagaag ctgagctagc aaccccagaa ggcatctatg agaggcctgt ggactacctg "08" aggg atgtttacga gagcctctgt Cgtggggagg aact ccgt "l4" agacagaaga ggcttttctg taggtaccac catggcaaca gggccccaca gctgctcatt "20" gcccccttca aagaggagga cgagtgggac agcccgcaca tcgtcaggta ctacgatgtc "26" atgtctgatg aggaaatcga gaggatcaag gagatcgcaa aacctaaact tgcacgagcc "32" accgttcgtg atcccaagac cctc actgtcgcca gctaccgggt ttccaaaagc "38" tcctggctag aggaagatga tgaccctgtt gtggcccgag taaatcgtcg gatgcagcat "44" atcacagggt taacagtaaa gactgcagaa ttgttacagg ttgcaaatta tggagtggga "50" ggacagtatg aaccgcactt cgacttctct cctt ttgacagcgg cctcaaaaca "56" gaggggaata ggttagcgac gtttcttaac tacatgagtg atgtagaagc tggtggtgcc "62" accgtcttcc ctgatctggg aatt tggcctaaga agggtacagc tgtgttctgg "68" tacaacctct gcgg ggaaggtgac taccgaacaa gacatgctgc ctgccctgtg "74" cttgtgggct gcaagtgggt ctccaataag tggttccatg aacgaggaca ggagttcttg "80" agaccttgtg gatcaacaga agttgactga catccttttc tgtccttccc cttcctggtc "86" cttcagccca tgtcaacgtg acagacacct gttc ctttgtatgt tcctatcagg "92" ctgatttttg tgaa tgtttgtctg aggg agaccatact agggcgactc "98" ctgtgtgact gaagtcccag cccttccatt gtgc catccctggc cccaaggcta 204" ggatcaaagt ggctgcagca gagttagctg tctagcgcct agcaaggtgc ctttgtacct 210" caggtgtttt aggtgtgaga agtg aaccaaagtt cctt gtttacatgt 216" ttgtttttat ggcatttcta tctattgtgg ctttaccaaa aaataaaatg tccctaccag 222" aagccttaaa aaaaaaaaaa aaaaaa Protein seguence (variant 4): NCBI Reference Seguence: NP_001136070.1 228228 LOCUS NP_001136070 ACCESSION NP_001136070 l mklwvsallm awfgvlscvq aefftsighm tdliyaekel vqslkeyilv eeaklskiks 6" wankmealts ksaadaegyl ahpvnayklv krlntdwpal edlvlquaa gfianlqur 12" qffptdedei gaakalmrlq dtyrldpgti srgelpgtky qamlsvddcf gmgrsayneg 18" dyyhtvlwme qvlkqldage eatttksqvl dylsyaqul gdlhralelt dpsh 24" eraggnlryf eqlleeerek tltnqteael atpegiyerp vdylperdvy eslcrgegvk " ltprqurlf cryhhgnrap qlliapfkee dewdsphivr yydvmsdeei erikeiakpk 36" laratvrdpk tgvltvasyr vsksswleed rvnr rmqhitgltv qvan 42" yeph fdfsrrpfds glktegnrla tflnymsdve aggatvfpdl gaaiwpkkgt 48" avfwynllrs gegdyrtrha acpvlvgckw vsnkwfherg qeflrpcgst evd Nucleotide seguence: (variant 5) NCBI Reference Seguence: NM_001142599.1 LOCUS NM_001142599 ION NM_001142599 l ggag gcgccgagct gaccgggcga cgccgcggga ggttctggaa acgccgggag 6" ctgcgagtgt ccagctgcgg agacccgtga taattcgtta actaattcaa caaacgggac 12" ccttctgtgt gccagaaacc gcaagcagtt ccag tgggacaggc ggattggaag 18" agcgggaagg tcctggccca gagcagtgtg acacttccct ctgtgaccat gaaactctgg 24" gcat tgctgatggc ctggtttggt gtcctgagct gtgtgcaggc cttc " acctctattg tgac tgacctgatt tatgcagaga aagagctggt gcagtctctg 36" aaagagtaca tccttgtgga ggaagccaag aaga ttaagagctg ggccaacaaa 42" atggaagcct tgactagcaa gtcagctgct gatgctgagg gctacctggc tcaccctgtg 48" aatgcctaca aactggtgaa gcggctaaac acagactggc ctgcgctgga ggaccttgtc 54" ctgcaggact cagctgcagg ttttatcgcc aacctctctg ggca gttcttcccc 60" actgatgagg tagg agctgccaaa gccctgatga gacttcagga cacatacagg 66" ctggacccag gcacaatttc cagaggggaa cttccaggaa ccaagtacca ggcaatgctg 72" agtgtggatg actgctttgg gatgggccgc taca atgaagggga ctattatcat 78" acggtgttgt ggatggagca ggtgctaaag cagcttgatg ccggggagga ggccaccaca 84" accaagtcac tgga ctacctcagc tatgctgtct tccagttggg tgatctgcac 229229 90" cgtgccctgg agctcacccg ccgcctgctc tcccttgacc caagccacga acgagctgga 96" gggaatctgc ggtactttga gcagttattg gaggaagaga aaac gttaacaaat "02" cagacagaag ctgagctagc aaccccagaa tatg agaggcctgt ggactacctg "08" cctgagaggg atgtttacga gagcctctgt gagg gtgtcaaact gacaccccgt "l4" agacagaaga ggcttttctg taggtaccac catggcaaca gggccccaca gctgctcatt "20" gcccccttca aagaggagga cgagtgggac agcccgcaca tcgtcaggta tgtc "26" atgtctgatg aggaaatcga caag gagatcgcaa aacctaaact tgcacgagcc "32" accgttcgtg atcccaagac aggagtcctc actgtcgcca gctaccgggt ttccaaaagc "38" tcctggctag aggaagatga tgaccctgtt gtggcccgag taaatcgtcg gcat "44" atcacagggt taacagtaaa gactgcagaa ttgttacagg ttgcaaatta tggagtggga "50" ggacagtatg aaccgcactt cgacttctct aggaatgatg agcgagatac tttcaagcat "56" ttagggacgg ggaatcgtgt ggctactttc ttaaactaca tgagtgatgt tggt "62" ggtgccaccg tcttccctga tctgggggct gcaatttggc ctaagaaggg tgtg "68" ttctggtaca acctcttgcg gagcggggaa ggtgactacc gaacaagaca tgctgcctgc "74" cctgtgcttg tgggctgcaa gtgggtctcc aataagtggt tccatgaacg aggacaggag "80" ttcttgagac cttgtggatc aacagaagtt gactgacatc cttttctgtc cttccccttc "86" ctggtccttc agcccatgtc aacgtgacag acacctttgt atgttccttt gtatgttcct "92" ctga tttttggaga aatgaatgtt gagc agagggagac catactaggg "98" cgactcctgt gtgactgaag tcccagccct tccattcagc ctgtgccatc cctggcccca 204" aggctaggat caaagtggct gcagcagagt tagctgtcta gcgcctagca aggtgccttt 210" gtacctcagg tgttttaggt gtgagatgtt tcagtgaacc ctga taccttgttt 216" acatgtttgt ttttatggca tttctatcta ttgtggcttt aaat tccc 222" taccagaagc cttaaaaaaa aaaa aa Protein seguence: NCBI Reference Seguence: NP_001136071.1 LOCUS NP_001136071 ACCESSION NP_001136071 l mklwvsallm scvq aefftsighm tdliyaekel vqslkeyilv eeaklskiks 61 wankmealts ksaadaegyl ahpvnayklv krlntdwpal edlvlquaa gfianlqur 230230 12" qffptdedei gaakalmrlq dtyrldpgti srgelpgtky qamlsvddcf gmgrsayneg 18" dyyhtvlwme dage eatttksqvl dylsyaqul gdlhralelt rrllsldpsh 24" eraggnlryf eqlleeerek tltnqteael atpegiyerp vdylperdvy eslcrgegvk " ltprqurlf cryhhgnrap fkee dewdsphivr yydvmsdeei erikeiakpk 36" laratvrdpk tgvltvasyr vsksswleed ddpvvarvnr rmqhitgltv ktaellqvan 42" yeph fdfsrnderd tfkhlgtgnr vatflnymsd veaggatvfp dlgaaiwpkk 48" ynll rsgegdyrtr haacpvlvgc kwvsnkwfhe rgqeflrpcg stevd Official Symbol: RBMX Official Name: RNA binding motif protein, X-Iinked Gene ID: 27316 Organism: Homo sapiens Other Aliases: RP1 1-1 1 14A5.1, Other Designations: RNA binding motif protein, X chromosome; RNA-binding motif protein, X chromosome; rotein p43; heterogeneous nuclear ribonucleoprotein G; hnRNP G Nucleotide ce (variant 1): NCBI Reference Seguence: NM_002139.3 LOCUS NM_002139 ACCESSION NM_002139 l ggtccttcag cctcgttccc gggcagtata gctg ttgt tcgccctcgt 6" tgcgcagtag tgctagcggc gttc ggtcctcgca cccggcagcc gccactggtg 12" ctgagctgct cccc tatcgccgag ctcgttggag cttgaaccca ttgtcacccc 18" tccgactcac cggcccaaaa aaaaaaaaac atggttgaag cagatcgccc aggaaagctc 24" ttcattggtg ggcttaatac ggaaacaaat gagaaagctc ttgaagcagt atttggcaaa " tatggacgaa tagtggaagt gatg aaagaccgtg aaaccaacaa atcaagagga 36" tttgcttttg tcacctttga aagcccagca gacgctaagg atgcagccag agacatgaat 42" ggaaagtcat gaaa agccatcaag gtggaacaag ccaccaaacc atcatttgaa 231231 48" agtggtagac gtggaccgcc tccacctcca agaagtagag gccctccaag aggtcttaga 54" ggtggaagag gaggaagtgg aggaaccagg ggacctccct cacggggagg acacatggat 60" gacggtggat attccatgaa ttttaacatg agttcttcca ggggaccact cccagtaaaa 66" agaggaccac caccaagaag tgggggtcct cctcctaaga gatctgcacc ttcaggacca 72" gttcgcagta gcagtggaat aaga gctcctgtat cacgtggaag agatagttat 78" ccac ctcgaaggga accgctgccc tctcgtagag attt gtccccaaga 84" gggt attctactaa agacagctat tcaagcagag attacccaag ttctcgtgat 90" actagagatt atgcaccacc accacgagat tatacttacc gtgattatgg tcattccagt 96" tcacgtgatg actatccatc aagaggatat agcgatagag atggatatgg tcgtgatcgt "02" gactattcag atcatccaag ttcc tacagagatt catatgagag taac "08" agtg ctccacctac acgagggccc ccgccatctt atggtggaag cagtcgctat "l4" gatgattaca gcagctcacg tgacggatat ggtggaagtc gagacagtta ctcaagcagc "20" cgaagtgatc tctactcaag tggtcgtgat cgggttggca aaag agggcttccc "26" ccttctatgg aaagggggta ccctcctcca cgtgattcct acagcagttc aagccgcgga "32" gcaccaagag gtggtggccg tggaggaagc cgatctgata gagggggagg cagaagcaga "38" tactagaaac aaacaaaact ttggaccaaa atcccagttc aaagaaacaa aaagtggaaa "44" ctattctatc ataactaccc aaggactact aaaaggaaaa attgtgttac tttttttaaa "50" ttccctgtta agttcccctc cataattttt atgttcttgt aaaa gtaaaacatg "56" tttaatttta tttgactttc gcattgcttt tcaacaagca aatgttaaat gtgttaagac "62" ttgtactagt gttgtaactt taaa agtatcccct aaaggccact tcctatctga "68" tttttcccag caaatgaggc aggcaattct aagatcttcc catc tagccatcta "74" aaatggagag atgaatcatt ctacctatac gcta gctattagag ggtggttggg "80" gtatgctact cataagattt cagggtgtct tccaactgaa atctcaatgt tctcagtacg "86" aaaaacctga aatcacatgc ctatgtaagg ctat tcacccagta aacccaaaaa "92" agcaaatgga taatgctggc cattttgcct ttctgacatt tccttgggaa tctgcaagaa "98" cctccccttt cccttccccc aataagacca tttaagtgtg acaa aata 2041 ctaaataaaa agtttggcca acca tgaagctgca aaaaaaaaaa aaaaaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_002130.2 LOCUS NP_002130 232232 ACCESSION NP_002130 1 mveadrpgkl figglntetn ekaleavfgk ygrivevllm kdretnksrg fafvtfespa 61 dakdaardmn gksldgkaik veqatkpsfe Sgrrgppppp rsrgpprglr ggrggsggtr 121 gppsrgghmd nfnm sssrgplpvk rgppprsggp ppkrsapsgp vrsssgmggr 181 apvsrgrdsy ggpprreplp srrdvylspr ddgystkdsy ssrdypssrd trdyappprd 241 ytyrdyghss srddypsrgy sdrdgygrdr dysdhpsggs yrdsyesygn srsapptrgp 301 ppsyggssry ddysssrdgy ggsrdsysss rsdlyssgrd rglp psmergyppp 361 rdsyssssrg aprgggrggs rsdrgggrsr Y Nucleotide seguence (variant 2): NCBI Reference ce: NM_001164803.1 LOCUS NM_001164803 ACCESSION NM_001164803 1 ggtccttcag cctcgttccc gggcagtata aagtttgctg tctcctttgt tcgccctcgt 6" tgcgcagtag tgctagcggc ttcgcggttc ggtcctcgca cccggcagcc gccactggtg 12" ctgagctgct cccc tatcgccgag ctcgttggag ccca ttgtcacccc 18" tccgactcac cggcccaaaa aaaaaaaaac gaag cagatcgccc aggaaagctc 24" ggtg ggcttaatac ggaaacaaat gagaaagctc ttgaagcagt atttggcaaa " tatggacgaa tagtggaagt actcttgatg aaagaccgtg aaaccaacaa atcaagagga 36" tttgcttttg tcacctttga aagcccagca gacgctaagg atgcagccag agacatgaat 42" ctcc acgt ggaagagata gttatggagg tccacctcga agggaaccgc 48" tgccctctcg tgtt tatttgtccc caagagatga tgggtattct actaaagaca 54" gctattcaag cagagattac ccaagttctc gtgatactag agattatgca ccaccaccac 60" gagattatac ttaccgtgat tatggtcatt ccagttcacg tgatgactat ccatcaagag 66" gatatagcga tagagatgga tatggtcgtg atcgtgacta tcat ccaagtggag 72" gttcctacag agattcatat gagagttatg gttggtgatt ttgctcatta tggtcgtgga 78" gtgctgattg attcacagta gataaagctg gcagtaagaa atgctaagag ttgttgaagc 84" agaaggcggc tgattgtcaa taagtcacta cagttgcata gctg tcagaattgg 90" tttggtgcag gcaatagatt ttgccttcag gggttcctgt ggatctgagg aaggcatcag 96" tgttgattaa taac tagggagtga ctggtagtta cttaaagcaa gtaattgacc 233233 "O2" aaatggaaaa ggggaagtaa ttaaggaaat tggtaagtgg aggtagtcag gaagttcttg "08" tggttcttca catagatttt acagctttgg ctttcatttt gtttagctaa agtcatgggg "14" acaactcttc aatttagaac ttaagttgaa ttataaaaat gatggatata agtggtagct "20" gtatctagtg aagtgtctgt cagtaagtga aacatttttt ggtggtggct tatccacaaa "26" cagtttagtt gtagaataaa acttatgagt gacatctgga aagtaaccat gctaagatgg "32" caagcacact ggaaacaatt aggccacttg gctttctttt gctgtattgt ttta:aagcc "38" acct cttg gaaacaagtt ttagtttttt attggtttgg agac:agagc "44" caatagtata atgttctcaa aggaaacaga cttgagttgt tggattagag gaac:aaccc "50" aacttatatg attttttttt tgtttttgtc gtgtagttat ggcactgtct tatt :ggaac "56" atttgcaact agggataata caacattttt aactctcatt tgacaaccta atca "62" cagaccacaa gggtaatgac caaatttatg tggtttttgc actccatagt tgtc:tagcc caatctttct atactcttac ttgg gttaacgctt ctgtgaggac cttc:ggctc "74" ttgagatacc ctaaatattt aagatattta gatatcttga agatagtata ggatatagag "80" attgtaccaa ataggaatat aaggagtatg ttaaaatgac cagatacctg tttgatagtt "86" tactgaccta gcagatgtgt ggaaaaggaa cttg attcttctgg actg "92" gttgtaaaac atac agaaaatgtt ttccttgttt aactggtagt taga "98" gtat tatagatcac ttttcacttt ttggaatgtt ttgtattgaa acttaataaa 204" actttaacat ggaaaaaaaa aaaaaaaaaa a Protein seguence (variant 2): NCBI Reference Seguence: NP_001158275.1 LOCUS NP_001158275 ACCESSION NP_001158275 1 mveadrpgkl figglntetn vfgk ygrivevllm kdretnksrg fafvtfespa 61 dakdaardmn veei vmevhlegnr cplvemficp qemmgillkt aiqaeitqvl 121 vileimhhhh eiiltvimvi pvhvmtihqe diaiemdmvv ivtiqiiqve mrvm 181 vgdfahygrg vlidsq |BP7 Official Symbol: IGFBP7 Official Name: insulin-like growth factor binding protein 7 234234 Gene ID: 3490 Organism: Homo sapiens Other Aliases: AGM, FSTL2, lBP-7, lGFBP-7, lGFBP-7v, IGFBPRP1, MAC25, PSF, RAMSVPS, TAF Other Designations: IGF-binding n 7; IGFBP-rPf; PGI2—stimulating factor; angiomodulin; insulin-like growth factor-binding protein 7; prostacyclin- stimulating factor; tumor-derived adhesion factor tide seguence (variant 1): NCBI Reference ce: NM_001553.2 LOCUS NM_001553 ACCESSION NM_001553 l actcgcgccc ctgc caccgcaccc cgccatggag cggccgtcgc tgcgcgccct 6" gctcctcggc gggc tgctgctcct gctcctgccc ctctcctctt cctcctcttc 12" ggacacctgc ggcccctgcg agccggcctc ctgcccgccc ctgcccccgc tgggctgcct 18" gctgggcgag acccgcgacg Cgtgcggctg ctgccctatg tgcgcccgcg gcgagggcga 24" cggg ggcg ccggcagggg gtactgcgcg ccgggcatgg agtgcgtgaa " gagccgcaag aggcggaagg gtaaagccgg agcc ggcggtccgg gtgtaagcgg 36" Cgtgtgcgtg tgcaagagcc gctacccggt gtgcggcagc gacggcacca cctacccgag 42" cggctgccag ctgcgcgccg ccagccagag ggccgagagc cgcggggaga aggccatcac 48" ccaggtcagc aagggcacct gcgagcaagg tccttccata gtgacgcccc ccaaggacat 54" ctggaatgtc actggtgccc aggtgtactt gagctgtgag gtcatcggaa cacc 60" catc tggaacaagg taaaaagggg tcactatgga gttcaaagga cagaactcct 66" gcctggtgac cgggacaacc tggccattca gacccggggt ggcccagaaa agcatgaagt 72" aactggctgg gtat ctcctctaag taaggaagat gctggagaat atgagtgcca 78" tgcatccaat tcccaaggac aggcttcagc atcagcaaaa attacagtgg ttgatgcctt 84" acatgaaata ccagtgaaaa aaggtgaagg tgccgagcta taaacctcca gaatattatt 90" agtctgcatg gttaaaagta gtcatggata actacattac ctgttcttgc ctaataagtt 96" tcttttaatc acta acactttagt tatattcact ggttttacac agagaaatac 102" aaga tcacacatca agactatcta caaaaattta ttatatattt acagaagaaa 108" agcatgcata tcattaaaca aataaaatac tttttatcac aacacagtaa aaaaaaa Protein seguence (variant 1): 235235 NCBI Reference Seguence: NP_001544.1 LOCUS NP_001544 ION NP_001544 1 rall lgaaglllll lplssssssd pasc pplpplgcll getrdacgcc 61 pmcargegep cggggagrgy capgmecvks rkrrkgkaga vsgv cvcksrypvc 121 gsdgttypsg cqlraasqra aitq vskgtcqup sivtppkdiw nvtgaquls 181 cevigiptpv liwnkvkrgh yqurtellp gdrdnlaiqt rggpekhevt nglvsplsk 241 edageyecha snsngasas akitvvdalh eipvkkgega e1 tide seguence (variant 2) NCBI Reference Seguence: NM_001253835.1 LOCUS NM_001253835 ACCESSION NM_001253835 1 actcgcgccc ttgccgctgc caccgcaccc cgccatggag cggccgtcgc tgcgcgccct 6" gctcctcggc gccgctgggc tgctgctcct gctcctgccc ctctcctctt cctcctcttc 12" ggacacctgc ggcccctgcg agccggcctc ctgcccgccc ctgcccccgc tgggctgcct 18" gctgggcgag acccgcgacg Cgtgcggctg ctgccctatg tgcgcccgcg gcga 24" gccgtgcggg ggtggcggcg ccggcagggg gtactgcgcg ccgggcatgg agtgcgtgaa " gagccgcaag aggcggaagg ccgg ggcagcagcc ggcggtccgg gtgtaagcgg 36" Cgtgtgcgtg tgcaagagcc gctacccggt cagc gacggcacca cctacccgag 42" cggctgccag ctgcgcgccg ccagccagag ggccgagagc cgcggggaga aggccatcac 48" ccaggtcagc acct gcgagcaagg tccttccata gtgacgcccc ccaaggacat 54" ctggaatgtc actggtgccc aggtgtactt gagctgtgag ggaa tcccgacacc 60" tgtcctcatc tggaacaagg taaaaagggg tcactatgga gttcaaagga cagaactcct 66" gcctggtgac cgggacaacc tggccattca gacccggggt ggcccagaaa agcatgaagt 72" aactggctgg gtgctggtat ctcctctaag taaggaagat gctggagaat atgagtgcca 78" tgcatccaat tcccaaggac aggcttcagc atcagcaaaa attacagtgg ttgatgcctt 84" acatgaaata ccagtgaaaa aaggtacaca ataaatctca ttta aaaatgacta 90" gtacatttgc tttaaaaaga ctaa gtatgaaagt atcagacgta gctattgatg 96" aaattctgta gttagcaacc ggca ttaagtatgc cattaaaatg tacagcatga 1021 gactccaaaa gattatctgg atgggtgact g 236236 Protein seguence (variant 2): NCBI Reference Seguence: NP_001240764.1 LOCUS NP_001240764 ACCESSION NP_001240764 1 merpslrall lgaaglllll lplssssssd pasc pplpplgcll getrdacgcc 61 pmcargegep grgy capgmecvks rkrrkgkaga aaggpgvsgv cvcksrypvc 121 gsdgttypsg sqra esrgekaitq vskgtcqup sivtppkdiw nvtgaquls 181 cevigiptpv liwnkvkrgh yqurtellp gdrdnlaiqt rggpekhevt nglvsplsk 241 edageyecha snsngasas akitvvdalh eipvkkgtq al Symbol: HLA-C Official Name: major ompatibility complex, class I, C m: 3107 Organism: Homo sapiens Other Aliases: XXbac-BCX101P6.2, D68204, HLA-JY3, HLC-C, PSORSf Other Designations: HLA class I histocompatibility antigen, G alpha chain; HLA class I histocompatibility antigen, Cw-1 alpha chain; MHC class I antigen heavy chain HLA-C; human leukocyte antigen-C alpha chain; major histocompatibility antigen HLA-C Nucleotide seguence (variant 1): NCBI nce Seguence: 117.5 LOCUS NM_002117 ACCESSION NM_002117 1 tccgcagtcc cggttctaaa gtccccagtc acccacccgg actcacattc tccccagagg 61 ccgagatgcg ggtcatggcg gccc tcctcctgct gctctcggga ggcctggccc 121 tgaccgagac ctgggcctgc tcccactcca tgaggtattt cgacaccgcc gtgtcccggc 181 gcgg agagccccgc ttcatctcag tgggctacgt ggacgacacg cagttcgtgc 241 ggttcgacag cgacgccgcg agtccgagag gggagccgcg ggcgccgtgg gtggagcagg 301 aggggccgga gtattgggac cgggagacac agaagtacaa ggca caggctgacc 237237 36" gagtgagcct gcggaacctg cgcggctact acaaccagag cgaggacggg tctcacaccc 42" tccagaggat gtctggctgc gacctggggc CCganggCg ccgc gggtatgacc 48" agtccgccta cgacggcaag gattacatcg ccctgaacga ggacctgcgc accg 54" ccgcggacac cgcggctcag atcacccagc gcaagttgga ggcggcccgt gcggcggagc 60" agctgagagc ggag ggcacgtgcg tggagtggct atac ctggagaacg 66" ggaaggagac gctgcagcgc cccc caaagacaca cgtgacccac caccccctct 72" atga ggccaccctg aggtgctggg ccctgggctt ctaccctgcg gagatcacac 78" tgacctggca gcgggatggg gaggaccaga cccaggacac cgagcttgtg gagaccaggc 84" cagcaggaga tggaaccttc cagaagtggg cagctgtggt ggtgccttct ggacaagagc 90" agagatacac gtgccatatg cagcacgagg ggctgcaaga gcccctcacc ctgagctggg 96" agccatcttc ccagcccacc atccccatca tgggcatcgt tgctggcctg gctgtcctgg "O2" ttgtcctagc tgtccttgga gctgtggtca ccgctatgat gtgtaggagg aagagctcag "08" aagg agggagctgc tctcaggctg cgtgcagcaa cagtgcccag ggctctgatg "14" agtctctcat cacttgtaaa gcctgagaca gctgcctgtg tgggactgag atgcaggatt "20" cacc tctcctttgt gacttcaaga gcctctggca tctctttctg caaaggcacc "26" tgaatgtgtc tgcgttcctg ttagcataat aggt ggagagacag cccacccccg "32" tgtccaccgt gacccctgtc cccacactga cctgtgttcc ctccccgatc atctttcctg "38" ttccagagag gtggggctgg atgtctccat ctctgtctca aattcatggt gcactgagct "44" tctt acttccctaa tgaagttaag aacctgaata taaatttgtg ttctcaaata "50" tttgctatga agcgttgatg gattaattaa ataagtcaat tcctagaagt tgagagagca 56" aataaagacc tgagaacctt ccagaa Protein seguence (variant 1): NCBI Reference Seguence: NP_002108.4 LOCUS NP_002108 ION 108 1 mrvmaprall lalt etwacshsmr yfdtavsrpg rgeprfisvg yvddtqfvrf 61 dsdaasprge prapwveqeg peywdretqk yquaqadrv slrnlrgyyn qsedgshth 121 rmsgcdlgpd grllrgydqs aydgkdyial nedlrswtaa dtaaqitqu leaaraaeql 181 tcve wlrrylengk ethraeppk thvthhplsd heatlrcwal gfypaeitlt 241 dqtq trpa gdgtquwaa vvvpsgqeqr ytchmqhegl qepltlswep 238238 301 ssqptipimg vlvv lavlgavvta mmcrrkssgg kggscsqaac snsaqgsdes 361 litcka Nucleotide seguence (variant 2): NCBI nce Seguence: NM_001243042.1 LOCUS NM_001243042 ACCESSION NM_001243042 l tccgcagtcc cggttctaaa gtccccagtc ccgg actcacattc tccccagagg 6" ccgagatgcg ggtcatggcg ccccgagccc tcctcctgct gctctcggga ggcctggccc 12" tgaccgagac ctgggcctgc tcccactcca tgaggtattt cgacaccgcc gtgtcccggc 18" CngCCgng agagccccgc ttcatctcag acgt ggacgacacg cagttcgtgc 24" ggttcgacag cgacgccgcg agtccgagag gggagccgcg ggcgccgtgg gtggagcagg " aggggccgga gtattgggac acac agaactacaa gcgccaggca caggctgacc 36" gagtgagcct gcggaacctg cgcggctact acaaccagag cgaggacggg tctcacaccc 42" tccagaggat gtatggctgc gacctggggc CCganggCg cctcctccgc gggtatgacc 48" agtccgccta caag gattacatcg ccctgaacga ggacctgcgc tcctggaccg 54" ccgcggacac cgcggctcag atcacccagc gcaagttgga ggcggcccgt gcggcggagc 60" agctgagagc ctacctggag ggcacgtgcg tggagtggct ccgcagatac ctggagaacg 66" ggaaggagac gctgcagcgc gcagaacccc caaagacaca ccac caccccctct 72" ctgaccatga ggccaccctg aggtgctggg ccctgggctt ctaccctgcg gagatcacac 78" tgacctggca gcgggatggg gaggaccaga cccaggacac cgagcttgtg gagaccaggc 84" cagcaggaga tggaaccttc cagaagtggg cagctgtggt ggtgccttct ggacaagagc 90" agagatacac gtgccatatg cagcacgagg ggctgcaaga gcccctcacc ctgagctggg 96" agccatcttc ccagcccacc atccccatca tgggcatcgt tgctggcctg gctgtcctgg "O2" ttgtcctagc tgtccttgga gctgtggtca ccgctatgat gtgtaggagg tcag "08" aagg agggagctgc tctcaggctg gcaa cagtgcccag ggctctgatg "l4" agtctctcat cacttgtaaa gcctgagaca gctgcctgtg tgag atgcaggatt "20" tcttcacacc tctcctttgt aaga ggca tctctttctg caaaggcgtc "26" tgaatgtgtc cctg ttagcataat gtgaggaggt ggagagacag cccacccccg "32" tgtccaccgt gacccctgtc ctga cctgtgttcc ctccccgatc atctttcctg "38" ttccagagag gtggggctgg atgtctccat ctctgtctca aattcatggt gcactgagct 239239 1441 gcaacttctt acttccctaa tgaagttaag aacctgaata taaatttgtg ttctcaaata 1501 tttgctatga agcgttgatg gattaattaa caat tcctagaagt tgagagagca 1561 aataaagacc tgagaacctt ccagaa Protein seguence nt 2): NCBI Reference Seguence: NP_001229971.1 LOCUS NP_001229971 ION NP_001229971 1 mrvmaprall lllsgglalt etwacshsmr yfdtavsrpg rgeprfisvg yvddtqfvrf 61 dsdaasprge prapwveqeg peywdretqn yquaqadrv slrnlrgyyn qsedgshth 121 rmygcdlgpd grllrgydqs aydgkdyial nedlrswtaa dtaaqitqu leaaraaeql 181 raylegtcve wlrrylengk ethraeppk plsd heatlrcwal gfypaeitlt 241 wqrdgedqtq dtelvetrpa gdgtquwaa qeqr ytchmqhegl qepltlswep 301 ssqptipimg ivaglavlvv lavlgavvta mmcrrkssgg kggscsqaac snsaqgsdes 361 litcka RRAS2 Official Sym bol: RRAS2 Official Name: d RAS viral (r-ras) oncogene homolog 2 Gene ID: 22800 Organism: Homo sapiens Other Aliases: TC21 Other Designations: ras-like protein TC21; ras-related protein R-Ras2; teratocarcinoma ne Nucleotide seguence (variant 1): NCBI Reference Seguence: 250.5 LOCUS NM_012250 ACCESSION NM_012250 1 cagacggcca tttgtggcgg cgctggaggc tgcgttcggc aggcgctgcg gagacgcgta 61 gaggagcgcg ccccccggcc gccc ctggcccgtg ccgtcacccc gcttctccgc 240240 12" gcctcgggcg agcc agtccccagc gccgcgctac cgcgctgacc ggccctccag 18" acgcctcccg gtacccggga ccccagcccg gccgctcgcc cgcagcccgc cggccgcaca 24" cgtccccgga gccgggccta gcgg cagcggcggc tcggcgcagt caggctgggc " tctgtagcgt ccccatggcc gcggccggct acgg ctccggccag gagaagtacc 36" ggctcgtggt ggtcggcggg ggcggcgtgg gcaagtcggc catc cagttcatcc 42" agtcctattt tgtaacggat ccaa ccattgaaga ttcttacaca aagcagtgtg 48" tgatagatga cagagcagcc cggctagata ttttggatac agcaggacaa gaagagtttg 54" gagccatgag agaacagtat actg gcgaaggctt ggtc ttttcagtca 60" cagatagagg cagttttgaa gaaatctata agtttcaaag acagattctc agagtaaagg 66" atcgtgatga gttcccaatg attttaattg gtaataaagc agatctggat catcaaagac 72" aggtaacaca ggaagaagga caacagttag cacggcagct taaggtaaca tacatggagg 78" catcagcaaa gattaggatg aatgtagatc aagctttcca tgtc cgggttatca 84" ggaaatttca agagcaggaa tgtcctcctt caccagaacc aacacggaaa gaaaaagaca 90" agaaaggctg ccattgtgtc attttctaga atcccttcag ttttagctac caacggccag 96" gaaaagccct catcttctct ttctctcctc catc ttgttggtac ctttctagcc "02" ttagacaaat gatcaccatg ttagccttag acgaagaagc tggctagtcc tttctgtgaa "08" gctaatacaa tggtcatttc aatt taaaggaaac actaaggctg cttcaaagat "14" tatctgattc aata tatgtctata tacacagaca tgctcttttt ttaagtgctt "20" acattttaat agagatgaat cagttttgga gctg tttgccaagc tgaagctaca "26" ggttgtgaaa taatttttaa cttttggaat catactgcct actgttactc taaatagaaa "32" tatagggttt tttttaatgt gaatttttgc ctatctttaa acatttcaat gtcagccttt "38" gttaacctta aatacactga attgaatcta caaaagtgaa ccatctcaga cct:tactga "44" tactacaact tttgttttct gatggccaaa ataccaaatg cctgttgtat tta:ggatta "50" aaaactgctt ataaaaccct gtgttactac tcctactctt gata ata:tctatg "56" tggtcaaata tttggactca tttaggactt agatatttca gtgtacttga ttt:ttaatt taactctttt tcacagccac gctaagggta aaaaggaata atttccttct gtc :tccttt "68" tcaagtattt ctgggtaagg gattcaaaaa actaaaactg tttttgtttg taa :ataaaa "74" tatggaattg atctttccag ggtcagagat gattaatgtt atat act :ttatac "80" ttct tatcaaacta gttaacaagt atat gtttgtaagc aga :atgctt "86" tcatagcata gtat atgtaaagat aagtatttaa ttctcactgt tcacttttaa 241241 192" ctgacaaaga aaaacaagtg gaaactacag tggt agaactttta cttgctggtc 198" tggt tgtacccatc tttggccagt acta ctcaagaaac cttcccaata 204" aaca ggatgagact ctgaaatcac tttcagtatt taga tattgattgt 210" tatttcaagt attaagtgta agcttttaat ggataattag tgtg gatggcatct 216" gattttgttt ttaattctgt ggattgtgtt taagcaattc aatagtatgt tcctgatttt 222" gagatgctaa gtggtattgc acagttgtca ctttatcaag tgtgtacaac agtcccatga 228" agtttataga cctt gtatagcttc aggtgctaga attaaaattg atctgttatc 234" acaagaaaaa aaaaaaaaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_036382.2 LOCUS N P_036382 ION NP_036382 1 maaagwrdgs gqekyrlvvv ggggvgksal tiqfiqsyfv tdydptieds ytchviddr 61 aarldildta gqeefgamre qymrtgegfl 1vfsvtdrgs feeiyqurq ilrvkdrdef 121 nkad 1dhqrqvtqe egqqlarqlk vtymeasaki afhe 1vrvirque 181 qecppspept rkekdkkgch cvif Nucleotide seguence (variant 2): NCBI Reference Seguence: NM_001102669.2 LOCUS NM_001102669 ACCESSION NM_001102669 1 gggagggcgg agctggaagg gtgggaagca ccgatccacc ttattgctct ggccgaggcc 6" agagacctcc gggagaggct ccga gccgggcttt actgctccga gggtccgggc 12" gtggggctgg agctggagcc ccgcgcgctg cttttccagc cgcctgcggc cgcgccttca 18" ccgtcggggc ggtg gcaacttggc cgcggctccg cgtggtctcc gggcttcccc 24" gcgccgcctg agccggagct gcccgcttca atcctatttt gtaacggatt atgatccaac " cattgaagat tcttacacaa agcagtgtgt gatagatgac agagcagccc ggctagatat 36" tttggataca gcaggacaag aagagtttgg agccatgaga gaacagtata tgaggactgg 42" cgaaggcttc ctgttggtct tttcagtcac agatagaggc agttttgaag aaatctataa 242242 48" gtttcaaaga cagattctca gagtaaagga tcgtgatgag ttcccaatga ttttaattgg 54" taataaagca gatctggatc atcaaagaca ggtaacacag gaagaaggac aacagttagc 60" acggcagctt aaggtaacat acatggaggc atcagcaaag attaggatga atca 66" agctttccat gaacttgtcc gggttatcag gaaatttcaa gagcaggaat gtcctccttc 72" accagaacca acacggaaag aaaaagacaa gaaaggctgc cattgtgtca ttttctagaa 78" tcccttcagt tttagctacc aacggccagg aaaagccctc atcttctctt tctctcctca 84" gtttacatct tgttggtacc tttctagcct tagacaaatg atcaccatgt tagccttaga 90" cgaagaagct ggctagtcct ttctgtgaag ctaatacaat ggtcatttcc agacaaattt 96" aaaggaaaca ctaaggctgc ttcaaagatt atctgattcc atat atgtctatat "02" acacagacat gctctttttt taagtgctta cattttaata gagatgaatc ggaa "08" tctaagctgt ttgccaagct gaagctacag gttgtgaaat aatttttaac aatc "14" atactgccta ctgttactct aaatagaaat atagggtttt ttttaatgtg aatttttgcc "20" tatctttaaa catttcaatg tttg ttaaccttaa atacactgaa ttgaatctac "26" aaaagtgaac catctcagac tgat actacaactt ttgttttctg atggccaaaa "32" taccaaatgc ctgttgtatt ta:ggattaa aaactgctta taaaaccctg tgttactact "38" cctactcttg gagatgataa ta:tctatgt ggtcaaatat ttggactcat ttaggactta "44" tcag tgtacttgat tt:ttaattt aactcttttt cacagccacg ctaagggtaa "50" aaaggaataa tttccttctg tc:tcctttt caagtatttc tgggtaaggg aaaa "56" ctaaaactgt ttttgtttgt aa :ataaaat atggaattga tctttccagg gtcagagatg "62" attaatgttt ttgctatata ct:ttataca ttattttctt atcaaactag ttaacaagta "68" tatg agca ga:atgcttt catagcatac cttgtgtata tgtaaagata "74" agtatttaat tgtt cacttttaac tgacaaagaa aaacaagtgg aaactacaga "80" aactgtggta gaacttttac ttgctggtct ggtcttggtt gtacccatct ttggccagtc "86" ctac tcaagaaacc ttcccaatag agtacaacag gatgagactc tgaaatcact "92" ttcagtattc agat attgattgtt agta ttaagtgtaa gcttttaatg "98" tagt ataactgtgg atggcatctg attttgtttt taattctgtg gattgtgttt 204" ttca atagtatgtt cctgattttg agatgctaag tggtattgca cagttgtcac 210" tttatcaagt gtgtacaaca gtcccatgaa gtttatagag catacccttg tatagcttca 216" ggtgctagaa ttaaaattga tctgttatca caagaaaaaa aaaaaaaaa n seguence (varaiant 2): 243243 NCBI Reference ce: NP_001096139.1 LOCUS NP_001096139 ACCESSION NP_001096139 1 mreqymrtge gfllvfsvtd rgsfeeiykf qrqilrvkdr defpmilign kadldhqrqv 61 tqeegqqlar qlkvtymeas akirmnvdqa fhelvrvirk fqeqecppsp eptrkekdkk 121 gchcvif Nucleotide seguence nt 3): NCBI Reference Seguence: NM_001177314.1 LOCUS NM_001177314 ACCESSION NM_001177314 1 atctcagatg catgcagctc ctgctgggcg gtttcattct gcca ttcattcaca 6" ttaaaagcaa tggcccatta aggaaataag tggatgatgc tcctccatca cccatgtcct 12" attttgtaac ggattatgat ccaaccattg aagattctta cacaaagcag tgtgtgatag 18" atgacagagc agcccggcta gatattttgg cagg acaagaagag tttggagcca 24" tgagagaaca gtatatgagg actggcgaag gcttcctgtt ggtcttttca gtcacagata " gaggcagttt tgaagaaatc tataagtttc aaagacagat tctcagagta aaggatcgtg 36" atgagttccc ttta aata aagcagatct ggatcatcaa agacaggtaa 42" cacaggaaga aggacaacag ttagcacggc agcttaaggt aacatacatg gaggcatcag 48" caaagattag tgta gatcaagctt tccatgaact tgtccgggtt atcaggaaat 54" ttcaagagca ggaatgtcct ccttcaccag aaccaacacg aaaa gacaagaaag 60" gctgccattg tttc tagaatccct tcagttttag ctaccaacgg ccaggaaaag 66" ccctcatctt ctctttctct cctcagttta catcttgttg gtacctttct agccttagac 72" aaatgatcac catgttagcc ttagacgaag aagctggcta gtcctttctg tgaagctaat 78" acaatggtca tttccagaca aatttaaagg aaacactaag gctgcttcaa agattatctg 84" attcctttaa aatatatgtc tatatacaca gacatgctct ttttttaagt gcttacattt 90" taatagagat gaatcagttt tggaatctaa gctgtttgcc aagctgaagc tacaggttgt 96" gaaataattt ttaacttttg gaatcatact gcctactgtt actctaaata tagg 102" gtttttttta atgtgaattt ttgcctatct attt caatgtcagc taac 108" cttaaataca ctgaattgaa tctacaaaag tgaaccatct cagaccttta ctgatactac 114" aacttttgtt ttctgatggc caaaatacca aatgcctgtt gtatttatgg attaaaaact 244244 "20" gcttataaaa ccctgtgtta ctactcctac tcttggagat gataatattc gtca "26" aatatttgga ctcatttagg acttagatat ttcagtgtac ttgatttttt aatttaactc "32" tttttcacag ccacgctaag ggtaaaaagg aataatttcc ttctgtcttc cttttcaagt "38" atttctgggt aagggattca aaaaactaaa actgtttttg tttgtaatat aaaatatgga "44" attgatcttt ccagggtcag agatgattaa tgtttttgct atatactttt atacattatt "50" ttcttatcaa actagttaac aagtattttt atatgtttgt aagcagatat gctttcatag "56" cataccttgt gtatatgtaa agataagtat ttaattctca ctgttcactt ttaactgaca "62" aagaaaaaca agtggaaact acagaaactg tggtagaact tttacttgct gtct "68" tggttgtacc tggc cagtcacata caag aaaccttccc aatagagtac "74" aacaggatga gactctgaaa tcactttcag ctgc tagatattga ttgttatttc "80" aagtattaag tgtaagcttt taatggataa ttagtataac tgtggatggc atctgatttt "86" aatt ctgtggattg tgtttaagca attcaatagt atgttcctga ttttgagatg "92" ggta ttgcacagtt gtcactttat caagtgtgta caacagtccc atgaagttta "98" tagagcatac ccttgtatag gtgc tagaattaaa attgatctgt tatcacaaga 204" aaaaaaaaaa aaaa Protein seguence (variant 3): NCBI Reference Seguence: NP_001170785.1 LOCUS NP_001170785 ACCESSION NP_001170785 1 msyfvtdydp tch viddraarld ildtagqeef gamreqymrt gegfllvfsv 61 tdrgsfeeiy qurqilrvk drdefpmili gnkadldhqr qvtqeegqql arqlkvtyme 121 asakirmnvd qafhelvrvi rqueqecpp speptrkekd kkgchcvif Nucleotide seguence (variant 4): NCBI Reference ce : NM_001177315.1 LOCUS NM_001177315 ACCESSION NM_001177315 1 attgctctgg ccgaggccag ccgg gagaggctgg gccaccgagc cgggctttac 61 tgctccgagg gtccgggcgt ggggctggag cccc gcgcgctgct tttccagccg 121 cctgcggccg cgccttcacc gtcggggcga tagcggtggc aacttggccg cggctccgcg 181 ccgg gcttccccgc gccgcctgag ccggagctgc ccgcttcaag tactgtgtat 245245 24" ttctttgttc ctattttgta acggattatg atccaaccat ttct tacacaaagc " tgat agatgacaga gcagcccggc tagatatttt ggatacagca ggacaagaag 36" gagc catgagagaa cagtatatga ggactggcga aggcttcctg tttt 42" cagtcacaga cagt tttgaagaaa tctataagtt tcaaagacag attctcagag 48" taaaggatcg tgatgagttc ccaatgattt taattggtaa taaagcagat ctggatcatc 54" aaagacaggt ggaa gaaggacaac agttagcacg gcagcttaag gtaacataca 60" tggaggcatc agcaaagatt aggatgaatg aagc tttccatgaa cttgtccggg 66" ttatcaggaa atttcaagag caggaatgtc ctccttcacc agaaccaaca cggaaagaaa 72" aagacaagaa aggctgccat tgtgtcattt tctagaatcc cttcagtttt agctaccaac 78" ggccaggaaa agccctcatc ttctctttct ctcctcagtt tacatcttgt tggtaccttt 84" ctagccttag acaaatgatc ttag ccttagacga agaagctggc tagtcctttc 90" tgtgaagcta atacaatggt catttccaga caaatttaaa ggaaacacta aggctgcttc 96" aaagattatc tgattccttt aaaatatatg tctatataca cagacatgct ctttttttaa "02" gtgcttacat tttaatagag atgaatcagt tttggaatct aagctgtttg ccaagctgaa "08" gctacaggtt gtgaaataat ttttaacttt tggaatcata ctgcctactg ttactctaaa "14" tagaaatata gggttttttt taatgtgaat ttttgcctat ctttaaacat ttcaatgtca "20" gcctttgtta accttaaata cactgaattg caaa agtgaaccat ctcagacct: "26" tactgatact acaacttttg ttttctgatg gccaaaatac caaatgcctg ttgtattta: "32" ggattaaaaa ctgcttataa gtgt tactactcct actcttggag atgataata: "38" tggt caaatatttg gactcattta agat atttcagtgt ttt: "44" ttaatttaac tctttttcac agccacgcta agggtaaaaa ggaataattt ccttctgtc: "50" tccttttcaa gtatttctgg gtaagggatt caaaaaacta aaactgtttt tgtttgtaa: "56" ataaaatatg gaattgatct ggtc agagatgatt aatgtttttg ctatatact: "62" ttatacatta ttttcttatc aaactagtta acaagtattt ttatatgttt gtaagcaga: "68" atgctttcat agcatacctt gtgtatatgt aaagataagt atttaattct cactgttcac "74" ttttaactga aaaa caagtggaaa ctacagaaac tgtggtagaa cttttacttg "80" ctggtctggt cttggttgta cccatctttg gccagtcaca taactactca agaaaccttc "86" ccaatagagt acaacaggat gagactctga aatcactttc agtattccct gctagatatt "92" gattgttatt tcaagtatta agct tttaatggat aattagtata actgtggatg "98" gcatctgatt ttgtttttaa ttctgtggat tgtgtttaag aata gtatgttcct 246246 2041 gattttgaga tgctaagtgg tattgcacag cttt atcaagtgtg tacaacagtc 2101 agtt tatagagcat acccttgtat agcttcaggt gctagaatta aaattgatct 2161 gttatcacaa gaaaaaaaaa aaaaaa Protein sequence (variant 4): NCBI Reference Seguence: 170786.1 LOCUS NP_001170786 ACCESSION 170786 1 mreqymrtge gfllvfsvtd rgsfeeiykf qrqilrvkdr defpmilign kadldhqrqv 61 tqeegqqlar qlkvtymeas akirmnvdqa fhelvrvirk fqeqecppsp eptrkekdkk 121 gchcvif TSP1 Official Symbol: THBSf Official Name: thrombospondin 1 Gene ID: 7057 Organism: Homo sapiens Other Aliases: THBS, , TSP, TSP-1, TSP1 Other Designations: thrombospondin-1; thrombospondin-1p180 Nucleotide seguence: NCBI Reference Seguence: 246.2 LOCUS NM_003246 ACCESSION 246 1 agccgctgcg cccgagctgg cctgcgagtt cagggctcct gtcgctctcc aggagcaacc 61 tctactccgg acgcacaggc attccccgcg cccctccagc cctcgccgcc ctcgccaccg 121 ctcccggccg ccgcgctccg acag gatccctgct caac agctccacca 181 tggggctggc ctggggacta ggcgtcctgt tcctgatgca tgtgtgtggc accaaccgca 241 ttccagagtc tggcggagac aacagcgtgt ttgacatctt tgaactcacc ggggCCgCCC 301 gcaaggggtc tgggcgccga ctggtgaagg gccccgaccc ttccagccca gctttccgca 247247 36" tcgaggatgc caacctgatc ccccctgtgc ctgatgacaa gttccaagac ctggtggatg 42" ctgtgcgggc agaaaagggt ttcctccttc tggcatccct gaggcagatg aagaagaccc 48" ggggcacgct gctggccctg gagcggaaag accactctgg ccaggtcttc agcgtggtgt 54" ccaatggcaa cacc ctggacctca gcctgaccgt ccaaggaaag cagcacgtgg 60" tgtctgtgga agaagctctc ctggcaaccg gccagtggaa gagcatcacc ctgtttgtgc 66" aggaagacag ggcccagctg tacatcgact gtgaaaagat tgct gagttggacg 72" tccccatcca aagcgtcttc accagagacc tggccagcat cgccagactc cgcatcgcaa 78" aggggggcgt caatgacaat gggg tgctgcagaa tgtgaggttt gtctttggaa 84" ccacaccaga agacatcctc aggaacaaag gctgctccag ctctaccagt ctca 90" cccttgacaa caacgtggtg aatggttcca gccctgccat ccgcactaac tacattggcc 96" acaagacaaa ggacttgcaa gccatctgcg gcatctcctg tgatgagctg tccagcatgg "O2" tcctggaact caggggcctg cgcaccattg tgaccacgct gcaggacagc atccgcaaag "08" tgactgaaga gaacaaagag ttggccaatg agctgaggcg gcctccccta tgctatcaca "l4" acggagttca gtacagaaat aacgaggaat ggactgttga tagctgcact gagtgtcact "20" gtcagaactc catc tgcaaaaagg tgtcctgccc catcatgccc tgctccaatg "26" ttcc tgatggagaa tgctgtcctc gctgttggcc cagcgactct gcggacgatg "32" gctggtctcc atggtccgag tggacctcct gttctacgag ctgtggcaat ggaattcagc "38" agcgcggccg ctcctgcgat agcctcaaca accgatgtga gggctcctcg gtccagacac "44" gcca cattcaggag tgtgacaaga gatttaaaca ggatggtggc cact "50" ggtccccgtg gtcatcttgt tctgtgacat gtggtgatgg tgtgatcaca aggatccggc "56" tctgcaactc tcccagcccc cagatgaacg ggaaaccctg tgaaggcgaa gcgcgggaga "62" ccaaagcctg caagaaagac gcctgcccca tcaatggagg ctggggtcct tggtcaccat "68" gggacatctg ttctgtcacc tgtggaggag gggtacagaa acgtagtcgt ctctgcaaca "74" accccacacc tgga ggcaaggact gcgttggtga tgtaacagaa aaccagatct "80" agca ggactgtcca attgatggat gcctgtccaa cttt gccggcgtga "86" agtgtactag ctaccctgat ggcagctgga gtgc ttgtccccct ggttacagtg "92" gaaatggcat ccagtgcaca gatgttgatg agtgcaaaga agtgcctgat gcctgcttca "98" accacaatgg agagcaccgg tgtgagaaca cggaccccgg ctacaactgc ctgccctgcc 204" ccccacgctt caccggctca cagcccttcg gccagggtgt cgaacatgcc acggccaaca 210" aacaggtgtg ccgt tgca ggac ccacgactgc aacaagaacg 248248 216" ccaagtgcaa ctacctgggc cactatagcg accccatgta ccgctgcgag tgcaagcctg 222" gctacgctgg caatggcatc atctgcgggg caga cctggatggc tggcccaatg 228" agaacctggt gtgcgtggcc aatgcgactt accactgcaa taat tgccccaacc 234" ttcccaactc agggcaggaa gactatgaca aggatggaat tggtgatgcc tgtgatgatg 240" acgatgacaa tgataaaatt ccagatgaca gggacaactg tccattccat tacaacccag 246" ctcagtatga caga gatgatgtgg gagaccgctg ctgt ccctacaacc 252" acaacccaga tcaggcagac acagacaaca atggggaagg agacgcctgt gaca 258" ttgatggaga cggtatcctc aatgaacggg acaactgcca gtacgtctac aatgtggacc 264" agagagacac tgatatggat ggggttggag atcagtgtga caattgcccc ttggaacaca 270" atccggatca gctggactct gactcagacc gcattggaga tacctgtgac aacaatcagg 276" atattgatga agatggccac aatc tggacaactg tccctatgtg cccaatgcca 282" accaggctga ccatgacaaa aagg gagatgcctg tgaccacgat gatgacaacg 288" atggcattcc tgatgacaag gacaactgca gactcgtgcc caatcccgac cagaaggact 294" ctgacggcga tggtcgaggt tgca aagatgattt tgac agtgtgccag 300" acatcgatga catctgtcct gttg acatcagtga gaccgatttc cgccgattcc 306" agatgattcc tctggacccc aaagggacat cccaaaatga ctgg gttgtacgcc 312" atcagggtaa agaactcgtc cagactgtca actgtgatcc tggactcgct gtaggttatg 318" atgagtttaa tgctgtggac ttcagtggca ccttcttcat caacaccgaa agggacgatg 324" actatgctgg atttgtcttt ggctaccagt ccagcagccg cttttatgtt gtgatgtgga 330" agcaagtcac ctac tgggacacca accccacgag ggctcaggga tactcgggcc 336" tttctgtgaa aaac tccaccacag ggcctggcga gcacctgcgg aacgccctgt 342" ggcacacagg aaacacccct ggccaggtgc tgtg gcatgaccct cgtcacatag 348" gctggaaaga tttcaccgcc tacagatggc gtctcagcca caggccaaag acgggtttca 354" ttagagtggt gatgtatgaa gggaagaaaa tcatggctga ctcaggaccc atctatgata 360" atgc tggtggtaga ctagggttgt ttgtcttctc tcaagaaatg gtgttcttct 366" ctgacctgaa atacgaatgt agagatccct aatcatcaaa ttgttgattg aaagactgat 372" cataaaccaa tgctggtatt gcaccttctg gaactatggg cttgagaaaa cccccaggat 378" tcct cctt cttttctgtg cttgcatcag tgtggactcc tagaacgtgc 384" cctc aagaaaatgc agttttcaaa aacagactca gcattcagcc tccaatgaat 390" aagacatctt ccaagcatat aaacaattgc tttggtttcc ttttgaaaaa gcatctactt 249249 396" gcttcagttg ggaaggtgcc cattccactc tgcctttgtc acagagcagg gtgctattgt 102" gaggccatct ctgagcagtg gactcaaaag cattttcagg catgtcagag ggac 108" gaat tagcaaacaa aaccaccctg acatcctcct tcaggaacac ggggagcaga 114" ggccaaagca ctaaggggag ggcgcatacc cgagacgatt gtatgaagaa gagg 120" aactgttaca tgttcggtac taagtcattt tcaggggatt ctat tgctggattt 126" catgatgctg actggcgtta gctgattaac ccatgtaaat aggcacttaa atagaagcag 132" gaga caaagactgg cttctggact tcctccctga tccccaccct tactcatcac 138" ctgcagtggc cagaattagg gaatcagaat caaaccagtg taaggcagtg ctggctgcca Z44" ttgcctggtc acattgaaat tggtggcttc attctagatg tagcttgtgc agatgtagca 150" ggaaaatagg aaaacctacc atctcagtga gcaccagctg cctcccaaag gaggggcagc 156" cgtgcttata tttttatggt ggca caaaattatt ctaa ctaaaacatt Z62" ccttttctct tttttcctga atgg agttttctaa cttt tggaatgtag Z68" atttttttta aatgctttac gatgtaaaat atttattttt tacttattct tctg Z74" ggat tattcatgga acaggaagaa agac tatccatgtc atctttgttg 180" agagtcttcg tgactgtaag attgtaaata cagattattt attaactctg ttctgcctgg 186" aggc ttcatacgga aagtgtttga gtag ttgacattta tcagcaaatc Z92" tcttgcaaga caag gaaaatcagt ctaataagct gctctgcccc ttgtgctcag Z98" agtggatgtt atgggattct ttttttctct gttttatctt ttcaagtgga attagttggt 504" tttg caaatgtttt aaattgcaaa gaaagccatg aggtcttcaa tactgtttta 510" ccccatccct tgtgcatatt tccagggaga aggaaagcat atacactttt ttctttcatt 516" tttccaaaag agaaaaaaat gacaaaaggt gaaacttaca tacaaatatt acctcatttg 522" ttgtgtgact gagtaaagaa tttttggatc aagcggaaag agtttaagtg tctaacaaac 528" ttaaagctac tgtagtacct aaaaagtcag tgttgtacat agcataaaaa ctctgcagag 534" aagtattccc aataaggaaa tagcattgaa atgt:aaata caa:ttctga aagttatgtt 540" ttttttctat catctggtat accattgctt tatt:ttata aat:attttc tcattgccat 546" tggaatagat atctcagatt gtgtagatat gcta:ttaaa taa:ttatca ggaaatactg 552" cctgtagagt tagtatttct atttttatat aatg:ttgca cac:gaattg aagaattgtt 558" ggttttttct tgtt ttgttttttt tttt :ttttt ttt :gctttt gacctcccat 564" ttttactatt tgccaatacc tttttctagg aatg:gcttt ttt:tgtaca catttttatc 570" cattttacat tctaaagcag tgtaagttgt atat:actgt ttc:tatgta caaggaacaa 250250 5761 caataaatca tatgqaaatt tatatttata aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_003237.2 LOCUS NP_003237 ACCESSION N P_003237 l mglawglgvl flmhvcgtnr ipesggdnsv fdifeltgaa rkgsgrrlvk gpdpsspafr 61 iedanlippv pddqudlvd avraekgfll lasqumkkt rgtllalerk dhsgqvfsvv 121 sngkagtldl sltvqgkqhv vsveeallat lfv qedraquid cekmenaeld 181 vpiqsvftrd lasiarlria kggvndnfqg vlqnvrfvfg ttpedilrnk gcssstsvll 241 tldnnvvngs nyig hktkdlqaic giscdelssm vlelrglrti vtthdsirk 301 vteenkelan elrrpplcyh nquyrnnee wtvdsctech cqnsvtickk vscpimpcsn 361 atvpdgeccp rcwpsdsadd gwspwsewts ngiq qrgrscdsln nrcegsqut 421 rtchiqecdk rfknggwsh wspwsscsvt cgdgvitrir lcnspqumn gkpcegeare 481 tkackkdacp PWSP wdicsvtcgg qukrsrlcn nptquggkd cvgdvtenqi 541 cnkqdcpidg clsnpcfagv kctsypdgsw kcgacppqys gngiqctdvd eckevpdacf 601 nhngehrcen tdpgynclpc pprftgsqpf gqgvehatan kqvckprnpc tdgthdcnkn 661 akcnylghys dpmyrceckp qyagngiicq edtdldgwpn enlvcvanat yhckkdncpn 721 1pnsgqedyd kdgigdacdd dddndkipdd rdncpfhynp rddv gdrcdncpyn 781 dtdn ngeqdacaad idgdgilner dncquynvd qrdtdmdgvg dchncpleh 841 npdqldsdsd dnnq didedghqnn ldncpyvpna nqadhdkdgk dddn 901 dgipddkdnc rlvpnpqud sdgdgrgdac kddfdhdsvp diddicpenv frrf 961 qmipldpkgt wvvr hqgkelvqtv ncdpglavgy defnavdfsg tffinterdd 1021 fqu sssrfyvvmw kqvtqsywdt nptraqusg lsvkvvnstt rnal 1081 whtgntpgqv rtlwhdprhi gwkdftayrw rlshrpktgf irvvmyegkk imadsgpiyd 1141 ktyaggrlgl fvfsqemvff sdlkyecrdp EDIL3 Official Symbol: EDIL3 gtigial Name: EGF-Iike s and discoidin I-Iike domains 3 251251 Gene ID: 10085 Organism: Homo sapiens Other Aliases: DEL1 Other Designations: ke repeat and discoidin l-like domain-containing protein 3; developmental endothelial locus-1; developmentally-regulated endothelial cell locus 1 n; integrin-binding protein DEL1 Nucleotide seguence: NCBI Reference Seguence: NM_005711.3 LOCUS NM_00571 1 ACCESSION NM_005711 1 agaagccccg cagccgccgc aaca gcgacagccg agcgcccggt ccgcctgtct 6" gccggtgggt ctgcctgccc gcgcagcaga CCngggng ccgcgggagc ccgcgccccg 12" cccgccgcgc ctctgccggg acccacccgc agcggagggc tgagcccgcc ggcggctccc 18" cggagctcac ccacctccgc gcgccggagc gcaggcaaaa ggggaggaaa ggctcctctc 24" tttagtcacc actctcgccc tctccaagaa tttgtttaac aaagcgctga ggaaagagaa " cgtcttcttg aattctttag cgga gtctgctgct gccctgcgct gccacctcgg 36" ctacactgcc ctccgcgacg gacc agccggggtc acgtccggga gacgggatca 42" tgaagcgctc ggtagccgtc tggctcttgg tcgggctcag cctcggtgtc ccccagttcg 48" gcaaaggtga tatttgtgat cccaatccat gtgaaaatgg aggtatctgt ttgccaggat 54" tggctgatgg ttccttttcc tgtgagtgtc cagatggctt cacagacccc aactgttcta 60" gtgttgtgga ggttgcatca gatgaagaag aaccaacttc agcaggtccc tgcactccta 66" atccatgcca taatggagga gaaa taagtgaagc ataccgaggg gatacattca 72" taggctatgt ttgtaaatgt ccccgaggat ttaatgggat tcactgtcag cacaacataa 78" atgaatgcga agttgagcct tgcaaaaatg gtggaatatg tacagatctt gttgctaact 84" attcctgtga gtgcccaggc atgg gaagaaattg tcaatacaaa tgctcaggcc 90" cactgggaat tgaaggtgga tcaa aaat cacagcttcc tctactcacc 96" gagctctttt tggactccaa aaatggtatc atgc acgtcttaat aagaaggggc 102" ttataaatgc gtggacagct aatg acagatggcc gtggattcag ataaatttgc 108" aaaggaaaat gagagttact ggtgtgatta cccaaggagc caagaggatt ggaagcccag 114" agtatataaa atcctacaaa attgcctaca atgg ttgg gcaatgtaca 252252 "20" aagtgaaagg caccaatgaa gacatggtgt ttcgtggaaa taac aacactccat "26" actc tttcacaccc cccataaaag ctcagtatgt aagactctat ccccaagttt "32" gtcgaagaca ttgcactttg cgaatggaac ttcttggctg tgaactgtcg ggttgttctg "38" tggg tatgaaatca ggacatatac aagactatca gatcactgcc tccagcatct "44" tcagaacgct caacatggac atgttcactt gggaaccaag gaaagctcgg ctggacaagc "50" aaggcaaagt gaatgcctgg ggcc acaatgacca gtcacaatgg ttacaggtgg "56" atcttcttgt tccaaccaaa gtgactggca tcattacaca taaa gattttggtc "62" atgtacagtt tgttggctcc tacaaactgg cttacagcaa tgatggagaa cactggactg "68" tataccagga tgaaaagcaa agaaaagata aggttttcca gggaaatttt gacaatgaca "74" ctcacagaaa aaatgtcatc gaccctccca tctatgcacg acacataaga atccttcctt "80" ggtcctggta gatc acattgcggt tgct gggctgcaca gaggaggaat "86" gaggggaggc tacatttcac aaccctcttc cctatttccc taaaagtatc tccatggaat "92" gaactgtgca aaatctgtag gaaactgaat ggtttttttt tttttttcat gaaaaagtgc "98" tcaaattatg gtaggcaact gttt ttaagggggt ctaagcctgc cttttcaatg 204" atttaatttg attttatttt caaa tctcttaagt aacaacacat taagtgtgaa 210" ttacttttct gttt ttat tcgcattggt agaaatatat tagggaaaga 216" aagtagcctt ctttttatag caagagtaaa aaagtctcaa caaa taagagcaag 222" agttgataga gcttttacaa tcaatactca cctaattctg ataaaaggaa tactgcaatg 228" ttagcaataa gtttttttct tctgtaatga ctctacgtta ttcc ctgtgcctac 234" caaacactgt ttat tacaaaattt taaagaagaa tatgtaacat gcagtactga 240" tattataatt ctcattttac tttcattatt tctaataaga gattatgtga cttctttttc 246" ttttagttct attctacatt cttaatattg tatattacct gaataattca atttttttct 252" aattgaattt cctattagtt gactaaaaga agtgtcatgt ttactcatat atgtagaaca 258" tgactgccta tcagtagatt gatctgtatt taatattcgt taattaaatc tgcagtttta 264" tttttgaagg aagccataac tatttaattt ccaaataatt gcttcataaa gaatcccata 270" ctctcagttt gcacaaaaga acaaaaaata tatatgtctc tttaaattta aatcttcatt 276" tagatggtaa ttacatatcc ttac tttaaaaaat cggcttattt gtttatttta 282" taaaaaattt agcaaagaaa tattaatata gtgctgcata gtttggccaa gcatactcat 288" catttctttg ttcagctcca catttcctgt gaaactaaca tcttattgag atttgaaact 2941 ggtggtagtt tcccaggaag gcacaggtgg agtt Protein seguence: 253253 NCBI Reference Seguence: NP_005702.3 LOCUS NP_005702 ACCESSION NP_005702 1 vwll vglslgquf gkgdicdpnp cenggiclpg ladgsfscec pdgftdpncs 6" svvevasdee eptsagpctp npchnggtce iseayrgdtf igyvckcprg fngihcqhni 12" necevepckn lvan yscecpgefm grncqykcsg plgieggiis nqqitassth 18" ralfglqkwy pyyarlnkkg linawtaaen drwpwiqinl gvi tqgakrigsp 24" eyiksykiay wamy kvkgtnedmv frgnidnntp yansftppik " crrhctlrme llgcelsgcs eplgmksghi qdyqitassi frtlnmdmft weprkarldk 36" qgkvnawtsg wlqv dllvptkvtg iitqgakdfg thfvgsykl aysndgehwt 42" qudequkd kqugnfdnd thrknvidpp iyarhirilp wswygritlr sellgcteee HMOX1 al Sym bol: HMOX1 Official Name: heme oxygenase (decycling) 1 Gene ID: 3162 Organism: Homo s Other Aliases: CTA-28GB10.6, HO-1, HSP32, bK28GB10 Other Designations: heat shock protein, 32—kD; heme oxygenase 1 Nucleotide seguence: NCBI Reference Seguence: 133.2 LOCUS 133 ACCESSION NM_002133 1 aaatgtgacc ggccgcggct ccggcagtca acgcctgcct cctctcgagc gtcctcagcg 61 cagccgccgc ccgcggagcc agcacgaacg agcccagcac ngccggatg gagcgtccgc 121 aacccgacag catgccccag gatttgtcag aggccctgaa ggaggccacc aaggaggtgc 181 acacccaggc agagaatgct gagttcatga ggaactttca gaagggccag gtgacccgag 241 acggcttcaa gctggtgatg gcctccctgt accacatcta tgtggccctg gaggaggaga 301 gcaa caaggagagc ccagtcttcg cccctgtcta cttcccagaa gagctgcacc 254254 36" gcaaggctgc cctggagcag gacctggcct tctggtacgg gccccgctgg caggaggtca 42" tcccctacac accagccatg cagcgctatg tgaagcggct ccacgaggtg gggcgcacag 48" agcccgagct gctggtggcc cacgcctaca cccgctacct gggtgacctg tctgggggcc 54" tcaa aaagattgcc cagaaagccc tggacctgcc cagctctggc gagggcctgg 60" ccttcttcac cttccccaac attgccagtg ccaccaagtt caagcagctc taccgctccc 66" gcatgaactc cctggagatg actcccgcag tcaggcagag ggtgatagaa gaggccaaga 72" ctgcgttcct gctcaacatc cagctctttg tgca ggagctgctg acccatgaca 78" ccaaggacca gagcccctca cgggcaccag gcca gcgggccagc aacaaagtgc 84" aagattctgc ggag actcccagag ggaagccccc actcaacacc cgctcccagg 90" ctccgcttct ccgatgggtc cttacactca gctttctggt ggcgacagtt gctgtagggc 96" tttatgccat gtgaatgcag gcatgctggc tcccagggcc atgaactttg tccggtggaa "O2" ggccttcttt ctagagaggg aattctcttg gctggcttcc ttaccgtggg cactgaaggc "08" ggcc tccagccctc tgtc cctctctctg gaaaggagga aggagcctat "14" ttcc ccaacgaaaa gcacatccag gcct aaacttcaga gggggcgaag "20" ggatcagccc tgcccttcag catcctcagt agca gagcctggaa ctaa "26" tgtggcagct gtctcaaacc tccaaaagcc ctgagtttca agtatccttg ttgacacggc "32" catgaccact ttccccgtgg gccatggcaa tttttacaca aacctgaaaa gtgt "38" tttt tgtcttattt ttgttggagc gttc ctggctcagc ctcaaatgca "44" gtatttttgt tgtgttctgt tgtttttata gcagggttgg ggtggttttt gagccatgcg "50" tgggtgggga gggaggtgtt taacggcact gtggccttgg tctaactttt gtgtgaaata 1561 ataaacaaca gata gtagcttgaa aaaaaaaaaa aaaaaa Protein ce: NCBI Reference Seguence: NP_002124.1 LOCUS NP_002124 ACCESSION NP_002124 1 merpqusmp qdlsealkea tkevhtqaen qug qvtrdgfklv maslyhiyva 61 leeeiernke spvfapvyfp eelhrkaale qdlafwygpr wqevipytpa mqryvkrlhe 121 vgrtepellv ahaytrylgd lsggqvlkki aqkaldlpss geglafftfp niasatkfkq 181 lyrsrmnsle mtpavrqrvi eeaktaflln iqlfeelqel lthdtkdqsp srapgqura 241 snkqusapv etprgkppln trsqapllrw vltlsflvat vavglyam 255255 NUCB1 Official Symbol: NUCBf Official Name: nucleobindin 1 Gene ID: 4924 Organism: Homo sapiens Other Aliases: CALNUC, NUC Other Designations: nucleobindin-1 Nucleotide seguence: NCBI nce Seguence: 184.5 LOCUS NM_006184 ACCESSION NM_006184 l gcggaagtta tttttccccc ggccggcagg gagttgtagt tatctttgaa agccttctct 6" ctcttttggc ataggcggga aatgtggcgt caagaggctg ggattccgag aaagaagcga 12" ggctttcacg aaaatgcgag cggcttcggc aaggggcggg gccgctggac gtggatgaaa 18" gctacagagc cagcgtggac caatcagacc tctttggggc ggggcctctg tggataagtg 24" ggcgtggtct aggggggaag tcaccaagaa cgcaccggag gtccttgccc gaaa " acgccctctg agga gagaccacac tgccatgcct ccctctgggc cccgaggaac 36" cctccttctg ttgccgctgc tgctgctgct cctgcttcgc ctgg ctgtccccct 42" ggagcgaggg gcgcccaaca aggaggagac ccctgcgact gagagtcccg acacaggcct 48" gtactaccac cggtacctcc aggaggtcat cgatgtactg gagacggatg ggcatttccg 54" agagaagctg caggctgcca atgcggagga catcaagagc gggaagctga gccgagagct 60" ggactttgtc agccaccacg tccgcaccaa tgag ctcaagcgac aggaggtgtc 66" acggctgcgg atgctgctca aggccaagat ggacgccgag caggatccca atgtacaggt 72" ggatcatctg aatctcctga ttga acacctggac cctcagaacc agcatacatt 78" cgaggcccgc gacctggagc tgctgatcca gacggccacc cgggaccttg cccagtacga 84" cgcagcccat catgaagagt tcaagcgcta gctt aaggaacacg agagacggcg 90" ttatctggag ggag aggagcagag aaaggaggcg gagaggaagc tggaagagca 96" acagcgccgg caccgcgagc aagt caacgtgcct caag cccagttgaa 256256 "02" ggaggtgtgg gaggagctgg atggactgga ccccaacagg tttaacccca agaccttctt "08" catactgcat aaca gtgatggtgt cctggatgag caggagctgg aggcactctt "l4" caccaaggag ctggagaaag tgtacgaccc aaagaatgag gaggacgaca tgcgggagat "20" ggaggaggag cgactgcgca tgcgggagca tgtgatgaag aatgtggaca ccaaccagga "26" cgtg accctggagg agttcctcgc tcag aggaaggagt ttggggacac "32" cggggagggc tgggagacag tggagatgca ccctgcctac accgaggaag agctgaggcg "38" ctttgaagag gagctggctg cccgggaggc agagctgaat gccaaggccc agcgcctcag "44" ccaggagaca gaggctctag ccca gggccgcctg gaggcccaga agagagagct "50" gcagcaggct gtgctgcaca tggagcagcg gaagcagcag cagcagcagc agcaaggcca "56" caaggccccg cacc ctgaggggca gctcaagttc gaca atgt "62" acctgtccca gctccagccg gtgaccagaa ggaggtggac acttcagaaa agaaacttct "68" cgagcggctc gttg aggtgcccca gcatctgtga tcctccggga ccccagccct "74" tcct gatgctccaa ggcgactgat gggcgctgga tgaagtggca cagtcagctt "80" ccctgggggc tggtgtcatg ttgggctcct ggggcggggg cacggcctgg catttcacgc "86" gcca ccccaggtcc acctgtctcc acag cctccaagtc tgtggctctt "92" cccttctgtc ctccgagggg cttgccttct ctcgtgtcca gtgaggtgct cagtgatcgg "98" cttaacttag agaagcccgc cccctcccct tctccgtctg tcccaagagg tctg 204" agcctgcgtt cctaggtggc tcag ctgcctgggt tgtggccgcc ctagcatcct 210" gtatgcccac agctactgga atccccgctg ctgctccggg ccaagcttct ggttgattaa 216" tgagggcatg gggtggtccc tcaagacctt cccctacctt ttgtggaacc agtgatgcct 222" cagt gtcccctcca cagctgggtg ccaggggcag gggatcctca gtatagccgg 228" tgaaccctga taccaggagc ctgggcctcc ctgaacccct ggcttccagc catctcatcg 234" ccagcctcct cctggacctc ttggccccca gccccttccc gccc cagaagggtc 240" ccagagctga ccccactcca ggacctaggc ccagcccctc agcctcatct ggagcccctg 246" aagaccagtc ccacccacct ttctggcctc atctgacact gctccgcatc ctgctgtgtg 252" tcctgttcca tgttccggtt ccatccaaat acactttctg gaacaaatgc atggctccaa 2581 aaaaa Protein ce: NCBI Reference Seguence: NP_006175.2 LOCUS NP_006175 257257 ACCESSION NP_006175 1 mppsgprgtl lllpllllll lravlavple rgapnkeetp atespdtgly yhrquevid 6" vletdghfre quaanaedi ksgklsreld fvshhvrtkl delquevsr lrmllkakmd 12" aeqdpnqud hlnllkqfeh ldpqnqhtfe ardlelliqt atrdlaqyda ahheefkrye 18" mlkeherrry leslgeequ eaerkleeqq rrhrehpkvn vpgsqaqlke vweeldgldp 24" nrfnpktffi lhdinsdgvl deqelealft kelekvydpk needdmreme eerlrmrehv " mknvdtnqdr lvtleeflas tquefgdtg egwetvemhp ayteeelrrf reae 36" lnakaqusq etealgrsqg relq qavlhmequ qqqqqqqghk apaahpegql 42" kfhpdtddvp vpapagque klle rlpevequh 1 C3010 Official Symbol: C190rf10 Official Name: chromosome 19 open reading frame 10 Gene ID: 56005 sm: Homo s Other s: EUROIMAGE1875335, |L25, |L27, |L27W, R33729_1, SF20 Other Designations: UPF0556 protein CfQorffO; interleukin 25; interleukin 27 working designation; interleukin-25; stromal cell-derived growth factor SF20 Nucleotide seguence: NCBI Reference Seguence: 107.3 LOCUS NM_019107 ACCESSION NM_019107 1 ggcggacgct ccacgtgtcc ctcgccgcgc cccgtctacc cgcccctgcc ctgaggaccc 61 tagtccaaca tggcggcgcc cagcggaggg tggaacggcg tcggcgcgag cttgtgggcc 121 gcgctgctcc taggggccgt ggcgctgagg ccggcggagg cggtgtccga gcccacgacg 181 gtggcgtttg acgtgcggcc cgtc gtgcattcct tctcccataa cgtgggcccg 241 ggggacaaat atacgtgtat gttcacttac gcctctcaag gagggaccaa tgagcaatgg 301 cagatgagtc tggggaccag cgaagaccac cagcacttca ccat ctggaggccc 361 aagt cctatctgta cttcacacag gcag aggtgcgggg cgctgagatt 258258 421 gagtacgcca tggcctactc taaagccgca tttgaaaggq aaagtgatgt ccctctgaaa 481 actgaggaat ttgaagtgac agca gtggctcaca ggcccggggc attcaaagct 541 gagctgtcca agctggtgat tgtggccaag cgca tgtg accagcagcc 601 cggg tggcaccttc tcatctccgg tgaagctgaa ggggcctgtg tccctgaaag 661 ggccagcaca tcactggttt tctaggaggg actcttaagt tttctacctg ggctgacgtt 721 gccttgtccg gaggggcttg cagggtggct gaagccctgg ggcagagaac agagggtcca 781 tcct ggctcccaac agcttctcag ttcccacttc ctgctgagct cttctggact 841 cgca gatccggggc acaaagaggq tggggaacat gggggctatg aaag 901 cagccatgct ccccccgacc tccagccgag catccttcat gagcctgcag aactgctttc 961 ctatgtttac ccaggggacc tcctttcaga tgaactggga agagatgaaa tgttttttca 1021 tatttaaata aataagaaca ttaaaaagca aaaaaaaaaa aaaaaaa Protein seguence: NCBI Reference Seguence: NP_061980.1 LOCUS NP_061980 ACCESSION NP_061980 1 maapsggwng vgaslwaall lgavalrpae avsepttvaf dvrpggvvhs fshnvgpgdk 61 ytcmftyasq ggtneqwqms lgtsedhth pqgk sylyftqfka evrgaeieya 121 mayskaafer esdvplktee fevtktavah rpgafkaels klvivakasr tel Offigigl Symml: PLIN2 Official Name: perilipin 2 w: 123 Organism: Homo s Other Aliases: RP11-151J10.1, ADFP, ADRP Other Designations: adipophilin; adipose differentiation-related protein; pin- Nucleotide sguence: NCBI Reference Seguence: NM_001122.3 LOCUS NM_001122 259259 ACCESSION NM_001122 l gtga cactcgggct tgggacaggg cgtgctgccg cgggtcacgt gagg 6" cttggggagg ggcggcgagg cggggtttat agcccgggcg cccgcgggcc ccacgctttg l2" accgggtcgt ggcagccgga gtcgtcttcg ggacgcgcct gctcttcgcc tttcgctgca 18" gtccgtcgat ttctttctcc aggaagaaaa atggcatccg ttgcagttga tccacaaccg 24" agtgtggtga ctcgggtggt caacctgccc ttggtgagct ccacgtatga cctcatgtcc " tcagcctatc tcagtacaaa ggaccagtat ccctacctga agtctgtgtg tgagatggca 36" gagaacggtg tgaagaccat cacctccgtg gccatgacca gtgctctgcc catcatccag 42" aagctagagc cgcaaattgc agttgccaat acctatgcct gtaaggggct agacaggatt 48" gaggagagac tgcctattct gcca tcaactcaga ttgttgccaa tgccaaaggc 54" gctgtgactg gggcaaaaga tgctgtgacg actactgtga ctggggccaa ggattctgtg 60" gccagcacga tcacaggggt gatggacaag accaaagggg cagtgactgg cagtgtggag 66" aagaccaagt ctgtggtcag catt aacacagtct tggggagtcg gatgatgcag 72" ctcgtgagca taga aaatgcactc accaaatcag agctgttggt gtac 78" ctccctctca ctgaggaaga actagaaaaa gaagcaaaaa aagttgaagg atttgatctg 84" gttcagaagc caagttatta tgttagactg ggatccctgt agct tcactcccgt 90" gcctaccagc aggctctcag cagggttaaa aagc aaaaaagcca acagaccatt 96" ctcc attctactgt tcacctgatt gaatttgcca ggaagaatgt gtatagtgcc "02" aaaa ttcaggatgc tcaggataag ctct catgggtaga gtggaaaagg "08" agcattggat atgatgatac tgatgagtcc cactgtgctg agcacattga gtcacgtact "l4" cttgcaattg cccgcaacct gcag ctccagacca cgtgccacac cctcctgtcc "20" aacatccaag gtgtaccaca gaacatccaa gatcaagcca agcacatggg ggtgatggca "26" ggcgacatct actcagtgtt ccgcaatgct gcctccttta aagaagtgtc tgacagcctc "32" ctcacttcta gcaaggggca gctgcagaaa atgaaggaat ctttagatga ggat "38" tatcttgtta acaacacgcc ctgg ctggtaggtc ccttttatcc tcagctgact "44" gagtctcaga atgctcagga ccaaggtgca gaca agagcagcca ggagacccag "50" cgatctgagc ataaaactca ttaaacctgc ccctatcact agtgcatgct gtggccagac "56" agatgacacc ttttgttatg ttgaaattaa cttgctaggc aaccctaaat tgggaagcaa "62" gtagctagta taaaggccct caattgtagt tgtttccagc tgaattaaga gctttaaagt "68" ttctggcatt agcagatgat tcac ctggtaagaa aagaatgata ggcttgtcag 260260 1741 agcctatagc cagaactcag aaaaaattca aatgcactta tgttctcatt ctatggccat 1801 gcct ctgttactgt tgaa taaaaacatc ttcatgtggg ctggggtaga 1861 aactggtgtc tgctctggtg tgatctgaaa aggcgtcttc actgctttat ctcatgatgc 1921 ttgcttgtaa aacttgattt tagtttttca tttctcaaat aggaatacta cctttgaatt 1981 caataaaatt cactgcagga tagaccagtt aaaaaaaaaa aaaaaaaaa Protein seguence: NCBI nce Seguence: NP_001113.2 LOCUS NP_001113 ION NP_001113.2 1 masvavdpqp svvtrvvnlp lvsstydlms saylstkdqy pylksvcema engvktitsv 6" amtsalpiiq avan tyackgldri eerlpilnqp stqivanakg avtgakdavt 12" ttvtgakdsv astitgvmdk tkgavtgsve ktksvvsgsi ntvlgsrmmq lvssgvenal 18" tksellveqy lplteeelek eakkvegfdl vqkpsyyvrl gslstklhsr ayqqalsrvk 24" eaqusqqti sqlhstvhli efarknvysa nqkiqdaqdk lylswvewkr sigyddtdes " hcaehiesrt laiarnltqq lqttchtlls ningpqniq dqakhmgvma frna 36" sdsl ltsskgqlqk mkeslddvmd ylvnntplnw lvgpfypqlt esqnaqdqga 42" qetq rsehkth ATP5A Official Symbol: ATP5A1 al Name: ATP synthase, H+ transporting, ondrial F1 complex, alpha subunit 1, cardiac muscle Gene ID: 498 Organism: Homo sapiens Other Aliases: ATP5A, ATP5AL2, ATPM, MOM2, OMR, ORM, hATP1 Other Designations: ATP synthase alpha chain, mitochondrial; ATP synthase subunit alpha, mitochondrial; ATP synthase, H+ transporting, mitochondrial F1 complex, alpha subunit, isoform 1, cardiac muscle; ATP synthase, H+ transporting, mitochondrial F1 complex, alpha subunit, isoform 2, non-cardiac muscle-like 2; ATP sythase (F1-ATPase) alpha subunit; mitochondrial ATP synthetase, oligomycin-resistant 261261 Nucleotide seguence (variant 1): NCBI Reference ce: NM_001001937.1 LOCUS NM_001001937 ACCESSION NM_001001937 l tctggcattg caagcctcgc ttcgttgcca cttcccagct cttcccgcct tccgcggtat 6" aatcaacact acgagagata gagccgccta gaaccagtcc ggaggctgcg gctgcagaag 12" taccgcctgc ggagtaactg caaagatgct gcgc gttgctgcgg ccgtggtccg 18" cgcccttcct cggcgggccg gactggtctc cagaaatgct tcat ctttcattgc 24" tgcaaggaac ttccatgcct ctaacactca tcttcaaaag actgggactg ctgagatgtc " ctctattctt gaagagcgta ttcttggagc tgatacctct gttgatcttg aagaaactgg 36" gcgtgtctta agtattggtg ttgc ccgcgtacat agga atgttcaagc 42" agaagaaatg gtagagtttt cttcaggctt aaagggtatg tccttgaact tggaacctga 48" caatgttggt gttgtcgtgt ttggaaatga taaactaatt aaggaaggag atatagtgaa 54" gaggacagga gccattgtgg acgttccagt tggtgaggag ctgttgggtc gtgtagttga 60" tgcccttggt aatgctattg atggaaaggg tccaattggt tccaagacgc gtaggcgagt 66" tggtctgaaa gcccccggta tcattcctcg aatttcagtg cgggaaccaa tgcagactgg 72" cattaaggct gtggatagct tggtgccaat tggtcgtggt cagcgtgaac tgattattgg 78" tgaccgacag actgggaaaa cctcaattgc tattgacaca atcattaacc agaaacgttt 84" caatgatgga tctgatgaaa agaagaagct gtactgtatt tatgttgcta ttggtcaaaa 90" gagatccact gttgcccagt tggtgaagag acttacagat gcagatgcca tgaagtacac 96" cattgtggtg acgg cctcggatgc tgccccactt cagtacctgg ctccttactc "02" ttcc atgggagagt attttagaga caatggcaaa catgctttga tcatctatga "08" cgacttatcc aaacaggctg ttgcttaccg tcagatgtct ctgttgctcc gccgaccccc "l4" tggtcgtgag cctg gtgatgtgtt ctacctacac ttgc tggagagagc "20" agccaaaatg aacgatgctt ttggtggtgg ctccttgact gctttgccag tcatagaaac "26" acaggctggt gatgtgtctg cttacattcc aacaaatgtc atttccatca ctgacggaca "32" gatcttcttg gaaacagaat tgttctacaa aggtatccgc atta acgttggtct "38" atct cgtgtcggat ccgctgccca aaccagggct cagg tagcaggtac "44" catgaagctg gaattggctc agtatcgtga tgct tttgcccagt ctga "50" cctcgatgct gccactcaac tgag tcgtggcgtg cgtctaactg agttgctgaa 262262 "56" gcaaggacag tattctccca tggctattga agaacaagtg gctgttatct atgcgggtgt "62" aaggggatat cttgataaac tggagcccag caagattaca aagtttgaga atgctttctt "68" gtctcatgtc gtcagccagc accaagcctt gttgggcact atcagggctg atggaaagat "74" ctcagaacaa tcagatgcaa agctgaaaga gattgtaaca aatttcttgg ctggatttga "80" agcttaaact cctgtggatt cacatcaaat accagttcag ttttgtcatt gttctagtaa "86" attagttcca tttgtaaaag ggttactctc atactcctta tgtacagaaa tcacatgaaa "92" aataaaggtt ccataatgca aaaa Protein ce (variant 1): NCBI Reference Seguence: NP_001001937.1 LOCUS NP_001001937 ACCESSION NP_001001937 1 mlsvrvaaav vralprragl gssf iaarnfhasn thlqktgtae mssileeril 6" gadtsvdlee igdg iarvhglrnv qaeemvefss glkgmslnle pdnvgvvvfg 12" egdi vkrtgaivdv pvgeellgrv vdalgnaidg ktrr rvglkapgii 18" prisvrepmq dslv pigrgqreli igdrqtgkts iaidtiinqk rfndgsdekk 24" klyciyvaig qkrstvaqlv krltdadamk ytivvsatas daaplqylap ysgcsmgeyf " rdngkhalii yddlskqava yrqmslllrr ppgreaypgd vfylhsrlle raakmndazg 36" ggsltalpvi etqagdvsay iptnvisitd gqifletelf ykgirpainv glsvsrvgsa 42" kqva gtmklelaqy revaafaqu tqql lsrgvrltel lkngyspma 48" ieeqvaviya gvrgyldkle pskitkfena flshvvsqhq allgtiradg kiseqsdakl 54" keivtnflag fea Nucleotide ce (variant 2): NCBI Reference Seguence: NM_004046.5 LOCUS NM_004046 ACCESSION NM_004046 1 ggggcagtac ttccgggtca ggtgggccgg tgac cttctttgcg ccat 61 tttgtcccag tcagtccgga ggctgcggct gcagaagtac cgcctgcgga gtaactgcaa 121 agatgctgtc cgtgcgcgtt gctgcggccg tggtccgcgc tcgg cgggccggac 181 tggtctccag aaatgctttg ggttcatctt tcattgctgc aaggaacttc catgcctcta 263263 24" acactcatct tcaaaagact gggactgctg agatgtcctc tattcttgaa attc " ttggagctga tacctctgtt gatcttgaag aaactgggcg tgtcttaagt attggtgatg 36" gtattgcccg tggg ctgaggaatg ttcaagcaga agaaatggta gagttttctt 42" caggcttaaa gggtatgtcc ttgaacttgg aacctgacaa tgttggtgtt gtcgtgtttg 48" gaaatgataa actaattaag gaaggagata tagtgaagag gacaggagcc gacg 54" ttccagttgg tgaggagctg ttgggtcgtg atgc ccttggtaat gctattgatg 60" gaaagggtcc aattggttcc aagacgcgta ttgg tctgaaagcc atca 66" ttcctcgaat ttcagtgcgg gaaccaatgc agactggcat taaggctgtg gatagcttgg 72" tgccaattgg tcgtggtcag cgtgaactga ttattggtga ccgacagact gggaaaacct 78" caattgctat tgacacaatc attaaccaga tcaa tgatggatct gatgaaaaga 84" agaagctgta ctgtatttat gttgctattg gtcaaaagag atccactgtt gcccagttgg 90" gact tacagatgca gatgccatga agtacaccat tgtggtgtcg gctacggcct 96" cggatgctgc cccacttcag tacctggctc ctgg ctgttccatg tatt "02" ttagagacaa tggcaaacat gctttgatca tctatgacga cttatccaaa caggctgttg "08" cttaccgtca gatgtctctg ttgctccgcc gaccccctgg tcgtgaggcc tatcctggtg "14" atgtgttcta ctcc cggttgctgg agagagcagc caaaatgaac gatgcttttg "20" gtggtggctc cttgactgct ttgccagtca tagaaacaca ggctggtgat gtgtctgctt "26" acattccaac aaatgtcatt tccatcactg agat cttcttggaa acagaattgt "32" tctacaaagg tatccgccct aacg ttggtctgtc tgtatctcgt gtcggatccg "38" ctgcccaaac cagggctatg aagcaggtag caggtaccat gaagctggaa ttggctcagt "44" atcgtgaggt tgctgctttt gcccagttcg acct cgatgctgcc actcaacaac "50" gtcg tggcgtgcgt ctaactgagt tgctgaagca aggacagtat tctcccatgg "56" ctattgaaga acaagtggct gttatctatg cgggtgtaag gggatatctt gataaactgg "62" agcccagcaa gattacaaag tttgagaatg ctttcttgtc tcatgtcgtc agccagcacc "68" aagccttgtt gggcactatc agggctgatg gaaagatctc agaacaatca gatgcaaagc "74" tgaaagagat tgtaacaaat ttcttggctg gatttgaagc ttaaactcct gtggattcac "80" atcaaatacc agttcagttt tgtcattgtt ctagtaaatt attt gtaaaagggt "86" cata ctccttatgt acagaaatca catgaaaaat aaaggttcca taatgcatag 1921 ttaaaaa Protein sequence (variant 2): 264264 NCBI Reference ce: NP_004037.1 LOCUS NP_004037 ACCESSION NP_004037 l mlsvrvaaav ragl gssf iaarnfhasn thlqktgtae mssileeril 6" gadtsvdlee tgrvlsigdg iarvhglrnv qaeemvefss lnle pdnvgvvvzg l2" ndklikegdi vkrtgaivdv pvgeellgrv vdalgnaidg kgpigsktrr rvglkapgii 18" epmq tgikavdslv pigrgqreli igdrqtgkts iaidtiinqk rfndgsdekk 24" klyciyvaig qkrstvaqlv krltdadamk ytivvsatas daaplqylap ysgcsmgeyf " rdngkhalii yddlskqava yrqmslllrr ypgd vfylhsrlle raakmndafg 36" ggsltalpvi etqagdvsay sitd gqifletelf ykgirpainv glsvsrvgsa 42" aqtramkqva gtmklelaqy revaafaqu sdldaatqql lsrgvrltel lkngYSpma 48" ieeqvaviya gvrgyldkle pskitkfena sqhq allgtiradg kiseqsdakl 54" keivtnflag fea Nucleotide seguence (variant 3): NCBI Reference ce: NM_001257334.1 LOCUS NM_001257334 ACCESSION NM_001257334 l ggggcagtac ttccgggtca ggtgggccgg tgac cttctttgcg gctcggccat 6" tttgtcccag tcagtccgga ggctgcggct gcagaagtac cgcctgcgga gtaactgcaa 12" agatgctgtc cgtgcgcgtt gctgcggccg tggtccgcgc ccttcctcgg cgggccggac 18" tggtctccag aaatgctttg ggttcatctt ctgc aaggaacttc catgcctcta 24" acactcatct tcaaaagact gggactgctg agatgtcctc tattcttgaa gagcgtattc " ttggagctga tacctctgtt gatcttgaag aaactgggcg tgtcttaagt attggtgatg 36" gtattgcccg cgtacatggg ctgaggaatg ttcaagcaga agaaatggta gagttttctt 42" caggcttaaa gggtatgtcc ttgaacttgg aacctgacaa tgttggtgtt gtcgtgtttg 48" ataa actaattaag gaaggagata tagtgaagag gacaggagcc attgtggacg 54" gtccaattgg ttccaagacg cgtaggcgag tgaa agcccccggt cctc 60" gaatttcagt gcgggaacca atgcagactg gcattaaggc tgtggatagc ttggtgccaa 66" ttggtcgtgg tcagcgtgaa ctgattattg gtgaccgaca gactgggaaa acctcaattg 265265 72" ctattgacac aatcattaac cagaaacgtt tcaatgatgg atctgatgaa aagaagaagc 78" tgtactgtat ttatgttgct attggtcaaa agagatccac tgttgcccag ttggtgaaga 84" gacttacaga tgcagatgcc atgaagtaca ccattgtggt tacg gcctcggatg 90" ctgccccact tcagtacctg tact ctggctgttc catgggagag agag 96" acaatggcaa acatgctttg tatg acgacttatc ggct gttgcttacc "O2" gtcagatgtc tctgttgctc cgccgacccc ctggtcgtga ggcctatcct ggtgatgtgt "08" tctacctaca ctcccggttg ctggagagag cagccaaaat gaacgatgct tttggtggtg "14" gctccttgac tgctttgcca gtcatagaaa cacaggctgg tgatgtgtct gcttacattc "20" caacaaatgt catttccatc actgacggac agatcttctt ggaaacagaa ttgttctaca "26" aaggtatccg ccctgcaatt aacgttggtc tgtctgtatc tcgtgtcgga tccgctgccc "32" aaaccagggc tatgaagcag gtagcaggta agct ggaattggct cagtatcgtg "38" aggttgctgc ttttgcccag tctg acctcgatgc tgccactcaa ttga "44" gcgt gcgtctaact gagttgctga agcaaggaca gtattctccc atggctattg "50" aagaacaagt ggctgttatc tatgcgggtg taaggggata tcttgataaa ctggagccca "56" gcaagattac aaagtttgag aatgctttct tgtctcatgt cgtcagccag caccaagcct "62" tgttgggcac ggct gatggaaaga tctcagaaca atcagatgca aagctgaaag "68" agattgtaac aaatttcttg gctggatttg aagcttaaac tcctgtggat tcacatcaaa "74" taccagttca gttttgtcat tgttctagta aattagttcc atttgtaaaa gggttactct "80" catactcctt agaa atcacatgaa aaataaaggt tccataatgc atagttaaaa "86" a Protein seguence nt 3): NCBI Reference Seguence: NP_001244263.1 LOCUS N P_001244263 ACCESSION NP_001244263 1 mlsvrvaaav ragl vsrnalgssf iaarnfhasn thlqktgtae mssileeril 61 gadtsvdlee tgrvlsigdg iarvhglrnv qaeemvefss glkgmslnle pdnvgvvvfg 121 ndklikegdi vkrtgaivdg pigsktrrrv iipr isvrepmqtg ikavdslvpi 181 grgqreliig drqtgktsia idtiinqkrf ndgsdekkkl yciyvaiqu rstvaqlvkr 241 ltdadamkyt ivvsatasda apys gcsmgeyfrd ngkhaliiyd dlskqavayr 266266 301 qmslllrrpp gdvf lera akmndafggg sltalpviet qagdvsayip 361 tnvisitdgq ifletelfyk girpainvgl svsrvgsaaq tramkqvagt mklelaqyre 421 vaafaqusd ldaatqqlls rgvrltellk ngyspmaie eqvaviyagv rgyldkleps 481 kitkfenafl shvvsqhqal lgtiradgki seqsdaklke ivtnflagfe tide sequence nt 4): NCBI Reference ce: NM_001001935.2 LOCUS NM_001001935 ACCESSION NM_001001935 l ggggcagtac ttccgggtca ggtgggccgg ctgtcttgac cttctttgcg gctcggccat 6" tttgtcccag tcagtccgga ggctgcggct gcagaagtac cgcctgcgga gtaactgcaa 12" agatgctgtc cgtgcgcgtt gctgcggccg gcgc tcgg cgggccggac 18" tggtctccag aaatgctttg ggttcatctt tcattgctgc aaggaacttc catgcctcta 24" acactcatct tcaaaagact ggtaagttat tatttctcag tctacgccgc acttactaga " tgaagatata atac taac tgtgggactg ctgagatgtc ctctattctt 36" gaagagcgta ttcttggagc tgatacctct gttgatcttg ctgg gcgtgtctta 42" agtattggtg atggtattgc ccgcgtacat gggctgagga atgttcaagc agaagaaatg 48" gtagagtttt cttcaggctt aaagggtatg tccttgaact tggaacctga tggt 54" gttgtcgtgt ttggaaatga taaactaatt aaggaaggag atatagtgaa gaggacagga 60" gccattgtgg acgttccagt tggtgaggag ctgttgggtc gtgtagttga tgcccttggt 66" aatgctattg atggaaaggg tccaattggt tccaagacgc gtaggcgagt tggtctgaaa 72" gcccccggta tcattcctcg aatttcagtg cgggaaccaa tgcagactgg cattaaggct 78" gtggatagct caat tggtcgtggt gaac tgattattgg tgaccgacag 84" actgggaaaa cctcaattgc tattgacaca atcattaacc agaaacgttt caatgatgga 90" tctgatgaaa agct gtactgtatt tatgttgcta ttggtcaaaa gagatccact 96" gttgcccagt tggtgaagag acttacagat gcagatgcca tgaagtacac cattgtggtg 102" tcggctacgg cctcggatgc tgccccactt cagtacctgg ctccttactc tggctgttcc 108" atgggagagt attttagaga caatggcaaa catgctttga tcatctatga cgacttatcc 114" aaacaggctg ttgcttaccg tcagatgtct ctgttgctcc gccgaccccc tggtcgtgag 120" gcctatcctg gtgatgtgtt ctacctacac tcccggttgc tggagagagc agccaaaatg 267267 "26" aacgatgctt ttggtggtgg ctccttgact gctttgccag tcatagaaac tggt "32" gatgtgtctg cttacattcc aacaaatgtc atttccatca ctgacggaca gatcttcttg "38" gaaacagaat tgttctacaa aggtatccgc atta acgttggtct gtctgtatct "44" cgtgtcggat ccgctgccca aaccagggct atgaagcagg gtac catgaagctg "50" gaattggctc agtatcgtga ggttgctgct tttgcccagt tcggttctga tgct "56" gccactcaac aacttttgag tcgtggcgtg cgtctaactg agttgctgaa acag "62" tattctccca tggctattga agaacaagtg gctgttatct atgcgggtgt aaggggatat "68" cttgataaac tggagcccag caagattaca aagtttgaga tctt gtctcatgtc "74" gtcagccagc accaagcctt gttgggcact atcagggctg atggaaagat acaa "80" tcagatgcaa agctgaaaga gattgtaaca ttgg ctggatttga agcttaaact "86" cctgtggatt cacatcaaat accagttcag ttttgtcatt gttctagtaa attagttcca "92" aaag ggttactctc atactcctta tgtacagaaa tcacatgaaa aataaaggtt 1981 ccataatgca tagttaaaaa Protein seguence (variant 4): NCBI nce Seguence: NP_001001935.1 LOCUS NP_001001935 ACCESSION 001935.1 l mssileeril gadtsvdlee tgrvlsigdg iarvhglrnv qaeemvefss glkgmslnLe 6" pdnvgvvvfg ndklikegdi vkrtgaivdv pvgeellgrv vdalgnaidg kgpigsktrr 12" rvglkapgii prisvrepmq tgikavdslv pigrgqreli igdrqtgkts iaidtiinqk 18" rfndgsdekk klyciyvaig aqlv krltdadamk ytivvsatas daaplqylap 24" ysgcsmgeyf rdngkhalii yddlskqava llrr ppgreaypgd vfylhsrlLe " raakmndafg ggsltalpvi etqagdvsay iptnvisitd gqifletelf ykgirpainv 36" glsvsrvgsa aqtramkqva gtmklelaqy revaafaqu sdldaatqql lsrgvrltel 42" lkngyspma ieeqvaviya gvrgyldkle pskitkfena flshvvsqhq allgtiradg 48" kiseqsdakl keivtnflag fea Nucleotide seguence (variant 5): NCBI Reference Seguence: 257335.1 LOCUS NM_001257335 268268 ACCESSION NM_001257335 l ggggcagtac ttccgggtca ggtgggccgg ctgtcttgac cttctttgcg ccat 6" tttgtcccag tcagtccgga ggctgcggct gcagaagtac cgcctgcgga gtaactgcaa 12" agatgctgtc cgtgcgcgtt gctgcggccg tggtccgcgc ccttcctcgg cgggccggac 18" gcac cgaaggccgg catgatgcag gcggccgggt ggggctgcag ggtggtggtg 24" cgccggctcg ggcgctctct gcaggagggc gaggggctgt ggcgaatgcc gccatcttgc " acccgtggct tctccggctg gacagagcag gcgacacagg tgcccttttg ctcgtcacct 36" aggc agaatggtac agggcagaca tcga tggtgtccag agacagggcc 42" tcaagattcc tgtcttcggc tgacagcggc cctagaaggg ggatcttggg tgaaggtcag 48" ggcttgggcg ctagctctcc tgtt ctgaatcggt aaat gctttgggtt 54" catctttcat tgctgcaagg aacttccatg cctctaacac tcatcttcaa aagactggga 60" ctgctgagat gtcctctatt cttgaagagc gtattcttgg agctgatacc tctgttgatc 66" ttgaagaaac tgggcgtgtc ttaagtattg gtgatggtat tgcccgcgta catgggctga 72" ggaatgttca agcagaagaa atggtagagt tttcttcagg cttaaagggt atgtccttga 78" acttggaacc tgacaatgtt ggtgttgtcg tgtttggaaa acta attaaggaag 84" gagatatagt gaagaggaca ggagccattg tggacgttcc agttggtgag gagctgttgg 90" tagt tgatgccctt ggtaatgcta ttgatggaaa gggtccaatt ggttccaaga 96" Cgcgtaggcg agttggtctg aaagcccccg gtatcattcc tcgaatttca gtgcgggaac "02" caatgcagac tggcattaag gctgtggata gcttggtgcc aattggtcgt cgtg "08" aactgattat tggtgaccga cagactggga aaacctcaat tgctattgac acaatcatta "l4" accagaaacg tttcaatgat ggatctgatg aaaagaagaa gctgtactgt atttatgttg "20" gtca aaagagatcc actgttgccc agttggtgaa gagacttaca gatg "26" ccatgaagta caccattgtg gtgtcggcta cggcctcgga tgctgcccca cttcagtacc "32" tggctcctta ctctggctgt ggag agtattttag agacaatggc aaacatgctt "38" tcta tgacgactta tccaaacagg ctgttgctta ccgtcagatg tctctgttgc "44" tccgccgacc ccctggtcgt gaggcctatc atgt gttctaccta cactcccggt "50" tgctggagag caaa atgaacgatg cttttggtgg tggctccttg actgctttgc "56" cagtcataga aacacaggct ggtgatgtgt ctgcttacat tccaacaaat gtcatttcca "62" tcactgacgg acagatcttc ttggaaacag aattgttcta caaaggtatc gcaa "68" ttaacgttgg tctgtctgta tctcgtgtcg gatccgctgc ccaaaccagg gctatgaagc 269269 1741 aggtagcagg taccatgaag ctggaattgg ctcagtatcg tgct gccc 1801 gttc tgacctcgat gctgccactc aacaactttt gagtcgtggc gtgcgtctaa 1861 ctgagttgct gaagcaagga cagtattctc ccatggctat tgaagaacaa gtggctgtta 1921 tctatgcggg tgtaagggga tatcttgata aactggagcc cagcaagatt acaaagtttg 1981 agaatgcttt cttgtctcat gtcgtcagcc agcaccaagc cttgttgggc actatcaggg 2041 ctgatggaaa gatctcagaa caatcagatg caaagctgaa agagattgta acaaatttct 2101 tggctggatt tgaagcttaa actcctgtgg attcacatca aataccagtt tgtc 2161 attgttctag taaattagtt ccatttgtaa tact ctcatactcc ttatgtacag 2221 aaatcacatg aaaaataaag gttccataat gcatagttaa aaa Protein ce (variant 5): NCBI Reference Seguence: NP_001244264.1 1244264 ACCESSION N P_001244264 l mssileeril dlee tgrvlsigdg iarvhglrnv qaeemvefss lnle 61 pdnvgvvvfg ndklikegdi vkrtgaivdv pvgeellgrv vdalgnaidg kgpigsktrr 121 rvglkapgii prisvrepmq tgikavdslv pigrgqreli gkts iaidtiinqk 181 rfndgsdekk klyciyvaig qkrstvaqlv krltdadamk ytivvsatas daaplqylap 241 ysgcsmgeyf rdngkhalii yddlskqava yrqmslllrr ppgreaypgd rlle 301 raakmndafg ggsltalpvi etqagdvsay iptnvisitd gqifletelf ykgirpainv 361 glsvsrvgsa aqtramkqva gtmklelaqy revaafaqu sdldaatqql lsrgvrltel 421 1kngyspma ieeqvaviya gvrgyldkle pskitkfena flshvvsqhq allgtiradg 481 kiseqsdakl keivtnflag fea HSPA9 (See entry for GRP75 above) MARS Official Symbol: MARS Official Name: methionyI-tRNA synthetase 270270 Gene ID: 4141 Organism: Homo sapiens Other Aliases: METRS, MRS, MTRNS Other Designations: lic nyI-tRNA synthetase; methionine tRNA ligase 1, cytoplasmic; methionine--tRNA ligase, cytoplasmic Nucleotide seguence: NCBI Reference Seguence: NM_004990.3 LOCUS NM_004990 ACCESSION NM_004990 l aaatagtcta ctttccggta ccag ggcagtggcc taatacggaa ctccatttcc 6" cggcgtgcct cgcggaggcc gctgaactca gaagcgggag gccggttccg gttgcatcag 12" cgagggattc acggcgaaat gagactgttc gtgagtgatg gcgtcccggg ttgcttgccg 18" gtgctggccg ccgccgggag agcccggggc agagcagagg tgctcatcag cactgtaggc 24" ccggaagatt gtgtggtccc gttcctgacc cggcctaagg tccctgtctt gcagctggat " agcggcaact acctcttctc cactagtgca atctgccgat attttttttt gttatctggc 36" tgggagcaag atgacctcac taaccagtgg tggg aagcgacaga gctgcagcca 42" gctttgtctg ctgccctgta ctatttagtg ggca agaaggggga agatgttctt 48" ggttcagtgc ggagagccct gactcacatt gaccacagct tgagtcgtca tcct 54" gctg gggagacaga agcc gacattgttt tgtggggagc cctataccca 60" ttactgcaag atcccgccta tgag gagctgagtg ccctgcacag ctggttccag 66" acactgagta cccaggaacc atgtcagcga gctgcagaga ctgtactgaa acagcaaggt 72" gtcctggctc tccggcctta cctccaaaag cagccccagc ccagccccgc tgagggaagg 78" gctgtcacca atgagcctga ggaggaggag ctggctaccc tatctgagga ggagattgct 84" atggctgtta ctgcttggga gaagggccta gaaagtttgc ccccgctgcg gccccagcag 90" aatccagtgt tgcctgtggc tggagaaagg aatgtgctca tcaccagtgc cctcccttac 96" gtcaacaatg tcccccacct tgggaacatc attggttgtg tgctcagtgc cgatgtcttt 102" tact ctcgcctccg ccagtggaac accctctatc tgtgtgggac agatgagtat 108" ggtacagcaa cagagaccaa ggctctggag gagggactaa ccccccagga gatctgcgac 114" aagtaccaca tcatccatgc tgacatctac cgctggttta acatttcgtt tttt 120" ggtcgcacca ccactccaca caaa atcacccagg acattttcca gcagttgctg 271271 "26" aaacgaggtt ttgtgctgca agatactgtg gagcaactgc gatgtgagca ctgtgctcgc "32" ttcctggctg accgcttcgt ggagggcgtg tgtcccttct atga ggaggctcgg "38" ggtgaccagt agtg tggcaagctc atcaatgctg tcgagcttaa gaagcctcag "44" tgtaaagtct gccgatcatg ccctgtggtg agcc agcacctgtt cctg "50" cctaagctgg agaagcgact ggaggagtgg ttggggagga cattgcctgg cagtgactgg "56" acacccaatg cccagtttat cacccgttct tggcttcggg atggcctcaa gccacgctgc "62" ataacccgag acctcaaatg gggaacccct gtacccttag aaggttttga agacaaggta "68" ttctatgtct ggtttgatgc cactattggc tatctgtcca tcacagccaa ctacacagac "74" cagtgggaga gatggtggaa gaacccagag caagtggacc tgtatcagtt catggccaaa "80" gacaatgttc ctttccatag cttagtcttt ccttgctcag ccctaggagc tgaggataac "86" tataccttgg tcagccacct cattgctaca gagtacctga actatgagga tgggaaattc "92" tctaagagcc tggg agtgtttggg gacatggccc aggacacggg gatccctgct "98" tggc gcttctatct gctgtacatt ngcctgagg gccaggacag tgctttctcc 204" tggacggacc tgctgctgaa gaataattct gagctgctta acaacctggg catc 210" aacagagctg ggatgtttgt gtctaagttc tttgggggct atgtgcctga gatggtgctc 216" acccctgatg atcagcgcct gctggcccat gtcaccctgg agctccagca ctatcaccag 222" ctacttgaga aggttcggat ccgggatgcc ttgcgcagta tcctcaccat atctcgacat 228" ggcaaccaat atattcaggt gaatgagccc tggaagcgga gcag tgaggctgac 234" aggcaacggg caggaacagt gactggcttg gcagtgaata tagctgcctt gctctctgtc 240" cagc cttacatgcc cacggttagt gccacaatcc aggcccagct gcagctccca 246" cctccagcct gcagtatcct gctgacaaac ttcctgtgta ccttaccagc aggacaccag 252" attggcacag tcagtccctt gttccaaaaa ttggaaaatg ttga aagtttaagg 258" cagcgctttg gagggggcca ggcaaaaacg tccccgaagc cagcagttgt agagactgtt 264" gcca agccacagca gatacaagcg ctgatggatg aagtgacaaa acaaggaaac 270" cgag aactgaaagc acaaaaggca gacaagaacg aggttgctgc ggaggtggcg 276" ttgg atctaaagaa acagttggct gtagctgagg ggaaaccccc tgaagcccct 282" aaaggcaaga agaaaaagta aaagaccttg gctcatagaa agtcacttta aggg 2881 acagtaataa gtac aatctctata aaaa aaaaaaaaaa aa Protein seguence: NCBI Reference Seguence: NP_004981.2 272272 LOCUS NP_004981 ACCESSION NP_004981 l mrlfvsdgvp gclpvlaaag rargraevli stvgpedcvv kvpv lqldsgnny 6" stsaicryff llsgweqddl tnqwleweat elqpalsaal yylvvqgkkg edvlgsvrra 12" lthidhslsr qncpflaget esladivlwg alypllqdpa ylpeelsalh swfqtlstqe 18" pcqraaetvl qugvlalrp qukqpqup aegravtnep tlse eeiamavtaw _ ekgleslppl rpqqnpvlpv agernvlits alpyvnnvph lgniigcvls advfarysrl " rqwntlylcg tdeygtatet kaleegltpq eicdkyhiih adiyrwfnis fdifgrtttp 36" qdif qqllkrgfvl thveqlrce hcarfladrf vegvcpfcgy 42" cgklinavel kkpqckvcrs cpvvqssth fldlpklekr leewlgrtlp gsdwtpnaqf 48" itrswlrdgl rdlk wgtpvplegf edkvfvafd atigylsita nytdqwerww 54" knpequlyq fmakdnvpfh slvfpcsalg aednytlvsh liateylnye dgkfsksrgv 60" gvfgdmath wrfy llyirpequ safswtdlll knnsellnnl gnfinragmf 66" gyvp emvltpddqr llahvtlelq hthllekvr irdalrsilt isrhgnqyiq 72" vnepwkrikg seadrqragt niaa llsvmlqpym ptvsatiqaq lqlpppacsi 78" lltnflctlp aghqigtvsp lqulendqi esqurfggg qaktspkpav vetvttakpq 84" qiqalmdevt relk aqkadkneva aevaklldlk kqlavaegkp peapkgkkkk Official Symbol: SENP1 Official Name: SUMOf/sentrin specific peptidase 1 Gene ID: 29843 Organism: Homo sapiens Other Aliases: SuPr-2 Other Designations: SUMOf/sentrin specific protease 1;sentrin-specific protease 1; n/SUMO-specific protease SENP1 Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_001267594.1 LOCUS NM_001267594 273273 ACCESSION NM_001267594 l attccgagta cgagaaagcg aaaaagccca gactgaaaag ggtactgaga aattacgact 6" agtcttaaat gctcccttcg cttctcgggc ctcgccacac cgcgcaggcg ccccactggt 12" ctct gttctttgac ctcctgcccc agccccctcc tcttcagcca gact 18" cttccggtgc tgtgaaggcg gttccggttc gcggcggttc ccgggttttg cgttccgcgc 24" ccggccggaa accccttcgc atggcagccg gttc ggactttgta tctttgctaa " agtcagtgat gtgaaaagac ttgaaatgga tgatattgct gataggatga ggatggatgc 36" tggagaagtg actttagtga accacaactc cgtattcaaa ctcc tgccacaaac 42" aggttttcca gaggaccagc tttcgctttc tgaccagcag attttatctt ccaggcaagg 48" acatttggac cgatctttta catgttccac aagaagtgca aatc caagctatta 54" ctcagataat ccttcctcag acagttttct tggctcaggc gatttaagaa cctttggcca 60" gagtgcaaat ggccaatgga gaaattctac cccatcgtca agctcatctt aatc 66" aagaaacagc cgaagtcttt acctcgaaac ccgaaagacc ggat tatcaaacag 72" ggga aagtcaaacc gcca tgtatctgca tatgaaaaat cttttcctat 78" taaacctgtt ccaagtccat gtgg ttcatgtcgt cgaagtcttt tgagccccaa 84" gaaaactcag aggcgacatg ttagtacagc agaagagaca gttcaagaag aagaaagaga 90" caga ctac agatggtcac agggaaacag tttactatag ccaaacccac 96" cacacatttt cctttacacc tgtctcgatg ttcc agtaaaaata ctttgaaaga "02" ctcactgttt aaaaatggaa actcttgtgc atctcagatc attggctctg atacttcatc "08" atctggatct gccagcattt taactaacca ggaacagctg tcccacagtg tatattccct "l4" atcttcttat accccagatg ttgcatttgg atccaaagat tctggtactc ttcatcatcc "20" ccatcatcac gttc cacatcagcc agataactta gcagcttcaa atacacaatc "26" tgaaggatca gactctgtga tgaa agtgaaagat tcccagactc caactcccag "32" ttctactttc ttccaggcag agctgtggat caaagaatta actagtgttt atgattctcg "38" agcacgagaa agattgcgcc agattgaaga acagaaggca ttggccttac agcttcaaaa "44" ccagagattg caggagcggg aacattcagt acatgattca gtagaactac atcttcgtgt "50" tgaa aaggagattc ctgttactgt tgtccaagaa acacaaaaaa aaggtcataa "56" attaactgat agtgaagatg aatttcctga agag gaaatggaga aagaaataaa "62" gaatgtattt cgtaatggga atcaggatga agttctcagt gaagcatttc gcctgaccat "68" tacacgcaaa gatattcaaa ctctaaacca tctgaattgg ctcaatgatg agatcatcaa 274274 174" tttctacatg aatatgctga tggagcgaag taaagagaag ggcttgccaa gtgtgcatgc 180" atttaatacc ttca ctaaattaaa aacggctggt tatcaggcag tgaaacgttg 186" gacaaagaaa gtagatgtat tttctgttga cattcttttg gtgcccattc acctgggagt 192" acactggtgt ctagctgttg tggactttag aaagaagaat attacctatt acgactccat 198" gggtgggata aacaatgaag cctgcagaat actcttgcaa tacctaaagc aagaaagcat 204" tgacaagaaa aggaaagagt ttgacaccaa tggctggcag cttttcagca gcca 210" ggagattcct cagcagatga atggaagtga ctgtgggatg tttgcctgca aatatgctga 216" ctgtattacc aaagacagac caatcaactt cacacagcaa cacatgccat acttccggaa 222" ggtc atcc tccaccgaaa actcttgtga agactgtctc acttagcaga 228" ccttgaccat gtgggggacc agctctttgt tgtctacagc cagagacctt ggaaacagct 234" gctcccagcc ctctgctgtt ccct tgatcctgga cctg gcgagatgca 240" ttcacaagca catctgcctt tccttttgta tctcagatac tatttttgca aagaaacttt 246" ggtgctgtga aaggggtgag ccct aagctgaaga gagagactgc ttttcacttc 252" ttcagttctg ccatcttgtt ttcaaagggc tccagcctca ctcagtccct aattatggga 258" ctgagaaaag aaga atcttggttt catataaatt gtta ggccttacta 264" agaagtagga aagggcatgg gcaaaaggta gggataaaaa ccaccagcat atacatggac 270" atacacacac acccacacac acaaacacac acacacacac aattttcacg atgtatggtc 276" aggaatgtga ctgtaaactg gactttgggg cccaggcata agtcccttcc tccaggacct 282" ttcctattta ccta tacaaaatcc atctgctttt atacgtagct gttttatcat 288" ctgtagcttc atcctatccg gaggcacagc acatgagccc tggacaggtc ccaaagttcc 294" aagcagtcct ttccgtgaaa gcaggggttt gcatgtgcta ccaacacatg atacggggaa 300" ccca gggagcggtt tcagtggcgc aacaaagcac cacttttact tact 306" tctgaccaag aagaaaaagg accttagtat ttagcataaa attccagcgc tggatgaatg 312" cagatctagt ttggtctgtg gctagtttaa atatgtttct aaccacagag aatttcatat 318" atatatacat atatatatac acatacatat atat atatgtatgt ataaaatttc 324" atat gctttttttt ttaaagactg aatgtgttca ccatttagcc tgtagattta 330" tttccatttt ccaaattcca gcacacagag atcccagccc ctatgagtag ggtgtttgtg 336" gactacctaa attt ttgaggcctg cttt gccatatggg tagaggttac 342" agagggaggt gatattttca gctaaaaaaa ggtg gagtttggac tgatcaactt 348" gagatttaaa aactgctatt ccttttgttc tttctagcat ctctccccac cctctgagag 275275 354" ctcctcaggc ttagatagtg aagtgatcaa atgccagtgt cattttgtac tcca 360" gaac attttatact tgta ttgtaatagg tagttttgta tgaaatcttt 366" tctcctctcc cgttgtaccg cattctt:cc agcattgtgc tttttccctg ggcttatttg 372" aaaattttac tgttttatac aagctcg:tt agtacatttt tctatgtttt accacaagtt 378" tgaa aagaaaacta t:aa catt gttaactgaa tgttactgtt 384" tccactccag caactacatg tcctccc:tc ctgc ggga aagaccacct 390" tttgtgtgtt tgttttttct ctctctt:ct ttccctttct ctttctatct ctctttattt 396" ttctttcttt ttctttgttt ttgagtt:tc tataggaaat aaatagcttt ctatatatga 102" gttgctgggg accttcacat tctcttt:ag gtgg catgcagtct cattgcagga 108" ctcctggaat tggt tatt tactgtatgt aagcaacaac ttgaaaggtg 114" gcaatatggt gtcgatttgg actatgaatc aaaagacctt tttcaggttc tttcactatt 120" gtctggggga ctcagaacaa gattgttctc tgtatttatt gtttgtccat ttaggtaaca 126" tctgtcttac cttcctcaca gactttgtac agaccaaagc aacaaatatt tattgccatg 132" tatagcagaa aatgaaacat gcaacaaaag cactttgaaa aatatataag gaattgttga 138" gcctgtctga atttgggccc cctttctgac taatgcagtt ttgcacaagg tagaagttag tgaccctgag ttac caccctggac ctggtccaaa tacagactta cattctttcc tgagctagcc aacaagagca ggagtagtat ctggaaactt ttaggggtag gctttgatga ccaggaaaaa aaaaaaggta tttctgcatt ttatggccca Z62" aaggcatgtt attaatatct tatgtaattt actttaaact aaataagact tttttctcct 4681 gtgtaaaaaa aaa Protein seguence (variant 1): NCBI Reference Seguence: NP_001254523.1 LOCUS N P_001254523 ACCESSION NP_001254523 1 mddiadrmrm dagevtlvnh nsvfkthllp qtgfpedqls lsdqqilssr qghldrsftc 61 ynps yysdnpssds flgsgdlrtf gqsanngrn stpssssslq ksrnsrslyl 121 etrktssgls snhh chvsayeksf pikpvpspsw sgscrrslls pkktqrrhvs 181 taeetvqeee reiyrqllqm tiak ptthfplhls rclssskntl kdslfkngns 241 casqiigsdt sssgsasilt nqeqlshsvy slssytpdva fgskdsgtlh hphhhhsvph 301 qunlaasnt qsegsdsvil 1kvkdsqtpt psstffqael wikeltsvyd srarerqui 276276 361 eeqkalalql qnquqereh svhdsvelhl rvplekeipv qkkg hkltdsedef 421 peiteemeke iknvfrngnq devlseafrl titrkdiqtl nhlnwlndei infymnmlme 481 rskekglpsv hafntffftk lktaquavk rwtkkvdvfs vdillvpihl gvhwclavvd 541 frkknityyd smgginneac rillqylkqe sidkkrkefd tngwafskk sqeipqqmng 601 sdcgmfacky adcitkdrpi nftqqhmpyf rkrmvweilh rkll Nucleotide seguence (variant 2): NCBI Reference Seguence: NM_001267595.1 LOCUS NM_001267595 ACCESSION NM_001267595 1 atggagggag cctggtccag gagactgtgt cggacccggt cagccggccc gggctggact 6" gggcggaagc ggggagcact ccgg cgccttttcc tctgccccgc cccctgggac 12" cacctcccct cccccctgct gtccggtggc Cgcgctgtgg ccgccggtgg ccgttaggct 18" acctgaggcc tctg ctct cctgggccgc ggagagaccg tctccctgcc 24" gttacagcag gccccatccc agcgcccagc cgtacttggg gaaaggccgg ttgcgattcc " ggggctttcc cgccagagct gggtcttctc tggggagagc tgttttcacc gggaagctcg 36" gctttctgtg gtaccggctt catctcccgc cttccttgag acccgagtga tatttcttga 42" ctacttctgc gtctcacgta aacatttctc tcct actctgtggt atctccctga 48" gatgtgatat cgctagtgcc accatcagaa agaaacgtct ggaccctcct gctcaggact 54" ttgtatcttt gctaaagtca gtgatgtgaa aagacttgaa atggatgata ttgctgatag 60" gatgaggatg gatgctggag aagtgacttt agtgaaccac aactccgtat tcaaaaccca 66" cctcctgcca caaacaggtt ttccagagga ccagctttcg ctttctgacc agcagatttt 72" atcttccagg catt tggaccgatc atgt tccacaagaa gtgcagctta 78" taatccaagc tattactcag ataatccttc ctcagacagt tttcttggct caggcgattt 84" aagaaccttt ggccagagtg caaatggcca atggagaaat ccat cgtcaagctc 90" atctttacaa aaatcaagaa acagccgaag cctc gaaacccgaa caag 96" tggattatca aacagttttg cgggaaagtc aaaccatcac gtat ctgcatatga 102" aaaatctttt cctattaaac ctgttccaag ttgg agtggttcat gtcgtcgaag 108" tcttttgagc cccaagaaaa ggcg acatgttagt acagcagaag ttca 114" agaagaagaa agagagattt acagacagct gctacagatg gtcacaggga aacagtttac 120" caaa cccaccacac attttccttt acacctgtct cgatgtctta gttccagtaa 277277 "26" aaatactttg aaagactcac tgtttaaaaa tggaaactct tgtgcatctc agatcattgg "32" tact tcatcatctg gatctgccag cattttaact aaccaggaac agctgtccca "38" cagtgtatat tccctatctt cttatacccc agatgttgca tttggatcca aagattctgg "44" tactcttcat catccccatc atcaccactc tgttccacat cagccagata acttagcagc "50" ttcaaataca caatctgaag gatcagactc tgtgatttta ctgaaagtga aagattccca "56" gactccaact cccagttcta tcca gctg tggatcaaag aattaactag "62" tgtttatgat tctcgagcac gagaaagatt gatt gaagaacaga aggcattggc "68" cttacagctt caaaaccaga gattgcagga gcgggaacat tcagtacatg attcagtaga "74" actacatctt cgtgtacctc ttgaaaagga gattcctgtt actgttgtcc aagaaacaca "80" aaaaaaaggt cataaattaa ctgatagtga agatgaattt cctgaaatta cagaggaaat "86" ggagaaagaa ataaagaatg tatttcgtaa tgggaatcag gatgaagttc tcagtgaagc "92" atttcgcctg accattacac gcaaagatat tcaaactcta aaccatctga attggctcaa "98" tgatgagatc atcaatttct acatgaatat gctgatggag cgaagtaaag agaagggctt 204" gccaagtgtg ttta ataccttttt cttcactaaa ttaaaaacgg atca 210" ggcagtgaaa acaa agaaagtaga tgtattttct gttgacattc ttttggtgcc 216" cattcacctg cact ggtgtctagc tgttgtggac tttagaaaga ttac 222" ctattacgac tccatgggtg acaa tgaagcctgc ctct tgcaatacct 228" aaagcaagaa agcattgaca agaaaaggaa agagtttgac accaatggct ggcagctttt 234" cagcaagaaa agccaggaga agca gatgaatgga tgtg ggatgtttgc 240" ctgcaaatat gctgactgta ttaccaaaga cagaccaatc aacttcacac agcaacacat 246" gccatacttc cggaagcgga tggtctggga gatcctccac cgaaaactct tgtgaagact 252" gtctcactta gcagaccttg accatgtggg gctc tttgttgtct acagccagag 258" accttggaaa cagctgctcc cagccctctg ctgttgtaac acccttgatc cagg 264" ccctggcgag atgcattcac aagcacatct gcctttcctt ttgtatctca gatactattt 270" ttgcaaagaa actttggtgc tgtgaaaggg gtgagggaca tccctaagct gaagagagag 276" actgcttttc acttcttcag ttctgccatc ttgttttcaa agggctccag cctcactcag 282" tccctaatta tgag aaaagcttgg aaagaatctt ggtttcatat aaattcttgt 288" tgttaggcct tactaagaag taggaaaggg catgggcaaa aggtagggat aaaaaccacc 294" agcatataca tggacataca cacacaccca caaa cacacacaca cacacaattt 300" tcacgatgta tggtcaggaa tgtgactgta aactggactt tggggcccag gcataagtcc 278278 306" cttcctccag gacctttcct atttatatgt ccctatacaa aatccatctg tacg 312" tagctgtttt atcatctgta gcttcatcct atccggaggc acagcacatg agccctggac 318" aggtcccaaa gttccaagca gtcctttccg cagg ggtttgcatg tgctaccaac 324" acatgatacg gggaagaccc acccagggag cggtttcagt ggcgcaacaa agcaccactt 330" ttactgttgc ctacttctga agaa aaaggacctt agtatttagc ataaaattcc 336" ggat gaatgcagat ctagtttggt ctgtggctag tatg tttctaacca 342" cagagaattt catatatata tacatatata tatacacata catatatata tatg 348" tatgtataaa atttcacagg gatatgcttt tttttttaaa gactgaatgt gttcaccatt 354" tagcctgtag atttatttcc attttccaaa ttccagcaca cagagatccc agcccctatg 360" agtagggtgt ttgtggacta cctaatggaa tatttttgag gcctggatga actttgccat 366" atgggtagag gttacagagg gaggtgatat tttcagctaa aaaaaaaaac gggtggagtt 372" tggactgatc aacttgagat ttaaaaactg ctattccttt tgttctttct agcatctctc 378" ctct gagagctcct caggcttaga tagtgaagtg atcaaatgcc agtgtcattt 384" tgtacttaag agta ggaacatttt atactttttt ctgtattgta ataggtagtt 390" ttgtatgaaa tcttttctcc tctcccgttg taccgcattc tt:ccagcat tgtgcttttt 396" ccctgggctt atttgaaaat tttactgttt tatacaagct cg:ttagtac ctat 102" gttttaccac aagttacaat ttgaaaagaa aactattttt tatt ccattgttaa 108" ctgaatgtta ctgtttccac tccagcaact acatgtcctc cc:tcaactg cctgcctttt 114" ggggaaagac ttgt gttt tttctctctc tt :ctttccc tttctctttc 120" tatctctctt tatttttctt tctttttctt tgag tt:tctatag gaaataaata 126" gctttctata tatgagttgc tggggacctt cacattctct tt:agaaagc tgtggcatgc Z32" agtctcattg caggactcct ggaatattgt ctggttcttg gtatttactg tatgtaagca Z38" acaacttgaa aggtggcaat atggtgtcga tttggactat gaatcaaaag acctttttca Z44" ggttctttca ctattgtctg ggggactcag aacaagattg ttctctgtat ttattgtttg 150" tccatttagg taacatctgt cttaccttcc tcacagactt tgtacagacc aaagcaacaa 156" atatttattg ccatgtatag cagaaaatga aacatgcaac aaaagcactt tgaaaaatat Z62" aatt gttgagcctg tttg ggcccccttt aatg cagttttgca Z68" caaggtagaa gttagtgacc ccat cttaccaccc tggacctggt ccaaatacag Z74" acttacacag tggaccattc tttcctgagc tagccaacaa gagcaggagt agtatctgga 180" aactttcccc tttgtttagg ggtaggcttt gatgaccagg aaaaaaaaaa aggtatttct 279279 4861 gcattttatg gcccaaaggc atgttattaa atgt aatttacttt aaactaaata 4921 agactttttt ctcctgtgta aa Protein seguence (variant 2): NCBI Reference ce: NP_001254524.1 LOCUS N P_001254524 ACCESSION NP_001254524 l mddiadrmrm dagevtlvnh nsvfkthllp qtgfpedqls lsdqqilssr qghldrsftc 6" strsaaynps yysdnpssds flgsgdlrtf gqsanngrn stpssssslq ksrnsrslyl 12" etrktssgls nsfagksnhh chvsayeksf pikpvpspsw sgscrrslls pkktqrrhvs 18" taeetvqeee reiyrqllqm vtgkqftiak ptthfplhls kntl kdslfkngns 24" casqiigsdt sssgsasilt nqeqlshsvy slssytpdva fgskdsgtlh hphhhhsvph " qunlaasnt qsegsdsvil lkvkdsqtpt psstffqael wikeltsvyd 36" eeqkalalql qnquqereh svhdsvelhl rvplekeipv tvvqetqkkg hkltdsedef 42" meke iknvfrngnq devlseafrl titrkdiqtl nhlnwlndei mlme 48" rskekglpsv hafntffftk lktaquavk rwtkkvdvfs vdillvpihl gvhwclavvd 54" frkknityyd smgginneac rillqylkqe sidkkrkefd tngwafskk sqeipqqmng 60" sdcgmfacky adcitkdrpi nftqqhmpyf eilh rkll ATP|F1 Official : ATP|F1 Official Name: ATPase inhibitory factor 1 Gene ID: 93974 Organism: Homo sapiens Other Aliases: RPS-1092A3.1, ATPI, ATPIP, IP Other Designations: ATP synthase inhibitor protein; ATPase tor protein; ATPase inhibitor, mitochondrial; |F(1); |F1; inhibitor of F(1)F(o)-ATPase Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_016311.4 LOCUS NM_016311 280280 ACCESSION NM_016311 l gaccagattg ggtgcttggc cgtccctgcc attagcgcgt aacgagagac tgcttgctgc 6" gacg ccagaggtgc agctccagca gcaatggcag tgacggcgtt ggcggcgcgg l2" acgtggcttg gcgtgtgggg cgtgaggacc atgcaagccc gaggcttcgg ctcggatcag 18" tccgagaatg tcgaccgggg cgcgggctcc gaag ccggtggggc cttcggaaag 24" agagagcagg ctgaagagga acgatatttc cgagcacaga gtagagaaca agct " ttgaaaaaac accatgaaga agaaatcgtt catcataaga aggagattga gcgtctgcag 36" aaagaaattg agcgccataa gcagaagatc aaaatgctaa aacatgatga gcac 42" accgtgtgcc atagaatggc acatgtcatt gcccacttct gtgtagacat ggtt 48" taactaatat ttgtctgtgt gctactaaca aata aattgtcatc agtgaactgt 54" gaaaaaaaaa aaaaaaaaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_057395.1 LOCUS NP_057395 ACCESSION NP_057395 l mavtalaart wlnggvrtm qargfgsdqs envdrgagsi reaggafgkr eqaeeeryfr 61 aqsreqlaal kkhheeeivh hkkeierlqk eierhquik mlkhdd Nucleotide seguence (variant 2): NCBI Reference Seguence: NM_178190.2 LOCUS NM_178190 ACCESSION NM_178190 l gaccagattg ggtgcttggc cgtccctgcc attagcgcgt agac tgcttgctgc 6" ggcagagacg ccagaggtgc agctccagca gcaatggcag tgacggcgtt ggcggcgcgg 12" acgtggcttg gcgtgtgggg gacc gccc gaggcttcgg tcag 18" tccgagaatg tcgaccgggg cgcgggctcc atccgggaag gggc cttcggaaag 24" cagg ctgaagagga acgatatttc cgacattaca gctt tgagatctct " ttggggtgaa ggattgaaat taaaccctga gccaccgtgt ccttgtagag cacagagtag 36" agaacaactg gcagctttga aaaaacacca tgaagaagaa atcgttcatc ataagaagga 42" gattgagcgt ctgcagaaag aaattgagcg ccataagcag aagatcaaaa tgctaaaaca 281281 481 tgatgattaa accg tgtgccatag acat gtcattgccc acttctgtgt 541 agacatggtt ctggtttaac taatatttgt ctgtgtgcta ctaacagatt ataataaatt 601 agtg aactgtgaaa aaaaaaaaaa aaaaaa n ce (variant 2): NCBI Reference Seguence: NP_835497.1 LOCUS NP_835497 ACCESSION NP_835497 l aart wlnggvrtm qargfgsdqs envdrgagsi reaggafgkr eqaeeeryfr 61 hyrlcfeisl g Nucleotide seguence nt 3) NCBI Reference Seguence: NM_178191.2 LOCUS NM_178191 ACCESSION NM_178191 l gaccagattg ggtgcttggc tgcc attagcgcgt aacgagagac tgcttgctgc 6" ggcagagacg ccagaggtgc agctccagca gcaatggcag tgacggcgtt ggcggcgcgg 12" acgtggcttg gcgtgtgggg cgtgaggacc atgcaagccc gaggcttcgg ctcggatcag 18" tccgagaatg tcgaccgggg cgcgggctcc gaag ccggtggggc cttcggaaag 24" agagagcagg ctgaagagga acgatatttc cggtgaggct caccgggtcc caagtccagc " cctggatctc ccaatggcct tccaatcctt aaactgccaa tcgccccacc cgttcctacc 36" tggtgccttg ggcgccccat cccccaacag aactcccggg ccccaatcca gtatacccta 42" acccttgatg ccgt tgccacgtat agggcactcc cagttacctg agtt 48" tcaggccccc aaaccgtttc caccggcggg tctccaaaac aacccacggc tcaactcctc 54" ctttatcatt accatctccc gcgtggagtt ctcctcaggt cgtgcgaaac acccccagat 60" tcttcgcaca gtgtctagat ccgaccgccc aacgtttgcc tcccagcctg actccctcgg 66" cccttaccca cctgtcaccc cctctacgct ctccttcctc gccagcacgc cttagctttg 72" caagcctgca tgcattcagg cttctcaggt gtttctagac ccccgactcc gcaagagtga 78" ggatgatggg agctggtcat gggagctact tgga caccatcttc taaaggcttt 84" tgccctactc agcccaacct agacctgtag ctct cctgcttagg agtatggagt 282282 90" gggctgggcc tccctttgcc agccttgagt tatctttaac tgacttctgt ccactctgga 96" gagcagtgag gaattaatct tgct tgtcctttgg cctttcactt ctgccttctg "02" ttgagaatta tcaccatgac cata ccgtatagag agccaaggta ttag "08" agactatcta attgagcccc tacattttgt agttaaggaa aactgaggcc tgac "l4" caaaccaaca ttgtaatcca gtcccttctt ggaacctaaa tgcc aagtactgcg "20" catgcaagag accctttatt ggccttacag tgggccattc atttctatag gcaaagaaag "26" ctctagacag attggaatag gaaatggata tttgcctttt agctacaccc ctttgtctgt "32" cttcctcatt ttgttccttt ttttttccct aaaggggagt caagttccct gggttgttcc "38" cctcataagg tattagggac ttgtgtcaca tctctctgga gttttctatt ttaaagagga "44" atctgaaagc aataagctct ttggtcttct taagatggct acacctcaat ttaagatggg "50" tttc actagttgag gagtagaaga ggatgaccag ctagactccc atggaattgg "56" aactcctatt ccttgcttag acattacagg ttatgctttg agatctcttt ggggtgaagg "62" atta aaccctgagc caccgtgtcc ttgtagagca cagagtagag aacaactggc "68" agctttgaaa aaacaccatg aagaagaaat cgttcatcat aagaaggaga ttgagcgtct "74" gcagaaagaa attgagcgcc ataagcagaa gatcaaaatg ctaaaacatg atgattaagt "80" gcacaccgtg tgccatagaa tggcacatgt cattgcccac ttctgtgtag acatggttct "86" ggtttaacta gtct gtgtgctact aacagattat aataaattgt catcagtgaa 1921 aaaa aaaa aaaa Protein sequence (variant 3): NCBI Reference Seguence: NP_835498.1 LOCUS NP_835498 ACCESSION NP_835498 1 mavtalaart ngvwgvrtm qargfgsdqs envdrgagsi reaggafgkr eqaeeeryfr VAM P3 Official Sym bol: VAMP3 Official Name: vesicle-associated membrane protein 3 (cellubrevin) Gene ID: 9341 Organism: Homo sapiens Other Aliases: CEB 283283 Other Designations: VAMP-3; revin; synaptobrevin-3; e-associated membrane protein 3 Nucleotide ce: NCBI Reference Seguence: NM_004781.3 LOCUS NM_004781 ACCESSION NM_004781 l agtgacgtct ttgccccgcg ccgcgccgtc ccacccatct ccctggcctc cggtcccaac 6" ttcgcttctc tgctgaccct tcgc cgctgccgcc gccgcagctg ccaaaatgtc 12" tacaggtcca actgctgcca gtaa tcgaagactt cagcagacac aaaatcaagt 18" agatgaggtg gtggacataa tgcgagttaa cgtggacaag gttctggaaa gagaccagaa 24" gctctctgag ttagacgacc gtgcagacgc actgcaggca ggcgcttctc aatttgaaac " agcc aagttgaaga attg gtggaagaat tgcaagatgt gggcaatcgg 36" gattactgtt ctggttatct tcatcatcat catcatcgtg tgggttgtct cttcatgaag 42" aaccagcgga aact gctgttcaag aaacctcttc aagacttttg acttagaacc 48" tgctatatta tcaagcttac ctactgttat ctctaaaatt ttttttgtgt taatgtaaag 54" ttct aggaaacgtg cctttgtttt ttaatatgca ctccaaatta gaaggccggc 60" cccgtccaca ttttgcacag tgcctttaca gatttacgta tgggctgatg aagaggcctt 66" cttaagttcc agagtgctat aatctagatg taatgttgtc actaattaat tgccattact 72" cccagttagt tacccttgtc atttggcatt agaa ccacatttta aacctttggg 78" taatcagatt tccaacttat gccttccaga aaaaaacact taac acaaatctgt 84" gataacaaca ggctgtgcct tattttgata attttctgat tccctagaag agaaccctct 90" actttttgta agcactactg gctg tatttaagat gctggtgaag agcttttgct 96" cttgcattag atttgaagat gtttacattg ttgttattgt tatgtatcac ttgctaaaaa "02" tattgtttta atcagagata acctctttaa aaaaattttt aaagaactat ggctatgacc "08" aaagcttcta ttttgccaaa aagttaaata ccgataaaat ggccttaagt gtattcctga "l4" cagttaaatt cagaaacgtg ccaaatggaa ctcaaggtgc cccttcagaa ttaaaatcat "20" taccttgtgt gtgaaccttc tacatcttca taggcctttc ttccttttga gtag "26" acagtgtggc tccccttctg attcagtatt ttgcatgggg gaag gtttgaggta "32" gactctgacc gtctcataaa agagttctac ccagcagttg gcagattatc agctgtggac "38" atgt ttctgataat tatgcaagca acaattctgt agcctcaagt aagaccacct 284284 1441 gtgaacttga tcattatctg gcccaaatat gaagataaac tataactttg gagtttgttt 1501 cctatttgta ttcacattct gcttcctaaa tcagttttct aaattatgcc tgcaattagq 1561 cattggtcag gggtgaatgg ctcttttcac agagagtagc caaccagaga cctttgcttt 1621 gatatcatca actgcagaga atgctgttga tgggaatgct ggaagcagaa actttgtcat 1681 cggaaaaact tttcttgtat gcatgagact caacatcagg agct taaagatggg 1741 aattcaggta tgaaagaaaa caggcaagga ggcactgagg gagaaagaca cagactttat 1801 cgctctgtgg ctcattgtta ctggaatatt ctaaaactct tgttcacatg ctattatgac 1861 ttataaagca gcaacagctg aggcgcacca ggacacagct tccatttctt taacgtctgt 1921 aaca tcgctgaaat gatttactgt tgaagagatg ccttgcggtg tggccagctg 1981 tgaggagaaa gcagctggca ggac ccac cttcagcgca ctgg 2041 ccgggtctga ctcagaaacc ttggtactcg ccccttggcc acagtgccca gacccatgta 2101 tggc tcctgcatta acccagaaat acctcgcttc tatctgtgca cttagctggg 2161 aacttaccca tcac ctaaataaag tgtttataaa catgaaaaaa aaaaaaaaaa 2221 aaaaa Protein seguence: NCBI nce Seguence: 772.1 LOCUS NP_004772 ACCESSION NP_004772 1 mstgptaatg snrrqutqn quevvdimr vnvdkvlerd qklselddra dalqagasqf 61 etsaaklkrk ywwknckmwa vifi iiiivwvvss VAPA Official Symbol: VAPA Official Name: VAMP (vesicle-associated ne protein)-associated protein A, 33kDa gene ID: 9218 Organism: Homo sapiens Other Aliases: VAP-33, VAP-A, VAP33, hVAP-33 Other Designations: 33 kDa VAMP—associated protein; VAMP-A; VAMP- associated protein A; vesicle-associated membrane protein-associated protein A Nucleotide sguence (variant 1): 285285 NCBI Reference Seguence: 574.5 LOCUS NM_003574 ACCESSION NM_003574 l agaagctccc ggccgggggc gcgcacgtag gcacgcagag gccgtcacgt gggtcgccga 6" ggctcgcaag tgcgcgtggc cgtggcggct ggtgtggggt tgagtcagtt gtgggacccg 12" gagctgctga cccagcgggt ggcccaccga accggtgaca cagcggcagg cgttagggct 18" cgggagccgc gagcctggcc taga gctcggccga gccg tccc 24" ccag tcagcaaacc gccgccgcgg gcgcgccccc gctctgcgct gtctctccga " tggcgtccgc ctcaggggcc atggcgaagc acgagcagat cctggtcctc gatccgccca 36" cagacctcaa attcaaaggc cccttcacag atgtagtcac tacaaatctt aaattgcgaa 42" atccatcgga agtg tgtttcaaag tgaagactac agcacctcgc cggtactgtg 48" tgaggcccaa cagtggaatt attgacccag ggtcaactgt gactgtttca gtaatgctac 54" agccctttga ctatgatccg aatgaaaaga gtaaacacaa gtttatggta cagacaattt 60" ttgctccacc aaacacttca gatatggaag ctgtgtggaa agaggcaaaa cctgatgaat 66" taatggattc caaattgaga tgcgtatttg aaatgcccaa tgat aaattgggta 72" taactccacc agggaatgct ccgactgtca cttcaatgag cagcatcaac gttg 78" caacacctgc cagttatcac acgaaggatg accccagggg actcagtgtg ttgaaacagg 84" agaaacagaa gaatgatatg gaacctagca ttcc actgaatgca caag 90" atggacctat gccaaaacca cacagtgttt cacttaatga taccgaaaca aggaaactaa 96" tggaagagtg taaaagactt cagggagaaa tgatgaagct atcagaagaa aatcggcacc "02" atga aggtttaagg ctcagaaagg tagcacattc ggataaacct ggatcaacct "08" caactgcatc cttcagagat aatgtcacca gtcctcttcc ttcacttctt gttgtaattg "l4" cagccatttt cattggattc tttctaggga aattcatctt gtagagtgaa gcatgcagag "20" ttct ttt: ttctcttgac cagaaaaaga tttgtttacc taccatttca "26" ttggtagtat ggcccacgg: gaccattttt ttgtgtgtac agcgtcatat aggctttgcc "32" tttaatgatc tcttacggt: agaaaacaca ataaaaacaa cggc tactggacag "38" gttgtatatt accagatca: cactagcaga tgtcagttgc acattgagtc ctttatgaaa "44" ttcataaata tgt: ctttctttgt ggttttaata agagttcaag aattgttcag "50" agtcttgtaa atgttattt: aataatccct ttaaatttta tctgttgctg ttacctcttg "56" gatt tatttagat: gctaatccca ctcattcagg aaatgccaag aggtattcct 286286 "62" tggggaaatg gtgcctctta cagtgtaaat ttttcctcct ttacctttgc taatatcatg "68" gcagaatttt tcttatccct tgtgaggcag actg agtttttcat ccttacaatc "74" ctgtcccatg gtatttaaca taaaaaaaaa taaaactgtt aacagattct tgctcgatag "80" cttgtttgtg gtgt tattagaggg aactccacta tatatggtca cttgaaatta "86" tgatgcaaag gtttctcttg aacc ctcttggata ttacagtatt tttaattgaa "92" agtcctaatt ctgttaagga aaggagttga ttaaatttta aggtaccact ggtattttgg "98" gagattataa tcagtttgtt ttcaagataa tagaaaataa ggtccatgag aatagaagtt 204" atgtgatttc agtgagttga tgtgtacagc atggctgtgc tccatctgat ttaccccatt 210" cttaagttct gagagtatgt tctcaaggaa gatttaactc tctttggttt taaattactt 216" tttaaccagc ctaataaata agtcttacta cttttcataa tatttcataa tagttaaaag 222" taggtgtttt tttcgtgctc aatttggcac tcaaaataat gttcattatg gaagtttggt 228" aatactgagc aagcctgtgg aattttcttt aatg attttagcc: ttgcaaatgt 234" taaccatgtg aaacacattt taag tatgcgttac agggtttga: actttcctgc 240" acttaggttt gtcctattct tcatttattc atactaggat agaaaattt: ggaatcagaa 246" tcca gtgtttagct acatacaatc tagtacaagt tta: tcttaaacat 252" aggtgtgttg gctctttttt taaaagatgc gctctacctg aaaaggaaa: tggattttag 258" aactggatgt ggtgcagtga agtattttag gcccaggtct gtgtacaca: tttatagaag 264" aatgaagtac tctgaagtat tttggttgcc ttttcatttc aactgtgtt: tgaatttgtc 270" agatcacaca tatattgtgt tattgggcgc tgtggtatct tttataaaac ctcttgcttg 276" tgtgcaaaag aaag gaaacacaag taatgcctat ccattactag catgctatgc 282" cttt actgccattg ctgtatgctt tactgtcttt gtaaaaatcc ccctctcccc 288" ttttctggta aaag catgctaaaa atagtcttat attttcaccc cataaatgca 294" gaatcagtaa ttccttggct taaagctctt atataatcaa tattattggt ggtaaatacc 300" aagtttggta tctcatagct atcttttttt aaagaaatta agttcttgaa aatttagcca 306" aatcccgttt tatgggaatg ctctttagaa ttgt cctt tgttctatgg 312" ttgagaaatc tgaggcctta cgaaggttaa gagaactttc cccgtgtctc aggt 318" agaggcagag ctggaactag atatctggtc ctct agctcagtgt cttctggtaa 324" aaaa ttgtcttagt ttgagagatg gctgaaataa tgaacataaa atgctattta 330" caag tatatgtgaa atttcttatt gtaagactac ctta ctgttgaata 336" ttat agtgtttagg ctagaaatgc ctcccacatt ggtaataaac attacaaaat 287287 342" acaatgtatt tttaggtagg cattttataa aatgcattat gccatggttg cttttgagat 348" tagt ctgggtagca tctttaaaat gtatgtgggc ttaactgttg ttcatatcag 354" gagatgctct gattgtatag ctct gtttctgtta tttttaattg ctgtatgaaa 360" tgtgatcaga tact accaacagtt atagtttgaa agtccaactg tattaattga 366" ctgataatat gataatatag agattaaatt gtttgtcttc attccttata tgtttagaag 372" tttttgcttt gtctgcctgc ttacttgtat atgtaagcat gagggaaata cactgttgct 378" aatactgaaa tcaa gtaactaagg ccttgagttc atatgtgaca ctgaatgcac 384" tagcttcctt cgttctataa ctaatgtacc ttaacttccc ccattcttat aaga 390" agctaagtca ttatgttctg agtgtgtggt atgttccctt aaaaaaaaat gacacttgga 396" agaaaaatgt atgaaattca gaaattccga tcaaagaaaa cttt cttttttttt 102" caga gtcttgcttt gttgcccagg ctggagggca gtggtgtgat ctcacctcac 108" tgcagcttcc gcctcctggg ttcaagtgat tctcatggct cagccgcctg agtagctggg 114" attacaggtg tgagccaaca agcccggcta atttttgtat ttttagtaga gacaaggttt 120" caccatgttg gtcaggctgg gctcaaactc ctgaatccgc ctgcctcggc aagt 126" gctgggatta caggtgtgag cacc cagccaagaa aaataatact cttaaatact Z32" tagatgttca cctaaagttg atattatttg gtatgggaat tacttttgaa ctgtaatctt Z38" tcagattaca ccactttgaa aacaagtttt aacagtaggg taaaaatata gtttttgagg Z44" gtattcccaa cttgtgatct tctaccactt tagagacatt caagtaatag ttttcttaga 150" gctttgcaca ttcctattca ctgagatttt aaaaatttca cctttattcg ttat ttgg gaag atcagaaggt aaatgatctt tattttctag Z62" ctttaaaggg aaattaaacc attcatgaat aaactttaaa aatgtgaagt gtccttttcc Z68" ttttcacaat acaaaaaaaa tttcaacaga ttgtgtggtt tgtgcattta tatcctgtta Z74" aata gctaatcact gggacttgaa ttctgatggc agatagtctc ttgcttagtg 180" agtt aactattttt tagtaggaag cagc tgattttcat gccacgtttc 186" atagccccac ttttggtaga ctaccaccac gcttcttcgc gtaagcagtg gcatcttggg Z92" aatgaatgcc cagccgctcg tgggttggtg caaagaagta taaacatata tcactaagga Z98" aaaagaaagt ttgtcttgcc cttctgacac agtgtgtgca cttcaggcaa tttttggaaa 504" atataaaaaa ttccaaattc tgcctttcag caat tgctaggaac atttcattca 510" tttccctgta atattaatgt tctttaagca taatcactaa ttataagttg tatcctattt 516" ttttccagct taatttctgt ggtttattga aaaccaagta taaatgtgac taaaagcatt 288288 522" tgtt gtta ttaa ggttatggac attttataat gtaacatttg 528" attggcctgg cctcttgaca attcccttct agttatgcat ctgt tgccacattt 534" cttgttttaa aactcagttt cttgttttcc agttgttgct atgtataaca cccatcttga 540" aagagagtat ataggaagtt attcagataa cttttgtagt agtgatattc aactatagca 546" gtaccttaac tcatgatgag cttaggaaca taaaagataa ttgttgcttg accc 552" ccagagatac tgacctaatt ggtctggggt ggagatctgg catggtagtt tttttcaagc 558" tccaatcatc ggccagacag ttgctttatg taggttttta aatgccaaag gcagatatga 564" agtagattta attaagactt gacttcagca atacagggga acttaaaata cttatttttc 570" tttaaactgc aggagtcact tatt gcttaaaaaa aattgcataa aagctttgct 576" tgtcaagtta ggattgctgg aataccacta tttg gaat aaatgagctg 582" tcatcgcaaa aaggcgattt gagaaatgtg ggcttcagta ttaattgcca ttttgctgac 588" acccagtgta cctacctacc attt attttgtcca tcatgtattt ctcaaagcaa 594" aaggtggttt tcaagtataa tgtcgttttc aacatgctta ttacttagtt ttacgtcagc 600" tcatttcatc atcattgata gaaa tacttatctc catcctatgg aataggggag 606" acgggtttag acaggttcaa ttagctcaag tctacacagc tgaagtagca gagaaagtgg 612" gatctagatg gtctgatcct agtgatctac catatgaagg acatagtttg tgtcctggtc 618" caagtcaaat attgactcct cacaaacagt aagtatggca attttgtgat gcctttgatt 624" ccactttaca tggagtacta ttatttgtga aatgtcttta agatttttgg tcttaaattt 630" ttgaagactg ctttccccct ccca gaaaattgag aagaagtaaa ctcctgccca 636" ctaacaatct cagtccgtga acaaaaccaa catgaacatt cctaaacaag agtgtgtgtt 642" actctaagaa gaaggctata gaatttatgg aaatggctta tgtaacctac ggag 648" aacagaatgt gactggcctt ttctaatggt cctttaagat ttaatgatta aagcaagagt 654" tttttataat tgactttgtg gtctaaattc ttgatactgt ttct acaaagaaca 660" gtta tgtactatag gcacttaaga accctgagga aaaataatac aatgtgtgtg 666" tgtgagagag agagtgagtt actgacattg ttccaaaaaa aaaaaaaaaa aaaaaaaaaa 672" tgtggagggt tgaaatggta aggaattgga atcttttgta ttttcgagca attc 678" ctattcttgt ttcaaataga ggtttgttag gaattacagt tgtggggagc aaactttctt 684" ttttgtgctg ttttaattca aaatgtatat ccttaattgt atataatatg tagataaata 690" tatgagggta tact ttgaattaaa tttaaggata tatttcacat gaaaacaaat 6961 acaaacgaga atcaaaataa agttttgcaa agta 289289 Protein seguence (variant 1): NCBI nce Seguence: NP_003565.4 LOCUS NP_003565 ACCESSION NP_003565 1 masasgamak heqilvldpp tdlkfkgpft dvvttnlklr npsdrkvcfk vkttaprryc 61 vrpnsgiidp gstvtvsvml qpfdydpnek skhkfquti fappntsdme avwkeakpde 121 lmdsklrcvf empnendklg aptv tsmssinntv atpasyhtkd dprglsvlkq 181 equndmeps kavplnaskq dgpmpkphsv slndtetrkl meeckrlqge mmklseenrh 241 lrdeglrlrk vahsdkpgst stasfrdnvt splpsllvvi fflg kfil Nucleotide seguence (variant 2): NCBI Reference ce: 434.2 LOCUS NM_1 94434 ION NM_194434 1 agaagctccc ggccgggggc gcgcacgtag gcacgcagag gccgtcacgt ccga 6" ggctcgcaag tgcgcgtggc Cgtggcggct ggtgtggggt tgagtcagtt gtgggacccg 12" gagctgctga cccagcgggt ggcccaccga accggtgaca cagcggcagg cgttagggct 18" cgggagccgc gagcctggcc tcgtcctaga gctcggccga gccgtcgccg ccgtcgtccc 24" ccgcccccag tcagcaaacc gccgccgcgg gcgcgccccc gctctgcgct gtctctccga " ccgc ctcaggggcc atggcgaagc acgagcagat cctggtcctc gatccgccca 36" cagacctcaa attcaaaggc cccttcacag atgtagtcac tacaaatctt aaattgcgaa 42" atccatcgga tagaaaagtg tgtttcaaag tgaagactac agcacctcgc cggtactgtg 48" tgaggcccaa cagtggaatt attgacccag ggtcaactgt gactgtttca gtaatgctac 54" agccctttga ctatgatccg aatgaaaaga gtaaacacaa gtttatggta attt 60" ttgctccacc aaacacttca gaag ctgtgtggaa agaggcaaaa cctgatgaat 66" taatggattc caaattgaga tgcgtatttg aaatgcccaa tgat aatg 72" atatggaacc agct gttccactga atgcatctaa gcaagatgga cctatgccaa 78" aaccacacag tgtttcactt aatgataccg aaacaaggaa actaatggaa gagtgtaaaa 84" gacttcaggg agaaatgatg aagctatcag aagaaaatcg gcacctgaga gatgaaggtt 90" taaggctcag aaaggtagca cattcggata aacctggatc aacctcaact gcatccttca 290290 96" gagataatgt caccagtcct cttccttcac ttgt aattgcagcc attttcattg "02" gattctttct agggaaattc atcttgtaga gtgaagcatg cagagtgctg tttctttttt "08" tttt:ttctc ttgaccagaa aaagatttgt ttacctacca tggt agtatggccc "14" acgg:gacca tttttttgtg tgtacagcgt catataggct ttgcctttaa tgatctctta "20" cggt:agaaa acacaataaa aacaaactgt tcggctactg ttgt ccag "26" atca:cacta gcagatgtca gttgcacatt gagtccttta tgaaattcat aaataaagaa "32" ttgt:ctttc tttgtggttt taataagagt tcaagaattg gtct tgtaaatgtt "38" attt:aataa tccctttaaa ttttatctgt tgctgttacc tcttgaaata tgatttattt "44" agat:gctaa tcccactcat tcaggaaatg ggta ttccttgggg aaatggtgcc "50" tcttacagtg taaatttttc ctcctttacc tttgctaata tcatggcaga atttttctta "56" tcccttgtga ggcagttgtt gactgagttt ttcatcctta caatcctgtc ccatggtatt "62" taacataaaa aaaaataaaa ctgttaacag attcttgctc gatagcttgt ttgtgtctgt "68" cgtgttatta gagggaactc cactatatat ggtcacttga aattatgatg caaaggtttc "74" attg aaaccctctt ggatattaca gtatttttaa ttgaaagtcc taattctgtt "80" aaggaaagga gttgattaaa ttttaaggta ccactggtat tttgggagat tataatcagt "86" ttgttttcaa gataatagaa aataaggtcc atgagaatag aagttatgtg gtga "92" gttgatgtgt acagcatggc tgtgctccat tacc ccattcttaa gttctgagag "98" tatgttctca aggaagattt aactctcttt ggttttaaat tactttttaa taat 204" aaataagtct tactactttt cataatattt cataatagtt aaaagtaggt gtttttttcg 210" tgctcaattt caaa ataatgttca ttatggaagt ttggtaatac tgagcaagcc 216" tgtggaattt tctttatgaa aaatgatttt tgca aatgttaacc atgtgaaaca 222" cattttcagt ataagtatgc gttacagggt ttga:acttt ctta ggtttgtcct 228" attcttcatt tattcatact aggatagaaa attt:ggaat cagaaaatag atccagtgtt 234" tagctacata caatctagta caagtgaatt ttta:tctta aacataggtg tgttggctct 240" ttttttaaaa gatgcgctct acctgaaaag gaaa :tggat tttagaactg gatgtggtgc 246" agtgaagtat tttaggccca ggtctgtgta caca:tttat atga agtactctga 252" agtattttgg ttgccttttc atttcaactg tgtt:tgaat ttgtcagatc acacatatat 258" tgtgttattg ggcgctgtgg tatcttttat aaaacctctt gcttgtgtgc aaaagttcct 264" aaaaggaaac acaagtaatg cctatccatt actagcatgc tatgctgcat gctttactgc 270" tgta tgctttactg tctttgtaaa aatccccctc tttc tggtaactgg 291291 276" aaaagcatgc taaaaatagt cttatatttt caccccataa atgcagaatc agtaattcct 282" tggcttaaag ctcttatata atcaatatta ttggtggtaa ataccaagtt tggtatctca 288" tagctatctt tttttaaaga aattaagttc ttgaaaattt agccaaatcc cgttttatgg 294" gaatgctctt tagaattcat tttgttcagc ccctttgttc tatggttgag aaatctgagg 300" gaag gttaagagaa ccgt gtctcacagg taggtagagg cagagctgga 306" actagatatc tggtctgttg actctagctc agtgtc:tct ggtaactgtt gaaaattgtc 312" ttagtttgag agatggctga aataatgaac ataaaa:gct atttataata acaagtatat 318" gtgaaatttc ttattgtaag accg gcttac:gtt gaatagtttg gttatagtgt 324" ttaggctaga aatgcctccc acattggtaa taaaca:tac aaaatacaat gtatttttag 330" gtaggcattt tataaaatgc attatgccat :ttt gatt gtagtctggg 336" tagcatcttt aaaatgtatg tgggcttaac tgttgt:cat atcaggagat gctctgattg 342" tataggtgag actctgtttc tgttattttt aattgc:gta tgaaatgtga tcagattatt 348" ttactaccaa cagttatagt ttgaaagtcc aactgtatta tgat aatatgataa 354" tatagagatt aaattgtttg tcttcattcc ttatatgttt agaagttttt gctttgtctg 360" cctgcttact tgtatatgta agcatgaggg aaatacactg ttgctaatac tgaaattaca 366" taac taaggccttg agttcatatg tgacactgaa tgcactagct tccttcgttc 372" taat gtaccttaac ttcccccatt cttatattta gcta agtcattatg 378" gtgt gtggtatgtt cccttaaaaa aaaatgacac ttggaagaaa aatgtatgaa 384" attcagaaat tccgatcaaa gaaaagtaat tctttctttt tttttttgag acagagtctt 390" gctttgttgc ccaggctgga tggt gtgatctcac ctcactgcag cctc 396" ctgggttcaa gtgattctca agcc gcctgagtag ctgggattac aggtgtgagc 102" caacaagccc ggctaatttt tgtattttta gtagagacaa ggtttcacca tgttggtcag 108" gctgggctca aactcctgaa tccgcctgcc tcggcctccc aaagtgctgg gattacaggt 114" gtgagctgcc gcacccagcc aagaaaaata atactcttaa atacttagat gttcacctaa agttgatatt atttggtatg ggaattactt ttgaactgta atctttcaga ttgaaaacaa acag tagggtaaaa atatagtttt tgagggtatt cccaacttgt 132" gatcttctac cactttagag acattcaagt aatagttttc ttagagcttt gcacattcct 138" attcactgag attttaaaaa tttcaccttt attcgaggga aggatcaatg cttattacca Z44" tttggaaaaa cgaagatcag aaggtaaatg atctttattt tctagcttta aatt 150" aaaccattca tgaataaact ttaaaaatgt gaagtgtcct tttccttttc acaatacaaa 292292 156" aaaaatttca acagattgtg tggtttgtgc atttatatcc tgttaagcat taatagctaa Z62" tcactgggac ttgaattctg atggcagata gtctcttgct tagtgagatg gagttaacta Z68" ttttttagta ggaagtgaga gatt ttcatgccac gtttcatagc tttg Z74" gtagactacc accacgcttc ttcgcgtaag cagtggcatc ttgggaatga atgcccagcc 180" gggt tggtgcaaag aagtataaac atatatcact aaggaaaaag aaagtttgtc 186" ttgcccttct gtgt gtgcacttca ggcaattttt ggaaaatata aaaaattcca Z92" aattctgcct ttcagcagca tcaattgcta ggaacatttc attcatttcc ctgtaatatt Z98" aatgttc:tt aagcataatc actaattata agttgtatcc tatttttttc cagcttaatt 504" tctgtgg:tt attgaaaacc aagtataaat aaaa gcattttgct ttgtttttat 510" agttaac:tt cttaaggtta tttt ataatgtaac atttgattgg cctggcctct 516" tgacaat:cc cttctagtta tgcatatcct cctgttgcca catttcttgt tttaaaactc 522" agtttct:gt gttg ttgctatgta taacacccat cttgaaagag agtatatagg 528" aagttat an gataactttt gtagtagtga tattcaacta tagcagtacc ttaactcatg 534" atgagct:ag gaacataaaa gataattgtt gcttgaatag cacccccaga gatactgacc 540" taattgg:ct ggggtggaga tctggcatgg tagttttttt caagctccaa gcca 546" gacagttgct ttatgtaggt ttttaaatgc caaaggcaga tatgaagtag atttaattaa 552" gacttgactt cagcaataca ggggaactta aaatacttat ttttctttaa actgcaggag 558" tcactgttag gtattgctta aaaaaaattg cataaaagct gtca agttaggatt 564" gctggaatac cactaaagat ttttgacttg tgaataaatg agctgtcatc gcaaaaaggc 570" gatttgagaa atgtgggctt cagtattaat tgccattttg ctgacaccca gtgtacctac 576" ctacctgaga aatttatttt gtccatcatg tatttctcaa agcaaaaggt ggttttcaag 582" tataatgtcg ttttcaacat gcttattact tacg tcagctcatt tcatcatcat 588" tgataacttg tgaaatactt atctccatcc tatggaatag cggg cagg 594" tagc tcaagtctac gaag agaa atct agatggtctg 600" atcctagtga tctaccatat gaaggacata gtttgtgtcc tggtccaagt caaatattga 606" ctcctcacaa acagtaagta tggcaatttt gtgatgcctt tgattccact ttacatggag 612" tactattatt tgtgaaatgt ctttaagatt tttggtctta aatttttgaa gactgctttc 618" cccctttatc tcccagaaaa agaa gtaaactcct gcccactaac agtc 624" cgtgaacaaa accaacatga acattcctaa acaagagtgt gtgttactct aagaagaagg 630" ctatagaatt tatggaaatg gcttatgtaa cctacaagac tggagaacag aatgtgactg 293293 636" gccttttcta cttt aagatttaat gattaaagca agagtttttt ataattgact 642" ttgtggtcta aattcttgat actgtttata attctacaaa gaacaaaaat tgttatgtac 648" tataggcact taagaaccct gaggaaaaat aatacaatgt gtgtgtgtga gagt 654" gagttactga tcca aaaaaaaaaa aaaaaaaaaa aaaaatgtgg agggttgaaa 660" tggtaaggaa ttggaatctt ttgtattttc gagcaataag aattcctatt cttgtttcaa 666" atagaggttt gttaggaatt acagttgtgg ggagcaaact tttg tgctgtttta 672" attcaaaatg tatatcctta attgtatata agat aaatatatga gggtattaag 678" ctactttgaa ttaaatttaa ggatatattt cacatgaaaa caaatacaaa cgagaatcaa 6841 aataaagttt tgcaaagta Protein seguence (variant 2): NCBI Reference Seguence: NP_919415.2 LOCUS NP_919415 ION NP_919415 1 masasgamak heqilvldpp tdlkfkgpft dvvttnlklr npsdrkvcfk vkttaprryc 61 iidp gstvtvsvml qpfdydpnek skhkfquti fappntsdme avwkeakpde 121 lmdsklrcvf empnendkln avpl pmp kphsvslndt etrklmeeck 181 rlqgemmkls eenrhlrdeg ahsd kpgststasf rdnvtsplps llvviaaifi 241 gfflgkfil HNRNPD Official Symbol: HNRNPD Official Name: heterogeneous nuclear ribonucleoprotein D (AU-rich element RNA binding protein 1, 37kDa Gene ID: 3184 Organism: Homo sapiens Other Aliases: AUF1, AUF1A, Other Designations: ARE-binding protein AUFI, type A; heterogeneous nuclear ribonucleoprotein D0; hnRNP D0 Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_031370.2 294294 LOCUS NM_031370 ACCESSION NM_031370 l cttccgtcgg ccattttagg tggtccgcgg cggcgccatt aaagcgagga ggaggcgaga 6" gcggccgccg ctggtgctta ttctttttta gtgcagcggg agagagcggg agtgtgcgcc l2" gcgcgagagt gggaggcgaa aggc cagggagagg cgcaggagcc gcca 18" cgcgcgcgcc ttccctgtct tgtgtgcttc gcgaggtaga gcgggcgcgc ggcg 24" gggattactt tgctgctagt ttcggttcgc ggcagcggcg ggtgtagtct cggcggcagc " ggcggagaca ctagcactat ggag cagttcggcg gggacggggc ggcggcagcg 36" gcaacggcgg cgg ctcggcgggc gagcaggagg gagccatggt ggcggcgaca 42" cagggggcag ngngngC gggaagcgga gccgggaccg ggggcggaac cgcgtctgga 48" ggcaccgaag ggggcagcgc Cgagtcggag ggggcgaaga ttgacgccag taagaacgag 54" gaggatgaag caaa ctcctcccca cgacactctg aagcagcgac ggcacagcgg 60" gaagaatgga ttat aggaggcctt agctgggaca ctacaaagaa gaag 66" gactactttt ccaaatttgg tgaagttgta gactgcactc taga tcctatcaca 72" gggcgatcaa ggggttttgg ctttgtgcta tttaaagaat cggagagtgt agataaggtc 78" atggatcaaa aagaacataa attgaatggg aaggtgattg atcctaaaag ggccaaagcc 84" acaa aagagccggt taaaaaaatt tttgttggtg gcctttctcc acct 90" gaagagaaaa taagggagta ctttggtggt tttggtgagg tggaatccat agagctcccc 96" atggacaaca agaccaataa gaggcgtggg ttctgcttta ttacctttaa ggaagaagaa "O2" ccagtgaaga agataatgga aaagaaatac cacaatgttg gtcttagtaa atgtgaaata "08" aaagtagcca tgtcgaagga acaatatcag caacagcaac agtggggatc tagaggagga "l4" tttgcaggaa gagctcgtgg aagaggtggt ggccccagtc aaaactggaa ccagggatat "20" tatt ggaatcaagg ctatggcaac tatggatata acagccaagg ttacggtggt "26" tatggaggat atgactacac tggttacaac aactactatg gatatggtga ttatagcaac "32" cagcagagtg gttatgggaa ggtatccagg Cgaggtggtc atcaaaatag ctacaaacca "38" tactaaatta ttccatttgc aacttatccc caacaggtgg tgaagcagta ttttccaatt "44" tgaagattca tttgaaggtg gcca cctgctaata gcagttcaaa ctaaattttt "50" tgtatcaagt ccctgaatgg aagtatgacg ttgggtccct ctgaagttta attctgagtt "56" ctcattaaaa gaaatttgct gttt tatttcttaa ttgctatgct tcaa "62" tttgtgtttt atgccctttc ccccagtatt gtagagcaag tcttgtgtta aaagcccagt 295295 168" gtgacagtgt catgatgtag tagtgtctta ctggtttttt aataaatcct tttgtataaa 174" aatgtattgg ctcttttatc atcagaatag gaaaaattgt ttca agttattaaa 180" agcataagtt acag gcttgccgaa attgaggaca tgattaaaat tgcagtgaag 186" tttgaaatgt ttttagcaaa atctaatttt tgccataatg tgtcctccct gtccaaattg 192" actt aatgtcaatt tgtttgttgg ttgttttaat tcct tatgtagcca 198" ttaagattta tatgaatatt ttcccaaatg cccagttttt gcttaatatg tattgtgctt 204" tttagaacaa ataa atgtgcaaaa gtaccccttt gcacagatag ttaatgtttt 210" atgcttccat taaataaaaa ggacttaaaa tctgttaatt ataatagaaa tgcggctagt 216" tcagagagat ttttagagct gtggtggact tcatagatga attcaagtgt tgagggagga 2221 ttaaagaaat atataccgtg tttatgtgtg tgtgctt Protein seguence (variant 1): NCBI Reference ce: NP_112738.1 LOCUS NP_1 12738 ACCESSION NP_112738 1 mseequgdg aaaaataavg gsageqegam vaatqgaaaa agsgagtggg tasggteggs 61 aesegakida skneedeghs nssprhseaa taqreewkmf igglswdttk kdlkdyfskf 121 tlkl dpitgrsrgf gfvlfkeses vdkvmqueh klngkvidpk rakamktkep 181 vkkifvggls pdtpeekire yfggfgeves nktn krrgfcfitf keeepvkkim 241 ekkyhnvgls kceikvamsk equqqqug srggfagrar grgggpsqnw nqusnywnq 301 gygnygynsq gydy tgynnyygyg dyanqsgyg kvsrrgghqn sykpy Nucleotide seguence (variant 2): NCBI Reference Seguence: NM_031369.2 LOCUS NM_031369 ACCESSION NM_031369 1 cttccgtcgg ccattttagg tggtccgcgg cggcgccatt aaagcgagga ggaggcgaga 61 gcggccgccg ctggtgctta ttctttttta gtgcagcggg cggg agtgtgcgcc 121 gcgcgagagt cgaa gggggcaggc cagggagagg cgcaggagcc gcca 181 cgcgcgcgcc ttccctgtct tgtgtgcttc taga gcgggcgcgc ggcagcggcg 241 gggattactt tgctgctagt ttcggttcgc ggcagcggcg ggtgtagtct cggcggcagc 296296 " gaca ctagcactat gtcggaggag cagttcggcg gggacggggc agcg 36" gcaacggcgg ngtaggcgg gggc gagcaggagg gagccatggt ggcggcgaca 42" cagggggcag ngngngC gggaagcgga gccgggaccg ggggcggaac cgcgtctgga 48" ggcaccgaag ggggcagcgc Cgagtcggag ggggcgaaga ttgacgccag taagaacgag 54" gaag ggaaaatgtt tataggaggc cttagctggg acactacaaa tctg 60" aaggactact aatt tggtgaagtt gtagactgca agtt agatcctatc 66" acagggcgat caaggggttt tggctttgtg ctatttaaag aatcggagag tgtagataag 72" gtcatggatc aaaaagaaca taaattgaat gggaaggtga ttgatcctaa aagggccaaa 78" gccatgaaaa caaaagagcc ggttaaaaaa atttttgttg tttc tccagataca 84" cctgaagaga aaataaggga gtactttggt ggttttggtg aggtggaatc catagagctc 90" cccatggaca acaagaccaa taagaggcgt gggttctgct ttattacctt agaa 96" gaaccagtga agaagataat ggaaaagaaa taccacaatg ttggtcttag taaatgtgaa "02" ataaaagtag ccatgtcgaa ggaacaatat cagcaacagc gggg atctagagga "08" ggatttgcag gaagagctcg tggaagaggt ggtggcccca gtcaaaactg gaaccaggga "14" tatagtaact attggaatca aggctatggc aactatggat ataacagcca aggttacggt "20" ggttatggag gatatgacta cactggttac aacaactact atggatatgg tgattatagc "26" aaccagcaga atgg gaaggtatcc aggcgaggtg gtcatcaaaa tagctacaaa "32" taaa catt tgcaacttat ccccaacagg tggtgaagca gtattttcca "38" agat tcatttgaag gtggctcctg ccacctgcta atagcagttc aaactaaatt "44" ttttgtatca agtccctgaa tggaagtatg acgttgggtc cctctgaagt ttaattctga "50" gttctcatta aaagaaattt gctttcattg ttttatttct taattgctat gcttcagaat "56" caatttgtgt tttatgccct ttcccccagt attgtagagc aagtcttgtg ttaaaagccc "62" acag tgtcatgatg tagtagtgtc ttactggttt tttaataaat ccttttgtat "68" aaaaatgtat tggctctttt atcatcagaa taggaaaaat tgtcatggat tcaagttatt "74" aaaagcataa gtttggaaga caggcttgcc gaaattgagg acatgattaa aattgcagtg "80" aagtttgaaa tgtttttagc aaaatctaat ttttgccata atgtgtcctc cctgtccaaa "86" ttgggaatga cttaatgtca atttgtttgt tggttgtttt aataatactt ccttatgtag "92" ccattaagat ttatatgaat attttcccaa atgcccagtt tttgcttaat tgtg "98" ctttttagaa caaatctgga tgca aaagtacccc tttgcacaga tagttaatgt 204" cttc cattaaataa aaaggactta aaatctgtta attataatag aaatgcggct 297297 2101 agttcagaga gatttttaga gctgtggtgg acttcataga caag tgttgaggga 2161 ggattaaaga aatatatacc gtgtttatgt gtgtgtgctt Protein seguence (variant 2): NCBI Reference ce: NP_112737.1 LOCUS NP_112737 ACCESSION NP_112737 1 mseequgdg aaaaataavg gsageqegam vaatqgaaaa agsgagtggg tasggteggs 61 aesegakida skneedegkm figglswdtt kkdlkdyfsk fgevvdctlk ldpitgrsrg 121 kese svdkvmque hklngkvidp krakamktke pvkkifvggl spdtpeekir 181 eyfggfgeve sielpmdnkt nkrrgfcfit fkeeepvkki mekkyhnvgl vams 241 kequqqqu gsrggfagra rgrgggpsqn wnqusnywn qugnygyns quggyggyd 301 ytgynnyygy gdyanqsgy gkvsrrgghq nsykpy Nucleotide ce (variant 3): NCBI Reference Seguence: NM_002138.3 LOCUS 138 ACCESSION NM_002138 1 cttccgtcgg ccattttagg tggtccgcgg cggcgccatt aaagcgagga ggaggcgaga 6" gccg ctggtgctta ttctttttta cggg agagagcggg agtgtgcgcc 12" gcgcgagagt cgaa gggggcaggc cagggagagg cgcaggagcc tttgcagcca 18" cgcgcgcgcc ttccctgtct tgtgtgcttc taga gcgc ggcagcggcg 24" gggattactt tgctgctagt ttcggttcgc ggcagcggcg ggtgtagtct cggcggcagc " ggcggagaca ctagcactat gtcggaggag cagttcggcg gggacggggc ggcggcagcg 36" gcaacggcgg ngtaggcgg ctcggcgggc gagcaggagg gagccatggt ggcggcgaca 42" cagggggcag ngngngC gggaagcgga gccgggaccg ggggcggaac cgcgtctgga 48" ggcaccgaag ggggcagcgc Cgagtcggag ggggcgaaga ttgacgccag cgag 54" gaggatgaag gccattcaaa ctcctcccca cgacactctg aagcagcgac ggcacagcgg 60" gaagaatgga aaatgtttat aggaggcctt agctgggaca ctacaaagaa agatctgaag 66" gactactttt ccaaatttgg tgaagttgta gactgcactc tgaagttaga tcctatcaca 298298 72" gggcgatcaa ggggttttgg ctttgtgcta tttaaagaat cggagagtgt agataaggtc 78" atggatcaaa aagaacataa attgaatggg aaggtgattg atcctaaaag ggccaaagcc 84" atgaaaacaa aagagccggt taaaaaaatt tttgttggtg gcctttctcc agatacacct 90" gaagagaaaa taagggagta ctttggtggt tttggtgagg tggaatccat agagctcccc 96" atggacaaca agaccaataa gaggcgtggg ttctgcttta ttacctttaa ggaagaagaa "O2" ccagtgaaga agataatgga aaagaaatac cacaatgttg gtcttagtaa atgtgaaata "08" aaagtagcca tgtcgaagga acaatatcag caacagcaac agtggggatc tagaggagga "l4" tttgcaggaa gagctcgtgg aagaggtggt gaccagcaga gtggttatgg gaaggtatcc "20" aggcgaggtg gtcatcaaaa tagctacaaa ccatactaaa ttattccatt tgcaacttat "26" cagg tggtgaagca gtattttcca atttgaagat tcatttgaag gtggctcctg "32" ccacctgcta atagcagttc aaactaaatt atca agtccctgaa tggaagtatg "38" acgttgggtc cctctgaagt ctga gttctcatta attt gctttcattg "44" ttttatttct taattgctat gcttcagaat caatttgtgt tttatgccct ttcccccagt "50" attgtagagc aagtcttgtg ttaaaagccc agtgtgacag tgtcatgatg tagtagtgtc "56" ttactggttt tttaataaat ccttttgtat aaaaatgtat tttt atcatcagaa "62" aaat tgtcatggat tcaagttatt aaaagcataa gtttggaaga caggcttgcc "68" gaaattgagg acatgattaa aattgcagtg gaaa tgtttttagc aaaatctaat "74" cata atgtgtcctc cctgtccaaa ttgggaatga cttaatgtca atttgtttgt "80" tggttgtttt aataatactt gtag ccattaagat ttatatgaat attttcccaa "86" atgcccagtt tttgcttaat atgtattgtg ctttttagaa caaatctgga taaatgtgca "92" aaagtacccc tttgcacaga tagttaatgt tttatgcttc cattaaataa aaaggactta "98" aaatctgtta attataatag ggct agttcagaga gatttttaga gctgtggtgg 204" acttcataga tgaattcaag tgttgaggga ggattaaaga aatatatacc gtgtttatgt 210" gtgtgtgctt Protein ce nt 3): NCBI Reference Seguence: NP_002129.2 LOCUS NP_002129 ACCESSION NP_002129 1 mseequgdg aaaaataavg gsageqegam vaatqgaaaa agsgagtggg tasggteggs 299299 61 aesegakida skneedeghs nssprhseaa taqreewkmf igglswdttk kdlkdyfskf 121 gevvdctlkl dpitgrsrgf gfvlfkeses vdkvmqueh klngkvidpk rakamktkep 181 vkkifvggls pdtpeekire yfggfgeves nktn krrgfcfitf keeepvkkim 241 ekkyhnvgls kceikvamsk ug srggfagrar grggdqqsgy gkvsrrgghq 301 nsykpy Nucleotide seguence (variant 4): NCBI nce Seguence: NM_001003810.1 LOCUS NM_001003810 ACCESSION NM_001003810 1 cttccgtcgg ccattttagg tggtccgcgg cggcgccatt aaagcgagga ggaggcgaga 6" gcggccgccg ctggtgctta ttctttttta gtgcagcggg agagagcggg agtgtgcgcc 12" gcgcgagagt gggaggcgaa gggggcaggc cagggagagg cgcaggagcc tttgcagcca 18" cgcgcgcgcc ttccctgtct tgtgtgcttc gcgaggtaga gcgggcgcgc ggcagcggcg 24" gggattactt tgctgctagt ttcggttcgc ggcagcggcg ggtgtagtct cggcggcagc " ggcggagaca ctagcactat gtcggaggag cagttcggcg gggacggggc agcg 36" gcaacggcgg ngtaggcgg gggc gagcaggagg gagccatggt ggcggcgaca 42" cagggggcag ngngngC cgga gccgggaccg ggggcggaac cgcgtctgga 48" ggcaccgaag ggggcagcgc Cgagtcggag ggggcgaaga ttgacgccag cgag 54" gaggatgaag ggaaaatgtt tataggaggc cttagctggg caaa gaaagatctg 60" aaggactact tttccaaatt tggtgaagtt gtagactgca agtt agatcctatc 66" acagggcgat caaggggttt tggctttgtg ctatttaaag aatcggagag tgtagataag 72" gtcatggatc aaaaagaaca gaat gggaaggtga ctaa caaa 78" gccatgaaaa caaaagagcc ggttaaaaaa atttttgttg gtggcctttc tccagataca 84" cctgaagaga aaataaggga gtactttggt ggttttggtg aggtggaatc catagagctc 90" cccatggaca acaagaccaa taagaggcgt tgct ttattacctt taaggaagaa 96" gaaccagtga agaagataat gaaa taccacaatg ttggtcttag taaatgtgaa 102" ataaaagtag ccatgtcgaa ggaacaatat cagcaacagc aacagtgggg atctagagga 108" ggatttgcag gaagagctcg tggaagaggt ggtgaccagc gtta tgggaaggta 114" tccaggcgag gtggtcatca aaatagctac aaaccatact aaattattcc atttgcaact 300300 "20" tatccccaac aggtggtgaa gcagtatttt ccaatttgaa gattcatttg aaggtggctc "26" cctg ctaatagcag ttcaaactaa tgta tcaagtccct gaatggaagt "32" atgacgttgg gtccctctga agtttaattc tgagttctca ttaaaagaaa tttgctttca "38" ttgttttatt ttgc tatgcttcag aatcaatttg tgttttatgc cctttccccc "44" agtattgtag agcaagtctt gtgttaaaag cccagtgtga cagtgtcatg atgtagtagt "50" gtcttactgg ttttttaata aatccttttg tataaaaatg tattggctct tttatcatca "56" gaataggaaa aattgtcatg gattcaagtt attaaaagca taagtttgga agacaggctt "62" gccgaaattg aggacatgat taaaattgca gtgaagtttg aaatgttttt agcaaaatct "68" aatttttgcc ataatgtgtc gtcc aaattgggaa tgacttaatg tcaatttgtt "74" tgttggttgt tttaataata cttccttatg tagccattaa tatg aatattttcc "80" caaatgccca gtttttgctt aatatgtatt gtgcttttta atct ggataaatgt "86" gcaaaagtac ccctttgcac agatagttaa tgttttatgc ttccattaaa taaaaaggac "92" ttaaaatctg ttaattataa tagaaatgcg tcag agagattttt agagctgtgg "98" tggacttcat attc aagtgttgag ggaggattaa agaaatatat accgtgttta 204" tgtgtgtgtg ctt Protein sequence (variant 4): NCBI Reference Seguence: NP_001003810.1 LOCUS NP_001003810 ACCESSION NP_001003810 1 mseequgdg aaaaataavg egam vaatqgaaaa agsgagtggg tasggteggs 61 aesegakida skneedegkm figglswdtt kkdlkdyfsk fgevvdctlk ldpitgrsrg 121 fgfvlfkese svdkvmque hklngkvidp krakamktke pvkkifvggl ekir 181 eyfggfgeve sielpmdnkt nkrrgfcfit fkeeepvkki mekkyhnvgl vams 241 kequqqqu gsrggfagra rgrggdqqsg ygkvsrrggh qnsykpy OfibblSwnbd:BSG Official Name: basigin (Ok blood group) Gene ID: basigin (Ok blood group) 301301 Organism: Homo sapiens Other Aliases: UNQ6505/PRO21383, 5F7, CD147, EMMPRIN, M6, OK, TCSF Other Designations: CD147 antigen; OK blood group antigen; basigin; collagenase stimulatory factor; extracellular matrix metalloproteinase inducer; yte tion antigen M6; tumor cell-derived collagenase stimulatory iacto r Nucleotide seguence nt 1): NCBI Reference Seguence: NM_001728.3 LOCUS NM_001728 ACCESSION 728 l gtacatgcga gcgtgtgcgc gcgtgcgcag gcggggcgac cggcgtcccc ggcgctcgcc 6" ccgcccccga gatgacgccg tgcgtgcgcg cgcccggtcc gcgcctccgc cgctttttat l2" cgcg ggcggcggcg gcagcggttg gaggttgtag gaccggcgag gaataggaat 18" catggcggct gcgctgttcg tgctgctggg attcgcgctg ctgggcaccc acggagcctc 24" cggggctgcc ggcttcgtcc aggcgccgct gtcccagcag aggtgggtgg ggggcagtgt " ggagctgcac tgcgaggccg tgggcagccc cgag atccagtggt ggtttgaagg 36" gcagggtccc acct gctcccagct ctgggacggc gcccggctgg tcca 42" catccacgcc acctaccacc agcacgcggc cagcaccatc tccatcgaca cgctcgtgga 48" ggaggacacg ggcacttacg agtgccgggc cagcaacgac cgca accacctgac 54" ccgggcgccc aagt gggtccgcgc ccaggcagtc gtgctagtcc ccgg 60" cacagtcttc actaccgtag aagaccttgg ctccaagata ctcctcacct gctccttgaa 66" tgacagcgcc acagaggtca cagggcaccg ctggctgaag gggggcgtgg tgctgaagga 72" ggacgcgctg cccggccaga aaacggagtt caaggtggac tccgacgacc agtggggaga 78" gtactcctgc gtcttcctcc ccgagcccat gggcacggcc cagc tccacgggcc 84" agtg aaggctgtga caga acacatcaac gagggggaga cggccatgct 90" ggtctgcaag tcagagtccg tgccacctgt cactgactgg gcctggtaca ctga 96" ctctgaggac aaggccctca tgaacggctc cgagagcagg ttcttcgtga gttcctcgca 102" gggccggtca gagctacaca ttgagaacct gaacatggag gccgaccccg gccagtaccg 108" gtgcaacggc accagctcca agggctccga ccaggccatc atcacgctcc gcgtgcgcag 114" ccacctggcc gccctctggc ccttcctggg catcgtggct gaggtgctgg tgctggtcac 302302 "20" catcatcttc atctacgaga agcgccggaa gcccgaggac gtcctggatg atgacgacgc "26" cggctctgca cccctgaaga gcagcgggca gcaccagaat gacaaaggca agaacgtccg "32" ccagaggaac tcttcctgag gcaggtggcc cgaggacgct ccctgctcca cgcc "38" gccgccggag tccactccca gtgcttgcaa gattccaagt tctcacctct taaagaaaac "44" ccaccccgta gattcccatc atacacttcc ttta ttgg gttttctcca "50" ttcaggattc tgttccttag gtttttttcc ttctgaagtg tttcacgaga gcccgggagc "56" tgctgccctg cggccccgtc tttc agcctctggg tctgagtcat ggccgggtgg "62" gcggcacagc cttctccact ggccggagtc agtgccaggt ccttgccctt aagt "68" cacaggtcac ggcc ccgtgtcctg cctgtctgaa gccaatgctg tctggttgcg "74" ccatttttgt gcttttatgt ttaattttat gagggccacg ggtctgtgtt cgactcagcc "80" tcagggacga ctctgacctc ttggccacag aggactcact tgcccacacc gagggcgacc "86" ccgtcacagc ctcaagtcac tcccaagccc cctccttgtc tgtgcatccg gctc "92" tggagggggt ttgctgggga actggcgcca tcgccgggac tccagaaccg cagaagcctc "98" tcac ccctggagga cggccggctc tctatagcac cagggctcac gtgggaaccc 204" ccctcccacc caca ataaagatcg cccccacctc caccctcaaa aaaaaaaaaa 2101 aaaaaaa Protein seguence (variant 1): NCBI nce Seguence: NP_001719.2 LOCUS NP_001719 ACCESSION NP_001719 1 vllg fallgthgas gaagquapl sqqrwvggsv elhceavgsp 61 qundtcsql wdgarldrvh ihatthhaa stisidtlve edtgtyecra sndpdrnhlt 121 raprvkwvra qavvlvlepg tvfttvedlg skilltcsln dsatevtghr wlkggvvlke 181 dalpqutef kvdsddqwge pepm gtaniqlhgp prvkavksse taml 241 vcksesvppv tdwawykitd sedkalmngs esrffvsssq grselhienl nmeadpgqyr 301 cngtsskgsd qaiitlrvrs hlaalwpflg ivaevlvlvt iifiyekrrk pedvldddda 361 gsaplkssgq hqndkgknvr qrnss Nucleotide seguence (variant 2): NCBI Reference Seguence: NM_198589.2 LOCUS NM_198589 303303 ION NM_198589 l gtacatgcga gcgtgtgcgc gcgtgcgcag gcggggcgac cggcgtcccc ggcgctcgcc 6" ccgcccccga gatgacgccg tgcgtgcgcg cgcccggtcc ccgc cgctttttat l2" cgcg ggcggcggcg gcagcggttg gtag gaccggcgag gaataggaat 18" catggcggct ttcg tgctgctggg attcgcgctg ctgggcaccc acggagcctc 24" cggggctgcc ggcacagtct tcactaccgt agaagacctt ggctccaaga tactcctcac " cttg aatgacagcg ccacagaggt gcac cgctggctga aggggggcgt 36" ggtgctgaag gaggacgcgc tgcccggcca gaaaacggag ttcaaggtgg acga 42" ccagtgggga gagtactcct gcgtcttcct ccccgagccc atgggcacgg ccaacatcca 48" gctccacggg cctcccagag tgaaggctgt gaagtcgtca gaacacatca acgaggggga 54" gacggccatg ctggtctgca agtcagagtc cgtgccacct gact ggta 60" caagatcact gactctgagg acaaggccct catgaacggc tccgagagca ggttcttcgt 66" gagttcctcg cagggccggt cagagctaca cattgagaac ctgaacatgg accc 72" cggccagtac cggtgcaacg gcaccagctc ctcc gaccaggcca tcatcacgct 78" ccgcgtgcgc agccacctgg ccgccctctg gcccttcctg ggcatcgtgg ctgaggtgct 84" ggtgctggtc accatcatct tcatctacga gaagcgccgg aagcccgagg acgtcctgga 90" tgatgacgac gccggctctg cacccctgaa gagcagcggg cagcaccaga atgacaaagg 96" caagaacgtc cgccagagga actcttcctg aggcaggtgg cccgaggacg ctccctgctc "O2" cacgtctgcg ccgccgccgg agtccactcc cagtgcttgc aagattccaa gttctcacct "08" cttaaagaaa acccaccccg tagattccca tcatacactt ccttcttttt taaaaaagtt "l4" gggttttctc ggat tctgttcctt aggttttttt ccttctgaag tgtttcacga "20" gagcccggga gctgctgccc tgcggccccg tctgtggctt tcagcctctg ggtctgagtc "26" atggccgggt gggcggcaca gccttctcca ctggccggag tcagtgccag gtccttgccc "32" tttgtggaaa gtcacaggtc gggg ccccgtgtcc tgcctgtctg aagccaatgc "38" tgtctggttg cgccattttt gtgcttttat gtttaatttt atgagggcca nggtctgtg "44" ttcgactcag cctcagggac gactctgacc tcttggccac agaggactca caca "50" ccgagggcga ccccgtcaca gcctcaagtc actcccaagc cccctccttg tctgtgcatc "56" cgggggcagc tctggagggg gtttgctggg gaactggcgc catcgccggg actccagaac "62" cgcagaagcc tccccagctc acccctggag gacggccggc tagc accagggctc "68" acgtgggaac ccccctccca cccaccgcca agat cgcccccacc tccaccctca 1741 aaaaaaaaaa aaaaaaaaa 304304 Protein seguence (variant 2): NCBI Reference Seguence: NP_940991.1 LOCUS NP_940991 ACCESSION NP_940991 1 maaalfvllg fallgthgas gaagtvfttv edlgskillt cslndsatev tghrwlkggv 61 lpgq ktefkvdsdd qwgeyscvfl pepmgtaniq lhgpprvkav nege 121 tamlvckses vppvtdwawy dkal mngsesrffv sssqgrselh ienlnmeadp 181 gqyrcngtss kgsdqaiitl rvrshlaalw pflgivaevl vlvtiifiye krrkpedvld 241 dddagsaplk ssgqhqndkg knvrqrnss Nucleotide seguence (variant 3): NCBI Reference Seguence: 590.2 LOCUS NM_198590 ACCESSION NM_198590 1 cccgccagtg tagccacatt cctgcccctt tccagttagc ccttcgcgtt cggcttagtc 6" tgcggtcctc ttgcattgcg actccgagtt taacttccaa cacacacttt caacctccaa 12" gagacgcccc cacctgtgtc gccccaatag cgacttttct ggtc gccgcggaac 18" ttcaagggtc cttcctaccc gcgttgctga gagtctgggt ttacgcgtca cctcgggcgg 24" atcc tccgctcctg aggcccccac aatgaagcag tcggacgcgt ctccccaaga " aagccggcac agtcttcact accgtagaag accttggctc caagatactc ctcacctgct 36" ccttgaatga cagcgccaca gaggtcacag ggcaccgctg gctgaagggg ggcgtggtgc 42" tgaaggagga cgcgctgccc aaaa cggagttcaa ggtggactcc gacgaccagt 48" ggggagagta ctcctgcgtc ttcctccccg agcccatggg cacggccaac atccagctcc 54" acgggcctcc gaag gctgtgaagt cgtcagaaca catcaacgag ggggagacgg 60" ccatgctggt gtca gagtccgtgc tcac tgactgggcc tggtacaaga 66" tcactgactc tgaggacaag atga acggctccga gagcaggttc ttcgtgagtt 72" cctcgcaggg ccggtcagag ctacacattg agaacctgaa catggaggcc gaccccggcc 78" agtaccggtg caacggcacc agctccaagg gctccgacca catc acgctccgcg 84" gcca cctggccgcc ctctggccct tcctgggcat Cgtggctgag gtgctggtgc 90" tggtcaccat catcttcatc tacgagaagc gccggaagcc cgaggacgtc ctggatgatg 305305 96" acgacgccgg ctctgcaccc ctgaagagca gcgggcagca ccagaatgac aaaggcaaga "02" acgtccgcca gaggaactct tcctgaggca ggtggcccga ggacgctccc tgctccacgt "08" ctgcgccgcc gccggagtcc actcccagtg cttgcaagat tccaagttct cacctcttaa "14" agaaaaccca ccccgtagat tcccatcata cacttccttc ttttttaaaa aagttgggtt "20" ttctccattc aggattctgt ggtt tttttccttc tgaagtgttt cacgagagcc "26" cgggagctgc tgccctgcgg ccccgtctgt ggctttcagc ctctgggtct gagtcatggc "32" nggtgggcg cctt ctccactggc cggagtcagt gccaggtcct tgccctttgt "38" ggaaagtcac aggtcacacg aggggccccg tgtcctgcct agcc gtct "44" ggttgcgcca tttttgtgct tttatgttta attttatgag ggccacgggt ctgtgttcga "50" ctcagcctca gggacgactc tgacctcttg gccacagagg actcacttgc ccacaccgag "56" cccg tcacagcctc aagtcactcc caagccccct ccttgtctgt gcatccgggg "62" gcagctctgg agggggtttg aact ggcgccatcg ccgggactcc agaaccgcag "68" cccc cccc tggaggacgg ctct atagcaccag ggctcacgtg "74" ggaacccccc tcccacccac cgccacaata aagatcgccc ccacctccac cctcaaaaaa 1801 aaaaaaaaaa aaaa Protein seguence (variant 3): NCBI Reference Seguence: NP_940992.1 LOCUS NP_940992 ACCESSION 992 1 mgtaniqlhg pprvkavkss ehinegetam svpp vtdwawykit dsedkalmng 61 sesrffvsss qgrselhien lnmeadpgqy rcngtsskgs lrvr shlaalwpfl 121 givaevlvlv tiifiyekrr kpedvldddd agsaplkssg qhqndkgknv rqrnss Nucleotide seguence nt 4): NCBI Reference Seguence: NM_198591.2 LOCUS NM_198591 ACCESSION NM_198591 1 cccgccagtg tagccacatt cctgcccctt tccagttagc ccttcgcgtt cggcttagtc 61 tgcggtcctc ttgcattgcg actccgagtt taacttccaa cacacacttt caacctccaa 121 gagacgcccc cacctgtgtc gccccaatag cgacttttct caccgtggtc gccgcggaac 306306 18" ttcaagggtc accc gcgttgctga gagtctgggt ttacgcgtca cctcgggcgg 24" gacccgatcc tccgctcctg aggcccccac aatgaagcag gcgt ctccccaaga " aagggtggac tccgacgacc agtggggaga gtactcctgc gtcttcctcc ccgagcccat 36" gggcacggcc aacatccagc tccacgggcc tcccagagtg aaggctgtga agtcgtcaga 42" acacatcaac gaga cggccatgct ggtctgcaag tcagagtccg ctgt 48" cactgactgg gcctggtaca agatcactga ctctgaggac aaggccctca gctc 54" cgagagcagg ttcttcgtga gttcctcgca gggccggtca gagctacaca ttgagaacct 60" ggag gccgaccccg gccagtaccg cggc accagctcca agggctccga 66" ccaggccatc atcacgctcc gcgtgcgcag ccacctggcc gccctctggc ccttcctggg 72" catcgtggct gaggtgctgg tgctggtcac catcatcttc atctacgaga agcgccggaa 78" gcccgaggac gtcctggatg atgacgacgc cggctctgca cccctgaaga gcagcgggca 84" gcaccagaat gacaaaggca tccg ccagaggaac tcttcctgag gcaggtggcc 90" cgaggacgct ccctgctcca cgtctgcgcc gccgccggag tccactccca gtgcttgcaa 96" gattccaagt ctct taaagaaaac ccaccccgta gattcccatc atacacttcc "O2" ttctttttta aaaaagttgg gttttctcca ttcaggattc tgttccttag gtttttttcc "08" ttctgaagtg tttcacgaga gcccgggagc tgctgccctg cggccccgtc tttc "l4" agcctctggg tctgagtcat ggccgggtgg gcggcacagc cttctccact agtc "20" agtgccaggt ccttgccctt tgtggaaagt cacaggtcac acgaggggcc ccgtgtcctg "26" cctgtctgaa gccaatgctg tctggttgcg ccatttttgt gcttttatgt ttaattttat "32" gagggccacg ggtctgtgtt cgactcagcc acga ctctgacctc ttggccacag "38" aggactcact tgcccacacc gagggcgacc ccgtcacagc ctcaagtcac tcccaagccc "44" cctccttgtc tgtgcatccg ggggcagctc tggagggggt ttgctgggga actggcgcca "50" ggac tccagaaccg cctc cccagctcac ccctggagga cggccggctc "56" tctatagcac cagggctcac gtgggaaccc cacc caccgccaca ataaagatcg "62" cccccacctc caccctcaaa aaaaaaaaaa aaaaaaa Protein sequence (variant 4): NCBI Reference Seguence: NP_940993.1 LOCUS NP_940993 ACCESSION NP_940993 307307 1 mkqsdaque rvdsddqwge yscvflpepm gtaniqlhgp ksse hinegetaml 61 vppv kitd sedkalmngs sssq grselhienl nmeadpgqyr 121 cngtsskgsd qaiitlrvrs hlaalwpflg ivaevlvlvt iifiyekrrk ddda 181 gsaplkssgq hqndkgknvr qrnss Official Symbol: EIF4A3 al Name: eukaryotic translation initiation factor 4A3 m: 9775 Organism: Homo sapiens Other Aliases: DDX48, NMP265, NUK34, elF4Alll Other Designations: ATP-dependent RNA helicase DDX48; ATP-dependent RNA helicase elF4A-3; DEAD lu-Ala-Asp) box polypeptide 48; DEAD box protein 48; NMP 265; elF-4A-lll; elF4A-lll; eukaryotic initiation factor 4A-lll; eukaryotic initiation factor 4A-like NUK-34; eukaryotic translation initiation factor 4A; hNMP 265; nuclear matrix protein 265 Nucleotide seguence: NCBI Reference Seguence: NM_014740.3 LOCUS 740 ACCESSION NM_014740 1 acgcacgcac cgct ttcgcatact taaggcgtct gttctcggca gcggcacagc 6" gaggtcggca gcggcacagc gaggtcggca gcggcacagc gaggtcggca gcggcacagc 12" gaggtcggca gcggcagcga ggtcggcagc ggcacagcga ggtcggcagc ggcagcgagg 18" tcggcagcgg cgcgcgctgt gctcttccgc ggactctgaa tcatggcgac cacggccacg 24" atggcgacct cgggctcggc gcgaaagcgg ctgctcaaag aggaagacat gactaaagtg " gaattcgaga ccagcgagga ggtggatgtg acccccacgt tcgacaccat gggcctgcgg 36" gaggacctgc tgcggggcat ctacgcttac ggttttgaaa aaccatcagc aatccagcaa 42" cgagcaatca agcagatcat caaagggaga gatgtcatcg ctca gtccggcaca 48" ggaaaaacag tcag tatctcagtc ctccagtgtt tggatattca ggttcgtgaa 54" actcaagctt tgatcttggc tcccacaaga gagttggctg tgcagatcca gaaggggctg 60" cttgctctcg gtgactacat gaatgtccag tgccatgcct gcattggagg caccaatgtt 308308 66" ggcgaggaca tcaggaagct ggattacgga cagcatgttg tcgcgggcac tccagggcgt 72" gtttttgata tgattcgtcg cagaagccta aggacacgtg ctatcaaaat gttggttttg 78" gatgaagctg atgaaatgtt gaataaaggt ttcaaagagc agatttacga tgtatacagg 84" tacctgcctc caca ggtggttctc atcagtgcca cgctgccaca cgagattctg 90" gagatgacca acaagttcat gaccgaccca atccgcatct tggtgaaacg tgatgaattg 96" gaag gcatcaagca atttttcgtg gcagtggaga gggaagagtg gaaatttgac "02" actctgtgtg acctctacga cacactgacc atcactcagg cggtcatctt ctgcaacacc "08" aaaagaaagg tggactggct gacggagaaa atgagggaag ccaacttcac tgtatcctca "14" atgcatggag acatgcccca gaaagagcgg gagtccatca tgaaggagtt gggc "20" gccagccgag tgcttatttc tacagatgtc tgggccaggg ggttggatgt ccctcaggtg "26" tccctcatca ttaactatga tctccctaat aacagagaat tgtacataca cagaattggg "32" ggtc gatacggccg gaagggtgtg gccattaact agaa tgacgacatc "38" cgcatcctca gagatatcga gcagtactat caga ttgatgagat gccgatgaac "44" gttgctgatc ttatctgaag cagcagatca gtgggatgag tgtt cacctgctgt "50" ctgt ttggaagtat ttagatccag attctactta atggggttta tatggacttt "56" cttctcataa atggcctgcc gtctcccttc agag gatatgggga ttctgctctc "62" ttttcttatt tacatgtaaa attg ttctaagtct ttttcattaa aaatttaaaa "68" cttttcccat aaactctata cttctaaggt gccaccacct tctctagtaa ctta n seguence: NCBI Reference Seguence: NP_055555.1 LOCUS NP_055555 ION NP_055555 1 mattatmats gsarkrllke edmtkvefet seevdvtptf dtmglredll rgiyaygfek 61 psaiqqraik qiikgrdvia qsqsgtgkta tfsisvlqcl diquetqal ilaptrelav 121 qiqullalg dymnvqchac iggtnvgedi rkldygqhvv agtpgrvfdm irrrslrtra 181 ikmlvldead emlnkgfkeq iydvyrylpp atqvvlisat emtn kfmtdpiril 241 vkrdeltleg ikqffvaver eewkfdtlcd lydtltitqa vifcntkrkv dwltekmrea 301 nftvssmhgd mqueresim kefrsgasrv listdearg ldquvslii nydlpnnrel 361 yihrigrsgr ygrkgvainf vknddirilr dieqyystqi dempmnvadl i 309309 MTHFD1 Official Symbol: MTHFD1 al Name: methylenetetrahydroiolate dehydrogenase (NADP+ ent) 1, methenyltetrahydrofolate cyclohydrolase, iormyltetrahydrofolate synthetase Gene ID: 4522 Organism: Homo sapiens Other Aliases: MTHFC, MTHFD Other Designations: 5,10-methylenetetrahydroiolate dehydrogenase, 5,10- methylenetetrahydroiolate cyclohydrolase, 10-iormyltetrahydroiolate synthetase; Ctetrahydroiolate synthase, cytoplasmic; C1 -THF synthase; cytoplasmic C tetrahydroiolate synthase Nucleotide seguence: NCBI Reference Seguence: NM_005956.3 LOCUS NM_005956 ACCESSION NM_005956 l aattacggcc ggattccgga gtcctttcca gctccctctt cggccgggtt tcccgccgaa 6" tacaaaggcg cactgtgaac tggctctttc tttccgccaa tcatttccgc cagccattca 12" tcaccgattt tcttcatctt cccctccctc ttccgtcccg cagtccccga cctgttagct 18" ctcggttagt taagggactc gggtccttcc cgca tgcgccaccg cgtctgcagg 24" gggagaagcg ggcaggggcg cagt atcc cctggccagt ccctaagcac " gtgggttggg ttgtcctgct tggctgcgga gggagtggaa cctcgatatt ggtggtgtcc 36" atcgtgggca taat catg gcgccagcag aaatcctgaa cgggaaggag 42" atctccgcgc aaataagggc gagactgaaa aatcaagtca ctcagttgaa agta 48" cctggtttca caccacgcct ggcaatatta caggttggca acagagatga ttccaatctt 54" tatataaatg tgaagctgaa ggctgctgaa gagattggga tcaaagccac tcacattaag 60" ttaccaagaa caaccacaga atctgaggtg taca ttacatcttt gaatgaagac 66" tctactgtac atgggttctt agtgcagcta gatt attc cattaacact 72" gaagaagtga tcaatgctat tgcacccgag aaggatgtgg atggattgac tagcatcaat 78" gctgggaaac gagg tgacctcaat gactgtttca ttccttgtac gcctaaggga 84" tgcttggaac aaga gacaggggtg ccgattgccg gaaggcatgc tgtggtggtt 90.. gggcgcagta aaatagttgg ggccccgatg catgacttgc ttctgtggaa caatgccaca 310310 96" acct gccactccaa gactgcccat ctggatgagg aggtaaataa aggtgacatc "02" ctggtggttg caactggtca gcctgaaatg gttaaagggg agtggatcaa acctggggca "08" atagtcatcg actgtggaat caattatgtc ccagatgata aaaaaccaaa tgggagaaaa "l4" gttgtgggtg atgtggcata cgacgaggcc aaagagaggg cgagcttcat cactcctgtt "20" cctggcggcg tagggcccat gacagttgca atgctcatgc agagcacagt agagagtgcc "26" aagcgtttcc tggagaaatt taagccagga aagtggatga ataa caaccttaac "32" ctcaagacac ctgttccaag tgacattgat atatcacgat cttgtaaacc gaagcccatt "38" ggtaagctgg ctcgagaaat tggtctgctg tctgaagagg tata tggtgaaaca "44" aaggccaaag ttctgctgtc agcactagaa cgcctgaagc accggcctga tgggaaatac "50" gtggtggtga ctggaataac tccaacaccc ctgggagaag ggaaaagcac aatc "56" gggctagtgc aagcccttgg tgcccatctc taccagaatg tctttgcgtg tgtgcgacag "62" ccttctcagg cctt tggaataaaa ggtggcgctg caggaggcgg ctactcccag "68" gtcattccta tggaagagtt taatctccac ctcacaggtg acatccatgc catcactgca "74" gctaataacc tcgttgctgc ggccattgat gctcggatat ttcatgaact gacccagaca "80" gacaaggctc tctttaatcg tttggtgcca tcagtaaatg gagtgagaag gttctctgac "86" atccaaatcc gaaggttaaa gagactaggc attgaaaaga ctgaccctac cacactgaca "92" gatgaagaga taaacagatt tgcaagattg gacattgatc cagaaaccat aacttggcaa "98" agagtgttgg ataccaatga tagattcctg aggaagatca cgattggaca ggctccaacg 204" gagaagggtc acacacggac ggcccagttt gatatctctg gtga aattatggct 210" gtcctggctc tcaccacttc tctagaagac atgagagaga gactgggcaa ggtg 216" gcatccagta agaaaggaga gcccgtcagt gccgaagatc tgggggtgag tggtgcactg 222" acagtgctta tgaaggacgc aatcaagccc aatctcatgc agacactgga gggcactcca 228" gtgtttgtcc gccc caac atcgcacatg cctc catcattgca 234" gaccggatcg agct tgttggccca gaagggtttg tagtgacgga attt 240" ggagcagaca ttggaatgga aaagtttttt aacatcaaat gccggtattc cggcctctgc 246" ccccacgtgg tggtgcttgt tgccactgtc agggctctca agatgcacgg gggcggcccc 252" acggtcactg tgcc tcttcccaag gcttacatac aggagaacct ggagctggtt 258" gaaaaaggct tcagtaactt acaa attgaaaatg ccagaatgtt tcca 264" gtagtagtgg ccgtgaatgc attcaagacg gatacagagt tgga cctcatcagc 270" cgcctttcca atgg ggcttttgat gccgtgaagt gcactcactg ggcagaaggg 311311 276" ggcaagggtg ccttagccct ggctcaggcc gtccagagag cagcacaagc acccagcagc 282" ttccagctcc tttatgacct caagctccca gttgaggata aaatcaggat cattgcacag 288" aagatctatg gagcagatga atta cttcccgaag ctcaacacaa agctgaagtc 294" tacacgaagc agggctttgg gaatctcccc atctgcatgg ctaaaacaca cttgtctttg 300" aacc cagagcaaaa ccct ttca ttctgcccat tcgcgacatc 306" cgcgccagcg ttggggctgg ttttctgtac cccttagtag gaacgatgag cacaatgcct 312" ggactcccca cccggccctg tttttatgat attgatttgg accctgaaac ggtg 318" aatggattat tctaaacaga tcaccatcca tcttcaagaa gctactttga aagtctggcc 324" agtgtctatt cact tagg aagtataagt aagccaagag aagtcagccc 330" ctgcccagaa gatctgaaac taatagtagg ccca gaagtcattt tcagccttaa 336" ttctcatcat gtataaatta acataaatca tgcatgtctg tttactttag tgacgttcca 3421 cagaataaaa ggaaacaagt ttgccatcaa aaaaaaaaaa aaaaaa Protein seguence: NCBI Reference ce: NP_005947.3 LOCUS NP_005947 ACCESSION NP_005947 l mapaeilngk eisaqirarl knqvtqlkeq vpgftprlai lqvgnrddsn lkaa 6" eeigikathi klprtttese vmkyitslne dstvhgflvq lpldsensin teevinaiap l2" ekdvdgltsi nagklargdl ndcfipctpk gcleliketg vpiagrhavv vgrskivgap 18" mhdlllwnna tvttchskta hldeevnkgd ilvvatgqpe mvkgewikpg aividcginy 24" vpddkkpngr kvvgdvayde akerasfitp vpggvgpmtv amlmqstves akrflekfkp " gkwmiqynnl nlktpvpsdi disrsckpkp igklareigl lseevelyge tkakvllsal 36" erlkhrpdgk yvvvtgitpt sttt iglvqalgah lyqnvfacvr qpsqutfgi 42" kggaagggys efnl hltgdihait aannlvaaai darifheltq tdkalfanv 48" psvngvrrfs diqirrlkrl giektdpttl tdeeinrfar ldidpetitw qrvldtndrf 54" lrkitigqap tekghtrtaq seim avlalttsle dmrerlgkmv vasskkgepv 60" vsga ltvlmkdaik pnlmqtlegt pvahagpfa niahgnssii adrialklvg 66" pegfvvteag fgadigmekf fnikcrysgl cphvvvlvat vralkmhggg ptvtaglpr 72" kayiqenlel nlkk qienarmfgi pvvvavnafk tdteseldli srlsrehgaf 78" davkcthwae ggkgalalaq avqraaqaps sfqllydlkl pvedkiriia qkiygaddie 312312 841 llpeaqhkae vytkqgfgnl picmakthls lshnpequv ptgfilpird irasvgagfl 901 yplvgtmstm pglptrpcfy didldpeteq vnglf EN02 Official Symbol: ENO2 al Name: enolase 2 , neuronal) m: 2026 Organism: Homo sapiens Other s: NSE Other Designations: 2—phospho-D-glycerate hydro-lyase; 2—phospho-D-glycerate hydrolyase; enolase; neural enolase; neuron specific gamma enolase; neuron-specific enolase; neurone-specific enolase Nucleotide seguence: NCBI Reference ce: NM_001975.2 LOCUS NM_001975 ACCESSION NM_001975 l gctc gtacgtgcgc ctccgccggc agctcctgac gggg ctccgggtca 6" catgcgcccg cgcggcccta taggcgcctc ctccgcccgc cgcccgggag ccgcagccgc 12" cgccgccact gccactcccg ctctctcagc gccgccgtcg ccaccgccac cgccaccgcc 18" actaccaccg tctgagtctg cagtcccgag atcccagcca tcatgtccat agagaagatc 24" cggg agatcctgga ctcccgcggg aaccccacag tggaggtgga tctctatact " gccaaaggtc ttttccgggc tgcagtgccc agtggagcct ctacgggcat ctatgaggcc 36" ctggagctga gggatggaga caaacagcgt tacttaggca aaggtgtcct gaaggcagtg 42" gaccacatca ccat cgcgccagcc ctcatcagct caggtctctc tgtggtggag 48" caagagaaac tggacaacct gatgctggag ttggatggga ctgagaacaa atccaagttt 54" ggggccaatg ccatcctggg tgtgtctctg gccgtgtgta aggcaggggc agctgagcgg 60" gaactgcccc tgtatcgcca cattgctcag ctggccggga actcagacct catcctgcct 66" gcct tcaacgtgat caatggtggc tctcatgctg gcaacaagct ggccatgcag 72" gagttcatga tcctcccagt gggagctgag agctttcggg atgccatgcg actaggtgca 313313 78" gaggtctacc atacactcaa gggagtcatc aaggacaaat acggcaagga caat 84" gtgggggatg aaggtggctt tgcccccaat atcctggaga acagtgaagc cttggagctg 90" gtgaaggaag ccatcgacaa ggctggctac acggaaaaga tcgttattgg catggatgtt 96" gctgcctcag agttttatcg tgatggcaaa tatgacttgg acttcaagtc tcccactgat "02" ccttcccgat acatcactgg ggaccagctg ctct accaggactt tgtcagggac "08" tatcctgtgg tctccattga ggacccattt gaccaggatg attgggctgc ctggtccaag "l4" ttcacagcca atgtagggat ccagattgtg ggtgatgacc tgac caacccaaaa "20" cgtattgagc gggcagtgga agaaaaggcc tgcaactgtc tgctgctcaa ggtcaaccag "26" atcggctcgg tcactgaagc catccaagcg tgcaagctgg cccaggagaa tggctggggg "32" gtcatggtga gctc aggagagact gaggacacat tcattgctga cctggtggtg "38" gggctgtgca caggccagat caagactggt gccccgtgcc gttctgaacg tctggctaaa "44" tacaaccagc tcatgagaat tgaggaagag ctgggggatg aagctcgctt tgccggacat "50" aacttccgta atcccagtgt gctgtgattc ctctgcttgc ctggagacgt tctg "56" tctcatcctc ctggaacctt gctgtcctga tctgtgatag ttcaccccct gagatcccct "62" gagccccagg gtgcccagaa cttccctgat tgacctgctc cgctgctcct tggcttacct "68" ttgc tgtctctgct cgccctcctt tctgtgccct actcattggg gttccgcact "74" ttct tcctttctct ttctctcttc aaac tagaaatgtg gatt "80" attataaaag ggggtccgtg gaagaatgat cagcatctgt agcg tcagggttgg "86" tgtgctgagg tgttagagag ggaccatgtg tcacttgtgc tttgctcttg tcccacgtgt "92" cttccacttt gcatatgagc cgtgaactgt gcatagtgct gggatggagg ggagtgttgg "98" gcatgtgatc acgcctggct aataaggctt tagtgtattt atttatttat ttattttatt 204" tgtttttcat tcatcccatt ttcc ccataactca atggcctaaa actggcctga 210" ggaa cgatgtgtct gtatttcatg taga tcccaagatg actggggtgg 216" gaggtcttgc tagaatggga atag aaagggcctt gacatcagtt cctttgtgtg 222" tactcactga agcctgcgtt ggtccagagc ggaggctgtg gggg agttttcctc 228" tatacatctc tccccaaccc taggttccct gttcttcctc cacc agagcaacct 234" ctcactcccc atgccacgtt ccacagttgc caccacctct gtggcattga aatgagcacc 240" tccattaaag tctgaatcag tgc Protein seguence: NCBI Reference Seguence: NP_001966.1 314314 LOCUS NP_001966 ACCESSION 966 1 msiekiware ildsrgnptv evdlytakgl sgas tgiyealelr dgqurylgk 6" gvlkavdhin stiapaliss glsvveqekl dnlmleldgt enkskfgana ilgvslavck 12" agaaerelpl yrhiaqlagn sdlilpvpaf nvinggshag nklamqefmi 1pvgaesfrd 18" amrlgaevyh tlkgvikdky gkdatnvgde ggfapnilen sealelvkea idkagyteki 24" vigmdvaase fyrdgkydld fksptdpsry itgdqlgaly qdfvrdypvv siedpqudd " waawskftan gddl tvtnpkrier aveekacncl llkvnqigsv teaiqack1a 36" qengwgvmvs hrsgetedtf iadlvvglct gqiktgapcr serlakynql mrieeelgde 42" arfaghnfrn psvl ATP5H Official Sym bol: ATP5H al Name: ATP synthase, H+ transporting, mitochondrial Fo x, subunit d Gene ID: 10476 Organism: Homo sapiens Other Aliases: My032, ATPQ Other ations: ATP synthase D chain, mitochondrial; ATP synthase subunit d, mitochondrial; ATP synthase, H+ orting, mitochondrial F0 complex, subunit d; ATP se, H+ transporting, mitochondrial F1 F0, subunit d; ATPase subunit d; My032 protein Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_006356.2 LOCUS 356 ACCESSION NM_006356. 1 tgacccactt ccgttacttg ctgcggagga ccgtgggcag ccagggtcgg tgaaggatcc 61 caaaatggct gggcgaaaac ttgctctaaa tgac tgggtagctt ttgcagagat 121 cataccccag aagg ccattgctag ttccctgaaa tcctggaatg agaccctcac 181 ctccaggttg gctgctttac ctgagaatcc accagctatc gactgggctt actacaaggc 241 caatgtggcc aaggctggct tggtggatga ctttgagaag aagtttaatg cgctgaaggt 315315 301 tcccgtgcca gaggataaat atactgccca ggtggatgcc gaagaaaaag aagatgtgaa 361 atcttgtgct gagtgggtgt ctctctcaaa ggccaggatt gtagaatatg agaaagagat 421 ggagaagatg aagaacttaa ttccatttga gacc attgaggact tgaatgaagc 481 tttcccagaa accaaattag aaaa ctat tggcctcacc aaccaattga 541 gaatttataa aattgagtcc aggaggaagc tctggccctt gtattacaca ttctggacat 601 taaaaataat aattatacag ttaaaaaa Protein seguence (variant 1): NCBI nce Seguence: NP_006347.1 LOCUS NP_006347 ACCESSION NP_006347 1 magrklalkt idwvafaeii pqnqkaiass lkswnetlts rlaalpenpp aidwayykan 61 vakaglvddf ekkfnalkvp vpedkytaqv daeekedvks caewvslska keme 121 kmknlipqu mtiedlneaf petkldkkky pywphqpien 1 Nucleotide seguence (variant 2): NCBI Reference Seguence: NM_001003785.1 LOCUS NM_001003785 ACCESSION NM_001003785 1 tgacccactt ccgttacttg ctgcggagga gcag ccagggtcgg tgaaggatcc 6" caaaatggct gggcgaaaac ttgctctaaa aaccattgac tgggtagctt ttgcagagat 12" cataccccag aaccaaaagg ccattgctag gaaa tcctggaatg agaccctcac 18" ctccaggttg gctgctttac ctgagaatcc accagctatc gactgggctt actacaaggc 24" caatgtggcc aaggctggct tggtggatga ctttgagaag aaggtgaaat cttgtgctga " gtgggtgtct ctctcaaagg ccaggattgt agaatatgag aaagagatgg agaagatgaa 36" gaacttaatt ccatttgatc ccat tgaggacttg aatgaagctt tcccagaaac 42" caaattagac aagt atccctattg gcctcaccaa ccaattgaga atttataaaa 48" ccag gaggaagctc tggcccttgt attacacatt ctggacatta aaaataataa 54" ttatacagtt aaaaaa Protein seguence (variant 2): NCBI Reference Seguence: NP_001003785.1 LOCUS NP_001003785 316316 ION NP_001003785 1 alkt idwvafaeii pqnqkaiass lkswnetlts rlaalpenpp aidwayykan 61 vakaglvddf ekkvkscaew vslskarive yekemekmkn tie dlneafpetk 121 ldkkkypywp hqpienl TRAP1 Official Symbol: TRAP1 Official Name: TNF receptor-associated protein 1 Gene ID: 10131 Organism: Homo sapiens Other Aliases: HSP75, HSP90L Other Designations: HSP 75; TNFR-associated protein 1; TRAP-1; heat shock protein 75 kDa, mitochondrial; tumor is factor type 1 receptor associated protein; tumor necrosis factor type 1 receptor-associated protein Nucleotide seguence: NCBI Reference Seguence: NM_016292.2 LOCUS NM_016292 ACCESSION 292 1 gaggaagccc cgccccgcgc agccccgtcc cgccccttcc catcgtgtac ggtcccgcgt 6" ggctgcgcgc ggcgctctgg gagtacgaca tggcgcgcga gctgcgggcg ctgctgctgt 12" ggggccgccg cctgcggcct ttgctgcggg cgccggcgct ggcggccgtg ccgggaggaa 18" aaccaattct gtgtcctcgg aggaccacag cccagttggg ccccaggcga aacccagcct 24" tgca ggcaggacga ctgttcagca ccgc cgaggacaag cccc " cgat tatcagcagc acagagagcg tgcagggttc cacttccaaa catgagttcc 36" aggccgagac aaagaagctt ttggacattg ttgcccggtc cctgtactca gaaaaagagg 42" tgtttatacg ggagctgatc tccaatgcca gcgatgcctt ggaaaaactg cgtcacaaac 48" tggtgtctga cggccaagca ctgccagaaa tggagattca cttgcagacc aatgccgaga 54" aaggcaccat caccatccag gatactggta tgac acaggaagag ctggtgtcca 60" acctggggac gattgccaga tcggggtcaa aggccttcct ggatgctctg cagaaccagg 66" ctgaggccag cagcaagatc atcggccagt ttggagtggg tttctactca gctttcatgg 317317 72" tggctgacag agtggaggtc tattcccgct cggcagcccc ggggagcctg ggttaccagt 78" ggctttcaga tgga gtgtttgaaa tcgccgaagc ttcgggagtt agaaccggga 84" caaaaatcat cctg aaatccgact gcaaggagtt ttccagcgag gcccgggtgc 90" gagatgtggt aacgaagtac agcaacttcg tcagcttccc cttgtacttg aatggaaggc 96" ggatgaacac cttgcaggcc atga tggaccccaa ggatgtccgt gagtggcaac "O2" atgaggagtt ctac gtcgcgcagg ctcacgacaa gccccgctac accctgcact "08" ataagacgga cgcaccgctc aacatccgca tcta cgtgcccgac atgaaaccgt "l4" ccatgtttga tgtgagccgg gagctgggct ccagcgttgc actgtacagc cgcaaagtcc "20" tcatccagac caaggccacg gacatcctgc ccaagtggct gcgcttcatc Cgaggtgtgg "26" tggacagtga ggacattccc ctgaacctca gccgggagct gctgcaggag agcgcactca "32" tcaggaaact ccgggacgtt ttacagcaga ggctgatcaa attcttcatt gaccagagta "38" aaaaagatgc tgagaagtat gcaaagtttt ttgaagatta cggcctgttc atgcgggagg "44" gcattgtgac cgccaccgag caggaggtca aggaggacat gctg ctgcgctacg "50" cggc gctgccctcc gggcagctaa ccagcctctc cgcc agccgcatgc "56" gggccggcac ccgcaacatc tactacctgt gcgcccccaa ccgtcacctg gcagagcact "62" caccctacta tgaggccatg aagaagaaag acacagaggt tctcttctgc tttgagcagt "68" ttgatgagct caccctgctg caccttcgtg agtttgacaa gaagaagctg atctctgtgg "74" acat agtcgtggat cactacaagg aggagaagtt cagg tccccagccg "80" ccgagtgcct atcagagaag gagg agctcatggc ctggatgaga aatgtgctgg "86" ggtcgcgtgt caccaacgtg aaggtgaccc tccgactgga cacccaccct gccatggtca "92" ccgtgctgga gatgggggct gcccgccact gcat gcagcagctg gccaagaccc "98" aggaggagcg cgcacagctc ctgcagccca cgctggagat caaccccagg cacgcgctca 204" tcaagaagct gaatcagctg cgcgcaagcg agcctggcct ggctcagctg ctggtggatc 210" agatatacga gaacgccatg attgctgctg gacttgttga tagg gccatggtgg 216" gccgcttgaa tgagctgctt gtcaaggccc tggagcgaca gcca gggggccaga 222" aggactgaca ccacagatga cacc tccttgagct ttatttacct aaatttaaag 2281 gtatttctta aaaa aaaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_057376.2 LOCUS NP_057376 318318 ACCESSION N 76 l alll wgrrlrpllr apalaavpgg kpilcprrtt aqlgprrnpa wslqagrlfs 61 tqtaedkeep lhsiisstes vqgstskhef qaetkklldi varslyseke vfirelisna 121 lrhk alpe meihlqtnae kgtitithg igmtqeelvs nlgtiarsgs 181 kafldalqnq aeasskiigq fgvgfysafm vadrvevysr lqu wlsdgsgvfe 241 iaeasgvrtg tkiiihlksd ckefssearv rdvvtkysnf vsfplylngr 301 mdpkdvrewq heefyryvaq ahdkprytlh yktdaplnir sifyvpdmkp smfdvsrelg 361 ssvalysrkv liqtkatdil pkwlrfirgv vdsediplnl srellqesal irklrdvqu 421 rlikffidqs kkdaekyakf fedyglfmre givtateqev kediakllry sgql 481 tslseyasrm ragtrniyyl capnrhlaeh spyyeamkkk cfeq fdeltllhlr 541 efdkkklisv etdivvdhyk eekfedrspa aeclsekete elmawmrnvl gsrvtnvkvt 501 lrldthpamv tvlemgaarh flrmqqlakt qeeraqllqp tleinprhal ikklnqlras 661 epglaqllvd qiyenamiaa glvddpramv grlnellvka lerh SDHA Official Symbol: SDHA Official Name: succinate dehydrogenase complex, subunit A, flavoprotein (Fp) w:6389 sm: Homo sapiens Other Aliases: CMD1GG, FP, PGL5, SDH1, SDH2, SDHF Other Designations: flavoprotein subunit of complex II; succinate dehydrogenase [ubiquinone] flavoprotein t, mitochondrial; succinate dehydrogenase complex flavoprotein subunit Nucleotide sguence: NCBI Reference Seguence: NM_004168.2 LOCUS NM_004168 ACCESSION NM_004168 1 tccggcgtgg tgcgcaggcg cggtatcccc cctcccccgc cagctcgacc tgqt 61 gcgcaggcgc agtctgcgca gggactgch ggactgcgcg gcggcaacag cagacatgtc 319319 12" gggggtccgg ggcctgtcgc ggctgctgag cgctcggcgc ctggcgctgg ccaaggcgtg 18" gccaacagtg ttgcaaacag gaacccgagg ttttcacttc actgttgatg ggaacaagag 24" ggcatctgct aaagtttcag attccatttc tgctcagtat ccagtagtgg atcatgaatt " tgatgcagtg gtggtaggcg ctggaggggc aggcttgcga gctgcatttg gcctttctga 36" ggcagggttt aatacagcat gtgttaccaa gctgtttcct accaggtcac acactgttgc 42" agcacaggga ggaatcaatg ctgctctggg gaacatggag gaggacaact ggaggtggca 48" tttctacgac accgtgaagg gctccgactg gctgggggac caggatgcca tccactacat 54" gacggagcag gcccccgccg ccgtggtcga gctagaaaat tatggcatgc cgtttagcag 60" aactgaagat gggaagattt atcagcgtgc tgga cagagcctca agtttggaaa 66" gggcgggcag gcccatcggt gctgctgtgt ggctgatcgg actggccact tgca 72" caccttatat ggaaggtctc tgcgatatga taccagctat tttgtggagt attttgcctt 78" cctg atggagaatg gggagtgccg tggtgtcatc gcactgtgca acgg 84" gtccatccat cgcataagag acac tgcc acaggaggct acgggcgcac 90" ctacttcagc tgcacgtctg cccacaccag cactggcgac ggcacggcca tgatcaccag 96" ggcaggcctt ccttgccagg acctagagtt tgttcagttc caccctacag gcatatatgg "02" tgctggttgt ctcattacgg gtcg tggagaggga ggcattctca ttaacagtca "08" aggcgaaagg tttatggagc gatacgcccc tgtcgcgaag gacctggcgt ctagagatgt "14" ggtgtctcgg tccatgactc tggagatccg agaaggaaga ggcc ctgagaaaga "20" tcacgtctac ctgcagctgc accacctacc gcag ctggccacgc gcctgcctgg "26" catttcagag acagccatga tcttcgctgg cgtggacgtc acgaaggagc cgatccctgt "32" cctccccacc gtgcattata acatgggcgg cacc aactacaagg ggcaggtcct "38" gaggcacgtg aatggccagg atcagattgt gcccggcctg tacgcctgtg ccgc "44" ctgtgcctcg gtacatggtg ccaaccgcct cggggcaaac tcgctcttgg ttgt "50" ctttggtcgg gcatgtgccc tgagcatcga agagtcatgc aggcctggag ataaagtccc "56" tccaattaaa ccaaacgctg aatc gaat cttgacaaat tgagatttgc "62" tgatggaagc ataagaacat cggaactgcg actcagcatg tcaa tgcaaaatca "68" tgctgccgtg ttccgtgtgg gaagcgtgtt gcaagaaggt tgtgggaaaa tcagcaagct "74" ctatggagac ctaaagcacc tgaagacgtt cgaccgggga atggtctgga acacggacct "80" ggtggagacc ctggagctgc agaacctgat gctgtgtgcg acca gagc "86" agaggcacgg aaggagtcac ggggcgcgca tgccagggaa gactacaagg tgcggattga 320320 1921 tgagtacgat tactccaagc ccatccaggg gcaacagaag aagccctttg aggagcactg 1981 gaggaagcac accctgtcct atgtggacgt tggcactggg aaggtcactc tggaatatag 2041 gatc gacaaaactt tgaacgaggc tgactgtgcc accgtcccgc cagccattcg 2101 ctcctactga tgagacaaga tgtggtgatg acagaatcag cttttgtaat taat 2161 agctcatgca tgtgtccatg tcataactgt cttcatacgc ttctgcactc tggggaagaa 2221 ggagtacatt gaagggagat tggcacctag tggctgggag cttgccagga acccagtggc 2281 cagggagcgt ggcacttacc tttgtccctt gcttcattct tgtgagatga taaaactggg 2341 cacagctctt aaataaaata taaatgaaca aactttcttt caaa aaaaaaaaaa 2401 aaaaa n ce: NCBI Reference Seguence: NP_004159.2 LOCUS NP_004159 AQQE§§IQN N P_004159 1 msgvrglsrl lsarrlalak awptvlqtgt rgfhftvdgn krasakvsds isaqypvvdh 61 efdavvvgag gaglraafgl seagfntacv tklfptrsht vaaqgginaa dnwr 121 whfydtvkgs dwlgdqdaih paav velenygmpf srtedgkiyq rafggqslkf 181 gkggqahrcc cvadrtghsl lhtlygrslr ydtsyfveyf aldllmenge crgvialcie 241 dgsihrirak ntvvatggyg rtyfsctsah tstgdgtami traglpchl equfhptgi 301 ygagcliteg crgeggilin sqgerfmery apvakdlasr dvvsrsmtle cgpe 361 lhh lppeqlatrl pgisetamif agvdvtkepi pvlptvhynm ggiptnykgq 421 vlrhvnquq ivpglyacge aacasvhgan rlganslldl cals ieescrpgdk 481 vppikpnage esvmnldklr fadgsirtse lrlsmqksmq nhaavfrvgs quegcgkis 541 klygdlkhlk vwnt dlvetlelqn lmlcalqtiy gaearkesrg aharedykvr 601 ideydyskpi ngqkkpfee hwrkhtlsyv dvgtgkvtle yrpvidktln eadcatvppa 661 irsy TPMA Official S mbol: TPM4 Official Name: tropomyosin 4 321321 m: 7171 Organism: Homo sapiens Other Aliases: Other Designations: TM30p1;tropomyosin alpha-4 chain; tropomyosin-4; Nucleotide seguence (variant 1): NCBI Reference ce: NM_001145160.1 LOCUS NM_001145160 ACCESSION NM_001145160 l ataaggccct ctcctccacc ctgccaggct cactctgccc cacagccaca gcccctgact 6" gccgcagccc ccacagagcc cgccgcgcac cccacgtccc ccacgccagc gcccagccat 12" ggaggccatc aagaagaaaa tgcagatgct gaagttggac aaggagaatg ccatcgaccg 18" cgcggagcag gcggaggcgg ataagaaagc cgctgaggac aagtgcaagc aggtggagga 24" ggagctgacg cacctccaga agaaactaaa agggacagag gacgagctgg attc " cgaggacctg aaggacgcgc aggagaagct ggagctcacg gagaagaagg cctccgacgc 36" tgaaggtgat gtggccgccc tcaaccgacg catccagctc gttgaggagg agttggacag 42" ggctcaggaa cgactggcca cggccctgca gaagctggag gaggcagaaa aagctgcaga 48" tgagagtgag agaggaatga aggtgataga ggcc gatg aggagaagat 54" ggagattcag gagatgcagc tcaaagaggc caagcacatt gcggaagagg ctgaccgcaa 60" atacgaggag cgta tcat cctggagggt gagctggaga gggcagagga 66" gcgtgcggag gtgtctgaac taaaatgtgg tgacctggaa gaagaactca agaatgttac 72" taacaatctg aaatctctgg aggctgcatc tgaaaagtat tctgaaaagg aata 78" tgaagaagaa attaaacttc tgtctgacaa actgaaagag gctgagaccc gtgctgaatt 84" tgcagagaga gcaa aactggaaaa gacaattgat gacctggaag agaaacttgc 90" caaa gaagagaacg tgggcttaca tcagacactg gatcagacac taaacgaact 96" taactgtata taagcaaaac agaagagtct aaca gaaactctgg gtgg 102" gtctttctct tctcttgtaa gaagttcctt ttgttattgc catcttcgct ttgctggaaa 108" tgtcaagcaa attatgaata catgaccaaa tattttgtat cggagaagct ttgagcacca 114" gttaaatctc attccttccc tttttttttc aaatggcacc agctttttca gctctcttat 120" tttttcctta agtagcattt attcctaagg taggcagggt tagt aagcatactt 322322 "26" tcttaagacg attt ggttcctggg agaataggca gccccacact ttgaagaata "32" cagaccccag tatctagtcg tggatataat taaaacgctg aagaccataa ccttttgggt "38" caactgttgg tcaaactata ggagagacca atca catgggtagg gattttccat "44" ccagagccaa taaaaggact ggtgggggcc gggggtggct attgtgggaa gtcataaccc "50" acagatagat aaga atcctggccc ttctccactc tccaccatgc aggacaaaca "56" tcttctcaag cagtcaacgt agaatgcttg ggaaatagtc ataattaccc acatatagta "62" attaatagat ggtaattaat tgatccttga tgtgatgttc atat catt "68" ctaaagttgt tccctggccg ggagcgttgg ctttcgcctg taatcccaac actttgggag "74" gccaggacag atcacttgag gtcaggagtt cgagaccagc ccagccaaca tggcgaaacc "80" atgtctctac taaaaataca aaaattatgg tgacgcctgc ctgtagtccc tcgg "86" gaggctgagg caggaggatc gcttgaaccc aggaagtgga gactgcagtg agccgatatc "92" gcaccacagc gcct ggtcgacaga gtgagactcc atctcaagaa aaaataaaaa "98" tgtt agag caaatgtctc gtaa tgacccactc agcaggaata 204" tggtggagtt cagtccaatt caggtcagcc atatccaaaa gaccacaagt cattactaag 210" aaaa gagtttttat ctattagcag cctc tctggcagca gagattaaaa 216" actggcccaa cttcatttcc atacttcagg gaacagcaaa ttgaggattt acttatctag 222" gacttgaatt ccttctttgg gaccaagtta ataaaagacc aagaaactcc tgattaaact 228" ggataatgaa ggattctgta gacagggctg cacgtatcgg ctttgtttga cttctctttt 234" ctcagttaac atctcagagc tagaacattc cacattcccc agcagcgtgt gggggctgac 240" taaagtttac aattccaact aaaaatcacc ctgcttctgg cttatctgaa tcccttaccc 246" accccacccc accaccctac tcctatttat tcagcaccac actacccagg aaatacacta 252" tgtg caatggaata aaatccacac tttagattct tgcaactgta tcatatgtaa 2581 tagtatcact ttttctacat tttggtcaaa taaattttta ctac Protein seguence (variant 1): NCBI Reference Seguence: NP_001138632.1 LOCUS NP_001 138632 ACCESSION NP_001138632 1 meaikkkmqm lkldkenaid raeqaeadkk aaedkckqve eelthlqkkl kgtedeldky 61 sedlkdaqek leltekkasd alnr eeld raqerlatal qkleeaekaa 121 desergmkvi enramkdeek meiqemqlke akhiaeeadr kyeevarklv ilegelerae 323323 181 eraevselkc gdleeelknv tnnlksleaa sekysekedk llsd klkeaetrae 241 faertvakle ktiddleekl aqakeenvgl hqtldqtlne lnci Nudeofideseguence: NCBI Reference Seguence (variant 2): NM_003290.2 LOCUS NM_003290 ACCESSION NM_003290 l tttccagcag ctgtggccag cggtgccgac ccct cccccagcgg tgctgacgtc 6" ggcggtccgg ccgggtgacc tcatcgcccc gacggcagcc ggcccggggg gcggggagag l2" gcgggggcgg cccccgcgca ggcaaaggct tggggggccg gggcgcggct gtgcagctct 18" cgccggagcc gagcccagcc gagcgtccgc ccgt gcgcctctgc gcctccgcgc 24" catggccggc ctcaactccc tggaggcggt gaaacgcaag atccaggccc agca " ggcggacgag gcggaagacc aggg cctgcagcgg gacg gcgagcgcga 36" gcggcgcgag aaagctgaag gtgatgtggc cgccctcaac cgacgcatcc agctcgttga 42" ggaggagttg gacagggctc aggaacgact ggccacggcc ctgcagaagc tggaggaggc 48" agaaaaagct gcagatgaga gtgagagagg ggtg atagaaaacc gggccatgaa 54" ggatgaggag aagatggaga ttcaggagat gcagctcaaa aagc cgga 60" agaggctgac tacg aggaggtagc tcgtaagctg gtcatcctgg agggtgagct 66" ggagagggca gaggagcgtg cggaggtgtc tgaactaaaa tgtggtgacc tggaagaaga 72" actcaagaat gttactaaca atctgaaatc tctggaggct gcatctgaaa agtattctga 78" aaaggaggac aaatatgaag aagaaattaa acttctgtct gacaaactga ctga 84" gacccgtgct gaatttgcag agagaacggt tgcaaaactg gaaaagacaa ttgatgacct 90" ggaagagaaa cttgcccagg ccaaagaaga gaacgtgggc ttacatcaga cactggatca 96" aaac gaacttaact gtatataagc aaaacagaag agtcttgttc caacagaaac 102" tctggagctc cgtgggtctt ctct tgtaagaagt tccttttgtt attgccatct 108" tcgctttgct ggaaatgtca agcaaattat gaatacatga ccaaatattt tgtatcggag 114" aagctttgag caccagttaa atctcattcc ttcccttttt ttttcaaatg gcaccagctt tttcagctct cttatttttt ccttaagtag catttattcc taaggtaggc agggtatttc 126" ctagtaagca tactttctta agacggaggc catttggttc gaat aggcagcccc 132" acactttgaa gaatacagac cccagtatct agtcgtggat ataattaaaa cgctgaagac 324324 "38" cataaccttt tgggtcaact gttggtcaaa gaga gaccagggac catcacatgg "44" gtagggattt tccatccaga gccaataaaa ggactggtgg gggccggggg tggctattgt "50" gggaagtcat aacccacaga tagatcaacc taagaatcct ggcccttctc cactctccac "56" catgcaggac aaacatcttc agtc aacgtagaat gcttgggaaa taat "62" tacccacata tagtaattaa tagatggtaa ttaattgatc cttgatgtga tgttcttttg "68" catatttcct tcattctaaa gttgttccct ggccgggagc tttc gcctgtaatc "74" ccaacacttt gggaggccag gacagatcac ttgaggtcag gagttcgaga ccagcccagc "80" caacatggcg aaaccatgtc tctactaaaa atacaaaaat gacg cctgcctgta "86" gtcccagcta ctcgggaggc tgaggcagga ggatcgcttg aacccaggaa gtggagactg "92" cagtgagccg atatcgcacc ctcc gtcg acagagtgag actccatctc "98" aagaaaaaat aaaaataaag ttgttctctg aagagcaaat gtctcattcc agtaatgacc 204" cactcagcag gaatatggtg gagttcagtc caattcaggt cagccatatc acca 210" caagtcatta ctaagttgag caaaagagtt tttatctatt agcagaaagg gcctctctgg 216" cagcagagat taaaaactgg cccaacttca tttccatact tcagggaaca gcaaattgag 222" gatttactta actt gaattccttc tttgggacca agttaataaa agaa 228" actcctgatt gata gatt ctgtagacag ggctgcacgt atcggctttg 234" tttgacttct cttttctcag ttaacatctc agagctagaa cattccacat tccccagcag 240" Cgtgtggggg ctgactaaag tttacaattc caactaaaaa tcaccctgct tctggcttat 246" ctgaatccct tacccacccc accccaccac cctactccta tttattcagc accacactac 252" ccaggaaata cactagcaaa ttgtgcaatg gaataaaatc cacactttag attcttgcaa 258" cata tgtaatagta tttc tacattttgg tcaaataaat ttttacataa 2641 actac Protein seguence (variant 2): NCBI Reference Seguence: NP_003281.1 LOCUS NP_003281 ACCESSION NP_003281 1 maglnsleav krkiqaquq adeaedraqg lqreldgere rrekaegdva alnrriqlve 61 eeldraqerl atalqkleea ekaadeserg mkvienramk deekmeiqem qlkeakhiae 121 eadrkyeeva rklvilegel eraeeraevs elkcgdleee 1knvtnn1ks leaasekyse 181 kedkyeeeik llsdklkeae traefaertv aklektiddl eeklaqakee nvglhqtldq 325325 241 tlnelnci Official Symbol: ETFA Official Name: electron-transfer-f|avoprotein, alpha ptide M1 2108 Organism: Homo sapiens Other Aliases: EMA, GA2, MADD Other Designations: alpha-ETF; electron transfer flavoprotein alpha-subunit; electron transfer flavoprotein subunit alpha, mitochondrial; electron transfer flavoprotein, alpha polypeptide; glutaric aciduria || Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_000126.3 LOCUS NM_000126 ACCESSION NM_000126 l attaggtgac tggctgaggc ggcgccagtt ggccgggcac ggggctgctg taaggccgag 6" gttgcggcgg aagcggagac catgttccga gctc agct ccggcgggcg 12" gcctcattgc tacgatttca gagtaccctg gtaatagctg agcatgcaaa tgattcccta 18" atta ctttaaatac cattactgca gccacacgcc ttggaggtga agtgtcctgc 24" ttagtagctg aatg ggtg gcacaagatc tctgtaaagt agcaggcata " gcaaaagttc tggtggctca gcatgatgtg tacaaaggcc tacttccaga ggaactgaca 36" ccattgattt tggcaactca gaagcagttc aattacacac acatctgtgc tggagcatct 42" ggaa agaacctttt gcccagagta gcagccaaac ttgc cccgatttct 48" gacatcattg caatcaagtc acctgacaca tttgtgagaa ctatttatgc aggaaatgct 54" ctatgtacag tgaagtgtga tgagaaagtg tttt ctgtccgtgg aacatccttt 60" gatgctgcag caacaagtgg cggtagtgcc agttcagaaa aggcatcaag tacttcacca 66" gtggaaatat cagagtggct tgaccagaaa ttaacaaaaa gtgatcgacc agagctaaca 72" ggtgccaaag tggtggtatc tggtggtcga aaga gtggagagaa ctttaagttg 78" ttatatgact tggcagatca actacatgct ggtg cttcccgtgc tgctgttgat 84" gctggctttg ttcccaatga catgcaagtt ggacagacgg gaaaaatagt agcaccagaa 326326 90" ctttatattg ctgttggaat atctggagcc atccaacatt tagctggga: gaaagacagc 96" aagacaattg tggcaattaa taaagaccca gaagctccaa ttttccaag: ggcagattat 102" ggaatagttg cagatttatt taaggtagtt cctgaaatga ctgagatat: aaaa 108" tgaatcagga ctta aaaagaaaac ttttgttaaa gtattccac: gaaatcacag 114" atatttgtgg gtattataac aatcattgga aagcatggag agctacatt: cataatttga 120" gggaaaattt ctaacagatg ccagaatgct tgtttatggg attgctgtg: ttccttttaa 126" ttatttgtgg ttccaaacaa ttattgtttg aactttttaa attctgtac: aaaatctata 132" ataaagcttt tccacagctt taaaactatc agaaaaaaaa aaaaaaaaa Protein seguence (variant 1 ): NCBI Reference Seguence: NP_000117.1 LOCUS NP_OOO117 ACCESSION NP_OOO117 1 mfraaapgql rraasllrfq stlviaehan dslapitlnt itaatrlgge vsclvagtkc 61 dkvaqdlckv agiakvlvaq hdvykgllpe eltplilatq kqfnythica knll 121 prvaakleva pisdiiaiks pdtfvrtiya gnalctvkcd ekvkvfsvrg tsfdaaatsg 181 gsassekass tspveisewl qultksdrp eltgakvvvs ggrglksgen ladq 241 lhaavgasra avdagfvpnd mqvgqtgkiv avgi sgaithagm kdsktivain 301 kdpeapifqv adlf kvvpemteil kkk Nucleotide sequence nt 2): NCBI Reference Seguence: NM_001127716.1 LOCUS NM_001127716 ACCESSION NM_001127716 1 attaggtgac tggctgaggc ggcgccagtt gcac ggggctgctg taaggccgag 61 gttgcggcgg aagcggagac catgttccga gcggcggctc cggggcagct ccggcgggcg 121 caag atctctgtaa agtagcaggc atagcaaaag ttctggtggc tcagcatgat 181 aaag gcctacttcc agaggaactg acaccattga ttttggcaac tcagaagcag 241 ttcaattaca cacacatctg tgctggagca tctgccttcg gaaagaacct tttgcccaga 301 gtagcagcca aacttgaggt tgccccgatt tctgacatca ttgcaatcaa gtcacctgac 327327 36" acatttgtga gaactattta tgcaggaaat gctctatgta cagtgaagtg tgatgagaaa 42" gtgaaagtgt tttctgtccg tggaacatcc tttgatgctg cagcaacaag tggcggtagt 48" gccagttcag aaaaggcatc aagtacttca ccagtggaaa tatcagagtg gcttgaccag 54" acaa aaagtgatcg accagagcta acaggtgcca aagtggtggt atctggtggt 60" cgaggcttga agagtggaga gaactttaag ttgttatatg acttggcaga tcaactacat 66" gctgcagttg gtgcttcccg tgctgctgtt gatgctggct ccaa tgacatgcaa 72" gttggacaga cgggaaaaat ag:agcacca gaactttata ttgctgttgg aatatctgga 78" gccatccaac atttagctgg agac acaa ttgtggcaat taataaagac 84" ccagaagctc caattttcca ag:ggcagat tatggaatag ttgcagattt atttaaggta 90" gttcctgaaa tgactgagat at:gaagaaa atca ggatcatgcc ttaaaaagaa 96" aacttttgtt aaagtattcc ac:gaaatca cagatatttg tgggtattat aacaatcatt 102" ggaaagcatg gagagctaca tt:cataatt tgagggaaaa tttctaacag atgccagaat 108" gcttgtttat gggattgctg tg:ttccttt taattatttg tggttccaaa caattattgt 114" tttt taaattctgt ac:aaaatct ataataaagc ttttccacag ctttaaaact 1201 atcagaaaaa aaaaaaaaaa aa Protein sequence (variant 2): NCBI Reference Seguence: NP_001121188.1 LOCUS 121188 ACCESSION NP_001121188 1 mfraaapgql rravaqdlck vlva gllp eeltplilat qkqfnythic 61 agasafgknl lprvaaklev apisdiiaik spdtfvrtiy agnalctvkc dekvkvfsvr 121 gtsfdaaats ggsassekas isew lqultksdr peltgakvvv sggrglksge 181 dlad qlhaavgasr aavdagfvpn quvgqtgki vapelyiavg isgaithag 241 ivai nkdpeapifq vadygivadl fkvvpemtei 1kkk RPL8 Official Sym bol: RPL8 Official Name: ribosomal protein L8 m: 6132 Organism: Homo sapiens 328328 Othe r Aliases: L8 Other Designations: 608 ribosomal protein L8 Nucleotide seguence (variant 1): NCBI Reference Seguence : NM_000973.3 LOCUS 973 ACCESSION NM_000973 1 agataaggcc gctcgctgac tttc ctctttcggc cgcgctggtg aacaggaccc 6" gtcgccatgg gccgtgtgat acag aggaagggcg ccgggtctgt gttccgcgcg 12" cacgtgaagc accgtaaagg gcgc ctgcgcgccg tggatttcgc tgagcggcac 18" ggctacatca agggcatcgt caaggacatc atccacgacc cgggccgcgg cctc 24" gccaaggtgg tcttccggga tccgtatcgg tttaagaagc ggacggagct gttcattgcc " gccgagggca ttcacacggg ccagtttgtg tattgcggca agaaggccca catt 36" ggcaatgtgc tccctgtggg caccatgcct gagggtacaa tcgtgtgctg cctggaggag 42" aagcctggag accgtggcaa gctggcccgg gcatcaggga actatgccac cgttatctcc 48" cacaaccctg agaccaagaa gacccgtgtg aagctgccct ccggctccaa gaaggttatc 54" tcctcagcca acagagctgt ggttggtgtg gtggctggag gtggccgaat tgacaaaccc 60" aagg ctggccgggc gtaccacaaa tataaggcaa agaggaactg ctggccacga 66" gtacggggtg tggccatgaa tcctgtggag catccttttg gcaa ccaccagcac 72" atcggcaagc cctccaccat agat gcccctgctg gccgcaaagt gggtctcatt 78" gctgcccgcc ggactggacg tctccgggga accaagactg tgcaggagaa agagaactag 84" tgctgagggc aaag tttgtgttta tgccaaaaaa aaaaaaaaaa aaaaaaaaaa 90" aaa Protein seguence (variant 1): NCBI Reference Seguence: NP_000964.1 LOCUS NP_000964 ACCESSION NP_000964 1 mgrvirgqu gagsvfrahv khrkgaarlr rhgy ikgivkdiih dpgrgaplak 61 vvfrdpyrfk krtelfiaae gihtgqfvyc gkkaqlnign vlpvgtmpeg tivccleekp 121 gdrgklaras gnyatvishn petkktrvkl psgskkviss anravvgvva gggridkpil 329329 181 kagrayhkyk akrncwprvr gvamnpvehp fgggnhqhig kpstirrdap agrkvgliaa 241 rrtgrlrgtk tvqeken Nucleotide seguence nt 2): NCBI Reference Seguence: 301.1 LOCUS NM_033301 ACCESSION NM_033301 1 gcggcatggg cagtatccgc cgccatcctc gagg cgcgctgaga cccggaccgg 6" ccctcctgag aggatgccgg tgcgggcgcc cgcggagagg tcgc catgggccgt 12" gtgatccgtg gacagaggaa gggcgccggg tctgtgttcc gcgcgcacgt gaagcaccgt 18" aaaggcgctg cgcgcctgcg cgccgtggat ttcgctgagc ggcacggcta catcaagggc 24" atcgtcaagg acatcatcca cgacccgggc cgcggcgcgc ccctcgccaa ggtggtcttc " cgggatccgt atcggtttaa gaagcggacg gagctgttca ttgccgccga gggcattcac 36" acgggccagt ttgtgtattg cggcaagaag gcccagctca acattggcaa tgtgctccct 42" gtgggcacca tgcctgaggg tacaatcgtg tgctgcctgg aggagaagcc tggagaccgt 48" ggcaagctgg cccgggcatc agggaactat gccaccgtta tctcccacaa ccctgagacc 54" aagaagaccc gtgtgaagct gccctccggc tccaagaagg ttatctcctc agccaacaga 60" gctgtggttg gtgtggtggc tggaggtggc cgaattgaca aacccatctt gaaggctggc 66" tacc acaaatataa gagg aactgctggc cacgagtacg gggtgtggcc 72" atgaatcctg tggagcatcc ttttggaggt ggcaaccacc agcacatcgg caagccctcc 78" accatccgca gagatgcccc tgctggccgc aaagtgggtc ctgc ccgccggact 84" ggacgtctcc ggggaaccaa gactgtgcag gagaaagaga actagtgctg agggcctcaa 90" taaagtttgt gtttatgcca aaaaaaaaaa aaaa aaaaaaaaaa aaaaaaaaaa 96" aaaaaaa Protein seguence (variant 2): NCBI nce Seguence: NP_150644.1 LOCUS NP_150644 ACCESSION NP_150644 1 gqu gagsvfrahv khrkgaarlr avdfaerhgy ikgivkdiih dpgrgaplak 61 yrfk krtelfiaae gihtgqfvyc gkkaqlnign vlpvgtmpeg tivccleekp 330330 121 gdrgklaras gnyatvishn petkktrvkl psgskkviss anravvgvva gggridkpil 181 kagrayhkyk akrncwprvr gvamnpvehp fgggnhqhig kpstirrdap agrkvgliaa 241 rrtgrlrgtk tvqeken Official Symbol: ARCN1 Official Name: archain 1 m: 372 Organism: Homo sapiens Other Aliases: COPD Other Designations: archain vesicle transport protein 1; er delta subunit; coatomer protein complex, subunit delta; er protein delta-COP; coatomer subunit delta; delta-COP; coat n Nucleotide seguence (variant 1): NCBI Reference Seguence: 655.4 LOCUS NM_001655 ACCESSION NM_001655 1 gaagacgtgg gccg ccatcttggc aagaggcgaa gcggcagcgg tcaa 6" gggggcagca ggtccagagc tgctggtgct cccgttcccc tacc cctatcccca 12" gtggagccgg agtgcgggcg cgccccacca ccgccctcac catggtgctg ttggcagcag 18" cggtctgcac aaaagcagga aaggctattg tttctcgaca gtttgtggaa atgacccgaa 24" ctcggattga gggcttatta tttc caaagctcat gaacactgga aaacaacata " cgtttgttga aacagagagt gtaagatatg tctaccagcc tatggagaaa ctgtatatgg 36" tactgatcac taccaaaaac agcaacattt tagaagattt ggagacccta aggctcttct 42" caagagtgat ccctgaatat tgccgagcct tagaagagaa tgaaatatct gagcactgtt 48" tgat tttt gatgaaattg tcgcactggg ataccgggag aatgttaact 54" tggcacagat cagaaccttc acagaaatgg atga ggagaaggtg ttcagagccg 60" tcagagagac tcaagaacgt gaagctaagg ctgagatgcg tcgtaaagca aaggaattac 66" aacaggcccg aagagatgca gagagacagg gcaaaaaagc accaggattt ggcggatttg 72" gcagctctgc agtatctgga ggcagcacag ctgccatgat cacagagacc atcattgaaa 331331 78" ctgataaacc ggca cctgcaccag ccaggccttc aggccccagc aaggctttaa 84" aacttggagc caaaggaaag gaagtagata actttgtgga caaattaaaa tctgaaggtg 90" aaaccatcat gtcctctagt atgggcaagc gtacttctga agcaaccaaa atgcatgctc 96" cacccattaa tatggaaagt gtacatatga agattgaaga aaagataaca ttaacctgtg "02" gacgagacgg aggattacag aatatggagt tgcatggcat gatcatgctt aggatctcag "08" atgacaagta tggccgaatt catg tggaaaatga agataagaaa cagc "14" tacagaccca tccaaatgtg aaac ttttcactgc agagtctcta attggcctga "20" caga gaagtcattt ccagtcaaca gtgacgtagg ggtgctaaag tggagactac "26" aaaccacaga ggaatctttt attccactga caattaattg ctggccctcg gagagtggaa "32" atggctgtga tgtcaacata gaatatgagc tacaagaaga taatttagaa ctgaatgatg "38" tggttatcac catcccactc ccgtctggtg tcggcgcgcc tgttatcggt gagatcgatg "44" gggagtatcg acatgacagt cgacgaaata agtg gtgcctgcct gtgattgatg "50" ccaaaaataa gagtggcagc ttta gcattgctgg gcagcccaat gacttcttcc "56" ctgttcaagt ttcctttgtc tccaagaaaa attactgtaa catacaggtt accaaagtga "62" cccaggtaga tggaaacagc aggt tttccacaga gaccactttc ctagtggata "68" agtatgaaat tctgtaatac caagaagagg aaaa tttt cagattaata "74" aagaagacgc caatgatggc tgaagagttt ttcccagatt tacaagccac tggagacccc "80" ttttttctga tacaatgcac gattctctgc ggac cctcgactca cccccatgtt "86" tcagtgtcac agagacattc aagg aaatggcaca aacataaagg gaaaggctgc "92" taattttctt tggcagattg tattggccag caggaaagca ccag agaatgcccc "98" cagttaaata cctcctctac ctttacctaa gttgctcctt tatttttatt ttattattat 204" tattattatt attattattt tttgagatgg agtctcactt tgtaacccag gctggaatgc 210" aatggcatga tctcagctca ctgcaacctc cgcctcctgg gcaa gtctcctgcc 216" tcagcctccg agtagctggg actacaggtg cacgccacca cgcctggcta attttttgta 222" ttttagtaga gacggggttt caccgtgttg cccaggctgg tcgcgaactc ctgagctcag 228" gcaatccgcc cacctcagcc tcccaaagtg ttgggattac gagc gccc 234" agctgctcct ttattttaat atat aatccctaaa tatagttata tttcatactt 240" tttt taaaaagttt tctctgtaga aaattttaat cattcatacc ctttaccttt 246" aggtttttct ttctatacat tcagtcaggc actgggatca tctgtttaca ggcattatat 252" ttatttggca ctcctggaac aagtatatct aacccattct tgatttttgg actattcagg 332332 258" tgaactattt gaggggtatg gggtctagaa gttaaaagat acgcatgtct tctgttcttt 264" atca attcattcct cttt gccaagttgt tttcctttca gggcctgtcc 270" ttccagttta gaacagtacc atgaatccca cttgtgtcaa agat agctgagaag 276" cacctttcaa atggcacagt ccctcttcaa gatgtctaaa agaatggtta tgtctgtcca 282" gttagggatt tcacatccac atgtaatcat gtctgctgct gttgctaccc aaattttcat 288" ttctccacat tttgggtact aaaa cgtaatggcc acagtctgta atccattcac 294" cagt ttcaccacct ccctcttcca gactgcactc tctgtcatca gtcccctcct 300" ttctaacaga aatggggtta tgaa ggctgtgggt tcagggagtc tttgccaatc 306" ctgttggccc taaactatca aggaggctcc atttcaccat ttgatttttt gcatttcagg 312" aggcaactga ttgtttcgat atgtacatat tactcacgta taccccattt ccttccagtc 318" agcccaacat tttccaccag tctgtcccca tctctgaaat ccttccttct ctttccccct 324" aagtcttttg agtgtcatca tgtactggtg gtttctcggt tccatctcat ccatttcctt 330" ttcaatggag actacagcgt cagccagctc agccttggct tttaactcaa tattccagtc 336" cataggggtg gttaaaagtt gctgcaaggc tgcaggcact ggcagtggga agaggcagac 342" gactagatga cttctgcact tttagctggt tgaaaagtac cactcccact ctgaacatct 348" ggccgtccct gcaaagagtg tactgtgctt gaagcagagc actcacacat aaatggctgt 354" gtgtggaatt gcttgccaaa gaagtttcta gcctttccct ttcccctaac tgcatcaggg 360" aagaattctt atctctagct tggtttccac atgaggtttt tctgagaagg gaca 366" agaagtctgt catgttagtt gcaa gaaatcctac agtt ttgtttgaaa 372" gttgtttgtc attt tttaaaagtc aagtttaatt tcaaaaaacc ttttttttct 378" gagattactt ttggggtaat atttaaaatg agagacattt cctg taaaatacat 384" agggaatata acattccagt gtatacaaag aaggcaaatt ctttaatcaa ataaagcgca 390" ttataaaatg agatgtttat tggattattg actcactttg gctt gttgattcag 396" gtaa tgggacctaa cattaaaaat taatgacatg ttaa gagaaaaaaa 4021 aaaaaaaaaa aa Protein sequence (variant 1): NCBI nce Seguence: NP_001646.2 LOCUS NP_001646 ACCESSION NP_001646 l mvllaaavct kagkaivsrq fvemtrtrie gllaafpklm ntgkqhtfve tesvryqup 333333 6" vlit tknsniledl etlrlfsrvi peycraleen eisehcfdli fafdeivaLg 12" yrenvnlaqi rtftemdshe ekvfravret qereakaemr rkakequar rdaerqgkka 18" pgfggfgssa vsggstaami tetiietdkp kvapaparps gpskalklga nfvd 24" klksegetim sssmgkrtse atkmhappin mesvhmkiee kitltcgrdg glqnmelhgm " imlrisddky grirlhvene dkkqulqth pnvdkklfta esliglknpe ksfpvnsdvg 36" qtte esfipltinc wpsesgngcd vnieyelqed nlelndvvit iplpsgvgap 42" vigeidgeyr hdsrrntlew clpvidaknk sgslefsiag qpndffpqu sfvskknycn 48" iqvtkvtqu gnspvrfste ttflvdkyei l tide seguence (variant 2): NCBI nce Seguence: NM_001142281.1 LOCUS NM_001142281 ACCESSION NM_001142281 l gaagacgtgg cttggggccg ccatcttggc aagaggcgaa gcggcagcgg ttcctgtcaa 6" gggggcagca ggtccagagc tgctggtgct cccgttcccc agaccctacc cctatcccca 12" gtggagccgg agtgcgggcg acca ccgccctcac catgatccct gaatattgcc 18" gagccttaga agagaatgaa atatctgagc actgttttga tttgattttt gatg 24" aaattgtcgc actgggatac cgggagaatg ttaacttggc acagatcaga accttcacag " attc tcatgaggag aaggtgttca gagccgtcag agagactcaa gaacgtgaag 36" ctaaggctga gatgcgtcgt aaagcaaagg aaca ggcccgaaga gatgcagaga 42" gacagggcaa aaaagcacca ggatttggcg gatttggcag ctctgcagta tctggaggca 48" gcacagctgc catgatcaca gagaccatca ttgaaactga aaaa gtggcacctg 54" caccagccag gccttcaggc cccagcaagg aact tggagccaaa gaag 60" tagataactt tgtggacaaa ttaaaatctg aaggtgaaac catcatgtcc atgg 66" gcaagcgtac ttctgaagca accaaaatgc atgctccacc cattaatatg gaaagtgtac 72" atatgaagat tgaagaaaag ataacattaa cctgtggacg agacggagga aata 78" tggagttgca tggcatgatc atgcttagga tctcagatga caagtatggc cgaattcgtc 84" ttcatgtgga agat aagaaagggg tgcagctaca gacccatcca aatgtggata 90" aaaaactttt cactgcagag tctctaattg gcctgaagaa tccagagaag tcatttccag 96" tcaacagtga Cgtaggggtg ctaaagtgga gactacaaac cacagaggaa tcttttattc 102" cactgacaat taattgctgg ccctcggaga gtggaaatgg ctgtgatgtc aacatagaat 334334 "08" atgagctaca agaagataat ttagaactga atgatgtggt tatcaccatc ccactcccgt "l4" tcgg cgcgcctgtt atcggtgaga tcgatgggga gtatcgacat gacagtcgac "20" gaaataccct ggagtggtgc ctgcctgtga ttgatgccaa aaataagagt ggcagcctgg "26" agtttagcat gcag cccaatgact tcttccctgt tcaagtttcc tttgtctcca "32" agaaaaatta cata caggttacca aagtgaccca ggtagatgga aacagccccg "38" tcaggttttc cacagagacc actttcctag tggataagta tctg taataccaag "44" aagagggagc tgaaaaggaa aattttcaga aaga agacgccaat tgaa "50" gagtttttcc cagatttaca agccactgga gacccctttt ttctgataca atgcacgatt "56" ctctgcgcgc aaggaccctc gactcacccc catgtttcag tgtcacagag acattctttg "62" aaat ggcacaaaca taaagggaaa ggctgctaat tggc agattgtatt "68" ggccagcagg aaagcaagct agaa tgcccccagt taaatacctc ctctaccttt "74" acctaagttg ctcctttatt tttattttat tattattatt attattatta ttattttttg "80" agatggagtc tcactttgta acccaggctg gaatgcaatg gcatgatctc agctcactgc "86" aacctccgcc tcctgggttc gtct cctgcctcag agta gctgggacta "92" caggtgcacg ccaccacgcc tggctaattt tttgtatttt agtagagacg gggtttcacc "98" gtgttgccca ggctggtcgc gaactcctga gctcaggcaa tccgcccacc tcagcctccc 204" ttgg gattacaggc atgagccacc atgcccagct gctcctttat tttaatccct 210" aaatataatc cctaaatata tttc atacttagtt tgtttttaaa aagttttctc 216" tgtagaaaat tttaatcatt catacccttt acctttaggt ttct atacattcag 222" tcaggcactg ggatcatctg tttacaggca ttatatttat ttggcactcc tggaacaagt 228" atatctaacc cattcttgat ttttggacta ttcaggtgaa ctatttgagg ggtatggggt 234" ctagaagtta aaagatacgc atgtcttctg ttcttttccc gtatcaattc attccttcat 240" ctctttgcca agttgttttc ctttcagggc ttcc agtttagaac agtaccatga 246" atcccacttg tatt aaagatagct gagaagcacc tttcaaatgg ccct 252" cttcaagatg tctaaaagaa tggttatgtc tgtccagtta gggatttcac atccacatgt 258" aatcatgtct gctgctgttg ctacccaaat tttcatttct ccacattttg ggtacttaag 264" ctaaaacgta atggccacag tctgtaatcc attcacattc ctcagtttca ccacctccct 270" cttccagact gcactctctg tcatcagtcc cctcctttct aacagaaatg gggttatgat 276" tttgaaggct tcag ggagtctttg ccaatcctgt tggccctaaa ctatcaagga 282" ggctccattt caccatttga ttttttgcat ttcaggaggc aactgattgt ttcgatatgt 335335 288" acatattact cacgtatacc ccatttcctt ccagtcagcc caacattttc caccagtctg 294" tccccatctc tgaaatcctt ccttctcttt ccccctaagt cttttgagtg tcatcatgta 300" ctggtggttt ctcggttcca tctcatccat ttccttttca atggagacta cagcgtcagc 306" cagctcagcc ttggctttta actcaatatt ccagtccata ggggtggtta aaagttgctg 312" caaggctgca ggcactggca gtgggaagag gcagacgact agatgacttc tgcactttta 318" gctggttgaa aagtaccact cccactctga acatctggcc gtccctgcaa agagtgtact 324" gtgcttgaag cagagcactc acacataaat ggctgtgtgt ggaattgctt gccaaagaag 330" tttctagcct ttccctttcc cctaactgca tcagggaaga attcttatct ctagcttggt 336" atga ggtttttctg agaagggctt gggacaagaa gtctgtcatg ttagttaagc 342" aggcaagaaa tcctactaat ttgt ttgaaagttg tttgtccgta tgatttttta 348" aagt ttaatttcaa aaaacctttt ttttctgaga ttacttttgg attt 354" agag acattttgta accctgtaaa atacataggg aatataacat tccagtgtat 360" acaaagaagg cttt aatcaaataa agcgcattat aaaatgagat tgga 366" ttattgactc actttggtgt ctgcttgttg attcaggatg ctgtaatggg acctaacatt 3721 aaaaattaat gacatgtttt ttttaagaga aaaaaaaaaa aaaaaaaa Protein sequence (variant 2): NCBI nce Seguence: NP_001135753.1 LOCUS NP_001135753 ACCESSION NP_001135753 l rale eneisehcfd lifafdeiva lgyrenvnla emds heekvfravr 6" etqereakae mrrkakequ arrdaerqgk kapgfggfgs savsggstaa mitetiietd 12" apar alkl vdnf vdklkseget imsssmgkrt seatkmhapp 18" inmesvhmki eekitltcgr dgglqnmelh gmimlrisdd kygrirlhve nedkkquLq 24" thpnvdkklf taesliglkn peksfpvnsd vgvlkwrlqt teesfiplti ncwpsesgng " cdvnieyelq ednlelndvv itiplpsgvg apvigeidge yrhdsrrntl idak 36" nksgslefsi agqpndffpv qufvskkny cniqvtkvtq vdgnspvrfs tettflvdky 42" eil DDX18 OfibblSwnbd:DDX18 336336 Official Name: DEAD (Asp-Glu-Ala-Asp) box polypeptide 18 Gene ID: 8886 Organism: Homo sapiens Other s: MrDb Other Designations: ATP-dependent RNA helicase DDX18; DEAD box protein 18; DEAD/H (Asp-Glu-Ala-Asp/His) box ptide 18 (Myc-regulated); Myc- regulated DEAD box protein tide seguence: NCBI Reference Seguence: NM_006773.3 LOCUS NM_006773 ACCESSION NM_006773 l gccgagctgc gcacgtgcgg ccggaaggga agtaacgtca gcctgagaac tgagtagctg 6" tactgtgtgg cgccttattc taggcacttg ttgggcagaa tgtcacacct gccgatgaaa 12" cgta agaagatcga gaagcggaac ctcaaattgc ggcagcggaa cctaaagttt 18" cagggggcct caaatctgac cctatcggaa actcaaaatg tatc tgaagaaaca 24" atgggaagta gaaaggttaa aaaatcaaaa caaaagccca tgaatgtggg cttatcagaa " actcaaaatg gaggcatgtc tcaagaagca gtgggaaata taaaagttac aaagtctccc 36" cagaaatcca ctgtattaac agaa gcagcaatgc agtcttccaa ttcagaatca 42" aaaaagaaaa agaagaaaaa gagaaaaatg gtgaatgatg ctgagcctga tacgaaaaaa 48" gcaaaaactg aagg gaaatctgaa gaagaaagtg ccgagactac taaagaaaca 54" gaaaataatg tggagaagcc agataatgat gaagatgaga gtgaggtgcc cagtctgccc 60" ctgggactga cttt tgaggatact tcgtttgctt ctctatgtaa tcttgtcaat 66" gaaaacactc tgaaggcaat aaaagaaatg ggttttacaa acatgactga aattcagcat 72" aaaagtatca gaccacttct ggaaggcagg gatcttctag cagctgcaaa aacaggcagt 78" ggtaaaaccc tggcttttct catccctgca gttgaactca ttgttaagtt catg 84" cccaggaatg gaacaggagt ccttattctc tcacctacta gagaactagc catgcaaacc 90" gttc ttaaggagct gatgactcac cacgtgcata cctatggctt gataatgggt 96" aaca gatctgctga agcacagaaa cttggtaatg ggatcaacat cattgtggcc 102" acaccaggcc gtctgctgga ccatatgcag aataccccag gatttatgta taaaaacctg 108" cagtgtctgg ttattgatga tcgt atcttggatg tggggtttga agaggaatta 337337 "l4" aagcaaatta ttaaactttt gccaacacgt agacagacta tgctcttttc tgccacccaa "20" actcgaaaag ttgaagacct ggcaaggatt tctctgaaaa aggagccatt gtatgttggc "26" gttgatgatg ataaagcgaa tgcaacagtg cttg aacagggata tgttgtttgt "32" ccttctgaaa agagattcct tctgctcttt acattcctta accg aaagaagaag "38" cttatggtct tcttttcatc ttgtatgtct gtgaaatacc actatgagtt gctgaactac "44" attgatttgc ccgtcttggc cattcatgga aagcaaaagc aaaataagcg tacaaccaca "50" cagt tctgcaatgc agattcggga acactattgt gtacggatgt ggcagcgaga "56" ggactagaca ttcctgaagt cgactggatt gttcagtatg accctccgga tgaccctaag "62" gaatatattc atcgtgtggg tagaacagcc agaggcctaa atgggagagg gcatgccttg "68" ctcattttgc gcccagaaga attgggtttt cttcgctact tgaaacaatc caaggttcca "74" ttaagtgaat ttgacttttc taaa atttctgaca ttcagtctca gcttgagaaa "80" ttgattgaaa agaattactt tcttcataag tcagcccagg aagcatataa cata "86" cgagcctatg attcccattc tctgaaacag atctttaatg ttaataacct aaatttgcct "92" caggttgctc ttgg tttcaaggtg cctcccttcg ttgatctgaa cgtcaacagt "98" aatgaaggca agcagaaaaa aggt ggat ttggctacca gaaaaccaag 204" aaagttgaga aatccaaaat ctttaaacac attagcaaga aatcatctga cagcaggcag 210" cact gaacacatgc ttca tcttgaataa ctttgtccta aaatgaattt 216" ccct tgatttaaca ggatttttgt agactttaga atttggactt caag 222" agtataaatt gacttgggtt gcaagcactg agcactgtta cttctatcac gtctctcttt 228" tatttctggg aaca ggctttaagt ttcttggttg cccaagggca gagcaaggaa 234" tatctggtgt ttcttgtgat gataatattt taattttaaa tatccctccc tcatacaagt 240" gtatgttacc attttaatat aattcttttt gtacctttcc ttcttgtttt gcgaagattt 246" catg gattgctgtg ctcactgctg taaaaggtga cctagtgtac tgggcagctg 252" gtggcggtgc agaaaagagt ctcaggttat tttttgtttt tagttatttc ttggaccttg 258" acagtatcta atgactcctc ctgaaaatgc ataa aagagcaaag agctttggga 264" aatacctaag aagcacctta agattagggt ggcattgctt ttatagattc ttgattttaa 270" agcaacaggc ctttctcagg tgttgcattt tttggagcaa aaactatggg ttgtaatttg 276" aataaagtgt cactaagcag ttataacgtt tgatggctgg ggggtaggaa gaggatggaa 282" ttgagatgtt tgagcctcat ttacatcaat agaggtgtaa tgtactgcat ttcttcattt 288" ggtaacataa caaagacttt aaga atgc tcctcattaa gatttgttta 338338 294" attcaaggtg gtttggattt cctt tgcactctgt agagtactta gaagacaagg 300" gcaacttact tggagttaga gccaagctgt gtgc ccagcacaca ttaatgttag 306" cttctttctg agaaaaaaat acctcttcca ggccctgaaa caaaaaatac atttgctgtg 312" aagattgaaa atgaacaaag ttagaaaaaa aaacagcaaa atcagtgatt tagtcagatg 318" agtttttcgt tgtaggagca cttgatttct agtgtgtttt tata taactacaag 324" atagtacatt ttgtagcagt tcaaagccaa agttgctagc atcattttgc tgttgtgcca 330" gttaatcata ggatcccatt aaataagtgt gctaacatcg aatatagaga aaactggtaa 336" agaacattcc agtaggaaaa aaca atcttccatt tctgggcttg gccaccatca 342" ccctggtcgg acctgtcctg gacttccaac cttgactgct gagctcctgg ttct 348" tgggttccta attcctggtg tttaataatt ctctccacga tcatgttttt ctgatttttt 354" ttttcagaaa taatgttttt taaaagacaa aaacaaaggg aagaatattt tgag 360" cagaagtaaa tactgttggc attttgtaca taatctaatt tttatatgca tgttcatgct 366" ttttaatttt tttatcaaaa attaagtcat ctacctacta cttgtaacca gcttgtttca 372" taacatgtta ttttcctgtg aata attacttcaa tgttgaaaaa aaaaaaaaaa 3781 aaaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_006764.3 LOCUS 764 ACCESSION NP_006764 l mshlpmkllr kkiekrnlkl rqrnlquga snltlsetqn gdvseetmgs rkvkksqup 6" mnvglsetqn ggmsqeavgn ikvtkquks tvltngeaam qssnseskkk kkkkrkmvnd l2" aepdtkkakt nkgks s a ttk t DD V kpdnd d s vpslplgl tgafedtsfa 18" slcnlvnent lkaikemgft nmteiqhksi rpllegrdll aaaktgsgkt laflipavel 24" ivklrfmprn gtgvlilspt relamqtfgv lkelmthhvh tyglimggsn rsaeaqklgn " giniivatpg rlldhmqntp gfmyknlqcl videadrild vgfeeelkqi ikllptrrqt 36" mlfsatqtrk vedlarislk keplyvgvdd dkanatvdgl quyvvcpse krflllftfl 42" klmv ffsscmsvky hyellnyidl pvlaihgqu anrtttffq gtLl 48" ctdvaargld ipevdwivqy keyi argl ngrghallil rpeelgflry 54" lkqskvplse fdfswskisd iqsqleklie ksaq eayksyiray dshslkqifn 60" pqva lsfgfkvppf vdlnvnsneg ggg fquktkkve kskifkhisk 339339 661 kssdsrqfsh Official Symbol: GBBP2 Official Name:GTPase activating protein (SH3 domain) binding protein 2 Gene ID: GTPase activating protein (SH3 domain) binding n 2 Organism: Homo sapiens Othe r Aliases: Other Designations: GBBP-2; GAP 8H3 domain-binding protein 2; Ras-GTPase activating protein 8H3 domain-binding protein 2; ras GTPase-activating protein- bindi ng protein 2 Nucleotide ce (variant 1): NCBI Reference Seguence: NM_203505.2 LOCUS 505 ACCESSION NM_203505 l gtgctcgggg gttccctggc cctttcggca ggggtaaaac aggg ggcggtggca 6" aagggggcgg gacgtccgtg gtccttgtcg cacgtcgcag gcgc ccgggaagag l2" gtggttgtga ggcagacgaa ctcgcggctc tccggcttcc tccg agttgtcgga 18" ggaagggggc ggcgagcaat aagaacccgc cgcacccggt cctcagcgac tcttctgacc 24" tccgcgcgac gtacccgccg ccgccgttgg ctggagcatt tgacattgtg cagcaaagaa " atggttatgg agaagcccag tccgctgctt gtagggcggg agtttgtgag gcaatattat 36" actttgctga ataaagctcc ggaatattta cacaggtttt atggcaggaa ttcttcctat 42" gttcatggtg gagtagatgc tagtggaaag ccccaggaag ctgtttatgg ccaaaatgat 48" caca aagtattatc tctgaacttc agtgaatgtc atactaaaat tcgtcatgtg 54" catg caaccttgag tgatggagta gttgtccagg tcatgggttt gctgtctaac 60" agtggacaac cagaaagaaa gtttatgcaa acctttgttc tggctcctga aggatctgtt 66" ccaaataaat tttatgttca caatgatatg tttcgttatg aagatgaagt gtttggtgat 72" tctgagcctg aacttgatga agaa gatgaagtag aaca agaagaaaga 78" tctc ctgaacctgt aaat gctaacagtg gttactatga agctcaccct 340340 84" gtgactaatg gcatagagga ggaa gaatcctctc ctga acctgagcca 90" gaatctgaaa caaagactga agagctgaaa ccacaagtgg aggagaagaa agaa 96" ctagaggaga aatctactac tcctcctccg gcagaacctg tttctctgcc acaagaacca "O2" gctt tctcctgggc ttcagtgacc agtaaaaacc ctag tggtactgtt "08" tcttcctctg gaattccacc ccatgttaaa gcaccagtct cacagccaag agtcgaagct "l4" aaaccagaag ttcaatctca gccacctcgt gtgcgtgaac aacgacctag agaacgacct "20" ggttttcctc ctagaggacc aagaccaggc agaggagata tggaacagaa tgactctgac "26" aaccgtagaa taattcgcta tccagatagt catcaacttt ttgttggtaa cttgccacat "32" gatattgatg aaaatgagct aaaggaattc ttcatgagtt ttggaaacgt tgtggaactt "38" cgcatcaata ccaagggtgt tgggggaaag cttccaaatt ttggttttgt ggtttttgat "44" gactctgaac cagttcagag aatcttaatt gcaaaaccga ttatgtttcg agta "50" cgtttaaatg tggaagagaa aaaaacaaga gctgcaagag agcgagaaac cagaggtggt "56" ggtgatgatc gcagggatat taggcgcaat gatcgaggtc ccggtggtcc acgtggaatt "62" gtgggtggtg gaatgatgcg tgatcgtgat ggaagaggac ctcctccaag gggtggcatg "68" aaac ttggctctgg aagaggaacc gggcaaatgg agggccgctt cacaggacag "74" tgaa gctccactgt tggcaaagtc ttggcagtgg tacattattc atcgtgtttg "80" cattcttgtt aatttttttt ttggctttgg aatgtgacac agcctttttg tctt "86" tgatgtgaaa agcatctttg gttatcagtt aaattgaggt ggacattatt tccccaattt "92" cacaacagga ttcacattgt taatttataa atctagactt ggagaattaa ggactgagaa "98" atgaccatat cttaaactat ctacgacaaa gtgaacttaa aaggacatgc ccactgaatt 204" caggtccttt gagtaaaaaa aaaatcttct gctgcacatt ttgtttaagt gttactgttt 210" ctgcctgtta atgctgggaa cacaaatagt tgtg caattggaga atcttgcctt 216" ttttcttggc tccccccaaa aacc aacagaaact tgttatgcac tcatcaaaat 222" gtactaatgg gtactctgaa aaca ttgacatctg caacaggagg caacagggaa 228" aaaatctcat cttcttttcc aaat agtttgtgaa atgatgaggg cattttatct 234" gcttgctgtg accagcgtgt gtacacataa acaa gactacaagt atattccaga 240" aggaaatcat tttagttatg aactaaataa taaaaattag aacttcaaat gcgatggtct 246" ttag accagattta gtagctccat atctaagatt tttctacctg cccctcttca 252" ggat ggctggctgc tcaacacact cctcctcccc ttttttcctt tctttaagct 258" gtgtacagtg aaaattgtct ttactgtatt tttgttctct ggtaatgtaa taagcatgat 341341 264" ggtgccttct attaatacat agtc ttgctggtaa ttttgtacag tatagtgtat 270" gaattgctgt aagc caaacagctg caaaatgttg aaaaatcatc tata 276" gcag tatctttaaa atcagtaaaa tggactagca tattatttat cttgttcttc 282" agttaacaac tttgtgttct ctgtgggagg gagggagtcc tgtgtgtttg tggggagagg 288" gaaggaggaa gtcagttatt tgagtaagcc tctagttgac ttttctctta gcctgaatgt 294" ggacgttgaa acatatcact tcagggcttg gaaaagtcag tcaacttgac gtacattttt 300" agtgacattt taaaagcagt cagattctat aaatggcaag taagcctgaa gtgaggatac 306" tgcaattttc ggagaaaaga acagcagctc tttaagtgtt tgcattttct atttgggggg 312" cagggaactg attt tgcacaattc ttgaactgat gtcagcaccc gagtggctcc 318" tgaatttaag tctgggacga catcttttat ttttacatga atctttaaac aattctgtga 324" gcaaagtttg tagctgctgg attattgtct gtctttatag caagttccag taaaccacaa 330" caaa gcttatccaa ttttatgctt ggagcagtca gtacatacca gtttctgatg 336" gcag gagtggggta aataagtgtg accacttaaa cgtt agcatggaag 342" ccat tctatctttg taaaacagac aagatatgca cttgacatag tagcaaattg 348" gttctgaatt atgcaactgt ttgctattta gtaaactagc aaatgatgca tgtattttgt 354" ttttcatgta ctgggcaata tgagtaaaat ctgtcccttt ttcccccttt gaatgaggtc 360" ttccatgttt gagggaaagt cttgcactat tgcatatatt ttggggacac agattttcat 366" agtttccatt tttggggggc ttaaggattt tttttttttc tgtttgaaac agttttatac 372" atat agtacttgaa attcttacca gaaaattact tttt gaagccttta 378" ttaatactac agaa gcagttgttt tattgtcaat gttttttttc ccccaagcat 384" ttgt atttctgttt ccatatatat atatatatat ataatttcca attcaggata 390" ttgccctgcc atccatgaaa actgttctgg caccaaaagt aatgacaaat gttaagtgta 396" ataatagaaa agtagagcaa agagccattc agcttcagtc tttacatacc atgaataaaa 402" aaca tcatatggag acat ggtgattgtt cacctgcagt actgtggagt 408" tttaacattt tgtcctcttt tcagtgaaac agagtaaaaa tattcatcta ccattactgt 414" tatttgctga ttttgtttta ttttttgatg gtaatattct atccttatga cactattgca 420" ttgg ctttaccatc ttggctttag taggtataga agacaatgga ttaccatctt 426" tattgctgta atgtgttaag cattatatgc tagtagaatc aatt gtttcaggtg 432" gaaagtattc tttgagtttc catattgaat gtgtttggac taaacaaaca ataaactact 4381 gatgtctgca atct atgtccctaa Protein seguence (variant 1): 342342 NCBI Reference Seguence: NP_987101.1 LOCUS NP_987101 ACCESSION NP_987101 l mvmekpspll vgrefvrqyy tllnkapeyl hrfygrnssy vhggvdasgk pqeavygqnd 6" ihhkvlslnf sechtkirhv dahatlsdgv vqumgllsn sgqperkfmq tfvlapegsv 12" pnkfyvhndm fryedevfgd sepeldeese deveeeqeer qpspepvqen ansgyyeahp 18" vtngi pl ssh p p p s tkt lk pqveeknlee leeksttppp pqep 24" pkafswasvt sknlppsgtv sssgipphvk rvea kpevqsqppr vreqrprerp " gfpprgprpg ndsd nrriirypds hqlfvgnlph didenelkef fmsfgnvvel 36" rintkgvggk lpnfgfvvfd dsepvqrili akpimfrgev rlnveekktr aareretrgg 42" gddrrdirrn drgpggprgi vgggmmrdrd grgppprggm aqklgsgrgt gqmegrftgq 48" rr Nucleotide seguence (variant 2): NCBI Reference ce: NM_012297.4 LOCUS NM_012297 ACCESSION NM_012297 l acattccatt cgcgctccgc ggcgcgaggc aatcgtccgg agcc cgggagccgg 6" aggtgtagcg gcagagacat tgttcttgcc ctac ggtgccgtgt gaga 12" gaagaccagt ctttcctcta gcatttgaca ttgtgcagca tggt gaag 18" cccagtccgc tagg gcgggagttt gtgaggcaat attatacttt gctgaataaa 24" gctccggaat atttacacag gttttatggc aggaattctt cctatgttca tggtggagta " gatgctagtg gaaagcccca ggaagctgtt tatggccaaa atgatataca ccacaaagta 36" ttatctctga acttcagtga atgtcatact aaaattcgtc atgtggatgc tcatgcaacc 42" ttgagtgatg gagtagttgt ccaggtcatg ggtttgctgt ctaacagtgg acaaccagaa 48" agaaagttta tgcaaacctt tgttctggct cctgaaggat ctgttccaaa taaattttat 54" gttcacaatg atatgtttcg ttatgaagat gaagtgtttg gtgattctga gcctgaactt 60" gaat cagaagatga agtagaagag gaacaagaag aaagacaacc atctcctgaa 66" cctgtgcaag aaaatgctaa cagtggttac tatgaagctc accctgtgac cata 72" gaggagcctt tggaagaatc ctctcatgaa cctg agccagaatc tgaaacaaag 343343 78" actgaagagc tgaaaccaca ggag aagaacttag aagaactaga ggagaaatct 84" actactcctc ctccggcaga acctgtttct ctgccacaag aaccaccaaa ggctttctcc 90" tgggcttcag tgaccagtaa gcct cctagtggta ctgtttcttc ctctggaatt 96" ccaccccatg ttaaagcacc agtctcacag ccaagagtcg aagctaaacc agaagttcaa "02" tctcagccac ctcgtgtgcg tgaacaacga cctagagaac gacctggttt tcctcctaga "08" ggaccaagac caggcagagg ggaa cagaatgact ctgacaaccg tagaataatt "14" cgctatccag atagtcatca actttttgtt ggtaacttgc cacatgatat tgatgaaaat "20" gagctaaagg aattcttcat gagttttgga aacgttgtgg aacttcgcat caataccaag "26" ggtgttgggg gaaagcttcc aaattttggt gttt ttgatgactc tgaaccagtt "32" cagagaatct taattgcaaa accgattatg tttcgagggg aagtacgttt aaatgtggaa "38" gagaaaaaaa caagagctgc aagagagcga gaaaccagag gtggtggtga tgatcgcagg "44" aggc gcaatgatcg aggtcccggt ggtccacgtg gaattgtggg tggtggaatg "50" atgcgtgatc gtgatggaag aggacctcct ccaaggggtg gcatggcaca gaaacttggc "56" tctggaagag gaaccgggca aatggagggc cgcttcacag gacagcgtcg ctgaagctcc "62" actgttggca aagtcttggc agtggtacat tattcatcgt gtttgcattc ttgttaattt "68" tttttttggc tttggaatgt gacacagcct ttttgatcat gatg tgaaaagcat "74" ttat cagttaaatt gaggtggaca ttatttcccc acaa caggattcac "80" attgttaatt tataaatcta gacttggaga attaaggact tgac ttaa "86" actatctacg acaaagtgaa cttaaaagga catgcccact gaattcaggt cctttgagta "92" aaaaaaaaat cttctgctgc acattttgtt taagtgttac tgtttctgcc tgct "98" gggaacacaa atagtgcaat ttgtgcaatt ggagaatctt gccttttttc cccc 204" ccaaaaatac acag aaacttgtta tgcactcatc aaaatgtact aatgggtact 210" ctgaactcat taacattgac atctgcaaca ggaggcaaca gggaaaaaat ctcatcttct 216" tttccagtag aaaatagttt gtgaaatgat attt tatctgcttg ctgtgaccag 222" cgtgtgtaca cataaacctt aacaagacta caagtatatt ccagaaggaa atcattttag 228" ttatgaacta aataataaaa attagaactt caaatgcgat ggtcttgact attagaccag 234" atttagtagc tccatatcta agatttttct acctgcccct cttcagtaca gggatggctg 240" caac ctcc tccccttttt tcctttcttt aagctgtgta cagtgaaaat 246" tgtctttact gtatttttgt tctctggtaa aagc atgatggtgc cttctattaa 252" tacatcattc cagtcttgct ggtaattttg tacagtatag tgtatgaatt gctgtgctgc 344344 258" aaagccaaac aaaa tgttgaaaaa tcatcgaaat gtataaaaat tgcagtatct 264" ttaaaatcag taaaatggac tagcatatta tttatcttgt tcttcagtta acaactttgt 270" gttctctgtg ggagggaggg agtcctgtgt gtttgtgggg agagggaagg aggaagtcag 276" ttatttgagt aagcctctag ttgacttttc tcttagcctg aatgtggacg ttgaaacata 282" tcacttcagg gcttggaaaa gtcagtcaac ttgacgtaca tttttagtga cattttaaaa 288" gcagtcagat tctataaatg gcaagtaagc ctgaagtgag gatactgcaa gaga 294" aaagaacagc agctctttaa gtgtttgcat tttctatttg gggggcaggg catt 300" cattttgcac aattcttgaa ctgatgtcag cacccgagtg gctcctgaat ttaagtctgg 306" gacgacatct tttattttta catgaatctt attc tgtgagcaaa gtttgtagct 312" gctggattat tgtctgtctt tatagcaagt tccagtaaac cacaagtatg gcaaagctta 318" tccaatttta tgcttggagc agtcagtaca taccagtttc tgatgtttca ggcaggagtg 324" gggtaaataa gtgtgaccac ttaaagctgc tcgttagcat ggaagacttc tccattctat 330" ctttgtaaaa cagacaagat atgcacttga agca ttct gaattatgca 336" actgtttgct atttagtaaa ctagcaaatg atgcatgtat tttgtttttc atgtactggg 342" caatatgagt aaaatctgtc cctttttccc cctttgaatg aggtcttcca tgtttgaggg 348" aaagtcttgc actattgcat atattttggg gacacagatt ttcatagttt ttgg 354" taag gatttttttt ttttctgttt gaaacagttt tatactttct gatatagtac 360" ttgaaattct taccagaaaa ttactttgga gttttgaagc ctttattaat actactttta 366" aagaagcagt tgttttattg tcaatgtttt ttttccccca agcatatttt cttgtatttc 372" tgtttccata tatatatata tatatataat ttccaattca tgcc ctgccatcca 378" tgaaaactgt tctggcacca atga caaatgttaa gtgtaataat agaaaagtag 384" agcaaagagc cattcagctt ttac tgaa taaaacatta aaacatcata 390" tggagaagtt tacatggtga ttgttcacct gcagtactgt ggagttttaa cattttgtcc 396" tcttttcagt gaaacagagt aaaaatattc atctaccatt actgttattt gctgattttg 402" ttttattttt taat attctatcct tatgacacta ttgcaaccaa attggcttta 408" ccatcttggc tttagtaggt gaca atggattacc atctttattg ctgtaatgtg 414" ttaagcatta tatgctagta gaatctagtt taattgtttc aggtggaaag tattctttga 420" gtttccatat tgaatgtgtt tggactaaac aaacaataaa ctactgatgt ctgcagcatt 4261 tatctatgtc cctaa Protein sequence (variant 2): 345345 NCBI nce Seguence: 429.2 LOCUS N P_036429 ACCESSION NP_036429 l spll vgrefvrqyy tllnkapeyl nssy vhggvdasgk pqeavygqnd 6" ihhkvlslnf sechtkirhv dahatlsdgv vqumgllsn sgqperkfmq tfvlapegsv 12" pnkfyvhndm fryedevfgd sepeldeese qeer qpspepvqen ansgyyeahp 18" vtngi pl ssh p p p s tkt lk nlee leeksttppp aepvslpqep 24" pkafswasvt sknlppsgtv sssgipphvk apvsqprvea kpevqsqppr vreqrprerp " gfpprgprpg rgdmeqndsd nrriirypds hqlfvgnlph didenelkef fmsfgnvvel 36" rintkgvggk lpnfgfvvfd dsepvqrili akpimfrgev rlnveekktr aareretrgg 42" gddrrdirrn drgpggprgi vgggmmrdrd grgppprggm aqklgsgrgt gqmegrftgq 48" rr Nucleotide seguence (variant 3): NCBI Reference Seguence: 504.2 LOCUS NM_203504 ACCESSION NM_203504 l gtgctcgggg gttccctggc cctttcggca ggggtaaaac aataagaggg ggcggtggca 6" aagggggcgg gacgtccgtg gtccttgtcg cacgtcgcag cgcctggcgc ccgggaagag l2" gtggttgtga ggcagacgaa ctcgcggctc tccggcttcc gaggcttccg agttgtcgga 18" ggaagggggc ggcgagcaat aagaacccgc cgcacccggt cctcagcgac tcttctgacc 24" tccgcgcgac gtacccgccg ccgccgttgg ctggagcatt tgacattgtg cagcaaagaa " atggttatgg agaagcccag tccgctgctt gtagggcggg agtttgtgag gcaatattat 36" actttgctga ataaagctcc ggaatattta tttt atggcaggaa ttcttcctat 42" gttcatggtg gagtagatgc tagtggaaag ccccaggaag ctgtttatgg ccaaaatgat 48" atacaccaca aagtattatc tctgaacttc tgtc aaat tcgtcatgtg 54" gatgctcatg caaccttgag agta gttgtccagg tcatgggttt taac 60" agtggacaac cagaaagaaa gtttatgcaa acctttgttc tggctcctga aggatctgtt 66" ccaaataaat tttatgttca caatgatatg tttcgttatg aagatgaagt gtttggtgat 72" cctg aacttgatga agaatcagaa gatgaagtag aagaggaaca agaagaaaga 346346 78" caaccatctc ctgaacctgt gcaagaaaat gctaacagtg atga agctcaccct 84" gtgactaatg gcatagagga gcctttggaa gaatcctctc atgaacctga acctgagcca 90" gaaa caaagactga agagctgaaa ccacaagtgg aggagaagaa cttagaagaa 96" ctagaggaga aatctactac tcctcctccg gcagaacctg tttctctgcc acaagaacca "02" ccaaagccaa gagtcgaagc taaaccagaa gttcaatctc agccacctcg tgtgcgtgaa "08" caacgaccta gagaacgacc tcct cctagaggac caagaccagg cagaggagat "14" atggaacaga atgactctga caaccgtaga ataattcgct atag tcatcaactt "20" tttgttggta acttgccaca tgatattgat gaaaatgagc taaaggaatt cttcatgagt "26" tttggaaacg ttgtggaact tcgcatcaat ggtg ttgggggaaa gcttccaaat "32" tttggttttg tggtttttga tgactctgaa caga gaatcttaat tgcaaaaccg "38" attatgtttc gaggggaagt acgtttaaat gtggaagaga caag agctgcaaga "44" gagcgagaaa ccagaggtgg tggtgatgat cgcagggata ttaggcgcaa aggt "50" cccggtggtc cacgtggaat tgtgggtggt ggaatgatgc gtgatcgtga tggaagagga "56" cctcctccaa ggggtggcat ggcacagaaa cttggctctg gaagaggaac cgggcaaatg "62" gagggccgct tcacaggaca gcgtcgctga agctccactg ttggcaaagt cttggcagtg "68" gtacattatt catcgtgttt gcattcttgt taattttttt tttggctttg gaatgtgaca "74" cagccttttt ttct ttgatgtgaa aagcatcttt ggttatcagt taaattgagg "80" tggacattat ttccccaatt tcacaacagg attcacattg ttaatttata aatctagact "86" tggagaatta aggactgaga aatgaccata tcttaaacta tctacgacaa agtgaactta "92" catg cccactgaat tcaggtcctt tgagtaaaaa aaaaatcttc tgctgcacat "98" taag tgttactgtt tgtt aatgctggga acacaaatag tgcaatttgt 204" gcaattggag aatcttgcct tttttcttgg ctccccccaa aaatacaaac caacagaaac 210" ttgttatgca ctcatcaaaa tgtactaatg ggtactctga actcattaac attgacatct 216" gcaacaggag gcaacaggga aaaaatctca tttc cagtagaaaa tagtttgtga 222" aatgatgagg gcattttatc ctgt gaccagcgtg tgtacacata aaccttaaca 228" agactacaag tatattccag atca ttttagttat gaactaaata ataaaaatta 234" gaacttcaaa tgcgatggtc ttgactatta gaccagattt agtagctcca tatctaagat 240" ttttctacct cttc ggga tggctggctg ctcaacacac tcctcctccc 246" cttttttcct ttctttaagc tgtgtacagt gaaaattgtc tttactgtat ttttgttctc 252" tggtaatgta ataagcatga tggtgccttc taca tcattccagt cttgctggta 347347 258" attttgtaca gtatagtgta tgaattgctg tgctgcaaag agct tgtt 264" gaaaaatcat cgaaatgtat aaaaattgca gtatctttaa aatcagtaaa atggactagc 270" atattattta tcttgttctt cagttaacaa ctttgtgttc tctgtgggag ggagggagtc 276" ctgtgtgttt gtggggagag ggaaggagga agtcagttat aagc ctctagttga 282" cttttctctt agcctgaatg tggacgttga aacatatcac ttcagggctt ggaaaagtca 288" gtcaacttga cgtacatttt tagtgacatt ttaaaagcag tcagattcta taaatggcaa 294" gtaagcctga agtgaggata ctgcaatttt cggagaaaag agct ctttaagtgt 300" ttgcattttc tatttggggg gcagggaact gtcattcatt ttgcacaatt ctga 306" cacc cgagtggctc ctgaatttaa gtctgggacg acatctttta tttttacatg 312" aatctttaaa caattctgtg agcaaagttt gtagctgctg gattattgtc tgtctttata 318" gcaagttcca gtaaaccaca agtatggcaa agcttatcca tgct tggagcagtc 324" agtacatacc agtttctgat ggca ggagtggggt aaataagtgt ttaa 330" agctgctcgt tagcatggaa gacttctcca ttctatcttt gtaaaacaga caagatatgc 336" cata gtagcaaatt ggttctgaat actg tttgctattt agtaaactag 342" caaatgatgc atgtattttg tttttcatgt actgggcaat atgagtaaaa tctgtccctt 348" tttccccctt tgaatgaggt cttccatgtt tgagggaaag tcttgcacta atat 354" tttggggaca cagattttca tagtttccat ttttgggggg cttaaggatt tttttttttt 360" ctgtttgaaa cagttttata ctttctgata tagtacttga aattcttacc agaaaattac 366" tttggagttt tgaagccttt attaatacta cttttaaaga agcagttgtt ttattgtcaa 372" tgtttttttt agca tattttcttg tatttctgtt tccatatata tatatatata 378" tataatttcc aattcaggat attgccctgc catccatgaa aactgttctg gcaccaaaag 384" caaa tgttaagtgt aataatagaa aagtagagca aagagccatt cagt 390" ctttacatac catgaataaa acattaaaac atcatatgga gaagtttaca tggtgattgt 396" tcacctgcag tactgtggag ttttaacatt ttgtcctctt gaaa cagagtaaaa 402" atattcatct accattactg ttatttgctg attttgtttt attttttgat ggtaatattc 408" tatccttatg acactattgc aaccaaattg gctttaccat cttggcttta gtaggtatag 414" aagacaatgg attaccatct ttattgctgt aatgtgttaa gcattatatg ctagtagaat 420" ctagtttaat aggt ggaaagtatt ctttgagttt ccatattgaa tgtgtttgga 4261 ctaaacaaac aataaactac tgatgtctgc agcatttatc tatgtcccta a Protein sequence (variant 3): 348348 NCBI Reference Seguence: NP_987100.1 LOCUS NP_987100 ACCESSION NP_987100 1 mvmekpspll vgrefvrqyy tllnkapeyl hrfygrnssy vhggvdasgk pqeavygqnd 6" ihhkvlslnf sechtkirhv dahatlsdgv vqumgllsn kfmq tfvlapegsv 12" pnkfyvhndm fryedevfgd eese deveeeqeer qpspepvqen ansgyyeahp 18" vtngi pl ssh p p p s tkt 1k nlee leeksttppp pqep 24" pkprveakpe vqsqpprvre qrprerpgfp prgprpgrgd meqndsdnrr shql " fvgnlphdid enelkeffms fgnvvelrin tkgvggklpn fgfvvfddse pvqriliakp 36" imfrgevrln veekktraar eretrgggdd rrdirrndrg pggprgivgg gmmrdrdgrg 42" ppprggmaqk lgsgrgtgqm egrftgqrr UQCRH al Symbol: UQCRH Official Name: ubiquinoI-cytochrome c reductase hinge protein Gene ID: 7388 Organism: Homo sapiens Other s: QCR6, UQCR8 Other Designations: complex III subunit 6; complex III subunit VIII; rome b-c1 complex subunit 6, ondrial; cytochrome c1 non-heme 11 kDa protein; mitochondrial hinge protein; ubiquinoI-cytochrome c reductase complex 11 kDa protein; ubiquinoI-cytochrome c reductase, compIex III subunit VIII tide seguence: NCBI Reference Seguence: NM_006004.2 LOCUS NM_006004 ACCESSION NM_006004 1 ctgaactggg ttaggtgccg ctgttgctgc tcgtgttgaa tctagaaccg tagccagaca 61 tgggactgga ggacgagcaa aagatgctta ccgaatccgg agatcctgag gaggaggaag 121 aggaagagga ggaattagtg gatcccctaa caacagtgag agagcaatgc gagcagttgg 181 agaaatgtgt aaaggcccgg gagcggctag agctctgtga tgagcgtgta tcctctcgat 349349 241 cacatacaga agaggattgc acggaggagc tctttgactt cttgcatgcg agggaccatt 301 gcgtggccca caaactcttt aacaacttga aataaatgtg tggacttaat tcaccccagt 361 cttcatcatc tgggcatcag aatatttcct tatggttttg gatgtaccat ttgtttctta 421 tttgtgtaac tgtaagttca cctc atgggtttgg cttaggctgg tagcttctat 481 gtaattcgca atgattccat ctaaataaaa gttctatgat aaaa aaaaaaaaaa 541 aaaaaa Protein seguence: NCBI Reference Seguence: NP_005995.2 LOCUS NP_005995 ACCESSION NP_005995 1 mgledeqkml tesgdpeeee eeeeelvdpl ttvreqceql ekcvkarerl elcdervssr 61 shteedctee lfdflhardh cvahklfnnl k HSPA4 Official Sym bol: HSPA4 Official Name: heat shock 70kDa protein 4 Gene ID: 3308 Organism: Homo sapiens Other Aliases: APG-2, 52, HSPH2, RY, hsp70, hsp70RY Other Designations: heat shock 70 kDa protein 4; heat shock 70-related protein APG-2; heat shock 70kD protein 4; heat shock protein, 110 kDa; hsp70 RY Nucleotide ce: NCBI Reference Seguence: 154.3 LOCUS NM_002154 ACCESSION NM_002154 1 gctctggtgc tgcggctccg tcgc aacgagatct ttcgagatct tctccgcccc 61 cgctaccggc gcctcctctg cggccactga gccggagccg gcctgagcag cgctctcggt 121 tgcagtaccc actggaagga cttaggcgct gaca ccgcaagccc ctcagtagcc 181 caag aggcctgctt tccactcgct gccg ggggtccgtg tcctgtctcg 350350 24" gtggccggac ccgggcccga gcccgagcag tagccggcgc catgtcggtg gtgggcatag " acctgggctt ccagagctgc tacgtcgctg tggcccgcgc cggcggcatc atcg 36" agta tagcgaccgc tgcacgccgg cttgcatttc ttttggtcct aagaatcgtt 42" caattggagc agcagctaaa agccaggtaa tttctaatgc aaagaacaca ggat 48" ttaaaagatt ccatggccga gcattctctg atccatttgt ggaggcagaa aacc 54" ttgcatatga tattgtgcag ttgcctacag gattaacagg tataaaggtg acatatatgg 60" aggaagagcg aaattttacc actgagcaag tgactgccat gcttttgtcc aaactgaagg 66" agacagccga aagtgttctt cctg tagttgactg tgttgtttcg gttccttgtt 72" ctga tgcagaaaga cgatcagtga tggatgcaac tgct ggtcttaatt 78" gcttgcgatt aatgaatgaa accactgcag ttgctcttgc atatggaatc tataagcagg 84" atcttcctgc cttagaagag aaaccaagaa atgtagtttt tgtagacatg ggccactctg 90" cttatcaagt ttctgtatgt gcatttaata gaggaaaact tctg gccactgcat 96" ttgacacgac attgggaggt agaaaatttg atgaagtgtt agtaaatcac ttctgtgaag "02" aatttgggaa gaaatacaag ctagacatta aaat atta ttacgactct "08" ctcaggagtg tgagaaactc aagaaattga tgagtgcaaa tgcttcagat ctccctttga "14" gcattgaatg ttttatgaat gatgttgatg tatctggaac tatgaataga ggcaaatttc "20" tggagatgtg caatgatctc ttagctagag tggagccacc acttcgtagt gttttggaac "26" agtt aaagaaagaa gatatttatg cagtggagat agttggtggt gctacacgaa "32" tccctgcggt aaaagagaag atcagcaaat ttttcggtaa agaacttagt acaacattaa "38" atgctgatga agctgtcact Cgaggctgtg agtg ctta tcgcctgctt "44" tcag agaattttct atcactgatg tagtaccata tccaatatct ctgagatgga "50" attctccagc tgaagaaggg gact gtgaagtctt ttccaaaaat catgctgctc "56" ctttctctaa agttcttaca agaa aggaaccttt cactcttgag gcctactaca "62" gctctcctca ggatttgccc tatccagatc ctgctatagc tcagttttca gttcagaaag "68" ctca gtctgatggc tccagttcaa aagtgaaagt caaagttcga gtaaatgtcc "74" atggcatttt cagtgtgtcc agtgcatctt tagtggaggt tcacaagtct gaggaaaatg "80" aggagccaat ggaaacagat cagaatgcaa aggaggaaga gaagatgcaa gtggaccagg "86" aggaaccaca tgttgaagag caacagcagc agacaccagc agaaaataag gcagagtctg "92" aagaaatgga gacctctcaa gctggatcca aggataaaaa gatggaccaa ccaccccaag "98" ccaagaaggc aaaagtgaag accagtactg tggacctgcc aatcgagaat cagctattat 351351 204" ggcagataga cagagagatg ctcaacttgt acattgaaaa tgagggtaag atgatcatgc 210" aggataaact ggagaaggag cggaatgatg ctaagaacgc agtggaggaa tatgtgtatg 216" aaatgagaga caagcttagt ggtgaatatg agaagtttgt gagtgaagat gatcgtaaca 222" gttttacttt gaaactggaa gaaa attggttgta tgaggatgga gaagaccagc 228" caaagcaagt ttatgttgat aagttggctg aattaaaaaa tctaggtcaa cctattaaga 234" tacgtttcca ggaatctgaa gaacgaccaa aattatttga aggg aaacagatcc 240" aacagtatat gaaaataatc agctctttca aaaacaagga gtat ttgg 246" atgctgctga catgacaaag gtagaaaaaa atga agcaatggag tggatgaata 252" acaagctaaa tctgcagaac aagcagagtt tgaccatgga tgtc aagtcaaaag 258" agattgaagc taaaattaag gagctgacaa gtacttgtag ccctataatt tcaaagccca 264" aagt ggaacctcca aaagaggaac aaaaaaatgc agagcagaat ggaccagtgg 270" atggacaagg agacaaccca ggcccccagg ctgctgagca gggtacagac acagctgtgc 276" cttcggattc agacaagaag gaaa tggacattga ttgattccaa cacttgtttc 282" aaca gactattata aagctttaag ttgtcaactt tgttctaaat atcaactagc 288" gcaagtgaat actgaagatt tcttagtcag tttttagggg attttcgggg aggggaaata 294" ggtaatgtat ggagcatttt cacttctaaa tagttagata cagaaattaa gtgcattgta 300" tcat aatggtacta tttagaagcc cagttagtct tactgagctt atgcttcact 306" cctttatgtt taaccatgtg agaa taagtttgtt agtt gagctatagc 312" tacagctcta gctatccagc agacttttca ttatgactta catggcagga gctctaatta 318" tgctttaaaa tgtg gagattgctt taaatgctcc ctgcctggtg tggggatggg 324" gtccccctct ttgtgagggc tggagcatgg cacggcatgg attaacacgg cagaggaaca 330" aaggtgtgct ctgagcttct tcatatttca ccttcaccct cacctgtgtt ctcttccctc 3361 tctcccaata aaagggctcc catta n seguence: NCBI Reference Seguence: NP_002145.3 LOCUS NP_002145 ACCESSION 145 1 msvvgidlgf qscyvavara ggietianey sdrctpacis fgpknrsiga aaksqvisna 61 kntvqgfkrf hgrafsdpfv eaeksnlayd ivqlptgltg ikvtymeeer vtam 121 llsklketae svlkkpvvdc vvsvpcfytd aerrsvmdat qiaglnclrl mnettavala 352352 18" ygiykqdlpa leekprnvvf ayqv svcafnrgkl kvlatafdtt lggrkfdevl 24" vnhfceefgk kykldikski rallrlsqec msan asdlplsiec fmndvdvsgt " mnrgkflemc ndllarvepp lrsvleqtkl avei vggatripav kekiskffgk 36" elsttlnade avtrgcalqc ailspafkvr efsitdvvpy pislrwnspa eegssdcevf 42" sknhaapfsk vltfyrkepf tleayysqu dlpypdpaia qfqukvtpq sdgssskvkv 48" kvrvnvhgif svssaslvev hkseeneepm etdqnakeee kmquqeeph veeqqqqtpa 54" enkaeseeme tsqagskdkk mdqppqakka kvktstvdlp ienqllwqid remlnlyien 60" egkmiqukl ekerndakna veeyvyemrd ekfv seddrnsftl kledtenwLy 66" edgedqpkqv yvdklaelkn lgqpikirfq eseerpklfe elgkqiqqym kiissfknke 72" dqydhldaad mtkvekstne nkln lanqsltmd pvvkskeiea kikeltstcs 78" piiskpkpkv qkna eqngpvdgqg dnpgpqaaeq psds dkklpemdid PSMA7 Official Sym bol: PSMA7 Official Name: proteasome (prosome, macropain) subunit, alpha type, 7 Gene ID: 5688 Organism: Homo sapiens Other Aliases: RPS-1005F21.4, 06, HSPC, RCS-f XAPC7 Other Designations: proteasome subunit ROG-1; proteasome subunit XAPC7; proteasome subunit alpha 4; proteasome subunit alpha type-7 Nucleotide seguence: NCBI nce Seguence: NM_002792.3 LOCUS NM_002792 ACCESSION NM_002792 1 gcct gacgccgccc gtcgccggca gcgcaggaca cggcgccgag ggtggggcgc 61 gggcgtagtg gcgccgggag tcgcgggtgc gcgcgggccg tgagtgtgcg cttttgagag 121 tcgcggcgga aggagcccgg ccgccgcccg ccggcatgag ctacgaccgc gccatcaccg 181 tcttctcgcc cgacggccac ctcttccaag tggagtacgc ggcc gtcaagaagg 241 gctcgaccgc ggttggtgtt cgaggaagag acattgttgt tcttggtgtg gagaagaagt 301 cagtggccaa actgcaggat acag tgcggaagat ctgtgctttg gatgacaacg 353353 361 tctgcatggc aggc ctcaccgccg ggat agtcatcaac agggcccggq 421 tggagtgcca gagccaccgg ctgactgtgg aggacccggt cactgtggag tacatcaccc 481 gctacatcgc cagtctgaag cagcgttata cgcagagcaa tgggcgcagg ccgtttggca 541 tctctgccct catcgtgggt ttcgactttg atggcactcc taggctctat cagactgacc 601 cctcgggcac ataccatgcc tggaaggcca atgccatagg tcggggtgcc aagtcagtgc 661 gcgagttcct ggagaagaac tatactgacg aagccattga tgat ctgaccatta 721 agctggtgat caaggcactc ctggaagtgg ttcagtcagg tggcaaaaac attgaacttg 781 ctgtcatgag gcgagatcaa tccctcaaga ttttaaatcc tgaagaaatt gagaagtatg 841 ttgctgaaat tgaaaaagaa aaagaagaaa acgaaaagaa gaaacaaaag aaagcatcat 901 gatgaataaa atgtctttgc ttgtaatttt taaattcata tcaatcatgg atgagtctcg 961 atgtgtaggc ctttccattc catttattca cactgagtgt cctacaataa acttccgtat 1021 ttttaacctg aaaa aaaaaaaaaa Protein ce: NCBI nce Seguence: NP_002783.1 LOCUS NP_002783 ACCESSION NP_002783 1 msydraitvf spdghlfqve yaqeavkkgs tavgvrgrdi kksv aqudertvr 61 kicalddnvc mafagltada rivinrarve tved pvtveyitry iaslquytq 121 sngrrpfgis alivgfdfdg tprlyqtdps gtyhawkana svre fleknytdea 181 ietddltikl vikallevvq sggknielav mrrdqslkil npeeiekyva eiekekeene 241 kkqukas KIFSB Official S mbol: Kl FSB Official Name: kinesin family member SB Gene ID: 3799 Qrganigm: Homo sapiens Other Aliases: KINH, KNS, KNSl, UKHC 354354 Other ations: conventional kinesin heavy chain; kinesin 1 (110-120kD); kinesin heavy chain; kinesin-1 heavy chain; tous kinesin heavy chain Nucleotide seguence: NCBI nce Seguence: NM_004521.2 LOCUS NM_004521 ACCESSION NM_004521 l ccgc accgccctgt cgcccaacgg cggcctcagg agtgatcggg tcgg 6" ccggccagcg gacggcagag cgggcggacg cccg gcctgctctt cgcgaggagg l2" aagaaggtgg ccactctccc ggtccccaga acctccccag cccccgcagt ccgcccagac 18" cgtaaagggg gacgctgagg agccgcggac gctctccccg gtgccgccgc cgctgccgcc 24" gccatggctg ccatgatgga tcggaagtga gcattagggt ctgc cggcgccggc " tcttcaagtc cccc ggccgcctcc acccggggaa gcgcagcgcg gcgcagctga 36" cctc tcacggccct cgcgaccaca agccctcagg tccggcgcgt tccctgcaag 42" actgagcggc ggggagtggc tcccggccgc cggccccggc tgcgagaaag gacc 48" tggccgagtg caacatcaaa gtgatgtgtc gcttcagacc tctcaacgag tctgaagtga 54" accgcggcga caagtacatc gccaagtttc agggagaaga cacggtcgtg atcgcgtcca 60" atgc atttgatcgg gtgttccagt caagcacatc tcaagagcaa gtgtataatg 66" actgtgcaaa gaagattgtt gtac ttgaaggata aaca atatttgcat 72" atggacaaac atcctctggg aagacacaca caatggaggg taaacttcat gatccagaag 78" gcatgggaat tattccaaga atagtgcaag atatttttaa ttatatttac tccatggatg 84" aaaatttgga atttcatatt aaggtttcat attttgaaat atatttggat aagataaggg 90" acctgttaga tgtttcaaag accaaccttt cagttcatga agacaaaaac cgagttccct 96" atgtaaaggg gtgcacagag cg:tttgtat gtagtccaga tgaagttatg gataccatag "O2" atgaaggaaa atccaacaga ca:gtagcag ttacaaatat gaatgaacat agctctagga "08" gtcacagtat atttcttatt aa:gtcaaac aagagaacac acaaacggaa caaaagctga "l4" gtggaaaact ttatctggtt gctg gtagtgaaaa ggttagtaaa actggagctg "20" ctgt gctggatgaa gc:aaaaaca agtc actttctgct cttggaaatg "26" ttatttctgc tttggctgag gg:agtacat atgttccata tcgagatagt aaaatgacaa "32" gaatccttca agattcatta gg:ggcaact gtagaaccac tattgtaatt tgctgctctc "38" catcatcata caatgagtct gaaacaaaat ctacactctt atttggccaa agggccaaaa 355355 "44" caattaagaa cacagtttgt gtcaatgtgg ctgc gtgg aaaaagaagt "50" atgaaaaaga aaaagaaaaa atcc tgcggaacac gtgg cttgaaaatg "56" agctcaacag atggcgtaat acgg tgcctattga tgaacagttt gacaaagaga "62" aagccaactt tttc acagtggata aagatattac tcttaccaat ccag "68" caaccgcaat tggagttata ttta ctgatgctga aagaagaaag tgtgaagaag "74" aaattgctaa attatacaaa cagcttgatg acaaggatga agaaattaac cagcaaagtc "80" aactggtaga gaaactgaag acgcaaatgt tggatcagga ggagcttttg gcatctacca "86" atca agacaatatg caagctgagc tgaatcgcct tcaagcagaa aatgatgcct "92" ctaaagaaga agtgaaagaa gttttacagg ccctagaaga acttgctgtc aattatgatc "98" agaagtctca ggaagttgaa gacaaaacta aggaatatga tagt ttga 204" atcagaaatc ggcaacttta gcgagtatag atgctgagct tcagaaactt aaggaaatga 210" ccaaccacca gaaaaaacga gcagctgaga tgatggcatc tttactaaaa gaccttgcag 216" aaataggaat tgctgtggga aataatgatg taaagcagcc tgagggaact ggcatgatag 222" atgaagagtt cactgttgca agactctaca ttagcaaaat gaagtcagaa gtaaaaacca 228" tggtgaaacg ttgcaagcag agca cacaaactga gagcaacaaa aaaatggaag 234" aaaatgaaaa ggagttagca cagc ttcgtatctc tcaacatgaa gccaaaatca 240" agtcattgac tgaatacctt caaaatgtgg aacaaaagaa gttg gaggaatctg 246" tcgatgccct cagtgaagaa ctagtccagc ttcgagcaca agagaaagtc catgaaatgg 252" aaaaggagca cttaaataag gttcagactg caaatgaagt taagcaagct gttgaacagc 258" agatccagag ccatagagaa actcatcaaa aacagatcag tagtttgaga gatgaagtag 264" aagcaaaagc aaaacttatt actgatcttc aagaccaaaa ccagaaaatg atgttagagc 270" aggaacgtct aagagtagaa catgagaagt tgaaagccac agatcaggaa aagagcagaa 276" aactacatga acttacggtt atgcaagata gacgagaaca agcaagacaa gacttgaagg 282" gtttggaaga gacagtggca aaagaacttc agactttaca caacctgcgc aaactctttg 288" ttcaggacct ggctacaaga gttaaaaaga agat tgat gacaccggag 294" gcagcgctgc tcagaagcaa aaaatctcct ttcttgaaaa taatcttgaa cagctcacta 300" aagtgcacaa acagttggta cgtgataatg cagatctccg ctgtgaactt cctaagttgg 306" aaaagcgact tcgagctaca gctgagagag cttt ggaatcagca ctgaaagaag 312" ctaaagaaaa tgcatctcgt gatcgcaaac gctatcagca agaagtagat cgcataaagg 318" aagcagtcag gaat atggccagaa gagggcattc tgcacagatt gctaaaccta 356356 324" ttcgtcccgg gcaacatcca gcagcttctc caactcaccc aagtgcaatt Cgtggaggag 330" gtgcatttgt tcagaacagc cagccagtgg cagtgcgagg tggaggaggc aaacaagtgt 336" aatcgtttat acatacccac aggtgttaaa aagtaatcga agtacgaaga ggacatggta 342" tcaagcagtc attcaatgac tataacctct actcccttgg gattgtagaa ttataacttt 348" taaaaaaaat gtataaatta tacctggcct gtacagctgt ttcctaccta ctcttcttgt 354" aaactctgct gcttcccaac acaactagag tgcaattttg gcatcttagg agggaaaaag 360" gacagtttac aactgtggcc ctatttatta cacagtttgt ctatcgtgtc ttaaatttag 366" tctttactgt ctaa ctgtacctta taggactgta ctttttgtat tttttgtgta 372" tgtttatttt ttaatctcag tttaaattac ctagctgcta ctgcttcttg tttt 378" cctattaaaa cgtcttcctt tttttttctt aagagaaaat ggaacattta ggttaaatgt 384" ctttaaattt taccacttaa caacactaca taaa atatatccag tcagtactgt 390" attttaaaat cccttgaaat atca aaat tacttgtatt gtttctgaag 396" tttgctcctg aaaactactg cact gaaacgttac ctaa taggcatttg 102" agca cttg ttatctcatg aaatgcctgt gtta ttttgaatag 108" aaatatttta aagtatcaaa agcagatctt agtttaaggg agtttggaaa aggaattata 114" tttctctttt tcctgattct gtactcaaca agtcttgatg gaattaaaat actctgcttt Z20" attctggtga gcctgctagc taatataagt attggacagg taataatttg ttaa Z26" tattagtaaa atgaattaag atattatagg attaaacata attttatacg gttagtactt Z32" tattggccga cctaaattta tagcgtgtgg aaattgagaa agaa acaggacaga Z38" tatatgatga attaaaaata tatataggtc aattttggtc tgaaatccct gaggtgtttt Z44" taacctgcta cactaatttg aatt tatttcttta gtctagaaat agtaaattgt 150" ttgcaagtca ctaataatca ttagataaat tattttcttg gccatagccg ataattttgt 156" aatcagtact aagtgtatac ttgc cactttttcc tcagatgatt aaagtaagtc Z62" aacagcttat tttaggaaac tgtaaaagta atagggaaag agatttcact ttca Z68" tcagtggtag gggggcggtg actgcaactg tgttagcaga aattcacaga gaatggggat Z74" ttaaggttag cagagaaact tggaaagttc tgtgttagga tcttgctggc agaattaact 180" ttttgcaaaa gttttataca cagatatttg tattaaattt ggagccatag cagatcataa tatt tttctatttc cgtaactatt gtaatttcca cttttgtaat Z92" aattttgatt taaaatataa atttatttat ttattttttt aatagtcaaa aatctttgct Z98" gttgtagtct gcaacctcta aaatgattgt gttgctttta ggattgatca gaagaaacac 357357 504" aatt gagatgaaat gttggtgcag ccagttataa gtaatatagt taacaagcaa 510" aaaaagtgct gccacctttt atgatgattt tctaaatgga gaaacatttg gc:gcatcca 516" catagacctt tatgttttgt tttcagttga aaacttgcct cctttggcaa ca:tcgtaaa 522" tgaagcagaa tttttttttc tcttttttcc aaatatgtta gttttgttct tg:aagatgt 528" atcatgggta ttggtgctgt aaca acgaatttta attagcatgt gg:tcagaat 534" tgtt aggtttttaa aaagtatctt gatggttctt ttctatttat aa:ttcagac 540" tttcataaag tgtaccaaga atttcataaa tttgttttca gtgaactgct tt :tgctatg 546" gtaggtcatt agca cttactctta aaaatgaaaa tttctgatca tc:aggatat 552" tgacacattt caatttgcag tgtctttttg actggatata ttaacgttcc tggc 558" attgatagat ggttcagaag agaaactcaa aaag agaatattta gcga 564" ttaattaaat tatttgccta acttaagaaa actactgtgc gtaactctca gtttgtgctt 570" aactccattt gacatgaggt gacagaagag agtctgagtc tacctgtgga atatgttggt 576" ttattttcag tgcttgaaga tacattcaca aatacttggt ttgggaagac taat 582" tttaagttaa cttgcatgtt gtaaatgcgt tttatgttta aataaagagg aaaatttttt 5881 gaaatgtaaa aaaaaaaaaa aaaaa Protein seguence: NCBI nce Seguence: NP_004512.1 LOCUS NP_004512 ION NP_004512 l madlaecnik vmcrfrplne sevnrgdkyi aqugedtvv iaskpyafdr qusstsqeq 6" vyndcakkiv kdvlegyngt ifaygqtssg kthtmegklh dpegmgiipr iquifnyiy 12" smdenlefhi kvsyfeiyld kirdlldvsk tnlsvhedkn rvpyvkgcte rfvcspdevm 18" dtidegksnr mneh ifli nvkqentqte qklsgklylv dlagsekvsk 24" tgaegavlde akninkslsa lgnvisalae gstyvpyrds kmtrilqul ggncrttivi " ccspssynes lfgq raktikntvc vnveltaeqw ekek nkilrntiqw 36" lenelnrwrn getvpideqf dkekanleaf tvdkditltn dkpataigvi gnftdaerrk 42" ceeeiaklyk qlddkdeein qqsqlveklk tqmldqeell astrrdqdnm lqae 48" ndaskeevke vlqaleelav nyqusqeve dktkeyells delnqksatl asidaelqkl 54" kemtnhqkkr aaemmasllk dlaeigiavg nndvkqpegt gmideeftva rlyiskmkse 60" vktmvkrckq lestqtesnk kmeenekela acqlrisqhe akikslteyl qnveqkqul 358358 661 eesvdalsee lvqlraqekv hemekehlnk vqtanevkqa veqqiqshre sslr 721 deveakakli tdlqdqnqkm mleqerlrve heklkatdqe ksrklheltv qurreqarq 781 dlkgleetva kelqtlhnlr klqudlatr vkksaeidsd dtggsaaqkq kisflennle 841 qltkvhkqlv rdnadlrcel pklekrlrat aervkalesa lkeakenasr drkryqqevd 901 rikeavrskn marrghsaqi akpirpgth aaspthpsai rgggaquns qpvavrgggg 961 kqv RP325 Official Sym bol: RPS25 Official Name: ribosomal protein 825 Gene ID: 6230 Organism: Homo sapiens Other Aliases: S25 Other Designations: 408 ribosomal protein 825 Nucleotide seguence: NCBI Reference Seguence: NM_001028.2 LOCUS 028 ACCESSION NM_001028 1 cttccttttt gtccgacatc ttgacgaggc tgcggtgtct gctgctattc tccgagcttc 6" gcaatgccgc ctaaggacga gaag aaggacgctg cggc caagaaagac 12" aaagacccag tgaacaaatc cgggggcaag gccaaaaaga agaagtggtc caaaggcaaa 18" gttcgggaca agctcaataa cttagtcttg tttgacaaag ctacctatga taaactctgt 24" aaggaagttc ccaactataa aacc ccagctgtgg tctctgagag actgaagatt " cgaggctccc tggccagggc agcccttcag gagctcctta gtaaaggact tatcaaactg 36" gtttcaaagc acagagctca ttac aata gtgg agatgctcca 42" gctgctggtg aagatgcatg aataggtcca accagctgta catttggaaa aataaaactt 48" tattaaatca aaaaaaaaaa aaaa aaaa Protein ce: NCBI Reference Seguence: NP_001019.1 359359 LOCUS NP_001019 ACCESSION NP_001019 1 mppkddkkkk dagksakkdk dpvnksggka kgkv rdklnnlvlf dkatydklck 61 evpnyklitp avvserlkir gslaraalqe llskgliklv skhraqviyt rntkggdapa 121 ageda Official Symbol: HSP90ABf Official Name: heat shock protein 90kDa alpha (cytosolic), class B member 1 m: 3326 Organism: Homo sapiens Other Aliases: 2G2.1, D6S182, HSP84, HSP90-BETA, HSPQOB, HSPC2, HSPCB Other Designations: 90-kda heat shock protein beta HSP90 beta; heat shock 84 kDa; heat shock 90kD protein 1, beta; heat shock 90kDa protein 1, beta; heat shock protein HSP 90-beta; heat shock protein beta Nucleotide seguence: NCBI Reference Seguence: NM_007355.2 LOCUS NM_007355 ACCESSION 355 1 cgca gtgttgggac tgtctgggta tcggaaagca agcctacgtt gctcactatt 6" acgtataatc cttttctttt caagatgcct gaggaagtgc accatggaga ggaggaggtg 12" gagacttttg cctttcaggc agaaattgcc atgt ccctcatcat caataccttc 18" tattccaaca aggagatttt ccttcgggag ttgatctcta atgcttctga tgccttggac 24" cgct atgagagcct gacagaccct tcgaagttgg gtaa agagctgaaa " attgacatca tccccaaccc tcaggaacgt accctgactt tggtagacac aggcattggc 36" atgaccaaag ctgatctcat aaataatttg ggaaccattg ccaagtctgg tactaaagca 42" ttcatggagg aggc tggtgcagac atctccatga ttgggcagtt tggtgttggc 48" tctg cctacttggt ggcagagaaa gtggttgtga tcacaaagca caacgatgat 54" gaacagtatg cttgggagtc ttctgctgga ggttccttca ctgtgcgtgc tgaccatggt 360360 60" gagcccattg gcaggggtac caaagtgatc ctccatctta aagaagatca gacagagtac 66" ctagaagaga ggcgggtcaa agaagtagtg aagaagcatt ctcagttcat aggctatccc 72" atcacccttt atttggagaa ggaacgagag aaggaaatta gtgatgatga ggcagaggaa 78" gagaaaggtg agaaagaaga ggaagataaa gatgatgaag ccaa gatcgaagat 84" gtgggttcag atgaggagga tgacagcggt aaga agaagaaaac taagaagatc 90" aaat atca ggaagaacta aacaagacca agcctatttg gaccagaaac 96" cctgatgaca tcacccaaga ggagtatgga gaattctaca agagcctcac taatgactgg "02" gaagaccact tggcagtcaa gcacttttct gtagaaggtc agttggaatt cagggcattg "08" ctatttattc ctcgtcgggc tccctttgac ctttttgaga agaa aaagaacaac "14" atcaaactct atgtccgccg tgtgttcatc atggacagct gtgatgagtt gataccagag "20" tatctcaatt ttatccgtgg tgtggttgac tctgaggatc tgcccctgaa catctcccga "26" gaaatgctcc agcagagcaa aatcttgaaa gtcattcgca aaaacattgt taagaagtgc "32" cttgagctct tctctgagct ggcagaagac aaggagaatt aatt ctatgaggca "38" ttctctaaaa atctcaagct tggaatccac tcca ctaaccgccg ccgcctgtct "44" gagctgctgc gctatcatac ctcccagtct ggagatgaga tgacatctct gtcagagtat "50" gtttctcgca tgaaggagac acagaagtcc atctattaca tcactggtga gagcaaagag "56" caggtggcca actcagcttt tgtggagcga gtgcggaaac ggggcttcga ggtggtatat "62" atgaccgagc ccattgacga gtactgtgtg cagcagctca aggaatttga tgggaagagc "68" ctggtctcag ttaccaagga gggtctggag gagg atgaggagga gaagaagaag "74" gaga gcaaggcaaa gtttgagaac ctctgcaagc tcatgaaaga aatcttagat "80" aagaaggttg agaaggtgac aatctccaat agacttgtgt cttcaccttg tgtg "86" accagcacct acggctggac tatg gagcggatca tgaaagccca ggcacttcgg "92" tcca ccatgggcta ggcc aaaaagcacc tggagatcaa ccac "98" cccattgtgg tgcg gcagaaggct gaggccgaca agaatgataa ggcagttaag 204" gacctggtgg tgctgctgtt tgaaaccgcc ctgctatctt ctggcttttc ggat 210" ccccagaccc actccaaccg catctatcgc atgatcaagc taggtctagg tattgatgaa 216" gatgaagtgg cagcagagga acccaatgct gcagttcctg atgagatccc ccctctcgag 222" ggcgatgagg atgcgtctcg catggaagaa gtcgattagg ttaggagttc atagttggaa 228" aacttgtgcc cttgtatagt gtccccatgg gctcccactg cagcctcgag tgcccctgtc 234" ccacctggct ccccctgctg gtgtctagtg tttttttccc tctcctgtcc ttgtgttgaa 361361 2401 ggcagtaaac taagggtgtc aagccccatt ccctctctac tcttgacagc aggattggat 2461 tatt gtggtttatt tctt cattttgttc tgaaattaaa gtatgcaaaa 2521 taaagaatat ttaa aaaaaaaaaa aaaaaaaaaa aaaaaaa Protein seguence: NCBI Reference Seguence: 381.2 LOCUS NP_031381 ACCESSION NP_031381 1 mpeevhhgee evetfafqae iaqlmsliin tfysnkeifl relisnasda ldkiryeslt 61 dpskldsgke pnpq vdtg igmtkadlin nlgtiaksgt kafmealqag 121 adismigqu vgfysaylva ekvvvitkhn ddeqyawess aggsftvrad hgepigrgtk 181 vilhlkedqt eyleerrvke vvkkhsqfig ypitlyleke rekeisddea eeekgekeee 241 dkddeekpki edvgsdeedd sgkdkkkktk kikekyidqe elnktkpiwt rnpdditqee 301 ygefyksltn dwedhlavkh fsvegqlefr allfiprrap fdlfenkkkk nniklyvrrv 361 fimdscdeli peylnfirgv vdsedlplni sremquski lkvirknivk kclelfsela 421 edkenykkfy eafsknlklg nrrr lsellryhts qsgdemtsls eyvsrmketq 481 ksiyyitges keqvansafv ervrkrgfev idey cqulkefdg kslvsvtkeg 541 lelpedeeek kkmeeskakf enlcklmkei ldkkvekvti snrlvsspcc ivtstygwta 601 nmerimkaqa lrdnstmgym makkhleinp dhpivetqu ndka llfe 661 tallssgfsl edpqthsnri yrmiklglgi dedevaaeep naavpdeipp legdedasrm 721 eevd LMO7 Official Symbol: LMO7 Offigigl Name: LIM domain 7 Gene ID: 4008 Organism: Homo sapiens Other s: RP11-332E32, FBX20, FBXOZO, LOMP Other Designations: F-box only protein 20; F-box protein Fbx20; LIM domain only 7 protein; LIM domain only protein 7; LMO-7; zinc-finger domain-containing protein 362362 Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_005358.5 LOCUS NM_005358 ACCESSION NM_005358 l ggaaagaagt ggaataatta ggaacctagg gtggggtagg gtagcaggac atttcaaaca 6" ttaatgagca tatgagattc caggtcttgt taaaatgcaa attctgattc agctggtagg l2" tgaggtctga gattgtgcat ttctaacaag cactcagata atcttaaggc tgttggcccc 18" agggtcacac ttatagtgat tttctagaac ccagttgggg aagtgaatct tgggcaggag 24" aaatacacac catt gagtttggag atctcatctg atataacttt ttaagaaaga " aaaataattt tatc caattgataa gctttcccac :aagtggctt tcccactaag 36" tggctgcgtt atgaaaattg cttcactttg aaacttctgg :cttggtaat atagaatttc 42" tgtgttctca cagtgcttga ttgagaatat gatattgaga :tatggcata aaa :atagtg 48" gctgtacaaa aaaaaataca ttattaggat ctctaacaat :atgtaaaag tca :tgcttc 54" agag ctcaaacttt ggtgtgagac ctggttttat :cttggcact tac :ctgagt 60" tgtcttaggc aaattaatac cttaagcaaa aatattctca :gtacatttt aca :gagaat 66" tataaatgaa gtacataaag tccagcagtc acaaatgtta :ctattatta cca :cgtcct 72" aagactgcaa tcagctatag tgaaagtagt ctcaaagatt gtttcataaa tca :cagatt 78" cacctaattt tctaaagaat ttaaataagg agatggaatg aatagattgc att :tgtttc 84" cagg ggaactgtgc atatttcttc tgtgactcgg aaatggttta act :ttaaaa 90" atcccaaaat agctgaagtt agcagacatg caatttacca aggatgattg gaa :ttttat 96" tgta ataatactat acccaagcac actgctcatg aggaaaacat ttt a "O2" atcttttact cttgggggca aagaatgctg tttttctttt tgataactat gtt :atagaa "08" tctaaatcac caat aaca tctaaagtta ccat tca :gtttca "l4" tttatagcta tttgaatttt gatgaatttc aatatggtgc tacagtgata gggcaagtgc "20" aaataagttc tggg tacggtctaa agctatttta atttttttat tacaactgct "26" aaaa ttaggatatg ccatattttc acgttttaca gttggatgtc ctatgatgtt "32" ctcttccaga gaacagagct tctg tgga tgat atgtgctcat "38" gtctgcatct gtgtgggttg tctc agggacagag tctgcagcaa aaaagatata "44" attttgagga ctgaacaaaa ttcaggaagg actattctca ttaaggcagt aacagagaag 363363 "50" aattttgaaa caaaagattt tcgagcctct ctagaaaatg gtgttctgct gtgtgatttg "56" attaataagc ttaaacctgg cgtcattaag aagatcaata gactgtctac accaatagca "62" ggattggata acgt gaaa gcttgtgaac agattggatt gaaagaagcc "68" cagcttttcc atcctggaga tctacaggat ttatcaaatc gagtcactgt caagcaagaa "74" gagactgaca ggagagtgaa aaatgttttg ataacattgt actggctggg aagaaaagca "80" aacc cgtactataa tggtccccat cttaatttga aagcgtttga gaatctttta "86" ggacaagcac tgacgaaggc actcgaagac ttcc tgaaaagaag tggcagggac "92" agtggctacg gtgacatctg gtgtcctgaa cgtggagaat ttcttgctcc tccaaggcac "98" cataagagag aagattcctt tgaaagcttg ttgg gctcgaggtc aagc 204" tgctcctctg atatcacgtt gagagggggg cgtgaaggtt ttgaaagtga ttcg 210" gaatttacat ttaagatgca ggattataat aaagatgata tgtcgtatcg aaggatttcg 216" gctgttgagc caaagactgc gttacccttc aatcgttttt tacccaacaa aagtagacag 222" ccatcctatg taccagcacc tctgagaaag aaaaagccag acaaacatga ggataacaga 228" agaagttggg caagcccggt ttatacagaa gcagatggaa cattttcaag actctttcaa 234" tatg gtgagaatgg gtcc atgagtgatg caga agatgttcaa 240" aacttgcgtc agctgcgtta cgaggagatg cagaaaataa aatcacaatt aaaagaacaa 246" gatcagaaat atga ccttgcaaaa tggaaagatc gtcgaaaaag ttca 252" gatctgcaga agaaaaaaga agagagagaa gaaattgaaa agcaggcact gtct 258" aagagaagct ctaagacgtt taaggaaatg ctgcaggaca gggaatccca aaag 264" gttc cgtcaagaag gagaatgtat tcttttgatg atgtgctgga ggaaggaaag 270" cgacccccta caatgactgt gtcagaagca agttaccaga gtgagagagt agaagagaag 276" ggagcaactt atccttcaga aattcccaaa gaagattcta ccacttttgc aaaaagagag 282" gaccgtgtaa caactgaaat tcagcttcct tctcaaagtc ctgtggaaga acaaagccca 288" gcctctttgt cttctctgcg ttcacggagc acacaaatgg aatcaactcg tgtttcagct 294" tctctcccca gaagttaccg gaaaactgat acagtcaggt taacatctgt ggtcacacca 300" agaccctttg gctctcagac aaggggaatc ctcc ccagatctta cacgatggat 306" gatgcttgga agtataatgg agatgttgaa gacattaaga gaactccaaa caatgtggtc 312" cctg caccaagccc ggacgcaagc caactggctt caagcttatc tagccagaaa 318" gaggtagcag aaga agatgtgaca aggctgccct ctcctacatc ccccttctca 324" tctctttccc aagaccaggc tgccacttct aaagccacat tgtcttccac atctggtctt 364364 330" gatttaatgt ctgg agaaggggaa atctccccac aaagagaagt atcc 336" caggatcagt tcagtgatat gagaatcagc ataaaccaga cgcctgggaa gagtcttgac 342" tttgggttta caataaaatg ggatattcct gggatcttcg tagcatcagt tgaagcaggt 348" agcccagcag aattttctca gctacaagta gatgatgaaa ctat taacaacacc 354" aagttttcat ataacgattc aaaagagtgg gaggaagcca tggctaaggc tcaagaaact 360" ggacacctag tgatggatgt gaggcgctat ggaaaggctg gttcacctga aacaaagtgg 366" attgatgcaa cttctggaat ttacaactca tctt caaatctatc tgtaacaact 372" gatttctccg aaagccttca taat tcca aagaaatcaa tggaattcat 378" gatgaaagca atgcttttga atcaaaagca tctgaatcca tttctttgaa aaacttaaaa 384" tcac aattttttga acaaggaagc tctgattcgg tggttcctga agtt 390" ccaaccatca gtgccccgag tcgctgggtg tgggatcaag aggaggagcg gaagcggcag 396" gagaggtggc agaaggagca ggaccgccta ctgcaggaaa aatatcaacg tgagcaggag 102" aaactgaggg aagagtggca aagggccaaa caggaggcag agaa ttccaagtac 108" ttggatgagg tggt cctaagctca aacagcatgt ccac acgggagccc 114" tctcttgcca cctgggaagc tacctggagt gaagggtcca agtcttcaga cagagaagga 120" acccgagcag gagaagagga gaggagacag ccacaagagg aagttgttca tgaggaccaa 126" ggaaagaagc cgcaggatca gcttgttatt gagagagaga ggaaatggga gcaacagctt 132" caggaagagc aagagcaaaa tcag gctgaggctg aggagcagaa gcgtcctgcg 138" gaggagcaga agcgccaggc agagatagag cgggaaacat cagtcagaat gtac Z44" aggaggcctg ttgattccta tgatatacca aagacagaag aagcatcttc aggttttctt 150" cctggtgaca ggaataaatc cagatctact ctgg atgattactc cacaaataaa 156" aatggaaaca ataaatattt agaccaaatt gggaacatga cctcttcaca atcc Z62" aagaaagaac aagtaccatc aggagcagaa ttggagaggc aacaaatcct tcaggaaatg Z68" aggaagagaa caccccttca caatgacaac agctggatcc gacagcgcag tgccagtgtc Z74" aacaaagagc ctgttagtct tcctgggatc atgagaagag gcgaatcttt agataacctg 180" gactcccccc gatccaattc ttggagacag cctccttggc agcc cacaggattc 186" tatgcttctt cctctgtgca agactttagt cgcccaccac ctcagctggt gtccacatca Z92" aaccgtgcct acatgcggaa cccctcctcc agcgtgcccc caccttcagc tggctccgtg Z98" aagacctcca ccacaggtgt ggccaccaca cagtccccca ccccgagaag ccattcccct 504" tcagcttcac agtcaggctc tcagctgcgt tcag tcagtgggaa gcgcatatgc 365365 510" tcctactgca ttct gggcaaagga atga tcatcgagtc cctgggtctt 516" tgttatcatt tgcattgttt taagtgtgtt gcctgtgagt gtgacctcgg ttcc 522" tcaggagctg aagtcaggat cagaaaccac caactgtact gcaacgactg ctatctcaga 528" ttcaaatctg gacggccaac cgccatgtga tgtaagcctc catacgaaag cactgttgca 534" gatagaagaa gaggtggttg ctgctcatgt agatctataa atatgtgttg tatgtctttt 540" ttgctttttt tttaaaaaaa agaataactt tttttgcctc tttagattac atagaagcat 546" tgtagtcttg gtagaaccag tatttttgtt gtttatttat aaggtaattg tgtgtgggga 552" aaagtgcagt atttacctgt tgaattcagc atcttgagag cacaagggaa aaaataagaa 558" cctacgaata tttttgaggc agataatgat ctagtttgac tttctagtta gtggtgtttt 564" gaagagggta ttttattgtt ttttaaaaaa aggttcttaa acattatttg aaatagttaa 570" tataaataca taattgcatt tgctctgttt attgtaatgt attctaaatt aatgcagaac 576" catatggaaa atttcattaa aatctatccc caaatgtgct ttctgtatcc ttccttctac 582" ttct gatttttaaa aatgcagtta catt tatttgcttg atgaagggag 588" ctctattttc tttaccagaa atgttgctaa gtaattccca atagaaagct gcttattttc 594" attaatgaaa aataaccatg gtttgtatac tagaagtctt cttcagaaac tggtgagcct 600" ttctgttcaa ttgcatttgt aaataaactt gctgatgcat ttaacgagtg ggtcgtcttt 606" ttcttaggtg tatgtgtctg acctcaggcc ttttagccat atttcagtat gtggcctttt 612" ttgatgttat gttttatcca gtagctttac taaggtataa ttgatgtaat aaactgcata 618" tatttaaagt gtatactttg acaaattttg acatggtgta taccttcgaa actatgccac 624" gatg tgtttactga aacattttaa taaggaagtt tatttttgat atgt 630" ttttggatac tttg tatggtgaga gtgatgaatt gttggatcat ttgaataaaa 636" tcttttacta accccatgat aaaaggagaa gacaacagtg agcttagaat atctataaag 642" caaaaaatgt agtctcttgt ttaaaaaatc ggga ggat acaaaacttt 648" cttt gagcaaaaat ttaaacttac tggaatcttt tataataatg taagtggaat 654" ttct aggaactgag attg gttc aaaatatgta agaaatgcta 660" gaga taaaaatttt atttagtact tattctgatt attattaaag taataatgtg 666" ttccttgagg ataacttgtc aaatgcccca aagcataaag aatataattc tgaatcccaa 672" attccaaaga caagaactct gtgtttgaat tgca tataattatt tataagtata 678" gattgtgaat ttttccatgt tcttaaaatt atttttatct tttttcatgg ttgcatagtg 684" ctccattgtt tggccttggt aatatttagt tgataattcc attactgtgt atttttcact 366366 6901 tgtttctaag atcaaacatt ttaatatgtg catgttatat ataaatatgt aaattctgtg 6961 atactctatg atcatctctt tctttatatt attttcatag acatgaaata gttgctcaga 7021 gattatgcat tttaagacac tcatagtata tattgccaaa gtggtttcca gaaaggcact 7081 gctggcttcg ataa gcagcacgtg ttca tctcactgca tgtttatgaa 7141 gatacagttc ttttgccttg ttctctgcct gatgtgtatg cagaggcagc cctcaatatg 7201 cagtggttga ataaatgaat gaagaaacca aaaa aaaa aaa Protein seguence (variant 1): NCBI Reference Seguence: NP_005349.3 LOCUS NP_005349 ION NP_005349 1 mkkirichif tfyswmsydv lgal eiqulicah vcicvgwlyl rdrvcskkdi 6" ilrteqnsgr tilikavtek nfetkdfras lengvllcdl inklkpgvik kinrlstpia 12" gldninvflk aceqiglkea qlfhpgdlqd lsnrvtvkqe etdrrvknvl itlywlgrka 18" qsnpyyngph lnlkafenll gqaltkaled ssflkrsgrd sgygdiwcpe pprh 24" hkredsfesl dslgsrslts cssditlrgg dtds eftfkquyn kddmsyrris " avepktalpf nrflpnksrq psyvpaplrk kkpdkhednr rswaspvyte adgtfsrlfq 36" kiygengsks msdvsaedvq nqulryeem qkiksqlkeq quwqddlak wkdrrksy:s 42" dlqkkkeere leks krssktfkem lqdresqnqk stvpsrrrmy sfddvleegk 48" rpptmtvsea syqserveek gatypseipk edsttfakre drvtteiqlp sqspveeqsp 54" aslsslrsrs tqmestrvsa slprsyrktd tvrltsvvtp trgi sslprsytmd 60" gdve dikrtpnnvv stpapspdas qlasslssqk evaateedvt rlpsptspfs 66" slsqdqaats katlsstsgl dlmsesgege iqurevsrs mris inqtpgks;d 72" fgftikwdip gifvasveag qlqv ddeiiainnt kfsyndskew eeamakaqet 78" ghlvmdvrry gkagspetkw idatsgiyns ekssnlsvtt dfseslqssn ieskeingih 84" desnafeska sesislknlk rrsqffqus sdsvvpdlpv srwv wdqeeerqu 90" erqueqdrl lqekyqreqe klreewqrak qeaerensky ldeelmvlss nsmslttrep 96" slatweatws egskssdreg trageeerrq pqeevvhedq gkkpqdqlvi ererkweqql 102" qeeqeqkrlq aeaeeqkrpa eeqquaeie retsvriyqy rrpvdsydip kteeassgfl 108" pgdrnksrst telddystnk ldqi gnmtssqrrs kkeqvpsgae lerqqilqem 367367 1141 rkrtplhndn swirqrsasv nkepvslpgi mrrgesldnl dsprsnsqu ppwlnqptgf 1201 yassqudfs rpppqlvsts nraymrnpss svpppsagsv ktsttgvatt qsptprshsp 1261 sasqsgsqlr nrsvsgkric sycnnilgkg aamiieslgl cyhlhcfkcv acecdlggss 1321 sgaevrirnh qucndcylr tam Nucleotide sequence (variant 2): NCBI Reference Seguence: NM_015842.2 LOCUS 842 ACCESSION NM_015842 1 taat gtttaacgtg ccagtcacaa agatcacaga aacagtgtat gcccgggcat 6" aagatagcac gactgtgtat gctctggagg actgaaaggc tgtacaagcc ctatgtattt 12" tttttcaaat atacatatgc atgggtcttg ctgctgcctc ttttgctgac tgtaattgga 18" ctttgaagct tcgaagttat atcataaaaa tttgtaacct ttgtctgaga tcag 24" ctaagcaatc actttccact caca ggataatata aacgttttct tgaaagcttg " tgaacagatt ggattgaaag aagcccagct tttccatcct ggagatctac aggatttatc 36" aaatcgagtc actgtcaagc aagaagagac tgacaggaga gtgaaaaatg ttttgataac 42" attgtactgg ctgggaagaa aagcacaaag caacccgtac ggtc cccatcttaa 48" tttgaaagcg tttgagaatc ttttaggaca agcactgacg aaggcactcg aagactccag 54" cttcctgaaa agaagtggca gggacagtgg ctacggtgac atctggtgtc ctgaacgtgg 60" agaatttctt gctcctccaa ggcaccataa gagagaagat tcctttgaaa gcttggactc 66" tttgggctcg aggtcattga caagctgctc ctctgatatc acgttgagag gggggcgtga 72" aggttttgaa agtgacacag attcggaatt taag gatt ataataaaga 78" tgatatgtcg agga tttcggctgt tgagccaaag actgcgttac ccttcaatcg 84" ttttttaccc agta gacagccatc ctatgtacca ctga gaaagaaaaa 90" gccagacaaa catgaggata acagaagaag ttgggcaagc tata cagaagcaga 96" tggaacattt tcaagtaatc agaggaggat ttggggcacc aatgtggaga actggccaac 102" tgtacaagga acttcaaagt cctcttgtta tttggaagag gaaaaagcaa gaag 108" catacccaac attgtaaagg atgatcttta tgtgcgcaag ctcagtccag tcatgccaaa 114" cccagggaat gcttttgatc agtttcttcc caaatgttgg accccagaag atgtgaactg 120" gaaaagaata aaaagggaaa agcc atggtataaa cagg gattcagtca 368368 "26" gtttttactg cttcaggccc cata ctctgatgac atcttgtctt ctgaaacaca "32" taccaaaatt gatcccactt ctggcccaag gctcataacc cgcaggaaga ctta "38" tgcaccaggc tatagaagag tcga gatggcagcc ctggatcctg agaa "44" tgatgatttc tttgtcagaa agactggggc tttccatgca aatccatatg ttctccgagc "50" agac tttagaaagt tctctgagca agatgattct gtagagcgag atataatttt "56" acagtgtaga gaaggtgaac ttgtacttcc ggatttggaa aaagatgata tgattgttcg "62" ccgaattcca gcacagaaga aagaagtgcc tggg gccccagata gataccaccc "68" agtccctttt cccgaaccct ggactcttcc tccagaaatt caagcaaaat ttctctgtgt "74" acttgaaagg acatgcccat ccaaagaaaa aagtaatagc tgtagaatat tagttccttc "80" atatcggcag aagaaagatg acatgctgac acgtaagatt cagtcctgga aactgggaac "86" taccgtgcct cccatcagtt tcacccctgg cccctgcagt gaggctgact tgaagagatg "92" ggaggccatc cgggaggcca gcagacttag gcacaagaaa aggctgatgg tggagagact "98" ctttcaaaag atttatggtg agaatgggag taagtccatg agtgatgtca aaga 204" tgttcaaaac ttgcgtcagc tgcgttacga ggagatgcag aaaataaaat cacaattaaa 210" agaacaagat cagaaatggc acct tgcaaaatgg aaagatcgtc gaaaaagtta 216" cacttcagat ctgcagaaga aaaaagaaga gagagaagaa attgaaaagc aggcacttga 222" gaagtctaag tcta agacgtttaa ggaaatgctg caggacaggg aaaa 228" tcaaaagtct acagttccgt caagaaggag aatgtattct tttgatgatg tgctggagga 234" aggaaagcga ccccctacaa tgactgtgtc agaagcaagt taccagagtg taga 240" agagaaggga gcaacttatc cttcagaaat tcccaaagaa gattctacca cttttgcaaa 246" ggac acaa ctgaaattca gcttccttct caaagtcctg tggaagaaca 252" aagcccagcc tctttgtctt ctctgcgttc caca caaatggaat gtgt 258" ttcagcttct ctccccagaa gttaccggaa aactgataca gtcaggttaa catctgtggt 264" aaga ccctttggct ctcagacaag gggaatctca tcactcccca gatcttacac 270" gatggatgat gcttggaagt gaga tgttgaagac attaagagaa ctccaaacaa 276" tgtggtcagc acccctgcac caagcccgga cgcaagccaa ctggcttcaa gcttatctag 282" ccagaaagag gtagcagcaa cagaagaaga tgtgacaagg ctgccctctc ctacatcccc 288" cttctcatct ctttcccaag accaggctgc cacttctaaa gccacattgt cttccacatc 294" tggtcttgat ttaatgtctg aatctggaga aggggaaatc tccccacaaa gagaagtctc 300" aagatcccag gatcagttca gtgatatgag aatcagcata aaccagacgc ctgggaagag 369369 306" tcttgacttt gggtttacaa taaaatggga tattcctggg gtag catcagttga 312" agcaggtagc ccagcagaat tttctcagct acaagtagat gatgaaatta ttgctattaa 318" caacaccaag ttttcatata acgattcaaa agagtgggag gaagccatgg ctaaggctca 324" agaaactgga cacctagtga tggatgtgag tgga aaggctggtt cacctgaaac 330" aaagtggatt gatgcaactt ctggaattta caactcagaa aaatcttcaa atctatctgt 336" tgat ttctccgaaa gccttcagag ttctaatatt gaatccaaag aaatcaatgg 342" aattcatgat gaaagcaatg cttttgaatc atct gaatccattt ctttgaaaaa 348" cttaaaaagg cgatcacaat tttttgaaca aggaagctct gattcggtgg ttcctgatct 354" tccagttcca accatcagtg ccccgagtcg ctgggtgtgg gatcaagagg aggagcggaa 360" gcggcaggag caga aggagcagga ccgcctactg caggaaaaat atcaacgtga 366" gcaggagaaa ctgagggaag agtggcaaag acag gaggcagaga gagagaattc 372" caagtacttg gatgaggaac tcct aaac agcatgtctc tgaccacacg 378" ggagccctct cttgccacct gggaagctac ctggagtgaa gggtccaagt cttcagacag 384" agaaggaacc cgagcaggag aagaggagag gagacagcca caagaggaag ttgttcatga 390" ggaccaagga ccgc aggatcagct tgag agagagagga aatgggagca 396" acagcttcag gaagagcaag agcaaaagcg gcttcaggct gaggctgagg agcg 102" ggag gagcagaagc gccaggcaga gatagagcgg gaaacatcag tcagaatata 108" ccagtacagg aggcctgttg attcctatga tataccaaag acagaagaag catcttcagg 114" ttttcttcct ggtgacagga ataaatccag atctactact gaactggatg attactccac 120" aaataaaaat ggaaacaata aatatttaga ccaaattggg aacatgacct cttcacagag 126" gagatccaag aaagaacaag cagg agcagaattg caac aaatccttca 132" ggaaatgagg aagagaacac cccttcacaa tgacaacagc tggatccgac agcgcagtgc 138" cagtgtcaac aaagagcctg ttagtcttcc tgggatcatg agaagaggcg aatctttaga Z44" taacctggac tccccccgat ccaattcttg gagacagcct ccttggctca atcagcccac 150" aggattctat gcttcttcct aaga tcgc ccaccacctc agctggtgtc 156" cacatcaaac cgtgcctaca tgcggaaccc ctcctccagc gtgcccccac cttcagctgg Z62" ctccgtgaag acctccacca caggtgtggc caccacacag tcccccaccc cgagaagcca Z68" ttca gcttcacagt caggctctca gctgcgtaac agtgtgttgc ctgtgagtgt Z74" gacctcggag gctcttcctc aggagctgaa gtcaggatca gaaaccacca actgtactgc 180" aacgactgct atctcagatt caaatctgga accg ccatgtgatg taagcctcca 370370 486" tacgaaagca ctgttgcaga tagaagaaga ggtggttgct gtag atctataaat 492" atgtgttgta tgtctttttt gctttttttt taaaaaaaag aataactttt tttgcctctt 498" tagattacat agaagcattg tggt agaaccagta ttgt ttatttataa 504" ggtaattgtg tgtggggaaa agtgcagtat ttacctgttg aattcagcat cttgagagca 510" aaaa aataagaacc tacgaatatt tttgaggcag ataatgatct actt 516" tctagttagt ttga agagggtatt ttattgtttt ttaaaaaaag gttcttaaac 522" attatttgaa atagttaata taaatacata attgcatttg ctctgtttat tgtaatgtat 528" tctaaattaa tgcagaacca tatggaaaat ttcattaaaa tctatcccca aatgtgcttt 534" ctgtatcctt ccttctacct attattctga tttttaaaaa tgcagttaat gtaccattta 540" tttgcttgat gaagggagct ctattttctt taccagaaat gttgctaagt aattcccaat 546" agaaagctgc ttattttcat taatgaaaaa taaccatggt ttgtatacta ttct 552" tcagaaactg gtgagccttt ctgttcaatt gcatttgtaa ataaacttgc tgatgcattt 558" aacgagtggg tcgtcttttt cttaggtgta tgtgtctgac ctcaggcctt ttagccatat 564" atgt ggcctttttt gatgttatgt tttatccagt agctttacta aggtataatt 570" gatgtaataa tata tttaaagtgt atactttgac aaattttgac atggtgtata 576" ccttcgaaac tatgccacag tctggatgtg tttactgaaa cattttaata aggaagttta 582" tttttgataa agttatgttt ttggatacaa tgta tggtgagagt gatgaattgt 588" tggatcattt gaataaaatc ttttactaac cccatgataa aaggagaaga tgag 594" cttagaatat ctataaagca aaaaatgtag gttt aaaaaatctg gagcgggaat 600" gcaaggatac aaaactttag catgctttga gcaaaaattt aaacttactg gaatctttta 606" tgta agtggaatgg aggattctag gaactgagaa ctgtattgga tcaa 612" taag aaatgctaat gtgggagata aaaattttat ttagtactta ttat 618" tattaaagta ataatgtgtt ccttgaggat aacttgtcaa atgccccaaa gcataaagaa 624" tataattctg aatcccaaat tccaaagaca agaactctgt gtttgaattc attctgcata 630" taattattta taagtataga ttgtgaattt ttccatgttc ttaaaattat ttttatcttt 636" tttcatggtt gcatagtgct ccattgtttg gccttggtaa tatttagttg ataattccat 642" tactgtgtat ttttcacttg tttctaagat caaacatttt aatatgtgca tgttatatat 648" aaatatgtaa attctgtgat actctatgat catctctttc tttatattat tttcatagac 654" atgaaatagt tgctcagaga ttatgcattt taagacactc atagtatata ttgccaaagt 660" caga aaggcactgc tggcttcgac aagc agcacgtggg cttgttcatc 371371 6661 tcactgcatg tttatgaaga tacagttctt ttgccttgtt ctctgcctga tgtgtatgca 6721 gaggcagccc tcaatatgca gtggttgaat aaatgaatga agaaaccact atcaaaaaaa 6781 aaaaaaaaaa a Protein seguence (variant 2): NCBI Reference Seguence: NP_056667.2 LOCUS NP_056667 ACCESSION NP_056667 1 dms yrrisavepk rflp nksrqpsyvp aplrkkkpdk hednrrswas 6" pvyteadgtf sanrriwgt nvenwptvqg tsksscylee ekaktrsipn ivkddlyvrk 12" lspvmpnpgn aquflpkcw tpedvnwkri kretykpwyk efqgfsqfll lqalqtysdd 18" ilssethtki dptsgprlit rrknlsyapg yrrddlemaa nddf fvrktgafha 24" npyvlrafed frkfseqdds lqcr egelvlpdle kddmivrrip aqkkevplsg " pvpf pepwtlppei qakflcvler ksns crilvpsyrq kkddmltrki 36" qswklgttvp gpcs eadlkrweai reasrlrhkk lqu iygengsksm 42" sdvsaedvqn qulryeemq kiksqlkeqd qkwqddlakw kdrrksytsd lqkkkeeree 48" iekqaleksk rssktfkeml qdresqnqks tvpsrrrmys egkr pptmtvseas 54" yqserveekg atypseipke dsttfakred rvtteiqlps qspveeqspa slsslrsrst 60" qmestrvsas lprsyrktdt vrltsvvtpr pfgsqtrgis slprsytmdd awkyngdved 66" ikrtpnnvvs tpapspdasq lasslssqke vaateedvtr lpsptspfss lsqdqaatsk 72" atlsstsgld lmsesgegei qurevsrsq dqfsdmrisi nqtpgksldf gftikwdipg 78" ifvasveags paefsqlqu deiiainntk fsyndskewe qetg hlvmdvrryg 84" kagspetkwi datsgiynse kssnlsvttd fseslqssni eskeingihd esnafeskas 90" esislknlkr rsqffquss dsvvpdlpvp tisapsrwvw dqeeerque rqueqerl 96" qekyqreqek lreewqrakq eaerenskyl lssn reps latweatwse 102" gskssdregt rrqp qeevvhedqg lvie rerkweqqlq eeqeqkrlqa 108" a qkrpa qqua i r tsvriyqyr rpvdsydipk teeassgflp gdrnksrstt 114" elddystnkn gnnkyldqig nmtssqrrsk keqvpsgael erqqilqemr krtplhndns 120" wirqrsasvn kepvslpgim rrgesldnld sprsnsqup pwlnqptgfy assqudfsr 126" stsn raymrnpsss vpppsagsvk tsttgvattq sptprshsps asqsgsqlrn 132" svlpvsvtse alpqelksgs ettnctatta isdsnldgqp pcdvslhtka llqieeevva 372372 1381 ahvdl CARS al Symbol: CARS Official Name: cysteinyl-tRNA synthetase Gene ID: 833 Organism: Homo sapiens Other Aliases: CARS1, CYSFlS, MGC:11246 Other Designations: ne tRNA ligase 1, asmic; cysteine translase; cysteine--tRNA , cytoplasmic Nucleotide sauence (variant 1): NCBI Reference Sequence: NM_139273.3 LOCUS NM_1 39273 AQQE§§IQN NM_139273 1 gtggggcgcg acttccgggg cggcggttgc atcagattct aggaagtgtc tgtagccgca 61 gctgcgggtc cgggattccc agccatgqca gattcctccg ggcagcaggg caaaggccgg 121 cgtgtgcagc ggtc ccctcctgct gggacccagc catgcagact ccacctttac 181 aacagcctca acaa ggaagtgttc caag atgggaaaaa ggtgacgtgg 241 tattgctgtg ggccaaccgt ctatgacgca atgg ggcacgccag gtcctacatc 301 tcttttgata tcttgagaag agtgttgaag gattacttca aatttgatgt cttttattgc 361 atgaacatta cggatattga tgacaagatc atcaagaggg cccggcagaa ccacctgttc 421 gagcagtatc gggagaagag gcctgaagcg gcacagctct tggaggatgt tcaggccgcc 481 ctgaagccat tttcagtaaa attaaatgaq accacggatc ccgataaaaa gcagatgctc 541 gaacggattc agcacgcagt tgcc acagagccac ttgagaaagc gtcc 601 agactcacgg gagaggaagt caacagctgt gtggaggtgt tgctggaaga agccaaggat 661 ttgctctctg actggctgga ttctacactt ggctgtgatg tcactgacaa ttccatcttc 721 ctgc ccaagttctg ggagggggac ttccacagag acatggaagc tctgaatgtt 781 ctccctccag atgtcttaac ccgggttagt gagtatgtgc cagaaattgt gaactttgtc 373373 84" cagaagattg tggacaacgg ttacggctat gtctccaatg ggtctgtcta ctttgataca 90" gcgaagtttg cttctagcga gaagcactcc tatgggaagc tggtgcctga ggccgttgga 96" gatcagaaag cccttcaaga aggggaaggt gacctgagca tctctgcaga ccgcctgagt "O2" gagaagcgct ctcccaacga ctttgcctta tggaaggcct ctaagcccgg agaaccgtcc "08" tggccgtgcc cttggggaaa gggtcgtccg ggctggcata tcgagtgctc ggccatggca "l4" ctcc taggggcttc gatggacatt cacggaggtg ggttcgacct ccggttcccc .20" caccatgaca tggc acagtcggag gcctactttg aaaacgactg ctgggtcagg "26" tacttcctgc acacaggcca cctgaccatt gcaggctgca aaatgtcaaa gtcactaaaa "32" aacttcatca ccattaaaga tgccttgaaa aagcactcag cacggcagtt gcggctggcc "38" ttcctcatgc actcgtggaa ggacaccctg tcca gcaacaccat ggagtcagcg "44" cttcaatatg agaagttctt gaatgagttt ttcttaaatg tgaaagatat ccttcgcgct "50" cctgttgaca tcactggtca gtttgagaag gaag aagaagcaga taag "56" tatg acaagaagac agcaattcac aaagccctct gtgacaatgt tgacacccgc "62" accgtcatgg aagagatgcg ggccttggtc agtcagtgca acctctatat ggcagcccgg "68" aaagccgtga ggaagaggcc caaccaggct ctgctggaga ccct gtacctcacc "74" catatgctga agatctttgg agaa gaggacagct ccctgggatt cgga "80" gggcctggaa ccagcctcag tctcgaggcc acagtcatgc cctaccttca atca "86" gaattccgag aaggagtgcg tgcc cgagagcaaa aagtccctga gcag "92" gatg ggga caacatcctg cccgagcttg gggtgcggtt tgaagaccac "98" gaaggactgc ccacagtggt gaaactggta gacagaaaca taaa agagagagaa 204" gaaaagagac gggttgaaga ggagaagagg aagaagaaag aggaggcggc ccggaggaaa 210" caggaacaag aagcagcaaa gctggccaag atgaagattc cccccagtga gatgttcttg 216" tcagaaaccg acaaatactc caagtttgat gaaaatgtaa gcatggtctg cccacacatg 222" acatggaggg caaagagctc agcaaagggc aagccaagaa gctgaagaag ctcttcgagg 228" ctcaggagaa gctctacaag gaatatctgc agatggccca gaatggaagc ttccagtgag 234" ggggcacagg actgactttt taaaccattg tggactagtg gctgctgtct gtga 240" caatgtccca gcgctcctat catgtttaca gtcacccttg ggtcctaaat taagagttgt 246" gttcatgtag gttcgtgtcg tcgttggctc tgagacattg ataataaatt tttctcaaca 2521 gtgagaccct caaaaaaaaa aaaaaaaaaa aaaaaaaa Protein sequence (variant 1): 374374 NCBI Reference Seguence: 802.1 LOCUS N P_644802 ACCESSION NP_644802 l madssgqqgk grrvqpqwsp pagtqpcrlh lynsltrnke vfipngkkv twyccgptvy 6" dashmghars yisfdilrrv fdvf ycmnitdidd kiikrarqnh lfeqyrekrp 12" eaaqlledvq aalkpfsvkl nettdpdkkq mleriqhavq lateplekav eevn 18" scvevlleea kdllsdwlds tlgcdvtdns ifsklpkfwe gdfhrdmeal nvlppdvltr 24" vseyvpeivn qukivdngy gyvsngsvyf dtakfassek hsygklvpea vgqualqeg " egdlsisadr lsekrspndf alwkaskpge pSWpprgkg rpgwhiecsa magtllgasm 36" dihgggfdlr fphhdnelaq seayfendcw vryflhtghl tiagckmsks lknfitikda 42" lkkhsarqlr laflmhswkd tldyssntme salqyekfln efflnvkdil rapvditgqf 48" ekwgeeeael nknfydkkta ihkalcdnvd trtvmeemra lyma arkavrkrpn 54" qallenialy lthmlkifga veedsslgfp vggpgtslsl eatvmpquv gvrk 60" iarequpei lqlsdalrdn ilpelgvrfe tvvk lvdrntllke reekrrveee 66" krkkkeeaar rkqeqeaakl akmkippsem kysk fdenvsmvcp hmtwrakssa 72" kgkprs Nucleotide seguence (variant 2): NCBI Reference Seguence: NM_001751.5 LOCUS NM_001751 ACCESSION NM_001751 l gtggggcgcg acttccgggg cggcggttgc atcagattct aggaagtgtc tgtagccgca 6" gctgcgggtc cgggattccc agccatggca gattcctccg aggg caaaggccgg 12" cgtgtgcagc cccagtggtc ccctcctgct gggacccagc catgcagact ccacctttac 18" aacagcctca acaa ggaagtgttc atacctcaag atgggaaaaa ggtgacgtgg 24" tattgctgtg ggccaaccgt ctatgacgca tctcacatgg ggcacgccag gtcctacatc " tcttttgata gaag agtgttgaag gattacttca aatttgatgt cttttattgc 36" atgaacatta cggatattga tgacaagatc atcaagaggg cccggcagaa ccacctgttc 42" tatc gggagaagag gcctgaagcg gcacagctct tggaggatgt cgcc 48" ctgaagccat tttcagtaaa attaaatgag accacggatc ccgataaaaa gctc 54" gaacggattc agcacgcagt gcagcttgcc acagagccac ttgagaaagc tgtgcagtcc 375375 60" acgg gagaggaagt caacagctgt gtggaggtgt aaga agccaaggat 66" ttgctctctg actggctgga ttctacactt ggctgtgatg tcactgacaa ttccatcttc 72" tccaagctgc ccaagttctg ggagggggac ttccacagag acatggaagc tctgaatgtt 78" ctccctccag atgtcttaac ccgggttagt gagtatgtgc cagaaattgt gaactttgtc 84" cagaagattg tggacaacgg ttacggctat aatg ggtctgtcta taca 90" gcgaagtttg cttctagcga ctcc tatgggaagc tggtgcctga ggccgttgga 96" aaag aaga aggggaaggt gacctgagca tctctgcaga ccgcctgagt "O2" gagaagcgct ctcccaacga ctttgcctta tggaaggcct ctaagcccgg agaaccgtcc "08" tggccgtgcc cttggggaaa gggtcgtccg ggctggcata tcgagtgctc ggccatggca "14" ggcaccctcc cttc catt cacggaggtg ggttcgacct ccggttcccc .20" caccatgaca atgagctggc acagtcggag tttg aaaacgactg ctgggtcagg "26" tacttcctgc acacaggcca cctgaccatt gcaggctgca aaatgtcaaa gtcactaaaa "32" aacttcatca ccattaaaga tgccttgaaa aagcactcag cacggcagtt ggcc "38" ttcctcatgc actcgtggaa ggacaccctg gactactcca gcaacaccat ggagtcagcg "44" cttcaatatg agaagttctt gaatgagttt ttcttaaatg tgaaagatat ccttcgcgct "50" cctgttgaca tcactggtca gtttgagaag tggggagaag aagaagcaga actgaataag "56" aacttttatg acaagaagac agcaattcac ctct gtgacaatgt tgacacccgc "62" accgtcatgg aagagatgcg ggccttggtc agtcagtgca acctctatat ggcagcccgg "68" aaagccgtga ggaagaggcc caaccaggct ctgctggaga acatcgccct cacc "74" catatgctga agatctttgg ggccgtagaa gaggacagct ccctgggatt cccggtcgga .80" gggcctggaa ccagcctcag tctcgaggcc acagtcatgc cctaccttca ggtgttatca "86" gaattccgag aaggagtgcg gaagattgcc cgagagcaaa aagtccctga gcag "92" ctcagcgatg ggga cctg cccgagcttg gggtgcggtt tgaagaccac "98" ctgc ccacagtggt gaaactggta gacagaaaca ccttattaaa agagagagaa 204" gaaaagagac gggttgaaga ggagaagagg aagaagaaag aggaggcggc ccggaggaaa 210" caggaacaag aagcagcaaa gctggccaag atgaagattc cccccagtga gatgttcttg 216" tcagaaaccg acaaatactc caagtttgat gaaaatggtc tgcccacaca ggag 222" ggcaaagagc tcagcaaagg gcaagccaag aagctgaaga agctcttcga ggctcaggag 228" aagctctaca aggaatatct gcagatggcc cagaatggaa gcttccagtg agggggcaca 234" ggactgactt tttaaaccat tgtggactag tggctgctgt ctgcctcagt gacaatgtcc 376376 2401 cagcgctcct atcatgttta cagtcaccct tgggtcctaa attaagagtt gtgttcatgt 2461 aggttcgtgt cgtcgttggc tctgagacat tgataataaa tttttctcaa cagtgagacc 2521 ctcaaaaaaa aaaaaaaaaa aaaa Protein seguence nt 2): NCBI Reference Seguence: NP_001742.1 LOCUS NP_001742 ACCESSION 742 1 madssgqqgk grrvqpqwsp pagtqpcrlh lynsltrnke vfipngkkv ptvy 6" dashmghars yisfdilrrv lkdyfkfdvf ycmnitdidd kiikrarqnh lfeqyrekrp 12" eaaqlledvq aalkpfsvkl nettdpdkkq mleriqhavq lateplekav qsrltgeevn 18" scvevlleea wlds tlgcdvtdns ifsklpkfwe gdfhrdmeal nvlppdvltr 24" vseyvpeivn qukivdngy gyvsngsvyf dtakfassek hsygklvpea vgqualqeg " egdlsisadr lsekrspndf alwkaskpge pSWpprgkg rpgwhiecsa magtllgasm 36" dihgggfdlr fphhdnelaq seayfendcw vryflhtghl tiagckmsks 1knfitikda 42" lkkhsarqlr laflmhswkd tldyssntme salqyekfln efflnvkdil rapvditgqf 48" ekwgeeeael nknfydkkta dnvd emra lvsqcnlyma arkavrkrpn 54" qallenialy lthmlkifga lgfp vggpgtslsl eatvmpquv lsefregvrk 60" pei lqlsdalrdn ilpelgvrfe dheglptvvk lvdrntllke reekrrveee 66" krkkkeeaar rkqeqeaakl akmkippsem flsetdkysk fdenglpthd megkelskgq 72" akklkklfea qeklykequ maqngsfq Nucleotide seguence (variant 3): NCBI Reference Seguence: 014437.2 LOCUS NM_001014437 ION NM_001014437 1 gtggggcgcg acttccgggg cggcggttgc atcagattct aggaagtgtc tgtagccgca 61 gctgcgggtc cgggattccc agccatggca gattcctccg ggcagcaggc tcctgactac 121 aggtccattc tgagcattag tgacgaggca gccagggcac aagccctgaa cgagcacctc 181 agcacgcgta gctatgtcca ggggtactca cagg cagacgtgga cgcgttcagg 241 cagctctcgg ccccgcccgc ccag ctcttccacg tggctcggtg gttcaggcac 377377 " atagaagcgc tcctgggtag tggc aaaggccagc cctgcaggct ccaagcaagc 36" aaaggccggc gtgtgcagcc ccagtggtcc cctcctgctg ggacccagcc actc 42" cacctttaca acagcctcac caggaacaag gaagtgttca tacctcaaga tgggaaaaag 48" gtgacgtggt attgctgtgg gccaaccgtc tatgacgcat ctcacatggg gcacgccagg 54" tcctacatct cttttgatat cttgagaaga gtgttgaagg attacttcaa atttgatgtc 60" ttttattgca tgaacattac ggatattgat atca gggc ccggcagaac 66" cacctgttcg agcagtatcg ggagaagagg cctgaagcgg cacagctctt ggaggatgtt 72" caggccgccc tgaagccatt ttcagtaaaa ttaaatgaga ccacggatcc cgataaaaag 78" cagatgctcg aacggattca gcacgcagtg cagcttgcca cagagccact agct 84" gtgcagtcca gactcacggg agaggaagtc aacagctgtg tggaggtgtt gctggaagaa 90" gccaaggatt tgctctctga ggat tctacacttg gctgtgatgt cactgacaat 96" tccatcttct ccaagctgcc caagttctgg gagggggact tccacagaga agct "02" ctgaatgttc caga tgtcttaacc cgggttagtg agtatgtgcc agaaattgtg "08" aactttgtcc agaagattgt ggacaacggt tacggctatg tctccaatgg gtctgtctac "l4" tttgatacag cgaagtttgc ttctagcgag aagcactcct atgggaagct ggtgcctgag "20" gccgttggag atcagaaagc agaa ggggaaggtg acctgagcat ctctgcagac "26" cgcctgagtg agaagcgctc tcccaacgac tttgccttat ggaaggcctc taagcccgga "32" tcct ggccgtgccc ttggggaaag ggtcgtccgg gctggcatat cgagtgctcg "38" gccatggcag gcaccctcct aggggcttcg attc acggaggtgg gttcgacctc "44" cggttccccc accatgacaa tgagctggca cagtcggagg cctactttga ctgc "50" tgggtcaggt acttcctgca cacaggccac ctgaccattg caggctgcaa aatgtcaaag "56" tcactaaaaa acttcatcac cattaaagat gccttgaaaa agcactcagc gttg "62" gcct tcctcatgca gaag gacaccctgg actactccag caacaccatg "68" gagtcagcgc ttcaatatga gaagttcttg aatgagtttt atgt gaaagatatc "74" cttcgcgctc ctgttgacat cactggtcag tttgagaagt ggggagaaga agaa "80" ctgaataaga acttttatga caagaagaca gcaattcaca aagccctctg tgacaatgtt "86" gacacccgca ccgtcatgga agagatgcgg gccttggtca gtcagtgcaa tatg "92" gcagcccgga aagccgtgag gaagaggccc aaccaggctc tgctggagaa catcgccctg "98" tacctcaccc atatgctgaa gatctttggg gccgtagaag aggacagctc cctgggattc 204" ccggtcggag ggcctggaac cagcctcagt ctcgaggcca cagtcatgcc ctaccttcag 378378 210" gtgttatcag aattccgaga aggagtgcgg aagattgccc gagagcaaaa agtccctgag 216" attctgcagc tcagcgatgc cctgcgggac aacatcctgc ccgagcttgg ggtgcggttt 222" gaagaccacg aaggactgcc cacagtggtg aaactggtag acagaaacac aaaa 228" gagagagaag aaaagagacg ggttgaagag gagaagagga agaagaaaga ggaggcggcc 234" cggaggaaac aaga agcagcaaag ctggccaaga tgaagattcc ccccagtgag 240" atgttcttgt cagaaaccga caaatactcc aagtttgatg aaaatggtct gcccacacat 246" gacatggagg gcaaagagct cagcaaaggg aaga agctgaagaa gctcttcgag 252" gctcaggaga agctctacaa ggaatatctg cagatggccc agaatggaag gtga 258" gggggcacag gactgacttt ttaaaccatt gtggactagt ggctgctgtc tgcctcagtg 264" acaatgtccc agcgctccta tcatgtttac agtcaccctt gggtcctaaa ttaagagttg 270" tgttcatgta ggttcgtgtc gtcgttggct ctgagacatt gataataaat ttttctcaac 276" agtgagaccc tcaaaaaaaa aaaaaaaaaa aaaaaaaaa Protein sequence (variant 3): NCBI Reference Seguence: NP_001014437.1 LOCUS N P_001014437 ACCESSION NP_001014437 l madssgqqap dyrsilsisd alne hlstrsyvqg yslsqadvda frqlsappad 6" arwf rhieallgsp cgkgqpcrlq askgrrvqpq wsppagtqpc rlhlynsltr l2" nkevfipng kkvtwyccgp tvydashmgh arsyisfdil yfkf dvfycmnitd 18" iddkiikrar qnhlfeqyre krpeaaqlle dvqaalkpfs vklnettdpd kkqmleriqh 24" avqlateple ltge evll eeakdllsdw ldstlgcdvt dnsifsklpk " fwegdfhrdm ealnvlppdv ltrvseyvpe ivd sngs vyfdtakfas 36" gklv peavgqual qegegdlsis adrlsekrsp ndfalwkask pgepswpcpw 42" gkgrpgwhie csamagtllg asmdihgggf dlrfphhdne laqseayfen flht 48" ghltiagckm skslknfiti kdalkkhsar qlrlaflmhs wkdtldyssn tmesalqyek 54" flnefflnvk dilrapvdit gqfekwgeee aelnknfydk ktaihkalcd nvdtrtvmee 60" mralvsqcnl ymaarkavrk rpnqalleni mlki fgaveedssl gfpvggpgts 66" lsleatvmpy lqvlsefreg vrkiarequ peilqlsdal rdnilpelgv glpt 72" vvklvdrntl lkereekrrv eeekrkkkee aarrkqeqea aklakmkipp semflsetdk 78" yskfdenglp thdmegkels kgqakklkkl lyke qumaqngsf q 379379 Nucleotide seguence nt 5): NCBI Reference Seguence: NM_001194997.1 LOCUS NM_001194997 ACCESSION NM_001194997 l gtggggcgcg acttccgggg cggcggttgc atcagattct tgtc tgtagccgca 6" ggtc cgggattccc agccatggca tccg ggcagcaggc tcctgactac 12" aggtccattc tgagcattag tgacgaggca gccagggcac aagccctgaa cgagcacctc 18" agcacgcgta gctatgtcca ggggtactca ctgtcccagg cagacgtgga cgcgttcagg 24" cagctctcgg ccccgcccgc tgacccccag ctcttccacg tggctcggtg gttcaggcac " atagaagcgc tcctgggtag cccctgtggc aaaggccagc cctgcaggct aagc 36" aaaggccggc gtgtgcagcc ccagtggtcc cctcctgctg ggacccagcc atgcagactc 42" cacctttaca acagcctcac caggaacaag gaagtgttca tacctcaaga tgggaaaaag 48" gtgacgtggt attgctgtgg gccaaccgtc tatgacgcat ctcacatggg gcacgccagg 54" tcctacatct cttttgatat cttgagaaga gtgttgaagg attacttcaa atttgatgtc 60" ttttattgca tgaacattac ggatattgat gacaagatca tcaagagggc ccggcagaac 66" cacctgttcg agcagtatcg ggagaagagg cctgaagcgg cacagctctt ggaggatgtt 72" caggccgccc tgaagccatt ttcagtaaaa ttaaatgaga ccacggatcc cgataaaaag 78" cagatgctcg aacggattca gcacgcagtg cagcttgcca cagagccact tgagaaagct 84" gtgcagtcca cggg agaggaagtc aacagctgtg tggaggtgtt gctggaagaa 90" gccaaggatt ctga ctggctggat tctacacttg atgt cactgacaat 96" tccatcttct ccaagctgcc caagttctgg gagggggact tccacagaga agct "O2" ctgaatgttc tccctccaga tgtcttaacc cgggttagtg agtatgtgcc agaaattgtg "08" aactttgtcc agaagattgt ggacaacggt tacggctatg tctccaatgg gtctgtctac "l4" tttgatacag cgaagtttgc ttctagcgag tcct atgggaagct ggtgcctgag "20" ggag atcagaaagc ccttcaagaa ggggaaggtg acctgagcat ctctgcagac "26" cgcctgagtg agaagcgctc tcccaacgac tttgccttat ggaaggcctc taagcccgga "32" gaaccgtcct ggccgtgccc aaag ccgg gctggcatat cgagtgctcg "38" gccatggcag gcaccctcct aggggcttcg atggacattc acggaggtgg gttcgacctc "44" cggttccccc accatgacaa tgagctggca cagtcggagg ttga aaacgactgc "50" tgggtcaggt acttcctgca ccac ctgaccattg caggctgcaa aatgtcaaag 380380 "56" aaaa acttcatcac cattaaagat aaaa agcactcagc acggcagttg "62" gcct tcctcatgca ctcgtggaag gacaccctgg actactccag caacaccatg "68" gagtcagcgc ttcaatatga gaagttcttg aatgagtttt tcttaaatgt gaaagatatc "74" cttcgcgctc ctgttgacat cactggtcag tttgagaagt aaga agaagcagaa "80" ctgaataaga acttttatga caagaagaca caca aagccctctg tgacaatgtt "86" gacacccgca ccgtcatgga agagatgcgg gccttggtca gtcagtgcaa tatg "92" gcagcccgga aagccgtgag gaagaggccc aaccaggctc tgctggagaa catcgccctg "98" tacctcaccc atatgctgaa gatctttggg gccgtagaag aggacagctc cctgggattc 204" ccggtcggag ggcctggaac cagcctcagt ctcgaggcca cagtcatgcc tcag 210" gtgttatcag aattccgaga aggagtgcgg aagattgccc gagagcaaaa agtccctgag 216" attctgcagc tcagcgatgc cctgcgggac aacatcctgc ccgagcttgg ggtgcggttt 222" gaagaccacg tgcc cacagtggtg aaactggtag acagaaacac cttattaaaa 228" gagagagaag aaaagagacg ggttgaagag agga agaagaaaga ggaggcggcc 234" cggaggaaac aggaacaaga agcagcaaag ctggccaaga tgaagattcc ccccagtgag 240" atgttcttgt cagaaaccga caaatactcc aagtttgatg aaaatgtaag catggtctgc 246" ccacacatga catggagggc aaagagctca gcaaagggca agccaagaag ctgaagaagc 252" tcttcgaggc tcaggagaag ctctacaagg aatatctgca gatggcccag aatggaagct 258" tccagtgagg gggcacagga ctgacttttt aaaccattgt ggactagtgg ctgctgtctg 264" tgac aatgtcccag cgctcctatc atgtttacag tcacccttgg gtcctaaatt 270" aagagttgtg ttcatgtagg ttcgtgtcgt cgttggctct gagacattga taataaattt 2761 ttctcaacag cctc aaaaaaaaaa aaaaaaaaaa aaaaaaa n seguence (variant 5): NCBI Reference Seguence: NP_001181926.1 LOCUS 181926 ACCESSION NP_001181926 1 madssgqqap dyrsilsisd eaaraqalne hlstrsyvqg yslsqadvda frqlsappad 61 pqlfhvarwf rhieallgsp cgkgqpcrlq askgrrvqpq wsppagtqpc rlhlynsltr 121 nkevfipng ccgp tvydashmgh arsyisfdil rrvlkdyfkf dvfycmnitd 181 iddkiikrar qnhlfeqyre krpeaaqlle dvqaalkpfs vklnettdpd riqh 241 avqlateple kavqsrltge evll eeakdllsdw ldstlgcdvt dnsifsklpk 381381 " fwegdfhrdm ealnvlppdv ltrvseyvpe ivnqukivd sngs vyfdtakfas 36" sekhsygklv peavgqual qegegdlsis adrlsekrsp ndfalwkask pgepswpcpw 42" gkgrpgwhie csamagtllg asmdihgggf dlrfphhdne yfen dcwvryflht 48" ghltiagckm skslknfiti hsar qlrlaflmhs wkdtldyssn tmesalqyek 54" flnefflnvk dilrapvdit gqfekwgeee aelnknfydk ktaihkalcd nvdtrtvmee 60" mralvsqcnl ymaarkavrk rpnqalleni alylthmlki fgaveedssl gfpvggpgts 66" lsleatvmpy freg vrkiarequ peilqlsdal rdnilpelgv rfedheglpt 72" vvklvdrntl lkereekrrv eeekrkkkee aarrkqeqea aklakmkipp semflsetdk 78" nvsm vcphmtwrak ssakgkprs DDX1 Official Symbol: DDX1 Official Name: DEAD (Asp-Glu-Ala-Asp) box helicase 1 Gene ID: 1653 Organism: Homo s Other Aliases: DBP-RB, UKVH5d Other Designations: ATP-dependent RNA helicase DDX1; DEAD (Asp-Glu-Ala- Asp) box polypeptide 1; DEAD box ptide 1 ; DEAD box protein 1; DEAD box protein retinoblastoma; DEAD box-1; DEAD/H (Asp-Glu-Ala-Asp/His) box ptide 1 Nucleotide seguence: NCBI Reference Seguence: NM_004939.2 LOCUS NM_004939 ACCESSION NM_004939 1 ctaatcacca aacatctgct tccttctctg tagctgtgac cctgataccg Cgtggtgtgc 61 tccgaacaca tggtgcccag aacgaaggcg gcgtccagaa gccctaggtc ccagaggtcc 121 gctcagcggc aggcgcataa ggcggggccg gcgcgggcct ccat cggaaccgtt 181 ctcccggggc tgagtccctg cccggactcc cgaa gaccaggggc cggaagcgcg 241 cgccgccact gccacgccgt gtcagtcggg ggag cgagcaggcg aagccgcgga 301 ggacggggtg aagatggcgg ccttctccga gatgggtgta atgcctgaga ttgcacaagc 382382 36" tgtggaagag tggc tcctcccaac tgatatccag gctgaatcta tgat 42" cttaggagga ggtgatgtac ttatggctgc agaaacagga agtggcaaaa ctggtgcttt 48" tcca gttatccaga tagtttatga aactctgaaa gaccaacagg aaggcaaaaa 54" aggaaaaaca acaattaaaa ctggtgcttc agtgctgaac aaatggcaga tgaacccata 60" tgacagagga tctgcttttg caattgggtc agatggtctt tgttgtcaaa gcagagaagt 66" aaaggaatgg catgggtgta gagctactaa aggattaatg aaagggaaac actactatga 72" agtatcctgt catgaccaag gcag ggtcgggtgg tctaccatgc aggcctcttt 78" aggt actgacaagt ttggatttgg tgga acaggaaaga aatcccataa 84" caaacaattt gataattatg gagaggaatt cactatgcat gataccattg gatgttacct 90" ggatatagat catg tcaagttctc caaaaatgga aaagatcttg gtctggcatt 96" tgaaatacca ccacatatga aaaaccaagc cctctttcct gcctgtgttt tgaagaatgc "02" tgaactgaaa tttaacttcg gtgaagagga gttt aaag atggctttgt "08" tgctctttcc aaggcaccgg atggttacat tgtcaaatca cagcactcag gtaatgcaca "14" ggtgacacaa acaaagtttc tccccaatgc tccgaaagct ctcattgttg aaccttcccg "20" ggagttagct gaacaaactt tgaacaacat caagcagttt aagaaataca ttgataatcc "26" taaattaagg ctga taattggagg tgttgcagcc cgggatcagc tctctgtttt "32" tgga gtagatatag ttgtaggtac tccgggaaga ctagatgact tggtgtcaac "38" tggaaagctg aacttatctc aagttagatt cctggtcctg gatgaagctg atgggcttct "44" ttctcaaggt tattctgatt ttataaatag gatgcacaat cagattcctc aggttacctc "50" tgatggaaaa agacttcagg tgattgtttg ctctgccact ttgcattctt tcgatgtaaa "56" gtcc gagaagataa tgcattttcc tacatgggtt gacttaaaag gagaagactc "62" tgttccagat actgtacacc atgttgttgt aaat cccaaaactg acagactctg "68" ggaaaggctt ggaaagagcc acattagaac tgatgatgta catgcaaaag ataacacaag "74" acctggtgct aatagtccag agatgtggtc tgaagctatt aaaatcctga aaggggagta "80" ccgg gcaatcaagg aacataagat ggatcaagca attatcttct gtagaaccaa "86" aattgactgt gataacttgg agcagtactt acaa ggaggaggac ctgataaaaa "92" aggacaccag ttctcatgtg tttgtcttca tggtgacaga aagcctcatg agagaaagca "98" aaacttggaa agatttaaga aaggagatgt aagattcttg atttgcacag atgtagctgc 204" aatt gatatccacg gtgttcctta tgttataaat gtcactctgc aaaa 210" gcaaaactac gtacatcgaa ttggcagagt aggaagagct gaaaggatgg gtctggcaat 383383 216" ttccctggtg gcaacagaaa aagaaaaggt ttggtaccat gtatgtagca gccgtggaaa 222" agggtgttat aacacaagac tcaaggaaga ctgt accatatggt acaacgagat 228" gcagttacta atag aagaacacct gaactgtacc atttctcagg ttgagccgga 234" tataaaggta ccagtggatg aatttgatgg gaaagttacc tacggtcaga aaagggctgc 240" tggtggtgga agctataaag gccatgtgga tattttggca cctactgttc aagagttggc 246" tgcccttgaa aaggaggcgc agacatcttt cctgcatctt ggctaccttc ctaaccagct 252" aacc tttt tacatttact gaataagatt tgagtaatga aagtctgtag 258" tcttaaaact ctaaaacagt tgtactgctt ccaagcagca gtatttatag taacgtaagc 264" tattaatgct tgca tgtcaagaaa cattagtctt aggaattctt caaaaaatgg 2701 catcccaatg aatt tgatgactat attttcatga aaaaaaaaaa aaaaa Protein seguence: NCBI Reference Seguence: NP_004930.1 LOCUS NP_004930 ACCESSION NP_004930 l maafsemgvm peiaqaveem dwllptdiqa esiplilggg dvlmaaetgs gktgafsipv 6" iqivyetlkd qqegkkgktt iktgasvlnk wqmnpydrgs afaigsdglc cqsrevkewh 12" glmk gkhyyevsch dqglcrvgws tmqasldlgt dkfgfgfggt gkkshnkqfd 18" nygeeftmhd tigcyldidk ghkaskngk dlglafeipp lfpa cvlknaelkf 24" nfgeeefkfp pkdgfvalsk apdgyivksq vtqt kflpnapkal ivepsrelae " qtlnnikqfk kyidnpklre lliiggvaar dqlsvlengv divvgtpgrl ddlvstgkLn 36" lsquflvld eadgllsqu sdfinrmhnq ipqvtsdgkr lqvivcsatl hsfdvkklse 42" kimhfptwvd lkgedsvpdt vhhvvvpvnp ktdrlwerlg kshirtddvh pgan 48" spemwseaik ilkgeyavra ikehkmdqai ifcrtkidcd nleqyfiqqg ggpdkkghqf 54" scvclhgdrk pherkqnler fkkgdvrfli ctdvaargid vinv tlpdekqnyv 60" hrigrvgrae rmglaislva tekekvahv cssrgkgcyn ggct iwynemqlLs 66" eieehlncti sqvepdikvp vdefdgkvty quraagggs ykghvdilap alek 72" eaqtsflhlg ylpnqlfrtf Official Symbol: CCDC22 384384 OfimmlName:mfibdcofldomaMcomaMMQ22 Gene ID: 28952 Organism: Homo sapiens Other Aliases: JM1, CXori37 Other Designations: coiled-coil domain-containing protein 22 tide seguence: NCBI Reference Seguence: NM_014008.3 LOCUS NM_014008 ACCESSION NM_014008 l ctcacatccg gcatgcgccg tgctcgctca cagaactaca ctttccaact ctccccacac 6" gacccgtgac actctgtgga ccgcgagcac ggagcagggt ttctacagct gctccccact 12" ttctcggacc cggtcctgga cccagccccc gactccgaca cggctccacc atggaggagg 18" cggaccgaat cctcatccat cgcc aggccggcac ggcagttcct ccagatgtgc 24" agaccttgcg cgccttcacc actgagctgg ttgtagaggc tgtggtccgc tgcctgcgtg " tgatcaaccc tgcggtgggc ctca gccctctgct tgcc atgtctgccc 36" ggttccgcct ggccatgagc ctggctcagg cctgcatgga cctgggctat cccttggagc 42" ttggctatca gaacttcctc taccccagtg agcctgacct ccgagacctg cttctcttct 48" tggctgagcg tctgcccacc gatgcctctg aggatgcaga ccagcctgca ggtgactcag 54" ctattctcct ccgggccatt gggagccaaa ttcgggacca gctggcactg ccttgggtcc 60" cgccccacct tcgcactccc aagctgcagc acctccaggg cctc cagaagcctt 66" tccatgccag caggctggtc gtgccagaat tgagttccag aggtgagcca cgggagttcc 72" aggcgagtcc cctgctgctt ccagtcccta tgcc tcagcctgtt gtgg 78.. cctcgctcct cgaacaccat gccctgcagc tctgccagca gacgggccgg gaccggccag 84" gggatgagga ccac cggacatccc gcctcccacc ccaggaggac acacgggctc 90" agcggcagcg gctgcagaag actg agcatctgcg ccaaagctgg cttg 96" gggcccccat acaagcccgg gacctgggag aactgctgca ggcctggggt gctggggcca 102" agactggtgc tcctaagggc tcccgcttca cgcactcaga gaagttcacc ttccatctgg 108" agccccaggc ccaggccact caggtgtcag atgtgccagc cacctcccgg cggcctgaac 114" aggtcacgtg ggcagctcag gaacaggagc ccct gcag ctggaaggag 385385 "20" tgaaccgcag ggag gttgaggccg acatgaagac cctgggcgtc agctttgtgc "26" aggcagagtc tgagtgccgg cacagcaagc tcagtacagc agagcgtgag caggccctgc "32" gcctgaagag ccgcgcggtg gagctgctgc ccgatgggac tgccaacctt gccaagctgc "38" agcttgtggt ggagaatagt gcccagcggg tcatccactt ggcgggtcag tgggagaagc "44" accgggtccc actcctcgct gagtaccgcc acctccgaaa gctgcaggat tgcagagagc "50" cttc tcgacggctg gcagagatcc aagaactgca ccagagtgtc nggcggctg "56" ctgaagaggc ccgcaggaag gaggaggtct ataagcagct agag ctggagactc "62" gaga tgtgtcccgg taca cccagcgcat cctggagatc gtgggcaaca "68" tccggaagca gaaggaagag atcaccaaga tcttgtctga tacgaaggag cttcagaagg "74" aaatcaactc cctatctggg gacc ggacgtttgc ggtgactgat gagcttgtgt "80" tcaaggatgc caagaaggac gatgctgttc ggaaggccta taagtatcta gctgctctgc "86" acgagaactg cagccagctc atccagacca tcgaggacac aggcaccatc atgcgggagg "92" ttcgagacct cgaggagcag atcgagacag gcaa gaagaccctc agcaacctgg "98" agaagatccg ggaggactac cgagccctcc gccaggagaa cgctggcctc ctaggccggg 204" tccgggaggc ctgaggagcc gccggcagag gtctctcccc agcctcaggc agggatttgg 210" ggtgctggag gcagtggcca agcacatgcc actt cctccgctgt ccagttcctc 216" ctgctgcggc cttggaccca gacccctgcc cactgaccgc aacccttata tggggtgata 222" gtccagcatg tggggagctc ggctgcagtt tattggggac ggtactgtgg gttgggggcc 2281 ttggatccca tgag tagttcctct gcagtctaaa aaaa aaa Protein seguence: NCBI Reference Seguence: 727.1 LOCUS NP_054727 ACCESSION NP_054727 l meeadrilih squagtavp pdvqtlraft telvveavvr clrvinpavg sglspllpLa 6" lams laqacmdlgy plelqunfl ypsepdlrdl llflaerlpt dasedadqpa 12" gdsaillrai gsqirdqlal pvaphlrtp quhlqgsal qkpfhasrlv vpelssrgep 18" refqasplll prthpqpv grvasllehh qtgr drpgdedwvh rtsrlppqed 24" traqrquqk qltehqusw gllgapiqar dlgellqawg agaktgapkg srfthsekft " fhlepqaqat tsr rpeqvtwaaq eqeleslreq legvnrsiee veadmktlgv 36" squaesecr hsklstaere qalrlksrav ellpdgtanl ens aqrvihlagq 386386 421 wekhrvplla qud crelessrrl aeiqelhqsv raaaeearrk eevykqlmse 481 letlprdvsr laytqrilei vgnirquee itkilsdtke lqkeinslsg avtd 541 elvfkdakkd davrkaykyl aalhencsql iqtiedtgti mrevrdleeq ietelgkktl 601 snlekiredy raquenagl lgrvrea CLIC4 Official Symbol: CLIC4 l Name: chloride intracellular channel 4 m:25932 Organism: Homo s Other Aliases: CLIC4L, H1, MTCLIC, huH1, p64H1 Other Designations: de intracellular channel 4 like; chloride intracellular channel protein 4; intracellular chloride ion channel protein p64H1 Nucleotide sauence: NCBI Reference Sequence: 943.2 LOCUS NM_O13943 ION NM_013943 1 ttattttccc cggagagtcc cgaggcgccg ggcc ctgcctacag cccgaggccc 61 cgcccccggc gcccctccca gccgtttgaa gcggctcggg ctgcggctgg ctcagagtgg 1 2 1 cgcggggggc gtggggcggt gctgaggagc tgaagccgtg gccagctcga cgccggacag 181 tccagcgagc agcacggcgg gaaccggcag ccggagcagt cccggagcag aagcagcagc 241 agcagcagca gccctcgccg gagc gcagccgagc cggccatggc gttgtcgatg 301 ccgctgaatg ggctgaagga ggaggacaaa gagcccctca tcgagctctt cgtcaaggct 361 ggcagtgatg gtgaaagcat aggaaactgc cccttttccc agaggctctt catgattctt 421 aaag gagttgtatt tagtgtgacg actgttgacc tgaaaaggaa gccagcagac 481 ctgcagaact tggctcccgg gacccaccca ccatttataa acag tgaagtcaaa 541 acggatgtaa ataagattga ggaatttctt gaagaagtct tatgccctcc caagtactta 601 aagctttcac caaaacaccc agaatcaaat actgctggaa tctt tgccaaattc 661 tctgcatata tcaagaattc aaggccagag gctaatgaag cactggagag gggtctcctg 721 aaaaccctgc agaaactgga tgaatatctg cctc tccctgatga aattgatgaa 387387 78" atgg aggacataaa gttttctaca cgtaaatttc tggatggcaa tgaaatgaca 84" ttagctgatt gcaacctgct gcccaaactg catattgtca aggtggtggc caaaaaatat 90" cgcaactttg atattccaaa agaaatgact ggcatctgga gatacctaac atac 96" agtagggacg agttcaccaa tacctgtccc agtgataagg aggttgaaat agcatatagt "02" gatgtagcca tcac caagtaaaat cgcgtttgta aaagagatgt cttcatgtct "08" tcccctaaga atacgctttt cctaacaggc tactccttcc tgtagagcag aaattgtatt "14" ttgcacgaac atgcagttat tgaagattag gatcaaggat agacaaggta tagtagttat "20" cttaaaatat ctaa gcagtattat tttaaaatcc tttaccc:gg ctacctcccc "26" tacccgggtt cccctctctt taatttggag acactccacc acaaact:tt cactttagag "32" gtagcttgcc atctctcagg agccctcacc attgtgtcca ttcactg:gt atagatggca "38" gaacttttga atgt ttaattgtta aaaatagtag ccacgac:tt atcaggcagc "44" cccaaactgg tgcataatgc atggtacaag ttat gtatttt:tg gaattttgta "50" atatttagta agagtatatg aaaggattgc tactgtatca gaaatat:gt ttcaatttag "56" tctatcctgg atatgtacta atta ccaccagaga agagagc:tt ctacaaaagt "62" cactacagat tttgctatat tgctttgtag atagattttt acttttgcct attt "68" atccttcata ccaattgtaa catctgacac catgtagaag ctaaaagttt agagggagtg "74" ttct caagaccttc ctcaagcatt ttatctttag aagagaaact gatgggcacc "80" tgatactctg tctaaatacg tttgttatat gtgttttgcc ctgtgccatt catttggaac "86" tttattgcat tctttatttt aaaaagcttg tttttacgta atcatagagc ttgctatttg "92" tacatctgtt gagcaacact acataactga tttttagttg acttagctat agcagtacaa "98" tgattagtaa tgtaaaaatt aacacagaaa taag gaatgaaggg tgggtttgtc 204" aaaatatcaa tttt gtttctaaag tacatttaat gtagatgacc taaagaatgc 210" catc aaaa gaaagataaa acacaggtca ccaattttct catttcaccc 216" catttacctt agga ttgttcattc ctttgggact aagttatagt tatggtgagt 222" gtgtatttac tgtagttttg cctgatctca ctcattgcac ttcctggagt taaattttcc 228" aacagccatg ttgaggaata gcactctgca tgtttttgtt ttgtttttcg gggttttttt 234" taattgaagc cctaaaccag gaattatttg aaca ggaggatgaa cttgctgaaa 240" ataaaacttt gctatgtatt tactcttttt taaaagacaa aagcaaaacc agactttcta 246" cgtactactc caaagactgt gact ataatacatt tttggtaatt tttttatacc 252" taatttgtat aggaagtgct atttctcata ttct tgaaatttta agtttattgc 388388 258" tttaaaatgg cagtgtttct ttga tatgctaaca tttagtaagc actggcttta 264" tgaaagcggc tttttataag tatactgcat tttttgagcc tatcattaat agta 270" tgaaagataa gaaaatctcc atgttgtatc catttggctc aggaagattc tttgccttac 276" ctttcttaga actctttatt caaa agtttgagta cccgcttggt ttttttttgg 282" taattaaata ttgtatgatt tatctggttc aaggaagatg cactattcag ttatctattg 288" agaaattatt ttgcagtggt tttagtgggt gaaaatgtcc catctgcacc agtacacagg 294" caggcattat tcac ctacttttta aatagtggca acttgggatt cattctggtg 300" aacc ttgcctcata gcttaaagta taaaaaagat tcaagagcag tgaggtttgt 306" tctttccagt gaatggtgga ctgagtggtg Cgaggtggag ggctaacaag aggaaagaac 312" tacattcttc agaatacagt gatgaaaatt cattttgaaa ctcaaatatt ttcattttgg 318" atattctcct gtttttatta aaccagtgat tacacctggc catccctcta aatgttctag 324" atgt ctattgtgat tttgatgaag acagaattat ctgt agaaacacag 330" ataccacttt atcagggaag ttagtcaaat gaaatggaaa ttggtaaatg gacaaaagct 336" agctagtaaa aaggacgacc cagcaacatg ctttaacccc attgtatgtt aaga 342" ttta acatcttgag aaatttggga cataaagttt tcatggtaga cagttcatgc 348" agtatatgaa taat ggaaataatc tgattttatt tttacaacta acatccattc 354" cccttcattt aaacaccttt tgtgttttac ttcagtgagg agattggagt ctgaatggat 360" ctgttttcca agagattctg agaaattttt gtattcagca gttggaaagc tctctattct 366" agttgataaa acttcccttt gtag atat tctatacagt tctgttgtct 372" tttactagga ctgtaaactt ttgtgataaa attcaaataa gattttattt cttggtaatt 378" ttggctttca caatttatct ttaaatcctt gagcaatctg tatacaatta agagatttct 384" gacatttatt cttacactaa atggatcaac tctaggattt aggcatgtta acttctgttg 390" tgttttgaat ctctccagag ttgcatgtag ttta tttctgtgcc cttaaaccca 396" tttagaaaat aactacaaag taaaaatgta gaggaaatag aaatgtattt gaac 402" attttgatac aaatttcatc atttaatgat tcaccaattt cttgcattaa tttgaattta 408" taat tcaaagagag gggagcatcc attattgata catgtgggct tttaaaaact 414" ccatccttta gtca aggtttgggc cacacaaagt atatttttat catggaaaaa 420" tttcaactcc tcaagccgta atgttgaaca gaattggagt attttcttta taatttcttg 426" aacaggcaaa ctta ttatagaatg catgtatttt cttttatctt tggaacatca 432" gcaccagtat attgctggca gctattgtat taaaaaataa agtatatttt cactatcata 389389 4381 aaggattctt ttttcccccc tcatgaaaat aaacaacaac ttggggtaaa agtgaaaaaa 4441 aaaaaaaaaa aa Protein seguence: NCBI Reference Seguence: NP_039234.1 LOCUS N P_039234 ACCESSION NP_039234 1 malsmplngl keedkeplie lkaagsdge fsqr lfmilwlkgv vfsvttvdlk 61 rkpadlqnla pgthppfitf nsevktdvnk ieefleevlc ppkylklspk hpesntagmd 121 ifakfsayik nsrpeaneal thk ldeylnsplp deidensmed ikfstrkfld 181 gnemtladcn llpklhivkv vakkyrnfdi pkemtgiwry ltnaysrdef tntcpsdkev 241 eiaysdvakr ltk Official Symbol: DLD Official Name: olipoamide ogenase M1 1738 Organism: Homo sapiens Other Aliases: tcag7.39, DLDH, E3, GCSL, LAD, PHE3 Other Designations: E3 component of pyruvate dehydrogenase complex, 2—oxo- glutarate x, branched chain keto acid ogenase complex; rase; dihydrolipoyl dehydrogenase, mitochondrial; glycine cleavage system L protein; glycine cleavage system protein L; ide dehydrogenase; lipoamide reductase; lipoyl dehydrogenase Nucleotide seguence: NCBI nce Seguence: NM_000108.3 m NM_000108 ACCESSION NM_000108 1 gatgacgtag gctgcgcctg tgcatgcgca gggaggggag accttggcgg agcggcggag 61 gcgcccagcg gaggtgaaag tattggcgga aaggaaaata cagcggaaaa atgcagagct 121 ggagtcgtgt gtactgctcc ttggccaaga gaggccattt caatcgaata tctcatggcc 390390 18" tacagggact ttctgcagtg cctctgagaa cttacgcaga tcagccgatt gatgctgatg 24" taacagttat tggt cctggaggat atgttgctgc agct gcccagttag " gcttcaagac agtctgcatt gagaaaaatg ttgg tggaacatgc ttgaatgttg 36" gttgtattcc ttc:aaggct ttattgaaca actctcatta ttaccatatg gcccatggaa 42" aagattttgc atc:agagga atgt ccgaagttcg cttgaattta gacaagatga 48" tggagcagaa gag:actgca gtaaaagctt taacaggtgg aattgcccac ttattcaaac 54" agaataaggt :catgtc aatggatatg gaaagataac tggcaaaaat caagtcactg 60" ctacgaaagc tga:ggcggc actcaggtta ttgatacaaa gaacattctt atagccacgg 66" aagt tttt cctggaatca cgatagatga aga:acaata gtgtcatcta 72" caggtgcttt atc:ttaaaa aaagttccag aaaagatggt tgt:attggt gcaggagtaa 78" taggtgtaga attgggttca gtttggcaaa gacttggtgc aga:gtgaca gcagttgaat 84" ttttaggtca tgtaggtgga gttggaattg atatggagat atc:aaaaac cgca 90" tccttcaaaa acaggggttt aaatttaaat caaa ggt:actggt gctaccaaga 96" agtcagatgg aaaaattgat gtttctattg aagctgcttc tgg:ggtaaa gctgaagtta "02" tcacttgtga tgtactcttg gtttgcattg gccgacgacc ctt:actaag aatttgggac "08" tagaagagct gggaattgaa ctagatccca gaggtagaat caat accagatttc "l4" aaactaaaat tatc tatgccattg tagt tgctggtcca atgctggctc "20" acaaagcaga ggatgaaggc attatctgtg ttgaaggaat ggctggtggt gctgtgcaca "26" ttgactacaa gcca tcagtgattt acacacaccc tgaagttgct tgggttggca "32" aatcagaaga gcagttgaaa ggta ttgagtacaa agttgggaaa ttcccatttg "38" ctgctaacag cagagctaag acaaatgctg acacagatgg catggtgaag atccttgggc "44" cgac agacagagta ctgggagcac atattcttgg accaggtgct ggagaaatgg "50" taaatgaagc tgctcttgct ttggaatatg gagcatcctg tgaagatata gctagagtct "56" gtcatgcaca tccgacctta tcagaagctt ttagagaagc aaatcttgct gcgtcatttg "62" gcaaatcaat ttga agat tatatatatt tttttctgaa atttcctggg "68" agcttttgta gaagtcacat tcctgaacag gatattctca cagctccaag aatttctagg "74" actgaattat gaaacttttg gaaggtattt aataggtttg gacaaaatgg aatactctta "80" tatctatatt ttacataaat ttagtatttt gtttcagtgc actaatgtgt aagacaaaaa "86" gctacttatt gtagcatcct ggaatatctc cgtcaactca tattttcatg ctgttcatga "92" caat gcccctgaat ttaaatagct tttttctctg atacagaaaa gttgaatttt 391391 198" acatggctgg agctagaatt tgatatgtga acagttgtgt ttgaagcaca gtgatcaagt 204" tatttttaat ttggttttca cattggaaac aagtcagtca ttcagatatg attcaaatgt 210" ctataaaccg aactgatgta agtaaacggt ctctcacttg ttttatttaa cctctaaatt 216" ctttcatttt aggggtagca :ttgtgttga agaggtttta ccat tgttgtctgc 222" aactctgaag ggtaattata :agttaccca aattaagaga gtctatttac caaa 228" ggca ttcaaatgta :tacagtggg gaatgaagat actgaaataa acgtcttaaa 234" ttta ctggttatca :gagtacgtg ttgagatggt catagt:ttt tttatgacta 240" cttctagtgt atattctaat :tcttttcta ggcctgaatg tatctt:att ttat 246" aggacaatat taaggcattt tcat catcctttca tctatt:tag atacacctac 252" taaatgttta atatatactt :tggagaagt acaacataag ggagtc:tta atctgtgttt 258" tccttggctg ggttaatgac :gtttattta aagagtgttg taaaat :gga gtgt 264" ttaaaatggc catgtcctga ggaaacttaa gtaacaaagt actaaa:gct aagtaggctt 270" ttgcatattg taactaaatt taagaataat tcagattaag tagttc:gaa atttggtata 276" atag attgtctcat gctcatgagt gacataatga ccctggattc tgttacatac 282" ttctaaagaa aattgattgt agga ggcagttaac ttggctgaac tcca 288" cactctgtct tgtttgtagg tggcagcagc tgaaatctct tctcagttgt tttagcttta 294" gctatgctgc tggaagtctt tcccatgcaa gtgtgtagtt caggggtcaa ccagagtttg 300" ggcagaagga agtctgcccc ttctgtgcct cctgtttttt gggggtttcc cctttatgtt 306" gttg tggttgcccc atattctgcc ttctgatcct taaccaataa aacttggctt 312" ttgtttcccc ctcaagtgag aacccgttaa aaatgagaca ttgagccagt gctgttcact 318" ttttaagtgc caacttccct ctactttcca cttgtttata gttgtttcca gtgcctttag 324" ttttttctaa aatatatttg ttcagagttt gcta ggag ggttggtctg 330" atatctgtgt gctactttgc cattattgga agtgaactct gcatcttttt aaaaatttga 336" aatcccggta tcatgtgaag tgctgtttat gtaaatctca acatatccct tactcaggga 342" aaaaaaagtt tttagttagg gaatagtgaa atataattta atatggaatt ctagctgtag 348" agttaaatcc atctttaagt gtttacattc agtatgagaa ttta tctgtatggg 3541 gaataaagtc ctaggaataa aacaagtttt aagtgttca Protein seguence: NCBI Reference Seguence: 099.2 LOCUS NP_000099 392392 ACCESSION NP_000099 1 mqswsrvycs fnri lsav plrtyadqpi dadvtvigsg pggyvaaika 6" aqlgfktvci eknetlggtc lnvgcipska llnnshyyhm ahgkdfasrg iemsevrlnl 12" dkmmeqksta vkaltggiah lfkanvvhv ngygkitgkn qvtatkadgg tqvidtknil 18" vtpf pgitidedti vsstgalslk kvpekmvvig agvigvelgs qurlgadvt 24" aveflghvgg vgidmeiskn fqrilqkqgf kfklntkvtg atkksdgkid vsieaasggk " dvll vcigrrpftk lgie ldprgripvn trfqtkipni yaigdvvagp 36" mlahkaedeg iicvegmagg avhidyncvp sviythpeva wvgkseeqlk eegieykvgk 42" srak tnadtdgmvk ilqustdrv lgahilgpga gemvneaala leygascedi 48" arvchahptl seafreanla asfgksinf ATAD3A Official Sym bol: ATAD3A Official Name: ATPase family, AAA domain containing 3A m: 55210 Organism: Homo sapiens Other Aliases: RPS-83202.1 Other Designations: ATPase family AAA domain-containing protein 3A Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_018188.3 LOCUS NM_018188 ACCESSION NM_018188 1 gtgtgtgtgg cgcctgcgca gtggcggtga ccaccggctc cgtg gaggctgctc 61 gcgc gcgagtcaga ct nggtggg ggtcccggcg gcggtagcgg ngcggcggt 121 gcgagcatgt cgtggctctt cggcattaac aagggcccca agggtgaagg cgcggggccg 181 ccgccgcctt tgccgcccgc gcagcccggg gccgagggcg gcggggaccg nggttggga 241 ccgg cgcccaagga caaatggagc aacttcgacc ccaccggcct ggagcgcgcc 301 gccaaggcgg cgcgcgagct ggagcactcg cgttatgcca aggacgccct gaatctggca 393393 36" cagatgcagg cgct gcagttggag tcca agctcaaaat gcggctggaa 42" gccctgagcc tgctgcacac actagtctgg gcatggagtc tctgccgtgc ngagccgtg 48" cagacacagg agcggctgtc aggcagtgcc agccctgagc aagtgccagc gtgc 54" tgtgctctgc aggagtatga ggccgccgtg gagcagctca agagcgagca gatccgggcg 60" caggctgagg agaggaggaa gagc gaggagaccc ggcagcacca ggccagggcc 66" cagtatcaag acaagctggc ccggcagcgc tacgaggacc aactgaagca actt 72" ctcaatgagg agaatttacg gaagcaggag gagtccgtgc agaagcagga agccatgcgg 78" cgagccaccg ggga gatggagctg cggcacaaga atgagatgct gcgagtggag 84" gccgaggccc gggcgcgcgc caaggccgag cgggagaatg cagacatcat gcag 90" atccgcctga aggcggccga gcaccgtcag ttgg agtccatcag gacggctggc 96" accttgtttg gatt ccgtgccttt gtgacagact gggacaaagt gacagccacg "02" gtggctgggc tgct ggctgttggg gtctactcag ccaagaatgc cacgcttgtc "08" gccggccgct tcatcgaggc tcggctgggg aagccgtccc tagtgaggga gacgtcccgc "l4" atcacggtgc ttgaggcgct gcggcacccc atccaggtca gccggcggct cctcagtcga "20" ccccaggacg Cgctggaggg tgttgtgctc agtcccagcc tggaagcacg ggtgcgcgac "26" atcgccatag caacaaggaa caccaagaag aaccgcagcc tgtacaggaa catcctgatg "32" tacgggccac caggcaccgg gaagacgctg tttgccaaga aactcgccct gcactcaggc "38" atggactacg ccatcatgac ggac gtggccccca tggggcggga aggcgtgacc "44" gccatgcaca agctctttga ctgggccaat accagccggc gcggcctcct gctctttgtg "50" gatgaagcgg acgccttcct tcggaagcga gaga agataagcga ggacctcagg "56" gccacactga acgccttcct gtaccgcacg ggccagcaca gcaacaagtt catgctggtc "62" ctggccagca accaaccaga gcagttcgac tgggccatca atgaccgcat caatgagatg "68" gtccacttcg acctgccagg gcaggaggaa cgggagcgcc tggtgagaat gtattttgac "74" aagtatgttc cggc cacagaagga aagcagcgcc tgaagctggc ccagtttgac "80" tacgggagga agtgctcgga ggtcgctcgg ctgacggagg gcatgtcggg ccgggagatc "86" gctcagctgg ccgtgtcctg gcaggccacg gcgtatgcct ccgaggacgg ggtcctgacc "92" gaggccatga tggacacccg cgtgcaagat gctgtccagc agcaccagca gaagatgtgc "98" tggctgaagg cggaagggcc tgggcgtggg ccct ccccatcctg agtccacagg 204" gagatccaca ggag cctggccgcg tccc acccctgcct tgccggcccc 210" tgcacattta ggatatgctc ctgggtgggg actgggctgt gcccagggcc tctgtccccc 394394 216" aggatgtctt gtggtgcggg tcggccgttc tgccccccag ggcaccccct gttgtaggca 222" ctggctaggg aggggcaggc ctccttcctg cccctcgaga cactcttggg agatgcattt 228" tccgtctggc tcacaggggg agggtgaggc tttgcacccc agcccctgcc caggccactg 234" tgagggtggg tgctggctga gCCCCnggg cagcaggagc caggcaggtg atgtctttgt 240" tctcggctcc cacagcagag ccaggtgagg gggcgcctgc cagggccaga cccaggtggg 246" gcagcctgaa ccctgcttcc ccctgtggcc ggcatgcccc gatctttcac acactggtga 252" ccctgagaga ggagggagga gggaacctgg nggggtgtc tgaggccgca ctgtcagctg 258" gccggtccaa ggct ggagctgggg tctgtttacc taataaagtc ccacaggtgc 2641 aaaa aaaaaa Protein seguence (variant 1): NCBI Reference Seguence: 658.3 LOCUS NP_060658 ACCESSION NP_060658 l inkg pkgegagppp uae lgdr papkdkwsnf dptgleraak 6" hsry akdalnlaqm qeqthleqq leal sllhtlvwaw slcragavqt 12" qerlsgsasp eqvpagecca lqeyeaaveq lkseqiraqa eerrktlsee trqhqaraqy 18" qdklarqrye dqlquqlln eenlrkqees vqkqeamrra tveremelrh knemlrveae 24" ararakaere nadiireqir lkaaehrqtv agtl fgegfrafvt dwdkvtatva " gltllavgvy saknatlvag rfiearlgkp slvretsrit vlealrhpiq vsrrllsrpq 36" dalegvvlsp slearvrdia iatrntkknr slyrnilmyg tlfa kklalhsgmd 42" gdva pmgregvtam hklfdwants rrglllfvde adaflrkrat ekisedlrat 48" lnaflyrtgq hsnkfmlvla anpeqfdwa indrinemvh fdlpgqeere rlvrmyfdky 54" vlkpategkq rlklaqfdyg rkcsevarlt egmsgreiaq lavswqatay asedgvltea 60" mmdtrquav qqhqqkmcwl kaegpgrgde psps Nucleotide seguence nt 2): NCBI Reference Seguence: NM_001170535.1 LOCUS NM_001170535 ACCESSION 170535 l gtgtgtgtgg cgcctgcgca gtggcggtga ccaccggctc gcggcgcgtg gaggctgctc 395395 6" ccagccgcgc caga ctcgggtggg ggtcccggcg gcggtagcgg ngcggcggt 12" atgt cgtggctctt taac ccca agggtgaagg gccg 18" ccgccgcctt tgccgcccgc gcagcccggg gccgagggcg gcggggaccg nggttggga 24" gaccggccgg cgcccaagga caaatggagc aacttcgacc ccaccggcct ggagcgcgcc " gccaaggcgg cgcgcgagct ggagcactcg cgttatgcca aggacgccct gaatctggca 36" cagatgcagg agcagacgct gcagttggag caacagtcca agctcaaaga gtatgaggcc 42" gccgtggagc agctcaagag cgagcagatc cagg ctgaggagag gaggaagacc 48" ctgagcgagg agacccggca gcaccaggcc agggcccagt atcaagacaa gctggcccgg 54" cagcgctacg aact gaagcagcag caacttctca atgaggagaa tttacggaag 60" caggaggagt ccgtgcagaa gcaggaagcc atgcggcgag tgga gcgggagatg 66" gagctgcggc acaagaatga gatgctgcga gccg aggcccgggc gcgcgccaag 72" gccgagcggg agaatgcaga ccgc gagcagatcc gcctgaaggc ggccgagcac 78" cgtcagaccg tcttggagtc catcaggacg gctggcacct tgtttgggga aggattccgt 84" gcctttgtga cagactggga caaagtgaca gccacggtgg ctgggctgac gctgctggct 90" gttggggtct actcagccaa gaatgccacg cttgtcgccg gccgcttcat Cgaggctcgg 96" aagc cgtccctagt gagggagacg tcccgcatca cggtgcttga ggcgctgcgg "02" caccccatcc aggtcagccg gcggctcctc agtcgacccc aggacgcgct ggagggtgtt "O8" gtgctcagtc ccagcctgga agcacgggtg atcg ccatagcaac aaggaacacc "14" aagaagaacc gcagcctgta catc ctgatgtacg ggccaccagg caccgggaag "20" acgctgtttg ccaagaaact cgccctgcac tcaggcatgg actacgccat catgacaggc "26" ggggacgtgg cccccatggg gcgggaaggc gtgaccgcca tgcacaagct ctttgactgg "32" gccaatacca gccggcgcgg cctcctgctc gatg aagcggacgc cttccttcgg "38" aagcgagcca ccgagaagat aagcgaggac ctcagggcca cactgaacgc cttcctgtac "44" cgcacgggcc agcacagcaa caagttcatg ctggtcctgg ccagcaacca accagagcag "50" ttcgactggg ccatcaatga ccgcatcaat gagatggtcc acttcgacct gccagggcag "56" gaggaacggg agcgcctggt gagaatgtat tttgacaagt atgttcttaa gccggccaca "62" gaaggaaagc agcgcctgaa gctggcccag tttgactacg ggaggaagtg ctcggaggtc "68" gctcggctga cggagggcat gtcgggccgg gagatcgctc ccgt gtcctggcag "74" gccacggcgt ccga ggacggggtc ctgaccgagg ccatgatgga cgtg "80" caagatgctg tccagcagca ccagcagaag atgtgctggc tgaaggcgga agggcctggg 396396 186" gacg cccc atcctgagtc cacagggaga tccacagctc cctg 192" gccgcggacc cctcccaccc ctgccttgcc tgca ggat atgctcctgg 198" gtggggactg ggctgtgccc agggcctctg tcccccagga tgtcttgtgg tgcgggtcgg 204" ccgttctgcc ccccagggca ccccctgttg taggcactgg ctagggaggg gcaggcctcc 210" ttcctgcccc tcgagacact cttgggagat gcattttccg tctggctcac agggggaggg 216" tgaggctttg caccccagcc cctgcccagg ccactgtgag ggtgggtgct ggctgagccc 222" ccggggcagc aggagccagg caggtgatgt ctttgttctc ggctcccaca gcagagccag 228" gtgagggggc gcctgccagg gccagaccca ggtggggcag cctgaaccct gcttccccct 234" gtggccggca tgccccgatc acac tggtgaccct gagagaggag ggaggaggga 240" acctggcggg tgag gccgcactgt gccg gcct ggag 246" ctggggtctg tttacctaat aaagtcccac aggtgcctca ttaaaaaaaa aa Pnneh1seguence(vafiant2y NCBI Reference Seguence: 164006.1 LOCUS NP_001164006 ACCESSION NP_001164006 l mswlfginkg pkgegagppp plppaquae gggdrglgdr papkdkwsnf dptgleraak 6" aarelehsry akdalnlaqm qeqthleqq sklkeyeaav eqlkseqira qaeerrkth l2" qara qudklarqr yedqlquql lneenlrkqe equkqeamr ratveremel 18" rhknemlrve aeararakae renadiireq irlkaaehrq tvlesirtag tlfgegfraf 24" vtdwdkvtat vagltllavg vysaknatlv agrfiearlg kpslvretsr itvlealrhp " iqurrllsr pqdalegvvl spslearvrd iaiatrntkk nrslyrnilm ygppgtgktl 36" fakklalhsg mdyaimtggd vapmgregvt amhklfdwan tsrrglllfv deadaflrkr 42" atekisedlr atlnaflyrt gqhsnkfmlv qfd waindrinem vhfdlpgqee 48" rerlvrmyfd kyvlkpateg qulklaqfd ygrkcsevar ltegmsgrei aqlavswqat 54" ayasedgvlt eammdtrqu aquhqqkmc wlkaegpgrg depsps Nudeofideseguence(vafiantBt NCBI Reference Seguence: NM_001170536.1 LOCUS NhA_001170536 ACCESSION NM_001170536 397397 l gggagccctg gcccttgccg ctcctcgccg ctgtcggcag ccacttcccg ggcgagactg 6" cgga gcacccccgg ccggagccgt gtcgcgtgcc gggaggatcg gactctttcc 12" gtcacccgtt tgcacctctg cagctgtcag gtca gcca aggacgccct 18" gaatctggca cagatgcagg agcagacgct gcagttggag caacagtcca agctcaaaga 24" gtatgaggcc gccgtggagc agctcaagag cgagcagatc cgggcgcagg ctgaggagag " gaggaagacc gagg agacccggca gcaccaggcc agggcccagt atcaagacaa 36" gctggcccgg cagcgctacg aact gaagcagcag ctca atgaggagaa 42" tttacggaag caggaggagt agaa gcaggaagcc atgcggcgag ccaccgtgga 48" gcgggagatg gagctgcggc acaagaatga gatgctgcga gtggaggccg aggcccgggc 54" gcgcgccaag cggg agaatgcaga catcatccgc gagcagatcc gcctgaaggc 60" ggccgagcac cgtcagaccg tcttggagtc catcaggacg gctggcacct tgtttgggga 66" aggattccgt gcctttgtga cagactggga caaagtgaca gccacggtgg ctgggctgac 72" gctgctggct gttggggtct actcagccaa gaatgccacg cttgtcgccg gccgcttcat 78" Cgaggctcgg ctggggaagc cgtccctagt gagggagacg tcccgcatca cggtgcttga 84" ggcgctgcgg caccccatcc aggtcagccg cctc agtcgacccc aggacgcgct 90" ggagggtgtt gtgctcagtc ccagcctgga agcacgggtg cgcgacatcg caac 96" aaggaacacc aagaagaacc gcagcctgta caggaacatc ctgatgtacg ggccaccagg "02" gaag acgctgtttg ccaagaaact cgccctgcac tcaggcatgg actacgccat "08" catgacaggc ggggacgtgg cccccatggg gcgggaaggc gtgaccgcca tgcacaagct "14" ctttgactgg gccaatacca gccggcgcgg cctcctgctc tttgtggatg aagcggacgc "20" cttccttcgg aagcgagcca agat aagcgaggac ctcagggcca cactgaacgc "26" gtac cgcacgggcc agcacagcaa caagttcatg ctggtcctgg ccagcaacca "32" gcag ttcgactggg ccatcaatga ccgcatcaat gagatggtcc acttcgacct "38" gccagggcag gaggaacggg agcgcctggt gtat tttgacaagt atgttcttaa "44" gccggccaca gaaggaaagc tgaa gctggcccag tttgactacg ggaggaagtg "50" ctcggaggtc gctcggctga cggagggcat gtcgggccgg gagatcgctc agctggccgt "56" gtcctggcag gccacggcgt atgcctccga ggacggggtc ctgaccgagg ccatgatgga "62" cacccgcgtg caagatgctg tccagcagca ccagcagaag atgtgctggc tgaaggcgga "68" agggcctggg Cgtggggacg agccctcccc atcctgagtc cacagggaga tccacagctc "74" acggagcctg gccgcggacc cctcccaccc ctgccttgcc tgca catttaggat 398398 180" atgctcctgg gtggggactg ggctgtgccc agggcctctg tcccccagga tgtcttgtgg 186" tgcgggtcgg ccgttctgcc ccccagggca ccccctgttg ctgg ctagggaggg 192" ctcc ttcctgcccc tcgagacact cttgggagat gcattttccg tctggctcac 198" agggggaggg tttg caccccagcc cctgcccagg ccactgtgag ggtgggtgct 204" ggctgagccc cagc aggagccagg caggtgatgt ctttgttctc ggctcccaca 210" gcagagccag gtgagggggc gcctgccagg gccagaccca ggtggggcag cctgaaccct 216" ccct gtggccggca tgccccgatc tttcacacac tggtgaccct gagagaggag 222" ggaggaggga acctggcggg ggtgtctgag gccgcactgt cagctggccg gtccaagcct 228" ggag ctggggtctg tttacctaat aaagtcccac ctca aaaa 2341 aa Protein seguence (variant 3): NCBI Reference Seguence: NP_001164007.1 LOCUS NP_001164007 ACCESSION NP_001164007 l mqeqthleq qsklkeyeaa veqlkseqir aqaeerrktl seetrqhqar aqudklarq 6" ryedqlquq llneenlrkq eequkqeam rratvereme lrhknemlrv eaeararaka 12" erenadiire qirlkaaehr irta gtlfgegfra fvtdwdkvta tvagltllav 18" gvysaknatl vagrfiearl gkpslvrets ritvlealrh piqurrlls rpqdalegvv 24" lspslearvr diaiatrntk knrslyrnil mygppgtgkt lfakklalhs gmdyaimtgg " dvapmgregv tamhklfdwa lllf flrk ratekisedl ratlnaflyr 36" tgqhsnkfml vlaanpeqf dwaindrine mvhfdlpgqe ererlvrmyf dkyvlkpate 42" gqulklaqf dygrkcseva rltegmsgre iaqlavswqa dgvl teammdtrvq 48" daquhqqkm cwlkaegpgr gdepsps PCBP2 OfibblSwnbd:PCBP2 Official Name: po|y(rC) binding protein 2 Gene ID: 5094 Organism: Homo sapiens 399399 Other Aliases: Other Designations: alpha-0P2; heterogeneous nuclear ribonucleoprotein E2; heterogenous nuclear ribonucleoprotein E2; hnRNP E2; poly(rC)-binding protein Nucleotide seguence nt 1): NCBI Reference Seguence: NM_005016.5 LOCUS NM_005016.
ION NM_005016 1 cccagaccag cagaggcagc agccggagca gccgcagcct gcgccctctc ccgcccgccc 6" gccctccgcc cgcccgcccg ccctccgccg ccctccaccc gccccggggt ctctttcccc l2" cttcctcctc ctcctcctcc accccccctt cctcctccgc ccgcccgcgg ggcccccctc 18" gccttcccgc ccgcccctat tgttccgccc ccggcctccc cccc ttcccgcccg 24" ctcccctttt cccctcagtc gcctcgcgcc tgcagttttt ggctt:cacc cccaaccagt " gaccaaagac ttgaccactc aaagtccagc tccccagaac actgc:cgac atggacaccg 36" gtgtgattga aggtggatta aatgtcactc tcaccatccg gctac:tatg catggaaagg 42" gcag tatcatcgga aagaaaggag ttaa gaaga:gcgc gaggagagtg 48" gtat caacatctca gaagggaatt gtcctgagag cact ggac 54" ccactaatgc catcttcaaa gcta tgatcattga caaac:ggaa gaggacataa 60" gcagctctat gaccaa:agc acagctgcca gtagaccccc cctg aggctggtgg 66" tccctgctag tcagtg:ggc attg gaaaaggtgg atgcaagatc aaggaaatac 72" gagagagtac aggggc:cag gtccaggtgg caggggatat gctacccaac tcaactgagc 78" gggccatcac :ggc attccacaat ccatcattga gtgtgtcaaa cagatctgcg 84" tggtcatgtt ggagac :ctc tcccagtccc ccccgaaggg cgtgaccatc ccgtaccggc 90" ccaagccgtc cagctc:ccg gtcatctttg caggtggtca ggacaggtac agcacaggca 96" gcgacagtgc gagctt:ccc cacaccaccc cgtccatgtg cctcaaccct gacctggagg 102" gaccacctct agaggcctat accattcaag gacagtatgc cattccacag ccagatttga 108" ccaagctgca ccagttggca atgcaacagt ctcattttcc catgacgcat ggcaacaccg 114" gattcagtgg cattgaatcc agctctccag aagg ctattgggca ggtttggatg 120" catctgctca gactacttct catgaactca ccattccaaa gatt ggctgcataa 126" tcgggcgtca aggcgccaaa atcaatgaga tccgtcagat gtctggggcg cagatcaaaa 400400 "32" ttgcgaaccc agtggaagga tctactgata ttac catcactgga tctgctgcca "38" gcattagcct ggctcaatat ctaatcaatg tcaggctttc ctcggagacg ggtggcatgg "44" ggagcagcta gaacaatgca tcca taatcccttt ctgctgttca ccaccaccca "50" tgatccatct gtgtagtttc tgaacagtca gcgattccag gttttaaata aaat "56" tttcagtttc tacacacttt atcatccact cgtgattttt taattaaagc gttttaattc "62" ctgt tcagctgttg atgctgagat ccatatttag ttttataagc ctgg "68" tttttttttt ttggctcatg aatttttctg tttgtcatgg aaatgtaaga gtggaatatt "74" aatacatttc agtttagttc tgtaatgtca ggaatttttc atta aaagatggac "80" tggagctttt tctttgtgaa tagaaactgg atgccacagt gattcatgtg ggttttattc "86" ctcttgtctt gctgttattt ttgtaccttt tatccctcaa aggacccttc ttgggttttg "92" aatggaagcc tttattccgg ttaagatgtt ttcttctatt ttaccacttc catctttttt "98" tgtggccctc gatcctattt ttccctgact ttgg ttggccctta taaaacttgt 204" gcccaaaaga ttgaggatta gactttccga ggacttacct gtcctagggg agtaggcaag 210" cact agggaggggg tgggggaaag caca tgacatacat ggcatacaca 216" ttaagcagtt gatcatatgt ctgactgggt tccagtttct tgggaatgtt ggtccccttg 222" cttg catattttaa actaaaaatt tcagtctatt gtttttagta acttcattta 228" tagtcctcca taacaagtta gaaggatgta tctgctacca tttattccta taga 234" aagttggggc ttat actcatttag tgagagtaga tgcaaaaaag tggaggggca 240" cttc tccagacacc tcagataaag tccggagccc aaggctttat cttaaccatg 246" tatggtaccc cattcattca tcaagaaaac cctcaacagc tgggcctgca tggagtgtta 252" tatttcaagg tttttcacag gggttacagt aggacagtcc ccaccccaat caggcaccag 258" gataaaagca gggacttaaa cagcaccccg gttcttcagc ctgagccatc acatgctatc 264" agtctcctaa cctccccctg ggccttaaga cagggcttgg gcagagaaga taaatggtgg 270" gacaaaaaaa tgagttacat tgccacctga gaaacctcag aggggaggac ccagccttag 276" cctccctcct cccaagtgca aaatgtgtaa acagagtaaa cggaacagaa aagtgcagtc 282" gttt tctctcctgc ccctcccacc gcccctcccc ccacccccta ttatttgggg 288" ataaagaata taaagacaac cctggctttt ctattgcctt tgct gaatataagg 294" aatggggtgg ggcaggaagg ggcttgccct tagccacagc gctg tgcctcattc 300" atttccacag ctgccagtgt ccctagagtt tatcaggtga attggtcagg gtct 306" ccctcgagcc acgg ctgggacagc cccatctttc tgttgattat gtggcgcata 401401 3121 tatatatata tatgtatata tatataattt atataaatat ttctctatgt aaaaaaaaaa 3181 aaaaaaa Protein seguence (variant 1): NCBI Reference Seguence: 007.2 LOCUS NP_005007 ACCESSION NP_005007 1 mdtgvieggl nvtltirllm hgkevgsiig kkgesvkkmr inis egncperiit 61 lagptnaifk afamiidkle edisssmtns taasrppvtl rlvvpasch sligkggcki 121 keirestgaq quagdmlpn steraitiag ipqsiiecvk qicvvmletl sqsppkgvti 181 sssp vifagqury stgsdsasfp httpsmclnp dlegppleay tingyaipq 241 pdltklhqla mqqshfpmth gies sspevkgywa gldasaqtts heltipndli 301 gciigrqgak ineirqmsga qikianpveg stdrqvtitg saasislaqy linvrlsset 361 ggmgss PDL|M7 Official Symbol: PDL|M7 Official Name: PDZ and LIM domain 7 Gene ID: 9260 sm: Homo sapiens Other s: LMP1, LMP3 Other Designations: 1110003801 Rik; LIM domain protein; LMP; Lim mineralization protein 3; PDZ and LIM domain protein 7; n enigma Nucleotide ce (variant 1): NCBI Reference Seguence: NM_005451.3 LOCUS NM_005451 ACCESSION NM_005451 1 agaacactgg cggccgatcc caacgaggct ccctggagcc cgacgcagag cagcgccctg 61 ccaa gcaggagccg gcatcatgga ttccttcaaa gtagtgctgg aggggccagc 121 accttggggc ttccggctgc aagggggcaa ggacttcaat gtgcccctct ccatttcccg 402402 18" gctcactcct gggggcaaag cggcgcaggc cggagtggcc gtgggtgact gggtgctgag 24" catcgatggc gagaatgcgg gtagcctcac acacatcgaa gctcagaaca agatccgggc " ctgcggggag cgcctcagcc tgggcctcag cagggcccag ccggttcaga gcaaaccgca 36" gaaggcctcc gccg ctcc gcggtacacc tttgcaccca gcgtctccct 42" caacaagacg gcccggccct ttggggcgcc cccgcccgct gacagcgccc cgcagcagaa 48" tggacagccg ctccgaccgc tggtcccaga tgccagcaag cagcggctga tggagaacac 54" agaggactgg cggc cggggacagg ccagtcgcgt tccttccgca tccttgccca 60" cctcacaggc accgagttca tgcaagaccc ggatgaggag cacctgaaga aatcaagcca 66" ggtgcccagg acagaagccc cagccccagc ctcatctaca gagc cctggcctgg 72" ccctaccgcc cccagcccta ccagccgccc gccctgggct gtggaccctg cgtttgccga 78" gcgctatgcc ccggacaaaa cgagcacagt gctgacccgg cacagccagc cggccacgcc 84" cacgccgctg cagagccgca cctccattgt gcaggcagct gccggagggg tgccaggagg 90" gggcagcaac aacggcaaga tgtg tcaccagtgc cacaaggtca tccggggccg 96" ctacctggtg ggcc acgcgtacca cccggaggag tttgtgtgta gccagtgtgg "02" gaaggtcctg gaagagggtg gcttctttga ggagaagggc gccatcttct gcccaccatg "08" ctatgacgtg cgctatgcac ccagctgtgc caagtgcaag aagaagatta caggcgagat "14" catgcacgcc ctgaagatga cctggcacgt gcactgcttt acctgtgctg cctgcaagac "20" gcccatccgg gcct tctacatgga ggagggcgtg ccctattgcg agcgagacta "26" gatg tttggcacga aatgccatgg ctgtgacttc aagatcgacg ctggggaccg "32" cttcctggag gccctgggct tcagctggca tgacacctgc ttcgtctgtg cgatatgtca "38" gatcaacctg gaaggaaaga ccttctactc caagaaggac aggcctctct gcca "44" tgccttctct catgtgtgag ccccttctgc ctgc ggcc cctagcctga "50" ggggcctgga gtcgtggccc tgcatttctg ggtagggctg gcaatggttg ccct "56" ggctcctggc ccgagcctgg ggctccctgg gccctgcccc acccacctta tcctcccacc "62" ccactccctc caccaccaca gcacaccggt caca ccagccccct ttcacctcca "68" gtgccacaat gtac ccagctgtg Protein sequence (variant 1): NCBI Reference Seguence: NP_005442.2 LOCUS N 42 ACCESSION NP_005442 403403 l vleg papwgfrlqg gkdfnvplsi srltpggkaa qagvavgdwv lsidgenags 6" lthieaani racgerlslg lsraqpvqsk quasapaad apsv slnktarng l2" sapq rplv pdasqulme ntedwrprpg tgqsrsfril ahltgtefmq 18" dpdeehlkks sqvprteapa passtpqepw pgptapSpts rppwavdpaf aeryapdkts 24" tvltrhsqpa tptplqsrts ivqaaaggvp gggsnngktp vchqchkvir grylvalgha " yhpeefvcsq cgkvleeggf feekgaifcp pcydvryaps cakckkkitg eimhalkmtw 36" hvhcftcaac ktpirnrafy ycer dyekmfgtkc hgcdfkidag drflealgfs 42" whdtcfvcai cqinlegktf yskkdrplck shafshv PDCD6 OfibblSwnbd:PDCD6 Official Name: programmed cell death 6 Gene ID: 10016 Organism: Homo sapiens Other Aliases: ALG-2, PEF1 B Other Designations: apoptosis-linked gene 2 protein; probable m-binding n ALG-2; programmed cell death protein 6 Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_013232.3 LOCUS NM_01 3232 ACCESSION 232 l gataatgcca ggccctgccc ccggcagagg cggaagcgga gtcggcctga gaggtctctc 6" gtcgctgcag gcgcctcagc ccagccgcgt gccttggccc atggccgcct actcttaccg 12" ccccggccct ggcc ctgggcctgc tgcaggcgcg gcgctgccgg accagagctt 18" cctgtggaac gttttccaga gggtcgataa agacaggagt ggagtgatat cagacaccga 24" gcttcagcaa gctctctcca cgtg gactcccttt aatccagtga ctgtcaggtc " gatcatatcc gacc gtgagaacaa ggccggcgtg aacttcagcg agttcacggg 36" tgtgtggaag tacatcacgg actggcagaa cgtcttccgc acgtacgacc gggacaactc 42" cgggatgatc gataagaacg agctgaagca ggccctctca ggtttcggct accggctctc 48" tgaccagttc cacgacatcc tcattcgaaa gtttgacagg cagggacggg ggcagattgc 404404 54" cgac ttcatccagg gctgcatcgt cctgcagagg ttgacggata tattcagacg 60" ttacgacacg gatcaggacg gctggattca ggtgtcgtac gaacagtacc tgtccatggt 66" tatc gtatgaccct tcgt cagc acaacatgga caaa 72" atgtcacagt ctgt gagggaatgg agcacaggtg cagttagatg ctgttcttcc 78" tttagatttt gtcacgtggg gacccagctg tacatatgtg gataagctga ttaatggttt 84" tgta atagtagctg tatcgttcta atgcagacat tggatttggt gactgtctca 90" catg aggtaaatgt aatgtttcag gcattctgct tgcaaaaaaa tctatcatgt 96" gcttttctag atgtctctgg ttctatagtg caaatgcttt tattagccaa taggaatttt 102" aaaataacat ggaacttaca caaaaggctt ttcatgtgcc ttactttttt aaaaaggagt 108" tatt cattggaata tgtgacgtaa gcaataaagg gaatgttaga cgtgtaaaaa 1141 aaaaaaaaaa a Protein seguence ( variant 1): NCBI Reference Seguence: NP_037364.1 LOCUS N P_037364 ACCESSION NP_037364 1 rpgp gagpgpaaga alpdqsflwn qurvdkdrs gvisdtequ alsngtwtpf 61 npvtvrsiis mfdrenkagv nfseftngk yitdquvfr tydrdnsgmi dknelkqals 121 gfgyrlsdqf hdilirkfdr qgrgqiafdd fiqgcivlqr ltdifrrydt dngwiquy 181 eqylsmvfsi v ACTR2 Official Symbol: ACTR2 Official Name: ARP2 actin-related protein 2 g (yeast) Gene ID: 10097 Organism: Homo sapiens Other Aliases: ARP2 Other Designations: actin-like protein 2; actin-related protein 2 Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_001005386.2 405405 LOCUS 005386 ACCESSION NM_001005386 l gagctcaccg ctgccagtcg cgctgcctgc ccgtcccacc cttttcgtgc aggcattcag 6" ctaaatgacg ggcggagccc ggct cggg ggaaaaaagt tgggccgaag l2" gaggggccgg gaagacgcaa gaggaagaag agaaaacggc nggngng tggctgtagg 18" ttgtgcggct gctc ttccctgggc ggacgatgga cagccagggc aggaaggtgg 24" tggtgtgcga caacggcacc gggtttgtga agtgtggata tgcaggctct aactttccag " aacacatctt cccagctttg agac tcag atcaaccacc aaagtgggaa 36" acattgaaat caagaataac aaaaagatgg atcttatggt tggtgatgag gcaagtgaat 42" tacgatcaat gttagaagtt aactaccc:a tggaaaatgg catagtacga aattgggatg 48" acatgaaaca cctgtgggac tacacatt gaccagagaa acttaatata gataccagaa 54" attgtaaaat cttactcaca gaacctcc:a tgaacccaac caaaaacaga gagaagattg 60" tagaggtaat gtttgaaact taccagtt ccggtgtata tgtagccatc caggcagttc 66" tgactttgta cgctcaaggt ttattgac gtgtagtggt agactctgga gatggtgtga 72" ctcacatttg atat gaaggctt ctctccctca tcttaccagg gata 78" ttgctgggag ggatataact agatatct:a tcaagctact tctgttgcga ggatacgcct 84" tcaaccactc tgctgatttt gt:c gcatgattaa agaaaaactg tgttacgtgg 90" gatataatat tgagcaagag cagaaactgg ccttagaaac cacagtatta gttgaatctt 96" atacactccc agatggacgt atcatcaaag ttgggggaga gagatttgaa gcaccagaag "O2" ctttatttca gcctcacttg gttg aaggagttgg tgaa ttta "08" acacaattca tgac attgatacca gatctgaatt ctacaaacac attgtgcttt "l4" ctggagggtc tactatgtat cctggcctgc catcacggtt ggaacgagaa cagc "20" tttacttaga acgagttttg aagggtgatg tggaaaaact ttctaaattt aagatccgca "26" ttgaagaccc accccgcaga aagcacatgg tattcctggg tggtgcagtt ctagcggata "32" tcatgaaaga caaagacaac ttttggatga cccgacaaga gtaccaagaa aagggtgtcc "38" gtgtgctaga gaaacttggt gtgactgttc gataaactcc aaagcttgtt cccgtcatac "44" ccgtaatgct ttcttttttc ctttattgcc aatctttgaa caac tccaggacat "50" ggaagaggcc tctctctgcc ctttgactgg aaaggtcaag ttttattctg gtgtcttggg "56" gaagctttgt taaatttttg ttaatgtggg taaatctgag tttaattcaa ctgcttccct "62" acatagacta taag gattctgtct gctgctttgt ttcttctaag taggcattta 406406 168" gatcattcct ataggcttcc tattttcact ttactgctct aatgctgcta gtcgtagtct 174" ttagcacact aggtggtatg cctttattag cataaaacaa aaaaaacttt aacaggagct 180" tatt actgggatgg ggggtggttc gggatgggtg ggcagctgct gaacccttta 186" gggcatttcc tctgtaatgt ggcgctttca actgtactgc tgcagcttta agtaccttaa 192" agcttctcct gtgaacttct tagggaaatg ttaggttcag aactaaagtg ttttgggtgg 198" gttttgttgc gggggggagg gtaacaatgg ttct gatttttatt tttgaggttt 204" ctgg agtacgtaga ggaactttat ttacagtact ttgatttggc aggttttctt 210" ctacttgtgc tctgcctgga gctgtttcca tatgatataa aaagcaagtg tagtattcca 216" ttactatgtg gcttagggat ttatttgttt tttaaaatca accatgttag ctgggattag 222" actccctaca gtccttcaat ggaaaagtaa catttaaaaa tcctttgggt aattcgaatt 228" acagatttaa aagagcttaa gatctggtgt tttgttaatg cttctgttta ttccagaagc 234" attaaggtaa cccattgcca agtatcattc ttgcaaatta tata taactgacca 240" gtgcttaata aaacaagcag gtacttacaa actg gcagtaggtt ataattggtg 246" gtttaaaaat aacattggaa tacaggactt gttgccaatt gggtaatttt cattagttgt 252" tttgtttgtt ttgatttgaa acctggaaat acagtaaaat gttt aaaatgttgg 258" ccaaaaaaat ttaa tttttttatt tgtactgaaa aactaatcat taat 264" tctcagccat ctttgaagct tgaaagaaga gtctttggta ttttgtaaac gttagcagac 270" tttcctgcca gtgtcagaaa atcctattta tgaatcctgt cggtattcct tggtatctga 276" aaaaaatacc aaatagtacc atacatgagt tatttctaag tttgaaaaat aaaaagaaat 282" tgcatcacac taattacaaa atacaagttc tggaaaaaat atttttcttc attttaaaac 288" ttttttttaa atag ctttgaaaga ttaa ggtg gtaactaaaa 294" tcaaaagaaa actt gagggtctct gtttggtaag aatacatcat tagcttaaat 300" caga aggttagttt taattatgta gcttctgtta atattaagtg ttttttgtct 306" gttttacctc aatttgaaca gataagtttg cctgcatgct ggacatgcct cctg 312" aatagcccgt actagatctt gggaacatgg atcttagagt cactttggaa taagttctta 318" tacc cccagccttt tgagaacggg gcttgttaaa ggacgcgtat cccg 324" tacctactgg cagttgggtt cagggaaatg ggattgactt ggccttcagg ctcctttggt 330" cataatttta aaatatggga gtagaaaaca acaaagaatg gaatggactc ttaaaacaat 336" gaaagagcat ttatcgtttg tcccttgaat gtagaatttg tttttgattt cataattctg 342" aatg tgacagttaa aatggtgcat tata tattataatt tagaaatacc 407407 348" taat tttactattc cagggtgaca attt aaatttggga tttgggtgga 354" gtattatgtt taactggagt tgtcaagtat gagtccctca ggaaaaaaaa aaaattctgt 360" tttaaaaagc aatctgattc ttagctcttg aaactattgc tacttaaatt tccaataatt 366" aaaaatttaa aatttttaaa ttagaattgc caatacttct gaga agggtttttt 372" taca tttagtaaag agac attagtctta aact tttttcttta 378" aaacatggtt ttggtggtta acttttacac agttctgagt actgttaata tctggaaagt 384" atcttgagat atcagtggaa agctaaacag tctaaattaa catgaaatac ttcattttga 3901 ttgagaaaat aaaatcagat tttttcaaag tcaaaaaaaa aaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_001005386.1 LOCUS NP_001005386 ACCESSION NP_001005386 1 mdsqgrkvvv cdngtgkac fpeh ifpalvgrpi irsttkvgni eiknnkkmdl 61 mvgdeaselr smlevnypme ngivrnwddm khlwdytfgp eklnidtrnc killteppmn 121 ptknrekive vmfetyqfsg vyvaiqavlt lyaqglltgv vvdsgdgvth icpvyegfsl 181 phltrrldia grditrylik llllrgyafn hsadfetvrm ikeklcyvgy klal 241 ettvlvesyt ikvg gerfeapeal fqphlinveg lfnt iqaadidtrs 301 efykhivlsg gstmypglps rlerelkqu lervlkgdve klskfkirie dpprrkhmvf 361 lggavladim kdkdnfwmtr qeyqekgvrv leklgvtvr TXNDC12 Official Sym bol: TXNDC12 Official Name: thioredoxin domain containing 12 (endoplasmic reticulum) Gene ID: 51060 Organism: Homo sapiens Other Aliases: UNQ713/PRO1376, AG1, AGRl, ERP16, ERP18, ERP19, PD|A16,TLP19,hAG-1,hTLP19 Other Designations: ER protein 18; ER protein 19; or gradient homolog 1; endoplasmic reticulum n ERp19; endoplasmic reticulum resident protein 18; endoplasmic reticulum resident protein 19; endoplasmic lum thioredoxin superfamily member, 18 kDa; protein disulfide isomerase family A, 408408 member 16; thioredoxin domain-containing protein 12; thioredoxin-like protein Nucleotide seguence: NCBI Reference Seguence: NM_015913.3 LOCUS NM_015913 ACCESSION NM_015913 l agtgctagtg gcggcaggtg caggtggccg cgcggcatcc tggggcttgc agtctcccga 6" gcgttctgtt gtgtccctgc ctacaatttt agggaaccta ataaaagggt ggtcggtatg l2" tttttatttg tact tttgttaggt ttcg atta agtacggact 18" ttaggactca actagtacca aaaa gaccggacat tttcacctgt ttgttataca 24" gcgaagggga aaaattggga agaaatcctc aagctacaag aaaaataacc agaagcttta " cactttagcc tgcagtgact tatatcctgg tgtcctaagt ccacctaagt cagttttgca 36" ataagaggtc ccaagtttgg ttttcttgga gcttgcatca gttggttgca tcatccctga 42" gtagaagatt tgcggttgca aggaaaaata aggtacagag cttctccagc gggaaagtgc 48" atgtctgcac ggcacgagcc cacgcaccgc agaacaggct tgccaggtct cctcagagac 54" agga acctaacaat gaaatccagt gtct tgatttgtgg aggggtaagg 60" cgag gggc aatccgccca ctgttgggag cgactgacct cacgaatcaa 66" gctt ttgactagga agtgcagcgg ttcttggggg gaggggctgg actgggtggc 72" ggacgcgagg agcaacggtt ctcccgaacc tctcccccgc ccctactatc ttggcctaca 78" ttttcccgct ccgtcccggg acac ccagaatcca cgaaaagcaa ctcgcgctcg 84" agaacagctc tcgtaccctt ctacgtgatc tgcaccttta agctcactcc atcccaaacc 90" ggaccccgga ggcaccaccc acatccgtct aacatcactt gagt ttgaaaaaaa 96" aaaatctggg aagtagaggt gttgtgctga gcggcgctcg tgtg tggaccgtct "02" gctgggactc cggccctgcg cagc cccgtggccc cgcgcaccta ctgccatgga "08" gacgcggcct cgtctcgggg ccacctgttt gctgggcttc agtttcctgc tcctcgtcat "l4" ctcttctgat ggacataatg ggcttggaaa gggttttgga gatcatattc attggaggac "20" actggaagat gggaagaaag aagcagctgc cagtggactg cccctgatgg tgattattca "26" taaatcctgg tgtggagctt gcaaagctct aaagcccaaa tttgcagaat ctacggaaat "32" ttcagaactc tcccataatt ttgttatggt aaatcttgag gatgaagagg aacccaaaga "38" tgaagatttc agccctgacg ggggttatat tccacgaatc ctttttctgg atcccagtgg 409409 "44" caaggtgcat cctgaaatca agaa tggaaacccc agctacaagt atgt "50" cagtgccgag caagttgttc aggggatgaa ggaagctcag gaaaggctga atgc "56" cttcagaaag aaacatcttg aagatgaatt gtaacatgaa cctt ctttcatcag "62" agttagtgtt ctggaaggaa agcagcaggg aagggaatat tgaggaatca tctagaacaa "68" ttaagccgac caggaaacct cattcctacc tacactggaa ggagcgctct cactgtggaa "74" gagttctgct aacagaagct ggtctgcatg tttgtggatc cagcggagag tggcagactt "80" cctt ctca cctaaatgtc aacttgtcat tgaatgtaaa gaatgaaacc "86" caca aaacttgagc cacttggatg tttactcctc gcacttaagt atttgagtct "92" tttcccattt cctcccactt tactcacctt agtggtgaaa ggagactagt agcatctttt "98" ctacaacgtt aaaattgcag aagtagctta tcattaaaaa acaacaacaa caacaataac 204" aataaatcct aagtgtaaat cagttattct accccctacc aaggatatca gcctgttttt 210" tttt ctcctgggaa taattgtggg cttcttccca aatttctaca gcctctttcc 216" tcttctcatg cttgagcttc tgca cgcatgcgtg tgcaggactg gctgtgtgct 222" tggactcggc tccaggtgga agcatgcttt ttac tgttggagaa actcaaacct 228" tcaagcccta ggtgtagcca ttttgtcaag tcatcaactg tatttttgta ctggcattaa 234" caaaaaaaga gataaaatat tgtaccatta aactttaata aaactttaaa aggaaaaaaa 2401 aaaaaaaaaa aa Protein seguence: NCBI Reference Seguence: NP_056997.1 LOCUS NP_056997 ACCESSION NP_056997 1 metrprlgat cllgfsflll vissdghngl gkgfgdhihw rtledgkkea aasglplmvi 61 ihkswcgack alkpkfaest eiselshnfv mvnledeeep kdedfspdgg yiprilfldp 121 sgkvhpeiin engnpsykyf yvsaeqvvqg mkeaqerltg dafrkkhled e1 Official : ANXA7 Official Name: annexin A7 m: 310 Organism: Homo sapiens 410410 Other Aliases: 37A6.8, ANX7, SNX, SYNEXIN CHherDesbnafions:annex”1VH;annexm-7 Nudeofideseguence(vafiant1t NCBI Reference Seguence: NM_001156.3 LOCUS NM_001156 ION NM_001156 l ccaccctggg cccg gctccatctt gcgggagacc gggttgggct ctgc 6" tgctggggtc tcat acccaggcta tcccccaaca ggctacccac ctttccctgg 12" atatcctcct gcaggtcagg agtcatcttt tcccccttct ggtcagtatc cttatcctag 18" tggctttcct ccaatgggag gaggtgccta cccacaagtg ccaagtagtg gctacccagg 24" agctggaggc taccctgcgc ctggaggtta tccagcccct tatc :ggtgcccc " acagccaggg ggagctccat cctatcccgg agttcctcca ggccaaggat :ggagtccc 36" accaggtgga gcaggctttt ctgggtatcc acagccacct tcacagtctt :ggaggtgg 42" tccagcacag gttccactac ctggtggctt tcctggagga cagatgcctt 48" tggaggacaa cctacttacc ctagtcagcc tgccacagtg actcaggtca 54" tatccgacca gctgccaact tcgatgctat aagagatgca gaaattcttc g :aaggcaat 60" tttt gggacagatg caat tgtggatgtg aacc g :tccaatga 66" tcagaggcaa aaaattaaag cagcatttaa gacctcctat ggcaaggatt taatcaaaga 72" tctcaaatca gagttaagtg gaaatatgga agaactgatc ctggccctct tcatgcctcc 78" tacgtattac gatgcctgga gcttacggaa gcag ggagcaggaa ctcaggaacg 84" tgtattgatt gagattttgt gcacaagaac aaatcaggaa atccgagaaa gatg 90" ttatcagtca gaatttggac gagaccttga aaaggacatt aggtcagata catcaggaca 96" ttttgaacgt ttacttgtgt ccatgtgcca gggaaatcgt gatgagaacc agagtataaa "02" ccaccaaatg gc:caggaag atgctcagcg tctctatcaa gctggtgagg ggagactagg "08" gaccgatgaa tc:tgcttta acatgatcct tgccacaaga agctttcctc agctgagagc "l4" taccatggag tcta ggatggctaa tcgagacttg ttaagcagtg tgagccgtga "20" gttttccgga ta:gtagaaa gtggtttgaa gaccatcttg cagtgtgccc tgaaccgccc "26" tgccttcttt gc:gagaggc tctactatgc tatgaaaggt gctggcacag atgactccac "32" cctggtccgg gtca ctcgaagtga gattgacctt gtacaaataa aacagatgtt "38" cgctcagatg ta:cagaaga ctctgggcac aatgattgca ggtgacacga gtggagatta 411411 1441 ccgaagactt cttctqgcta ttgtqggcca gtaggaggga tttttttttt tttaatgaaa 1501 aaaaatttct attcatagct tatccttcag agcaatgacc tgcatgcagc aaac 1561 atcagctaac cgaaagagct ttctgtcaag gaccgtatca gggtaatgtg cttggtttgc 1621 acatgttgtt attgccttaa ttctaatttt ttct ctacatacaa tcaatgtaaa 1681 gccatatcac aatgatacag taatattgca atgtttgtaa accttcattc ttactagttt 1741 cattctaatc aagatgtcaa attgaataaa aatcacagca atctctgatt ctgtgtaata 1801 atattgaata attttttaga aggttactga aagctctgcc ttccggaatc cctctaagtc 1861 tgcttgatag agtggatagt gtgttaaaac tgtgtacttt aaaaaaaaat tcaaccttta 1921 catctagaat aatttgcatc tcattttgcc taaattggtt ctgtattcat aaacactttc 1981 cacatagaaa atagattagt attacctgtg gcacctttta agaaagggtc aaatgtttat 2041 aaga tacatagcct actttttttt cgcagttgtt ttcttttttt agtt 2101 aata aaaaattgca tatatttaag gtgtacaaaa aaaaaa Protein seguence (variant 1): NCBI Reference ce: NP_001147.1 LOCUS NP_001147 ACCESSION N P_001147 1 msypqypptg yppfpgyppa gqessfppsg qypypsgfpp mgggaypqvp SSgYPgaggY 61 papggypapg qug apsypgvppg qngVPpgga qfsgypqpps qsygggpaqv 121 plpggfpggq mpsqypggqp typsqpatvt irpa anfdairdae ilrkamkgfg 181 tdeqaivdvv anrsndqqu ikaafktsyg kdlikdlkse elil alfmpptyyd 241 awslrkamqg agtqervlie ilctrtnqei reivrcyqse fgrdlekdir sdtsghferl 301 lvsmcqgnrd enqsinhqma qedaquyqa gequgtdes cfnmilatrs qulratmea 361 ysrmanrdll ssvsrefsgy vesglktilq calnrpaffa erlyyamkga gtddstlvri 421 vvtrseidlv qmy qktlgtmiag dtsgdyrrll laivgq PFKM Official Symbol: PFKM al Name: phosphofructokinase, muscle w:5213 412412 Organism: Homo sapiens Other Aliases: GSD7, PFK-1, PFK1, PFKA, PFKX Other Designations: 6-phosphofructokinase; 6-phosphoiructokinase, muscle type; PFK-A; phosphofructokinase isozyme A; phosphofructokinase 1; ofructokinase, polypeptide X; phosphofructokinase-M; phosphohexokinase Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_001166686.1 LOCUS NM_001166686 ACCESSION NM_001166686 1 gtcccagggg gcggggcaga ggaaaaggcg ccggccccac agtgctcccc gcttccgccc 6" gccc gggccggctg accgggtccg acacagtctc ctggaccagg ctccctccat 12" cctcacccct cccccagctt gcca ctcaccgaac cggaaccggc tgccatgcga 18" aggggtttcc ggccgggcgc ggaacgcaaa acccgggaac cgccgcgaac cggaaccgcc 24" ttcacagcac cggaagagtc gctaggaggc agccatgcat aaagacgagt ttcatctgaa " atttttcatg tgtgtgattc gcca gttagtcagg actcctcaga gaacagctgg 36" ggaagcttct acttccagca tgctcatacc aaagccacca ccaaagacag acatcttgaa 42" gagtctagat actatggatg atccagacac cgtgggaagc atacctgttt tcaaaactga 48" gtggatcatg acccatgaag atgc agccaaaacc ctggggattg gcaaagccat 54" tgctgtctta acctctggtg ccca gaat gctgctgtca gggctgtggt 60" tcgagttggt atcttcaccg gtgcccgtgt cttctttgtc catgagggtt atcaaggcct 66" ggtggatggt ggagatcaca tcaaggaagc cacctgggag agcgtttcga tgatgcttca 72" gctgggaggc acggtgattg gaagtgcccg gtgcaaggac tttcgggaac gagaaggacg 78" actccgagct aacc tggtgaagcg tgggatcacc aatctctgtg tcattggggg 84" tgatggcagc ctcactgggg ctgacacctt ccgttctgag tggagtgact gtga 90" cctccagaaa gcaggtaaga tcacagatga ggaggctacg aagtccagct acctgaacat 96" tgtgggcctg gttgggtcaa atga cttctgtggc atga gcac 102" tgactctgcc ctgcatcgga tcatggaaat tgtagatgcc atcactacca ctgcccagag 108" gagg acatttgtgt tagaagtaat gggccgccac tgtggatacc tggcccttgt 114" cacctctctg tcctgtgggg ccgactgggt ttttattcct gaatgtccac cagatgacga 120" ctgggaggaa cacctttgtc tcag cgagacaagg acccgtggtt ctcgtctcaa 413413 "26" catcatcatt gtggctgagg gtgcaattga caagaatgga aaaccaatca cctcagaaga "32" catcaagaat ctggtggtta agcgtctggg atatgacacc cgggttactg tcttggggca "38" tgtgcagagg ggtgggacgc catcagcctt tgacagaatt ctgggcagca ggatgggtgt "44" ggaagcagtg atggcacttt tggaggggac cccagatacc ccagcctgtg tagtgagcct "50" ctctggtaac caggctgtgc gcctgcccct catggaatgt gtccaggtga ccaaagatgt "56" gaccaaggcc atggatgaga agaaatttga cgaagccctg aagctgagag gccggagctt "62" catgaacaac gtgt acaagcttct agctcatgtc agacccccgg tatctaagag "68" gcac acagtggctg tgatgaacgt gggggctccg ggca tgaatgctgc "74" tgttcgctcc actgtgagga ttggccttat ccagggcaac cgagtgctcg ttgtccatga "80" tggtttcgag ggcctggcca aggggcagat agaggaagct ggctggagct atgttggggg "86" ctggactggc caaggtggct ctaaacttgg aagg actctaccca agaagagctt "92" tgaacagatc agtgccaata taactaagtt taacattcag ggccttgtca tcattggggg "98" ctttgaggct tacacagggg gcctggaact gggc aggaagcagt ttgatgagct 204" ctgcatccca tttgtggtca ttcctgctac agtctccaac aatgtccctg gctcagactt 210" cagcgttggg gctgacacag cactcaatac tatctgcaca gacc gcatcaagca 216" gtcagcagct ggcaccaagc gtcgggtgtt tatcattgag actatgggtg gctactgtgg 222" ctacctggct accatggctg gactggcagc tggggccgat gctgcctaca tttttgagga 228" gcccttcacc attcgagacc tgcaggcaaa tgttgaacat ctggtgcaaa aaac 234" gaaa aggggcttgg tgttaaggaa tgaaaagtgc aatgagaact ataccactga 240" cttcattttc tact ctgaggaggg gaagggcatc ttcgacagca ggaagaatgt 246" gcttggtcac atgcagcagg gtgggagccc aaccccattt gataggaatt ttgccactaa 252" gatgggcgcc aaggctatga actggatgtc aatc aaagagagtt accgtaatgg 258" gcggatcttt gccaatactc cagattcggg ctgtgttctg gggatgcgta agagggctct 264" ccaa ccagtggctg agctgaagga ccagacagat tttgagcatc gaatccccaa 270" ggaacagtgg tggctgaaac tgaggcccat cctcaaaatc ctagccaagt acgagattga 276" cact tcagaccatg cccacctgga gcacatcacc cggaagcggt ccggggaagc 282" tgccgtctaa acctctctgg ggga atagattacc tgatcatggt cagctcacac 288" cctaataagt ccacatcttc tcagtgtttt agctgttttt aggt ttccttttat 294" tctgtacctt gcagccatga ctgg ccaggagctg gaggagcagg cagtgggtgg 300" cttt taggtagaat ttaacatgac ttctgcccca gctttatctg tcacacaagg 414414 306" ctgggcacct ctac tgctagatat cacttactca gttagaattt tcctaaaaat 312" aagctttatt tatttctttg tgataacaaa gagtcttggt tcctctacta cttttactac 318" agtgacaaat taca ctaataaatg ccaactggtc actgtgcttt tgcttctcct 324" gttatcatct tcctaagtgg aatgtaatac tgtcagcccc atgtatcaga cacttgtctg 330" atgaagcagt aaagacgtta agggtatcac agggggtgga ggaagggatt atctctagta 336" cactacttgc tggctgtctg tgtc actgccaaac tctaaaaaca gttctaaata 342" gtgactgaga aggtttgttg ctggagtcag ggaataaggc agccaaatac tctttgcaca 3481 tagt gggaagagaa attaacaata aatatcaagc actgtgaaaa aaaaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_001160158.1 LOCUS NP_001160158 ACCESSION NP_001160158 l mhkdefhlkf fmcviqsrql vrtpqrtage astssmlipk pppktdilks ldtmddpdtv 6" ktew imtheehhaa ktlgigkaia vltsggdaqg mnaavravvr arvf 12" fvhequglv dggdhikeat wesvsmmlql ggtvigsarc kdfreregrl vkrg 18" itnlcviggd gsltgadtfr sewsdllsdl qkagkitdee atkssylniv glvgsidndf 24" igtd meiv daitttaqsh qrtfvlevmg rhcgylalvt slscgadwvf " dddw eehlcrrlse trtrgsrlni iivaegaidk ngkpitsedi knlvvkrlgy 36" dtrvtvlghv safd mgve avmallegtp dtpacvvsls gnqavrlp;m 42" ecqutkdvt kamdekkfde alklrgrsfm nnwevyklla hvrppvsksg mnvg 48" apaagmnaav rstvrigliq gnrvlvvhdg feglakgqie eagwsyvggw tgqggsklgt 54" krtlpkksfe qisanitkfn iqglviiggf eaytgglelm egrkqfdelc ipfvvipatv 60" snnvpgsdfs vgadtalnti cttcdrikqs aagtkrrvfi ietmggycgy latmaglaag 66" adaayifeep ftirdlqanv ehlvqkmktt vkrglvlrne kcnenyttdf ifnlyseegk 72" knvl ghmqqggSpt pfdrnfatkm gakamnwmsg kikesyrngr ifantpdsgc 78" vlgmrkralv fqpvaelkdq tdfehripke qulklrpil kilakyeidl dtsdhahleh 84" itrkrsgeaa SUB1 OfibblSymbokSUB1 415415 Official Name: SUB1 g (S. cerevisiae) m: 10923 Organism: Homo sapiens Other Aliases: P15, P04, p14 Other Designations: ted RNA polymerase II transcription coiactor 4; activated RNA polymerase II transcriptional coactivator p15; positive coiactor 4 Nucleotide seguence: NCBI Reference Seguence: NM_006713.3 LOCUS NM_006713 ACCESSION NM_006713 l gccccatcac gtgaccgcag ccccagcgcg gcggggccgg cgtctcctgg ctgccgtcac 6" ttccggttct gtcg cgagcgaacg accaagaggg tgttcgactg ctagagccga 12" gcgaagcgat gcctaaatca aaggaacttg tttcttcaag ctcttctggc agtgattctg 18" acagtgaggt tgacaaaaag agga aaaagcaagt tgctccagaa aaacctgtaa 24" agaaacaaaa gacaggtgag acttcgagag ccctgtcatc ttctaaacag agcagcagca " gcagagatga taacatgttt cagattggga aaatgaggta cgttagtgtt cgcgatttta 36" aaggcaaagt gctaattgat attagagaat tgga tcctgaaggt gaaatgaaac 42" caggaagaaa aggtatttct ttaaatccag aacaatggag ccagctgaag gaacagattt 48" ctgacattga tgatgcagta agaaaactgt aaaattcgag aaat aaaacctgta 54" ctgttctagt tgttttaatc tgtcttttta cttt tgttttctaa atgttctcca 60" agctattgta gatt gcagaagaat ttgtaagatg tttt tttaatgtgc 66" attattaaaa atattgagtg attg tcaactttat taaggattac tttgtctgcc 72" caccacctag tgtaaaataa aatcaagtaa tacaatctta actgttgtgg ccttttttga 78" tcataagagt tggtactgtt taaggccaaa agtaacagtt tttatagatc ttttagtttc 84" aactcagctt ttacaataaa aaggatttgt attgcattga gtttataaac ttttggtttg 90" tgaacttcat atttgatctt ttctcttcca atcaaatgtc taggcttgtt tgacttccac 96" tggt ttttcactct ttac ttcc tttaataact taatctcttc 102" atgttcagtt tttacttcac tctttattct tttctttgat tatggtatgc ttatttggaa 108" agtcagtgaa actgtcaaaa tgttatctca ataagatact tatatgagaa ctacaatcac 114" cgaatctact gtattcaata ttagcagatc taatttgata aacaacatgg cttgtgtgaa 416416 "20" aactgagcag gtgtttgttt acccatagtg ttctgtgtag ttattgctta gtctgcagaa "26" aataatgact tagatgagat gtctgacttg ctttcactta ttaaacatgt tggg "32" atgatgtctg taacatcaga tattgttcaa ctagactagg atttaataaa aattgtgaaa "38" gcttactggc ctaacatttt attttataat attgggtatg aattatatgt agccagagat "44" gtcattaagc tttactgtta tagtaggtaa tatggttagt ggaa aagagcatat "50" gagcacatgc ttgtgtattt tggcctttgc cccagtagaa aatg gcattctaga "56" cttgatgata ctaagtttta gcagacacta gtaagtggtt tgtatttaac catactgatg "62" aagcagacag attgaggcac agattttagt ggctttgtgg caataaatag ggcatggtgt "68" gccttaggaa aagaatgttt ataaagggaa ttataactga aattaaagga ggcggcagtg "74" aagaggaaat aattctcttc tatctaaatg atatacatat gatattttga gatttttata "80" acagcagtgg aacacaattc taggtagagt agaaaaagga aagttttaaa gacatataaa "86" ttg: tgacaaatta tttttggtag caaatctcaa acct gctattaagg "92" tctgccata: tttg tttg ctaccaagtt tgattcatac atctaaaaca "98" ttttgtagt: acttgtcaag gacttaattt gaaaa:catt tgccaggcca catagttatc 204" aattttttt: tctatcagct attctgttgt aaaa cattttttag tttt 210" aaagtatat: tagcagtaac cttatgaggt tcaaa:tggt cttg taatttagcc 216" ttcatcgaa: tacc agtgtattaa gtat tttttgcagc cccttgaacc 222" agagtaggt: cagagaaact cccaaagttt gtact:taga cacatcatgc ttgattggta 228" acttccctcc ttttttgggg aacatgtttg tgtcc:atta acttaattgg atagattttt 234" aaatatttct tatttttggc acacggaaag ggtag:tcga gtacagaact ttgatttttg 240" gtgtagatgc agagggaatg atgggtaaat ttcctaggtt tatgtgaatt tagggggtgt 246" atgcattttg aaacaatcta ctaacagatg gtgctgaaat ccta catgttttct 252" agttgttcag cattatgtta atgaagcctc catataagga gtgtttctct ggcacagttg 258" gtaagttgac tgctaacttc atttaaatgt gttactggat atgcagtata ctgaaattat 264" taatcagttt gtgtatagga aaagagaact gggttaaaag caaattaact tgttctgaaa 270" agaaagtata tttt gttttctgtt taaattttat ggta aagatttttt 276" tttcctgggc agaaaacttg ttag gcgtagatac cttaccttac aatgccaaaa 282" tgaatttaat tccagtactc aggtttttcc ctttaacaga ctctatgtgt atcagggctt 288" tctaatgggt ttttcctctt cgtttttaaa atgtgagtag catttgacca atttccagtg 294" ctcttagcat tttacttaaa gaacaaccac tacaaaagaa aatctttgta atttgattgt 417417 300" cttttgcttt gcttcattaa tgcctaagaa aata ctcctacctc attagctact 306" gctg tgacgatcaa atctattcta cataatgcgt ttagaaacaa agacttgggt 312" gaaaaatgaa ataagtatat tctgacttgg ctattgaggg gaaaattcag tattaagtgt 318" tcctcacagg agatatgtta gcagaatact ataaaagttt gaaattttta aaaagtaaaa 324" gtacttaaat ttaggtatct ctcctgaaat tctttgcagt tcatttttta tggcagttaa 330" tccagtgaaa cactcaaaag tttttttttt tttaaaagtg tttttccaga taaactgtag 336" ggtgaacatt cacataatca caaatatgta attctgtaat atgc ttgtatgctt 342" tgttttcgta catcttccat ggagatgtct gaatataata ctccatctgt gaatatttta 3481 aatgttgaaa taaaagtaag aaatgtgaaa aaaaaaaaaa aa Protein ce: NCBI Reference Seguence: 704.3 LOCUS NP_006704.3 ACCESSION 704.3 1 mpkskelvss sssgsdsdse vdkklkrkkq vapekpvkkq ktgetsrals ssrd 61 dnquigkmr yvsvrdfkgk vlidireywm dpegemkpgr kgislnpeqw sqlkeqisdi 121 ddavrkl ACDB3 Official Symbol: ACBD3 Official Name: acyl-CoA binding domain containing 3 Gene ID: 64746 Organism: Homo sapiens Other Aliases: GCP60, GOCAPf, , PAP7 Other Designations: Golgi resident protein GCP60; PBR- and PKA-associated protein 7; PKA (Rlalpha)-associated protein; oenzyme A binding domain containing 3; golgi complex associated protein 1, 60kDa; golgi phosphoprotein 1; peripheral benzodiazepine receptor-associated n PAP7 Nucleotide seguence: NCBI Reference Seguence: NM_022735.3 LOCUS NM_022735 ACCESSION NM_022735 418418 l atacgtggct gccgtctgtc cccgctgagg aggtgcagca gatg gcggcggtgc 6" tgaacgcaga gcgactcgag gtcg acggcctcac gctcagcccg gacccggagg 12" agcggcctgg ggcggagggc gccccgctgc tgccgccacc gctgccaccg ccac 18" ctggatccgg tcgcggcccg ggcgcctcag gggagcagcc cgagcccggg gcgg 24" ctgggggcgc ggag gcgcggcggc tggagcagcg ctggggtttc ggcctggagg " agttgtacgg cctggcactg cgcttcttca aagaaaaaga tggcaaagca ccaa 36" cttatgaaga aaaattgaag cttgtggcac tgcataagca agttcttatg tata 42" atccagacac ttgtcctgag gttggattct ttgatgtgtt ggggaatgac aggaggagag 48" aatgggcagc cctgggaaac atgtctaaag aggatgccat ggtggagttt gtcaagctct 54" taaataggtg ttgccatctc ttttcaacat atgttgcgtc ccacaaaata gaag 60" agcaagaaaa aaaaaggaag gaggaagagg agcgaaggcg gcgtgaagag gaagaaagag 66" aacgtctgca aaaggaggaa gagaaacgta ggagagaaga aagg cttcgacggg 72" aggaagagga aaggagacgg atagaagaag aaaggcttcg gttggagcag caaaagcagc 78" agataatggc agctttaaac tcccagactg ccgtgcagtt ccagcagtat gcagcccaac 84" agtatccagg gaactacgaa cagcagcaaa ttctcatccg ccagttgcag gagcaacact 90" atcagcagta catgcagcag ttgtatcaag tccagcttgc acagcaacag gcagcattac 96" agaaacaaca ggaagtagta gtggctgggt cttccttgcc tacatcatca aaagtgaatg "02" tacc aagtaatatg atgtcagtta atggacaggc caaaacacac actgacagct "08" ccgaaaaaga actggaacca gaagctgcag aagaagccct ggagaatgga ccaaaagaat "l4" ctcttccagt agct ccatccatgt ggacacgacc tcagatcaaa gacttcaaag "20" agaagattca gcaggatgca gattccgtga ttacagtggg ccgaggagaa gtggtcactg "26" ttcgagtacc cacccatgaa gaaggatcat atctcttttg ggaatttgcc aatt "32" atgacattgg gtttggggtg tattttgaat ggacagactc tccaaacact gctgtcagcg "38" tgcatgtcag tgagtccagc gacg aaga agaaaacatc gaag "44" agaaagccaa aaagaatgcc aacaagcctt tgctggatga gattgtgcct gtgtaccgac "50" gggactgtca tgaggaggtg tatgctggca gccatcaata tccagggaga ggagtctatc "56" agtt tgacaactcc tactctttgt ggcggtcaaa atcagtctac tacagagtct "62" attatactag ataaaaatgt tgttacaaag tctggagtct gggc agaagatgac "68" atttaatttg gaaatttctt tttacttttg tggagcatta gagtcacagt ttaccttatt "74" gatattggtc tgatggtttg cttg ctgggaatca aaatttcctt gagactcttt 419419 180" agcattcata ctttggggtt aaaggagatt cctcagactc atccagccct tgggtgctga 186" ccagcagagt cactagtgga tgctgaagtt acatgagcta catgttaaat atttaaagtc 192" tccaaaataa aacaccccaa cgttgacctt acccggctga tggttagccc cttgctgcct 198" gctccatgtg tcttatgaga gcccgtagtt acagtgtcct tgaa atccataagt 204" taacaagtct ggtg cagctggctt tgattaaagg ccatttttaa aaaa 210" ctcaacacct cacagattat aaaa gaaatggcct gatc agaa 216" tgacccagat tgtttctgct gcag ctgtttagtt cagagttata ttacagagaa 222" ttattttctg agataatctt aaactagaat gttcaaaact aattgataat tgaagtatca 228" agatacgtag aacacctcag agatttttct tcaggaactt ccacaaactt tgaatccttg 234" tatc:ttatt tggtattcat actactagta taca ggttttttgt tttgttttgt 240" ttgg cttcatagag tatctcaaat tgaaactttt ctgcacaaag aataaaatta 246" agga:tttat aaactcaaat tggcacctac tgaattaaaa tacataaaat catttaaata 252" taat:cagca tatgggaagt aacattgcac ggaa atcactgcca gagacagtct 258" a:tt :ctttt aatttgttac tacttagtca caaaccccac attattccag atta 264" c:ta:taagg agaattggaa atacatatgc ccatgcttaa attttatagc tttaatttgt 270" g:ta:ttctt tattgacggg aagaggtaca tctttttttc cttactgaaa acaaatatgg 276" a:taattgcc tcaaatttgt gatt ggctagtgat tcttgttttc agaagggaga 282" g:ggtataga tagaaaatga caaagatggc aatatacact taatgttgtt attgtatgtt 288" gaag tacttagatt tttaaaattt caaatcctaa atcacttctt gtaggagggt 294" t:tcattaac tgcagtatat acagttcact acatatgggt tgtttgagtt gtgc 300" tgtatttctt tctgtttttt aatacctggt tttgtacata tctaactctg ttctcttttg 306" gttgttcaga aactggattt tttttttctt aagcagtgct taatttgtgt tttttaattt 312" tgattcagaa gtagtcccag ggtg ttcatactgt tacatccaga acatttgtca 318" ggctctctgt cagctttcat gtacatatgg tatagaaacc atggagttag gcacttcctg 324" gatttttttt ttatgagaaa aatactgtat ttaaaatgta aaataaactt ttaaaaagca 330" aata tatatttctt ccagcctttg attacaaatt tgtccttgca catgttaaga 336" tgaattatct cctaaaaata tcattgttct tgggagcagt gtatgttact ttacatagca 342" gcggttcctg tcatgtgttc atgtcagaat atttttggtt ttaaactttc ttattgcctt 348" tggctgttga ttagtacagt acaagtgcga tttcaaaaag atcttgaaag taatatattt 3541 aatcaattaa aatgtttatc tgtaaaaaaa aaaaaaaaaa a Protein seguence: 420420 NCBI Reference Seguence: NP_073572.2 LOCUS NP_073572 ACCESSION NP_073572 1 maavlnaerl evsvdgltls pdpeerpgae gapllppplp ppsppgsgrg pgasgeqpep 6" geaaaggaae earrleqrwg fgleelygla lrffkekdgk afhptyeekl klvalhkqvl 12" mgpynpdtcp evgffdvlgn drrrewaalg nmskedamve kallnrcch lfstyvashk 18" i k q kkr k rrrr r rlqk krrr rlrr rr rieeerlrLe 24-— qqquimaal qqu yaaqqypgny eqqqilirql qeqhyqqqu quqvqlaqq " qaalqquev vvagsslpts vpsn mmsvngqakt htdssekele peaaeealen 36" gpkeslpvia apsmwtrpqi kdfkekiqqd adsvitvgrg vpth eegsylfwef 42" atdnydigfg dspn tavsvhvses sdddeeeeen akkn ankplldeiv 48" pvyrrdchee vyagshqypg rgvyllkfdn syslwrsksv yyrvyytr ASNA1 Official : ASNA1 Official Name: arsA arsenite transporter, nding, homolog 1 (bacterial) Gene ID: 439 Organism: Homo sapiens Other Aliases: ARSA-I, ARSA1 I , ASNA-I, GET3, TRC40, Other Designations: ATPase ASNA1; arsenical pump-driving ATPase; arsenite- stimulated ATPase; golgi to ER traffic 3 homolog; transmembrane domain recognition complex 40 kDa ATPase subunit; transmembrane domain recognition complex, 40kDa Nucleotide seguence: NCBI Reference Seguence: NM_004317.2 LOCUS NM_004317 ION NM_004317 1 gagccagttc caaaatggcg gcaggggtgg ccgggtgggg ggca gaggagttcg 61 aagatgctcc tgatgtggag gagc ctacacttag caacatcatc gagcagcgca 121 gcctgaagtg gatcttcgtc gggggcaagg gtggtgtggg caagaccacc tgcagctgca 181 gcctggcagt ccagctctcc aaggggcgtg agagtgttct gatcatctcc acagacccag 421421 24" cacacaacat ctcagatgct tttgaccaga agttctcaaa ggtgcctacc aaggtcaaag " acaa cctctttgct attg accccagcct gggcgtggcg gagctgcctg 36" acgagttctt ggac aacatgctga gcatgggcaa gaagatgatg caggaggcca 42" tgagcgcatt tcccggcatc gatgaggcca tgagctatgc cgaggtcatg aggctggtga 48" agggcatgaa cttctcggtg gtggtatttg acacggcacc cacgggccac accctgaggc 54" tgctcaactt ccccaccatc gtggagcggg gcctgggccg gcttatgcag atcaagaacc 60" agatcagccc tttcatctca cagatgtgca acatgctggg cctgggggac atgaacgcag 66" accagctggc ctccaagctg acgc tgcccgtcat ccgctcagtc cagt 72" tcaaggaccc tgagcagaca actttcatct gcgtatgcat tgctgagttc ctgt 78" caga gaggctgatc caggagctgg ccaagtgcaa gattgacaca cacaatataa 84" ttgtcaacca gctcgtcttc cccgaccccg agaagccctg caagatgtgt gaggcccgtc 90" acaagatcca ggccaagtat ctggaccaga tggaggacct gtatgaagac ttccacatcg 96" tgaagctgcc gctgttaccc catgaggtgc ggggggcaga caaggtcaac accttctcgg 102" ccctcctcct ggagccctac aagcccccca gtgcccagta gcacagctgc cagccccaac 108" cgctgccatt tcacactcac cctccaccct ccccaccccc tcggggcaga gtttgcacaa 114" agtccccccc ataatacagg gggagccact tgggcaggag gcagggaggg gtccattccc 120" cctggtgggg ctggtgggga gctgtagttg ccccctacct ctcccacctc ttgctcttca 126" ataaaatgat cttaaactgc aaaaaaaaaa aaaaaaaa Protein ce: NCBI Reference ce: NP_004308.2 LOCUS NP_004308 ION NP_004308 1 maagvagwgv eaeefedapd vepleptlsn iieqrslkwi fvggkggvgk ttcscslavq 61 lskgresvli istdpahnis daqukfskv ptkvkgydnl pslg vaelpdeffe 121 ednmlsmgkk safp gideamsyae vmrlvkgmnf svvvfdtapt ghtlrllnfp 181 tiverglgrl mqiknqispf isqmcnmlgl gdmnadqlas kleetlpvir svseqfkdpe 241 qttficvcia eflslyeter liqelakcki dthniivnql vfpdpekpck mcearhkiqa 301 kyldqmedly edfhivklpl lphevrgadk vntfsallle pykppsaq PSM D3 422422 OfibblSwnbd:PSMD3 al Name: proteasome (prosome, macropain) 268 subunit, non-ATPase, 3 m: 5709 Organism: Homo sapiens Other Aliases: P58, RPN3, 83, TSTA2 Other Designations: 268 proteasome Pase regulatory subunit 3; 268 proteasome regulatory subunit RPN3; 268 proteasome regulatory subunit 83; proteasome subunit p58; tissue specific transplantation antigen 2 Nucleotide seguence: NCBI Reference Seguence: NM_002809.3 LOCUS NM_002809 ACCESSION NM_002809 l gttgactcgg ccatcggcct gccgggcctg ccca gaaggcccag cgccgggaag 6" gggtttgcag ctgctccgtc atcgtgcggc ccgacgctat ctcgcgctcg tgtgcaggcc 12" cggctcggct cctggtcccc ggtgcgaggg ttaacgcgag gccccggcct cggtccccgg 18" actaggccgt gaccccgggt gccatgaagc aggagggctc ggcgcggcgc cgcggcgcgg 24" acaaggcgaa gccc ggcggaggag aacaagaacc cccaccgccg ccggcccccc " aggatgtgga gatgaaagag gcga nggtggcgg gtcgacgggg gaggcagacg 36" cggc agcg gctgagcact cccagcgaga gctggacaca gtcaccttgg 42" aggacatcaa ggagcacgtg aaacagctag agaaagcggt ttcaggcaag gagccgagat 48" tcgtgctgcg ggccctgcgg atgctgcctt cacg ccgcctcaac cactatgttc 54" tgtataaggc tgtgcagggc ttcttcactt caaataatgc cactcgagac tttttgctcc 60" tgga agagcccatg gacacagagg ctgatttaca gttccgtccc cgcacgggaa 66" aagctgcgtc gacacccctc ctgcctgaag tggaagccta tctccaactc ctcgtggtca 72" tcttcatgat gaacagcaag cgctacaaag aggcacagaa gatctctgat gatctgatgc 78" agaagatcag tactcagaac cgccgggccc tagaccttgt agccgcaaag tgttactatt 84" atcacgcccg ggtctatgag ttcctggaca agctggatgt ggtgcgcagc catg 90" ctcggctccg gacagctacg cttcggcatg acgcagacgg cacc aacc 96" tcctgctgcg gaattaccta cactacagct tgtacgacca ggctgagaag ctggtgtcca 102" agtctgtgtt gcag gccaacaaca atgagtgggc caggtacctc tactacacag 423423 "08" ggcgaatcaa agccatccag ctggagtact cccg gagaacgatg accaacgccc "l4" ttcgcaaggc ccctcagcac acagctg:cg gcttcaaaca gacggtgcac aagcttctca "20" tcgtggtgga gctgttgctg ggggaga:cc ctgaccggct gcagttccgc cagccctccc "26" tcaagcgctc actcatgccc tatttcc:tc tgactcaagc tgtcaggaca ggaaacctag "32" tcaa ccaggtcctg t:tg gggagaagtt tcaagcagat gggacctaca "38" ccctaattat ccggctgcgg cacaacg:ga cagg tgtacgcatg atcagcctct "44" cctattcccg aatctccttg gctgaca:cg agct gcagttggat agccccgaag "50" atgcagagtt cattgttgcc aaggcca:cc gggatggtgt cattgaggcc agcatcaacc "56" acgagaaggg ctatgtccaa tccaaggaga acat ctattccacc cgagagcccc "62" agctagcctt ccaccagcgc atctccttct gcctagatat catg tctgtcaagg "68" ccatgaggtt tcctcccaaa aaca aggacttgga gtctgcagag gaacggcgtg "74" agcgagaaca gcaggacttg gagtttgcca aggagatggc agaagatgat gatgacagct "80" tcccttgagc tggggggctg gggaggggta atgg ggacaggctc tttccccctt "86" gggggtcccc gggc actgtcccca ttttcccaca cacagctcat atgctgcatt "92" gggg gtgggggtgc tgggagccag ccaccctgac ctcccccagg gctcctcccc "98" agccggtgac ttactgtaca gcaggcagga gcag gcaacctccc nggcagggt 204" cctggccagc agtgtgggag caggagggga aggatagttc tgtgtactcc tttagggagt 210" gggggactag aactgggatg tcttggcttg tatgtttttt gaagcttcga ttatgatttt 2161 taaacaataa aaagttctcc c Protein seguence: NCBI Reference Seguence: NP_002800.2 LOCUS NP_002800 ACCESSION NP_002800 l mkqegsarrr gadkakpppg ggeqeppppp apqdvemkee aatgggstge adgktaaaaa 6" ehsqreldtv tledikehvk qlekavsgke prfvlralrm lpstsrrlnh yvlykavqgf 12" trdf llpfleepmd teadqurpr tgkaastpll peveaqull vvifmmnskr 18" isdd lmqkistan raldlvaakc yyyharvyef ldkldvvrsf lharlrtatl 24" rhdadgqatl lnlllrnylh aekl vsksvfpeqa nnnewaryly ytgrikaiql " eysearrtmt nalrkapqht avgfkqtvhk llivvelllg eipdrqurq pslkrslmpy 36" flltqavrtg nlakfnqvld quequadg tytliirlrh nviktgvrmi slsysrisLa 424424 421 diaqqulds pedaefivak airdgvieas inhekgyvqs kemidiystr epqlathri 481 sfcldihnms vkamrfppks ynkdl sa rr r qqdl fak ma ddd dsfp |DH1 ymboleH1 Official Name: isocitrate dehydrogenase 1 (NADP+), soluble Gene ID: 3417 Organism: Homo sapiens Other Aliases: IDCD, IDH, IDP, IDPC, PICD Other Designations: NADP(+)-specific ICDH; ependent isocitrate dehydrogenase, cytosolic; NADP-dependent isocitrate dehydrogenase, peroxisomal; isocitrate dehydrogenase [NADP] cytoplasmic; uccinate decarboxylase Nucleotide seguence: NCBI Reference Seguence: NM_005896.2 LOCUS NM_005896 ACCESSION NM_005896 l cctgtggtcc cgggtttctg cagagtctac ttcagaagcg gaggcactgg ggtt 6" tgggattgcc aggctgtggt tgtgagtctg agcttgtgag cggctgtggc gccccaactc 12" ttcgccagca tatcatcccg gcaggcgata aactacattc agttgagtct gcaagactgg 18" gaggaactgg ggtgataaga aatctattca ctgtcaaggt ttattgaagt caaaatgtcc 24" aaaaaaatca gtggcggttc tgtggtagag atgcaaggag tgac acgaatcatt " ttga ttaaagagaa actcattttt ccctacgtgg aattggatct ctat 36" gatttaggca tagagaatcg tgatgccacc aacgaccaag tcaccaagga tgctgcagaa 42" gctataaaga agcataatgt tggcgtcaaa tgtgccacta tcactcctga tgagaagagg 48" gttgaggagt tgaa acaaatgtgg aaatcaccaa atggcaccat acgaaatatt 54" ggca cggtcttcag agaagccatt atctgcaaaa atatcccccg gcttgtgagt 60" ggatgggtaa aacctatcat cataggtcgt catgcttatg aata cagagcaact 66" gattttgttg ttcctgggcc tggaaaagta gagataacct acacaccaag tgacggaacc 72" caaaaggtga catacctggt acataacttt gaagaaggtg gtggtgttgc gatg 78" tataatcaag caat tgaagatttt gcacacagtt ccttccaaat ggctctgtct 425425 84" aagggttggc ctttgtatct gagcaccaaa aacactattc tgaagaaata tgatgggcgt 90" tttaaagaca tctttcagga tgac aagcagtaca agtcccagtt tgaagctcaa 96" aagatctggt atgagcatag gctcatcgac gacatggtgg cccaagctat gaaatcagag "02" ggaggcttca tctgggcctg taaaaactat gatggtgacg tgcagtcgga ctctgtggcc "08" caagggtatg gctctctcgg catgatgacc agcgtgctgg tttgtccaga tggcaagaca "14" gtagaagcag aggctgccca cgggactgta acccgtcact accgcatgta ccagaaagga "20" caggagacgt ccaccaatcc cattgcttcc atttttgcct ggaccagagg gttagcccac "26" agagcaaagc ttgataacaa taaagagctt gccttctttg cttt ggaagaagtc "32" tctattgaga caattgaggc tggcttcatg accaaggact cttg cattaaaggt "38" ttacccaatg tgcaacgttc tgactacttg aatacatttg agttcatgga taaacttgga "44" gaaaacttga agatcaaact agctcaggcc aaactttaag ttcatacctg agctaagaag "50" tgtc ttttggtaac taggtctaca ggtttacatt tttctgtgtt aagg "56" gcaa tttt gtaatttgtt ccag agtttatctt ttctataagt "62" ttacagcctt tttcttatat atacagttat cttt gtgaacatgg caagggactt "68" ttttacaatt tttattttat tttctagtac cagcctagga attcggttag tactcatttg "74" tattcactgt cactttttct catgttctaa ttataaatga tcaa gattgctcaa "80" aagggtaaat gatagccaca gtattgctcc ctaaaatatg cataaagtag actg "86" ccttcccctc ctgtccatga gcac agggaagttc tggtgtcata gatatcccgt "92" tttgtgaggt agagctgtgc attaaacttg cacatgactg gaacgaagta tgagtgcaac "98" tcaaatgtgt tgaagatact attt ttgtaaagac cttgctgaat gtttccaata 204" gactaaatac tgtttaggcc gcaggagagt ttggaatccg gaataaatac tacctggagg 210" tttgtcctct ttct ctttctcctc ctggcctggc ctgaatatta tactactcta 216" aatagcatat ttcatccaag tgcaataatg gaat cttttttgga cttctgctgg 222" cctgttttat ttcttttata taaatgtgat ttctcagaaa ttgatattaa acactatctt 2281 atcttctcct gaactgttga ttttaattaa aattaagtgc taattaccaa aaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_005887.2 LOCUS NP_005887 ACCESSION NP_005887 426426 1 mskkisggsv vemquemtr iiwelikekl ifpyveldlh sydlgienrd atndqvtkda 61 hnvg vkcatitpde krveefklkq mwkspngtir nilggtvfre aiickniprl 121 vsngkpiii grhaygdqyr atdfvvpgpg kveitytpsd gtqutylvh nfeegggvam 181 ksie dfahssfqma lskgwplyls tkntilkkyd grfkdifqei yquyksqfe 241 aqkiwyehrl iddmvaqamk seggfiwack nydgdvqsds lgm mtsvlvcpdg 301 ktveaeaahg tvtrhyrmyq kgqetstnpi asifawtrgl ahrakldnnk elaffanale 361 evsietieag fmtkdlaaci kglpnvqrsd ylntfefmdk lgenlkikla qakl KPNB1 Official Symbol: KPNB1 Official Name: karyopherin (importin) beta 1 Gene ID: 3837 Organism: Homo sapiens Other Aliases: IMB1, lPOf, IPOB, lmpnb, NTF97 Other Designations: PTA097; importin 1; importin 90; importin beta-1 subunit; in subunit ; in-90; karyopherin subunit ; nuclear factor p97; pore targeting complex 97 kDa subunit Nucleotide seguence: NCBI nce Seguence: NM_002265.4 LOCUS NM_002265 ACCESSION NM_002265 1 ctccctcgct ccctccctgc gcgccgcctc tcactcacag cctcccttcc ttctttctcc 6" ctccgcctcc cgagcaccag cgcgctctga gctgccccca gggtccctcc cccgccgcca 12" gcagcccatt tggagggagg aagtaaggga agaggagagg aaggggagcc acta 18" cccagacaga gccggtgaat gggtttgtgg tgacccccgc CCCCC&CCCC accctccctt 24" cccacccgac ccccaacccc catccccagt tcgagccgcc gcccgaaagg ccgggccgtc " gtcttaggag gagtcgccgc cgccgccacc tccgccatgg agctgatcac cgag 36" aagaccgtgt ctcccgatcg gctggagctg gaagcggcgc agaagttcct tgcg 42" gccgtggaga acctgcccac tttccttgtg gaactgtcca gagtgctggc aaatccagga 48" aacagtcagg ttgccagagt tgcagctggt ctacaaatca agaactcttt gacatctaaa 427427 54" gatccagata tcaaggcaca atatcagcag aggtggcttg ctattgatgc taatgctcga 60" cgagaagtca atgt tttgcagaca ttgggtacag accg gcctagttct 66" gcctcacagt gtgtggctgg tattgcttgt gcagagatcc cagtaaacca gtggccagaa 72" ctcattcctc agctggtggc caatgtcaca aaccccaaca agca catgaaggag 78" ttgg tcgg ttatatttgc caagatatag acccagagca gctacaagat 84" aaatccaatg agattctgac tgccataatc caggggatga ggaaagaaga gcctagtaat 90" aatgtgaagc tagctgctac gaatgcactc ctgaactcat tggagttcac caaagcaaac 96" tttgataaag agtctgaaag gcactttatt atgcaggtgg tctgtgaagc cacacagtgt "02" ccagatacga gggtacgagt ggctgcttta cagaatctgg taat gtccttatat "08" tatcagtaca tggagacata tatgggtcct gctctttttg caatcacaat cgaagcaatg "14" aaaagtgaca ttgatgaggt acaa gggatagaat tctggtccaa tgtctgtgat "20" gaggaaatgg atttggccat tgaagcttca gaggcagcag aacaaggacg gccccctgag "26" agca agttttatgc gaagggagca ctacagtatc caat cctcacacag "32" acactaacta aacaggacga aaatgatgat gacgatgact ggaacccctg caaagcagca "38" ggggtgtgcc tcatgcttct ggccacctgc tgtgaagatg acattgtccc acatgtcctc "44" cccttcatta aagaacacat caagaaccca gattggcggt accgggatgc agcagtgatg "50" gcttttggtt gtatcttgga aggaccagag cccagtcagc tcaaaccact agttatacag "56" gctatgccca ccctaataga attaatgaaa gaccccagtg tagttgttcg agatacagct "62" gcatggactg taggcagaat ttgtgagctg cttcctgaag ctgccatcaa ctac "68" ttggctcccc tgctacagtg tctgattgag ggtctcagtg ctgaacccag agtggcttca "74" aatgtgtgct gggctttctc ggct gaagctgctt ctgc agacgttgct "80" gatgatcagg aagaaccagc tacttactgc ttatcttctt catttgaact catagttcag "86" aagctcctag caga cagacctgat ggacaccaga acaacctgag gagttctgca "92" tctc tgatggaaat tgtgaaaaac agtgccaagg attgttatcc tgctgtccag "98" aaaacgactt tggtcatcat ggaacgactg caacaggttc ttcagatgga gtcacatatc 204" cagagcacat ccgatagaat ccagttcaat gaccttcagt ctttactctg tgcaactctt 210" gttc ttcggaaagt gcaacatcaa gatgctttgc agatctctga tgtggttatg 216" gcctccctgt taaggatgtt ccaaagcaca gctgggtctg ggggagtaca agaggatgcc 222" ctgatggcag ttagcacact ggtggaagtg ggtg aattcctcaa gtacatggag 228" gcctttaaac ccttcctggg atta aaaaattatg acca ggtttgtttg 428428 234" gcagctgtgg gcttagtggg agacttgtgc cg:gccctgc aatccaacat catacctttc 240" tgtgacgagg tgatgcagct gcttctggaa aa:ttgggga atgagaacgt ccacaggtct 246" gtgaagccgc agattctgtc agtgtttggt ga:attgccc ttgctattgg aggagagttt 252" aaaaaatact tagaggttgt attgaatact cagg cctcccaagc ggac 258" aagtcagact atgacatggt tctg aa:gagctaa gggaaagctg cttggaagcc 264" tatactggaa tcgtccaggg attaaagggg ga:caggaga acgtacaccc ggatgtgatg 270" ctggtacaac ccagagtaga atttattctg tc:ttcattg ttgc tggagatgag 276" gatcacacag atggagtagt tgct gc:ggactaa taggggactt atgtacagca 282" tttgggaagg tgaa attagtagaa ccaa atga attgttaact 288" gaagggcgga gatcgaagac taacaaagca aaaacccttg ctacatgggc aacaaaagaa 294" ctgaggaaac tgaagaacca agcttgatct gttaccattg ggatgataac ctgaggaccc 300" ccactggaaa tctcccatct tttgaaaaac ctggaagtga ggagtgtgca ctga 306" ggga atgagaggat gagtgagtga ggcttgaaaa cacaccacat tgaaaatcct 312" gccacagcag cagc cagc agcgctgtta gtgagctaag taagcactga 318" cttcgtagaa aaccataaca tcggccatct tggaaaagag aaaaacaatg gagttactta 324" tttaaaaaaa aagaaagaaa gttatctctt cccaggagag gctagaagta gcttttctgt 330" cttttggcca gtgccgagtg gaatgcctgg tttgggggag gaggagggac tgggttcagc 336" tgtggtgctt tgttgtaaaa ggcagcctgg ctac tgaggagaaa gatggagcct 342" gggtctcaag cccaccttcg ctgtaccttt gccacatggt actgtatgct tgccagctag 348" aaggagggtc agggattttt tacagtctga gaatgagtgt gtgtgagtga ggcggtatcc 354" acattctcaa cttcaagtca ttgcagtttc tttttcccag aaaacaaggg gttagatgtt 360" cata aaactaaccg aagttctgtc tactgatgca gcacaagaga tgtaaaaaaa 366" aaaaaaaaaa aaaaaaaaaa aacacacaca cagaggaaag acgctcttta ggttttgttt 372" tgtttttttt ttttggtttt gttttttgtt ttttttactc tagggaaaac actgacgaat 378" ggtcagagct cctatcctga tcttttcatc aaggcgcctt taat atggttcaac 384" tgtgaatgta gaagtggggg ggagggggga gaaaaagaaa actctggcgt tagaggatat 390" agaaaaatat aagtacaatt gttacaaata acgcagactt caaaaacaaa aaaatcacaa 396" cccaaacaaa ccaaaattta aatgatcaga attggcagca aaac gccctctcct 402" gacttgtatt gtggcagtct gaacgccccc agaaaattgt gccaaagagt ttagaaaaat 408" aaatatacaa taaa cacatacaca agca aacttcaggt aactattttg 429429 4141 gattgcaaac aggataaatt aaatgttcaa acaatctgat aaaataacca tttggaaact 4201 gaaaa Protein seguence: NCBI Reference Seguence: NP_002256.2 LOCUS NP_002256 ACCESSION NP_002256 1 melitilekt elea aqkfleraav enlptflvel srvlanpgns qvarvaag;q 6" iknsltskdp dikaquqrw laidanarre vknyvlqtlg tetyrpssas qcvagiacae 12" ipvnqueli pqlvanvtnp nstehmkest leaigyich idpeqlqdks neiltaiiqg 18" mrkeepsnnv klaatnalln sleftkanfd fimq qcpd trvrvaalqn 24" lvkimslyyq gpal faitieamks didevalqgi efwsnvcdee mdlaieasea " aqurppeht skfyakgalq ylvpiltqtl tkqdendddd dwnpckaagv clmllatcce 36" ddivphvlpf ikehiknpdw vmaf peps qlkplviqam ptlielmkdp 42" svvvrdtaaw tvgricellp eaaindvyla pllqcliegl saeprvasnv cwafsslaea 48" ayeaadvadd qeepatycls ssfelivqkl lettdrpdgh qnnlrssaye slmeivknsa 54" kdcypavqkt tlvimerqu vlqmeshiqs tsdriqfndl qsllcathn vlrkvqhqda 60" lqisdvvmas llrqustag sgquedalm evlg geflkymeaf kpflgiglkn 66" yaeyqvclaa vglvgdlcra lqsniipfcd evmqlllenl gnenvhrsvk pqilsvfgdi 72" alaiggefkk ylevvlnth qasqaquks dydmvdylne lrescleayt givqglkgdq 78" envhpdvmlv qprvefilsf idhiagdedh tdgvvacaag ligdlctafg kdvlklvear 84" pmihellteg kakt latwatkelr klknqa DDX17 Official Sym bol: DDX17 Official Name: DEAD (Asp-Glu-Ala-Asp) box helicase 17 m: 10521 sm: Homo sapiens Other Aliases: RPS-434P1.1, P72, RH70 430430 Other Designations: DEAD (Asp-Glu-Ala-Asp) box polypeptide 17; DEAD box protein p72; DEAD/H lu-Ala-Asp/His) box polypeptide 17 (72kD); RNA- dependent helicase p72; probable ATP-dependent RNA helicase DDX17 Nucleotide ce (variant 1): NCBI Reference Seguence: NM_006386.4 LOCUS NM_006386 ACCESSION NM_006386 l gttaagttgg agccgactca gcggcggccg tgtg ctgg agga 6" gacgcctaaa ccgcggcact gcccggtttg gcca aacctgccca ccggctttgt 12" agccccgatt ctctgtgttt tgctcccgtc gaga gaggcggcga cggtggcgtc 18" tgcgacggga gacagcgcgt cggagcgaga gagcgctgcg cctgccgccg ccccaacagc 24" ggaggcgccg ccgccatcgg tcgtcaccag accggagccg caggccctcc cggc " catccgtgcc ccgctcccag atcc ttttgggacc atgcgcggag gaggctttgg 36" ggaccgggac cgggatcgtg accgtggagg atttggagca agaggtggtg gtggccttcc 42" cccgaagaaa tttggtaatc ctggggagcg tttgcgtaaa aaaaagtggg atttgagtga 48" gctccccaag tttgagaaaa atgt ggaacatccg gaagtagcaa ggctgacacc 54" atatgaggtt gatgagctac gccgaaagaa ggagattaca gtgagggggg gagatgtttg 60" tcctaaaccc gtgtttgcct tccatcatgc taacttccca caatatgtaa tggatgtgtt 66" gatggatcag cactttacag aaccaactcc gtgc cagggatttc cgttggctct 72" tagtggccgg gtgg gcattgctca gactggctct gggaagacgt tggcgtatct 78" cctgcctgca attgttcata ttaaccacca gccatacttg gaaaggggag atggcccaat 84" ctgtctagtt ctggctccta ccagagagct gcaa gtacagcagg tggccgatga 90" ctatggcaaa tgttctagat tgaagagtac ttgtatttat ggaggtgctc ctaaaggtcc 96" ccagattcga gacttggaaa gaggtgttga gatctgcata gccactcctg gacgtctgat "02" agatttcctg gagtcaggaa agacaaatct tcgccgatgt acttaccttg tattggacga "08" agctgacaga atgcttgata tggggtttga accccagatc cgtaaaattg ttgaccaaat "l4" caggcctgat aggcagacac tgatgtggag tgcaacctgg ccaaaagaag taagacagct "20" tgcagaggat ttccttcgtg attacaccca gatcaacgta ggcaatctgg agttgagtgc "26" caaccacaac atcctccaga tagtggatgt ctgcatggaa agtgaaaaag accacaagtt "32" gatccaacta atggaagaaa taatggctga aaaggaaaac aaaacaataa tgga "38" gacaaagaga cgctgtgatg atctgactcg gcgc agagatggtt ggccagctat 431431 "44" ccat ggagacaaga caga aagagattgg gtacttaatg agttccgttc "50" tggaaaggca ctta ttgctacaga tgtagcctcc cgtgggctag atgtggaaga "56" tgtcaagttt gtgatcaact atgactatcc aaacagctca gaggattatg tgcaccgtat "62" tggccgaaca gcccgtagca ccaacaaggg taccgcctat accttcttca ccccagggaa "68" cctaaaacag gccagagagc ttatcaaagt gctggaagag gccaatcagg ctatcaatcc "74" aaaactgatg cagcttgtgg accacagagg cgga ggcgggggtg gtcgttctcg "80" ttaccggacc acttcttcag ccaacaatcc caatctgatg tatcaggatg agtgtgaccg "86" aaggcttcga ggagtcaagg atggtggccg gagagactct gcaagctatc gggatcgtag "92" tgaaaccgat agagctggtt atgctaatgg ctat ggaagtccaa attctgcctt "98" tggagcacaa gcaggccaat acacctatgg tcaaggcacc tatggggcag atgg 204" tagc tatacagctc aagaatatgg tgctggcact tatggagcta gtagcaccac 210" ctcaactggg agaagttcac agagctctag ccagcagttt agtgggatag gccggtctgg 216" gcagcagcca cagccactga aaca acag cctccgggag ctaccaatat 222" gataggttac atggggcaga ctgcctacca ataccctcct cctcctcccc ctcctcctcc 228" ttcacgtaaa tgaaaccact caagtggtag tgactccagc agacttaatt taag 234" gaacactgtc tttccttttt ttttcctctt cgccttttct ttttttttcc ttttttcttt 240" tttttttttt aatttttccc cccaaccatc gtgatttgtc ttttcatgca ttag 246" aattcactgc caggtttctt ctgcccacca aaatgatcca gtctggaata acattttgta 252" aaaaaaaaaa aaatatatat atatatatat agctgactgg aagagattaa tttcttcccc 258" caacttcttg catgttgaag atatttgagc tatttttcat gagt aaggtattag 264" gcccttttgt gggagcccca tgttttgttt ttctgagttg gtggggaggg agggaggggg 270" agggctgaat tgttttgcag aggaagatgg catctgtgct ttaaatttct cattactggg 276" ttagaaaaca aagagggatt gccctgcaca tttg tgcttttaaa tgtttcttaa 282" gttggaacag gtttcctcgg gcctgttttg actgattgct ggagtgcatt tgatagttaa 288" aaattactaa ttggttttat ttcccttcac actctgcctc cccacttctc ttac 294" ataa ccattttagt gtcaggctag aaattgaatt gctgagtttt gtgtatcctt 300" aaaa accacaagtg tttattgtag tggttaaact gtagcatctc agcatctggg 306" tggaagctgc ctatatttct tcccagttta actggggacc atctgtgaaa ttaattttcc 312" atccagacag tgag caaatgaaca taaatgctcg ctggaaattt actaaccagt 318" ttttatattg acctgcagtg taaaaagcac atttaattat tata ttcaaaatgg 432432 324" gcaaatttta ttttcaaatg cagtgtagag taaa agcaactctt tgccacctac 330" tctgcccttt tggcaaagtt accttgaaca aagaatctta tatt aagaactctt 336" cttc ataccctgtt ctctgcagtg ctttctaaca gcttctgggt gcagattttc 342" atcc ttttgcactc agcttattac aggtaggtag tgcttaagaa aagtcatgga 348" ggactaaagc ctaagtcctt ttcacttttc ctccatctga aggtaggtga gttcatcctc 354" gtaa tgctgtttta ccaagacttt gatg gacccagaaa gaattttctg 360" ctattgtgtt cactacaaca ggatagggac atcagacagc cccagaaacc ccttccagat 366" ctgatatggg aatt tttatgctgt taattggtat acaa tgcagttgaa 372" gggggaaggc tccactgcat tctttggcta aggcctgaat gcttgctcat ctgtaagatc 378" tatactcgag gttttgtttt ccttttaaaa ttctttaggg agagagggat ggtttctgag 384" tgaa agtatgattc aatgtgcaac atacaggtag gtcttcagca taagctgaaa 390" tatatgcatg taaaaacttt gacatctttt tttttaattt tccactttct tcttaacttt 396" acttctcttt ttgtcccccc cccatcttac agaagttgag gccaagggag aatggtaggc 102" acagaagaaa catggcaaac tgctctgtgc tttcaaacca aagtgttccc cccaacccca 108" aatttgtcta agcactggcc agtctgttgt gggcattgtt ttctacaacc aaattctggg 114" tctt ctttctttaa acatagaggt accaccacaa gggatgccct actctctcgc 120" agctcttgaa agcatctgtt tgagggaaag gtctctgggc aagcaagtgg tgcttgcttc cctttttcca cctgggacat cata aaataacagt aaattccaaa 132" aaac tattatggcc tgagcacagc tgaaatctag cagagtttaa ctcttctgcc 138" tccatgtctg tcacttataa ttcaggttct gctgttggct tcagaacatg agcagaagaa Z44" tcgttttatg ctagttattg cattcatggt tgaaactcaa cttagggaaa gggttccaat 150" gtattaagca atgggctgct tctccccaat cctccctaac aattcgttgt gtggacttct 156" catctaaaag gttagtggct tggg atcagtgctc tctattgatg ttcttgctgg Z62" tctccagaca cattcctgtt gcattaagac ttgaaagact tgtagatgtg tgatgttcag Z68" gcacaggatg ctgaaagcta tatt cttagtttgt aaattgtcct tttgatacca Z74" tcatcttgtt ttctttttgt aggtataaat aaaaacactg ttgacaataa aaaa 4801 aaaaa Pnneh1seguence(vafiant1y NCBI Reference Seguence: NP_006377.2 LOCUS NP_006377 433433 ACCESSION NP_006377 l mptgfvapil cvllpsptre aatvasatgd saseresaap aaaptaeapp psvvtrpepq 6" alpspairap lpdlypfgtm rgggfgdrdr fgar gggglppkkf gnpgerlrkk l2" kwdlselpkf eknfyvehpe varltpyevd elrrkkeitv rggdvcpkpv fafhhanqu 18" yvmdvlmdqh fteptpich gfplalsgrd mvgiaqtgsg lpai vhinhqpyLe 24" rgdgpiclvl aptrelaqu quaddygkc srlkstciyg gapkgpqird lergveicia " tpgrlidfle sgktnlrrct adrm ldmgfepqir kivdqirpdr qtlmwsatwp 36" kevrqlaedf lrdytqinvg nlelsanhni cmes ekdhkliqlm kenk 42" tiifvetkrr cddltrrmrr dgwpamcihg dksqperdwv lnefrsgkap iliatdvasr 48" gldvedkav inydypnsse dyvhrigrta rstnkgtayt fftpgnlkqa relikvleea 54" nqainpklmq lvdhrggggg gggrsryrtt nlmy qdecdrrlrg vkdggrrdsa 60" etdr agyangsgyg spnsafgaqa gqytygqgty gaaaygtssy taqeygagty 66" gassttstgr ssqsssqqfs gigrsgqqpq plmsqqfaqp pgatnmigym gqtayqyppp 72" ppppppsrk M6PRBP1 Official Symbol: PLIN3 Official Name: perilipin 3 m: 10226 Organism: Homo sapiens Other Aliases: M6PRBP1, PP17, TIP47 Other Designations: 47 kDa nding protein; cargo selection protein TIP47; mannosephosphate receptor-binding protein 1 ; pin-3; placental protein 17; tail-interacting protein, 47 kD Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_005817.4 LOCUS NM_005817 ACCESSION NM_005817 l tggcgcgggc aatccctcaa cctgattggt cccctcgccc ccag tgcgccccca 434434 6" acctaccacg cagtaaaagc cacccccgcc tcggcccgga cggtttccaa gctggttttg 12" aagtcgcggc agctgttcct gggacgtccg gttgaccgcg cgtctgctgc agagaccatg 18" tctgccgacg gggcagaggc tgatggcagc acccaggtga aaga acag 24" cagcccagtg tggtggaccg tgtggccagc atgcctctga tcagctccac ctgcgacatg " gtgtccgcag cctatgcctc caccaaggag ccgc acatcaagac tgtctgcgac 36" gcagcagaga agggagtgag gaccctcacg gcggctgctg tcagcggggc tcagccgatc 42" ctctccaagc tggagcccca gattgcatca gccagcgaat acgcccacag ggggctggac 48" aagttggagg agaacctccc gcag acgg agaaggtcct ggcggacacc 54" aaggagcttg tgtcgtctaa ggtgtcgggg gaga tggtgtctag cgccaaggac 60" acggtggcca cccaattgtc ggaggcggtg gacgcgaccc gcggtgctgt gcagagcggc 66" gtggacaaga caaagtccgt agtgaccggc ggcgtccaat cggtcatggg ctcccgcttg 72" ggccagatgg tgttgagtgg ggtcgacacg gtgctgggga agtcggagga ggac 78" aaccacctgc cccttacgga tgccgaactg gcccgcatcg ccacatccct ggatggcttt 84" gacgtcgcgt ccgtgcagca gcagcggcag gaacagagct acttcgtacg tctgggctcc 90" ctgtcggaga ggctgcggca gcacgcctat gagcactcgc tgggcaagct tcgagccacc 96" aggg cacaggaggc tctgctgcag ctgtcgcagg tcctaagcct gatggaaact "O2" gtcaagcaag gcgttgatca gaagctggtg gaaggccagg agaagctgca ccagatgtgg "08" ctcagctgga accagaagca gctccagggc aagg agccgcccaa gccagagcag "l4" gtcgagtccc gggcgctcac catgttccgg gacattgccc agcaactgca ctgt "20" acctccctgg gcat tcagggcctc cccaccaatg tgaaggacca ggtgcagcag "26" gcccgccgcc aggtggagga cctccaggcc tcca gcatccactc cttccaggac "32" agca gcattctggc ccagagccgt gagcgtgtcg ccagcgcccg cctg "38" gaccacatgg tggaatatgt ggcccagaac acacctgtca tcgt gggacccttt "44" gcccctggaa tcactgagaa agccccggag gagaagaagt agggggagag gagaggactc "50" agcgggcccc gtctctataa tgcagctgtg ctctggagtc ccgg ggctcatttc "56" aaacttattt tctagccact cctcccagct cttctgtgct gtccacttgg aagg "62" ctctcaaaac gggcatcacc cagttgaccc atctctcagc ctctctgagc ttggaagaag "68" ctga gcctcaccct atcagtcagt agagagagat gtccagaaaa aatatctttc "74" aggaaagttc tcccctgcag aatttttttt ccttgttaaa tatcaggaat ataggccggg "80" tgcggtggct cacacctgta atcccagcac tttgggaggc tgaggcgggc ggaacacctg 435435 186" aggtcaggtg ttcgagacca gccaggccaa catggtgaaa ccccgtctct actaaaaata 192" caaaaaaaaa tgagccgggc atggtagcag gtgtctgtta ttag gaggctgagg 198" caagagaatc tcttgaacct gagaggcgga ggttgcagtg agccaagatc gcgccattgc 204" actccagcct gggggacaag agtgagactt agtctcaaaa aaaa agaaaaaaaa 210" atcagggata tagttcatat cccacttctt tgtttacacc gatgtccctg aatatcagcc 216" tgtagctaat ggacttggga tttctggtct aagtgggcct cctggggatg gggtggtaca 222" ctgagcttct gagcctcatt gtagagtaga aaggtactgg ggcctgtgtg gtaagccttg 228" ttgaaatgct ctggtattca gtattgcctt aataaacttc acccacaact gcatacaggc 2341 aaaaa Protein seguence (variant 1): NCBI nce ce: NP_005808.3 LOCUS NP_005808 ACCESSION NP_005808 l msadgaeadg stqvtveepv qqpsvvdrva smplisstcd mvsaayastk esyphiktvc 6" daaekgvrtl taaavsgaqp ilsklepqia saseyahrgl dkleenlpil qqptekvlad 12" tkelvsskvs gaqemvssak dtvatqlsea vdatrgavqs svvt gqusvmgsr 18" lgqmvlsgvd tvlgkseewa dnhlpltdae lariatsldg fdvaquqqr qeqsyfvr;g 24" slserquha yehslgklra tquaqeall qlsqvlslme tvkqgvqul vegqeklhqm " wlswnqkqlq gpekeppkpe ltmf rdiaqqlqat ctslgssiqg lptnvkdqvq 36" qarrqvedlq hsfq dlsssilaqs rervasarea ldhmveyvaq ntpvtwlvgp 42" fapgitekap eekk E|F4A3 Official Symbol: E| F4A3 al Name: eukaryotic translation initiation factor 4A3 Gene ID: 9775 Organism: Homo sapiens Other Aliases: DDX48, NMP265, NUK34, e|F4A||| Other Designations: ATP-dependent RNA helicase DDX48; pendent RNA helicase e|F4A-3; DEAD (Asp-Glu-Ala-Asp) box polypeptide 48; DEAD box 436436 protein 48; NMP 265; -lll; elF4A-lll; eukaryotic initiation factor 4A-lll; eukaryotic tion factor 4A-like NUK-34; eukaryotic translation initiation factor 4A; hNMP 265; nuclear matrix protein 265 Nucleotide seguence: NCBI Reference Seguence: NM_014740.3 LOCUS NM_014740 ACCESSION NM_014740 l gcac gtctctcgct ttcgcatact taaggcgtct gttctcggca gcggcacagc 6" gaggtcggca gcggcacagc gaggtcggca gcggcacagc gaggtcggca cagc l2" gaggtcggca gcggcagcga ggtcggcagc ggcacagcga ggtcggcagc ggcagcgagg 18" tcggcagcgg cgcgcgctgt gctcttccgc ggactctgaa tcatggcgac cacggccacg 24" atggcgacct cgggctcggc gcgaaagcgg ctgctcaaag acat gactaaagtg " gaattcgaga ccagcgagga ggtggatgtg acccccacgt tcgacaccat gggcctgcgg 36" gaggacctgc tgcggggcat ctacgcttac ggttttgaaa aaccatcagc aatccagcaa 42" cgagcaatca agcagatcat gaga gatgtcatcg cacagtctca gtccggcaca 48" ggaaaaacag ccaccttcag tatctcagtc ctccagtgtt tggatattca ggttcgtgaa 54" gctt tgatcttggc aaga gagttggctg tcca gaaggggctg 60" cttgctctcg gtgactacat gaatgtccag gcct gcattggagg caccaatgtt 66" ggcgaggaca tcaggaagct ggattacgga cagcatgttg gcac tccagggcgt 72" gtttttgata tgattcgtcg cagaagccta aggacacgtg ctatcaaaat gttggttttg 78" gatgaagctg atgaaatgtt gaataaaggt ttcaaagagc agatttacga cagg 84" tacctgcctc cagccacaca ggtggttctc atcagtgcca cgctgccaca cgagattctg 90" gagatgacca acaagttcat gaccgaccca atccgcatct tggtgaaacg tgatgaattg 96" actctggaag gcatcaagca atttttcgtg gcagtggaga gggaagagtg gaaatttgac "O2" actctgtgtg acctctacga cacactgacc atcactcagg cggtcatctt ctgcaacacc "08" aaaagaaagg tggactggct gacggagaaa atgagggaag ccaacttcac tgtatcctca "l4" atgcatggag acatgcccca gaaagagcgg gagtccatca tgaaggagtt ccggtcgggc "20" gccagccgag tgcttatttc tacagatgtc tgggccaggg ggttggatgt ccctcaggtg "26" tccctcatca ttaactatga taat aacagagaat tgtacataca cagaattggg "32" agatcaggtc gatacggccg gaagggtgtg gccattaact ttgtaaagaa tgacgacatc "38" ctca gagatatcga gcagtactat caga ttgatgagat gccgatgaac 437437 1441 gttgctgatc ttatctgaag cagcagatca gtgggatgag ggagactgtt cacctgctgt 1501 gtactcctgt ttggaagtat ttagatccag attctactta atggggttta tatggacttt 1561 cttctcataa atggcctgcc gtctcccttc ctttgaagag gatatgggga ttctgctctc 1621 tatt tacatgtaaa taatacattg ttctaagtct ttttcattaa aaatttaaaa 1681 cttttcccat aaactctata cttctaaggt gccaccacct tctctagtaa ctta Protein seguence: NCBI Reference Seguence: NP_055555.1 LOCUS NP_055555 ACCESSION NP_055555 1 mattatmats gsarkrllke edmtkvefet seevdvtptf dtmglredll rgiyaygfek 61 psaiqqraik dvia qsqsgtgkta tfsisvlqcl diquetqal ilaptrelav 121 alg chac iggtnvgedi qhvv agtpgrvfdm irrrslrtra 181 ikmlvldead emlnkgfkeq iydvyrylpp atqvvlisat emtn kfmtdpiril 241 vkrdeltleg ikqffvaver eewkfdtlcd lydtltitqa vifcntkrkv dwltekmrea 301 mhgd mqueresim kefrsgasrv listdearg lii nydlpnnrel 361 yihrigrsgr ygrkgvainf vknddirilr dieqyystqi dempmnvadl i IQGAP1 Official Symbol: IQGAP1 al Name: IQ motif containing GTPase activating n 1 m: 8826 Organism: Homo sapiens Other Aliases: HUMORFA01, SAR1, p195 Other Designations: RasGAP-like with IQ motifs; ras GTPase-activating-like protein IQGAP1 Nucleotide seguence: NCBI Reference Seguence: NM_003870.3 LOCUS NM_003870 ACCESSION NM_003870 1 ggaccccggc aagcccgcgc acttggcagg agctgtagct accgccgtcc gcgcctccaa 438438 6" ggtttcacgg cttcctcagc agagactcgg gctcgtccgc catgtccgcc gcagacgagg 12" ttgacgggct gggcgtggcc cggccgcact atggctctgt cctggataat gaaagactta 18" ctgcagagga tgaa aggagacgtc agaacgtggc ttatgagtac ctttgtcatt 24" tggaagaagc gtgg atggaagcat gcctagggga agatctgcct cccaccacag " aactggagga ggggcttagg aatggggtct ccaa actggggaac ttcttctctc 36" ccaaagtagt gtccctgaaa aaaatctatg atcgagaaca gaccagatac aaggcgactg 42" gcctccactt tagacacact gataatgtga ttcagtggtt gaatgccatg gatgagattg 48" gattgcctaa gattttttac ccagaaacta cagatatcta tgatcgaaag aacatgccaa 54" tcta ctgtatccat agtt tgtt caagctaggc cctc 60" agattcaaga cctatatgga aaggttgact tcacagaaga agaaatcaac aacatgaaga 66" ctgagttgga gaagtatggc atccagatgc ctgcctttag caagattggg ggcatcttgg 72" ctaatgaact gtcagtggat gaagccgcat tacatgctgc tgttattgct attaatgaag 78" ctattgaccg tagaattcca gccgacacat ttgcagcttt tccg atgc 84" ttgtaaatct tgaagagccc ttggcatcca cttaccagga tatactttac caggctaagc 90" aggacaaaat gacaaatgct aaaaacagga cagaaaactc agagagagaa agagatgttt 96" atgaggagct gctcacgcaa gctgaaattc aaggcaatat aaacaaagtc aatacatttt "02" tagc aaatatcgac ctggctttag aacaaggaga tgcactggcc ttgttcaggg "08" ctctgcagtc accagccctg gggcttcgag gactgcagca acagaatagc gactggtact "14" tgaagcagct cctgagtgat aaacagcaga agagacagag tggtcagact gaccccctgc "20" agaaggagga gctgcagtct ggagtggatg ctgcaaacag tgctgcccag caatatcaga "26" gaagattggc agcagtagca ctgattaatg ctgcaatcca tgtt gctgagaaga "32" ctgttttgga actgatgaat cccgaagccc agctgcccca ggtgtatcca tttgccgccg "38" atctctatca gaaggagctg gctaccctgc agcgacaaag tcctgaacat aatctcaccc "44" acccagagct cgca gtggagatgt tgtcatcggt ggccctgatc aacagggcat "50" tggaatcagg agatgtgaat acagtgtgga agcaattgag cagttcagtt actggtctta "56" ccaatattga ggaagaaaac tgtcagaggt atctcgatga gttgatgaaa ctgaaggctc "62" aggcacatgc agagaataat gaattcatta atga tatccaagct tgcgtggacc "68" atgtgaacct ggtggtgcaa gaggaacatg agaggatttt agccattggt ttaattaatg "74" tgga tgaaggtgat aaga ctctgcaggc cctacagatt cctgcagcta "80" aacttgaggg agtccttgca gaagtggccc agcattacca agacacgctg attagagcga 439439 186" agagagagaa agcccaggaa atccaggatg agtcagctgt gttatggttg gatgaaattc 192" gaat ctggcagtcc gaca cccaagaagc acagaagttt ggaa 198" tctttgccat taatgaggca gtagaaagtg gtgatgttgg caaaacactg agtgcccttc 204" gctcccctga tgttggcttg tatggagtca tccctgagtg tggtgaaact taccacagtg 210" atcttgctga agccaagaag aaaaaactgg cagtaggaga taataacagc aagtgggtga 216" agcactgggt aaaaggtgga tattattatt accacaatct ggagacccag gaaggaggat 222" gggatgaacc tccaaatttt gtgcaaaatt ctatgcagct ttctcgggag gagatccaga 228" gttctatctc tggggtgact gccgcatata accgagaaca gctgtggctg gccaatgaag 234" tcac caggctgcag gctcgctgcc gtggatactt agttcgacag gaattccgat 240" ccaggatgaa tttcctgaag aaacaaatcc ctgccatcac ctgcattcag tcacagtgga 246" gaggatacaa gcagaagaag gcatatcaag atcggttagc ttacctgcgc aaag 252" atgaagttgt aaagattcag tccctggcaa acca agctcgaaag cgctatcgag 258" atcgcctgca gtacttccgg gaccatataa atgacattat ccag gcttttattc 264" acaa agctcgggat gactacaaga ctctcatcaa ggat cctcctatgg 270" ttgtggtccg tgtc cacctgctgg accaaagtga ccaggatttt caggaggagc 276" ttgaccttat gaagatgcgg gaagaggtta tcaccctcat taac cagcagctgg 282" agaatgacct caatctcatg gatatcaaaa ttggactgct agtgaaaaat aagattacgt 288" tgcaggatgt ggtttcccac agtaaaaaac ttaccaaaaa aaataaggaa cagttgtctg 294" atatgatgat gataaataaa cagaagggag gtctcaaggc tttgagcaag gagaagagag 300" agaagttgga agcttaccag cacctgtttt atttattgca aaccaatccc ctgg 306" ccaagctcat gatg ccccagaaca agtccaccaa gttcatggac tctgtaatct 312" tcacactcta caactacgcg tccaaccagc gagaggagta cctg cggctcttta 318" agacagcact ccaagaggaa atcaagtcga aggtagatca gattcaagag attgtgacag 324" gaaatcctac ggttattaaa atggttgtaa gtttcaaccg tggtgcccgt ggccagaatg 330" ccctgagaca gatcttggcc ccagtcgtga aggaaattat caaa tctctcaaca 336" tcaaaactga ccctgtggat aaat cttgggttaa tcagatggag tctcagacag 342" gagaggcaag caaactgccc tatgatgtga cccctgagca ggcgctagct catgaagaag 348" tgaagacacg gctagacagc agga acatgcgggc tgtgacagac aagtttctct 354" cagccattgt cagctctgtg gacaaaatcc cttatgggat gcgcttcatt gccaaagtgc 360" tgaaggactc gttgcatgag aagttccctg atgctggtga ggatgagctg ctgaagatta 440440 366" ttggtaactt gctttattat cgatacatga atccagccat tgttgctcct gatgcctttg 372" acatcattga cctgtcagca ggaggccagc ttaccacaga ccaacgccga aatctgggct 378" ccattgcaaa tcag catgctgctt ccaataagat gtttctggga gataatgccc 384" acttaagcat cattaatgaa tatctttccc agtcctacca gaaattcaga cggtttttcc 390" aaactgcttg tgatgtccca gagcttcagg ttaa tgtggatgag tactctgatt 396" tagtaaccct caccaaacca gtaatctaca ttgg catc aacacccaca 102" ctctcctgtt ggatcaccag gatgccattg ctccggagca caatgatcca atccacgaac 108" tgctggacga cctcggcgag gtgcccacca tcgagtccct ggaa agctctggca 114" atttaaatga cccaaataag ctgg cgga agtgtctctc accctgacca 120" acaagttcga cgtgcctgga gatgagaatg cagaaatgga tgctcgaacc atcttactga 126" atacaaaacg tgtg atcc ggttccagcc aggagagacc ttgactgaaa 132" tcctagaaac accagccacc agtgaacagg aagcagaaca agcc atgcagagac 138" gtgctatccg tgatgccaaa acacctgaca agatgaaaaa gtcaaaatct gtaaaggaag Z44" acagcaacct cactcttcaa gagaagaaag agaagatcca gacaggttta aagaagctaa 150" cagagcttgg aaccgtggac ccaaagaaca aataccagga actgatcaac gacattgcca 156" gggatattcg gaatcagcgg cgac agaggagaaa ggccgaacta gtgaaactgc Z62" aacagacata cgctgctctg aactctaagg ccacctttta tggggagcag gtggattact Z68" ataaaagcta tatcaaaacc tgcttggata acttagccag caagggcaaa gtctccaaaa Z74" agcctaggga aatgaaagga aagaaaagca aaaagatttc atat acagcagcaa 180" gactacatga aaaaggagtt cttctggaaa ttgaggacct gcaagtgaat cagtttaaaa 186" atgttatatt tgaaatcagt ccaacagaag aagttggaga cttcgaagtg aaagccaaat Z92" tcatgggagt tcaaatggag acttttatgt tacattatca ggacctgctg cagctacagt Z98" atgaaggagt tgcagtcatg aaattatttg ctaa agtaaatgtc aacctcctga 504" tcttccttct aaag ttctacggga agtaattgat cgtttgctgc cagcccagaa 510" ggatgaagga aagaagcacc tcacagctcc tttctaggtc cttctttcct cattggaagc 516" aaagacctag acag cacctcaatc tgatacactc ccgatgccac atttttaact 522" cctctcgctc tgatgggaca tttgttaccc ttttttcata gtgaaattgt gtttcaggct 528" tagtctgacc gttt cttcattttc ttccattact taggaaagag tggaaactcc 534" actaaaattt ctctgtgttg ttacagtctt tgca gtactatatt gtaagctttg 540" gtgtttgttt aattagcaat agggatggta ggattcaaat gtgtgtcatt tagaagtgga 441441 546" agctattagc accaatgaca taaatacata caagacacac aactaaaatg tcatgttatt 552" aacagttatt tcat ttaaaaataa agttccttta tatttctgtc ccatcaggaa 558" aactgaagga tatggggaat cattggttat cttccattgt gtttttcttt atggacagga 564" ggaa gtgacagtca tgttcaaagg aagcatttct aagg agataatgtt 570" tttaaatttc attatcaaac ttgggcaatt ctgtttgtgt aactccccga ctagtggatg 576" ggagagtccc attgctaaaa ttcagctact cagataaatt cagaatgggt caaggcacct 582" gcctgttttt gttggtgcac agagattgac ttgattcaga gagacaattc actccatccc 588" tatggcagag gaatgggtta gccctaatgt tcat tgtttttaaa actgttttat 594" atcttaagag tgccttatta aagtatagat ctta aaatgtgggt gataggaatt 600" ttaaagattt atataatgca tcaaaagcct tagaataaga aaagcttttt ttaaattgct 606" ttatctgtat atctgaactc ttgaaactta tagctaaaac actaggattt atctgcagtg 612" ttcagggaga taattctgcc tttaattgtc taaaacaaaa acaaaaccag ccaacctatg 618" ttacacgtga gattaaaacc aattttttcc tttc tttc tcttgctgcc 624" cacattgtgc ctttatttta tgagccccag ttttctgggc ttagtttaaa aaaaaaatca 630" agtctaaaca ttgcatttag aaagcttttg ttcttggata aaaagtcata cactttaaaa 636" aaaaaaaaaa ctttttccag gaaaatatat tgaaatcatg ctgctgagcc tctattttct 642" ttctttgatg ttttgattca gtattctttt atcataaatt tttagcattt tcac 648" tgatgtacat aata aactgcttta atgaataaca aactatgtag tgtgtcccta 654" ttataaatgc attggagaag tatttttatg agactcttta ctcaggtgca tggttacagc 660" ccacagggag gcatggagtg ccatggaagg attcgccact acccagacct tgttttttgt 666" tgtattttgg aagacaggtt agaa tcct aaaa gatgatgcta 672" ttacaactag cattgcctca aaaactggga ccaaccaaag tgtgtcaacc ctgtttcctt 678" aaaagaggct atgaatccca aaggccacat ccaagacagg caataatgag cagagtttac 684" agctccttta tgtg tcagtaattt taaggtttat agttccctca acacaattgc 690" taatgcagaa tagtgtaaaa tgcgcttcaa gaatgttgat gatgatgata tagaattgtg 696" gctttagtag cacagaggat gccccaacaa actcatggcg ttgaaaccac acagttctca 702" ttactgttat ttattagctg tagcattctc ctct ctctcctcct ttgaccttct 708" cctcgaccag ccatcatgac atttaccatg aatttacttc ctcccaagag ctgc 714" ccgtcagatt gttgctgcac atagttgcct ctct gtatgaaata aaaggtcatt 7201 tgttcatgtt aaaaaaaaa Protein seguence: 442442 NCBI Reference Seguence: NP_003861.1 LOCUS NP_003861 ACCESSION NP_003861 l msaadevdgl gvarphygsv ldnerltaee mderrrqnva yeylchleea krwmeaclge 6" dlppttelee glrngvylak lgnffspkvv slkkiydreq trykatglhf rhtdnviqwl l2" namdeiglpk ifypettdiy drknmprciy cihalslylf klglapqiqd lygkvdftee 18" einnmktele pafs kiggilanel svdeaalhaa viaineaidr ripadtfaal 24" knpnamlvnl tyqd ilyqakqdkm tnaknrtens ererdvyeel ltqaeiqgni " nkvntfsala nidlalqud alalfralqs palglrgqu qnsdwylkql lsquqqus 36" qkee lqsgvdaans aaqqurrla aaiq tvle lmnpeaqlpq 42" vypfaadlyq rqs hpel svavemlssv alinralesg dvntvwkq; s 48" ssvtgltnie eencqrylde lmklkaqaha ennefitwnd iqacvdhvnl vvqeeheril 54" aiglineald egdaqktha lqipaakleg vlaevaqhyq krek aqeiqdesav 60" lwldeiqggi qunkdtqea qkfalgifai neavesgdvg ktlsalrspd vglygvipec 66" getyhsdlae akkkklavgd hwv kggyyyyhnl etqeggwdep pnqunsmql 72" ssis gvtaaynreq lwlaneglit rlqarcrgyl vrqefrsrmn flkkqipait 78" ciqsqwrgyk qkkayqdrla ylrshkdevv kiqslarmhq arkryrdrlq yfrdhindil 84" kiqafirank arddyktlin vvvr kfvhlldqsd qdfqeeldlm kmreevit; i 90" ranqlendl nlmdikigll hdv vshskkltkk dmmm inqugglka 96" lskekrekle athlfyllq tnptylakli fqmpanstk fmdsviftly nyaanreey "O2" ktal qeeikskvdq iqeivtgnpt sfnr gargqnaqu ilapvvkeim "08" ddkslniktd pvdiykswvn geas klpydvtpeq alaheevktr ldssirnmra "l4" vtdkflsaiv ssvdkipygm rfiakvlkds lhekfpdage dellkiignl lyyrymnpal "20" vapdafdiid lsaggqlttd qrrnlgsiak mlqhaasnkm flgdnahlsi ineylsqsyq "26" kfrrffqtac dvpelqdkfn vdeysdlvtl tkpviyisig eiinthtlll dhqdaiapeh "32" ndpihelldd lgevptiesl igessgnlnd pnkealakte vsltltnkfd vpgdenaemd "38" artillntkr livdvirfqp getlteilet patseqeaeh qramqrrair daktpdkmkk "44" sksvkedsnl thekkekiq tglkkltelg tvdpknkyqe lindiardir nqrryrqrrk "50" aelvquqty aalnskatfy gequyyksy iktcldnlas kgkvskkpre mkgkkskkis "56" lkytaarlhe kgvlleiedl qvanknvif eispteevgd fevkakfmgv qmetfmlhyq 443443 1621 dllqlqyegv avmklfdrak vnvnllifll k OfibblSymbokSRSF2 Official Name: serine/arginine-rich splicing factor 2 gflflyiflg:6427 Organism: Homo sapiens Other s: PR264, 80-35, 8035, SFRSZ, SFRS2A, SRpBOb Other Designations: SR splicing factor 2; splicing ent, 35 kDa; splicing factor SC35; splicing factor, arginine/serine-rich 2 Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_003016.4 LOCUS NM_003016 ACCESSION NM_003016 l agaaggtttc atttccgggt ggcgcgggcg ccattttgtg aggagcgata taaacgggcg 6" cagaggccgg gccc agttgttact caggtgcgct agcctgcgga gcccgtccgt 12" gctgttctgc gcct ttcccagtgt ccccacgcgg aaggcaactg cctgagaggc 18" gcggcgtcgc accgcccaga gctgaggaag ccggcgccag ttcgcggggc tccgggccgc 24" cactcagagc tatgagctac ggccgccccc ctcccgatgt ggagggtatg acctccctca " aggtggacaa cctgacctac cgcacctcgc ccgacacgct gaggcgcgtc ttcgagaagt 36" acgggcgcgt cgtg tacatcccgc gggaccgcta caccaaggag tcccgcggct 42" tcgccttcgt tcgctttcac cgcg acgctgagga cgctatggat gccatggacg 48" gggccgtgct ggangCCgC gagctgcggg tgcaaatggc gcgctacggc cgccccccgg 54" actcacacca cagccgccgg ggaccgccac cccgcaggta nggggcggt ggctacggac 60" gccggagccg cagccctagg cggcgtcgcc gcagccgatc ccggagtcgg agccgttcca 66" ggtctcgcag ccgatctcgc tacagccgct cgaagtctcg cact cgttctcgat 72" ctcggtcgac ctccaagtcc agatccgcac gaaggtccaa gtccaagtcc tcgtcggtct 78" ctcg ttcgcggtcc aggtcccggt ctcggtccag gagtcctccc ccagtgtcca 84" agagggaatc caaatccagg tcgcgatcga agagtccccc caagtctcct gaag 90" gagcggtgtc ctcttaagaa aatggtaatg aatc cgagacacat aaccctaatt 444444 96" cataaatggg atttggggta ggtctttttg agtcgtgtta atgtaagaat gactcctatc "02" attaggagtg ctgctcggag gttactcacc tttgggagta atactgaaga gaggggtctg "08" cagaaaggat gtgtatgaag cttagataat aatggctgtt tcgtaaactg tttgagacct "l4" attaatgaaa atgactattt cttgctgttt ttatccaacg tttt ccccctttaa "20" agctgcggtc tcctgtttga aata ttggccagta ttgcagattt taactgattt "26" ggctgatcct ccagggacca gtttctgtgg gcgtgtattg gagcaggttt gtctttaaat "32" gttaaagatg cactatcctc ttagagaaac aatcagttca actattgttg ctgg "38" gacttcatat tctaatggat gtggcaaaag aattgcaata agaagcagtg aacatttgga "44" accccaaaag aaag:tacag gtattgcact gggtggggaa aggatagtgt gtctttaact "50" ttgt ttgg:cctat tttttaaaaa ggaaagggcc ctaagtagct cagatattaa "56" agtagtattc tcaa:tacca aatgtttca: ttgaaacaat ttatcttaat gaaatataga "62" ccaattctct gatc:cgagt tgt: tggatacagc cctttttttt ttcttttttt "68" ccct tacc :ttctt caccttggt: atttggccag gaatacgtaa actt "74" gtacatgctg atgg:agcct ttgtgaaat: ttcctaattg ggccttttaa aaacatggct "80" gggtggaaca tttc:gtacc ctactggtt: gaccagagcc ttagtaagta tgaa "86" actgaaacca cttt aatggaagg: aagctgaact tttt caaacctaga "92" tgtatcggca agcagtgtaa acggaggac: tggggaaaaa ggaccacata gtccatcgaa "98" gaagagtcct tggaacaagc aactggcta: tgaaaaggtt attttgtaac att :gtctaa 204" ctttttactt gtttaagctt tgcctcagt: ggcaaacttc attttatgtg cca :tttgtt 210" gctgttattc aaatttcttg taatttagtg aggtgaacga attt cat a 216" tttggatatt tgaggtaaaa tttcattttg agtg ctgacttttt gaaa 222" ttaaacagat tggtaaccta atttgtggcc tcctgacttt taaggaaaac gtg:gcagcc 228" attacacaca gcctaaagct gtcaagagat tgactcggca ttgccttcat tcc :taaaat 234" taaaaaccta caaaagttgg tgtaaatttg tatatgttat ttaccttcag atc :aaatgg 240" taatctgaac ccaaatttgt actt ttcaggtgaa aagacttgat ttt :tgaaag 246" gattgtttat caaacacaat tctaatctct tctcttatgt atttttgtgc actaggcgca 252" gttgtgtagc agttgagtaa tgctggttag ctgttaaggt ggcgtgttgc agtgcagagt 258" gcttggctgt ttcctgtttt ctcccgattg ctcctgtgta aagatgcctt gtcgtgcaga 264" aacaaatggc tgtccagttt attaaaatgc ctgacaactg cagt cacccgggcc 270" ttgcatataa ataacggagc atacagtgag cacatctagc tgatgataaa tacacctttt 445445 2761 tttccctctt ccccctaaaa atggtaaatc tatc tacatgtatg aacttaacat 2821 tgtt aaggaagcaa atggttgtaa ctttgtaagt acttataaca tggtgtatct 2881 ttttgcttat gaatattctg tattataacc tctg tagtttaatt tttt 2941 cttggtgtta gcttttctca gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3001 aaa Protein seguence (variant 1): NCBI Reference Seguence: NP_003007.2 LOCUS NP_003007 ACCESSION NP_003007 1 msygrpppdv egmtslkvdn ltyrtspdtl rrvfekygrv gdvyiprdry tkesrgfafv 61 rfhdkrdaed amdamdgavl dgrelrvqma dshh srrgppprry ggggygrrsr 121 sprrrrrsrs rsrsrsrsrs rsrysrsksr srtrsrsrst sksrsarrsk sksssvsrsr 181 srsrsrsrsr spppvskres kspp kspeeegavs s GOLGA3 Official Symbol: GOLGA3 Official Name: golgin A3 m: 2802 Organism: Homo sapiens Other Aliases: GCP170, MEA-2 Other Designations: Golgi ne associated protein; Golgi peripheral membrane protein; Golgin subfamily A member 3; SY2/SY10 protein; golgi autoantigen, golgin subfamily a, 3; golgi complex-associated n of 170 kDa; golgin-160; golgin-165; male enhanced antigen-2 Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_005895.3 LOCUS NM_005895 ACCESSION NM_005895 1 ggcctgggcg cgtccctgca gcgtggcggg acggccccgt tccagtcacc cccgcctcgc 446446 6" tgcggtggcc tcgggcctgg gcgcccgcct tcagctgcgg cggagctggc tctgtaaatg 12" ccggtgcccg cgagccctcc tgaatgcttg tctgcgcccg gcgg cctgtcccga 18" agctgtccac tgccaccact cgggcagtgc ttgctttagc ctggcccttt gcagctgaaa 24" ggcgtgacat ggtgtcgggt ggttcgtggg aagtcggggt agtc cgtgtacttc " cttgtttgtc tttgtcgctg gccgacttgt cttcattcca ggtggccaga gcgagtgggg 36" ccgggcgttg tcacgggtat catgatatta ttga catcaagtca tttgtgagtc 42" atcagatctt ctcctgaaaa tgggagacac agtagggccc ggag ctcttggctg 48" ttgctgatgg ccaa gcttgtccaa ggttcacttg tagcccctca gctc 54" tgtc gtcctgacca gcgc cgag caagatggcc tccaggagga 60" cagatcccac agtggcccct cgtctctccc cgaggcccca ctgaagcccc cgggcccact 66" ggtgccacct gaccagcagg acaaagtcca gtgtgccgag gtaaacagag catccacgga 72" aggggaaagc ccggatggac ctggccaggg aggcctctgt cagaacgggc caacgccacc 78" cttcccagac cctccgtcgt ctctcgatcc caccacaagc ccagtgggcc ctgatgcctc 84" tccaggtgtg gctggtttcc atgacaacct aaggaagtct cagggaacta gtgctgaggg 90" cagtgttaga aaagaagctt tgcagtctct cagactcagt cttcctatgc aagaaacgca 96" actgtgctct acagattctc ccctgcccct ggag gagcaggtcc gacttcaggc "02" tcggaagtgg ctggaagagc agctcaaaca gtacagggtg aagcgccagc aggagaggtc "08" cagtcaacct gcaaccaaaa cgagactttt tagcacgctt gatcctgagc tcatgttaaa "l4" cccagaaaac aggg ccagtaccct ggctatgaca aaagaatatt ccttcctgcg "20" caccagtgtc gggc ctaaggtggg cagcctgggg cttccggcac atcctaggga "26" gaaaaaaact tccaaatcaa gcaaaatccg gtctctggcc gattacagaa ctgaagattc "32" gggg aattctgggg gaaatgtccc ggctcccgat tctaccaagg gttccctgaa "38" gcagaacaga agcagtgcgg cgtccgttgt gtctgagatc agcctgtccc ccgacactga "44" cgaccgtctg gagaacacct ccctggctgg agacagcgtg tctgaggtgg atga "50" cagcgacagc tcatcgtaca gcagcgcctc cacccgaggg acctatggca ttctgtcgaa "56" gggc acgcaggaca ccccctatat ggtcaacggc caggagattc ctgcggatac "62" ccag ttcccctcca ttaaggacgt cctccaggcc gcagccgctg agcaccaaga "68" ccaggggcag gaggtcaacg gggaggtgcg gagtcggaga gacagcatct gcagcagcgt "74" ggag agctctgcag cagaaacaca ggaggagatg ctgcaggtgc tcaaagagaa "80" aatgcgactc gaaggacagc tggaagcctt gtcactggag gcgagtcagg cacttaaaga 447447 186" gaaggctgag ctgcaggccc agctggccgc cctcagcacg aagctgcagg cgcaggtgga 192" gtgcagccac agcagccagc agcggcagga ttcgctgagc tcggaggtgg acaccctgaa 198" gcagtcgtgc tgggacctgg agcgagccat cctg cagaacatgc tggaggcaaa 204" aaatgccagc ctggcgtcgt acga cttgcaggtg gccgaggagc agtaccagag 210" gcttatggcc aaggtagagg acatgcagag gagcatgctc agcaaggaca acacagtgca 216" cgacctgcga cagcagatga cagccttgca gagccagctt cagcaggtgc agctggagcg 222" gacgacgctg accagcaagc tgaaggcgtc gcaggcggag atctcgtccc tgcagagtgt 228" ccggcagtgg taccagcagc ccct ggcacaggag gcccgcgtca ggctgcaggg 234" ggcc cacatccagg ttggacagat gacccaggca ggtctcctgg agcacctgaa 240" actcgagaat gtgtccctgt cccagcagct gacggaaact aggt ccatgaagga 246" gaaggggcgc atcgcggcac aggg cattgaggct gacatgttgg aagc 252" agccttcatg cagattcagg aggcaaagac gatggtggag gaggaccttc agaggaggct 258" gttt gaaggtgaga gggagcggct gcagaggatg gcggactcgg cggcatccct 264" ggagcagcag ctggagcagg tgaagttgac ccag cgagaccagc agcttgaggc 270" tttgcagcag gagcacctgg acctgatgaa acagctcacc ttgactcagg aggctctgca 276" gagcagggag cagtccctcg atgccctgca ctac gatgagctgc aggccaggct 282" gggggagctg cagggcgagg ccgcctccag ggaggacacg atctgcctcc tgcagaacga 288" catc gcgg ctttgcaggc ggccaagagt ggcaaggagg acag 294" aggagcaaga cgcttggaag aaggtaccga ggaaacgtcg gaaactttag agaagttaag 300" agaagaatta aaat ccggccaggt ggaacacctg cagcaggaga ctgctgctct 306" gaaaaagcaa atgcaaaaaa taaaggaaca gtttctccaa caaaaggtga tggtggaggc 312" ctaccggcgc gacgccacct ccaaagacca gctcatcagt gagctgaaag ccaccaggaa 318" gaggctggac tcggagctga aggagctgcg gcaggagctg atgcaagtgc acggggagaa 324" tgcc gaggcggagc tctcgcgcct gcacagagag gtggcccagg tccgtcagca 330" catggcggac cttgaagggc atctccagtc ggcgcagaag gagcgagacg agatggaaac 336" acacttgcag tcgttgcagt tcgataagga gcagatggtc gcggtcacag aggccaatga 342" ggcgctgaag aaacaaatcg aagagttgca gcaagaggcc cggaaggcca tcacggaaca 348" gaagcagaag atgaggcggc tgggctcaga cttgaccagc gcccagaagg agatgaagac 354" caaacataag gcctacgaga acgccgtggg catcctcagc cgccgcctgc aggaggccct 360" cgcggccaag gcgg acgcggagct gggccagctc cgagcccagg gtggcagcag 448448 366" tgacagcagc ctggctctac atgaaaggat cctg gaggcggagc tgcaggctgt 372" cagtcatagc aagacgctgc tggaaaagga actgcaggag gtcatagcgc tgaccagcca 378" ggagctggag cggg agaaggtgct ggagctggag gacgagcttc aagaatccag 384" aggctttagg aagaagataa aacgccttga ggagtcaaac aagaagttgg ctcttgaatt 390" agagcacgag aagc ttacgggcct cggtcagtcc aacgcagctc tgcgggaaca 396" caacagcatc ctagaaacag ctttggccaa gagggaggca gtcc agttgaacct 102" tcaggtgcag ttgc agcgcaaaga agaggaggat cgccagatga agcatcttgt 108" ccaggccctg caggcctcac tagagaagga gaaggagaag gtgaacagcc tcaaggagca 114" ggtggctgct gccaaggtgg aagccgggca taaccgccgc cacttcaagg cggcctcctt 120" ggagctgagt gaggtgaaga aggagctgca ggccaaggaa cacctggtgc agaagctgca 126" ggccgaggcc cttc agattcggga ggggaaacat tcccaggaga tagcacagtt Z32" ccaagcagag ctggccgagg cccgggcaca gctccagctc ctgcagaagc acga Z38" gcagctcagc aaacagcccg tgggaaacca agagatggaa aatctcaaat gggaggtgga Z44" tcagaaagaa agagaaatcc agtccttgaa gcagcagctg gacttgacgg agcagcaggg 150" caggaaggaa ctggaagggc tacagcagct gctgcagaac gtcaagtctg agttggagat 156" ggcccaggaa gacctgtcca tgacccagaa ggataaattt atgctccagg caaaagtgtc Z62" ggagctgaag aacaacatga agaccctgct ccagcagaac cagcagctca agctggacct Z68" acgccgcggc gcggccaaga agga gccgaaaggc gaggccagct cttccaaccc Z74" tgccacgccc atcc gccc cgcc tcgctgctgg aggagctgct 180" accg cccgccgtga gcaaggagcc cctcaagaac ctgaacagct gcctccagca 186" gctcaagcag gaca gcctgcagcg ccagatggag gagcacgccc tgacggtgca Z92" cgagtctctg tcctcgtgga cgccgctgga gccagccact gccagccctg tgcccccggg Z98" gggtcacgcc ggcccacgcg gcgacccaca cagt cagagcaggg cttccaaaga 504" ggga gagtgactgc tgtggactcg cctccgtgcg ccgctgcccc agaaggctct 510" tatcaatgtt atttatttga ttgtgtggtc gatgtttttc taagacatga aatttaagtt 516" ttgttttgcc tttaacaaga agtaaaatat atagcagaat gagagccaag gactagaaaa 522" acattcgaag atcacaatta caca tggaatgacc aactcttaaa agcctgatag 528" gctctcggcg aggagctttg aacgtgtctg tact tgtaggtcgt ggcttctgag 534" cggccaccga tgctgctctc tgcgggtgac agggagaggc tgcgtaactg ggagcagctg 540" tgtgacaggg tctgcggcac cgcgcctggc caggccggct gcagtttctc acttccctgt 449449 546" tccattcagt aagagcttta cttttccgca gaaatgaaat tttatctgta gctt 552" tttacttgtt tttttggata gccatcccac cataggatgt gtacatagat actgaatatc 558" ataatccaat ctttgttttt tttttttttt tttttttgag acagagtctc gctttgttgc 564" ccaggctgga gtgcagtggc acactctccg ctcactgcaa gctccgcctc ccaggttcat 570" gcgattctcc tgcctcagcc tctcgagtag ctgggattac aggcgtgcgc cactatgcca 576" ggctaatgtt tgtattttta gtagcaatgg ggtttcacca tgttggccag gatggtctcg 582" atctcctgac tgat ctgcccatct cagcctccca aagtgctggg attacaggcg 588" tgagcccctg cgcccggcct gtcacccagt ctttaagaag catatgctca tgttattgaa 594" gaagaaccta cttattattg attgcctttt gaaaatttgt tgggaataat ttacctgcag 600" ggga tagtcagaaa attctaagaa atataattat tttatttacc ttctaaagcc 606" aaatattctt aaag tgtt gttctggttt tactttgttg ctgaggatct 612" ttccttcctg ctggtctctt cctctcaggc cactggccct gtgtgattcc accgtggctg 618" gccactggga aggggcagct tggacccttg gtcaggcctg acggccatca ggaggcacaa 624" ggacactgag gccccatatc tgatctgacc tttggggggg cacagggaga ggccggtgga 630" ggaggaggag gagagcagac ctcc cgac tcccgcggtt tcccctggag 636" aggt gtaggtcgca ggcggtaaca acac tcctgttccc caagtgaaaa 642" ccat tgtctgtggg agcgcctgta ctcgtgtgta ggagcacctg tacttctgca 648" gtcatcgaga agtcctggat cttttgtggt tacaccagca tcatgtggca agcagaggcg 654" acttccggaa gagacaggca ggcaccgtga ggaaggtggc tgtgctctcc caggtgtctc 660" agagacagat gccttattta aaatcagcac gacatgtgtg agatcttctg tttcctaccc 666" caaatcctga aaccctgcag acactggctg actgggagag gtggggtctg taagttgtcc 672" ttgc taagaaaatc taaaataata tttattatat gagttaggag agagagaatg 678" ggtccgcgtg gcctcctctg cagatgtact ggtctgaaat gaggttctga gtcactggcc 684" aggccagatg tgctcatgtc ggtgtctggt gtctgttttg aaac agtatggtgt 690" gttttaagct atttgtgttc aata tacttttaga aggttaattg gtaaggttaa 696" atta accacaaaga tgtttggtat ttaaaaaata ttctctagca aatattggaa 702" tttccaaaat atatcatttg tacagggtta attttgaaat aatacttgaa aatttcatta 708" tatc ctacttttta tcttaagttg aagatgttat ttactaaatt gttcttgtac 714" cattagaaaa aaaaatacgg caatttacgt tcttatttat tttggctgta ctaccccttt 720" gttttaattt taaaatcaag aaatcgggcc gggcgcggtg gctcatgcct gtaatcccag 450450 726" cactttggga ggcg ggtggatcac ctgaggtcaa gaggtccaga ccatcctggc 732" caacatggca aaaccccgtc tttactaaac atacaaaaat tagccgggtg tcgtagtgcg 738" cacctataat cccagctact tgggaggctg aggcaggaga atcacttgaa cccaggaggc 744" ggagcttgca cgag atcgcgccac tgccctccag cctgggcaac agagcgagac 750" tccgtctcaa aaataaataa atgattttaa aaaatctaaa atcgagaaat cacacattca 756" gtggggagcg acttctcctt gcttatggga agtcctcaag tgagtgatgt tcaccatgta 762" tttttttttc tcttaggaca gactaattct gaaaataccg aaggaaaagt agctctatgt 768" tctcaccccg gttttcctgc gtgtgtgccc ttgggtgcga tgcctccccc agcgctctgt 774" cggt gccagggccc cctctggttt gcct ggctgccttt gctccctgca 780" gtgagtcttt tggtgttttc atgcacggct tgtgcttctg gatctgaggc ctctcgtgtt 786" gaca cttccttcct taagaagacg cctaaaagag gaagttggaa tttttttttt 792" ttga gacagagtct cgctctgtcg cccaggctgg agtgcagtgg cgtgatctct 798" gctcactgca agctccgcca ttca agcgattctc ctgcctcatt ctccccagta 804" atta caggtgcccg ccaccacacc agcctaattt ttgtattttt agaggggtgg 810" agttccacca ccag gctggtcttg aactcttgac ctcaggtgat cctgagcctc 816" agcctcccaa agtgctggaa ttataggcgt gaaccaccgc ccccggctgc agttggattt 822" ttaaattgct tttttttatt gttgaggttt ttttatctcc aagggactct cccggcactt 828" ctaccttcca gagttacttc agtgcataaa gtttgaatta ttttgttctt gtgggcagaa 834" gtgggaatga tggaatatcc aaaa ggcagtgaag ttgggagtac tgcttacaaa 840" acagggtcac cagtgcatta tgtggcgtgt ccac gccgtgtgtc acgggctagg 846" gcggcgtgtt catccccaca ccgtgtgtca gcta gggcacttca cgatgtcact 852" acttgttttt ctgatgttcc aaaaacaacg taacttggtt ttcatgtgtt tttccgtggt 858" atatgtgaga ttgatgctac gggtcttacg gactcacacc cgttcccact ctctgcaata 864" tggatcaggc agtgtttctg ataggatgtg aaatggactc tcctcgggtg ggtccagcag 870" gggccctgcc caccagaaca cagtccgtgc gcgc taaggagctg gccctcaact 876" ctccttggtg cagggttccc gagt tccc tgaggtcttt aaaaacaaaa 882" tgtt gtacgtgaag attctaggag gggagggacc agcaaatctg agagaaccgt 888" gcct cccttcgagg agccctctga tgtgaggagg gacttgagtt gagtgacgct 894" gtggtgtgag gtgttctgag ctcactgacc ggaaggtcca ggtgaatctc gtcataagtg 900" atctcaggct ctcacaggat ccggagggaa atgtgttaga gggtctggaa aattcagtgc 451451 9061 ttttgagtta cttgttttta ttaaaaattt cctcacaaaa cctc aagttgtggc 9121 tgttcttggg aaaggggtca ccgtgtctga caaagtgtaa ctttaaaaag cacgttgatt 9181 ttttacaaat gtaagtgtgc ttgggaattc cttaaatttt gtgcaataaa ctattttttg 9241 gtaaagattt to Protein seguence (variant 1): NCBI Reference Seguence: NP_005886.2 LOCUS NP_005886 ACCESSION NP_005886 1 eng lqedrshsgp sslpeaplkp pgplvppdqq dkvqcaevnr spdg 6" pgqgglcqng ptppfpdpps Spvg pdaspgvagf hdnlrksqgt saegsvrkea 12" lqslrlslpm qetqlcstds eeqv rlqarkwlee qlqurvqu qerssqpatk 18" trlfstldpe lmlnpenlpr astlamtkey sflrtsvprg pkvgslglpa hprekktsks 24" skirsladyr tedsnagnsg dstk gslkanssa isls pdtddrlent " slagdsvsev dgndsdsssy ssastrgtyg ilsktvgtqd tpymvngqei padtlgqus 36" ikdvlqaaaa ehqdngevn gevrsrrdsi cssvslessa aetqeemlqv lkekmrlegq 42" lealsleasq alkekaelqa qlaalstqu aqvecshssq qrqulssev dtlkqscwdl 48" eramtdlqnm leaknaslas snndlqvaee qurlmakve qursmlskd ntvhdquqm 54" talqsqquv qlerttltsk lkasqaeiss lqsvrquqq qlalaqearv rlqgemahiq 60" vgqmtqagll ehlklenvsl sqqltetth smkekgriaa adml dqeaafmqiq 66" eaktmveedl qrrleefege rerlqrmads aasleqqleq vkltllqrdq qleaquehl 72" dlmkqltltq ealqsreqsl dalqthydel qarlgelqge aasredticl lqnekiilea 78" alqaaksgke eldrgarrle egteetsetl eklreelaik sgqvehque taalkkqmqk 84" ukv mveayrrdat skdqliselk atrkrldsel kequelmqv hgekrtaeae 90" evaq vrqhmadleg hlqsaqkerd emethlqslq fdkeqmvavt eaneakaqi 96" eequearka iteqqumrr lgsdltsaqk emktkhkaye navgilsrrl qealaakeaa "02" daelgqlraq ggssdsslal heriqaleae sktl lekelqevia ltsqeLeesr "08" ekvleledel qesrgfrkki krleesnkkl alelehekgk ltglgqsnaa lrehnsilet "14" alakreadlv qlnlqvqavl queeedrqm khlvqalqas lekekekvns lkeqvaaakv "20" rhfk aaslelsevk kelqakehlv ddl qiregkhsqe iaqfqaelae "26" araqlqllqk qldeqlskqp vgnqemenlk wevquerei qslquldlt eqqgrkeleg 452452 1321 qullqnvks elemaqedls fmlq akvselknnm ktlqunqql kldlrrgaak 1381 trkepkgeas ssnpatpiki pdcpvpasll eellrpppav skeplknlns cqulkqemd 1441 slqrqmeeha ltvheslssw tplepatasp vppgghagpr gdpqrhsqsr askegpge PH4B Official Symbol: P4HB Official Name: prolyl 4-hydroxylase, beta polypeptide Gene ID: 5034 Organism: Homo sapiens Other Aliases: DSI, , GIT, P4Hbeta, PDI, PDIA1, PHDB, PO4DB, PO4HB, PROHB Other Designations: cellular thyroid e-binding n; en prolyl 4- hydroxylase beta; hione-insulin transhydrogenase; p55; procollagen- proline, 2—oxoglutarate 4-dioxygenase (proline 4-hydroxylase), beta polypeptide; prolyl 4-hydroxylase subunit beta; protein disulfide isomerase family A, member 1; protein disulfide ase-associated 1; protein disulfide isomerase/oxidoreductase; protein disulfide-isomerase; protocollagen hydroxylase; thyroid hormone-binding protein p55 Nucleotide seguence: NCBI nce ce: NM_000918.3 LOCUS NM_000918 ACCESSION NM_000918 1 gagcctcgaa gtccgccggc caatcgaagg cgggccccag cggcgcgtgc cggc 6" cagcgcgcgc gggcgggggg gcaggcgcgc cccggaccca ggatttataa aggcgaggcc 12" gggaccggcg ctcg tcgcccccgc tgtcccggcg gcgccaaccg aagcgccccg 18" cctgatccgt gtccgacatg ctgcgccgcg ctctgctgtg cctggccgtg gccgccctgg 24" tgcgcgccga cgcccccgag gaggaggacc acgtcctggt gctgcggaaa agcaacttcg " cggaggcgct ggcggcccac ctgc tggtggagtt ctatgcccct tggtgtggcc 36" actgcaaggc tctggcccct gagtatgcca aagccgctgg gaagctgaag gcagaaggtt 42" ccgagatcag gttggccaag gtggacgcca cggaggagtc tgacctggcc cagcagtacg 48" gcgtgcgcgg ctatcccacc atcaagttct tcaggaatgg agacacggct tcccccaagg 54" aatatacagc tggcagagag gctgatgaca tcgtgaactg gaag cgcacgggcc 453453 60" cggctgccac caccctgcct gacggcgcag ctgcagagtc cttggtggag tccagcgagg 66" tggctgtcat cggcttcttc aaggacgtgg agtcggactc tgccaagcag tttttgcagg 72" cagcagaggc catcgatgac ataccatttg ggatcacttc caacagtgac gtgttctcca 78" aataccagct cgacaaagat ggggttgtcc tctttaagaa gtttgatgaa ggccggaaca 84" actttgaagg ggaggtcacc aaggagaacc tgctggactt tatcaaacac aaccagctgc 90" cccttgtcat cgagttcacc gagcagacag ccccgaagat ttttggaggt gaaatcaaga 96" ctcacatcct gctgttcttg cccaagagtg acta tgacggcaaa ctgagcaact "02" tcaaaacagc agccgagagc ggca agatcctgtt catcttcatc gacagcgacc "08" acaccgacaa ccagcgcatc ctcgagttct ttggcctgaa gaaggaagag gccg "l4" tgcgcctcat caccctggag gaggagatga ccaagtacaa gcccgaatcg ctga "20" cggcagagag gatcacagag ttctgccacc gcttcctgga aatc aagccccacc "26" tgatgagcca ggagctgccg gaggactggg agcc tgtcaaggtg cttgttggga "32" agaactttga agacgtggct tttgatgaga aaaaaaacgt ctttgtggag ttctatgccc "38" catggtgtgg tcactgcaaa cagttggctc ccatttggga taaactggga gagacgtaca "44" aggaccatga gaacatcgtc aaga cgac cgag gtggaggccg "50" tcaaagtgca cagcttcccc acactcaagt tctttcctgc cagtgccgac aggacggtca "56" ttgattacaa cggggaacgc gatg gttttaagaa attcctggag agcggtggcc "62" aggatggggc aggggatgat gacgatctcg aggacctgga agaagcagag gagccagaca "68" tggaggaaga cgatgatcag aaagctgtga aagatgaact gtaatacgca gacc "74" ctgc cgagacccct nggggctgc acacccagca gcagcgcacg cctccgaagc "80" ctgcggcctc agga gggcgtcgcc ggaaacccag ggaacctctc tgaagtgaca "86" cctcacccct acacaccgtc cgttcacccc cgtctcttcc ttctgctttt cggtttttgg "92" aaagggatcc aggc agcccaccct ggtggggctt gtttcctgaa accatgatgt "98" actttttcat acatgagtct gtccagagtg cttgctaccg tgttcggagt ctcgctgcct 204" ccctcccgcg ggaggtttct cctctttttg aaaattccgt ctgtgggatt catt 210" tttcgacatc agggtatttg ttccaccttg gccaggcctc ctcggagaag cttgtccccc 216" gtgtgggagg gacggagccg gactggacat ggtcactcag ctgc agtgtcgcca 222" tgactgatca tggctcttgc atttttgggt aaatggagac ttccggatcc tgtcagggtg 228" tcccccatgc ctggaagagg agctggtggc tgccagccct ggggcccggc ctgg 234" gccttcccct tccctcaagc cagggctcct cctcctgtcg tgggctcatt gtgaccactg 454454 2401 gcctctctac agcacggcct gtggcctgtt caaggcagaa ccct tgactcccgg 2461 gtggggaggt ggat gctggagctg aatcagacgc tgacagttct tcaggcattt 2521 ctatttcaca atcgaattga acacattggc caaataaagt tgaaatttta ccacctgtaa 2581 aaaaaaaaaa aaaaaa Protein seguence: NCBI Reference Seguence: NP_000909.2 LOCUS NP_000909 ACCESSION NP_000909 l mlrrallcla vaalvradap eeedhvlvlr ksnfaealaa efya pwcghckaLa 6" peyakaagkl kaegseirla kvdateesdl aqqygvrgyp tikffrngdt aspkeytagr l2" eaddivnwlk krtgpaattl pdgaaaeslv essevavigf fkdvesdsak qflqaaeaid 18" dipfgitsns dvfskyqldk dgvvlfkkfd egrnnfegev tkenlldfik hnqlplvief 24" kifg geikthillf lpksvsdydg klsnfktaae sfkgkilfif idsdhtdnqr " ileffglkke ecpavrlitl eeemtkykpe seeltaerit efchrflegk sqel 36" pedwqupvk fedv afdekknva efyapwcghc kqlapiwdkl getykdhenl 42" viakmdstan eveavkvhsf ptlkffpasa drtvidynge rtldgfkkfl esgqugagd 48" dddledleea eepdmeeddd qkavkdel HSPA1 A al Symbol: HSPA1 A Official Name: heat shock 70kDa protein 1A m: 3303 Organism: Homo sapiens Other Aliases: DAQB-147D11.1, HSP70-1, HSP70-1A, HSP7OI, HSP72, HSPA1 455455 CNherDesbnafions:HSP704/HSP702;HSP7OJ/HSP702;dnaK¢ype molecular chaperone HSP70-1; heat shock 70 kDa protein 1/2; heat shock 70 kDa protein 1A/1 B; heat shock 70kD protein 1A; heat shock-induced protein Nucleotide ce: NCBI Reference Seguence: NM_005345.5 LOCUS NM_005345 ACCESSION NM_005345 l ataaaagccc aggggcaagc ggtccggata acggctagcc gctg ctgcgacagt 6" ccactacctt tttcgagagt gactcccgtt gtcccaaggc ttcccagagc gaacctgtgc 12" aggc accggcgcgt cgagtttccg gcgtccggaa ggaccgagct cttctcgcgg 18" atccagtgtt ccgtttccag atct cagagcggag ccgacagaga aacc 24" ggcatggcca cggc gatcggcatc gacctgggca ccacctactc ctgcgtgggg " gtgttccaac acggcaaggt ggagatcatc gccaacgacc agggcaaccg caccaccccc 36" agctacgtgg ccttcacgga caccgagcgg ctcatcgggg atgcggccaa gaaccaggtg 42" gcgctgaacc acac cgtgtttgac gcgaagcggc gccg caagttcggc 48" gacccggtgg tgcagtcgga catgaagcac tggcctttcc aggtgatcaa cgacggagac 54" aagcccaagg tgcaggtgag gggg gagaccaagg cattctaccc cgaggagatc 60" atgg ccaa gatgaaggag atcgccgagg cgtacctggg ctacccggtg 66" accaacgcgg ccgt gccggcctac ttcaacgact cgcagcgcca ggccaccaag 72" gatgcgggtg tgatcgcggg gctcaacgtg ctgcggatca tcaacgagcc cacggccgcc 78" gccatcgcct tgga cagaacgggc aagggggagc gcaacgtgct catctttgac 84" ctgggcgggg gcaccttcga cgtgtccatc ctgacgatcg acgacggcat cttcgaggtg 90" aaggccacgg ccggggacac ccacctgggt ggggaggact ttgacaacag gctggtgaac 96" cacttcgtgg aggagttcaa gagaaaacac aagaaggaca tcagccagaa caagcgagcc "02" gtgaggcggc tgcgcaccgc ctgcgagagg gccaagagga ccctgtcgtc ccag "08" gccagcctgg agatcgactc cctgtttgag ggcatcgact tctacacgtc cagg "l4" gcgaggttcg aggagctgtg ctccgacctg ttccgaagca ccctggagcc cgtggagaag "20" gctctgcgcg acgccaagct ggacaaggcc cagattcacg acctggtcct ggtcgggggc "26" tccacccgca tccccaaggt gcagaagctg ctgcaggact tcttcaacgg gcgcgacctg "32" aacaagagca tcaaccccga Cgaggctgtg gcctacgggg cggcggtgca ggcggccatc "38" ctgatggggg acaagtccga gaacgtgcag ctgc tgctggacgt ggctcccctg 456456 "44" tcgctggggc cggc ngaggcgtg atgactgccc agcg caactccacc .50" atccccacca agcagacgca gatcttcacc acctactccg acaaccaacc ngggtgctg "56" atccaggtgt acgagggcga gagggccatg acgaaagaca acaatctgtt ggggcgcttc "62" gagctgagcg gcatccctcc ggcccccagg ggcgtgcccc agatcgaggt gaccttcgac "68" atcgatgcca acggcatcct gaacgtcacg gccacggaca agagcaccgg caaggccaac "74" acca tcaccaacga caagggccgc ctgagcaagg tcga ggtg "80" caggaggcgg agaagtacaa agcggaggac gaggtgcagc gcgagagggt gtcagccaag "86" aacgccctgg acgc cttcaacatg aagagcgccg tggaggatga ggggctcaag "92" ggcaagatca gcgaggcgga caagaagaag gtgctggaca aaga ggtcatctcg "98" tggctggacg cctt ggccgagaag gacgagtttg agcacaagag gaaggagctg 204" gagcaggtgt gtaaccccat catcagcgga ctgtaccagg gtgccggtgg tcccgggcct 210" gggggcttcg gggctcaggg tcccaaggga gggtctgggt ccac cattgaggag 216" gtagattagg ggcctttcca agattgctgt ttttgttttg gagcttcaag actttgcatt 222" tcctagtatt tctgtttgtc agttctcaat ttcctgtgtt tgcaatgttg aaattttttg 228" gtgaagtact gaacttgctt tttttccggt ttctacatgc gaat ttatactgcc 234" atcttacgac tatttcttct ttttaataca cttaactcag gccatttttt aagttggtta 240" cttcaaagta aataaacttt aaaattcaaa aaaaaaaaaa aaaaa Protein seguence: NCBI Reference Seguence: NP_005336.3 LOCUS NP_005336 ACCESSION N 36 _ makaaaigid lgttyscvgv eiia ndqgnrttps yvaftdterl igdaaknqva 6" lnpqntvfda krligrkfgd pvvqsdmkhw pfqvindgdk pkqusykge tkafypeeis 12" smvltkmkei aeaylgypvt navitvpayf ndsqrqatkd agviaglnvl riineptaaa 18" iaygldrtgk gernvlifdl gggtfdvsil tiddgifevk atagdthlgg edfdnrlvnh 24" fveefkrkhk kdisanrav rrlrtacera krtlssstqa sleidslfeg idfytsitra " rfeelcsdlf rstlepveka lrdakldkaq ihdlvlvggs tripkvqkll qdffngrdLn 36" ksinpdeava ygaavqaail mgdksenqu lllldvapls lgletaggvm talikrnsti 42" ptkqtqiftt ysdnquvli amt kdnnllgrfe aprg quievtfdi 457457 481 dangilnvta kank ititndkgrl skeeierqu eaekykaede vqrervsakn 541 alesyafnmk savedeglkg kiseadkkkv ldkcqevisw ldantlaekd efehkrkele 601 qvcnpiisgl ngaggpgpg kgg sgsgptieev d Gene Official Symbol: HNRNPD Official Name: heterogeneous nuclear ribonucleoprotein D (AU-rich element RNA binding protein 1, 37kDa) Gene ID: 3184 Organism: Homo s Other Aliases: AUF1, AUF1A, Other Designations: nding protein AUFI, type A; heterogeneous nuclear ribonucleoprotein D0; hnRNP D0 Nucleotide seguence: M D NCBI Reference Seguence: NM_001003810.1 LOCUS NM_001003810 ACCESSION 003810 cttccgtcgg ccattttagg tggtccgcgg cggcgccatt aaagcgagga gaga 6" gcggccgccg ctggtgctta ttctttttta gtgcagcggg agagagcggg agtgtgcgcc 12" gcgcgagagt gggaggcgaa gggggcaggc cagggagagg cgcaggagcc tttgcagcca 18" cgcgcgcgcc ttccctgtct tgtgtgcttc gcgaggtaga gcgc ggcagcggcg 24" gggattactt tgctgctagt tcgc ggcagcggcg ggtgtagtct cggcggcagc " ggcggagaca ctagcactat gtcggaggag cagttcggcg gggacggggc agcg 36" gcaacggcgg ngtaggcgg ctcggcgggc gagg gagccatggt ggcggcgaca 42" cagggggcag ngngngC gggaagcgga gccgggaccg ggggcggaac cgcgtctgga 48" ggcaccgaag ggggcagcgc Cgagtcggag ggggcgaaga ttgacgccag taagaacgag 54" gaggatgaag ggaaaatgtt tataggaggc cttagctggg acactacaaa gaaagatctg 60" aaggactact tttccaaatt tggtgaagtt gtagactgca ctctgaagtt agatcctatc 66" acagggcgat caaggggttt tggctttgtg ctatttaaag aatcggagag tgtagataag 458458 72" gtcatggatc aaaaagaaca taaattgaat gggaaggtga ttgatcctaa aagggccaaa 78" gccatgaaaa caaaagagcc aaaa atttttgttg gtggcctttc tccagataca 84" cctgaagaga aaataaggga gtactttggt ggtg aggtggaatc catagagctc 90" cccatggaca ccaa taagaggcgt gggttctgct ttattacctt taaggaagaa 96" gaaccagtga agaagataat ggaaaagaaa taccacaatg ttggtcttag taaatgtgaa "02" ataaaagtag ccatgtcgaa ggaacaatat cagcaacagc aacagtgggg atctagagga "08" ggatttgcag gaagagctcg tggaagaggt ggtgaccagc agagtggtta tgggaaggta "l4" tccaggcgag gtggtcatca aaatagctac aaaccatact aaattattcc atttgcaact "20" tatccccaac aggtggtgaa gcagtatttt ccaatttgaa gattcatttg aaggtggctc "26" ctgccacctg ctaatagcag ttcaaactaa attttttgta tcaagtccct gaatggaagt "32" atgacgttgg gtccctctga agtttaattc tgagttctca ttaaaagaaa tttgctttca "38" ttgttttatt tcttaattgc tatgcttcag aatcaatttg tgttttatgc cctttccccc "44" agtattgtag agcaagtctt gtgttaaaag cccagtgtga cagtgtcatg atgtagtagt "50" gtcttactgg ttttttaata aatccttttg tataaaaatg tattggctct tttatcatca "56" gaataggaaa aattgtcatg gattcaagtt attaaaagca taagtttgga agacaggctt "62" gccgaaattg tgat taaaattgca tttg aaatgttttt agcaaaatct "68" aatttttgcc ataatgtgtc ctccctgtcc aaattgggaa tgacttaatg tcaatttgtt "74" tgttggttgt tttaataata tatg tagccattaa gatttatatg ttcc "80" caaatgccca gtttttgctt aatatgtatt gtgcttttta gaacaaatct ggataaatgt "86" gcaaaagtac ccctttgcac ttaa tgttttatgc ttccattaaa taaaaaggac "92" ttaaaatctg ttaattataa tagaaatgcg gctagttcag agagattttt gtgg "98" tggacttcat agatgaattc tgag ggaggattaa atat accgtgttta 204" tgtgtgtgtg ctt n seguence: ISOFORM D NCBI Reference Seguence: NP_001003810.1 LOCUS NP_001003810 ION N P_001003810 1 mseequgdg aaaaataavg gsageqegam vaatqgaaaa agsgagtggg tasggteggs 61 aesegakida egkm figglswdtt kkdlkdyfsk fgevvdctlk ldpitgrsrg 459459 121 fgfvlfkese svdkvmque hklngkvidp krakamktke pvkkifvggl spdtpeekir 181 eyfggfgeve sielpmdnkt nkrrgfcfit fkeeepvkki nvgl skceikvams 241 kequqqqu gsrggfagra rgrggdqqsg ygkvsrrggh qnsykpy Nucleotide seguence: ISOFORM C NCBI Reference ce: NM_002138.3 LOCUS NM_002138 ACCESSION NM_002138 1 cttccgtcgg ccattttagg tggtccgcgg cggcgccatt aaagcgagga ggaggcgaga 6" gcggccgccg ctggtgctta ttta gtgcagcggg agagagcggg agtgtgcgcc 12" gagt gggaggcgaa gggggcaggc cagggagagg cgcaggagcc tttgcagcca 18" cgcgcgcgcc gtct tgtgtgcttc gcgaggtaga gcgggcgcgc ggcagcggcg 24" gggattactt tgctgctagt ttcggttcgc ggcagcggcg ggtgtagtct cggcggcagc " ggcggagaca ctagcactat gtcggaggag cagttcggcg gggacggggc ggcggcagcg 36" gcaacggcgg ngtaggcgg ctcggcgggc gagcaggagg tggt ggcggcgaca 42" cagggggcag ngngngC gggaagcgga gccgggaccg ggggcggaac cgcgtctgga 48" ggcaccgaag ggggcagcgc Cgagtcggag ggggcgaaga ttgacgccag taagaacgag 54" gaggatgaag gccattcaaa ctcctcccca cgacactctg aagcagcgac ggcacagcgg 60" gaagaatgga aaatgtttat aggaggcctt agctgggaca ctacaaagaa agatctgaag 66" gactactttt ccaaatttgg tgaagttgta gactgcactc tgaagttaga tcctatcaca 72" gggcgatcaa ggggttttgg ctttgtgcta tttaaagaat gtgt ggtc 78" caaa aagaacataa attgaatggg aaggtgattg atcctaaaag ggccaaagcc 84" atgaaaacaa aagagccggt taaaaaaatt tttgttggtg gcctttctcc agatacacct 90" gaagagaaaa taagggagta ctttggtggt gagg tggaatccat agagctcccc 96" atggacaaca agaccaataa gaggcgtggg ttctgcttta ttacctttaa ggaagaagaa 102" ccagtgaaga agataatgga aaagaaatac cacaatgttg gtcttagtaa atgtgaaata 108" aaagtagcca tgtcgaagga acaatatcag caacagcaac agtggggatc tagaggagga 114" ggaa gagctcgtgg tggt gaccagcaga gtggttatgg gaaggtatcc 120" aggcgaggtg aaaa tagctacaaa ccatactaaa catt ttat 126" ccccaacagg tggtgaagca gtattttcca atttgaagat tcatttgaag gtggctcctg 460460 "32" ccacctgcta atagcagttc aaactaaatt ttttgtatca agtccctgaa tggaagtatg "38" acgttgggtc cctctgaagt ttaattctga gttctcatta attt gctttcattg "44" ttct taattgctat gcttcagaat gtgt tttatgccct cagt "50" attgtagagc aagtcttgtg ttaaaagccc agtgtgacag gatg tagtagtgtc "56" ttactggttt tttaataaat ccttttgtat aaaaatgtat tggctctttt atcatcagaa "62" aaat tgtcatggat tcaagttatt aaaagcataa gtttggaaga caggcttgcc "68" gagg acatgattaa aattgcagtg aagtttgaaa tgtttttagc aaaatctaat "74" cata atgtgtcctc cctgtccaaa atga cttaatgtca atttgtttgt "80" tggttgtttt aataatactt ccttatgtag ccattaagat ttatatgaat attttcccaa "86" atgcccagtt tttgcttaat atgtattgtg ctttttagaa caaatctgga taaatgtgca "92" aaagtacccc tttgcacaga tagttaatgt tttatgcttc cattaaataa aaaggactta "98" aaatctgtta attataatag aaatgcggct agttcagaga gatttttaga gctgtggtgg 204" acttcataga tgaattcaag tgttgaggga ggattaaaga aatatatacc gtgtttatgt 210" gtgtgtgctt Protein seguence: ISOFORM C NCBI nce Seguence: NP_002129.2 LOCUS NP_002129 ACCESSION NP_002129 1 mseequgdg aaaaataavg gsageqegam vaatqgaaaa agsgagtggg tasggteggs 61 kida skneedeghs nssprhseaa taqreewkmf igglswdttk kdlkdyfskf 121 gevvdctlkl dpitgrsrgf gfvlfkeses vdkvmqueh klngkvidpk rakamktkep 181 vkkifvggls pdtpeekire yfggfgeves ielpmdnktn krrgfcfitf keeepvkkim 241 ekkyhnvgls kceikvamsk equqqqug srggfagrar grggdqqsgy gkvsrrgghq 301 nsykpy Nucleotide seguence: M B NCBI Reference Seguence: NM_031369.2 LOCUS NM_031369 ACCESSION NM_031369 461461 l cttccgtcgg ccattttagg tggtccgcgg cggcgccatt aaagcgagga ggaggcgaga 6" gcggccgccg ctggtgctta ttctttttta gtgcagcggg agagagcggg agtgtgcgcc 12" gcgcgagagt gggaggcgaa gggggcaggc cagggagagg cgcaggagcc tttgcagcca 18" cgcgcgcgcc gtct tgtgtgcttc gcgaggtaga gcgggcgcgc ggcagcggcg 24" gggattactt tgctgctagt ttcggttcgc ggcagcggcg ggtgtagtct cggcggcagc " ggcggagaca ctagcactat gtcggaggag cagttcggcg gggacggggc ggcggcagcg 36" gcaacggcgg ngtaggcgg ctcggcgggc gagcaggagg tggt ggcggcgaca 42" cagggggcag ngngngC gggaagcgga gccgggaccg ggggcggaac cgcgtctgga 48" ggcaccgaag ggggcagcgc Cgagtcggag ggggcgaaga ttgacgccag taagaacgag 54" gaggatgaag ggaaaatgtt tataggaggc cttagctggg acactacaaa gaaagatctg 60" aaggactact tttccaaatt tggtgaagtt tgca ctctgaagtt tatc 66" acagggcgat caaggggttt tggctttgtg ctatttaaag aatcggagag tgtagataag 72" gtcatggatc aaaaagaaca taaattgaat gggaaggtga ttgatcctaa aagggccaaa 78" gccatgaaaa caaaagagcc ggttaaaaaa atttttgttg tttc tccagataca 84" cctgaagaga aaataaggga gtactttggt ggttttggtg aatc gctc 90" cccatggaca acaagaccaa taagaggcgt gggttctgct ttattacctt taaggaagaa 96" gaaccagtga agaagataat ggaaaagaaa taccacaatg ttggtcttag tgaa "02" gtag ccatgtcgaa ggaacaatat cagcaacagc aacagtgggg atctagagga "08" ggatttgcag gaagagctcg tggaagaggt ggtggcccca gtcaaaactg gaaccaggga "14" tatagtaact attggaatca aggctatggc aactatggat ataacagcca cggt "20" ggttatggag gatatgacta cactggttac aacaactact atggatatgg tgattatagc "26" aaccagcaga gtggttatgg gaaggtatcc aggcgaggtg gtcatcaaaa tagctacaaa "32" ccatactaaa ttattccatt tgcaacttat ccccaacagg agca gtattttcca "38" atttgaagat gaag gtggctcctg gcta atagcagttc aaactaaatt "44" ttttgtatca agtccctgaa tggaagtatg acgttgggtc aagt ttaattctga "50" gttctcatta aaagaaattt gctttcattg ttttatttct taattgctat gcttcagaat "56" caatttgtgt tttatgccct ttcccccagt gagc aagtcttgtg ttaaaagccc "62" agtgtgacag tgtcatgatg tagtagtgtc ttactggttt tttaataaat ccttttgtat "68" aaaaatgtat tttt atcatcagaa taggaaaaat tgtcatggat tcaagttatt "74" aaaagcataa aaga caggcttgcc gaaattgagg acatgattaa aattgcagtg 462462 180" gaaa tgtttttagc aaaatctaat ttttgccata cctc cctgtccaaa 186" ttgggaatga cttaatgtca atttgtttgt tggttgtttt aataatactt ccttatgtag 192" ccattaagat ttatatgaat attttcccaa atgcccagtt taat atgtattgtg 198" ctttttagaa caaatctgga taaatgtgca aaagtacccc tttgcacaga tagttaatgt 204" tttatgcttc cattaaataa aaaggactta aaatctgtta attataatag aaatgcggct 210" agttcagaga gatttttaga gtgg taga tgaattcaag ggga 216" ggattaaaga aatatatacc gtgtttatgt gtgtgtgctt Protein seguence: ISOFORM B NCBI Reference ce: NP_112737.1 LOCUS NP_1 12737 ACCESSION NP_112737 1 mseequgdg aaaaataavg gsageqegam vaatqgaaaa agsgagtggg tasggteggs 61 aesegakida skneedegkm figglswdtt kkdlkdyfsk fgevvdctlk ldpitgrsrg 121 kese svdkvmque hklngkvidp krakamktke pvkkifvggl spdtpeekir 181 eyfggfgeve sielpmdnkt cfit fkeeepvkki mekkyhnvgl skceikvams 241 kequqqqu gsrggfagra rgrgggpsqn wnqusnywn qugnygyns quggyggyd 301 ytgynnyygy sgy gkvsrrgghq nsykpy Nucleotide seguence: ISOFORM A NCBI Reference Seguence: NM_031370.2 LOCUS NM_031370 ACCESSION NM_031370 1 cttccgtcgg ccattttagg tggtccgcgg cggcgccatt aaagcgagga ggaggcgaga 61 gcggccgccg ctggtgctta ttctttttta cggg cggg agtgtgcgcc 121 gcgcgagagt gggaggcgaa gggggcaggc cagggagagg cgcaggagcc tttgcagcca 181 cgcgcgcgcc ttccctgtct tgtgtgcttc gcgaggtaga gcgggcgcgc ggcagcggcg 241 gggattactt tgctgctagt tcgc ggcagcggcg ggtgtagtct cggcggcagc 301 ggcggagaca ctagcactat gtcggaggag cagttcggcg gggacggggc ggcggcagcg 361 gcaacggcgg ngtaggcgg ctcggcgggc gagcaggagg gagccatggt ggcggcgaca 463463 42" cagggggcag ngngngC gggaagcgga gccgggaccg ggggcggaac cgcgtctgga 48" ggcaccgaag ggggcagcgc ggag ggggcgaaga ttgacgccag taagaacgag 54" gaggatgaag gccattcaaa ctcctcccca cgacactctg aagcagcgac ggcacagcgg 60" gaagaatgga aaatgtttat cctt gaca ctacaaagaa agatctgaag 66" gactactttt ccaaatttgg tgaagttgta gactgcactc tgaagttaga tcctatcaca 72" gggcgatcaa ggggttttgg ctttgtgcta tttaaagaat cggagagtgt agataaggtc 78" atggatcaaa aagaacataa attgaatggg aaggtgattg atcctaaaag ggccaaagcc 84" atgaaaacaa aagagccggt taaaaaaatt tttgttggtg gcctttctcc acct 90" aaaa taagggagta ctttggtggt gagg tggaatccat agagctcccc 96" atggacaaca agaccaataa gaggcgtggg ttctgcttta ttacctttaa ggaagaagaa "02" aaga agataatgga aaagaaatac cacaatgttg gtcttagtaa aata "08" aaagtagcca tgtcgaagga acaatatcag caacagcaac agtggggatc tagaggagga "l4" tttgcaggaa gagctcgtgg aagaggtggt ggccccagtc aaaactggaa ccagggatat "20" agtaactatt ggaatcaagg ctatggcaac tatggatata aagg ttacggtggt "26" tatggaggat atgactacac tggttacaac aactactatg gatatggtga ttatagcaac "32" cagcagagtg gttatgggaa ggtatccagg cgaggtggtc atag ctacaaacca "38" tactaaatta ttccatttgc aacttatccc gtgg tgaagcagta ttttccaatt "44" tgaagattca tttgaaggtg gctcctgcca cctgctaata gcagttcaaa ctaaattttt "50" tgtatcaagt ccctgaatgg gacg ttgggtccct ctgaagttta attctgagtt "56" aaaa gaaatttgct ttcattgttt tatttcttaa ttgctatgct tcagaatcaa "62" tttgtgtttt tttc ccccagtatt gtagagcaag tcttgtgtta aaagcccagt "68" gtgacagtgt catgatgtag tagtgtctta ctggtttttt aataaatcct tttgtataaa "74" aatgtattgg ctcttttatc atcagaatag gaaaaattgt catggattca agttattaaa "80" agcataagtt tggaagacag gcttgccgaa attgaggaca tgattaaaat tgcagtgaag "86" tttgaaatgt ttttagcaaa atctaatttt aatg tgtcctccct gtccaaattg "92" ggaatgactt aatgtcaatt tgtttgttgg ttgttttaat aatacttcct tatgtagcca "98" ttaagattta tatgaatatt ttcccaaatg cccagttttt gcttaatatg tattgtgctt 204" tttagaacaa atctggataa atgtgcaaaa gtaccccttt gcacagatag tttt 210" atgcttccat taaataaaaa ggacttaaaa tctgttaatt ataatagaaa tgcggctagt 216" tcagagagat ttttagagct gtggtggact tcatagatga attcaagtgt tgagggagga 222" ttaaagaaat atataccgtg tttatgtgtg tgtgctt 464464 n seguence: ISOFORM A NCBI Reference Seguence: NP_112738.1 LOCUS NP_112738 ACCESSION N P_112738 1 mseequgdg aaaaataavg egam vaatqgaaaa agsgagtggg tasggteggs 61 aesegakida skneedeghs nssprhseaa taqreewkmf igglswdttk kdlkdyfskf 121 gevvdctlkl dpitgrsrgf gfvlfkeses vdkvmqueh idpk rakamktkep 181 ggls pdtpeekire yfggfgeves ielpmdnktn fitf keeepvkkim 241 ekkyhnvgls kceikvamsk equqqqug srggfagrar grgggpsqnw nqusnywnq 301 gygnygynsq gyggyggydy ygyg dyanqsgyg kvsrrgghqn sykpy RPL32 Official Sym bol: RPL32 Official Name: ribosomal protein L32 Gene ID: 6161 Organism: Homo s Other Aliases: AU020185, rpL32-3A Other Designations: 608 ribosomal protein L32; snoRNA MBl-141 Nucleotide seguence: Transcript Variant 1 NCBI Reference Seguence: NM_000994.3 LOCUS NM_000994 ION 994 1 aggggttacg acccatcagc ccttgcgcgc caccgtccct tctctcttcc tcggcgctgc 61 ctacggaggt ggcagccatc tccttctcgg catcatggcc gccctcagac cccttgtgaa 121 gcccaagatc gtcaaaaaga gaaccaagaa gttcatccgg caccagtcag accgatatgt 181 caaaattaag cgtaactggc ggaaacccag aggcattgac aacagggttc gtagaagatt 465465 24" caagggccag atcttgatgc ccaacattgg aagc aacaaaaaaa caaagcacat " gctgcccagt ggcttccgga tggt ccacaacgtc aaggagctgg aagtgctgct 36" gatgtgcaac aaatcttact gtgccgagat cgctcacaat gtttcctcca agaaccgcaa 42" agccatcgtg gaaagagctg cccaactggc catcagagtc accaacccca atgccaggct 48" gcgcagtgaa gaaaatgagt aggcagctca tgtgcacgtt ttctgtttaa ataaatgtaa 54" ccat ctggcatctt ccttccttga ttttaagtct tcagcttctt ggccaactta 60" gtttgccaca gagattgttc ttttgcttaa gcccctttgg aatctcccat ttggagggga 66" tttgtaaagg acactcagtc cttgaacagg ggaatgtggc ctcaagtgca cagactagcc 72" ttagtcatct ccagttgagg ctgggtatga ggggtacaga cttggccctc acaccaggta 78" ggttctgaga cacttgaaga agcttgtggc tcccaagcca caagtagtca ttcttagcct 84" tgcttttgta aagttaggtg acaagttatt ccatgtgatg cttgtgagaa ttgagaaaat 90" atgcatggaa agat gaatttctta cacagattct tacgggatgc ctaaattgca 96" tcctgtaact caaa aagaacagga tgatgtacaa attgctcttc caggtaatcc "02" accacggtta actggaaaag cactttcagt ctcctataac cctcccacca gctgctgctt "08" taat gttacagcag tttgccaagg cggggaccta gaca attgagcctc "l4" ggta ctcagaattt agtgacacgt ggtcctgatt ttttttggag acggggtctt "20" gctctcaccc aggctgggag tgcagtggca cactgactac agccttgacc tccccaggct "26" caggtgatct ctca gccttccaag ggac tacagatgca cacctccaaa "32" cctgggtagt ttttgaagtt taga ggtggtctag tgcc taggctcccg "38" aactcctgag ctcaagcaat cctgcttcag cctcccaaag tactgggatt acaggcatct "44" tctgtagtat ataggtcatg agggatatgg gatgtggtac ttatgagaca gaaatgctta "50" caggatgttt ttctgtaacc atcctggtca acttagcaga aatgctgcgc tgggtataat "56" aaagcttttc tacttctagt ctagacagga atcttacaga ttgtctcctg ttcaaaacct "62" agtcataaat atttataatg caaactggtc aaaaaaaaaa aaaaaaaa Protein seguence: ript Variant 1.
NCBI Reference Seguence: NP_000985.1 LOCUS NP_000985 ACCESSION N P_000985 l maalrplvkp tkkf irhqsdryvk kprg idnrvrrrfk gqilmpnigy 466466 61 gsnkktkhml psgfrkflvh nvkelevllm cnksycaeia hnvssknrka qlai 121 rvtnpnarlr seene Nucleotide seguence: Transcript Variant 2.
NCBI Reference Seguence: NM_001007073.1 LOCUS NM_001007073 ION NM_001007073 1 aggggttacg acccatcagc ccttgcgcgc ccct ttcc tcggcgctgc 6" ctacggaggt ggcagccatc tccttctcgc tggcgattgg aagacactct gcgacagtgt 12" tcagtccctg ggcaggaaag cctccttcca ggattcttcc tcacctgggg ccgcttcttc 18" cccaaaaggc atcatggccg ccctcagacc ccttgtgaag atcg agag 24" aaccaagaag ttcatccggc accagtcaga ccgatatgtc aaaattaagc gtaactggcg " gaaacccaga ggcattgaca acagggttcg attc aagggccaga tcttgatgcc 36" caacattggt tatggaagca acaaaaaaac aaagcacatg ctgcccagtg gcttccggaa 42" gttcctggtc gtca aggagctgga agtgctgctg atgtgcaaca aatcttactg 48" gatc gctcacaatg tttcctccaa gaaccgcaaa gccatcgtgg aaagagctgc 54" ccaactggcc atcagagtca ccaaccccaa tgccaggctg cgcagtgaag aaaatgagta 60" ggcagctcat gtgcacgttt tctgtttaaa taaatgtaaa aactgccatc tggcatcttc 66" cttccttgat tttaagtctt cagcttcttg gccaacttag tttgccacag agattgttct 72" tttgcttaag cccctttgga atctcccatt tggaggggat ttgtaaagga cactcagtcc 78" ttgaacaggg gaatgtggcc tcaagtgcac agactagcct tagtcatctc cagttgaggc 84" tgggtatgag gggtacagac ttggccctca caccaggtag gttctgagac agaa 90" gcttgtggct cccaagccac aagtagtcat tcttagcctt gcttttgtaa agttaggtga 96" caagttattc catgtgatgc ttgtgagaat tgagaaaata tgcatggaaa tatccagatg 102" aatttcttac acagattctt acgggatgcc taaattgcat cctgtaactt ctgtccaaaa 108" agaacaggat gatgtacaaa ttgctcttcc aggtaatcca ccacggttaa ctggaaaagc 114" actttcagtc tcctataacc ctcccaccag ctgctgcttc aggtataatg cagt 120" ttgccaaggc ctaa ctggtgacaa ttgagcctct tgactggtac tcagaattta 126" cgtg gtcctgattt tttttggaga cttg ccca ggctgggagt 132" gcagtggcac actgactaca gccttgacct ccccaggctc aggtgatctt cccacctcag 467467 "38" ccttccaagt agctgggact acagatgcac acctccaaac agtt tttgaagttt "44" ttttgtagag tagc catgttgcct aggctcccga actcctgagc tcaagcaatc "50" ctgcttcagc ctcccaaagt actgggatta caggcatctt ctgtagtata taggtcatga "56" gggatatggg atgtggtact tatgagacag aaatgcttac aggatgtttt tctgtaacca "62" tcaa cttagcagaa atgctgcgct gggtataata aagcttttct acttctagtc "68" tagacaggaa tcttacagat tgtctcctgt tcaaaaccta gtcataaata tttataatgc "74" gtca aaaaaaaaaa aaaaaaa Protein seguence: Transcript Variant 2.
NCBI nce Seguence: NP_001007074.1 LOCUS NP_001007074 ACCESSION NP_001007074 1 maalrplvkp kivkkrtkkf irhqsdryvk ikrnwrkprg idnrvrrrfk gqilmpnigy 61 gsnkktkhml psgfrkflvh nvkelevllm cnksycaeia hnvssknrka iveraaqlai 121 rvtnpnarlr seene Nucleotide seguence: Transcript Variant 3.
NCBI Reference Seguence: NM_001007074.1 LOCUS NM_001007074 ION NM_001007074 1 gacctcctgg gatcgcatct ggagagtgcc tagtattctg ccagcttcgg aaagggaggg 6" aaagcaagcc tggcagaggc acccattcca ttcccagctt tagc tggcgattgg 12" aagacactct gcgacagtgt tcagtccctg ggcaggaaag cctccttcca ggattcttcc 18" tcacctgggg cttc cccaaaaggc atcatggccg ccctcagacc ccttgtgaag 24" cccaagatcg tcaaaaagag aaccaagaag ttcatccggc accagtcaga ccgatatgtc " aaaattaagc gtaactggcg gaaacccaga ggcattgaca acagggttcg tagaagattc 36" caga tcttgatgcc caacattggt agca aaac aaagcacatg 42" ctgcccagtg gcttccggaa gttcctggtc cacaacgtca aggagctgga agtgctgctg 48" atgtgcaaca aatcttactg tgccgagatc gctcacaatg tttcctccaa caaa 54" gccatcgtgg aaagagctgc ccaactggcc gtca ccaaccccaa tgccaggctg 468468 60" cgcagtgaag aaaatgagta tcat gtgcacgttt tctgtttaaa taaatgtaaa 66" aactgccatc tggcatcttc cttccttgat tttaagtctt cagcttcttg gccaacttag 72" tttgccacag agattgttct tttgcttaag cccctttgga atctcccatt tggaggggat 78" ttgtaaagga cactcagtcc ttgaacaggg gaatgtggcc tcaagtgcac agactagcct 84" tagtcatctc cagttgaggc tgggtatgag agac ttggccctca caccaggtag 90" gttctgagac acttgaagaa gcttgtggct cccaagccac aagtagtcat tcttagcctt 96" gcttttgtaa agttaggtga caagttattc catgtgatgc gaat tgagaaaata "02" tgcatggaaa tatccagatg aatttcttac acagattctt acgggatgcc gcat "08" cctgtaactt aaaa agaacaggat gatgtacaaa ttgctcttcc aggtaatcca "14" ttaa ctggaaaagc actttcagtc tcctataacc ctcccaccag ctgctgcttc "20" aggtataatg ttacagcagt ttgccaaggc ggggacctaa ctggtgacaa ttgagcctct "26" tgactggtac tcagaattta gtgacacgtg gtcctgattt tttttggaga ngggtcttg "32" ccca ggctgggagt gcac actgactaca gccttgacct ccccaggctc "38" aggtgatctt cccacctcag ccttccaagt agctgggact acagatgcac aaac "44" ctgggtagtt tttgaagttt ttttgtagag gtggtctagc catgttgcct aggctcccga "50" actcctgagc tcaagcaatc ctgcttcagc ctcccaaagt actgggatta caggcatctt "56" ctgtagtata atga gggatatggg atgtggtact tatgagacag ttac "62" aggatgtttt tctgtaacca tcctggtcaa cttagcagaa atgctgcgct gggtataata "68" aagcttttct acttctagtc tagacaggaa tcttacagat tgtctcctgt ccta "74" gtcataaata tttataatgc gtca aaaaaaaaaa aaaaaaa Protein seguence: Transcript t 3.
NCBI Reference Seguence: NP_001007075.1 LOCUS NP_001007075 ACCESSION N P_001007075 1 maalrplvkp kivkkrtkkf irhqsdryvk ikrnwrkprg idnrvrrrfk gqilmpnigy 61 gsnkktkhml psgfrkflvh nvkelevllm cnksycaeia hnvssknrka iveraaqlai 121 rvtnpnarlr seene 469469 Gene Official Sym bol: ATP5H al Name: ATP synthase, H+ orting, mitochondrial Fo complex, subunit d Gene ID: 10476 Organism: Homo sapiens Other Aliases: My032, ATPQ Other Designations: ATP synthase D chain, ondrial; ATP se subunit d, mitochondrial; ATP se, H+ transporting, mitochondrial F0 complex, t d; ATP synthase, H+ transporting, mitochondrial F1 F0, subunit d; ATPase subunit d; My032 protein Nucleotide seguence: M B NCBI Reference Seguence: NM_001003785.1 LOCUS NM_001003785 ACCESSION NM_001003785 l tgacccactt ccgttacttg ctgcggagga ccgtgggcag ccagggtcgg tgaaggatcc 6" caaaatggct gggcgaaaac ttgctctaaa aaccattgac tgggtagctt ttgcagagat 12" cataccccag aaccaaaagg ccattgctag ttccctgaaa tcctggaatg agaccctcac 18" ctccaggttg gctgctttac ctgagaatcc accagctatc gactgggctt actacaaggc 24" caatgtggcc aaggctggct tggtggatga ctttgagaag aaggtgaaat cttgtgctga " gtgggtgtct ctctcaaagg ccaggattgt agaatatgag aaagagatgg agaagatgaa 36" gaacttaatt ccatttgatc agatgaccat tgaggacttg aatgaagctt tcccagaaac 42" caaattagac aagt atccctattg gcctcaccaa ccaattgaga atttataaaa 48" ttgagtccag gaggaagctc tggcccttgt attacacatt ctggacatta aaaataataa 54" ttatacagtt aaaaaa n seguence: ISOFORM B NCBI Reference ce: NP_001003785.1 LOCUS NP_001003785 ACCESSION NP_001003785 l magrklalkt idwvafaeii pqnqkaiass lkswnetlts rlaalpenpp aidwayykan 470470 61 vddf ekkvkscaew vslskarive kmkn tie dlneafpetk 121 ldkkkypywp l Nucleotide seguence: ISOFORM A NCBI Reference Seguence: NM_006356.2 LOCUS 356 ION NM_006356 1 tgacccactt cttg ctgcggagga ccgtgggcag ccagggtcgg tgaaggatcc 6" caaaatggct aaac ttgctctaaa aaccattgac tgggtagctt ttgcagagat 12" cataccccag aaccaaaagg ccattgctag ttccctgaaa tcctggaatg agaccctcac 18" ctccaggttg ttac ctgagaatcc accagctatc gactgggctt actacaaggc 24" caatgtggcc aaggctggct tggtggatga ctttgagaag aagtttaatg cgctgaaggt " tcccgtgcca gaggataaat atactgccca ggtggatgcc gaagaaaaag aagatgtgaa 36" atcttgtgct gagtgggtgt ctctctcaaa ggccaggatt gtagaatatg agaaagagat 42" ggagaagatg aagaacttaa ttccatttga tcagatgacc attgaggact tgaatgaagc 48" tttcccagaa accaaattag acaagaaaaa gtatccctat tggcctcacc aaccaattga 54" gaatttataa aattgagtcc aggaggaagc tctggccctt gtattacaca ttctggacat 60" taaaaataat aattatacag ttaaaaaa Protein seguence: ISOFORM A NCBI Reference Seguence: 347.1 LOCUS NP_006347 ACCESSION NP_006347 1 magrklalkt idwvafaeii pqnqkaiass lkswnetlts rlaalpenpp aidwayykan 61 vakaglvddf ekkfnalkvp vpedkytaqv daeekedvks caewvslska keme 121 kmknlipqu mtiedlneaf petkldkkky pywphqpien 1 PSMA1 Official Sym bol: PSMA1 Official Name: proteasome (prosome, macropain) subunit, alpha type, 1 471471 Gene ID: 5682 Organism: Homo sapiens Other s: HC2, NU, PROS30 Other Designations: 30 kDa prosomal protein; 0; macropain subunit C2; macropain subunit nu; multicatalytic endopeptidase x subunit C2; some component C2; proteasome nu chain; proteasome subunit alpha type-1; proteasome subunit nu; some subunit, alpha-type, 1; protein P30- tide seguence: ISOFORM 3 NCBI Reference ce: NM_001143937.1 LOCUS NM_001143937 ACCESSION NM_001143937 l gatatctctg gaatagactg cgctaccctg cgccgccgcc gtcaaactcc cgcagacttc 6" tctgtagatc gctgagcgat actttcggca gcacctcctt gattctcagt tttgctggag 12" gccgcaacca ggcccgcgcc gccaccatgt ttcgaaatca gtatgacaat gatgtcactg 18" tttggagccc ccagggcagg attcatcaaa ttgaatatgc aatggaagct caag 24" gttcagccac agttggtctg aaatcaaaaa ctcatgcagt tttggttgca ttgaaaaggg " cgcaatcaga gcttgcagct catcagaaaa aaattctcca tgttgacaac catattggta 36" tctcaattgc ggggcttact gctgatgcta gactgttatg taattttatg cgtcaggagt 42" gtttggattc cagatttgta agac cactgcctgt gtctcgtctt gtatctctaa 48" ttggaagcag tatccttttt atgttagcat ttatggatat gaactttgaa gggttttgat 54" acttgtgtta attattagga atataataat aatatgacat aggtaagatt gtgaaaactt 60" taaaacaaca aattggattg ctctttcatt tata agcaatttat atttgctaga 66" cacaaataag cccaacttca ggaaaatcat ctaagcatct ttttagaggg gatttaaagt 72" ttcttaatgg ttctagatgt cccaagaaat cctagacccc ttgtatccaa aacaaatcag 78" gttttagatg aatt attttgctgg cactctttct taggttcggt agaaagtcaa 84" acattttata ttaggccaaa gaaatagtgt cctattgcat tatttctctg gtggattatg 90" caacaattaa agcc agagac Protein seguence: ISOFORM 3 NCBI Reference Seguence: NP_001137409.1 472472 LOCUS 137409 ACCESSION N 37409 1 dndv rih qieyameavk qgsatvglks kthavlvalk raqselaahq 61 kkilhvdnhi gisiagltad arllcnfmrq ecldsrfvfd rplpvsrlvs ligssilfml 121 afmdmnfegf Nucleotide seguence: ISOFORM 2 NCBI Reference Seguence: NM_002786.3 LOCUS NM_002786 ACCESSION NM_002786 1 tctg gaatagactg cgctaccctg cgccgccgcc gtcaaactcc cgcagacttc 6" gatc gctgagcgat actttcggca gcacctcctt gattctcagt tttgctggag 12" gccgcaacca ggcccgcgcc gccaccatgt ttcgaaatca gtatgacaat gatgtcactg 18" tttggagccc ccagggcagg attcatcaaa ttgaatatgc aatggaagct gttaaacaag 24" gttcagccac agttggtctg aaatcaaaaa ctcatgcagt tttggttgca aggg " caga gcttgcagct catcagaaaa aaattctcca tgttgacaac ggta 36" tctcaattgc ggggcttact gctgatgcta gactgttatg taattttatg gagt 42" gtttggattc cagatttgta ttcgatagac ctgt tctt gtatc :ctaa 48" ttggaagcaa gacccagata ccaacacaac gatatggccg gagaccatat ggtgt :ggtc 54" tccttattgc tggttatgat gatatgggcc ctcacatttt ccaaacctgt ccatc :gcta 60" actattttga ctgcagagcc atgtccattg gagcccgttc ccaatcagct cgtac :tact 66" tggagagaca tatgtctgaa tttatggagt gtaatttaaa tgaactagtt aaaca :ggtc 72" tgcgtgcctt aagagagacg cttcctgcag aacaggacct gactacaaag aatgt :tcca 78" ttggaattgt tggtaaagac ttggagttta caatctatga tgatgatgat gtgtc :ccat 84" aagg tcttgaagaa agaccacaga gaaaggcaca gcctgctcaa cctgc :gatg 90" aacctgcaga aaaggctgat gaaccaatgg aacattaagt gataagccag tctatatatg 96" tattatcaaa tatgtaagaa tacaggcacc acatactgat gacaataatc tatactttga 102" accaaaagtt gcagagtggt ggaatgctat gttttaggaa tcagtccaga tgtgagtttt ttccaagcaa cctcactgaa acctatataa tggaatacat ttttctttga aagggtctgt 473473 1141 ataatcattt tctagaaagt atgggtatct atactaatgt ttttatatga agaacatagg 1201 tgtctttgtg gttttaaaga caactgtgaa ataaaattgt ttcaccgcct ggtaaaaaaa 1261 aaaaaaaaaa aaaa a Protein seguence: ISOFORM 2 NCBI Reference Seguence: NP_002777.1 LOCUS NP_002777 ACCESSION N P_002777 1 mfrnqydndv rih qieyameavk qgsatvglks kthavlvalk raqselaahq 61 kkilhvdnhi gisiagltad arllcnfmrq ecldsrfvfd rplpvsrlvs ligsktqipt 121 qrygrrpygv glliagyddm gphifqtcps anyfdcrams sart ylerhmsefm 181 ecnlnelvkh etlp aeqdlttknv sigivgkdle ftiyddddvs eerp 241 quaqpaqpa depaekadep meh tide ce: ISOFORM 1 NCBI Reference Seguence: 976.2 LOCUS NM_148976 ACCESSION NM_148976 1 cggccgccca acagggacgc gagccgggac cacgccgacc cagcgtgccc aggccgagga 6" aagcgcggcg gtcc gaagacccac cgggactgaa agagaaggac gaggtcatct 12" tcggacggga ggggcaagcc agccatcctg ggaccccagg cgtgcaggtt ctctttgagg 18" gtattccacc ctgcaaaaag catgtattca tggtcagctc tcagcaaggc cagtagcaga 24" gtggtaaagg ccttggccct ccaaggctgg gaaaagacaa tgacaagtca aatccagacc " tatgttgtat gttggtctac taggtgactg tctcctggaa atgttatgca gctcagcaag 36" gtgaagtttc gaaatcagta tgacaatgat gtcactgttt ggagccccca gatt 42" catcaaattg aatatgcaat ggaagctgtt aaacaaggtt cagccacagt tggtctgaaa 48" tcaaaaactc atgcagtttt ggttgcattg aaaagggcgc aatcagagct tcat 54" cagaaaaaaa atgt tgacaaccat atct caattgcggg gcttactgct 60" gatgctagac tgttatgtaa ttttatgcgt caggagtgtt tggattccag atttgtattc 66" gatagaccac tgcctgtgtc tcgtcttgta tctctaattg gaagcaagac ccagatacca 474474 72" cgat atggccggag accatatggt gt:ggtctcc ttattgctgg ttatgatgat 78" cctc tcca aacctgtcca tc:gctaact attttgactg cagagccatg 84" tccattggag cccgttccca atcagctcgt ac:tacttgg agagacatat gtctgaattt 90" atggagtgta atttaaatga actagttaaa ca:ggtctgc gtgccttaag agagacgctt 96" cctgcagaac aggacctgac gaat gt:tccattg gaattgttgg taaagacttg "02" gagtttacaa tctatgatga tgatgatgtg tc:ccattcc tggaaggtct tgaagaaaga "08" ccacagagaa aggcacagcc tgctcaacct gc:gatgaac ctgcagaaaa ggctgatgaa "14" ccaatggaac attaagtgat aagccagtct atatatgtat tatcaaatat gtaagaatac "20" caca tactgatgac aataatctat actttgaacc aaaagttgca gagtggtgga "26" atgctatgtt ttaggaatca gtccagatgt tttc caagcaacct aacc "32" tatataatgg aatacatttt tctttgaaag ggtctgtata atcattttct agaaagtatg "38" ggtatctata ctaatgtttt tatatgaaga acataggtgt ctttgtggtt ttaaagacaa "44" ctgtgaaata aaattgtttc accgcctggt aaaaaaaaaa aaaaaaaaaa aaaaaaaa Protein ce: ISOFORM 1 NCBI Reference Seguence: NP_683877.1 LOCUS NP_683877 ACCESSION N P_683877 1 mqlskkarn qydndvtvws pqgrihqiey qgsa tvglksktha vlvalkraqs 61 elaahqkkil hvdnhigisi agltadarll cnfmrqecld rplp vsrlvsligs 121 ktqiptqryg rrpygvglli agyddmgphi fqtcpsanyf dcramsigar sqsartyler 181 hmsefmecnl nelvkhglra lretlpaeqd lttknvsigi ftiy ddddvspfle 241 gleerpqua qpaqpadepa ekadepmeh PTBP1 Official Sym bol: PTBP1 Official Name: polypyrimidine tract binding protein 1 475475 Gene ID: 5725 Organism: Homo s Other Aliases: PTB4,pPTB Other Designations: 57 kDa RNA-binding protein PPTB-1; RNA-binding protein; heterogeneous nuclear ribonucleoprotein I; heterogeneous nuclear ribonucleoprotein polypeptide l; hnRNP l; polypyrimidine tract binding protein (heterogeneous nuclear ribonucleoprotein l); polypyrimidine tract-binding protein Nucleotide seguence: ISOFORM A NCBI Reference Seguence: NM_002819.4 LOCUS NM_002819 ACCESSION 819 l tgcgggcgtc tccgccattt tgtgagtcta taactcggag ccgttgggtc ggttcctgct 6" attccggcgc ctccactccg tcccccgcgg tctg tgtgccatgg acggcattgt 12" cccagatata gccgttggta caaagcgggg cgag cttttctcta tcac 18" taacggaccg tttatcatga gcagcaactc ggcttctgca gcaaacggaa gcaa 24" gaagttcaaa ggtgacagcc gaagtgcagg ctct agagtgatcc acatccggaa " gctccccatc gacgtcacgg agggggaagt catctccctg gggctgccct ttgggaaggt 36" caccaacctc ctga aggggaaaaa ccaggccttc atcgagatga acacggagga 42" ggctgccaac accatggtga actactacac ctcggtgacc cctgtgctgc gcggccagcc 48" catctacatc cagttctcca accacaagga gctgaagacc gacagctctc ccaaccaggc 54" gcgggcccag ctgc aggcggtgaa ccag tcggggaacc tggccttggc 60" tgcctcggcg gcggccgtgg acgcagggat ggcgatggcc gggcagagcc ccgtgctcag 66" gatcatcgtg gagaacctct tctaccctgt gaccctggat gtgctgcacc agattttctc 72" caagttcggc acagtgttga agatcatcac cttcaccaag aacaaccagt tccaggccct 78" gtat cccg tgagcgccca gcacgccaag ctgg acgggcagaa 84" catctacaac gcctgctgca cgctgcgcat cgacttttcc aagctcacca gcctcaacgt 90" caagtacaac aatgacaaga gccgtgacta cacacgccca gacctgcctt ccggggacag 96" ccagccctcg caga ccatggccgc ggccttcggt gcacctggta taatctcagc 102" ctctccgtat gcaggagctg gtttccctcc cacctttgcc attcctcaag ctgcaggcct 476476 "08" ttccgttccg aacgtccacg gcgccctggc ccccctggcc atcccctcgg ngngngC "14" agctgcggcg gcaggtcgga tcgccatccc gggcctggcg ggggcaggaa attctgtatt "20" gctggtcagc aacctcaacc cagagagagt ccaa agcctcttta ttcttttcgg "26" cgtctacggt gacgtgcagc agat cctgttcaat aagaaggaga tagt "32" gcagatggcg gacggcaacc agct ggccatgagc cacctgaacg ggcacaagct "38" gcacgggaag cccatccgca tcacgctctc gaagcaccag aacgtgcagc tgccccgcga "44" gggccaggag gaccagggcc tgaccaagga ctacggcaac tcacccctgc accgcttcaa "50" gaagccgggc tccaagaact tccagaacat attcccgccc tcggccacgc tgcacctctc "56" cccg ccctcagtct ccgaggagga tctcaaggtc ctgttttcca gcaatggggg "62" cgtcgtcaaa ggattcaagt tcttccagaa ggaccgcaag atggcactga tccagatggg "68" ctccgtggag gaggcggtcc aggccctcat gcac aaccacgacc tcggggagaa "74" cctg cgggtctcct tctccaagtc caccatctag gggcacaggc ccccacggcc "80" gggccccctg actt ttcc agagaaaagc cactttaaaa gaag "86" tagc agaccagaga ttttattttt ttaaagagaa atcagtttac ctgtttttaa "92" aaaaattaaa tctagttcac cttgctcacc ctgcggtgac agggacagct caggctcttg "98" gtgactgtgg cagcgggagt tcccggccct ccacacccgg ggccagaccc tcggggccat 204" gccttggtgg ggcc:gtgtc gggcgtgggg cctgcaggtg ggcgccccga ccacgacttg 210" gcttccttgt gcct:aaaaa acctgccttc ctgcagccac acacccaccc ggggtgtcct 216" ggggacccaa gggg:ggggg ggtcacacca gagagaggca gggggcctgg ccggctcctg 222" caggatcatg cagc:ggggc gcggcggccg cggctgcgac accc cagccctcta 228" atcaagtcac gtga:tctcc cccg cccccagggc cttcccttct gcccccaggc 234" gggctccccg ctgc:ccagc tgcggagctg gtcgacataa tctctgtatt atatactttg 240" cagttgcaga cgtc:gtgcc tagcaatat: tccagttgac caaatattct aatctttttt 246" catttatatg caaaagaaat agttttaag: aactttttat agcaagatga tacaatggta 252" tgagtgtaat ctaaacttcc ttgtggtat: accttgtatg ctgttacttt tattttattc 258" cttgtaatta agtcacaggc aggacccag: ttccagagag caggcggggc gtgg 264" gtcaggcaca gggagccccg gtcctatct: agagcccctg agcttcaggg aaggggcggg 270" cgtgtcgccg cctctggcat cgcctccgg: tgccttacac cacgccttca cctgcagtcg 276" cctagaaaac ttgctctcaa ggg: tttttcttcc ttcaaatttt ggaccaaagt 282" ctcatttctg tgttttgcct gcctctgatg cccg gaaggcgggc gctcctcctg 477477 288" tcttctctgt gctctttcta ccgcccccgc gtcctgtccc gggggctctc ctaggatccc 294" ctttccgtaa aagcgtgtaa caagggtgta aatatttata attttttata cctgttgtga 300" aggg gcgc ggttttttat ggtgacacaa attt tgctaacagc 306" aattccaggc tcagtattgt gaccgcggag ggga ccccacgcac attccgttgc 312" cttacccgat ggcttgtgac gcggagagaa ccgattaaaa ccgtttgaga aactcctccc 318" ttgtctagcc ctgtgttcgc tgtggacgct gtagaggcag gttggccagt ctgtacctgg 324" acttcgaata aatcttctgt atcctcgctc cgttccgcct taaaaaaaaa aaaaaaaaaa 330" aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa n seguence: ISOFORM A NCBI Reference Seguence: NP_002810.1 LOCUS NP_002810 ACCESSION N P_002810 l diav gtkrgsdelf stcvtngpfi saan gndskkfkgd srsagvpsrv 6" pidv slgl pfgkvtnllm lkgknqafie mnteeaantm vnyytsvtpv 12" lrgqpiyiqf snhkelktds spnqaraqaa lqavnqusg nlalaasaaa vdagmamagq 18" spvlriiven lfypvtldvl hqifskfgtv lkiitftknn qfqallqyad pvsaqhakLs 24" ldgqniynac ctlridfskl tslnvkynnd ksrdytrpdl psgdsqpsld qtmaaafgap " giisaspyag agfpptfaip qaaglsvpnv hgalaplaip saaaaaaaag riaipglaga 36" vsnl npervtpqsl filfgvygdv qrvkilfnkk enalvqmadg mshl 42" nghklhgkpi ritlskhqnv qlpregqedq gltkdygnsp lhrfkkpgsk nfqnifppsa 48" tlhlsnipps vseedlkvlf ssnggvvkgf kfqudrkma liqmgsveea vqalidlhnh 54" dlgenhhlrv sfsksti Nucleotide seguence: ISOFORM B NCBI Reference Seguence: NM_031990.3 LOCUS NM_031990 ACCESSION NM_031990 l tgcgggcgtc tccgccattt tgtgagtcta ggag ccgttgggtc ggttcctgct 61 attccggcgc ctccactccg tcccccgcgg gtctgctctg tgtgccatgg acggcattgt 478478 12" cccagatata ggta caaagcgggg atctgacgag cttttctcta cttgtgtcac 18" taacggaccg tttatcatga gcagcaactc ggcttctgca gcaaacggaa atgacagcaa 24" gaagttcaaa ggtgacagcc gaagtgcagg cgtcccctct agagtgatcc acatccggaa " gctccccatc acgg aagt catctccctg gggctgccct ttgggaaggt 36" caccaacctc ctgatgctga aggggaaaaa ccaggccttc atga acacggagga 42" ggctgccaac accatggtga actactacac gacc cctgtgctgc gcggccagcc 48" catctacatc cagttctcca accacaagga gctgaagacc gacagctctc ccaaccaggc 54" gcgggcccag gcggccctgc aggcggtgaa ctcggtccag tcggggaacc tggccttggc 60" tgcctcggcg gcggccgtgg ggat ggcgatggcc gggcagagcc ccgtgctcag 66" cgtg ctct tctaccctgt gaccctggat gtgctgcacc agattttctc 72" cggc acagtgttga agatcatcac cttcaccaag aacaaccagt tccaggccct 78" gctgcagtat gcggaccccg tgagcgccca gcacgccaag ctgtcgctgg acgggcagaa 84" catctacaac gcctgctgca cgctgcgcat cgacttttcc aagctcacca gcctcaacgt 90" caagtacaac aaga gccgtgacta cacacgccca gacctgcctt ccggggacag 96" ccagccctcg ctggaccaga ccatggccgc ggccttcgcc tatg caggagctgg "O2" tttccctccc acctttgcca ttcctcaagc tgcaggcctt tccgttccga acgtccacgg "08" cgccctggcc cccctggcca cggc ggcggcggca gctgcggcgg caggtcggat "l4" cgccatcccg ggcctggcgg gggcaggaaa ttctgtattg ctggtcagca acctcaaccc "20" agagagagtc acaccccaaa gcctctttat tcttttcggc gtctacggtg acgtgcagcg "26" cgtgaagatc aata agaaggagaa cgccctagtg cagatggcgg acggcaacca "32" ggcccagctg gccatgagcc acctgaacgg gcacaagctg cacgggaagc ccatccgcat "38" cacgctctcg aagcaccaga acgtgcagct gccccgcgag ggccaggagg accagggcct "44" gaccaaggac tacggcaact cacccctgca ccgcttcaag aagccgggct ccaagaactt "50" ccagaacata ccct cggccacgct gcacctctcc aacatcccgc cctcagtctc "56" cgaggaggat ctcaaggtcc tgttttccag caatgggggc gtcgtcaaag gattcaagtt "62" cttccagaag gaccgcaaga tggcactgat ccagatgggc gagg aggcggtcca .68" ggccctcatt caca accacgacct cggggagaac caccacctgc gggtctcctt "74" ctccaagtcc accatctagg ggcc cccacggccg ggccccctgg cgacaacttc "80" catcattcca agcc actttaaaaa cagctgaagt gaccttagca gaccagagat "86" tttatttttt taaagagaaa tcagtttacc tgtttttaaa aaaattaaat ctagttcacc 479479 192" ttgctcaccc tgcggtgaca gggacagctc aggctcttgg tgactgtggc agcgggagtt 198" cccggccctc cacacccggg gccagaccct catg ccttggtggg gcc g 204" ggcgtggggc ctgcaggtgg cgac cacgacttgg cttccttgtg cct :aaaaaa 210" ttcc tgcagccaca cacccacccg gggtgtcctg gggacccaag ggg :gggggg 216" gtcacaccag agagaggcag tggc ctgc aggatcatgc agc :ggggcg 222" cggcggccgc ggctgcgaca ccccaacccc agccctctaa tcaagtcacg :tctccc 228" ttcaccccgc ccccagggcc ttcccttctg cccccaggcg ggctccccgc :ccagct 234" gcggagctgg tcgacataat ctctgtatta tatactttgc agttgcagac :gtgcct 240" agcaatat:t ccagttgacc aaatattcta atcttttttc atttatatgc aaaagaaata 246" gttttaag:a actttttata gcaagatgat acaatggtat gagtgtaatc taaacttcct 252" tgtggtat ccttgtatgc tgttactttt attttattcc ttgtaattaa ggca 258" ggacccag tccagagagc ggcc gcccagtggg tcaggcacag ggagccccgg 264" tcctatct :a gagcccctga gcttcaggga aggggcgggc gtgtcgccgc ctctggcatc 270" gg:t gccttacacc acgccttcac tcgc ctagaaaact tgctctcaaa 276" cttcaggg:t ttttcttcct tcaaattttg gaccaaagtc tcatttctgt gttttgcctg 282" atgc tgggacccgg aaggcgggcg ctcctcctgt cttctctgtg ctctttctac 288" cgcg tcctgtcccg ggggctctcc taggatcccc tttccgtaaa agcgtgtaac 294" aagggtgtaa atatttataa ttttttatac ctgttgtgag acccgagggg cggcggcgcg 300" gttttttatg gtgacacaaa tgtatatttt gctaacagca attccaggct cagtattgtg 306" accgcggagc cacaggggac cccacgcaca ttccgttgcc ttacccgatg gcttgtgacg 312" cggagagaac cgattaaaac cgtttgagaa actcctccct tgtctagccc tgtgttcgct 318" gtggacgctg tagaggcagg ttggccagtc tgtacctgga cttcgaataa atcttctgta 324" tcctcgctcc gttccgcctt aaaaaaaaaa aaaa aaaaaaaaaa aaaaaaaaaa 330" aaaaaaaaaa aaaaaaaaa n seguence: ISOFORM B NCBI Reference Seguence: NP_114367.1 LOCUS NP_1 14367 ACCESSION N P_114367 480480 l mdgivpdiav delf stcvtngpfi mssnsasaan gndskkfkgd srsagvpsrv 6" ihirklpidv tegevislgl pfgkvtnllm lkgknqafie mnteeaantm vnyytsvtpv l2" lrgqpiyiqf snhkelktds spnqaraqaa lqavnqusg nlalaasaaa vdagmamagq 18" spvlriiven lfypvtldvl hqifskfgtv tknn qfqallqyad pvsaqhakLs 24" ldgqniynac fskl tslnvkynnd ksrdytrpdl psgdsqpsld qtmaaafasp " yagagfpptf aipqaaglsv pnvhgalapl aipsaaaaaa aagriaipgl agagnsvlLv 36" snlnpervtp qslfilfgvy gdvqrvkilf nkkenalvqm adgnqaqlam shlnghklhg 42" kpiritlskh qnvqlpregq kdyg nsplhrfkkp gsknfqnifp psatlhlsni 48" ppsvseedlk vlfssnggvv udr kmaliqmgsv eeavqalidl hnhdlgenhh 54" lrvsfsksti Nucleotide seguence: ISOFORM C NCBI nce Seguence: NM_031991.3 LOCUS NM_031991 ACCESSION NM_031991 l tgcgggcgtc tccgccattt tgtgagtcta taactcggag ccgttgggtc tgct 6" attccggcgc ctccactccg tcccccgcgg gtctgctctg tgtgccatgg acggcattgt 12" cccagatata gccgttggta caaagcgggg atctgacgag cttttctcta cttgtgtcac 18" taacggaccg atga gcagcaactc tgca gcaaacggaa gcaa 24" gaagttcaaa ggtgacagcc gaagtgcagg cgtcccctct atcc acatccggaa " gctccccatc gacgtcacgg agggggaagt catctccctg gggctgccct ttgggaaggt 36" caccaacctc ctgatgctga aggggaaaaa ccaggccttc atcgagatga acacggagga 42" ggctgccaac accatggtga actactacac ctcggtgacc cctgtgctgc gcggccagcc 48" catctacatc cagttctcca accacaagga gacc gacagctctc ccaaccaggc 54" gcgggcccag gcggccctgc aggcggtgaa ctcggtccag tcggggaacc tggccttggc 60" tgcctcggcg gcggccgtgg acgcagggat ggcgatggcc gggcagagcc ccgtgctcag 66" cgtg gagaacctct tctaccctgt gaccctggat gtgctgcacc agattttctc 72" caagttcggc acagtgttga agatcatcac cttcaccaag aacaaccagt tccaggccct 78" gctgcagtat gcggaccccg tgagcgccca gcacgccaag ctgtcgctgg acgggcagaa 84" catctacaac gcctgctgca cgctgcgcat cgacttttcc acca gcctcaacgt 481481 90" caagtacaac aatgacaaga gccgtgacta cacacgccca gacctgcctt ccggggacag 96" ccagccctcg ctggaccaga ccatggccgc cggc ctttccgttc cgaacgtcca "O2" cggcgccctg gcccccctgg ccatcccctc ggcggcggcg gcagctgcgg ngcaggtcg "08" gatcgccatc ccgggcctgg cgggggcagg aaattctgta ttgctggtca gcaacctcaa "l4" cccagagaga gtcacacccc aaagcctctt tattcttttc ggcgtctacg gtgacgtgca "20" gcgcgtgaag atcctgttca ataagaagga gaacgcccta gtgcagatgg cggacggcaa "26" ccaggcccag ctggccatga gccacctgaa cgggcacaag ctgcacggga agcccatccg "32" catcacgctc tcgaagcacc agaacgtgca gctgccccgc gagggccagg aggaccaggg "38" cctgaccaag gactacggca actcacccct gcaccgcttc aagaagccgg agaa "44" cttccagaac atattcccgc cctcggccac gctgcacctc tccaacatcc cgccctcagt "50" ctccgaggag aagg tcctgttttc tggg ggcgtcgtca aaggattcaa "56" gttcttccag aaggaccgca agatggcact gatccagatg gtgg aggaggcggt "62" ccaggccctc attgacctgc acaaccacga cctcggggag aaccaccacc tgcgggtctc "68" cttctccaag tccaccatct aggggcacag gcccccacgg ccgggccccc tggcgacaac "74" ttccatcatt ccagagaaaa gccactttaa ctga agtgacctta gcagaccaga "80" attt ttttaaagag aaatcagttt acctgttttt aaaaaaatta gttc "86" accttgctca ccctgcggtg acagggacag ctcaggctct tggtgactgt ggcagcggga "92" gttcccggcc accc ggggccagac cctcggggcc atgccttggt :gtg "98" tcgggcgtgg ggcctgcagg tgggcgcccc gaccacgact tggcttcctt gtgcct:aaa 204" aaacctgcct tcctgcagcc acacacccac ccggggtgtc ctggggaccc aagggg:ggg 210" ggggtcacac cagagagagg cagggggcct ggccggctcc tgcaggatca tgcagc:ggg 216" gcgcggcggc Cgcggctgcg acaccccaac cccagccctc taatcaagtc acgtga:tct 222" cccttcaccc cgcccccagg gccttccctt ctgcccccag gcgggctccc cgctgc:cca 228" gctgcggagc tggtcgacat aatctctgta ttatatactt tgcagttgca gacgtc:gtg 234" cctagcaata tttccagttg tatt ctaatctttt tata tgcaaaagaa 240" atagttttaa gtaacttttt agat gatacaatgg tatgagtgta atctaaactt 246" ccttgtggta ttaccttgta tgctgttact tttattttat taat taagtcacag 252" ccca gtttccagag agcaggcggg gccgcccagt gggtcaggca cagggagccc 258" cggtcctatc ttagagcccc tgagcttcag ggaaggggcg ggcgtgtcgc cgcctctggc 264" tccg ttac accacgcctt cacctgcagt cgcctagaaa acttgctctc 482482 270" aaacttcagg gttttttctt ccttcaaatt ttggaccaaa gtctcatttc tgtgttttgc 276" ctgcctctga gacc cggaaggcgg gcgctcctcc tgtcttctct gtgctctttc 282" taccgccccc gcgtcctgtc ccgggggctc tcctaggatc ccctttccgt aaaagcgtgt 288" aacaagggtg taaatattta taatttttta tacctgttgt gagacccgag gggcggcggc 294" gcggtttttt atggtgacac aaatgtatat tttgctaaca gcaattccag gctcagtatt 300" gcgg agccacaggg gaccccacgc acattccgtt cccg atggcttgtg 306" acgcggagag ttaa aaccgtttga gaaactcctc ctag ccctgtgttc 312" gctgtggacg ctgtagaggc aggttggcca gtctgtacct ggacttcgaa taaatcttct 318" tcgc tccgttccgc cttaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 324" aaaaaaaaaa aaaaaaaaaa aa n seguence: ISOFORM C NCBI Reference Seguence: NP_114368.1 LOCUS NP_114368 ACCESSION N 68 l mdgivpdiav gtkrgsdelf stcvtngpfi mssnsasaan fkgd srsagvpsrv 6" ihirklpidv tegevislgl pfgkvtnllm lkgknqafie mnteeaantm vnyytsvtpv 12" lrgqpiyiqf snhkelktds aqaa lqavnqusg nlalaasaaa vdagmamagq 18" spvlriiven lfypvtldvl hqifskfgtv lkiitftknn qfqallqyad pvsaqhakLs 24" ldgqniynac ctlridfskl tslnvkynnd ksrdytrpdl psgdsqpsld qtmaaafgLs " vpnvhgalap laipsaaaaa aaagriaipg lagagnsvll vsnlnpervt pqslfilfgv 36" ygdvqrvkil fnkkenalvq madgnqaqla mshlnghklh gkpiritlsk hqnvqlpreg 42" qedqgltkdy gnsplhrfkk pgsknfqnif ppsatlhlsn ippsvseedl kvlfssnggv 48" qud rkmaliqmgs veeavqalid lhnhdlgenh hlrvsfskst i AP2A1 Official Symbol: AP2A1 al Name: adaptor-related protein complex 2, alpha 1 subunit Gene ID: 160 Organism: Homo sapiens 483483 Other s: ADTAA, AP2—ALPHA, CLAPA1 Other Designations: 100 kDa coated vesicle protein A; AP-2 complex subunit alpha-1; adapter-related protein complex 2 1 subunit; adaptin, alpha A; r protein complex AP-2 subunit alpha-1 ; alpha-adaptin A; alpha1-adaptin; in assembly protein complex 2 alpha-A large chain; clathrin- associated/assembly/adaptor protein, large, alpha 1 ; plasma membrane adaptor HA2/AP2 adaptin alpha A subunit Nucleotide seguence: ISOFORM 1 NCBI Reference Seguence: NM_014203.2 LOCUS 203 ACCESSION NM_014203 1 cggctcagag ctccggaccg cgggcggagg gcag gtgc cacggcctgc 6" cagcccgccc gcccgcccgc cagccagccc tccccgcggc cggctcggct ccttggcgct 12" gcctggggtc ctttccgccc cgct tgccagcccc cgctgctctg gtcc 18" ggccaggcct ggagccgaca ccat catgccggcc gtgtccaagg gcgatgggat 24" gcgggggctc gcggtgttca tctccgacat ccggaactgt aaag aggcggaaat " taagagaatc aacaaggaac tggccaacat ccgctccaag ttcaaaggag acaaagcctt 36" ggatggctac agtaagaaaa aatatgtgtg taaactgctt ttcatcttcc tgcttggcca 42" tgacattgac tttgggcaca tggaggctgt gttg agttccaata aatacacaga 48" aata ggttacctgt tcatttctgt gctggtgaac tcgaactcgg agctgatccg 54" cctcatcaac aacgccatca agaatgacct ggccagccgc aaccccacct tcatgtgcct 60" ggccctgcac tgcatcgcca acgtgggcag ccgggagatg ggcgaggcct ttgccgctga 66" catcccccgc atcctggtgg ccggggacag catggacagt gtcaagcaga gtgcggccct 72" gtgcctcctt cgactgtaca aggcctcgcc tgacctggtg cccatgggcg agtggacggc 78" gcgtgtggta cacctgctca atgaccagca catgggtgtg gtcacggccg ccgtcagcct 84" catcacctgt ctctgcaaga agaacccaga tgacttcaag acgtgcgtct ctctggctgt 90" gtcgcgcctg agccggatcg tctcctctgc ctccaccgac ctccaggact acacctacta 96" cttcgtccca gcaccctggc tctcggtgaa gctcctgcgg ctgctgcagt gctacccgcc 102" tccagaggat gcggctgtga aggggcggct ggtggaatgt ctggagactg tgctcaacaa 108" ggcccaggag ccccccaaat ccaagaaggt gcagcattcc aaga acgccatcct 114" gacc atcagcctca tcatccacta tgacagtgag cccaacctcc tggttcgggc 484484 "20" ccag ctgggccagt tcctgcagca ccgggagacc aacctgcgct acctggccct "26" ggagagcatg tgcacgctgg ccagctccga gttctcccat gtca agacgcacat "32" tgacaccgtc atcaatgccc tcaagacgga gcgggacgtc agcgtgcggc agcgggcggc "38" tgacctcctc tacgccatgt gtgaccggag caag cagatcgtgt cggagatgct "44" gcggtacctg gagacggcag actacgccat ccgcgaggag atcgtcctga aggtggccat "50" cctggccgag aagtacgccg tggactacag ctggtacgtg gacaccatcc tcat "56" ccgcattgcg ggcgactacg tgagtgagga ggtgtggtac cgtgtgctac agatcgtcac "62" tgat cagg gctatgccgc caagaccgtc tttgaggcgc tccaggcccc "68" tcac gagaacatgg tgaaggttgg cggctacatc cttggggagt ttgggaacct "74" gattgctggg gacccccgct ccagcccccc gttc tccctgctcc actccaagtt "80" ccatctgtgc agcgtggcca cgcgggcgct gctgctgtcc acctacatca agttcatcaa "86" cctcttcccc gagaccaagg ccaccatcca cctg cgggccggct cccagctgcg "92" caatgctgac gtggagctgc agcagcgagc cgtggagtac ctca gctcagtggc "98" cagcaccgac gtcctggcca ngtgctgga ggagatgccg cccttccccg agcgcgagtc 204" gtccatcctg gccaagctga aacgcaagaa ggggccaggg gccggcagcg ccctggacga 210" tggccggagg gaccccagca gcaacgacat caacgggggc atggagccca cccccagcac 216" tgtgtcgacg ccctcgccct ccgccgacct cctggggctg gccc cggc 222" agcacccccg gcttctgcag gagcagggaa ccttctggtg gacgtcttcg atggcccggc 228" cgcccagccc agcctggggc ccacccccga ggaggccttc ctcagcgagc tggagccgcc 234" tgcccccgag agccccatgg ctttgctggc tgacccagct ccagctgctg acccaggtcc 240" tgaggacatc ccca ttccggaagc cgatgagttg ctgaataagt ttgtgtgtaa 246" gaacaacggg gtcctgttcg agaaccagct gctgcagatc ggagtcaagt cagagttccg 252" cctg ggccgcatgt atctcttcta tggcaacaag acctcggtgc agttccagaa 258" tttctcaccc actgtggttc acccgggaga gact cagctggctg tgcagaccaa 264" gcgcgtggcg gcgcaggtgg angngCgC gcag caggtgctca atatcgagtg 270" cctgcgggac ttcctgacgc tgct gtccgtgcgc ttccggtacg cccc 276" ccaggccctc accctgaagc tcccagtgac catcaacaag ttcttccagc ccaccgagat 282" ggcggcccag gatttcttcc agcgctggaa gcagctgagc ctccctcaac aggaggcgca 288" gaaaatcttc aaagccaacc accccatgga cgcagaagtt actaaggcca agcttctggg 294" gtttggctct gctctcctgg acaatgtgga ccccaaccct gagaacttcg tgggggcggg 485485 300" ccag actaaagccc tgcaggtggg ctgtctgctt cggctggagc ccaatgccca 306" ggcccagatg taccggctga ccctgcgcac cagcaaggag cccgtctccc gtcacctgtg 312" gctg gcacagcagt tctgagccct ggactctgcc ccgggggatg tggccggcac 318" tgggcagccc cttggactga ggcagttttg gtggatgggg gacctccact ggtgacagag 324" aagacaccag ggtttggggg atgcctggga ctttcctccg gccttttgta tttttatttt 330" tgttcatctg ctgctgttta cattctgggg ggttaggggg agtccccctc cctccctttc 336" ccccccaagc acagagggga gaggggccag ggaagtggat gtctcctccc ctcccacccc 342" accctgttgt agcccctcct tccc catccagggg ctgtgtatta ttgtgagcga 348" ataaacagag agacgctaa Protein seguence: ISOFORM 1 NCBI Reference Seguence: NP_055018.2 LOCUS NP_055018 ACCESSION N P_055018 l mpavskgdgm rglavfisdi rnckskeaei krinkelani rskfkgdkal dgyskkkyvc 6" kllfifllgh didfghmeav nllssnkyte kqigylfisv lvnsnselir linnaikndl 12" asrnptfmcl alhcianvgs remgeafaad iprilvagds mdsvkqsaal cllrlykasp 18" dlvpmgewta rvvhllndqh mgvvtaavsl itclckknpd dfktcvslav srlsrivssa 24" stdlqdytyy lsvk llrllqcypp pedaavkgrl vecletvlnk aqeppkskkv " qhsnaknail fetisliihy dsepnllvra cnqlgqflqh retnlrylal esmctlasse 36" fsheavkthi dtvinalkte rdvsvrqraa dllyamcdrs nakqivseml ryletadyai 42" kvai laekyavdys wyvdtilnli riagdyvsee varvlqivt 48" ktvfealqap achenmvkvg gyilgefgnl iagdprsspp hskf hlcsvatral 54" llstyikfin lfpetkatiq sqlr nadvequra ssva tvLe 60" emppfperes silaklkrkk gpgagsaldd grrdpssndi nggmeptpst vstpspsadl 66" lglraapppa appasagagn llvdvfdgpa aqpslgptpe eaflselepp apespmalLa 72" dpgp edigppipea dellnkfvck nngvlfenql lqigvksefr qnlgrmylzy 78" gnktqufqn fsptvvhpgd lqtqlavqtk rvaaqugga qquvlniec 84" svrfryggap qaltlklpvt inkffqptem qrwk qlslpqqeaq kifkanhpmd 90" kllg fgsalldnvd pnpenfvgag iiqtkalqvg pnaq aqmyrltlrt 96" skepvsrhlc ellaqqf 486486 Nucleotide seguence: ISOFORM 2 NCBI Reference Seguence: NM_130787.2 LOCUS NM_130787 ACCESSION NM_130787 1 cggctcagag ctccggaccg cgggcggagg ggaggggcag ggggcggtgc cacggcctgc 6" cagcccgccc gcccgcccgc cagccagccc tccccgcggc cggctcggct ccttggcgct 12" gcctggggtc ctttccgccc ggtccccgct tgccagcccc cgctgctctg tgccctgtcc 18" ggccaggcct ggagccgaca ccaccgccat catgccggcc gtgtccaagg ggat 24" gcgggggctc gcggtgttca acat ccggaactgt aagagcaaag aggcggaaat " taagagaatc aacaaggaac tggccaacat caag ttcaaaggag acaaagcctt 36" ggatggctac agtaagaaaa aatatgtgtg taaactgctt ttcatcttcc gcca 42" tgacattgac tttgggcaca tggaggctgt gaatctgttg agttccaata aatacacaga 48" gaagcaaata ggttacctgt tcatttctgt gctggtgaac tcgaactcgg agctgatccg 54" cctcatcaac aacgccatca agaatgacct ggccagccgc acct gcct 60" ggccctgcac tgcatcgcca acgtgggcag ccgggagatg ggcgaggcct ttgccgctga 66" catcccccgc atcctggtgg ccggggacag catggacagt gtcaagcaga gtgcggccct 72" gtgcctcctt cgactgtaca aggcctcgcc tgacctggtg cccatgggcg agtggacggc 78" gcgtgtggta cacctgctca atgaccagca catgggtgtg gtcacggccg gcct 84" catcacctgt ctctgcaaga agaacccaga tgacttcaag acgtgcgtct ctctggctgt 90" gtcgcgcctg agccggatcg tctcctctgc ctccaccgac ctccaggact acta 96" cttcgtccca gcaccctggc tctcggtgaa gctcctgcgg ctgctgcagt cgcc "02" tccagaggat gcggctgtga aggggcggct atgt ctggagactg tgctcaacaa "08" ggcccaggag ccccccaaat ccaagaaggt gcagcattcc aacgccaaga acgccatcct "14" cttcgagacc atcagcctca tcatccacta tgacagtgag cccaacctcc tggttcgggc "20" ctgcaaccag cagt tcctgcagca gacc aacctgcgct acctggccct "26" ggagagcatg tgcacgctgg ccagctccga gttctcccat gaagccgtca agacgcacat "32" tgacaccgtc atcaatgccc cgga gcgggacgtc agcgtgcggc agcgggcggc "38" tgacctcctc tacgccatgt gtgaccggag caatgccaag cagatcgtgt tgct "44" gcggtacctg gagacggcag actacgccat ccgcgaggag atcgtcctga aggtggccat 487487 "50" cctggccgag aagtacgccg tggactacag ctggtacgtg gacaccatcc tcaacctcat "56" ccgcattgcg ggcgactacg tgagtgagga ggtgtggtac cgtgtgctac agatcgtcac "62" caaccgtgat gacgtccagg gctatgccgc caagaccgtc tttgaggcgc tccaggcccc "68" tcac gagaacatgg tgaaggttgg catc cttggggagt ttgggaacct "74" tggg gacccccgct ccagcccccc agtgcagttc tccctgctcc actccaagtt "80" gtgc gcca cgct gctgctgtcc acctacatca tcaa "86" cctcttcccc gagaccaagg ccaccatcca gggcgtcctg cgggccggct cccagctgcg "92" caatgctgac gtggagctgc agcagcgagc cgtggagtac ctcaccctca gctcagtggc "98" cagcaccgac gtcctggcca ngtgctgga ggagatgccg cccttccccg agcgcgagtc 204" cctg gccaagctga aacgcaagaa ggggccaggg gccggcagcg acga 210" tggccggagg gaccccagca gcaacgacat caacgggggc atggagccca cccccagcac 216" tgtgtcgacg ccctcgccct ccgccgacct cctggggctg cgggcagccc cggc 222" agcacccccg gcttctgcag gagcagggaa ccttctggtg gacgtcttcg atggcccggc 228" cgcccagccc agcctggggc ccacccccga ggaggccttc ctcagcccag gtcctgagga 234" catcggccct cccattccgg aagccgatga gttgctgaat aagtttgtgt gtaagaacaa 240" cggggtcctg ttcgagaacc agctgctgca gatcggagtc aagtcagagt tccgacagaa 246" cctgggccgc atgtatctct tctatggcaa caagacctcg gtgcagttcc agaatttctc 252" acccactgtg ccgg gagacctcca gactcagctg gctgtgcaga gcgt 258" gcag gtggacggcg gcgcgcaggt gcagcaggtg ctcaatatcg agtgcctgcg 264" ggacttcctg acgcccccgc tgctgtccgt gcgcttccgg tacggtggcg ccccccaggc 270" cctcaccctg aagctcccag tgaccatcaa caagttcttc cagcccaccg agatggcggc 276" ccaggatttc ttccagcgct ggaagcagct gagcctccct caacaggagg cgcagaaaat 282" cttcaaagcc aaccacccca tggacgcaga agttactaag gccaagcttc tggggtttgg 288" ctctgctctc ctggacaatg tggaccccaa ccctgagaac ttcgtggggg cggggatcat 294" ccagactaaa cagg tgggctgtct gcttcggctg gagcccaatg cccaggccca 300" gatgtaccgg ctgaccctgc gcaccagcaa ggagcccgtc tcccgtcacc tgtgtgagct 306" gctggcacag cagttctgag ccctggactc tgccccgggg gatgtggccg gcactgggca 312" gccccttgga ctgaggcagt tttggtggat gggggacctc cactggtgac agagaagaca 318" ccagggtttg ggggatgcct ttcc tccggccttt tgtattttta ttca 324" tctgctgctg tttacattct ggggggttag ggggagtccc cctccctccc tttccccccc 488488 3301 aagcacagag gggagagggg ccagggaagt ctcc tcccctccca ccccaccctg 3361 ttgtagcccc tcctaccccc tcca ggggctgtgt attattgtga gcgaataaac 3421 agagagacgc taa Protein seguence: ISOFORM 2 NCBI Reference Seguence: NP_570603.2 LOCUS NP_570603 ACCESSION N P_570603 l gdgm rglavfisdi rnckskeaei krinkelani rskfkgdkal dgyskkkyvc 6" kllfifllgh didfghmeav nllssnkyte fisv lvnsnselir linnaikndl 12" asrnptfmcl alhcianvgs remgeafaad iprilvagds mdsvkqsaal cllrlykasp 18" dlvpmgewta rvvhllndqh mgvvtaavsl itclckknpd dfktcvslav srlsrivssa 24" stdlqdytyy fvpapwlsvk llrllqcypp pedaavkgrl vecletvlnk aqeppkskkv " qhsnaknail fetisliihy dsepnllvra cnqlgqflqh retnlrylal esmctlasse 36" fsheavkthi dtvinalkte rdvsvrqraa cdrs nakqivseml ryletadyai 42" reeivlkvai laekyavdys lnli riagdyvsee varvlqivt nrddvquaa 48" ktvfealqap achenmvkvg gyilgefgnl iagdprsspp vqfsllhskf hlcsvatral 54" llstyikfin lfpetkatiq sqlr nadvequra veyltlssva stdvlatvLe 60" emppfperes silaklkrkk gpgagsaldd grrdpssndi nggmeptpst vstpspsadl 66" lglraapppa appasagagn dgpa ptpe eaflspgped igppipeade 72" llnkfvcknn gvlfenqllq frqn lgrmylfygn ktqufqnfs ptvvhpgd;q 78" tqlavqtkrv aaquggaqv qulnieclr dfltppllsv rfryggapqa vtin 84" kffqptemaa qdffqrwkql slpqqeaqki fkanhpmdae vtkakllgfg salldnvdpn 90" penfvgagii qtkalqvgcl aqaq myrltlrtsk epvsrhlcel laqqf TTLL12 Official Sym bol: TTLL12 Official Name: tubulin tyrosine ligase-like family, member 12 Gene ID: 23170 489489 Organism: Homo sapiens Other Aliases: dJ526|14.2 Other Designations: tubulin--tyrosine ligase-Iike protein 12 Nucleotide seguence: NCBI Reference Seguence: NM_015140.3 LOCUS 140 ACCESSION NM_015140 1 gccgacggac ggcgggcggc ggcggcggtg ctgg agtcggcgcg ggtgctggcg 6" ccatggaggc Cgagcggggt cgcc ggcctgcgga gcgtagcagc ccgggccaga 12" cgccggagga gggcgcgcag gccttggccg agttcgcggc gctgcacggc ccggcgctgc 18" gcgcttcggg ggtccccgaa cgttactggg tcct gcacaagctg gagcacgagg 24" acgc tggggaagtg tttgggatca tgcaagtgga agaa gagg " acgaggcagc ccgggaggtg cggaagcagc agcccaaccc ggggaacgag ctgtgctaca 36" aggtcatcgt ggag agcgggctcc aggcagccca ccccaacagc atcttcctca 42" tcgaccacgc ctggacgtgc Cgtgtggagc acgcgcgcca gcagctgcag caggtgcccg 48" ggctgctgca ccgcatggcc aacctgatgg gcattgagtt ccacggtgag ctgcccagta 54" cagaggctgt ggtg ctggaggaga tgtggaagtt caaccagacc taccagctgg 60" cccatgggac agctgaggag aagatgccgg tgtggtatat catggacgag ttcggttcgc 66" ggatccagca Cgcggacgtg cccagcttcg ccacggcacc cttcttctac cagc 72" ccta cacgctgctg ctga gggacctgga cactggcgag gaggtgaccc 78" gagactttgc ctacggagag acggaccccc tgatccggaa gtgcatgctg ctgccctggg 84" cccccaccga catgctggac ctcagctctt gcacacccga gccgcccgcc gagcactacc 90" aggccattct ggaggaaaac aaggagaagc tgccacttga catcaacccc gtggtgcacc 96" cccacggcca catcttcaag gtctacacgg acgtgcagca ggtggccagc agcctcaccc 102" acccgcgctt caccctcacc cagagtgagg ccga catcctcttc aacttctcac 108" acttcaagga ctacaggaaa ctcagccagg agaggccagg cgtgctgctg aaccagttcc 114" agaa cctgctgact gtcaaggact gcctggcctc catcgcgcgc nggcaggtg 120" gccccgaggg cccaccctgg ctgccccgaa ccttcaacct gcgcactgag ctgccccagt 126" ttgtcagcta cttccagcag cgggaaaggt ggggcgagga caaccactgg atctgcaagc 490490 "32" cctggaacct ggcgcgcagc ctggacaccc acgtcaccaa gcac agcatcatcc "38" ggcaccgaga gagcaccccc aaggttgtgt ccaagtacat cgaaagtccc gtgttgttcc "44" ttcgagaaga cgtgggaaag gtcaagttcg acatccgcta catcgtgctg ctgcggtcag "50" tgaggcccct acggttgttc gtgtatgatg ggct gcggttctcc gcct "56" ttgcactcaa cgacctggat gactacgaga agcacttcac ggtcatgaac tatgacccgg "62" atgtggtgct gaagcaggtg cactgtgaag agttcatccc cgagtttgag tacc "68" cagaatttcc ctggacggac gtccaggctg agatcttccg ggccttcacg gagctgttcc "74" aggtggcctg tgccaagcca ccacccctgg gcga ctacccctca tcccgggcca "80" tgtatgccgt cgacctcatg ctgaagtggg acaacggccc aagg nggtgatgc "86" agccgcagat cctggaggtg aacttcaacc ccgactgtga gcgagcctgc aggtaccacc "92" ccaccttctt caacgacgtc ttcagcacct tgtttctgga ccagcccggt cacg "98" ttacctgcct tgtctaggca ctcgctgtcc ccaaaacctg tgcttggggc ccaa 204" cctcagttct ctgagctgct tctgcaaagg cccccatgtc cctccccaca ccggccctgg 210" gcatagcctc agccccaggc ctctgtcctg ccgagccatc ctcccggcgc cacactccgg 216" gagcacagca tcctcctctc acctgtgggt agga cagtgatggt aggg 222" ctgagcacca ccccacgccc tgccctcacc cctcaccacc atctgtgcac tgatgagtct 228" ccagtttagc caagggcttt gttcctggca tggagaattt gttcctggct gctgtgtttc 234" cagggggtgc tgggggaagg tgga gcgagacaag gtgtcctcgg gagcagggtt 240" ccaccgggaa gcgtttggga gccctgtatc acacggggca ggcgggtttc tcttccgggg 246" tctctgctct tatgcatcag gacgaccccg ggacggctgt ggggccccac actgcaccca 252" cagggctcta tgcgacaggg gcccaggaac aggc ccag caagcccgcc 258" ttatcaccca ttccagctca cccagaacct tcaccagcaa tgct gaggtcctgg 264" caggaggcca ccgtcttgtt accgtttcct ttgc tgagggtcac agaccccaac 270" agggaaatca gtatctgtct tcccagtggt tgccctgctc gccgggcact ccacggggtc 276" ccgcccttgt gtgagatggg ccaggatcct tcggcaaggg gcgcctgggg ctggggctga 282" ttgtgggcgg gcca gacagaaaag gattccaatg agaacttcag gttaaagtca 288" gatgccacct accagggtct acagtcaaaa tgttggcttt ttcttatttt ttaatgtatg 294" ggagaaaaat gtaaaattcc agttcttttc taattgtgtt tctgaaatta ggagtcagct 300" gccagcgttt ttgtgtggct gcagtgtgcc tgggcccagc tcacgggcag tgggtggacc 306" ccca ggcaggcgag agctacttcc agagccttcc agtgcatggg agggcagggc 491491 3121 taggtgtagc ggtgtctcct ctttgaaatt aagaactatc tttcttgtag caaagctgca 3181 gatg ctcc tctctgtgtt gtctgggccc ttgtttacaa gcacgcgtta 3241 cccttcctga ggggagccat gctctagccc ctggagggcc tgttgcaggg gcagggcggg 3301 cccgtcgcct ttggcagctc ctggagagct gtggacatgc agtccccctc agttcgtgct 3361 aagg ccatcttctc aaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3421 a Protein seguence: NCBI Reference Seguence: 955.1 LOCUS NP_055955 ION N P_055955 1 meaergperr paersspgqt peegaqalae faalhgpalr asgvperng rllhklehev 6" fgim qveeveeeed eaarevrqu pnpgnelcyk vivtresglq aahpnsifLi 12" dhawtcrveh arqqquVpg llhrmanlmg iefhgelpst eavalvleem wkfnqtqua 18" hgtaeekmpv wyimdefgsr iqhadvpsfa tapffympqq vaytllwplr dldtgeevtr 24" dfaygetdpl irkcmllpwa ptdmldlssc tpeppaehyq aileenkekl pldinpvvhp " vytd quvasslth prftltqsea dadilfnfsh fkdyrklsqe rpgvllanp 36" cenlltvkdc ragg pegppwlprt fnlrtelqu vsyqurerw gednhwic<p 42" wnlarsldth vtkslhsiir hrestpkvvs kyiespvlfl redvgkkad lrsv 48" rplrlfvydv fwlrfsnraf alndlddyek hftvmnydpd vvlkqvhcee fipefequp 54" efpwtdvqae ifraftelfq vacakppplg lcdypssram yavdlmlkwd ngpdgrrvmq 60" pqilevnfnp dceracryhp tffndvfstl fldqugchv tclv FERMT2 Official Symbol: FERMT2 Official Name: fermitin family member 2 Gene ID: 10979 Organism: Homo sapiens 492492 Other Aliases: KIND2, MIG2, PLEKHCl, UNCl 12, UNCl12B, mig-2 OflmrDeflgnmbnx McomamMgfamHyCrnmnbertfmnmmfmmw homolog 2; kindlin 2; kindlin-2; mitogen inducible gene 2 protein; mitogen- inducible gene 2 protein; pleckstrin homology domain containing, family C (with FERMdomaM)membmfi;pbd$vmhomobgydomamcomaMMgJamHyC membmfi;pbd$nmhomobgydomaMcomamMgfamWCDmembm Nucleotide seguence: NCBI Reference Seguence: NM_001134999.1 LOCUS 134999 ACCESSION 134999 l gggtggagcg cggggagcca ggcgaggggc cgcgacgacg ggactccatt agccgctccg 6" gccacaggca gcgcttcgcc agccgaggaa g acaccgccgc cccgcgagcc 12" ccct cgcctgttgc agtc CnggCCng agcgctagga gcgcgcggaa 18" ggagccatgg ctctggacgg gataaggatg ccagatggct gctacgcgga nggacgtgg 24" gaactgagtg tccatgtgac ggacctgaac cgcgatgtca ccctgagagt gaccggcgag " gtgcacattg gaggcgtgat gcttaagctg gtggagaaac tcgatgtaaa aaaagattgg 36" tctgaccatg ctctctggtg ggaaaagaag tggc ttctgaagac acattggacc 42" ttagataagt atggtattca ggcagatgct aagcttcagt tcacccctca actg 48" ctccgcctgc agcttcccaa catgaagtat gtgaaggtga aagtgaattt ctctgataga 54" gtcttcaaag ctgtttctga catctgtaag acttttaata tcagacaccc cgaagaactt 60" tctctcttaa agaaacccag agatccaaca aagaaaaaaa agaagaagct agatgaccag 66" gatg aggcacttga attagagggg cctcttatca ctcctggatc tata 72" agcc caggactgta tagtaaaaca ccca cttatgatgc tcatgatgga 78" agccccttgt caccaacttc tgcttggttt ggtgacagtg ctttgtcaga aggcaatcct 84" ggtatacttg ctgtcagtca accaatcacg tcaccagaaa tcttggcaaa aatgttcaag 90" cctcaagctc ttcttgataa agcaaaaatc aaccaaggat ggcttgattc ctcaagatct 96" ctcatggaac aagatgtgaa ggaaaatgag gccttgctgc tccgattcaa gtattacagc 102" ttttttgatt tgaatccaaa gtatgatgca atcagaatca atcagcttta cgagcaggcc 108" aaatgggcca ttctcctgga agagattgaa tgcacagaag aagaaatgat gatgtttgca 114" gccctgcagt atcatatcaa taagctgtca atcatgacat cagagaatca tttgaacaac 493493 "20" agtgacaaag aagttgatga agttgatgct gccctttcag acctggagat tactctggaa "26" gggggtaaaa cgtcaacaat tttgggtgac attacttcca ttcctgaact tgctgactac "32" attaaagttt tcaagccaaa aaagctgact ctgaaaggtt acaaacaata ttggtgcacc "38" ttcaaagaca catccatttc ttgttataag agcaaagaag aatccagtgg agct "44" catcagatga acctcagggg atgtgaagtt accccagatg taaacatttc aggccaaaaa "50" tttaacatta aactcctgat tccagttgca gaaggcatga atgaaatctg ttgt "56" gacaatgaaa aacagtatgc gatg gcagcctgca gattagcctc caaaggcaag "62" accatggcgg acagttctta caacttagaa gttcagaata ttctttcctt tctgaagatg "68" cagcatttaa acccagatcc tcagttaata ccagagcaga tcacgactga tataactcct "74" gaatgtttgg tgtctccccg ctatctaaaa aagtataaga acaagcagcc aggctatata "80" agagatttga taacagcgag aatcttggag gcccatcaga atgtagctca tcta "86" attgaagcca agatgagatt agct tggcagtcac tacctgaatt tggcatcact "92" cacttcattg caaggttcca agggggcaaa aaagaagaac gaat tgcatacaac "98" agactgattc ggatggatgc cagcactgga gatgcaatta aaacatggcg tttcagcaac 204" cagt ggaatgtcaa ctgggaaatc aaaatggtca ccgtagagtt tgcagatgaa 210" gtacgattgt ccttcatttg tactgaagta aaag tggttcatga attcattggt 216" ggctacatat ttctctcaac acgtgcaaaa gaccaaaacg agagtttaga tgaagagatg 222" ttctacaaac ttaccagtgg ttgggtgtga ataggaatac tgtttaatga aactccacgg 228" ccataacaat atttaacttt aaaagctgtt tgttatatgc tgcttaataa agtaagcttg 234" aaatttatca ttttatcatg aaaacttctt tgccttacca ttaa tatgtgcact 240" aaacaagcac taat tgtt taat aaacttgaat ttgtcacaca 246" ttccttaggg attg aaaactgaaa tagtgggcaa atcaggaaca aaccatcact 252" gatttactga tttaagctag ccaaactgta agaaacaagc catctatttt aaagctatcc 258" agggcttaac ctatatgaac tctatttatc atgtctaatg catgtgattt aatgtatgtt 264" gata tcatgtttta aaatatccta cttctggtag aatt cctcccccta 270" cccccaaata aatcaggcat gcaggaggcc tgatatttag catt gtgtttgacc 276" ttgaaggaaa ttag tccgtcgtgc ttgatttgtt tttgtccttg aataagcatg 282" ttatgtatat tgtctcgtgt tttt acaccatatt gtattacact tttagtattc 288" accagcataa tcactgtctg cctaaaatat gcaactcttt gcattacaat atgaagtaaa 294" gttctatgaa gtatgcattt tgtgtaacta atgtaaaaac acaaatttta taaaattgta 494494 300" cagtttttta aaaactactc acaactagca ttaa atgtagcaat ctctgcgtta 306" attaaatgcc tttaagagat ataattaacg tgcagtttta acta aattaagaat 312" gacttcatta tgatcatgat ttgccacaat gtccttaact ctaatgcctg gactggccat 318" gttctagtct ctgt tacaatctgt attggtgcta gtcagaaaat tcctagctca 324" ccaa aagggtgcga gggagaggtg cagt attgttcaat aatccatggt 330" tcaaagactg tgca ttttatttta aataaaagca aaacttttat ttaataaaaa 336" aaaaaaaaaa aa Protein seguence: NCBI Reference Seguence: NP_001128471.1 LOCUS NP_001128471 ION N P_001128471 l maldgirmpd gcyadgtwel svhvtdlnrd vtlrvtgevh iggvmlklve kldvkkdwsd 6" halwwekkrt wllkthwtld kygiqadakl qftpqhkllr lqlpnmkyvk vkvnfsdrvf 12" kavsdicktf nirhpeelsl lkkprdptkk kkkklddqse dealelegpl itpgsgsiys 18" spglysktmt ptydahdgsp lsptsawfgd salsegnpgi lavsqpitsp eilakmfkpq 24" kinq gwldssrslm eqdvkeneal llrfkyysff dlnpkydair " ailleeiect faal thinklsim tsenhlnnsd kevdevdaal sdleitlegg 36" ktstilgdit dyik vfkpkkltlk gyquwctfk dtsiscyksk eessgtpahq 42" mnlrgcevtp dvnisqufn ikllipvaeg mneiwlrcdn equahwmaa crlaskgktm 48" adssynlevq nilsflkmqh lnpdpqlipe qittditpec lvsprylkky knkquyird 54" litarileah qnvaqmslie akmrfiqawq slpefgithf iarfqggkke eligiaynrl 60" irmdastgda iktwrfsnmk qwnvnweikm vtvefadevr lsfictevdc kvvhefiggy 66" iflstrakdq nesldeemfy kltsng ANXA6 495495 Official Sym bol: ANXA6 Official Name: annexin A6 Gene ID: 309 Organism: Homo s Other Aliases: ANX6, CBP68 Other Designations: 67 kDa calelectrin; CPB-II; annexin VI (p68); annexin-6; m-binding protein p68; calelectrin; calphobindin II; calphobindin-II; chromobindin-20; lipocortin VI; p68; p70 Nucleotide ce: NCBI Reference Seguence: NM_001155.4 LOCUS NM_001155 ACCESSION NM_001155 1 agaggggtgg ggtggaggag gggc gcgccggatt ggcctctgcg cgccacgtgt 6" ccggctcgga gcccacggct gtcctcccgg tccgccccgc gctgcggttg ctgctgggct 12" aacgggctcc gatccagcga cgtc ctcgagtccc tgcgcccgtg cgtccgtctg 18" cgacccgagg cctccgctgc gcgtggattc tgctgcgaac cggagaccat ggccaaacca 24" gcacagggtg ccaagtaccg gggctccatc catgacttcc caggctttga ccccaaccag " gatgccgagg ctctgtacac gaag ggctttggca gtgacaagga ggccatactg 36" gacataatca cctcacggag caacaggcag aggcaggagg tctgccagag ctacaagtcc 42" ctctacggca aggacctcat tgctgattta aagtatgaat tgacgggcaa acgg 48" ttgattgtgg gcctgatgag gccacctgcc tattgtgatg ccaaagaaat taaagatgcc 54" atctcgggca ttggcactga tgagaagtgc ctcattgaga tcttggcttc ccggaccaat 60" gagcagatgc accagctggt ggcagcatac aaagatgcct acgagcggga cctggaggct 66" gacatcatcg gcgacacctc tggccacttc cagaagatgc ttgtggtcct gctccaggga 72" accagggagg aggatgacgt agtgagcgag gacctggtac aacaggatgt ccta 78" gcag gggaactgaa atggggaaca gccc agttcattta catcttggga 84" aatcgcagca agcagcatct tcggttggtg ttcgatgagt atctgaagac cacagggaag 90" ccgattgaag tccg aggggagctg tctggggact ttgagaagct aatgctggcc 96" gtagtgaagt gtatccggag caccccggaa tattttgctg aaaggctctt tatg 102" aagggcctgg ggactcggga caacaccctg atccgcatca tggtctcccg tagtgagttg 496496 "08" gacatgctcg acattcggga gatcttccgg accaagtatg agaagtccct ctacagcatg "14" atcaagaatg acacctctgg cgagtacaag aagactctgc tgaagctgtc tgggggagat "20" gatgatgctg ctggccagtt cttcccggag gcagcgcagg tggcctatca gatgtgggaa "26" cttagtgcag gagt agagctgaag ggaactgtgc ccaa tgacttcaac "32" cctgacgcag atgccaaagc gctgcggaaa gccatgaagg gactcgggac tgacgaagac "38" acaatcatcg atatcatcac gcaccgcagc aatgtccagc ggcagcagat ccggcagacc "44" ttcaagtctc actttggccg ggacttaatg ctga agtctgagat ctctggagac "50" ctggcaaggc tgattctggg gctcatgatg ccaccggccc attacgatgc caagcagttg "56" aagaaggcca tggagggagc cggcacagat gaaaaggctc ttattgaaat cctggccact "62" cggaccaatg ctgaaatccg ggccatcaat gaggcctata aggaggacta tcacaagtcc "68" ctggaggatg ctctgagctc agacacatct ggccacttca ggaggatcct catttctctg "74" gccacggggc atcgtgagga agaa aacctggacc aggcacggga ccag "80" gtggctgctg agatcttgga aatagcagac acacctagtg gagacaaaac ggag "86" acacgtttca tgacgatcct gtgtacccgg agctatccgc acctccggag ag:cttccag "92" gagttcatca agatgaccaa ctatgacgtg gagcacacca tcaagaagga ga:gtctggg "98" gatgtcaggg atgcatttgt ggccattgtt caaagtgtca agaacaagcc tc:cttcttt 204" aaac tttacaaatc catgaagggt gctggcacag agac tc:gaccagg 210" atcatggtat cccgcagtga gattgacctg ctcaacatcc ggagggaatt gaaa 216" tatgacaagt acca agccattgag ggtgacacct actt cc:gaaggcc 222" ttgctggctc tctgtggtgg tgaggactag ggccacagct ttggcgggca ct :ctgccaa 228" gaaatggtta tcagcaccag ccgccatggc tgat tgttccagct ccagagacta 234" aggaaggggc aggggtgggg ggaggggttg ggttgggctc ttatcttcag tggagcttag 240" gaaacgctcc cactcccacg ggccatcgag ggcccagcac ggctgagcgg aacc 246" gtagccatag atcctgtcca cctccactcc cctctgaccc tcaggctttc ccagcttcct 252" ccccttgcta cagcctctgc cctggtttgg gctatgtcag atccaaaaac atcctgaacc 258" tctgtctgta aaatgagtag tgtctgtact ttgaatgagg gggttggtgg caggggccag 264" ttgaatgtgc tgggcggggt aagg atagtaaatg tgctggggca aactgacaaa 270" tcttcccatc catttcacca cccatctcca tccaggccgc gtac tggaccagga 276" atgc ctgggttcaa atctgcatct gccatgcact tgtttctgac cttaggccag 282" cccctttccc tccctgagtc tctattttct tatctacaat gttg gacaaaaaaa 497497 2881 tcttggcttc ccttctaaca ttaacttcct aaagtatgcc tccgattcat tcccttgaca 2941 ctttttattt ctaaggaaga aataaaaaga gatacacaaa cacataaaca caaaaaaaaa 3001 aa Protein seguence: NCBI Reference Seguence: NP_001146.2 LOCUS NP_001146 ACCESSION N P_001146 l makpaqgaky rgsihdfpgf dpnqdaealy tamkgfgsdk eaildiitsr snrqrqech 6" sykslygkdl iadlkyeltg kferlivglm dake ikdaisgigt dekcliei;a 12" srtnethql vaaykdayer dleadiigdt sghqumlvv llqgtreedd 18" qulyeagel kwgtdeaqfi yilgnrskqh lrlvfdeylk ttgkpieasi rgelsgdfek 24" lmlavvkcir stpeyfaerl fkamkglgtr dntlirimvs rseldmldir yeks " ndts geykktllkl sggdddaagq ffpeaaqvay qmwelsavar velkgtvrpa 36" ndfnpdadak alrkamkglg tdedtiidii thrsnvqrqq irqtfkshfg rdlmtdlkse 42" isgdlarlil glmmppahyd ameg agtdekalie ilatrtnaei raineayked 48" yhksledals sdtsghfrri ghre eggenldqar edaqvaaeil eiadtpsgdk 54" tsletrfmti lctrsyphlr rquefikmt nydvehtikk emsgdvrdaf vknk 60" plffadklyk smkgagtdek tltrimvsrs eidllnirre fiekydkslh qaiegdtsgd 66" flkallalcg ged PSM D4 Official Symbol: PSMD4 al Name: proteasome (prosome, macropain) 268 subunit, Pase, 4 Gene ID: 5710 Organism: Homo sapiens Other Aliases: RP11-126K1.1, AF, AF-1, ASF, MCB1, an10, 85A, pUB-R5 498498 Other Designations: 268 proteasome non-ATPase regulatory subunit 4; 268 proteasome regulatory subunit 85A; RPNf 0 homolog; SSa/antisecretory factor protein; angiocidin; antisecretory factor 1; multiubiquitin chain-binding protein Nucleotide seguence: NCBI Reference Seguence: NM_002810.2 LOCUS NM_002810 ACCESSION NM_002810 l aattggagga gttgttgtta ggccgtcccg gagacccggt cgggagggag gaaggtggca 6" agatggtgtt ggaaagcact tgtg tggacaacag tatg ngaatggag 12" acttcttacc gctg cagc aggatgctgt caacatagtt tgtcattcaa 18" agacccgcag caaccctgag aacaacgtgg gccttatcac taat gactgtgaag 24" tgctgaccac actcacccca gacactggcc gtatcctgtc caagctacat actgtccaac " ccaagggcaa gatcaccttc tgcacgggca tccgcgtggc ccatctggct ctgaagcacc 36" gacaaggcaa gaatcacaag atgcgcatca ttgcctttgt gggaagccca gtggaggaca 42" atgagaagga tctggtgaaa ctggctaaac agaa ggagaaagta aatgttgaca 48" ttatcaattt agag gaggtgaaca cagaaaagct gacagccttt gtaaacacgt 54" tgaatggcaa agatggaacc ggttctcatc tggtgacagt gcctcctggg cccagtttgg 60" ctgatgctct catcagttct ccgattttgg ctggtgaagg tggtgccatg ctgggtcttg 66" gtgccagtga attt ggagtagatc ccagtgctga tcctgagctg gccttggccc 72" ttcgtgtatc tatggaagag cagcggcagc ggcaggagga ggaggcccgg nggcagctg 78" cagcttctgc tgctgaggcc gggattgcta cgactgggac tgaagactca gacgatgccc 84" tgctgaagat gaccatcagc cagcaagagt ttggccgcac tcct gacctaagca 90" gtatgactga ggaagagcag attgcttatg ccatgcagat gtccctgcag ggagcagagt 96" ttggccaggc ggaatcagca gatg ccagctcagc caca tctgagccag 102" ccaaggagga ttac gacgtgatgc aggaccccga gttccttcag agtgtcctag 108" agaacctccc aggtgtggat cccaacaatg aagccattcg aaatgctatg ggctccctgg 114" cctcccaggc ggac ggcaagaagg acaagaagga ggaagacaag aagtgagact 120" ggagggaaag ggtagctgag tagg ggactgcatg ggaagcacgg aatatagggt 126" tagatgtgtg ttatctgtaa cagc ctaaataaag cttggcaact ttttttcctt ttttgcttca aa 499499 n seguence: NCBI Reference ce: NP_002801.1 LOCUS NP_002801 ACCESSION NP_002801 1 mvlestmvcv dnseymrngd flptrlqaqq davnivchsk trsnpennvg litlandcev 61 lttltpdtgr ilsklhtvqp kgkitfctgi rvahlalkhr qgknhkmrii afvgspvedn 121 ekdlvklakr lkkekvnvdi infgeeevnt ekltafvntl ngkdgtgshl vtvppgpsla 181 pila geggamlglg asdfefgvdp sadpelalal rvsmeeqrqr qeeearraaa 241 asaaeagiat tgtedsddal lkmtisqqef grtglpdlss mteeeqiaya gaef 301 gqaesadida ssamdtsepa keeddydvmq dpeflqsvle nlpgvdpnne airnamgsla 361 sqatkdgkkd kkeedkk Official Symbol: COTL1 Official Name: coactosin-like 1 (Dictyostelium) Gene ID: 23406 Organism: Homo sapiens Other Aliases: CLP Other ations: coactosin-like protein Nucleotide seguence: NCBI nce Seguence: NM_021149.2 LOCUS NM_021149 ACCESSION NM_021149 1 cgcgctcgca gctcgcaggc gccgcgtagc cgtcgccacc gccgccagcc cgtgcgccct 61 cggcgcgtac ccgccgcgct cccatccccg ccgccggcca ggggcgcgct cggccgcccc 500500 12" ggacagtgtc ccgctgcggc tccgcggcga tggccaccaa gatcgacaaa gaggcttgcc 18" gggcggcgta caacctggtg cgcgacgacg gctcggccgt catctgggtg acttttaaat 24" atgacggctc caccatcgtc cccggcgagc agggagcgga gtaccagcac ttcatccagc " agtgcacaga tgacgtccgg ttgtttgcct tcgtgcgctt cggg atga 36" gcaagaggtc caagtttgcc acgt ggatcggtga cagc gggctgcagc 42" gcgccaaaac cgggacggac aagaccctgg tgaaggaggt cgtacagaat ttcgctaagg 48" agtttgtgat cagtgatcgg aaggagctgg aggaagattt catcaagagc gagctgaaga 54" aggcgggggg agccaattac gacgcccaga cggagtaacc ccagcccccg ccacaccacc 60" caaa gtcatctgcc tgctccccgg gggagaggac cgccggcctc agctactagc 66" ccaccagccc accagggaga aaagaagcca tgagaggcag cgcccgccac cctgtgtcca 72" ccac gctt cccttagaac cctgccgtgt cctatctcat gacgctcatg 78" gaacctcttt ctttgatctt cttt tctccccctc :tttttgttc taaagaaaag 84" tcattttgat gcaaggtcct gcctgccatc agatccgagg :gcctcctgc ccct 90" tttcctggca tttctcttcc acgcgacgag gtctgcctag ctgc atgacctcac 96" gttgctttcc agagcccggg cctattttgc catctcagtt :tcctggacc ctgcttcctg "02" tgtaccactg aggggcagct ggag ctgtgcccgg :gcctgcagc cttcataagc "08" acacacgtcc attccctact aaggcccaga cctcctggta :ctgccccgg gctccctcat "l4" cccacctcca tccggagttg cctaagatgc atgtccagca :aggcaggat tgctcggtgg "20" tgagaaggtt aggtccggct cagactgaat aagaagagat aaaatttgcc ttaaaactta "26" cctggcagtg gctttgctgc acggtctgaa accacctgtt cccaccctct aaat "32" gtga cacagagaag ggcaaaggtc tgagcccaga gttgacggag ggagtatttc "38" agggttcact gctc ccaaagcgac aagatcgtta gggagagagg cccagggtgg "44" ggactgggaa tttaaggaga gctgggaacg gatcccttag gttcaggaag cttctgtgta "50" agctgcgagg atggcttggg ccgaagggtt gctctgcccg ccgcgctagc tgtgagctga "56" gcaaagccct gggctcacag caccccaaaa gcctgtggct tcagtcctgc gtctgcacca "62" cacattcaaa aggatcgttt tgttttgttt ttaaagaaag gtgagattgg cttggttctt "68" caca tttgatatag ctctttttct gtttttcctt gctcatttcg ttttggggaa "74" tgta ctgtattggg attgtaaaga acatctctgc actcagacag tttacagaaa "80" taaatgtttt ttttgttttt cagaaaaaaa aaaaaaaaaa aaaaaaaaaa 501501 Protein seguence: NCBI Reference Seguence: NP_066972.1 LOCUS NP_066972 ACCESSION N P_066972 1 matkidkeac raaynlvrdd gsaviwvtfk ydgstivpge qgaethfiq qctddvrlfa 61 fvrfttgdam skrskfalit wigenvsglq raktgtdktl vkevvqnfak efvisdrkel 121 eedfikselk kagganydaq te Official : ST13 al Name: suppression of tumorigenicity 13 m: 6767 Organism: Homo sapiens Other Aliases: AAG2, FAM10A1, 4, HIP, HOP, HSPABP, HSPABP1, P48, PROO786, SNCG Other Designations: Hsp70-interacting protein; aging-associated n 2; heat shock 70kD protein binding protein; hsc70-interacting protein; progesterone receptor-associated p48 protein; putative tumor suppressor ST13; rena| carcinoma antigen NY-REN-33; suppression of tumorigenicity 13 protein Nucleotide seguence: NCBI Reference Seguence: NM_003932.3 LOCUS 932 ACCESSION NM_003932 1 gagg agccagcggc ggtt ctagtctgtt tgcg gcagccgccc 61 ccttctgcgc ggtcacgccg cgcc tgggcctgga accgggccgt agccccccca 121 gtttcgccca ccacctccct accatggacc cccgcaaagt gaacgagctt cgggcctttg 181 tgaaaatgtg taagcaggat ccgagcgttc tgcacaccga ggaaatgcgc ttcctgaggg 241 agtgggtgga gagcatgggt ggtaaagtac cacctgctac tcagaaagct aaatcagaag 502502 " ccaa ggaagaaaaa cctgatagta agaaggtgga ggaagactta aaggcagacg 36" aaccatcaag tgaggaaagt gatctagaaa ttgataaaga aggtgtgatt gaaccagaca 42" ctgatgctcc tcaagaaatg ggagatgaaa atgcggagat ggag atgatggatc 48" aggcaaatga taaaaaagtg gctgctattg taaa tgatggtgaa ctccagaaag 54" ccattgactt attcacagat gccatcaagc tgaatcctcg cttggccatt ttgtatgcca 60" agagggccag tgtcttcgtc aaattacaga agccaaatgc tgccatccga gactgtgaca 66" gagccattga aataaatcct gattcagctc acaa gtggcggggg aaagcacaca 72" gacttctagg ccactgggaa gccc atgatcttgc ccttgcctgt aaattggatt 78" atgatgaaga tgctagtgca atgctgaaag aagttcaacc tagggcacag aaaattgcag 84" aacatcggag aaagtatgag cgaaaacgtg aagagcgaga gatcaaagaa gaac 90" agaa ggctcgagaa gagcatgaga agag ggaggaagaa gccagacgac 96" agtcaggagc tcagtatggc tcttttccag gtggctttcc tgggggaatg cctggtaatt "02" ttcccggagg aatgcctgga atgggagggg gcatgcctgg aatggctgga ggac "08" tcaatgaaat tcttagtgat ccagaggttc ttgcagccat gcaggatcca gaagttatgg "14" tggctttcca ggatgtggct cagaacccag caaatatgtc ccag agcaacccaa "20" aggttatgaa tctcatcagt aaattgtcag ccaaatttgg aggtcaagcg taatgtcctt "26" ctgataaata aagcccttgc tgaaggaaaa gcaacctaga tcaccttatg gatgtcgcaa "32" taatacaaac cagtgtacct ctgaccttct catcaagaga gctggggtgc tttgaagata "38" atccctaccc ctctccccca aatgcagctg aagcatttta cagtggtttg gggt "44" attcattcag ataatgtttt cctactagga attacaaact ttaaacactt tctt "50" caaaatattt aaaacaaatt taaagggcct gttaattctt atatttttct ttactaatca "56" ttttggattt ttttctttga ggca gggaatatac ttatgtatgg aagattactg "62" ctctgagtga aataaaagtt attagtgcga ggcaaacata actcatttga ggataaagtt "68" tgtgttggat atgtggttcc tgatgcattt tgacttgtct ttttaaatgc tttatctttt "74" tctttaaaga tttatttcaa taaaactaat tgggaccacc cgtatttcag taggacctgg "80" gtagggattg gaagtacttg gcagggcagc agcaatcttg ctgtgtttga tataacatgc "86" atccttgggc aggttgccct taaatcttac actgtggtga agggatgttt tttttgtaat "92" gctgcagtag agttggagta cttagttctc ttgttgtcca gtatatctaa taagtgtttt "98" tatt tccacgtaag ggaaataagg tttt ctttttatat ttctatgctt 204" aaaattctct ttcctagtca aaaattgccc aaatctgtgt ttgctttctg cttgctacat 503503 210" ttgtctccct tacttttctt gagctaaaga caggcttttt ccaccggcat catcactgct 216" atcatcatta acagcgtaat tatacaagca tatttaatgc tgagtttaat tgta 222" atacatatgg taattgtagg gtaataccca ctgt agtttcttac ttggccaaga 228" gaatgcttat ttaagtgtta gacttccatt ctggcaaaat cttgccttat cagaagacat 234" tggaaagagg gattcccttt gg:gtttggt cttctactta gaaaaaccta ttgcagttag 240" tttatcttgt agtattcatc tt:gtattct gaagataagg tttgaattaa attgatacac 246" ggga accgattttt caat tata aatgagataa tccacagtta 252" ttcattgtgg agttgttgag ac:atgaaag actcattgtc tttgtattca gctcttaaat 258" agtgtaacta tatccccacc tc:gcttgct ttctttccct cccctccaat gataaagaaa 264" atgataaatt ttctgttgtg ca:tcaattc ttattttaaa taagactaag tataggcatt 270" gtacctgaca ttgctacgtt tc:accagtg tttcaattta tagt gtttaaaaac 276" attttcaagg gataaggcct tc:gtacttt gcttatttga agaatcagtg gtaggagcag 282" aaat tctatggagt acatttctaa aataccacat ttctgaaatc ataaataagt 288" ttattcaggt tctaaccctt acac aagcagacag aaatgcatct gttacataaa 294" tgagaaaaag tgct gatggagcat gctttttaaa tcctttaaaa acactcacca 300" tataaacttg catttgagct tgtgtgttct tttgttaatg gttc tcctttctcg 306" aaattgccag cttg gcttaactca agaacagttt cttctggatt ccttatttga 312" tttatttaac ctaattatat tctaatattg caaatattac cataagtggg taaa 318" attcctcttc tgaaaaaaaa aaaaaaaaaa aaaa Protein seguence: NCBI Reference Seguence: NP_003923.2 LOCUS N P_003923 ACCESSION N P_003923 1 mdprkvnelr akamckqdp svlhteemrf lrewvesmgg kvppatqkak eekp 61 dskkveedlk adepsseesd leidkegvie pdtdapqemg denaeiteem mdqandkkva 121 aiealndgel qkaidlftda iklnprlail yakrasvak lqkpnaaird cdraieinpd 181 saqpykwrgk ahrllghwee aahdlalack ldydedasam lkevqpraqk iaehrrkyer 241 kreereiker aree heraqreeea rrqsgaqygs fpggfpggmp gnfpggmpgm 504504 301 gggmpgmagm pglneilsdp evlaaqupe vmvafqdvaq npanmskyqs npkvmnlisk 361 lsakfggqa Gene Official Symbol: SRSF2 (also known as SFR32) Official Name: serine/arginine-rich splicing factor 2 Gene ID: 6427 sm: Homo s Other Aliases: PR264, 80-35, 8035, SFRSZ, SFRS2A, SRpBOb Other Designations: SR splicing factor 2; splicing component, 35 kDa; ng factor SC35; splicing factor, arginine/serine-rich 2 Nucleotide seguence: NCBI Reference Seguence: NM_001195427.1 LOCUS NM_001195427 ACCESSION NM_001195427 l agaaggtttc atttccgggt ggcg ccattttgtg aggagcgata taaacgggcg 6" cagaggccgg ctgcccgccc agttgttact caggtgcgct agcctgcgga gcccgtccgt 12" gctgttctgc ggcaaggcct gtgt ccccacgcgg aaggcaactg cctgagaggc 18" gcggcgtcgc accgcccaga gctgaggaag ccggcgccag ttcgcggggc tccgggccgc 24" cactcagagc tatgagctac ggccgccccc ctcccgatgt ggagggtatg acctccctca " aggtggacaa cctgacctac tcgc ccgacacgct cgtc ttcgagaagt 36" acgggcgcgt cggcgacgtg tacatcccgc gggaccgcta caccaaggag ggct 42" tcgccttcgt tcgctttcac gacaagcgcg agga cgctatggat gccatggacg 48" gggccgtgct ggangCCgC gagctgcggg tgcaaatggc gcgctacggc cgccccccgg 54" actcacacca cagccgccgg ggaccgccac cccgcaggta nggggcggt ggctacggac 60" gccggagccg cagccctagg cggcgtcgcc gcagccgatc ccggagtcgg agccgttcca 66" ggtctcgcag ccgatctcgc tacagccgct cgaagtctcg gtcccgcact cgttctcgat 505505 72" ctcggtcgac ctccaagtcc agatccgcac gaaggtccaa gtccaagtcc tcgtcggtct 78" ccagatctcg ttcgcggtcc aggtcccggt ctcggtccag gagtcctccc ccagtgtcca 84" agagggaatc cagg tcgcgatcga agagtccccc caagtctcct gaagaggaag 90" gagcggtgtc ctcttaagaa aatgatgtat cggcaagcag tgtaaacgga ggacttgggg 96" aaaaaggacc acatagtcca tcgaagaaga gtccttggaa caagcaactg gctattgaaa "O2" tttt tt tctaactttt tacttgttta agctttgcct cagttggcaa "08" acttcatttt atgtgcca:t ttgttgctgt tattcaaatt tcttgtaatt tagtgaggtg "l4" aacgacttca gatttcat ttggatttgg atatttgagg taaaatttca tata "20" tagtgctgac tttttttg:t tgaaattaaa cagattggta acctaatttg tggcctcctg "26" acttttaagg aaaacgtg:g cagccattac acacagccta aagctgtcaa gagattgact "32" cggcattgcc ttcattcc:t aaaattaaaa acctacaaaa gtaa atttgtatat "38" gttatttacc ttcagatc:a aatc tgaacccaaa tttgtataaa gacttttcag "44" gtgaaaagac ttgatttt:t gaaaggattg tttatcaaac acaattctaa tctcttctct "50" tatgtatttt tgtgcactag gcgcagttgt gtagcagttg agtaatgctg tgtt "56" aaggtggcgt gttgcagtgc agagtgcttg gctgtttcct gttttctccc gattgctcct "62" gtgtaaagat gccttgtcgt gcagaaacaa atggctgtcc agtttattaa aatgcctgac "68" aactgcactt ccagtcaccc gggccttgca tataaataac ggagcataca gtgagcacat "74" ctagctgatg ataaatacac ttcc ctcttccccc taaaaatggt aaatctgatc "80" atatctacat gtatgaactt aacatggaaa atgttaagga agcaaatggt tgtaactttg "86" taagtactta taacatggtg tatctttttg aata ttctgtatta taaccattgt "92" ttctgtagtt taattaaaac attttcttgg tgttagcttt tctcagaaaa aaaaaaaaaa "98" aaaaaaaaaa aaaaaaaaaa aaaaaaaa Protein seguence: NCBI Reference Seguence: NP_001182356.1 LOCUS 182356 ACCESSION N P_001182356 l ppdv egmtslkvdn ltyrtspdtl ygrv gdvyiprdry fafv 61 rfhdkrdaed amdamdgavl vqma rygrppdshh srrgppprry ggggygrrsr 506506 121 sprrrrrsrs rsrsrsrsrs rsrysrsksr srtrsrsrst sksrsarrsk sksssvsrsr 181 srsrsrsrsr spppvskres ksrsrskspp kspeeegavs s HNRNPH1 al Symbol: HNRNPH1 Official Name: heterogeneous nuclear ribonucleoprotein H1 (H) Gene ID: 3187 Organism: Homo sapiens Other Aliases: Other Designations: heterogeneous nuclear ribonucleoprotein H Nucleotide seguence: NCBI Reference ce: NM_001257293.1 LOCUS 257293 ACCESSION NM_001257293 1 acaagggacc ttatttaggt tgcgcaggcg cccgctggcc atttcgtctt agccacgcag 6" aagtcgcgtg tctaggtgag tcgcggtggg tcctcgcttg cagttcagcg accacgtttg 12" tttcgacgcc ggaccgcgta agagacgatg atgttgggca ngaaggtgg agagggattc 18" gtggtgaagg tccggggctt gccctggtct gccg atgaagtgca gaggtttttt 24" tctgactgca aaattcaaaa tggggctcaa ggtattcgtt acac cagagaaggc " agaccaagtg gcgaggcttt tgttgaactt gaatcagaag atgaagtcaa cctg 36" aaaaaagaca gagaaactat gggacacaga tatgttgaag tattcaagtc aaacaacgtt 42" gaaatggatt gggtgttgaa gcatactggt ccaaatagtc ctgacacggc caatgatggc 48" tttgtacggc ttagaggact tccctttgga tgtagcaagg aagaaattgt tcagttcttc 54" tcagggttgg aaatcgtgcc gata acattgccgg tggacttcca gagt 60" acgggggagg tgca gtttgcttca caggaaatag ctgaaaaggc tctaaagaaa 66" cacaaggaaa gaatagggca tatt gaaatcttta agagcagtag agctgaagtt 507507 72" agaactcatt atgatccacc acgaaagctt atggccatgc agcggccagg tccttatgac 78" agacctgggg ctggtagagg gtataacagc attggcagag gctt tgagaggatg 84" aggcgtggtg cttatggtgg aggctatgga ggctatgatg attacaatgg ctataatgat 90" ggctatggat ttgggtcaga tagatttgga agagacctca attactgttt ttcaggaatg 96" tctgatcaca gatacgggga tggtggctct actttccaga gcacaacagg acactgtgta "02" cacatgcggg gattacctta cagagctact gagaatgaca attt tttttcaccg "08" ctcaaccctg taca cattgaaatt ggtcctgatg gcagagtaac tggtgaagca "14" gatgtcgagt tcgcaactca tgaagatgct gtggcagcta tgtcaaaaga aaat "20" atgcaacaca gatatgtaga actcttcttg aattctacag caggagcaag cggtggtgct "26" tacgaacaca gatatgtaga actcttcttg aattctacag caggagcaag cggtggtgct "32" tatggtagcc aaatgatggg aggcatgggc ttgtcaaacc gcta cgggggccca "38" gccagccagc agctgagtgg gggttacgga ggcggctacg gtggccagag gagt "44" ggatacgacc aagttttaca ggaaaactcc agtgattttc aatcaaacat tgcataggta "50" accaaggagc agtgaacagc agctactaca gtagtggaag ccgtgcatct atgggcgtga "56" acggaatggg agggttgtct agcatgtcca gtatgagtgg tggatgggga ttga "62" tcgatcctga tcactgactc ttggtcaacc tttt tttttttttt ttctttaaga "68" aaacttcagt ttaacagttt ctgcaataca agcttgtgat ttatgcttac tctaagtgga "74" aatcaggatt gttatgaaga cttaaggccc agtatttttg atac tcatctagga "80" tgtaacagtg aagctgagta aactataact gttaaactta agct tttctcaagt "86" tagttatagg atgtacttaa gcagtaagcg tatttaggta aaagcagttg aattatgtta "92" aatgttgccc tttgccacgt taaattgaac actgttttgg atgcatgttg aaagacatgc "98" ttttattttt ttgtaaaaca atataggagc tgtgtctact attaaaagtg aaacattttg 204" gcatgtttgt taattctagt ttcatttaat aacctgtaag gcacgtaagt ttaagctttt 210" ttttttttta agttaatggg aaaaatttga gacgcaatac caatacttag ggtc 216" ttggtgtttg tatgaaattc tgaggccttg atct ttcattgtat tgtgatttcc 222" ttttaggtat ctaa gtgaaacttg tcaaataaat ttta aaaactgcaa 228" aaaaaaaaaa aaaaaaaaaa aaaa Protein seguence: NCBI Reference ce: NP_001244222.1 508508 LOCUS N P_001244222 ACCESSION N P_001244222 1 mmlgteggeg fvvkvrglpw scsadevqrf fsdckiqnga qgirfiytre grpsgeafve 6" lesedevkla lkkdretmgh ryvevfksnn vemdwvlkht gpnspdtand gfvrlrglpf 12" gcskeeivqf fsgleivpng itlpvdfqgr stgeaqufa sqeiaekalk khkerighry 18" ieifkssrae vrthydpprk lmamqrpgpy drpgagrgyn gfer mrrgaygggy 24" ggyddyngyn dgygfgsdrf grdlnycfsg msdhrygdgg stquttghc vhmrglpyra " tendiynffs vhie igpdgrvtge advefathed avaamskdka nmthyveLf 36" asgg ayehryvelf lnstagasgg aygsqmmggm glanssygg pasqqlsggy 42.. gggyggqssm sgydqvlqen ssdqunia Gene Official Symbol: IQGAP1 Official Name: IQ motif containing GTPase activating protein 1 Gene ID: 8826 Organism: Homo sapiens Other Aliases: HUMORFAOf, SAR1, p195 Other Designations: RasGAP-like with IQ motifs; ras GTPase-activating-like n IQGAPf Nucleotide seguence: NCBI Reference Seguence: NM_003870.3 LOCUS NM_003870 ACCESSION NM_003870 1 ggaccccggc gcgc acttggcagg agctgtagct accgccgtcc gcgcctccaa 61 ggtttcacgg cttcctcagc agagactcgg ccgc cgcc gcagacgagg 121 ttgacgggct gggcgtggcc cact atggctctgt cctggataat gaaagactta 181 ctgcagagga gatggatgaa aggagacgtc agaacgtggc ttatgagtac catt 241 tggaagaagc gaagaggtgg gcat gcctagggga agatctgcct cccaccacag 509509 " aactggagga ggggcttagg aatggggtct ccaa actggggaac ttcttctctc 36" ccaaagtagt gtccctgaaa aaaatctatg atcgagaaca gaccagatac aaggcgactg 42" gcctccactt tagacacact gataatgtga ttcagtggtt gaatgccatg gatgagattg 48" gattgcctaa gattttttac ccagaaacta cagatatcta tgatcgaaag ccaa 54" tcta ctgtatccat gcactcagtt tgtacctgtt caagctaggc ctggcccctc 60" agattcaaga tgga aaggttgact tcacagaaga agaaatcaac aacatgaaga 66" ctgagttgga gaagtatggc atccagatgc ctgcctttag caagattggg ggcatcttgg 72" aact gtcagtggat gaagccgcat tacatgctgc tgttattgct attaatgaag 78" ctattgaccg tagaattcca gccgacacat ttgcagcttt gaaaaatccg aatgccatgc 84" ttgtaaatct tgaagagccc ttggcatcca cttaccagga tatactttac caggctaagc 90" aggacaaaat gacaaatgct aaaaacagga cagaaaactc agagagagaa gttt 96" atgaggagct gctcacgcaa gctgaaattc aaggcaatat aaacaaagtc aatacatttt "02" tagc aaatatcgac ctggctttag aacaaggaga tgcactggcc ttgttcaggg "08" ctctgcagtc accagccctg gggcttcgag gactgcagca tagc gactggtact "14" tgaagcagct cctgagtgat caga agagacagag tggtcagact gaccccctgc "20" agaaggagga gctgcagtct ggagtggatg ctgcaaacag tgctgcccag caatatcaga "26" tggc agcagtagca ctgattaatg ctgcaatcca gaagggtgtt gctgagaaga "32" ctgttttgga gaat cccgaagccc agctgcccca ggtgtatcca tttgccgccg "38" atctctatca gctg gctaccctgc agcgacaaag tcctgaacat aatctcaccc "44" acccagagct ctctgtcgca atgt tgtcatcggt ggccctgatc aacagggcat "50" tggaatcagg agatgtgaat tgga agcaattgag cagttcagtt actggtctta "56" ccaatattga ggaagaaaac tgtcagaggt atctcgatga gttgatgaaa ctgaaggctc "62" aggcacatgc agagaataat gaattcatta catggaatga tatccaagct tgcgtggacc "68" atgtgaacct ggtggtgcaa gaggaacatg agaggatttt agccattggt ttaattaatg "74" aagccctgga tgat gcccaaaaga ctctgcaggc cctacagatt cctgcagcta "80" aacttgaggg agtccttgca gaagtggccc agcattacca agacacgctg attagagcga "86" agagagagaa agcccaggaa atccaggatg agtcagctgt gttatggttg gatgaaattc "92" aaggtggaat ctggcagtcc aacaaagaca cccaagaagc acagaagttt gccttaggaa "98" tctttgccat taatgaggca gtagaaagtg gtgatgttgg caaaacactg cttc 204" gctcccctga cttg tatggagtca tccctgagtg aact taccacagtg 510510 210" atcttgctga agccaagaag aaaaaactgg cagtaggaga taataacagc aagtgggtga 216" gggt aaaaggtgga tattattatt accacaatct ggagacccag gaaggaggat 222" gggatgaacc tccaaatttt gtgcaaaatt ctatgcagct ttctcgggag gagatccaga 228" gttctatctc tggggtgact gccgcatata accgagaaca gctgtggctg gccaatgaag 234" gcctgatcac caggctgcag gctcgctgcc gtggatactt acag gaattccgat 240" ccaggatgaa gaag aaacaaatcc ctgccatcac ctgcattcag tcacagtgga 246" gaggatacaa gcagaagaag gcatatcaag atcggttagc ttacctgcgc tcccacaaag 252" ttgt aaagattcag tccctggcaa ggatgcacca agctcgaaag cgctatcgag 258" atcgcctgca gtacttccgg gaccatataa atgacattat caaaatccag gcttttattc 264" gggcaaacaa agctcgggat gactacaaga ctctcatcaa tgctgaggat cctcctatgg 270" ttgtggtccg aaaatttgtc cacctgctgg accaaagtga ccaggatttt caggaggagc 276" ttgaccttat gaagatgcgg gaagaggtta tcaccctcat tcgttctaac cagcagctgg 282" agaatgacct caatctcatg gatatcaaaa ttggactgct agtgaaaaat aagattacgt 288" tgcaggatgt ggtttcccac agtaaaaaac ttaccaaaaa aaataaggaa cagttgtctg 294" atatgatgat taaa cagaagggag gtctcaaggc tttgagcaag gagaagagag 300" agaagttgga agcttaccag cacctgtttt atttattgca aaccaatccc acctatctgg 306" ccaagctcat ttttcagatg ccccagaaca ccaa gttcatggac tctgtaatct 312" tcacactcta caactacgcg tccaaccagc gagaggagta cctgctcctg cggctcttta 318" agacagcact ggaa atcaagtcga aggtagatca gattcaagag attgtgacag 324" gaaatcctac ggttattaaa atggttgtaa gtttcaaccg tggtgcccgt ggccagaatg 330" ccctgagaca gatcttggcc gtga aggaaattat ggatgacaaa tctctcaaca 336" tcaaaactga ggat aaat cttgggttaa tcagatggag tctcagacag 342" gagaggcaag caaactgccc tatgatgtga cccctgagca ggcgctagct catgaagaag 348" tgaagacacg gctagacagc agga acatgcgggc tgtgacagac aagtttctct 354" ttgt cagctctgtg gacaaaatcc ggat catt gccaaagtgc 360" tgaaggactc gttgcatgag aagttccctg atgctggtga ggatgagctg ctgaagatta 366" ttggtaactt ttat cgatacatga atccagccat tgttgctcct gatgcctttg 372" acatcattga cctgtcagca ggaggccagc ttaccacaga ccga aatctgggct 378" ccattgcaaa aatgcttcag catgctgctt ccaataagat gtttctggga gataatgccc 384" gcat cattaatgaa tccc agtcctacca gaaattcaga cggtttttcc 511511 390" aaactgcttg ccca gagcttcagg ttaa tgtggatgag tactctgatt 396" tagtaaccct caccaaacca gtaatctaca tttccattgg tgaaatcatc aacacccaca 102" ctctcctgtt ccag gatgccattg ctccggagca tcca atccacgaac 108" tgctggacga cctcggcgag gtgcccacca ccct ggaa agctctggca 114" atttaaatga cccaaataag ctgg ctaagacgga agtgtctctc accctgacca 120" acaagttcga cgtgcctgga gatgagaatg cagaaatgga tgctcgaacc atcttactga 126" atacaaaacg tttaattgtg atcc ggttccagcc aggagagacc ttgactgaaa 132" tcctagaaac accagccacc agtgaacagg aagcagaaca tcagagagcc atgcagagac 138" gtgctatccg tgatgccaaa acacctgaca agatgaaaaa gtcaaaatct gtaaaggaag Z44" acagcaacct cactcttcaa gagaagaaag agaagatcca ttta aagaagctaa 150" cagagcttgg aaccgtggac ccaaagaaca aataccagga actgatcaac gacattgcca 156" gggatattcg gaatcagcgg aggtaccgac agaggagaaa ggccgaacta gtgaaactgc Z62" aacagacata cgctgctctg aactctaagg ccacctttta tggggagcag gtggattact Z68" ataaaagcta tatcaaaacc tgcttggata acttagccag caagggcaaa aaaa Z74" agcctaggga aatgaaagga aagaaaagca aaaagatttc tctgaaatat acagcagcaa 180" gactacatga aaaaggagtt gaaa ttgaggacct gcaagtgaat cagtttaaaa 186" atgttatatt tgaaatcagt ccaacagaag aagttggaga cttcgaagtg aaagccaaat Z92" tcatgggagt ggag acttttatgt tacattatca ggacctgctg cagctacagt Z98" atgaaggagt tgcagtcatg aaattatttg atagagctaa agtaaatgtc aacctcctga 504" ttct caacaaaaag ttctacggga agtaattgat cgtttgctgc cagcccagaa 510" ggatgaagga aagaagcacc tcacagctcc tttctaggtc cttctttcct cattggaagc 516" aaagacctag ccaacaacag aatc tgatacactc ccgatgccac atttttaact 522" cctctcgctc tgatgggaca tttgttaccc ttttttcata gtgaaattgt gtttcaggct 528" gacc tttctggttt cttcattttc ttccattact taggaaagag tggaaactcc 534" actaaaattt ctctgtgttg ttacagtctt agaggttgca gtactatatt gtaagctttg 540" gtgtttgttt aattagcaat agggatggta ggattcaaat gtgtgtcatt tagaagtgga 546" agctattagc accaatgaca taaatacata caagacacac aatg tcatgttatt 552" aacagttatt aggttgtcat ttaaaaataa ttta tatttctgtc ccatcaggaa 558" aactgaagga tatggggaat cattggttat cttccattgt gtttttcttt atggacagga 564" gctaatggaa gtca tgttcaaagg aagcatttct agaaaaaagg agataatgtt 512512 570" tttaaatttc attatcaaac ttgggcaatt ctgtttgtgt aactccccga ctagtggatg 576" ggagagtccc attgctaaaa ttcagctact cagataaatt cagaatgggt caaggcacct 582" gcctgttttt gttggtgcac agagattgac caga gagacaattc actccatccc 588" tatggcagag gaatgggtta gccctaatgt agaatgtcat taaa actgttttat 594" atcttaagag tgccttatta aagtatagat gtatgtctta aaatgtgggt gataggaatt 600" ttaaagattt atataatgca tcaaaagcct tagaataaga aaagcttttt ttaaattgct 606" gtat atctgaactc ttgaaactta tagctaaaac actaggattt atctgcagtg 612" ttcagggaga taattctgcc tttaattgtc taaaacaaaa acaaaaccag ccaacctatg 618" ttacacgtga gattaaaacc aattttttcc tttc tccttttttc tcttgctgcc 624" cacattgtgc ctttatttta tgagccccag ttttctgggc ttagtttaaa aaaaaaatca 630" agtctaaaca ttgcatttag aaagcttttg ttcttggata aaaagtcata cactttaaaa 636" aaaaaaaaaa ctttttccag gaaaatatat tgaaatcatg ctgctgagcc tctattttct 642" ttctttgatg ttttgattca gtattctttt atcataaatt tttagcattt aaaaattcac 648" tgatgtacat taagccaata aactgcttta atgaataaca aactatgtag tgtgtcccta 654" ttataaatgc attggagaag tatg agactcttta ctcaggtgca tggttacagc 660" ccacagggag gcatggagtg ccatggaagg attcgccact acccagacct tgttttttgt 666" tgtattttgg aagacaggtt agaa acattttcct cagattaaaa gatgatgcta 672" ttacaactag cattgcctca aaaactggga ccaaccaaag aacc ctgtttcctt 678" aaaagaggct atgaatccca aaggccacat ccaagacagg caataatgag ttac 684" agctccttta ataaaatgtg tcagtaattt taaggtttat ctca ttgc 690" taatgcagaa tagtgtaaaa tgcgcttcaa gaatgttgat gatgatgata tgtg 696" gctttagtag cacagaggat gccccaacaa actcatggcg ttgaaaccac ctca 702" ttactgttat ttattagctg tagcattctc tgtctcctct ctctcctcct ttct 708" cctcgaccag ccatcatgac atttaccatg aatttacttc ctcccaagag tttggactgc 714" ccgtcagatt gttgctgcac atagttgcct ttgtatctct gtatgaaata aaaggtcatt 720" tgttcatgtt aaaaaaaaa Protein seguence: NCBI Reference Seguence: 861.1 LOCUS NP_003861 ION N P_003861 513513 l msaadevdgl gvarphygsv ldnerltaee mderrrqnva yeylchleea krwmeaclge 6" elee glrngvylak lgnffspkvv slkkiydreq trykatglhf rhtdnviqwl 12" namdeiglpk ifypettdiy rciy cihalslylf klglapqiqd lygkvdftee 18" einnmktele kygiqmpafs kiggilanel svdeaalhaa aidr ripadtfaal 24" knpnamlvnl eeplastyqd qdkm tnaknrtens ererdvyeel ltqaeiqgni " nkvntfsala nidlalqud alalfralqs gqu qnsdwylkql lsquqqus 36" gqtdplqkee lqsgvdaans aaqqurrla avalinaaiq kgvaektvle lmnpeaqlpq 42" dlyq kelathrqs pehnlthpel lssv alinralesg dvntvwqus 48" ssvtgltnie eencqrylde lmklkaqaha ennefitwnd iqacvdhvnl vvqeeheril 54" aiglineald egdaqktha lqipaakleg vlaevaqhyq dtlirakrek aqeiqdesav 60" lwldeiqggi qunkdtqea qkfalgifai neavesgdvg ktlsalrspd vglygvipec 66" getyhsdlae akkkklavgd nnskkahwv kggyyyyhnl etqeggwdep pnqunsmql 72" sreeiqssis gvtaaynreq lwlaneglit rlqarcrgyl vrqefrsrmn flkkqipait 78" ciqsqwrgyk qkkayqdrla devv kiqslarmhq arkryrdrlq yfrdhindii 84" kiqafirank arddyktlin aedppmvvvr dqsd qdfqeeldlm kmreevitLi 90" ranqlendl igll vknkithdv vshskkltkk nkeqlsdmmm 96" lskekrekle athlfyllq tnptylakli fqmpanstk fmdsviftly nyaanreey "02" lllrlfktal qeeikskvdq iqeivtgnpt vikmvvsfnr gargqnaqu ilapvvkeim "08" ddkslniktd pvdiykswvn qmesqtgeas klpydvtpeq alaheevktr ldssirnmra "l4" vtdkflsaiv ssvdkipygm rfiakvlkds lhekfpdage dellkiignl npai "20" diid lsaggqlttd qrrnlgsiak mlqhaasnkm flgdnahlsi ineylsqsyq "26" kfrrffqtac dvpelqdkfn vdeysdlvtl tkpviyisig eiinthtlll dhqdaiapeh "32" ndpihelldd lgevptiesl igessgnlnd pnkealakte vsltltnkfd vpgdenaemd "38" artillntkr livdvirfqp getlteilet patseqeaeh qramqrrair daktpdkmkk "44" sksvkedsnl thekkekiq tglkkltelg tvdpknkyqe lindiardir qrrk "50" aelvquqty aalnskatfy gequyyksy iktcldnlas kgkvskkpre mkgkkskkis "56" lkytaarlhe iedl qvanknvif eispteevgd fevkakfmgv qmetfmlhyq "62" dllqlqyegv avmklfdrak ifll nkkfygk 514514 OfibblSwnbd:TECR(aBoknownasGPSN2) Official Name: trans-2,3-enoyI-CoA reductase Qfiflkifl2:9524 Organism: Homo sapiens Other Aliases: GPSN2, MRT14, SC2, TER Other ations: glycoprotein, synaptic 2; synaptic glycoprotein SC2 tide seguence: NCBI Reference Seguence: NM_138501.5 LOCUS NM_138501 ACCESSION NM_138501 XM_001132190 XM_001132196 l ggaggggcgg ggcggacgca cgtt tagtctatcg ctgcggttgc gagcgctgta 6" gggagcctgt gctgtgccgc gcagttaggc agcagcagcc gcggagcagt agccgccgtg 12" ggagggagcc atgaagcatt acgaggtgga gattctggac gcaaagacaa gggagaagct 18" gtgtttcttg gacaaggtgg agccccacgc caccattgcg aaga acctcttcac 24" taagacccat ccgcagtggt accccgcccg ccagtccctc cgcctggacc ccaagggcaa " gtccctgaag gatgaggatg ttctgcagaa gctgcccgtg ggcaccacgg ccacactgta 36" cttccgggac gccc agatcagctg ggtgacggtc ttcctaacag cggg 42" tttc atctacctgc tcttctactt ccgagtgccc ttcatctatg gccacaaata 48" tacg tccagtcggc atacagtggt gcacctcgcc tgcatctgtc actcattcca 54" ctacatcaag cgcctgctgg agacgctctt cgtgcaccgc ttctcccatg gcactatgcc 60" tttgcgcaac atcttcaaga actgcaccta ctactggggc ttcgccgcgt ggatggccta 66" caat caccctctct acactccccc tacctacgga gctcagcagg tgaaactggc 72" gctcgccatc atct gccagctcgg caacttctcc atccacatgg ccctgcggga 78" cctgcggccc gctgggtcca agacgcggaa gatcccatac cccaccaaga tcac 84" gtggctcttc ctgctggtgt cctgccccaa ctacacctac gaggtggggt cctggatcgg 90" tttcgccatc atgacgcagt gtctcccagt ggccctgttc tccctggtgg gcttcaccca 96" gatgaccatc tgggccaagg gcaagcaccg cagctacctg aaggagttcc gggactaccc 102" gcgc atgcccatca tccccttcct gctctgagcg ctcacccctg ctgaggctca 515515 1081 gcccctcaac ccggtggcat ggag gagtggggcc cacagctctc cagcacccgg 1141 aataaagccc gcctgcccca gtcggaaaaa aaaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_612510.1 LOCUS NP_612510 ION NP_612510 XP_001132190 XP_001132196 1 mkhyeveild aktreklcfl dkvephatia eiknlftkth pquparqsl rldpkgkslk 61 dedvlqklpv gttatlyfrd lgaqiswvtv flteyagplf iyllfyfrvp fiyghkydft 121 ssrhtvvhla cichsfhyik rlletlfvhr fshgtmplrn ifknctyng faawmayyin 181 ptyg aquklalai fvicqlgnfs ihmalrdlrp kipy ptknpftwlf 241 llvscpnyty evgswigfai mtqclpvalf slvgftqmti wakgkhrsyl kefrdypplr 301 mpiipfll al Sym bol: EH D2 Official Name: EH-domain containing 2 Gene ID: 30846 Organism: Homo sapiens Other Aliases: PAST2 Other Designations: EH domain ning 2; EH domain-containing protein 2; PAST homolog 2 Nucleotide seguence: NCBI Reference Seguence: NM_014601.3 m NM_014601 ACCESSION NM_014601 1 ttttgagggg tcgt cccgcccctc cctcctgtcc tccctcccgt cctccccgct 516516 6" ccgggcccca cccggctcag ccgg acgggaccgc gagcacaggc cgctccgcgg 12" gcgcttcgga cggg accccaccct ctcccagcct gcccagcccg ctgcagccgc 18" cagcgcgccc cgtcggcagc tctccatctg cacgtctctc cgtgaacccc gtgagcggtg 24" tgcagccacc atgttcagct agcg gggcggggca cggggccagc agcccgaggc " catccgcacg gtgacctcgg ccctcaagga gctgtaccgc acgaagctgc tgccgctgga 36" ggagcactac cgctttgggg ccttccactc gccggccctg gaggacgcag acttcgacgg 42" caagcccatg gtgctggtgg ccggccagta cagcacgggc aagaccagct tcatccagta 48" ggag caggaggtgc ccggctcccg cgtggggcct gagcccacca ccgactgctt 54" tgtggccgtc atgcacgggg acactgaggg caccgtgccc ggcaacgccc tcgtcgtgga 60" cccggacaag cccttccgca accc tttcggaaac ctca acaggttcat 66" gtgtgcccag ctccctaatc tgga gagcatcagc atcatcgaca ccccgggtat 72" cctgtcgggt gccaagcaga gagtgagccg cggctacgac ttcccggccg tgctgcgctg 78" gttcgcggag cgcgtggacc tcatcatcct tgat aagc tggagatctc 84" ggacgagttc tcagaggcca tcggcgcgtt gcggggccat gaggacaaga tccgcgtggt 90" gctcaacaag gccgacatgg tggagacgca gcagctgatg cgcgtctacg gcgcgctcat 96" gtgggcgctg ggcaaggtgg tgggcacgcc cgaggtgctg cgcgtctaca tcggctcctt "02" ctggtcccag cccctcctcg tgcccgacaa ccggcgcctc ttcgagctgg aggagcagga "08" cctcttccgc gacatccagg gcctgccccg gcacgcagcc ttgcgcaagc tcaacgacct "l4" ggtgaagagg gcccggctgg tgcgagttca cgcttacatc atcagctacc tgaagaagga "20" gatgccctct gtgtttggga aggagaacaa gaagaagcag ctgatcctca ccgt "26" catctttgcg aagattcagc tggaacatca catctcccct ggggactttc ctgattgcca "32" gaaaatgcag gagctgctga tggcgcacga cttcaccaag tttcactcgc tgaagccgaa "38" gctgctagag gacg tgac catc gccaagctca tgcccctgct "44" gcggcaggag gagctggaga gcaccgaggt gggcgtgcag gggggcgctt ttgagggcac "50" ccacatgggc ccgtttgtgg agcggggacc tgacgaggcc atggaggacg gcgaggaggg "56" ctcggacgac gaggccgagt gggtggtgac caaggacaag tccaaatacg tctt "62" ctacaacctg gcgcctgccg acggcaagct gagcggctcc aaga tggt "68" ggggaccaag ctccccaact cagtgctggg gcgcatctgg aagctcagcg atgtggaccg "74" cgacggcatg ctggatgatg aggagttcgc gctggccagc cacctcatcg aggccaagct "80" ccac gggctgcccg ccaacctgcc ccgtcgcctg gtgccaccct ccaagcgacg 517517 186" ccacaagggc gagt gagccggccc ccctcccatg gccctgctgt ggctccccag 192" ctccagtcgg ctgcacgcac acccctgctc cggctcacac tgcc tgccctccct 198" gcccagctgt aaggaccggg ggtctccctc ctcactaccg ccagacaccc ngtggaagc 204" atttagaggg gaccacggga gggacaaggc gtcc gcccttcaca cctccagcct 210" cacgttcact taggcacatc acacacacac acgc aggcatccat ccatccgtca 216" ttcattcaaa tatttattga gcacctacta tgtgcccagc cctgttctag gcactgggca 222" taga gaacaaaata gacaaataca tctgccctca tggaaggtga cgttcccagg 228" agagggcacc tacacagtca cgcaaacaca cactaattcc tggcagggcc cccagcccct 234" cccctggctg agcagccctg tggctgaaat gactagcaga taaacagacc cccttctgct 240" ccgcttcctc ctgcccagcc cacc ctcaaccggc tccatcacat cctcaggtct 246" cgggaccatg gggggctcag aggggagaca cacc:actgc ttcctcagat gggcccctcc 252" gcagcccctt ctcg gccc ccaa:tctgc ccacacccat ttatttcctt 258" ccttccttcc ttcttttctt tccttccttc cttc:ttttt gtttttgccc ccaattctgc 264" ccatacccat cttt ccttccttcc ttct:ttttg tttttgcccc cagttctgtc 270" cacacccctt cctg tcctgtcctt tctt:ctttt ttgatagaat cttgctctgt 276" cgcccaggct ggagtgcagt ggtgagatct cagc :cactg caacctccac ctcctgggtt 282" gaagtgattc tcgtgcctca gcctcctgag tagc:gggac tgcaggcacg cgccaccacg 288" cccagctaat ttttgtattt gagtagagac gggg:ttcac catgttggcc gtct 294" cgaactccgc atctcaggtg atctgctcgc cttggcctcc caaagtgatg ggattacagg 300" catgagccac cgtgcccggc ttcacaccca tttctttaaa aaggatcccg tagcaggcag 306" aaaagcccct tccatcctgc tcctctgata ctgtgccccc ttggagatat ttccgtcctc 312" cacccacgtg tctgtggctg gaactgccca gcctgctcct ggccccctgg aagcctcccc 318" acagctggta atctggactt aaggattgct gggccaccgc ctctctgcct attc 324" catatttaag tggagcccct acgtagaaag gCCCngggC ttag tctccttttc 330" agggatgtcg gggg agggggttct tggtgctaca gccctctccc cacccctaaa 336" gggacgccga cgctgtttgc cacc acatattagt ccct ggcaggggac 342" cccatggaaa agatggggaa gagcaaaata catggagacg acgcaccctc caggatgctc 348" gctgggattc ccacgcccac cactgtcccc caccccatgg ctgggagggg cctctgaacg 354" gaacagtgtc cccacagagc gaataaagcc aaggcttctt cccaaaaaaa aaaaaaaaaa 360" a 518518 Protein seguence: NCBI Reference Seguence: NP_055416.2 LOCUS NP_055416 ACCESSION N P_055416 l mfswlkrgga rgqqpeairt vtsalkelyr eehy rfgafhspal edadfdgkpm 6" vlvagqystg ktsfiqylle rvgp epttdcfvav mhgdtegtvp gnalvvdpdk 12" pfrklnpfgn tflnrfmcaq lpnqvlesis iidtpgilsg aquvsrgyd wfae 18" rvdliillfd ahkleisdef seaigalrgh edkirvvlnk admvetqqlm rvygalmwal 24" gkvvgtpevl rvyigsfwsq pllvpdnrrl feleeqdlfr rhaa lrklndlvkr " arlvrvhayi isylkkemps vfgkenkkkq lilklpvifa hisp gdfpdchmq 36" ellmahdftk fhslkpklle aldemlthdi aklmplque elestevqu ggafegthmg 42" pfv rgpd a m dg gsdd a wvvtkdk skydeifynl apadgklsgs kaktwmvgtk 48" griw klsdvdrdgm lddeefalas hlieaklegh glpanlprrl vppskrrhkg 54" sae UGP2 Official Sym bol: UGP2 Official Name: UDP-glucose pyrophosphorylase 2 Gene ID: \ Organism: Homo sapiens Other Aliases: UDPG, UDPGP2, UGP1, UGPP2,pHC379 Other Designations: UDP-glucose diphosphorylase; UDP-glucose pyrophosphorylase 1; UDPGP; UGPase 2; UTP--glucosephosphate uridylyltransferase; UTP--glucosephosphate uridylyltransferase 2; UTP- glucose-f -phosphate uridyltransferase; uridyl diphosphate glucose pyrophosphorylase 2 Nucleotide seguence: NCBI Reference Seguence: 001521.1 LOCUS 001521 ACCESSION NM_001001521 519519 l tttccgcatt gaaggggctg ctccgaatgg agggggaggg gaggtgttta ggagaaagta 6" ggggctgtgg ggag ccggctgacg ggtggacaag ggggggttag cagctgggct 12" gtta ggct caaggtgtgc atgtgtgagg gaagagagag agagagaagg 18" gcgcctcaga ggtgactttc agcctgcgag tccc ggggcgccat aaacgccccc 24" aatttcccag ctgctaaagg aagaggaaga caaa gcaatgtctc aagatggtgc " gttc caagaagtca ttcggcaaga gctagaatta aaga aggaactaga 36" aaaaatactc accacagcat catcacatga atttgagcac accaaaaaag acctggatgg 42" atttcggaag ctatttcata tgca agaaaagggg ccttctgtgg attggggaaa 48" aatccagaga ccccctgaag attcgattca accctatgaa aagataaagg ccaggggctt 54" gcctgataat atatcttccg tgttgaacaa actagtggtg gtgaaactca atggtggttt 60" gggaaccagc atgggctgca aaggccctaa aagtctgatt ggtgtgagga atgagaatac 66" ctttctggat ctgactgttc agcaaattga acatttgaat aaaacctaca atacagatgt 72" tcctcttgtt ttaatgaact cttttaacac ggatgaagat accaaaaaaa agaa 78" gtacaatcat tgtcgtgtga aaatctacac tttcaatcaa agcaggtacc cgaggattaa 84" taaagaatct ttacttcctg tagcaaagga cgtgtcttac tcaggggaaa atacagaagc 90" ttggtaccct ccaggtcatg gtgatattta cgccagtttc tctg gattgcttga 96" tacctttata ggagaaggca atat ttttgtgtct aacatagata atctgggtgc "02" cacagtggat ctgtatattc ttaatcatct aatgaaccca cccaatggaa aacgctgtga "08" atttgtcatg gaagtcacaa ataaaacacg tgcagatgta aagggcggga cactcactca "14" atatgaaggc aaactgagac tggtggaaat tgctcaagtg gcac atgtagacga "20" gttcaagtct gtatcaaagt tcaaaatatt taatacaaac aacctatgga tttctcttgc "26" agcagttaaa agactgcagg agcaaaatgc cattgacatg gaaatcattg tgaatgcaaa "32" gactttggat ggaggcctga atgtcattca attagaaact gcagtagggg ctgccatcaa "38" aagttttgag aattctctag gtattaatgt gagc cgttttctgc ctgtcaaaac "44" cacatcagat ctcttgctgg caaa tagt cttaatgcag gatctctgac "50" aatgagtgaa aagcgggaat ttcctacagt gcccttggtt aaattaggca gttcttttac "56" gaaggttcaa gattatctaa gaagatttga acca gatatgcttg atca "62" cctcacagtt tcaggagatg tgacatttgg aaaaaatgtt tcattaaagg gaacggttat "68" catcattgca aatcatggtg acagaattga tatcccacct gtat tagagaacaa "74" gattgtgtct ggaaaccttc gcatcttgga ccactgaaat gaaaaatact gtggacactt 520520 1801 aaataatggg ctagtttctt acaatgaaat gttctctagq attctaaaat aggcaggtac 1861 tttactatgt tactgtaccc tgcagtgttg atttttaaaa tagagttttc tgcagtatgc 1921 ttttagtcta agaaaagcac agatggagca atactttcct tctttgaaga gaatcccaaa 1981 agttagttca tcttaaagtg caatattgtt taatcttaaa actgggcaac tttggaagaa 2041 cttttaacag caat gatgatcact ttgaattgct tgtgatttca aaaataaagc 2101 agtgaagcaa taaaaaaaaa aaaaaaaaa Protein seguence: NCBI Reference ce: NP_001001521.1 LOCUS NP_001001521 ION NP_001001521 1 msngasqfq evirqelels vkkelekilt feht kkdldgfrkl fhrflqekgp 61 svdwgkiqrp pedsiqpyek ikarglpdni ssvlnklvvv klngglgtsm gckgpkslig 121 vrnentfldl tquiehlnk tyntdvplvl dedt kkilqkynhc rvkiytfnqs 181 kesl 1pvakdvsys genteawypp ghgdiyasfy nsglldtfig egkeyifvsn 241 idnlgatvdl yilnhlmnpp ngkrcefvme vtnktradvk ggtltqyegk lrlveiaqvp 301 kahvdefksv ntnn 1wislaavkr lqeqnaidme iivnaktldg glnviqleta 361 vgaaiksfen slginvprsr flpvkttsdl llvmsnlysl nagsltmsek refptvplvk 421 lgssftkqu ylrrfesipd mleldhltvs gdvtfgknvs lkgtviiian hgdridippg 481 avlenkivsg nlrildh UGDH Official : UGDH Official Name: UDP-glucose 6-dehydrogenase Gene ID: 7358 sm: Homo sapiens Other Aliases: GDH, UDP-GICDH, UDPGDH, UGD 521521 Other Designations: UDP-Glc dehydrogenase; UDP-glucose dehydrogenase; uridine diphospho-glucose dehydrogenase Nucleotide seguence: NCBI Reference Seguence: NM_001184700.1 LOCUS: NM_001184700 ACCESSION : NM_001184700 l gaaa tagggacctg ggcc agcg ggagggggag ctaggaagca 6" gctgagggca gaatccagga gggcctggct gcgggggaat gaagcctccg ccttcgcagg 12" caaaagcctt cggg ctcaggcccg ggactcagag tgtaacgcgt ggcagcctga 18" ggcg tgcgccgaga gggagctcag atcgagcggg gcgcgggtgg tgcg 24" gcggcgcggc ccgtaggaag gtgctgtccg aacgatcggg ataggagcgg tccctgcgct " tgctgctggg taca atcatgtttg agaa gatctgttgc atcggtgcag 36" gctatgttgg aggacccaca tgtagtgtca ttgctcatat gtgtcctgaa atcagggtaa 42" cggttgttga tgaa tcaagaatca atgcgtggaa taca cttcctattt 48" atgagccagg actaaaagaa gtggtagaat cctgtcgagg aaaaaatctt tttttttcta 54" ccaatattga tgatgccatc aaagaagctg atcttgtatt tatttctgtg ctgtccaacc 60" ctgagtttct ggga acagccatca taaa gaacccagac ctga 66" ttggagggga tgaaactcca gagggccaga gagctgtgca ggccctgtgt gctgtatatg 72" agcactgggt tcccagagaa aagatcctca ccactaatac ttggtcttca gagctttcca 78" cagc aaatgctttt cttgcccaga gaataagcag cattaactcc ataagtgctc 84" tgtgtgaagc aacaggagct gatgtagaag aggtagcaac agcgattgga atggaccaga 90" gaattggaaa caagtttcta aaagccagtg ttgggtttgg tgggagctgt ttccaaaagg 96" atgttctgaa tttggtttat ctctgtgagg ctctgaattt gccagaagta gctcgttatt 102" ggcagcaggt catagacatg aatgactacc agaggaggag gtttgcttcc cggatcatag 108" atagtctgtt taatacagta actgataaga agatagctat tttgggattt gcattcaaaa 114" aggacactgg tgatacaaga gaatcttcta gtatatatat tagcaaatat ttgatggatg 120" aaggtgcaca tctacatata tatgatccaa aagtacctag ggaacaaata gttgtggatc 126" tttctcatcc ttca gaggatgacc aagtgtcccg gctcgtgacc atttccaagg 132" atccatatga agcatgtgat ggtgcccatg ctgttgttat ttgcactgag tgggacatgt 522522 "38" ttaaggaatt ggattatgaa cgcattcata aaaaaatgct agcc tttatcttcg "44" atggacggcg ggat gggctccaca atgaactaca aaccattggc ttccagattg "50" aaacaattgg caaaaaggtg tcttcaaaga gaattccata tgctccttct ggtgaaattc "56" cgaagtttag tcttcaagat ccacctaaca agaaacctaa agtgtagaga ttgccatttt "62" tatttgtgat tttttttttt tttttttggt acttcaggat agcaaatatc tatctgctat "68" taaatggtaa atgaaccaag tgtttttttt tgtttttttt ttgagacaga gtctcactgt "74" tgcccaggct ggagtgcagt ggtgcaatct cggctcactg caagctctgc ttcccaggtt "80" cacgccattc tcctggctca gcctcccaag tagctgggac cacc cgccacagtg "86" cctggctaat tttttgtatt agag acagggtttc accatgtgag ccaggatggt "92" ctcaatctcc tgaccttgtg aaccacccgt ctcggcctcc caaagtgctg ggattacagg "98" tgtgagccac cacgcctggc acca agtgttttta aggaaacaaa actatttttt 204" taatcatcag atttatacta gctatatgga tattagcata tctggtaatt atgaatctag 210" aattttttta catattttta taatactgtt agctcagtta ttggatgagt taat 216" catgttggtt ttaatagtgt caatttttgt aaaataaaaa ttaaacttca aactctttac 222" tttataaatt gtccataggc cacactttaa tatcacatta taaagggaag cttc 228" cctg gttattggtt :gtttgtcat taaagatata ttttgaatcc atgaaattgc 234" tatgctaaac agcctttaca :gtatggtct ggttaaagtt cctttgttcc ttttgtttta 240" ataaaatgtg tcactgattt :ttagctcaa aatcatcact g:taatttcc agtcacccca 246" aatatggtta aaagattttt :ttttaatca tgaagagaaa a:tagtagca tttctttctc 252" tccccattat ttattggttt ctaa tttt t:tagtccaa aaaa 258" tatttatctt ggttttacat :ttaatttcc attcttaatt g:aatttttt tctttaaata 264" aggaaaccaa tataatctca :gtataaaaa cttaaatatt t:acaagtta catatagcat 270" cattctaaaa taagaatttt :tttgttttc tgtctgcttt t:tcttatgt ctcttgttga 276" gttttatatt ttcagtggtt atttttgctt gtgttagatc a:tattaaaa tatatccaat 282" gtccctttga tacttgtgct ctgctgagaa gttt gcattaaaca tcccaggtct 288" catccttcag gaattttgca gttcaatgag aagagggaga caaatataaa gatgaggaca 294" gaagcatctc tacagatgaa aattacataa catt ctccatcaac aaaa 300" aaaa aaa Protein seguence: NCBI Reference ce: NP_001171629.1 523523 LOCUS: NP_001171629 ACCESSION: NP_001171629 1 mfeikkicci gagyvggptc sviahmcpei rvtvvdvnes rinawnsptl piyepglkev 6" vescrgknlf fstniddaik eadlvfisvl snpeflaegt aikdlknpdr vliggdetpe 12" gqravqalca vyehvarek ilttntwsse lsklaanafl aqrissinsi salceatgad 18" veevataigm dqrignkflk asvgfggscf qkdvlnlvyl cealnlpeva rywquidmn 24" dyqrrrfasr iidslfntvt dkkiailgfa fkkdtgdtre sssiyiskyl mdegahlhiy " dpkvpreqiv vdlshpgvse ddqurlvti skdpyeacdg ahavvictew dmfkeldyer 36" ihkkmlkpaf ifdgrrvldg lhnelqtigf qietigkkvs skripyapsg eipkfslqdp 42" pnkkpkv Official Symbol: PL|N3 (also known as M6PRBP1) Official Name: perilipin 3 Gene ID: 10226 Organism: Homo sapiens Other s: M6PRBP1, PP17, T|P47 Other Designations: 47 kDa MPR-binding protein; cargo selection protein TlP47; mannosephosphate receptor-binding protein 1; pin-3; placental protein 17; tail-interacting protein, 47 kD Nucleotide seguence: NCBI Reference Seguence: NM_001164189.1 LOCUS: NM_001164189 ACCESSION : 164189 1 tggcgcgggc aatccctcaa tggt cccctcgccc gtcactccag tgcgccccca 61 cacg cagtaaaagc cacccccgcc tcggcccgga ccaa gctggttttg 121 aagtcgcggc agctgttcct gggacgtccg gttgaccgcg ctgc agagaccatg 524524 18" tctgccgacg gggcagaggc tgatggcagc acccaggtga cagtggaaga accggtacag 24" cagcccagtg tggtggaccg tgtggccagc atgcctctga tcagctccac ctgcgacatg " gtgtccgcag cctatgcctc caccaaggag agctacccgc acatcaagac tgtctgcgac 36" gcagcagaga agggagtgag gaccctcacg gcggctgctg tcagcggggc tcagccgatc 42" ctctccaagc tggagcccca gattgcatca gccagcgaat acgcccacag ggggctggac 48" aagttggagg agaacctccc catcctgcag cagcccacgg agaaggtcct ggcggacacc 54" aaggagcttg ctaa ggtgtcgggg gcccaagaga tggtgtctag ggac 60" acggtggcca cccaattgtc ggaggcggtg accc gcggtgctgt cggc 66" gtggacaaga caaagtccgt cggc ggcgtccaat cggtcatggg ctcccgcttg 72" ggccagatgg tgttgagtgg cacg gtgctgggga agtcggagga gtgggcggac 78" aaccacctgc cccttacgga tgccgaactg atcg ccacatccct ggatggcttt 84" gacgtcgcgt ccgtgcagca gcagcggcag gaacagagct acttcgtacg tctgggctcc 90" ctgtcggaga ggctgcggca gcacgcctat gagcactcgc agct tcgagccacc 96" aagcagaggg cacaggaggc tctgctgcag ctgtcgcagg tcctaagcct gatggaaact "O2" gtcaagcaag gcgttgatca gaagctggtg gaaggccagg agaagctgca ccagatgtgg "08" ctcagctgga accagaagca gctccagggc aagg agccgcccaa gccagaggtc "l4" gagtcccggg cgctcaccat gttccgggac attgcccagc aactgcaggc cacctgtacc "20" tccctggggt ccagcattca gggcctcccc gtga aggt gcagcaggcc "26" cgccgccagg tggaggacct ccaggccacg ttttccagca tccactcctt ccaggacctg "32" tccagcagca ttctggccca gagccgtgag cgtgtcgcca gcgcccgcga ggccctggac "38" cacatggtgg aatatgtggc ccagaacaca cctgtcacgt ggctcgtggg accctttgcc "44" cctggaatca ctgagaaagc cccggaggag aagaagtagg gggagaggag aggactcagc "50" gggccccgtc tctataatgc agctgtgctc tggagtcctc aacccggggc caaa "56" cttattttct agccactcct cccagctctt tgtc cacttgggaa gctc "62" tcaaaacggg catcacccag ttgacccatc tctcagcctc tctgagcttg gaagaagcct "68" gttctgagcc tcaccctatc taga gagagatgtc cagaaaaaat atctttcagg "74" aaagttctcc cctgcagaat tttttttcct tgttaaatat caggaatata ggccgggtgc "80" tcac acctgtaatc ccagcacttt gggaggctga ggcgggcgga acacctgagg "86" tcaggtgttc agcc aggccaacat ggtgaaaccc cgtctctact aaaaatacaa "92" aaaaaaatga gccgggcatg gtagcaggtg tctgttatcc cagttaggag gctgaggcaa 525525 198" gagaatctct tgaacctgag aggcggaggt tgcagtgagc caagatcgcg cact 204" ccagcctggg ggacaagagt gagacttagt ctcaaaaaaa aaaaaaaaga aatc 210" agggatatag ttcatatccc acttctttgt ttacaccgat gtccctgaat atcagcctgt 216" agctaatgga attt ctggtctaag tgggcctcct ggggatgggg tggtacactg 222" agcttctgag cctcattgta gagtagaaag gtactggggc ctgtgtggta agccttgttg 228" aaatgctctg gtattcagta ttgccttaat aaacttcacc cacaactgca caaa 234" aa Protein seguence: NCBI Reference Seguence: NP_001157661.1 LOCUS: NP_001157661 ACCESSION: N P_001157661 l msadgaeadg stqvtveepv qqpsvvdrva smplisstcd mvsaayastk esyphiktvc 6" daaekgvrtl taaavsgaqp ilsklepqia saseyahrgl dkleenlpil qqptekvlad 12" tkelvsskvs gaqemvssak dtvatqlsea vdatrgavqs svvt gqusvmgsr 18" lgqmvlsgvd tvlgkseewa dnhlpltdae lariatsldg fdvaquqqr qeqsyferg 24" slserquha klra tquaqeall qlsqvlslme qul vegqeklhqm " wlswnqkqlq pkpe vesraltmfr diaqqlqatc tslgssiqgl ptnvkdqqu 36" arrqvedlqa tfssihsfqd lsssilaqsr ervasareal dhmveyvaqn tpvtwlvgpf 42" apgitekape ekk C14orf166 Official Sym bol: C14orf166 Official Name: chromosome 14 open g frame 166 Gene ID: 51637 Organism: Homo sapiens Other Aliases: CGI-99, CGI99, CLE, CLE7, LCRP369, RLLM1 Other Designations: CLE7 homolog; RLL motif containing 1; UPF0568 protein C14orf166 tide seguence: 526526 NCBI Reference ce: NM_016039.2 LOCUS: NM_016039 ACCESSION : NM_016039 l cgccgtcatt tcggagcgac tcagcgcctg cccgccctct cgccgcgtcg ccggtgcctg 6" cgcctcccgc tcgc ttcttctctc ccggccgagg cccgggggac cagagcgaga 12" agcggggacc atgttccgac gcaagttgac ggctctcgac taccacaacc ccgccggctt 18" caactgcaaa gatgaaacag aatttagaaa cgtt tggcttgaag accagaaaat 24" caggcactac aagattgaag acagagggaa tttaagaaac atccacagca gcgactggcc " caagttcttt gaaaagtatc tcagagatgt taactgtcct ttcaagattc aagatcgaca 36" agaagctatt gactggcttc ttggtttagc tgttagactt gaatatggag ataatgctga 42" aaaatacaag gatttagtac ctgataattc aaaaactgct gcaa ctaaaaatgc 48" agaaccattg ttgg ataa tcctgatttt aaggctggtg tgatggcttt 54" ggctaacctg cttcagattc agcgtcatga tgattacctg gtaatgctta aggcaattcg 60" gattttggtt caggagcgcc tgacacagga tgcagttgct aatc aaacaaaaga 66" gggcttacct gttgctttag acaaacatat tcttggtttt gacacaggag atgcagttct 72" taatgaagct gctcaaattc tgcgattgct gcacatagag gagctcagag agctacagac 78" aaaaatcaac gaagccatag tagctgttca ggcaattatt gctgatccaa agacagacca 84" cagactggga aaagttggaa gatgaacact tgaggacttc tcac ctacttagta 90" cagttgggaa ccatacactt ctggcatgtt tggaaatcaa aatgtcacat ggga 96" ggaagcccag aaaattgggt atgttctaga gatttaccac cattgcttat tgcttttttc 102" tttaataaag aaag tagaattttt aaaaaaaaaa aaaa Protein seguence: NCBI Reference Seguence: NP_057123.1 LOCUS: NP_057123 ION: N P_057123 l tald fnck detefrnfiv wlequirhy kiedrgnlrn ihssdwpkff 61 ekylrdvncp fkiqdrqeai dwllglavrl eygdnaekyk dlvpdnskta dnatknaepl 527527 121 inldvnnpdf lanl lqiqrhddyl vmlkairilv qerltqdava eglp 181 valdkhilgf dtgdavlnea aqilrllhie elrelqtkin eaivavqaii adpktdhrlg 241 kvgr SNRNP70 Official Sym bol: SNRN P70 Official Name: small nuclear ribonucleoprotein 70kDa (U1) m: 6625 Organism: Homo s Other Aliases: RNPU1Z, RPU1, SNRP70, Snp1, U1-70K, U170K, U1AP, U1RNP Other Designations: U1 small nuclear ribonucleoprotein 70 kDa; U1 snRNP 70 Nucleotide seguence: NCBI nce Seguence: NM_003089.4 LOCUS: NM_003089 ACCESSION : NM_003089 1 gcggttcggc gcggaaagcg ggaggtggag gggcggcttg gggcaagcgc gcgcgcgcag 6" tgcagaagcc agccccccgc ggta ctcaaggtgc ccaaaggcgg ggtagtgacc 12" tcgcgcgtgc gctgtgcccg cggcagcgcc gggtcctagt gtgtgggttg ttgttggcac 18" cgcacggcgc gtgcgcagtg aggacggcgg agggatttgc ggccgggacc caccccctgc 24" tccagtcgct atcggaggcc gcgcgggtgg ctgagcagcg gcctggtgcg ctcgcttagc " gggcgacgga atcagacgga cgtggacgcc cccggagtgg aagccgaagc aggagttgtt 36" gttgctgagg cgca gccgccgcga gcctccggac agacgccaga aggg 42" cgctacgcga cttggcaaga tgacccagtt gccc aaccttctgg ccctctttgc 48" cccccgtgac cctattccat acctgccacc cctggagaaa ctgccacatg aaaaacacca 54" caatcaacct tattgtggca ttgcgccgta cattcgagag tttgaggacc ctcgagatgc 60" ccctcctcca actcgtgctg aaacccgaga catg gagaggaaaa gacgggaaaa 66" gattgagcgg cgacagcaag aagtggagac agagcttaaa atgtgggacc ctcacaatga 528528 72" tcccaatgct cagggggatg ccttcaagac tctcttcgtg gcgagagtga attatgacac 78" aacagaatcc aagctccgga gagagtttga ggtgtacgga aaaa gaatacacat 84" ggtctacagt aagcggtcag gaaagccccg tggctatgcc ttcatcgagt acgaacacga 90" gcgagacatg cactccgctt acaaacacgc agatggcaag aagattgatg gcaggagggt 96" ccttgtggac gtggagaggg gccgaaccgt gaagggctgg aggccccggc ggctaggagg "02" aggcctcggt ggtaccagaa gaggaggggc tgatgtgaac atccggcatt caggccgcga "08" tgacacctcc cgctacgatg agaggcccgg cccctccccg cttccgcaca gggaccggga "14" ccgggaccgt gagcgggagc gcagagagcg gagccgggag cgagacaagg agcgagaacg "20" gcgacgctcc cgctcccggg accggcggag gcgctcacgg agtcgcgaca agcg "26" gaggcgctcc agggagcgga gcaaggacaa ggac cggaagcggc gaagcagccg "32" gagtcgggag nggCCngC gggagcggga gcgcaaggag gagctgcgtg gcggcggtgg "38" ggcg tccg aggcgggtga ccct gatgatgggc ctccagggga "44" gcct gacggccctg acggtccaga gggc cgggatcgtg accgggagcg "50" acggcggagc caccggagcg agcgcgagcg gcgccgggac cgggatcgtg accgtgaccg "56" tgaccgcgag cacaaacggg gggagcgggg cagtgagcgg gatg aggcccgagg "62" cggt ggccaggaca acgggctgga gggtctgggc aacgacagcc gagacatgta "68" catggagtct gagggcggcg acggctacct ggctccggag aatgggtatt aggc "74" tgcgccggag tgaagaggtc gtcctctcca tctgctgtgt ttggacgcgt tcctgcccag "80" ccccttgctg tcatcccctc ccccaacctt ggccacttga gtttgtcctc caagggtagg "86" tgtctcattt gttctggccc cttggattta aaaataaaat taatttcctg ttgatagtgg "92" gcaaaaaaaa aaaaaaaaaa Protein seguence: NCBI Reference Seguence: 080.2 LOCUS: NP_003080 ACCESSION: N P_003080 1 mtqflppnll alfaprdpip ylppleklph ekhhnqpycg iapyirefed prdappptra 61 tr rm rk rr ki rrqq v t lkmwd phndpnaqu varv nydttesklr 121 refevygpik rihmvyskrs afie yeherdmhsa ykhadgkkid grrvlvdver 529529 181 grtvkgwrpr rlggglggtr nirh sgrddtsryd erpgpsplph rere 241 rrersrerdk ererrrsrsr drrrrsrsrd keerrrsrer skdkdrdrkr rssrsrerar 301 rererkeelr ggggdmaeps pddg ppgelgpdgp dgpeekgrdr drerrrshrs 361 ererrrdrdr drdrdrehkr gergsergrd gqu ngleglgnds rdmymesegg 421 engy lmeaape Official Sym bol: CN N2 Official Name: calponin 2 m: 1265 Organism: Homo sapiens Other s: none Other Designations: calponin H2, smooth muscle; calponin-2; neutral calponin Nucleotide seguence: NCBI Reference Seguence: NM_004368.2 LOCUS: NM_004368 ACCESSION : NM_004368 1 gtga gagccgccca cgagctctga gcagagagcc cgcaggagtg ccacgtcccg 6" gcggcctcgg cccctccctg cctcagtttc ccgtggcatc aaagggggcg aggggcccct 12" ccaggcctct ggtgacgggg gtgctgtgcc caggcggggg tccgggggcg accgaggggg 18" ctcaggaagt ccgcggccgc aggaattcgg cgcctccagg ccttataagg acatttgcgc 24" tccgggccaa tcagcggcgg gggcgtggcg cgcggagccc ggcgcgtccc aaccccgcgc " cagcccggcg gtcccgtccc gtcccgtcct gtgcggcccc gtcccgccgc ccgcccgcca 36" agct ccacgcagtt caacaagggc ccctcgtacg ggctgtcggc cgaggtcaag 42" aaccggctcc tgtccaaata tgacccccag aaggaggcag agctccgcac ctggatcgag 48" ggactcaccg gcctctccat cgac ttccagaagg gcctgaagga tggaactatc 530530 54" ttatgcacac tcatgaacaa gctacagccg ggctccgtcc ccaagatcaa ccgctccatg 60" cagaactggc accagctaga aaacctgtcc aacttcatca tggt cagctacggc 66" atgaaccctg tggacctgtt cgaggccaac tttg agagtgggaa catgacgcag 72" gtgcaggtgt ctcttctcgc cctggcgggg aaggccaaga ctaaggggct gcagagcggg 78" gtggacattg gcgtcaagta ctcggagaag caggagcgga atttcgacga tgccaccatg 84" aaggctggcc agtgcgtcat cgggctgcag atgggcacca acaaatgcgc cagccagtcg 90" ggcatgactg cctacggcac gagaaggcat ctctatgacc ccaagaacca tatcctgccc 96" cccatggacc actcgaccat cagcctccag acga acaagtgtgc cagccaggtg "02" acgg ctcccgggac ccggcggcac atctatgata ccaagctggg aaccgacaag "08" aact cctccatgtc cctgcagatg ggctacacgc agggcgccaa ccagagcggc "l4" ttcg gcctgggccg gcagatatat gaccccaagt cgca aggcacagtg "20" gccgatgggg ctccctcggg caccggcgac tgcccggacc nggggaggt ccctgaatat "26" cccccttact accaggagga ggccggctac tccc agcacgctct ctccccacat "32" cgtctgccca tctgggtttt tgggtttttc tgtgttttca tctttttttt ttttttctta "38" acccgttcag tgctgccagt caaccaaggg tctgtgagtg tcagcgtggg atcaggcagc "44" agagcttttt tcccctttgc cttgatcctt cgcaaggctg agccactggg ctgtggggga "50" aggggtcaag gccatatccc aatacgtgta gggcgagggt tggc acattcaggc "56" ggga agaagagacc tgggcttgga aggaaccggt ccccgacggt ttctgcttgc "62" ctcgcctctt cccccttttg tcagctgagc agtttgtggt ttctatgccc gcaagtttca "68" ggaagtattc acaaaagaaa aatacatttt ttcccccagg ggtggggcaa ggacagtgga "74" gagagtgcta ggaaatgagt cccctgggaa aggggaccgg gccgtgatgt taaatatctc "80" cggctcccaa gtgactggat ttgcctagga ccttcagatc aacagacttc agaccctcag "86" acctgccccg gggccaggtg gagaaagtga gggccgtaca aggaagtgaa attctgagtt "92" gttggggcta agcctgaccc cctctccatg gccc caactcactc tggcctcagt "98" agattttttt ttcagttgtg gttgttgccc gagt gcagtggcgc catcttggct 204" cactgcacct ccaccttccg ggctcaagcg attctccagc ctcagcctcc tgagtagcta 210" ggactgcagg tgctccacca cgcccggcta atttttgtat ttttagtaga gttt 216" ccccatgttg gccaggctgg tctcgaactc ctggcctcag gtgtgatccg cccgcctccg 222" cctccccaag cgctgagatt gtga gtgc cctc agtaggtttt 228" aaggagtccc cagccctcct tggg cccgaccagc ttatactgct ccatcttccc 531531 2341 cggccacatg ccccgccaag tactgcacag ggacccccca cccaggggcc ctgctccgtg 2401 agataatgtg aaatacgact gtggaccaaa cgcaataaaa cctctgtttg tacgaagaaa 2461 aaaaaaaaaa aaaaaaaa Protein seguence: NCBI Reference Seguence: 359.1 mNP_OO4359 ACCESSION: NP_OO4359 1 msstqfnkgp syglsaevkn dqu eaelrtwieg ltglsigpdf qulkdgtil 61 ctlmnqupg svpkinrsmq nlsn fikamvsygm npvdlfeand lfesgnmtqv 121 qullalagk aktkglqsgv digvkysekq ernfddatmk agqcviglqm gtnkcasqsg 181 mtaygtrrhl ydpknhilpp mdhstislqm gtnkcasqvg mtapgtrrhi ydtklgtdkc 241 dnssmslqmg ytqganqsgq vfglgrqiyd pkycpqgtva dgapsgtgdc pdpgeVpeyp 301 agy PEBP1 Official Symbol: PEBP1 al Name: phosphatidylethanolamine g protein 1 m: 5037 Organism: Homo sapiens Other Aliases: HCNP, HCNPpp, PBP, PEBP, PEBP-1, RKIP Other Designations: Raf kinase inhibitory protein; ampal ergic neurostimulating peptide; neuropolypeptide h3; phosphatidylethanolaminebinding protein 1; prostatic binding protein; prostatic-binding protein; raf kinase inhibitor protein Nucleotide seguence: NCBI Reference Seguence: NM_002567.2 LOCUS: NM_002567 ACCESSION : NM_002567 XR_109136 XR_109137 XR_111344 XR_1 14620 532532 l tgggcggcgg cgcg tgctctcgcg tggtcgctgg gtctgcgtct gcca 6" gtgtgctgag ctctccgcgt cgcctctgtc gcccgcgcct ggcctaccgc ggcactcccg 12" gctgcacgct ctgcttggcc tcgccatgcc ggtggacctc agcaagtggt ccgggccctt 18" gagcctgcaa gaagtggacg agcagccgca gctg catgtcacct acgccggggc 24" ggcggtggac gagctgggca aagtgctgac ccag gttaagaata gacccaccag " catttcgtgg gatggtcttg attcagggaa gctctacacc ttggtcctga cagacccgga 36" tgctcccagc aggaaggatc acag agaatggcat catttcctgg tggtcaacat 42" gaagggcaat gacatcagca gtggcacagt cctctccgat tatgtgggct cggggcctcc 48" caagggcaca ggcctccacc gctatgtctg gctggtttac gagcaggaca ggccgctaaa 54" gtgtgacgag cccatcctca gcaaccgatc tggagaccac cgtggcaaat tcaaggtggc 60" gtccttccgt aaaaagtatg agctcagggc cccggtggct ggcacgtgtt accaggccga 66" gtgggatgac tatgtgccca aactgtacga gcagctgtct gggaagtagg gggttagctt 72" ggggacctga actgtcctgg aggccccaag tccc cagttcagtg ttgcatgtat 78" aatagatttc tcctcttcct gccccccttg gcatgggtga gacctgacca gtcagatggt 84" agttgagggt gacttttcct tggc ctttataatt ttactcactc actctgattt 90" tgat caaatttgaa cttcattttg gggggtattt tgtg atggggtcat 96" caaattatta atctgaaaat agcaacccag aatgtaaaaa agaaaaaact ggggggaaaa "02" agaccaggtc tacagtgata gagcaaagca tcaaagaatc ggag gtttaaaaaa "08" aaaaaaaaaa aaaaagattg gttgcctctg cctttgtgat cctgagtcca gaatggtaca "l4" caatgtgatt ttatggtgat gtcactcacc tagacaacca gaggctggca ttgaggctaa "20" cctccaacac tctc agatgcctca gtaggcatca gtatgtcact ctggtccctt "26" taaagagcaa tcctggaaga agcaggaggg gctt tgtt gggacatggc "32" aatctagacc ggtagcagcg ctcgctgaca gcttgggagg aaacctgaga tctgtgtttt "38" ttaaattgat tcat gggggtaaga aaagctggtc tggagttgct gaatgttgca "44" ttaattgtgc tgtttgcttg tagttgaata aaaatagaaa cctgaatgaa gaaaaaaaaa "50" aaaaaaa Protein ce: NCBI Reference Seguence: NP_002558.1 LOCUS: NP_002558 533533 ACCESSION: N P_002558 1 mpvdlskwsg plslqevdeq pthlhvtya gaavdelgkv ltptqvknrp tsiswdglds 61 vltd pdapsrkdpk yrewhhflvv nmkgndissg tvlsdyvgsg ppkgtglhry 121 vwlvyeqdrp lkcdepilsn gkfk vasfrkkyel rapvagtcyq aewddyvpkl 181 yeqlsgk ACLY Official Sym bol: ACLY Official Name: ATP citrate Iyase m: 47 Organism: Homo sapiens Other Aliases: ACL, ATPCL, CLATP Other Designations: ATP citrate synthase; ATP-citrate -)-lyase; ATP- citrate synthase; citrate cleavage enzyme Nucleotide seguence: NCBI Reference Seguence: NM_001096.2 LOCUS: NM_001096 ACCESSION : NM_001096 1 agccgatggg ggcggggaaa agtccggctg ggccgggaca aaagccggat cccgggaagc 6" taccggctgc tggggtgctc cggattttgc ggggttcgtc gggcctgtgg aagaagcgcc 12" ggac ttcggcagag gtagagcagg tctctctgca gccatgtcgg ccaaggcaat 18" ttcagagcag acgggcaaag aactccttta caagttcatc tgtaccacct cagccatcca 24" gaatcggttc aagtatgctc gggtcactcc tgacacagac tgggcccgct tgctgcagga " ccacccctgg ctgctcagcc tggt gcca gaccagctga tcaaacgtcg 36" tggaaaactt ggtctcgttg gggtcaacct ggat ggggtcaagt cctggctgaa 42" gccacggctg ggacaggaag ccacagttgg caaggccaca ggcttcctca agaactttct 48" gccc ttcgtccccc acagtcaggc tgaggagttc tatgtctgca tctatgccac 534534 54" ccgagaaggg gtcc tgttccacca gggt gtggacgtgg gtgatgtgga 60" cgccaaggcc cagaagctgc ttgttggcgt gaaa ctgaatcctg tcaa 66" aaaacacctg ttggtccacg cccctgaaga caagaaagaa attctggcca gttttatctc 72" cggcctcttc aatttctacg aggacttgta cttcacctac ctcgagatca atccccttgt 78" agtgaccaaa gatggagtct atgtccttga cttggcggcc aaggtggacg ccactgccga 84" ctacatctgc aaagtgaagt ggggtgacat cgagttccct ccccccttcg ggcgggaggc 90" atatccagag gaagcctaca ttgcagacct cgatgccaaa agtggggcaa gcctgaagct 96" gaccttgctg aaccccaaag ggaggatctg gaccatggtg ggtg gcgcctctgt "02" cgtgtacagc gataccatct tagg gggtgtcaac gagctggcaa actatgggga "08" gtactcaggc gcccccagcg agcagcagac ctatgactat gccaagacta tcctctccct "l4" catgacccga gagaagcacc cagatggcaa gatcctcatc attggaggca gcatcgcaaa "20" cttcaccaac gtggctgcca cgttcaaggg catcgtgaga gcaattcgag attaccaggg "26" ccccctgaag gagcacgaag tcacaatctt tgtccgaaga ggtggcccca actatcagga "32" gggcttacgg gtgatgggag aagtcgggaa gaccactggg atccccatcc atgtctttgg "38" cacagagact acgg ccattgtggg catggccctg ggccaccggc ccatccccaa "44" ccagccaccc acagcggccc acactgcaaa cctc aacgccagcg ggagcacatc "50" gacgccagcc cccagcagga cagcatcttt gtcc agggccgatg aggtggcgcc "56" tgcaaagaag cctg ccatgccaca agtc ccaagtccaa gatccctgca "62" aggaaagagc accaccctct tcagccgcca caccaaggcc attgtgtggg gcatgcagac "68" ccgggccgtg caaggcatgc tggactttga ctatgtctgc tcccgagacg agccctcagt "74" ggctgccatg gtctaccctt tcactgggga ccacaagcag aagttttact gggggcacaa "80" cctg atccctgtct tcaagaacat ggctgatgcc atgaggaagc atccggaggt "86" gctc tttg cctctctccg ctctgcctat gacagcacca tggagaccat "92" gaactatgcc cgga ccatcgccat catagctgaa ggcatccctg aggccctcac "98" gagaaagctg atcaagaagg cggaccagaa gggagtgacc atcatcggac ctgccactgt 204" tggaggcatc aagcctgggt gctttaagat tggcaacaca ggtgggatgc tggacaacat 210" cctggcctcc aaactgtacc gcccaggcag cgtggcctat cgtt ccggaggcat 216" gtccaacgag ctcaacaata tcatctctcg gaccacggat ggcgtctatg agggcgtggc 222" cattggtggg gacaggtacc cgggctccac attcatggat catgtgttac gctatcagga 228" cactccagga gtcaaaatga ttgtggttct gatt gggggcactg aggaatataa 535535 234" ccgg ggcatcaagg agggccgcct cactaagccc atcgtctgct tcgg 240" tgcc accatgttct cctctgaggt ccagtttggc catgctggag cttgtgccaa 246" ccaggcttct gaaactgcag tagccaagaa ccaggctttg aaggaagcag gagtgtttgt 252" gccccggagc tttgatgagc ttggagagat catccagtct gtatacgaag atctcgtggc 258" caatggagtc attgtacctg cccaggaggt gccgccccca accgtgccca tggactactc 264" ctgggccagg gagcttggtt tgatccgcaa acctgcctcg ttcatgacca gcatctgcga 270" tgagcgagga caggagctca tctacgcggg catgcccatc actgaggtct tcaaggaaga 276" gatgggcatt ggcggggtcc tcggcctcct ctggttccag aaaaggttgc ctaagtactc 282" ttgccagttc attgagatgt gtctgatggt gacagctgat cacgggccag ccgtctctgg 288" agcccacaac accatcattt gtgcgcgagc tgggaaagac ctggtctcca gcctcacctc 294" ggggctgctc accatcgggg atcggtttgg gggtgccttg gatgcagcag ccaagatgtt 300" agcc agtg gcattatccc catggagttt gtgaacaaga tgaagaagga 306" agggaagctg atcatgggca ttggtcaccg gtcg ataaacaacc cagacatgcg 312" agtgcagatc ctcaaagatt acgtcaggca gcacttccct cctc atta 318" ggaa gtagagaaga ttaccacctc gaagaagcca aatcttatcc tgaatgtaga 324" tggtctcatc gcat ttgtagacat gcttagaaac tgtgggtcct ttactcggga 330" tgat attg acattggagc cctcaatggc atctttgtgc tgggaaggag 336" tatggggttc attggacact atcttgatca gaagaggctg aagcaggggc tgtatcgtca 342" tccgtgggat gatatttcat atgttcttcc ggaacacatg agcatgtaac agagccagga 348" accctactgc ctga agacaagatc tcttccccca agaaaaagtg tacagacagc 354" tggcagtgga gcctgcttta aggg gcctggaatg gcca ctggggtaca 360" ggcaccgaag accaacatcc acaggctaac accccttcag tccacacaaa gaagcttcat 366" atttttttta taagcataga aataaaaacc aagccaatat ttgtgacttt gctctgctac 372" ctgctgtatt tattatatgg aagcatctaa gtactgtcag gatggggtct tcctcattgt 378" agggcgttag gatgttgctt tctttttcca ttagttaaac ttct cctttggagg 384" aagggaatga aacatttatg gcctcaagat actatacatt taaagcaccc caatgtctct 390" cttttttttt ttttacttcc ctttcttctt ccttatataa catgaagaac attgtattaa 396" tttt aaagatcttt ttgtatgtta cgtgttaagg gcttgtttgg actg 402" aaatgttctg tgttgcagac cagagtctgt ttatgtcagg gggatggggc cattgcatcc 408" ttagccattg tcacaaaata tgtggagtag taacttaata tgtaaagttg taacatacat 536536 4141 aaaa tggaaatgca gaaagctgtg aaatgtcttg tgtcttatgt tctctgtatt 4201 tatgcagctg atttgtctgt ctgtaactga agtgtgggtc caaggactcc taactacttt 4261 gcatctgtaa tccacaaaga ttctgggcag ctgccacctc agtctcttct tatc 4321 atagtctggt ttaaataaac tatatagtaa caaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4381 aaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4441 aaaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_001087.2 LOCUS: 087 ACCESSION: N P_001087 1 msakaiseqt gkellykfic ttsaianfk yarvtpdtdw arllqdhpwl vkpd 6" qlikrrgklg lvgvnltldg vkswlkprlg qeatvgkatg flknfliepf vphsqaeefy 12" vciyatregd yvlfhheggv dvgdvdakaq kllvgvdekl npedikkhll vhapedkkei 18" glfn fyedlyftyl einplvvtkd gvyvldlaak vdatadyick vkwgdiefpp 24" pfgreaypee ayiadldaks gaslkltlln pkgriwtmva gggasvvysd gvne " lanygeysga pseqqtydya ktilslmtre khpdgkilii ggsianftnv aatfkgivra 36" lke hevtifvrrg gpnyqeglrv mgevgkttgi pihvfgteth mtaivgmaLg 42" hrpipnqppt aahtanflln asgststpap srtasfsesr adevapakka kpampquvp 48" sprslqgkst tlfsrhtkai vwgmqtravq gmldfdyvcs rdepsvaamv ypftgdhqu 54" fnghkeili pvfknmadam dvli nfaslrsayd stmetmnyaq irtiaiiaeg 60" ipealtrkli kkaqugvti igpatvggik gntg gmldnilask lyrpgsvayv 66" srsggmsnel nniisrttdg vyegvaiggd rypgstfmdh vlrythpgv kmivvlgeig 72" icrg ikegrltkpi vcwcigtcat mfssevquh agacanqase tavaknqaLk 78" eagvaprsf delgeiiqsv yedlvangvi vpaqevpppt vpmdysware lglirkpasf 84" mtsicdergq eliyagmpit evfkeemgig gvlgllwqu rlpkyscqfi emclmvtadh 90" gpavsgahnt iicaragkdl vssltsgllt igdrfggald skaf mefv 96" nkmkkegkli mgighrvksi nnpdmrvqil kdyvrthpa tplldyalev ekittskkpn 102" lilnvdglig vafvdmlrnc gsftreeade yidigalngi fvlgrsmgfi ghylqurLk 108" qglyrhpwdd isyvlpehms 537537 SNX12 al Sym bol: SNX12 OfibblName:soMngnefln12 Gene ID: 29934 Organism: Homo sapiens Other Aliases: none Other Designations: sorting nexin-12 Nucleotide seguence: NCBI Reference Seguence: NM_001256185.1 LOCUS: NM_001256185 ACCESSION : NM_001256185 l ttttgattgc catttccttg ggacagcctg gaat cgaaagaagt tcttttcagt 6" atggttgcat aacatcgagt cggagattgt gaaatgtcct ttgagaaaaa tgtacgggca 12" ggaagatgaa taggtgtctg tttgattaag gctctccttc ggaaagatgt cggacacggc 18" agtagctgat acccggcgcc ttaactcgaa gccgcaggac ctgaccgacg cttacgggcc 24" gccaagtaac ttcctggaga tcgacatctt tcag acggtgggcg tgggacgcgc " cacc acctatgagg ttcgcatgcg gacaaaccta cctatcttca agctaaagga 36" gtcctgcgta cggcggcgct actt tgagtggctg aaaaatgagc tggagagaga 42" tagcaagatt gtagtaccac cactgcctgg cttg cagc tccctttccg 48" aggagatgaa gggatctttg aggagtcttt catcgaagaa aggaggcagg gcctcgagca 54" gtttattaac aaaattgctg ggcacccact ggctcagaat gaacgctgcc tacacatgtt 60" agag gaggcaattg acaggaacta cgtcccgggg aaggtgcgcc agtaggagcc 66" cctctcacca cctgccctct actttcctgc tgaaatgaca ttggttttta cactaagcct 72" ctctctgtct ttgatctgaa gttggctgcc ctgg cctgatagac tgtctggcat 78" tgtgctccct ggtacctgac tacaccatgt tctg ctaggatcct cttctctgag 84" gagaggtggg aacccacagg cagatgccct gggg ttggggtggg tgggtggggg 90.. gattagactg gaaggcaatt tcttgggcat ttacccatgc cagaaggcta acctgggggg 538538 96" aggggggcgc ttgtgctggt gaggcacttg gatacatact gatgctgcaa gttcagggga "02" tttttcttac gttt aaccaagaac actgagcagg gaaaaaccct gcctttccta "08" actgcatgta ttttttcctt tttggaaagg tggtagagac tcagaagctt tccttgtttt "l4" cttcaggcct gctcccagtt ttcttaacag tttcttttgt tgctttctct ctcccttgtt "20" gctttccatg gcagtaatcc tcctagagtc gtct gttgtatgga gcagggtgtg "26" tgggttttct gggcccatca ttatggctgc ttcagagtca gaagaaagcc atagggcagt "32" aggggagctc ctattgccta gcccctctcc ctttgtggct cccactctag ctgcctattt "38" atca gctggtgagt cagtatgggc ttct ccctccctaa gcccttgcta "44" ctttatgggt tagctttgca ggtttggtgg cttgaggggt gggggcaact caccactgcc "50" aggtaactcc ctgaagggtg gatt atcttctagg ctcttacccg ngtagggaa "56" gggcatcaac actgtcttcc ttccattctc ctttccccca tcccatttag tgctgccaca "62" gggcagaagc acacaaacca accacacagt ctctgacttc tcctaagcac tttgagttgt "68" tgaatggggc tcaggggcaa gagtttttgc tgccctcccc agcgtggtca cagggttatt "74" gaactgcctg cacttgtttc tcatgcaact tttt ccccagaagt tgaactatgg "80" atagcagctt ggtatggatt tcctaaatct taacatttga agcagcttct tggc "86" aactatcctg gtttctgtct gggt ggtttgtttg ctggggccca acgtctgtcc "92" caagtggtgg ggtgagagta agttaacttt ggtgccaggt gagaggtggg ggctctttgc "98" ttagactccc tatcatggaa agattggagt tttctatgca gggcactgtg gaaaaggatt 204" gctgattctg actgaccctg atcagagaga ttaggattgt attttgacat aggatttgga 210" acccatctaa atgttgaagt tccctgagac agctctccag ctgctgagcc tgcgccaggg 216" gctaagcagc ccctaatgag aggctctgct ccctttccca caat gttgttgttg 222" ctgccttttt gatttgtatc tata gacatttttt aaaaacgatt tcctctttca 228" ttgtgcacaa gtgctgagag tctgaggccc tgct gtgtatatat actc 234" ggggctttta ttcagcaaac tgttcattct tctgtcagac aatgtcatat tcaactctgt 240" tcatattaaa ccactgtgaa gcaaaaaaaa aaaaaaaaaa Protein seguence: NCBI nce Seguence: NP_001243114.1 LOCUS: NP_001243114 ION: NP_001243114 539539 1 msdtavadtr rlnskpqdlt daygppsnfl eidifnpqtv gvgrarftty evrmrtnlpi 61 fklkescvrr rysdfewlkn elerdskivv pplpgkalkr qlpfrgdegi feesfieerr 121 qgleqfinki aghplaqner clhmflqeea idrnyvpgkv rq Official Symbol: SYNCRIP Official Name: synaptotagmin binding, cytoplasmic RNA interacting protein Gene ID: 10492 Organism: Homo sapiens Other s: RP1-3J17.2, GRY-RBP, GRYRBP, hnRNP-Q Other ations: NS1-associated protein 1; e- and tyrosine-rich RNA- binding protein; heterogeneous nuclear ribonucleoprotein Q Nucleotide seguence: NCBI Reference Seguence: NM_001159673.1 LOCUS: NM_001159673 ACCESSION : NM_001159673 1 cggcgtgagc cgcc attttacaac agctccactc gcgccggaca cagggagcag 6" cgagcacgcg tttcccgcaa cccgatacca tcggacagga tttctccgcc tcagcccaac 12" ggct acat agtgatttag atgaaagagc tattgaagct ttaaaagaat 18" tcaatgaaga cggtgcattg gcagttcttc aacagtttaa agacagtgat ctctctcatg 24" acaa aagtgccttt ttatgtggag agac ttacaggcag agagaaaaac " aagggaccaa agtagcagat tctagtaaag gaccagatga ggcaaaaatt aaggcactct 36" tggaaagaac aggctacaca cttgatgtga ccactggaca gaggaagtat ggaggaccac 42" ctccagattc cgtttattca ggtcagcagc cttctgttgg cactgagata ggaa 48" agatcccaag agatctattt gaac ttgttccatt atttgagaaa gctggaccta 54" tatgggatct tcgtctaatg atggatccac tcactggtct caatagaggt tatgcgtttg 60" tcactttttg tacaaaagaa gcagctcagg aggctgttaa actgtataat aatcatgaaa 540540 66" ttcgttctgg aaaacatatt ggtgtctgca tctcagttgc caacaatagg ctttttgtgg 72" gctctattcc taagagtaaa accaaggaac agattcttga agaatttagc aaagtaacag 78" agggtcttac catt ttataccacc aaccggatga caagaaaaaa aacagaggct 84" tttgctttct tgaatatgaa gatcacaaaa cagctgccca gcgt aggttaatga 90" gtggtaaagt caaggtctgg gttg gaactgttga atgggctgat cctatagaag 96" atcctgatcc tgaggttatg gcaaaggtaa aagtgctgtt tgtacgcaac cttgccaata "02" ctgtaacaga agagatttta gaaaaggcat agtt tgggaaactg gaacgagtga "08" agaagttaaa agattatgcg ttcattcatt ttgatgagcg agatggtgct gtcaaggcta "14" tggaagaaat gaatggcaaa gacttggagg gagaaaatat tgaaattgtt tttgccaagc "20" caccagatca gaaaaggaaa gaaagaaaag ctcagaggca agcagcaaaa atgt "26" atgacgatta ctactattat ggtccacctc cccc tccaacaaga ggtcgagggc "32" gtggaggtag aggtggttat cctc cagattatta tggatatgaa gattattatg "38" attattatgg ttac cataactatc gtggtggata tgaagatcca tactatggtt "44" atgaagattt tgga gctagaggaa ggggtggtag aggagcaagg ggtgctgctc "50" catccagagg tcgtggggct gctcctcccc gcggtagagc ttca cagagaggag "56" gtcctggatc aggc gttcgaggtg cgagaggagg acaa caaagaggcc "62" gcgggcaggg aaaaggggtc gaggccggtc ctgacctgtt acaatgaaga ctgacttgct "68" atgtgggatt acaccagaag cttgcagtgg agtaatggta aggaaatcaa gcaaccttaa "74" atatgtcggc tgtataggag catattctat tgcagaagac cttcctatga agatcatgga "80" atcaaatacg ggacattgaa ctaatacttg gata tgaatttctt taacaatttt "86" ctctgcagtg caagttatta aactaaagct actctatttt caaaatgtgt tccaacagaa "92" atccttcata actcctagca tggtatctta ataaagaata aagttctttt aaaaatctgc "98" tctaagtaga tttttcccct aatt aaggatccca acagtggtat tttgaaatat 204" tctcttgaat ttgtgcattt aaattttatt gtat agatgaatgc cactgatggt 210" atccttaaat tttatttctg ctcaccaagg ttaatcatga ttgtctatat cttttttata 216" gtgatcactt tgtg ttcagatatg cagtttcagg tgtaatcatc agagctggtt 222" gcat tccagatagt ggttcttttc agaacctttt taaaagggtt ggttaactac 228" ctcagtagca gaggattgaa ctataccctg tctgtactgt acatagaaaa tctttgtaga 234" taaaagcaag gcttgttaaa tatgatatga gggtaagatt ttaatatacc aaatgtaaca 240" ttcttagttg cctttagttt cagaggcttg taagacttcc tcatgaccat cataacaggc 541541 246" cttgcttttg tcgtattttg tggctgaaaa agcagccttg cttcttcaga tattgtagtt 252" atttggatgt ataatagttt atgt tacttttgta agacatcaga tgttcaaaaa 258" agtgcatccg aacttgtact aaatactgca gtgtcccttt ataaaaagtc aaac 264" tgacaattgt aagc ctgacatttg gatattttga :tca taaatcatag 270" aaattagtat atggctgtag tttagctttt taggtaaaag gtatgt:tca ttagtgcatt 276" tcttcctgct gatcactgta aacatgtgaa tcagctttcc atttct:atg caggtcatga 282" taacttgtag gtac aatcatttgt gctatgtttt taattt:cta aagcaccttg 288" atgacagtga gtgtccagtg gtgaagcatc tgaa ccaccc:caa aaattttttt 294" gccaagtcct aagttgatag cttaaagtaa aaagtgaaaa :ttc attaggactt 300" ggtgtaaaga aatcccctcc ccccttcccc aaagggatac :ata tcacataccc 306" aataggcacc acgatgaaga tcagagctta tacttaatta aggttt:ata cacaccagtt 312" ccccagtaaa tgcaaattta acaagaaaat cagacatgtc atatgt:caa aatgctcatg 318" aatc attttgcatt cctgcaaata aaattgtttt atactg:aag ctggaggcga 324" gtgtaactta tttttgtaat aaagttttta ttttttttat gtgtca:taa tataaatgtg 330" tgta gaaatcttct ggtttaaaaa cttagaattg cacaca:ttc agtatgttta 336" tttgtactta cataatttta gaatagtggt tgccaatagc ctgtatgttt cacattaatt 342" ggttttttgt tatctaaata aatcatttta gtatgttgta tgtcagttac tgggatagct 348" gggacataga ttta gtca ataagtattc attggaatat atgtaaatgt 354" gccttgccgg ttattgaaac acaa aatgagtatg gggtgacaaa aattagttcc 360" tggtgcttaa tgaaactttc tgccactgat tttatatatt accccgtgct tttttaaagt 366" acatctctct caaaacttag tgtaagtttg agggctacac aaaacattta catttcattc 372" taacataatg aatataatag gaaa gtgggtaaac taaatgtagc cttcagtaaa 378" attgaatctc agtgtaatcc ttggtgctgg tcag ttccgaggag ttaaatgatc 384" ccatctaaga ggtcattgcc atgcctattg gcactttact gtcatagcat ttttaaggga 390" cactgtcaag gtgtttaagt tctcagaatt acttgttggg attttaggac aggtttgttt 396" acttaaagta agaactgcat tgtcaaagtt gaaagaggaa cacttttgtg agttcacaaa 402" tgtgttctta agaaaacatt aaaatatgga gctctgggtt ttcaagacta tttggcattc 408" tggg gacttgggag ggaaactgat aaaaagaaat tgaagaattg atggttatac 414" aagg gtaatgtaaa cagtggtgat gaaatatata cacatcaagt gaaattactt 420" gacagtgttc atttgaatga ctttgaattc aagccattat aattactttt aaaattaaat 542542 126" atcatttgca ctgttctgat aatgggtgca gtttttgagc aatataatca gagctaaata 132" tgcatgtagt gattagtgat gtgaacaatt aacgttctga atac taactgtggt 138" attttcaaac ttaaatttct gtagtaaaat cagtatcaaa gtcttatcag atcaaggaaa Z44" caat gcatataaac atacttttga atgttgtgtg gcctataaag caataatgca 150" atttatatgg aatgtcatgg gatatgagaa atggaaatgc aaaaataact ttag 156" taaaaatgtc aacatgttaa agggggaatg ttaactaatg taggttattg ctatttgtga Z62" tatg ggttcttggc tttgacagct aatg gacagtgata agttaaaaga Z68" aattttgtat attgtcaagg aaagggtctt aaatccgagt cctt ccttggggta Z74" aaaaatgtat tcttaaagca ttctgatgtt aaaaagaaaa cttaagttat ctaaccaaaa 180" cagacgcaag attttgtttc ctac ttggcaatca aaagtgatca taaatttagg 186" ttatcagttt tcagaaagtt gctttgtgag aaaattttgt tagatatatt ctcccaagca Z92" tgctttttgt ggaaggtttt cagccattgc cactgaatca gatgttaaaa atgaagggaa Z98" aattgagtgt acac aactgttgta cactcatgat tgcagttttt agcttaagaa 504" acttttctac cagttactgt gaatctgact taaaatgtaa agtttcctca tgataaaata 510" ggaacaacat agaaatggat tgatggggtg atctgagtta ttgtatataa aagtttttaa 516" agaatagaat gaacatcaag aggc aaaaattgac acattcagaa cagctttttt 522" gactgcgaag ccaaaagttg acag atcc cttattatta cagagtattt 528" tacgtagtct ctattttaag gagagaaatt aaatagaagg gcttcatgca tttaggggag 534" ggtgctaaaa cttctcaagt tcgtcaaact tacaggaata cccaccatga tcattttctc 540" tctaattatg tataccacaa aattttcatc tggccatagg aattcactgg tgggtgtaaa 546" attaatgact aaagaaatta agtgacaaat acataaaaga aacagacttg tggggatatt 552" gttttaaggt gtattaatta ctcagtgatg ataccactca atagggcatg ccactacttt 558" tcttaagatg ctaattatga agcagtgctc attt tttaactagc aaattagtag 564" atggactttt ggggtctgtc actttttaaa agtatttaag acttaaattc cacc 570" acagtctgcc ttcagtaata cacctaaaat atttttcagg agca ttcagtttga 576" aaatttgcag atgcaaacca gtattattac taacgctctg ggtcaaagat taggttttta 582" acag tagtctggta aatatttaga agtctggcat tgagaaacaa aagcttgtac 588" ctgactagta tttttattta aaaaaattag ttctgttagc ttatttaaat tgtgttttat 594" ttatccgtag tatt tatttcattc ctttcatctc actgaaaact aggc 600" attt ggattagatg tgtgaagtac tgtcttttgc caaaaacctc aaattacctg 543543 606" ttcttttcaa cgtagtgggt ttgtgcttgt ttggagatca gttcaaaaac tatctgtact 612" atctgtactg cctctgatgt taagatttta tgtatagcat aaggaagcta gctctgacta 618" tattttccta agaataaaga cctatttttg tagcatgtct taggatctcc ccaa 624" gaattattgt gggtgtcctc caattcatca ctcttcactt aacagctttt aagtagacac 630" ttggaatctt tagaggtctg tcgccctttg attatccata cattcgaagt aactagccaa 636" aaaa ttcctcaaga tatcctcagt tgcaatcaca ttactggaag atgaatagaa 642" taaatgtatt aggctggtct taatttttga tggaaatatt ctgttgtccc gtacttgcca 648" ttggatttga taaagttagt ggtaatttgg aaagaatcgg ggacttgcca atatatttgt 654" gggttttagc ttatacccct aggatttctt ggttgcggga cgagcagttt tggccacttc 660" catcaggaca agacttttta ttag tgcaggtttt agtttctatt ttggattaac 666" aacatttata ttgattatcg aaaagaagct ttcatcattt agtc ctggaagttt 672" gactttgagt gtgggagaag tcctaataaa ccattttgga aaaa aaaa Protein seguence: NCBI Reference Seguence: NP_001153145.1 LOCUS: NP_001153145 ACCESSION: NP_001153145 l rekq gtkvadsskg pdeakikall ldvt tgquyggpp gqqp 6" svgteifvgk iprdlfedel vplfekagpi wdlrlmmdpl tglnrgyafv tfctkeaaqe 12" avklynnhei rsgkhigvci svannrlfvg sipksktkeq ileefskvte gltdvilth 18" pddkkknrgf cfleyedhkt aaqarrrlms gkvkvwgnvg tvewadpied pdpevmakvk 24" vlfvrnlant vteeilekaf squklervk klkdyafihf derdgavkam eemngkdleg " enieivfakp pqurkerka qrqaaknqmy ddyyyygpph mppptrgrgr gypp 36" dyygyedyyd yygydyhnyr ggyedpyygy edfqvgargr ggrgargaap srgrgaappr 42" qrgg pgsargvrga qrgr gqgkgveagp dllq SAR1B OfibblSwnmfi:SAR1B Official Name: SAR1 homolog B (S. cerevisiae) 544544 Gene ID: 51128 Organism: Homo s Other Aliases: ANDD, CMRD, GTBPB, SARA2 Other Designations: 2310075M17Rik; GTP-binding protein B; GTP-binding protein SAR1 b; GTP-binding protein Sara; SAR1 a gene g 2 Nucleotide seguence: NCBI Reference Seguence: NM_001033503.2 LOCUS: NM_001033503 ACCESSION : NM_001033503 l gccggcccgg aaggggctga tgcgaactgg ggccacggca gccatcgcgc gttc 6" ggtctcctgg tgtacggcca acgccaagta ggggattgcg ttccctccag tcgcagagtt 12" ttgt cgcccaggct ggagtgaagt ggcacgatct actg caagctccgc 18" ctcccgggtt cacgccattc tcctgcctca cgag tagctgggac tacagaccct 24" atcagatttg gatatgtcct tcatatttga ttggatttac agtggtttca gcagtgtgct " acagttttta ggattatata ctgg taaactggta tttcttggat tggataatgc 36" aggaaaaaca acattgctac acatgctaaa agatgacaga cttggacaac atgtcccaac 42" attacatccc acttccgaag ccat tgctggcatg acgtttacaa cttttgatct 48" gggtggacat gttcaagctc gaagagtgtg gaaaaactac cttcctgcta tcaatggcat 54" tgtatttctg gtggattgtg cagaccacga aaggctgtta gagtcaaaag aagaacttga 60" ttcactaatg acagatgaaa ccattgctaa tgtgcctata ctgattcttg ggaataagat 66" cgacagacct gaagccatca gtgaagagag agag atgtttggtt tatatggtca 72" agga aaggggagta tatctctgaa agaactgaat ccct tagaagtttt 78" tagt gtgctcaaaa gacaaggtta cggagaaggc ttccgctgga tggcacagta 84" cattgattaa cacaaactca cattggttcc aggtctcaac gttcaggctt actcagagat 90" ttgattgctc aacatgcata acttgaattc cttt tgctggttat aaaacagatg 96" ttttttagat tattaatatt aaatcaactt aatttgaatg agaattgaaa actgattcaa 102" gtaagtttga gtatcacaat gttagctttc taattccata aaagtacttg gtttttacag 108" atct gacatcaccc cagcgccatt tgtaaagagc aactttccag cagtacattt 114" gaagcacttt ttaacaacat gaaactataa accatattta aaagctcatc atgttaaatt 545545 "20" ttttatgtac ttttctggaa ctagttttta aattttagat tatatgtcca cctatcttaa "26" gtgtacagtt aataattagc ttattcaatg attgcatgat gccttacagt tttcaataac "32" tttttttctt atgcaaacgt catgcaataa aacaaactct aatgtttggc atccttgttg "38" ggcaaatgtt tcatttaaat gtgtcttatc tagctagtat actctgaaaa tttgagtatt "44" taatattagg aagt ggttgttggg aaaggagatt ccttcagaat atttagaata "50" gcgtttaagg ctctcaaggc ttaggtattt agat ggttgtggtc agttgcacct "56" gataccatgt atccctgaat catgtgtatt tttattagta atgcagccac taccattgct "62" gatggggtct gtgtcctagt ccctgtggga ttggccttct gaagaaaagc acatttatgg "68" tacacaaggt tgattctcag tttggtaggc tctatagttc catcccagct gtctaattct "74" gaagtgctcc agtcttattg gtgcccaatg gggtaatctg tgcctcttcc gcaa "80" tgacaggcac cagtgtctcc acatttagat atattcccag tctctgtaca tttaacagta "86" gggg caagcaataa cctt gtgtgcagtt tcattgtaga gacatgtata "92" tctgaggaat aaaacagctt gttctgtgtc agtctgagat atgtggagat tttgttccta "98" agct cttttgtatt ctttggcata ttttatatcc tggtagaaga aaacaagtgc 204" ttgtttccaa tttt tcttatttgc tctcaggtag ctcc caaa 210" gactttttgt ggct tttttttttt tttttttttt gctctgtttc attttgtttt 216" agagacgggg tttcaccatg ttgtccaggc tggtctcaaa acct caagtgatct 222" gcccgccttg gcctcccaaa gttctgggat tataggcgtg catg cctgacctga 228" cttttgtttt tagatattat gcttgccatg tgatagggcc tgcaagcctc attgctaggc 234" ttactaagaa aattttagtt tttcaaaagc attataattt cctaagaaac tgaattcttt 240" ttttatatgt ttgagattcc catcattagt aatataagat gaaaggtaag tgccaaaaat 246" gtatttttaa agaccctcaa gatt tatcctgatt ataagccaag ttttatagta 252" tatttaacca attccatcaa tttt aaaa attagtgttt gcca 258" ttgtccatgt cagagttaca gtcctttttg tcattgataa tataacctat gaagcagata 264" aggattgagg aatatgactg gaaggaatta ctatttagct aagctgacaa ggtcgcttct 270" taagatgaca tttggtttca gtaatctgac tattctgttt tcactttcat cttctttcta 276" aatgaaaaca ctcc ctcccttcct ggaaacctca gtaacactat gggaaaagta 282" gaacatgaca ttgcagccta ttgatttctt cttccagata ggtttaaagt actccttaag 288" ttctgactaa atagaactaa ttaa aaataactgc ctcttgttca tgttatctgt 294" accttcaggg acctgccttt cttcaagtat ttcctagagt atctattatg atactgaaga 546546 300" agctaattat ttgtgttgta aatgggtata aatgaaaaaa aaacatactg gtttctctag 306" ccaggaaaaa tgctttctgg tgtaatatat cttgctccag catt ctaattgtaa 312" cactaggatc aaagaaacaa agtcactttg tggaccacag ctaaactgtg gatattttcc 318" caaagacata agatttttat ggcccgagcc aagg aagccatgtt taggagcaac 324" cagctttcct ttta gggggcagag ttcctgagcc agaggactta ctgtccagct 330" ttgagaacct ctccagagta tatgcactgg gtactgctct ttttcaagag aaccaaatta 336" agaggatggc aagaaacagt agaagcacag gaca actctgcatg tgcctgtgtg 342" aatgtgtgca tccatggagt cagg ctag tactggggac ataggctaat 348" tgtgtgtccc acactgcaag atgctagggc gtagttaaca ctgtggtata catacaaatc 354" aggcactgtc caaaagattt tttaaaatct aaagtctgaa atgtaaaaat ataaggtctc 360" aacccacttt tacactttta aagagatccc atacctgttt cactgactgc cgttaattac 366" acttttggat cacagctggt gata agtt tatctcagtg aatttttaga 372" atggagatta tagcattttt taattggaga acagacattt cctaaagtat atgaaaaaaa 378" attattcact gttggtttaa accagtatct ttgtatgagt gata tatgaacaca 384" gatactgcct gtgcagacct aaattttagt tttgtgtacc tggatccata tacaaatttt 390" ttgtggttta tagcataaaa gcagaacgtt gtttctttct tagttttcaa ccggctcatc 396" ttttgttttt gttttttgtt tttttgtttt ttgttttttt tgagatggag tcttgctttg 102" ttgcccaggc tagagtgcag tggcacaatc tctgcttact gcaacctcca cctccggggt 108" tcaagtgatt ctcctgcctc agcctcctga gtagctggga ctacaggcgc gcaccaccat 114" gcccggctaa tatt tttagtagag atggggtttc actatgttgg ccaggctggt 120" ctcgaactcc tgacctcgtg atctgcccac ctcagcctcc caaagtgctg ggattacagg 126" cgtgagccgc catgcccggc caattttcaa ctggcccata ctttatagtg atggaaagcg 132" cataaactac ttgtaaatca ttaaaatagg actg tgataatagt gtttcttgca 138" aaaa ttattttatt aactacattc aaaacccagc atttcacagg ttccatcatt Z44" agaaacagta tagttctagt taacatgatt tttc aggggaaagg tttacatttt 150" ctgaaactgt atttggtatg tgactcaatg tggtatttca gtcttgttag acat 156" gactgacgtt tgcaaggatt tattgccaag taaaatttga ccagagtgca ctgagaatag Z62" ctacataagg ggaaatctct caaaattcct tctgttcatt taatttggag catattgttt Z68" aaatcatttt aaacatatgt ttga aaaa aatcttcaag aaacaatgaa Z74" aaaatagaaa aaca taagtttctt aatgcaaaat taatagtgaa taaaatatag 547547 480" cctacattaa aagccagagg ctttgctata aatataagag tttagaaaaa cagtgtgctt 486" caattaagga ctaaattatc aaaactgcat gtttgttttt tcttttcttt tctttttttt 492" tgaa gtctcactct cagg ctggagtgca cgat cttggctcac 498" tgcaacctct gcttcccagg ttcaagcgat tctcgtgcct caccatcccg agtagctggg 504" attacaggtg caccacacca gtta atttttgtat ttttagtaga gacagggttt 510" cattatttgg ccaggctggt ctcgagctcc tgatcttaag tgatccactc gcctcggcct 516" cccaaagtgc tgggattata agcc actgtgccca gcctaaaact tgaa 522" gcttccggtc atttccatta ttatccttct tttgaaattc aagttagtgc acca 528" aataaaagaa gctc ttgggatatg tgactctgcc tctgtataaa gtgactggaa 534" ttttgttaaa accgtgtttc cacttctgaa ccctgttacc attccccctc acaaatcccc 540" acccaacacc tggattttaa agatcctcca gtgtcaaggg aagccacaga gtctattaaa 546" gaggcagttc tgaaccaatt aatttttgtc cttataattt ttaa atagctaata 552" tatttaatgg cactaattgt tgttcacggc tttcatcata cttttaaaca gaatccaaag 558" tattcaaagg aaagtaagcg aagttatcca aagccaactt tgtttcaggt gtgtcccctg 564" ccccaaatag attttagggc agaaatagaa aactgagttt acacagaact atttttggaa 570" aagctgcact ggagtagatg gattcttctt cagcatactt ttttgtttgt ttgtttgaga 576" tggagtcttg ctttgtcacc ggag tgcagtggtg tgatctccac tcactgcaac 582" ctccacctcc cagcttcaag tgattctcct acct tccaagtagc ttggattaca 588" ggcgtgcgcc accacagctg gctaatattt gtattgttag tagagacagg gtttcaccat 594" gttgtccagg cttgtcgaac ttctgacctc acgtgatcca cctgcctcag cctcccaaag 600" tgctagatta taggcgtgaa ccactgcgcc cggccagcat gcattttaaa ttag 606" atttagtttt aaatattttg gggtgaaagg cagt tctgtttttg acatacaggt 612" tttctttggg attgttttca ttttcaagta catg tcagaatggc caacttaacg 618" tgggtttctg tattccctgg tgttgctctt aacctgaact cataatcagt tgccatactg 624" aggcaagagc actcagggtg aacatagtca agttacttta aaagtgataa aagtgttttt 630" ccatggtgaa accttcagta tttggctgaa gtat gttgaagtgg tatattgatg 636" gtaagttgtt aatcactaac cttgtttgca cttttgtaca ccactgcttg cactaggatc 642" ttggtgtgaa ttttcaattg ttttacagtg tatacagatt attaaggata atttatataa 648" agatgtttct gtttaacttt gtgtgtttta caacaaagag taga tggttaaacg 654" tttttgaatt atat gttagtttga ttatgttcta ttatcttttc acctgccatg 660" aatttgagtg ttaggaaggg aaaa tactaatctg gtcttgaaga a 548548 Protein seguence: NCBI Reference Seguence: NP_001028675.1 LOCUS: NP_001028675 ION: N 28675 1 msfifdwiys gfssvqulg lykktgklvf lgldnagktt llhmlkddrl gqhvptlhpt 61 agmt fttfdlgghv qarrvwknyl vflv dcadherlle skeeldslmt 121 detianvpil ilgnkidrpe aiseerlrem fglygqttgk gsislkelna rplevfmcsv 181 egf rwmaqyid Official Symbol: CCDC47 Official Name: coiled-coil domain containing 47 Gene ID: 57003 Organism: Homo sapiens Other Aliases: GK001, MSTP041 Other Designations: coiled-coil domain-containing protein 47 Nucleotide seguence: NCBI Reference Seguence: NM_020198.2 LOCUS: NM_020198 ACCESSION : NM_020198 1 attatgtaat tttcccaaaa cctc gcctcagccg ggcgggagag agggaggtct 61 cgcgctttcc ccggggttgc gtcccgcccc gcaggctgcg cgcaggcgct gacgagccgc 121 tcgcattcta cgtaacggac ggcggaggct acgtgaagag aggcgcggcg tgactgagct 549549 18" acggttctgg ctgcgtccta tccg gggcagtaaa accgctgcga tcgcggaggc 24" ggcggccagg ccgagaggca ggccgggcag gggtgtcgga cgcagggcgc tgggccgggt " ttcggcttcg gccacagctt tttttctcaa ggtgcaatga aagccttcca cactttctgt 36" cttc ttgg gagtgtctct gaagccaagt ttgatgattt tgaggatgag 42" gaggacatag tagagtatga tgataatgac ttcgctgaat ttgaggatgt catggaagac 48" tctgttactg aatctcctca acgggtcata atcactgaag atgatgaaga tgagaccact 54" gtggagttgg aagggcagga tgaaaaccaa gaaggagatt ttgaagatgc agatacccag 60" gagggagata ctgagagtga tgat gatgaagaat ttgaaggtta tgaagacaaa 66" ccagatactt cttctagcaa aaataaagac ccaataacga ttgttgatgt tcctgcacac 72" aaca gctgggagag ttattatcta gaaattttga tggtgactgg tctgcttgct 78" tatatcatga attacatcat tgggaagaat aaaaacagtc gccttgcaca ggcctggttt 84" aacactcata tttt ggagagcaac tttactttag tgggggatga tggaactaac 90" gcca caagcacagg aaagttgaac caggagaatg agcacatcta taacctgtgg 96" tgttctggtc gagtgtgctg tgagggcatg cttatccagc tgaggttcct caagagacaa "02" gacttactga atgtcctggc ccggatgatg aggccagtga aagt gcaaataaaa "08" gtaaccatga atgatgaaga catggatacc tacgtatttg ctgttggcac acggaaagcc "14" ttggtgcgac aaga gatgcaggat ttgagtgagt tttgtagtga taag "20" tctggagcaa agtatggact gccggactct atcc tgtcagagat gggagaagtc "26" ggaa tgatggatac aaagatggtt cactttctta cacactatgc tgacaagatt "32" gaatctgttc atttttcaga ccagttctct ggtccaaaaa ttatgcaaga ggaaggtcag "38" aagc tacctgacac taagaggaca ctgttgttta catttaatgt gcctggctca "44" ggtaacactt acccaaagga tatggaggca ctgctacccc tgatgaacat ggtgatttat "50" tctattgata aagccaaaaa gttccgactc aacagagaag gcaaacaaaa taag "56" aaccgtgccc aaga gaacttcttg aaactgacac atgtgcaaag acaggaagca "62" gcacagtctc ggcgggagga gaaaaaaaga gcagagaagg agcgaatcat gaatgaggaa "68" gatcctgaga aacagcgcag gctggaggag gctgcattga ggcgtgagca aaagaagttg "74" gaaaagaagc aaatgaaaat gaaacaaatc aaagtgaaag ccatgtaaag ccatcccaga "80" gatttgagtt ccac ctgtaagctc tgaattcaca ggaaacatga aaaacgccag "86" tccatttctc aaat ttcagacagt cttgggcaac tgagaaatcc ttatttcatc "92" atctactctg tttggggttt ggggttttac agagattgaa gatacctgga aagggctctg 550550 198" tttcaagaat ttttttttcc agataatcaa attattttga ttattttata aaaggaatga 204" tctatgaaat ctgtgtaggt tatt ttaaaaatta taatacaaat catcagtgct 210" tttagtactt cagtgtttaa agaaataccg tgaaatttat aggtagataa ccagattgtt 216" gctttttgtt taaaccaagc agttgaaatg gctataaaga ctgactctaa accaagattc 222" tgcaaataat gattggaatt gcacaataaa ttga tgttttcttg tatgtctaca 228" ttaaacttga gaaaaagtaa aaattagaac actgtatgta gtaatgaaat ttcagggacc 234" cagaacataa tgtagtatat gtttttaggt gggagatgct gataacaaaa ttaataggaa 240" gtctgtaggc attaggatac tgacatgtac atggaaaatt acag gagcatcatt 246" ttttccttac ctgataccac gaaccagtga caacgtgaat gctgtatttt aagtggttgt 252" atgtttattt tcttgagtaa caaatgcatg aaaaattaat ccta ggtaagatca 258" ttggtctgtg tgaaatcaca aatgtttttt ccttcttggt tgctgcagcc tggtggatgt 264" tcatggagaa gctctgttct ctatattatg gctgtgtgcc gttgcttctc cttt 270" ttcc acagttgagg ctgggtatgt aaag aaatggccat gaatatgtgt 276" aagtatactt ttgaaaatga gctttcctaa gaga gttctttcca cctcttgcgg 282" aaccaactct tggaggagag gcccatgtat ctgcacgagc acttagcttg ttcagatctc 288" ttat aaatgcttct taccaagaaa gcatttttag gtcattgctt gtaccaggta 294" gccg gggatgggta agggttgggt tttctggtgg gagtggggtg gtgggtattt 300" tttgttgatg ctttagtgca ttct gaggcaataa caagttgctg tgaaaacagc 306" tgct gcctttgtaa ctgcatggaa caca tgggtttttc tccaagttaa 312" tacagaaata tgtaaactga gagatgcaaa tgtaatattt ttaacagttc atgaagttgt 318" tattaaaata actaacataa aacttaatta ctttaatatt atataattat agtagtggcc 324" ttgttttaca aacctttaaa ttacatttta gaaatcaaag ttgatagtct tagttatctt 330" ttgagtaaga aaagctttcc taaagtccca tacatttgga ccatggcagc taattttgta 336" acttaagcat tcatatgaac tacctatgga catctattaa agtgattgac aaaatctcaa 342" aaaaaaaaaa aaaaaaaaaa aaaaa Protein ce: NCBI Reference Seguence: NP_064583.2 LOCUS: NP_064583 ACCESSION: N P_064583 551551 1 mkafhtfcvv llvfgsvsea kfddfedeed iveyddndfa efedvmedsv tequrviit 6" dd d ttv l qu nq g df dadtq g dt 3 pydd f gy dkpd tsssknkdpi 12" tivdvpahlq nswesyylei lmvtgllayi mnyiigknkn srlaqawfnt hrellesnft 18" lvgddgtnke atstgklnqe nehiynlwcs grvccegmli udl lnvlarmmrp 24" vsdqvqikvt mndedmdtyv favgtrkalv rlqkequls pksg akyglpdsLa " ilsemgevtd mvhf lthyadkies vhfsdqfsgp kimqeegqpl k1pdtkrt11 36" ftfnvpgsgn typkdmeall plmnmviysi dkakkfrlnr egquadknr arveenflkl 42" aaq srr kkra k rimn dp qurl aa lrreqkklek kqumkqikv 48" kam PSM D12 Official Symbol: PSMD12 Official Name: proteasome (prosome, macropain) 268 subunit, non-ATPase, 12 Gene ID: 5718 Organism: Homo sapiens Other Aliases: an5, p55 Other Designations: 268 proteasome non-ATPase regulatory subunit 12; 268 proteasome tory subunit RPN5; 268 proteasome tory subunit p55 Nucleotide seguence: NCBI Reference Seguence: NM_002816.3 LOCUS: NM_002816 ACCESSION : NM_002816 XM_942494 XM_946044 XM_946047 XM_946049 XM_946052 XM_946055 XM_946058 1 gggt gcaa cttccggtgt gggtgacgag tggtggccga agcaggggga 61 cagcaaggga cgctcaggcg gggaccatgg cggacggcgg ctcggagcgg gctgacgggc 121 gcatcgtcaa gatggaggtg gactacagcg ccacggtgga tcagcgccta tgtg 181 cgaagctagc agga agacttcaag aagtcattga aacccttctc tctctggaaa 241 agcagactcg tactgcttcc gtat cgacatcccg tatcttagtt gcagtagtga 301 agatgtgcta tgaggctaaa gaatgggatt tacttaatga aaatattatg cttttgtcca 552552 36" aaaggcggag tcagttaaaa caagctgttg ccaaaatggt tcaacagtgc tgtacttatg 42" ttgaggaaat cacagacctt cctatcaaac ttcgattaat tcta cgaatggtta 48" ccgaaggcaa gatttatgtt gaaattgagc gtgcgcgact gactaaaaca acta 54" taaaagaaca aaatggtgat gagg cagcctccat tttacaggag ttacaggtgg 60" aaacctacgg gtcaatggaa aagaaagagc aatt tattttggag caaatgaggc 66" tctgcctagc ggat tacattcgaa cacaaatcat cagcaagaaa attaacacca 72" aatttttcca ggaagaaaat aaat taaagttgaa gtactataat ttaatgattc 78" agctggatca acatgaggga tcctatttgt ctatttgtaa gcactacaga gcaatatatg 84" atactccctg tatacaggca gaaagtgaaa aatggcagca ggctctgaag agtgttgtac 90" tctatgttat cctggctcct tttgacaatg aacagtcaga tcac cgaataagtg 96" agaa gttagaagaa attcccaaat acaaggatct tttaaagctt tttaccacaa "02" tggagttgat gcgttggtcc acacttgttg aggactatgg aatggaatta agaaaaggtt "08" cccttgagag tcctgcaacg gatgtttttg gttctacaga ggaaggtgaa aaaaggtgga "14" aagacttgaa gaacagagtt gttgaacata atattagaat aatggccaag tattatactc "20" caat gaaaaggatg gcacagcttc tggatctatc tgttgatgag tccgaagcct "26" ttctctcaaa tctagtagtt aacaagacca tctttgctaa agtagacaga ttagcaggaa "32" ttatcaactt ccagagaccc ccaa tatt aaatgactgg tctcagaaac "38" tgaactcatt aatgtctctg gttaacaaaa ctacgcatct catagccaaa atga "44" tacataatct acaataaggg tcttagtgct ttagaaaaaa attg gaagtcatta "50" aaaaaagact gttataatgg tgtatatgtt ggggtttttt ttctaagctt ctttgtctta "56" aattttaaaa tagtgaatat gtttgagact ccctttgacc tttcagttcc ccaagttcat "62" tgttaacttt gcatttgcaa caaa aatacagatt tctgtcgtct gaatacacaa "68" aaagttgtgt cataacttac ccagatatgt ttttctatca tttgaaacct ttttagctac "74" tgtttgtttt cattcaacta acaaacatat tccaataata aaagcagtat atacataaaa "80" aaaaaaaaaa aaaa Protein seguence: NCBI Reference ce: NP_002807.1 LOCUS: NP_002807 553553 ACCESSION: NP_002807 XP_947587 XP_951137 XP_951140 XP_951142 XP_951145 XP_951148 XP_951151 1 madggserad grivkmevdy satvdqupe caklakegrl qevietllsl ekqtrtasdm 6" vstsrilvav vkmcyeakew dllnenimll lkqa vakquqcct yveeitdlpi 12" klrlidtlrm vtegkiyvei erarltktla gdvk eaasilqelq vetygsmekk 18" ervefileqm rlclavkdyi rtqiiskkin tkffqeente klklkyynlm iqldqhegsy 24" lsickhyrai ydtpciqaes ekwqqalksv apfd vhri sgdkkleeip " kykdllklft tmelmrwstl vedygmelrk atdv fgsteegekr wkdlknrvve 36" hnirimakyy tritmkrmaq lldlsvdese aflsnlvvnk tifakvdrla giinfqrpkd 42" pnnllndwsq klnslmslvn ktthliakee mihnlq L5F1 Official : ATP5F1 Official Name: ATP synthase, H+ transporting, mitochondrial Fo complex, subunit B1 Gene ID: 515 Organism: Homo sapiens Other Aliases: RP11-552M11.5, P|G47 Other Designations: ATP synthase B chain, mitochondrial; ATP synthase subunit b, mitochondrial; ATP synthase, H+ transporting, mitochondrial F0 complex, subunit B1 ; ATP synthase, H+ transporting, mitochondrial F0 complex, subunit b; ATPase t b; H+-ATP synthase subunit b; cell proliferation- inducing protein 47 Nucleotide seguence: NCBI nce Seguence: 688.4 LOCUS NM_001688 ACCESSION NM_001688 1 actcccgggc cgccgggggc actagggggg gtggggtttc cttccgcatc tccacggttc 61 caactccaac ctagactcaa actggacgcc ggccggagac tccgctccgg cagcaaaccc 121 cacgtggtgc acctctgagc ccct ctcccgaggg aaccgcaact ctacttctcg 181 cgagaattgc ttctatggct ccatcctgct ttccggctgt catg cgataggctc 554554 24" tcagcgttac ttgactcttc tcgcgataat tttttttaaa aatctcccaa ggaaagttga " aggaagagta caaaattttc atctcgcgag acttgtgagc ggccatcttg ccct 36" gacagattct cctatcgggg tcacagggac gctaagattg ctacctggac tgac 42" catgctgtcc cgggtggtac tttccgccgc cgccacagcg gccccctctc tgaagaatgc 48" agccttccta ggtccagggg tattgcaggc aacaaggacc tttcatacag ggcagccaca 54" ccttgtccct gtaccacctc ttcctgaata cggaggaaaa gttcgttatg gactgatccc 60" tgaggaattc ttccagtttc tttatcctaa aactggtgta acaggaccct atgtactcgg 66" aactgggctt atcttgtacg ctttatccaa agaaatatat gtgattagcg cctt 72" ccta tcagtactag gtgtaatggt ctatggaatt aaaaaatatg gtccctttgt 78" tgcagacttt aaac tcaatgagca aaaacttgcc caactagaag aggcgaagca 84" ggcttccatc caacacatcc agaatgcaat tgatacggag aagtcacaac aggcactggt 90" tcagaagcgc cattaccttt ttgatgtgca aaggaataac attgctatgg ctttggaagt 96" tacttaccgg gaacgactgt atagagtata taaggaagta aagaatcgcc tggactatca "02" tatatctgtg cagaacatga tgcgtcgaaa ggaacaagaa cacatgataa attgggtgga "08" gaagcacgtg gtgcaaagca tctccacaca gcaggaaaag gagacaattg ccaagtgcat "14" ccta aagctgctgg caaagaaggc tcaagcacag ccagttatgt aaatgtatct "20" attg agacagctag aaacagttga ctgactaaat ggaaactagt ctatttgaca "26" aagtctttct gtgt ctactgaagt tatagtttac ccttcctaaa aatgaaaagt "32" cata tagtgagaga acgaaatctc tatcggccag tcagatgttt cttc "38" ttgctctgcc tttgagttgt tccgtgatca cttctgaata agcagtttgc ctttataaaa "44" acttgctgcc tgactaaaga ttaacaggtt atagtttaaa tttgtaatta attctaccat "50" cttgcaataa aatt gaatgaaaca ttca agttgtataa ttctctgaaa "56" tactcagctt ttgtcatatg ggtaaaaatt gtca ttgaactact gtcttgttta "62" tgagaccatt cagtggtgaa ctgtttctgg ctgataggtt atgagatatg taaagctttc "68" tagtactctt aaaataacta aatggagtat tatatatcaa ttcatatcat tgactttatt "74" attttagtag tata gaaaatatta tggactcaga gtgtcataaa atcactctta "80" agaatccatg cagcaggcca ggcacagtgg ctcacacctg taatgcctgc actttggaag "86" acag gcggatcact cagg agtttgaaac cagccaggcc gtga "92" aaccctgtct ctactaaaaa tacaaaaggt tagccgggca tggtggcagg cgcctgtaat "98" cccagctact caggaggctg aggcaggaga attgcttgaa cgcaggaggc aaaggttgca 555555 2041 gtgagctgag atcacgccac tgcactccag cctgggcaac agacctcgac tccatctaga 2101 aaaaaaaaaa aaaaaa Protein seguence: NCBI Reference Seguence: NP_001679.2 LOCUS NP_001679 ACCESSION NP_001679 1 mlsrvvlsaa ataapslkna aflgpgvlqa trtfhtgqph lvpvpplpey ggkvryglip 61 eefqulypk tgvtgpyvlg tglilyalsk eiyvisaetf talsvlgvmv ygikkygpfv 121 adfadklneq klaqleeakq asiqhiqnai qalv qkrhylfdvq rnniamalev 181 yrvy kevknrldyh rrk eqehminwve khvvqsistq qeketiakci 241 adlkllakka qaqpvm Official Symbol: CMPK1 Official Name: cytidine monophosphate (UMP-CMP) kinase 1, cytosolic m: 51727 Organism: Homo s Other Aliases: RP1 1-511l2.1, CMK, CMPK, UMK, UMP-CMPK, UMPK Other Designations: UMP-CMP kinase; UMP/CMP kinase; late kinase; deoxycytidylate kinase; e monophosphate kinase; uridine monophosphate/cytidine monophosphate kinase Nucleotide seguence: NCBI Reference Seguence (variant 1): NM_016308.2 LOCUS NM_016308 ACCESSION NM_016308 1 gacagggccg cggacgcccg ggcagccacg gcggcggggc cgcggcgggc gccggctcag 61 cccgcccctt tctcccgccg cctccccgcc ccgccccgcg ccgcgccggc cgctgtcagc 121 tccctcagcg tccggccgag tgta tgctgagccg ctgccgcagc gggctgctcc 181 acgtcctggg ccttagcttc ctgctgcaga cccgccggcc gattctcctc tgctctccac 241 gtctcatgaa ggtc gtgttcgtcc tcggcggccc cggcgccggc aaggggaccc 556556 " agtgcgcccg catcgtcgag ggct acacacacct ttctgcagga gagctgcttc 36" gtgatgaaag gaagaaccca gattcacagt atggtgaact tattgaaaag tacattaaag 42" aaggaaagat agtt gagataacca tcagtttatt aaagagggaa atggatcaga 48" caatggctgc caatgctcag aagaataaat tcttgattga tgggtttcca agaaatcaag 54" acaaccttca aggatggaac aagaccatgg atgggaaggc agatgtatct ttcgttctct 60" tttttgactg taataatgag atttgtattg aacgatgtct tgagagggga aagagtagtg 66" gtga tgacaacaga gagagcttgg aaaagagaat tcagacctac cttcagtcaa 72" caat tattgactta tatgaagaaa tggggaaagt caagaaaata gatgcttcta 78" aatctgttga tgaagttttt gatgaagttg tgcagatttt tgacaaggaa ggctaattct 84" aaacctgaag gcatccttga aatcatgctt tgct ttgatagctg ctatcatgac 90" ccctttttaa ggcaattcta atctttcata actacatctc aattagtggc tggaaagtac 96" atggtaaaac aaagtaaatt tttttatgtt cttttttttg gtcacaggag tagacagtga "02" attcaggttt aacttcacct tggt gctcaccaaa gtat cagctatttt "08" ttttaaaatt caaaaagaat atccctttta tagtttgtgc cttctgtgag caaaactttt "14" tagtacgcgt atatatccct ctagtaatca caacatttta ggatttaggg atacccgctt "20" cctctttttc ttgcaagttt tcca accttaagtg aatttgtgga ccaaatttca "26" aaggaacttt ttgtgtagtc agttcttgca caatgtgttt ggtaaacaaa ctcaaaatgg "32" attcttagga tagt gtttattaaa taactgacca tttgctgtag aaagatgaga "38" aaacttaagc tttgttttac tacaacttgt acaaagttgt atgacagggc atattctttg "44" cttccaagat ttgggttggg ggcactaggg gttcagagcc tggcagaatt gtcagcttta "50" gtctgacata gggt atggggcaag gatcacatct aatgcttgtg ttccttatac "56" tctattatat agtgttattc atgattcagc tgatcttaac aaaattcgta gcagtggaac "62" cttgaaatgc atgtggctag atttatgcta aaatgattct gcat aaca "68" cttcaaaggt ttttttttgt ttgttttcta gacttaataa aagcttagga ttaattagaa "74" gaagcaatct agttaaattt cccatttgta tttc ttgaatactt ttttcatagt "80" tatttgttta aaaagattta attg cactttggtc agaaaaataa taaatatatc "86" ttataaatgt ttgattccct tccttgctat ttttattcag tagatttttg tttggcatca "92" tgttgaagca ccgaaagata aatgattttt aaaaggctat agagtccaaa ggaatattct "98" ccaa cttt ctct gaggaatttg ttttcgcctt actttttttt 204" cttctgtcac aagt ggtatccgag gttcttaata tgagatttaa aatcttaaaa 557557 210" tgtttcttat tttcagcact attt ggtacacagg gtcaaatagg gcaaataatt 216" ttgtctttgt ataatagatt tgatatttaa agtcactgga aataggacaa gttaatggat 222" gtttttatat tttaatagaa tcatttattt ctatgtgtta tgaaattcac ataa 228" atttttcaac atacttgcca ttagaaaaca aagtattgct aagtactata acatattggc 234" aatt catattgaga ttatcttggt ttcttggaag gaat gagttcttat 240" ctagtgttgc aggccagcaa atacagaggt ggtttaatca aacagctcta agca 246" aaga ctaaggtttc gagagcattc ctactcacat aagtgaagaa atctgtcaga 252" taggaatcta aatatttata gtgagattgt gaaagcaacc ttaaagtttt gaagaagact 258" gatgagacta ggtgctttgc ttcctttcat caggtatctt tctgtggcat ttgagaacag 264" aaaccaagaa acatggtaat tactaaatta tgaggctttg ctttttgttt gcttttaagt 270" agaaaaacat gttggcaaca ttgagttttg gagttgattg agataatatg acttaactag 276" ttttgtcatt ccatttgtta aagatacagt caccaagaat gttttgagtt aaga 282" ccccaattta agccttgctt atttttaaat tatttccatt tgtt ggatgtatat 288" cagttattta gtaaataatc tcaataaatt ttgtgctgtg gcctttgcta aaaaaaaaaa 2941 aaaaaaaaaa aaaaaa Protein sequence (variant 1): NCBI Reference Seguence: NP_057392.1 LOCUS 392 ACCESSION NP_057392 1 mlsrcrsgll fllq trrpillcsp rlmkplvva gkgt qcarivekyg 61 ythlsagell rderknpdsq ygeliekyik egkivpveit isllkremdq tmaanaqknk 121 flidgfprnq dnlqgwnktm dgkadvsfvl ffdcnneici erclergkss grsddnresl 181 ekriqtqus tkpiidlyee mgkvkkidas ksvdevfdev vqifdkeg Official Symbol: COXGBf Official Name: cytochrome c oxidase subunit Vlb polypeptide 1 (ubiquitous) m: 1340 Organism: Homo sapiens Other Aliases: COXSB, COXG, COXVlbf 558558 Other Designations: COX Vlb-1 ; cytochrome c oxidase t 681 Nucleotide seguence: NCBI Reference Seguence: NM_001863.4 LOCUS NM_001863 ACCESSION NM_001863 l tgggcgtggc ttgaatgact tcagtggcct cctcctggga gggagctgaa gccgctcgca 6" agactcccgt agtccccacc tctctcagct tccggctggt agtagttccg cttcctgtcc 12" gactgtggtg tctttgctga gggtcacatt cagg ttgaatccgg ggtgccttta 18" ggattcagca ccatggcgga agacatggag accaaaatca acaa gaccgcccct 24" tttgacagcc gcttccccaa ccagaaccag actagaaact gctggcagaa ctacctggac " ttccaccgct gtcagaaggc aatgaccgct aaaggaggcg atatctctgt gtgcgaatgg 36" taccagcgtg tgtaccagtc cctctgcccc acatcctggg tcacagactg ggatgagcaa 42" cgggctgaag gcacgtttcc cgggaagatc tgaactggct ccct ttcctctgtc 48" cctt ctcccaggat ggtgaagggg gacctggtac ccagtgatcc cagg 54" atcctaaatc atgacttacc tgctaataaa aactcattgg aaaagtgaga Protein seguence: NCBI Reference Seguence: NP_001854.1 LOCUS NP_001854 ACCESSION NP_001854 l maedmetkik nyktapfdsr trnc quyldfhrc qkamtakggd isvcequrv 61 yqslcptswv tdwdeqraeg tfpgki CTSA al Symbol: CTSA Official Name: cathepsin A Gene ID: 5476 Organism: Homo sapiens Other s: RP3-337018.1, GLB2, GSL, NGBE, PPCA, PPGB 559559 Other ations: beta-galactosidase 2; beta-galactosidase protective protein; carboxypeptidase C; carboxypeptidase L; carboxypeptidase Y-Iike se; carboxypeptidase-L; deamidase; lysosomal carboxypeptidase A; lysosomal protective protein; protective protein cathepsin A; y kininase Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_000308.2 LOCUS NM_000308 ACCESSION NM_000308 l agagtgcacc cgaatccacg ggctcggagg cagcagccat ctctcggcca tagggcaggc 6" cagctggcgc cgggggctat tttgggcggc gggcaatgat ggtgaccgca aggcgacctt 12" gtaaggcatt tcccccctga ctcccttccc cgagcctctg cccgggggtc ctagcgccgc 18" tttctcagcc atcccgccta caacttagcc aaca tctg atcgcgtgcg 24" cccgggctac gatctgcgag gcccgcggac cttgacccgg cattgaccgc caccgccccc " caggtccgta gggaccaaag aaggggcggg aggaagactg tcacgtggcg ccggagttca 36" cgtgactcgt acacatgact tccagtcccc gggcgcctcc tggagagcaa ggacgcgggg 42" agat gatccgagcc gcgccgccgc cgctgttcct gctgctgctg ctgctgctgc 48" tgctagtgtc ctgggcgtcc cgaggcgagg cagcccccga ccaggacgag atccagcgcc 54" ggct ggccaagcag ccgtctttcc gccagtactc cggctacctc aaaggctccg 60" gctccaagca cctccactac tggtttgtgg agaa ggatcccgag aacagccctg 66" tggtgctttg gctcaatggg ggtcccggct gcagctcact agatgggctc ctcacagagc 72" atggcccctt cctggtccag ccagatggtg tcaccctgga gtacaacccc tattcttgga 78" atctgattgc caatgtgtta tacctggagt ccccagctgg cttc tcctactccg 84" atgacaagtt ttatgcaact aatgacactg aggtcgccca gagcaatttt cttc 90" tctt cttt ccggagtaca agaacaacaa acttttcctg gaga 96" gctatgctgg catctacatc cccaccctgg ccgtgctggt catgcaggat cccagcatga 102" accttcaggg gctggctgtg ggcaatggac tctcctccta tgagcagaat gacaactccc 108" tggtctactt tgcctactac catggccttc tggggaacag gctttggtct tctctccaga 114" gctg ctctcaaaac aagtgtaact tctatgacaa caaagacctg gaatgcgtga 120" ccaatcttca ggaagtggcc cgcatcgtgg gcaactctgg cctcaacatc tacaatctct 126" atgccccgtg tgctggaggg gtgcccagcc attttaggta tgagaaggac actgttgtgg 132" tccaggattt gggcaacatc ttcactcgcc tgccactcaa gtgg catcaggcac 560560 "38" tgctgcgctc aggggataaa gtgcgcatgg accccccctg caccaacaca acagctgctt "44" ccacctacct caacaacccg tacgtgcgga aggccctcaa catcccggag ccac "50" aatgggacat cttt ctggtaaact tacagtaccg ccgtctctac cgaagcatga "56" actcccagta tctgaagctg cttagctcac agaaatacca gatcctatta tataatggag "62" atgtagacat ggcctgcaat ttcatggggg atgagtggtt tgtggattcc ctcaaccaga "68" agatggaggt gcagcgccgg ccctggttag tgaagtacgg ggacagcggg gagcagattg "74" ccggcttcgt gaaggagttc tcccacatcg tcac gatcaagggc gccggccaca "80" tggttcccac cgacaagccc ctcgctgcct tcaccatgtt cttc ctgaacaagc "86" agccatactg atgaccacag caaccagctc cacggcctga tgcagcccct cccagcctct "92" cccgctagga gagtcctctt ctaagcaaag tgcccctgca gttc tgccgccagg "98" actgccccct tcccagagcc ctgtacatcc cagactgggc tctc ccatagacag 204" cctgggggca agttagcact ttattcccgc agcagttcct gaatggggtg gcctggcccc 210" ttctctgctt aaagaatgcc ctttatgatg cactgattcc atcccaggaa cccaacagag 216" ctcaggacag cccacaggga ggtggtggac ggactgtaat tgatagattg attatggaat 222" taaattgggt tcaa aaaaaaaaaa aaaa Protein sequence (variant 1): NCBI Reference Seguence: NP_000299.2 LOCUS NP_000299 ACCESSION NP_000299 l mtssprappg qurggaemi raappplfll lllllllvsw asrgeaapdq deiqunga 6" kqpsfrqysg ylkgsgskhl hwavesqkd penspvvlwl nggpgcssld glltehgpfl 12" ley npyswnlian vlylespagv gfsysddkfy atndtevaqs nfealqdffr 18" lfpeyknnkl fltgesyagi yiptlavlvm qdpsmnlqgl avgnglssye qndnslvyfa 24" yyhgllgnrl wsslqthccs ancnfydnk dlecvtnlqe varivgnsgl niynlyapca " ggvpshfrye ulg niftrlplkr mwhqallrsg ppct tyLn 36" alni peqlpqwdmc qyrr lyrsmnsqyl kllssqkyqi vdma 42" cnfmgdewfv dslnqkmevq rrpwlvkygd sgeqiagka efshiaflti kgaghmvptd 48" kplaaftmfs rflnkqpy EPHX1 561561 Official Sym bol: EPHX1 Official Name: epoxide hydrolase 1, microsomal (xenobiotic) Gene ID: 2052 Organism: Homo sapiens Other Aliases: EPHX, EPOX, HYL1, MEH Other Designations: epoxide hydratase; epoxide hydrolase 1 Nucleotide seguence: (variant 1) NCBI Reference Seguence: NM_000120.3 LOCUS NM_000120 ACCESSION 120 l cagaaggccg tggggagtgg gggccagtgc ctgcagcctg ccctgcctct ctcacaggcc 6" cttagagcat cgccaggtgc agagctccac agctctcttt cccaaggagt aatcagaggg 12" cgtg gagcctggtg gacaggtgaa agcactggga tctttctgcc cagaaagggg 18" gcac atcc tagagggaag cgacagcagt gcttctccct gtgctgaggt 24" acaggagcca tgtggctaga aatcctcctc acttcagtgc tgggctttgc catctactgg " ttcatctccc gggacaaaga ggaaactttg gaag atgggtggtg ggggccaggc 36" acgaggtccg cagccaggga ggacgacagc atccgccctt tcaaggtgga aacgtcagat 42" gaggagatcc acgacttaca ccagaggatc gataagttcc gtttcacccc acctttggag 48" gacagctgct tccactatgg cttcaactcc aactacctga agaaagtcat ctcctactgg 54" cggaatgaat ttgactggaa gaagcaggtg gagattctca accc tcacttcaag 60" actaagattg aagggctgga catccacttc atccacgtga agccccccca cgca 66" ggccataccc cgaagccctt gctgatggtg cacggctggc ccggctcttt ctacgagttt 72" tataagatca tcccactcct cccc aagaaccatg gcga tgagcacgtt 78" tttgaagtca tctgcccttc catccctggc tatggcttct cagaggcatc ctccaagaag 84" gggttcaact cggtggccac cgccaggatc ttttacaagc tgatgctgcg gctgggcttc 90" caggaattct acattcaagg aggggactgg gggtccctga tctgcactaa tatggcccag 96" ctggtgccca gccacgtgaa aggcctgcac ttgaacatgg ctttggtttt aagcaacttc 102" ctga tcct gggacagcgt ttcgggaggt ttcttggcct cactgagagg 108" gatgtggagc tgctgtaccc cgtcaaggag ttct acagcctgat gagggagagc 562562 "l4" ggctacatgc agtg gcct gacaccgtag gctctgctct gaatgactct "20" cctgtgggtc tggctgccta tattctagag aagttttcca cctggaccaa tacggaattc "26" cgatacctgg aggatggagg cctggaaagg tccc tggacgacct gctgaccaac "32" gtcatgctct actggacaac aggcaccatc atctcctccc agcgcttcta caaggagaac "38" ctgggacagg gctggatgac gcat gagcggatga aggtctatgt gcccactggc "44" ttctctgcct tcccttttga gctattgcac acgcctgaaa tgag gttcaagtac "50" ccaaagctca tctcctattc ggtt cgtgggggcc actttgcggc ctttgaggag "56" ccggagctgc tcgcccagga catccgcaag ttcctgtcgg tgctggagcg accc "62" acccctctcc ccccgcctgc cacctccccc cacaagtgcc ctccaggctt ttcttgggga "68" agatacccct agga atgagtttgc ctccgtcccc tgcccatgct gggagcccac "74" gctcaccccc tcacccctcc aagctcactc cccaaccccc aactccgtgt ggtaagcaac 1801 atggctttga tgataaacga tcta aaaaaaaaaa aaaaaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_000111.1 LOCUS NP_000111 ACCESSION NP_000111 l mwleilltsv lgfaiwais rdkeetlple dgwwgpgtrs aareddsirp fkvetsdeei 6" hdlhqridkf rftppledsc fhygfnsnyl kkvisywrne fdwkkqveil nryphfktki l2" egldihfihv kppqlpaght pkpllmvhgw pgsfyefyki pknh glsdehvfev 18" icpsipgygf seasskkgfn svatarifyk lmlrlgfqef yiqggdwgsl ictnmaqlvp 24" shvkglhlnm alvlsnfstl tlllgqrfgr flglterdve llypvkekvf yslmresgym " pdtv gsalndspvg laayilekfs twtntefryl edgglerkfs lddlltnvml 36" ywttgtiiss qrfykenlgq gwmtqkherm kvyvptgfsa fpfellhtpe kwvrfkypkl 42" isysymvrgg hfaafeepel laqdirkfls vlerq ATP5B Official Sym bol: ATPSB Official Name: ATP se, H+ transporting, mitochondrial F1 complex, beta polypeptide Gene ID: 506 563563 Organism: Homo sapiens Other Aliases: ATPMB, ATPSB Other Designations: ATP synthase t beta, mitochondrial; mitochondrial ATP synthase beta t; mitochondrial ATP tase, beta subunit Nucleotide seguence: NCBI Reference ce: NM_001686.3 LOCUS NM_001686 ACCESSION NM_001686 l agttcaccca atggacctgc ctactgcagc gtaggcctcg cctcaacggc aggagagcag 6" gcggctgcgg ttgctgcagc cttcagtctc cacccggact tgtt ggggtttgtg l2" gtgg ccgctgctcc ggcctccggg gccttgcgga gactcacccc ttcagcgtcg 18" ctgcccccag ctcagctctt actgcgggcc gctccgacgg cggtccatcc ggac 24" tatgcggcgc aaacatctcc ttcgccaaaa gcaggcgccg ccaccgggcg catcgtggcg " gtcattggcg cagtggtgga gttt gatgagggac taccaccaat tctaaatgcc 36" ctggaagtgc aaggcaggga gaccagactg gttttggagg tggcccagca tttgggtgag 42" agcacagtaa ggactattgc tatggatggt acagaaggct tggttagagg ccagaaagta 48" ctggattctg gtgcaccaat caaaattcct gttggtcctg tggg cagaatcatg 54" aatgtcattg gagaacctat tgatgaaaga ggtcccatca aaaccaaaca atttgctccc 60" attcatgctg aggctccaga gttcatggaa atgagtgttg agcaggaaat tctggtgact 66" aagg ttgtcgatct gctagctccc tatgccaagg gtggcaaaat tgggcttttt 72" ggtggtgctg gagttggcaa gactgtactg atcatggagt taatcaacaa tgtcgccaaa 78" gcccatggtg gttactctgt gtttgctggt gttggtgaga ggacccgtga aggcaatgat 84" ttataccatg aaatgattga atctggtgtt atcaacttaa aagatgccac ctctaaggta 90" gcgctggtat atggtcaaat gaatgaacca cctggtgctc gtgcccgggt agctctgact 96" gggctgactg tggctgaata cttcagagac caagaaggtc tact gctatttatt 102" gataacatct ttcgcttcac ccaggctggt tcagaggtgt ctgcattatt gggccgaatc 108" ccttctgctg tgggctatca gcctaccctg gccactgaca ctat gcaggaaaga 114" attaccacta ccaagaaggg atctatcacc tctgtacagg atgt gcctgctgat 120" gacttgactg accctgcccc tgctactacg tttgcccatt tggatgctac actg 126" tcgcgtgcca ttgctgagct gggcatctat ccagctgtgg atcctctaga ctccacctct 564564 "32" cgtatcatgg atcccaacat tgttggcagt gagcattacg atgttgcccg tggggtgcaa "38" aagatcctgc aggactacaa ccag gatatcattg ccatcctggg tatggatgaa "44" gagg aagacaagtt gaccgtgtcc cgtgcacgga aaatacagcg tttcttgtct "50" cagccattcc aggttgctga ggtcttcaca ggtcatatgg ggaagctggt acccctgaag "56" gagaccatca aaggattcca gcagattttg gcaggtgaat atgaccatct cccagaacag "62" gccttctata tggtgggacc cattgaagaa gctgtggcaa aagctgataa gctggctgaa "68" gagcattcat Cgtgaggggt ctttgtcctc tgtactgtct ctctccttgc ccca "74" aaaagcttca tttttctgtg taggctgcac aagagccttg attgaagata tattctttct 1801 gaacagtatt taaggtttcc aataaaatgt acacccctca gaaaaaaaaa aaaaaaa Protein seguence: NCBI Reference Seguence: NP_001677.2 LOCUS NP_001677 ION NP_001677 l mlgfvgrvaa lrrl tpsaslppaq lllraaptav hpvrdyaaqt spspkagaat 6" grivavigav vdvqfdeglp pilnalevqg retrlvleva thgestvrt iamdgteg;V 12" rquvldsga pikipvgpet lgrimnvige pidergpikt kqfapihaea pefmemsveq 18" eilvtgikvv dllapyakgg kiglfggagv gktvlimeli hggy svfagvgert 24" yhem iesgvinlkd atskvalvyg qmneppgara rvaltgltva eyfrdqequ " nifr ftqagsevsa llgripsavg yqptlatdmg tmqeritttk kgsitquai 36" yvpaddltdp apattfahld raia elgiypavdp ldstsrimdp nivgsehydv 42" arqukilqd ykslqdiiai seed kltvsrarki qrflsqpfqv aevftghmgk 48" lvplketikg quilageyd hlpeqafymv gpieeavaka dklaeehss ATP5D Official Sym bol: ATP5D Official Name: ATP se, H+ transporting, mitochondrial F1 complex, delta subunn Gene ID: 513 Organism: Homo s Other Aliases: None currently listed 565565 Other Designations: ATP synthase t delta, mitochondrial; F-ATPase delta subunit; mitochondrial ATP synthase complex delta-subunit or; mitochondrial ATP synthase, delta subunit Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_001687.4 LOCUS NM_001687 ACCESSION NM_001687 1 cagacgtccc tgcgcgtcgt cctcctcgcc ctccaggccg cccgcgccgc gccggagtcc 6" cgcc agctacccgc ttcctgccgc ccgccgctgc catgctgccc gccgcgctgc 12" tccgccgccc gggacttggc gtcc gccacgcccg tgcctatgcc gaggccgccg 18" ccgccccggc tgccgcctct ggccccaacc agatgtcctt caccttcgcc acgc 24" aggtgttctt caacggtgcc aacgtccggc aggtggacgt gcccacgctg accggagcct " tcggcatcct ggcggcccac acgc tgcaggtcct gcggccgggg ctggtcgtgg 36" tgcatgcaga ggacggcacc acctccaaat actttgtgag cagcggttcc atcgcagtga 42" acgccgactc ttcggtgcag ttgttggccg aagaggccgt gacgctggac atgttggacc 48" cagc caaggcaaac ttggagaagg cccaggcgga gctggtgggg acagctgacg 54" aggccacgcg ggcagagatc cagatccgaa tcgaggccaa cgaggccctg gtgaaggccc 60" tggagtaggc ggtgcgtacc cggtgtcccg aggcccggcc tggg cagggatgcc 66" aggtgggccc agccagctcc tggggtcccg gccacctggg gaagccgcgc ctgccaagga 72" ggccaccaga gggcagtgca ggcttctgcc tgggccccag gccctgcctg tgttgaaagc 78" tctggggact gggccaggga agctcctcct cagctttgag ctgtggctgc cacccatggg 84" gctctccttc cgcctctcaa ccca gcctgacggg ccgcttacca tcccctctgc 90" cctgcagagc cagccgccaa ggttgacctc agcttcggag ccacctctgg atgaactgcc 96" cccc gccccattaa ggaa gcctgaaaaa aaaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_001678.1 LOCUS NP_001678 ACCESSION NP_001678 1 mlpaallrrp glgrlvrhar ayaeaaaapa aasgpnqmsf tfasptqvff nganvrquv 61 ptltgafgil hvl rpglvvvhae yfvs sgsiavnads 121 tldmldlgaa kanlekaqae 1vgtadeatr aeiqiriean ealvkale 566566 OfibblSwnbd:CAPN1 Official Name: calpain 1, (mu/l) large t M1 823 sm: Homo sapiens Other Aliases: PlG30, CANP, CANP1, CANPL1, muCANP, muCL Other ations: CANP 1; calcium-activated neutral proteinase 1;ca|pain mu-type; calpain, large polypeptide L1; n-1 catalytic subunit; calpain-1 large subunit; cell proliferation-inducing gene 30 protein; cell proliferation- inducing protein 30; micromolar-calpain Nucleotide seguence (variant 1): NCBI nce Seguence: NM_001198868.1 LOCUS NM_001198868 ACCESSION 198868 l agggacttac ccaaggtcac gcagcgagcc cggtccccct gcgttccccg gggagcgctg 6" agccgggacg cggcggtggg gtggggaagg ggagtggcgc ggccctgcgg ggtgaggctg 12" ccgtttgctg cggc aggggtctgc tcgctgccag ccct cctcagagca 18" gctgccgcag cccgaggatg tcggaggaga tcatcacgcc ggtgtactgc gtgt 24" cagcccaagt gcagaagcag cgggccaggg agctgggcct gggccgccat gagaatgcca " tcaagtacct ggat tatgagcagc tgcgggtgcg atgcctgcag agtgggaccc 36" tcttccgtga tgaggccttc cccccggtac cccagagcct gggttacaag gacctgggtc 42" ccaattcctc caagacctat ggcatcaagt ggaagcgtcc cacggaactg ctgtcaaacc 48" cccagttcat tgtggatgga gctacccgca cagacatctg ccagggagca ctgggggact 54" gctggctctt catc gcctccctca ctctcaacga caccctcctg caccgagtgg 60" ttccgcacgg ccagagcttc cagaatggct atgccggcat cttccatttc cagctgtggc 66" aatttgggga gtgggtggac gtggtcgtgg atgacctgct gcccatcaag gacgggaagc 72" tagtgttcgt gcactctgcc gaaggcaacg agttctggag cgccctgctt gagaaggcct 78" atgccaaggt aaatggcagc tacgaggccc tgtcaggggg cagcacctca gagggctttg 567567 84" aggacttcac aggcggggtt accgagtggt tgcg caaggctccc agtgacctct 90" accagatcat cctcaaggcg ctggagcggg gctccctgct gggctgctcc atagacatct 96" ttct agacatggag gccatcactt agtt ggtgaagggc catgcctact "02" ccgg ggccaagcag gtgaactacc gaggccaggt ggtgagcctg atccggatgc "08" ggaacccctg ggtg gagtggacgg gagcctggag cgacagctcc tcagagtgga "l4" acaacgtgga cccatatgaa cgggaccagc tccgggtcaa gatggaggac ggggagttct "20" ggatgtcatt ccgagacttc atgcgggagt tcacccgcct ggagatctgc aacctcacac "26" ccgacgccct caagagccgg accatccgca aatggaacac ctac gaaggcacct "32" ggcggcgggg gagcaccgcg gggggctgcc gaaactaccc agccaccttc aacc "38" ctcagttcaa gatccggctg gatgagacgg atgacccgga cgactacggg gagt "44" caggctgcag cttcgtgctc gcccttatgc agaagcaccg tcgccgcgag cgccgcttcg "50" acat ggagactatt ggcttcgcgg aggt ccctccggag ctggtgggcc "56" agccggccgt acacttgaag cgtgacttct tcctggccaa tcgg gcgcgctcag "62" agcagttcat gcga gaggtcagca cccgcttccg cctgccaccc ggggagtatg "68" tggtggtgcc ctccaccttc gagcccaaca aggagggcga cttcgtgctg cgcttcttct "74" cagagaagag tgctgggact gtggagctgg atgaccagat ccaggccaat ctccccgatg "80" agcaagtgct ctcagaagag gagattgacg agaacttcaa ggccctcttc aggcagctgg "86" caggggagga catggagatc agcgtgaagg agttgcggac aatcctcaat aggatcatca "92" gcaaacacaa agacctgcgg accaagggct tcagcctaga gtcgtgccgc agcatggtga "98" acctcatgga tcgtgatggc aagc tggt ggagttcaac atcctgtgga 204" accgcatccg gaattacctg tccatcttcc ggaagtttga cctggacaag tcgggcagca 210" tgagtgccta Cgagatgcgg atggccattg agtcggcagg cttcaagctc aacaagaagc 216" tgtacgagct catcatcacc cgctactcgg agcccgacct ggcggtcgac tttgacaatt 222" tcgtttgctg cctggtgcgg ctagagacca tgttccgatt tttcaaaact ctggacacag 228" atgg agttgtgacc ttgt ggtt gcagctgacc atgtttgcat 234" gaggcaggga ctcggtcccc cttgccgtgc tcccctccct cctcgtctgc caagcctcgc 240" ctcctaccac accacaccag gccaccccag ctgcaagtgc cttccttgga gcagagaggc 246" agcctcgtcc tcctgtcccc tctcctccca gccaccatcg ttcatctgct ccgggcagaa 252" ctgtgtggcc cctgcctgtg atgg gctcgggatg gactccctgg gccccaccca 258" ttgccaagcc aggaaggcag ctttcgcttg ttcctgcctc gggacagccc cgggtttccc 568568 264" cagcatcctg atgtgtcccc tctccccact tcagaggcca cccactcagc accaccggcc 270" tgcc tgcagactat aaactataac ctcg acacagtctg cagtccaggc 276" gtgtggagcc cggc tcggggaggc cccggggctg ggaacgcctg tgccttcctg 282" agcc aacgccccct ctgtccttcc ctggccctgc tgccgaccag gagctgccca 288" gcctgtgggc ggtcggcctt ccctccttcg ctcctttttt atattagtga ttttaaaggg 294" gactcttcag ggacttgtgt actggttatg ggggtgccag aggcactagg cttggggtgg 300" ggaggtcccg tgttccatat agaggaaccc caaataataa aaggccccac ctgt 306" aaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3121 aaaaa Protein seguence (variant 1): NCBI Reference Seguence: 185797.1 LOCUS NP_001185797 ACCESSION NP_001185797 l mseeiitpvy ctgvsaqvqk qrarelglgr henaikylgq vrcl qsgtlfrdea 6" fppquslgy kdlgpnsskt ygikwkrpte llsnquivd gatrtdicqg algdcwllaa 12" iasltlndtl lhrvvphgqs fqngyagifh fqlwquewv dvvvddllpi kdgklvahs 18" aegnefwsal kvng syealsggst segfedftgg vtewyelrka psdlyqiiLk 24" alergsllgc sidissvldm eaitfkklvk ghaysvtgak qvnyrgqvvs lirmrnpwge " vewtgawsds ssewnnvdpy erdqlrvkme sfrd fmreftrlei cnltpdal<s 36" rtirkwnttl yegtwrrgst aggcrnypat fwvnqukir ldetddpddy gdresgcsfv 42" lalmqkhrrr dmet igfavyevpp elvgqpavhl krdfflanas rarseqfinl 48" revstrfrlp vpst fepnkegdfv lrffseksag tvelddqiqa nlpdeqvlse 54" eeidenfkal frqlagedme isvkelrtil nriiskhkdl rtkgfslesc rsmvnlmdrd 60" gngklglvef nilwnrirny lsifrkfdld ksgsmsayem rmaiesagfk lnkklyelii 66" trysepdlav dfdnfvcclv rletmfrffk tldtdldgvv tfdlfkwlql tmfa CAPZA2 OfibblSwnbd:CAPZA2 Official Name: capping protein (actin nt) muscle Z-Iine, alpha 2 569569 Gene ID: 830 Organism: Homo sapiens Other Aliases: CAPPA2, CAPZ Other Designations: F-actin capping protein alpha-2 subunit; n-capping protein subunit alpha-2; capZ 2 Nucleotide seguence: NCBI Reference Seguence: NM_006136.2 LOCUS NM_006136 ACCESSION NM_006136 l cccctccctt agcgggggcg cgcggcgctg aggaccgcac ggaaacgggg aagtcaggtg 6" gccgctgccg ccgccgccgc cgcggtttgt aagg aagatggcgg atctggagga 12" gcagttgtct gatgaagaga aggtgcgtat agcagcaaaa ttcatcattc atgcccctcc 18" tggagaattt gttt tcaatgatgt tcggttactg cttaataatg acaatcttct 24" cagggaagga gcagcccatg catttgcaca gtataacttg gaccagttta ctccagtaaa " aattgaaggt tatgaagatc aggtattgat acat ggcgacttgg gaaatggaaa 36" gtttttggat ccaaagaaca gaatctgttt taaatttgat cacttaagga aggaggcaac 42" tgatccaaga ccctgtgaag tagaaaatgc agttgaatca tggagaactt cagtagaaac 48" tgctctgaga gcttacgtaa aagaacatta cccgaatgga gtctgcactg tgtatggcaa 54" aaaaatagat ggacagcaaa ccattattgc atgcatagaa cagt caaa 60" aaatttttgg aatggtcgtt ggaggtcaga atggaagttt acaatcactc ccac 66" tcaagtggtt ggcatcttga aaattcaggt tcattattat gaagatggta agct 72" agtgagtcat aaagatatac aagattccct gtct aatgaagtgc aaacagcaaa 78" tata aagattgtag aagctgcaga aaatgaatac cagactgcca tcagtgagaa 84" ttatcagaca atgtcggaca ctactttcaa agccttacgt cgacagttgc cagttacacg 90" cactaagatt gattggaaca agatccttag ctacaagatt ggcaaagaga tgcagaatgc 96" ataagatgaa cattgcatga ccggatcatt ttagtgtctt tgcgttaaaa aatcattgca 102" aaagtattct gaactgtcaa gctgcccagt cagatgggct gttgccattt aaaatcactg 108" taattaatta gtttgattag agcacaaagc ttagctaatc aaccattatt tttcattttg 114" tttgttctaa gaggattgaa aatcagttta gtttaaatgt gtta ggcctttctt 120" aatg aagagatgat tcttctagtt tatggttaaa agtttttgaa gtgtctcaaa 570570 "26" aatattttac taactgtaac cctaaaattg atgtcttttg gtttatgaaa attt "32" ttgatatttc cccagttctt tttaatgggg atgg tagt ttaaggtggt "38" tgatggattt agccatatat gctgctaaag tcta ccttttcttc tgtt "44" ccatttatgt gaga ttagagggaa agcattttct atatcaattg tgtttaaacc "50" tttcaagaag gttatttagc tagcttagtg ttgaactaaa ttttttttaa acaaggcaag "56" gtctaatgct gttttgagat tctgaaatta atgaaaatac caga aatgcattta "62" atgctttttt tcttgtgaca caaa tcagcttgaa ttccatatgt ccctgagtta "68" tttttatcat aaagccacaa atgtattata acaaggcaaa ttgtaatata tataatcctg "74" aactcatgac tcgg tttatttttt ttttcttgga ttgaaaagta ctgaaattca "80" atgtgacatt aaaatgcaaa ttttcctatt tatttgagta gaaaatcact taccagtgag "86" catatatatt ttaaaatact ttctttggat attgtaattc ttaactggtt taga "92" aaagctggga ttacatatgg tgtgcggtta cagtctaaat tttttcatcc tcctatgcat "98" cataagcatg tt:gtaatat aaat agttctactg atgctacagg aatttcaagc 204" ctgtggtgaa tg:tagtatt taccataggg agtgaagtgg agttatggtt tcattcaata 210" gagtattgct ga:tatactt gagtggaatc ctttcctcac gtactcccac agacgtctgg 216" gcctggaaat tt:tttttta ttttatttta ttgttttttt ttttagaaaa acaccacttt 222" tattatgtac aa:aaaatat ttcattagct tgaattgtat agatttttaa aaattcaatg 228" aaagcatgtt gt:taatttc tttttaaaat cactgttggg ctttgaaagc attgagaata 2341 taatatgaaa t:atgaaaaa aaaaaaaaaa aaa Protein seguence: NCBI Reference ce: NP_006127.1 LOCUS NP_006127 ACCESSION NP_006127 1 madleeqlsd eekvriaakf iihappgefn evfndvrlll nndnllrega ahafaqynld 61 qftpvkiegy edqvlitehg dlgngkfldp knricfkfdh lrkeatdprp cevenavesw 121 rtsvetalra pngv ctvygkkidg qqtiiacies hqfqaknfwn grwrsewkft 181 itpsttqvvg ilkiqvhyye dgnvqlvshk diqultvsn evqtakefik iveaaeneyq 241 taisenyqtm sdttfkalrr qlpvtrtkid wnkilsykig kemqna 571571 Official Symbol: CCT7 al Name: chaperonin containing TCP1, subunit 7 (eta) Gene ID: 10574 Organism: Homo sapiens Other Aliases: CCTETA, CCTH, NIP7-1, TCP1 ETA Other Designations: CCT-eta; HIV-1 Nef interacting protein; HIV-1 Nef- interacting protein; T-complex protein 1 subunit eta; TCP-f-eta; chaperonin containing t-complex polypeptide 1, eta t Nucleotide seguence (variant 1): NCBI nce Seguence: NM_006429.3 LOCUS NM_006429 ACCESSION 429 l atagagtagc ggaagtggtc cgttctcttc ctctcccggc ccaagcttct gggtatttct 6" attgcgcgag gcattgtggg ttgctgggcg gcccggtctc ggagaagagg ggagagtggc l2" gggccgctga ataagcttcc aaaatgatgc cagt tatcctattg aaagagggga 18" ctgatagctc ccaaggcatc ccccagcttg tgagtaacat cagtgcctgc caggtgattg 24" ctgaggctgt aagaactacc ctgggtcccc gtggcatgga caagcttatt gtagatggca " gaggcaaagc aacaatttct aatgatgggg ccacaattct gaaacttctt gtcc 36" atcctgcagc aaagactttg gtagacattg ccaaatccca agatgctgag gtgggtgatg 42" gcaccacctc agtgaccttg ctggctgcag agtttctgaa gcaggtgaaa ccctatgtgg 48" aggaaggttt acacccccag atcatcattc gagctttccg cacagccacc cagctggcag 54" ttaacaagat caaagagatt gctgtgaccg tgaagaaggc agataaagtg gagcagagga 60" tgga tgcc atgaccgctc tgagctccaa ctcc cagcagaaag 66" ctttctttgc ggtg gtggatgcag tgatgatgct cgatgatttg ctgcagctta 72" aaatgattgg aatcaagaag gtacagggtg gagccctcga ggattctcag ctggtagctg 78" gtgttgcatt caagaagact ttctcttacg ttga aatgcaaccc aaaaagtacc 84" acaatcccaa gattgccctt ttgaatgtcg agctcgagtt tgag aaagacaatg 90" ctgagataag agtccacaca gttgaggatt atcaggcaat tgttgatgct gagtggaaca 96" ttctctatga caagttagag aagatccatc attctggagc tgtc ttgtccaaac 102" tccccattgg ggatgtggcc acccagtact ttgctgacag gttc tgtgctggcc 572572 "08" gagtacctga ggaggatctg aagaggacaa tgatggcctg tggaggctca acca "14" gtgtgaatgc tctgtcagca gatgtgctgg gtcgatgcca tgaa gagacccaga "20" ttggaggcga gaggtacaat ttttttactg ccaa gaca tgcaccttca "26" ttctccgtgg cggcgccgag cagtttatgg aggagacaga gcggtccctg catgatgcca "32" tcatgatcgt caggagggcc atcaagaatg attcagtggt ggctggtggc ggggccattg "38" agatggaact ctccaagtac ctgcgggatt actcaaggac tattccagga aaacagcagc "44" tgttgattgg ggcatatgcc aaggccttgg agattatccc acgccagctg tgtgacaatg "50" ttga tgccacaaac attctcaaca agctgcgggc tcggcatgcc caggggggta "56" atgg agtagacatc aacaacgagg acattgctga caactttgaa gctttcgtgt "62" gggagccagc tatggtgcgg gcgc tgacagcagc ggct gcgtgcctga "68" tcgtgtctgt agatgaaacc atcaagaacc cccgctcgac tgtggatgct cccacagcag "74" caggccgggg ccgtggtcgt ggccgccccc actgagaggc accccaccca tcacatggct "80" ggctggctgc tgggtgcact taccctcctt ggcttggtta cttcatttta caaggaaggg "86" gtagtaattg gcccactctc ttcttactgg aggctattta aataaaatgt aagacttcag "92" ataactttgt aaaa aaaaaaaa Protein sequence (variant 1): NCBI Reference Seguence: NP_006420.1 LOCUS 420 ACCESSION NP_006420 l mmptpvillk egtdssqgip qlvsnisacq rttl gprgmdkliv dgrgkatisn 6" dgatilklld vvhpaaktlv diaksqdaev gdgttsvtll aaeflkqvkp yveeglhpqi 12" iirafrtatq lavnkikeia vtvkkadkve qullekcam talssklisq qkaffakmvv 18" davmmlddll qlkmigikkv dsql vagvafkktf syagfemqpk kyhnpkiaLl 24" nvelelkaek vhtv edyqaivdae wnilydklek ihhsgakvvl sklpigdvat " qyfadrdmfc agrvpeedlk rtmmacggsi qtsvnalsad vlgrcqvfee tqiggerynf 36" ftgcpkaktc tfilrggaeq fmeeterslh daimivrrai kndsvvaggg aiemelskyl 42" ipgk qqlligayak aleiiprqlc dnagfdatni lnklrarhaq ggtwygvdin 48" nediadnfea fvwepamvri naltaaseaa clivsvdeti knprstvdap taagrgrgrg 54" rph 573573 OfibblSwnbd:CTSB Official Name: cathepsin B m: 1508 Organism: Homo sapiens Other Aliases: APPS, CPSB Other Designations: APP ase; amyloid precursor protein secretase; cathepsin B1; cysteine protease Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_001908.3 LOCUS NM_001908 ACCESSION NM_001908 1 gggc nggagggta cttagggccg gggctggccc aggctacggc ggctgcaggg 6" ctccggcaac cgctccggca acgccaaccg ctccgctgcg cgcaggctgg gctgcaggct 12" ctcggctgca ggtg gatctaggat ccggcttcca acatgtggca gctctgggcc 18" tccctctgct gcctgctggt gttggccaat gcccggagca ggccctcttt ccatcccctg 24" tcggatgagc tggtcaacta tgtcaacaaa cggaatacca cgtggcaggc cgggcacaac " ttctacaacg tgag gaag aggctatgtg gtaccttcct gggtgggccc 36" aagccacccc agagagttat gtttaccgag gacctgaagc tgcctgcaag cttcgatgca 42" cgggaacaat ggccacagtg tcccaccatc aaagagatca gagaccaggg ctcctgtggc 48" tcctgctggg ccttcggggc agcc atctctgacc ggatctgcat ccacaccaat 54" gcgcacgtca gcgtggaggt gtcggcggag gacctgctca gtgg gtgt 60" ggggacggct gtaatggtgg tgct tgga acttctggac aagaaaaggc 66" ctggtttctg gtggcctcta tgaatcccat gtagggtgca gaccgtactc catccctccc 72" tgtgagcacc acgtcaacgg ctcccggccc ccatgcacgg gggagggaga tacccccaag 78" aaga tctgtgagcc tggctacagc ccgacctaca aacaggacaa gcactacgga 84" tacaattcct acagcgtctc caatagcgag aaggacatca agat ctacaaaaac 90" gtgg agggagcttt ctctgtgtat tcggacttcc tgctctacaa gtcaggagtg 96" taccaacacg tcaccggaga gatgatgggt ggccatgcca tccgcatcct gggctgggga 102" gtggagaatg gcacacccta ctggctggtt gccaactcct ggaacactga ctggggtgac 574574 "08" aatggcttct ttaaaatact cagaggacag gatcactgtg aatc agaagtggtg "l4" gctggaattc cacgcaccga tcagtactgg gaaaagatct aatctgccgt gggcctgtcg "20" tgccagtcct gggggcgaga taga aatgcatttt attctttaag ttcacgtaag "26" atacaagttt cagacagggt gact ggattggcca aacatcagac ctgtcttcca "32" aggagaccaa gtcctggcta catcccagcc tgtggttaca gtgcagacag tgag "38" ccaccgctgc cagcacagag cgtccttccc cctgtagact agtgccgtag ggagtacctg "44" ctgccccagc tggc cccctccgtg atccatccat ctccagggag caagacagag "50" acgcaggaat ggaaagcgga gttcctaaca ggatgaaagt tcccccatca gttcccccag "56" tacctccaag gctt tccacatttg tcacagaaat agag acggtgttgg "62" gagccctttg gagaacgcca gtctcccagg ccccctgcat ctatcgagtt tgcaatgtca "68" caacctctct gatcttgtgc tcagcatgat tctttaatag aagttttatt ttttcgtgca "74" ctctgctaat catgtgggtg agccagtgga acagcgggag acctgtgcta gttttacaga "80" ttgcctcctt cggc tcaaaaggaa tggt caggagttgt ttctgaccca "86" ctgatctcta ctaccacaag gaaaatagtt taggagaaac cagcttttac tgtttttgaa "92" aaattacagc ttcaccctgt caagttaaca aggaatgcct gtgccaataa aagttttctc "98" caacttgaag tctactctga tgggatctca gatcctttgt cactgcctat agacttgtag 204" ctgctgtctc tctttgtccc tgcagagaat cacgtcctgg aactgcatgt gact 210" cttgggactt catcttaact tctcgctgcc atgt tttcaaccat cctc 216" ccccaattag ttccctgtca tcctcgtcaa ccttctctgt aagtgcctgg taagcttgcc 222" cttgcttaag aactcaaaac atagctgtgc tctatttttt tgttgttgtt gtgactgaca 228" gagtgagatt ccgtctccca ggctggagtg cagtggcgcc ttctcagctc actgcaacct 234" gcagcctcct agattcaagc gattctcctg cttcagcctt ccgagtagct gggatgacag 240" gcactcacca atatgcctgg gtaatttttg tatttttaag tacatacagg atttcaccat 246" gttggccagg ctagtttcaa ggcc tcaggtggtc cctc agcctcccaa 252" agtgttggga ttacaggcgt gagccactgg gccctgcctg tattttttat cagccacaaa 258" aaca agctgaggat tcagctcata aaacaggctt ttgg tgatctcaca 264" taaccaagat gctaccccgt ggggaaccac ctgg atgccctcca gttt 270" gggctggagt cagggcctgt atacagtatt ttgaatttgt atgccactgg tttgcattgc 276" tggtcaggaa ctctagtgct ttgcatagcc ctggtttaga aacatgttat agcagttctt 282" ggtatagagc aaactagaag aaccagcaat cattccactg tcctgccaag gtacacctca 575575 288" gtactcccct tcccaactga agtggtatga ggctagctct ttccaaaagc attcaagttt 294" ggcttctgat gtgactcaga atttaggaac cagatgctag atcaaataag ctctgaaaat 300" ctgaggaaca ttgtaggaaa ggtttgttaa gcatctctta agtgccatga tgagcataac 306" agccggccgt cgtggctcac gcctgtaatc ccagcacttt gggaggccaa ggtgggagga 312" tgacaaggtc aggagttcaa gaccagcctg gccaacatgc tgaaacctca ctaa 318" aaatacaaaa attagctggg catggtggca catgcctgta atcccagcta cttgggaggc 324" tgaggcagga gaatcgcttg ggag gcggaggttg cagtgagcca agacagtgcc 330" agtgcactcc agcctcggtg caag gctccgtctc aataattaaa aaaaaaaaaa 336" aaaaaaaaaa ggccgggcgc agtggctcaa gcctgtaatc ccagcacttt gggaggctga 342" ggcgggcaga tcacctgagg tcaggagttt tgagatcagc cttggcaaca cggtgaaacc 348" ccatctctac taaaaataca aaattagcca agcatgctgg cacatgcctg taatcccagc 354" tactcgggag gctgaggtac gagaatcgct tgaacctggg aggcagagga tgcagtgagc 360" cgagatcacg ccattgcact ccagcctggg ggacaagagt gaatctgtgt aaaa 366" aaaaaaagaa aaagaaagat caaa ggttaccata agccacaaat tcataaccac 372" ttcc aagt agaatatatt cataacctca ataaagttct ccctgctccc 3781 aaa Protein sequence (variant 1): NCBI Reference Seguence: NP_001899.1 LOCUS 899 ACCESSION NP_001899 1 qulwaslcc llvlanarsr psfhplsdel vnyvnkrntt wqaghnfynv dmsylkrlcg 61 tflggpkppq rvmftedlkl pasfdareqw pqcptikeir scwa fgaveaisdr 121 icihtnahvs vevsaedllt ccgsmcgdgc nggypaeawn fwtrkglvsg glyeshvgcr 181 pysippcehh vngsrppctg cski cengSptyk qdkhygynsy svsnsekdim 241 gpve gafsvysdfl 1yksgquhv ghai rilgwgveng tpywlvansw 301 ntdwgdngff kilrquhcg iesevvagip rtdqyweki FKBP2 OfibblSwnbd:FKBP2 576576 Official Name: FK506 binding protein 2, 13kDa g:2286 Organism: Homo sapiens Other Aliases: FKBP-13, PPIase OflmrDesmnamnm:13kDaFK506bMdMgpnflem;13kDaFKBP;FK506 Unwngpnnmn2(13HM;FKBP2;PPMseFKBPzflmnumopMHnFKBP13 peptidyI-prolyl cis-trans isomerase FKBP2; proline isomerase; rapamycinbinding protein; se Nucleotide seguence nt 1): NCBI Reference Seguence: NM_004470.3 LOCUS NM_004470 ACCESSION NM_004470 1 gccggaagtg acgcagggca gcggcgtcgc gggggcgggg ctcgggaaag gcca 6" gcgggcgtgt gggt ttcgcacggt ccaataaggg agggcggcgt ggcccggcct 12" ggtagcgacg aggacgcgcc tgcgcagagg cggcagcacc accggggttg actccggggg 18" cgcggcgagg agagacatga ggctgagctg gttccgggtc ctgacagtac tgtccatctg 24" cctgagcgcc gtggccacgg ccacgggggc cgagggcaaa aggaagctgc agatcggggt " caagaagcgg gtggaccact gtcccatcaa atcgcgcaaa ggggatgtcc tgcacatgca 36" ctacacgggg aagctggaag atgggacaga gtttgacagc agcctgcccc agaaccagcc 42" ctttgtcttc tcccttggca caggccaggt catcaagggc tgggaccagg ggctgctggg 48" gatgtgtgag ggggaaaagc tggt gatcccatcc gagctagggt atggagagcg 54" gggagctccc attc gtgc aaccctggtg ttcgaggtgg agctgctcaa 60" aatagagcga cgaactgagc tgtaaccaga ctggggaggg gcagggggag aggcccccat 66" cagggaccag actgttccaa aaca aaaaacaaaa acaaacaaaa ttaa 72" aagcccaagg aaaaaaaaaa aaaaaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_004461.2 LOCUS NP_004461 ACCESSION NP_004461 l rvlt vlsiclsava tatgaegkrk lqigvkkrvd hcpiksrkgd vlhmhytgkl 577577 61 edgtefdssl pqnqpfvfsl gtgqvikgwd qgllgmcege krklvipsel appk 121 ipggatlvfe errt e1 FLNC Official Sym bol: FLNC Official Name: filamin G, gamma Gene ID: 2318 Organism: Homo sapiens Other Aliases: ABP-280, ABP280A, ABPA, ABPL, FLN2, MFM5, MPD4 Other Designations: ABPlike protein; ABP-L, gamma filamin; FLN-C; actin binding protein 280; actin-binding-like protein; filamin 2; filamin-2; filamin-C Nucleotide seguence (variant 1): NCBI Reference ce: NM_001458.4 LOCUS NM_001458 ACCESSION NM_001458 1 ccctggaggg agagagagcc agagagcggc cgagcgccta ggaggcccgc cgagcctcgc 6" cgagccccgc cagccccggc gcgagagaag ttggagagga gagcagcgca cgag 12" tcccgtggtc gcgccccaac agcgcccgac agcccccgat agcccaaacc gcggccctag 18" ccccggccgc acccccagcc agca tgatgaacaa cagcggctac tcagacgccg 24" gcctcggcct gggcgatgag acagacgaga tgccgtccac ggagaaggac gagg " acgcgccgtg gatc cagcagaaca cattcacgcg ctggtgcaat gagcacctca 36" agtgcgtggg caagcgcctg accgacctgc agcgcgacct cagcgacggg ctca 42" tcgcgctgct cgaggtgctc agccagaagc gcatgtaccg caagttccat ccgcgcccca 48" acttccgcca aatgaagctg gagaacgtgt ccgtggccct cctc gagcgcgagc 54" acatcaagct cgtgtccata gacagcaagg ccatcgtgga tgggaacctg atcc 60" tgggcctgat gctg atcctgcact tctc catgcccatg tgggaggatg 66.. aagatgatga ggatgcccgc aaacagacgc ccaagcagcg gctgcttggc tggatccaga 72" acaaggtgcc ccagctgccc atcaccaact tcaaccgtga ctggcaggac ggcaaagctc 78" tgggcgccct ggtggacaac tgcgcccccg gtctctgccc cgactgggag gcctgggacc 578578 84" ccaaccagcc cgtggagaac gcccgggagg ccatgcagca ggccgacgac tggcttgggg 90" tgccccaggt cattgcccct gaggagattg tggaccccaa cgtggatgag cattctgtta 96" acct gtcccagttc cccaaggcca agctcaaacc tggtgcccct gttcgatcca "02" agcagctgaa ccccaagaaa gccatcgcct atgggcctgg catcgagcca cagggcaaca "08" ccgtgctgca gcctgcccac ttcaccgtgc agacggtgga Cgcgggcgtg ggcgaggtgc "14" tggtctacat cgaggaccct gaaggccaca aggc taaggtggtt cccaacaatg "20" acaaggatcg cacctatgct gtctcctatg tgcccaaggt cgctgggtta cacaaggtga "26" ccgtgctctt tgctggccag aacattgaac gcagtccctt tgaggtgaac gtgggcatgg "32" ccctgggaga tgccaacaag gtgtcagccc gtggccctgg acct gtgggcaatg "38" tggccaacaa acccacctac tttgacatct cggg ggccggcact gttg "44" ctgtggtgat cgtggaccca cagggccggc gggacacagt ggaggtggcc ctggaggaca "50" agggtgacag ccgc taca gacctgccat ggaggggcca cataccgtgc "56" atgtggcctt tgcgggtgcc cccatcaccc gcagtccctt ccctgtccat gtgtcggaag "62" cctgtaaccc ctgc tctg ggcgaggcct gcagcccaag ggtgttcgcg "68" tgaaagaggt ggctgacttc aaggtgttta gtgc cggcagcggg gagctcaagg "74" tcacggtcaa ggggccaaag gagg agccagtgaa ggtgcgggag gctggggatg "80" gtgtgttcga gtac tacccggtgg tgcctgggaa ggtg accatcacgt "86" ggggcggcta cgccatccct cgcagcccct ttgaggtaca ggtgagccca gaggcaggag "92" tgcaaaaggt ccgggcctgg ggtcctggtt tggagactgg ccaggtgggc aagtcagccg "98" attttgtggt ggaagccatt ggcaccgagg tggggacact gggcttctcc atcgaggggc 204" cctcacaagc caagatcgaa tgtgacgaca agggggatgg ctcctgcgat gtgcggtact 210" ggcccacgga gcctggggag tacgctgtgc acgtcatctg tgacgatgag gacatccgag 216" actcaccctt cattgcccac atcctgcccg ccccacctga ctgcttccca gataaggtga 222" aggcctttgg gcctggcctg gagcctaccg gctgcatcgt ggacaagccc gctgagttca 228" atgc tcgtgcagct ggcaagggag agct ccag gacgccgacg 234" gctgtcccat cgacatcaag gtgatcccca ng caccttccgc tgctcctacg 240" tgcccaccaa gcccattaag cacaccatca tcatctcctg gggaggcgta aacgtgccca 246" agagcccctt ccgggtgaac gtgggcgagg gcagccaccc ggta aaggtgtacg 252" gccccggagt ggagaagaca ggcctcaagg ccaatgagcc cacctacttc acggtggact 258" gcagcgaggc ggggcaaggc gacgtgagca tcggcatcaa gtgcgcccca ggcgtggtgg 579579 264" gccctgcaga ggctgacatt gacttcgaca tcatcaagaa tgacaacgac accttcaccg 270" tcaagtacac aggg gcgggccgct acaccatcat ggtgctgttt gccaaccagg 276" agatccccgc cagccccttc cacatcaagg tggacccatc ccacgatgcc agcaaagtca 282" aggccgaggg ccctgggctg aatcgcacag aagt cgggaagccc acccacttca 288" tgac caagggagcc gcca agctggatgt gcagtttgca gggacagcca 294" agggcgaggt tgtgcgggac tttgagatca tagacaacca tgactactcc tacactgtca 300" agtacaccgc tgtccagcag ggcaacatgg cagtgacagt gacttatggc ggggaccctg 306" tccccaagag tgtg gtgaatgtgg cacccccgct ggacctcagc aaaatcaaag 312" ttcagggcct taatagcaag gtggctgtgg gacaggaaca agcattctct gtgaacacac 318" gaggggctgg ngtcagggc gatg tgcggatgac ttcgccctct cgccggccca 324" tcccctgcaa gctggagcca ggcggtggag cggaagccca ggctgtgcgc tacatgcccc 330" cggaggaggg gccctacaag gtggatatca cctacgatgg tcacccggtg cctggcagcc 336" cgtttgctgt ggagggtgtc ctgccccctg atccctccaa ggtctgtgct tatggcccgg 342" gtctcaaggg tggactggta ggcacccccg cgccattctc cacc aagggggctg 348" gcacaggtgg cctggggctg accgtagagg gcccctgcga ggccaagatc gagtgccagg 354" gtga tggctcatgt gctgtcagct acctgcccac tggc gagtacacca 360" tcaacatcct gtttgctgag gcccacatcc ctggctcgcc cttcaaagcc cggc 366" ctgtgtttga cccgagcaag gtgcgggcca cggg cctggagcgc ggcaaggtcg 372" gtgaggcagc caccttcact gtggactgct cagaggcagg cgaggcggag ctgaccattg 378" agatcctgtc ggatgccggg gccg aggtgctgat ccacaacaac gcggatggca 384" cctaccacat cacctacagc cctgccttcc ctggcaccta tacc atcaagtatg 390" gcgggcatcc cgtgcccaaa ttccccaccc gtgtccatgt gcagcctgcg acca 396" gtggcgtcaa ggtctcaggg cctggtgttg agccacacgg tgtcctgcgg gaggtgacca 102" ctgagttcac tgtggatgca agatccctaa cagccacagg cggcaaccac gtgacggctc 108" gtgtgctcaa cccctcgggg gccaagacag acacctatgt gacagacaat ggggacggca 114" gagt gcagtacacc gcctacgagg agggcgtgca tctggtggag gtcctgtatg Z20" atgaggtcgc tgtgcccaag agccccttcc gagtgggcgt gaccgagggc tgtgatccca Z26" cccgcgtccg agccttcggg ccaggcctgg agggtggctt caag gccaaccgat Z32" tcactgtgga gaccagggga gcgggcaccg ggggccttgg cctagccatc ccct Z38" cggaagccaa gatgtcctgc aaggacaaca aggatggtag ctgcaccgtg gagtacatcc 580580 Z44" ccttcactcc tggagactat gacgtcaaca tcaccttcgg ggggcggccc atcccaggga 150" gcccgttccg cgtgccagtg aaggatgtgg tggaccctgg gaaggtgaag tgctcagggc 156" cagggctggg ggctggtgtc agggcccggg ttcctcagac cttcacagtg gactgcagtc Z62" aagctggccg cctg caggtggctg tgctgggccc cacaggtgtg gccgagcctg Z68" tggaggtgcg ggacaatgga accc acactgtcca ctacacccca gccactgacg Z74" ggccctacac ggtagccgtc gctg accaggaggt gccacgcagc cccttcaaga 180" tcaaggtcct cccagctcat agca aggtgcgggc cagcggccca ggcctcaacg 186" cctctggcat ccctgccagc ctgcctgtgg agttcaccat cgacgcacgg gacgcgggcg Z92" aggggttgct cactgtccag gacc ccgagggtaa gcccaagaag gccaacatcc Z98" gggacaatgg ggatggcacg tacactgtgt cctacctgcc gagt ggccggtaca 504" ccatcaccat caagtatggc ggtgatgaga tcccctactc gcccttccgc atccatgctc 510" ctgg ggatgccagc aagtgcctcg tcacagtgtc cattggaggc ctgg 516" gtgcctgcct gggccctcga atccagattg ggcaggagac ggtgatcacg gtggatgcca 522" ccgg tgaggggaag tgca cggtgtccac gccggatggg ctcg 528" atgtggatgt ggttgagaac catgacggta cctttgacat ctactacaca gcgcccgagc 534" cgggcaagta cgtcatcacc atccgcttcg ggggtgagca catccccaac agccccttcc 540" acgtgctggc cccc ctgccgcacg aggaggagcc ctctgaagtg ccacagctgc 546" gccagcccta cgctcctccc ngCCngCg cccgccccac acactgggcc acagaggagc 552" cagtggtgcc tgtggagcca atggagtcca tgctgaggcc cttcaacctg gtcatcccct 558" tcgcggtgca gaaaggggag ctcacaggag aggtgcggat gccctcgggg aagacggcac 564" ggcccaacat caccgacaac aaggacggca cggt gaggtatgca cccactgaga 570" aaggcctgca ccagatgggg atcaagtatg acca catccctggg agccccttac 576" agttctatgt ggatgccatc aacagccgcc atgtcagtgc ctatgggcca ggcctgagcc 582" atggcatggt caacaagcca gccaccttca ctattgtcac caaagatgct ggagaagggg 588" gtctgtcact ggccgtggag ggcccatcca aggcagagat cacctgtaag gacaacaagg 594" atggcacctg caccgtgtcc tatctgccga ctgg agactacagc atcatcgtgc 600" gcttcgatga caagcacatc agcc ccttcacagc caagatcaca ggtgatgact 606" ccatgaggac ctcacagctg aatgtgggca cctccacgga actg aagatcaccg 612" agagtgatct gagccagctg accgccagca tccgtgcccc ctcgggcaac gaggagccct 618" gcctgctgaa gcgcctgccc aaccggcaca ttgggatctc cccc aaggaggtcg 581581 624" gggagcacgt ggtgagcgtg cgcaagagtg gcaagcatgt caccaacagc cccttcaaga 630" tcctggtggg gccatctgag atcggggacg aggt ctgg ggcaaggggc 636" tttccgaggg acacacattc caggtggcag agttcatcgt ggacactcgc aatgcaggtt 642" atgggggctt gagt attgaaggcc caagcaaggt ggacatcaac tgtgaggaca 648" tggaggacgg gacatgcaaa gtcacctact gccccaccga gcccggcacc tacatcatca 654" acatcaagtt tgctgacaag cacgtgcctg gaagcccctt cactgtgaag gtgaccggcg 660" agggccgcat gaaggagagc atcacccggc ggagacaggc catc gccaccatcg 666" gcagcacctg caac ctcaagatcc caggaaactg gttccagatg gccc 672" aggagcgcct gacacgcacc ttcacacgca gcagccacac ctacacccgc acggagcgca 678" cggagatcag caagacgcgg ggcggggaga caaagcgcga ggtgcgggtg gaggagtcca 684" cccaggtcgg cggggacccc ttccctgctg tgtttgggga cttcctgggc cgggagcgcc 690" tgggatcctt cggcagcatc acccggcagc aggagggtga ggccagctct caggacatga 696" ctgcacaggt gaccagccca tcgggcaagg tggaagccgc agagatcgtc gagg 702" acagcgccta cagcgtgcgc tttgtgcccc aggaaatggg gccccatacg gtcgctgtca 708" agtaccgtgg cgtg cccggcagcc cctttcagtt cactgtgggg ggtg 714" aaggtggtgc ccacaaggtg ggag gcacagggct ggagcgaggt gtggccggcg 720" tgccagccga gttcagcatc tggacccggg aggctggcgc tgggggcctg tccattgctg 726" tggagggtcc tagcaaagcg gagattgcat ttgaggatcg tggc tcctgcggcg 732" tctcctatgt cgtccaggaa ccaggtgact atgaggtctc catcaagttc aatgatgagc 738" acatcccaga cagccccttt gtggtgcctg tggcctccct ctcggatgac gctcgccgtc 744" tcactgtcac cagcctccag gagacggggc tgaa ccagccagcg tcctttgccg 750" tgcagctgaa ngtgcccgg ggcgtgattg atgcccgggt gcacacaccc tcgggggctg 756" agtg ctacgtctct gagctggaca gtgacaagca caccatccgc ttcatccccc 762" acgagaatgg cgtccactcc atcgatgtca agttcaacgg tgcccacatc cctggaagtc 768" ccttcaagat ccgcgttggg gagcagagcc aggctgggga cttg gtgtcagcct 774" acggtcctgg gctcgaggga ggcactaccg gtgtgtcatc agagttcatc gtgaacaccc 780" ccgg ctcgggggcc ttgtctgtca ccattgatgg cccctccaag ctgg 786" actgtcggga gtgtcctgag ggccatgtgg tcacttatac tcccatggcc cctggcaact 792" acctcattgc catcaagtac ggtggccccc agcacatcgt gggcagcccc ttcaaggcca 798" aggtcactgg tccgaggctg tccggaggcc acagccttca cgaaacatcc acggttctgg 582582 804" tggagactgt gaccaagtcc tcctcaagcc ggggctccag ctacagctcc atccccaagt 810" tctcctcaga tgccagcaag gtggtgactc ggggccctgg ccag gccttcgtgg 816" gccagaagaa ctccttcacc gtggactgca gcaaagcagg caccaacatg atgatggtgg 822" gcgtgcacgg ccccaagacc ccctgtgagg aggtgtacgt gaagcacatg gggaaccggg 828" atgt cacctacact gtcaaggaga aaggggacta catcctcatt gtcaagtggg 834" gtgacgaaag tgga agccccttca aagtcaaggt cccttgaatc ccaaaagtgc 840" ctccccagcc tcagccccca cctccagcca cacacacatt acacacacac acacacacac 846" gtgc cacacccaga cacgcacaga atcagacact acaaacacct gccttggggg 852" tgaagtgaag gcccagcctc cccaccccac cgcgccccag gggttggagg accttgtctg 858" gaca gtgtccctcc ctgggaatgt gacatgaggg ccgactgggg ccaggctcag 864" gggcagaggc tgggacacaa ggggctggcg agggctgcga ggccagggaa agtt 870" tctggcgggg ctgagcagtg catt gtgttgtggg tgtctgtgtg tgaggtcacc 876" ctcaaactgc accgccggcc agataccctc ctgaccccga ggacttggtc tggtctctct 882" ggtggctaca accccagagt tttaaggact tggaaaggaa agcacaatca gagaagaaaa 888" cagcccccga accagcagga gtggcctggc acatggaccg gcctgagcga tgtgcactcc 894" acccaagcca ggctcccagg gggcctgatt tctctctcac tgtctctttt tttaaaatgg 900" ttgcacggct ctgccccatg gggggccttt tttacacact gcgaggccca gctttctagg 906" ggacttttgc acatgtcatg cagctcagct gggagctgct gaaa actccaaata 9121 aagtgcggct gaaa aaaaaaaaa Protein se uence variant : NCBI Reference Seguence: NP_001449.3 LOCUS NP_001449 ACCESSION NP_001449 1 ysda glglgdetde mpstekdlae iqqn tftrwcnehl kcvgkrltdl 61 qrdlsdglrl iallevlsqk hprp nfrqulenv svaleflere hiklvsidsk 121 lkli 1g1iwt1i1h ysismpmwed eddedarkqt pqullgwiq 181 gka lgalvdncap glcpdweawd pnqpvenare amqqaddwlg quviapeei 241 vdpnvdehsv mtylsquka klkpgapvrs kqlnpkkaia ygpgiepqgn tvlqpahftv 301 qtvdagvgev lvyiedpegh teeakvvpnn dkdrtyavsy vpkvaglhkv tvlfagqnie 361 rspfevnvgm algdankvsa rgpglepvgn vankptyfdi ytagagtgdv avvivdpqgr 583583 42" rdtvevaled kgdstfrcty rpamegphtv hvafagapit rspfpvhvse acnpnacras 48" grglqpkgvr vkevadfkvf tkgagsgelk vtvkgpkgte epvkvreagd yypv 54" vpgkyvvtit wggyaiprSp fequspeag vqurawgpg letgqvgksa igte 60" vgtlgfsieg psqakiecdd kgdgscdvry eyav hvicddedir dspfiahin 66" appdcfpdkv kafgpglept gcivdkpaef tidaraagkg dlklyaqdad gcpidikvip 72" ngdgtfrcsy vptkpikhti iiswggvnvp kspfrvnvge gshpervkvy gpgvektng 78" ftvd cseagquvs igikcapgvv gpaeadidfd iikndndtft vkytppgagr 84" ytimvlfanq eipaspfhik vdpshdaskv kaegpglnrt gvevgkpthf tvltkgagka 90" kldvqfagta kgevvrdfei sytv kytaqugnm ggdp vpkspfvvnv 96" appldlskik vqglnskvav gqeqafsvnt rgaggngld vrmtspsrrp ipcklepggg "O2" aeaqavrymp kvdi tydghpvpgs pfavegvlpp dpskvcaygp glkggngtp "08" apfsidtkga gtgglgltve gpceakiecq dngdgscavs ylptepgeyt inilfaeahi "l4" atir kvra sgpglergkv geaatftvdc seageaelti eilsdagvka "20" evlihnnadg tyhityspaf pgtytitiky gghpvpkfpt rvthpavdt sgvkvsgpgv "26" ephgvlrevt teftvdarsl tatggnhvta rvlnpsgakt dtyvtdngdg tyrvqytaye "32" egvhlvevly devavpkspf rvgvtegcdp trvrafgpgl egglvnkanr ftvetrgagt "38" gglglaiegp seakmsckdn kdgsctveyi pftpgdydvn itfggrpipg spfrvpvkdv "44" vdpgkvkcsg vrar qutftvdcs qagraplqva vlgptgvaep vevrdngdgt "50" htvhytpatd gpytvavkya dqevprspfk ikvlpahdas kvrasgpgln asgipaslpv "56" eftidardag eglltvqild pegkpkkani rdngdgtytv sylpdmsgry titikyggde "62" ipyspfriha skcl ghgl gaclgpriqi gqetvitvda kaagegkvtc "68" tvstpdgael dvdvvenhdg tape pgkyvitirf ggehipnspf hvlacdplph "74" eeepsequl rqpyapprpg arpthwatee pvvpvepmes mlrpfnlvip favqueltg "80" evrmpsgkta rpnitdnkdg titvryapte kglhqmgiky dgnhipgspl quvdainsr "86" hvsaygpgls hgmvnkpatf tivtkdageg glslavegps kaeitckdnk dgtctvspr "92" tapgdysiiv rfddkhipgs tgdd lnvg tstdvslkit esdlsqltas "98" irapsgneep pnrh igisftpkev gehvvsvrks gkhvtnspfk ilvgpseigd 204" askvrvwgkg lseghtfqva efivdtrnag ygglglsieg pskvdinced medgtckvty 210" cptepgtyii nikfadkhvp gspftvkvtg egrmkesitr rrqapsiati gstcdlnlki 216" mvsa qerltrtftr sshtytrter teisktrgge tkrevrvees tqvggdpfpa 584584 222" vfgdflgrer lgsfgsitrq sqdm taqvtspsgk veaaeivege dsaysvrfvp 228" qemgphtvav kyrgqhvpgs pqutvgplg eggahkvrag gtglergvag iwtr 234" eagagglsia vegpskaeia gscg epgd yevsikfnde hipdspfvvp 240" vaslsddarr ltvtslqetg lkvnqpasfa vqlngargvi darvhtpsga veecyvse;d 246" sdkhtirfip hengvhsidv kfngahipgs pfkirvgeqs qagdpglvsa ygpgleggtt 252" gvssefivnt lnagsgalsv tidgpskvql dcrecpeghv apgn yliaikyggp 258" qhivgspfka kvtgprlsgg hslhetstvl vetvtkssss rgssyssipk fssdaskvvt 264" rgpglsqafv qunsftvdc mmmv gvhgpktpce mgnr vynvtytvke 2701 kgdyilivkw gdesvpgspf kvkvp Official Sym bol: HPX Official Name: hemopexin Gene ID: 3263 Organism: Homo sapiens Other Aliases: HX Other Designations: beta-1 B-glycoprotein Nucleotide seguence: NCBI Reference Seguence: NM_000613.2 LOCUS NM_000613 ACCESSION NM_000613 l aactctatat agggagttca actggtcacc cagagctgtc cctc tgcagctcag 6" catggctagg gtactgggag cacccgttgc actggggttg tggagcctat gctggtctct 12" ggccattgcc acccctcttc ctag tgcccatggg aatgttgctg aaggcgagac 18" caagccagac gtga ctgaacgctg ctcagatggc tggagctttg atgctaccac 24" cctggatgac aatggaacca tgctgttttt taaaggggag tttgtgtgga agagtcacaa " atgggaccgg gagttaatct cagagagatg gaagaatttc cccagccctg tggatgctgc 36" attccgtcaa ggtcacaaca gtgtctttct gatcaagggg gacaaagtct gggtataccc 42" tcctgaaaag aaggagaaag gatacccaaa gttgctccaa gatgaatttc ctggaatccc 585585 48" atccccactg gatgcagctg gtca ccgtggagaa tgtcaagctg aaggcgtcct 54" cttcttccaa ggtgaccgcg agtggttctg ggacttggct acgggaacca tgaaggagcg 60" ttcctggcca gctgttggga actgctcctc tgccctgaga tggctgggcc gctactactg 66" cttccagggt aaccaattcc tcga ccctgtcagg ggagaggtgc ctcccaggta 72" cccgcgggat gtccgagact tgcc ctgccctggc agaggccatg gacacaggaa 78" tggc catgggaaca gtacccacca tggccctgag tatatgcgct gtagcccaca 84" tctagtcttg tctgcactga cgtctgacaa ccatggtgcc acctatgcct ggac 90" ccactactgg cgtctggaca ccagccggga tggctggcat agctggccca ttgctcatca 96" gtggccccag ggtccttcag cagtggatgc tgccttttcc tgggaagaaa aactctatct "02" ggtccagggc acccaggtat atgtcttcct gacaaaggga ggctataccc tagtaagcgg "08" ttatccgaag cggctggaga aggaagtcgg gacccctcat gggattatcc tggactctgt "14" ggatgcggcc tttatctgcc ctgggtcttc tcggctccat atcatggcag gacggcggct "20" gctg gacctgaagt caggagccca gtgg acagagcttc cttggcccca "26" tgagaaggta gacggagcct tgtgtatgga aaagtccctt ggccctaact catgttccgc "32" caatggtccc ggcttgtacc tcatccatgg tcccaatttg tactgctaca gtgatgtgga "38" gaaactgaat gcagccaagg cccttccgca accccagaat gtgaccagtc tcctgggctg "44" ctga ggggccttct gacatgagtc tggcctggcc ccacctccta gttcctcata "50" ataaagacag attgcttctt cgcttctcac tgaggggcct tctgacatga cctg "56" cctc cccagtttct cataataaag tgct tcttcacttg aatcaaggga 1621 cctaaaaaaa aaaaa Protein seguence: NCBI Reference Seguence: NP_000604.1 LOCUS NP_000604 ACCESSION NP_000604 1 marvlgapva lglwslcwsl aiatplppts ahgnvaeget kpdpdvterc sdgwsfdatt 61 1ddngtm1ff kgefvwkshk wdreliserw knfpspvdaa frqghnsvfl ikgdkvwvyp 121 pekkekgypk llqdefpgip spldaavech rgecqaegvl wfw dlatgtmker 181 swpavgncss alrwlgryyc fqganlrfd pvrgevppry prdvrdyfmp ghrn 241 gtghgnsthh gpeymrcsph lvlsaltsdn hgatyafsgt hywrldtsrd gwhswpiahq 301 wpqusavda afsweeklyl vqgtquvfl tkggytlvsg ypkrlekevg tphgiildsv 586586 361 daaficpgss rlhimagrrl wwldlksgaq atwtelpwph ekvdgalcme kslgpnscsa 421 ngpglylihg pnlycysdve klnaakalpq pqnvtsllgc th TLN1 OfibblSwnbokTLN1 Ofimmlenatmm1 Gene ID: 7094 sm: Homo s Other Aliases: RP1 1-1 12J3.1, ILWEQ, TLN Other Designations: talin-1 Nucleotide seguence: NCBI Reference Seguence: NM_006289.3 LOCUS NM_006289 ACCESSION NM_006289 l ggaagagttc tagcctgaaa tcgg gcgtcgtcct ggcgtcctct ccggattgcg 6" ccgcaccctc gcttcctgcc agggaggggc tgccgggctt ctcc cgagcatcga l2" gaacggggcc agagcagctt cctgcctgcc ccccgcgacc atagagcgcg ggcccagggc 18" gccgcccgcg ggtgggggac gttcccagga cggaagtggc Cgagagagtg tcgaagggag 24" ggcgaggccg gagcccgagg cgag aagcggcggg gcggcgggcc cggg " agcc aggcagcgca ggtatagcca ggctggagaa aagaagctgc caccatggtt 36" gcactttcac tgaagatcag cattgggaat gtggtgaaga cgatgcagtt tgagccgtct 42" accatggtgt acgacgcctg ccgcatcatt Cgtgagcgga tcccagaggc cccagctggt 48" cctcccagcg actttgggct ctttctgtca gatgatgacc ccaaaaaggg tatatggctg 54" gaggctggga aagctttgga ctactacatg ctccgaaatg ctat ggagtacagg 60" aagaaacaga gacccctgaa gatccgtatg ctggatggaa ctgtgaagac gatcatggtg 66" gatgactcta agactgtcac tgacatgctc atgaccatct gtgcccgcat tggcatcacc 72" aatcatgatg catt ggttcgagag ctgatggaag agaaaaagga ggaaataaca 78" gggaccttaa gaaaggacaa gacattgctg cgagatgaaa agaagatgga gaaactaaag 84" cagaaattgc acacagatga tgagttgaac gacc atggtcggac actgagggag 587587 90" gtag aggagcacga gacgctgctg agga agttctttta ctcagaccag 96" aatgtggatt cccgggaccc tgtacagctg aacctcctgt atgtgcaggc acgagatgac "02" atcctgaatg gctcccaccc tgtctccttt gacaaggcct ttgc tggcttccaa "08" tgccagatcc agtttgggcc ccacaatgag cagaagcaca aggctggctt ccttgacctg "l4" aaggacttcc tgcccaagga gtatgtgaag ggag agcgtaagat cttccaggca "20" cacaagaatt gtgggcagat gagtgagatt gaggccaagg tccgctacgt gaagctagcc "26" cgttctctca agacttacgg tgtctccttc ttcctggtga aggaaaaaat gaaagggaag "32" aacaagctag tgcccaggct tctgggcatc accaaggagt gtgtgatgcg agtggatgag "38" aagaccaagg aagtgatcca ggagtggaac ctcaccaaca gctg ggctgcgtct "44" agct tcaccctgga ttttggagat gatg gctattactc agtacagaca "50" actgaagggg agcagattgc acagctcatt gccggctaca tcgatatcat cctgaagaag "56" aaaaaaagca aggatcactt tgggctggaa ggagatgagg agtctactat ggac "62" tcagtgtccc ccaaaaagtc aacagtcctg cagcagcaat acaaccgggt ggggaaagtg "68" gagcatggct ctgtggccct gcctgccatc atgcgctctg gagcctctgg gaat "74" ttccaggtgg gcagcatgcc ccag cagcagatta ccagcggcca gatgcaccga "80" ggacacatgc ctcctctgac ttcagcccag caggcactca ctggaaccat taactccagc "86" atgcaggccg tgcaggctgc ccaggccacc ctggatgact ttgacactct gccgcctctt "92" ggccaggatg ctgcctctaa gcgt aaaaacaaga tggatgaatc aaagcatgag "98" atccactctc aggtagatgc catcacagct ggtactgcgt ctgtggtgaa cctgacagca 204" ggggaccctg ctgagacaga ctataccgca gtgggctgtg cagtcaccac aatctcctcc 210" aacctgacgg agatgtcccg tggggtgaag gctg ccttgctgga ggacgaaggc 216" ggcagtggtc ggcccctgtt agca aagggccttg cgggagcagt gtcagaactg 222" ctgcgcagtg cccaaccagc cagtgctgag ccccgtcaga acctgctgca agcagctggg 228" aacgtgggcc aggccagtgg gttg caacaaattg gggaaagtga tactgacccc 234" cacttccagg atgcgctaat gcagctcgcc aaagctgtgg caagtgctgc agctgccctg 240" gtcctcaagg ccaagagtgt ggcccagcgg acagaggact ttca gacccaagtt 246" attgctgcag caacacagtg tgccctatcc acttcccaac tagtggcctg tactaaggtg 252" gtggcaccta caatcagctc acctgtctgc caagagcaac tggtggaggc tggacgactg 258" gtagccaaag ccgtggaggg gtct gcctcccagg cagctacaga ggatgggcaa 264" cgag gggtaggagc agcagccaca gctgtcaccc aggccctaaa tgagctgctg 588588 270" cagcatgtga aagcccatgc cacaggggct gggcctgctg atga ccaggctact 276" gacaccatcc taaccgtcac tgagaacatc tcca tgggtgatgc tggggagatg 282" gtgcgacagg cccgcatcct ggcccaagcc acatctgacc tggtcaatgc catcaaggct 288" gatgctgagg gggaaagtga tctggagaac aagc tcttaagtgc tgccaagatc 294" ctagctgatg ccacagccaa gatggtagag gctgccaagg ctgc ccaccctgac 300" gagc agcagcagcg gctgcgggag gcagctgagg ggctgcgcat caat 306" gcagctgcgc agaatgccat caagaaaaag ctggtgcagc gcctggagca tgcagccaag 312" caggctgcag cctcagccac acagaccatc gctgcagctc agcacgcagc ctctaccccc 318" aaggcctctg ccggccccca gcccctgctg gtgcagagct gcaaggcagt gcag 324" attccactgc tggtgcaggg cgtccgagga agccaagccc acag ccccagcgct 330" cagcttgccc tcattgctgc cagccagagc ttcctgcagc caggtgggaa gatggtggca 336" gctgcaaagg cctcagtgcc aacgattcag gaccaggctt cagccatgca tcag 342" tgtgccaaga acctgggcac cgcgctggct gaactccgga cggctgccca gaaggctcag 348" gaagcatgtg gacctttgga gatggattct gcactgagtg tggtacagaa tctagagaaa 354" gatctacagg aagtgaaggc agcagctcga gatggcaagc ttaaaccctt acctggggag 360" acaatggaga agtgtaccca ggacctgggc aacagcacca aagccgtgag ctcagccatc 366" gcccagctac tgggagaggt tgcccagggc aatgagaatt atgcaggtat tgcagctcgg 372" gatgtggcag gtgggctgcg gtcactggcc caggccgcta ggggagtcgc tgcactgacg 378" tcagatcctg cagtgcaggc cattgtactt gcca tgct ggacaaggcc 384" agcagcctca ttgaggaggc gaaaaaggca gctggccatc caggggaccc ccag 390" cagcggcttg cccaggtggc taaagcagtg acccaggctc tgaaccgctg tgtcagctgc 396" ctacctggcc agcgcgatgt ggataatgcc ctgagggcag ttggagatgc cagcaagcga 102" ctcctgagtg ttcc tcctagcact gggacatttc aagaagctca gagccggttg 108" aatgaagctg ctgctgggct ggca gccacagaac tggtgcaggc ctctcgggga 114" acccctcagg acctggctcg agcctcaggc cgatttggac aggacttcag caccttcctg 120" gaagctggtg tggagatggc aggccaggct ccgagccagg aggaccgagc tgtg 126" tccaacttga agggcatctc catgtcttca agcaaacttc ttctggctgc caaggccctg Z32" tccacggacc ctgctgcccc taacctcaag agtcagctgg ctgcagctgc agta Z38" actgacagca tcaatcagct tatg tgcacccagc aggcacccgg ccagaaggag Z44" tgtgataacg ccctgcggga attggagacg gtccgggaac tcctggagaa cccagtccag 589589 150" cccatcaatg acatgtccta ctttggttgc ctggacagtg taatggagaa ggtg 156" ctgggcgagg ccatgactgg catctcccaa aatgccaaga acct gccagagttt Z62" ggagatgcca tttccacagc ctcaaaggca ctttgtggct tcaccgaggc agctgcacag Z68" gctgcatatc tggttggtgt ctctgacccc aatagccaag ctggacagca agggctagtg Z74" gagcccacac agtttgcccg tgcaaaccag gcaattcaga tggcctgcca gagcctggct gtacccaggc ccaggtgctc tctgcagcca ccattgtggc tctgcactgt gtaacagctg tcgcctggct tctgcccgta ccaccaatcc tactgccaag Z92" cgccagtttg cagc caaggaggtg gccaacagca cagctaatct tgtcaagacc Z98" atcaaggcgc tagatggggc cttcacagag gagaaccgtg cccagtgccg agcagcaaca 504" gcccctctgc tggaggctgt ggacaatctg agtgcctttg cgtccaaccc tgagttctcc 510" agcattcctg cccagatcag ccctgagggt cgggctgcca tggagcccat tgtgatctct 516" gccaagacaa tgttagagag tgccggggga caga cagcccgggc agtc 522" aatccccggg accccccgag ctggtcggtg ctggccggcc actcccgtac tgtctcagac 528" tccatcaaga agctaattac aagcatgagg gacaaggctc cagggcagct ggagtgtgaa 534" acggccattg cagctctgaa cagttgtcta cgggacctag accaggcttc cctcgctgca 540" gtcagccagc agcttgctcc ccgtgaggga atctctcaag aggccttgca cactcagatg 546" ctcactgcag tccaagagat ctcccatctc attgagccgc tggccaatgc tgcccgggct 552" gaagcctccc gaca caaggtgtcc cagatggcgc agtactttga gccgctcacc 558" ctggctgcag tgggtgctgc ctccaagacc cacc cgcagcagat ggcactcctg 564" gaccagacta aaacattggc agagtctgcc ctgcagttgc tatacactgc caaggaggct 570" ggtggtaacc caaagcaagc agctcacacc caggaagccc aggc tgtgcagatg 576" atgaccgagg ccgtagagga aaca accctcaacg aggcagccag tgctgctggg 582" gtcgtgggtg gcatggtgga ctccatcacc caggccatca accagctaga tgaaggacca 588" atgggtgaac cagaaggttc cttcgtggat taccaaacaa tgcg caag 594" gccattgcag tgaccgttca ggagatggtt accaagtcaa acaccagccc agaggagctg 600" cttg agct gaccagtgac tatggccgtc tggcctcgga ggccaagcct 606" gtgg ctgctgaaaa tgaagagata ggttcccata tcaaacaccg ggtacaggag 612" ctgggccatg ccgc cacc aaggcaggcg agtg cagccccagt 618" gatgcctaca ccaagaagga gctcatagag tgtgcccgga gagtctctga gaaggtctcc 624" cacgtcctgg ctgcgctcca ggctgggaat cgtggcaccc aggcctgcat cacagcagcc 590590 630" agcgctgtgt ctggtatcat tgctgacctc gacaccacca tcatgttcgc cactgctggc 636" acgctcaatc gtgagggtac tgaaactttc gctgaccacc gggagggcat cctgaagact 642" gcgaaggtgc tggtggagga caccaaggtc ctggtgcaaa acgcagctgg gagccaggag 648" gcgc aggctgccca gtcctccgtg gcgaccatca cccgcctcgc tgatgtggtc 654" ggtg cagccagcct gggagctgag gaccctgaga cccaggtggt actaatcaac 660" gcagtgaaag ccaa agccctggga gacctcatca gtgcaacgaa ggctgcagct 666" ggcaaagttg gagatgaccc tgctgtgtgg cagctaaaga ccaa ggtgatggtg 672" accaatgtga catcattgct taagacagta aaagccgtgg aagatgaggc caccaaaggc 678" gccc tggaggcaac cacagaacac atacggcagg agctggcggt tttctgttcc 684" ccagagccac ctgccaagac ctctacccca gaagacttca tccgaatgac tatc 690" accatggcaa ccgccaaggc cgttgctgct tcct agga agatgtcatt 696" gccacagcca atctgagccg tatt gcagatatgc ttcgggcttg caaggaagca 702" gcttaccacc cagaagtggc ccctgatgtg cggcttcgag ccctgcacta tggccgggag 708" tgtgccaatg gctacctgga actgctggac catgtactgc tgaccctgca gaagccaagc 714" ccagaactga agcagcagtt gacaggacat tcaaagcgtg tggctggttc cgtcactgag 720" ctcatccagg ctgctgaagc catgaaggga acagaatggg caga ggaccccaca 726" gtcattgctg agaatgagct cctgggagct gcagccgcca ttgaggctgc agccaaaaag 732" ctagagcagc tgaagccccg ggccaaaccc aaggaggcag atgagtcctt gaactttgag 738" gagcagatac tagaagctgc caagtccatt gcagcagcca ccagtgcact ggtaaaggct 744" gcgtcggctg cccagagaga actagtggcc caagggaagg tgggtgccat tccagccaat 750" gacg atgggcagtg gtcccagggc ctcatttctg ctgcccggat ggtggctgcg 756" gccaccaaca atctgtgtga caat gcagctgtac aaggccatgc cagccaggag 762" aagctcatct catcagccaa gcaggtagct acag cccagctcct ctgc 768" aaggtcaagg ctgaccagga ctcggaggca atgaaacgac ttcaggctgc tggcaacgca 774" gtgaagcgag cctcagataa tctggtgaaa gcagcacaga cagc ctttgaagag 780" caggagaatg agacagtggt ggtgaaagag aagatggttg gcggcattgc ccagatcatc 786" cagg aagaaatgct tcggaaggaa Cgagagctgg aagaggcgcg gaagaaactg 792" gcccagatcc ggcagcagca gttt ctgccttcag agcttcgaga tgagcactaa 798" agaagcctct tctatttaat gcagacccgg cccagagact gtgcgtgcca ctaccaaagc 804" cttctgggct gtcggggccc aacctgccca accccagcac tccccaaagt gcctgccaaa 591591 8101 ccccagggcc tggccccgcc cagtcccgca gtacatcccc tgtcccctcc ccaa 8161 gtgccttcat gccctagggc cccccaagtg cctc cccagagtat taacgctcca 8221 agagtattat taacgctgct cgat ctgaatctgc cggggcccca gcccactcca 8281 ccctgccagc agcttccggc cagtccccac agcctcatca gctctcttca ccgttttttg 8341 atactatctt CCCCC&CCCC cagctaccca taggggctgc agagttataa gccccaaaca 8401 ggtcatgctc caataaaaat gattctacct acaa n seguence: NCBI Reference Seguence: NP_006280.3 LOCUS NP_006280 ACCESSION NP_006280 1 mvalslkisi gnvvktmqfe pstmvydacr iireripeap agppsdfglf lsdddpkkgi 6" wleagkaldy ymlrngdtme yrkquplki rmldgtvkti mvddsktvtd mlmticarig 12" yslv relmeekkee itgtlrkdkt llrdekkmek 1qu1htdde lnwldhgrtl 18" rquveehet lllrrkffys dqnvdsrdpv qlnllyvqar ddilngshpv sfdkacefag 24" chqiquph neqkhkagfl dlkdflpkey vqugerkif qahkncgqms ryvk " larslktygv sfflvkekmk gknklvprll gitkecvmrv dektkeviqe wnltnikrwa 36" aspksftldf gdyngyysv qttegeqiaq liagyidiil dhfg stml 42" edsvspkkst vquqynrvg kvehgsvalp aimrsgasgp enfqvgsmpp aqqqitsgqm 48" hrghmpplts aqqaltgtin ssmqavqaaq atlddfdtlp ska wrknkmdesk 54" heihsquai tagtasvvnl tagdpaetdy tavgcavtti ssnltemsrg vkllaalled 60" eggsgrpllq aakglagavs qpas aeprqnllqa agnvgqasge 66" dphfqdalmq lakavasaaa alvlkaksva qrtedsglqt qviaaatqca lstsqlvact 72" kvvaptissp cheqlveag rlvakavegc vsasqaated gqllrgvgaa atavtqalne 78" llqhvkahat gagpagrydq atdtiltvte gdag rila qatsdlvnai 84" kadaegesdl lsaa kiladatakm veaakgaaah pdseeqqqu lrma 90" tnaaaqnaik kklvqueha akqaaasatq tiaaaqhaas tpkasagpqp llvqsckava 96" eqipllvqgv rgsqaquSp saqlaliaas qsflqugkm vaaakasvpt amql 102" sqcaknlgta laelrtaaqk aqeacgplem dsalsvvqnl ekdlqevkaa ardgklkap 108" getmekctqd lgnstkavss aiaqllgeva qgnenyagia ardvagglrs laqaargvaa 114" ltsdpavqai vldtasdvld kasslieeak kaaghpgdpe sqquaqvak avtqalnrcv 592592 "20" rdvd nalravgdas krllsdslpp stgtfqeaqs rlneaaagln qaatequas "26" rgtpqdlara sgrfqufst fleagvemag draq vvsnlkgism ssslelaak "32" alstdpaapn aaar nqli tmctqqapgq lrel etvreLlenp "38" vqpindmsyf gcldsvmens kvlgeamtgi sqnakngnlp efgdaistas kalcgfteaa "44" aqaaylvgvs dpnsqagqqg lveptqfara nqaiqmacqs lgepgctqaq vlsaativak "50" htsalcnscr lasarttnpt ak evanstanlv ktikaldgaf teenraqcra "56" eavd nlsafasnpe fssipaqisp egraamepiv isaktmlesa ggliqtaral "62" avnprdppsw svlaghsrtv sdsikklits mrdkapgqle cetaiaalns clrdldqasl "68" aavsqqlapr egisqealht qmltavqeis hlieplanaa raeasqlghk vsqmaqyfep "74" ltlaavgaas ktlshpqqma lldqtktlae salqllytak eaggnpkqaa htqeaLeeav "80" qmmteavedl tttlneaasa agvvggmvds itqainqlde gpmgepegsf vdyqttmvrt "86" akaiavtvqe mvtksntspe elgplanqlt sdygrlasea kpaavaaene khrv "92" qelghgcaal vtkagalqcs psdaytkkel iecarrvsek vshvlaalqa gnrgtqacit "98" aasavsgiia dldttimfat agtlnregte tfadhregil ktakvlvedt kvlvqnaags 204" qeklaqaaqs svatitrlad vvklgaaslg qvvl inavkdvaka atka 210" aagkvgddpa qulknsakv mvtnvtsllk tvkavedeat kgtraleatt ehirqelavf 216" cspeppakts tpedfirmtk gitmatakav aagnscrqed viatanlsrr aiadmlrack 222" eaayhpevap dvrlralhyg recangylel ldhvllthk pspelqult ghskrvagsv 228" teliqaaeam kgtewvdped ptviaenell gaaaaieaaa kkleqlkpra kpkeadesLn 234" feeqileaak siaaatsalv kaasaaqrel vaqgkvgaip analddngs qglisaarmv 240" aaatnnlcea ghas qeklissakq vaastaqllv ackvkadqu eamkrlqaag 246" navkrasdnl vkaaqkaaaf eeqenetvvv giaq iiaaqeemlr kereleeark 2521 klaqirqqqy lrde h PSME2, Official Symbol: PSME2 and Name: proteasome (prosome, macropain) activator subunit 2 (PA28 beta) [Homo sapiens] Other Aliases: PA28B, PA28beta, REGbeta Other Designations: llS regulator complex beta subunit; llS regulator complex subunit beta; MCP activator, 31-kD subunit; ta; activator of multicatalytic protease subunit 2; cell migration-inducing protein 22; proteasome activator 28 subunit beta; proteasome activator 28-beta; proteasome activator complex subunit 2; proteasome tor hPA28 subunit beta 593593 LOCUS 818 ACCESSION NM_002818 VERSION 818.2 GI:30410791 tggggagtga aagcgaaagc ccgggcgact gaga atct agcgactgaa 6" gcagcatggc caagccgtgt ggggtgcgcc ggga agcccgcaaa caggtggagg 12" tcttcagaca gaatcttttc caggaggctg aggaattcct ctacagattc ttgccacaga 18" aaatcatata cctgaatcag ctcttgcaag aggactccct caatgtggct gacttgactt 24" ccctccgggc cccactggac atccccatcc cagaccctcc acccaaggat gatgagatgg " aaacagataa gcaggagaag aaagaagtcc ataagtgtgg atttctccct gggaatgaga 36" aagtcctgtc cctgcttgcc ctggttaagc cagaagtctg gactctcaaa gagaaatgca 42" tgat tacatggatc caacacctga tccccaagat tgaagatgga aatgattttg 48" gggtagcaat ccaggagaag gtgctggaga gggtgaatgc cgtcaagacc aaagtggaag 54" ctttccagac aaccatttcc aagtacttct cagaacgtgg ggatgctgtg gccaaggcct 60" ccaaggagac tcatgtaatg gattaccggg ccttggtgca tgagcgagat gaggcagcct 66" atggggagct catg gacc tgagggcctt ctatgctgag ctttatcata 72" tcatcagcag caacctggag aaaattgtca acccaaaggg tgaagaaaag ccatctatgt 78" actgaacccg ggactagaag gaaaataaat gatctatatg ttgtgtgga LOCUS NP_002809 ION NP_002809 VERSION NP_002809.2 10792 1 makpcgvrls gearkqvevf rqnlfqeaee flyrflqui iylnqllqed slnvadltsl 61 rapldipipd pppkddemet quekkevhk cgflpgnekv lsllalvkpe thlkekcil 121 vitwithip kiedgndfgv aiqekvlerv navktkveaf qttiskyfse rgdavakask 181 ethvmdyral aayg elramvldlr afyaelyhii ssnlekivnp kgeekpsmy Q9BQE5 Official Symbol: APOL2 and Name: apolipoprotein L, 2 Other Aliases: APOL-II, APOL3 594594 Other ations: apolipoprotein L-II; apolipoprotein L2 LOCUS NM_030882 ACCESSION NM_030882 VERSION NM_030882.2 GI:22035654 gtgctgggga gcagcgtgtt tactgtgctt ggtcatgagc tgctgggaag ttgtgacttt 6" cactttccct ttcgaattcc agggtatatc tgggaggccg gaggacgtgt ctggttatta 12" cacagatgca cagctggacg tgggatccac acagctcaga ggat cttgctcagt 18" ctctgtcaga ggaagatccc ttggacaaga ggaccctgcc ttggtgtgag agtgagggaa 24" gaggaagctg gaacgagggt taaggaaaac cttccagtct ggacagtgac tggagagctc " caaggaaagc ccctcggtaa cccagccgct ggcaccatga acccagagag cagtatcttt 36" attgaggatt accttaagta tttccaggac caagtgagca gagagaatct gctacaactg 42" ctgactgatg cctg gaatggattc gtggctgctg ctgaactgcc cagggatgag 48" gcagatgagc tccgtaaagc tctgaacaag cttgcaagtc acatggtcat gaaggacaaa 54" aaccgccacg ataaagacca gcagcacagg cagtggtttt tgaaagagtt tcctcggttg 60" aaaagggagc ttgaggatca cataaggaag ctccgtgccc ttgcagagga ggttgagcag 66" agag ccat tgccaatgtg gtgtccaact gcac tacctctggc 72" atcctgaccc tcctcggcct gggtctggca cccttcacag aaggaatcag ttttgtgctc 78" ttggacactg gcatgggtct gggagcagca gctgctgtgg ttac ctgcagtgtg 84" gtagaactag taaacaaatt gcgggcacga gcccaagccc gcaacttgga cggc 90" accaatgtag caaaggtgat gaaggagttt gtgggtggga acacacccaa tgttcttacc 96" ttagttgaca attggtacca agtcacacaa gggattggga ggaacatccg tgccatcaga "02" cgagccagag ccaaccctca gttaggagcg tatgccccac ccccgcatat cattgggcga "08" atctcagctg aaggcggtga acaggttgag agggttgttg aaggccccgc ccaggcaatg "l4" agcagaggaa ccatgatcgt gggtgcagcc actggaggca tcttgcttct gctggatgtg "20" cttg catatgagtc aaagcacttg cttgaggggg caaagtcaga tgag "26" gagctgaaga agcgggctca ggagctggag gggaagctca actttctcac caagatccat "32" ctgc agccaggcca agaccaatga ccccagagca gtgcagccac agaa .38" ggca caggccagga gcag actttttttt tttttttttt ttttttttga "44" gatggagtct cgctctatcg cccaggatgg agtgcagtgg ctcaatctcg gctcactgca "50" aactccgcct cccgggttca caccattctc cagt ctcccgagta gctgggacta 595595 "56" caggcacctg ccaccacgcc attt ttttgtattt tcactggaga cggggtttca "62" ctgtgttagc cacgatggtc tccatctcct gacctcgtga tctgcccacc tcggcctccc "68" aaagtgctgg gattacaggc gtgagccacc gcgcctggcc aaaatgcaga cattttatta "74" gggggataag gagggcaagg ttat ggaactgagt gttagtgact ttggcatttg "80" tgtagctgag cacagcaagg gaggggttaa tgcagatggc aagtgcacca aggagaaggc "86" aggaacactg gagcctgcaa agga gagaggactg gagagtgtgg ggaa "92" gaagtagttt actttggact aaagaatata ttgggcgaag aatagagggg gagcttgcag "98" gaaccagcaa tgagaaggcc aggaaaagaa agagctgaaa atggagaaaa ccagagttag 204" aactgttgga tacaggagaa gaaacagcag ctccactacc gacccccccc caggtttgat 210" gtccttccaa gaataaagtc tttccctggt gatggtctct cgctctgtct ttccagcatc 216" cactctccct tgtccttctg ggggtgtatc acagtcagcc ttct tcatgatggt 222" ggttggggtg gttgtcatgt gacgggtccc ctccaggtta ctaaagggtg catgtcccct 228" gcttgaaccc tgagaggcag gtggtaggcc atggccacaa tccccagctg aggagcaggt 234" gtccctgaga acccaaactt gagt atctgagaac caaccaatga aaacagtccc 240" atcgctctta gccggtaagt aaacagtcag aagattagca tgaaagcagt ttagcattgg 246" gaggaagcac ctag agctgtcctg tcgctgccca ggattgacct aagt 252" cccaataaac tcacctactc accaa LOCUS NP_112092 ACCESSION NP_112092 VERSION NP_112092.1GI:13562090 1 mnpessifie qdqv srenllqllt ddeawngfva aaelprdead elrkalnkla 61 shmvmkdknr hdkdqthqw rlkr eledhirklr alaeeveqvh rgttianvvs 121 nsvgttsgil tllglglapf vlld tgmglgaaaa svve lvnklraraq 181 arnldqsgtn efvg gntpnvltlv dnquvtqgi grnirairra ranpqlgaya 241 ppphiigris aeggeqverv vegpaqamsr gtmivgaatg gilllldvvs layeskhlle 301 gaksesaeel kkraqelegk lnfltkihem lquqdq Q9Y262 Official Symbol: EIF3L and Name: eukaryotic translation initiation factor 3, subunit L 596596 Other Aliases: AL022311.1, EIF3EIP, EIF3Sl 1, EIF3S6IP, HSPC021, HSPC025, MSTPOOS Other Designations: eIEF associated protein HSPC021; eukaryotic translation initiation factor 3 subunit 6-interacting protein; eukaryotic translation initiation factor 3 subunit E- interacting protein; eukaryotic translation initiation factor 3 subunit L LOCUS NM_001242923 ACCESSION NM_001242923 VERSION NM_001242923.1 GI:339275830 gctgaacttc cggcctcagg acgcaggcgc gggccgctca tttcgctctt tccggcggtg 6" ctcgcaagcg aggcagccat gtcttatccc gctgatgatt atgagtctga ggcggcttat 12" gacccctacg cttatcccag cgactatgat acag gagatccaaa gcaggacctt 18" gcttatgaac gtcagtatga aacc tatcaggtga tccctgaggt gatcaaaaac 24" cagt atttccacaa aactgtctca gatttgattg accagaaagt gtatgagcta " caggccagtc gtgtctccag tgatgtcatt gaccagaagg tgtatgagat ccaggacatc 36" tatgagaaca gctggaccaa gctgactgaa agattcttca agaatacacc ttggcccgag 42" gcca ttgctccaca ggttggcaat gatgctgtct tcctgatttt atacaaagaa 48" ttatactaca ggcacatata tgccaaagtc agttttcagt cattcagtca gtaccgctgt 54" aagactgcca agaagtcaga ggaggagatt cttc atcc caaaatctgg 60" aatgttcata gtgtcctcaa tgtccttcat gtag acaaatccaa catcaaccga 66" cagttggagg tatacacaag tgac cctgagagtg tggctgggga gtatgggcgg 72" cactccctct acaaaatgct tggttacttc agcctggtcg ggcttctccg ctcc 78" ctgttaggag attactacca ggccatcaag gtgctggaga acatcgaact gaacaagaag 84" agtatgtatt tgcc agagtgccag gtcaccacat atgt tgggtttgca 90" tatttgatga tgcgtcgtta ccaggatgcc atccgggtct tcgccaacat cctcctctac 96" atccagagga ccaagagcat gttccagagg accacgtaca agtatgagat gattaacaag 102" gagc atgc ggcc attgccctca cgatgtaccc catgcgtatt 108" gatgagagca ttcacctcca gctgcgggag aaatatgggg acaagatgtt gcgcatgcag 114" aaaggtgacc cacaagtcta tgaagaactt ttcagttact cctgccccaa gttcctgtcg 120" cctgtagtgc ccaactatga gcac tacc acaaagagcc cttcctgcag 126" cagctgaagg tgttttctga tgaagtacag cagcaggccc agctttcaac catccgcagc 132" ttcctgaagc tctacaccac catgcctgtg gccaagctgg ctggcttcct ggacctcaca 597597 "38" gagcaggagt tccggatcca gcttcttgtc caca agatgaagaa cctcgtgtgg "44" accagcggta tctcagccct ggatggtgaa tcag cctcagaggt tgacttctac "50" attgataagg acatgatcca catcgcggac accaaggtcg ccaggcgtta tggggatttc "56" ttcatccgtc agatccacaa atttgaggag cttaatcgaa ccctgaagaa gatgggacag "62" agaccttgat gatattcaca cacattcagg aacctgtttt gatgtattat aggcaggaag "68" tgtttttgct accgtgaaac ctttacctag atcagccatc agcctgtcaa ctcagttaac "74" aagttaagga ccgaagtgtt ggat aaag gatctttgga tttg "80" catt attgtaggag agaatttgtg ggttgtggca gtaatacatt tcccatgtgt "86" cctgatgctt tcaggataca tcagttgtta aaat tgagttattt ttattttgtg "92" cttttgagat ggagtctcac tctgtct LOCUS 229852 ACCESSION NP_001229852 VERSION NP_001229852.1 GI:339275831 _ msypaddyes eaaydpyayp sdydmhtgdp kqdlayerqy eqqtyqvipe viknfiqyfh 6" ktvsdliqu vyelqasrvs sdviquvye iqdiyenswt klterffknt pwpeaeaiap 12" qvgndavfli lykelyyrhi yakvsqufs qyrcktakks eeeidflrsn pkiwnvhsvl 18" nvlhslvdks evyt sggdpesvag lykm lgyfslvgll rlhsllgdyy 24" qaikvlenie lnkksmysrv pecqvttyyy vgfaylmmrr yqdairvfan illyiqrtks " qurttykye minkqneth allaialtmy pmridesihl qlrekygdkm lrmqudpqv 36" yeelfsyscp kflspvvpny dnvhpnyhke pfqulkvfs dequqaqls tirsflklyt 42" tmpvaklagf ldlteqefri qllvfkhkmk nlthsgisa ldgequase vdfyidkdmi 48" hiadtkvarr ygdffirqih kfeelnrtlk kmgqrp Official : RABlB and Name: RABlB, member RAS oncogene family [Homo sapiens] Other Designations: ras-related protein ; small GTP-binding protein LOCUS NM_030981 ACCESSION NM_030981 XM_001134089 VERSION NM_030981.2 Glill6014337 598598 gcgggcgggg cctctggggc ggagcggcca ccatcttgga acgggaggcg agtc 6" gactgggagc gaccgagcgg gccg ccgccatgaa ccccgaatat gactacctgt 12" ttaagctgct tttgattggc gactcaggcg tgggcaagtc atgcctgctc ctgcggtttg 18" ctgatgacac gtacacagag agctacatca gcaccatcgg ggtggacttc aagatccgaa 24" ccatcgagct ggatggcaaa actatcaaac ttcagatctg ggacacagcg ggccaggaac " ggttccggac catcacttcc agctactacc ggggggctca tggcatcatc gtggtgtatg 36" acgtcactga ccaggaatcc tacgccaacg agtg gctgcaggag attgaccgct 42" atgccagcga gaacgtcaat aagctcctgg acaa gagcgacctc accaccaaga 48" aggtggtgga caacaccaca gccaaggagt ttgcagactc tctgggcatc cccttcttgg 54" agacgagcgc caagaatgcc accaatgtcg cgtt catg gaaa 60" tcaaaaagcg gatggggcct ggagcagcct ctgggggcga gcggcccaat ctcaagatcg 66" acagcacccc tgtaaagccg gctggcggtg gctgttgcta ggaggggcac atggagtggg 72" acaggagggg gcaccttctc cagatgatgt aggg ggcaggaggt acctccctct 78" ccctctcctg ttga gtctgtggct ttggggtgtc ctgggctccc catctccttc 84" tggcccatct gcctgctgcc ctgagccccg gttctgtcag taag ggaggacact 90" cagggcctgt ggccaggcag ggcggaggcc tgctgtgctg ttgcctctag gtgactttcc 96" aagatgcccc cctacacacc tttctttgga acgagggctc cggt gtccctccca "02" cccccatgta tgctgcactg ggttctctcc ttcttcttcc tgctgtcctg cccaagaact .08" ctcc ccggcctcta ctgccctggc cagt gcccagggcg gtgg "14" ccaggggatc caggacctgg gatccagggc cctgggctgg acctcaggac aggcatggag "20" gccacagggg cccagcagcc caccctttcc tctccccact gcctcctctc ccttcctaca "26" ctcccagctc gagccgtcca gctgcggtgg gatctgagta tatctagggc gggtgggcgg "32" gtagcagtgc tgggcctgtg tcttgagcct ggagggagtc tgctcctgcc gccctctgcc "38" ctgccagaga catg cgctgcctgc ccaccgtgcc cctttgtccc catgtcaggc "44" ggaggcggaa ggcccaccgt ggct gggcaccagc cttaaccctc actctgctag "50" cacctcctcc ctttccccaa ggtagcacat ctggctcact ccccactccg tctctggagc "56" ccaccaggga aggccctcat cccctgccgc tacttctctg gggaatgtgg gttccatcca "62" ggattggggg tgct cacccactct gcacccagga tcctagtccc tctg "68" gcacagctgc ttcctgcaag aaagcaagtc tttggtctcc ctgagaagcc atgtccctcg "74" tgctgtctct tgcctgtccc acctgtgccc tgccctccag cttgtattta agtccctggg 599599 1801 ctgccccctt ggggtgcccc ccgctcccag gttcccctct ggtgtcatgt caggcatttt 1861 gcaaggaaaa gccacttggg gaaagatgga caaa aaaaattaat ccat 1921 tcgg gtgagctgag ggtttttgca aggaa LOCUS NP_112243 ACCESSION NP_112243 XP_001134089 VERSION NP_112243.1GI:13569962 1 mnpeydylfk llligdsgvg ksclllrfad dtytesyist igvdfkirti eldgktiqu 61 iwdtagqerf rtitssyyrg ahgiivvydv tdqesyanvk qwlqeidrya senvnkllvg 121 nksdlttkkv vdnttakefa dslgipflet saknatnveq afmtmaaeik krmgpgaasg 181 gerpnlkids gggc C Official Symbol: RPS6 ed by HGNC Official Full Name: ribosomal protein S6provided by HGNC Also known as: S6 LOCUS NM_001010 ACCESSION NM_001010 VERSION NM_001010.2 GI: 17158043 cctcttttcc gtggcgcctc gttc agctgcttca agatgaagct gaacatctcc 6" ttcccagcca ctggctgcca gaaactcatt gaagtggacg atgaacgcaa acttcgtact 12" ttctatgaga agcgtatggc cacagaagtt gctgctgacg ctctgggtga agaatggaag 18" gtgg tccgaatcag tggtgggaac gacaaacaag gtttccccat gggt 24" gtcttgaccc gtgt ccgcctgcta ctgagtaagg ggcattcctg ttacagacca " aggagaactg gagaaagaaa gagaaaatca gttcgtggtt gcattgtgga tgcaaatctg 36" agcgttctca acttggttat tgtaaaaaaa ggagagaagg atattcctgg actgactgat 42" actacagtgc ctcgccgcct gggccccaaa agagctagca gaatccgcaa acttttcaat 48" ctctctaaag aagatgatgt ccgccagtat gttgtaagaa agcccttaaa taaagaaggt 54" aagaaaccta ggaccaaagc acccaagatt cagcgtcttg ttactccacg tgtcctgcag 60" cacaaacggc ggcgtattgc gaag cagcgtacca agaaaaataa agaagaggct 66" gcagaatatg ctaaactttt ggccaagaga gagg agaa gcgccaggaa 600600 721 caaattgcga agagacgcag actttcctct ctgcgagctt ctaa gtctgaatcc 781 agtcagaaat aagatttttt gagtaacaaa taaataagat cagactctg LOCUS NP_001001 ACCESSION NP_001001 VERSION NP_001001.2 GI: 17158044 1 mklnisfpat gchlievdd erklrtfyek rmatevaada lgeewkgyvv risggnqug 61 fpmkqgvlth grvrlllskg rrtg erkrksvrgc ivdanlsvln lvivkkgekd 121 ipgltdttvp rrlgpkrasr irklfnlske ddvrquvrk plnkegkkpr tkapkiquv 181 tprvlqhkrr rialkqutk knkeeaaeya kllakrmkea kequeqiak rrrlsslras 241 tsksessqk Official SymbolRRPl Official Full Nameribosomal RNA processing 1 homolog (S. cerevisiae) Also known asNNP-1; NOP52; RRP1A; 56E LOCUS 683 ION NM_003683 N NM_003683.5 GI: 134304836 gccggggcca ggaggaggcg ggcgcaggag gcgcgtgctc agtgtgctgg gtaccaggcg 6" actccgggac agggggtctc ggccgtcggc gtcatggttt cgcgcgtgca gctcccgcct 12" gagatccagc tggctcagcg cctggcgggg aatgagcagg tgacccggga ggtg 18" aggaagctcc ggaaatacat cgtcgccagg actcagcggg ccgcaggtgg ttttacgcac 24" gacgagctgc tgaaggtgtg gaaaggactg tgca tgtggatgca ggacaagcca " ctcctccagg aagaattagg aaggactatt tcccagctcg ttcatgcttt cacg 36" gaggcgcagc acctgttcct tcaggccttc tggcagacca tgaatcgcga gtggacgggc 42" attgacaggc tgga taaattctac atgctcatgc ggatggtcct gaacgagtcc 48" ttgaaggttc tgca aggctgggaa gaaagacaga tcgaggagct gctagagctg 54" ctgatgactg agatcctgca ccccagcagc caggccccca acggtgtgaa gagccacttc 60" atcgagatct agga gctgaccaaa gtgggcgccg aggagcttac ggcagaccag 66" aacctgaagt tcatcgaccc cttctgcaga attgctgccc ggaccaagga ttccttggtt 601601 72" ttgaacaaca tcactcgagg catctttgag acgattgtgg agcaggcccc gcttgccatt 78" gaagacctcc tgaatgaact acag gatgaggagg tggcgtcgga cagtgatgag 84" tcctctgagg gtggtgagcg tggagacgcg ctgtcccaga agaggtctga gaagccgccc 90" gcaggctcca tctgcagggc tgaacctgag gctggtgagg agcaggcagg cagg 96" gacagtggcg gccccgttct ccagtttgac tacgaggcag ttgctaacag actgtttgaa "02" atggccagcc gccagagcac tcag aacagaaagc gtctctacaa agtgatccgg "08" aagctgcagg acctggcagg aggcattttc cctgaagatg agatcccaga gaaggcctgc "14" aggcgcctgc ttgaagggag gcggcagaag aagacgaaga agcagaagcg tctgctcagg "20" ttgcagcagg agagagggaa aggtgagaag gagcccccga gcccgggcat ggagaggaag "26" aggagcagga ggaggggtgt aggggccgac cccgaggcgc gggcagaggc tggtgagcag "32" ccaggcacag ctgagcgggc cctgctccga gatcagccca ggggccgtgg ccagagaggg "38" gctcgccaga gaaggaggac acctcggccc ctgaccagtg cccgagcaaa ggcggccaat "44" gtccaggagc cggagaagaa gaagaaacgc agggagtgat gtggccgggc caaggacagg "50" cagggaggga ggccaggcct cacc gcgggacgag gctgaccggg ctgttctgta "56" gactcaggac cgtggctcca gaactcctgt gccaggcggg agggaagggc ggcactggag "62" agatgggccc atcattaggg gccagcatcc caggaactgg acctttcccc tccg "68" cctgtggctg tgatgacctt gggccagaag gtcaaactcc tgaa actctgcctg "74" cagcaggact ggccgcccct gctgtggggg aaaa taaaatgccg cctt "80" gccagggaaa aaaaaaaaaa LOCUS NP_003674 ACCESSION 674 VERSION NP_003674.1 GI:4503247 _ mvsrvqlppe agn eqvtrdravr klrkyivart qraaggfthd ellkvwkgLf 6" ycmwqukpl lqeelgrtis qtte athflqafw qtmnrewtgi drlrldkfym 12" lmrmvlnesl kvlkmqgwee rqieellell mteilhpssq apngvkshfi eifleeltkv 18" gaeeltadqn lkfidpfcri aartkdslvl ifet iveqaplaie dtqd 24" eevasdsdes seggergdal sqkrsekppa gsicraepea geeqagddrd sggpvqudy " eavanrlfem asrqstpsqn rkrlykvirk lqdlaggifp edeipekacr rllegrquk 36" lrl geke ppspgmerkr srrrgvgadp earaeageqp gtaerallrd 42" qprgrgqrga rqrrrtprpl aanv qepekkkkrr e 602602 Sunnnmy(MfichSwnb0kSEPTll Official Full Name: septin 11 LOCUS NM_018243 ION 243 VERSION NM_018243.2 GI:38605734 ggcgtggggg gagcagatgc cgctggctgc cagcgggacg ccggcgagca gagcgcagcc 6" gcgagggagg cgcgagggag gcgagccgga gcccgagcac tagcagcagc cggagtcggc l2" gtaaagcacc cgggcgcagc cggt gccgcagctg cgatggccgt ggccgtgggg 18" agaccgtcta atgaagagct tcgaaacttg tctg gccatgtggg atttgacagc 24" ctccctgacc agctggtcaa caagtctact ggat tcaa catcctttgt " gttggtgaga caggcattgg caaatccacg ttaatggaca ctttgttcaa caccaaattt 36" gaaagtgacc ctca caatgaacca ggtgttcggt ccag aagttatgag 42" cttcaggaaa tacg gctgaagtta accattgttg acaccgtggg atttggagac 48" cagataaata aagatgacag ctataagccg atagtagaat atattgatgc cgag 54" gcctacctgc aagaggaatt gaagattaaa cgttctctct tcaactacca tgacacgagg 60" atccatgcct gcctctactt tattgcccct actggacatt cactaaagtc cctggatctg 66" gtcaccatga aaaagctgga cagtaaggtg aacatcattc caataattgc aaaagctgac 72" gcca agaatgaact attc aagagtaaga tcatgagtga actggtcagc 78" aatggggtcc agatatatca gtttcccact gatgaagaaa cggtggcaga gattaacgca 84" acaatgagtg tccatctccc atttgcagtg gttggcagca ccgaagaggt gaagattggc 90" aacaagatgg caaaggccag gcagtacccc tggggtgtgg tgcaggttga gaatgaaaat 96" cattgcgatt ttgtgaaact gatg ctgatccgcg tgaacatgga ggacttgcga 102" gagcagactc acacccgcca ctatgaattg taccgacgct gtaagcttga agagatgggg 108" ttcaaggaca ctgaccctga cagcaaaccc ttcagtcttc aggagacata tgaagcaaaa 114" aggaatgaat tcctgggaga actgcagaag aaagaagaag aaatgagaca aatgtttgtt 120" atgagagtga aggagaaaga agctgaactt aaggaggcag agaaagagct tcacgagaag 126" tttgaccttc taaagcggac agaa gaaaagaaga aagtggaaga caagaagaag 603603 "32" gagcttgagg aggaggtgaa caacttccag aagaagaaag cagcggctca gttactacag "38" tcccaggccc agcaatctgg ggcccagcaa accaagaaag acaaggataa gaaaaatgca "44" agcttcacat aaagcctggc aagccaagga tgttcccgca ttcacctgct tttgcagtaa "50" tatcgtatct ctgccatgtg tgttctttag ttttatttta ttttatttta tttttttacc "56" caaa caccagtaac tattattaac tcgttttgct gaatgttgtt gggtggtaga "62" aaatgataga gaat aaccgcgaat gctctgtgca ctct gtttccggaa "68" agtaaatgat ttgcttttta ttct cagc acgaagcagg cctgttactt "74" gtatgtcgct ttggacagag gaaagtgggg taaaatgcta cctgtacgtc tgacatgaaa "80" acttctcacc gcctcagcag ctgaactaaa aacctgaata gccatgacaa gagtttgcat "86" gatg attcatctcc gcac aatccctgaa ctcactgtct tttctccaca "92" cttgtcctaa gccaaggtag atttgtacgt agacagactg gtgagcaagc attatatttt "98" atttttaccc ttgcatgaca ttttcatttt aatcaataac tggc ctgagcttgt 204" gggtctgttc agactgtctc ctctcatggt ttgaaactgc atgc ctgccttcaa 210" tcctggccaa gttggagtag actggtatga gaaaactatg attagttcac atttactggt 216" gcatccttga tcctctcaca gatagaggtc ttaaaggttg gatcatgtaa cattgcttag 222" tagaagaatc ttcttctaag gggc tttctacagc acca ctaacagtaa 228" ggaatctttc ataaacacac ctcagtttgt tcccagtggg cttagaggga ggacctgatg 234" actgattcca ttgt acttctaata acatttttca tgaatcatga gaaaatttcc 240" acagatactt gaaa atttgctata aactctgtat cattggtagc acaaatttga 246" gcgaggcctt gtcaatttta aggtggaaat aggaaggacc acaacatgac gtca 252" taga catttcatat ccagcttcct tgcttagtct cctttcagta tttggcaata 258" aaagaaagaa gaaatagaac agctgaagtc tcaaatcatt gtctggaatt ttcctcacct 264" tggctagctc cacctgctct ttgtctaagg cccttgcctc atcagggatt agaactggcc 270" catatgccag aacctgtact aaatgcctaa tttgtatgga agagtgcata tttaatctct 276" tttctatact gctcctttct gatgcttatc ctttcatctg tgtgattgtt cctc 282" tactaacaag atcctcccag ctttctctct acatgtagaa acat ttctcatgaa 288" cccactgccc ctctgcattt tcctcactgg ttagagatta agtaaatagg atagaatatg 294" ctgcgtctcc cctgacacac actttctttt ttgaatgagc aagtctccat tttgatttca 300" gcaaagattt tttctccttt tctttgtcct caaccatact aaag aaggaatggt 306" cttccatgaa ctgattatgc ttaattaagc aaagtaagga aattagtttc atggaagcct 604604 312" aaacaaagct ggaatagaaa ctacacacta gcag tagtcatagt cttcacaggt 318" ttaggagcta ctggaccaac attcttgttt ttgcttttgt ttttttaaat aattctagtc 324" taac tgtggagcag ccaaatagta gctggcatgt tgattcaaac catgggctga 330" atttgctcat aggctgtgca tcagacaaaa gcttgaatat ttgtgttgta ttcc 336" aaccaccgct agca tggc ttgtacagaa agtacacttt taaattgtct 342" cttgcatcac taaaattttt ttaaaatgag cataacaacg aaaggcatcc agctgacttt 348" ttgattccaa gattattgat tggattgact catt aaatttttcc cagcaaaata 354" aatcatatgg aggg aataaaaagt caaaagaaac aaatagaagc tttttttttt 360" tgta ttgcttctga acttttttct gccactgctc cctagccctg tttagtttgt 366" tattgctgct tttctttttt ctttctgtat ctatgccttt ttttcacagt agtccttggc 372" tctgcacgga gata ccctcaaatc taattggatg tgctttcgcc tttgcatgta 378" agtacggtag taagaaacct ttgagatctt tctgactttt caaaattaga gaaagcaaat 384" gggatggata gatttttttt ttcttttcaa ggggggcagg aaggtaatgg tttgagtagc 390" ttaa aaaaaagact attt aaaaggccac atttatattt ttttcacaag 396" aaccacataa taaattccac ttcttgacct gaatttggaa atccgaaatt actaatccag 102" gccaggtgtg gtggctcatg cctgtaatcc cagcactttg agaggccgag gtgggcagat 108" cacttgaggc ctggagttca agaccacctt ggcgaacacg gtgaaacccc gtctctacaa 114" aaaatacaaa aattagccag gcgtggtggc acgtgcctgt agtcccagct acttgggagg Z20" ctaagtcagg agaattgctt gaacttggga gatggaggtt gcagtgagcc gcac Z26" cactgcattc caacctgggt gatgaagtga gactctccaa aaaaaaaaaa gaaattatta Z32" atccctgcct gtgctctaca catg ggcatcattg gatagctcag agggcccttg Z38" attctggcaa ggcaaataaa gccagaatga gaaattacca tcttctacta gagaaaacca Z44" aaat ttttatgcta ggatgccttt atgaccactt aattttttaa tcttagttta 150" atggtctctc cctggtgcta actgctgaca gtggccacct cttttttggg gattgagggg 156" cctacataac tagctggcct taccccatat cttttgttca aacataatac catctttttg Z62" cttcttctga actttagatc tccataacac atgtactgta gaatgtgatg gaaaagcatt Z68" aatt tattggcagt tcagattgtg caac tctt tattaattgg Z74" ttaaggtttt ctccaaaaag ggcatttcaa caatgggaat tatttaatgt aacagtgggc Z80" acagattact tatcttcctt ctctgctttg tgactcacca gcagtaacac acacaatcca Z86" catcttgtgc acctcaaatg aacagacttg gtttccttgc tttcttgaca tttccatgac 605605 492" tgtttcacat tatt gggtgaggtt tttcagctgt taccgaccca cgtcctgctg 498" tctctgtgtg gtcctacaaa aactgtccat tcccacccct tgcc atttgcaaga 504" aatt gtcaggtctc agcttcgaaa agtcctggtt ccactgacag gacacattct 510" ttagtgggaa ttaagaccta caaagtctag tttgtatgta ggtatgaagg gaatttttta 516" aataaattga aaagctgtga acagcattag aactttgtct atttcttaat tttaaaatat 522" gctgatatgc ctgt agttgtagat ccttgtcatt ttgctgtttg aaaataacca 528" atgtgttttc taaaactgtc gtgtaatcta ctttcattgt taatgcagaa ttgtcatata 534" tgtaagctgc atgttagaca tttgtctttt ttaaactaaa gtat tgatgtgaag 540" catatcattt tttcaaatat gaaagtgatc acttagcaac atgcttggta catc 546" tgttaaggta ggagagtggt gaacagataa tctatgcata tatcactagt gccaagacat 552" aaagcggggg aaaatatatt tttacccaaa cattaaaaaa aaaaaaaaaa aaaaaaaaaa 558" aa LOCUS NP_060713 ION NP_060713 VERSION NP_060713. 1 GI:8922712 _ mavavgrpsn eelrnlslsg hvgfdslpdq lvnkstsqgf cfnilcvget gigkstlmdt 6" lfntkfesdp athnepgvrl karsyelqes tivd tvgfgdqink ddsykpivey 12" idaqfeaqu eelkikrslf nyhdtrihac lyfiaptghs lksldlvtmk kldskvniip 18" iiakadtiak nelhkfkski mselvsnqu iyqutdeet tmsv hlpfavvgst 24" eevkignkma karqypwgvv hcdf vklremlirv nmedlreqth trhyelyrrc " kleemgfkdt dpdskpfslq etyeakrnef lgelqkkeee mrquvmrvk keae 36" kelhekfdll krthqeekkk vedkkkelee evnnqukka aaqllqsqaq qsgaqqtkkd 42" kdkknasft Official Symbol: SEPT7 and Name: septin 7 Other Aliases: CDC10, CDC3, NBLA02942, SEPT7A Other Designations: CDC10 (cell division cycle 10, S. cereVisiae, homolog); CDC10 n homolog; septin-7 LOCUS NM_001011553 ACCESSION NM_001011553 606606 VERSION NM_001011553.3 GI:339639595 gagatggaag ccagcctccg ctaggcccgg aagcctcgtc tgagggggcg ggggacggag 6" gagggagcgg gagtcgagcg tgtg gaggagtccg cctgctgtag cgtgcgtaag l2" caaggcagct anCnggCg gctacgctgc ggaatcggcg taggcgcctt atcg 18" gcgggctgcg ctccgctggg gctggtcgcg ggga ggggatgtcg gtcagtgcga 24" gatccgctgc tgctgaggag aggagcgtca acagcagcac catggctcaa cagaagaacc " ttgaaggcta tgtgggattt gccaatctcc aagt atacagaaaa tcggtgaaga 36" gaggttttga attcacgctt atggtagtgg gtgaatctgg attgggaaag tcgacattaa 42" tcaactcatt attcctcaca tatt agta tccaggtcct tctcatagaa 48" ttaaaaagac tgtacaggtg gaacaatcca aagttttaat aggt ggtgttcagt 54" tgctgctcac aatagttgat accccaggat ttggagatgc agtggataat agtaattgct 60" ggcagcctgt tatcgactac attgatagta aatttgagga ctacctaaat gcagaatcac 66" gagtgaacag acgtcagatg cctgataaca gggtgcagtg ttgtttatac gctc 72" cttcaggaca tggacttaaa gata ttgagtttat gaagcgtttg catgaaaaag 78" tgaatatcat cccacttatt gccaaagcag acacactcac accagaggaa tgccaacagt 84" ttaaaaaaca gataatgaaa gaaatccaag aacataaaat taaaatatac gaatttccag 90" aaacagatga tgaagaagaa aataaacttg ttaaaaagat aaaggaccgt ttacctcttg 96" ctgtggtagg tagtaatact gaag gcaa aagggtcaga ggaaggcagt "O2" atccttgggg tgttgctgaa aatg gtgaacattg tgattttaca atcctaagaa "08" atatgttgat aagaacacac atgcaggact tgaaagatgt tactaataat gtccactatg "l4" agaactacag aagcagaaaa cttgcagctg tgacttataa tggagttgat aacaacaaga "20" ataaagggca gctgactaag agccctctgg cacaaatgga agaagaaaga agggagcatg "26" tagctaaaat gaagaagatg gagatggaga tggagcaggt gtttgagatg aaggtcaaag "32" aaaaagttca aaaactgaag gactctgaag ctgagctcca gcggcgccat gagcaaatga "38" aaaagaattt ggaagcacag cacaaagaat aaaa acgtcgtcag ttcgaggatg "44" agaaagcaaa ctgggaagct caacaacgta ttttagaaca acagaactct tcaagaacct "50" tggaaaagaa gaaa atct tttaaactct ctattgacca ccagttaacg "56" tattagttgc caatatgcca acat cagtgtttgt tggatccgtt tgaccaattt "62" gcaccagttt tatccataat gatggattta tgac aaaaattatt tttttttttg "68" ttcttgatgg agattaagat gccttgaatt gtctagggtg ttctgtactt agaaagtaag 607607 174" agctctaagt acctttccta cattttcttt ttttattaaa cagatatctt cagtttaatg 180" caagagaaca ttttactgtt gtacaatcat gttctggtgg tttgattgtt atat 186" tccaaaataa aaggactctg gaagattttc attgaggata aattgccata atatgatgca 192" aactgtgctt ctctatgata attacaatac aaaggttcca ttcagtgcag caat 198" aatgtaattt agtctaacac agttgaccct attttttgac acttccattg tttaaaaata 204" cacatggaaa aaaaaaaaac cctatatgct gcac ctagagcttt tttataacaa 210" tttg tttgtttgtt ttggattctt taaatatata ttattctcat ttagtgccct 216" ctttagccag aatctcatta ctgcttcatt tttgtaataa catttaattt agatattttc 222" catatattgg cactgctaaa atagaatata gcatctttca tatggtagga accaacaagg 228" tcct ttaactccct ttttacactt tatggtaagt agcagggggg gaaatgcatt 234" tcat ttctaggcaa aattgtgaag ctaatgacca acctgtttct acctatatgc 240" agtctcttta ttttactaga aatgggaatc atggcctctt gaagagaaaa aagtcaccat 246" ttta gctgtattca tatattgcat ttctgtattt tttgtttgta ttgtaaaaaa 252" ttcacataat aaacgatgtt gtgatgtaat attgtgtgag gtcttaaata agtc 258" gatgtacaag agtagagtat gtttgggaag ttca gcttaagttt gcctcctcta 264" caatgacatc ttttatatgc ttgtctcatt tagaatgcat atgtgctgat attt 270" aagagatacc atatctctct attcatttct atctctcatt tgtatgctta tttttctgag 276" aacatttttt ttttccccca gacagggtct tgcttcattg cccaggctgg agtgcggtgg 282" cacaaacacg acttgactgc agcctcaacc ctctgggctc aagcagtcct cctgcctcag 288" ccccctgagt atctgggatt gcaggcgtgc accaccacgc ctggctaatt tttgtatttt 294" ttgcagcctc ccaaagttct gggattacag gcatgagccg tcatgcctgg cctctgagaa 300" cagtttctga ctcattcaga ttaggtatac tctcaagtcc ctggaaactg aaattttttt 306" taactgtaaa gagggtagtg tcatttcttt tcttaaggtc aagtgacata gattttaatg 312" taatgcataa tttaggtaag aaattaatta atgtagccta gtttattatc tgtt 318" ttaccctatt tactttttaa aattaatgac ctaagcggag atta taagtcaata 324" agat tgttgtttgg attt ttttcagttt tgag agattgggtt 330" aacacctcta gccaaaattg tttggtttta gggaggctaa caataaccta ttgg 336" aaaatgcaaa ggtaaaaaat gtatatagac tgcctgctga actggttaag tactactgct 342" tctgggaaat tcaa aattctatgt attataataa taaatttgta agacattcat 348" tattctacca tgaa aactttcaga agtctttctt tatccatggc atgcccaggg 608608 354" ttttacctga atctgataca ggatctatat aactttacta ggacttttga ttgttgactc 360" caggcttagg tatatcagaa tttt gccatttggc ctgtggatgt agat 366" cattcacaat acatgtaaaa ttcaggtagg cctaaggaaa ggccagcctg tagaaagcaa 372" aatggcagtg tctgttctcc ggag gcattatgta atttaagtat cctgttagcc 378" actgtctttc tgctaattaa gtggggctga acaagtaagc actaataata ccagtgaacc 384" acttgggcac ggta gagttttgct gccacctagt ggaatgggat gctt 390" ccatatcagg ttcacaagca agttaagtgg gcacagttta tttctgtgta gctcaggctg 396" taatcttgaa agctgaggag atacccatgc ctctcagact cattagctgg gtgtcacatt 402" accacctgca cattctgacc caccgcatct taatatgttt tgtcctcttg gagaaactag 408" aagt caggatatgg taggtaaggg ggaaaaagga aagacggctt gatagctatg 414" aatgcatgag aatg ttgactcagt tatctagatc atggtctcca atgc 420" tatttcctta caaaaatatt tgttgagcat gtgtccataa gtat atga 426" aaatatgtgt caacaaatgt actgctacac taatgtgaac attatggaac aaaatttgaa 432" agagtgaaat aaaaggttta cactttcaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa LOCUS NP_001011553 ACCESSION NP_001011553 _ msvsarsaaa eersvnsstm aqqknlegyv gfanlpnqu rksvkrgfef tlmvvgesgl 6" gkstlinslf ltdlyspeyp gpshrikktv qveqskvlik egqulllti vdtpgfgdav 12" dnsncwqpvi dyidskfedy lnaesrvnrr qmpdnrvqcc lyfiapsghg lkpldiefmk 18" rlhekvniip liakadtltp eecqqfkkqi mkeiqehkik iyefpetdde eenklvkkik 24" drlplavvgs ntiievngkr vrgrqypwgv aevengehcd ftilrnmlir thqulkdvt " nnvhyenyrs rklaavtyng vdnnknkgql qmee errehvakmk kmememeqvf 36" emkvkekvqk lkdseaelqr rhequknle aqhkeleekr rqfedekanw eaqqrileqq 42" nssrtleknk kkgkif al Symbol: SH3BGRL and Name: SH3 domain binding glutamic acid-rich protein like [Homo sapiens] Other Aliases: SH3BGR 609609 Other Designations: SH3 domain-binding glutamic acid-rich-like protein; SH3-binding domain glutamic ich protein like LOCUS NM_003022 2090 bp mRNA linear PR1 27-JUN-2012 ACCESSION NM_003022 VERSION NM_003022.2 GI:21 1938420 ttttagtctc cgcgtgaaaa ggtccttcat gctaatctaa ttcaattcag ttctcctttt 6" tctttcttct tctcctcgcc ctcttctcag ggaaggaatc gtcaaaaatc aatgtttcaa 12" cagatctcgc ggtgctattc gagattccct attaaaaaaa aatagaattg atgcaaacag 18" cctgttcctt ccggggtttt gggctggaac tgcagcgctt agagagctcg gtggaagctg 24" ctaaaggcgg aggcggggct ctggcgagtt ctccttccac cttcccccac ccttctctgc " caaccgctgt ttcagcccct agctggattc cagccattgc tgcagctgct ccacagccct 36" tttcaggacc caaacaaccg cagccgctgt gatg gtgatccgtg tatatattgc 42" atcttcctct ggctctacag cgattaagaa gaaacaacaa gatgtgcttg gtttcctaga 48" caaa ataggatttg aagaaaaaga tattgcagcc aatgaagaga atcggaagtg 54" gatgagagaa aatgtacctg aaaatagtcg accagccaca ggttaccccc tgccacctca 60" gattttcaat gaaagccagt atcgcgggga ctatgatgcc ttctttgaag ccagagaaaa 66" agtg tatgccttct taggcttgac agccccacct ggttcaaagg aagcagaagt 72" gcaagcaaag cagcaagcat gaaccttaag cactgtgctt taagcatcct gaaaaatgag 78" tctccattgc ttttataaaa tagcagaatt agctttgctt caaaagaaat aatg 84" ttgaaataat agattagttg caca tgcaaacatt gaat ttaa 90" aatttgaaca tgat tatggtgagg agaatgggat attaacataa atta 96" ataagtagat atcgtagaaa tagtgttgtt acctgccaag tgta tacaccaatg "02" attttacaaa gaaaacaccc ttccctcctt ctgccattac tatggcaact taagtgtatc "08" tcta cattaaaaag gagaaagaga aataacctgt ctctcattcc taagttgcct "l4" tttt catgaacaag aatatgtacc tttttgatgc tatattactg cgattaaaaa "20" gttcttgcag gtaatgttta gtta aacgttgtaa tttcctatcg taattataac "26" attcccattc ttttgtagat gaaacttcta catattgaac tttt ctgagcttct "32" aaatgtagcc tttcattgca catttcagtg atcagaatag atatcctttt acacgcacaa "38" aagcaataga ttcattcagt ggacaagttc cttgtttaac tacacagcta tgatggaatg "44" atatatccaa gttccttgcc tcagtgaaat atgcatatgt atatcatgaa agtgggatgc 610610 "50" caagtaagct ggca ttctctagca aagagattag acttttaaat aactcttata "56" aaacaggttg gcgatcattt cccaagattg gtttcccttg agtttttgct aatc "62" ttagtagttt tgcccgttta aaacaactca caatcgtaaa tgctactatt cctaagatat "68" cttacctttt tatttcagtt tagccatgta ttgtatgagt gtattagtct aagcagtgag "74" aatcttttct atgcctctat tccagcaaaa agtagaagta tcaaataaaa agggcaactt "80" tatt aagcctgaag acttctaaaa agacaagaaa catggcctaa ataaccaaca "86" tagatttaca tagtaagttt cacactacct tattaccaaa agcaaacacc tcttacttta "92" aactacatta tcatgtatat ctattgtatg ctggtcttta ctttttgcca aaatcaacat "98" ataatgaaga gatgcctttg tttcatgaga ttcaaacttg atgctatgct ttaaaataaa 204" ctcagtactt ttagaaacat aaaaaaaaaa aaaaaaaaaa aaaa LOCUS NP_003013 ACCESSION NP_003013 VERSION 013.1 GI:4506925 1 mvirvyiass sgstaikkkq qdvlgflean kigfeekdia aneenrkwmr envpensrpa 61 tgyplppqif nesqyrgdyd affearenna ltap pgskeaevqa qua Official Symbol: SNRPB and Name: small nuclear ribonucleoprotein polypeptides B and B1 Other Aliases: COD, SNRPBl, Sm-B/B', SmB/B', SmB/SmB', snRNP-B Other Designations: B polypeptide of Sm protein; Sm protein B/B'; sm-B/Sm-B'; small nuclear cleoprotein polypeptide B; small r ribonucleoprotein polypeptides B and B'; small nuclear ribonucleoprotein-associated proteins B and B' LOCUS NM_003091 ACCESSION NM_003091 VERSION NM_003091.3 GI:38149990 1 aactccaggg ctagtgagct ggaccggaag tcta cccgaccgca ttttacgtgg 61 attt ccggtagcgg cggcgggaaa tcggctgtgg gagagaggct aggcctctga 121 ggaggcgaat ccggcgggta tcagagccat cagaaccgcc accatgacgg tgggcaagag 181 cagcaagatg ctgcagcata ttgattacag gatgaggtgc atcctgcagg acggccggat 241 tggc accttcaagg cttttgacaa gcacatgaat ctct gtgactgtga 611611 " tgagttcaga aagc caaagaactc caaacaagca gaaagggaag agaagcgagt 36" tctg ctgc gaggggagaa tctggtctca atgacagtag agggacctcc 42" tcccaaagat actggtattg ttcc acttgctgga gctgccgggg gcccagggat 48" ngcagggct gctggcagag gaatcccagc tggggttccc atgccccagg ctcctgcagg 54" acttgctggg ccagtccgtg gggttggcgg gccatcccaa caggtgatga ccccacaagg 60" aagaggtact gttgcagccg ctgcagctgc tgccacagcc agtattgccg gggctccaac 66" ccagtaccca cctggccgtg ggggtcctcc cccacctatg ggccgaggag caccccctcc 72" aggcatgatg ggcccacctc ctggtatgag acctcctatg ggtcccccaa tccc 78" ccctggaaga gggactccaa tgggcatgcc ccctccggga atgcggcctc ctccccctgg 84" gatgcgaggc cttctttgac ccttggccac agagtatgga agtagctccg cagaggcgtg 90" ggctcgattc ctcagggcca cgttaccaca gacctgtttg tttcttatgc tgttgttcgt 96" tcat gggattgtct ggtttccctt acagggcccc ctcccccggg aatgcgccca 102" ccaaggccct agactcatct tggccctcct cagctccctg cctgtttccc gtaaggctgt 108" acatagtcct tttatctcct tgtggcctat gaaactggtt tataataaac tcttaagaga 114" acattataat tgc LOCUS NP_003082 231 aa linear PRI 27-JUN-2012 ACCESSION NP_003082 VERSION 082.1 GI:4507125 1 mtvgksskml qhidyrmrci lngrifigt fkafdkhmnl ilcdcdefrk kqae 61 reekrvlglv llrgenlvsm tvegpppkdt giarvplaga aggpgigraa grgipangm 121 pqapaglagp vrgvggpsqq vmtpqgrgtv aaaaaaatas iagaptqypp grggppppmg 181 rgapppgmmg pppgmrppmg ppmgippgrg tpmgmpppgm rppppgmrgl 1 Official Symbol: SODl and Name: superoxide dismutase l, soluble Other Aliases: ALS, ALSl, IPOA, SOD, hSodl, mer Other Designations: Cu/Zn superoxide dismutase; SOD, soluble; indophenoloxidase A; superoxide dismutase ]; superoxide ase, cystolic LOCUS 454 ACCESSION NM_000454 VERSION NM_000454.4 GI:48762945 612612 1 gtttggggcc agagtgggcg aggcchgag gtctggccta taaagtagtc gcggagacgg 61 ggtgctggtt tgcgtcgtag tctcctgcag cgtctggggt ttccgttgca gtcctcggaa 121 ccaggacctc ggcgtggcct agcgagttat ggcgacgaag gccgtgtgcg tgctgaaggg 181 cgacggccca gtgcagggca tcatcaattt cgagcagaag gaaagtaatg gaccagtgaa 2 4 1 ggtgtgggga agcattaaag gactgactga aggcctgcat ggattccatg agtt 301 tggagataat ggct gtaccagtgc aggtcctcac tttaatcctc tatccagaaa 361 tggg ccaaaggatg aagagaggca tgttggagac ttgggcaatg ctga 421 caaagatggt gatg tgtctattga agattctgtg atctcactct caggagacca 481 ttgcatcatt acac tggtggtcca tgaaaaagca gatgacttgg gcaaaggtgg 541 agaa agtacaaaga caggaaacgc tggaagtcgt ttggcttgtg ttgg 601 gatcgcccaa taaacattcc cttggatgta gtctgaggcc ccttaactca atcc 661 ctgt agaaatgtat cctgataaac attaaacact gtaatcttaa aagtgtaatt 721 gtgtgacttt ttcagagttg ctttaaagta cctgtagtga gaaactgatt tatgatcact 781 tggaagattt gtatagtttt ataaaactca gttaaaatgt caat gacctgtatt 841 ttgccagact taaatcacag atggqtatta aacttgtcag aatttctttg tcattcaagc 901 ctgtgaataa aaaccctgta tggcacttat tatgaggcta ttaaaagaat ccaaattcaa 961 actaaaaaaa aaaaaaaaaa a LOCUS NP_000445 ACCESSION NP_000445 VERSION NP_000445.1 GI:4507149 1 matkavcvlk gdgpvqgiin feqkesngpv kvwgsikglt eglhgfhvhe fgdntagcts 61 agphfnplsr khggpkdeer hvgdlgnvta dkdgvadvsi edsvislsgd hciigrtlvv 121 hekaddlgkg gneestktgn agsrlacgvi giaq KARS Official Symbol: KARS Official Name: tRNA synthetase Gene ID: 3735 Organism: Homo sapiens 613613 Other Aliases: CMTRIB, KARS2, KRS esQnafionx WSRS;WSMetRNAIbaseHySMeuIRNAlbase Nucleotide seguence: NCBI nce Seguence: NM_001130089.1 LOCUS: NM_001130089 ACCESSION : NM_001130089 l aaattaacgt actatcctcc ttgg gtcgggccct ccgggaagat ggcggccgtg 6" caggcggccg aggtgaaagt ggatggcagc gagccgaaac tgagcaagaa gtaa 12" tcattagttc cagggtgctc tgccatgttg acgcaagctg ctgtaaggct tgttaggggg 18" tccctgcgca aaacctcctg ggcagagtgg ggtcacaggg aactgcgact gggtcaactt 24" gctcctttca cagcgcctca caaggacaag tcattttctg atcaaagaag tgagctgaag " agacgcctga aagctgagaa gaaagtagca gagaaggagg ccaaacagaa agagctcagt 36" gagaaacagc taagccaagc cactgctgct gccaccaacc acaccactga taatggtgtg 42" ggtcctgagg aagagagcgt ggacccaaat caatactaca gcag tcaagcaatt 48" catcagctga aggtcaatgg ggaagaccca tacccacaca agttccatgt agacatctca 54" ctcactgact tcatccaaaa atatagtcac cctg gggatcacct gactgacatc 60" accttaaagg gtag gatccatgcc aaaagagctt ctgggggaaa gctcatcttc 66" tatgatcttc gaggagaggg ggtgaagttg caagtcatgg ccaattccag aaattataaa 72" tcagaagaag aatttattca tattaataac aaactgcgtc acat aattggagtt 78" caggggaatc ctggtaaaac gggt gagctgagca tcattccgta tgagatcaca 84" ctgctgtctc cctgtttgca acct catcttcact ttggcctcaa agacaaggaa 90" acaaggtatc gccagagata cttggacttg atcctgaatg actttgtgag gcagaaattt 96" cgct ctaagatcat cacatatata agaagtttct tagatgagct gagattgaaa ctcccatgat gaacatcatc ccagggggag ccgtggccaa acttatcaca acgagctgga catgaactta tatatgagaa ttgctccaga actctatcat 114" aagatgcttg tggttggtgg catcgaccgg gtttatgaaa ttggacgcca gttccggaat 120" attg atttgacgca caatcctgag ttcaccacct gtgagttcta catggcctat 126" gcagactatc acgatctcat cacg gagaagatgg tttcagggat ggtgaagcat 614614 "32" attacaggca gttacaaggt cacctaccac ccagatggcc cagagggcca agcctacgat "38" gttgacttca ccccaccctt ccggcgaatc gtag aagagcttga gaaagccctg "44" aagc tgccagaaac gaacctcttt gaaactgaag aaactcgcaa tgat "50" gatatctgtg tggcaaaagc tgttgaatgc cctccacctc ggaccacagc caggctcctt "56" gacaagcttg ttggggagtt cctggaagtg acttgcatca atcctacatt catctgtgat "62" cacccacaga gccc tttggctaaa tggcaccgct ctaaagaggg tctgactgag "68" cgctttgagc tgtttgtcat gaagaaagag atatgcaatg cgtatactga gctgaatgat "74" cccatgcggc agcggcagct ttttgaagaa caggccaagg ccaaggctgc aggtgatgat "80" gaggccatgt tcatagatga aaacttctgt actgccctgg aatatgggct gccccccaca "86" gctggctggg gcatgggcat tgatcgagtc gccatgtttc tcacggactc catc "92" aaggaagtac ttctgtttcc tgccatgaaa cccgaagaca agaaggagaa tgtagcaacc "98" actgatacac tggaaagcac aacagttggc acttctgtct agaaaataat aagt 204" tgtataactc aggcgtcttt gcatttctgc gaaagatcaa ggtctgcaag cttg 210" tgtgctgctt tccatttgac gttc tgttcagcca tcagaagaga gacaaggaat 216" taaaaatttc tttttaatcc tgttaccaaa taaaaaa Protein seguence: NCBI Reference Seguence: NP_001123561.1 LOCUS: NP_001123561 ACCESSION: N P_001123561 l mltqaavrlv rgslrktswa ewghrelrlg qlapftaphk dksfsdqrse lkrrlkaekk 6" vaekeaque lsekqlsqat aaatnhttdn esvd pnqyykirsq aihqlkvnge 12" dpyphkfhvd isltdfiqky shlqudhlt ditlkvagri hakrasggkl ifydlrgegv 18" quvmansrn ykseeefihi nnklrrgdii qugnpgktk kgelsiipye itllspclhm 24" lphlhfglkd ketryrqryl dlilndfvrq kfiirskiit yirsfldelg fleietpmmn " vakp fityhneldm nlymriapel yhkmlvvggi grqf rnegidlthn 36" pefttcefym ayadyhdlme itekmvsgmv khitgsykvt yhpdgpegqa ydvdftppfr 42" elek algmklpetn lfeteetrki lddicvakav ecppprttar lldklvgefl 615615 481 evtcinptfi cdhpqimspl akwhrskegl terfelfvmk keicnaytel rqlf 541 eeqakakaag ddeamfiden fctaleyglp ptagwgmgid rvamfltdsn lfpa 601 kenv attdtlestt vgtsv KIFSB Official Symbol: KIFSB Official Name: kinesin family member SB m: 3799 sm: Homo sapiens Other Aliases: KINH, KNS, KNS1, UKHC Other Designations: conventional kinesin heavy chain; kinesin 1 (110-120kD); kinesin heavy chain; kinesin-1 heavy chain; ubiquitous kinesin heavy chain Nucleotide sguence: NCBI Reference Sequence: NM_0045212 LOCUS: NM_004521 ACCESSION : 521 1 ctcctcccgc accgccctgt cgcccaacgg cggcctcagg agtgatcggg cagcagtcgq 61 ccggccagcg gacggcagag cgggcggacg ggtaggcccg gcctgctctt cgcgaggagg 121 aagaaggtgg tccc caga acctccccag cccccgcagt ccgcccagac 181 cgtaaagggg gacgctgagg ggac gctctccccg gtgccgccgc cgctgccgcc 241 gccatggctg ccatgatgga tcggaagtga gcattagggt taacggctgc cggcgccggc 301 tcttcaagtc ccggctcccc ggccgcctcc acccgggqaa gcqcagcgcg gcgcagctga 361 ctgctgcctc tcacggccct cgcgaccaca agccctcagg tccggcgcgt tccctgcaag 421 actgagcggc gggqagtggc tcccggccgc cggc tgcgagaaag atggcgqacc 481 tggccgagtg caacatcaaa gtgatgtgtc gcttcagacc tctcaacgag tctgaagtga 541 gcga caagtacatc gccaagtttc agggagaaga cacggtcgtg atcgcgtcca 601 agccttatgc atttgatcgg gtgttccagt caagcacatc tcaagagcaa gtgtataatg 661 actgtgcaaa gaagattgtt aaagatgtac ttgaaggata taatggaaca atatttgcat 616616 72" atggacaaac atcctctggg aagacacaca caatggaggg taaacttcat gatccagaag 78" gcatgggaat tattccaaga atagtgcaag atatttttaa ttatatttac tccatggatg 84" aaaatttgga atttcatatt aaggtttcat attttgaaat atatttggat aagataaggg 90" acctgttaga tgtttcaaag accaaccttt cagttcatga agacaaaaac cgagttccct 96" atgtaaaggg gtgcacagag gtat caga tgaagttatg gataccatag "02" atgaaggaaa atccaacaga gcag ttacaaatat gaatgaacat agctctagga "08" gtcacagtat atttcttatt aaac aagagaacac acaaacggaa ctga "14" gtggaaaact ttatctggtt gctg gtagtgaaaa ggttagtaaa actggagctg "20" aaggtgctgt tgaa gC :aaaaaca tcaacaagtc actttctgct cttggaaatg "26" ttatttctgc tttggctgag gg:agtacat atgttccata tcgagatagt aaaatgacaa "32" gaatccttca agattcatta gg:ggcaact gtagaaccac tattgtaatt tgctgctctc "38" catcatcata caatgagtct gaaacaaaat ctacactctt atttggccaa agggccaaaa "44" caattaagaa cacagtttgt gtcaatgtgg agttaactgc agaacagtgg aagt "50" atgaaaaaga aaaagaaaaa aataagatcc tgcggaacac tattcagtgg cttgaaaatg "56" agctcaacag taat ggggagacgg tgcctattga tgaacagttt gacaaagaga "62" aagccaactt ggaagctttc acagtggata aagatattac caat gataaaccag "68" caaccgcaat tggagttata ggaaatttta ctgatgctga aagaagaaag tgtgaagaag "74" aaattgctaa attatacaaa cagcttgatg acaaggatga taac cagcaaagtc "80" aactggtaga gaaactgaag acgcaaatgt tggatcagga ggagcttttg acca "86" gaagggatca agacaatatg caagctgagc tgaatcgcct agaa gcct "92" aaga agtgaaagaa gttttacagg ccctagaaga acttgctgtc aattatgatc "98" agaagtctca ggaagttgaa gacaaaacta aggaatatga attgcttagt gatgaattga 204" atcagaaatc ggcaacttta gcgagtatag atgctgagct tcagaaactt aaggaaatga 210" ccaaccacca gaaaaaacga gcagctgaga catc tttactaaaa gaccttgcag 216" aaataggaat tgctgtggga aataatgatg taaagcagcc tgagggaact ggcatgatag 222" atgaagagtt cactgttgca agactctaca ttagcaaaat gaagtcagaa gtaaaaacca 228" tggtgaaacg ttgcaagcag ttagaaagca cacaaactga caaa aaaatggaag 234" aaaatgaaaa ggagttagca gcatgtcagc ttcgtatctc tcaacatgaa atca 240" agtcattgac tgaatacctt caaaatgtgg aacaaaagaa aagacagttg gaggaatctg 246" tcgatgccct cagtgaagaa ctagtccagc ttcgagcaca agagaaagtc catgaaatgg 617617 252" agca taag gttcagactg caaatgaagt taagcaagct gttgaacagc 258" agatccagag ccatagagaa actcatcaaa aacagatcag tagtttgaga gatgaagtag 264" aagcaaaagc aaaacttatt cttc aagaccaaaa ccagaaaatg gagc 270" aggaacgtct aagagtagaa catgagaagt tgaaagccac ggaa aagagcagaa 276" aactacatga acttacggtt atgcaagata gacgagaaca agcaagacaa gacttgaagg 282" gtttggaaga gacagtggca aaagaacttc agactttaca caacctgcgc aaactctttg 288" ttcaggacct ggctacaaga gttaaaaaga gtgctgagat tgattctgat gacaccggag 294" gcagcgctgc tcagaagcaa tcct ttcttgaaaa taatcttgaa cagctcacta 300" acaa acagttggta cgtgataatg cagatctccg ctgtgaactt cctaagttgg 306" aaaagcgact taca gctgagagag tgaaagcttt ggaatcagca ctgaaagaag 312" ctaaagaaaa tcgt gatcgcaaac gctatcagca agaagtagat cgcataaagg 318" aagcagtcag gtcaaagaat atggccagaa gagggcattc tgcacagatt gctaaaccta 324" ttcgtcccgg gcaacatcca gcagcttctc caactcaccc aagtgcaatt Cgtggaggag 330" gtgcatttgt tcagaacagc cagccagtgg cagtgcgagg tggaggaggc gtgt 336" aatcgtttat acatacccac aggtgttaaa aagtaatcga agtacgaaga ggacatggta 342" tcaagcagtc attcaatgac tataacctct actcccttgg agaa ttataacttt 348" taaaaaaaat gtataaatta tacctggcct gtacagctgt ttcctaccta ctcttcttgt 354" aaactctgct gcttcccaac acaactagag tgcaattttg gcatcttagg agggaaaaag 360" gacagtttac ggcc ctatttatta cacagtttgt ctatcgtgtc ttaaatttag 366" tctttactgt gccaagctaa ctgtacctta taggactgta ctttttgtat tttttgtgta 372" tgtttatttt ttaatctcag tttaaattac ctagctgcta ctgcttcttg tttttctttt 378" cctattaaaa cgtcttcctt tttttttctt aagagaaaat ggaacattta ggttaaatgt 384" ctttaaattt taccacttaa taca taaa atatatccag tcagtactgt 390" aaat cccttgaaat gatgatatca gggttaaaat tacttgtatt gtttctgaag 396" tttgctcctg aaaactactg cact gaaacgttac aaatgcctaa taggcatttg 402" agca aggctacttg catg aaatgcctgt tgccgagtta ttttgaatag 408" aaatatttta aagtatcaaa agcagatctt agtttaaggg agtttggaaa aggaattata 414" tttctctttt tcctgattct gtactcaaca agtcttgatg gaattaaaat actctgcttt 420" attctggtga gcctgctagc taatataagt attggacagg taataatttg tcatctttaa 426" tattagtaaa atgaattaag atattatagg attaaacata attttatacg gttagtactt 618618 132" tattggccga cctaaattta tagcgtgtgg agaa agaa caga 138" atga attaaaaata tatataggtc aattttggtc tgaaatccct gaggtgtttt Z44" taacctgcta cactaatttg tacactaatt ttta gtctagaaat agtaaattgt 150" ttgcaagtca ctaataatca ttagataaat tattttcttg gccatagccg ataattttgt 156" aatcagtact aagtgtatac gtatttttgc cactttttcc tcagatgatt aaagtaagtc Z62" aacagcttat tttaggaaac tgtaaaagta atagggaaag agatttcact atttgcttca Z68" tcagtggtag gggggcggtg actgcaactg tgttagcaga aattcacaga ttaaggttag cagagaaact tggaaagttc tgtgttagga tcttgctggc ttttgcaaaa gttttataca cagatatttg tattaaattt ggagccatag cagatcataa ttggcttatt tttctatttc cgtaactatt gtaatttcca aattttgatt taaaatataa atttatttat ttattttttt aatagtcaaa gttgtagtct gcaacctcta aaatgattgt ttta ggattgatca gaagaaacac 504" tccaaaaatt gagatgaaat gttggtgcag ccagttataa gtaatatagt taacaagcaa 510" aaaaagtgct gccacctttt atgatgattt tctaaatgga gaaacatttg gc:gcatcca 516" catagacctt ttgt ttga aaacttgcct cctttggcaa ca:tcgtaaa 522" tgaagcagaa tttttttttc tcttttttcc aaatatgtta gttttgttct tg:aagatgt 528" atcatgggta ttggtgctgt gtaatgaaca acgaatttta atgt gg:tcagaat 534" atacaatgtt aggtttttaa aaagtatctt gatggttctt ttctatttat aa:ttcagac 540" tttcataaag tgtaccaaga atttcataaa tttgttttca gtgaactgct tt :tgctatg 546" gtaggtcatt aaacacagca cttactctta aaaatgaaaa tttctgatca atat 552" tgacacattt caatttgcag tgtctttttg actggatata ttaacgttcc tc:gaatggc 558" attgatagat ggttcagaag agaaactcaa tgaaataaag ttta gcga 564" aaat ccta acttaagaaa gtgc gtaactctca gtttgtgctt 570" aactccattt gacatgaggt gacagaagag agtctgagtc tacctgtgga atatgttggt 576" ttattttcag tgcttgaaga tacattcaca aatacttggt ttgggaagac accgtttaat 582" tttaagttaa cttgcatgtt gtaaatgcgt tttatgttta aataaagagg aaaatttttt 588" gaaatgtaaa aaaaaaaaaa aaaaa Protein seguence: NCBI Reference Seguence: NP_004512.1 619619 LOCUS: NP_004512 ACCESSION: N P_004512 l madlaecnik vmcrfrplne sevnrgdkyi aqugedtvv iaskpyafdr qusstsqeq 6" vyndcakkiv kdvlegyngt ifaygqtssg kthtmegklh dpegmgiipr iquifnyiy l2" smdenlefhi kvsyfeiyld kirdlldvsk tnlsvhedkn rvpyvkgcte rfvcspdevm 18" dtidegksnr hvavtnmneh ssrshsifli nvkqentqte lylv dlagsekvsk 24" tgaegavlde akninkslsa lgnvisalae gstyvpyrds kmtrilqul ggncrttivi " ccspssynes etkstllfgq raktikntvc vnveltaeqw kkkyekekek nkilrntiqw 36" lenelnrwrn getvpideqf dkekanleaf tltn dkpataigvi gnftdaerrk 42" ceeeiaklyk qlddkdeein qqsqlveklk tqmldqeell astrrdqdnm qaelnrlqae 48" ndaskeevke vlqaleelav nyqusqeve dktkeyells delnqksatl asidaelqkl 54" qkkr aaemmasllk dlaeigiavg nndvkqpegt gmideeftva rlyiskmkse 60" vktmvkrckq esnk kmeenekela acqlrisqhe akikslteyl qnveqkqul 66" eesvdalsee lvqlraqekv hemekehlnk vqtanevkqa veqqiqshre thqkqiser 72" akli tdlqdqnqkm mleqerlrve tdqe ksrklheltv 78" dlkgleetva hnlr klqudlatr vkksaeidsd dtggsaaqkq kisflennLe 84" qltkvhkqlv rdnadlrcel pklekrlrat aervkalesa lkeakenasr drkryqqevd 90" rikeavrskn saqi gth aaspthpsai rgggaquns gggg KPNA3 Official Sym bol: KPNA3 Official Name: karyopherin alpha 3 (importin alpha 4) m: 3839 Organism: Homo sapiens Other Aliases: RP11-432M24.3, IPOA4, SRP1, SRP1gamma, SRP4, hSRP1 Other Designations: SRP1-gamma; importin alpha 4; importin alpha Q2; importin alpha-3; importin subunit alpha-3; importin-alpha-Q2; karyopherin subunit alpha-3; qip2 Nucleotide seguence: 620620 NCBI nce Seguence: NM_002267.3 LOCUS: NM_002267 ACCESSION : NM_002267 l gccccgcgcc tgaggggcag taaaagtcgc caggtccggc tccatttctg gcacaaaact 6" tgcagcaccg aggggttgtg gagagccctt gcaggggaag agggcagggt catcccgaga 12" gggc acgtatagcc cggcgaacgc ccaagccggt caccgccccc ggtcacgtgt 18" cgccagcctc cgcg cgccgctctc agcaccgttc ccgccccacc cggcccggca 24" gtcggcccgc gcctcccccg gcgctactgc gcgc tcggaggcgt cacagaacgt " gctcttctct cccctccccc ctcccgctct cctc cccctcccgc tccaagattc 36" gccgccgccg ccgccgcagc cgcaggagta gccgccgccg gagccgcgcg cagccatggc 42" cgagaacccc agcttggaga accaccgcat caagagcttc aagg gccgcgatgt 48" ggaaacaatg cgaagacata gaaatgaagt gacagtggaa ctgcggaaga acaaaagaga 54" tgaacactta ttgaaaaaga gaaatgttcc ccaagaagaa gaag attcagatgt 60" tgatgctgat tttaaagcac aaaatgtaac cctagaagct atattgcaga atgccacaag 66" tgataaccca gtggtccaat tgagtgctgt ccaggcagca agaaaactgt tatccagtga 72" cagaaatcca ccgattgatg acttaataaa atctgggatt ttaccaattc tagtcaaatg 78" tctagaaagg gatgataatc cttcattaca gtttgaagct gcttgggcat taactaacat 84" agga acttctgcac agactcaagc tgttgtgcag tctaatgcag tacctctttt 90" tctgagactt cttcgttcac agaa tgtttgtgaa caagcagtat gggctttggg 96" aaacattata ggtgatggtc ctcaatgtag agattatgtc atatcactgg tcaa "02" acctcttctg tccttcatca gtccctccat ccccatcacc ttccttcgga acgtcacatg "08" ggtcattgtc aatctctgca ggaataagga tcccccgccg gaga agga "l4" gattttgcca gctttatgtg tata ccatacagat ataaacattc ttgtagacac "20" tgtttgggct ctgtcatact tgacagatgg aggtaatgaa cagatacaga tggttattga "26" ttcaggagtt gtgccctttc ttgtgcccct tctgagccat caggaagtca aagttcaaac "32" agcagccctc agagcagttg gcaacatagt cacc gacgagcaga cccaggttgt "38" ttgt gatgtcctgt cacacttccc aaatctctta tcacacccaa aagagaagat "44" aaataaggaa gcagtgtggt tcctttccaa cataacagca ggcaaccagc aacaagttca "50" agctgtaata gatgctggat taattcctat gataattcat cagcttgcta agggggactt 621621 "56" tggaacacaa aaagaagctg cttgggcaat cagcaactta acaataagtg gcagaaaaga "62" tcaggttgag gtac agcagaatgt aataccaccg ttctgtaatt tactgtcagt "68" gaaagattct caagtggttc aggtggttct agatggtcta aaaaacattc tgataatggc "74" cggtgatgaa gcaagcacaa tagctgaaat aatagaggaa tgtggaggtt tggagaaaat "80" tgaagtttta cagcaacatg aaaatgaaga catatataaa ttagcatttg aaatcataga "86" tcagtatttc tctggtgatg atattgatga agatccctgc ctcattcctg aagcaacaca "92" tacc tacaattttg atccaacagc tcaa acaaaagaat ttaattttta "98" aattcagttg agtgcagcat ctttcccaca ttcaatatga agcaccacca gatggctacc 204" aaatgataag aacaacagca acaaaaggct ccaaaacaca catgcctctt tgttttgatg 210" cttctaaagc aagccatgtc tcagtcactt tgcagttgcc aaaagtcact atcacatgga 216" ctgtaaatgc atatgcatga tttcctaaac agaa ctctccttaa caac 222" taccctattt ttccctgttc cctggtgcca caggctgaca actgcagtct ccagtttaga 228" ataaatattc catagtggtg acatgtcagc tgcccactga tactcctttg ggtg 234" cgctgtggat caagacactt tggtatgatg catatacaag ttggaagact aaagaggtgc 240" atct gagcctccat cattgtcctc cata ttttcatatt ctttatgtgg 246" aagaatagat tttaaagtac aatg attttcattg gtggaactga cacaaaaaaa 252" gtaacttaaa aacaagaaac ttggttattg caga taaa aaaaaaaaaa 258" aactacttca tctaccagta attgatgtgt ttattatctg cctcagaagc cagggttgga 264" ggaagaactt tgga tgct tttgccatta tacctaattt ttgagaacag 270" caagccctat ttgaccactc tcttcagcct gtgtgttcct gctgttttga agtaatcaaa 276" tgctgtgcat ggtattttac ctgagctgca acctgttatg gacttgaact tctgtttaag 282" gcaa gagtccctga gtataaagga aaaacagcaa aacaaaaagc aaacaaaaaa 288" caaa agtctaaaat acccattggt gatgtttttt aaaaaaatct tgctttcagc 294" tttcaggagt tctt tgttttaatt tgataattgg atatggttga ttgg 300" gtttaaactg tggagctttc atgtttactg taatttagtc ttaaaatatt ttttacttag 306" taaccagtgc ttttgataat gtggttggca acaaaccagc aactatttag aagtgtcata 312" agagttcatt ctttgagtat tgggaaagtt aattcagatc ctactcaaaa agcatcttca 318" catattaaaa gattcagaca gggatctgtg tagaggagta atttgcagtt atttaacata 324" aacctgattt gcagtgatct ctaagtaatt ctgcaaaatc cggtattact atgtcaagtt 330" attgcttttg gtaaattgtc tgacccagtt attaatgaaa ggat ttaaaaattt 622622 336" ttaaactaaa taatttgtgc tgtcacagaa atggtattgt tgctcttgtt tactgggtat 342" aatttcccaa tgcattgatg tgaagggata gaaaatctaa actaatttag ttatccattg 348" gggggtgtat ttactgtgat gaga cagatgccat cagagctttg tgaatcagct 354" ggggtgtttt cactgataaa caacacatag caggtgtgca ttcattacaa gtat 360" ctgccaaggt ggagccactt taaagagtga gttttgtctt gtatctaaag tggatacaag 366" cgtatgttta aactgcaaga cttg ctagagaatc tgttttaata tagtggtttg 372" gcctctgatt atttataggt tttataaatt ttagaatcaa tttctcttta aggtggctca 378" gatttttcaa ctcttgtgca cataaaattg agttgaagtt cattgtgcct ttttttcttt 384" atccaaattt tgagttaaag cttcatatgg catc ctgttcggac actatagtct 390" aaatttttga aactgtgtgg tgttcgctaa aagtaggaat aacaacgtaa aagctaatta 396" aggtcacaaa cttcggtgaa acccttaaaa atct tcttgatatt gtgaaccgta 102" ccccttccag tttagtttct tctggacttt ccttacttaa ctgacagtta ccttttaaaa 108" caca ttaa aattgggcct ctactgtgat gattcctatt tcctctcatg 114" ttttaaagtg caaactaaca tttaagtgaa cattagcatc aagtagtgca gacacttgta 120" tgcatttcct tgattcaatt tgtgacctta ccagttttga attggaattg caccatttcg 126" tagataaagg aaactaagta tattgctgca cttttaagtt ttcaaaacag aaaa ttgcattgtt attttttttt aaactcagtt taaaaagact ttct gcatctaaat gtgttcctaa ttttgtatat gggcttaggt tttgtaacca ctgctatcaa atatgataaa aaaa cttaaaaa Protein seguence: NCBI Reference Seguence: NP_002258.2 LOCUS: 258 ACCESSION: N P_002258 1 maenpslenh riksfknkgr dvetmrrhrn evtvelrknk kkrn queesleds 61 dvdadfkaqn vtleailqna tsdnpvvqls avqaarklls sdrnppiddl iksgilpilv 121 kclerddnps queaawalt niasgtsaqt qavvqsnavp lflrllrsph qnvceqavwa 181 lgniigdgpq slgv vkpllsfisp sipitflrnv twvivnlcrn kdppppmetv 241 qeilpalcvl iyhtdinilv sylt dggneqiqmv idsgvvpflv pllshqevkv 623623 301 qtaalravgn ivtgtdeqtq vvlncdvlsh fpnllshpke kinkeavwfl snitagnqqq 361 agli pmiihqlakg dfgtqkeaaw aisnltisgr kdqveylqu nvippfcnll 421 svkdsqvqu nili stia eiieecggle kievquhen ediyklafei 481 idqyfsgddi dedpclipea tqggtynfdp tanlqtkefn f LGALS1 Official : LGALS1 Official Name: lectin, galactoside-binding, soluble, 1 m: 3956 Organism: Homo sapiens Other Aliases: GAL1, GBP Other Designations: 14 kDa laminin-binding protein; 14 kDa lectin; HBL; HLBP14; HPL; S-Lac lectin 1; beta-galactoside-binding lectin LI; beta- galactoside-binding protein 14kDa; gal-1;galaptin;galectin 1; galectin-1; lactose-binding lectin 1; putative MAPK-activating protein PM12 Nucleotide seguence: NCBI Reference Seguence: NM_002305.3 LOCUS: NM_002305 ACCESSION : NM_002305 l agttaaaagg gtgggagcgt ccgggggccc atctctctcg ggtggagtct tctgacagct 6" ggtgcgcctg cccgggaaca tcctcctgga ctcaatcatg gcttgtggtc tggtcgccag 12" caacctgaat ctcaaacctg gagagtgcct tcgagtgcga ggcgaggtgg ctcctgacgc 18" cttc gtgctgaacc tgggcaaaga cagcaacaac ctgtgcctgc acttcaaccc 24" tcgcttcaac gcccacggcg acgccaacac catcgtgtgc aacagcaagg gggc " ctgggggacc gagcagcggg aggctgtctt tcccttccag cctggaagtg ttgcagaggt 36" gtgcatcacc ttcgaccagg ccaacctgac gctg ccagatggat acgaattcaa 42" gttccccaac cgcctcaacc tggaggccat caactacatg gcagctgacg gtgacttcaa 48" gatcaaatgt gtggcctttg actgaaatca gccagcccat ggcccccaat agct 54" gctc cctctgaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 624624 Protein ce: NCBI Reference Seguence: NP_002296.1 LOCUS: NP_002296 ACCESSION: NP_002296 1 macglvasnl nlkpgeclrv rgevapdaks kdsn nlclhfnprf nahgdantiv 61 cnskdggawg teqreavfpf qusvaevci tquanltvk lpdgyefkfp nrlnleainy 121 maadgdfkik cvafd MACF1 Official : MACF1 Official Name: ubuIe-actin crosslinking factor 1 m: 23499 Organism: Homo sapiens Other Aliases: ABP620, ACF7, MACF, OFC4 Other Designations: 620 kDa actin binding protein; actin cross-linking family protein 7; macrophin 1; microtubuIe-actin cross-linking factor 1; trabeculin-alpha Nucleotide seguence: NCBI Reference Seguence: NM_012090.4 LOCUS: NM_012090 ACCESSION : NM_012090 NM_033024 1 attgtgggag ccgctcccct cggctccgcc acgctcccct cgactgcgct ccagcctggg 61 gcgcgcccgg ccgc cttcgctgcc gccacgggcc cgtcttcttc ctccttcggc 121 tcccaggatg aagaaactga gaga ggtgaagtga cttgcccaag atcacagcaa 181 ttatcacttc tccctgggct cccaggccct cctgcagcag cctg ggccatgtct 625625 24" tcctcagatg aagagacgct cagtgagcgg cgga gtgagcggtc ttgtcggagt " gagcgatctt acaggagcga gcggtcgggg agcctgtctc cctgtccccc aggggacacc 36" ttgccctgga acctgccact gcatgagcag aaaaagcgga aaagccagga ttcggtgctg 42" gaccctgcag agcgtgctgt ggtcagagtc gctgatgaac gggt tcagaagaaa 48" acgttcacca agtgggtcaa caagcactta atgaaggtcc gcaagcacat caatgatctt 54" tatgaagatc tgcgggatgg ccataacctg atctctctgt tggaggtcct ctcaggcatc 60" cccc aggg caggatgcgt tttcataggc tgcagaatgt gcagattgcc 66" ctggacttcc taaagcagcg acaggtgaaa ctagtgaata ttcgcaatga tgacatcaca 72" gatggcaacc ccaagttgac cctgggtctg atctggacca tgca tttccagatc 78" tctgacatct acattagtgg agaatcaggg gatatgtcag ccaaggagaa actactcctg 84" tggacccaga aggtgacagc tggttacaca ggaatcaaat gcaccaactt ttcctcctgc 90" tggagtgatg ggaagatgtt caatgcactc attcaccgat accgacccga tctagtagac 96" atggagaggg tgcaaatcca aagtaaccga gagaatctgg aacaggcttt tgaagtggca "02" gaaagactgg gggtcactcg cctgctggat gcagaagatg tggatgtgcc atctccagat "08" gaaaagtctg taatcactta ttcg atttatgatg ccttccctaa agttcctgag "14" ggtggagaag ggatcagtgc tacggaagtg gactccaggt ggcaagaata ccaaagccga "20" gtggactccc tcattccctg acag catacaatac tgatgtcaga taaaactttt "26" ccccaaaacc ctgttgaact aaaggcactt tataaccaat actt caaagaaaca "32" gaaattctgg agag agaaaaagga agaattgagg aattatataa attactagag "38" gtgtggattg aatttggccg aattaaactg cctcaaggtt atcaccctaa tgatgtggaa "44" gaagagtggg gaaagctcat catagagatg ctggaacgag agaaatcact tcggccggct "50" gtggagaggc tggaattgct gctacagatt gcaaacaaaa atgg tgctttgaac "56" gaaa aactgacact agctaagaat acactgcagg ctgatgctgc tcacctggaa "62" tcaggacaac aatg agat gtcattatgt acattcagga gtgtgaaggt "68" ctcatcaggc agctgcaggt ggatctccag atcctgcggg atgagaatta ctaccagcta "74" ctgg cttttagggt tctt caggatgagc tggtcacctt gcgtctagag "80" tgtacaaacc tgtaccggaa gggtcatttc acttcacttg aattggttcc accctctact "86" ttaaccacca ctcatctgaa accc ttaaccaagg caacccattc ttcttctacc "92" ttcc gaaagcctat gactcgggct gaacttgtgg ccatcagctc ctctgaagat "98" gaaggcaatc tccgatttgt gtatgaacta ctgtcttggg tagaagagat gcagatgaaa 626626 204" ctggagcgag cagagtgggg caatgacctg cctagtgtgg agct agaaacacag 210" cagcacatcc gtgt agaagagctg ggctcaagtg tcaaggaggc caggttgtat 216" gagggaaaga tgtcccagaa tttccatacc gctg ttgg aaagctggag 222" tatt gtaaattgaa ggaaacttct agcttccgga tgaggcacct tcagagcctg 228" cataaatttg tttccagagc tacagctgag tggt tgaatgagaa ggag 234" gaactagcat atgactggag tgacaacaat tccaatatct cagccaagag aaattacttc 240" tctgagttga caatggaact ggaggagaaa caggatgtgt ctct acaagataca 246" gcagaactac tttcacttga gaaccaccca caga cagtggaggc ttacagtgct 252" gctgtccagt cccagttgca gtggatgaag cagctgtgcc tgtgtgttga gcagcatgtg 258" aaagagaata ctgcttattt tcagttcttc agtgatgcac gagagctgga gtcattcttg 264" aggaacctcc aagattccat taaacgaaaa tattcctgtg accacaacac cagcttatcc 270" cgccttgaag acctgctcca catg gatgaaaagg agcagcttat acagtccaag 276" agttccgttg ccagtctcgt tgggagatca aaaaccatcg ttcagctaaa cagt 282" ccagaccatg tgttaaagaa caccatttct gtcaaggctg tctgtgacta caggcagatc 288" gagattacta tttgcaaaaa tgatgaatgt gtgctagaag ataattctca gcggaccaaa 294" tggaaagtga tcagccccac agggaacgag gcaatggtgc cgtcagtctg cttcctcatc 300" cccccaccca ataaggatgc cattgagatg gccagcaggg aatc ttatcagaag 306" gttatggccc tttggcatca gctgcatgtt aacaccaaaa gccttatctc ttggaactat 312" ctgcgtaaag accttgacct tgtacagacc tggaacctag aaaagcttcg atcctcagca 318" ccaggggagt gccatcagat tatgaagaac cttcaggccc actatgaaga ctttctgcag 324" gatagtcgtg actctgtgct gttctcagtg gctgatcgct tgga agaggaggtg 330" gaagcttgta aagcccgctt ccagcacctg atgaagtcca tggagaatga ggacaaagag 336" gagactgtgg ccaagatgta catttcagag ttgaagaaca tccggctacg cctggaggag 342" tatgaacaga gggtggtcaa acgaattcag tctctagcca gctctaggac tgacagagat 348" cagg acaatgcatt aaggattgca gagcaagagc acacccagga ggatttacag 354" caattgaggt cagacttgga tgcagtttct atgaaatgtg acagctttct gtct 360" ccatctagtt caagtgtccc aactctgcgc tcagaactga atctgctggt ggagaagatg 366" gaccatgtct atggtctctc tactgtatat ctgaataagt taaagacagt tgatgttata 372" gtacgtagca tacaggatgc tgaactcttg ggtt atgagattaa gctgagtcaa 378" gaagaagtag tactggcaga tctctcagct ctggaggccc cgac attacggcac 627627 384" tggcttagtg atgtgaagga caagaattca gtgttttcag tcctggatga ggaaattgcc 390" aaggccaagg tagtggcaga gcagatgagt cgtctgacac cagagcgaaa tctggatttg 396" gagcgctatc aagg ctcccagctg caggagcgtt ggcaccgagt cattgcccag 102" ctcgagattc ctga gctagaaagt gaag ttctgggaga ttaccgagcc 108" tgccatggaa ctctcatcaa gtggattgag gaaaccactg cccagcagga aatgatgaag 114" ccaggccagg cagaggatag cagagtgctt cagc tcagccagca gacggcccta 120" tttgcagaaa ttgagagaaa tcagacaaaa ctggatcaat gtcaaaaatt ttcccagcag 126" tactctacta ttgtaaagga ctatgaattg caactgatga catacaaggc ctttgtggaa 132" tcgcagcaga aatcccctgg caagcgccgt cgcatgcttt cctcttcaga tgccatcact 138" caagagttca tggacttaag gactcgctac acggcattgg tgactttaac aactcagcac Z44" gtgaaataca tcagtgatgc actccggcgt ctggaggagg aggagaaagt ggtagaagag 150" gagaaacaag aacatgtgga taaa gaacttttgg gctgggtgtc taccctagcg 156" aggaatacac aaggaaaagc atcc gagaccaaag aatcaacaga aaaa Z62" gctattttgg aacagcaggt tctgtcagaa gagctgacaa caaagaaaga acaagtctct gaagctatta aaacatcaca cttg gccaagcatg gtcataagct gagaagaaac aaatatctga gaat gccctaaaca aggcttacca tgacctttgt 180" gatggttctg caaatcagct tcagcagctt cagagccagt tggctcacca gacagaacaa aagaccctgc agaaacaaca aaatacctgt caccagcaac tggaggatct ttgcagttgg Z92" cagg cagaaagagc actggcaggc caccaaggca gaaccaccca gcaggatctc Z98" tctgctttgc agaagaacca aagtgacttg ttac aggatgacat tcagaatcgt 504" gccacctcat ctgt tgtcaaggac attgaggggt tcatggaaga gaatcagacc 510" aagctgagcc cacgtgagtt gacagctctt cgggaaaagc ttcatcaggc taaggagcaa 516" gcgc tccaggaaga gacacgtgtg gcccagaagg aactggagga agcagtgacc 522" tccgccttac agcaggagac tgaaaagagt aaagcagcaa aggaactggc agagaacaag 528" aagaagatcg tcct ggattgggta acttcagtag gatcatctgg tggacagctg 534" ctgaccaacc gaat ggagcagctc tcgggagcta gcttggagaa aggagccttg 540" gacaccactg atggttacat gggggtgaat caagccccag agaaactgga caagcaatgt 546" gagatgatga aggcccgtca attg ctgtcccagc agcaaaattt ggcc 552" acccagtcag ctcaggcctt cttggatcag catggccaca atctcacacc tgaggagcaa 558" cagatgctgc aacagaagct gggagagcta aaggaacaat actctacttc cctggcccaa 628628 564" tcagaggcag aactgaagca ggtgcagaca cttcaggatg agttgcagaa gcag 570" gatcataaag agtttgaaag ctggttggaa cgatccgaga aagagctgga gaacatgcat 576" aagggaggca gcagccccga gacccttccc tccctgctaa agcggcaagg aagcttctca 582" gaggatgtca tttcccacaa aggagacttg agatttgtga ctatctcagg acagaaagtc 588" ttggacatgg aaaacagttt taaggaaggc aaagaaccat cagaaattgg aaacttagta 594" aaggacaagt tgaaggatgc aacagaaaga tacactgctc tccactcaaa gtgtacacga 600" ttaggatctc acctgaatat gctgttaggc cagtatcatc aattccaaaa cagtgctgac 606" agcctgcagg cctggatgca ggcttgtgag gccaacgtgg tcct ctcagatact 612" gttgcctctg gagt tctccaggag cagcttgcaa agca gttgcaggag 618" gaattggctg agcaccaagt acctgtggaa aaactccaaa aagtagctcg tgacataatg 624" gaaattgaag gggagccagc cccagaccac gttc aagaaactac agattccata 630" ctcagccact tccaaagcct ctcctatagc ctggctgagc gatcttctct gaaa 636" gccc aatctcagag tgtccaggaa agcctggaga gcctgttgca gtctattggg 642" gaagttgaac aaaacctgga agggaagcag gtgtcatcac tctcatcagg agtcatccag 648" gaagccttag ccacaaatat gaaattgaag caggacattg ctcggcaaaa gagcagcttg 654" gaggccaccc gtgagatggt attc atggagacag cagacagtac tacagcagca 660" gtgctgcagg gcaaactggc agaggtgagc cagcggttcg aacagctctg tctacagcag 666" caagaaaagg agagctccct aaagaagctt ctaccccagg cagagatgtt tgaacacctc 672" tctggtaagc tgcagcagtt aaac aaaagtcgga tgctggcctc tggaaatcag 678" ccagatcaag atattacaca tttcttccaa cagg agctcaattt ggaaatggaa 684" gaccaacagg agaacctaga tgag cacctggtca ctgaactgag tggc 690" ctgg acttgtgcca gcatcaggac agggtacaga atctaagaaa caca 696" gagctacaga ttaa agagagagag aaagatgcat catcttgcca gttg 702" gatgaattcc ggaagctggt caggaccttc cagaaatggt tgaaagaaac tgaagggagt 708" attccaccta cggaaacttc tatgagtgct aaagagttag aaaagcagat tgaacacctg 714" aagagtctac tagatgactg ggcaagtaag ggaactctgg tggaagaaat caattgcaaa 720" ggtacttctt tagaaaatct catcatggaa atcacagcac ccca aggcaagaca 726" ggttccatac tgccctctgt aggaagctct gtaggcagtg taaacggata ccacacctgc 732" aaagatctga cggagatcca gtgtgacatg tcagatgtaa acttgaagta tgagaaacta 738" gggggagtac ttcatgaacg ccaggaaagc cttcaggcta tcctcaacag aatggaggag 629629 744" gttcacaagg aggcaaactc tgtgctgcag tggctggaat caaaagagga agtcctgaaa 750" tccatggatg ccatgtcatc tccaaccaag acagaaacag tgaaagccca agctgaatct 756" aacaaggcct tcctggctga acag aattctccaa aaattcaaaa agtaaaggaa 762" gccctggctg gattactggt gacatatccc aactcacagg aagcagaaaa ttggaagaaa 768" gaag attc ccgatgggaa agggccactg aggttactgt ggctcggcaa 774" aggcagctag aggaatctgc aagtcatctg gcctgcttcc aggctgcaga atcccagctc 780" cggccgtggc tgatggagaa agaactgatg atgggagtgc tggggcccct gtctattgac 786" atgt tgaatgcaca aaagcaacag gtccagttta tgctaaagga atttgaagca 792" cgcaggcaac agcatgagca actgaatgag gcagctcagg gcatcctaac aggccctgga 798" gatgtctctc tgtccaccag ccaagtacag aaagaactcc agagcatcaa tcagaaatgg 804" gttgagctga aact ccgt tccagccaaa ttgaccaagc tattgttaag 810" agcacccagt accaggaact gctccaggac ttatcagaga aggtgagggc agttggacaa 816" cggctgagtg tccagtcagc tatcagcacc caaccagagg ctgtaaagca gcaattggaa 822" gagaccagtg aaattcgatc ggag cagttagacc acgaggttaa ggaggctcag 828" acactgtgcg atgaactctc agtgctcatt cagt acctcaagga tgaactgaag 834" aagcgtttgg agacagttgc cctgcctctc caaggtttag aagaccttgc agccgatcgc 840" agac tccaggcagc tcttgccagc acccagcagt tccagcaaat gtttgatgag 846" ttgaggacct ggttggatga taaacaaagc gcaa aaaactgccc aatttctgca 852" aaattggagc ggctacagtc tcagctacag gagaatgaag agtttcagaa aagtcttaat 858" caacacagtg gctcctatga ggtgattgtg gctgaagggg aatctctact tgta 864" cctcctggag aagagaaaag gactctacaa aaccagttgg ttgagctcaa ttgg 870" ctta gtaaaaaaac tgcagacaga caatccaggc tcaaggattg tatgcagaaa 876" gctcagaaat atcagtggca tgtggaagac cttgtgccat ggatagaaga ttgtaaagct 882" aagatgtctg agttgcgagt cactctggat cagc tagagtccag tctcctaaga 888" tcaaaggcta tgctgaatga gaag cgccgctccc tgctggaaat attgaatagt 894" gctgctgaca ttctgatcaa ttcttcagaa gcagatgagg atggaatccg gaag 900" gctgggatca accagaacat ggatgctgtt acagaagagc tgcaggccaa aacagggtca 906" ctcgaagaaa tgactcagag gctcagggag ttccaggaaa gctttaagaa tattgaaaag 912" aaggttgaag gagccaaaca ccaacttgag atctttgatg ctctgggttc tcaagcctgt 918" aaga acctggagaa gctaagagct caacaggaag tgctgcaggc cctagagcct 630630 924" caggtagact atctgaggaa tcag ggtctggtag aagatgcccc agatggatct 930" gatgcttctc aacttctcca ccaagctgag gtcgcccagc aagagttcct cgaagttaag 936" caaagagtga acagtggttg tgtgatgatg gaaaacaagc tggaggggat tggccagttt 942" cactgccggg tccgagagat gttctctcaa ttggcagacc tggatgatga gctagatggc 948" atgggtgcta ttggcagaga cactgatagc ctccagtccc aaatcgagga tgtccggcta 954" ttccttaaca aaattcacgt cctcaaatta gacatagagg cctctgaagc agagtgtcga 960" catatgctag aagaagaggg gactctggat ttgttaggtc tcaaaaggga gctagaagcc 966" ctgaacaaac agtgtggcaa actgacagag agggggaaag ctcgtcagga acagctggaa 972" ctgacactag gccgtgtaga ggacttctac aggaaattga aaggactcaa gacc 978" acagcagcag caga ggccctccag tgggtagtgg ggaccgaagt catc 984" aaccaacaat tagcagattt taaaatgttt cagaaagaac aagtggatcc tcttcagatg 990" cagc aggtgaatgg acttggccag ggattaattc cagg aaaagactgt 996" gatgtacagg gtttagaaca tgacatggaa aatg ctcgatggaa tacattgaat "002" aaaaaggtcg cacaaagaat tgcacagcta caggaagctt tgttgcattg tgggaagttt "008" gcct tggagccatt gctcagctgg ttggcagata ccgaggagct catagccaat "014" cagaaacctc catctgctga gtataaagtg gtgaaagcac agatccaaga gttg "020" ctccagcggc tcctagatga tcgaaaggcc acagtagaca tgcttcaagc agaaggaggc "026" agaatagccc agtcagcaga gctggctgat agagagaaaa tcactggaca gctggagagt "032" cttgaaagta gatggactga actactcagt aaggcagcag aaaa acagctggaa "038" gacatcctgg ttctggccaa acagttccat gagacagctg agcctatttc ctta "O44" tctgtcacag agaaaaagct tgctaactca gaacctgttg agac tgccaaaata "050" cagcagcaga tcattcggca caaggctctg gaagaagaca tagaaaacca tgcaacagat "056" cagg cagtcaaaat tgggcagtcc ctctcctccc tgacatctcc tgcagaacag "O62" ggtgtgctgt cagaaaagat agactcattg caggcccgat acagtgaaat tcaagaccgc "O68" tgttgtcgga aggcagccct acttgaccaa gctctgtcta atgctaggct gtttggggag "074" gatgaggtgg aggtgctcaa ctggctggct gaggttgagg acaagctcag ttcagtgttc "080" gtaaaggatt tcaaacagga tgtcctgcac catg ctgaccacct ggctttaaat "086" gaagaaattg ttaatagaaa gaagaatgta gatcaagcta ttaaaaatgg tcaggctctt "O92" caaa ccacaggtga ggaggtgtta cttatccagg aaaaactaga aaag "O98" actcgttacg cagacatcac agttactagc tccaaggccc tcagaacttt agagcaagcc 631631 ""04" ctgg ccaccaagtt ccagtctact gaac tgaccgggtg gctgagggag ""10" gtggaggagg agctggcaac cagtggagga cagtctccca caggggaaca gataccccag "16" tttcagcaga gacagaagga attaaagaag gaggtcatgg agcacaggct ggac ""22" aatg aggtgagccg tgctctctta gagctggtgc cctggagagc cagagaaggg ""28" ctggataaac ttgtgtccga tgctaacgag cagtacaaac tagtcagtga cactattgga ""34" caaagggtgg atgaaattga tgctgctatt cagagatcac aacagtatga gcaagctgcc ""40" gatgcagaac tagcttgggt tgctgaaaca aaacggaaac tgatggctct gggtccaatt ""46" gaac aggaccagac cacagctcag gtac agaaggcttt ctccattgac ""52" attattcgac acaaagattc aatggatgaa ctcttcagtc accgtagtga tggc ""58" acatgtgggg aggagcaaaa aactgtatta caggaaaaga cagagtctct aatacagcaa ""64" tatgaagcca ttagcctact caattcagag gccc agcg ggcccaggtc ""70" aacc agttttggga aacttatgaa gagctcagcc cctggattga ggaaactcgg ""76" gcactaatag cacagttacc ctctccagcc attgatcatg agcagctcag gcagcaacaa ""82" atga ggcaattaag ggaatctatt caca aacctcatat tgacaaacta ""88" ctaaagatag gcccacaact aaaggaatta aaccctgagg aaggggaaat ggtggaagaa ""94" aaataccaga aagcagaaaa catgtatgcc caaataaagg aggaggtgcg ccagcgagcc "200" ctgg atgaagccgt gtcccagtcc acacagatta cagagtttca tgataaaatt "206" gagcctatgt tggagacact ggagaatctt tcctctcgcc tgcgtatgcc accactgatc "212" cctgctgaag tagacaagat cagagagtgc atcagtgaca ataagagtgc caccgtggag "218" ctagaaaaac tgcagccatc ctttgaggcc ttgaagcgcc gtggagagga gcttattgga "224" cgatctcagg gagcagacaa ggatctggct gcaaaagaaa tccaggataa attggatcaa "230" atggtattct tctgggagga catcaaagct gaag aacgagaaat caaatttctt "236" gatgtccttg aattagcaga gaagttctgg tatgacatgg cagctctcct gaccaccatc "242" accc aggatattgt ccatgacttg gaaagcccag gcattgatcc ttccatcatc "248" cagg ttgaagctgc tgagactatt aaggaagaga cagatggtct gcatgaagag "254" ctggagttta ttcggatcct tggagcagat ttgatttttg cctgtggaga aactgagaag "260" cctgaagtga ggaagagcat tgatgagatg aataatgctt gggagaactt aaacaaaaca "266" tggaaagaga ggctagaaaa acttgaggat gctatgcaag ctgctgtgca gtatcaggac "272" actcttcagg ctatgtttga ctggctagat aacactgtga ttaaactctg caccatgccc "278" cctgttggca ctgacctcaa tactgttaaa gatcagttaa tgaa ggagttcaaa 632632 "284" gtagaagttt accaacagca aattgagatg gagaagctta atcaccaggg tgaactgatg "290" ttaaagaaag ctactgatga gacggacaga gacattatac gagaaccact gacagaactc "296" aaacacctct gggagaacct gaaa attgcccacc gacagcacaa actagaaggg "302" gctctgttgg cccttggtca gttccagcat gccttagagg aactaatgag ttggctgact "308" cataccgaag agttgttaga tgctcagaga ccaataagtg gagacccaaa agtcattgaa "314" gttgagctcg caaagcacca tgtcctaaaa gttt tggctcatca agccacagtg "320" gaaacagtca ctgg gctt cttgaatcca gtgctggaga tgatgccagc "326" agcttaagga gccgtttgga agccatgaac caatgctggg agtcagtgtt acagaaaaca "332" gaggagaggg agcagcagct tcagtcaact ctgcagcagg cccagggctt ccacagtgaa "338" attgaagatt tcctcttgga acttactaga atggagagcc agctttctgc gccc "344" ggac ttcctgaaac tgctagggaa cagcttgata cacatatgga actctattcc "350" cagctgaaag aaga gacttataat caactacttg acaagggcag actcatgctt "356" ctaagccgtg acgactctgg gtctggctcc aagacagaac agagtgtagc acttttggag "362" cagaagtggc tcag cagtaagatg agaa agtcaaagct ggaagaggcc "368" ctcaacttgg aatt ccagaattcc ctacaagaat ttatcaactg gctcactcta "374" gcagagcaga gtttaaacat cgcttctcca ccaagcctga ttctaaatac tgtcctttcc "380" cagatagaag agcacaaggt ttttgctaat gaagtaaatg ctcatcgaga ccagatcatt "386" gagctggatc aaactgggaa tcaattaaag ttccttagcc aaaagcagga tctg "392" atcaagaatt tgttggtgag gtct cgatgggaga aggttgtcca gcgatctatt "398" gaaagagggc gatcactaga tgatgccagg aagcgggcaa aacaattcca tgaagcttgg "104" aaaaaactga ggct agaagatgca gagagtcacc tggactcaga actagagata "110" tccaatgacc cagacaaaat taaacttcag ctttctaagc ataaggagtt tcagaagact "116" cttggtggca agcagcctgt gtatgatacc acaattagaa ctggcagagc actgaaagaa "122" aagactttgc aaga tagtcagaaa cttgacaatt tcctaggaga agtcagagac "128" aaatgggata ctgtttgtgg caagtctgtg gagcggcagc acaagttgga ggaagccctg "134" ctcttttcgg gtcagttcat tttg caggcattgg ttgactggtt atacaaggtg "Z40" gagccacagc tggctgagga ccagcccgtg cacggggacc ttgacctcgt catgaacctc "Z46" gcac acaaggtttt ccagaaggaa ctgggaaagc gaacaggaac cgttcaggtc "152" ctgaagcggt caggccgaga gctgattgag aatagtcgag atgacaccac ttgggtaaaa "158" ggacagctcc aggaactgag cactcgctgg gacactgtct gtaaactctc tgtttccaaa 633633 "464" caaagccggc ttgagcaggc cttaaaacaa gcggaagtgt ttcgagacac agtccacatg "470" ctgttggagt ctga agcagagcaa acgcttcgct ttcggggagc acttcctgat "476" gacacagagg ccctgcagtc tctcattgac acccataagg aattcatgaa gaaagtagaa "482" gaaaagcgag tggacgttaa ctcagcagta gccatgggag aagtcatcct ggctgtctgc "488" caccccgatt gcatcacaac catcaaacac tggatcacca tcatccgagc tcgcttcgag "494" gaggtcctga catgggctaa gcagcaccag cttg cctt gtcagaactg "500" gtggctaatg ctgagctcct ggaagaactt ctggcatgga tccagtgggc tgagaccacc "506" ctcattcagc gggatcagga gccaatcccg attg ttaa agcccttatc "512" gctgagcatc agacatttat ggaggagatg aaac acgt ggaccgggtc "518" accaagacat acaaaaggaa aaacatagag cctactcacg cgcctttcat atcc "524" cgcagcggag gcaggaaatc cctaagtcag ccaacccctc ctcccatgcc ttca "530" cagtctgaag caaaaaaccc acggatcaac cagctttctg cccgctggca gcaggtgtgg "536" ctgttagcac tggagcggca aaggaaactg aatgatgcct tggatcggct gttg "542" aaagaatttg ccaactttga ctttgatgtc tggaggaaaa agtatatgcg ttggatgaat "548" cacaaaaagt ctcgagtgat ggatttcttc cggcgcattg ataaggacca ggatgggaag "554" ataacacgtc ttat cgatggcatt ttagcatcca ccac caccaagtta "560" actg ctgtggctga cattttcgac gggg atggttacat tgattattat "566" gaatttgtgg ctgctcttca tcccaacaag gatgcgtatc gaccaacaac cgatgcagat "572" aaaatcgaag atgaggttac aagacaagtg gctcagtgca aatgtgcaaa aaggtttcag "578" gtggagcaga tcggagagaa taaataccgg tttggggatt ctcagcagtt gcggctggtc "584" cgtattctgc gcagcaccgt gatggttcgc gttggtggag gatggatggc cttggatgaa "590" tttttagtga aaaatgatcc ctgccgagca cgaggtagaa ctaacattga agag "596" aaattcatcc taccagaggg agcatcccag ggaatgaccc ccttccgctc acggggtcga "602" aggtccaaac catcttcccg ggcagcttcc cctactcgtt ccagctccag tgctagtcag "608" agtaaccaca catc catgccatct tctccagcca ccccagccag tggaaccaag "614" gttatcccat catcaggtag caagttgaaa cgaccaacac caacttttca ttctagtcgg "620" cttg ctggtgatac cagcaatagt tcttccccgg cctccacagg tgccaaaact "626" aatcgggcag accctaaaaa gtctgccagt cgccctggga gtcgggctgg gagtcgagcc "632" gggagtcgag gccg gcgaggaagt gacgcttctg actttgacct cttagagacg "638" cagtctgctt gttccgacac ttcagaaagc agcgctgcag ggggccaagg caactccagg 634634 "644" agagggctaa acaaaccttc caaaatccca accatgtcta agaagaccac cactgcctcc "650" cccaggactc caggtcccaa gcgataacac tgtctaagca cccccaagcc actatccact "656" ttgaatcctg ctccatacat tgggtgtata tttattctga acgggagaag ttatattgtt "662" aaaagtgtaa aagaataatt gtgttatgaa gctgccttat tctt tttgtaagtt "668" actattttca tgtgaatatt tatgtagata aaatttgcct cctggtaacc ctgtaatgga "674" tggggcccag aaatgaaata tttgagaaaa acaagtgaaa aggtcaagat acaaatgtgt "680" attaaaaaaa aaaaagccta ttaatagggt ttctgcgcgg tgcagggttg taaacctgct "686" ttatctttta ggattattcc catc ttctttataa acttgacttg ctatctcagc "692" aagataaatt aaaa aataagaatc ctgcagtgtt taaggaactc tttttttgta "698" aatcacggac acctcaatta gcaagaactg aggggagggc tttttccatt gtttaatgtt "704" ttgtgatttt tagctaaaga gagggaacct catctaagta acatttgcac atgatacagc "710" aaaaggagtt cattgcaata ctgtctttgg atattgtttc agtactgggt agga "716" caaatagctg ctagaattca ggggtaaatg taagtgttca gaaaacgtca gaacatttgg "722" ggttttaaac tgatttgttg ctccctatcc agcctagaca ccagtaactc ttgtgttcac "728" caggacccag acccttggca agggataggc tcgttggtga cattgtgaat ttcagatttg "734" ttttatccac tgct atttatttaa atggtcgatc aacttcccac aaactgagga "740" atgaattcca tgtt atgt aaga caaacacgtg ctcgtccttt "746" aatggagttc accagcacac ttgttaacca gtcctgtttg ctttcgtctt tttttgtgcg "752" taataaagtc aactgaccaa gtgaccatga gctg tctggggctc ctgtttttta "758" gctgctgttc tccg accatgttgc tgtgtgatta ttgg ttttaattga "764" ggcagaaact gaagctctac caatgaactg tttagaaaca actt ttgtattaaa "770" attgcttgca gtaacaaata ttttgtattt cctgattttc ttttcaacta ttaccttatc "776" tataaatgtt accctggggt ataatcatgt tgtaggtact taaatgcatt ccgcaaatca "782" cttg atggataaat tatagagctt aatagatctt gttttatttc aaaaaaaaaa "788" aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa Protein seguence: NCBI Reference Seguence: 222.3 LOCUS: NP_036222 ACCESSION: N P_036222 N P_148984 635635 l msssdeetls erscrsersc rser sgslspcppg dtlpwnlplh eqkkrksqu 6" vldpaeravv rvaderdrvq kktftkwvnk hlmkvrkhin dlyedlrdgh nlisllevLs 12" giklprekgr mrfhrlqnvq ialdflquq vklvnirndd kltl gliwtiilhf 18" qisdiyisge sgdmsakekl llwtqutag ytgikctnfs kmfn alihryrpdl 24" vdmervqiqs nrenleqafe vaerlgvtrl ldaedvdvps pdeksvityv ssiydafp<v " isat evdsrwqeyq ipwi kqhtilmsdk elk alynqyihfk 36" eteilakere kgrieelykl levwiefgri klpquhpnd veeewgklii emlereker 42" paverlelll qiankiqnga lnceekltla knthadaah lesgqpvqce sdvimyiqec 48" eglirqlqu lqilrdenyy qleelafrvm rlqdelvtlr lectnlyrkg lvpp 54" stlttthlka epltkathss stswfrkpmt raelvaisss edegnlrfvy eemq 60" mkleraewgn dlpsvelqle tqqhihtsve elgssvkear lyegkmsqnf htsyaetlgk 66" letqycklke tssfrmrhlq slhkfvsrat aeliwlneke eeelaydwsd nnsnisakrn 72" yfseltmele ekqdvfrslq dtaellslen hpakqtveay saavqsqlqw mkqlclcveq 78" hvkentayfq ffsdareles flrnlquik hnts lsrledllqd smdekeqliq 84" skssvaslvg rsktivqlkp rspdhvlknt isvkavcdyr qieiticknd ecvlednsqr 90" tkwkvisptg neamvpsvcf lipppnkdai emasrveqsy qumalwhql hvntkslisw 96" nylrkdldlv thnleklrs hqim knlqahyedf lqurdsvlf svaerrlee "02" eveackarfq hlmksmened keetvakmyi selknirlrl eeyeqrvvkr iqslassrtd "08" rdawqdnalr iaeqehtqed qulrsdlda vsmkcdsflh qspssssvpt lrselnllve "14" glst vylnklktvd vivrsiqdae llvkgyeikl sqeevvladl saleahwstl "20" rhwlsdvkdk nsvfsvldee iakakvvaeq ernl dleryqekgs qlqerwhrvi "26" aqleirqsel esiqevlgdy rachgtlikw ieettaqqem mkpgqaedsr vlsequqqt "32" alfaeiernq tkldchkfs vkdy elqlmtykaf vesqqkspgk rrrmlsssda "38" itqefmdlrt rytalvtltt qhvkyisdal rrleeeekvv eeekqehvek vkellngst "44" larntqgkat ssetkestdi ekailequl kkeq vseaiktsqi flakhghkLs "50" ekekkqiseq lnalnkayhd lcdgsanqlq qlqsqlahqt eqkthqun tchquedLC "56" swvgqaeral aghqgrttqq dlsalqknqs dlkdlqddiq nratsfatvv kdiegfmeen "62" qtklsprelt alreklhqak eqyealqeet rvaqkeleea vtsaquete kskaa<elae "68" alld wvtsvgssgg qlltnlpgme qlsgaslekg aldttdgymg vnqapekldk "74" qcemmkarhq qnfi qafl dqhghnltpe eqqmquklg elkeqystsl 636636 180" aqseaelkqv qthdelqkf lqdhkefesw lersekelen mhkggsspet lpsllqugs 186" shkg dlrfvtisgq kvldmensfk egkepseign lvkdklkdat erytalhskc 192" trlgshlnml lgthqfqns adslqawmqa ceanveklls dtvasdpgvl qeqlattkql 198" qeelaehqvp vequkvard imeiegepap dhrthettd silshquls yslaerssLl 204" qkaiaqsqsv qeslesllqs igeveqnleg kquslssgv iqealatnmk lkqdiarq<s 210" sleatremvt dstt aavlqgklae vsqrfeqlcl qqqekesslk kllpqaemfe 216" hlsgququ enksrmlasg nquqdithf quiqelnle medqqenldt lehlvtelss 222" cgfaldlcqh qdrvqnlrkd ftelqktvke scqe qldefrklvr tquwlkete 228" etsm sakelekqie hlksllddwa eein ckgtslenli meitapdsqg 234" ktgsilpsvg ssvgsvngyh tckdlteiqc dmsdvnlkye herq eslqailnrm 240" eevhkeansv lqwleskeev lksmdamssp tktetvkaqa esnkaflael eqnspkiqu 246" kealagllvt ypnsqeaenw kkiqeelnsr weratevtva rqrqleesas hlachaaes 252" meke lmmgvlgpls idpnmlnaqk ququlkef heql iltg 258" pgdvslstsq vqkelqsinq kwveltdkln srssqidqai vkstquell qdlsekvrav 264" gququsai stqpeavqu leetseirsd leqldhevke elsv ligeqylkde 270" tval plqgledlaa drinrlqaal astqqqumf delrtwlddk qsqqakncpi 276" saklerlqsq lqeneequs lnqhsgsyev ivaegeslll svppgeekrt lqnqlvelkn 282" hweelskkta drqsrlkdcm qkaqkquhv edc kakmselrvt ldpvqlessl 288" lrskamlnev ekrrslleil nsaadilins seadedgird ekaginqnmd avteelqakt 294" gsleemtqu refqesfkni ekkvegakhq leifdalgsq acsnknlekl raqqevlqal 300" epquylrnf tqglvedapd gsdasqllhq aevaqqefle vquvnsgcv mmenklegig 306" qfhcrvremf sqladlddel dgmgaigrdt dslqsqiedv rlflnkihvl kldieaseae 312" eegt krel ealnchgkl tergkarqeq leltlgrved fyrklkglnd 318" attaaeeaea lquvgteve iinqqladfk qukequpl ququvngl gqgliqsagk 324" dcdvqglehd meeinarwnt lnkkvaqria lhcg pll swladteeLi 330" anqkppsaey kvvkaqiqeq kllqulddr lqae sael adrekitgql 336" eslesrwtel lskaaaquq ledilvlakq fhetaepisd flsvtekkla nsepvgtqta 342" kiqqqiirhk aleedienha tdvhqavkig qslssltspa quvlsekid slqaryseiq 348" drccrkaall dqalsnarlf gedevevlnw laevedklss vakdfkqdv lhrqhadhLa 354" lneeivnrkk nvdqaikngq allkqttgee vlliqekldg iktryaditv tsskalrtLe 637637 360" qarqlatqu tgwl reveeelats gquptgeqi quqqquel kkevmehr;V 366" ldtvnevsra llelvarar egldklvsda neqyklvsdt igqrvdeida aiqrsqqyeq 372" aadaelawva etkrklmalg pirleqdqtt aqlqvqkafs idiirhkdsm delfshrsei 378" fgtcgeeqkt vlqektesli qqyeaislln seryarlera qvlvanwet yeelspwiee 384" traliaqlps paidheqqu qqeemrqlre siaehkphid kllkigpqlk elnpeegemv 390" eekquaenm yaqikeevrq ralaldeavs qstqitefhd kiepmletle nlssrlrmpp 396" lipaevdkir ecisdnksat velequpsf ealkrrgeel igrsqgadkd laakeiqdkl 102" dqmvffwedi karaeereik fldvlelaek fwydmaallt divh dlespgidps 108" aae tikeetdglh eelefirilg adlifacget ekpevrksid emnnawenLn 114" ktwkerlekl edamqaavqy fdw ldntviklct mppvgtdlnt vkdqlnemke 120" fkvequqqi emeklnhqge lmlkkatdet drdiireplt elkhlwenlg ekiahrqhkl 126" egallalgqf lmsw lthteellda qrpisgdpkv ievelakhhv lkndvlahqa 132" tvetvnkagn ellessagdd asslrsrlea mnchesvlq kteereqqlq 138" llel trmesqlsas kptgglpeta reqldthmel ysqlkakeet ynqlldkgrl Z44" mllsrddsgs sval leqkwhvvss kmeerkskle ealnlatefq nslqefinwl 150" tlaeqslnia sppslilntv lsqieehkvf anevnahrdq iieldqtgnq lkflsqkqdv 156" vliknllvsv qsrwekvvqr siergrsldd arkrakqfhe awkklidwle daeshldsel Z62" eisndpdkik lqlskhkefq ktlggkqpvy dttirtgral kektllpeds qkldnflgev Z68" rdkwdtvcgk klee allfsgqud alqalvdwly kvepqlaedq pvhgdldlvm Z74" nlmdahkqu kelgkrtgtv qvlkrsgrel iensrddttw vkgqlqelst rwdtvcklsv 180" eqal kqaevfrdtv hmllewlsea eqtlrfrgal pddtealqsl idthkefmkk 186" veekrvdvns avamgevila vchpdcitti khwitiirar feevltwakq hqquetaLs Z92" elvanaelle qwae ttliqrdqep ipqnidrvka liaehqtfme emtrkquvd Z98" rvtktykrkn iepthapfie rksl pmpi lsqseaknpr inqlsarwqq 504" vwllalerqr klndaldrle elkefanfdf mrw mnhkksrvmd kdqd 510" gkitrqefid gilaskfptt klemtavadi fdrdgdgyid yyefvaalhp pttd 516" adkiedevtr qvaqckcakr fqveqigenk yrfgdsqqlr lvrilrstvm vrvgggwmal 522" deflvkndpc rargrtniel rekfilpega frsr grrskpssra sssa 528" sqsnhsctsm psspatpasg tkvipssgsk lkrptptfhs srtslagdts nssspastga 534" ktnradpkks asrpgsrags ragsrassrr gsdasdfdll etqsacsdts essaaggqgn 540" srrglnkpsk iptmskkttt asprtpgpkr 638638 MAP1 B Official Sym bol: MAP1 B Official Name: microtubule-associated n 18 Gene ID: 4131 Organism: Homo sapiens Other Aliases: FUTSCH, MAPS Other Designations: MAP-1 B Nucleotide seguence: NCBI Reference Seguence: NM_019217.1 LOCUS:NM_m92W ACCESSION : NM_019217 XM_001061557 XM_215469 1 cgcgcaggga gagagcggag ggggaggcga ccgg gaggaggggg gacgcagtgg 6" gcggagcgga gacagcacct tcggagataa tcctttctcc tgccgcagag cagaggagcg l2" gcgggagagg aacacttctc ccaggcttta gcagagccgg caggatggcg accgtggtgg 18" tggaagccac cgagccggag ccatcgggca gcatcggcaa CCngngCg accacctcgc 24" ccagcctgtc gcaccgcttc ctagacagca agttctactt gctggtggtg gaga " cggtgaccga agagcacctg aggcgtgcca tcggcaacat Cgagctgggg atccgatcgt 36" gggacacaaa cctgatcgag tgcaacttgg accaagagct caaacttttc gtgtctcgac 42" actccgcgag attctctcct gaagttccag gacaaaagat cctccatcac cgaagtgacg 48" tcttagaaac tgtagttctg atcaaccctt cggatgaagc agtcagcacc gaggtgcgtt 54" tgatgatcac tgacgccgcc cgccataaac tgctggtgct caccggacag tgctttgaga 60" acactggaga gctcatcctc cagtcaggct ctttctcctt cttc atagagattt 66" tcaccgacca agagattggg gagctcctaa gcaccaccca tcctgccaac agcc 72" tcaccctctt ctgccctgag gact ggaagaactc caaccttgac agacacaatc 78" tccaagactt catc aagctcaact cagcttctat cttgccagaa atggagggac 639639 84" tttctgagtt gtac gagt ctgtcgaagt cccctccccc tttgacatcc 90" tggagccccc gacctcgggc ggatttctga agctctccaa gccttgttgt tacatttttc 96" cggggggccg cggggactct gccctgttcg cagtgaacgg attcaacatg ctcattaacg "02" gaggatcaga aagaaagtcc tgga agctcattcg gcacttggac nggtggact "08" ccatcctgct cacccacatt ggggatgaca acttgcccgg gatcaacagc atgttgcaac "l4" gcaagattgc ggaa gaggagcggt cccagggctc caac tgga "20" tgaaaaacct catctcccct gacttggggg ttct caatgtacct gaaaatctga "26" aaaacccaga acccaacatc aaga gaagtacaga agaagcatgc ttcaccctcc "32" agtacctaaa caaactgtcc atgaaaccag agcctttatt tagaagtgta ggcaatgcca "38" ttgagcctgt catcctgttc caaaaaatgg gagtgggtaa actggagatg tacgtgctta "44" acccagtcaa aagcagcaag gaaatgcagt atttcatgca gcagtggact ggaaccaaca "50" aagacaaggc tgaacttatc ctgcccaatg gtcaagaagt agacatcccg atttcctacc "56" tgacttccgt ctcgtctttg attgtgtggc acccagccaa ccctgctgag aaaatcatcc "62" gggttctgtt tcctggaaac agcacccagt acaacatcct agaagggctg gaaaaactca "68" aacatctaga cttcctaaag cagccactgg ccacccaaaa agatctcact ggccaggtgt "74" ccaccccccc agtgaaacag gtcaagttga aacagcgggc tgacagccga gagagtctga "80" agccagccac aaaaccactt tccagtaaat cagtgaggaa ggagtccaaa gaggaggccc "86" ccac aaaagccagc caagtggaaa aaacacccaa aagc aaagagaaag "92" tgatagtgaa aaaagacaag ccaggaaagg tagaaagtaa gccatcggtg acggaaaagg "98" aggtgcccag caaagaggag cagtcgcccg tcaaagctga tgag aaggcggcca 204" gcaa acccaaagtc gaca aagtggtaaa aaaggaaata aagacaaaac 210" ccgaagaaaa gaaagaggag aagcccaaga aggaagtggc taaaaaggaa gacaaaactc 216" ccctcaagaa agacgagaag cccaaaaagg aagaggcgaa gaaggagatc gaaa 222" tcaaaaagga agagaaaaag gagctgaaga aagaggtgaa gaaggaaacg cccctgaagg 228" agaa ggaggtgaag aaagacgaga agaaagaagt taaaaaggaa gagaaggaac 234" ccaaaaagga gattaagaag atctccaagg acataaagaa atccactcct ctgtcagaca 240" caaagaaacc ggctgcattg aaaccaaaag tagcaaagaa agaagagccc aagg 246" ttgc tgctgggaaa ctcaaggaca aggggaaggt catt aaaaaggaag 252" gcaagaccac agaggccgct gccacagctg ttggcactgc tgccgtggct gcagcagccg 258" gagtagcggc cagcggtcct gccaaagaac ttgaagctga gcggtccctc atgtcgtccc 640640 264" atct aaccaaggac tttgaggagc taaaggctga ggagatcgat gtagcgaagg 270" acatcaagcc tcagctggag ctcattgaag atgaagagaa actgaaggaa accgagccgg 276" gagaagccta tcag aaagagacgg aagtcagcaa aggttctgct gagtcacctg 282" ggat caccaccact gaag gcga gcaaaccccc gaggagctgg 288" agccagttga gggc gtggatgaca tcgagaagtt cgaggatgaa ggtt 294" ttgaagaatc ctcagaggcc ggagactacg aagagaaggc tgag gaggccgagg 300" agccggaaga agacggggaa gacaatgtga gcgggagcgc ctcgaagcac agccccacag 306" aagacgaaga aatcgctaag gctgaggcgg acgtacacat caaggagaag agggagtctg 312" tggccagcgg cgatgaccgg gccgaagaag atga agcgcttgag aaaggagaag 318" ctgaacagtc tgaggaggag ggtgaggagg aggaggacaa agcagaggac gccagagagg 324" aagaccatga gcccgacaaa actgaggctg aagattatgt gatggctgtg gttgacaagg 330" ccgcggaggc cggagtcacc gaggatcagt atgatttcct ggggacaccg gccaagcaac 336" tcca gtctcctagc cgagaacccg cgtcttcaat tcatgatgag accctacccg 342" gaggctccga gagcgaggcc actgcttcag atgaggagaa tcgagaagac cagcctgagg 348" ctgc tacctccgga tatactcagt ccaccatcga gatatctagt gagccgactc 354" caatggatga cact cctcgagatg tgatgaccga cgagaccaac aatgaggaga 360" cagagtcccc gtctcaggag aaca ttaccaaata cgagtcttcg ctgtactctc 366" aggagtactc caaacctgtg gttgcatcat tcaatggatt gtcagacggg tcaaagacag 372" acgccactga cggtagggat tacaacgctt ccgcctccac catatcacca ccttcgtcca 378" tggaagaaga caaattcagc aagtctgctc acgc ttaccgccca gaagagacgg 384" acgtgaaaac cggtgccgag ttggacatca aagatgtttc ggatgagaga cttagcccag 390" ccaagagtcc atccctgagt ccttctccac catcacccat agagaagact cccctgggtg 396" aacgtagcgt gaatttctct ctgacaccca acgagatcaa agcctctgca gagggagagg 102" caacagcagt agtgtccccc ggagtgaccc aagcagtagt tgaagaacac tgtgccagtc 108" ctgaggagaa gaccttggag gtagtgtcac cgtctcagtc tgtgacaggc agtgcgggcc 114" ctta ctaccaatct cccaccgacg aaaagtccag tcacctacct acagaagtca 120" ctgagaacgc gcaggcagtc ccggtgagct ttgaattcac tgaggccaaa gatgagaacg 126" agaggtcgtc catcagcccc atggatgaac ctgtgcctga gtct cctatcgaga 132" tgtc tccgttacgc agccctcccc ttattggatc cgagtccgca tatgaagact 138" tcctgagtgc ggatgacaag gctcttggca gacgttcaga aagccccttt gaagggaaga 641641 Z44" atggaaagca aggcttctca gacaaagaaa gcccagtttc tgacctgact tccgatcttt 150" accaagacaa agag aaaagcgcgg gcttcatacc gataaaggaa agtc 156" cagaaaagaa agccagcgat gctgaaatca tgagttctca atcagctctg gctttggatg Z62" aaaggaaact gggaggagat ggatctccaa cgcaagtaga tgtcagtcag tttggctctt Z68" tcaaagaaga caccaagatg tccatttcgg aaggcaccgt ttcagacaag tccgccacgc Z74" ctgtggatga gggcgtggcc gaagacacct attcacacat ggaaggtgtg gcctcagtgt 180" cctc tacc agctcgtttc cagagccaac cacagatgac gtgtctcctt 186" ctctccacgc tgaagtgggc tctccacatt ccacagaggt ggatgactcc ctgtcggtgt Z92" cggtggtgca aacaccaact actttccagg aaat gtctccgtct gagt Z98" gcccaagacc aatgtcgatt tctcctcctg acttctcccc agcc aaatccagga 504" caccagttca agatcaccga tccgaacagt cttcaatgtc tattgaattc ggtcaggaat 510" cccccgagca ttctcttgct atggacttta gtcggcagtc tccagaccac cctactgtgg 516" gtgctggtat gcttcacatc accgaaaatg ctga ggtggactac agtccctccg 522" atatccagga ctctagtttg tcacataaga ttccgccgac gcca tcctacaccc 528" aggataatga tctgtccgag ctcatctctg tgtctcaggt ggaggcttcc ccatccacct 534" cttctgctca cactccttct cagatagcct ctcctcttca ggaagacact ctctctgatg 540" tcgttcctcc cagagatatg tccttatatg ttgc gtctgagaaa gtgcagagcc 546" tggaaggaga gaaactctct tccg ctcc ccct tcct 552" cacctacata ttcacctggc ttttcagatt ctacctctgg agctaaagag agtacagcgg 558" cttaccaaac ctcctcttcc ccaccaatag atgcagcagc cgcagagccc tacggcttcc 564" gctcctcaat tgat acaatgcagc atcacctggc cttgagtaga gatttgacca 570" catctagtgt ggagaaggac aatggaggga agacacccgg tgactttaac tatgcctatc 576" aaaagcccga gagcaccacc gaatccccag atgaagaaga ttatgactat gaatctcacg 582" agaaaaccat ccaggcccac ggtg gttactacta tgagaagaca gagagaacca 588" taaaatcccc atgtgacagt ggatactcct atgagaccat tgagaagacc accaagaccc 594" cagaagatgg tggctactcc tgtgaaatta ccgagaaaac cactcggacc cctgaagagg 600" gcgggtactc gtatgagatc agcgagaaga caacacgaac ccctgaagta agtggctaca 606" cctatgagaa gagg tccagaaggc tcctcgatga cattagcaat ggctacgatg 612" acactgagga tggtggccac acacttggcg actgtagcta ttcctacgaa accactgaga 618" aaattaccag ctttcctgaa tctgaaagct atga gacaactaca aaaacaacac 642642 624" ggagtccaga cacctctgca tactgttacg agaccatgga gaagatcacc aagaccccac 630" aggcatccac atactcctat gagacctcag accgatgcta cactccagaa tccc 636" cctcggaggc acgccaggat gttgacttgt gtctggtgtc ctcctgtgaa ttcaagcatc 642" ccaagaccga gctctcacct tccttcatta atccaaaccc tctcgagtgg tttgctgggg 648" aagagcccac tgaagaatct gagaagcctc tcactcagtc tggaggagcc cctt 654" caggaggaaa acaacagggc agacaatgcg ctcc acccacctca gtcagtgagt 660" cagctccatc ccagacggac tctgatgttc ccccagagac agaagagtgc ccctccatca 666" cagctgatgc caacattgac tctgaagatg agtcagaaac caca acgg 672" ttacgtacaa acacatggac ccgcctccag cccccatgca agaccgaagc ccttctcctc 678" gccaccctga tgtgtccatg gtggatccag aggccttggc gcag aacctaggca 684" aggctctgaa aaaggatctg aaggagaagg ccaa gaaaccaggc acaaagacca 690" agtcctcttc acctgtcaaa aagggtgatg ggaagtccaa gccttcagca gcttccccca 696" aaccaggagc cttgaaggaa tcctctgaca aggtgtccag agtggcttct aaga 702" aagagtctgt ggagaaagct atgaagacca ccaccactcc tgaggtcaaa gccacacgag 708" gggaagagaa ggacaaggaa actaagaatg cagccaatgc ttctgcatcc gtga 714" agactgcaac agcaggacca ggaaccacta agacggccaa gtcgtccacc cccg 720" gcctccctgt gtatttggac ctctgctata ttcccaacca cagcaacagt aagaatgtcg 726" atgttgagtt tttcaagaga gtgaggtcat cttactacgt ggtgagtggg aacgaccctg 732" ccgcggagga gcccagccgg gctgtcctgg atgccttgtt ggaagggaag gctcagtggg 738" gaagcaacat gact ctgatcccaa cacatgactc tgaggtgatg agggagtggt 744" accaggagac ccacgagaag cagcaagacc tcat ggtcctagca agcagtagta 750" cagtggtcat gcaagacgag tccttccctg agat agaactgtag aaaccgcagc 756" cgaccacacc acaggatttg aactgtgttt ccagaaattc ctgaatttga cttt 762" tcttaaacgt ccattcatct aattacgtca ctgaacaagg acctgccaga tgctatacag 768" tgtcatggtg atca tttc tcaatttttg ccgaccgcta aggaaagtaa 774" ccatattccc acaatagatt tcaagttact gcaaaattac ctacccccgt tcatctctgc 780" tgaaatacgt ggaagccagg cactcgcaca cccaactgac ttctgctagg tattggcatt 786" tatcttagag agagaaagag agagggaaag agagtcagcg ggaggggagg gagagcggcg 792" ggagagaggg acaaagagac tgccggagag agagagagtg agagtgagag aggaagaatc 798" agaaagaaaa agaatgcaag acaaaaaagg tagagagttc tgatgagatg cccagggaga 643643 804" aagagtgggc gggt atagagaaga caccagcaac tgggtctgcg ttttcccaga 810" ccacagcgat tcattctgtg gtctacacag gtggagtttt ccattttcac cagagtcatc 816" agaaagagtc gatctcctaa agcttgtttc acag gaaggacgaa gcctgtcacg 822" agggcatgag atttttcacg ccttaattaa ttgc tataaaggct gcccacgcat 828" aatgtagcaa gtgtaggctg cgag tggttggagc cccgttcaga actctcagac 834" ttttcaaaca Cgtgagtagg ttgatctaag gcatgctccc agcatttgtc tacccaagtc 840" cacatcgagt caacccgcat gcagcaacac ccaaggccac cccagttaac tgaagcaaat 846" accaaagcag ttgggagaac atatgggaga cattttgcct gtga cttgaatgta 852" caaagttacc cgatgcactt attttttaac gtgagacggc ttaa aacatccgtg 858" taggattgta ccag gcgaaggagc atgcggggag ggaaggtact tcgt 864" ctgactggcc caacagttta gtgcagagtc atgcttggtg aacgtcacca cttctgatgt 870" acccgtggcc ccag tgca ccctaagtag cttctttaac gcttcaagtt 876" taagactgaa atggcttctc taatcagaac cagggaaaca atgaatctca ngtggaagg 882" ggttctcggc acag cctt cctttgtctt gcattgacta ttttaatttt 888" aatt ccaacacgtg ggaacacatg tacagaagat tttttttttt agatacatga 894" gaacttttca tagatgaact ttctaacgaa tgttttcatt tacagaaaaa tgcaaagaaa 900" aatttgaagc gatggtcttt ttttttaatt attattttaa gtgttttgta agacaaaaaa 906" gttt tttgaggttc tggaaagatt tgaagcctga agtc atat 912" ttatttaaaa acccgtcact actggaaacg gtggtacctc ccaccctttg actctcatat 918" tatgaaagtg tgagtccgtg ctgtttgaga gtgggtaggt ggcagggtag gctactgttc 924" agggtttcac agtgctattc cctcctcctt tttt ttcaccgtga ggtggaagag 930" ccaagttcag aagcacccta gcgccagctt gcttgggcct tttctggaaa acattcattg 936" aaagaaataa cagattgaaa acaaatgaat tctcagctcc tacgttcacc atgtagagag 942" ttcagacaca atgtccgtca ctgtcatcac tgaaccacaa actcgtaacg ccagatcatg 948" aggaaatctt tcgccaagtt tcaaacggca gatccatgta ccaggggttc agagttggca 954" atcttctcag tgacagccat gacagctcgt tcacgctgag tttcctgcag actcttaaga 960" tctccggagt agtgaacaat gacctcattt tattttctat gttagttatt tatttcaaaa 966" gttacatttt agtttacttt tcgtctgtga agtctatgtt tcgcactgct gtttactctg 972" agggtttaac aatatttctc cagggtcccc tcaccgagga cccatgcagt ctacttaatg 978" ctgtgaatca ccaa atgtttaatt ttttaaagaa aattaatatt ctatttttgt 644644 984" taggcgtctc taggaatgca gcttttattt ctat ttctttccaa ctcgtaaaag 990" ccacacatta aaggtacaga aatgtcctat atggtggaag agacattgag aacggagttc 996" attgagtgtc agctcacttg tttctcactt caggcaagct gaacacacac aagatggcaa "002" tctcatggta gggt tggtccacac aatacactca aagagaaaca gtttctagcc "008" ttcttgccaa atccagacct ctggttgact tttctttcct aaaagatgga gttactgccc "014" agttctacag cttaaattta tttagccttt tatattattt tgttttaaag atgcaaagac "020" cttagaagaa ccag ccttcagtta taaccactcg accccatacg tcttattgtt "026" tatacagaat ccgatgggga aaacactttt ttttttcaaa actgtcactg atccactggc "032" tcct acacaatctg gcagcaccat cctgacccct gtcctcagtg atggccatgt "038" agccgaggac gtga ttgg ctttgcaggt agctcctgat ggca "O44" cccctcctac ctgtgtggct gatctgatct tgcttgttcc atacatacac aattagttta "050" gaggtagcca ccacgatgga ctatgtacat ttgtggtgag agctcaaaac cgacacagac "056" ttctagaaga tttgtgcata caatccttga tccaattgta tagattgact atttgagtgg "O62" aaggcgtttc caccctgttt aaatgatgga attctattgt gctagcactc ccgatcatga "O68" cccttttggt agtatttgta aacaaaattc tacagagact aaatcttaga gataatcctc "O74" catttcaatt ttaatcaatt tcct tttttccaaa ccccgaaccc catgcatgct "080" ttcccagtct tgtgatggga ctggacacag tgaccgaccc cctcgcctca aacaccattt "086" tccatggaat tcaaaagaaa aaaatttttt ttcttaacct tacatatcat agtgaatggt "O92" ttccccggtg tatatgaatg ttttaagtgt ttccaatagc ttatgaaatt taggagcttt "O98" ctaatactcg ttttataaat ttaatcattt ggaa attttaccac ctcccatttg ""04" aaat cttagctcct ggagcggcac tacaattcag gagttgtttt ttctcacctc ""10" ctctgtcatt tgtcacagga ggtccctgct tggcaatgac atttgtgagt taggataatg ""16" acgttccttc tctccttttt ttttcctttc atacttcaga tttaggagaa aaagattctg ""22" tttccacgtg agaggaactg taagctttta tcacgtaacc acaa cacaccaaaa ""28" tagg gatgagcacc gcgctttggt agcgattagg ttttattcac ctggtattaa ""34" tcac aaaa atccggaact tttaagaatt catttcaaag gcagcatcaa ""40" aaactgaaaa ggaagggaaa aaaaaacaac agctaataat cggcttctcc gcacgcgtgg ""46" cgaa actggagccc ngagaagtg gctctgctca cgcc cacgccgcgg "52" cggtccttgc tttccccgca tgcgcccgca ggcagcgtgc agtcctaagc ccggctgtgg .58" agaagctcac tctt gttctgaatg gtgt cggtctgcct ctgtgtatgg 645645 11641 tattatgtct tataatcctg catcacttcc atcctatcca gtcatatcta atgtagaaaa 11701 attagtttcc agtgaaagta agtg cttttatggt atttgtgtgc aatatcccct 11761 cttctattga ggatatttga tgtaaaggaa aaaaaaaaag gaaa ctgagttcca 11821 caataaaata caaagtggca aaagttcact tgtgtgttga gacatcaaaa aaaaaaaaaa 11881 aaaa Protein seguence: NCBI Reference Seguence: NP_062090.1 LOCUS: NP_062090 ACCESSION: NP_062090 XP_001061557 XP_215469 1 eate pepsgsignp aattspslsh rfldskfyll vvvgetvtee hlrraignie 6" lgirswdtnl iecnldqelk lfvsrhsarf spevpquil hhrsdvletv deav 12" stevrlmitd aarhkllvlt gqcfentgel ilqsgsfsfq nfieiftdqe igellstthp 18" tlfc peegdwknsn ldrhnlqdfi niklnsasil seft eylsesvevp 24" spfdileppt sggflklskp ccyifpggrg vngf nmlinggser kscfwklirh " ldrvdsillt higddnlpgi nsmlquiae leeersqgst snsdwmknli spdlgvvan 36" vpenlknpep nikmkrstee acfthylnk lsmkpeplfr svgnaiepvi lqumgvgkl 42" emyvlnpvks fmqq wtgtnkdkae lilpngqevd ipisyltsvs slivwhpanp 48" aekiirvlfp gnstqynile gleklkhldf lkqplatqkd ltgqutppv kqvklquad 54" sreslkpatk plssksvrke skeeapeatk asqvektpkv eskekvivkk dkpgkveskp 60" svtekevpsk eeqspvkaev aekaateskp kvtkdkvvkk eiktkpeekk eekpkkevak 66" kedktplkkd kpkk akk ikk ikk kk 1kk vkk tplkdakke vkkdekkevk 72" keekepkkei kkiskdikks tplsdtkkpa alkpkvakke piaa gklkdkgkvk 78" ktte aaatavgtaa vaaaagvaas gpakeleaer slmsspedlt kaee 84" idvakdikpq leliedeekl ketepgeayv iqketevskg saespdegit eceq 90" tpeelepvek qgvddi kf d gagf ss agdy ka t a p d gednvsgsas 96" khsptedeei akaeadvhik ekresvasgd draeedmdea eqse eegeeeedka 102" dhep dkteaedyvm avvdkaaeag vtedqydflg tpakquvqs psrepassih 108" detlpggses eatasdeenr edqpeeftat sgytqstiei sseptpmdem stprdvmtde 646646 1141 tnneetesps qefvnitkye pvvasfngls dgsktdatdg rdynasasti 1201 sppssmeedk fsksalrday rpeetdvktg aeldikdvsd erlspaksps lspsppspie 1261 ktplgersvn fsltpneika saegeatavv spgvtqavve ehcaspeekt levvspsqsv 1321 tpyy qsptdekssh 1ptevtenaq avpvsfefte akdenerssi spmdepvpds 1381 espiekvlsp lrsppligse sayedflsad dkalgrrses pfegkngkqg fsdkespvsd 1441 ltsdlyqqu eeksagfipi kedfspekka sdaeimssqs alalderklg 1501 squsfkedt kmsisegtvs dksatpvdeg vaedtyshme gvasvstasv atssfpeptt 1561 ddvspslhae tevd dslsvsvvqt pttfqetems prpm sisppdfspk 1621 pqu hrseqssmsi pehs lamdfsrqsp dhptvgagml hitengptev 1681 dyspsdiqu slshkippte epsytqdndl selisvsqve aspstssaht psqiasplqe 1741 dtlsdvvppr dmslyaslas ekvqslegek lspksdispl tpressptys pgfsdstsga 1801 kestaayqts ssppidaaaa epygfrssml fdtmqhhlal ssve kdnggktpgd 1861 fnyaqupes ttespdeedy dyeshektiq ahdvggyyye ktertikspc dsgysyetie 1921 kttktpedgg ysceitektt rtpeeggysy eisekttrtp evsgytyekt ersrrllddi 1981 sngyddtedg ghtlgdcsys yettekitsf pesesysyet spdt saycyetmek 2041 itktpqasty syetsdrcyt sear qdvdlclvss cefkhpktel spsfinpnpl 2101 ewfageepte esekpltqsg gapppsqgkq qgrchetpp tsvsesapsq tdsdvppete 2161 ecpsitadan idsedeseti tykh mdpppapqu rspsprhpdv smvdpealai 2221 eqnlgkalkk dlkekaktkk sssp vkkgdgkskp saaspkpgal kessdkvsrv 2281 aspkkkesve kamkttttpe vkatrgeekd ketknaanas asksvktata gpgttktaks 2341 lpvy 1dlcyipnhs nsknvdveff krvrssyyvv sgndpaaeep sravldalle 2401 gkaqwgsnmq vtlipthdse vmrequeth imv lassstvvmq desfpackie 2461 l Official Symbol: MDH1 Official Name: malate dehydrogenase 1, NAD (soluble) Gene ID: 4190 647647 Organism: Homo sapiens Other Aliases: MDH-s, MDHA, MGC:1375, MOR2 Other Designations: cytosolic malate ogenase; malate dehydrogenase, cytoplasmic; soluble malate dehydrogenase Nucleotide seguence: NCBI Reference Seguence: NM_001199111.1 LOCUS: NM_001199111 ACCESSION : NM_001199111 l ccttcgcgcc ctttggcaag ctcggactca tcttctgggg attgccgcag tgacccagta 6" aggg attgatttcc accttgcggg ggcg ctcttaggag gactctggag 12" aagtagttgt cctgggagag gagcgatctt aatcctgctg catgacggga ggacaaaatg 18" cgacgctgca gctattttcc aaaggacgtt acggtgtttg acga taagtctgaa 24" ccaatcagag tccttgtgac tggagcagct ggtcaaattg cact gctgtacagt " attggaaatg gatctgtctt tggtaaagat cagcctataa ttcttgtgct gttggatatc 36" acccccatga tgggtgtcct ggacggtgtc ctaatggaac tgcaagactg tgcccttccc 42" ctcctgaaag atgtcatcgc aacagataaa gaagacgttg ccttcaaaga tgtg 48" cttg tgggctccat gccaagaagg gaaggcatgg agagaaaaga tttactgaaa 54" gcaaatgtga aaatcttcaa atcccagggt ttag ataaatacgc caagaagtca 60" gttaaggtta ttgttgtggg taatccagcc aataccaact gcctgactgc ttccaagtca 66" tcca tccccaagga gaacttcagt tgcttgactc gtttggatca caaccgagct 72" aaagctcaaa ttgctcttaa acttggtgtg actgctaatg atgtaaagaa tgtcattatc 78" tggggaaacc attcctcgac tcca aacc atgccaaggt gaaattgcaa 84" ggaaaggaag ttggtgttta tgaagctctg aaagatgaca gctggctcaa gggagaattt 90" gtcacgactg tgcagcagcg tggcgctgct gtcatcaagg ctcgaaaact atccagtgcc 96" atgtctgctg caaaagccat ctgtgaccac gtcagggaca tctggtttgg aaccccagag 102" tttg tgtccatggg tgttatctct gatggcaact gtgt tcctgatgat 108" ctgctctact cattccctgt tgtaatcaag aataagacct ggaagtttgt tgaaggtctc 114" cctattaatg atttctcacg tgagaagatg gatcttactg caaaggaact gacagaagaa 648648 1201 agtg cttttgaatt tctttcctct gcctgactag acaatgatgt tactaaatgc 1261 ttcaaagctg aagaatctaa atgtcgtctt tgactcaagt accaaataat aataatgcta 1321 tacttaaatt acttgtgaaa aacaacacat tttaaagatt acgtgcttct tggtacaggt 1381 ttgtgaatga atcg tgtt agtgtgcatt aaat atatattcaa 1441 atgaaaaaaa aaaaaaaaaa a Protein seguence: NCBI Reference Seguence: NP_001186040.1 LOCUS: NP_001186040 ACCESSION: N P_001186040 1 mrrcsyfpkd vtvfdkddks epirvlvtga agqiayslly signgsvfgk dqpiilvlld 61 itpmmgvldg vlmelqdcal pllkdviatd kedvafkdld vailvgsmpr regmerkdll 121 kanvkifksq gaaldkyakk vgnp antncltask kenf scltrldhnr 181 akaqialklg vtandvknvi iwgnhsstqy pdvnhakvkl qgkevgvyea lkddswlkge 241 fvttqurga avikarklss amsaakaicd hvrdiwfgtp egefvsmgvi sdgnsygvpd 301 dllysfpvvi knktwkfveg lpindfsrek mdltakelte ekesafefls sa NHP2L1 al Symbol: NHP2L1 Official Name: HP2 non-histone some protein 2-like 1 (S. cerevisiae) m: 4809 Organism: Homo sapiens Other Aliases: CTA-216E10.8, 15.5K, FA-1, FA1, NHPX, OTK27, SNRNP15-5, SNU13, SPAG12, SSFA1 Other Designations: NHP2-like protein 1; U4/U6.U5 tri-snRNP 15.5 kDa protein; [U4/U6.U5] tri-snRNP 15.5 kD RNA binding protein; high mobility group-like nuclear protein 2 homolog 1; non-histone chromosome protein 2-like 1; small nuclear cleoprotein 15.5kDa (U4/U6.U5); sperm specific antigen 1 tide seguence: NCBI nce Seguence: NM_001003796.1 649649 LOCUS: NM_001003796 ACCESSION : NM_001003796 l gccgcgcggg gccatttccg ctgctgcttc tgtgagtttt tccggtgcac gcgagtgctt 6" ctgaaacgtc agctgcgctc ccctaggagt gctgagcccg cggaaccgca gccatgactg 12" aggctgatgt gaatccaaag gcctatcccc ttgccgatgc ccacctcacc aagaagctac 18" tggacctcgt tcagcagtca tgtaactata ttcg gaaaggagcc aatgaggcca 24" ccaaaaccct caacaggggc atctctgagt tgat ggctgcagac gccgagccac " tcat tctgcacctg ctgt gtgaagacaa gaatgtgccc tacgtgtttg 36" tgcgctccaa gcaggccctg gggagagcct gtggggtctc caggcctgtc atcgcctgtt 42" ctgtcaccat aggc ctga aacagcagat ccaatccatt cagcagtcca 48" ttgaaaggct cttagtctaa acctgtggcc tctgccacgt gctccctgcc agcttccccc 54" ctgaggttgt tatt atctgtgtta gcatgtagta ttttcagcta ctctctattg 60" ttataaaatg tagtactaaa tctggtttct ggatttttgt gttgtttttg ttctgtttta 66" cagggttgct atcccccttc ctttcctccc tccctctgcc atccttcatc cttttatcct 72" ccctttttgg aacaagtgtt agac agaagcaggg tggtggcacc gttgaaaggc 78" agaaagagcc aggagaaagc tgatggagcc aggacagaga tctggttcca gctttcagcc 84" ttcc tgttgtgtgc ggggtgtggt ggaattaaac agcattcatt gtgtgtccct 90" gtgcctggca cacagaatca ttcatacgtg ttcaagtgat caaggggttt catttgctct 96" tgggggatta attt gaag catgtgttct gtgaggttgt tcggctatgt "02" ccaagtgtcg tttactaatg tacccctgct gtttgctttt ggtaatgtga tgttgatgtt "08" ctccccctac ccacaaccat gcccttgagg gtagcagggc agcagcatac caaagagatg "14" tgctgcagga ctccggaggc agcctgggtg ggtgagccat ggggcagttg acctgggtct "20" tgaaagagtc gggagtgaca agctcagaga gcatgaactg atgctggcat gaaggattcc "26" aggaagatca tggagacctg gctggtagct gtaacagaga agtc aaca "32" gcctgtctct ggtgaatggg actttctttg gtggacactt ggcaccagct ctgagagccc "38" ttcccctgtg tcctgccacc atgtgggtca gatgtactct ctgtcacatg aggagagtgc "44" tagttcatgt catt cttgtgagca tcctaataaa tctgttccat tttgatgaca "50" aaaaaaaaaa aaaaaaaaaa 650650 Protein seguence: NCBI Reference Seguence: NP_001003796.1 LOCUS: NP_001003796 ACCESSION: N 03796 1 mteadvnpka ypladahltk klldlqusc nykqlrkgan eatktlnrgi sefivmaada 61 epleiilhlp llcedknvpy varskqalg racgvsrpvi kegs qlquiqsiq 121 qsierllv OLA1 Official Sym bol: OLA1 Official Name: Obg-like ATPase 1 m: 29789 Organism: Homo sapiens Other Aliases: PTD004, DOC45, GBP45, GTBP9, GTPBP9 Other Designations: DNA damage-regulated overexpressed in cancer 45 protein; GTP-binding protein 9 (putative); GTP-binding protein PTD004; homologous yeast-44.2 protein; obg-like ATPase 1 Nucleotide ce: NCBI Reference Seguence: NM_001011708.1 LOCUS: NM_001011708 ION : NM_001011708 1 ggtctgcgcg ccgc tcggcgcccg gcccgcccgt tccgccgctg tcgccgccgt 61 cgtgcgtgcc gctcggcgga ggggacgggc tctc tcctccttcc tccccgcctc 121 cagctgccgg caggaccttt ctctcgctgc cgctgggacc ccgtgtcatc gcccaggccg 181 agcacggaaa tctactttct tcaatgtgtt tagt caggcttcag cagaaaactt 241 cccgttctgc actattgatc ctaatgagag cagagtacct gtgccagatg aaaggtttga 651651 " ctttctttgt caataccaca aaccagcaag caaaattcct gcctttctaa atgtggtgga 36" tattgctggc cttgtgaaag gagctcacaa tgggcagggc ctggggaatg cttttttatc 42" tcatattagt gcctgtgatg gcatctttca tctaacacgt gcttttgaag atat 48" cacgcacgtt gaaggaagtg tagatcctat tcgagatata gaaataatac atgaagagct 54" tcagcttaaa gatgaggaaa tgattgggcc cattatagat aaactagaaa aggtggctgt 60" gagaggagga gataaaaaac taaaacctga atatgatata atgtgcaaag cctg 66" ggttatagat caaaagaaac ctgttcgctt ctatcatgat gaca ttga 72" agtgttgaat aaacacttat ttttgacttc aaaaccaatg gtctacttgg ttaatctttc 78" tgaaaaagac tacattagaa agaaaaacaa atggttgata aaaattaaag agtgggtgga 84" caagtatgac gctt tggtcattcc ttttagtggg gccttggaac tcaagttgca 90" agaattgagt gctgaggaga agta tctggaagcg aacatgacac cttt 96" gccaaagatc attaaggctg ggtttgcagc actccaacta gaatactttt tcactgcagg "O2" cccagatgaa gtgcgtgcat tcag gaaagggact aaggctcctc aggctgcagg "08" aaagattcac acagattttg aaaagggatt cattatggct atga aatacgaaga "l4" ttttaaagag gaaggttctg cagt caaggctgct ggaaagtaca gacaacaagg "20" cagaaattat attgttgaag atggagatat tatcttcttc aaatttaaca cacctcaaca "26" accgaagaag aaataaaatt tagttattgc tcagataaac atacaacttc caaaaggcat "32" ttta aaaaattaaa atttctgaaa accaatgcga caaataaagt gatg "38" ggaatctttg acaaacaaat tatt tgttttaaaa ttaaaatact cccc "44" ccccccccat gaaatgcagg ttcactaaat gtgaacagct ttgcttttca cgtgattaag "50" accctactcc aaattgtaga tcag gaaccatatt atga tacttcatta "56" atctccatca tgtatgccaa gcctgacaca tttgacagtg aggacaatgt ggcttgctcc "62" tttttgaatc tacagataat gcatgtttta cagtactcca gatgtctaca ctcaataaaa "68" catttgacaa aaccagcctt ggtgtgtttg gggatgtctg tattgactga ctgtggtgtg "74" ctgaatgcga tacggcacct ggtggttgct gattacagaa ttttaaggcg tgtgtatgac "80" acagtaactg gcagtgtggg gcagctgcaa ttgtatctta aaagtgagtc tttcatggga "86" atcagaagaa tagtatacca gggatttgtc tcaaaaaagt taattaattt atagatcaac "92" atttattgaa agatgatagc atta tgagggatat gagataggtg caca "98" aagaagaaaa ttctgtctca aaaattaaag aattttgttt gttattgttg ttctgaccat 204" attgaaaaga ggttcacttt taatctttcc tttgaaatta ttaaattgta aaaactgacc 652652 210" cattgatgtc tggtgggtta tgttttgctt caattcagca atgtgtataa aagttcctac 216" actgattttg aaatactaag cttt gaaaatttta aaaattcttt cattcttact 222" tttcagatat ttgagagcag tcgaagtggt agtatgggta agttaagcac acat 228" agaa aattccaaaa gaaagactag aacaaattgg taatttggag accctttgcc 234" cctt ctcttgtgat gatggcctga tctc ggccctgagc ctccctttct 240" ctcccatgtt tccattcctg tgagacctcc cttccttgtc cttgtcctct gcccttttgt 246" gctcttctgt agga ccatttccat ggaatgcaga ttcataccca gccagccttt 252" gcctctactg tatcgctttg ggactttaaa gatgcagaag tattagaaca cacacaaact 258" taaaatggaa agttaccaaa tgtagcacta gaggcaagaa aaggggtgat tttttaaaga 264" tttggctata cttaagatat agga tgaggttgag tcgtgagtat aatttagaag 270" agtg agtatttttt tagtatcaaa tagtgatctt cttttgctaa cataaaaatg 276" agtaaactac agcagagaaa taagaataaa actaatgaag ggttttttaa aagaactaaa 282" taaaacaaat aaaactaggc tacaggtata gctgaccaag tttgctgtga aaaatccttg 288" ttatatctat ttctgtattg gagtaatgtg agtg gattgttagg gatttcagca 294" gtataagaaa gaatcaaata tcttgctttt gggtcttctc caaagtaaat tgcatggact 300" tcttaaagaa gtttgaaatc agtaaaaaga cattaggatc atgcaatttt actttgtcta 306" gcttctaatg attg gtgc aacagaacca ttaataaaag aaaggagtta 312" gttttgaggt aactgcattc attaggggct gaaaataaat gttaaaaatg tttaatttgg 318" ttagaagttc tacaacattg tatgtacata tacaaaattt ctct gaaagtcgag 324" aattccaaga actgttacat ataatcatat atatgcctat atctaaacgt tttccctcct 330" ttaatattga taaagttaat atattttagg atgttatgta aatatttttg catgtatctg 336" tagctccatc ttaatcaaat ttatggttaa aagagatttg aggccaggcg tggaggctca 342" cacctataat cccaacactt tgggaggccg aggtgggagg atcacttgag cacaggagtt 348" tgagaccagc ctgtgcaaca tcatgaaaca tcatctctac aaat acatatatac 354" aaaaattagc tgggcatcat ggcacacacc tgtagtccca gctacttagg aggctgagat 360" gggaggattg atcgatcgct tgagcctggg aggc tactgtgagc catgatcatg 366" ccactgcact ccagcctggg caacagagtg agaccctgtc tccaaagaga aact 372" tgagaaggct tgtgcccaaa gcttgaagaa taat agcatttagt ttat 378" catgttatct tcataacatg cattttgcat atggctcaga gcagagccgg atctcccaag 384" gaagcacaaa tagtttttgt cgctaactta gttatgagtg aagcctctgt tcacttataa 653653 3901 cttgccagtt tcattggtgg aataagtccc cttactcatg attcatcaat attcctatat 3961 taaaattcca gtttgatgtt tgtctctagt tgtctgtgta taatatttcc aaagctcctt 4021 tgtaaacccc gtctcctttc attcagagag tcagagacgg ctcc ctct 4081 tctattgttc tgattagtct acttaaattc acttgatgct ctttttattt tacaagagac 4141 ttgagaaatg tcggttgttc ctaatggcaa ttag ctgggaaaat tcatttcaac 4201 taag cttgtaaaat acactgtggt gataaaataa aagagctggg tttga Protein seguence: NCBI Reference Seguence: NP_001011708.1 LOCUS: NP_001011708 ACCESSION: NP_001011708 1 migpiidkle kvavrggdkk lkpeydimck vkswviquk pvrfyhdwnd keievlnkhl 61 fltskpmvyl vnlsekdyir kknkwlikik ewvdkydpga lvipfsgale lquelsaee 121 quyleanmt qsalpkiika gfaalqleyf ftagpdevra wtirkgtkap htdf 181 ekgfimaevm kyedfkeegs enavkaagky yive dgdiiffkfn POFUT1 Official Symbol: POFUT1 Official Name: protein O-fucosyltransferase 1 Gene ID: 23509 Organism: Homo sapiens Other Aliases: FUT12, O-FUT, O-Fuc-T, O-FucT-1 Other Designations: GDP-fucose protein O-fucosyltransferase 1; o- fucosyltransferase protein; peptide-O-iucosyltransferase 1 Nucleotide seguence: NCBI Reference Seguence: NM_015352.1 654654 LOCUS: NM_015352 ION : 352 l cttccctccc cgactgtgcg ccgcggctgg ctcgggttcc cgggccgaca tgggcgccgc 6" cgcgtgggca ctga gcgtgtcttt cctgctgctg cttctgccgc tcccggggat l2" gcctgcgggc tcctgggacc cggccggtta cctgctctac tgcccctgca tggggcgctt 18" tgggaaccag gccgatcact tcttgggctc tctggcattt ctgc taaaccgtac 24" cttggctgtc cctccttgga ttgagtacca gcatcacaag cctcctttca ccaacctcca " tgtgtcctac cagaagtact tcaagctgga gcccctccag gcttaccatc gggtcatcag 36" cttggaggat ttcatggaga agctggcacc cacccactgg ccccctgaga agcgggtggc 42" atactgcttt gaggtggcag cccagcgaag cccagataag aagacgtgcc ccatgaagga 48" aggaaacccc tttggcccat tctgggatca gtttcatgtg agtttcaaca agtcggagct 54" ttttacaggc atttccttca gtgcttccta cagagaacaa tggagccaga gattttctcc 60" aaaggaacat ccggtgcttg ccctgccagg agccccagcc cagttccccg tcctagagga 66" acacaggcca ctacagaagt acatggtatg gtcagacgaa atggtgaaga cgggagaggc 72" ccagattcat gcccaccttg tccggcccta tgtgggcatt cgca ttggctctga 78" gaac gcctgtgcca tgctgaagga cgggactgca ggctcgcact tcatggcctc 84" tccgcagtgt gtgggctaca gccgcagcac agcggccccc ctcacgatga ctatgtgcct 90" gcctgacctg aaggagatcc agagggctgt gaagctctgg gtgaggtcgc tggatgccca 96" gtcggtctac gttgctactg attccgagag ttatgtgcct gagctccaac agctcttcaa "O2" agggaaggtg gtga gcctgaagcc tgaggtggcc caggtcgacc tgtacatcct "08" cggccaagcc gaccacttta ttggcaactg tgtctcctcc ttcactgcct ttgtgaagcg "l4" ggagcgggac ctccagggga ggccgtcttc tttcttcggc atggacaggc cccctaagct "20" gcgggacgag ttctgattct ggccggagca ccagaccctc tgga gggaccagag "26" ctgg tccttccagc caggcctggc agccagaggt gctccgggat tgcaaactcc "32" tcttctcacc tgccaaagat ggagaagagt gccagggacc cctcaaggag ggagacgctc "38" catatcccag ggcataggac ttgcaggttc gcag gagcatctcc catcgcacgt "44" gctttctgct ggaa acac tggcaaagca cctc ctgg "50" tccactctgc tctgagcagc ctgggatgct gaactcttca tttt tttatagaga 655655 "56" gatttctata attttgatac aaggtcatga ctatcctaga actctctgtg gtttttgaaa "62" atcattgaat tctattaatg taggtaccta aagtgacctt atgt ggatgaggct "68" ggggctggtg tgggtctttt ggctgctttt caaggtgtcc cccaatgtgg ccctcaagag "74" ccatccccac tgcctggcca gagccattgt tgtcccctac ttcctaggcc atttctgggg "80" cttgggggat gaatgctgtc ctgtgctgta aacactatgc aaatggaagt tatcggttgt "86" ggtgctgtgc agcgctctgt gggcgactaa gtgccactca atgt tcctggcaag "92" gagcacatac catcaagcca cactatcatg gtattgttct cacagtcttt tggtggttga "98" tggccactgc aaacctggca ccatcagatc tcttctgatc ccca gtggggcctg 204" agaa tgttggcatt cggttgatat ccaaagcctg ttctcccagc cgtcctcctg 210" cagctggagc cttcaggccg tattctcacg agggaacgtt tgccaaggct ctgacctcac 216" agaagatgcc cagggcccag aagccatcag aattatcagt ggagaagcac cttttgactc 222" ttcccttcca atgtaatctc tgccaacacc ctta aggtgctcta agtcatgagt 228" gttttggtct caaatgctgc agttttaata atctgtgact agcc catggttttt 234" tgaccttgtg gttctaaaat tccttgtctg gtag atcttttcct tgccatgtca 240" ttgg cctttgatcc tggaaaggtg gcagagcctc cactgagcca ggcccagagc 246" tccttgcagt gccttcttcc ttgtttacct gtgggaggaa acactttttt tgtcaggggc 252" agcctggttc agagctcaga ggtcacactg tatcaaagat ctcaaacagc aaagtcagca 258" gtat agagctgcca cccaactcta agcaggagaa actgtacaga aagggctttg 264" ttcc cttttgggaa aacaatgaag tgttttaagt cctgggtgga ctgagagatg 270" gtttgcctgt ccagacttgc tctcaagcct catccagaga aggagctgca ggag 276" cccgtacact ccctgccacc actaggttgt aagcctgtag ctggctggct gatttcattt 282" tggaattcat ttgccatcca taca ctaggcacac gagt ctggggctcc 288" agtggggccc gcctaatttt ttttcccccc aagacagggc cttgctctgt ctcccaggct 294" ggagtgcagt ggcatgatca tggcttactg cagccttgat ctcccaggct atcc 300" ttctgcctca gcctctctgg tagctgagac ccca gctccaaatc accttgattc 306" atatcagcag taataatcac ttgtgttctg aaagaaaggg caccagaagt tctagcaaaa 312" ttcagttgtg ttctgtgagc tagcactttt tcctctgacc caattttctt acctataaaa 318" tggtgataaa aaccgacagg ttgttcaaag gcccagatca gctaaagcat gtatataaga 324" gcacgttgta aaag agacaaaggc acaaatgtgg ctgttgatta atttgactgc 330" ttctcgttgc tcgtcacctc catgccaggc actgtgcttg gctt tatgggggca 656656 336" ttctcttatt tattccccag ccctgggaaa taggagctgt cattatcctt ctctttctgc 342" acaaggaaaa attaatgccc tgagaattgt cataattttc ccaaggctgc ccagctggtg 348" gtgttaagcc agaatttgac ctcccagagc cagtttccat tagctgccat gctctgctgc 354" ctctaattca cagaatgcac ccct gtgtgccatg gagacctcct atggaaaaat 360" gatcagccac cttaccttct actgggtacc tgctgtgagt ctgcctatgc cagaaggatt 366" aaggagggga ggttacccaa aagc ctacatgccg cttacagccc gatg 372" gttgctcagt gtct tgcattcagc aggtgtttgt tcatcaccta ctatgtgtca 378" ggctctatgc taggtactgg ggatacagga caag cgtaaagtct ttgttctcaa 384" ggaatttgca ttctagaaag tagaagatgt aataaatgta ctgtgggaca tgttaataag 390" tgctataaag aaatataaag ggtttgggag caaaaagagg gagtggatct attttagatg 396" agcccaggta agacctctct ctgt catgaaggag ggagggagca cattcctggc 102" agagaaaaca gcaa aggccccgag actggagtgt gttcctgaag agcagccagg 108" aggccagcat ggctggagag gcaggcatag gcagggaacc cagg tcagagcagg 114" gaca agcc tggacggcca tggcaggaag cttttagttg gagagataca Z20" ggaagcctcc tagggttctg agcagaagag gagc tgattcacat tctgaaggac Z26" ctctctagct ggccagtgct gaggaggttg gagagagaaa gggtgaaagc agagagacca Z32" gtgcagggct gttaacaggg ttgcaggcga gagactgggg tgctgggctc ccctagacta Z38" ggactccagt gccctcctct cccaagagac catt gcattgaagg aggtgggaaa Z44" tgattagatt ctgaacatat gtaattattt ttcagtcttt ttcaaagata caaatattta 150" catagtttta atcatgtaat atatacaatt taatgtccta gtgttttact tgta 156" tcatgttttc cctgttggta tgtagcctgg ataaatgctc ttaattataa aaaattctgt Z62" cgaggagtgt tccatagttt attgttttcc tattatgaga atttaggcca agtgtggtgg Z68" cctg taatcccagc actttgcgag gccgaggtgg gcagatcact tgaggtgagg Z74" agttcaagac cagcctggcc aacatggtga attatctcta ctaaaaatac aaaaaaataa Z80" taataatagc caggcgtggt ggcacatgcc tgtattccca gctgcttggg aggctgaggc Z86" aggagaatgg cttgaacctg ggaggtggag gttgcagtga gccgagatgg tgccactgca Z92" ttccagcctg ggcaacagag cgagactcca tctcaaaaaa aaggagactt catgtgcccc Z98" caatttttca ttat ttgaaaaaat atttttattt gttt ttctttattt 504" aaaatgttca ttaataaagt tgttggacgg gaagcaaaaa aaaaaagttg tttaagataa 510" agaa gtgaatttgt tagatcaaac acttaaaact ttttgttatg gaagaattca 657657 5161 aatataaata aaaaattgtg agtaataaaa tgaactcaca gtttcaacaa tgacccacaa 5221 aaaaaaaaaa aaaaaaaaaa aaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_056167.1 LOCUS: NP_056167 ACCESSION: N P_056167 1 mgaaawarpl svsflllllp lpgmpagswd ycpc mgrfgnqadh flgslafakl 61 lnrtlavppw iethhkppf quy fkleplqayh rvisledfme klapthwppe 121 krvaycfeva aqrspdkktc pmkegnpfgp fwdqfhvsfn kselftgisf sasyreqwsq 181 rfspkehpvl alpgapaqu vleehrplqk ymvwsdemvk tgeaqihahl vrpyvgihlr 241 igsdwknaca mlkdgtagsh fmaqucvgy pltm tmclpdlkei qravklwvrs 301 ldaqsvyvat dsesyvpelq vkvv slkpevaqu lyilgqadhf igncvssfta 361 karerdlqg rpssffgmdr ppklrdef Official Symbol: PRKDC al Name: n kinase, DNA-activated, catalytic polypeptide m: 5591 Organism: Homo sapiens Other Aliases: DNA-PKcs, DNAPK, DNPK1, HYRC, HYRC1, XRCC7, p350 Other Designations: DNA-PK catalytic subunit; DNA-dependent protein kinase catalytic subunit; hyper-radiosensitivity of murine scid mutation, complementing 1; p460 tide seguence: NCBI Reference Seguence: NM_001081640.1 LOCUS: NM_001081640 ACCESSION : NM_001081640 658658 l ggggcatttc cgggtccggg ccgagcgggc gcacgcgcgg gagcgggact ngcggcatg 6" gcgggctccg gagccggtgt gcgttgctcc ctgctgcggc tgcaggagac cttgtccgct l2" gcggaccgct gcggtgctgc cctggccggt catcaactga tccgcggcct ggggcaggaa 18" tgcgtcctga gcagcagccc Cgcggtgctg gcattacaga tagt tttttccaga 24" gatttcggtt tgcttgtatt tgtccggaag tcactcaaca gtattgaatt tcgtgaatgt " agagaagaaa agtt tttatgtatt ttcttagaaa aaatgggcca gaagatcgca 36" ccttactctg ttgaaattaa gaacacttgt gttt atacaaaaga tagagctgct 42" aaatgtaaaa ttccagccct ggaccttctt attaagttac cttt tagaagttct 48" agactcatgg atgaatttaa aattggagaa ttatttagta aattctatgg agaacttgca 54" ttgaaaaaaa aaataccaga tacagtttta gaaaaagtat atgagctcct aggattattg 60" gttc atcctagtga aaat aatgcagaaa tccg cgcttttctg 66" ggtgaactta agacccagat gacatcagca gtaagagagc ccaaactacc tgttctggca 72" ggatgtctga aggggttgtc ctcacttctg tgcaacttca ctaagtccat ggaagaagat 78" ccccagactt caagggagat ttttaatttt gtactaaagg caattcgtcc tcagattgat 84" ctgaagagat atgctgtgcc ctcagctggc ttgcgcctat tgca tgcatctcag 90" tttagcacct gccttctgga caactacgtg tctctatttg aagtcttgtt aaagtggtgt 96" gcccacacaa atgtagaatt gaaaaaagct tcag aatc ctttctgaaa "O2" caggtttcta atatggtggc gaaaaatgca gaaatgcata aaaataaact gcagtacttt "08" atggagcagt tttatggaat catcagaaat gtggattcga acaacaagga gttatctatt "l4" gctatccgtg gatatggact ttttgcagga ccgtgcaagg ttataaacgc aaaagatgtt "20" gacttcatgt acgttgagct cattcagcgc tgcaagcaga tgttcctcac ccagacagac "26" actggtgacg accgtgttta tcagatgcca agcttcctcc agtctgttgc aagcgtcttg "32" ctgtaccttg acacagttcc tgaggtgtat gttc tggagcacct gatg "38" cagatagaca gtttcccaca gtacagtcca aaaatgcagc tggtgtgttg cata "44" gtgt tcctagcttt ggcagcaaaa gggccagttc tcaggaattg cattagtact "50" gtggtgcatc taat cagaatatgt tctaaaccag tggtccttcc aaagggccct "56" gagtctgaat ctgaagacca ccgtgcttca ggggaagtca gaactggcaa ggtg "62" cccacataca aagactacgt ggatctcttc agacatctcc tgagctctga ccagatgatg "68" gattctattt tagcagatga agcatttttc tctgtgaatt cctccagtga aagtctgaat 659659 174" catttacttt atgatgaatt tgtaaaatcc gttttgaaga ttgttgagaa attggatctt 180" gaaa tacagactgt tggggaacaa gagaatggag atgaggcgcc tggtgtttgg 186" atgatcccaa cttcagatcc agcggctaac ttgcatccag ctaaacctaa agatttttcg 192" gctttcatta tgga attttgcaga gagattctcc ctgagaaaca agcagaattt 198" tttgaaccat actc attttcatat gaattaattt tgcaatctac aaggttgccc 204" ctcatcagtg gtttctacaa attgctttct attacagtaa gaaatgccaa gaaaataaaa 210" tatttcgagg gagttagtcc aaagagtctg aaacactctc ctgaagaccc agaaaagtat 216" tcttgctttg ctttatttgt tggc aaagaggtgg cagttaaaat gaagcagtac 222" aaagatgaac ttttggcctc ttgtttgacc tttcttctgt ccttgccaca caacatcatt 228" gaactcgatg ttagagccta cgttcctgca ctgcagatgg ctttcaaact gggcctgagc 234" tataccccct aagt aggcctgaat gctctagaag aatggtcaat ttatattgac 240" gtaa tgcagcctta ttacaaagac attctcccct gcctggatgg atacctgaag 246" acttcagcct tgtcagatga gaat aactgggaag tgtcagctct ttctcgggct 252" gcccagaaag gatttaataa agtggtgtta aagcatctga agaagacaaa gaacctttca 258" gaag caatatcctt aata agag tagtacaaat gcttggatct 264" ctaggaggac aaataaacaa aaatcttctg acagtcacgt cctcagatga gatgatgaag 270" agctatgtgg acag agagaagcgg ctgagctttg cagtgccctt tagagagatg 276" aaacctgtca ttttcctgga cctg cctcgagtca cagaattagc gctcacagcc 282" agtgacagac aaactaaagt tgcagcctgt gaacttttac atagcatggt tatgtttatg 288" ttgggcaaag agat gccagaaggg ggacagggag ccccacccat gtaccagctc 294" tataagcgga cgtttcctgt gctgcttcga cttgcgtgtg atgttgatca ggtgacaagg 300" caactgtatg agccactagt tatgcagctg attcactggt tcactaacaa caagaaattt 306" gaaagtcagg atactgttgc agaa ttgg atggaattgt ggaccctgtt 312" actt taagagattt ttgtggtcgg tgtattcgag aattccttaa atggtccatt 318" aagcaaataa caccacagca gcaggagaag agtccagtaa acaccaaatc gcttttcaag 324" cgactttata gccttgcgct tcaccccaat gctttcaaga ggctgggagc atcacttgcc 330" tttaataata tctacaggga attcagggaa tctc tggtggaaca gtttgtgttt 336" gaagccttgg tgatatacat ggagagtctg gccttagcac atgcagatga gaagtcctta 342" ggtacaattc aacagtgttg tgatgccatt gatcacctat tcat tgaaaagaag 348" catgtttctt taaataaagc aaagaaacga cgtttgccgc gaggatttcc accttccgca 660660 354" tgtt tattggatct ggtcaagtgg cttttagctc attgtgggag gccccagaca 360" gaatgtcgac acaaatccat tgaactcttt tataaattcg ttcctttatt gccaggcaac 366" agatccccta atttgtggct gaaagatgtt ctcaaggaag aaggtgtctc ttttctcatc 372" aacacctttg aggggggtgg ctgtggccag ccctcgggca tcctggccca gcccaccctc 378" ttgtaccttc gggggccatt cagcctgcag gccacgctat gctggctgga cctgctcctg 384" gccgcgttgg agtgctacaa cacgttcatt ggcgagagaa ctgtaggagc gctccaggtc 390" actg aagcccagtc ttcacttttg aaagcagtgg ctttcttctt agaaagcatt 396" gccatgcatg acattatagc agcagaaaag tgctttggca ctggggcagc aggtaacaga 102" acaagcccac gaga aaggtacaac tacagcaaat gcaccgttgt ggtccggatt 108" atggagttta ccacgactct gctaaacacc tccccggaag gatggaagct cctgaagaag 114" gacttgtgta atacacacct agtc caga cgctgtgtga gcccgcaagc 120" ataggtttca acatcggaga cgtccaggtt atggctcatc ttcctgatgt ttgtgtgaat 126" ctgatgaaag ctctaaagat gtccccatac aaagatatcc tagagaccca tctgagagag 132" aaaataacag cacagagcat tgaggagctt tgtgccgtca acttgtatgg ccctgacgcg 138" caagtggaca ggagcaggct ggctgctgtt gtgtctgcct gtaaacagct tcacagagct Z44" gggcttctgc ataatatatt tcag tccacagatt attc tgttggcaca 150" gaacttcttt ttta taaaggcatt gcccctggag atgagagaca gtgtctgcct 156" tctctagacc tcagttgtaa ggcc agcggacttc tggagttagc ctttgctttt Z62" ctgt gtgagcgcct tgtgagtctt ctcctgaacc cagcggtgct gtccacggcg Z68" tccttgggca gctcacaggg cagcgtcatc cacttctccc atggggagta tttctatagc Z74" ttgttctcag aaacgatcaa atta ttgaaaaatc tggatcttgc tgtattggag 180" ctcatgcagt cttcagtgga taataccaaa atggtgagtg tgaa cggcatgtta 186" gaccagagct tcagggagcg agcaaaccag aaacaccaag gactgaaact tgcgactaca Z92" attctgcaac actggaagaa gtgtgattca tggtgggcca aagattcccc tctcgaaact Z98" aaaatggcag tgctggcctt actggcaaaa caga ttgattcatc tgtatctttt 504" aatacaagtc atggttcatt agtc tttacaacat atattagtct acttgctgac 510" ctgg attt aaagggccaa gctgtcactc ttcttccatt cttcaccagc 516" ctcactggag gcagtctgga taga cgtgttctgg agcagctcat cgttgctcac 522" ttccccatgc agtccaggga atttcctcca ggaactccgc ggttcaataa ttatgtggac 528" tgcatgaaaa agtttctaga tgcattggaa ttatctcaaa gccctatgtt gttggaattg 661661 534" atgacagaag ttctttgtcg ggaacagcag catgtcatgg aagaattatt tcaatccagt 540" ttcaggagga ttgccagaag gggttcatgt gtcacacaag taggccttct ggaaagcgtg 546" tatgaaatgt tcaggaagga tgacccccgc ctaagtttca cacgccagtc ctttgtggac 552" cgctccctcc tcactctgct gtggcactgt agcctggatg ctttgagaga attcttcagc 558" acaattgtgg ccat tgatgtgttg aagtccaggt ttacaaagct aaatgaatct 564" acctttgata tcac caagaagatg ggctactata agattctaga cgtgatgtat 570" tctcgccttc atga tgttcatgct aaggaatcaa aaattaatca agttttccat 576" ggctcgtgta ttacagaagg aaatgaactt acaaagacat tgattaaatt gtgctacgat 582" gcatttacag agaacatggc aggagagaat cagctgctgg gaag actttaccat 588" tgtgcagcat acaactgcgc catatctgtc atctgctgtg tcttcaatga gttaaaattt 594" taccaaggtt ttag tgaaaaacca gaaaagaact tgcttatttt tctg 600" atcgacctga agcgccgcta taattttcct gtagaagttg aggttcctat ggaaagaaag 606" aaaaagtaca ttgaaattag gaaagaagcc agagaagcag caaatgggga ttcagatggt 612" ccttcctata tgtcttccct gtcatatttg agta ccctgagtga ggaaatgagt 618" caatttgatt tctcaaccgg agttcagagc tattcataca aaga ccctagacct 624" gccactggtc gttttcggag acgggagcag cgggacccca cggtgcatga tgatgtgctg 630" gagctggaga tggacgagct caatcggcat gagtgcatgg cgcccctgac ggccctggtc 636" aagcacatgc acagaagcct gggcccgcct caaggagaag aggattcagt gccaagagat 642" cttccttctt ggatgaaatt cctccatggc aaactgggaa atccaatagt accattaaat 648" atccgtctct ccaa gcttgttatt aatacagaag aggtctttcg cgcg 654" aagcactggc ttagcccctt gctg gctgcttctg aaaacaatgg aggagaagga 660" taca tggtggttga gatagtggcc actattcttt catggacagg cttggccact 666" ccaacagggg tccctaaaga tgaagtgtta gcaaatcgat tgcttaattt cctaatgaaa 672" catgtctttc atccaaaaag agctgtgttt agacacaacc ttgaaattat aaagaccctt 678" gtcgagtgct ggaaggattg tttatccatc ccttataggt taatatttga aaagttttcc 684" ggtaaagatc ctaa agacaactca gtagggattc aattgctagg catcgtgatg 690" gccaatgacc tgcctcccta tgacccacag tgtggcatcc agagtagcga atacttccag 696" gtga tgtc aaga gaag tgtatgccgc agaa 702" gttctaggac ttatacttcg atatgttatg gagagaaaaa acatactgga ggagtctctg 708" tgtgaactgg ttgcgaaaca attgaagcaa catcagaata ctatggagga caagtttatt 662662 714" gtgtgcttga tgac caagagcttc cctcctcttg cagacaggtt catgaatgct 720" gtgttctttc caaa atttcatgga gtgttgaaaa cactctgtct ggta 726" ctttgtcgtg tggagggaat gacagagctg tacttccagt taaagagcaa ggacttcgtt 732" atga gacatagaga tgatgaaaga caaaaagtat gtttggacat aatttataag 738" atgatgccaa aacc actc cgagaacttc tgaaccccgt tgtggaattc 744" gtttcccatc cttctacaac atgtagggaa caaatgtata atattctcat gtggattcat 750" taca gagatccaga aagtgagaca gataatgact cccaggaaat atttaagttg 756" gcaaaagatg tgctgattca aggattgatc aacc ctggacttca attaattatt 762" cgaaatttct ggagccatga aactaggtta ccttcaaata ccttggaccg gttgctggca 768" ctaaattcct tatattctcc taagatagaa gtgcactttt taagtttagc aacaaatttt 774" ctgctcgaaa tgaccagcat gagcccagat aacc ccatgttcga gcatcctctg 780" tcagaatgcg aatttcagga atataccatt gattctgatt ggcgtttccg aagtactgtt 786" ctcactccga tgtttgtgga gacccaggcc tcccagggca ctctccagac ccgtacccag 792" gaagggtccc tctcagctcg ctggccagtg gcagggcaga taagggccac ccagcagcag 798" catgacttca cactgacaca gactgcagat ggaagaagct attg gctgaccggg 804" agcagcactg acccgctggt cgaccacacc agtccctcat ctgactcctt gctgtttgcc 810" cacaagagga gtgaaaggtt acagagagca cccttgaagt cagtggggcc tgattttggg 816" aaaaaaaggc tgggccttcc aggggacgag gtggataaca aagtgaaagg tgcggccggc 822" cggacggacc tactacgact acgg tttatgaggg accaggagaa gctcagtttg 828" atgtatgcca gaaaaggcgt tgctgagcaa aaacgagaga aggaaatcaa gagtgagtta 834" aaaatgaagc aggatgccca ggtcgttctg tacagaagct accggcacgg tcct 840" gacattcaga tcaagcacag catc accccgttac aggccgtggc ccagagggac 846" ccaataattg caaaacagct cagc ttgttttctg gaattttgaa agagatggat 852" aaatttaaga cactgtctga aaaaaacaac atcactcaaa agttgcttca agacttcaat 858" cgttttctta ataccacctt ctctttcttt ccaccctttg tctcttgtat catt 864" agctgtcagc acgcagccct gctgagcctc gacccagcgg ctgttagcgc tggttgcctg 870" gccagcctac agcagcccgt gggcatccgc ctgctagagg aggctctgct ccgcctgctg 876" cctgctgagc tgcctgccaa gcgagtccgt gggaaggccc ctcc tgatgtcctc 882" agatgggtgg agcttgctaa taga tcaattggag aatacgacgt cctccgtggg 888" atttttacca gtgagatagg aacaaagcaa atcactcaga gtgcattatt agcagaagcc 663663 894" agaagtgatt aagc tgctaagcag tatgatgagg ctctcaataa acaagactgg 900" ggtg agcccacaga agccgagaag gatttttggg catc ctgt 906" tacaaccacc ttgctgagtg gaaatcactt gaatactgtt ctacagccag tatagacagt 912" cccc cagacctaaa taaaatctgg agtgaaccat tttatcagga aacatatcta 918" ccttacatga tccgcagcaa gctgaagctg ctgctccagg gagaggctga ccagtccctg 924" ctgacattta ttgacaaagc tatgcacggg gagctccaga ttct agagcttcat 930" tacagtcaag agctgagtct gctttacctc ctgcaagatg atgttgacag agccaaatat 936" tacattcaaa ttca gagttttatg cagaattatt ctagtattga tgtcctctta 942" caccaaagta gactcaccaa attgcagtct gtacaggctt taacagaaat tcaggagttc 948" atcagcttta taagcaaaca ttta caag ttccccttaa gagacttctg 954" aacacctgga caaacagata tccagatgct aaaatggacc caatgaacat ctgggatgac 960" atcatcacaa atcgatgttt ctttctcagc aaaatagagg agaagcttac ccctcttcca 966" gaagataata gtatgaatgt ggatcaagat ggagacccca gtgacaggat ggaagtgcaa 972" gagcaggaag aagatatcag ctccctgatc aggagttgca agttttccat gaaaatgaag 978" atgatagaca gtgcccggaa gcagaacaat ttctcacttg ctatgaaact actgaaggag 984" ctgcataaag agtcaaaaac cagagacgat tggctggtga gctgggtgca gagctactgc 990" cgcctgagcc actgccggag ccag ggctgctctg agcaggtgct cactgtgctg 996" aaaacagtct ctttgttgga tgagaacaac agct acttaagcaa aaatattctg "002" gctttccgtg accagaacat tctcttgggt acaacttaca ggatcatagc tctc "008" agcagtgagc cagcctgcct tgctgaaatc gaggaggaca aggctagaag agag "014" ctttctggat ccagttcaga ggattcagag aaggtgatcg cgggtctgta ccagagagca "020" ttccagcacc tctctgaggc tgtgcaggcg gctgaggagg aggcccagcc tccctcctgg "026" agctgtgggc ctgcagctgg ggtgattgat gcttacatga cgctggcaga tttctgtgac "032" caacagctgc gcaaggagga agagaatgca tcagttattg attctgcaga actgcaggcg "038" tatccagcac ttgtggtgga gaaaatgttg aaagctttaa aattaaattc caatgaagcc "O44" agattgaagt ttcctagatt acttcagatt atagaacggt atccagagga gactttgagc "050" ctcatgacaa tctc ttccgttccc cagt gctg gatcagccac "056" atggtggcct acaa agaccaagcc gttgctgttc agcactctgt ggaagaaatc "O62" actgataact acccgcaggc tattgtttat atca taagcagcga aagctattcc "O68" ttcaaggata cttctactgg tcataagaat aaggagtttg tggcaaggat taaaagtaag 664664 "O74" ttggatcaag gaggagtgat tcaagatttt attaatgcct tagatcagct ctctaatcct "080" gaactgctct ttaaggattg tgat gtaagagctg aactagcaaa aacccctgta "086" aataaaaaaa acattgaaaa aatgtatgaa agaatgtatg cagccttggg tgacccaaag "O92" gctccaggcc tgggggcctt tagaaggaag tttattcaga cttttggaaa agaatttgat "O98" aaacattttg ggaaaggagg ttctaaacta ctgagaatga agctcagtga cgac ""04" attaccaaca tgctactttt gaac aaagactcaa agccccctgg gaatctgaaa ""10" gaatgttcac cctggatgag cgacttcaaa gtggagttcc tgagaaatga gctggagatt ""16" cccggtcagt gtag gggaaagcca ttgccagagt accacgtgcg aatcgccggg ""22" tttgatgagc gggtgacagt gtct ctgcgaaggc ccaagcgcat catcatccgt ""28" ggccatgacg agagggaaca cctg gtgaagggtg gcgaggacct ggac ""34" cagcgcgtgg agcagctctt ccaggtcatg aatgggatcc tggcccaaga ctccgcctgc ""40" agccagaggg ccctgcagct gaggacctat agcgttgtgc ccatgacctc cagtgatccc ""46" agggcaccgc cgtgtgaata taaagattgg ctgacaaaaa tgtcaggaaa acatgatgtt ""52" ggagcttaca tgctaatgta taagggcgct aatcgtactg aaacagtcac gtcttttaga ""58" aaacgagaaa gtaaagtgcc tctc ttaaagcggg ccttcgtgag gatgagtaca ""64" agccctgagg tggc gctccgctcc cacttcgcca gctctcacgc tctgatatgc ""70" atcagccact ggatcctcgg gattggagac agacatctga acaactttat ggtggccatg ""76" gagactggcg gcgtgatcgg gatcgacttt gggcatgcgt ttggatccgc tacacagttt ""82" ctgccagtcc ctgagttgat gccttttcgg ctaactcgcc agtttatcaa tctgatgtta ""88" ccaatgaaag aaacgggcct tatgtacagc atcatggtac acgcactccg ccgc ""94" tcagaccctg gcctgctcac caacaccatg gatgtgtttg tcaaggagcc ctcctttgat "200" tggaaaaatt ttgaacagaa aatgctgaaa aaaggagggt ttca agaaataaat "206" gttgctgaaa aaaattggta cccccgacag aaaatatgtt acgctaagag aaagttagca "212" aatc cagcagtcat tacttgtgat gagctactcc tgggtcatga gaaggcccct "218" gccttcagag actatgtggc tgtggcacga ggaagcaaag atcacaacat tcgtgcccaa "224" gaaccagaga tttc gact caagtgaagt gcctgatgga ccaggcaaca "230" gaccccaaca tccttggcag aacctgggaa ggatgggagc cctggatgtg aggtctgtgg "236" gagtctgcag atagaaagca ttacattgtt taaagaatct actatacttt cagc "242" attccatgag ctgattttcc tgaaacacta aagagaaatg tcttttgtgc ttcg "248" tagcatgagt ttaaatcaag attatgatga gtaaatgtgt atgggttaaa tcaaagataa 665665 "254" ggttatagta acatcaaaga ttaggtgagg tttatagaaa gatagatatc caggcttacc "260" aaagtattaa gtcaagaata taatatgtga tcagctttca aagcatttac aagtgctgca "266" agttagtgaa acagctgtct ccgtaaatgg tgtg gggaagcctt ggaatgccct "272" tctggttctg gcacattgga aagcacactc agaaggcttc atcaccaaga ttttgggaga "278" gtaaagctaa gtatagttga tgtaacattg tagaagcagc ataggaacaa taagaacaat "284" aggtaaagct ataattatgg ttta actg catttgatat tttaggatat "290" ttttctaggt tttttccttt cattttattc tcttctagtt ttgacatttt atgatagatt "296" tgctctctag aaggaaacgt ctttatttag aaaa gtca tagcattcac "302" ttttgctatt ccaatctaca agat acataaaagt gctttgcatt gaatttggga "308" taacttcaaa aatcccatgg ttgttgttag ggatagtact aagcatttca gttccaggag "314" aataaaagaa attcctattt gaaatgaatt cctcatttgg aggaaaaaaa gcatgcattc "320" tagcacaaca agatgaaatt atggaataca aaagtggctc cttcccatgt gcagtccctg "326" tccccccccg ccagtcctcc acacccaaac tgtttctgat ttag ctttttgttg "332" tttttttttt tccttctaac acttgtattt tctt ctgtgatttt gagaagtata "338" ctcttgagtg tttaataaag tttttttcca aaagta Protein ce: NCBI Reference Seguence: NP_001075109.1 LOCUS: NP_001075109 ACCESSION: N P_001075109 l magsgagvrc sllrlqetls aadrcgaala ghqlirglgq ecvlssspav lalqtslvzs 6" rdfgllvar kslnsiefre creeilkflc iflekmqui apysveiknt ctsvytkdra 12" akckipaldl likllqtfrs srlmdefkig elfskfygel alkkkipdtv lekvyellgl 18" lgevhpsemi nnaenlfraf qmts avrepklpvl agclkglssl lcnftksmee 24" dpqtsreifn fvlkairpqi dlkryavpsa glrlfalhas qfstclldny vslfevllkw " cahtnvelkk aalsalesfl akn aemhknquy fmequgiir nvdsnnkeLs 36" iairgyglfa gpckvinakd vdfmyveliq rckqultqt dtgddrqum psflqsvasv 42" llyldtvpev ytpvlehlvv mqidsquys ccra ivkvflalaa kgpvlrncis 48" tvvhqgliri lpkg pesesedhra sgevrtgkwk vptykdyvdl frhllssdqm 666666 54" mdsiladeaf fsvnsssesl nhllydeka svlkivekld ltleiqtvge apgv 60" wmiptsdpaa nlhpakpkdf vefc reilpekqae ffepwvysfs yelilqstrl 66" plisgfykll sitvrnakki kyfegvspks lkhspedpek yscfalkaf gkevavkmkq 72" ykdellascl tfllslphni ieldvrayvp alqmafklgl sytplaevgl naleewsiyi 78" drhvmqpyyk dilpcldgyl ktsalsdetk nnwevsalsr aaqufnkvv lkhlkktknl 84" ssneaislee irirvvqmlg slggqinknl ltvtssdemm ksyvawdrek rlsfavpfre 90" mkpvifldvf lprvtelalt asdrqtkvaa cellhsmvmf qmpe ggqgappmyq 96" pvll rlacdvdqvt rqueplvmq lihwftnnkk festhvall eaildgivdp "02" vdstlrdfcg rcireflkws ikqitpqqqe kspvntkslf krlyslalhp nafergasl "08" afnniyrefr eeeslveqfv ymes lalahadeks lgtiqqccda idhlcriiek "14" khvslnkakk fpps dlvk wllahcgrpq tecrhksiel fykfvpllpg "20" wlkd vlkeegvsfl intfegggcg qpsgilaqpt llylrgpfsl qatlcwldLl "26" laalecyntf igertvgalq vlgteaqssl lkavaffles iamhdiiaae kcfgtgaagn "32" rtquegery nyskctvvvr imeftttlln tspegwkllk kdlcnthlmr vlvthcepa "38" sigfnigdvq vmahlpdvcv nlmkalkmsp ykdilethlr ekitaqsiee lcaanygpd "44" aqursrlaa vvsackqlhr agllhnilps qstdlhhsvg tellslvykg iapgderqcl "50" psldlsckql lafa fgglcerlvs lllnpavlst qgsv ihfshgeyfy "56" slfsetinte llknldlavl elmqssvdnt kmvsavlngm ldqsfreran qkhquklat "62" tilqhwkkcd swwakdsple tkmavlalla kilqidssvs fntshgsfpe vfttyilea "68" dtkldlhlkg qavtllpfft sltggsleel rrvleqliva hfpmqsrefp pgtprznnyv "74" ldal elsqspmlle lmtevlcreq qhvmeelqu rrgs cvtqvglles "80" vyemfrkddp rlsftrqsfv drslltllwh csldalreff aidv klne "86" itkk mgyykildvm ysrlpkddvh akeskinqvf hgscitegne ltktliklcy "92" daftenmage nqllerrrly hcaayncais viccvfnelk fngflfsek peknlLifen "98" lidlkrrynf pvevevpmer kkkyieirke areaangdsd gpsymsslsy ladstheem 204" sqfdfstqu qdpr patgrfrrre hddv lelemdelnr hecmapltal 210" vkhmhrslgp pqgeedsvpr dlpswmkflh gklgnpivpl nirlflaklv inteevfrpy 216" akhwlspllq laasenngge gihymvveiv atilswtgla tptgvpkdev lanrllnf;m 222" khvfhpkrav frhnleiikt lvecwkdcls ipyrlifekf sgkdpnskdn svgiqllgiv 228" mandlppydp chiqsseyf qalvnnmsfv rykevyaaaa evlglilryv merknilees 667667 234" lcelvakqlk qhqntmedkf vtks fppladrfmn avffllpkfh gvlktlclev 240" vlcrvegmte skdf qumrhrdde quvcldiiy kmmpklkpve lrellnpvve 246" fvshpsttcr eqmynilmwi hdnyrdpese tdndsqeifk lakdvliqgl idenpgqui 252" irnfwshetr lpsntldrll alnslyspki evhflslatn fllemtsmsp dypnpmfehp 258" lsecefqeyt idsdwrfrst vltpmfvetq asqgthtrt qegslsarwp vagqiratqq 264" qhdftltqta dgrssfdwlt gsstdplvdh sllf ahkrserlqr aplksvgpdf 270" gkkrlglpgd evdnkvkgaa grtdllrlrr rfmrdqekls lmyarkgvae qkrekeikse 276" lkmkqdaqvv lyrsyrhgdl pdiqikhssl itplqavaqr dpiiakqlfs slfsgilkem 282" dkfktlsekn nitqkllqdf tfsf ciqd iscqhaalls sagc 288" lasqupvgi rlleeallrl lpaelpakrv rgkarlppdv lrwvelakly rsigeyder 294" giftseigtk qitqsallae arsdyseaak qydealnkqd wvdgepteae kdfwelas;d 300" cynhlaewks leycstasid senppdlnki wsepfyqety lpymirsklk adqs 306" lltfidkamh gelqkailel hysqelslly llqddvdrak yyiqngiqsf mqnyssidvl 312" lhqsrltqu qualteiqe kqgn lssqvplkrl lntwtnrypd akmdpmniwd 318" diitnrcffl skieekltpl pednsmnvdq dgdpsdrmev qeqeedissl irsckfsmkm 324" kmidsarkqn nfslamkllk elhkesktrd dwlvsqusy crlshcrsrs qgcseqvltv 330" lktvsllden nvssylskni lafrdqnill gttyriiana lssepaclae ieedkarril 336" elsgssseds ekviaglyqr afthseavq aaeeeaqpps wscgpaagvi daymtladfc 342" dqqlrkeeen aelq aypalvvekm lkalklnsne arlkfprllq iierypee:l 348" slmtkeissv pcwqfiswis hmvalldkdq avavqhsvee itdnypqaiv ypfiissesy 354" sfkdtstghk nkefvariks kldqggviqd finaldqlsn pellfkdwsn ak:p 360" vnkkniekmy ermyaalgdp kapglgafrr gkef dkhfgkggsk llrmklsdfn 366" ditnmlllkm nkdskppgnl msdf kveflrnele ipgqydgrgk plpeyhvria 372" gfdervtvma slrrpkriii rghderehpf lvkggedqu dqrveqlfqv mngilaqua 378" csqralqlrt ysvvpmtssd prappceykd wltkmsgkhd vgaymlmykg anrtetvtsf 384" rkreskvpad llkrafvrms tspeaflalr shfasshali cishwilgig fmva 390" igid fghafgsatq flpvpelmpf rltrqfinlm glmy simvhalraf 396" ltnt mdvakepsf dwknfeqkml kkggswiqei nvaeknwypr qkicyakrkl 402" aganpavitc delllgheka pafrdyvava rgskdhnira qepesglsee tqvkclmdqa 408" tdpnilgrtw egwepwm 668668 PSM D6 Official Symbol: PSM D6 Official Name: proteasome (prosome, macropain) 268 subunit, Pase, 6 Gene ID: 9861 Organism: Homo sapiens Other Aliases: an7, S10, SGA-113M, p44S10 Other Designations: 268 some Pase regulatory subunit 6; 268 proteasome regulatory subunit RPN7; 268 proteasome regulatory subunit S10; breast cancer-associated protein SGA-113M; p42A; phosphonoformate immuno-associated n 4; proteasome regulatory particle t p44S10 Nucleotide seguence: NCBI Reference ce: NM_014814.1 LOCUS: 814 ACCESSION : NM_014814 l gtcagccgct gtccccttag ccgcgatgcc gctggagaac ctggaggagg agggtctgcc 6" caagaacccc gacttgcgta tcgcgcagct gcgcttcctg ctcagcctgc ccgagcaccg 12" cggagacgct cgcg acgagctgat ggcggccgtc cgcgataaca acatggctcc 18" ttactatgaa gccttgtgca aatccctcga ctggcagata gacgtggacc tactcaataa 24" aatgaagaag gcaaatgaag atgagttgaa gcgtttggat gaggagctgg aagatgcaga " gaagaatcta ggagagagcg aaattcgcga tgcaatgatg gcaaaggccg agtacctctg 36" ccggataggt gacaaagagg gagctctgac agcctttcgc tatg acaaaactgt 42" ggccctgggt caccgattgg atattgtatt ctatctcctt aggattggct tattttatat 48" ggataatgat ctcatcacac gaaacacaga aaaggccaaa agcttaatag aagaaggagg 54" agactgggac aacc gcctaaaagt gtatcagggt ctttattgtg tggctattcg 60" tgatttcaaa caggcagctg aactcttcct tgtt tcaacattta catcctatga 66" actcatggat tataaaacat ttgtgactta tactgtctat gtcagtatga ttgccttaga 72" aagaccagat ctcagggaaa aggtcattaa aggagcagag attcttgaag tgttgcacag 669669 781 tcttccagca gttcggcagt atctgttttc actctatgaa tgccgttact ctgttttctt 841 ccaatcatta gcggttgtgg aacaggaaat gaaaaaggac tggctttttg atta 901 tcgatactat gtaagagaaa tgagaattca tgcatacagt cagctgctgg aatcatatag 961 gtcattaacc cttggctata tggcagaagc gtttggtgtt ggtgtggaat tcattgatca 1021 ggaactgtcc attg ctgccgggag actacactgc aaaatagata aagtgaatga 1081 aatagtagaa accaacagac ctgatagcaa gaactggcag taccaagaaa ctatcaagaa 1141 aggagatctg aaca gagttcaaaa actttccaga gtaattaata tgtaaagcca 1201 tgtaactaac aaaggatttg ctttagagat aattatttgq aatttttata gcttacttca 1261 caatgtgccc gctg ataa atactgcatt gttgtttc Protein seguence: NCBI Reference Seguence: NP_055629.1 LOCUS: NP_055629 ACCESSION: NP_055629 1 mplenleeeg lpknpdlria qlrfllslpe vrde lmaavrdnnm apyyealcks 61 ldwqidvdll nkmkkanede lkrldeeled aeknlgesei rdammakaey lcrigdkeqa 121 ltafrktydk tvalghrldi vfyllriglf ymdndlitrn tekaksliee ggdwdrrnrl 181 kquglycva irdfkqaael fldtvstfts yelmdyktfv tytvyvsmia 1erpd1rekv 241 ikgaeilevl hslpavrqyl rysv fqulavveq lfap hyryyvremr 301 ihaysqlles yrsltlgyma eafgvgvefi dqelsrfiaa grlhckidkv neivetnrpd 361 skanyqeti kkgdlllnrv qklsrvinm ITG B1 Official Symbol: ITGB1 Official Name: integrin, beta 1 nectin receptor, beta polypeptide, antigen CD29 includes MDF2, MSK12) [Homo sapiens] Gene ID: 3688 sm: Homo sapiens 670670 Other Aliases: RP11-479G22.2, CD29, FNRB, GPIIA, MDF2, MSK12, VLA- BETA, VLAB Other Designations: integrin VLA-4 beta subunit; integrin beta-1; very late activation n, beta polypeptide tide seguence: NCBI nce Seguence: NM_002211.3 Locus: NM_002211 ACCESSION : NM_002211 l atcagacgcg cagaggaggc ggggccgcgg ctggtttcct gccggggggc ggctctgggc 6" cgccgagtcc cctcctcccg cccctgagga ggaggagccg ccgccacccg ccgcgcccga 12" ggag gccccgccag cccgcgggag aggcccagcg ggagtcgcgg aacagcaggc 18" ccgagcccac CgCgCnggC cccggacgcc gcgcggaaaa gatgaattta caaccaattt 24" ttgg actgatcagt tgct gtgtgtttgc tcaaacagat gaaaatagat " gtttaaaagc caaa tcatgtggag aatgtataca agcagggcca aattgtgggt 36" caaa ttcaacattt ttacaggaag gaatgcctac ttctgcacga tgtgatgatt 42" tagaagcctt aaaaaagaag ggttgccctc cagatgacat agaaaatccc agaggctcca 48" taaa gaaaaataaa aatgtaacca accgtagcaa aggaacagca gagaagctca 54" agccagagga tattactcag atccaaccac agcagttggt tttgcgatta agatcagggg 60" agccacagac atttacatta aaga gagctgaaga ctatcccatt gacctctact 66" accttatgga ttac tcaatgaaag acgatttgga gaatgtaaaa agtcttggaa 72" cagatctgat gaatgaaatg aggaggatta cttcggactt cagaattgga tttggctcat 78" ttgtggaaaa gactgtgatg ccttacatta gcacaacacc agctaagctc aggaaccctt 84" gcacaagtga acagaactgc ccat ttagctacaa aaatgtgctc agtcttacta 90" ataaaggaga agtatttaat gttg gaaaacagcg catatctgga aatttggatt 96" ctccagaagg tggtttcgat gccatcatgc aagttgcagt ttgtggatca ctgattggct 102" ggaggaatgt tacacggctg ctggtgtttt ccacagatgc cgggtttcac tttgctggag 108" atgggaaact tggtggcatt gttttaccaa atgatggaca atgtcacctg gaaaataata 114" tgtacacaat gagccattat tatgattatc cttctattgc tcaccttgtc cagaaactga 120" ataa tattcagaca atttttgcag ttactgaaga atttcagcct aagg 126" agctgaaaaa cttgatccct aagtcagcag taggaacatt atctgcaaat tctagcaatg 671671 "32" taattcagtt gatcattgat gcatacaatt ccctttcctc agaagtcatt ttggaaaacg "38" gcaaattgtc agaaggcgta acaataagtt ctta ctgcaagaac ggggtgaatg "44" gaacagggga aaatggaaga aaatgttcca atatttccat tggagatgag gttcaatttg "50" aaattagcat aacttcaaat aagtgtccaa attc tgacagcttt aaaattaggc "56" ctctgggctt tacggaggaa gtagaggtta ttcttcagta catctgtgaa tgtgaatgcc "62" aaagcgaagg catccctgaa agtcccaagt gtcatgaagg aaatgggaca tttgagtgtg "68" gcgcgtgcag gtgcaatgaa gggcgtgttg gtagacattg tgaatgcagc gaag "74" ttaacagtga agacatggat gcttactgca ggaaagaaaa cagttcagaa atctgcagta "80" acaatggaga gtgcgtctgc ggacagtgtg tttgtaggaa gagggataat acaaatgaaa "86" tttattctgg caaattctgc gagtgtgata atttcaactg tgatagatcc aatggcttaa "92" tttgtggagg aaatggtgtt tgtc gtgtgtgtga gtgcaacccc aactacactg "98" gcagtgcatg tgactgttct acta gtacttgtga agccagcaac ggacagatct 204" gcaatggccg gggcatctgc gagtgtggtg tctgtaagtg tacagatccg aagtttcaag 210" ggcaaacgtg tgagatgtgt tgcc ttggtgtctg tgctgagcat aaagaatgtg 216" ttcagtgcag agccttcaat aaaggagaaa agaaagacac atgcacacag gaatgttcct 222" attttaacat taccaaggta gaaagtcggg acaaattacc ccagccggtc caacctgatc 228" ctgtgtccca ttgtaaggag aaggatgttg acgactgttg gttctatttt acgtattcag 234" tgaatgggaa caacgaggtc atggttcatg agaa tccagagtgt cccactggtc 240" cagacatcat tgta gtgg ttgctggaat tgttcttatt ggccttgcat 246" tactgctgat atggaagctt ttaatgataa ttcatgacag aagggagttt gctaaatttg 252" aaaaggagaa aatgaatgcc aaatgggaca cgggtgaaaa tcctatttat aagagtgccg 258" taacaactgt ggtcaatccg aagtatgagg gaaaatgagt actgcccgtg caaatcccac 264" aacactgaat gcaaagtagc aatttccata gtta ggtagcttta gggcaatatt 270" gccatggttt tgtg caggttttga aaatgtacaa tatgtataat ttttaaaatg 276" ttttattatt ttgaaaataa tgttgtaatt aggg actgacaaaa gacttgagac 282" aggatggtta ctcttgtcag ctaaggtcac attgtgcctt tttgaccttt tgga 288" ctattgaaat caagcttatt ggattaagtg atatttctat agcgattgaa atag 294" ttaaagtaat gagcatgatg agagtttctg atgt attaaaactg atttttagct 300" ttacaaatat gtcagtttgc agttatgcag aatccaaagt aaatgtcctg ctagctagtt 306" aaggattgtt ctgt tattttgcta tttgcctgtt agacatgact gatgacatat 672672 312" gaca agtatgttga gagttgctgg tgtaaaatac gtttgaaata gttgatc:ac 318" aaaggccatg ggaaaaattc agagagttag gaaggaaaaa ccaa:agctt taaaacc:gt 324" tttt aagagttact taatgtttgg ttat gcct:cactt t:ca 330" agccttagat aaaagaaccg tttc tgctaaaaag tcct:gattt a:tt 336" acatacaggc ttac aaagtatttg ctgaatgggg acct:ttgag ttgaatt:at 342" tttattattt ttattttgtt taatgtctgg tgctttctgt cacc:cttct aatcttt:aa 348" tgtatttgtt tgcaattttg gggtaagact ttttttatga gtac:ttttc tttgaag:tt 354" tagcggtcaa tttgcctttt taatgaacat gtgaagttat actg:ggcta tgcaacagct 360" ctcacctacg cgagtcttac tttgagttag tgccataaca gaccactgta cttc 366" tcaccatttg agttgcccat cttgtttcac actagtcaca ttcttgtttt aagtgccttt 372" agttttaaca gttcactttt tacagtgcta tttactgaag atta aatatgccta 378" aaatacttaa tgtc ttgactctga tgtattttat caggttgtgt gcatgaaatt 384" tttatagatt aaagaagttg aggaaaagca aaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_002202.2 LOCUS: NP_002202 ACCESSION: N P 002202 1 mnlqpifwig lissvccvfa qtdenrclka nakscgeciq agpncgwctn stflqegmpt 6" sarcddleal kkkgcppddi enprgskdik knknvtnrsk gtaeklkped itqiqpquv l2" lrlrsgepqt ftlkfkraed ypidlyylmd lsysmkddle nvkslgtdlm nemrritsdf 18" rigfgsfvek tvmpyisttp aklrnpctse fsyk nvlsltnkge 24" isgnldspeg gfdaimqvav cgsligwrnv trllvfstda gfhfagdgkl ggivlpndgq " chlennmytm shyydypsia hlvqklsenn iqtifavtee fqpvykelkn lipksavgtl 36" sanssnviql iidaynslss evilengkls egvtisyksy ckngvngtge ngrkcsnisi 42" gdevqfeisi tsnkcpkkds dsfkirplgf teevevilqy icececqseg ipespkcheg 48" ngtfecgacr cnegrvgrhc ecstdevnse rken sseicsnnge cvcgqcvcrk 54" rdntneiysg kfcecdnfnc drsnglicgg ngvckcrvce cnpnytgsac dcsldtstce 60" cngr gicecgvckc tdpqugqtc emcqtclgvc aehkecvqcr afnkgekkdt 66" yfni dklp qpvqupvsh ckekdvddcw fyftysvngn nevmvhvven 72" pecptgpdii pivagvvagi vliglallli wkllmiihdr refakfekek mnakwdtgen 78" piyksavttv vnpkyegk 673673 Official Symbol: MYH10 Official Name: myosin, heavy chain 10, non-muscle [Homo sapiens] m: 4628 Organism: Homo s Other Aliases: NMMHC-IIB, NMMHCB Other Designations: cellular myosin heavy chain, type B; myosin heavy chain, nonmuscle type B; myosin, heavy polypeptide 10, non-muscle; myosin-10; nonmuscle myosin ll heavy chain-B; nonmuscle myosin heavy chain ”8 Nucleotide seguence: NCBI Reference Seguence: NM_001256012.1 LOCUS: NM 001256012 ACCESSION : 256012 l agcagtgcta aaggagcccg gcag cggtgggttt gagg cgctggatct 6" gtggtcgcgg ctggggacgt gcgcccgcgc caccatcttc ggctgaagag gctt 12" ttggatcgtt ccatttacaa tggcgcagag aactggactc gaggatccag agaggtatct 18" ctttgtggac agggctgtca tctacaaccc tcaa gctgattgga cagctaaaaa 24" gctagtgtgg attccatcag aacgccatgg ggca gctagtatca aagaagaacg " gggagatgaa gttatggtgg agttggcaga gaatggaaag aaagcaatgg tcaacaaaga 36" tgatattcag aagatgaacc cacctaagtt ttccaaggtg gaggatatgg tgac 42" atgcttgaat gaagcttccg ttttacataa tctgaaggat cgctactatt caggactaat 48" ctatacttat tctggactct tctgtgtagt tataaaccct aatc ttccaattta 54" ctctgagaat attattgaaa tgtacagagg gcgt catgagatgc ctccacacat 60" ctatgctata tctgaatctg cttacagatg catgcttcaa gagg accagtcaat 66" tctttgcacg ggtgagtcag gtgctgggaa gacagaaaat acaaagaaag ttattcagta 72" ccttgcccat gttgcttctt cacataaagg aagaaaggac cataatattc ctcaggaatc 78" gcctaaacca gtgaaacacc agggggaact tgaacggcag cttttgcaag caaatccaat 84" tctggaatca aatg cgaagactgt gaaaaatgat aactcatctc gttttggcaa 90" atttattcgg atcaactttg atgtaactgg ctatatcgtt ggggccaaca ttgaaacata 674674 96" ccttctggaa aagtctcgtg ctgttcgtca agcaaaagat gaacgtactt ttcatatctt "02" ttaccagttg ttatctggag caggagaaca cctaaagtct gatttgcttc ttgaaggatt "08" taataactac aggtttctct ccaatggcta tattcctatt ccgggacagc aagacaaaga "14" taatttccag gagaccatgg aagcaatgca cataatgggc ttctcccatg aagagattct "20" gtcaatgctt aaagtagtat cttcagtgct acagtttgga aatatttctt tcaaaaagga "26" tact gatcaagctt ccatgccaga agtt gcgcagaagc tctgccatct "32" tcttgggatg aatgtgatgg agtttactcg ggccatcctg actccccgga tcaaggtcgg "38" ccgagactat aaag cccagaccaa agaacaggca gcag tagaagcatt "44" ggcaaaagct acctatgagc ggctctttcg ctggctcgtt catcgcatca ataaagctct "50" ggataggacc aaacgtcagg gagcatcttt cattggaatc ctggatattg ttga "56" aatttttgag ctgaactcct ttgaacaact ttgcatcaac tacaccaatg agaagctgca "62" gcagctgttc aaccacacca tgtttatcct agaacaagag gaataccagc gcgaaggcat "68" cgagtggaac ttcatcgatt tcgggctgga tctgcagcca gacc taatagagag "74" acctgcgaac cctcctggtg tactggccct tttggatgaa gaatgctggt tccctaaagc "80" cacagataaa acctttgttg aaaaactggt gcaa ggttcccact ccaagtttca "86" gaaacctcga caattaaaag acaaagctga tttttgcatt atacattatg cagggaaggt "92" ggactataag gcagatgagt ggctgatgaa gaatatggac cccctgaatg acaacgtggc "98" cacccttttg caccagtcat cagacagatt agag ctttggaaag atgagattca 204" gaatattcag agagcttctt tctatgacag tgtttctggt cttcatgagc caccagtgga 210" ccgtatcgtg ggtctggatc aagtcactgg tatgactgag acagcttttg gctccgcata 216" taaaaccaag aagggcatgt ttcgtaccgt tgggcaactc tacaaagaat ctctcaccaa 222" ggca actctccgaa acaccaaccc taactttgtt cgttgtatca ttccaaatca 228" cgagaagagg gctggaaaat caca cctagtccta gatcagcttc gctgtaatgg 234" tgtcctggaa gggatccgaa tctgtcgcca gggcttccct aaccgaatag ttttccagga 240" attcagacag agatatgaga tcctaactcc aaatgctatt cctaaaggtt ttatggatgg 246" ggcc tgtgaacgaa tgatccgggc tttagaattg gacccaaact tgtacagaat 252" tggacagagc tttt tcagagctgg agttctggca cacttagagg aagaaagaga 258" tttaaaaatc accgatatca tctt cgtt tgcagaggtt acctggccag 264" aaaggccttt gccaagaagc agcagcaact aagtgcctta aaggtcttgc agcggaactg 270" gtac ttac ggcactggca gtggtggcga gtcttcacaa aggtgaagcc 675675 276" gcttctacaa gtgactcgcc aggaggaaga acttcaggcc aaagatgaag tgaa 282" ggtgaaggag aagcagacga aggtggaagg agagctggag gagc acca 288" gcagctttta gaagagaaga ttgc agaacaacta caagcagaga ctgagctctt 294" tgctgaagca gaagagatga gggcaagact tgctgctaaa aagcaggaat tagaagagat 300" tctacatgac tcta gggttgaaga agaagaagaa agaaaccaaa tcctccaaaa 306" tgaaaagaaa aaaatgcaag cacatattca ggacctggaa gaacagctag acgaggagga 312" aggggctcgg caaaagctgc agctggaaaa ggtgacagca gaggccaaga agat 318" ggaagaggag attctgcttc tcgaggacca aaattccaag ttcatcaaag aaaagaaact 324" catggaagat cgcattgctg agtgttcctc ggct gaagaggaag aaaaggcgaa 330" aaacttggcc aaaatcagga ataagcaaga agtgatgatc tcagatttag aagaacgctt 336" aaagaaggaa gaaaagactc gtcaggaact ggaaaaggcc aaaagaaaac ggga 342" gacgaccgac ctgcaggacc agatcgcaga gctgcaggcg cagattgatg agctcaagct 348" gcagctggcc aagaaggagg aggagctgca gggcgcactg gccagaggtg atgatgaaac 354" actccataag aacaatgccc ttaaagttgt gcgagagcta caagcccaaa ttgctgaact 360" tcaggaagac tttgaatccg agaaggcttc acggaacaag gccgaaaagc agaaaaggga 366" tgag gaag ctctgaaaac agagctggag gacacgctgg acaccacggc 372" agcccagcag gaactacgta caaaacgtga acaagaagtg gcagagctga agaaagctct 378" tgaggaggaa actaagaacc atgaagctca aatccaggac caaa gacacgcaac 384" agccctggag gagctctcag agcagctgga acaggccaag cggttcaaag caaatctaga 390" gaagaacaag ctgg agacagataa caaggagctg gcgtgtgagg tgaaggtcct 396" gcagcaggtc aaggctgagt ctgagcacaa gaggaagaag ctcgacgcgc aggtccagga 102" gctccatgcc aaggtctctg aaggcgacag gctcagggtg gagctggcgg agaaagcaag 108" taagctgcag aatgagctag ataatgtctc cacccttctg gaagaagcag agaagaaggg 114" tattaaattt gctaaggatg cagctagtct tgagtctcaa ctacaggata cacaggagct 120" ggag gagacacgcc agaaactaaa cctgagcagt cggatccggc agctggaaga 126" ggagaagaac agtcttcagg agga ggag agga agaacctgga Z32" gaagcaagtg ctggccctgc agtcccagtt ggctgatacc aagaagaaag tagatgacga Z38" cctgggaaca attgaaagtc tggaagaagc caagaagaag cttctgaagg acgcggaggc Z44" cctgagccag cgcctggagg cact ggcgtatgac aaactggaga agaccaagaa 150" ccgcctgcag caggagctgg acgacctcac ggtggacctg gaccaccagc gccaggtcgc 676676 156" ctccaacttg gagaagaagc agaagaagtt tgaccagctg ttagcagaag agaagagcat Z62" ctctgctcgc tatgccgaag agcgggaccg ggccgaagcc gaggccagag aaac Z68" caaagccctg tcactggccc gggccctcga ggaagccctg gaggccaagg aggagtttga Z74" gaggcagaac aagcagctcc gagcagacat ggaagacctc atgagctcca aagatgatgt 180" gggaaaaaac gttcacgaac ttgaaaaatc caaacgggcc ctagagcagc aggtggagga 186" gacc cagctggagg agctggaaga cgaactccag gccacggaag atgccaagct Z92" tcgtctggag gtcaacatgc aggccatgaa ggcgcagttc gagagagacc tgcaaaccag Z98" ggatgagcag aatgaagaga agaagcggct gctgatcaaa caggtgcggg agctcgaggc 504" ggagctggag gatgagagga gggc gcttgctgta gcttcaaaga aaaagatgga 510" gatagacctg ctcg aagcccaaat Cgaggctgcg aacaaagctc gggatgaggt 516" gcag ctccgcaagc tccaggctca gatgaaggat taccaacgtg aattagaaga 522" agctcgtgca tccagagatg agatttttgc tcaatccaaa gagagtgaaa agaaattgaa 528" gagtctggaa atcc tgca ggaggaactt gcctcatctg agcgagcccg 534" ccgacacgcc gagcaggaga agct ggcggacgag atcaccaaca gcgcctctgg 540" caagtccgcg ctgctggatg agaagcggcg tctggaagct cggatcgcac agctggagga 546" ggagctggaa gaggagcaga gcaacatgga gctgctcaac ttcc gcaagaccac 552" tctacaggtg gacacactga acgccgagct agcagccgag gccg agag 558" tgacaatgca cgccagcaac ggca gaacaaggag ctgaaggcca agctgcagga 564" actcgagggt gctgtcaagt ctaagttcaa ggccaccatc tcagccctgg agat 570" tgggcagctg gaggagcagc ttgagcagga agccaaggaa cgagcagccg aatt 576" agtccgtcgc aaga agctgaaaga aatcttcatg caggttgagg atgagcgtcg 582" acacgcggac aaag agcagatgga gaaggccaac gctcggatga agcagcttaa 588" acgccagctg gaggaagcag aagaagaagc gacgcgtgcc aacgcatctc ggcgtaaact 594" ccagcgggaa ctggatgatg ccaccgaggc caacgagggc ctgagccgcg aggtcagcac 600" cctgaagaac cggctgaggc ggggtggccc catcagcttc tcttccagcc gatctggccg 606" gcgccagctg caccttgaag gagcttccct ggagctctcc gacgatgaca cagaaagtaa 612" tgat gtcaacgaga cgcagccacc ccagtcagag taaagttgca ggaagccaga 618" ggaggcaata cagtgggaca gttaggaatg cacccggggc ctcctgcaga tttcggaaat 624" tggcaagcta cgggattcct tcctgaaaga tcaactgtgt cttaaggctc tccagcctat 630" tgta tcctgcttca gacttaggta caattgctcc cctttttata tatagacaca 677677 636" acac atatattaaa cagattgttt catcattgca tctattttcc atatagtcat 642" caagagacca ttttataaaa aaga ccctttttaa aacaaactcc aggcccttgg 648" gtcg tatt ggggcagcgc cgtggtcgtc actcagtcgc tctgcatgct 654" ctctgtcata cagacaggta acctagttct gtgttcacgt ggcccccgac tcctcagcca 660" catcaagtct cctagaccac tgtggactct aaactgcact tgtctctctc atttccttca 666" aataatgatc aatgctattt cagtgagcaa actgtgaaag gggctttgga aagagtagga 672" ggggtgggct ggatcggaag ccat ttggggttac catgtccatc gggg 678" ggccctgccc ctcgagtcga tggtgtcccg catctactca tgtgaactgg cgag 684" ggctggtctg tgcatagaag tggc cacactgcag ctgaggcccc aggtggcagc 690" catggatcat gtagacttcc agatggtctc cgcc tggctctgcc ggcgccctcc 696" cagg agcaagcagc cgtggacccc taagccgagc tggtggaagg cccctccccg 702" tcgccagccg ggccctcatg ctgaccttgc aaattcagcc gctgctttga gcccaaaatg 708" ggaatattgg gtcc gaggcttgtt ccaagtttgt caatgaggtt tatggagcct 714" ccagaacaga tgccatcttc ctgaatgttg acatgccagt gggtgtgact ccttcatttt 720" tccttctccc ttccctttgg acagtgttac agtgaacact tagcatcctg tttttggttg 726" gtagttaagc aaactgacat tacggaaagt gaca ctacagtact aagacaatgt 732" tgaatatatc attcgcctct ataacaattt aatgtattca gttttgactg tgcttcatat 738" catgtacctc tctagtcaaa gtggtattac agacattcag tgacaatgaa tcagtgttaa 744" ttctaaatcc ttgatcctct gcaatgtgct tgaaaacaca aaccttttgg gttaaaagct 750" ttaacatcta ttaggaagaa tttgtcctgt gggtttggaa tcttggattt tcccccttta 756" tgaactgtac tggctgttga ccaccagaca cctgaccgca aatatctttt cttgtattcc 762" catatttcta gacaatgatt tttgtaagac aataaattta ttcattatag atatttgcgc 768" ctgctctgtt tacttgaaga aaaaagcacc cgtggagaat aaagagacct caataaacaa 774" gaataatcat gtgaacgtgg aaaaaaaaaa aaaaaaaa Protein seguence: NCBI Reference ce: NP_001242941.1 LOCUS: NP_001242941 ACCESSION: NP 001242941 1 ledp erylfvdrav iynpatqadw takklvwips erhgfeaasi keergdevmv 61 elaengkkam vnkddiqkmn ppkfskvedm aeltclneas vlhnlkdryy sgliytysgl 678678 12" fcvvinpykn lpiyseniie myrgkkrhem pphiyaises ayrcmlqdre dqsilctges 18" gagktentkk hvas shkgrkdhni pqespkpvkh qgelerqllq anpilesfgn 24" aktvkndnss rfgkfirinf dvtgyivgan ietylleksr avrqakdert fhifyqllsg " agehlksdll legfnnyrfl sngyipipgq qdkdnfqetm eamhimgfsh eeilsmlkvv 36" ssvqugnis tdqa smpentvaqk lchllgmnvm eftrailtpr ikvgrdyvqk 42" aqtkeqadfa vealakatye rlfrwlvhri nkaldrtqu gasfigildi agfeifelns 48" feqlcinytn equqlfnht mfileqeeyq regiewnfid pcid lierpanppg 54" vlalldeecw fpkatdktfv eklvqqush squkprqlk dkadfciihy agkvdykade 60" dpln lhqs sdrfvaelwk deiqniqras fydsvsglhe ppvdringd 66" qvtgmtetaf gsayktkkgm frtvgquke sltklmatlr ntnpnfvrci ipnhekragk 72" ldphlvldql rcngvlegir icrqgfpnri rye iltpnaipkg fmdgkqacer 78" miraleldpn lyrigqskif ahle eerdlkitdi iiffqavcrg ylarkafakk 84" qqqlsalkvl ylkl rhwqurvft kvkpllqvtr qeeelqakde ellkvkekqt 90" kvegeleeme rkhqqlleek nilaeqlqae aeem rarlaakkqe leeilhdles 96" ernq ilqnekkkmq ahiqdleeql deeegaqul qlekvtaeak ikkmeeeiLl "02" ledqnskfik ekklmedria ecssqlaeee ekaknlakir nkqevmisdl eerlkkeekt "08" akrk ldgettdlqd qiaelqaqid elqulakke eelqgalarg ddetlhknna "14" lkvvrelqaq iaelqedfes ekasrnkaek qkrdlseele edtl dttaaqqeLr "20" tkreqevael kkaleeetkn heaqiqdmrq rhataleels eqleqakrfk anleknkqgl "26" etdnkelace vkvquvkae sehkrkklda qvqelhakvs egdrlrvela ekasqunel "32" dnvstlleea ekkgikfakd aaslesqlqd tqellqeetr srir qleeekns;q "38" eqqeeeeear vlal qsqladtkkk vdddlgties leeakkkllk daealsque "44" ekalaydkle ktknrquel ddltvdldhq rqvasnlekk qkkqullae eksisaryae "50" erdraeaear eketkalsla raleealeak eeferanql radmedlmss kddvgknvhe "56" lekskral q qv mrtql l d lqat daklrlevnm qamkaqferd lqtrdeqnee "62" kkrllikqu eleaeleder qualavask kkmeidlkdl eaqieaanka rdevikqlrk "68" yqr eleearasrd eifaqskese kklksleaei lqlqeelass erarrhaeqe "74" rdeladeitn sasgksalld ekrrlearia qleeeleeeq snmellndrf rktthvdtl "80" naelaaersa aqksdnarqq leranelka quelegavk skfkatisal eakigqleeq "86" eraa anklvrrtek klkeifmqve derrhadqyk eqmekanarm kqlquleea 679679 1921 anas rrqureldd glsr evstlknrlr rggpisfsss rsgrrqlhle 1981 gaslelsddd tesktsdvne e Official Sym bol: NCL Official Name: nucleolin [Homo sapiens] Gene ID: 4691 Organism: Homo sapiens Other Aliases: C23 Nucleotide seguence: NCBI Reference Seguence: NM_005381.2 LOCUS: NM 005381 ACCESSION : NM_005381 XM_002342275 ctttcgcctc agtctcgagc tctcgctggc cttcgggtgt tccg ggatcttcag 6" cacccgcggc cgccatcgcc tggc ttcttctgga ctcatctgcg ccacttgtcc 12" gcttcacact ccgccgccat catggtgaag ctcgcgaagg aaaa tcaaggtgac 18" aaaa tggctcctcc tccaaaggag gaag atagtgaaga tgaggaaatg 24" tcagaagatg aagaagatga tagcagtgga gaagaggtcg tcatacctca gaagaaaggc " aagaaggctg cctc agcaaagaag gtggtcgttt ccccaacaaa aaaggttgca 36" gttgccacac cagccaagaa agcagctgtc actccaggca aaaaggcagc agcaacacct 42" gccaagaaga cagttacacc agccaaagca gttaccacac ctggcaagaa gggagccaca 48" ccaggcaaag cattggtagc aactcctggt aagaaggg 1g ctgccatccc gggg 54" gcaaagaatg gcaagaatgc caagaaggaa gacagtga:g aagaggagga cagt 60" gaggaggatg aggaggatga cgaggacgag gatgagga:g aagatgaaat tgaaccagca 66" gcgatgaaag cagcagctgc tgcccctgcc tcagagga:g aggacgatga ggatgacgaa 72" gatgatgagg atgacgatga cgatgaggaa gatgactc:g aagaagaagc tatggagact 78" acaccagcca aaggaaagaa agctgcaaaa gttgttcc:g tgaaagccaa gaacgtggct 84" gaggatgaag atgaagaaga ggatgatgag gacgagga:g acgacgacga cgaagatgat 680680 90" gatg atgatgaaga tgatgaggag gaggaagaag aggaggagga agagcctgtc 96" aaagaagcac aacg aaagaaggaa aaac agaaagcagc tcctgaagcc "02" aagaaacaga aagtggaagg cacagaaccg actacggctt tcaatctctt tgttggaaac "08" ctaaacttta ctgc tcctgaatta aaaactggta tcagcgatgt ttttgctaaa "l4" aatgatcttg ctgttgtgga tgtcagaatt ggtatgacta ggaaatttgg ttatgtggat "20" tttgaatctg ctgaagacct ggagaaagcg ttggaactca tgaa agtctttggc "26" aatgaaatta aactagagaa accaaaagga aaagacagta agaaagagcg agatgcgaga "32" acacttttgg ctaaaaatct cccttacaaa gtcactcagg atgaattgaa agaagtgttt "38" gaagatgctg cggagatcag attagtcagc ggga aaagtaaagg gattgcttat "44" attgaattta aagc tgatgcagag aaaacctttg aagaaaagca gggaacagag "50" atcgatgggc tttc cctgtactat actggagaga aaggtcaaaa tcaagactat "56" agaggtggaa agaatagcac tggt aaaa ctctggtttt aagcaacctc "62" agtg caacagaaga aactcttcag gaagtatttg agaaagcaac ttttatcaaa "68" gtaccccaga accaaaatgg taaa gggtatgcat ttatagagtt tgcttcattc "74" gaagacgcta aagaagcttt aaattcctgt aataaaaggg aaattgaggg cagagcaatc "80" aggctggagt tgcaaggacc caggggatca cctaatgcca gaagccagcc atccaaaact "86" ctgtttgtca aaggcctgtc tgaggatacc actgaagaga cattaaagga gtcatttgac "92" ggctccgttc gggcaaggat agttactgac cgggaaactg ggtcctccaa agggtttggt "98" tttgtagact tcaacagtga ggaggatgcc aaagctgcca aggaggccat ggaagacggt 204" gaaattgatg gaaataaagt taccttggac tgggccaaac ctaagggtga aggtggcttc 210" gggggtcgtg gtggaggcag aggcggcttt ggaggacgag gtggtggtag aggaggccga 216" ggaggatttg gtggcagagg ccggggaggc tttggagggc gaggaggctt ccgaggaggc 222" agaggaggag gaggtgacca caagccacaa ggaaagaaga cgaagtttga tctg 228" tccctctgct tttc catttgaaag aaaggactct ggggttttta ctgttacctg 234" atcaatgaca gagccttctg aggacattcc aagacagtat acagtcctgt ggtctccttg 240" gaaatccgtc tagttaacat ttcaagggca ataccgtgtt ggttttgact ggatattcat 246" ataaactttt taaagagttg agtgatagag ctaaccctta tctgtaagtt ttgaatttat 252" attgtttcat cccatgtaca aaaccatttt caaa gggt gttg 258" tttctttttt ttgttttgtt tttgtttttt ttgc gttcgtgggg ttgtaaaaga 264" aaagaaagca gaatgtttta tcatggtttt tgcttcagcg gctttaggac aaattaaaag 270" tcaactctgg tgccagaaaa aaaaaaaaaa aa 681681 Protein seguence: NCBI Reference Seguence: NP_005372.2 LOCUS: NP_005372 ACCESSION: N P_005372 XP_002342316 _ mvklakagkn qupkkmapp pkeveedsed eemsedeedd ssgeevvipq kkgkkaaats 6" akkvvvsptk kvavatpakk aavtpgkkaa atpakktvtp akavttpgkk alva 12" tpgkkgaaip akgakngkna kk dsd d dds d dd d d d d i aaaa 18" apasededde ddeddedddd deeddseeea mettpakgkk aakvvpvkak nvaededeee _ ddededdddd dd dddd d d pvk apgkr kkemaquaa " tepttafnlf vgnlnfnksa pelktgisdv fakndlavvd vrigmtrkfg yvdfesaedl 36" ekaleltglk vfgneiklek pkgkdskker dartllaknl pykvtqdelk evfedaaeir 42" lvskdgkskg iayiefktea daektfeekq gteidgrsis lyytgekgqn qdyrggknst 48" wsgesktlvl atee thevfekat fikqunqng kskgyafief asfedakeal _ nscnkreieg lqu rgspnarsqp sktlkagls tlke sfdgsvrari 60" vtdretgssk gfgfvdfnse edakaakeam edgeidgnkv tldwakpkge ggfggrgggr 66.. ggfggrgggr ggrggfggrg rggfggrggf rggrggggdh kpqgkktkfe Official Symbol: SE061A1 Official Name: Sec61 alpha1 subunit (S. siae) [Homo sapiens] Gene ID: 29927 Organism: Homo sapiens Other Aliases: HSE061, SE061, SE061A Other Designations: Sec61 1;protein transport protein SE061 alpha subunit; protein transport protein Sec61 t alpha; protein ort protein Sec61 subunit alpha isoform 1; sec61 Nucleotide seguence: NCBI Reference Seguence: NM_013336.3 LOCUS: NM_013336 ACCESSION : NM_013336 NM_015968 682682 agcgatccga ggcccggccc cggccccgcc ccgcgccgcg ccgcgccgct tgccgccggg 6" ctagcactga cgtgtctctc ggcggagctg ctgtgcagtg gaacgcgctg ggccgcgggc 12" agcgtcgcct cacgcggagc agagctgagc ggga cccggagccc gagcagccgc 18" cgccatggca atcaaatttc tggaagtcat caagcccttc tgtgtcatcc tgccggaaat 24" tcagaagcca gagaggaaga ttcagtttaa ggagaaagtg ctgtggaccg ctatcaccct " ctttatcttc ttagtgtgct gccagattcc cctgtttggg atcatgtctt cagattcagc 36" tgaccctttc tattggatga gagtgattct agcctctaac agaggcacat tgatggagct 42" ctct cctattgtca cgtctggcct tataatgcaa ctcttggctg agat 48" aattgaagtt ggtgacaccc accg agctctcttc aacggagccc aaaagttatt 54" tggcatgatc attactatcg gccagtctat cgtgtatgtg atgaccggga tgtatgggga 60" tgaa atgggtgctg gaatttgcct gctaatcacc attcagctct ttgttgctgg 66" cttaattgtc ctacttttgg atgaactcct gcaaaaagga cttg gctctggtat 72" ttctctcttc attgcaacta acatctgtga aaccatcgta tggaaggcat tcagccccac 78" tactgtcaac actggccgag gaatggaatt tgaaggtgct atcatcgcac ttttccatct 84" gctggccaca cgcacagaca gagc ccttcgggag gcgttctacc gccagaatct 90" tcccaacctc atgaatctca ccat ctttgtcttt gcagtggtca tctatttcca 96" ccga gtggacctgc caatcaagtc ggcccgctac cgtggccagt acaacaccta "02" tcccatcaag ctcttctata cgtccaacat ccccatcatc ctgcagtctg ccctggtgtc "08" caacctttat gtcatctccc aaatgctctc cttc agtggcaact tgctggtcag "14" cctgctgggc tcgg acacgtcttc tgggggccca gcacgtgctt atccagttgg "20" ttgc tattacctgt caga atcttttggc tccgtgttag aagacccggt "26" ccatgcagtt gtatacatag tgttcatgct gggctcctgt gcattcttct ccaaaacgtg "32" gattgaggtc tcaggttcct ctgccaaaga tgttgcaaag cagctgaagg agcagcagat "38" ggtgatgaga ggccaccgag agacctccat ggtccatgaa ctcaaccggt ccac "44" agccgcggcc gggc tgtgcatcgg ctcg gtcctggctg acttcctagg "50" cgccattggg tctggaaccg ggatcctgct cgcagtcaca atcatctacc agtactttga "56" gatcttcgtt aaggagcaaa gcgaggttgg cagcatgggg gccctgctct cccg "62" tctcccggac aggttgagga agctgctcca gaagcgcctc ggaaggggag ctctcatcat "68" ggcgcgtgct gctgcggcat atggactttt aataatgttt ttgaatttcg tattctttca "74" tgtg taaagtgcta gacattttcc aatttaaaat tttgcttttt atcctggcac 683683 180" tggcaaaaag aactgtgaaa gtgaaatttt attcagccga ctgccagaga aatg 186" gatt gtccccaagt gtccatgtaa cttttgtttt aacctttgca ccttctcagt 192" gctgtatgcg gccg tctcacctgt ttccccacaa agggaatttc tcactctggt 198" tggaagcaca aacactgaaa tgtctacgtt tcattttggc agtagggtgt ggga 204" catg tatttcccgg agacgtggga ccttgctggc atgtctcctt cacaatcagg 210" cgtgggaata tctggcttag gactgtttct ctctaagaca ccattgtttt cccttatttt 216" aaaagtgatt tttttaagga cagaacttct tccaaaagag agggatggct ttcccagaag 222" acactcctgg ccatctgtgg atttgtctgt gcacctattg gctcttctag ctgactcttc 228" tggttgggct tagagtctgc ctgtttctgc tagctccgtg tttagtccac ttgggtcatc 234" agctctgcca agctgagcct ggccaagcta ggtggacaga cagt cgtt 240" tgtccagatt ctgccagtca tcactggaca cgtctcctcg cagctgccct agcaagggga 246" gacattgtgg tagctatcag acatggacag aaactgactt agtgctcaca agcccctaca 252" ccttctgggc tgaagatcac ccagctgtgt tcagaatttt cttactgtgc ttaggactgc 258" gtga gcagacacca ccgacttcct ttctgcgtca ccagtgtcgt cagcagagag 264" aggacagcac aggctcaagg ttggtagtga agtcaggttc ggggtgcatg ggtg 270" gtgttgatca gttgctccag tgtttgaaat aagaagactc atgtttatgt ctggaataag 276" ttctgtttgt gctgacaggt ggcctaggtc ctggagatga gcaccctctc tctggccttt 282" agggagtccc ctcttaggac aggcactgcc cagcagcaag ggcagcagag ttgggtgcta 288" agatcctgag gagctcgagg tttcgagctg gctttagaca ttggtgggac tgtt 294" ttgcaggatg ccctgatcct gggg cctgggggtg cgtgcagcct gtcggggaga 300" ccccactctg acagtgggca cacggcagcc tgcaaagcac agggccaccg ccacagcccg 306" gcagaggggc acactctgga gaccttgctg gcagtgctag ccaggaaaca gagtgaccaa 312" gggacaagaa gggacttgcc taaagccacc cagcaactca gcagcagaac caagatgggc 318" ctcc tccatatggc ccagggctta ccaccctatc acacgtggcc ttgtctagac 324" ccagtcctga gcaggggaga tgag acctgatgcc ccca catggttctc 330" ccactgccct gtctgctctg ctgctacaga gggc cagc ccacgcttag 336" gaatgcttgg cctctggcag gcaggcagct aagc tggtgggcag ggggctggaa 342" ggcaccaggc ctcaggagga gccccatagt cccgcctgca gcctgtaacc atcggctggg 348" ccctgcaagg cccacactca cgccctgtgg gtgatggtca ngtgggtgg gtgggggctg 354" accccagctt ccaggggact gtcactgtgg acgccaaaat ggcataactg agataaggtg 360" aataagtgac aaataaagcc agttttttac aaggtaaaaa aaaaaaaaaa aaaaaa 684684 Protein seguence: NCBI nce Seguence: NP_037468.1 LOCUS: NP_037468 ACCESSION: N P_037468 N P_057052 l maikflevik pfcvilpeiq kperkiqfke kvlwtaitlf iflvccqipl fgimssdsad 6" pfywmrvila snrgtlmelg ispivtsgli mqllagakii evgdtpkdra lfngaqklfg 12" miitigqsiv yvmtgmygdp semgagicll itiqlfvagl ellq sgis 18" lfiatnicet ivwkafsptt vntgrgmefe gaiialfhll atrtdkvral reafyranp 24" nlmnliatif vfavviyfqg frvdlpiksa ryrgqyntyp iklfytsnip iilqsalvsn " lyvisqmlsa rfsgnllvsl lgtwsdtssg gparaypvgg lcyylsppes fgsvledpvh 36" avvyivfmlg ktwi evsgssakdv akqlkeqqmv mrghretsmv helnryipta 42" aafgglciga flga igsgtgilla vtiiyqyfei kaeqsevgs mgallf PAPSSZ Official Sym bol: PAPSS2 Official Name: 3'-phosphoadenosine 5'-phosphosulfate se 2 [Homo sapiens] Gene ID: 9060 Organism: Homo sapiens Other Aliases: RP11-77F13.2, ATPSK2, 8K2 Other Designations: 3-prime-phosphoadenosine 5-prime-phosphosulfate synthase 2; ATP sulfurylase/APS kinase 2; ATP sulfurylase/adenosine 5'- phosphosulfate kinase; PAPS synthase 2; PAPS synthetase 2; PAPSS 2; SK 2; bifunctional 3'-phosphoadenosine 5'-phosphosulfate se 2; bifunctional 3'- phosphoadenosine 5'-phosphosulfate synthethase 2 Nucleotide seguence: NCBI nce ce: NM_001015880.1 LOCUS: NM_001015880 ACCESSION : NM_001015880 l ctaggcggcg gcggccgggt ccccaaggct gggcgctgct tgcggaaccg acggggcgga 61 gaggagcgtg gcgggaggag gagtaggaga agggggctgg gaag tgcgacgtgt 685685 12" ctgcggagcc tttttatacc ccgg gagtccggca gccgctgctg ctgctgctgc 18" tgctgctgcc gccgccgccg ccgccgtccc tgcgtccttc ggtctctgct cccgggaccc 24" cgcc gcagccagcc agcatgtcgg ggatcaagaa gcaaaagacg gagaaccagc " agaaatccac caatgtagtc tatcaggccc accatgtgag caggaataag agagggcaag 36" tggttggaac aaggggtggg ttccgaggat tgtg gctaacaggt ctctctggtg 42" ctggaaaaac aacgataagt tttgccctgg aggagtacct tgtctcccat gccatccctt 48" gttactccct ggatggggac aatgtccgtc atggccttaa cagaaatctc ggattctctc 54" acag agaggaaaat atccgccgga ttgctgaggt ggctaagctg tttgctgatg 60" ctggtctggt ctgcattacc agctttattt ctccattcgc aaaggatcgt gagaatgccc 66" gcaaaataca tgaatcagca gggctgccat tctttgaaat atttgtagat ctaa 72" atatttgtga aagcagagac gtaaaaggcc tctataaaag ggccagagct ggggagatta 78" aaggatttac tgat tctgattatg agaaacctga aactcctgag cgtgtgctta 84" aaaccaattt gtccacagtg agtgactgtg tccaccaggt agtggaactt ctgcaagagc 90" agaacattgt accctatact ataatcaaag atatccacga actctttgtg ccggaaaaca 96" aacttgacca cgtccgagct gaggctgaaa ctctcccttc attatcaatt actaagctgg "02" agtg ggtccaggtt ttgagcgaag gctgggccac tcccctcaaa atgc "08" gggagaagga acag gttatgcact ttgacaccct tggc atggcccttc "14" ctgatggcgt gatcaacatg agcatcccca ttgtactgcc cgtctctgca gaggataaga "20" cacggctgga agggtgcagc aagtttgtcc tggcacatgg tggacggagg gtagctatct "26" tacgagacgc tgaattctat gaacacagaa aagaggaacg ccgt gtttggggga "32" caacatgtac cccc catatcaaaa tggtgatgga aagtggggac tggctggttg "38" gtggagacct tcaggtgctg gagaaaataa gatggaatga tgggctggac cgtc "44" tgacacctct ggagctcaaa cagaaatgta aagaaatgaa tgctgatgcg gtgtttgcat "50" tccagttgcg caatcctgtc cacaatggcc atgccctgtt gatgcaggac actcgccgca "56" ggctcctaga gaggggctac aagcacccgg tact acaccctctg ggcggctgga "62" ccaaggatga gcct ctagactggc ggatgaagca ggct gtgctcgagg "68" aaggggtcct ggatcccaag tcaaccattg ttgccatctt tccgtctccc atgttatatg "74" ctggccccac agaggtccag tggcactgca ggtcccggat gattgcgggt gccaatttct "80" acattgtggg gagggaccct gcaggaatgc cccatcctga gaag gatctgtatg "86" aacccactca tgggggcaag gtcttgagca tggcccctgg cctcacctct gtggaaatca 686686 192" ttccattccg agtggctgcc aaag ccaaaaaagc catggacttc tatgatccag 198" caaggcacaa tgagtttgac ttcatctcag gaat gaggaagctc gcccgggaag 204" gagagaatcc cccagatggc ttcatggccc catg gaaggtcctg acagattatt 210" acaggtccct ggagaagaac taagcctttg gctccagagt ttctttctga agtgctcttt 216" gattaccttt tctattttta tgattagatg ctttgtatta aattgcttct caatgatgca 222" ttttaatctt tgaa gtaaaagttg tgtctataat taaaaaaaaa tatatatata 228" tacacacaca catatacata caaagtcaaa ctgaagacca aatcttagca agca 234" atattcttat acatttcata ataaaattag ctctatgtat tttctactgc acctgagcag 240" gcaggtccca gatttcttaa ggctttgttt gaccatgtgt ctagttactt gctgaaaagt 246" attt tccagcatgt cttgacaacc tgtactcttc caatgtcatt tatcagttgt 252" aaaatatatc tgtc ctcttctgta caattgacaa aaaaaaaaat ttttttttct 258" cactctaaaa gaggtgtggc tcacatcaag attcttcctg atattttacc tcatgctgta 264" caaagcctta atgttgtaat catatcttac gtgttgaaga cctgactgga gaaacaaaat 270" gtgcaataac gtgaatttta tcttagagat ctgtgcagcc tatttctgtc acaaaagtta 276" tattgtctaa taagagaagt cttaatggcc tctgtgaata atgtaactcc agttacacgg 282" tgacttttaa tagcatacag tgatttgatg aaaggacgtc aaacaatgtg gcgatgtcgt 288" ggaaagttat ctttcccgct ctttgctgtg gtcattgtgt cttgcagaaa ggatggccct 294" gatgcagcag cagcgccagc tgtaataaaa aataattcac actatcagac tagcaaggca 300" ctagaactgg ccac caaa gaatccaacc tctt acaggtgaac 306" aaactgtgat gatgcacatg tatgtgtttt gtaagctgtg agcaccgtaa caaaatgtaa 312" atttgccatt attaggaagt gctggtggca gtgaagaagc acccaggcca cttgactccc 318" agtctggtgc cctgtctaca ccagacaaca caggagctgg gtcagattcc cctcagctgc 324" ttaacaaagt tcctcgaaca gctt acaaagctgc cttctcggat actgaaaggt 330" ttct gaactgcact gattttattg cagttgaaaa aaaaaaaaag caaa 336" gatttcaagc tgttctgaga ctga tggctttact tcctgagagg caatgttttt 342" actttatgca taattcattg ttgccaagga ataaagtgaa gaaacagcac atat 348" ataggtctct ctggaagaga cctaaattag aaagagaaaa ctgtgacaat tttcatattc 354" tcattcttaa aaaacactaa tcttaactaa caaaagttct aata acac 360" aatggccaca gcagtttgtc tttaatagta tagtgcctat actcatgtaa tcggttactc 366" actactgcct ttaaaaaaaa aaaccagcat atttattgaa aacatgagac aggattatag 687687 3721 tgccttaacc gatatatttt gtgacttaaa aaatacattt aaaactgctc ttctgctcta 3781 gtaccatgct aaat gattatttct atgtacaact tgtt cttattttaa 3841 taaatttatc aaaa aaaaaaaaaa aaaa Protein seguence: NCBI Reference Seguence: NP_001015880.1 LOCUS: NP_001015880 ACCESSION: N P_001015880 _ msgikqute nqqkstnvvy qahhvsrnkr rggf rgctvwltgl sgagkttisf 6" aleeylvsha ipcysldgdn vrhglnrnlg fspgdreeni rriaevaklf adaglvcits 12" fispfakdre narkihesag lpffeifvda plnicesrdv kglykrarag eikgftgids 18" dyekpetper vlktnlstvs dcvhqvvell qeqnivpyti ikdihelfvp enkldhvrae 24" lsit kldlquqvl segwatplkg fmrekequv mhfdtlldgm inms " vsae dktrlegcsk fvlahggrrv ailrdaefye hrkeercsrv wgttctkhph 36" ikmvmesgdw lvggdlqvle kirwndgldq yrltplelkq kckemnadav fafqlrnpvh 42" nghallqut rrrllergyk hpvlllhplg gwtkdddvpl dwrmkqhaav leegvldpks 48" tivaifpspm lyagptevqw hcrsrmiaga nfyivgrdpa gmphpetkkd ggkv 54" lsmapgltsv eiipfrvaay nkakkamdfy dparhnefdf isgtrmrkla regenppdgf 60" mapkawkvlt dyyrslekn 688688 We claim: 1. A method for identifying a tor of drug-induced toxicity, said method sing: (1) establishing a model for drug-induced toxicity using cells associated with druginduced toxicity to represent a characteristic aspect of drug-induced toxicity; (2) obtaining a first data set from the model for drug-induced toxicity, wherein the first data set represents one or more of genomics, lipidomics, proteomics, metabolomics, riptomics, and single nucleotide polymorphism (SNP) data characterizing the cells associated with drug-induced toxicity, wherein the drug is Anthracyclines, 5-Fluorouracil, Cisplatin, zumab, Gemcitabine, Rosiglitazone, Pioglitazone, tazone, Cabergoline, Pergolide, Sumatriptan, Bisphosphonates, or a TNF antagonist; (3) obtaining a second data set from the model for drug-induced toxicity, wherein the second data set represents a functional activity or a cellular response of the cells ated with drug-induced toxicity; (4) ting a consensus causal relationship network among the sion levels of the one or more of genomics, lipidomics, proteomics, metabolomics, transcriptomics, and single nucleotide polymorphism (SNP) data and the functional activity or cellular response based solely on the first data set and the second data set using a programmed computing device, wherein the generation of the consensus causal relationship network is carried out by an artificial intelligence (AI) -based informatics rm, and wherein the generation of the consensus causal relationship network is not based on any known biological relationships other than the first data set and the second data set; and (5) identifying, from the consensus causal relationship network, a causal relationship unique in drug-induced toxicity, wherein a gene, lipid, protein, metabolite, transcript, or SNP associated with the unique causal onship is identified as a tor of drug-induced toxicity. 2. The method of claim 1, wherein second data set representing the functional activity or cellular response of the cells comprises one or more of bioenergetics, cell proliferation, (14289951_1):GGG sis, organellar function, a genotype-phenotype association actualized by functional models selected from ATP, ROS, OXPHOS, and oxygen consumption rate (OCR) assays, global enzyme activity, and an effect of global enzyme activity on the enzyme metabolic substrates of cells associated with drug-induced toxicity. 3. The method of claim 2, wherein the global enzyme activity is global kinase activity, and wherein the effect of global enzyme activity on the enzyme metabolic substrates is the phospho 4. The method of claim 1, wherein the first data set comprises two or more of genomics, lipidomics, mics, lomics, transcriptomics, and single nucleotide polymorphism (SNP) data.
. The method of claim 1, wherein the AI-based informatics platform receives all data input from the first data set and the second data set without applying a statistical cut-off point. 6. The method of claim 1, wherein the consensus causal relationship network established in step (4) is r refined to a simulation causal relationship network, before step (5), by in silico simulation based on input data, to provide a confidence level of prediction for one or more causal relationships within the consensus causal relationship network. 7. The method of claim 1, wherein the unique causal relationship is identified as part of a ential causal relationship network that is uniquely present in cells, and absent in the matching control cells. 8. The method of claim 1, wherein the unique causal relationship identified is a relationship between at least one pair selected from the group consisting of expression of a gene and level of a lipid; expression of a gene and level of a transcript; sion of a gene and level of a lite; expression of a first gene and a second gene; expression of a gene and presence of a SNP; expression of a gene and a functional activity; level of a lipid and level of a transcript; level of a lipid and level of a metabolite; level of a first lipid and a second lipid; level of a lipid and presence of a SNP; level of a lipid and a functional activity; level of a first transcript and level of a second transcript; level of a transcript and level of a lite; level of a transcript and (14289951_1):GGG presence of a SNP; level of a first ript and a onal activity; level of a first metabolite and level of a second metabolite; level of a metabolite and ce of a SNP; level of a metabolite and a functional activity; level of a first SNP and presence of a second SNP; and presence of a SNP and a functional activity. 9. The method of claim 8, wherein the functional activity is selected from the group consisting of bioenergetics, cell eration, apoptosis, organellar function, kinase activity, protease activity, and a genotype-phenotype ation actualized by functional models selected from ATP, ROS, OXPHOS, and oxygen consumption rate (OCR) assays.
. The method of claim 1, further comprising validating the identified unique causal relationship in drug-induced toxicity. 11. The method of claim 1, wherein the drug-induced toxicity is drug-induced cardiotoxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, renaltoxicity, or myotoxicity. 12. The method of claim 11, wherein the drug-induced cardiotoxicity is cardiomyopathy, heart failure, atrial fibrillation, cardiomyopathy and heart e, heart failure and LV dysfunction, atrial flutter and fibrillation, or heart valve damage and heart failure. 13. The method of claim 1, n the model for drug-induced toxicity comprises cell cardiomyocytes, diabetic cardiomyocytes, hepatocytes, kidney cells, neuronal cells, renal cells, or sts. 14. The method of claim 1, n the model for drug-induced toxicity comprises a toxicity inducing drug, cancer drug, diabetic drug, neurological drug, or anti-inflammatory drug.
. The method of claim 1, wherein the modulator of drug-induced toxicity is an inhibitor of drug-induced toxicity. (14289951_1):GGG 16. The method of any one of claims 1-15, wherein step (4) comprises: generating a differential causal relationship network from the consensus causal relationship network based on the first data set and the second data set and a second consensus causal relationship network based on comparison cell data; and identifying the causal relationship unique in drug-induced ty as part of the differential causal relationship network. 17. The method of any one of claims 1-15, wherein step (3) ses: creating a set of network fragments and determining a probabilistic score associated with each network fragment based on the first data set and the second data set; ng an ensemble of trial networks, each trial network constructed from a ent subset of the set of k fragments; and optimizing the ensemble of trial networks by evolving each trial network through local transformations, wherein the consensus causal relationship network is based on the optimized ensemble of trial networks.
Berg LLC By the Attorneys for the Applicant SPRUSON & FERGUSON Per: (14289951_1):GGG 1/35 380$ uszwafimfi. coBmSQENE 3.2% $5393 Ucm $93 :23sz EBXwEOU cornuczcwwx .5 :8 An _ .8ch 803332 .9593? ”57.2:qu 3535 cozuczu E_:=mu-EE_ 28 - 53th -95.: 33.3958 *0 53> .9502 mwcmo 299a ' mezmcmfihmmueq 85530230 2939355336 mcosumhmt: 230 53:3 .- o [Annotation] ggg 2/35 Ceii cycie J : C etai V~ y GEucoge immune Metaboissm V V V } resanse g response """"" vvvvvv VVVVVV Fig. 2 [Annotation] ggg 3/35 $3202 222 co_uuc:n<mQ30cm;n_ aussuEEoRfi AIIIIIIIIV m 3850 9:06:35th E mEowonSwE BLUOZDQJBILH [Annotation] ggg 4/35 UwUDme “EwECOggcm 332:0 mm: cmmzxo 6389: 0 oncm 3832 mmmcaobmé [Annotation] ggg /35 Pre-clinical In vitro/ In vivo models Disease/Drug toxicity Clinical Gene expression Tissue specific gene expression % Proteomics Intracellular Probes &lor Stressors l Proteomics 1p onal Assays Survrval studies Fig.5 ation] ggg 6/35 awn—30:05; $3me 3me gmmmm 3me £82.; 53:8 mama <mjm mU_EOOHO&n_ WEE E52 mascuan o Commmwb 4‘sz m_w—UOE flmUOCL n flh 0mfiwm_n_ OPS , ZN E“BEG [Annotation] ggg 7/35 The Interrogative y Drug Discovery Platfrom Tissue specific gene Gene expression expressnon Data Omics Omics integration signature signatures Outcome measurement G Functional assays y of life Survival studies studies REFSTM Model building data mining Mechanisms Therapeutic target. of toxicity Fig. 7 [Annotation] ggg 8/35 The "Omics" Cascades ‘———————————————-> Interactome cs ‘—>~ Transcriptomics "\"< .5 Proteomics , Metaboiomics ation] ggg 9/35 In vitro modeling lnterrogative MIMS: T1 Conditions Hyperglycemia, hyperlipidemia, hyperinsulinemia, inflammation Transcriptional profiling Omics profiling Cell types Functional assays Adipocytes, myotubes, hepatocytes, aortic smooth muscle, al tubular kidney, and intracellular cross talk lnterrogative MlMS= T2 Fig. 9 /35 .0355 wabogfia Eve—2Jim: c2359.“. 3355 .9”. “swans“. {03.2 newsman.
{Aw 35E?“— fiflrfigfig.
KWOM €ng x5332 .. 32 £9532a xguz3 “carcassa. x5352 @.@.acmuso 2 S Eucanu o. “Wages..©. $2.85m, 2 mm“ =9.BS:& «E850 11/35 (Normal vs. ic) Vs. (Diabetic T1 vs. Diabetic) Fig. 11 ation] ggg 12/35 [Annotation] ggg 13/35 mRNA VSpEicing Lipfid GTPase Metaboiism ‘ Signaiing ; Nucfieoseme Protein Transiation WM V iytic Endosomai' Processing \varaffickin [Annotation] ggg 14/35 m W m W mm E20823 :0 :82: E558 WEEEUEE :8 EEEEEE E :8 :0 _E0_ EEEE; £30 8:0ng $5250; :8 SEE. 0853305 0238280 0:320:38: S€838 m 203 “:8 £50: S6on EEO: EEEEEEE 2200:): :0: 380$ :0 EEEQ BEE ESE: :Emgmm E8 mEEmmEmg SEEES EOE ESE :8 :Em E: m EREEEES EEEEEmsw EEwEEEEEEE88 “83065 :8 EEBgm EoE :0: :8 ESE E ESESES3% mEEmmE 0 CQMEUEEEEO E00 mES”E; 3% EEEE£8 :mgezucs: 86 EEEEEEE E3: EEEEEQ Emwmmvommm :0 E2008 :0 8%:me mEEEVE E38 : 5:33 $333283EEEEEEE $805 mEEmmEEE 95 EEEEEEESE E88: :8 m Eficasfimm E305 “808:3: 0200mm: EEEEEE V :8 E08 $832 :0 E532: mm EEEE :0: 30505::E358 cm :8 00883 .0: “@865 £8 GEEEEE 0.8 :0 m 24W EEmmwaxw WEE EEEQESEEES mEEmeQE 3E8§m :0 1\\ :8 03805: £0 E 3% 83$???Egg EEEO E550: 3me53 $330330:“£050: E558 E89; “8&205 E :05: 8:va .80 ESE E 3:00;: E0 ESE 38: E0335 EEEE: :00: fiEmEEQ 5:050 333,: mEEEUE :8 EEEEEEE E88 :8 E ESEEEE:8 2:8 :000E EEmmEm E88 9:8£5) EEE <2mE :0: gm. EOE ESE/0‘02 «08388 EEaEaxmmEEmmEgE EOE #8 mEEE :8 :0 E0 :0 “2&me E0: 858mm 8330330: $8. EEEUEE EEEEm: 3:050; EEEEEQ comcmanu 380:0 £820: :0 E8 EmEEa $0 E ”Em 0:8 EEEEQ ESE 388 EEEBEQMEEEEEE Egg \EEQE “208:0: EESEEEWnBEmcmm EEEmtE E 98 E330 E88 :8 gm ENE mwcmm [Annotation] ggg /35 A. Data Processing B. Bayesian Fragment Enumeration Drug and pe to Phenotype Network Fragments SNP SNP SNP SNPSNP (ta wag *IQ girl; 33 al Data to Molecular Measures Network Fragments C. Parallel Ensemble Sampling.
D. Model intervention Simulation et cell line—specific enotype or mutation Ensemble of Models Fig. 15 [Annotation] ggg 16/35 input Raw Data (Expression Levei at Piuraiity at Genes 84 Functionai Activity er Ceiiuiar Reappnge) Pre—process raw data (normaiizationi missing vaiues etc.) Generate netvvprk fragment iibrary from , ocessed data Generate ensembie 0t initiai tnai . a , . netvverks from iikeiy netvverk fragments Optimize/waive each triai k in Zia ensembie Simuiatipn for quantitative parameter 220 extraction and/0r predictive purpeses Output quantitative reiatipnship parameters and/pr ether tipn predictions Fig. 16 [Annotation] ggg 17/35 Camputing Device v \ \iGG Proceggor e 102 ”8 Operating System Visuai Dispiay ‘iflé i252 Device 228 Executabie Muiti—pointiouch“33 User Code interface 124 i ace Painting Device i9() 2 Ai-Based Netwerk iniornatics interface System ______________________________ H4 Virtueii‘v’iachine Experimentai Precesseris) 5 System(s) Coreis) imuuunuu“mm“-i : i04’ [Annotation] ggg 18/35 33%wa “mm/«Um . au mmug {E..1 xkOEMWC NF IIIJr EEEZ mwmwfl E Amxafim %me 3.5 fiaiwmz $3 fagww: mmug 3 EEG fig xkcguwz EELOZ 2 EQEES Ammgflm Emma magmmmm wamumagu C mmug :thch VEOEHWC fiafiwwz ghwucmu mumewQ wagging £5;me [Annotation] ggg 19/35 mEEoi mcmfi mu_E0U_Q__\mu_EoBoi \m>mmm< 0:505 720325“. 5332/ EmEummfi .9“. 220:: 320 mcEEmU EEwUEngrBEEz mma>uo>Eo_Emu :mEsz ation] ggg /35 225 mnz H3m>m+h :- 04- #3 ad #6 To- 0.0- F.»- 0.7 TN- 9N- Tm- QM- “Carma mnz o: .9; th>H :- 04. #3 _.m.
@.N. o. 3.». ad To ed- $.0- e.
[Annotation] ggg 21/35 NG HG NG HG G HG G HG Fig. 21A Glycerolphosphate r/‘AcyI-COA‘\ Acyl-CoA Mitochondria GPA“ GPAT3 Endoplasmic GPATZ GPAT4 reticulum Acyl—CoA Cell signalling AGPATl asmic AGPATZ reticulum Nucleus AGPAT3-5 TAZ riptional, .
Pl, PG,CL <———PA Coactivation , , Cells;. nallln. / m u g }\:Llpln3, mm g 1 - - Endoplasmic pi reticulum PC, PE, PS +—-DAG . . / Acyl-CoA Cell Signalling DGATl DGAT2 Fig. 218 [Annotation] ggg 22/35 XFAssay_92201 1_1745 ,xls 214 A818 C 66 83 99 TIME (min) Fig. 22A 140 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ 100 . -‘-““""‘"”" Patient 1 Patient 3 _____ [Annotation] ggg 23/35 maggu EEG?“ gofimméogamong mmmuea gogmficmmho mmmmgmmofi Egmmfiam togmcg $33 togmcmb Egg Egg £3? tommcmb £03353» £83on $3th 38 togmcmb ufimfiimofi Em sebum? mvcsoQES meES EEEUEEM E38 36%me gofimmbosamoxg a??? togmcm: gmmfixo mcmEEmE Q "93%on $333ng coaummm fimEEEmoE Bmggmoga .E E305 “we EEG cosmmsmg cobugm &3th cosmnmxo mmmwcwmozghofic mmmmfigm ESQEWE magma he E a? cefimémb “$35050me cosfiwcmm amaze [Annotation] ggg 24/35 EC enwmnmen t N G .fi%.fi.m w8:...itwh erflfiawe Cat5%mmCV5ua“ d.mhQ.huS baakgraund Fig. 24 ation] ggg /35 [Annotation] ggg 26/35 $508“..qu mEmEmgumE SECREQOEMQ 3.86232 ‘5me mEBQn. o. w. mxw:0 UWHDWQMEWSH.39¢ <zm ”/55 :2045”NI. mm .__ EL: \u«5‘ an ”5, .mm um: \0x 32:2 muamELQE OI IUcA~101quuimmUv 02800285 O o GouEonzozsgm: Ummmn OlmT—U omzu- wucmewE. y 02> w EDT—BE S 3350 . a. q”9,? .
[Annotation] ggg 27/35 co___E 2032328 4% >EEmmmm _ 6602 3:338 $393 .3253“. .3me 82:0 ® _< muzmctoué ® co nwmmm®fozsmz ® 29.5605 885%: .mzcmhmtfi .mcozazumcmc. @559“. \mu_EoBoi £60.29. EcEUcE 333.1 53mm; @353 conga? .9”. 95 n; 920 x “595mm; 230823 Eocov \. 3: Eu< 9:032 Eot cm 220:: >Eo_Emu 220 mcEEmu meE \oEEZ 283298;: mmiuo‘AEofihmu ation] ggg 28/35 29/35 m5... ~< 2 Am wmm_>xo‘_v>: F 3.3. 205.02; .mch 559.25% mamumu. iu ¢3.05I 25805: 3:333 Bing—.05 323. 3:03:33 Kan. :(m -mm>u N<Ivm -uz: . . . . . .325 we 9:033: «cwEacfiiu 532; [Annotation] ggg /35 szma mwu>uo>Eo§mu gm m .9”.
Emumnmu £3553 n_._.< mtgocuogé mo-mo.¢ 00-9mm 00-00%“ @9me oo-mo.m woémé 00-004 no-wo 00.2;v wo-wm.m oo-mo.m m moboN woémé 00-00; No-00 div mam div $882 $820 :9:mmOUEU ation] ggg 31/35 Ngrmai Synthesig ation] ggg 32/35 Refiucefi Synthesk fig, 3‘33 [Annotation] ggg tmgfimm V a“? am V EEEZ m?» mmmmgugm @ m <0 [Annotation] ggg 34/35 V (xwmmmaoé\\aj/ @mufifiwm mafia fimwgwc>m } x/i\x mammmeu \xgz/ fa as; .3 m<ogow 3&3 azwygu<, Vfigmmg @3ng “mgtfiz mag flmmgwc>m [Annotation] ggg /35 QZ wm LEJUw—OE .m: 39‘.va
NZ722231A 2012-05-22 2012-09-07 Interrogatory cell-based assays for identifying drug-induced toxicity markers NZ722231B2 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201261650462P 2012-05-22 2012-05-22
US61/650,462 2012-05-22
NZ701908A NZ701908B2 (en) 2012-05-22 2012-09-07 Interrogatory cell-based assays for identifying drug-induced toxicity markers

Publications (2)

Publication Number Publication Date
NZ722231A NZ722231A (en) 2018-02-23
NZ722231B2 true NZ722231B2 (en) 2018-05-24

Family

ID=

Similar Documents

Publication Publication Date Title
AU2012381038B2 (en) Interrogatory cell-based assays for identifying drug-induced toxicity markers
KR20150043566A (en) Use of markers in the identification of cardiotoxic agents
AU2017267184B2 (en) Method for assessing a prognosis and predicting the response of patients with malignant diseases to immunotherapy
CN107077536B (en) Evaluation of activity of TGF-beta cell signaling pathway using mathematical modeling of target gene expression
RU2721130C2 (en) Assessment of activity of cell signaling pathways using a linear combination(s) of target gene expression
RU2719194C2 (en) Assessing activity of cell signaling pathways using probabilistic modeling of expression of target genes
KR102023584B1 (en) PREDICTING GASTROENTEROPANCREATIC NEUROENDOCRINE NEOPLASMS (GEP-NENs)
DK2644712T3 (en) A method for diagnosing neoplasms
AU2014299322B2 (en) Sepsis biomarkers and uses thereof
US20230416827A1 (en) Assay for distinguishing between sepsis and systemic inflammatory response syndrome
KR101421326B1 (en) Composition for predicting prognosis of breast cancer and kit comprising the same
KR20140140069A (en) Compositions and methods for diagnosis and treatment of pervasive developmental disorder
AU779411B2 (en) Biallelic markers derived from genomic regions carrying genes involved in arachidonic acid metabolism
WO2003042661A2 (en) Methods of diagnosis of cancer, compositions and methods of screening for modulators of cancer
AU2018210695A1 (en) Molecular subtyping, prognosis, and treatment of bladder cancer
CA2442820A1 (en) Microarray gene expression profiling in clear cell renal cell carcinoma: prognosis and drug target identification
AU2016331663A1 (en) Pathogen biomarkers and uses therefor
MXPA05005653A (en) Heart failure gene determination and therapeutic screening.
CA2403946A1 (en) Genes expressed in foam cell differentiation
KR20060045950A (en) Prognostic for hematological malignancy
KR20070099564A (en) Methods for assessing patients with acute myeloid leukemia
US20020137077A1 (en) Genes regulated in activated T cells
CN101778954A (en) Predictive markers for egfr inhibitor treatment
AU784761B2 (en) Biallelic markers related to genes involved in drug metabolism
JP2003235573A (en) Diabetic nephropathy marker and its utilization