NZ701908B2 - Interrogatory cell-based assays for identifying drug-induced toxicity markers - Google Patents

Interrogatory cell-based assays for identifying drug-induced toxicity markers Download PDF

Info

Publication number
NZ701908B2
NZ701908B2 NZ701908A NZ70190812A NZ701908B2 NZ 701908 B2 NZ701908 B2 NZ 701908B2 NZ 701908 A NZ701908 A NZ 701908A NZ 70190812 A NZ70190812 A NZ 70190812A NZ 701908 B2 NZ701908 B2 NZ 701908B2
Authority
NZ
New Zealand
Prior art keywords
drug
cell
biomarkers
level
protein
Prior art date
Application number
NZ701908A
Other versions
NZ701908A (en
Inventor
Niven Rajin Narain
Rangaprasad Sarangarajan
Vivek K Vishnudas
Original Assignee
Berg Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Berg Llc filed Critical Berg Llc
Priority to NZ722231A priority Critical patent/NZ722231B2/en
Priority claimed from PCT/US2012/054323 external-priority patent/WO2013176694A1/en
Publication of NZ701908A publication Critical patent/NZ701908A/en
Publication of NZ701908B2 publication Critical patent/NZ701908B2/en

Links

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P39/00General protective or antinoxious agents
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P39/00General protective or antinoxious agents
    • A61P39/02Antidotes
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P9/00Drugs for disorders of the cardiovascular system
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P9/00Drugs for disorders of the cardiovascular system
    • A61P9/04Inotropic agents, i.e. stimulants of cardiac contraction; Drugs for heart failure
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P9/00Drugs for disorders of the cardiovascular system
    • A61P9/06Antiarrhythmics
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6834Enzymatic or biochemical coupling of nucleic acids to a solid phase
    • C12Q1/6837Enzymatic or biochemical coupling of nucleic acids to a solid phase using probe arrays or probe chips
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/142Toxicological screening, e.g. expression profiles which identify toxicity
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/5005Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells
    • G01N33/5008Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • G01N33/6803General methods of protein analysis not limited to specific proteins or families of proteins
    • G01N33/6842Proteomic analysis of subsets of protein mixtures with reduced complexity, e.g. membrane proteins, phosphoproteins, organelle proteins
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B5/00ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks

Abstract

The instant application provides several biomarkers associated with drug induced toxicity which are useful in methods for predicting potential toxicity of a molecule or drug candidate, and as potential therapeutic targets for treating, preventing or counteracting drug-induced toxicity.

Description

INTERROGATORY CELL-BASED ASSAYS FOR IDENTIFYING DRUG- D TOXICITY MARKERS Cross-Reference to Related Applications This application claims priority to 11.8. Provisional Application Serial No., 6 2 filed May 22, 20 | 2, the entire content of which is incorporated .
Background ofthe Invention 'l‘he phannaceutical industry is currently witnessing a 90% attrition of potential compounds entering clinical development. 30% of which is owing to poor clinical safety (Kola et al.(2004) Nat Rev Drug Discovery-3 711-715) .
In the ll.S., fatal adverse drug reactions (ADRs) are the 4m to 6'h leading causes of death. Costs directly attributable to ADRs may lead to an additional $1.56 to $4 billion in direct al costs per year in the [1.8. (lazarou .I et al.(1998) JAM/1: 279(15):]200-1225). The cost of drug discovery and development has increased to about $1 billion. partly due to increased attrition of compounds and NME late in clinical development ( Adams CP. Brantner W (2010) “. pending on New Drug Development” Health Econ. 19: 130—141). The lack of reliable tools that can help with predicting toxicity early in drug development is partly to blame for increasing costs and lower return on investment. Further. drug safety issues are the leading cause of increased litigation and settlements in the pharmaceutical industry. Between January 2009 and May 2011 the industry has spent over [181) 8 billion on litigation cases related to drug safety issues.
In order to t a “kill early policy” of compounds in early clinical trials and drug development, the FDA is now encouraging the drug industry and the community to adopt a very tive strategy. FDA white paper Innovation or Stagnation: nges and Opportunity on the Critical Path to New l Projects states. “A new product development toolkit containing powerful new scientific and technical methods such as animal or computer-based predictive models, biomarkers for safety and effectiveness, and new clinical evaluation techniques—is urgently needed to improve predictability and efficiency along the critical path from laboratory t to commercial product” (FDA. 2005). The FDA ation clearly underscores the lack of innovative technologies that can aid in efficient decision making in drug development.
Cardiotoxicity refers to a broad range of adverse effects on heart function induced by therapeutic molecules. Cardiotoxicity may emerge early in pre-clinical studies or become apparent later in the clinical setting. It is a leading cause of drug withdrawal, accounting for over 45% of all drugs withdrawn since 1994, which results in significant financial burden for drug development. Cardiovascular toxicity includes increased QT duration, arrhythmias, myocardial ischemia, hypertension and oembolic cations, and myocardial dysfunction.
Cardiac safety biomarkers currently used by the FDA are QTc prolongation — lectrophysiological arrhythmias, circulating in 0, heart rate, blood pressure, lipids, troponin, C-reactive protein (CRP), brain ot B-type natriuretic peptide (BNP), ex vivo platelet ation, and imaging biomarkers (cardiac magnetic resonance imaging).
The QTc gation is a very robust but complex marker. However, a decision on r to kill or n a drug in early pment is hard to make based on QTc alone. In addition, QTc is subjective and is dependent upon underlying pathologies that can lead to tachyarrythmias.
In view of the foregoing, it is evident that new cardiac safety biomarkers, such as molecular cardiac safety biomarkers, are needed in the art.
Summary ofthe Invention The platform technology described herein is useful for identifying markers associated with nduced toxicity. This rm technology integrates molecular interactions within and across a hierarchy of models starting from primary human cell based model to human clinical samples. This approach leads to the identification of biomarkers that reflect an underlying toxicity caused by a compound or NME that is a potential drug, such as a drug candidate ready to enter phase I clinical trials. Drug induced toxicities can e cardiac, renal, hepatic and other tissue toxicity. The instant application provides several novel biomarkers associated with nduced toxicity, and which are useful in methods for predicting potential toxicity of a molecule or drug candidate, and as ial therapeutic targets for treating, preventing or counteracting drug-induced toxicity.
The invention bed herein is based, at least in part, on a novel, collaborative utilization of network biology, genomic, proteomic, lomic, transcriptomic, and bioinformatics tools and methodologies, which, when combined, may be used to study any biological system of interest, such as obtaining t into the molecular mechanisms associated with or causal for nduced toxicity. The platform technology is further described in international PCT Application PCT/U82012/027615, the entire contents of which are hereby expressly incorporated herein. Additional embodiments of the platform technology, including a description of how to carry out platform technology methods involving oration of enzyme (e.g., kinase) activity data, are described in US. Application Serial No. 13/607,587, filed on September 7, 2012, the entire contents of which are expressly incorporated herein by reference. In a first step, cellular modeling systems are developed to probe a drug-induced toxicities, such as cardiotoxicity, hepatotoxicity, toxicity, neurotoxicity, renaltoxicity or myotoxicity .
A cellular system modeling drug—induced toxicity can comprise toxicity- related cells subjected to various -relevant environment stimuli (e.g., hyperglycemia, hypoxia, immuno-stress, and lipid peroxidation, or exposure to a test molecule or drug ate). In some embodiments, the cellular modeling system es cellular cross- talk isms between various interacting cell types d to specific drug-induced toxicity, such as cardiomyocytes, diabetic cardiomyocytes, hepatocytes, kidney cells, neuronal cells, renal cells, or myoblasts. High throughput biological readouts from the cell model system are ed by using a combination of techniques, including, for example, cutting edge mass spectrometry (LC/MSMS), flow cytometry, cell-based assays, and onal assays. The high throughput biological readouts are then subjected to a bioinformatic analysis to study congruent data trends by in vitro, in vivo, and in silico modeling. The resulting matrices allow for cross-related data mining where linear and non-linear regression analysis were developed to reach conclusive pressure points (or “hubs”). These “hubs”, as presented herein, are ates for drug discovery.
In particular, these hubs represent potential drug targets for reducing or alleviating drug- induced toxicity and/or drug-induced toxicity s.
The molecular signatures of the differentials allow for t into the isms that dictate the alterations in the tissue microenvironment that lead to drug- induced toxicity. Taken together, the combination of the entioned technology platform with strategic cellular modeling allows for robust intelligence that can be employed to further establish an tanding of the underlying mechanisms and molecular drivers buting to drug-induced toxicity, e.g., cardiotoxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, renal toxicity or myotoxicity while creating biomarker libraries that may allow early identification of drug candidates at risk for causing drug-induced toxic effects, as well as drug targets that may reduce or alleviate drug-induced toxicity.
A significant feature of the platform of the invention is that the AI-based system is based on the data sets obtained from the drug-induced toxicity cell model system, without resorting to or taking into eration any existing knowledge in the art, such as known biological relationships (i.e., no data points are cial), ning the drug- induced toxicity. Accordingly, the resulting statistical models generated from the platform are unbiased. Another significant feature of the platform of the ion and its components, e. g., the cell model systems and data sets obtained therefrom, is that it allows for continual building on the drug—induced toxicity cell models over time (e. g., by the introduction of new cells andfor conditions), such that an initial, “first generation” consensus causal onship k generated from a cell model for a drug-induced ty can evolve along with the evolution of the cell model itself to a multiple generation causal relationship network (and delta or delta-delta networks obtained therefrom). In this way, both the drug-induced toxicity cell models, the data sets from the drug-induced toxicity cell models, and the causal relationship networks generated from the drug-induced toxicity cell models by using the Platform Technology methods can constantly evolve and build upon previous knowledge obtained from the Platform Technology.
The present ion is based, at least in part, on the identification of novel biomarkers that are associated with drug-induced cardiotoxicity. The invention is further based, at least in part, on the discovery that Coenzyme Q10 is e of reducing or preventing drug-induced cardiotoxicity.
Accordingly, the invention es methods for identifying an agent that causes or is at risk for causing cardiotoxicity. In one embodiment, the agent is a drug or drug candidate. In one embodiment, the toxicity is drug-induced toxicity, e. g., toxicity.
In one embodiment, the agent is a drug or drug candidate for treating diabetes, obesity, a cardiovascular disorder, cancer, a neurological disorder, or an inflammatory er.
In these s, the amount of one or more biomarkers/proteins in a pair of samples (a first sample not subject to the drug ent, and a second sample subjected to the drug treatment) is assessed. A tion in the level, expression level, or ty of the one or more biomarkers in the second sample as compared to the level of expression of the one or more biomarkers in the first sample is an indication that the drug causes or is at risk for causing nduced cardiotoxicity. In one ment, the one or more biomarkers is selected from the markers listed in table 2. The methods of the present invention can be practiced in conjunction with any other method used by the skilled practitioner to identify a drug at risk for causing drug-induced cardiotoxocity.
In one embodiment, a drug that may be used in the s of the invention includes, but is not limited to, cyclines, 5-Fluorouracil, Cisplatin, Trastuzumab, Gemcitabine, Rosiglitazone, Pioglitazone, Troglitazone, oline, Pergolide, Sumatriptan, sphonates, and TNF antagonists.
In one aspect the invention provides a method for identifying a drug that causes or is at risk for causing drug-induced cardiotoxicity, comprising: (1) determining a level of expression of one or more biomarkers in a cell sample obtained following treatment with a drug; and (2) comparing the level of expression of the one or more biomarkers t in the cell sample obtained following treatment with the drug with a level of expression of the corresponding one or more biomarkers t in a cell sample obtained prior to treatment with the drug; wherein the one or more biomarkers comprises coiled-coil domain ning 47 (CCDC47); and wherein a modulation in the level of expression of the one or more biomarkers in the sample obtained following treatment with the drug as compared to the level of expression of the corresponding one or more biomarkers present in the sample obtained prior to treatment with the drug is an indication that the drug causes or is at risk for causing drug-induced cardiotoxicity.
In one aspect the invention provides a method for identifying a rescue agent that can reduce or prevent drug-induced cardiotoxicity comprising: (1) determining a level of expression of one or more biomarkers present in a cell sample obtained following treatment with a cardiotoxicity inducing drug and a candidate rescue agent; and (2) comparing the level of expression of the one or more biomarkers in the sample obtained following treatment with the cardiotoxicity inducing drug and the (11115688_1):JJP candidate rescue agent with the normal level of expression of the corresponding one or more kers present in a cell sample obtained prior to treatment with the cardiotoxicity inducing drug and candidate rescue agent; wherein the one or more kers comprises coiled-coil domain containing 47 (CCDC47); and n a normalized level of expression of the one or more biomarkers in the sample obtained following treatment with the cardiotoxicity inducing drug and the candidate rescue agent as compared to the normal level of sion of the corresponding one or more biomarkers in the sample obtained prior to treatment with the toxicity inducing drug and the candidate rescue agent is an indication that the candidate rescue agent is a rescue agent which can reduce or prevent drug-induced cardiotoxicity.
In one aspect, the invention provides the use of a rescue agent identified by the method of the invention for the preparation of a medicament for alleviating, reducing or preventing drug-induced cardiotoxicity in a subject.
In one aspect the ion provides a method for identifying a rescue agent for the prevention, reduction or ent of drug-induced cardiotoxicity, comprising: (a) determining a level of one or more biomarkers in a first cell sample obtained following treatment with a cardiotoxicity inducing drug; (b) determining the level of the one or more kers in a second cell sample obtained following treatment with the cardiotoxicity ng drug and a candidate rescue agent; and (c) ing the level of the one or more biomarkers in the second cell sample with the level of the corresponding one or more biomarkers in the first cell sample; wherein the one or more biomarkers comprises coiled-coil domain containing 47 (CCDC47), and wherein a modulation in the level of the one or more biomarkers in the second cell sample as compared to the first cell sample is an indication that the candidate rescue agent is a rescue agent for the prevention, reduction or ent of nduced cardiotoxicity.
In one aspect, the invention provides a method for identifying a drug that causes or is at risk for causing drug-induced cardiotoxicity, comprising: comparing (i) the level of expression of one or more biomarkers present in a first cell sample obtained prior to the treatment with the drug; with (ii) the level of expression of the one or more biomarkers present in a second cell sample obtained following the treatment with the drug; wherein 11544667 the one or more biomarkers is selected from the s listed in table 2; wherein a modulation in the level of expression of the one or more biomarkers in the second sample as compared to the first sample is an indication that the drug causes or is at risk for causing drug-induced cardiotoxicity.
In one embodiment, the cells are cells of the cardiovascular system, e.g., cardiomyocytes.
In one ment, the cells are diabetic cardiomyocytes. In one embodiment, the drug is a drug or candidate drug for ng diabetes, obesity,cardiovascular disease, cancer, neurological disorder, or inflammatory disorder. In one embodiment, the drug is any one of Anthracyclines, 5-Fluorouracil, tin, Trastuzumab, Gemcitabine, Rosiglitazone, Pioglitazone, tazone, Cabergoline, Pergolide, Sumatriptan, Bisphosphonates, and TNF antagonists.
In one embodiment, a modulation (e.g., an increase or a decrease) in the level of expression of one, two, three, four, five, six, seven, eight, nine, ten, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, or more of the biomarkers selected from the markers listed in table 2 in the second 11544667 sample as compared to the first sample is an indication that the drug causes or is at risk for causing drug-induced toxicity.
In one embodiment, a modulation (e.g., an increase or a decrease) in the level of expression of a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen markers selected from a group consisting TIMPl, PTX3, HSP76, FINC, CYB5, PAIl, IBP7 (IGFBP7), 1C1?, EDIL3, HMOXl, NUCB1, C8010, HSPA4 in the second sample as ed to the first sample is an indication that the drug causes or is at risk for causing drug-induced cardiotoxicity.
Methods for fying a rescue agent that can reduce or prevent drug-induced cardiotoxicity are also provided by the invention. In these methods, the amount of one or more biomarkers in three samples (a first sample not subjected to the drug treatment, a second sample subjected to the drug treatment, and a third sample subjected both to the drug treatment and the agent) is assessed. A normalized level of expression of the one or more biomarkers in the third sample as compared to the first sample, with a change of expression in the second e d with the drug, is an indication that the rescue agent can reduce or prevent nduced cardiotoxicity. In one embodiment, the one or more biomarkers is selected from the markers listed in table 2.
Using the methods described herein, a variety of molecules, particularly ing molecules sufficiently small to be able to cross the cell membrane, may be screened in order to identify molecules which modulate, e.g., increase or decrease the expression and/or activity of a marker of the ion. Compounds so identified can be provided to a subject in order to reduce, ate or prevent drug-induced toxicity in the subject.
Accordingly, in another aspect, the invention provides a method for identifying a rescue agent that can reduce or prevent drug-induced cardiotoxicity comprising: (i) determining a normal level of expression of one or more biomarkers present in a first cell sample obtained prior to the treatment with a toxicity inducing drug; (ii) determining a treated level of expression of the one or more biomarkers present in a second cell sample ed following the treatment with the toxicity inducing drug to identify one or more biomarkers with a change of expression in the treated cell sample; (iii) ining the level of expression of the one or more biomarkers with a changed level of expression in the cardiotoxicity inducing drug treated sample present in a third cell sample obtained following the treatment with the cardiotoxicity inducing drug and the rescue agent; and (iv) comparing the level of expression of the one or more biomarkers determined in the third sample with the level of expression of the one or more biomarkers present in the first sample; wherein the one or more kers is selected from the markers listed in table 2; and wherein a normalized level of expression of the one or more biomarkers in the third sample as compared to the first sample is an indication that the rescue agent can reduce or prevent drug—induced cardiotoxicity.
In one embodiment, the cells are cells of the cardiovascular system, e.g., cardiomyocytes. In one embodiment, the cells are diabetic myocytes. In one embodiment, the drug is a drug or candidate drug for treating diabetes, obesity,cardiovascular disease, cancer, neurological disorder, or inflammatory disorder.
In one embodiment, the drug is is any one of Anthracyclines, 5-Fluorouracil, Cisplatin, Trastuzumab, Gemcitabine, itazone, Pioglitazone, Troglitazone, Cabergoline, Pergolide, Sumatriptan, Bisphosphonates, and TNF antagonists. In one ment, about the same level of expression of one, two, three, four, five, six, seven, eight, nine, ,11, 12, 13,14, 15,16,17, 18,19, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, or more of the biomarkers selected from the s listed in table 2 in the third sample as compared to the first sample is an indication that the rescue agent can reduce or prevent drug-induced cardiotoxicity.
In one embodiment, a ized level of expression of a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen,markers selected from a group ting TIMPl, PTX3, HSP76, FINC, CYB5, PAIl, IBP7 (IGFBP7), 1C17, EDIL3, HMOXl, NUCBl, C8010, HSPA4, in the third sample as compared to the first sample is an indication that the rescue agent can reduce or prevent nduced cardiotoxicity.
The invention further provides methods for alleviating, ng or preventing drug-induced cardiotoxicity in a t in need thereof, comprising administering to a subject (e.g., a mammal, a human, or a non-human animal) an agent identified by the screening methods provided herein, thereby reducing or preventing drug-induced cardiotoxicity in the subject. In one embodiment, the agent is administered to a subject that has already been treated with a cardiotoxicity-inducing drug. In one ment, the agent is administered to a subject at the same time as treatment of the subject with a cardiotoxicity-inducing drug. In one ment, the agent is administered to a subject prior to treatment of the t with a cardiotoxicity-inducing drug.
The invention further provides methods for alleviating, ng or preventing drug-induced cardiotoxicity in a subject in need thereof, comprising administering Coenzyme Q10 to the subject (e.g., a mammal, a human, or a non-human animal), thereby ng or preventing drug—induced cardiotoxicity in the subject. In one embodiment, the Coenzyme Q10 is administered to a subject that has already been treated with a cardiotoxicity-inducing drug. In one ment, the Coenzyme Q10 is stered to a subject at the same time as ent of the subject with a cardiotoxicity-inducing drug. In one embodiment, the Coenzyme Q10 is administered to a subject prior to treatment of the subject with a cardiotoxicity-inducing drug. In one embodiment, the nduced cardiotoxicity is associated with modulation of expression of one, two, three, four, five, six, seven, eight, nine, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, or more of the biomarkers selected from the markers listed in table 2. All values presented in the foregoing list can also be the upper or lower limit of ranges, that are intended to be a part of this invention, e.g., between 1 and 5, 1 and 10, 2 and 5, 2 and 10, or 5 and 10 of the foregoing genes (or proteins).
In one embodiment, the drug-induced cardiotoxicity is cardiomyopathy, heart failure, atrial fibrillation, cardiomyopathy and heart failure, heart failure and LV dysfunction, atrial flutter and fibrillation, or heart valve damage and heart failure.
The ion further provides biomarkers (e.g, genes and/or proteins) that are useful as predictive markers for drug-induced cardiotoxicity. These biomarkers include the markers listed in table 2.
In one embodiment, the drug-induced cardiotoxicity is associated with modulation of a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen, markers selected from a group consisting TIMPl, PTX3, HSP76, FINC, CYB5, PAIl, IBP7 (IGFBP?), 1C17, EDIL3, HMOXl, NUCBl, C8010, HSPA4.
In one embodiment, the tive markers for drug-induced cardiotoxicity is a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen markers selected from a group consisting TIMPl, PTX3, HSP76, FINC, CYB5, PAIl, IBP7 (IGFBP7), 1C17, EDIL3, HMOXI, NUCBl, C8010, HSPA4.
The ry skilled artisan would, however, be able to identify additional biomarkers predictive of drug-induced cardiotoxicity by employing the methods described herein, e. g., by carrying out the methods described in Example 3 but by using a different drug known to induce cardiotoxicity. ary drug-induced cardiotoxicity biomarkers of the invention are further described below.
In one aspect, the invention relates to a method for identifying a modulator of adrug-induced toxicity, said method comprising: (1) establishing a model for drug- induced toxicity, using cells associated with drug-induced toxicity, to represents a characteristic aspect of drug-induced toxicity; (2) obtaining a first data set from the model for drug-induced toxicity, wherein the first data set represents one or more of genomics, lipidomics, proteomics, metabolomics, transcriptomics, and single nucleotide polymorphism (SNP) data characterizing the cells associated with nduced toxicity; (3) ing a second data set from the model for drug-induced toxicity, n the second data set represents a functional activity or a ar response of the cells associated with drug-induced ty; (4) generating a consensus causal relationship network among the expression levels of the one or more of genomics, lipidomics, proteomics, metabolomics, transcriptomics, and single nucleotide polymorphism (SNP) data and the onal activity or cellular response based solely on the first data set and the second data set using a programmed computing device, wherein the generation of the consensus causal relationship network is not based on any known biological relationships other than the first data set and the second data set; (5) identifying, from the consensus causal relationship network, a causal relationship unique in drug-induced toxicity, wherein a gene, lipid, protein, metabolite, transcript, or SNP associated with the unique causal onship is identified as a modulator of drug-induced ty.
In certain embodiments, the modulator stimulates or es the drug-induced toxicity.
In n embodiments, the modulator inhibits the drug-induced ty.
In certain embodiments, the model of the drug-induced toxicity comprises an in Vitro culture of cells associated with the drug-induced toxicity, optionally further comprising a matching in Vitro culture of control cells.
In certain ments, the in Vitro culture of the cells is subject to an environmental perturbation, and the in Vitro e of the matching control cells is identical cells not subject to the environmental perturbation.
In certain embodiments, the environmental perturbation comprises one or more of a contact with an agent, a change in culture condition, an introduced genetic modification / mutation, and a e (e. g., vector) that causes a genetic modification / mutation.
In certain embodiments, the first data set comprises protein and/or mRNA expression levels of the plurality of genes.
In certain embodiments, the first data set further ses two or more of genomics, lipidomics, proteomics, metabolomics, riptomics, and single nucleotide polymorphism (SNP) data. In certain ments, the first data set further comprises three or more of genomics, lipidomics, proteomics, metabolomics, riptomics, and single nucleotide polymorphism (SNP) data.
In n embodiments, the second data set representing the functional activity or cellular response of the cells ses one or more of bioenergetics, cell proliferation, apoptosis, organellar function, a genotype-phenotype association actualized by functional models selected from ATP, ROS, OXPHOS, and Seahorse assays, global enzyme activity, and an effect of global enzyme activity on the enzyme metabolic substrates of cells associated with drug-induced toxicity. In one embodiment, the global enzyme activity is global kinase activity. In one ment, the effect of global enzyme activity on the enzyme metabolic substrates is the phospho proteome of the cell.
In certain embodiments, step (4) is carried out by an artificial intelligence (AI) - based informatics platform.
In certain embodiments, the AI—based informatics platform comprises REFS(TM).
In certain embodiments, the ed informatics platform receives all data input from the first data set and the second data set without applying a statistical cut-off point.
In certain embodiments, the consensus causal relationship network established in step (4) is further refined to a simulation causal onship network, before step (5), by in silico simulation based on input data, to provide a confidence level of prediction for one or more causal relationships within the consensus causal relationship k.
In certain embodiments, the unique causal relationship is identified as part of a differential causal relationship k that is uniquely present in cells, and absent in the matching control cells.
In one embodiment, the unique causal relationship identified is a relationship between at least one pair selected from the group consisting of expression of a gene and level of a lipid; expression of a gene and level of a transcript; expression of a gene and level of a lite; expression of a first gene and a second gene; sion of a gene and presence of a SNP; expression of a gene and a functional activity; level of a lipid and level of a transcript; level of a lipid and level of a metabolite; level of a first lipid and a second lipid; level of a lipid and presence of a SNP; level of a lipid and a functional activity; level of a first transcript and level of a second transcript; level of a transcript and level of a metabolite; level of a transcript and presence of a SNP; level of a first transcript and a functional activity; level of a first metabolite and level of a second metabolite; level of a metabolite and presence of a SNP; level of a metabolite and a functional activity; level of a first SNP and presence of a second SNP; and presence of a SNP and a functional ty.
In one embodiment, the functional activity is selected from the group consisting of rgetics, cell proliferation, sis, organellar function, kinase activity, protease ty, and a genotype-phenotype association actualized by functional models selected from ATP, ROS, OXPHOS, and Seahorse assays. In certain embodiments, the method further comprising validating the identified unique causal relationship in a drug- indiced ty model.
In one embodiment, the drug—induced toxicity is drug-induced cardiotoxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, renaltoxicity or myotoxicity.
In one embodiment, the drug-induced cardiotoxicity is cardiomyopathy, heart failure, atrial fibrillation, cardiomyopathy and heart failure, heart failure and LV dysfunction, atrial flutter and fibrillation, or, heart valve damage and heart failure.
In one ment, the model for drug—induced ty comprises cardiomyocytes, diabetic cardiomyocytes, cytes, kidney cells, neuronal cells, renal cells, or myoblasts.
In one ment, the model for drug-induced toxicity comprises a toxicity inducing drug, cancer drug, diabetic drug, neurological drug, or anti-inflammatory drug.
In one embodiment, the drug is Anthracyclines, 5—Fluorouracil, Cisplatin, Trastuzumab, abine, itazone, Pioglitazone, Troglitazone, Cabergoline, Pergolide, Sumatriptan, Bisphosphonates, or TNF nists.
In one aspect, the invention provides a method for identifying a drug that causes or is at risk for causing drug-induced toxicity, comprising: comparing (i) a level of one or more biomarkers present in a first cell sample obtained prior to the ent with the drug; with (ii) a level of the one or more biomarkers present in a second cell sample obtained following the treatment with the drug; wherein the one or more biomarkers is selected from the tors identified by the methods described above; wherein a modulation in the level of the one or more kers in the second sample as compared to the first sample is an indication that the drug causes or is at risk for causing drug- induced toxicity.
In one aspect, the invention provides a method for identifying a rescue agent that can reduce or prevent drug-induced toxicity comprising: (i) determining a normal level of one or more kers present in a first cell sample obtained prior to the treatment with a toxicity inducing drug; (ii) determining a treated level of the one or more biomarkers present in a second cell sample obtained ing the treatment with the toxicity inducing drug to identify one or more biomarkers with a change of level in the treated cell sample; (iii) determining the level of the one or more biomarkers with a d level in the toxicity inducing drug treated sample present in a third cell sample obtained following the treatment with the toxicity inducing drug and the rescue agent; and (iv) comparing the level of the one or more biomarkers determined in the third sample with the level of the one or more biomarkers present in the first sample; wherein the one or more biomarkers is selected from the modulators identified by the methods described above and wherein a normalized level of the one or more biomarkers in the third sample as compared to the first sample is an tion that the rescue agent can reduce or prevent drug-induced toxicity.
In another aspect, the invention relates to a method for ating, reducing or preventing drug-induced toxicity, comprising administering to a subject the rescue agent identified by the methods described above, thereby reducing or preventing drug-induced toxicity in the subject.
In another aspect, the ion relates to a method for providing a model for drug-induced toxicity for use in a platform method, comprising: establishing a drug- induced toxicity model, using cells associated with the nduced toxicity, to represent a characteristic aspect of the drug-induced toxicity, wherein the model for the drug-induced toxicity is useful for generating data sets used in the platform method; thereby providing a model for drug—induced toxicity for use in a platform method.
In one embodiment, the model for drug—induced toxicity comprises cardiomyocytes, diabetic cardiomyocytes, cytes, kidney cells, neuronal cells, renal cells, or myoblasts.
In another aspect, the invention relates to a method for obtaining a first data set and second data set from a model for drug-induced toxicity for use in a platform , comprising: (1) ing a first data set from the model for drug-induced toxicity for use in a platform method, n the model for the drug-induced toxicity comprises cells associated with the drug-induced toxicity, and wherein the first data set represents expression levels of a plurality of genes in the cells associated with the drug-induced toxicity; (2) obtaining a second data set from the model for nduced toxicity for use in the platform method, wherein the second data set represents a functional activity or a cellular se of the cells associated with the drug-induced toxicity; thereby obtaining a first data set and second data set from the model for the drug-induced toxicity for use in a platform method.
In r aspect, the invention s to a method for identifying a tor of drug-induced toxicity, said method comprising: (1) generating a consensus causal relationship network among a first data set and second data set obtained from a model for the drug-induced toxicity, wherein the model comprises cells ated with the drug-induced toxicity, and wherein the first data set ents expression levels of a plurality of genes in the cells and the second data set represents a functional activity or a cellular response of the cells, using a programmed computing device, wherein the generation of the consensus causal relationship network is not based on any known biological relationships other than the first data set and the second data set; (2) identifying, from the consensus causal relationship network, a causal relationship unique in the nduced toxicity, wherein a gene associated with the unique causal relationship is identified as a modulator of the drug-induced toxicity; thereby identifying a modulator of drug-induced toxicity.
In another aspect, the invention relates to a method for identifying a modulator of a drug-induced toxicity, said method comprising: 1) ing a consensus causal relationship network generated from a model for the nduced toxicity; 2) identifying, from the consensus causal relationship network, a causal relationship unique in the drug-induced toxicity, wherein a gene associated with the unique causal relationship is identified as a modulator of the drug—induced toxicity; thereby identifying a tor of a drug-induced toxicity.
In certain embodiments of the various s, the sus causal relationship network is generated among a first data set and second data set obtained from the model for the drug-induced toxicity, wherein the model comprises cells associated with the drug-induced toxicity, and wherein the first data set represents expression levels of a plurality of genes in the cells and the second data set represents a onal activity or a ar response of the cells, using a programmed computing device, wherein the generation of the consensus causal relationship k is not based on any known biological relationships other than the first data set and the second data set.
In certain embodiments, the “environmental perturbation”, also referred to herein as “external stimulus ent”, is a therapeutic agent. In certain embodiments, the external stimulus component is a small molecule (e.g., a small molecule of no more than kDa, 4 kDa, 3 kDa, 2 kDa, 1 kDa, 500 Dalton, or 250 Dalton). In certain ments, the external stimulus component is a biologic. In certain embodiments, the external stimulus component is a chemical. In certain embodiments, the external stimulus component is endogenous or exogenous to cells. In certain embodiments, the external stimulus component is a MIM or epishifter. In certain embodiments, the external stimulus component is a stress factor for the cell system, such as hypoxia, hyperglycemia, hyperlipidemia, hyperinsulinemia, and/or lactic acid rich ions.
In certain embodiments, the external stimulus ent may include a therapeutic agent or a candidate therapeutic agent for treating a drug-induced toxicity, including chemotherapeutic agent, protein-based biological drugs, antibodies, fusion ns, small molecule drugs, lipids, polysaccharides, nucleic acids, etc.
In certain embodiments, the external stimulus component may be one or more stress factors, such as those typically encountered in viva under the various drug-induced toxicities, including hypoxia, hyperglycemic conditions, acidic environment (that may be mimicked by lactic acid treatment), etc.
In other ments, the external stimulus component may include one or more MIMs and/or epishifters, as defined herein below. Exemplary MIMs include Coenzyme Q10 (also referred to herein as CleO) and compounds in the Vitamin B family, or nucleosides, mononucleotides or dinucleotides that comprise a compound in the Vitamin B family.
In making cellular output measurements (such as protein expression), either absolute amount (e. g., expression amount) or relative level (e.g., relative expression level) may be used. In one embodiment, absolute amounts (e.g., expression amounts) are used. In one embodiment, relative levels or amounts (e. g., relative expression ) are used. For example, to determine the relative protein expression level of a cell system, the amount of any given protein in the cell system, with or without the external stimulus to the cell system, may be ed to a suitable control cell line or mixture of cell lines (such as all cells used in the same experiment) and given a fold-increase or fold-decrease value. The d person will iate that absolute amounts or relative amounts can be ed in any cellular output measurement, such as gene and/or RNA transcription level, level of lipid, level of metabolite, or any onal output, 6. g., level of apoptosis, level of toxicity, level of enzyme (e.g., kinase) activity, or ECAR or OCR as bed herein. A pre-determined threshold level for a fold-increase (e.g., at least 1.2,1.3,1.4,1.5,1.6,1.7,1.8,1.9, 2, 2.5, 3, 3.5, 4, 4.5, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, , 40, 45, 50, 75 or 100 or more fold increase) or ecrease (e.g., at least a decrease to 0.9, 0.8, 0.75, 0.7, 0.6, 0.5, 0.45, 0.4, 0.35, 0.3, 0.25, 0.2, 0.15, 0.1 or 0.05 fold, or a decrease to 90%, 80%, 75%, 70%, 65%, 60%, 55%, 50%, 45%, 40%, 35%, 30%, 25%, %, 15%, 10% or 5% or less) may be used to select icant differentials, and the cellular output data for the significant differentials may then be ed in the data sets (e. g., first and second data sets) utilized in the platform technology methods of the invention. All values presented in the foregoing list can also be the upper or lower limit of , e. g., between 1.5 and 5 fold, 5 and 10 fold, 2 and 5 fold, or between 0.9 and 0.7, 0.9 and 0.5, or 0.7 and 0.3 fold, are intended to be a part of this invention.
Throughout the present application, all values presented in a list, e.g., such as those above, can also be the upper or lower limit of ranges that are intended to be a part of this invention.
In one embodiment of the methods of the invention, not every observed causal relationship in a causal relationship network may be of biological significance. With respect to any given drug-induced toxicity for which the subject interrogative biological assessment is applied, some (or maybe all) of the causal onships (and the genes associated therewith) may be minative” with respect to the ic biological problem at issue, e.g., either sible for causing a drug—induced toxicity (a potential target for therapeutic intervention) or is a biomarker for the nduced toxicity (a potential diagnostic or prognostic factor). In one embodiment, an observed causal relationship unique in the drug-induced toxicity is inative with t to the specific biological problem at issue. In one embodiment, not every observed causal onship unique in the drug—induced toxicity is determinative with respect to the specific problem at issue.
Such determinative causal relationships may be selected by an end user of the subject method, or it may be selected by a bioinformatics software program, such as REFS, DAVID-enabled comparative pathway analysis program, or the KEGG pathway analysis program. In certain embodiments, more than one bioinformatics software program is used, and consensus s from two or more bioinformatics software programs are preferred.
As used herein, “differentials” of ar outputs include differences (6. g., increased or decreased levels) in any one or more parameters of the cellular outputs. In n embodiments, the differentials are each independently selected from the group consisting of differentials in mRNA transcription, protein expression, lipid expression, protein activity, kinase ty, lite / intermediate level, and/or ligand-target interaction. For example, in terms of protein expression level, differentials between two cellular outputs, such as the outputs associated with a cell system before and after the treatment by an external stimulus component, can be measured and quantitated by using art-recognized technologies, such as mass—spectrometry based assays (e. g., iTRAQ, 2D- LC-MSMS, etc.) In one , the cell model for a drug-induced toxicity comprises a cellular cross-talking system, wherein a first cell system having a first cellular environment with an external stimulus component generates a first modified cellular environment; such that a talking cell system is established by exposing a second cell system having a second cellular environment to the first modified cellular environment.
In one embodiment, at least one significant cellular cross-talking differential from the cross-talking cell system is generated; and at least one determinative cellular cross-talking differential is identified such that an interrogative biological assessment occurs. In certain embodiments, the at least one significant cellular cross-talking differential is a plurality of differentials.
In certain embodiments, the at least one determinative ar cross-talking differential is ed by the end user. Alternatively, in another embodiment, the at least one determinative cellular talking differential is selected by a bioinformatics software m (such as, e.g., REFS, KEGG pathway analysis or DAVID-enabled comparative pathway analysis) based on the quantitative proteomics data.
In certain embodiments, the method further comprises generating a significant cellular output differential for the first cell system.
In certain embodiments, the differentials are each independently selected from the group consisting of differentials in mRNA transcription, protein expression, lipid expression, n activity, metabolite / intermediate level, and/or ligand-target interaction.
In certain embodiments, the first cell system and the second cell system are independently selected from: a homogeneous population of primary cells, a drug- induced toxicity related cell line, or a normal cell line.
In certain embodiments, the first ed cellular environment comprises factors ed by the first cell system into the first cellular environment, as a result of contacting the first cell system with the external stimulus component. The factors may comprise secreted proteins or other signaling molecules. In n embodiments, the first modified cellular environment is substantially free of the original external stimulus component.
In certain embodiments, the cross-talking cell system comprises a transwell having an insert compartment and a well compartment separated by a membrane. For e, the first cell system may grow in the insert compartment (or the well tment), and the second cell system may grow in the well compartment (or the insert compartment).
In certain ments, the cross-talking cell system comprises a first culture for growing the first cell system, and a second e for g the second cell system.
In this case, the first modified cellular environment may be a conditioned medium from the first cell system.
In n embodiments, the first cellular environment and the second cellular environment can be identical. In certain embodiments, the first cellular environment and the second cellular environment can be different.
In certain ments, the cross—talking cell system comprises a co-culture of the first cell system and the second cell system.
The methods of the invention may be used for, or d to, any number of “interrogative biological assessments.” Application of the methods of the invention to an interrogative biological assessment allows for the identification of one or more modulators of a drug-induced toxicity or determinative cellular process “drivers” of a drug-induced toxicity.
In one embodiment, the interrogative biological assessment is the assessment of the toxicological profile of an agent, e.g., a drug, on a cell, tissue, organ or organism, wherein the identified modulators of drug—induced ty, e. g., determinative cellular process driver (e. g., cellular cross-talk differentials or causal relationships unique in nduced toxicity) may be tors of toxicity, e.g., cytotoxicity, cardiotoxicity, hepatotoxicity, nephrotoxicity, oxicity, renaltoxicity or myotoxicity, and may in turn be used to predict or identify the toxicological e of the agent. In one embodiment, the identified modulators of a drug-induced toxicity, e. g., determinative cellular process driver (e.g., cellular cross-talk entials or causal relationships unique in a drug-induced toxicity) is an indicator of cardiotoxicity of a drug or drug candidate, and may in turn be used to predict or identify the toxicological profile of the drug or drug candidate.
In another aspect, the invention provides a kit for conducting an interrogative biological assessment using a discovery rm Technology, comprising one or more reagents for detecting the ce of, and/or for quantitating the amount of, an analyte that is the subject of a causal relationship network generated from the methods of the invention. In one embodiment, said analyte is the subject of a unique causal relationship in the drug-induced toxicity, e.g., a gene associated with a unique causal relationhip in the drug-induced toxicity. In certain embodiments, the analyte is a n, and the reagents comprise an antibody against the protein, a label for the protein, and/or one or more agents for preparing the protein for high throughput analysis (e.g., mass spectrometry based sequencing).
It should be tood that all embodiments described herein, including those described only in es, are parts of the general description of the invention, and can be combined with any other embodiments of the invention unless explicitly disclaimed or inapplicable. escription ofthe Drawings Various embodiments of the present disclosure will be described herein below with reference to the figures wherein: Figure 1: Illustration of ch to identify therapeutics.
Figure 2: Illustration of systems biology of cancer and consequence of integrated multi-physiological interactive output tion.
Figure 3: Illustration of systematic interrogation of biological relevance using MIMS.
Figure 4: Illustration of modeling cancer network to enable interrogative ical query.
Figure 5: Illustration of the interrogative biology platform technology.
Figure 6: Illustration of technologies employed in the platform technology.
Figure 7: Schematic representation of the components of the rm ing data tion, data integration, and data mining.
Figure 8: Schematic representation of the systematic interrogation using MIMS and collection of se data from the “omics” cascade.
Figure 9: Sketch of the components employed to build the in vitro models representing normal and diabetic states.
Figure 10: Schematic entation of the informatics platform REFSTM used to generate causal networks of the protein as they relate to disease pathophysiology.
Figure 11: Schematic representation of the approach towards generation of differential network in diabetic versus normal states and diabetic nodes that are restored to normal states by treatment with MIMS.
Figure 12: A representative differential network in diabetic versus normal states.
Figure 13: A schematic representation of a node and associated edges of st (Nodelin the center). The cellular functionality associated with each edge is represented.
Figure 14: High level flow chart of an exemplary method, in accordance with some embodiments.
Figure ISA-15D: High level tic illustration of the components and process for an AI-based informatics system that may be used with exemplary embodiments.
Figure 16: Flow chart of process in ed informatics system that may be used with some ary embodiments.
Figure 17: Schematically depicts an exemplary computing environment suitable for practicing ary embodiments taught .
Figure 18: Illustration of the mathematical approach s generation of delta- delta networks.
Figure 19: A schematic representing experimental design and modeling parameters used to study drug induced toxicity in diabetic cardiomyocytes.
Figure 20: Dysregulation of transcriptional network and expression of human mitochondrial energy metabolism genes in diabetic cardiomyocytes by drug treatment (T): rescue molecule (R) normalizes gene expression.
Figure 21: A. Drug treatment (T) induced expression of GPATl and TAZ in ondria from cardiomyocytes conditioned in hyerglycemia. In combination with the rescue molecule (T+R) the levels of GPATl and TAZ were normalized. B.
Synthesis of TAG from G3P.
Figure 22: A. Drug treatment (T) decreases mitochondrial OCR (oxygen consumption rate) in cardiomyocytes conditioned in hyperglycemia. The rescue molecule (T+R) normalizes OCR. B. Drug treatment (T) represses mitochondrial ATP synthesis in cardiomyocytes conditioned in hyperglycemia.
Figure 23: GO Annotation of proteins down regulated by drug treatment. ns involved in mitochondrial energy metabolism were down regulated with drug treatment.
Figure 24: Illustration of the mathematical ch towards generation of delta ks. Compare unique edges from T versus UT both the models being in diabetic environment.
Figure 25: A tic representing potential protein hubs and ks that drive pathophysiology of drug induced toxicity.
Figure 26: Schematic representation of the Interrogative biology rm.
Figure 27: Illustration of cellular functional models, data integration and mathematical model Building.
Figure 28: Causal molecular interaction network that drives pathophysiology of drug-induced toxicity.
Figure 29: Causal lar interaction sub-network of PTX3 as the central hub that drives pathophysiology of drug-induced toxicity.
Figure 30: Mitochondria ATP synthesis capacity of cardiomyocutes in normal glucose and high glucose ions.
Figure 31: Causal molecular interaction network of ATP drivers.
Figure 32: Causal molecular ction sub—network of ATP drivers with P4HB as the central hub.
Figure 33: Unique edges of causal molecular interaction sub-network of ATP drivers with P4HB as the central hub.
Figure 34: Illustration of functional toxicomics: omics ation.
Attached herewith, as in Appendix A, are the sequences of all biomarkers referenced herein. All of the information associated with the Gene Bank accession numbers listed in Appendix A and through this application are incorporated herein by reference in the verions available on the filing date of this application.
Detailed Description ofthe Invention I. Overview Exemplary embodiments of the present invention incorporate methods that may be performed using an interrogative biology platform (“the Platform”) that is a tool for understanding a wide variety of drug—induced toxicities, such as cardiotoxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, renaltoxicity or myotoxicity, and the key molecular drivers underlying such drug-induced toxicities, including factors that enable a drug-induced toxicity. Some exemplary ments include systems that may incorporate at least a portion of, or all of, the Platform. Some exemplary methods may employ at least some of, or all of the Platform. Goals and objectives of some exemplary embodiments ing the platform are generally outlined below for rative purposes: i) to create specific molecular signatures as drivers of critical components of the drug-induced toxicity as they relate to overall pathophysiology of the nt cells, tissues, and/or organs; ii) to generate lar signatures or differential maps pertaining to the drug-induced toxicity, which may help to identify differential lar signatures that distinguishes one biological state (e.g., a drug—induced toxicity state) versus a different biological stage (e. g., a normal state), and develop understanding of signatures or molecular entities as they arbitrate mechanisms of change between the two biological states (e. g., from normal to drug-induced toxicity state); and, iii) to investigate the role of “hubs” of molecular activity as potential intervention targets for al control of the nduced toxicity (e. g., to use the hub as a potential therapeutic target), or as potential bio-markers for the nduced toxicity in question (e. g., drug-induced toxicity ic biomarkers, in prognostic and/or theranostics uses).
Some exemplary methods involving the Drug-induced Toxicity Platform may include one or more of the following features: 1) modeling the nduced toxicities (e.g., cardiotoxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, renaltoxicity, or myotoxicity) and/or components of the nduced toxicity (e.g., physiology & hysiology associated with toxicities) in one or more models, preferably in vitro models, using cells associated with the drug- induced toxicity. For example, the cells may be human derived cells which normally participate in the drug-induced toxicity in question (e.g., heat muscle cells involved in cardiotoxicity). The model may include various cellular cues / conditions / perturbations that are specific to the drug-induced ty. Ideally, the model represents various drug- induced toxicity states and flux components, instead of a static assessment of the drug- induced toxicity condition. 2) profiling mRNA and/or n signatures using any art-recognized means. For example, quantitative polymerase chain reaction (qPCR) & proteomics analysis tools such as Mass Spectrometry (MS). Such mRNA and n data sets represent biological reaction to environment / perturbation. Where applicable and possible, mics, metabolomics, and transcriptomics data may also be ated as supplemental or alternative measures for the drug-induced toxicity in question. SNP analysis is another component that may be used at times in the process. It may be helpful for investigating, for example, r the SNP or a specific mutation has any effect on the drug-induced toxicity. These variables may be used to describe the drug- induced toxicity, either as a static “snapshot,” or as a representation of a dynamic process. 3) assaying for one or more functional activities or ar responses to cues and bations, including but not limited to bioenergetics, cell proliferation, apoptosis, and organellar function. True pe—phenotype association is actualized by employment of functional models, such as ATP, ROS, OXPHOS, Seahorse assays, etc. Such functional activities can involve global enzyme activity, such as kinase activity, and/or effects of global enzyme activity or the enzyme metabolites or substrates in the cells, e. g., the phosphor proteome of the cells. Such cellular responses represent the reaction of the cells in the drug-induced toxicity process (or models thereof) in response to the corresponding drug—induced toxicity state(s) of the mRNA / protein expression, and any other related states in 2) above. 4) integrating functional assay data thus obtained in 3) with mics and other data obtained in 2), and determining protein, gene, lipid, enzyme activity and other functional acitivity associations as driven by causality, by employing artificial intelligence based (AI-based) atics system or platform. Such an ed system is based on, and preferably based only on, the data sets obtained in 2) and/or 3), without resorting to existing knowledge concerning the drug—induced toxicity process.
Preferably, no data points are statistically or artificially cut-off. Instead, all obtained data is fed into the AI-system for determining protein, gene, lipid, enzyme activity and other functional acitivity ations. One goal or output of the integration process is one or more differential networks (otherwise may be referred to herein as “delta networks,” or, in some cases, “delta—delta networks” as the case may be) between the ent biological states (e.g., nduced toxicity vs. normal states). ) profiling the s from the AI-based informatics platform to explore each hub of activity as a potential therapeutic target and/or biomarker. Such profiling can be done entirely in silico based on the obtained data sets, without resorting to any actual wet-lab experiments. 6) validating hub of activity by employing molecular and cellular techniques. Such post-informatic validation of output with wet-lab ased experiments may be optional, but they help to create a full—circle of interrogation.
Any or all of the approaches outlined above may be used in any specific application concerning any nduced toxicity, depending, at least in part, on the nature of the specific ation. That is, one or more ches outlined above may be omitted or modified, and one or more additional approaches may be employed, depending on specific application.
Various schematics illustrating the platform are provided. In ular, an illustration of an exemplary ch to fy therapeutics using the platform is depicted in Figure 1. An illustration of systems biology of cancer and the consequence of integrated multi-physiological interactive output regulation is depicted in Figure 2.
An illustration of a systematic interrogation of biological relevance using MIMS is depicted in Figure 3. An illustration of modeling a cancer network to enable an interrogative biological query is ed in Figure 4.
Illustrations of the interrogative biology rm and technologies employed in the platform are depicted in Figures 5 and 6. A schematic representation of the components of the platform including data collection, data integration, and data mining is depicted in Figure 7. A schematic representation of a systematic interrogation using MIMS and tion of response data from the “omics” cascade is depicted in Figure 8.
Figure 14 is a high level flow chart of an ary method 10, in which components of an exemplary system that may be used to perform the exemplary method are indicated. Initially, a model (e.g., an in vitro model) is established for a biological process (e. g., a drug-induced toxicityprocess) and/or components of the ical process (e. g., drug-induced toxicity physiology and pathophysiology) using cells normally associated with the process (step 12). For example, the cells may be human- derived cells that normally ipate in the biological process (e. g., drug-induced toxicity). The cell model may include various cellular cues, conditions, and/or perturbations that are specific to the biological process (e. g., drug-induced toxicity).
Ideally, the cell model represents various (drug-induced toxicity) states and flux components of the biological process (e.g., drug-induced toxicity), instead of a static assessment of the ical process. The comparison cell model may include control cells or normal cells, e. g., cells not exposed to a drug which induces toxicity. Additional description of the cell models appears below in sections III.A and IV.
A first data set is obtained from the cell model for the biological s (e. g. drug-induced toxicity), which includes information representing, by way of example, expression levels of a ity of genes (e.g., mRNA and/or protein signatures) (step 16) using any known process or system (e.g., quantitative polymerase chain reaction (qPCR) & proteomics analysis tools such as Mass Spectrometry (M8)).
A third data set is obtained from the comparison cell model for the biological s (e. g. drug-induced toxicity) (step 18). The third data set includes information representing, e. g., expression levels of a plurality of genes in the comparison cells from the comparison cell model.
In certain ments of the methods of the invention, these first and third data sets are collectively referred to herein as a “first data set” that represents, e.g., expression levels of a plurality of genes in the cells (all cells including comparison cells) associated with the biological system (e.g. drug-induced toxicity model).
The first data set and third data set may be obtained from one or more mRNA and/or Protein Signature Analysis System(s). The mRNA and protein data in the first and third data sets may ent biological reactions to environment and/or perturbation. Where applicable and possible, lipidomics, metabolomics, and transcriptomics data may also be integrated into the first data set as supplemental or alternative es for the biological process (e. g. nduced toxicity). The SNP analysis is another component that may be used at times in the process. It may be helpful for investigating, for example, whether a -nucleotide polymorphism (SNP) or a specific mutation has any effect on the ical process (e.g. drug—induced toxicity). The data variables may be used to describe the biological process (e.g. druginduced toxicity) either as a static “snapshot,” or as a representation of a dynamic process. Additional description regarding obtaining information representing sion levels of a plurality of genes in cells appears below in section 111B.
A second data set is obtained from the cell model for the ical process (e. g. drug-induced ty), which includes ation representing a functional activity or response of cells (step 20). Similarly, a fourth data set is ed from the comparison cell model for the biological process (e.g. drug-induced toxicity), which includes information enting a functional activity or response of the comparison cells (step 22).
In certain embodiments of the methods of the invention, these second and fourth data sets are collectively ed to herein as a “second data set” that represents a functional activity or a cellular response of the cells (all cells including comparison cells) associated with the biological system (e.g. drug-induced toxicity).
One or more functional assay systems may be used to obtain information regarding the functional activity or response of cells or of comparison cells. The information regarding functional cellular responses to cues and perturbations may include, but is not limited to, bioenergetics profiling, cell proliferation, apoptosis, and organellar function. Functional models for processes and pathways (e.g., adenosine triphosphate (ATP), ve oxygen species (ROS), oxidative phosphorylation (OXPHOS), Seahorse assays, etc.,) may be employed to obtain true genotype-phenotype association. Such functional activities can involve global enzyme activity, such as kinase activity, and/or effects of global enzyme activity, or the enzyme lites or substrates in the cells, e. g., the phosphor proteome of the cells. The functional activity or cellular responses represent the reaction of the cells in the biological process (or models thereof) in response to the corresponding s) of the mRNA / protein expression, and any other related applied conditions or perturbations. onal information regarding obtaining information representing functional activity or response of cells is provided below in section III.B.
The method also es generating computer-implemented models of the biological processes (e. g. nduced toxicity) in the cells and in the control cells.
For example, one or more (e.g., an ensemble of) Bayesian networks of causal relationships between the expression level of the plurality of genes and the functional activity or cellular response may be generated for the cell model (the “generated cell model ks”) from the first data set and the second data set (step 24). The generated cell model networks, dually or collectively, include quantitative probabilistic directional information regarding relationships. The generated cell model networks are not based on known biological relationships n gene expression and/or functional ty or cellular response, other than information from the first data set and second data set. The one or more generated cell model networks may collectively be referred to as a consensus cell model network.
One or more (e.g., an ensemble of) Bayesian networks of causal relationships between the expression level of the plurality of genes and the functional activity or ar response may be generated for the comparison cell model (the ated comparison cell model networks”) from the first data set and the second data set (step 26). The generated comparison cell model networks, individually or collectively, include quantitative probabilistic directional information regarding relationships. The generated cell networks are not based on known ical relationships between gene expression and/or functional activity or cellular response, other than the information in the first data set and the second data set. The one or more generated comparison model networks may collectively be refered to as a consensus cell model network.
The generated cell model networks and the generated comparison cell model networks may be created using an artificial intelligence based (AI-based) informatics platform. Further details regarding the creation of the generated cell model networks, the creation of the generated comparison cell model networks and the ed informatics system appear below in section III.C and in the ption of Figures 2A-3.
It should be noted that many different AI—based platforms or systems may be employed to generate the Bayesian ks of causal onships including tative probabilistic directional ation. Although certain examples described herein employ one specific commercially available system, i.e., REFSTM se Engineering/Forward Simulation) from GNS (Cambridge, MA), embodiments are not d. AI—Based Systems or Platforms suitable to ent some embodiments employ mathematical algorithms to establish causal relationships among the input variables (e.g., the first and second data sets), based only on the input data without taking into consideration prior existing knowledge about any potential, established, and/or verified biological onships.
For example, the REFSTM AI-based informatics platform utilizes experimentally derived raw (original) or minimally processed input ical data (e. g., genetic, genomic, epigenetic, proteomic, metabolomic, and clinical data), and rapidly ms trillions of calculations to determine how molecules interact with one another in a complete system. The REFSTM AI—based informatics platform performs a reverse engineering process aimed at creating an in silico computer-implemented cell model (e. g., generated cell model ks), based on the input data, that quantitatively represents the underlying biological system (e.g. drug-induced toxicity). Further, hypotheses about the underlying biological system can be developed and rapidly ted based on the computer-implemented cell model, in order to obtain predictions, accompanied by associated confidence levels, regarding the hypotheses.
With this approach, biological systems are represented by quantitative computer- ented cell models in which “interventions” are simulated to learn detailed mechanisms of the biological system (e.g., drug-induced toxicity), effective intervention strategies, and/or clinical biomarkers that determine which patients will respond to a given treatment regimen. Conventional bioinformatics and statistical approaches, as well as approaches based on the modeling of known biology, are typically unable to e these types of insights.
After the generated cell model networks and the generated comparison cell model networks are created, they are compared. One or more causal relationships present in at least some of the ted cell model networks, and absent from, or having at least one icantly different parameter in, the generated comparison cell model networks are identified (step 28). Such a comparison may result in the creation of a ential k. The comparison, identification, and/or differential ) network creation may be conducted using a differential network creation module, which is described in further detail below in section 111D and with respect to the description of Figure 18.
In some embodiments, input data sets are from one cell type and one comparison cell type, which creates an ensemble of cell model networks based on the one cell type and another ensemble of comparison cell model networks based on the one comparison control cell type. A differential may be performed between the le of networks of the one cell type and the ensemble of networks of the comparison cell type(s).
In other embodiments, input data sets are from multiple cell types (e. g., two or more cell types that are normally associated with the particular type of drug-induced toxicity and multiple ison cell types (e.g., two or more normal cell types, e. g., same cells which are not exposed to the drug). An ensemble of cell model networks may be ted for each cell types and each comparison cell type individually, and/or data from the multiple cell types and the multiple comparison cell types may be combined into respective composite data sets. The ite data sets produce an ensemble of networks corresponding to the multiple cell types (composite data) and r ensemble of networks ponding to the multiple comparison cell types (comparison composite data). A differential may be performed on the ensemble of networks for the composite data as compared to the ensemble of networks for the comparison composite data.
In some embodiments, a differential may be performed between two ent differential ks. This output may be referred to as a delta network, and is described below with respect to Figure 18.
Quantitative relationship information may be identified for each relationship in the generated cell model networks (step 30). Similarly, quantitative relationship information for each relationship in the generated ison cell model networks may be fied (step 32). The quantitative information regarding the relationship may include a direction indicating causality, a measure of the statistical uncertainty ing the relationship (e. g., an Area Under the Curve (AUC) statistical measurement), and/or an sion of the quantitative magnitude of the strength of the onship (e.g., a fold). The various relationships in the generated cell model networks may be profiled using the quantitative relationship information to explore each hub of activity in the networks as a ial therapeutic target and/or biomarker. Such profiling can be done entirely in silico based on the results from the ted cell model networks, without resorting to any actual wet-lab experiments.
In some embodiments, a hub of activity in the networks may be validated by employing molecular and cellular techniques. Such post-informatic validation of output with wet-lab cell based experiments need not be performed, but it may help to create a full-circle of interrogationFigure 15 schematically depicts a simplified high level representation of the functionality of an exemplary AI—based informatics system (e. g., REFSTM AI-based informatics system) and interactions between the AI-based system and other elements or portions of an interrogative biology platform (“the Platform”). In Figure 15A, various data sets obtained from a model for a biological s (e. g., a drug-induced toxicity model), such as drug , treatment dosage, protein expression, mRNA expression, lipid levels, metabolite , kinase activity and any of many other associated functional measures (such as OCR, ECAR) are fed into an AI- based system. As shown in Figure 15B, from the input data sets, the AI-system creates a library of “network fragments” that includes variables (e.g., proteins, lipids, kinases and metabolites) that drive molecular isms in the biological s (e. g., drug- induced toxicity), in a process referred to as Bayesian Fragment Enumeration (Figure 1513).
In Figure 15C, the AI-based system s a subset of the network fragments in the library and constructs an initial trial network from the fragments. The AI-based system also selects a different subset of the network fragments in the library to uct another initial trial network. Eventually an ensemble of initial trial networks are created (e.g., 1000 networks) from different subsets of network fragments in the library. This process may be termed parallel ensemble sampling. Each trial network in the ensemble is evolved or optimized by adding, subtracting and/or substitution additional network fragments from the library. If onal data is obtained, the additional data may be incorporated into the network fragments in the library and may be incorporated into the ensemble of trial networks through the evolution of each trial k. After completion of the zationievolution process, the ensemble of trial networks may be described as the generated cell model networks.
As shown in Figure 15D, the ensemble of generated cell model networks may be used to simulate the behavior of the ical system (e. g. drug-induced toxicity). The simulation may be used to predict behavior of the biological system (e.g. drug-induced toxicity) to changes in conditions, which may be experimentally verified using wet-lab cell-based, or animal-based, experiments. Also, tative parameters of relationships in the ted cell model networks may be extracted using the simulation functionality by applying simulated perturbations to each node individually while ing the effects on the other nodes in the generated cell model neworks. Further detail is provided below in section III.C.
The automated reverse engineering process of the AI-based informatics system, which is depicted in Figures 2A—2D, creates an ensemble of generated cell model networks ks that is an unbiased and systematic computer-based model of the cells.
The reverse ering determines the probabilistic directional k connections between the molecular measurements in the data, and the phenotypic outcomes of interest. The ion in the molecular measurements enables learning of the probabilistic cause and effect relationships between these entities and changes in endpoints. The machine learning nature of the platform also enables cross training and predictions based on a data set that is constantly evolving.
The network tions between the molecular measurements in the data are “probabilistic,” partly because the connection may be based on correlations between the observed data sets “learned” by the computer algorithm. For example, if the expression level of protein X and that of protein Y are vely or negatively ated, based on statistical is of the data set, a causal onship may be assigned to establish a k connection between proteins X and Y. The reliability of such a putative causal relationship may be r defined by a likelihood of the connection, which can be measured by p-value (e.g., p < 0.1, 0.05, 0.01, etc).
The network connections between the molecular measurements in the data are “directional,” partly because the k connections between the molecular measurements, as determined by the reverse—engineering process, reflects the cause and effect of the relationship between the connected gene / protein, such that raising the expression level of one protein may cause the expression level of the other to rise or fall, depending on whether the connection is stimulatory or inhibitory.
The network tions between the molecular ements in the data are “quantitative,” partly because the network connections n the molecular measurements, as determined by the process, may be simulated in silico, based on the existing data set and the probabilistic measures associated therewith. For example, in the established network connections between the molecular measurements, it may be possible to theoretically increase or decrease (e.g., by l, 2, 3, 5, 10, 20, 30, 50,100-fold or more) the expression level of a given protein (or a “node” in the network), and quantitatively simulate its effects on other connected proteins in the network.
The network connections between the molecular measurements in the data are “unbiased,” at least partly because no data points are tically or artificially cut-off, and partly because the network connections are based on input data alone, without referring to pre-existing knowledge about the biological process in question.
The network connections n the molecular ements in the data are “systemic” and (unbiased), partly because all potential connections among all input variables have been systemically explored, for example, in a pair-wise fashion. The reliance on computing power to execute such systemic probing exponentially ses as the number of input variables increases.
In general, an le of ~l,000 networks is usually sufficient to predict probabilistic causal quantitative relationships among all of the ed entities. The ensemble of networks captures uncertainty in the data and enables the calculation of confidence metrics for each model prediction. Predictions generated using the ensemble of networks together, where differences in the predictions from individual networks in the ensemble ent the degree of uncertainty in the prediction. This feature enables the ment of ence metrics for predictions of clinical response generated from the model.
Once the models are reverse-engineered, further simulation queries may be conducted on the ensemble of models to determine key molecular drivers for the biological process in question, such as a drug-induced ty condition.
Sketch of components employed to build examplary In vitro models representing normal and diabetic statesis is depicted in Figure 9. Schematic representation of an examplary informatics platform REFSTM used to generate causal networks of the protein as they relate to disease pathophysiology is depicted in Figure 10. Schematic representation of examplary approach s generation of differential k in diabetic versus normal states and diabetic nodes that are restored to normal states by treatment with MIMS is depicted in Figure 11. A representative differential network in diabetic versus normal states is depicted in Figure 12. A schematic representation of a node and associated edges of interest (Nodel in the center) and the cellular functionality associated with each edge is depicted in Figure 13.
The invention having been generally described above, the sections below provide more detailed description for s aspects or elements of the general ion, in conjunction with one or more specific biological systems (e.g. drug-induced toxicity) that can be analyzed using the methods herein. It should be noted, however, the specific drug-induced toxicity used for illustration purpose below are not limiting. To the contrary, it is intended that other distinct drug—induced ties, including any alternatives, cations, and equivalents thereof, may be analyzed rly using the subject Platform technology.
II. Definitions As used herein, certain terms intended to be specifically defined, but are not already defined in other sections of the ication, are defined herein.
The articles “a” and “an” are used herein to refer to one or to more than one (i.e., to at least one) of the grammatical object of the article. By way of example, “an t” means one element or more than one element.
The term “including” is used herein to mean, and is used interchangeably with, the phrase “including but not limited to.” The term “or” is used herein to mean, and is used interchangeably with, the term “and/or,” unless t clearly tes otherwise.
The term “such as” is used herein to mean, and is used interchangeably, with the phrase “such as but not limited to.” “Metabolic pathway” refers to a sequence of enzyme-mediated reactions that transform one nd to another and provide intermediates and energy for cellular ons. The metabolic pathway can be linear or cyclic or branched.
“Metabolic state” refers to the molecular content of a particular cellular, multicellular or tissue environment at a given point in time as measured by various chemical and biological indicators as they relate to a state of health or disease.
The term “microarray” refers to an array of distinct polynucleotides, oligonucleotides, polypeptides (e.g., dies) or peptides synthesized on a substrate, such as paper, nylon or other type of membrane, filter, chip, glass slide, or any other suitable solid support.
The terms “disorders” and “diseases” are used inclusively and refer to any deviation from the normal structure or function of any part, organ or system of the body (or any combination thereof). A specific disease is manifested by characteristic symptoms and signs, including biological, chemical and physical changes, and is often associated with a variety of other factors including, but not limited to, demographic, nmental, ment, genetic and medically historical factors. n characteristic signs, symptoms, and related factors can be quantitated through a variety of methods to yield important diagnostic information.
The term “drug-induced toxicity” es but is not limited to cardiotoxicity, hepatotoxicity, hephrotoxicity, neurotoxicity, renaltoxicity or myotoxicity.
The term “cardiotoxicity” refers to a broad range of adverse s on heart function induced by eutic molecules. It may emerge early in pre-clinical studies or become apparent later in the clinical setting. Cardiovascular toxicity described herein includes, but is not limited to, any one or more of increased QT duration, arrhythmias, myocardial ischemia, hypertension and thromboembolic complications, myocardial dysfunction, cardiomyopathy, heart failure, atrial fibrillation, cardiomyopathy and heart e, heart e and LV ction, atrial flutter and fibrillation, and, heart valve damage and heart failure.
The term “expression” includes the process by which a polypeptide is produced from polynucleotides, such as DNA. The process may involves the transcription of a gene into mRNA and the ation of this mRNA into a polypeptide. Depending on the context in which it is used, “expression” may refer to the production of RNA, protein or both.
The terms “level of expression of a gene” or “gene expression level” refer to the level of mRNA, as well as pre-mRNA nascent transcript(s), transcript processing intermediates, mature mRNA(s) and degradation products, or the level of protein, encoded by the gene in the cell.
The term “modulation” refers to upregulation (i.e., activation or stimulation), downregulation (i.e., inhibition or suppression) of a response, or the two in combination or apart. A “modulator” is a compound or molecule that modulates, and may be, e.g., an agonist, antagonist, activator, stimulator, ssor, or inhibitor.
“Normal level” of a protein, a lipid, a transcript, a metabolite, or gene expression refers to the level of the protein, lipid, transcript, metabolite, or gene expression prior to contacting the cells with the drug with the potentially toxic drug. A “normal level” can be determined in cells grown under various conditions, e. g., hyperglycemia, hypoxia, if the toxicity of the drug is to be tested under the same conditions.
“Modulated level” refers to a changed value relative to the normal level which is based on historical normal control samples or preferably normal control s tested in the same experiment. The specific “normal” value will depend, for example, on the type of assay (e. g., ELISA, enzyme ty, immunohistochemistry, PCR), the sample to be tested (e. g., cell type and culture conditions), and other considerations known to those of skill in the art. l samples can be used to define cut-offs between normal and abnormal.
A drug is considered to be toxic if treatment of cells with the drug s in a statistically significant change in the level of at least one marker relative to a “normal” or appropriate control level. It is understood that not all concentrations of a drug must result in a statistically significant change in the level of the at least one marker. In a preferred ment, a drug is considered to potentially have toxicities if a therapeutically relevant concentration of the drug results in a tically significant change in the level of at least on marker.
A “rescue agent” is considered to be effective in reducing toxicity if the level of the marker is modulated in a statistically significant manner towards the marker level in the “normal cells” when the rescue agent is present at a therapeutically relevant concentration. In a preferred embodiment, the rescue agent returns the marker to a level that is not tically different from the level of the marker in the control cells.
The term “control level” refers to an ed or pre-determined level of a marker, or preferably the marker level determined in a control sample tested in parallel with the test sample, which is used to compare with the level of a marker in a sample derived from cells not treated with the potentially toxic drug or rescue agent. A “control level” is obtained from cells that are cultured under the same conditions, e.g., hypoxia, hyperglycemia, lactic acid, etc.
The term “Trolamine,” as used herein, refers to Trolamine NF, Triethanolamine, TEALAN®, TEAlan 99%, anolamine, 99%, anolamine, NF or Triethanolamine, 99%, NF. These terms may be used interchangeably herein.
The term “genome” refers to the entirety of a biological entity’s (cell, tissue, organ, system, organism) genetic information. It is encoded either in DNA or RNA (in certain viruses, for example). The genome includes both the genes and the non-coding sequences of the DNA.
The term ome” refers to the entire set of proteins expressed by a genome, a cell, a , or an sm at a given time. More specifically, it may refer to the entire set of expressed proteins in a given type of cells or an organism at a given time under defined conditions. Proteome may include protein variants due to, for example, alternative splicing of genes and/or post—translational modifications (such as ylation or phosphorylation).
The term “transcriptome” refers to the entire set of transcribed RNA molecules, including mRNA, rRNA, tRNA, microRNA and other non-coding RNA produced in one or a population of cells at a given time. The term can be applied to the total set of transcripts in a given organism, or to the specific subset of transcripts present in a particular cell type. Unlike the genome, which is roughly fixed for a given cell line (excluding mutations), the transcriptome can vary with external environmental conditions. e it includes all mRNA transcripts in the cell, the transcriptome reflects the genes that are being actively expressed at any given time, with the exception of mRNA degradation phenomena such as riptional attenuation.
The study of transcriptomics, also ed to as sion profiling, examines the expression level of mRNAs in a given cell population, often using high-throughput techniques based on DNA microarray technology.
The term “metabolome” refers to the complete set of small-molecule metabolites (such as metabolic intermediates, hormones and other signalling molecules, and ary metabolites) to be found within a biological sample, such as a single organism, at a given time under a given ion. The metabolome is dynamic, and may change from second to second.
The term “lipidome” refers to the complete set of lipids to be found within a biological sample, such as a single organism, at a given time under a given condition.
The lipidome is dynamic, and may change from second to second.
The term actome” refers to the whole set of molecular interactions in a biological system under study (e.g., cells). It can be displayed as a directed graph.
Molecular interactions can occur between molecules belonging to different biochemical es (proteins, nucleic acids, lipids, carbohydrates, etc.) and also within a given family. When spoken in terms of proteomics, interactome refers to protein-protein interaction network (PPI), or protein interaction network (PIN). Another extensively studied type of interactome is the n-DNA interactome (network formed by ription factors (and DNA or chromatin regulatory proteins) and their target genes.
The term “cellular output” includes a collection of ters, preferably measurable parameters, relating to cellullar status, including (without limiting): level of transcription for one or more genes (e.g., measurable by , qPCR, microarray, etc), level of expression for one or more proteins (e.g., measurable by mass spectrometry or Western blot), te activity (e.g., measurable as ate conversion rates) or ve activity (e.g., measurable as a % value compared to maximum activity) of one or more enzymes or proteins, level of one or more metabolites or intermediates, level of oxidative phosphorylation (e. g., able by Oxygen Consumption Rate or OCR), level of glycolysis (e.g., measurable by Extra Cellular Acidification Rate or ECAR), extent of ligand—target binding or interaction, activity of extracellular secreted molecules, etc. The cellular output may include data for a pre- determined number of target genes or proteins, etc., or may include a global assessment for all detectable genes or proteins. For example, mass spectrometry may be used to identify and/or quantitate all detectable proteins expressed in a given sample or cell population, without prior knowledge as to whether any specific protein may be expressed in the sample or cell population.
As used herein, a “cell system” includes a population of homogeneous or heterogeneous cells. The cells within the system may be growing in vivo, under the natural or physiological environment, or may be growing in vitro in, for example, controlled tissue culture environments. The cells within the system may be relatively neous (e.g., no less than 70%, 80%, 90%, 95%, 99%, 99.5%, 99.9% homogeneous), or may contain two or more cell types, such as cell types usually found to grow in close proximity in vivo, or cell types that may interact with one r in vivo through, 6. g., paracrine or other long distance inter-cellular communication. The cells within the cell system may be derived from established cell lines, including cancer cell lines, immortal cell lines, or normal cell lines, or may be primary cells or cells freshly ed from live tissues or organs.
Cells in the cell system are typically in contact with a “cellular environment” that may provide nutrients, gases (oxygen or C02, etc), chemicals, or naceous / non- proteinaceous stimulants that may define the conditions that affect cellular or.
The cellular environment may be a chemical media with defined chemical ents and/or less well-defined tissue extracts or serum components, and may include a specific pH, C02 t, pressure, and temperature under which the cells grow. Alternatively, the cellular environment may be the natural or physiological environment found in vivo for the ic cell system.
In certain embodiments, a cell environment comprises ions that simulate an aspect of a biological system or process, e.g., simulate a e state, s, or environment. Such e ions include, for example, hyperglycemia, hypoxia, or lactic-rich conditions. Numerous other such conditions are bed herein.
In certain embodiments, a cellular environment for a specific cell system also include certain cell surface features of the cell system, such as the types of receptors or ligands on the cell surface and their respective activities, the structure of carbohydrate or lipid molecules, membrane ty or fluidity, status of clustering of certain membrane proteins, etc. These cell surface features may affect the function of nearby cells, such as cells belonging to a ent cell system. In certain other embodiments, however, the cellular environment of a cell system does not include cell surface features of the cell system.
The cellular environment may be altered to become a “modified cellular environment.” Alterations may include changes (e.g., increase or decrease) in any one or more component found in the cellular environment, ing addition of one or more “external stimulus component” to the ar environment. The environmental perturbation or external stimulus component may be endogenous to the cellular environment (e. g., the cellular environment contains some levels of the stimulant, and more of the same is added to increase its level), or may be exogenous to the cellular environment (e.g., the stimulant is largely absent from the cellular nment prior to the tion). The cellular environment may further be altered by secondary changes resulting from adding the external stimulus component, since the external stimulus component may change the cellular output of the cell system, including molecules secreted into the cellular environment by the cell system.
As used herein, nal stimulus component”, also referred to herein as “environmental perturbation”, include any external physical and/or chemical stimulus that may affect cellular function. This may include any large or small organic or inorganic molecules, natural or tic chemicals, temperature shift, pH change, ion, light (UVA, UVB etc.), microwave, sonic wave, electrical t, modulated or lated magnetic fields, etc.
The term “Multidimensional Intracellular Molecule (MIM)”, is an isolated version or synthetically produced version of an endogenous molecule that is naturally produced by the body and/or is present in at least one cell of a human. A MIM is capable of entering a cell and the entry into the cell es complete or partial entry into the cell as long as the biologically active portion of the molecule wholly enters the cell. MIMs are e of inducing a signal transduction and/or gene expression mechanism within a cell. MIMs are multidimensional because the molecules have both a therapeutic and a carrier, e.g., drug delivery, effect. MIMs also are multidimensional e the molecules act one way in a disease state and a different way in a normal state. For example, in the case of CoQ—lO, administration of CoQ—10 to a melanoma cell in the presence of VEGF leads to a decreased level of B012 which, in turn, leads to a decreased oncogenic potential for the melanoma cell. In contrast, in a normal fibroblast, co-administration of CoQ-IO and VEFG has no effect on the levels of B012.
In one embodiment, a MIM is also an epi-shifter In another embodiment, a MIM is not an epi-shifter. In another embodiment, a MIM is characterized by one or more of the foregoing functions. In another embodiment, a MIM is characterized by two or more of the foregoing functions. In a further ment, a MIM is characterized by three or more of the foregoing functions. In yet another embodiment, a MIM is characterized by all of the foregoing functions. The d artisan will appreciate that a MIM of the invention is also intended to encompass a mixture of two or more endogenous molecules, wherein the mixture is characterized by one or more of the foregoing ons. The endogenous molecules in the mixture are t at a ratio such that the mixture functions as a MIM.
MIMs can be lipid based or non—lipid based les. Examples of MIMs include, but are not limited to, CleO, acetyl Co-A, yl Co-A, L-carnitine, amino acids such as, for example, tyrosine, phenylalanine, and cysteine. In one embodiment, the MIM is a small molecule. In one embodiment of the invention, the MIM is not CleO. MIMs can be routinely identified by one of skill in the art using any of the assays described in detail herein. MIMs are described in further detail in US 12/777,902 (US 2011-0110914), the entire contents of which are expressly incorporated herein by reference.
As used , an tabolic shifter” (epi-shifter) is a molecule that modulates the metabolic shift from a healthy (or normal) state to a disease state and vice versa, thereby maintaining or reestablishing cellular, tissue, organ, system and/or host health in a human. ifters are capable of effectuating ization in a tissue microenvironment. For example, an epi—shifter includes any molecule which is capable, when added to or ed from a cell, of affecting the microenvironment (e.g., the metabolic state) of a cell. The skilled artisan will appreciate that an epi-shifter of the invention is also intended to encompass a mixture of two or more molecules, wherein the e is characterized by one or more of the foregoing ons. The molecules in the mixture are present at a ratio such that the mixture functions as an epi-shifter.
Examples of ifters include, but are not d to, CoQ—10; vitamin D3; ECM components such as fibronectin; immunomodulators, such as TNFa or any of the eukins, e. g., IL-5, IL-12, IL-23; angiogenic factors; and apoptotic factors.
In one embodiment, the epi-shifter also is a MIM. In one embodiment, the epi- shifter is not CleO. Epi-shifters can be routinely identified by one of skill in the art using any of the assays described in detail herein. Epi—shifters are described in further detail in US 12/777,902 (US 2011—0110914), the entire contents of which are expressly incorporated herein by reference.
Other terms not explicitly defined in the instant application have g as would have been understood by one of ordinary skill in the art. 111. Exemplary Steps and Components of the Platform Technology For illustration purpose only, the following steps of the subject Platform Technology may be described herein below as an ary utility for integrating data obtained from a custom built drug—induced toxicity model, and for identifying novel proteins / pathways g the pathogenesis of drug-induced toxicity. Relational maps resulting from this analysis provides drug-induced toxicity ent targets, as well as diagnostic / prognostic markers associated with drug-induced toxicity. However, the subject rm Technology has general applicability for any drug-induced toxicity, and is not limited to any particular drug—induced toxicityor other specific drug-induced toxicity models.
In addition, gh the description below is presented in some portions as discrete steps, it is for illustration purpose and simplicity, and thus, in reality, it does not imply such a rigid order and/or demarcation of steps. Moreover, the steps of the invention may be performed separately, and the invention provided herein is intended to encompass each of the individual steps separately, as well as combinations of one or more (e. g., any one, two, three, four, five, six or all seven steps) steps of the subject Platform Technology, which may be carried out independently of the remaining steps.
The invention also is intended to include all aspects of the Drug-induced Toxicity Platform Technology as te ents and embodiments of the invention. For example, the ted data sets are ed to be embodiments of the invention. As further examples, the generated causal relationship ks, generated consensus causal relationship networks, and/or generated ted causal relationship ks, are also intended to be embodiments of the ion. The causal relationships identified as being unique in the drug-induced toxicity system are intended to be embodiments of the invention. Further, the custom built models for a particular drug-induced toxicity system are also intended to be embodiments of the ion. For example, custom built models for a drug—induced toxicity state or process, such as, e. g., a custom built model for toxicity (e.g., cardiotoxicity) of a drug, are also intended to be embodiments of the invention.
A. Custom Model Building The first step in the Platform Technology is the establishment of a model for a drug-induced toxicity system or process. An example of a drug-induced toxicity system or process is cardiotoxicity. As any other complicated biological process or system, cardiotoxicity is a complicated pathological condition characterized by multiple unique aspects. For example, chronic imbalance in uptake, utilization, organellar biogenesis and secretion in non-adipose tissue (heart and liver) is thought to be at the center of mitochondrial damage and dysfunction and a key player in drug induced cardiotoxicity.
To this end, a custom cardiotoxicity model comprising diabetic and normal cardiomyocytes may be established to simulate the environment of cardiotoxicity, e.g., by creating cell culture conditions closely approximating the conditions of a cadiac cell experiencing cardiotoxicity. One or more relevant types of cells may be used in the model, such as, for example, cardiomyocytes, diabetic cardiomyocytes, hepatocytes, kidney cells, neural cells, renal cells, or sts.
One such “environment”, or growth stress condition, is hypoxia, a condition typically found in a number of disease states and in late stage diabetes or in cardiovascular disease due to ischemia and poor ation. Hypoxia can be induced in cells in cells using art-recognized s. For example, hypoxia can be induced by g cell systems in a Modular Incubator Chamber (MIC-101, Billups-Rothenberg Inc. Del Mar, CA), which can be flooded with an industrial gas mix ning 5% C02, 2% Oz and 93% nitrogen. Effects can be ed after a pre-determined period, 6. g., at 24 hours after hypoxia treatment, with and without additional external stimulus components (e.g., CleO at 0, 50, or 100 HM).
Likewise, lactic acid treatment of cells mimics a cellular environment where glycolysis activity is high. Lactic acid induced stress can be igated at a final lactic acid concentration of about 12.5 mM at a pre-determined time, e.g., at 24 hours, with or without additional external stimulus components (e.g., CleO at 0, 50, or 100 uM).
Hyperglycemia is normally a condition found in diabetes. As high glucose is known to alter cellular metabolism, agents for the treatment of diabetes can be tested in cells cultured under hyperglycemic conditions. Exposing subject cells to a l hyperglycemic condition may include adding 10% culture grade glucose to suitable media, such that the final tration of e in the media is about 22 mM.
However, as subjects with type 2 diabetes, are ntly overweight or obese, they are frequently treated for other diseases or conditions with other agents, e. g., arthritis with anti-inflammatory , cardiovascular disease with cholesterol lowering, blood pressure lowering, or blood thinning agents. Thus, custom built models can be used to assess drug toxicity in normal subjects as compared to subjects to be treated for a first condition with a first agent that also have other diseases or conditions. For example, cells not exposed or exposed to hyperglycemic conditions can be tested together to detect differential toxicities of agents in subjects with or without diabetes.
Hyperlipidemia is a condition found, for e, in obesity and cardiovascular disease. Hyperlipidemia is also a condition which mimics one aspect of cardiotoxicity.
The hyperlipidemic conditions can be provided by culturing cells in media containing 0.15 mM sodium palmitate.
Individual conditions ing different aspects of toxicity may be investigated separately in the custom built toxicity model, and/or may be combined together. In one embodiment, combinations of at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 40, 50 or more ions reflecting or simulating different aspects of toxicity ions are investigated in the custom built toxicity model. In one embodiment, individual conditions and, in addition, combinations of at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, , 40, 50 or more of the conditions reflecting or simulating different aspects of toxicity conditions are investigated in the custom built toxicity model. All values presented in the foregoing list can also be the upper or lower limit of ranges, that are intended to be a part of this invention, e.g., between 1 and 5, 1 and 10, l and 20, l and 30, 2 and 5, 2 and , 5 and 10, 1 and 20, 5 and 20, 10 and 20, 10 and 25, 10 and 30 or 10 and 50 different conditions.
Listed herein below are a few exemplary combinations of ions that can be used to treat cells for building drug-induced toxicity models. Other combinations can be readily formulated depending on the specific interrogative biological assessment that is being conducted. 1. Media only 2 50 11M CTL Coenzyme Q10 (CleO) 3 100 MM CTL Coenzyme Q10 4 12.5 mM Lactic Acid . 12.5 mM Lactic Acid + 50 11M CTL Coenzyme Q10 6 12.5 mM Lactic Acid + 100 uM CTL Coenzyme Q10 7 Hypoxia 8 a + 50 uM CTL me Q10 9 Hypoxia + 100 uM CTL Coenzyme Q10 . Hypoxia + 12.5 mM Lactic Acid 11. Hypoxia + 12.5 mM Lactic Acid + 50 MM CTL Coenzyme Q10 12. Hypoxia + 12.5 mM Lactic Acid + 100 MM CTL Coenzyme Q10 13. Media + 22 mM Glucose 14. 50 MM CTL Coenzyme Q10 + 22 mM Glucose . 100 HM CTL Coenzyme Q10 + 22 mM Glucose 16. 12.5 mM Lactic Acid + 22 mM Glucose 17. 12.5 mM Lactic Acid + 22 mM Glucose + 50 uM CTL me Q10 18. 12.5 mM Lactic Acid + 22 mM Glucose +100 uM CTL Coenzyme Q10 19. Hypoxia + 22 mM Glucose . Hypoxia + 22 mM Glucose + 50 uM CTL Coenzyme Q10 21. Hypoxia + 22 mM Glucose + 100 uM CTL Coenzyme Q10 22. Hypoxia +12.5 mM Lactic Acid + 22 mM Glucose 23. Hypoxia +12.5 mM Lactic Acid + 22 mM Glucose + 50 ttM CTL Coenzyme Q10 24. Hypoxia + 12.5 mM Lactic Acid + 22 mM Glucose +100 ttM CTL Coenzyme Q10 As a control one or more cell lines (e.g.,cardiomyocytes, diabetic cardiomyocytes, hepatocytes, kidney cells, neural cells, renal cells, or myoblasts) are ed under l conditions in order to identify toxicity unique proteins or ys (see below). The control may be the comparison cell model described above. le cells of the same or different origin (for example, cardiomyocytes, diabetic cardiomyocytes, cytes, kidney cells, neural cells, renal cells, or myoblasts), as opposed to a single cell type, may be included in the toxicity model. In certain situations, cross talk or ECS experiments between different cells (cardiomyocytes, diabetic cardiomyocytes, hepatocytes, kidney cells, neuro cells, renal cells, or myoblasts ) may be conducted for l related purposes.
In some embodiments that involve cross talk, experiments conducted on the cell models are designed to determine modulation of cellular state or function of one cell system or population (e.g.,cardiomyocytes) by another cell system or population (6. g., diabetic cardiomyocytes) under defined treatment ions (6. g., hyperglycemia, hypoxia (ischemia)). According to a typical setting, a first cell system / population is contacted by an external us components, such as a candidate molecule (6. g., a small drug molecule, a n) or a candidate condition (e.g., hypoxia, high glucose environment). In response, the first cell system / population changes its transcriptome, proteome, metabolome, and/or interactome, leading to changes that can be readily detected both inside and outside the cell. For example, s in transcriptome can be measured by the transcription level of a plurality of target mRNAs; changes in proteome can be measured by the expression level of a plurality of target proteins; and changes in metabolome can be measured by the level of a plurality of target metabolites by assays designed specifically for given metabolites. Alternatively, the above referenced changes in metabolome and/or proteome, at least with respect to n secreted metabolites or proteins, can also be measured by their effects on the second cell system / population, including the modulation of the transcriptome, proteome, metabolome, and interactome of the second cell system / population. Therefore, the experiments can be used to identify the effects of the molecule(s) of interest secreted by the first cell system / population on a second cell system / population under different treatment conditions.
The experiments can also be used to fy any proteins that are modulated as a result of signaling from the first cell system (in response to the external us component treatment) to another cell system, by, for example, differential screening of proteomics.
The same experimental setting can also be adapted for a e setting, such that reciprocal effects between the two cell s can also be assessed. In general, for this type of experiment, the choice of cell line pairs is largely based on the factors such as origin, toxicity state and cellular on.
Although two-cell systems are typically involved in this type of experimental setting, similar experiments can also be designed for more than two cell systems by, for e, immobilizing each ct cell system on a separate solid support.
Once the custom model is built, one or more “perturbations” may be applied to the system, such as genetic variation from patient to patient, or with / without treatment by certain drugs or pro-drugs. See Figure 15D. The effects of such perturbations to the system, including the effect on cells related to drug-induced toxicity, and normal control cells, can be measured using various cognized or proprietary means, as described in section III.B below.
In an exemplary experiment, cardiomyocytes are conditioned in hyperglycemia and ipidemia conditions, and in on with or without an environmental perturbation, specifically treatment by a diabetic drug known for inducing cardiotoxicity and/or a potential rescue agent CoenzymteO.
The custom built cell model may be established and used throughout the steps of the Platform Technology of the invention to ultimately identify a causal relationship unique in the drug-induced toxicity system, by carrying out the steps bed .
It will be understood by the skilled artisan, however, that a custom built cell model that is used to generate an l, “first generation” consensus causal relationship network for a drug-induced toxicity can continually evolve or expand over time, e. g., by the introduction of additional drug-induced toxicity related cell lines and/or additional drug- induced ty related conditions. Additional data from the evolved cell model, i.e., data from the newly added portion(s) of the cell model, can be collected. The new data collected from an expanded or evolved cell model, i.e., from newly added portion(s) of the cell model, can then be introduced to the data sets previously used to generate the “first generation” consensus causal relationship network in order to generate a more robust “second tion” sus causal relationship network. New causal relationships unique to the drug—induced toxicity can then be identified from the “second generation” consensus causal relationship network. In this way, the evolution of the cell model provides an evolution of the consensus causal onship networks, y providing new and/or more reliable insights into the modulators of the nduced toxicity Custom models can also be designed to assess toxicity of drugs used in combination. For example, therapeutic agents for the treatment of a number of conditions including cancer, auto-immune disease, or HIV are typically administered as cocktails of combinations of agents. Further, many subjects have multiple, unrelated conditions to be treated simultaneously (e.g., diabetes, arthritis, cardiovascular disease).
Models can be built, either in normal cells or in cells ted to various culture conditions, to identify combinations of agents that may result in toxicities when administered simultaneously. Thus, the methods provided include testing combinations of agents (e.g., 2, 3, 4, 5, 6, 7, 8 or more) er to determine if the combination results in drug d toxicities, including with agents that do not result in toxicities alone.
Models can also be built for “personalized medicine” applications in which the specific combination of drugs being stered or considered for administration can be tested using the methods provided herein to determine if the combination of drugs are likely to have unacceptable toxicities. Such combinations can be tested in various cell types (e.g., cardiac cells, kidney cells, nerve cells, muscle cells, liver cells; either cell lines or primary cells cultured from the subject) grown under various conditions to mimic the subject of interest (e.g., grown in high glucose for a subject with diabetes or hypoxia for a subject with ischemia).
Additional examples of custom built cell models are bed in detail herein.
B. Data Collection In general, two types of data may be collected from any custom built model systems. One type of data (e.g., the first set of data, the third set of data) usually s to the level of certain macromolecules, such as DNA, RNA, protein, lipid, etc. An exemplary data set in this ry is proteomic data (e. g., qualitative and quantitative data ning the expression of all or substantially all measurable proteins from a sample). The other type of data is generally functional data (e.g., the second set of data, the fourth set of data) that reflects the phenotypic changes resulting from the changes in the first type of data. Functional activity or cellular response of the cells can include any one or more of bioenergetics, cell eration, apoptosis, organellar function, a genotype-phenotype association actualized by functional models selected from ATP, ROS, OXPHOS, and Seahorse assays, global enzyme activity (e.g., global kinase activity), and an effect of global enzyme ty on the enzyme metabolic substrates of cells associated with drug-induced toxicity (e.g., phosphoproteomic data).
With respect to the first type of data, in some example embodiments, quantitative polymerase chain reaction (qPCR) and mics are performed to profile s in cellular mRNA and n expression by quantitative polymerase chain on (qPCR) and proteomics. Total RNA can be isolated using a commercial RNA isolation kit. Following cDNA synthesis, specific commercially available qPCR arrays (e.g., those from SA Biosciences) for disease area or cellular processes such as angiogenesis, sis, and diabetes, may be employed to profile a predetermined set of genes by ing a manufacturer’s instructions. For example, the Biorad cfx-384 amplification system can be used for all transcriptional profiling ments. Following data collection (Ct), the final fold change over control can be determined using the 8Ct method as outlined in manufacturer’s protocol. Proteomic sample analysis can be performed as described in subsequent sections.
The subject method may employ large-scale high-throughput quantitative proteomic analysis of hundreds of samples of similar character, and provides the data necessary for fying the cellular output differentials.
There are numerous art-recognized technologies suitable for this purpose. An exemplary technique, iTRAQ analysis in combination with mass spectrometry, is briefly bed below.
The quantitative proteomics approach is based on stable isotope labeling with the 8—plex iTRAQ reagent and 2D-LC MALDI MS/MS for e identification and quantification. Quantification with this que is relative: peptides and proteins are assigned abundance ratios relative to a nce sample. Common reference samples in multiple iTRAQ experiments facilitate the comparison of samples across multiple iTRAQ experiments.
For example, to implement this analysis scheme, six primary samples and two control pool samples can be combined into one 8-plex iTRAQ mix according to the manufacturer’s tions. This mixture of eight s then can be fractionated by two-dimensional liquid chromatography; strong cation exchange (SCX) in the first dimension, and reversed-phase HPLC in the second dimension, then can be subjected to mass spectrometric analysis.
A brief overview of exemplary laboratory procedures that can be employed is provided herein.
Protein extraction: Cells can be lysed with 8 M urea lysis buffer with protease inhibitors (Thermo Scientific Halt Protease inhibitor EDTA-free) and incubate on ice for minutes with vertex for 5 seconds every 10 minutes. Lysis can be completed by ultrasonication in 5 seconds pulse. Cell lysates can be centrifuged at 14000 x g for 15 minutes (4 0C) to remove cellular debris. rd assay can be performed to determine the n concentration. 100ug protein from each samples can be reduced (10mM Dithiothreitol (DTT), 55 0C, 1 h), alkylated (25 mM iodoacetamide, room ature, minutes) and digested with Trypsin (1:25 w/w, 200 mM triethylammonium bicarbonate (TEAB), 37 0C, 16 h). ome sample preparation: 1) In one embodiment, the cells can be cultured in serum free medium: Conditioned media can be concentrated by freeze dryer, reduced (10mM Dithiothreitol (DTT), 55 OC, 1 h), ted (25 mM iodoacetamide, at room temperature, incubate for 30 minutes), and then desalted by actone precipitation. Equal amount of proteins from the concentrated conditioned media can be digested with Trypsin (1:25 w/w, 200 mM triethylammonium bicarbonate (TEAB), 37 0C, 16 h).
In one ment, the cells can be cultured in serum ning medium: The volume of the medium can be reduced using 3k MWCO in columns (GE Healthcare Life Sciences), then can be reconstituted withleBS (Invitrogen). Serum albumin can be depleted from all samples using AlbuVoid column (Biotech Support Group, LLC) following the manufacturer’s instructions with the modifications of bufferexchange to optimize for condition medium ation. iTRAQ 8 Plex Labeling: Aliquot from each tryptic digests in each experimental set can be pooled together to create the pooled control sample. Equal ts from each sample and the pooled control sample can be labeled by iTRAQ 8 Flex reagents according to the cturer’s protocols (AB Sciex). The reactions can be combined, vacuumed to dryness, re-suspended by adding 0.1% formic acid, and analyzed by LC- MS/MS. 2D-Nan0LC-MS/MS: All labeled peptides mixtures can be separated by online 2D-nanoLC and analysed by electrospray tandem mass spectrometry. The experiments can be d out on an nt 2D NanoLC Ultra system connected to an LTQ Orbitrap Velos mass spectrometer equipped with a ectrospray ion source (Thermo Electron, Bremen, Germany).
The peptides mixtures can be injected into a 5 cm SCX column (300nm ID, 5pm, PolySULFOETHYL Aspartamide column from PolyLC, ia, MD) with a flow of 4 uL / min and eluted in 10 ion exchange elution segments into a C18 trap column (2.5 cm, 100nm ID, 5pm, 300 A ProteoPep II from New Objective, Woburn, MA) and washed for 5 min with H20/0.1%FA. The tion then can be further carried out at 300 nL/min using a gradient of 2—45% B (H20 /0.1%FA nt A) and ACN /0.1%FA (solvent B)) for 120 minutes on a 15 cm fused silica column (75pm ID, 5um, 300 A ProteoPep II from New Objective, Woburn, MA).
Full scan MS spectra (m/z 300-2000) can be ed in the Orbitrap with resolution of 30,000. The most intense ions (up to 10) can be sequentially isolated for fragmentation using High energy C-trap iation (HCD) and cally exclude for 30 seconds. HCD can be conducted with an isolation width of 1.2 Da. The resulting fragment ions can be scanned in the orbitrap with resolution of 7500. The LTQ Orbitrap Velos can be controlled by ur 2.1 with foundation 1.0.1.
Peptides/proteins identification and quantification: Peptides and proteins can be identified by automated database searching using Proteome Discoverer software (Thermo Electron) with Mascot search engine against SwissProt database. Search parameters can include 10 ppm for MS tolerance, 0.02 Da for MS2 tolerance, and full trypsin digestion allowing for up to 2 missed cleavages. Carbamidomethylation (C) can be set as the fixed modification. Oxidation (M), TMT6, and deamidation (NQ) can be set as c modifications. Peptides and protein identifications can be filtered with Mascot icant Threshold (p<0.05). The filters can be allowed a 99% confidence level of protein identification (1% FDA).
The Proteome Discoverer software can apply correction factors on the reporter ions, and can reject all quantitation values if not all quantitation channels are present.
Relative protein quantitation can be achieved by normalization at the mean intensity.
With respect to the second type of data, in some exemplary embodiments, bioenergetics profiling of cancer and normal models may employ the SeahorseTM XF24 analyzer to enable the understanding of glycolysis and oxidative phosphorylation components .
Specifically, cells can be plated on Seahorse culture plates at optimal densities.
These cells can be plated in 100 u] of media or treatment and left in a 37°C incubator with 5% C02. Two hours later, when the cells are adhered to the 24 well plate, an additional 150 pl of either media or ent solution can be added and the plates can be left in the culture incubator ght. This two step seeding procedure allows for even distribution of cells in the culture plate. se cartridges that contain the oxygen and pH sensor can be hydrated overnight in the calibrating fluid in a non-C02 incubator at 37°C. Three mitochondrial drugs are typically loaded onto three ports in the cartridge. Oligomycin, a complex [11 inhibitor, FCCP, an uncoupler and Rotenone, a x I inhibitor can be loaded into ports A, B and C respectively of the cartridge.
All stock drugs can be ed at a 10x concentration in an unbuffered DMEM media.
The cartridges can be first incubated with the mitochondrial compounds in a non-C02 incubator for about 15 minutes prior to the assay. Seahorse culture plates can be washed in DMEM based unbuffered media that ns glucose at a concentration found in the normal growth media. The cells can be layered with 630 111 of the unbuffered media and can be equilibriated in a non-C02 incubator before g in the Seahorse instrument with a precalibrated cartridge. The instrument can be run for three-four loops with a mix, wait and measure cycle for get a baseline, before injection of drugs through the port is ted. There can be two loops before the next drug is introduced.
OCR (Oxygen consumption rate) and ECAR (Extracullular Acidification Rate) can be ed by the electrodes in a 7 pl chamber and can be created with the cartridge pushing against the se culture plate.
C. Data Integration and in silico Model Generation Once relevant data sets have been obtained, integration of data sets and generation of computer-implemented statistical models may be performed using an AI- based informatics system or platform (e.g, the REFSTM platform). For example, an exemplary AI-based system may produce simulation—based networks of protein associations as key drivers of metabolic end points (ECAR/OCR). See Figure 15. Some background details regarding the REFSTM system may be found in Xing et al., “Causal Modeling Using Network Ensemble tions of Genetic and Gene Expression Data Predicts Genes Involved in Rheumatoid Arthritis,” PloS Computational Biology, vol. 7, issue. 3, 1-19 (March 2011) (e100105) and US. Patent 7,512,497 to Periwal, the entire contents of each of which is expressly incorporated herein by reference in its entirety. In essence, as described earlier, the REFSTM system is an AI—based system that employs atical algorithms to establish causal relationships among the input variables (e. g., protein expression levels, mRNA expression levels, and the corresponding functional data, such as the OCR / ECAR values measured on Seahorse culture plates).
This process is based only on the input data alone, t taking into consideration prior existing knowledge about any ial, established, and/or verified biological relationships.
In particular, a significant advantage of the platform of the invention is that the AI-based system is based on the data sets obtained from the cell model, t resorting to or taking into consideration any existing dge in the art concerning the biological process. Further, preferably, no data points are statistically or artificially cut- off and, instead, all ed data is fed into the AI-system for determining protein associations. Accordingly, the resulting statistical models generated from the platform are unbiased, since they do not take into consideration any known biological onships.
Specifically, data from the proteomics and ECAR/OCR can be input into the AI- based information system, which builds statistical models based on data ations, as described above. tion—based networks of protein associations are then derived for each disease versus normal io, including ents and conditions using the following s.
A detailed ption of an exemplary process for building the generated (e. g., optimized or evolved) networks appears below with respect to Figure 16. As described above, data from the proteomics and functional cell data is input into the AI—based system (step 210). The input data, which may be raw data or minimally processed data, is pre-processed, which may include normalization (e.g., using a quantile function or internal standards) (step 212). The pre-processing may also include imputing missing data values (e. g., by using the K—nearest neighbor (K—NN) algorithm) (step 212).
The pre-processed data is used to uct a network fragment library (step 214). The network fragments define quantitative, continuous relationships among all possible small sets (e. g., 2-3 member sets or 2-4 member sets) of measured variables (input data). The relationships between the variables in a fragment may be linear, logistic, multinomial, dominant or recessive gous, etc. The relationship in each fragment is assigned a Bayesian probabilistic score that reflect how likely the candidate onship is given the input data, and also penalizes the relationship for its mathematical complexity. By scoring all of the possible pairwise and three-way relationships (and in some embodiments also ay relationships) inferred from the input data, the most likely fragments in the library can be identified (the likely fragments). Quantitative parameters of the relationship are also computed based on the input data and stored for each fragment. Various model types may be used in fragment enumeration including but not limited to linear regression, logistic regression, (Analysis of Variance) ANOVA , sis of ance) ANCOVA models, non- linear/polynomial regression models and even non—parametric regression. The prior assumptions on model parameters may assume Gull distributions or Bayesian Information Criterion (BIC) penalties related to the number of ters used in the model. In a network inference process, each network in an ensemble of initial trial networks is constructed from a subset of fragments in the nt library. Each initial trial k in the ensemble of initial trial networks is ucted with a different subset of the fragments from the fragment library (step 216).
An overview of the mathematical representations underlying the Bayesian networks and network fragments, which is based on Xing et al., l Modeling Using Network Ensemble Simulations of Genetic and Gene Expression Data Predicts Genes Involved in Rheumatoid Arthritis,” PLoS Computational Biology, vol. 7, issue. 3, 1-19 (March 2011) (e100105), is presented below.
A multivariate system with random variables X = X 1 ,. . . X be , n may characterized by a multivariate probability distribution function P(X1 X ,. . . , n ; (9), that includes a large number of parameters (9. The multivariate probability distribution function may be factorized and represented by a t of local conditional probability distributions: P(X1,...,Xn;®) = fig(x,|rjl,...,YjKi ;o,) in which each variable X is independent from its non-descendent variables given its K l. 1. parent variables, which are Yjul,, YjK After ization, each local probability distribution has its own parameters (9,.
The multivariate probability distribution function may be factorized in ent ways with each particular factorization and corresponding parameters being a distinct probabilistic model. Each particular ization (model) can be represented by a Directed Acrylic Graph (DAC) having a vertex for each variable X and directed edges between vertices representing)dependences between variables in the local conditional distributions PX(XY‘Y 1‘1 9 sY ).Subgraphs of a DAG, each including a vertex and associated directed edges are network fragments.
A model is evolved or optimized by determining the most likely factorization and the most likely parameters given the input data. This may be described as “learning a Bayesian network,” or, in other words, given a training set of input data, finding a k that best matches the input data. This is accomplished by using a scoring function that evaluates each network with respect to the input data.
A Bayesian framework is used to determine the likelihood of a factorization given the input data. Bayes Law states that the posterior probability, P(DIM), of a model M, given data D is proportional to the product of the product of the posterior probability of the data given the model assumptions, , multiplied by the prior probability of the model, P(M ), assuming that the probability of the data, P(D), is constant across models. This is expressed in the following equation: P(DlM)* P(M) P(M|D) = P(D) The posterior probability of the data ng the model is the integral of the data hood over the prior distribution of parameters: P(D|M)= j P(D|M(®))P(®|M )dQ Assuming all models are equally likely (i.e., that P(M) is a constant), the ior probability of model M given the data D may be factored into the product of integrals over parameters for each local network nt Mi as follows: (MID)= elmml..- w Note that in the equation above, a leading constant term has been omitted. In some embodiments, a Bayesian Information Criterion (BIC), which takes a negative logarithm of the posterior probability of the model P(D|M) may be used to “Score” each model as follows: Stat (M) = —log = 25W where the total score SW for a model M is a sum of the local scores S, for each local network fragment. The BIC further gives an expression for determining a score each individual network fragment: i logN where K(Mi) is the number of fitting parameter in model M,- and N is the number of samples (data points). ,~) is the negative logarithm of the likelihood function for a network fragment, which may be ated from the functional relationships used for each network fragment. For a BIC score, the lower the score, the more likely a model fits the input data.
The ensemble of trial networks is globally optimized, which may be described as optimizing or evolving the networks (step 218). For example, the trial networks may be evolved and optimized according to a olis Monte Carlo Sampling alogorithm.
Simulated annealing may be used to optimize or evolve each trial network in the le through local transformations. In an example simulated annealing processes, each trial network is changed by adding a network fragment from the library, by deleted a network fragment from the trial network, by substituting a network fragment or by otherwise changing network topology, and then a new score for the k is ated. Generally speaking, if the score es, the change is kept and if the score worsens the change is rejected. A “temperature” parameter allows some local changes which worsen the score to be kept, which aids the zation process in avoiding some local minima. The “temperature” parameter is decreased over time to allow the optimization/evolution process to converge.
All or part of the network inference process may be conducted in parallel for the trial different networks. Each k may be optimized in parallel on a separate processor and/or on a separate computing device. In some ments, the optimization process may be conducted on a supercomputer incorporating hundreds to thousands of processors which operate in parallel. Information may be shared among the optimization ses conducted on el processors.
The optimization process may include a network filter that drops any networks from the ensemble that fail to meet a threshold standard for l score. The dropped network may be ed by a new initial network. Further any networks that are not “scale free” may be dropped from the ensemble. After the ensemble of networks has been zed or evolved, the result may be termed an ensemble of generated cell model networks, which may be collectively ed to as the ted consensus network.
D. Simulation to Extract uantitative Relationshi Information and fo’1 Prediction Simulation may be used to extract tative parameter information regarding each relationship in the generated cell model networks (step 220). For example, the simulation for quantitative information extraction may involve perturbing (increasing or decreasing) each node in the network by 10 fold and calculating the posterior distributions for the other nodes (e.g., proteins) in the models. The endpoints are compared by t-test with the assumption of 100 samples per group and the 0.01 significance cut-off. The t-test tic is the median of 100 t-tests. Through use of this simulation technique, an AUC (area under the curve) enting the strength of prediction and fold change representing the in silico magnitude of a node driving an end point are generated for each relationship in the ensemble of networks.
A relationship quantification module of a local computer system may be employed to direct the AI-based system to perform the perturbations and to extract the AUC information and fold information. The extracted quantitative information may include fold change and AUC for each edge connecting a parent note to a child node.
In some embodiments, a custom—built R program may be used to extract the quantitative information.
In some embodiments, the ensemble of ted cell model networks can be used through simulation to predict responses to changes in conditions, which may be later verified though wet-lab cell—based, or animal-based, experiments.
The output of the AI—based system may be quantitative relationship parameters and/or other simulation predictions (222).
E. Generation of Differential gDelta) ks A differential network creation module may be used to generate differential (delta) networks between generated cell model networks and generated comparison cell model networks. As described above, in some embodiments, the differential network compares all of the tative parameters of the relationships in the generated cell model networks and the generated comparison cell model network. The quantitative parameters for each relationship in the differential k are based on the comparison.
In some embodiments, a differential may be performed between various differential networks, which may be termed a delta—delta network. An e of a delta-delta network is described below with respect to Figure 18 in the Examples section. The differential network creation module may be a m or script written in PERL.
F. ization of Networks The relationship values for the ensemble of networks and for the differential networks may be visualized using a network visualization program (e.g., ape open source platform for complex network analysis and visualization from the ape consortium). In the visual depictions of the networks, the thickness of each edge (e. g., each line connecting the proteins) represents the strength of fold . The edges are also directional indicating causality, and each edge has an associated prediction confidence level.
G. Exemplary Computer System Figure 17 schematically depicts an exemplary computer system/environment that may be employed in some embodiments for communicating with the AI—based informatics system, for ting differential networks, for visualizing networks, for saving and storing data, and/or for interacting with a user. As explained above, calculations for an AI-based informatics system may be performed on a separate supercomputer with hundreds or thousands of parallel processors that interacts, directly or indirectly, with the ary computer system. The environment includes a computing device 100 with associated peripheral devices. Computing device 100 is programmable to implement executable code 150 for ming various methods, or portions of methods, taught herein. Computing device 100 es a storage device 116, such as a hard-drive, CD-ROM, or other non-transitory er readable media.
Storage device 116 may store an ing system 118 and other related software.
Computing device 100 may further include memory 106. Memory 106 may comprise a computer system memory or random access memory, such as DRAM, SRAM, EDO RAM, etc. Memory 106 may comprise other types of memory as well, or combinations thereof. Computing device 100 may store, in storage device 116 and/or memory 106, instructions for implementing and processing each portion of the executable code 150.
The executable code 150 may include code for communicating with the AI—based informatics system 190, for generating differential networks (e. g., a differential network creation ), for extracting quantitative relationship information from the AI-based informatics system (e.g., a relationship quantification module) and for visualizing networks (e. g., ape).
In some embodiments, the ing device 100 may communicate directly or indirectly with the AI-based atics system 190 (e. g., a system for executing REFS).
For e, the computing device 100 may icate with the AI-based informatics system 190 by transferring data files (e.g., data frames) to the AI-based informatics system 190 through a network. Further, the computing device 100 may have executable code 150 that provides an interface and instructions to the AI-based informatics system 190.
In some embodiments, the computing device 100 may communicate directly or indirectly with one or more experimental systems 180 that provide data for the input data set. Experimental s 180 for generating data may include systems for mass ometry based proteomics, microarray gene expression, qPCR gene expression, mass spectrometry based metabolomics, and mass spectrometry based lipidomics, SNP microarrays, a panel of functional , and other in-vitro biology platforms and technologies.
Computing device 100 also includes processor 102, and may include one or more onal processor(s) 102’, for executing software stored in the memory 106 and other programs for controlling system hardware, eral devices and/or peripheral hardware. Processor 102 and processor(s) 102’ each can be a single core processor or multiple core (104 and 104’) processor. Virtualization may be employed in computing device 100 so that infrastructure and resources in the computing device can be shared dynamically. Virtualized processors may also be used with executable code 150 and other software in storage device 116. A virtual machine 114 may be provided to handle a process running on multiple processors so that the process appears to be using only one computing ce rather than multiple. Multiple virtual es can also be used with one processor.
A user may interact with computing device 100 through a visual display device 122, such as a computer monitor, which may display a user interface 124 or any other interface. The user interface 124 of the display device 122 may be used to display raw data, visual representations of networks, etc. The visual display device 122 may also display other aspects or elements of exemplary embodiments (e.g., an icon for storage device 116). Computing device 100 may include other I/O devices such a keyboard or a multi-point touch ace (e.g., a touchscreen) 108 and a ng device 110, (e. g., a mouse, trackball and/or trackpad) for receiving input from a user. The keyboard 108 and the pointing device 110 may be ted to the visual display device 122 and/or to the computing device 100 via a wired and/or a wireless connection. ing device 100 may include a network interface 112 to interface with a network device 126 via a Local Area Network (LAN), Wide Area Network (WAN) or the Internet through a y of tions including, but not limited to, standard one lines, LAN or WAN links (e.g., 802.11, T1, T3, 56kb, X25), broadband connections (e. g., ISDN, Frame Relay, ATM), wireless connections, controller area network (CAN), or some combination of any or all of the above. The k interface 112 may comprise a built-in network adapter, network interface card, PCMCIA network card, card bus network adapter, wireless network adapter, USB network adapter, modem or any other device suitable for enabling computing device 100 to interface with any type of network capable of communication and performing the operations described herein.
Moreover, computing device 100 may be any computer system such as a workstation, desktop computer, server, laptop, handheld er or other form of computing or telecommunications device that is e of communication and that has sufficient processor power and memory capacity to perform the operations described herein.
Computing device 100 can be running any operating system 118 such as any of the versions of the MICROSOFT S ing systems, the different releases of the Unix and Linux operating systems, any version of the MACOS for Macintosh computers, any embedded operating system, any real—time operating system, any open source operating , any proprietary operating system, any operating systems for mobile computing devices, or any other operating system capable of g on the computing device and performing the operations described herein. The ing system may be running in native mode or emulated mode.
IV. Models for Drug-induced Toxicity and Uses Therefor Virtually all drug-induced toxicity involves complicated interactions among different cell types and/or organ systems. Perturbation of critical functions in one cell type or organ may lead to secondary effects on other interacting cells types and organs, and such downstream changes may in turn feedback to the initial changes and cause further cations. Therefore, it is beneficial to dissect a given drug-induced ty to its components, such as interaction between pairs of cell types or organs, and systemically probe the interactions n these components in order to gain a more complete, global View of the drug-induced ty process.
Accordingly, the t invention provides cell models for drug-induced toxicity. To this end, Applicants have built cell models for an exemplary drug-induced ty (e. g., cardiotoxicity) which have been employed in the subject discovery Platform Technology. Applicants have conducted experiments with the cell models using the subject discovery Platform Technology to generate consensus causal relationship networks, including causal relationships unique in the nduced toxicity, and thereby identify “modulators” or critical molecular “drivers” important for the particular drug-induced toxicity.
One significant advantage of the Platform Technology and its components, e. g., the custom built cell models and data sets obtained from the drug-induced toxicity cell models, is that an initial, “first generation” consensus causal relationship k generated for a drug-induced toxicity can continually evolve or expand over time, e. g., by the uction of additional cell lines/types and/or additional conditions.
Additional data from the d cell model, i.e., data from the newly added portion(s) of the cell model, can be collected. The new data collected from an expanded or evolved cell model, i.e., from newly added portion(s) of the cell model, can then be introduced to the data sets previously used to generate the “first generation” sus causal relationship network in order to generate a more robust “second generation” consensus causal relationship network. New causal relationships unique to the drug- induced toxicity can then be identified from the d generation” consensus causal relationship network. In this way, the evolution of the drug-induced toxicity cell model provides an evolution of the consensus causal relationship networks, thereby providing new and/or more reliable insights into the tors of the nduced ty. In this way, both the drug-induced toxicity cell models, the data sets from the cell models, and the causal relationship networks generated from the drug-induced toxicity cell models by using the rm Technology methods can constantly evolve and build upon previous knowledge obtained from the Platform Technology.
Accordingly, the invention provides consensus causal relationship networks generated from the drug-induced toxicity cell models ed in the Platform Technology. These consensus causal relationship networks may be first generation consensus causal relationship networks, or may be multiple generation consensus causal onship networks, e.g., 2“ 3rd, 4‘“, 5‘“, 6‘“, T“, 8‘“, 9”“, 10‘“, 11th, 12th, 13th, 14th, 15th, 16th, 17th, 18th, 19th, 20‘11 or greater generation consensus causal relationship networks.
Further, the invention provides simulated consensus causal relationship networks generated from the nduced toxicity cell models employed in the Platform Technology. These simulated consensus causal relationship networks may be first generation simulated consensus causal relationship ks, or may be multiple generation simulated consensus causal relationship networks, e.g., 2nd’ 3rd, 4”, 5m, 6th, 7th, 8th, 93‘, 103‘, 11th, 12h, 13m, 14m,15”‘,16'h, 17*, 18th, 19h, 20h or r simulated tion consensus causal relationship networks. The invention further provides delta networks and delta-delta networks generated from any of the consensus causal onship ks of the invention.
A custom built cell model for a drug-induced ty comprises one or more cells associated with the drug-induced toxicity. The model for a drug-induced toxicity may be established to simulate an nment of the drug-induced toxicity, e. g., environment of drug-induced cardiotoxicity in vivo, by creating conditions (e. g., cell culture conditions) that mimic a characteristic aspect of the drug-induced toxicity.
Multiple cells of the same or different origin, as opposed to a single cell type, may be included in the cell model. In one embodiment, at least 1, 2, 3, 4, 5, 6, 7, 8, 9, , 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 50 or more different cell lines or cell types are included in the drug—induced toxicity cell model. In one embodiment, the cells are all of the same type, e.g., all cardiomyocytes, but are different established cell lines, e. g., different established cell lines of cardiomyocytes. All values presented in the foregoing list can also be the upper or lower limit of ranges, that are intended to be a part of this invention, e.g., between 1 and 5, 1 and 10, 2 and 5, or 5 and 15 ent cell lines or cell types. es of cell types that may be included in the cell models of the invention include, without limitation, human cells, animal cells, mammalian cells, plant cells, yeast, bacteria, or fungae. In one embodiment, cells of the cell model can e diseased cells, such as cancer cells or bacterially or virally infected cells. In one embodiment, cells of the cell model can include drug-induced toxicity associated cells, such as cells involved in diabetes, obesity or cardiovascular drug-induced toxicity state, e. g., aortic smooth muscle cells or hepatocytes. The skilled person would recognize those cells that are involved in or associated with a particular drug-induced toxicity, e. g., toxicity, toxicity, nephrotoxicity, neurotoxicity, renaltoxicity, or myotoxicity, and any such cells may be included in a cell model of the invention, e. g., cardiomyocytes, diabetic cardiomyocytes, hepatocytes, kidney cells, neuro cells, renal cells, or myoblasts.
Cell models of the invention may include one or more “control cells.” In one embodiment, a control cell may be an untreated or unperturbed cell. In r embodiment, a “control cell” may be a normalcell, e.g., a cell that has not been exposed to a toxicity-causing agent or drug. In one embodiment, at least 1, 2, 3, 4, 5, 6, 7, 8, 9, , 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50 or more different control cells are included in the cell model. All values presented in the foregoing list can also be the upper or lower limit of ranges, that are ed to be a part of this invention, 6. g., between 1 and 5, 1 and 10, 2 and 5, or 5 and 15 ent control cell lines or control cell types. In one embodiment, the l cells are all of the same type but are different established cell lines of that cell type. In one embodiment, as a control, one or more normal, e.g., non-diseased, cell lines are ed under similar conditions, and/or are d to the same perturbation, as the primary cells of the cell model in order to identify proteins or pathways unique to the drug-induced toxicity.
A custom cell model of the invention may also comprise conditions that mimic a characteristic aspect of the nduced toxicity. For example, cell culture ions may be selected that closely approximating the conditions of a cell in a diabetic environment in vivo for probing diabetic drug induced toxicity, or of an aortic smooth muscle cell of a patient suffering from drug—induced cardiotoxicity. In some instances, the ions are stress conditions.Various conditions / stressors may be employed in the cell models of the invention. In one embodiment, these stressors / conditions may constitute the “perturbation”, e.g., external stimulus, for the cell systems. One exemplary stress condition is hypoxia, a condition typically found, for example, within patients with advanced stage of es. Hypoxia can be induced using art-recognized methods. For example, hypoxia can be induced by placing cell systems in a Modular Incubator Chamber (MIC-101, Billups-Rothenberg Inc. Del Mar, CA), which can be flooded with an industrial gas mix containing 5% C02, 2% Oz and 93% nitrogen.
Effects can be measured after a pre-determined period, e.g., at 24 hours after hypoxia treatment, with and without additional external stimulus components (e.g., CleO at 0, 50, or 100 MM). Likewise, lactic acid treatment mimics a cellular environment where glycolysis activity is high. Lactic acid induced stress can be investigated at a final lactic acid concentration of about 12.5 mM at a pre-determined time, e.g., at 24 hours, with or t additional external stimulus components (e.g., CleO at 0, 50, or 100 uM).
Hyperglycemia is a ion found in diabetes as well as in diabetic drug-induced toxicity. A typical hyperglycemic condition that can be used to treat the subject cells e 10% culture grade glucose added to suitable media to bring up the final concentration of glucose in the media to about 22 mM. Hyperlipidemia is a condition found, for example, in obesity and cardiovascular disease, and can be used to simulate drug-induced cardiotoxicity. The hyperlipidemic ions can be provided by culturing cells in media containing 0.15 mM sodium palmitate. Hyperinsulinemia is a condition found, for example, in diabetes, as well as in diabetic drug-induced toxicity.
The hyperinsulinemic conditions may be induced by culturing the cells in media containing 1000 nM insulin.
Individual conditions may be investigated tely in the custom built cell models of the ion, and/or may be combined together. In one embodiment, a combination of at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 40, 50 or more ions reflecting or simulating different characteristic aspects of the biological system are investigated in the custom built cell model. In one embodiment, individual ions and, in addition, combinations of at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, , 30, 35, 40, 45, 50 or more of the conditions reflecting or simulating different characteristic aspects of the drug—induced toxicity are igated in the custom built drug-induced toxicity cell model. All values presented in the foregoing list can also be the upper or lower limit of ranges, that are intended to be a part of this invention, e.g., between 1 and 5, 1 and 10, 1 and 20, 1 and 30, 2 and 5, 2 and 10, 5 and 10, 1 and 20, 5 and 20, 10 and 20, 10 and 25 and 30 or 10 and 50 different conditions. , 10 Once the custom nduced toxicity cell model is built, one or more “perturbations” may be applied to the system, such as genetic variation from patient to patient, or with / without treatment by certain drugs or pro-drugs. See Figure 15D. The effects of such bations to the cell model system can be measured using various art- recognized or proprietary means, as described in n III.B below.
The custom built drug-induced toxicity cell model may be exposed to a perturbation, e.g., an “environmental perturbation” or “external stimulus component”.
The “environmental perturbation” or “external stimulus component” may be endogenous to the cellular environment (e.g., the ar environment contains some levels of the ant, and more of the same is added to increase its , or may be exogenous to the cellular environment (e.g., the stimulant/perturbation is largely absent from the cellular environment prior to the alteration). The cellular environment may further be altered by secondary changes resulting from adding the environmental perturbation or external stimulus component, since the external stimulus component may change the cellular output of the cell , including molecules secreted into the cellular nment by the cell system. The nmental perturbation or external stimulus component may include any external al andfor chemical stimulus that may affect cellular function. This may include any large or small organic or inorganic molecules, l or synthetic chemicals, temperature shift, pH change, radiation, light (UVA, UVB eta), microwave, sonic wave, electrical current, modulated or unmodulated ic fields, etc. The environmental perturbation or external stimulus component may also include an introduced genetic cation or mutation or a vehicle (e. g., vector) that causes a genetic modification / mutation. (i) Cross-talk cell systems In n situations, where interaction between two or more cell systems are desired to be investigated, a “cross—talking cell system” may be formed by, for example, bringing the modified cellular environment of a first cell system into contact with a second cell system to affect the cellular output of the second cell system.
As used herein, -talk cell system” comprises two or more cell systems, in which the cellular environment of at least one cell system comes into contact with a second cell system, such that at least one cellular output in the second cell system is changed or affected. In certain embodiments, the cell systems within the cross-talk cell system may be in direct contact with one another. In other embodiments, none of the cell systems are in direct contact with one another.
For example, in certain embodiments, the cross-talk cell system may be in the form of a transwell, in which a first cell system is growing in an insert and a second cell system is growing in a corresponding well compartment. The two cell systems may be in contact with the same or different media, and may exchange some or all of the media components. External stimulus component added to one cell system may be substantially absorbed by one cell system and/or degraded before it has a chance to diffuse to the other cell system. Alternatively, the al stimulus component may eventually ch or reach an equilibrium within the two cell systems.
In certain embodiments, the talk cell system may adopt the form of separately cultured cell systems, where each cell system may have its own medium and/or culture conditions rature, C02 content, pH, etc), or similar or identical e conditions. The two cell systems may come into contact by, for example, taking the conditioned medium from one cell system and bringing it into contact with another cell . Direct cell-cell contacts between the two cell systems can also be effected if desired. For example, the cells of the two cell systems may be co-cultured at any point if desired, and the co-cultured cell systems can later be separated by, for example, FACS sorting when cells in at least one cell system have a le marker or label (such as a stably expressed fluorescent marker protein GFP).
Similarly, in certain embodiments, the cross—talk cell system may simply be a co- culture. Selective ent of cells in one cell system can be effected by first treating the cells in that cell system, before culturing the treated cells in co-culture with cells in another cell system. The ture cross-talk cell system setting may be helpful when it is desired to study, for example, effects on a second cell system caused by cell surface changes in a first cell system, after stimulation of the first cell system by an external stimulus component.
The cross-talk cell system of the invention is particularly le for exploring the effect of certain termined external stimulus component on the cellular output of one or both cell s. The primary effect of such a stimulus on the first cell system (with which the stimulus directly t) may be determined by comparing cellular outputs (e.g., protein expression level) before and after the first cell system’s contact with the al stimulus, which, as used herein, may be referred to as “(significant) cellular output differentials.” The secondary effect of such a stimulus on the second cell system, which is mediated through the modified cellular environment of the first cell system (such as its secretome), can also be similarly measured. There, a comparison in, for e, proteome of the second cell system can be made n the proteome of the second cell system with the external stimulus treatment on the first cell system, and the proteome of the second cell system without the external stimulus treatment on the first cell system. Any significant changes observed (in proteome or any other cellular outputs of interest) may be referred to as a “significant cellular talk differential.” In making cellular output measurements (such as protein expression), either absolute expression amount or relative expression level may be used. For e, to determine the relative protein expression level of a second cell system, the amount of any given protein in the second cell system, with or without the external stimulus to the first cell system, may be compared to a suitable control cell line and mixture of cell lines and given a fold-increase or fold—decrease value. A pre-determined threshold level for such fold-increase (e.g., at least 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.5, 3, 3.5, 4, 4.5, , 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 75 or 100 or more fold se) or fold- se (e.g., at least a decrease to 0.95, 0.9, 0.8, 0.75, 0.7, 0.6, 0.5, 0.45, 0.4, 0.35, 0.3, 0.25, 0.2, 0.15, 0.1 or 0.05 fold, or 90%, 80%, 75%, 70%, 65%, 60%, 55%, 50%, 45%, 40%, 35%, 30%, 25%, 20%, 15%, 10% or 5% or less) may be used to select icant cellular cross-talk entials. All values presented in the ing list can also be the upper or lower limit of ranges, e.g., between 1.5 and 5 fold, between 2 and 10 fold, between 1 and 2 fold, or between 0.9 and 07 fold, that are intended to be a part of this invention.
Throughout the present application, all values presented in a list, e. g., such as those above, can also be the upper or lower limit of ranges that are intended to be a part of this invention.
To illustrate, in one exemplary two—cell system established to imitate aspects of a drug-induced cardiotoxicity and toxicity model, a heart smooth muscle cell line (first cell system) may be treated with a hypoxia condition (an external stimulus component), and proteome changes in a kidney cell line (second cell system) resulting from contacting the kidney cells with conditioned medium of the heart smooth muscle may be measured using conventional quantitative mass spectrometry. Significant cellular cross-talking differentials in these kidney cells may be ined, based on comparison with a proper control (e.g., similarly cultured kidney cells contacted with conditioned medium from similarly cultured heart smooth muscle cells n_ot treated with hypoxia conditions).
Not every observed significant cellular cross-talking differentials may be of biological significance. With respect to any given drug-induced toxicity for which the subject interrogative biological assessment is applied, some (or maybe all) of the significant cellular cross-talking differentials may be “determinative” with t to the specific biological problem at issue, 6. g. , either responsible for causing a drug-induced toxicity (a potential target for therapeutic intervention) or is a biomarker for the drug- induced toxicity (a ial diagnostic or stic factor).
Such determinative cross-talking differentials may be selected by an end user of the subject method, or it may be selected by a bioinformatics re program, such as DAVID-enabled comparative pathway analysis m, or the KEGG pathway analysis program. In certain embodiments, more than one bioinformatics software program is used, and consensus results from two or more bioinformatics software programs are preferred.
As used herein, rentials” of cellular outputs include differences (e.g., increased or sed levels) in any one or more parameters of the cellular outputs. For example, in terms of protein expression level, differentials between two cellular outputs, such as the outputs associated with a cell system before and after the ent by an external stimulus component, can be ed and quantitated by using art-recognized technologies, such as mass-spectrometry based assays (e.g., iTRAQ, 2D-LC-MSMS, etc.) B. Use of Cell Models for Interrogative Biological Assessments The methods and cell models described herein, and further described in international Application No. PCT/U32012/027615, may be used for, or applied to, any number of “interrogative biological assessments.” Use of the methods of the invention for an interrogative biological assessment tates the identification of “modulators” or determinative cellular process “drivers” of a drug-induced toxicity.
As used herein, an “interrogative biological assessment” may e the identification of one or more modulators of a biological system, e. g., determinative cellular process “drivers,” (e. g., an increase or decrease in activity of a biological pathway, or key members of the pathway, or key regulators to members of the pathway) associated with the environmental perturbation or external stimulus ent, or a unique causal relationship unique in a biological system or process. It may further include onal steps designed to test or verify whether the identified determinative cellular process drivers are necessary and/or sufficient for the downstream events associated with the nmental perturbation or external stimulus component, including in vivo animal models and/or in vitro tissue culture ments.
In a preferred embodiment, the interrogative biological assessment is the assessment of the drug-induced toxicological profile of an agent, e.g., a drug, on a cell, tissue, organ or organism, wherein the identified modulators of a biological system, e. g., determinative cellular process driver (e.g., cellular talk differentials or causal relationships unique in a biological system or process) may be indicators of drug- d toxicities, e. g., cytotoxicity, cardiotoxicity, hepatotoxicity, nephrotoxicity, oxicity, renaltoxicity, or myotoxicity, and may in turn be used to predict or identify the logical profile of the drug. In one embodiment, the identified modulators of a drug-induced toxicity, e.g., determinative cellular process driver (e. g., cellular cross-talk differentials or causal relationships unique in a drug-induced toxicity) is an indicator of cardiotoxicity of a drug or drug candidate, and may in turn be used to t or identify the cardiotoxicological profile of the drug or drug candidate.
V. mic Sample Analysis In certain embodiments, the subject method employs large-scale hroughput quantitative proteomic analysis of hundreds of samples of similar character, and es the data necessary for identifying the cellular output differentials.
There are numerous art-recognized technologies suitable for this purpose. An exemplary technique, iTRAQ analysis in combination with mass spectrometry, is briefly described below.
To provide reference samples for relative quantification with the iTRAQ technique, multiple QC pools are created. Two separate QC pools, consisting of aliquots of each sample, were generated from the Cell #1 and Cell #2 samples - these samples are denoted as QCSl and QCSZ, and QCPl and QCP2 for supernatants and pellets, respectively. In order to allow for protein concentration comparison across the two cell lines, cell pellet aliquots from the QC pools described above are combined in equal volumes to generate reference samples (QCP).
The tative proteomics approach is based on stable isotope labeling with the 8-plex iTRAQ t and 2D—LC MALDI MS/MS for peptide identification and fication. Quantification with this technique is relative: es and proteins are assigned nce ratios relative to a reference sample. Common reference samples in multiple iTRAQ experiments facilitate the comparison of s across multiple iTRAQ experiments.
To implement this analysis scheme, six primary samples and two control pool samples are ed into one 8-plex iTRAQ mix, with the l pool samples labeled with 113 and 117 reagents according to the manufacturer’s suggestions. This mixture of eight samples is then fractionated by two-dimensional liquid chromatography; strong cation exchange (SCX) in the first dimension, and reversed- phase HPLC in the second dimension. The HPLC eluent is directly fractionated onto MALDI , and the plates are analyzed on an MDS SCIEX/AB 4800 MALDI TOF/TOF mass ometer.
In the absence of additional information, it is assumed that the most important changes in protein expression are those within the same cell types under different treatment conditions. For this reason, primary samples from Cell#l and Cell#2 are analyzed in separate iTRAQ mixes. To facilitate comparison of protein expression in Cell#l vs. Cell#2 s, universal QCP samples are analyzed in the available “iTRAQ slots” not occupied by y or cell line ic QC samples (QCl and QC2).
A brief overview of the laboratory procedures employed is provided herein.
A. Protein Extraction From Cell Supernatant Samples For cell supernatant samples (CSN), proteins from the culture medium are present in a large excess over proteins secreted by the cultured cells. In an attempt to reduce this background, t abundant protein depletion was implemented. As specific affinity columns are not available for bovine or horse serum proteins, an anti- human Ing4 column was used. While the antibodies are directed against human proteins, the broad specificity provided by the polyclonal nature of the antibodies was anticipated to accomplish depletion of both bovine and equine proteins present in the cell culture media that was used.
A ZOO-til aliquot of the CSN QC material is loaded on a 10-mL Ing4 depletion column before the start of the study to determine the total protein concentration (Bicinchoninic acid (BCA) assay) in the flow-through material. The loading volume is then selected to achieve a ed fraction containing approximately 40 pg total B. n Extraction From Cell Pellets An aliquot of Cell #1 and Cell #2 is lysed in the “standard” lysis buffer used for the analysis of tissue samples at BGM, and total protein content is determined by the BCA assay. Having established the protein content of these representative cell lystates, all cell pellet samples (including QC s described in Section 1.1) were processed to cell lysates. Lysate amounts of approximately 40 ug of total n were carried forward in the processing workflow.
C. Sample Preparation for Mass Spectrometry Sample preparation follows standard operating procedures and constitute of the following: 0 Reduction and alkylation of proteins 0 Protein clean-up on reversed-phase column (cell pellets only) 0 Digestion with n 0 iTRAQ labeling 0 Strong cation exchange tography — collection of six fractions (Agilent 1200 system) 0 HPLC fractionation and spotting to MALDI plates (Dionex Ultimate3000/Probot D. MALDI MS and MSIMS HPLC-MS lly employs online ESI MS/MS strategies. BG Medicine uses an off-line LC-MALDI MS/MS platform that results in better concordance of observed protein sets across the primary samples without the need of injecting the same sample multiple times. Following first pass data tion across all iTRAQ mixes, since the peptide fractions are retained on the MALDI target plates, the samples can be analyzed a second time using a targeted MS/MS ition pattern derived from knowledge gained during the first acquisition. In this manner, maximum observation frequency for all of the fied proteins is accomplished (ideally, every protein should be measured in every iTRAQ mix).
E. Data Processing The data processing process within the BGM Proteomics workflow can be separated into those procedures such as inary peptide identification and fication that are completed for each iTRAQ mix individually (Section 1.5.1) and those processes (Section 1.5 .2) such as final assignment of peptides to proteins and final quantification of proteins, which are not completed until data acquisition is completed for the project.
The main data processing steps within the BGM Proteomics workflow are: 0 Peptide identification using the Mascot (Matrix Sciences) database search engine 0 Automated in house validation of Mascot IDs 0 Quantification of peptides and preliminary quantification of proteins 0 Expert on of final dataset 0 Final assignment of peptides from each mix into a common set of proteins using the automated PVT tool 0 Outlier elimination and final fication of proteins (i) Data Processing of Individual iTRAQ Mixes As each iTRAQ mix is processed through the workflow the MS/MS spectra are analyzed using proprietary BGM software tools for peptide and protein identifications, as well as initial assessment of quantification information. Based on the results of this preliminary analysis, the y of the workflow for each y sample in the mix is judged against a set of BGM performance metrics. If a given sample (or mix) does not pass the specified minimal performance metrics, and additional material is available, that sample is repeated in its entirety and it is data from this second implementation of the workflow that is incorporated in the final dataset. (ii) Peptide Identification MS/MS spectra was searched against the Uniprot protein ce database containing human, bovine, and horse sequences augmented by common contaminant sequences such as porcine trypsin. The details of the Mascot search parameters, including the complete list of modifications, are given in Table 1.
Table 1: Mascot Search Parameters Precursor mass tolerance 100 ppm Fragment mass tolerance le modifications N—term iTRAQ8 Lysine iTRAQ8 Cys carbamidomethyl Pyro—Glu (N—term) Pyro-Carbamidomethyl Cys m) ation (N only) Oxidation (M) —_sitesallowed Peptide rank considered After the Mascot search is complete, an auto-validation procedure is used to promote (i.e., validate) specific Mascot peptide matches. Differentiation between valid and invalid matches is based on the attained Mascot score relative to the expected Mascot score and the difference between the Rank 1 peptides and Rank 2 peptide Mascot scores. The criteria required for tion are at relaxed if the peptide is one of several d to a single n in the iTRAQ mix or if the peptide is present in a catalogue of previously validated peptides. (iii) Peptide and Protein Quantification The set of validated peptides for each mix is utilized to calculate preliminary protein quantification metrics for each mix. Peptide ratios are calculated by dividing the peak area from the iTRAQ label (526., m/z 114, 115, 116, 118, 119, or 121) for each validated peptide by the best representation of the peak area of the reference pool (QCl or QC2). This peak area is the e of the 113 and 117 peaks provided both samples pass QC acceptance criteria. Preliminary protein ratios are determined by calculating the median ratio of all “useful” validated peptides matching to that protein. “Useful” peptides are fully iTRAQ labeled (all N—terminal are labeled with either Lysine or PyroGlu) and fully Cysteine labeled (i.e., all Cys residues are alkylated with idomethyl or N-terminal Pyro-cmc). (iv) Post-acquisition Processing Once all passes of MS/MS data acquisition are complete for every mix in the project, the data is collated using the three steps discussed below which are aimed at ng the results from each primary sample to be simply and meaningfully compared to that of another. (v) Global Assignment of Peptide Sequences to Proteins Final assignment of e sequences to protein accession numbers is carried out through the proprietary Protein Validation Tool (PVT). The PVT procedure determines the best, minimum dundant protein set to describe the entire collection of peptides identified in the project. This is an automated ure that has been optimized to handle data from a homogeneous taxonomy.
Protein assignments for the supernatant ments were manually curated in order to deal with the complexities of mixed taxonomies in the database. Since the ted paradigm is not valid for cell cultures grown in bovine and horse serum supplemented media, extensive manual curation is necessary to minimize the ambiguity of the source of any given protein. (vi) ization of Peptide Ratios The e ratios for each sample are normalized based on the method of ompele et al. Genome Biology, 2002, 3(7), research 0034.1-11. This procedure is applied to the cell pellet measurements only. For the supernatant s, quantitative data are not normalized ering the t bution to peptide identifications coming from the media. (vii) Final Calculation of Protein Ratios A standard tical outlier elimination procedure is used to remove outliers from around each protein median ratio, beyond the 1.96 6 level in the log-transformed data set. Following this elimination process, the final set of protein ratios are (re- )calculated.
VI. Markers of the Invention and Uses Thereof The present invention is based, at least in part, on the identification of novel biomarkers that are associated with drug-induced toxicities, such as a drug-induced cardiotoxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, renaltoxicity, or myotoxicity, or response of a drug—induced toxicity to a bation, such as a therapeutic agent.
In ular, the invention relates to markers (hereinafter “markers” or rs of the invention”), which are described in the examples. The invention provides nucleic acids and proteins that are encoded by or correspond to the markers (hereinafter “marker nucleic acids” and “marker ns,” respectively). These markers are particularly useful in diagnosing drug-induced toxicity states; prognosing drug-induced toxicity ; developing drug targets for varies drug-induced toxicity states; screening for the presence of toxicity, preferably drug-induced toxicities, e.g., cardiotoxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, renaltoxicity, or myotoxicity; identifying an agent that cause or is at risk for causing drug-induced toxicity; identifying an agent that can reduce or prevent drug—induced toxicity; alleviating, reducing or preventing drug-inducedtoxicity; and identifying markers predictive of drug-induced toxicity.
A "marker" is a gene whose altered level of expression in a tissue or cell from its expression level in normal or healthy tissue or cell is associated with a toxicity state, such as a drug-induced toxicity, e.g., cardiotoxicity. A “marker nucleic acid” is a nucleic acid (e.g., mRNA, cDNA) encoded by or corresponding to a marker of the invention. Such marker nucleic acids include DNA (6. g., cDNA) comprising the entire or a partial sequence of any of the genes that are markers of the invention or the complement of such a sequence. Such ces are known to the one of skill in the art and can be found for example, on the NIH government pubmed website. The marker nucleic acids also include RNA comprising the entire or a partial sequence of any of the gene markers of the ion or the complement of such a sequence, wherein all thymidine residues are replaced with uridine residues. A “marker protein” is a protein encoded by or corresponding to a marker of the invention. A marker protein comprises the entire or a partial sequence of any of the marker proteins of the invention. Such sequences are known to the one of skill in the art and can be found for example, on the NIH government pubmed website. The terms “protein” and “polypeptide’ are used interchangeably.
A “toxic state associated" body fluid is a fluid which, when in the body of a patient, contacts or passes through sarcoma cells or into which cells or proteins shed from sarcoma cells are capable of passing. Exemplary e state or toxic state associated body fluids include blood fluids (6.g. whole blood, blood serum, blood having platelets removed rom), and are described in more detail below. Disease state or toxic state associated body fluids are not limited to, whole blood, blood having platelets d rom, lymph, prostatic fluid, urine and semen.
The "normal" level of expression of a marker is the level of expression of the marker in cells of a human subject or patient not afflicted with a toxicity state.
An “over-expression” or “higher level of expression” of a marker refers to an expression level in a test sample that is r than the standard error of the assay employed to assess expression, and is preferably at least twice, and more preferably three, four, five, six, seven, eight, nine or ten times the expression level of the marker in a control sample (e. g., sample from a healthy subject not having the marker associated a drug-induce toxicity state, e.g., cardiotoxicit, toxicity, nephrotoxicity, oxicity, renaltoxicity, or myotoxicity) and preferably, the average sion level of the marker in several control samples.
A “lower level of expression” of a marker refers to an expression level in a test sample that is at least twice, and more preferably three, four, five, six, seven, eight, nine or ten times lower than the expression level of the marker in a l sample (6. g., sample from a healthy subjects not having the marker associated a drug-induced toxicity state, e.g., cardiotoxicity, cardiotoxicity, hepatotoxicity, nephrotoxicity, neurotoxicity, renaltoxicity, or myotoxicity) and preferably, the average expression level of the marker in several control samples.
A "transcribed polynucleotide" or “nucleotide transcript” is a cleotide (e. g. an mRNA, hnRNA, a cDNA, or an analog of such RNA or cDNA) which is complementary to or homologous with all or a n of a mature mRNA made by transcription of a marker of the invention and normal ranscriptional processing (e. g. splicing), if any, of the RNA transcript, and reverse transcription of the RNA transcript.
"Complementary" refers to the broad concept of sequence mentarity between regions of two nucleic acid strands or between two regions of the same nucleic acid strand. It is known that an e residue of a first nucleic acid region is capable of forming specific hydrogen bonds ("base pairing") with a residue of a second nucleic acid region which is antiparallel to the first region if the residue is thymine or uracil.
Similarly, it is known that a ne residue of a first nucleic acid strand is capable of base pairing with a e of a second c acid strand which is antiparallel to the first strand if the residue is e. A first region of a nucleic acid is complementary to a second region of the same or a different nucleic acid if, when the two regions are arranged in an antiparallel n, at least one nucleotide residue of the first region is capable of base pairing with a e of the second region. Preferably, the first region comprises a first portion and the second region comprises a second n, whereby, when the first and second portions are arranged in an antiparallel fashion, at least about 50%, and preferably at least about ?5%, at least about 90%, or at least about 95% of the nucleotide residues of the first portion are capable of base pairing with nucleotide residues in the second portion. More preferably, all nucleotide residues of the first portion are capable of base pairing with nucleotide residues in the second portion.
"Homologous" as used herein, refers to nucleotide sequence similarity between two regions of the same nucleic acid strand or between regions of two different nucleic acid strands. When a nucleotide residue position in both regions is occupied by the same nucleotide e, then the regions are homologous at that position. A first region is homologous to a second region if at least one nucleotide residue position of each region is occupied by the same residue. gy between two regions is expressed in terms of the proportion of nucleotide residue ons of the two regions that are occupied by the same nucleotide residue. By way of example, a region having the nucleotide sequence 5‘-ATTGCC—3‘ and a region having the nucleotide sequence 5'- —3‘ share 50% homology. Preferably, the first region comprises a first portion and the second region comprises a second portion, whereby, at least about 50%, and preferably at least about 75%, at least about 90%, or at least about 95% of the nucleotide e positions of each of the portions are occupied by the same nucleotide residue.
More preferably, all nucleotide residue positions of each of the portions are occupied by the same nucleotide residue.
“Proteins of the invention” encompass marker proteins and their fragments; variant marker proteins and their fragments; peptides and polypeptides comprising an at least 15 amino acid segment of a marker or variant marker protein; and fusion ns comprising a marker or variant marker protein, or an at least 15 amino acid segment of a marker or variant marker protein.
The invention further es antibodies, antibody derivatives and antibody fragments which specifically bind with the marker proteins and fragments of the marker proteins of the present invention. Unless otherwise specified herewithin, the terms “antibody” and “antibodies” broadly encompass naturally-occurring forms of antibodies (e.g., IgG, IgA, IgM, IgE) and recombinant antibodies such as single-chain antibodies, chimeric and humanized antibodies and multi—specific antibodies, as well as fragments and tives of all of the foregoing, which fragments and derivatives have at least an antigenic binding site. Antibody derivatives may se a n or chemical moiety ated to an antibody.
In one embodiment, the markers of the invention are genes or proteins associated with or involved in nduced toxicity. Such genes or proteins involved in drug- induced toxicity e, for example, the markers listed in table 2. In some embodiments, the markers of the invention are a combination of at least two, three, four, five, six, seven, eight, nine, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, or more of the foregoing genes (or proteins). All values ted in the foregoing list can also be the upper or lower limit of ranges, that are ed to be a part of this invention, e.g., between 1 and 5, 1 and 10, 1 and 20, 1 and 30, 2 and 5, 2 and 10, 5 and 10, 1 and 20, 5 and 20, 10 and 20, 10 and , 10 and 30 of the foregoing genes (or proteins).
A. Cardiotoxicity ated Markers The present ion is based, at least in part, on the identification of novel biomarkers that are ated with drug—induced cardiotoxicity. The ion is further based, at least in part, on the discovery that Coenzyme Q10 is capable of reducing or preventing drug-induced cardiotoxicity.
Accordingly, the invention provides methods for identifying an agent that causes or is at risk for causing drug—induced cardiotoxicity. In one embodiment, the agent is a drug or drug candidate. In these methods, the amount of one or more biomarkers/proteins in a pair of samples (a first sample not subject to the drug treatment, and a second sample subjected to the drug ent) is assessed. A modulation in the level of expression of the one or more biomarkers in the second sample as compared to the first sample is an indication that the drug causes or is at risk for causing drug- induced cardiotoxicity. In one embodiment, the one or more biomarkers is selected from the markers listed in table 2. The methods of the present invention can be practiced in conjunction with any other method used by the skilled practitioner to fy a drug at risk for causing drug-induced toxocity.
Accordingly, in one aspect, the invention provides a method for identifying a drug that causes or is at risk for causing drug-induced cardiotoxicity, comprising: comparing (i) the level of expression of one or more biomarkers present in a first cell sample obtained prior to the treatment with the drug; with (ii) the level of expression of the one or more biomarkers present in a second cell sample obtained following the treatment with the drug; wherein the one or more biomarkers is ed from the markers listed in table 2; wherein a modulation in the level of expression of the one or more biomarkers in the second sample as compared to the first sample is an indication that the drug causes or is at risk for causing drug-induced cardiotoxicity.
In one embodiment, the cells are cells of the cardiovascular , e.g., cardiomyocytes. In one embodiment, the cells are diabetic cardiomyocytes. In one embodiment, the drug is a drug or candidate drug for treating diabetes, obesity or cardiovascular disease.
In one embodiment, a modulation (e.g., an increase or a decrease) in the level of expression of one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, sixteen, seventeen, eighteen, nineteen, twenty, twenty-five, thirty, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160 or more of the biomarkers selected from the s listed in table 2 in the second sample as compared to the first sample is an indication that the drug causes or is at risk for g drug- induced toxicity.
In one embodiment, a modulation (e.g., an increase or a decrease) in the level of expression of a panel of two, three, four, five, six, seven, eight, nine, ten, , twelve, or thirteen, markers selected from a group ting TIMPl, PTX3, HSP76, FINC, CYB5, PAIl, IBP7 (IGFBP7), 1C17, EDIL3, HMOXl, NUCBl, CSOlO, HSPA4 in the second sample as compared to the first sample is an indication that the drug causes or is at risk for causing drug-induced cardiotoxicity.
Methods for identifying a rescue agent that can reduce or prevent drug-induced cardiotoxicity are also provided by the invention. In one embodiment, the drug is a drug or drug candidate for treating diabetes, obesity or a cardiovascular disorder. In these methods, the amount of one or more biomarkers in three samples (a first sample not subjected to the drug treatment, a second sample subjected to the drug treatment, and a third sample subjected both to the drug treatment and the agent) is assessed.
Approximately a normalized level of sion of the one or more biomarkers, in the third sample as compared to the first sample, with a changed level of expression in the second sample, is an indication that the rescue agent can reduce or prevent drug-induced cardiotoxicity. In one ment, the one or more biomarkers is selected from the markers listed in table 2.
Using the methods described herein, a variety of molecules, particularly including molecules sufficiently small to be able to cross the cell ne, may be screened in order to identify molecules which modulate, e.g., increase or decrease the expression and/or activity of a marker of the invention. nds so identified can be provided to a subject in order to reduce, alleviate or prevent drug-induced cardiotoxicity in the subject.
Accordingly, in another , the invention es a method for identifying an agent that can reduce or prevent drug—induced cardiotoxicity comprising: (i) determining a normal level of expression of one or more biomarkers present in a first cell sample obtained prior to the treatment with a toxicity inducing drug; (ii) determining a treated level of sion of the one or more biomarkers present in a second cell sample obtained ing the ent with the toxicity inducing drug to identify one or more biomarkers with a change of expression in the treated cell ; (iii) ining the level of expression of the one or more biomarkers with a changed level of expression in the toxicity inducing drug treated sample present in a third cell sample obtained following the treatment with the toxicity inducing drug and the rescue agent; and (iv) comparing the level of expression of the one or more biomarkers ined in the third sample with the level of expression of the one or more kers determined in the first sample; and a normalized level of expression of the one or more biomarkers in the third sample as compared to the first sample is an indication that the agent can reduce or prevent drug—induced cardiotoxicity. In one embodiment, the one or more biomarkers is selected from the markers listed in table 2.
In one embodiment, the cells are cells of the cardiovascular system, e.g., cardiomyocytes. In one embodiment, the cells are diabetic cardiomyocytes. In one embodiment, the drug is a drug or candidate drug for treating diabetes, obesity or cardiovascular disease. In one embodiment, the drug is Anthracyclines, 5-Fluorouracil, Cisplatin, Trastuzumab, Gemcitabine, Rosiglitazone, Pioglitazone, Troglitazone, oline, Pergolide, Sumatriptan, sphonates, or TNF antagonists.In one embodiment, a normalized level of expression of one, two, three, four, five, six, seven, eight, nine, ten, , twelve, thirteen, fourteen, fifteen, sixteen, seventeen, eighteen, nineteen, twenty, -five, thirty, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, or more of the kers selected from the markers listed in table 2 in the third sample as compared to the first sample is an indication that the rescue agent can reduce or prevent drug-induced cardiotoxicity.
In one embodiment, a normalized level of expression of a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen markers selected from a group consisting TIMPl, PTX3, HSP76, FINC, CYBS, PAIl, IBP7 (IGFBP7), 1C17, EDIL3, HMOXl, NUCB 1, CSOlO, HSPA4, in the third sample as compared to the first sample is an indication that the rescue agent can reduce or prevent drug-induced cardiotoxicity.
In one embodiment, the sample comprises a fluid obtained from the subject. In one embodiment, the fluid is selected from the group consisting of blood fluids, vomit, , lymph, cystic fluid, urine, fluids ted by bronchial , fluids collected by peritoneal rinsing, and gynecological fluids. In one embodiment, the sample is a blood sample or a component thereof.
In another embodiment, the sample comprises a tissue or component thereof ed from the subject. In one embodiment, the tissue is selected from the group consisting of bone, connective tissue, cartilage, lung, liver, kidney, muscle tissue, heart, pancreas, and skin.
In one embodiment, the subject is a human.
In one embodiment, the level of expression of the one or more markers in the biological sample is determined by assaying a transcribed polynucleotide or a portion f in the sample. In one embodiment, wherein assaying the transcribed polynucleotide comprises amplifying the transcribed polynucleotide.
In one embodiment, the level of expression of the marker in the subject sample is determined by assaying a protein or a portion thereof in the sample. In one embodiment, the n is assayed using a reagent which specifically binds with the protein.
In one embodiment, the level of sion of the one or more markers in the sample is determined using a technique selected from the group consisting of polymerase chain reaction (PCR) amplification reaction, reverse-transcriptase PCR is, single-strand conformation polymorphism analysis (SSCP), mismatch cleavage detection, heteroduplex analysis, Southern blot analysis, Northern blot analysis, Western blot analysis, in situ hybridization, array analysis, deoxyribonucleic acid sequencing, ction fragment length polymorphism is, and combinations or sub- combinations thereof, of said sample.
In one embodiment, the level of expression of the marker in the sample is determined using a technique selected from the group consisting of histochemistry, immunocytochemistry, flow cytometry, ELISA and mass spectrometry.
In one ment, the level of expression of a plurality of markers is determined.
The invention further provides methods for alleviating, reducing or preventing drug-induced cardiotoxicity in a subject in need thereof, comprising administering to a subject (e. g., a mammal, a human, or a non-human animal) an agent identified by the screening methods provided herein, thereby reducing or ting drug-induced cardiotoxicity in the subject. In one embodiment, the agent is administered to a subject that has already been treated with a cardiotoxicity-inducing drug. In one embodiment, the agent is administered to a subject at the same time as treatment of the subject with a cardiotoxicity-inducing drug. In one embodiment, the agent is administered to a subject prior to treatment of the subject with a cardiotoxicity-inducing drug.
The invention further provides methods for alleviating, reducing or preventing drug-induced cardiotoxicity in a subject in need thereof, sing administering Coenzyme Q10 to the subject (e.g., a mammal, a human, or a non-human animal), thereby reducing or preventing drug-induced cardiotoxicity in the subject. In one embodiment, the Coenzyme Q10 is administered to a subject that has already been treated with a cardiotoxicity-inducing drug. In one embodiment, the Coenzyme Q10 is stered to a subject at the same time as treatment of the subject with a cardiotoxicity-inducing drug. In one embodiment, the Coenzyme Q10 is administered to a subject prior to treatment of the subject with a cardiotoxicity-inducing drug. In one embodiment, the drug-induced cardiotoxicity is associated with modulation of sion of one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, n, sixteen, seventeen, eighteen, nineteen, twenty, -five, thirty, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, or more of the biomarkers selected from the markers listed in table 2. All values presented in the foregoing list can also be the upper or lower limit of ranges, that are intended to be a part of this invention, e.g., between 1 and 5, l and 10, 2 and 5, 2 and 10, or 5 and 10 of the ing genes (or ns).
In one ment, the drug-induced toxicity is associated with modulation of a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen markers selected from a group consisting TIMPl, PTX3, HSP76, FINC, CYB5, PAIl, IBP7 (IGFBP?), 1C17, EDIL3, HMOXl, NUCBl, CSOlO, HSPA4.
The invention r provides biomarkers (e.g, genes and/or proteins) that are useful as predictive markers for drug-induced cardiotoxicity. These biomarkers e the markers listed in table 2. In one embodiment, the predictive markers for drug- induced cardiotoxicity is a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen, markers selected from a group consisting TIMPl, PTX3, HSP76, FINC, CYB5, PAIl, IBP7I (IGFBP7), 1C17, EDIL3, HMOXl, NUCBl, CSOlO, HSPA4. The ordinary skilled artisan would, however, be able to fy additional biomarkers predictive of drug-induced cardiotoxicity by employing the methods described herein, e. g., by carrying out the methods described in Example 3 but by using a different drug known to induce cardiotoxicity. ary drug-induced cardiotoxicity kers of the invention are further bed below.
GRP78 and GRP75 are also referred to as glucose response proteins. These ns are ated with endofsarcoplasmic reticulum stress (ER stress) of cardiomyocytes. SERCA, or sarcoendoplasmic reticulum calcium ATPase, tes Ca2+ tatsis in cardiac cells. Any disruption of these ATPase can lead to cardiac dysfunction and heart failure. Based upon the data provided herein, GRP75 and GRP78 and the edges around them are novel predictors of drug induced cardiotoxicity.
TIMPl, also referred to as TIMP metalloprotease inhibitor 1, is involved with remodeling of extra cellular matrix in association with MMPs. TIMPl expression is ated with fibrosis of the heart, and hypoxia of vascular elial cells also induces TIMPl expression. Based upon the data provided herein, TIMPl is a novel predictor of drug induced cardiactoxicity PTX3, also ed to as Pentraxin 3, belongs to the family of C Reactive Proteins (CRP) and is a good marker of an inflammatory condition of the heart. r, plasma PTX3 could also be entative of systemic inflammatory response due to sepsis or other medical conditions. Based upon the data provided herein, PTX3 may be a novel marker of cardiac function or cardiotoxicity. Additionally, the edges associated with PTX 3 in the network could form a novel panel of biomarkers.
HSP76, also referred to as HSPA6, is only known to be expressed in endothelial cells and B lymphocytes. There is no known role for this protein in cardiac function.
Based upon the data provided herein, HSP76 may be a novel predictor of drug induced cardiotoxicity PDIA4, PDIAl, also referred to as protein disulphide isomerase family A proteins, are associated with ER stress response, like GRPs. There is no known role for these proteins in cardiac function. Based upon the data provided herein, these proteins may be novel predictors of drug induced cardiotoxicity.
CA2D1 is also referred to as calcium channel, voltage-dependent, alpha 2/delta subunit. The alpha-Z/delta subunit of voltage-dependent calcium channel regulates calcium t density and tion/inactivation kinetics of the calcium channel.
CA2D1 plays an important role in excitation—contraction coupling in the heart. There is no known role for this protein in cardiac function. Based upon the data ed herein, CA2Dl is a novel predictor of drug induced cardiotoxicity GPATl is one of four known glycerolphosphate acyltransferase isoforms, and is located on the mitochondrial outer membrane, allowing reciprocal tion with carnitine palmitoyltransferase-l. GPATl is upregulated transcriptionally by insulin and SREBP-lc and downregulated acutely by AMP-activated n kinase, consistent with a role in triacylglycerol synthesis. Based upon the data provided herein, GPATl is a novel predictor of drug induced toxicity.
TAZ, also referred to as Tafazzin, is highly sed in c and skeletal muscle. TAZ is ed in the lism of cardiolipin and functions as a phospholipid-lysophospholipid transacylase. Tafazzin is responsible for remodeling of a phospholipid cardiolipin (CL), the signature lipid of the mitochondrial inner membrane.
Based upon the data ed herein, TAZ is a novel predictor of drug induced cardiotoxicity Various aspects of the invention are described in further detail in the following subsections.
B. Isolated Nucleic Acid Molecules One aspect of the invention pertains to isolated nucleic acid molecules, including nucleic acids which encode a marker protein or a portion f. Isolated nucleic acids of the invention also include nucleic acid molecules sufficient for use as hybridization probes to identify marker nucleic acid molecules, and fragments of marker nucleic acid molecules, e.g., those suitable for use as PCR primers for the amplification or mutation of marker nucleic acid les. As used herein, the term "nucleic acid molecule" is intended to include DNA molecules (e.g., cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and analogs of the DNA or RNA generated using nucleotide analogs. The nucleic acid molecule can be single-stranded or double—stranded, but preferably is double-stranded DNA.
An ted" c acid molecule is one which is separated from other nucleic acid molecules which are present in the natural source of the nucleic acid molecule. In one embodiment, an "isolated" nucleic acid molecule is free of sequences (preferably protein-encoding sequences) which naturally flank the nucleic acid (i.e., ces located at the 5‘ and 3‘ ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic acid is derived. For example, in various embodiments, the isolated nucleic acid molecule can contain less than about 5 kB, 4 kB, 3 kB, 2 kB, 1 kB, 0.5 kB or 0.1 kB of nucleotide ces which naturally flank the nucleic acid le in c DNA of the cell from which the nucleic acid is derived. In another embodiment, an "isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free of other cellular material, or culture medium when produced by recombinant techniques, or substantially free of chemical precursors or other chemicals when chemically synthesized. A nucleic acid molecule that is substantially free of cellular material includes preparations having less than about 30%, 20%, 10%, or 5% of logous c acid (also referred to herein as a "contaminating nucleic acid").
A nucleic acid molecule of the present invention can be isolated using standard molecular biology techniques and the sequence information in the database records described herein. Using all or a portion of such nucleic acid ces, nucleic acid molecules of the invention can be isolated using standard hybridization and cloning techniques (e. g., as bed in Sambrook et al, ed., Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989).
A nucleic acid molecule of the invention can be amplified using cDNA, mRNA, or genomic DNA as a template and appropriate ucleotide primers according to standard PCR amplification techniques. The nucleic acid so amplified can be cloned into an appropriate vector and characterized by DNA sequence analysis. Furthermore, nucleotides corresponding to all or a portion of a nucleic acid molecule of the invention can be prepared by standard synthetic techniques, e.g., using an automated DNA synthesizer.
In another preferred embodiment, an isolated c acid molecule of the invention comprises a nucleic acid molecule which has a tide sequence complementary to the nucleotide sequence of a marker nucleic acid or to the nucleotide sequence of a nucleic acid encoding a marker n. A c acid molecule which is complementary to a given tide sequence is one which is sufficiently complementary to the given nucleotide sequence that it can hybridize to the given nucleotide sequence thereby forming a stable duplex.
Moreover, a nucleic acid molecule of the invention can comprise only a portion of a nucleic acid ce, wherein the full length nucleic acid ce comprises a marker nucleic acid or which encodes a marker protein. Such nucleic acids can be used, for example, as a probe or primer. The probe/primer typically is used as one or more substantially purified oligonucleotides. The oligonucleotide typically comprises a region of nucleotide sequence that izes under stringent conditions to at least about 7, preferably about 15, more preferably about 25, 50, 75, 100, 125, 150, 175, 200, 250, 300, 350, or 400 or more consecutive tides of a nucleic acid of the invention.
Probes based on the sequence of a nucleic acid molecule of the invention can be used to detect transcripts or genomic sequences corresponding to one or more markers of the invention. The probe comprises a label group attached thereto, e.g., a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes can be used as part of a diagnostic test kit for identifying cells or s which mis-express the protein, such as by measuring levels of a nucleic acid molecule encoding the protein in a sample of cells from a subject, e.g., detecting mRNA levels or determining whether a gene encoding the n has been mutated or deleted.
The ion further encompasses nucleic acid molecules that differ, due to degeneracy of the c code, from the nucleotide sequence of nucleic acids encoding a marker protein, and thus encode the same protein.
It will be appreciated by those skilled in the art that DNA sequence polymorphisms that lead to changes in the amino acid sequence can exist within a population (6. g., the human population). Such genetic polymorphisms can exist among individuals within a population due to natural allelic variation. An allele is one of a group of genes which occur alternatively at a given genetic locus. In addition, it will be appreciated that DNA polymorphisms that affect RNA expression levels can also exist that may affect the overall expression level of that gene (6. g., by ing regulation or degradation).
As used herein, the phrase "allelic variant" refers to a nucleotide sequence which occurs at a given locus or to a polypeptide encoded by the tide sequence.
As used herein, the terms "gene" and "recombinant gene" refer to nucleic acid molecules comprising an open reading frame encoding a polypeptide ponding to a marker of the ion. Such natural allelic variations can typically result in 1-5% variance in the tide ce of a given gene. Alternative alleles can be identified by sequencing the gene of interest in a number of ent individuals. This can be readily carried out by using hybridization probes to identify the same genetic locus in a variety of individuals. Any and all such nucleotide variations and resulting amino acid polymorphisms or variations that are the result of natural allelic variation and that do not alter the functional activity are intended to be within the scope of the ion.
In r embodiment, an isolated nucleic acid molecule of the invention is at least 7, 15, 20, 25, 30, 40, 60, 80, 100, 150, 200, 250, 300, 350, 400, 450, 550, 650, 700, 800, 900, 1000, 1200, 1400, 1600, 1800, 2000, 2200, 2400, 2600, 2800, 3000, 3500, 4000, 4500, or more nucleotides in length and hybridizes under stringent conditions to a marker nucleic acid or to a nucleic acid encoding a marker protein. As used herein, the term "hybridizes under stringent conditions" is intended to describe conditions for ization and washing under which nucleotide ces at least 60% (65%, 70%, preferably 75%) identical to each other typically remain hybridized to each other. Such stringent conditions are known to those skilled in the art and can be found in sections 6.3.1-6.3.6 of Current Protocols in Molecular Biology, John Wiley & Sons, NY. (1989). A preferred, non-limiting example of stringent hybridization conditions are ization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes in 0.2X SSC, 0.1% SDS at 50—650C.
In addition to naturally-occurring allelic ts of a nucleic acid molecule of the invention that can exist in the population, the skilled artisan will further appreciate that sequence changes can be introduced by mutation y leading to changes in the amino acid sequence of the encoded protein, without altering the biological activity of the protein encoded thereby. For example, one can make nucleotide substitutions leading to amino acid substitutions at "non—essential" amino acid residues. A "non- essential" amino acid e is a residue that can be altered from the wild-type sequence without altering the biological activity, whereas an tial" amino acid residue is required for biological activity. For example, amino acid residues that are not conserved or only semi-conserved among homologs of various species may be non-essential for activity and thus would be likely targets for alteration. Alternatively, amino acid residues that are conserved among the gs of various species (e.g., murine and human) may be essential for activity and thus would not be likely targets for tion.
Accordingly, another aspect of the invention pertains to nucleic acid molecules encoding a variant marker protein that contain s in amino acid es that are not essential for activity. Such variant marker proteins differ in amino acid sequence from the naturally-occurring marker proteins, yet retain biological activity. In one embodiment, such a variant marker protein has an amino acid sequence that is at least about 40% identical, 50%, 60%, ?0%, 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identical to the amino acid sequence of a marker protein.
An isolated nucleic acid molecule ng a variant marker protein can be created by introducing one or more nucleotide substitutions, additions or ons into the nucleotide sequence of marker nucleic acids, such that one or more amino acid residue tutions, additions, or deletions are introduced into the encoded protein.
Mutations can be introduced by standard ques, such as site-directed mutagenesis and PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are made at one or more predicted sential amino acid residues. A "conservative amino acid substitution" is one in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art. These families include amino acids with basic side chains (e. g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, , threonine, tyrosine, cysteine), non—polar side chains (e.g., alanine, , leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Alternatively, mutations can be introduced randomly along all or part of the coding sequence, such as by saturation mutagenesis, and the resultant mutants can be screened for biological activity to identify mutants that retain activity. Following mutagenesis, the encoded protein can be sed recombinantly and the activity of the protein can be determined.
The present invention encompasses antisense c acid molecules, i.e., molecules which are mentary to a sense nucleic acid of the invention, 6. g., mentary to the coding strand of a -stranded marker cDNA molecule or complementary to a marker mRNA sequence. ingly, an antisense nucleic acid of the invention can en bond to (5.6. anneal with) a sense nucleic acid of the invention. The antisense nucleic acid can be complementary to an entire coding strand, or to only a portion thereof, e. g., all or part of the protein coding region (or open reading . An antisense nucleic acid molecule can also be antisense to all or part of a non- coding region of the coding strand of a nucleotide sequence encoding a marker protein.
The non-coding s ("5‘ and 3‘ untranslated regions") are the 5' and 3' sequences which flank the coding region and are not translated into amino acids.
An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45, or 50 or more nucleotides in length. An antisense nucleic acid of the invention can be constructed using chemical synthesis and enzymatic ligation reactions using procedures known in the art. For example, an antisense c acid (e. g., an antisense oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or sly modified nucleotides ed to increase the biological stability of the molecules or to increase the physical stability of the duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine substituted nucleotides can be used. Examples of modified nucleotides which can be used to generate the antisense nucleic acid include 5—fluorouracil, 5-bromouracil, 5- chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4—acetylcytosine, 5- (carboxyhydroxylmethyl) uracil, 5—carboxymethylaminomethylthiouridine, 5- carboxymethylaminomethyluracil, dihydrouracil, -galactosquueosine, inosine, N6-isopentenyladenine, 1-methylguanine, l—methylinosine, 2,2-dimethylguanine, 2- methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- methylguanine, 5-methylaminomethyluracil, 5—methoxyaminomethylthiouracil, beta- D-mannosquueosine, 5‘-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- pentenyladenine, uraciloxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl—2—thiouracil, 2-thiouracil, 4-thiouracil, 5- methyluracil, uraciloxyacetic acid methylester, uraciloxyacetic acid (v), 5-methyl- 2-thiouracil, 3-(3-aminoN-2—carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. atively, the nse nucleic acid can be produced biologically using an expression vector into which a nucleic acid has been sub-cloned in an antisense orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, described r in the following subsection).
The antisense nucleic acid molecules of the invention are typically administered to a t or generated in sin: such that they hybridize with or bind to cellular mRNA and/or genomic DNA encoding a marker protein to thereby inhibit sion of the marker, e. g., by inhibiting ription and/or translation. The hybridization can be by conventional tide complementarity to form a stable duplex, or, for example, in the case of an nse nucleic acid molecule which binds to DNA duplexes, through specific interactions in the major groove of the double helix. Examples of a route of administration of antisense nucleic acid molecules of the invention includes direct injection at a tissue site or infusion of the antisense nucleic acid into toxicity state associated body fluid. Alternatively, antisense c acid les can be modified to target selected cells and then administered systemically. For example, for systemic administration, nse molecules can be modified such that they specifically bind to receptors or antigens expressed on a selected cell surface, e. g., by linking the antisense nucleic acid molecules to peptides or antibodies which bind to cell surface receptors or antigens. The nse nucleic acid molecules can also be delivered to cells using the s bed herein. To achieve sufficient intracellular concentrations of the antisense molecules, vector constructs in which the antisense nucleic acid molecule is placed under the control of a strong pol II or pol III promoter are preferred.
An antisense nucleic acid molecule of the invention can be an (x-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific double- stranded hybrids with complementary RNA in which, contrary to the usual a-units, the strands run parallel to each other (Gaultier et al., 1987, c Acids Res. 15:6625- 6641). The antisense nucleic acid molecule can also se a 2 methylribonucleotide (Inoue et al., 1987, Nucleic Acids Res. 15:6131-6148) or a chimeric RNA-DNA analogue (Inoue et al., 1987, FEBS Lett. 7-330).
The invention also encompasses ribozymes. Ribozymes are catalytic RNA molecules with ribonuclease activity which are capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes as described in off and Gerlach, 1988, Nature 334:585-591) can be used to catalytically cleave mRNA transcripts to thereby t translation of the protein encoded by the mRNA. A ribozyme having specificity for a nucleic acid molecule encoding a marker protein can be designed based upon the nucleotide ce of a cDNA corresponding to the marker. For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the active site is complementary to the nucleotide sequence to be cleaved (see Cech et al. US. Patent No. 071; and Cech et al. US. Patent No. 5,116,742).
Alternatively, an mRNA encoding a polypeptide of the invention can be used to select a catalytic RNA having a ic ribonuclease ty from a pool of RNA les (see, e.g., Bartel and Szostak, 1993, Science 261:1411-1418).
The invention also encompasses nucleic acid molecules which form triple helical structures. For example, expression of a marker of the invention can be inhibited by targeting nucleotide sequences complementary to the regulatory region of the gene encoding the marker nucleic acid or protein (e. g., the promoter and/or enhancer) to form triple helical structures that prevent transcription of the gene in target cells. See generally Helene (1991) Anticancer Drug Des. 6(6):569-84; Helene (1992) Ann. N. Y.
Acad. Sci. 660227-36; and Maher (1992) Bioassays 14(12):807—15.
In various embodiments, the nucleic acid molecules of the invention can be modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or solubility of the molecule. For example, the deoxyribose phosphate backbone of the nucleic acids can be modified to te peptide nucleic acids (see Hyrup et al., 1996, Bioorganic & Medicinal Chemistry 4(1): 5-23). As used herein, the terms de nucleic acids" or "PNAs" refer to nucleic acid mimics, e. g., DNA , in which the deoxyribose phosphate backbone is replaced by a pseudopeptide backbone and only the four natural nucleobases are retained. The neutral backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under conditions of low ionic th. The synthesis of PNA oligomers can be performed using standard solid phase peptide sis protocols as described in Hyrup et al. , supra; Perry-O‘Keefe et al. (1996) Proc. Natl. Acad. Sci. USA 93:14670- 675.
PNAs can be used in therapeutic and diagnostic applications. For example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene expression by, e. g., inducing transcription or translation arrest or inhibiting replication. PNAs can also be used, e.g., in the is of single base pair mutations in a gene by, e. g., PNA directed PCR clamping; as artificial restriction enzymes when used in combination with other enzymes, e.g., 31 nucleases (Hyrup ( 1996), supra; or as probes or primers for DNA sequence and hybridization (Hyrup, 1996, supra; Perry- O‘Keefe et al., 1996, Proc. Natl. Acad. Sci. USA 70-675).
In another embodiment, PNAs can be modified, e. g., to enhance their stability or cellular , by attaching lipophilic or other helper groups to PNA, by the formation of PNA-DNA chimeras, or by the use of mes or other techniques of drug delivery known in the art. For example, PNA—DNA chimeras can be ted which can combine the advantageous properties of PNA and DNA. Such chimeras allow DNA recognition enzymes, e. g., RNase H and DNA polymerases, to interact with the DNA portion while the PNA portion would provide high binding affinity and specificity.
PNA-DNA chimeras can be linked using s of appropriate lengths selected in terms of base ng, number of bonds between the nucleobases, and orientation (Hyrup, 1996, . The synthesis of PNA—DNA chimeras can be performed as described in Hyrup , supra, and Finn et al. (1996) Nucleic Acids Res. :3357—63. For example, a DNA chain can be synthesized on a solid support using standard phosphoramidite coupling chemistry and modified side analogs. Compounds such as 5‘-(4-methoxytrityl)amino—5‘—deoxy-thymidine phosphoramidite can be used as a link between the PNA and the 5‘ end of DNA (Mag et al., 1989, Nucleic Acids Res. 17:5973-88). PNA monomers are then coupled in a step-wise manner to produce a chimeric molecule with a 5‘ PNA segment and a 3' DNA segment (Finn et al., 1996, Nucleic Acids Res. 24(17):3357-63). Alternatively, chimeric molecules can be synthesized with a 5‘ DNA segment and a 3‘ PNA segment (Peterser et al., 1975, anic Med. Chem. Lett. 5:1119-11124).
In other embodiments, the oligonucleotide can include other appended groups such as es (e. g., for targeting host cell ors in vivo), or agents facilitating transport across the cell ne (see, e.g., Letsinger et al., 1989, Proc. Natl. Acad.
Sci. USA 86:6553-6556; Lemaitre et al., 1987, Proc. Natl. Acad. Sci. USA 84:648-652; PCT Publication No. WO 88/09810) or the blood-brain barrier (see, e. g., PCT Publication No. W0 34). In addition, oligonucleotides can be modified with hybridization-triggered cleavage agents (see, e.g., Krol et al., 1988, Bio/Fechniques 62958-976) or intercalating agents (see, e. g., Zon, 1988, Pharm. Res. 5:539-549). To this end, the oligonucleotide can be conjugated to another molecule, e. g., a peptide, hybridization triggered cross—linking agent, transport agent, hybridization-triggered cleavage agent, etc.
The invention also es molecular beacon nucleic acids having at least one region which is complementary to a nucleic acid of the invention, such that the molecular beacon is useful for quantitating the presence of the nucleic acid of the invention in a sample. A "molecular beacon" nucleic acid is a nucleic acid comprising a pair of complementary regions and having a fluorophore and a fluorescent quencher ated therewith. The fluorophore and quencher are associated with different ns of the c acid in such an ation that when the complementary regions are ed with one another, fluorescence of the fluorophore is ed by the quencher. When the complementary regions of the nucleic acid are not annealed with one another, cence of the fluorophore is quenched to a lesser degree. lar beacon nucleic acids are described, for example, in US. Patent 5,876,930.
C. ed Proteins and Antibodies One aspect of the invention pertains to isolated marker proteins and biologically active portions thereof, as well as polypeptide fragments suitable for use as immunogens to raise antibodies directed against a marker protein or a fragment thereof. In one embodiment, the native marker protein can be isolated from cells or tissue sources by an appropriate purification scheme using standard protein purification techniques. In another embodiment, a protein or peptide comprising the whole or a segment of the marker protein is produced by recombinant DNA techniques. Alternative to recombinant expression, such protein or peptide can be synthesized chemically using standard peptide synthesis ques.
An "isolated" or ied" protein or biologically active portion thereof is substantially free of cellular material or other contaminating proteins from the cell or tissue source from which the protein is derived, or substantially free of chemical precursors or other chemicals when chemically synthesized. The language "substantially free of cellular material" includes preparations of protein in which the protein is separated from cellular components of the cells from which it is isolated or recombinantly produced. Thus, protein that is substantially free of cellular material includes preparations of protein having less than about 30%, 20%, 10%, or 5% (by dry weight) of heterologous protein (also referred to herein as a "contaminating protein”).
When the protein or ically active portion thereof is recombinantly produced, it is also preferably substantially free of culture medium, i.€., culture medium represents less than about 20%, 10%, or 5% of the volume of the n preparation. When the protein is ed by chemical synthesis, it is preferably substantially free of chemical precursors or other chemicals, i.e., it is separated from chemical precursors or other chemicals which are involved in the synthesis of the protein. Accordingly such preparations of the n have less than about 30%, 20%, 10%, 5% (by dry weight) of chemical precursors or compounds other than the polypeptide of interest.
Biologically active portions of a marker protein include polypeptides comprising amino acid ces sufficiently identical to or derived from the amino acid sequence of the marker protein, which include fewer amino acids than the full length protein, and exhibit at least one activity of the corresponding full—length protein. Typically, biologically active portions comprise a domain or motif with at least one activity of the corresponding full-length protein. A biologically active n of a marker protein of the invention can be a polypeptide which is, for e, 10, 25, 50, 100 or more amino acids in length. Moreover, other ically active ns, in which other regions of the marker protein are deleted, can be prepared by recombinant techniques and evaluated for one or more of the functional activities of the native form of the marker protein.
Preferred marker proteins are encoded by nucleotide sequences comprising the sequences encoding any of the genes described in the examples. Other useful proteins are substantially identical (e. g., at least about 40%, preferably 50%, 60%, 70%, 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99%) to one of these sequences and retain the functional activity of the corresponding naturally-occurring marker protein yet differ in amino acid sequence due to natural allelic variation or nesis.
To determine the percent identity of two amino acid sequences or of two nucleic acids, the ces are aligned for l comparison purposes (6. g., gaps can be uced in the sequence of a first amino acid or nucleic acid sequence for optimal alignment with a second amino or nucleic acid sequence). The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide ons are then ed. When a position in the first sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position. Preferably, the percent identity between the two sequences is calculated using a global alignment. Alternatively, the percent identity between the two ces is calculated using a local alignment. The percent identity between the two sequences is a function of the number of identical positions shared by the sequences (i.e., % identity = # of identical ons/total # of positions (e.g., overlapping positions) x100). In one embodiment the two sequences are the same length. In another embodiment, the two ces are not the same length.
The determination of percent identity between two sequences can be accomplished using a mathematical algorithm. A preferred, non-limiting example of a atical algorithm utilized for the comparison of two sequences is the algorithm of Karlin and Altschul (1990) Proc. Natl. Acad. Sci. USA 87:2264-2268, modified as in Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5877. Such an algorithm is incorporated into the BLASTN and BLASTX programs of Altschul, et al. (1990) J. Mol. Biol. 3—410. BLAST nucleotide searches can be performed with the BLASTN program, score = 100, wordlength = 12 to obtain nucleotide sequences gous to a nucleic acid molecules of the invention. BLAST protein searches can be performed with the BLASTP program, score = 50, wordlength = 3 to obtain amino acid sequences gous to a protein molecules of the invention. To obtain gapped alignments for comparison purposes, a newer n of the BLAST algorithm called Gapped BLAST can be utilized as described in Altschul et al. (1997) c Acids Res. :3389-3402, which is able to perform gapped local alignments for the programs BLASTN, BLASTP and BLASTX. atively, PSI-Blast can be used to perform an iterated search which detects t relationships n molecules. When utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the default parameters of the respective programs (e.g., BLASTX and BLASTN) can be used. See http://www.ncbi.nlm.nih.gov. Another preferred, non-limiting example of a mathematical algorithm utilized for the comparison of sequences is the algorithm of Myers and Miller, (1988) CABIOS 4:11—17. Such an algorithm is incorporated into the ALIGN program (version 2.0) which is part of the GCG sequence alignment software e. When utilizing the ALIGN program for comparing amino acid ces, a PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be used. Yet another useful algorithm for identifying s of local sequence similarity and alignment is the FASTA algorithm as described in Pearson and Lipman (1988) Proc. Natl. Acad. Sci. USA 85:2444—2448. When using the FASTA thm for comparing tide or amino acid sequences, a PAM120 weight residue table can, for example, be used with a k-tuple value of 2.
The percent identity between two sequences can be determined using techniques similar to those described above, with or without allowing gaps. In calculating percent identity, only exact matches are counted.
The invention also provides ic or fusion proteins comprising a marker protein or a t thereof. As used herein, a "chimeric protein" or "fusion protein" comprises all or part (preferably a biologically active part) of a marker protein operably linked to a heterologous ptide (i.e., a polypeptide other than the marker protein).
Within the fusion protein, the term "operably linked" is intended to indicate that the marker protein or segment thereof and the heterologous polypeptide are fused in-frame to each other. The heterologous polypeptide can be fused to the amino—terminus or the carboxyl-terminus of the marker protein or segment.
One useful fusion protein is 3 GST fusion protein in which a marker protein or segment is fused to the carboxyl terminus of GST sequences. Such fusion proteins can facilitate the purification of a recombinant ptide of the invention.
In another embodiment, the fusion protein contains a heterologous signal sequence at its amino terminus. For example, the native signal sequence of a marker protein can be removed and replaced with a signal sequence from r protein. For example, the gp67 secretory sequence of the baculovirus envelope n can be used as a logous signal sequence (Ausubel et al., ed., Current Protocols in Molecular Biology, John Wiley & Sons, NY, 1992). Other examples of eukaryotic heterologous signal sequences e the secretory sequences of melittin and human placental alkaline phosphatase (Stratagene; La Jolla, California). In yet another example, useful prokaryotic heterologous signal sequences include the phoA secretory signal ook et al., supra) and the protein A secretory signal (Pharmacia Biotech; Piscataway, New Jersey).
In yet another embodiment, the fusion protein is an immunoglobulin fusion protein in which all or part of a marker protein is fused to sequences d from a member of the globulin protein . The immunoglobulin fusion proteins of the invention can be incorporated into ceutical compositions and administered to a subject to inhibit an interaction between a ligand (soluble or membrane-bound) and a protein on the surface of a cell (receptor), to thereby suppress signal transduction in vivo.
The immunoglobulin fusion protein can be used to affect the bioavailability of a cognate ligand of a marker protein. Inhibition of ligand/receptor interaction can be useful therapeutically, both for treating proliferative and entiative disorders and for modulating (e. g. promoting or inhibiting) cell survival. Moreover, the immunoglobulin fusion proteins of the invention can be used as gens to produce antibodies directed against a marker protein in a subject, to purify ligands and in screening assays to identify molecules which inhibit the ction of the marker protein with ligands.
Chimeric and fusion proteins of the invention can be produced by rd recombinant DNA techniques. In another embodiment, the fusion gene can be synthesized by conventional techniques ing automated DNA synthesizers.
Alternatively, PCR amplification of gene fragments can be carried out using anchor primers which give rise to complementary overhangs between two consecutive gene fragments which can uently be annealed and lified to generate a chimeric gene sequence (see, e. g., Ausubel et (1]., supra). Moreover, many expression vectors are commercially ble that already encode a fusion moiety (e.g., a GST polypeptide).
A nucleic acid encoding a polypeptide of the invention can be cloned into such an expression vector such that the fusion moiety is linked me to the ptide of the invention.
A signal sequence can be used to facilitate secretion and isolation of marker ns. Signal sequences are lly characterized by a core of hydrophobic amino acids which are generally cleaved from the mature protein during secretion in one or more cleavage events. Such signal es contain sing sites that allow cleavage of the signal sequence from the mature proteins as they pass through the secretory pathway. Thus, the invention pertains to marker ns, fusion proteins or segments thereof having a signal sequence, as well as to such proteins from which the signal sequence has been proteolytically cleaved (i. e., the cleavage products). In one embodiment, a nucleic acid sequence encoding a signal sequence can be operably linked in an expression vector to a protein of interest, such as a marker protein or a segment thereof. The signal sequence directs secretion of the protein, such as from a eukaryotic host into which the expression vector is ormed, and the signal sequence is subsequently or concurrently cleaved. The protein can then be readily purified from the extracellular medium by art recognized methods. Alternatively, the signal sequence can be linked to the n of interest using a sequence which facilitates purification, such as with a GST domain.
The present invention also pertains to variants of the marker proteins. Such variants have an altered amino acid sequence which can function as either agonists (mimetics) or as antagonists. Variants can be generated by mutagenesis, e.g., discrete point mutation or truncation. An agonist can retain substantially the same, or a subset, of the biological activities of the naturally occurring form of the protein. An nist of a protein can inhibit one or more of the activities of the naturally occurring form of the protein by, for example, competitively binding to a downstream or upstream member of a cellular signaling cascade which includes the protein of interest. Thus, ic biological effects can be elicited by treatment with a variant of limited function.
Treatment of a subject with a variant having a subset of the biological activities of the naturally ing form of the protein can have fewer side effects in a subject relative to treatment with the naturally occurring form of the protein.
Variants of a marker protein which function as either agonists (mimetics) or as antagonists can be identified by screening combinatorial libraries of mutants, e.g., truncation mutants, of the protein of the invention for agonist or antagonist activity. In one ment, a variegated library of variants is generated by combinatorial nesis at the nucleic acid level and is encoded by a variegated gene library. A variegated library of variants can be produced by, for example, enzymatically ligating a mixture of tic oligonucleotides into gene sequences such that a degenerate set of ial protein sequences is expressible as individual polypeptides, or alternatively, as a set of larger fusion proteins (e. g. for phage display). There are a variety of methods which can be used to produce libraries of potential variants of the marker proteins from a degenerate oligonucleotide ce. s for synthesizing degenerate oligonucleotides are known in the art (see, e.g., Narang, 1983, Tetrahedron 39:3; Itakura et al., 1984, Annu. Rev. Biochem. 53:323; a et al., 1984, Science 198:1056; Ike et al., 1983 Nucleic Acid Res. 11:47?) In addition, libraries of segments of a marker n can be used to generate a variegated population of polypeptides for screening and subsequent ion of variant marker proteins or ts thereof. For example, a library of coding sequence fragments can be generated by treating a double stranded PCR fragment of the coding sequence of interest with a nuclease under conditions n g occurs only about once per molecule, ring the double stranded DNA, renaturing the DNA to form double stranded DNA which can include sense/antisense pairs from different nicked products, removing single stranded portions from reformed duplexes by treatment with S1 nuclease, and ligating the resulting fragment library into an expression vector. By this method, an expression library can be d which encodes amino terminal and internal fragments of various sizes of the protein of interest.
Several ques are known in the art for screening gene products of atorial libraries made by point mutations or truncation, and for screening cDNA libraries for gene products having a selected ty. The most widely used techniques, which are amenable to high through-put analysis, for screening large gene libraries typically include cloning the gene library into replicable expression vectors, transforming appropriate cells with the resulting library of vectors, and expressing the combinatorial genes under conditions in which detection of a desired activity facilitates isolation of the vector encoding the gene whose product was detected. Recursive ensemble mutagenesis (REM), a technique which enhances the frequency of functional s in the libraries, can be used in combination with the screening assays to identify variants of a protein of the invention (Arkin and Yourvan, 1992, Proc. Natl. Acad. Sci.
USA 89:7811-7815; Delgrave et at, 1993, n Engineering 6(3):327- 331). r aspect of the invention pertains to antibodies directed against a protein of the invention. In preferred embodiments, the antibodies specifically bind a marker protein or a fragment thereof. The terms "antibody" and odies" as used hangeably herein refer to immunoglobulin molecules as well as fragments and tives thereof that comprise an immunologically active n of an immunoglobulin molecule, (i.e., such a portion contains an antigen binding site which specifically binds an antigen, such as a marker protein, e. g., an epitope of a marker n). An antibody which specifically binds to a protein of the invention is an antibody which binds the protein, but does not substantially bind other molecules in a sample, e. g., a biological sample, which naturally contains the protein. Examples of an logically active portion of an immunoglobulin molecule include, but are not limited to, single-chain antibodies (scAb), F(ab) and 2 fragments.
An isolated protein of the invention or a fragment thereof can be used as an immunogen to generate antibodies. The full-length protein can be used or, alternatively, the invention provides antigenic peptide fragments for use as immunogens. The antigenic peptide of a protein of the invention comprises at least 8 (preferably 10, 15, 20, or 30 or more) amino acid residues of the amino acid sequence of one of the proteins of the invention, and encompasses at least one epitope of the n such that an antibody raised against the peptide forms a ic immune complex with the protein. Preferred epitopes encompassed by the antigenic peptide are regions that are located on the surface of the protein, e. g., hydrophilic regions. Hydrophobicity ce analysis, hydrophilicity sequence analysis, or similar es can be used to fy hydrophilic s. In preferred embodiments, an isolated marker protein or fragment thereof is used as an immunogen.
An immunogen typically is used to prepare antibodies by immunizing a le (i.e. competent) subject such as a rabbit, goat, mouse, or other mammal or vertebrate. An appropriate immunogenic preparation can contain, for example, recombinantly-expressed or chemically—synthesized protein or e. The preparation can further include an adjuvant, such as 's complete or incomplete nt, or a similar immunostimulatory agent. Preferred immunogen compositions are those that contain no other human proteins such as, for example, immunogen compositions made using a non-human host cell for recombinant expression of a protein of the invention. In such a manner, the resulting antibody compositions have reduced or no binding of human proteins other than a protein of the invention.
The invention es onal and monoclonal antibodies. The term lonal antibody" or "monoclonal antibody composition", as used herein, refers to a population of antibody les that contain only one species of an antigen binding site capable of immunoreacting with a particular epitope. Preferred polyclonal and monoclonal antibody compositions are ones that have been selected for dies ed against a protein of the invention. Particularly preferred polyclonal and monoclonal antibody preparations are ones that contain only antibodies directed against a marker protein or fragment thereof.
Polyclonal antibodies can be prepared by immunizing a suitable subject with a protein of the invention as an immunogen. The antibody titer in the immunized subject can be monitored over time by standard techniques, such as with an enzyme linked immunosorbent assay (ELISA) using immobilized polypeptide. At an appropriate time after immunization, e. g., when the specific antibody titers are highest, antibody- producing cells can be obtained from the subject and used to prepare monoclonal antibodies (mAb) by standard techniques, such as the hybridoma technique originally described by Kohler and Milstein (1975) Nature 256:495-497, the human B cell hybridoma technique (see Kozbor et al., 1983, Immunol. Today 4:72), the EBV- hybridoma technique (see Cole et at, pp. 77—96 In Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., 1985) or trioma techniques. The technology for producing hybridomas is well known (see generally Current ols in Immunology, COligan et al. ed., John Wiley & Sons, New York, 1994). Hybridoma cells producing a monoclonal antibody of the invention are detected by screening the hybridoma culture supernatants for antibodies that bind the ptide of interest, e. g., using a standard ELISA assay.
Alternative to preparing monoclonal antibody-secreting hybridomas, a monoclonal antibody directed against a n of the invention can be identified and isolated by screening a recombinant combinatorial immunoglobulin library (e.g., an antibody phage y library) with the polypeptide of interest. Kits for ting and screening phage display libraries are commercially available (e. g., the Pharmacia Recombinant Phage Antibody System, Catalog No. 2701; and the Stratagene SuerAP Phage Display Kit, Catalog No. ). Additionally, examples of methods and reagents particularly amenable for use in ting and screening antibody display library can be found in, for example, U.S. Patent No. 5,223,409; PCT Publication No.
W0 92/18619; PCT Publication No. W0 91/17271; PCT Publication No. WO 92/20791; PCT Publication No. W0 92/15679; PCT Publication No. WO 93/01288; PCT Publication No. WO 92/01047; PCT Publication No. WO 92/09690; PCT Publication No. WO 90/02809; Fuchs et al. (1991) Bioflechnology 9:1370-1372; Hay et al. (1992) Hum. Antibod. Hybridomas 5; Huse et al. (1989) Science 246:1275- 1281; Griffiths et al. (1993) EMBO J. 12:725—734.
The invention also provides recombinant antibodies that specifically bind a protein of the ion. In red embodiments, the inant antibodies specifically binds a marker protein or fragment thereof. inant antibodies include, but are not limited to, ic and humanized monoclonal antibodies, comprising both human and non-human portions, -chain antibodies and multispecific antibodies. A chimeric antibody is a le in which different portions are derived from different animal species, such as those having a variable region derived from a murine mAb and a human immunoglobulin constant region. (See, e. g., Cabilly et al., U.S. Patent No. 4,816,567; and Boss et al., U.S. Patent No. 4,816,397, which are incorporated herein by reference in their entirety.) -chain dies have an antigen binding site and consist of a single polypeptide. They can be produced by techniques known in the art, for example using methods described in Ladner et. al U.S.
Pat. No. 4,946,778 (which is incorporated herein by reference in its entirety); Bird et al., (1988) Science 242:423-426; Whitlow et al., (1991) Methods in Enzymology 2:1-9; w et al., (1991) Methods in Enzymology 2:97-105; and Huston et al., (1991) Methods in Enzymology lar Design and Modeling: Concepts and Applications 203:46-88. Multi-specific antibodies are antibody molecules having at least two antigen-binding sites that specifically bind ent antigens. Such molecules can be produced by ques known in the art, for example using methods described in Segal, U.S. Patent No. 4,676,980 (the disclosure of which is incorporated herein by reference in its entirety); Holliger etal., (1993) Proc. Natl. Acad. Sci. USA 90:6444-6448; Whitlow et al., (1994) Protein Eng. 7:1017-1026 and U.S. Pat. No. 6,121,424.
Humanized antibodies are antibody molecules from non-human species having one or more complementarity determining regions (CDRs) from the non-human species and a framework region from a human immunoglobulin molecule. (See, e. g., Queen, U.S. Patent No. 5,585,089, which is incorporated herein by reference in its entirety.) Humanized monoclonal antibodies can be produced by recombinant DNA techniques known in the art, for example using methods described in PCT Publication No. WO 87/02671; European Patent Application 7; European Patent Application 171,496; European Patent Application 173,494; PCT Publication No. WO 86/01533; U.S. Patent No. 4,816,567; an Patent Application 125,023; Better et al. (1988) Science 240:1041-1043; Liu et al. (1987) Proc. Natl. Acad. Sci. USA 9-3443; Liu et al. (1987) J. Immunol. 139:3521- 3526; Sun et al. (1987) Proc. Natl. Acad. Sci. USA 84:214-218; Nishimura et al. (1987) Cancer Res. 47:999-1005; Wood et al. (1985) Nature 314:446-449; and Shaw et al. (1988) J. Natl. Cancer Inst. 80:1553-1559); Morrison (1985) Science 229:1202—1207; Oi et al. (1986) Bio/Techniques 4:214; U.S.
Patent 5,225,539; Jones et al. (1986) Nature 321:552-525; Verhoeyan et al. (1988) Science 23921534; and r et al. (1988) J. Immunol. 53-4060.
More particularly, humanized antibodies can be produced, for e, using transgenic mice which are incapable of expressing endogenous immunoglobulin heavy and light chains genes, but which can express human heavy and light chain genes. The enic mice are immunized in the normal fashion with a selected antigen, e.g., all or a n of a polypeptide corresponding to a marker of the invention. Monoclonal antibodies directed t the antigen can be obtained using conventional hybridoma technology. The human immunoglobulin transgenes harbored by the transgenic mice rearrange during B cell differentiation, and subsequently undergo class switching and somatic mutation. Thus, using such a technique, it is possible to produce therapeutically useful IgG, IgA and IgE antibodies. For an overview of this technology for producing human antibodies, see Lonberg and Huszar (1995) Int. Rev. l. 13:65-93). For a detailed discussion of this technology for producing human antibodies and human monoclonal antibodies and ols for producing such antibodies, see, 6. g., U.S.
Patent 5,625,126; US. Patent 5,633,425; US. Patent 5,569,825; U.S. Patent 5,661,016; and US. Patent 5,545,806. In addition, companies such as Abgenix, Inc. (Freemont, CA), can be engaged to provide human antibodies directed t a selected antigen using technology similar to that described above.
Completely human antibodies which ize a selected epitope can be generated using a technique referred to as d selection." In this approach a selected man monoclonal antibody, e.g., a murine antibody, is used to guide the selection of a completely human antibody recognizing the same epitope (Jespers et al., 1994, Bio/technology -903).
The antibodies of the invention can be ed after production (e.g., from the blood or serum of the subject) or synthesis and further purified by well-known techniques. For example, IgG antibodies can be purified using protein A chromatography. Antibodies ic for a protein of the invention can be selected or (e.g., partially purified) or purified by, e.g., affinity chromatography. For example, a recombinantly expressed and purified (or lly purified) protein of the invention is produced as described , and covalently or non-covalently coupled to a solid support such as, for e, a chromatography column. The column can then be used to affinity purify antibodies specific for the proteins of the invention from a sample containing antibodies directed against a large number of different epitopes, thereby generating a substantially purified antibody composition, i.e., one that is substantially free of contaminating antibodies. By a substantially purified antibody composition is meant, in this context, that the antibody sample ns at most only 30% (by dry weight) of contaminating antibodies directed against epitopes other than those of the desired protein of the invention, and preferably at most 20%, yet more preferably at most 10%, and most preferably at most 5% (by dry weight) of the sample is contaminating dies. A purified antibody composition means that at least 99% of the antibodies in the composition are directed against the desired protein of the invention.
In a preferred embodiment, the substantially purified antibodies of the invention may specifically bind to a signal peptide, a secreted sequence, an extracellular , a transmembrane or a cytoplasmic domain or cytoplasmic membrane of a protein of the invention. In a particularly preferred embodiment, the ntially purified antibodies of the invention specifically bind to a secreted sequence or an extracellular domain of the amino acid sequences of a protein of the invention. In a more preferred embodiment, the substantially purified antibodies of the invention specifically bind to a secreted sequence or an ellular domain of the amino acid sequences of a marker protein.
An antibody directed against a protein of the invention can be used to isolate the protein by standard techniques, such as affinity chromatography or immunoprecipitation. er, such an antibody can be used to detect the marker protein or fragment thereof (e. g., in a cellular lysate or cell supernatant) in order to te the level and pattern of sion of the marker. The antibodies can also be used diagnostically to r protein levels in tissues or body fluids (6. g. in ty state associated body fluid) as part of a clinical testing procedure, e. g., to, for example, determine the efficacy of a given treatment regimen. Detection can be tated by the use of an antibody derivative, which comprises an dy of the invention coupled to a detectable substance.
Examples of detectable substances include various enzymes, prosthetic groups, cent materials, luminescent materials, bioluminescent materials, and radioactive materials. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, B-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent als include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material es luminol; examples of bioluminescent materials include luciferase, luciferin, and aequorin, and examples of suitable 125 131 35 3 radioactive matenal include. . . , I, I, S or H.
Antibodies of the invention may also be used as therapeutic agents in ng cancers. In a preferred embodiment, completely human antibodies of the invention are used for therapeutic treatment of human cancer patients, particularly those having a cancer. In another preferred ment, antibodies that bind ically to a marker protein or fragment thereof are used for therapeutic treatment. Further, such therapeutic antibody may be an antibody tive or immunotoxin comprising an antibody ated to a therapeutic moiety such as a cytotoxin, a therapeutic agent or a radioactive metal ion. A cytotoxin or cytotoxic agent includes any agent that is ental to cells. Examples include taxol, cytochalasin B, gramicidin D, ethidium bromide, emetine, mitomycin, etoposide, tenoposide, Vincristine, Vinblastine, colchicin, doxorubicin, daunorubicin, dihydroxy cin dione, mitoxantrone, mithramycin, actinomycin D, 1-dehydrotestosterone, glucocorticoids, ne, tetracaine, lidocaine, propranolol, and puromycin and analogs or gs thereof. Therapeutic agents include, but are not limited to, antimetabolites (e.g., methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-fluorouracil decarbazine), alkylating agents (e.g., mechlorethamine, thioepa chlorambucil, melphalan, carmustine (BSNU) and lomustine (CCNU), cyclothosphamide, busulfan, dibromomannitol, ozotocin, mitomycin C, and cis-dichlorodiamine platinum (II) (DDP) cisplatin), anthracyclines (e.g., daunorubicin (formerly ycin) and doxorubicin), antibiotics (e.g., dactinomycin (formerly actinomycin), cin, mithramycin, and anthramycin (AMC)), and anti-mitotic agents (e. g., vincristine and vinblastine).
The conjugated antibodies of the invention can be used for modifying a given biological response, for the drug moiety is not to be construed as limited to classical chemical therapeutic . For example, the drug moiety may be a protein or ptide possessing a desired biological activity. Such proteins may include, for example, a toxin such as ribosome—inhibiting protein (see Better et al., U.S. Patent No. 6,146,631, the disclosure of which is incorporated herein in its entirety), abrin, ricin A, monas exotoxin, or diphtheria toxin; a protein such as tumor necrosis factor, alpha-interferon, B-interferon, nerve growth factor, platelet derived growth , tissue plasminogen activator; or, biological response modifiers such as, for e, lymphokines, interleukin-1 ("IL-1"), interleukin—2 ("IL—2"), interleukin-6 ("IL-6"), granulocyte macrophase colony stimulating factor ("GM—CSF”), granulocyte colony stimulating factor ("G-CSF"), or other growth factors.
Techniques for conjugating such therapeutic moiety to antibodies are well known, see, e. g., Arnon et al., "Monoclonal Antibodies For Immunotargeting Of Drugs In Cancer Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. (eds.), pp. 243-56 (Alan R. Liss, Inc. 1985); Hellstrom et al., "Antibodies For Drug Delivery", in Controlled Drug Delivery (2nd Ed), Robinson et al. (eds.), pp. 623-53 (Marcel Dekker, Inc. 1987); , "Antibody Carriers Of Cytotoxic Agents In Cancer Therapy: A Review", in Monoclonal Antibodies '84: Biological And Clinical Applications, Pinchera et al. (eds.), pp. 475-506 (1985); "Analysis, Results, And Future Prospective Of The Therapeutic Use Of Radiolabeled Antibody In Cancer Therapy", in Monoclonal Antibodies For Cancer Detection And Therapy, Baldwin et al. (eds.), pp. 303-16 (Academic Press 1985), and Thorpe et al., "The Preparation And Cytotoxic ties Of Antibody-Toxin Conjugates", Immunol. Rev., 622119-58 (1982).
Accordingly, in one aspect, the invention provides substantially purified antibodies, antibody nts and derivatives, all of which specifically bind to a protein of the invention and preferably, a marker protein. In s embodiments, the ntially purified antibodies of the invention, or fragments or derivatives thereof, can be human, non-human, ic and/or humanized antibodies. In another aspect, the invention es non—human antibodies, antibody fragments and derivatives, all of which specifically bind to a protein of the invention and preferably, a marker protein.
Such non-human antibodies can be goat, mouse, sheep, horse, n, rabbit, or rat antibodies. Alternatively, the non—human antibodies of the invention can be chimeric and/or humanized antibodies. In addition, the non—human dies of the ion can be polyclonal antibodies or monoclonal antibodies. In still a further aspect, the invention provides monoclonal antibodies, antibody fragments and derivatives, all of which ically bind to a protein of the invention and preferably, a marker protein.
The monoclonal antibodies can be human, zed, chimeric and/or non-human antibodies.
The invention also provides a kit containing an antibody of the invention conjugated to a detectable substance, and ctions for use. Still another aspect of the invention is a pharmaceutical composition comprising an antibody of the ion. In one embodiment, the pharmaceutical composition comprises an antibody of the invention and a pharmaceutically acceptable carrier.
D. Predictive Medicine The t invention pertains to the field of predictive medicine in which diagnostic assays, prognostic , pharmacogenomics, and monitoring al trails are used for prognostic (predictive) purposes to thereby treat an individual prophylactically. Accordingly, one aspect of the present invention relates to diagnostic assays for determining the level of expression of one or more marker proteins or nucleic acids, in order to determine whether an individual is at risk of developing nduced toxicity. Such assays can be used for prognostic or predictive purposes to thereby lactically treat an individual prior to the onset of the disorder.
Yet another aspect of the invention pertains to monitoring the influence of agents (6. g., drugs or other compounds administered either to inhibit or to treat or prevent or drug-induced toxicity {i.e. in order to understand any drug—induced toxic effects that such treatment may have}) on the expression or activity of a marker of the invention in clinical trials. These and other agents are bed in further detail in the following sections.
E. Diagnostic Assays An ary method for detecting the presence or absence of a marker protein or nucleic acid in a biological sample involves obtaining a biological sample (e.g. toxicity-associated body fluid or tissue ) from a test subject and contacting the ical sample with a compound or an agent capable of detecting the polypeptide or c acid (e.g., mRNA, genomic DNA, or cDNA). The detection s of the ion can thus be used to detect mRNA, protein, cDNA, or genomic DNA, for example, in a biological sample in vitra as well as in viva. For example, in vitra techniques for detection of mRNA e Northern hybridizations and in situ hybridizations. In vitra techniques for detection of a marker protein include enzyme linked immunosorbent assays (ELISAs), Western blots, immunoprecipitations and immunofluorescence. In vitra techniques for detection of genomic DNA include Southern hybridizations. In viva techniques for detection of mRNA include polymerase chain reaction (PCR), Northern hybridizations and in situ hybridizations. rmore, in viva techniques for detection of a marker n include introducing into a subject a labeled antibody directed against the protein or fragment thereof. For example, the antibody can be labeled with a radioactive marker whose presence and location in a subject can be detected by rd imaging techniques.
A general principle of such diagnostic and prognostic assays involves preparing a sample or reaction mixture that may contain a marker, and a probe, under appropriate conditions and for a time sufficient to allow the marker and probe to interact and bind, thus forming a complex that can be d andior detected in the reaction mixture.
These assays can be conducted in a variety of ways.
For example, one method to conduct such an assay would involve anchoring the marker or probe onto a solid phase support, also referred to as a substrate, and detecting target /probe complexes anchored on the solid phase at the end of the reaction. In one embodiment of such a method, a sample from a subject, which is to be assayed for presence and/or concentration of marker, can be ed onto a carrier or solid phase support. In another embodiment, the reverse situation is possible, in which the probe can be anchored to a solid phase and a sample from a subject can be allowed to react as an unanchored component of the assay.
There are many ished methods for ing assay components to a solid phase. These include, without limitation, marker or probe molecules which are immobilized through conjugation of biotin and streptavidin. Such biotinylated assay components can be prepared from biotin—NHS roxy-succinimide) using techniques known in the art (e. g. , biotinylation kit, Pierce Chemicals, Rockford, IL), and immobilized in the wells of streptavidin—coated 96 well plates (Pierce Chemical). In certain embodiments, the surfaces with lized assay components can be prepared in advance and stored.
Other suitable carriers or solid phase supports for such assays include any al e of binding the class of molecule to which the marker or probe belongs.
Well-known supports or carriers include, but are not limited to, glass, polystyrene, nylon, polypropylene, nylon, polyethylene, dextran, amylases, natural and modified celluloses, polyacrylamides, gabbros, and magnetite.
In order to conduct assays with the above mentioned approaches, the non- immobilized component is added to the solid phase upon which the second component is anchored. After the reaction is complete, uncomplexed components may be removed (6. g., by washing) under ions such that any complexes formed will remain immobilized upon the solid phase. The detection of marker/probe complexes anchored to the solid phase can be accomplished in a number of methods outlined herein.
In a preferred embodiment, the probe, when it is the ored assay component, can be labeled for the purpose of ion and readout of the assay, either directly or indirectly, with detectable labels discussed herein and which are nown to one skilled in the art.
It is also possible to directly detect markerfprobe complex formation without further manipulation or labeling of either component r or probe), for example by utilizing the technique of fluorescence energy transfer (see, for example, Lakowicz et al., US. Patent No. 5,631,169; Stavrianopoulos, et al., U.S. Patent No. 4,868,103). A fluorophore label on the first, ‘donor’ molecule is selected such that, upon excitation with incident light of appropriate wavelength, its emitted fluorescent energy will be absorbed by a fluorescent label on a second ‘acceptor’ le, which in turn is able to fluoresce due to the absorbed energy. Alternately, the ‘donor’ protein molecule may simply utilize the natural fluorescent energy of tryptophan residues. Labels are chosen that emit different wavelengths of light, such that the ‘acceptor’ le label may be differentiated from that of the ’. Since the ency of energy transfer between the labels is related to the distance separating the molecules, spatial relationships between the les can be assessed. In a situation in which binding occurs between the molecules, the fluorescent on of the ‘acceptor’ molecule label in the assay should be maximal. An FET binding event can be conveniently measured through standard fluorometric ion means well known in the art (e.g., using a fluorimeter).
In another embodiment, determination of the ability of a probe to recognize a marker can be accomplished without labeling either assay component (probe or marker) by utilizing a technology such as real-time Biomolecular Interaction Analysis (BIA) (see, e.g., Sjolander, S. and Urbaniczky, C., 1991, Anal. Chem. 63:2338—2345 and Szabo et al., 1995, Curr. Opin. Struct. Biol. 5:699-705). As used herein, “BIA” or “surface n resonance” is a technology for studying biospecific interactions in real time, without labeling any of the interactants (e.g., BIAcore). s in the mass at the binding surface (indicative of a binding event) result in alterations of the refractive index of light near the surface (the optical phenomenon of surface plasmon resonance (SPR)), resulting in a detectable signal which can be used as an indication of real-time reactions between biological molecules.
Alternatively, in r embodiment, ous diagnostic and prognostic assays can be conducted with marker and probe as solutes in a liquid phase. In such an assay, the complexed marker and probe are separated from uncomplexed components by any of a number of standard techniques, ing but not d to: differential centrifugation, chromatography, electrophoresis and immunoprecipitation. In differential centrifugation, markerfprobe complexes may be ted from uncomplexed assay components through a series of centrifugal steps, due to the ent sedimentation equilibria of complexes based on their different sizes and densities (see, for example, Rivas, G., and Minton, A.P., 1993, Trends Biochem Sci. 18(8):284-7).
Standard tographic techniques may also be utilized to separate complexed molecules from uncomplexed ones. For example, gel filtration chromatography separates molecules based on size, and through the utilization of an appropriate gel filtration resin in a column , for example, the relatively larger complex may be separated from the relatively smaller uncomplexed components. Similarly, the relatively different charge properties of the marker/probe complex as compared to the uncomplexed components may be exploited to differentiate the complex from uncomplexed ents, for example through the utilization of ion-exchange chromatography resins. Such resins and chromatographic techniques are well known to one skilled in the art (see, e.g., Heegaard, NH, 1998, J. Mol. Recognit. Winter 11(1- 6):141-8; Hage, D.S., and Tweed, S.A. J Chromatogr B Biomed Sci Appl 1997 Oct ;699(1-2):499-525). Gel electrophoresis may also be ed to separate complexed assay components from unbound ents (see, e.g., Ausubel et al., ed., Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1987-1999). In this technique, protein or nucleic acid xes are separated based on size or charge, for example. In order to maintain the binding interaction during the electrophoretic s, non-denaturing gel matrix materials and conditions in the absence of reducing agent are lly preferred. Appropriate conditions to the particular assay and components thereof will be well known to one d in the art.
In a particular embodiment, the level of marker mRNA can be determined both by in situ and by in vitro formats in a biological sample using s known in the art.
The term gical sample" is intended to include tissues, cells, biological fluids and isolates thereof, isolated from a subject, as well as tissues, cells and fluids present within a subject. Many expression detection methods use isolated RNA. For in vitro methods, any RNA isolation technique that does not select against the isolation of mRNA can be utilized for the purification of RNA from cells (see, e. g., Ausubel et al, ed., Current Protocols in Molecular Biology, John Wiley & Sons, New York 1987-1999).
Additionally, large numbers of tissue samples can readily be processed using ques well known to those of skill in the art, such as, for example, the single-step RNA isolation s of Chomczynski (1989, US. Patent No. 4,843,155).
The isolated mRNA can be used in hybridization or amplification assays that include, but are not limited to, Southern or Northern es, polymerase chain reaction es and probe arrays. One preferred diagnostic method for the detection of mRNA levels involves contacting the isolated mRNA with a c acid molecule (probe) that can hybridize to the mRNA encoded by the gene being detected. The nucleic acid probe can be, for example, a full-length cDNA, or a portion thereof, such as an oligonucleotide of at least 7, 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under stringent ions to a mRNA or genomic DNA encoding a marker of the present invention. Other le probes for use in the diagnostic assays of the invention are described herein. Hybridization of an mRNA with the probe indicates that the marker in question is being expressed.
In one format, the mRNA is immobilized on a solid surface and contacted with a probe, for example by running the isolated mRNA on an agarose gel and transferring the mRNA from the gel to a ne, such as ellulose. In an alternative format, the probe(s) are immobilized on a solid surface and the mRNA is contacted with the probe(s), for example, in an Affymetrix gene chip array. A skilled artisan can readily adapt known mRNA detection methods for use in detecting the level of mRNA encoded by the markers of the present ion.
An alternative method for determining the level of mRNA marker in a sample involves the process of nucleic acid amplification, e.g., by RT-PCR (the experimental embodiment set forth in Mullis, 1987, U.S. Patent No. 4,683,202), ligase chain reaction y, 1991, Proc. Natl. Acad. Sci. USA, 88:189-193), self sustained sequence ation (Guatelli et al., 1990, Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional amplification system (Kwoh et al., 1989, Proc. Natl. Acad. Sci. USA 86: 1 173-1177), Q-Beta Replicase (Lizardi et al., 1988, Bio/1’echnology 6: 1 197), rolling circle replication (Lizardi et al., US. Patent No. 5,854,033) or any other nucleic acid amplification method, followed by the detection of the amplified molecules using techniques well known to those of skill in the art. These detection schemes are especially useful for the detection of nucleic acid molecules if such molecules are present in very low numbers. As used herein, amplification primers are defined as being a pair of nucleic acid molecules that can anneal to 5’ or 3’ regions of a gene (plus and minus strands, respectively, or vice-versa) and contain a short region in between. In general, amplification primers are from about 10 to 30 nucleotides in length and flank a region from about 50 to 200 nucleotides in . Under appropriate conditions and with appropriate reagents, such primers permit the amplification of a nucleic acid molecule comprising the nucleotide sequence flanked by the primers.
For in situ methods, mRNA does not need to be isolated from the prior to detection. In such methods, a cell or tissue sample is prepared/processed using known ogical s. The sample is then immobilized on a support, typically a glass slide, and then ted with a probe that can hybridize to mRNA that encodes the marker.
As an alternative to making inations based on the absolute expression level of the marker, determinations may be based on the normalized expression level of the marker. Expression levels are normalized by correcting the absolute expression level of a marker by comparing its expression to the expression of a gene that is not a marker, e. g., a housekeeping gene that is constitutively expressed. Suitable genes for normalization include housekeeping genes such as the actin gene, or epithelial cell- specific genes. This normalization allows the comparison of the expression level in one sample, e.g., a t sample, to r sample, e.g., a non-disease or non-toxic sample, or between samples from different sources.
Alternatively, the expression level can be provided as a relative expression level.
To determine a relative expression level of a marker, the level of expression of the marker is determined for 10 or more samples of normal versus e or toxic cell isolates, preferably 50 or more samples, prior to the ination of the expression level for the sample in on. The mean sion level of each of the genes assayed in the larger number of samples is determined and this is used as a baseline expression level for the marker. The expression level of the marker determined for the test sample (absolute level of expression) is then divided by the mean expression value obtained for that marker. This provides a relative expression level.
Preferably, the samples used in the baseline determination will be from non-toxic cells. The choice of the cell source is dependent on the use of the relative expression level. Using expression found in normal tissues as a mean expression score aids in validating whether the marker assayed is toxicity specific s normal cells). In on, as more data is accumulated, the mean expression value can be d, providing improved relative expression values based on accumulated data. Expression data from disesase cells or toxic cells provides a means for grading the severity of the disease or toxic state.
In another embodiment of the present invention, a marker n is detected. A preferred agent for detecting marker protein of the invention is an antibody capable of binding to such a protein or a fragment thereof, preferably an antibody with a detectable label. Antibodies can be polyclonal, or more preferably, onal. An intact dy, or a fragment or derivative f (e.g., Fab or F(ab')2) can be used. The term "labeled", with regard to the probe or antibody, is intended to encompass direct labeling of the probe or antibody by coupling (126., physically linking) a detectable substance to the probe or antibody, as well as indirect labeling of the probe or antibody by reactivity with another reagent that is directly labeled. Examples of indirect labeling include detection of a primary dy using a fluorescently labeled secondary antibody and end-labeling of a DNA probe with biotin such that it can be detected with fluorescently labeled avidin.
Proteins from cells can be isolated using techniques that are well known to those of skill in the art. The protein isolation methods employed can, for e, be such as those described in Harlow and Lane (Harlow and Lane, 1988, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring , New York).
A variety of formats can be employed to determine r a sample ns a protein that binds to a given antibody. Examples of such formats include, but are not limited to, enzyme immunoassay (EIA), radioimmunoassay (RIA), Western blot analysis and enzyme linked immunoabsorbant assay (ELISA). A skilled artisan can readily adapt known protein/antibody detection methods for use in determining whether cells s a marker of the present invention.
In one format, dies, or antibody fragments or derivatives, can be used in methods such as Western blots or immunofluorescence ques to detect the expressed proteins. In such uses, it is generally preferable to immobilize either the antibody or proteins on a solid support. Suitable solid phase supports or carriers include any support capable of binding an antigen or an antibody. Well—known supports or carriers include glass, polystyrene, polypropylene, polyethylene, dextran, nylon, es, natural and modified celluloses, polyacrylamides, gabbros, and magnetite.
One skilled in the art will know many other le carriers for g antibody or antigen, and will be able to adapt such support for use with the present invention. For example, protein isolated from disease or toxic cells can be run on a polyacrylamide gel electrophoresis and immobilized onto a solid phase support such as nitrocellulose. The support can then be washed with suitable buffers followed by treatment with the detectably d antibody. The solid phase t can then be washed with the buffer a second time to remove unbound antibody. The amount of bound label on the solid support can then be detected by conventional means.
The invention also encompasses kits for detecting the presence of a marker protein or c acid in a biological sample. Such kits can be used to determine if a subject is suffering from or is at increased risk of developing drug—induced toxicity. For example, the kit can comprise a labeled compound or agent e of detecting a marker protein or nucleic acid in a biological sample and means for determining the amount of the n or mRNA in the sample (e.g., an antibody which binds the protein or a fragment thereof, or an oligonucleotide probe which binds to DNA or mRNA encoding the protein). Kits can also e instructions for interpreting the results obtained using the kit.
For antibody-based kits, the kit can comprise, for example: (1) a first antibody (e.g., attached to a solid support) which binds to a marker protein; and, optionally, (2) a second, different antibody which binds to either the protein or the first antibody and is conjugated to a detectable label.
For oligonucleotide-based kits, the kit can se, for example: (1) an oligonucleotide, e. g., a detectably labeled oligonucleotide, which hybridizes to a nucleic acid sequence encoding a marker protein or (2) a pair of primers useful for amplifying a marker nucleic acid molecule. The kit can also comprise, e.g., a buffering agent, a vative, or a protein stabilizing agent. The kit can further comprise components necessary for detecting the detectable label (e.g. , an enzyme or a substrate). The kit can also n a control sample or a series of control samples which can be assayed and compared to the test sample. Each component of the kit can be enclosed within an dual ner and all of the various containers can be within a single package, along with ctions for interpreting the results of the assays performed using the kit.
F. Pharmacogenomics The markers of the invention are also useful as pharmacogenomic markers. As used herein, a “pharmacogenomic marker” is an ive biochemical marker whose expression level correlates with a specific clinical drug response or susceptibility in a patient (see, e.g., McLeod et a]. (1999) Eur. J. Cancer 35(12): 1650-1652). The presence or quantity of the pharmacogenomic marker expression is related to the predicted response of the patient and more particularly the patient’s diseased or toxic cells to therapy with a specific drug or class of drugs. By assessing the presence or quantity of the expression of one or more cogenomic markers in a patient, a drug y which is most appropriate for the patient, or which is ted to have a greater degree of success, may be selected. For example, based on the presence or quantity of RNA or protein encoded by specific tumor markers in a patient, a drug or course of treatment may be selected that is optimized for the ent of the ic tumor likely to be present in the patient. The use of pharmacogenomic markers therefore permits selecting or designing the most appropriate treatment for each cancer patient without trying different drugs or regimes.
Another aspect of pharmacogenomics deals with genetic conditions that alters the way the body acts on drugs. These pharmacogenetic conditions can occur either as rare defects or as polymorphisms. For example, glucosephosphate dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the main al complication is hemolysis after ingestion of oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and consumption of fava beans.
As an illustrative embodiment, the activity of drug metabolizing s is a major determinant of both the intensity and duration of drug action. The discovery of genetic polymorphisms of drug metabolizing s (6. g., N-acetyltransferase 2 (NAT 2) and cytochrome P450 s CYP2D6 and CYP2C19) has provided an explanation as to why some ts do not obtain the ed drug effects or show exaggerated drug response and serious ty after taking the standard and safe dose of a drug.
These polymorphisms are expressed in two phenotypes in the population, the extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among different populations. For example, the gene coding for CYP2D6 is highly rphic and several mutations have been identified in PM, which all lead to the absence of functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently experience exaggerated drug response and side effects when they receive standard doses.
If a metabolite is the active therapeutic moiety, 3 PM will show no therapeutic response, as demonstrated for the sic effect of codeine mediated by its CYP2D6-formed metabolite morphine. The other extreme are the so called ultra-rapid metabolizers who do not respond to standard doses. Recently, the molecular basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene ication.
Thus, the level of expression of a marker of the invention in an individual can be determined to thereby select appropriate s) for therapeutic or prophylactic treatment of the individual. In addition, pharmacogenetic studies can be used to apply genotyping of rphic alleles encoding drug—metabolizing enzymes to the identification of an dual‘s drug responsiveness phenotype. This knowledge, when applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus enhance therapeutic or prophylactic efficiency when treating a subject with a modulator of expression of a marker of the invention.
G. Monitoring Clinical Trials Monitoring the influence of agents (e.g., drug nds) on the level of sion of a marker of the invention can be applied not only in basic drug ing, but also in al trials. For example, the effectiveness of an agent to affect marker expression can be monitored in clinical trials of subjects receiving treatment for cardiotoxicity, or drug-induced toxicity. In a preferred embodiment, the present invention provides a method for monitoring the iveness of treatment of a subject with an agent (6. g., an agonist, antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) comprising the steps of (i) obtaining a pre-administration sample from a subject prior to administration of the agent; (ii) detecting the level of expression of one or more selected markers of the invention in the pre-administration sample; (iii) obtaining one or more post-administration samples from the subject; (iv) detecting the level of expression of the marker(s) in the post- administration samples; (v) comparing the level of expression of the (s) in the ministration sample with the level of expression of the marker(s) in the post- administration sample or samples; and (vi) altering the administration of the agent to the subject accordingly. For example, increased expression of the marker gene(s) during the course of treatment may indicate ineffective dosage and the desirability of sing the dosage. Conversely, decreased expression of the marker gene(s) may te efficacious treatment and no need to change dosage.
H. Arrays The invention also includes an array comprising a marker of the present invention. The array can be used to assay expression of one or more genes in the array.
In one embodiment, the array can be used to assay gene expression in a tissue to ascertain tissue specificity of genes in the array. In this manner, up to about 7600 genes can be aneously assayed for expression. This allows a profile to be developed showing a battery of genes specifically expressed in one or more tissues.
In addition to such qualitative determination, the invention allows the quantitation of gene expression. Thus, not only tissue specificity, but also the level of expression of a battery of genes in the tissue is ascertainable. Thus, genes can be grouped on the basis of their tissue expression per se and level of expression in that . This is useful, for example, in aining the relationship of gene expression between or among tissues. Thus, one tissue can be bed and the effect on gene expression in a second tissue can be determined. In this context, the effect of one cell type on another cell type in response to a biological stimulus can be determined. Such a determination is useful, for example, to know the effect of cell-cell interaction at the level of gene expression. If an agent is stered therapeutically to treat one cell type but has an undesirable effect on another cell type, the invention es an assay to determine the lar basis of the undesirable effect and thus provides the opportunity to co-administer a counteracting agent or otherwise treat the undesired effect. Similarly, even within a single cell type, undesirable biological effects can be determined at the molecular level. Thus, the effects of an agent on expression of other than the target gene can be ascertained and racted.
In another embodiment, the array can be used to monitor the time course of expression of one or more genes in the array. This can occur in various biological contexts, as disclosed herein, for example development of drug-induced toxicity, ssion of drug-induced toxicity, and processes, such a cellular transformation associated with drug-induced toxicity.
The array is also useful for ascertaining the effect of the expression of a gene on the expression of other genes in the same cell or in ent cells. This es, for example, for a ion of alternate molecular targets for therapeutic intervention if the ultimate or downstream target cannot be regulated.
The array is also useful for ascertaining differential expression patterns of one or more genes in normal and abnormal cells. This provides a battery of genes that could serve as a molecular target for sis or therapeutic intervention.
VII. Methods for Obtaining Samples Samples useful in the methods of the invention include any tissue, cell, biopsy, or bodily fluid sample that expresses a marker of the invention. In one embodiment, a sample may be a tissue, a cell, whole blood, serum, plasma, buccal scrape, saliva, cerebrospinal fluid, urine, stool, or bronchoalveolar lavage. In preferred embodiments, the tissue sample is a toxicity state sample. In more preferred embodiments, the tissue sample is a a cardiovascular sample or a drug-induced toxicity sample.
Body samples may be obtained from a subject by a variety of techniques known in the art including, for example, by the use of a biopsy or by scraping or swabbing an area or by using a needle to aspirate bodily fluids. Methods for collecting various body samples are well known in the art.
Tissue samples suitable for ing and quantitating a marker of the invention may be fresh, , or fixed according to methods known to one of skill in the art.
Suitable tissue samples are ably sectioned and placed on a microscope slide for r analyses. Alternatively, solid samples, i.e., tissue samples, may be solubilized and/or homogenized and subsequently analyzed as soluble extracts.
In one embodiment, a freshly obtained biopsy sample is frozen using, for example, liquid nitrogen or odichloromethane. The frozen sample is mounted for sectioning using, for example, OCT, and serially sectioned in a cryostat. The serial sections are ted on a glass microscope slide. For immunohistochemical staining the slides may be coated with, for example, chrome-alum, gelatine or poly-L-lysine to ensure that the sections stick to the slides. In another embodiment, samples are fixed and embedded prior to sectioning. For example, a tissue sample may be fixed in, for e, formalin, ly dehydrated and ed in, for example, paraffin.
Once the sample is obtained any method known in the art to be suitable for detecting and quantitating a marker of the invention may be used r at the nucleic acid or at the protein level). Such methods are well known in the art and include but are not limited to western blots, northern blots, southern blots, immunohistochemistry, ELISA, e. g., amplified ELISA, precipitation, immunofluorescence, flow cytometry, immunocytochemistry, mass spectrometrometric analyses, e.g., MALDI- TOF and SELDI—TOF, nucleic acid hybridization techniques, nucleic acid e ription methods, and c acid amplification methods. In particular embodiments, the expression of a marker of the invention is detected on a protein level using, for example, antibodies that specifically bind these proteins.
Samples may need to be modified in order to make a marker of the invention accessible to antibody binding. In a ular aspect of the immunocytochemistry or immunohistochemistry methods, slides may be erred to a pretreatment buffer and optionally heated to increase antigen accessibility. Heating of the sample in the pretreatment buffer rapidly disrupts the lipid bi-layer of the cells and makes the antigens (may be the case in fresh specimens, but not typically what occurs in fixed specimens) more accessible for antibody binding. The terms "pretreatment buffer" and ration buffer" are used interchangeably herein to refer to a buffer that is used to prepare cytology or histology samples for immunostaining, particularly by sing the accessibility of a marker of the invention for antibody binding. The atment buffer may comprise a pH-specific salt solution, a polymer, a detergent, or a nonionic or anionic surfactant such as, for example, an xylated anionic or nonionic surfactant, an alkanoate or an alkoxylate or even blends of these surfactants or even the use of a bile salt. The pretreatment buffer may, for example, be a solution of 0.1% to 1% of deoxycholic acid, sodium salt, or a solution of sodium laureth-l3-carboxylate (e. g., Sandopan LS) or and ethoxylated anionic complex. In some embodiments, the pretreatment buffer may also be used as a slide storage buffer.
Any method for making marker proteins of the invention more ible for antibody binding may be used in the practice of the invention, including the antigen retrieval methods known in the art. See, for example, Bibbo, et al. (2002) Acta. Cytol. 46:25-29; Saqi, et al. (2003) Diagn. Cytopathol. 27:365—370; Bibbo, et al. (2003) Anal.
Quant. Cytol. Histol. 2528-11, the entire contents of each of which are orated herein by reference.
Following pretreatment to increase marker protein accessibility, samples may be d using an appropriate ng agent, e.g., a dase blocking reagent such as hydrogen peroxide. In some embodiments, the samples may be blocked using a protein blocking reagent to prevent non-specific binding of the antibody. The protein blocking reagent may comprise, for example, purified casein. An antibody, particularly a monoclonal or polyclonal dy that specifically binds to a marker of the invention is then ted with the sample. One of skill in the art will appreciate that a more accurate prognosis or diagnosis may be obtained in some cases by detecting multiple epitopes on a marker protein of the invention in a patient sample. Therefore, in particular embodiments, at least two dies directed to different es of a marker of the invention are used. Where more than one antibody is used, these antibodies may be added to a single sample sequentially as individual antibody reagents or simultaneously as an antibody cocktail. atively, each individual antibody may be added to a separate sample from the same patient, and the resulting data pooled. ques for detecting antibody binding are well known in the art. dy binding to a marker of the invention may be detected through the use of chemical reagents that generate a able signal that corresponds to the level of antibody binding and, accordingly, to the level of marker protein sion. In one of the immunohistochemistry or immunocytochemistry methods of the invention, antibody binding is detected through the use of a secondary antibody that is conjugated to a labeled polymer. Examples of labeled polymers include but are not limited to polymer- enzyme conjugates. The enzymes in these complexes are typically used to catalyze the deposition of a chromogen at the antigen-antibody g site, thereby resulting in cell staining that corresponds to sion level of the biomarker of interest. Enzymes of particular interest include, but are not limited to, horseradish peroxidase (HRP) and alkaline atase (AP).
In one particular immunohistochemistry or immunocytochemistry method of the invention, antibody binding to a marker of the invention is detected through the use of an HRP-labeled polymer that is conjugated to a secondary antibody. dy binding can also be detected through the use of a species-specific probe reagent, which binds to monoclonal or polyclonal antibodies, and a polymer conjugated to HRP, which binds to the species specific probe t. Slides are d for antibody binding using any chromagen, e. g., the chromagen 3,3-diaminobenzidine (DAB), and then counterstained with hematoxylin and, optionally, a bluing agent such as ammonium hydroxide or TBS/Tween-QO. Other suitable chromagens include, for e, 3-amino ethylcarbazole (AEC). In some aspects of the invention, slides are reviewed microscopically by a cytotechnologist and/or a pathologist to assess cell staining, e.g., fluorescent staining (i.e., marker expression). Alternatively, samples may be reviewed via automated microscopy or by nel with the assistance of er software that facilitates the identification of positive ng cells.
Detection of antibody binding can be facilitated by coupling the anti-marker antibodies to a detectable nce. Examples of detectable substances include various enzymes, prosthetic groups, fluorescent materials, luminescent materials, bioluminescent materials, and radioactive als. Examples of suitable enzymes include adish peroxidase, alkaline phosphatase, B—galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein ocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes luminol; examples of inescent materials include rase, luciferin, and aequorin; and examples of le radioactive material include 125I, 131I, 358, 14C, or 3H.
In one embodiment of the invention frozen samples are prepared as described above and uently stained with antibodies t a marker of the invention diluted to an appropriate concentration using, for example, Tris-buffered saline (TBS). Primary antibodies can be detected by incubating the slides in biotinylated mmunoglobulin.
This signal can optionally be amplified and visualized using obenzidine precipitation of the antigen. Furthermore, slides can be optionally counterstained with, for example, hematoxylin, to visualize the cells.
In another embodiment, fixed and embedded samples are stained with dies t a marker of the invention and counterstained as described above for frozen sections. In addition, samples may be optionally treated with agents to amplify the signal in order to visualize antibody staining. For example, a peroxidase-catalyzed deposition of yl-tyramide, which in turn is reacted with peroxidase-conjugated streptavidin (Catalyzed Signal Amplification (CSA) System, DAKO, Carpinteria, CA) may be used.
Tissue-based assays (i.e., immunohistochemistry) are the preferred methods of detecting and quantitating a marker of the invention. In one embodiment, the presence or absence of a marker of the invention may be determined by immunohistochemistry.
In one embodiment, the immunohistochemical analysis uses low concentrations of an anti-marker antibody such that cells lacking the marker do not stain. In another embodiment, the presence or absence of a marker of the invention is determined using an immunohistochemical method that uses high concentrations of an anti-marker antibody such that cells lacking the marker protein stain heavily. Cells that do not stain contain either mutated marker and fail to produce antigenically recognizable marker protein, or are cells in which the pathways that regulate marker levels are dysregulated, resulting in steady state expression of negligible marker protein.
One of skill in the art will recognize that the concentration of a particular dy used to practice the methods of the invention will vary depending on such factors as time for binding, level of specificity of the dy for a marker of the invention, and method of sample preparation. Moreover, when multiple antibodies are used, the required concentration may be affected by the order in which the dies are applied to the sample, e. g., simultaneously as a cocktail or sequentially as individual antibody reagents. Furthermore, the detection chemistry used to visualize antibody binding to a marker of the invention must also be zed to produce the desired signal to noise ratio.
In one embodiment of the ion, proteomic methods, e.g., mass spectrometry, are used for detecting and quantitating the marker proteins of the invention. For e, matrix-associated laser desorption/ionization time-of—flight mass spectrometry -TOF MS) or surface-enhanced laser tion/ionization time-of—flight mass spectrometry (SELDI-TOF MS) which es the application of a biological sample, such as serum, to a protein-binding chip (Wright, G.L., Jr., et al. (2002) Expert Rev M01 Diagn 2:549; Li, J ., et al. (2002) Clin Chem 6; Laronga, C., et al. (2003) Dis Markers 19:229; Petricoin, EMF, et al. (2002) 359:572; Adam, B.L., et al. (2002) Cancer Res 6223609; , J et a1. (2004) Lab Invest 84:845; Xiao, Z., et al. (2001) Cancer Res 6126029) can be used to detect and quantitate the PY-Shc and/or p66-Shc proteins. Mass spectrometric methods are described in, for example, U.S. Patent Nos. 5,622,824, 5,605,?98 and 5,54?,835, the entire contents of each of which are incorporated herein by reference.
In other embodiments, the expression of a marker of the invention is detected at the nucleic acid level. Nucleic acid-based techniques for assessing expression are well known in the art and include, for example, determining the level of marker mRNA in a sample from a subject. Many expression detection methods use ed RNA. Any RNA isolation technique that does not select against the isolation of mRNA can be utilized for the purification of RNA from cells that express a marker of the invention (see, e.g., l et al., ed., 1999) Current Protocols in Molecular Biology (John Wiley & Sons, New York). onally, large numbers of tissue samples can readily be processed using techniques well known to those of skill in the art, such as, for example, the single-step RNA isolation process of Chomczynski (1989, U.S. Pat. No. 4,843,155).
The term "probe" refers to any molecule that is capable of selectively binding to a marker of the invention, for example, a nucleotide transcript and/or protein. Probes can be synthesized by one of skill in the art, or derived from appropriate biological preparations. Probes may be specifically designed to be d. Examples of molecules that can be utilized as probes include, but are not limited to, RNA, DNA, proteins, antibodies, and organic molecules. ed mRNA can be used in hybridization or amplification assays that include, but are not limited to, Southern or Northern analyses, rase chain reaction analyses and probe arrays. One method for the detection of mRNA levels es contacting the isolated mRNA with a nucleic acid le (probe) that can hybridize to the marker mRNA. The nucleic acid probe can be, for example, a full-length cDNA, or a portion thereof, such as an oligonucleotide of at least 7, 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under stringent conditions to marker genomic DNA.
In one embodiment, the mRNA is immobilized on a solid surface and contacted with a probe, for example by running the isolated mRNA on an agarose gel and transferring the mRNA from the gel to a membrane, such as nitrocellulose. In an ative embodiment, the probe(s) are immobilized on a solid surface and the mRNA is contacted with the probe(s), for e, in an Affymetrix gene chip array. A skilled artisan can readily adapt known mRNA ion methods for use in detecting the level of marker mRNA.
An alternative method for determining the level of marker mRNA in a sample involves the process of nucleic acid amplification, e.g., by RT—PCR (the experimental embodiment set forth in Mullis, 198?, US. Pat. No. 4,683,202), ligase chain on (Barany (1991) Proc. Natl. Acad. Sci. USA 88:189-193), self sustained sequence replication (Guatelli et a1. (1990) Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional amplification system (Kwoh et al. (1989) Proc. Natl. Acad. Sci. USA 86: 1 173-1177), Q—Beta Replicase (Lizardi et a1. (1988) Bio/Technology 6: 1 197), rolling circle ation (Lizardi et al., US. Pat. No. 5,854,033) or any other nucleic acid amplification method, followed by the detection of the ied molecules using techniques well known to those of skill in the art. These detection schemes are especially useful for the detection of c acid molecules if such molecules are present in very low numbers. In particular aspects of the invention, marker sion is assessed by quantitative fluorogenic RT-PCR (5.6., the TaqManTM System). Such methods typically utilize pairs of oligonucleotide primers that are specific for a marker of the invention.
Methods for designing oligonucleotide s specific for a known sequence are well known in the art.
The expression levels of a marker of the invention may be red using a membrane blot (such as used in hybridization analysis such as Northern, Southern, dot, and the like), or microwells, sample tubes, gels, beads or fibers (or any solid support comprising bound nucleic acids). See US. Pat. Nos. 5,770,722, 5,874,219, 5,744,305, 195 and 5,445,934, which are incorporated herein by reference. The detection of marker expression may also comprise using nucleic acid probes in solution.
In one embodiment of the invention, microarrays are used to detect the expression of a marker of the invention. Microarrays are particularly well suited for this purpose e of the reproducibility between ent experiments. DNA microarrays provide one method for the simultaneous ement of the expression levels of large numbers of genes. Each array consists of a reproducible pattern of capture probes attached to a solid support. Labeled RNA or DNA is hybridized to complementary probes on the array and then detected by laser scanning. Hybridization intensities for each probe on the array are determined and converted to a quantitative value representing ve gene expression levels. See, US. Pat. Nos. 6,040,138, 5,800,992 and 6,020,135, 6,033,860, and 6,344,316, which are incorporated herein by reference.
High-density oligonucleotide arrays are particularly useful for determining the gene expression profile for a large number of RNA's in a .
The amounts of marker, andfor a mathematical relationship of the amounts of a marker of the invention may be used to calculate the risk of a toxicity state, e. g., a drug- d toxicity or cardiotoxicity, in a subject being treated with a drug, the efficacy of a treatment regimen for ng, ting or counteracting a toxicity state, and the like, using the methods of the invention, which may include methods of regression analysis known to one of skill in the art. For example, suitable regression models include, but are not limited to CART (e. g., Hill, T, and Lewicki, P. (2006) “STATISTICS Methods and Applications” StatSoft, Tulsa, OK), Cox (e.g., wwwevidence-based—medicine.co.uk), exponential, normal and log normal (e.g., www.obgyn.cam.ac.uk/mrg/statsbook/stsurvan.html), logistic (e.g., www.en.wikipedia.org/wiki/Logistic_regression), parametric, non-parametric, semi- parametric (e.g., cserv.mcmaster.ca/jfox/Books/Companion), linear (e.g., www.en.wikipedia.org/wiki/Linear_regression), or ve (e.g., www,en,wikipedia.org/wikiiGeneralized_additive_model).
In one embodiment, a regression analysis includes the s of marker. In another ment, a regression analysis includes a marker atical relationship.
In yet another embodiment, a regression analysis of the amounts of marker, and/or a marker mathematical relationship may include onal clinical and/or lar co- variates. Such clinical co-variates include, but are not limited to, nodal status, tumor stage, tumor grade, tumor size, treatment regime, e.g., chemotherapy and/or radiation therapy, clinical outcome (e. g., relapse, disease—specific survival, therapy failure), and/or clinical outcome as a function of time after diagnosis, time after initiation of therapy, and/or time after completion of treatment.
VIII. Kits The invention also provides compositions and kits for identifying an agent at risk for causing drug-induced toxicity, e.g., cardiotoxicity, for prognosing a cardiotoxic state, e. g., a drug-induced cardiotoxicity, recurrence of cardiotoxicity, or survival of a t being treated for cardiotoxicity. These kits include one or more of the following: a detectable dy that specifically binds to a marker of the invention, a able antibody that specifically binds to a marker of the invention, reagents for ing and/or preparing subject tissue samples for staining, and instructions for use.
The kits of the invention may optionally comprise additional components useful for performing the methods of the invention. By way of example, the kits may comprise fluids (e.g., SSC buffer) suitable for annealing complementary nucleic acids or for binding an antibody with a protein with which it specifically binds, one or more sample compartments, an instructional material which bes performance of a method of the invention and tissue specific controls/standards.
IX. ing Assays Targets of the invention include, but are not limited to, the genes and/or proteins listed herein. Based on the results of experiments described by Applicants herein, the key proteins modulated in a toxicity state are ated with or can be classified into different pathways or groups of les, ing cytoskeletal ents, transcription factors, apoptotic response, pentose ate pathway, biosynthetic pathway, ive stress (pro—oxidant), membrane alterations, and oxidative phosphorylation metabolism. Accordingly, in one embodiment of the invention, a marker may include one or more genes (or proteins) selected from the markers listed in table 2. In some embodiments, the markers are a combination of at least two, three, four, five, six, seven, eight, nine, ten, eleven, , en, fourteen, fifteen, sixteen, seventeen, eighteen, en, twenty, twenty—five, thirty, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, or more of the foregoing genes (or proteins).
Screening assays useful for identifying modulators of identified markers are described below.
The invention also provides methods (also referred to herein as "screening assays") for identifying modulators, i.e., candidate or test compounds or agents (e.g., proteins, peptides, peptidomimetics, peptoids, small molecules or other drugs), which are useful for treating or preventing a toxicity state by modulating the expression and/or activity of a marker of the invention. Such assays typically se a on between a marker of the invention and one or more assay components. The other ents may be either the test compound itself, or a combination of test compounds and a natural binding partner of a marker of the invention. Compounds identified via assays such as those described herein may be useful, for example, for modulating, e. g., inhibiting, ameliorating, treating, or preventing aggressiveness of a disease state or toxicity state.
The test compounds used in the screening assays of the present invention may be obtained from any available source, ing systematic libraries of natural and/or synthetic nds. Test compounds may also be obtained by any of the numerous ches in combinatorial library methods known in the art, including: biological libraries; peptoid libraries (libraries of molecules having the functionalities of peptides, but with a novel, non-peptide backbone which are resistant to enzymatic degradation but which nevertheless remain bioactive; see, e.g., Zuckermann er al., 1994, J. Med. Chem. 37:2678—85); spatially addressable parallel solid phase or solution phase libraries; synthetic library methods requiring deconvolution; the 'one-bead one-compound' library method; and synthetic library methods using affinity chromatography selection. The biological library and peptoid library approaches are limited to peptide libraries, while the other four approaches are applicable to peptide, non-peptide oligomer or small molecule libraries of compounds (Lam, 1997, Anticancer Drug Des. 12: 145).
Examples of methods for the synthesis of molecular libraries can be found in the art, for example in: DeWitt et al. (1993) Proc. Natl. Acad. Sci. U. SA. 90:6909; Erb et al. (1994) Proc. Natl. Acad. Sci. USA 91:11422; Zuckermann et al. (1994). J. Med.
Chem. 3722678; Cho et al. (1993) Science 261:1303; Carrel] et al. (1994) Angew. Chem.
Int. Ed. Engl. 3322059; Carell et al. ( 1994) Angew. Chem. Int. Ed. Engl. 33:2061; and in Gallop et al. (1994) J. Med. Chem. 3721233.
Libraries of compounds may be presented in solution (e. g., en, 1992, Biotechniques 13:412-421), or on beads (Lam, 1991, Nature 354:82-84), chips , 1993, Nature 364:555-556), bacteria and/or , r, USP 5,223,409), plasmids (Cull et al, 1992, Proc Natl Acad Sci USA 89: 869) or on phage (Scott and Smith, 1990, Science 249:386-390; Devlin, 1990, e 249:404-406; Cwirla et al, 1990, Proc. Natl. Acad. Sci. 87:6378—6382; Felici, 1991, J. Mol. Biol. 222:301-310; Ladner, supra).
The screening methods of the invention comprise contacting a toxicity state cell with a test compound and determining the ability of the test compound to modulate the expression and/or activity of a marker of the invention in the cell. The sion and/or activity of a marker of the invention can be determined as bed herein.
In r ment, the invention provides assays for screening candidate or test nds which are substrates of a marker of the invention or biologically active portions thereof. In yet another embodiment, the invention provides assays for screening candidate or test compounds which bind to a marker of the invention or ically active portions f. Determining the ability of the test compound to directly bind to a marker can be accomplished, for example, by coupling the compound with a radioisotope or enzymatic label such that g of the compound to the marker can be determined by detecting the labeled marker compound in a complex. For example, compounds (e.g., marker substrates) can be labeled with 131I, 125I, 35S, 14C, or 3H, either directly or indirectly, and the radioisotope detected by direct counting of radioemission or by scintillation counting. Alternatively, assay components can be enzymatically labeled with, for example, horseradish peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label detected by determination of conversion of an appropriate substrate to product.
This invention further pertains to novel agents identified by the above-described ing assays. Accordingly, it is within the scope of this invention to further use an agent identified as described herein in an appropriate animal model. For example, an agent capable of modulating the sion and/or activity of a marker of the invention fied as described herein can be used in an animal model to determine the efficacy, toxicity, or side effects of treatment with such an agent. atively, an agent identified as described herein can be used in an animal model to determine the mechanism of action of such an agent. Furthermore, this invention pertains to uses of novel agents identified by the above-described screening assays for treatment as described above. ification 0fthe Invention EXAMPLE 1: Employing Platform Technology to Build Models of Drug Induced toxicity In this example, the platform technology described in detail in international PCT ation No. PCT/U82012/02?615 was employed to integrate data obtained from a custom built drug-induced cardiotoxicity model, and to identify novel proteins/pathways g the pathogenesis/ toxicity of drugs. Relational maps resulting from this analysis have ed drug—induced cardiotoxicity biomarkers.
In the healthy heart contractile function depends on a balance of fatty acid and carbohydrate oxidation. Chronic imbalance in uptake, utilization, organellar biogenesis and secretion in ipose tissue (heart and liver) is thought to be at the center of mitochondrial damage and dysfunction and a key player in drug induced cardiotoxicity.
Here Applicants be a systems approach combining protein and lipid signatures with functional end point assays specifically looking at ar bioenergetics and mitochondrial membrane function. In vitro models comprising diabetic and normal cardiomyocytes supplemented with excessive fatty acid and lycemia were treated with a panel of drugs to create signatures and potential mechanisms of toxicity.
Applicants demonstrated the varied effects of drugs in destabilizing the ondria by disrupting the energy metabolism component at various levels including (i) Dysregulation of transcriptional networks that controls expression of mitochondrial energy metabolism genes; (ii) Induction of GPATl and taffazin in diabetic cardiomyocytes thereby initiating de novo phospholipid synthesis and remodeling in the mitochondrial membrane; and (iii) Altered fate of fatty acid in diabetic cardiomyocytes, influencing , fatty acid oxidation and ATP synthesis. Further, Applicants combined the power of wet lab biology and AI based data mining platform to generate causal network based on an models. Networks of proteins and lipids that are causal for loss of normal cell function were used to discern mechanisms of drug induced toxicity from cellular protective mechanisms. This novel approach will serve as a powerful new tool to understand mechanism of ty while allowing for development of safer therapeutics that correct an altered phenotype.
Human myocytes were subject to conditions simulating an diabetic nment experienced by the disease—relevant cells in vivo. Specifically, the cells were exposed to hyperglycemic conditions and hyperlipidemia conditions. The lycemic condition was induced by culturing cells in media containing 22 mM glucose. The hyperlipidemia ion was induced by culturing the cells in media containing 1mM L—carnitine, 0.?mM Oleic acid and 0.7mM Linoleic acid.
The cell model comprising the above—mentioned cells, wherein the cells were exposed to each condition described above, was additionally “interrogated” by exposing the cells to an “environmental perturbation” by treating with a diabetic drug (T) which is known to cause cardiotoxicity, a rescue molecule (R) or both the diabetic drug and the rescue molecule (T+R). Specifically, the cells were treated with ic drug; or treated with rescue molecule Coenzyme Q10 at 0, 5011M, or lOOuM; or treated with both of the diabetic drug and the rescue molecule Coenzyme Q10.
Cell samples from each condition with each perturbation treatment were collected at various times following treatment, including after 6 hours of treatment. For certain conditions, media samples were also collected and analyzed. iProfiling of changes in total cellular n expression by quantitative proteomics was performed for cell and media samples ted for each condition and with each “environmental perturbation”, i.e, diabetic drug treatment, me Q10 treatment or both, using the ques described above in the detailed description.
Transcriptional profiling experiments were carried out using the Biorad cfx-3 84 amplification system. Following data collection (Ct), the final fold change over control was ined using the 5Ct method as outlined in manufacturer’s protocol.
Lipidomics experiments were carried out using mass spectrometry. Functional assays such as Oxygen consumption rate OCR were measured by employing the Seahorse analyzer essentially as recommended by the cturer. OCR was recorded by the electrodes in a 7 ul chamber created with the cartridge pushing against the seahorse culture plate.
As shown in Figure 20, transcriptional network and expression of human mitochondrial energy metabolism genes in diabetic cardiomyocytes (cardiomyocytes conditioned in hyperglycemic and hyperlipidemia) were compared between perturbed and urbed treatments. Specifically, data of transcriptional network and expression of human mitochondrial energy lism genes were ed between diabetic cardiomyocytes treated with diabetic drug (T) and untreated diabetic cardiomyocytes samples (UT). Data of Transcriptional network and expression of human mitochondrial energy metabolism genes were compared n diabetic cardiomyocytes treated with both ic drug and rescue molecule Coenzyme Q10 (T+R) and untreated diabetic cardiomyocytes samples (UT). Comparing to data from untreated diabetic cardiomyocytes, certain genes expression and transcription were altered when diabetic cardiomyocytes were treated with diabetic drug. Rescue molecule Coenzyme Q10 was demonstrated to reverse the toxic effect of diabetic drug and ize gene expression and transcription.
As shown in Figure 21A, cardiomyocytes were ed either in normoglycemia (NG) or hyperglygemia (HG) condition and treated with either diabetic drug alone (T) or with both diabetic drug and rescue molecule Coenzyme Q10 (T+R) . Protein expression levels of GPATl and TAZ for each condition and each treatment were tested with western blotting. Both GPATl and TAZ were upregulated in hyperglycemia conditioned and diabetic drug treated cardiomyocytes. When hyperglycemia conditioned cardiomyocytes were treated with both diabetic drug and rescue molecule Coenzyme Q10, the upregulated protein expression level of GPATl and TAZ were normalized.
As shown in Figure 22A, mitochondrial oxygen consumption rate (%) experiments were carried out for hyperglycemia conditioned cardiomyocytes samples. lycemia conditioned cardiomyocytes were either untreated (UT), treated with diabetic drug Tl which is known to cause cardiotoxicity, d with diabetic drug T2 which is known to cause cardiotoxicity, d with both diabetic drug T1 and rescue molecule me Q10 (Tl+R), or treated with both diabetic drug T2 and rescue molecule Coenzyme Q10 (T2+R). Comparing to untreated control samples, ondrial OCR was decreased when hyperglycemia conditioned cardiomyocytes were treated with diabetic drug T1 or T2. However, mitochondrial OCR was ized when hyperglycemia conditioned cardiomyocytes were treated with both ic drug and rescue molecule Coenzyme Q10 (T1 + R, or T2 + R).
As shown in Figure 22B, mitochondria ATP sis ments were carried out for hyperglycemia conditioned cardiomyocytes samples. Hyperglycemia conditioned cardiomyocytes were either untreated (UT), treated with a diabetic drug (T), or treated with both diabetic drug and rescue molecule Coenzyme Q10 (T+R).
Comparing to untreated control samples, mitochondrial ATP synthesis was repressed when hyperglycemia conditioned cardiomyocytes were treated with diabetic drug (T).
As shown in Figure 23, based on the collected proteomic data, proteins down regulated by drug treatment were annotated with G0 terms. Proteins involved in mitochondrial energy metabolism were down regulated when hyperglycemia conditioned cardiomyocytes were treated with a diabetic drug which is known to cause cardiotoxicity.
Proteomics, lipidomics, transcriptional profiling, functional , and western blotting data collected for each condition and with each perturbation, were then processed by the REFSTM system. Composite perturbed networks were ted from combined data obtained from one ic condition (e. g., hyperglycemia, or hyperlipidemia) exposed to each perturbation (e.g., ic drug, CleO, or both). ite unperturbed networks were generated from combined data obtained from the same one specific condition (e.g., hyperglycemia, or hyperlipidemia), without perturbation (untreated). Similarly, composite perturbed networks were generated from ed data obtained for a second, control condition (e. g., normal glycemia) exposed to each perturbation (e. g., diabetic drug, CleO, or both). Composite unperturbed ks were generated from combined data obtained from the same second, control condition (e. g., normal glycemia), without perturbation (untreated).
Each node in the consensus composite networks described above was simulated (by increasing or decreasing by 10—fold) to generate simulation networks using REFSTM, as described in detail above in the detailed description.
The area under the curve and fold changes for each edge ting a parent node to a child node in the tion networks were extracted by a custom-built program using the R mming ge, where the R programming ge is an open source re environment for tical computing and graphics.
Delta networks were generated from the simulated composite networks. To generate a drug induced cardiotoxicity condition vs. normal condition differential network in response to the diabetic drug (delt network), steps of ison were performed as illustrated in Figure 24, by a custom built program using the PERL programming language.
Specifically, as shown in Figure 24, Untreated refers to protein expression networks of untreated control cardiomyocytes in hyperglycemia condition. Drug refers to protein expression networks of diabetic drug d cardiomyocytes in hyperglycemia condition. Unique edges from Drug in the Drug fl Untreated delta network are presented in Figure 25.
Specifically, a simulated composite map of untreated cardiomyocytes in hyperglycemia condition and a simulated composite map of diabetic drug treated cardiomyocytes in lycemia condition were compared using a custom-made Perl program to generate unique edges of the diabetic drug treated cardiomyocytes in hyperglycemia condition. Output from the PERL and R programs were input into Cytoscape, an open source program, to generate a visual representation of the delta network. As shown in Figure 25, the network represents delta ks that are driven by the ic drug versus untreated in cardiomyocytes/ cardiotox models in hyperglycemia condition.
From the drug induced toxicity condition vs. normal ion differential network shown in Figure 25, proteins were identified which drive pathophysiology of drug induced toxicity, such as GRP78, GRP75, TIMPl, PTX3, HSP76, PDIA4, PDIAl, CA2Dl. These proteins can function as biomarkers for fication of other cardiotoxicity inducing drugs. These ns can also function as biomarkers for identification of agents which can alleviate cardiotoxicity.
The experiments described in this Example demonstrate that perturbed membrane biology and altered fate of free fatty acid in diabetic cardiomyocytes exposed to drug ent represent the center piece of drug induced toxicity. Data integration and network biology have allowed for an enhanced understanding of cardiotoxicity, and fication of novel kers predictive for cardiotoxicity.
EXAMPLE 2: Employing Models of Drug Induced Cardiotoxicity to Identify Additional Markers of Cardiotoxicity The platform technology described above in Example 1 was similarly employed to integrate further data obtained from the same custom built cardiotoxicity model. Five t cardiomyocyte lines were used to create a model of cardiotoxicity as ned in the above-detailed description. The five cardiomyocyte lines were then subjected to a mitochondrial ATP assay to assay for mitochondrial dysfunction imposed by drug treatment or absence there of (as indicated as + and —) under diabetic conditions (hyperglycemia) and normal conditions (normoglycemia). A reduction of mitochondrial ATP was observed under diabetic conditions upon drug treatment in only 2 out of the 5 cardiotoxicity model (see Figure 30). The results of these further experiments lead to the identification of additional novel proteins/pathways driving the pathogenesis of cardiotoxicity of drugs, as summarized in Figures 26-34.
The causal ction network identified several novel biomarkers and potential therapeutic targets for drug-induced cardiotoxicity. Relational maps resulting from this analysis as shown in Figures 28, 29, 31-33 have provided additional drug-induced cardiotoxicity biomarkers, which are listed below in Table 2. These biomarkers may be used for predicting drug-induced cardiotoxicity of a drug, for diagnosis/prognosis of drug-induced cardiotoxicity, and for identifying a rescue agent which can reduce or alleviate drug-induced cardiotoxicity.
Table 2: kers identified by the Interrogative Biology ery rm lA69, lCl7, ACBD3, ACLY, ACTR2, ANXA6, ANXA7, AP2Al, ARCNl, ASNAl, , ATP5A, ATPSB, ATPSD, ATP5Fl, ATP5H, ATPIFl, BSG, Cl4orfl66, CA2D1, CAPNl, CAPZA2, CARS, CCDC22, CCDC47, CCT7, CLIC4, CMPKl, CNN2, COlA2, CO6Al, COTLl, COX6Bl, CRTAP, CSOlO, CTSA, CTSB, CYBS, DDXl, DDXl7, DDXl8, DLD, EDIL3, EHD2, , ENO2, EPHXl, ETFA, FERMT2, FINC, FKBlO, FKBP2, FLNC, G3BP2, GOLGA3, GPATl, GPSN2, GRP75, GRP78, HSPAlA, HSPA4, HSPA9, IBP7, IDHl, IQGAPl, ITBl, ITGBl, EARS, KIF5B, KPNA3, KPNBl, LAMCl, , LMO7, M6PRBPl, MACFl, MAPlB, MARS, MDHl, MPRl, MTHFDl, MYHlO, NCL, NHP2Ll, NUCB1,0LA1, P08621, P3H1, P4HA2, P4HB, SEC61A1 (P61619),PA11, PAPSS2, PCBP2, PDCD6, PDIAl, PDIA3, PDIA3, PDIA4, PDLIM7, PEBPl, PFKM, PH4B, PLIN2, POFUTl, PRKDC, PSMAl, PSMA7, PSMD12, PSMD3, PSMD4, PSMD6, PSME2, PTBPl, PTX3, Q9BQE5, , RABlB, RP515A, RPL32, RPL?A, RPL8, RPS25, RPS6, RRAS2, RRPl, SARlB, SDHA, SENPl, SEPTI 1, SEPT7, SERPH, SERPINEl, SFRS2, S SYNCRIP, TAGLN, TAZ, TGM2, TIMPl, TLNl, TPM4, TRAPl, TSPl, TTLL12, TXNDClZ, UBAlC, UGDH, UGP2, UQCRH, VAMP3, VAPA In one embodiment, a panel of two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen markers selected from a group consisting TIMPl, PTX3, HSP76, FINC, CYB5, PAIl, IBP7 (IGFBP7), 1C17, EDIL3, HMOXl, NUCB1, C8010, HSPA4 can be used for ting drug—induced cardiotoxicity of a drug, for diagnosis/prognosis of drug-induced toxicity, for identifying a rescue agent which can reduce or alleviate drug-induced cardiotoxicity.
Among the markers listed in Table 2, PTX3, PAIl, IBP7 (IGFBP7) have been reported as markers of cardiomyopathy previously. GRP78 and PDIA3 have been ed as serving important indications of ER stress and hypoxic insult. The fact that these markers have been identified by the above—descriped rm logy for drug- induced cardiotoxicity, have validated this platform technology for probing novel drug- induced cardiotoxicity biomarkers.
The sDNA sequences of the markers listed in Table 2 are set forth in Appendix A, and are known in the art.
Incorporation by Reference The contents of all cited references (including literature references, s, patent applications, and es) that maybe cited throughout this application are hereby expressly incorporated by reference in their entirety, as are the references cited therein. The practice of the present invention will employ, unless otherwise indicated, conventional ques of protein formulation, which are well known in the art.
Equivalents The invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The foregoing embodiments are therefore to be ered in all respects illustrative rather than limiting of the invention described herein. Scope of the invention is thus indicated by the appended claims rather than by the foregoing description, and all changes that come within the meaning and range of equivalency of the claims are therefore ed to be embraced herein.
Appendix A Official Symbol: HSPAS Official Name: heat shock 70kDa protein 5 (glucose-regulated protein, 78kDa) m: 3309 Organism: Homo sapiens Other Aliases: BIP; MIF2; GRP78 Other Designations: 78 kDa glucose-regulated protein; asmic reticulum lumenal Ca(2+)-binding protein grp78; immunoglobulin heavy chain-binding protein Nucleotide ce: NCBI Reference Seguence: NM_005347.4 LOCUS NM_005347 ACCESSION NM_005347 l gggctggggg tata agccgagtag gcgacggtga cgcc ggccaagaca 6; gcacagacag attgacctat tggggtgttt cgcgagtgtg agagggaagc gccgcggcct 12; gtatttctag acctgccctt Cgcctggttc gtggcgcctt ccgg gcccctgccg 18; cctgcaagtc ggaaattgcg ctgtgctcct gtgctacggc ctgtggctgg tgct 24; gctgcccaac tggctggcaa gatgaagctc tccctggtgg ccgcgatgct gctgctgctc ; agcgcggcgc gggccgagga caag aaggaggacg tgggcacggt ggtcggcatc 36; ggga ccacctactc ctgcgtcggc gtgttcaaga acggccgcgt ggagatcatc 42; gatc agggcaaccg catcacgccg tcctatgtcg ccttcactcc tgaaggggaa 48; cgtctgattg gcgatgccgc caagaaccag ctcacctcca accccgagaa cacggtcttt 54; gacgccaagc tcgg ccgcacgtgg aatgacccgt ctgtgcagca ggacatcaag 60; ttcttgccgt tcaaggtggt tgaaaagaaa actaaaccat acattcaagt tgatattgga 66L ggtgggcaaa caaagacatt tgctcctgaa gaaatttctg ccatggttct cactaaaatg 72; aaagaaaccg ctgaggctta tttgggaaag aaggttaccc atgcagttgt tactgtacca 78L gcctatttta atgatgccca acgccaagca accaaagacg ctat tgctggccta 84; aatgttatga ggatcatcaa cgagcctacg gcagctgcta ttgcttatgg cctggataag 90; agggaggggg agaagaacat cctggtgttt gacctgggtg gcggaacctt cgatgtgtct 96; cttctcacca ttgacaatgg tgtcttcgaa gttgtggcca ctaatggaga tactcatctg L02; ggtggagaag actttgacca gcgtgtcatg gaacacttca tcaaactgta caaaaagaag L08; acgggcaaag atgtcaggaa agacaataga gctgtgcaga aactccggcg cgaggtagaa L14; aaggccaaac gggccctgtc ttctcagcat caagcaagaa ttgaaattga gtccttctat L20; gaaggagaag acttttctga gaccctgact cgggccaaat agct caacatggat L26; ctgttccggt tgaa gcccgtccag aaagtgttgg aagattctga tttgaagaag L32; tctgatattg atgaaattgt tcttgttggt ggctcgactc gaattccaaa agttcttcaa tggcaaggaa ccatcccgtg gcataaaccc gtgctgctgt ccaggctggt gtgctctctg gtgatcaaga ctggtactgc ttgatgtatg tccccttaca cttggtattg aaactgtggg aggtgtcatg L56; accaaactga ttccaaggaa cacagtggtg cctaccaaga agtctcagat gcttctgata atcaaccaac tgttacaatc aaggtctatg aaggtgaaag acccctgaca L68; aaagacaatc atcttctggg tacatttgat ctgactggaa ttcctcctgc L74; gtcccacaga ttgaagtcac ctttgagata gatgtgaatg gtattcttcg agtgacagct L80; gaagacaagg gtacagggaa caaaaataag atcacaatca ccaatgacca cctg L86; acacctgaag aaatcgaaag gatggttaat gatgctgaga agtttgctga ggaagacaaa L92; aagctcaagg agcgcattga tactagaaat gagttggaaa gctatgccta aaag L98; aatcagattg gagataaaga aaagctggga ggtaaacttt cctctgaaga taaggagacc 204; atggaaaaag ctgtagaaga aaagattgaa tggctggaaa aaga catt 210; gaagacttca aagctaagaa actg attg ttcaaccaat tatcagcaaa 216; ctctatggaa gtgcaggccc tcccccaact gagg atacagcaga aaaagatgag 222; ttgtagacac tgatctgcta gtgctgtaat attgtaaata ctggactcag gaacttttgt 228; taggaaaaaa ttgaaagaac ttaagtctcg aatgtaattg gaatcttcac ctcagagtgg 234; agttgaaact gctatagcct aagcggctgt ttactgcttt tcattagcag ttgctcacat 240; gtctttgggt gaga agaagaattg gccatcttaa aaagcgggta aaaaacctgg 2461 gttagggtgt gtgttcacct tcaaaatgtt ctatttaaca actgggtcat ctgg 252; tgtaggaagt tttttctacc ataagtgaca ccaataaatg attt acactggtct 258; aatgtttgtg agaagcttct aattagatca attacttatt ttaggaaatt taga 264; tactcgtgtg gagg ggagggagta tttggtatgt tgggataagg ttct 270; atttaatgct tccagggatt tttttttttt tttttaaccc tcctgggccc aagtgatcct 276; tccacctcag tctcccagct aattgagacc acaggcttgt taccaccatg ctcggctttt 282; gcattaatct aagaaaaggg gagagaagtt aatccacatc tttactcagg caaggggcat 288; ttcacagtgc ccaagagtgg ggttttcttg aacatacttg gtttcctatt tccccttatc 294; tttctaaaac tgcctttctg gtggcttttt ttaaaattat tactaatgat gcttttatag 300; ggat tctctgagaa atgatgggga gtgagtgatc actggtatta actttataca 306; cttggatttc atttgtaact ttaggatgta aaggtatatt gtgaacccta gctgtgtcag 3121 aatctccatc cctgaaattt ctcattagtg gtactggggt gggatcttgg atggtgacat 3181 tgaaactaca ctaaatcccc tcactatgaa tgggttgtta aaggcaatgg tttgtgtcaa 3241 aactggttta ggattactta gttc aaaa gagtccaggt aaatggtatg 330; atcaataaag gacaggctgg tgctaacata aaatccaata ttgtaatcct agcactttgg 336; gaggccaagg nggtggatc acaaggtcaa gagatagaga ccatctttgc caacatggtg 342; aaactccatc tctactgaaa atacaaaaat tagctgggcg tggtagtgca agctgaaggc 348; tgaggcagga ctcg aacccgggag gcagaggttg cagtgagccg cacc 354; ctcc agcccggcac tccagcctgg cgacaagagt gagactccac aaaa 360; aaaaaaagaa tccaatactg gata ggtattttat agatgggcaa ctggctgaaa 366; ggttaattct ctagggctag tagaactgga cacc aaactcttaa ttagacctag 372; gcctcagctg cactgcccga aaagcatttg ggcagaccct gagcagaata ctggtctcag 378; gccaagccca ccat taaagatgac ctacagtgct gtgtaccctg gggcaatagg 384; gttaaatggt agttagcaac tagggctagt cttcccttac ctcaaaggct ctcactaccg 390; tggaccacct agtctgtaac tctttctgag gagctgttac tgaatattaa aaagatagac 396; ttcaactatg aaa Protein seguence: NCBI Reference Seguence: NP_005338.1 LOCUS N P_005338 ACCESSION NP_005338 l mklslvaaml lllsaaraee edkkedvgtv vgidlgttys cvgvfkngrv eiiandqgnr 61 itpsyvaftp egerligdaa knqltsnpen tvfdakrlig rtwndpquq dikflpfkvv 121 ekktkpyiqv digggqtktf apeeisamvl tkmketaeay lgkkvthavv tvpayfndaq 181 agti aglnvmriin eptaaaiayg ldkregekni ggtf dvslltidng 24; vfevvatngd thlggedqu rvmehfikly kkktgkdvrk dnravqklrr evekakraLs ; sqhqarieie sfyegedfse tltrakfeel nmdlfrstmk pvquledsd lkksdideiv 36; lvggstripk effn gkepsrginp deavaygaav qagvlsgdqd tgdlvlldvc 42; pltlgietvg gvmtkliprn tvvptkksqi fstasdnqpt vtikvyeger pltkdnhng 48; tfdltgippa prgquievt feidvngilr vtaedkgtgn tndq eier 54; mvndaekfae erid trnelesyay slknqigdke klggklssed ketmekavee 60- kiewleshqd adiedfkakk keleeivqpi isklygsagp pptgeedtae kdel Grp75 Official : HSPA9 al Name: heat shock 70kDa protein 9 (mortalin) Gene ID: 3313 Organism: Homo sapiens Other Aliases: CSA; MOT; MOT2; GRP75; PBP74; GRP-75; HSPAQB; MTHSP75 Other Designations: 75 kDa glucose—regulated protein; heat shock 70kD protein QB; mortalin, perinuclear; mortalin-2; rtalin; peptide-binding protein 74; stress-70 protein, mitochondrial Nucleotide seguence: NCBI Reference Seguence: NM_004134.6 LOCUS NM_004134 ACCESSION NM_004134 l ttcctcccct ggactctttc tgagctcaga gccgccgcag ccgggacagg agggcaggct 6; ttctccaacc atcatgctgc ggagcatatt acctgtacgc cctggctccg ggagcggcag 12; atcc tctggtcagg ngCgnggC tcag cggaagagcg ggcctctggg l8; ccgcagtgac caacccccgc ccctcacccc acgtggttgg ccag tgcc 24L gccaccgcat cgcgcagctc tttgccgtcg gagcgcttgt ttgctgcctc gtactcctcc ; atttatccgc catgataagt gccagccgag ctgcagcagc ccgtctcgtg ggcgccgcag 36; cctcccgggg ccctacggcc gcccgccacc aggatagctg gaatggcctt agtcatgagg 42; cttttagact aagg cgggattatg catcagaagc aatcaaggga gcagttgttg 48; gtattgattt gggtactacc aactcctgcg tggcagttat ggaaggtaaa caagcaaagg 54; tgctggagaa aggt gccagaacca ccccttcagt tgtggccttt gatg 60; gtgagcgact tgttggaatg ccggccaagc ctgt caccaaccca aacaatacat 66; tttatgctac caagcgtctc attggccggc gatatgatga tcctgaagta cagaaagaca 72; ttaaaaatgt tccctttaaa attgtccgtg cctccaatgg tgatgcctgg gttgaggctc 78; aatt gtattctccg agtcagattg ttgt gttgatgaag atgaaagaga 84L ctgcagaaaa ttacttgggg cacacagcaa aaaatgctgt gatcacagtc ccagcttatt 90; tcaatgactc gcagagacag gccactaaag atgctggcca gatatctgga ctgaatgtgc 96L ttcgggtgat taatgagccc acagctgctg ctcttgccta agac aaatcagaag 102; acaaagtcat tgctgtatat gatttaggtg gtggaacttt tgatatttct ttcagaaagg agtatttgag gtgaaatcca caaatgggga taccttctta actttgacca ggccttgcta cggcacattg tgaaggagtt caagagagag acaggggttg L20; atttgactaa agacaacatg gcacttcaga gggtacggga agctgctgaa aaggctaaat L26; gtgaactctc ctcatctgtg cagactgaca tcaatttgcc ctatcttaca atggattctt L32; ctggacccaa gcatttgaat atgaagttga cccgtgctca atttgaaggg actg L38; atctaatcag aaggactatc tgcc aaaaagctat gcaagatgca gaagtcagca L44; agagtgacat aggagaagtg attcttgtgg gtggcatgac gccc aaggttcagc L50; agactgtaca ggatcttttt ggcagagccc caagtaaagc tgtcaatcct gatgaggctg L56; tggccattgg agctgccatt cagggaggtg tgttggccgg cgatgtcacg gatgtgctgc L62; tccttgatgt cactcccctg tctctgggta ttgaaactct aggaggtgtc aaac L68; ttattaatag gaataccact attccaacca gcca ggtattctct actgccgctg L74; atggtcaaac gcaagtggaa attaaagtgt gtcagggtga aagagagatg gctggagaca L80; acaaactcct tggacagttt actttgattg gaattccacc agcccctcgt ggagttcctc L86; agattgaagt tacatttgac gcca atgggatagt acatgtttct gctaaagata L92: aaggcacagg acgtgagcag cagattgtaa tccagtcttc tggtggatta agcaaagatg L981 atattgaaaa tatggttaaa aatgcagaga ctga agaagaccgg aagg 204; aacgagttga agcagttaat atggctgaag gaatcattca cgacacagaa accaagatgg 210; aagaattcaa ggaccaatta cctgctgatg agtgcaacaa gctgaaagaa tcca 216; aaatgaggga gctcctggct gaca gcgaaacagg agaaaatatt agacaggcag 222; catcctctct tcagcaggca tcactgaagc tgttcgaaat ggcatacaaa aagatggcat 228; ctgagcgaga aggctctgga agttctggca ctggggaaca aaaggaagat caaaaggagg 234; aaaaacagta ataatagcag aaattttgaa gccagaagga caacatatga agcttaggag 240; gact tcctgagcag aaatgggcga acttcagtct ttttactgtg tttttgcagt 246; attctatata taatttcctt aatttgtaaa tttagtgacc tagt gatcatttaa 252; tggacagtga ttctaacagt ataaagttca caatattcta tgtccctagc ctgtcatttt 258; tcagctgcat gtaaaaggag gtaggatgaa ttgatcatta taaagattta actattttat 2641 gctgaagtga ccatattttc aaggggtgaa accatctcgc caat gaaggtagtc 2701 atccatagac ttgaaatgag tatg gggatgagat ccttctagtt agcctagtac 2761 tgctgtactg gcctgtatgt acatggggtc cttcaactga ggccttgcaa gtcaagctgg 2821 ctgtgccatg gatg gggcagagga atctagaaca atgggaaact ttta 288; tattaggtac agctattaaa acaaggtagg aatgaggcta gacctttaac ttccctaagg 294; catacttttc tagctacctt ctgccctgtg acct acatccttga tgattgttct 300; cttacccatt ctggaatttt ttttttttta taca gaaagcatct tgatctcttg 306; tttgtgaggg gtgatgccct gagatttagc ttcaagaata tgccatggct catgcttccc 312; atatttccca aagagggaaa tacaggattt gctaacactg aatg caaattcaag 318; aagg gctgttataa tgaaataatg agcagtatca gcatgtgcaa atcttgtttg 324; aaggatttta cccc ttagaccttt ggtacattta gaatcttgaa agtttctaga 330; tctctaacat gaaagtttct agatctctaa catgaaagtt tttagatctc taacatgaaa 336; accaaggtgg ctattttcag gttgctttca gctccaagta gaaataacca gaattggctt 342; acattaaaga aactgcatct agaaataagt cctaagatac tatg gctcaaaaat 348; aaaaggaacc cagatttctt tcccta Protein seguence: NCBI Reference ce: NP_004125.3 LOCUS N P_0041 25 ACCESSION N P_004125 1 misasraaaa rlvgaaasrg ptaarhquw nglsheafrl vsrrdyasea ikgavvgidl 61 gttnscvavm egkqakvlen aegarttpsv vaftadgerl vgmpaquav tnpnntfyat 121 krligrrydd pevqkdiknv pfkivrasng dawveahgkl yspsqigafv taen 181 ylghtaknav itvpayfnds qrqatkdagq isglnvlrvi alay gldksedkvi 24; ggtf disileiqu vfevkstngd tflggedqu allrhivkef kretgvdltk ; dnmalqrvre aaekakcels squtdinlp yltmdssgpk hlnmkltraq fegivtdlir 36; rtiapcham ksdi gevilvggmt tvq dlfgrapska vnpdeavaig 42; aaiqggvlag dvtdvllldv tplslgietl ggvftklinr nttiptkksq vfstaadgqt 48; qveikchge remagdnkll gqftligipp aprgquiev tfdidangiv hvsakdkgtg 54; reqqiviqss gglskddien mvknaekyae edrrkkerve avnmaegiih eefk 60- dqlpadecnk lkeeiskmre llarkdsetg assl qqaslklfem aykkmasere 66; gsgssgtgeq kequeekq TIMP1 Official Symbol: TIMP1 Official Name: TIMP metallopeptidase inhibitor 1 m: Gene ID: 7076 Organism: Homo sapiens Other Aliases: RP1-230G1.3, CLGI, EPA, EPO, HCI, TIMP Other Designations: TIMP-1;collagenase inhibitor; erythroid potentiating activity; erythroid-potentiating activity; fibroblast collagenase inhibitor; metalloproteinase inhibitor 1; tissue inhibitor of metalloproteinases 1 Nucleotide ce: NCBI Reference Seguence: 254.2 LOCUS NM_003254 ACCESSION NM_003254 l tttcgtcggc ccgccccttg gcttctgcac tgatggtggg tggatgagta atgcatccag 6; gaagcctgga ggcctgtggt ttccgcaccc gctgccaccc ccgcccctag cgtggacatt 12; tatcctctag cgctcaggcc ctgccgccat cgccgcagat ccca gagagacacc l8; agagaaccca ccatggcccc ctttgagccc ctggcttctg gcatcctgtt gttgctgtgg 24; gccc ccagcagggc ctgcacctgt gtcccacccc acccacagac ggccttctgc ; aattccgacc tcgtcatcag ggccaagttc gtggggacac cagaagtcaa ccagaccacc 36; ttataccagc agat caagatgacc aagatgtata aagggttcca aggg 42; gatgccgctg acatccggtt cgtctacacc cccgccatgg agagtgtctg cggatacttc 48; tccc acaaccgcag cgaggagttt gctg gaaaactgca ggatggactc 54; ttgcacatca ctacctgcag ttttgtggct ccctggaaca gcctgagctt agctcagcgc 60; cggggcttca ccaagaccta cactgttggc tgtgaggaat gcacagtgtt tccctgttta 66; tccatcccct gcaaactgca gagtggcact cattgcttgt ggacggacca gctcctccaa 72; ggctctgaaa agggcttcca gtcccgtcac cttgcctgcc tgcctcggga gccagggctg 78; tgcacctggc agtccctgcg gtcccagata gcctgaatcc tgcccggagt ggaagctgaa 84; gcctgcacag tgtccaccct gttcccactc ccatctttct tccggacaat gaaataaaga 90- gttaccaccc agcagaaaaa aaaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_003245.1 LOCUS 245 ACCESSION NP_003245 1 lasg liap sractcvpph pqtafcnsdl virakfvgtp lyqr 61 yeikmtkmyk daad pame svcgyfhrsh nrseefliag qudgllhit 121 tcsfvapwns lslaqrrgft ktytvgceec sipc qusgthclw tdqllqgsek 181 gqurhlacl prepglctwq slrsqia PTX3 al Sym bol: PTX3 Official Name: pentraxin 3, long Gene ID: 5806 Organism: Homo sapiens Other Aliases: TN FAIP5, TSG-14 Other Designations: TNF alpha-induced protein 5; pentaxin-related gene, rapidly induced by lL-1 beta, tumor necrosis factor, alpha-induced protein 5; pentaxin- related protein PTX3; pentraxin-3; xin-related gene, rapidly induced by IL- 1 beta; pentraxin-related protein PTX3; tumor necrosis factor induced protein 5; tumor necrosis factor, alpha-induced protein 5; tumor necrosis factor- inducible gene 14 protein; tumor necrosis factor-inducible protein TSG-14 Nucleotide seguence: NCBI Reference Seguence: NM_002852.3 LOCUS NM_002852 ACCESSION NM_002852 l attcatcccc attcaggctt tcctcagcat ttattaagga ctctctgctc cagcctctca 6; ctctcactct cctccgctca aactcagctc acttgagagt ctcctcccgc cagctgtgga 12; aagaactttg cgtctctcca gcaatgcatc tccttgcgat tctgttttgt gctctctggt 18; ctgcagtgtt ggccgagaac tcggatgatt atgatctcat gtatgtgaat ttggacaacg 24; aaatagacaa tggactccat cccactgagg accccacgcc gtgcgcctgc ggtcaggagc ; actcggaatg ggacaagctc ttcatcatgc tggagaactc gcagatgaga gagcgcatgc 36L tgctgcaagc cacggacgac gtcctgcggg gcgagctgca gaggctgcgg gaggagctgg 42L gccggctcgc ggaaagcctg gcgaggccgt gcgcgccggg ggctcccgca gaggccaggc 48L tgaccagtgc cgag ctgctgcagg cgacccgcga cgcgggccgc aggctggcgc 54L gtatggaggg cgcggaggcg cagcgcccag aggaggcggg gcgcgccctg gccgcggtgc 60; tagaggagct gcggcagacg cgagccgacc tgcacgcggt ctgg gctgcccgga 66; gctggctgcc ggcaggttgt gaaacagcta ttttattccc aatgcgttcc aagaagattt 72; ttggaagcgt gcatccagtg agaccaatga ggcttgagtc ttttagtgcc tgcatttggg 78; tcaaagccac agatgtatta aacaaaacca tttc ctatggcaca aagaggaatc 84; catatgaaat ccagctgtat tacc aatccatagt gtttgtggtg ggtggagagg 90; agaacaaact tgaa gccatggttt ccctgggaag gtggacccac ctgtgcggca 96; cctggaattc agaggaaggg ctcacatcct tgtgggtaaa tggtgaactg acca L02; agat ggccacaggt gttc ctgagggagg gcag attggccaag L08; aaaagaatgg tgtg ggtggtggct ttgatgaaac attagccttc tctgggagac L14; tcacaggctt caatatctgg gatagtgttc ttagcaatga agagataaga gagaccggag L20; gagcagagtc ttgtcacatc nggggaata ttgttgggtg gggagtcaca gagatccagc L26; cacatggagg agctcagtat taaa tgttgtgaaa ctccacttga agccaaagaa L32; agaaactcac acttaaaaca catgccagtt gggaaggtct gaaaactcag atag L38; gaacacttga gactaatgaa agtt gagaccaatc tttatttgta ctggccaaat L44; actgaataaa cagttgaagg aaagacattg gaaaaagctt ttgaggataa tggtgctttc agtttaatgc tctg tcagataaac tgtt aggg acaattgttt tacttttctt L62; tgttttgqcc agagatgaat tttacattgg aagaataaca aaataagatt tgttgtccat L68; ttgt tattggtatg taccttatta caaaaaaaag atgaaaacat atttatacta L74; caaggtgact taacaactat aaatgtagtt tatgtgttat aatcgaatgt cacgtttttg 1801 agaagatagt catataagtt caaa agggatttgt attaatttaa gactattttt 1861 gtaaagctct actgtaaata aaatatttta taaaactagc tcacgtcatt taattataaa 1921 agat gttttggaaa aaaaaaaaaa aaaaa Protein seguence: NCBI Reference ce: NP_002843.2 LOCUS N P_002843 ACCESSION NP_002843 l mhllailfca 1wsavlaens ddydlmyvnl dneidnglhp cacg qehsewdklf 6- imlensqmre rmllqatddv lrgelqure elgrlaesla rpcapgapae arltsaldel 12; agrr larmegaeaq rpeeagrala avleequtr adlhavqgwa arswlpagce 18; tailfpmrsk kifgsvhpvr pmrlesfsac ikaatdvln ktilfsygtk rnpyeiqul 24; syqsivavg geenklvaea mvslgrwthl cgtwnseegl tslwvngela atgh ; ivpeggilqi gqekngccvg ggfdetlafs grltgfniwd svlsneeire tggaeschir 361 gnivgwgvte iqphggaqu HSP76 Official Sym bol: HSPA6 Official Name: heat shock 70kDa n 6 B') Gene ID: 3310 Organism: Homo sapiens Othe r Aliases: Other Designations: heat shock 70 kDa protein 6; heat shock 70 kDa protein 8'; heat shock 70kD protein 6 (HSP7OB') Nucleotide seguence: NCBI Reference Seguence: NM_002155.3 LOCUS NM_0021 55 ACCESSION NM_002155 1 agagccagcc cggaggagct agaaccttcc ttct ttcagcagcc tgagtcagag 61 gcgggctggc ctggcgtagc cgcccagcct cgcggctcat gccccgatct gcccgaacct 12; tctcccgggg tcagcgccgc gccgcgccac ccggctgagt cagcccgggc gggcgagagg 18; ctctcaactg ggcgggaagg tgcgggaagg tgcggaaagg ttcgcgaaag ttcgcggcgg 24; nggggtcgg gtgaggcgca aaaggataaa aagcccgtgg aagcggagct gagcagatcc ; gagccgggct ggctgcagag aaaccgcagg gagagcctca ctgctgagcg gacg 36; gcggagcggc agcagcctcc gtggcctcca gcatccgaca agaagcttca cagg 42; ccccacggga ggtg ggcatcgacc tgggcaccac ctactcgtgc gtgggcgtgt 48L aggg ccgcgtggag atcctggcca aggg caaccgcacc acgcccagct 54L acgtggcctt caccgacacc gagcggctgg tcggggacgc ggccaagagc caggcggccc 60; tgaaccccca caacaccgtg ttcgatgcca agcggctgat cgggcgcaag gaca 66L ccacggtgca catg aagcactggc ccttccgggt ggtgagcgag ggcggcaagc 72; ccaaggtgcg ctac cgcggggagg acaagacgtt ctaccccgag gagatctcgt 78; ccatggtgct gagcaagatg acgg ccgaggcgta cctgggccag cccgtgaagc 84; acgcagtgat caccgtgccc gcctatttca atgactcgca gcgccaggcc accaaggacg 90; nggggccat Cgcggggctc aacgtgttgc ggatcatcaa tgagcccacg gcagctgcca 96; tcgcctatgg ccgg cggggcgcgg gagagcgcaa cgtgctcatt tttgacctgg L02; gtgggggcac cttcgatgtg tcggttctct ccattgacgc tggtgtcttt gaggtgaaag L08; ctgg agatacccac ctgggaggag aggacttcga caaccggctc gtgaaccact L14; tcatggaaga attccggcgg aagcatggga aggacctgag cgggaacaag cgtgccctgc L20; gcaggctgcg cacagcctgt gagcgcgcca agcgcaccct gtcctccagc gcca L26; ccctggagat agactccctg ttcgagggcg tcta cacgtccatc actcgtgccc L32; gctttgagga actgtgctca gacctcttcc gcagcaccct ggagccggtg gagaaggccc L38; tgcgggatgc caagctggac aaggcccaga ttcatgacgt cgtcctggtg tcca L44; ctcgcatccc caaggtgcag aagttgctgc aggacttctt caacggcaag gagctgaaca L50; agagcatcaa ccctgatgag gctgtggcct atggggctgc tgtgcaggcg tgggggacaa gaaa gtgcaggatc tcctgctgct ggatgtggct L621 tggggctgga gacagcaggt atga tgat ccagaggaac gccactatcc L68; ccaccaagca gacccagact ttcaccacct actcggacaa ccagcctggg aggtgtatga gggtgagagg gccatgacca aggacaacaa cctgctgggg cgttttgaac -80; tcagtggcat ccctcctgcc ccacgtggag tcccccagat agaggtgacc atgctaatgg catcctgagc gtgacagcca ctgacaggag cacaggtaag gctaacaaga 192; tcaccatcac caatgacaag ggccggctga gcaaggagga ggtggagagg atggttcatg 198; aagccgagca gtacaaggct gaggatgagg cccagaggga cagagtggct gccaaaaact 204; aggc ccatgtcttc catgtgaaag gttctttgca agaggaaagc cttagggaca 210; agattcccga agaggacagg atgc aagacaagtg tcgggaagtc cttgcctggc 216; tggagcacaa ccagctggca gagaaggagg agtatgagca gagg gagctggagc 222; aaatctgtcg ccccatcttc tccaggctct ggcc tggtgtccct gggggcagca 228: gttgtggcac tcaagcccgc cagggggacc ccagcaccgg ccccatcatt gaggaggttg 234; attgaatggc ccttcgtgat aagtcagctg tcag ggctatgcta tgggccttct 240; agactgtctt tcct gcccttcaga gatgaacttt ccctccaaag ctagaacttt 2461 cttcccagga aagt cttttgactt tttgggggga gggcggttca tcctcttctg 252; cttcaaataa aaagtcatta atttattaaa acttgtgtgg cactttaaca ttgctttcac 258; ctatattttg tgtactttgt tacttgcatg tatgaatttt gttatgtaaa atatagttat 264; agacctaaat aaaaaaaaaa aaaa Pnneh1seguence: NCBI Reference Seguence: NP_002146.2 LOCUS NP_002146 ACCESSION N P_002146 l mqaprelavg idlgttyscv gquqgrvei landqgnrtt psyvaftdte rlvgdaaksq 6; aalnphntvf dakrligrkf adttvqsdmk hwpfrvvseg gkpkvrvcyr gedktfypee 12; issmvlskmk etaeaylgqp vkhavitvpa yfndsqrqat kdagaiagln vlriinepta 18; aaiaygldrr gagernvlif fdvs vlsidagvfe vkatagdthl ggedfdnr;V 24; nhfmeefrrk gnkr alrrlrtace rakrtlssst qatleidslf egvdfytsit ; rarfeelcsd lfrstlepve kalrdakldk aqihdvvlvg gstripkvqk llqdffngke 36; lnksinpdea vaygaavqaa cekv dvap lslgletagg vmttliqrna 42L tiptkqtqtf ttysdnquv fiquegera mtkdnnllgr ppap rgquievtf 48L didangilsv tatdrstgka nkititndkg rlskeeverm vheaeqykae deaqrdrvaa 54L knsleahvfh vkgslqeesl rdkipeedrr kqukcrevl awlehnqlae keeyehqkre 60; pifs rlyggpgvpg gsscgtqarq gdpstgpiie evd PD|A4 Official Sym bol: PD|A4 Official Name: protein disulfide isomerase family A, member 4 m: 9601 Organism: Homo sapiens Other Aliases: ERP70, ERP72, ERp-72 Other Designations: ER protein 70; ER protein 72; endoplasmic reticulum resident protein 70; endoplasmic reticulum resident protein 72; protein disulfide isomerase related protein (calcium-binding protein, intestinal-related); protein disulfide ase-associated 4; protein disulfide-isomerase A4 Nucleotide seguence: NCBI nce Seguence: NM_004911.4 LOCUS NM_00491 1 ACCESSION NM_004911 l gttttaaacg cgcagccgag ggccgcgcgc aggagtaggg agggcctagg gcggcggagc 6; cgactcgtcg cggccgaggc gcgcgcggtc Cgtgccggcg tcagtctggg attggccggc 12; ccgcgacttc ctccgccccc tgccaatcgc cggggacgac ttccgtgggt ttttccggct 18; cccccgcgtc gctaaggagc gacgggctgt cggccagacc ccgagttctc ggtgcgctca 24; gngCCgCCg acgctaggag gccgcgctcc gcta ccatgaggcc ccggaaagcc ; ttcctgctcc tgctgctctt ggggctggtg cagctgctgg ccgtggcggg gggc 36; ccggacgagg attcttctaa aaat gccattgagg atgaagagga ggaggaggag 42; gaagatgatg atgaggaaga agacgacttg gaagttaagg aagaaaatgg agtcttggtc 48; ctaaatgatg caaactttga taattttgtg gctgacaaag acacagtgct gttt 54; ccat ggtgtggaca gcag ccgg aatatgaaaa aattgccaac 60; atattaaagg ataaagatcc tcccattcct gttgccaaga tcgatgcaac gtct 66; gtgctggcca gcaggtttga tgtgagtggc taccccacca tcaagatcct taagaagggg 72L caggctgtag actacgaggg ctccagaacc gaaa ttgttgccaa ggtcagagaa 78L gtctcccagc ccgactggac gcctccacca gaagtcacgc ttgtgttgac caaagagaac 84L tttgatgaag ttgtgaatga tgcagatatc attctggtgg agttttatgc cccatggtgt 90; ggacactgca agaaacttgc ccccgagtat gagaaggccg ccaaggagct cagcaagcgt 96; tctcctccaa ttcccctggc aaaggtcgac gccaccgcag aaacagacct ggccaagagg L02; tttgatgtct ctggctatcc caccctgaaa attttccgca aaggaaggcc ttatgactac L08; aacggcccac gagaaaaata tggaatcgtt gattacatga agtc cgggcctccc L14; tccaaggaga ttctgaccct gaagcaggtc caggagttcc tgaaggatgg tgtc L20; atcatcatcg gggtctttaa gggggagagt gcct accagcaata ccaggatgcc L26; gctaacaacc tgagagaaga ttacaaattt caccacactt tcagcacaga aatagcaaag L32; ttcttgaaag tctcccaggg gcagttggtt cagc ctgagaaatt ccagtccaag L38; cccc ggagccacat gatggacgtc cagggctcca cccaggactc ggccatcaag L44; gacttcgtgc tgaagtacgc cctgcccctg gttggccacc gcaaggtgtc aaacgatgct L50; aagcgctaca ccaggcgccc cctggtggtc gtctactaca gtgtggactt cagctttgat L56; tacagagctg caactcagtt ttggcggagc aaagtcctag aggtggccaa ccct L62; gagtacacct ttgccattgc ggacgaagag gctg gggaggtgaa ggacctgggg L68; gaga gtggggagga tgtcaatgcc gccatcctgg acgagagtgg gaagaagttc L74; gccatggagc cagaggagtt tgactctgac accctccgcg agtttgtcac tgctttcaaa L80; aaaggaaaac cagt catcaaatcc gtgc ccaagaacaa accc L86; gtcaaggtcg tggtgggaaa gacctttgac tccattgtga tggaccccaa gaaggacgtc L92; ctcatcgagt tctacgcgcc atggtgcggg aagc agctagagcc cgtgtacaac L98; agcctggcca agaagtacaa gggccaaaag ggcctggtca agat ggacgccact 204; gccaacgacg gcga ccgctataag gtggagggct tccccaccat cgcc 210; cccagtgggg acaaaaagaa cccagttaaa tttgagggtg gagacagaga tctggagcat 216; ttgagcaagt ttatagaaga acatgccaca aaactgagca ggaccaagga agagctttga 222; aggcctgagg tctgcggaag gtgggaggag gcagacgccc tgcgtggccc atggtcgggg 228; cgtccacgcc ggca acaaacgaca gtatctcgga ttcctttttt ttttttttta 234; attttttata ctttggtgtt tcacttcatg ctctgaatac tgaataacca tgaatgactg 240; aatagtttag tccagatttt tacagaggat acatctattt ttatcattat ttggggtttg 246; aaaaattttt ttttacacct tctaatttct ttatttctca aagcagataa ttcttctgtg 2521 tgaaaatgtt ttcttttttt aatttaaggt ttaaaattcc ttttccaaat catgttgatt 258; ttgctctttg ctttttcgtt gtctgagaaa ttgttggcgt agatttggct atgt 264; gtttctgatt gcttcctgtt gagcacaaag tgagagctgc cactgagcag ccctgccagg 270; ggtgctgttt caggctgggc atcgccaggc ggcctccctg caaaccaagg ggca 276; aaggggcatg atccagggtc ccccagggtg ggctcagctc cagggagagg ccacccacgt 2821 ggcagcccca cctcttgaga gcccccagtg ccggagcaga aaggaccctg gacccagagg 2881 cagatactgc ggggtggtag aaaaggtaga gtaggctgtg gcaatggaat aaaacacgat 2941 taaaaacgtt aaaa aaaaaaaaaa Protein seguence: NCBI Reference ce: NP_004902.1 LOCUS N P_004902 ACCESSION NP_004902 l flll lllglvqlla vagaegpded ssnrenaied eeeeeeeddd eeeddlevke 6L engvlvlnda nfdnfvadkd tvllefyapw cghckqfape yekianilkd kdppipvaki 121 vlas rfdvsgypti kilkkgqavd yegsrtqeei vakvrevsqp dwtpppevtl 181 vltkenfdev vndadiilve fyapwcghck klapeyekaa kelskrsppi plakvdatae 241 tdlakrfdvs gyptlkifrk ngpr dymi eqsgppskei 1t1kqvqef1 ; kdgddviiig pay qqudaannl redykfhhtf steiakflkv snglvvmqp 36; equskyepr shmmdvqgst quaikdfvl kyalplvghr kvsndakryt rrplvvvyys 42; vdfsfdyraa kvle vakdfpeytf aiadeedyag evkdlglses aiLd 48; esgkkfamep eefdsdtlre fvtafkkgkl kpviksqpvp knnkgpvkvv vgktfdsivm 54; dpkkdvlief hckq lepvynslak kykquglvi akmdatandv psdrykvegf 60; ptiyfapsgd kknpkaegg drdlehlskf ieehatklsr tkeel PD|A1 Official Symbol: P4HB Official Name: prolyl 4-hydroxylase, beta polypeptide Gene ID: 5034 Organism: Homo sapiens Other Aliases: DSI, ERBAZL, GIT, P4Hbeta, PDI, PDIA1, PHDB, PO4DB, PO4HB, PROHB Other Designations: cellular thyroid hormone-binding protein; en prolyl 4- hydroxylase beta; glutathione-insulin transhydrogenase; p55; procollagenproline , 2-oxoglutarate 4-dioxygenase (proline 4-hydroxylase), beta polypeptide; prolyl 4-hydroxylase subunit beta; protein ide isomerase family A, member 1; protein disulfide isomerase-associated 1; protein disulfide isomerase/oxidoreductase; protein disulfide-isomerase; protocollagen hydroxylase; thyroid hormone—binding protein p55 Nucleotide seguence: NCBI Reference Seguence: NM_000918.3 LOCUS NM_00091 8 ION NM_000918 l gagcctcgaa gtccgccggc caatcgaagg cgggccccag cggcgcgtgc gcgccgcggc 6L cagcgcgcgc gggcgggggg gcaggcgcgc cccggaccca ggatttataa aggcgaggcc 12L gggaccggcg cgcgctctcg tcgcccccgc tgtcccggcg gcgccaaccg aagcgccccg l8; cctgatccgt gtccgacatg ctgcgccgcg ctctgctgtg cctggccgtg gccgccctgg 24; tgcgcgccga cgcccccgag gacc acgtcctggt gctgcggaaa agcaacttcg ; cggaggcgct ggcggcccac ctgc agtt ctatgcccct tggtgtggcc 36; actgcaaggc tctggcccct gagtatgcca aagccgctgg gaagctgaag gcagaaggtt 42; ccgagatcag gttggccaag gtggacgcca cggaggagtc tgacctggcc cagcagtacg 48; gcgg ctatcccacc atcaagttct tcaggaatgg agacacggct tcccccaagg 54; cagc tggcagagag gctgatgaca tcgtgaactg gaag cgcacgggcc 60; cggctgccac caccctgcct gacggcgcag ctgcagagtc cttggtggag gagg 66; tggctgtcat cggcttcttc aaggacgtgg agtcggactc tgccaagcag tttttgcagg 72; aggc catcgatgac ataccatttg ggatcacttc caacagtgac gtgttctcca 78; aataccagct cgacaaagat ggggttgtcc tctttaagaa gtttgatgaa ggccggaaca 84; actttgaagg ggaggtcacc aaggagaacc tgctggactt acac aaccagctgc 90; cccttgtcat cgagttcacc gagcagacag ccccgaagat ttttggaggt gaaatcaaga 96; ctcacatcct gctgttcttg cccaagagtg tgtctgacta tgacggcaaa ctgagcaact L02; tcaaaacagc agccgagagc ttcaagggca agatcctgtt catcttcatc gacagcgacc L08; acaccgacaa catc ctcgagttct ttggcctgaa gaaggaagag 114; tcat caccctggag gaggagatga ccaagtacaa gcccgaatcg gaggagctga 120; cggcagagag gatcacagag ttctgccacc gcttcctgga gggcaaaatc aagccccacc L26; tgatgagcca ggagctgccg gaggactggg acaagcagcc ggtg cttgttggga L32; agaactttga agacgtggct tttgatgaga aaaaaaacgt ctttgtggag ttctatgccc L38; catggtgtgg tcactgcaaa cagttggctc ccatttggga taaactggga gagacgtaca ;44; atga gaacatcgtc atcgccaaga tggactcgac tgccaacgag gtggaggccg ;50; tcaaagtgca cagcttcccc acactcaagt tctttcctgc cagtgccgac aggacggtca 156; ttgattacaa cggggaacgc acgctggatg gttttaagaa attcctggag agcggtggcc 162; aggatggggc aggggatgat gacgatctcg aggacctgga agaagcagag gagccagaca 168; tggaggaaga cgatgatcag aaagctgtga aagatgaact gtaatacgca aagccagacc 174; cgggcgctgc cgagacccct nggggctgc acacccagca gcagcgcacg ctgcggcctc gcttgaagga gggcgtcgcc ggaaacccag ggaacctctc tgaagtgaca 1861 cctcacccct acacaccgtc cgttcacccc cgtctcttcc ttctgctttt cggtttttgg 1921 aaagggatcc atctccaggc ccct ggtggggctt tgaa tcat acatgagtct gtccagagtg accg tgttcggagt gcct 204; ccctcccgcg ggaggtttct cctctttttg aaaattccgt ctgtgggatt tttagacatt 210; tttcgacatc agggtatttg ttccaccttg gccaggcctc ctcggagaag cttgtccccc 216; gagg gacggagccg gactggacat ggtcactcag ctgc agtgtcgcca 222; tgactgatca tggctcttgc atttttgggt aaatggagac ttccggatcc tgtcagggtg 228; tcccccatgc ctggaagagg agctggtggc tgccagccct ggggcccggc acaggcctgg 234; gccttcccct tccctcaagc cagggctcct cctcctgtcg tgggctcatt gtgaccactg 240; gcctctctac agcacggcct tgtt caaggcagaa ccacgaccct ccgg 246; gtggggaggt ggccaaggat gctggagctg aatcagacgc tgacagttct attt 252; ctatttcaca atcgaattga acacattggc caaataaagt tgaaatttta ccacctgtaa 258; aaaaaaaaaa aaaaaa Pnneh1seguence: NCBI Reference Seguence: NP_000909.2 LOCUS N P_000909 ACCESSION 909 1 mlrrallcla vaalvradap eeedhvlvlr ksnfaealaa hkyllvefya pwcghckala 61 peyakaagkl kaegseirla kvdateesdl aqqygvrgyp tikffrngdt aspkeytagr 121 eaddivnwlk krtgpaattl eslv essevavigf fkdvesdsak qflqaaeaid 181 dipfgitsns dvfskyqldk dgvvlfkkfd egrnnfegev tkenlldfik hnqlplvief 241 teqtapkifg geikthillf lpksvsdydg klsnfktaae sfkgkilfif idsdhtdnqr 301 ileffglkke ecpavrlitl ykpe seeltaerit efchrflegk ikphlmsqel 361 pedwqupvk fedv afdekknva efyapwcghc kqlapiwdkl getykdheni 421 viakmdstan eveavkvhsf ptlkffpasa drtvidynge rtldgfkkfl esgqugagd 481 dddledleea eepdmeeddd qkavkdel CA2D1 Official Symbol: CACNA2D1 Official Name: m channel, voltage-dependent, alpha 2/delta subunit 1 Gene ID: 781 Organism: Homo sapiens Other Aliases: H_DJ0560014.1, CACNA2, CACNL2A, CCHL2A Other Designations: calcium channel, L type, alpha 2 polypeptide; dihydropyridine-sensitive L-type, calcium channel alpha-2/delta subunit; voltagedependent calcium channel subunit alpha—2idelta-1 ; e-gated calcium channel subunit alpha-2fdelta-1 Nucleotide seguence: NCBI nce Seguence: NM_000722.2 LOCUS NM_000722 ACCESSION NM_000722 1 aggc aaggcggccg cggcgcggag acgc acgctagtgg gtccgcccgc 6" caccgcccct tcctcggcgt ccgctcccgc ccttgccgtc ccccgcgcgg ctccgcgcct 12; nggCCCng gcgcagccag ccctccagac gcccgcggtc ccggcggcgt gtgctgctct 18; tcctccgccc gcggtttcca gcgccgctcc ttcccccgct tgggcaggga gggggcattg 24; gatc gcgaagatgg ctgctggctg cctgctggcc ttgactctga cacttttcca ; atctttgctc atcggcccct cgtcggagga ccct tcggccgtca ctatcaaatc 36- atgggtggat aagatgcaag aagaccttgt cacactggca aaaacagcaa gtggagtcaa 42L tgtt gatatttatg agaaatatca agatttgtat actgtggaac caaataatgc 48L acgccagctg gtagaaattg cagccaggga gaaa agca acagatctaa 54L agccctggtg Cgcctggcat tggaagcgga gaaagttcaa gcagctcacc agtggagaga 60; agattttgca agcaatgaag ttgtctacta caatgcaaag gatgatctcg atcctgagaa 66; aaatgacagt gagccaggca ggat aaaacctgtt ttcattgaag atgctaattt 72; tggacgacaa atatcttatc agcacgcagc agtccatatt cctactgaca tctatgaggg 78; ctcaacaatt gtgttaaatg actg gacaagtgcc ttagatgaag aaaa 84; gaatcgcgag gaagaccctt cattattgtg tttt ggcagtgcca ctggcctagc 90; tcgatattat ccagcttcac catgggttga taatagtaga actccaaata agattgacct 96; ttatgatgta cgcagaagac catggtacat ccaaggagct gcatctccta aagacatgct L02; tattctggtg gatgtgagtg gaagtgttag tggattgaca cttaaactga L08; tgtctccgaa atgttagaaa ccctctcaga tgatgatttc gtgaatgtag cttcatttaa L14; cagcaatgct caggatgtaa gctgttttca gcaccttgtc caagcaaatg aaaagtgttg aaagacgcgg tgaataatat cacagccaaa ggaattacag L26; tagt tttgcttttg aacagctgct taattataat gtttccagag caaactgcaa L32; taagattatt atgctattca cggatggagg agaagagaga gcccaggaga atacaataaa gataaaaaag tacgtgtatt cacgttttca gttggtcaac acaattatga L44; acct attcagtgga tggcctgtga aaacaaaggt tattattatg cattggtgca ataagaatca atactcagga ggat gttttgggaa gaccaatggt L56; tttagcagga gacaaagcta tcca atggacaaat ctgg atgcattgga L62; actgggactt gtcattactg gaactcttcc ggtcttcaac ataaccggcc aatttgaaaa L68; taagacaaac ttaaagaacc agctgattct tggtgtgatg ggagtagatg tgtctttgga L74; agatattaaa agactgacac cacgttttac actgtgcccc aatgggtatt actttgcaat L80; cgatcctaat ggttatgttt atcc aaatcttcag ccaaagaacc ccaaatctca L86; ggagccagta acattggatt tccttgatgc agagttagag aatgatatta agat L92; tcgaaataag atgattgatg gggaaagtgg aaca ttcagaactc tggttaaatc L98; tgag agatatattg acaaaggaaa caggacatac acatggacac ctgtcaatgg 204; cacagattac agtttggcct tggtattacc aacctacagt ttttactata taaaagccaa 210; actagaagag acaataactc aggccagatc aaaaaagggc aaaatgaagg attcggaaac 216: cctgaagcca tttg aagaatctgg attc atagcaccaa gagattactg 2221 caatgacctg aaaatatcgg ataataacac tgaatttctt ttaaatttca acgagtttat 228; tgatagaaaa actccaaaca acccatcatg taacgcggat ttgattaata gagtcttgct 234; tgatgcaggc tttacaaatg aacttgtcca aaattactgg agtaagcaga aaaatatcaa 240; gaaa gcacgatttg ttgtgactga tggtgggatt accagagttt atcccaaaga 246; ggctggagaa aattggcaag aaaacccaga gacatatgag gacagcttct ataaaaggag 252; cctagataat gataactatg ttttcactgc tccctacttt aacaaaagtg gacctggtgc 258; atcg atgg taagcaaagc tgtagaaata tatattcaag ggaaacttct 264; taaacctgca gttgttggaa ttaaaattga tgtaaattcc tggatagaga atttcaccaa 270; aacctcaatc agagatccgt gtgctggtcc agtttgtgac tgcaaaagaa acagtgacgt 276; aatggattgt gtgattctgg atgatggtgg gtttcttctg atggcaaatc atgatgatta 282; tactaatcag attggaagat tttttggaga gattgatccc agcttgatga gacacctggt 2881 taatatatca gtttatgctt ttaacaaatc ttatgattat cagtcagtat gtgagcccgg 294; acca aaacaaggag caggacatcg ctcagcatat gtgccatcag tagcagacat 300; attacaaatt ggctggtggg ccactgctgc tgcctggtct cagc agtttctctt 3061 gagtttgacc cgac tccttgaggc agttgagatg gaggatgatg acttcacggc 312; ctccctgtcc aagcagagct gcattactga acaaacccag tatttcttcg acag 318; taaatcattc agtggtgtat tagactgtgg aaactgttcc agaatctttc atggagaaaa 324; gaac accaacttaa tattcataat ggttgagagc aaagggacat gtccatgtga 330; cacacgactg ctcatacaag cggagcagac ttctgacggt ccaaatcctt gtgacatggt 336; taagcaaccc cgaa ctga tgtctgcttt gataacaatg tcttggagga 342; tgac tgtggtggtg tttctggatt aaatccctcc ctgtggtata tcattggaat 348; ccagtttcta ctactttggc ctgg cagcacacac cgcctgttat gaccttctaa 354; aaaccaaatc tgcatagtta aactccagac cctgccaaaa catgagccct gccctcaatt 360; acagtaacgt agggtcagct ataaaatcag acaaacatta gctgggcctg ttccatggca 366; taacactaag gcgcagactc ctaaggcacc cactggctgc atgtcagggt gtcagatcct 372; taaacgtgtg tgaatgctgc atcatctatg tgtaacatca aagcaaaatc ctatacgtgt 378; cctctattgg tggg agtttgttgt tgcattgttg gt Protein seguence: NCBI Reference Seguence: NP_000713.2 LOCUS N P_00071 3 ACCESSION N P_000713 1 maagcllalt 1t1qullig psseepfpsa vdkm qedlvtlakt asgvnqlvdi 61 yekyqdlytv qlve iaardiekll snrskalvrl aleaekvqaa hqwredfasn 121 evvyynakdd ldpekndsep gsqrikpvfi edanfgrqis ipt diyegstivl 18; nelnwtsald evfkknreed psllqufgs atglaryypa spwvdnsrtp nkidlydvrr 24; rpwyiqgaas pkdmlilvdv sgsvsgltlk lirtsvseml etlsdddfvn naqd ; vschhlvqa nvrnkkvlkd avnnitakgi tdykkgfsfa ynvs rancnkiiml 36; ftdggeeraq eifnkynkdk kvrvftfsvg qhnydrgpiq wmacenkgyy yeipsigair 42; intqeyldvl grpmvlagdk tnvy ldalelglvi tgtlpvfnit gqfenktnLk 48; nqlilgvmgv dvsledikrl tprftlcpng yyfaidpngy vllhpnlqpk npksqepvtl 54L dfldaelend ikveirnkmi dgesgektfr tlvksqdery idkgnrtytw tpvngtdysl 60; alvlptysfy eeti tqarskkgkm kdsetlkpdn feesgytfia prdycndlki 66L flln fnefidrktp nnpscnadli nrvlldagft nelvqnywsk vkar 721 fvvtdggitr vypkeagenw yeds dndn yvftapyfnk sgpgayesgi 78; mvskaveiyi qgkllkpavv gikidvnswi enftktsird cdck rnsdvmdcvi 84; lddggfllma nhddytnqig rffgeidpsl mrhlvnisvy afnksydyqs vcepgaapkq 90; gaghrsayvp svadilqigw wataaawsil qqfllsltfp emed ddftaslskq 96; sciteqtqyf fdndsksfsg Vldcgncsri fhgeklmntn lifimveskg tcpcdtrlLi 102; qaeqtsdgpn pcdmvkqpry rkgpdvcfdn nvledytdcg gvsglnpslw yiigiqflLl 108; wlvsgsthrl 1 GPAT1 Official Symbol: GPAM Official Name: glycerol—3—phosphate acyltransferase, mitochondrial m: 57678 Organism: Homo sapiens Other Aliases: RP11-426E52, GPAT, GPAT1 Other ations: GPAT-1 ; glycerol 3-phosphate acyltransferase, mitochondrial; glycerolphosphate acyltransferase 1, mitochondrial Nucleotide seguence: NCBI Reference Seguence: NM_001244949.
LOCUS NM_001 244949 ACCESSION NM_001244949 l tgcgtcatca gggtgcgcca ctgcagctgg cattggccgg gactggaagt gcgggcttct 6" gcagcagccg gagc ggca gcagcggctc ccctgttgta tggacattct 12; gcacccgaaa ctgatagctg agtcctgaag ttttatgtta tgaaacagaa gaactttcat 18; cccagcacat gatttgggaa ttacactttg tgacatggat gaatctgcac tgacccttgg 24; tacaatagat gtttcttatc tgccacattc atcagaatac agtgttggtc gatgtaagca ; cacaagtgag gaatggggtg agtgtggctt tagacccacc atcttcagat ctgcaacttt 36L aaaatggaaa gaaagcctaa tgagtcggaa aaggccattt gttggaagat gttgttactc 42L ctgcactccc cagagctggg acaaattttt caaccccagt atcccgtctt tgggtttgcg 48L gaatgttatt tatatcaatg aaactcacac ccgc ggatggcttg caagacgcct 54L ttcttacgtt ctttttattc aagagcgaga tgtgcataag ggcatgtttg ccaccaatgt 60; gactgaaaat gtgctgaaca gcagtagagt acaagaggca attgcagaag tggctgctga 66; attaaaccct gatggttctg cccagcagca atcaaaagcc gttaacaaag agaa 72; agctaaaagg attcttcaag ttgc ctca ccggcaatga tcagactgac 78; tgggtgggtg ctgctaaaac tgttcaacag cttcttttgg aacattcaaa ttcacaaagg 84; tcaacttgag aaag ctgcaactga gacgaatttg ccgcttctgt ttctaccagt 90; tcatagatcc catattgact atctgctgct cactttcatt tgcc ataacatcaa 96; agcaccatac attgcttcag gcaataatct caacatccca agta ccttgatcca L02; taagcttggg ggcttcttca tacgacgaag gctcgatgaa acaccagatg gacggaaaga L08; ctat agagctttgc tccatgggca tatagttgaa ttacttcgac agcagcaatt L14; cttggagatc ttcctggaag gcacacgttc tgga aaaacctctt gtgctcgggc L20; aggacttttg tcagttgtgg tagatactct gtctaccaat gtcatcccag tgat L26; aatacctgtt ggaatctcct atgatcgcat tatcgaaggt cactacaatg gtgaacaact L32; gggcaaacct aagaagaatg agagcctgtg gagtgtagca agaggtgtta ttagaatgtt L38; acgaaaaaac tatggttgtg tccgagtgga ttttgcacag tcct L44; tttagaaagc caaagtcaga aaccggtgtc tgctctactt tccctggagc tgtt L501 accagctata cttccttcaa gacccagtga tgctgctgat gaaggtagag ccat L56; taatgagtcc agaaatgcaa cagatgaatc cctacgaagg aggttgattg ctattcactg ctagcaagtc catt atgtccacac gctc ctctacagac acaggcaggg aattgatctc tccacattgg ctttgtgatg aaagaggaag tcctggctcg tgattttgac ctggggttct caggaaattc 180; agaagatgta gtaatgcatg ccatacagct gctgggaaat tgtgtcacaa tcacccacac 186; tagcaggaac gatgagtttt ttatcacccc cagcacaact gtcccatcag tcttcgaact 192; caacttctac agcaatgggg tacttcatgt ctttatcatg gaggccatca tagcttgcag 198; cctttatgca gttctgaaca gact ggggggtccc accc cacctaacct 204; gatcagccag gagcagctgg aggc ggccagcctg tgctaccttc tctccaatga 210; aggcaccatc tcactgcctt gccagacatt ttaccaagtc tgccatgaaa cagtaggaaa 216; gtttatccag tatggcattc ttacagtggc agagcacgat gaccaggaag gtcc 222; tagtcttgct gagcagcagt gggacaagaa gcttccagaa cctttgtctt gtga 228; tgaagaagat gaagacagtg actttgggga ggaacagcga gattgctacc tgaaggtgag 234; ccaatccaag gagcaccagc agtttatcac cttcttacag cttg ggcctttgct 240; ggaggcctac agctctgctg ttgt tcacaacttc agtggtcctg ttccagaacc 246; tgagtatctg caaaagttgc acaaatacct aataaccaga acagaaagaa atgttgcagt 252; atatgctgag agtgccacat ttgt gaagaatgct gtgaaaatgt ttaaggatat 258; tggggttttc aaggagacca aacaaaagag agtgtctgtt ttagaactga gcagcacttt 264; tctacctcaa tgcaaccgac aaaaacttct agaatatatt ctgagttttg tggtgctgta 270; ggtaacgtgt ggcactgctg gcaaatgaag gtcatgagat gagttccttg taggtaccag 276; cttctggctc aagagttgaa ggtgccatcg cagggtcagg cctgccctgt gtga 282; tctcctggaa gacaagtgcc ttctccctcc atggatctgt gatcttccca gctctgcatc 288; aacacagcag cctgcagata acacttgggg ggacctcagc ctctattcgc taat 294; ccgtagacta caagatgaaa tctcaataaa ttatttttga gtttattaaa gattgacatt 300; ttaagtacaa cttttaagga ctaattactg tgatggacac agaaatgtag ctgtgttctg 306; gaactgaatc ttacatggta tacttagtgc tgctgggtaa ggta tattatctgg 312; ttagtggtta atgcttcctt taaaaataat tgagtcatcc attcactctt tttcagtttt 318; atctgtcaat taca atgg gagcaccttt tatcccaaag tgctttataa 324; attgagtgga ctgatatata tcacacccag gtatcactgt gctgtccttt agat 3301 ttagaaatgt ttttaagagc tatgtgaaaa cagacaatat tagtttaggt cgggaactga 336; gatattgtaa tcaaatagtt agga agttaatttg gctggcaaaa ttctagggaa 342; acttggccag aaaactggtg ttgaaggctt ttgctcatat aaacaagtgc cattgagttt 348; caaatgacca gcaaatatat ttagaaccct tcctgtttta tgtctgtacc accc 354; ctcaggtaat acctgcctct cacaggtaca gctgtttctt ggaaatcctc caaccaaata 360; gcagttttcc gatt agcttgagct gacagactgt tagaatacag ttctctggcc 366; acagctgatg agggctttct gtactgcaca cagattgtgt actgcacccc agtccaggtg 372; actggtaccc gttg tgccgtgcac aacctgtcca gtatatgcat gtggtggccc 378; tactgactgg ttag aggcatttat ggatttttag ggaa aaaccatgac 384; ttttaacaaa tttttatggg ttatatgcct aaacccttat tagt ggtaaataat 390; tatgaaaaat ggtctgttca taattggtag gtgccttttg tgagcaggga ttat 396; tggtttatta tggtaattat ggtgattttt taaatatcat gtaatgttaa aacgttttct 402; aacagtttac tgttgcttat ctccaagata ttatggaatt aagaattttt ccagatgagt 408; gttacataga ttctttgaat ttagtataaa agtactgaga attaagtttg tacttccata 414; gatt ttaaacactg atagtatctc atgagtaatg tgtgttttgg gagagggagg 420; gatt gatatttcac attgtatgaa ataccatgtt tgaaactcat agcaataatg 426; ctatgctgtt gtgatccctc tcaagttctg catttaaaat atattttttc tttataggaa 432; tata ccatgaagtc attgtcagtt gtagtagctc tgatgttgaa tgagatatca 438; tgttttagca ttccatttta aggg tagaagaaca cttttcttgg ctacatttgg 444; aggataccca gggagtcttg ggtgttcctt atctggggaa gcaaacattt cactagtctc 450; tcat cctttaaatt gtaaattaag gattactcaa gctcaccatt attcaagatt 456; gggactcgct tcccagtcga cactctgccc tgcctgtcat tgctgcaaag agctgctgct 462; ttgccaacct aagcaaagaa aatacggctt ctcttgcatt cctt ttggttggtt 468; tgttttctag gttc agatgctttg gggaatgcaa tgtatgattt tctc 474; tcaccactta actcactgtg aata tgcatgcttt ttgtaattaa ctggtgcttt 480; gaaaatcttt tttaagggag aaaaatctca accaaagtta tgctcatcca gacaagctga 486; cctttgagtt aatttcagca caactcattc ttcagtgcct catgactgaa aacaaaaaac 492; aaaaaaacga aagcatcttc acaatgaagc ttccagatag caccgttttg ctaaaagata 498; cattctcatt gttttccaac agtgatggct taag gttaaacaaa ctaggtgctt 504; aatt tattacagtt tactctatcg catttctgta acatgaaatg catgcccttc 5101 ttcagggqaa gactgtggtc aagttaaaaa aaaaaaacaa tattaaacaa actg 516; cagtctgttt ttgaaaatga gaatgtccta agtgattcag aagagaggag ggaagttgtg 522; cactctgaaa atgcatgaaa aacaaaggca aaaactagtg ggaaatgtgt agaactgtta 528; actgagatgg cttcgagtct tccttctgga atctgttaaa tttcacaaag tcatgagggt 534; aaatggagaa aatatttctg ggattacaat gaatgtaagc ccaaattgtg gaattgccag 540; taacctggat ggggaaaagc atttcccata gcactccatg taatatgagt tgag 546; atgttcatca gtgttttata gtgt gaaa ccaagtttgc acctggaaac 552; ttacaatgca ctttagcgca gtaagggctt ggcatccggt aaac tgtctaaccc 558; agcattgccc aaactatttt gacaccagga cctttttctc ctttgggata cttatgaacc 564; tctcactaat gtcctgtgga gaacattttg ggaaacacta tgttagatag ttctttaagg 570; agacaaaacg gtaatgaaca gatagcactg gggcagaata tgcatgcatt ttgtaacgtc 576: cagtgtggcg ttgaatagat gtgtatttcc tcccctgcag aaaataagca cagaaaatta 582; taatgtaggt gatcggagct ctttcctttg agaa cagccccaat gatcctggct 588; ttttcactga acgtatcaga atacatggat gaattggggt aaataaggtt ttaattcaga 5941 tctagaagaa gtac gtttgaatgc agatttttat ccacagatag ttgtagtgtt 600; tagacatgac aggacctatc gttgaggttt ctaagactta ctatgggctg taaacctgtt 606; ttttaaaact attttagaaa cctgagactt gccgtctggc attttagttt aatacaaact 612; aatgattgca tttgaaagag attcttgacc ttatttctaa acgtctagag ctctgaaatg 618; tcttgatgga aggtattaaa ctatttgcct gttgtacaaa gaaatgttaa gactcgtgaa 624; aagaattact ataaggtact gtgaaataac tgcgattttg tgagcaaaac atacttggaa 630; atgctgattg atttttatgc gtgt attgcaagaa acacagaaaa tgtagttttg 636; taaa ccaaaaattg aacatacaaa aaaaaaaaaa aaaaaa Protein seguence: NCBI Reference Seguence: NP_001231878.1 LOCUS N P_001231878 ION N P_001231878 l mdesaltlgt phss eysvgrckht seewgecgfr ptifrsatlk srkr 6; pfvgrccysc tpqswdkffn psipslglrn viyinethtr hrgwlarrls yvlfiqerdv 12; hkgmfatnvt envlnssrvq eaiaevaael npdgsaqqqs kavnkvkkka krilqemvat l8; vspamirltg fnsf fwniqihkgq lemvkaatet nlpllflpvh rshidylllt 24L filfchnika pyiasgnnln lihk lggffirrrl detpdgrkdv lyrallhghi ; velquqqfl eiflegtrsr sgktscarag llsvvvdtls tnvipdilii pvgisydrii 36; eghyngeqlg kpkkneslws vargvirmlr knygcvrvdf aqpfslkeyl pvsa 42; llsleqallp psda adegrdtsin esrnatdesl rrrlianlae hilftasksc 481 aimsthivac 111yrhrqgi dlstlvedff vmkeevlard fdlgfsgnse dvvmhaiqll 541 gncvtithts rndeffitps ttvpsvfeln fysngvlhvf imeaiiacsl yavlnkrglg 601 gptstppnli sqeqlvrkaa slcyllsneg qtfy qvchetvgkf iqygiltvae 661 hddqedisps 1aequdkkl peplswrsde ededsdfgee qrdcylkvsq skehqqfitf 721 lqulgplle ayssaaifvh nfsgpvpepe yli trternvavy aesatyclvk 781 navkmfkdig vfketqurv svlelsstfl pqcnqulle yilsfvvl Official Symbol: TAZ Official Name: tafazzin Gene ID: 6901 Organism: Homo sapiens Other Aliases: XX-FW83563B9.3, BTHS, CMDBA, EFE, EFE2, G4.5, LVNCX, Taz1 Other Designations: protein G4.5 Nucleotide seguence: NCBI Reference Seguence: NM_000116.3 LOCUS 116 ACCESSION NM_000116 1 tttccggcgg ttgcaccggg ccggggtgcc agcgcccgcc ttcccgtttc ctcccgttcc 61 gcagcgcgcc cacggcctgt gaccccggcg accgctcccc agtgacgaga gagcggggcc 121 gggcgctgct ccggcctgac ctgcgaaggg acctcggtcc agtcccctgt tgcgccgcgc 181 cccctgtccg tccgtgcgcg ggccagtcag gggccagtgt cggt cgaggtcgca 24; gagg cgccccacag ccgg ggcgctggga gccg cgggccgggt ; ggggatgcct ctgcacgtga agtggccgtt ccccgcggtg ccgccgctca ccct 36L ggccagcagc gtcgtcatgg gcttggtggg cacctacagc tgcttctgga ccaagtacat 42; gaaccacctg accgtgcaca acagggaggt gctgtacgag ctcatcgaga agcgaggccc 481 ggccacgccc ctcatcaccg tgtccaatca ccagtcctgc atggacgacc ctcatctctg 541 ggggatcctg cgcc acatctggaa cctgaagttg atgcgttgga cccctgcagc 60; tgcagacatc tgcttcacca aggagctaca ctcccacttc ttcagcttgg gtgt 66; gtgc cgaggagcag aatttttcca agcagagaat gaggggaaag gtgttctaga 72; cacaggcagg cacatgccag gtgctggaaa aagaagagag aaaggagatg gcgtctacca 78; gaaggggatg gacttcattt tggagaagct caaccatggg gactgggtgc atatcttccc 84; agaagggaaa gtgaacatga gttccgaatt cctgcgtttc aagtggggaa tcgggcgcct 90; gattgctgag tgtcatctca accccatcat cctg tggcatgtcg gaatgaatga 96L cgtccttcct aacagtccgc cctacttccc ccgctttgga cagaaaatca ctgtgctgat 102; cgggaagccc ttcagtgccc tgcctgtact cgagcggctc cgggcggaga acaagtcggc 108; tgtggagatg cggaaagccc actt cattcaagag cagc cagctccaca accacctcca gcctgggaga taggccttgc tctt ggcccgcaca gagctggggc tgagggatgg actgatgctt caaa 126; cgtggctttt agacagattt gttcatagac cctctcaagt gccctctccg cattccagct cctccgtgct tcctcagtta cacaaaggac ctcagctgct " ggccaagcag ggaggaagaa gcttaggcag ggctctcttt ccttcttgcc ttcagatgtt 144; ctctcccagg ggctggcttc aggagggagc atagaaggca ggtgagcaac cagttggcta 150; ggggagcagg gggcccacca tgga gaggggaccc tcct cggcctggct 156; cctacccacc gcccttgccg aaccaggagc tgctcactac ctcctcaggg gttg 162; gccacgtctt ccttctgcct gagcttcccc ccgaccacag tcct caggcaaggt 168; ctggcctcag gtgggccgca ggcgggaaaa gcagcccttg gccagaagtc aagcccagcc 174; acgtggagcc tagagtgagg gcctgaggtc tggctgcttg cccccatgct ggcgccaaca 180; acttctccat cctttctgcc tctcaacatc acttgaatcc tagggcctgg gttttcatgt 186; ttttgaaaca gaaccataaa gcatatgtgt tggcttgttg taaaaaaaaa aaaaaaaaa Protein seguence: NCBI Reference Seguence: 107.1 LOCUS N P_0001 07 ACCESSION N P_000107 1 mplhvkwpfp avppltwtla ssvvmglvgt yscfwtkymn hltvhnrevl yeliekrgpa 61 tplitvsnhq scmddphlwg ilklrhiwnl klmrwtpaaa elhs kcvp 121 vcrgaeffqa enegkgvldt grhmpgagkr rekgdgquk gmdfilekln hgdwvhifpe 181 gkvnmssefl rfkwgigrli piil plwhvgmndv lpnsppyfpr fquitvlig 241 kpfsalpvle rlraenksav emrkaltdfi ktq aeqlhnhlqp gr OfibblSwnbd:COL1A2 Official Name: collagen, type I, alpha 2 m: 1278 Organism: Homo sapiens Other Aliases: Ol4 Other Designations: alpha 2(l)-collagen; 2 type I collagen; collagen l, alpha-2 polypeptide; collagen alpha-2(l) chain; collagen of skin, tendon and bone, alpha-2 chain; type | procollagen tide seguence: NCBI Reference Seguence: NM_000089.3 LOCUS NM_000089 ACCESSION NM_000089 l gtgtcccata gtgtttccaa acttggaaag ggcgggggag ggcgggagga gggc 6; ggaggtatgc agacaacgag tcagagtttc cccttgaaag cctcaaaagt gtccacgtcc 12; tcaaaaagaa tggaaccaat ttaagaagcc agccccgtgg ccacgtccct tcccccattc 18; gctccctcct ctgcgccccc gcaggctcct cccagctgtg gctgcccggg cccccagccc 24; cagccctccc attggtggag gcccttttgg aggcacccta gggccaggga aacttttgcc ; gtataaatag tccg ggctttatta ttttagcacc acggcagcag gaggtttcgg 36; ctaagttgga ggtactggcc acgactgcat gcccgcgccc gccaggtgat acctccgccg 42; gtgacccagg gcga cacaaggagt ctgcatgtct aagtgctaga catgctcagc 48; gata cgcggacttt gttgctgctt gcagtaacct tagc aacatgccaa 54L tctttacaag aggaaactgt aagaaagggc ccagccggag atagaggacc acgtggagaa 60; aggggtccac caggcccccc aggcagagat ggtgaagatg cagg ccctcctggt 66L ccacctggtc ctcctggccc ccctggtctc ggtgggaact ttgctgctca gtatgatgga 72; aaaggagttg gccc aatg ggcttaatgg gacctagagg cccacctggt 78; gcagctggag ccccaggccc tcaaggtttc caaggacctg ctggtgagcc tggtgaacct 84; ggtcaaactg cagg tgctcgtggt ccagctggcc ctcctggcaa ggctggtgaa 90; gatggtcacc ctggaaaacc cggacgacct ggtgagagag gagttgttgg accacagggt 96; gctcgtggtt tccctggaac tcctggactt cctggcttca aaggcattag gggacacaat L02; ggtctggatg gattgaaggg acagcccggt gctcctggtg gtga acctggtgcc L08; gaaa atggaactcc aggtcaaaca ggagcccgtg ggcttcctgg tgagagagga L14; cgtgttggtg cccctggccc agctggtgcc cgtggcagtg atggaagtgt gggtcccgtg L20; ggtcctgctg gtcccattgg gtctgctggc cctccaggct tcccaggtgc ttggagctgt tggtaacgct ggtcctgctg gtcccgccgg ttccaggcct cccc gttggacctc ctggtaatcc gtgccaaggg tgctgctggc cttcccggcg ttgctggggc cctggacccc ttcc tggccctgtt ggtgctgccg gtgctactgg cttgttggtg agcctggtcc agctggctcc aaaggagaga gcggtaacaa gctg ggccccaagg tcctcctggt cccagtggtg aagaaggaaa gagaggccct L62; aatggggaag ctggatctgc cggccctcca cctg ggctgagagg tagtcctggt L68; tctcgtggtc ttcctggagc tgatggcaga gctggcgtca tgggccctcc tggtagtcgt L74; ggtgcaagtg gccctgctgg agtccgagga cctaatggag atgctggtcg ccctggggag L80; cctggtctca tgggacccag aggtcttcct ggttcccctg gaaatatcgg ccccgctgga L86; aaagaaggtc ctgtcggcct ccctggcatc gacggcaggc caat tggcccagct L92; ggagcaagag gagagcctgg caacattgga ttccctggac ccaaaggccc tgat L98; cctggcaaaa acggtgataa aggtcatgct ggtcttgctg gtgctcgggg tgctccaggt 204; cctgatggaa acaatggtgc acct cctggaccac agggtgttca aggtggaaaa 210; ggtgaacagg gtccccctgg aggc ttccagggtc tgcctggccc ctcaggtccc 216; gctggtgaag ttggcaaacc aggagaaagg catg gtgagtttgg tggt 222; cctgctggtc caagagggga acgcggtccc ccaggtgaga gtggtgctgc cggtcctact 228; ggtcctattg gaagccgagg tgga cccccagggc ctgatggaaa caagggtgaa 2341 cctggtgtgg ttggtgctgt gggcactgct ggtccatctg gtgg actcccagga 2401 qaqaqqqqtq ctgctggcat acctggaggc aagggagaaa agggtgaacc tggtctcaga 246; ggtgaaattg gtaaccctgg cagagatggt gctcgtggtg ctcctggtgc tgtaggtgcc 252; cctggtcctg ctggagccac aggtgaccgg ggcgaagctg gggctgctgg tcctgctggt 258; cctgctggtc ctcggggaag ccctggtgaa gagg tcggtcctgc tggccccaat 264; ggatttgctg gtcctgctgg tgctgctggt ggtg ctaaaggaga aagaggagcc 270; aaagggccta agggtgaaaa cggtgttgtt ggtcccacag gccccgttgg agctgctggc 276; ccagctggtc caaatggtcc ccccggtcct gctggaagtc gtggtgatgg aggcccccct 282; ggtatgactg gtttccctgg tgctgctgga cggactggtc ccccaggacc ctctggtatt 288; tctggccctc ctggtccccc tggtcctgct gggaaagaag ggcttcgtgg tcctcgtggt 294; gaccaaggtc cagttggccg aactggagaa gtaggtgcag ttggtccccc tggcttcgct 300; aagg gtccctctgg agaggctggt actgctggac ctcctggcac tccaggtcct 306; cagggtcttc ttggtgctcc tctg ggtctccctg gctcgagagg tggt 312; ctaccagqtg ttgctggtgc tgtgggtgaa cctc ttggcattgc cggccctcct 318; ggggcccgtg ctgg tgctgtgggt agtcctggag gtgc tcctggtgaa 324; gctggtcgtg accc tgggaacgat ggtcccccag gtcgcgatgg tcaacccgga 330; cacaagggag agcgcggtta caat attggtcccg ctgc aggtgcacct 336; ggtcctcatg tggg tcctgctggc aaacatggaa accgtggtga aactggtcct 342; tctggtcctg ttggtcctgc tggtgctgtt ggcccaagag gtcctagtgg cccacaaggc 348; attcgtggcg ataagggaga gcccggtgaa aaggggccca gaggtcttcc tggcttaaag 354; ggacacaatg aagg tctgcctggt atcgctggtc accatggtga tcaaggtgct 360; cctggctccg tgggtcctgc tggtcctagg ggccctgctg gtccttctgg ccctgctgga 366; aaagatggtc gcactggaca tcctggtaca gttggacctg ctggcattcg aggccctcag 372; ggtcaccaag gccctgctgg cccccctggt ccccctggcc ctcctggacc tccaggtgta 378; agcggtggtg gttatgactt tggttacgat ttct acagggctga tcgc 384; tcagcacctt ctctcagacc caaggactat gaagttgatg ctactctgaa gtctctcaac 390; aaccagattg agacccttct tactcctgaa ggctctagaa agaacccagc tcgcacatgc 396; cgtgacttga gactcagcca cccagagtgg agcagtggtt actactggat tgaccctaac 402; caaggatgca ctatggatgc tatcaaagta tactgtgatt tctctactgg cgaaacctgt 408; atccgggccc aacctgaaaa catcccagcc aagaactggt ataggagctc caaggacaag 4141 aaacacgtct ggctaggaga aactatcaat agcc agtttgaata taatgtagaa 420; ggagtgactt ccaaggaaat ggctacccaa ttca tgcgcctgct ggccaactat 426; gcctctcaga acatcaccta caag aacagcattg catacatgga tgaggagact 432; ggcaacctga aaaaggctgt cattctacag ggctctaatg aact tgttgctgag 438; ggcaacagca ggttcactta cactgttctt gtagatggct gctctaaaaa gacaaatgaa 444; tggggaaaga caatcattga atacaaaaca aataagccat cacgcctgcc cttccttgat 450; cctt tggacatcgg tggtgctgac caggaattct ttgtggacat agtc 456; tgtttcaaat actc aatctaaatt aaaaaagaaa gaaatttgaa aaaactttct 462; ctttgccatt tcttcttctt cttttttaac tgaaagctga atccttccat ttcttctgca 468; cttg cttaaattgt gggcaaaaga gaaaaagaag gattgatcag agcattgtgc 474; aatacagttt cattaactcc ttcccccgct cccccaaaaa tttgaatttt tttttcaaca 480: ctcttacacc tgttatggaa aatgtcaacc tttgtaagaa aaccaaaata aaaattgaaa 486; aataaaaacc ataaacattt gcaccacttg tggcttttga atatcttcca cagagggaag 492; tttaaaaccc aaacttccaa aggtttaaac aaaa cactttccca tgagtgtgat 4981 ccacattgtt aggtgctgac ctagacagag tgag gtccttgttt gttc 504; caaa aatt aatagtattt cagatacttg aagaatgttg atggtgctag 510; aagaatttga gaagaaatac tcctgtattg agttgtatcg tgtggtgtat tttttaaaaa 516; atttgattta gcattcatat tttccatctt attcccaatt aaaagtatgc agattatttg 522; cccaaatctt cttcagattc agcatttgtt ctttgccagt ctcattttca tcttcttcca 528; tggttccaca gaagctttgt ttcttgggca agcagaaaaa ttaaattgta cctattttgt 534; atatgtgaga tgtttaaata aattgtgaaa aaaatgaaat aaagcatgtt tggttttcca 540; aaagaacata t seguence: NCBI Reference Seguence: NP_000080.2 LOCUS NP_000080 ACCESSION NP_000080 l mlsfvdtrtl lllavtlcla thslqeetv rkgpagdrgp rgergppgpp grdgedgptg 6; ppgppgppgp pglggnfaaq ydgkgvglgp gpmglmgprg ppgaagapgp qgfquagep 12; gepgqtgpag argpagppgk agedghpgkp grpgergvvg fpgt pglpgfkqir 18; ghngldglkg quapgvkge pgapgengtp gqtgarglpg ergrvgapgp agargsdgsv 24; qpvqpaqpiq sagpqupga pgpkgeigav gnagpagpag prgevglpgl sgpvgppqnp ; gangltgakg aaglpgvaga pglpgprgip gpvgaagatg arglvgepgp agskgesgnk 36; gepgsagpqg ppgpsgeegk agsa glrg Spgsrglpga dgragvmgpp 42; gsrgasgpag vrgpngdagr pgepglmgpr glpg8pgnig pagkegpvgl pgidgrpgpi 48; gpagargepg nigfpgpkgp ngdk ghaglagarg apgpdgnnga qupgpqqu 54; ggkgquppg ppgfqglpgp Sgpagevgkp gerglhgefg lpgpagprge rgppgesgaa 60; gptgpigsrg psgppgpdgn kgepgvvgav gtagpsgpsg lpgergaagi pggkgekgep 66; glrgeignpg rdgargapga vgapgpagat gdrgeagaag pagpagprgs pgergevgpa 72; gpngfagpag aagquakge rgakgpkgen gvvgptgpvg aagpagpngp pgpagsrgdg 78; gppgmtgfpg aagrtgppgp Sgisgppgpp glrg prgdquvgr tgevgavgpp 84- gfagekgpsg eagtagppgt pgpqgllgap gilglpgsrg erglpgvaga vgepgplgia 901 qppqarqppq avgspgvnga pgeagrdqnp grdg quhkgergy vgaa 961 qapqphqpvq pagkhgnrge tgpsgpvgpa gavgprgpsg pqgirgdkge pgekgprqlp 102; glkghnglqg lpgiaghhgd qgapgsvgpa gprgpagpsg pagkdgrtgh agir 108; gpqghquag ppgppgppgp gydf yrad qprsapslrp 114; slnnqietll tpegsrknpa rtcrdlrlsh yywi dpnqgctmda ikvycdfstg 120; etciraqpen ipaknwyrss kdkkhvwlge tinagsqfey nvegvtskem atqlafmrLl 126; anyasqnity hcknsiaymd eetgnlkkav ilqgsndvel vaegnsrfty tvlvdgcskk 132; tnewgktiie yktnkpsrlp fldiapldig gadqeffvdi gpvcfk LAMC1 OfibblSwnbokLAMC1 OmdmNameHammmgamma1UmmmWLAMBm Gene ID: 3915 CkgamsmzHomosapmns Other Aliases: RP11-181K3.1, LAMBZ Other Designations: S—LAM gamma; S-laminin subunit gamma; laminin 82 chahnIamhnnsubunfigannnaJhlannnm-10subunngannnaflanflnm-11subunn gannnaHanflnm-ZsubunhgannnaflanfinM-Bsubunhgannnaflanflnm-4subunH gannnaflanflnm-6subunfigannnaflanfinm¥7subunhgannnaflanfinm-Bsubunfl mmnmesummfigamma Nudeofideseguence: NCBI Reference Seguence: NM_002293.3 LOCUS NM_002293 ION 293 1 gtgcaggctg ctcccggggt aggtgaggga agcgcggagg cggcgcgcgg gggcagtggt 6; cggcgagcag cgcggtcctc ggcg cccacccgtc agtctctccg gcgcgagccg 12; ccgccaccgc ccgcgccgga gtcaggcccc tgggccccca ggctcaagca cggc 18; ggga tagg cgagaggaac gcgccggtgc ccttgccttc gccgtgaccc 24; agcgtgcggg ngcgggatg agagggagcc atcgggccgc gccggccctg cggccccggg 30L ggcggctctg gcccgtgctg gccgtgctgg ccgc cgcggcgggc tgtgcccagg 36L cagccatgga cgagtgcacg gacgagggcg ggcggccgca gcgctgcatg cccgagttcg 42L tcaacgccgc cgtg actgtggtgg acac gtgtgggact ccgcccgagg 48L aatactgtgt gcagaccggg gtgaccgggg tcaccaagtc ctgtcacctg tgcgacgccg 54; ggcagcccca cctgcagcac ggggcagcct tcctgaccga ctacaacaac gaca 60; ccacctggtg gcaaagccag accatgctgg ccggggtgca gtaccccagc tccatcaacc 66; tcacgctgca cctgggaaaa gcttttgaca tcacctatgt gcgtctcaag ttccacacca 72; gccgcccgga gagctttgcc atttacaagc gcacacggga agacgggccc tggattcctt 78; accagtacta cagtggttcc tgtgagaaca cctactccaa ggcaaaccgc ggcttcatca 84; ggacaggagg ggacgagcag caggccttgt gtactgatga attcagtgac atttctcccc 90; tcactggggg caacgtggcc ttttctaccc tggaaggaag gcccagcgcc tataactttg 96; acaatagccc tgtgctgcag gaatgggtaa ctgccactga catcagagta actcttaatc L02; gcctgaacac agat gaagtgttta acgatcccaa agttctcaag tcctattatt L08; atgccatctc tgattttgct gtaggtggca gatgtaaatg acac gcaagcgagt L14; gtatgaagaa cgaatttgat aagctggtgt gtaattgcaa caca tatggagtag L20; actgtgaaaa gtgtcttcct ttcttcaatg accggccgtg gaggagggca actgcggaaa L26; gtgccagtga atgcctgccc tgtgattgca atggtcgatc ccaggaatgc gacc L32; ctgaactcta cact ggccatgggg gccactgtac caactgccag gagg tgccgagaga acttcttccg ccttggcaac ccactgtagt cctgtgggct ctctaagcac acagtgtgat taagccagga gtgatggggg acaaatgtga ccgttgccag tgaagcagga tgcaggccat gctcttgtga tccctctggc aatgtaatat tgaaacagga agatgtgttt gcaaagacaa tgtcgaaggc ttcaattgtg L68; aaagatgcaa acctggattt tttaatctgg aatcatctaa tcctcggggt tgcacaccct 174; gcttctgctt tgggcattct tctgtctgta caaacgctgt tggctacagt gtttattcta 180; tctcctctac ctttcagatt gatg ggtggcgtgc ggaacagaga gatggctctg 186; aagcatctct cgagtggtcc tctgagaggc aagatatcgc cgtgatctca gacagctact 192; ttcctcggta cttcattgct cctgcaaagt gcaa gcaggtgttg agttatggtc 198; agaacctctc cttctccttt cgagtggaca ggcgagatac tcgcctctct gcagaagacc 204; ttgtgcttga gggagctggc ttaagagtat ctgtaccctt tcag ggcaattcct 210i gtga gaccactgtg aagtatgtct tcaggctcca tgaagcaaca cctt 216; ggaggcctgc tcttacccct tttgaatttc agaagctcct aaacaacttg acctctatca 222; agatacgtgg cagt gagagaagtg ctggatattt ggatgatgtc accctggcaa 228; gtgctcgtcc tgga gtccctgcaa cttgggtgga gtcctgcacc tgtcctgtgg 234; gatatggagg gcagttttgt gagatgtgcc tctcaggtta cagaagagaa actcctaatc 240; ttggaccata cagtccatgt gtgctttgcg cctgcaatgg acacagcgag acctgtgatc 246; ctgagacagg tgtttgtaac tgcagagaca atacggctgg cccgcactgt gagaagtgca 252; gtgatgggta ctatggagat tcaactgcag gcacctcctc cgattgccaa ccctgtccgt 258; gtcctggagg ttcaagttgt gctgttgttc ccaagacaaa ggaggtggtg tgcaccaact 264; gtcctactgg caccactggt tgtg agctctgtga tgatggctac tttggagacc 270; ccctgggtag ccct gtgagacttt gccgcctgtg ccagtgcagt gacaacatcg 276; atcccaatgc agttggaaat tgcaatcgct tgacgggaga gaag tata 282; acactgctgg cttctattgt gaccggtgca aagacggatt ttttggaaat cccctggctc 288; ccaatccagc agacaaatgc aaagcctgca attgcaatct gtatgggacc atgaagcagc 294; agagcagctg taaccccgtg acggggcagt gtgaatgttt gcctcacgtg actggccagg 300; actgtggtgc ttgtgaccct ggattctaca atctgcagag tgggcaaggc tgtgagaggt 306; gtgactgcca tgccttgggc aatg ggcagtgtga catccgcacc ggccagtgtg 312; agtgccagcc cggcatcact ggtcagcact gtgagcgctg tgaggtcaac cactttgggt 318; ttggacctga aggctgcaaa ccctgtgact gtcatcctga gggatctctt tcacttcagt 3241 gcaaagatga tggtcgctgt agag aaggctttgt gggaaatcgc tgtgaccagt 330; gtgaagaaaa ctatttctac aatcggtctt ggcctggctg ccaggaatgt ccagcttgtt 336; tggt aaaggataag gttgctgatc tgaa ggaa ttagagagtc 342; tcatagcaaa ccttggaact ggggatgaga tggtgacaga tcaagccttc gaggatagac 348; aagc agagagggaa gttatggacc tccttcgtga ggcccaggat gtcaaagatg 354; ttgaccagaa tttgatggat Cgcctacaga gagtgaataa cactctgtcc agccaaatta 360; gccgtttaca gaatatccgg aataccattg aagagactgg ggct gaacaagcgc 366; atgt agagaacaca gagcggttga ttgaaatcgc atccagagaa cttgagaaag 372; tcgc tgctgccaat gtca ctcagccaga atctacaggg gacccaaaca 378; acatgactct tttggcagaa gaggctcgaa agcttgctga acgtcataaa gctg 384; atgacattgt tcgagtggca aagacagcca cgtc aactgaggca tacaacctgc 390; ttctgagqac actggcagga gaaaatcaaa cagcatttga gattgaagag cttaatagga 396; agtatgaaca agcgaagaac atctcacagg atctggaaaa tgcc cgagtacatg 402; aggaggccaa aagggccggt gacaaagctg tggagatcta cgtg gctcagctga 408; gccctttgga gaca ctggagaatg aagcaaataa cataaagatg gaagctgaga 414; atctggaaca tgac cagaaattaa aagattatga ggacctcaga gaagatatga 420; gagggaagga acttgaagtc cttc aagg caagactgaa cagcagaccg 426; cagaccaact cctagcccga gctgatgctg ccaaggccct cgctgaagaa gctgcaaaga 432; agggacggga taccttacaa aatg acattctcaa caacctgaaa gattttgata 438; ggcgtgtgaa Cgataacaag acggccgcag aggaggcact gatt cctgccatca 444; accagaccat cactgaagcc aatgaaaaga ccagagaagc ccagcaggcc ctgggcagtg 450; ctgcggcgga agag gccaagaaca aggcccatga ggcggagagg atcgcgagcg 456; ctgtccaaaa gaatgccacc agcaccaagg cagaagctga aagaactttt gcagaagtta 462; cagatctgga taatgaggtg aacaatatgt tgaagcaact gcaggaagca gaaaaagagc 468; taaagagaaa acaagatgac gctgaccagg acatgatgat ggcagggatg cagg 474; ctgctcaaga agccgagatc aatgccagaa aagccaaaaa ctctgttact agcctcctca 480; gcattattaa tgacctcttg gagcagctgg ggcagctgga tacagtggac ctgaataagc 486; taaacgagat tgaaggcacc ctaaacaaag ccaaagatga aatgaaggtc agcgatcttg 492; ataggaaagt gtctgacctg gagaatgaag ccaagaagca ggaggctgcc atcatggact 498; ataaccgaga tatcgaggag atcatgaagg acattcgcaa tctggaggac atcaggaaga 5041 ccttaccatc tggctgcttc aacaccccgt ccattgaaaa gccctagtgt ctttagggct 510; ggaaggcagc atccctctga caggggggca gttgtgaggc cacagagtgc cttgacacaa 516; agattacatt tttcagaccc ccactcctct gtcc atgactgtcc ttttgaacca 522; ggaaaagtca cagagtttaa agagaagcaa attaaacatc ctgaatcggg aacaaagggt 528; tttatctaat aaagtgtctc ttccattcac gttgctacct cact ttcccttctg 534; gtga ggacgtggca tcctacgtta ctgtacagtg gcac atcgtgtgag 540; cccatgtatg ctggggtaga gcaagtagcc ctcccctgtc tcatcgatac cagcagaacc 546; tcctcagtct cagtactctt gtttctatga agtt tggctactaa cagtagcatt 552; gtgatggcca gtatatccag tccatggata aagaaaatgc atctgcatct cctacccctc 558; ttccttctaa gcaaaaggaa ataaacatcc tgtgccaaag gtca tttagaatgt 564; cggtagccat ccatcagtgc ttttagttat tatgagtgta ggacactgag ccatccgtgg 570i gtcaggatgc aattatttat aaaagtctcc aggtgaacat ggctgaagat ttttctagta 576; taat tgactaggaa gatgaacttt ttttcagatc tttgggcagc tgataattta 582; aatctggatg ggcagcttgc actcaccaat agaccaaaag acatcttttg atattcttat 588; aaatggaact tacacagaag aaatagggat atgataacca ctaaaatttt gttttcaaaa 594; tcaaactaat tcttacagct tttttattag ttagtcttgg aactagtgtt aagtatctgg 600; cagagaacag ttaatcccta aggtcttgac aaaacagaag aaaaacaagc ctcctcgtcc 606; tagtcttttc tagcaaaggg ataaaactta gatggcagct tgtactgtca gaatcccgtg 612; tatccatttg ttcttctgtt ggagagatga gacatttgac ccttagctcc agttttcttc 618; tgatgtttcc atcttccaga atccctcaaa aaacattgtt tgccaaatcc tggtggcaaa 624; tacttgcact cagtatttca tgcc atcg agttcctgca ctttgtgatt 630; taaatccact ctaaaccttc cctctaagtg tagagggaag acccttacgt ggagtttcct 636; agtgggcttc tcaacttttg atcctcagct ctgtggtttt aagaccacag agtt 642; caca cacccccttc ctcctaccaa cccacctttg agattcatat atagccttta 648; acactatgca actttgtact ttgcgtagca ggggcggggt ggggggaaag aaactattat 654; cact ggtgctatta attatttcaa atttatattt ttgtgtgaat gttttgtgtt 660; ttgtttatca tgattataga ataaggaatt tatgtaaata tacttagtcc tatttctaga 666; atgacactct gttcactttg ctcaattttt cctcttcact ggcacaatgt atctgaatac 672; ctccttccct cccttctaga attctttgga ttgtactcca aagaattgtg ccttgtgttt 678i gcagcatctc cattctctaa aattaatata attgctttcc tccacaccca gccactgtaa 6841 agaggtaact tgggtcctct tccattgcag tcctgatgat cctaacctgc agcacggtgg 690; ttttacaatg agca ggaacgccag gttgacaagc tatggtagga ttaggaaagt 696; aaga ggatctttga agtg gcca aggg agaaatgccc 702; tttctggcaa ttgttggagc tggataggta agttttataa gggagtacat tttgactgag 708; cacttagggc atcaggaaca ctta ggta gactgggaga ggtggtgtaa 714; tctt gatgatccca cttcctgttt ccatctgctt gggatatacc agagtttacc 720; acaagtgttt tgacgatata agct ttcactctgc tccc aggcctcttc 726; tactatggca ggagatgtgg cgtgctgttg caaagttttc acgtcattgt ttcctggcta 732; gttcatttca ttaagtggct acatcctaac atatgcattt ggtcaaggtt gcagaagagg 738; gatt gactgccaag tggg tgaagttcac tccagcaagt ctcaggccac 744; aatggggtgg tttggtttgg tttcctttta tttt tgttatttgc ttttctcctc 750: cacctgtgtg gtatattttt taagcagaat tttt aaaataaaag gttctttaca 756; agatgatacc ttaattacac tcccgcaaca cagccattat tttattgtct agctccagtt 762; atctgtattt tatgtaatgt aattgacagg atggctgctg cagaatgctg acag 768; ggattattat actgctattt ttccctgaat cctt tgaattccaa ctgtggacct 774; tttatatgtg ccttcacttt agctgtttgc cttaatctct acagccttgc tctccggggt 780; ggttaataaa atgcaacact tggcattttt atgttttaag aaaaacagta ttttatttat 7861 aataaaatct gaatatttgt aacccttt n sequence: NCBI Reference Seguence: NP_002284.3 LOCUS NP_002284 ACCESSION NP_002284 l mrgshraapa lrprgrlwpv lavlaaaaaa gcaqaamdec tdeggrpqrc mpefvnaafn 6; ntcg tppeeycvqt gvtgvtksch lcdagqphlq hgaafltdyn nqadttwqu 12; qtmlaquyp ssinltlhlg kafdityvrl kfhtsrpesf aiykrtredg pwipyqyysg 18; scentyskan rgfirtggde qqalctdefs displtggnv afstlegrps aynfdnspvl 24; qewvtatdir vtlnrlntfg devfndpkvl ksyyyaisdf avggrckcng hasecmknef 30L dklvcnckhn tygvdcekcl pffndrpwrr ataesasecl pcdcngrsqe cyfdpelyrs 36L tghgghctnc qdntdgahce rcrenffrlg nneacsschc spvgslstqc dsygrcsckp 42L gvmgdkcdrc qufhsltea gcrpcscdps gsidecniet grcvckdnve gfncerckpg 481 ffnlessnpr gctpcfcfgh ssvctnavgy svysisstfq idedgwraeq rdgseaslew 54; sserqdiavi sdsyfpryfi gkqv lsygqnlsfs frvdrrdtrl saedlvlega 60; glrvsvplia qgnsypsett hea tdypwrpalt pfequllnn ltsikirgty 66L sersagyldd vtlasarpgp gvpatwvesc thvgyggqf cemclsgyrr etpnlgpysp 72L cvlcacnghs etcdpetgvc ncrdntagph cekcsdgyyg dstagtssdc qpcpcpggss 78L cavvpktkev vctncptgtt cddg yfgdplgrng pvrlcrlcqc sdnidpnavg 84L ncnrltgecl kciyntagfy cdrckdgffg nplapnpadk ckacncnlyg tmqusscnp 90; vtgqceclph vtqucgacd pgfynlqsgq gcercdchal hir tgqcecqui 96L tgthercev nhfgfgpegc kpcdchpegs ddgr cecregfvgn rcdqceenyf L02; ynrswpgcqe cpacyrlvkd kvadhrvqu eleslianlg tgdemvtdqa fedrlkeaer L08; evmdllreaq dvkdvdqnlm drlqrvnntl lqni rntieetgnl aeqarahven elekakvaaa nvsvtqpest gdpnnmtlla eearklaerh aynlllrtla genqtafeie elnrkyeqak nisqdlekqa arvheeakra ;26; gdkaveiyas vaqlspldse tleneannik meaenleqli qulkdyedl redmrgkeLe L32; kgkt eqqtadqlla radaakalae eaakkgrdtl qeandilnnl kdfdrrvndn L38; ktaaeealrk ipainqtite anektreaqq algsaaadat eaknkaheae riasavqkna L44; tstkaeaert faevtdldne vnnmlkqlqe rkqd dadqdmmmag masqaaqeae L50; knsv tsllsiindl leqlgqldtv dlnklneieg tlnkakdemk vsdldrkvsd 1561 leneakkqea aimdynrdie eimkdirnle dirktlpsgc fntpsiekp SPRC al Symbol: SPARC Official Name: secreted protein, acidic, cysteine-rich (osteonectin) Gene ID: 6678 Organism: Homo s Other s: ON Other Designations: BM-40; basement-membrane protein 40; cysteine-rich protein; osteonectin; secreted protein acidic and rich in cysteine Nucleotide seguence: NCBI Reference Seguence: NM_003118.3 LOCUS NM_003118 ION NM_003118 1 gggagaagga ggaggccggg ggaaggagga gacaggagga ggagggacca ngggtggag 6; gggagataga cccagcccag agctctgagt ggtttcctgt tgcctgtctc taaacccctc 12; cacattcccg cggtccttca gactgcccgg agagcgcgct ctgcctgccg cctgcctgcc 18; tgccactgag ggttcccagc accatgaggg cctggatctt ctttctcctt tgcctggccg 24; ggagggcctt ggcagcccct cagcaagaag ccctgcctga tgagacagag gtggtggaag ; aaactgtggc agaggtgact gaggtatctg tgggagctaa tcctgtccag gtggaagtag 36; gagaatttga tgatggtgca gaggaaaccg aagaggaggt ggtggcggaa aatccctgcc 42L agaaccacca ctgcaaacac ggcaaggtgt gcgagctgga tgagaacaac acccccatgt 48L gcgtgtgcca ggaccccacc agctgcccag cccccattgg cgagtttgag aaggtgtgca 54L gcaatgacaa caagaccttc gactcttcct tctt tgccacaaag tgcaccctgg 60; agggcaccaa gaagggccac aagctccacc tggactacat cgggccttgc aaatacatcc 66; ccccttgcct ggactctgag ctgaccgaat tccccctgcg ggac aaga 72; acgtcctggt caccctgtat gagagggatg aggacaacaa ccttctgact caga 78; agctgcgggt gaagaagatc catgagaatg agaagcgcct ggaggcagga cccg 84; tggagctgct ggac ttcgagaaga actataacat cttc cctgtacact 90; ggcagttcgg ccagctggac cagcacccca ttgacgggta cctctcccac accgagctgg 96; tgcg tgctcccctc atccccatgg agcattgcac cacccgcttt ttcgagacct L02; gtgacctgga caatgacaag tacatcgccc tggatgagtg ggccggctgc ttcggcatca L08; agcagaagga caag gatcttgtga tctaaatcca ctccttccac agtaccggat L14; tctctcttta accctcccct tcgtgtttcc cccaatgttt tttg gatggtttgt L20; tgttctgcct ggagacaagg tgctaacata gatttaagtg aatacattaa cggtgctaaa L26; aatt ctaacccaag catt cttagctgta acttaactat taaggccttt L32; tccacacgca ttaatagtcc catttttctc tttg tagctttgcc cattgtctta L38; ttggcacatg ggtggacacg gatctgctgg gctctgcctt aaacacacat tgcagcttca L44; acttttctct ttagtgttct gtttgaaact aatacttacc gact ttgtgttcat L50; ttcatttcag ggtcttggct gcctgtgggc ttccccaggt ggcctggagg acacacgatg agga tggttttggg actagaggct agagatccct gcagaaccca ccaaccagaa cgtggtttgc ctgaggctgt aagattctgg gtta tgaaaatata gacattctca cataagccca cacc L74; atttcctcct ttacctttca gtgcagtttc ttttcacatt aggctgttgg tgggagcacg gactgtcagt tctctgggaa gtggtcagcg catcctgcag ggcttctcct 186; cctctgtctt ttggagaacc agggctcttc tcaggggctc tagggactgc caggctgttt 192; ggaa ggccaaaatc aagagtgaga tgtagaaagt tgtaaaatag aaaaagtgga 198; gttggtgaat cggttgttct ttcctcacat ttggatgatt gtcataaggt ttttagcatg 204; cttt tcttcaccct cccctttttt cttctattaa tcaagagaaa agtt 210; aatgggatgg ctca caggctgaga actcgttcac ctccaagcat ttcatgaaaa 216; agctgcttct tattaatcat acaaactctc accatgatgt gaagagtttc acaaatcctt 222i caaaataaaa agtaatgact tagaaactgc cttcctgggt gatttgcatg tgtcttagtc 228; ttagtcacct tattatcctg acacaaaaac acatgagcat acatgtctac acatgactac 234; acaaatgcaa acctttgcaa acacattatg cttttgcaca cacacacctg tacacacaca 240; ccggcatgtt tatacacagg atgg ttcctgtaag cactaagtta gctgttttca 246; tttaatgacc ttaa cccttttgat cactaccacc attatcagca tgag 252; cagctatatc cttttattaa tcatggtcat tcattcattc attcattcac aaaatattta 258; tgatgtattt actctgcacc aggtcccatg ccaagcactg gggacacagt aaag 264; tagacaaagc atttgttcat ttggagctta gagtccagga ggaatacatt agataatgac 270; acaatcaaat ataaattgca agatgtcaca ggtgtgatga agggagagta ggagagacca 276; tgagtatgtg taacaggagg acacagcatt attctagtgc tgtactgttc cgtacggcag 282; ccactaccca catgtaactt tttaagattt aaatttaaat tagttaacat tcaaaacgca 288; gctccccaat cacactagca acatttcaag tgcttgagag ccatgcatga gtta 294; ccctattgaa taggtcagaa tctt ttcatcatca cagaaagttc tattggacag 300; tgctcttcta cata agactacaga gcacttttca aagctcatgc atgttcatca 306; tgttagtgtc tgag ctggggtttt gagactcccc ttagagatag agaaacagac 312; ccaagaaatg tgctcaattg caatgggcca catacctaga tctccagatg tcatttcccc 318; tctcttattt taagttatgt taagattact ataa ctaa aaaatcaaac 324; tgtattctgg cttc tacacagtgg gagggcgagc agtaggagag attggcccat 330; ttggtgctgg ccatttgagg aatgcaagcc cagcactagt atct ctaggaatct 3361 gtagagagag gaattgaagt aaatttcagc attggctcat tcagtcattc ggcgacattc 342; atcaggtacc tgcaatgtgt taggggatct tatgagtagg cagcgtgcgt gatccttgct 348; cccctggagc tttctaacat tctagcaggc agaccacaca taaatttgca atactgtttc 354; aaac gtgctgtaaa ggaaataaag cagagaacta tcatggaaaa aaaaaaaaaa 360; aaaa Protein seguence: NCBI Reference Seguence: 109.1 LOCUS NP_003109 ACCESSION N P_003109 1 mrawiffllc lagralaapq qealpdetev evte vsvganpqu evgefddgae 61 eteeevvaen pcqnhhckhg kvceldennt pmcchdpts cpapigefek vcsndnktfd 121 sschffatkc tlegtkkghk lhldyigpck yippcldsel tefplrmrdw 1knv1vt1ye 181 rdednnllte qulrvkkih enekrleagd hpvellardf eknynmyifp vhwquqldq 241 hpidgylsht elaplrapli pmehcttrff etcdldndky ialdewagcf giqudidkd 301 1vi P3H1 Official Symbol: LEPRE1 Official Name: leucine e-enriched proteoglycan can) 1 m: 64175 Organism: Homo sapiens Other Aliases: PSECO109, GROS1, Ol8, P3H1 Other Designations: growth suppressor 1; leprecan; leucine- and proline- enriched proteoglycan 1; prolyl oxylase 1 Nucleotide seguence: NCBI Reference ce: NM_001146289.1 LOCUS NM_001146289 ACCESSION NM_001 146289 1 atgcgccgcc cggcttggaa ggtggggctt cgcccggggg cgggccttcg ccgggggtag 61 ggcc ttggtggcgg gtggctggcg gttccgttag gtctgaggga gcgatggcgg 121 tacgcgcgtt gaagctgctg accacactgc tcgt ggccgctgcc tcccaagccg 181 aggtcgagtc Cgaggcagga tggggcatgg tgacgcctga tctgctcttc gccgagggga 241 ccgcagccta Cgcgcgcggg gactggcccg tcct gagcatggaa cgggcgctgc 301 gctcccgggc agccctccgc gcccttcgcc tgcgctgccg cacccagtgt gccgccgact 36; tcccgtggga gctggacccc gactggtccc ccagcccggc ctcg ggcgccgccg 42; ccctgcgcga cctgagcttc ttcgggggcc ttctgcgtcg cgctgcctgc ctgcgccgct 48; gcctcgggcc gccggccgcc cactcgctca gcgaagagat ggagctggag ttccgcaagc 54; ggagccccta caactacctg gcct acttcaagat caacaagttg gagaaagctg 60; ttgctgcagc acacaccttc ttcgtgggca atcctgagca catggaaatg cagcagaacc 66; tagactatta ccaaaccatg tctggagtga aggaggccga cttcaaggat cttgagactc 72L aaccccatat attt ggag tgcgactcta ctcagaggaa cagccacagg 78L aagctgtgcc ccacctagag gcggcgctgc aagaatactt tgtggcctat gaggagtgcc 84L gtgccctctg cgaagggccc tatgactacg atggctacaa ctaccttgag tacaacgctg 90; acctcttcca ggccatcaca gatcattaca tccaggtcct caactgtaag cagaactgtg 96; tcacggagct tgcttcccac ccaagtcgag agaagccctt tgaagacttc ctcccatcgc L02; attataatta tctgcagttt gcctactata acattgggaa acag gctgttgaat L08; gtgccaagac ctatcttctc ttcttcccca atgacgaggt gatgaaccaa attatgcagc tatgcttgga gaagaacaca ccagatccat cggcccccgt gagagtgcca L20; aggagtaccg acagcgaagc ctactggaaa aagaactgct tttcttcgct gttt L26; ttggaattcc ggat ccggattcat caga agaagtgatt cccaagagat L32; tgcaagagaa gtca gaacgggaaa cagccgtacg catctcccag gagattggga L38; accttatgaa ggaaatcgag acccttgtgg aagagaagac caaggagtca ctggatgtga L44; gcagactgac ccgggaaggt ggccccctgc tgtatgaagg tctc accatgaact L50; ccaaactcct gaatggttcc cagcgggtgg tgatggacgg cgtaatctct gaccacgagt L56; gtcaggagct gcagagactg accaatgtgg cctc aggagatggc taccggggtc L62; agacctcccc tccc aagt tctatggtgt cactgtcttc aaagccctca L68; agctggggca agaaggcaaa gttcctctgc agagtgccca cctgtactac aacgtgacgg L74; agaaggtgcg gcgcatcatg gagtcctact tccgcctgga tacgcccctc tacttttcct L80: actctcatct ggtgtgccgc actgccatcg aagaggtcca ggcagagagg aaggatgata L86; gtcatccagt ccacgtggac atcc tgaatgccga gaccctcgtg agcccccagc ctacaccttc cgcgactaca gcgccatcct ttacctaaat ggggactth :98; atggcggaaa cttttatttc actgaactgg agac cgtgacggca gaggtgcagc 204; ctcagtgtgg aagagccgtg ggattctctt caggcactga aaacccacat ggagtgaagg 210; ccag ggggcagcgc tgtgccatcg ccctgtggtt caccctggac cctcgacaca 216; gcgagcgggt gagagcagct cgagcgggac agggtgcagg cagatgacct ggtgaagatg 222; agcc cagaagagat ggacctctcc caggagcagc ccctggatgc ccagcagggc 228; ccccccgaac ctgcacaaga gtctctctca ggcagtgaat cgaagcccaa gcta 234; tgacagcgtc agac gtga ctagacccat ggagaggaac tcttctgcac 240; tctgagctgg ccagcccctc gcag agcagtgagc ctacatctgc cactcagccg 246; aggggaccct gctcacagcc ttctacatgg tgctactgct cttggagtgg acatgaccag 252: acaccgcacc ccctggatct ggctgagggc tcaggacaca gcca cccccagggg 258; cctccacagg ccgctgcatg acagcgatac agtacttaag tgtctgtgta gacaaccaaa 264; gaataaatga ttcatggttt tttttacttg gtttgttcag acaatggaaa tttgcccatt 2701 aaaa aaaaa Protein seguence: NCBI Reference Seguence: NP_001139761.1 LOCUS N P_001139761 ACCESSION N 39761 l kllt tllavvaaas qaeveseagw gmvtpdllfa egtaayargd wpgvvlsmer 6; alrsraalra lrlrcrtqca adfpweldpd wspspaqasg lsff ggllrraacl 12; rrclgppaah slseemelef rkrspynqu vayfkinkle kavaaahtff vgnpehmemq 18; qnldyyqtms gvkeadfkdl etqphmqefr lgvrlyseeq pqeavphlea alqeyfvaye 24; ecralcegpy dydgynyley nadlfqaitd hyiqvlnckq ncvtelashp srekpfedfl ; pshynqufa yynignytqa vecaktyllf fpndevmnqn layyaamlge ehtrsigpre 36; sakeyrqrsl Lekellffay dvfgipfvdp dswtpeevip krlqequse retavrisqe 42; ignlmkeiet Lveektkesl dvsrltregg pllyegislt mnskllngsq visd 48; qut nvaatsgdgy rgqtsphtpn ekfygvtvfk alklgqegkv plqsahlyyn 54; vtekvrrime tply fsyshlvcrt aieevqaerk ddshpvhvdn cilnaetlvc 60; vkeppaytfr dysailylng dfdggnfyft eldaktvtae vqpchravg fssgtenphg 66L vkavtrgqrc aialwftldp rhservraar agqgagr Official Symbol: COL6A1 Official Name: collagen, type VI, alpha 1 Gene ID: 1291 Organism: Homo sapiens Other Aliases: OPLL Other Designations: alpha1 (VI) chain (61 AA); collagen VI, alpha-1 polypeptide; collagen alpha-1(VI) chain Nucleotide ce: NCBI Reference Seguence: NM_001848.2 LOCUS NM_001 848 ACCESSION NM_001848 l gctctcactc tggctgggag cagaaggcag cctcggtctc tgggcggcgg ccca 6; ctctgccctg gccgcgctgt accg caggccccag acatgagggc ggcccgtgct 12; ctgctgcccc tgctgctgca ggcctgctgg acagccgcgc aggatgagcc ggagaccccg l8; agggccgtgg ccttccagga ctgccccgtg gacctgttct ttgtgctgga cacctctgag 24; gccc tgaggctgaa gccctacggg gccctcgtgg acaaagtcaa gtccttcacc ; aagcgcttca tcgacaacct gagggacagg tactaccgct gtgaccgaaa gtgg 36; aacgcaggcg cgctgcacta cagtgacgag gtggagatca tccaaggcct cacgcgcatg 42; cctggcggcc gcgacgcact cagc gtggacgcgg tcaagtactt tgggaagggc 48; acctacaccg actgcgctat caagaagggg ctggagcagc tcctcgtggg gggctcccac 54; ctgaaggaga ataagtacct gattgtggtg accgacgggc accccctgga gggctacaag 60; gaaccctgtg gggggctgga ggatgctgtg aacgaggcca agcacctggg cgtcaaagtc 66; gtgg cacc cgaccacctg cgtc tgagcatcat cgccacggac 72; cacacgtacc ggcgcaactt cacggcggct gactggggcc agagccgcga ggag 78; gccatcagcc agaccatcga caccatcgtg gacatgatca aaaataacgt ggagcaagtg 84L tgctgctcct tcgaatgcca gcctgcaaga ggacctccgg ggctccgggg cgaccccggc 90; tttgagggag aacgaggcaa gctc ccaggagaga agggagaagc cggagatcct 96L ggaagacccg gggacctcgg acctgttggg taccagggaa tgaagggaga aaaagggagc 1021 cgtggggaga agggctccag gggacccaag aagg gagagaaggg caagcgtggc 108; atcgacgggg tggacggcgt gaagggggag atggggtacc caggcctgcc caag 114; ggctcgcccg ggtttgacgg cattcaagga ccccctggcc ccaagggaga ccccggtgcc L20; tttggactga aaggagaaaa gggcgagcct ggagctgacg gggaggcggg gagaccaggg L26; agctcgggac catctggaga ccag ccgggagagc ctgggccccc cggagagaaa L32; ggagaggcgg gcgacgaggg gaacccagga cctgacggtg cccccgggga gcggggtggc L38; cctggagaga gaggaccacg ggggacccca ggcacgcggg gaccaagagg agaccctggt L44; ggcc cgcagggtga tcagggaaga gaaggccccg ttggtgtccc cccg L50; ggcgaggctg gccctatcgg acctaaaggc taccgaggcg gtcc cccagggtcc L56; gagggtgcca gaggagcccc aggacctgcc ggaccccctg cggg gctgatgggt L62; gaaagggqag aagacggccc cgctggaaat ggcaccgagg gcttccccgg L68; tatccggqca acaggggcgc tcccgggata aacggcacga agggctaccc cggcctcaag L74; ggggacgagg gagaagccgg ggaccccgga gacgataaca acgacattgc accccgagga L80; gtcaaaggag caaaggggta ccggggtccc gagggccccc cccc aggacaccaa L86; ggaccgcctg ggccggacga atgcgagatt ttggacatca tcatgaaaat tgtgaatgca agtgcggccc catcgacctc ctgttcgtgc tggacagctc agagagcatt L98; ggcctgcaga acttcgagat tgccaaggac gtca aggtcatcga ccggctgagc 204; cgggacgagc tggtcaagtt cgagccaggg cagtcgtacg nggtgtggt cagc 210; cacagccaga tgcaggagca Cgtgagcctg cgcagcccca gcatccggaa Cgtgcaggag 216; ctcaaggaag ccatcaagag cctgcagtgg atggcgggcg gcaccttcac gggggaggcc 222; ctgcagtaca cgcgggacca gctgctgccg cccagcccga acaaccgcat cgccctggtc 228; atcactgacg ggcgctcaga cactcagagg gacaccacac cgctcaacgt gctctgcagc 234; cccggcatcc aggtggtctc Cgtgggcatc gtgt ttgacttcat cccaggctca 240; gaccagctca atgtcatttc ttgccaaggc ccat gccg gcccggcctc 246; tcgctggtca aggagaacta tgcagagctg ctggaggatg ccttcctgaa gaatgtcacc 252; gcccagatct gcatagacaa gaagtgtcca gattacacct gccccatcac gttctcctcc 258; ccggctgaca tcct gctggacggc tccgccagcg tgggcagcca tgac 264: accaccaagc gcttcgccaa gcgcctggcc gagcgcttcc tcacagcggg caggacggac 2701 cccgcccacg gggt ggcggtggtg cagtacagcg gcacgggcca gcagcgccca 276; gagcgggcgt cgctgcagtt cctgcagaac tacacggccc tggccagtgc cgtcgatgcc 282; atggacttta acgc cgtc aacgatgccc tgggctatgt gacccgcttc 288; taccgcgagg cctcgtccgg cgctgccaag aagaggctgc tgctcttctc agatggcaac 294; tcgcagggcg ccacgcccgc tgccatcgag aaggccgtgc aggaagccca gcgggcaggc 300; atcgagatct tcgtggtggt Cgtgggccgc caggtgaatg agccccacat ccgcgtcctg 306; gtcaccggca agacggccga gtacgacgtg gcctacggcg agagccacct gttccgtgtc 312; cccagctacc aggccctgct ccgcggtgtc ttccaccaga cagtctccag gaaggtggcg 318; ctgggctagc ccaccctgca cgccggcacc aaaccctgtc ctcccacccc tccccactca 324; tcactaaaca gagtaaaatg tgatgcgaat tttcccgacc aacctgattc gctagatttt 330; ttttaaggaa aagcttggaa agccaggaca tgct gcctgctttg tgcagggtcc 3361 tccggggctc agccctgagt tggcatcacc tgcgcagggc cctctggggc tcagccctga 342; gctagtgtca cctgcacagg gccctctgag gctcagccct gagctggcgt cacctgtgca 348; gggccctctg gggctcagcc ctgagctggc ctcacctggg accc cgggctctcc 3541 tgccctgccc tcctgcccgc cctccctcct gcctgcgcag ctccttccct aggcacctct 360; catc ccaccagcct gacg ccctctcggg gcctgtgccg cactagcctc 366; cctctcctct gtccccatag ctggtttttc ccaccaatcc tcacctaaca gttactttac 372; aattaaactc agct cttctcctca gcttggggca gccattggcc tctgtctcgt 378; tttgggaaac caaggtcagg aggccgttgc agacataaat ctcggcgact cggccccgtc 384; tcctgagggt cctgctggtg accggcctgg accttggccc tacagccctg gaggccgctg 390; agca ctgaccccga cctcagagag tactcgcagg ggcgctggct gcactcaaga 396; ccctcgagat taacggtgct aaccccgtct ccct cccgcagaga ctggggcctg 4021 gactggacat gagagcccct tggtgccaca gagggctgtg tcttactaga aacaacgcaa 4081 acctctcctt aata tgtt cgacgtttta tcaaaggccc cctttctatg 4141 ttcatgttag ttttgctcct tctgtgtttt aacc atatccatgt tgctgacttt 4201 tccaaataaa ggttttcact cctctaaaaa aaaa aaaaaa Protein seguence: NCBI Reference Seguence: NP_001839.2 LOCUS N P_001 839 ACCESSION N P_001839 l mraarallpl taaq depetprava fqdcpvdlff sval alvd 61 kvksftkrfi dnlrdryyrc drnlvwnaga 1hysdeveii qgltrmpggr dalkssvdav 121 kyfgkgtytd caikkgleql lvggshlken kylivvtdgh plegykepcg gledavneak 18; hlgvkvfsva itpdhleprl siiatdhtyr rnftaadwgq srdaeeaisq tidtivdmik 24; nnveqvccsf ecqpargppg fege rgkpglpgek geagdpgrpg dlgpvqugm ; kgekgsrgek gsrgpkgykg ekgkrgidgv dgvkgemgyp glpgckgSpg fdgiqupgp 36; kgdpgafglk gekgepgadg eagrpgssgp Sgdegquep gppgekgeag degnpgpdga 42; pgerggpger gprgtpgtrg prgdpgeagp quqgregpv nggdpgeag pigpkgyrgd 48; egppgsegar gppg dpglmgerge dgpagngteg fpgfpgypgn rgapgingtk 54L gdeg ddnn diaprgvkga kgyrgpegpq gppghqupg pdeceildii 60; mkmcscceck cgpidllfvl dssesiglqn fvvk vidrlsrdel kaepgqsya 66L gvvqyshsqm qehvslrsps irnvqelkea ikslqwmagg tftgealqyt rdqllppspn 721 nrialvitdg rsdtqrdttp lnvlcspgiq vvsvgikdvf dfipgsdqln viscqglaps 78; qgrpglslvk enyaelleda flknvtaqic idkkcpdytc pitfsspadi tilldgsasv 84; ttkr fakrlaerfl tagrtdpahd vrvavvqysg tgqqrperas qulqnytal 90; asavdamdfi ndatdvndal yrea ssgaakkrll lfsdgnsqga tpaaiekavq 96; eaqragieif vvvvgrqvne phirvlvtgk taeydvayge shlfrvpsyq allrgvtht 102; vsrkvalg Official Symbol: CRTAP al Name: cartilage ated protein Gene ID: 10491 Organism: Homo sapiens Other Aliases: CASP, LEPREL3, O|7 Other Designations: cartilage-associated protein; leprecan-Iike 3 Nucleotide seguence: NCBI Reference Seguence: NM_006371.4 LOCUS NM_006371 ACCESSION NM_006371 l aggctggcgt ccccgccccg aaagcactgg gcccgccgcg tcgcaccgtc ctctttcctt 61 tccttctccc tccccttttc ccttccttcg tcccttcctt ccttcctttc gccgggcgcg 12; atggagccgg ggcgccgggg ggccgcggcg ctgctagcgc tgctgtgcgt ggcctgcgcg 18; ctgcgcgccg ggcgcgccca atacgaacgc tacagcttcc gcagcttccc acgggacgag 24; ctgatgccgc tcgagtcggc ctaccggcac gcgctggaca agtacagcgg cgagcactgg ; agcg tgggctacct ggagatcagc ctgcggctgc accgcttgct gcgcgacagc 36; gaggccttct gccaccgcaa ctgcagcgcc gcgccgcagc ccgagcccgc cgccggcctc 42; tatc tgcg cctcttcggg ggcctgctgc gccgcgcgca caag 48L cgctgcaagc agggcctgcc agccttccgc cagtcccagc ccagccgcga ggtgctggcg 54L gacttccagc gccgcgagcc ctacaagttc ctgcagttcg cttacttcaa ggcaaataat 60; ctccccaaag ccatcgccgc tgctcacacc tttctactga agcatcctga tgacgaaatg 66L atgaagagga cata ttataagagc ctgcctggtg ccgaggacta cattaaagac 72; acca agtcatatga aagcctgttc atccgagcag tgcgggcata caacggtgag 78; aactggagaa catccatcac agacatggag ctggcccttc ccgacttctt caaagccttt 84; tacgagtgtc tcgcagcctg cgagggttcc atca aggacttcaa ggatttctac 90; ctttccatag cagatcatta tgtagaagtt tgca aaatacagtg tgaagagaac 96; ctcaccccag ttataggagg ctatccggtt gagaaatttg tggctaccat gtatcattac L02; ttgcagtttg cctattataa gttgaacgac ctgaagaatg cagccccctg tgcagtcagc L08; ctct ttgatcagaa tgacaaggtc atgcagcaga acctggtgta ttaccagtac L14; cacagggaca cttggggcct ctcggatgag cacttccagc ctga agcagttcag L20; ttctttaatg tgaccacact ccagaaggag ctgtatgact ttgctaagga aaatataatg L26; gatgatgatg agggagaagt tgtggaatat gtggatgacc tcttggaact ggaggagacc L32; agctagccca cagcaaccaa agagacttcc cgtt caggaaacac agattctttg L38; tccttttccc aacagcccag gata cctcagagcc ttac tctccaaagt L44; gaaagggaag cccccgtctc tctaactgca tgtcatcagg ggtgagcctg cctttcctat L50; cttcacacct gccacctcat gttcacacct atctttctca cctttttttt gagatggagt L56: ctct tgcccaggct ggagtgcaat ggcacgttct cagctcactg caacctccgc L621 ctcttggqtt caagcaattc tgctgcatca gcctcccgag tacctgggat 168; tgccaccacg cccggctaat tttgtatttt tagtagagac ggggttttgc catgttggcc L74; aggctggtct cgaactcttg acttcagatg atccatctgc cttggcctcc cacagtgctg L80; ggattacagg Cgtgagccac catgcccggc ctctttctca cctttacacc tgtcttctta L86; tcctcacatc tgttttcaca ccttcatccc tgtcttcctc atgttcacac ttgtcttccc 192; catgttcata gctgcctttc ttaccatttt ggtttgaagg gcagtcttct ctggcttgtt 198; tttttgtttt aaaa tcagtattat tttttaaata agaaaaacat tcctagaaga 204; tgataattgt ctcc tttggcttat ttgcttttcc agattttagt ctcctttctc 210; cccatccggg aaagatggtg gaagacatag gctaaatttc tccagcctca caatggtctt 216; cacttggtct gacttgtacc agca cccactgaaa aacaagttga gtagagagtg 222; gcag aaatgtggct ccac tttgcatctc caaaattaca acggttggcc 228; gatcccattt gaggacaatg cttagttata agtctccgag ttggaaaagg aagaaagcca 234; gagctgtcta gtttcattca ttctttcagt aaatatttat tgagtaccta ctgtgtgcta 240; ggcattgacc tgggaactag agatacttca cagaataaca gggaaagttc cctgtgctca 246; tggagcttac attctacagg gagaaagaga atac ataggaataa atatatacaa 252; ggtatcatgt aatt gctgtggaga aaaataaagc aggggaggga gtaagaaatc 258; ctggagatga ggctgcagtt gggg cctcactggg aatgtgacgt tgagcagaga 264; cgttagggaa gtggatcctg gacaaggcat tccaggcaga ggaacaggat gtgcactgcc 270; ccaaagtgag aacttgctct acgtggtcag gaaagagcag ggagaccaag cagagtcgtg 276; ggcaggggta gaatggaagg agaggcggct ggggaggaca ggtggtggag ggct 282; tctgctaagt gagatgggaa ccactggagg gtttgaacag aggagtgcct tgattgattt 288; tgca agggtcattc tagctgccat attgtgaaaa gtgg acaagggcag 294; aaggaagagg gaagacctgt taggaagcta ctgcaaggtt ccaggcttgg gcctgggcca 300; cagcaacagc agtggtcaaa tatctagatt tattttgaaa agagccaata ggatttgctg 306; agagtttgaa tgtggagtgt aagagaagga agagttaatg atgacattaa ggtttttggc 312; ctgaatagca ggaaagatgg agttaccagt tactgaaata gggaaggatg ggctgggtaa 318; gtatggaatt tggtgcaaag caggctgtct gtggttggaa tgggaggttc tggctgcaaa 324; tcaaagtgga gagttctctc aggtcaggtc tgcagcagag ctcgagacag ggatctgaat 330; gcacttggtt tggg ggtgctctca gaaggaacct gtgaaagcct ttatcagtca 336; tttattggct gtgagaagtt ctctgggagt gtgggtacat ttgaaggcaa gtgacttcag 3421 ttgagggcaa gtctctggaa aagaggctgt ctgg ccat gcatggtagt 3481 qtqttqqqqq tgggggtcct gggcactggc tgtgtgaagg gatctggcag ggcaccacag 354; cgccccctac tgaaccatca gcatgtcagt ggcatttaaa gccatgcagc tggaggggcc 360; actgagattg agta ttactgagaa gcaacagaaa atgg atggagccct 366; tgggctctct gggaaatggg gcca aaggactgag aaggagttac cttaaggtca 372; gagaaaacca agagagtgtg gtgttctgga agctgagctt tctttattca acctcattcc 378; cttctccaaa actt gtgtagttgg gcccctccag ggttgaaggc aagaggagaa 384; aggcacagcg aaac aagacttttc ctgcaatagc ctgggaagga ggat 390; agagtgtttg ggtttttgtg taatggtggt taattggggt ggaacactca cacgttgtgc 396; tggg cttcccttat cccccagaac actctaccaa cctcggggaa ctcgggcaca 402; tccttctgtt tctccttcag ctctatcctg ctttcctcat cccttctgac accacgtcct 408; cctg cacaagaatc cctgcatcag gttctccttt accc acccaggaca 414; gtcccctacc acttctgtct tgggctgaag ttgcccacgt ccacaaaatc tgtactccca 4201 chqqqqtqt ttggcccgag gagtcagtgt tattactggt accg tgtccacagc 426; agcccccaat cccagcgatg cgtcagatct tacgtggctt cctgctgggg gcct 432; tcacccacgg gatgccgggt tctcctttct ttcctcaccc caacctttac tccaccagag 438; aaacttcctt ttgaactcag tggggaagag ggtgatgaga caggactaga aagtagtggg 444; ggacccagcg agtggacgcc ctgctccggg attcctgagt ctgtaaatag tgtgcccagc 450; agctgtgaac tccccttata gcctcaggct gcagtgtcct ctgt aaat 456; gaaagccgac gtccacaggg acccaggcag ggttgggtgt tgtgactcac tctg 462; tgccctgcag aggtactgtt gggtccttgt cttgtgagcc tggggtgagc tctctgtaca 468; tgttgttgtt ccacgtatgg gttgacttgg gggg ggtcctcgtt cactctctga 474; agttggcctc ctttcactgg ggattgaaaa ccac ccctacccta tccc 480; ctgaggaccc gggtgatagt acagtcaata ttgtcagtac tttgctttga ttgaaggctg 486; tagagctgag ttaccaaaat ttctatttca acca aaccttaaaa aaaaaaaaca 492; aaaactgggc tgggtcttcc aaacctacca tgaaaccctg gtgtgcaggc tgcactcaat 498; gacctcaacc caacacctcc ctgagtgtgc ttcttggaag agcctagaag attcctggat 504; ggagacccca ttggttcagc ctcaagtctg gcccgtcttc gaaaaaacaa acacatttgt 510; aagctttgtg ggagcttcca ggcctgctct aagatgcctt gcttgtcctt tgacccatca 516; gcatggagct ttgc tgtttggttc tgcaggctgg tggggaggcc gcccatcgtg 5221 gtggggcatc tgtccagccc cattgccact cagggcatcc aaacaggagg cacccgctgg 528; tcta aagatactcc ttgtggccac tgctactgtt cacacttgac ttgtggagaa 534; gcgaagggct gaggggaggt ttgtgtacac ccatgtattt aaaagtgact gactgactga 540; aatgagcaca taccgacata tgcaacatac taataccttc ttcg agactttcta 546; attactacaa ctaacctgtt gtgctcacct ctggaattca gaaagagagc cactgcgagc 552; actgaccaca agggctgcct taggaaggaa atgtgtgctt tcaggagttc gggg 558; aattttaaga gcacaaaaat tttattgaac ccaccttagt gacaaacaga aaatgatttg 564; ttagattgtc agttggaagg ggtttattat gactgctggt aaca agtt 570; tctgccaaca tttatggaaa taacgtttct aggttttaaa tgtgagccgt aagc 576; agtt aaaaaaaaaa atac ttaaactgta gggaaaaggt ggattggtgc 582; cagagaaaac taat agtgtcagaa catggaactg caacacagtt tgtagcaagg 5881 gaaa aacccacaac taatccttca tcgcagctgt ctgaactctt gatcatctgc 5941 catcccccaa acagtgactt ctttttttct ggtgactcca ggcctgaatg acctagtgtg 6001 gagagtttaa actatgtaca agggaggaaa gaaaaaaagg aaaggaacct taagtaagcc 6061 tcagccaaag gttttatgca ttggacttcc tgtgcctgcc cccaggggca agactgatcc 612; ccatgcctgt gcccatgaca tctccctgaa agaggacacc atgacagccc ggctttgcct 618; accc actgctaccc cagaaataag aatcaagagc agctattgtt atccttagag 624; tgttttccgt ctagggccgg aaca gccacatatc cttgcacctg acactgtccc 630; cacacaaata gctggctttc gttgcttgtt gaatgaatga gtgagttggc tctatatccc 636; cttggagctg gccggtaaga tattagtgcc tcattttaca agaagagaat aggaaggaat 642; taagcaattt ggccaacaga tacaagatag gagt tttatctccc actttagggt 648; ggcagccagt aggccaaact ccaaagaccg ttgctgatgt ctgc ctcccctctt 654; tgggttagtg tggtatgtac aagctcactc ttgttgaaaa ttagaaaata gttgaaaaca 660; aaaggttttt gtttttcttt ttaaatcaca ttaaatgttt tacattgctt aaaaaaaaaa 666; aaaaaaaa Protein seguence: NCBI Reference Seguence: NP_006362.1 LOCUS N P_006362 ACCESSION NP_006362 1 mepgrrgaaa llallcvaca lragraqyer ysfrsfprde 1mp1esayrh aldkysgehw 61 aesvgyleis lrlhrllrds eafchrncsa apqpepaagl asypelrlfg gllrrahclk 121 rckqglpafr qsqpsrevla dfqrrepykf quayfkann 1pkaiaaaht ddem 181 mkrnmayyks lpgaedyikd letksyeslf iravraynge nwrtsitdme lalpdffkaf 241 yeclaacegs reikdfkdfy yvev leckiqceen ltpviggypv ekfvatmyhy 301 quayyklnd lknaapcavs yllqundkv mqqnlvyyqy hrdtwglsde eavq 361 ffnvtthke lydfakenim dddegevvey vddlleleet s SERPH Official Symbol: SERPINH1 Official Name: serpin peptidase inhibitor, clade H (heat shock protein 47), member 1, (collagen binding protein 1) Gene ID: 871 Organism: Homo sapiens Other Aliases: PlG14, AsTP3, CBP1, CBP2, HSP47, OI10, PPROM, RA-A47, SERPINH2, gp46 Other Designations: 47 kDa heat shock protein; arsenic-transactivated protein 3; cell proliferation-inducing gene 14 protein; colligin-1 ; colligin-2; rheumatoid arthritis antigen A-47; rheumatoid arthritis-related antigen ; serine (or cysteine) proteinase tor, clade H (heat shock protein 47), member 1, (collagen binding protein 1); serine (or cysteine) proteinase inhibitor, clade H (heat shock protein 47), member 2, (collagen-binding protein 2); serpin H1 Nucleotide seguence: NCBI nce Seguence: NM_001207014.1 LOCUS NM_001207014 ACCESSION NM_001207014 1 agtaggaccc aggggccggg aggcgccggc agagggaggg gccgggggcc ggggaggttt 6; tgagggaggt ctttggcttt ttttggcgga gctggggcgc cctccggaag cgtttccaac 12; tttccagaag tttctcggga cgggcaggag ggggtgggga ctgccatata ccgg 18; gagcagggga taag agtagaatcg tgtcgcggct cgagagcgag agtcacgtcc 24; cggcgctagc ccagcccgac ccagaatgaa aaaggcaggc ctcc ctctgaggca ; gtttccaggc ccaccgtggt gcacgcaaac cacttcctgg ccatgcgctc cctcctgctt 36; ctcagcgcct tctgcctcct ggaggcggcc ctggccgccg aggtgaagaa acctgcagcc 42; gctc ctggcactgc ggagaagttg agccccaagg cggccacgct tgccgagcgc 48; ggcc tcag cttgtaccag gccatggcca aggaccaggc agtggagaac 54; atcctggtgt tggt ggtggcctcg tcgctagggc tcgtgtcgct gggcggcaag 60; gcgaccacgg cgtcgcaggc caaggcagtg ctgagcgccg tgcg cgacgaggag 66; gtgcacgccg gcctgggcga gcgc tcactcagca actccacggc gcgcaacgtg 72; acctggaagc tgggcagccg actgtacgga cccagctcag tgagcttcgc cttc 78; gtgcgcagca gcaagcagca ctacaactgc gagcactcca actt ccgcgacaag 84; gcgc ccat caacgagtgg gccgcgcaga ccaccgacgg caagctgccc 90; gaggtcacca tgga gcgcacggac ggcgccctgc tagtcaacgc catgttcttc 96; aagccacact gggatgagaa attccaccac aagatggtgg acaaccgtgg cttcatggtg L02: actcggtcct ataccgtggg tgtcatgatg cgga caggcctcta gacgacgaga aggaaaagct gcaaatcgtg gagatgcccc tggcccacaa ctcatcatcc tcatgcccca tcacgtggag cctctcgagc gccttgaaaa gctgctaacc 1201 aaagagcagc tgaagatctg gatggggaag atgcagaaga aggctgttgc cccaagggtg tggtggaggt gacccatgac aaac acctggctgg actgaggcca ttgacaagaa caaggccgac ttgtcacgca tgtcaggcaa gaaggacctg L38; tacctggcca gcgtgttcca cgccaccgcc tttgagttgg acacagatgg caaccccttt L44; gaccaggaca tctacgggcg cgaggagctg cgcagcccca agctgttcta cgccgaccac L50; atct tcctagtgcg ggacacccaa agcggctccc tcat tgggcgcctg L56; gtccggccta agggtgacaa gatgcgagac gagttatagg gcctcagggt gcacacagga L62; tggcaggagg catccaaagg ctcctgagac acatgggtgc ggtt gggggggagg L68; tgaggtacca gccttggata gggg tgggggtgga aaaacagacc ggggttcccg L74; tgtgcctgag cggaccttcc cagctagaat tcactccact tggacatggg ccccagatac L80; catgatgctg agcccggaaa ctccacatcc tgtgggacct gggccatagt gcct L86; gccctgaaag tcccagatca agcctgcctc aatcagtatt catatttata gccaggtacc L92; ttctcacctg tgagaccaaa ttgagctagg ggggtcagcc agccctcttc tgacactaaa L98; acacctcagc tgcctcccca gctctatccc aacctctccc aactataaaa ctaggtgctg 204; cagcccctgg gaccaggcac ccccagaatg acctggccgc agtgaggcgg attgagaagg 210; agctcccagg aggggcttct actc tggtcaagaa gcatcgtgtc tggcgttgtg 2161 gggatgaact ttttgttttg tttcttcctt ttttagttct tcaaagatag ggagggaagg 222; gggaacatga gcctttgttg ctatcaatcc aagaacttat ttgtacattt tttttttcaa 228; taaaactttt ccaatgacat tttgttggag agaa aaaaaaaaaa aaa Protein seguence: NCBI nce Seguence: NP_001193943.1 LOCUS N P_001 1 93943 ACCESSION N 93943 1 mrsllllsaf clleaalaae vkkpaaaaap gtaeklspka atlaersagl amak 61 dqavenilvs pvvvasslgl vslggkatta sqakavlsae qlrdeevhag 1ge11rslsn 121 starnvtwkl gsrlygpssv sfaddfvrss kqhyncehsk infrdkrsal qsinewaaqt 181 tdgklpevtk dvertdgall vnamffkphw dekfhhkmvd nrgfmvtrsy tvgvmmmhrt 241 glynyyddek mpl ahklssliil mphhvepler 1ek11tkeq1 kiwmgkmqkk 301 avaislpkgv vevthdlqkh laglglteai dknkadlsrm sgkkdlylas vfhatafeld 361 tdgnpqudi rspk lfyadhpfif 1vrdtqsgsl 1figr1vrpk |TB1 Official Symbol: lTGBf Official Name: integrin, beta 1 (fibronectin receptor, beta ptide, antigen CD29 includes MDF2, MSK12) Gene ID: 3688 Organism: Homo sapiens Other Aliases: RP11-479G22.2, CD29, FNRB, GPIIA, MDF2, MSK12, VLA- BETA, VLAB Other Designations: integrin VLA-4 beta subunit; integrin beta-1 ; very late activation protein, beta polypeptide Nucleotide seguence: NCBI Reference Seguence: NM_002211.3 LOCUS NM_00221 1 ACCESSION NM_002211 1 atcagacgcg cagaggaggc ggggccgcgg ctggtttcct gccggggggc ggctctgggc 61 cgccgagtcc cctcctcccg cccctgagga ggaggagccg ccgccacccg ccgcgcccga 121 ggag gccccgccag cccgcgggag aggcccagcg ggagtcgcgg aacagcaggc 181 ccac cgcgccgggc cccggacgcc gcgcggaaaa gatgaattta attt 241 tctggattgg actgatcagt tcagtttgct gtgtgtttgc agat gaaaatagat 301 gtttaaaagc aaatgccaaa tcatgtggag aatgtataca agcagggcca aattgtgggt 36; ggtgcacaaa ttcaacattt ttacaggaag gaatgcctac ttctgcacga gatt 42; tagaagcctt aaaaaagaag ggttgccctc cagatgacat agaaaatccc agaggctcca 48; aagatataaa gaaaaataaa aatgtaacca accgtagcaa aggaacagca gagaagctca 54; agccagagga tcag atccaaccac agcagttggt tttgcgatta agatcagggg 60; agccacagac atta aaattcaaga gagctgaaga ctatcccatt gacctctact 66; accttatgga ttac tcaatgaaag acgatttgga gaatgtaaaa agtcttggaa 72L cagatctgat gaatgaaatg aggaggatta actt cagaattgga tttggctcat 78L ttgtggaaaa gactgtgatg ccttacatta gcacaacacc agctaagctc aggaaccctt 84L gcacaagtga acagaactgc accagcccat ttagctacaa aaatgtgctc agtcttacta 90; ataaaggaga agtatttaat gaacttgttg gaaaacagcg catatctgga aatttggatt 96; ctccagaagg cgat gccatcatgc aagttgcagt atca ctgattggct L02; ggaggaatgt gctg ctggtgtttt ccacagatgc tcac tttgctggag L08; atgggaaact tggtggcatt gttttaccaa atgatggaca atgtcacctg gaaaataata L14; tgtacacaat gagccattat tatgattatc cttctattgc tcaccttgtc cagaaactga L20; ataa tattcagaca atttttgcag ttactgaaga atttcagcct gtttacaagg L26; agctgaaaaa cttgatccct aagtcagcag taggaacatt atctgcaaat tctagcaatg L32; taattcagtt gatcattgat gcatacaatt ccctttcctc agaagtcatt ttggaaaacg L38; gcaaattgtc agaaggcgta acaataagtt ctta ctgcaagaac ggggtgaatg L44; gaacagggga aaatggaaga aaatgttcca atatttccat tggagatgag gttcaatttg L50; aaattagcat aacttcaaat aagtgtccaa aaaaggattc tgacagcttt aaaattaggc L56; ctctgggctt tacggaggaa gtagaggtta ttcttcagta catctgtgaa tgtgaatgcc L62; aaagcgaagg catccctgaa agtcccaagt gtcatgaagg aaatgggaca tttgagtgtg L68; gcgcgtgcag gtgcaatgaa gggcgtgttg gtagacattg tgaatgcagc acagatgaag L74; ttaacagtga agacatggat gcttactgca aaaa cagttcagaa atctgcagta L80; gaga gtgcgtctgc ggacagtgtg tttgtaggaa gagggataat gaaa L861 ctgg caaattctgc gagtgtgata atttcaactg atcc aaatggtgtt tgcaagtgtc gtgtgtgtga gtgcaacccc L98; gcagtgcatg tgactgttct ttggatacta gtacttgtga agccagcaac ggacagatct 204; gcaatggccg ctgc gagtgtggtg tctgtaagtg tacagatccg aagtttcaag 210; ggcaaacgtg tgagatgtgt cagacctgcc ttggtgtctg tgctgagcat aaagaatgtg 216; ttcagtgcag caat aaaggagaaa agaaagacac atgcacacag gaatgttcct 222; attttaacat taccaaggta gaaagtcggg acaaattacc ccagccggtc caacctgatc 228; ctgtgtccca ttgtaaggag aaggatgttg acgactgttg gttctatttt acgtattcag 234; tgaatgggaa caacgaggtc atggttcatg agaa tccagagtgt cccactggtc 240; cagacatcat tccaattgta gtgg ttgctggaat tgttcttatt ggccttgcat 246; tactgctgat atggaagctt ttaatgataa acag aagggagttt gctaaatttg 252; aaaaggagaa aatgaatgcc gaca cgggtgaaaa tcctatttat aagagtgccg 258; taacaactgt ggtcaatccg aagtatgagg gaaaatgagt actgcccgtg ccac 264; aacactgaat gcaaagtagc aatttccata gtcacagtta ggtagcttta gggcaatatt 270; gccatggttt tactcatgtg caggttttga aaatgtacaa taat ttttaaaatg 276; ttttattatt ttgaaaataa tgttgtaatt catgccaggg actgacaaaa gacttgagac 282; aggatggtta ctcttgtcag ctaaggtcac attgtgcctt tttgaccttt tcttcctgga 288; ctattgaaat caagcttatt ggattaagtg atatttctat agcgattgaa agggcaatag 294; taat gagcatgatg tctg atgt attaaaactg atttttagct 300; ttacaaatat gtcagtttgc agttatgcag aatccaaagt aaatgtcctg ctagctagtt 306; aaggattgtt ttaaatctgt tattttgcta tttgcctgtt agacatgact gatgacatat 312; ctgaaagaca agtatgttga gagttgctgg tgtaaaatac gtttgaaata gttgatctac 318; aaaggccatg ggaaaaattc agagagttag gaaggaaaaa gctt taaaacctgt 324; gtgccatttt aagagttact taatgtttgg taacttttat gccttcactt tacaaattca 330; agccttagat aaaagaaccg agcaattttc tgctaaaaag tccttgattt agcactattt 336; aggc catactttac aaagtatttg gggg accttttgag ttgaatttat 342; tttattattt ttattttgtt taatgtctgg tgctttctgt cacctcttct aatcttttaa 348; tgtatttgtt tgcaattttg gggtaagact ttttttatga gtactttttc tttgaagttt 354; tagcggtcaa tttt taatgaacat gtgaagttat actgtggcta tgcaacagct 360; ctcacctacg cgagtcttac tttgagttag tgccataaca gaccactgta cttc 3661 tcaccatttg agttgcccat cttgtttcac actagtcaca ttcttgtttt aagtgccttt 372; agttttaaca gttcactttt tacagtgcta tttactgaag ttatttatta ccta 378; aaatacttaa atcggatgtc ttgactctga tgtattttat caggttgtgt gcatgaaatt 3841 tttatagatt aaagaagttg aggaaaagca aaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_002202.2 LOCUS NP_002202 ACCESSION 202 l mnlqpifwig ;issvccvfa qtdenrclka eciq wctn stflqegmpt 6; sarcddlea; kkkgcppddi enprgskdik knknvtnrsk gtaeklkped qq;v 12; lrlrsgepqt ftlkfkraed ypidlyylmd lsysmkddle nvkslgtdlm nemrritsdf l8; rigfgsfvek tvmpyisttp aklrnpctse qnctspfsyk nvlsltnkge vfnelvgqu 24; isgnldspeg gfdaimqvav cgsligwrnv trllvfstda gfhfagdgkl ggivlpndgq ; chlennmytm shyydypsia hlvqklsenn iqtifavtee fqpvykelkn lipksavgtl 36; sanssnviql iidaynslss evilengkls egvtisyksy ckngvngtge ngrkcsnisi 42; gdevqfeisi tsnkcpkkds dsfkirplgf teevevilqy icececqseg cheg 48; ngtfecgacr grhc ecstdevnse dmdaycrken sseicsnnge cvcgqcvcrk 54; rdntneiysg kfcecdnfnc drsnglicgg ngvckcrvce cnpnytgsac dcsldtstce 60; asngqicngr gicecgvckc tdpqugqtc emcqtclgvc aehkecvqcr afnkgekkdt 66; ctqecsyfni dklp qpvqupvsh ckekdvddcw fyftysvngn nevmvhvven 72; pecptgpdii pivagvvagi vliglallli wkllmiihdr refakfekek tgen 78; piyksavttv vnpkyegk FKB10 al Symbol: FKBP10 Official Name: FK506 binding protein 10, 65 kDa Gene ID: 60681 sm: Homo sapiens Other Aliases: PSEC0056, , Ol11, Ol6, PPIASE, hFKBP65 Other Designations: 65 kDa FK506-binding protein; 65 kDa FKBP; FK506- binding protein 10; FKBP-10; FKBP-65; PPlase FKBP10; immunophilin ; peptidyl-prolyl cis-trans isomerase FKBP10; rotamase Nucleotide seguence: NCBI Reference Seguence: NM_021939.3 LOCUS NM_021 939 ACCESSION NM_021939 l cccgagcctc tctccctggc caggccccag gtctcgcagc cagggatgga ggga 6; gggggaacct agagttcttt gtagtgcctc cctcagactc taacacactc agcctggccc 12; cctcctccta ttgcaacccc ctcccccgct cctcccggcc aggccagctc agtcttccca 18; gcccccattc cacgtggacc agccagggcg ggggtaggga aagaggacag gaagaggggg 24; agccagttct gggaggcggg gagg ttggtggcga ctccctcgct cgccctcact ; gccggcggtc ccaactccag gcaccatgtt ccccgcgggc ccccccagcc acagcctcct 36L ccggctcccc ctgctgcagt tgctgctact ggtggtgcag gccgtgggga gggggctggg 42; ccgcgccagc ccggccgggg gccccctgga agatgtggtc atcgagaggt accacatccc 48L ctgt ccccgggaag tgcagatggg ggattttgtg cgctaccact acaacggcac 54L ttttgaagat aagt ttgattcaag ctatgatcgc aacaccttgg tggccatcgt 60; ggtgggtgtg gggcgcctca tcactggcat ggaccgaggc ctcatgggca tgtgtgtcaa 66; cgagcggcga cgcctcattg tgcctcccca ctat gggagcatcg cggg 72; tcca ccggatgcca ccctctactt Cgatgtggtt ctgctggatg tgtggaacaa 78; ggaagacacc gtgcaggtga gcacattgct gcgcccgccc cactgccccc gcatggtcca 84; ggacggcgac tttgtccgct accactacaa tggcaccctg ctggacggca cctccttcga 90; caccagctac agtaagggcg gcacttatga cacctacgtc ggctctggtt ggctgatcaa 96; gggcatggac caggggctgc tgggcatgtg tcctggagag aaga ttatcatccc L02; tccattcctg gcctatggcg agaaaggcta tgggacagtg ccac cgct L08; tcac gtcctcctga ttgacgtgca caacccgaag gtcc agctagagac L14; gctggagctc ccccccggct gtgtccgcag agccggggcc ggggacttca tgcgctacca L20; ctacaatggc tccttgatgg ccct cttcgattcc agctactccc gcaaccacac L26; ctacaatacc tatatcgggc agggttacat catccccggg atggaccagg ggctgcaggg L32; tgcctgcatg ggggaacgcc ggagaattac catccccccg cacctcgcct atggggagaa L38; tggaactgga gacaagatcc ctggctctgc cgtgctaatc ttcaacgtcc atgtcattga L44: caac gatg tggtggaaat caggacactg tcccggccat caatgagacc accaagcttg gggactttgt tcgataccat tacaactgtt cggcacccag ctgttcacct cgcatgacta cggggccccc caggaggcga caacaagqtg atcgaaggcc tggacacggg cctgcagggc atgtgtgtgg gagagaggcg L68; gcagctcatc gtgcccccgc acctggccca gagt ggagcccggg cagtgctgtg ctgctgtttg aggtggagct ggtgtcccgg gaggatgggc tgcccacagg 180; ctacctgttt caca aggaccctcc tgccaacctg tttgaagaca tggacctcaa 186; caaggatggc gaggtccctc ngaggagtt cttc atcaaggctc aagtgagtga 192; gggcaaagga atgc ctgggcagga ccctgagaaa accataggag acatgttcca 198; gaaccaggac cgcaaccagg agat cacagtcgac gagctcaagc tgaagtcaga 204; tgaggacgag gagcgggtcc agct ctgaggggca gggagcctgg ccaggcctga 210; gacacagagg cccactgcga gggggacagt ggcggtggga ctgacctgct gacagtcacc 2161 ctccctctgc tgggatgagg tccaggagcc aactaaaaca atggcagagg agacatctct 2221 ggtgttccca ccaccctaga tgaaaatcca cagcacagac ctctaccgtg tttctcttcc 2281 atccctaaac cacttcctta aaatgtttgg atttgcaaag ccaatttggg gcctgtggag 2341 cctggggttg gatagggcca tggctggtcc cccaccatac ctcccctcca tgac 240; acagctgagc ttgttatcca tctccccaaa ctttctcttt ctttgtactt cttgtcatcc 246; ccactcccag ccccttttcc tctatgtgac agctccctag tctg ccttcctccc 252; caatcctgac tggctcctag ggaaggggaa ggctcctgga gggcagccct acctctccca 258; tgccctttgc ccct cgcctccagt ggaggctgag ctgg gctgctggag 264; gccagactgg gctgtagtta gcttttcatc cctaaagaag gctcctttcc ctaaggaacc 270; atagaagaga ggaagaaaac aaagggcatg tgtgagggaa gctgcttggg tgggtgttag 276; ggctatgaaa tcttggattt ggggctgagg ggtgggaggg agggcagagc tctgcacact 282; caaaggctaa actggtgtca gtcctttttt cctttgttcc aaga ttaaaccaat 288; ggcaaaaa Protein seguence: NCBI Reference Seguence: NP_068758.3 LOCUS N 58 ACCESSION NP_068758 1 mfpagppshs llrlpllqll llvvqavgrg lgraspaggp ledvvieryh revq 61 mgdfvryhyn gtfedgkkfd ssydrntlva rlit gmdrglmgmc vnerrrlivp 121 phlgygsigl aglippdatl yfdvvllde ust 11rpphcprm qugdfvryh 181 yngtlldgts fdtsyskggt ydtyvgsgwl ikgmdqgllg mcpgerrkii ippflaygek 241 gygtvippqa slvfhvllid thpkdavql etlelppgcv dfmr yhyngslmdg 301 tlfdssysrn htyntyigqg yiipgmdqgl qgacmgerrr itipphlayg engtgdkipg 361 savlifnvhv idfhnpadvv eirtlsrpse tcnettklgd fvryhyncsl 1dgtq1ftsh 421 dygapqeatl gankviegld tglqgmcvge rrqlivpphl argv pgsavllfev 481 elvsredglp tgylfvwhkd ppanlfedmd lnkdgevppe efstfikaqv segkgrlmpg 541 qdpektigdm fqnqdrnng kitvdelklk sdedeervhe e1 FINC Official Symbol: FN1 Official Name: fibronectin 1 Gene ID: 2335 Organism: Homo sapiens Other Aliases: CIG, ED-B, FINC, FN, FNZ, GFND, GFND2, LETS, MSF Other Designations: cold-insoluble globulin; fibronectin; migration-stimulating facto r Nucleotide seguence: NCBI Reference Seguence: NM_002026.2 LOCUS NM_002026 ACCESSION NM_002026 1 gcccgcgccg gctgtgctgc acagggggag gaac cccaggcgcg aaga 61 ggggacctgc aact tctctggtcc tctgcatccc ccct ccacccgtcc 121 ccttccccac cctctggccc ccaccttctt ggaggcgaca acccccggga ggcattagaa 181 gggatttttc ccgcaggttg cgaagggaag caaacttggt ggcaacttgc ctcccggtgc 241 gggcgtctct cccccaccgt ctcaacatgc ttaggggtcc ggggcccggg ctgctgctgc 301 tggccgtcca gtgcctgggg acagcggtgc cctccacggg agcctcgaag aggc 361 aggctcagca aatggttcag ccccagtccc cggtggctgt cagtcaaagc ggtt 421 gttatgacaa tggaaaacac tatcagataa atcaacagtg gacc tacctaggca 481 atgcgttggt ttgtacttgt tatggaggaa gccgaggttt taactgcgag agtaaacctg 541 aagctgaaga gacttgcttt gacaagtaca ctgggaacac ttaccgagtg ggtgacactt 601 atgagcgtcc taaagactcc atgatctggg actgtacctg catcggggct gggcgaggga 66; gaataagctg cgca aaccgctgcc atgaaggggg tcagtcctac aagattggtg 72; acacctggag gagaccacat gagactggtg gttacatgtt agagtgtgtg tgtcttggta 78; atggaaaagg agaatggacc tgcaagccca tagctgagaa gtgttttgat catgctgctg 84; ggacttccta tgtggtcgga gaaacgtggg agaagcccta ctgg atgatggtag 90; attgtacttg cctgggagaa ggcagcggac gcatcacttg cacttctaga tgca 96; acgatcagga cacaaggaca tcctatagaa ttggagacac ctggagcaag aaggataatc L02; gaggaaacct gctccagtgc atctgcacag gccg aggagagtgg aagtgtgaga L08; ggcacacctc tgtgcagacc acatcgagcg gatctggccc cttcaccgat gttcgtgcag L14; ctgtttacca gcct cagc ctcctcccta ctgt L20; gtggtgtggt tgtg gggatgcagt ggctgaagac acaaggaaat aagcaaatgc L26; tttgcacgtg cctgggcaac ggagtcagct gccaagagac agctgtaacc cagacttacg L32; gtggcaactc aaatggagag ccatgtgtct taccattcac ctacaatggc ttct L38; actcctgcac cacagaaggg cgacaggacg gacatctttg gtgcagcaca atgagcagga ccagaaatac tctttctgca cagaccacac ggtt cagactcgag L50; gaggaaattc caatggtgcc ttgtgccact tccccttcct atacaacaac cacaattaca L56; ctgattgcac gggc agaagagaca acatgaagtg gtgtgggacc aact L62; atgatgccga ccagaagttt gggttctgcc ccatggctgc ggaa atctgcacaa L68; ccaatgaagg ggtcatgtac Cgcattggag atcagtggga taagcagcat gacatgggtc L74; tgag gtgcacgtgt gttgggaatg gtcgtgggga atggacatgc attgcctact L80; cgcagcttcg agatcagtgc attgttgatg acatcactta caatgtgaac gacacattcc L86; acaagcgtca tgaagagggg cacatgctga actgtacatg cttcggtcag ggtcggggca L92; ggtggaagtg tgatcccgtc gaccaatgcc aggattcaga gactgggacg ttttatcaaa L98; ttggagattc atgggagaag tatgtgcatg gtgtcagata ccagtgctac tgctatggcc 204; gtggcattgg ggagtggcat tgccaacctt tacagaccta tccaagctca agtggtcctg 210; tcgaagtatt tatcactgag actccgagtc agcccaactc ccaccccatc cagtggaatg 2161 caccacagcc atctcacatt tccaagtaca ggtg gagacctaaa aattctgtag 222; gccgttgqaa ggaagctacc ataccaggcc acttaaactc ctacaccatc aaaggcctga 228; agcctggtgt ggtatacgag ggccagctca tcagcatcca gcagtacggc caccaagaag 234; tgactcgctt tgacttcacc agca ccagcacacc tgtgaccagc aacaccgtga 240; caggagagac gactcccttt tctcctcttg tggccacttc tgaatctgtg accgaaatca 246; cagccagtag ctttgtggtc tcctgggtct cagcttccga caccgtgtcg ggattccggg 252; tggaatatga gctgagtgag gagggagatg agccacagta cctggatctt ccaagcacag 258; ccacttctgt gaacatccct gacctgcttc ctggccgaaa atacattgta aatgtctatc 264; agatatctga ggatggggag cagagtttga tcctgtctac ttcacaaaca cctg 270; atgcccctcc tgacccgact gtggaccaag ttgatgacac ctcaattgtt gttcgctgga 276; gcagacccca catc acagggtaca gaatagtcta ttcgccatca gtagaaggta 282: gcagcacaga actcaacctt cctgaaactg caaactccgt caccctcagt gacttgcaac 288; ctggtgttca gtataacatc actatctatg ctgtggaaga aaatcaagaa agtacacctg 294; ttgtcattca acaagaaacc actggcaccc caga tacagtgccc tctcccaggg 300; acctgcagtt tgtggaagtg acagacgtga ccat catgtggaca ccgcctgaga 306; gtgcagtgac ngctaccgt gtggatgtga tccccgtcaa tggc gagcacgggc 312; agaggctgcc catcagcagg aacacctttg cagaagtcac cgggctgtcc cctggggtca 318; cctattactt caaagtcttt gcagtgagcc atgggaggga gagcaagcct ctgactgctc 324; aacagacaac caaactggat gctcccacta acctccagtt tgtcaatgaa actgattcta 330; ctgtcctggt gagatggact ccacctcggg cccagataac ccga ctgaccgtgg 336; gccttacccg acag cccaggcagt acaatgtggg tccctctgtc tacc 342; cactgaggaa tctgcagcct gcatctgagt acaccgtatc cctcgtggcc ataaagggca 348; accaagagag ccccaaagcc actggagtct ttaccacact gcagcctggg agctctattc 354; caccttacaa ggtg actgagacca ccattgtgat cacatggacg cctgctccaa 360; gaattggttt taagctgggt gtacgaccaa gccagggagg acca cgagaagtga 366; cttcagactc aggaagcatc gttgtgtccg gcttgactcc aggagtagaa tacgtctaca 372; ccatccaagt cctgagagat ggacaggaaa gagatgcgcc aattgtaaac aaagtggtga 378; caccattgtc tccaccaaca aacttgcatc tggaggcaaa cact ggagtgctca 384; cagtctcctg ggagaggagc accaccccag ctgg ttatagaatt accacaaccc 390: ctacaaacgg ccagcaggga ttgg aagaagtggt tgat cagagctcct 3961 ttga taacctgagt cccggcctgg atgt cagtgtttac actgtcaagg 402; atgacaagga aagtgtccct atctctgata ccatcatccc agctgttcct cctcccactg 408; gatt caccaacatt ggtccagaca ccatgcgtgt cacctgggct ccacccccat 414; ccattgattt aaccaacttc ctggtgcgtt actcacctgt gaaaaatgag gaagatgttg 420; cagagttgtc aatttctcct tcagacaatg cagtggtctt aacaaatctc ctgcctggta 426; cagaatatgt agtgagtgtc tccagtgtct acgaacaaca tgagagcaca cctcttagag 432; gaagacagaa aacaggtctt ccaa ctggcattga cttttctgat attactgcca 438; ttac tgtgcactgg attgctcctc gagccaccat cactggctac aggatccgcc 444; atcatcccga cagt gggagacctc gagaagatcg ggtgccccac tctcggaatt 450; ccatcaccct caccaacctc actccaggca cagagtatgt ggtcagcatc gttgctctta 456; atggcagaga ggaaagtccc attg gccaacaatc aacagtttct gatgttccga 462; tgga agttgttgct gcgaccccca ccagcctact gatcagctgg gatgctcctg 468; ctgtcacagt gagatattac aggatcactt agac aggaggaaat gtcc 474; aggagttcac tgtgcctggg agcaagtcta cagctaccat cagcggcctt aaacctggag 480; ttgattatac catcactgtg tatgctgtca ctggccgtgg agacagcccc gcaagcagca 486; tttc cattaattac cgaacagaaa ttgacaaacc atcccagatg caagtgaccg 492; atgttcagga caacagcatt agtgtcaagt ggctgccttc aagttcccct gttactggtt 498; acagagtaac caccactccc ggac caggaccaac aaaaactaaa actgcaggtc 504; cagatcaaac agaaatgact attgaaggct tgcagcccac agtggagtat gtggttagtg 510; tctatgctca gaatccaagc ggagagagtc agcctctggt tcagactgca gtaaccaaca 516; ttgatcgccc taaaggactg actg atgtggatgt cgattccatc aaaattgctt 522; gggaaagccc acaggggcaa gtttccaggt acagggtgac ctactcgagc cctgaggatg 528; gaatccatga gctattccct gcacctgatg gtgaagaaga cactgcagag ctgcaaggcc 534; cggg ttctgagtac acagtcagtg tggttgcctt gcacgatgat atggagagcc 540; agcccctgat tggaacccag tccacagcta ttcctgcacc aactgacctg aagttcactc 546; aggtcacacc cacaagcctg agcgcccagt ggacaccacc caatgttcag ctcactggat 552; atcgagtgcg ggtgaccccc aaggagaaga ccggaccaat gaaagaaatc aaccttgctc 558; ctgacagctc atccgtggtt gtatcaggac ttatggtggc caccaaatat gaagtgagtg 564; tctatgctct cact agca gaccagctca gggagttgtc accactctgg 570; agaatgtcag cccaccaaga agggctcgtg tgacagatgc tactgagacc accatcacca 5761 ttagctgqag aaccaagact gagacgatca ctggcttcca agttgatgcc gttccagcca 582; agac tccaatccag atca agccagatgt cagaagctac accatcacag 588; aacc aggcactgac tacaagatct acctgtacac tgac aatgctcgga 594; gctcccctgt ggtcatcgac gcctccactg ccattgatgc accatccaac ttcc 600; tggccaccac acccaattcc ttgctggtat catggcagcc gccacgtgcc aggattaccg 606; gctacatcat tgag aagcctgggt ctcctcccag agaagtggtc cctcggcccc 612; gccctggtgt cacagaggct actattactg gcctggaacc gggaaccgaa tatacaattt 618; atgtcattgc cctgaagaat aatcagaaga gcgagcccct gattggaagg aaaaagacag 624; acgagcttcc ccaactggta acccttccac accccaatct tcatggacca gagatcttgg 630; atgttccttc cacagttcaa aagacccctt tcgtcaccca ccctgggtat gacactggaa 636; atggtattca gcttcctggc acttctggtc agcaacccag tgttgggcaa caaatgatct 642; ttgaggaaca tggttttagg cggaccacac cgcccacaac ggccaccccc ataaggcata 648; ggccaagacc gccg aatgtaggac aagaagctct ctctcagaca accatctcat 654; gggccccatt ccaggacact tctgagtaca tcatttcatg tcatcctgtt ggcactgatg 660; cctt acagttcagg gttcctggaa cttctaccag tgccactctg acaggcctca 666; ccagaggtgc cacctacaac atcatagtgg aggcactgaa agaccagcag aggcataagg 672; ttcgggaaga ggttgttacc gtgggcaact ctgtcaacga gaac acgg 678; atgactcgtg cccc tacacagttt cccattatgc cgttggagat gagtgggaac 684; ctga atcaggcttt ttgt gccagtgctt aggctttgga agtggtcatt 690; tcagatgtga ttcatctaga tggtgccatg acaatggtgt gaactacaag attggagaga 696; agtgggaccg tcagggagaa aatggccaga tgatgagctg cacatgtctt gggaacggaa 702; aaggagaatt caagtgtgac cctcatgagg gtta tgatgatggg aagacatacc 708; acgtaggaga gcag aaggaatatc tcggtgccat ttgctcctgc tttg 714; gaggccagcg gcgc tgtgacaact gccgcagacc tgggggtgaa cccagtcccg 720; aaggcactac tggccagtcc tacaaccagt attctcagag ataccatcag agaacaaaca 726; ctaatgttaa ttgcccaatt gagtgcttca tgcctttaga tgtacaggct gacagagaag 732; attcccgaga gtaaatcatc tttccaatcc agaggaacaa gcatgtctct ctgccaagat 738; ccatctaaac tggagtgatg ttagcagacc cagcttagag tttc tttcttaagc 744; cctttgctct ggaggaagtt ctccagcttc agctcaactc acagcttctc tcac 750i cctgggagtt tcctgagggt ataa atgagggctg gcct gttctgcttc 7561 gaagtattca ataccgctca gtattttaaa tgaagtgatt ctaagatttg gtttgggatc 762; aataggaaag catatgcagc caaccaagat gcaaatgttt tgaaatgata aaat 768; tttaagtagg aaagtcaccc aaacacttct gctttcactt aagtgtctgg cccgcaatac 774; tgtaggaaca agcatgatct tgttactgtg atattttaaa tatccacagt actcactttt 780; tccaaatgat cctagtaatt gcctagaaat atctttctct tacctgttat ttatcaattt 786; ttcccagtat ttttatacgg aaaaaattgt attgaaaaca cttagtatgc agttgataag 792; aggaatttgg tataattatg gtgggtgatt attttttata ctgtatgtgc caaagcttta 798; ctactgtgga aagacaactg ttttaataaa agatttacat tccacaactt gaagttcatc 804; tatttgatat aagacacctt aat aattcctgtg aatattcttt ttcaattcag 810; caaacatttg aaaatctatg atgtgcaagt ctaattgttg atttcagtac aagattttct 816; aaatcagttg ctacaaaaac gttt ttgtcacttc tcac taatggagat 822; agctttacac tttctgcttt aatagattta agtggacccc aatatttatt aaaattgcta 828; gtttaccgtt cagaagtata ataa tctttagttg ctcttttcta accattgtaa 834; ttcttccctt cttccctcca cttc attgaataaa cctctgttca aagagattgc 8401 ctgcaaggga aataaaaatg actaagatat taaaaaaaaa aaaaaaaaa seguence: NCBI Reference Seguence: NP_002017.1 LOCUS N P_00201 7 ACCESSION N P_002017 l mlrgpgpgll llavqclgta vpstgasksk rqaqqqupq spvavsqskp gcydngkhyq 6; inquertyl gnalvctcyg gsrgfncesk cfdk ytgntyrvgd dsmi 12; gagr grisctianr Cheggqsyki gdtwrrphet ggymlecvcl gngkgewtck 18; piaekcfdha agtsyvvget wekpngwmm vdctclgegs gritctsrnr cndthrtsy 24; skkd nrgnllqcic tgngrgewkc erhtqutts sgsgpftdvr aaqupqphp ; qpppyghcvt dsgvvysvgm qwlktqgnkq mlctclgngv scqetavtqt yggnsngepc 36; vlpftyngrt fyscttegrq dghlwcstts nyeqquysf ctdhtvlvqt rggnsngaLc 42; hfpflynnhn ytdctsegrr dnmkwcgttq nydaqufgf eeic ttnegvmyri 48; gdqwquhdm ghmmrctcvg ngrgewtcia ysqlrdqciv dditynvndt fhkrheeghm 54; lnctcfgqgr grwkcdpvdq chsetgtfy qigdswekyv hgvrchycy grgigewhcq 60L plqtypsssq pvevfitetp sqpnshpiqw hisk yilrwrpkns vgrwkeatip 66L tikg lkpqvvyegq lisiqqyghq evtrfdfttt tsnt vtgettpfsp 72; lvatsesvte itassfvvsw vsasdtvsgf rveyelseeg depqyldlps tatsvnipdl 781 lpgrkyivnv yqisedgeqs lilstsqtta pdappdptvd qudtsivvr wsrpqapitg 84; yrivyspsve gsstelnlpe tansvtlsdl quvqyniti yaveenqest pvviqqettg 90; tprsdtvpsp rdquvevtd Vkvtimwtpp esavtgyrvd vipvnlpgeh gqupisrnt 96; faevtglspg vtyyfkvfav shgreskplt aqqttkldap tnquvnetd stvlvrwtpp L02; raqitgyrlt vgltrrgqpr qynvgpsvsk yplrnlqpas eytvslvaik gnqespkatg L08; gss ippyntevte ttivitwtpa prigfklgvr psqggeapre vtsdsgsivv L14; sgltpgveyv ytiqvlrdgq erdapivnkv vtplspptnl hleanpdtgv ltvswerstt L20; pditgyritt tptngqqgns leevvhadqs sctfdnlspg leynvsvytv kddkesvpis ’26; dtiipavppp tdlrftnigp dtmrvtwapp psidltnflv ryspvkneed gteyvvsvss vyeqhestpl rgqutglds ptgidfsdit ansftvhwia L38; pratitgyri rhhpehfsgr predrvphsr nsitltnltp gteyvvsiva spll L44; igqqstvsdv prdlevvaat ptslliswda pavtvryyri tygetggnsp vqeftvpgsk 1501 statisglkp tvya vtgrgdspas skpisinyrt eidkpsqmqv tdqunsisv L56; kwlpssspvt tpkn gpgptktkta gpdqtemtie glqptveyvv svyaqnpsge tavt nidrpkglaf tdvdvdsiki awequgqu ryrvtysspe glrpgseytv svvalhddme sqpligtqst dlkf tqvtptslsa L74; qwtppnvqlt gyrvrvtpke einl apdsssvvvs glmvatkyev svyalkdtLt L80; srpaqgvvtt lenvspprra rvtdatetti tiswrtktet itgfquavp iqrt L86; ikpdvrsyti tglqutdyk iylytlndna idas snlr flattpnsLl L92; vswqpprari tgyiikyekp gspprevvpr prpgvteati teyt iyviaLknnq L98; ksepligrkk tdelpqlvtl phpnlhgpei ldvpstvqkt pfvthpgydt gngiqugts 204; gqqpsvgqqm ifeehgfrrt tppttatpir ppnv gqealsqtti swapfthse 210; yiischpvgt deepqurvp tltg ltrgatynii vealkdqqrh kvreevvtvg 216; nsvneglnqp tddscfdpyt vshyavgdew ermsesgfkl lcqclgfgsg hfrcdssrwc 222; hdngvnykig ekwdrqgeng qmmsctclgn gkgefkcdph eatcyddgkt 228; ylgaicsctc fggqrgwrcd ncrrpggeps pegttgqsyn qysqrthrt ntnvncpiec 234; fmpldvqadr edsre CYBS Official Symbol: CYB5A Official Name: cytochrome b5 type A (microsomal) Gene ID: 1528 Organism: Homo sapiens Other Aliases: cvss, MCB5 Other Designations: cytochrome b5; type 1 cyt-b5 Note — there are three difference isoforms Isoform1 Nucleotide ce: NCBI Reference ce: NM_148923.3 LOCUS NM_148923 ACCESSION NM_148923 l gcgccccgcc cctgagccgg ccgcccagcc cccagtgggg ttcccggcgc ggggaatgtc 6- ccgggtggag gagt cgcgcgctct gctccacccg acggggctgt gtgtgctggg 12; cctggctcgc ggcgaaccga gatggcagag cagtcggacg aggccgtgaa gtactacacc l8; ctagaggaga ttcagaagca caaccacagc aagagcacct ggctgatcct gcaccacaag 24; gtgtacgatt tgaccaaatt tctggaagag catcctggtg gggaagaagt tttaagggaa ; caagctggag gtgacgctac tgagaacttt gaggatgtcg ggcactctac agatgccagg 36; gaaatgtcca aaacattcat cattggggag ctccatccag atgacagacc aaagttaaac 42; aagcctccgg aaactcttat cactactatt agtt ccagttggtg gaccaactgg 48; gtgatccctg ctgc agtggccgtc gccttgatgt atcgcctata catggcagag 54; gactgaacac ctcctcagaa gtcagcgcag gaagagcctg ctttggacac gggagaaaag 60; ttgc taactacttc aactgacaga aaccttcact tgaaaacaat gattttaata 66; tatctctttc tttttcttcc gacattagaa acaaaacaaa aagaactgtc ctttctgcgc 72; tcaaattttt cgagtgtgcc tttttattca tctactttat tttgatgttt ccttaatgtg 78; taatttactt attataagca ttta aaaatatatt tggcttttaa agtatgcaaa 84; aaaaaaaaaa Protein seguence: NCBI Reference Seguence: 725.1 LOCUS N P_683725 ION 725 l eavk yytleeiqkh nhskstwlil hhkvydltkf leehpggeev lreqaggdat 61 enfedvghst daremsktfi ddrp klnkppetli ttidssssww tnwvipaisa 121 vavalmyrly maed Isoform2 Nucleotide seguence: NCBI nce Seguence: NM_001914.3 LOCUS 914 ACCESSION NM_001914 l gcgccccgcc cctgagccgg ccgcccagcc cccagtgggg ttcccggcgc ggggaatgtc 61 ccgggtggag ctggctgagt cgcgcgctct gctccacccg acggggctgt gtgtgctggg 121 cctggctcgc ggcgaaccga gatggcagag cagtcggacg aggccgtgaa gtactacacc 181 ctagaggaga ttcagaagca caaccacagc aagagcacct ggctgatcct gcaccacaag 241 gtgtacgatt tgaccaaatt agag catcctggtg gggaagaagt tttaagggaa 301 caagctggag gtgacgctac tgagaacttt gaggatgtcg ggcactctac agatgccagg 361 gaaatgtcca aaacattcat cattggggag ctccatccag atgacagacc aaagttaaac 421 aagcctccgg aaag gcggtgtttc aaggaaactc ttatcactac tattgattct 481 agtt ggtggaccaa ctgggtgatc cctgccatct ctgcagtggc cgtcgccttg 541 cgcc tatacatggc agaggactga acacctcctc agaagtcagc gcaggaagag 601 cctgctttgg acacgggaga aaagaagcca ttgctaacta cttcaactga cagaaacctt 661 cacttgaaaa caatgatttt aatatatctc tttctttttc ttccgacatt agaaacaaaa 721 caaaaagaac tgtcctttct gcgctcaaat ttttcgagtg ttta ttcatctact 781 ttattttgat gtttccttaa tgtgtaattt acttattata agcatgatct tttaaaaata 841 tatttggctt ttaaagtatg caaaaaaaaa aaaa Protein seguence: NCBI Reference Seguence: NP_001905.1 LOCUS N P_001 905 ION N P_001905 1 maeqsdeavk yytleeiqkh nhskstwlil hhkvydltkf leehpggeev 1reqaggdat 61 enfedvghst ktfi igelhpddrp klnkppep Isoform 3 Nucleotide seguence: NCBI Reference Seguence: NM_001190807.2 LOCUS NM_001 190807 ACCESSION NM_001 190807 1 gcgccccgcc cctgagccgg ccgcccagcc cccagtgggg gcgc ggggaatgtc 6; ccgggtggag ctggctgagt cgcgcgctct gctccacccg acggggctgt gtgtgctggg 12; cctggctcgc ggcgaaccga gatggcagag cagtcggacg aggccgtgaa gtactacacc 18; ctagaggaga ttcagaagca caaccacagc aagagcacct ggctgatcct gcaccacaag 24; gtgtacgatt tgaccaaatt tctggaagag catcctggtg gggaagaagt tttaagggaa ; caagctggag gtgacgctac cttt gaggatgtcg ctac agatgccagg 36; gaaatgtcca aaacattcat cattggggag ccag ttat cactactatt 42; gattctagtt ccagttggtg gaccaactgg gtgatccctg ccatctctgc cgtc 48; gccttgatgt atcgcctata catggcagag gactgaacac ctcctcagaa gtcagcgcag 54; gaagagcctg ctttggacac gggagaaaag aagccattgc taactacttc caga 60; aaccttcact caat gattttaata tatctctttc tttttcttcc gacattagaa 66; acaaaacaaa aagaactgtc ctttctgcgc tcaaattttt cgagtgtgcc tttttattca 72; tctactttat tttgatgttt ccttaatgtg taatttactt attataagca tgatctttta 78; aaaatatatt tggcttttaa agtatgcaaa aaaaaaaaaa Protein seguence: NCBI nce Seguence: NP_001177736.1 LOCUS N P_001 177736 ACCESSION N P_001177736 1 maeqsdeavk yytleeiqkh nhskstwlil hhkvydltkf leehpggeev gdat 61 enfedvghst daremsktfi igelhpetli ttidssssww tnwvipaisa vavalmyrly 121 maed PA|1 Official Symbol: SERPINE1 Official Name: serpin peptidase inhibitor, clade E (nexin, plasminogen activator inhibitor type 1), member 1 Gene ID: 5054 Organism: Homo sapiens Other s: PAI, PAl-1, PAI1, PLANH1 Other ations: endothelial plasminogen activator inhibitor; plasminogen activator inhibitor 1; serine (or cysteine) nase inhibitor, clade E (nexin, plasminogen activator inhibitor type 1), member 1; serpin E1 Nucleotide ce (lsoform 1): NCBI Reference Seguence: NM_000602.4 LOCUS NM_000602 ACCESSION 602 l ggcccacaga cagc tgtgtttggc tgcagggcca agagcgctgt caagaagacc 6; cacacgcccc cctccagcag ctgaattcct gcagctcagc agccgccgcc agagcaggac 12; ccaa tcgcaaggca cctctgagaa gatg cagatgtctc tcac 18; ctgcctagtc ctgggcctgg cccttgtctt tggtgaaggg tctgctgtgc accatccccc 24; atcctacgtg gcccacctgg cctcagactt cggggtgagg gtgtttcagc aggtggcgca ; ggcctccaag gaccgcaacg tggttttctc accctatggg gtggcctcgg tgttggccat 36; gctccagctg acaacaggag gagaaaccca gcagcagatt caagcagcta tgggattcaa 42; gattgatgac aagggcatgg cccccgccct ccggcatctg tacaaggagc ggcc 48; atggaacaag gatgagatca gcaccacaga cgcgatcttc gtccagcggg atctgaagct 54; ggtccagggc ttcatgcccc acttcttcag gctgttccgg agcacggtca agcaagtgga 60; cttttcagag gtggagagag tcat catcaatgac tgggtgaaga cacacacaaa 66L aggtatgatc agcaacttgc ttgggaaagg ggac cagctgacac ggctggtgct 72L ggtgaatgcc ctctacttca agtg gaagactccc ttccccgact ccagcaccca 78L ccgccgcctc ttccacaaat cagacggcag cactgtctct gtgcccatga tggctcagac 84; caacaagttc aactatactg agttcaccac gcccgatggc cattactacg acatcctgga 90; actgccctac cacggggaca ccctcagcat gttcattgct gccccttatg aaaaagaggt 96; gcctctctct gccctcacca acattctgag tgcccagctc atcagccact ggaaaggcaa L02; catgaccagg ctgccccgcc tcctggttct gcccaagttc tccctggaga ctgaagtcga L08; cctcaggaag cccctagaga acctgggaat gaccgacatg ttcagacagt ttcaggctga L14; cttcacgagt ctttcagacc aagagcctct ccacgtcgcg caggcgctgc agaaagtgaa L20; gatcgaggtg aacgagagtg gcacggtggc ctcctcatcc acagctgtca tagtctcagc L26; ccgcatggcc cccgaggaga tcatcatgga cagacccttc ctctttgtgg tccggcacaa L32; ccccacagga acagtccttt tcatgggcca agtgatggaa ccct ggggaaagac L38; gccttcatct gggacaaaac tgca tcgggaaaga agaaactccg aagaaaagaa L44; ttttagtgtt aatgactctt tctgaaggaa gagaagacat ttgccttttg tctgtctcca agaccttggc ctctccttgg aggaccttta cctagtctcc acctgagacc ctgggagaga agtttgaagc acaactccct taaggtctcc L62; aaaccagacg gtgacgcctg cgggaccatc tggggcacct gcttccaccc ctgc L68; ccactcgggt ctgcagacct ggttcccact gaggcccttt gcaggatgga ttacaggagc ttttgtgtgc ctggtagaaa ctgt tccagtcaca ttgccatcac L80; tcttgtactg cctgccaccg cggaggaggc tggtgacagg ccaaaggcca gtggaagaaa L86; caccctttca tctcagagtc cactgtggca ctggccaccc agta tgct L92; gcaggtggca gagtgaatgt cccccatcat gtggcccaac tctcctggcc tggccatctc L98; cctccccaga aacagtgtgc tatt ttggagtgta ggtgacttgt ttactcattg 204; attt cctt ttatttttat aggaatagag gaagaaatgt cagatgcgtg 210; cccagctctt ccaa tctcttggtg gggaggggtg tacctaaata tttatcatat 216; ccttgccctt gagtgcttgt tagagagaaa gagaactact aaggaaaata ttta 222; aactcgctcc tagtgtttct ttgtggtctg tgtcaccgta gaag tccagccact 228; tgactggcac acacccctcc ccag cgtgacggag cccacactgc caccttgtgg 234; ccgcctgaga ccctcgcgcc ccccgcgccc ctctttttcc tgga aattgaccat 240: acaatttcat cctccttcag gggatcaaaa ggacggagtg gggggacaga gactcagatg 2461 aggacagagt ggtttccaat gtgttcaata gatttaggag cagaaatgca aggggctgca 252; tgacctacca ggacagaact ttccccaatt acagggtgac tcacagccgc attggtgact 258; cacttcaatg tgtcatttcc ggctgctgtg tgtgagcagt gtga ggggggggtg 264; ggtgagagag acaggcagct caac taccttagat aatatttctg aaaacctacc 270; aggg tagggcacaa agatggatgt aatgcacttt gggaggccaa ggcgggagga 276; ttgcttgagc ccaggagttc aagaccagcc tgggcaacat accaagaccc ccgtctcttt 282; aaaaatatat atattttaaa tatacttaaa tatatatttc taatatcttt aaatatatat 288; atatatttta aagaccaatt tatgggagaa ttgcacacag atgtgaaatg aatgtaatct 294; aatagaagcc taatcagccc ttct ccactgaaaa atcctctttc tttggggttt 300; ttctttcttt cttttttgat tttgcactgg acggtgacgt tgta caggatccac 306; aggggtggtg tcaaatgcta ttgaaattgt gttgaattgt atgctttttc acttttgata 3121 aataaacatg taaaaatgtt tcaaaaaaat aataaaataa ataaatacga agaatatgtc 3181 aggacagtca aaaaaaaaaa aaaaaaa Protein seguence (isoform 1): NCBI Reference Seguence: NP_000593.1 LOCUS N 93 ACCESSION 593 1 mqmspaltcl vfge gsavhhppsy vahlasdfgv rquqvaqas kdrnvvfspy 61 gvasvlamlq lttggetqqq iqaamgfkid dkgmapalrh gpwn tdai 121 qurdlklvq gfmphffrlf rstvkqufs everarfiin dkathtkgm isnllgkgav 181 dqltrlvlvn alyfnngkt pfpdssthrr gstv svpmmaqtnk fnytefttpd 241 ghyydilelp yhgdtlsmfi aapyekevpl saltnilsaq lishwkgnmt rlprllvlpk 301 fsletevdlr kplenlgmtd mfrqfqadft plhv aqalqukie vnesgtvass 361 stavivsarm apeeiimdrp flfvvrhnpt gtvlfmgqvm ep Nucleotide seguence (isoform 2): NCBI Reference Seguence: NM_001165413.2 LOCUS NM_001165413 ACCESSION NM_001165413 1 ggcccacaga ggagcacagc tgtgtttggc tgcagggcca agagcgctgt caagaagacc 61 cacacgcccc cctccagcag ctgaattcct gcagctcagc agccgccgcc agagcaggac 121 gaaccgccaa tcgcaaggca cctctgagaa cttcaggatg tctc cagccctcac 181 agtc ctgggcctgg cccttgtctt tggtgaaggg tctgctgtgc accatccccc 241 atcctacgtg gcct ccaaggaccg caacgtggtt ttctcaccct atggggtggc ; ctcggtgttg gccatgctcc agctgacaac aggaggagaa acccagcagc agattcaagc 36; agctatggga ttcaagattg aggg catggccccc gccctccggc acaa 42; ggagctcatg gggccatgga acaaggatga gatcagcacc acagacgcga tcttcgtcca 48; gcgggatctg aagctggtcc agggcttcat gccccacttc ttcaggctgt tccggagcac 54; ggtcaagcaa gtggactttt cagaggtgga gagagccaga ttcatcatca atgactgggt 60; gaagacacac acaaaaggta gcaa cttgcttggg aaaggagccg tggaccagct 66L gacacggctg gtgctggtga atgccctcta cggc cagtggaaga ctcccttccc 72; cgactccagc acccaccgcc gcctcttcca caaatcagac ggcagcactg tctctgtgcc 78L catgatgqct cagaccaaca agttcaacta tactgagttc cccg atggccatta 84L ctacgacatc ctggaactgc cctaccacgg ggacaccctc agcatgttca ttgctgcccc 90; ttatgaaaaa gaggtgcctc tctctgccct caccaacatt ctgagtgccc agctcatcag 96; ccactggaaa ggcaacatga ccaggctgcc cctg gttctgccca agttctccct L02; ggagactgaa gtcgacctca ggaagcccct agagaacctg ggaatgaccg acatgttcag L08; acagtttcag gctgacttca cgagtctttc agaccaagag cctctccacg tcgcgcaggc L14; gctgcagaaa atcg aggtgaacga cacg gtggcctcct catccacagc L20; tgtcatagtc tcagcccgca tggcccccga ggagatcatc atggacagac ccttcctctt L26; tgtggtccgg cacaacccca caggaacagt ccttttcatg gtga tggaaccctg L32; accctgggga aagacgcctt catctgggac aaaactggag atgcatcggg gaaa L38; agaa aagaatttta atga ctctttctga aggaagagaa gacatttgcc L44; ttttgttaaa agatggtaaa ccagatctgt ctccaagacc ttggcctctc cttggaggac L50; ctttaggtca aactccctag tctccacctg agaccctggg agagaagttt caac L56; tcccttaagg tctccaaacc tgac gcctgcggga ccatctgggg cacctgcttc L62; cacccgtctc tctgcccact cgggtctgca gacctggttc ccactgaggc cctttgcagg L68; atggaactac ggggcttaca tttg tgtgcctggt agaaactatt tctgttccag L74; tcacattgcc atcactcttg tactgcctgc caccgcggag gaggctggtg acaggccaaa L801 ggccagtgga agaaacaccc tttcatctca gagtccactg tggcactggc cacccctccc L86; cagtacaggg gtgctgcagg tggcagagtg aatgtccccc atcatgtggc ccaactctcc L92; tggcctgqcc atctccctcc ccagaaacag tgtgcatggg ttattttgga cttgtttact cattgaagca gatttctgct tccttttatt ggaa aaga 204; aatgtcagat gcgtgcccag ctcttcaccc cccaatctct tggtggggag gggtgtacct 210; aaatatttat catatccttg cccttgagtg cttgttagag agaaagagaa ctactaagga 216; aaataatatt atttaaactc gctcctagtg tgtg gtctgtgtca ccgtatctca 222; ggaagtccag ccacttgact ggcacacacc cctccggaca tccagcgtga ccac 228; actgccacct tgtggccgcc tgagaccctc gcgccccccg cgcccctctt cttg 234; atggaaattg accatacaat ttcatcctcc ttcaggggat caaaaggacg gagtgggggg 240; actc agatgaggac agagtggttt ccaatgtgtt caatagattt aggagcagaa 2461 atgcaagggg ctgcatgacc taccaggaca tccc caattacagg gtgactcaca 2521 gccgcattgg tgactcactt caatgtgtca tttccggctg ctgtgtgtga gcagtggaca 2581 cgtgaggqgg gggtgggtga gagagacagg cagctcggat tcaactacct tagataatat 2641 ttctgaaaac ctaccagcca gagggtaggg cacaaagatg gatgtaatgc actttgggag 270; gccaaggcgg gaggattgct tgagcccagg agttcaagac gggc aacataccaa 276; gacccccgtc tctttaaaaa tatatatatt ttaaatatac ttaaatatat atttctaata 282; tctttaaata tata ttttaaagac caatttatgg gagaattgca cacagatgtg 288; aaatgaatgt atag aagcctaatc ccat gttctccact gaaaaatcct 294; ctttctttgg ggtttttctt tctttctttt ttgattttgc actggacggt gacgtcagcc 300; atgtacagga tccacagggg tggtgtcaaa tgctattgaa attgtgttga attgtatgct 306; ttttcacttt tgataaataa acatgtaaaa atgtttcaaa aaaataataa aataaataaa 3121 tacgaagaat atgtcaggac agtcaaaaaa aaaaaaaaaa aa Protein seguence (isoform 2): NCBI Reference ce: NP_001158885.1 LOCUS N P_001 158885 ACCESSION NP_001158885 1 mqmspaltcl vlglalvfge gsavhhppsy drnv vfspygvasv 1am1q1ttgg 61 etqqqiqaam gfkiddkgma palrhlykel mgpwnkdeis ttdaiqurd 1k1vqgfmph 121 ffrlfrstvk qufsevera rfiindkat htkgmisnll gkgavdqltr 1v1vna1yfn 181 ngktpfpds sthrrlfhks dgstvsvpmm aqtnkfnyte fttpdghyyd ilelpyhgdt 241 lsmfiaapye kevplsaltn ilsaqlishw kgnmtrlprl 1v1pkfslet plen 301 1gmtdmfrqf qadftslsdq eplhvaqalq kvkievnesg tavi vsarmapeei 361 imdrpflfvv rhnptgtvlf mgqvmep MPR1 Official Symbol: IGF2R Official Name: insulin-like growth factor 2 receptor m: 3482 Organism: Homo sapiens Other Aliases: CD222, CIMPR, M6P-R, MPR1, MPRI Other ations: 300 kDa mannose 6-phosphate receptor; CI ManP receptor; Cl-MPR; IGF-II receptor; M6P/IGF2 receptor; M6P/IGF2R; M6PR; MPR 300; cation-independent e-6 phosphate receptor; cation- independent mannosephosphate receptor; insulin-like growth factor II receptor Nucleotide seguence: NCBI Reference ce: NM_000876.2 LOCUS NM_000876 ACCESSION NM_000876 l cgagcccagt cgagccgcgc tcacctcggg ctcccgctcc gtctccacct ccgcctttgc 6; cctggcggcg cgaccccgtc ccgggcgcgg cccccagcag tcgcgcgccg ttagcctcgc 12; gcccgccgcg cagtccgggc ccggcgcgat gggggccgcc cgga gcccccacct 18; ggggcccgcg cccgcccgcc gcccgcagcg ctctctgctc cagc tgctgctgct 24; cgtcgctgcc tcca ccca ggccgccccg ttccccgagc tgtgcagtta ; tacatgggaa gctgttgata ccaaaaataa tgtactttat aaaatcaaca gaag 36; tgtggatatt gtccagtgcg ggccatcaag tgctgtttgt atgcacgact tgaagacacg 42; cacttatcat tcagtgggtg actctgtttt gagaagtgca accagatctc tcctggaatt 48; caacacaaca tgtg aagg cacaaatcac agagtccaga gcagcattgc 54; cttcctgtgt gggaaaaccc tgggaactcc tgaatttgta actgcaacag aatgtgtgca 60; ctactttgag acca ctgcagcctg caagaaagac atatttaaag caaataagga 66L ggtgccatgc tatgtgtttg atgaagagtt gaggaagcat gatctcaatc ctctgatcaa 72L gcttagtggt gcctacttgg tggatgactc cgatccggac ctat tcatcaatgt 78L ttgtagagac atagacacac tacgagaccc aggttcacag ctgcgggcct gtccccccgg 84; cactgccgcc tgcctggtaa gaggacacca ggcgtttgat gttggccagc cccgggacgg 90; actgaagctg gtgcgcaagg acaggcttgt cctgagttac gaag aggcaggaaa 96; gctagacttt tgtgatggtc acagccctgc tatt acatttgttt gcccgtcgga L02; gcggagagag ggcaccattc ccaaactcac agctaaatcc aactgccgct atgaaattga L08; gtggattact gagtatgcct gccacagaga ttacctggaa actt gttctctgag L14; cggcgagcag caggatgtct ccatagacct cacaccactt gcccagagcg gaggttcatc L20; ctatatttca gatggaaaag aatatttgtt ttatttgaat ggag aaactgaaat L26; acagttctgt aataaaaaac aagctgcagt ttgccaagtg aaaaagagcg atacctctca L32; agca gcaggaagat accacaatca gaccctccga tattcggatg gagacctcac L38; cttgatatat tttggaggtg atgaatgcag ctcagggttt cagcggatga ctttgagtgc aataaaaccg acga tgggaaagga actcctgtat L50; ggttgactgc acctacttct tcacatggga cacggaatac gcctgtgtta aggagaagga L56; cctc tgcggtgcca ccgacgggaa gaagcgctat gacctgtccg ccatgcagaa ccagagcaga attgggaagc tggc agtcagacgg aaacagagaa L68; gaagcatttt ttcattaata acag agtgctgcag gaaggcaagg cacgagggtg L74; tcccgaggac gcggcagtgt gtgcagtgga taaaaatgga agtaaaaatc tgggaaaatt L80; ctct cccatgaaag agaaaggaaa cattcaactc tcttattcag atggtgatga L86; ttgtggtcat ggcaagaaaa ttaaaactaa tatcacactt gtatgcaagc caggtgatct L92; ggaaagtgca ccagtgttga gaacttctgg ggaaggcggt tgcttttatg agtttgagtg L98; gcacacagct gcggcctgtg tgctgtctaa gacagaaggg gagaactgca cggtctttga 204; ctcccaggca gggttttctt ttgacttatc caca aagaaaaatg gtgcctataa 210; agttgagaca aagaagtatg acttttatat aaatgtgtgt ggcccggtgt ctgtgagccc 216; ctgtcagcca ggag cctgccaggt ggcaaaaagt gatgagaaga cttggaactt 222; gggtctgagt aatgcgaagc tttcatatta tgatgggatg atccaactga actacagagg 228; cggcacaccc tataacaatg aaagacacac accgagagct acgctcatca tctg 234; tgatcgagac gcgggagtgg gcttccctga atatcaggaa gaggataact ccacctacaa 240: gtgg tacaccagct atgcctgccc ggaggagccc ctggaatgcg tagtgaccga 2461 cccctccacg ctggagcagt acgacctctc cagtctggca aaatctgaag gtggccttgg 252; aggaaactgg tatgccatgg acaactcagg ggaacatgtc acgtggagga aatactacat 258; taacgtgtgt ctga atccagtgcc gggctgcaac cgatatgcat cggcttgcca 264; gatgaagtat gatc agggctcctt cactgaagtg atca gtaacttggg 270; aatggcaaag accggcccgg tggttgagga cagcggcagc ctccttctgg aatacgtgaa 276; tgggtcggcc tgcaccacca gcgatggcag acagaccaca tataccacga ggatccatct 282; cgtctgctcc aggggcaggc tgaacagcca ccccatcttt tctctcaact gggagtgtgt 288; ggtcagtttc ctgtggaaca cagaggctgc ctgtcccatt cagacaacga cggatacaga 294; ccaggcttgc tctataaggg atcccaacag tggatttgtg tttaatctta atccgctaaa 300; cagttcgcaa aacg tctctggcat tgggaagatt ttta atgtctgcgg 306; cacaatgcct gtctgtggga ccatcctggg aaaacctgct tgtg aggcagaaac 312: ccaaactgaa gagctcaaga attggaagcc agcaaggcca gtcggaattg agaaaagcct 318; ccagctgtcc acagagggct tcatcactct gacctacaaa gggcctctct ctgccaaagg 324; taccgctgat gcttttatcg tccgctttgt ttgcaatgat gatgtttact cagggcccct 330; caaattcctg gata tcgactctgg gcaagggatc cgaaacactt actttgagtt 336; tgaaaccgcg ttggcctgtg ttccttctcc agtggactgc caagtcaccg acctggctgg 342; aaatgagtac gacctgactg gcctaagcac agtcaggaaa ccttggacgg ctgttgacac 348; ctctgtcgat gggagaaaga ggactttcta tttgagcgtt tgcaatcctc tcccttacat 354; tcctggatgc cagggcagcg cagtggggtc ttgcttagtg tcagaaggca atagctggaa 360; tctgggtgtg gtgcagatga gtccccaagc cgcggcgaat ggatctttga tgta 366; tgtcaacggt gacaagtgtg ggaaccagcg cttctccacc acgt ttgagtgtgc 372; atcg ggctcaccag catttcagct tggt tgtgagtacg tctg 378; gagaactgtg gaagcctgtc ccgttgtcag agtggaaggg gacaactgtg aggtgaaaga 384; gcat ggcaacttgt atgacctgaa gcccctgggc ctcaacgaca ccatcgtgag 390; cgctggcgaa tatt acttccgggt ctgtgggaag ctttcctcag acgtctgccc 396; cacaagtgac aagg tggtctcctc atgtcaggaa aagcgggaac cgcagggatt 402; tcacaaagtg gcaggtctcc tgactcagaa gctaacttat gaaaatggct tgttaaaaat 408; gaacttcacg gggggggaca cttgccataa ggtttatcag cgctccacag ccatcttctt 414; ctactgtgac cgcggcaccc agcggccagt atttctaaag tcag attgttccta 420: cttgtttgag tggcgaacgc agtatgcctg cccacctttc gatctgactg aatgttcatt 4261 caaagatggg gctggcaact acct cctg tcaaggtaca gtgacaactg 432; ggaagccatc actgggacgg gggacccgga gcactacctc atcaatgtct ctct 438; ggccccgcag gctggcactg agccgtgccc tccagaagca gccgcgtgtc tgctgggtgg 444; ctccaagccc gtgaacctcg gcagggtaag ggacggacct cagtggagag atggcataat 450; tgtcctgaaa tacgttgatg gcgacttatg tccagatggg attcggaaaa agtcaaccac 456; catccgattc agcg agagccaagt cagg cccatgttca tcagcgccgt 462; ggaggactgt gagtacacct ttgcctggcc cacagccaca ccca tgaagagcaa 468; cgagcatgat gactgccagg accc aagcacagga cacctgtttg atctgagctc 474; tggc agggcgggat tcacagctgc ttacagcgag aaggggttgg tttacatgag 480; catctgtggg gagaatgaaa actgccctcc gggg gcctgctttg gacagaccag 486; gattagcgtg ggcaaggcca acaagaggct gagatacgtg gaccaggtcc tgcagctggt 492; gtacaaggat gggtcccctt gtccctccaa atccggcctg agctataaga gtgtgatcag 498; tttcgtgtgc aggcctgagg ccgggccaac caataggccc atgctcatct ccctggacaa 504; gcagacatgc actctcttct tctcctggca cacgccgctg gcctgcgagc ccga 510; atgttccgtg aggaatggaa gctctattgt tgacttgtct ccccttattc atcgcactgg 516; tggttatgag gatg agagtgagga tgatgcctcc gataccaacc ctgatttcta 522; catcaatatt tgtcagccac taaatcccat gcacggagtg ccctgtcctg ccggagccgc 528; tgtgtgcaaa gttcctattg atggtccccc catagatatc ggccgggtag cacc 534; aatactcaat ccaatagcaa atgagattta cttgaatttt agta ctccttgctt 540; agcggacaag catttcaact cgct catcgcgttt cactgtaaga gaggtgtgag 546; catgggaacg cctaagctgt taaggaccag cgagtgcgac tttgtgttcg aatgggagac 552; tcctgtcgtc tgtcctgatg aagtgaggat ggatggctgt accctgacag atgagcagct 558; cctctacagc ttgt ccagcctttc cacgagcacc tttaaggtga ctcgcgactc 564; gcgcacctac agcgttgggg tgtgcacctt tgcagtcggg caag gtaa 570; ggacggagga ctgc tctcaggcac caagggggca tcctttggac ggctgcaatc 576; aatgaaactg gattacaggc accaggatga agcggtcgtt ttaagttacg tgaatggtga 582; tcgttgccct ccagaaaccg atgacggcgt cccctgtgtc ttccccttca tattcaatgg 588; gaagagctac gaggagtgca tcatagagag cagggcgaag tgta gcacaactgc 594; ggactacgac agagaccacg agtggggctt ctgcagacac tcaaacagct accggacatc 600; cagcatcata tttaagtgtg atgaagatga ggacattggg aggccacaag tcttcagtga 606; agtgcgtggg tgtgatgtga catttgagtg gaaaacaaaa gttgtctgcc ctccaaagaa 612; gttggagtgc aaattcgtcc agaaacacaa aacctacgac ctgcggctgc tctcctctct 618; caccgggtcc tggtccctgg tccacaacgg agtctcgtac tatataaatc tgtgccagaa 624; aatatataaa gggcccctgg gctgctctga aagggccagc agaa caac 630; tggtgacgtc caggtcctgg gactcgttca cacgcagaag ctgggtgtca taggtgacaa 636; agttgttgtc acgtactcca aaggttatcc gtgtggtgga accg catcctccgt 642; gatagaattg acctgtacaa agacggtggg cagacctgca ttcaagaggt ttgatatcga 648; cagctgcact tactacttca gctgggactc ccgggctgcc tgcgccgtga agcctcagga 654; ggtgcagatg gtgaatggga ccatcaccaa ccctataaat ggcaagagct tcagcctcgg 660; agatatttat tttaagctgt tcagagcctc catg aatg gggacaacta 666; cctgtatgag atccaacttt cctccatcac aagctccaga aacccggcgt gctctggagc 672; caacatatgc aagc ccaacgatca gcacttcagt cggaaagttg gaacctctga 678; caagaccaag tactaccttc aagacggcga tctcgatgtc gtgtttgcct cttcctctaa 684; gtgcggaaag gataagacca tttc ttccaccatc ttcttccact ctct 690; ggtggagqac gggatccccg agttcagtca cgagactgcc gactgccagt acctcttctc 696; cacc tcagccgtgt gtcctctggg ggtgggcttt gacagcgaga atcccgggga 702; cgacgggcag atgcacaagg ggctgtcaga acggagccag gcagtcggcg cggtgctcag 708; cctgctgctg gtggcgctca cctgctgcct gctggccctg ttgctctaca agaaggagag 714; gagggaaaca gtgataagta agctgaccac ttgctgtagg agaagttcca acgtgtccta 720; caaatactca aaggtgaata aggaagaaga tgag aatgaaacag agtggctgat 726; ggaagagatc cagctgcctc ctccacggca gggaaaggaa gggcaggaga acggccatat 732; taccaccaag tcagtgaaag ccctcagctc cctgcatggg gatgaccagg acagtgagga 738; tgaggttctg accatcccag aggtgaaagt tcactcgggc aggggagctg gggcagagag 744; ctcccaccca gtgagaaacg cacagagcaa tcag gagcgtgagg acgatagggt 750; ggggctggtc aggggtgaga aggcgaggaa agggaagtcc agctctgcac agcagaagac 756; agtgagctcc accaagctgg tgtccttcca tgacgacagc gacgaggacc tcttacacat 762; ctgactccgc agtgcctgca ggggagcacg cggg acagccaagc aacc 768; aaataagact tccactcgat gatgcttcta taattttgcc tttaacagaa actttcaaaa 774; gggaagagtt tttgtgatgg gggagagggt ggtc aggccccact ccttcctgat 780; cagt cattggaata aggcatggct cagatcggcc acagggcggt accttgtgcc 786; cagggttttg ccccaagtcc tcatttaaaa gcataaggcc ggacgcatct agag 792; ggctgcattc gaagaaaccc ttgctgcttt agtcccgata gggtatttga ccccgatata 798; ttttagcatt ttaattctct ccccctattt attgactttg actc aggtttgaga 804; aaaaggaaaa aaaaacagcc accgtttctt cctgccagca ggggtgtgat gtaccagttt 810; cttg agatggtgag gctgtcagtg tatggggcag cttccggcgg gatgttgaac 816; ttaa tgtgtcccct gagttggagc tcattctgtc tcttttctct tttgctttct 822; gtttcttaag ggcacacaca cgtgcgtgcg agcacacaca cacatacgtg cacagggtcc 828; gcct aggttttgga gagtttgcct gttctatgcc tttagtcagg aatggctgca 834; cctttttgca tgatatcttc gggc gtacagagca tcag tatttttgcc 840; ggctggtgaa ttcaaacaac ctgcccaaag attgatttgt gtgtttgtgt gtgtgtgtgt 846; gtgtgtgtgt gtgtgtgtga gtggagttga ggtgtcagag aatt ttttccagat 852; ttggggtata ggtctcatct cttcaggttc tcatgatacc acctttactg tgcttatttt 858; tttaagaaaa tgat caaccattcg acctataaga agccttaatt gtgt 864; gtgacttaca gcat gaaaaatcat gggccagagc ctcggcccta gcattgcact 870; tggcctcatg ctggagggag gctgggcggg tacagcgcgg aggaggaggg aggccaggcg 876; ggcatggcgt ggaggaggag ggaggccggg cggtcacagc atggaggagg agggaggcgc 882; tgctggtgtt ctgg cggcagcgcc tttcctgcca tgtttagtga atgacttttc 888; tcgcattgta gaattgtata tagactctgg tgttctattg ctgagaagca aaccgccctg 894; cagcatccct cagcctgtac cggtttggct ggcttgtttg atttcaacat gagtgtattt 900; tttaaaattg atttttctct tcattttttt ttcaatcaac tttactgtaa tataaagtat 9061 tcaacaattt aaga taaattatta aaa Protein seguence: NCBI Reference ce: NP_000867.2 LOCUS N P_000867 ACCESSION NP_000867 1 mgaaagrsph 1gpaparrpq rsllllqlll 1vaapgstqa qaapfpelcs ytweavdtkn 6; nvlykinicg svdivchps savcmhdlkt gdsv 1rsatrslle fnttvscdqq 12; gtnhrvqssi aflcgktlgt pefvtatecv hyfewrttaa ckkdifkank evpcyvfdee 18; 1rkhd1np1i klsgaylvdd sdpdtslfin vcrdidtlrd pgsqlranp gtaaclvrgh 24; qafdvgqprd kdrl vlsyvreeag kldfcdghsp avtitfvcps erregtipkl ; ryei ewiteyachr dylesktcsl sgeqqdvsid 1tp1aqsggs syisdgkeyl 36; fylnvcgete iqfcnkkqaa chvkksdts qvkaagryhn qtlrysdgdl tliyfggdec 42; ssgfqrmsvi nfecnktagn dgkgtpvftg evdctyfftw vkek edllcgatdg 48; kkrydlsalv rhaepeqnwe avdgsqtete kkhffinich rvlqegkarg cpedaavcav 54; nlgk fisspmkekg niqlsysdgd dcghgkkikt nitlvckpgd lesapvlrts 60; geggcfyefe whtaaacvls ktegenctvf dsqagfsfdl spltkkngay kvetkkydfy 66; invcgpvsvs pcqusgacq vaksdektwn lglsnaklsy ydgmiqlnyr nerh 72; tpratlitf; cdrdagvgfp eyqeednsty nfrwytsyac peeplecvvt dpstleqydl 78; sslaksegg; ggnwyamdns gehvtwrkyy invcrplnpv pgcnryasac quyekdqgs 84; ftevvsisn; gmaktgpvve dsgsllleyv ngsacttsdg trih lvcsrgrlns 90; hpifslnwec vvsflwntea acpiqtttdt rdpn sgfvfnlnpl 96; igkifmfnvc gtmpvcgtil gkpasgceae tqteelknwk parpvgieks lqlstegfit ;02; ltykgplsak gtadafivrf vcnddvysgp lkflhqdids gqgirntyfe gneydltgls tvrkpwtavd tsvdgrkrtf ylsvcnplpy ipgcqgsavg ;l4; sclvsegnsw nlgvvqmqu aaangslsim yvngdkcgnq rfstritfec aqisgspafq ;20; lngceyvfi wrtveacpvv cevk dprhgnlydl kplglndtiv sageytyyfr ;26; sdvc ptsdkskvvs scqekrepqg fhkvaglltq kltyengllk mnftggdtch ;32; kqurstaif fycdrgtqrp Vflketsdcs ylfewrtqya cppfdltecs fkdgagnsfd ;38; lsslsrysdn weaitgtgdp ehylinvcks lapqagtepc ppeaaacllg lgrv ;44; rdgpqwrdgi ivlkyvdgdl kst sesq vnsrpmfisa vedceytfaw ;50; ptatacpmks nehddcqvtn pstghlfdls gfta aysekglvym sicgenencp ;56; pgvgacfgqt ankr lryvdqvlql pcps ksglsyksvi sfvcrpeagp ;62; tnrpmlisld kqtctlffsw htplaceqat ecsvrngssi vdlsplihrt ggyeaydese ;68; ddasdtnpdf plnp mhgvpcpaga avckvpidgp pidigrvagp pilnpianei ;74; ylnfesstpc ;adkhfnyts liafhckrgv smgtpkllrt secdfvfewe tpvvcpdevr ;80; mdgctltdeq ;lysfnlssl ststfkvtrd srtysvgvct favgpqugc kdggvcllsg ;86; tkgasfgrlq smkldyrhqd eavvlsyvng drcppetddg vpcvfpfifn gksyeeciie ;92; sraklwcstt adydrdhewg fcrhsnsyrt ssiifkcded edigrpqvfs evrgcdvtfe ;98; wktkvvcppk kleckqukh llss ltgswslvhn gvsyyinlcq kiykgplgcs 204; erasicrrtt tgdqulglv htqklgvigd skgy pcggnktass vieltctktv 210; grpafkrfdi dsctyyfswd sraacavkpq evqmvngtit npingksfsl gdiyfklfra 216; sgdmrtngdn ylyeiqlssi tssrnpacsg anicqvkpnd thsrkvgts dktkyqudg 222; dldvvfasss kcgkdktksv sstiffhcdp lvedgipefs hetadcqylf vcpl 228; gvgfdsenpg ddgthkgls ersqavgavl slllvaltcc llalllykke rretvisk;t 234; tccrrssnvs ykyskvnkee etdenetewl meeiqlpppr qgkegqengh ittksvka;s 2401 slhgddque devltipevk vhsgrgagae naqs nalqereddr vglvrgekar 2461 kgksssaqqk tvsstklvsf hddsdedllh i 1 A69 Official Symbol: HLA-A Official Name: major histocompatibility complex, class I, A Gene ID: 3105 Organism: Homo s Other Aliases: DAQB-90011.16-002, HLAA Other Designations: HLA class I histocompatibility n, A-1 alpha chain; MHC class I antigen HLA-A heavy chain; antigen presenting molecule; yte n class l-A Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_002116.7 LOCUS NM_002116 ACCESSION NM_002116 l ccaa tcagtgtcgt cgcggtcgct gttctaaagc ccgcacgcac ccaccgggac 6; tcagattctc cccagacgcc gaggatggcc gtcatggcgc cccgaaccct cctcctgcta 12; ctctcggggg ccctggccct gacccagacc tgggcgggct cccactccat gaggtatttc 18; ttcacatccg tgtcccggcc cggccgcggg gagccccgct tcatcgccgt gggctacgtg 24; gacgacacgc agttcgtgcg gttcgacagc gacgccgcga gccagaggat ggagccgcgg ; gcgccgtgga tagagcagga ggggccggag tattgggacc aggagacacg gaatgtgaag 36; gcccagtcac accg agtggacctg ctgc gcggctacta caaccagagc 42; gaggccggtt ctcacaccat aatg tatggctgcg acgtggggtc ggacgggcgc 48; ttcctccgcg ggtaccggca ggacgcctac gacggcaagg attacatcgc cctgaacgag 54L gacctgcgct cttggaccgc catg gcggctcaga tcaccaagcg ggag 60; gcggcccatg aggcggagca gttgagagcc tacctggatg gcacgtgcgt ggagtggctc 66L cgcagatacc tggagaacgg gaaggagacg ctgcagcgca cggacccccc caagacacat 72; atgacccacc accccatctc tgaccatgag gccaccctga ggtgctgggc cctgggcttc 78; taccctgcgg agatcacact gacctggcag cgggatgggg agac ccaggacacg 841 gagctcgtgg agaccaggcc tgcaggggat ggaaccttcc agaagtgggc ggctgtggtg 90; tctg gagaggagca gagatacacc tgccatgtgc agcatgaggg tctgcccaag 961 cccctcaccc tgagatggga gctgtcttcc cagcccacca tccccatcgt gggcatcatt 1021 gctggcctgg ttctccttgg agctgtgatc actggagctg tggtcgctgc cgtgatgtgg 1081 aggaggaaga gctcagatag aaaaggaggg agttacactc caag cagtgacagt 1141 gcccagggct ctgatgtgtc cctcacagct tgtaaagtgt gagacagctg gtgg 1201 gactgagagg caagagttgt tcctgccctt ccctttgtga cttgaagaac 1261 tttctgcaaa ggcacctgca tgtgtctgtg ttcgtgtagg cataatgtga ggaggtqggg agaccacccc acccccatgt tgac cctcttccca cgctgacctg ccaatcatct ttcctgttcc agagaggtgg ggctgaggtg tctccatctc tcatggtgca ctgagctgta tcct tccctattaa aattagaacc tttactttct caaattcttg ccatgagagg ttgatgagtt aattaaagga gaagattcct 156; aaaatttgag agacaaaata agac atgagaacct tccagagtcc aaaa 162; aaaaaaaaaa aaaaaa Protein seguence nt 1): NCBI Reference Seguence: NP_002107.3 LOCUS NP_002107 ACCESSION NP_002107 1 mavmaprtll lllsgalalt thagshsmr yfftsvsrpg rgeprfiavg yvddtqfvrf 61 dsdaasqrme prapwieqeg peywdqetrn vkaqsqtdrv dlgtlrgyyn qseagshtiq 121 imygcdvgsd grflrgyrqd aydgkdyial nedlrswtaa dmaaqitkrk weaaheaeql 181 rayldgtcve wlrrylengk ppk thmthhpisd heatlrcwal itlt 241 wqrdgedqtq dtelvetrpa gdgtquwaa vvvpsgeeqr ytcthhegl pkpltlrwel 301 ssqptipivg iiaglvllga vitgavvaav mwrrkssdrk aass dsaqgsdvsl 361 tackv Nucleotide seguence (variant 2): NCBI Reference Seguence: NM_001242758.1 LOCUS NM_001242758 ACCESSION NM_001242758 l gagaagccaa tcgt cgcggtcgct gttctaaagt gcac ccaccgggac 6; tcagattctc cccagacgcc ggcc gtcatggcgc cccgaaccct cctcctgcta 12; ctctcggggg ccctggccct gacccagacc tgggcgggct cccactccat gaggtatttc 18; ttcacatccg tgtcccggcc ngCCgngg gagccccgct tcatcgccgt gggctacgtg 24; gacgacacgc agttcgtgcg gttcgacagc gacgccgcga gccagaagat ggagccgcgg ; gcgccgtgga tagagcagga ggggccggag gacc aggagacacg gaatatgaag 36L gcccactcac accg agcgaacctg gggaccctgc gcggctacta caaccagagc 42L gaggacggtt ctcacaccat ccagataatg tatggctgcg acgtggggcc ggacgggcgc 48L ttcctccgcg ggca ggacgcctac gacggcaagg attacatcgc cctgaacgag 54L gacctgcgct ccgc ggcggacatg gcagctcaga tcaccaagcg caagtgggag 60; gcggtccatg agca gcggagagtc tacctggagg gccggtgcgt ggacgggctc 66; cgcagatacc tggagaacgg gaaggagacg ctgcagcgca cggacccccc caagacacat 72; atgacccacc accccatctc tgaccatgag gccaccctga ggtgctgggc cctgggcttc 78; taccctgcgg agatcacact gacctggcag nggatgggg agac ccaggacacg 84; gagctcgtgg agaccaggcc tgcaggggat ggaaccttcc gggc ggctgtggtg 90; gtgccttctg gagaggagca gagatacacc tgccatgtgc agcatgaggg tctgcccaag 96; cccctcaccc tgagatggga gctgtcttcc acca tccccatcgt gggcatcatt L02; gctggcctgg ttctccttgg agctgtgatc actggagctg tggtcgctgc cgtgatgtgg L08; aggaggaaga gctcagatag aaaaggaggg agttacactc aggctgcaag cagtgacagt L14; gcccagggct ctgatgtgtc tctcacagct tgtaaagtgt gagacagctg ccttgtgtgg L20; gactgagagg caagagttgt tcctgccctt ccctttgtga cttgaagaac cctgactttg L26; tttctgcaaa ggcacctgca tgtgtctgtg ttcgtgtagg gtga ggaggtgggg L32; agagcacccc atgt ccaccatgac cctcttccca cgctgacctg tgctccctct L38; ccaatcatct ttcctgttcc agagaggtgg ggctgaggtg tctccatctc tgtctcaact L44; tcatggtgca ctgagctgta acttcttcct tccctattaa aattagaacc tttactttct caaattcttg ccatgagagg ttgatgagtt agga gaagattcct 1561 aaaatttgag agacaaaatt aatggaacgc atgagaacct tccagagtcc a Protein ce (variant 2): NCBI Reference Seguence: 229687.1 LOCUS N P_001229687 ACCESSION NP_001229687 1 mavmaprtll lllsgalalt thagshsmr yfftsvsrpg rgeprfiavg yvddtqfvrf 61 dsdaasqkme prapwieqeg peywdqetrn mkahsqtdra nlgtlrgyyn qsedgshtiq 121 imygcdvgpd yrqd aydgkdyial nedlrswtaa dmaaqitkrk weavhaaeqr 181 rvylegrcvd engk ethrtdppk thmthhpisd heatlrcwal gfypaeitlt 241 wqrdgedqtq dtelvetrpa gdgtquwaa vvvpsgeeqr egl pkpltlrwel 301 ssqptipivg iiaglvllga vitgavvaav mwrrkssdrk ggsytqaass dsaqgsdvsl 361 tackv P4HA2 Official Sym bol: P4HA2 Official Name: prolyl 4-hydroxylase, alpha ptide II M: 8974 sm: Homo s Other Aliases: UNOZQO/PROBBO Other Designations: 4-PH alpha 2; 4—PH alpha-2; C-P4Halpha(ll); en prolyl 4-hydroxylase alpha(l|); procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline 4-hydroxylase), alpha polypeptide ll; procollagen-proline,2-oxoglutarate- ygenase subunit alpha-2; prolyl 4-hydroxylase subunit alpha-2 Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_004199.2 LOCUS NM_0041 99 ACCESSION NM_004199 1 agcgttgttt ttccttggca gctgcggaga cccgtgataa aact aattcaacaa 61 acgggaccct tctgtgtgcc agaaaccgca agcagttgct aacccagtgg gacaggcgga 12- ttggaagagc gggaaggtcc tggcccagag cagtgtggtg agcgctgtgc tggaagggaa l8; tgcgggcagt ttgg tagagcactg actgcctccg gccagaggac ttcccggagg 24L aggtgaccca tgagctggag tggtcagagg aaggctggca aaagggcatc gtggacagag 301 gaacagccta tgtgagtggg agcagagacc ttggccaatg ccattcctta tggccttgta 361 gcaa ggtgatgggg aaggaacact gtaggggata gctgtccacg gacgctgtct 421 acaagaccct ggagtgagat aacgtgcctg gtactgtgcc ctgcatgtgt aagatgccca 48; gttgaccttc gcagcaggag cctggatcag ggcacttcct gcctcaggta ttgctggaca 54; gcccagacac ctgt gaccatgaaa ctctgggtgt ctgcattgct gatggcctgg 60; tttggtgtcc tgagctgtgt gcaggccgaa ttcttcacct ctattgggca catgactgac 66; ctgatttatg cagagaaaga gctggtgcag aaag agtacatcct tgtggaggaa 72; gccaagcttt ccaagattaa gagctgggcc aacaaaatgg aagccttgac tagcaagtca 78; gctgctgatg ctgagggcta cctggctcac cctgtgaatg cctacaaact ggtgaagcgg 84L ctaaacacag ctgc gctggaggac cttgtcctgc aggactcagc tgcaggtttt 90; aacc tctctgtgca gcggcagttc actg atgaggacga gataggagct 96L gccaaagccc tgatgagact caca tacaggctgg acccaggcac aatttccaga L02; cttc caggaaccaa gtaccaggca atgctgagtg tggatgactg tcgg cctacaatga aggggactat tatcatacgg tgttgtggat ggagcaggtg L14; ctaaagcagc ttgatgccgg ggaggaggcc accacaacca agtcacaggt ctcagctatg ctgtcttcca gttgggtgat ctgcaccgtg ccctggagct cacccgccgc L26; ctgctctccc ttgacccaag ccacgaacga gctggaggga atctgcggta ctttgagcag L32; ttattggagg aagagagaga aaaaacgtta acaaatcaga cagaagctga gctagcaacc L38; ccagaaggca tctatgagag gcctgtggac tacctgcctg agagggatgt ttacgagagc L44; ctctgtcgtg gggagggtgt caaactgaca agac agaagaggct tttctgtagg L50; taccaccatg gcaacagggc cccacagctg gccc ccttcaaaga ggaggacgag L56; tgggacagcc Cgcacatcgt caggtactac gatgtcatgt ctgatgagga aatcgagagg L62; atcaaggaga tcgcaaaacc taaacttgca cgagccaccg ttcgtgatcc caagacagga L68; gtcctcactg tcgccagcta ccgggtttcc aaaagctcct ggctagagga agatgatgac L74; cctgttgtgg taaa tcgtcggatg cagcatatca cagggttaac agtaaagact L80; gcagaattgt tacaggttgc aaattatgga ggac agtatgaacc gcacttcgac L86; ttctctagga atgatgagcg agatactttc aagcatttag ggacggggaa tcgtgtggct L92; actttcttaa actacatgag tgatgtagaa gctggtggtg ccaccgtctt L981 ggggctgcaa tttggcctaa gaagggtaca gctgtgttct acct cttgcggagc 204; ggggaagqtg gaac aagacatgct cctg tgcttgtggg ctgcaagtgg 210; gtctccaata agtggttcca tgaacgagga caggagttct cttg tggatcaaca 216; gaagttgact cttt tctgtccttc cccttcctgg tccttcagcc catgtcaacg 222; tgacagacac ctttgtatgt tcctttgtat gttcctatca ggctgatttt tggagaaatg 2281 aatgtttgtc tggagcagag ggagaccata ctagggcgac tcctgtgtga ctgaagtccc 2341 agcccttcca ttcagcctgt gccatccctg gccccaaggc taggatcaaa gtggctgcag 2401 cagagttagc tgtctagcgc ctagcaaggt gcctttgtac ctcaggtgtt ttaggtgtga 2461 gatgtttcag tgaaccaaag ttctgatacc ttgtttacat gtttgttttt atggcatttc 2521 tatctattgt ggctttacca aaaaataaaa tgtccctacc agaagcctta aaaaaaaaaa 2581 aaaaaaaa Protein seguence (variant 1): NCBI Reference Seguence: 190.1 LOCUS N P_0041 90 ACCESSION N P_004190 1 allm awfgvlscvq ighm tdliyaekel vqslkeyilv eeaklskiks 61 wankmealts ksaadaegyl ahpvnayklv krlntdwpal edlvlquaa gfianlqur 121 qffptdedei gaakalmrlq dtyrldpgti srgelpgtky qamlsvddcf gmgrsayneg 181 lwme qvlkqldage eatttksqvl dylsyaqul gdlhralelt rrllsldpsh 241 eraggnlryf eqlleeerek tltnqteael atpegiyerp vdylperdvy eslcrgegvk 301 rlf cryhhgnrap qlliapfkee dewdsphivr deei erikeiakpk 361 laratvrdpk tgvltvasyr vsksswleed ddpvvarvnr rmqhitgltv ktaellqvan 421 ygvggqyeph fdfsrnderd tfkhlgtgnr vatflnymsd veaggatvfp dlgaaiwpkk 481 gtavfwynll yrtr haacpvlvgc kwvsnkwfhe rgqeflrpcg stevd Nucleotide seguence (variant 2): NCBI Reference Seguence: NM_001017973.1 LOCUS NM_001017973 ACCESSION NM_001017973 1 agcgttgttt ttccttggca gctgcggaga ataa ttcgttaact aattcaacaa 61 ccct tctgtgtgcc agaaaccgca agcagttgct aacccagtgg cgga 121 ttggaagagc gggaaggtcc tggcccagag cagtgtggtg gtgc tggaagggaa 181 tgcgggcagt gggtacttgg tagagcactg tccg gccagaggac ttcccggagg 241 aggtgaccca tgagctggag tggtcagagg aaggctggca aaagggcatc gtggacagag 301 gaacagccta tgtgagtggg agcagagacc ttggccaatg ccattcctta tggccttgta 36; gtggaagcaa ggtgatgggg cact gtaggggata gctgtccacg gacgctgtct 42; acaagaccct ggagtgagat aacgtgcctg gtactgtgcc ctgcatgtgt aagatgccca 48; cttc gcagcaggag cctggatcag ggcacttcct gcctcaggta gaca 54; acac ttccctctgt gaccatgaaa ctctgggtgt ctgcattgct gatggcctgg 60; tttggtgtcc tgagctgtgt cgaa ttcttcacct ctattgggca catgactgac 66; ctgatttatg cagagaaaga gctggtgcag tctctgaaag agtacatcct tgtggaggaa 72L gccaagcttt ccaagattaa gagctgggcc aacaaaatgg aagccttgac gtca 78L gctgctgatg ctgagggcta cctggctcac cctgtgaatg cctacaaact ggtgaagcgg 84L ctaaacacag actggcctgc gctggaggac ctgc aggactcagc tgcaggtttt 90; atcgccaacc tctctgtgca gttc ttccccactg atgaggacga gataggagct 96; gccaaagccc tgatgagact tcaggacaca tacaggctgg gcac aatttccaga L02; ggggaacttc caggaaccaa gtaccaggca atgctgagtg actg ggccgctcgg cctacaatga aggggactat tatcatacgg tgttgtggat ggagcaggtg L14; ctaaagcagc ttgatgccgg ggaggaggcc accacaacca agtcacaggt gctggactac L20; ctcagctatg ctgtcttcca gttgggtgat ctgcaccgtg ccctggagct cacccgccgc L26; ctgctctccc ttgacccaag ccacgaacga gctggaggga atctgcggta ctttgagcag L32; ttattggagg aagagagaga aaaaacgtta acaaatcaga cagaagctga gctagcaacc L38; ccagaaggca tctatgagag gcctgtggac tacctgcctg agagggatgt ttacgagagc L44; ctctgtcgtg gggagggtgt caaactgaca ccccgtagac agaagaggct tttctgtagg L50; taccaccatg gcaacagggc cccacagctg ctcattgccc ccttcaaaga ggaggacgag L56; tgggacagcc cgcacatcgt ctac gatgtcatgt ctgatgagga aatcgagagg L62; atcaaggaga tcgcaaaacc taaacttgca accg ttcgtgatcc caagacagga L68; gtcctcactg tcgccagcta ccgggtttcc aaaagctcct ggctagagga tgac L74; cctgttgtgg cccgagtaaa tcgtcggatg cagcatatca taac agtaaagact L80; gcagaattgt tacaggttgc aaattatgga gtgggaggac agtatgaacc gcacttcgac L861 ttctctaggc gaccttttga cagcggcctc aaaacagagg ggaataggtt agcgacgttt L92; cttaactaca tgagtgatgt tggt ggtgccaccg tcttccctga L98; gcaatttggc ctaagaaggg tgtg ttctggtaca acctcttgcg gagcggggaa 204; tacc gaacaagaca tgctgcctgc cctgtgcttg tgggctgcaa gtgggtctcc 210; aataagtggt tccatgaacg aggacaggag ttcttgagac cttgtggatc aacagaagtt 216; gactgacatc cttttctgtc cttccccttc ctggtccttc agcccatgtc aacgtgacag 222; ttgt atgttccttt gtatgttcct atcaggctga tttttggaga aatgaatgtt 228; tgtctggagc agagggagac catactaggg cgactcctgt gtgactgaag tcccagccct 234; tccattcagc ctgtgccatc cctggcccca ggat caaagtggct gcagcagagt 240; tagctgtcta gcgcctagca aggtgccttt gtacctcagg tgttttaggt gtgagatgtt 246; tcagtgaacc ctga taccttgttt acatgtttgt ttttatggca tcta 2521 ttgtggcttt accaaaaaat aaaatgtccc taccagaagc cttaaaaaaa aaaaaaaaaa 2581 aa Protein seguence (variant 2): NCBI Reference Seguence: NP_001017973.1 LOCUS N P_001017973 ACCESSION N P_001017973 1 mklwvsallm awfgvlscvq aefftsighm tdliyaekel vqslkeyilv kiks 61 wankmealts ksaadaegyl ahpvnayklv krlntdwpal edlvlquaa gfianlqur 121 qffptdedei gaakalmrlq dtyrldpgti srgelpgtky qamlsvddcf yneg 181 dyyhtvlwme qvlkqldage eatttksqvl dylsyaqul gdlhralelt rrllsldpsh 241 eraggnlryf eqlleeerek tltnqteael atpegiyerp vdylperdvy eslcrgegvk 301 ltprqurlf cryhhgnrap qlliapfkee dewdsphivr yydvmsdeei erikeiakpk 361 laratvrdpk tgvltvasyr vsksswleed rvnr rmqhitgltv qvan 421 ygvggqyeph fdfsrrpfds glktegnrla tflnymsdve aggatvfpdl gaaiwpkkgt 481 avfwynllrs gegdyrtrha acpvlvgckw vsnkwfherg qeflrpcgst evd Nucleotide seguence (variant 3): NCBI Reference Seguence: 017974.1 LOCUS NM_001017974 ACCESSION 017974 l aagggaggag agct gaccgggcga cgccgcggga ggttctggaa acgccgggag 61 ctgcgagtgt ccagacactt ccctctgtga ccatgaaact gtct gcattgctga 121 tggcctggtt tggtgtcctg agctgtgtgc aggccgaatt cttcacctct attgggcaca 181 tgactgacct tgca gagaaagagc tggtgcagtc tctgaaagag tacatccttg 24; tggaggaagc caagctttcc aagattaaga gctgggccaa caaaatggaa gccttgacta ; gcaagtcagc tgct gagggctacc tggctcaccc tgtgaatgcc tacaaactgg 36; tgaagcggct aaacacagac tggcctgcgc tggaggacct tgtcctgcag gactcagctg 42; caggttttat cgccaacctc cagc ggcagttctt ccccactgat gaggacgaga 48; ctgc caaagccctg cttc aggacacata caggctggac ccaggcacaa 54; tttccagagg ggaacttcca aagt caat gctgagtgtg gatgactgct 60L ttgggatggg ccgctcggcc gaag gggactatta tcatacggtg ttgtggatgg 66L agcaggtgct aaagcagctt gggg aggaggccac cacaaccaag tcacaggtgc 72L tggactacct cagctatgct gtcttccagt tgggtgatct gcaccgtgcc ctggagctca 78L gcct gctctccctt gacccaagcc acgaacgagc tggagggaat ctgcggtact 84; ttgagcagtt attggaggaa gaaa aaacgttaac aaatcagaca gaagctgagc 90; tagcaacccc agaaggcatc tatgagaggc ctgtggacta cctgcctgag agggatgttt 96; gcct ctgtcgtggg gagggtgtca aactgacacc ccgtagacag aagaggcttt L02; tctgtaggta tggc aacagggccc cacagctgct cattgccccc ttcaaagagg L08; aggacgagtg ggacagcccg cacatcgtca ggtactacga tgtcatgtct gatgaggaaa L14; tcgagaggat caaggagatc gcaaaaccta aacttgcacg agccaccgtt cgtgatccca L20; agacaggagt cctcactgtc gccagctacc gggtttccaa aagctcctgg gaag L26; atgatgaccc tgttgtggcc Cgagtaaatc gtcggatgca gcatatcaca gggttaacag L32; taaagactgc agaattgtta caggttgcaa attatggagt gggaggacag tatgaaccgc L38; acttcgactt ctctaggcga ccttttgaca gcggcctcaa aacagagggg aataggttag L44; cgacgtttct taactacatg agtgatgtag aagctggtgg tgccaccgtc ttccctgatc L50; tgggggctgc aatttggcct aagaagggta cagctgtgtt ctggtacaac ctcttgcgga L56; gcggggaagg tgactaccga acaagacatg ctgcctgccc tgtgcttgtg ggctgcaagt L62; ccaa taagtggttc catgaacgag agtt cttgagacct tgtggatcaa L68; cagaagttga ctgacatcct tttctgtcct tccccttcct tcag cccatgtcaa L741 cgtgacagac acctttgtat gttcctttgt atgttcctat caggctgatt tttggagaaa L80; tgaatgtttg tctggagcag agggagacca tactagggcg actcctgtgt ccagcccttc cattcagcct gtgccatccc tggccccaag gctaggatca agcagagtta gctgtctagc gcctagcaag gtgcctttgt acctcaggtg gagatgtttc agtgaaccaa agttctgata ttac atgtttgttt ttatggcatt 2041 tctatctatt gtggctttac caaaaaataa aatgtcccta ccagaagcct aaaa 2101 aaaaaaaaaa Protein seguence (variant 3): NCBI nce Seguence: NP_001017974.1 LOCUS N P_001017974 ACCESSION N P_001017974 l mklwvsallm awfgvlscvq aefftsighm tdliyaekel vqslkeyilv eeaklskiks 6L wankmealts ksaadaegyl ahpvnayklv krlntdwpal uaa gfianlqur 12; qffptdedei gaakalmrlq dtyrldpgti srgelpgtky qamlsvddcf gmgrsayneg 18; dyyhtvlwme qvlkqldage eatttksqvl dylsyaqul gdlhralelt rrllsldpsh 24; eraggnlryf eqlleeerek tltnqteael atpegiyerp vdylperdvy eslcrgegvk ; ltprqurlf cryhhgnrap qlliapfkee dewdsphivr yydvmsdeei erikeiakpk 36; laratvrdpk tgvltvasyr vsksswleed ddpvvarvnr gltv qvan 42; ygvggqyeph fdfsrrpfds glktegnrla tflnymsdve aggatvfpdl gaaiwpkkgt 48; avfwynllrs gegdyrtrha acpvlvgckw vsnkwfherg qeflrpcgst evd Nucleotide seguence (variant 4): NCBI Reference Seguence: NM_001142598.1 LOCUS NM_001 142598 ACCESSION NM_001142598 l aagggaggag gcgccgagct gcga cgccgcggga ggttctggaa acgccgggag 6; ctgcgagtgt gcgg agacccgtga taattcgtta actaattcaa caaacgggac 12; ccttctgtgt aacc gcaagcagtt gctaacccag tgggacaggc ggattggaag 18; agcgggaagg tcctggccca gagcagtgtg acacttccct ctgtgaccat gaaactctgg 24L gtgtctgcat tgctgatggc ctggtttggt gtcctgagct gtgtgcaggc cgaattcttc ; acctctattg ggcacatgac tgacctgatt tatgcagaga aagagctggt gcagtctctg 36L taca tccttgtgga ggaagccaag ctttccaaga ttaagagctg ggccaacaaa 42; atggaagcct tgactagcaa gtcagctgct gagg gctacctggc tcaccctgtg 48; aatgcctaca aactggtgaa gcggctaaac acagactggc ctgcgctgga ggaccttgtc 54; gact cagctgcagg ttttatcgcc aacctctctg ggca gttcttcccc 60; actgatgagg acgagatagg agctgccaaa gccctgatga gacttcagga cacatacagg 66; ctggacccag gcacaatttc cagaggggaa cttccaggaa ccaagtacca ggcaatgctg 72; agtgtggatg actgctttgg gatgggccgc tcggcctaca atgaagggga ctattatcat 78; acggtgttgt ggatggagca ggtgctaaag cagcttgatg ccggggagga ggccaccaca 84; accaagtcac tgga ctacctcagc tatgctgtct tccagttggg tgatctgcac 90; cgtgccctgg agctcacccg ccgcctgctc gacc caagccacga tgga 96L ctgc ggtactttga gcagttattg gaggaagaga gagaaaaaac gttaacaaat L02; cagacagaag ctgagctagc aaccccagaa ggcatctatg agaggcctgt cctgagaggg atgtttacga gagcctctgt cgtggggagg aact gacaccccgt 1141 aaga ggcttttctg taggtaccac catggcaaca gggccccaca catt L20; gcccccttca agga cgagtgggac agcccgcaca tcgtcaggta atgtctgatg aggaaatcga gaggatcaag gagatcgcaa aacctaaact accgttcgtg atcccaagac aggagtcctc gcca gctaccgggt ttccaaaagc L38; ctag aggaagatga tgaccctgtt gtggcccgag taaatcgtcg gatgcagcat L44; atcacagggt taacagtaaa gactgcagaa ttgttacagg ttgcaaatta tggagtggga L50; ggacagtatg aaccgcactt Cgacttctct aggcgacctt ttgacagcgg cctcaaaaca L56; gaggggaata ggttagcgac gtttcttaac tacatgagtg aagc tggtggtgcc L62; accgtcttcc ctgatctggg ggctgcaatt tggcctaaga cagc tgtgttctgg L68; tacaacctct tgcggagcgg ggaaggtgac taccgaacaa gacatgctgc tgtg L74; cttgtgggct gcaagtgggt ctccaataag tggttccatg aacgaggaca ggagttcttg L80; agaccttgtg caga agttgactga catccttttc tgtccttccc cttcctggtc L86; cttcagccca tgtcaacgtg acagacacct ttgtatgttc ctttgtatgt tcctatcagg L92; ctgatttttg gagaaatgaa tgtttgtctg gagcagaggg agaccatact agggcgactc L98; ctgtgtgact gaagtcccag cccttccatt cagcctgtgc catccctggc cccaaggcta 204; ggatcaaagt ggctgcagca gctg tctagcgcct agcaaggtgc ctttgtacct 2101 caggtgtttt aggtgtgaga tgtttcagtg aaccaaagtt ctgatacctt gtttacatgt 216; ttgtttttat ggcatttcta tctattgtgg ctttaccaaa aaataaaatg tccctaccag 222; aagccttaaa aaaaaaaaaa aaaaaa n seguence (variant 4): NCBI Reference Seguence: NP_001136070.1 LOCUS N P_001 136070 ACCESSION N P_001136070 l mklwvsallm awfgvlscvq aefftsighm ekel vqslkeyilv eeaklskiks 6; wankmealts ksaadaegyl ahpvnayklv krlntdwpal edlvlquaa gfianlqur 12; qffptdedei gaakalmrlq dtyrldpgti srgelpgtky qamlsvddcf gmgrsayneg 18; dyyhtvlwme qvlkqldage eatttksqvl dylsyaqul gdlhralelt rrllsldpsh 241 eraggnlryf eqlleeerek tltnqteael atpegiyerp vdylperdvy eslcrgegvk ; ltprqurlf cryhhgnrap qlliapfkee dewdsphivr yydvmsdeei erikeiakpk 36L laratvrdpk tgvltvasyr vsksswleed ddpvvarvnr gltv ktaellqvan 421 ygvggqyeph fdfsrrpfds glktegnrla tflnymsdve aggatvfpdl kkgt 48; avfwynllrs gegdyrtrha acpvlvgckw vsnkwfherg qeflrpcgst evd Nucleotide seguence: (variant 5) NCBI Reference Seguence: NM_001142599.1 LOCUS NM_001 142599 ACCESSION NM_001 142599 l aagggaggag gcgccgagct gaccgggcga cgccgcggga ggttctggaa acgccgggag 6; ctgcgagtgt ccagctgcgg gtga taattcgtta actaattcaa caaacgggac 12; ccttctgtgt gccagaaacc agtt gctaacccag aggc ggattggaag 18; agcgggaagg tcctggccca gagcagtgtg acacttccct ctgtgaccat gaaactctgg 24; gtgtctgcat tgctgatggc ctggtttggt gtcctgagct gtgtgcaggc cgaattcttc ; acctctattg ggcacatgac tgacctgatt tatgcagaga aagagctggt gcagtctctg 36; aaagagtaca tccttgtgga ggaagccaag ctttccaaga ttaagagctg caaa 42; atggaagcct tgactagcaa gtcagctgct gatgctgagg gctacctggc tgtg 48; aatgcctaca aactggtgaa gcggctaaac acagactggc ctgcgctgga ggaccttgtc 54L ctgcaggact cagg ttttatcgcc aacctctctg tgcagcggca gttcttcccc 60; gagg acgagatagg caaa gccctgatga gacttcagga cacatacagg 66L ctggacccag gcacaatttc cagaggggaa ggaa ccaagtacca ggcaatgctg 721 agtgtggatg actgctttgg gatgggccgc tcggcctaca ggga ctattatcat 78; acggtgttgt ggatggagca ggtgctaaag cagcttgatg ccggggagga ggccaccaca 84; accaagtcac aggtgctgga ctacctcagc tatgctgtct tccagttggg tgatctgcac 90; cgtgccctgg agctcacccg ccgcctgctc tcccttgacc caagccacga acgagctgga 96; gggaatctgc ttga attg gaggaagaga gagaaaaaac gttaacaaat L02; gaag ctgagctagc aaccccagaa ggcatctatg agaggcctgt ggactacctg L08; cctgagaggg atgtttacga gagcctctgt cgtggggagg gtgtcaaact gacaccccgt L14; aaga ggcttttctg taggtaccac catggcaaca gggccccaca gctgctcatt L20; gcccccttca aagaggagga cgagtgggac agcccgcaca tcgtcaggta ctacgatgtc L26: atgtctgatg aggaaatcga caag gagatcgcaa aacctaaact atcccaagac aggagtcctc actgtcgcca gctaccgggt aggaagatga tgaccctgtt gtggcccgag taaatcgtcg gggt taacagtaaa gactgcagaa ttgttacagg atta tggagtggga ggacagtatg aaccgcactt cgacttctct aggaatgatg agcgagatac ttagggacgg ggaatcgtgt ggctactttc ttaaactaca tgagtgatgt agaagctggt L62; ggtgccaccg tcttccctga ggct gcaatttggc ctaagaaggg tacagctgtg L68; ttctggtaca acctcttgcg gagcggggaa ggtgactacc gaacaagaca tgctgcctgc L74; cctgtgcttg tgggctgcaa gtgggtctcc aataagtggt tccatgaacg aggacaggag L80; ttcttgagac cttgtggatc aacagaagtt gactgacatc tgtc cttccccttc L86; ctggtccttc agcccatgtc aacgtgacag acacctttgt atgttccttt gtatgttcct L92; atcaggctga tttttggaga aatgaatgtt tgtctggagc agagggagac catactaggg L98; ctgt gtgactgaag tcccagccct tccattcagc catc cctggcccca 204; aggctaggat caaagtggct gcagcagagt tagctgtcta gcgcctagca aggtgccttt 210; gtacctcagg tgttttaggt gtgagatgtt tcagtgaacc aaagttctga taccttgttt 216; ttgt ttttatggca tcta ttgtggcttt aaat aaaatgtccc 222; taccagaagc cttaaaaaaa aaaaaaaaaa aa Protein seguence: NCBI Reference Seguence: NP_001136071.1 LOCUS N P_001 136071 ACCESSION N P_001136071 l mklwvsallm awfgvlscvq aefftsighm tdliyaekel vqslkeyilv eeaklskiks 61 wankmealts ksaadaegyl ahpvnayklv krlntdwpal edlvlquaa gfianlqur 12; edei gaakalmrlq dtyrldpgti srgelpgtky qamlsvddcf gmgrsayneg 18; dyyhtvlwme qv;kqldage eatttksqv; dylsyaqul gdlhralelt rrllsldpsh 24; eraggnlryf eq;leeerek tltnqteae; atpegiyerp vdylperdvy eslcrgegvk ; ltprqurlf cryhhgnrap qlliapfkee dewdsphivr yydvmsdeei erikeiakpk 36; laratvrdpk tgvltvasyr vsksswleed ddpvvarvnr rmqhitgltv qvan 42; ygvggqyeph fdfsrnderd tfkhlgtgnr ymsd veaggatvfp dlgaaiwpkk 48- gtavfwynll rsgegdyrtr haacpvlvgc wfhe rgqeflrpcg stevd Official : RBMX Official Name: RNA binding motif protein, ed Gene ID: 27316 Organism: Homo sapiens Other Aliases: RP11-1114A5.1, Other Designations: RNA binding motif protein, X chromosome; RNA-binding motif protein, X chromosome; glycoprotein p43; heterogeneous nuclear ribonucleoprotein G; hnRNP G Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_002139.3 LOCUS NM_002139 ACCESSION NM_002139 l ggtccttcag cctcgttccc gggcagtata aagtttgctg ttgt tcgccctcgt 6; tgcgcagtag tgctagcggc ttcgcggttc ggtcctcgca cccggcagcc gccactggtg 12; ctgagctgct cccc tatcgccgag ctcgttggag cttgaaccca cccc l8; tcac cggcccaaaa aaaaaaaaac atggttgaag cagatcgccc aggaaagctc 24; ttcattggtg ggcttaatac ggaaacaaat gagaaagctc ttgaagcagt caaa ; tatggacgaa tagtggaagt actcttgatg aaagaccgtg aaaccaacaa atcaagagga 36; tttgcttttg tcacctttga aagcccagca gacgctaagg atgcagccag agacatgaat 42; ggaaagtcat tagatggaaa agccatcaag gtggaacaag ccaccaaacc atcatttgaa 48; agtggtagac gtggaccgcc tcca agaagtagag gccctccaag aggtcttaga 54; ggtggaagag gaggaagtgg aggaaccagg ggacctccct cacggggagg acacatggat 60; gacggtggat attccatgaa ttttaacatg tcca ggggaccact cccagtaaaa 66; agaggaccac caccaagaag tgggggtcct cctcctaaga gatctgcacc ttcaggacca 72; gttcgcagta gcagtggaat gggaggaaga gctcctgtat cacgtggaag agatagttat 78; ggaggtccac ctcgaaggga accgctgccc tctcgtagag atgtttattt gtccccaaga 84L gatgatgggt attctactaa agacagctat tcaagcagag attacccaag ttctcgtgat 90; actagagatt atgcaccacc accacgagat tatacttacc gtgattatgg tcattccagt 96L tcacgtgatg catc aagaggatat agcgatagag atggatatgg tcgtgatcgt 1021 gactattcag atcatccaag tggaggttcc gatt catatgagag tcacgtagtg ctccacctac acgagggccc ccgccatctt atggtggaag gatgattaca gcagctcacg tgacggatat ggtggaagtc gagacagtta gatc tctactcaag tggtcgtgat gca gacaagaaag agggcttccc L26; atgg aaagggggta ccctcctcca cgtgattcct acagcagttc aagccgcgga L32; gcaccaagag gtggtggccg tggaggaagc cgatctgata gagggggagg cagaagcaga L38; tactagaaac aaacaaaact ttggaccaaa atcccagttc aaagaaacaa aaagtggaaa L44; ctattctatc accc aaggactact aaaaggaaaa attgtgttac tttttttaaa L50; ttccctgtta agttcccctc cataattttt atgttcttgt gaggaaaaaa gtaaaacatg L56; tttaatttta tttgactttc gcattgcttt tcaacaagca aatgttaaat gtgttaagac L62; ttgtactagt gttgtaactt tccaagtaaa agtatcccct aaaggccact tcctatctga L68; tttttcccag caaatgaggc aggcaattct aagatcttcc acaaaacatc tagccatcta L74; aaatggagag atgaatcatt ctacctatac aaacaagcta gctattagag ggtggttggg L80; gtatgctact attt cagggtgtct tccaactgaa atctcaatgt tctcagtacg L86; aaaaacctga aatcacatgc ctatgtaagg aaagtgctat tcacccagta aaaa L92; tgga taatgctggc cattttgcct ttctgacatt tccttgggaa cctccccttt cccttccccc aataagacca tttaagtgtg tgttaaacaa aata 2041 ctaaataaaa agtttggcca acca tgaagctgca aaaa aaaaaaa Protein seguence (variant 1): NCBI Reference Seguence: 130.2 LOCUS N P_002130 ACCESSION NP_002130 1 mveadrpgkl tetn ekaleavfgk ygrivevllm kdretnksrg fafvtfespa 61 dakdaardmn gksldgkaik veqatkpsfe Sgrrgppppp rsrgpprglr ggrggsggtr 121 gppsrgghmd dggysmnfnm sssrgplpvk rgppprsggp ppkrsapsgp mggr 181 apvsrgrdsy ggpprrep1p srrdvylspr ddgystkdsy ssrdypssrd trdyappprd 241 ytyrdyghss srddypsrgy grdr dysdhpsggs yrdsyesygn srsapptrgp 301 ppsyggssry ddysssrdgy ggsrdsysss rsdlyssgrd rvgrqerglp psmergyppp 361 rdsyssssrg aprgggrggs rsdrgggrsr Y tide seguence (variant 2): NCBI Reference Seguence: NM_001164803.1 LOCUS NM_001 164803 ACCESSION NM_001 164803 1 ggtccttcag cctcgttccc tata aagtttgctg tctcctttgt tcgccctcgt 6" gtag tgctagcggc ttcgcggttc ggtcctcgca cccggcagcc ggtg 12; ctgagctgct aggaagcccc tatcgccgag ctcgttggag cttgaaccca ttgtcacccc 18; tccgactcac cggcccaaaa aaaaaaaaac atggttgaag cagatcgccc aggaaagctc 24; ttcattggtg ggcttaatac ggaaacaaat gagaaagctc ttgaagcagt atttggcaaa ; tatggacgaa aagt actcttgatg aaagaccgtg aaaccaacaa agga 36; tttgcttttg tcacctttga aagcccagca gacgctaagg atgcagccag agacatgaat 42; ggaaagctcc tgtatcacgt ggaagagata gttatggagg tccacctcga agggaaccgc 48; ctcg tgtt tatttgtccc atga tgggtattct actaaagaca 54; gctattcaag ttac ccaagttctc ctag agattatgca ccaccaccac 60; gagattatac ttaccgtgat tatggtcatt ccagttcacg tgatgactat ccatcaagag 66; gatatagcga tagagatgga tatggtcgtg atcgtgacta ttcagatcat ccaagtggag 72; gttcctacag agattcatat gagagttatg gttggtgatt ttgctcatta tggtcgtgga 78L gtgctgattg attcacagta gataaagctg gcagtaagaa atgctaagag ttgttgaagc 84L agaaggcggc tgattgtcaa taagtcacta cagttgcata agcagtgctg tcagaattgg 90; gcag gcaatagatt ttgccttcag gggttcctgt ggatctgagg aaggcatcag 96; tgttgattaa cactcataac tagggagtga ctggtagtta cttaaagcaa gtaattgacc 1021 aaatggaaaa gtaa ttaaggaaat gtgg aggtagtcag gaagttcttg 1081 tggttcttca catagatttt acagctttgg ctttcatttt gtttagctaa agtcatgggg 1141 acaactcttc aatttagaac ttaagttgaa ttataaaaat gatggatata agtggtagct 1201 gtatctagtg aagtgtctgt cagtaagtga aacatttttt ggtggtggct tatccacaaa 1261 cagtttagtt gtagaataaa acttatgagt tgga aagtaaccat gctaagatgg 1321 caagcacact aatt aggccacttg tttt ttgt tactttacct cccagtcttg gaaacaagtt ttagtttttt attggtttgg caatagtata atgttctcaa aggaaacaga cttgagttgt tggattagag aacttatatg attttttttt tgtttttgtc gtgtagttat ggcactgtct atttgcaact agggataata caacattttt aactctcatt tgacaaccta cagaccacaa gggtaatgac tatg tggtttttgc actccatagt tgtcttagcc 168; caatctttct atactcttac gattacttgg gttaacgctt ctgtgaggac ttgagatacc attt aagatattta gatatcttga agatagtata ggatatagag 180; attgtaccaa ataggaatat aaggagtatg ttaaaatgac cagatacctg tttgatagtt 1861 tactgaccta gcagatgtgt ggaaaaggaa tcagatcttg attcttctgg gtttatactg 1921 gttgtaaaac agaatgatac agaaaatgtt ttccttgttt aactggtagt tgaacataga 1981 acttgggtat tatagatcac ttttcacttt ttggaatgtt ttgtattgaa acttaataaa 2041 actttaacat ggaaaaaaaa aaaaaaaaaa a Protein seguence (variant 2): NCBI Reference Seguence: NP_001158275.1 LOCUS N P_001 158275 ION NP_001158275 1 mveadrpgkl figg:.ntetn ekaleavfgk vllm kdretnksrg fafvtfespa 61 dakdaardmn gk:.1yhveei vmevhlegnr cplvemficp qemmgillkt aiqaeitqvl 121 vileimhhhh eiiltvimvi pvhvmtihqe diaiemdmvv ivtiqiiqve vpteihmrvm 181 ygrg Vlidsq |BP7 Official Symbol: |GFBP7 Official Name: insulin-like growth factor binding protein 7 Gene ID: 3490 Organism: Homo sapiens Other Aliases: AGM, FSTL2, IBP—7, IGFBP-7, IGFBP-7v, IGFBPRP1, MAC25, PSF, RAMSVPS, TAF Other Designations: lGF-binding protein 7; rP1; PGI2—stimulating factor; odulin; insulin-like growth factor-binding protein 7; prostacyclinstimulating factor; tumor-derived adhesion factor Nucleotide ce (variant 1): NCBI Reference ce: 553.2 LOCUS NM_001 553 ACCESSION NM_001553 l actcgcgccc ttgccgctgc caccgcaccc cgccatggag cggccgtcgc tgcgcgccct 6; gctcctcggc gccgctgggc tgctgctcct gctcctgccc ctctcctctt cctcctcttc 12; ggacacctgc ggcccctgcg agccggcctc ctgcccgccc ctgcccccgc tgggctgcct l8; gctgggcgag acccgcgacg cgtgcggctg ctgccctatg tgcgcccgcg gcgagggcga 24; gccgtgcggg ggtggcggcg gggg cgcg ccgggcatgg agtgcgtgaa ; gagccgcaag aggcggaagg ccgg agcc ggcggtccgg gtgtaagcgg 36; cgtgtgcgtg tgcaagagcc gctacccggt gtgcggcagc gacggcacca cctacccgag 42; cggctgccag ctgcgcgccg ccagccagag ggccgagagc cgcggggaga aggccatcac 48; ccaggtcagc aagggcacct gcgagcaagg tccttccata gtgacgcccc ccaaggacat 54; ctggaatgtc actggtgccc aggtgtactt gagctgtgag gtcatcggaa tcccgacacc 60; tgtcctcatc tggaacaagg taaaaagggg tcactatgga gttcaaagga cagaactcct 66; gcctggtgac cgggacaacc tggccattca gacccggggt ggcccagaaa aagt 72; aactggctgg gtgctggtat ctcctctaag taaggaagat gctggagaat atgagtgcca 78; tgcatccaat tcccaaggac aggcttcagc atcagcaaaa attacagtgg ttgatgcctt 84L acatgaaata ccagtgaaaa aaggtgaagg tgccgagcta taaacctcca gaatattatt 90; agtctgcatg gttaaaagta gtcatggata actacattac ctgttcttgc ctaataagtt 96L tcttttaatc caatccacta acactttagt tatattcact ggttttacac agagaaatac 102; aaaataaaga atca agactatcta caaaaattta ttatatattt acagaagaaa 108; agcatgcata tcattaaaca aataaaatac tcac aacacagtaa aaaaaaa Protein ce (variant 1 ): NCBI Reference Seguence: NP_001544.1 LOCUS N P_001 544 ACCESSION 544 1 merpslrall 1gaaglllll lplssssssd tcgpcepasc pplpplgcll getrdacgcc 61 egep grgy capgmecvks rkrrkgkaga aaggpgvsgv ypvc 121 gsdgttypsg sqra esrgekaitq vskgtcqup sivtppkdiw nvtgaquls 181 cevigiptpv 1iwnkvkrgh yqurtellp gdrdnlaiqt rggpekhevt nglvsplsk 241 edageyecha snsngasas akitvvdalh eipvkkgega e1 tide seguence (variant 2) NCBI Reference Seguence: NM_001253835.1 LOCUS NM_001253835 ACCESSION NM_001253835 1 actcgcgccc ctgc caccgcaccc cgccatggag cggccgtcgc tgcgcgccct 6; gctcctcggc gccgctgggc tgctgctcct gctcctgccc ctctcctctt cctcctcttc 12; ggacacctgc tgcg agccggcctc ctgcccgccc ctgcccccgc tgggctgcct 18; gctgggcgag acccgcgacg cgtgcggctg ctgccctatg tgcgcccgcg gcgagggcga 24; gccgtgcggg ggtggcggcg gggg gtactgcgcg ccgggcatgg agtgcgtgaa ; gagccgcaag aggcggaagg gtaaagccgg ggcagcagcc ggcggtccgg gtgtaagcgg 36; cgtgtgcgtg tgcaagagcc gctacccggt gtgcggcagc acca cctacccgag 42; cggctgccag ctgcgcgccg agag ggccgagagc cgcggggaga aggccatcac 48; ccaggtcagc aagggcacct gcgagcaagg tccttccata gtgacgcccc ccaaggacat 54; ctggaatgtc actggtgccc aggtgtactt gagctgtgag gtcatcggaa tcccgacacc 60; catc tggaacaagg taaaaagggg tcactatgga gttcaaagga cagaactcct 661 gcctggtgac cgggacaacc tggccattca gacccggggt ggcccagaaa agcatgaagt 721 aactggctgg gtgctggtat ctcctctaag taaggaagat gctggagaat atgagtgcca 78L tgcatccaat tcccaaggac aggcttcagc aaaa attacagtgg ttgatgcctt 84L acatgaaata ccagtgaaaa caca ataaatctca cagccattta aaaatgacta 90; gtacatttgc tttaaaaaga acagaactaa gtatgaaagt atcagacgta gctattgatg 96; aaattctgta gttagcaacc cataagggca ttaagtatgc cattaaaatg tacagcatga 1021 gactccaaaa gattatctgg atgggtgact g Protein seguence (variant 2): NCBI Reference Seguence: NP_001240764.1 LOCUS N P_001240764 ACCESSION NP_001240764 1 merpslrall 1gaag11111 lplssssssd tcgpcepasc pplpplgcll getrdacgcc 61 pmcargegep cggggagrgy capgmecvks rkrrkgkaga aaggpgvsgv cvcksrypvc 121 gsdgttypsg cqlraasqra esrgekaitq vskgtcqup sivtppkdiw nvtgaquls 181 cevigiptpv 1iwnkvkrgh yqurtellp gdrdnlaiqt hevt nglvsplsk 241 echa snsngasas dalh eipvkkgtq Official Symbol: HLA-C Official Name: major histocompatibility complex, class I, C m: 3107 Organism: Homo sapiens Other Aliases: XXbac-BCX101P6.2, D68204, 3, HLC-C, PSORS1 Other Designations: HLA class I histocompatibility antigen, G alpha chain; HLA class I histocompatibility antigen, Cw—1 alpha chain; MHC class I antigen heavy chain HLA-C; human leukocyte antigen-C alpha chain; major histocompatibility antigen HLA-C Nucleotide seguence (variant 1): NCBI Reference ce: NM_002117.5 LOCUS NM_002117 ACCESSION 117 1 tccgcagtcc taaa gtccccagtc acccacccgg attc tccccagagg 61 ccgagatgcg ggtcatggcg ccccgagccc tcctcctgct gctctcggga ggcctggccc 121 tgaccgagac ctgggcctgc tcccactcca tgaggtattt cgcc gtgtcccggc 181 ccggccgcgg agagccccgc ttcatctcag tgggctacgt ggacgacacg cagttcgtgc 241 ggttcgacag cgacgccgcg agtccgagag cgcg ggcgccgtgg gtggagcagg 301 aggggccgga gtattgggac cgggagacac agaagtacaa gcgccaggca caggctgacc 36; gagtgagcct gcggaacctg cgcggctact acaaccagag cggg tctcacaccc 42; tccagaggat gtctggctgc gacctggggc ccgacgggcg cctcctccgc gggtatgacc 48; agtccgccta cgacggcaag gattacatcg ccctgaacga ggacctgcgc tcctggaccg 54; ccgcggacac cgcggctcag atcacccagc gcaagttgga ccgt gcggcggagc 60; agctgagagc ctacctggag ggcacgtgcg tggagtggct ccgcagatac ctggagaacg 66; ggaaggagac gctgcagcgc gcagaacccc caaagacaca ccac caccccctct 721 ctgaccatga ggccaccctg tggg ccctgggctt tgcg gagatcacac 781 tgacctggca gcgggatggg gaggaccaga cccaggacac cgagcttgtg gagaccaggc 841 cagcaggaga tggaaccttc cagaagtggg cagctgtggt ggtgccttct ggacaagagc 90; agagatacac tatg cagcacgagg ggctgcaaga gcccctcacc ctgagctggg 96; agccatcttc ccagcccacc atccccatca tgggcatcgt tgctggcctg gctgtcctgg 102; ttgtcctagc tgtccttgga gctgtggtca ccgctatgat gtgtaggagg aagagctcag 108; gtggaaaagg agggagctgc tctcaggctg cgtgcagcaa ccag ggctctgatg 114; agtctctcat cacttgtaaa gcctgagaca gctgcctgtg tgggactgag atgcaggatt 120; tcttcacacc tctcctttgt gacttcaaga gcctctggca tctctttctg caaaggcacc 126; tgaatgtgtc tgcgttcctg ttagcataat gtgaggaggt acag cccacccccg 132; tgtccaccgt gacccctgtc cccacactga cctgtgttcc ctccccgatc atctttcctg 138; ttccagagag gtggggctgg atgtctccat ctctgtctca aattcatggt gcactgagct 144; gcaacttctt acttccctaa tgaagttaag aacctgaata taaatttgtg ttctcaaata 150; tttgctatga agcgttgatg gattaattaa caat tcctagaagt tgagagagca 56; aataaagacc tgagaacctt ccagaa Protein seguence (variant 1): NCBI Reference Seguence: 108.4 LOCUS N P_0021 08 ION N P_002108 1 mrvmaprall lllsgglalt etwacshsmr yfdtavsrpg rgeprfisvg yvddtqfvrf 61 dsdaasprge prapwveqeg peywdretqk drv slrnlrgyyn qsedgshth 121 rmsgcdlgpd grllrgydqs aydgkdyial nedlrswtaa dtaaqitqu leaaraaeql 181 raylegtcve wlrrylengk ppk thvthhplsd heatlrcwal gfypaeitlt 241 dqtq dtelvetrpa gdgtquwaa vvvpsgqeqr ytchmqhegl qepltlswep 301 ssqptipimg ivaglavlvv lavlgavvta mmcrrkssgg kggscsqaac snsaqgsdes 361 litcka Nucleotide ce (variant 2): NCBI Reference Seguence: NM_001243042.1 LOCUS NM_001243042 ACCESSION NM_001243042 l tccgcagtcc cggttctaaa gtccccagtc acccacccgg actcacattc tccccagagg 6L ccgagatgcg ggtcatggcg ccccgagccc tcctcctgct gctctcggga gccc 12L tgaccgagac ctgggcctgc tcccactcca tgaggtattt cgacaccgcc gtgtcccggc 18; ccggccgcgg agagccccgc ttcatctcag tgggctacgt cacg cagttcgtgc 24; ggttcgacag cgacgccgcg agtccgagag gggagccgcg ggcgccgtgg gtggagcagg ; aggggccgga gtattgggac cgggagacac agaactacaa gcgccaggca caggctgacc 36; gagtgagcct gcggaacctg cgcggctact acaaccagag cgaggacggg accc 42; tccagaggat gtatggctgc gacctggggc ccgacgggcg cctcctccgc gggtatgacc 48; agtccgccta caag gattacatcg ccctgaacga ggacctgcgc tcctggaccg 54; ccgcggacac cgcggctcag atcacccagc gcaagttgga ggcggcccgt gcggcggagc 60; agctgagagc ctacctggag ggcacgtgcg tggagtggct ccgcagatac ctggagaacg 66; ggaaggagac gctgcagcgc gcagaacccc caaagacaca cgtgacccac caccccctct 72; ctgaccatga ggccaccctg aggtgctggg gctt ctaccctgcg gagatcacac 78; tgacctggca gcgggatggg gaggaccaga cccaggacac cgagcttgtg gagaccaggc 84; cagcaggaga tggaaccttc cagaagtggg cagctgtggt ttct ggacaagagc 90; agagatacac gtgccatatg gagg ggctgcaaga gcccctcacc ctgagctggg 96; cttc ccagcccacc atccccatca tgggcatcgt tgctggcctg gctgtcctgg 202; ttgtcctagc tgtccttgga gctgtggtca ccgctatgat gtgtaggagg aagagctcag L08: gtggaaaagg agggagctgc tctcaggctg cgtgcagcaa cagtgcccag ggctctgatg 114; agtctctcat cacttgtaaa gcctgagaca gctgcctgtg tgggactgag tcttcacacc ttgt aaga gcctctggca tctctttctg caaaggcgtc 1261 tgtc tgcgttcctg ttagcataat gtgaggaggt ggagagacag cccacccccg L32; tgtccaccgt gacccctgtc cccacactga cctgtgttcc ctccccgatc cctg L38; agag gtggggctgg atgtctccat ctctgtctca aattcatggt gcactgagct 1441 gcaacttctt acttccctaa taag aacctgaata taaatttgtg ttctcaaata 1501 tttgctatga agcgttgatg gattaattaa ataagtcaat tcctagaagt tgagagagca 1561 aataaagacc tgagaacctt ccagaa Protein ce (variant 2): NCBI Reference Seguence: NP_001229971.1 LOCUS NP_001229971 ACCESSION NP_001229971 l mrvmaprall lllsgglalt etwacshsmr yfdtavsrpg rgeprfisvg yvddtqfvrf 61 dsdaasprge prapwveqeg peywdretqn yquaqadrv slrnlrgyyn qsedgshth 121 rmygcdlgpd grllrgydqs aydgkdyial wtaa dtaaqitqu leaaraaeql 181 raylegtcve wlrrylengk ppk thvthhplsd heatlrcwal itlt 241 wqrdgedqtq dtelvetrpa gdgtquwaa qeqr ytchmqhegl swep 301 ssqptipimg ivaglavlvv lavlgavvta mmcrrkssgg kggscsqaac snsaqgsdes 361 litcka RRAS2 Official Sym bol: RRAS2 Official Name: related RAS viral (r-ras) oncogene homolog 2 Gene ID: 22800 Organism: Homo sapiens Other Aliases: TC21 Other Designations: ras-like protein TC21; ras-related protein R-Ras2; teratocarcinoma oncogene Nucleotide ce (variant 1): NCBI Reference Seguence: NM_012250.5 LOCUS NM_012250 ACCESSION NM_012250 1 cagacggcca tttgtggcgg cgctggaggc tgcgttcggc aggcgctgcg gagacgcgta 61 cgcg ccccccggcc gctgccgccc ctggcccgtg ccgtcacccc gcttctccgc 12; gcctcgggcg gtacccagcc agtccccagc ctac cgcgctgacc ggccctccag 18; acgcctcccg gtacccggga ccccagcccg gccgctcgcc cgcagcccgc caca 24; cgtccccgga gccgggccta gggcgggcgg cagcggcggc tcggcgcagt caggctgggc ; tctgtagcgt ccccatggcc gcggccggct ggcgggacgg ctccggccag gagaagtacc 36; ggctcgtggt cggg ggcggcgtgg gcaagtcggc gctcaccatc cagttcatcc 42; agtcctattt tgtaacggat tatgatccaa ccattgaaga ttcttacaca aagcagtgtg 48L atga cagagcagcc cggctagata ttttggatac agcaggacaa gaagagtttg 54L gagccatgag agaacagtat atgaggactg gcgaaggctt ggtc ttttcagtca 60; cagatagagg cagttttgaa gaaatctata agtttcaaag acagattctc agagtaaagg 66L atcgtgatga gttcccaatg attg gtaataaagc ggat catcaaagac 72; aggtaacaca ggaagaagga caacagttag agct aaca tacatggagg 78; catcagcaaa gattaggatg aatgtagatc aagctttcca tgtc cgggttatca 84; ggaaatttca agagcaggaa tgtcctcctt caccagaacc aacacggaaa gaaaaagaca 90; agaaaggctg ccattgtgtc attttctaga atcccttcag ttttagctac caacggccag 96; gaaaagccct ctct ttctctcctc agtttacatc ttgttggtac ctttctagcc L02; ttagacaaat gatcaccatg ttagccttag acgaagaagc tggctagtcc tttctgtgaa L08; gctaatacaa tggtcatttc cagacaaatt taaaggaaac actaaggctg cttcaaagat L14; tatctgattc ctttaaaata tatgtctata tacacagaca tgctcttttt ttaagtgctt L20; acattttaat agagatgaat cagttttgga atctaagctg tttgccaagc tgaagctaca L26; ggttgtgaaa taatttttaa cttttggaat catactgcct actgttactc gaaa L32; tatagggttt tttttaatgt gaatttttgc ttaa acatttcaat gtcagccttt L38; gttaacctta ctga attgaatcta caaaagtgaa ccatctcaga cctttactga L44; tactacaact tttgttttct gatggccaaa ataccaaatg cctgttgtat atta L50; aaaactgctt ataaaaccct gtgttactac tcctactctt ggagatgata atattctatg L56; tggtcaaata tttggactca tttaggactt agatatttca gtgtacttga ttttttaatt L621 taactctttt ccac gctaagggta aaaaggaata atttccttct L68; tcaagtattt ctgggtaagg gattcaaaaa actaaaactg tttttgtttg taatataaaa L74; tatggaattg ccag ggtcagagat gattaatgtt tttgctatat acttttatac L80; attattttct tatcaaacta gttaacaagt atttttatat gtttgtaagc agatatgctt L86; tcatagcata ccttgtgtat atgtaaagat aagtatttaa ttctcactgt tcacttttaa 1921 ctgacaaaga aaaacaagtg gaaactacag aaactgtggt agaactttta cttgctggtc 1981 tggtcttggt tgtacccatc tttggccagt cacataacta ctcaagaaac cttcccaata 2041 gagtacaaca ggatgagact ctgaaatcac tatt ccctgctaga tattgattgt 2101 tatttcaagt tgta agcttttaat ggataattag tataactgtg gatggcatct 2161 gattttgttt ttaattctgt ggattgtgtt taagcaattc aatagtatgt tcctgatttt 2221 gagatgctaa gtggtattgc acagttgtca ctttatcaag tgtgtacaac atga 2281 agtttataga gcataccctt gtatagcttc aggtgctaga attaaaattg atctgttatc 2341 acaagaaaaa aaaaaaaaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_036382.2 LOCUS N P_036382 ACCESSION NP_036382 1 maaagwrdgs gqekyrlvvv ggggvgksal tiqfiqsyfv tdydptieds ytchviddr 61 aarldildta gqeefgamre qymrtgegfl 1vfsvtdrgs feeiyqurq ilrvkdrdef 121 pmilignkad 1dhqrqvtqe egqqlarqlk vtymeasaki rmnvdqafhe 1vrvirque 181 qecppspept rkekdkkgch cvif Nucleotide seguence (variant 2): NCBI Reference ce: NM_001102669.2 LOCUS NM_001 102669 ACCESSION NM_001 102669 1 gcgg agctggaagg gtgggaagca cacc ttattgctct ggccgaggcc 6; agagacctcc gggagaggct gggccaccga gccgggcttt actgctccga gggtccgggc 121 gtggggctgg agctggagcc ccgcgcgctg cttttccagc cgcctgcggc ttca l8; ccgtcggggc gatagcggtg gcaacttggc cgcggctccg cgtggtctcc gggcttcccc 24L gcgccgcctg agccggagct gcccgcttca atcctatttt gtaacggatt caac ; cattgaagat tcttacacaa agcagtgtgt gatagatgac agagcagccc ggctagatat 36; taca gcaggacaag aagagtttgg agccatgaga gaacagtata tgaggactgg 42; cgaaggcttc gtct tttcagtcac aggc agttttgaag aaatctataa 48; gtttcaaaga cagattctca gagtaaagga tcgtgatgag ttcccaatga ttttaattgg 54; taataaagca gatctggatc atcaaagaca ggtaacacag ggac aacagttagc 60; acggcagctt aaggtaacat acatggaggc atcagcaaag attaggatga atca 66; ccat gaacttgtcc gggttatcag gaaatttcaa gagcaggaat gtcctccttc 72; accagaacca acacggaaag aaaaagacaa gaaaggctgc cattgtgtca ttttctagaa 78; tcccttcagt tttagctacc cagg aaaagccctc atcttctctt tctctcctca 84L gtttacatct tgttggtacc tttctagcct tagacaaatg atcaccatgt tagccttaga 90; cgaagaagct ggctagtcct ttctgtgaag ctaatacaat ggtcatttcc agacaaattt 96L aaaggaaaca ctaaggctgc ttcaaagatt atctgattcc tttaaaatat atgtctatat 102; acacagacat gctctttttt taagtgctta cattttaata aatc agttttggaa L08; tctaagctgt ttgccaagct gaagctacag gttgtgaaat aatttttaac atactgccta ctgttactct aaatagaaat atagggtttt ttttaatgtg aatttttgcc L20; tatctttaaa catttcaatg tcagcctttg ttaaccttaa atacactgaa ttgaatctac L26; aaaagtgaac catctcagac ctttactgat actacaactt tctg atggccaaaa L32; taccaaatgc ctgttgtatt tatggattaa aaactgctta taaaaccctg tgttactact L38; cctactcttg gagatgataa atgt ggtcaaatat ttggactcat ctta L44; gatatttcag tgtacttgat attt aactcttttt cacagccacg ctaagggtaa L50; aaaggaataa tttccttctg tcttcctttt caagtatttc tgggtaaggg attcaaaaaa L56; ctaaaactgt ttttgtttgt aatataaaat atggaattga tctttccagg gtcagagatg L62; attaatgttt ttgctatata cttttataca ttattttctt ctag ttaacaagta L68; tttttatatg tttgtaagca gatatgcttt atac tata tgtaaagata L74; taat tctcactgtt cacttttaac tgacaaagaa aaacaagtgg aaactacaga L80; aactgtggta gaacttttac ttgctggtct ggtcttggtt gtacccatct ttggccagtc L86; acataactac aacc ttcccaatag agtacaacag gatgagactc tgaaatcact L92; ttcagtattc agat attgattgtt atttcaagta ttaagtgtaa gcttttaatg L981 gataattagt ataactgtgg tctg attttgtttt taattctgtg gattgtgttt 204; aagcaattca atagtatgtt cctgattttg agatgctaag tggtattgca cagttgtcac 210; tttatcaagt gtgtacaaca gtcccatgaa gtttatagag catacccttg tatagcttca 216; agaa ttaaaattga tctgttatca caagaaaaaa aaaaaaaaa Protein seguence (varaiant 2): NCBI Reference Seguence: NP_001096139.1 LOCUS N P_001096139 ACCESSION N 96139 1 mreqymrtge gfli.vfsvtd rgsfeeiykf qrqilrvkdr defpmilign kadldhqrqv 61 tqeegqqlar q:.kvtymeas akirmnvdqa fhelvrvirk fqeqecppsp kdkk 121 gchcvif Nucleotide seguence (variant 3): NCBI Reference Seguence: NM_001177314.1 LOCUS NM_001177314 ACCESSION NM_001177314 1 atctcagatg catgcagctc ctgctgggcg gtttcattct ctgccagcca ttcattcaca 6; gcaa tggcccatta aggaaataag tggatgatgc tcctccatca cccatgtcct 12; attttgtaac ggattatgat ccaaccattg aagattctta cacaaagcag tgtgtgatag 18; atgacagagc agcccggcta gatattttgg atacagcagg acaagaagag tttggagcca 24; tgagagaaca gtatatgagg actggcgaag gcttcctgtt ggtcttttca gtcacagata ; gaggcagttt tgaagaaatc tataagtttc aaagacagat agta aaggatcgtg 36; atgagttccc aatgatttta attggtaata aagcagatct ggatcatcaa gtaa 42; cacaggaaga aggacaacag ttagcacggc agcttaaggt aacatacatg gaggcatcag 48; caaagattag tgta gatcaagctt tccatgaact tgtccgggtt atcaggaaat 54; ttcaagagca ggaatgtcct ccttcaccag aaccaacacg gaaagaaaaa gacaagaaag 60; attg tgtcattttc ccct tcagttttag ctaccaacgg ccaggaaaag 66; ccctcatctt ctctttctct cctcagttta catcttgttg gtacctttct agccttagac 72; aaatgatcac catgttagcc ttagacgaag gcta gtcctttctg tgaagctaat 78; acaatggtca gaca aatttaaagg aaacactaag gctgcttcaa agattatctg 841 ttaa aatatatgtc tatatacaca gacatgctct ttttttaagt gcttacattt 90; taatagagat gaatcagttt tggaatctaa gctgtttgcc aagctgaagc tacaggttgt 961 gaaataattt ttaacttttg gaatcatact gcctactgtt actctaaata tagg 102; ttta atgtgaattt ttgcctatct ttaaacattt caatgtcagc ctttgttaac 108; cttaaataca ctgaattgaa tctacaaaag tgaaccatct cagaccttta ctgatactac 114; aacttttgtt ttctgatggc caaaatacca aatgcctgtt gtatttatgg attaaaaact 120; gcttataaaa ccctgtgtta ctactcctac tcttggagat gataatattc tatgtggtca 126; aatatttgga ctcatttagg acttagatat ttcagtgtac ttgatttttt aatttaactc L32; tttttcacag taag ggtaaaaagg ttcc ttctgtcttc cttttcaagt L38; atttctgggt aagggattca aaaaactaaa actgtttttg tttgtaatat tgga L44; attgatcttt ccagggtcag agatgattaa tgct atatactttt atacattatt 150; ttcttatcaa actagttaac aagtattttt ttgt aagcagatat gctttcatag :56: cataccttgt gtatatgtaa agataagtat ttaattctca ctgttcactt ttaactgaca L62; aagaaaaaca aact acagaaactg tggtagaact tttacttgct catctttggc cagtcacata actactcaag aaaccttccc gactctgaaa tcag ctgc tagatattga tgtaagcttt taatggataa ttagtataac tgtggatggc ctgtggattg agca attcaatagt atgttcctga ctaagtggta ttgcacagtt gtcactttat caagtgtgta caacagtccc atgaagttta L98; tagagcatac ccttgtatag gtgc tagaattaaa attgatctgt tatcacaaga 204; aaaaaaaaaa aaaa Protein seguence (variant 3): NCBI Reference Seguence: NP_001170785.1 LOCUS N P_001 170785 ACCESSION N P_001170785 1 msyfvtdydp tiedsytch viddraarld ildtagqeef gamreqymrt gegfllvfsv 61 tdrgsfeeiy qurqilrvk drdefpmili gnkadldhqr qvtqeegqql arqlkvtyme 121 asakirmnvd qafhelvrvi rqueqecpp speptrkekd kkgchcvif Nucleotide seguence (variant 4): NCBI Reference Seguence : NM_001177315.1 LOCUS NM_001177315 ACCESSION 177315 l attgctctgg ccgaggccag agacctccgg gagaggctgg gccaccgagc cgggctttac 61 tgctccgagg gtccgggcgt ggggctggag cccc gcgcgctgct tttccagccg 121 cctgcggccg Cgccttcacc gtcggggcga tagcggtggc aacttggccg cggctccgcg 181 tggtctccgg gcttccccgc gccgcctgag ccggagctgc ccgcttcaag tactgtgtat 24; gttc ctattttgta acggattatg atccaaccat tgaagattct tacacaaagc ; agtgtgtgat agatgacaga gcagcccggc tagatatttt ggatacagca ggacaagaag 36; agtttggagc catgagagaa cagtatatga ggactggcga aggcttcctg ttggtctttt 42; cagtcacaga cagt tttgaagaaa agtt tcaaagacag agag 48; taaaggatcg tgatgagttc ccaatgattt taattggtaa taaagcagat ctggatcatc 54; aaagacaggt aacacaggaa gaaggacaac agttagcacg gcagcttaag gtaacataca 60L tggaggcatc agcaaagatt aatg tagatcaagc tgaa cggg 66L ttatcagqaa agag caggaatgtc ctccttcacc agaaccaaca cggaaagaaa 72; aagacaagaa aggctgccat tgtgtcattt tctagaatcc cttcagtttt agctaccaac 78L ggccaggaaa agccctcatc ttctctttct ctcctcagtt tacatcttgt tggtaccttt 84; ttag acaaatgatc accatgttag acga agaagctggc tagtcctttc 90; tgtgaagcta atacaatggt catttccaga caaatttaaa ggaaacacta aggctgcttc 96; aaagattatc tgattccttt aaaatatatg tctatataca cagacatgct ctttttttaa L02; gtgcttacat tttaatagag atgaatcagt tttggaatct aagctgtttg tgaa L08; gctacaggtt gtgaaataat ttttaacttt tggaatcata ctgcctactg ttactctaaa L14; tata gggttttttt taatgtgaat ttttgcctat ctttaaacat ttcaatgtca L20; gcctttgtta accttaaata cactgaattg aatctacaaa agtgaaccat ctcagacctt L26; tactgatact acaacttttg ttttctgatg gccaaaatac caaatgcctg ttgtatttat L32; ggattaaaaa ctgcttataa aaccctgtgt tactactcct actcttggag atgataatat L38; tctatgtggt caaatatttg gactcattta ggacttagat gtgt tttt L44; ttaatttaac tctttttcac agccacgcta agggtaaaaa ggaataattt ccttctgtct L50; tccttttcaa gtatttctgg gtaagggatt caaaaaacta aaactgtttt tgtttgtaat L56; ataaaatatg gaattgatct ttccagggtc agagatgatt aatgtttttg ctatatactt L62; ttatacatta ttttcttatc aaactagtta attt ttatatgttt gtaagcagat L68: atgctttcat agcatacctt gtgtatatgt aaagataagt atttaattct cactgttcac L741 ttttaactga caaagaaaaa caagtggaaa ctacagaaac tgtggtagaa cttttacttg L80; ctggtctggt cttggttgta cccatctttg caca taactactca agaaaccttc L86; ccaatagagt acaacaggat gagactctga aatcactttc agtattccct gattgttatt tcaagtatta agtgtaagct tttaatggat aattagtata gcatctgatt ttgtttttaa ttctgtggat tgtgtttaag caattcaata gtatgttcct 2041 gattttgaga tgctaagtgg acag cttt atcaagtgtg tacaacagtc 2101 ccatgaagtt tatagagcat acccttgtat aggt gctagaatta aaattgatct 2161 acaa gaaaaaaaaa aaaaaa Protein seguence (variant 4): NCBI Reference Seguence: NP_001170786.1 LOCUS N P_001 170786 ACCESSION NP_001170786 l mreqymrtge gfllvfsvtd rgsfeeiykf qrqilrvkdr defpmilign kadldhqrqv 61 qlar qlkvtymeas akirmnvdqa virk fqeqecppsp eptrkekdkk 121 gchcvif TSP1 Official Symbol: THBS1 Official Name: thrombospondin 1 Gene ID: 7057 Organism: Homo sapiens Other Aliases: THBS, THBS-1, TSP, TSP-1, TSP1 Other Designations: thrombospondin-1; thrombospondin-1p180 tide seguence: NCBI nce Seguence: NM_003246.2 LOCUS NM_003246 ACCESSION NM_003246 1 agccgctgcg cccgagctgg cctgcgagtt cagggctcct gtcgctctcc aggagcaacc 61 tctactccgg acgcacaggc attccccgcg cccctccagc cctcgccgcc ctcgccaccg 121 ctcccggccg ccgcgctccg gtacacacag gatccctgct gggcaccaac agctccacca 181 tggggctggc ctggggacta ctgt tcctgatgca tgtgtgtggc accaaccgca 241 ttccagagtc tggcggagac aacagcgtgt ttgacatctt tgaactcacc ggggCCgCCC 301 gcaaggggtc tgggcgccga ctggtgaagg gccccgaccc ttccagccca gctttccgca 36; tcgaggatgc caacctgatc ccccctgtgc ctgatgacaa gttccaagac gatg 42; ctgtgcgggc agaaaagggt cttc tggcatccct gaggcagatg aagaagaccc 48; ggggcacgct cctg gagcggaaag accactctgg ccaggtcttc agcgtggtgt 54; ccaatggcaa ggcgggcacc ctggacctca gcctgaccgt ccaaggaaag cagcacgtgg 60; tgtctgtgga agaagctctc ctggcaaccg gccagtggaa gagcatcacc ctgtttgtgc 66; acag ggcccagctg tacatcgact gtgaaaagat ggagaatgct gagttggacg 72L tccccatcca aagcgtcttc accagagacc tggccagcat cgccagactc cgcatcgcaa 78L aggggggcgt caatgacaat ttccaggggg tgctgcagaa tgtgaggttt gtctttggaa 84L caga agacatcctc aggaacaaag gctgctccag ctctaccagt gtcctcctca 90; cccttgacaa caacgtggtg aatggttcca gccctgccat ccgcactaac tacattggcc 96; acaagacaaa ggacttgcaa gccatctgcg gcatctcctg gctg tccagcatgg L02; tcctggaact caggggcctg cgcaccattg tgaccacgct gcaggacagc tgactgaaga gaacaaagag ttggccaatg agctgaggcg gcctccccta caca L14; ttca gtacagaaat aacgaggaat ggactgttga tagctgcact gagtgtcact L20; gtcagaactc agttaccatc tgcaaaaagg tgtcctgccc gccc tgctccaatg L26; ccacagttcc tgatggagaa tgctgtcctc gctgttggcc cagcgactct gcggacgatg L32; gctggtctcc atggtccgag tggacctcct gttctacgag ctgtggcaat ggaattcagc L38; agcgcggccg ctcctgcgat agcctcaaca accgatgtga gggctcctcg gtccagacac L44; ggacctgcca cattcaggag tgtgacaaga gatttaaaca ggatggtggc tggagccact L50; ggtccccgtg gtcatcttgt tctgtgacat gtggtgatgg caca aggatccggc L56; tctgcaactc tcccagcccc cagatgaacg ggaaaccctg tgaaggcgaa gcgcgggaga L62; ccaaagcctg caagaaagac gcctgcccca gagg ctggggtcct ccat L68; gggacatctg ttctgtcacc tgtggaggag gggtacagaa acgtagtcgt ctctgcaaca L74; accccacacc ccagtttgga ggcaaggact gcgttggtga tgtaacagaa aaccagatct L80: agca ggactgtcca attgatggat gcctgtccaa tccctgcttt gccggcgtga L86; agtgtactag tgat ggcagctgga aatgtggtgc ttgtccccct ccagtgcaca gatgttgatg agtgcaaaga agtgcctgat agagcaccgg tgtgagaaca cggaccccgg ctacaactgc ctgccctgcc 204; ccccacgctt caccggctca cagcccttcg gtgt cgaacatgcc acggccaaca 210; tgtg caagccccgt aacccctgca cggatgggac ccacgactgc aacaagaacg 216; ccaagtgcaa ctacctgggc cactatagcg accccatgta ccgctgcgag tgcaagcctg 222; gctacgctgg caatggcatc atctgcgggg aggacacaga cctggatggc tggcccaatg 228; agaacctggt ggcc aatgcgactt accactgcaa aaaggataat tgccccaacc 234; ttcccaactc agggcaggaa gactatgaca aggatggaat tggtgatgcc gatg 240; acgatgacaa tgataaaatt ccagatgaca gggacaactg tccattccat tacaacccag 246; ctcagtatga caga gatgatgtgg gagaccgctg tgacaactgt ccctacaacc 252i acaacccaga tcaggcagac acagacaaca atggggaagg ctgt gaca 258; ttgatggaga cggtatcctc aatgaacggg acaactgcca gtacgtctac aatgtggacc 264; agagagacac tgatatggat ggggttggag atcagtgtga caattgcccc ttggaacaca 270; atccggatca gctggactct gacc gcattggaga tacctgtgac aacaatcagg 276; atattgatga agatggccac cagaacaatc actg tccctatgtg cccaatgcca 282; accaggctga ccatgacaaa gatggcaagg gagatgcctg tgaccacgat gatgacaacg 288; atggcattcc tgatgacaag gacaactgca gactcgtgcc caatcccgac cagaaggact 294; ctgacggcga tggtcgaggt gatgcctgca aagatgattt tgaccatgac agtgtgccag 300; acatcgatga catctgtcct gagaatgttg acatcagtga gaccgatttc cgccgattcc 306; agatgattcc tctggacccc aaagggacat cccaaaatga ccctaactgg gttgtacgcc 312; atcagggtaa agaactcgtc gtca actgtgatcc tggactcgct gtaggttatg 318; ttaa tgctgtggac ttcagtggca ccttcttcat cgaa agggacgatg 324; actatgctgg atttgtcttt ggctaccagt ccagcagccg cttttatgtt gtgatgtgga 330; tcac ccagtcctac tgggacacca accccacgag ggctcaggga tactcgggcc 336; tttctgtgaa agttgtaaac tccaccacag ggcctggcga gcacctgcgg aacgccctgt 342; ggcacacagg aaacacccct ggccaggtgc gcaccctgtg gcatgaccct cgtcacatag 348; gctggaaaga tttcaccgcc tacagatggc gtctcagcca caggccaaag acgggtttca 354; ttagagtggt gatgtatgaa aaaa tcatggctga ctcaggaccc atctatgata 360i aaacctatgc tggtggtaga ctagggttgt ttgtcttctc tcaagaaatg gtgttcttct 3661 ctgacctgaa atacgaatgt agagatccct caaa ttgttgattg aaagactgat 372; cataaaccaa tgctggtatt gcaccttctg gaactatggg cttgagaaaa cccccaggat 378; cacttctcct cctt cttttctgtg cttgcatcag tgtggactcc tagaacgtgc 384; gacctgcctc atgc agttttcaaa aacagactca agcc tccaatgaat 390; aagacatctt ccaagcatat aaacaattgc tttggtttcc ttttgaaaaa gcatctactt 396; gcttcagttg ggaaggtgcc cattccactc tgcctttgtc acagagcagg gtgctattgt 402; gaggccatct ctgagcagtg gactcaaaag cattttcagg catgtcagag aagggaggac 408; tcactagaat tagcaaacaa aaccaccctg acatcctcct tcaggaacac ggggagcaga 414; ggccaaagca ct aaggggag ggcgcatacc cgagacgatt agaa aatatggagg 420; aactgttaca tgttcggtac taagtcattt gatt gaaagactat tgctggattt 426; catgatgctg gtta gctgattaac ccatgtaaat aggcacttaa atagaagcag 432; gaga ctgg cttctggact tcctccctga tccccaccct tactcatcac 438; ctgcagtggc cagaattagg gaatcagaat caaaccagtg taaggcagtg ctggctgcca 444; ttgcctgqtc acattgaaat tggtggcttc attctagatg gtgc agatgtagca 450; ggaaaatagg aaaacctacc atctcagtga gcaccagctg cctcccaaag gaggggcagc 456; cgtgcttata tttttatggt tacaatggca caaaattatt atcaacctaa ctaaaacatt 462; ccttttctct tttttcctga attatcatgg agttttctaa ttctctcttt tggaatgtag 468; atttttttta aatgctttac aaat atttattttt tacttattct ggaagatctg 474; gctgaaggat tattcatgga acaggaagaa gcgtaaagac tatccatgtc atctttgttg 480; agagtcttcg tgactgtaag attgtaaata cagattattt attaactctg ttctgcctgg 486; aaatttaggc ttcatacgga aagtgtttga gagcaagtag ttta tcagcaaatc 492; tcttgcaaga acagcacaag gaaaatcagt ctaataagct gctctgcccc ttgtgctcag 498; agtggatgtt atgggattct ttttttctct gttttatctt ttcaagtgga tggt 504; tatccatttg caaatgtttt aaattgcaaa gaaagccatg aggtcttcaa tactgtttta 510; ccccatccct tgtgcatatt gaga gcat atacactttt ttctttcatt 516; tttccaaaag agaaaaaaat gacaaaaggt taca tacaaatatt acctcatttg 522; ttgtgtgact gagtaaagaa tttttggatc aagcggaaag agtttaagtg tctaacaaac 528; ttaaagctac tgtagtacct aaaaagtcag acat agcataaaaa ctctgcagag 534; aagtattccc aataaggaaa tagcattgaa atgttaaata caatttctga aagttatgtt 540; ttttttctat catctggtat accattgctt tatttttata aattattttc ccat 5461 agat atctcagatt gtgtagatat gctatttaaa taatttatca actg 552; cctgtagagt tagtatttct atttttatat aatgtttgca cactgaattg aagaattgtt 558; ttct tttttttgtt ttgttttttt tttttttttt ttttgctttt gacctcccat 564; ttttactatt tgccaatacc tttttctagg aatgtgcttt tttttgtaca catttttatc 570; cattttacat tctaaagcag ttgt atattactgt ttcttatgta caaggaacaa 5761 caataaatca tatggaaatt tatatttata aaaa aaaaaaaaaa aaaaaaaaaa Protein ce: NCBI Reference Seguence: NP_003237.2 LOCUS N P_003237 ACCESSION NP_003237 l mglawglgvl flmhvcgtnr ipesggdnsv fdifeltgaa rlvk gpdpsspafr 6; iedanlippv pddqudlvd avraekgfll lasqumkkt rgtllalerk dhsgqvfsvv 12; sngkagtldl sltvqgkqhv vsveeallat ngksitlfv qedraquid ae;d l8; ftrd lasiarlria nfqg vlqnvrfvfg ttpedilrnk gcssstsv;l 24; tldnnvvngs spairtnyig hktkdlqaic giscdelssm vlelrglrti vtthdsirk ; vteenkelan elrrpplcyh nquyrnnee wtvdsctech cqnsvtickk vscpimpcsn 36; atvpdgeccp rcwpsdsadd gwspwsewts cstscgngiq qrgrscdsln nrcegsqut 42; rtchiqecdk rfknggwsh csvt cgdgvitrir lcnspqumn gkpcegeare 48; tkackkdacp inggwgPWSP wdicsvtcgg qukrsrlcn nptquggkd cvgdvtenqi 54; cnkqdcpidg clsnpcfagv kctsypdgsw kcgacppgys tdvd eckevpdacf 60; nhngehrcen clpc pprftgsqpf gqgvehatan kqvckprnpc tdgthdcnkn 66; akcnylghys dpmyrceckp gyagngiicg edtdldgwpn enlvcvanat yhckkdncpn 72; lpnsgqedyd kdgigdacdd dddndkipdd rdncpfhynp aqydydrddv gdrcdncpyn 78; hnpdqadtdn ngegdacaad idgdgilner dncquynvd qrdtdmdgvg dchncpleh 84; npdqldsdsd rigdtcdnnq hqnn ldncpyvpna nqadhdkdgk gdacdhdddn 90; dgipddkdnc rlvpnpqud sdgdgrgdac kddfdhdsvp diddicpenv disetdfrrf 96; qmipldpkgt wvvr hqgkelvqtv ncdpglavgy defnavdfsg tffinterdd 102; dyagfvfqu sssrfyvvmw kqvtqsywdt nptraqusg lsvkvvnstt gpgehlrnal 108; whtgntpgqv rtlwhdprhi gwkdftayrw rlshrpktgf irvvmyegkk imadsgpiyd ll4; ktyaggrlgl mvff sdlkyecrdp EDIL3 Official Symbol: EDIL3 Official Name: EGF-Iike s and discoidin l—Iike domains 3 Gene ID: 10085 Organism: Homo sapiens Other Aliases: DEL1 Other Designations: EGF-like repeat and discoidin l-like domain-containing protein 3; developmental endothelial locus-1; pmentally-regulated elial cell locus 1 protein; integrin-binding protein DEL1 Nucleotide seguence: NCBI Reference Seguence: NM_005711.3 LOCUS NM_00571 1 ACCESSION NM_005711 1 agaagccccg cagccgccgc gcggagaaca gcgacagccg agcgcccggt ccgcctgtct 6; gccggtgggt ctgcctgccc gcgcagcaga CCngggng ccgcgggagc ccgcgccccg 12; cccgccgcgc ctctgccggg acccacccgc agcggagggc tgagcccgcc ggcggctccc l8; cggagctcac ccacctccgc gcgccggagc gcaggcaaaa ggggaggaaa ggctcctctc 24; tttagtcacc actctcgccc tctccaagaa taac aaagcgctga agaa ; cgtcttcttg ttag taggggcgga gtctgctgct gccctgcgct gccacctcgg 36; ctacactgcc ctccgcgacg acccctgacc agccggggtc acgtccggga atca 42; tgaagcgctc ggtagccgtc tggctcttgg tcgggctcag cctcggtgtc ttcg 48; gcaaaggtga tatttgtgat cccaatccat gtgaaaatgg aggtatctgt ttgccaggat 54; tggctgatgg ttccttttcc tgtc cagatggctt cacagacccc aactgttcta 60; tgga ggttgcatca gatgaagaag aaccaacttc agcaggtccc tgcactccta 66; gcca taatggagga acctgtgaaa taagtgaagc ataccgaggg gatacattca 72; taggctatgt atgt ccccgaggat ttaatgggat tcactgtcag ataa 78; atgaatgcga agttgagcct tgcaaaaatg gtggaatatg tacagatctt gttgctaact 84; attcctgtga gtgcccaggc gaatttatgg gaagaaattg caaa tgctcaggcc 90; cactgggaat tgaaggtgga attatatcaa accagcaaat cacagcttcc tctactcacc 96L tttt tggactccaa aaatggtatc cctactatgc acgtcttaat aagaaggggc 102; ttataaatgc gtggacagct gcagaaaatg acagatggcc gtggattcag ataaatttgc 108; aaaggaaaat gagagttact ggtgtgatta cccaaggagc caagaggatt ggaagcccag 114; agtatataaa atcctacaaa attgcctaca gtaatgatgg aaagacttgg gcaatgtaca L20; aagtgaaagg caccaatgaa gacatggtgt ttcgtggaaa cattgataac aacactccat L26; atgctaactc tttcacaccc cccataaaag ctcagtatgt aagactctat ccccaagttt L32; gtcgaagaca ttgcactttg cgaatggaac ttcttggctg tgaactgtcg ggttgttctg L38; agcctctggg tatgaaatca ggacatatac aagactatca gatcactgcc tccagcatct L44; tcagaacgct ggac atgttcactt gggaaccaag gaaagctcgg ctggacaagc L50; aaggcaaagt gaatgcctgg ggcc acaatgacca gtcacaatgg atcttcttgt tccaaccaaa gtgactggca tcattacaca taaa gattttggtc L62; atgtacagtt tgttggctcc tacaaactgg cttacagcaa tgatggagaa L68; tataccagga tgaaaagcaa gata aggttttcca gggaaatttt gacaatgaca L74; ctcacagaaa aaatgtcatc gaccctccca tctatgcacg acacataaga atccttcctt L80; ggtcctggta nggaggatc acattgcggt cagagctgct gggctgcaca gaggaggaat L86; gaggggaggc tacatttcac aaccctcttc cctatttccc tatc gaactgtgca aaatctgtag gaaactgaat ggtttttttt tttttttcat gaaaaagtgc L98; tcaaattatg gtaggcaact aacggtgttt ttaagggggt ctgc cttttcaatg 204; atttaatttg attttatttt atccgtcaaa tctcttaagt acat taagtgtgaa 210; ttacttttct ctcattgttt cctgaattat tcgcattggt agaaatatat tagggaaaga 216; aagtagcctt ctttttatag caagagtaaa aaagtctcaa agtcatcaaa caag 222; agttgataga gcttttacaa ctca cctaattctg ataaaaggaa tactgcaatg 228; ttagcaataa gtttttttct tctgtaatga ctctacgtta tcctgtttcc ctac 234; caaacactgt caatgtttat tacaaaattt taaagaagaa tatgtaacat gcagtactga 240; tattataatt ttac tttcattatt tctaataaga gattatgtga cttctttttc 246; ttttagttct attctacatt cttaatattg tatattacct gaataattca atttttttct 252; aattgaattt cctattagtt gactaaaaga agtgtcatgt ttactcatat atgtagaaca 258; tgactgccta tcagtagatt gatctgtatt taatattcgt taattaaatc tgcagtttta 264; tttttgaagg aagccataac tatttaattt ccaaataatt gcttcataaa cata 2701 ctctcagttt gcacaaaaga aata tatatgtctc tttaaattta aatcttcatt 276; tagatggtaa ttacatatcc ttatatttac tttaaaaaat cggcttattt gtttatttta 282; taaaaaattt agcaaagaaa tattaatata gtgctgcata gtttggccaa gcatactcat 288; tttg ttcagctcca catttcctgt gaaactaaca tgag aact 2941 ggtggtagtt tcccaggaag gcacaggtgg agtt Protein seguence: NCBI Reference Seguence: NP_005702.3 LOCUS N P_005702 ACCESSION NP_005702 1 mkrsvavwll vglslgquf gkgdicdpnp cenggiclpg ladgsfscec pdgftdpncs 6; sdee eptsagpctp npchnggtce gdtf igyvckcprg fngihcqhni 12; necevepckn ggictdlvan yscecpgefm grncqykcsg plgieggiis nqqitassth 18; ralfglqkwy pyyarlnkkg linawtaaen drwpwiqinl gvi igsp 24; eyiksykiay sndgktwamy kvkgtnedmv frgnidnntp yansftppik aqurlypqv ; crrhctlrme llgcelsgcs eplgmksghi qdyqitassi frtlnmdmft weprkarldk 36; qgkvnawtsg wlqv dllvptkvtg iitqgakdfg thfvgsykl aysndgehwt 42; qudequkd dnd thrknvidpp iyarhirilp wswygritlr sellgcteee HMOX1 Official Symbol: HMOX1 Official Name: heme oxygenase (decycling) 1 Gene ID: 3162 Organism: Homo sapiens Other Aliases: CTA-28GB10.6, HO-1, HSP32, 10 Other Designations: heat shock protein, 32-kD; heme oxygenase 1 Nucleotide seguence: NCBI Reference Seguence: NM_002133.2 LOCUS NM_002133 ACCESSION NM_002133 1 gacc ggccgcggct gtca acgcctgcct cctctcgagc gtcctcagcg 61 cagccgccgc ccgcggagcc agcacgaacg agcccagcac cggccggatg gagcgtccgc 121 acag catgccccag gatttgtcag aggccctgaa ggaggccacc aaggaggtgc 181 acacccaggc agagaatgct gagttcatga ggaactttca gaagggccag gtgacccgag 241 acggcttcaa gctggtgatg gcctccctgt accacatcta cctg gaggaggaga 301 ttgagcgcaa caaggagagc ccagtcttcg cccctgtcta cttcccagaa gagctgcacc 36; gcaaggctgc cctggagcag gacctggcct tctggtacgg gccccgctgg gtca 42; tcccctacac accagccatg cagcgctatg tgaagcggct ccacgaggtg gggcgcacag 48; agcccgagct gctggtggcc cacgcctaca cccgctacct gggtgacctg tctgggggcc 54; aggtgctcaa aaagattgcc cagaaagccc tggacctgcc cagctctggc gagggcctgg 60; ccttcttcac cttccccaac attgccagtg ccaccaagtt caagcagctc tccc 66; gcatgaactc cctggagatg gcag tcaggcagag ggtgatagaa gaggccaaga 721 tcct gctcaacatc cagctctttg aggagttgca ggagctgctg acccatgaca 78L ccaaggacca ctca cgggcaccag ggcttcgcca gcgggccagc aacaaagtgc 84L aagattctgc ccccgtggag actcccagag ggaagccccc actcaacacc cagg 90; ctccgcttct ggtc cttacactca gctttctggt ggcgacagtt gggc 96; tttatgccat gtgaatgcag gcatgctggc tcccagggcc atgaactttg ggaa 102; ggccttcttt ctagagaggg aattctcttg gctggcttcc tggg tttcagggcc tccagccctc tcactgtgtc cctctctctg gaaaggagga aggagcctat 114; ggcatcttcc ccaacgaaaa gcacatccag gcaatggcct aaacttcaga gggggcgaag 120; ggatcagccc tgcccttcag cagt tcctgcagca gagcctggaa gacaccctaa 126; tgtggcagct gtctcaaacc tccaaaagcc ctgagtttca agtatccttg ttgacacggc 132; catgaccact ttccccgtgg gccatggcaa tttttacaca aacctgaaaa gatgttgtgt 138; cttgtgtttt tgtcttattt ttgttggagc cactctgttc ctggctcagc ctcaaatgca 144; gtatttttgt tgtgttctgt tgtttttata gcagggttgg ggtggttttt tgcg 150; tgggtgggga gggaggtgtt taacggcact gtggccttgg tctaactttt gtgtgaaata 1561 ataaacaaca ttgtctgata gtagcttgaa aaaaaaaaaa aaaaaa Protein seguence: NCBI Reference Seguence: NP_002124.1 LOCUS N P_002124 ACCESSION N P_002124 l merpqusmp qdlsealkea tkevhtqaen qug fklv maslyhiyva 61 leeeiernke spvfapvyfp eelhrkaale ygpr wqevipytpa mqryvkrlhe 121 vgrtepellv ahaytrylgd lsggqvlkki aqkaldlpss geglafftfp niasatkfkq 181 lyrsrmnsle mtpavrqrvi eeaktaflln iqlfeelqel 1thdtkdqsp srapgqura 241 snkqusapv etprgkppln trsqapllrw Vltlsflvat vavglyam NUCB1 OfibblSwnbd:NUCB1 CMfidalName:nudeomndmi Gene ID: 4924 Organism: Homo sapiens Other Aliases: , NUC Other Designations: nucleobindin-1 tide seguence: NCBI Reference ce: NM_006184.5 LOCUS NM_0061 84 ACCESSION NM_006184 l gcggaagtta tttttccccc ggccggcagg gagttgtagt tatctttgaa agccttctct 6; ctcttttggc ataggcggga aatgtggcgt caagaggctg ggattccgag aaagaagcga 12; ggctttcacg cgag ngcttcggc cggg ggac gtggatgaaa 18; gctacagagc cagcgtggac caatcagacc tctttggggc ggggcctctg tggataagtg 24; ggcgtggtct aggggggaag tcaccaagaa ggag gtccttgccc gccctggaaa ; acgccctctg cggtgaagga gagaccacac tgccatgcct ccctctgggc cccgaggaac 36; cctccttctg ttgccgctgc tgctgctgct cctgcttcgc gccgtgctgg ctgtccccct 42; ggagcgaggg gcgcccaaca aggaggagac ccctgcgact gagagtcccg acacaggcct 48; ccac cggtacctcc aggaggtcat cgatgtactg gagacggatg ggcatttccg 54; agagaagctg caggctgcca agga catcaagagc gggaagctga gccgagagct 60; ggactttgtc agccaccacg tccgcaccaa tgag ctcaagcgac aggaggtgtc 66L acggctgcgg atgctgctca aggccaagat ggacgccgag caggatccca atgtacaggt 72; ggatcatctg aatctcctga aacagtttga acacctggac cctcagaacc agcatacatt 78L cgaggcccgc gacctggagc tgctgatcca gacggccacc cgggaccttg cccagtacga 84; cgcagcccat gagt tcaagcgcta cgagatgctt aaggaacacg agagacggcg 90; ttatctggag tcactgggag aggagcagag aaaggaggcg gagaggaagc tggaagagca 96; acagcgccgg caccgcgagc accctaaagt caacgtgcct ggcagccaag cccagttgaa L02; ggaggtgtgg gaggagctgg tgga ccccaacagg tttaacccca agaccttctt L08; gcat gatatcaaca gtgatggtgt cctggatgag caggagctgg aggcactctt L14; caccaaggag ctggagaaag tgtacgaccc tgag gaca tgcgggagat L20; ggaggaggag cgactgcgca tgcgggagca tgtgatgaag gaca ccaaccagga L26; ccgcctcgtg accctggagg agttcctcgc atccactcag aggaaggagt ttggggacac L32; gggc tgggagacag tggagatgca ccctgcctac accgaggaag agag gagctggctg cccgggaggc agagctgaat gccaaggccc agcgcctcag L44; ccaggagaca gaggctctag ggcggtccca gggccgcctg gaggcccaga agagagagct L50; gcagcaggct gtgctgcaca tggagcagcg gaagcagcag cagc agcaaggcca 1561 caaggccccg gctgcccacc ctgaggggca gttc cacccagaca cagacgatgt L62; acctgtccca gctccagccg gtgaccagaa ggaggtggac acttcagaaa agaaacttct L68; cgagcggctc cctgaggttg aggtgcccca gcatctgtga tcctccggga ccct L74; caggattcct gatgctccaa ggcgactgat gggcgctgga tgaagtggca cagtcagctt L80; gggc tggtgtcatg ttgggctcct ggggcggggg cacggcctgg catttcacgc L86; attgctgcca ccccaggtcc acctgtctcc actttcacag cctccaagtc tgtggctctt L92; cccttctgtc ctccgagggg cttgccttct ctcgtgtcca gtgaggtgct cagtgatcgg L98; cttaacttag agaagcccgc cccctcccct tctccgtctg tcccaagagg gtctgctctg 204; cgtt cctaggtggc tcggcctcag ctgcctgggt tgtggccgcc ctagcatcct 210; gtatgcccac agctactgga atccccgctg ctgctccggg ccaagcttct ggttgattaa 216; tgagggcatg gggtggtccc tcaagacctt cctt aacc agtgatgcct 222; caaagacagt gtcccctcca cagctgggtg ccaggggcag gggatcctca gtatagccgg 228; tgaaccctga taccaggagc ctgggcctcc ctgaacccct ggcttccagc atcg 234; ccagcctcct cctggacctc ttggccccca gccccttccc cacacagccc cagaagggtc 240; ccagagctga ccccactcca ggacctaggc ccagcccctc agcctcatct ggagcccctg 246; aagaccagtc ccacccacct ttctggcctc atctgacact gctccgcatc ctgctgtgtg 252; tcctgttcca tgttccggtt ccatccaaat acactttctg gaacaaatgc atggctccaa 2581 aaaaa Protein seguence: NCBI Reference Seguence: NP_006175.2 LOCUS N P_0061 75 ACCESSION N P_006175 1 mppsgprgtl lllpllllll lravlavple rgapnkeetp atespdtgly yhrquevid 61 vletdghfre quaanaedi ksgklsreld fvshhvrtkl delquevsr 1rm11kakmd 121 aeqdpnqud h1nllkqfeh ldpqnqhtfe ardlelliqt atrdlaqyda ahheefkrye 181 mlkeherrry Leslgeequ eaerkleeqq rrhrehpkvn vpgsqaqlke vweeldgldp 241 tffi Lhdinsdgvl deqelealft kelekvydpk needdmreme eerlrmrehv 301 nqdr 1vtleeflas tquefgdtg egwetvemhp ayteeelrrf reae 361 1nakaqusq etealgrsqg rleaqkrelq qavlhmequ qghk apaahpegql 421 kfhpdtddvp vpapagque vdtsekklle rlpevequh 1 C3010 Official Symbol: C190rf10 al Name: chromosome 19 open reading frame 10 Gene ID: 56005 Organism: Homo sapiens Other Aliases: EUROIMAGE1875335, lL25, lL27, lL27w, R33729_1, SF20 Other Designations: UPF0556 protein C190rf10; interleukin 25; interleukin 27 g designation; interleukin-25; stromal cell-derived growth factor SF20 Nucleotide seguence: NCBI Reference Seguence: NM_019107.3 LOCUS NM_019107 ACCESSION NM_019107 1 ggcggacgct ccacgtgtcc ctcgccgcgc cccgtctacc cgcccctgcc ctgaggaccc 61 tagtccaaca tggcggcgcc aggg tggaacggcg tcggcgcgag cttgtgggcc 121 gcgctgctcc taggggccgt gagg ccggcggagg cggtgtccga gcccacgacg 181 gtggcgtttg ggcc cggcggcgtc tcct tctcccataa cgtgggcccg 241 ggggacaaat atacgtgtat gttcacttac gcctctcaag gagggaccaa tgagcaatgg 301 cagatgagtc tggggaccag cgaagaccac cagcacttca cctgcaccat ctggaggccc 361 caggggaagt cctatctgta cttcacacag ttcaaggcag aggtgcgggg cgctgagatt 421 gcca actc taaagccgca tttgaaaggg aaagtgatgt ccctctgaaa 481 gaat ttgaagtgac caaaacagca gtggctcaca ggcccggggc attcaaagct 541 gagctgtcca agctggtgat tgtggccaag gcatcgcgca ctgagctgtg accagcagcc 60; ctgttgcggg cttc tcatctccgg tgaagctgaa ggggcctgtg tccctgaaag 661 ggccagcaca tcactggttt tctaggaggg actcttaagt tttctacctg cgtt 721 gccttgtccg gaggggcttg cagggtggct gaagccctgg ggcagagaac agagggtcca 78- gggccctcct ggctcccaac agcttctcag ttcccacttc ctgctgagct cttctggact 84L caggatcgca qatccnggc acaaagaggg tggggaacat gggggctatg ctggggaaag 90; cagccatgct ccccccgacc tccagccgag catccttcat gagcctgcag aactgctttc 96; ctatgtttac ccaggggacc tcctttcaga ggga agagatgaaa tgttttttca 1021 tatttaaata aataagaaca ttaaaaagca aaaaaaaaaa aaaaaaa n seguence: NCBI Reference Seguence: NP_061980.1 LOCUS NP_061980 ACCESSION NP_061980 1 maapsggwng vgaslwaall lgavalrpae avsepttvaf dvrpggvvhs fshnvgpgdk 61 ytcmftyasq ggtneqwqms lgtsedhth tctiwrpqgk sylyftqfka ieya 121 mayskaafer esdvplktee fevtktavah rpgafkaels klvivakasr tel PL|N2 OfibblSwnbd:PUN2 Official Name: perilipin 2 gfiflgifl2:123 Organism: Homo sapiens Other Aliases: RP11-151J10.1, ADFP, ADRP Other Designations: adipophilin; adipose differentiation-related protein; perilipin- Nucleotide ce: NCBI Reference Seguence: NM_001122.3 LOCUS NM_001122 ION NM_001122 l ccgagggtga cactcgggct tgggacaggg gccg cgggtcacgt gctgcggagg 6; cttggggagg ggcggcgagg cggggtttat agcccgggcg cccgcgggcc ccacgctttg 12; accgggtcgt ggcagccgga gtcgtcttcg ggacgcgcct gctcttcgcc tgca 18; gtccgtcgat ttctttctcc aggaagaaaa atggcatccg ttgcagttga tccacaaccg 24; agtgtggtga ctcgggtggt caacctgccc ttggtgagct ccacgtatga gtcc 30L tcagcctatc tcagtacaaa ggaccagtat ccctacctga agtctgtgtg ggca 36L gagaacggtg tgaagaccat cacctccgtg gccatgacca gtgctctgcc catcatccag 42; aagctagagc cgcaaattgc agttgccaat acctatgcct gtaaggggct agacaggatt 48L gaggagagac tgcctattct gaatcagcca tcaactcaga ttgttgccaa aggc 54; gctgtgactg gggcaaaaga tgctgtgacg actactgtga ctggggccaa ggattctgtg 60; gccagcacga tcacaggggt gatggacaag accaaagggg cagtgactgg cagtgtggag 66; aagaccaagt ctgtggtcag tggcagcatt aacacagtct tggggagtcg gatgatgcag 72; ctcgtgagca gtggcgtaga aaatgcactc accaaatcag agctgttggt agaacagtac 78; ctccctctca ctgaggaaga actagaaaaa gaagcaaaaa aagg atttgatctg 84; gttcagaagc atta tgttagactg ggatccctgt ctaccaagct tcactcccgt 90; cagc aggctctcag cagggttaaa gaagctaagc aaaaaagcca acagaccatt 96; tctcagctcc attctactgt tcacctgatt gaatttgcca ggaagaatgt gtatagtgcc L02; aatcagaaaa ttcaggatgc tcaggataag ctctacctct catgggtaga gtggaaaagg L08; agcattggat atgatgatac tgatgagtcc cactgtgctg agcacattga gtcacgtact L14; cttgcaattg cccgcaacct gactcagcag ctccagacca cgtgccacac cctcctgtcc L20; aacatccaag gtgtaccaca gaacatccaa gatcaagcca agcacatggg ggtgatggca L26; ggcgacatct actcagtgtt ccgcaatgct gcctccttta aagaagtgtc tgacagcctc L32; ctcacttcta gcaaggggca gctgcagaaa atgaaggaat ctttagatga cgtgatggat L38; tatcttgtta acaacacgcc ctgg ctggtaggtc ccttttatcc tcagctgact L441 gagtctcaga atgctcagga ccaaggtgca gaca agagcagcca ggagacccag 150; cgatctgagc ctca ttaaacctgc ccctatcact agtgcatgct agatgacacc ttttgttatg ttaa cttgctaggc aaat agta taaaggccct caattgtagt tgtttccagc aaga ttctggcatt agcagatgat ttctgttcac ctggtaagaa aagaatgata ggcttgtcag 1741 agcctatagc cagaactcag ttca aatgcactta tgttctcatt ctatggccat 1801 tgtgttgcct ctgttactgt ttgtattgaa taaaaacatc ttcatgtggg ctggggtaga 1861 aactggtgtc tgctctggtg tgatctgaaa aggcgtcttc actgctttat ctcatgatgc 1921 ttgcttgtaa aacttgattt tagtttttca tttctcaaat aggaatacta cctttgaatt 1981 caataaaatt cactgcagga tagaccagtt aaaaaaaaaa aaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_001113.2 LOCUS NP_001113 ACCESSION NP_001113.2 1 masvavdpqp svvtrvvnlp lvsstydlms saylstkdqy pylksvcema engvktitsv 61 amtsalpiiq klepqiavan tyackgldri eerlpilnqp nakg avtgakdavt 121 ttvtgakdsv astitgvmdk tkgavtgsve ktksvvsgsi ntvlgsrmmq 1vssgvena1 181 veqy lplteeelek eakkvegfdl yvrl gslstklhsr ayqqalsrvk 241 eaqusqqti sqlhstvhli efarknvysa nqkiqdaqdk 1ylswvewkr sigyddtdes 301 hcaehiesrt laiarnltqq lqttchtlls ningPqniq dqakhmgvma gdiysvfrna 361 asfkevsdsl 1tsskgqlqk dvmd ylvnntplnw 1vgpfypq1t esqnaqdqga 421 emdkssqetq rsehkth ATP5A Official Symbol: ATP5A1 Official Name: ATP synthase, H+ transporting, ondrial F1 complex, alpha subunit 1, cardiac muscle Gene ID: 498 Organism: Homo sapiens Other Aliases: ATP5A, ATP5AL2, ATPM, MOM2, OMR, ORM, hATP1 Other Designations: ATP synthase alpha chain, mitochondrial; ATP synthase subunit alpha, mitochondrial; ATP se, H+ transporting, mitochondrial F1 x, alpha subunit, isoform 1, cardiac muscle; ATP synthase, H+ transporting, mitochondrial F1 complex, alpha subunit, isoform 2, rdiac muscle-like 2; ATP sythase Pase) alpha t; mitochondrial ATP synthetase, oligomycin-resistant Nucleotide seguence (variant 1): NCBI Reference ce: NM_001001937.1 LOCUS NM_001001937 ACCESSION NM_001001937 l tctggcattg caagcctcgc ttcgttgcca cttcccagct gcct tccgcggtat 6; aatcaacact acgagagata gagccgccta gaaccagtcc ggaggctgcg gctgcagaag 121 taccgcctgc ggagtaactg caaagatgct gtccgtgcgc gttgctgcgg ccgtggtccg l8; cgcccttcct cggcgggccg gactggtctc cagaaatgct ttgggttcat ctttcattgc 24L tgcaaggaac ttccatgcct ctaacactca tcttcaaaag actg ctgagatgtc ; ctctattctt gaagagcgta ttcttggagc tgatacctct gttgatcttg ctgg 36; gcgtgtctta agtattggtg atggtattgc acat gggctgagga atgttcaagc 42; agaagaaatg gtagagtttt cttcaggctt aaagggtatg tccttgaact tggaacctga 48; caatgttggt gttgtcgtgt ttggaaatga taaactaatt aaggaaggag atatagtgaa 54; gaggacagga gccattgtgg acgttccagt tggtgaggag ctgttgggtc gtgtagttga 60; tgcccttggt aatgctattg atggaaaggg tccaattggt tccaagacgc gtaggcgagt 66; tggtctgaaa gcccccggta tcattcctcg aatttcagtg cgggaaccaa tgcagactgg 72; cattaaggct gtggatagct tggtgccaat tggtcgtggt cagcgtgaac tgattattgg 78; tgaccgacag actgggaaaa cctcaattgc tattgacaca aacc agaaacgttt 84; caatgatgga tctgatgaaa agaagaagct gtactgtatt tatgttgcta aaaa 90; cact gttgcccagt tggtgaagag acttacagat gcagatgcca tgaagtacac 96; cattgtggtg tcggctacgg cctcggatgc tgccccactt ctgg ctccttactc L02; ttcc atgggagagt attttagaga caatggcaaa catgctttga tcatctatga L08; cgacttatcc aaacaggctg ttgcttaccg tcagatgtct ctgttgctcc gccgaccccc L14; tggtcgtgag gcctatcctg gtgatgtgtt acac ttgc tggagagagc L20; agccaaaatg aacgatgctt ttggtggtgg ctccttgact gctttgccag gatgtgtctg cttacattcc aacaaatgtc atttccatca gaat tgttctacaa aggtatccgc cctgcaatta cgtgtcggat ccgctgccca aaccagggct atgaagcagg catgaagctg gaattggctc agtatcgtga ggttgctgct tttgcccagt tcggttctga L50; cctcgatgct gccactcaac aacttttgag tcgtggcgtg cgtctaactg agttgctgaa 156; gcaaggacag tattctccca tggctattga agaacaagtg gctgttatct atgcgggtgt 162; aaggggatat cttgataaac tggagcccag caagattaca aagtttgaga tctt 168; gtctcatgtc gtcagccagc accaagcctt gttgggcact atcagggctg atggaaagat 174; ctcagaacaa tcagatgcaa agctgaaaga gattgtaaca aatttcttgg ctggatttga 180; agcttaaact cctgtggatt cacatcaaat accagttcag ttttgtcatt gttctagtaa 186; attagttcca tttgtaaaag tctc atactcctta tgtacagaaa tcacatgaaa L921 aataaaggtt ccataatgca tagttaaaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_001001937.1 LOCUS N P_001001937 ACCESSION N P_001001937 1 mlsvrvaaav ragl vsrnalgssf iaarnfhasn thlqktgtae mssileeril 6; gadtsvdlee tgrvlsigdg iarvhglrnv qaeemvefss glkgmslnle pdnvgvvvfg 121 ndklikegdi Vkrtgaivdv pvgeellgrv vdalgnaidg kgpigsktrr rvglkapgii 181 prisvrepmq tgikavdslv pigrgqreli igdrqtgkts iaidtiinqk rfndgsdekk 241 vaig qkrstvaqlv krltdadamk ytivvsatas daaplqylap geyf 301 rdngkhalii yddlskqava yrqmslllrr ppgreaypgd vfylhsrlle raakmndazg 36; ggsltalpvi etqagdvsay iptnvisitd gqifletelf ykgirpainv glsvsrvgsa 42; kqva gtmklelaqy revaafaqu sdldaatqql ltel 48; viya gvrgyldkle pskitkfena flshvvsqhq allgtiradg kiseqsdakl 54; keivtnflag fea Nucleotide seguence (variant 2): NCBI Reference Seguence: NM_004046.5 LOCUS NM_004046 ACCESSION NM_004046 l ggggcagtac gtca ggtgggccgg ctgtcttgac cttctttgcg gctcggccat 61 tttgtcccag cgga ggctgcggct gcagaagtac cgcctgcgga gtaactgcaa 121 agatgctgtc Cgtgcgcgtt gccg tggtccgcgc ccttcctcgg cgggccggac 181 tggtctccag aaatgctttg tctt tcattgctgc aaggaacttc catgcctcta 24; acactcatct tcaaaagact gggactgctg agatgtcctc tattcttgaa gagcgtattc ; ttggagctga tacctctgtt gatcttgaag aaactgggcg tgtcttaagt attggtgatg 36; gtattgcccg cgtacatggg ctgaggaatg ttcaagcaga agaaatggta gagttttctt 42; caggcttaaa gggtatgtcc ttgaacttgg aacctgacaa tgttggtgtt gtcgtgtttg 48; gaaatgataa actaattaag gaaggagata agag gacaggagcc attgtggacg 54; ttccagttgg tgaggagctg ttgggtcgtg tagttgatgc ccttggtaat gctattgatg 60L gaaagggtcc aattggttcc aagacgcgta ttgg tctgaaagcc cccggtatca 66L ttcctcgaat ttcagtgcgg gaaccaatgc agactggcat tgtg gatagcttgg 72L tgccaattgg tcgtggtcag cgtgaactga ttattggtga gact acct 78L ctat tgacacaatc attaaccaga tcaa tgatggatct gatgaaaaga 84; agaagctgta ctgtatttat gttgctattg agag atccactgtt gcccagttgg 90; tgaagagact tacagatgca gatgccatga agtacaccat tgtggtgtcg gctacggcct 96; cggatgctgc cccacttcag tacctggctc cttactctgg ctgttccatg ggagagtatt L02; ttagagacaa tggcaaacat gctttgatca tctatgacga cttatccaaa caggctgttg L08; cttaccgtca gatgtctctg ttgctccgcc gaccccctgg tcgtgaggcc ggtg L14; atgtgttcta cctacactcc ngttgctgg agagagcagc caaaatgaac gatgcttttg L20; gtggtggctc cttgactgct ttgccagtca tagaaacaca ggctggtgat gtgtctgctt L26; acattccaac aaatgtcatt tccatcactg acggacagat cttcttggaa acagaattgt L32; tctacaaagg tatccgccct gcaattaacg ttggtctgtc tgtatctcgt gtcggatccg L38; ctgcccaaac cagggctatg aagcaggtag caggtaccat gaagctggaa ttggctcagt L44; atcgtgaggt tgctgctttt ttcg gttctgacct cgatgctgcc actcaacaac L50; ttttgagtcg tggcgtgcgt ctaactgagt tgctgaagca aggacagtat tctcccatgg L56; ctattgaaga acaagtggct gttatctatg cgggtgtaag tctt ctgg L62; agcccagcaa gattacaaag tttgagaatg ctttcttgtc tcatgtcgtc agccagcacc L68; aagccttgtt gggcactatc agggctgatg tctc atca gatgcaaagc L741 tgaaagagat tgtaacaaat ttcttggctg gatttgaagc tcct gtggattcac L80; tacc agttcagttt tgtcattgtt ctagtaaatt agttccattt tactctcata ctccttatgt acagaaatca catgaaaaat aaaggttcca taatgcatag 1921 ttaaaaa Protein seguence (variant 2): NCBI Reference Seguence: NP_004037.1 LOCUS N P_004037 ACCESSION NP_004037 l mlsvrvaaav vralprragl vsrnalgssf iaarnfhasn thlqktgtae mssileeril 6; gadtsvdlee tgrvlsigdg iarvhglrnv qaeemvefss glkgmslnle pdnvgvvvfg 12; ndklikegdi vkrtgaivdv pvgeellgrv vdalgnaidg kgpigsktrr pgii l8; epmq dslv pigrgqreli igdrqtgkts iaidtiinqk rfndgsdekk 24; klyciyvaig qkrstvaqlv krltdadamk ytivvsatas ylap ysgcsmgeyf ; rdngkhalii yddlskqava yrqmslllrr ypgd vfylhsrlle raakmndafg 36; ggsltalpvi etqagdvsay iptnvisitd telf ykgirpainv glsvsrvgsa 42; aqtramkqva gtmklelaqy revaafaqu sdldaatqql lsrgvrltel lkngyspma 48; ieeqvaviya dkle pskitkfena flshvvsqhq allgtiradg kiseqsdakl 54; keivtnflag fea Nucleotide seguence (variant 3): NCBI Reference Seguence: NM_001257334.1 LOCUS NM_001257334 ACCESSION NM_001257334 l ggggcagtac gtca ggtgggccgg ctgtcttgac cttctttgcg gctcggccat 6; tttgtcccag cgga ggctgcggct gcagaagtac cgcctgcgga gtaactgcaa 12; agatgctgtc cgtgcgcgtt gccg tggtccgcgc ccttcctcgg cgggccggac 18; tggtctccag aaatgctttg ggttcatctt tcattgctgc aaggaacttc catgcctcta 24; acactcatct tcaaaagact gggactgctg agatgtcctc tgaa gagcgtattc ; ttggagctga tacctctgtt gatcttgaag aaactgggcg tgtcttaagt attggtgatg 36; gtattgcccg cgtacatggg ctgaggaatg ttcaagcaga agaaatggta gagttttctt 42; caggcttaaa gggtatgtcc ttgaacttgg acaa tgttggtgtt gtcgtgtttg 48; gaaatgataa actaattaag gaaggagata tagtgaagag gacaggagcc attgtggacg 54; gtccaattgg ttccaagacg cgtaggcgag tgaa agcccccggt atcattcctc 60; gaatttcagt gcgggaacca atgcagactg gcattaaggc tgtggatagc ttggtgccaa 66; gtgg tcagcgtgaa ctgattattg gtgaccgaca gactgggaaa acctcaattg 72; ctattgacac taac cagaaacgtt tcaatgatgg atctgatgaa aagaagaagc 78; tgtactgtat ttatgttgct attggtcaaa agagatccac tgttgcccag ttggtgaaga 84; gacttacaga tgcagatgcc atgaagtaca ccattgtggt gtcggctacg gcctcggatg 90; ctgccccact tcagtacctg gctccttact ctggctgttc catgggagag tattttagag 96; gcaa acatgctttg atcatctatg acgacttatc caaacaggct gttgcttacc 102; gtcagatgtc tctgttgctc cgccgacccc ctggtcgtga ggcctatcct ggtgatgtgt 1081 tctacctaca ctcccggttg ctggagagag cagccaaaat gaacgatgct tgctttgcca gtcatagaaa cacaggctgg tgatgtgtct caacaaatgt catttccatc actgacggac agatcttctt ggaaacagaa aaggtatccg ccctgcaatt aacgttggtc tgtctgtatc tcgtgtcgga tccgctgccc 132; aaaccagggc tatgaagcag gtagcaggta ccatgaagct ggct cagtatcgtg 138; aggttgctgc ttttgcccag ttcggttctg acctcgatgc tgccactcaa caacttttga 144; gtcgtggcgt gcgtctaact gagttgctga agcaaggaca tccc atggctattg 150; aagaacaagt ggctgttatc tatgcgggtg taaggggata tcttgataaa ctggagccca 156; gcaagattac tgag aatgctttct tgtctcatgt cgtcagccag gcct 162; tgttgggcac tatcagggct gatggaaaga tctcagaaca atcagatgca aagctgaaag 168; agattgtaac cttg gctggatttg aaac tcctgtggat tcacatcaaa 174; taccagttca tcat tgttctagta aattagttcc atttgtaaaa gggttactct 180; catactcctt atgtacagaa atcacatgaa aaataaaggt tccataatgc atagttaaaa 186; a Protein seguence (variant 3): NCBI Reference Seguence: NP_001244263.1 LOCUS N 44263 ACCESSION N P_001244263 1 mlsvrvaaav vralprragl vsrnalgssf iaarnfhasn thlqktgtae eril 61 gadtsvdlee tgrvlsigdg iarvhglrnv qaeemvefss glkgmslnle pdnvgvvvfg 121 ndklikegdi ivdg pigsktrrrv glkapgiipr isvrepmqtg ikavdslvpi 181 grgqreliig drqtgktsia idtiinqkrf ndgsdekkkl yciyvaiqu rstvaqlvkr 241 ltdadamkyt ivvsatasda apys gcsmgeyfrd ngkhaliiyd dlskqavayr 301 qmslllrrpp greaypgdvf ylhsrllera akmndafggg sltalpviet qagdvsayip 361 tnvisitdgq ifletelfyk girpainvgl svsrvgsaaq tramkqvagt mklelaqyre 421 vaafaqusd 1daatqqlls rgvrltellk ngyspmaie eqvaviyagv rgyldkleps 481 kitkfenafl shvvsqhqal lgtiradgki seqsdaklke ivtnflagfe Nudeofideseguence(vafiant4y NCBI Reference Seguence: NM_001001935.2 LOCUS NM_001001935 ION NM_001001935 l ggggcagtac ttccgggtca ggtgggccgg ctgtcttgac cttctttgcg gctcggccat 6; tttgtcccag tcagtccgga ggctgcggct gcagaagtac cgcctgcgga gtaactgcaa 12; agatgctgtc Cgtgcgcgtt gctgcggccg gcgc ccttcctcgg cgggccggac 18; ccag aaatgctttg tctt tcattgctgc aaggaacttc catgcctcta 24; acactcatct tcaaaagact ggtaagttat tatttctcag tctacgccgc acttactaga ; tgaagatata atac atcgtataac tgtgggactg ctgagatgtc ctctattctt 36; gaagagcgta ttcttggagc tgatacctct gttgatcttg aagaaactgg gcgtgtctta 42; ggtg atggtattgc ccgcgtacat gggctgagga atgttcaagc agaagaaatg 48; gtagagtttt cttcaggctt tatg tccttgaact tggaacctga caatgttggt 54; gttgtcgtgt ttggaaatga taaactaatt aaggaaggag atatagtgaa gaggacagga 60; gccattgtgg acgttccagt tggtgaggag ggtc gtgtagttga tgcccttggt 66; aatgctattg atggaaaggg tccaattggt tccaagacgc gtaggcgagt tggtctgaaa 72; gcccccggta tcattcctcg aatttcagtg cgggaaccaa tgcagactgg cattaaggct 78; gtggatagct tggtgccaat tggt gaac tgattattgg tgaccgacag 84; actgggaaaa cctcaattgc tattgacaca atcattaacc agaaacgttt caatgatgga 90; tctgatgaaa agaagaagct tatt tatgttgcta ttggtcaaaa gagatccact 96L gttgcccagt tggtgaagag acttacagat gcagatgcca tgaagtacac cattgtggtg 102; tcggctacgg cctcggatgc actt cagtacctgg ctccttactc tggctgttcc 108; atgggagagt gaga caaa catgctttga tcatctatga cgacttatcc 114; aaacaggctg ttgcttaccg tcagatgtct ctgttgctcc gccgaccccc tggtcgtgag 120; gcctatcctg gtgatgtgtt acac tcccggttgc tggagagagc agccaaaatg L26; aacgatgctt ttggtggtgg ctccttgact gctttgccag tcatagaaac tggt L32; gatgtgtctg cttacattcc aacaaatgtc atttccatca ctgacggaca gatcttcttg L38; gaaacagaat tgttctacaa aggtatccgc cctgcaatta acgttggtct gtctgtatct L44; cgtgtcggat ccgctgccca aaccagggct atgaagcagg tagcaggtac gctg L50; gctc gtga ggttgctgct tttgcccagt tcggttctga cctcgatgct L56; gccactcaac aacttttgag tcgtggcgtg cgtctaactg agttgctgaa gcaaggacag L62: tattctccca tggctattga agaacaagtg gctgttatct atgcgggtgt aaggggatat L68; cttgataaac tggagcccag caagattaca aagtttgaga atgctttctt gtctcatgtc -74; gtcagccagc accaagcctt gttgggcact gctg atggaaagat agctgaaaga gattgtaaca aatttcttgg ctggatttga gatt cacatcaaat accagttcag ttttgtcatt gtaa attagttcca L92; tttgtaaaag ggttactctc atactcctta tgtacagaaa tcacatgaaa aataaaggtt 1981 ccataatgca tagttaaaaa Protein seguence (variant 4): NCBI Reference Seguence: NP_001001935.1 LOCUS N P_001001935 ACCESSION NP_001001935.1 l mssileeril gadtsvdlee tgrvlsigdg iarvhglrnv efss glkgmslnLe 6; pdnvgvvvfg ndklikegdi vkrtgaivdv pvgeellgrv vdalgnaidg kgpigsktrr 12; rvglkapgii prisvrepmq tgikavdslv pigrgqreli igdrqtgkts iaidtiinqk 18; rfndgsdekk klyciyvaig qkrstvaqlv krltdadamk ytivvsatas daaplqylap 24; ysgcsmgeyf alii yddlskqava yrqmslllrr ppgreaypgd vfylhsrlLe ; raakmndafg ggsltalpvi etqagdvsay iptnvisitd gqifletelf ainv 36L glsvsrvgsa aqtramkqva gtmklelaqy revaafaqu sdldaatqql lsrgvrltel 42L pma ieeqvaviya gvrgyldkle pskitkfena flshvvsqhq allgtiradg 48L kiseqsdakl keivtnflag fea tide seguence (variant 5): NCBI Reference Seguence: NM_001257335.1 LOCUS NM_001257335 ACCESSION NM_001257335 l ggggcagtac ttccgggtca ggtgggccgg ctgtcttgac cttctttgcg gctcggccat 6; tttgtcccag tcagtccgga ggctgcggct gcagaagtac cgcctgcgga gtaactgcaa 12; agatgctgtc cgtgcgcgtt gccg tggtccgcgc ccttcctcgg cgggccggac 18; tggtgagcac cgaaggccgg catgatgcag gcggccgggt ggggctgcag ggtggtggtg 24; cgccggctcg ggcgctctct gcaggagggc gaggggctgt ggcgaatgcc gccatcttgc 30L acccgtgqct tctccggctg gacagagcag gcgacacagg tgcccttttg ctcgtcacct 36L gcgcagaggc agaatggtac agggcagaca gttaactcga tggtgtccag agacagggcc 42L tcaagattcc tgtcttcggc tgacagcggc cctagaaggg ggatcttggg tgaaggtcag 48L ggcttgggcg ctagctctcc tgtt ctgaatcggt ctccagaaat gctttgggtt 54; catctttcat tgctgcaagg aacttccatg cctctaacac tcatcttcaa aagactggga 60; ctgctgagat gtcctctatt cttgaagagc gtattcttgg agctgatacc tctgttgatc 66; ttgaagaaac tgggcgtgtc ttaagtattg gtgatggtat tgcccgcgta catgggctga 72; ggaatgttca agcagaagaa atggtagagt tttcttcagg cttaaagggt atgtccttga 78; acttggaacc tgacaatgtt ggtgttgtcg tgtttggaaa tgataaacta attaaggaag 84; gagatatagt gaagaggaca ggagccattg tggacgttcc agttggtgag gagctgttgg 90; gtcgtgtagt tgatgccctt ggtaatgcta ttgatggaaa gggtccaatt ggttccaaga 96; cgcgtaggcg agttggtctg aaagcccccg gtatcattcc tcgaatttca gtgcgggaac L02; caatgcagac tggcattaag gctgtggata gcttggtgcc aattggtcgt ggtcagcgtg L08; aactgattat tggtgaccga cagactggga aaacctcaat tgctattgac atta L14; accagaaacg tttcaatgat ggatctgatg agaa gctgtactgt atttatgttg L20; gtca aaagagatcc actgttgccc agttggtgaa taca gatgcagatg L26; agta tgtg gcta cggcctcgga tgctgcccca cttcagtacc L32; tggctcctta ctctggctgt ggag ttag agacaatggc gctt L38; tgatcatcta tgacgactta tccaaacagg ctgttgctta ccgtcagatg tcgt gaggcctatc ctggtgatgt gttctaccta L50; tgctggagag agcagccaaa atgaacgatg cttttggtgg tggctccttg actgctttgc 156; cagtcataga aacacaggct ggtgatgtgt ctgcttacat tccaacaaat tcca L62; tcactgacgg acagatcttc ttggaaacag aattgttcta caaaggtatc gcaa L68; ttaacgttgg tctgtctgta tctcgtgtcg gatccgctgc ccaaaccagg gctatgaagc 174; aggtagcagg taccatgaag ctggaattgg ctcagtatcg tgaggttgct gcttttgccc 180; agttcggttc tgacctcgat gctgccactc aacaactttt gagtcgtggc gtgcgtctaa 186; ctgagttgct gaagcaagga cagtattctc ccatggctat tgaagaacaa gtta 192; cggg tgtaagggga tatcttgata aactggagcc cagcaagatt tttg 198; cttt cttgtctcat agcc agcaccaagc cttgttgggc actatcaggg 204; ctgatggaaa gatctcagaa caatcagatg caaagctgaa agagattgta acaaatttct 210- tggctggatt tgaagcttaa gtgg attcacatca aataccagtt cagttttgtc 216; attgttctag taaattagtt ccatttgtaa aagggttact ctcatactcc ttatgtacag 222; aaatcacatg aaaaataaag gttccataat gcatagttaa aaa Protein seguence (variant 5): NCBI Reference Seguence: NP_001244264.1 LOCUS N P_001 244264 ACCESSION N P_001244264 l eril dlee tgrvlsigdg iarvhglrnv qaeemvefss glkgmslnLe 6; pdnvgvvvfg ndklikegdi vkrtgaivdv pvgeellgrv vdalgnaidg kgpigsktrr 12; rvglkapgii prisvrepmq tgikavdslv pigrgqreli igdrqtgkts iaidtiinqk 18; rfndgsdekk klyciyvaig qkrstvaqlv krltdadamk ytivvsatas daaplqylap 24; ysgcsmgeyf rdngkhalii yddlskqava yrqmslllrr ppgreaypgd vfylhsrlLe ; dafg ggsltalpvi etqagdvsay iptnvisitd gqifletelf ykgirpainv 36; vgsa aqtramkqva gtmklelaqy revaafaqu sdldaatqql lsrgvrltel 42; lkngyspma ieeqvaviya gvrgyldkle pskitkfena flshvvsqhq radg 48; kiseqsdakl keivtnflag fea HSPA9 (See entry for GRP75 above) MARS OfibblSwnbszARS Official Name: methionyl-tRNA synthetase Gene ID: 4141 Organism: Homo sapiens Other Aliases: METRS, MRS, MTRNS Other Designations: cytosolic methionyl-tRNA synthetase; methionine tRNA ligase 1, cytoplasmic; methionine--tRNA ligase, asmic tide ce: NCBI Reference Seguence: NM_004990.3 LOCUS NM_004990 ACCESSION NM_004990 l aaatagtcta ctttccggta gcggtgccag ggcagtggcc taatacggaa ctccatttcc 6; cggcgtgcct cgcggaggcc gctgaactca gaagcgggag gccggttccg gttgcatcag 12; cgagggattc acggcgaaat gagactgttc gtgagtgatg gcgtcccggg ttgcttgccg l8; gtgctggccg ccgccgggag agcccggggc agagcagagg tgctcatcag cactgtaggc 24; ccggaagatt tccc gttcctgacc cggcctaagg tccctgtctt gcagctggat ; aact acctcttctc cactagtgca atctgccgat tttt gttatctggc 36; tgggagcaag atgacctcac gtgg ctggaatggg aagcgacaga gctgcagcca 42; gctttgtctg ctgccctgta ctatttagtg gtccaaggca agaaggggga agatgttctt 48; ggttcagtgc ggagagccct gactcacatt gaccacagct tgagtcgtca gaactgtcct 54; ttcctggctg gggagacaga atctctagcc gacattgttt tgtggggagc cctataccca 60; ttactgcaag atcccgccta cctccctgag gagctgagtg ccctgcacag ctggttccag 66; acactgagta cccaggaacc atgtcagcga gctgcagaga ctgtactgaa acagcaaggt 72; gtcctggctc tccggcctta cctccaaaag cagccccagc ccagccccgc tgagggaagg 78; gctgtcacca ctga ggaggaggag ctggctaccc tatctgagga ggagattgct 84; gtta ctgcttggga gaagggccta gaaagtttgc ccccgctgcg gccccagcag 90L gtgt tgcctgtggc tggagaaagg aatgtgctca tcaccagtgc ttac 96L gtcaacaatg tcccccacct tgggaacatc attggttgtg tgctcagtgc cgatgtcttt 102; gccaggtact ctcgcctccg ccagtggaac accctctatc tgtgtgggac agatgagtat 1081 ggtacagcaa cagagaccaa ggctctggag ctaa agga gatctgcgac 114; aagtaccaca tcatccatgc tgacatctac cgctggttta acatttcgtt tgatattttt 120; ggtcgcacca caca gcagaccaaa atcacccagg acattttcca gcagttgctg L26; aaacgaggtt ttgtgctgca agatactgtg gagcaactgc gatgtgagca ctgtgctcgc L32; ttcctggctg tcgt ggagggcgtg ttct gtggctatga ggaggctcgg L38; cagt gtgacaagtg tggcaagctc atcaatgctg tcgagcttaa gaagcctcag L44; tgtaaagtct gccgatcatg ccctgtggtg cagtcgagcc agcacctgtt tctggacctg L50; cctaagctgg agaagcgact ggaggagtgg ttggggagga cattgcctgg cagtgactgg L56; acacccaatg cccagtttat cacccgttct tggcttcggg atggcctcaa gccacgctgc L62; ataacccgag acctcaaatg gggaacccct gtacccttag ttga ggta L68; ttctatgtct ggtttgatgc cactattggc tatctgtcca tcacagccaa gatggtggaa gaacccagag caagtggacc tgtatcagtt ctttccatag cttagtcttt ccttgctcag ccctaggagc tataccttgg tcagccacct cattgctaca gagtacctga actatgagga tctaagagcc gcggtgtggg agtgtttggg gacatggccc aggacacggg L98; gacatctggc gcttctatct gctgtacatt ngcctgagg gccaggacag tgctttctcc 204; tggacggacc tgctgctgaa gaataattct gagctgctta tggg caacttcatc 210; aacagagctg ggatgtttgt gttc tttgggggct atgtgcctga gatggtgctc 216; acccctgatg atcagcgcct gctggcccat gtcaccctgg agca ccag 222; ctacttgaga aggttcggat ccgggatgcc ttgcgcagta tcctcaccat atctcgacat 228; ggcaaccaat atattcaggt gccc tggaagcgga ttaaaggcag tgaggctgac 234; aggcaacggg caggaacagt gactggcttg gcagtgaata cctt gctctctgtc 240; atgcttcagc cttacatgcc cacggttagt gccacaatcc agct gcagctccca 246; cctccagcct gcagtatcct gctgacaaac tgta ccttaccagc aggacaccag 252; attggcacag tcagtccctt gttccaaaaa aatg accagattga aagtttaagg 258; cagcgctttg gagggggcca aacg tccccgaagc cagcagttgt agagactgtt 264; acaacagcca agccacagca gatacaagcg ctgatggatg aagtgacaaa acaaggaaac 270; attgtccgag aactgaaagc acaaaaggca aacg aggttgctgc ggaggtggcg 2761 aaactcttgg atctaaagaa acagttggct gtagctgagg ggaaaccccc tgaagcccct 282; aaaggcaaga agaaaaagta aaagaccttg gctcatagaa agtcacttta atagataggg 2881 acagtaataa ataaatgtac aatctctata tacaaaaaaa aaaaaaaaaa aa Protein seguence: NCBI Reference Seguence: NP_004981.2 LOCUS N P_004981 ION 981 l mrlfvsdgvp gc;pvlaaag rargraevli stvgpedcvv pfltrpkvpv lqldsgny;f 6; stsaicryff ;lsgweqddl tnqwleweat elqpalsaal yylvvqgkkg edvlgsvrra 12; lthidhslsr aget esladivlwg alypllqdpa ylpeelsalh swfqtlstqe l8; pcqraaetvl qugvlalrp qukqpqpsp aegravtnep eeeelatlse eeiamavtaw 24; ekgleslppl rpqqnpvlpv agernvlits alpyvnnvph lgniigcvls advfarysrl ; rqwntlylcg tdeygtatet ltpq eicdkyhiih adiyrwfnis fdifgrtttp 36; qqtkitqdif qqllkrgfvl thveqlrce hcarfladrf fcgy eeargdchk 42; cgklinavel kkpqckvcrs cpvvqssth fldlpklekr leewlgrtlp gsdwtpnaqf 48; rdgl kprcitrdlk wgtpvplegf edkvfvafd atigylsita nytdqwerww 54; knpequlyq fmakdnvpfh slvfpcsalg aednytlvsh liateylnye dgkfsksrgv 60; gvfgdmath gipadiwrfy llyirpequ safswtdlll knnsellnnl gnfinragmf 66; vskffggyvp emvltpddqr llahvtlelq hthllekvr silt isrhgnqyiq 72; vnepwkrikg seadrqragt niaa llsvmlqpym ptvsatiqaq lqlpppacsi 78; lltnflctlp aghqigtvsp lqulendqi esqurfggg qaktspkpav vetvttakpq 84; qiqalmdevt kqgnivrelk aqkadkneva aevaklldlk kqlavaegkp peapkgkkkk Official Symbol: SENP1 Official Name: SUMO1r‘sentrin specific peptidase 1 Gene ID: 29843 Organism: Homo sapiens Other Aliases: SuPr-2 Other ations: sentrin specific protease 1;sentrin-specific protease 1 ; sentrin/SUMO-specific protease SENP1 Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_001267594.1 LOCUS NM_001267594 ACCESSION NM_001267594 l attccgagta cgagaaagcg aaaaagccca gactgaaaag ggtactgaga aattacgact 6; aaat gctcccttcg cttctcgggc ctcgccacac cgcgcaggcg ccccactggt 12; ccttaactct gttctttgac ctcctgcccc agccccctcc tcttcagcca gact 18; cttccggtgc tgtgaaggcg gttccggttc gcggcggttc ccgggttttg gcgc 24; ccggccggaa accccttcgc atggcagccg gttc ggactttgta tctttgctaa 30L agtcagtgat gtgaaaagac ttgaaatgga tgatattgct atga ggatggatgc 36L tggagaagtg actttagtga accacaactc cgtattcaaa acccacctcc tgccacaaac 42L aggttttcca gaggaccagc tttcgctttc tgaccagcag attttatctt ccaggcaagg 48L ggac cgatctttta catgttccac aagaagtgca gcttataatc caagctatta 54; taat ccttcctcag acagttttct tggctcaggc gatttaagaa cctttggcca 60; gagtgcaaat ggccaatgga gaaattctac cccatcgtca agctcatctt tacaaaaatc 66; aagaaacagc Cgaagtcttt acctcgaaac ccgaaagacc tcaagtggat tatcaaacag 72; ttttgcggga aagtcaaacc atcactgcca tgtatctgca tatgaaaaat cttttcctat 78; taaacctgtt ccaagtccat cttggagtgg ttcatgtcgt cgaagtcttt tgagccccaa 84; gaaaactcag aggcgacatg cagc agaagagaca gttcaagaag aagaaagaga 90; gatttacaga cagctgctac agatggtcac agggaaacag tttactatag ccaaacccac 96; cacacatttt cctttacacc gatg tcttagttcc agtaaaaata ctttgaaaga L02; ctcactgttt aaaaatggaa actcttgtgc atctcagatc attggctctg atacttcatc L08; atctggatct gccagcattt taactaacca gctg tcccacagtg tatattccct L14; atcttcttat accccagatg ttgcatttgg atccaaagat tctggtactc ttcatcatcc L20; tcac cactctgttc cacatcagcc agataactta gcagcttcaa atacacaatc L26; tgaaggatca gactctgtga ttttactgaa agat tcccagactc caactcccag L32; tttc ttccaggcag agctgtggat caaagaatta actagtgttt atgattctcg L38; agaa agattgcgcc agattgaaga acagaaggca ttggccttac attg caggagcggg aacattcagt acatgattca gtagaactac acctcttgaa aaggagattc ctgttactgt tgtccaagaa acacaaaaaa attaactgat agtgaagatg aatttcctga aattacagag gaaatggaga aagaaataaa L62; gaatgtattt Cgtaatggga atga agttctcagt gaagcatttc tacacgcaaa gatattcaaa acca tctgaattgg ctcaatgatg agatcatcaa 174; tttctacatg aatatgctga tggagcgaag taaagagaag ggcttgccaa gtgtgcatgc 180; atttaatacc tttttcttca ctaaattaaa aacggctggt gcag tgaaacgttg 186; gacaaagaaa gtagatgtat ttga cattcttttg gtgcccattc acctgggagt 192; acactggtgt gttg ttag aaagaagaat attacctatt acgactccat 198; gggtgggata aacaatgaag cctgcagaat actcttgcaa tacctaaagc aagaaagcat 204; tgacaagaaa aggaaagagt ttgacaccaa tggctggcag cttttcagca agaaaagcca 210; ggagattcct cagcagatga atggaagtga ctgtgggatg tttgcctgca aatatgctga 216; tacc agac caatcaactt cacacagcaa cacatgccat acttccggaa 222; gcggatgqtc atcc tccaccgaaa actcttgtga agactgtctc acttagcaga 228; ccttgaccat gtgggggacc agctctttgt tgtctacagc cagagacctt ggaaacagct 234; gctcccagcc ctctgctgtt gtaacaccct tgatcctgga ccaggccctg gcgagatgca 240; ttcacaagca catctgcctt tccttttgta tctcagatac tatttttgca aagaaacttt 246; ggtgctgtga aaggggtgag ggacatccct aagctgaaga ctgc ttttcacttc 252; ttcagttctg ccatcttgtt ttcaaagggc tccagcctca ctcagtccct ggga 258; aaag cttggaaaga atcttggttt catataaatt cttgttgtta ggccttacta 264; agaagtagga aagggcatgg gcaaaaggta aaaa ccaccagcat atacatggac 270; atacacacac acccacacac acaaacacac acacacacac aattttcacg atgtatggtc 276; aggaatgtga ctgtaaactg gggg cccaggcata agtcccttcc tccaggacct 282; ttcctattta tatgtcccta tacaaaatcc atctgctttt atacgtagct gttttatcat 288; ctgtagcttc atcctatccg gaggcacagc acatgagccc tggacaggtc ccaaagttcc 294; aagcagtcct gaaa gcaggggttt gcatgtgcta ccaacacatg atacggggaa 300; gacccaccca gggagcggtt gcgc aacaaagcac cacttttact gttgcctact 306; tctgaccaag aagaaaaagg accttagtat ttagcataaa attccagcgc tggatgaatg 312; cagatctagt tgtg gctagtttaa atatgtttct aaccacagag aatttcatat 318; atatatacat atatatatac acatacatat atatatatat atatgtatgt tttc 3241 acagggatat gctttttttt ttaaagactg aatgtgttca ccatttagcc tgtagattta 330; tttccatttt ccaaattcca gcacacagag atcccagccc ctatgagtag ggtgtttqtq 336; gactacctaa tggaatattt ttgaggcctg gatgaacttt gccatatggg tagaggttac 342; agagggaggt gatattttca gctaaaaaaa aaaacgggtg gagtttggac tgatcaactt 348; gagatttaaa tatt ccttttgttc tttctagcat ctctccccac cctctgagag 354; ctcctcaggc ttagatagtg aagtgatcaa gtgt cattttgtac ttaagttcca 360; aagtaggaac attttatact tttttctgta tagg tagttttgta cttt 366; tctcctctcc cgttgtaccg cattctttcc agcattgtgc tttttccctg ggcttatttg 372; aaaattttac tgttttatac aagctcgttt agtacatttt tttt accacaagtt 378; acaatttgaa aagaaaacta ttttttttaa atattccatt gttaactgaa tgttactgtt 384; tccactccag caactacatg tcctcccttc aactgcctgc cttttgggga aagaccacct 3901 tttgtgtgtt tgttttttct ctctctttct ttccctttct atct ctctttattt 396; ttctttcttt ttctttgttt ttgagttttc tataggaaat aaatagcttt ctatatatga 402; gttgctgqgg acat ttag aaagctgtgg catgcagtct cattgcagga 4081 ctcctggaat attgtctggt tcttggtatt tactgtatgt aagcaacaac ttgaaaggtg 414; gcaatatggt gtcgatttgg actatgaatc aaaagacctt gttc tttcactatt 420; gtctggggga ctcagaacaa gattgttctc tatt ccat ttaggtaaca 426; tctgtcttac cttcctcaca gactttgtac agaccaaagc aacaaatatt tattgccatg 432; tatagcagaa aatgaaacat gcaacaaaag cactttgaaa aatatataag gaattgttga 438; gcctgtctga atttgggccc cctttctgac taatgcagtt aagg tagaagttag 444; tgaccctgag ttac caccctggac ctggtccaaa tacagactta cacagtggac 450; cattctttcc tgagctagcc aacaagagca ggagtagtat ctggaaactt tcccctttgt 456; ttaggggtag gctttgatga ccaggaaaaa aaaaaaggta tttctgcatt ttatggccca 462; aaggcatgtt attaatatct tatgtaattt actttaaact aaataagact tttttctcct 4681 gtgtaaaaaa aaa Protein seguence (variant 1): NCBI Reference Seguence: NP_001254523.1 LOCUS N P_001 254523 ACCESSION N P_001254523 1 mddiadrmrm dagevtlvnh nsvfkthllp qtgfpedqls lsdqqilssr qghldrsftc 61 strsaaynps yysdnpssds flgsgdlrtf gqsanngrn stpssssslq slyl 121 etrktssgls nsfagksnhh chvsayeksf pikpvpspsw sgscrrslls pkktqrrhvs 181 taeetvqeee reiyrqllqm vtgkqftiak ptthfplhls rclssskntl ngns 241 casqiigsdt sssgsasilt nqeqlshsvy pdva fgskdsgtlh hphhhhsvph 301 qunlaasnt qsegsdsvil lkvkdsqtpt psstffqael wikeltsvyd srarerqui 361 eeqkalalql qnquqereh elhl rvplekeipv tvvqetqkkg hkltdsedef 421 peiteemeke iknvfrngnq devlseafrl titrkdiqtl nhlnwlndei infymnmlme 481 rskekglpsv fftk lktaquavk rwtkkvdvfs vdillvpihl gvhwclavvd 541 frkknityyd smgginneac rillqylkqe sidkkrkefd tngwafskk sqeipqqmng 601 sdcgmfacky adcitkdrpi nftqqhmpyf rkrmvweilh rk11 deseguence(vafiantZr NCBI Reference Seguence: NM_001267595.1 LOCUS NM_001267595 ACCESSION 267595 1 atggagggag ccag gagactgtgt cggacccggt cagccggccc gggctggact 6; gggcggaagc ggggagcact ccgg cgccttttcc tctgccccgc cccctgggac 12; cacctcccct cccccctgct gtccggtggc gtgg ccgccggtgg ccgttaggct 18; acctgaggcc gttccttctg gtctctctct cctgggccgc ggagagaccg tctccctgcc 24; gttacagcag gccccatccc agcgcccagc cgtacttggg gaaaggccgg ttgcgattcc ; ggggctttcc agct gggtcttctc tggggagagc tgttttcacc gggaagctcg 36; gctttctgtg gtaccggctt catctcccgc cttccttgag acccgagtga tatttcttga 42; ctacttctgc gtctcacgta aacatttctc caactctcct actctgtggt atctccctga 48; atat cgctagtgcc accatcagaa agaaacgtct ggaccctcct gact 54; ttgtatcttt gctaaagtca gtgatgtgaa aagacttgaa atggatgata ttgctgatag 60; gatgaggatg gatgctggag aagtgacttt agtgaaccac aactccgtat tcaaaaccca 66; cctcctgcca caaacaggtt ttccagagga ttcg ctttctgacc tttt 72; atcttccagg caaggacatt tggaccgatc ttttacatgt tccacaagaa gtgcagctta 78; taatccaagc tattactcag ataatccttc ctcagacagt tttcttggct caggcgattt 84; aagaaccttt ggccagagtg caaatggcca aaat tctaccccat cgtcaagctc 901 atctttacaa aaatcaagaa acagccgaag tctttacctc gaaacccgaa agacctcaag 96L tggattatca aacagttttg cgggaaagtc aaaccatcac tgccatgtat ctgcatatga 102; aaaatctttt cctattaaac ctgttccaag tccatcttgg agtggttcat gtcgtcgaag 1081 tcttttgagc cccaagaaaa ctcagaggcg acatgttagt acagcagaag agacagttca 114; agaagaagaa agagagattt acagacagct gctacagatg gtcacaggga aacagtttac 120; tatagccaaa cccaccacac cttt gtct ctta gttccagtaa L26; aaatactttg aaagactcac tgtttaaaaa tggaaactct tgtgcatctc agatcattgg L32; ctctgatact tctg gatctgccag cattttaact aaccaggaac agctgtccca L38; cagtgtatat tccctatctt cttatacccc agatgttgca tttggatcca aagattctgg L44; tactcttcat catccccatc atcaccactc tgttccacat cagccagata acttagcagc L50; ttcaaataca caatctgaag gatcagactc tgtgatttta ctgaaagtga aagattccca L56; gactccaact cccagttcta ctttcttcca ggcagagctg tggatcaaag L62; tgtttatgat tctcgagcac gagaaagatt gcgccagatt gaagaacaga aggcattggc L68; gctt caaaaccaga gattgcagga gcgggaacat tcagtacatg actacatctt cgtgtacctc ttgaaaagga gattcctgtt actgttgtcc aagaaacaca L80; aaaaaaaggt cataaattaa ctgatagtga agatgaattt cctgaaatta cagaggaaat L86; ggagaaagaa ataaagaatg tatttcgtaa tgggaatcag gatgaagttc atttcgcctg accattacac gcaaagatat tcta aaccatctga tgatgagatc ttct acatgaatat gctgatggag cgaagtaaag agaagggctt 204; tgtg catgcattta ataccttttt cttcactaaa ttaaaaacgg ctggttatca 210; ggcagtgaaa Cgttggacaa agaaagtaga tgtattttct gttgacattc ttttggtgcc 216; cattcacctg ggagtacact ggtgtctagc tgttgtggac tttagaaaga agaatattac 222; cgac tccatgggtg ggataaacaa tgaagcctgc agaatactct tgcaatacct 228; aaagcaagaa gaca agaaaaggaa tgac ggct ggcagctttt 234; cagcaagaaa agccaggaga ttcctcagca gatgaatgga agtgactgtg ttgc 240; ctgcaaatat gctgactgta aaga cagaccaatc aacttcacac agcaacacat 246; cttc cggaagcgga tggtctggga gatcctccac cgaaaactct gact 252; gtctcactta gcagaccttg accatgtggg ggaccagctc tttgttgtct acagccagag 258; accttggaaa cagctgctcc cagccctctg ctgttgtaac acccttgatc ctggaccagg 264; ccctggcgag atgcattcac aagcacatct gcctttcctt ctca gatactattt 270; ttgcaaagaa actttggtgc tgtgaaaggg gtgagggaca tccctaagct gaagagagag 2761 actgcttttc acttcttcag ttctgccatc ttgttttcaa agggctccag cctcactcag 282; tccctaatta tgggactgag aaaagcttgg aaagaatctt atat aaattcttgt 288; tgttaggcct tactaagaag taggaaaggg catgggcaaa aggtagggat cacc 294; agcatataca tggacataca cacacaccca cacacacaaa cacacacaca cacacaattt 300; tcacgatgta tggtcaggaa tgtgactgta aactggactt tggggcccag gcataagtcc 306; ccag gacctttcct atttatatgt ccctatacaa aatccatctg cttttatacg 312; tagctgtttt atcatctgta gcttcatcct atccggaggc acagcacatg agccctggac 318; aggtcccaaa gttccaagca gtcctttccg tgaaagcagg catg tgctaccaac 324; acatgatacg gggaagaccc acccagggag cggtttcagt ggcgcaacaa agcaccactt 330; ttactgttgc ctacttctga ccaagaagaa aaaggacctt agtatttagc ataaaattcc 336; agcgctggat gaatgcagat ctagtttggt ctgtggctag tttaaatatg tttctaacca 342; cagagaattt catatatata tata tatacacata catatatata tatatatatg 348; tatgtataaa cagg gatatgcttt tttttttaaa gactgaatgt gttcaccatt 354; tagcctgtag ttcc attttccaaa ttccagcaca cagagatccc agcccctatg 360; agtagggtgt ttgtggacta ggaa tatttttgag gcctggatga actttgccat 366; atgggtagag gttacagagg gaggtgatat tttcagctaa aaaaaaaaac gggtggagtt 372; tggactgatc aacttgagat ttaaaaactg ctattccttt tgttctttct agcatctctc 378; ctct gagagctcct caggcttaga tagtgaagtg atcaaatgcc agtgtcattt 384; tgtacttaag ttccaaagta ggaacatttt atactttttt ctgtattgta ataggtagtt 390; ttgtatgaaa ctcc tctcccgttg taccgcattc tttccagcat tgtgcttttt 396; ccctgggctt atttgaaaat tttactgttt agct cgtttagtac atttttctat 402; gttttaccac aagttacaat ttgaaaagaa aactattttt tttaaatatt ccattgttaa 408; gtta ctgtttccac tccagcaact cctc ccttcaactg cctgcctttt 414; ggggaaagac caccttttgt gtgtttgttt tttctctctc tttctttccc tttctctttc 420; tatctctctt tatttttctt tctttttctt tgtttttgag ttttctatag gaaataaata 426; gctttctata tatgagttgc tggggacctt cacattctct aagc tgtggcatgc 432; agtctcattg caggactcct ggaatattgt ctggttcttg gtatttactg tatgtaagca 438; acaacttgaa aggtggcaat atggtgtcga tttggactat gaatcaaaag ttca 444; ggttctttca ctattgtctg tcag aacaagattg ttctctgtat ttattgtttg 450; tccatttagg taacatctgt cttaccttcc tcacagactt gacc aaagcaacaa 4561 atatttattg atag cagaaaatga aacatgcaac aaaagcactt tgaaaaatat 462; ataaggaatt gttgagcctg tctgaatttg ggcccccttt ctgactaatg cagttttgca 468; caaggtagaa gttagtgacc ctgagaccat cttaccaccc tggacctggt acag 474; acttacacag tggaccattc tttcctgagc tagccaacaa gagcaggagt agtatctgga 480; aactttcccc tttgtttagg ggtaggcttt gatgaccagg aaaaaaaaaa aggtatttct 4861 gcattttatg gcccaaaggc atgttattaa tatcttatgt cttt aaactaaata 4921 agactttttt ctcctgtgta aaaaaaaa Protein seguence (variant 2): NCBI Reference Seguence: NP_001254524.1 LOCUS N P_001254524 ACCESSION N P_001254524 l mddiadrmrm dagevtlvnh nsvfkthllp qtgfpedqls lsdqqilssr qghldrsftc 6L strsaaynps yysdnpssds lrtf gqsanngrn sslq ksrnsrslyl 12L etrktssgls nsfagksnhh chvsayeksf pikpvpspsw sgscrrslls pkktqrrhvs l8; taeetvqeee reiyrqllqm vtgkqftiak ptthfplhls rclssskntl kdslfkngns 24; casqiigsdt sssgsasilt nqeqlshsvy pdva gtlh hphhhhsvph ; qunlaasnt qsegsdsvil lkvkdsqtpt psstffqael wikeltsvyd srarerqui 36; eeqkalalql qnquqereh svhdsvelhl rvplekeipv tvvqetqkkg hkltdsedef 42; peiteemeke iknvfrngnq devlseafrl titrkdiqtl nhlnwlndei infymnmlme 48; rskekglpsv hafntffftk lktaquavk rwtkkvdvfs vdillvpihl gvhwclavvd 54; frkknityyd smgginneac lkqe sidkkrkefd tngwafskk sqeipqqmng 60; sdcgmfacky drpi nftqqhmpyf rkrmvweilh rkll ATPIF1 Official Symbol: ATPIF1 Official Name: ATPase inhibitory factor 1 Gene ID: 93974 Organism: Homo sapiens Other Aliases: RP5-1092A3.1, ATPI, ATPIP, IP Other ations: ATP synthase inhibitor protein; ATPase inhibitor protein; ATPase inhibitor, mitochondrial; |F(1); |F1; tor of F(1)F(o)-ATPase Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_016311.4 LOCUS NM_016311 ACCESSION NM_016311 l gaccagattg ggtgcttggc cgtccctgcc gcgt aacgagagac tgcttgctgc 6" ggcagagacg ccagaggtgc agctccagca gcaatggcag tgacggcgtt gcgg 12; cttg gcgtgtgggg cgtgaggacc atgcaagccc gaggcttcgg ctcggatcag 18; tccgagaatg gggg cgcgggctcc atccgggaag ccggtggggc aaag 24; agagagcagg agga acgatatttc cgagcacaga gtagagaaca actggcagct 30L ttgaaaaaac accatgaaga agaaatcgtt catcataaga aggagattga gcgtctgcag 36L aaagaaattg agcgccataa gcagaagatc aaaatgctaa aacatgatga ttaagtgcac 42; accgtgtgcc atagaatggc acatgtcatt gcccacttct gtgtagacat ggttctggtt 48L taactaatat ttgtctgtgt gctactaaca gattataata aattgtcatc agtgaactgt 54; gaaaaaaaaa aaaaaaaaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_057395.1 LOCUS N P_057395 ACCESSION NP_057395 l mavtalaart rtm qargfgsdqs envdrgagsi reaggafgkr eqaeeeryfr 61 laal kkhheeeivh hkkeierlqk eierhquik mlkhdd Nucleotide seguence (variant 2): NCBI nce Seguence: NM_178190.2 LOCUS 190 ACCESSION NM_178190 l gaccagattg ggtgcttggc cgtccctgcc attagcgcgt aacgagagac tgcttgctgc 6" ggcagagacg ccagaggtgc agctccagca gcaatggcag tgacggcgtt ggcggcgcgg 12L acgtggcttg gcgtgtgggg cgtgaggacc atgcaagccc gaggcttcgg ctcggatcag l8; tccgagaatg tcgaccgggg cgcgggctcc gaag ccggtggggc cttcggaaag 24L agagagcagg agga tttc cgacattaca ggttatgctt tgagatctct ; ttggggtgaa ggattgaaat taaaccctga gccaccgtgt ccttgtagag cacagagtag 36; agaacaactg gcagctttga aaaaacacca tgaagaagaa atcgttcatc ataagaagga 42; gattgagcgt ctgcagaaag aaattgagcg ccataagcag aagatcaaaa tgctaaaaca 481 ttaa gtgcacaccg tgtgccatag aatggcacat gtcattgccc acttctgtgt 541 agacatggtt ctggtttaac taatatttgt ctgtgtgcta ctaacagatt aatt 601 gtcatcagtg aactgtgaaa aaaa aaaaaa Protein seguence (variant 2): NCBI Reference Seguence: NP_835497.1 LOCUS N 97 ION NP_835497 l mavtalaart wlnggvrtm qargfgsdqs agsi reaggafgkr eqaeeeryfr 61 hyrlcfeisl g Nucleotide seguence (variant 3) NCBI nce Seguence: NM_178191.2 LOCUS NM_178191 ACCESSION NM_178191 l gaccagattg ggtgcttggc cgtccctgcc attagcgcgt aacgagagac tgcttgctgc 6; ggcagagacg ccagaggtgc agca gcaatggcag tgacggcgtt ggcggcgcgg 12; acgtggcttg gcgtgtgggg gacc atgcaagccc gaggcttcgg ctcggatcag 18; tccgagaatg tcgaccgggg Cgcgggctcc atccgggaag ccggtggggc cttcggaaag 24; agagagcagg ctgaagagga acgatatttc cggtgaggct caccgggtcc caagtccagc ; cctggatctc ccaatggcct tccaatcctt aaactgccaa tcgccccacc cgttcctacc 36; tggtgccttg ggcgccccat cccccaacag aactcccggg ccccaatcca gtatacccta 42; acccttgatg tcccgaccgt tgccacgtat agggcactcc cagttacctg cacaacagtt 48; tcaggccccc aaaccgtttc cggg tctccaaaac aacccacggc tcaactcctc 54; ctttatcatt tccc gcgtggagtt ctcctcaggt cgtgcgaaac acccccagat 60; tcttcgcaca gtgtctagat ccgaccgccc tgcc tcccagcctg actccctcgg 66L cccttaccca cctgtcaccc cctctacgct ctccttcctc gccagcacgc cttagctttg 72; tgca tgcattcagg cttctcaggt gtttctagac ccccgactcc gcaagagtga 78; ggatgatggg agctggtcat gggagctact tatggttgga caccatcttc taaaggcttt 84; tgccctactc agcccaacct agacctgtag atttccctct cctgcttagg agtatggagt 90; gggctgggcc tccctttgcc agccttgagt tatctttaac tgacttctgt ccactctgga 96; gagcagtgag gaattaatct tgcttttgct tgtcctttgg cctttcactt ctgccttctg L02; ttgagaatta tcaccatgac cata ccgtatagag agccaaggta cagccgttag L08; agactatcta attgagcccc tacattttgt agttaaggaa aactgaggcc taaatgtgac L14; caaaccaaca tcca gtcccttctt ggaacctaaa ttgaactgcc aagtactgcg L20; agag accctttatt ggccttacag tgggccattc atttctatag gcaaagaaag L26- ctctagacag attggaatag gaaatggata tttgcctttt agctacaccc ctttgtctgt L32; cttcctcatt ttgttccttt ttttttccct aaaggggagt ccct cctcataagg tattagggac ttgtgtcaca tctctctgga gttttctatt atctgaaagc aataagctct ttggtcttct ggct acacctcaat gtattctttc actagttgag gagtagaaga ggatgaccag ctagactccc tatt ccttgcttag acattacagg ttatgctttg agatctcttt attgaaatta aaccctgagc caccgtgtcc ttgtagagca cagagtagag aacaactggc L68; agctttgaaa aaacaccatg aagaagaaat tcat aagaaggaga ttgagcgtct L74; gcagaaagaa attgagcgcc ataagcagaa gatcaaaatg ctaaaacatg aagt L80; gcacaccgtg tgccatagaa tggcacatgt cattgcccac ttctgtgtag acatggttct "86; ggtttaacta atatttgtct gtgtgctact aacagattat aataaattgt catcagtgaa 1921 ctgtgaaaaa aaaaaaaaaa aaaa Protein sequence (variant 3): NCBI Reference Seguence: NP_835498.1 LOCUS N 98 ACCESSION NP_835498 1 aart wlnggvrtm qargfgsdqs envdrgagsi reaggafgkr eqaeeeryfr VAM P3 Official Symbol: VAMP3 Official Name: vesicle-associated membrane protein 3 (cellubrevin) Gene ID: 9341 Organism: Homo s Other Aliases: CEB Other Designations: VAMP—3; cellubrevin; synaptobrevin-3; vesicle-associated membrane protein 3 Nucleotide seguence: NCBI Reference ce: NM_004781.3 LOCUS NM_004781 ION NM_004781 l agtgacgtct ttgccccgcg ccgcgccgtc ccacccatct ccctggcctc cggtcccaac 6L tctc tgctgaccct ctctcgtcgc cgcc gccgcagctg ccaaaatgtc 12L tacaggtcca actgctgcca ctggcagtaa actt cagcagacac aaaatcaagt l8; agatgaggtg ataa tgcgagttaa cgtggacaag gttctggaaa gagaccagaa 24; gctctctgag gacc gtgcagacgc actgcaggca ggcgcttctc aatttgaaac ; gagcgcagcc aagttgaaga ggaaatattg gtggaagaat tgcaagatgt gggcaatcgg 36; gattactgtt ctggttatct tcatcatcat catcatcgtg gtct cttcatgaag 42; aaccagcgga actcaaaact caag aaacctcttc aagacttttg aacc 48; tgctatatta tcaagcttac ctactgttat ctctaaaatt ttttttgtgt taatgtaaag 54; ttgaatttct aggaaacgtg cctttgtttt ttaatatgca ctccaaatta gaaggccggc 60; cccgtccaca ttttgcacag taca gatttacgta tgggctgatg aagaggcctt 66; cttaagttcc agagtgctat aatctagatg taatgttgtc actaattaat tgccattact 72; cccagttagt tacccttgtc atttggcatt attttcagaa ccacatttta aacctttggg 78; taatcagatt tccaacttat gccttccaga aaaaaacact actgcctaac acaaatctgt 84; gataacaaca ggctgtgcct tattttgata tgat tccctagaag agaaccctct 90; actttttgta agcactactg actctcgctg tatttaagat gctggtgaag agcttttgct 96; cttgcattag atttgaagat gtttacattg ttgttattgt tatgtatcac ttgctaaaaa L02; tattgtttta gata acctctttaa aaaaattttt ctat ggctatgacc L08: aaagcttcta ttttgccaaa aagttaaata ccgataaaat ggccttaagt cagttaaatt cagaaacgtg ccaaatggaa ctcaaggtgc cccttcagaa 1201 taccttgtgt gtgaaccttc tacatcttca taggcctttc ttccttttga gtag L26; acagtgtggc tccccttctg attcagtatt ttgcatgggg gttagagaag gactctgacc gtctcataaa agagttctac ccagcagttg gcagattatc agctgtggac L38; tccagcatgt ttctgataat tatgcaagca acaattctgt agcctcaagt aagaccacct 144; gtgaacttga tcattatctg gcccaaatat gaagataaac tataactttg gttt 150; tgta ttcacattct gcttcctaaa tcagttttct aaattatgcc tagg 156; cattggtcag gggtgaatgg ctcttttcac agagagtagc caaccagaga cctttgcttt 162; gatatcatca actgcagaga atgctgttga tgggaatgct ggaagcagaa actttgtcat 168; cggaaaaact tttcttgtat gcatgagact caacatcagg atccacagct taaagatggg 174; aattcaggta tgaaagaaaa caggcaagga ggcactgagg gagaaagaca cagactttat L80- cgctctgtgg gtta ctggaatatt ctaaaactct tgttcacatg tgac L86; ttataaagca gcaacagctg aggcgcacca ggacacagct tccatttctt ctgt tcgctgaaat gatttactgt tgaagagatg ccttgcggtg gcagctggca gtgttaggac attagtccac cttcagcgca gggtctctgg 204; ctga ctcagaaacc ttggtactcg ccccttggcc acagtgccca tgta 210; acccactggc tcctgcatta acccagaaat acctcgcttc tatctgtgca cttagctggg 216; aacttaccca ctgtaatcac ctaaataaag tgtttataaa aaaa aaaaaaaaaa 2221 aaaaa n seguence: NCBI Reference Seguence: NP_004772.1 LOCUS NP_004772 ACCESSION NP_004772 1 mstgptaatg snrrqutqn quevvdimr vnvdkvlerd qklselddra dalqagasqf 61 etsaaklkrk ywwknckmwa igitvlvifi iiiivwvvss VAPA Official Symbol: VAPA Official Name: VAMP (vesicle-associated membrane protein)-associated protein A, 33kDa Gene ID: 9218 Organism: Homo sapiens Other Aliases: VAP-33, VAP-A, VAP33, hVAP-33 Other Designations: 33 kDa VAMP-associated protein; VAMP-A; VAMP- associated protein A; vesicle-associated membrane protein-associated protein A Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_003574.5 LOCUS NM_003574 ACCESSION NM_003574 l agaagctccc ggccgggggc gcgcacgtag gcacgcagag gccgtcacgt gggtcgccga 6; caag tgcgcgtggc cgtggcggct ggtgtggggt tgagtcagtt gtgggacccg 12; gagctgctga cccagcgggt ggcccaccga accggtgaca cagg ggct 18L cgggagccgc gagcctggcc tcgtcctaga gctcggccga gccgtcgccg ccgtcgtccc 24L ccgcccccag tcagcaaacc gccgccgcgg gcgcgccccc gctctgcgct gtctctccga ; tggcgtccgc ctcaggggcc atggcgaagc acgagcagat cctggtcctc gatccgccca 36; cagacctcaa attcaaaggc cccttcacag atgtagtcac tacaaatctt cgaa 42; atccatcgga tagaaaagtg tgtttcaaag tgaagactac agcacctcgc cggtactgtg 48; tgaggcccaa cagtggaatt attgacccag ggtcaactgt gactgtttca gtaatgctac 54; ttga ctatgatccg aatgaaaaga gtaaacacaa gtttatggta cagacaattt 60; ttgctccacc ttca gatatggaag ctgtgtggaa agaggcaaaa cctgatgaat 66; taatggattc caaattgaga tgcgtatttg ccaa tgaaaatgat aaattgggta 72; taactccacc agggaatgct ccgactgtca cttcaatgag cagcatcaac aacacagttg 78; caacacctgc cagttatcac acgaaggatg accccagggg actcagtgtg ttgaaacagg 84; agaaacagaa gaatgatatg agca aagctgttcc actgaatgca tctaagcaag 90; atggacctat gccaaaacca cacagtgttt atga taccgaaaca aggaaactaa 96; tggaagagtg taaaagactt cagggagaaa tgatgaagct atcagaagaa aatcggcacc L02; tgagagatga aggtttaagg ctcagaaagg tagcacattc acct ggatcaacct L08; caactgcatc cttcagagat aatgtcacca gtcctcttcc ttcacttctt attg L14; cagccatttt cattggattc tttctaggga aattcatctt gtagagtgaa gcatgcagag L20; ttct tttttttttt ttctcttgac cagaaaaaga tttgtttacc taccatttca L26; ttggtagtat ggcccacggt gaccattttt ttgtgtgtac agcgtcatat aggctttgcc L32; tttaatgatc tcttacggtt agaaaacaca ataaaaacaa actgttcggc L38; gttgtatatt accagatcat cactagcaga ttgc acattgagtc ctttatgaaa L44; ttcataaata aagaattgtt ctttctttgt ggttttaata agagttcaag tcag L50; agtcttgtaa atgttatttt aataatccct ttaaatttta tctgttgctg cttg L56; gatt tatttagatt gctaatccca ctcattcagg aaatgccaag aggtattcct L62; tggggaaatg gtgcctctta cagtgtaaat ttttcctcct ttacctttgc catg L68; gcagaatttt tcttatccct tgtgaggcag ttgttgactg agtttttcat ccttacaatc L74; ctgtcccatg aaca aaaa taaaactgtt aacagattct tgctcgatag L80; cttgtttgtg gtgt tattagaggg aactccacta tatatggtca cttgaaatta L86; tgatgcaaag gtttctcttg cattgaaacc ctcttggata ttacagtatt tgaa L92; agtcctaatt ctgttaagga aaggagttga ttaaatttta aggtaccact ggtattttgg L98: ataa tcagtttgtt ttcaagataa tagaaaataa ggtccatgag aatagaagtt 204; tttc agtgagttga tgtgtacagc atggctgtgc tccatctgat ttaccccatt 210; cttaagttct gagagtatgt tctcaaggaa gatttaactc tctttggttt taaattactt 216; tttaaccagc ctaataaata agtcttacta cttttcataa tatttcataa tagttaaaag 222; taggtgtttt tttcgtgctc aatttggcac tcaaaataat gttcattatg gaagtttggt 228; aatactgagc aagcctgtgg aattttcttt atgaaaaatg attttagcct atgt 234; tgtg aaacacattt tcagtataag tatgcgttac agggtttgat actttcctgc 240; acttaggttt gtcctattct tcatttattc atactaggat agaaaatttt ggaatcagaa 246; tcca gtgtttagct acatacaatc tagtacaagt gaatttttat tcttaaacat 252; aggtgtgttg gctctttttt taaaagatgc gctctacctg aaaaggaaat tggattttag 258; aactggatgt ggtgcagtga agtattttag gcccaggtct gtgtacacat tttatagaag 264; aatgaagtac tctgaagtat tgcc ttttcatttc gttt tgaatttgtc 270; agatcacaca tatattgtgt tattgggcgc tgtggtatct tttataaaac ctcttgcttg 276; tgtgcaaaag ttcctaaaag gaaacacaag taatgcctat ccattactag catgctatgc 282; tgcatgcttt actgccattg ctgtatgctt cttt gtaaaaatcc ccctctcccc 288; ttttctggta actggaaaag catgctaaaa atagtcttat attttcaccc cataaatgca 294; gaatcagtaa ttccttggct tctt atataatcaa tggt ggtaaatacc 300; aagtttggta tctcatagct atcttttttt aaagaaatta agttcttgaa aatttagcca 306: aatcccgttt tatgggaatg agaa ttcattttgt tcagcccctt tgttctatgg 3121 ttgagaaatc tgaggcctta cgaaggttaa gagaactttc cccgtgtctc aggt 318; agaggcagag ctggaactag atatctggtc tgttgactct agctcagtgt cttctggtaa 324; ctgttgaaaa ttgtcttagt ttgagagatg gctgaaataa tgaacataaa atgctattta 330; taataacaag tgaa atttcttatt gtaagactac taccggctta ctgttgaata 336; gtttggttat agtgtttagg ctagaaatgc ctcccacatt ggtaataaac attacaaaat 342; acaatgtatt tttaggtagg cattttataa ttat gccatggttg cttttgagat 348; agattgtagt agca tctttaaaat gtatgtgggc ttaactgttg ttcatatcag 354; gagatgctct gattgtatag gtgagactct gtttctgtta tttttaattg ctgtatgaaa 360; tgtgatcaga ttattttact accaacagtt atagtttgaa agtccaactg tattaattga 366; ctgataatat gataatatag agattaaatt gtttgtcttc tata tgtttagaag 372; cttt gtctgcctgc ttacttgtat atgtaagcat gagggaaata tgct 378; aatactgaaa tcaa gtaactaagg ccttgagttc atatgtgaca ctgaatgcac 384; tagcttcctt cgttctataa ctaatgtacc tccc ccattcttat atttacaaga 390; agctaagtca ttatgttctg agtgtgtggt atgttccctt aaaaaaaaat gacacttgga 396; agaaaaatgt atgaaattca gaaattccga tcaaagaaaa gtaattcttt cttttttttt 402; ttgagacaga gtcttgcttt gttgcccagg ctggagggca gtggtgtgat ctcacctcac 408; tgcagcttcc gcctcctggg ttcaagtgat tctcatggct cagccgcctg agtagctggg 414; attacaggtg tgagccaaca agcccggcta atttttgtat ttttagtaga gacaaggttt 420; caccatgttg gtcaggctgg gctcaaactc ctgaatccgc ctgcctcggc ctcccaaagt 426; gctgggatta tgag cacc cagccaagaa aaataatact cttaaatact 432; tagatgttca cctaaagttg tttg gtatgggaat tacttttgaa ctgtaatctt 438; tcagattaca ccactttgaa aacaagtttt aacagtaggg taaaaatata gtttttgagg 444; gtattcccaa cttgtgatct tctaccactt tagagacatt caagtaatag ttttcttaga 450; gctttgcaca ttcctattca ctgagatttt ttca cctttattcg agggaaggat 456; caatgcttat taccatttgg aaaaacgaag atcagaaggt aaatgatctt tattttctag 462; ctttaaaggg aaattaaacc attcatgaat taaa aatgtgaagt gtccttttcc 468; ttttcacaat acaaaaaaaa tttcaacaga ttgtgtggtt ttta tatcctgtta 474; agcattaata gctaatcact gggacttgaa ttctgatggc tctc ttgcttagtg 480; agtt aactattttt gaag tgagaacagc tgattttcat gccacgtttc 486; atagccccac ttttggtaga ctaccaccac gcttcttcgc gtaagcagtg gcatcttggg 4921 aatgaatgcc cagccgctcg tgggttggtg caaagaagta taaacatata tcactaagga 498; aagt ttgtcttgcc cttctgacac agtgtgtgca cttcaggcaa tttttggaaa 504; atataaaaaa ttccaaattc tgcctttcag cagcatcaat tgctaggaac atttcattca 510; tttccctgta atattaatgt tctttaagca taatcactaa ttataagttg tatcctattt 516; ttttccagct taatttctgt ggtttattga aaaccaagta taaatgtgac taaaagcatt 522; ttgctttgtt tttatagtta actttcttaa ggttatggac attttataat gtaacatttg 528; attggcctgg cctcttgaca attcccttct agttatgcat atcctcctgt tgccacattt 534; cttgttttaa aactcagttt cttgttttcc agttgttgct atgtataaca cccatcttga 540; aagagagtat ataggaagtt attcagataa tagt agtgatattc aactatagca 546; gtaccttaac tcatgatgag cttaggaaca taaaagataa ttgttgcttg aatagcaccc 552; ccagagatac tgacctaatt ggtctggggt ggagatctgg catggtagtt tttttcaagc 558; tccaatcatc ggccagacag tatg taggttttta aaag gcagatatga 564; agtagattta attaagactt gacttcagca atacagggga aata cttatttttc 570; tttaaactgc aggagtcact gttaggtatt gcttaaaaaa aattgcataa aagctttgct 576; gtta ggattgctgg aataccacta aagatttttg acttgtgaat aaatgagctg 582; tcatcgcaaa aaggcgattt gagaaatgtg ggcttcagta ttaattgcca ttttgctgac 588; acccagtgta cctacctacc tgagaaattt attttgtcca tcatgtattt ctcaaagcaa 594; gttt tcaagtataa tgtcgttttc aacatgctta ttacttagtt cagc 600; tcatttcatc atcattgata acttgtgaaa tacttatctc catcctatgg aataggggag 606; acgggtttag acaggttcaa ttagctcaag tctacacagc tgaagtagca gagaaagtgg 612; gatctagatg gtctgatcct agtgatctac catatgaagg acatagtttg tgtcctggtc 618; caagtcaaat attgactcct cacaaacagt aagtatggca attttgtgat gcctttgatt 624; ccactttaca tggagtacta ttatttgtga aatgtcttta agatttttgg tcttaaattt 630; actg ctttccccct ttatctccca gaaaattgag aagaagtaaa ctcctgccca 636; ctaacaatct cagtccgtga acaaaaccaa catgaacatt cctaaacaag agtgtgtgtt 642; actctaagaa gaaggctata gaatttatgg aaatggctta ctac aagactggag 648; aacagaatgt gactggcctt ttctaatggt cctttaagat ttaatgatta aagcaagagt 654; tttttataat tgactttgtg gtctaaattc ctgt ttataattct acaaagaaca 660; aaaattgtta tgtactatag gcacttaaga accctgagga aaaataatac aatgtgtgtg 666; tgtgagagag agagtgagtt actgacattg ttccaaaaaa aaaaaaaaaa aaaa 6721 tgtggagggt tgaaatggta tgga atcttttgta ttttcgagca attc 678; ctattcttgt ttcaaataga ttag gaattacagt tgtggggagc aaactttctt 684; ttttgtgctg ttttaattca atat ttgt atataatatg tagataaata 690; tatgagggta ttaagctact ttgaattaaa gata tatttcacat gaaaacaaat 6961 acaaacgaga atcaaaataa agttttgcaa agta Protein seguence (variant 1): NCBI Reference Seguence: NP_003565.4 LOCUS N P_003565 ACCESSION NP_003565 1 masasgamak heqilvldpp tdlkfkgpft dvvttnlklr vcfk vkttaprryc 61 vrpnsgiidp gstvtvsvml qpfdydpnek uti fappntsdme avwkeakpde 121 lmdsklrcvf empnendklg itppgnaptv nntv atpasyhtkd dprglsvlkq 181 equndmeps kavplnaskq dgpmpkphsv slndtetrkl meeckrlqge mmklseenrh 241 rlrk vahsdkpgst stasfrdnvt splpsllvvi aaifigfflg kfil Nucleotide seguence nt 2): NCBI Reference Seguence: NM_194434.2 LOCUS NM_1 94434 ACCESSION NM_194434 1 agaagctccc ggccgggggc gcgcacgtag gcacgcagag acgt gggtcgccga 61 ggctcgcaag tgcgcgtggc Cgtggcggct ggtgtggggt tgagtcagtt gtgggacccg 121 gagctgctga cccagcgggt ggcccaccga accggtgaca cagg cgttagggct 181 cgggagccgc ggcc tcgtcctaga gctcggccga gccgtcgccg ccgtcgtccc 241 ccgcccccag tcagcaaacc gccgccgcgg gcgcgccccc gctctgcgct gtctctccga 301 tggcgtccgc ggcc atggcgaagc acgagcagat cctggtcctc gatccgccca 361 cagacctcaa attcaaaggc cccttcacag atgtagtcac tacaaatctt aaattgcgaa 421 atccatcgga tagaaaagtg tgtttcaaag tgaagactac agcacctcgc cggtactgtg 481 tgaggcccaa cagtggaatt attgacccag ggtcaactgt gactgtttca gtaatgctac 541 agccctttga ctatgatccg aatgaaaaga gtaaacacaa gtttatggta cagacaattt 601 ttgctccacc aaacacttca gatatggaag ctgtgtggaa agaggcaaaa cctgatgaat 661 taatggattc gaga tgcgtatttg aaatgcccaa tgaaaatgat aaattgaatg 721 atatggaacc tagcaaagct gttccactga ctaa gcaagatgga cctatgccaa 781 aaccacacag tgtttcactt aatgataccg aaacaaggaa actaatggaa gagtgtaaaa 841 gacttcaggg agaaatgatg aagctatcag aagaaaatcg gcacctgaga gatgaaggtt 901 taaggctcag agca cattcggata aacctggatc aacctcaact gcatccttca 96; gagataatgt caccagtcct cttccttcac ttcttgttgt aattgcagcc attttcattg L02; gattctttct agggaaattc atcttgtaga gtgaagcatg cagagtgctg tttctttttt L08; tttttttctc ttgaccagaa aaagatttgt ttacctacca tggt agtatggccc L14; acggtgacca tttttttgtg tgtacagcgt catataggct ttgcctttaa tgatctctta L20; cggttagaaa acacaataaa aacaaactgt tcggctactg gacaggttgt atattaccag L26; atcatcacta gcagatgtca gttgcacatt ttta tgaaattcat tttgtggttt taataagagt attg ttcagagtct taaa ttttatctgt tgctgttacc tcttgaaata tgatttattt L44; agattgctaa tcccactcat tcaggaaatg ccaagaggta ttccttgggg L50; tcttacagtg taaatttttc ctcctttacc tttgctaata tcatggcaga atttttctta L56; tcccttgtga tgtt gactgagttt ttcatcctta caatcctgtc aaaa aaaaataaaa ctgttaacag attcttgctc gatagcttgt ttgtgtctgt L68; cgtgttatta gagggaactc cactatatat ggtcacttga aattatgatg tttc L74; tcttgcattg aaaccctctt ggatattaca gtatttttaa gtcc taattctgtt L80; aaggaaagga gttgattaaa ttttaaggta ccactggtat agat tataatcagt L86; ttgttttcaa gataatagaa aataaggtcc atgagaatag aagttatgtg atttcagtga L92; gttgatgtgt acagcatggc tgtgctccat ctgatttacc ccattcttaa gttctgagag L98; tatgttctca aggaagattt aactctcttt ggttttaaat tactttttaa ccagcctaat 204; aaataagtct tactactttt attt cataatagtt aaaagtaggt gtttttttcg 210; tgctcaattt ggcactcaaa ataatgttca ttatggaagt ttggtaatac tgagcaagcc 216; tgtggaattt tctttatgaa aaatgatttt agcctttgca aacc atgtgaaaca 222; cattttcagt ataagtatgc gttacagggt ttgatacttt cctgcactta ggtttgtcct 228; attcttcatt tact aggatagaaa gaat cagaaaatag atccagtgtt 234; tagctacata caatctagta caagtgaatt tttattctta aacataggtg tgttggctct 240: aaaa gatgcgctct acctgaaaag gaaattggat actg gatgtggtgc 2461 agtgaagtat tttaggccca ggtctgtgta cacattttat agaagaatga agtactctga 252; agtattttgg ttgccttttc atttcaactg tgttttgaat ttgtcagatc acacatatat 258; tgtgttattg ggcgctgtgg tatcttttat aaaacctctt gcttgtgtgc aaaagttcct 264; aaaaggaaac acaagtaatg cctatccatt actagcatgc tatgctgcat gctttactgc 270; cattgctgta tgctttactg tctttgtaaa aatccccctc tccccttttc tggtaactgg 276; aaaagcatgc taaaaatagt cttatatttt caccccataa atgcagaatc agtaattcct 282; tggcttaaag tata atcaatatta ttggtggtaa ataccaagtt tggtatctca 288; tagctatctt tttttaaaga aattaagttc ttgaaaattt agccaaatcc cgttttatgg 294; gaatgctctt tagaattcat tttgttcagc ccctttgttc tatggttgag aaatctgagg 300; gaag gttaagagaa ctttccccgt gtctcacagg gagg cagagctgga 306; actagatatc tggtctgttg actctagctc agtgtcttct ggtaactgtt gaaaattgtc 312; ttagtttgag agatggctga aataatgaac ataaaatgct atttataata acaagtatat 318; gtgaaatttc taag actactaccg gcttactgtt gaatagtttg gtgt 324; ttaggctaga aatgcctccc acattggtaa taaacattac aaaatacaat gtatttttag 330; gtaggcattt tataaaatgc attatgccat ggttgctttt gagatagatt gtagtctggg 336; tagcatcttt aaaatgtatg tgggcttaac tgttgttcat atcaggagat gctctgattg 342; tataggtgag actctgtttc tgttattttt aattgctgta tgaaatgtga tcagattatt 348; ttactaccaa cagttatagt gtcc aactgtatta tgat aatatgataa 354; gatt aaattgtttg tcttcattcc ttatatgttt agaagttttt gctttgtctg 360; cctgcttact tgtatatgta agcatgaggg aaatacactg ttgctaatac tgaaattaca 366; atcaagtaac taaggccttg agttcatatg tgacactgaa tgcactagct tccttcgttc 372; tataactaat gtaccttaac ttcccccatt cttatattta caagaagcta agtcattatg 378; ttctgagtgt gtggtatgtt cccttaaaaa aaaatgacac ttggaagaaa aatgtatgaa 384; attcagaaat tccgatcaaa taat tttt tttttttgag acagagtctt 390; gctttgttgc ccaggctgga gggcagtggt gtgatctcac ctcactgcag cttccgcctc 396; ctgggttcaa gtgattctca tggctcagcc gcctgagtag ctgggattac aggtgtgagc 402; caacaagccc tttt ttta gtagagacaa ggtttcacca tgttggtcag 408; gctgggctca aactcctgaa tccgcctgcc tcggcctccc aaagtgctgg gattacaggt 414; gtgagctgcc gcacccagcc aagaaaaata atactcttaa atacttagat gttcacctaa 420; agttgatatt atttggtatg actt tgta atctttcaga ttacaccact 4261 ttgaaaacaa gttttaacag aaaa atatagtttt tgagggtatt cccaacttgt 432; gatcttctac cactttagag acattcaagt aatagttttc ttagagcttt tcct 438; attcactgag attttaaaaa tttcaccttt attcgaggga aggatcaatg acca 444; tttggaaaaa Cgaagatcag aaggtaaatg atctttattt ttta aagggaaatt 450; aaaccattca tgaataaact ttaaaaatgt gaagtgtcct tttccttttc acaatacaaa 456; aaaaatttca acagattgtg tggtttgtgc atttatatcc tgttaagcat ctaa 462; tcactgggac ttgaattctg atggcagata gtctcttgct tagtgagatg acta 468; ttttttagta ggaagtgaga acagctgatt ttcatgccac gtttcatagc cccacttttg 474; tacc accacgcttc ttcgcgtaag cagtggcatc ttgggaatga atgcccagcc 480; gctcgtgggt tggtgcaaag aagtataaac atatatcact aaggaaaaag aaagtttgtc 486; ttgcccttct gacacagtgt gtgcacttca ggcaattttt ggaaaatata tcca 492: aattctgcct ttcagcagca gcta ggaacatttc ttcc ctgtaatatt 498; cttt aagcataatc actaattata agttgtatcc tatttttttc cagcttaatt 504; tctgtggttt attgaaaacc aagtataaat gtgactaaaa tgct ttgtttttat 510; agttaacttt cttaaggtta tggacatttt ataatgtaac atttgattgg cctggcctct 516; ttcc cttctagtta tgcatatcct cctgttgcca catttcttgt tttaaaactc 522; agtttcttgt tttccagttg ttgctatgta taacacccat cttgaaagag tagg 528; ttca gataactttt gtagtagtga tattcaacta tagcagtacc ttaactcatg 534; atgagcttag gaacataaaa gataattgtt atag cacccccaga gatactgacc 540; taattggtct ggggtggaga tctggcatgg tagttttttt caagctccaa tcatcggcca 546; gacagttgct ttatgtaggt ttttaaatgc caaaggcaga tatgaagtag atttaattaa 552; gacttgactt cagcaataca ggggaactta aaatacttat ttttctttaa actgcaggag 558; tcactgttag gtattgctta aaaaaaattg cataaaagct ttgcttgtca agttaggatt 564; gctggaatac cactaaagat ttttgacttg tgaataaatg agctgtcatc gcaaaaaggc 570; gatttgagaa atgtgggctt cagtattaat tgccattttg ctgacaccca gtgtacctac 576; ctacctgaga aatttatttt gtccatcatg tatttctcaa agcaaaaggt ggttttcaag 582; tataatgtcg ttttcaacat gcttattact tagttttacg tcagctcatt tcatcatcat 588; tgataacttg tgaaatactt atctccatcc tatggaatag gggagacggg tttagacagg 594; ttcaattagc tcaagtctac acagctgaag tagcagagaa agtgggatct agatggtctg 600: atcctagtga tctaccatat gaaggacata gtttgtgtcc tggtccaagt ttga 6061 ctcctcacaa acagtaagta tggcaatttt gtgatgcctt tgattccact ttacatggag 612; tactattatt tgtgaaatgt ctttaagatt tttggtctta aatttttgaa gactgctttc 618; tatc tcccagaaaa ttgagaagaa gtaaactcct gcccactaac aatctcagtc 624; caaa accaacatga acattcctaa acaagagtgt ctct aagaagaagg 630; ctatagaatt tatggaaatg gcttatgtaa cctacaagac tggagaacag aatgtgactg 6361 gccttttcta atggtccttt aagatttaat agca agagtttttt ataattgact 6421 ttgtggtcta aattcttgat actgtttata attctacaaa gaacaaaaat tgttatgtac 6481 tataggcact taagaaccct gaggaaaaat aatacaatgt gtgtgtgtga gagagagagt 6541 gagttactga cattgttcca aaaaaaaaaa aaaaaaaaaa aaaaatgtgg agggttgaaa 6601 tggtaaggaa ttggaatctt ttgtattttc gagcaataag aattcctatt tcaa 6661 atagaggttt gttaggaatt acagttgtgg ggagcaaact ttcttttttg tgctgtttta 672- attcaaaatg tatatcctta attgtatata atatgtagat aaatatatga gggtattaag 6781 ctactttgaa ttaaatttaa ggatatattt cacatgaaaa caaatacaaa cgagaatcaa 6841 aataaagttt tgcaaagta Protein seguence nt 2): NCBI Reference ce: NP_919415.2 LOCUS NP_919415 ION N P_919415 1 masasgamak ldpp tdlkfkgpft dvvttnlklr npsdrkvcfk vkttaprryc 61 iidp gstvtvsvml qpfdydpnek skhkfquti fappntsdme avwkeakpde 121 lmdsklrcvf empnendkln dmepskavpl naskngpmp kphsvslndt etrklmeeck 181 rlqgemmkls eenrhlrdeg lrlrkvahsd kpgststasf rdnvtsplps llvviaaifi 241 gfflgkfil HNRNPD Official Symbol: HNRNPD al Name: heterogeneous nuclear cleoprotein D (AU-rich t RNA binding protein 1, 37kDa Gene ID: 3184 Organism: Homo sapiens Other Aliases: AUF1, AUF1A, Other Designations: ARE-binding n AUFI, type A; heterogeneous nuclear ribonucleoprotein DO; hnRNP D0 Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_031370.2 LOCUS NM_031370 ACCESSION NM_031370 l cttccgtcgg ccattttagg tggtccgcgg cggcgccatt aaagcgagga ggaggcgaga 6; gcggccgccg ctggtgctta ttctttttta gtgcagcggg agagagcggg agtgtgcgcc 12; gagt gggaggcgaa gggggcaggc cagggagagg cgcaggagcc tttgcagcca 18; cgcgcgcgcc ttccctgtct cttc gcgaggtaga gcgggcgcgc ggcagcggcg 24L gggattactt tgctgctagt ttcggttcgc ggcagcggcg ggtgtagtct cggcggcagc ; ggcggagaca ctagcactat gtcggaggag cagttcggcg gggacggggc ggcggcagcg 36L gcaacggcgg cggtaggcgg ctcggcgggc gagcaggagg gagccatggt ggcggcgaca 421 cagggggcag cggcggcggc gggaagcgga gccgggaccg ggggcggaac cgcgtctgga 48; ggcaccgaag ggggcagcgc cgagtcggag ggggcgaaga ttgacgccag cgag 54; gaggatgaag gccattcaaa ctcctcccca cgacactctg aagcagcgac ggcacagcgg 60; gaagaatgga aaatgtttat aggaggcctt gaca ctacaaagaa agatctgaag 66; gactactttt ccaaatttgg tgaagttgta gactgcactc taga tcctatcaca 72; gggcgatcaa ggggttttgg ctttgtgcta tttaaagaat cggagagtgt agataaggtc 78; atggatcaaa aagaacataa attgaatggg aaggtgattg atcctaaaag ggccaaagcc 84; atgaaaacaa cggt taaaaaaatt tttgttggtg gcctttctcc agatacacct 90; gaagagaaaa agta ctttggtggt tttggtgagg tggaatccat agagctcccc 96; atggacaaca agaccaataa gaggcgtggg ttctgcttta ttacctttaa agaa L02; ccagtgaaga agataatgga aaagaaatac cacaatgttg gtcttagtaa atgtgaaata L08; aaagtagcca tgtcgaagga acaatatcag caacagcaac agtggggatc tagaggagga L14; tttgcaggaa gagctcgtgg tggt ggccccagtc aaaactggaa ccagggatat L20; agtaactatt ggaatcaagg ctatggcaac tatggatata acagccaagg ttacggtggt L26; tatggaggat atgactacac tggttacaac aactactatg gatatggtga ttatagcaac L32; cagcagagtg gttatgggaa cagg ggtc atcaaaatag acca L38; tactaaatta ttccatttgc aacttatccc caacaggtgg tgaagcagta ttttccaatt L44; tgaagattca tttgaaggtg gctcctgcca cctgctaata gcagttcaaa ctaaattttt 150; tgtatcaagt ccctgaatgg aagtatgacg ttgggtccct ctgaagttta ctcattaaaa gaaatttgct gttt ttaa ttgctatgct tcagaatcaa L62; tttgtgtttt tttc ccccagtatt gtagagcaag gtta aaagcccagt 168; gtgacagtgt catgatgtag tagtgtctta ctggtttttt aataaatcct tttgtataaa 174; aatgtattgg ctcttttatc atcagaatag gaaaaattgt catggattca agttattaaa 180; agcataagtt tggaagacag gcttgccgaa attgaggaca tgattaaaat tgcagtgaag 186; tttgaaatgt ttttagcaaa atctaatttt tgccataatg tgtcctccct gtccaaattg 192; ggaatgactt aatgtcaatt tgtttgttgg taat aatacttcct tatgtagcca 198; ttaagattta tatgaatatt ttcccaaatg cccagttttt gcttaatatg gctt 204: tttagaacaa atctggataa atgtgcaaaa cttt gcacagatag ttaatgtttt 210; atgcttccat taaataaaaa ggacttaaaa aatt ataatagaaa tgcggctagt 216; tcagagagat agct gtggtggact tcatagatga gtgt tgagggagga 2221 ttaaagaaat atataccgtg tttatgtgtg tgtgctt Protein ce (variant 1 ): NCBI Reference Seguence: NP_112738.1 LOCUS N P_1 12738 ACCESSION N P_112738 1 mseequgdg aaaaataavg gsageqegam vaatqgaaaa agsgagtggg tasggteggs 61 aesegakida skneedeghs nssprhseaa wkmf igglswdttk kdlkdyfskf 121 gevvdctlkl dpitgrsrgf gfvlfkeses vdkvmqueh klngkvidpk rakamktkep 181 ggls pdtpeekire yfggfgeves ielpmdnktn krrgfcfitf keeepvkkim 241 ekkyhnvgls kceikvamsk equqqqug srggfagrar sqnw nqusnywnq 301 gygnygynsq gyggyggydy tgynnyygyg dyanqsgyg kvsrrgghqn sykpy Nucleotide seguence (variant 2): NCBI Reference ce: NM_031369.2 LOCUS NM_031 369 ACCESSION NM_031369 l cttccgtcgg ccattttagg tggtccgcgg cggcgccatt aaagcgagga ggaggcgaga 61 gcggccgccg ctggtgctta ttctttttta gtgcagcggg agagagcggg agtgtgcgcc 121 gcgcgagagt gggaggcgaa gggggcaggc cagggagagg cgcaggagcc tttgcagcca 181 cgcgcgcgcc gtct tgtgtgcttc gcgaggtaga gcgggcgcgc ggcagcggcg 241 gggattactt tgctgctagt ttcggttcgc ggcagcggcg ggtgtagtct cggcggcagc ; ggcggagaca ctagcactat gtcggaggag cagttcggcg gggacggggc ggcggcagcg 36; gcgg ngtaggcgg ctcggcgggc gagcaggagg gagccatggt ggcggcgaca 42; cagggggcag ngngngC gggaagcgga gccgggaccg ggggcggaac tgga 48; ggcaccgaag ggggcagcgc Cgagtcggag ggggcgaaga ttgacgccag taagaacgag 54; gaggatgaag ggaaaatgtt tataggaggc cttagctggg acactacaaa gaaagatctg 60; aaggactact tttccaaatt tggtgaagtt gtagactgca ctctgaagtt tatc 66L acagggcgat caaggggttt tggctttgtg ctatttaaag aatcggagag tgtagataag 72L gtcatggatc aaca gaat gggaaggtga ttgatcctaa aagggccaaa 78L gccatgaaaa caaaagagcc aaaa atttttgttg gtggcctttc tccagataca 84L cctgaagaga aaataaggga gtactttggt ggttttggtg aggtggaatc catagagctc 90; cccatggaca acaagaccaa gcgt gggttctgct ttattacctt taaggaagaa 96; gaaccagtga agaagataat ggaaaagaaa taccacaatg ttggtcttag taaatgtgaa L02; ataaaagtag cgaa ggaacaatat cagcaacagc gggg ggatttgcag ctcg tggaagaggt ggtggcccca gtcaaaactg gaaccaggga L14; tatagtaact attggaatca aggctatggc aactatggat ataacagcca aggttacggt L20; ggttatggag gatatgacta cactggttac aacaactact atggatatgg tgattatagc L26; aaccagcaga gtggttatgg gaaggtatcc aggcgaggtg gtcatcaaaa caaa L32; ccatactaaa ttattccatt tgcaacttat ccccaacagg tggtgaagca gtattttcca L38; atttgaagat tcatttgaag cctg gcta gttc aaactaaatt L44; ttttgtatca agtccctgaa tggaagtatg acgttgggtc cctctgaagt ttaattctga L50; gttctcatta aaagaaattt attg ttttatttct taattgctat gcttcagaat L56; caatttgtgt tttatgccct ttcccccagt attgtagagc aagtcttgtg ttaaaagccc L62; agtgtgacag tgtcatgatg tagtagtgtc ttactggttt tttaataaat ccttttgtat L68; aaaaatgtat tggctctttt atcatcagaa taggaaaaat tgtcatggat tcaagttatt L74; aaaagcataa gtttggaaga caggcttgcc gaaattgagg acatgattaa aattgcagtg L80; gaaa tgtttttagc aaaatctaat ttttgccata atgtgtcctc L86; ttgggaatga cttaatgtca atttgtttgt tggttgtttt aataatactt ccttatgtag 192; ccattaagat ttatatgaat attttcccaa atgcccagtt tttgcttaat atgtattgtg L98; ctttttagaa caaatctgga taaatgtgca aaagtacccc tttgcacaga tagttaatgt 204; tttatgcttc cattaaataa aaaggactta aaatctgtta attataatag aaatgcggct 2101 agttcagaga taga gctgtggtgg acttcataga caag tgttgaggga 2161 ggattaaaga aatatatacc gtgtttatgt gtgtgtgctt Protein seguence (variant 2): NCBI Reference Seguence: NP_112737.1 LOCUS NP_112?3? ACCESSION NP_112?3? l mseequgdg aaaaataavg gsageqegam vaatqgaaaa agsgagtggg tasggteggs 61 aesegakida skneedegkm figglswdtt kkdlkdyfsk fgevvdctlk 1dpitgrsrg 121 fgfvlfkese svdkvmque hklngkvidp krakamktke pvkkifvggl ekir 181 eyfggfgeve sielpmdnkt nkrrgfcfit fkeeepvkki mekkyhnvgl skceikvams 241 kequqqqu gsrggfagra rgrgggpsqn wnqusnywn qugnygyns quggyggyd 301 ytgynnyygy sgy gkvsrrgghq nsykpy Nucleotide seguence (variant 3): NCBI Reference Seguence: NM_002138.3 LOCUS 138 ACCESSION 138 1 cttccgtcgg ccattttagg gcgg cggcgccatt aaagcgagga gaga 6" gcggccgccg ctggtgctta ttctttttta gtgcagcggg agagagcggg agtgtgcgcc 12; gcgcgagagt gggaggcgaa gggggcaggc cagggagagg cgcaggagcc tttgcagcca 18; cgcgcgcgcc ttccctgtct tgtgtgcttc gcgaggtaga gcgggcgcgc ggcagcggcg 24; gggattactt tgctgctagt tcgc ggcagcggcg ggtgtagtct cggcggcagc ; ggcggagaca ctagcactat gtcggaggag cagttcggcg gggc ggcggcagcg 36; gcaacggcgg gcgg ctcggcgggc gagcaggagg gagccatggt ggcggcgaca 42; cagggggcag ngngngC gggaagcgga gccgggaccg ggggcggaac cgcgtctgga 48L ggcaccgaag ggggcagcgc cgagtcggag ggggcgaaga ttgacgccag taagaacgag 54L gaggatgaag gccattcaaa ctcctcccca tctg aagcagcgac ggcacagcgg 60; tgga aaatgtttat aggaggcctt agctgggaca ctacaaagaa agatctgaag 661 gactactttt ccaaatttgg tgaagttgta gactgcactc tgaagttaga tcctatcaca 72; gggcgatcaa ggggttttgg ctttgtgcta tttaaagaat cggagagtgt agataaggtc 78; atggatcaaa aagaacataa attgaatggg aaggtgattg atcctaaaag ggccaaagcc 84; atgaaaacaa aagagccggt taaaaaaatt tttgttggtg gcctttctcc agatacacct 90; gaagagaaaa taagggagta tggt tttggtgagg tggaatccat cccc 96; atggacaaca ataa gaggcgtggg ttctgcttta ttacctttaa ggaagaagaa L02; aaga agataatgga aaagaaatac cacaatgttg gtcttagtaa aaagtagcca tgtcgaagga acaatatcag caacagcaac agtggggatc L14; tttgcaggaa gagctcgtgg aagaggtggt gaccagcaga gtggttatgg gaaggtatcc L20; aggcgaggtg aaaa tagctacaaa ccatactaaa ttattccatt ccccaacagg tggtgaagca gtattttcca atttgaagat tcatttgaag gtggctcctg L32; ccacctgcta atagcagttc aaactaaatt ttttgtatca agtccctgaa acgttgggtc cctctgaagt ttaattctga gttctcatta aaagaaattt ttttatttct taattgctat gcttcagaat caatttgtgt tttatgccct ttcccccagt L50; attgtagagc aagtcttgtg ttaaaagccc agtgtgacag tgtcatgatg tagtagtgtc L56; ttactggttt tttaataaat gtat aaaaatgtat tggctctttt atcatcagaa L62; taggaaaaat ggat tatt aaaagcataa gtttggaaga caggcttgcc L68; gaaattgagg acatgattaa aattgcagtg aagtttgaaa tagc aaaatctaat L74; ttttgccata cctc cctgtccaaa ttgggaatga cttaatgtca ttgt L80; tggttgtttt aataatactt ccttatgtag ccattaagat ttatatgaat attttcccaa L86; atgcccagtt taat atgtattgtg ctttttagaa caaatctgga taaatgtgca L92; aaagtacccc tttgcacaga tagttaatgt tttatgcttc cattaaataa aaaggactta L98; aaatctgtta attataatag aaatgcggct gaga gatttttaga gctgtggtgg 204; acttcataga tgaattcaag tgttgaggga ggattaaaga tacc gtgtttatgt 210; gtgtgtgctt Pnneh1seguence(vafiant3r NCBI Reference Seguence: NP_002129.2 LOCUS N P_0021 29 ACCESSION NP_002129 1 mseequgdg aaaaataavg gsageqegam vaatqgaaaa agsgagtggg tasggteggs 61 aesegakida skneedeghs seaa taqreewkmf igglswdttk kdlkdyfskf 121 gevvdctlkl dpitgrsrgf gfvlfkeses vdkvmqueh klngkvidpk rakamktkep 181 vkkifvggls pdtpeekire eves ielpmdnktn krrgfcfitf keeepvkkim 241 ekkyhnvgls kceikvamsk equqqqug srggfagrar grggdqqsgy gkvsrrgghq 301 nsykpy Nucleotide seguence (variant 4): NCBI Reference Seguence: NM_001003810.1 LOCUS NM_001003810 ACCESSION 003810 1 cttccgtcgg ccattttagg tggtccgcgg cggcgccatt aaagcgagga ggaggcgaga 6; gcggccgccg ctggtgctta ttctttttta gtgcagcggg agagagcggg agtgtgcgcc 12; gagt gggaggcgaa gggggcaggc cagggagagg cgcaggagcc tttgcagcca 18; cgcgcgcgcc ttccctgtct tgtgtgcttc gcgaggtaga gcgggcgcgc ggcagcggcg 24; gggattactt tagt ttcggttcgc ggcagcggcg ggtgtagtct cggcggcagc ; ggcggagaca ctagcactat gtcggaggag cagttcggcg gggc ggcggcagcg 36; gcaacggcgg cggtaggcgg ctcggcgggc gagcaggagg gagccatggt ggcggcgaca 42; cagggggcag ngngngC cgga gccgggaccg ggggcggaac cgcgtctgga 48; ggcaccgaag ggggcagcgc Cgagtcggag ggggcgaaga ttgacgccag taagaacgag 54; gaggatgaag ggaaaatgtt aggc cttagctggg acactacaaa gaaagatctg 60; tact tttccaaatt tggtgaagtt gtagactgca ctctgaagtt agatcctatc 66; acagggcgat caaggggttt tggctttgtg ctatttaaag aatcggagag tgtagataag 72; gtcatggatc aaaaagaaca taaattgaat gggaaggtga ttgatcctaa aagggccaaa 78; aaaa caaaagagcc ggttaaaaaa atttttgttg tttc tccagataca 84; cctgaagaga aaataaggga gtactttggt ggttttggtg aggtggaatc gctc 90; cccatggaca acaagaccaa taagaggcgt gggttctgct ttattacctt taaggaagaa 96L gaaccagtga agaagataat ggaaaagaaa taccacaatg ttggtcttag taaatgtgaa 102; ataaaagtag ccatgtcgaa ggaacaatat cagcaacagc aacagtgggg atctagagga 108; ggatttgcag gaagagctcg tggaagaggt ggtgaccagc gtta tgggaaggta 114; tccaggcgag gtggtcatca aaatagctac aaaccatact aaattattcc atttgcaact 120; tatccccaac aggtggtgaa gcagtatttt ccaatttgaa gattcatttg aaggtggctc 126; ctgccacctg ctaatagcag ctaa attttttgta tcaagtccct gaatggaagt 132; atgacgttgg gtccctctga agtttaattc tgagttctca ttaaaagaaa tttgctttca 138; ttgttttatt tcttaattgc tatgcttcag aatcaatttg tgttttatgc cctttccccc 144; agtattgtag agcaagtctt gtgttaaaag gtga cagtgtcatg atgtagtagt 150; gtcttactgg ttttttaata aatccttttg tataaaaatg tattggctct tttatcatca 1561 gaataggaaa catg agtt attaaaagca taagtttgga agacaggctt L62; gccgaaattg aggacatgat taaaattgca gtgaagtttg tttt agcaaaatct L68; aatttttgcc ataatgtgtc ctccctgtcc aaattgggaa tgacttaatg tcaatttgtt L741 tgttggttgt tttaataata cttccttatg tagccattaa gatttatatg aatattttcc 180; caaatgccca gtttttgctt aatatgtatt gtgcttttta gaacaaatct gcaaaagtac ccctttgcac agatagttaa tgttttatgc ttccattaaa ttaaaatctg ttaattataa tagaaatgcg gctagttcag agagattttt agagctgtgg 198; tggacttcat agatgaattc aagtgttgag ggaggattaa atat accgtgttta 204; tgtgtgtgtg ctt Protein sequence (variant 4): NCBI Reference ce: NP_001003810.1 LOCUS N P_001003810 ACCESSION NP_001003810 1 mseequgdg aaaaataavg gsageqegam vaatqgaaaa agsgagtggg tasggteggs 61 aesegakida egkm figglswdtt kkdlkdyfsk fgevvdctlk ldpitgrsrg 121 fgfvlfkese que hklngkvidp krakamktke pvkkifvggl spdtpeekir 181 eyfggfgeve sielpmdnkt nkrrgfcfit fkeeepvkki mekkyhnvgl vams 241 kequqqqu gsrggfagra rgrggdqqsg rggh qnsykpy al Symbol: BSG Official Name: basigin (Ok blood group) Gene ID: basigin (Ok blood group) Organism: Homo s Other Aliases: UNOGSOSXPR021383, 5F7, CD147, EMMPRIN, M6, OK, TCSF Other Designations: CD147 antigen; OK blood group antigen; basigin; collagenase stimulatory factor; extracellular matrix metalloproteinase inducer; leukocyte activation antigen M6; tumor erived enase stimulatory factor Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_001728.3 LOCUS NM_001 728 ACCESSION NM_001728 l gcga gcgtgtgcgc gcgtgcgcag gcggggcgac cggcgtcccc cgcc 6; ccgcccccga gatgacgccg tgcgtgcgcg cgcccggtcc gcgcctccgc cgctttttat 12; agcggccgcg ggcggcggcg gcagcggttg gaggttgtag gaccggcgag gaataggaat l8; catggcggct gcgctgttcg tgctgctggg attcgcgctg ctgggcaccc acggagcctc 24; ngggctgcc ggcttcgtcc aggcgccgct gtcccagcag aggtgggtgg ggggcagtgt ; ggagctgcac gccg tgggcagccc ggtgcccgag atccagtggt ggtttgaagg 36; gcagggtccc aacgacacct gctcccagct ctgggacggc gcccggctgg accgcgtcca 42; catccacgcc acctaccacc agcacgcggc cagcaccatc gaca cgctcgtgga 48; ggaggacacg ggcacttacg agtgccgggc cagcaacgac ccggatcgca accacctgac 54; ccgggcgccc agggtcaagt gcgc agtc gtgctagtcc tggaacccgg 60; cacagtcttc actaccgtag aagaccttgg ctccaagata ctcctcacct gctccttgaa 66; tgacagcgcc acagaggtca cagggcaccg ctggctgaag gggggcgtgg tgctgaagga 72; ggacgcgctg cccggccaga aaacggagtt caaggtggac tccgacgacc agtggggaga 78; gtactcctgc gtcttcctcc ccat gggcacggcc aacatccagc ggcc 84; tcccagagtg aaggctgtga agtcgtcaga acacatcaac gagggggaga cggccatgct 90; caag tcagagtccg tgccacctgt cactgactgg gcctggtaca agatcactga 96L ggac aaggccctca tgaacggctc cgagagcagg ttcttcgtga gttcctcgca 102; gggccggtca gagctacaca ttgagaacct gaacatggag cccg gccagtaccg 108; gtgcaacggc accagctcca agggctccga ccaggccatc atcacgctcc gcgtgcgcag 114; ccacctggcc gccctctggc ccttcctggg catcgtggct gaggtgctgg tgctggtcac 120; catcatcttc atctacgaga agcgccggaa gcccgaggac gtcctggatg atgacgacgc 126; cggctctgca cccctgaaga gcagcgggca gcaccagaat gacaaaggca agaacgtccg 132; ccagaggaac tcttcctgag gcaggtggcc cgaggacgct ccctgctcca cgtctgcgcc 138; gccgccggag tccactccca gtgcttgcaa gattccaagt tctcacctct taaagaaaac 144; ccaccccgta gattcccatc atacacttcc ttctttttta aaaaagttgg gttttctcca 150; ttcaggattc tgttccttag gtttttttcc ttctgaagtg gaga gcccgggagc 1561 tgctgccctg cgtc tgtggctttc agcctctggg tcat ggccgggtgg L62; gcggcacagc cttctccact agtc agtgccaggt cctt tgtggaaagt L68; cacaggtcac acgaggggcc ccgtgtcctg cctgtctgaa gccaatgctg tctggttgcg 1741 ccatttttgt gcttttatgt ttaattttat gagggccacg ggtctgtgtt cctc ttggccacag aggactcact tgcccacacc gagggcgacc 186; ccgtcacagc ctcaagtcac tcccaagccc tgtc tgtgcatccg ggggcagctc 1921 tggagggggt ttgctgggga actggcgcca tcgccgggac tccagaaccg cagaagcctc 198; cccagctcac ccctggagga cggccggctc tctatagcac tcac gtgggaaccc 204; ccctcccacc caccgccaca ataaagatcg cccccacctc caccctcaaa aaaaaaaaaa 2101 aaaaaaa Protein ce (variant 1): NCBI nce Seguence: NP_001719.2 LOCUS NP_001719 ACCESSION N P_001719 1 maaalfvllg fallgthgas apl sqqrwvggsv elhceavgsp vpeiqufeg 61 qundtcsql drvh ihatthhaa stisidtlve edtgtyecra sndpdrnhlt 121 raprvkwvra qavvlvlepg tvfttvedlg skilltcsln dsatevtghr wlkggvvlke 181 dalpqutef kvdsddqwge yscvflpepm lhgp prvkavksse hinegetaml 241 vcksesvppv tdwawykitd sedkalmngs esrffvsssq grselhienl nmeadpgqyr 301 cngtsskgsd qaiitlrvrs pflg ivaevlvlvt iifiyekrrk pedvldddda 361 gsaplkssgq hqndkgknvr qrnss Nucleotide seguence (variant 2): NCBI Reference Seguence: NM_198589.2 LOCUS NM_1 98589 ACCESSION NM_198589 l gtacatgcga gcgtgtgcgc gcag gcggggcgac cggcgtcccc ggcgctcgcc 6; ccgcccccga gatgacgccg tgcgtgcgcg cgcccggtcc gcgcctccgc cgctttttat 12; agcggccgcg ggcggcggcg gcagcggttg gaggttgtag gaccggcgag gaataggaat 18; catggcggct gcgctgttcg tgctgctggg attcgcgctg ctgggcaccc acggagcctc 24; ngggctgcc gtct ccgt agaagacctt ggctccaaga tactcctcac 30L ctgctccttg aatgacagcg ccacagaggt cacagggcac cgctggctga aggggggcgt 36L ggtgctgaag gaggacgcgc tgcccggcca gaaaacggag ttcaaggtgg actccgacga 42L ccagtggqga tcct gcgtcttcct ccccgagccc atgggcacgg ccaacatcca 48L gctccacggg cctcccagag tgaaggctgt gaagtcgtca gaacacatca ggga 54; gacggccatg ctggtctgca agtcagagtc cgtgccacct gtcactgact gggcctggta 60; caagatcact gactctgagg acaaggccct catgaacggc tccgagagca ggttcttcgt 66; ctcg cggt cagagctaca cattgagaac ctgaacatgg aggccgaccc 72; cggccagtac ngtgcaacg gctc caagggctcc gaccaggcca tcatcacgct 78; ccgcgtgcgc agccacctgg ccgccctctg gcccttcctg ggcatcgtgg ctgaggtgct 84; ggtc accatcatct tcatctacga gaagcgccgg aagcccgagg acgtcctgga 90; tgatgacgac gccggctctg cacccctgaa gagcagcggg caga atgacaaagg 96; caagaacgtc cgccagagga actcttcctg aggcaggtgg cccgaggacg ctccctgctc L02; cacgtctgcg ccgccgccgg agtccactcc cagtgcttgc aagattccaa gttctcacct L08; cttaaagaaa acccaccccg tagattccca tcatacactt ccttcttttt taaaaaagtt L14; gggttttctc ggat tctgttcctt aggttttttt gaag tgtttcacga L20; gagcccggga gctgctgccc tgcggccccg tctgtggctt tctg ggtctgagtc L26; atggccgggt gggcggcaca gccttctcca ctggccggag tcagtgccag gtccttgccc L32; tttgtggaaa gtcacaggtc acacgagggg ccccgtgtcc tgcctgtctg aagccaatgc L38; tgtctggttg cgccattttt gtgcttttat tttt atgagggcca cctcagggac gactctgacc tcttggccac agaggactca ccgagggcga ccccgtcaca gcctcaagtc actcccaagc cccctccttg cgggggcagc gggg tggg gaactggcgc catcgccggg cgcagaagcc tccccagctc acccctggag gacggccggc tctctatagc accagggctc L68; acgtgggaac ccccctccca cccaccgcca caataaagat C&CC tccaccctca 1741 aaaaaaaaaa aaaaaaaaa Protein seguence (variant 2): NCBI nce Seguence: NP_940991.1 LOCUS N P_940991 ACCESSION 991 1 maaalfvllg fallgthgas gaagtvfttv illt cslndsatev tghrwlkggv 61 lpgq ktefkvdsdd qwgeyscvfl pepmgtaniq 1hgpprvkav kssehinege 121 tamlvckses vppvtdwawy kitdsedkal mngsesrffv sssqgrselh ienlnmeadp 181 gqyrcngtss kgsdqaiitl aalw pflgivaevl vlvtiifiye krrkpedvld 241 dddagsaplk ssgqhqndkg knvrqrnss Nucleotide seguence nt 3): NCBI Reference Seguence: NM_198590.2 LOCUS NM_1 98590 ACCESSION NM_198590 1 cccgccagtg tagccacatt cctgcccctt tccagttagc ccttcgcgtt cggcttagtc 61 tgcggtcctc ttgcattgcg actccgagtt taacttccaa cacacacttt caacctccaa 121 gagacgcccc cacctgtgtc gccccaatag cgacttttct caccgtggtc gccgcggaac 181 ttcaagggtc cttcctaccc gcgttgctga gggt ttacgcgtca cctcgggcgg 241 gacccgatcc tccgctcctg ccac aatgaagcag tcggacgcgt ctccccaaga 301 aagccggcac agtcttcact accgtagaag accttggctc caagatactc ctcacctgct 361 ccttgaatga cagcgccaca gaggtcacag ggcaccgctg gctgaagggg ggcgtggtgc 421 tgaaggagga cgcgctgccc ggccagaaaa cggagttcaa ggtggactcc gacgaccagt 481 ggggagagta ctcctgcgtc cccg agcccatggg cacggccaac atccagctcc 541 acgggcctcc cagagtgaag gctgtgaagt cgtcagaaca cgag ggggagacgg 601 ccatgctggt ctgcaagtca gtgc cacctgtcac tgactgggcc aaga 661 tcactgactc tgaggacaag gccctcatga ccga gagcaggttc ttcgtgagtt 721 cctcgcaggg ccggtcagag ctacacattg agaacctgaa catggaggcc gaccccggcc 781 agtaccggtg caacggcacc agctccaagg gctccgacca ggccatcatc acgctccgcg 841 tgcgcagcca cctggccgcc ctctggccct tcctgggcat cgtggctgag gtgctggtgc 901 tggtcaccat catcttcatc tacgagaagc gccggaagcc cgaggacgtc ctggatgatg 96; acgacgccgg ctctgcaccc ctgaagagca gcgggcagca ccagaatgac aaaggcaaga 102; acgtccgcca gaggaactct ggca ggtggcccga ggacgctccc tgctccacgt 108; cgcc gccggagtcc agtg cttgcaagat tccaagttct cacctcttaa 114; agaaaaccca agat tcccatcata cttc aaaa aagttgggtt 120; ttctccattc aggattctgt tccttaggtt tttttccttc tgaagtgttt cacgagagcc 126; cgggagctgc tgccctgcgg ccccgtctgt ggctttcagc ctctgggtct gagtcatggc L321 cgggtgggcg gcacagcctt ctccactggc cggagtcagt gccaggtcct tgccctttgt L38; ggaaagtcac aggtcacacg aggggccccg tgtcctgcct gtctgaagcc aatgctgtct L44; ggttgcgcca tttttgtgct tttatgttta attttatgag ggccacgggt ctgtgttcga 1501 ctcagcctca gggacgactc tgacctcttg gccacagagg actcacttgc ccacaccgag 156; ggcgaccccg tcacagcctc aagtcactcc caagccccct ccttgtctgt gcagctctgg agggggtttg ctggggaact ggcgccatcg ccgggactcc agaaccgcag 168; aagcctcccc agctcacccc tggaggacgg ccggctctct atagcaccag ggctcacgtg 174; cccc tcccacccac cgccacaata aagatcgccc ccacctccac cctcaaaaaa 1801 aaaaaaaaaa aaaa Protein seguence (variant 3): NCBI Reference Seguence: NP_940992.1 LOCUS N P_940992 ACCESSION 992 1 mgtaniqlhg pprvkavkss ehinegetam lvcksesvpp vtdwawykit dsedkalmng 61 vsss qgrselhien lnmeadpgqy rcngtsskgs dqaiitlrvr shlaalwpfl 121 givaevlvlv ekrr kpedvldddd agsaplkssg qhqndkgknv rqrnss Nucleotide seguence nt 4): NCBI Reference Seguence: NM_198591.2 LOCUS NM_1 98591 ACCESSION NM_198591 l cccgccagtg tagccacatt cctgcccctt tccagttagc ccttcgcgtt agtc 61 tgcggtcctc ttgcattgcg actccgagtt taacttccaa cacacacttt caacctccaa 121 gagacgcccc cacctgtgtc gccccaatag cgacttttct caccgtggtc gccgcggaac 18; ttcaagggtc cttcctaccc gcgttgctga gagtctgggt ttacgcgtca cctcgggcgg 24; gacccgatcc tccgctcctg aggcccccac aatgaagcag tcggacgcgt ctccccaaga ; ggac tccgacgacc agtggggaga ctgc gtcttcctcc ccgagcccat 36; gggcacggcc aacatccagc tccacgggcc tcccagagtg aaggctgtga agtcgtcaga 42; acacatcaac gagggggaga cggccatgct ggtctgcaag tcagagtccg tgccacctgt 48; cactgactgg gcctggtaca ctga ctctgaggac aaggccctca tgaacggctc 54L cgagagcagg ttcttcgtga gttcctcgca gggccggtca gagctacaca ttgagaacct 60; gaacatggag gccgaccccg accg cggc accagctcca agggctccga 66L ccaggccatc atcacgctcc gcgtgcgcag ccacctggcc gccctctggc ccttcctggg 721 catcgtggct gaggtgctgg tcac catcatcttc atctacgaga agcgccggaa 78; gcccgaggac gatg atgacgacgc cggctctgca cccctgaaga gcagcgggca 84; gcaccagaat gacaaaggca agaacgtccg ccagaggaac tcttcctgag gcaggtggcc 90; cgaggacgct ccctgctcca cgtctgcgcc gccgccggag tccactccca gcaa 96; gattccaagt tctcacctct taaagaaaac ccaccccgta gattcccatc atacacttcc L02; ttctttttta aaaaagttgg gttttctcca attc tgttccttag gtttttttcc L08; ttctgaagtg tttcacgaga gcccgggagc tgctgccctg cggccccgtc tgtggctttc L14; agcctctggg tctgagtcat ggccgggtgg gcggcacagc cttctccact ggccggagtc L20; agtgccaggt ccttgccctt tgtggaaagt cacaggtcac acgaggggcc ccgtgtcctg L26; cctgtctgaa gccaatgctg tctggttgcg ccatttttgt gcttttatgt ttat L32; gagggccacg ggtctgtgtt Cgactcagcc tcagggacga ctctgacctc ttggccacag L38; aggactcact tgcccacacc gacc ccgtcacagc ctcaagtcac tcccaagccc L44; cctccttgtc tgtgcatccg ggggcagctc tggagggggt ttgctgggga actggcgcca L50; tcgccgggac tccagaaccg cagaagcctc cccagctcac ccctggagga gctc L56; tctatagcac cagggctcac gtgggaaccc ccctcccacc caccgccaca ataaagatcg L62; cccccacctc caccctcaaa aaaaaaaaaa aaaaaaa n seguence (variant 4): NCBI nce Seguence: NP_940993.1 LOCUS N P_940993 ACCESSION NP_940993 1 mkqsdaque rvdsddqwge yscvflpepm gtaniqlhgp prvkavksse hinegetaml 61 vcksesvppv tdwawykitd sedkalmngs esrffvsssq ienl nmeadpgqyr 121 cngtsskgsd qaiitlrvrs hlaalwpflg ivaevlvlvt iifiyekrrk pedvldddda 181 gsaplkssgq hqndkgknvr qrnss Official Symbol: EIF4A3 Official Name: eukaryotic translation initiation factor 4A3 m: 9775 Organism: Homo sapiens Other Aliases: DDX48, NMP265, NUK34, || Other Designations: ATP-dependent RNA helicase DDX48; ATP-dependent RNA helicase elF4A-3; DEAD (Asp-Glu-Ala—Asp) box polypeptide 48; DEAD box protein 48; NMP 265; eIF-4A-lll; lll; eukaryotic initiation factor 4A-III; otic initiation factor e NUK—34; eukaryotic translation initiation factor 4A; hNMP 265; nuclear matrix protein 265 Nucleotide seguence: NCBI Reference Seguence: NM_014740.3 LOCUS NM_014740 ACCESSION NM_014740 1 acgcacgcac gtctctcgct ttcgcatact taaggcgtct gttctcggca gcggcacagc 61 gaggtcggca gcggcacagc ggca gcggcacagc gaggtcggca gcggcacagc 12; gaggtcggca gcggcagcga ggtcggcagc ggcacagcga ggtcggcagc ggcagcgagg 18; tcggcagcgg cgcgcgctgt gctcttccgc tgaa cgac cacggccacg 24; atggcgacct nggctcggc gcgaaagcgg ctgctcaaag aggaagacat gactaaagtg - gaga ccagcgagga tgtg acccccacgt tcgacaccat gggcctgcgg 36L gaggacctgc tgcggggcat ctacgcttac ggttttgaaa aaccatcagc aatccagcaa 42; atca agcagatcat caaagggaga gatgtcatcg cacagtctca gtccggcaca 481 ggaaaaacag ccaccttcag tatctcagtc ctccagtgtt ttca ggttcgtgaa 541 actcaagctt tgatcttggc tcccacaaga gagttggctg tgcagatcca gaaggggctg 60; cttgctctcg gtgactacat gaatgtccag tgccatgcct gcattggagg caccaatgtt 66; ggcgaggaca agct ggattacgga cagcatgttg tcgcgggcac gcgt 72; gtttttgata tgattcgtcg cagaagccta aggacacgtg ctatcaaaat gttggttttg 78; gatgaagctg atgaaatgtt gaataaaggt ttcaaagagc agatttacga tgtatacagg 84; tacctgcctc cagccacaca ggtggttctc atcagtgcca cgctgccaca tctg 90; gagatgacca acaagttcat gaccgaccca atccgcatct tggtgaaacg tgatgaattg 96; actctggaag gcatcaagca atttttcgtg gcagtggaga gggaagagtg gaaatttgac 1021 actctgtgtg acctctacga gacc atcactcagg cggtcatctt aaaagaaagg tggactggct gacggagaaa atgagggaag ccaacttcac atgcatgqag acatgcccca gaaagagcgg atca tgaaggagtt gccagccgag tgcttatttc tacagatgtc tgggccaggg ggttggatgt tccctcatca ttaactatga tctccctaat aacagagaat tgtacataca tggg 132; agatcaggtc gatacggccg gaagggtgtg gccattaact ttgtaaagaa cgcatcctca gagatatcga gcagtactat tccactcaga ttgatgagat gccgatgaac 144; gttgctgatc gaag cagcagatca tgag ggagactgtt cacctgctgt 150; gtactcctgt ttggaagtat ttagatccag attctactta atggggttta tatggacttt 156; cttctcataa atggcctgcc cttc ctttgaagag gatatgggga ttctgctctc 162; ttttcttatt tacatgtaaa attg ttctaagtct ttttcattaa aaatttaaaa 168; cttttcccat aaactctata cttctaaggt gccaccacct tctctagtaa ctta Protein seguence: NCBI Reference Seguence: 555.1 LOCUS N P_055555 ACCESSION NP_055555 1 mattatmats gsarkrllke edmtkvefet seevdvtptf dtmglredll gfek 61 psaiqqraik qiikgrdvia qsqsgtgkta tfsisvlqcl diquetqal ilaptrelav 121 qiqullalg dymnvqchac iggtnvgedi rkldygqhvv agtpgrvfdm irrrslrtra 181 ikmlvldead emlnkgfkeq iydvyrylpp atqvvlisat 1phei1emtn kfmtdpiril 241 vkrdeltleg ikqffvaver eewkfdtlcd lydtltitqa vifcntkrkv dwltekmrea 301 nftvssmhgd mqueresim kefrsgasrv listdearg ldquvslii nydlpnnrel 361 yihrigrsgr ygrkgvainf anddirilr stqi dempmnvadl i MTHFD1 Official Symbol: MTHFD1 Official Name: methylenetetrahydrofolate dehydrogenase (NADP+ dependent) 1, methenyltetrahydroiolate cyclohydrolase, formyltetrahydrofolate synthetase Gene ID: 4522 Organism: Homo sapiens Other Aliases: MTHFC, MTHFD Other Designations: 5,10-methylenetetrahydrofolate ogenase, 5,10- methylenetetrahydrofolate cyclohydrolase, 10-formyltetrahydrofolate synthetase; Ctetrahydrofolate synthase, cytoplasmic; Cl-THF synthase; cytoplasmic C tetrahydrofolate synthase Nucleotide seguence: NCBI Reference Seguence: NM_005956.3 LOCUS NM_005956 ACCESSION NM_005956 l aattacggcc ggattccgga gtcctttcca gctccctctt cggccgggtt cgaa 6; ggcg gaac tggctctttc tttccgccaa ccgc cagccattca 12; tcaccgattt tcttcatctt cccctccctc ttccgtcccg cagtccccga cctgttagct 18; tagt taagggactc gggtccttcc gaactgcgca tgcgccaccg cgtctgcagg 24; gggagaagcg ggcaggggcg caggcgcagt agtgtgatcc cctggccagt gcac ; gtgggttggg ttgtcctgct tggctgcgga gggagtggaa cctcgatatt ggtggtgtcc 36; atcgtgggca gcggactaat aaaggccatg gcgccagcag aaatcctgaa cgggaaggag 42; atctccgcgc aaataagggc gagactgaaa aatcaagtca ctcagttgaa ggagcaagta 48; cctggtttca gcct ggcaatatta ggca atga ttccaatctt 54; tatataaatg tgaagctgaa ggctgctgaa gagattggga tcaaagccac tcacattaag 60; ttaccaagaa caaccacaga atctgaggtg atgaagtaca ttacatcttt gaatgaagac 66L tctactgtac atgggttctt agtgcagcta gatt cagagaattc cattaacact 72L gaagaagtga tcaatgctat tgcacccgag aaggatgtgg atggattgac tagcatcaat 78; gctgggaaac gagg tgacctcaat gactgtttca ttccttgtac gcctaaggga 84; gaac tcatcaaaga gacaggggtg ccgattgccg gaaggcatgc tgtggtggtt 90; gggcgcagta aaatagttgg ggccccgatg catgacttgc ttctgtggaa caatgccaca 96; gtgaccacct gccactccaa gactgcccat ctggatgagg aggtaaataa aggtgacatc L02; ctggtggttg caactggtca gcctgaaatg gttaaagggg agtggatcaa acctggggca L08; atagtcatcg actgtggaat caattatgtc ccagatgata aaaaaccaaa tgggagaaaa L14; ggtg atgtggcata cgacgaggcc aaagagaggg cgagcttcat cactcctgtt L20; ggcg tagggcccat gacagttgca atgctcatgc agagcacagt agagagtgcc L26; aagcgtttcc tggagaaatt taagccagga aagtggatga ttcagtataa caaccttaac L32; ctcaagacac ctgttccaag tgacattgat atatcacgat cttgtaaacc catt L38; ggtaagctgg ctcgagaaat tggtctgctg tctgaagagg tagaattata aaggccaaag ttctgctgtc agcactagaa cgcctgaagc accggcctga gtggtggtga ctggaataac tccaacaccc ctgggagaag ggaaaagcac gggctagtgc aagcccttgg tgcccatctc taccagaatg tctttgcgtg ccttctcagg cctt tggaataaaa ggtggcgctg caggaggcgg gtcattccta tggaagagtt taatctccac ggtg atgc gctaataacc tcgttgctgc tgat gctcggatat ttcatgaact gacccagaca L80; gacaaggctc tctttaatcg tttggtgcca tcagtaaatg gagtgagaag gttctctgac L86; atccaaatcc gaaggttaaa gagactaggc aaga ctac cacactgaca L92; gaga taaacagatt tgcaagattg gatc cagaaaccat aacttggcaa L98; agagtgttgg ataccaatga tagattcctg aggaagatca cgattggaca ggctccaacg 204; gagaagggtc acacacggac ggcccagttt tctg tggccagtga aattatggct 210; gtcctggctc tcaccacttc tctagaagac gaga gactgggcaa aatggtggtg 216; gcatccagta agaaaggaga gcccgtcagt gccgaagatc tgggggtgag tggtgcactg 222; acagtgctta tgaaggacgc aatcaagccc aatctcatgc agacactgga gggcactcca 228; gtgtttgtcc atgctggccc caac atcgcacatg gcaattcctc catcattgca 234; gaccggatcg cactcaagct tgttggccca gaagggtttg cgga agcaggattt 240: ggagcagaca ttggaatgga aaagtttttt aacatcaaat gccggtattc cggcctctgc 2461 ccccacgtgg tggtgcttgt tgccactgtc agggctctca agatgcacgg gggcggcccc 252; acggtcactg ctggactgcc caag gcttacatac aggagaacct ggagctggtt 258; gaaaaaggct tcagtaactt gaagaaacaa aatg ccagaatgtt tggaattcca 264; gtagtagtgg ccgtgaatgc attcaagacg gatacagagt ctgagctgga cctcatcagc 270; cgcctttcca gagaacatgg ggcttttgat gccgtgaagt gcactcactg ggcagaaggg 276; ggcaagggtg ccttagccct ggctcaggcc agag cagcacaagc cagc 282; ctcc tttatgacct caagctccca gttgaggata aaatcaggat cattgcacag 288; aagatctatg gagcagatga cattgaatta cttcccgaag ctcaacacaa agctgaagtc 294; tacacgaagc agggctttgg gaatctcccc atctgcatgg ctaaaacaca cttgtctttg 300; tctcacaacc cagagcaaaa aggtgtccct ttca ttctgcccat tcgcgacatc 306; cgcgccagcg ttggggctgg ttttctgtac cccttagtag gaacgatgag cacaatgcct 312; ggactcccca cccggccctg tgat ttgg accctgaaac agaacaggtg 318; aatggattat tctaaacaga tcaccatcca tcttcaagaa gctactttga aagtctggcc 324; agtgtctatt caggcccact gggagttagg aagtataagt aagccaagag aagtcagccc 3301 ctgcccagaa gatctgaaac taatagtagg ccca attt tcagccttaa 336; ttctcatcat gtataaatta atca tgcatgtctg ttag tgacgttcca 3421 cagaataaaa ggaaacaagt ttgccatcaa aaaaaaaaaa aaaaaa Protein seguence: NCBI Reference Seguence: NP_005947.3 LOCUS N P_005947 ION NP_005947 l mapaeilngk eisaqirarl knqvtqlkeq vpgftprlai lqvgnrddsn lyinvklkaa 6" eeigikathi klprtttese vmkyitslne dstvhgflvq lpldsensin teevinaiap 12; ekdvdgltsi nagklargdl ndcfipctpk gcleliketg vpiagrhavv vgrskivgap l8; mhdlllwnna tvttchskta hldeevnkgd ilvvatgqpe mvkgewikpg aividcginy 24; vpddkkpngr kvvgdvayde akerasfitp Vngvgpmtv amlmqstves akrflekfkp ; gkwmiqynnl nlktpvpsdi disrsckpkp igklareigl lseevelyge lsal 36; erlkhrpdgk yvvvtgitpt plgegksttt iglvqalgah lyqnvfacvr qpsqutfgi 42; kggaagggys qvipmeefnl hltgdihait aannlvaaai darifheltq tdkalfanv 48L psvngvrrfs diqirrlkrl giektdpttl tdeeinrfar ldidpetitw qrvldtndrf 54L lrkitigqap tekghtrtaq fdisvaseim avlalttsle dmrerlgkmv vasskkgepv 60; vsga ltvlmkdaik pnlmqtlegt pvahagpfa niahgnssii adrialklvg 66L pegfvvteag fgadigmekf fnikcrysgl cphvvvlvat vralkmhggg ptvtaglpr 72; kayiqenlel nlkk qienarmfgi pvvvavnafk tdteseldli srlsrehgaf 78; hwae ggkgalalaq avqraaqaps sfqllydlkl pvedkiriia qkiygaddie 841 llpeaqhkae vytkqgfgnl picmakthls lshnpequv ptgfilpird irasvgagfl 901 yplvgtmstm pcfy didldpeteq vnglf EN02 OfibblSwnbdzENO2 Official Name: enolase 2 (gamma, neuronal) m: 2026 Organism: Homo s Other Aliases: NSE Other Designations: 2-phospho-D-glycerate hydro-Iyase; 2—phospho-D-glycerate hydrolyase; gamma-enolase; neural enolase; neuron ic gamma enolase; neuron-specific enolase; neurone-specific enolase Nucleotide ce: NCBI Reference Seguence: NM_001975.2 LOCUS NM_001 975 ACCESSION NM_001975 l acccgcgctc gtacgtgcgc ctccgccggc agctcctgac tcatcggggg ctccgggtca 6; catgcgcccg cgcggcccta taggcgcctc ccgc cgcccgggag ccgc 12; cgccgccact gccactcccg ctctctcagc gccgccgtcg ccaccgccac cgccaccgcc 18; actaccaccg tctgagtctg cagtcccgag atcccagcca tcatgtccat agagaagatc 24; tgggcccggg agatcctgga ctcccgcggg aaccccacag tggaggtgga tctctatact ; gccaaaggtc ttttccgggc tgcagtgccc agtggagcct ctacgggcat ggcc 36; ctggagctga gggatggaga caaacagcgt tacttaggca aaggtgtcct gaaggcagtg 42; gaccacatca actccaccat cgcgccagcc ctcatcagct tctc tgtggtggag 48L caagagaaac tggacaacct gatgctggag ttggatggga ctgagaacaa gttt 54L ggggccaatg ccatcctggg tgtgtctctg gccgtgtgta aggcaggggc agctgagcgg 60; gaactgcccc tgtatcgcca cattgctcag ctggccggga actcagacct catcctgcct 66; gtgccggcct tcaacgtgat caatggtggc tctcatgctg gcaacaagct ggccatgcag 72; atga tcctcccagt gggagctgag agctttcggg tgcg actaggtgca 78; tacc atacactcaa gggagtcatc aaggacaaat acggcaagga caat 84; gtgggggatg aaggtggctt tgcccccaat atcctggaga acagtgaagc cttggagctg 90; gtgaaggaag ccatcgacaa ggctggctac acggaaaaga tcgttattgg catggatgtt 96; tcag agttttatcg tgatggcaaa ttgg acttcaagtc tgat L02; ccttcccgat acatcactgg ggaccagctg ggggcactct accaggactt tgtcagggac L08; tatcctgtgg tctccattga attt gaccaggatg attgggctgc ctggtccaag L14; ttcacagcca atgtagggat ccagattgtg ggtgatgacc tgacagtgac caacccaaaa 120; cgtattgagc gggcagtgga agaaaaggcc tgcaactgtc tcaa atcggctcgg tcactgaagc catccaagcg tgcaagctgg cccaggagaa L321 gtcatggtga gtcatcgctc aggagagact gaggacacat tcattgctga cctggtggtg L38; gggctgtgca caggccagat caagactggt gccccgtgcc gttctgaacg tacaaccagc tcatgagaat tgaggaagag ctgggggatg aagctcgctt aacttccgta atcccagtgt gctgtgattc ctctgcttgc ctggagacgt ggaacctctg L56; cctc ctggaacctt gctgtcctga tctgtgatag ttcaccccct gagatcccct L62; gagccccagg gtgcccagaa cttccctgat tgacctgctc cgctgctcct tggcttacct L68; gacctcttgc tgtctctgct Cgccctcctt tctgtgccct actcattggg gttccgcact L74; ttccacttct tcctttctct ttctctcttc cctcagaaac tgtg aatgaggatt L80; attataaaag ggggtccgtg gaagaatgat ctgt gatgggagcg ttgg L86; tgtgctgagg agag ggaccatgtg tcacttgtgc cttg tcccacgtgt L92; cttccacttt gcatatgagc Cgtgaactgt gcatagtgct gggatggagg ggagtgttgg L98; gcatgtgatc acgcctggct aataaggctt tagtgtattt atttatttat ttattttatt 204; tgtttttcat catt aatcatttcc ccataactca atggcctaaa actggcctga 210; cttgggggaa cgatgtgtct gtatttcatg tggctgtaga tcccaagatg actggggtgg 216; gaggtcttgc tagaatggga agggtcatag aaagggcctt gacatcagtt cctttgtgtg 222; tactcactga agcctgcgtt ggtccagagc ggaggctgtg tgcctggggg agttttcctc 2281 tatacatctc tccccaaccc taggttccct gttcttcctc cagctgcacc agagcaacct 234; ctcactcccc atgccacgtt ttgc caccacctct gtggcattga aatgagcacc 240; tccattaaag tctgaatcag tgc Protein seguence: NCBI Reference Seguence: NP_001966.1 LOCUS NP_001966 ACCESSION N P_001966 1 msiekiware ildsrgnptv evdlytakgl sgas tgiyealelr dgqurylgk 6; gvlkavdhin stiapaliss glsvveqekl dnlmleldgt enkskfgana ilgvslavck 12; agaaerelpl yrhiaqlagn sdlilpvpaf nvinggshag nklamqefmi lpvgaesfrd 18; amrlgaevyh tlkgvikdky gkdatnvgde ggfapnilen sealelvkea idkagyteki 241 vigmdvaase fyrdgkydld fksptdpsry itgdqlgaly qdfvrdypvv siedpqudd 301 waawskftan vgiqivgddl tvtnpkrier aveekacncl llkvnqigsv teaiqackla 361 qengwgvmvs hrsgetedtf glct gqiktgapcr serlakynql mrieeelgde 42; arfaghnfrn psvl ATP5H Official Sym bol: ATP5H al Name: ATP synthase, H+ transporting, mitochondrial Fo complex, subunit d Gene ID: 10476 Organism: Homo sapiens Other Aliases: My032, ATPQ Other Designations: ATP synthase D chain, mitochondrial; ATP synthase subunit d, mitochondrial; ATP synthase, H+ transporting, ondrial F0 complex, subunit d; ATP synthase, H+ orting, mitochondrial F1 F0, t d; ATPase subunit d; My032 protein Nucleotide seguence nt 1): NCBI Reference Seguence: NM_006356.2 LOCUS 356 ACCESSION NM_006356. 1 tgacccactt cttg ctgcggagga ccgtgggcag ccagggtcgg tgaaggatcc 61 caaaatggct gggcgaaaac ttgctctaaa aaccattgac tgggtagctt ttgcagagat 121 cataccccag aaccaaaagg ccattgctag ttccctgaaa tcctggaatg agaccctcac 181 ctccaggttg gctgctttac ctgagaatcc accagctatc gactgggctt aggc 241 caatgtggcc ggct tggtggatga ctttgagaag aagtttaatg cgctgaaggt 301 gcca gaggataaat atactgccca ggtggatgcc gaagaaaaag tgaa 361 atcttgtgct gagtgggtgt ctctctcaaa ggccaggatt gtagaatatg agat 421 ggagaagatg aagaacttaa ttccatttga gacc gact tgaatgaagc 481 tttcccagaa accaaattag acaagaaaaa gtatccctat tggcctcacc aaccaattga 541 gaatttataa aattgagtcc aggaggaagc tctggccctt gtattacaca ttctggacat 601 taaaaataat aattatacag ttaaaaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_006347.1 LOCUS N P_006347 ACCESSION NP_006347 1 magrklalkt idwvafaeii pqnqkaiass lkswnetlts rlaalpenpp aidwayykan 61 vakaglvddf lkvp vpedkytaqv daeekedvks caewvslska keme 121 kmknlipqu mtiedlneaf petkldkkky pywphqpien 1 Nucleotide seguence (variant 2): NCBI Reference ce: NM_001003785.1 LOCUS NM_001003785 ACCESSION NM_001003785 1 tgacccactt ccgttacttg ctgcggagga ccgtgggcag ccagggtcgg tgaaggatcc 61 caaaatggct gggcgaaaac ttgctctaaa aaccattgac tgggtagctt ttgcagagat 12; cataccccag aaccaaaagg ccattgctag gaaa aatg agaccctcac 18; ctccaggttg gctgctttac ctgagaatcc accagctatc gactgggctt aggc 24; caatgtggcc aaggctggct tggtggatga ctttgagaag aaat cttgtgctga ; gtgggtgtct ctctcaaagg ccaggattgt agaatatgag aaagagatgg agaagatgaa 36; gaacttaatt ccatttgatc agatgaccat tgaggacttg aatgaagctt tcccagaaac 42; caaattagac aagaaaaagt atccctattg gcctcaccaa ccaattgaga atttataaaa 48L ttgagtccag gaggaagctc tggcccttgt attacacatt ctggacatta aaaataataa 54L ttatacagtt aaaaaa Protein seguence (variant 2): NCBI Reference Seguence: NP_001003785.1 LOCUS N P_001003785 ACCESSION N P_001003785 1 magrklalkt idwvafaeii pqnqkaiass 1kswnet1ts enpp aidwayykan 61 vakaglvddf ekkvkscaew vslskarive yekemekmkn tie dlneafpetk 121 1dkkkypywp hqpienl Official Symbol: TRAP1 Official Name: TNF receptor-associated protein 1 W: 10131 Organism: Homo sapiens Other Aliases: HSP75, HSP90L Other Designations: HSP 75; TNFR-associated protein 1; ; heat shock protein 75 kDa, mitochondrial; tumor necrosis factor type 1 receptor associated protein; tumor necrosis factor type 1 receptor-associated protein Nucleotide seguence: NCBI Reference Seguence: NM_016292.2 LOCUS NM_01 6292 ION NM_016292 1 gaggaagccc cgccccgcgc agccccgtcc cgccccttcc catcgtgtac ggtcccgcgt 61 gcgc ctgg gagtacgaca tggcgcgcga gctgcgggcg ctgctgctgt 121 ggggccgccg cctgcggcct ttgctgcggg cgccggcgct cgtg ccgggaggaa 181 aaccaattct gtgtcctcgg aggaccacag cccagttggg ccccaggcga aacccagcct 241 ggagcttgca ggcaggacga ctgttcagca cgcagaccgc cgaggacaag gaggaacccc ; tgcactcgat tatcagcagc acagagagcg tgcagggttc cacttccaaa catgagttcc 36- aggccgagac aaagaagctt ttggacattg ttgcccggtc ctca gaaaaagagg 42; tgtttatacg ggagctgatc tccaatgcca gcgatgcctt ggaaaaactg cgtcacaaac 48L tggtgtctga cggccaagca ctgccagaaa tggagattca cttgcagacc aatgccgaga 54L ccat caccatccag gatactggta tcgggatgac acaggaagag ctggtgtcca 60; acctggggac gattgccaga tcggggtcaa aggccttcct ggatgctctg cagaaccagg 661 ctgaggccag gatc atcggccagt tggg tttctactca gctttcatgg 72; tggctgacag agtggaggtc tattcccgct cggcagcccc ggggagcctg ggttaccagt 78; ggctttcaga tggttctgga gaaa tcgccgaagc ttcgggagtt agaaccggga 84; caaaaatcat catccacctg gact gcaaggagtt ttccagcgag gcccgggtgc 90; gagatgtggt aacgaagtac agcaacttcg tcagcttccc cttgtacttg aatggaaggc 96; ggatgaacac cttgcaggcc atctggatga tggaccccaa ggatgtccgt gagtggcaac L02; atgaggagtt ctaccgctac gtcgcgcagg ctcacgacaa gccccgctac ataagacgga cgcaccgctc aacatccgca gcatcttcta cgtgcccgac ccatgtttga tgtgagccgg gagctgggct ccagcgttgc actgtacagc cgcaaagtcc L20; tcatccagac caaggccacg gacatcctgc ggct gcgcttcatc cgaggtgtgg 1261 tggacagtga ggacattccc ctgaacctca gccgggagct gctgcaggag agcgcactca L32; tcaggaaact ccgggacgtt ttacagcaga ggctgatcaa catt gaccagagta L38; aaaaagatgc tgagaagtat gcaaagtttt ttgaagatta cggcctgttc gcattgtgac cgccaccgag caggaggtca aggaggacat agcaaagctg ctgcgctacg L50; agtcctcggc gctgccctcc gggcagctaa ccagcctctc agaatacgcc atgc L56; gcac ccgcaacatc tactacctgt ccaa ccgtcacctg gcagagcact L62; caccctacta tgaggccatg aagaagaaag acacagaggt tctcttctgc cagt L68; ttgatgagct caccctgctg cgtg agtttgacaa gaagaagctg atctctgtgg L74; agacggacat agtcgtggat cactacaagg aggagaagtt cagg tccccagccg L80; ccgagtgcct atcagagaag gagacggagg agctcatggc ctggatgaga aatgtgctgg L86; ggtcgcgtgt cgtg accc tccgactgga cacccaccct gccatggtca L92; ccgtgctgga gatgggggct gcccgccact tcctgcgcat gcagcagctg gccaagaccc L98; agcg cgcacagctc ctgcagccca cgctggagat caaccccagg cacgcgctca 204; tcaagaagct gaatcagctg cgcgcaagcg agcctggcct ggctcagctg ctggtggatc 210; acga gaacgccatg attgctgctg gacttgttga cgaccctagg gccatggtgg 216; gccgcttgaa tgagctgctt gtcaaggccc tggagcgaca ctgacagcca gggggccaga 222; aggactgaca ccacagatga cagccccacc agct ttatttacct aaatttaaag 2281 gtatttctta acccgaaaaa aaaaaaaaaa Protein ce: NCBI Reference Seguence: NP_057376.2 LOCUS N P_057376 ACCESSION NP_057376 l marelralll wgrrlrpllr apalaavpgg kpilcprrtt aqlgprrnpa wslqagrlfs 6; tqtaedkeep ;hsiisstes vqgstskhef qaetkklldi varslyseke vfirelisna 12; sdaleklrhk ;vsdgqalpe meihlqtnae kgtitithg igmtqeelvs nlgtiarsgs 18; kafldalqnq iigq fgvgfysafm vadrvevysr saapgslqu gvfe 24; iaeasgvrtg tkiiihlksd earv rdvvtkysnf vsfplylngr ; mdpkdvrewq yvaq ahdkprytlh yktdaplnir sifyvpdmkp smfdvsre;g 36; ssvalysrkv liqtkatdil pkwlrfirgv vdsediplnl srellqesal irklrdvqu 42; rlikffidqs kkdaekyakf fedyglfmre givtateqev kediakllry essalpsgql 48; tslseyasrm ragtrniyyl capnrhlaeh spyyeamkkk dtevlfcfeq fdeltllh;r 54; lisv etdivvdhyk eekfedrspa aeclsekete elmawmrnvl gsrvtnvkvt 60; lrldthpamv tvlemgaarh flrmqqlakt qeeraqllqp tleinprhal ikklnqlras 66; epglaqllvd qiyenamiaa glvddpramv grlnellvka lerh SDHA Official Symbol: SDHA Official Name: succinate dehydrogenase x, subunit A, rotein (Fp) Gene ID: 6389 Organism: Homo sapiens Other Aliases: CMD1GG, FP, PGL5, SDH1, SDH2, SDHF Other Designations: rotein subunit of complex ll; succinate dehydrogenase [ubiquinone] flavoprotein subunit, mitochondrial; succinate dehydrogenase complex flavoprotein subunit Nucleotide seguence: NCBI Reference Seguence: NM_004168.2 % 168 ACCESSION NM_004168 l tccggcgtgg ggcg cggtatcccc cctcccccgc cagctcgacc ccggtgtggt 61 gcgcaggcgc agtctgcgca gggactggcg ggactgcgcg gcggcaacag cagacatgtc 12; gggggtccgg ggcctgtcgc ggctgctgag cgctcggcgc ctggcgctgg ccaaggcgtg 18; gccaacagtg ttgcaaacag gaacccgagg ttttcacttc actgttgatg ggaacaagag 24; ggcatctgct aaagtttcag attccatttc tgctcagtat ccagtagtgg atcatgaatt ; tgatgcagtg gtggtaggcg ctggaggggc aggcttgcga gctgcatttg gcctttctga 36; ggcagggttt aatacagcat gtgttaccaa gctgtttcct accaggtcac ttgc 42; ggga ggaatcaatg ctgctctggg gaacatggag gaggacaact ggaggtggca 48L tttctacgac accgtgaagg gctccgactg ggac caggatgcca tccactacat 54L gacggagcag gcccccgccg ccgtggtcga gctagaaaat tatggcatgc cgtttagcag 60; agat attt atcagcgtgc atttggtgga cagagcctca agtttggaaa 66L gggcgggcag gcccatcggt gctgctgtgt ggctgatcgg actggccact cgctattgca 72; caccttatat tctc tgcgatatga taccagctat tttgtggagt attttgcctt 78; ggatctcctg atggagaatg gggagtgccg tggtgtcatc gcactgtgca tagaggacgg 84; gtccatccat agag caaagaacac tgttgttgcc acaggaggct gcac 90; ctacttcagc tgcacgtctg cccacaccag cactggcgac ggcacggcca tgatcaccag 96; ggcaggcctt ccttgccagg acctagagtt tgttcagttc caccctacag gcatatatgg L02; tgctggttgt acgg aaggatgtcg tggagaggga ggcattctca ttaacagtca L08; aagg tttatggagc gatacgcccc gaag gacctggcgt atgt L14; ggtgtctcgg tccatgactc tggagatccg agaaggaaga ggcc ctgagaaaga L20; tcacgtctac ctgcagctgc accacctacc tccagagcag ctggccacgc ctgg L26; catttcagag acagccatga tcttcgctgg cgtggacgtc gagc ctgt L32; cctccccacc gtgcattata acatgggcgg cattcccacc aactacaagg ggcaggtcct L38; gaggcacgtg aatggccagg atcagattgt gcccggcctg tacgcctgtg gggaggccgc L44; ctgtgcctcg gtacatggtg ccaaccgcct cggggcaaac tcgctcttgg acctggttgt L50; ctttggtcgg gcatgtgccc tgagcatcga agagtcatgc aggcctggag ataaagtccc L56; tccaattaaa ccaaacgctg gggaagaatc tgtcatgaat cttgacaaat tgagatttgc L621 tgatggaagc ataagaacat cggaactgcg actcagcatg cagaagtcaa ttccgtgtgg gaagcgtgtt gcaagaaggt tgtgggaaaa ctaaagcacc cgtt cgaccgggga atggtctgga acacggacct L80; ggtggagacc ctggagctgc agaacctgat gctgtgtgcg ctgcagacca agaggcacgg aaggagtcac ggggcgcgca tgccagggaa gactacaagg tgcggattga 192; tgagtacgat tactccaagc ccatccaggg gcaacagaag aagccctttg aggagcactg 198; gaggaagcac accctgtcct atgtggacgt tggcactggg actc tggaatatag 204; acccgtgatc gacaaaactt tgaacgaggc tgactgtgcc accgtcccgc cagccattcg 210; ctcctactga aaga tgtggtgatg acagaatcag cttttgtaat tatgtataat 216; agctcatgca tgtgtccatg tcataactgt cttcatacgc ttctgcactc tggggaagaa 222; ggagtacatt agat tggcacctag tggctgggag cttgccagga acccagtggc 228- caqqqacht ggcacttacc tttgtccctt gcttcattct tgtgagatga tggg 234- tctt aaataaaata aaca aactttcttt tatttccaaa aaaaaaaaaa 240; aaaaa Protein seguence: NCBI Reference ce: NP_004159.2 LOCUS NP_004159 ACCESSION N P_004159 l msgvrglsrl lsarrlalak qtgt rgfhftvdgn krasakvsds isaqypvvdh 6; efdavvvgag gaglraafgl seagfntacv tklfptrsht vaaqgginaa lgnmeednwr 12; whfydtvkgs dwlgdqdaih ymteqapaav velenygmpf srtedgkiyq rafggqslkf 18; gkggqahrcc cvadrtghsl lhtlygrslr ydtsyfveyf aldllmenge crgvialcie 24; dgsihrirak ntvvatggyg rtyfsctsah tstgdgtami traglpchl equfhptgi ; iteg crgeggilin sqgerfmery lasr dvvsrsmtle iregrgcgpe 36; kdhvqulhh lppeqlatrl pgisetamif agvdvtkepi pvlptvhynm ggiptnykgq 42; vlrhvnquq ivpglyacge aacasvhgan rlganslldl vvfgracals ieescrpgdk 48; vppikpnage esvmnldklr fadgsirtse lrlsmqksmq nhaavfrvgs vlqegcgkis 54; klygdlkhlk tfdrgmvwnt elqn lmlcalqtiy gaearkesrg aharedykvr 60; ideydyskpi ngqkkpfee hwrkhtlsyv dvgtgkvtle yrpvidktln eadcatvppa 66L irsy TPMA OfibblSwnbszPM4 OfimmlName:Uopomy0§n4 m: 7171 Organism: Homo sapiens Othe r Aliases: Other Designations: TM30p1;tropomyosin alpha-4 chain; tropomyosin-4; Nucleotide seguence (variant 1): NCBI Reference Seguence: 145160.1 LOCUS NM_001145160 ION 145160 l ataaggccct ctcctccacc ctgccaggct cactctgccc caca gcccctgact 6; gccgcagccc ccacagagcc cgccgcgcac cccacgtccc ccacgccagc gcccagccat 12; ggaggccatc aagaagaaaa tgcagatgct gaagttggac aaggagaatg ccatcgaccg l8; cgcggagcag gcggaggcgg ataagaaagc cgctgaggac aagtgcaagc aggtggagga 24; ggagctgacg cacctccaga agaaactaaa agag gacgagctgg ataaatattc ; cgaggacctg aaggacgcgc aggagaagct ggagctcacg gagaagaagg cctccgacgc 36; tgaaggtgat gtggccgccc tcaaccgacg catccagctc gttgaggagg agttggacag 42; ggctcaggaa cgactggcca ngccctgca gaagctggag gaggcagaaa caga 48; tgagagtgag agaggaatga aggtgataga aaaccgggcc gatg aggagaagat 54; ggagattcag cagc tcaaagaggc caagcacatt gcggaagagg ctgaccgcaa 60; atacgaggag gtagctcgta agctggtcat cctggagggt gagctggaga gggcagagga 66; gcgtgcggag gtgtctgaac taaaatgtgg tgacctggaa gaagaactca agaatgttac 72; taacaatctg aaatctctgg aggctgcatc tgaaaagtat tctgaaaagg aggacaaata 78; tgaagaagaa attaaacttc tgtctgacaa agag gctgagaccc gtgctgaatt 84; tgcagagaga acggttgcaa aactggaaaa gacaattgat gacctggaag agaaacttgc 90; ccaggccaaa gaagagaacg tgggcttaca tcagacactg gatcagacac taaacgaact 96L taactgtata taagcaaaac gtct tgttccaaca gaaactctgg agctccgtgg 102; gtctttctct tctcttgtaa gaagttcctt ttgttattgc catcttcgct gaaa lO8L tgtcaagcaa aata catgaccaaa tattttgtat cggagaagct ttgagcacca 114; gttaaatctc attccttccc tttc aaatggcacc agctttttca gctctcttat 120; ctta agtagcattt attcctaagg taggcagggt atttcctagt aagcatactt 126; tcttaagacg gaggccattt tggg agaataggca gccccacact ttgaagaata 132; cagaccccag tatctagtcg tggatataat taaaacgctg aagaccataa ccttttgggt 138; caactgttgg tcaaactata ggagagacca gggaccatca catgggtagg gattttccat 144; ccagagccaa taaaaggact ggtgggggcc gggggtggct attgtgggaa gtcataaccc 150; acagatagat caacctaaga atcctggccc ttctccactc tccaccatgc aggacaaaca 156; tcttctcaag cagtcaacgt agaatgcttg ggaaatagtc ataattaccc acatatagta 1621 attaatagat ggtaattaat tgatccttga tgtgatgttc ttttgcatat ttccttcatt L681 ctaaagttgt tccctggccg ggagcgttgg ctttcgcctg taatcccaac gccaggacag tgag gtcaggagtt cgagaccagc aaca atgtctctac taaaaataca aaaattatgg tgacgcctgc ctgtagtccc gaggctgagg caggaggatc gcttgaaccc aggaagtgga gactgcagtg agccgatatc 192; cagc gctccagcct ggtcgacaga gtgagactcc atctcaagaa taaagttgtt ctctgaagag caaatgtctc attccagtaa tgacccactc agcaggaata 204; tggtggagtt aatt caggtcagcc atatccaaaa gaccacaagt cattactaag 210; ttgagcaaaa gagtttttat ctattagcag aaagggcctc tctggcagca aaaa 216; actggcccaa cttcatttcc atacttcagg gaacagcaaa ttgaggattt acttatctag 222; aatt ccttctttgg gaccaagtta ataaaagacc aagaaactcc tgattaaact 228; ggataatgaa ggattctgta gctg cacgtatcgg ctttgtttga cttctctttt 234; ctcagttaac gagc tagaacattc cacattcccc agcagcgtgt gggggctgac 240; taaagtttac aattccaact aaaaatcacc ctgcttctgg cttatctgaa tcccttaccc 246; accccacccc ctac ttat tcagcaccac actacccagg aaatacacta 252; tgtg caatggaata aaatccacac tttagattct tgcaactgta tcatatgtaa 2581 tagtatcact ttttctacat tttggtcaaa taaattttta cataaactac Protein seguence (variant 1): NCBI Reference Seguence: NP_001138632.1 LOCUS N P_001 138632 ACCESSION N P_001138632 1 meaikkkmqm 1k1dkenaid raeqaeadkk kqve eelthlqkkl kgtedeldky 61 sedlkdaqek leltekkasd aegdvaalnr riqlveeeld raqerlatal qkleeaekaa 121 desergmkvi deek meiqemqlke akhiaeeadr kyeevarklv ilegelerae 181 eraevselkc lknv tnnlksleaa sekysekedk yeeeikllsd klkeaetrae 241 faertvakle ktiddleekl aqakeenvgl hqtldqtlne lnci Nucleotide seguence: NCBI Reference Seguence (variant 2): NM_003290.2 LOCUS NM_003290 ACCESSION NM_003290 l tttccagcag ctgtggccag cgac gtcaggccct cccccagcgg tgctgacgtc 6L ggcggtccgg ccgggtgacc tcatcgcccc gacggcagcc ggcccggggg agag 12; gcgggggcgg cccccgcgca ggcaaaggct tggggggccg ggct gtgcagctct l8; cgccggagcc gagcccagcc ccgc cgctgcccgt gcgcctctgc gcctccgcgc 24; catggccggc ctcaactccc tggaggcggt gaaacgcaag atccaggccc agca ; ggcggacgag gcggaagacc gcgcgcaggg cctgcagcgg gagctggacg gcgagcgcga 36; gcggcgcgag aaagctgaag gtgatgtggc cgccctcaac cgacgcatcc agctcgttga 42; ggaggagttg gctc aggaacgact ggccacggcc ctgcagaagc tggaggaggc 48; agaaaaagct gaga gagg aatgaaggtg atagaaaacc gggccatgaa 54; ggatgaggag aagatggaga ttcaggagat gcagctcaaa gaggccaagc acattgcgga 60; agaggctgac cgcaaatacg aggaggtagc tcgtaagctg gtcatcctgg agggtgagct 66; ggagagggca gaggagcgtg gtc tgaactaaaa tgtggtgacc tggaagaaga 72; actcaagaat gttactaaca atctgaaatc tctggaggct gcatctgaaa agtattctga 78; aaaggaggac aaatatgaag aagaaattaa acttctgtct gacaaactga aagaggctga 84; tgct gaatttgcag agagaacggt tgcaaaactg gaaaagacaa ttgatgacct 90; ggaagagaaa cttgcccagg aaga gaacgtgggc caga cactggatca 96; gacactaaac gaacttaact gtatataagc aaaacagaag agtcttgttc caacagaaac 102; tctggagctc cgtgggtctt tctcttctct tgtaagaagt tccttttgtt attgccatct 108; tgct ggaaatgtca agcaaattat gaatacatga ccaaatattt tgtatcggag 114; aagctttgag ttaa atctcattcc ttcccttttt ttttcaaatg gcaccagctt 120; tttcagctct cttatttttt ccttaagtag catttattcc taaggtaggc ctagtaagca tactttctta agacggaggc catttggttc ctgggagaat aggcagcccc 132; acactttgaa gaatacagac cccagtatct agtcgtggat ataattaaaa cgctgaagac L38; cataaccttt tgggtcaact gttggtcaaa gaga gaccagggac catcacatgg L44; gtagggattt tccatccaga gccaataaaa ggactggtgg gggccggggg tggctattgt 150; gggaagtcat aacccacaga tagatcaacc taagaatcct ggcccttctc ccac 156; catgcaggac aaacatcttc tcaagcagtc gaat gaaa tagtcataat L62; tacccacata tagtaattaa tagatggtaa ttaattgatc cttgatgtga tgttcttttg L68; catatttcct tcattctaaa gttgttccct ggccgggagc tttc gcctgtaatc L74: ccaacacttt ccag gacagatcac ttgaggtcag gagttcgaga ccagcccagc L80; caacatggcg aaaccatgtc tctactaaaa atacaaaaat tatggtgacg ctcgggaggc tgaggcagga ggatcgcttg aacccaggaa atatcgcacc acagcgctcc agcctggtcg acagagtgag aagaaaaaat aaaaataaag ttgttctctg aagagcaaat gtctcattcc agtaatgacc 204; cactcagcag gaatatggtg gagttcagtc caattcaggt tatc caaaagacca 210; caagtcatta ctaagttgag caaaagagtt tttatctatt agcagaaagg gcctctctgg 216; cagcagagat taaaaactgg cccaacttca tttccatact tcagggaaca gcaaattgag 222; gatttactta tctaggactt gaattccttc tttgggacca taaa agaccaagaa 228; actcctgatt aaactggata atgaaggatt acag acgt atcggctttg 234; tttgacttct cttttctcag ttaacatctc agagctagaa cattccacat tccccagcag 240; Cgtgtggggg ctgactaaag tttacaattc caactaaaaa tcaccctgct tctggcttat 246; ctgaatccct tacccacccc accccaccac cctactccta tttattcagc ctac 252; ccaggaaata cactagcaaa ttgtgcaatg gaataaaatc cacactttag attcttgcaa 258; ctgtatcata tgtaatagta tcactttttc tacattttgg tcaaataaat ttttacataa 2641 actac Pnneh1seguence(vafiant2y NCBI Reference Seguence: NP_003281.1 LOCUS N P_003281 ACCESSION NP_003281 l leav krkiqaquq adeaedraqg lqreldgere rrekaegdva alnrriqlve 61 eeldraqerl atalqkleea ekaadeserg mkvienramk deekmeiqem qlkeakhiae 121 eadrkyeeva rklvilegel aevs elkcgdleee nlks leaasekyse 181 kedkyeeeik llsdklkeae traefaertv aklektiddl eeklaqakee nvglhqtldq 241 tlnelnci Official Symbol: ETFA Official Name: electron-transfer-flavoprotein, alpha polypeptide M: 2108 Organism: Homo s Other Aliases: EMA, GAZ, MADD Other Designations: alpha-ETF; electron transfer flavoprotein alpha-subunit; electron transfer flavoprotein subunit alpha, ondrial; electron transfer flavoprotein, alpha polypeptide; glutaric aciduria || Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_000126.3 LOCUS NM_000126 ACCESSION NM_000126 l attaggtgac aggc ggcgccagtt ggccgggcac ggggctgctg taaggccgag 63— gttgcggcgg aagcggagac catgttccga gcggcggctc cggggcagct ccggcgggcg 12; gcctcattgc tacgatttca gagtaccctg gtaatagctg agcatgcaaa tgattcccta 18; gcacccatta ctttaaatac cattactgca gccacacgcc ttggaggtga agtgtcctgc 24; ttagtagctg gaaccaaatg tgacaaggtg gcacaagatc tctgtaaagt agcaggcata ; gcaaaagttc tggtggctca gcatgatgtg tacaaaggcc tacttccaga ggaactgaca 36; ccattgattt tggcaactca gaagcagttc aattacacac acatctgtgc tggagcatct 42; gccttcggaa agaacctttt gcccagagta gcagccaaac ttgaggttgc cccgatttct 48; gacatcattg agtc acctgacaca tttgtgagaa ctatttatgc aggaaatgct 54; acag tgaagtgtga tgagaaagtg tttt gtgg aacatccttt 60; gatgctgcag caacaagtgg cggtagtgcc agttcagaaa caag tacttcacca 66L gtggaaatat cagagtggct tgaccagaaa ttaacaaaaa gtgatcgacc agagctaaca 72L ggtgccaaag tggtggtatc tggtggtcga ggcttgaaga agaa gttg 78; ttatatgact atca actacatgct ggtg cttcccgtgc tgctgttgat 84; gctggctttg ttcccaatga catgcaagtt ggacagacgg gaaaaatagt agcaccagaa 90; ctttatattg gaat atctggagcc atccaacatt tagctgggat gaaagacagc 96; aagacaattg tggcaattaa taaagaccca gaagctccaa ttttccaagt ggcagattat 102; ggaatagttg cagatttatt taaggtagtt atga tatt gaagaaaaaa 108; tgaatcagga tcatgcctta aaaagaaaac ttttgttaaa cact gaaatcacag 114; atatttgtgg gtattataac aatcattgga aagcatggag agctacattt cataatttga 120; gggaaaattt ctaacagatg ccagaatgct tgtttatggg attgctgtgt ttccttttaa 1261 ttatttgtgg ttccaaacaa tttg aactttttaa attctgtact aaaatctata 1321 ataaagcttt tccacagctt taaaactatc agaaaaaaaa aaaaaaaaa Protein seguence (variant 1 ): NCBI Reference Seguence: NP_000117.1 LOCUS NP_OOO117 ACCESSION NP_000117 1 mfraaapgql lrfq stlviaehan dslapitlnt itaatrlgge vsclvagtkc 61 dkvaqdlckv agiakvlvaq hdvykgllpe eltplilatq kqfnythica gasafgknll 121 prvaakleva aiks pdtfvrtiya gnalctvkcd ekvkvfsvrg tsfdaaatsg 181 kass tspveisewl qultksdrp eltgakvvvs ggrglksgen fkllydladq 241 lhaavgasra avdagfvpnd mqvgqtgkiv avgi sgaithagm kdsktivain 301 kdpeapifqv adygivadlf kvvpemteil kkk Nucleotide sequence (variant 2): NCBI Reference ce: NM_001127716.1 LOCUS NM_001127716 ACCESSION 127716 1 attaggtgac tggctgaggc ggcgccagtt ggccgggcac ggggctgctg taaggccgag 61 gttgcggcgg aagcggagac catgttccga gctc cggggcagct ccggcgggcg 121 gtggcacaag atctctgtaa agtagcaggc atagcaaaag ttctggtggc tcagcatgat 181 gtgtacaaag gcctacttcc agaggaactg acaccattga ttttggcaac tcagaagcag 241 ttcaattaca cacacatctg tgctggagca tctgccttcg gaaagaacct tttgcccaga 301 gtagcagcca aacttgaggt tgccccgatt tctgacatca ttgcaatcaa gtcacctgac 36; acatttgtga gaactattta tgcaggaaat gctctatgta cagtgaagtg tgatgagaaa 42; gtgaaagtgt tccg tggaacatcc tttgatgctg caag tggcggtagt 48; gccagttcag aaaaggcatc aagtacttca ccagtggaaa tatcagagtg gcttgaccag 54; aaattaacaa aaagtgatcg gcta acaggtgcca aagtggtggt atctggtggt 60; cgaggcttga agagtggaga gaactttaag tatg acttggcaga tcaactacat 66; gctgcagttg gtgcttcccg tgctgctgtt gatgctggct ccaa tgacatgcaa 72: gttggacaga cgggaaaaat agtagcacca gaactttata ttgctgttgg tgga 78L gccatccaac atttagctgg gatgaaagac agcaagacaa ttgtggcaat taataaagac 84L ccagaagctc caattttcca agtggcagat tatggaatag attt atttaaggta 90; gttcctgaaa tgactgagat gaaa aaatgaatca ggatcatgcc ttaaaaagaa 96; aacttttgtt aaagtattcc actgaaatca cagatatttg tgggtattat aacaatcatt 102; ggaaagcatg gagagctaca tttcataatt tgagggaaaa tttctaacag atgccagaat 108; gcttgtttat gggattgctg cttt taattatttg tggttccaaa caattattgt 114; ttgaactttt taaattctgt actaaaatct ataataaagc ttttccacag ctttaaaact 1201 atcagaaaaa aaaaaaaaaa aa Protein sequence (variant 2): NCBI Reference Seguence: NP_001121188.1 LOCUS NP_001121188 ACCESSION NP_001121188 1 mfraaapgql rravaqdlck vagiakvlva qhdvykgllp eeltplilat qkqfnythic 61 agasafgknl 1prvaaklev apisdiiaik rtiy agnalctvkc dekvkvfsvr 121 aats ggsassekas stspveisew 1qu1tksdr peltgakvvv sggrglksge 181 nfkllydlad qlhaavgasr fvpn quvgqtgki vapelyiavg isgaithag 241 mkdsktivai pifq vadygivadl fkvvpemtei 1kkk RPL8 Official Symbol: RPL8 Official Name: ribosomal protein L8 m: 6132 Organism: Homo sapiens Othe r Aliases: L8 Other Designations: 608 mal n L8 Nucleotide seguence nt 1): NCBI Reference Seguence : NM_000973.3 LOCUS 973 ACCESSION NM_000973 l agataaggcc gctcgctgac gccgtgtttc ctctttcggc cgcgctggtg aacaggaccc 6L gtcgccatgg gccgtgtgat ccgtggacag aggaagggcg ccgggtctgt gttccgcgcg 121 cacgtgaagc accgtaaagg cgctgcgcgc ctgcgcgccg tggatttcgc gcac 18; ggctacatca agggcatcgt caaggacatc atccacgacc cgggccgcgg cgcgcccctc 24; gccaaggtgg tcttccggga tccgtatcgg tttaagaagc ggacggagct gttcattgcc ; gccgagggca ttcacacggg ccagtttgtg tattgcggca agaaggccca gctcaacatt 36; ggcaatgtgc tccctgtggg caccatgcct gagggtacaa tcgtgtgctg cctggaggag 421 aagcctggag accgtggcaa gctggcccgg gcatcaggga actatgccac cgttatctcc 481 cacaaccctg agaccaagaa gacccgtgtg aagctgccct ccggctccaa tatc 541 tcctcagcca acagagctgt ggttggtgtg gtggctggag gtggccgaat tgacaaaccc 60; atcttgaagg ctggccgggc gtaccacaaa tataaggcaa agaggaactg ctggccacga 661 gtacggggtg tggccatgaa tcctgtggag catccttttg gaggtggcaa ccaccagcac 721 atcggcaagc cctccaccat ccgcagagat gctg gccgcaaagt gggtctcatt 781 gctgcccgcc ggactggacg tctccgggga accaagactg tgcaggagaa agagaactag 841 tgctgagggc ctcaataaag tttgtgttta aaaa aaaaaaaaaa aaaa 90; aaa Protein seguence (variant 1): NCBI nce Seguence: NP_000964.1 LOCUS N P_000964 ACCESSION 964 1 mgrvirgqu gagsvfrahv khrkgaarlr avdfaerhgy ikgivkdiih dpgrgaplak 61 yrfk krtelfiaae gihtgqfvyc gkkaqlnign vlpvgtmpeg tivccleekp 121 gdrgklaras gnyatvishn petkktrvkl psgskkviss anravvgvva gggridkpil 181 kagrayhkyk akrncwprvr gvamnpvehp fgggnhqhig kpstirrdap agrkvgliaa 241 rrtgrlrgtk tvqeken Nucleotide seguence (variant 2): NCBI nce ce: NM_033301.1 LOCUS NM_033301 ACCESSION NM_033301 l tggg cagtatccgc cgccatcctc ttccgtgagg cgcgctgaga cccggaccgg 6L ccctcctgag aggatgccgg cgcc cgcggagagg gacccgtcgc catgggccgt 121 gtgatccgtg gacagaggaa cggg tctgtgttcc gcgcgcacgt gaagcaccgt 18; aaaggcgctg Cgcgcctgcg cgccgtggat ttcgctgagc ggcacggcta gggc 24; atcgtcaagg acatcatcca cgacccgggc cgcggcgcgc ccctcgccaa ggtggtcttc ; cgggatccgt atcggtttaa gaagcggacg gagctgttca ccga gggcattcac 36; acgggccagt ttgtgtattg cggcaagaag gcccagctca acattggcaa tgtgctccct 42; gtgggcacca aggg tacaatcgtg tgctgcctgg aggagaagcc tggagaccgt 48; ggcaagctgg cccgggcatc agggaactat gccaccgtta tctcccacaa ccctgagacc 54; aagaagaccc gtgtgaagct gccctccggc tccaagaagg ttatctcctc agccaacaga 60; gctgtggttg gtgtggtggc tggaggtggc cgaattgaca aacccatctt gaaggctggc 66; cgggcgtacc acaaatataa ggcaaagagg aactgctggc cacgagtacg gggtgtggcc 72; atgaatcctg tggagcatcc ttttggaggt cacc agcacatcgg caagccctcc 78; accatccgca gagatgcccc tgctggccgc aaagtgggtc ctgc ccgccggact 84; ggacgtctcc ggggaaccaa gactgtgcag gagaaagaga actagtgctg agggcctcaa 90; taaagtttgt gtttatgcca aaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 96; aaaaaaa Protein seguence (variant 2): NCBI Reference Seguence: NP_150644.1 LOCUS N P_1 50644 ACCESSION N 44 1 mgrvirgqu gagsvfrahv khrkgaarlr avdfaerhgy ikgivkdiih dpgrgaplak 61 vvfrdpyrfk krtelfiaae gihtgqfvyc gkkaqlnign mpeg tivccleekp 121 gdrgklaras gnyatvishn rvkl psgskkviss anravvgvva gggridkpil 181 kagrayhkyk akrncwprvr gvamnpvehp fgggnhqhig kpstirrdap agrkvgliaa 241 rrtgrlrgtk tvqeken Official Symbol: ARCN1 Official Name: archain 1 m: 372 Organism: Homo sapiens Other Aliases: COPD Other Designations: archain vesicle transport protein 1; coatomer delta subunit; coatomer protein complex, t delta; coatomer protein delta-COP; coatomer subunit delta; delta-COP; delta-coat protein Nucleotide seguence (variant 1): NCBI Reference Seguence: 655.4 LOCUS NM_001655 ACCESSION NM_001655 1 gaagacgtgg cttggggccg ccatcttggc aagaggcgaa gcgg ttcctgtcaa 61 gggggcagca ggtccagagc tgct cccgttcccc agaccctacc ccca 121 gtggagccgg agtgcgggcg cgccccacca ccgccctcac catggtgctg ttggcagcag 181 cggtctgcac aaaagcagga aaggctattg tttctcgaca gtttgtggaa atgacccgaa 241 ctcggattga gggcttatta gcagcttttc caaagctcat gaacactgga aaacaacata ; cgtttgttga aacagagagt gtaagatatg tctaccagcc gaaa ctgtatatgg 361 tactgatcac taccaaaaac agcaacattt tagaagattt ggagacccta aggctcttct 42; tgat atat tgccgagcct tagaagagaa tgaaatatct 48L ttgatttgat ttttgctttt gatgaaattg tcgcactggg ataccgggag aatgttaact 54L tggcacagat cttc acagaaatgg attctcatga ggtg ttcagagccg 60; tcagagagac tcaagaacgt gaagctaagg ctgagatgcg tcgtaaagca aaggaattac 661 aacaggcccg aagagatgca gagagacagg gcaaaaaagc accaggattt ggcggatttg 721 gcagctctgc agtatctgga acag ctgccatgat cacagagacc atcattgaaa 78; ctgataaacc aaaagtggca cctgcaccag cttc aggccccagc aaggctttaa 84; aacttggagc caaaggaaag gaagtagata actttgtgga caaattaaaa tctgaaggtg 90; tcat gtcctctagt atgggcaagc gtacttctga agcaaccaaa gctc 96; cacccattaa tatggaaagt gtacatatga agattgaaga aaagataaca ttaacctgtg L02; gacgagacgg aggattacag aatatggagt tgcatggcat gatcatgctt aggatctcag L08; atgacaagta aatt cgtcttcatg tggaaaatga agataagaaa tccaaatgtg gataaaaaac ttttcactgc tcta gaagtcattt ccagtcaaca gtgacgtagg ggtgctaaag aaaccacaga ggaatctttt attccactga caattaattg ctggccctcg gagagtggaa L32; atggctgtga tgtcaacata gaatatgagc tacaagaaga taatttagaa ctgaatgatg L38; tggttatcac catcccactc ccgtctggtg tcggcgcgcc tgttatcggt gggagt at Cg acatgacagt cgacgaaata agtg gtgcctgcct gatg L50; ataa gagtggcagc ctggagttta gcattgctgg gcagcccaat ctgttcaagt ttcctttgtc tccaagaaaa attactgtaa catacaggtt accaaagtga L62; cccaggtaga tggaaacagc cccgtcaggt tttccacaga tttc ctagtggata L68; agtatgaaat tctgtaatac caagaagagg gagctgaaaa ggaaaatttt cagattaata L74; aagaagacgc caatgatggc tgaagagttt ttcccagatt tacaagccac tggagacccc L80; ttttttctga tacaatgcac gattctctgc gcgcaaggac cctcgactca cccccatgtt L86; tcagtgtcac agagacattc tttgataagg aaatggcaca aacataaagg gaaaggctgc L92; taattttctt tggcagattg tattggccag caggaaagca agctctccag agaatgcccc L98; aata cctcctctac ctttacctaa gttgctcctt tatttttatt ttattattat 204; tattattatt attattattt tttgagatgg agtctcactt ccag gctggaatgc 210; aatggcatga tctcagctca ctgcaacctc cgcctcctgg gttcaagcaa tgcc 216; tcagcctccg agtagctggg actacaggtg acca cgcctggcta attttttgta 222; ttttagtaga gttt caccgtgttg cccaggctgg tcgcgaactc ctgagctcag 2281 gcaatccgcc cacctcagcc tcccaaagtg ttgggattac aggcatgagc caccatgccc 234; agctgctcct ttattttaat ccctaaatat aatccctaaa tatagttata tttcatactt 240; agtttgtttt gttt tctctgtaga aaattttaat cattcatacc ctttaccttt 246; aggtttttct acat tcagtcaggc actgggatca tctgtttaca ggcattatat 252; ggca ctcctggaac aagtatatct aacccattct tgatttttgg actattcagg 258; tgaactattt gaggggtatg gggtctagaa gttaaaagat acgcatgtct tctgttcttt 264; tcccgtatca attcattcct tcatctcttt gccaagttgt tttcctttca gggcctgtcc 270; ttccagttta tacc atgaatccca cttgtgtcaa tattaaagat agctgagaag 276; cacctttcaa atggcacagt ccctcttcaa taaa agaatggtta tgtctgtcca 282; gttagggatt tcacatccac atgtaatcat gtctgctgct gttgctaccc aaattttcat 288; acat tttgggtact taagctaaaa cgtaatggcc acagtctgta atccattcac 294: attcctcagt ttcaccacct ccctcttcca gactgcactc tctgtcatca gtcccctcct 300; ttctaacaga aatggggtta tgattttgaa gggt tcagggagtc tttgccaatc 306; ctgttggccc taaactatca aggaggctcc atttcaccat ttgatttttt gcatttcagg 3121 aggcaactga ttgtttcgat atgtacatat tactcacgta taccccattt ccttccagtc 318; acat tttccaccag tctgtcccca tctctgaaat ccttccttct ccct 324; aagtcttttg agtgtcatca tgtactggtg gtttctcggt tccatctcat ccatttcctt 330; ttcaatggag actacagcgt cagccagctc agccttggct tttaactcaa tattccagtc 336; cataggggtg agtt aggc tgcaggcact ggcagtggga agaggcagac 342; gactagatga cttctgcact tttagctggt tgaaaagtac cactcccact atct 348; ggccgtccct gcaaagagtg tactgtgctt gaagcagagc actcacacat aaatggctgt 354; gtgtggaatt gcttgccaaa gaagtttcta gcctttccct ttcccctaac tgcatcaggg 360; aagaattctt atctctagct tggtttccac atgaggtttt aagg gcttgggaca 366; agaagtctgt catgttagtt aagcaggcaa gaaatcctac taatccagtt ttgtttgaaa 372; gttgtttgtc cgtatgattt tttaaaagtc aagtttaatt tcaaaaaacc ttttttttct 378; gagattactt ttggggtaat atttaaaatg agagacattt tgtaaccctg taaaatacat 384; agggaatata acattccagt gtatacaaag aaggcaaatt ctttaatcaa ataaagcgca 390; ttataaaatg agatgtttat tggattattg actcactttg gctt gttgattcag 396; gatgctgtaa ctaa cattaaaaat taatgacatg ttttttttaa aaaa 4021 aaaa aa Protein seguence (variant 1): NCBI Reference Seguence: NP_001646.2 LOCUS N P_001 646 ACCESSION NP_001646 l mvllaaavct kagkaivsrq fvemtrtrie gllaafpklm ntgkqhtfve tesvryqup 6; meklymvlit tknsniledl etlrlfsrvi peycraleen eisehcfdli fafdeivaLg 12; yrenvnlaqi rtftemdshe ekvfravret qereakaemr rkakequar gkka 18; pgfggfgssa vsggstaami tetiietdkp kvapaparps gpskalklga kgkevdnfvd 24; klksegetim sssmgkrtse atkmhappin mesvhmkiee kitltcgrdg glqnmelhgm ; imlrisddky grirlhvene dkkqulqth pnvdkklfta esliglknpe ksfpvnsdvg 36; vlkwrlqtte tinc wpsesgngcd vnieyelqed nlelndvvit iplpsgvgap 42: vigeidgeyr tlew clpvidaknk sgslefsiag qpndffpqu sfvskknycn 48L iqvtkvtqu gnspvrfste ttflvdkyei l Nucleotide seguence (variant 2): NCBI Reference Seguence: NM_001142281.1 LOCUS NM_001 142281 ACCESSION 142281 l gaagacgtgg cttggggccg ccatcttggc cgaa gcggcagcgg tcaa 6; gggggcagca ggtccagagc tgctggtgct cccgttcccc tacc ccca 12; ccgg agtgcgggcg cgccccacca ccgccctcac catgatccct gaatattgcc 18; gagccttaga agagaatgaa atatctgagc actgttttga tttgattttt gcttttgatg 24; aaattgtcgc actgggatac nggagaatg ttaacttggc acagatcaga accttcacag ; aaatggattc tcatgaggag ttca gagccgtcag agagactcaa gaag 36; ctaaggctga gatgcgtcgt aaagcaaagg aattacaaca ggcccgaaga gatgcagaga 42; gacagggcaa aaaagcacca ggatttggcg gatttggcag ctctgcagta tctggaggca 48; gcacagctgc catgatcaca gagaccatca ttgaaactga taaaccaaaa gtggcacctg 54; caccagccag gccttcaggc cccagcaagg ctttaaaact tggagccaaa gaag 60; tagataactt tgtggacaaa ttaaaatctg aaggtgaaac gtcc tctagtatgg 66; gcaagcgtac ttctgaagca accaaaatgc atgctccacc cattaatatg gaaagtgtac 72: atatgaagat tgaagaaaag ataacattaa cctgtggacg agacggagga ttacagaata 78L tggagttgca tggcatgatc atgcttagga tctcagatga tggc cgaattcgtc 84L ttcatgtgga aaatgaagat aagaaagggg tgcagctaca gacccatcca aatgtggata 90; aaaaactttt cactgcagag tctctaattg gcctgaagaa tccagagaag tcatttccag 96; tcaacagtga Cgtaggggtg ctaaagtgga gactacaaac cacagaggaa tcttttattc 102; cactgacaat taattgctgg ccctcggaga gtggaaatgg ctgtgatgtc aacatagaat L08; atgagctaca taat ttagaactga atgatgtggt tatcaccatc ccgt L14; ctggtgtcgg cgcgcctgtt atcggtgaga tcgatgggga gtatcgacat gacagtcgac L20; gaaataccct ggagtggtgc ctgcctgtga ttgatgccaa aaataagagt ggcagcctgg L26; agtttagcat tgctgggcag cccaatgact tcttccctgt tcaagtttcc tcca L32; agaaaaatta ctgtaacata caggttacca aagtgaccca ggtagatgga aacagccccg L38; tcaggttttc cacagagacc actttcctag agta tgaaattctg taataccaag L44; aagagggagc tgaaaaggaa aattttcaga ttaataaaga agacgccaat cagatttaca agccactgga gacccctttt ttctgataca gcgc aaggaccctc gactcacccc catgtttcag tgtcacagag ataaggaaat ggcacaaaca taaagggaaa ggctgctaat tttctttggc cagg aaagcaagct ctccagagaa tgcccccagt taaatacctc acctaagttg tatt tttattttat tattattatt atta ttattttttg L80; agatggagtc tcactttgta acccaggctg gaatgcaatg tctc aacctccgcc gttc aagcaagtct cctgcctcag agta gctgggacta L92; caggtgcacg ccaccacgcc tggctaattt tttgtatttt agtagagacg gggtttcacc L98; gtgttgccca ggctggtcgc gaactcctga gctcaggcaa tccgcccacc tcagcctccc 204; aaagtgttgg gattacaggc atgagccacc atgcccagct gctcctttat tttaatccct 210; aaatataatc cctaaatata gttatatttc atacttagtt tgtttttaaa tctc 216; tgtagaaaat tttaatcatt catacccttt acctttaggt ttttctttct atacattcag 222; tcaggcactg ggatcatctg tttacaggca ttatatttat ttggcactcc tggaacaagt 228; atatctaacc cattcttgat ttttggacta ttcaggtgaa gagg ggtatggggt 234; ctagaagtta aaagatacgc atgtcttctg ttcttttccc gtatcaattc attccttcat 240; ctctttgcca agttgttttc ctttcagggc ctgtccttcc gaac agtaccatga 246; atcccacttg tgtcaatatt aaagatagct gagaagcacc tttcaaatgg cacagtccct 252i cttcaagatg tctaaaagaa tggttatgtc tgtccagtta gggatttcac atccacatgt 2581 aatcatgtct gctgctgttg ctacccaaat tttcatttct ccacattttg ggtacttaag 264; ctaaaacgta atggccacag tctgtaatcc attcacattc ctcagtttca ccct 270; cttccagact gcactctctg tcatcagtcc ttct aacagaaatg gggttatgat 276; tttgaaggct gtgggttcag tttg ccaatcctgt tggccctaaa ctatcaagga 282; ggctccattt caccatttga ttttttgcat ttcaggaggc aactgattgt ttcgatatgt 288; acatattact cacgtatacc ccatttcctt ccagtcagcc caacattttc tctg 294; tccccatctc tgaaatcctt ccttctcttt ccccctaagt cttttgagtg tcatcatgta 300; ctggtggttt ctcggttcca tctcatccat ttccttttca atggagacta cagcgtcagc 306; cagctcagcc ttggctttta actcaatatt ccagtccata ggggtggtta aaagttgctg 312; caaggctgca ggca gtgggaagag gcagacgact agatgacttc tgcactttta 318; gctggttgaa aagtaccact cccactctga acatctggcc gtccctgcaa agagtgtact 324: gtgcttgaag cagagcactc acacataaat ggctgtgtgt ggaattgctt gccaaagaag 330; tttctagcct ttccctttcc cctaactgca tcagggaaga attcttatct ctagcttggt 336; ttccacatga ggtttttctg gctt gggacaagaa gtctgtcatg ttagttaagc 3421 aggcaagaaa tcctactaat ccagttttgt ttgaaagttg tttgtccgta ttta 348; aaagtcaagt tcaa aaaacctttt ttttctgaga ttacttttgg ggtaatattt 354; aaaatgagag tgta accctgtaaa atacataggg aatataacat tccagtgtat 360; acaaagaagg caaattcttt aatcaaataa agcgcattat aaaatgagat gtttattgga 366; ttattgactc gtgt ctgcttgttg attcaggatg ctgtaatggg acctaacatt 3721 taat gacatgtttt gaga aaaaaaaaaa aaaaaaaa Protein sequence (variant 2): NCBI Reference Seguence: NP_001135753.1 LOCUS N P_001 135753 ION N P_001135753 l mipeycrale eneisehcfd lifafdeiva lgyrenvnla qirtftemds heekvfravr 6; etqereakae mrrkakequ arrdaerqgk kapgfggfgs staa mitetiietd 12; kpkvapapar psgpskalkl gakgkevdnf vdklkseget imsssmgkrt seatkmhapp 18; inmesvhmki eekitltcgr dgglqnmelh gmimlrisdd kygrirlhve 24; thpnvdkklf taesliglkn peksfpvnsd vgvlkwrlqt teesfiplti ncwpsesgng 30L cdvnieyelq ednlelndvv itiplpsgvg apvigeidge yrhdsrrntl ewclpvidak 36L nksgslefsi agqpndffpv kny cniqvtkvtq vdgnspvrfs tettflvdky 42L eil DDX18 Official Sym bol: DDX18 Official Name: DEAD (Asp-Glu-Ala—Asp) box polypeptide 18 m: 8886 Organism: Homo sapiens Other s: MrDb Other Designations: ATP-dependent RNA helicase DDX18; DEAD box protein 18; DEAD/H (Asp-Glu-AIa—Asp/His) box polypeptide 18 (Myc-regulated); Myc- regulated DEAD box protein Nucleotide seguence: NCBI nce Seguence: NM_006773.3 LOCUS NM_006773 ACCESSION NM_006773 l gccgagctgc gcacgtgcgg ccggaaggga agtaacgtca gcctgagaac tgagtagctg 6; tactgtgtgg cgccttattc taggcacttg ttgggcagaa tgtcacacct gccgatgaaa 12; ctcctgcgta agaagatcga gaagcggaac ctcaaattgc ggcagcggaa cctaaagttt 18; cagggggcct caaatctgac ggaa actcaaaatg gagatgtatc tgaagaaaca 24; atgggaagta gaaaggttaa aaaatcaaaa caaaagccca tgaatgtggg cttatcagaa ; actcaaaatg gaggcatgtc tcaagaagca gtgggaaata ttac tccc 36; cagaaatcca ctgtattaac caatggagaa gcagcaatgc agtcttccaa ttcagaatca 42; aaaaagaaaa agaagaaaaa gagaaaaatg gtgaatgatg ctgagcctga tacgaaaaaa 48; gcaaaaactg aaaacaaagg gaaatctgaa gaagaaagtg ccgagactac taaagaaaca 54; gaaaataatg tggagaagcc agataatgat gaagatgaga gtgaggtgcc cagtctgccc 60; ctgggactga caggagcttt tgaggatact tcgtttgctt ctctatgtaa tcttgtcaat 66; gaaaacactc tgaaggcaat aaaagaaatg ggttttacaa ctga gcat 72; aaaagtatca gaccacttct ggaaggcagg gatcttctag cagctgcaaa aacaggcagt 78; ggtaaaaccc tggcttttct catccctgca gttgaactca ttgttaagtt aaggttcatg 84L aatg gaacaggagt ccttattctc tcacctacta gagaactagc aacc 90; tttggtgttc ttaaggagct gatgactcac cacgtgcata cctatggctt gataatgggt 96L ggcagtaaca gatctgctga agcacagaaa cttggtaatg ggatcaacat cattgtggcc 102; ggcc gtctgctgga ccatatgcag ccag gatttatgta cctg 108; cagtgtctgg ttattgatga agctgatcgt atcttggatg ttga agaggaatta L14; aagcaaatta ttaaactttt gccaacacgt agacagacta tgctcttttc tgccacccaa L20; actcgaaaag acct ggcaaggatt tctctgaaaa aggagccatt gtatgttggc L26; gttgatgatg ataaagcgaa tgcaacagtg gatggtcttg aacagggata tgttgtttgt L32; ccttctgaaa agagattcct tctgctcttt acattcctta agaagaaccg aaagaagaag L38; cttatggtct tcttttcatc ttgtatgtct tacc actatgagtt gctgaactac L44; attgatttgc ccgtcttggc cattcatgga aagcaaaagc aaaataagcg L50; ttcttccagt tctgcaatgc ggga acactattgt gtacggatgt ggcagcgaga L56; ggactagaca ttcctgaagt cgactggatt gttcagtatg accctccgga atcgtgtggg tagaacagcc agaggcctaa atgggagagg gcccagaaga attgggtttt cttcgctact tgaaacaatc caaggttcca L74; ttaagtgaat ttgacttttc ctggtctaaa atttctgaca ttcagtctca ttgattgaaa agaattactt tcttcataag tcagcccagg aagcatataa cgagcctatg attcccattc acag atctttaatg ttaataacct aaatttgcct L92; caggttgctc tgtcatttgg tttcaaggtg cctcccttcg ttgatctgaa cgtcaacagt L98; aatgaaggca agcagaaaaa gcgaggaggt ggat ttggctacca gaaaaccaag 204; aaagttgaga aaat ctttaaacac attagcaaga aatcatctga cagcaggcag 210; ttctctcact gaacacatgc cttcctttca tcttgaataa ccta attt 216; tttttcccct aaca ttgt agactttaga atttggactt acctaacaag 222; agtataaatt gacttgggtt gcaagcactg agcactgtta tcac gtctctcttt 228; tatttctggg atataaaaca ggctttaagt ttcttggttg cccaagggca gagcaaggaa 234; tatctggtgt ttcttgtgat gataatattt taattttaaa tatccctccc tcatacaagt 240; tacc attttaatat aattcttttt ttcc ttcttgtttt gcgaagattt 246; ttgtggcatg gattgctgtg ctcactgctg taaaaggtga cctagtgtac gctg 252; gtggcggtgc agaaaagagt ctcaggttat tttttgtttt tagttatttc ttggaccttg 258; tcta atgactcctc ctgaaaatgc tgcagtataa aagagcaaag agctttggga 2641 aatacctaag aagcacctta agattagggt ggcattgctt ttatagattc ttgattttaa 270; agcaacaggc ctttctcagg tgttgcattt tttggagcaa aaactatggg ttgtaatttg 276; aataaagtgt cactaagcag ttataacgtt tgatggctgg ggggtaggaa gaggatggaa 282; ttgagatgtt tgagcctcat caat agaggtgtaa tgtactgcat ttcttcattt 288; ggtaacataa cttt catacaaaga acgatgatgc tcctcattaa gatttgttta 294; attcaaggtg gtttggattt ggtaagcctt tgcactctgt agagtactta gaagacaagg 300; gcaacttact tggagttaga gccaagctgt cagacggtgc ccagcacaca ttaatgttag 306; cttctttctg agaaaaaaat acctcttcca ggccctgaaa caaaaaatac atttgctgtg 312; aagattgaaa atgaacaaag aaaa aaacagcaaa atcagtgatt tagtcagatg 318; agtttttcgt tgtaggagca cttgatttct agtgtgtttt gtacagtata taactacaag 324; atagtacatt ttgtagcagt tcaaagccaa agttgctagc atcattttgc tgttgtgcca 330; gttaatcata ggatcccatt gtgt gctaacatcg aatatagaga aaactggtaa 336; agaacattcc agtaggaaaa gaaaagaaca catt tctgggcttg gccaccatca 342; tcgg cctg gacttccaac cttgactgct gagctcctgg cttagcttct 348; tgggttccta attcctggtg tttaataatt ctctccacga tcatgttttt ctgatttttt 354; ttttcagaaa taatgttttt taaaagacaa aaacaaaggg aagaatattt aattactgag 360; cagaagtaaa tggc attttgtaca taatctaatt tttatatgca tgttcatgct 366; ttttaatttt aaaa attaagtcat ctacctacta cttgtaacca gcttgtttca 372; taacatgtta tgtg tcattaaata attacttcaa tgttgaaaaa aaaaaaaaaa 3781 aaaaaaaaaa Protein sequence: NCBI Reference Seguence: 764.3 LOCUS N P_006764 ACCESSION NP_006764 l mshlpmkllr kkiekrnlkl rqrnlquga snltlsetqn gdvseetmgs rkvkksqup 6" mnvglsetqn ggmsqeavgn uks tv._tngeaam qssnseskkk kkkkrkmvnd 12; aepdtkkakt enkgkseees aettketenn vekpdndede sevpslplgl tgafedtsfa 18; slcnlvnent lkaikemgft nmteiqhksi rdll aaaktgsgkt laflipavel 24; ivklrfmprn gtgvlilspt relamqtfgv lkelmthhvh tyglimggsn rsaeaqklgn 30L giniivatpg rlldhmqntp gfmyknlqcl videadrild vgfeeelkqi ikllptrrqt 36L mlfsatqtrk islk keplyvgvdd vdgl quyvvcpse krflllftfl 42L kknrkkklmv svky hyellnyidl pvlaihgqu anrtttffq fcnadsgtll 48L ctdvaargld ipevdwivqy dppddpkeyi hrvgrtargl ngrghallil rpeelgflry 54; lkqskvplse fdfswskisd iqsqleklie knyflhksaq eayksyiray qifn 60; vnnlnlpqva lsfgfkvppf lenvnsneg qukrggggg fquktkkve kskifkhisk 661 kssdsrqfsh Official Symbol: GSBPZ Official TPase activating protein (SH3 domain) binding protein 2 m: GTPase activating protein (SH3 domain) binding protein 2 Organism: Homo sapiens Other Aliases: Other Designations: ; GAP 8H3 domain-binding protein 2; Ras-GTPase activating protein 8H3 domain-binding protein 2; ras GTPase-activating nbinding protein 2 Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_203505.2 LOCUS NM_203505 ACCESSION 505 l gtgctcgggg gttccctggc cctttcggca ggggtaaaac aataagaggg ggcggtggca 6L aagggggcgg cgtg gtccttgtcg cacgtcgcag cgcctggcgc ccgggaagag 12; gtggttgtga ggcagacgaa ctcgcggctc tccggcttcc gaggcttccg agttgtcgga 18; ggaagggggc ggcgagcaat aagaacccgc cggt cctcagcgac tcttctgacc 24; tccgcgcgac gtacccgccg ccgccgttgg ctggagcatt tgtg cagcaaagaa ; atggttatgg agaagcccag tccgctgctt gtagggcggg agtttgtgag gcaatattat 36; actttgctga ataaagctcc ggaatattta cacaggtttt atggcaggaa ttcttcctat 42; gttcatggtg gagtagatgc tagtggaaag gaag ctgtttatgg ccaaaatgat 48; atacaccaca aagtattatc tctgaacttc agtgaatgtc atactaaaat tcgtcatgtg 54L gatgctcatg caaccttgag tgatggagta gttgtccagg tcatgggttt gctgtctaac 60; agtggacaac gaaa gtttatgcaa acctttgttc ctga aggatctgtt 66L ccaaataaat tttatgttca caatgatatg tttcgttatg aagatgaagt gtttggtgat 72; tctgagcctg atga agaatcagaa gatgaagtag aagaggaaca agaagaaaga 78; caaccatctc ctgaacctgt gcaagaaaat gctaacagtg atga agctcaccct 84; gtgactaatg gcatagagga gcctttggaa gaatcctctc atgaacctga acctgagcca 90; gaatctgaaa caaagactga agagctgaaa ccacaagtgg aggagaagaa cttagaagaa 96; gaga ctac tcctcctccg gcagaacctg tttctctgcc acaagaacca L02; ccaaaggctt tctcctgggc ttcagtgacc agtaaaaacc tgcctcctag tggtactgtt L08; tcttcctctg gaattccacc ccatgttaaa gcaccagtct cacagccaag agtcgaagct L14; aaaccagaag ttcaatctca gccacctcgt gtgcgtgaac aacgacctag agaacgacct L20; cctc ctagaggacc aagaccaggc agaggagata tggaacagaa taattcgcta tccagatagt catcaacttt ttgttggtaa aaaatgagct aaaggaattc ttcatgagtt ttggaaacgt ccaagggtgt tgggggaaag cttccaaatt ttggttttgt ggtttttgat ;44; gactctgaac cagttcagag aatcttaatt gcaaaaccga ttatgtttcg aggggaagta ;50; cgtttaaatg tggaagagaa aaaaacaaga gctgcaagag agcgagaaac cagaggtggt L56; ggtgatgatc gcagggatat taggcgcaat gatcgaggtc ccggtggtcc gtgggtggtg gaatgatgcg tgatcgtgat ggaagaggac ctcctccaag gcacagaaac ttggctctgg aagaggaacc gggcaaatgg agggccgctt cacaggacag L74; cgtcgctgaa gctccactgt tggcaaagtc ttggcagtgg tacattattc atcgtgtttg L80; cattcttgtt aatttttttt ttggctttgg acac agcctttttg atcatttctt L86; tgatgtgaaa agcatctttg gttatcagtt aggt tatt tccccaattt L92; cacaacagga ttcacattgt taatttataa atctagactt ggagaattaa ggactgagaa L98; atgaccatat cttaaactat ctacgacaaa gtgaacttaa aaggacatgc aatt 204; caggtccttt gagtaaaaaa aaaatcttct gctgcacatt ttgtttaagt gttactgttt 210; gtta atgctgggaa tagt gcaatttgtg caattggaga atcttgcctt 216; ttttcttggc tccccccaaa aatacaaacc aacagaaact tgttatgcac aaat 222; gtactaatgg gtactctgaa ctcattaaca ttgacatctg gagg caacagggaa 228; tcat cttcttttcc agtagaaaat agtttgtgaa atgatgaggg cattttatct 2341 gcttgctgtg accagcgtgt gtacacataa accttaacaa gactacaagt atattccaga 240; tcat tttagttatg aactaaataa taaaaattag aacttcaaat gcgatggtct 246; tgactattag accagattta gtagctccat gatt tttctacctg cccctcttca 252; gtacagggat ggctggctgc tcaacacact cccc ttttttcctt tctttaagct 258; gtgtacagtg aaaattgtct tatt tttgttctct gtaa taagcatgat 264; ggtgccttct attaatacat cattccagtc ttgctggtaa ttttgtacag tatagtgtat 270; gaattgctgt gctgcaaagc caaacagctg caaaatgttg aaaaatcatc gaaatgtata 276; aaaattgcag tatctttaaa atcagtaaaa tggactagca tattatttat cttgttcttc 282; agttaacaac tttgtgttct ctgtgggagg gtcc tgtgtgtttg tggggagagg 288; gaaggaggaa gtcagttatt tgagtaagcc tgac ttttctctta gcctgaatgt 294; ggacgttgaa acatatcact tcagggcttg gaaaagtcag tcaacttgac gtacattttt 300: agtgacattt cagt cagattctat aaatggcaag taagcctgaa gtgaggatac 306; tgcaattttc ggagaaaaga acagcagctc tttaagtgtt tgcattttct atttgggggg 312; cagggaactg tcattcattt tgcacaattc tgat gtcagcaccc gagtggctcc 318; tgaatttaag tctgggacga catcttttat ttttacatga atctttaaac gtga 324; gcaaagtttg ctgg gtct gtctttatag caagttccag taaaccacaa 330; gtatggcaaa gcttatccaa ttttatgctt ggagcagtca gtacatacca gtttctgatg 336; tttcaggcag gagtggggta aataagtgtg accacttaaa gctgctcgtt agcatggaag 342; acttctccat tctatctttg taaaacagac aagatatgca cttgacatag tagcaaattg 348; gttctgaatt atgcaactgt ttgctattta gtaaactagc aaatgatgca tgtattttgt 354; ttttcatgta ctgggcaata aaat ctgtcccttt ttcccccttt ggtc 360; gttt gagggaaagt cttgcactat tgcatatatt ttggggacac agattttcat 366; agtttccatt tttggggggc ttaaggattt tttttttttc tgtttgaaac agttttatac 372; tttctgatat agtacttgaa attcttacca gaaaattact ttggagtttt gaagccttta 378; ttaatactac ttttaaagaa gcagttgttt caat gttttttttc ccccaagcat 384; attttcttgt atttctgttt ccatatatat atatatatat tcca attcaggata 390; tgcc atccatgaaa actgttctgg caccaaaagt aatgacaaat gttaagtgta 396; ataatagaaa agtagagcaa agagccattc agcttcagtc tttacatacc atgaataaaa 402; cattaaaaca tcatatggag aagtttacat tgtt cacctgcagt actgtggagt 408: tttaacattt tgtcctcttt tcagtgaaac agagtaaaaa tattcatcta ccattactgt 4141 tatttgctga ttttgtttta ttttttgatg gtaatattct atccttatga tgca 420; accaaattgg ctttaccatc ttag taggtataga agacaatgga ttaccatctt 426; tattgctgta atgtgttaag cattatatgc tagtagaatc tagtttaatt ggtg 432; gaaagtattc tttgagtttc catattgaat gtgtttggac taaacaaaca ataaactact 4381 gatgtctgca gcatttatct atgtccctaa Protein seguence (variant 1 ): NCBI Reference Seguence: NP_987101.1 LOCUS NP_987101 ACCESSION N P_987101 l mvmekpspll vgrefvrqyy tllnkapeyl hrfygrnssy vhggvdasgk pqeavygqnd 6; ihhkvlslnf irhv dahatlsdgv vqumgllsn sgqperkfmq tfvlapegsv 12; pnkfyvhndm vfgd sepeldeese deveeeqeer vqen ansgyyeahp l8; vtngieeple esshepepep esetkteelk pqveeknlee leeksttppp aepvslpqep 24; pkafswasvt sknlppsgtv sssgipphvk apvsqprvea kpevqsqppr vreqrprerp 3 0 '— quprqprpq rgdmeqndsd nrriirypds hqlfvgnlph didenelkef vvel 36; rintkgvggk lpnfgfvvfd dsepvqrili akpimfrgev rlnveekktr aareretrgg 42; gddrrdirrn drgpggprgi rdrd rggm aqklgsgrgt gqmegrftgq 48; rr Nucleotide ce (variant 2): NCBI Reference Seguence: NM_012297.4 LOCUS NM_012297 ACCESSION NM_012297 l acattccatt ccgc ggcgcgaggc ccgg tgtgtgagcc cgggagccgg 6; aggtgtagcg gcagagacat tgttcttgcc ggctccctac ggtgccgtgt gtgcgtgaga 12; gaagaccagt ctttcctcta gcatttgaca ttgtgcagca aagaaatggt gaag 18; cccagtccgc tgcttgtagg gcgggagttt gtgaggcaat attatacttt gctgaataaa 24; gctccggaat atttacacag gttttatggc aggaattctt cctatgttca tggtggagta ; gatgctagtg gaaagcccca ggaagctgtt caaa atgatataca ccacaaagta 36; ttatctctga acttcagtga atgtcatact aaaattcgtc atgtggatgc tcatgcaacc 42; ttgagtgatg gagtagttgt ccaggtcatg ggtttgctgt ctaacagtgg acaaccagaa 48; agaaagttta cctt tgttctggct cctgaaggat ctgttccaaa taaattttat 54; gttcacaatg atatgtttcg ttatgaagat gaagtgtttg gtgattctga gcctgaactt 60; gaat cagaagatga agtagaagag gaacaagaag aaagacaacc atctcctgaa 66; cctgtgcaag aaaatgctaa cagtggttac tatgaagctc accctgtgac taatggcata 72; gaggagcctt tggaagaatc ctctcatgaa cctgaacctg agccagaatc tgaaacaaag 78; actgaagagc tgaaaccaca agtggaggag aagaacttag aagaactaga ggagaaatct 84; actactcctc ctccggcaga acctgtttct ctgccacaag aaccaccaaa ggctttctcc 90; tgggcttcag gtaa gcct cctagtggta ctgtttcttc ctctggaatt 96; ccaccccatg ttaaagcacc agtctcacag ccaagagtcg aacc tcaa L02; tctcagccac ctcgtgtgcg tgaacaacga cctagagaac gttt tcctcctaga L08; ggaccaagac gagg agatatggaa cagaatgact ctgacaaccg tagaataatt :14: cgctatccag atagtcatca actttttgtt ggtaacttgc atat aattcttcat gagttttgga gtgg aacttcgcat gaaagcttcc aaattttggt tttgtggttt ttgatgactc cagagaatct taattgcaaa accgattatg tttcgagggg aagtacgttt aaatgtggaa L38; gagaaaaaaa caagagctgc aagagagcga gaaaccagag gtggtggtga gatattaggc gcaatgatcg aggtcccggt ggtccacgtg tggg atgcgtgatc gtgatggaag aggacctcct ccaaggggtg gcatggcaca gaaacttggc L56; agag gaaccgggca aatggagggc cgcttcacag gacagcgtcg ctgaagctcc L62; actgttggca aagtcttggc agtggtacat tattcatcgt gtttgcattc ttgttaattt L68; tttttttggc tttggaatgt gacacagcct ttttgatcat ttctttgatg tgaaaagcat L74; ctttggttat cagttaaatt gaggtggaca cccc aatttcacaa caggattcac L80; attgttaatt tataaatcta gaga attaaggact gagaaatgac catatcttaa L86; actatctacg tgaa cttaaaagga cact gaattcaggt cctttgagta L92; aaaaaaaaat cttctgctgc acattttgtt taagtgttac tgtttctgcc tgct L98; gggaacacaa atagtgcaat ttgtgcaatt ggagaatctt gccttttttc ttggctcccc 204; atac aaaccaacag aaacttgtta tgcactcatc aaaatgtact aatgggtact 210; ctgaactcat taacattgac atctgcaaca ggaggcaaca gggaaaaaat ctcatcttct 216; tttccagtag aaaatagttt gtgaaatgat gagggcattt tatctgcttg ctgtgaccag 222: cgtgtgtaca cataaacctt aacaagacta tatt ccagaaggaa atcattttag 2281 ttatgaacta aataataaaa attagaactt caaatgcgat ggtcttgact attagaccag 234; atttagtagc tccatatcta agatttttct acctgcccct cttcagtaca gggatggctg 240; gctgctcaac acactcctcc tccccttttt tcctttcttt aagctgtgta cagtgaaaat 246; tgtctttact gtatttttgt tctctggtaa tgtaataagc atgatggtgc cttctattaa 252; tacatcattc cagtcttgct ggtaattttg tacagtatag tgtatgaatt gctgtgctgc 258; aaagccaaac agctgcaaaa tgttgaaaaa tcatcgaaat gtataaaaat tgcagtatct 264; ttaaaatcag taaaatggac tagcatatta tttatcttgt tcttcagtta acaactttgt 270; gttctctgtg ggagggaggg agtcctgtgt gtttgtgggg agagggaagg aggaagtcag 276; ttatttgagt ctag ttgacttttc tcttagcctg aatgtggacg ttgaaacata 282; tcacttcagg gcttggaaaa caac ttgacgtaca tttttagtga cattttaaaa 288; gcagtcagat tctataaatg gcaagtaagc tgag gatactgcaa ttttcggaga 294i cagc agctctttaa gtgtttgcat tttctatttg gggggcaggg aactgtcatt 300; cattttgcac aattcttgaa ctgatgtcag cacccgagtg gctcctgaat ttaagtctgg 306; gacgacatct ttta catgaatctt taaacaattc tgtgagcaaa gtttgtagct 312; gctggattat tgtctgtctt tatagcaagt tccagtaaac cacaagtatg gcaaagctta 318; ttta tgcttggagc taca taccagtttc tgatgtttca ggcaggagtg 324; gggtaaataa gtgtgaccac ttaaagctgc tcgttagcat ggaagacttc tccattctat 330; ctttgtaaaa cagacaagat atgcacttga catagtagca aattggttct gaattatgca 336; actgtttgct taaa ctagcaaatg atgcatgtat tttgtttttc atgtactggg 342; caatatgagt aaaatctgtc cctttttccc cctttgaatg aggtcttcca aggg 348; aaagtcttgc actattgcat atattttggg gacacagatt ttcatagttt ccatttttgg 354; taag gatttttttt ttttctgttt gaaacagttt tatactttct gatatagtac 360; ttgaaattct taccagaaaa ttactttgga gttttgaagc ctttattaat actactttta 366; aagaagcagt tgttttattg tcaatgtttt ttttccccca agcatatttt cttgtatttc 372; tgtttccata tatatatata tatatataat ttca ggatattgcc tcca 378; ctgt acca aaagtaatga caaatgttaa gtgtaataat agaaaagtag 384; agcaaagagc cattcagctt cagtctttac ataccatgaa taaaacatta aaacatcata 390; tggagaagtt tacatggtga ttgttcacct gcagtactgt ggagttttaa cattttgtcc 396; cagt gaaacagagt aaaaatattc atctaccatt actgttattt gctgattttg 402; ttttattttt tgatggtaat attctatcct tatgacacta ttgcaaccaa attggcttta 408; ccatcttggc tttagtaggt atagaagaca atggattacc atctttattg ctgtaatgtg 414; ttaagcatta tatgctagta gaatctagtt taattgtttc aggtggaaag tattctttga 420; gtttccatat tgaatgtgtt tggactaaac aaacaataaa ctactgatgt ctgcagcatt 4261 tatctatgtc cctaa Protein seguence (variant 2): NCBI Reference Seguence: NP_036429.2 LOCUS N P_036429 ACCESSION NP_036429 l mvmekpspll vgrefvrqyy tllnkapeyl hrfygrnssy asgk pqeavygqnd 6; ihhkvlslnf sechtkirhv dahatlsdgv vqumgllsn sgqperkfmq tfvlapegsv 12; hndm fryedevfgd sepeldeese deveeeqeer qpspepvqen ansgyyeahp l8; vtngieeple esshepepep esetkteelk pqveeknlee leeksttppp aepvslpqep 24; pkafswasvt sknlppsgtv sssgipphvk apvsqprvea kpevqsqppr vreqrprerp 3 0 '— quprqprpq rgdmeqndsd nrriirypds hqlfvgnlph didenelkef fmsfgnvvel 36; rintkgvggk lpnfgfvvfd dsepvqrili akpimfrgev rlnveekktr trgg 42; gddrrdirrn drgpggprgi Vgggmmrdrd grgppprggm aqklgsgrgt gqmegrftgq 48; rr Nucleotide seguence (variant 3): NCBI Reference ce: NM_203504.2 LOCUS NM_203504 ACCESSION NM_203504 l gtgctcgggg gttccctggc cctttcggca ggggtaaaac aataagaggg ggcggtggca 6L aagggggcgg gacgtccgtg gtcg cacgtcgcag cgcctggcgc ccgggaagag 12; gtggttgtga ggcagacgaa ctcgcggctc tccggcttcc gaggcttccg agttgtcgga 18; ggaagggggc ggcgagcaat ccgc cgcacccggt cgac tcttctgacc 24; tccgcgcgac gtacccgccg ttgg ctggagcatt tgacattgtg cagcaaagaa ; atggttatgg agaagcccag tccgctgctt gtagggcggg agtttgtgag gcaatattat 36; actttgctga ataaagctcc ggaatattta cacaggtttt atggcaggaa ttcttcctat 42; gttcatggtg gagtagatgc tagtggaaag ccccaggaag ctgtttatgg ccaaaatgat 48; atacaccaca aagtattatc tctgaacttc agtgaatgtc atactaaaat tcgtcatgtg 54; catg caaccttgag tgatggagta gttgtccagg tcatgggttt gctgtctaac 60; agtggacaac cagaaagaaa gtttatgcaa acctttgttc tggctcctga aggatctgtt 66; ccaaataaat ttca caatgatatg tttcgttatg aagatgaagt gtttggtgat 72; cctg aacttgatga agaa gatgaagtag aagaggaaca agaagaaaga 78; caaccatctc ctgaacctgt gcaagaaaat gctaacagtg gttactatga agctcaccct 84; gtgactaatg gcatagagga gcctttggaa gaatcctctc atgaacctga acctgagcca 90; gaatctgaaa caaagactga gaaa ccacaagtgg aggagaagaa agaa 96; ctagaggaga aatctactac tccg gcagaacctg tttctctgcc acaagaacca L02; ccaaagccaa gagtcgaagc taaaccagaa gttcaatctc agccacctcg tgtgcgtgaa L08; ccta gagaacgacc tggttttcct cctagaggac caagaccagg cagaggagat L14; atggaacaga ctga caaccgtaga ataattcgct atccagatag L20; tttgttgqta acttgccaca tgatattgat gagc taaaggaatt cttcatgagt L26; tttggaaacg ttgtggaact tcgcatcaat accaagggtg ttgggggaaa ttga tgactctgaa caga gaatcttaat gaggggaagt acgtttaaat gtggaagaga aaaaaacaag gagcgagaaa coagaggtgg tggtgatgat cgcagggata ttaggcgcaa cccggtggtc cacgtggaat tgtgggtggt ggaatgatgc gtgatcgtga cctcctccaa gcat ggcacagaaa cttggctctg gaagaggaac cgggcaaatg L62; gagggccgct gaca gcgtcgctga actg ttggcaaagt cttggcagtg L68; gtacattatt catcgtgttt gcattcttgt taattttttt tttggctttg gaatgtgaca L74; cagccttttt gatcatttct ttgatgtgaa aagcatcttt ggttatcagt taaattgagg L80; tggacattat ttccccaatt tcacaacagg attcacattg ttaatttata aatctagact L86; tggagaatta aggactgaga aatgaccata tcttaaacta tctacgacaa agtgaactta L92; aaaggacatg cccactgaat cctt tgagtaaaaa aaaaatcttc tgctgcacat L98; tttgtttaag tgttactgtt tctgcctgtt aatgctggga acacaaatag tgcaatttgt 204; gcaattggag aatcttgcct tttttcttgg ctccccccaa aaatacaaac caacagaaac 210; ttgttatgca ctcatcaaaa tgtactaatg ggtactctga actcattaac atct 216; gcaacaggag gcaacaggga aaaaatctca tcttcttttc cagtagaaaa tagtttgtga 222; aatgatgagg tatc tgcttgctgt gaccagcgtg tgtacacata aaccttaaca 2281 agactacaag tatattccag atca ttttagttat gaactaaata ataaaaatta 234; gaacttcaaa tgcgatggtc ttgactatta gaccagattt agtagctcca tatctaagat 240; ttttctacct gcccctcttc ggga tggctggctg ctcaacacac tcctcctccc 246; cttttttcct ttctttaagc tgtgtacagt gaaaattgtc tttactgtat ttttgttctc 252; tggtaatgta ataagcatga tggtgccttc tattaataca tcattccagt cttgctggta 258; attttgtaca gtatagtgta tgaattgctg tgctgcaaag ccaaacagct gcaaaatgtt 264; gaaaaatcat cgaaatgtat aaaaattgca gtatctttaa aatcagtaaa atggactagc 270; atattattta tctt cagttaacaa ctttgtgttc tctgtgggag agtc 276; ctgtgtgttt gtggggagag ggaaggagga agtcagttat ttgagtaagc ctctagttga 282; cttttctctt agcctgaatg tggacgttga aacatatcac ttcagggctt ggaaaagtca 288; ttga cgtacatttt tagtgacatt ttaaaagcag tcagattcta taaatggcaa 294; gtaagcctga agtgaggata ctgcaatttt aaag aacagcagct ctttaagtgt 300; ttgcattttc tatttggggg gcagggaact gtcattcatt ttgcacaatt cttgaactga 306; tgtcagcacc cgagtggctc ctgaatttaa gtctgggacg acatctttta tttttacatg 312; aatctttaaa caattctgtg agcaaagttt gtagctgctg gattattgtc tgtctttata 318; gcaagttcca gtaaaccaca agtatggcaa tcca tgct tggagcagtc 324; agtacatacc agtttctgat ggca ggagtggggt gtgt gaccacttaa 330; agctgctcgt tagcatggaa gacttctcca ttctatcttt gtaaaacaga caagatatgc 336; cata gtagcaaatt ggttctgaat tatgcaactg tttgctattt agtaaactag 342; caaatgatgc atgtattttg tttttcatgt actgggcaat atgagtaaaa tctgtccctt 348; tttccccctt tgaatgaggt cttccatgtt tgagggaaag tcttgcacta ttgcatatat 354; tttggggaca cagattttca tagtttccat ttttgggggg cttaaggatt tttttttttt 360; ctgtttgaaa cagttttata ctttctgata tagtacttga aattcttacc agaaaattac 366; tttggagttt tgaagccttt attaatacta cttttaaaga agcagttgtt ttattgtcaa 372; tgtttttttt cccccaagca tattttcttg tatttctgtt tccatatata tatatatata 378; tataatttcc aattcaggat attgccctgc tgaa aactgttctg gcaccaaaag 384; caaa tgttaagtgt aataatagaa agca catt cagcttcagt 390; ctttacatac catgaataaa acattaaaac tgga gaagtttaca tggtgattgt 396; tcacctgcag tactgtggag ttttaacatt ttgtcctctt ttcagtgaaa cagagtaaaa 402; atattcatct accattactg ttatttgctg attttgtttt attttttgat ggtaatattc 4081 tatccttatg acactattgc aaccaaattg gctttaccat cttggcttta atag 414; aagacaatgg attaccatct ctgt aatgtgttaa gcattatatg ctagtagaat 420; ctagtttaat tgtttcaggt ggaaagtatt gttt ccatattgaa tgtgtttgga 4261 ctaaacaaac aataaactac tgatgtctgc agcatttatc tatgtcccta a Protein seguence (variant 3): NCBI Reference Seguence: NP_987100.1 LOCUS NP_987100 ACCESSION N P_987100 1 spll vgrefvrqyy tllnkapeyl hrfygrnssy vhggvdasgk pqeavygqnd 61 ihhkvlslnf sechtkirhv dahatlsdgv lsn sgqperkfmq egsv 12; pnkfyvhndm fryedevfgd sepeldeese deveeeqeer qpspepvqen ansgyyeahp 181 vtngieeple esshepepep esetkteelk pqveeknlee leeksttppp aepvslpqep 241 pkprveakpe vqsqpprvre qrprerqup grgd meqndsdnrr iirypdshql 301 fvgnlphdid enelkeffms fgnvvelrin tkgvggklpn fgfvvfddse pvqriliakp 36; imfrgevrln veekktraar eretrgggdd ndrg pggprgivgg gmmrdrdgrg 42'— ppprggmaqk lgsgrgtgqm egrftgqrr UQCRH Official Symbol: UQCRH Official Name: ubiquinoI-cytochrome 0 reductase hinge protein Gene ID: 7388 Organism: Homo sapiens Other Aliases: QCRG, UQCR8 Other Designations: complex I” subunit 6; complex ||| subunit Vlll; cytochrome b-cf complex subunit 6, mitochondrial;cytochrome c1 non-heme 11 kDa protein; mitochondrial hinge protein; nol-cytochrome 0 reductase complex 11 kDa protein; ubiquinol—cytochrome 0 reductase, complex I” subunit V||| Nucleotide seguence: NCBI nce Seguence: NM_006004.2 LOCUS NM_006004 ACCESSION NM_006004 1 ctgaactggg ttaggtgccg ctgttgctgc tcgtgttgaa tctagaaccg tagccagaca 61 tgggactgga ggacgagcaa aagatgctta ccgaatccgg tgag gaggaggaag 121 aggaagagga agtg gatcccctaa caacagtgag agagcaatgc gagcagttgg 181 agaaatgtgt aaaggcccgg gagcggctag gtga tgagcgtgta tcctctcgat 241 cacatacaga agaggattgc acggaggagc actt tgcg agggaccatt 301 gcgtggccca caaactcttt aacaacttga aataaatgtg tggacttaat tcaccccagt 361 cttcatcatc tgggcatcag aatatttcct tatggttttg gatgtaccat ttgtttctta 421 tttgtgtaac tgtaagttca catgaacctc atgggtttgg cttaggctgg ctat 481 gtaattcgca ccat aaaa gttctatgat ctgcaaaaaa aaaaaaaaaa 541 aaaaaa Protein seguence: NCBI Reference Seguence: NP_005995.2 LOCUS N P_005995 ACCESSION NP_005995 1 mgledeqkml tesgdpeeee eeeeelvdpl ttvreqceql ekcvkarerl elcdervssr 61 shteedctee lfdflhardh cvahklfnnl k HSPA4 Official Sym bol: HSPA4 Official Name: heat shock 70kDa protein 4 Gene ID: 3308 Organism: Homo sapiens Other Aliases: APG-2, H824/P52, HSPH2, RY, hsp70, hsp70RY Other Designations: heat shock 70 kDa protein 4; heat shock 70-related protein APO-2; heat shock 70kD protein 4; heat shock protein, 110 kDa; hsp70 RY Nucleotide ce: NCBI Reference Seguence: NM_002154.3 LOCUS NM_0021 54 ACCESSION NM_002154 l gctctggtgc tgcggctccg ctctcgtcgc aacgagatct ttcgagatct cccc 61 cggc gcctcctctg cggccactga gccggagccg gcctgagcag cggt 121 tgcagtaccc actggaagga cttaggcgct cgcgtggaca ccgcaagccc ctcagtagcc 181 tcggcccaag aggcctgctt tccactcgct agccccgccg ggggtccgtg tcctgtctcg 24; gtggccggac ccga gcccgagcag tagccggcgc catgtcggtg gtgggcatag ; acctgggctt ccagagctgc gctg tggcccgcgc cggcggcatc gagactatcg 36; ctaatgagta tagcgaccgc tgcacgccgg cttgcatttc ttttggtcct aagaatcgtt 42; caattggagc agcagctaaa gtaa tttctaatgc aaagaacaca gtccaaggat 48; ttaaaagatt ccatggccga gcattctctg atccatttgt ggaggcagaa aaatctaacc 54; ttgcatatga tattgtgcag ttgcctacag gattaacagg tataaaggtg acatatatgg 60L aggaagagcg aaattttacc caag tgactgccat gtcc aaactgaagg 66L agacagccga aagtgttctt cctg tagttgactg tgttgtttcg gttccttgtt 72L tctatactga tgcagaaaga cgatcagtga tggatgcaac acagattgct ggtcttaatt 78L gcttgcgatt aatgaatgaa gcag ttgctcttgc atatggaatc tataagcagg 84; atcttcctgc cttagaagag aaaccaagaa atgtagtttt tgtagacatg ggccactctg 90; cttatcaagt ttctgtatgt gcatttaata gaggaaaact gaaagttctg gccactgcat 96; ttgacacgac attgggaggt agaaaatttg atgaagtgtt tcac ttctgtgaag L02; aatttgggaa gaaatacaag ctagacatta agtccaaaat ccgtgcatta ttacgactct L08; ctcaggagtg tgagaaactc aagaaattga tgagtgcaaa tgcttcagat ctccctttga L14; gcattgaatg ttttatgaat gatg tatctggaac tatgaataga ggcaaatttc L20; tggagatgtg caatgatctc ttagctagag tggagccacc acttcgtagt gttttggaac L26; aaaccaagtt aaagaaagaa gatatttatg cagtggagat agttggtggt gctacacgaa L32; tccctgcggt aaaagagaag aaat ttttcggtaa agaacttagt acaacattaa L38; atgctgatga agctgtcact Cgaggctgtg cattgcagtg tgccatctta gctt L44; tcaaagtcag agaattttct gatg tagtaccata tccaatatct ctgagatgga L50; attctccagc tgaagaaggg tcaagtgact gtgaagtctt aaat catgctgctc L56; ctttctctaa agttcttaca ttttatagaa aggaaccttt cactcttgag gcctactaca L62; gctctcctca ggatttgccc tatccagatc ctgctatagc tcagttttca gttcagaaag L68: tcactcctca gtctgatggc tccagttcaa aagtgaaagt caaagttcga gtaaatgtcc L741 atggcatttt cagtgtgtcc agtgcatctt tagtggaggt tcacaagtct aatg L80; aggagccaat ggaaacagat cagaatgcaa aggaggaaga gaagatgcaa aggaaccaca tgttgaagag caacagcagc cagc agaaaataag gcagagtctg L92; aagaaatgga gacctctcaa gctggatcca aggataaaaa gatggaccaa caag L98; ccaagaaggc aaaagtgaag accagtactg tggacctgcc gaat cagctattat 204; ggcagataga cagagagatg ctcaacttgt acattgaaaa tgagggtaag atgatcatgc 210; aggataaact ggagaaggag cggaatgatg ctaagaacgc agtggaggaa tatgtgtatg 216; aaatgagaga caagcttagt ggtgaatatg agaagtttgt gagtgaagat aaca 222; gttttacttt gaaactggaa gatactgaaa attggttgta tgaggatgga gaagaccagc 228; caaagcaagt ttatgttgat aagttggctg aattaaaaaa tcaa cctattaaga 234; tacgtttcca ggaatctgaa gaacgaccaa aattatttga agaactaggg aaacagatcc 2401 atat gaaaataatc agctctttca aaaacaagga ggaccagtat gatcatttgg 246; atgctgctga catgacaaag gtagaaaaaa gcacaaatga agcaatggag aata 252; acaagctaaa tctgcagaac aagcagagtt tgaccatgga tccagttgtc aagtcaaaag 2581 agattgaagc taaaattaag gagctgacaa gtag ccctataatt tcaaagccca 264; aacccaaagt ggaacctcca aaagaggaac aaaaaaatgc agagcagaat ggaccagtgg 270; atggacaagg agacaaccca ggcccccagg ctgctgagca gggtacagac acagctgtgc 276; cttcggattc agacaagaag cttcctgaaa tggacattga ttgattccaa cacttgtttc 282; tattaaaaca gactattata aagctttaag ttgtcaactt tgttctaaat tagc 288; gcaagtgaat actgaagatt tcag tttttagggg attttcgggg aggggaaata 294; ggtaatgtat ggagcatttt cacttctaaa tagttagata cagaaattaa gtgcattgta 300; tctttttcat acta tttagaagcc gtct tactgagctt atgcttcact 306; cctttatgtt tgtg tctacaagaa taagtttgtt ttggaaagtt gagctatagc 312; tacagctcta gctatccagc agacttttca ttatgactta catggcagga gctctaatta 318; aaaa atctgttgtg gagattgctt taaatgctcc ggtg tggggatggg 324; gtccccctct gggc tggagcatgg cacggcatgg attaacacgg cagaggaaca 330; aaggtgtgct ctgagcttct tcatatttca ccttcaccct cacctgtgtt ctcttccctc 3361 tctcccaata aaagggctcc catta Protein seguence: NCBI Reference Seguence: NP_002145.3 LOCUS N P_002145 ACCESSION N P_002145 1 msvvgidlgf vara ggietianey sdrctpacis fgpknrsiga aaksqvisna 61 fkrf hgrafsdpfv eaeksnlayd ivqlptgltg eeer nftteqvtam 121 llsklketae svlkkpvvdc vvsvpcfytd aerrsvmdat qiaglnclrl mnettavala 18; ygiykqdlpa leekprnvvf ayqv rgkl kvlatafdtt lggrkfdevl 24; vnhfceefgk kykldikski rallrlsqec eklkklmsan asdlplsiec fmndvdvsgt ; mnrgkflemc ndllarvepp lrsvleqtkl kkediyavei vggatripav ffgk 36; elsttlnade avtrgcalqc ailspafkvr efsitdvvpy pislrwnspa eegssdcevf 42; sknhaapfsk vltfyrkepf tleayysqu paia qfqukvtpq sdgssskvkv 48; kvrvnvhgif svssaslvev hkseeneepm etdqnakeee kmquqeeph veeqqqqtpa 541 enkaeseeme tsqagskdkk mdqppqakka kvktstvdlp ienqllwqid remlnlyien 601 ukl ekerndakna veeyvyemrd klsgeyekfv seddrnsftl kledtenwly 661 edgedqpkqv yvdklaelkn lgqpikirfq eseerpklfe qqym kiissfknke 721 dqydhldaad mtkvekstne amewmnnkln lanqsltmd pvvkskeiea kikeltstcs 78; piiskpkpkv eppkeeqkna eqngpvdgqg dnpgpqaaeq gtdtavpsds dkklpemdid PSMA7 Official Symbol: PSMA7 Official Name: proteasome (prosome, macropain) subunit, alpha type, 7 Gene ID: 5688 Organism: Homo sapiens Other Aliases: RPS-1005F21.4, 06, HSPC, XAPC7 Other Designations: proteasome subunit RCG-f ; proteasome subunit XAPC7; proteasome subunit alpha 4; proteasome subunit alpha type-7 Nucleotide ce: NCBI Reference Seguence: NM_002792.3 LOCUS NM_002792 ACCESSION NM_002792 1 gtcgccgcct qacgccgccc gtcgccggca gcgcaggaca cggcgccgag ggtggggcgc 61 gggcgtagtg gcgccgggag thnggth gccg tgagtgtgcg agag 121 cgga aggagcccgg ccgccgcccg ccggcatgag ctacgaccgc gccatcaccg 181 tcttctcgcc cgacggccac ctcttccaag acgc gcaggaggcc gtcaagaagg 241 ccgc ggttggtgtt cgaggaagag acattgttgt tcttggtgtg gagaagaagt 301 cagtggccaa actgcaggat gaaagaacag tgcggaagat ctgtgctttg gatgacaacg 36; tctgcatggc aggc ctcaccgccg atgcaaggat agtcatcaac cggg 42; tggagtgcca gagccaccgg ctgactgtgg aggacccggt cactgtggag tacatcaccc 48; gctacatcgc cagtctgaag cagcgttata cgcagagcaa tgggcgcagg ccgtttggca 54; ccct catcgtgggt ttcgactttg atggcactcc taggctctat cagactgacc 60; cctcgggcac ataccatgcc tggaaggcca atgccatagg tcggggtgcc aagtcagtgc 66; gcgagttcct ggagaagaac tatactgacg ttga aacagatgat ctgaccatta 721 agctggtgat caaggcactc ctggaagtgg ttcagtcagg tggcaaaaac attgaacttg 78L ctgtcatgag gcgagatcaa tccctcaaga ttttaaatcc tgaagaaatt gagaagtatg 84L ttgctgaaat agaa aaagaagaaa acgaaaagaa gaaacaaaag aaagcatcat 90; gatgaataaa atgtctttgc ttgtaatttt taaattcata atgg ctcg 96; atgtgtaggc ctttccattc catttattca cactgagtgt cctacaataa gtat 102; ttttaacctg ttaaaaaaaa aaaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_002783.1 LOCUS N P_002783 ACCESSION NP_002783 1 msydraitvf spdghlfqve yaqeavkkgs tavgvrgrdi vvlgvekksv aqudertvr 61 kicalddnvc mafagltada rivinrarve cqshrltved pvtveyitry iaslquytq 121 sngrrpfgis dfdg tprlyqtdps gtyhawkana igrgaksvre fleknytdea 181 ietddltikl vikallevvq sggknielav mrrdqslkil npeeiekyva eiekekeene 241 kkqukas K|F5B Official Symbol: KIFSB Official Name: n family member SB Gene ID: 3799 Organism: Homo sapiens Other Aliases: KINH, KNS, KNS1, UKHC Other Designations: conventional kinesin heavy chain; kinesin 1 (110-120kD); Mnefinheavychmn;mneSM—1heavycham;ubmuHOUSKMe§nimavycham Nucleotide seguence: NCBI Reference Seguence: NM_004521.2 LOCUS NM_004521 ACCESSION NM_004521 l ctcctcccgc accgccctgt cgcccaacgg cggcctcagg agtgatcggg tcgg 6L ccggccagcg gacggcagag cgggcggacg ggtaggcccg gcctgctctt cgcgaggagg 12L aagaaggtgg ccactctccc ggtccccaga acctccccag cccccgcagt ccgcccagac l8; Cgtaaagggg gacgctgagg ggac gctctccccg gtgccgccgc cgctgccgcc 24; gccatggctg ccatgatgga tcggaagtga gcattagggt taacggctgc cggcgccggc ; agtc ccggctcccc ggccgcctcc acccggggaa gcgcagcgcg gcgcagctga 36; ctgctgcctc tcacggccct cgcgaccaca agccctcagg tccggcgcgt tccctgcaag 42; actgagcggc ggggagtggc tcccggccgc cggccccggc tgcgagaaag atggcggacc 48; tggccgagtg caacatcaaa gtgatgtgtc gcttcagacc cgag tctgaagtga 54; accgcggcga caagtacatc tttc agggagaaga cacggtcgtg atcgcgtcca 60; agccttatgc atttgatcgg gtgttccagt caagcacatc tcaagagcaa aatg 66; actgtgcaaa gaagattgtt aaagatgtac ttgaaggata taatggaaca atatttgcat 72; atggacaaac atcctctggg aagacacaca caatggaggg taaacttcat gatccagaag 78; gcatgggaat tattccaaga atagtgcaag atatttttaa ttatatttac tccatggatg 84; aaaatttgga atttcatatt aaggtttcat attttgaaat ggat aagataaggg 90; acctgttaga tgtttcaaag accaaccttt cagttcatga agacaaaaac cgagttccct 96; aggg gtgcacagag cgttttgtat gtagtccaga tgaagttatg gataccatag L02; atgaaggaaa atccaacaga catgtagcag ttacaaatat gaatgaacat agctctagga L08: gtcacagtat atttcttatt aatgtcaaac aagagaacac ggaa caaaagctga L14; aact ttatctggtt gctg gtagtgaaaa ggttagtaaa aaggtgctgt gctggatgaa gctaaaaaca tcaacaagtc actttctgct ctgc tgag acat atgttccata tcgagatagt gaatccttca agattcatta ggtggcaact gtagaaccac tattgtaatt catcatcata caatgagtct gaaacaaaat ctacactctt atttggccaa agggccaaaa L44; caattaagaa ttgt gtcaatgtgg agttaactgc agaacagtgg aaaaagaagt L50; atgaaaaaga aaaagaaaaa atcc tgcggaacac tattcagtgg cttgaaaatg L56; agctcaacag atggcgtaat ggggagacgg tgcctattga tgaacagttt gacaaagaga L62; aagccaactt ggaagctttc acagtggata aagatattac tcttaccaat gataaaccag L68; caaccgcaat tggagttata ggaaatttta ctgatgctga aagaagaaag gaag L74; aaattgctaa attatacaaa cagcttgatg atga agaaattaac cagcaaagtc L80; aactggtaga gaaactgaag acgcaaatgt tggatcagga tttg gcatctacca L86; gaagggatca agacaatatg caagctgagc tgaatcgcct tcaagcagaa aatgatgcct L92; ctaaagaaga agtgaaagaa cagg ccctagaaga acttgctgtc aattatgatc L98; agaagtctca ggaagttgaa gacaaaacta aggaatatga attgcttagt gatgaattga 204; atcagaaatc ggcaacttta gcgagtatag atgctgagct tcagaaactt aaggaaatga 210; ccaaccacca gaaaaaacga gcagctgaga tgatggcatc tttactaaaa gaccttgcag 216; aaataggaat tgctgtggga aataatgatg taaagcagcc aact ggcatgatag 222; atgaagagtt cactgttgca agactctaca ttagcaaaat agaa gtaaaaacca 228; tggtgaaacg ttgcaagcag ttagaaagca cacaaactga gagcaacaaa gaag 234; aaaatgaaaa ggagttagca gcatgtcagc tctc tcaacatgaa gccaaaatca 240; agtcattgac tgaatacctt caaaatgtgg aacaaaagaa gttg tctg 246; tcgatgccct cagtgaagaa ctagtccagc ttcgagcaca agagaaagtc atgg 252; aaaaggagca cttaaataag actg caaatgaagt taagcaagct gttgaacagc 258; agatccagag ccatagagaa actcatcaaa aacagatcag tagtttgaga gatgaagtag 264; aagcaaaagc aaaacttatt actgatcttc aagaccaaaa ccagaaaatg atgttagagc 270; aggaacgtct aagagtagaa catgagaagt tgaaagccac agatcaggaa aagagcagaa 276; aactacatga acttacggtt gata gacgagaaca agcaagacaa gacttgaagg 282; gtttggaaga gacagtggca aaagaacttc agactttaca gcgc aaactctttg 288: ttcaggacct ggctacaaga gttaaaaaga gtgctgagat tgattctgat gacaccggag 2941 gcagcgctgc gcaa aaaatctcct ttcttgaaaa taatcttgaa cagctcacta 300; aagtgcacaa acagttggta cgtgataatg cagatctccg ctgtgaactt cctaagttgg 306; aaaagcgact tcgagctaca gctgagagag tgaaagcttt ggaatcagca ctgaaagaag 312; ctaaagaaaa tgcatctcgt gatcgcaaac gctatcagca agaagtagat cgcataaagg 318; aagcagtcag gtcaaagaat atggccagaa gagggcattc tgcacagatt gctaaaccta 324; ccgg gcaacatcca gcagcttctc caactcaccc aagtgcaatt Cgtggaggag 330; gtgcatttgt tcagaacagc gtgg cagtgcgagg tggaggaggc aaacaagtgt 336; aatcgtttat acatacccac aggtgttaaa tcga aaga ggacatggta 342; tcaagcagtc attcaatgac tataacctct actcccttgg gattgtagaa ttataacttt 348; taaaaaaaat gtataaatta tacctggcct gtacagctgt ttcctaccta ctcttcttgt 354; aaactctgct gcttcccaac acaactagag tgcaattttg gcatcttagg agggaaaaag 360; gacagtttac aactgtggcc ctatttatta cacagtttgt ctatcgtgtc ttaaatttag 366; tctttactgt gccaagctaa ctgtacctta taggactgta ctttttgtat tttttgtgta 372; tttt ttaatctcag tttaaattac ctagctgcta cttg tttttctttt 378; cctattaaaa cgtcttcctt tttttttctt aagagaaaat ggaacattta ggttaaatgt 384; ctttaaattt taccacttaa taca tgcccataaa atatatccag tcagtactgt 390; aaat cccttgaaat gatgatatca gggttaaaat tacttgtatt gtttctgaag 396; tttgctcctg aaaactactg tttgagcact gaaacgttac aaatgcctaa taggcatttg 402; agactgagca aggctacttg ttatctcatg aaatgcctgt tgccgagtta ttttgaatag 408; aaatatttta caaa agcagatctt agtttaaggg agtttggaaa aggaattata 414; tttctctttt tcctgattct gtactcaaca agtcttgatg gaattaaaat actctgcttt 420; attctggtga gcctgctagc taatataagt attggacagg tttg tcatctttaa 426; tattagtaaa atgaattaag atattatagg attaaacata tacg gttagtactt 432; tattggccga cctaaattta tagcgtgtgg aaattgagaa aaatgaagaa acaggacaga 438; tatatgatga attaaaaata tatataggtc aattttggtc tgaaatccct gaggtgtttt 444; taacctgcta tttg tacactaatt tatttcttta gtctagaaat agtaaattgt 450; ttgcaagtca ctaataatca ttagataaat tattttcttg gccatagccg ataattttgt 456; aatcagtact aagtgtatac gtatttttgc cactttttcc tcagatgatt aaagtaagtc 462; aacagcttat aaac tgtaaaagta aaag agatttcact atttgcttca 468; gtag gggggcggtg actg tgttagcaga aattcacaga gaatggggat 4741 ttaaggttag cagagaaact tggaaagttc tgtgttagga tcttgctggc aact 480; ttttgcaaaa gttttataca cagatatttg tattaaattt ggagccatag tcagaagact 486; cagatcataa ttggcttatt tttctatttc tatt gtaatttcca cttttgtaat 492; aattttgatt taaaatataa atttatttat ttattttttt aatagtcaaa aatctttgct 498; gttgtagtct gcaacctcta aaatgattgt gttgctttta ggattgatca gaagaaacac 504; tccaaaaatt gagatgaaat gcag ccagttataa gtaatatagt taacaagcaa 510; aaaaagtgct gccacctttt atgatgattt tctaaatgga gaaacatttg gctgcatcca 516; catagacctt tatgttttgt tttcagttga aaacttgcct cctttggcaa cattcgtaaa 522; tgaagcagaa tttc tcttttttcc aaatatgtta gttttgttct tgtaagatgt 528; atcatgggta ttggtgctgt gtaatgaaca acgaatttta attagcatgt ggttcagaat 534; atacaatgtt aggtttttaa aaagtatctt gatggttctt ttctatttat aatttcagac 540; tttcataaag tgtaccaaga taaa tttgttttca gtgaactgct ttttgctatg 546; gtaggtcatt aaacacagca cttactctta aaaatgaaaa tttctgatca tctaggatat 552; tgacacattt caatttgcag tgtctttttg actggatata ttaacgttcc tctgaatggc 558; attgatagat ggttcagaag agaaactcaa tgaaataaag ttta gcga 564; aaat tatttgccta acttaagaaa actactgtgc gtaactctca gtttgtgctt 570; aactccattt gacatgaggt gacagaagag agtctgagtc tacctgtgga atatgttggt 576; ttattttcag tgcttgaaga caca tggt ttgggaagac accgtttaat 582; tttaagttaa cttgcatgtt gtaaatgcgt tttatgttta aataaagagg aaaatttttt 5881 gaaatgtaaa aaaaaaaaaa aaaaa Protein sequence: NCBI Reference Seguence: NP_004512.1 LOCUS NP_004512 ACCESSION N P_004512 l madlaecnik vmcrfrplne sevnrgdkyi aqugedtvv iaskpyafdr qusstsqeq 6; vyndcakkiv yngt ifaygqtssg kthtmegklh dpegmgiipr iquifnyiy 12; smdenlefhi kvsyfeiyld kirdlldvsk tnlsvhedkn rvpyvkgcte devm 18; ksnr hvavtnmneh ssrshsifli nvkqentqte qklsgklylv dlagsekvsk 24; tgaegavlde akninkslsa lgnvisalae gstyvpyrds kmtrilqul ggncrttivi ; ccspssynes lfgq raktikntvc vnveltaeqw kkkyekekek nkilrntiqw 36; lenelnrwrn getvpideqf leaf tvdkditltn dkpataigvi gnftdaerrk 42; ceeeiaklyk qlddkdeein eklk tqmldqeell astrrdqdnm qaelnrlqae 48; ndaskeevke vlqaleelav nyqusqeve dktkeyells delnqksatl asidaelqkl 54; kemtnhqkkr aaemmasllk iavg nndvkqpegt gmideeftva rlyiskmkse 60; vktmvkrckq lestqtesnk kmeenekela acqlrisqhe akikslteyl qnveqkqul 661 lsee 1vqlraqekv hemekehlnk vqtanevkqa veqqiqshre thqkqisslr 721 akli nqkm mleqerlrve heklkatdqe eltv qurreqarq 781 dlkgleetva ke1qtlhnlr klqudlatr vkksaeidsd dtggsaaqkq kisflennle 841 qltkvhkqlv rdnadlrcel lrat aervkalesa lkeakenasr drkryqqevd 901 rikeavrskn marrghsaqi akpirpgth aaspthpsai rgggaquns qpvavrgggg 961 kqv RP325 Official Symbol: RP825 Official Name: ribosomal protein 825 Gene ID: 6230 Organism: Homo sapiens Other Aliases: S25 Other Designations: 408 ribosomal protein 825 Nucleotide seguence: NCBI Reference Seguence: NM_001028.2 LOCUS NM_001 028 ACCESSION NM_001028 1 cttccttttt gtccgacatc ttgacgaggc tgcggtgtct attc tccgagcttc 61 gcaatgccgc ctaaggacga caagaagaag aaggacgctg gaaagtcggc caagaaagac 121 aaagacccag tgaacaaatc nggggcaag gccaaaaaga agaagtggtc caaaggcaaa 181 gttcgggaca agctcaataa cttagtcttg tttgacaaag ctacctatga ctgt 241 aaggaagttc ccaactataa acttataacc ccagctgtgg tctctgagag actgaagatt 301 cgaggctccc tggccagggc agcccttcag gagctcctta gtaaaggact tatcaaactg 361 aagc acagagctca agtaatttac accagaaata gtgg agatgctcca 421 qctqctqqtq aagatgcatg aataggtcca accagctgta gaaa aaaaaaaaaa aaaaaaaaaa aaaa Protein seguence: NCBI Reference Seguence: NP_001019.1 LOCUS NP_001019 ACCESSION N P_001019 1 mppkddkkkk dagksakkdk dpvnksggka kkkkwskgkv lvlf dkatydklck 61 litp lkir gslaraalqe llskgliklv viyt rntkggdapa 121 ageda HSP90AB1 Official Symbol: HSP90AB1 Official Name: heat shock protein 90kDa alpha (cytosolic), class B member 1 m: 3326 Organism: Homo sapiens Other Aliases: RP1-302G2.1, , HSP84, HSP90-BETA, HSPQOB, HSPC2, HSPCB Other Designations: 90-kda heat shock protein beta HSP90 beta; heat shock 84 kDa; heat shock 90kD protein 1, beta; heat shock 90kDa protein 1, beta; heat shock protein HSP 90-beta; heat shock protein beta Nucleotide seguence: NCBI Reference Seguence: NM_007355.2 LOCUS NM_007355 ACCESSION NM_007355 1 ctccggcgca gtgttgggac ggta tcggaaagca agcctacgtt gctcactatt 61 aatc cttttctttt caagatgcct gaggaagtgc accatggaga ggaggaggtg 121 gagacttttg cctttcaggc agaaattgcc atgt ccctcatcat cttc 181 tattccaaca aggagatttt ccttcgggag ttgatctcta ctga tgccttggac 24- aagattcgct atgagagcct gacagaccct tcgaagttgg acagtggtaa agagctgaaa ; attgacatca tccccaaccc tcaggaacgt accctgactt tggtagacac aggcattggc 36L atgaccaaag ctgatctcat aaataatttg ggaaccattg ccaagtctgg tactaaagca 42; ttcatggagg ctcttcaggc tggtgcagac atctccatga ttgggcagtt tggtgttggc 481 ttttattctg cctacttggt ggcagagaaa gtggttgtga tcacaaagca caacgatgat 541 gaacagtatg cttgggagtc ttctgctgga ggttccttca ctgtgcgtgc tgaccatggt 60; gagcccattg gcaggggtac caaagtgatc ctccatctta aagaagatca gacagagtac 66; gaga ggcgggtcaa agtg aagaagcatt ctcagttcat tccc 72; cttt agaa ggaacgagag aaggaaatta gtgatgatga ggcagaggaa 78; gagaaaggtg agaaagaaga ggaagataaa gatgatgaag aaaaacccaa gatcgaagat 84; gtgggttcag atgaggagga tgacagcggt aaggataaga agaagaaaac taagaagatc 90; aaagagaaat acattgatca ggaagaacta aacaagacca agcctatttg gaccagaaac 96L cctgatgaca tcacccaaga ggagtatgga gaattctaca tcac taatgactgg L02; gaagaccact tggcagtcaa gcacttttct gtagaaggtc aatt cagggcattg L08; ctatttattc ctcgtcgggc tccctttgac ctttttgaga acaagaagaa aaagaacaac 114; atcaaactct gccg tgtgttcatc atggacagct gtgatgagtt tatctcaatt ttatccgtgg tgtggttgac gatc tgcccctgaa gaaatgctcc agcagagcaa gaaa gtcattcgca aaaacattgt cttgagctct tctctgagct ggcagaagac aaggagaatt acaagaaatt ttctctaaaa atctcaagct tggaatccac gaagactcca ctaaccgccg ccgcctgtct L44; ctgc gctatcatac ctcccagtct ggagatgaga tgacatctct gtcagagtat L50; gtttctcgca tgaaggagac acagaagtcc atctattaca tcactggtga gagcaaagag L56; caggtggcca actcagcttt tgtggagcga gtgcggaaac ggggcttcga ggtggtatat L62; atgaccgagc ccattgacga gtactgtgtg cagcagctca aggaatttga tgggaagagc L68; ctggtctcag ttaccaagga gggtctggag gagg atgaggagga gaagaagaag L74; atggaagaga gcaaggcaaa gtttgagaac ctctgcaagc aaga aatcttagat L80; aagaaggttg agaaggtgac aatctccaat agacttgtgt cttcaccttg ctgcattgtg L86; accagcacct acggctggac agccaatatg gagcggatca tgaaagccca ggcacttcgg L92; gacaactcca ccatgggcta tatgatggcc aaaaagcacc tggagatcaa ccctgaccac L98; cccattgtgg agacgctgcg gcagaaggct gaggccgaca agaatgataa ggcagttaag 204; gacctggtgg tgctgctgtt tgaaaccgcc ctgctatctt tttc ccttgaggat 2101 ccccagaccc actccaaccg catctatcgc atgatcaagc taggtctagg tattgatgaa 216; gatgaagtgg cagcagagga acccaatgct gcagttcctg atgagatccc ccctctcgag 222; ggcgatgagg atgcgtctcg catggaagaa gtcgattagg ttaggagttc atagttggaa 228; tgcc cttgtatagt gtccccatgg gctcccactg cagcctcgag tgcccctgtc 234; ccacctggct ccccctgctg gtgtctagtg tttttttccc tctcctgtcc ttgtgttgaa 2401 ggcagtaaac taagggtgtc aagccccatt ccctctctac tcttgacagc aggattggat 2461 tatt gtggtttatt ttattttctt cattttgttc tgaaattaaa gtatgcaaaa 2521 taaagaatat gccgttttaa aaaaaaaaaa aaaaaaaaaa aaaaaaa Protein seguence: NCBI nce Seguence: NP_031381.2 LOCUS NP_031381 ACCESSION N P_031381 l mpeevhhgee evetfafqae iaqlmsliin tfysnkeifl relisnasda ldkiryeslt 6L dpskldsgke lkidiipnpq ertltlvdtg igmtkadlin nlgtiaksgt kafmealqag l2; adismigqu ngysaylva ekvvvitkhn ddeqyawess aggsftvrad hgepigrgtk l8; vilhlkedqt eyleerrvke qfig leke rekeisddea eeekgekeee 24; dkddeekpki edvgsdeedd sgkdkkkktk kikekyidqe elnktkpiwt rnpdditqee ; ygefyksltn dwedhlavkh fsvegqlefr allfiprrap fdlfenkkkk nniklyvrrv 36; fimdscdeli peylnfirgv plni sremquski lkvirknivk kclelfseLa 42; edkenykkfy eafsknlklg ihedstnrrr lsellryhts qsgdemtsls eyvsrmketq 48; ksiyyitges keqvansafv ervrkrgfev vymtepidey fdg kslvsvtkeg 54; lelpedeeek kkmeeskakf enlcklmkei kvti snrlvsspcc gwta 60; nmerimkaqa lrdnstmgym makkhleinp dhpivetqu kaeadkndka vkdlvvllfe 66; gfsl edpqthsnri yrmiklglgi dedevaaeep naavpdeipp legdedasrm 72; eevd LMO7 Official Symbol: LMO7 Official Name: LIM domain 7 Gene ID: 4008 Organism: Homo sapiens Other Aliases: RP11-332E32, FBX20, FBXOZO, LOMP Other Designations: F-box only protein 20; F-box protein Fbx20; LIM domain only 7 protein; LIM domain only protein 7; LMO-7; zinc-finger -containing protein Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_005358.5 LOCUS NM_005358 ACCESSION NM_005358 l ggaaagaagt ggaataatta ggaacctagg tagg gtagcaggac atttcaaaca 6L agca tatgagattc caggtcttgt taaaatgcaa attctgattc tagg 12L tgaggtctga gattgtgcat ttctaacaag cactcagata aggc tgttggcccc l8; agggtcacac ttatagtgat tttctagaac ccagttgggg aagtgaatct tgggcaggag 24L acac ctcttgcatt gagtttggag atctcatctg atataacttt ttaagaaaga ; aaaataattt tccaaatatc caattgataa gctttcccac taagtggctt tcccactaag 36; tggctgcgtt atgaaaattg cttcactttg aaacttctgg tcttggtaat atagaatttc 42; tgtgttctca cagtgcttga ttgagaatat gatattgaga ttatggcata aaatatagtg 48; gctgtacaaa aaaaaataca ttattaggat ctctaacaat tatgtaaaag tca :tgcttc 54; atgggtagag ctcaaacttt ggtgtgagac ctggttttat tcttggcact tac :ctgagt 60; tgtcttaggc aaattaatac cttaagcaaa aatattctca tgtacatttt aca :gagaat 66; tataaatgaa gtacataaag tccagcagtc acaaatgtta tctattatta cca :cgtcct 72; aagactgcaa tcagctatag tgaaagtagt ctcaaagatt gtttcataaa tca :cagatt 78; cacctaattt tctaaagaat ttaaataagg agatggaatg aatagattgc att :tgtttc 84; cagg ggaactgtgc atatttcttc tgtgactcgg aaatggttta act :ttaaaa 90; atcccaaaat agctgaagtt agcagacatg caatttacca aggatgattg gaa :ttttat 96; ctttcctgta ataatactat gcac actgctcatg aggaaaacat ttttatgtga L02; atcttttact ggca aagaatgctg tttt tgataactat gtttatagaa L08; tcac cctgagcaat tatttcaaca tctaaagtta ttattaccat tcatgtttca L14; gcta tttt gatgaatttc aatatggtgc tacagtgata gggcaagtgc L201 aaataagttc aatatatggg tacggtctaa agctatttta atttttttat atgaagaaaa tatg ccatattttc acgttttaca gttggatgtc ctcttccaga gaacagagct tctg gaaatttgga ggcaactgat gtctgcatct gtgtgggttg tctc agggacagag tctgcagcaa aaaagatata L44; attttgagga ctgaacaaaa ttcaggaagg actattctca ttaaggcagt aacagagaag L50; aattttgaaa caaaagattt tcgagcctct ctagaaaatg gtgttctgct gtgtgatttg L56; attaataagc ctgg cgtcattaag aata gactgtctac accaatagca L62; ggattggata atataaacgt tttcttgaaa gcttgtgaac agattggatt gaaagaagcc L68; cagcttttcc atcctggaga tctacaggat ttatcaaatc gagtcactgt caagcaagaa L74; gagactgaca tgaa aaatgttttg ataacattgt actggctggg aagaaaagca L80; caaagcaacc cgtactataa tggtccccat cttaatttga ttga gaatctttta L86: ggacaagcac tgacgaaggc actcgaagac tccagcttcc tgaaaagaag agtggctacg gtgacatctg gtgtcctgaa cgtggagaat ttcttgctcc cataagagag aagattcctt tgaaagcttg gactctttgg gctcgaggtc attgacaagc 204; tgctcctctg atatcacgtt gagagggggg cgtgaaggtt gtga cacagattcg 210; acat ttaagatgca ggattataat aaagatgata tgtcgtatcg aaggatttcg 216; gagc ctgc cttc aatcgttttt tacccaacaa aagtagacag 222; ccatcctatg taccagcacc tctgagaaag ccag acaaacatga ggataacaga 228; agaagttggg caagcccggt ttatacagaa gcagatggaa cattttcaag actctttcaa 234; aagatttatg gtgagaatgg gagtaagtcc atgagtgatg tcagcgcaga agatgttcaa 240; cgtc agctgcgtta Cgaggagatg cagaaaataa aatcacaatt aaaagaacaa 246; gatcagaaat ggcaggatga ccttgcaaaa tggaaagatc gtcgaaaaag ttacacttca 252; gatctgcaga agaaaaaaga agagagagaa gaaattgaaa agcaggcact tgagaagtct 258; agct ctaagacgtt taaggaaatg ctgcaggaca gggaatccca aaatcaaaag 264; tctacagttc cgtcaagaag gagaatgtat tcttttgatg atgtgctgga ggaaggaaag 270; cgacccccta caatgactgt gtcagaagca agttaccaga gtgagagagt agaagagaag 276; ggagcaactt atccttcaga aattcccaaa gaagattcta ccacttttgc aaaaagagag 282; gaccgtgtaa caactgaaat tcagcttcct tctcaaagtc ctgtggaaga acaaagccca 288; gcctctttgt cttctctgcg ttcacggagc acacaaatgg ctcg tgtttcagct 294: tctctcccca gaagttaccg gaaaactgat acagtcaggt taacatctgt ggtcacacca 3001 agaccctttg gctctcagac aaggggaatc tcatcactcc ccagatctta cacgatggat 306; gatgcttgga agtataatgg agatgttgaa gacattaaga gaactccaaa caatgtggtc 312; agcacccctg caccaagccc ggacgcaagc caactggctt caagcttatc tagccagaaa 318; gaggtagcag caacagaaga agatgtgaca aggctgccct catc ccccttctca 324; tctctttccc aagaccaggc tgccacttct aaagccacat tgtcttccac atctggtctt 330; gatttaatgt ctgaatctgg agaaggggaa atctccccac aaagagaagt ctcaagatcc 336; caggatcagt tcagtgatat gagaatcagc ataaaccaga ggaa gagtcttgac 342; tttgggttta caataaaatg ggatattcct gggatcttcg tagcatcagt tgaagcaggt 348; agcccagcag ctca gctacaagta gaaa ttattgctat taacaacacc 354; aagttttcat ataacgattc aaaagagtgg gaggaagcca tggctaaggc tcaagaaact 360; ggacacctag tgatggatgt gaggcgctat ggaaaggctg gttcacctga aacaaagtgg 366; attgatgcaa cttctggaat ttacaactca gaaaaatctt caaatctatc tgtaacaact 372; gatttctccg aaagccttca gagttctaat attgaatcca aagaaatcaa tggaattcat 378; gatgaaagca ttga atcaaaagca tctgaatcca tgaa aaaa 384; aggcgatcac aattttttga acaaggaagc tctgattcgg tggttcctga tcttccagtt 390; ccaaccatca gtgccccgag tcgctgggtg tgggatcaag aggaggagcg gcag 396; gagaggtggc agaaggagca ggaccgccta ctgcaggaaa aatatcaacg tgagcaggag 402; aggg aagagtggca aagggccaaa gcag agagagagaa ttccaagtac 408; ttggatgagg aactgatggt cctaagctca aacagcatgt ctctgaccac acgggagccc 414; tctcttgcca cctgggaagc tacctggagt gaagggtcca agtcttcaga cagagaagga 420; acccgagcag gagaagagga gaggagacag ccacaagagg aagttgttca tgaggaccaa 426; ggaaagaagc Cgcaggatca gcttgttatt gaga ggaaatggga gcaacagctt 432; caggaagagc aagagcaaaa gcggcttcag gctgaggctg aggagcagaa gcgtcctgcg 438; gaggagcaga agcgccaggc agagatagag cgggaaacat cagtcagaat ataccagtac 444; aggaggcctg ttgattccta tgatatacca aagacagaag aagcatcttc aggttttctt 450; cctggtgaca ggaataaatc cagatctact actgaactgg atgattactc cacaaataaa 456; aatggaaaca ataaatattt agaccaaatt gggaacatga caca gaggagatcc 462; aagaaagaac aagtaccatc aggagcagaa ttggagaggc aacaaatcct tcaggaaatg 468; agaa ttca caatgacaac agctggatcc gcag tgccagtgtc 474; aacaaagagc ctgttagtct tcctgggatc atgagaagag gcgaatcttt agataacctg 4801 gactcccccc gatccaattc ttggagacag cctccttggc tcaatcagcc cacaggattc 486; tatgcttctt cctctgtgca agactttagt ccac ctcagctggt gtccacatca 492; aaccgtgcct acatgcggaa cccctcctcc agcgtgcccc caccttcagc tggctccgtg 498; tcca ccacaggtgt ggccaccaca cagtccccca ccccgagaag ccattcccct 504; tcagcttcac gctc tcagctgcgt aacaggtcag tcagtgggaa gcgcatatgc 510; tcctactgca ataacattct gggcaaagga gccgccatga tcatcgagtc cctgggtctt 516; tgttatcatt tgcattgttt taagtgtgtt gcctgtgagt gtgacctcgg aggctcttcc 522; tcaggagctg aagtcaggat cagaaaccac caactgtact gcaacgactg ctatctcaga 528; tctg gacggccaac gtga tgtaagcctc catacgaaag cactgttgca 534; gatagaagaa gaggtggttg ctgctcatgt agatctataa atatgtgttg tatgtctttt 540; ttgctttttt tttaaaaaaa agaataactt tttttgcctc tttagattac atagaagcat 546: tgtagtcttg gtagaaccag tatttttgtt gtttatttat aaggtaattg tgtgtgggga 552; aaagtgcagt atttacctgt tgaattcagc atcttgagag cacaagggaa agaa 558; cctacgaata tttttgaggc agataatgat ctagtttgac tttctagtta tttt 564; gaagagggta ttttattgtt ttttaaaaaa aggttcttaa acattatttg aaatagttaa 570; tataaataca taattgcatt tgctctgttt attgtaatgt attctaaatt aatgcagaac 576; catatggaaa atttcattaa aatctatccc caaatgtgct ttctgtatcc ttccttctac 582; ctattattct gatttttaaa aatgcagtta atgtaccatt tatttgcttg atgaagggag 588; ctctattttc tttaccagaa atgttgctaa ccca atagaaagct gcttattttc 594; attaatgaaa aataaccatg gtttgtatac tagaagtctt cttcagaaac tggtgagcct 600; ttctgttcaa ttgcatttgt aaataaactt gctgatgcat ttaacgagtg ggtcgtcttt 606; ttcttaggtg tatgtgtctg ggcc ccat atttcagtat gtggcctttt 612; ttgatgttat tcca gtagctttac taaggtataa ttgatgtaat aaactgcata 618; tatttaaagt gtatactttg acaaattttg acatggtgta taccttcgaa actatgccac 624; agtctggatg ctga aacattttaa taaggaagtt tatttttgat aaagttatgt 630; ttttggatac aatatatttg tatggtgaga gtgatgaatt gttggatcat ttgaataaaa 636; tcttttacta accccatgat agaa agtg gaat atctataaag 642; caaaaaatgt agtctcttgt aatc tggagcggga atgcaaggat acaaaacttt 648; agcatgcttt gagcaaaaat ttaaacttac cttt tataataatg taagtggaat 654: ggaggattct aggaactgag aactgtattg gaataggttc aaaatatgta agaaatgcta 6601 atgtgggaga taaaaatttt atttagtact tattctgatt attattaaag taataatgtg 666; ttccttgagg ataacttgtc aaatgcccca aagcataaag aatataattc tgaatcccaa 672; attccaaaga caagaactct gtgtttgaat tcattctgca tatt tataagtata 678; gattgtgaat ttttccatgt aatt atttttatct atgg ttgcatagtg 684; ctccattgtt tggccttggt aatatttagt tgataattcc gtgt atttttcact 6901 tgtttctaag atcaaacatt ttaatatgtg catgttatat ataaatatgt aaattctgtg 6961 atactctatg atcatctctt tctttatatt attttcatag acatgaaata gttgctcaga 7021 gattatgcat tttaagacac tcatagtata tattgccaaa gtggtttcca gaaaggcact 7081 gctggcttcg actcctataa gcagcacgtg ggcttgttca tctcactgca tgtttatgaa 7141 gatacagttc ttttgccttg gcct gatgtgtatg cagaggcagc cctcaatatg 7201 ttga ataaatgaat gaagaaacca ctatcaaaaa aaaaaaaaaa aaa n seguence (variant 1): NCBI Reference Seguence: NP_005349.3 LOCUS N P_005349 ACCESSION NP_005349 1 mkkirichif tfyswmsydv lfqrtelgal eiqulicah vcicvgwlyl kkdi 6; ilrteqnsgr tilikavtek nfetkdfras lengvllcdl inklkpgvik kinrlstpia 121 gldninvflk aceqiglkea qlfhpgdlqd lsnrvtvkqe knvl itlywlgrka 18; qsnpyyngph 1nlkafen11 gqaltkaled ssflkrsgrd sgygdiwcpe rgeflapprh 24; hkredsfesl dslgsrslts cssditlrgg regfesdtds eftfkquyn kddmsyrris ; avepktalpf nrflpnksrq psyvpaplrk kkpdkhednr rswaspvyte adgtfsrlfq 36; kiygengsks edvq nqulryeem qkiksqlkeq quwqddlak wkdrrksy:s 42; dlqkkkeere leks krssktfkem lqdresqnqk stvpsrrrmy sfddvleegk 48; rpptmtvsea syqserveek gatypseipk edsttfakre drvtteiqlp sqspveeqsp 54; aslsslrsrs rvsa slprsyrktd tvrltsvvtp rpfgsqtrgi sslprsytmd 60; dawkyngdve dikrtpnnvv stpapspdas qlasslssqk evaateedvt rlpsptspfs 66; slsqdqaats katlsstsgl dlmsesgege iqurevsrs qdqfsdmris inqtpgksLd 72; fgftikwdip gifvasveag spaefsqlqv ddeiiainnt kfsyndskew aqet 78L ghlvmdvrry gkagspetkw idatsgiyns ekssnlsvtt dfseslqssn ieskeingih 84L desnafeska sesislknlk qus sdsvvpdlpv ptisapsrwv wdqeeerqu 90; erqueqdrl lqekyqreqe klreewqrak qeaerensky 1dee1mvlss nsmslttrep 961 slatweatws egskssdreg trageeerrq pqeevvhedq gkkpqdqlvi ererkweqql 1021 qeeqeqkrlq krpa eeqquaeie retsvriyqy rrpvdsydip kteeassgfl 1081 pgdrnksrst telddystnk ngnnkyldqi gnmtssqrrs sgae 1erqqi1qem 1141 rkrtplhndn swirqrsasv nkepvslpgi mrrgesldnl dsprsnsqu ppwlnqptgf 1201 yassqudfs rpppqlvsts nraymrnpss svpppsagsv ktsttgvatt qsptprshsp 1261 sasqsgsqlr nrsvsgkric sycnnilgkg aamiieslgl cyhlhcfkcv acecdlggss 1321 sgaevrirnh qucndcylr fksgrptam Nudeofideseguence(vafiantZy NCBI Reference Seguence: NM_015842.2 LOCUS NM_01 5842 ACCESSION 842 l aacaggtaat cgtg ccagtcacaa caga aacagtgtat gcccgggcat 6; aagatagcac gactgtgtat gctctggagg actgaaaggc tgtacaagcc attt 12; aaat atacatatgc atgggtcttg ctgctgcctc ttttgctgac tgtaattgga 18; agct tcgaagttat atcataaaaa tttgtaacct ttgtctgaga gagagctcag 24; ctaagcaatc actttccact tcttttcaca ggataatata aacgttttct tgaaagcttg ; tgaacagatt aaag aagcccagct tttccatcct ggagatctac aggatttatc 36; aaatcgagtc aagc agac tgacaggaga gtgaaaaatg ttttgataac 42; attgtactgg ctgggaagaa aagcacaaag caacccgtac tataatggtc cccatcttaa 48; tttgaaagcg tttgagaatc ttttaggaca agcactgacg ctcg aagactccag 54; cttcctgaaa agaagtggca gggacagtgg ctacggtgac atctggtgtc ctgaacgtgg 60; agaatttctt gctcctccaa ggcaccataa gagagaagat tcctttgaaa gcttggactc 66; tttgggctcg aggtcattga caagctgctc tatc acgttgagag gggggcgtga 72; aggttttgaa agtgacacag attcggaatt tacatttaag atgcaggatt ataataaaga 78; tgatatgtcg tatcgaagga tttcggctgt tgagccaaag actgcgttac ccttcaatcg 84; ttttttaccc aacaaaagta gacagccatc ctatgtacca gcacctctga gaaagaaaaa 90; gccagacaaa catgaggata acagaagaag ttgggcaagc ccggtttata cagaagcaga 96L tggaacattt tcaagtaatc agaggaggat ttggggcacc aatgtggaga actggccaac 102; tgtacaagga acttcaaagt cctcttgtta agag gaaaaagcaa agacaagaag 108; caac attgtaaagg atgatcttta tgtgcgcaag ctcagtccag caaa 114; gaat gcttttgatc agtttcttcc caaatgttgg accccagaag atgtgaactg 120; gaaaagaata aaaagggaaa cttataagcc atggtataaa gaatttcagg gattcagtca L26; gtttttactg cttcaggccc tccaaacata ctctgatgac atcttgtctt caca L32; taccaaaatt gatcccactt ctggcccaag gctcataacc aaga ctta L38; tgcaccaggc tatagaagag atgacctcga gatggcagcc ctggatcctg acttagagaa L44; tgatgatttc tttgtcagaa agactggggc tttccatgca aatccatatg ttctccgagc L50; ttttgaagac tttagaaagt tctctgagca agatgattct gtagagcgag atataatttt L56; acagtgtaga gaaggtgaac ttgtacttcc ggatttggaa aaagatgata ttcg L62: ccgaattcca aaga aagaagtgcc gctgtctggg gccccagata agtccctttt cccgaaccct ggactcttcc tccagaaatt caagcaaaat ttctctgtgt L74; acttgaaagg acatgcccat ccaaagaaaa aagtaatagc tgtagaatat atatcggcag aagaaagatg acatgctgac acgtaagatt cagtcctgga taccgtgcct cccatcagtt tcacccctgg cccctgcagt gaggctgact ggaggccatc gcca gcagacttag gcacaagaaa aggctgatgg aaag atttatggtg ggag taagtccatg gtca gcgcagaaga 204; tgttcaaaac ttgcgtcagc tgcgttacga ggagatgcag aaaataaaat cacaattaaa 210; agaacaagat cagaaatggc aggatgacct atgg aaagatcgtc gaaaaagtta 216; cacttcagat ctgcagaaga aaaaagaaga gagagaagaa attgaaaagc aggcacttga 222; gaagtctaag agaagctcta ttaa ggaaatgctg caggacaggg aatcccaaaa 228; tcaaaagtct acagttccgt caagaaggag aatgtattct tttgatgatg tgctggagga 234; aggaaagcga ccccctacaa tgactgtgtc agaagcaagt taccagagtg agagagtaga 240; agagaaggga gcaacttatc cttcagaaat tcccaaagaa gattctacca cttttgcaaa 246; ggac cgtgtaacaa ctgaaattca gcttccttct cctg tggaagaaca 252; aagcccagcc tctttgtctt ctctgcgttc acggagcaca caaatggaat caactcgtgt 258; ttcagcttct ctccccagaa gttaccggaa aactgataca gtcaggttaa catctgtggt 264; cacaccaaga ccctttggct ctcagacaag gggaatctca tcactcccca gatcttacac 270; gatggatgat gcttggaagt gaga tgttgaagac attaagagaa ctccaaacaa 2761 tgtggtcagc acccctgcac caagcccgga cgcaagccaa tcaa gcttatctag 282; agag gtagcagcaa cagaagaaga tgtgacaagg ctgccctctc ctacatcccc 288; cttctcatct ctttcccaag accaggctgc cacttctaaa gccacattgt cttccacatc 294; tggtcttgat ttaatgtctg aatctggaga aggggaaatc tccccacaaa gagaagtctc 300; aagatcccag gatcagttca gtgatatgag cata aaccagacgc ctgggaagag 306; tcttgacttt gggtttacaa taaaatggga tattcctggg atcttcgtag catcagttga 312; agcaggtagc ccagcagaat tttctcagct acaagtagat gatgaaatta ttgctattaa 318; caacaccaag ttttcatata acgattcaaa agagtgggag gaagccatgg ctaaggctca 324; agaaactgga cacctagtga tggatgtgag tgga aaggctggtt cacctgaaac 330; aaagtggatt actt ctggaattta caactcagaa aaatcttcaa atctatctgt 336; aacaactgat ttctccgaaa gccttcagag ttctaatatt gaatccaaag aaatcaatgg 342i aattcatgat gaaagcaatg cttttgaatc aaaagcatct gaatccattt ctttgaaaaa 348; cttaaaaagg cgatcacaat tttttgaaca aggaagctct gattcggtgg ttcctgatct 354; tccagttcca accatcagtg ccccgagtcg ctgggtgtgg gagg aggagcggaa 360; gcggcagqag aggtggcaga aggagcagga actg caggaaaaat atcaacgtga 366; gaaa ctgagggaag agtggcaaag ggccaaacag gaggcagaga gagagaattc 372; caagtacttg gaac tgatggtcct aagctcaaac agcatgtctc tgaccacacg 378; ctct acct gggaagctac ctggagtgaa gggtccaagt cttcagacag 384; agaaggaacc cgagcaggag aagaggagag gagacagcca gaag ttgttcatga 390; ggaccaagga aagaagccgc aggatcagct tgttattgag agagagagga aatgggagca 396; acagcttcag gaagagcaag agcaaaagcg gcttcaggct gaggctgagg agcagaagcg 402; tcctgcggag gagcagaagc gccaggcaga gcgg gaaacatcag tcagaatata 408; ccagtacagg aggcctgttg attcctatga tataccaaag acagaagaag catcttcagg 414; ttttcttcct ggtgacagga ataaatccag atctactact gaactggatg ccac 420; aaataaaaat ggaaacaata aatatttaga ccaaattggg aacatgacct cttcacagag 426; gagatccaag caag taccatcagg agcagaattg gagaggcaac aaatccttca 432; ggaaatgagg aagagaacac cccttcacaa tgacaacagc tggatccgac agcgcagtgc 438; caac cctg ttagtcttcc tgggatcatg agaagaggcg aatctttaga 444; taacctggac cgat ccaattcttg gagacagcct ccttggctca atcagcccac 450i aggattctat gcttcttcct aaga ctttagtcgc ccaccacctc agctggtgtc 4561 cacatcaaac cgtgcctaca tgcggaaccc ctcctccagc gtgcccccac cttcagctgg 462; gaag acctccacca caggtgtggc caccacacag tcccccaccc cgagaagcca 468; ttccccttca gcttcacagt caggctctca gctgcgtaac agtgtgttgc ctgtgagtgt 474; gacctcggag gctcttcctc aggagctgaa gtcaggatca acca actgtactgc 480; aacgactgct atctcagatt caaatctgga cggccaaccg ccatgtgatg taagcctcca 486; tacgaaagca ctgttgcaga tagaagaaga ggtggttgct gctcatgtag atctataaat 492; atgtgttgta tgtctttttt gctttttttt taaaaaaaag aataactttt tttgcctctt 498; tagattacat agaagcattg tagtcttggt agaaccagta tttttgttgt ttatttataa 504; ggtaattgtg tgtggggaaa agtgcagtat ttacctgttg aattcagcat cttgagagca 510; caagggaaaa aataagaacc tatt tttgaggcag ataatgatct agtttgactt 516; tctagttagt ggtgttttga agagggtatt ttattgtttt ttaaaaaaag gttcttaaac 522: attatttgaa atagttaata taaatacata attgcatttg ctctgtttat tgtaatgtat 528; tctaaattaa tgcagaacca tatggaaaat ttcattaaaa tctatcccca aatgtgcttt 534; ctgtatcctt ccttctacct attattctga aaaa tgcagttaat gtaccattta 540; tttgcttgat gaagggagct ctattttctt aaat gttgctaagt aattcccaat 546; agaaagctgc ttattttcat taatgaaaaa taaccatggt ttgtatacta gaagtcttct 552; tcagaaactg gtgagccttt aatt gcatttgtaa ataaacttgc tgatgcattt 558; aacgagtggg tcgtcttttt cttaggtgta tgtgtctgac ctcaggcctt ttagccatat 564; ttcagtatgt tttt gatgttatgt tttatccagt agctttacta aggtataatt 570; gatgtaataa tata tttaaagtgt atactttgac tgac atggtgtata 576; ccttcgaaac tatgccacag tctggatgtg tttactgaaa aata aggaagttta 582; tttttgataa agttatgttt ttggatacaa tgta tggtgagagt gatgaattgt 588; tggatcattt gaataaaatc ttttactaac cccatgataa aaggagaaga caacagtgag 594; cttagaatat ctataaagca aaaaatgtag tctcttgttt aaaaaatctg gagcgggaat 600; atac aaaactttag ttga gcaaaaattt aaacttactg gaatctttta 606; taataatgta agtggaatgg aggattctag gaactgagaa ctgtattgga ataggttcaa 612; aatatgtaag aaatgctaat gtgggagata ttat ttagtactta ttctgattat 618; tattaaagta ataatgtgtt ccttgaggat aacttgtcaa atgccccaaa gcataaagaa 624; tataattctg aatcccaaat gaca agaactctgt attc attctgcata 630: taattattta taga ttgtgaattt ttccatgttc ttaaaattat ttttatcttt 6361 tttcatggtt gcatagtgct ccattgtttg gccttggtaa tatttagttg ataattccat 642; gtat ttttcacttg tttctaagat caaacatttt tgca tgttatatat 648; aaatatgtaa attctgtgat actctatgat catctctttc tttatattat tttcatagac 654; atgaaatagt tgctcagaga ttatgcattt taagacactc atagtatata ttgccaaagt 660; ggtttccaga aaggcactgc tggcttcgac tcctataagc agcacgtggg cttgttcatc 6661 tcactgcatg tttatgaaga tacagttctt ttgccttgtt ctctgcctga tgtgtatgca 6721 gccc tcaatatgca gtggttgaat aaatgaatga agaaaccact atcaaaaaaa 6781 aaaaaaaaaa a Pnneh1seguence(vafiant2y NCBI Reference Seguence: NP_056667.2 LOCUS N P_056667 ACCESSION NP_056667 1 quynkddms yrrisavepk talpfnrflp nksrqpsyvp kpdk hednrrswas 61 pvyteadgtf sanrriwgt nvenwptvqg tsksscylee ekaktrsipn yvrk 121 lspvmpnpgn aquflpkcw tpedvnwkri kretykpwyk efqgfsqfll lqalqtysdd 181 ilssethtki dptsgprlit rrknlsyapg yrrddlemaa nddf afha 241 npyvlrafed frkfseqdds verdiilqcr egelvlpdle kddmivrrip aqkkevplsg 301 apdryhpvpf pepwtlppei qakflcvler tcpskeksns crilvpsyrq kkddmltrki 361 qswklgttvp pisftpgpcs eadlkrweai reasrlrhkk rlmverlqu iygengsksm 421 sdvsaedvqn qulryeemq kiksqlkeqd qkwqddlakw kdrrksytsd lqkkkeeree 481 iekqaleksk rssktfkeml qdresqnqks tvpsrrrmys fddvleegkr pptmtvseas 541 yqserveekg atypseipke dsttfakred qlps qspa slsslrsrst 601 qmestrvsas 1prsyrktdt vrltsvvtpr pfgsqtrgis slprsytmdd awkyngdved 661 ikrtpnnvvs tpapspdasq lasslssqke vaateedvtr 1psptspfss lsqdqaatsk 721 atlsstsgld 1msesgegei rsq dqfsdmrisi nqtpgksldf gftikwdipg 781 ifvasveags lqu deiiainntk fsyndskewe eamakaqetg hlvmdvrryg 841 kagspetkwi datsgiynse kssnlsvttd ssni eskeingihd esnafeskas 901 esislknlkr rsqffquss dsvvpdlpvp tisapsrwvw dqeeerque rqueqdr11 961 qekyqreqek 1reewqrakq skyl deelmvlssn smslttreps latweatwse 1021 gskssdregt rageeerrqp qeevvhedqg kkpqdqlvie rerkweqqlq rlqa 1081 eaeeqkrpae eqquaeier etsvriyqyr rpvdsydipk teeassgflp gdrnksrstt 1141 elddystnkn gnnkyldqig nmtssqrrsk keqvpsgael erqqilqemr krtplhndns 1201 asvn kepvslpgim rrgesldnld sprsnsqup pwlnqptgfy assqudfsr 1261 pppqlvstsn raymrnpsss vpppsagsvk attq sptprshsps asqsgsqlrn 1321 svlpvsvtse alpqelksgs ettnctatta dgqp pcdvslhtka llqieeevva 1381 ahvdl CARS Official Symbol: CARS Official Name: cysteinyl-tRNA synthetase m: 833 Organism: Homo sapiens Other Aliases: CARSf, CYSRS, MGC:11246 Other Designations: cysteine tRNA ligase 1, cytoplasmic; cysteine translase; cysteine--tRNA , cytoplasmic Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_139273.3 LOCUS NM_139273 ACCESSION NM_139273 l gtggggcgcg acttccgggg cggcggttgc atcagattct aggaagtgtc tgtagccgca 6" gctgcgggtc cgggattccc ggca gattcctccg ggcagcaggg caaaggccgg 12; cgtgtgcagc cccagtggtc ccctcctgct gggacccagc catgcagact ccacctttac 18; aacagcctca ccaggaacaa ggaagtgttc atacctcaag atgggaaaaa ggtgacgtgg 24; tattgctgtg ggccaaccgt ctatgacgca tctcacatgg ggcacgccag catc ; tcttttgata tcttgagaag agtgttgaag gattacttca aatttgatgt cttttattgc 36; atgaacatta ttga tgacaagatc atcaagaggg cccggcagaa ccacctgttc 42; tatc gggagaagag agcg gcacagctct tggaggatgt tcaggccgcc 48; ctgaagccat tttcagtaaa attaaatgag accacggatc ccgataaaaa gcagatgctc 54L attc cagt gcagcttgcc acagagccac ttgagaaagc tgtgcagtcc 60; agactcacgg gagaggaagt ctgt gtggaggtgt aaga ggat 66L ttgctctctg actggctgga ttctacactt ggctgtgatg tcactgacaa ttccatcttc 72; tccaagctgc ccaagttctg ggagggggac ttccacagag acatggaagc tctgaatgtt 78; ctccctccag atgtcttaac ccgggttagt gagtatgtgc cagaaattgt gaactttgtc 84; cagaagattg tggacaacgg ttacggctat gtctccaatg ggtctgtcta ctttgataca 90; gcgaagtttg gcga gaagcactcc aagc ctga tgga 96; gatcagaaag cccttcaaga aggggaaggt gacctgagca tctctgcaga ccgcctgagt L02; gagaagcgct ctcccaacga ctttgcctta tggaaggcct ctaagcccgg agaaccgtcc L08; tggccgtgcc cttggggaaa gggtcgtccg ggctggcata tcgagtgctc ggccatggca L14; ggcaccctcc taggggcttc gatggacatt cacggaggtg ggttcgacct ccggttcccc L20; caccatgaca atgagctggc acagtcggag gcctactttg actg ctgggtcagg L26; tacttcctgc acacaggcca cctgaccatt gcaggctgca aaatgtcaaa gtcactaaaa L32; aacttcatca ccattaaaga tgccttgaaa tcag cacggcagtt gcggctggcc L38; ttcctcatgc actcgtggaa ggacaccctg gactactcca gcaacaccat cttcaatatg agaagttctt gaatgagttt ttcttaaatg tgaaagatat cctgttgaca tcactggtca gtttgagaag tggggagaag aagaagcaga tatg acaagaagac agcaattcac aaagccctct gtgacaatgt accgtcatgg aagagatgcg ggccttggtc tgca acctctatat ggcagcccgg L68; aaagccgtga ggaagaggcc caaccaggct ctgctggaga acatcgccct gtacctcacc L74; catatgctga agatctttgg ggccgtagaa gaggacagct ccctgggatt cccggtcgga L80; gggcctggaa ccagcctcag tctcgaggcc acagtcatgc ttca atca L86; gaattccgag aaggagtgcg gaagattgcc caaa ctga gattctgcag L92; ctcagcgatg ccctgcggga caacatcctg cccgagcttg gggtgcggtt tgaagaccac L98; gaaggactgc ccacagtggt gaaactggta gacagaaaca ccttattaaa agagagagaa 204; gaaaagagac gggttgaaga ggagaagagg aagaagaaag aggaggcggc ccggaggaaa 210; caggaacaag aagcagcaaa gctggccaag atgaagattc cccccagtga gatgttcttg 216; tcagaaaccg acaaatactc caagtttgat gaaaatgtaa gcatggtctg cccacacatg 222; acatggaggg caaagagctc agcaaagggc aagccaagaa gctgaagaag ctcttcgagg 228; ctcaggagaa gctctacaag gaatatctgc agatggccca gaatggaagc ttccagtgag 2341 ggggcacagg actgactttt attg tggactagtg gctgctgtct gcctcagtga 240; ccca gcgctcctat catgtttaca gtcacccttg aaat taagagttgt 246; gttcatgtag gttcgtgtcg tcgttggctc tgagacattg ataataaatt tttctcaaca 2521 gtgagaccct caaaaaaaaa aaaaaaaaaa aaaaaaaa Protein seguence (variant 1 1: NCBI Reference Seguence: NP_644802.1 LOCUS N P_644802 ACCESSION NP_644802 l madssgqqgk grrvqpqwsp crlh lynsltrnke vfipngkkv twyccgptvy 6; dashmghars yisfdilrrv lkdyfkfdvf ycmnitdidd kiikrarqnh lfeqyrekrp 12; eaaqlledvq aalkpfsvkl nettdpdkkq mleriqhavq lateplekav qsrltgeevn l8; scvevlleea kdllsdwlds tdns ifsklpkfwe gdfhrdmeal nvlppdvltr 24; vseyvpeivn qukivdngy gyvsngsvyf dtakfassek hsygklvpea vgqualqeg ; egdlsisadr lsekrspndf alwkaskpge psprpwgkg rpgwhiecsa magtllgasm 36; dihgggfdlr fphhdnelaq seayfendcw vryflhtghl tiagckmsks ikda 42; lkkhsarqlr laflmhswkd tldyssntme salqyekfln efflnvkdil rapvditgqf 48; ekwgeeeael nknfydkkta ihkalcdnvd trtvmeemra lyma arkavrkrpn 54; qallenialy lthmlkifga veedsslgfp vggpgtslsl quv lsefregvrk 60; iarequpei lqlsdalrdn ilpelgvrfe dheglptvvk lvdrntllke reekrrveee 66; krkkkeeaar rkqeqeaakl akmkippsem flsetdkysk fdenvsmvcp hmtwrakssa 72; kgkprs Nucleotide seguence (variant 2): NCBI Reference Seguence: NM_001751.5 LOCUS NM_001751 ACCESSION NM_001751 l gtggggcgcg acttccgggg cggcggttgc atcagattct aggaagtgtc tgtagccgca 6; gctgcgggtc nggattccc agccatggca gattcctccg ggcagcaggg caaaggccgg 12; cgtgtgcagc cccagtggtc ccctcctgct cagc catgcagact ccacctttac 18; ctca ccaggaacaa ggaagtgttc atacctcaag atgggaaaaa ggtgacgtgg 24; tgtg ccgt ctatgacgca atgg ggcacgccag gtcctacatc ; tcttttgata tcttgagaag agtgttgaag gattacttca aatttgatgt cttttattgc 36; atgaacatta cggatattga gatc atcaagaggg cccggcagaa ccacctgttc 42; gagcagtatc gggagaagag gcctgaagcg gcacagctct tggaggatgt cgcc 48; ctgaagccat tttcagtaaa attaaatgag accacggatc ccgataaaaa gcagatgctc 54; gaacggattc agcacgcagt tgcc ccac ttgagaaagc tgtgcagtcc 60; agactcacgg aagt caacagctgt gtggaggtgt tgctggaaga ggat 66; ttgctctctg actggctgga ttctacactt ggctgtgatg tcactgacaa ttccatcttc 72; tccaagctgc tctg ggagggggac ttccacagag acatggaagc tctgaatgtt 78; ctccctccag atgtcttaac ccgggttagt gagtatgtgc ttgt gaactttgtc 84; cagaagattg tggacaacgg ttacggctat gtctccaatg ggtctgtcta ctttgataca 90; gcgaagtttg cttctagcga gaagcactcc tatgggaagc tggtgcctga tgga 96L gatcagaaag cccttcaaga aggggaaggt gacctgagca tctctgcaga ccgcctgagt L02; gagaagcgct ctcccaacga ctttgcctta tggaaggcct ctaagcccgg gtcc L08; tggccgtgcc cttggggaaa gggtcgtccg ggctggcata tcgagtgctc ggccatggca L14; ggcaccctcc taggggcttc gatggacatt cacggaggtg ggttcgacct ccggttcccc L20; caccatgaca atgagctggc acagtcggag gcctactttg aaaacgactg ctgggtcagg L26; tacttcctgc acacaggcca cctgaccatt gcaggctgca aaatgtcaaa gtcactaaaa L32; aacttcatca ccattaaaga tgccttgaaa aagcactcag cacggcagtt gcggctggcc L38; ttcctcatgc actcgtggaa ggacaccctg gactactcca gcaacaccat ggagtcagcg L44; cttcaatatg agaagttctt gaatgagttt ttcttaaatg tgaaagatat ccttcgcgct L50; cctgttgaca tcactggtca gtttgagaag gaag aagaagcaga taag L56; aacttttatg acaagaagac agcaattcac aaagccctct atgt tgacacccgc L62; accgtcatgg aagagatgcg ggccttggtc agtcagtgca acctctatat ccgg L68; aaagccgtga ggaagaggcc caaccaggct ctgctggaga acatcgccct gtacctcacc L74; catatgctga agatctttgg ggccgtagaa gaggacagct ccctgggatt cccggtcgga L80; ggaa ccagcctcag tctcgaggcc acagtcatgc cctaccttca ggtgttatca L86; gaattccgag aaggagtgcg gaagattgcc cgagagcaaa aagtccctga gattctgcag L92; gatg ccctgcggga cctg cccgagcttg gggtgcggtt tgaagaccac L98; gaaggactgc ccacagtggt gaaactggta gacagaaaca ccttattaaa agagagagaa 204i gaaaagagac gggttgaaga ggagaagagg aagaagaaag aggaggcggc gaaa 2101 caggaacaag aagcagcaaa gctggccaag atgaagattc cccccagtga gatgttcttg 216; accg actc tgat gaaaatggtc tgcccacaca tgacatggag 222; ggcaaagagc tcagcaaagg gcaagccaag aagctgaaga agctcttcga ggctcaggag 228; aagctctaca aggaatatct gcagatggcc cagaatggaa gcttccagtg agggggcaca 234; ggactgactt tttaaaccat tgtggactag tggctgctgt ctgcctcagt gacaatgtcc 2401 cagcgctcct atcatgttta cagtcaccct tgggtcctaa attaagagtt gtgttcatgt 2461 aggttcgtgt cgtcgttggc tctgagacat tgataataaa tttttctcaa cagtgagacc 2521 ctcaaaaaaa aaaaaaaaaa aaaaaaaaaa Protein seguence (variant 2): NCBI Reference Seguence: NP_001742.1 LOCUS N P_001 742 ACCESSION NP_001742 1 madssgqqgk grrvqpqwsp pagtqpcrlh lynsltrnke vfipngkkv twyccgptvy 61 dashmghars yisfdilrrv lkdyfkfdvf ycmnitdidd kiikrarqnh lfeqyrekrp 121 eaaqlledvq aalkpfsvkl nettdpdkkq mleriqhavq lateplekav qsrltgeevn 181 scvevlleea kdllsdwlds tlgcdvtdns ifsklpkfwe meal nvlppdvltr 241 vseyvpeivn qukivdngy gyvsngsvyf dtakfassek hsygklvpea vgqualqeg ; egdlsisadr lsekrspndf kpge pSWpCprkg rpgwhiecsa magtllgasm 361 dihgggfdlr elaq seayfendcw tghl tiagckmsks lknfitikda 421 lkkhsarqlr laflmhswkd tldyssntme kfln efflnvkdil rapvditgqf 481 eael nknfydkkta ihkalcdnvd trtvmeemra lvsqcnlyma arkavrkrpn 541 qallenialy lthmlkifga veedsslgfp vggpgtslsl eatvmpquv lsefregvrk 60; iarequpei lqlsdalrdn ilpelgvrfe dheglptvvk lvdrntllke veee 661 krkkkeeaar rkqeqeaakl akmkippsem flsetdkysk fdenglpthd megkelskgq 721 akklkklfea qeklykequ maqngsfq Nucleotide seguence (variant 3): NCBI Reference Seguence: NM_001014437.2 LOCUS NM_001014437 ACCESSION NM_001014437 1 cgcg acttccgggg ttgc atcagattct aggaagtgtc tgtagccgca 61 gctgcgggtc cgggattccc ggca gattcctccg ggcagcaggc tcctgactac 121 aggtccattc ttag tgacgaggca gccagggcac aagccctgaa cgagcacctc 181 agcacgcgta gctatgtcca ggggtactca ctgtcccagg cagacgtgga cgcgttcagg 241 cagctctcgg ccgc tgacccccag ctcttccacg tggctcggtg gttcaggcac ; atagaagcgc tcctgggtag tggc aaaggccagc cctgcaggct ccaagcaagc 36; aaaggccggc gtgtgcagcc ccagtggtcc cctcctgctg ggacccagcc atgcagactc 42; cacctttaca tcac caggaacaag gaagtgttca tacctcaaga tgggaaaaag 48; gtgacgtggt attgctgtgg gccaaccgtc tatgacgcat ctcacatggg gcacgccagg 54; tcctacatct cttttgatat cttgagaaga aagg attacttcaa atttgatgtc 60; ttttattgca tgaacattac ggatattgat gacaagatca tcaagagggc ccggcagaac 66L cacctgttcg agcagtatcg gagg cctgaagcgg cacagctctt ggaggatgtt 72L gccc tgaagccatt aaaa ttaaatgaga ccacggatcc cgataaaaag 78L cagatgctcg aacggattca gcacgcagtg cagcttgcca cagagccact tgagaaagct 84L gtgcagtcca gactcacggg agaggaagtc tgtg tggaggtgtt gctggaagaa 90; gccaaggatt tgctctctga ctggctggat tctacacttg gctgtgatgt cactgacaat 96; tccatcttct ccaagctgcc caagttctgg gagggggact tccacagaga catggaagct L02; ctgaatgttc tccctccaga tgtcttaacc nggttagtg agtatgtgcc agaaattgtg L08; aactttgtcc agaagattgt ggacaacggt tacggctatg tctccaatgg gtctgtctac L14; tttgatacag Cgaagtttgc ttctagcgag aagcactcct atgggaagct ggtgcctgag L20; gccgttggag atcagaaagc ccttcaagaa ggggaaggtg acctgagcat ctctgcagac L26; agtg agaagcgctc tcccaacgac tttgccttat ggaaggcctc taagcccgga L32; gaaccgtcct ggccgtgccc ttggggaaag ggtcgtccgg gctggcatat cgagtgctcg L38; gccatggcag gcaccctcct aggggcttcg attc acggaggtgg gttcgacctc L44; cggttccccc accatgacaa tgagctggca cagtcggagg cctactttga aaacgactgc L50; aggt acttcctgca cacaggccac ctgaccattg gcaa aatgtcaaag L56; tcactaaaaa acttcatcac agat aaaa agcactcagc acggcagttg L62; cggctggcct tcctcatgca ctcgtggaag gacaccctgg actactccag caacaccatg L68; gagtcagcgc atga gaagttcttg aatgagtttt tcttaaatgt gaaagatatc L74: cttcgcgctc ctgttgacat cactggtcag tttgagaagt ggggagaaga agaagcagaa L80; ctgaataaga acttttatga caagaagaca gcaattcaca tctg gacacccgca ccgtcatgga agagatgcgg gccttggtca gtcagtgcaa gcagcccgga aagccgtgag gaagaggccc aaccaggctc tgctggagaa tacctcaccc atatgctgaa gatctttggg gccgtagaag aggacagctc cctgggattc 204; ccggtcggag ggcctggaac cagcctcagt ctcgaggcca cagtcatgcc ctaccttcag 210; tcag aattccgaga aggagtgcgg aagattgccc gagagcaaaa agtccctgag 216; attctgcagc tcagcgatgc cctgcgggac aacatcctgc ccgagcttgg ggtgcggttt 222; gaagaccacg aaggactgcc ggtg aaactggtag acagaaacac cttattaaaa 228; gagagagaag aaaagagacg ggttgaagag gagaagagga agaagaaaga ggcc 234; cggaggaaac aggaacaaga agcagcaaag ctggccaaga tgaagattcc ccccagtgag 240; atgttcttgt cagaaaccga caaatactcc aagtttgatg aaaatggtct acat 246; gagg gcaaagagct aggg caagccaaga agctgaagaa gctcttcgag 252; gctcaggaga agctctacaa ggaatatctg cagatggccc gaag cttccagtga 258; gggggcacag gactgacttt ttaaaccatt gtggactagt ggctgctgtc tgcctcagtg 2641 acaatgtccc agcgctccta tcatgtttac agtcaccctt gggtcctaaa ttaagagttg 270; tgttcatgta tgtc gtcgttggct ctgagacatt gataataaat ttttctcaac 276; agtgagaccc tcaaaaaaaa aaaaaaaaaa aaaaaaaaa Pnneh1seguence(vafiant3y NCBI nce Seguence: 014437.1 LOCUS N P_001014437 ACCESSION N P_001014437 l qqap dyrsilsisd eaaraqalne yvqg yslsqadvda frqlsappad 6" pqlfhvarwf rhieallgsp Cgkgqpcrlq askgrrvqpq wsppagtqpc sltr 12; nkevfipng kkvtwyccgp tvydashmgh arsyisfdil rrvlkdyfkf dvfycmnitd 18; iddkiikrar qyre krpeaaqlle dvqaalkpfs tdpd kkqmleriqh 24; avqlateple kavqsrltge evnscvevll eeakdllsdw ldstlgcdvt dnsifsklpk ; fwegdfhrdm ealnvlppdv ltrvseyvpe ivnqukivd ngygyvsngs vyfdtakfas 36; sekhsygklv peavgqual qegegdlsis adrlsekrsp ndfalwkask pgepswpcpw 42; gkgrpgwhie tllg asmdihgggf dlrfphhdne laqseayfen dcwvryflht 48L ghltiagckm skslknfiti kdalkkhsar qlrlaflmhs wkdtldyssn tmesalqyek 54L flnefflnvk dilrapvdit gqfekwgeee aelnknfydk ktaihkalcd nvdtrtvmee 60; mralvsqcnl ymaarkavrk rpnqalleni alylthmlki fgaveedssl gfpvggpgts 66; lsleatvmpy lqvlsefreg vrkiarequ peilqlsdal rdnilpelgv rfedheglpt 72; vvklvdrntl lkereekrrv eeekrkkkee aarrkqeqea aklakmkipp semflsetdk 78; yskfdenglp thdmegkels kgqakklkkl feaqeklyke qumaqngsf q Nucleotide seguence (variant 5): NCBI Reference Seguence: NM_001194997.1 LOCUS NM_001 194997 ACCESSION NM_001 194997 1 gtggggcgcg gggg cggcggttgc atcagattct aggaagtgtc tgtagccgca 6; gctgcgggtc cgggattccc agccatggca gattcctccg ggcagcaggc tcctgactac 121 aggtccattc tgagcattag tgacgaggca gcac tgaa cctc l8; agcacgcgta gctatgtcca ggggtactca ctgtcccagg cagacgtgga cgcgttcagg 24L cagctctcgg ccccgcccgc tgacccccag ctcttccacg tggctcggtg gcac ; atagaagcgc tcctgggtag cccctgtggc aaaggccagc cctgcaggct aagc 36; aaaggccggc gtgtgcagcc ccagtggtcc cctcctgctg agcc atgcagactc 42; cacctttaca tcac caggaacaag gaagtgttca tacctcaaga tgggaaaaag 48; gtgacgtggt attgctgtgg gccaaccgtc tatgacgcat ctcacatggg gcacgccagg 54; atct cttttgatat cttgagaaga gtgttgaagg attacttcaa atttgatgtc 60; ttttattgca tgaacattac ggatattgat gacaagatca tcaagagggc ccggcagaac 66; cacctgttcg agcagtatcg ggagaagagg cctgaagcgg cacagctctt ggaggatgtt 72; caggccgccc tgaagccatt ttcagtaaaa ttaaatgaga ccacggatcc cgataaaaag 78; cagatgctcg aacggattca gcacgcagtg cagcttgcca cact tgagaaagct 84; gtgcagtcca gactcacggg agaggaagtc aacagctgtg tggaggtgtt gctggaagaa 90; gccaaggatt tgctctctga ctggctggat tctacacttg atgt cactgacaat 96; tccatcttct ccaagctgcc caagttctgg gagggggact tccacagaga catggaagct L02; ctgaatgttc tccctccaga aacc cgggttagtg agtatgtgcc agaaattgtg L08; gtcc agaagattgt ggacaacggt tacggctatg tctccaatgg gtctgtctac L14; acag cgaagtttgc ttctagcgag tcct atgggaagct ggtgcctgag L20; gccgttggag atcagaaagc ccttcaagaa ggggaaggtg acctgagcat agaagcgctc tcccaacgac tttgccttat ggaaggcctc ggccgtgccc ttggggaaag ggtcgtccgg gctggcatat gcaccctcct aggggcttcg atggacattc acggaggtgg cggttccccc accatgacaa tgagctggca cagtcggagg cctactttga aaacgactgc L50; tgggtcaggt acttcctgca cacaggccac ctgaccattg caggctgcaa aatgtcaaag 156; tcactaaaaa acttcatcac cattaaagat gccttgaaaa agcactcagc acggcagttg 162; cggctggcct tcctcatgca ctcgtggaag ctgg actactccag caacaccatg 168; gagtcagcgc ttcaatatga gaagttcttg tttt tcttaaatgt gaaagatatc 174; cttcgcgctc ctgttgacat cactggtcag tttgagaagt aaga agaagcagaa 180; ctgaataaga acttttatga caagaagaca gcaattcaca aagccctctg tgacaatgtt 186; gacacccgca ccgtcatgga agagatgcgg gccttggtca gtcagtgcaa cctctatatg 1921 gcagcccgga aagccgtgag gaagaggccc aaccaggctc tgctggagaa 1981 tacctcaccc atatgctgaa gatctttggg gccgtagaag aggacagctc cctgggattc 2041 ccggtcggag ggcctggaac cagt gcca cagtcatgcc tcag 2101 gtgttatcag aattccgaga gcgg aagattgccc aaaa tgag 216; attctgcagc tcagcgatgc cctgcgggac aacatcctgc ccgagcttgg ggtgcggttt 222; gaagaccacg aaggactgcc cacagtggtg aaactggtag acagaaacac cttattaaaa 228; gagagagaag aaaagagacg ggttgaagag gagaagagga agaagaaaga ggaggcggcc 234; cggaggaaac aggaacaaga agcagcaaag ctggccaaga tgaagattcc tgag 240; atgttcttgt cagaaaccga caaatactcc aagtttgatg aaaatgtaag catggtctgc 246; ccacacatga catggagggc aaagagctca ggca agccaagaag ctgaagaagc 252; tcttcgaggc tcaggagaag ctctacaagg aatatctgca gatggcccag aatggaagct 258; tccagtgagg gggcacagga ctgacttttt aaaccattgt ggactagtgg ctgctgtctg 264; cctcagtgac aatgtcccag Cgctcctatc atgtttacag ttgg gtcctaaatt 270; aagagttgtg ttcatgtagg ttcgtgtcgt cgttggctct gagacattga taataaattt 2761 ttctcaacag cctc aaaaaaaaaa aaaaaaaaaa aaaaaaa Protein seguence (variant 5): NCBI Reference Seguence: 181926.1 LOCUS NP_001181926 ACCESSION NP_001181926 1 madssgqqap dyrsilsisd eaaraqalne hlstrsyvqg yslsqadvda frqlsappad 61 pqlfhvarwf rhieallgsp cgkgqpcrlq askgrrvqpq tqpc rlhlynsltr 121 nkevfipng kkvtwyccgp tvydashmgh arsyisfdil rrvlkdyfkf dvfycmnitd 181 iddkiikrar qnhlfeqyre krpeaaqlle dvqaalkpfs vklnettdpd kkqmleriqh 241 avqlateple kavqsrltge evnscvevll eeakdllsdw ldstlgcdvt dnsifsklpk 301 fwegdfhrdm ealnvlppdv ltrvseyvpe ivnqukivd DgygyVSDgS vyfdtakfas 361 sekhsygklv peavgqual qegegdlsis krsp kask pgepswpcpw 421 gkgrpgwhie csamagtllg asmdihgggf dlrfphhdne laqseayfen flht 481 ghltiagckm skslknfiti hsar qlrlaflmhs wkdtldyssn tmesalqyek 541 flnefflnvk dilrapvdit gqfekwgeee aelnknfydk ktaihkalcd nvdtrtvmee 60; mralvsqcnl ymaarkavrk rpnqalleni alylthmlki fgaveedssl gfpvggpgts 66- lsleatvmpy freg vrkiarequ peilqlsdal rdnilpelgv glpt 721 vvklvdrntl krrv eeekrkkkee aarrkqeqea aklakmkipp semflsetdk 781 yskfdenvsm vcphmtwrak ssakgkprs DDX1 Official Symbol: DDX1 Official Name: DEAD (Asp-Glu-Ala-Asp) box helicase 1 Gene ID: 1653 Organism: Homo sapiens Other Aliases: DBP-RB, UKVH5d Other Designations: ATP-dependent RNA helicase DDX1; DEAD (Asp-Glu-Ala- Asp) box polypeptide 1; DEAD box polypeptide 1 ; DEAD box protein 1; DEAD box protein retinoblastoma; DEAD box-1; DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 1 Nucleotide seguence: NCBI nce Seguence: 939.2 LOCUS NM_004939 ACCESSION NM_004939 1 ctaatcacca aacatctgct tctg tagctgtgac cctgataccg cgtggtgtgc 61 caca tggtgcccag aacgaaggcg gcgtccagaa gccctaggtc ccagaggtcc 121 gctcagcggc aggcgcataa ggcggggccg gcgcgggc ct ttccttccat cggaaccgtt 181 ctcccggggc tgagtccctg cccggactcc gaacgccgaa gaccaggggc cggaagcgcg 241 cgccgccact gccacgccgt gtcagtcggg agggagggag cgagcaggcg aagccgcgga 3 0 1 ggacggggtg aagatggcgg ccttctccga gatgggtgta atgcctgaga ttgcacaagc 36; tgtggaagag atggattggc tcctcccaac tgatatccag gctgaatcta tgat 42; cttaggagga ggtgatgtac ttatggctgc agaaacagga agtggcaaaa ctggtgcttt 48; tagtattcca gttatccaga tagtttatga aactctgaaa gaccaacagg aaggcaaaaa 54; aggaaaaaca acaattaaaa ctggtgcttc gaac aaatggcaga tgaacccata 60; tgacagagga tctgcttttg caattgggtc agatggtctt tgttgtcaaa gcagagaagt 66; aaaggaatgg catgggtgta gagctactaa aatg aaagggaaac actactatga 72L agtatcctgt catgaccaag ggttatgcag ggtcgggtgg atgc aggcctcttt 78L ggacctaggt actgacaagt ttggatttgg ctttggtgga acaggaaaga aatcccataa 84L caaacaattt gataattatg aatt cactatgcat attg gatgttacct 90; ggatatagat aagggacatg tcaagttctc caaaaatgga aaagatcttg gtctggcatt 96; tgaaatacca ccacatatga aaaaccaagc cctctttcct gcctgtgttt atgc L02; tgaactgaaa ttcg gtgaagagga atttaagttt ccaccaaaag ttgt L08; tgctctttcc aaggcaccgg acat tgtcaaatca cagcactcag ggtgacacaa acaaagtttc tccccaatgc tccgaaagct ctcattgttg aaccttcccg L20; ggagttagct gaacaaactt tgaacaacat caagcagttt aagaaataca ttgataatcc L26; taaattaagg gagcttctga gagg agcc cgggatcagc tctctgtttt L32; ggaaaatgga gtagatatag ttgtaggtac tccgggaaga ctagatgact tggtgtcaac L38; tggaaagctg aacttatctc aagttagatt cctggtcctg gatgaagctg atgggcttct L44; aggt tattctgatt ttataaatag gatgcacaat cagattcctc aggttacctc L50; tgatggaaaa agacttcagg tgattgtttg ctctgccact ttgcattctt tcgatgtaaa L56; gaaactgtcc gagaagataa tgcattttcc ggtt gacttaaaag gagaagactc L62; tgttccagat actgtacacc atgttgttgt cccagtaaat cccaaaactg acagactctg L68; ggaaaggctt ggaaagagcc acattagaac tgatgatgta catgcaaaag ataacacaag L74; acctggtgct aatagtccag agatgtggtc tgaagctatt aaaatcctga aaggggagta L80: tgctgtccgg gcaatcaagg aacataagat ggatcaagca attatcttct aattgactgt gataacttgg actt tatacaacaa ggaggaggac aggacaccag ttctcatgtg tttgtcttca tggtgacaga aagcctcatg agagaaagca L98; aaacttgqaa agatttaaga aaggagatgt aagattcttg atttgcacag atgtagctgc 204; tagaggaatt gatatccacg gtgttcctta aaat gtcactctgc ccgatgaaaa 210; gcaaaactac gtacatcgaa ttggcagagt aggaagagct gaaaggatgg gtctggcaat 216; ttccctggtg gcaacagaaa aagaaaaggt ttggtaccat gtatgtagca gccgtggaaa 222; agggtgttat aacacaagac tcaaggaaga tggaggctgt accatatggt acaacgagat 228; gcagttacta tctgagatag aagaacacct gaactgtacc atttctcagg ttgagccgga 234; tataaaggta ccagtggatg atgg tacc tacggtcaga aaagggctgc 240; tgga aaag gccatgtgga tattttggca cctactgttc aagagttggc 246; tgcccttgaa aaggaggcgc agacatcttt cctgcatctt ggctaccttc ctaaccagct 252: gttcagaacc ttctgatttt tacatttact gaataagatt tgagtaatga aagtctgtag 258; tcttaaaact ctaaaacagt tgtactgctt ccaagcagca gtatttatag taacgtaagc 264; tattaatgct aactcttgca tgtcaagaaa cattagtctt aggaattctt caaaaaatgg 2701 catcccaatg aatt tgatgactat attttcatga aaaaaaaaaa aaaaa Protein seguence: NCBI Reference Seguence: NP_004930.1 LOCUS N P_004930 ION NP_004930 l maafsemgvm veem dwllptdiqa esiplilggg dvlmaaetgs gktgafsipv 6; tlkd gktt vlnk wqmnpydrgs afaigsdglc cqsrevkewh 12; gcratkglmk gkhyyevsch dqglcrvgws tmqasldlgt dkfgfgfggt gkkshnkqfd 18; nygeeftmhd tigcyldidk ghkaskngk dlglafeipp hmknqalfpa cvlknaelkf 24; fkfp pkdgfvalsk vksq hsgnaqvtqt kflpnapkal ivepsrelae ; qtlnnikqfk klre lliiggvaar dqlsvlengv divvgtpgrl ddlvstgkLn 36; lsquflvld eadgllsqu sdfinrmhnq ipqvtsdgkr lqvivcsatl hsfdvkklse 42; kimhfptwvd lkgedsvpdt vhhvvvpvnp ktdrlwerlg kshirtddvh akdntrpgan 48; spemwseaik ilkgeyavra ikehkmdqai ifcrtkidcd nleqyfiqqg ggpdkkghqf 54; scvclhgdrk pherkqnler fkkgdvrfli ctdvaargid ihgvpyvinv tlpdekqnyv 60L hrigrvgrae rmglaislva tekekvahv cssrgkgcyn trlkedggct iwynemqlls 66L eieehlncti sqvepdikvp vdefdgkvty quraagggs ykghvdilap tvqelaalek 72L eaqtsflhlg ylpnqlfrtf Official Symbol: CCD022 Official Name: coiled-coil domain containing 22 Gene ID: 28952 Organism: Homo sapiens Other Aliases: JM1, CXorf37 Other Designations: coiled-coil domain-containing protein 22 Nucleotide seguence: NCBI Reference Seguence: NM_014008.3 LOCUS NM_014008 ACCESSION NM_014008 l tccg gcatgcgccg tgctcgctca cagaactaca ctttccaact ctccccacac 6; gacccgtgac actctgtgga ccgcgagcac ggagcagggt ttctacagct gctccccact 12; ttctcggacc ngtcctgga cccagccccc gactccgaca cggctccacc atggaggagg l8; cggaccgaat cctcatccat tcgctgcgcc aggccggcac ggcagttcct ccagatgtgc 24; agaccttgcg cgccttcacc actgagctgg ttgtagaggc tgtggtccgc tgcctgcgtg ; tgatcaaccc tgcggtgggc tctggcctca gccctctgct gcctcttgcc atgtctgccc 36; gcct ggccatgagc ctggctcagg cctgcatgga cctgggctat cccttggagc 42; ttggctatca gaacttcctc taccccagtg agcctgacct ccgagacctg cttctcttct 48; tggctgagcg tctgcccacc gatgcctctg aggatgcaga ccagcctgca ggtgactcag 54; ctattctcct ccgggccatt gggagccaaa ttcgggacca gctggcactg ccttgggtcc 60; cgccccacct tcgcactccc aagctgcagc acctccaggg ctcggccctc cagaagcctt 66; tccatgccag caggctggtc gaat tgagttccag aggtgagcca cgggagttcc 72; aggcgagtcc cctgctgctt ccta cccaggtgcc tcagcctgtt ggaagggtgg 78; cctcgctcct cgaacaccat gccctgcagc tctgccagca gacgggccgg gaccggccag 84; gggatgagga ctgggtccac cggacatccc gcctcccacc ccaggaggac acacgggctc 90; agcggcagcg gctgcagaag caactgactg agcatctgcg ccaaagctgg ggcctgcttg 96L ccat acaagcccgg ggag aactgctgca ggcctggggt gctggggcca 102; agactggtgc tcctaagggc ttca cgcactcaga cacc ttccatctgg 108; agccccaggc ccaggccact tcag atgtgccagc cacctcccgg cggcctgaac 114; cgtg ggcagctcag gaacaggagc ccct tcgggagcag ctggaaggag L20; tgaaccgcag cattgaggag gttgaggccg acatgaagac cctgggcgtc agctttgtgc L26; aggcagagtc tgagtgccgg aagc tcagtacagc agagcgtgag caggccctgc L32; gcctgaagag CCgCgngtg gagctgctgc ccgatgggac tgccaacctt gccaagctgc L38; agcttgtggt ggagaatagt cggg tcatccactt ggcgggtcag tgggagaagc L44; accgggtccc actcctcgct gagtaccgcc acctccgaaa gctgcaggat tgcagagagc L50; tggaatcttc tcgacggctg gcagagatcc aagaactgca ccagagtgtc cgggcggctg L56: ctgaagaggc ccgcaggaag gaggaggtct ataagcagct gatgtcagag tgtgtcccgg ctggcctaca cccagcgcat cctggagatc gaaggaagag atcaccaaga tcttgtctga tacgaaggag cctatctggg aagctggacc ggacgtttgc ggtgactgat gagcttgtgt L80; atgc caagaaggac gatgctgttc ggaaggccta taagtatcta ctgc L86; acgagaactg cagccagctc atccagacca tcgaggacac aggcaccatc ttcgagacct cgaggagcag atcgagacag gcaa gaagaccctc agcaacctgg L98; agaagatccg ctac cgagccctcc gccaggagaa cctc ctaggccggg 204; tccgggaggc ctgaggagcc gccggcagag gtctctcccc agcctcaggc agggatttgg 210; ggtgctggag gcagtggcca agcacatgcc actt cctccgctgt ccagttcctc 216; cggc cttggaccca gacccctgcc cactgaccgc aacccttata tggggtgata 222; gtccagcatg tggggagctc agtt tattggggac ggtactgtgg gttgggggcc 2281 ttggatccca aataaatgag tagttcctct gcagtctaaa aaaaaaaaaa aaa Protein sequence: NCBI Reference Seguence: 727.1 LOCUS NP_054727 ACCESSION NP_054727 l meeadrilih squagtavp pdvqtlraft telvveavvr clrvinpavg sglspllpLa 6L msarfrlams laqacmdlgy plelqunfl ypsepdlrdl llflaerlpt dasedadqpa 12L lrai gsqirdqlal pvaphlrtp quhlqgsal qkpfhasrlv vpelssrgep l8; refqasplll prthpqu grvasllehh alqlcqqtgr drpgdedwvh rtsrlppqed 24L traqrquqk qltehqusw gllgapiqar dlgellqawg apkg srfthsekft ; fhlepqaqat qudvpatsr rpeqvtwaaq eqeleslreq legvnrsiee veadmktlgv 36; squaesecr aere qalrlksrav tanl aqulvvens aqrvihlagq 421 wekhrvplla eyrhlrqud crelessrrl aeiqelhqsv raaaeearrk eevykqlmse 481 letlprdvsr ilei vgnirquee itkilsdtke lqkeinslsg kldrtfavtd 541 akkd davrkaykyl aalhencsql iqtiedtgti mrevrdleeq ietelgkktl 601 snlekiredy raquenagl lgrvrea CL|C4 Official Symbol: CLIC4 Official Name: chloride intracellular channel 4 Gene ID: 25932 Organism: Homo sapiens Other Aliases: CLIC4L, H1, , huH1, p64H1 Other Designations: de intracellular channel 4 like; de intracellular l protein 4; intracellular chloride ion channel protein p64H1 Nucleotide seguence: NCBI Reference Seguence: NM_013943.2 LOCUS 943 ACCESSION NM_013943 1 ttattttccc cggagagtcc cgaggcgccg cgccttggcc ctgcctacag cccgaggccc 61 cggc gcccctccca gccgtttgaa gcggctcggg ctgcggctgg ctcagagtgg 12; cgcggggggc gtggggcggt gctgaggagc tgaagccgtg gccagctcga cgccggacag 18; tccagcgagc agcacggcgg gaaccggcag ccggagcagt cccggagcag aagcagcagc 24; agcagcagca gccctcgccg ttcgcggagc gcagccgagc cggccatggc gttgtcgatg ; ccgctgaatg ggctgaagga ggaggacaaa gagcccctca tcgagctctt cgtcaaggct 36; ggcagtgatg gtgaaagcat aggaaactgc cccttttccc agaggctctt catgattctt 421 tggctcaaag gagttgtatt tagtgtgacg actgttgacc tgaaaaggaa gccagcagac 48L ctgcagaact tggctcccgg gacccaccca ataa ctttcaacag tgaagtcaaa 54L acggatgtaa ttga ggaatttctt gaagaagtct tatgccctcc caagtactta 60; aagctttcac caaaacaccc agaatcaaat actgctggaa tggacatctt tgccaaattc 661 tctgcatata tcaagaattc aaggccagag gctaatgaag agag cctg 72; aaaaccctgc agaaactgga tgaatatctg aattctcctc tccctgatga aattgatgaa 78; aatagtatgg aggacataaa gttttctaca cgtaaatttc tggatggcaa tgaaatgaca 84; gatt gcaacctgct gcccaaactg catattgtca aggtggtggc caaaaaatat 90; cgcaactttg atattccaaa agaaatgact ggcatctgga gatacctaac taatgcatac 96; agtagggacg agttcaccaa tacctgtccc agtgataagg aggttgaaat agcatatagt L02; gatgtagcca aaagactcac caagtaaaat tgta aaagagatgt cttcatgtct L08; tcccctaaga atacgctttt cctaacaggc tactccttcc tgtagagcag aaattgtatt L14; ttgcacgaac ttat ttag gatcaaggat ggta cttaaaatat acactcctaa gcagtattat tttaaaatcc tttaccctgg cccc L26; ggtt cccctctctt taatttggag acactccacc acaaactttt cactttagag L32; gtagcttgcc atctctcagg agccctcacc attgtgtcca ttcactgtgt atagatggca L38; gaacttttga ggtgcaatgt ttaattgtta gtag ccacgacttt ctgg atgc atggtacaag ttat gtattttttg gaattttgta L50; atatttagta agagtatatg aaaggattgc tactgtatca ttgt ttcaatttag L56; tctatcctgg atatgtacta acgaatatta ccaccagaga agagagcttt ctacaaaagt L62; cactacagat tttgctatat tgctttgtag atagattttt acttttgcct attt L68; atccttcata ccaattgtaa catctgacac gaag ctaaaagttt agagggagtg L74; agggttttct cttc ctcaagcatt ttatctttag aagagaaact gatgggcacc L80; tgatactctg tctaaatacg tttgttatat tgcc ctgtgccatt catttggaac L86; tttattgcat tctttatttt aaaaagcttg tttttacgta atcatagagc ttgctatttg L92; tacatctgtt gagcaacact acataactga tttttagttg acttagctat agcagtacaa L98; tgattagtaa tgtaaaaatt aacacagaaa ttaacctaag gaatgaaggg tgggtttgtc 204; aaaatatcaa gtaaattttt gtttctaaag tacatttaat gtagatgacc taaagaatgc 210; gttatccatc ctatataaaa gaaagataaa acacaggtca ccaattttct catttcaccc 216; catttacctt gtatagagga ttgttcattc ctttgggact aagttatagt tatggtgagt 222; gtgtatttac tgtagttttg cctgatctca ctcattgcac ttcctggagt taaattttcc 228; aacagccatg ttgaggaata gcactctgca tgtttttgtt ttgtttttcg gggttttttt 234; taattgaagc cctaaaccag gaattatttg tgttctaaca ggaggatgaa cttgctgaaa 240; ataaaacttt gctatgtatt tactcttttt taaaagacaa aagcaaaacc agactttcta 246; cgtactactc caaagactgt gattgtgact ataatacatt tttggtaatt tttttatacc 252; taatttgtat aggaagtgct atttctcata ggctgtttct ttta agtttattgc 258; tttaaaatgg cagtgtttct cccactttga tatgctaaca tttagtaagc actggcttta 264; tgaaagcggc tttttataag tatactgcat tttttgagcc tatcattaat tagcttagta 270; tgaaagataa gaaaatctcc atgttgtatc catttggctc attc tttgccttac 276; ctttcttaga actctttatt gcttatcaaa agtttgagta cccgcttggt ttttttttgg 282; taattaaata gatt tatctggttc aaggaagatg cactattcag ttatctattg 288; agaaattatt ttgcagtggt tttagtgggt gaaaatgtcc catctgcacc agtacacagg 294: caggcattat cattcttcac ctacttttta aatagtggca acttgggatt cattctggtg 300; attctgaacc ttgcctcata gcttaaagta taaaaaagat tcaagagcag tgaggtttgt 306; tctttccagt gaatggtgga ctgagtggtg cgaggtggag caag aggaaagaac 312; tacattcttc cagt gatgaaaatt cattttgaaa ctcaaatatt ttcattttgg 318; atattctcct gtttttatta aaccagtgat tacacctggc catccctcta aatgttctag 324; gaaggcatgt ctattgtgat tttgatgaag acagaattat ctgt agaaacacag 330; ataccacttt atcagggaag aaat gaaatggaaa ttggtaaatg gacaaaagct 336; agctagtaaa aaggacgacc cagcaacatg ctttaacccc attgtatgtt tgtggaaaga 342; gcatagttta acatcttgag aaatttggga cataaagttt tcatggtaga cagttcatgc 348; tgaa ttgccataat ggaaataatc tgattttatt tttacaacta acatccattc 354; cccttcattt cttt tgtgttttac ttcagtgagg gagt ctgaatggat 360; ctgttttcca agagattctg tttt agca gttggaaagc tctctattct 366; agttgataaa acttcccttt tttgatgtag atgcagatat tctatacagt tctgttgtct 372; tttactagga ctgtaaactt ttgtgataaa attcaaataa gattttattt cttggtaatt 378; ttggctttca caatttatct ttaaatcctt gagcaatctg tatacaatta agagatttct 384; gacatttatt cttacactaa atggatcaac tctaggattt gtta gttg 390; tgttttgaat ctctccagag gtag atagcattta tttctgtgcc cttaaaccca 396; tttagaaaat aactacaaag taaaaatgta atag aaatgtattt tttcatgaac 402: attttgatac aaatttcatc atttaatgat tcaccaattt cttgcattaa tttgaattta 4081 agcatttaat tcaaagagag gggagcatcc gata catgtgggct tttaaaaact 414; ccatccttta taaatagtca aggtttgggc cacacaaagt atatttttat catggaaaaa 420; tttcaactcc tcaagccgta atgttgaaca gagt attttcttta cttg 426; aacaggcaaa tgaaagctta ttatagaatg catgtatttt cttttatctt tggaacatca 432; gcaccagtat attgctggca gctattgtat taaaaaataa agtatatttt cactatcata 4381 aaggattctt cccc tcatgaaaat aaacaacaac ttggggtaaa agtgaaaaaa 4441 aaaaaaaaaa aa Protein ce: NCBI Reference Seguence: NP_039234.1 mNP_039234 ION NP_039234 1 malsmplngl keedkeplie dge signcpfsqr 1fmi1w1kgv vfsvttvdlk 61 rkpadlqnla fitf nsevktdvnk evlc ppkylklspk hpesntagmd 121 ifakfsayik nsrpeaneal ergllkthk 1dey1nsp1p deidensmed kfld 181 gnemtladcn llpklhivkv vakkyrnfdi pkemtgiwry ltnaysrdef tntcpsdkev 241 vakr ltk Official Symbol: DLD Official Name: dihydrolipoamide dehydrogenase m: 1738 Organism: Homo sapiens Other Aliases: tcag7.39, DLDH, E3, GCSL, LAD, PHE3 Other Designations: E3 component of pyruvate dehydrogenase complex, 2-oxo- glutarate complex, branched chain keto acid dehydrogenase complex; diaphorase; dihydrolipoyl dehydrogenase, mitochondrial; glycine cleavage system L protein; glycine cleavage system protein L; lipoamide dehydrogenase; lipoamide reductase; lipoyl dehydrogenase Nucleotide seguence: NCBI Reference Seguence: NM_000108.3 m NM_000108 ION NM_000108 1 gatgacgtag gctgcgcctg tgcatgcgca gggaggggag accttggcgg agcggcggag 61 gcgcccagcg gaggtgaaag tattggcgga aaggaaaata cagcggaaaa atgcagagct 121 ggagtcgtgt gtactgctcc ttggccaaga gaggccattt aata tctcatggcc 18; tacagggact ttctgcagtg cctctgagaa cttacgcaga tcagccgatt gatgctgatg 24; taacagttat aggttctggt cctggaggat atgttgctgc tattaaagct gcccagttag ; gcttcaagac agtctgcatt gagaaaaatg aaacacttgg tggaacatgc gttg 36; gttgtattcc ttctaaggct ttattgaaca actctcatta ttaccatatg gcccatggaa 42; aagattttgc atctagagga attgaaatgt ccgaagttcg cttgaattta gacaagatga 48; tggagcagaa gagtactgca gtaaaagctt taacaggtgg aattgcccac ttattcaaac 54L agaataaggt tgtc aatggatatg gaaagataac tggcaaaaat caagtcactg 60; aagc tgatggcggc actcaggtta ttgatacaaa gaacattctt atagccacgg 66L aagt tactcctttt cctggaatca atga agatacaata gtgtcatcta 72; caggtgcttt atctttaaaa aaagttccag aaaagatggt tgttattggt gcaggagtaa 78; taggtgtaga attgggttca gtttggcaaa gtgc agatgtgaca gcagttgaat 84; ttttaggtca tgtaggtgga attg atatggagat atctaaaaac tttcaacgca 90; aaaa acaggggttt aaatttaaat tgaatacaaa ggttactggt gctaccaaga 96; agtcagatgg aaaaattgat gtttctattg cttc taaa gctgaagtta L02; tcacttgtga tgtactcttg gtttgcattg gccgacgacc ctttactaag aatttgggac L08; tagaagagct gggaattgaa ccca gaggtagaat tccagtcaat accagatttc L14; aaactaaaat tccaaatatc tatgccattg gtgatgtagt tgctggtcca atgctggctc L20; acaaagcaga ggatgaaggc attatctgtg ttgaaggaat ggctggtggt gctgtgcaca L26; acaa ttgtgtgcca tcagtgattt acacacaccc tgaagttgct tgggttggca L32; aaga gcagttgaaa gaagagggta ttgagtacaa agttgggaaa tttg L38; ctgctaacag cagagctaag acaaatgctg acacagatgg catggtgaag atccttgggc L44; agaaatcgac agacagagta ctgggagcac atattcttgg accaggtgct ggagaaatgg L50; taaatgaagc tgctcttgct ttggaatatg gagcatcctg tgaagatata gctagagtct L56; gtcatgcaca tccgacctta tcagaagctt aagc aaatcttgct caacttttga attagaagat tatatatatt tttttctgaa agcttttgta gaagtcacat tcctgaacag ctca cagctccaag actgaattat gaaacttttg gaaggtattt aataggtttg gacaaaatgg aatactctta L80; tatctatatt ttacataaat ttagtatttt gtttcagtgc actaatgtgt aagacaaaaa L86; gctacttatt gtagcatcct ggaatatctc cgtcaactca catg aagattcaat gcccctgaat ttaaatagct tttttctctg atacagaaaa gttgaatttt 198; acatggctgg agctagaatt tgatatgtga acagttgtgt ttgaagcaca gtgatcaagt 204; tatttttaat ttggttttca cattggaaac aagtcagtca ttcagatatg attcaaatgt 210; ctataaaccg aactgatgta agtaaacggt ctctcacttg ttttatttaa cctctaaatt 216; ctttcatttt aggggtagca ttga agaggtttta aagcttccat tgttgtctgc 222; aactctgaag ggtaattata tagttaccca aattaagaga gtctatttac ggaactcaaa 228; tacgtgggca tgta ttacagtggg gaatgaagat actgaaataa acgtcttaaa 234: tattcattta ctggttatca tgagtacgtg ttgagatggt catagttttt tttatgacta 240; cttctagtgt atattctaat ttcttttcta ggcctgaatg tatctttatt ttcatgttat 246; aggacaatat taaggcattt taaaggtcat catcctttca tctattttag atacacctac 2521 taaatgttta actt ttggagaagt acaacataag ggagtcttta atctgtgttt 258; tccttggctg tgac tgtttattta aagagtgttg taaaattgga tgtgtggtgt 264; tggc catgtcctga ttaa gtaacaaagt actaaatgct aagtaggctt 270; attg taactaaatt taagaataat tcagattaag tgaa atttggtata 276; gatagcatag attgtctcat gctcatgagt gacataatga ccctggattc tgttacatac 282; agaa aattgattgt tgtcttagga ggcagttaac ttggctgaac accaactcca 288; cactctgtct tgtttgtagg tggcagcagc tgaaatctct tctcagttgt tttagcttta 294; gctatgctgc tggaagtctt tcccatgcaa gtgtgtagtt caggggtcaa ccagagtttg 300; agga agtctgcccc ttctgtgcct cctgtttttt gggggtttcc cctttatgtt 306; ccagctgttg tggttgcccc atattctgcc ttctgatcct taaccaataa aacttggctt 312; ttgtttcccc ctcaagtgag aacccgttaa aaatgagaca ttgagccagt gctgttcact 318; ttttaagtgc caacttccct ctactttcca cttgtttata gttgtttcca gtgcctttag 324; ttttttctaa tttg ttcagagttt gcta tcagcaggag ggttggtctg 330; gtgt gctactttgc cattattgga agtgaactct gcatcttttt aaaaatttga 336; ggta tcatgtgaag tgctgtttat gtaaatctca acatatccct tactcaggga 342; aaaaaaagtt tttagttagg gaatagtgaa atataattta atatggaatt gtag 3481 agttaaatcc atctttaagt gtttacattc agtatgagaa tgcaaattta tctgtatggg 3541 gaataaagtc ctaggaataa aacaagtttt aagtgttca Protein seguence: NCBI Reference Seguence: NP_000099.2 LOCUS N P_000099 ACCESSION 099 1 mqswsrvycs lakrghfnri shglqglsav plrtyadqpi dadvtvigsg pggyvaaika 6; aqlgfktvci eknetlggtc pska llnnshyyhm ahgkdfasrg iemsevrlnl 12; ksta vkaltggiah lfkanvvhv ngygkitgkn qvtatkadgg tqvidtknil 18; iatgsevtpf pgitidedti vsstgalslk kvpekmvvig agvigvelgs qurlgadvt 24; aveflghvgg vgidmeiskn fqrilqkqgf kfklntkvtg atkksdgkid vsieaasggk 301 aevitcdvll vcigrrpftk nlgleelgie ldprgripvn trfqtkipni yaigdvvagp 361 mlahkaedeg iicvegmagg avhidyncvp sviythpeva wvgkseeqlk eegieykvgk 421 fpfaansrak tnadtdgmvk ilqustdrv gpga gemvneaala 1eygascedi 48; arvchahptl seafreanla asfgksinf ATAD3A Official Symbol: ATAD3A Official Name: ATPase family, AAA domain containing 3A m: 55210 Organism: Homo sapiens Other Aliases: RPS-83202.1 Other Designations: ATPase family AAA domain-containing n 3A tide seguence (variant 1): NCBI Reference Seguence: NM_018188.3 LOCUS 188 ACCESSION NM_018188 1 gtgtgtgtgg cgcctgcgca gtggcggtga ccaccggctc gcggcgcgtg gaggctgctc 61 gcgc gcgagtcaga ctcgggtggg ggtcccggcg gcggtagcgg cggcggcggt 121 gcgagcatgt cgtggctctt cggcattaac aagggcccca aagg cgcggggccg 181 ccgccgcctt tgccgcccgc gcagcccggg gccgagggcg gcggggaccg nggttggga 241 ccgg cgcccaagga caaatggagc aacttcgacc ccaccggcct ggagcgcgcc 301 gccaaggcgg cgcgcgagct ggagcactcg cgttatgcca aggacgccct gaatctggca 36; cagatgcagg agcagacgct gcagttggag caacagtcca agctcaaaat gcggctggaa 42; gccctgagcc tgctgcacac actagtctgg gcatggagtc tctgccgtgc cggagccgtg 48; cagacacagg agcggctgtc aggcagtgcc agccctgagc aagtgccagc tggtgagtgc 54; tgtgctctgc aggagtatga ggccgccgtg gagcagctca agagcgagca gatccgggcg 60; caggctgagg agaggaggaa gaccctgagc gaggagaccc acca ggccagggcc 66; cagtatcaag acaagctggc ccggcagcgc tacgaggacc aactgaagca gcagcaactt 721 ctcaatgagg agaatttacg gaagcaggag gagtccgtgc agga agccatgcgg 78L cgagccaccg tggagcggga gatggagctg cggcacaaga atgagatgct gcgagtggag 84L gccgaggccc gggcgcgcgc cgag aatg cagacatcat ccgcgagcag 90; atccgcctga aggcggccga gcaccgtcag accgtcttgg agtccatcag gacggctggc 96; accttgtttg gggaaggatt ccgtgccttt gtgacagact gggacaaagt gacagccacg L02; gtggctgggc tgacgctgct ggctgttggg gtctactcag ccaagaatgc cacgcttgtc L08; gccggccgct tcatcgaggc tcggctgggg aagccgtccc tagtgaggga atcacggtgc cgct gcggcacccc atccaggtca gccggcggct cctcagtcga L20; ccccaggacg Cgctggaggg tgttgtgctc agtcccagcc tggaagcacg cgac L26; atag caacaaggaa caccaagaag aaccgcagcc tgtacaggaa catcctgatg L32; tacgggccac caggcaccgg gaagacgctg tttgccaaga aactcgccct gcactcaggc L38; atggactacg tgac ggac gtggccccca tggggcggga aggcgtgacc L44; gccatgcaca agctctttga ctgggccaat accagccggc gcggcctcct gctctttgtg L50; gatgaagcgg acgccttcct tcggaagcga gccaccgaga agataagcga ggacctcagg L56; gccacactga tcct cacg ggccagcaca gcaacaagtt catgctggtc L62; ctggccagca accaaccaga gcagttcgac tgggccatca atgaccgcat caatgagatg L68; gtccacttcg acctgccagg gcaggaggaa cgggagcgcc tggtgagaat tgac L74; aagtatgttc ttaagccggc cacagaagga aagcagcgcc tggc ccagtttgac L80; tacgggagga agtgctcgga ggtcgctcgg ctgacggagg gcatgtcggg ccgggagatc L861 gctcagctgg ccgtgtcctg gcaggccacg gcct ccgaggacgg ggtcctgacc L92; gaggccatga tggacacccg cgtgcaagat gctgtccagc agcaccagca gaagatgtgc L98; tggctgaagg ggcc tgggcgtggg gacgagccct ccccatcctg agtccacagg 204; gagatccaca gctcacggag cctggccgcg gacccctccc acccctgcct tgccggcccc 210; tgcacattta ggatatgctc ctgggtgggg actgggctgt ggcc tctgtccccc 216; aggatgtctt gtggtgcggg tcggccgttc tgccccccag ggcaccccct gttgtaggca 222; ctggctaggg aggggcaggc ctccttcctg cccctcgaga tggg attt 228; tccgtctggc tcacaggggg agggtgaggc tttgcacccc agcccctgcc caggccactg 234; tgagggtggg tgctggctga gcccccgggg cagcaggagc caggcaggtg atgtctttgt 240; tctcggctcc cacagcagag ccaggtgagg gggcgcctgc cagggccaga cccaggtggg 246; gcagcctgaa ttcc ccctgtggcc ggcatgcccc gatctttcac acactggtga 252: ccctgagaga ggagggagga gggaacctgg cgggggtgtc tgaggccgca ctgtcagctg 258; ccaa gcctgtggct gggg tctgtttacc taataaagtc ccacaggtgc 2641 ctcattaaaa aaaaaa Protein seguence (variant 1 ): NCBI Reference Seguence: NP_060658.3 LOCUS N P_060658 ACCESSION NP_060658 l mswlfginkg pkgegagppp plppaquae gggdrglgdr papkdkwsnf dptgleraak 6; aarelehsry akdalnlaqm qeqthleqq sklkmrleal sllhtlvwaw slcragavqt 12; qerlsgsasp eqvpagecca aveq lkseqiraqa eerrktlsee trqhqaraqy 18; qdklarqrye dqlquqlln eenlrkqees vqkqeamrra tveremelrh knemlrveae 24; ararakaere nadiireqir lkaaehrqtv lesirtagtl fgegfrafvt dwdkvtatva ; gltllavgvy saknatlvag rfiearlgkp slvretsrit vlealrhpiq srpq 36; dalegvvlsp slearvrdia iatrntkknr slyrnilmyg ppgtgktlfa kklalhsgmd 42; yaimtggdva pmgregvtam hklfdwants rrglllfvde adaflrkrat ekisedlrat 48; lnaflyrtgq hsnkfmlvla dwa indrinemvh fdlpgqeere rlvrmyfdky 54; vlkpategkq fdyg arlt egmsgreiaq lavswqatay asedgvltea 60; mmdtrquav qqhqqkmcwl rgde psps Nucleotide seguence (variant 2): NCBI nce Seguence: NM_001170535.1 LOCUS NM_001 170535 ACCESSION NM_001 170535 l gtgtgtgtgg cgcctgcgca gtggcggtga ccaccggctc gcggcgcgtg gaggctgctc 6; ccagccgcgc gcgagtcaga ctcgggtggg ggtcccggcg gcggtagcgg ngcggcggt 12; gcgagcatgt cgtggctctt cggcattaac aagggcccca agggtgaagg cgcggggccg 18; ccgccgcctt tgccgcccgc gcagcccggg gccgagggcg gcggggaccg nggttggga 24; gaccggccgg cgcccaagga caaatggagc gacc ccaccggcct ggagcgcgcc ; gccaaggcgg cgcgcgagct ggagcactcg cgttatgcca aggacgccct gaatctggca 36; cagatgcagg agcagacgct gcagttggag caacagtcca agctcaaaga gtatgaggcc 42L gccgtggagc agctcaagag cgagcagatc cgggcgcagg ctgaggagag gaggaagacc 48L gagg agacccggca gcaccaggcc agggcccagt atcaagacaa gctggcccgg 54L cagcgctacg aact gaagcagcag caacttctca atgaggagaa tttacggaag 60; caggaggagt ccgtgcagaa gcaggaagcc atgcggcgag ccaccgtgga gcgggagatg 66; gagctgcggc acaagaatga gatgctgcga gtggaggccg gggc gcgcgccaag 72; gccgagcggg agaatgcaga catcatccgc gagcagatcc gcctgaaggc ggccgagcac 78; cgtcagaccg tcttggagtc catcaggacg gctggcacct tgtttgggga aggattccgt 84; gcctttgtga cagactggga caaagtgaca gtgg ctgggctgac gctgctggct 90; gttggggtct actcagccaa gaatgccacg cttgtcgccg gccgcttcat Cgaggctcgg 96; ctggggaagc Cgtccctagt gagggagacg tcccgcatca ttga ggcgctgcgg L02; caccccatcc aggtcagccg gcggctcctc agtcgacccc aggacgcgct ggagggtgtt L08; agtc ccagcctgga agcacgggtg cgcgacatcg ccatagcaac aaggaacacc L14; aagaagaacc gcagcctgta caggaacatc ctgatgtacg ggccaccagg caccgggaag L20; tttg ccaagaaact Cgccctgcac tcaggcatgg actacgccat catgacaggc L26; ggggacgtgg tggg gcgggaaggc gtgaccgcca tgcacaagct ctgg L32; acca gccggcgcgg gctc tttgtggatg aagcggacgc cttccttcgg L38; gcca ccgagaagat aagcgaggac ctcagggcca cactgaacgc cttcctgtac L44; cgcacgggcc agcacagcaa caagttcatg ctggtcctgg ccagcaacca accagagcag L50; tggg ccatcaatga ccgcatcaat gagatggtcc acttcgacct gccagggcag L561 gaggaacggg agcgcctggt gagaatgtat tttgacaagt atgttcttaa gccggccaca L62; gaaggaaagc agcgcctgaa gctggcccag tttgactacg ggaggaagtg gcat gtcgggccgg gagatcgctc ccgt gccacggcgt atgcctccga ggacggggtc ctgaccgagg ccatgatgga cacccgcgtg L80; caagatgctg agca ccagcagaag atgtgctggc tgaaggcgga agggcctggg 186; cgtggggacg agccctcccc atcctgagtc cacagggaga tccacagctc cctg 192; gccgcggacc cctcccaccc ctgccttgcc ggcccctgca catttaggat atgctcctgg 198; gtggggactg ggctgtgccc agggcctctg tcccccagga tgtcttgtgg tgcgggtcgg 204; ccgttctgcc ccccagggca ccccctgttg ctgg ctagggaggg ctcc 210; ttcctgcccc tcgagacact cttgggagat tccg tctggctcac agggggaggg 216; tgaggctttg caccccagcc cctgcccagg ccactgtgag ggtgggtgct ggctgagccc 222: ccggggcagc aggagccagg caggtgatgt tctc ggctcccaca gcagagccag 228; gtgagggggc gcctgccagg gccagaccca gcag cctgaaccct gcttccccct 234; gtggccggca tgccccgatc tttcacacac ccct gagagaggag ggaggaggga 2401 acctggcggg ggtgtctgag gccgcactgt cagctggccg gtccaagcct gtggctggag 246; ctggggtctg tttacctaat aaagtcccac aggtgcctca ttaaaaaaaa aa Protein seguence (variant 2): NCBI Reference Seguence: NP_001164006.1 LOCUS N P_001 164006 ACCESSION N P_001164006 l mswlfginkg pkgegagppp plppaquae gggdrglgdr papkdkwsnf dptgleraak 6; aarelehsry akdalnlaqm qeqthleqq sklkeyeaav qira qaeerrkth l2; eetrqhqara qudklarqr yedqlquql lneenlrkqe equkqeamr ratveremel 18; rhknemlrve aeararakae ireq irlkaaehrq tvlesirtag tlfgegfraf 24; vtdwdkvtat vagltllavg vysaknatlv agrfiearlg kpslvretsr itvlealrhp ; iqurrllsr pqdalegvvl rvrd iaiatrntkk nrslyrnilm ygppgtgktl 36; fakklalhsg tggd vapmgregvt amhklfdwan tsrrglllfv deadaflrkr 42; atekisedlr atlnaflyrt gqhsnkfmlv qfd waindrinem vhfdlpgqee 48; rerlvrmyfd kyvlkpateg qulklaqfd ygrkcsevar ltegmsgrei aqlavswqat 54L ayasedgvlt eammdtrqu aquhqqkmc wlkaegpgrg depsps Nucleotide seguence (variant 3): NCBI Reference Seguence: NM_001170536.1 LOCUS NM_001 170536 ACCESSION NM_001 170536 l cctg gcccttgccg ctcctcgccg gcag ccacttcccg ggcgagactg 6; cgcccccgga gcacccccgg ccgt gtcgcgtgcc gggaggatcg ttcc 12; gtcacccgtt tgcacctctg cagctgtcag gagcgggtca ggttatgcca aggacgccct 18; gaatctggca cagatgcagg agcagacgct gcagttggag caacagtcca agctcaaaga 24; gtatgaggcc gccgtggagc agctcaagag cgagcagatc cgggcgcagg agag ; gaggaagacc ctgagcgagg agacccggca gcaccaggcc agggcccagt atcaagacaa 36L gctggcccgg cagcgctacg aact gaagcagcag caacttctca atgaggagaa 42L tttacggaag caggaggagt ccgtgcagaa gcaggaagcc atgcggcgag ccaccgtgga 48L gcgggagatg gagctgcggc acaagaatga gatgctgcga gtggaggccg aggcccgggc 54L gcgcgccaag gccgagcggg agaatgcaga ccgc gagcagatcc gcctgaaggc 60; ggccgagcac Cgtcagaccg tcttggagtc gacg gctggcacct tgtttgggga 66; aggattccgt gcctttgtga cagactggga caaagtgaca gccacggtgg ctgggctgac 72; gctgctggct gttggggtct actcagccaa gaatgccacg cttgtcgccg gccgcttcat 78; Cgaggctcgg ctggggaagc cgtccctagt gagggagacg tcccgcatca cggtgcttga 84; ggcgctgcgg caccccatcc aggtcagccg gcggctcctc agtcgacccc aggacgcgct 90; ggagggtgtt gtgctcagtc ccagcctgga agcacgggtg cgcgacatcg ccatagcaac 96; aaggaacacc aacc gcagcctgta caggaacatc ctgatgtacg ggccaccagg L02; caccgggaag acgctgtttg ccaagaaact cgccctgcac tcaggcatgg ccat L08; catgacaggc ggggacgtgg cccccatggg gcgggaaggc gtgaccgcca tgcacaagct L14; ctttgactgg gccaatacca gccggcgcgg cctcctgctc tttgtggatg aagcggacgc L20; cttccttcgg gcca ccgagaagat aagcgaggac ctcagggcca cactgaacgc L26; cttcctgtac cgcacgggcc agcacagcaa catg ctggtcctgg ccagcaacca L32; accagagcag ttcgactggg ccatcaatga ccgcatcaat gagatggtcc acttcgacct L38; gccagggcag gaggaacggg agcgcctggt gagaatgtat tttgacaagt atgttcttaa L44; gccggccaca gaaggaaagc agcgcctgaa gctggcccag tttgactacg ggaggaagtg L501 ctcggagqtc gctcggctga cggagggcat ccgg gagatcgctc gccacggcgt atgcctccga ggacggggtc ctgaccgagg caagatgctg tccagcagca gaag atgtgctggc Cgtggggacg agccctcccc agtc cacagggaga gccgcggacc cctcccaccc tgcc ggcccctgca catttaggat 180; atgctcctgg gtggggactg ggctgtgccc agggcctctg tcccccagga tgtcttgtgg 186; tgcgggtcgg ccgttctgcc ccccagggca ccccctgttg taggcactgg ctagggaggg 192; gcaggcctcc ttcctgcccc tcgagacact cttgggagat gcattttccg tcac 198; agggggaggg tttg caccccagcc cctgcccagg ccactgtgag ggtgggtgct 204; gccc ccggggcagc aggagccagg caggtgatgt ctttgttctc ggctcccaca 210; gcagagccag gtgagggggc gcctgccagg gccagaccca ggtggggcag cctgaaccct 216: gcttccccct gtggccggca tgccccgatc tttcacacac tggtgaccct gagagaggag 2221 qqaqqaqqqa acctggcggg ggtgtctgag gccgcactgt cagctggccg gtccaagcct 228; gtggctggag ctgngtctg tttacctaat aaagtcccac aggtgcctca aaaa 2341 aa Protein seguence nt 3): NCBI Reference ce: NP_001164007.1 LOCUS N P_001 164007 ACCESSION N P_001164007 l mqeqthleq qsklkeyeaa veqlkseqir aqaeerrktl seetrqhqar aqudklarq 6" ryedqlquq llneenlrkq eequkqeam rratvereme lrhknemlrv eaeararaka 12; erenadiire qirlkaaehr qtvlesirta gtlfgegfra fvtdwdkvta tvagltllav 18; gvysaknatl vagrfiearl gkpslvrets ritvlealrh piqurrlls rpqdalegvv 24; lspslearvr diaiatrntk knrslyrnil mygppgtgkt lfakklalhs mtgg ; dvapmgregv tamhklfdwa ntsrrglllf vdeadaflrk ratekisedl ratlnaflyr 36; tgqhsnkfml eqf dwaindrine mvhfdlpgqe rmyf dkyvlkpate 42; gqulklaqf seva rltegmsgre iaqlavswqa tayasedgvl teammdtrvq 48; daquhqqkm cwlkaegpgr gdepsps Official Sym bol: PCBP2 al Name: poly(rC) binding protein 2 Gene ID: 5094 Organism: Homo sapiens Other Aliases: Other Designations: alpha—0P2; heterogeneous nuclear ribonucleoprotein E2; heterogenous nuclear ribonucleoprotein E2; hnRNP E2; poly(rC)-binding protein tide ce (variant 1): NCBI Reference Seguence: NM_005016.5 LOCUS NM_00501 6.
ACCESSION NM_005016 l cccagaccag cagaggcagc agccggagca gccgcagcct gcgccctctc ccgcccgccc 6L gccctccgcc cgcccgcccg ccctccgccg ccctccaccc gccccggggt ctctttcccc 12; cttcctcctc ctcctcctcc accccccctt cctcctccgc ccgcccgcgg ggcccccctc l8; gccttcccgc ccgcccctat tgttccgccc ccggcctccc gcccttcccc ttcccgcccg 24; ctcccctttt cccctcagtc gcctcgcgcc tgcagttttt ggctttcacc cagt ; gaccaaagac ttgaccactc aaagtccagc tccccagaac actgctcgac atggacaccg 36; gtgtgattga aggtggatta aatgtcactc tccg gctacttatg catggaaagg 42; gcag tatcatcgga aagaaaggag ttaa gaagatgcgc gaggagagtg 48; gtgcacgtat caacatctca gaagggaatt gtcctgagag aattatcact ttggctggac 54; ccactaatgc catcttcaaa gcta ttga caaactggaa gaggacataa 60; gcagctctat gaccaatagc acagctgcca gtagaccccc ggtcaccctg aggctggtgg 66; tccctgctag tcagtgtggc tctctcattg gaaaaggtgg atgcaagatc aaggaaatac 72; gagagagtac aggggctcag gtccaggtgg atat gctacccaac tcaactgagc 78; gggccatcac tattgctggc attccacaat ccatcattga gtgtgtcaaa cagatctgcg 84; tggtcatgtt ggagactctc tcccagtccc ccccgaaggg cgtgaccatc ccgtaccggc 90; cgtc cagctctccg gtcatctttg gtca ggacaggtac agcacaggca 96; gcgacagtgc gagctttccc cacaccaccc cgtccatgtg ccct gacctggagg 102; gaccacctct agaggcctat accattcaag gacagtatgc cattccacag ccagatttga 108; tgca ccagttggca atgcaacagt ctcattttcc catgacgcat ggcaacaccg 114; gattcagtgg cattgaatcc agctctccag aggtgaaagg ctattgggca ggtttggatg 120; catctgctca gactacttct catgaactca ccattccaaa cgatttgatt ggctgcataa 126; tcgggcgtca caaa atcaatgaga agat gtctggggcg cagatcaaaa L32; ttgcgaaccc agtggaagga tctactgata ggcaggttac tgga tctgctgcca L38; gcattagcct ggctcaatat ctaatcaatg tcaggctttc ctcggagacg ggtggcatgg L44; ggagcagcta gaacaatgca tcca taatcccttt ctgctgttca ccaccaccca L50; tgatccatct gtgtagtttc tgaacagtca gcgattccag gttttaaata gtttgtaaat L56; tttcagtttc tacacacttt atcatccact cgtgattttt taattaaagc gttttaattc L62; ctttctctgt tcagctgttg atgctgagat ccatatttag ttttataagc ttctccctgg L68: tttttttttt ttggctcatg aatttttctg tttgtcatgg aaatgtaaga gtggaatatt L74; aatacatttc agtttagttc tgtaatgtca ggaatttttc aaaaaaatta aaagatggac L80; tggagctttt tgaa tagaaactgg atgccacagt tgtg ggttttattc L86; ctcttgtctt gctgttattt ttgtaccttt tatccctcaa aggacccttc ttgggttttg L92; aatggaagcc ccgg ttaagatgtt ttcttctatt ttaccacttc catctttttt L98; tgtggccctc gatcctattt ttccctgact ccatgcttgg ttggccctta taaaacttgt 204; gcccaaaaga ttgaggatta gactttccga ggacttacct gtcctagggg agtaggcaag 210; cacttccact agggaggggg tgggggaaag gaatgacaca tgacatacat ggcatacaca 216; ttaagcagtt gatcatatgt ctgactgggt tccagtttct tgggaatgtt ggtccccttg 222; ttcaggcttg catattttaa aatt tcagtctatt gtttttagta acttcattta 228; tagtcctcca taacaagtta gaaggatgta acca tttattccta taattttaga 234; aagttggggc ttgacattat actcatttag taga tgcaaaaaag tggaggggca 240; ggagaacttc tccagacacc tcagataaag tccggagccc aaggctttat catg 246; accc cattcattca aaac cctcaacagc tgggcctgca gtta 252; tatttcaagg tttttcacag gggttacagt aggacagtcc ccaccccaat caggcaccag 258; gataaaagca gggacttaaa cagcaccccg gttcttcagc ctgagccatc acatgctatc 264; agtctcctaa cctccccctg ggccttaaga cagggcttgg gcagagaaga taaatggtgg 270; gacaaaaaaa tgagttacat tgccacctga gaaacctcag aggggaggac ttag 276; cctccctcct cccaagtgca aaatgtgtaa acagagtaaa cggaacagaa agtc 2821 taagtggttt tctctcctgc ccctcccacc gcccctcccc ccacccccta ttatttgggg 288; ataaagaata taaagacaac cctggctttt ctattgcctt gttgcttgct aagg 294; aatggggtgg ggcaggaagg ggcttgccct tagccacagc gctg tgcctcattc 300; atttccacag ctgccagtgt ccctagagtt tatcaggtga attggtcagg ggatcagtct 306; ccctcgagcc tgacttacgg ctgggacagc tttc ttat gtggcgcata 3121 tatatatata tatgtatata tatataattt atataaatat ttctctatgt aaaa 3181 aaaaaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_005007.2 LOCUS N P_005007 ACCESSION NP_005007 l mdtgvieggl nvtltirllm hgkevgsiig kkgesvkkmr eesgarinis egncperiit 6l lagptnaifk dkle edisssmtns pvtl sch sligkggcki 121 keirestgaq quagdmlpn steraitiag ipqsiiecvk qicvvmletl sqsppkgvti 181 pyrpkpsssp Vifagqury stgsdsasfp httpsmclnp dlegppleay tingyaipq 241 pdltklhqla mqqshfpmth gntgfsgies sspevkgywa gldasaqtts heltipndli 301 gciigrqgak ineirqmsga qikianpveg titg saasislaqy linvrlsset 361 ggmgss PDL|M7 Official Symbol: PDL|M7 Official Name: PDZ and LIM domain 7 Gene ID: 9260 Organism: Homo sapiens Other Aliases: LMP1, LMP3 Other ations: 1110003801 Rik; LIM domain protein; LMP; Lim mineralization protein 3; PDZ and LIM domain protein 7; protein enigma Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_005451.3 LOCUS NM_005451 ACCESSION NM_005451 1 agaacactgg cggccgatcc caacgaggct ccctggagcc cgacgcagag cagcgccctg 61 gccgggccaa gcaggagccg gcatcatgga caaa gtagtgctgg aggggccagc 121 accttggggc ttccggctgc aagggggcaa caat ctct ccatttcccg 18; gctcactcct gggggcaaag cggcgcaggc cggagtggcc gtgggtgact gggtgctgag 24; catcgatggc gagaatgcgg gtagcctcac acacatcgaa gctcagaaca agatccgggc ; ctgcggggag cgcctcagcc tgggcctcag cagggcccag ccggttcaga gcaaaccgca 36; gaaggcctcc gccg cggaccctcc gcggtacacc tttgcaccca gcgtctccct 42; caacaagacg gcccggccct ttggggcgcc cccgcccgct gacagcgccc cgcagcagaa 48; tggacagccg ctccgaccgc tggtcccaga tgccagcaag cagcggctga tggagaacac 54L agaggactgg cggccgcggc cggggacagg ccagtcgcgt cgca tccttgccca 60; cctcacaggc accgagttca tgcaagaccc ggatgaggag cacctgaaga aatcaagcca 66L ggtgcccagg acagaagccc cagccccagc ctcatctaca ccccaggagc cctggcctgg 721 ccctaccgcc cccagcccta ccagccgccc gccctgggct cctg cgtttgccga 78; gcgctatgcc aaaa cgagcacagt gctgacccgg cacagccagc cgcc 84; cacgccgctg cagagccgca cctccattgt gcaggcagct gccggagggg tgccaggagg 90; gggcagcaac aacggcaaga ctcccgtgtg gtgc cacaaggtca tccggggccg 96; ggtg gcgctgggcc acgcgtacca cccggaggag tttgtgtgta gccagtgtgg L02; cctg gaagagggtg gcttctttga gggc gccatcttct gcccaccatg L08; ctatgacgtg Cgctatgcac ccagctgtgc caagtgcaag aagaagatta caggcgagat L14; catgcacgcc ctgaagatga cctggcacgt gcactgcttt acctgtgctg cctgcaagac L20; gcccatccgg aacagggcct tctacatgga cgtg ccctattgcg acta L26; tgagaagatg tttggcacga aatgccatgg ctgtgacttc aagatcgacg ctggggaccg L32; cttcctggag gccctgggct tcagctggca tgacacctgc ttcgtctgtg cgatatgtca L38; gatcaacctg aaga ccttctactc caagaaggac aggcctctct gcaagagcca L44; tgccttctct catgtgtgag ccccttctgc ctgc cgcggtggcc cctagcctga L50; ggggcctgga gtcgtggccc tgcatttctg ggtagggctg gcaatggttg ccttaaccct L56; ggctcctggc ccgagcctgg ggctccctgg gccctgcccc acccacctta cacc L62; ccactccctc caccaccaca gcacaccggt gctggccaca ccagccccct L68; gtgccacaat aaacctgtac ccagctgtg Protein seguence (variant 1): NCBI Reference Seguence: NP_005442.2 LOCUS N P_005442 ACCESSION NP_005442 l vleg papwgfrlqg plsi srltpggkaa qagvavgdwv lsidgenags 6; lthieaani racgerlslg lsraqpvqsk quasapaad pprytfapsv rpfg 12; apppadsapq qngqplrp1v pdasqulme ntedwrprpg fril ahltgtefmq 18; dpdeehlkks sqvprteapa passtpqepw pgptaPSpts rppwavdpaf aeryapdkts 24; tvltrhsqpa srts ivqaaaggvp gktp vchqchkvir grylvalgha ; yhpeefvcsq eggf feekgaifcp pcydvryaps cakckkkitg eimhalkmtw 36L caac ktpirnrafy ycer dyekmfgtkc hgcdfkidag drflealgfs 42L whdtcfvcai cqinlegktf yskkdrplck shafshv PDCD6 Official Symbol: PDCD6 Official Name: programmed cell death 6 Gene ID: 10016 Organism: Homo sapiens Other Aliases: ALG-2, PEF1 B Other Designations: apoptosis-linked gene 2 protein; probable m-binding protein ALG-2; programmed cell death protein 6 Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_013232.3 LOCUS NM_01 3232 ACCESSION NM_013232 l gataatgcca ggccctgccc ccggcagagg cggaagcgga gtcggcctga gaggtctctc 6; gtcgctgcag gcgcctcagc ccagccgcgt gccttggccc atggccgcct actcttaccg 12; ccccggccct ggggccggcc ctgggcctgc tgcaggcgcg gcgctgccgg gctt l8; cctgtggaac gttttccaga gggtcgataa agacaggagt ggagtgatat cagacaccga 24L gcttcagcaa gctctctcca acggcacgtg gactcccttt aatccagtga ctgtcaggtc ; gatcatatcc atgtttgacc gtgagaacaa ggccggcgtg aacttcagcg agttcacggg 36; tgtgtggaag tacatcacgg actggcagaa cgtcttccgc acgtacgacc gggacaactc 42; cgggatgatc gataagaacg agctgaagca ggccctctca ggtttcggct accggctctc 48; tgaccagttc cacgacatcc tcattcgaaa gtttgacagg cagggacggg ggcagattgc 54; cgac ttcatccagg gctgcatcgt gagg ttgacggata tattcagacg 60; ttacgacacg gatcaggacg gctggattca ggtgtcgtac gaacagtacc tgtccatggt 66; cttcagtatc gtatgaccct ggcctctcgt gaagagcagc tgga caaa 72; atgtcacagt tcctatctgt gagggaatgg agcacaggtg cagttagatg ctgttcttcc 78; tttagatttt gtcacgtggg gacccagctg tacatatgtg gataagctga ttaatggttt 84; tgta atagtagctg tatcgttcta atgcagacat tggatttggt gactgtctca 90- ttgtgccatg aggtaaatgt aatgtttcag gcattctgct tgcaaaaaaa tctatcatgt 96L gcttttctag atgtctctgg ttctatagtg caaatgcttt tattagccaa taggaatttt 102; aaaataacat ggaacttaca caaaaggctt ttcatgtgcc ttactttttt aaaaaggagt 108; tatt cattggaata gtaa gcaataaagg gaatgttaga cgtgtaaaaa 1141 aaaaaaaaaa a Protein seguence ( variant 1): NCBI Reference Seguence: NP_037364.1 LOCUS N P_037364 ACCESSION NP_037364 1 maaysyrpgp gagpgpaaga alpdqsflwn qurvdkdrs gvisdtequ alsngtwtpf 61 npvtvrsiis mfdrenkagv nfseftngk yitdquvfr tydrdnsgmi dknelkqals 121 gfgyrlsdqf hdilirkfdr qgrgqiafdd fiqgcivlqr ltdifrrydt 181 eqylsmvfsi v ACTR2 Official Sym bol: ACTR2 Official Name: ARP2 actin-related protein 2 g (yeast) Gene ID: 10097 sm: Homo sapiens Other Aliases: ARP2 Other Designations: actin-Iike protein 2; actin-related protein 2 Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_001005386.2 LOCUS NM_001005386 ACCESSION NM_001005386 l gagctcaccg ctgccagtcg cgctgcctgc cacc cttttcgtgc aggcattcag 6; ctaaatgacg ggcggagccc ggcggcggct tccggtcggg ggaaaaaagt tgggccgaag 12; ccgg gaagacgcaa gaag cggc cgggcggcgg tggctgtagg 18; ggct gctc gggc ggacgatgga cagccagggc aggaaggtgg 24L tggtgtgcga caacggcacc gggtttgtga agtgtggata ctct aactttccag ; aacacatctt cccagctttg gttggaagac ctattatcag atcaaccacc aaagtgggaa 36L acattgaaat caagaataac aaaaagatgg atcttatggt tggtgatgag gaat 421 tacgatcaat gttagaagtt aactacccta tggaaaatgg acga aattgggatg 48; acatgaaaca cctgtgggac tacacatttg gaccagagaa acttaatata gataccagaa 54; attgtaaaat cttactcaca gaacctccta tgaacccaac caaaaacaga tagaggtaat gtttgaaact taccagtttt ccggtgtata tgtagccatc tgactttgta cgctcaaggt ttattgactg tggt agactctgga gatggtgtga 72; ctcacatttg cccagtatat gaaggctttt ctctccctca tcttaccagg agactggata 78; ttgctgggag ggatataact agatatctta tcaagctact tctgttgcga ggatacgcct 84; tcaaccactc tgctgatttt gaaacggttc ttaa agaaaaactg tgttacgtgg 90; gatataatat tgagcaagag cagaaactgg ccttagaaac cacagtatta gttgaatctt 96; atacactccc acgt atcatcaaag ttgggggaga gagatttgaa gcaccagaag L02; ctttatttca gcctcacttg atcaatgttg aaggagttgg tgttgctgaa ttgcttttta L08; acacaattca ggcagctgac attgatacca gatctgaatt ctacaaacac attgtgcttt L14; ctggagggtc tactatgtat cctggcctgc catcacggtt ggaacgagaa cttaaacagc L20; tttacttaga acgagttttg gatg tggaaaaact ttctaaattt aagatccgca L26; ttgaagaccc accccgcaga aagcacatgg tattcctggg tggtgcagtt ctagcggata L32; tcatgaaaga caaagacaac ttttggatga cccgacaaga gtaccaagaa aagggtgtcc L381 gtgtgctaga gaaacttggt gttc gataaactcc aaagcttgtt cccgtcatac L44; ccgtaatgct ttcttttttc ctttattgcc aatctttgaa ctcattcaac tccaggacat 150; ggaagagqcc tctctctgcc ctgg aaaggtcaag ttttattctg gaagctttgt taaatttttg ttaatgtggg taaatctgag tttaattcaa acatagacta gagggctaag gattctgtct gctgctttgt ttcttctaag taggcattta 168; gatcattcct ataggcttcc tattttcact ttactgctct aatgctgcta gtcgtagtct 174; ttagcacact aggtggtatg cctttattag cataaaacaa aaaaaacttt aacaggagct 180; tttacatatt actgggatgg ggggtggttc gggatgggtg tgct gaacccttta 186; gggcatttcc tctgtaatgt ggcgctttca ctgc ttta agtaccttaa 192; agcttctcct gtgaacttct tagggaaatg ttaggttcag aactaaagtg ttttgggtgg 198; gttttgttgc gggggggagg gtaacaatgg gtggtcttct gatttttatt tttgaggttt 204: tgtcaactgg agtacgtaga ttat ttacagtact ttgatttggc tctt 210; ctacttgtgc tctgcctgga gctgtttcca tatgatataa aaagcaagtg tagtattcca 216; ttactatgtg gcttagggat gttt tttaaaatca accatgttag ttag 222; actccctaca gtccttcaat ggaaaagtaa catttaaaaa tcctttgggt aattcgaatt 228; acagatttaa aagagcttaa gatctggtgt tttgttaatg cttctgttta ttccagaagc 234; attaaggtaa cccattgcca agtatcattc ttgcaaatta ttcttttata acca 240; gtgcttaata aaacaagcag gtacttacaa ataattactg ggtt ggtg 246; gtttaaaaat aacattggaa tacaggactt gttgccaatt gggtaatttt cattagttgt 252; tttgtttgtt tgaa acctggaaat acagtaaaat ttgactgttt aaaatgttgg 258; ccaaaaaaat caagatttaa tttttttatt tgtactgaaa aactaatcat aactgttaat 264; tctcagccat ctttgaagct tgaaagaaga gtctttggta ttttgtaaac gttagcagac 270; tttcctgcca gtgtcagaaa atcctattta tgaatcctgt cggtattcct tggtatctga 276; aaaaaatacc aaatagtacc atacatgagt tatttctaag tttgaaaaat aaaaagaaat 282; tgcatcacac taattacaaa atacaagttc tggaaaaaat cttc attttaaaac 288; ttttttttaa ctaataatag aaga ttaa tttgggggtg gtaactaaaa 294; tcaaaagaaa tgattgactt gagggtctct gtttggtaag aatacatcat tagcttaaat 300; aagcagcaga aggttagttt taattatgta gcttctgtta agtg gtct 306; gttttacctc aatttgaaca gataagtttg cctgcatgct ggacatgcct cagaaccctg 312: aatagcccgt actagatctt gggaacatgg atcttagagt cactttggaa ctta 3181 tataaatacc cccagccttt tgagaacggg gcttgttaaa ggacgcgtat gtagggcccg 324; tacctactgg cagttgggtt cagggaaatg ggattgactt ggccttcagg ctcctttggt 330; cataatttta aaatatggga gtagaaaaca acaaagaatg gaatggactc ttaaaacaat 336; gaaagagcat ttatcgtttg tcccttgaat tttg tttttgattt cataattctg 342; ctggtaaatg tgacagttaa aatggtgcat tatgtatata tattataatt tagaaatacc 348; attttataat tttactattc cagggtgaca taatgcattt aaatttggga tttgggtgga 354; gtattatgtt gagt tgtcaagtat gagtccctca ggaaaaaaaa aaaattctgt 360; aagc aatctgattc ttagctcttg aaactattgc tacttaaatt tccaataatt 366; aaaaatttaa aatttttaaa ttagaattgc caatacttct acatttgaga agggtttttt 372; tagaaataca tttagtaaag tccccaagac attagtctta catttaaact tttttcttta 378; aaacatggtt gtta acttttacac agttctgagt actgttaata tctggaaagt 384- atcttgagat ggaa agctaaacag tctaaattaa atac ttcattttga 3901 ttgagaaaat aaaatcagat tttttcaaag tcaaaaaaaa aaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_001005386.1 LOCUS N P_001005386 ACCESSION N P_001005386 1 mdsqgrkvvv cdngtgkac gyagsnfpeh ifpalvgrpi irsttkvgni eiknnkkmdl 61 mvgdeaselr smlevnypme ngivrnwddm khlwdytfgp eklnidtrnc killteppmn 121 ptknrekive vmfetyqfsg vyvaiqavlt lyaqglltgv vvdsgdgvth icpvyegfsl 181 phltrrldia ylik llllrgyafn hsadfetvrm ikeklcyvgy nieqeqklal 241 ettvlvesyt lpdgriikvg gerfeapeal fqphlinveg vgvaellfnt iqaadidtrs 301 efykhivlsg gstmypglps kqu gdve klskfkirie dpprrkhmvf 361 lggavladim kdkdnfwmtr qeyqekgvrv leklgvtvr TXNDC12 Official Symbol: TXNDC12 Official Name: thioredoxin domain containing 12 (endoplasmic lum) Gene ID: 51060 sm: Homo sapiens Other Aliases: UNQ713/PRO1376, AG1, AGR1, ERP16, ERP18, ERP19, PDIA16, TLP19, hAG-1,hTLP19 Other Designations: ER protein 18; ER protein 19; anterior gradient homolog 1; endoplasmic reticulum protein ERp19; asmic reticulum resident protein 18; endoplasmic reticulum resident protein 19; endoplasmic lum thioredoxin superfamily member, 18 kDa; protein ide isomerase family A, member 16; thioredoxin domain-containing protein 12; thioredoxin-like protein Nucleotide ce: NCBI Reference Seguence: NM_015913.3 LOCUS NM_015913 ACCESSION NM_015913 l agtgctagtg gcggcaggtg caggtggccg cgcggcatcc ttgc agtctcccga 6L gcgttctgtt gtgtccctgc ctacaatttt ccta ataaaagggt ggtcggtatg 12L tttttatttg ggtgtgtact tttgttaggt cgctttttcg ctatgcatta agtacggact l8; ttaggactca actagtacca ggaagaaaaa gaccggacat ctgt ttgttataca 24; gcgaagggga ggga agaaatcctc aagctacaag aacc agaagcttta ; cactttagcc tgcagtgact tatatcctgg tgtcctaagt ccacctaagt cagttttgca 36; ataagaggtc ccaagtttgg ttttcttgga gcttgcatca gttggttgca tcatccctga 42; gtagaagatt tgcggttgca aggaaaaata aggtacagag cttctccagc gggaaagtgc 48; atgtctgcac ggcacgagcc cacgcaccgc agaacaggct tgccaggtct cctcagagac 54; cctcgcagga acctaacaat gaaatccagt tgtccagtct tgatttgtgg aggggtaagg 60; agaatccgag gccagtgggc aatccgccca ctgttgggag acct cacgaatcaa 66; taatttgctt ttgactagga agtgcagcgg ttcttggggg gaggggctgg actgggtggc 72; ggacgcgagg agcaacggtt ctcccgaacc tctcccccgc ccctactatc ttggcctaca 78; ttttcccgct ccgtcccggg acctggacac ccagaatcca cgaaaagcaa ctcgcgctcg 84; agaacagctc tcgtaccctt ctacgtgatc ttta agctcactcc atcccaaacc 90; ggaccccgga ggcaccaccc acatccgtct aacatcactt ccttcagagt ttgaaaaaaa 96; aaaatctggg aagtagaggt gttgtgctga gcggcgctcg gcgaactgtg tggaccgtct L02; gctgggactc cggccctgcg tccgctcagc gccc cgcgcaccta ctgccatgga L08; gacgcggcct cgtctcgggg ccacctgttt gctgggcttc agtttcctgc ggacataatg ggcttggaaa tgga gatcatattc aaag aagcagctgc actg atgg tgtggagctt gcaaagctct aaagcccaaa tttgcagaat tcccataatt ttgttatggt aaatcttgag gatgaagagg aacccaaaga L38; tgaagatttc agccctgacg ggggttatat aatc ctttttctgg atcccagtgg 144; caaggtgcat cctgaaatca tcaatgagaa tggaaacccc aagt atgt 150; cagtgccgag caagttgttc aggggatgaa ggaagctcag gaaaggctga cgggtgatgc 156; cttcagaaag cttg aagatgaatt gtaacatgaa tgtgcccctt ctttcatcag 162; agttagtgtt ctggaaggaa agcagcaggg aagggaatat tgaggaatca tctagaacaa 168; ttaagccgac caggaaacct cattcctacc tacactggaa ggagcgctct cactgtggaa 174; gagttctgct aacagaagct ggtctgcatg tttgtggatc cagcggagag tcttctcctt ttccctctca cctaaatgtc aacttgtcat tgaatgtaaa aaacttgagc cacttggatg tttactcctc gcacttaagt cctcccactt tactcacctt agtggtgaaa tagt ctacaacgtt aaaattgcag ctta tcattaaaaa acaacaacaa caacaataac 204; aataaatcct aagtgtaaat cagttattct accccctacc aaggatatca gcctgttttt 210; tccctttttt ctcctgggaa taattgtggg cttcttccca aatttctaca gcctctttcc 216; tcttctcatg cttgagcttc cctgtttgca cgcatgcgtg tgcaggactg tgct 222; cggc tccaggtgga agcatgcttt ttac tgttggagaa acct 228; tcaagcccta ggtgtagcca ttttgtcaag tcatcaactg tatttttgta ctggcattaa 234; caaaaaaaga gataaaatat tgtaccatta aactttaata aaactttaaa aggaaaaaaa 2401 aaaa aa Protein seguence: NCBI Reference Seguence: NP_056997.1 LOCUS N P_056997 ACCESSION NP_056997 1 metrprlgat cllgfsflll vissdghngl gkgfgdhihw rtledgkkea aasglplmvi 61 ihkswcgack aest eiselshnfv mvnledeeep kdedfspdgg yiprilfldp 121 sgkvhpeiin engnpsykyf vvqg mkeaqerltg dafrkkhled e1 ANXA7 Official Symbol: ANXA7 Official Name: annexin A7 Gene ID: 310 Organism: Homo sapiens Other Aliases: RP11-537A6.8, ANX7, SNX, SYNEXIN CNherDesbnaflons:annex"1VH;annexm¥7 deseguence(vafiant1y NCBI Reference Seguence: NM_001156.3 LOCUS NM_001156 ACCESSION NM_001156 l ccaccctggg cccgcccccg gctccatctt gacc gggttgggct gtgacgctgc 6L tgctggggtc agaatgtcat acccaggcta tcccccaaca ggctacccac ctttccctgg 12L atatcctcct gcaggtcagg cttt tcccccttct ggtcagtatc cttatcctag l8; tggctttcct ccaatgggag gaggtgccta cccacaagtg ccaagtagtg gctacccagg 24; agctggaggc taccctgcgc ctggaggtta tccagcccct ggaggctatc ctggtgcccc ; acagccaggg ccat cctatcccgg agttcctcca ggccaaggat ttggagtccc 36; accaggtgga gcaggctttt atcc acagccacct tcacagtctt atggaggtgg 42; tccagcacag gttccactac ctggtggctt tcctggagga cagatgcctt ctcagtatcc 48; tggaggacaa cctacttacc ctagtcagcc agtg actcaggtca ctcaaggaac 54; tatccgacca gctgccaact tcgatgctat aagagatgca cttc gtaaggcaat 60; gaagggtttt gggacagatg agcaggcaat tgtggatgtg gtggccaacc gttccaatga 66; tcagaggcaa aaag ttaa gacctcctat ggcaaggatt taatcaaaga 72; tctcaaatca gagttaagtg gaaatatgga agaactgatc ctggccctct tcatgcctcc 78; tacgtattac gatgcctgga gcttacggaa agcaatgcag ggagcaggaa ctcaggaacg 84; tgtattgatt gagattttgt gcacaagaac aaatcaggaa atccgagaaa ttgtcagatg 90; ttatcagtca gaatttggac gagaccttga aaaggacatt aggtcagata catcaggaca 96; ttttgaacgt gtgt gcca gggaaatcgt gatgagaacc agagtataaa L02; ccaccaaatg gctcaggaag atgctcagcg tctctatcaa gctggtgagg ggagactagg L08: gaccgatgaa tcttgcttta acatgatcct tgccacaaga agctttcctc gcttattcta ggatggctaa tcgagacttg agtg tatgtagaaa gtggtttgaa gaccatcttg cagtgtgccc 1261 cttt gctgagaggc atgc tatgaaaggt gctggcacag atgactccac L32; cctggtccgg attgtggtca ctcgaagtga gattgacctt gtacaaataa aacagatgtt L38; cgctcagatg tatcagaaga ctctgggcac aatgattgca acga gtggagatta L44; ccgaagactt cttctggcta ttgtgggcca gtaggaggga tttttttttt tttaatgaaa L50; aaaaatttct attcatagct tatccttcag agcaatgacc tgcatgcagc aatatcaaac L56; atcagctaac cgaaagagct ttctgtcaag gaccgtatca gggtaatgtg cttggtttgc L62; acatgttgtt attgccttaa ttctaatttt attttgttct ctacatacaa tcaatgtaaa L68; gccatatcac aatgatacag taatattgca atgtttgtaa accttcattc ttactagttt L74; cattctaatc aagatgtcaa attgaataaa aatcacagca atctctgatt ctgtgtaata L80: atattgaata attttttaga aggttactga aagctctgcc ttccggaatc agtggatagt aaac tgtgtacttt aaaaaaaaat catctagaat aatttgcatc tcattttgcc taaattggtt ctgtattcat aaacactttc 1981 cacatagaaa atagattagt attacctgtg gcacctttta agaaagggtc aaatgtttat 204; atgcttaaga tacatagcct actttttttt cgcagttgtt ttcttttttt aaattgagtt 2101 atgacaaata aaaaattgca taag gtgtacaaaa aaaaaa Protein ce (variant 1 ): NCBI Reference Seguence: NP_001147.1 LOCUS NP_001147 ACCESSION NP_001147 1 msypgypptg yppa ppsg qypypsgfpp mgggayqup ssgypgaggy 6L papggypapg gypgapqug apsypgvppg qgfgvppgga gfsgypqpps qsygggpaqv 12; pggq mpsqypggqp typsqpatvt qvtqgtirpa anfdairdae ilrkamkgfg 18; vdvv anrsndqqu ikaafktsyg kdlikdlkse lsgnmeelil alfmpptyyd 24; awslrkamqg agtqervlie ilctrtnqei reivrcyqse fgrdlekdir sdtsghferl ; lvsmcqgnrd enqsinhqma qedaquyqa gegrlgtdes atrs qulratmea 36; ysrmanrdLl fsgy vesglktilq calnrpaffa erlyyamkga gtddstlvri 42; vvtrseidLv qikquaqmy qktlgtmiag rrll laivgq PFKM Official : PFKM Official Name: phosphofructokinase, muscle Gene ID: 5213 Organism: Homo sapiens Other Aliases: GSD7, PFK-1, PFK1, PFKA, PFKX Other Designations: 6—phosphofructokinase; 6-phosphofructokinase, muscle type; PFK-A; phosphofructokinase isozyme A; phosphofructokinase 1; phosphofructokinase, polypeptide X; phosphofructokinase-M; phosphohexokinase Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_001166686.1 LOCUS NM_001 166686 ACCESSION NM_001 166686 1 gtcccagggg gcggggcaga ggcg ccggccccac agtgctcccc gcttccgccc 6; agtccagccc gggccggctg accgggtccg acacagtctc ctggaccagg ctccctccat 12; cctcacccct cccccagctt cccgccgcca ctcaccgaac cggaaccggc tgccatgcga l8; aggggtttcc ggccgggcgc ggaacgcaaa acccgggaac cgccgcgaac cggaaccgcc 24; gcac cggaagagtc aggc agccatgcat aaagacgagt ttcatctgaa ; atttttcatg tgtgtgattc agtctcgcca gttagtcagg actcctcaga gaacagctgg 36; ggaagcttct acttccagca tgctcatacc aaagccacca ccaaagacag acatcttgaa 42; gagtctagat actatggatg atccagacac aagc atacctgttt tcaaaactga 48; gtggatcatg acccatgaag agcaccatgc agccaaaacc ctggggattg gcaaagccat 54; tgctgtctta acctctggtg gagatgccca aggtatgaat gctgctgtca gggctgtggt 60; tcgagttggt atcttcaccg gtgcccgtgt tgtc catgagggtt atcaaggcct 66; ggtggatggt caca tcaaggaagc cacctgggag agcgtttcga tgatgcttca 72; gctgggaggc acggtgattg gaagtgcccg ggac tttcgggaac gagaaggacg 78; actccgagct gcctacaacc tggtgaagcg tgggatcacc aatctctgtg tcattggggg 84; tgatggcagc gggg ctgacacctt ccgttctgag tggagtgact gtga 90- cctccagaaa gcaggtaaga tcacagatga tacg aagtccagct acctgaacat 96L tgtgggcctg gttgggtcaa ttgacaatga cttctgtggc accgatatga ccattggcac 102; tgactctgcc ctgcatcgga tcatggaaat tgtagatgcc atcactacca ctgcccagag 108; gagg acatttgtgt tagaagtaat gggccgccac tgtggatacc tggcccttgt 114; cacctctctg tcctgtgggg gggt ttttattcct gaatgtccac cagatgacga 120; ctgggaggaa cacctttgtc gccgactcag cgagacaagg acccgtggtt ctcgtctcaa L26; catcatcatt gtggctgagg ttga caagaatgga aaaccaatca cctcagaaga L32; catcaagaat ctggtggtta agcgtctggg atatgacacc cgggttactg tcttggggca L38; tgtgcagagg ggtgggacgc cctt tgacagaatt agca ggatgggtgt L44; ggaagcagtg atggcacttt tggaggggac cccagatacc ccagcctgtg tagtgagcct L50; ctctggtaac caggctgtgc gcctgcccct catggaatgt gtga ccaaagatgt L56; gaccaaggcc atggatgaga agaaatttga cgaagccctg aagctgagag gccggagctt L62: catgaacaac tgggaggtgt acaagcttct agctcatgtc agacccccgg acagtggctg tgatgaacgt gggggctccg gctgcaggca actgtgagga ttggccttat ccagggcaac cgagtgctcg ggcctggcca aggggcagat agaggaagct ggctggagct ctggactggc caaggtggct ttgg gactaaaagg actctaccca agaagagctt -92; tgaacagatc agtgccaata taactaagtt taacattcag ggccttgtca ctttgaggct tacacagggg gcctggaact gatggagggc cagt ttgatgagct 204; ctgcatccca tttgtggtca ctac agtctccaac aatgtccctg gctcagactt 210; cagcgttggg gctgacacag cactcaatac tatctgcaca acctgtgacc agca 216; gtcagcagct ggcaccaagc gtcgggtgtt tatcattgag actatgggtg gctactgtgg 222; ctacctggct accatggctg cagc tggggccgat gctgcctaca tttttgagga 228; gcccttcacc attcgagacc tgcaggcaaa tgttgaacat ctggtgcaaa agatgaaaac 234; gaaa aggggcttgg tgttaaggaa tgaaaagtgc aatgagaact ataccactga 240; cttcattttc tact ctgaggaggg gaagggcatc ttcgacagca ggaagaatgt 246; gcttggtcac atgcagcagg gccc aaccccattt gataggaatt ctaa 252; gatgggcgcc aaggctatga actggatgtc tgggaaaatc aaagagagtt accgtaatgg 258; gcggatcttt gccaatactc cagattcggg ctgtgttctg gggatgcgta agagggctct 264; ggtcttccaa ccagtggctg agctgaagga ccagacagat tttgagcatc gaatccccaa 270; ggaacagtgg tggctgaaac tgaggcccat aatc ctagccaagt acgagattga 2761 cttggacact tcagaccatg cccacctgga gcacatcacc cggaagcggt ccggggaagc 282; tgccgtctaa acctctctgg agtgagggga atagattacc tgatcatggt cagctcacac 288; cctaataagt ccacatcttc tcagtgtttt agctgttttt ttcattaggt ttccttttat 294; tctgtacctt gcagccatga ccagttctgg ccaggagctg gaggagcagg cagtgggtgg 300; gagctccttt gaat ttaacatgac ttctgcccca gctttatctg tcacacaagg 306; ctgggcacct ctagtgctac tgctagatat cacttactca gttagaattt tcctaaaaat 312; aagctttatt tatttctttg tgataacaaa tggt tcctctacta cttttactac 318; agtgacaaat tgtaactaca ctaataaatg ccaactggtc actgtgcttt tgcttctcct 324; gttatcatct tcctaagtgg aatgtaatac tgtcagcccc atgtatcaga tctg 330; atgaagcagt aaagacgtta agggtatcac agggggtgga ggaagggatt atctctagta 336; cactacttgc tggctgtctg aaaaattgtc actgccaaac tctaaaaaca gttctaaata 342: gtgactgaga gttg ctggagtcag ggaataaggc agccaaatac caca 3481 gttctttagt gggaagagaa attaacaata aagc actgtgaaaa aaaaaa Protein seguence (variant 1): NCBI Reference ce: NP_001160158.1 LOCUS N P_001160158 ACCESSION 160158 l mhkdefhlkf fmcviqsrql vrtpqrtage astssmlipk pppktdilks ldtmddpdtv 6; gsipvfktew imtheehhaa ktlgigkaia vltsggdaqg mnaavravvr vgiftgarvf 12; fvhequglv dggdhikeat wesvsmmlql ggtvigsarc kdfreregrl vkrg 18; itnlcviggd gsltgadtfr sewsdllsdl qkagkitdee atkssylniv glvgsidndf 24; igtd salhrimeiv daitttaqsh qrtfvlevmg rhcgylalvt slscgadwvf ; ipecppdddw rlse trtrgsrlni iivaegaidk ngkpitsedi knlvvkrlgy 36; dtrvtvlghv safd rilgsrmgve egtp dtpacvvsls 42; ecqutkdvt kamdekkfde alklrgrsfm nnwevyklla hvrppvsksg shtvavmnvg 48; apaagmnaav rstvrigliq gnrvlvvhdg feglakgqie eagwsyvggw tgqggsklgt 54; krtlpkksfe qisanitkfn iqglviiggf eaytgglelm egrkqfdelc ipfvvipatv 60; snnvpgsdfs vgadtalnti cttcdrikqs aagtkrrvfi ietmggycgy latmaglaag 66; adaayifeep ftirdlqanv ehlvqkmktt vkrglvlrne kcnenyttdf ifnlyseegk 721 gifdsrknvl qhmqqgg8pt pfdrnfatkm gakamnwmsg kikesyrngr ifantpdsgc 78L vlgmrkralv lkdq tdfehripke qulklrpil kilakyeidl dtsdhahleh 84L itrkrsgeaa V SUB1 Official Symbol: SUB1 Official Name: SUB1 homolog (S. cerevisiae) Gene ID: 10923 Organism: Homo sapiens Other Aliases: P15, P04, p14 Other Designations: activated RNA rase II transcription cofactor 4; activated RNA polymerase II transcriptional coactivator p15; positive cofactor 4 Nucleotide seguence: NCBI Reference Seguence: 713.3 LOCUS NM_00671 3 ION NM_006713 l gccccatcac gtgaccgcag ccccagcgcg gcggggccgg cgtctcctgg ctgccgtcac 6; ttccggttct ctgtcagtcg cgagcgaacg accaagaggg tgttcgactg ctagagccga 12; gcgaagcgat gcctaaatca aaggaacttg tttcttcaag tggc agtgattctg 18; acagtgaggt tgacaaaaag ttaaagagga aaaagcaagt tgctccagaa aaacctgtaa 24; agaaacaaaa gacaggtgag acttcgagag ccctgtcatc ttctaaacag agcagcagca ; gcagagatga taacatgttt cagattggga aaatgaggta cgttagtgtt cgcgatttta 36; aaggcaaagt gctaattgat attagagaat attggatgga tcctgaaggt gaaatgaaac 42; caggaagaaa aggtatttct ttaaatccag aacaatggag gaag gaacagattt 48; ctgacattga tgatgcagta agaaaactgt aaaattcgag ccatataaat aaaacctgta 54; ctgttctagt tgttttaatc tgtcttttta cttt ctaa tcca 60; agctattgta tgtttggatt gcagaagaat gatg aatacttttt gtgc 66; attattaaaa atattgagtg aagctaattg tcaactttat taaggattac tttgtctgcc 72; caccacctag tgtaaaataa aatcaagtaa tacaatctta actgttgtgg ccttttttga 78; tcataagagt tggtactgtt taaggccaaa agtaacagtt tttatagatc ttttagtttc 84- aactcagctt taaa aaggatttgt attgcattga gtttataaac ttttggtttg 90; tgaacttcat atttgatctt ttctcttcca tgtc taggcttgtt tgacttccac 96L ccccaatggt ttttcactct ttttatttac ttcattttcc tttaataact taatctcttc 102; atgttcagtt tttacttcac tctttattct tttctttgat tatggtatgc ggaa 108; agtcagtgaa actgtcaaaa tgttatctca ataagatact tatatgagaa ctacaatcac 114; cgaatctact gtattcaata ttagcagatc taatttgata aacaacatgg cttgtgtgaa L20; aactgagcag gtgtttgttt acccatagtg ttctgtgtag ttattgctta gtctgcagaa L26; aataatgact tagatgagat gtctgacttg ctttcactta ttaaacatgt tggg L32; atgatgtctg taacatcaga tattgttcaa ctagactagg taaa aattgtgaaa L38; gcttactggc tttt attttataat attgggtatg aattatatgt agccagagat L44; gtcattaagc tttactgtta tagtaggtaa tatggttagt ggaa atat L50; gagcacatgc ttgtgtattt tggcctttgc cccagtagaa cagaccaatg gcattctaga L56: cttgatgata ctaagtttta gcagacacta gtaagtggtt tgtatttaac catactgatg L62; acag attgaggcac agattttagt ggctttgtgg caataaatag L68; gccttagqaa aagaatgttt ataaagggaa ttataactga aattaaagga ggcggcagtg L74; aagaggaaat aattctcttc tatctaaatg atatacatat gatattttga gatttttata L80; acagcagtgg aacacaattc taggtagagt agaaaaagga aagttttaaa agattcttgt tgacaaatta tttttggtag caaatctcaa atggttacct aagg tctgccatat tagagttttg cactattttg ctaccaagtt tgattcatac ttttgtagtt acttgtcaag gacttaattt gaaaatcatt tgccaggcca catagttatc 204; aatttttttt tctatcagct ttgt aaaa cattttttag atgacttttt 210; tatt tagcagtaac cttatgaggt tcaaattggt aaatctcttg taatttagcc 216; ttcatcgaat aataggtacc agtgtattaa aaatgtgtat tttttgcagc cccttgaacc 222; agagtaggtt cagagaaact cccaaagttt gtactttaga cacatcatgc ttgattggta 228; acttccctcc ttttttgggg tttg tgtcctatta acttaattgg atagattttt 234; ttct tatttttggc acacggaaag ggtagttcga gtacagaact ttgatttttg 240; gtgtagatgc agagggaatg atgggtaaat ttcctaggtt tatgtgaatt tagggggtgt 246; atgcattttg tcta gatg gtgctgaaat ctattaccta catgttttct 252; agttgttcag cattatgtta atgaagcctc agga gtgtttctct ggcacagttg 258; gtaagttgac tgctaacttc atttaaatgt gttactggat atgcagtata ctgaaattat 264; taatcagttt gtgtatagga aact gggttaaaag caaattaact tgttctgaaa 270; agaaagtata gattaatttt gttttctgtt taaattttat ctccttggta aagatttttt 276; tttcctgqgc agaaaacttg gcatttttag gcgtagatac cttaccttac aatgccaaaa 282; tgaatttaat tccagtactc aggtttttcc ctttaacaga ctctatgtgt atcagggctt 288; tctaatgggt ttttcctctt cgtttttaaa atgtgagtag catttgacca atttccagtg 294; ctcttagcat tttacttaaa gaacaaccac tacaaaagaa tgta atttgattgt 300; cttttgcttt gcttcattaa tgcctaagaa cttaagaata cctc attagctact 306; caagatgctg tgacgatcaa atctattcta cataatgcgt acaa agacttgggt 312; gaaaaatgaa ataagtatat tctgacttgg ctattgaggg gaaaattcag tattaagtgt 318; tcctcacagg agatatgtta gcagaatact ataaaagttt gaaattttta aaaagtaaaa 324; gtacttaaat ttaggtatct ctcctgaaat tctttgcagt tcatttttta tggcagttaa 330; tccagtgaaa cactcaaaag tttttttttt tttaaaagtg tttttccaga gtag 336- ggtgaacatt cacataatca caaatatgta attctgtaat tgtggaatgc ttgtatgctt 3421 tgttttcgta catcttccat gtct aata ctccatctgt gaatatttta 3481 aatgttgaaa taag aaatgtgaaa aaaaaaaaaa aa Protein seguence: NCBI Reference Seguence: NP_006704.3 LOCUS NP_006704.3 ION NP_006704.3 1 mpkskelvss sssgsdsdse deklkrkkq vapekpvkkq ktgetsrals sskqssssrd 61 dnquigkmr yvsvrdfkgk vlidireywm dpegemkpgr kgislnpeqw sqlkeqisdi 121 ddavrkl ACDB3 Official Symbol: ACBD3 Official Name: acyl-CoA binding domain containing 3 Gene ID: 64746 Organism: Homo sapiens Other Aliases: GCP60, GOCAP1, GOLPH1, PAP7 Other Designations: Golgi resident protein GCP60; PBR- and PKA-associated protein 7; PKA (RIalpha)-associated protein; acyI-Coenzyme A binding domain containing 3; golgi complex ated protein 1, 60kDa; golgi phosphoprotein 1; peripheral benzodiazepine or-associated protein PAP7 tide seguence: NCBI Reference Seguence: 735.3 LOCUS NM_022735 ACCESSION NM_022735 l atacgtggct gccgtctgtc cccgctgagg aggtgcagca gccggagatg gcggcggtgc 6; caga gcgactcgag gtgtccgtcg acggcctcac gctcagcccg gagg 12; agcggcctgg ggcggagggc gccccgctgc tgccgccacc gctgccaccg ccctcgccac 18; ctggatccgg tcgcggcccg ggcgcctcag gggagcagcc cgagcccggg gaggcggcgg 24; ctgggggcgc ggcggaggag gcgcggcggc tggagcagcg ctggggtttc ggcctggagg ; agttgtacgg cctggcactg cgcttcttca aagaaaaaga tggcaaagca tttcatccaa 36L cttatgaaga aaaattgaag cttgtggcac tgcataagca agttcttatg ggcccatata 42; atccagacac ttgtcctgag gttggattct ttgatgtgtt tgac aggaggagag 48L aatgggcagc cctgggaaac atgtctaaag ccat ggtggagttt gtcaagctct 54L taaatagqtg ttgccatctc ttttcaacat atgttgcgtc ccacaaaata gagaaggaag 60; agcaagaaaa aaaaaggaag gaggaagagg agcgaaggcg gcgtgaagag gaagaaagag 66; aacgtctgca aaaggaggaa gagaaacgta ggagagaaga agaggaaagg cttcgacggg 72; aggaagagga aaggagacgg atagaagaag ttcg gttggagcag caaaagcagc 78; agataatggc agctttaaac tcccagactg ccgtgcagtt ccagcagtat gcagcccaac 84; agtatccagg gaactacgaa cagcagcaaa ttctcatccg ccagttgcag gagcaacact 90; atcagcagta catgcagcag ttgtatcaag tccagcttgc acagcaacag gcagcattac 96; agaaacaaca agta gtggctgggt cttccttgcc tacatcatca aaagtgaatg L02; tacc aagtaatatg atgtcagtta atggacaggc acac actgacagct L08; ccgaaaaaga actggaacca gaagctgcag aagaagccct ggagaatgga gaat L14; ctcttccagt aatagcagct ccatccatgt ggacacgacc tcagatcaaa gacttcaaag L20; agaagattca gcaggatgca gattccgtga ttacagtggg ccgaggagaa gtggtcactg L26; ttcgagtacc cacccatgaa tcat atctcttttg tgcc acagacaatt L32; atgacattgg ggtg tattttgaat ggacagactc tccaaacact gctgtcagcg L38; tgcatgtcag tgagtccagc gatgacgacg aggaggaaga agaaaacatc ggttgtgaag L44; agaaagccaa aaagaatgcc aacaagcctt atga gattgtgcct tgaggaggtg tatgctggca gccatcaata gaga tgacaactcc tactctttgt caaa atcagtctac attatactag ataaaaatgt tgttacaaag tctggagtct agggttgggc agaagatgac L68; atttaatttg gaaatttctt tttacttttg tggagcatta gagtcacagt ttaccttatt L74; gatattggtc tgatggtttg tgaactcttg atca aaatttcctt gagactcttt 180; agcattcata ctttggggtt aaaggagatt cctcagactc atccagccct tgggtgctga 186; gagt cactagtgga tgctgaagtt acatgagcta catgttaaat atttaaagtc 192; tccaaaataa aacaccccaa cgttgacctt acccggctga tggttagccc cttgctgcct 198; gctccatgtg tcttatgaga gcccgtagtt acagtgtcct ctaatttgaa atccataagt 204; taacaagtct atatcaggtg cagctggctt tgattaaagg ccatttttaa aacttaaaaa 210; ctcaacacct ttat aatagaaaaa gaaatggcct cagtttgatc tcgttcagaa 216; tgacccagat tgtttctgct ttgggtgcag ctgtttagtt cagagttata ttacagagaa 222; ttattttctg agataatctt aaactagaat aact aattgataat tgaagtatca 228; agatacgtag aacacctcag agatttttct tcaggaactt ccacaaactt tgaatccttg 234; tatctttatt tggtattcat actactagta gcaaaataca ggttttttgt tttgttttgt 240; tttgttttgg cttcatagag tatctcaaat tgaaactttt ctgcacaaag aataaaatta 246; ttat aaactcaaat tggcacctac tgaattaaaa aaat catttaaata 252; taattcagca tatgggaagt aacattgcac ggaa atcactgcca gagacagtct 258; attttctttt aatttgttac tacttagtca caaaccccac attattccag tttggaatta 264; cttattaagg agaattggaa atacatatgc ccatgcttaa attttatagc tttaatttgt 270; gttatttctt tattgacggg aagaggtaca tctttttttc cttactgaaa acaaatatgg 276; tgcc tcaaatttgt ataagtgatt ggctagtgat tcttgttttc agaagggaga 282; gtggtataga atga caaagatggc aatatacact tgtt attgtatgtt 288; gttactgaag gatt tttaaaattt ctaa atcacttctt gggt 294; tttcattaac tgcagtatat acagttcact acatatgggt tgtttgagtt gtgc 300; tgtatttctt tctgtttttt aatacctggt tttgtacata tctaactctg ttctcttttg 306; caga aactggattt tttttttctt tgct taatttgtgt tttttaattt 312; tgattcagaa gtagtcccag ctcataggtg ttcatactgt tacatccaga acatttgtca 318; ggctctctgt cagctttcat gtacatatgg tatagaaacc atggagttag gcacttcctg 324; gatttttttt ttatgagaaa aatactgtat ttaaaatgta aaataaactt ttaaaaagca 3301 aata tatatttctt ccagcctttg attacaaatt tgtccttgca aaga 336; tgaattatct cctaaaaata tcattgttct tgggagcagt gtatgttact ttacatagca 342; cctg tcatgtgttc atgtcagaat atttttggtt ttaaactttc ttattgcctt 348; tggctgttga ttagtacagt acaagtgcga tttcaaaaag atcttgaaag taatatattt 3541 aatcaattaa aatgtttatc tgtaaaaaaa aaaaaaaaaa a Protein seguence: NCBI Reference Seguence: NP_073572.2 LOCUS NP_073572 ION NP_073572 1 maavlnaerl evsvdgltls pdpeerpgae gapllppplp ppsppgsgrg pgasgeqpep 61 geaaaggaae qrwg fgleelygla kdgk afhptyeekl kqvl 121 mgpynpdth evgffdvlgn drrrewaalg nmskedamve kallnrcch 1fstyvashk 181 iekeeqekkr keeeerrrre eeererlqke eekrrreeee rlrreeeerr rieeerlrle 241 qqquimaal nsqtavqqu yaaqqypgny eqqqilirql qeqhyqqqu quqvqlaqq - qaalqquev vvagsslpts skvnatvpsn mmsvngqakt htdssekele peaaeealen 36; gpkeslpvia apsmwtrpqi kdfkekiqqd adsvitvgrg evvtvrvpth eegsylfwef 42; atdnydigfg dspn tavsvhvses sdddeeeeen igceekakkn ankplldeiv 48; pvyrrdchee Vyagshqypg rgvyllkfdn syslwrsksv yyrvyytr ASNA1 Official Symbol: ASNA1 Official Name: arsA arsenite transporter, nding, homolog 1 (bacterial) Gene ID: 439 Organism: Homo sapiens Other s: , ARSA1, ASNA-l, GETS, TRC40, hASNA-I Other Designations: ATPase ASNA1; arsenical pump-driving ATPase; arsenite- stimulated ATPase; golgi to ER traffic 3 homolog; transmembrane domain recognition complex 40 kDa ATPase subunit; embrane domain recognition complex, 40kDa Nucleotide seguence: NCBI Reference Seguence: NM_004317.2 LOCUS NM_00431 7 ACCESSION NM_004317 l gagccagttc caaaatggcg gcaggggtgg gggg ggttgaggca gaggagttcg 61 aagatgctcc tgatgtggag ccgctggagc ctacacttag caacatcatc gagcagcgca 121 gcctgaagtg gatcttcgtc gggggcaagg tggg cacc tgcagctgca 181 gcctggcagt ccagctctcc aaggggcgtg agagtgttct gatcatctcc acagacccag 24; cacacaacat ctcagatgct tttgaccaga agttctcaaa ggtgcctacc aaggtcaaag ; gctatgacaa cctctttgct atggagattg gcct gggcgtggcg gagctgcctg 36; acgagttctt cgaggaggac aacatgctga gcatgggcaa gaagatgatg caggaggcca 42; tgagcgcatt catc gcca atgc cgaggtcatg aggctggtga 48; agggcatgaa cttctcggtg gtggtatttg acacggcacc cacgggccac accctgaggc 54; actt ccccaccatc gtggagcggg gcctgggccg gcttatgcag atcaagaacc 601 agatcagccc tttcatctca cagatgtgca acatgctggg cctgggggac gcag 661 accagctggc ctccaagctg gaggagacgc tgcccgtcat ccgctcagtc agcgaacagt 721 tcaaggaccc tgagcagaca actttcatct gcgtatgcat tgctgagttc ctgtccctgt 781 atgagacaga gaggctgatc caggagctgg ccaagtgcaa gattgacaca cacaatataa 84; ttgtcaacca gctcgtcttc cccg agaagccctg caagatgtgt gaggcccgtc 90; acaagatcca ggccaagtat ctggaccaga tggaggacct gtatgaagac ttccacatcg 96; tgaagctgcc gctgttaccc catgaggtgc ggggggcaga caaggtcaac accttctcgg 102; ccctcctcct ggagccctac ccca gtgcccagta gcacagctgc cagccccaac 108; cgctgccatt tcacactcac ccct ccccaccccc tcggggcaga gtttgcacaa 114; agtccccccc cagg gggagccact tgggcaggag gcagggaggg gtccattccc 120; cctggtgggg ctggtgggga gctgtagttg ccccctacct ctcccacctc ttgctcttca 126; ataaaatgat cttaaactgc aaaaaaaaaa aa n seguence: NCBI nce Seguence: NP_004308.2 LOCUS N P_004308 ACCESSION NP_004308 1 maagvagwgv eaeefedapd vepleptlsn iieqrslkwi fvggkggvgk ttcscslavq 61 svli istdpahnis daqukfskv ptkvkgydnl fameidpslg vaelpdeffe 121 ednmlsmgkk mmqeamsafp gideamsyae vmrlvkgmnf svvvfdtapt ghtlrllnfp 181 tiverglgrl mqiknqispf isqmcnmlgl gdmnadqlas kleetlpvir svseqfkdpe 241 qttficvcia eflslyeter liqelakcki dthniivnql vfpdpekpck mcearhkiqa 301 kyldqmedly edfhivklpl lphevrgadk vntfsallle pykppsaq PSM D3 OfibblSwnbd:PSMD3 Official Name: proteasome (prosome, macropain) 268 subunit, non-ATPase, 3 m: 5709 Organism: Homo sapiens Other Aliases: P58, RPN3, S3, TSTA2 Other Designations: 26S proteasome non-ATPase regulatory subunit 3; 268 proteasome regulatory subunit RPN3; 268 proteasome regulatory subunit 83; some subunit p58; tissue ic transplantation n 2 Nucleotide seguence: NCBI Reference Seguence: NM_002809.3 LOCUS NM_002809 ACCESSION NM_002809 l gttgactcgg ccatcggcct gccgggcctg gcgtttccca gaaggcccag cgccgggaag 6; gggtttgcag ctgctccgtc atcgtgcggc ccgacgctat ctcgcgctcg tgtgcaggcc 12; cggctcggct cctggtcccc ggtgcgaggg ttaacgcgag gccccggcct cggtccccgg 18; actaggccgt gaccccgggt gccatgaagc aggagggctc ggcgcggcgc cgcggcgcgg 24; acaaggcgaa accgccgccc ggcggaggag aacaagaacc cccaccgccg ccggcccccc ; aggatgtgga agag gaggcagcga cgggtggcgg gggg gacg 36; gcaagacggc ggcggcagcg gctgagcact cccagcgaga gctggacaca gtcaccttgg 42; aggacatcaa ggagcacgtg aaacagctag agaaagcggt ttcaggcaag gagccgagat 48; tcgtgctgcg ggccctgcgg atgctgcctt cacg ccgcctcaac cactatgttc 54; tgtataaggc tgtgcagggc ttcttcactt caaataatgc cactcgagac tttttgctcc 60; tgga agagcccatg gacacagagg ctgatttaca gttccgtccc cgcacgggaa 66; aagctgcgtc gacacccctc ctgcctgaag tggaagccta tctccaactc ctcgtggtca 72; tgat gaacagcaag cgctacaaag aggcacagaa tgat gatctgatgc 78L agaagatcag tactcagaac cgccgggccc ttgt aaag tgttactatt 84L atcacgcccg ggtctatgag ttcctggaca agctggatgt ggtgcgcagc ttcttgcatg 90; ctcggctccg gacagctacg cttcggcatg acgcagacgg gcaggccacc aacc 96; tcctgctgcg gaattaccta agct tgtacgacca ggctgagaag ctggtgtcca 102; agtctgtgtt cccagagcag gccaacaaca atgagtgggc caggtacctc tactacacag L08; ggcgaatcaa agccatccag ctggagtact cagaggcccg gagaacgatg accaacgccc L14; ttcgcaaggc ccctcagcac acagctgtcg gcttcaaaca gacggtgcac aagcttctca L20; tcgtggtgga gctgttgctg ggggagatcc ctgaccggct gcagttccgc cagccctccc L26; gctc actcatgccc cttc tgactcaagc tgtcaggaca ggaaacctag L32; ccaagttcaa ccaggtcctg gatcagtttg agtt tcaagcagat gggacctaca L38; ccctaattat ccggctgcgg cacaacgtga ttaagacagg tgtacgcatg atcagcctct L44: cctattcccg aatctccttg gctgacatcg cccagaagct gcagttggat agccccgaag L50; atgcagagtt cattgttgcc atcc gggatggtgt cattgaggcc acgagaaggg ccaa gaga tgattgacat ctattccacc cgagagcccc 1621 agctagcctt ccaccagcgc atctccttct gcctagatat ccacaacatg ccatgaggtt tcctcccaaa tcgtacaaca aggacttgga gtctgcagag gaacggcgtg L74; agcgagaaca gcaggacttg gagtttgcca aggagatggc agaagatgat tcccttgagc tggggggctg gggaggggta gggggaatgg ggacaggctc gggggtcccc tgcccagggc actgtcccca ttttcccaca cacagctcat atgctgcatt L92; Cgtgcagggg gtgggggtgc tgggagccag tgac ctcccccagg gctcctcccc L98; agccggtgac ttactgtaca gcaggcagga gggtgggcag gcaacctccc nggcagggt 204; cctggccagc ggag caggagggga aggatagttc tgtgtactcc tttagggagt 210; gggggactag aactgggatg tcttggcttg tatgtttttt gaagcttcga ttatgatttt 2161 taaacaataa aaagttctcc acagtgc Protein sequence: NCBI Reference Seguence: NP_002800.2 LOCUS N P_002800 ACCESSION NP_002800 l mkqegsarrr gadkakpppg ggeqeppppp apqdvemkee aatgggstge aaaa 6L ehsqreldtv tledikehvk sgke alrm lpstsrrlnh yvlykavqgf 12L ftsnnatrdf llpfleepmd teadqurpr tgkaastpll peveaqull vvifmmnskr l8; ykeaqkisdd tan raldlvaakc yyyharvyef ldkldvvrsf lharlrtatl 241 rhdadgqatl lnlllrnylh yslydqaekl vsksvfpeqa nnnewaryly ytgrikaiql ; eysearrtmt nalrkapqht avgfkqtvhk llivvelllg urq pslkrslmpy 36; flltqavrtg nlakfnqvld dg tytliirlrh nviktgvrmi slsysrisLa 421 diaqqulds pedaefivak airdgvieas inhekgyvqs kemidiystr epqlathri 481 sfcldihnms ppks ynkdlesaee rrereqqdle fakemaeddd dsfp |DH1 Official : IDH1 Official Name: rate dehydrogenase 1 ), soluble m: 3417 Organism: Homo sapiens Other Aliases: IDCD, IDH, IDP, IDPC, PICD Other ations: NADP(+)-specific ICDH; NADP-dependent isocitrate dehydrogenase, cytosolic; NADP-dependent isocitrate dehydrogenase, peroxisomal; isocitrate dehydrogenase [NADP] cytoplasmic; oxalosuccinate decarboxylase tide ce: NCBI Reference Seguence: NM_005896.2 LOCUS NM_005896 ACCESSION NM_005896 l cctgtggtcc cgggtttctg cagagtctac ttcagaagcg gaggcactgg gagtccggtt 6; tgggattgcc aggctgtggt tctg agcttgtgag ngctgtggc gccccaactc 12; ttcgccagca tatcatcccg gcaggcgata aactacattc agttgagtct gcaagactgg 18; gaggaactgg ggtgataaga aatctattca ctgtcaaggt ttattgaagt caaaatgtcc 24; aaaaaaatca gtggcggttc tgtggtagag atgcaaggag atgaaatgac catt ; tgggaattga ttaaagagaa actcattttt ccctacgtgg aattggatct acatagctat 36; gatttaggca tagagaatcg tgatgccacc aacgaccaag tcaccaagga tgctgcagaa 42; gctataaaga agcataatgt tggcgtcaaa tgtgccacta tcactcctga tgagaagagg 48; gttgaggagt tcaagttgaa acaaatgtgg aaatcaccaa atggcaccat acgaaatatt 54L ctgggtggca cggtcttcag agaagccatt atctgcaaaa atatcccccg gcttgtgagt 60; ggatgggtaa aacctatcat cataggtcgt catgcttatg gggatcaata cagagcaact 66L gattttgttg ttcctgggcc agta gagataacct acacaccaag tgacggaacc 72; caaaaggtga catacctggt acataacttt ggtg gtggtgttgc catggggatg 78; tataatcaag caat tgaagatttt gcacacagtt ccttccaaat ggctctgtct 84; aagggttggc ctttgtatct gagcaccaaa aacactattc tgaagaaata tgatgggcgt 90; tttaaagaca tctttcagga tgac aagcagtaca agtcccagtt tgaagctcaa 96; tggt atgagcatag gctcatcgac gacatggtgg cccaagctat agag L02; ggaggcttca tctgggcctg taaaaactat gatggtgacg tgcagtcgga ctctgtggcc L08; caagggtatg gctctctcgg catgatgacc agcgtgctgg tttgtccaga gaca L14; gtagaagcag aggctgccca cgggactgta acccgtcact accgcatgta ccagaaagga L20: caggagacgt ccaccaatcc cattgcttcc atttttgcct ggaccagagg agagcaaagc ttgataacaa taaagagctt gccttctttg caaatgcttt ggaagaagtc L32; tctattgaga aggc tggcttcatg gact tggctgcttg 1381 ttacccaatg gttc tgactacttg aatacatttg agttcatgga taaacttgga L44; gaaaacttga aact agctcaggcc aaactttaag ttcatacctg gataattgtc ttttggtaac taggtctaca ggtttacatt tttctgtgtt ataaaggcaa aatcaatttt gtaatttgtt tagaagccag agtttatctt ttctataagt L62; ttacagcctt tttcttatat atacagttat tgccaccttt gtgaacatgg caagggactt L68; ttttacaatt tttattttat tttctagtac cagcctagga ttag tactcatttg L74; tattcactgt ttct catgttctaa ttataaatga ccaaaatcaa gattgctcaa L80; aagggtaaat gatagccaca ctcc ctaaaatatg cataaagtag aaattcactg L86; ccttcccctc ctgtccatga ccttgggcac agggaagttc tggtgtcata gatatcccgt L92; tttgtgaggt agagctgtgc attaaacttg cacatgactg gaacgaagta tgagtgcaac L98; gtgt tgaagatact gcagtcattt ttgtaaagac cttgctgaat gtttccaata 204; gactaaatac tgtttaggcc gcaggagagt ttggaatccg gaataaatac tacctggagg 210; tttgtcctct ccatttttct ctttctcctc ctggcctggc ctgaatatta tactactcta 216; aatagcatat ttcatccaag aatg taagctgaat cttttttgga cttctgctgg 222; cctgttttat ttcttttata taaatgtgat ttctcagaaa ttaa acactatctt 2281 atcttctcct gaactgttga ttttaattaa gtgc taattaccaa aaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_005887.2 LOCUS N P_005887 ACCESSION NP_005887 1 mskkisggsv vemquemtr iiwelikekl ifpyveldlh enrd atndqvtkda 61 aeaikkhnvg vkcatitpde krveefklkq mwkspngtir nilggtvfre aiickniprl 121 vsngkpiii grhaygdqyr atdfvvpgpg tpsd gtqutylvh nfeegggvam 181 ksie dfahssfqma lskgwplyls tkntilkkyd grfkdifqei yquyksqfe 241 aqkiwyehrl iddmvaqamk seggfiwack nydgdvqsds vaqugslgm mtsvlvcpdg 301 ktveaeaahg tvtrhyrmyq tnpi trgl ahrakldnnk elaffanale 361 evsietieag fmtkdlaaci kglpnvqrsd ylntfefmdk lgenlkikla qakl KPNB1 Official Symbol: KPNB1 Official Name: karyopherin (importin) beta 1 Gene ID: 3837 Organism: Homo sapiens Other s: IMBf, IPO1, IPOB, Impnb, NTF97 Other Designations: PTA097; importin 1; importin 90; importin beta-1 subunit; importin subunit beta-1; importin-90; karyopherin subunit beta-1; nuclear factor p97; pore targeting complex 97 kDa subunit tide seguence: NCBI Reference Seguence: NM_002265.4 LOCUS NM_002265 ACCESSION NM_002265 1 cgct ccctccctgc gcgccgcctc tcactcacag cctcccttcc ctcc 61 ctccgcctcc cgagcaccag cgcgctctga gctgccccca gggtccctcc cccgccgcca 12; gcagcccatt tggagggagg aagtaaggga agaggagagg aaggggagcc ggaccgacta 18; cccagacaga gccggtgaat gggtttgtgg tgacccccgc cccccacccc accctccctt 24L cccacccgac ccccaacccc catccccagt tcgagccgcc gcccgaaagg ccgggccgtc ; gtcttaggag gagtcgccgc cgccgccacc tccgccatgg agctgatcac cattctcgag 36L aagaccgtgt ctcccgatcg gctggagctg gaagcggcgc agaagttcct ggagcgtgcg 421 gccgtggaga acctgcccac tttccttgtg gaactgtcca gagtgctggc aaatccagga 481 aacagtcagg ttgccagagt tgcagctggt atca agaactcttt gacatctaaa 54; gatccagata tcaaggcaca atatcagcag aggtggcttg ctattgatgc taatgctcga 60; cgagaagtca agaactatgt tttgcagaca ttgggtacag aaacttaccg gcctagttct 66; gcctcacagt gtgtggctgg ttgt atcc acca gtggccagaa 72; ctcattcctc agctggtggc caatgtcaca aaccccaaca gcacagagca catgaaggag 78; tcgacattgg aagccatcgg ttatatttgc caagatatag agca gctacaagat 84; aaatccaatg agattctgac aatc caggggatga aaga gcctagtaat 90L aatgtgaagc tagctgctac gaatgcactc ctgaactcat tggagttcac caaagcaaac 96L tttgataaag agtctgaaag gcactttatt atgcaggtgg tctgtgaagc gtgt L02; ccagatacga gggtacgagt ggctgcttta cagaatctgg tgaagataat gtccttatat 108; tatcagtaca tggagacata tatgggtcct gctctttttg caatcacaat cgaagcaatg L14; aaaagtgaca ttgatgaggt ggctttacaa gaat tctggtccaa gaggaaatgg atttggccat tgaagcttca gaggcagcag aacaaggacg gccccctgag L26; cacaccagca agttttatgc gaagggagca tatc tggttccaat L32; acactaacta aacaggacga aaatgatgat gacgatgact ggaacccctg caaagcagca L38; ggggtgtgcc tcatgcttct ggccacctgc gatg acattgtccc acatgtcctc L44; cccttcatta aagaacacat caagaaccca gattggcggt accgggatgc agcagtgatg L50; gcttttggtt gtatcttgga aggaccagag cccagtcagc tcaaaccact agttatacag L56; gctatgccca ccctaataga attaatgaaa gaccccagtg tagttgttcg agatacagct L62; gcatggactg taggcagaat gctg cttcctgaag ctgccatcaa ctac L68; ttggctcccc tgctacagtg tctgattgag ggtctcagtg ctgaacccag agtggcttca L74; aatgtgtgct gggctttctc cagtctggct gaagctgctt atgaagctgc agacgttgct L80; gatgatcagg aagaaccagc tacttactgc ttatcttctt catttgaact catagttcag L86; aagctcctag agactacaga cagacctgat ggacaccaga acaacctgag gagttctgca L92; tatgaatctc tgatggaaat tgtgaaaaac agtgccaagg atcc tgctgtccag L98: aaaacgactt tggtcatcat ggaacgactg gttc ttcagatgga gtcacatatc 2041 cagagcacat ccgatagaat ccagttcaat gaccttcagt ctttactctg tgcaactctt 210; cagaatgttc aagt gcaacatcaa gatgctttgc agatctctga tgtggttatg 216; gcctccctgt taaggatgtt ccaaagcaca gctgggtctg ggggagtaca tgcc 222; ctgatggcag ttagcacact agtg ttgggtggtg aattcctcaa gtacatggag 228; gcctttaaac ccttcctggg cattggatta aaaaattatg ctgaatacca ggtttgtttg 234; gcagctgtgg gcttagtggg agacttgtgc ctgc aatccaacat catacctttc 240; tgtgacgagg tgatgcagct gcttctggaa aatttgggga atgagaacgt ccacaggtct 246; gtgaagccgc agattctgtc agtgtttggt gatattgccc ttgctattgg aggagagttt 252; aaaaaatact tagaggttgt attgaatact cttcagcagg cctcccaagc ccaggtggac 258; aagtcagact atgacatggt ggattatctg aatgagctaa gggaaagctg cttggaagcc 264; tatactggaa tcgtccaggg attaaagggg gatcaggaga acgtacaccc ggatgtgatg 270; ctggtacaac ccagagtaga atttattctg tctttcattg accacattgc tggagatgag 276; gatcacacag atggagtagt agcttgtgct ctaa taggggactt atgtacagca 282; tttgggaagg atgtactgaa attagtagaa gctaggccaa tgatccatga attgttaact 288; gaagggcgga gatcgaagac agca aaaacccttg ctacatgggc aacaaaagaa 294; aaac tgaagaacca agcttgatct gttaccattg ggatgataac accc 300; ccactggaaa tctcccatct tttgaaaaac ctggaagtga ggagtgtgca ctga 306; atgtttggga atgagaggat gagtgagtga ggcttgaaaa cacaccacat tgaaaatcct 312; gccacagcag cagc cgccaacagc agcgctgtta gtgagctaag taagcactga 318; cttcgtagaa aaccataaca tcggccatct tggaaaagag aaaaacaatg gagttactta 324; tttaaaaaaa aagaaagaaa gttatctctt cccaggagag gctagaagta gcttttctgt 330; cttttggcca gtgccgagtg gaatgcctgg tttgggggag gaggagggac tgggttcagc 336; tgtggtgctt tgttgtaaaa ctgg cctttgctac tgaggagaaa gatggagcct 342; gggtctcaag cccaccttcg cttt gccacatggt actgtatgct tgccagctag 348; aaggagggtc tttt tacagtctga gaatgagtgt gtgtgagtga atcc 354; acattctcaa cttcaagtca ttgcagtttc tttttcccag aaaacaaggg gttagatgtt 360; cata aaactaaccg aagttctgtc tgca gcacaagaga tgtaaaaaaa 366; aaaaaaaaaa aaaaaaaaaa aacacacaca cagaggaaag acgctcttta ggttttgttt 372; tgtttttttt ttttggtttt gttttttgtt ttttttactc tagggaaaac actgacgaat 378; ggtcagagct cctatcctga catc aaggcgcctt tcctaataat atggttcaac 3841 tgtgaatgta gaagtggggg ggagggggga gaaa actctggcgt atat 390; agaaaaatat aagtacaatt gttacaaata acgcagactt caaaaacaaa aaaatcacaa 396; cccaaacaaa ttta aatgatcaga attggcagca caaagaaaac gccctctcct 402; gacttgtatt gtggcagtct gaacgccccc agaaaattgt gccaaagagt ttagaaaaat 408; aaatatacaa taaaagtaaa cacatacaca caaaacagca aacttcaggt aactattttg 4141 gattgcaaac aggataaatt tcaa tgat aaaataacca tttggaaact 4201 gaaaa n seguence: NCBI Reference Seguence: NP_002256.2 LOCUS N P_002256 ACCESSION NP_002256 l melitilekt vspdrlelea aqkfleraav lvel srvlanpgns qvarvaag_q 6L iknsltskdp dikaquqrw laidanarre vknyvlqtlg tetyrpssas qcvagiacae 12; eli pqlvanvtnp nstehmkest leaigyich idpeqlqdks neiltaiiqg 181 mrkeepsnnv klaatnalln sleftkanfd keserhfimq vvceatqcpd alqn 24; 1vkimslyyq ymetymgpal amks lqgi efwsnvcdee mdlaieasea ; aqurppeht skfyakgalq ylvpiltqtl tkqdendddd dwnpckaagv clmllatcce 361 ddivphvlpf ikehiknpdw ryrdaavmaf gcilegpeps qlkplviqam ptlielmkdp 42; taaw tvgricellp eaaindvyla pllqcliegl saeprvasnv cwafsslaea 48; ayeaadvadd qeepatycls ssfelivqkl lettdrpdgh qnnlrssaye slmeivknsa 54; kdcypavqkt tlvimerqu vlqmeshiqs tsdriqfndl qsllcathn vlrkvqhqda 60; lqisdvvmas llrqustag sgquedalm avstlvevlg geflkymeaf kpflgiglkn 66; yaeyqvclaa vglvgdlcra lqsniipfcd evmqlllenl gnenvhrsvk pqilsvfgdi 72; efkk ylevvlnth qasqaquks dydmvdylne 1rescleayt givqglkgdq 78; envhpdvmlv qprvefilsf idhiagdedh tdgvvacaag ligdlctafg kdvlklvear 84; pmihellteg rrsktnkakt latwatkelr klknqa DDX17 Official Symbol: DDX17 Official Name: DEAD (Asp-Glu-Ala-Asp) box helicase 17 m: 10521 Organism: Homo sapiens Other Aliases: RPS-434P1.1, P72, RH70 Other Designations: DEAD (Asp-Glu-Ala—Asp) box ptide 17; DEAD box protein p72; DEAD/H (Asp—Glu-Ala—Asp/His) box polypeptide 17 (72kD); RNA- dependent helicase p72; probable ATP-dependent RNA helicase DDX17 Nucleotide seguence (variant 1): NCBI Reference Seguence: 386.4 LOCUS NM_006386 ACCESSION NM_006386 l gttaagttgg agccgactca gcggcggccg ccattttgtg cagtcgctgg gaaggaagga 6L gacgcctaaa ccgcggcact gcccggtttg agcgtagcca aacctgccca ccggctttgt 12L agccccgatt ctctgtgttt tgctcccgtc tccgacgaga gcga cggtggcgtc l8; tgcgacggga gacagcgcgt cggagcgaga gagcgctgcg cctgccgccg ccccaacagc 24; ggaggcgccg ccgccatcgg tcgtcaccag accggagccg ctcc cgagcccggc ; catccgtgcc ccgctcccag atctctatcc ttttgggacc atgcgcggag gaggctttgg 36; ggaccgggac nggatcgtg accgtggagg atttggagca ggtg gtggccttcc 42; cccgaagaaa tttggtaatc ctggggagcg tttgcgtaaa aaaaagtggg atttgagtga 48; gctccccaag tttgagaaaa atgt ggaacatccg gaagtagcaa ggctgacacc 54; ggtt gatgagctac gccgaaagaa ggagattaca gggg gagatgtttg 60; tcctaaaccc gtgtttgcct tccatcatgc taacttccca caatatgtaa tggatgtgtt 66; gatggatcag cactttacag aaccaactcc aattcagtgc cagggatttc cgttggctct 72; tagtggccgg gatatggtgg gcattgctca gactggctct gggaagacgt tggcgtatct 78; cctgcctgca attgttcata ttaaccacca gccatacttg gaaaggggag atggcccaat 84; ctgtctagtt ctggctccta ccagagagct tgcccagcaa gtacagcagg tggccgatga 90; ctatggcaaa tgttctagat tgaagagtac ttgtatttat ggaggtgctc ctaaaggtcc 96; ccagattcga gacttggaaa gaggtgttga gatctgcata gccactcctg tgat L02; agatttcctg gagtcaggaa agacaaatct tcgccgatgt acttaccttg atgcttgata tggggtttga accccagatc cgtaaaattg aggcagacac ggag tgcaacctgg gaag cgtg attacaccca gatcaacgta ggcaatctgg L26; caaccacaac atcctccaga tagtggatgt ctgcatggaa aaag accacaagtt L32; gatccaacta atggaagaaa taatggctga aaaggaaaac aaaacaataa gacaaagaga Cgctgtgatg atctgactcg aaggatgcgc agagatggtt ggccagctat L44; gtgtatccat ggagacaaga gtcaaccaga aagagattgg gtacttaatg agttccgttc L50; tggaaaggca cccatcctta ttgctacaga tgtagcctcc cgtgggctag atgtggaaga L56; tgtcaagttt gtgatcaact atgactatcc aaacagctca gaggattatg tgcaccgtat L62; tggccgaaca gcccgtagca ccaacaaggg taccgcctat accttcttca ccccagggaa L68; cctaaaacag gccagagagc ttatcaaagt gctggaagag cagg ctatcaatcc L74; aaaactgatg cagcttgtgg accacagagg aggcggcgga ggcgggggtg ctcg L80; ttaccggacc acttcttcag atcc caatctgatg tatcaggatg ggagtcaagg atggtggccg gagagactct gcaagctatc agagctggtt atgctaatgg cagtggctat ggaagtccaa L98; tggagcacaa gcaggccaat acacctatgg cacc tatggggcag ctgcttatgg 204; caccagtagc tatacagctc aagaatatgg tgctggcact tatggagcta gtagcaccac 210; ctcaactggg agaagttcac agagctctag ccagcagttt agtgggatag ctgg 216; gcagcagcca cagccactga tgtcacaaca gtttgcacag cctccgggag ctaccaatat 222; gataggttac atggggcaga ctgcctacca ataccctcct cctcctcccc ctcctcctcc 228; ttcacgtaaa tgaaaccact gtag tgactccagc agacttaatt acattttaag 234; gaacactgtc tttccttttt ttttcctctt cgccttttct ttttttttcc ttttttcttt 240; tttttttttt aatttttccc cccaaccatc gtgatttgtc ttttcatgca gattagttag 246; aattcactgc caggtttctt ctgcccacca aaatgatcca gtctggaata acattttgta 252; aaaaaaaaaa aaatatatat atatatatat agctgactgg aagagattaa tttcttcccc 258; caacttcttg catgttgaag atatttgagc tatttttcat ctaaaagagt aaggtattag 264; gcccttttgt ccca gttt ttctgagttg gtggggaggg agggaggggg 270; agggctgaat tgttttgcag aggaagatgg catctgtgct ttaaatttct cattactggg 276; aaca aagagggatt caca ttttcttttg tgcttttaaa tgtttcttaa 282; gttggaacag gtttcctcgg gcctgttttg actgattgct ggagtgcatt tgatagttaa 288; aaattactaa ttggttttat ttcccttcac actctgcctc cccacttctc ttac 2941 ataa tagt gtcaggctag aaattgaatt gctgagtttt gtgtatcctt 300; taaattaaaa agtg tttattgtag tggttaaact gtagcatctc tggg 306; tggaagctgc ctatatttct tcccagttta actggggacc atctgtgaaa ttaattttcc 312; atccagacag ctgctgtgag caaatgaaca taaatgctcg ctggaaattt cagt 318; ttttatattg acctgcagtg taaaaagcac atttaattat aaacaatata ttcaaaatgg 324; gcaaatttta ttttcaaatg cagtgtagag ctagattaaa agcaactctt tgccacctac 330; tctgcccttt tggcaaagtt accttgaaca aagaatctta agggtttatt aagaactctt 336; tattttcttc ataccctgtt ctctgcagtg ctttctaaca gcttctgggt gcagattttc 342; ttcggcatcc ttttgcactc agcttattac aggtaggtag tgcttaagaa aagtcatgga 348; ggactaaagc ctaagtcctt ttcacttttc ctccatctga aggtaggtga gttcatcctc 354; ttcatggtaa tgctgtttta ccaagacttt atagcagatg gacccagaaa tctg 360: ctattgtgtt aaca ggatagggac atcagacagc cccagaaacc ccttccagat 366; ctgatatggg actattaatt tttatgctgt taattggtat tcattcacaa tgcagttgaa 372; gggggaaggc tccactgcat tctttggcta aggcctgaat gcttgctcat ctgtaagatc 3781 tatactcgag gttttgtttt ccttttaaaa aggg agagagggat ggtttctgag 384; gggttctgaa agtatgattc aatgtgcaac atacaggtag gtcttcagca taagctgaaa 390; tatatgcatg taaaaacttt tttt tttttaattt tccactttct tcttaacttt 396; acttctcttt ttgtcccccc cccatcttac agaagttgag gccaagggag aatggtaggc 402; acagaagaaa aaac tgctctgtgc tttcaaacca aagtgttccc cccaacccca 408; aatttgtcta agcactggcc agtctgttgt gggcattgtt ttctacaacc aaattctggg 414; tctt ctttctttaa acatagaggt accaccacaa gggatgccct tcgc 420; agctcttgaa agcatctgtt tgagggaaag gtctctgggc gtgg ttatttggat 426; tgcttgcttc cctttttcca acat tgtaatcata aaataacagt aaattccaaa 432; cctcaaaaac tattatggcc tgagcacagc tgaaatctag cagagtttaa tgcc 438; tccatgtctg tcacttataa ttcaggttct gctgttggct tcagaacatg agcagaagaa 444; tcgttttatg ctagttattg cattcatggt tgaaactcaa cttagggaaa gggttccaat 450; gtattaagca atgggctgct tctccccaat cctccctaac aattcgttgt gtggacttct 456; catctaaaag gttagtggct tttgcttggg atcagtgctc tctattgatg ctgg 462; tctccagaca cattcctgtt agac gact tgtagatgtg tgatgttcag 468; gcacaggatg ctgaaagcta tgttactatt ttgt aaattgtcct tttgatacca 474; tcatcttgtt ttctttttgt aaat aaaaacactg ttgacaataa aaaaaaaaaa 4801 aaaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_006377.2 LOCUS N P_006377 ACCESSION NP_006377 l mptgfvapil CV:.lpsptre aatvasatgd saseresaap aaaptaeapp pepq 6; alpspairap ;pdlypfgtm rgggfgdrdr fgar gggglppkkf lrkk 12; kwdlselpkf eknfyvehpe varltpyevd elrrkkeitv rggdvcpkpv fafhhanqu 18; yvmdvlmdqh fteptpich gfplalsgrd mvgiaqtgsg ktlayllpai vhinhqpy;e 24; rgdgpiclvl aptrelaqu quaddygkc srlkstciyg gapkgpqird icia ; tpgrlidfle sgktnlrrct ylvldeadrm ldmgfepqir kivdqirpdr qtlmwsatwp 36; kevrqlaedf lrdytqinvg nlelsanhni lqivdvcmes ekdhkliqlm eeimaekenk 42; tiifvetkrr cddltrrmrr dgwpamcihg dksqperdwv lnefrsgkap iliatdvasr 48; gldvedkav inydypnsse dyvhrigrta tayt fftpgnlkqa relikvleea 54; nqainpklmq lvdhrggggg gggrsryrtt ssannpnlmy qdecdrrlrg vkdggrrdsa 60; etdr agyangsgyg spnsafgaqa gqytygqgty gaaaygtssy taqeygagty 66; gassttstgr qqfs gigrsgqqpq plmsqqfaqp pgatnmigym gqtayqyppp 72; ppppppsrk M6PRBP1 Official Symbol: PLIN3 Official Name: perilipin 3 m: 10226 Organism: Homo sapiens Other Aliases: M6PRBP1, PP17, T|P47 Other Designations: 47 kDa MPR-binding protein; cargo selection protein TlP47; mannosephosphate receptor-binding n 1 ; perilipin-3; tal n 17; tail-interacting protein, 47 kD tide seguence (variant 1): NCBI Reference Seguence: NM_005817.4 LOCUS NM_00581 7 ACCESSION NM_005817 l tggcgcgggc aatccctcaa cctgattggt cccctcgccc gtcactccag tgcgccccca 6; acctaccacg cagtaaaagc cgcc tcggcccgga cggtttccaa gctggttttg 12; aagtcgcggc agctgttcct gggacgtccg gttgaccgcg cgtctgctgc agagaccatg 18; gacg gggcagaggc tgatggcagc acccaggtga cagtggaaga accggtacag 24; cagcccagtg tggtggaccg cagc atgcctctga tcagctccac ctgcgacatg ; gtgtccgcag cctatgcctc caccaaggag agctacccgc acatcaagac tgtctgcgac 36; gcagcagaga agggagtgag gaccctcacg gcggctgctg tcagcggggc tcagccgatc 42L ctctccaagc tggagcccca gattgcatca gaat acgcccacag ggac 48L aagttggagg agaacctccc catcctgcag cagcccacgg agaaggtcct ggcggacacc 54L aaggagcttg tgtcgtctaa gggg gcccaagaga tggtgtctag cgccaaggac 60; gcca cccaattgtc ggaggcggtg gacgcgaccc gcggtgctgt gcagagcggc 66; gtggacaaga caaagtccgt agtgaccggc caat ngtcatggg ctcccgcttg 72; ggccagatgg tgttgagtgg ggtcgacacg gtgctgggga agtcggagga gtgggcggac 78; aaccacctgc cccttacgga tgccgaactg gcccgcatcg ccacatccct ggatggcttt 84; gacgtcgcgt ccgtgcagca gcagcggcag gaacagagct acttcgtacg tctgggctcc 90; ctgtcggaga ggctgcggca gcacgcctat gagcactcgc tgggcaagct tcgagccacc 96; aagcagaggg cacaggaggc gcag ctgtcgcagg gcct gatggaaact L02; gtcaagcaag atca gaagctggtg gaaggccagg agaagctgca ccagatgtgg L08; tgga agca gctccagggc cccgagaagg agccgcccaa gccagagcag L14; gtcgagtccc gggcgctcac catgttccgg gacattgccc agcaactgca ggccacctgt L20; ctgg ggtccagcat tcagggcctc cccaccaatg tgaaggacca ggtgcagcag L26; gcccgccgcc aggtggagga ggcc acgttttcca gcatccactc cttccaggac L32; ctgtccagca gcattctggc ccagagccgt gagcgtgtcg ccagcgcccg cgaggccctg L38; gaccacatgg tggaatatgt ggcccagaac acacctgtca cgtggctcgt cttt L44; gcccctggaa tcactgagaa ggag gagaagaagt agggggagag gagaggactc L50; agcgggcccc gtctctataa tgcagctgtg ctctggagtc ctcaacccgg aaacttattt tctagccact cctcccagct cttctgtgct gtccacttgg gaagctaagg L62; ctctcaaaac gggcatcacc cagttgaccc atctctcagc ctctctgagc cctgttctga gcctcaccct atcagtcagt agagagagat gtccagaaaa aggaaagttc tcccctgcag aatttttttt ccttgttaaa tatcaggaat tgcggtggct cacacctgta atcccagcac tttgggaggc tgaggcgggc ggaacacctg 186; aggtcaggtg acca gccaggccaa catggtgaaa ccccgtctct actaaaaata 192; caaaaaaaaa tgagccgggc atggtagcag gtgtctgtta tcccagttag gaggctgagg 198; caagagaatc tcttgaacct gagaggcgga ggttgcagtg agccaagatc gcgccattgc 204; actccagcct gggggacaag agtgagactt agtctcaaaa aaaaaaaaaa agaaaaaaaa 210; gata tagttcatat cccacttctt tgtttacacc gatgtccctg aatatcagcc 216; tgtagctaat ggacttggga tttctggtct aagtgggcct cctggggatg gggtggtaca 222- ctgagcttct gagcctcatt gtagagtaga ctgg tgtg gtaagccttg 228; ttgaaatgct ctggtattca gtattgcctt aataaacttc acccacaact gcatacaggc 2341 aaaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_005808.3 LOCUS N P_005808 ION NP_005808 l msadgaeadg stqvtveepv qqpsvvdrva smplisstcd mvsaayastk esyphiktvc 6; daaekgvrtl taaavsgaqp ilsklepqia hrgl dkleenlpil qqptekvlad 12; tkelvsskvs gaqemvssak dtvatqlsea vdatrgavqs gvdktksvvt gqusvmgsr 18; lgqmvlsgvd tvlgkseewa dnhlpltdae lariatsldg fdvaquqqr qeqsyfvr;g 24; uha yehslgklra tquaqeall qlsqvlslme tvkqgvqul vegqeklhqm ; wlswnqkqlq gpekeppkpe qvesraltmf rdiaqqlqat ctslgssiqg lptnvkdqvq 36; qarrqvedlq atfssihsfq dlsssilaqs rervasarea ldhmveyvaq ntpvtwlvgp 42; fapgitekap eekk E|F4A3 Official Symbol: El F4A3 Official Name: otic ation initiation factor 4A3 m: 9775 Organism: Homo sapiens Other Aliases: DDX48, NMP265, NUK34, elF4A||| Other Designations: ATP-dependent RNA helicase DDX48; ATP-dependent RNA helicase 3; DEAD lu-Ala-Asp) box polypeptide 48; DEAD box protein 48; NMP 265; elF—4A—lll; elF4A-lll; eukaryotic initiation factor 4A-lll; eukaryotic initiation factor 4A—like ; eukaryotic translation tion factor 4A; hNMP 265; nuclear matrix protein 265 Nucleotide seguence: NCBI Reference Seguence: NM_014740.3 LOCUS NM_014740 ION NM_014740 l acgcacgcac gtctctcgct ttcgcatact taaggcgtct gttctcggca gcggcacagc 6L gaggtcggca gcggcacagc gaggtcggca gcggcacagc gaggtcggca gcggcacagc 12L gaggtcggca gcggcagcga ggtcggcagc ggcacagcga ggtcggcagc ggcagcgagg l8; tcggcagcgg Cgcgcgctgt gctcttccgc ggactctgaa tcatggcgac cacggccacg 24; atggcgacct nggctcggc gcgaaagcgg aaag aggaagacat gactaaagtg ; gaattcgaga ccagcgagga ggtggatgtg acgt tcgacaccat gggcctgcgg 36; gaggacctgc tgcggggcat ctacgcttac ggttttgaaa aaccatcagc aatccagcaa 42; cgagcaatca agcagatcat caaagggaga gatgtcatcg cacagtctca gtccggcaca 48; ggaaaaacag ccaccttcag tatctcagtc ctccagtgtt ttca ggttcgtgaa 54; actcaagctt tgatcttggc tcccacaaga gagttggctg tgcagatcca gctg 60; cttgctctcg gtgactacat gaatgtccag tgccatgcct gagg tgtt 66; ggcgaggaca tcaggaagct ggattacgga cagcatgttg tcgcgggcac tccagggcgt 72; gtttttgata tgattcgtcg cagaagccta cgtg ctatcaaaat gttggttttg 78; gatgaagctg atgaaatgtt gaataaaggt ttcaaagagc agatttacga tgtatacagg 84; tacctgcctc cagccacaca ggtggttctc atcagtgcca cgctgccaca tctg 90; gagatgacca tcat gaccgaccca atccgcatct tggtgaaacg tgatgaattg 96; actctggaag gcatcaagca atttttcgtg gcagtggaga gggaagagtg gaaatttgac L02; actctgtgtg acga cacactgacc atcactcagg cggtcatctt ctgcaacacc L08: aaaagaaagg tggactggct gacggagaaa atgagggaag tcac atgcatggag acatgcccca gaaagagcgg gagtccatca tgaaggagtt gccagccgag tgcttatttc tacagatgtc tgggccaggg ggttggatgt tccctcatca ttaactatga tctccctaat aacagagaat tgtacataca cagaattggg L32; agatcaggtc gatacggccg gaagggtgtg gccattaact ttgtaaagaa cgcatcctca tcga gcagtactat tccactcaga ttgatgagat gccgatgaac 1441 gttgctgatc ttatctgaag cagcagatca tgag ggagactgtt cacctgctgt 1501 gtactcctgt ttggaagtat ccag attctactta ttta tatggacttt 1561 cttctcataa atggcctgcc gtctcccttc agag gatatgggga ttctgctctc 1621 ttttcttatt tacatgtaaa taatacattg ttctaagtct ttttcattaa aaatttaaaa 1681 cttttcccat tata cttctaaggt gccaccacct tctctagtaa ctta Protein seguence: NCBI Reference Seguence: NP_055555.1 LOCUS NP_055555 ACCESSION 555 1 mattatmats gsarkrllke efet seevdvtptf dtmglredll rgiyaygfek 61 psaiqqraik qiikgrdvia qsqsgtgkta tfsisvlqcl diquetqal ilaptrelav 121 qiqullalg dymnvqchac iggtnvgedi rkldygqhvv agtpgrvfdm irrrslrtra 181 ikmlvldead emlnkgfkeq iydvyrylpp atqvvlisat 1phei1emtn kfmtdpiril 241 vkrdeltleg ikqffvaver eewkfdtlcd 1ydt1titqa vifcntkrkv dwltekmrea 301 nftvssmhgd mqueresim kefrsgasrv listdearg 1dquvslii nydlpnnrel 361 yihrigrsgr ygrkgvainf vknddirilr stqi vadl i Official Symbol: IQGAP1 Official Name: IO motif containing GTPase activating protein 1 m: 8826 Organism: Homo sapiens Other Aliases: HUMORFA01, SAR1, p195 Other Designations: RasGAP-Iike with IQ motifs; ras GTPase-activating-like n IOGAP1 Nucleotide seguence: NCBI Reference Seguence: NM_003870.3 LOCUS NM_003870 ACCESSION NM_003870 1 ggaccccggc aagcccgcgc acttggcagg agctgtagct accgccgtcc gcgcctccaa 6; ggtttcacgg cttcctcagc agagactcgg gctcgtccgc cgcc gcagacgagg 12; ttgacgggct gggcgtggcc cact atggctctgt cctggataat gaaagactta 18; ctgcagagga gatggatgaa aggagacgtc agaacgtggc ttatgagtac ctttgtcatt 24; tggaagaagc gaagaggtgg atggaagcat gcctagggga agatctgcct cccaccacag ; aactggagga ggggcttagg aatggggtct accttgccaa actggggaac ttcttctctc 36; ccaaagtagt gtccctgaaa aaaatctatg atcgagaaca gaccagatac aaggcgactg 42L gcctccactt tagacacact gataatgtga ttcagtggtt gaatgccatg gatgagattg 48L gattgcctaa ttac ccagaaacta cagatatcta tgatcgaaag ccaa 54L gatgtatcta ctgtatccat agtt tgtacctgtt caagctaggc ctggcccctc 60; agattcaaga cctatatgga aaggttgact tcacagaaga caac aaga 66; ctgagttgga gaagtatggc atccagatgc ttag caagattggg ttgg 72; ctaatgaact gtcagtggat gaagccgcat tacatgctgc tgttattgct attaatgaag 78; accg tagaattcca gccgacacat ttgcagcttt gaaaaatccg aatgccatgc 84; ttgtaaatct tgaagagccc ttggcatcca cttaccagga tatactttac caggctaagc 90; aggacaaaat gacaaatgct aaaaacagga cagaaaactc agagagagaa agagatgttt 96; atgaggagct gctcacgcaa gctgaaattc atat aaacaaagtc tttt L02; ctgcattagc aaatatcgac ctggctttag aacaaggaga tgcactggcc ttgttcaggg L08; ctctgcagtc accagccctg gggcttcgag gactgcagca acagaatagc gactggtact L14; tgaagcagct cctgagtgat aaacagcaga agagacagag tggtcagact gaccccctgc L20; agaaggagga gctgcagtct ggagtggatg ctgcaaacag tgctgcccag caatatcaga L26; gaagattggc agcagtagca ctgattaatg ctgcaatcca gaagggtgtt gctgagaaga L32; ctgttttgga actgatgaat cccgaagccc agctgcccca tcca tttgccgccg L38; atctctatca gaaggagctg gctaccctgc aaag tcctgaacat aatctcaccc L44; acccagagct ctctgtcgca gtggagatgt tgtcatcggt ggccctgatc aacagggcat L50; tggaatcagg agatgtgaat acagtgtgga agcaattgag cagttcagtt actggtctta L561 ccaatattga ggaagaaaac tgtcagaggt atctcgatga gttgatgaaa aggcacatgc taat gaattcatta catggaatga tatccaagct atgtgaacct ggtggtgcaa gaggaacatg agaggatttt agccattggt ttaattaatg aagccctgga tgaaggtgat gcccaaaaga ctctgcaggc cctacagatt aggg agtccttgca gaagtggccc agcattacca agacacgctg attagagcga 186; agagagagaa agcccaggaa gatg agtcagctgt gttatggttg gatgaaattc 192; aaggtggaat ctggcagtcc aacaaagaca cccaagaagc acagaagttt gccttaggaa 198; tctttgccat taatgaggca agtg gtgatgttgg caaaacactg agtgcccttc 204; gctcccctga tgttggcttg tatggagtca tccctgagtg tggtgaaact taccacagtg 210; atcttgctga agccaagaag aaaaaactgg cagtaggaga taataacagc gtga 216; agcactgggt aaaaggtgga tattattatt accacaatct ggagacccag gaaggaggat 222; gggatgaacc tccaaatttt gtgcaaaatt ctatgcagct ttctcgggag gagatccaga 228; gttctatctc tggggtgact gccgcatata accgagaaca gctgtggctg gccaatgaag 234; gcctgatcac caggctgcag gctcgctgcc gtggatactt agttcgacag gaattccgat 240; tgaa tttcctgaag aaacaaatcc ctgccatcac ctgcattcag tcacagtgga 246; gaggatacaa gcagaagaag caag atcggttagc ttacctgcgc tcccacaaag 252; atgaagttgt aaagattcag tccctggcaa ggatgcacca agctcgaaag cgctatcgag 258; atcgcctgca gtacttccgg gaccatataa atgacattat caaaatccag gcttttattc 264; gggcaaacaa agctcgggat gactacaaga ctctcatcaa tgctgaggat cctcctatgg 270; ttgtggtccg aaaatttgtc cacctgctgg accaaagtga ccaggatttt caggaggagc 276; ttgaccttat gaagatgcgg gaagaggtta tcaccctcat tcgttctaac cagcagctgg 282; agaatgacct caatctcatg gatatcaaaa ttggactgct agtgaaaaat aagattacgt 288; atgt ggtttcccac aaac ttaccaaaaa ggaa cagttgtctg 294; atatgatgat gataaataaa ggag aggc caag gagaagagag 300; agaagttgga agcttaccag cacctgtttt tgca aaccaatccc acctatctgg 306; ccaagctcat ttttcagatg ccccagaaca agtccaccaa gttcatggac tctgtaatct 312; tcacactcta caactacgcg tccaaccagc gagaggagta cctgctcctg cggctcttta 318; agacagcact ccaagaggaa atcaagtcga aggtagatca gattcaagag attgtgacag 324; gaaatcctac ggttattaaa atggttgtaa gtttcaaccg tggtgcccgt ggccagaatg 330; ccctgagaca gatcttggcc gtga aggaaattat ggatgacaaa tctctcaaca 3361 tcaaaactga ccctgtggat atttacaaat ttaa tcagatggag tctcagacag 342; gagaggcaag caaactgccc gtga cccctgagca ggcgctagct gaag 348; tgaagacacg gctagacagc tccatcagga acatgcgggc tgtgacagac aagtttctct 354; cagccattgt cagctctgtg gacaaaatcc cttatgggat gcgcttcatt gtgc 360; tgaaggactc gttgcatgag aagttccctg gtga ggatgagctg ctgaagatta 366; ttggtaactt gctttattat Cgatacatga atccagccat tgttgctcct gatgcctttg 372; acatcattga cctgtcagca ggaggccagc ttaccacaga ccaacgccga aatctgggct 378; ccattgcaaa aatgcttcag catgctgctt ccaataagat gtttctggga gataatgccc 384; acttaagcat cattaatgaa tatctttccc agtcctacca caga cggtttttcc 390; aaactgcttg tgatgtccca gagcttcagg ataaatttaa tgag tactctgatt 396; tagtaaccct acca gtaatctaca tttccattgg tgaaatcatc aacacccaca 402: ctctcctgtt ggatcaccag gatgccattg ctccggagca caatgatcca atccacgaac 408; tgctggacga cctcggcgag gtgcccacca tcgagtccct gataggggaa agctctggca 414; atttaaatga cccaaataag gaggcactgg ctaagacgga agtgtctctc accctgacca 420; acaagttcga cgtgcctgga gatgagaatg cagaaatgga tgctcgaacc atcttactga 426; aacg tttaattgtg gatgtcatcc ggttccagcc aggagagacc ttgactgaaa 432; tcctagaaac accagccacc agtgaacagg aagcagaaca agcc atgcagagac 438; tccg tgatgccaaa acacctgaca agatgaaaaa gtcaaaatct gtaaaggaag 444; acagcaacct cactcttcaa gagaagaaag agaagatcca ttta aagaagctaa 450; cagagcttgg ggac ccaaagaaca aataccagga actgatcaac gacattgcca 456; gggatattcg gaatcagcgg aggtaccgac agaggagaaa ggccgaacta gtgaaactgc 462; aacagacata Cgctgctctg aactctaagg ccacctttta tggggagcag gtggattact 468; ataaaagcta tatcaaaacc tgcttggata ccag caagggcaaa gtctccaaaa 474; agcctaggga aatgaaagga aagaaaagca aaaagatttc tctgaaatat acagcagcaa 480; gactacatga aaaaggagtt cttctggaaa ttgaggacct gcaagtgaat cagtttaaaa 486; atgttatatt tgaaatcagt ccaacagaag aagttggaga cttcgaagtg aaagccaaat 492; tcatgggagt tcaaatggag acttttatgt tacattatca ggacctgctg cagctacagt 498; atgaaggagt tgcagtcatg aaattatttg atagagctaa agtaaatgtc aacctcctga 504; tcttccttct aaag ttctacggga agtaattgat cgtttgctgc cagcccagaa 510: ggatgaagga aagaagcacc tcacagctcc tttctaggtc cttctttcct cattggaagc 5161 aaagacctag acag cacctcaatc tgatacactc ccgatgccac atttttaact 522; cctctcgctc gaca tttgttaccc ttttttcata ttgt gtttcaggct 528; tagtctgacc tttctggttt tttc ttccattact taggaaagag tggaaactcc 534; attt ctctgtgttg ttacagtctt agaggttgca tatt gtaagctttg 540; gtgtttgttt caat agggatggta ggattcaaat gtgtgtcatt tagaagtgga 546; tagc accaatgaca taaatacata caagacacac aactaaaatg tcatgttatt 552; aacagttatt aggttgtcat ttaaaaataa agttccttta tatttctgtc ccatcaggaa 558; aactgaagga tatggggaat cattggttat cttccattgt gtttttcttt atggacagga 564; gctaatggaa gtgacagtca tgttcaaagg aagcatttct agaaaaaagg agataatgtt 570; tttaaatttc aaac ttgggcaatt ctgtttgtgt aactccccga ctagtggatg 576; ggagagtccc attgctaaaa ttcagctact cagataaatt cagaatgggt caaggcacct 582; tttt gttggtgcac agagattgac ttgattcaga gagacaattc actccatccc 588; tatggcagag gaatgggtta gccctaatgt agaatgtcat taaa actgttttat 594; atcttaagag tgccttatta aagtatagat gtatgtctta aaatgtgggt gataggaatt 600; ttaaagattt atataatgca gcct tagaataaga aaagcttttt tgct 606; ttatctgtat actc ttgaaactta tagctaaaac actaggattt atctgcagtg 612; ttcagggaga taattctgcc tttaattgtc taaaacaaaa ccag ccaacctatg 618; ttacacgtga gattaaaacc aattttttcc ccattttttc tccttttttc tcttgctgcc 624; cacattgtgc ctttatttta ccag ttttctgggc ttagtttaaa aaaaaaatca 630; agtctaaaca ttag aaagcttttg ttcttggata aaaagtcata cactttaaaa 636; aaaaaaaaaa ctttttccag gaaaatatat tgaaatcatg ctgctgagcc tctattttct 642; ttctttgatg ttttgattca gtattctttt atcataaatt tttagcattt aaaaattcac 648; tgatgtacat aata aactgcttta atgaataaca aactatgtag tgtgtcccta 654; ttataaatgc attggagaag tatttttatg agactcttta ctcaggtgca tggttacagc 660; ccacagggag gcatggagtg aagg attcgccact acct tgttttttgt 666; tgtattttgg aagacaggtt ttttaaagaa acattttcct cagattaaaa gatgatgcta 672; ttacaactag cattgcctca aaaactggga aaag tgtgtcaacc ctgtttcctt 678; aaaagaggct atgaatccca aaggccacat ccaagacagg caataatgag cagagtttac 684; agctccttta ataaaatgtg tcagtaattt taaggtttat agttccctca acacaattgc 690; taatgcagaa tagtgtaaaa tgcgcttcaa gaatgttgat gatgatgata tagaattgtg 6961 gctttagtag cacagaggat gccccaacaa actcatggcg ttgaaaccac acagttctca 702; ttactgttat gctg tagcattctc tgtctcctct ctctcctcct ttgaccttct 708; cctcgaccag ccatcatgac atttaccatg aatttacttc ctcccaagag ctgc 714; ccgtcagatt gttgctgcac atagttgcct ttgtatctct gtatgaaata catt 7201 tgttcatgtt aaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_003861.1 LOCUS 861 ACCESSION NP_003861 l msaadevdg; gvarphygsv ldnerltaee qnva yeylchleea krwmeaclge 6; dlpptte;ee ylak lgnffspkvv slkkiydreq trykatglhf rhtdnviqwl 12; g;pk ifypettdiy drknmprciy cihalslylf klglapqiqd lygkvdftee l8; einnmktele kygiqmpafs kiggilanel svdeaalhaa viaineaidr ripadtfaal 24; knpnamlvnl eeplastyqd ilyqakqdkm tnaknrtens ererdvyeel ltqaeiqgni ; nkvntfsala nidlalqud alalfralqs palglrgqu qnsdwylkql lsquqqus 36; gqtdplqkee lqsgvdaans aaqqurrla avalinaaiq kgvaektvle lmnpeaqlpq 42; vypfaadlyq kelathrqs pehnlthpel svavemlssv alinralesg dvntvwkq;s 48; ssvtgltnie eencqrylde lmklkaqaha ennefitwnd hvnl vvqeeheril 54; aiglineald egdaqktha lqipaakleg Vlaevaqhyq dtlirakrek aqeiqdesav 60; lwldeiqggi qunkdtqea qkfalgifai neavesgdvg ktlsalrspd vglygvipec 66; getyhsdlae akkkklavgd nnskkahwv kggyyyyhnl etqeggwdep pnqunsmql 72; sreeiqssis nreq lwlaneglit rlqarcrgyl vrqefrsrmn flkkqipait 78; ciqsqwrgyk qkkayqdrla ylrshkdevv kiqslarmhq arkryrdrlq yfrdhindii 84; kiqafirank arddyktlin aedppmvvvr kfvhlldqsd qdfqeeldlm kmreevit;i 90; ranqlendl nlmdikigll vknkithdv vshskkltkk nkeqlsdmmm inqugglka 96; lskekrekle athlfyllq tnptylakli fqmpanstk fmdsviftly nyaanreey ;02; lllrlfktal kvdq iqeivtgnpt vikmvvsfnr gargqnaqu ilapvvkeim ;08; ddkslniktd pvdiykswvn qmesqtgeas klpydvtpeq alaheevktr ldssirnmra ;14; vtdkflsaiv ssvdkipygm rfiakvlkds lhekfpdage dellkiignl lyyrymnpai '20; vapdafdiid lsaggqlttd qrrnlgsiak mlqhaasnkm flgdnahlsi dvpelqdkfn vdeysdlvtl tkpviyisig tlll ;32; ndpihelldd iesl igessgnlnd pnkealakte vsltltnkfd vpgdenaemd ;38; artillntkr livdvirfqp getlteilet eaeh qramqrrair sksvkedsnl thekkekiq tglkkltelg tvdpknkyqe lindiardir qrrk ;50; aelvquqty atfy gequyyksy iktcldnlas kpre ;56; lkytaarlhe iedl qvanknvif eispteevgd fevkakfmgv qmetfmlhyq 1621 dllqlqyegv avmklfdrak vnvnllifll k SFRSZ OfibblSymbokSRSF2 Official Name: serinefarginine-rich splicing factor 2 Gene ID: 6427 Organism: Homo sapiens Other Aliases: PR264, 80-35, 8035, SFRS2, SFRS2A, SRpSOb Other Designations: SR ng factor 2; splicing component, 35 kDa; ng factor 8035; splicing factor, arginine/serine-rich 2 Nucleotide seguence (variant 1): NCBI Reference ce: NM_003016.4 LOCUS 01 6 ACCESSION NM_003016 l agaaggtttc atttccgggt ggcgcgggcg ccattttgtg gata taaacgggcg 6; cagaggccgg ctgcccgccc agttgttact cgct agcctgcgga gcccgtccgt 12; gctgttctgc ggcaaggcct ttcccagtgt gcgg aaggcaactg cctgagaggc 18; gcggcgtcgc accgcccaga gctgaggaag ccggcgccag ttcgcggggc ccgc 24; gagc tatgagctac ggccgccccc ctcccgatgt ggagggtatg acctccctca ; aggtggacaa cctgacctac cgcacctcgc ccgacacgct gaggcgcgtc ttcgagaagt 36; acgggcgcgt cgtg ccgc gggaccgcta caccaaggag tcccgcggct 42; tcgccttcgt tcgctttcac gacaagcgcg acgctgagga cgctatggat gccatggacg 48; gggccgtgct ggacggccgc gagctgcggg tgcaaatggc gcgctacggc cgccccccgg 54; actcacacca cagccgccgg ggaccgccac cccgcaggta cgggggcggt ggctacggac 60; gccggagccg cagccctagg cggcgtcgcc gcagccgatc ccggagtcgg agccgttcca 66L ggtctcgcag ccgatctcgc tacagccgct cgaagtctcg gtcccgcact cgttctcgat 72L ctcggtcgac ctccaagtcc agatccgcac gaaggtccaa gtccaagtcc tcgtcggtct 78L ccagatctcg ttcgcggtcc aggtcccggt ctcggtccag gagtcctccc ccagtgtcca 84; agagggaatc caaatccagg tcgcgatcga agagtccccc caagtctcct gaagaggaag 90; gagcggtgtc ctcttaagaa aatggtaatg tctgggaatc cgagacacat aaccctaatt 96; cataaatggg atttggggta ggtctttttg gtta atgtaagaat gactcctatc L02; attaggagtg ctgctcggag gttactcacc tttgggagta atactgaaga tctg L08; cagaaaggat gtgtatgaag taat aatggctgtt tcgtaaactg tttgagacct L14; attaatgaaa atgactattt cttgctgttt ttatccaacg tctgcatttt ccccctttaa L20; agctgcggtc ttga taaaagaata ttggccagta ttgcagattt taactgattt L26; ggctgatcct ccagggacca gtttctgtgg attg gagcaggttt gtctttaaat L32; gttaaagatg cactatcctc ttagagaaac aatcagttca actattgttg L38; gacttcatat ggat gtggcaaaag aattgcaata agaagcagtg aacatttgga L44; accccaaaag aaagttacag gtattgcact gggtggggaa gtgt gtctttaact 150; cttaaattgt ttggtcctat tttttaaaaa ggaaagggcc ctaagtagct agtagtattc tcaattacca aatgtttcat ttgaaacaat ttatcttaat ccaattctct gatctcgagt tgtttttgtt tggatacagc cctttttttt ttcttttttt L68; ttcttcccct tacctttctt caccttggtt atttggccag gaatacgtaa gtacatgctg atggtagcct ttgtgaaatt ttcctaattg ggccttttaa aaacatggct L80; aaca tttctgtacc ctactggttt gaccagagcc ttagtaagta cgtgcctgaa L86; actgaaacca cttt aggt aagctgaact tctttctttt caaacctaga L92; tgtatcggca agcagtgtaa acggaggact tggggaaaaa ggaccacata gtccatcgaa L98; gaagagtcct tggaacaagc ctat tgaaaaggtt attttgtaac ctaa 204; ctttttactt gtttaagctt tgcctcagtt ggcaaacttc attttatgtg ccattttgtt 210; gctgttattc aaatttcttg taatttagtg acga cttcagattt cattattgga 216; tttggatatt tgaggtaaaa tttcattttg ttatatagtg ctgacttttt ttgtttgaaa 222; ttaaacagat tggtaaccta atttgtggcc tcctgacttt taaggaaaac gtgtgcagcc 228; attacacaca gcctaaagct gtcaagagat tgactcggca ttgccttcat tccttaaaat 234; taaaaaccta caaaagttgg tttg tatatgttat ttaccttcag atctaaatgg 240; taatctgaac ccaaatttgt ataaagactt ttcaggtgaa aagacttgat tttttgaaag 2461 gattgtttat caaacacaat ctct tctcttatgt atttttgtgc actaggcgca 252; gttgtgtagc agttgagtaa tgctggttag ctgttaaggt ggcgtgttgc gagt 258; gcttggctgt ttcctgtttt ctcccgattg ctcctgtgta aagatgcctt gtcgtgcaga 264; aacaaatggc tgtccagttt attaaaatgc ctgacaactg cacttccagt cacccgggcc 270; ttgcatataa ataacggagc atacagtgag cacatctagc tgatgataaa tacacctttt 2761 tttccctctt ccccctaaaa atggtaaatc tgatcatatc tacatgtatg aacttaacat 2821 ggaaaatgtt aaggaagcaa atggttgtaa ctttgtaagt acttataaca tggtgtatct 2881 ttttgcttat gaatattctg tattataacc attgtttctg tagtttaatt aaaacatttt 2941 cttggtgtta gcttttctca gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3001 aaa Protein seguence (variant 1): NCBI Reference Seguence: NP_003007.2 LOCUS NP_003007 ACCESSION NP_003007 1 msygrpppdv kvdn ltyrtspdtl rrvfekygrv gdvyiprdry tkesrgfafv 61 rfhdkrdaed gavl dgrelrvqma rygrppdshh srrgppprry rrsr 121 sprrrrrsrs rsrsrsrsrs rsrysrsksr srtrsrsrst sksrsarrsk srsr 181 srsrsrsrsr spppvskres ksrsrskspp kspeeegavs s GOLGA3 Official Symbol: GOLGA3 Official Name: golgin A3 m: 2802 Organism: Homo sapiens Other Aliases: GCP170, MEA-2 Other Designations: Golgi membrane associated protein; Golgi peripheral membrane protein; Golgin subfamily A member 3; 10 protein; golgi autoantigen, golgin subfamily a, 3; golgi complex-associated n of 170 kDa; golgin-160; golgin-165; male enhanced antigen-2 Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_005895.3 LOCUS NM_005895 ION NM_005895 1 ggcg cgtccctgca gcgtggcggg acggccccgt tccagtcacc cccgcctcgc 6; tgcggtggcc tcgggcctgg gcgcccgcct tcagctgcgg cggagctggc tctgtaaatg 12; ccggtgcccg cgagccctcc tgaatgcttg tctgcgcccg acgagcgcgg cctgtcccga 18; agctgtccac cact cgggcagtgc tagc ctggcccttt gcagctgaaa 24; ggcgtgacat ggtgtcgggt ggttcgtggg aagtcggggt ttcaggagtc cgtgtacttc ; cttgtttgtc tttgtcgctg gccgacttgt cttcattcca ggtggccaga gcgagtgggg 36; ccgggcgttg tcacgggtat catgatatta gctggtttga catcaagtca tttgtgagtc 42L atcagatctt aaaa tgggagacac gccc ctcccaggag ctcttggctg 48L ttgctgatgg cagaagccaa gcttgtccaa ggttcacttg tagcccctca gcgtcagctc 54L agctggtgtc gtcctgacca tggacggcgc gtcggccgag caagatggcc tccaggagga 60; ccac agtggcccct cgtctctccc cgaggcccca ctgaagcccc cgggcccact 66; ggtgccacct gaccagcagg acaaagtcca gtgtgccgag gtaaacagag catccacgga 72; aggggaaagc ccggatggac ctggccaggg aggcctctgt cagaacgggc caacgccacc 78; cttcccagac cctccgtcgt ctctcgatcc caccacaagc ccagtgggcc ctgatgcctc 84; tgtg gctggtttcc atgacaacct aaggaagtct cagggaacta gtgctgaggg 90; cagtgttaga gctt tgcagtctct cagactcagt cttcctatgc aagaaacgca 96; actgtgctct acagattctc ccctgcccct ggagaaggag gagcaggtcc gacttcaggc L02; tcggaagtgg ctggaagagc agctcaaaca gtacagggtg aagcgccagc aggagaggtc L08; cagtcaacct gcaaccaaaa cgagactttt tagcacgctt gatcctgagc tcatgttaaa L14; aaac ttaccaaggg ccagtaccct ggctatgaca aaagaatatt ccttcctgcg L20; caccagtgtc cctcgggggc ctaaggtggg cagcctgggg cttccggcac atcctaggga L26; gaaaaaaact tccaaatcaa gcaaaatccg gtctctggcc gattacagaa ctgaagattc L32; aaatgcgggg aattctgggg gaaatgtccc ggctcccgat tctaccaagg gttccctgaa L38; caga agcagtgcgg cgtccgttgt gtctgagatc tccc ctga L44; cgaccgtctg gagaacacct ctgg agacagcgtg tctgaggtgg atga L50: cagcgacagc tcatcgtaca gcagcgcctc cacccgaggg acctatggca cgaa L561 gacagtgqgc acgcaggaca atat ggtcaacggc caggagattc cctgggccag ttcccctcca ttaaggacgt cctccaggcc gcagccgctg agcaccaaga L68; gcag gaggtcaacg tgcg gagtcggaga gacagcatct gcagcagcgt L74; gtccttggag agctctgcag cagaaacaca ggaggagatg ctgcaggtgc aatgcgactc gaaggacagc tggaagcctt gtcactggag gcgagtcagg cacttaaaga 186; gaaggctgag ctgcaggccc agctggccgc cctcagcacg aagctgcagg cgcaggtgga 192; gtgcagccac agcagccagc agcggcagga ttcgctgagc tcggaggtgg acaccctgaa 198; gtgc tgggacctgg agcgagccat gactgacctg cagaacatgc tggaggcaaa 204; aaatgccagc ctggcgtcgt ccaacaacga cttgcaggtg gccgaggagc agtaccagag 210; gcttatggcc aaggtagagg acatgcagag gagcatgctc agcaaggaca acacagtgca 216; cgacctgcga cagcagatga cagccttgca gagccagctt cagcaggtgc agctggagcg 222i gacgacgctg accagcaagc tgaaggcgtc gcaggcggag atctcgtccc tgcagagtgt 228; ccggcagtgg cagc ccct ggcacaggag gcccgcgtca ggctgcaggg 234; tgagatgqcc cacatccagg agat ggca ctgg agcacctgaa 240; actcgagaat ctgt cccagcagct gacggaaact aggt ccatgaagga 246; gaaggggcgc atcgcggcac agctgcaggg cattgaggct ttgg atcaggaagc 252; agccttcatg cagattcagg aggcaaagac gatggtggag gaggaccttc agaggaggct 258; ggaagagttt gaaggtgaga ggct gcagaggatg gcggactcgg cggcatccct 264; ggagcagcag ctggagcagg tgaagttgac tttactccag cgagaccagc agcttgaggc 270; tttgcagcag gagcacctgg acctgatgaa acagctcacc ttgactcagg aggctctgca 276; gagcagggag cagtccctcg atgccctgca gacacactac gatgagctgc aggccaggct 282; gggggagctg cagggcgagg ccgcctccag ggaggacacg atctgcctcc tgcagaacga 288; gaagatcatc ttggaggcgg ctttgcaggc gagt ggcaaggagg agcttgacag 294; aggagcaaga Cgcttggaag aaggtaccga ggaaacgtcg gaaactttag agaagttaag 300; agaagaatta gctatcaaat ccggccaggt ggaacacctg gaga ctgctgctct 306; gaaaaagcaa atgcaaaaaa taaaggaaca gtttctccaa caaaaggtga tggtggaggc 312; ctaccggcgc gacgccacct acca gctcatcagt gagctgaaag ccaccaggaa 318; gaggctggac tcggagctga aggagctgcg gcaggagctg atgcaagtgc acggggagaa 324; gcggactgcc gaggcggagc tctcgcgcct gcacagagag gtggcccagg tccgtcagca 330; catggcgqac cttgaagggc agtc ggcgcagaag gagcgagacg agatggaaac 3361 acacttgcag tcgttgcagt tcgataagga gcagatggtc gcggtcacag aggccaatga 342; ggcgctgaag aaacaaatcg aagagttgca gcaagaggcc cggaaggcca tcacggaaca 348; gaagcagaag atgaggcggc caga cttgaccagc gcccagaagg agatgaagac 354; caaacataag gaga acgccgtggg catcctcagc cgccgcctgc aggaggccct 360; cgcggccaag gaggctgcgg acgcggagct gggccagctc cgagcccagg gtggcagcag 366; tgacagcagc ctggctctac ggat ccaggccctg gaggcggagc tgcaggctgt 372; cagtcatagc aagacgctgc tggaaaagga actgcaggag gtcatagcgc tgaccagcca 378; ggagctggag gagtcccggg agaaggtgct ggagctggag gacgagcttc ccag 384; aggctttagg aagaagataa aacgccttga aaac aagaagttgg ctcttgaatt 390; agagcacgag aaagggaagc ttacgggcct cggtcagtcc aacgcagctc tgcgggaaca 396; caacagcatc ctagaaacag ctttggccaa gagggaggca gacctagtcc agttgaacct 402; gcag gcagttttgc agcgcaaaga ggat cgccagatga agcatcttgt 408; ccaggccctg caggcctcac tagagaagga gaaggagaag gtgaacagcc tcaaggagca 414; ggtggctgct gccaaggtgg aagccgggca taaccgccgc cacttcaagg cggcctcctt 420; gagt aaga aggagctgca ggccaaggaa gtgc agaagctgca 426; ggccgaggcc gacgaccttc agattcggga acat tcccaggaga tagcacagtt 432; ccaagcagag ctggccgagg cccgggcaca gctccagctc ctgcagaagc agctggacga 438; gcagctcagc aaacagcccg tgggaaacca agagatggaa aatctcaaat gggaggtgga 444; tcagaaagaa agagaaatcc agtccttgaa gctg gacttgacgg agcagcaggg 450; caggaaggaa ctggaagggc tacagcagct gctgcagaac gtcaagtctg agttggagat 456; ggcccaggaa gacctgtcca tgacccagaa ggataaattt atgctccagg caaaagtgtc 462; ggagctgaag aacaacatga agaccctgct ccagcagaac ctca agctggacct 468; acgccgcggc gcggccaaga cgagaaagga gccgaaaggc gaggccagct cttccaaccc 474; tgccacgccc atcaagatcc ngactgccc agttcccgcc tcgctgctgg aggagctgct 480; gagaccaccg cccgccgtga gcaaggagcc cctcaagaac ctgaacagct gcctccagca 486; gctcaagcag gagatggaca gcctgcagcg ccagatggag gagcacgccc tgacggtgca 492; tctg tcctcgtgga cgccgctgga gccagccact gccagccctg tgcccccggg 498; gggtcacgcc ggcccacgcg gcgacccaca gagacacagt cagagcaggg cttccaaaga 504; agggccggga gagtgactgc tgtggactcg cctccgtgcg ccgctgcccc agaaggctct 510; tgtt atttatttga ttgtgtggtc gatgtttttc taagacatga aatttaagtt 5161 tgcc tttaacaaga agtaaaatat atagcagaat gagagccaag gactagaaaa 522; gaag atcacaatta gcttttcaca tggaatgacc aactcttaaa agcctgatag 528; gctctcgch aggagctttg aacgtgtctg tact tgtaggtcgt tgag 534; cggccaccga tgctgctctc tgcgggtgac agggagaggc tgcgtaactg ggagcagctg 540; tgtgacaggg tctgcggcac cgcgcctggc caggccggct tctc acttccctgt 546; tccattcagt aagagcttta cgca aaat tttatctgta cctttggctt 552; tttacttgtt tttttggata gccatcccac cataggatgt gtacatagat actgaatatc 558; ataatccaat ctttgttttt tttttttttt tttttttgag acagagtctc gctttgttgc 564; ccaggctgga gtgcagtggc acactctccg ctcactgcaa gctccgcctc ccaggttcat 570; ctcc tgcctcagcc tctcgagtag ctgggattac aggcgtgcgc cactatgcca 576; ggctaatgtt tgtattttta gtagcaatgg ggtttcacca tgttggccag gatggtctcg 582; atctcctgac ctcaagtgat ctgcccatct cagcctccca aagtgctggg attacaggcg 588; tgagcccctg cgcccggcct gtcacccagt ctttaagaag catatgctca tgttattgaa 594; gaagaaccta cttattattg attgcctttt gaaaatttgt tgggaataat ttacctgcag 600; gatttagqga tagtcagaaa agaa atataattat tttatttacc ttctaaagcc 606; aaatattctt acacagaaag gtcctctgtt gttctggttt tactttgttg ctgaggatct 612; ttccttcctg ctggtctctt cctctcaggc cactggccct ttcc accgtggctg 618; gccactggga aggggcagct cttg cctg acggccatca ggaggcacaa 624; ggacactgag gccccatatc tgatctgacc tttggggggg cacagggaga ggccggtgga 630; ggaggaggag gagagcagac caggggctcc ctgcagcgac tcccgcggtt tcccctggag 636; tcagccaggt gtaggtcgca ggcggtaaca aacctcacac tcctgttccc caagtgaaaa 642; tctttaccat tgtctgtggg agcgcctgta ctcgtgtgta ggagcacctg tacttctgca 648; gtcatcgaga agtcctggat tggt tacaccagca ggca agcagaggcg 654; acttccggaa gagacaggca ggcaccgtga ggaaggtggc tgtgctctcc tctc 660; agagacagat gccttattta aaatcagcac tgtg agatcttctg tttcctaccc 666; caaatcctga gcag acactggctg actgggagag gtggggtctg taagttgtcc 672; cctagtttgc taagaaaatc taaaataata tttattatat gagttaggag agagagaatg 678; ggtccgcgtg gcctcctctg cagatgtact ggtctgaaat gaggttctga gtcactggcc 684; aggccagatg tgctcatgtc ggtgtctggt gtctgttttg aaac agtatggtgt 690; gttttaagct atttgtgttc tgttgtaata tacttttaga aggttaattg gtaaggttaa 6961 atta accacaaaga tgtttggtat ttaaaaaata ttctctagca ggaa 702; aaat atatcatttg tacagggtta aaat aatacttgaa aatttcatta 708; taaatatatc ctacttttta tcttaagttg aagatgttat ttactaaatt gttcttgtac 714; cattagaaaa aaaaatacgg caatttacgt tcttatttat tttggctgta ctaccccttt 720; gttttaattt taaaatcaag aaatcgggcc ggtg gctcatgcct gtaatcccag 726; cactttggga ggccgaggcg ggtggatcac ctgaggtcaa gaggtccaga tggc 732; caacatggca cgtc tttactaaac atacaaaaat tagccgggtg tcgtagtgcg 738; cacctataat cccagctact tgggaggctg aggcaggaga atcacttgaa cccaggaggc 744; ggagcttgca gtgagccgag atcgcgccac tgccctccag cctgggcaac agagcgagac 750; tccgtctcaa aaataaataa atgattttaa aaaatctaaa atcgagaaat cacacattca 756; gtggggagcg acttctcctt gcttatggga agtcctcaag tgagtgatgt tcaccatgta 762: tttttttttc tcttaggaca gactaattct gaaaataccg aaggaaaagt atgt 768; tctcaccccg gttttcctgc gtgtgtgccc ttgggtgcga tgcctccccc agcgctctgt 774; ggtcgccggt gccagggccc cctctggttt ggcagggcct ggctgccttt gctccctgca 780; cttt tggtgttttc atgcacggct tgtgcttctg gatctgaggc ctctcgtgtt 786; cacgcggaca cttccttcct taagaagacg agag gaagttggaa tttttttttt 792; ttttttttga gacagagtct cgctctgtcg ctgg agtgcagtgg cgtgatctct 798; gctcactgca agctccgcca tctgggttca agcgattctc catt ctccccagta 804; atta caggtgcccg cacc agcctaattt ttgtattttt gtgg 810; agttccacca ccag gctggtcttg aactcttgac ctcaggtgat cctgagcctc 816; agcctcccaa agtgctggaa ttataggcgt gaaccaccgc ccccggctgc attt 822; ttaaattgct tttttttatt gttgaggttt ttttatctcc aagggactct cccggcactt 828; ctaccttcca gagttacttc taaa gtttgaatta ttttgttctt gtgggcagaa 834; gtgggaatga tggaatatcc tcacggaaaa ggcagtgaag ttgggagtac tgcttacaaa 840; acagggtcac cagtgcatta tgtggcgtgt tcatccccac gccgtgtgtc acgggctagg 846; tgtt catccccaca ccgtgtgtca caacaggcta gggcacttca cgatgtcact 852; acttgttttt ctgatgttcc aaaaacaacg taacttggtt ttcatgtgtt tggt 858; atatgtgaga ttgatgctac gggtcttacg cacc cgttcccact ctctgcaata 864; tggatcaggc agtgtttctg ataggatgtg aaatggactc tcctcgggtg gcag 870; gggccctgcc caccagaaca cagtccgtgc tgtgctgcgc gctg gccctcaact 876; ctccttgqtg cagggttccc acaaccgagt tctagttccc tgaggtcttt aaaaacaaaa 882; acagaatgtt gtacgtgaag attctaggag gggagggacc agcaaatctg agagaaccgt 888; cctggggcct cccttcgagg agccctctga tgtgaggagg gacttgagtt gagtgacgct 894; gtggtgtgag gtgttctgag ctcactgacc ggaaggtcca ggtgaatctc gtcataagtg 900; atctcaggct ctcacaggat ccggagggaa atgtgttaga gggtctggaa aattcagtgc 9061 ttttgagtta cttgttttta ttaaaaattt cctcacaaaa cctc aagttgtggc 9121 tgttcttggg aaaggggtca ctga caaagtgtaa ctttaaaaag cacgttgatt 9181 ttttacaaat gtaagtgtgc ttgggaattc cttaaatttt gtgcaataaa ctattttttg 9241 gtaaagattt tc Pnneh1seguence(vafiant1y NCBI Reference Seguence: 886.2 LOCUS NP_005886 ACCESSION NP_005886 l mdgasaeng lqedrshsgp sslpeaplkp pgplvppdqq dkvqcaevnr astegespdg 6— pgqgglcqng ptppfpdpps sldpttsmg pdaspgvagf sqgt saegsvrkea 12; lqslrlslpm qetqlcstds plplekeeqv rlqarkwlee qlqurvqu qerssqpatk 181 trlfstldpe lmlnpenlpr astlamtkey sflrtsvprg pkvgslglpa hprekktsks 24; skirsladyr tedsnagnsg gnvpapdstk gslkanssa asvvseisls pdtddrlent ; slagdsvsev dgndsdsssy ssastrgtyg ilsktvgtqd tpymvngqei padtlgqus 36; aaaa ehqdngevn gevrsrrdsi cssvslessa aetqeemlqv 1kekmr1egq 42; lealsleasq alkekaelqa qlaalstqu aqvecshssq qrqulssev dtlkqscwdl 48; eramtdlqnm leaknaslas snndlqvaee kve qursmlskd 54; talqsqquv qlerttltsk 1kasqaeiss lqsvrquqq qlalaqearv rlqgemahiq 60; vgqmtqagll ehlklenvsl sqqltetth smkekgriaa qlqgieadml dqeaafmqiq 66; eaktmveedl qrrleefege rerlqrmads aasleqqleq vkltllqrdq qleaquehl 72; tltq ealqsreqsl dalqthydel qarlgelqge aasredticl lqnekiilea 78; alqaaksgke eldrgarrle egteetsetl eklreelaik sgqvehque taalkkqmqk 84; ikeqfqukv mveayrrdat skdqliselk atrkrldsel mqv hgekrtaeae 90; lsrlhrevaq dleg hlqsaqkerd emethlqslq fdkeqmvavt eaneakaqi 96L eequearka mrr lgsdltsaqk emktkhkaye navgilsrrl ggssdsslal heriqaleae lqavshsktl 1eke1qevia qesrgfrkki nkkl alelehekgk 1tg1gqsnaa qlnlqvqavl queeedrqm khlvqalqas 1ekekekvns sevk kelqakehlv qquaeaddl qiregkhsqe 1261 araqlqllqk qldeqlskqp Vgnqemenlk rei qslquldlt eqqgrkeleg 1321 qullqnvks elemaqedls mtqkdkfmlq akvselknnm qql kldlrrgaak 1381 trkepkgeas ssnpatpiki pdcpvpasll eellrpppav skeplknlns cqulkqemd 1441 slqrqmeeha 1tvheslssw tplepatasp vppgghagpr gdpqrhsqsr askegpge PH4B Official Symbol: P4HB Official Name: prolyl 4-hydroxylase, beta polypeptide Gene ID: 5034 Organism: Homo sapiens Other s: DSI, ERBAZL, GIT, P4Hbeta, PDI, PDIA1, PHDB, PO4DB, PO4HB, PROHB Other Designations: cellular thyroid hormone—binding protein; collagen prolyl 4- hydroxylase beta; glutathione-insulin transhydrogenase; p55; procollagen- proline, 2—oxoglutarate 4-dioxygenase ne oxylase), beta polypeptide; prolyl 4-hydroxylase subunit beta; protein disulfide isomerase family A, member 1; protein disulfide ase-associated 1; protein disulfide isomerase/oxidoreductase; protein disulfide-isomerase; protocollagen hydroxylase; thyroid hormone-binding protein p55 Nucleotide seguence: NCBI Reference Seguence: NM_000918.3 LOCUS NM_00091 8 ACCESSION NM_000918 1 gagcctcgaa gtccgccggc caatcgaagg cgggccccag cggcgcgtgc cggc 6" cagcgcgcgc gggcgggggg gcaggcgcgc ccca ggatttataa aggcgaggcc 12; gggaccggcg cgcgctctcg tcgcccccgc tgtcccggcg gcgccaaccg aagcgccccg 18; cctgatccgt catg ctgcgccgcg ctctgctgtg cctggccgtg gccgccctgg 24- tgcgcgccga cgcccccgag gaggaggacc acgtcctggt gctgcggaaa agcaacttcg ; cggaggcgct ggcggcccac aagtacctgc tggtggagtt ctatgcccct tggtgtggcc 36L actgcaaggc ccct gagtatgcca ctgg gaagctgaag gcagaaggtt 42; tcag gttggccaag gcca cggaggagtc tgacctggcc cagcagtacg 481 gcgtgcgcgg ctatcccacc atcaagttct tcaggaatgg ggct tcccccaagg 541 aatatacagc tggcagagag gctgatgaca tcgtgaactg gctgaagaag cgcacgggcc 60; ccac caccctgcct gacggcgcag ctgcagagtc ggag tccagcgagg 66; tggctgtcat cggcttcttc gtgg actc tgccaagcag cagg 72; cagcagaggc catcgatgac ataccatttg ggatcacttc caacagtgac gtgttctcca 78; aataccagct cgacaaagat gtcc tctttaagaa gtttgatgaa ggccggaaca 84; actttgaagg ggaggtcacc aaggagaacc tgctggactt tatcaaacac aaccagctgc 90; cccttgtcat cgagttcacc gagcagacag ccccgaagat ttttggaggt gaaatcaaga 96L tcct gctgttcttg cccaagagtg acta tgacggcaaa ctgagcaact L02; tcaaaacagc agccgagagc ttcaagggca agatcctgtt catcttcatc gacagcgacc L08; acaccgacaa catc ctcgagttct ttggcctgaa gaaggaagag L14; tgcgcctcat caccctggag gaggagatga ccaagtacaa gcccgaatcg gaggagctga L20; agag gatcacagag ttctgccacc gcttcctgga gggcaaaatc aagccccacc L26; tgatgagcca ggagctgccg gaggactggg acaagcagcc tgtcaaggtg cttgttggga L32; agaactttga agacgtggct tttgatgaga aaaaaaacgt ctttgtggag ttctatgccc L38; catggtgtgg tcactgcaaa cagttggctc ccatttggga taaactggga gagacgtaca L44; aggaccatga gaacatcgtc atcgccaaga tggactcgac tgccaacgag gtggaggccg L50; tcaaagtgca cagcttcccc acactcaagt tctttcctgc cagtgccgac aggacggtca L56; ttgattacaa cggggaacgc acgctggatg agaa attcctggag agcggtggcc L62; aggatggggc aggggatgat gacgatctcg aggacctgga agaagcagag gagccagaca L68; tggaggaaga Cgatgatcag aaagctgtga aagatgaact gtaatacgca aagccagacc L74; nggcgctgc cgagacccct nggggctgc acacccagca gcagcgcacg cctccgaagc L80; cctc gcttgaagga gggcgtcgcc ggaaacccag ggaacctctc tgaagtgaca L86; cctcacccct acacaccgtc cgttcacccc cgtctcttcc ttctgctttt cggtttttgg L92; aaagggatcc atctccaggc agcccaccct gctt gtttcctgaa accatgatgt L98; actttttcat acatgagtct gtccagagtg accg tgttcggagt ctcgctgcct 204: cgcg ggaggtttct cctctttttg aaaattccgt gatt tttagacatt 2101 tttcgacatc agggtatttg ttccaccttg gccaggcctc ctcggagaag cttgtccccc 2161 qtqtqqqagq gacggagccg gactggacat ggtcactcag taccgcctgc agtgtcgcca 222; tgactgatca tggctcttgc atttttgggt aaatggagac ttccggatcc tgtcagggtg 228; tcccccatgc ctggaagagg agctggtggc tgccagccct ggggcccggc acaggcctgg 234; gccttcccct tccctcaagc cagggctcct gtcg tgggctcatt gtgaccactg 2401 ctac agcacggcct gtggcctgtt caaggcagaa ccacgaccct ccgg 2461 gtggggaggt ggccaaggat gctggagctg aatcagacgc tgacagttct tcaggcattt 2521 ctatttcaca atcgaattga acacattggc caaataaagt tgaaatttta ccacctgtaa 2581 aaaaaaaaaa aaaaaa Protein seguence: NCBI Reference Seguence: 909.2 LOCUS N P_000909 ACCESSION NP_000909 l mlrrallcla vaalvradap eeedhvlvlr ksnfaealaa hkyllvefya pwcghckala 6; peyakaagkl kaegseirla kvdateesdl aqqygvrgyp tikffrngdt tagr 12; eaddivnwlk krtgpaattl pdgaaaeslv essevavigf fkdvesdsak eaid l8; dipfgitsns dvfskyqldk dgvvlfkkfd egrnnfegev tkenlldfik hnqlplvief 24; teqtapkifg illf lpksvsdydg klsnfktaae sfkgkilfif idsdhtdnqr ; ileffglkke ecpavrlitl eeemtkykpe seeltaerit efchrflegk ikphlmsqel 36; pedwqupvk vlvgknfedv afdekknva efyapwcghc wdkl getykdheni 42; viakmdstan eveavkvhsf ptlkffpasa drtvidynge kkfl esgqugagd 48; dddledleea eepdmeeddd qkavkdel Official Symbol: HSPA1 A Official Name: heat shock 70kDa protein 1A Gene ID: 3303 Organism: Homo sapiens Other Aliases: DAQB-147D11.1, HSP70-1, HSP70-1A, HSP7OI, HSP72, HSPA1 Other Designations: HSP70—1/HSP70-2; 1/HSP70.2; dnaK-type molecular one HSP70—1 ; heat shock 70 kDa protein 1/2; heat shock 70 kDa protein 1A/1 B; heat shock 70kD protein 1A; heat shock-induced protein Nucleotide ce: NCBI Reference Seguence: NM_005345.5 LOCUS NM_005345 ACCESSION NM_005345 l ataaaagccc aggggcaagc ggtccggata acggctagcc gctg ctgcgacagt 6L ccactacctt tttcgagagt gactcccgtt gtcccaaggc ttcccagagc gaacctgtgc 12L ggctgcaggc accggcgcgt cgagtttccg gcgtccggaa ggaccgagct cttctcgcgg 18; atccagtgtt ccgtttccag cccccaatct cagagcggag ccgacagaga aacc 24; ggcatggcca aagccgcggc gatcggcatc gacctgggca ccacctactc ctgcgtgggg ; gtgttccaac acggcaaggt ggagatcatc gccaacgacc agggcaaccg caccaccccc 36; gtgg ccttcacgga caccgagcgg gggg atgcggccaa ggtg 42; gcgctgaacc cgcagaacac cgtgtttgac gcgaagcggc tgattggccg caagttcggc 48; gacccggtgg tgcagtcgga catgaagcac tggcctttcc tcaa cgacggagac 54; aagcccaagg tgcaggtgag ctacaagggg gagaccaagg cattctaccc cgaggagatc 60; tcgtccatgg tgctgaccaa gatgaaggag atcgccgagg cgtacctggg ctacccggtg 66; accaacgcgg tgatcaccgt gccggcctac ttcaacgact gcca ggccaccaag 72; gatgcgggtg tgatcgcggg gctcaacgtg ctgcggatca tcaacgagcc cacggccgcc 78; gccatcgcct acggcctgga cagaacgggc aagggggagc gcaacgtgct catctttgac 84; ctgggcgggg gcaccttcga catc ctgacgatcg gcat cttcgaggtg 90; aaggccacgg ccggggacac ccacctgggt ggggaggact ttgacaacag gctggtgaac 96; cacttcgtgg aggagttcaa gagaaaacac aagaaggaca tcagccagaa caagcgagcc L02; cggc tgcgcaccgc ctgcgagagg agga ccctgtcgtc cagcacccag L08: gccagcctgg agatcgactc cctgtttgag ggcatcgact tctacacgtc aggagctgtg ctccgacctg ttccgaagca ccctggagcc acgccaagct ggacaaggcc cagattcacg acctggtcct L26; tccacccgca aggt gcagaagctg ctgcaggact tcttcaacgg gcgcgacctg L32; aacaagagca tcaaccccga cgaggctgtg gcctacgggg cggcggtgca ggcggccatc L38; ctgatggggg acaagtccga gaacgtgcag gacctgctgc tgctggacgt ggctcccctg ;44; tcgctggggc tggagacggc ngaggcgtg atgactgccc tgatcaagcg cacc ;50; atccccacca cgca gatcttcacc acctactccg acaaccaacc cggggtgctg ;56; atccaggtgt acgagggcga gagggccatg acgaaagaca acaatctgtt ggggcgcttc ;62; agcg gcatccctcc cagg ggcgtgcccc agatcgaggt gaccttcgac ;68; atcgatgcca acggcatcct gaacgtcacg gccacggaca agagcaccgg caaggccaac ;74; aagatcacca tcaccaacga caagggccgc ctgagcaagg aggagatcga gcgcatggtg ;80; caggaggcgg agaagtacaa agcggaggac gaggtgcagc gcgagagggt aacgccctgg agtcctacgc cttcaacatg aagagcgccg tggaggatga ggggctcaag ;92; ggcaagatca gcgaggcgga caagaagaag gtgctggaca agtgtcaaga ggtcatctcg ;98; gacg ccaacacctt ggccgagaag gacgagtttg agcacaagag gaaggagctg 204; gagcaggtgt gtaaccccat catcagcgga ctgtaccagg gtgccggtgg tcccgggcct 210; gggggcttcg gggctcaggg ggga gggtctgggt caggccccac cattgaggag 216; gtagattagg ggcctttcca agattgctgt ttttgttttg caag actttgcatt 222; tcctagtatt tctgtttgtc agttctcaat ttcctgtgtt tgcaatgttg aaattttttg 228; gtgaagtact gaacttgctt tttttccggt ttctacatgc agagatgaat ttatactgcc 234; atcttacgac tatttcttct ttttaataca cttaactcag gccatttttt aagttggtta 240; cttcaaagta aataaacttt caaa aaaa aaaaa Protein ce: NCBI Reference Seguence: NP_005336.3 LOCUS 336 ACCESSION NP_005336 ; makaaaigid lgttyscvgv fqhgkveiia ndqgnrttps yvaftdterl igdaaknqva 6; lnpqntvfda krligrkfgd pvvqsdmkhw pfqvindgdk pkqusykge tkafypeeis 12; smvltkmkei aeaylgypvt navitvpayf ndsqrqatkd agviaglnvl taaa l8; iaygldrtgk gernvlifdl vsil tiddgifevk atagdthlgg edfdnrlvnh 24; fveefkrkhk kdisanrav rrlrtacera krtlssstqa sleidslfeg idfytsitra ; rfeelcsdlf rstlepveka lrdakldkaq vggs tripkvqkll qdffngrd;n 36; ksinpdeava ygaavqaail mgdksenqu lllldvapls lgletaggvm nsti 42; ptkqtqiftt ysdnquvli quegeramt kdnnllgrfe lsgippaprg quievtfdi 481 dangilnvta tdkstgkank ititndkgrl skeeierqu eaekykaede vqrervsakn 541 alesyafnmk savedeglkg kiseadkkkv ldkcqevisw ldantlaekd efehkrkele 601 qvcnpiisgl ngaggpgpg gfgaqukgg sgsgptieev d Gene Official Symbol: HNRNPD Official Name: heterogeneous nuclear ribonucleoprotein D (AU-rich t RNA binding protein 1, 37kDa) Gene ID: 3184 Organism: Homo sapiens Other Aliases: AUF1, AUF1A, Other ations: ARE-binding protein AUFI, type A; heterogeneous r ribonucleoprotein D0; hnRNP D0 Nucleotide seguence: ISOFORM D NCBI Reference Seguence: NM_001003810.1 LOCUS NM_001003810 ACCESSION NM_001003810 ; cttccgtcgg ccattttagg tggtccgcgg cggcgccatt aaagcgagga gaga 6; gcggccgccg ctggtgctta ttctttttta gtgcagcggg agagagcggg agtgtgcgcc 12; gagt gggaggcgaa gggggcaggc gagg cgcaggagcc tttgcagcca 18; cgcgcgcgcc gtct tgtgtgcttc gcgaggtaga gcgggcgcgc ggcagcggcg 24; gggattactt tgctgctagt ttcggttcgc ggcg ggtgtagtct cggcggcagc ; ggcggagaca ctagcactat gtcggaggag ggcg gggacggggc ggcggcagcg 36- gcaacggcgg cggtaggcgg ctcggcgggc gagcaggagg gagccatggt gaca 42; cagggggcag ngngngC gggaagcgga gccgggaccg ggggcggaac cgcgtctgga 48; ggcaccgaag ggggcagcgc cgagtcggag ggggcgaaga ttgacgccag taagaacgag 54; gaggatgaag ggaaaatgtt tataggaggc cttagctggg acactacaaa gaaagatctg 60; aaggactact tttccaaatt tggtgaagtt gtagactgca ctctgaagtt agatcctatc 66; acagggcgat caaggggttt tggctttgtg aaag aatcggagag tgtagataag 72; gtcatggatc aaaaagaaca taaattgaat gggaaggtga ttgatcctaa aagggccaaa 78; aaaa agcc ggttaaaaaa atttttgttg gtggcctttc tccagataca 84; cctgaagaga aaataaggga gtactttggt ggttttggtg aggtggaatc catagagctc 90; gaca acaagaccaa gcgt gggttctgct ttattacctt taaggaagaa 96; gaaccagtga agaagataat ggaaaagaaa taccacaatg ttggtcttag taaatgtgaa L02; ataaaagtag ccatgtcgaa ggaacaatat cagcaacagc gggg atctagagga L08: ggatttgcag gaagagctcg tggaagaggt ggtgaccagc agagtggtta gtggtcatca aaatagctac aaaccatact aaattattcc aggtggtgaa gcagtatttt ccaatttgaa tttg ctgccacctg ctaatagcag ttcaaactaa attttttgta tcaagtccct atgacgttgg gtccctctga agtttaattc tgagttctca ttaaaagaaa ttca L38; ttgttttatt tcttaattgc tatgcttcag aatcaatttg atgc agtattgtag agcaagtctt gtgttaaaag cccagtgtga cagtgtcatg atgtagtagt L50; ctgg ttttttaata tttg tataaaaatg tattggctct tttatcatca L56; gaataggaaa aattgtcatg gattcaagtt attaaaagca taagtttgga agacaggctt L62; gccgaaattg aggacatgat taaaattgca gtgaagtttg aaatgttttt agcaaaatct L68; aatttttgcc ataatgtgtc ctccctgtcc aaattgggaa tgacttaatg tcaatttgtt L74; tgttggttgt tttaataata cttccttatg tagccattaa gatttatatg aatattttcc L80; caaatgccca gtttttgctt aatatgtatt gtgcttttta gaacaaatct ggataaatgt L86; gcaaaagtac ccctttgcac agatagttaa tgttttatgc ttccattaaa taaaaaggac L92; ttaaaatctg ttaattataa tagaaatgcg tcag agagattttt agagctgtgg L98; tggacttcat attc aagtgttgag ggaggattaa agaaatatat accgtgttta 204; tgtgtgtgtg ctt Protein seguence: ISOFORM D NCBI Reference Seguence: NP_001003810.1 LOCUS NP_001003810 ACCESSION NP_001003810 1 mseequgdg aaaaataavg egam vaatqgaaaa agsgagtggg tasggteggs 61 aesegakida skneedegkm figglswdtt kkdlkdyfsk ctlk ldpitgrsrg 121 kese svdkvmque hklngkvidp krakamktke pvkkifvggl spdtpeekir 181 geve sielpmdnkt nkrrgfcfit fkeeepvkki mekkyhnvgl skceikvams 241 kequqqqu gsrggfagra rgrggdqqsg ygkvsrrggh qnsykpy Nucleotide ce: ISOFORM C NCBI Reference ce: NM_002138.3 LOCUS NM_0021 38 ACCESSION NM_002138 l cttccgtcgg ccattttagg tggtccgcgg cggcgccatt aaagcgagga ggaggcgaga 6- gcggccgccg ctggtgctta ttctttttta gtgcagcggg agagagcggg agtgtgcgcc 12; gcgcgagagt gggaggcgaa gggggcaggc cagggagagg agcc tttgcagcca 18; cgcgcgcgcc ttccctgtct cttc gcgaggtaga gcgggcgcgc ggcagcggcg 24; gggattactt tgctgctagt ttcggttcgc ggcagcggcg ggtgtagtct cggcggcagc ; ggcggagaca ctagcactat ggag cagttcggcg gggacggggc ggcggcagcg 36; gcaacggcgg ngtaggcgg ctcggcgggc gagcaggagg gagccatggt ggcggcgaca 42; cagggggcag cggcggcggc gggaagcgga gccgggaccg gaac cgcgtctgga 48; gaag ggggcagcgc Cgagtcggag ggggcgaaga ttgacgccag taagaacgag 54; gaggatgaag gccattcaaa ccca cgacactctg aagcagcgac ggcacagcgg 60; gaagaatgga aaatgtttat aggaggcctt agctgggaca ctacaaagaa agatctgaag 66; gactactttt ccaaatttgg tgaagttgta gactgcactc tgaagttaga tcctatcaca 72; gggcgatcaa ggggttttgg ctttgtgcta tttaaagaat gtgt agataaggtc 78; atggatcaaa ataa attgaatggg aaggtgattg atcctaaaag ggccaaagcc 84; atgaaaacaa aagagccggt taaaaaaatt tttgttggtg gcctttctcc agatacacct 90; gaagagaaaa taagggagta ctttggtggt tttggtgagg tggaatccat cccc 96; atggacaaca agaccaataa gaggcgtggg ttctgcttta ttacctttaa ggaagaagaa 1021 aaga agataatgga aaagaaatac gttg gtcttagtaa atgtgaaata 1081 aaagtagcca tgtcgaagga acaatatcag caacagcaac agtggggatc tagaggagga 114; tttgcaggaa gagctcgtgg aagaggtggt gaccagcaga gtggttatgg gaaggtatcc 120; aggcgaggtg gtcatcaaaa tagctacaaa ccatactaaa ttattccatt tgcaacttat 126; ccccaacagg tggtgaagca gtattttcca atttgaagat tcatttgaag gtggctcctg 132; ccacctgcta atagcagttc aaactaaatt ttttgtatca agtccctgaa tggaagtatg 138; acgttgggtc aagt ttaattctga gttctcatta aaagaaattt gctttcattg 144; ttttatttct ctat gcttcagaat caatttgtgt tttatgccct ttcccccagt 150; attgtagagc aagtcttgtg ttaaaagccc agtgtgacag tgtcatgatg tagtagtgtc 156; ttactggttt tttaataaat ccttttgtat aaaaatgtat tggctctttt agaa 162; taggaaaaat ggat tcaagttatt aaaagcataa gtttggaaga caggcttgcc L681 gaaattgagg acatgattaa agtg aagtttgaaa tgtttttagc aaaatctaat L74; ttttgccata atgtgtcctc cctgtccaaa ttgggaatga cttaatgtca atttgtttgt L80; tggttgtttt aataatactt ccttatgtag ccattaagat ttatatgaat atgcccagtt tttgcttaat atgtattgtg agaa caaatctgga aaagtacccc tttgcacaga tagttaatgt tttatgcttc cattaaataa aaaggactta 198; aaatctgtta attataatag aaatgcggct agttcagaga gatttttaga gctgtggtgg 204; acttcataga tgaattcaag tgttgaggga ggattaaaga aatatatacc atgt 210; gctt Protein ce: ISOFORM C NCBI nce Seguence: NP_002129.2 LOCUS N P_002129 ACCESSION NP_002129 1 mseequgdg aaaaataavg gsageqegam vaatqgaaaa agsgagtggg tasggteggs 61 aesegakida skneedeghs nssprhseaa taqreewkmf igglswdttk fskf 121 gevvdctlkl dpitgrsrgf gfvlfkeses vdkvmqueh klngkvidpk rakamktkep 181 vkkifvggLs pdtpeekire yfggfgeves nktn krrgfcfitf keeepvkkim 241 ekkyhnngs kceikvamsk equqqqug srggfagrar grggdqqsgy gkvsrrgghq 301 nsykpy Nucleotide seguence: ISOFORM B NCBI Reference Seguence: NM_031369.2 LOCUS NM_031369 ACCESSION NM_031369 l cttccgtcgg ccattttagg tggtccgcgg cggcgccatt aaagcgagga ggaggcgaga 6; gcggccgccg ctggtgctta ttctttttta gtgcagcggg agagagcggg agtgtgcgcc 12; gcgcgagagt gggaggcgaa gggggcaggc cagggagagg cgcaggagcc tttgcagcca 18; cgcc ttccctgtct tgtgtgcttc gcgaggtaga gcgggcgcgc ggcagcggcg 24; gggattactt tgctgctagt ttcggttcgc ggcagcggcg gtct cagc ; ggcggagaca ctat gtcggaggag cagttcggcg gggacggggc ggcggcagcg 36L gcaacggcgg cggtaggcgg ctcggcgggc gagg gagccatggt ggcggcgaca 42L gcag ngngngC gggaagcgga gccgggaccg ggggcggaac cgcgtctgga 48L ggcaccgaag ggggcagcgc cgagtcggag ggggcgaaga ttgacgccag taagaacgag 54L gaggatgaag tgtt aggc cttagctggg acactacaaa gaaagatctg 60; aaggactact tttccaaatt tggtgaagtt gtagactgca ctctgaagtt agatcctatc 66; acagggcgat gttt tggctttgtg aaag aatcggagag tgtagataag 72; gatc aaaaagaaca taaattgaat gggaaggtga ttgatcctaa aagggccaaa 78; gccatgaaaa caaaagagcc ggttaaaaaa atttttgttg gtggcctttc tccagataca 84; cctgaagaga aaataaggga gtactttggt ggttttggtg aggtggaatc gctc 90; cccatggaca acaagaccaa taagaggcgt gggttctgct ttattacctt taaggaagaa 96; gaaccagtga agaagataat ggaaaagaaa taccacaatg ttggtcttag taaatgtgaa L02; ataaaagtag ccatgtcgaa ggaacaatat cagcaacagc aacagtgggg atctagagga L08; ggatttgcag gaagagctcg tggaagaggt ggtggcccca gtcaaaactg gaaccaggga L14; tatagtaact attggaatca aggctatggc aactatggat ataacagcca aggttacggt L20; ggttatggag acta cactggttac tact atggatatgg tgattatagc L26; caga gtggttatgg gaaggtatcc aggcgaggtg gtcatcaaaa tagctacaaa L32; ccatactaaa ttattccatt tgcaacttat ccccaacagg tggtgaagca gtattttcca L38; atttgaagat tcatttgaag gtggctcctg ccacctgcta atagcagttc L44; ttttgtatca agtccctgaa tggaagtatg acgttgggtc cctctgaagt ttaattctga L501 gttctcatta aaagaaattt gctttcattg ttttatttct taattgctat caatttgtgt tttatgccct ttcccccagt attgtagagc aagtcttgtg agtgtgacag tgtcatgatg tagtagtgtc ttactggttt tttaataaat ccttttgtat L68; gtat tggctctttt atcatcagaa taggaaaaat tgtcatggat tcaagttatt L74; aaaagcataa gtttggaaga caggcttgcc gaaattgagg acatgattaa aattgcagtg 180; aagtttgaaa tgtttttagc aaaatctaat ttttgccata atgtgtcctc cctgtccaaa 186; ttgggaatga gtca atttgtttgt tggttgtttt aataatactt ccttatgtag 192; agat ttatatgaat attttcccaa atgcccagtt tttgcttaat atgtattgtg 198; ctttttagaa caaatctgga taaatgtgca aaagtacccc tttgcacaga tagttaatgt 204; cttc cattaaataa aaaggactta aaatctgtta attataatag ggct 210; agttcagaga gatttttaga gctgtggtgg acttcataga tgaattcaag tgttgaggga 2161 ggattaaaga tacc gtgtttatgt gtgtgtgctt Protein seguence: ISOFORM B NCBI nce Seguence: NP_112737.1 LOCUS NP_1 12737 ACCESSION NP_112737 1 mseequgdg aaaaataavg gsageqegam vaatqgaaaa agsgagtggg tasggteggs 61 aesegakida skneedegkm figglswdtt kkdlkdyfsk fgevvdctlk ldpitgrsrg 121 fgfvlfkese svdkvmque hklngkvidp krakamktke pvkkifvggl spdtpeekir 181 eyfggfgeve sielpmdnkt nkrrgfcfit fkeeepvkki mekkyhnvgl skceikvams 241 qu gsrggfagra rgrgggpsqn wnqusnywn qugnygyns quggyggyd 301 ytgynnyygy gdyanqsgy gkvsrrgghq nsykpy tide seguence: ISOFORM A NCBI Reference Seguence: NM_031370.2 LOCUS NM_031370 ACCESSION NM_031370 1 cttccgtcgg ccattttagg tggtccgcgg cggcgccatt aaagcgagga ggaggcgaga 61 gcggccgccg ctggtgctta ttctttttta gtgcagcggg agagagcggg agtgtgcgcc 121 gagt cgaa gggggcaggc cagggagagg cgcaggagcc tttgcagcca 181 cgcgcgcgcc ttccctgtct tgtgtgcttc gcgaggtaga gcgggcgcgc ggcagcggcg 241 gggattactt tgctgctagt ttcggttcgc ggcagcggcg ggtgtagtct cggcggcagc 301 gaca ctagcactat gtcggaggag cagttcggcg gggacggggc ggcggcagcg 361 gcgg ngtaggcgg gggc gagcaggagg gagccatggt ggcggcgaca 42; cagggggcag ngngngC gggaagcgga accg ggggcggaac cgcgtctgga 48; ggcaccgaag ggggcagcgc cgagtcggag ggggcgaaga ccag taagaacgag 54; gaggatgaag gccattcaaa ctcctcccca cgacactctg aagcagcgac ggcacagcgg 60; gaagaatgga aaatgtttat aggaggcctt agctgggaca ctacaaagaa agatctgaag 66; gactactttt ccaaatttgg tgaagttgta gactgcactc tgaagttaga tcctatcaca 72; gggcgatcaa ggggttttgg ctttgtgcta tttaaagaat cggagagtgt agataaggtc 78L atggatcaaa aagaacataa tggg attg atcctaaaag ggccaaagcc 84L atgaaaacaa cggt taaaaaaatt tttgttggtg gcctttctcc agatacacct 90; gaagagaaaa agta ctttggtggt tttggtgagg tggaatccat agagctcccc 96L aaca agaccaataa gaggcgtggg ttta ttacctttaa ggaagaagaa L02; ccagtgaaga agataatgga aaagaaatac cacaatgttg gtcttagtaa aaagtagcca tgtcgaagga acaatatcag caacagcaac agtggggatc tttgcaggaa gagctcgtgg aagaggtggt ggccccagtc aaaactggaa ccagggatat L20; agtaactatt aagg ctatggcaac tatggatata acagccaagg ttacggtggt L26; tatggaggat atgactacac tggttacaac aactactatg gatatggtga caac L32; agtg gttatgggaa ggtatccagg cgaggtggtc atcaaaatag ctacaaacca L38; tactaaatta ttccatttgc aacttatccc caacaggtgg tgaagcagta ttttccaatt L44; tgaagattca tttgaaggtg gctcctgcca aata gcagttcaaa ctaaattttt L50; tgtatcaagt ccctgaatgg aagtatgacg ttgggtccct ctgaagttta attctgagtt L56; ctcattaaaa gaaatttgct ttcattgttt tatttcttaa ttgctatgct tcagaatcaa L62; tttt atgccctttc ccccagtatt gtagagcaag tcttgtgtta aaagcccagt L68; gtgacagtgt catgatgtag ctta ctggtttttt aataaatcct tttgtataaa L74; aatgtattgg ctcttttatc atag gaaaaattgt catggattca agttattaaa L80; agcataagtt tggaagacag gcttgccgaa attgaggaca tgattaaaat tgcagtgaag L86: tttgaaatgt ttttagcaaa atctaatttt tgccataatg tgtcctccct aatgtcaatt tgtttgttgg ttgttttaat aatacttcct L98; ttaagattta tatgaatatt ttcccaaatg cccagttttt gcttaatatg tattgtgctt 204; tttagaacaa atctggataa atgtgcaaaa gtaccccttt gcacagatag ttaatgtttt 210; atgcttccat taaataaaaa ggacttaaaa tctgttaatt ataatagaaa tgcggctagt 216; tcagagagat ttttagagct gtggtggact atga attcaagtgt tgagggagga 222; ttaaagaaat atataccgtg tttatgtgtg tgtgctt Protein seguence: M A NCBI Reference Seguence: NP_112738.1 LOCUS NP_112738 ACCESSION NP_112738 1 mseequgdg aaaaataavg gsageqegam vaatqgaaaa agsgagtggg tasggteggs 61 aesegakida skneedeghs nssprhseaa taqreewkmf igglswdttk kdlkdyfskf 121 tlkl srgf gfvlfkeses vdkvmqueh klngkvidpk rakamktkep 181 vkkifvggls pdtpeekire yfggfgeves ielpmdnktn krrgfcfitf kkim 241 vgls kceikvamsk equqqqug srggfagrar sqnw nqusnywnq 301 gygnygynsq gyggyggydy tgynnyygyg dyanqsgyg kvsrrgghqn sykpy RPL32 Official Sym bol: RPL32 Official Name: ribosomal protein L32 Gene ID: 6161 Organism: Homo sapiens Other Aliases: AU020185, 3A Other Designations: 608 ribosomal n L32; snoRNA MBl-141 Nucleotide seguence: Transcript Variant 1 NCBI Reference Seguence: NM_000994.3 LOCUS NM_000994 ACCESSION NM_000994 1 aggggttacg acccatcagc ccttgcgcgc caccgtccct tctctcttcc tcggcgctgc 61 ctacggaggt ggcagccatc tccttctcgg catcatggcc gccctcagac cccttgtgaa 121 gcccaagatc aaga gaaccaagaa gttcatccgg caccagtcag accgatatgt 181 caaaattaag Cgtaactggc ggaaacccag aggcattgac aacagggttc gtagaagatt 24; caagggccag atcttgatgc ccaacattgg ttatggaagc aacaaaaaaa caaagcacat ; gctgcccagt ggcttccgga agttcctggt ccacaacgtc ctgg aagtgctgct 36; gatgtgcaac aaatcttact gtgccgagat cgctcacaat gtttcctcca agaaccgcaa 42; agccatcgtg gaaagagctg cccaactggc catcagagtc accaacccca atgccaggct 48; gcgcagtgaa gaaaatgagt aggcagctca tgtgcacgtt ttctgtttaa ataaatgtaa 54; aaactgccat ctggcatctt ccttccttga ttttaagtct tcagcttctt ggccaactta 60L gtttgccaca gagattgttc ttttgcttaa gcccctttgg ccat ttggagggga 66L tttgtaaagg acactcagtc cttgaacagg ggaatgtggc ctcaagtgca cagactagcc 72L ttagtcatct ccagttgagg atga ggggtacaga cttggccctc ggta 78L ggttctgaga cacttgaaga tggc tcccaagcca caagtagtca ttcttagcct 84; tgcttttgta aagttaggtg acaagttatt ccatgtgatg cttgtgagaa ttgagaaaat 90; atgcatggaa atatccagat gaatttctta cacagattct tacgggatgc ctaaattgca 96; aact tctgtccaaa aagaacagga tgatgtacaa attgctcttc caggtaatcc L02; accacggtta actggaaaag cactttcagt ctcctataac cctcccacca gctgctgctt L08; caggtataat gttacagcag tttgccaagg cggggaccta actggtgaca attgagcctc L14; ttgactggta ctcagaattt agtgacacgt ggtcctgatt ttttttggag acggggtctt L20; gctctcaccc aggctgggag tgcagtggca cactgactac agccttgacc tccccaggct L26; caggtgatct tcccacctca gccttccaag tagctgggac tacagatgca cacctccaaa L32; cctgggtagt ttttgaagtt tttttgtaga ggtggtctag ccatgttgcc taggctcccg L38; aactcctgag ctcaagcaat cctgcttcag cctcccaaag tactgggatt acaggcatct L44; tctgtagtat ataggtcatg agggatatgg gatgtggtac ttatgagaca gaaatgctta L50; caggatgttt ttctgtaacc atcctggtca acttagcaga aatgctgcgc tgggtataat L56; aaagcttttc tagt agga atcttacaga cctg ttcaaaacct L62; aaat atttataatg caaactggtc aaaaaaaaaa aaaaaaaa Protein seguence: Transcript t 1.
NCBI Reference ce: 985.1 LOCUS 985 ACCESSION NP_000985 l maalrplvkp kivkkrtkkf irhqsdryvk ikrnwrkprg idnrvrrrfk gqilmpnigy 61 gsnkktkhml psgfrkflvh nvkelevllm cnksycaeia hnvssknrka iveraaqlai 121 rvtnpnarlr seene Nucleotide ce: Transcript Variant 2.
NCBI Reference ce: 007073.1 LOCUS NM_001007073 ION NM_001007073 1 aggggttacg acccatcagc ccttgcgcgc caccgtccct tctctcttcc tcggcgctgc 61 ctacggaggt ggcagccatc tccttctcgc ttgg aagacactct gcgacagtgt 121 tcagtccctg aaag cctccttcca ggattcttcc tcacctgggg ccgcttcttc 181 cccaaaaggc atcatggccg gacc ccttgtgaag cccaagatcg tcaaaaagag 241 aaccaagaag ttcatccggc accagtcaga ccgatatgtc aaaattaagc gtaactggcg 301 gaaacccaga ggcattgaca acagggttcg tagaagattc aagggccaga tcttgatgcc 361 caacattggt tatggaagca acaaaaaaac aaagcacatg ctgcccagtg gcttccggaa 421 gttcctggtc cacaacgtca tgga agtgctgctg atgtgcaaca aatcttactg 481 tgccgagatc gctcacaatg tttcctccaa gaaccgcaaa gccatcgtgg aaagagctgc 541 ccaactggcc atcagagtca ccaa tgccaggctg cgcagtgaag aaaatgagta 601 ggcagctcat gtgcacgttt tctgtttaaa taaatgtaaa catc tggcatcttc 661 cttccttgat tttaagtctt cagcttcttg gccaacttag tttgccacag agattgttct 721 tttgcttaag cccctttgga atctcccatt tggaggggat ttgtaaagga cactcagtcc 781 ttgaacaggg gaatgtggcc tcaagtgcac agactagcct tagtcatctc cagttgaggc 841 tgggtatgag gggtacagac ttggccctca caccaggtag agac acttgaagaa 901 ggct cccaagccac aagtagtcat tcttagcctt gcttttgtaa agttaggtga 961 caagttattc catgtgatgc ttgtgagaat tgagaaaata tgcatggaaa tatccagatg 1021 aatttcttac acagattctt acgggatgcc taaattgcat cctgtaactt ctgtccaaaa 1081 ggat gatgtacaaa ttgctcttcc aggtaatcca ttaa ctggaaaagc 1141 agtc tcctataacc ctcccaccag ctgctgcttc aggtataatg ttacagcagt 1201 ttgccaaggc ggggacctaa ctggtgacaa ttgagcctct tgactggtac tcagaattta 1261 gtgacacgtg gtcctgattt tttttggaga cggggtcttg ctctcaccca ggctgggagt 1321 gcagtggcac actgactaca gccttgacct ccccaggctc aggtgatctt cccacctcag 138; ccttccaagt agctgggact acagatgcac acctccaaac ctgggtagtt tttgaagttt 144; ttttgtagag gtggtctagc catgttgcct aggctcccga actcctgagc tcaagcaatc 150; cagc ctcccaaagt atta caggcatctt ctgtagtata taggtcatga 156; tggg atgtggtact tatgagacag aaatgcttac aggatgtttt tctgtaacca 162; tcctggtcaa cttagcagaa cgct gggtataata ttct agtc 168; tagacaggaa tcttacagat tgtctcctgt tcaaaaccta gtcataaata tttataatgc 1741 aaactggtca aaaaaaaaaa aaaaaaa Protein seguence: Transcript Variant 2.
NCBI Reference Seguence: NP_001007074.1 LOCUS N P_001007074 ION N P_001007074 1 maalrplvkp kivkkrtkkf irhqsdryvk ikrnwrkprg idnrvrrrfk gqilmpnigy 61 khml psgfrkflvh nvkelevllm cnksycaeia hnvssknrka iveraaqlai 121 rvtnpnarlr seene Nucleotide seguence: Transcript Variant 3.
NCBI Reference Seguence: NM_001007074.1 LOCUS NM_001007074 ACCESSION NM_001007074 1 gacctcctgg gatcgcatct ggagagtgcc tagtattctg ccagcttcgg aaagggaggg 6; aaagcaagcc tggcagaggc acccattcca ttcccagctt gctccgtagc tggcgattgg 12; aagacactct gtgt cctg ggcaggaaag cctccttcca ggattcttcc 18; tcacctgggg ccgcttcttc cccaaaaggc atcatggccg ccctcagacc ccttgtgaag 24; cccaagatcg agag aaccaagaag ttcatccggc accagtcaga ccgatatgtc ; aaaattaagc gtaactggcg gaaacccaga ggcattgaca acagggttcg tagaagattc 36L aagggccaga tcttgatgcc caacattggt tatggaagca acaaaaaaac catg 42; ctgcccagtg gcttccggaa gttcctggtc cacaacgtca aggagctgga agtgctgctg 48; atgtgcaaca aatcttactg tgccgagatc gctcacaatg tttcctccaa gaaccgcaaa 54; gccatcgtgg aaagagctgc ccaactggcc atcagagtca ccaaccccaa tgccaggctg 60; cgcagtgaag aaaatgagta ggcagctcat gtgcacgttt tctgtttaaa taaatgtaaa 66; aactgccatc tggcatcttc cttccttgat tttaagtctt cagcttcttg gccaacttag 72; tttgccacag agattgttct tttgcttaag cccctttgga atctcccatt tggaggggat 78; ttgtaaagga cactcagtcc ttgaacaggg gaatgtggcc tcaagtgcac agactagcct 84; tagtcatctc cagttgaggc tgggtatgag gggtacagac ttggccctca caccaggtag 90; gttctgagac acttgaagaa gcttgtggct cccaagccac aagtagtcat tcttagcctt 961 gcttttgtaa agttaggtga caagttattc catgtgatgc ttgtgagaat tgagaaaata 1021 tgcatggaaa gatg aatttcttac acagattctt acgggatgcc taaattgcat 1081 cctgtaactt ctgtccaaaa agaacaggat gatgtacaaa ttgctcttcc ccacggttaa aagc actttcagtc tcctataacc ctcccaccag aggtataatg ttacagcagt ttgccaaggc ctaa ctggtgacaa tgactggtac tcagaattta gtgacacgtg gtcctgattt gaga ngggtcttg 132; ctctcaccca ggctgggagt gcagtggcac actgactaca gccttgacct ccccaggctc 138; aggtgatctt cccacctcag aagt gact acagatgcac aaac 144; ctgggtagtt tttgaagttt ttttgtagag gtggtctagc catgttgcct aggctcccga 150; actcctgagc tcaagcaatc ctgcttcagc ctcccaaagt actgggatta tctt 156; ctgtagtata taggtcatga tggg atgtggtact tatgagacag aaatgcttac 162; aggatgtttt tctgtaacca tcctggtcaa cttagcagaa atgctgcgct gggtataata 168; aagcttttct agtc tagacaggaa agat tgtctcctgt tcaaaaccta 174; gtcataaata tttataatgc gtca aaaaaaaaaa aaaaaaa Protein seguence: Transcript Variant 3.
NCBI Reference Seguence: NP_001007075.1 LOCUS NP_001007075 ACCESSION NP_001007075 1 maalrplvkp kivkkrtkkf irhqsdryvk ikrnwrkprg rrfk gqilmpnigy 61 gsnkktkhml psgfrkflvh nvkelevllm cnksycaeia hnvssknrka iveraaqlai 121 rvtnpnarlr seene Gene Official : ATP5H Official Name: ATP synthase, H+ transporting, mitochondrial Fo x, subunit d Gene ID: 10476 Organism: Homo sapiens Other Aliases: My032, ATPQ Other Designations: ATP synthase D chain, mitochondrial; ATP se subunit d, mitochondrial; ATP synthase, H+ transporting, mitochondrial F0 complex, subunit d; ATP synthase, H+ transporting, mitochondrial F1 F0, subunit d; ATPase subunit d; My032 protein Nucleotide seguence: ISOFORM B NCBI Reference Seguence: NM_001003785.1 m NM_001003785 ION NM_001003785 l tgacccactt ccgttacttg ctgcggagga ccgtgggcag ccagggtcgg atcc 6; caaaatggct gggcgaaaac ttgctctaaa tgac tgggtagctt ttgcagagat 12; cataccccag aaccaaaagg ctag gaaa tcctggaatg agaccctcac 18; ctccaggttg gctgctttac ctgagaatcc accagctatc gactgggctt actacaaggc 24; caatgtggcc aaggctggct tggtggatga ctttgagaag aaggtgaaat cttgtgctga ; gtgggtgtct ctctcaaagg ccaggattgt agaatatgag aaagagatgg agaagatgaa 36; gaacttaatt ccatttgatc agatgaccat tgaggacttg aatgaagctt tcccagaaac 42; caaattagac aagaaaaagt atccctattg gcctcaccaa ccaattgaga atttataaaa 48; ttgagtccag gctc tggcccttgt attacacatt ctggacatta ataa 54; ttatacagtt aaaaaa Protein seguence: ISOFORM B NCBI Reference Seguence: NP_001003785.1 MNP_001003785 ACCESSION NP_001003785 l magrklalkt idwvafaeii pqnqkaiass lkswnetlts rlaalpenpp aidwayykan 61 vakaglvddf ekkvkscaew vslskarive yekemekmkn lipqumtie dlneafpetk 121 1dkkkypywp hqpienl Nucleotide seguence: ISOFORM A NCBI Reference Seguence: NM_006356.2 LOCUS NM_006356 ACCESSION NM_006356 l tgacccactt ccgttacttg ctgcggagga ccgtgggcag ccagggtcgg tgaaggatcc 6L caaaatggct gggcgaaaac ttgctctaaa aaccattgac tgggtagctt ttgcagagat 12; cataccccag aaccaaaagg ccattgctag ttccctgaaa tcctggaatg agaccctcac 18; ctccaggttg gctgctttac ctgagaatcc accagctatc gactgggctt actacaaggc 24; ggcc aaggctggct tggtggatga ctttgagaag aagtttaatg cgctgaaggt ; gcca gaggataaat atactgccca ggtggatgcc aaag aagatgtgaa 36; atcttgtgct gagtgggtgt caaa gatt gtagaatatg agaaagagat 42; gatg aagaacttaa ttccatttga tcagatgacc attgaggact tgaatgaagc 48; agaa accaaattag acaagaaaaa gtatccctat tggcctcacc aaccaattga 54; gaatttataa aattgagtcc aggaggaagc tctggccctt gtattacaca ttctggacat 60; taaaaataat aattatacag aa Protein seguence: ISOFORM A NCBI Reference Seguence: NP_006347.1 LOCUS N P_006347 ACCESSION NP_006347 1 magrklalkt idwvafaeii pqnqkaiass 1kswnet1ts rlaalpenpp aidwayykan 61 vakaglvddf ekkfnalkvp vpedkytaqv daeekedvks caewvslska riveyekeme 121 kmknlipqu mtiedlneaf kkky pien 1 Official Symbol: PSMA1 Official Name: proteasome (prosome, macropain) subunit, alpha type, 1 Gene ID: 5682 sm: Homo sapiens Other Aliases: HC2, NU, PROS30 Other Designations: 30 kDa prosomal protein; PROS-30; macropain subunit C2; macropain subunit nu; multicatalytic endopeptidase complex subunit 02; proteasome component 02; proteasome nu chain; proteasome subunit alpha type-1; proteasome subunit nu; proteasome subunit, alpha-type, 1; protein P30- Nucleotide seguence: ISOFORM 3 NCBI Reference Seguence: NM_001143937.1 LOCUS NM_001 143937 ACCESSION 143937 l gatatctctg gaatagactg cgctaccctg cgccgccgcc ctcc cgcagacttc 6; gatc cgat actttcggca gcacctcctt gattctcagt tttgctggag 12; gccgcaacca ggcccgcgcc gccaccatgt ttcgaaatca gtatgacaat gatgtcactg 18; tttggagccc ccagggcagg attcatcaaa ttgaatatgc aatggaagct gttaaacaag 24; gttcagccac agttggtctg aaatcaaaaa ctcatgcagt tttggttgca ttgaaaaggg ; cgcaatcaga agct catcagaaaa aaattctcca tgttgacaac catattggta 36; tctcaattgc ggggcttact gctgatgcta gactgttatg taattttatg cgtcaggagt 42; gtttggattc cagatttgta ttcgatagac cactgcctgt tctt gtatctctaa 48; ttggaagcag tatccttttt atgttagcat ttatggatat gaactttgaa gggttttgat 54; acttgtgtta attattagga atataataat aatatgacat aggtaagatt gtgaaaactt 60; taaaacaaca aattggattg ctctttcatt agcctttata ttat atttgctaga 66; cacaaataag ttca ggaaaatcat ctaagcatct ttttagaggg gatttaaagt 72; ttcttaatgg ttctagatgt cccaagaaat cctagacccc ttgtatccaa aacaaatcag 78- gttttagatg ggaagaaatt attttgctgg cactctttct taggttcggt tcaa 84L acattttata ttaggccaaa gaaatagtgt cctattgcat tatttctctg gtggattatg 90; caacaattaa agaataagcc agagac n seguence: ISOFORM 3 NCBI Reference ce: NP_001137409.1 LOCUS N P_001 137409 ION NP_001 137409 1 mfrnqydndv tvwqugrih qieyameavk qgsatvglks kthavlvalk raqselaahq 61 kkilhvdnhi gisiagltad fmrq ecldsrfvfd rplpvsrlvs ligssilfml 121 fegf Nucleotide seguence: ISOFORM 2 NCBI Reference Seguence: NM_002786.3 LOCUS NM_002786 ACCESSION NM_002786 1 gatatctctg gaatagactg cgctaccctg cgccgccgcc gtcaaactcc cgcagacttc 61 tctgtagatc gctgagcgat ggca gcacctcctt cagt ggag 121 gccgcaacca ggcccgcgcc gccaccatgt ttcgaaatca caat gatgtcactg 181 tttggagccc ccagggcagg caaa ttgaatatgc aatggaagct gttaaacaag 241 gttcagccac agttggtctg aaatcaaaaa ctcatgcagt tttggttgca ttgaaaaggg 301 cgcaatcaga agct catcagaaaa aaattctcca tgttgacaac catattggta 361 tctcaattgc ggggcttact gctgatgcta gactgttatg taattttatg cgtcaggagt 421 gtttggattc cagatttgta ttcgatagac cactgcctgt gtctcgtctt gtatctctaa 481 ttggaagcaa gacccagata ccaacacaac gatatggccg gagaccatat ggtgttggtc 541 tccttattgc tggttatgat gatatgggcc ctcacatttt ccaaacctgt ccatctgcta 601 actattttga ctgcagagcc atgtccattg gagcccgttc ccaatcagct cgtacttact 661 gaca tatgtctgaa tttatggagt gtaatttaaa tgaactagtt aaacatggtc 721 tgcgtgcctt aagagagacg cttcctgcag aacaggacct gactacaaag aatgtttcca 781 ttggaattgt tggtaaagac ttggagttta caatctatga tgatgatgat gtgtctccat 841 tcctggaagg tcttgaagaa agaccacaga gaaaggcaca gcctgctcaa aacctgcaga aaaggctgat gaaccaatgg aagt gataagccag tattatcaaa tatgtaagaa tacaggcacc tgat gacaataatc tatactttga 1021 accaaaagtt gcagagtggt ggaatgctat gttttaggaa tcagtccaga ttccaagcaa cctcactgaa acctatataa tggaatacat ttttctttga aagggtctgt 1141 ataatcattt tctagaaagt atgggtatct atactaatgt atga agaacatagg 1201 tgtctttgtg gttttaaaga caactgtgaa ataaaattgt ttcaccgcct ggtaaaaaaa 1261 aaaaaaaaaa aaaaaaaaaa a Protein seguence: ISOFORM 2 NCBI Reference Seguence: NP_002777.1 LOCUS NP_002777 ACCESSION NP_002777 l mfrnqydndv tvwqugrih qieyameavk qgsatvglks kthavlvalk raqselaahq 61 kkilhvdnhi gisiagltad fmrq ecldsrfvfd rplpvsrlvs ligsktqipt 121 qrygrrpygv glliagyddm gphifqtcps anyfdcrams igarsqsart ylerhmsefm 181 ecnlnelvkh glralretlp aeqdlttknv kdle ftiyddddvs pflegleerp 241 quaqpaqpa depaekadep meh tide seguence: ISOFORM 1 NCBI Reference Seguence: NM_148976.2 LOCUS 976 ION NM_148976 1 cggccgccca acagggacgc gagccgggac cacgccgacc cagcgtgccc aggccgagga 6; aagcgcggcg gcggcagtcc gaagacccac cgggactgaa agagaaggac gaggtcatct 12; tcggacggga ggggcaagcc cctg ggaccccagg cgtgcaggtt ctctttgagg 18; gtattccacc ctgcaaaaag catgtattca tggtcagctc aggc cagtagcaga 24; gtggtaaagg ccct ccaaggctgg gaaaagacaa tgacaagtca aatccagacc ; tatgttgtat gttggtctac taggtgactg tctcctggaa atgttatgca gctcagcaag 361 tttc gaaatcagta tgacaatgat gtcactgttt ggagccccca gggcaggatt 42L catcaaattg aatatgcaat ggaagctgtt aaacaaggtt cagccacagt tggtctgaaa 48L tcaaaaactc atgcagtttt ggttgcattg aaaagggcgc aatcagagct tgcagctcat 541 cagaaaaaaa ttctccatgt tgacaaccat attggtatct caattgcggg gcttactgct 60; gatgctagac tgttatgtaa ttttatgcgt caggagtgtt ccag atttgtattc 66; gatagaccac tgcctgtgtc tcgtcttgta attg gaagcaagac ccagatacca 721 acacaacgat atggccggag accatatggt ctcc ttattgctgg ttatgatgat 781 cctc acattttcca tcca tctgctaact actg cagagccatg 841 tccattggag cccgttccca atcagctcgt acttacttgg agagacatat gtctgaattt 90; atggagtgta atttaaatga actagttaaa catggtctgc taag agagacgctt 961 cctgcagaac tgac tacaaagaat gtttccattg gaattgttgg taaagacttg 1021 acaa tctatgatga tgatgatgtg tctccattcc tggaaggtct tgaagaaaga 1081 ccacagagaa aggcacagcc tgctcaacct gctgatgaac ctgcagaaaa ccaatggaac attaagtgat aagccagtct atatatgtat tatcaaatat aggcaccaca tgac aataatctat actttgaacc aaaagttgca atgctatgtt ttaggaatca gtccagatgt gagttttttc caagcaacct tatataatgg aatacatttt tctttgaaag ggtctgtata atcattttct agaaagtatg -381 ggtatctata ctaatgtttt tatatgaaga acataggtgt ctttgtggtt ctgtgaaata aaattgtttc accgcctggt aaaa aaaaaaaaaa aaaaaaaa Protein seguence: ISOFORM 1 NCBI Reference ce: NP_683877.1 LOCUS NP_683877 ACCESSION NP_683877 1 mqlskkarn qydndvtvws pqgrihqiey ameavkqgsa tvglksktha vlvalkraqs 61 elaahqkkil hvdnhigisi agltadarll cnfmrqecld srfvfdrplp vsrlvsligs 121 ktqiptqryg rrpygvglli agyddmgphi fqtcpsanyf dcramsigar sqsartyler 181 hmsefmecnl nelvkhglra 1ret1paeqd 1ttknvsigi vgkdleftiy pfle 241 gleerpqua qpaqpadepa ekadepmeh PTBP1 Official Symbol: PTBP1 Official Name: polypyrimidine tract binding protein 1 Gene ID: 5725 sm: Homo sapiens Other Aliases: PTB4,pPTB Other Designations: 57 kDa RNA-binding protein PPTB-1; RNA-binding protein; heterogeneous nuclear ribonucleoprotein I; heterogeneous nuclear ribonucleoprotein polypeptide I; hnRNP I; polypyrimidine tract binding protein (heterogeneous r ribonucleoprotein I); polypyrimidine tract-binding protein Nucleotide seguence: ISOFORM A NCBI Reference Seguence: NM_002819.4 LOCUS NM_002819 ACCESSION NM_002819 l tgcgggcgtc tccgccattt tgtgagtcta taactcggag ccgttgggtc ggttcctgct 6; gcgc ctccactccg tcccccgcgg gtctgctctg tgtgccatgg acggcattgt 12; cccagatata gccgttggta caaagcgggg cgag cttttctcta cttgtgtcac 18; taacggaccg tttatcatga gcagcaactc ggcttctgca gcaaacggaa atgacagcaa 24; caaa ggtgacagcc gaagtgcagg cgtcccctct agagtgatcc acatccggaa ; gctccccatc gacgtcacgg agggggaagt catctccctg gggctgccct ttgggaaggt 36; caccaacctc ctgatgctga aggggaaaaa ccaggccttc atcgagatga acacggagga 42; caac accatggtga actactacac ctcggtgacc cctgtgctgc gcggccagcc 48; catctacatc cagttctcca accacaagga gctgaagacc gacagctctc ccaaccaggc 54; gcgggcccag gcggccctgc aggcggtgaa ctcggtccag tcggggaacc tggccttggc 60; tgcctcggcg gcggccgtgg acgcagggat ggcgatggcc agcc ccgtgctcag 66; gatcatcgtg gagaacctct tctaccctgt ggat gtgctgcacc tctc 72L caagttcggc acagtgttga agatcatcac cttcaccaag aacaaccagt tccaggccct 78L gctgcagtat cccg tgagcgccca gcacgccaag ctgtcgctgg acgggcagaa 84L catctacaac gcctgctgca cgctgcgcat cgacttttcc aagctcacca gcctcaacgt 90; caagtacaac aatgacaaga acta cacacgccca gacctgcctt ccggggacag 96; ccagccctcg ctggaccaga ccatggccgc cggt gcacctggta taatctcagc 102; ctctccgtat gcaggagctg ctcc cacctttgcc attcctcaag ctgcaggcct L08; ttccgttccg aacgtccacg gcgccctggc ccccctggcc atcccctcgg cggcggcggc L14; agctgcggcg gcaggtcgga tcgccatccc gggcctggcg ggggcaggaa attctgtatt L20; gctggtcagc aacctcaacc cagagagagt ccaa ttta ttcttttcgg L26; cgtctacggt gacgtgcagc gcgtgaagat cctgttcaat aagaaggaga acgccctagt L32; gcagatggcg gacggcaacc aggcccagct ggccatgagc cacctgaacg ggcacaagct L38; gcacgggaag cccatccgca tcacgctctc gaagcaccag aacgtgcagc tgccccgcga L44; gggccagqag gaccagggcc tgaccaagga caac tcacccctgc accgcttcaa L50; gaagccgqgc tccaagaact tccagaacat attcccgccc tcggccacgc caacatcccg ccctcagtct ccgaggagga tctcaaggtc ctgttttcca caaa ggattcaagt agaa ggaccgcaag atggcactga ctccgtggag gaggcggtcc aggccctcat tgacctgcac aaccacgacc ccaccacctg nggtctcct tctccaagtc caccatctag aggc ccccacggcc L80; gggccccctg actt ccatcattcc agagaaaagc cactttaaaa acagctgaag L86; tgaccttagc agaccagaga ttttattttt ttaaagagaa atcagtttac ctgtttttaa L92; aaaaattaaa tctagttcac cttgctcacc ctgcggtgac agggacagct caggctcttg L98; gtgactgtgg cagcgggagt tcccggccct ccacacccgg ggccagaccc tcggggccat 204; gccttggtgg tgtc gggcgtgggg cctgcaggtg ggcgccccga ccacgacttg 210; gcttccttgt gccttaaaaa cttc ctgcagccac acacccaccc ggggtgtcct 216; ggggacccaa ggggtggggg ggtcacacca gagagaggca gggggcctgg ccggctcctg 222; caggatcatg cagctggggc gccg cggctgcgac accccaaccc cagccctcta 228; atcaagtcac gtgattctcc cttcaccccg cccccagggc cttcccttct gcccccaggc 234; gggctccccg cagc tgcggagctg gtcgacataa tctctgtatt atatactttg 240; cagttgcaga cgtctgtgcc tatt tgac ttct aatctttttt 246; tatg caaaagaaat agttttaagt aactttttat agcaagatga tacaatggta 252; tgagtgtaat ttcc ttgtggtatt accttgtatg ctgttacttt tattttattc 2581 cttgtaatta agtcacaggc aggacccagt ttccagagag caggcggggc cgcccagtgg 264; gtcaggcaca gggagccccg gtcctatctt agagcccctg agcttcaggg aaggggcggg 270; cgtgtcgccg cctctggcat cgcctccggt tgccttacac cacgccttca cctgcagtcg 276; cctagaaaac ttgctctcaa acttcagggt tttttcttcc ttcaaatttt ggaccaaagt 282; ctcatttctg tgttttgcct gcctctgatg cccg gaaggcgggc gctcctcctg 288; ctgt gctctttcta ccgcccccgc gtcctgtccc gggggctctc ctaggatccc 294; ctttccgtaa aagcgtgtaa caagggtgta aatatttata attttttata cctgttgtga 300; gacccgaggg gcggcggcgc ggttttttat acaa atgtatattt tgctaacagc 306; aattccaggc tcagtattgt gaccgcggag ccacagggga ccccacgcac attccgttgc 312; cttacccgat ggcttgtgac agaa ccgattaaaa ccgtttgaga aactcctccc 318; ttgtctagcc ctgtgttcgc tgtggacgct gcag gttggccagt ctgtacctgg 324: acttcgaata aatcttctgt atcctcgctc cgttccgcct taaaaaaaaa aaaaaaaaaa 330; aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa Protein ce: ISOFORM A NCBI Reference ce: NP_002810.1 LOCUS NP_00281 0 ACCESSION NP_002810 l mdgivpdiav gtkrgsdelf stcvtngpfi mssnsasaan gndskkfkgd srsagvpsrv 6; ihirklpidv tegevislgl pfgkvtnllm afie mnteeaantm vnyytsvtpv 12; lrgqpiyiqf ktds spnqaraqaa lqavnqusg nlalaasaaa vdagmamagq 18; iven lfypvtldvl hqifskfgtv lkiitftknn qfqallqyad pvsaqhakLs 24; ldgqniynac ctlridfskl tslnvkynnd ksrdytrpdl psgdsqpsld qtmaaafgap ; giisaspyag agfpptfaip qaaglsvpnv hgalaplaip saaaaaaaag riaipglaga 36; gnsvllvsnl npervtpqsl filfgvygdv qrvkilfnkk enalvqmadg nqaqlamshl 42; gkpi ritlskhqnv qlpregqedq gltkdygnsp lhrfkkpgsk nfqnifppsa 48; tlhlsnipps vseedlkvlf ssnggvvkgf kma liqmgsveea vqalidlhnh 54; dlgenhhlrv sfsksti Nucleotide seguence: ISOFORM B NCBI Reference Seguence: NM_031990.3 LOCUS NM_031990 ACCESSION 990 l tgcgggcgtc tccgccattt tgtgagtcta taactcggag ccgttgggtc ggttcctgct 61 attccggcgc ctccactccg tcccccgcgg gtctgctctg tgtgccatgg acggcattgt 12; cccagatata gccgttggta caaagcgggg atctgacgag tcta cttgtgtcac 18; taacggaccg tttatcatga gcagcaactc ggcttctgca gcaaacggaa atgacagcaa 24; gaagttcaaa ggtgacagcc gaagtgcagg cgtcccctct agagtgatcc acatccggaa ; gctccccatc gacgtcacgg agggggaagt catctccctg gggctgccct aggt 36; caccaacctc ctgatgctga aaaa ccaggccttc atcgagatga acacggagga 42; ggctgccaac accatggtga actactacac ctcggtgacc cctgtgctgc gcggccagcc 48L catctacatc cagttctcca accacaagga gctgaagacc gacagctctc ccaaccaggc 54L gcgggcccag gcggccctgc aggcggtgaa ctcggtccag tcggggaacc tggccttggc 60; tgcctcgch gcggccgtgg acgcagggat ggcgatggcc gggcagagcc ccgtgctcag 66L gatcatcgtg gagaacctct tctaccctgt ggat gtgctgcacc agattttctc 72; caagttcggc acagtgttga agatcatcac cttcaccaag aacaaccagt tccaggccct 78; gtat gcggaccccg tgagcgccca gcacgccaag ctgtcgctgg acgggcagaa 84; caac gcctgctgca gcat cgacttttcc aagctcacca gcctcaacgt 90; caac aatgacaaga gccgtgacta cacacgccca gacctgcctt ccggggacag 96; ccagccctcg ctggaccaga ccgc ggccttcgcc tatg caggagctgg L02; tttccctccc acctttgcca ttcctcaagc cctt tccgttccga acgtccacgg L08; cgccctggcc cccctggcca tcccctcggc ggcggcggca gcgg caggtcggat L14; cgccatcccg ggcctggcgg gggcaggaaa ttctgtattg ctggtcagca acctcaaccc L20; agagagagtc acaccccaaa gcctctttat cggc gtctacggtg acgtgcagcg L26; cgtgaagatc ctgttcaata agaa cgccctagtg cagatggcgg acggcaacca L32; ggcccagctg gccatgagcc acctgaacgg gcacaagctg cacgggaagc gcat L38; ctcg aagcaccaga acgtgcagct gccccgcgag ggccaggagg accagggcct L44; gaccaaggac tacggcaact cacccctgca ccgcttcaag aagccgggct ccaagaactt L50; ccagaacata ttcccgccct cggccacgct gcacctctcc aacatcccgc cctcagtctc L56; cgaggagqat ctcaaggtcc tgttttccag caatgggggc gtcgtcaaag cttccagaag gaccgcaaga tggcactgat ccagatgggc tccgtggagg aggcggtcca L681 ggccctcatt gacctgcaca accacgacct cggggagaac caccacctgc gggtctcctt L74; ctccaagtcc accatctagg ggcacaggcc cccacggccg ggccccctgg cgacaacttc L80; catcattcca gagaaaagcc actttaaaaa cagctgaagt gaccttagca gaccagagat L86; tttatttttt taaagagaaa tcagtttacc tgtttttaaa aaaattaaat ctagttcacc 192; ttgctcaccc tgcggtgaca gggacagctc aggctcttgg tgactgtggc agcgggagtt 198; cccggccctc cacacccggg gccagaccct cggggccatg ccttggtggg gcctgtgtcg 204; ggcgtggggc ctgcaggtgg gcgccccgac cacgacttgg cttccttgtg aaaa 210; cctgccttcc tgcagccaca cacccacccg gggtgtcctg gggacccaag gggg 216; gtcacaccag gcag ggggcctggc cggctcctgc aggatcatgc ggcg 222; ngngCCg’C ggctgcgaca ccccaacccc agccctctaa tcaagtcacg tgattctccc 228: ttcaccccgc ccccagggcc ttcccttctg cccccaggcg ggctccccgc tgctccagct 234; gcggagctgg tcgacataat ctctgtatta ttgc agttgcagac gtctgtgcct 240; agcaatattt gacc aaatattcta atcttttttc atttatatgc aaaagaaata 2461 gttttaagta actttttata gcaagatgat gtat gagtgtaatc taaacttcct 252; tgtggtatta ccttgtatgc tgttactttt attttattcc ttgtaattaa gtcacaggca 258; ggacccagtt tccagagagc aggcggggcc gcccagtggg tcaggcacag ggagccccgg 264; tcctatctta gagcccctga gcttcaggga aggggcgggc ccgc ctctggcatc 270; gcctccggtt gccttacacc acgccttcac ctgcagtcgc ctagaaaact tgctctcaaa 276; cttcagggtt ttttcttcct tcaaattttg gaccaaagtc tcatttctgt gttttgcctg 282; cctctgatgc tgggacccgg ggcg ctcctcctgt cttctctgtg ctctttctac 288; cgcccccgcg tcctgtcccg ggggctctcc taggatcccc tttccgtaaa agcgtgtaac 294; aagggtgtaa atatttataa ttttttatac ctgttgtgag acccgagggg cggcggcgcg 300; tatg gtgacacaaa tttt gctaacagca attccaggct cagtattgtg 306; accgcggagc cacaggggac cccacgcaca ttccgttgcc ttacccgatg gcttgtgacg 312; cggagagaac cgattaaaac cgtttgagaa actcctccct tgtctagccc tgtgttcgct 318; gtggacgctg tagaggcagg ttggccagtc tgtacctgga cttcgaataa atcttctgta 324; tcctcgctcc gttccgcctt aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 330; aaaaaaaaaa aaaaaaaaa n seguence: ISOFORM B NCBI nce Seguence: NP_114367.1 LOCUS NP_1 14367 ACCESSION NP_114367 l mdgivpdiav gtkrgsdelf stcvtngpfi mssnsasaan gndskkfkgd psrv 6; ihirklpidv tegevislgl pfgkvtnllm "kgknqafie mnteeaantm vnyytsvtpv 12; lrgqpiyiqf snhkelktds spnqaraqaa "qavnqusg saaa vdagmamagq 18; spvlriiven ldvl hqifskfgtv "kiitftknn qyad pvsaqhakLs 24; ldgqniynac ctlridfskl tslnvkynnd ksrdytrpdl psgdsqpsld qtmaaafasp ; yagagfpptf aipqaaglsv pnvhgalapl aipsaaaaaa aagriaipgl agagnsvlLv 36L snlnpervtp qslfilfgvy gdvqrvkilf nkkenalvqm adgnqaqlam shlnghklhg 42L kpiritlskh qnvqlpregq kdyg nsplhrfkkp gsknfqnifp psatlhlsni 48L edlk vlfssnggvv kgfkfqudr kmaliqmgsv eeavqalidl hnhdlgenhh 541 lrvsfsksti Nucleotide seguence: ISOFORM C NCBI Reference Seguence: NM_031991.3 LOCUS 991 ACCESSION NM_031991 l tgcgggcgtc tccgccattt tgtgagtcta taactcggag ccgttgggtc tgct 6; attccggcgc ctccactccg tcccccgcgg gtctgctctg tgtgccatgg acggcattgt 12; cccagatata ggta caaagcgggg cgag cttttctcta tcac 18; accg tttatcatga gcagcaactc ggcttctgca gcaaacggaa atgacagcaa 24; gaagttcaaa ggtgacagcc gaagtgcagg cgtcccctct agagtgatcc acatccggaa ; gctccccatc gacgtcacgg agggggaagt catctccctg gggctgccct ttgggaaggt 36; caccaacctc ctgatgctga aggggaaaaa ccaggccttc atcgagatga acacggagga 42; caac accatggtga actactacac ctcggtgacc ctgc gcggccagcc 48; catctacatc cagttctcca accacaagga gctgaagacc gacagctctc ccaaccaggc 54L gcgggcccag gcggccctgc aggcggtgaa ctcggtccag tcggggaacc tggccttggc 60; tgcctcgch gcggccgtgg acgcagggat ggcgatggcc gggcagagcc ccgtgctcag 66L gatcatcgtg gagaacctct tctaccctgt gaccctggat gtgctgcacc agattttctc 721 caagttcggc acagtgttga agatcatcac cttcaccaag aacaaccagt tccaggccct 78; gctgcagtat gcggaccccg tgagcgccca gcacgccaag ctgtcgctgg acgggcagaa 84; caac gcctgctgca cgctgcgcat cgacttttcc aagctcacca gcctcaacgt 90; caagtacaac aaga gccgtgacta cacacgccca gacctgcctt ccggggacag 96; ccagccctcg caga ccatggccgc ggccttcggc ctttccgttc cgaacgtcca L02; cggcgccctg gcccccctgg ccatcccctc ggcggcggcg gcagctgcgg L08; gatcgccatc ccgggcctgg cgggggcagg aaattctgta ttgctggtca gcaacctcaa L14; cccagagaga gtcacacccc aaagcctctt tattcttttc ggcgtctacg gtgacgtgca L20; gcgcgtgaag ttca ataagaagga gaacgcccta gtgcagatgg gcaa L26; ccaggcccag ctggccatga gccacctgaa cgggcacaag ctgcacggga agcccatccg L32; catcacgctc tcgaagcacc agaacgtgca gctgccccgc cagg aggg L38; cctgaccaag gactacggca actcacccct gcaccgcttc aagaagccgg cttccagaac atattcccgc cctcggccac gctgcacctc tccaacatcc cgccctcagt L50; ctccgaggag gatctcaagg tcctgttttc cagcaatggg ggcgtcgtca aaggattcaa L56; gttcttccag aaggaccgca agatggcact gatccagatg ggctccgtgg aggaggcggt L62; ccaggccctc attgacctgc acaaccacga cctcggggag aaccaccacc caag tccaccatct aggggcacag gcccccacgg cccc ttccatcatt ccagagaaaa gccactttaa aaacagctga agtgacctta gcagaccaga L80; gattttattt ttttaaagag aaatcagttt acctgttttt aaaaaaatta aatctagttc L86; accttgctca ccctgcggtg acagggacag ctcaggctct tggtgactgt ggcagcggga L92; gttcccggcc ctccacaccc ggggccagac cctcggggcc tggt ggggcctgtg L98; tcgggcgtgg ggcctgcagg tgggcgcccc gaccacgact tggcttcctt gtgccttaaa 204; aaacctgcct tcctgcagcc acacacccac ccggggtgtc ctggggaccc aaggggtggg 210; ggggtcacac cagagagagg cagggggcct ggccggctcc tgcaggatca tgcagctggg 216; gcgcggcggc cgcggctgcg acaccccaac cccagccctc taatcaagtc acgtgattct 222; cccttcaccc cgcccccagg gccttccctt ctgcccccag gcgggctccc tcca 228; gctgcggagc tggtcgacat aatctctgta actt tgcagttgca gacgtctgtg 234; cctagcaata tttccagttg accaaatatt ctaatctttt ttcatttata tgcaaaagaa 2401 atagttttaa gtaacttttt atagcaagat atgg tgta actt 246; ccttgtgqta ttaccttgta tgctgttact tttattttat tccttgtaat taagtcacag 252; gcaggaccca gtttccagag agcaggcggg gccgcccagt gggtcaggca gccc 258; cggtcctatc ttagagcccc tgagcttcag ggaaggggcg ggcgtgtcgc cgcctctggc 264; atcgcctccg gttgccttac cctt cacctgcagt cgcctagaaa acttgctctc 270; aaacttcagg gttttttctt aatt ttggaccaaa gtctcatttc tgtgttttgc 276; ctgcctctga tgctgggacc ngaaggcgg gcgctcctcc tgtcttctct gtgctctttc 282; cccc gcgtcctgtc gctc tcctaggatc ccgt aaaagcgtgt 288; aacaagggtg taaatattta taatttttta tacctgttgt gagacccgag gggcggcggc 294; gcggtttttt atggtgacac aaatgtatat tttgctaaca gcaattccag gctcagtatt 300; gtgaccgcgg agccacaggg gaccccacgc acattccgtt gccttacccg atggcttgtg 306: acgcggagag aaccgattaa aaccgtttga gaaactcctc ccttgtctag ccctgtgttc 312; gctgtggacg ctgtagaggc aggttggcca acct ggacttcgaa taaatcttct 318; gtatcctcgc tccgttccgc cttaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3241 aaaaaaaaaa aaaaaaaaaa aa Protein seguence: ISOFORM C NCBI Reference Seguence: NP_114368.1 LOCUS NP_114368 ACCESSION NP_114368 l mdgivpdiav gtkrgsdelf stcvtngpfi mssnsasaan fkgd srsagvpsrv 6; pidv tegevislgl nllm lkgknqafie mnteeaantm vnyytsvtpv 12; lrgqpiyiqf snhkelktds spnqaraqaa lqavnqusg nlalaasaaa vdagmamagq 18; spvlriiven lfypvtldvl hqifskfgtv lkiitftknn qfqallqyad pvsaqhakLs 24; ldgqniynac ctlridfskl tslnvkynnd ksrdytrpdl psld qtmaaafgLs ; vpnvhgalap laipsaaaaa aaagriaipg lagagnsvll ervt pqslfilfgv 36; ygdvqrvkil fnkkenalvq madgnqaqla mshlnghklh gkpiritlsk hqnvqlpreg 42; qedqgltkdy rfkk pgsknfqnif ppsatlhlsn ippsvseedl kvlfssnggv 48; vkgfkfqud rkmaliqmgs veeavqalid lhnhdlgenh hlrvsfskst i AP2A1 Official Symbol: AP2A1 al Name: adaptor-related protein complex 2, alpha 1 subunit Gene ID: 160 Organism: Homo sapiens Other Aliases: ADTAA, AP2—ALPHA, CLAPA1 Other Designations: 100 kDa coated vesicle n A; AP-2 complex subunit 1; r-related protein complex 2 alpha-1 subunit; n, alpha A; adaptor protein complex AP-2 t alpha-1; alpha-adaptin A; alpha1-adaptin; clathrin assembly protein complex 2 alpha-A large chain; clathrin- associated/assemblyfadaptor protein, large, alpha 1 ; plasma membrane adaptor HA2/AP2 adaptin alpha A subunit Nucleotide seguence: M 1 NCBI Reference Seguence: NM_014203.2 LOCUS NM_014203 ACCESSION 203 1 cggctcagag ctccggaccg cgggcggagg ggaggggcag ggggcggtgc cacggcctgc 6; cagcccgccc gcccgcccgc cagccagccc tccccgcggc cggctcggct ccttggcgct 12; gcctggggtc ctttccgccc ggtccccgct tgccagcccc cgctgctctg tgccctgtcc l8; ggccaggcct ggagccgaca ccaccgccat catgccggcc gtgtccaagg gcgatgggat 24; gcgggggctc gcggtgttca tctccgacat ccggaactgt aagagcaaag aggcggaaat ; taagagaatc aacaaggaac tggccaacat ccgctccaag ggag acaaagcctt 36; ggatggctac agtaagaaaa aatatgtgtg taaactgctt ttcatcttcc tgcttggcca 42; tgacattgac tttgggcaca tggaggctgt gaatctgttg agttccaata aatacacaga 48; gaagcaaata ggttacctgt tcatttctgt gctggtgaac tcgaactcgg agctgatccg 54; cctcatcaac atca agaatgacct ggccagccgc aaccccacct tcatgtgcct 60; ggccctgcac tgcatcgcca acgtgggcag ccgggagatg ggcgaggcct ttgccgctga 66; ccgc atcctggtgg ccggggacag catggacagt gtcaagcaga gtgcggccct 72; gtgcctcctt cgactgtaca aggcctcgcc tgacctggtg cccatgggcg agtggacggc 78; gcgtgtggta cacctgctca atgaccagca tgtg gtcacggccg ccgtcagcct 84; catcacctgt ctctgcaaga agaacccaga tgacttcaag acgtgcgtct ctctggctgt 90; gtcgcgcctg agccggatcg tctcctctgc ctccaccgac ctccaggact acacctacta 96L cttcgtccca gcaccctggc tctcggtgaa gctcctgcgg cagt gctacccgcc 102; tccagaggat gtga aggggcggct ggtggaatgt ctggagactg tgctcaacaa 108; ggcccaggag ccccccaaat ccaagaaggt gcagcattcc aacgccaaga acgccatcct 114; cttcgagacc ctca tcatccacta tgacagtgag cccaacctcc tggttcgggc L20; ctgcaaccag cagt tcctgcagca ccgggagacc aacctgcgct acctggccct L26; ggagagcatg tgcacgctgg ccga gttctcccat gaagccgtca agacgcacat L32; tgacaccgtc atcaatgccc cgga gcgggacgtc agcgtgcggc agcgggcggc L38; tgacctcctc tacgccatgt gtgaccggag caatgccaag cagatcgtgt cggagatgct L44; gcggtacctg gagacggcag actacgccat ccgcgaggag atcgtcctga aggtggccat L50; cctggccgag aagtacgccg tggactacag ctggtacgtg gacaccatcc tcaacctcat L56; ccgcattgcg ggcgactacg tgagtgagga gtac cgtgtgctac gacgtccagg gctatgccgc caagaccgtc tttgaggcgc gagaacatgg tgaaggttgg cggctacatc cttggggagt gacccccgct ccagcccccc agtgcagttc tccctgctcc ccatctgtgc agcgtggcca cgcgggcgct gctgctgtcc acctacatca L86; cctcttcccc aagg ccaccatcca gggcgtcctg cgggccggct cccagctgcg L92; caatgctgac gtggagctgc agcagcgagc cgtggagtac ctcaccctca L98; cagcaccgac gtcctggcca cggtgctgga gccg cccttccccg agcgcgagtc 204; gtccatcctg gccaagctga aacgcaagaa ggggccaggg agcg ccctggacga 210; tggccggagg gaccccagca gcaacgacat caacgggggc atggagccca cccccagcac 216; gacg ccctcgccct ccgccgacct cctggggctg cgggcagccc ctcccccggc 222; agcacccccg gcttctgcag gagcagggaa ccttctggtg gacgtcttcg atggcccggc 228; cgcccagccc gggc ccacccccga ggaggccttc ctcagcgagc tggagccgcc 234; tgcccccgag agccccatgg tggc agct ccagctgctg acccaggtcc 240; tgaggacatc ggccctccca aagc cgatgagttg ctgaataagt ttgtgtgtaa 246; gaacaacggg gtcctgttcg agaaccagct gctgcagatc ggagtcaagt cagagttccg 252; acagaacctg ggccgcatgt atctcttcta tggcaacaag acctcggtgc agaa 258; tttctcaccc actgtggttc acccgggaga cctccagact gctg tgcagaccaa 264; gcgcgtgch gcgcaggtgg acggcggcgc gcag caggtgctca atatcgagtg 270; cctgcggqac ttcctgacgc ccccgctgct gtccgtgcgc ttccggtacg gtggcgcccc 276; ccaggccctc accctgaagc tcccagtgac catcaacaag ttcttccagc ccaccgagat 282; ggcggcccag gatttcttcc agcgctggaa gcagctgagc ctccctcaac aggaggcgca 288; gaaaatcttc aaagccaacc accccatgga cgcagaagtt actaaggcca tggg 294; gtttggctct gctctcctgg acaatgtgga ccccaaccct gagaacttcg tgggggcggg 300; gatcatccag actaaagccc tgcaggtggg ctgtctgctt cggctggagc ccaatgccca 306; ggcccagatg taccggctga ccctgcgcac ggag cccgtctccc gtcacctgtg 312; tgagctgctg gcacagcagt tctgagccct ggactctgcc ccgggggatg tggccggcac 318; tgggcagccc cttggactga ggcagttttg gtggatgggg gacctccact ggtgacagag 324; aagacaccag ggtttggggg ggga ctttcctccg gccttttgta tttttatttt 330; tctg ttta cattctgggg ggttaggggg agtccccctc cctccctttc 336; aagc acagaggqga gaggggccag ggaagtggat gtctcctccc ctcccacccc 342; accctgttgt agcccctcct accccctccc catccagggg ctgtgtatta ttgtgagcga 348; ataaacagag agacgctaa Protein seguence: ISOFORM 1 NCBI Reference Seguence: NP_055018.2 LOCUS 01 8 ACCESSION NP_055018 l mpavskgdgm rglavfisdi eaei krinkelani rskfkgdkal kyvc 6; kllfifllgh meav nllssnkyte kqigylfisv lvnsnselir linnaikndl 12; asrnptfmcl alhcianvgs remgeafaad iprilvagds mdsvkqsaal cllrlykasp 18; dlvpmgewta rvvhllndqh mgvvtaavsl itclckknpd dfktcvslav srlsrivssa 24; stdlqdytyy fvpapwlsvk llrllqcypp pedaavkgrl vecletvlnk aqeppkskkv ; qhsnaknail fetisliihy dsepnllvra flqh retnlrylal asse 36; kthi dtvinalkte rdvsvrqraa dllyamcdrs nakqivseml ryletadyai 42; reeivlkvai laekyavdys wyvdtilnli riagdyvsee varvlqivt nrddvquaa 48; ktvfealqap achenmvkvg gyilgefgnl iagdprsspp vqfsllhskf hlcsvatral 54; llstyikfin lfpetkatiq gvlragsqlr nadvequra veyltlssva stdvlatvLe 60; emppfperes silaklkrkk gpgagsaldd grrdpssndi nggmeptpst vstpspsadl 66L lglraapppa appasagagn dgpa aqpslgptpe eaflselepp apespmalla 72; dpapaadpqp edigppipea dellnkfvck nngvlfenql lqigvksefr qnlgrmylfy 78L gnktqufqn fsptvvhpgd lqtqlavqtk gga qquvlniec lrdfltppll 84; svrfryggap qaltlklpvt inkffqptem qrwk qlslpqqeaq kifkanhpmd 90; aevtkakllg fgsalldnvd pnpenfvgag iiqtkalqvg cllrlepnaq aqmyrltlrt 96; skepvsrhlc ellaqqf Nucleotide ce: ISOFORM 2 NCBI Reference ce: NM_130787.2 LOCUS NM_130787 ACCESSION NM_130787 1 cggctcagag ctccggaccg nggcggagg ggaggggcag ggggcggtgc cacggcctgc 6L cagcccgccc gcccgcccgc cagccagccc tccccgcggc cggctcggct ccttggcgct 12; qcctqqqqtc ctttccgccc ggtccccgct cccc cgctgctctg tgccctgtcc l8; ggccaggcct ggagccgaca ccaccgccat catgccggcc gtgtccaagg gcgatgggat 24; gcgggggctc gcggtgttca tctccgacat ccggaactgt aaag aggcggaaat ; taagagaatc aacaaggaac tggccaacat ccgctccaag ttcaaaggag acaaagcctt 36; ggatggctac agtaagaaaa aatatgtgtg taaactgctt ttcatcttcc tgcttggcca 42; tgacattgac tttgggcaca tggaggctgt gaatctgttg agttccaata aatacacaga 48; gaagcaaata ctgt tcatttctgt gctggtgaac tcgaactcgg agctgatccg 54; cctcatcaac atca agaatgacct ggccagccgc aaccccacct tcatgtgcct 60; ggccctgcac gcca gcag ccgggagatg ggcgaggcct ttgccgctga 66; catcccccgc atcctggtgg ccggggacag catggacagt gtcaagcaga gtgcggccct 72; gtgcctcctt cgactgtaca aggcctcgcc tgacctggtg cccatgggcg agtggacggc 78; gcgtgtggta cacctgctca atgaccagca catgggtgtg gtcacggccg ccgtcagcct 84; catcacctgt ctctgcaaga agaacccaga tgacttcaag acgtgcgtct ctctggctgt 90; gtcgcgcctg agccggatcg tctcctctgc ctccaccgac gact acacctacta 96; cttcgtccca gcaccctggc tctcggtgaa gcgg ctgctgcagt gctacccgcc L02; tccagaggat gcggctgtga aggggcggct ggtggaatgt ctggagactg tgctcaacaa L08; ggcccaggag ccccccaaat ccaagaaggt gcagcattcc aacgccaaga acgccatcct L14: cttcgagacc atcagcctca tcatccacta tgag ctcc ctgcaaccag ctgggccagt tcctgcagca ccgggagacc aacctgcgct L26; ggagagcatg tgcacgctgg ccagctccga gttctcccat gaagccgtca agacgcacat L321 tgacaccgtc atcaatgccc tcaagacgga gcgggacgtc agcgtgcggc agcgggcggc L38; cctc tacgccatgt ggag caatgccaag cagatcgtgt cggagatgct L44; gcggtacctg gagacggcag actacgccat ccgcgaggag ctga aggtggccat L50; cctggccgag aagtacgccg acag ctggtacgtg gacaccatcc tcaacctcat L56; ccgcattgcg ggcgactacg tgagtgagga ggtgtggtac cgtgtgctac agatcgtcac L62; caaccgtgat cagg gctatgccgc cgtc tttgaggcgc tccaggcccc L68; tcac gagaacatgg tgaaggttgg cggctacatc cttggggagt ttgggaacct L74; gattgctggg gacccccgct ccagcccccc agtgcagttc tccctgctcc actccaagtt L80; ccatctgtgc agcgtggcca cgcgggcgct gctgctgtcc acctacatca agttcatcaa L86; cctcttcccc gagaccaagg ccaccatcca gggcgtcctg cgggccggct cccagctgcg L92; caatgctgac gtggagctgc gagc cgtggagtac ctcaccctca cagcaccgac gtcctggcca cggtgctgga ggagatgccg cccttccccg agcgcgagtc 204; gtccatcctg gccaagctga aacgcaagaa ggggccaggg gccggcagcg ccctggacga 210; tggccggagg gaccccagca gcaacgacat caacgggggc atggagccca cccccagcac 216; tgtgtcgacg ccctcgccct acct gctg gccc ctcccccggc 222; agcacccccg gcttctgcag gagcagggaa ccttctggtg gacgtcttcg atggcccggc 228; cgcccagccc agcctggggc ccacccccga ggaggccttc ctcagcccag gtcctgagga 234; ccct cccattccgg atga gttgctgaat aagtttgtgt gtaagaacaa 240; ngggtcctg ttcgagaacc agctgctgca gatcggagtc aagtcagagt tccgacagaa 246; ccgc atgtatctct tctatggcaa caagacctcg gtgcagttcc agaatttctc 252; acccactgtg gttcacccgg gagacctcca gactcagctg gctgtgcaga ccaagcgcgt 258; ggcggcgcag gtggacggcg gcgcgcaggt gcagcaggtg ctcaatatcg agtgcctgcg 264; ggacttcctg acgcccccgc tgctgtccgt gcgcttccgg tacggtggcg ccccccaggc 270; cctcaccctg aagctcccag tgaccatcaa caagttcttc cagcccaccg cggc 276; ccaggatttc ttccagcgct ggaagcagct gagcctccct caacaggagg cgcagaaaat 282; cttcaaagcc aaccacccca tggacgcaga agttactaag gccaagcttc tggggtttgg 288; ctctgctctc ctggacaatg tggaccccaa ccctgagaac ttcgtggggg cggggatcat 294: ccagactaaa gccctgcagg gtct gcttcggctg gagcccaatg cccaggccca 3001 gatgtaccgg ctgaccctgc gcaccagcaa ggagcccgtc tcccgtcacc tgtgtgagct 306; gctggcacag cagttctgag ccctggactc tgccccgggg gatgtggccg gcactgggca 312; gccccttgga ctgaggcagt tttggtggat gggggacctc cactggtgac gaca 318; ccagggtttg ggggatgcct gggactttcc tccggccttt tgtattttta ttca 324; tctgctgctg tttacattct ggggggttag ggggagtccc cctccctccc tttccccccc 3301 aagcacagag gggagagggg ccagggaagt ctcc tcccctccca ccccaccctg 3361 ttgtagcccc tcctaccccc tccccatcca ggggctgtgt attattgtga aaac 3421 agagagacgc taa Protein seguence: M 2 NCBI Reference Seguence: NP_570603.2 LOCUS NP_570603 ACCESSION NP_570603 l mpavskgdgm rglavfisdi rnckskeaei krinkelani rskfkgdkal dgyskkkyvc 6; llgh didfghmeav nllssnkyte kqigylfisv lvnsnselir linnaikndl 12; asrnptfmcl alhcianvgs remgeafaad iprilvagds mdsvkqsaal cllrlykasp 18; dlvpmgewta rvvhllndqh mgvvtaavsl itclckknpd dfktcvslav srlsrivssa 24; stdlqdytyy fvpapwlsvk llrllqcypp pedaavkgrl vecletvlnk aqeppkskkv ; qhsnaknail fetisliihy dsepnllvra cnqlgqflqh retnlrylal esmctlasse 36; fsheavkthi lkte rdvsvrqraa dllyamcdrs nakqivseml ryletadyai 42; reeivlkvai laekyavdys wyvdtilnli riagdyvsee varvlqivt nrddvquaa 48; ktvfealqap achenmvkvg fgnl iagdprsspp vqfsllhskf hlcsvatral 54; llstyikfin lfpetkatiq gvlragsqlr nadvequra veyltlssva tvLe 60; emppfperes silaklkrkk gpgagsaldd grrdpssndi nggmeptpst vstpspsadl 66; lglraapppa appasagagn llvdvfdgpa aqpslgptpe gped igppipeade 72; llnkfvcknn gvlfenqllq igvksefrqn lgrmylfygn ktqufqnfs ptvvhpgd;q 78; tqlavqtkrv aaquggaqv qulnieclr dfltppllsv rfryggapqa vtin 84; kffqptemaa qdffqrwkql slpqqeaqki fkanhpmdae vtkakllgfg salldnvdpn 90; penfvgagii qtkalqvgcl lrlepnaqaq rtsk epvsrhlcel laqqf TTLL12 Official Symbol: TTLL12 Official Name: tubulin tyrosine -like family, member 12 Gene ID: 23170 Organism: Homo sapiens Other Aliases: dJ526|14.2 Other Designations: tubulin—-tyrosine ligase-like protein 12 Nucleotide seguence: NCBI Reference Seguence: NM_015140.3 LOCUS NM_015140 ACCESSION NM_015140 l gccgacggac ggcgggcggc ggcggcggtg gcggcgctgg agtcggcgcg ggtgctggcg 6L ccatggaggc cgagcggggt cccgagcgcc ggcctgcgga gcgtagcagc ccgggccaga 12; cgccggagga gcag gccttggccg agttcgcggc gctgcacggc ccggcgctgc l8; gcgcttcggg ggtccccgaa cgttactggg gccgcctcct gcacaagctg gagcacgagg 24; ttttcgacgc tggggaagtg tttgggatca tgcaagtgga ggaggtagaa gaggaggagg ; acgaggcagc ccgggaggtg cggaagcagc agcccaaccc ggggaacgag ctgtgctaca 36; tcgt gaccagggag agcgggctcc aggcagccca ccccaacagc atcttcctca 42; tcgaccacgc ctggacgtgc Cgtgtggagc acgcgcgcca gcagctgcag caggtgcccg 48; ggctgctgca ccgcatggcc aacctgatgg gcattgagtt ccacggtgag ctgcccagta 54; cagaggctgt ggccctggtg ctggaggaga agtt gacc taccagctgg 60; ggac agctgaggag aagatgccgg tgtggtatat catggacgag ttcggttcgc 66; ggatccagca cgcggacgtg cccagcttcg ccacggcacc ctac atgccgcagc 72; aggtggccta cacgctgctg tggcccctga tgga cactggcgag accc 78; ttgc ctacggagag acggaccccc ggaa gtgcatgctg ctgccctggg 84; cccccaccga catgctggac tctt gcacacccga gccgcccgcc gagcactacc 90; aggccattct ggaggaaaac aaggagaagc tgccacttga catcaacccc gtggtgcacc 96; gcca catcttcaag gtctacacgg acgtgcagca ggtggccagc accc 1021 acccgcgctt caccctcacc cagagtgagg cggacgccga catcctcttc aacttctcac 108; acttcaagga ctacaggaaa ctcagccagg agaggccagg cgtgctgctg aaccagttcc 114; cctgcgagaa cctgctgact gtcaaggact gcctggcctc catcgcgcgc gccccgaggg cccaccctgg ctgccccgaa ccttcaacct gcgcactgag ctgccccagt 126; ttgtcagcta cttccagcag cgggaaaggt ggggcgagga caaccactgg atctgcaagc L32; cctggaacct ggcgcgcagc ctggacaccc acgtcaccaa gagcctgcac agcatcatcc L38; gaga gagcaccccc aaggttgtgt ccaagtacat cgaaagtccc gtgttgttcc L44; ttcgagaaga Cgtgggaaag gtcaagttcg acatccgcta gctg ctgcggtcag L50; tgaggcccct acggttgttc gtgtatgatg tgttctggct ctcc aaccgggcct L56; ttgcactcaa cgacctggat gactacgaga agcacttcac ggtcatgaac tatgacccgg L62; atgtggtgct gaagcaggtg cactgtgaag agttcatccc cgagtttgag aagcaatacc L68; cagaatttcc ctggacggac gtccaggctg agatcttccg ggccttcacg gagctgttcc L74; aqqtqqcctq tgccaagcca ccacccctgg gcctctgcga ctacccctca gcca L80; tgtatgccgt cgacctcatg tggg acaacggccc aagg agccgcagat cctggaggtg aacttcaacc ccgactgtga gcgagcctgc ccaccttctt caacgacgtc ttcagcacct tgtttctgga ccagcccggt ttacctgcct tgtctaggca ctcgctgtcc ccaaaacctg tgcttggggc aggattccaa 204; cctcagttct ctgagctgct tctgcaaagg cccccatgtc cctccccaca ccggccctgg 210; gcatagcctc agccccaggc ctctgtcctg ccgagccatc ctcccggcgc cacactccgg 216; gagcacagca tcctcctctc acctgtgggt cagagcagga tggt gtccccaggg 222; ctgagcacca ccccacgccc tgccctcacc cctcaccacc atctgtgcac tgatgagtct 228; tagc cttt gttcctggca tggagaattt gttcctggct gctgtgtttc 234; cagggggtgc tgggggaagg gttccgtgga gcgagacaag gtgtcctcgg gagcagggtt 240; ccaccgggaa ggga gccctgtatc ggca ggcgggtttc tcttccgggg 246; tctctgctct tcag gacgaccccg ggacggctgt ggggccccac actgcaccca 252; tcta tgcgacaggg gcccaggaac agcctgaggc caccacccag caagcccgcc 258; ttatcaccca ttccagctca cccagaacct tcaccagcaa acctcctgct gaggtcctgg 264; caggaggcca ccgtcttgtt accgtttcct tttcgtttgc tgagggtcac agaccccaac 270; agggaaatca gtatctgtct tcccagtggt tgccctgctc gccgggcact ccacggggtc 276i ccgcccttgt gtgagatggg ccaggatcct tcggcaaggg gcgcctgggg ctggggctga 282- gcgg tggagcgcca gacagaaaag gattccaatg tcag gttaaagtca 288; gatgccacct accagggtct acagtcaaaa tgttggcttt ttcttatttt ttaatgtatg 294; ggagaaaaat gtaaaattcc agttcttttc taattgtgtt tctgaaatta ggagtcagct 300; gccagcgttt ttgtgtggct gcagtgtgcc cagc tcacgggcag tgggtggacc 306; taactgccca ggcaggcgag agctacttcc agagccttcc agtgcatggg agggcagggc 3121 taggtgtagc ggtgtctcct aatt aagaactatc tttcttgtag tgca 3181 cctgatgatg ctcc tctctgtgtt gtctgggccc ttgtttacaa gcacgcgtta 3241 ctga ggggagccat gctctagccc ctggagggcc tgttgcaggg gcagggcggg 3301 cccgtcgcct ttggcagctc ctggagagct gtggacatgc agtccccctc agttcgtgct 3361 gcaataaagg ccatcttctc ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3421 a n seguence: NCBI Reference Seguence: NP_055955.1 LOCUS NP_055955 ACCESSION NP_055955 1 meaergperr paersspgqt peegaqalae faalhgpalr asgvperng rllhklehev 61 fdagevfgim qveeveeeed eaarevrqu pnpgnelcyk vivtresglq aahpnsif1i 121 dhawtcrveh arqqquVPg llhrmanlmg iefhgelpst eavalvleem wkfnqtyq1a 181 hgtaeekmpv wyimdefgsr iqhadvpsfa tapffympqq vaytllwplr dldtgeevtr 241 dfaygetdpl irkcmllpwa ptdmldlssc tpeppaehyq aileenkekl pldinpvvhp 301 hghifkvytd lth prftltqsea dadilfnfsh fkdyrklsqe rpgvllanp 361 cenlltvkdc lasiarragg pegppwlprt lqu vsyqurerw gednhwickp 421 wnlarsldth vtkslhsiir hrestpkvvs kyiespvlfl redvgkkad iryivllrsv 481 vydv fwlrfsnraf alndlddyek hftvmnydpd vvlkqvhcee fipefequp 541 efpwtdvqae ifraftelfq vacakppplg lcdypssram yavdlmlkwd ngpdgrrvmq 601 pqilevnfnp ryhp tffndvfstl fldqugchv tclv FERMT2 Official Symbol: FERMT2 Official Name: fermitin family member 2 Gene ID: 10979 Organism: Homo sapiens Other Aliases: KIND2, MIG2, PLEKHC1, UN0112, UNC112B, mig-2 Other Designations: PH -containing family C member 1; fermitin family homolog 2; kindlin 2; kindlin—2; mitogen inducible gene 2 protein; mitogen- inducible gene 2 protein; pleckstrin homology domain containing, family C (with FERM domain) member 1; pleckstrin homology domain containing, family C member 1; pleckstrin homology domain-containing family C member Nucleotide seguence: NCBI Reference Seguence: NM_001134999.1 LOCUS NM_001 134999 ACCESSION NM_001 134999 1 gggtggagcg cggggagcca ggcgaggggc cgcgacgacg ggactccatt agccgctccg 6; gccacaggca gcgcttcgcc agccgaggaa Cnganng acaccgccgc cccgcgagcc 12; tccagcccct cgcctgttgc cgcgcgagtc ccgggcccgg agcgctagga gcgcgcggaa 18; ggagccatgg ctctggacgg gataaggatg ccagatggct gctacgcgga nggacgtgg 24; gaactgagtg tccatgtgac ggacctgaac gtca ccctgagagt gaccggcgag ; gtgcacattg gaggcgtgat gcttaagctg gtggagaaac tcgatgtaaa aaaagattgg 36; tctgaccatg ggtg ggaaaagaag agaacttggc ttctgaagac acattggacc 42; ttagataagt atggtattca ggcagatgct aagcttcagt tcacccctca gcacaaactg 48; ctccgcctgc ccaa catgaagtat gtgaaggtga aagtgaattt ctctgataga 54; gtcttcaaag ctgtttctga catctgtaag acttttaata tcagacaccc cgaagaactt 60; tctctcttaa agaaacccag agatccaaca aagaaaaaaa agaagaagct agatgaccag 66; tctgaagatg aggcacttga attagagggg cctcttatca gatc aggaagtata 72; tattcaagcc caggactgta tagtaaaaca atgaccccca cttatgatgc tcatgatgga 78; agccccttgt caccaacttc tgcttggttt agtg ctttgtcaga aggcaatcct 84L ggtatacttg ctgtcagtca accaatcacg tcaccagaaa tcttggcaaa aatgttcaag 90; cctcaagctc ttcttgataa agcaaaaatc aaccaaggat ggcttgattc ctcaagatct 96L ctcatggaac aagatgtgaa tgag gccttgctgc tcaa gtattacagc 102; ttttttgatt tgaatccaaa gtatgatgca atca ttta cgagcaggcc 108; aaatgggcca ttctcctgga agagattgaa gaag aagaaatgat gatgtttgca 114; gccctgcagt atcatatcaa taagctgtca acat cagagaatca tttgaacaac L20; agtgacaaag aagttgatga agttgatgct gccctttcag acctggagat tactctggaa L26; gggggtaaaa cgtcaacaat tttgggtgac attacttcca ttcctgaact ctac L32; attaaagttt tcaagccaaa aaagctgact ctgaaaggtt acaaacaata ttggtgcacc L38; ttcaaagaca catccatttc taag gaag aatccagtgg cacaccagct L44; catcagatga acctcagggg atgtgaagtt accccagatg taaacatttc aggccaaaaa L50; tttaacatta aactcctgat tccagttgca gaaggcatga atgaaatctg gcttcgttgt L56; gacaatgaaa aacagtatgc acactggatg gcagcctgca cctc caaaggcaag L62; gcgg ctta caacttagaa gttcagaata ttctttcctt acccagatcc tcagttaata ccagagcaga tcacgactga tgtctccccg ctatctaaaa aagtataaga acaagcagcc agagatttga cgag aatcttggag gcccatcaga atgtagctca attgaagcca agatgagatt tattcaagct tggcagtcac tacctgaatt tggcatcact -92; cacttcattg caaggttcca agggggcaaa gaac ttattggaat agactgattc ggatggatgc cagcactgga gatgcaatta aaacatggcg tttcagcaac 204; atgaaacagt ggaatgtcaa ctgggaaatc aaaatggtca ccgtagagtt tgcagatgaa 210; gtacgattgt ccttcatttg tactgaagta gattgcaaag atga attcattggt 216; ggctacatat ttctctcaac acgtgcaaaa gaccaaaacg agagtttaga tgaagagatg 222; ttctacaaac ttaccagtgg gtga ataggaatac tgtttaatga aactccacgg 228; ccataacaat atttaacttt aaaagctgtt tgttatatgc tgcttaataa agtaagcttg 234; aaatttatca ttttatcatg tctt tgccttacca gaccagttaa tatgtgcact 240; aaacaagcac gactattaat ctatcatgtt atgatataat gaat ttgtcacaca 246; ttccttaggg ccatgaattg aaaactgaaa gcaa atcaggaaca cact 252; gatttactga tttaagctag ccaaactgta agaaacaagc catctatttt aaagctatcc 258; agggcttaac ctatatgaac tctatttatc atgtctaatg catgtgattt aatgtatgtt 264; taatttgata tcatgtttta aaatatccta cttctggtag ccatttaatt cctcccccta 2701 cccccaaata aatcaggcat gcaggaggcc tgatatttag taatgtcatt gtgtttgacc 276; ttgaaggaaa atgctattag tccgtcgtgc ttgatttgtt tttgtccttg aataagcatg 282; atat tgtctcgtgt tttt acaccatatt gtattacact tttagtattc 288; accagcataa tcactgtctg cctaaaatat gcaactcttt caat atgaagtaaa 294; gttctatgaa gtatgcattt tgtgtaacta atgtaaaaac acaaatttta taaaattgta 300; cagtttttta aaaactactc acaactagca gatggcttaa atgtagcaat ctctgcgtta 306; tgcc tttaagagat ataattaacg tgcagtttta atatctacta aattaagaat 312; gacttcatta tgat ttgccacaat gtccttaact ctaatgcctg gactggccat 318; gttctagtct gttgcgctgt tacaatctgt gcta gtcagaaaat tcctagctca 324; catagcccaa aagggtgcga ggtg gattaccagt attgttcaat aatccatggt 330; tcaaagactg tataaatgca ttttatttta aataaaagca aaacttttat ttaataaaaa 336: aaaaaaaaaa aa Protein seguence: NCBI Reference Seguence: NP_001128471.1 LOCUS NP_001128471 ACCESSION NP_001128471 l maldgirmpd twel svhvtdlnrd vtlrvtgevh klve kldvkkdwsd 6; halwwekkrt wllkthwtld kygiqadakl qftpqhkllr lqlpnmkyvk vkvnfsdrvf 12; kavsdicktf nirhpeelsl lkkprdptkk kkkklddqse dealelegpl itpgsgsiys 18; spglysktmt ptydahdgsp lsptsawfgd salsegnpgi lavsqpitsp eilakmfkpq 24; alldkakinq gwldssrslm eqdvkeneal llrfkyysff dair inqueqakw ; ailleeiect eeemmmfaal sim tsenhlnnsd kevdevdaal sdleitlegg 36; ktstilgdit sipeladyik vfkpkkltlk gyquwctfk dtsiscyksk pahq 42; mnlrgcevtp dvnisqufn ikllipvaeg mneiwlrcdn equahwmaa crlaskgktm 48; adssynlevq nilsflkmqh lnpdpqlipe qittditpec lvsprylkky knkquyird 54; litarileah qnvaqmslie akmrfiqawq slpefgithf gkke eligiaynrl 60; irmdastgda iktwrfsnmk qwnvnweikm vtvefadevr lsfictevdc kvvhefiggy 6 6 ‘_ iflstrakdq nesldeemfy kltsng ANXA6 Official Symbol: ANXA6 Official Name: annexin A6 m: 309 Organism: Homo s Other Aliases: ANX6, CBP68 Other Designations: 67 kDa ctrin; CPB-ll; annexin V| (p68); annexin-6; calcium-binding protein p68; calelectrin; calphobindin II; calphobindin-II; chromobindin-20; rtin VI; p68; p70 Nucleotide seguence: NCBI Reference Seguence: NM_001155.4 LOCUS NM_001155 ACCESSION 155 1 agaggggtgg gqag ggaggcgggc gcgccggatt ggcctctgcg cgccacgtgt 6; ccggctcgga gcccacggct gtcctcccgg tccgccccgc gctgcggttg ctgctgggct 12; aacgggctcc gatccagcga gcgctgcgtc ctcgagtccc tgcgcccgtg cgtccgtctg 18; cgacccgagg cctccgctgc gcgtggattc tgctgcgaac cggagaccat ggccaaacca 24; gcacagggtg ccaagtaccg gggctccatc catgacttcc caggctttga ccccaaccag ; gatgccgagg ctctgtacac tgccatgaag ggctttggca gtgacaagga actg 36; gacataatca cctcacggag caacaggcag aggcaggagg tctgccagag ctacaagtcc 42; ctctacggca aggacctcat tgctgattta aagtatgaat tgacgggcaa gtttgaacgg 48; ttgattgtgg gcctgatgag gccacctgcc tattgtgatg ccaaagaaat taaagatgcc 54; atctcgggca ttggcactga tgagaagtgc ctcattgaga tcttggcttc ccggaccaat 60; gagcagatgc accagctggt ggcagcatac aaagatgcct acgagcggga cctggaggct 66; gacatcatcg gcgacacctc tggccacttc cagaagatgc ttgtggtcct gctccaggga 72L accagggagg aggatgacgt agtgagcgag gacctggtac aacaggatgt ccaggaccta 78L gcag gggaactgaa atggggaaca gatgaagccc agttcattta catcttggga 84L aatcgcagca agcagcatct ggtg ttcgatgagt atctgaagac gaag 90; ccgattgaag tccg aggggagctg tctggggact ttgagaagct ggcc 96; gtagtgaagt gtatccggag caccccggaa tattttgctg aaaggctctt caaggctatg 102; aagggcctgg ggga cctg atccgcatca tggtctcccg tagtgagttg L08; gacatgctcg acattcggga gatcttccgg accaagtatg agaagtccct ctacagcatg L14; atcaagaatg acacctctgg cgagtacaag aagactctgc tgaagctgtc tgggggagat L20; gatgatgctg ctggccagtt cttcccggag gcagcgcagg atca gatgtgggaa L26; cttagtgcag tggcccgagt agagctgaag ggaactgtgc gcccagccaa tgacttcaac L32; gcag aagc gctgcggaaa gccatgaagg gactcgggac tgacgaagac L38; acaatcatcg atatcatcac gcaccgcagc aatgtccagc ggcagcagat ccggcagacc L44; ttcaagtctc actttggccg ggacttaatg actgacctga agtctgagat ctggcaaggc tgattctggg gctcatgatg ccaccggccc attacgatgc caagcagttg L56; aagaaggcca tggagggagc agat gaaaaggctc ttattgaaat cggaccaatg ctgaaatccg ggccatcaat tata aggaggacta gatg ctctgagctc agacacatct ttca ggaggatcct gccacggggc atcgtgagga gggaggagaa aacctggacc aggcacggga gtggctgctg agatcttgga aatagcagac acacctagtg gagacaaaac ttccttggag L86; acacgtttca tgacgatcct gtgtacccgg agctatccgc acctccggag ag:cttccag L92; gagttcatca agatgaccaa ctatgacgtg gagcacacca tcaagaagga tggg L98; gatgtcaggg atgcatttgt ggccattgtt gtca agaacaagcc tc:cttcttt 204; aaac tttacaaatc catgaagggt acag atgagaagac tc:gaccagg 210; atcatggtat cccgcagtga gattgacctg ctcaacatcc ggagggaatt ca:tgagaaa 216; tatgacaagt ctctccacca tgag ggtgacacct ccggagactt cc:gaaggcc 222; ttgctggctc tctgtggtgg tgaggactag ggccacagct ttggcgggca ct:ctgccaa 228; gaaatggtta tcagcaccag ccgccatggc caagcctgat tgttccagct ccagagacta 234; aggaaggggc aggggtgggg ggaggggttg ggttgggctc ttatcttcag tggagcttag 240; gaaacgctcc cacg ggccatcgag ggcccagcac ggctgagcgg ctgaaaaacc 246; gtagccatag atcctgtcca cctccactcc cctctgaccc tcaggctttc ccagcttcct 252; ccccttgcta cagcctctgc cctggtttgg gctatgtcag aaac atcctgaacc 2581 tctgtctgta aaatgagtag tgtctgtact ttgaatgagg gggttggtgg caggggccag 264; ttgaatgtgc gggt ggtgggaagg atagtaaatg tgctggggca aactgacaaa 270; tcttcccatc catttcacca cccatctcca tccaggccgc gtac agga 276; atttggatgc ctgggttcaa atctgcatct gccatgcact tgtttctgac cttaggccag 282; cccctttccc tccctgagtc tctattttct tatctacaat gagacagttg gacaaaaaaa 2881 tcttggcttc ccttctaaca ttaacttcct aaagtatgcc tccgattcat tcccttgaca 2941 ctttttattt ctaaggaaga aataaaaaga gatacacaaa cacataaaca caaaaaaaaa 3001 aa Protein seguence: NCBI Reference Seguence: NP_001146.2 LOCUS NP_001146 ACCESSION NP_001146 l makpaqgaky fpgf dpnqdaealy tamkgfgsdk eaildiitsr snrqrqech 6; sykslygkdl iadlkyeltg kferlivglm dake ikdaisgigt dekclieiLa 12; srtnethql vaaykdayer dleadiigdt sghqumlvv eedd vvsedlqud 18; qulyeagel kwgtdeaqfi skqh lrlvfdeylk ttgkpieasi rgelsgdfek 24; kcir stpeyfaerl fkamkglgtr dntlirimvs rseldmldir eifrtkyeks ; lysmikndts geykktllkl sggdddaagq ffpeaaqvay qmwelsavar velkgtvrpa 36; ndfnpdadak alrkamkglg tdedtiidii thrsnvqrqq irqtfkshfg lkse 42; isgdlarlil glmmppahyd akqlkkameg agtdekalie naei raineayked 48; yhksledals sdtsghfrri lislatghre eggenldqar edaqvaaeil eiadtpsgdk 54; tsletrfmti phlr rquefikmt nydvehtikk emsgdvrdaf vaivqsvknk 60; plffadklyk smkgagtdek tltrimvsrs eidllnirre fiekydkslh qaiegdtsgd 66; alcg ged PSM D4 Official Symbol: PSMD4 Official Name: proteasome (prosome, macropain) 268 subunit, non-ATPase, 4 Gene ID: 5710 Organism: Homo sapiens Other Aliases: RP11-126K1.1, AF, AF-1, ASF, MCB1, an10, 85A, pUB-RS Other Designations: 268 proteasome non-ATPase regulatory subunit 4; 268 some regulatory subunit 35A; RPN10 homolog; SSa/antisecretory factor protein; angiocidin; antisecretory factor 1; multiubiquitin chain-binding protein Nucleotide ce: NCBI Reference Seguence: NM_002810.2 LOCUS NM_00281 0 ACCESSION NM_002810 l agga gttgttgtta ggccgtcccg gagacccggt cgggagggag gaaggtggca 6L agatggtgtt cact atggtgtgtg tggacaacag tgagtatatg ngaatggag 12; tacc caccaggctg caggcccagc aggatgctgt caacatagtt tgtcattcaa 18; agacccgcag caaccctgag aacaacgtgg gccttatcac actggctaat gactgtgaag 24; tgctgaccac actcacccca gacactggcc gtatcctgtc caagctacat actgtccaac ; ccaagggcaa gatcaccttc tgcacgggca tccgcgtggc ccatctggct ctgaagcacc 36; gacaaggcaa gaatcacaag atca ttgt gggaagccca gtggaggaca 42; atgagaagga tctggtgaaa ctggctaaac gcctcaagaa ggagaaagta gaca 48; attt tggggaagag gaggtgaaca cagaaaagct gacagccttt gtaaacacgt 54; tgaatggcaa agatggaacc ggttctcatc tggtgacagt gcctcctggg cccagtttgg 60; ctgatgctct catcagttct ccgattttgg ctggtgaagg tggtgccatg ctgggtcttg 66; gtgccagtga ctttgaattt ggagtagatc ccagtgctga tcctgagctg gccttggccc 72; ttcgtgtatc tatggaagag cagcggcagc ggcaggagga ggaggcccgg cgggcagctg 78; cagcttctgc tgctgaggcc gggattgcta cgactgggac ctca gacgatgccc 84; tgctgaagat gaccatcagc cagcaagagt ttggccgcac tcct gacctaagca 90; gtatgactga ggaagagcag attgcttatg ccatgcagat gcag ggagcagagt 96; ttggccaggc ggaatcagca gacattgatg ccagctcagc tatggacaca tctgagccag 1021 ccaaggagga ggatgattac gacgtgatgc aggaccccga gttccttcag agtgtcctag 108; tccc aggtgtggat cccaacaatg aagccattcg aaatgctatg ctgg 114; cctcccaggc ggac ggcaagaagg acaagaagga ggaagacaag aagtgagact 120; ggagggaaag ggtagctgag tctgcttagg ggactgcatg ggaagcacgg aatatagggt 126; tagatgtgtg ttatctgtaa ccattacagc ctaaataaag cttggcaact ttttgcttca aa Protein seguence: NCBI Reference Seguence: NP_002801.1 LOCUS NP_002801 ACCESSION NP_002801 1 mvlestmvcv dnseymrngd flptrlqaqq davnivchsk trsnpennvg dcev 61 1tt1tpdtgr ilsklhtvqp kgkitfctgi rvahlalkhr qgknhkmrii vedn 121 ekdlvklakr 1kkekvnvdi infgeeevnt ekltafvntl ngkdgtgshl vtvppgpsla 181 dalisspila geggamlglg asdfefgvdp sadpelalal rvsmeeqrqr qeeearraaa 241 giat tgtedsddal lkmtisqqef grtglpdlss mteeeqiaya mqmslqgaef 301 gqaesadida ssamdtsepa keeddydvmq dpeflqsvle nlpgvdpnne airnamgsla 361 sqatkdgkkd kkeedkk Official Symbol: COTL1 Official Name: coactosin—like 1 (Dictyostelium) Gene ID: 23406 Organism: Homo sapiens Other s: CLP Other Designations: coactosin-like protein Nucleotide ce: NCBI Reference Seguence: NM_021149.2 LOCUS NM_021149 ACCESSION NM_021149 1 cgcgctcgca gctcgcaggc gccgcgtagc cgtcgccacc gccgccagcc ccct 61 cggcgcgtac ccgccgcgct cccatccccg ccgccggcca cgct cggccgcccc 12; ggacagtgtc ccgctgcggc tccgcggcga tggccaccaa gatcgacaaa tgcc 18; gggcggcgta caacctggtg cgcgacgacg gctcggccgt catctgggtg acttttaaat 24; atgacggctc caccatcgtc cccggcgagc agggagcgga gtaccagcac ttcatccagc ; agtgcacaga tgacgtccgg gcct gctt caccaccggg gatgccatga 36; gcaagaggtc caagtttgcc ctcatcacgt ggatcggtga cagc gggctgcagc 42; gcgccaaaac cgggacggac aagaccctgg tgaaggaggt cgtacagaat ttcgctaagg 48L tgat cagtgatcgg aaggagctgg aggaagattt gagc gagctgaaga 54L aggcgggggg agccaattac gacgcccaga aacc ccagcccccg ccacaccacc 60; ccttgccaaa gtcatctgcc tgctccccgg gggagaggac cgccggcctc agctactagc 66L ccaccagccc accagggaga aaagaagcca tgagaggcag cgcccgccac cctgtgtcca 72; cagcccccac cttcccgctt cccttagaac cctgccgtgt cctatctcat gacgctcatg 78; cttt ctttgatctt ctttttcttt tctccccctc ttttttgttc taaagaaaag 84; tcattttgat gcaaggtcct gcctgccatc agatccgagg tgcctcctgc agtgacccct 90; tttcctggca tttctcttcc acgcgacgag gtctgcctag tgagatctgc atgacctcac 96; gttgctttcc agagcccggg cctattttgc catctcagtt ttcctggacc ctgcttcctg L02; tgtaccactg aggggcagct gggccaggag ctgtgcccgg tgcctgcagc cttcataagc L08; acacacgtcc attccctact caga cctcctggta tctgccccgg gctccctcat L14; cccacctcca tccggagttg cctaagatgc atgtccagca taggcaggat tgctcggtgg L20; tgagaaggtt aggtccggct cagactgaat aagaagagat aaaatttgcc ttaaaactta L26; cctggcagtg ctgc acggtctgaa accacctgtt cccaccctct tgaccgaaat L32; ttccttgtga cacagagaag ggcaaaggtc tgagcccaga gttgacggag ggagtatttc L38; agggttcact tcaggggctc ccaaagcgac aagatcgtta gggagagagg cccagggtgg L44; ggactgggaa tttaaggaga gctgggaacg gatcccttag gttcaggaag tgta L50; agctgcgagg atggcttggg ccgaagggtt gctctgcccg ccgcgctagc tgtgagctga :56: gcaaagccct gggctcacag aaaa ggct tcagtcctgc L621 cacattcaaa aggatcgttt tgttttgttt ttaaagaaag gtgagattgg cttggttctt L68; catgagcaca atag ctctttttct cctt ttcg ctgtattggg attgtaaaga acatctctgc actcagacag ttttgttttt cagaaaaaaa aaaaaaaaaa aaaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_066972.1 LOCUS NP_066972 ACCESSION NP_066972 1 matkidkeac raaynlvrdd gsaviwvtfk ydgstivpge qgaethfiq qctddvrlfa 61 fvrfttgdam skrskfalit wigenvsglq raktgtdktl nfak efvisdrkel 121 eedfikselk ydaq te Official Symbol: ST13 Official Name: suppression of tumorigenicity 13 m: 6767 Organism: Homo sapiens Other Aliases: AAG2, FAM10A1, FAM10A4, HIP, HOP, HSPABP, 1, P48, PROO786, SNCG Other Designations: Hsp70—interacting protein; aging-associated protein 2; heat shock 70kD protein binding protein; hsc70-interacting protein; progesterone receptor-associated p48 protein; putative tumor suppressor ST13; renal carcinoma antigen NY—REN-33; suppression of genicity 13 protein tide seguence: NCBI Reference ce: NM_003932.3 LOCUS NM_003932 ACCESSION NM_003932 1 gggtgggagg agccagcggc cggggaggtt ctagtctgtt ctgtcttgcg gcagccgccc 61 ccttctgcgc ggtcacgccg agccagcgcc tgggcctgga accgggccgt agccccccca 121 gtttcgccca ccacctccct accatggacc cccgcaaagt gaacgagctt cgggcctttg 181 tgaaaatgtg ggat ccgagcgttc tgcacaccga ggaaatgcgc ttcctgaggg 241 agtgggtgga gagcatgggt ggtaaagtac cacctgctac tcagaaagct aaatcagaag ; aaaataccaa ggaagaaaaa cctgatagta agaaggtgga ggaagactta gacg 36; aaccatcaag tgaggaaagt gatctagaaa ttgataaaga aggtgtgatt gaaccagaca 42; ctgatgctcc aatg ggagatgaaa atgcggagat ggag atgatggatc 48; aggcaaatga taaaaaagtg gctgctattg aagccctaaa tgatggtgaa ctccagaaag 54; ccattgactt attcacagat gccatcaagc tgaatcctcg cttggccatt ttgtatgcca 60; agagggccag cgtc aaattacaga agccaaatgc tgccatccga gactgtgaca 66L gagccattga aataaatcct gattcagctc agccttacaa gtggcggggg aaagcacaca 72L gacttctagg ccactgggaa gaagcagccc atgatcttgc ctgt aaattggatt 78L atgatgaaga tgctagtgca aaag aacc acag aaaattgcag 84L aacatcgqag aaagtatgag cgaaaacgtg aagagcgaga gatcaaagaa agaatagaac 90; gagttaagaa ggctcgagaa gagcatgaga gagcccagag ggaggaagaa gccagacgac 96; agtcaggagc tggc tcttttccag gtggctttcc tgggggaatg cctggtaatt L02; ttcccggagg aatgcctgga gggg gcatgcctgg tgga atgcctggac L08; tcaatgaaat tcttagtgat ccagaggttc ttgcagccat gcaggatcca gaagttatgg L14; tggctttcca ggatgtggct cagaacccag caaatatgtc aaaataccag ccaa L20; tgaa tctcatcagt aaattgtcag ttgg aggtcaagcg taatgtcctt L26; ctgataaata aagcccttgc tgaaggaaaa gcaacctaga tcaccttatg gatgtcgcaa L32; aaac cagtgtacct ctgaccttct catcaagaga gctggggtgc tttgaagata L38; atccctaccc ctctccccca aatgcagctg aagcatttta cagtggtttg ccattagggt L44; attcattcag ataatgtttt cctactagga attacaaact ttaaacactt tttaaatctt L50; caaaatattt aaaacaaatt gcct gttaattctt atatttttct ttactaatca L56; ttttggattt ttttctttga attattggca gggaatatac ttatgtatgg aagattactg L62; ctctgagtga aataaaagtt attagtgcga ggcaaacata actcatttga ggataaagtt L68; tgtgttggat atgtggttcc tgatgcattt tgacttgtct ttttaaatgc tttatctttt L74; tctttaaaga tttatttcaa taaaactaat tgggaccacc cgtatttcag gaagtacttg gcagggcagc agcaatcttg ttga aggttgccct taaatcttac actgtggtga agggatgttt agttggagta cttagttctc ttgttgtcca gtatatctaa tcatattatt tccacgtaag ggaaataagg tagtactttt ctttttatat ttctatgctt 204; aaaattctct ttcctagtca aaaattgccc aaatctgtgt ttgctttctg cttgctacat 210; ttgtctccct tctt gagctaaaga caggcttttt ccaccggcat catcactgct 216; atcatcatta acagcgtaat tatacaagca tatttaatgc tgagtttaat ttaatatgta 222; atacatatgg taattgtagg gtaataccca caacaactgt agtttcttac ttggccaaga 228; ttat ttaagtgtta gacttccatt ctggcaaaat cttgccttat cagaagacat 234; tggaaagagg gattcccttt ggtgtttggt ctta gaaaaaccta ttgcagttag 240; tttatcttgt agtattcatc tttgtattct gaagataagg tttgaattaa attgatacac 2461 acagagggga accgattttt tttatccaat gtgaattata aatgagataa tccacagtta 2521 ttcattgtgg agttgttgag actatgaaag actcattgtc tttgtattca gctcttaaat 2581 agtgtaacta cacc tctgcttgct ttctttccct cccctccaat gataaagaaa 2641 atgataaatt ttctgttgtg cattcaattc ttattttaaa taagactaag tataggcatt 270; gtacctgaca ttgctacgtt tctaccagtg ttta aagtgctagt gtttaaaaac 276; attttcaagg gataaggcct tctgtacttt gcttatttga agaatcagtg gtaggagcag 282; tgaagtaaat tctatggagt acatttctaa aataccacat ttctgaaatc ataaataagt 288; ttattcaggt tctaaccctt tgctgtacac aagcagacag aaatgcatct gttacataaa 294; tgagaaaaag ctattatgct gatggagcat taaa aaaa acactcacca 300; tataaacttg catttgagct tgtgtgttct tttgttaatg tgtagagttc tcctttctcg 306; aaattgccag tgtgtacttg ctca agaacagttt cttctggatt ccttatttga 312; taac ctaattatat tctaatattg caaatattac cataagtggg taaa 318; attcctcttc tgaaaaaaaa aaaaaaaaaa aaaa Protein seguence: NCBI Reference Seguence: NP_003923.2 LOCUS 923 ACCESSION NP_003923 1 mdprkvnelr akamckqdp svlhteemrf lrewvesmgg kvppatqkak seentkeekp 61 dskkveedlk adepsseesd leidkegvie pdtdapqemg denaeiteem mdqandkkva 121 dgel ftda iklnprlail vak lqkpnaaird cdraieinpd 181 saqpykwrgk ahrllghwee aahdlalack ldydedasam lkevqpraqk iaehrrkyer 241 kreereiker iervkkaree heraqreeea rrqsgaqygs fpggfpggmp gnfpggmpgm 301 gggmpgmagm pglneilsdp evlaaqupe vmvafqdvaq kyqs npkvmnlisk 361 lsakfggqa Gene OfibblSwnmfi:SRSF2(aBoknownasSFR32) Official Name: serine/arginine-rich ng factor 2 Gene ID: 6427 Organism: Homo sapiens Other Aliases: PR264, 80-35, 8035, SFRSZ, SFRSZA, SRpBOb Other Designations: SR splicing factor 2; splicing component, 35 kDa; splicing factor SC35; splicing factor, arginine/serine-rich 2 Nucleotide seguence: NCBI Reference Seguence: NM_001195427.1 LOCUS 195427 ACCESSION NM_001195427 l agaaggtttc atttccgggt ggcg ccattttgtg aggagcgata taaacgggcg 6; cagaggccgg ctgcccgccc agttgttact caggtgcgct agcctgcgga gcccgtccgt 12; gctgttctgc ggcaaggcct ttcccagtgt ccccacgcgg actg cctgagaggc 18; tcgc accgcccaga gctgaggaag ccggcgccag ttcgcggggc tccgggccgc 24; cactcagagc tatgagctac ggccgccccc atgt ggagggtatg acctccctca ; aggtggacaa cctgacctac tcgc ccgacacgct gaggcgcgtc ttcgagaagt 36; acgggcgcgt cggcgacgtg tacatcccgc gggaccgcta caccaaggag tcccgcggct 42; tcgccttcgt tcgctttcac gacaagcgcg acgctgagga cgctatggat gccatggacg 48; gggccgtgct ggacggccgc gagctgcggg tgcaaatggc gcgctacggc cgccccccgg 54; actcacacca cagccgccgg ggaccgccac cccgcaggta cgggggcggt ggctacggac 60; gccggagccg cagccctagg cggcgtcgcc gcagccgatc tcgg agccgttcca 66; ggtctcgcag ccgatctcgc tacagccgct cgaagtctcg gtcccgcact cgttctcgat 72; cgac ctccaagtcc agatccgcac gaaggtccaa gtcc tcgtcggtct 78; ccagatctcg ttcgcggtcc cggt ctcggtccag gagtcctccc ccagtgtcca 84; agagggaatc caaatccagg tcgcgatcga agagtccccc caagtctcct gaagaggaag 90; gagcggtgtc ctcttaagaa aatgatgtat cggcaagcag tgtaaacgga ggacttgggg 96; aaaaaggacc acatagtcca tcgaagaaga gtccttggaa caagcaactg gctattgaaa L02; aggttatttt gtaacatttg tctaactttt ttta agctttgcct cagttggcaa L08: tttt atgtgccatt ttgttgctgt tattcaaatt tcttgtaatt aacgacttca gatttcatta ttggatttgg atatttgagg taaaatttca tata L20; tagtgctgac tttttttgtt tgaaattaaa ggta acctaatttg cctg 1261 acttttaagg aaaacgtgtg cagccattac acacagccta aagctgtcaa gagattgact L32; cggcattgcc ttcattcctt aaaattaaaa acctacaaaa gttggtgtaa atttgtatat L38; tacc ttcagatcta aatggtaatc caaa tttgtataaa gacttttcag L44; agac tttt gaaaggattg tttatcaaac acaattctaa tctcttctct L50; tatgtatttt tgtgcactag gcgcagttgt gtagcagttg agtaatgctg gttagctgtt L56; aaggtggcgt gttgcagtgc agagtgcttg gctgtttcct gttttctccc gattgctcct L62; gtgtaaagat gccttgtcgt gcagaaacaa atggctgtcc agtttattaa aatgcctgac L68; aactgcactt ccagtcaccc gggccttgca tataaataac ggagcataca gtgagcacat L74; ctagctgatg ataaatacac ctttttttcc ctcttccccc taaaaatggt aaatctgatc L80; atatctacat gtatgaactt aacatggaaa atgttaagga agcaaatggt tgtaactttg L86; taagtactta taacatggtg tatctttttg cttatgaata ttctgtatta taaccattgt L92; ttctgtagtt taattaaaac attttcttgg tgttagcttt tctcagaaaa aaaaaaaaaa L98; aaaaaaaaaa aaaaaaaaaa aaaaaaaa n seguence: NCBI Reference ce: NP_001182356.1 LOCUS NP_001 182356 ACCESSION NP_001 182356 l msygrpppdv egmtslkvdn ltyrtspdtl rrvfekygrv gdvyiprdry tkesrgfafv 61 rfhdkrdaed amdamdgavl dgrelrvqma rygrppdshh srrgppprry ggggygrrsr 121 sprrrrrsrs rsrsrsrsrs rsrysrsksr srtrsrsrst sksrsarrsk sksssvsrsr 181 srsr spppvskres ksrsrskspp kspeeegavs s HNRNPH1 Official Symbol: HNRNPH1 Official Name: heterogeneous nuclear ribonucleoprotein H1 (H) m: 3187 Organism: Homo sapiens Other Aliases: Other Designations: heterogeneous nuclear ribonucleoprotein H tide seguence: NCBI Reference Seguence: NM_001257293.1 LOCUS 257293 ACCESSION NM_001257293 1 acaagggacc ttatttaggt tgcgcaggcg cccgctggcc atttcgtctt agccacgcag 61 aagtcgcgtg tctaggtgag tcgcggtggg tcctcgcttg cagttcagcg accacgtttg 12; tttcgacgcc cgta agagacgatg atgttgggca cggaaggtgg agagggattc 18; gtggtgaagg tccggggctt gccctggtct tgctcggccg atgaagtgca gaggtttttt 24; tctgactgca aaaa tggggctcaa ggtattcgtt tcatctacac cagagaaggc ; agaccaagtg gcgaggcttt actt gaatcagaag tcaa attggccctg 36; aaaaaagaca ctat gggacacaga tatgttgaag tattcaagtc aaacaacgtt 42; gaaatggatt gggtgttgaa gcatactggt ccaaatagtc ctgacacggc caatgatggc 48L tttgtacggc ttagaggact tccctttgga tgtagcaagg aagaaattgt tcagttcttc 54L tcagggttgg aaatcgtgcc aaatgggata acattgccgg tggacttcca ggggaggagt 60; acgggggagg ccttcgtgca gtttgcttca caggaaatag ctgaaaaggc tctaaagaaa 66; cacaaggaaa gaatagggca caggtatatt gaaatcttta gtag agctgaagtt 72; catt atgatccacc acgaaagctt atggccatgc agcggccagg tccttatgac 78; agacctgggg gagg gtataacagc attggcagag gagctggctt tgagaggatg 84; ggtg cttatggtgg tgga ggctatgatg attacaatgg ctataatgat 90; ggctatggat ttgggtcaga tagatttgga agagacctca attactgttt ttcaggaatg 96; tctgatcaca ggga tggtggctct actttccaga gcacaacagg acactgtgta L02; cacatgcggg gattacctta cagagctact gagaatgaca tttataattt ctcaaccctg tgagagtaca cattgaaatt ggtcctgatg gcagagtaac L14; gatgtcgagt tcgcaactca tgaagatgct gtggcagcta tgtcaaaaga caaagcaaat L20; atgcaacaca gatatgtaga actcttcttg aattctacag caggagcaag gatatgtaga actcttcttg aattctacag caggagcaag aaatgatggg aggcatgggc ttgtcaaacc gcta cgggggccca L38; gccagccagc agctgagtgg gggttacgga ggcggctacg agag cagcatgagt L44; ggatacgacc aagttttaca ggaaaactcc agtgattttc aatcaaacat tgcataggta L50; gagc agtgaacagc agctactaca gtagtggaag ccgtgcatct atgggcgtga L56; acggaatggg agggttgtct agcatgtcca gtatgagtgg tggatgggga atgtaattga L62; tcgatcctga tcactgactc ttggtcaacc tttttttttt tttt aaga L68; aaacttcagt ttaacagttt taca agcttgtgat ttatgcttac tgga L74; aatcaggatt gttatgaaga cttaaggccc agtatttttg aatacaatac tcatctagga L80; tgtaacagtg aagctgagta aactataact gttaaactta agttccagct tttctcaagt L86; tagttatagg atgtacttaa gcagtaagcg tatttaggta aaagcagttg aattatgtta L92; aatgttgccc tttgccacgt taaattgaac actgttttgg atgcatgttg aaagacatgc L98; ttttattttt ttgtaaaaca atataggagc tgtgtctact attaaaagtg aaacattttg 204; gcatgtttgt taattctagt ttcatttaat aacctgtaag gcacgtaagt ttaagctttt 210; ttttttttta agttaatggg aaaaatttga gacgcaatac caatacttag gattttggtc 216; ttggtgtttg tatgaaattc tgaggccttg atttaaatct ttcattgtat ttcc 222; ttttaggtat attgcgctaa gtgaaacttg aaat cctcctttta aaaactgcaa 228; aaaaaaaaaa aaaaaaaaaa aaaa Protein seguence: NCBI Reference ce: NP_001244222.1 LOCUS NP_001 244222 ACCESSION NP_001244222 1 mmlgteggeg glpw scsadevqrf fsdckiqnga ytre grpsgeafve 6; lesedevkla lkkdretmgh ryvevfksnn vemdwvlkht gpnspdtand gfvrlrglpf 12; gcskeeivqf fsgleivpng fqgr stgeaqufa sqeiaekalk khkerighry 18; ieifkssrae vrthydpprk lmamqrpgpy drpgagrgyn sigrgagfer mrrgaygggy 241 qqyddynqyn dgygfgsdrf grdlnycfsg msdhrygdgg stquttghc vhmrglpyra ; tendiynffs plnpvrvhie igpdgrvtge advefathed avaamskdka nmthyvelf 361 1nstagasgg ayehryvelf lnstagasgg aygsqmmggm glanssygg pasqqlsgqy 42; gggyggqssm sgydqvlqen ia Gene Official Symbol: IQGAP1 Official Name: IO motif containing GTPase activating protein 1 Gene ID: 8826 Organism: Homo sapiens Other Aliases: A01, SAR1, p195 Other Designations: RasGAP-like with IQ motifs; ras GTPase-activating-like protein IQGAPf tide seguence: NCBI Reference Seguence: NM_003870.3 LOCUS NM_003870 ACCESSION NM_003870 1 ggaccccggc aagcccgcgc acttggcagg agctgtagct accgccgtcc gcgcctccaa 61 ggtttcacgg cagc agagactcgg gctcgtccgc catgtccgcc gcagacgagg 121 ttgacgggct gggcgtggcc cggccgcact atggctctgt cctggataat gaaagactta 181 ctgcagagga gatggatgaa aggagacgtc agaacgtggc ttatgagtac catt 241 tggaagaagc gaagaggtgg atggaagcat gcctagggga agatctgcct cccaccacag ; aactggagga tagg aatggggtct accttgccaa actggggaac ttcttctctc 36; ccaaagtagt gtccctgaaa aaaatctatg atcgagaaca gaccagatac aaggcgactg 42; gcctccactt tagacacact gataatgtga ttcagtggtt gaatgccatg gatgagattg 48; ctaa gattttttac ccagaaacta cagatatcta tgatcgaaag ccaa 54; gatgtatcta ctgtatccat gcactcagtt tgtacctgtt caagctaggc ctggcccctc 60; agattcaaga cctatatgga aaggttgact tcacagaaga agaaatcaac aacatgaaga 66L ctgagttgga gaagtatggc atccagatgc ctgcctttag caagattggg ggcatcttgg 72; ctaatgaact gtcagtggat gaagccgcat tacatgctgc tgttattgct attaatgaag 78L ctattgaccg tagaattcca gccgacacat ttgcagcttt gaaaaatccg aatgccatgc 84L ttgtaaatct tgaagagccc ttggcatcca cttaccagga ttac caggctaagc 90; aggacaaaat gacaaatgct aaaaacagga cagaaaactc agagagagaa agagatgttt 96; agct gctcacgcaa gctgaaattc aaggcaatat aaacaaagtc aatacatttt L02; ctgcattagc aaatatcgac ctggctttag aacaaggaga tgcactggcc ctctgcagtc accagccctg gggcttcgag gactgcagca acagaatagc gactggtact L14; tgaagcagct cctgagtgat aaacagcaga agagacagag tggtcagact gaccccctgc L20; agaaggagga gctgcagtct ggagtggatg acag tgctgcccag caatatcaga L26; gaagattggc agca ctgattaatg ctgcaatcca tgtt aaga L32; ctgttttgga actgatgaat cccgaagccc agctgcccca ggtgtatcca gccg L38; atctctatca gaaggagctg gctaccctgc agcgacaaag tcctgaacat aatctcaccc L44; acccagagct ctctgtcgca atgt tgtcatcggt ggccctgatc gcat L50; tggaatcagg agatgtgaat acagtgtgga tgag cagttcagtt actggtctta L56; ccaatattga ggaagaaaac tgtcagaggt atctcgatga gttgatgaaa ctgaaggctc L62; aggcacatgc agagaataat gaattcatta catggaatga tatccaagct tgcgtggacc L68; atgtgaacct ggtggtgcaa gaggaacatg tttt agccattggt ttaattaatg L74; tgga tgaaggtgat gcccaaaaga ctctgcaggc cctacagatt cctgcagcta L80; aacttgaggg agtccttgca gaagtggccc agcattacca agacacgctg agagagagaa agcccaggaa atccaggatg agtcagctgt gttatggttg aaggtggaat gtcc aacaaagaca cccaagaagc acagaagttt tctttgccat taatgaggca gtagaaagtg gtgatgttgg caaaacactg agtgcccttc 204; gctcccctga tgttggcttg tatggagtca tccctgagtg tggtgaaact taccacagtg 210; atcttgctga agccaagaag aaaaaactgg cagtaggaga taataacagc aagtgggtga 216; agcactgggt aaaaggtgga tattattatt accacaatct ggagacccag gaaggaggat 222; gggatgaacc tccaaatttt aatt ctatgcagct ttctcgggag gagatccaga 228; gttctatctc tggggtgact gccgcatata accgagaaca gctgtggctg gccaatgaag 234; tcac caggctgcag gctcgctgcc gtggatactt agttcgacag gaattccgat 240; tgaa tttcctgaag aaacaaatcc ctgccatcac ctgcattcag tcacagtgga 246: gaggatacaa gcagaagaag gcatatcaag atcggttagc ttacctgcgc tcccacaaag 252; atgaagttgt aaagattcag tccctggcaa ggatgcacca agctcgaaag cgctatcgag 258; atcgcctgca gtacttccgg gaccatataa atgacattat caaaatccag gcttttattc 264; gggcaaacaa agctcgggat gactacaaga ctctcatcaa tgctgaggat cctcctatgg 270; ttgtggtccg aaaatttgtc cacctgctgg accaaagtga ccaggatttt caggaggagc 276; ttat gaagatgcgg gaagaggtta tcaccctcat tcgttctaac cagcagctgg 282; agaatgacct caatctcatg aaaa ttggactgct agtgaaaaat aagattacgt 288; tgcaggatgt ggtttcccac agtaaaaaac ttaccaaaaa aaataaggaa cagttgtctg 294; atatgatgat gataaataaa cagaagggag gtctcaaggc tttgagcaag gagaagagag 300; agaagttgga agcttaccag cacctgtttt atttattgca aaccaatccc acctatctgg 306; ccaagctcat ttttcagatg ccccagaaca agtccaccaa gttcatggac tctgtaatct 312; tcacactcta caactacgcg tccaaccagc gagaggagta cctgctcctg cggctcttta 318; cact ccaagaggaa atcaagtcga aggtagatca gattcaagag acag 324; gaaatcctac taaa atggttgtaa gtttcaaccg tggtgcccgt ggccagaatg 330; ccctgagaca gatcttggcc ccagtcgtga aggaaattat ggatgacaaa tctctcaaca 336; tcaaaactga ccctgtggat atttacaaat cttgggttaa tcagatggag tctcagacag 342; gagaggcaag caaactgccc tatgatgtga cccctgagca ggcgctagct catgaagaag 348; tgaagacacg gctagacagc tccatcagga acatgcgggc tgtgacagac ctct 354: ttgt cagctctgtg gacaaaatcc cttatgggat gcgcttcatt gccaaagtgc 3601 tgaaggactc gttgcatgag aagttccctg atgctggtga ggatgagctg ctgaagatta 366; ttggtaactt ttat cgatacatga ccat tcct gatgcctttg 372; acatcattga agca ggaggccagc caga ccaacgccga ggct 378; ccattgcaaa aatgcttcag catgctgctt ccaataagat gtttctggga gataatgccc 384; acttaagcat cattaatgaa tatctttccc agtcctacca caga cggtttttcc 390; aaactgcttg tgatgtccca gagcttcagg ataaatttaa tgtggatgag tactctgatt 396; ccct acca gtaatctaca ttgg tgaaatcatc aacacccaca 402; ctctcctgtt ggatcaccag gatgccattg ctccggagca caatgatcca atccacgaac 408; acga cctcggcgag acca ccct gataggggaa agctctggca 414; atttaaatga cccaaataag gaggcactgg ctaagacgga agtgtctctc accctgacca 420; acaagttcga cgtgcctgga gatgagaatg cagaaatgga tgctcgaacc atcttactga 426; atacaaaacg tttaattgtg gatgtcatcc ggttccagcc gacc ttgactgaaa 432; aaac accagccacc cagg aagcagaaca tcagagagcc atgcagagac 438; gtgctatccg tgatgccaaa acacctgaca agatgaaaaa gtcaaaatct gtaaaggaag 444; acagcaacct cactcttcaa gagaagaaag agaagatcca gacaggttta aagaagctaa 450; cagagcttgg aaccgtggac ccaaagaaca aataccagga actgatcaac gacattgcca 456; ttcg gaatcagcgg aggtaccgac agaggagaaa ggccgaacta gtgaaactgc 462; aacagacata Cgctgctctg aactctaagg ccacctttta tggggagcag gtggattact 468; ataaaagcta tatcaaaacc tgcttggata acttagccag caagggcaaa gtctccaaaa 474; agcctaggga aatgaaagga aagaaaagca aaaagatttc tctgaaatat acagcagcaa 480; gactacatga aaaaggagtt cttctggaaa ttgaggacct gcaagtgaat cagtttaaaa 486; atgttatatt tgaaatcagt ccaacagaag aagttggaga cttcgaagtg aaat 492; gagt ggag acttttatgt tacattatca ggacctgctg cagctacagt 498; atgaaggagt tgcagtcatg aaattatttg atagagctaa agtaaatgtc aacctcctga 504; tcttccttct caacaaaaag ttctacggga agtaattgat ctgc cagcccagaa 510; ggatgaagga aagaagcacc tcacagctcc tttctaggtc cttctttcct cattggaagc 516; aaagacctag ccaacaacag cacctcaatc tgatacactc ccgatgccac atttttaact 522; cctctcgctc tgatgggaca tttgttaccc ttttttcata gtgaaattgt ggct 528; tagtctgacc tttctggttt cttcattttc ttccattact taggaaagag tggaaactcc 534; actaaaattt ctctgtgttg ttacagtctt agaggttgca gtactatatt gtaagctttg 5401 gtgtttgttt aattagcaat agggatggta ggattcaaat gtgtgtcatt tagaagtgga 546; tagc accaatgaca taaatacata caagacacac aactaaaatg tcatgttatt 552; tatt aggttgtcat ttaaaaataa agttccttta tatttctgtc ccatcaggaa 558; aactgaagga tatggggaat cattggttat cttccattgt gtttttcttt atggacagga 564; gctaatggaa gtgacagtca tgttcaaagg aagcatttct agaaaaaagg agataatgtt 570; tttaaatttc attatcaaac ttgggcaatt ctgtttgtgt aactccccga ctagtggatg 576; ggagagtccc attgctaaaa ttcagctact cagataaatt cagaatgggt caaggcacct 582; gcctgttttt gttggtgcac agagattgac ttgattcaga attc actccatccc 588; tatggcagag gaatgggtta gccctaatgt agaatgtcat tgtttttaaa actgttttat 594; atcttaagag tgccttatta aagtatagat gtatgtctta aaatgtgggt aatt 600; attt atataatgca gcct tagaataaga aaagcttttt ttaaattgct 606: ttatctgtat atctgaactc ttgaaactta tagctaaaac actaggattt atctgcagtg 612; ttcagggaga taattctgcc tgtc taaaacaaaa acaaaaccag ccaacctatg 618; ttacacgtga gattaaaacc ttcc ccattttttc tccttttttc tcttgctgcc 6241 cacattgtgc ttta ccag ttttctgggc ttagtttaaa aaaaaaatca 630; agtctaaaca ttgcatttag aaagcttttg ttcttggata cata cactttaaaa 636; aaaaaaaaaa ctttttccag gaaaatatat tgaaatcatg ctgctgagcc tctattttct 642; ttctttgatg ttttgattca gtattctttt atcataaatt attt aaaaattcac 648; tgatgtacat taagccaata aactgcttta atgaataaca gtag tgtgtcccta 654; atgc attggagaag tatg agactcttta ctcaggtgca cagc 660; ccacagggag gcatggagtg ccatggaagg attcgccact acccagacct tgttttttgt 666; tgtattttgg aagacaggtt ttttaaagaa acattttcct cagattaaaa gatgatgcta 672; ttacaactag cattgcctca aaaactggga ccaaccaaag tgtgtcaacc ctgtttcctt 678; aaaagaggct atgaatccca aaggccacat ccaagacagg caataatgag cagagtttac 684; agctccttta ataaaatgtg tcagtaattt taaggtttat agttccctca acacaattgc 690; taatgcagaa tagtgtaaaa tgcgcttcaa gaatgttgat gatgatgata tagaattgtg 696; gctttagtag cacagaggat gccccaacaa actcatggcg ttgaaaccac acagttctca 702; ttactgttat ttattagctg tagcattctc tgtctcctct ctctcctcct ttct 708; cctcgaccag ccatcatgac atttaccatg aatttacttc ctcccaagag tttggactgc 714; ccgtcagatt gttgctgcac atagttgcct ttgtatctct gtatgaaata aaaggtcatt 720; tgttcatgtt aaaaaaaaa Protein seguence: NCBI Reference Seguence: 861.1 LOCUS NP_003861 ACCESSION NP_003861 l msaadevdgl gvarphygsv taee mderrrqnva yeylchleea krwmeaclge 6; dlppttelee glrngvylak lgnffspkvv slkkiydreq trykatglhf rhtdnviqwl 12; namdeiglpk ifypettdiy drknmprciy cihalslylf klglapqiqd lygkvdftee 18; einnmktele kygiqmpafs anel svdeaalhaa viaineaidr ripadtfaal 24; knpnamlvnl eeplastyqd ilyqakqdkm tnaknrtens ererdvyeel qgni ; nkvntfsala nidlalqud alalfralqs palglrgqu qnsdwylkql lsquqqus 36; gqtdplqkee lqsgvdaans aaqqurrla avalinaaiq kgvaektvle lmnpeaqlpq 42; vypfaadlyq kelathrqs pehnlthpel lssv alinralesg dvntvwkq;s 48; ssvtgltnie eencqrylde lmklkaqaha twnd iqacvdhvnl vvqeeheril 54; aiglineald egdaqktha lqipaakleg vlaevaqhyq dtlirakrek esav 60; lwldeiqggi qunkdtqea qkfalgifai neavesgdvg ktlsalrspd vglygvipec 66; getyhsdlae akkkklavgd nnskkahwv kggyyyyhnl etqeggwdep pnqunsmql 72; ssis gvtaaynreq lwlaneglit rlqarcrgyl vrqefrsrmn flkkqipait 78; ciqsqwrgyk qkkayqdrla ylrshkdevv rmhq drlq yfrdhindii 84; kiqafirank arddyktlin aedppmvvvr kfvhlldqsd qdfqeeldlm kmreevit;i 90; ranqlendl igll vknkithdv vshskkltkk nkeqlsdmmm inqugglka 96; lskekrekle athlfyllq tnptylakli fqmpanstk fmdsviftly nyaanreey ;02; lllrlfktal kvdq gnpt vikmvvsfnr gargqnaqu ilapvvkeim ;08; ddkslniktd pvdiykswvn qmesqtgeas klpydvtpeq vktr ldssirnmra ;14; vtdkflsaiv ssvdkipygm rfiakvlkds lhekfpdage dellkiignl npai ;20; vapdafdiid lsaggqlttd qrrnlgsiak mlqhaasnkm flgdnahlsi qsyq ;26; kfrrffqtac dvpelqdkfn vdeysdlvtl tkpviyisig eiinthtlll dhqdaiapeh ;32; ndpihelldd ;gevptiesl igessgnlnd pnkealakte vsltltnkfd vpgdenaemd ;38; artillntkr ;ivdvirfqp getlteilet patseqeaeh qramqrrair daktpdkmkk ;44; sksvkedsnl thekkekiq tglkkltelg tvdpknkyqe lindiardir nqrryrqrrk ;50; aelvquqty aalnskatfy gequyyksy iktcldnlas kgkvskkpre kgvlleiedl qvanknvif eispteevgd fmgv avmklfdrak vnvnllifll nkkfygk Official Symbol: TECR (also known as GPSN2) Official Name: trans-2,3—enoyl-CoA ase Qfiflfiifl2:9524 Organism: Homo sapiens Other Aliases: GPSN2, MRT14, 802, TER Other Designations: glycoprotein, ic 2; ic glycoprotein 802 Nucleotide seguence: NCBI Reference Seguence: NM_138501.5 LOCUS NM_1 38501 ACCESSION NM_138501 XM_001132190 XM_001132196 l ggaggggcgg ggcggacgca gagccgcgtt tagtctatcg ctgcggttgc gagcgctgta 6; gggagcctgt ccgc gcagttaggc agcagcagcc gcggagcagt agccgccgtg 12; ggagggagcc atgaagcatt acgaggtgga gattctggac gcaaagacaa gggagaagct 18; gtgtttcttg gacaaggtgg agccccacgc caccattgcg gagatcaaga acctcttcac 24; taagacccat ccgcagtggt accccgcccg ccagtccctc cgcctggacc ccaagggcaa ; gtccctgaag gatgaggatg ttctgcagaa gctgcccgtg acgg tgta 36; cttccgggac ctgggggccc agatcagctg ggtgacggtc ttcctaacag agtacgcggg 42; gccccttttc ctgc tcttctactt ccgagtgccc ttcatctatg gccacaaata 48; tgactttacg tccagtcggc atacagtggt gcacctcgcc tgcatctgtc actcattcca 54; caag cgcctgctgg agacgctctt cgtgcaccgc ttctcccatg gcactatgcc 60; tttgcgcaac atcttcaaga actgcaccta gggc ttcgccgcgt ggatggccta 66; caat caccctctct acactccccc tacctacgga gctcagcagg tgaaactggc 721 gctcgccatc tttgtgatct gccagctcgg caacttctcc atccacatgg ccctgcggga 78L cctgcggccc gctgggtcca agacgcggaa gatcccatac cccaccaaga accccttcac 84L gtggctcttc ctgctggtgt cctgccccaa ctacacctac gggt cctggatcgg 90; tttcgccatc atgacgcagt gtctcccagt ggccctgttc tccctggtgg gcttcaccca 96; gatgaccatc tgggccaagg gcaagcaccg cagctacctg ttcc gggactaccc 102; gcccctgcgc atgcccatca tccccttcct gctctgagcg ctcacccctg ctgaggctca 1081 gcccctcaac ccggtggcat ggag gagtggggcc cacagctctc cagcacccgg 1141 aataaagccc gcctgcccca gtcggaaaaa aaaa Protein seguence: NCBI Reference Seguence: NP_612510.1 LOCUS NP_612510 ACCESSION NP_612510 132190 XP_001132196 l mkhyeveild aktreklcfl dkvephatia tkth pquparqsl rldpkgkslk 61 dedvlqklpv gttatlyfrd lgaqiswvtv flteyagplf frvp fiyghkydft 121 ssrhtvvhla cichsfhyik rlletlfvhr fshgtmplrn ifknctyng faawmayyin 181 hplytpptyg aquklalai fvicqlgnfs ihmalrdlrp agsktrkipy twlf 241 llvscpnyty evgswigfai mtqclpvalf slvgftqmti wakgkhrsyl kefrdypplr 301 mpiipfll Official Symbol: EHD2 Official Name: EH-domain containing 2 m: 30846 Organism: Homo sapiens Other Aliases: PAST2 Other Designations: EH domain containing 2; EH domain-containing protein 2; PAST homolog 2 Nucleotide seguence: NCBI Reference Seguence: NM_014601.3 M NM_014601 ACCESSION NM_014601 1 ttttgagggg ggcgcctcgt cccgcccctc cctcctgtcc tccctcccgt cctccccgct 6; ccgggcccca cccggctcag acggctccgg acgggaccgc gagcacaggc cgctccgcgg 12; gcgcttcgga cggg accccaccct gcct gcccagcccg ctgcagccgc 18; cagcgcgccc cgtcggcagc tctccatctg cacgtctctc cgtgaacccc gtgagcggtg 24; tgcagccacc atgttcagct ggctgaagcg ggca cggggccagc agcccgaggc ; catccgcacg gtgacctcgg ccctcaagga gctgtaccgc acgaagctgc tgccgctgga 36; ggagcactac Cgctttgggg ccttccactc gccggccctg gaggacgcag acgg 42L caagcccatg gtgctggtgg ccggccagta cagcacgggc aagaccagct tcatccagta 48L cctgctgqag gtgc ccggctcccg cgtggggcct gagcccacca ccgactgctt 54L tgtggccgtc atgcacgggg acactgaggg gccc ggcaacgccc tcgtcgtgga 60; cccggacaag cccttccgca aactcaaccc tttcggaaac accttcctca acaggttcat 66; ccag ctccctaatc aggtcctgga cagc atcatcgaca ccccgggtat 72; cctgtcgggt gccaagcaga gagtgagccg cggctacgac ttcccggccg tgctgcgctg 78; gttcgcggag Cgcgtggacc tcatcatcct gctctttgat gcgcacaagc tggagatctc 84; ggacgagttc tcagaggcca tcggcgcgtt gcggggccat gaggacaaga tccgcgtggt 90; gctcaacaag gccgacatgg tggagacgca gcagctgatg cgcgtctacg gcgcgctcat 96; gtgggcgctg ggcaaggtgg tgggcacgcc Cgaggtgctg taca tcggctcctt L02; ctggtcccag ctcg tgcccgacaa ccggcgcctc ttcgagctgg aggagcagga L08; cctcttccgc cagg cccg gcacgcagcc ttgcgcaagc tcaacgacct L14; ggtgaagagg gcccggctgg tgcgagttca cgcttacatc tacc tgaagaagga L20; gatgccctct gtgtttggga aggagaacaa gaagaagcag ctgatcctca ccgt L26; tgcg aagattcagc tggaacatca catctcccct ggggactttc ctgattgcca L32; gaaaatgcag ctga tggcgcacga cttcaccaag tttcactcgc tgaagccgaa L38; gctgctagag gcactggacg agatgctgac gcacgacatc gccaagctca tgcccctgct L44; gcggcaggag gagctggaga gcaccgaggt gggcgtgcag gggggcgctt ttgagggcac L50: ccacatgqgc ccgtttgtgg agcggggacc tgacgaggcc atggaggacg gcgaggaggg L561 ctcggacgac gaggccgagt gggtggtgac caaggacaag tccaaatacg acgagatctt L62; ctacaacctg gcgcctgccg acggcaagct gagcggctcc aaggccaaga ggggaccaag ctccccaact cagtgctggg gcgcatctgg aagctcagcg cgacggcatg ctggatgatg aggagttcgc gctggccagc cacctcatcg aggccaagct L80; ggaaggccac gggctgcccg ccaacctgcc ccgtcgcctg gtgccaccct ccaagcgacg 186; ccacaagggc tccgccgagt gagccggccc ccctcccatg gccctgctgt ggctccccag 192; tcgg ctgcacgcac gctc cggctcacac tgcc tgccctccct 198; ctgt aaggaccggg ggtctccctc ctcactaccg ccagacaccc ngtggaagc 204; atttagaggg ggga gggacaaggc ttctctgtcc gcccttcaca cctccagcct 210; cacgttcact taggcacatc acacacacac tggcacacgc aggcatccat ccatccgtca 216; ttcattcaaa tatttattga gcacctacta tgtgcccagc cctgttctag gcactgggca 222; ttaccataga gaacaaaata taca tctgccctca tggaaggtga cgttcccagg 228; agagggcacc tacacagtca cgcaaacaca cactaattcc tggcagggcc cccagcccct 234; cccctggctg agcagccctg aaat caga taaacagacc cccttctgct 240; ccgcttcctc ctgcccagcc aggcaacacc ctcaaccggc tccatcacat cctcaggtct 246; cgggaccatg gggggctcag aggggagaca cacctactgc ttcctcagat gggcccctcc 252; cctt cccttgctcg gggaaagccc ccaattctgc ccacacccat ttatttcctt 258; ccttccttcc ttcttttctt tccttccttc cttctttttt gtttttgccc ccaattctgc 264; ccat ttctttcttt ccttccttcc ttcttttttg tttttgcccc cagttctgtc 270; cacacccctt ccctttcctg tcctgtcctt tctttctttt ttgatagaat cttgctctgt 276; cgcccaggct ggagtgcagt ggtgagatct cagctcactg ccac ctcctgggtt 282; gaagtgattc tcgtgcctca gcctcctgag tagctgggac tgcaggcacg cacg 288; cccagctaat ttttgtattt gagtagagac ggggtttcac catgttggcc aggctggtct 294; cgaactccgc atctcaggtg atctgctcgc cttggcctcc caaagtgatg ggattacagg 300; catgagccac cgtgcccggc ttcacaccca tttctttaaa aaggatcccg tagcaggcag 306; aaaagcccct tccatcctgc gata ctgtgccccc ttggagatat ttccgtcctc 312; cacccacgtg tctgtggctg ccca gcctgctcct ggccccctgg aagcctcccc 318; acagctggta actt aaggattgct gggccaccgc ctctctgcct accaccattc 324; catatttaag tggagcccct aaag gCCCngggC tttattttag tctccttttc 330; agggatgtcg tgggcggggg agggggttct tggtgctaca gccctctccc cacccctaaa 3361 gggacgccga cgctgtttgc tgccttcacc acatattagt ccct ggcaggggac 342; cccatggaaa agatggggaa gagcaaaata catggagacg acgcaccctc caggatgctc 348; gctgggattc ccacgcccac cactgtcccc caccccatgg ctgggagggg cctctgaacg 354; gaacagtgtc cccacagagc gaataaagcc aaggcttctt cccaaaaaaa aaaaaaaaaa 360; a n seguence: NCBI Reference Seguence: NP_055416.2 LOCUS NP_05541 6 ACCESSION NP_055416 l rgga rgqqpeairt vtsalkelyr eehy rfgafhspal edadfdgkpm 6; vlvagqystg ktsfiqylle qevpgsrvgp epttdcfvav mhgdtegtvp gnalvvdpdk 12; pfrklnpfgn tflnrfmcaq lpnqvlesis iidtpgilsg aquvsrgyd fpavlrwfae l8; rvdliillfd ahkleisdef seaigalrgh edkirvvlnk admvetqqlm rvygalmwal 24; gkvvgtpevl fwsq nrrl feleeqdlfr diqglprhaa lrklndlvkr ; arlvrvhayi isylkkemps vfgkenkkkq lilklpvifa kiqlehhisp gdfpdchmq 36; ellmahdftk fhslkpklle aldemlthdi aklmplque vqu ggafegthmg 42; pfv rgpd a m dg gsdd a wvvtkdk skydeifynl apadgklsgs kaktwmvgtk 48; lpnsvlgriw klsdvdrdgm lddeefalas hlieaklegh glpanlprrl vppskrrhkg 54; sae UGP2 Official Sym bol: UGP2 Official Name: UDP-glucose pyrophosphorylase 2 Gene ID: \ sm: Homo sapiens Other Aliases: UDPG, , UGP1, UGPPl, UGPP2, pHC379 Other Designations: UDP-glucose diphosphorylase; UDP-glucose pyrophosphorylase 1; UDPGP; UGPase 2; UTP--glucosephosphate uridylyltransferase; UTP--glucosephosphate uridylyltransferase 2; UTP- glucose-1 -phosphate uridyltransferase; uridyl diphosphate glucose osphorylase 2 Nucleotide seguence: NCBI Reference Seguence: NM_001001521.1 LOCUS NM_001001521 ACCESSION NM_001001521 l tttccgcatt gaaggggctg ctccgaatgg agggggaggg gaggtgttta ggagaaagta 6; gtgg gtgtcgggag ccggctgacg ggtggacaag ggggggttag cagctgggct 12; gcgaccgtta gggaggggct gtgc atgtgtgagg gaagagagag agagagaagg 18; caga ggtgactttc agcctgcgag tccc ggggcgccat aaacgccccc 24; aatttcccag ctgctaaagg aagaggaaga tcttagcaaa gcaatgtctc aagatggtgc ; ttctcagttc caagaagtca ttcggcaaga gctagaatta tctgtgaaga aggaactaga 36L aaaaatactc accacagcat catcacatga atttgagcac accaaaaaag acctggatgg 42L atttcggaag ctatttcata tgca agaaaagggg ccttctgtgg attggggaaa 48L aatccagaga ccccctgaag attcgattca accctatgaa aagataaagg ccaggggctt 54L gcctgataat atatcttccg tgttgaacaa actagtggtg gtgaaactca atggtggttt 60; gggaaccagc atgggctgca aaggccctaa aagtctgatt ggtgtgagga atac 66; ctttctggat ctgactgttc agcaaattga acatttgaat aaaacctaca atacagatgt 72; tcctcttgtt ttaatgaact cttttaacac ggatgaagat accaaaaaaa tactacagaa 78; gtacaatcat tgtcgtgtga acac tcaa agcaggtacc cgaggattaa 84; taaagaatct cctg tagcaaagga cgtgtcttac tcaggggaaa atacagaagc 90; ttggtaccct ccaggtcatg gtgatattta cgccagtttc tacaactctg ttga 96; tacctttata ggagaaggca aagagtatat ttttgtgtct aacatagata atctgggtgc L02; cacagtggat ctgtatattc ttaatcatct aatgaaccca cccaatggaa aacgctgtga L08; atttgtcatg acaa ataaaacacg tgcagatgta aagggcggga cactcactca L14; atatgaaggc aaactgagac tggtggaaat tgctcaagtg ccaaaagcac atgtagacga L20; gtct gtatcaaagt tcaaaatatt taatacaaac aacctatgga tttctcttgc L26; agcagttaaa agactgcagg agcaaaatgc cattgacatg gaaatcattg tgaatgcaaa L32; gactttggat ggaggcctga atgtcattca aact gcagtagggg ctgccatcaa L38; aagttttgag aattctctag gtattaatgt gccaaggagc cgttttctgc cacatcagat ctcttgctgg tgatgtcaaa cctctatagt cttaatgcag aatgagtgaa gaat ttcctacagt gcccttggtt aaattaggca gttcttttac 156; gaaggttcaa gattatctaa gaagatttga aagtatacca gatatgcttg cctcacagtt gatg tgacatttgg aaaaaatgtt tcattaaagg gaacggttat L68; tgca aatcatggtg acagaattga tatcccacct ggagcagtat gtct ggaaaccttc gcatcttgga ccactgaaat gaaaaatact gtggacactt 1801 aaataatggg ctagtttctt acaatgaaat gttctctagg attctaaaat aggcaggtac 1861 tttactatgt tactgtaccc tgcagtgttg atttttaaaa tagagttttc tgcagtatgc 1921 ttttagtcta agaaaagcac agatggagca atactttcct tctttgaaga gaatcccaaa 1981 agttagttca tcttaaagtg caatattgtt taaa actgggcaac tttggaagaa 2041 cttttaacag aagcctcaat gatgatcact tgct tgtgatttca aaaataaagc 2101 agtgaagcaa aaaa aaaaaaaaa Protein seguence: NCBI nce Seguence: NP_001001521.1 LOCUS NP_001001521 ION NP_001001521 1 msngasqfq evirqelels ilt tasshefeht frkl fhrflqekgp 61 svdwgkiqrp pedsiqpyek ikarglpdni ssvlnklvvv klngglgtsm gckgpkslig 121 vrnentfldl tquiehlnk tyntdvplvl mnsfntdedt kkilqkynhc rvkiytfnqs 181 ryprinkesl lpvakdvsys genteawypp ghgdiyasfy nsglldtfig egkeyifvsn 241 idnlgatvdl yilnhlmnpp ngkrcefvme advk ggtltqyegk 1r1veiaqvp 301 kahvdefksv skfkifntnn lwislaavkr lqeqnaidme iivnaktldg glnviqleta 361 vgaaiksfen s1ginvprsr flpvkttsdl llvmsnlysl nagsltmsek refptvplvk 421 1gssftkqu sipd mleldhltvs gdvtfgknvs lkgtviiian hgdridippg 481 avlenkivsg n1rildh Official Symbol: UGDH Official Name: UDP-glucose drogenase Gene ID: 7358 Organism: Homo sapiens Other Aliases: GDH, UDP-GICDH, UDPGDH, UGD Other Designations: UDP—Glc dehydrogenase; UDP-glucose dehydrogenase; uridine pho-glucose dehydrogenase Nucleotide seguence: NCBI nce Seguence: NM_001184700.1 LOCUS: NM_001184700 ACCESSION : NM_001184700 l gtgaaggaaa tagggacctg gccctgggcc ttgtgtagcg ggagggggag ctaggaagca 6L gctgagggca gaatccagga gggcctggct gaat gaagcctccg ccttcgcagg 12; caaaagcctt taaatacggg ctcaggcccg ggactcagag tgtaacgcgt ggcagcctga l8; ggcg tgcgccgaga gggagctcag atcgagcggg gtgg agaagctgcg 24; gcggcgcggc ccgtaggaag gtgctgtccg aacgatcggg ataggagcgg tccctgcgct ; tgctgctggg aagtggtaca atcatgtttg aaattaagaa gatctgttgc atcggtgcag 36; gctatgttgg aggacccaca gtca ttgctcatat tgaa atcagggtaa 42; cggttgttga tgtcaatgaa tcaagaatca atgcgtggaa ttctcctaca cttcctattt 48; atgagccagg actaaaagaa gtggtagaat cctgtcgagg aaaaaatctt tttttttcta 54; ccaatattga tgatgccatc aaagaagctg atcttgtatt tatttctgtg ctgtccaacc 60; ctgagtttct ggcagaggga acagccatca aggacctaaa agac agagtactga 66; ttggagggga tgaaactcca caga gagctgtgca ggccctgtgt gctgtatatg 72; agcactgggt tcccagagaa aagatcctca ccactaatac ttggtcttca gagctttcca 78; aactggcagc aaatgctttt cttgcccaga gaataagcag cattaactcc ataagtgctc 84; aagc aacaggagct gatgtagaag caac agcgattgga atggaccaga 90; gaattggaaa caagtttcta aaagccagtg ttgggtttgg tgggagctgt ttccaaaagg 96; atgttctgaa tttggtttat ctctgtgagg ctctgaattt gccagaagta gctcgttatt 102; ggcagcaggt catagacatg aatgactacc ggag gtttgcttcc cggatcatag 108; atagtctgtt taatacagta actgataaga agatagctat attt gcattcaaaa 114; aggacactgg tgatacaaga tcta gtatatatat tagcaaatat ttgatggatg 120; aaggtgcaca tctacatata tatgatccaa aagtacctag ggaacaaata gttgtggatc 126; atcc aggtgtttca gaggatgacc aagtgtcccg gctcgtgacc atttccaagg 132; atccatatga agcatgtgat ggtgcccatg ctgttgttat ttgcactgag tgggacatgt L38; ttaaggaatt ggattatgaa Cgcattcata aaaaaatgct aaagccagcc tttatcttcg L44; atggacggcg tgtcctggat gggctccaca taca aaccattggc ttccagattg L50; aaacaattgg caaaaaggtg tcttcaaaga cata tgctccttct ggtgaaattc L56; cgaagtttag tcttcaagat ccacctaaca agaaacctaa agtgtagaga ttgccatttt L62; tatttgtgat tttttttttt tttttttggt acttcaggat agcaaatatc tatctgctat L68; gtaa atgaaccaag tgtttttttt tgtttttttt ttgagacaga gtctcactgt L74; tgcccaggct ggagtgcagt ggtgcaatct cggctcactg caagctctgc ggtt L801 cacgccattc tcctggctca gcctcccaag tagctgggac tacaggcacc cgccacagtg L861 taat tttttgtatt tttagtagag acagggtttc accatgtgag ccaggatggt 1921 ctcaatctcc tgaccttgtg aaccacccgt ctcggcctcc gctg tgtgagccac cacgcctggc ccatgaacca agtgttttta aggaaacaaa actatttttt 204; tcag atttatacta gctatatgga cata tctggtaatt atgaatctag 210; aattttttta catattttta taatactgtt agctcagtta ttggatgagt gaaagataat 216; catgttggtt ttaatagtgt ttgt aaaataaaaa ttaaacttca aactctttac 222; tttataaatt gtccataggc cacactttaa tatcacatta taaagggaag gacagtcttc 228; attcctcctg gttattggtt tcat taaagatata ttttgaatcc atgaaattgc 234; tatgctaaac agcctttaca tgtatggtct ggttaaagtt cctttgttcc ttttgtttta 240; ataaaatgtg tcactgattt tttagctcaa aatcatcact gttaatttcc agtcacccca 246; aatatggtta tttt tttttaatca tgaagagaaa attagtagca tttctttctc 252; tccccattat ttattggttt ctaa tctttttttt tttagtccaa aagccaaaaa 258; tatttatctt ggttttacat tttaatttcc attcttaatt gtaatttttt tctttaaata 264; aggaaaccaa tataatctca tgtataaaaa tatt ttacaagtta catatagcat 270; cattctaaaa taagaatttt ttttgttttc tgtctgcttt tttcttatgt ctcttgttga 276; gttttatatt ggtt atttttgctt gatc attattaaaa tatatccaat 282; gtccctttga tacttgtgct ctgctgagaa tgtacagttt aaca tcccaggtct 2881 catccttcag gaattttgca gttcaatgag aagagggaga caaatataaa gatgaggaca 294; gaagcatctc tacagatgaa aattacataa ataaaacatt ctccatcaac aactaaaaaa 300; aaaaaaaaaa aaa Protein seguence: NCBI Reference Seguence: NP_001171629.1 LOCUS: NP_001171629 ACCESSION: NP_001171629 1 mfeikkicci gagyvggptc sviahmcpei rvtvvdvnes rinawnsptl piyepglkev 6; vescrgknlf fstniddaik eadlvfisvl snpeflaegt aikdlknpdr vliggdetpe 12; gqravqalca vyehvarek ilttntwsse lsklaanafl aqrissinsi salceatgad l8; veevataigm dqrignkflk asvgfggscf qkdvlnlvyl cealnlpeva rywquidmn 24L dyqrrrfasr iidslfntvt dkkiailgfa fkkdtgdtre skyl mdegahlhiy ; eqiv vdlshpgvse ddqurlvti skdpyeacdg ahavvictew dmfkeldyer 36; ihkkmlkpaf ifdgrrvldg lhnelqtigf qietigkkvs apsg lqdp 42; pnkkpkv Official Symbol: PL|N3 (also known as M6PRBP1) al Name: pin 3 Gene ID: 10226 sm: Homo sapiens Other Aliases: M6PRBP1, PP17, T|P47 Other Designations: 47 kDa MPR-binding protein; cargo selection protein TlP47; mannosephosphate receptor-binding protein 1; perilipin-3; placental protein 17; tail-interacting protein, 47 kD tide seguence: NCBI Reference Seguence: NM_001164189.1 LOCUS: NM_001164189 ACCESSION : NM_001164189 1 tggcgcgggc aatccctcaa cctgattggt cccctcgccc gtcactccag tgcgccccca 61 acctaccacg cagtaaaagc cacccccgcc tcggcccgga cggtttccaa gctggttttg 121 aagtcgcggc agctgttcct tccg gttgaccgcg cgtctgctgc agagaccatg 18; tctgccgacg gggcagaggc tgatggcagc acccaggtga cagtggaaga acag 24; cagcccagtg accg tgtggccagc atgcctctga tcagctccac ctgcgacatg ; gtgtccgcag cctatgcctc caccaaggag agctacccgc acatcaagac cgac 36; gcagcagaga agggagtgag gaccctcacg gcggctgctg tcagcggggc tcagccgatc 42; ctctccaagc tggagcccca gattgcatca gccagcgaat acgcccacag ggggctggac 48; gagg agaacctccc catcctgcag cagcccacgg agaaggtcct ggcggacacc 54L aaggagcttg tgtcgtctaa ggtgtcgggg gcccaagaga tggtgtctag cgccaaggac 60; acggtggcca cccaattgtc ggaggcggtg gacgcgaccc gcggtgctgt gcagagcggc 66L gtggacaaga caaagtccgt cggc ggcgtccaat cggtcatggg ctcccgcttg 72; ggccagatgg tgttgagtgg ggtcgacacg gtgctgggga agtcggagga gtgggcggac 78; aaccacctgc cccttacgga actg gcccgcatcg ccacatccct ggatggcttt 84; gacgtcgcgt ccgtgcagca gcag gaacagagct acttcgtacg ctcc 90; ctgtcggaga ggctgcggca gcacgcctat gagcactcgc tgggcaagct tcgagccacc 96; aagcagaggg cacaggaggc tctgctgcag cagg tcctaagcct gatggaaact L02; gtcaagcaag gcgttgatca gaagctggtg gaaggccagg agaagctgca ccagatgtgg L08; tgga agca gctccagggc cccgagaagg agccgcccaa ggtc L14; gagtcccggg Cgctcaccat gttccgggac attgcccagc aactgcaggc cacctgtacc L20; tccctggggt ccagcattca gggcctcccc accaatgtga aggaccaggt gcagcaggcc L26; cgccgccagg tggaggacct ccaggccacg ttttccagca cctt ccaggacctg L32; tccagcagca ttctggccca gagccgtgag cgtgtcgcca gcgcccgcga ggccctggac L38; cacatggtgg tggc ccagaacaca cctgtcacgt ggctcgtggg accctttgcc L44; cctggaatca ctgagaaagc cccggaggag aagaagtagg gggagaggag aggactcagc L50; gggccccgtc tctataatgc agctgtgctc tggagtcctc aacccggggc caaa L56; cttattttct agccactcct cccagctctt ctgtgctgtc cacttgggaa L62; tcaaaacggg catcacccag ttgacccatc cctc tctgagcttg gaagaagcct L681 gttctgagcc tcaccctatc agtcagtaga gagagatgtc cagaaaaaat atctttcagg L74; aaagttctcc cctgcagaat tttttttcct tgttaaatat caggaatata ggccgggtgc L80; ggtggctcac acctgtaatc ccagcacttt gggaggctga cgga acacctgagg L86; tcaggtgttc gagaccagcc aggccaacat ggtgaaaccc cgtctctact aaaaatacaa L92; aaaaaaatga gccgggcatg gtagcaggtg tctgttatcc cagttaggag gctgaggcaa 198; gagaatctct tgaacctgag aggcggaggt tgcagtgagc caagatcgcg ccattgcact 204; ccagcctggg ggacaagagt gagacttagt ctcaaaaaaa aaga aaaaaaaatc 210; agggatatag ttcatatccc acttctttgt cgat gtccctgaat atcagcctgt 216; agctaatgga attt ctggtctaag tgggcctcct ggggatgggg actg 222; agcttctgag cctcattgta gagtagaaag gtactggggc ctgtgtggta agccttgttg 228; aaatgctctg gtattcagta ttgccttaat aaacttcacc tgca tacaggcaaa 234- aa Protein seguence: NCBI Reference Seguence: NP_001157661.1 LOCUS: 157661 ACCESSION: NP_001157661 l msadgaeadg stqvtveepv qqpsvvdrva smplisstcd mvsaayastk esyphiktvc 6; daaekgvrtl taaavsgaqp ilsklepqia saseyahrgl dkleenlpil qqptekvlad 12; tkelvsskvs gaqemvssak dtvatqlsea vdatrgavqs gvdktksvvt gqusvmgsr 18; lgqmvlsgvd tvlgkseewa tdae lariatsldg fdvaquqqr 24; slserquha yehslgklra tquaqeall qlsqvlslme tvkqgvqul vegqeklhqm ; wlswnqkqlq gpekeppkpe vesraltmfr diaqqlqatc tslgssiqgl ptnvkdqqu 36; arrqvedlqa tfssihsfqd lsssilaqsr ervasareal dhmveyvaqn tpvtwlvgpf 42; apgitekape ekk C14orf166 Official Sym bol: C14orf166 Official Name: chromosome 14 open reading frame 166 m: 51637 Organism: Homo sapiens Other Aliases: CGl-99, CGI99, CLE, CLE7, LCRP369, RLLM1 Other Designations: CLE7 g; RLL motif ning 1; UPF0568 protein C14orf166 Nucleotide seguence: NCBI Reference Seguence: NM_016039.2 LOCUS: NM_016039 ION : NM_016039 l cgccgtcatt tcggagcgac tcagcgcctg cccgccctct cgccgcgtcg ccggtgcctg 6; cgcctcccgc tcgc ttcttctctc gagg cccgggggac cagagcgaga 121 agcggggacc atgttccgac gcaagttgac ggctctcgac taccacaacc ccgccggctt l8; caaa gatgaaacag gaaa cttcatcgtt tggcttgaag accagaaaat 24L caggcactac aagattgaag acagagggaa aaac atccacagca gcgactggcc ; caagttcttt gaaaagtatc tcagagatgt taactgtcct ttcaagattc aagatcgaca 36; agaagctatt gactggcttc ttggtttagc actt gaatatggag ataatgctga 42; caag gatttagtac ctgataattc aaaaactgct gacaatgcaa ctaaaaatgc 48; agaaccattg atcaatttgg atgtaaataa tcctgatttt aaggctggtg tgatggcttt 54; ggctaacctg cttcagattc agcgtcatga tgattacctg gtaatgctta aggcaattcg 60; ggtt caggagcgcc tgacacagga tgct aaggcaaatc aaacaaaaga 66; gggcttacct gttgctttag acaaacatat tcttggtttt gacacaggag atgcagttct 72; taatgaagct gctcaaattc tgcgattgct gcacatagag gagctcagag agctacagac 78; aaaaatcaac gaagccatag tagctgttca tatt gctgatccaa agacagacca 84; cagactggga aaagttggaa gatgaacact tgaggacttc tcac ctacttagta 90; cagttgggaa ccatacactt ctggcatgtt tggaaatcaa aatgtcacat tctcggggga 96; ggaagcccag aaaattgggt atgttctaga gatttaccac cattgcttat tgcttttttc 102; tttaataaag tttaggaaag tagaattttt aaaaaaaaaa aaaa Protein seguence: NCBI Reference Seguence: NP_057123.1 LOCUS: NP_057123 ACCESSION: NP_057123 l mfrrkltald fnck detefrnfiv wlequirhy kiedrgnlrn ihssdwpkff 61 ekylrdvncp fkiqdrqeai dwllglavrl eygdnaekyk dlvpdnskta dnatknaepl 121 inldvnnpdf kagvmalanl lqiqrhddyl vmlkairilv qerltqdava kanqtkeglp 181 valdkhilgf dtgdavlnea aqilrllhie elrelqtkin eaivavqaii adpktdhrlg 241 kvgr SNRNP70 Official Symbol: SNRNP70 Official Name: small nuclear ribonucleoprotein 70kDa (U1) Gene ID: 6625 Organism: Homo sapiens Other Aliases: RNPU1Z, RPU1, SNRP70, Snp1, , U170K, U1AP, U1RNP Other Designations: U1 small nuclear ribonucleoprotein 70 kDa; U1 snRNP 70 Nucleotide seguence: NCBI Reference Seguence: NM_003089.4 LOCUS: NM_003089 ION : 089 1 gcggttcggc gcggaaagcg ggaggtggag gggcggcttg gggcaagcgc gcgcgcgcag 6; tgcagaagcc agccccccgc ggctgaggta ctcaaggtgc ccaaaggcgg ggtagtgacc 12; tcgcgcgtgc gctgtgcccg cggcagcgcc gggtcctagt gttg ttgttggcac 18; cgcacggcgc gtgcgcagtg aggacggcgg agggatttgc ggCnggaCC ctgc 24; tccagtcgct atcggaggcc gcgcgggtgg ctgagcagcg gcctggtgcg ctcgcttagc ; gggcgacgga atcagacgga cgtggacgcc gtgg aagccgaagc aggagttgtt 36L gttgctgagg ggctgccgca gccgccgcga gcctccggac agacgccaga gcgaggaggg 421 cgctacgcga cttggcaaga agtt cctgccgccc aaccttctgg ccctctttgc 48L cccccgtgac cctattccat acctgccacc cctggagaaa ctgccacatg acca 54; caatcaacct tattgtggca ttgcgccgta cattcgagag tttgaggacc ctcgagatgc 60; tcca actcgtgctg aaacccgaga ggagcgcatg gagaggaaaa gacgggaaaa 66; gattgagcgg cgacagcaag aagtggagac agagcttaaa atgtgggacc ctcacaatga 72; tcccaatgct cagggggatg agac tctcttcgtg gcgagagtga acac 78; aacagaatcc aagctccgga gagagtttga ggtgtacgga cctatcaaaa gaatacacat 84; ggtctacagt aagcggtcag gaaagccccg tggctatgcc gagt acgaacacga 90; gcgagacatg cactccgctt acaaacacgc agatggcaag aagattgatg gcaggagggt 96; ccttgtggac gtggagaggg gccgaaccgt gaagggctgg cggc ggctaggagg L02; aggcctcggt ggtaccagaa gaggaggggc tgatgtgaac atccggcatt caggccgcga 1081 tgacacctcc cgctacgatg agaggcccgq cccctccccg caca gggaccggga 114; ccgggaccgt gagcgggagc gcagagagcq gagccgggag cgagacaagg agcgagaacg 120; ctcc cgctcccggg accggcggaq gcgctcacgg agtcgcgaca aggaggagcg 1261 gaggcgctcc agggagcgga gcaaggacaa ggaccgggac cggaagcggc gaagcagccg L32; gagtcgggag cgggcccggc gggagcggga gcgcaaggag gagctgcgtg gcggcggtgg L38; cgacatggcg gagccctccg aggcgggtga cgcgccccct gatgatgggc gctcgggcct gacggccctg acggtccaga gggc nggatcgtg accgggagcg 150; acggcggagc caccggagcg agcgcgagcg gcgccgggac gtg accgtgaccg 156; tgaccgcgag cacaaacggg gggagcgggg cagtgagcgg ggcagggatg aggcccgagg 162; tgggggcggt ggccaggaca acgggctgga gggtctgggc aacgacagcc tgta 168; catggagtct ggcg acggctacct ggctccggag aatgggtatt tgatggaggc 174; tgcgccggag tgaagaggtc gtcctctcca tctgctgtgt ttggacgcgt ccag 180; ccccttgctg tcatcccctc ccccaacctt ggccacttga gtttgtcctc caagggtagg 186; tgtctcattt gttctggccc cttggattta aaaataaaat cctg ttgatagtgg 192; gcaaaaaaaa aaaaaaaaaa Pnneh1seguence: NCBI nce Seguence: NP_003080.2 LOCUS: NP_003080 ACCESSION: NP_003080 l mtqflppnll alfaprdpip ylppleklph ekhhnqpycg iapyirefed prdappptra 61 tr rm rk rr ki rrqq V t lkmwd phndpnaqu afktlfvarv nydttesklr 121 refevygpik rihmvyskrs gkprgyafie yeherdmhsa kkid grrvlvdver 181 grtvkgwrpr rlggglggtr rggadvnirh sgrddtsryd erpgpsplph rdrdrdrere 241 rrersrerdk ererrrsrsr drrrrsrsrd keerrrsrer skdkdrdrkr rssrsrerar 301 rererkeelr ggggdmaeps eagdappddg ppgelgpdgp dgpeekgrdr drerrrshrs 361 ererrrdrdr drdrdrehkr gergsergrd earggggqu ngleglgnds rdmymesegg 421 dgylapengy e Official Symbol: CNN2 Official Name: calponin 2 m:1265 Organism: Homo sapiens Other Aliases: none Other ations: calponin H2, smooth muscle; calponin-2; neutral calponin Nucleotide seguence: NCBI Reference Seguence: NM_004368.2 LOCUS: NM_004368 ACCESSION : NM_004368 1 gaaagagtga gagccgccca ctga gcagagagcc cgcaggagtg ccacgtcccg 61 gcggcctcgg cccctccctg tttc catc aaagggggcg aggggcccct 121 ccaggcctct ggtgacgggg gtgctgtgcc caggcggggg tccgggggcg accgaggggg 18; ctcaggaagt ccgcggccgc tcgg cgcctccagg ccttataagg acatttgcgc 24L tccgggccaa tcagcggcgg gggcgtggcg cgcggagccc ggcgcgtccc aaccccgcgc ; cagcccggcg gtcccgtccc gtcccgtcct gtgcggcccc gtcccgccgc ccgcccgcca 36L agct ccacgcagtt caacaagggc ccctcgtacg ggctgtcggc caag 421 ctcc tgtccaaata tgacccccag aaggaggcag agctccgcac ctggatcgag 481 ggactcaccg ccat cggccccgac ttccagaagg gcctgaagga tggaactatc 54; ttatgcacac tcatgaacaa gctacagccg ggctccgtcc ccaagatcaa ccgctccatg 60; cagaactggc taga aaacctgtcc atca aggccatggt cagctacggc 66; atgaaccctg tggacctgtt cgaggccaac gacctgtttg agagtgggaa catgacgcag 72; gtgcaggtgt ctcttctcgc gggg aaggccaaga ctaaggggct gcagagcggg 78; gtggacattg gcgtcaagta ctcggagaag cgga atttcgacga catg 84; aaggctggcc agtgcgtcat nggctgcag atgggcacca acaaatgcgc cagccagtcg 90L ggcatgactg cctacggcac gagaaggcat ctctatgacc ccaagaacca tatcctgccc 96L cccatggacc actcgaccat cagcctccag atgggcacga acaagtgtgc cagccaggtg L02; ggcatgacgg ctcccgggac ccggcggcac atctatgata ccaagctggg aaccgacaag L08; tgtgacaact cctccatgtc cctgcagatg ggctacacgc agggcgccaa ccagagcggc L14; caggtcttcg gcctgggccg gcagatatat gaccccaagt actgcccgca aggcacagtg L20; gccgatgggg ctccctcggg caccggcgac gacc nggggaggt cccccttact accaggagga ggccggctac tgaggctccc agcacgctct cgtctgccca tctgggtttt tgggtttttc ttca tctttttttt ttttttctta L38; acccgttcag tgctgccagt caaccaaggg agtg tcagcgtggg atcaggcagc L44; agagcttttt tcccctttgc cttgatcctt cgcaaggctg agccactggg ctgtggggga L50; aggggtcaag gccatatccc aatacgtgta gggcgagggt ccctgctggc aggc L56; ggga agaagagacc tgggcttgga aggaaccggt ccccgacggt ttctgcttgc L62; ctcgcctctt cccccttttg tcagctgagc agtttgtggt ttctatgccc gcaagtttca L68; ggaagtattc acaaaagaaa aatacatttt cagg ggtggggcaa ggacagtgga L74; gcta ggaaatgagt cccctgggaa aggggaccgg gccgtgatgt taaatatctc L80; ccaa gtgactggat ttgcctagga ccttcagatc aacagacttc agaccctcag L86; acctgccccg gggccaggtg gagaaagtga gggccgtaca aggaagtgaa attctgagtt L92; gttggggcta accc cctctccatg ctccccgccc caactcactc tggcctcagt L98: agattttttt ttcagttgtg gttgttgccc aggctggagt gcgc catcttggct 2041 cactgcacct ccaccttccg ggctcaagcg cagc ctcagcctcc tgagtagcta 210; ggactgcagg tgctccacca cgcccggcta atttttgtat ttttagtaga gatggggttt 216; ccccatgttg gccaggctgg tctcgaactc ctggcctcag gtgtgatccg cccgcctccg 222; cctccccaag Cgctgagatt acaggtgtga gccaccgtgc ccaggccctc agtaggtttt 228; aaggagtccc cagccctcct cccttctggg cccgaccagc ttatactgct ccatcttccc 2341 cggccacatg caag tactgcacag ggacccccca cccaggggcc ctgctccgtg 2401 agataatgtg gact gtggaccaaa cgcaataaaa cctctgtttg gaaa 2461 aaaaaaaaaa aaaaaaaa Protein seguence: NCBI Reference Seguence: NP_004359.1 LOCUS: NP_004359 ACCESSION: NP_004359 l msstqfnkgp syglsaevkn rllskydqu eaelrtwieg 1tglsigpdf qulkdgtil 61 ctlmnqupg svpkinrsmq nwhqlenlsn fikamvsygm npvdlfeand 1fesgnmtqv 121 qullalagk aktkglqsgv digvkysekq ernfddatmk agqcviglqm sqsg 181 mtaygtrrhl ydpknhilpp slqm gtnkcasqvg mtapgtrrhi ydtklgtdkc 241 dnssmslqmg qsgq Vfglgrqiyd pkycpqgtva dgapsgtgdc pdpgeVPeyp 301 pyyqeeagy PEBP1 Official Symbol: PEBP1 Official Name: phosphatidylethanolamine binding protein 1 m: 5037 Organism: Homo sapiens Other Aliases: HCNP, HCNPpp, PBP, PEBP, PEBP-1, RKIP Other Designations: Raf kinase inhibitory protein; hippocampal cholinergic neurostimulating e; neuropolypeptide h3; phosphatidylethanolaminebinding protein 1; prostatic binding n; tic-binding protein; raf kinase inhibitor protein Nucleotide seguence: NCBI Reference Seguence: NM_002567.2 LOCUS: NM_002567 ACCESSION I NM_002567 XR_109136 XR_109137 XR_111344 XR_1 14620 l tgggcggcgg ctgaggcgcg tgctctcgcg tggtcgctgg gtctgcgtct tcccgagcca 6; tgag ctctccgcgt cgcctctgtc gcccgcgcct ggcctaccgc ggcactcccg 12; gctgcacgct ctgcttggcc tgcc ggtggacctc tggt ccgggccctt 18; gcaa gaagtggacg agcagccgca gcacccgctg catgtcacct acgccggggc 24; ggcggtggac gagctgggca aagtgctgac gcccacccag gttaagaata gacccaccag 30L catttcgtgg gatggtcttg attcagggaa gctctacacc ttggtcctga cagacccgga 36L tgctcccagc aggaaggatc ccaaatacag agaatggcat catttcctgg tggtcaacat 42L gaagggcaat gacatcagca gtggcacagt cctctccgat tatgtgggct cggggcctcc 48L caagggcaca ggcctccacc gctatgtctg gctggtttac gagcaggaca taaa 54; gtgtgacgag cccatcctca gcaaccgatc tggagaccac cgtggcaaat tcaaggtggc 60; gtccttccgt tatg agctcagggc cccggtggct ggcacgtgtt accaggccga 66; gtgggatgac tatgtgccca aactgtacga gcagctgtct gggaagtagg gggttagctt 72; ggggacctga actgtcctgg aggccccaag ccatgttccc cagttcagtg ttgcatgtat 78; aatagatttc tcctcttcct gccccccttg gcatgggtga acca gtcagatggt 84; agttgagggt gacttttcct gctgcctggc ctttataatt ttactcactc actctgattt 90; atgttttgat caaatttgaa cttcattttg gggggtattt tggtactgtg atggggtcat 96; atta aaat ccag aatgtaaaaa agaaaaaact ggggggaaaa L02; agaccaggtc tacagtgata gagcaaagca aatc tttaagggag gtttaaaaaa L08; aaaaaaaaaa aaaaagattg gttgcctctg cctttgtgat cctgagtcca gaatggtaca L14; caatgtgatt tgat gtcactcacc tagacaacca gaggctggca ttgaggctaa L20; cctccaacac agtgcatctc agatgcctca gtaggcatca gtatgtcact ctggtccctt L26; taaagagcaa tcctggaaga agcaggaggg agggtggctt tgctgttgtt gggacatggc L32; aatctagacc ggtagcagcg ctcgctgaca gcttgggagg aaacctgaga tctgtgtttt L38; ttaaattgat cgttcttcat gggggtaaga aaagctggtc tggagttgct gaatgttgca L44; ttaattgtgc tgtttgcttg tagttgaata aaaatagaaa tgaa gaaaaaaaaa L50; aaaaaaa Protein seguence: NCBI Reference Seguence: NP_002558.1 LOCUS: NP_002558 ACCESSION: NP_002558 1 mpvdlskwsg plslqevdeq pthlhvtya gaavdelgkv 1tptqvknrp tsiswdglds 61 gklytlvltd pdapsrkdpk yrewhhflvv nmkgndissg tvlsdyvgsg ppkgtglhry 121 vwlvyeqdrp 1kcdepilsn rsgdhrgkfk vasfrkkyel rapvagtcyq aewddyvpkl 181 yeqlsgk ACLY al Symbol: ACLY al Name: ATP citrate lyase m: 47 sm: Homo s Other Aliases: ACL, ATPCL, CLATP Other Designations: ATP citrate synthase; ATP-citrate (pro-S-)-lyase; ATP- citrate synthase; citrate cleavage enzyme Nucleotide seguence: NCBI Reference Seguence: NM_001096.2 LOCUS: NM_001096 ACCESSION : NM_001096 1 agccgatggg ggcggggaaa agtccggctg ggccgggaca aaagccggat cccgggaagc 61 taccggctgc tggggtgctc cggattttgc ggggttcgtc gggcctgtgg aagaagcgcc 121 gcgcacggac ttcggcagag gt agagcagg tctctctgca gccatgtcgg ccaaggcaat 181 ttcagagcag acgggcaaag aactccttta caagttcatc tgtaccacct cagccatcca 24L gaatcggttc gctc gggtcactcc tgacacagac tgggcccgct tgctgcagga ; ctgg ctgctcagcc agaacttggt agtcaagcca gaccagctga tcaaacgtcg 361 tggaaaactt ggtctcgttg acct cactctggat ggggtcaagt cctggctgaa 421 gccacggctg gaag ccacagttgg caaggccaca ggcttcctca agaactttct 481 gatcgagccc cccc acagtcaggc tgaggagttc tatgtctgca tctatgccac 54; ccgagaaggg gactacgtcc tgttccacca Cgaggggggt gtggacgtgg tgga 60; cgccaaggcc cagaagctgc ttgttggcgt ggatgagaaa ctgaatcctg aggacatcaa 66; aaaacacctg ttggtccacg cccctgaaga caagaaagaa attctggcca gttttatctc 72; cggcctcttc aatttctacg aggacttgta cttcacctac ctcgagatca ttgt 78; agtgaccaaa gatggagtct atgtccttga cttggcggcc aaggtggacg ccactgccga 84; ctacatctgc aaagtgaagt ggggtgacat cgagttccct ccccccttcg ggcgggaggc 90L atatccagag gaagcctaca ttgcagacct cgatgccaaa gcaa gcctgaagct 96L gaccttgctg aaccccaaag ggaggatctg gaccatggtg gccgggggtg ctgt L02; cgtgtacagc gataccatct gtgatctagg gggtgtcaac gagctggcaa L08; gtactcaggc gcccccagcg agcagcagac ctatgactat gccaagacta tcctctccct L14; catgacccga gagaagcacc cagatggcaa gatcctcatc attggaggca cttcaccaac gtggctgcca cgttcaaggg catcgtgaga cgag gaag gagcacgaag tcacaatctt tgtccgaaga ggtggcccca gggcttacgg gtgatgggag ggaa gaccactggg atccccatcc atgtctttgg L38; cacagagact cacatgacgg ccattgtggg catggccctg ggccaccggc ccatccccaa L44; ccagccaccc acagcggccc caaa cttcctcctc aacgccagcg ggagcacatc L50; gacgccagcc cccagcagga cagcatcttt ttctgagtcc agggccgatg aggtggcgcc L56; tgcaaagaag gccaagcctg caca agtc ccaagtccaa gatccctgca L62; aggaaagagc accaccctct tcagccgcca caccaaggcc attgtgtggg gcatgcagac L68; ccgggccgtg caaggcatgc tggactttga ctatgtctgc tcccgagacg agccctcagt L74; ggctgccatg gtctaccctt tcactgggga ccacaagcag aagttttact gggggcacaa L80; agagatcctg atccctgtct tcaagaacat ggctgatgcc atgaggaagc atccggaggt L86; agatgtgctc atcaactttg cctctctccg ctctgcctat acca tggagaccat L92; gaactatgcc cagatccgga ccatcgccat catagctgaa ggcatccctg aggccctcac L98; gagaaagctg aagg cggaccagaa gggagtgacc atcatcggac ctgccactgt 2041 tggaggcatc aagcctgggt gctttaagat tggcaacaca atgc tggacaacat 210; cctggcctcc aaactgtacc gcccaggcag ctat cgtt ccggaggcat 216; gtccaacgag ctcaacaata tcatctctcg gaccacggat tatg agggcgtggc 222; cattggtggg tacc cgggctccac attcatggat catgtgttac gctatcagga 228; cactccagga gtcaaaatga ttgtggttct tggagagatt gggggcactg aggaatataa 234; gatttgccgg ggcatcaagg agggccgcct cactaagccc atcgtctgct ggtgcatcgg 240; gacgtgtgcc accatgttct cctctgaggt ccagtttggc catgctggag cttgtgccaa 246; ccaggcttct gaaactgcag tagccaagaa ccaggctttg aaggaagcag gagtgtttgt 252; gccccggagc tttgatgagc ttggagagat catccagtct gtatacgaag atctcgtggc 258; caatggagtc attgtacctg cccaggaggt gccgccccca ccca tggactactc 264; ctgggccagg gagcttggtt tgatccgcaa acctgcctcg ttcatgacca gcatctgcga 270; tgagcgagga caggagctca tctacgcggg catgcccatc actgaggtct aaga 276; gatgggcatt ggcggggtcc tcggcctcct ctggttccag aaaaggttgc ctaagtactc 282; ttgccagttc atgt gtctgatggt gacagctgat cacgggccag ctgg 288; caac accatcattt gtgcgcgagc tgggaaagac ctggtctcca gcctcacctc 294; ggggctgctc accatcgggg atcggtttgg gggtgccttg gatgcagcag ccaagatgtt 300; cagtaaagcc tttgacagtg gcattatccc gttt gtgaacaaga tgaagaagga 306; gctg ggca accg agtgaagtcg ataaacaacc cagacatgcg 312; agtgcagatc ctcaaagatt acgtcaggca gcacttccct gccactcctc tgctcgatta 318; tgcactggaa gtagagaaga ttaccacctc gaagaagcca aatcttatcc tgaatgtaga 324; tggtctcatc ggagtcgcat ttgtagacat aaac tgtgggtcct ttactcggga 330; ggaagctgat gaatatattg acattggagc cctcaatggc atctttgtgc tgggaaggag 336; gttc attggacact atca gaagaggctg aagcaggggc tgtatcgtca 342; tccgtgggat gatatttcat atgttcttcc ggaacacatg taac agagccagga 348; accctactgc agtaaactga agacaagatc tcttccccca agaaaaagtg tacagacagc 354; tggcagtgga gcctgcttta tttagcaggg gcctggaatg taaacagcca ctggggtaca 360; ggcaccgaag accaacatcc acaggctaac accccttcag tccacacaaa gaagcttcat 366; atttttttta taagcataga aataaaaacc aagccaatat ttgtgacttt gctctgctac 372; ctgctgtatt atgg aagcatctaa gtactgtcag gatggggtct tcctcattgt 378i agggcgttag gatgttgctt tctttttcca ttagttaaac atttttttct cctttggagg 384; aagggaatga aacatttatg gcctcaagat actatacatt taaagcaccc caatgtctct 390; cttttttttt ttttacttcc ctttcttctt ccttatataa catgaagaac attgtattaa 396; tctgattttt aaagatcttt ttgtatgtta aagg gcttgtttgg tatcccactg 402; aaatgttctg tgttgcagac cagagtctgt ttatgtcagg gggatggggc cattgcatcc 408; attg tcacaaaata tgtggagtag aata tgtaaagttg taacatacat 4141 acatttaaaa tggaaatgca gaaagctgtg cttg tgtcttatgt tctctgtatt 4201 tatgcagctg atttgtctgt ctga agtgtgggtc caaggactcc taactacttt 4261 gcatctgtaa tccacaaaga ttctgggcag ctgccacctc agtctcttct ctgtattatc 4321 atagtctggt ttaaataaac tatatagtaa caaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4381 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4441 aaaaaaaaaa Protein seguence: NCBI Reference ce: NP_001087.2 LOCUS: NP_001087 ACCESSION: NP_001087 1 msakaiseqt gkellykfic ttsaianfk yarvtpdtdw arllqdhpwl lsqnlvvkpd 61 qlikrrgklg lvgvnltldg Vkswlkprlg qeatvgkatg flknfliepf vphsqaeefy 121 vciyatregd yvlfhheggv dvgdvdakaq kllvgvdekl npedikkhll vhapedkkei 18; lasfisglfn fyedlyftyl einplvvtkd gvyvldlaak vdatadyick vkwgdiefpp 24; pfgreaypee daks gaslkltlln tmva gggasvvysd ticdlggvne ; lanygeysga pseqqtydya ktilslmtre khpdgkilii ftnv aatfkgivra 36; irdngplke hevtifvrrg gpnyqeglrv mgevgkttgi pihvfgteth mtaivgma;g 42; hrpipnqppt aahtanflln tpap srtasfsesr akka kpampquvp 48; sprslqgkst tlfsrhtkai vwgmqtravq gmldfdyvcs rdepsvaamv ypftgdhqu 54; fnghkeili pvfknmadam rkhpevdvli nfaslrsayd stmetmnyaq iaeg 60; ipealtrkli kkaqugvti igpatvggik pgcfkigntg gmldnilask vayv 66; srsggmsnel nniisrttdg vyegvaiggd rypgstfmdh vlrythpgv kmivvlgeig 72; gteeykicrg ikegrltkpi vcwcigtcat mfssevquh qase tavaknqaLk 78L eagvaprsf delgeiiqsv yedlvangvi vpaqevpppt vpmdysware 1g1irkpasf 84L mtsicdergq eliyagmpit evfkeemgig gvlgllwqu rlpkyscqfi tadh 90; gpavsgahnt iicaragkdl vssltsgllt igdrfggald aaakmfskaf dsgiipmefv 961 nkmkkegkli vksi nnpdmrvqil kdyvrthpa tplldyalev ekittskkpn 102; glig vafvdmlrnc gsftreeade yidigalngi fvlgrsmgfi ghylqurLk 1081 qglyrhpwdd isyvlpehms SNX12 Official Symbol: SNX12 Official Name: sorting nexin 12 Gene ID: 29934 Organism: Homo sapiens Other Aliases: none Other Designations: sorting nexin-12 Nucleotide seguence: NCBI Reference Seguence: NM_001256185.1 LOCUS: NM_001256185 ACCESSION : 256185 l ttttgattgc catttccttg ggacagcctg aagagagaat cgaaagaagt cagt 6; atggttgcat aacatcgagt ngagattgt tcct ttgagaaaaa tgtacgggca 12; ggaagatgaa taggtgtctg tttgattaag gctctccttc ggaaagatgt cggacacggc 18; agtagctgat acccggcgcc ttaactcgaa gccgcaggac ctgaccgacg cttacgggcc 24; gccaagtaac ttcctggaga tcgacatctt taatcctcag acggtgggcg tgggacgcgc ; gcgcttcacc acctatgagg ttcgcatgcg ccta cctatcttca agctaaagga 36; gtcctgcgta cggcggcgct acagtgactt tgagtggctg aaaaatgagc tggagagaga 42; tagcaagatt gtagtaccac cactgcctgg gaaagccttg aagcggcagc tccctttccg 48; aggagatgaa gggatctttg aggagtcttt catcgaagaa aggaggcagg gcctcgagca 54; taac aaaattgctg ggcacccact ggctcagaat gaacgctgcc tacacatgtt 60; cctgcaagag attg acaggaacta cgtcccgggg aaggtgcgcc agtaggagcc 66L cctctcacca cctgccctct actttcctgc tgaaatgaca ttta cactaagcct 72L ctctctgtct ttgatctgaa gttggctgcc catctcctgg cctgatagac gcat 78L tgtgctccct ggtacctgac tacaccatgt tctg ctaggatcct cttctctgag 84; gagaggtggg aacccacagg cagatgccct gggg ttggggtggg tgggtggggg 90; gattagactg gaaggcaatt tcttgggcat ttacccatgc cagaaggcta acctgggggg 96; aggggggcgc ttgtgctggt cttg gatacatact gcaa gttcagggga L02; ttac tcttaggttt gaac cagg gaaaaaccct gcctttccta L08; actgcatgta ttttttcctt tttggaaagg tggtagagac tcagaagctt tccttgtttt L14; gcct gctcccagtt ttcttaacag tttcttttgt tgctttctct ctcccttgtt L20; gctttccatg gcagtaatcc tcctagagtc caagcagtct gttgtatgga gcagggtgtg L26; tgggttttct gggcccatca ttatggctgc ttcagagtca gaagaaagcc atagggcagt L32: gctc ctattgccta gcccctctcc ctttgtggct cccactctag ctgcctattt L38; ttgctcatca gctggtgagt cagtatgggc cagcagttct ccctccctaa gcta L44; ctttatgggt tagctttgca ggtttggtgg cttgaggggt gggggcaact caccactgcc 1501 aggtaactcc ctgaagggtg ggagtggatt atcttctagg ctcttacccg gggcatcaac actgtcttcc ttccattctc ctttccccca tcccatttag gggcagaagc acca accacacagt cttc tcctaagcac tgaatggggc gcaa gagtttttgc tgccctcccc agcgtggtca cagggttatt "74; gaactgcctg cacttgtttc tcatgcaact ccagcatttt ccccagaagt tgaactatgg L80; atagcagctt ggtatggatt tcctaaatct taacatttga agcagcttct tgaggctggc L86; aactatcctg gtttctgtct tggagggggt ggtttgtttg ctggggccca acgtctgtcc L92; caagtggtgg ggtgagagta agttaacttt ggtgccaggt gagaggtggg ggctctttgc L98; ttagactccc tatcatggaa agattggagt tttctatgca tgtg gaaaaggatt 204; gctgattctg actgaccctg atcagagaga ttaggattgt attttgacat aggatttgga 210; acccatctaa atgttgaagt tccctgagac agctctccag ctgctgagcc tgcgccaggg 216; gctaagcagc ccctaatgag aggctctgct ccca cctcgccaat gttgttgttg 222; tttt gatttgtatc ctctgttata gacatttttt aaaaacgatt tcctctttca 228; ttgtgcacaa gtgctgagag tctgaggccc catttctgct gtgtatatat atcctgactc 234; ggggctttta ttcagcaaac tgttcattct tctgtcagac aatgtcatat tcaactctgt 240; tcatattaaa tgaa gcaaaaaaaa aaaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_001243114.1 LOCUS: NP_001243114 ACCESSION: NP_001243114 1 msdtavadtr rlnskpqdlt daygppsnfl eidifnpqtv gvgrarftty evrmrtnlpi 61 cvrr rysdfewlkn elerdskivv pplpgkalkr qlpfrgdegi eerr 121 qgleqfinki aghplaqner clhmflqeea idrnyvpgkv rq SYNCRIP Official Symbol: SYNCRIP Official Name: otagmin binding, cytoplasmic RNA cting protein Gene ID: 10492 Organism: Homo sapiens Other Aliases: RP1-3J172, P, GRYRBP, , NSAP1, PP68, hnRNP-Q Other Designations: NS1-associated protein 1; glycine- and tyrosine-rich RNA- binding protein; heterogeneous nuclear ribonucleoprotein Q Nucleotide seguence: NCBI Reference Seguence: NM_001159673.1 LOCUS: NM_001159673 ACCESSION : NM_001159673 1 cggcgtgagc ttcggccgcc attttacaac agctccactc gcgccggaca cagggagcag 6; cgagcacgcg tttcccgcaa cccgatacca tcggacagga tttctccgcc tcagcccaac 12; ggggagggct agttgcacat agtgatttag atgaaagagc tattgaagct ttaaaagaat 18; aaga cggtgcattg cttc aacagtttaa agacagtgat ctctctcatg 24; ttcagaacaa aagtgccttt ttatgtggag tcatgaagac ttacaggcag agagaaaaac ; aagggaccaa agat tctagtaaag gaccagatga ggcaaaaatt aaggcactct 36L tggaaagaac aggctacaca cttgatgtga ccactggaca gtat ggaggaccac 42; ctccagattc cgtttattca ggtcagcagc cttctgttgg cactgagata tttgtgggaa 48L agatcccaag agatctattt gaggatgaac ttgttccatt atttgagaaa gctggaccta 54; tatgggatct tcgtctaatg ccac tcactggtct caatagaggt tatgcgtttg 60; tcactttttg tacaaaagaa gcagctcagg aggctgttaa actgtataat aatcatgaaa 66; ttcgttctgg aaaacatatt ggtgtctgca tctcagttgc caacaatagg ctttttgtgg 72; gctctattcc taagagtaaa accaaggaac agattcttga agaatttagc aaagtaacag 78; agggtcttac agacgtcatt ttataccacc aaccggatga caagaaaaaa aacagaggct 84; tttgctttct tgaatatgaa gatcacaaaa ccca ggcaaggcgt aggttaatga 90; gtggtaaagt ctgg gggaatgttg gaactgttga atgggctgat cctatagaag 96; atcctgatcc tgaggttatg gcaaaggtaa aagtgctgtt tgtacgcaac cttgccaata L02; ctgtaacaga agagatttta gaaaaggcat ttagtcagtt tgggaaactg gaacgagtga 108; agaagttaaa agattatgcg ttcattcatt ttgatgagcg agatggtgct gaatggcaaa gacttggagg gagaaaatat tgaaattgtt gaaaaggaaa gaaagaaaag ctcagaggca agcagcaaaa atgacgatta ctactattat ggtccacctc atatgccccc tccaacaaga gtggaggtag aggtggttat ggatatcctc cagattatta tgaa gattattatg L38; attattatgg ttac cataactatc gtggtggata tgaagatcca tactatggtt L44; atgaagattt tcaagttgga gctagaggaa ggggtggtag aagg ggtgctgctc L50; catccagagg tcgtggggct gctcctcccc gcggtagagc cggttattca cagagaggag L56; gtcctggatc agcaagaggc gttcgaggtg cgagaggagg tgcccaacaa caaagaggcc L62; gcgggcaggg aaaaggggtc gaggccggtc ctgacctgtt acaatgaaga ctgacttgct L68; atgtgggatt acaccagaag cttgcagtgg agtaatggta aggaaatcaa gcaaccttaa L74; atatgtcggc tgtataggag catattctat tgcagaagac atga agatcatgga L80; atcaaatacg ggacattgaa ctaatacttg gactttgata tgaatttctt taacaatttt L86; ctctgcagtg atta aactaaagct actctatttt caaaatgtgt agaa L92; atccttcata actcctagca tggtatctta ataaagaata tttt aaaaatctgc L98; tctaagtaga tttttcccct aatt aaggatccca acagtggtat tttgaaatat 204; tctcttgaat ttgtgcattt aaattttatt gcagtggtat agatgaatgc cactgatggt 210: atccttaaat tttatttctg ctcaccaagg ttaatcatga ttgtctatat cttttttata 2161 gtgatcactt ttgaattgtg ttcagatatg cagtttcagg tgtaatcatc ggtt 222; agtcaggcat tccagatagt ggttcttttc agaacctttt taaaagggtt ggttaactac 228; ctcagtagca tgaa cctg tctgtactgt acatagaaaa tctttgtaga 234; taaaagcaag taaa tatgatatga gggtaagatt ttaatatacc aaca 240; ttcttagttg cctttagttt cagaggcttg taagacttcc tcatgaccat cataacaggc 246; cttgcttttg tcgtattttg tggctgaaaa agcagccttg cttcttcaga tattgtagtt 252; atttggatgt ataatagttt agcaagatgt tacttttgta agacatcaga tgttcaaaaa 258; agtgcatccg aacttgtact aaatactgca gtgtcccttt ataaaaagtc agactaaaac 264; tgacaattgt acagcgaagc ctgacatttg gatattttga agttttttca taaatcatag 270; aaattagtat atggctgtag tttagctttt aaag gtatgtttca ttagtgcatt 276; tcttcctgct gatcactgta aacatgtgaa tcagctttcc atttcttatg caggtcatga 282; taacttgtag agtagagtac aatcatttgt gctatgtttt taattttcta aagcaccttg 288; atgacagtga gtgtccagtg catc ctctattgaa ccaccctcaa aaattttttt 294; gccaagtcct atag cttaaagtaa aaaa ttatagtttc actt 300; ggtgtaaaga aatcccctcc ccccttcccc aaagggatac tgcagttata accc 306; aataggcacc acgatgaaga tcagagctta atta tata cacaccagtt 312; ccccagtaaa ttta acaagaaaat cagacatgtc atatgttcaa aatgctcatg 318; gcaaacaatc attttgcatt cctgcaaata aaattgtttt atactgtaag gcga 324; ctta tttttgtaat aaagttttta ttttttttat gtgtcattaa tataaatgtg 330; tgttagtgta gaaatcttct ggtttaaaaa cttagaattg cacacatttc agtatgttta 336; tttgtactta cataatttta tggt tgccaatagc ctgtatgttt cacattaatt 342; ggttttttgt aata aatcatttta gtatgttgta tgtcagttac agct 348; gggacataga gtgtaattta aaatttgtca ataagtattc attggaatat atgtaaatgt 354; gccttgccgg ttattgaaac ttatctacaa aatgagtatg gggtgacaaa aattagttcc 360; tggtgcttaa tgaaactttc tgccactgat tttatatatt accccgtgct tttttaaagt 366; acatctctct caaaacttag tgtaagtttg agggctacac aaaacattta catttcattc 372; taacataatg aatataatag gttgtggaaa gtgggtaaac taaatgtagc cttcagtaaa 378; attgaatctc agtgtaatcc ttggtgctgg catttctcag ttccgaggag ttaaatgatc 384; ccatctaaga ggtcattgcc atgcctattg gcactttact gtcatagcat ttttaaggga 390; cactgtcaag gtgtttaagt tctcagaatt acttgttggg attttaggac aggtttgttt 3961 acttaaagta agaactgcat tgtcaaagtt gaaagaggaa cacttttgtg agttcacaaa 402; tgtgttctta agaaaacatt aaaatatgga gctctgggtt ttcaagacta tttggcattc 408; tggg gacttgggag ggaaactgat aaaaagaaat attg atggttatac 414; ttaaagaagg gtaatgtaaa cagtggtgat gaaatatata cacatcaagt actt 420; gacagtgttc atttgaatga ctttgaattc aagccattat aattactttt aaaattaaat 426; atcatttgca ctgttctgat aatgggtgca gtttttgagc aatataatca gagctaaata 432; tgcatgtagt tgat gtgaacaatt aacgttctga gaagaaatac taactgtggt 438; attttcaaac ttaaatttct gtagtaaaat cagtatcaaa gtcttatcag atcaaggaaa 444; aacaggcaat gcatataaac atacttttga atgttgtgtg gcctataaag caataatgca 450; atgg aatgtcatgg gatatgagaa atggaaatgc aact aatcctttag 456; taaaaatgtc aacatgttaa agggggaatg ttaactaatg taggttattg ctatttgtga 462: tatg ggttcttggc tttgacagct tcaaagaatg gata agttaaaaga 468; aattttgtat attgtcaagg aaagggtctt gagt caagtccctt ccttggggta 474; aaaaatgtat tcttaaagca ttctgatgtt aaaaagaaaa cttaagttat ctaaccaaaa 480; caag attttgtttc tgcagactac ttggcaatca aaagtgatca taaatttagg 486; ttatcagttt tcagaaagtt gctttgtgag aaaattttgt tagatatatt ctcccaagca 492; tgctttttgt ggaaggtttt cagccattgc cactgaatca gatgttaaaa atgaagggaa 498; aattgagtgt gcacacacac aactgttgta cactcatgat tgcagttttt agaa 504; acttttctac cagttactgt gaatctgact taaaatgtaa agtttcctca tgataaaata 510; ggaacaacat agaaatggat tgatggggtg atctgagtta ttgtatataa aagtttttaa 516; agaatagaat gaacatcaag ctagataggc aaaaattgac acattcagaa cagctttttt 522; gactgcgaag ccaaaagttg tcagaaacag caaaagatcc atta cagagtattt 528; tacgtagtct ctattttaag gagagaaatt aaatagaagg gcttcatgca tttaggggag 534; ggtgctaaaa cttctcaagt tcgtcaaact tacaggaata cccaccatga tcattttctc 540; tctaattatg tataccacaa aattttcatc tggccatagg aattcactgg tgggtgtaaa 546; attaatgact aaagaaatta agtgacaaat aaga aacagacttg tggggatatt 552; aggt gtattaatta ctcagtgatg ataccactca atagggcatg ccactacttt 558; tcttaagatg ctaattatga agcagtgctc attt tttaactagc aaattagtag 564; atggactttt ggggtctgtc actttttaaa agtatttaag acttaaattc tattagcacc 570: tgcc ttcagtaata cacctaaaat atttttcagg accagaagca ttcagtttga 5761 aaatttgcag acca gtattattac taacgctctg ggtcaaagat taggttttta 582; atattaacag tagtctggta aatatttaga agtctggcat tgagaaacaa aagcttgtac 588; agta tttttattta ttag ttctgttagc ttatttaaat ttat 594; ttatccgtag tatt tatttcattc ctttcatctc actgaaaact gtctgcaggc 600; cctttgattt ggattagatg tgtgaagtac tgtcttttgc caaaaacctc aaattacctg 606; tcaa cgtagtgggt ttgtgcttgt ttggagatca gttcaaaaac tatctgtact 612; atctgtactg cctctgatgt taagatttta tgtatagcat aaggaagcta acta 618; tattttccta agaataaaga cctatttttg tagcatgtct taggatctcc aggagtccaa 624; gaattattgt gggtgtcctc caattcatca ctcttcactt aacagctttt aagtagacac 630; ttggaatctt tagaggtctg tcgccctttg attatccata cattcgaagt aactagccaa 636; tggtgaaaaa ttcctcaaga tatcctcagt tgcaatcaca gaag atgaatagaa 642: taaatgtatt aggctggtct taatttttga tggaaatatt ctgttgtccc gtacttgcca 648; ttggatttga taaagttagt ggtaatttgg aaagaatcgg ggacttgcca atatatttgt 654; gggttttagc ttatacccct aggatttctt ggttgcggga cgagcagttt tggccacttc 6601 catcaggaca agacttttta ggtcacttag tgcaggtttt agtttctatt ttggattaac 666; aacatttata ttgattatcg aaaagaagct ttcatcattt cagaacagtc gttt 672; gactttgagt gtgggagaag tcctaataaa ccattttgga aattaaaaaa aaaa seguence: NCBI Reference Seguence: NP_001153145.1 LOCUS: NP_001153145 ACCESSION: 153145 l mktyrqrekq gtkvadsskg pdeakikall ertgytldvt tgquyggpp pdsvysgqqp 6; svgteifvgk iprdlfedel agpi wdlrlmmdpl tglnrgyafv aaqe 12; avklynnhei rsgkhigvci svannrlfvg sipksktkeq ileefskvte 18; pddkkknrgf cfleyedhkt aaqarrrlms gkvkvwgnvg tvewadpied akvk 24; vlfvrnlant vteeilekaf squklervk klkdyafihf derdgavkam eemngkdleg ; enieivfakp pqurkerka qrqaaknqmy ddyyyygpph mppptrgrgr ggrggygypp 36; dyyqyedyyd yygydyhnyr ggyedpyygy edfqvgargr gaap srgrgaappr 42- qrgq pgsargvrga rggaqqqrgr gqgkgveagp dllq SAR13 Official Sym bol: SAR1 B Official Name: SAR1 homolog B (S. cerevisiae) Gene ID: 51128 Organism: Homo sapiens Other s: ANDD, CMRD, GTBPB, SARA2 Other Designations: 2310075M17Rik; GTP-binding protein B; GTP-binding protein SAR1 b; GTP-binding protein Sara; SAR1 a gene homolog 2 Nucleotide ce: NCBI nce Seguence: NM_001033503.2 LOCUS: NM_001033503 ACCESSION : NM_001033503 l gccggcccgg ctga tgcgaactgg ggccacggca gccatcgcgc tttgcagttc 6; ggtctcctgg tgtacggcca acgccaagta ggggattgcg ttccctccag tcgcagagtt 12; ttgt Cgcccaggct ggagtgaagt ggcacgatct cggcttactg caagctccgc 18; ctcccgggtt cacgccattc tcctgcctca gcctcccgag tagctgggac tacagaccct 24; atcagatttg gatatgtcct ttga ttggatttac agtggtttca gcagtgtgct ; acagttttta ggattatata agaaaactgg taaactggta tttcttggat tggataatgc 36; aggaaaaaca acattgctac acatgctaaa agatgacaga cttggacaac atgtcccaac 42; attacatccc acttccgaag aactgaccat catg acgtttacaa cttttgatct 48; gggtggacat gttcaagctc gaagagtgtg gaaaaactac cttcctgcta tcaatggcat 54; tgtatttctg gtggattgtg cagaccacga aaggctgtta aaag aagaacttga 60; ttcactaatg acagatgaaa ccattgctaa tgtgcctata cttg ggaataagat 66; cgacagacct gaagccatca gtgaagagag gttgcgagag atgtttggtt gtca 72; gacaacagga aaggggagta tatctctgaa agaactgaat gcccgaccct tagaagtttt 78; catgtgtagt gtgctcaaaa gtta cggagaaggc ttccgctgga tggcacagta 84L cattgattaa cacaaactca cattggttcc aggtctcaac gttcaggctt actcagagat 90; ttgattgctc aacatgcata acttgaattc cttt tgctggttat aaaacagatg 96L ttttttagat tattaatatt aaatcaactt aatg agaattgaaa actgattcaa 102; gtaagtttga gtatcacaat gttagctttc taattccata aaagtacttg gtttttacag 108; tttataatct gacatcaccc cagcgccatt tgtaaagagc ccag cagtacattt 114; gaagcacttt ttaacaacat gaaactataa accatattta aaagctcatc atgttaaatt L20; ttttatgtac ttttctggaa ctagttttta aattttagat tatatgtcca cctatcttaa L26; gtgtacagtt aataattagc ttattcaatg attgcatgat gccttacagt tttcaataac L32; tttttttctt atgcaaacgt catgcaataa aacaaactct aatgtttggc atccttgttg L38; ggcaaatgtt tcatttaaat tatc gtat actctgaaaa tttgagtatt L44; taatattagg catatgaagt tggg aaaggagatt ccttcagaat atttagaata L50; gcgtttaagg ctctcaaggc ttaggtattt accatgagat ggttgtggtc L56; gataccatgt atccctgaat tatt tttattagta atgcagccac tgct L62; qatqqqqtct gtgtcctagt ccctgtggga ttggccttct gaagaaaagc acatttatgg L68; tacacaaggt tgattctcag tttggtaggc tctatagttc catcccagct L74; gaagtgctcc attg gtgcccaatg gggtaatctg tgcctcttcc ccaaaagcaa L80; tgacaggcac ctcc acatttagat atattcccag tctctgtaca gggg caagcaataa gtaggacctt gtgtgcagtt taga tctgaggaat aaaacagctt gttctgtgtc agtctgagat atgtggagat tttgttccta L98; tcctatagct cttttgtatt ctttggcata ttttatatcc aaga aaacaagtgc 204; ttgtttccaa ttttcttttt tcttatttgc tctcaggtag ttcttactcc atacaacaaa 210; gactttttgt ggct tttttttttt tttttttttt gctctgtttc attttgtttt 216; agagacgggg tttcaccatg ttgtccaggc tggtctcaaa ctcctgacct caagtgatct 222; gcccgccttg gcctcccaaa gttctgggat tataggcgtg agctaccatg cctgacctga 228; tttt tagatattat catg tgatagggcc tgcaagcctc attgctaggc 234; ttactaagaa aattttagtt tttcaaaagc attataattt cctaagaaac tgaattcttt 240; ttttatatgt ttgagattcc catcattagt aatataagat gaaaggtaag tgccaaaaat 246; gtatttttaa tcaa gtttaagatt tatcctgatt ataagccaag ttttatagta 252; tatttaacca attccatcaa gaataatttt aatatcaaaa attagtgttt tctgtagcca 258; ttgtccatgt taca gtcctttttg tcattgataa tataacctat gaagcagata 264; aggattgagg aatatgactg gaaggaatta ctatttagct aagctgacaa ggtcgcttct 2701 taagatgaca tttggtttca gtaatctgac tattctgttt tcactttcat cttctttcta 276; aatgaaaaca aaagtgctcc ctcccttcct ggaaacctca gtaacactat gggaaaagta 282; gaacatgaca ttgcagccta ttgatttctt cttccagata ggtttaaagt actccttaag 288; ttctgactaa atagaactaa gccttattaa aaataactgc ctcttgttca tgttatctgt 294; accttcaggg cttt gtat ttcctagagt atctattatg atactgaaga 300; agctaattat tgta aatgggtata aatgaaaaaa aaacatactg gtttctctag 306; ccaggaaaaa ctgg tgtaatatat cttgctccag aaccctcatt ctaattgtaa 312; cactaggatc aaagaaacaa agtcactttg tggaccacag ctaaactgtg gatattttcc 318; caaagacata agatttttat ggcccgagcc tctagaaagg aagccatgtt taggagcaac 324; cagctttcct cccagcttta gggggcagag ttcctgagcc agaggactta ctgtccagct 330; ttgagaacct ctccagagta tatgcactgg gtactgctct ttttcaagag atta 336: agaggatggc aagaaacagt agaagcacag aggaaagaca actctgcatg tgcctgtgtg 342; aatgtgtgca tccatggagt atttcccagg taaatactag tactggggac ataggctaat 348; tgtgtgtccc acactgcaag atgctagggc gtagttaaca ctgtggtata catacaaatc 354; aggcactgtc caaaagattt atct aaagtctgaa atgtaaaaat ataaggtctc 360; cttt tacactttta aagagatccc atacctgttt cactgactgc cgttaattac 366; acttttggat cacagctggt taaattgata agtt tatctcagtg aatttttaga 372; atggagatta tagcattttt taattggaga acagacattt cctaaagtat atgaaaaaaa 378; attattcact gttggtttaa accagtatct ttgtatgagt gccaaagata tatgaacaca 384; gatactgcct gtgcagacct aaattttagt tttgtgtacc cata tacaaatttt 390; ttgtggttta tagcataaaa cgtt gtttctttct tagttttcaa ccggctcatc 396; ttttgttttt gttttttgtt tttttgtttt ttgttttttt tgagatggag tcttgctttg 402; ttgcccaggc tagagtgcag tggcacaatc tctgcttact gcaacctcca cctccggggt 408; tcaagtgatt ctcctgcctc agcctcctga gtagctggga ctacaggcgc gcaccaccat 414; gcccggctaa tttttgtatt tttagtagag atggggtttc actatgttgg ccaggctggt 420; ctcc tgacctcgtg atctgcccac ctcagcctcc caaagtgctg cagg 426; cgtgagccgc catgcccggc caattttcaa ctggcccata ctttatagtg atggaaagcg 432; cataaactac ttgtaaatca ttaaaatagg gtgataactg tgataatagt gtttcttgca 438; ttctagaaaa ttattttatt attc aaaacccagc atttcacagg ttccatcatt 444: agta tagttctagt taacatgatt ggagagtttc aagg tttacatttt 4501 ctgt atttggtatg aatg tggtatttca gtcttgttag tcacttacat 456; gactgacgtt tgcaaggatt tattgccaag taaaatttga ccagagtgca atag 462; ctacataagg ggaaatctct caaaattcct tctgttcatt taatttggag catattgttt 468; aaatcatttt aaacatatgt aaaaagttga agcattaaaa caag aaacaatgaa 474; aaaatagaaa ttagcaaaca taagtttctt aatgcaaaat taatagtgaa taaaatatag 480; cctacattaa aagccagagg ctttgctata aatataagag tttagaaaaa cagtgtgctt 486; caattaagga ctaaattatc aaaactgcat gtttgttttt tcttttcttt tctttttttt 492; ttgagatgaa gtctcactct gttgcccagg ctggagtgca gtggtgcgat tcac 498; tgcaacctct gcttcccagg ttcaagcgat gcct caccatcccg tggg 504; attacaggtg caccacacca tgcccagtta atttttgtat taga gacagggttt 510; cattatttgg ccaggctggt ctcc tgatcttaag tgatccactc gcctcggcct 516: cccaaagtgc tgggattata agcc actgtgccca gcctaaaact acatgttgaa 522; gcttccgqtc atta ttatccttct tttgaaattc aagttagtgc tttttaacca 528; aataaaagaa gaaccagctc tatg tgactctgcc tctgtataaa gtgactggaa 534; ttttgttaaa accgtgtttc cacttctgaa ccctgttacc cctc acaaatcccc 540; acccaacacc tggattttaa agatcctcca gtgtcaaggg aagccacaga gtctattaaa 546; gaggcagttc tgaaccaatt aatttttgtc cttataattt agagcattaa aata 552; tatttaatgg cactaattgt tgttcacggc tttcatcata cttttaaaca gaatccaaag 558; tattcaaagg aaagtaagcg aagttatcca aagccaactt tgtttcaggt cctg 564; ccccaaatag attttagggc agaaatagaa aactgagttt acacagaact atttttggaa 570; aagctgcact ggagtagatg gattcttctt cagcatactt ttttgtttgt ttgtttgaga 576; tggagtcttg ctttgtcacc caggctggag tgcagtggtg tgatctccac tcactgcaac 582; ctccacctcc cagcttcaag tgattctcct gcctcaacct tccaagtagc ttggattaca 588; cgcc accacagctg gctaatattt gtattgttag tagagacagg gtttcaccat 594; gttgtccagg cttgtcgaac ttctgacctc acgtgatcca cctgcctcag cctcccaaag 600; tgctagatta taggcgtgaa ccactgcgcc cggccagcat gcattttaaa agtggcttag 606; atttagtttt aaatattttg gggtgaaagg caggaacagt tctgtttttg acatacaggt 612; tttctttggg attgttttca ttttcaagta tagattcatg tcagaatggc caacttaacg 618; tgggtttctg tattccctgg tgttgctctt aacctgaact cagt tgccatactg 624: aggcaagagc actcagggtg aacatagtca agttacttta aaagtgataa aagtgttttt 6301 ccatggtgaa accttcagta tttggctgaa tgtaaagtat gttgaagtgg gatg 636; gtaagttgtt aatcactaac cttgtttgca cttttgtaca ccactgcttg cactaggatc 642; ttggtgtgaa ttttcaattg ttttacagtg tatacagatt attaaggata atttatataa 648; agatgtttct gtttaacttt gtgtgtttta caacaaagag ctataataga tggttaaacg 654; tttttgaatt gtgtttatat gttagtttga ttatgttcta ttatcttttc acctgccatg 660; aatttgagtg ttaggaaggg aaaaataaaa tctg gtcttgaaga a Protein seguence: NCBI nce Seguence: NP_001028675.1 LOCUS: NP_001028675 ACCESSION: NP_001028675 l msfifdwiys gfssvqulg lykktgklvf lgldnagktt llhmlkddrl gqhvptlhpt 61 seeltiagmt fttfdlgghv qarrvwknyl paingivflv dcadherlle skeeldslmt 121 vpil ilgnkidrpe aiseerlrem fglygqttgk gsislkelna rplevfmcsv 181 1qugygegf id CCDC47 Official Symbol: CCDC47 Official Name: coiled-coil domain containing 47 m: 57003 Organism: Homo sapiens Other Aliases: GKOOf, MSTP041 Other Designations: coiled-coil domain-containing protein 47 Nucleotide seguence: NCBI nce Seguence: NM_020198.2 LOCUS: NM_020198 ACCESSION : NM_020198 1 attatgtaat tttcccaaaa gccccacctc gcctcagccg ggcgggagag agggaggtct 61 ttcc ccggggttgc gtcccgcccc gcaggctgcg cgcaggcgct gacgagccgc 121 tcgcattcta cgtaacggac ggcggaggct acgtgaagag aggcgcggcg tgactgagct 18; ctgg ctgcgtccta gaggcatccg taaa accgctgcga tcgcggaggc 24; ggcggccagg ccgagaggca ggccgggcag gggtgtcgga cgcagggcgc tgggccgggt ; ttcggcttcg gctt tttttctcaa ggtgcaatga aagccttcca cactttctgt 36; gttgtccttc tggtgtttgg gagtgtctct gaagccaagt ttgatgattt tgaggatgag 42; gaggacatag tagagtatga tgataatgac ttcgctgaat ttgaggatgt catggaagac 48; tctgttactg aatctcctca acgggtcata atcactgaag atgatgaaga tgagaccact 54L ttgg agga tgaaaaccaa gaaggagatt ttgaagatgc agatacccag 60; gagggagata ctgagagtga accatatgat gatgaagaat ttgaaggtta tgaagacaaa 66L ccagatactt cttctagcaa aaataaagac ccaataacga ttgttgatgt tcctgcacac 72; ctccagaaca gctgggagag ttattatcta gaaattttga tggtgactgg tctgcttgct 78; tatatcatga attacatcat tgggaagaat aaaaacagtc caca ggcctggttt 84; cata gggagctttt ggagagcaac tttactttag tgggggatga tggaactaac 90; aaagaagcca caagcacagg aaagttgaac caggagaatg agcacatcta gtgg 96; tgttctggtc gagtgtgctg tgagggcatg cttatccagc tgaggttcct caagagacaa L02; gacttactga atgtcctggc ccggatgatg gtga gtgatcaagt gcaaataaaa L08; gtaaccatga atgatgaaga catggatacc tacgtatttg ctgttggcac acggaaagcc L14; ttggtgcgac tacagaaaga gatgcaggat ttgagtgagt tttgtagtga taaacctaag L20; tctggagcaa agtatggact gccggactct ttggccatcc tgtcagagat gggagaagtc L26; acagacggaa tgatggatac ggtt cactttctta cacactatgc tgacaagatt L32; gaatctgttc atttttcaga ccagttctct ggtccaaaaa ttatgcaaga ggaaggtcag L38; cctttaaagc acac taagaggaca ctgttgttta catttaatgt gcctggctca L44; ggtaacactt acccaaagga tatggaggca ctgctacccc tgatgaacat ggtgatttat L50; tctattgata aagccaaaaa gttccgactc aacagagaag gcaaacaaaa agcagataag L56; aaccgtgccc gagtagaaga gaacttcttg aaactgacac atgtgcaaag acaggaagca L62; gcacagtctc ggcgggagga gaaaaaaaga aagg agcgaatcat L681 gatcctgaga aacagcgcag gctggaggag ttga ggcgtgagca aaagaagttg L74; gaaaagaagc aaatgaaaat gaaacaaatc aaag ccatgtaaag L80; agtt ctgatgccac ctgtaagctc tgaattcaca ggaaacatga aaaacgccag L86; tccatttctc aaat ttcagacagt cttgggcaac tgagaaatcc ttatttcatc L92; atctactctg tttggggttt ggggttttac agagattgaa gatacctgga aagggctctg 198; tttcaagaat ttttttttcc agataatcaa attattttga ttattttata aaaggaatga 204; tctatgaaat ctgtgtaggt tttaaatatt ttaaaaatta taatacaaat catcagtgct 210; tttagtactt cagtgtttaa agaaataccg tgaaatttat aggtagataa ccagattgtt 216; gctttttgtt taaaccaagc agttgaaatg gctataaaga ctgactctaa attc 222; tgcaaataat gattggaatt gcacaataaa ttga tgttttcttg tatgtctaca 228; ttaaacttga gaaaaagtaa aaattagaac tgta gtaatgaaat ttcagggacc 234: cagaacataa tgtagtatat gtttttaggt gggagatgct aaaa ttaataggaa 240; gtctgtaggc attaggatac tgacatgtac atggaaaatt ctagggacag gagcatcatt 246; ttttccttac ctgataccac gaaccagtga caacgtgaat gctgtatttt aagtggttgt 2521 atgtttattt tcttgagtaa catg aaaaattaat gcttcaccta ggtaagatca 258; ttggtctgtg tgaaatcaca aatgtttttt ccttcttggt agcc tggtggatgt 264; tcatggagaa ttct ctatattatg gctgtgtgcc tctc cctctgcttt 270; tatcttttcc acagttgagg ctgggtatgt tctttcaaag ccat gaatatgtgt 276; aagtatactt ttgaaaatga gctttcctaa actattgaga gttctttcca cctcttgcgg 282; aaccaactct tggaggagag gcccatgtat ctgcacgagc acttagcttg ttcagatctc 288; tgcattttat aaatgcttct taccaagaaa ttag gtcattgctt gtaccaggta 294; atttttgccg gggatgggta agggttgggt tttctggtgg gagtggggtg gtgggtattt 300; tttgttgatg ctttagtgca ggcctgttct gaggcaataa caagttgctg cagc 306; atgtgctgct gcctttgtaa ctgcatggaa acttttcaca tgggtttttc tccaagttaa 312; tacagaaata tgtaaactga gagatgcaaa tgtaatattt ttaacagttc atgaagttgt 318; tattaaaata actaacataa aacttaatta ctttaatatt atataattat agtagtggcc 324; ttgttttaca aacctttaaa ttacatttta gaaatcaaag ttgatagtct tagttatctt 330; ttgagtaaga aaagctttcc taaagtccca tacatttgga ccatggcagc taattttgta 336; acttaagcat gaac tacctatgga catctattaa agtgattgac aaaatctcaa 342; aaaaaaaaaa aaaaaaaaaa aaaaa n seguence: NCBI Reference Seguence: NP_064583.2 LOCUS: NP_064583 ION: NP_064583 l mkafhtfcvv llvfgsvsea kfddfedeed iveyddndfa efedvmedsv tequrviit 6; eddedettve lequenqeg dfedadtqeg dtesepydde efegyedkpd tsssknkdpi 121 tivdvpahlq nswesyylei lmvtgllayi mnyiigknkn wfnt hrellesnft 181 lvgddgtnke atstgklnqe nehiynlwcs grvccegmli udl lnvlarmmrp 241 vsdqvqikvt mndedmdtyv favgtrkalv rlqkequls efcsdkpksg akyglpdsLa ; ilsemgevtd gmmdtkmvhf lthyadkies vhfsdqfsgp kimqeegqpl klpdtkrtLl 36- ftfnvpgsgn typkdmeall plmnmviysi dkakkfrlnr egquadknr arveenflkl 421 tthrqeaaq krae kerimneedp equrleeaa lrreqkklek kqumkqikv 48L kam PSMD12 Official Symbol: PSM D12 Official Name: proteasome (prosome, macropain) 26S subunit, non-ATPase, 12 Gene ID: 5718 Organism: Homo sapiens Other Aliases: an5, p55 Other Designations: 26S proteasome non-ATPase regulatory t 12; 268 proteasome regulatory subunit RPN5; 268 proteasome regulatory subunit p55 Nucleotide seguence: NCBI Reference Seguence: NM_002816.3 LOCUS: NM_002816 ACCESSION : NM_002816 494 XM_946044 XM_946047 XM_946049 XM_946052 055 XM_946058 1 ctgagcgggt gcaa cttccggtgt gggtgacgag tggtggccga ggga 61 cagcaaggga ggcg gggaccatgg cggacggcgg ctcggagcgg gctgacgggc 121 gcatcgtcaa gatggaggtg gactacagcg tgga ccta cccgagtgtg 181 cgaagctagc caaggaagga agacttcaag aagtcattga aacccttctc tctctggaaa 241 agcagactcg tactgcttcc gatatggtat cgacatcccg tatcttagtt gtga 301 agatgtgcta tgaggctaaa gaatgggatt tacttaatga aaatattatg cttttgtcca 36; aaaggcggag tcagttaaaa gttg ccaaaatggt gtgc tgtacttatg 42; ttgaggaaat cacagacctt cctatcaaac ttcgattaat tgatactcta cgaatggtta 48; ccgaaggcaa gatttatgtt gaaattgagc gtgcgcgact gactaaaaca ttagcaacta 54; taaaagaaca aaatggtgat gtgaaagagg cagcctccat tttacaggag ttacaggtgg 60; aaacctacgg gtcaatggaa aagaaagagc gagtggaatt tattttggag caaatgaggc 66; tctgcctagc tgtgaaggat cgaa cacaaatcat cagcaagaaa attaacacca 72L aatttttcca ggaagaaaat acagagaaat taaagttgaa gtactataat ttaatgattc 78L agctggatca acatgaggga ttgt ctatttgtaa gcactacaga tatg 84L atactccctg tatacaggca gaaagtgaaa aatggcagca ggctctgaag agtgttgtac 90; ttat cctggctcct tttgacaatg aacagtcaga tttggttcac cgaataagtg 96; gtgacaagaa gttagaagaa aaat acaaggatct tttaaagctt tttaccacaa L02; tgat gcgttggtcc acacttgttg aggactatgg aatggaatta agaaaaggtt L08; cccttgagag tcctgcaacg gatgtttttg gttctacaga ggaaggtgaa aaaaggtgga L14; aagacttgaa gaacagagtt gttgaacata atattagaat aatggccaag tattatactc L20; ggataacaat gaaaaggatg gcacagcttc tggatctatc tgttgatgag tccgaagcct L26; ttctctcaaa tctagtagtt aacaagacca tctttgctaa agtagacaga ttagcaggaa L32; ttatcaactt ccagagaccc aaggatccaa ataatttatt aaatgactgg tctcagaaac L38; tgaactcatt tctg gttaacaaaa ctacgcatct catagccaaa atga L44; tacataatct acaataaggg tcttagtgct ttagaaaaaa gttaaaattg gaagtcatta L50; aaaaaagact gttataatgg tgtatatgtt ggggtttttt ttctaagctt ctttgtctta L56; aattttaaaa atat gtttgagact ccctttgacc ttcc ccaagttcat L62; tgttaacttt gcatttgcaa ttggtgcaaa aatacagatt tctgtcgtct gaatacacaa L68; aaagttgtgt cataacttac ccagatatgt ttttctatca tttgaaacct ctac L74; tgtttgtttt cattcaacta atat tccaataata aaagcagtat atacataaaa L80; aaaaaaaaaa aaaa Protein seguence: NCBI Reference Seguence: NP_002807.1 LOCUS: NP_002807 ION: NP_002807 XP_947587 XP_951137 XP_951140 XP_951142 145 148 XP_951151 1 madggserad grivkmevdy satvdqupe caklakegrl qevietllsl ekqtrtasdm 6; vstsrilvav akew dllnenimll skrrsqlkqa vakquqcct yveeitdlpi 12; klrlidtlrm vtegkiyvei erarltktla tikeqngdvk eaasilqelq vetygsmekk 18; ervefileqm rlclavkdyi rtqiiskkin tkffqeente ynlm iqldqhegsy 241 lsickhyrai ydtpciqaes ekwqqalksv vlyvilapfd neqsdlvhri sgdkkleeip 301 kykdllklft tmelmrwstl vedygmelrk gslespatdv fgsteegekr wkdlknrvve 361 hnirimakyy tritmkrmaq lldlsvdese aflsnlvvnk tifakvdrla giinfqrpkd 42; pnnllndwsq klnslmslvn ktthliakee mihnlq fl5F1 Official Symbol: ATP5F1 Official Name: ATP synthase, H+ orting, mitochondrial Fo complex, subunit B1 Gene ID: 515 sm: Homo sapiens Other Aliases: RP11-552M11.5, PIG47 Other Designations: ATP synthase B chain, mitochondrial; ATP synthase subunit b, mitochondrial; ATP synthase, H+ transporting, mitochondrial F0 complex, subunit B1 ; ATP synthase, H+ orting, mitochondrial F0 complex, subunit b; ATPase subunit b; H+-ATP synthase subunit b; cell proliferation- ng protein 47 Nucleotide seguence: NCBI Reference Seguence: NM_001688.4 LOCUS NM_001 688 ACCESSION NM_001688 1 gggc cgccgggggc actagggggg gtggggtttc cttccgcatc tccacggttc 61 caactccaac ctagactcaa actggacgcc ggccggagac tccgctccgg cagcaaaccc 121 cacgtggtgc acctctgagc ctccgcccct aggg aaccgcaact ctacttctcg 181 cgagaattgc ttctatggct ccatcctgct ttccggctgt cgccctcatg cgataggctc 24; tcagcgttac ttgactcttc tcgcgataat tttttttaaa aatctcccaa ggaaagttga ; aggaagagta caaaattttc atctcgcgag acttgtgagc ggccatcttg gtcctgccct 36; gacagattct cctatcgggg tcacagggac gctaagattg ctacctggac tttcgttgac 42; catgctgtcc cgggtggtac tttccgccgc cgccacagcg gccccctctc tgaagaatgc 48; agccttccta ggtccagggg tattgcaggc aacaaggacc tttcatacag ggcagccaca 54; ccttgtccct gtaccacctc ttcctgaata aaaa gttcgttatg gactgatccc 60L tgaggaattc ttccagtttc tttatcctaa aactggtgta acaggaccct atgtactcgg 66L aactgggctt atcttgtacg ctttatccaa agaaatatat gtgattagcg cagagacctt 72; cactgcccta tcagtactag gtgtaatggt aatt aaaaaatatg gtccctttgt 78L tgcagacttt gctgataaac tcaatgagca aaaacttgcc caactagaag aggcgaagca 84; ggcttccatc caacacatcc agaatgcaat tgatacggag aagtcacaac aggcactggt 90; tcagaagcgc cattaccttt ttgatgtgca aaggaataac attgctatgg ctttggaagt 96; tacttaccgg gaacgactgt atagagtata taaggaagta aagaatcgcc tggactatca L02; tatatctgtg atga tgcgtcgaaa ggaacaagaa cacatgataa attgggtgga L08; gaagcacgtg gtgcaaagca caca gcaggaaaag gagacaattg ccaagtgcat L14; tgcggaccta aagctgctgg aggc tcaagcacag ccagttatgt aaatgtatct L20; attg agacagctag aaacagttga ctgactaaat ggaaactagt ctatttgaca L26; aagtctttct gtgttggtgt ctactgaagt tatagtttac ccttcctaaa aatgaaaagt L32; ttgtttcata tagtgagaga acgaaatctc tatcggccag tcagatgttt ctcatccttc L38; ttgctctgcc tttgagttgt tccgtgatca aata agcagtttgc ctttataaaa L44; acttgctgcc tgactaaaga ttaacaggtt atagtttaaa tttgtaatta attctaccat L50; cttgcaataa agtgacaatt gaatgaaaca gggtttttca agttgtataa ttctctgaaa L56; gctt tatg ggtaaaaatt aaagatgtca ttgaactact ttta L62; tgagaccatt cagtggtgaa ctgtttctgg ggtt atgagatatg taaagctttc L68; tctt aaaataacta aatggagtat tatatatcaa ttcatatcat tgactttatt L741 attttagtag tatgcctata gaaaatatta tggactcaga gtgtcataaa ctta L80; agaatccatg cagcaggcca ggcacagtgg ctcacacctg taatgcctgc gccgagacag gcggatcact tgaggtcagg agtttgaaac cagccaggcc aacacagtga L92; gtct ctactaaaaa tacaaaaggt tagccgggca cagg cccagctact caggaggctg gaga attgcttgaa cgcaggaggc aaaggttgca 2041 gtgagctgag atcacgccac tgcactccag cctgggcaac agacctcgac tccatctaga 2101 aaaaaaaaaa aaaaaa n seguence: NCBI Reference Seguence: 679.2 LOCUS NP_001679 ACCESSION N P_001679 l mlsrvvlsaa ataapslkna aflgpgvlqa trtfhtgqph 1vpvpp1pey ggkvryglip 61 eefqulypk tgvtgpyvlg tglilyalsk eiyvisaetf talsvlgvmv ygikkygpfv 121 adfadklneq klaqleeakq qnai dteksqqalv fdvq rnniamalev 181 tyrerlyrvy kevknrldyh iqunmmrrk eqehminwve khvvqsistq qeketiakci 241 adlkllakka qaqpvm Official Symbol: CMPK1 Official Name: cytidine monophosphate (UMP-CMP) kinase 1, cytosolic m: 51727 Organism: Homo sapiens Other Aliases: RP11-511l2.1, CMK, CMPK, UMK, PK, UMPK Other Designations: UMP-CMP kinase; UMP/CMP kinase; cytidylate kinase; deoxycytidylate kinase; uridine monophosphate ; uridine monophosphate/cytidine monophosphate kinase Nucleotide seguence: NCBI Reference ce (variant 1): NM_016308.2 LOCUS NM_016308 ACCESSION NM_016308 l gacagggccg cggacgcccg ggcagccacg gggc cgcggcgggc gccggctcag 61 cccgcccctt tctcccgccg cctccccgcc ccgccccgcg ccgcgccggc cgctgtcagc 121 tccctcagcg tccggccgag gcgcggtgta tgctgagccg ctgccgcagc gggctgctcc 181 tggg ccttagcttc ctgctgcaga cccgccggcc cctc tgctctccac 241 gtctcatgaa gccgctggtc gtgttcgtcc tcggcggccc cggcgccggc aaggggaccc ; agtgcgcccg catcgtcgag aaatatggct acacacacct ttctgcagga gagctgcttc 36; gtgatgaaag gaagaaccca cagt atggtgaact tattgaaaag tacattaaag 42; aaggaaagat tgtaccagtt acca tcagtttatt aaagagggaa atggatcaga 48; caatggctgc caatgctcag aagaataaat tcttgattga tgggtttcca agaaatcaag 54; acaaccttca aggatggaac aagaccatgg atgggaaggc agatgtatct ttcgttctct 60; tttttgactg taataatgag atttgtattg aacgatgtct tgagagggga aagagtagtg 66L gtaggagtga tgacaacaga gagagcttgg aaaagagaat tcagacctac cttcagtcaa 72L caaagccaat tattgactta tatgaagaaa tggggaaagt caagaaaata gatgcttcta 78L ttga tgaagttttt gttg tgcagatttt tgacaaggaa ggctaattct 84L aaacctgaag gcatccttga aatcatgctt gaatattgct ttgatagctg ctatcatgac 90; ccctttttaa ggcaattcta atctttcata actacatctc aattagtggc tggaaagtac 96; atggtaaaac aaagtaaatt tttttatgtt cttttttttg gtcacaggag tagacagtga L02; attcaggttt aacttcacct tagttatggt gctcaccaaa cgaagggtat ttttaaaatt caaaaagaat atccctttta tagtttgtgc cttctgtgag caaaactttt L14; tagtacgcgt atatatccct ctagtaatca caacatttta ggatttaggg atacccgctt L20; cctctttttc ttgcaagttt tcca accttaagtg tgga ccaaatttca L26; aaggaacttt ttgtgtagtc agttcttgca caatgtgttt ggtaaacaaa ctcaaaatgg L32; attcttagga gcattttagt gtttattaaa taactgacca tttgctgtag aaagatgaga L38; aaacttaagc tttgttttac tacaacttgt acaaagttgt atgacagggc atattctttg L44; cttccaagat tggg aggg gttcagagcc tggcagaatt gtcagcttta L50; gtctgacata atctaagggt caag gatcacatct aatgcttgtg ttccttatac L56; tctattatat agtgttattc atgattcagc tgatcttaac aaaattcgta gaac L62; atgc atgtggctag atttatgcta aaatgattct cagttagcat tttagtaaca L68; aggt ttttttttgt ttgttttcta gacttaataa aagcttagga ttaattagaa L74; gaagcaatct agttaaattt cccatttgta ttttattttc ttgaatactt aaaagattta aaaatcattg cactttggtc agaaaaataa ttgattccct ctat ttttattcag tagatttttg ccgaaagata aatgattttt aaaaggctat agagtccaaa L98; tttacaccaa ttcttccttt ctct gaggaatttg cctt actttttttt 204; cttctgtcac aagt ggtatccgag gttcttaata tgagatttaa aatcttaaaa 210; tgtttcttat tttcagcact tacatcattt ggtacacagg gtcaaatagg gcaaataatt 216; ttgtctttgt gatt tgatatttaa agtcactgga aataggacaa gttaatggat 222; gtttttatat tttaatagaa tcatttattt ctatgtgtta tgaaattcac ttaatgataa 228; atttttcaac atacttgcca ttagaaaaca aagtattgct aagtactata acatattggc 234; cactaaaatt catattgaga ttatcttggt gaag agataggaat gagttcttat 240; ctagtgttgc aggccagcaa atacagaggt ggtttaatca aacagctcta gtatgaagca 246: agagtaaaga ctaaggtttc gagagcattc ctactcacat aagtgaagaa atctgtcaga 252; tcta aatatttata gtgagattgt gaaagcaacc ttaaagtttt gaagaagact 258; gatgagacta ggtgctttgc ttcctttcat caggtatctt tctgtggcat ttgagaacag 2641 aaaccaagaa taat tactaaatta tttg ctttttgttt aagt 270; acat gttggcaaca ttgagttttg gagttgattg agataatatg acttaactag 276; catt ccatttgtta aagatacagt caccaagaat gttttgagtt ttttgaaaga 282; ccccaattta agccttgctt atttttaaat tatttccatt cagtgatgtt ggatgtatat 288; cagttattta gtaaataatc tcaataaatt ttgtgctgtg gcctttgcta aaaaaaaaaa 2941 aaaaaaaaaa aaaaaa Protein ce (variant 1): NCBI Reference Seguence: NP_057392.1 LOCUS NP_057392 ACCESSION NP_057392 1 mlsrcrsgll fllq trrpillcsp rlmkplvva 1ggpgagkgt qcarivekyg 61 ythlsagell pdsq ygeliekyik egkivpveit isllkremdq tmaanaqknk 121 flidgfprnq dnlqgwnktm dgkadvsfvl ffdcnneici erclergkss grsddnresl 181 ekriqtqus tkpiidlyee mgkvkkidas fdev vqifdkeg COX631 Official Symbol: COXSB1 Official Name: cytochrome c oxidase subunit Vlb polypeptide 1 (ubiquitous) m: 1340 Organism: Homo sapiens Other Aliases: COXGB, COXG, COXVIb1 Other Designations: COX Vlb—1 ; cytochrome c oxidase subunit 681 Nucleotide seguence: NCBI Reference Seguence: 863.4 LOCUS NM_001 863 ACCESSION NM_001863 l tgggcgtggc gact tcagtggcct cctcctggga gggagctgaa gccgctcgca 6L agactcccgt agtccccacc tctctcagct tccggctggt agtagttccg cttcctgtcc 12L gactgtggtg tctttgctga gggtcacatt gagctgcagg ttgaatccgg ggtgccttta 18; ggattcagca ccatggcgga agacatggag accaaaatca acaa gaccgcccct 24; tttgacagcc ccaa ccagaaccag aact gctggcagaa ctacctggac ; ttccaccgct gtcagaaggc aatgaccgct aaaggaggcg ctgt atgg 36; taccagcgtg tgtaccagtc cctctgcccc acatcctggg tcacagactg ggatgagcaa 42; nggctgaag gcacgtttcc cgggaagatc tgaactggct gcatctccct ttcctctgtc 48; ctccatcctt ctcccaggat ggtgaagggg gtac ccagtgatcc ccaccccagg 54; atcctaaatc atgacttacc tgctaataaa aactcattgg aaaagtgaga Protein seguence: NCBI Reference Seguence: NP_001854.1 LOCUS 854 ACCESSION N P_001854 l maedmetkik nyktapfdsr fpnqnqtrnc quyldfhrc qkamtakggd isvcequrv 61 yqslcptswv tdwdeqraeg tfpgki CTSA Official Symbol: CTSA Official Name: cathepsin A Gene ID: 5476 Organism: Homo sapiens Other Aliases: RPS-337018.1, GLB2, GSL, NGBE, PPCA, PPGB Other Designations: beta—galactosidase 2; beta-galactosidase protective protein; oarboxypeptidase C; oarboxypeptidase L; oarboxypeptidase Y-Iike kininase; oarboxypeptidase-L; deamidase; lysosomal oarboxypeptidase A; lysosomal protective protein; protective n cathepsin A; urinary kininase Nucleotide seguence (variant 1): NCBI nce Seguence: 308.2 LOCUS NM_000308 ACCESSION NM_000308 l cacc cgaatccacg ggctcggagg cagcagccat ctctcggcca tagggcaggc 6L gcgc cgggggctat tttgggcggc gggcaatgat ggtgaccgca aggcgacctt 12; gtaaggcatt tcccccctga ctcccttccc cgagcctctg cccgggggtc ctagcgccgc l8; tttctcagcc ccta caacttagcc gtccacaaca ggatcatctg atcgcgtgcg 24; cccgggctac gatctgcgag gcccgcggac cttgacccgg cattgaccgc caccgccccc ; caggtccgta gggaccaaag aaggggcggg aggaagactg tcacgtggcg ccggagttca 36; cgtgactcgt acacatgact tccagtcccc gggcgcctcc tggagagcaa ggacgcgggg 42; agat gatccgagcc gcgccgccgc cgctgttcct gctgctgctg ctgctgctgc 48; tgctagtgtc ctgggcgtcc cgaggcgagg cagcccccga ccaggacgag atccagcgcc 54; tccccgggct ggccaagcag ccgtctttcc gccagtactc cggctacctc aaaggctccg 60; gctccaagca cctccactac tggtttgtgg agtcccagaa ggatcccgag aacagccctg 66; tggtgctttg gctcaatggg ggtcccggct gcagctcact agatgggctc ctcacagagc 72; atggcccctt cctggtccag ccagatggtg tcaccctgga gtacaacccc tattcttgga 78; ttgc caatgtgtta tacctggagt ccccagctgg ggtgggcttc tcctactccg 84; agtt ttatgcaact aatgacactg aggtcgccca gagcaatttt gaggcccttc 90; aagatttctt ccgcctcttt ccggagtaca agaacaacaa acttttcctg accggggaga 96; ctgg catctacatc ctgg ccgtgctggt catgcaggat atga 102; accttcaggg gctggctgtg ggcaatggac tctcctccta tgagcagaat gacaactccc 108; tggtctactt tgcctactac catggccttc tggggaacag gctttggtct tctctccaga 114; cccactgctg aaac aagtgtaact tctatgacaa caaagacctg gaatgcgtga 120; ccaatcttca ggaagtggcc cgcatcgtgg gcaactctgg catc tacaatctct 126; atgccccgtg aggg gtgcccagcc attttaggta tgagaaggac actgttgtgg 132; tccaggattt gggcaacatc ttcactcgcc tgccactcaa gcggatgtgg catcaggcac L38; tgctgcgctc aggggataaa gtgcgcatgg accccccctg caccaacaca acagctgctt L44; ccacctacct caacaacccg tacgtgcgga aggccctcaa ggag cagctgccac L50; aatgggacat gtgcaacttt ctggtaaact tacagtaccg ccgtctctac cgaagcatga 156; actcccagta tctgaagctg cttagctcac agaaatacca gatcctatta tataatggag L62; atgtagacat ggcctgcaat ttcatggggg atgagtggtt tgtggattcc ctcaaccaga L68; agatggaggt gcagcgccgg ccctggttag tgaagtacgg ggacagcggg gagcagattg L74; ccggcttcgt gaaggagttc tcccacatcg cctttctcac gatcaagggc gccggccaca cgacaagccc ctcgctgcct tcaccatgtt ctcccgcttc agccatactg acag caaccagctc cacggcctga tgcagcccct cccagcctct L921 cccgctagga gagtcctctt ctaagcaaag tgcccctgca gttc L98; actgccccct tcccagagcc ctgtacatcc gggc ccagggtctc ccatagacag 204; cctgggggca agttagcact ttattcccgc agcagttcct gaatggggtg gcctggcccc 210; ttctctgctt aaagaatgcc ctttatgatg cactgattcc atcccaggaa cccaacagag 216; ctcaggacag cccacaggga ggtggtggac ggactgtaat tgatagattg attatggaat 222; gggt acagcttcaa aaaaaaaaaa aaaa Pnneh1seguence(vafiant1y NCBI Reference Seguence: NP_000299.2 LOCUS N P_000299 ION NP_000299 l mtssprappg qurggaemi lfll lllllllvsw asrgeaapdq deiqunga 6; kqpsfrqysg ylkgsgskhl qkd penspvvlwl nggpgcssld glltehgpfl 12; vqugvtley npyswnlian vlylespagv gfsysddkfy atndtevaqs nfealqdffr 18; lfpeyknnkl fltgesyagi yiptlavlvm qdpsmnlqgl avgnglssye qndnslvyfa 24; yyhgllgnrl wsslqthccs dnk dlecvtnlqe varivgnsgl niynlyapca ; ggvpshfrye kdtvvqulg niftrlplkr mwhqallrsg ppct nttaastyln 36L npyvrkalni peqlpqwdmc nflvnlqyrr lyrsmnsqyl kllssqkyqi llyngdvdma 42L ewfv dslnqkmevq rrpwlvkygd gka efshiaflti kgaghmvptd 48; kplaaftmfs rflnkqpy EPHX1 OfibblSwnbd:EPHX1 Official Name: epoxide hydrolase 1, microsomal (xenobiotic) Gene ID: 2052 sm: Homo sapiens Other Aliases: EPHX, EPOX, HYL1, MEH Other ations: e hydratase; epoxide hydrolase 1 Nucleotide seguence: (variant 1) NCBI Reference Seguence: NM_000120.3 LOCUS NM_0001 20 ACCESSION NM_000120 l cagaaggccg tggggagtgg gggccagtgc ctgcagcctg ccctgcctct ctcacaggcc 6; cttagagcat Cgccaggtgc agagctccac agctctcttt cccaaggagt aatcagaggg 12; tgagaacgtg gagcctggtg gacaggtgaa agcactggga tctttctgcc cagaaagggg 18; aaagttgcac atttatatcc gaag cgacagcagt gcttctccct gtgctgaggt 24; acaggagcca taga aatcctcctc acttcagtgc tgggctttgc catctactgg ; ttcatctccc gggacaaaga ggaaactttg ccacttgaag atgggtggtg ggggccaggc 36; acgaggtccg cagccaggga ggacgacagc atccgccctt tcaaggtgga aacgtcagat 42; gaggagatcc acgacttaca ccagaggatc gataagttcc cccc acctttggag 48; gacagctgct tccactatgg ctcc aactacctga agaaagtcat ctcctactgg 54; cggaatgaat ttgactggaa gaagcaggtg gagattctca accc tcacttcaag 60; actaagattg aagggctgga catccacttc atccacgtga agccccccca gctgcccgca 66; ggccataccc cgaagccctt gctgatggtg cacggctggc ccggctcttt ctacgagttt 72; tataagatca tcccactcct gactgacccc aagaaccatg gcctgagcga tgagcacgtt 78; gtca tctgcccttc catccctggc tatggcttct cagaggcatc ctccaagaag 84L gggttcaact cggtggccac cgccaggatc aagc tgatgctgcg gctgggcttc 90; caggaattct acattcaagg ctgg gggtccctga tctgcactaa tatggcccag 96L ctggtgccca tgaa aggcctgcac ttgaacatgg ctttggtttt aagcaacttc 102; tctaccctga ccctcctcct gggacagcgt ttcgggaggt ttcttggcct cactgagagg 108; gagc tgctgtaccc cgtcaaggag aaggtattct acagcctgat gagggagagc L14; ggctacatgc acatccagtg caccaagcct gacaccgtag gctctgctct gaatgactct L20; cctgtgggtc tggctgccta tattctagag aagttttcca cctggaccaa tacggaattc L26; cgatacctgg aggatggagg cctggaaagg aagttctccc tggacgacct gctgaccaac L32; gtcatgctct actggacaac catc atctcctccc agcgcttcta caaggagaac L38; ctgggacagg gctggatgac ccagaagcat gagcggatga aggtctatgt gcccactggc L44; ttctctgcct tcccttttga gctattgcac acgcctgaaa agtgggtgag gttcaagtac L50: ctca tctcctattc ctacatggtt cgtgggggcc actttgcggc ccggagctgc tcgcccagga catccgcaag tcgg tgctggagcg acccctctcc ccccgcctgc cacctccccc cacaagtgcc ctccaggctt ccct tttctgagga atgagtttgc ctccgtcccc tgcccatgct gggagcccac L74; gctcaccccc tcacccctcc aagctcactc cccaaccccc aactccgtgt ggtaagcaac 1801 atggctttga tgataaacga ctttactcta aaaaaaaaaa aaaaaaa Protein seguence nt 1 ): NCBI Reference Seguence: NP_000111.1 LOCUS 111 ACCESSION NP_000111 l mwleilltsv lgfaiwais rdkeetlple dgwwgpgtrs aareddsirp deei 6; hdlhqridkf rftppledsc fhygfnsnyl wrne fdwkkqveil nryphfktkl 12; fihv kppqlpaght pkpllmvhgw pgsfyefyki iplltdpknh glsdehvfev 18; iCpsipgygf kgfn svatarifyk lmlrlgfqef yiqggdwgsl ictnmaqlvp 24; shvkglhlnm alvlsnfstl tlllgqrfgr rdve llypvkekvf yslmresgym ; pdtv gsalndspvg laayilekfs twtntefryl edgglerkfs lddlltnvml 36; ywttgtiiss qrfykenlgq gwmtqkherm kvyvptgfsa fpfellhtpe kwvrfkypkl 42; isysymvrgg hfaafeepel laqdirkfls vlerq ATPSB al Symbol: ATP5B Official Name: ATP synthase, H+ transporting, mitochondrial F1 complex, beta polypeptide Gene ID: 506 Organism: Homo sapiens Other Aliases: ATPMB, ATPSB Other Designations: ATP synthase subunit beta, mitochondrial; mitochondrial ATP synthase beta subunit; mitochondrial ATP synthetase, beta subunit Nucleotide ce: NCBI nce Seguence: NM_001686.3 LOCUS NM_001 686 ACCESSION 686 l agttcaccca atggacctgc ctactgcagc gtaggcctcg cctcaacggc aggagagcag 6; gcggctgcgg ttgctgcagc cttcagtctc cacccggact acgccatgtt ggggtttgtg 12; ggtcgggtgg ctcc ggcctccggg gccttgcgga gactcacccc ttcagcgtcg 18; ctgcccccag ctcagctctt actgcgggcc gctccgacgg cggtccatcc tgtcagggac 24; tatgcggcgc aaacatctcc ttcgccaaaa gcaggcgccg ccaccgggcg catcgtggcg ; gtcattggcg cagtggtgga cgtccagttt gatgagggac taccaccaat tctaaatgcc 36; gtgc aaggcaggga gaccagactg gttttggagg tggcccagca tttgggtgag 42; agcacagtaa ggactattgc tatggatggt acagaaggct tggttagagg ccagaaagta 48; ctggattctg gtgcaccaat caaaattcct gttggtcctg agactttggg cagaatcatg 54; aatgtcattg gagaacctat tgatgaaaga ggtcccatca aaaccaaaca atttgctccc 60; attcatgctg aggctccaga gttcatggaa atgagtgttg agcaggaaat tctggtgact 66; ggtatcaagg ttgtcgatct gctagctccc tatgccaagg gtggcaaaat tgggcttttt 72; gctg gagttggcaa gactgtactg atcatggagt taatcaacaa tgtcgccaaa 78; ggtg gttactctgt gtttgctggt gttggtgaga gtga aggcaatgat 84; ttataccatg aaatgattga atctggtgtt atcaacttaa aagatgccac ctctaaggta 90; gcgctggtat aaat gaatgaacca cctggtgctc gtgcccgggt agctctgact 96; gggctgactg tggctgaata cttcagagac caagaaggtc aagatgtact gctatttatt 102; gataacatct ttcgcttcac ccaggctggt tcagaggtgt tatt gggccgaatc 108; ccttctgctg tgggctatca gcctaccctg gccactgaca ctat gcaggaaaga 114; attaccacta ccaagaaggg atctatcacc tctgtacagg ctatctatgt gcctgctgat 120; actg cccc tgctactacg tttgcccatt tggatgctac cactgtactg 126; tcgcgtgcca ttgctgagct ctat ccagctgtgg atcctctaga ctccacctct L32; cgtatcatgg atcccaacat tgttggcagt gagcattacg atgttgcccg gcaa L38; aagatcctgc acaa atccctccag gatatcattg ccatcctggg tatggatgaa L44; ctttctgagg aagacaagtt gaccgtgtcc cgtgcacgga aaatacagcg gtct L50; cagccattcc aggttgctga caca ggtcatatgg ggaagctggt acccctgaag L56; gagaccatca aaggattcca gcagattttg gcaggtgaat atgaccatct cccagaacag L62; gccttctata tggtgggacc cattgaagaa gctgtggcaa aagctgataa cgtgaggggt ctttgtcctc tgtactgtct ctctccttgc aaaagcttca tttttctgtg taggctgcac aagagccttg attgaagata tattctttct l80l gaacagtatt taaggtttcc aataaaatgt acacccctca gaaaaaaaaa aaaaaaa n seguence: NCBI nce Seguence: NP_001677.2 LOCUS NP_001677 ACCESSION N P_001677 l mlgfvgrvaa apasgalrrl ppaq lllraaptav hpvrdyaaqt spspkagaat 6; grivavigav vdvqfdeglp pilnalevqg retrlvleva thgestvrt eg;V 12; rquvldsga pikipvgpet lgrimnvige pidergpikt kqfapihaea pefmemsveq 18; eilvtgikvv dllapyakgg kiglfggagv meli nnvakahggy svfagvgert 24; regndlyhem iesgvinlkd atskvalvyg gara rvaltgltva eyfrdqequ ; vllfidnifr ftqagsevsa llgripsavg yqptlatdmg tmqeritttk kgsitquai 36; yvpaddltdp apattfahld attvlsraia engiypavdp ldstsrimdp nivgsehydv 42; arqukilqd ykslqdiiai lgmdelseed kltvsrarki qrflsqpfqv aevftghmgk 48; lvplketikg quilageyd hlpeqafymv gpieeavaka dklaeehss ATP5D Official Symbol: ATP5D al Name: ATP se, H+ transporting, mitochondrial F1 complex, delta subunn Gene ID: 513 Organism: Homo sapiens Other Aliases: None currently listed Other ations: ATP synthase subunit delta, mitochondrial; F-ATPase delta subunit; mitochondrial ATP synthase complex delta-subunit precusor; mitochondrial ATP synthase, delta subunit Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_001687.4 LOCUS NM_001 687 ACCESSION 687 l cagacgtccc tgcgcgtcgt cctcctcgcc ctccaggccg cccgcgccgc gccggagtcc 6L gctgtccgcc agctacccgc ttcctgccgc ccgccgctgc catgctgccc gccgcgctgc 121 tccgccgccc tggc cgcctcgtcc cccg tgcctatgcc gccg 18; ccgccccggc tgccgcctct ggccccaacc agatgtcctt caccttcgcc tctcccacgc 24; aggtgttctt caacggtgcc aacgtccggc aggtggacgt gcccacgctg accggagcct ; tcggcatcct ggcggcccac gtgcccacgc tgcaggtcct gcggccgggg ctggtcgtgg 36; tgcatgcaga ggacggcacc acctccaaat actttgtgag ttcc atcgcagtga 42; acgccgactc ttcggtgcag ttgttggccg aagaggccgt gacgctggac atgttggacc 48; tgggggcagc aaac ttggagaagg cccaggcgga gctggtgggg acagctgacg 54; aggccacgcg ggcagagatc cagatccgaa tcgaggccaa cgaggccctg gtgaaggccc 60; tggagtaggc ggtgcgtacc ngtgtcccg aggcccggcc aggggctggg cagggatgcc 66; gccc ctcc tggggtcccg gccacctggg gaagccgcgc ctgccaagga 72; ggccaccaga gggcagtgca ggcttctgcc tgggccccag gccctgcctg tgttgaaagc 78; tctggggact gggccaggga agctcctcct cagctttgag ctgtggctgc cacccatggg 84; gctctccttc cgcctctcaa gatcccccca gcctgacggg ccgcttacca tcccctctgc 90; cctgcagagc cagccgccaa ggttgacctc ggag ccacctctgg atgaactgcc 96; cccagccccc gccccattaa agacccggaa gcctgaaaaa aaaaa n seguence (variant 1): NCBI Reference ce: NP_001678.1 LOCUS NP_001678 ACCESSION N P_001678 1 mlpaallrrp glgrlvrhar aapa aasgpnqmsf tfasptqvff nganvrquv 61 ptltgafgil aahvpthvl rpglvvvhae dgttskyfvs sgsiavnads qullaeeav 121 tldmldlgaa kanlekaqae lvgtadeatr aeiqiriean ealvkale OfibblSwnbd:CAPN1 Official Name: calpain 1, (mu/l) large subunit M: 823 sm: Homo sapiens Other Aliases: CANP, CANP1,CANPL1, , muCL Other Designations: CANP 1; calcium-activated neutral proteinase 1;ca|pain mu-type; n, large polypeptide L1; calpain-1 catalytic subunit; calpain-1 large subunit; cell proliferation-inducing gene 30 protein; cell proliferationinducing protein 30; micromolar-calpain Nucleotide seguence (variant 1): NCBI nce Seguence: NM_001198868.1 LOCUS NM_001 198868 ACCESSION NM_001198868 l agggacttac ccaaggtcac gcagcgagcc cggtccccct gcgttccccg gctg 6" agccgggacg cggcggtggg gtggggaagg ggagtggcgc gcgg ggtgaggctg 12; ccgtttgctg agtgtccggc aggggtctgc tcgctgccag cccggcccct cctcagagca 18; gctgccgcag cccgaggatg tcggaggaga tcatcacgcc ggtgtactgc actggggtgt 24; cagcccaagt gcagaagcag cgggccaggg agctgggcct gggccgccat gagaatgcca ; tcaagtacct gggccaggat tatgagcagc tgcgggtgcg atgcctgcag agtgggaccc 36; tcttccgtga tgaggccttc cccccggtac cccagagcct gggttacaag gacctgggtc 42; ccaattcctc caagacctat ggcatcaagt ggaagcgtcc cacggaactg ctgtcaaacc 48; cccagttcat tgtggatgga gctacccgca cagacatctg ccagggagca ctgggggact 54L gctggctctt ggcggccatc gcctccctca ctctcaacga caccctcctg caccgagtgg 60; ttccgcacgg ccagagcttc cagaatggct atgccggcat cttccatttc tggc 66L aatttgggga gtgggtggac gtggtcgtgg tgct caag gacgggaagc 72; tagtgttcgt gcactctgcc gaaggcaacg agttctggag cgccctgctt gagaaggcct 78; atgccaaggt aaatggcagc gccc gggg cagcacctca gagggctttg 84; tcac aggcggggtt accgagtggt acgagttgcg caaggctccc agtgacctct 90; accagatcat cctcaaggcg ctggagcggg gctccctgct gggctgctcc atagacatct 96; ttct agacatggag gccatcactt tcaagaagtt gggc tact L02; ccgg ggccaagcag gtgaactacc gaggccaggt ggtgagcctg atccggatgc L08; ggaacccctg ggtg gagtggacgg gagcctggag cgacagctcc tcagagtgga L14; tgga cccatatgaa cgggaccagc tccgggtcaa gatggaggac ggggagttct L20; ggatgtcatt ccgagacttc atgcgggagt tcacccgcct ggagatctgc ccgacgccct caagagccgg accatccgca aatggaacac cacactctac gaaggcacct ggcggcgqgg gagcaccgcg gggggctgcc gaaactaccc cttc ctcagttcaa gatccggctg gatgagacgg atgacccgga cgactacggg gaccgcgagt L44; caggctgcag cttcgtgctc atgc agaagcaccg tcgccgcgag cgccgcttcg L50; gccgcgacat ggagactatt ggcttcgcgg tctacgaggt ccctccggag agccggccgt acacttgaag cgtgacttct tcctggccaa tgcgtctcgg gcgcgctcag L62; agcagttcat caacctgcga gaggtcagca cccgcttccg accc ggggagtatg L68; tggtggtgcc ctccaccttc gagcccaaca aggagggcga cttcgtgctg cgcttcttct L74; cagagaagag tgctgggact gtggagctgg atgaccagat ccaggccaat ctccccgatg L80; agcaagtgct ctcagaagag gagattgacg agaacttcaa ggccctcttc aggcagctgg L86; caggggagga catggagatc agcgtgaagg agttgcggac aatcctcaat aggatcatca L92; gcaaacacaa agacctgcgg accaagggct tcagcctaga gtcgtgccgc agcatggtga L98; acctcatgga tcgtgatggc aagc tgggcctggt ggagttcaac atcctgtgga 204; accgcatccg gaattacctg tccatcttcc ggaagtttga cctggacaag agca 210; tgagtgccta gcgg attg agtcggcagg cttcaagctc aacaagaagc 216; tgtacgagct catcatcacc cgctactcgg agcccgacct ggcggtcgac tttgacaatt 222; tcgtttgctg gcgg ctagagacca tgttccgatt tttcaaaact ctggacacag 228: atctggatgg agttgtgacc tttgacttgt ttaagtggtt gcagctgacc atgtttgcat 2341 gaggcaggga ctcggtcccc cttgccgtgc tcccctccct cctcgtctgc caagcctcgc 240; ctcctaccac accacaccag gccaccccag ctgcaagtgc cttccttgga gcagagaggc 246; agcctcgtcc tcctgtcccc tctcctccca gccaccatcg ttcatctgct ccgggcagaa 252; ctgtgtggcc cctgcctgtg ccagccatgg gctcgggatg gactccctgg gccccaccca 258; ttgccaagcc aggaaggcag ctttcgcttg ttcctgcctc gccc cgggtttccc 264; cagcatcctg atgtgtcccc tctccccact tcagaggcca cccactcagc accaccggcc 270; tggccttgcc tgcagactat aaactataac cactagctcg acacagtctg cagtccaggc 276; gtgtggagcc gcctcccggc aggc cccggggctg ggaacgcctg tgccttcctg 282; cgccgaagcc aacgccccct ctgtccttcc ctggccctgc tgccgaccag gagctgccca 288; gcctgtgggc ggtcggcctt ccctccttcg ctcctttttt atattagtga ttttaaaggg 294; gactcttcag ggacttgtgt actggttatg ggggtgccag aggcactagg cttggggtgg 300: ggaggtcccg atat agaggaaccc ataa aaggccccac atctgtctgt 306; gaaaaaaaaa aaaaaaaaaa aaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3121 aaaaa n seguence (variant 1): NCBI Reference Seguence: NP_001185797.1 LOCUS N P_001 185797 ACCESSION N P_001185797 l mseeiitpvy ctgvsaqvqk qrarelglgr henaikylgq dyeqlrvrcl qsgtlfrdea 63— fpprqslgy kdlgpnsskt rpte llsnquivd gatrtdicqg algdcwllaa 12; iasltlndtl lhrvvphgqs gifh fqlwquewv dvvvddllpi kdgklvahs 18; aegnefwsal lekayakvng syealsggst segfedftgg vtewyelrka psdlyqiiLk 24; alergsllgc sidissvldm eaitfkklvk ghaysvtgak qvnyrgqvvs lirmrnpwge ; vewtgawsds ssewnnvdpy erdqlrvkme dgefwmsfrd fmreftrlei cnltpdalks 36; rtirkwnttl yegtwrrgst aggcrnypat fwvnqukir ldetddpddy gdresgcsfv 42; lalmqkhrrr dmet igfavyevpp elvgqpavhl krdfflanas rarseqfinl 48; revstrfrlp pgeyvvvpst gdfv lrffseksag tvelddqiqa nlpdeqvlse 54; eeidenfkal frqlagedme isvkelrtil nriiskhkdl rtkgfslesc rsmvnlmdrd 60; gngklglvef nilwnrirny lsifrkfdld ayem rmaiesagfk elii 66L trysepdlav dfdnfvcclv rffk tldtdldgvv tfdlfkwlql tmfa CAPZA2 Official Symbol: CAPZA2 Official Name: capping protein (actin filament) muscle Z-line, alpha 2 Gene ID: 830 sm: Homo sapiens Other Aliases: CAPPA2, CAPZ Other Designations: F-actin capping protein alpha-2 subunit; F-actin-capping protein subunit alpha-2; capZ alpha-2 Nucleotide seguence: NCBI Reference ce: NM_006136.2 LOCUS NM_0061 36 ACCESSION NM_006136 l cccctccctt agcgggggcg gctg aggaccgcac ggaaacgggg aagtcaggtg 6; gccgctgccg ccgccgccgc cgcggtttgt cgccagaagg gcgg atctggagga 12; gcagttgtct gatgaagaga aggtgcgtat agcagcaaaa attc atgcccctcc 18; tggagaattt aatgaggttt tcaatgatgt tcggttactg cttaataatg acaatcttct 24; cagggaagga gcagcccatg catttgcaca gtataacttg gaccagttta ctccagtaaa ; aattgaaggt tatgaagatc aggtattgat aacagaacat ggcgacttgg gaaatggaaa 36; gtttttggat ccaaagaaca gaatctgttt taaatttgat agga aggaggcaac 42; tgatccaaga ccctgtgaag tagaaaatgc agttgaatca tggagaactt cagtagaaac 48; tgctctgaga gcttacgtaa aagaacatta cccgaatgga actg tgtatggcaa 54; aaaaatagat ggacagcaaa ccattattgc agaa agccatcagt tccaagcaaa 60; ttgg aatggtcgtt ggaggtcaga atggaagttt acaatcactc cttcaaccac 66; tcaagtggtt ggcatcttga aaattcaggt tcattattat gaagatggta atgttcagct 72; agtgagtcat aaagatatac aagattccct aacagtgtct aatgaagtgc aaacagcaaa 78; agaatttata aagattgtag aagctgcaga aaatgaatac cagactgcca tcagtgagaa 84; gaca atgtcggaca ctactttcaa acgt cgacagttgc cagttacacg 90: cactaagatt gattggaaca agatccttag ctacaagatt ggcaaagaga tgcagaatgc 96L ataagatgaa cattgcatga ccggatcatt ttagtgtctt tgcgttaaaa aatcattgca 102; aaagtattct gaactgtcaa gctgcccagt cagatgggct gttgccattt actg 1081 taattaatta gtttgattag agcacaaagc ttagctaatc aaccattatt tttcattttg 114; tttgttctaa gaggattgaa aatcagttta gtttaaatgt ctttctgtta ggcctttctt 120; tcttacaatg aagagatgat tcttctagtt tatggttaaa agtttttgaa gtgtctcaaa 126; aatattttac taactgtaac cctaaaattg atgtcttttg gtttatgaaa tcagtaattt 132; ttgatatttc cccagttctt tttaatgggg tcaataatgg tagt ttaaggtggt 138; tgatggattt agccatatat gctgctaaag aaattgtcta ccttttcttc ctcacctgtt 144; ccatttatgt aaagttgaga ttagagggaa agcattttct attg tgtttaaacc 150; tttcaagaag gttatttagc tagcttagtg taaa ttaa acaaggcaag 156; gtctaatgct gttttgagat atta atac ttatttcaga aatgcattta 1621 tttt tcttgtgaca gttacgcaaa tgaa ttccatatgt tttttatcat aaagccacaa atgtattata acaaggcaaa ttgtaatata cctg L741 aactcatgac catgtctcgg tttatttttt ttttcttgga ttgaaaagta atgtgacatt aaaatgcaaa ttttcctatt tatttgagta gaaaatcact catatatatt ttaaaatact ttctttggat attgtaattc ttaactggtt gtaaattaga 192; aaagctggga ttacatatgg tgtgcggtta cagtctaaat tttttcatcc tcctatgcat 198; cataagcatg atat tttcaaaaat agttctactg atgctacagg aatttcaagc 204; tgaa tgttagtatt taccataggg agtgaagtgg agttatggtt tcattcaata 210; gagtattgct actt gagtggaatc tcac gtactcccac agacgtctgg 216; gcctggaaat ttttttttta ttttatttta ttgttttttt ttttagaaaa acaccacttt 222; tattatgtac aataaaatat ttcattagct tgaattgtat agatttttaa aaattcaatg 228; aaagcatgtt gtttaatttc tttttaaaat cactgttggg ctttgaaagc attgagaata 2341 taatatgaaa ttatgaaaaa aaaaaaaaaa aaa Protein sequence: NCBI Reference Seguence: NP_006127.1 LOCUS N P_006127 ACCESSION N P_006127 1 madleeqlsd eekvriaakf iihappgefn evfndvrlll nndnllrega ahafaqynld 61 qftpvkiegy edqvlitehg dlgngkfldp knricfkfdh 1rkeatdprp cevenavesw 121 rtsvetalra yvkehypngv ctvygkkidg qqtiiacies hqfqaknfwn grwrsewkft 181 itpsttqvvg ilkiqvhyye dgnvqlvshk vsn evqtakefik iveaaeneyq 241 taisenyqtm sdttfkalrr qlpvtrtkid wnkilsykig kemqna Official Symbol: CCT7 Official Name: chaperonin containing TCP1, subunit 7 (eta) m: 10574 Organism: Homo sapiens Other Aliases: CCTETA, CCTH, NIP7-1, TCP1 ETA Other Designations: OCT-eta; HIV-1 Net interacting protein; HIV-1 Nef- interacting protein; lex protein 1 subunit eta; TCPeta; chaperonin containing t-complex polypeptide 1, eta subunit Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_006429.3 LOCUS NM_006429 ACCESSION NM_006429 l atagagtagc ggaagtggtc cgttctcttc cggc ccaagcttct ttct 6; attgcgcgag gcattgtggg ttgctgggcg tctc ggagaagagg ggagagtggc 12; gggccgctga ataagcttcc aaaatgatgc ccacaccagt attg aaagagggga 18; ctgatagctc ccaaggcatc ccccagcttg tgagtaacat ctgc caggtgattg 24; ctgaggctgt aagaactacc ctgggtcccc gtggcatgga caagcttatt gtagatggca ; gaggcaaagc aacaatttct aatgatgggg ccacaattct gaaacttctt gatgttgtcc 36; atcctgcagc aaagactttg gtagacattg ccaaatccca agatgctgag gtgggtgatg 42; gcaccacctc agtgaccttg ctggctgcag agtttctgaa gcaggtgaaa ccctatgtgg 48; aggaaggttt acacccccag atcatcattc gagctttccg cacagccacc cagctggcag 54; ttaacaagat caaagagatt gctgtgaccg tgaagaaggc agataaagtg gagcagagga 60; agctgctgga aaagtgtgcc atgaccgctc tgagctccaa ctcc cagcagaaag 66; ctttctttgc taagatggtg gtggatgcag tgct cgatgatttg ctgcagctta 72; ttgg aatcaagaag gtacagggtg gagccctcga ggattctcag ctggtagctg 78L catt caagaagact ttctcttacg ctgggtttga aatgcaaccc tacc 84L acaatcccaa gattgccctt ttgaatgtcg agctcgagtt tgag aaagacaatg 90; ctgagataag agtccacaca gttgaggatt atcaggcaat tgttgatgct gagtggaaca 96; ttctctatga caagttagag aagatccatc attctggagc caaagttgtc aaac 102; tccccattgg ggatgtggcc acccagtact ttgctgacag ggacatgttc tgtgctggcc ;O8; gagtacctga tctg aagaggacaa tgatggcctg tggaggctca atccagacca ;l4; gtgtgaatgc tctgtcagca gatgtgctgg gtcgatgcca ggtgtttgaa gagacccaga ;20; ttggaggcga gaggtacaat ttttttactg gctgccccaa ggccaagaca tgcaccttca ;26; ttctccgtgg cggcgccgag cagtttatgg aggagacaga gcggtccctg catgatgcca ;32; tcatgatcgt ggcc atcaagaatg attcagtggt tggc attg ;38; agatggaact ctccaagtac ctgcgggatt actcaaggac tattccagga aaacagcagc ggcatatgcc aaggccttgg agattatccc acgccagctg ctggctttga tgccacaaac attctcaaca agctgcgggc tcggcatgcc caggggggta ;56; catggtatgg agtagacatc aacaacgagg acattgctga caactttgaa gctttcgtgt ;62; gggagccagc gcgg atcaatgcgc tgacagcagc ctctgaggct agatgaaacc atcaagaacc cccgctcgac tgtggatgct cccacagcag ;74; caggccgggg ccgtggtcgt ggccgccccc actgagaggc accccaccca tcacatggct ;80; ctgc tgggtgcact taccctcctt ggcttggtta cttcatttta caaggaaggg ;86; gtagtaattg gcccactctc ttcttactgg aggctattta aataaaatgt aagacttcag ;92; ataactttgt aaattaaaaa aaaaaaaa n ce (variant 1): NCBI Reference Seguence: 420.1 LOCUS NP_006420 ION NP_006420 l illk egtdssqgip qlvsnisacq viaeavrttl gprgmdkliv dgrgkatisn 6; dgatilklld vvhpaaktlv diaksqdaev gdgttsvtll aaeflkqvkp yveeglhpql 12; iirafrtatq lavnkikeia vtvkkadkve qullekcam talssklisq qkaffakmvv 18; davmmlddll ikkv qggaledsql vagvafkktf syagfemqpk kyhnpkia;l 24; nvelelkaek dnaeirvhtv edyqaivdae klek ihhsgakvvl sklpigdvat ; qyfadrdmfc agrvpeedlk rtmmacggsi qtsvnalsad vlgrcqvfee tqiggerynf 36; ftgcpkaktc tfilrggaeq fmeeterslh daimivrrai kndsvvaggg aiemelskyl 42; rdysrtipgk qqlligayak aleiiprqlc dnagfdatni lnklrarhaq ggtwygvdin 48; nediadnfea fvwepamvri naltaaseaa clivsvdeti knprstvdap taagrgrgrg 54; rph OfibblSwnbd:CTSB Official Name: cathepsin B m: 1508 Organism: Homo sapiens Other Aliases: APPS, CPSB Other Designations: APP secretase; amyloid precursor protein secretase; cathepsin B1; cysteine protease Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_001908.3 LOCUS NM_001 908 ACCESSION NM_001908 1 ggggcggggc gta cttagggccg gccc aggctacggc ggctgcaggg 6; ctccggcaac cgctccggca acgccaaccg ctccgctgcg Cgcaggctgg gctgcaggct 12; ctcggctgca gcgctgggtg gatctaggat ccggcttcca acatgtggca gctctgggcc 18; tccctctgct gcctgctggt gttggccaat gcccggagca ggccctcttt ccatcccctg 24; tcggatgagc tggtcaacta caaa cggaatacca Cgtggcaggc cgggcacaac ; ttctacaacg tggacatgag gaag aggctatgtg gtaccttcct gggtgggccc 36; aagccacccc ttat gtttaccgag gacctgaagc caag cttcgatgca 42; cgggaacaat ggccacagtg tcccaccatc atca gagaccaggg ctcctgtggc 48; tcctgctggg ccttcggggc tgtggaagcc atctctgacc ggatctgcat ccacaccaat 54; gtca gcgtggaggt gtcggcggag gacctgctca catgctgtgg cagcatgtgt 60; ggggacggct gtgg ctatcctgct tgga acttctggac aagaaaaggc 66; ctggtttctg gtggcctcta tgaatcccat gtagggtgca gaccgtactc catccctccc 721 tgtgagcacc acgg ctcccggccc ccatgcacgg gggagggaga tacccccaag 78L tgtagcaaga tctgtgagcc tggctacagc ccgacctaca aacaggacaa gcactacgga 84L tacaattcct tctc caatagcgag aaggacatca agat ctacaaaaac 90; ggccccgtgg agggagcttt ctctgtgtat tcggacttcc tgctctacaa gtcaggagtg 96; taccaacacg tcaccggaga gatgatgggt ggccatgcca tccgcatcct gggctgggga 102; gtggagaatg gcacacccta ctggctggtt gccaactcct ggaacactga ctggggtgac L08; ttct ttaaaatact cagaggacag gatcactgtg gaatcgaatc agaagtggtg L14; gctggaattc ccga tcagtactgg gaaaagatct aatctgccgt gggcctgtcg L20; tgccagtcct gaga tcggggtaga aatgcatttt attctttaag ttcacgtaag L26; atacaagttt cagacagggt ctgaaggact ggattggcca aacatcagac ctgtcttcca L32; aggagaccaa gcta catcccagcc tgtggttaca gtgcagacag gccatgtgag L38; ccaccgctgc cagcacagag cgtccttccc cctgtagact agtgccgtag ctgccccagc tggc cccctccgtg atccatccat ctccagggag caagacagag L50; acgcaggaat ggaaagcgga gttcctaaca ggatgaaagt tcccccatca caagtagctt tccacatttg tcacagaaat cagaggagag L62; tttg gagaacgcca cagg ccccctgcat ctatcgagtt tgcaatgtca L68; caacctctct gatcttgtgc tcagcatgat tctttaatag aagttttatt ctctgctaat catgtgggtg agccagtgga ggag acctgtgcta ttgcctcctt atgacgcggc tcaaaaggaa accaagtggt caggagttgt L86; ctgatctcta ctaccacaag agtt taggagaaac cagcttttac tgtttttgaa L92; aaattacagc ttcaccctgt caagttaaca aggaatgcct gtgccaataa aagttttctc L98; caacttgaag tctactctga tgggatctca gatcctttgt cactgcctat agacttgtag 204; ctgctgtctc tctttgtccc tgcagagaat cacgtcctgg aactgcatgt tcttgcgact 210; cttgggactt aact tctcgctgcc ccagccatgt tttcaaccat ggcatccctc 216; ccccaattag ttccctgtca tcaa ccttctctgt aagtgcctgg taagcttgcc 222; taag aactcaaaac atagctgtgc tctatttttt tgttgttgtt gtgactgaca 228; gagtgagatt ccgtctccca ggctggagtg cagtggcgcc ttctcagctc acct 234; tcct agattcaagc gattctcctg cctt ccgagtagct gggatgacag 240; gcactcacca atatgcctgg gtaatttttg tatttttaag tacatacagg atttcaccat 246; gttggccagg ctagtttcaa actcccggcc tcaggtggtc tgcctgcctc agcctcccaa 252; gqga ttacaggcgt gagccactgg gccctgcctg tattttttat cagccacaaa 2581 tccagcaaca agctgaggat tcagctcata aaacaggctt ggtgtcttgg tgatctcaca 264; taaccaagat gctaccccgt ggggaaccac atccccctgg atgccctcca gccttggttt 270; gagt cagggcctgt atacagtatt ttgaatttgt atgccactgg tttgcattgc 276; tggtcaggaa ctctagtgct ttgcatagcc ctggtttaga aacatgttat agcagttctt 282; ggtatagagc aaactagaag aaccagcaat cattccactg tcctgccaag gtacacctca 288; gtactcccct tcccaactga agtggtatga ggctagctct ttccaaaagc attcaagttt 294; ggcttctgat gtgactcaga atttaggaac ctag atcaaataag ctctgaaaat 300; ctgaggaaca ttgtaggaaa ggtttgttaa gcatctctta agtgccatga tgagcataac 306; agccggccgt cgtggctcac aatc ccagcacttt gggaggccaa ggtgggagga 312; tgacaaggtc aggagttcaa gaccagcctg gccaacatgc tgaaacctca cctctactaa 318; aaatacaaaa attagctggg catggtggca catgcctgta atcccagcta cttgggaggc 324: tgaggcagga gaatcgcttg aacccgggag gcggaggttg gcca agacagtgcc 330; agtgcactcc agcctcggtg acagcgcaag gctccgtctc aataattaaa aaaaaaaaaa 336; aaaaaaaaaa ggccgggcgc agtggctcaa gcctgtaatc ccagcacttt gggaggctga 3421 ggcgggcaga tcacctgagg tcaggagttt tgagatcagc cttggcaaca cggtgaaacc 348; ccatctctac taca aaattagcca agcatgctgg cacatgcctg taatcccagc 354; tactcgggag gctgaggtac gagaatcgct tggg agga gagc 360; cgagatcacg ccattgcact ccagcctggg ggacaagagt gaatctgtgt ctcaccaaaa 366; aaaaaaagaa aaagaaagat gcttaacaaa ggttaccata aaat tcataaccac 372; ttatccttcc agtttcaagt tatt cataacctca ataaagttct ccctgctccc 3781 aaa n sequence nt 1): NCBI Reference Seguence: NP_001899.1 LOCUS NP_001899 ACCESSION N P_001899 1 qulwaslcc 11v1anarsr psfhplsdel vnyvnkrntt wqaghnfynv dmsylkrlcg 61 tflggpkppq rvmftedlkl reqw keir dqgscgscwa fgaveaisdr 121 icihtnahvs vevsaedllt ccgsmcgdgc nggypaeawn fwtrkglvsg glyeshvgcr 181 pysippcehh vngsrppctg egdtpkcski cepgysptyk qdkhygynsy svsnsekdim 241 aeiykngpve gafsvysdfl lyksgquhv tgemmgghai rilgwgveng tpywlvansw 301 ntdwgdngff kilrquhcg iesevvagip rtdqyweki FKBP2 Official Sym bol: FKBP2 Official Name: FK506 binding protein 2, 13kDa gflyyiflg:2286 Organism: Homo sapiens Other Aliases: FKBP-13, PPlase OHmrDesbnamMm:13kDaFK506bMdmgpnfiem;13kDaFKBP;FK506 binding protein 2 (13kD); FKBP-2; PPlase FKBP2; immunophilin FKBP13; peptidyI-prolyl cis-trans isomerase FKBP2; proline isomerase; rapamycin- binding protein; se Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_004470.3 LOCUS NM_004470 ACCESSION NM_004470 1 gccggaagtg acgcagggca gcggcgtcgc gggggcgggg ctcgggaaag gcca 6; gcgggcgtgt ggccgcgggt cggt ccaataaggg agggcggcgt ggcccggcct 12; ggtagcgacg aggacgcgcc tgcgcagagg cggcagcacc accggggttg actccggggg 18; cgcggcgagg agagacatga ggctgagctg ggtc ctgacagtac tgtccatctg 24; cctgagcgcc gtggccacgg ccacgggggc cgagggcaaa aggaagctgc agatcggggt ; caagaagcgg gtggaccact gtcccatcaa caaa ggggatgtcc tgcacatgca 36; ctacacgggg gaag atgggacaga gtttgacagc cccc agaaccagcc 42; ctttgtcttc tcccttggca caggccaggt catcaagggc tgggaccagg ggctgctggg 48; gatgtgtgag ggggaaaagc gcaagctggt gatcccatcc gagctagggt atggagagcg 54; gggagctccc attc gtgc aaccctggtg ttcgaggtgg agctgctcaa 60; aatagagcga cgaactgagc tgtaaccaga ctggggaggg gcagggggag ccat 66; cagggaccag actgttccaa aaaaaaaaca aaaaacaaaa acaaacaaaa aaacacttaa 72; aagcccaagg aaaaaaaaaa aaaaaaa Protein seguence (variant 1): NCBI Reference Seguence: NP_004461.2 LOCUS N P_004461 ACCESSION NP_004461 l rvlt sava tatgaegkrk lqigvkkrvd hcpiksrkgd vlhmhytgkl 61 edgtefdssl pqnqpfvfsl gtgqvikgwd qgllgmcege krklvipsel gygergappk 121 ipggatlvfe vellkierrt e1 FLNC Official Sym bol: FLNC Official Name: filamin C, gamma m: 2318 Organism: Homo sapiens Other Aliases: ABP-280, ABP280A, ABPA, ABPL, FLN2, MFM5, MPD4 Other Designations: ABPlike protein; ABP-L, gamma filamin; FLN-C; actin binding protein 280; actin-binding-like protein; filamin 2; filamin-2; filamin-C Nucleotide seguence (variant 1): NCBI Reference Seguence: NM_001458.4 LOCUS NM_001458 ION NM_001458 1 ccctggaggg agagagagcc agagagcggc cgagcgccta ccgc tcgc 61 cgagccccgc cagccccggc gcgagagaag ttggagagga gagcagcgca cgag 121 ggtc caac agcgcccgac agcccccgat agcccaaacc ctag 181 ccccggccgc acccccagcc cgcgccagca tgatgaacaa cagcggctac gccg 241 gcctcggcct gggcgatgag acagacgaga tgccgtccac ggagaaggac ctggcggagg 301 acgcgccgtg gaagaagatc cagcagaaca cattcacgcg ctggtgcaat gagcacctca 361 agtgcgtggg caagcgcctg accgacctgc agcgcgacct cagcgacggg ctccggctca 421 tcgcgctgct cgaggtgctc agccagaagc gcatgtaccg caagttccat ccgcgcccca 481 acttccgcca aatgaagctg gagaacgtgt ccct cgagttcctc gagcgcgagc 541 acatcaagct cgtgtccata gacagcaagg ccatcgtgga cctg aagctgatcc 601 tgggcctgat ctggacgctg atcctgcact actccatctc catgcccatg tgggaggatg 661 atga ggatgcccgc aaacagacgc ccaagcagcg gctgcttggc tggatccaga 721 acaaggtgcc gccc atcaccaact tcaaccgtga ctggcaggac ggcaaagctc 781 tgggcgccct ggtggacaac tgcgcccccg gtctctgccc cgactgggag gcctgggacc 84; ccaaccagcc Cgtggagaac gcccgggagg ccatgcagca ggccgacgac gggg 90; tgccccaggt cattgcccct gaggagattg tggaccccaa cgtggatgag cattctgtta 96; tgacctacct gtcccagttc cccaaggcca agctcaaacc tggtgcccct gttcgatcca L02; agcagctgaa ccccaagaaa gccatcgcct atgggcctgg catcgagcca aaca L08; ccgtgctgca gcctgcccac ttcaccgtgc agacggtgga cgcgggcgtg ggcgaggtgc L14; tggtctacat cgaggaccct gaaggccaca CCgaggaggC taaggtggtt cccaacaatg L20; acaaggatcg cacctatgct gtctcctatg tgcccaaggt gtta cacaaggtga 126; ccgtgctctt ccag gaac gcagtccctt tgaggtgaac ccctgggaga tgccaacaag gtgtcagccc gtggccctgg cctggaacct L38; tggccaacaa acccacctac tttgacatct acactgcggg ggccggcact ggcgatgttg L44; ctgtggtgat Cgtggaccca cagggccggc gggacacagt ggcc agggtgacag cacgttccgc tgcacataca gacctgccat ggaggggcca atgtggcctt tgcgggtgcc cccatcaccc gcagtccctt ccctgtccat cctgtaaccc caacgcctgc cgcgcctctg ggcgaggcct gcagcccaag ggtgttcgcg L68; tgaaagaggt ggctgacttc ttta ccaagggtgc ggg gagctcaagg L74; tcacggtcaa ggggccaaag ggcacagagg agccagtgaa ggtgcgggag gctggggatg L80; gtgtgttcga gtgcgagtac tacccggtgg tgcctgggaa gtatgtggtg accatcacgt L86; ggggcggcta Cgccatccct Cgcagcccct ttgaggtaca ggtgagccca ggag L92; tgcaaaaggt ccgggcctgg ggtcctggtt tggagactgg ccaggtgggc aagtcagccg L98; attttgtggt ggaagccatt ggcaccgagg tggggacact gggcttctcc atcgaggggc 204; cctcacaagc caagatcgaa tgtgacgaca agggggatgg ctcctgcgat gtgcggtact 210; ggcccacgga gcctggggag tacgctgtgc acgtcatctg tgacgatgag gacatccgag 216; actcaccctt cattgcccac cccg ccccacctga ctgcttccca gataaggtga 222; aggcctttgg gcctggcctg gagcctaccg gctgcatcgt ggacaagccc gctgagttca 228i ccattgatgc tcgtgcagct ggcaagggag acctgaagct ctatgcccag gacgccgacg 2341 ccat cgacatcaag gtgatcccca acggcgacgg caccttccgc tgctcctacg 240; ccaa gcccattaag cacaccatca cctg gggaggcgta aacgtgccca 246; agagcccctt ccgggtgaac gtgggcgagg gcagccaccc ggta aaggtgtacg 252; gccccggagt ggagaagaca aagg ccaatgagcc cacctacttc acggtggact 258; gcagcgaggc ggggcaaggc gacgtgagca tcggcatcaa ccca ggcgtggtgg 264; gccctgcaga ggctgacatt gacttcgaca tcatcaagaa tgacaacgac accttcaccg 270; tcaagtacac gccaccaggg gcgggccgct acaccatcat gttt gccaaccagg 276; agatccccgc cttc cacatcaagg tggacccatc tgcc agcaaagtca 282; aggccgaggg ccctgggctg aatcgcacag gtgtggaagt cgggaagccc ttca 288; cggtgctgac caagggagcc gcca agctggatgt gcagtttgca gggacagcca 294; aggt tgtgcgggac tttgagatca tagacaacca tgactactcc tacactgtca 300; agtacaccgc tgtccagcag ggcaacatgg cagtgacagt gacttatggc ggggaccctg 306; tccccaagag cccctttgtg gtgaatgtgg cacccccgct ggacctcagc aaaatcaaag 312; ttcagggcct caag gtggctgtgg gacaggaaca agcattctct gtgaacacac 318; gaggggctgg cggtcagggc caactggatg tgcggatgac ttcgccctct ccca 324; tcccctgcaa gctggagcca ggcggtggag cggaagccca ggctgtgcgc tacatgcccc 330; cggaggaggg gccctacaag gtggatatca cctacgatgg tcacccggtg cctggcagcc 336; cgtttgctgt ggagggtgtc ctgccccctg atccctccaa ggtctgtgct tatggcccgg 342; gtctcaaggg tggactggta ggcacccccg cgccattctc catcgacacc aagggggctg 348; gcacaggtgg cctggggctg accgtagagg gcccctgcga ggccaagatc gagtgccagg 354; acaatggtga tggctcatgt gctgtcagct acctgcccac ggagcctggc gagtacacca 360; tcct gtttgctgag atcc ctggctcgcc agcc accattcggc 366; ctgtgtttga cccgagcaag gtgcgggcca gtggaccggg cctggagcgc ggcaaggtcg 372; gtgaggcagc caccttcact gtggactgct cagaggcagg cgaggcggag ctgaccattg 378; tgtc ggatgccggg gtcaaggccg aggtgctgat ccacaacaac gcggatggca 384; cctaccacat cacctacagc cctgccttcc ctggcaccta caccattacc atcaagtatg 390; atcc cgtgcccaaa ttccccaccc gtgtccatgt gcagcctgcg gtcgatacca 396; gtggcgtcaa ggtctcaggg cctggtgttg agccacacgg tgtcctgcgg gaggtgacca 402; ctgagttcac tgtggatgca agatccctaa cagccacagg cggcaaccac gtgacggctc 408; gtgtgctcaa cccctcgggg gccaagacag acacctatgt gacagacaat ggggacggca 414; cctaccgagt gcagtacacc gcctacgagg agggcgtgca tctggtggag gtcctgtatg 420; tcgc tgtgcccaag agccccttcc gagtgggcgt gggc tgtgatccca 426; cccgcgtccg agccttcggg ccaggcctgg agggtggctt ggtcaacaag cgat 432; tcactgtgga gaccagggga gcgggcaccg ggggccttgg cctagccatc gagggtccct 438; cggaagccaa gatgtcctgc aaggacaaca aggatggtag ctgcaccgtg gagtacatcc 444; ccttcactcc tggagactat gacgtcaaca tcaccttcgg ggggcggccc ggga 450; gcccgttccg cgtgccagtg aaggatgtgg tggaccctgg gaaggtgaag tgctcagggc 456; cagggctggg ggctggtgtc agggcccggg agac cttcacagtg agtc 462; aagctggccg cctg caggtggctg tgctgggccc cacaggtgtg gccgagcctg 468; tggaggtgcg ggacaatgga gatggcaccc acactgtcca ctacacccca gccactgacg 474; ggccctacac ggtagccgtc aagtatgctg accaggaggt gccacgcagc cccttcaaga 480; tcaaggtcct cccagctcat gatgccagca aggtgcgggc cagcggccca ggcctcaacg 486; cctctggcat ccctgccagc ctgcctgtgg agttcaccat cgacgcacgg gacgcgggcg 492; aggggttgct cactgtccag atcttggacc ccgagggtaa gcccaagaag gccaacatcc 498; atgg ggatggcacg tacactgtgt cctacctgcc ggacatgagt ggccggtaca 504; ccatcaccat caagtatggc gaga actc gcccttccgc atccatgctc 510; tgcccactgg cagc aagtgcctcg tcacagtgtc cattggaggc catggcctgg 516; gcct gggccctcga attg ggcaggagac ggtgatcacg gtggatgcca 522; aggcagccgg tgaggggaag gtgacatgca cggtgtccac gccggatggg gcagagctcg 528; atgtggatgt ggttgagaac catgacggta cctttgacat ctactacaca gcgcccgagc 534; cgggcaagta Cgtcatcacc atccgcttcg ggggtgagca catccccaac agccccttcc 540; acgtgctggc gtgtgacccc ctgccgcacg aggaggagcc ctctgaagtg ccacagctgc 546; ccta Cgctcctccc cggcccggcg cccgccccac acactgggcc acagaggagc 552; cagtggtgcc tgtggagcca atggagtcca tgctgaggcc cctg gtcatcccct 558; tcgcggtgca gaaaggggag ctcacaggag aggtgcggat gccctcgggg aagacggcac 564; ggcccaacat caccgacaac aaggacggca ccatcacggt gaggtatgca cccactgaga 570; aaggcctgca ccagatgggg atcaagtatg acggcaacca tggg agccccttac 576; agttctatgt ggatgccatc aacagccgcc atgtcagtgc gcca ggcctgagcc 582; atggcatggt caacaagcca gccaccttca tcac caaagatgct ggagaagggg 588; gtctgtcact ggccgtggag ggcccatcca aggcagagat taag gacaacaagg 5941 atggcacctg caccgtgtcc tatctgccga ctgcgcctgg agactacagc atcatcgtgc 600; gcttcgatga caagcacatc ccggggagcc ccttcacagc caagatcaca ggtgatgact 606; ccatgagqac ctcacagctg aatgtgggca cctccacgga actg aagatcaccg 612; agagtgatct gagccagctg accgccagca tccgtgcccc ctcgggcaac gaggagccct 618; gcctgctgaa gcgcctgccc aaccggcaca ttgggatctc cttcaccccc aaggaggtcg 624; gggagcacgt ggtgagcgtg Cgcaagagtg gcaagcatgt caccaacagc cccttcaaga 630; tcctggtggg gccatctgag atcggggacg ccagcaaggt ctgg ggcaaggggc 636; tttccgaggg acacacattc caggtggcag agttcatcgt ggacactcgc aatgcaggtt 642; atgggggctt ggggctgagt attgaaggcc caagcaaggt ggacatcaac tgtgaggaca 648; tggaggacgg gacatgcaaa gtcacctact gccccaccga gcccggcacc tacatcatca 654; acatcaagtt tgctgacaag cacgtgcctg gaagcccctt cactgtgaag gtgaccggcg 660; agggccgcat gaaggagagc cggc ggagacaggc accttccatc gccaccatcg 666; gcagcacctg tgacctcaac ctcaagatcc caggaaactg gttccagatg gtgtctgccc 672; aggagcgcct cacc ttcacacgca gcagccacac ctacacccgc acggagcgca 678; cggagatcag caagacgcgg gaga caaagcgcga ggtgcgggtg gaggagtcca 684; cccaggtcgg cggggacccc ttccctgctg tgtttgggga gggc cgggagcgcc 690; tgggatcctt ngcagcatc cagc gtga ggccagctct caggacatga 696; ctgcacaggt gaccagccca tcgggcaagg tggaagccgc agagatcgtc gagggcgagg 702; acagcgccta cagcgtgcgc cccc aggaaatggg gccccatacg gtcgctgtca 708; agtaccgtgg ccagcacgtg cccggcagcc cctttcagtt cactgtgggg ccgctgggtg 714; aaggtggtgc ccacaaggtg cgggccggag gcacagggct ggagcgaggt gtggccggcg 720; tgccagccga gttcagcatc tggacccggg aggctggcgc tgggggcctg tccattgctg 726; tggagggtcc tagcaaagcg gagattgcat atcg caaagatggc ggcg 732; tctcctatgt cgtccaggaa ccaggtgact atgaggtctc catcaagttc aatgatgagc 738; acatcccaga cagccccttt gtggtgcctg tggcctccct ctcggatgac gctcgccgtc 744; tcactgtcac cagcctccag gggc tcaaggtgaa ccagccagcg tcctttgccg 750; tgcagctgaa ngtgcccgg ggcgtgattg atgcccgggt gcacacaccc tcgggggctg 756; tggaggagtg ctacgtctct gagctggaca gtgacaagca caccatccgc ttcatccccc 762; acgagaatgg cgtccactcc atcgatgtca agttcaacgg tgcccacatc cctggaagtc 768; ccttcaagat tggg gagcagagcc aggctgggga cccaggcttg gtgtcagcct 7741 acggtcctgg gctcgaggga ggcactaccg gtgtgtcatc agagttcatc accc 780; tgaatgccgg ctcgggggcc ttgtctgtca ccattgatgg cccctccaag gtgcagctgg 786; gqga gtgtcctgag ggccatgtgg tcacttatac tcccatggcc aact 792; acctcattgc catcaagtac ggtggccccc agcacatcgt gggcagcccc ttcaaggcca 798; aggtcactgg tccgaggctg ggcc ttca cgaaacatcc acggttctgg 8041 tggagactgt gaccaagtcc tcctcaagcc ggggctccag ctacagctcc atccccaagt 8101 tctcctcaga tgccagcaag gtggtgactc ggggccctgg gctgtcccag gccttcgtgg 8161 gccagaagaa ctccttcacc gtggactgca gcaaagcagg caccaacatg atgatggtgg 8221 gcgtgcacgg ccccaagacc ccctgtgagg aggtgtacgt gaagcacatg gggaaccggg 8281 tgtacaatgt cacctacact gtcaaggaga aaggggacta catcctcatt gtcaagtggg 8341 gtgacgaaag tgtccctgga agccccttca aagtcaaggt cccttgaatc ccaaaagtgc 8401 ctccccagcc tcagccccca cctccagcca cacacacatt acacacacac acacacacac 8461 acaaatgtgc caga cacgcacaga cact acaaacacct gccttggggg 8521 gaag gcccagcctc cccaccccac cgcgccccag gggttggagg accttgtctg 8581 tgtcaggaca gtgtccctcc atgt gacatgaggg ccgactgggg ccaggctcag 864; gggcagaggc tgggacacaa ggggctggcg agggctgcga ggccagggaa gccctgagtt 870; tctggcgggg ctgagcagtg ggggagcatt tggg tgtctgtgtg tgaggtcacc 876; ctcaaactgc accgccggcc agataccctc ctgaccccga ggacttggtc tggtctctct 882; ggtggctaca accccagagt tttaaggact ggaa agcacaatca gagaagaaaa 8881 cagcccccga accagcagga gtggcctggc acatggaccg gcctgagcga tgtgcactcc 8941 acccaagcca ggctcccagg gggcctgatt tctctctcac tgtctctttt tttaaaatgg 9001 ttgcacggct catg gggggccttt cact gcgaggccca gctttctagg 9061 ttgc acatgtcatg cagctcagct gggagctgct taggtggaaa actccaaata 9121 aagtgcggct gtcgcagaaa aaaaaaaaa Pnneh13e uence vafiant: NCBI Reference Seguence: NP_001449.3 LOCUS N P_001449 ACCESSION N P_001449 1 mmnnsgysda etde mpstekdlae dapwkkiqqn tftrwcnehl kcvgkrltdl 61 qrdlsdglrl lsqk rmyrkfhprp nfrqulenv svaleflere hiklvsidsk 121 aivdgnlkli 1g1iwtli1h ysismpmwed eddedarkqt wiq nkqulpitn 181 fnrdwngka 1galvdncap glcpdweawd pnqpvenare amqqaddwlg 241 vdpnvdehsv mtylsquka klkpgapvrs kqlnpkkaia ygpgiepqgn tvlqpahftv 301 qtvdagvgev lvyiedpegh teeakvvpnn dkdrtyavsy vpkvaglhkv qnie 361 nvgm algdankvsa rgpglepvgn vankptyfdi ytagagtgdv avvivdpqgr 42; rdtvevaled kgdstfrcty rpamegphtv hvafagapit rspfpvhvse acnpnacras 48; grglqpkgvr vkevadfkvf tkgagsgelk vtvkgpkgte epvkvreagd gvfeceyypv 54; vpgkyvvtit prSp fequspeag vqurawgpg letgqvgksa igte 60; vgtlgfsieg psqakiecdd kgdgscdvry wptepgeyav hvicddedir dspfiahin 66; appdcfpdkv kafgpglept gcivdkpaef tidaraagkg dlklyaqdad gcpidikvip 72; ngdgtfrcsy vptkpikhti iiswggvnvp nvge gshpervkvy gpgvektng 78L ftvd cseagquvs pgvv gpaeadidfd iikndndtft vkytppqaqr 84L ytimvlfanq eipaspfhik vdpshdaskv kaegpglnrt gvevgkpthf agka 90; kldvqfagta kgevvrdfei idnhdysytv kytaqugnm avtvtyggdp vpkspfvvnv 96L appldlskik vqglnskvav gqeqafsvnt rgaggngld vrmtspsrrp ipcklepggg L02; aeaqavrymp peegpykvdi vpgs pfavegvlpp dpskvcaygp gtgglgltve gpceakiecq dngdgscavs ylptepgeyt kvra sgpglergkv geaatftvdc seageaelti tyhityspaf pgtytitiky gghpvpkfpt rvthpavdt _ ephgvlrevt arsl tatggnhvta rvlnpsgakt dtyvtdngdg tyrvqytaye L32; egvhlvevly devavpkspf rvgvtegcdp trvrafgpgl egglvnkanr gagt L38; gglglaiegp seakmsckdn kdgsctveyi pftpgdydvn itfggrpipg spfrvpvkdv L44; vdpgkvkcsg pglgagvrar qutftvdcs qagraplqva vlgptgvaep vevrdngdgt L50; htvhytpatd gpytvavkya dqevprspfk ikvlpahdas kvrasgpgln asgipaslpv L56; eftidardag eglltvqild pegkpkkani rdngdgtytv sylpdmsgry titikyggde L62; ipyspfriha lptgdaskcl vtvsigghgl gaclgpriqi tvda kaagegkvtc L68; tvstpdgae; dvdvvenhdg tape pgkyvitirf ggehipnspf hvlacdplph L74; eeepsequ; rqpyapprpg arpthwatee pvvpvepmes mlrpfnlvip favqueltg L80; gkta rpnitdnkdg titvryapte kglhqmgiky gspl hgmvnkpatf tivtkdageg glslavegps kaeitckdnk rfddkhipgs pftakitgdd smrtsqlnvg tstdvslkit L98; irapsgneep cllkrlpnrh pkev gehvvsvrks gkhvtnspfk ilvgpseigd 204; askvrvwgkg lseghtfqva efivdtrnag ygglglsieg pskvdinced medgtckvty 210; cptepgtyii nikfadkhvp gspftvkvtg egrmkesitr iati gstcdlnlki 216; pgnwfqmvsa qerltrtftr sshtytrter teisktrgge tkrevrvees tqvggdpfpa 222; vfgdflgrer itrq qegeassqdm taqvtspsgk veaaeivege dsaysvrfvp 228; qemgphtvav kyrgqhvpgs pqutvgplg eggahkvrag gvag iwtr 234; eagagglsia aeia fedrkdgscg vsyvvqepgd yevsikfnde hipdspfvvp 240; vaslsddarr ltvtslqetg lkvnqpasfa vqlngargvi darvhtpsga veecyvse;d 246; sdkhtirfip hengvhsidv kfngahipgs pfkirvgeqs qagdpglvsa ggtt 252; gvssefivnt lnagsgalsv tidgpskvql dcrecpeghv vtytpmapgn yliaikyggp 258: qhivgspfka kvtgprlsgg hslhetstvl vetvtkssss rgssyssipk fssdaskvvt 264; rgpglsqafv qunsftvdc skagtnmmmv tpce evyvkhmgnr tvke 2701 kgdyilivkw gdesvpgspf kvkvp Official Sym bol: HPX Official Name: hemopexin Gene ID: 3263 Organism: Homo sapiens Other Aliases: HX Other Designations: beta—1B-glycoprotein Nucleotide seguence: NCBI Reference Seguence: NM_000613.2 LOCUS NM_000613 ION NM_000613 l aactctatat agggagttca actggtcacc cagagctgtc ctgtggcctc tgcagctcag 6; catggctagg gtactgggag cacccgttgc actggggttg tggagcctat gctggtctct 12; ggccattgcc acccctcttc ctccgactag tgcccatggg aatgttgctg aaggcgagac l8; caagccagac ccagacgtga ctgaacgctg ctcagatggc tggagctttg atgctaccac 24L cctggatgac aatggaacca tgctgttttt taaaggggag tttgtgtgga agagtcacaa ; ccgg gagttaatct cagagagatg gaagaatttc cccagccctg tggatgctgc 36; attccgtcaa ggtcacaaca gtgtctttct gatcaagggg gtct gggtataccc 42; tcctgaaaag aaggagaaag gatacccaaa gttgctccaa gatgaatttc ctggaatccc 48; atccccactg gatgcagctg tggaatgtca agaa tgtcaagctg aaggcgtcct 54; cttcttccaa ggtgaccgcg agtggttctg ggacttggct acca tgaaggagcg 60; ttcctggcca gctgttggga actgctcctc gaga tggctgggcc gctactactg 66; cttccagggt aaccaattcc tgcgcttcga ccctgtcagg gtgc ctcccaggta 72; cccgcgggat gtccgagact acttcatgcc ctgccctggc agaggccatg gacacaggaa 78; tgggactggc catgggaaca gtacccacca tggccctgag tatatgcgct gtagcccaca 841 tctagtcttg tctgcactga cgtctgacaa ccatggtgcc acctatgcct tcagtgggac 90; ccactactgg cgtctggaca ccagccggga tggctggcat agctggccca ttgctcatca 96L ccag ggtccttcag cagtggatgc tgccttttcc tgggaagaaa aactctatct 1021 gggc acccaggtat atgtcttcct gacaaaggga ggctataccc ttatccgaag aga aggaagtcgg gacccctcat gggattatcc tggactctgt 114; ggatgcggcc tttatctgcc ctgggtcttc tcggctccat gcag gacggcggct 120; gtggtggctg gacctgaagt caggagccca agccacgtgg acagagcttc ccca 126; tgagaaggta gcct tgga aaagtccctt ggccctaact catgttccgc 132; caatggtccc ggcttgtacc tcatccatgg tcccaatttg tactgctaca gtgatgtgga 138; gaaactgaat aagg cccttccgca accccagaat gtgaccagtc tcctgggctg 144; cactcactga ggggccttct gacatgagtc tggcctggcc ccacctccta gttcctcata 150; ataaagacag attgcttctt Cgcttctcac tgaggggcct tctgacatga gtctggcctg 156; gccccacctc cccagtttct cataataaag acagattgct tcttcacttg aatcaaggga 1621 cctaaaaaaa aaaaa Protein seguence: NCBI Reference Seguence: NP_000604.1 LOCUS NP_000604 ACCESSION NP_000604 1 marvlgapva 1g1wslcwsl aiatplppts ahgnvaeget kpdpdvterc sdgwsfdatt 61 1ddngtm1ff kgefvwkshk wdreliserw vdaa frqghnsvfl ikgdkvwvyp 121 pekkekgypk llqdefpgip spldaavech rgecqaegvl ffqurewfw dlatgtmker 181 swpavgncss alrwlgryyc fqganlrfd pvrgevppry yfmp cpgrghghrn 241 gtghgnsthh gpeymrcsph lvlsaltsdn hgatyafsgt hywrldtsrd gwhswpiahq 301 wpqusavda afsweeklyl vqgtquvfl tkggytlvsg ypkrlekevg tphgiildsv 361 daaficpgss rlhimagrrl wwldlksgaq atwtelpwph ekvdgalcme kslgpnscsa 421 ngpglylihg pnlycysdve klnaakalpq pqnvtsllgc th TLN1 Official Symbol: TLN1 Official Name: talin 1 Gene ID: 7094 Organism: Homo sapiens Other Aliases: RP11-112J3.1, ILWEQ, TLN Other Designations: talin-1 Nucleotide seguence: NCBI Reference Seguence: NM_006289.3 LOCUS NM_006289 ACCESSION NM_006289 l ggaagagttc tagcctgaaa gggaactcgg tcct ggcgtcctct ccggattgcg 6; ccgcaccctc tgcc agggaggggc tgccgggctt tggcggctcc cgagcatcga 12; gaacggggcc agagcagctt cctgcctgcc ccccgcgacc atagagcgcg ggcccagggc 18; gccgcccgcg ggac gttcccagga cggaagtggc cgagagagtg tcgaagggag 24; gccg gagcccgagg gcgacccgag aagcggcggg gcggcgggcc ggcgggcggg ; gcgcagagcc aggcagcgca ggtatagcca ggctggagaa aagaagctgc ggtt 36; gcactttcac tcag cattgggaat gtggtgaaga agtt tgagccgtct 42; accatggtgt acgacgcctg ccgcatcatt cgtgagcgga tcccagaggc cccagctggt 48; cctcccagcg actttgggct ctttctgtca gatgatgacc ccaaaaaggg gctg 54; gaggctggga aagctttgga ctactacatg ctccgaaatg gggacactat ggagtacagg 60; aagaaacaga gacccctgaa gatccgtatg ggaa ctgtgaagac gatcatggtg 66L gatgactcta agactgtcac tgacatgctc atgaccatct gtgcccgcat tggcatcacc 72L aatcatgatg aatattcatt ggttcgagag ctgatggaag agaaaaagga aaca 78; gggaccttaa gaaaggacaa gacattgctg cgagatgaaa agaagatgga gaaactaaag 84; cagaaattgc acacagatga gaac tggctggacc atggtcggac actgagggag 90; gtag aggagcacga gacgctgctg agga agttctttta ctcagaccag 96; aatgtggatt cccgggaccc tgtacagctg aacctcctgt atgtgcaggc acgagatgac L02; atcctgaatg gctcccaccc tgtctccttt gacaaggcct gtgagtttgc tggcttccaa L08; tgccagatcc agtttgggcc ccacaatgag cagaagcaca aggctggctt ccttgacctg L14; aaggacttcc tgcccaagga gaag cagaagggag agcgtaagat cttccaggca L20; aatt gtgggcagat gagtgagatt gaggccaagg tccgctacgt gaagctagcc L26; cgttctctca agacttacgg tgtctccttc ttcctggtga aaat gaaagggaag L32; aacaagctag ggct catc accaaggagt gtgtgatgcg aagaccaagg aagtgatcca ggagtggaac ctcaccaaca gctg cccaaaagct tcaccctgga ttttggagat taccaagatg gctattactc actgaagggg agcagattgc acagctcatt gccggctaca tcgatatcat aaaaaaagca aggatcactt tgggctggaa ggagatgagg agtctactat tcagtgtccc ccaaaaagtc aacagtcctg cagcagcaat acaaccgggt ggggaaagtg L68; gagcatggct ctgtggccct gcctgccatc atgcgctctg gagcctctgg tcctgagaat L74; ttccaggtgg gcagcatgcc ccctgcccag cagcagatta ccagcggcca gatgcaccga L80; ggacacatgc ctcctctgac ttcagcccag caggcactca ctggaaccat taactccagc L86; gccg tgcaggctgc ccaggccacc ctggatgact ttgacactct gccgcctctt L92; ggccaggatg ctgcctctaa ggcctggcgt aaaaacaaga tggatgaatc aaagcatgag L98; atccactctc aggtagatgc catcacagct ggtactgcgt ctgtggtgaa cctgacagca 204; ggggaccctg ctgagacaga ctataccgca gtgggctgtg cagtcaccac aatctcctcc 210; aacctgacgg agatgtcccg tggggtgaag ctgctggctg ccttgctgga ggacgaaggc 216; ggcagtggtc tgtt gcaggcagca aagggccttg cgggagcagt actg 222; ctgcgcagtg cccaaccagc cagtgctgag caga acctgctgca agcagctggg 228; aacgtgggcc aggccagtgg gttg caacaaattg gggaaagtga tactgacccc 234; cacttccagg atgcgctaat gcagctcgcc aaagctgtgg caagtgctgc agctgccctg 2401 gtcctcaagg gtgt ggcccagcgg acagaggact cgggacttca agtt 246; attgctgcag caacacagtg tgccctatcc acttcccaac tagtggcctg tactaaggtg 252; gtggcaccta caatcagctc acctgtctgc caagagcaac tggtggaggc tggacgactg 258; gtagccaaag aggg ctgtgtgtct gcctcccagg cagctacaga ggatgggcaa 264; ctgttgcgag gggtaggagc caca gctgtcaccc aggccctaaa tgagctgctg 270; cagcatgtga aagcccatgc cacaggggct gggcctgctg gccgttatga ccaggctact 276; gacaccatcc taaccgtcac tgagaacatc tttagctcca tgggtgatgc tggggagatg 282; gtgcgacagg cccgcatcct ggcccaagcc acatctgacc tggtcaatgc catcaaggct 288; gatgctgagg gggaaagtga tctggagaac tcccgcaagc gtgc tgccaagatc 294; ctagctgatg ccacagccaa gatggtagag gctgccaagg gagcagctgc ccaccctgac 300; agtgaggagc agcagcagcg gctgcgggag gcagctgagg ggctgcgcat ggccaccaat 306; gcagctgcgc ccat caagaaaaag ctggtgcagc gcctggagca tgcagccaag 312; caggctgcag cctcagccac acagaccatc gctgcagctc agcacgcagc cccc 318; aaggcctctg ccggccccca gctg gtgcagagct gcaaggcagt ggcagagcag 324; attccactgc tggtgcaggg cgtccgagga agccaagccc acag cgct 330; cagcttgccc tcattgctgc cagccagagc ttcctgcagc caggtgggaa gatggtggca 336; gctgcaaagg cctcagtgcc aacgattcag gaccaggctt cagccatgca gctgagtcag 342; tgtgccaaga acctgggcac cgcgctggct gaactccgga cggctgccca gaaggctcag 348; gaagcatgtg gacctttgga gatggattct agtg tggtacagaa tctagagaaa 354; gatctacagg aagtgaaggc agcagctcga gatggcaagc ttaaaccctt acctggggag 360; acaatggaga agtgtaccca ggacctgggc acca aagccgtgag ctcagccatc 366; gcccagctac tgggagaggt tgcccagggc aatgagaatt atgcaggtat tgcagctcgg 372; gatgtggcag gtgggctgcg gtcactggcc caggccgcta ggggagtcgc tgcactgacg 378; tcagatcctg cagtgcaggc cattgtactt gatacggcca gtgatgtgct ggacaaggcc 384; ctca ttgaggaggc gaaaaaggca gctggccatc caggggaccc tgagagccag 390; cagcggcttg cccaggtggc agtg acccaggctc tgaaccgctg tgtcagctgc 396; ctacctggcc agcgcgatgt ggataatgcc ctgagggcag atgc cagcaagcga 402; ctcctgagtg actcgcttcc tcctagcact gggacatttc aagaagctca gagccggttg 408; gctg ctgctgggct gaatcaggca gccacagaac tggtgcaggc ctctcgggga 414; acccctcagg acctggctcg agcctcaggc ggac aggacttcag caccttcctg 4201 gaagctgqtg tggagatggc aggccaggct ccgagccagg aggaccgagc ccaagttgtg 426; tccaacttga agggcatctc catgtcttca agcaaacttc ctgc caaggccctg 432; tccacggacc cccc taacctcaag agtcagctgg ctgcagctgc cagggcagta 438; agca agct catcactatg tgcacccagc aggcacccgg ccagaaggag 444; tgtgataacg ccctgcggga attggagacg gtccgggaac tcctggagaa cccagtccag 450; cccatcaatg acatgtccta ctttggttgc ctggacagtg taatggagaa ctcaaaggtg 456; ctgggcgagg ctgg catctcccaa aatgccaaga acggaaacct gccagagttt 462; ggagatgcca tttccacagc ctcaaaggca ctttgtggct aggc agctgcacag 468; gctgcatatc tggttggtgt ctctgacccc caag ctggacagca agggctagtg 474; gagcccacac agtttgcccg tgcaaaccag gcaattcaga tggcctgcca gagtttggga 480; gagcctggct gtacccaggc ccaggtgctc tctgcagcca ccattgtggc taaacacacc 486; tctgcactgt gtaacagctg tcgcctggct tctgcccgta atcc tactgccaag 492; cgccagtttg tacagtcagc caaggaggtg gccaacagca atct tgtcaagacc 498; atcaaggcgc tagatggggc cttcacagag gagaaccgtg cccagtgccg agcagcaaca 504; gcccctctgc tggaggctgt ggacaatctg agtgcctttg cgtccaaccc ctcc 510; agcattcctg cccagatcag ccctgagggt gcca tggagcccat tgtgatctct 516; gccaagacaa tgttagagag tgccggggga ctcatccaga cagcccgggc cctcgcagtc 522; aatccccggg accccccgag ggtg ctggccggcc actcccgtac tgtctcagac 528; aaga agctaattac aagcatgagg gacaaggctc cagggcagct ggagtgtgaa 534; acggccattg cagctctgaa cagttgtcta cgggacctag cttc cctcgctgca 540; gtcagccagc agcttgctcc ccgtgaggga atctctcaag aggccttgca cactcagatg 546; ctcactgcag tccaagagat ctcccatctc attgagccgc atgc tgcccgggct 552; gaagcctccc agctgggaca caaggtgtcc cagatggcgc agtactttga cacc 558; gcag tgggtgctgc ctccaagacc ctgagccacc cgcagcagat ggcactcctg 564; gaccagacta aaacattggc agagtctgcc ctgcagttgc tatacactgc caaggaggct 570; ggtggtaacc aagc agctcacacc gccc tggaggaggc gatg 576; gagg ccgtagagga cctgacaaca accctcaacg aggcagccag tgctgctggg 582; ggtg gcatggtgga ctccatcacc caggccatca accagctaga tgaaggacca 588; atgggtgaac cagaaggttc cttcgtggat taccaaacaa ctatggtgcg gacagccaag 594; gccattgcag tgaccgttca ggagatggtt accaagtcaa acaccagccc agaggagctg 6001 ggccctcttg ctaaccagct gaccagtgac tatggccgtc tggcctcgga ggccaagcct 606; gcagcggtgg ctgctgaaaa tgaagagata ggttcccata tcaaacaccg ggtacaggag 612; ctgggccatg gctgtgccgc tctggtcacc aaggcaggcg ccctgcagtg cagccccagt 618; gatgcctaca ccaagaagga gctcatagag tgtgcccgga gagtctctga gaaggtctcc 624; cacgtcctgg ctgcgctcca ggctgggaat cgtggcaccc aggcctgcat cacagcagcc 630; agcgctgtgt ctggtatcat cctc gacaccacca tcatgttcgc tggc 636; acgctcaatc gtgagggtac tgaaactttc gctgaccacc gggagggcat cctgaagact 642; gcgaaggtgc tggtggagga caccaaggtc ctggtgcaaa acgcagctgg gagccaggag 648; aagttggcgc aggctgccca cgtg gcgaccatca cccgcctcgc tgatgtggtc 654; aagctgggtg gcct tgag gaccctgaga cccaggtggt actaatcaac 660; gcagtgaaag atgtagccaa agccctggga gacctcatca gtgcaacgaa ggctgcagct 666i ggcaaagttg gagatgaccc tgctgtgtgg cagctaaaga actctgccaa ggtgatggtg 672; accaatgtga tgct agta aaagccgtgg aagatgaggc caccaaaggc 678; actcgggccc caac cacagaacac cagg agctggcggt tttctgttcc 684; ccagagccac ctgccaagac ctctacccca gaagacttca tccgaatgac caagggtatc 690; accatggcaa ccgccaaggc cgttgctgct ggcaattcct gtcgccagga agatgtcatt 696; gccacagcca atctgagccg ccgtgctatt atgc ttcgggcttg caaggaagca 702; gcttaccacc cagaagtggc ccctgatgtg ngcttcgag ccctgcacta tggccgggag 708; tgtgccaatg gctacctgga actgctggac catgtactgc tgaccctgca aagc 714; ccagaactga agcagcagtt gacaggacat tcaaagcgtg tggctggttc cgtcactgag 720; ctcatccagg ctgctgaagc catgaaggga acagaatggg tagacccaga ggaccccaca 726; gtcattgctg agaatgagct cctgggagct gcagccgcca ttgaggctgc agccaaaaag 732; ctagagcagc tgaagccccg ggccaaaccc aaggaggcag atgagtcctt gaactttgag 738; gagcagatac tagaagctgc caagtccatt gcagcagcca ccagtgcact ggtaaaggct 744; gcgtcggctg cccagagaga actagtggcc caagggaagg tgggtgccat tccagccaat 750; gcactggacg atgggcagtg gtcccagggc ctcatttctg ctgcccggat ggtggctgcg 756; aaca atctgtgtga ggcagccaat gcagctgtac aaggccatgc cagccaggag 762; aagctcatct catcagccaa gcaggtagct gcctccacag cccagctcct tgtggcctgc 768; aaggtcaagg agga ctcggaggca atgaaacgac ttcaggctgc tggcaacgca 774; cgag cctcagataa tctggtgaaa caga aggctgcagc ctttgaagag 7801 caggagaatg agacagtggt agag aagatggttg gcggcattgc ccagatcatc 786; gcagcacagg aagaaatgct tcggaaggaa cgagagctgg aagaggcgcg gaagaaactg 792; gcccagatcc ggcagcagca gttt ctgccttcag agcttcgaga tgagcactaa 798; agaagcctct tctatttaat gcagacccgg cccagagact gtgcgtgcca aagc 804; cttctgggct gtcggggccc aacctgccca accccagcac tccccaaagt gcctgccaaa 8101 ccccagggcc tggccccgcc cagtcccgca cccc tgtcccctcc ccaaccccaa 8161 gtgccttcat gccctagggc cccccaagtg cctgcccctc cccagagtat tcca 8221 agagtattat taacgctgct gtacctcgat ctgaatctgc ccca gcccactcca 8281 ccctgccagc cggc cagtccccac agcctcatca gctctcttca ccgttttttg 8341 tctt CCCCC&CCCC cagctaccca taggggctgc agagttataa aaca 8401 ggtcatgctc caataaaaat gattctacct acaa Protein seguence: NCBI Reference Seguence: 280.3 LOCUS N P_006280 ACCESSION NP_006280 1 mvalslkisi gnvvktmqfe pstmvydacr iireripeap agppsdfglf lsdddpkkgi 61 wleagkaldy ymlrngdtme yrkquplki rmldgtvkti mvddsktvtd mlmticarig 12; itnhdeyslv relmeekkee itgtlrkdkt kmek 1qu1htdde 1nw1dhgrt1 181 het ffys dqnvdsrdpv qlnllyvqar shpv sfdkacefag 24; chqiquph neqkhkagfl dlkdflpkey vqugerkif qahkncgqms eieakvryvk ; larslktygv sfflvkekmk gknklvprll gitkecvmrv dektkeviqe wnltnikrwa 36; tldf gdyngyysv qttegeqiaq liagyidiil kkkkskdhfg legdeestml 42; edsvspkkst vquqynrvg kvehgsvalp aimrsgasgp enfqvgsmpp aqqqitsgqm 48; hrghmpplts aqqaltgtin ssmqavqaaq atlddfdtlp plquaaska wrknkmdesk 54; heihsquai vvnl tagdpaetdy tavgcavtti ssnltemsrg vkllaalled 60; eggsgrpllq aakglagavs ellrsaqpas aeprqnllqa agnvgqasge lquigesdt 66; dphfqdalmq lakavasaaa alvlkaksva qrtedsglqt qviaaatqca lstsqlvact 72; kvvaptissp cheqlveag rlvakavegc vsasqaated gqllrgvgaa atavtqalne 781 llqhvkahat gagpagrydq atdtiltvte nifssmgdag emvrqarila qatsdlvnai 84L kadaegesdl ensrkllsaa kiladatakm veaakgaaah pdseeqqqu reaaeglrma 90; tnaaaqnaik kklvqueha akqaaasatq tiaaaqhaas tpkasagpqp llvqsckava 96L eqipllvqgv usp saqlaliaas qsflqugkm vaaakasvpt iqdqasamql 1021 sqcaknlgta 1ae1rtaaqk aqeacgplem dsalsvvqnl ekdlqevkaa ardgklkap 1081 getmekctqd lgnstkavss aiaqllgeva qgnenyagia ardvagglrs 1aqaargvaa 114; 1tsdpavqai dvld kasslieeak kaaghpgdpe sqquaqvak avtqalnrcv L20; sclpgqrdvd nalravgdas krllsdslpp eaqs agln L26; rgtpqdlara sgrfqufst fleagvemag qapsqedraq vvsnlkgism ssslelaak L32; alstdpaapn lksqlaaaar nqli tmctqqapgq kecdnalrel etvreLlenp L38; vqpindmsyf gcldsvmens mtgi sqnakngnlp efgdaistas teaa L44; aqaaylvgvs dpnsqagqqg lveptqfara nqaiqmacqs lgepgctqaq vlsaativak L50; htsalcnscr lasarttnpt aququsak evanstanlv ktikaldgaf teenraqcra L56; ataplleavd nlsafasnpe fssipaqisp egraamepiv isaktmlesa svlaghsrtv sdsikklits mrdkapgqle cetaiaalns egisqealht qmltavqeis hlieplanaa raeasqlghk vsqmaqyfep L74; ltlaavgaas qqma lldqtktlae salqllytak eaggnpkqaa htqeaLeeav L80; qmmteavedl tttlneaasa agvvggmvds itqainqlde gpmgepegsf vdyqttmvrt L86; akaiavtvqe mvtksntspe elgplanqlt sdygrlasea kpaavaaene vtkagalqcs psdaytkkel iecarrvsek alqa L98; giia dldttimfat agtlnregte tfadhregil ktakvlvedt kvlvqnaags 204; qeklaqaaqs svatitrlad vvklgaaslg aedpetqvvl inavkdvaka lgdlisatka 210; aagkvgddpa qulknsakv mvtnvtsllk tvkavedeat kgtraleatt ehirqelavf 216; cspeppakts tpedfirmtk gitmatakav aagnscrqed viatanlsrr aiadmlrack 222; eaayhpevap dvrlralhyg recangylel ldhvllthk ult ghskrvagsv 228; teliqaaeam kgtewvdped ptviaenell eaaa kkleqlkpra kpkeadesLn 234; feeqileaak siaaatsalv qrel vaqgkvgaip analddngs qglisaarmv 240; aaatnnlcea anaavqghas qeklissakq vaastaqllv ackvkadqu eamkrlqaag 246; navkrasdnl vkaaqkaaaf tvvv giaq iiaaqeemlr kereleeark 2521 klaqirqqqy kflpselrde h PSME2, Official Symbol: PSME2 and Name: some (prosome, macropain) activator subunit 2 (PA28 beta) [Homo sapiens] Other Aliases: PA28B, PA28beta, REGbeta Other Designations: 11S regulator complex beta subunit; 11$ regulator complex subunit beta; MCP activator, 31-kD subunit; REG-beta; activator of multicatalytic protease subunit 2; cell migration-inducing protein 22; proteasome activator 28 subunit beta; some activator 28-beta; proteasome activator complex subunit 2; proteasome activator hPA28 subunit beta LOCUS NM_002818 ACCESSION NM_002818 VERSION NM_002818.2 GI:30410791 1 tggggagtga aagcgaaagc ccgggcgact agccgggaga ccagagatct agcgactgaa 6; gcagcatggc caagccgtgt ggggtgcgcc tgagcgggga agcccgcaaa caggtggagg 12; tcttcagaca gaatcttttc caggaggctg aggaattcct ctacagattc ttgccacaga l8; aaatcatata cctgaatcag ctcttgcaag ccct caatgtggct gacttgactt 24L ccctccgggc cccactggac atccccatcc cagaccctcc acccaaggat atgg ; aaacagataa gcaggagaag aaagaagtcc ataagtgtgg ccct gggaatgaga 36; aagtcctgtc cctgcttgcc ctggttaagc cagaagtctg caaa gagaaatgca 42; ttctggtgat tacatggatc caacacctga tccccaagat tgaagatgga aatgattttg 48; gggtagcaat ccaggagaag gtgctggaga gggtgaatgc cgtcaagacc gaag 54; ctttccagac aaccatttcc aagtacttct gtgg ggatgctgtg gccaaggcct 60; ccaaggagac tcatgtaatg cggg ccttggtgca agat gcct 66; atggggagct catg gtgctggacc tgagggcctt ctatgctgag ctttatcata 72; tcatcagcag caacctggag gtca acccaaaggg tgaagaaaag ccatctatgt 78; actgaacccg ggactagaag gaaaataaat gatctatatg ttgtgtgga LOCUS NP_002809 ACCESSION NP_002809 VERSION NP_002809.2 GI:30410792 l makpcgvrls gearkqvevf rqnlfqeaee flyrflqui iylnqllqed slnvadltsl 61 rapldipipd pppkddemet quekkevhk cgflpgnekv lsllalvkpe thlkekcil 121 hip kiedgndfgv aiqekvlerv navktkveaf qttiskyfse rgdavakask 181 ethvmdyral vherdeaayg elramvldlr afyaelyhii ssnlekivnp kgeekpsmy Q9BQE5 Official Symbol: APOL2 and Name: apolipoprotein L, 2 Other Aliases: APOL—II, APOL3 Other Designations: oprotein L—II; apolipoprotein L2 LOCUS NM_030882 ACCESSION NM_030882 VERSION NM_030882.2 GI:22035654 gtgctgggga gcagcgtgtt tactgtgctt ggtcatgagc tgctgggaag cactttccct ttcgaattcc agggtatatc tgggaggccg gaggacgtgt cacagatgca cagctggacg tgggatccac acagctcaga acagttggat cttgctcagt l8; caga ggaagatccc ttggacaaga ggaccctgcc ttggtgtgag agtgagggaa 24; gctg gggt taaggaaaac cttccagtct ggacagtgac caaggaaagc ccctcggtaa cccagccgct ggcaccatga acccagagag gatt accttaagta tttccaggac caagtgagca gagagaatct ctgactgatg atgaagcctg gaatggattc gtggctgctg ctgaactgcc cagggatgag 48; gcagatgagc tccgtaaagc tctgaacaag cttgcaagtc acatggtcat caaa 54; aaccgccacg ataaagacca gcagcacagg cagtggtttt agtt tcctcggttg 60; aaaagggagc atca cataaggaag gccc ttgcagagga ggttgagcag 66; gtccacagag gcaccaccat tgccaatgtg gtgtccaact gcac tacctctggc 72; accc tcctcggcct gggtctggca cccttcacag aaggaatcag gctc 78; ttggacactg gcatgggtct gggagcagca gctgctgtgg ctgggattac tgtg 84; gtagaactag taaacaaatt gcgggcacga gcccaagccc gcaacttgga ccaaagcggc 90; accaatgtag caaaggtgat gaaggagttt gtgggtggga acacacccaa tgttcttacc 96; ttagttgaca attggtacca agtcacacaa gggattggga ggaacatccg tgccatcaga L02; cgagccagag ccaaccctca gttaggagcg tatgccccac ccccgcatat cattgggcga L08; atctcagctg aaggcggtga acaggttgag agggttgttg aaggccccgc ccaggcaatg L14; agcagaggaa ccatgatcgt gggtgcagcc actggaggca tcttgcttct gctggatgtg L20: gtcagccttg catatgagtc aaagcacttg cttgaggggg caaagtcaga gtcagctgag 126; gagctgaaga agcgggctca ggagctggag gggaagctca actttctcac caagatccat L32; gagatgctgc agccaggcca agaccaatga ccccagagca gtgcagccac cagggcagaa 138; atgccgggca caggccagga caaaatgcag tttt tttttttttt ttttttttga L44; gatggagtct Cgctctatcg cccaggatgg agtgcagtgg ctcaatctcg aactccgcct cccgggttca caccattctc cggcctcagt ctcccgagta gctgggacta 1561 caggcacctg ccaccacgcc ngctaattt ttttgtattt tcactggaga cggggtttca 1621 ctgtgttagc cacgatggtc tccatctcct gacctcgtga tctgcccacc tcggcctccc 1681 aaagtgctgg gattacaggc gtgagccacc gcgcctggcc aaaatgcaga cattttatta 1741 gggggataag gagggcaagg taaagcttat ggaactgagt gttagtgact ttggcatttg 1801 tgtagctgag cacagcaagg ttaa tgcagatggc aagtgcacca aggagaaggc 1861 aggaacactg gagcctgcaa agga gagaggactg gagagtgtgg 1921 gaagtagttt actttggact tata ttgggcgaag aatagagggg gagcttgcag L981 gaaccagcaa tgagaaggcc aggaaaagaa agagctgaaa atggagaaaa ccagagttag 2041 aactgttgga tacaggagaa gaaacagcag ctccactacc gacccccccc caggtttgat 2101 gtccttccaa gaataaagtc tttccctggt gatggtctct cgctctgtct ttccagcatc 216; cactctccct tgtccttctg ggggtgtatc acagtcagcc agtggcttct tcatgatggt 222; ggttggggtg gttgtcatgt gacgggtccc ctccaggtta ctaaagggtg catgtcccct 228; gcttgaaccc tgagaggcag gtggtaggcc atggccacaa tccccagctg aggagcaggt 234; gtccctgaga actt gagt gaac caaccaatga aaacagtccc 240; atcgctctta aagt aaacagtcag aagattagca cagt ttagcattgg 246; gaggaagcac agatctctag agctgtcctg tcgctgccca acct gtgtgtaagt 252; cccaataaac tcacctactc accaa LOCUS NP_112092 ACCESSION 092 N NP_112092.1 GI:13562090 1 mnpessifie dylkyfqdqv srenllqllt ddeawngfva aaelprdead elrkalnkla 61 shmvmkdknr hdkdqthqw flkefprlkr eledhirklr alaeeveqvh rgttianvvs 121 nsvgttsgil lapf tegisfvlld tgmglgaaaa vagitcsvve lvnklraraq 181 arnldqsgtn vakvmkefvg gntpnvltlv dnquvtqgi grnirairra ranpqlgaya 241 ppphiigris aeggeqverv vegpaqamsr gtmivgaatg gilllldvvs layeskhlle 301 gaksesaeel kkraqelegk lnfltkihem lquqdq Q9Y262 al Symbol: EIF3L and Name: eukaryotic translation initiation factor 3, subunit L Other Aliases: AL022311.1, EIF3EIP, EIF3S11, EIF3S6IP, HSPC021, HSPC025, MSTP005 Other ations: eIEF associated protein HSPC021; eukaryotic translation initiation factor 3 subunit 6-interacting n; eukaryotic translation initiation factor 3 subunit E- interacting protein; eukaryotic translation initiation factor 3 subunit L LOCUS 242923 ION NM_001242923 VERSION NM_001242923.1 GII339275830 L gctgaacttc cggcctcagg acgcaggcgc gggccgctca tttcgctctt tccggcggtg 6L ctcgcaagcg aggcagccat gtcttatccc gctgatgatt atgagtctga ggcggcttat 12; tacg cttatcccag cgactatgat atgcacacag gagatccaaa gcaggacctt 18; gcttatgaac gtcagtatga acagcaaacc tatcaggtga aggt gatcaaaaac 24; ttcatccagt atttccacaa aactgtctca gatttgattg accagaaagt gtatgagcta ; caggccagtc gtgtctccag tgatgtcatt gaccagaagg tgtatgagat ccaggacatc 36; aaca gctggaccaa gctgactgaa agattcttca agaatacacc ttggcccgag 42; gctgaagcca ttgctccaca ggttggcaat gtct tcctgatttt atacaaagaa 48; ttatactaca ggcacatata tgccaaagtc agttttcagt gtca gtaccgctgt 54; aagactgcca agaagtcaga ggaggagatt gactttcttc gttccaatcc caaaatctgg 60; aatgttcata gtgtcctcaa tgtccttcat tccctggtag acaaatccaa catcaaccga 66; cagttggagg tatacacaag ngaggtgac cctgagagtg tggctgggga gtatgggcgg 72; ctct acaaaatgct tggttacttc agcctggtcg tccg ctcc 78; ctgttaggag attactacca ggccatcaag gtgctggaga acatcgaact gaag 84; agtatgtatt cccgtgtgcc agagtgccag gtcaccacat actattatgt tgggtttgca 90; tatttgatga tgcgtcgtta ccaggatgcc atccgggtct tcgccaacat cctcctctac 96; atccagagga gcat gttccagagg accacgtaca agtatgagat gattaacaag 102: cagaatgagc agatgcatgc gctgctggcc ctca cgatgtaccc catgcgtatt 108; gatgagagca ttcacctcca gctgcgggag gggg acaagatgtt gcgcatgcag 114; aaaggtgacc cacaagtcta actt ttcagttact cctgccccaa gttcctgtcg 120; cctgtagtgc ccaactatga taatgtgcac cccaactacc acaaagagcc cttcctgcag 126; cagctgaagg tgttttctga tgaagtacag cagcaggccc agctttcaac catccgcagc 132; ttcctgaagc tctacaccac catgcctgtg gccaagctgg ctggcttcct ggacctcaca L38; gagcaggagt tccggatcca gcttcttgtc ttcaaacaca agatgaagaa cctcgtgtgg L44; accagcggta tctcagccct ggatggtgaa tttcagtcag cctcagaggt tgacttctac L50; attgataagg acatgatcca catcgcggac accaaggtcg ccaggcgtta tggggatttc L56; ttcatccgtc agatccacaa ggag cttaatcgaa ccctgaagaa gatgggacag L62; agaccttgat gatattcaca cacattcagg aacctgtttt ttat aggcaggaag L68; tgtttttgct accgtgaaac ctttacctag atcagccatc tcaa ctcagttaac L74: aagttaagga ccgaagtgtt tcaagtggat aaag tgga gccagatttg L80; tcgtctcatt attgtaggag agaatttgtg ggttgtggca gtaatacatt gtgt L86; cctgatgctt tcaggataca tcagttgtta gtgtttaaat tgagttattt ttattttgtg L921 cttttgagat tcac tctgtct LOCUS NP_001229852 ACCESSION NP_001229852 VERSION NP_001229852.1 275831 I msypaddyes yayp tgdp kqdlayerqy eqqtyqvipe viknfiqyfh 6; ktvsdliqu vyelqasrvs sdviquvye iqdiyenswt klterffknt pwpeaeaiap 12; qvgndavfli lykelyyrhi ufs qyrcktakks eeeidflrsn pkiwnvhsvl 18; nvlhslvdks ninrqlevyt sggdpesvag eygrhslykm lgyfslvgll rlhsllgdyy 24; qaikvlenie lnkksmysrv pecqvttyyy vgfaylmmrr yqdairvfan illyiqrtks ; qurttykye minkqneth allaialtmy pmridesihl qlrekygdkm lrmqudpqv 36; yeelfsyscp kaSpvvpny dnvhpnyhke pfqulkvfs dequqaqls tirsflklyt 42; tmpvaklagf lsteqefri qllvfkhkmk nlthsgisa ldgequase kdmi 48; hiadtkvarr ygdffirqih kfeelnrtlk kmgqrp Official Symbol: RABIB and Name: RABlB, member RAS oncogene family [Homo sapiens] Other Designations: ras-related protein Rab-1B; small GTP-binding protein LOCUS NM_030981 ION NM_030981 XM_001134089 VERSION NM_030981.2 GI:116014337 gggg cctctggggc ggagcggcca ccatcttgga acgggaggcg gagcagagtc 6; gactgggagc gaccgagcgg gccgccgccg ccgccatgaa atat gactacctgt 12; ttaagctgct tttgattggc gactcaggcg tgggcaagtc atgcctgctc ctgcggtttg 18; ctgatgacac gtacacagag agctacatca gcaccatcgg ggtggacttc aagatccgaa 24; ccatcgagct ggatggcaaa actatcaaac tctg ggacacagcg ggccaggaac ; ggttccggac catcacttcc agctactacc ggggggctca tggcatcatc gtggtgtatg 36L ctga ccaggaatcc tacgccaacg tgaagcagtg gctgcaggag attgaccgct 42; atgccagcga gaacgtcaat aagctcctgg tgggcaacaa gagcgacctc accaccaaga 48L aggtggtgga caacaccaca gccaaggagt ttgcagactc tctgggcatc cccttcttgg 54L agacgagcgc caagaatgcc accaatgtcg agcaggcgtt catgaccatg gctgctgaaa 60; tcaaaaagcg gatggggcct ggagcagcct gcga gcggcccaat ctcaagatcg 66; acagcacccc tgtaaagccg gctggcggtg gctgttgcta gcac atggagtggg 72; acaggagggg gcaccttctc cagatgatgt ccctggaggg ggcaggaggt acctccctct 78; ccctctcctg gggcatttga gtctgtggct ttggggtgtc ctgggctccc cttc 84; tggcccatct gcctgctgcc ctgagccccg gttctgtcag ggtccctaag ggaggacact 90; cagggcctgt gcag ggcggaggcc tgctgtgctg ttgcctctag gtgactttcc 96; aagatgcccc cctacacacc tttctttgga gctc ttctgtcggt gtccctccca L02; cccccatgta tgctgcactg ggttctctcc ttcttcttcc tgctgtcctg cccaagaact L08; gagggtctcc tcta ctgccctggc tgcagtcagt gcccagggcg aggaatgtgg L14; ccaggggatc ctgg gatccagggc cctgggctgg acctcaggac aggcatggag L20; gccacagggg cccagcagcc caccctttcc tctccccact gcctcctctc ccttcctaca L26; gctc gagccgtcca gctgcggtgg gatctgagta tatctagggc gggtgggcgg L32; gtagcagtgc tgggcctgtg tcttgagcct ggagggagtc tgctcctgcc gccctctgcc L38; gaga cagacccatg cgctgcctgc ccaccgtgcc cctttgtccc catgtcaggc L44: ggaggcggaa ccgt gccagaggct gggcaccagc cttaaccctc ctcc ctttccccaa ggtagcacat ctggctcact tccg aggccctcat cccctgccgc tacttctctg gggaatgtgg cctctctgct cacccactct gcacccagga tcctagtccc caag aaagcaagtc tttggtctcc ctgagaagcc tgcctgtccc acctgtgccc tgccctccag cttgtattta agtccctggg 1801 ctgccccctt ggggtgcccc ccgctcccag gttcccctct ggtgtcatgt caggcatttt 1861 gcaaggaaaa gccacttggg gaaagatgga aaaggacaaa aaaaattaat aaatttccat 1921 tggccctcgg gtgagctgag ggtttttgca aggaa LOCUS NP_112243 ACCESSION NP_112243 XP_001134089 VERSION NP_112243.1 GI:13569962 1 mnpeydylfk llligdsgvg ksclllrfad dtytesyist igvdfkirti eldgktiqu 61 iwdtagqerf yyrg ahgiivvydv tdqesyanvk qwlqeidrya senvnkllvg 121 tkkv vdnttakefa dslgipflet saknatnveq afmtmaaeik krmgpgaasg 181 gerpnlkids tpvkpagggc Official Symbol: RPS6 provided by HGNC Official Full Name: ribosomal protein S6provided by HGNC Also known as: S6 LOCUS NM_001010 ACCESSION NM_001010 VERSION NM_001010.2 GI: 43 cctcttttcc gtggcgcctc ggaggcgttc agctgcttca agatgaagct gaacatctcc 6; ttcccagcca ctggctgcca gaaactcatt gaagtggacg atgaacgcaa acttcgtact 12; ttctatgaga agcgtatggc cacagaagtt gctgctgacg ctctgggtga agaatggaag 18; ggttatgtgg tccgaatcag tggtgggaac gacaaacaag gtttccccat gggt 24; gtcttgaccc atggccgtgt ccgcctgcta aagg ggcattcctg ttacagacca ; aggagaactg gagaaagaaa gagaaaatca gttcgtggtt gcattgtgga tgcaaatctg 361 agcgttctca acttggttat tgtaaaaaaa ggagagaagg ctgg actgactgat 421 actacagtgc ctcgccgcct gggccccaaa agca gaatccgcaa acttttcaat 481 ctctctaaag atgt gtat gttgtaagaa agcccttaaa taaagaaggt 541 aagaaaccta ggaccaaagc acccaagatt cagcgtcttg ttactccacg tgtcctgcag 60; cacaaacggc ggcgtattgc gaag cagcgtacca agaaaaataa agaagaggct 66; gcagaatatg ctaaactttt ggccaagaga atgaaggagg agaa gcgccaggaa 721 caaattgcga agagacgcag ctct ctgcgagctt ctacttctaa gtctgaatcc 781 agtcagaaat aagatttttt gagtaacaaa taaataagat cagactctg LOCUS NP_001001 ACCESSION 001 VERSION NP_001001.2 GI: 17158044 1 mklnisfpat vdd erklrtfyek rmatevaada lgeewkgyvv risggnqug 61 fpmkqgvlth grvrlllskg hscyrprrtg erkrksvrgc svln lvivkkgekd 121 ipgltdttvp rrlgpkrasr irklfnlske ddvrquvrk plnkegkkpr tkapkiquv 181 tprvlqhkrr rialkqutk knkeeaaeya kllakrmkea kequeqiak rrrlsslras 241 tsksessqk Official SymbolRRPl Official Full bosomal RNA processing 1 homolog (S. cerevisiae) Also known asNNP-1; NOP52; RRPlA; D21S2056E LOCUS NM_003683 ION 683 VERSION NM_003683.5 GI:134304836 1 gccggggcca ggaggaggcg ggcgcaggag gcgcgtgctc agtgtgctgg gtaccaggcg 6; actccgggac agggggtctc ggccgtngc gtcatggttt cgcgcgtgca gctcccgcct 12; gagatccagc tggctcagcg cctggcggqg aatgagcagg tgacccggga ccgggcggtg 18; aggaagctcc ggaaatacat cgtcgccagg actcagcggg ccgcaggtgg gcac 24; gacgagctgc tgaaggtgtg gaaaggactg tgca tgtggatgca ggacaagcca ; ctcctccagg aagaattagg aaggactatt tcccagctcg ttcatgcttt tcagaccacg 361 gaggcgcagc acctgttcct tcaggccttc tggcagacca tgaatcgcga gtggacgggc 42L attgacaggc tgcgcctgga ctac atgctcatgc ggatggtcct gaacgagtcc 481 ttgaaggttc tgaagatgca ggaa gaaagacaga tcgaggagct gctagagctg 541 ctgatgactg agatcctgca ccccagcagc caggccccca acggtgtgaa gagccacttc 60; atcgagatct agga gctgaccaaa gtgggcgccg aggagcttac ggcagaccag 66; aacctgaagt tcatcgaccc cttctgcaga attgctgccc ggaccaagga ttccttggtt 72; ttgaacaaca tcactcgagg catctttgag acgattgtgg agcaggcccc gcttgccatt 78; gaagacctcc tgaatgaact ggacacacag gatgaggagg tggcgtcgga cagtgatgag 84; tcctctgagg gtggtgagcg tggagacgcg ctgtcccaga agaggtctga gaagccgccc 90; gcaggctcca tctgcagggc tgaacctgag gagg agcaggcagg tgacgacagg 96; gacagtggcg ttct ccagtttgac tacgaggcag acag tgaa L02; atggccagcc gccagagcac cccttctcag aacagaaagc gtctctacaa agtgatccgg L08; aagctgcagg acctggcagg aggcattttc gatg agatcccaga gaaggcctgc L14; aggcgcctgc ttgaagggag gcggcagaag aagacgaaga agcagaagcg tctgctcagg L20; ttgcagcagg agagagggaa aggtgagaag gagcccccga gcccgggcat ggagaggaag 126; aggagcagga ggaggggtgt cgac cccgaggcgc gggcagaggc ccaggcacag ctgagcgggc cctgctccga gatcagccca ggggccgtgg ccagagaggg L38; gctcgccaga gaaggaggac acctcggccc ctgaccagtg cccgagcaaa ggcggccaat L44; gtccaggagc cggagaagaa gaagaaacgc agggagtgat gtggccgggc caaggacagg L50; cagggaggga ggccaggcct cgcttgcacc gcgggacgag gctgaccggg ctgttctgta L56; gactcaggac Cgtggctcca gaactcctgt cggg agggaagggc ggcactggag L62; agatgggccc atcattaggg gccagcatcc caggaactgg acctttcccc tccg L68; cctgtggctg tgatgacctt gggccagaag gtcaaactcc gaagactgaa actctgcctg L74; gact ggccgcccct gctgtggggg gttcagaaaa taaaatgccg cgcagccctt L80; gccagggaaa aaaaaaaaaa LOCUS NP_003674 ACCESSION NP_00367’4 VERSION NP_003674.1 GI:4503247 I lppe iqlaquagn eqvtrdravr klrkyivart qraaggfthd kgLf 6L ycmwqukpl lqeelgrtis qlvhafqtte afw qtmnrewtgi drlrldkfym 121 lmrmvlnesl kvlkmqgwee rqieellell mteilhpssq apngvkshfi eifleeltkv l8; gaeeltadqn lkfidpfcri aartkdslvl nnitrgifet iveqaplaie dllneldtqd 24L eevasdsdes seggergdal sqkrsekppa gsicraepea geeqagddrd sggpvqudy ; eavanrlfem asrqstpsqn rkrlykvirk lqdlaggifp edeipekacr rllegrquk 36; lrl qqergkgeke ppspgmerkr srrrgvgadp earaeageqp llrd 42; qprgrgqrga prpl tsarakaanv qepekkkkrr e SunnnaQ/OffidaISynflxfl:SEPTll Official Full Name: septin 11 LOCUS NM_018243 ACCESSION NM_018243 VERSION NM_018243.2 GII38605734 ggcgtggggg atgc ctgc cagcgggacg ccggcgagca gagcgcagcc 61 gcgagggagg Cgcgagggag gcgagccgga gcccgagcac tagcagcagc cggagtcggc 12; gtaaagcacc cgggcgcagc cggagccggt gccgcagctg cgatggccgt ggccgtgggg 18; agaccgtcta atgaagagct tcgaaacttg tctttgtctg gccatgtggg atttgacagc 24; ctccctgacc agctggtcaa caagtctact ggat tctgtttcaa catcctttgt ; gttggtgaga caggcattgg cacg ttaatggaca ctttgttcaa caccaaattt 36; gaaagtgacc cagctactca acca ggtgttcggt taaaagccag aagttatgag 42; cttcaggaaa gcaatgtacg gtta accattgttg acaccgtggg agac 48; cagataaata acag ctataagccg atagtagaat atattgatgc ccagttcgag 54; ctgc aagaggaatt gaagattaaa cgttctctct tcaactacca tgacacgagg 60; atccatgcct gcctctactt tattgcccct actggacatt cactaaagtc cctggatctg 66; gtcaccatga aaaagctgga cagtaaggtg aacatcattc caataattgc aaaagctgac 72; accattgcca agaatgaact gcacaaattc aagagtaaga tcatgagtga actggtcagc 78; aatggggtcc agatatatca gtttcccact gatgaagaaa cggtggcaga gattaacgca 84; acaatgagtg tccatctccc atttgcagtg gttggcagca ccgaagaggt gaagattggc 90; aacaagatgg caaaggccag gcagtacccc tggggtgtgg tgcaggttga aaat 96; cattgcgatt ttgtgaaact tcgagagatg cgcg tgaacatgga ggacttgcga 102; gagcagactc acacccgcca ctatgaattg taccgacgct gtaagcttga agagatgggg 108; ttcaaggaca ctgaccctga accc ttcagtcttc aggagacata tgaagcaaaa 114; aggaatgaat tcctgggaga actgcagaag aaagaagaag aaatgagaca tgtt 120; atgagagtga aggagaaaga agctgaactt aaggaggcag agaaagagct tcacgagaag 126; tttgaccttc taaagcggac agaa gaaaagaaga aagtggaaga caagaagaag L32; gagcttgagg aggaggtgaa caacttccag aagaagaaag cagcggctca gttactacag L38; tcccaggccc agcaatctgg ggcccagcaa accaagaaag acaaggataa tgca L44; agcttcacat tggc aagccaagga tgttcccgca ttcacctgct tttgcagtaa L50; tatcgtatct ctgccatgtg tgttctttag ttttatttta ttttatttta tttttttacc L56; cttcctcaaa caccagtaac tattattaac tgct tgtt gggtggtaga L62; aaatgataga acaagggaat aaccgcgaat tgca ctct gtttccggaa L68: agtaaatgat ttgcttttta tgcctgttct gaatggcagc acgaagcagg cctgttactt L74; gtatgtcgct ttggacagag gaaagtgggg taaaatgcta cctgtacgtc tgacatgaaa L80; cacc gcctcagcag ctgaactaaa aacctgaata gccatgacaa gagtttgcat L86; tttcttgatg attcatctcc atgagtgcac tgaa ctcactgtct ctaa gccaaggtag atttgtacgt agacagactg gtgagcaagc L98; atttttaccc ttgcatgaca ttttcatttt aatcaataac attatttggc ctgagcttgt 204; gggtctgttc agactgtctc ctctcatggt ttgaaactgc atctgaatgc ctgccttcaa 210; tcctggccaa gttggagtag actggtatga gaaaactatg attagttcac atttactggt 216; gcatccttga tcctctcaca ggtc gttg gatcatgtaa cattgcttag 222; tagaagaatc ttcttctaag gatgatgggc tttctacagc ctgcttacca ctaacagtaa 228; ggaatctttc ataaacacac ttgt tcccagtggg cttagaggga ggacctgatg 234; actgattcca ggatacttgt acttctaata acatttttca tgaatcatga gaaaatttcc 240; acagatactt cccttagaaa atttgctata aactctgtat cattggtagc acaaatttga 246; gcgaggcctt gtcaatttta aggtggaaat aggaaggacc acaacatgac ccgtaagtca 252; agaaggtaga catttcatat ccagcttcct tgcttagtct cctttcagta tttggcaata 258; aaagaaagaa gaaatagaac agctgaagtc tcaaatcatt gtctggaatt ttcctcacct 264; gctc cacctgctct ttgtctaagg cccttgcctc atcagggatt agaactggcc 270; catatgccag aacctgtact aaatgcctaa tttgtatgga agagtgcata tttaatctct 276: tttctatact gctcctttct gatgcttatc ctttcatctg tgtgattgtt ttttcccctc 2821 tactaacaag ccag ctttctctct acatgtagaa acat ttctcatgaa 288; cccactgccc ctctgcattt tcctcactgg ttagagatta agtaaatagg tatg 294; ctgcgtctcc cctgacacac actttctttt ttgaatgagc aagtctccat tttgatttca 300; gcaaagattt cttt tctttgtcct caaccatact tagaggaaag aaggaatggt 306; cttccatgaa ctgattatgc ttaattaagc aaagtaagga aattagtttc atggaagcct 312; aaacaaagct ggaatagaaa ctacacacta gacacagcag tagtcatagt aggt 318; ttaggagcta ctggaccaac attcttgttt ttgcttttgt ttttttaaat aattctagtc 324; tggagctaac tgtggagcag ccaaatagta gctggcatgt tgattcaaac ctga 330; atttgctcat aggctgtgca tcagacaaaa gcttgaatat ttgtgttgta tgcttgttcc 336; aaccaccgct tgtgtgagca tttttgtggc ttgtacagaa agtacacttt taaattgtct 342; cttgcatcac taaaattttt ttaaaatgag cataacaacg aaaggcatcc agctgacttt 348; ttgattccaa gattattgat tggattgact tttttgcatt aaatttttcc cagcaaaata 354; aatcatatgg cgagtcaggg aataaaaagt caaaagaaac aaatagaagc tttttttttt 360; aaaaaatgta ttgcttctga acttttttct gccactgctc cctagccctg tttagtttgt 366; tattgctgct tttctttttt ctttctgtat ctatgccttt ttttcacagt tggc 372; tctgcacgga ataaatgata ccctcaaatc taattggatg tgctttcgcc tttgcatgta 378; agtacggtag taagaaacct tctt tctgactttt caaaattaga aaat 384; gggatggata gatttttttt ttcttttcaa ggggggcagg aaggtaatgg tttgagtagc 390; ctttgtttaa aaaaaagact aaatatattt aaaaggccac atttatattt ttttcacaag 396; aaccacataa taaattccac ttcttgacct gaatttggaa atccgaaatt actaatccag 402; gccaggtgtg gtggctcatg cctgtaatcc cagcactttg cgag gtgggcagat 408; cacttgaggc ctggagttca cctt ggcgaacacg gtgaaacccc gtctctacaa 414; aaaatacaaa aattagccag gcgtggtggc acgtgcctgt agtcccagct acttgggagg 420; ctaagtcagg agaattgctt gaacttggga gatggaggtt gcagtgagcc aagattgcac 426; cactgcattc caacctgggt gatgaagtga gactctccaa aaaaaaaaaa gaaattatta 432; atccctgcct gtgctctaca tagcctcatg attg tcag agggcccttg 438; attctggcaa ggcaaataaa atga gaaattacca tcttctacta gagaaaacca 444; aaat ttttatgcta ggatgccttt atgaccactt aattttttaa tcttagttta 450; atggtctctc cctggtgcta actgctgaca gtggccacct tggg gattgagggg 456; cctacataac tagctggcct taccccatat cttttgttca aacataatac catctttttg 4621 ctga actttagatc tccataacac atgtactgta gaatgtgatg gaaaagcatt 468; gatgagaatt tattggcagt tgtg ttttcccaac ttaggctctt tattaattgg 474; ttaaggtttt ctccaaaaag ggcatttcaa caatgggaat atgt aacagtgggc 480; acagattact tatcttcctt ctctgctttg tgactcacca gcagtaacac acacaatcca 486; catcttgtgc acctcaaatg aacagacttg gtttccttgc tttcttgaca tttccatgac 492; tgtttcacat acaaactatt gggtgaggtt tttcagctgt taccgaccca cgtcctgctg 498; tctctgtgtg gtcctacaaa aactgtccat ccct ttgctttgcc atttgcaaga 504; gtctggaatt gtcaggtctc agcttcgaaa agtcctggtt ccactgacag gacacattct 510; ttagtgggaa ttaagaccta caaagtctag tttgtatgta ggtatgaagg gaatttttta 516; aataaattga aaagctgtga acagcattag aactttgtct atttcttaat tttaaaatat 522; gctgatatgc cttaaactgt agttgtagat ccttgtcatt ttgctgtttg aaaataacca 528: atgtgttttc taaaactgtc gtgtaatcta ctttcattgt taatgcagaa ttgtcatata 534; tgtaagctgc atgttagaca tttgtctttt ttaaactaaa gtat tgatgtgaag 540; catatcattt tttcaaatat gaaagtgatc acttagcaac atgcttggta atttggcatc 5461 tgttaaggta ggagagtggt gaacagataa tctatgcata tagt gccaagacat 552; aaagcggggg aaaatatatt tttacccaaa cattaaaaaa aaaaaaaaaa aaaa 558; aa LOCUS NP_060713 ACCESSION NP_060713 VERSION NP_060713. 1 2712 I mavavgrpsn eelrnlslsg hvgfdslpdq lvnkstsqgf cfnilcvget gigkstlmdt 6; lfntkfesdp gvrl karsyelqes nvrlkltivd tvgfgdqink ddsykpivey 12; idaqfeaqu rslf nyhdtrihac lyfiaptghs lksldlvtmk kldskvniip 18; iiakadtiak nelhkfkski mselvsnqu eet vaeinatmsv hlpfavvgst 24; eevkignkma karqypwgvv qvenenhcdf vklremlirv nmedlreqth trhyelyrrc ; fkdt dpdskpfslq etyeakrnef lgelqkkeee mrquvmrvk ekeaelkeae 36; kelhekfdll krthqeekkk elee evnnqukka aaqllqsqaq qsgaqqtkkd 42; kdkknasft al Symbol: SEPT7 and Name: septin 7 Other Aliases: CDC10, CDC3, NBLA02942, SEPT7A Other Designations: CDC10 (cell division cycle 10, S. cerevisiae, homolog); CDC10 protein homolog; septin-"f LOCUS NM_001011553 ION NM_001011553 VERSION NM_001011553.3 GI:339639595 gagatggaag ccagcctccg ctaggcccgg aagcctcgtc tgagggggcg ggggacggag 6; gcgg gagtcgagcg agagcctgtg gaggagtccg cctgctgtag cgtgcgtaag 12; caaggcagct acgccgggcg gctacgctgc ggaatcggcg taggcgcctt tggagaatcg 18; tgcg ctccgctggg gctggtcgcg gaggggggga gtcg gtcagtgcga 24; gatccgctgc ggag aggagcgtca acagcagcac catggctcaa aacc 30L ttgaaggcta tgtgggattt ctcc caaatcaagt atacagaaaa tcggtgaaga 36L gaggttttga attcacgctt atggtagtgg gtgaatctgg attgggaaag tcgacattaa 42L tcaactcatt attcctcaca gatttgtatt ctccagagta tccaggtcct tctcatagaa 481 ttaaaaagac ggtg gaacaatcca aagttttaat aggt ggtgttcagt 54; tgctgctcac aatagttgat accccaggat ttggagatgc agtggataat agtaattgct 60; ggcagcctgt tatcgactac attgatagta aatttgagga ctacctaaat gcagaatcac 66; acag acgtcagatg cctgataaca gggtgcagtg ttgtttatac ttcattgctc 72; cttcaggaca tggacttaaa ccattggata ttgagtttat gaagcgtttg catgaaaaag 78; tgaatatcat cccacttatt gccaaagcag acacactcac accagaggaa tgccaacagt 84; ttaaaaaaca gataatgaaa gaaatccaag aacataaaat taaaatatac gaatttccag 90; aaacagatga tgaagaagaa aataaacttg ttaaaaagat ccgt ttacctcttg 96; ctgtggtagg tagtaatact atcattgaag ttaatggcaa aagggtcaga ggaaggcagt L02; atccttgggg tgaa gttgaaaatg gtgaacattg tgattttaca atcctaagaa L08; atatgttgat aagaacacac atgcaggact tgaaagatgt tactaataat gtccactatg L14; agaactacag aagcagaaaa cttgcagctg tgacttataa tgat aacaacaaga L20; ataaagggca gctgactaag agccctctgg cacaaatgga agaagaaaga agggagcatg L26; tagctaaaat gaagaagatg gagatggaga tggagcaggt gtttgagatg aaggtcaaag L32; aaaaagttca aaaactgaag gactctgaag ctgagctcca gcggcgccat gagcaaatga L38; aaaagaattt ggaagcacag cacaaagaat tggaggaaaa acgtcgtcag agaaagcaaa ctgggaagct caacaacgta ttttagaaca acagaactct 1501 agaa gaaa gggaagatct tttaaactct ctattgacca ccagttaacg 156; tattagttgc caatatgcca gcttggacat cagtgtttgt tggatccgtt gttt tatccataat gatggattta acagcatgac aaaaattatt tttttttttg L68; ttcttgatgg agattaagat gccttgaatt gtctagggtg ttctgtactt agaaagtaag 174; agctctaagt acctttccta cttt ttttattaaa tctt cagtttaatg 180; caagagaaca ttttactgtt gtacaatcat gttctggtgg tttgattgtt tacaggatat 186; ataa aaggactctg gaagattttc attgaggata aattgccata atatgatgca 192; aactgtgctt gata attacaatac aaaggttcca ttcagtgcag caat 198; aatgtaattt agtctaacac agttgaccct attttttgac acttccattg tttaaaaata 204; cacatggaaa aaaaaaaaac cctatatgct tactgtgcac ctagagcttt tttataacaa 210: cgtctttttg tttgtttgtt ttggattctt taaatatata ttattctcat ttagtgccct 216; ctttagccag aatctcatta ctgcttcatt tttgtaataa catttaattt agatattttc 222; catatattgg cactgctaaa tata gcatctttca tatggtagga accaacaagg 228; aaactttcct ttaactccct ttttacactt tatggtaagt agcagggggg gaaatgcatt 234; tatagatcat ttctaggcaa aattgtgaag ctaatgacca acctgtttct acctatatgc 240; agtctcttta ttttactaga aatgggaatc atggcctctt gaagagaaaa aagtcaccat 246; tctgcattta gctgtattca tatattgcat ttctgtattt tttgtttgta ttgtaaaaaa 252; ttcacataat aaacgatgtt gtgatgtaat attgtgtgag gtcttaaata tcctacagtc 258; gatgtacaag agtagagtat gtttgggaag aaacttttca gcttaagttt gcctcctcta 264; caatgacatc ttttatatgc catt tagaatgcat atgtgctgat tttctaattt 270; aagagatacc atatctctct attcatttct atctctcatt tgtatgctta tttttctgag 276; tttt ttttccccca gtct tgcttcattg cccaggctgg gtgg 282; cacaaacacg acttgactgc agcctcaacc ctctgggctc aagcagtcct tcag 288; ccccctgagt atctgggatt gtgc accaccacgc ctggctaatt tttgtatttt 294; ttgcagcctc ccaaagttct gggattacag gcatgagccg tcatgcctgg cctctgagaa 300; cagtttctga ctcattcaga ttaggtatac tctcaagtcc ctggaaactg tttt 306; taactgtaaa gagggtagtg cttt tcttaaggtc aagtgacata gattttaatg 312; taatgcataa tttaggtaag aaattaatta atgtagccta gtttattatc ttgaaatgtt 318: ttaccctatt tactttttaa aattaatgac ctaagcggag ggaataatta taagtcaata 3241 gcagagagat tgttgtttgg gtgtttattt gttt ttgttttgag agattgggtt 330; aacacctcta gccaaaattg tttggtttta gggaggctaa caataaccta ctgaatttgg 336; aaaatgcaaa aaat gtatatagac tgcctgctga actggttaag tactactgct 342; tctgggaaat actatttcaa aattctatgt attataataa taaatttgta tcat 348; tattctacca tcctaatgaa aactttcaga agtctttctt tatccatggc atgcccaggg 354; ttttacctga atctgataca ggatctatat aactttacta ggacttttga ttgttgactc 360; caggcttagg tatatcagaa ggttcttttt tggc ctgtggatgt ctgagaagat 366; cattcacaat aaaa ttcaggtagg cctaaggaaa ggccagcctg tagaaagcaa 372; aatggcagtg tctgttctcc actgttggag gcattatgta atttaagtat cctgttagcc 378; actgtctttc ttaa gtggggctga aagc actaataata ccagtgaacc 384; acttgggcac cttgtgggta gagttttgct gccacctagt ggaatgggat atcattgctt 390: ccatatcagg ttcacaagca agttaagtgg gcacagttta tttctgtgta gctcaggctg 396; taatcttgaa agctgaggag atacccatgc ctctcagact cattagctgg gtgtcacatt 402; accacctgca cattctgacc caccgcatct gttt tgtcctcttg gagaaactag 4081 aagt caggatatgg taggtaaggg ggaaaaagga aagacggctt gatagctatg 414; aatgcatgag gagcgaaatg ttgactcagt tatctagatc atggtctcca atgc 420; ctta caaaaatatt tgttgagcat gtgtccataa ttatatgtat atga 426; aaatatgtgt caacaaatgt acac taatgtgaac attatggaac aaaatttgaa 432; agagtgaaat aaaaggttta cactttcaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa LOCUS NP_001011553 ACCESSION NP_001011553 I msvsarsaaa eersvnsstm aqqknlegyv gfanlpnqu rksvkrgfef tlmvvgesgl 6; gkstlinslf ltdlyspeyp gpshrikktv qveqskvlik egqulllti vdtpgfgdav 12; qpvi dyidskfedy lnaesrvnrr qmpdnrvqcc lyfiapsghg lkpldiefmk 18; rlhekvniip liakadtltp eecqqfkkqi mkeiqehkik iyefpetdde eenklvkkik 24; drlplavvgs ntiievngkr vrgrqypwgv aevengehcd ftilrnmlir thqulkdvt ; nnvhyenyrs rklaavtyng vdnnknkgql qmee errehvakmk kmememeqvf 36; emkvkekvqk lkdseaelqr rhequknle aqhkeleekr rqfedekanw eaqqrileqq 42L nssrtleknk kkgkif Official Symbol: SH3BGRL and Name: SH3 domain binding glutamic acid-rich protein like [Homo sapiens] Other Aliases: SH3BGR Other Designations: SH3 domain—binding glutamic acid-rich-like protein; SH3-binding domain glutamic acid-rich protein like LOCUS NM_003022 2090 bp mRNA linear PR127-JUN-2012 ION NM_003022 VERSION NM_003022.2 GI:211938420 ttttagtctc cgcgtgaaaa ggtccttcat gctaatctaa ttcaattcag ttctcctttt 6L tctttcttct tctcctcgcc ctcttctcag ggaaggaatc gtcaaaaatc aatgtttcaa 12L cagatctcgc ggtgctattc gagattccct attaaaaaaa aatagaattg atgcaaacag l8; cctgttcctt ccggggtttt gggctggaac tgcagcgctt agagagctcg gtggaagctg 24L ctaaaggcgg aggcggggct ctggcgagtt ctccttccac cttcccccac ccttctctgc ; caaccgctgt ttcagcccct agctggattc cagccattgc tgcagctgct ccacagccct 36; tttcaggacc caaacaaccg cagccgctgt tcccaggatg gtgatccgtg ttgc 42; atcttcctct ggctctacag cgattaagaa acaa gatgtgcttg gtttcctaga 48; agccaacaaa ataggatttg aagaaaaaga tattgcagcc aatgaagaga atcggaagtg 54; gatgagagaa aatgtacctg aaaatagtcg caca ggttaccccc tgccacctca 60; gattttcaat gaaagccagt atcgcgggga ctatgatgcc ttctttgaag ccagagaaaa 66; taatgcagtg tatgccttct taggcttgac acct ggttcaaagg aagt 72; gcaagcaaag cagcaagcat gaaccttaag cactgtgctt taagcatcct gaaaaatgag 78; tctccattgc ttttataaaa aatt agctttgctt aaat aatg 84; ttgaaataat agattagttg caca tgcaaacatt caaaatgaat ttaa 90; aatttgaaca ttatggtgat tatggtgagg ggat attaacataa aattatatta 96; ataagtagat atcgtagaaa tagtgttgtt acctgccaag tgta tacaccaatg L02; attttacaaa gaaaacaccc ttccctcctt ctgccattac tatggcaact taagtgtatc L08; tgcagctcta cattaaaaag gagaaagaga aataacctgt ctctcattcc gcct L14: cattaatttt catgaacaag aatatgtacc tttttgatgc tatattactg gtaatgttta tgatatgtta aacgttgtaa tttcctatcg ttttgtagat gaaacttcta catattgaac cacagatttt aaatgtagcc tttcattgca catttcagtg atcagaatag atatcctttt acacgcacaa L38; aagcaataga ttcattcagt ggacaagttc cttgtttaac tacacagcta L44; atatatccaa tgcc tcagtgaaat atgcatatgt atatcatgaa agtgggatgc 150; agct taaaatggca ttctctagca aagagattag acttttaaat aactcttata 156; aaacaggttg gcgatcattt attg gtttcccttg agtttttgct aatc 162; ttagtagttt tgcccgttta aaacaactca caatcgtaaa tgctactatt cctaagatat 168; cttacctttt tatttcagtt tagccatgta gagt gtattagtct aagcagtgag 174; aatcttttct atgcctctat tccagcaaaa agtagaagta tcaaataaaa agggcaactt 180; ttaaaatatt aagcctgaag acttctaaaa agacaagaaa catggcctaa tagtaagttt cacactacct tattaccaaa agcaaacacc tcatgtatat ctattgtatg ctggtcttta ctttttgcca ataatgaaga gatgcctttg tttcatgaga ttcaaacttg atgctatgct ttaaaataaa 2041 ctcagtactt ttagaaacat aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa LOCUS NP_003013 ACCESSION NP_003013 VERSION NP_003013.1 6925 1 mvirvyiass sgstaikkkq lean kigfeekdia aneenrkwmr envpensrpa 61 tgyplppqif nesqyrgdyd affearenna vyaflgltap pgskeaevqa qua Official Symbol: SNRPB and Name: small nuclear ribonucleoprotein polypeptides B and B1 Other Aliases: COD, SNRPBl, Sm—B/B', SmB/B', SmB/SmB', snRNP-B Other Designations: B polypeptide of Sm protein; Sm protein B/B'; sm-B/Sm-B'; small nuclear ribonucleoprotein polypeptide B; small nuclear cleoprotein ptides B and B‘; small nuclear ribonucleoprotein-associated ns B and B' LOCUS NM_003091 ACCESSION NM_003091 VERSION NM_003091.3 GI:38149990 1 aggg ctagtgagct ggaccggaag taggtttcta cccgaccgca ttttacgtgg 61 tgctgcattt ccggtagcgg cggcgggaaa tcggctgtgg ggct aggcctctga 121 ggaggcgaat ccggcgggta tcagagccat cagaaccgcc accatgacgg tgggcaagag 181 cagcaagatg ctgcagcata acag gatgaggtgc atcctgcagg acggccggat 241 cttcattggc accttcaagg acaa gcacatgaat ttgatcctct gtgactgtga 301 tgagttcaga aagatcaagc caaagaactc caaacaagca gaaagggaag agaagcgagt 361 cctcggtctg gtgctgctgc gaggggagaa tctggtctca atgacagtag ctcc 421 tcccaaagat actggtattg ctcgagttcc acttgctgga gctgccgggg gcccagggat 481 ngcagggct gctggcagag gaatcccagc tggggttccc atgccccagg ctcctgcagg 541 acttgctggg ccagtccgtg gggttggcgg gccatcccaa caggtgatga ccccacaagg 60; aagaggtact gttgcagccg ctgcagctgc tgccacagcc agtattgccg gggctccaac 661 ccagtaccca cctggccgtg ggggtcctcc cccacctatg ggccgaggag caccccctcc 721 aggcatgatg ggcccacctc tgag acctcctatg ggtcccccaa tggggatccc 781 ccctggaaga gggactccaa tgggcatgcc ccctccggga atgcggcctc ctccccctgg 841 gatgcgaggc cttctttgac ccttggccac agagtatgga agtagctccg cgtg 90; ggctcgattc ctcagggcca cgttaccaca tttg tttcttatgc tcgt 96; tcat gggattgtct ggtttccctt cccc ctcccccggg aatgcgccca 102; ccaaggccct agactcatct tggccctcct cagctccctg cctgtttccc gtaaggctgt 108; acatagtcct tttatctcct tgtggcctat gaaactggtt tataataaac tcttaagaga 1141 acattataat tgc LOCUS NP_003082 231 aa linear PRI 27-JUN-2012 ION NP_003082 VERSION NP_003082.1 GI:4507125 1 mtvgksskml qhidyrmrci lngrifigt fkafdkhmnl ilcdcdefrk kqae 61 reekrvlglv 11rgenlvsm tvegpppkdt giarvplaga graa grgipangm 121 pqapaglagp vrgvggpsqq vmtpqgrgtv aaaaaaatas iagaptqypp grggppppmg 181 rgapppgmmg pppgmrppmg ppmgippgrg tpmgmpppgm rppppgmrgl 1 Official : SOD1 and Name: superoxide dismutase 1, soluble Other Aliases: ALS, ALSI, IPOA, SOD, hSodl, homodimer Other Designations: Cu/Zn superoxide dismutase; SOD, soluble; indophenoloxidase A; superoxide dismutase [Cu-Zn]; superoxide dismutase, cystolic LOCUS NM_000454 ACCESSION NM_000454 VERSION NM_000454.4 GI:48762945 gtttggggcc agagtgggcg aggcgcggag gtctggccta taaagtagtc gcggagacgg 6; ggtgctggtt tgcgtcgtag tctcctgcag cgtctggggt ttccgttgca gtcctcggaa 12; ccaggacctc gcct agcgagttat ggcgacgaag gccgtgtgcg tgctgaaggg 18; cgacggccca gtgcagggca tcatcaattt cgagcagaag gaaagtaatg gaccagtgaa 24; ggtgtgggga aaag gactgactga aggcctgcat ggattccatg agtt ; tggagataat ggct gtaccagtgc aggtcctcac tttaatcctc tatccagaaa 361 acacggtggg ccaaaggatg aagagaggca tgttggagac ttgggcaatg tgactgctga 421 caaagatggt gtggccgatg tgtctattga agattctgtg atctcactct caggagacca 48L catt ggccgcacac tggtggtcca tgaaaaagca gatgacttgg gcaaaggtgg 541 aaatgaagaa agtacaaaga caggaaacgc tggaagtcgt ttggcttgtg gtgtaattgg 60; gatcgcccaa taaacattcc cttggatgta gtctgaggcc ccttaactca tctgttatcc 66; tgctagctgt agaaatgtat cctgataaac attaaacact gtaatcttaa aatt 72; gtgtgacttt ttcagagttg agta cctgtagtga gaaactgatt tatgatcact 78; tggaagattt tttt ataaaactca gttaaaatgt ctgtttcaat gacctgtatt 84; gact taaatcacag atgggtatta aacttgtcag aatttctttg tcattcaagc 90; ctgtgaataa aaaccctgta tggcacttat tatgaggcta ttaaaagaat ccaaattcaa 96; actaaaaaaa aaaaaaaaaa LOCUS NP_000445 ACCESSION 445 VERSION NP_000445.1 GI:4507149 1 matkavcvlk gdgpvqgiin feqkesngpv kvwgsikglt eglhgfhvhe fgdntagcts 61 plsr khggpkdeer hvgdlgnvta dkdgvadvsi edsvislsgd hciigrtlvv 121 hekaddlgkg gneestktgn agsrlacgvi giaq KARS Official Sym bol: KARS Official Name: lysyl-tRNA synthetase Gene ID: 3735 sm: Homo sapiens Other Aliases: CMTRIB, KARS2, KRS Other Designations: lysRS; lysine tRNA ligase; --tRNA ligase Nucleotide seguence: NCBI Reference ce: NM_001130089.1 LOCUS: NM_001130089 ACCESSION 2 NM_001130089 l aaattaacgt actatcctcc ttacttttgg gtcgggccct ccgggaagat cgtg 6L caggcggccg aggtgaaagt cagc gagccgaaac tgagcaagaa gtggtggtaa 12; tcattagttc cagggtgctc tgccatgttg acgcaagctg ctgtaaggct tgttaggggg 18; tccctgcgca aaacctcctg gtgg aggg aactgcgact gggtcaactt 24; gctcctttca cagcgcctca caaggacaag tcattttctg gaag tgagctgaag ; agacgcctga aagctgagaa gaaagtagca gagaaggagg ccaaacagaa agagctcagt 36; gagaaacagc taagccaagc cactgctgct gccaccaacc acaccactga taatggtgtg 42; ggtcctgagg aagagagcgt ggacccaaat taca aaatccgcag tcaagcaatt 48; catcagctga aggtcaatgg ggaagaccca tacccacaca agttccatgt agacatctca 54; gact tcatccaaaa atatagtcac ctgcagcctg gggatcacct gactgacatc 60; accttaaagg tggcaggtag gatccatgcc aaaagagctt ctgggggaaa gctcatcttc 66; tatgatcttc gaggagaggg gttg caagtcatgg ccaattccag aaattataaa 72; tcagaagaag aatttattca tattaataac aaactgcgtc ggggagacat aattggagtt 78; caggggaatc ctggtaaaac caagaagggt gagctgagca tcattccgta tgagatcaca 84; ctgctgtctc cctgtttgca tatgttacct catcttcact tcaa agacaaggaa 90; acaaggtatc gccagagata cttggacttg atcctgaatg actttgtgag gcagaaattt 96; atcatccgct ctaagatcat cacatatata agaagtttct tagatgagct gggattccta 1021 gagattgaaa ctcccatgat gaacatcatc ccagggggag ccgtggccaa gcctttcatc 108; acttatcaca acgagctgga ctta tatatgagaa ttgctccaga actctatcat 114; aagatgcttg tggttggtgg catcgaccgg gtttatgaaa ttggacgcca gttccggaat 120; gaggggattg atttgacgca caatcctgag ttcaccacct gtgagttcta catggcctat 126; tatc acgatctcat ggaaatcacg gagaagatgg tttcagggat ggtgaagcat ;32; attacaggca aggt cacctaccac ccagatggcc cagagggcca agcctacgat ;38; ttca ccccaccctt ccggcgaatc gtag aagagcttga gaaagccctg ;44; gggatgaagc tgccagaaac gaacctcttt gaaactgaag aaactcgcaa aattcttgat ;50; gatatctgtg tggcaaaagc tgttgaatgc cctccacctc ggaccacagc caggctcctt ;56; gacaagcttg ttggggagtt cctggaagtg acttgcatca atcctacatt catctgtgat ;62; cacccacaga taatgagccc tttggctaaa tggcaccgct ctaaagaggg tctgactgag ;68; cgctttgagc tgtttgtcat gaagaaagag aatg cgtatactga gctgaatgat ;74; cccatgcggc agcggcagct ttttgaagaa caggccaagg ccaaggctgc tgat ;80; gaggccatgt tcatagatga aaacttctgt ctgg aatatgggct gccccccaca -86; gctggctggg gcatgggcat tgatcgagtc gccatgtttc tcacggactc caacaacatc ;92; aaggaagtac ttctgtttcc tgccatgaaa gaca agaaggagaa acac tggaaagcac aacagttggc acttctgtct agaaaataat aattgcaagt 204; tgtataactc aggcgtcttt gcatttctgc gaaagatcaa ggtctgcaag ggaattcttg 210; tgtgctgctt tccatttgac accgcagttc tgttcagcca tcagaagaga gacaaggaat 216; taaaaatttc tttttaatcc tgttaccaaa taaaaaa Protein seguence: NCBI Reference Seguence: NP_001123561.1 LOCUS: NP_001123561 ACCESSION: NP_001123561 l mltqaavrlv tswa ewghrelrlg qlapftaphk dksfsdqrse lkrrlkaekk 6; vaekeaque lsekqlsqat ttdn gvgpeeesvd irsq aihqlkvnge 12; dpyphkfhvd isltdfiqky shlqudhlt ditlkvagri hakrasggkl ifydlrgegv l8; quvmansrn ykseeefihi nnklrrgdii qvqgnpgktk ipye itllspclhm 24; lphlhfglkd ketryrqryl dlilndfvrq kfiirskiit yirsfldelg fleietpmmn ; iipggavakp fityhneldm nlymriapel yhkmlvvggi drvyeigrqf rnegidlthn 36; pefttcefym ayadyhdlme itekmvsgmv ykvt yhpdgpegqa ydvdftppfr 42; rinmveelek algmklpetn lfeteetrki lddicvakav ecppprttar lldklvgefl 481 evtcinptfi cdhpqimspl akwhrskegl terfelfvmk keicnaytel ndpmrqrqlf 541 eeqakakaag ddeamfiden fctaleyglp ptagwgmgid rvamfltdsn nikevllfpa 601 mkpedkkenv attdtlestt vgtsv KIFSB Official Symbol: K|F5B al Name: kinesin family member SB Gene ID: 3799 Organism: Homo sapiens Other s: KINH, KNS, KNS1, UKHC Other ations: conventional kinesin heavy chain; kinesin 1 (110-120kD); kinesin heavy chain; kinesin-1 heavy chain; ubiquitous n heavy chain Nucleotide seguence: NCBI nce Seguence: 521.2 LOCUS: NM_004521 ACCESSION : NM_004521 l ctcctcccgc accgccctgt acgg cggcctcagg agtgatcggg cagcagtcgg 6; ccggccagcg gacggcagag gacg ggtaggcccg gcctgctctt cgcgaggagg 12; aagaaggtgg ccactctccc ggtccccaga acctccccag cccccgcagt ccgcccagac 18; Cgtaaagggg gacgctgagg ggac gctctccccg gtgccgccgc cgctgccgcc 24; gccatggctg ccatgatgga tcggaagtga gcattagggt taacggctgc cggcgccggc ; tcttcaagtc ccggctcccc ggccgcctcc acccggggaa gcgcagcgcg gcgcagctga 36L ctgctgcctc tcacggccct cgcgaccaca agccctcagg tccggcgcgt caag 42; actgagcggc ggggagtggc tcccggccgc cggccccggc tgcgagaaag atggcggacc 48L tggccgagtg caacatcaaa gtgatgtgtc gcttcagacc tctcaacgag tctgaagtga 54; accgcggcga caagtacatc gccaagtttc agggagaaga cacggtcgtg atcgcgtcca 60; agccttatgc atttgatcgg gtgttccagt caagcacatc gcaa gtgtataatg 66; actgtgcaaa gaagattgtt aaagatgtac ttgaaggata taatggaaca atatttgcat 72; atggacaaac atcctctggg aagacacaca aggg taaacttcat gatccagaag 78; gcatgggaat tattccaaga atagtgcaag atatttttaa ttac tccatggatg 84; tgga atttcatatt aaggtttcat attttgaaat atatttggat aagataaggg 90; taga tgtttcaaag accaaccttt cagttcatga agacaaaaac cgagttccct 96; atgtaaaggg gtgcacagag gtat gtagtccaga tgaagttatg gataccatag L02; atgaaggaaa atccaacaga catgtagcag ttacaaatat gaatgaacat agctctagga L08; gtcacagtat atttcttatt aatgtcaaac aagagaacac acaaacggaa caaaagctga L14; gtggaaaact ttatctggtt gatttagctg gtagtgaaaa ggttagtaaa gctggatgaa gctaaaaaca agtc actttctgct tgag ggtagtacat atgttccata tagt gaatccttca agattcatta ggtggcaact gtagaaccac tattgtaatt catcatcata caatgagtct gaaacaaaat ctacactctt atttggccaa agggccaaaa L44; caattaagaa cacagtttgt gtcaatgtgg agttaactgc agaacagtgg aaaaagaagt L50; atgaaaaaga aaaagaaaaa aataagatcc tgcggaacac tattcagtgg cttgaaaatg L56; agctcaacag taat ggggagacgg ttga tgaacagttt gacaaagaga L62; aagccaactt tttc acagtggata aagatattac tcttaccaat ccag L68; caaccgcaat tggagttata ttta ctgatgctga aagaagaaag tgtgaagaag L74; aaattgctaa attatacaaa cagcttgatg acaaggatga agaaattaac cagcaaagtc L80; aactggtaga gaaactgaag acgcaaatgt tggatcagga ggagcttttg gcatctacca L86; gaagggatca agacaatatg caagctgagc tgaatcgcct tcaagcagaa aatgatgcct L92; ctaaagaaga agtgaaagaa gttttacagg aaga acttgctgtc aattatgatc L98; agaagtctca ggaagttgaa acta aggaatatga attgcttagt gatgaattga 204; atcagaaatc ggcaacttta gcgagtatag atgctgagct tcagaaactt aaggaaatga 210; ccaaccacca gaaaaaacga gcagctgaga tgatggcatc tttactaaaa gaccttgcag 216: aaataggaat ggga aataatgatg taaagcagcc tgagggaact ggcatgatag 2221 atgaagagtt cactgttgca agactctaca ttagcaaaat gaagtcagaa gtaaaaacca 228; tggtgaaacg ttgcaagcag ttagaaagca cacaaactga gagcaacaaa aaaatggaag 234; aaaa ggagttagca gcatgtcagc ttcgtatctc tcaacatgaa gccaaaatca 240; agtcattgac tgaatacctt caaaatgtgg aacaaaagaa aagacagttg gaggaatctg 246; tcgatgccct cagtgaagaa ctagtccagc ttcgagcaca agagaaagtc catgaaatgg 252; aaaaggagca cttaaataag gttcagactg caaatgaagt taagcaagct gttgaacagc 258; agatccagag ccatagagaa caaa aacagatcag tagtttgaga gatgaagtag 264; aagcaaaagc aaaacttatt actgatcttc aagaccaaaa ccagaaaatg atgttagagc 270; aggaacgtct agaa catgagaagt tgaaagccac agatcaggaa aagagcagaa 276; atga acttacggtt atgcaagata gacgagaaca agcaagacaa gacttgaagg 282; gtttggaaga gacagtggca aaagaacttc agactttaca caacctgcgc aaactctttg 288; ttcaggacct ggctacaaga gttaaaaaga gtgctgagat tgattctgat gacaccggag 294; gcagcgctgc tcagaagcaa aaaatctcct ttcttgaaaa taatcttgaa cagctcacta 300; aagtgcacaa acagttggta cgtgataatg cagatctccg ctgtgaactt cctaagttgg 306; aaaagcgact tcgagctaca gctgagagag tgaaagcttt ggaatcagca ctgaaagaag 312; ctaaagaaaa tgcatctcgt gatcgcaaac agca agat cgcataaagg 318; aagcagtcag gtcaaagaat atggccagaa gagggcattc tgcacagatt ccta 324; ttcgtcccgg gcaacatcca gcagcttctc caactcaccc aagtgcaatt cgtggaggag 330; gtgcatttgt cagc cagccagtgg cagtgcgagg tggaggaggc aaacaagtgt 336; aatcgtttat acatacccac aggtgttaaa aagtaatcga agtacgaaga ggacatggta 342; tcaagcagtc attcaatgac tataacctct actcccttgg gattgtagaa ttataacttt 348; taaaaaaaat gtataaatta tacctggcct gtacagctgt ttcctaccta ctcttcttgt 354; aaactctgct gcttcccaac acaactagag tttg gcatcttagg agggaaaaag 360; gacagtttac aactgtggcc ctatttatta cacagtttgt tgtc ttaaatttag 366; tctttactgt gccaagctaa ctta tgta ctttttgtat tttttgtgta 372; tgtttatttt tcag tttaaattac ctagctgcta ctgcttcttg tttttctttt 378; cctattaaaa cgtcttcctt tttttttctt aagagaaaat ggaacattta ggttaaatgt 384; ctttaaattt ttaa caacactaca tgcccataaa ccag tcagtactgt 390; attttaaaat cccttgaaat gatgatatca gggttaaaat tacttgtatt gtttctgaag 396; tttgctcctg aaaactactg cact gaaacgttac aaatgcctaa taggcatttg 4021 agactgagca aggctacttg ttatctcatg aaatgcctgt tgccgagtta ttttgaatag 408; aaatatttta aagtatcaaa agcagatctt agtttaaggg agtttggaaa tata 414; tttctctttt tcctgattct aaca agtcttgatg gaattaaaat actctgcttt 420; attctggtga gcctgctagc taatataagt attggacagg taataatttg tcatctttaa 426; tattagtaaa atgaattaag atattatagg attaaacata attttatacg gttagtactt 432; tattggccga cctaaattta tagcgtgtgg aaattgagaa agaa acaggacaga 438; atga attaaaaata tatataggtc aattttggtc tgaaatccct gaggtgtttt 444; gcta cactaatttg tacactaatt tatttcttta gtctagaaat agtaaattgt 450; ttgcaagtca ctaataatca ttagataaat tattttcttg gccatagccg ataattttgt 456; tact aagtgtatac ttgc cactttttcc tcagatgatt aaagtaagtc 462; aacagcttat tttaggaaac tgtaaaagta atagggaaag agatttcact ttca 468; tcagtggtag gggggcggtg actgcaactg tgttagcaga aattcacaga gaatggggat 474; ttaaggttag cagagaaact gttc agga tcttgctggc agaattaact 480; ttttgcaaaa gttttataca tttg tattaaattt ggagccatag tcagaagact 4861 cagatcataa ttggcttatt tttctatttc cgtaactatt gtaatttcca cttttgtaat 492; aattttgatt taaaatataa atttatttat ttattttttt aatagtcaaa tgct 498; gttgtagtct gcaacctcta aaatgattgt gttgctttta ggattgatca gaagaaacac 504; tccaaaaatt gagatgaaat gttggtgcag ccagttataa gtaatatagt taacaagcaa 510; aaaaagtgct tttt atgatgattt tgga gaaacatttg gc:gcatcca 516; cctt tatgttttgt tttcagttga aaacttgcct cctttggcaa ca:tcgtaaa 522; tgaagcagaa tttttttttc tcttttttcc aaatatgtta gttttgttct tg:aagatgt 528; atcatgggta ttggtgctgt gtaatgaaca acgaatttta attagcatgt gg:tcagaat 534; atacaatgtt aggtttttaa aaagtatctt gatggttctt ttctatttat aa:ttcagac 540; tttcataaag tgtaccaaga atttcataaa tttgttttca gtgaactgct tt :tgctatg 546; gtaggtcatt aaacacagca cttactctta aaaatgaaaa atca tc:aggatat 552; tgacacattt caatttgcag tgtctttttg actggatata ttaacgttcc tc:gaatggc 558; attgatagat ggttcagaag agaaactcaa tgaaataaag agaatattta ttcatggcga 564; ttaattaaat tatttgccta acttaagaaa actactgtgc gtaactctca gtttgtgctt 570; attt gacatgaggt gacagaagag agtctgagtc tacctgtgga atatgttggt 576; ttattttcag tgcttgaaga tacattcaca aatacttggt ttgggaagac accgtttaat 5821 tttaagttaa cttgcatgtt gtaaatgcgt tttatgttta aataaagagg aaaatttttt 588; gaaatgtaaa aaaaaaaaaa aaaaa Protein seguence: NCBI nce Seguence: NP_004512.1 LOCUS: NP_004512 ACCESSION: NP_004512 l madlaecnik vmcrfrplne sevnrgdkyi aqugedtvv iaskpyafdr qusstsqeq 6; vyndcakkiv kdvlegyngt ifaygqtssg kthtmegklh dpegmgiipr iquifnyiy 12; smdenlefhi kvsyfeiyld kirdlldvsk tnlsvhedkn rvpyvkgcte rfvcspdevm 18; dtidegksnr hvavtnmneh ssrshsifli nvkqentqte qklsgklylv kvsk 24; tgaegavlde akninkslsa lgnvisalae gstyvpyrds kmtrilqul ggncrttivi ; ccspssynes lfgq raktikntvc vnveltaeqw kkkyekekek nkilrntiqw 36; lenelnrwrn getvpideqf dkekanleaf tvdkditltn dkpataigvi gnftdaerrk 42; ceeeiaklyk qlddkdeein qqsqlveklk eell astrrdqdnm qaelnrlqae 48; evke qualeelav nyqusqeve dktkeyells delnqksatl asidaelqkl 54; kemtnhqkkr aaemmasllk dlaeigiavg nndvkqpegt gmideeftva rlyiskmkse 60; rckq lestqtesnk kmeenekela acqlrisqhe akikslteyl qnveqkqul 66; lsee lvqlraqekv hemekehlnk vqtanevkqa veqqiqshre thqkqiss;r 72; deveakakli tdlqdqnqkm mleqerlrve heklkatdqe ksrklheltv qurreqarq 78; dlkgleetva kelqtlhnlr klqudlatr vkksaeidsd dtggsaaqkq nn; e 84; qltkvhkqlv rdnadlrcel pklekrlrat aervkalesa lkeakenasr drkryqqevd 90; rikeavrskn marrghsaqi akpirpgth aaspthpsai rgggaquns qpvavrgggg 96; kqv Official Symbol: KPNA3 Official Name: karyopherin alpha 3 (importin alpha 4) m: 3839 sm: Homo sapiens Other Aliases: RP11-432M24.3, IPOA4, SRP1, SRP1gamma, SRP4, hSRP1 Other Designations: amma; importin alpha 4; importin alpha Q2; importin alpha-3; importin subunit alpha-3; importin-alpha-Q2; karyopherin t 3; qip2 Nucleotide seguence: NCBI Reference Seguence: NM_002267.3 LOCUS: NM_002267 ACCESSION : 267 l gccccgcgcc tgaggggcag taaaagtcgc caggtccggc tccatttctg aact 6; tgcagcaccg aggggttgtg gagagccctt gcaggggaag agggcagggt catcccgaga 121 accaacgqgc agcc cggcgaacgc ccaagccggt caccgccccc ggtcacgtgt l8; cgccagcctc cgcggccgcg cgccgctctc agcaccgttc cacc cggcccggca 24L gtcggcccgc gcctcccccg gcgctactgc cacctcgcgc tcggaggcgt acgt ; gctcttctct cccctccccc ctcccgctct ccccctcctc cccctcccgc tccaagattc 36; gccgccgccg ccgccgcagc agta gccgccgccg gagccgcgcg cagccatggc 42; cgagaacccc agcttggaga accaccgcat caagagcttc aagaacaagg gccgcgatgt 48; ggaaacaatg Cgaagacata gaaatgaagt gacagtggaa ctgcggaaga acaaaagaga 54; tgaacactta aaga gaaatgttcc ccaagaagaa agtctagaag attcagatgt 60; tgat tttaaagcac aaaatgtaac cctagaagct atattgcaga atgccacaag 66; tgataaccca gtggtccaat tgagtgctgt ccaggcagca agaaaactgt tatccagtga 72; cagaaatcca ccgattgatg acttaataaa atctgggatt ttaccaattc aatg 78; tctagaaagg aatc cttcattaca gtttgaagct gcttgggcat taactaacat 84; agcatcagga gcac agactcaagc tgttgtgcag tctaatgcag tttt 90; actt cttcgttcac cacatcagaa tgtttgtgaa caagcagtat gggctttggg 96; aaacattata ggtgatggtc ctcaatgtag agattatgtc atatcactgg gagttgtcaa L02; acctcttctg tccttcatca gtccctccat ccccatcacc ttccttcgga acgtcacatg L08; ggtcattgtc aatctctgca ggaataagga tcccccgccg cctatggaga cagttcagga L14; gattttgcca gctttatgtg tata ccatacagat ataaacattc ttgtagacac L20; tgtttgggct ctgtcatact tgacagatgg aggtaatgaa cagatacaga gtgccctttc ttgtgcccct tctgagccat caggaagtca agcagccctc agagcagttg gcaacatagt gactggcacc gacgagcaga cccaggttgt L38; tctcaattgt gatgtcctgt cacacttccc aaatctctta tcacacccaa aagagaagat L44; aaataaggaa gcagtgtggt tcctttccaa cataacagca ggcaaccagc aacaagttca L50; aata gatgctggat taattcctat gataattcat cagcttgcta agggggactt L56; tggaacacaa gctg cttgggcaat cagcaactta acaataagtg gcagaaaaga L62; tcaggttgag taccttgtac agcagaatgt aataccaccg ttctgtaatt tactgtcagt L68; gaaagattct caagtggttc aggtggttct tcta aaaaacattc tgataatggc L74; cggtgatgaa gcaagcacaa tagctgaaat aatagaggaa tgtggaggtt aaat L80; tgaagtttta cagcaacatg aaaatgaaga catatataaa ttagcatttg aaatcataga L86; tcagtatttc tctggtgatg atattgatga agatccctgc ctcattcctg aagcaacaca L92: aggaggtacc tacaattttg atccaacagc caaccttcaa acaaaagaat ttaattttta L98; aattcagttg gcat caca ttcaatatga agcaccacca gatggctacc 204; aaatgataag aacaacagca acaaaaggct ccaaaacaca catgcctctt tgttttgatg 210; cttctaaagc aagccatgtc tcagtcactt tgcagttgcc cact atcacatgga 216; ctgtaaatgc atatgcatga tttcctaaac tgttttagaa ctctccttaa caatctcaac 222; taccctattt ttccctgttc cctggtgcca caggctgaca actgcagtct ccagtttaga 228; ataaatattc catagtggtg acatgtcagc tgcccactga tactcctttg gaaaatggtg 234; cgctgtggat caagacactt tggtatgatg catatacaag ttggaagact aaagaggtgc 240; agtgtgatct gagcctccat cattgtcctc cata ttttcatatt ctttatgtgg 246; agat tttaaagtac aatg attttcattg gtggaactga aaaa 252; gtaacttaaa aacaagaaac ttggttattg aataaacaga taagtttaaa aaaaaaaaaa 258; aactacttca agta attgatgtgt ttattatctg cctcagaagc tgga 264; ggaagaactt tagatatgga tattaatgct tttgccatta tacctaattt ttgagaacag 270; caagccctat ttgaccactc tcttcagcct gtgtgttcct gctgttttga agtaatcaaa 276; tgctgtgcat ggtattttac ctgagctgca acctgttatg gacttgaact tctgtttaag 282; ttgaaagcaa ctga gtataaagga aaaacagcaa aacaaaaagc aaacaaaaaa 288; aaactgcaaa agtctaaaat tggt gatgtttttt atct tgctttcagc 294; tttcaggagt taatattctt tgttttaatt tgataattgg atatggttga tttatattgg 300: gtttaaactg tggagctttc atgtttactg taatttagtc ttaaaatatt ttag 3061 taaccagtgc ttttgataat gtggttggca acaaaccagc ttag aagtgtcata 312; catt ctttgagtat tgggaaagtt aattcagatc ctactcaaaa agcatcttca 318; catattaaaa gattcagaca gggatctgtg tagaggagta atttgcagtt atttaacata 324; aacctgattt gcagtgatct ctaagtaatt ctgcaaaatc cggtattact atgtcaagtt 330; attgcttttg gtaaattgtc tgacccagtt attaatgaaa gaatatggat ttaaaaattt 336; ttaaactaaa taatttgtgc tgtcacagaa atggtattgt tgctcttgtt tactgggtat 342; aatttcccaa tgcattgatg tgaagggata gaaaatctaa actaatttag ttatccattg 348; gggggtgt at ttactgtgat gaagatgaga cagatgccat cagagctttg tgaatcagct 354; ggggtgtttt cactgataaa caacacatag caggtgtgca ttcattacaa atatatgtat 360; ctgccaaggt ggagccactt taaagagtga gttttgtctt aaag tggatacaag 366; cgtatgttta aactgcaaga tttttacttg ctagagaatc tgttttaata tagtggtttg 3721 gcctctgatt atttataggt tttataaatt ttagaatcaa ttta aggtggctca 3781 gatttttcaa ctcttgtgca cataaaattg agttgaagtt cattgtgcct cttt 3841 atccaaattt tgagttaaag cttcatatgg taactgcatc ctgttcggac actatagtct 3901 aaatttttga aactgtgtgg tgttcgctaa gaat gtaa aagctaatta 396; aggtcacaaa cttcggtgaa acccttaaaa gtccaaatct tatt cgta 402; ccccttccag ttct cttt ccttacttaa ctgacagtta ccttttaaaa 408; tttgcacaca ttatgattaa aattgggcct ctactgtgat gattcctatt catg 414; agtg caaactaaca tttaagtgaa cattagcatc aagtagtgca gacacttgta 420; tgcatttcct tgattcaatt tgtgacctta ccagttttga attg caccatttcg 426; tagataaagg aaactaagta tattgctgca cttttaagtt ttcaaaacag tgtttaaaaa 432; ttgcattgtt attttttttt aaactcagtt taaaaagact aaaacgttct agag 438; gcatctaaat gtgttcctaa ttttgtatat gggcttaggt tttgtaacca ataaaaaaag 444; ctgctatcaa atatgataaa acattgaaaa cttaaaaa Protein seguence: NCBI Reference Seguence: NP_002258.2 LOCUS: NP_002258 ACCESSION: NP_002258 1 maenpslenh riksfknkgr dvetmrrhrn evtvelrknk kkrn queesleds 61 dvdadfkaqn vtleailqna tsdnpvvqls avqaarklls sdrnppiddl iksgilpilv 121 kclerddnps queaawalt niasgtsaqt qavvqsnavp 1f1r11rsph qnvceqavwa 181 lgniigdgpq crdyvislgv Vkpllsfisp sipitflrnv twvivnlcrn kdppppmetv 241 lcvl iyhtdinilv dtvwalsylt dggneqiqmv idsgvvpflv pllshqevkv 301 qtaalravgn ivtgtdeqtq vvlncdvlsh fpnllshpke vwfl snitagnqqq 361 vqavidagli pmiihqlakg dfgtqkeaaw aisnltisgr kdqveylqu nvippfcnll 421 vqu vldglknili magdeastia eiieecggle kievquhen ediyklafei 481 idqyfsgddi dedpclipea tqggtynfdp tanlqtkefn LGALS1 Official Symbol: LGALS1 Official Name: lectin, galactoside-binding, soluble, 1 Gene ID: 3956 Organism: Homo sapiens Other s: GAL1, GBP Other Designations: 14 kDa laminin-binding protein; 14 kDa lectin; HBL; HLBP14; HPL; S-Lac lectin 1; alactoside-binding lectin Ll; beta- oside-binding protein 14kDa; gal-1;galaptin; galectin 1; galectin-1; lactose-binding lectin 1; putative MAPK-activating protein PM12 Nucleotide sequence: NCBI nce Seguence: NM_002305.3 LOCUS: NM_002305 ACCESSION : NM_002305 1 agttaaaagg gcgt ccgggggccc atctctctcg ggtggagtct tctgacagct 6; ggtgcgcctg cccgggaaca tcctcctgga ctcaatcatg gcttgtggtc tggtcgccag 12; caacctgaat ctcaaacctg gagagtgcct tcgagtgcga ggcgaggtgg ctcctgacgc 18; cttc gtgctgaacc tgggcaaaga cagcaacaac ctgtgcctgc acttcaaccc 24; tcgcttcaac gcccacggcg acgccaacac catcgtgtgc aacagcaagg acggcggggc ; ctgggggacc gagcagcggg aggctgtctt tcccttccag cctggaagtg aggt 36L gtgcatcacc ttcgaccagg ccaacctgac cgtcaagctg ccagatggat acgaattcaa 421 caac cgcctcaacc tggaggccat caactacatg gcagctgacg gtgacttcaa 48; gatcaaatgt gtggcctttg actgaaatca gccagcccat ggcccccaat aaaggcagct 54; gcctctgctc aaaa aaaaaaaaaa aaaaaaaaaa aaaaaa Protein seguence: NCBI Reference Seguence: NP_002296.1 LOCUS: NP_002296 ACCESSION: NP_002296 1 macglvasnl clrv rgevapdaks fvlnlgkdsn nlclhfnprf nahgdantiv 61 cnskdggawg teqreavfpf qusvaevci tquanltvk fkfp nrlnleainy 121 maadgdfkik cvafd MACF1 Official Symbol: MACF1 Official Name: ubule-actin crosslinking factor 1 m: 23499 Organism: Homo sapiens Other Aliases: ABP620, ACF7, MACF, OFC4 Other Designations: 620 kDa actin binding n; actin cross-linking family protein 7; macrophin 1; microtubule-actin cross-linking factor 1; trabeculin-alpha Nucleotide seguence: NCBI Reference Seguence: NM_012090.4 LOCUS: NM_012090 ACCESSION : NM_012090 024 1 ggag ccgctcccct cggctccgcc acgctcccct cgactgcgct ccagcctggg 61 gcgcgcccgg ccgccgccgc cttcgctgcc gccacgggcc cttc ctccttcggc 121 tcccaggatg aagaaactga gtctcagaga ggtgaagtga cttgcccaag atcacagcaa 181 ttatcacttc tccctgggct cccaggccct cctgcagcag cccccgcctg ggccatgtct 24; tcctcagatg aagagacgct cagtgagcgg tcatgtcgga gtgagcggtc ttgtcggagt ; gagcgatctt acaggagcga gcggtcgggg agcctgtctc cctgtccccc cacc 36; ttgccctgga acctgccact gcatgagcag aaaaagcgga aaagccagga ttcggtgctg 42; gaccctgcag agcgtgctgt ggtcagagtc gctgatgaac gggaccgggt tcagaagaaa 48; acgttcacca agtgggtcaa caagcactta atgaaggtcc gcaagcacat caatgatctt 54; tatgaagatc tgcgggatgg ccataacctg atctctctgt tggaggtcct ctcaggcatc 60L aaactgcccc gggagaaggg caggatgcgt tttcataggc tgcagaatgt gcagattgcc 66L ctggacttcc taaagcagcg acaggtgaaa ctagtgaata ttcgcaatga tgacatcaca 72L gatggcaacc ccaagttgac cctgggtctg atctggacca tgca gatc 78L tctgacatct acattagtgg agaatcaggg gatatgtcag ccaaggagaa cctg 84; caga aggtgacagc tggttacaca ggaatcaaat gcaccaactt ttcctcctgc 90; tggagtgatg ggaagatgtt caatgcactc attcaccgat accgacccga tctagtagac 96; atggagaggg tgcaaatcca aagtaaccga ctgg cttt tgaagtggca L02; gaaagactgg gggtcactcg ggat gcagaagatg tggatgtgcc atctccagat L08; gaaaagtctg taatcactta tgtgtcttcg atttatgatg ccttccctaa agttcctgag L14; ggtggagaag ggatcagtgc tacggaagtg gactccaggt aata ccaaagccga L20; gtggactccc tcattccctg gatcaaacag catacaatac tgatgtcaga taaaactttt L26; ccccaaaacc ctgttgaact aaaggcactt caat atatacactt caaagaaaca L32; gaaattctgg ccaaggagag agaaaaagga agaattgagg aattatataa attactagag L38; gtgtggattg aatttggccg aattaaactg cctcaaggtt atcaccctaa tgatgtggaa L44; gaagagtggg gaaagctcat catagagatg ctggaacgag agaaatcact ggct L50; gtggagaggc tggaattgct gctacagatt gcaaacaaaa tccagaatgg tgctttgaac L56; tgtgaagaaa aactgacact agctaagaat acactgcagg ctgatgctgc tcacctggaa L62; tcaggacaac cggtacaatg tgagtcagat gtcattatgt agga gtgtgaaggt L68; aggc agctgcaggt ccag cggg atgagaatta ctaccagcta L741 gaagagctgg cttttagggt catgcgtctt gagc tggtcacctt gcgtctagag L80; tgtacaaacc tgtaccggaa gggtcatttc acttcacttg aattggttcc ttaaccacca ctcatctgaa agcagaaccc ttaaccaagg caacccattc ttcc gaaagcctat gactcgggct gaacttgtgg ccatcagctc gaaggcaatc tccgatttgt gtatgaacta ctgtcttggg tagaagagat gcagatgaaa 204; ctggagcgag gggg caatgacctg cctagtgtgg agttgcagct agaaacacag 210; cagcacatcc atacgagtgt agaagagctg ggctcaagtg tcaaggaggc caggttgtat 216; gagggaaaga agaa tttccatacc agctatgctg aaactcttgg ggag 222; acacagtatt tgaa ggaaacttct agcttccgga tgaggcacct tcagagcctg 228; cataaatttg tttccagagc tacagctgag ttgatctggt tgaatgagaa ggaggaggag 234; gaactagcat atgactggag tgacaacaat tccaatatct cagccaagag aaattacttc 240; tctgagttga caatggaact ggaggagaaa caggatgtgt ttcgttctct acaagataca 246; gcagaactac tttcacttga gaaccaccca gccaagcaga cagtggaggc ttacagtgct 252; gctgtccagt cccagttgca gtggatgaag cagctgtgcc tgtgtgttga gcagcatgtg 258; aaagagaata ctgcttattt tcagttcttc agtgatgcac gagagctgga gtcattcttg 264; aggaacctcc aagattccat taaacgaaaa tattcctgtg accacaacac cagcttatcc 270; cgccttgaag acctgctcca ggactccatg gatgaaaagg agcagcttat acagtccaag 276; agttccgttg ccagtctcgt tgggagatca aaaaccatcg ttcagctaaa cagt 282; catg tgttaaagaa caccatttct gtcaaggctg tctgtgacta caggcagatc 288; gagattacta aaaa tgatgaatgt gtgctagaag ataattctca gcggaccaaa 294; tggaaagtga tcagccccac cgag gcaatggtgc cgtcagtctg cttcctcatc 300; cccccaccca ataaggatgc cattgagatg aggg aatc gaag 306; gttatggccc tttggcatca gctgcatgtt aacaccaaaa gccttatctc ttggaactat 312; ctgcgtaaag accttgacct tgtacagacc tggaacctag aaaagcttcg atcctcagca 318; ccaggggagt gccatcagat tatgaagaac cttcaggccc actatgaaga gcag 324; gatagtcgtg tgct gttctcagtg gctgatcgct tgga agaggaggtg 330; tgta aagcccgctt ccagcacctg tcca tggagaatga ggacaaagag 336; gagactgtgg ccaagatgta catttcagag ttgaagaaca tccggctacg cctggaggag 342; tatgaacaga gggtggtcaa acgaattcag tctctagcca gctctaggac tgacagagat 348; gcctggcagg acaatgcatt aaggattgca gagcaagagc acacccagga ggatttacag 3541 caattgaggt cagacttgga tgcagtttct atgaaatgtg acagctttct ccatcagtct 360; ccatctagtt caagtgtccc aactctgcgc tcagaactga atctgctggt ggagaagatg 366; gaccatgtct atggtctctc tactgtatat ctgaataagt taaagacagt tata 372; agca tacaggatgc tgaactcttg gtcaaaggtt atgagattaa gctgagtcaa 378; gaagaagtag tactggcaga tctctcagct ctggaggccc attggtcgac attacggcac 384; tggcttagtg agga caagaattca gtgttttcag tcctggatga ggaaattgcc 390; aaggccaagg tagtggcaga gcagatgagt cgtctgacac gaaa tctggatttg 396; gagcgctatc aggaaaaagg ctcccagctg cgtt gagt cattgcccag 402; ctcgagattc gccaatctga gctagaaagt atccaggaag ttctgggaga ttaccgagcc 408; tgccatggaa ctctcatcaa gtggattgag gaaaccactg cccagcagga aatgatgaag 414; ccaggccagg cagaggatag cagagtgctt tcggagcagc tcagccagca gacggcccta 420; tttgcagaaa ttgagagaaa tcagacaaaa ctggatcaat gtcaaaaatt ttcccagcag 426; tactctacta ttgtaaagga ctatgaattg caactgatga catacaaggc ctttgtggaa 432; tcgcagcaga aatcccctgg caagcgccgt cgcatgcttt cctcttcaga tgccatcact 438; caagagttca tggacttaag ctac acggcattgg tgactttaac aactcagcac 444; gtgaaataca tcagtgatgc actccggcgt ctggaggagg aggagaaagt ggtagaagag 450; gagaaacaag aacatgtgga gaaggttaaa gaacttttgg gctgggtgtc taccctagcg 456; acac aaggaaaagc tacctcatcc gagaccaaag aatcaacaga cattgaaaaa 462; gctattttgg aacagcaggt tctgtcagaa gagctgacaa caaagaaaga ctct 468; atta aaacatcaca gatcttcttg gccaagcatg gtcataagct ctcagaaaaa 474; gagaagaaac aaatatctga gcaattgaat gccctaaaca aggcttacca tgacctttgt 480; gatggttctg caaatcagct tcagcagctt cagagccagt tggctcacca gacagaacaa 486; aagaccctgc aaca aaatacctgt caac atct ttgcagttgg 492; gtaggacagg cagaaagagc actggcaggc caccaaggca gaaccaccca tctc 498; tctgctttgc acca aagtgacttg aaggatttac aggatgacat tcagaatcgt 504; gccacctcat ttgccactgt tgtcaaggac attgaggggt tcatggaaga gaatcagacc 510; aagctgagcc cacgtgagtt gacagctctt cgggaaaagc ttcatcaggc taaggagcaa 516; tatgaggcgc tccaggaaga gacacgtgtg aagg aactggagga agcagtgacc 522; tccgccttac agcaggagac tgaaaagagt aaagcagcaa aggaactggc agagaacaag 528; aagaagatcg atgctctcct ggattgggta acttcagtag gatcatctgg tggacagctg 534; aacc ttccaggaat ggagcagctc tcgggagcta gcttggagaa aggagccttg 540; gacaccactg atggttacat gggggtgaat caagccccag agaaactgga caagcaatgt 546; gagatgatga aggcccgtca attg ctgtcccagc agcaaaattt cattctggcc 552; acccagtcag ctcaggcctt cttggatcag catggccaca atctcacacc tgaggagcaa 558; cagatgctgc aacagaagct gggagagcta caat actctacttc cctggcccaa 564; tcagaggcag aactgaagca ggtgcagaca gatg agttgcagaa atttctgcag 570; gatcataaag aaag ctggttggaa cgatccgaga aagagctgga gaacatgcat 576; aagggaggca gcagccccga gacccttccc tccctgctaa agcggcaagg aagcttctca 582; gaggatgtca acaa aggagacttg agatttgtga ctatctcagg acagaaagtc 588; ttggacatgg gttt taaggaaggc aaagaaccat cagaaattgg aaacttagta 594; aagt tgaaggatgc aacagaaaga tacactgctc tccactcaaa gtgtacacga 600; ttaggatctc acctgaatat aggc cagtatcatc aaaa cagtgctgac 606; agcctgcagg cctggatgca ggcttgtgag gccaacgtgg agaagctcct ctcagatact 612; gttgcctctg accctggagt tctccaggag cagcttgcaa caacaaagca gttgcaggag 618; gaattggctg agcaccaagt acctgtggaa aaactccaaa aagtagctcg tgacataatg 624; gaaattgaag gggagccagc cccagaccac aggcatgttc aagaaactac agattccata 630; ctcagccact tccaaagcct ctcctatagc ctggctgagc gatcttctct gctgcagaaa 636; gcaattgccc aatctcagag tgtccaggaa gaga gcctgttgca gtctattggg 642; gaac tgga agggaagcag gtgtcatcac tctcatcagg agtcatccag 648; gaagccttag ccacaaatat gaaattgaag caggacattg ctcggcaaaa gagcagcttg 654; gaggccaccc gtgagatggt gacccgattc atggagacag gtac tacagcagca 660; gtgctgcagg gcaaactggc agaggtgagc cagcggttcg aacagctctg tctacagcag 666; caagaaaagg agagctccct aaagaagctt ctaccccagg cagagatgtt cctc 672; aagc tgcagcagtt catggaaaac aaaagtcgga tgctggcctc tggaaatcag 678; ccagatcaag atattacaca tttcttccaa cagatccagg agctcaattt ggaaatggaa 684; gaccaacagg taga tactcttgag cacctggtca ctgaactgag ctcttgtggc 690; ctgg acttgtgcca gcatcaggac agggtacaga atctaagaaa agacttcaca 696; gagctacaga agacagttaa agagagagag aaagatgcat catcttgcca ggaacagttg 702; gatgaattcc ggaagctggt caggaccttc cagaaatggt tgaaagaaac tgaagggagt 708; attccaccta cttc tatgagtgct aaagagttag aaaagcagat tgaacacctg 714; aagagtctac tagatgactg ggcaagtaag ggaactctgg tggaagaaat caattgcaaa 720; tctt tagaaaatct catcatggaa atcacagcac ctgattccca aggcaagaca 726; ggttccatac tgccctctgt aggaagctct agtg taaacggata ccacacctgc 732; aaagatctga cggagatcca gtgtgacatg tcagatgtaa acttgaagta tgagaaacta 738; gggggagtac ttcatgaacg ccaggaaagc cttcaggcta tcctcaacag aatggaggag 744; gttcacaagg aggcaaactc tgtgctgcag tggctggaat caaaagagga agtcctgaaa 750; tccatggatg ccatgtcatc tccaaccaag acagaaacag tgaaagccca agctgaatct 756; aacaaggcct tcctggctga gttggaacag aattctccaa aaattcaaaa agtaaaggaa 762; gccctggctg gattactggt gacatatccc aactcacagg aagcagaaaa ttggaagaaa 768; attcaggaag aactcaattc ccgatgggaa agggccactg aggttactgt ggctcggcaa 774; aggcagctag aggaatctgc aagtcatctg gcctgcttcc aggctgcaga atcccagctc 780; cggccgtggc tgatggagaa agaactgatg atgggagtgc tggggcccct gtctattgac 786; cccaacatgt tgaatgcaca acag gtccagttta tgctaaagga atttgaagca 792; cgcaggcaac agcatgagca actgaatgag gcagctcagg gcatcctaac aggccctgga 798; gatgtctctc tgtccaccag ccaagtacag aaagaactcc agagcatcaa tcagaaatgg 804; gttgagctga ctgacaaact caactcccgt tccagccaaa aagc tattgttaag 810; agcacccagt accaggaact ggac ttatcagaga aggtgagggc acaa 816; ngctgagtg tccagtcagc tatcagcacc caaccagagg ctgtaaagca gcaattggaa 822; gagaccagtg gatc tgacttggag cagttagacc acgaggttaa tcag 828; acactgtgcg atgaactctc agtgctcatt ggtgagcagt agga gaag 834; aagcgtttgg agacagttgc cctgcctctc caaggtttag aagaccttgc agccgatcgc 840; attaacagac tccaggcagc tcttgccagc acccagcagt tccagcaaat gtttgatgag 846; ttgaggacct ggttggatga taaacaaagc cagcaagcaa aaaactgccc aatttctgca 852; aaattggagc ggctacagtc tcagctacag gagaatgaag agaa aagtcttaat 858; caacacagtg gctcctatga ggtgattgtg gctgaagggg aatctctact tgta 864; cctcctggag aagagaaaag gactctacaa aaccagttgg ttgagctcaa aaaccattgg 870; gaagagctta gtaaaaaaac caga caatccaggc tcaaggattg tatgcagaaa 876; gctcagaaat atcagtggca tgtggaagac cttgtgccat ggatagaaga ttgtaaagct 882; aagatgtctg agttgcgagt cactctggat ccagtgcagc tagagtccag tctcctaaga 888; tcaaaggcta tgctgaatga ggtggagaag cgccgctccc tgctggaaat attgaatagt 8941 gctgctgaca ttctgatcaa ttcttcagaa gcagatgagg tccg ggatgagaag 900; gctgggatca accagaacat ggatgctgtt acagaagagc tgcaggccaa aacagggtca 906; ctcgaagaaa agag gctcagggag ttccaggaaa gctttaagaa aaag 912; gaag gagccaaaca ccaacttgag atctttgatg gttc tcaagcctgt 918; agcaacaaga acctggagaa gctaagagct gaag tgctgcaggc cctagagcct 924; caggtagact atctgaggaa ctttactcag ggtctggtag aagatgcccc agatggatct 930; tctc aacttctcca ccaagctgag gtcgcccagc aagagttcct cgaagttaag 936; caaagagtga acagtggttg tgtgatgatg gaaaacaagc tggaggggat tggccagttt 942; cactgccggg tccgagagat gttctctcaa ttggcagacc atga gctagatggc 948; atgggtgcta ttggcagaga cactgatagc ctccagtccc aaatcgagga gcta 954; ttccttaaca aaattcacgt cctcaaatta gagg cctctgaagc tcga 960i catatgctag aagaagaggg gactctggat ttgttaggtc tcaaaaggga agcc 966; aaac agtgtggcaa actgacagag agggggaaag ctcgtcagga acagctggaa 972; ctag gccgtgtaga ggacttctac aggaaattga aaggactcaa tgacgcgacc 978; acagcagcag aggaggcaga ggccctccag tgggtagtgg ggaccgaagt ggaaatcatc 984; aaccaacaat tagcagattt taaaatgttt cagaaagaac aagtggatcc tcttcagatg 990; aaattgcagc aggtgaatgg acttggccag ggattaattc agagtgcagg aaaagactgt 996; cagg gtttagaaca tgacatggaa gagatcaatg ctcgatggaa tacattgaat L002; aaaaaggtcg cacaaagaat tgcacagcta caggaagctt tgttgcattg tgggaagttt L008; caagatgcct tggagccatt gctcagctgg gata ccgaggagct catagccaat L014; cctc catctgctga agtg gtgaaagcac agatccaaga acagaagttg L020; cggc tcctagatga tcgaaaggcc acagtagaca tgcttcaagc aggc L026; agaatagccc agtcagcaga gctggctgat agagagaaaa tcactggaca gctggagagt L032; cttgaaagta gatggactga actactcagt aaggcagcag ccaggcaaaa acagctggaa L038; gacatcctgg ttctggccaa acagttccat gagacagctg agcctatttc tgacttctta L044; tctgtcacag agct tgctaactca gaacctgttg gcactcagac tgccaaaata L050; cagcagcaga tcattcggca caaggctctg gaagaagaca tagaaaacca tgcaacagat L056; gtgcaccagg cagtcaaaat tgggcagtcc ctctcctccc tgacatctcc tgcagaacag L062; ggtgtgctgt agat agactcattg caggcccgat acagtgaaat aggcagccct acttgaccaa gctctgtcta atgctaggct aggtgctcaa ctggctggct gaggttgagg tcag L080; gtaaaggatt tcaaacagga tgtcctgcac aggcagcatg ctgaccacct ggctttaaat 1086; gaagaaattg ttaatagaaa gaagaatgta gatcaagcta ttaaaaatgg ctaaaacaaa ccacaggtga ggaggtgtta cttatccagg aaaaactaga actcgttacg cagacatcac agttactagc tccaaggccc tcagaacttt agagcaagcc LL04; ngcagctgg ccaccaagtt ccagtctact gaac tgaccgggtg gctgagggag LL10; gtggaggagg agctggcaac cagtggagga cagtctccca caggggaaca gataccccag LL16; tttcagcaga gacagaagga attaaagaag gaggtcatgg agcacaggct ggac LL22; acagtgaatg aggtgagccg tgctctctta gtgc cctggagagc cagagaaggg LL28; ctggataaac ttgtgtccga tgctaacgag cagtacaaac tagtcagtga cactattgga LL34; caaagggtgg atgaaattga tgctgctatt tcac aacagtatga gcaagctgcc LL40; gatgcagaac tagcttgggt tgctgaaaca aaacggaaac tgatggctct gggtccaatt LL46; cgcctggaac aggaccagac cacagctcag cttcaggtac agaaggcttt LL52; attattcgac acaaagattc tgaa ctcttcagtc accgtagtga aatctttggc 1158; acatgtgqgg aggagcaaaa aactgtatta caggaaaaga cagagtctct tatgaagcca ttagcctact caattcagag cgttatgccc gcctagagcg ggcccaggtc LL70; ttagtaaacc agttttggga aacttatgaa gagctcagcc cctggattga tcgg LL76; gcactaatag cacagttacc ctctccagcc attgatcatg agcagctcag gcagcaacaa LL82; gaggaaatga ggcaattaag ggaatctatt gctgaacaca aacctcatat acta LL88; ctaaagatag gcccacaact aaaggaatta aaccctgagg aaggggaaat ggtggaagaa LL94; aaataccaga aagcagaaaa catgtatgcc caaataaagg tgcg ccagcgagcc L200; ctggctctgg ccgt gtcccagtcc acacagatta cagagtttca tgataaaatt L206; gagcctatgt tggagacact ggagaatctt tcctctcgcc tgcgtatgcc accactgatc L212; cctgctgaag agat cagagagtgc atcagtgaca ataagagtgc caccgtggag L218; ctagaaaaac tgcagccatc ctttgaggcc ttgaagcgcc gtggagagga gcttattgga L224; cgatctcagg gagcagacaa ggatctggct gcaaaagaaa tccaggataa attggatcaa L230; atggtattct tctgggagga agct cgggctgaag aacgagaaat caaatttctt L236; gatgtccttg aattagcaga ctgg tatgacatgg tcct gaccaccatc L242; aaagacaccc aggatattgt ccatgacttg gaaagcccag gcattgatcc ttgaagctgc tgagactatt gaga cagatggtct ttcggatcct tggagcagat tttg cctgtggaga ggaagagcat tgatgagatg aataatgctt gggagaactt ggctagaaaa acttgaggat gctatgcaag ctgctgtgca ctatgtttga ctggctagat aacactgtga ttaaactctg cctgttggca ctgacctcaa tactgttaaa gatcagttaa atgaaatgaa ggagttcaaa L284; gtagaagttt agca aattgagatg gagaagctta atcaccaggg tgaactgatg L290; ttaaagaaag ctactgatga gacggacaga gacattatac gagaaccact gacagaactc L296; aaacacctct gggagaacct gggtgagaaa cacc gacagcacaa actagaaggg L302; gctctgttgg cccttggtca gttccagcat gccttagagg aactaatgag ttggctgact L308; cataccgaag agttgttaga tgctcagaga ccaataagtg caaa agtcattgaa L314; gttgagctcg caaagcacca tgtcctaaaa aatgatgttt tggctcatca acaaagctgg caatgagctt cttgaatcca gtgctggaga gccgtttgga agccatgaac caatgctggg agtcagtgtt gaggagaggg agcagcagct tcagtcaact ctgcagcagg cccagggctt attgaagatt tcctcttgga acttactaga atggagagcc agctttctgc acaggaggac ttcctgaaac tgctagggaa cagcttgata cacatatgga actctattcc L350; cagctgaaag aaga gacttataat cttg gcag ctaagccgtg acgactctgg gtctggctcc aagacagaac tagc acttttggag L362; tggc atgtggtcag cagtaagatg agaa agtcaaagct ggaagaggcc L368; ctcaacttgg caacagaatt ccagaattcc ctacaagaat ttatcaactg gctcactcta L374; gcagagcaga gtttaaacat Cgcttctcca ctga ttctaaatac tgtcctttcc L380; cagatagaag agcacaaggt ttttgctaat gaagtaaatg ctcatcgaga ccagatcatt L386; gagctggatc aaactgggaa tcaattaaag ttccttagcc aaaagcagga tgttgttctg L392; atcaagaatt tgttggtgag Cgtgcagtct cgatgggaga aggttgtcca gcgatctatt L398; gaaagagggc gatcactaga cagg gcaa aacaattcca tgaagcttgg L404; aaaaaactga ttgactggct agaagatgca gagagtcacc tggactcaga actagagata L410; gacc aaat taaacttcag ctttctaagc ataaggagtt tcagaagact L416; cttggtggca agcagcctgt gtatgatacc acaattagaa ctggcagagc actgaaagaa L422; aagactttgc ttcccgaaga tagtcagaaa cttgacaatt tcctaggaga aaatgggata ctgtttgtgg caagtctgtg gagcggcagc acaagttgga ctcttttcgg gtcagttcat ggatgctttg caggcattgg ttgactggtt gagccacagc tggctgagga ccagcccgtg cacggggacc ttgacctcgt atggatgcac acaaggtttt ccagaaggaa ctgggaaagc gaac ctgaagcggt caggccgaga gctgattgag aatagtcgag atgacaccac ggacagctcc aggaactgag cactcgctgg gtct gtaaactctc tgtttccaaa L464; caaagccggc ttgagcaggc acaa gcggaagtgt ttcgagacac agtccacatg L470; gagt ggctttctga agcagagcaa acgcttcgct ttcggggagc acttcctgat L476; gacacagagg ccctgcagtc tctcattgac acccataagg aattcatgaa gaaagtagaa L482; gaaaagcgag tggacgttaa ctcagcagta gccatgggag aagtcatcct ctgc L488; caccccgatt gcatcacaac catcaaacac tggatcacca tcatccgagc tcgcttcgag L494; gaggtcctga catgggctaa gcagcaccag cttg cctt ctgagctcct ggaagaactt ctggcatgga tccagtgggc gggatcagga gccaatcccg cagaacattg accgagttaa agacatttat ggaggagatg actcgcaaac agcctgacgt acaaaaggaa aaacatagag cctactcacg cgcctttcat cgcagcggag gcaggaaatc cctaagtcag cctc ctcccatgcc gaag caaaaaaccc caac cagctttctg cccgctggca ctgttagcac tggagcggca aaggaaactg aatgatgcct tggatcggct aaagaatttg ttga ctttgatgtc tggaggaaaa agtatatgcg ttggatgaat L548; cacaaaaagt ctcgagtgat ggatttcttc cggcgcattg ataaggacca ggatgggaag L554; ataacacgtc aggagtttat Cgatggcatt ttagcatcca ccac caccaagtta L560; actg ctgtggctga cattttcgac Cgagatgggg acat tgattattat L566; gaatttgtgg ctgctcttca tcccaacaag gatgcgtatc gaccaacaac cgatgcagat L572; aaaatcgaag atgaggttac aagacaagtg gctcagtgca aatgtgcaaa aaggtttcag L578; gtggagcaga agaa taaataccgg tttggggatt ctcagcagtt gcggctggtc L584; cgtattctgc gcagcaccgt gatggttcgc gttggtggag gatggatggc cttggatgaa L590; tttttagtga aaaatgatcc ctgccgagca cgaggtagaa ctaacattga acttagagag L596; aaattcatcc taccagaggg agcatcccag ggaatgaccc ccttccgctc tcga L602; aggtccaaac catcttcccg ggcagcttcc cctactcgtt ccagctccag gctgtacatc catgccatct tctccagcca ccccagccag catcaggtag caagttgaaa cgaccaacac caacttttca acatcccttg ctggtgatac cagcaatagt tcttccccgg cctccacagg aatcgggcag accctaaaaa cagt cgccctggga ctgg gggagtcgag ccagcagccg gcgaggaagt gacgcttctg actttgacct cagtctgctt gttccgacac ttcagaaagc agcgctgcag ggggccaagg caactccagg L644; agagggctaa acaaaccttc caaaatccca accatgtcta agaagaccac cactgcctcc L650; cccaggactc caggtcccaa gcgataacac tgtctaagca cccccaagcc actatccact L656; ttgaatcctg ctccatacat tgggtgtata tttattctga acgggagaag ttatattgtt L662; aaaagtgtaa aagaataatt gtgttatgaa ttat tttttttctt tttgtaagtt L668; actattttca tatt tatgtagata aaatttgcct cctggtaacc ctgtaatgga L674; tggggcccag aata tttgagaaaa acaagtgaaa agat attaaaaaaa aaaaagccta ttaatagggt ttctgcgcgg tgcagggttg ttatctttta ggattattcc taaatgcatc ttctttataa acttgacttg aagataaatt atattaaaaa aataagaatc ctgcagtgtt actc tttttttgta L698; ggac acctcaatta gcaagaactg aggggagggc tttttccatt ttgtgatttt tagctaaaga gagggaacct catctaagta acatttgcac atgatacagc L710; aaaaggagtt cattgcaata ctgtctttgg tttc gggt caaatagctg ctagaattca ggggtaaatg taagtgttca gaaaacgtca gaacatttgg L722; ggttttaaac tgatttgttg ctccctatcc agcctagaca ccagtaactc ttgtgttcac L728; caggacccag acccttggca agggataggc tcgttggtga cattgtgaat ttcagatttg L734; ttttatccac tttttttgct atttatttaa atggtcgatc aacttcccac aaactgagga L740; atgaattcca cgagcctgtt ctgaaaatgt ggacgtaaga caaacacgtg ctcgtccttt L746; aatggagttc accagcacac ttgttaacca gtcctgtttg ctttcgtctt tttttgtgcg L752; taataaagtc aactgaccaa gtgaccatga aaaggggctg tctggggctc ctgtttttta L758; gctgctgttc tccg accatgttgc tgtgtgatta tctcaattgg ttttaattga L764; ggcagaaact gaagctctac caatgaactg tttagaaaca agacacactt ttgtattaaa L770; attgcttgca gtaacaaata ttttgtattt cctgattttc ttttcaacta ttaccttatc L776; tataaatgtt gggt ataatcatgt tgtaggtact taaatgcatt ccgcaaatca L782; aaatatcttg atggataaat gctt aatagatctt gttttatttc aaaa L788; aaaa aaaaaaaaaa aaaaaaaaaa aaaa aaaaaaaa Protein seguence: NCBI Reference Seguence: NP_036222.3 LOCUS: 222 ACCESS| ON: N P_036222 N P_148984 l msssdeetls erscrsersc rsersyrser sgslspcppg dtlpwnlplh eqkkrksqu 6; vldpaeravv rvaderdrvq kktftkwvnk hlmkvrkhin dlyedlrdgh nlisllev;s 12; giklprekgr mrfhrlqnvq ialdflquq vklvnirndd itdgnpkltl gliwtiilhf 18; qisdiyisge sgdmsakekl llwtqutag tnfs scwsdgkmfn alihryrpdl 24; vdmervqiqs nrenleqafe vaerlgvtrl ldaedvdvps pdeksvityv ssiydafpkv ; peggegisat evdsrwqeyq srvdslipwi msdk tqunpvelk alynqyihfk 36; eteilakere kgrieelykl levwiefgri klpquhpnd klii emlereks;r 42; paverlelll qnga lnceekltla knthadaah lesgqpvqce sdvimyiqec 48; eglirqlqu lqilrdenyy qleelafrvm rlqdelvtlr lectnlyrkg hftslelvpp 54; stlttthlka epltkathss stswfrkpmt raelvaisss edegnlrfvy ellswveemq 60; mkleraewgn dlpsvelqle tqqhihtsve elgssvkear lyegkmsqnf tlgk 66; letqycklke tssfrmrhlq slhkfvsrat aeliwlneke eeelaydwsd nnsnisakrn 72; yfseltmele ekqdvfrslq slen hpakqtveay saavqsqlqw mkqlclcveq 78; hvkentayfq eles flrnlquik rkyscdhnts lsrledllqd smdekeqliq 84; skssvaslvg rsktivqlkp lknt isvkavcdyr cknd ecvlednsqr 90; tkwkvisptg svcf kdai emasrveqsy qumalwhql hvntkslisw 96; nylrkdldlv thnleklrs sapgechqim knlqahyedf lqurdsvlf svadr;rlee ;02; eveackarfq hlmksmened keetvakmyi selknirlrl eeyeqrvvkr iqslassrtd ;08; rdawqdnalr iaeqehtqed qulrsdlda vsmkcdsflh qspssssvpt lrselnllve ;14; kmdhvyglst vylnklktvd vivrsiqdae eikl sqeevvladl saleahwstl ;20; rhwlsdvkdk nsvfsvldee iakakvvaeq msrltpernl dleryqekgs hrvi ;26; qsel esiqevlgdy rachgtlikw ieettaqqem mkpgqaedsr vlseq;sqqt ;32; alfaeiernq tkldchkfs qqystivkdy elqlmtykaf vesqqkspgk rrrmlsssda ;38; itqefmdlrt rytalvtltt qhvkyisdal rrleeeekvv eeekqehvek vkellngst ;44; larntqgkat ssetkestdi ekailequl seelttkkeq tsqi lnalnkayhd lcdgsanqlq qlqsqlahqt un ;56; swvgqaeral ttqq dlsalqknqs dlkdlqddiq nratsfatvv kdiegfmeen ;62; qtklsprelt alreklhqak eqyealqeet rvaqkeleea vtsaquete kskaakelae ;68; nkkkidalld wvtsvgssgg qlltnlpgme qlsgaslekg aldttdgymg vnqapekldk ;74; qcemmkarhq ellsqqqnfi latqsaqafl dqhghnltpe eqqmquklg elkeqystsl 180; aqseaelkqv qthdelqkf lqdhkefesw lersekelen mhkggsspet lpsllqugs 186; fsedvishkg dlrfvtisgq kvldmensfk egkepseign lvkdklkdat erytalhskc 192; trlgshlnml qns adslqawmqa klls dtvasdpgvl qeqlattkql 198; hqvp vequkvard imeiegepap dhrthettd silshquls yslaerssLl 204; qkaiaqsqsv qeslesllqs igeveqnleg kquslssgv tnmk 210; sleatremvt dstt aavlqgklae vsqrfeqlcl qqqekesslk kllpqaemfe 216; hlsgququ enksrmlasg nquqdithf quiqelnle nldt lehlvtelss 222; cgfaldlcqh qdrvqnlrkd tvke rekdasscqe qldefrklvr tquwlkete 228; gsipptetsm sakelekqie hlksllddwa eein ckgtslenli meitapdsqg 234; ktgsilpsvg ssvgsvngyh tckdlteiqc lkye klggvlherq eslqailnrm 240; ansv lqwleskeev lksmdamssp tktetvkaqa esnkaflael eqnspkiqu 246; kealagllvt ypnsqeaenw lnsr weratevtva rqrqleesas hlachaaes 252; qlrpwlmeke lmmgvlgpls idpnmlnaqk ququlkef earrqqheql neaaqgiltg 258; pgdvslstsq vqkelqsinq kwveltdkln srssqidqai vkstquell qdlsekvrav 264; gququsai stqpeavqu leetseirsd leqldhevke aqtlcdelsv ligeqylkde 270; lkkrletval plqgledlaa drinrlqaal astqqqumf delrtwlddk qsqqakncpi 276; saklerlqsq lqeneequs syev ivaegeslll svppgeekrt lqnqlvelkn 282; hweelskkta drqsrlkdcm qkaqkquhv edlvaiedc kakmselrvt ldpvqlessl 288; lrskamlnev ekrrslleil nsaadilins seadedgird qnmd avteelqakt 294; gsleemtqu refqesfkni ekkvegakhq leifdalgsq acsnknlekl raqqevlqal 300; epquylrnf tqglvedapd gsdasqllhq aevaqqefle vquvnsgcv mmenklegig 306; qfhcrvremf sqladlddel dgmgaigrdt iedv rlflnkihvl kldieaseae 312; crhmleeegt Ldllglkrel ealnchgkl tergkarqeq leltlgrved fyrklkglnd 318; attaaeeaea quvvgteve iinqqladfk qukequpl ququvngl gqgliqsagk 324; dcdvqglehd meeinarwnt lnkkvaqria qlqeallhcg qudalepll swladteeli 3301 anqkppsaey kvvkaqiqeq kllqulddr katvdmlqae sael adrekitgql 336; wtel lskaaaquq ledilvlakq fhetaepisd flsvtekkla nsepvgtqta 342; kiqqqiirhk aleedienha tdvhqavkig qslssltspa quvlsekid slqaryseiq 348; aall dqalsnarlf gedevevlnw laevedklss vakdfkqdv lhrqhadhLa 354; lneeivnrkk nvdqaikngq allkqttgee vlliqekldg iktryaditv tsskalrtLe 360; qarqlatqu styeeltgwl reveeelats gquPtgeqi quqqquel kkevmehr;V 366; ldtvnevsra llelvarar vsda neqyklvsdt igqrvdeida aiqrsqqyeq 372; awva etkrklmalg pirleqdqtt aqlqvqkafs idiirhkdsm delfshrsei 378; eqkt vlqektesli qqyeaislln seryarlera wet yeelspwiee 384; traliaqlps paidheqqu qqeemrqlre siaehkphid kllkigpqlk elnpeegemv 390; eekquaenm yaqikeevrq ralaldeavs qstqitefhd kiepmletle nlssrlrmpp 396i lipaevdkir ecisdnksat velequpsf ealkrrgeel igrsqgadkd laakeiqdkl 402; dqmvffwedi karaeereik fldvlelaek fwydmaallt tikdtqdivh dlespgidps 408; iiquveaae tikeetdglh eelefirilg adlifacget ekpevrksid emnnawen_n 414; ktwkerlekl edamqaavqy thlqamfdw ldntviklct mppvgtdlnt vkdqlnemke 420; fkvequqqi emeklnhqge lmlkkatdet drdiireplt elkhlwenlg ekiahrqhkl 426; egallalgqf qhaleelmsw lthteellda dpkv ievelakhhv lkndvlahqa 432; tvetvnkagn ellessagdd asslrsrlea mnchesvlq kteereqqlq sthqaqgfh 438; seiedfllel trmesqlsas kptgglpeta hmel ysqlkakeet ynqlldkgrl 444; mllsrddsgs gskteqsval leqkwhvvss kmeerkskle ealnlatefq inwl 450; tlaeqslnia sppslilntv lsqieehkvf hrdq iieldqtgnq lkflsqkqdv 456; vliknllvsv qsrwekvvqr siergrsldd arkrakqfhe awkklidwle daeshldsel 462; eisndpdkik lqlskhkefq ktlggkqpvy dttirtgral kektllpeds lgev 468; vcgk sverqhklee allfsgqud alqalvdwly aedq pvhgdldlvm 474; nlmdahkqu kelgkrtgtv qvlkrsgrel iensrddttw vkgqlqelst rwdtvcklsv 480; eqal kqaevfrdtv hmllewlsea eqtlrfrgal lqsl idthkefmkk 486; dvns avamgevila vchpdcitti khwitiirar feevltwakq hqquetaLs 492; elvanaelle ellawiqwae ttliqrdqep ipqnidrvka liaehqtfme 498; rvtktykrkn iepthapfie ksrsggrksl sqptpppmpi lsqseaknpr inqlsarwqq 504; vwllalerqr klndaldrle elkefanfdf derkkymrw mnhkksrvmd ffrridkdqd 5101 gkitrqefid gilaskfptt klemtavadi fdrdgdgyid yyefvaalhp nkdayrpttd 516; adkiedevtr qvaqckcakr fqveqigenk yrfgdsqqlr lvrilrstvm wmal 522; deflvkndpc rargrtniel pega sqgmtpfrsr grrskpssra asptrssssa 528; sqsnhsctsm pasg tkvipssgsk lkrptptfhs srtslagdts nssspastga 534; ktnradpkks asrpgsrags ragsrassrr gsdasdfdll etqsacsdts essaaggqgn 540; srrglnkpsk iptmskkttt asprtpgpkr MAP1 B al : MAP1 B Official Name: microtubule-associated protein 18 Gene ID: 4131 Organism: Homo sapiens Other Aliases: , MAPS Other Designations: MAP-1B Nucleotide seguence: NCBI Reference Seguence: NM_019217.1 LOCUS: NM_019217 ACCESSION : 217 XM_001061557 XM_215469 1 cgcgcaggga gagagcggag ggggaggcga cgcgcgccgg gaggaggggg gacgcagtgg 6; gcggagcgga gacagcacct tcggagataa tcctttctcc tgccgcagag cagaggagcg 12; gcgggagagg aacacttctc ccaggcttta gcagagccgg ggcg accgtggtgg 18; tggaagccac cgagccggag ccatcgggca gcatcggcaa cccggcggcg accacctcgc 24; tgtc gcaccgcttc ctagacagca agttctactt gctggtggtg gtcggcgaga ; cggtgaccga agagcacctg aggcgtgcca tcggcaacat cgagctgggg tcgt 36; gggacacaaa cgag tgcaacttgg accaagagct caaacttttc gtgtctcgac 42; cgag attctctcct gaagttccag gacaaaagat cctccatcac cgaagtgacg 48; tcttagaaac tgtagttctg atcaaccctt cggatgaagc agtcagcacc gaggtgcgtt 54L tgatgatcac tgacgccgcc aaac tgctggtgct caccggacag tgctttgaga 60; acactggaga gctcatcctc cagtcaggct ctttctcctt ccagaacttc atagagattt 66L tcaccgacca agagattggg gagctcctaa gcaccaccca tcctgccaac aaagccagcc 72; tcaccctctt ctgccctgag gaaggagact ggaagaactc caaccttgac agacacaatc 78; tccaagactt catcaacatc aagctcaact cagcttctat cttgccagaa atggagggac 84; tttctgagtt caccgagtac ctctcggagt ctgtcgaagt cccctccccc tttgacatcc 90; tggagccccc gacctcgggc ggatttctga agctctccaa gccttgttgt tacatttttc 96; cggggggccg cggggactct gccctgttcg cagtgaacgg attcaacatg ctcattaacg L02; gaggatcaga aagaaagtcc tgcttctgga agctcattcg gcacttggac nggtggact L08; ccatcctgct catt ggggatgaca acttgcccgg gatcaacagc atgttgcaac L14; gcaagattgc agagctggaa gaggagcggt cccagggctc caccagcaac L20; tgaaaaacct catctcccct gacttggggg ttgtgtttct caatgtacct gaaaatctga L26; aaaacccaga acccaacatc aagatgaaga gaagtacaga agaagcatgc agtacctaaa caaactgtcc ccag agcctttatt tagaagtgta ggcaatgcca L38; ttgagcctgt catcctgttc caaaaaatgg gagtgggtaa actggagatg L44; acccagtcaa aagcagcaag gaaatgcagt atttcatgca gcagtggact ggaaccaaca L50; aagacaaggc tgaacttatc aatg gtcaagaagt cccg tgacttccgt ctcgtctttg attgtgtggc acccagccaa ccctgctgag gggttctgtt tcctggaaac cagt acaacatcct agaagggctg gaaaaactca L68; taga cttcctaaag cagccactgg ccacccaaaa agatctcact ggccaggtgt L74; ccaccccccc agtgaaacag gtcaagttga aacagcgggc ccga gagagtctga L80; agccagccac aaaaccactt tccagtaaat cagtgaggaa ggagtccaaa gaggaggccc L86; ctgaagccac aaaagccagc gaaa aaacacccaa agttgaaagc aaagagaaag L92; tgatagtgaa aaaagacaag ccaggaaagg tagaaagtaa gccatcggtg aagg L98; aggtgcccag caaagaggag cagtcgcccg tcaaagctga tgag aaggcggcca 204; cggagagcaa acccaaagtc accaaagaca aagtggtaaa aaaggaaata aagacaaaac 210; ccgaagaaaa gaaagaggag aagcccaaga aggaagtggc taaaaaggaa gacaaaactc 216; ccctcaagaa agacgagaag cccaaaaagg aagaggcgaa gaaggagatc aagaaagaaa 222; tcaaaaagga agagaaaaag gagctgaaga aagaggtgaa gaaggaaacg cccctgaagg 228i acgccaagaa ggaggtgaag aaagacgaga agaaagaagt taaaaaggaa gagaaggaac 2341 ccaaaaagga gattaagaag atctccaagg acataaagaa atccactcct ctgtcagaca 240; caaagaaacc ggctgcattg aaaccaaaag tagcaaagaa agaagagccc accaagaagg 246; ttgc tgctgggaaa ctcaaggaca aggggaaggt caaagtcatt aaaaaggaag 252; gcaagaccac cgct gccacagctg ttggcactgc tgccgtggct gcagcagccg 258; gagtagcggc tcct gaac ttgaagctga gcggtccctc atgtcgtccc 264; ctgaggatct aaccaaggac tttgaggagc taaaggctga ggagatcgat gtagcgaagg 270; acatcaagcc tcagctggag ctcattgaag atgaagagaa actgaaggaa accgagccgg 276; gagaagccta cgtcattcag aaagagacgg aagtcagcaa aggttctgct gagtcacctg 282; ggat caccaccact gagggggaag gggagtgcga gcaaaccccc gaggagctgg 288; agccagttga gaagcagggc gtggatgaca tcgagaagtt cgaggatgaa ggcgctggtt 294; ttgaagaatc ctcagaggcc ggagactacg aagagaaggc agaaactgag gaggccgagg 300: agccggaaga agacggggaa gacaatgtga gcgggagcgc ctcgaagcac agccccacag 306; aagacgaaga aatcgctaag gctgaggcgg acgtacacat caaggagaag agggagtctg 312; tggccagcgg cgatgaccgg gccgaagaag acatggatga agcgcttgag aaaggagaag 318; ctgaacagtc tgaggaggag gagg acaa agcagaggac gccagagagg 324; aagaccatga gcccgacaaa actgaggctg aagattatgt gatggctgtg gttgacaagg 330; ccgcggaggc ngagtcacc gaggatcagt atgatttcct ggggacaccg caac 336; ctggagtcca gtctcctagc cgagaacccg caat tcatgatgag accctacccg 342; gaggctccga gagcgaggcc actgcttcag atgaggagaa tcgagaagac cagcctgagg 348; aattcactgc tacctccgga tatactcagt ccaccatcga gatatctagt gagccgactc 354; caatggatga gatgtccact cctcgagatg tgatgaccga cgagaccaac aatgaggaga 360; cagagtcccc gtctcaggag ttcgtgaaca ttaccaaata cgagtcttcg ctgtactctc 366; actc caaacctgtg gttgcatcat tcaatggatt cggg tcaaagacag 372; acgccactga ngtagggat tacaacgctt ccgcctccac catatcacca ccttcgtcca 378; tggaagaaga caaattcagc aagtctgctc ttcgtgacgc ttaccgccca gaagagacgg 384; aaac cggtgccgag ttggacatca aagatgtttc ggatgagaga cttagcccag 390; gtcc atccctgagt ccttctccac catcacccat gact cccctgggtg 396; aacgtagcgt gaatttctct ctgacaccca acgagatcaa agcctctgca gagggagagg 402; cagt agtgtccccc ggagtgaccc aagcagtagt tgaagaacac tgtgccagtc 408: ctgaggagaa gaccttggag gtagtgtcac cgtctcagtc aggc agtgcgggcc 4141 acacacctta ctaccaatct cccaccgacg aaaagtccag tcacctacct acagaagtca 4201 ctgagaacgc gcaggcagtc agct ttgaattcac tgaggccaaa gatgagaacg 426; agaggtcgtc catcagcccc atggatgaac ctgtgcctga ctcagagtct gaga 432; aagttctgtc acgc agccctcccc gatc cgagtccgca tatgaagact 438; tcctgagtgc ggatgacaag ggca gacgttcaga aagccccttt gaagggaaga 444; atggaaagca aggcttctca gacaaagaaa tttc tgacctgact tccgatcttt 450; accaagacaa gcaggaagag aaaagcgcgg gcttcatacc gataaaggaa gactttagtc 456; cagaaaagaa agccagcgat gctgaaatca tgagttctca atcagctctg gatg 462; aaaggaaact gggaggagat ggatctccaa cgcaagtaga tgtcagtcag tttggctctt 468; tcaaagaaga caccaagatg tccatttcgg aaggcaccgt ttcagacaag tccgccacgc 474; ctgtggatga gggcgtggcc acct acat ggaaggtgtg gcctcagtgt 480: caaccgcctc tgtggctacc agctcgtttc cagagccaac cacagatgac gtgtctcctt 486; ctctccacgc tgaagtgggc tctccacatt ccacagaggt ggatgactcc ctgtcggtgt 492; cggtggtgca aact actttccagg aaacagaaat gtctccgtct aaagaagagt 498; gcccaagacc aatgtcgatt tctcctcctg acttctcccc taagacagcc aaatccagga 504; caccagttca agatcaccga tccgaacagt cttcaatgtc tattgaattc ggtcaggaat 510; cccccgagca ttctcttgct atggacttta gtcggcagtc tccagaccac cctactgtgg 516; gtgctggtat gcttcacatc accgaaaatg ggccaactga ggtggactac tccg 522; atatccagga ctctagtttg tcacataaga ttccgccgac agaagagcca tcctacaccc 528; aggataatga tctgtccgag tctg tgtctcaggt ggaggcttcc ccatccacct 534; cttctgctca cactccttct cagatagcct ctcctcttca ggaagacact ctctctgatg 540; tcgttcctcc cagagatatg tatg cctcgcttgc gtctgagaaa gtgcagagcc 546; tggaaggaga gaaactctct ccaaaatccg atatttctcc gctcacccct cgagagtcct 552; cacctacata ttcacctggc ttttcagatt ctacctctgg agctaaagag agtacagcgg 558; cttaccaaac ctcctcttcc ccaccaatag atgcagcagc cgcagagccc ttcc 564; gctcctcaat tgat acaatgcagc atcacctggc cttgagtaga gatttgacca 570; catctagtgt ggac aatggaggga ccgg taac tatgcctatc 576; aaaagcccga gagcaccacc gaatccccag atgaagaaga ctat gaatctcacg 582; agaaaaccat ccaggcccac gatgtgggtg gttactacta tgagaagaca gagagaacca 588: taaaatcccc atgtgacagt ggatactcct atgagaccat tgagaagacc accaagaccc 5941 atgg tggctactcc tgtgaaatta ccgagaaaac cactcggacc cctgaagagg 600; actc gtatgagatc agcgagaaga caacacgaac ccctgaagta agtggctaca 606; cctatgagaa gagg tccagaaggc tcctcgatga cattagcaat ggctacgatg 612; acactgagga tggtggccac acacttggcg actgtagcta ttcctacgaa accactgaga 618; aaattaccag tgaa tctgaaagct attcctatga gacaactaca aaaacaacac 624; ggagtccaga cacctctgca tactgttacg agaccatgga gaagatcacc aagaccccac 630; aggcatccac atactcctat gagacctcag accgatgcta cactccagaa aggaagtccc 636; cctcggaggc acgccaggat gttgacttgt gtctggtgtc tgaa ttcaagcatc 642; ccaagaccga gctctcacct atta atccaaaccc tctcgagtgg tttgctgggg 648; aagagcccac tgaagaatct gagaagcctc tcactcagtc tggaggagcc cccccacctt 654; caggaggaaa acaacagggc agacaatgcg atgaaactcc acccacctca gtcagtgagt 660; cagctccatc ccagacggac tctgatgttc ccccagagac gtgc ccctccatca 666; cagctgatgc caacattgac tctgaagatg agtcagaaac catccccaca gacaaaacgg 672; ttacgtacaa acacatggac ccgcctccag cccccatgca agaccgaagc ccttctcctc 678; gccaccctga tgtgtccatg gtggatccag aggccttggc gcag aacctaggca 684; tgaa aaaggatctg aaggagaagg ccaagaccaa gaaaccaggc acaaagacca 690; agtcctcttc acctgtcaaa aagggtgatg ggaagtccaa gccttcagca gcttccccca 696; aaccaggagc cttgaaggaa tcctctgaca aggtgtccag agtggcttct cccaagaaga 702; aagagtctgt ggagaaagct acca ccaccactcc caaa gccacacgag 708; gggaagagaa ggacaaggaa actaagaatg cagccaatgc ttctgcatcc gtga 714; agactgcaac agcaggacca ggaaccacta agacggccaa cacc cccg 720; gcctccctgt gtatttggac ctctgctata ttcccaacca cagcaacagt aagaatgtcg 726; atgttgagtt tttcaagaga gtgaggtcat cttactacgt ggtgagtggg aacgaccctg 732; ccgcggagga gcccagccgg gctgtcctgg atgccttgtt ggaagggaag gctcagtggg 738; gaagcaacat gcaggtgact ccaa cacatgactc tgaggtgatg agggagtggt 744; accaggagac ccacgagaag cagcaagacc tcaacatcat ggtcctagca agcagtagta 750; cagtggtcat gcaagacgag tccttccctg catgcaagat agaactgtag aaaccgcagc 756; cgaccacacc acaggatttg aactgtgttt ccagaaattc ctgaatttga aactaccttt 762; tcttaaacgt atct aattacgtca ctgaacaagg acctgccaga tgctatacag 768; tgtcatgqtg atgcaaatca ctgatatttc tcaatttttg ccgaccgcta gtaa 7741 ccatattccc acaatagatt tcaagttact gcaaaattac ctacccccgt tcatctctgc 780; tgaaatacgt ggaagccagg caca tgac ttctgctagg catt 786; tatcttagag agagaaagag agagggaaag agagtcagcg ggaggggagg gagagcggcg 792; ggagagaggg acaaagagac tgccggagag agagagagtg agagtgagag aatc 798; agaaagaaaa agaatgcaag acaaaaaagg tagagagttc gatg cccagggaga 804; aagagtgggc aggatggggt atagagaaga caccagcaac tgggtctgcg ttttcccaga 810; ccacagcgat tcattctgtg gtctacacag gtggagtttt ccattttcac cagagtcatc 816; agaaagagtc gatctcctaa agcttgtttc taaagaacag gaaggacgaa gcctgtcacg 822; agggcatgag atttttcacg ccttaattaa atgcctttgc ggct gcccacgcat 828; aatgtagcaa gtgtaggctg gagaagcgag gagc cccgttcaga agac 834; ttttcaaaca Cgtgagtagg taag gcatgctccc agcatttgtc tacccaagtc 840; cacatcgagt caacccgcat acac ccaaggccac cccagttaac tgaagcaaat 846; accaaagcag ttgggagaac atatgggaga cattttgcct gtga cttgaatgta 852; caaagttacc cgatgcactt attttttaac gtgagacggc aagtttttaa aacatccgtg 858; taggattgta ccag gcgaaggagc atgcggggag ggaaggtact agaaactcgt 864; ctgactggcc caacagttta gtgcagagtc atgcttggtg aacgtcacca atgt 870; acccgtggcc tctcagccag tgca ccctaagtag taac gcttcaagtt 876; taagactgaa atggcttctc taatcagaac cagggaaaca atgaatctca cggtggaagg 882; ggttctcggc aagtgtacag tgtctgcctt cctttgtctt gcattgacta ttttaatttt 888; tccattaatt ccaacacgtg ggaacacatg tacagaagat tttttttttt atga 894; gaacttttca tagatgaact ttctaacgaa tgttttcatt tacagaaaaa tgcaaagaaa 900; aatttgaagc gatggtcttt ttttttaatt attattttaa gtgttttgta agacaaaaaa 906; attgaagttt tttgaggttc tggaaagatt tgaagcctga tattgaagtc gtgatgatat 912; ttatttaaaa acccgtcact actggaaacg gtggtacctc tttg actctcatat 918; tatgaaagtg tgagtccgtg ctgtttgaga gtgggtaggt ggcagggtag gctactgttc 924; agggtttcac agtgctattc cctcctcctt tcaagatttt ttcaccgtga ggtggaagag 930; ccaagttcag aagcacccta gcgccagctt gcttgggcct tttctggaaa acattcattg 936; ataa cagattgaaa acaaatgaat tctcagctcc tacgttcacc atgtagagag 942; ttcagacaca gtca ctgtcatcac tgaaccacaa actcgtaacg ccagatcatg 948; aggaaatctt tcgccaagtt tcaaacggca gatccatgta ccaggggttc agagttggca 9541 atcttctcag tgacagccat gacagctcgt tcacgctgag tttcctgcag actcttaaga 960; tctccggagt agtgaacaat gacctcattt tattttctat gttagttatt tatttcaaaa 966; gttacatttt agtttacttt tcgtctgtga agtctatgtt tcgcactgct gtttactctg 972; agggtttaac aatatttctc cagggtcccc tcaccgagga cccatgcagt ctacttaatg 978; atca ccaa aatt ttttaaagaa aattaatatt ctatttttgt 984; taggcgtctc taggaatgca gcttttattt attttcctat ttctttccaa aaag 990; atta caga ctat atggtggaag agacattgag aacggagttc 996; attgagtgtc cttg tttctcactt caggcaagct gaacacacac aagatggcaa L002; tctcatggta gctgttgggt tggtccacac aatacactca aagagaaaca gtttctagcc L008; ttcttgccaa atccagacct ctggttgact tttctttcct aaaagatgga gttactgccc L014; agttctacag cttaaattta tttagccttt tatattattt tgttttaaag ccatagccag ccttcagtta taaccactcg accccatacg ccgatgggga aaacactttt ttttttcaaa actgtcactg acacaatctg gcagcaccat cctgacccct agtg agccgagqac accagggtga agtacattgg ctttgcaggt agctcctgat cccctcctac ctgtgtggct gatctgatct tgcttgttcc atacatacac gaggtagcca tgga ctatgtacat ttgtggtgag agctcaaaac ttctagaaga tttgtgcata caatccttga tccaattgta tagattgact aaggcgtttc caccctgttt aaatgatgga attctattgt gctagcactc ccgatcatga L068; cccttttggt agtatttgta aacaaaattc tacagagact aaatcttaga gataatcctc L074; catttcaatt ttaatcaatt ctgtcctcct tttttccaaa ccccgaaccc catgcatgct L080; ttcccagtct tgtgatggga ctggacacag accc cctcgcctca aacaccattt L086; tccatggaat tcaaaagaaa aaaatttttt ttcttaacct tacatatcat agtgaatggt L092; ttccccggtg tatatgaatg gtgt ttccaatagc aatt taggagcttt L098; ctaatactcg ttttataaat ttaatcattt ggaa attttaccac tttg LL04; tgttacaaat cttagctcct ggagcggcac tcag gagttgtttt ttctcacctc LL10; catt tgtcacagga ggtccctgct tggcaatgac atttgtgagt taggataatg LL16; acgttccttc tctccttttt ttttcctttc atacttcaga tttaggagaa aaagattctg ’122; tttccacgtg agaggaactg taagctttta tcacgtaacc agctgaacaa gatgagcacc gcgctttggt agcgattagg ttttattcac tatttcaaaa atccggaact tttaagaatt catttcaaag aaactgaaaa ggaagggaaa aaaaaacaac agctaataat cggcttctcc agctcgcgaa actggagccc cggagaagtg gctctgctca gccgcccgcc gcgg LL52; cggtccttgc tttccccgca tgcgcccgca ggcagcgtgc agtcctaagc ccggctgtgg LL58; tcac tctctctctt gttctgaatg gtgtttgtgt cggtctgcct ctgtgtatgg 11641 tattatgtct tataatcctg catcacttcc tcca tcta atgtagaaaa 11701 attagtttcc agtgaaagta atatgtagtg cttttatggt atttgtgtgc aatatcccct 11761 cttctattga ggatatttga tgtaaaggaa aaaaaaaaag aaaaaagaaa ctgagttcca 11821 caataaaata caaagtggca aaagttcact tgtgtgttga gacatcaaaa aaaaaaaaaa 11881 aaaa Protein seguence: NCBI Reference Seguence: NP_062090.1 LOCUS: NP_062090 ACCESSION: 090 XP_001061557 XP_215469 1 eate pepsgsignp aattspslsh rfldskfyll vvvgetvtee hlrraignie 6; 1girswdtn1 iecnldqelk lfvsrhsarf spevpquil letv vlinpsdeav 12; mitd aarhkllvlt gqcfentgel ilqsgsfsfq nfieiftdqe igellstthp 18; ankasltlfc peegdwknsn 1drhnlqdfi asil pemeglseft eylsesvevp 24; spfdileppt sggflklskp ccyifpggrg dsalfavngf nmlinggser kscfwklirh ; 1drvdsi11t higddnlpgi nsmlquiae Leeersqgst snsdwmknli spdlgvvan 36; vpenlknpep nikmkrstee acfthylnk Lsmkpeplfr svgnaiepvi 1qumgvgk1 42; emyvlnpvks skemqyfmqq wtgtnkdkae "ilpngqevd ipisyltsvs slivwhpanp 48; vlfp gnstqynile gleklkhldf "kqplatqkd 1tgqutppv kqvklquad 54; sreslkpatk plssksvrke skeeapeatk asqvektpkv eskekvivkk dkpgkveskp 60; vpsk eeqspvkaev aekaateskp kvtkdkvvkk eiktkpeekk eekpkkevak 66; kedktplkkd ekpkkeeakk eikkeikkee kkelkkevkk etplkdakke vkkdekkevk 72; keekepkkei kkiskdikks tplsdtkkpa alkpkvakke eptkkepiaa gkvk 78L vikkegktte aaatavgtaa vaaaagvaas gpakeleaer slmsspedlt kdfeelkaee 84L idvakdikpq 1eliedeekl ketepgeayv iqketevskg saespdegit eceq 90; tpeelepvek qgvddiekfe degagfeess eagdyeekae teeaeepeed gednvsgsas 961 khsptedeei akaeadvhik ekresvasgd draeedmdea 1ekgeaeqse edka 1021 edareedhep dkteaedyvm avvdkaaeag vtedqydflg tpakquvqs psrepassih 1081 gses eatasdeenr edqpeeftat tiei sseptpmdem stprdvmtde L14; tnneetesps qefvnitkye eysk pvvasfngls dgsktdatdg rdynasasti L20; sppssmeedk fsksalrday rpeetdvktg aeldikdvsd erlspaksps lspsppspie L26; ktplgersvn fsltpneika tavv spgvtqavve ehcaspeekt levvspsqsv L32; tpyy qsptdekssh lptevtenaq avpvsfefte akdenerssi spmdepvpds L38; espiekvlsp lrsppligse lsad dkalgrrses gkqg fsdkespvsd L44; ltsdlyqqu eeksagfipi ekka sdaeimssqs alalderklg gdgsptquv L50: squsfkedt kmsisegtvs dksatpvdeg vaedtyshme gvasvstasv vgsphstevd dslsvsvvqt pttfqetems pskeecprpm hrseqssmsi pehs lamdfsrqsp dhptvgagml slshkippte epsytqdndl selisvsqve saht dmslyaslas ekvqslegek lspksdispl tpressptys ssppidaaaa epygfrssml fdtmqhhlal srdlttssve tpgd L86; fnyaqupes ttespdeedy dyeshektiq ahdvggyyye ktertikspc dsgysyetie L92; kttktpedgg ysceitektt rtpeeggysy eisekttrtp evsgytyekt ersrrledi L98; sngyddtedg ghtlgdcsys yettekitsf pesesysyet ttkttrspdt tmek 204; itktpqasty syetsdrcyt perkspsear qdvdlclvss cefkhpktel spsfinpnpl 210; ewfageepte esekpltqsg gapppsggkq qgrchetpp tsvsesapsq tdsdvppete 216; ecpsitadan idsedeseti ptdktvtykh mdpppapqu rspsprhpdv smvdpealai 222; eqnlgkalkk dlkekaktkk pgtktksssp vkkgdgkskp saaspkpgal vsrv 228; aspkkkesve kamkttttpe vkatrgeekd ketknaanas asksvktata gpgttktaks 234; sthpglpvy ldlcyipnhs nsknvdveff krvrssyyvv sgndpaaeep sravldalLe 240; gkaqwgsnmq vtlipthdse vmrequeth equdlnimv lassstvvmq desfpackie 246; l MDH1 Official Symbol: MDH1 Official Name: malate ogenase 1, NAD (soluble) Gene ID: 4190 Organism: Homo sapiens Other Aliases: MDH-s, MDHA, MGC:1375, MOR2 Other Designations: cytosolic malate ogenase; malate dehydrogenase, cytoplasmic; soluble malate dehydrogenase Nucleotide seguence: NCBI Reference ce: NM_001199111.1 LOCUS: NM_001199111 ACCESSION : NM_001199111 l cgcc caag ctcggactca tcttctgggg gcag tgacccagta 6; atgggaaggg attgatttcc accttgcggg gt atggggcg ctcttaggag gactctggag 12; aagtagttgt cctgggagag gagcgatctt aatcctgctg catgacggga ggacaaaatg 18; cgacgctgca gctattttcc aaaggacgtt acggtgtttg ataaggacga tgaa 24; ccaatcagag tccttgtgac tggagcagct ggtcaaattg catattcact gctgtacagt ; attggaaatg gatctgtctt tggtaaagat cagcctataa ttcttgtgct gttggatatc 36; acccccatga tgggtgtcct tgtc ctaatggaac tgcaagactg tgcccttccc 42; ctcctgaaag atgtcatcgc aacagataaa gaagacgttg ccttcaaaga cctggatgtg 48; gccattcttg tgggctccat gccaagaagg gaaggcatgg agagaaaaga tttactgaaa 54; gcaaatgtga aaatcttcaa atcccagggt gcagccttag ataaatacgc caagaagtca 60; gttaaggtta ttgttgtggg taatccagcc aataccaact gcctgactgc ttccaagtca 66; gctccatcca tccccaagga gaacttcagt tgcttgactc gtttggatca caaccgagct 72; caaa ttgctcttaa acttggtgtg actgctaatg atgtaaagaa tgtcattatc 78; aacc attcctcgac tcagtatcca gatgtcaacc atgccaaggt gaaattgcaa 84; ggaaaggaag ttta tgaagctctg aaagatgaca gctggctcaa gggagaattt 90; gtcacgactg tgcagcagcg tggcgctgct aagg aact atccagtgcc 96L atgtctgctg caaaagccat ctgtgaccac gtcagggaca tctggtttgg aaccccagag 102; ggagagtttg tgtccatggg tgttatctct gatggcaact cctatggtgt tcctgatgat 108; ctgctctact cattccctgt tgtaatcaag aataagacct ggaagtttgt tctc 114; cctattaatg atttctcacg tgagaagatg gatcttactg caaaggaact gacagaagaa 1201 aaagaaagtg cttttgaatt tctttcctct gcctgactag acaatgatgt tactaaatgc 1261 ttcaaagctg aagaatctaa tctt tgactcaagt accaaataat aataatgcta 1321 tacttaaatt acttgtgaaa aacaacacat tttaaagatt acgtgcttct tggtacaggt 1381 ttgtgaatga atcg tgtt agtgtgcatt ctaaataaat atatattcaa 1441 atgaaaaaaa aaaaaaaaaa a Protein seguence: NCBI Reference Seguence: NP_001186040.1 LOCUS: NP_001186040 ION: NP_001186040 1 mrrcsyfpkd vtvfdkddks epirvlvtga agqiayslly signgsvfgk dqpiilvlld 61 itpmmgvldg dcal pllkdviatd kedvafkdld vailvgsmpr regmerkdll 121 kanvkifksq gaaldkyakk vgnp antncltask sapsipkenf scltrldhnr 181 akaqialklg vtandvknvi iwgnhsstqy pdvnhakvkl qgkevgvyea 1kddsw1kge 241 fvttqurga avikarklss amsaakaicd hvrdiwfgtp egefvsmgvi sdgnsygvpd 301 dllysfpvvi knktwkfveg 1pindfsrek mdltakelte ekesafefls sa NHP2L1 Official : NHP2L1 Official Name: HP2 non-histone chromosome protein 2-like 1 (S. cerevisiae) m: 4809 Organism: Homo sapiens Other Aliases: CTA—216E10.8, 15.5K, FA—1, FA1, NHPX, OTK27, SNRNP15-5, SNU13, SPAG12, SSFA1 Other Designations: NHP2-like n 1; U4/U6.U5 tri-snRNP 15.5 kDa n; [U4/U6.U5] tri-snRNP 15.5 kD RNA binding protein; high mobility group-like r n 2 homolog 1; non-histone chromosome protein 2-Iike 1; small nuclear ribonucleoprotein 15.5kDa (U4fU6.U5); sperm specific antigen 1 Nucleotide seguence: NCBI Reference Seguence: NM_001003796.1 LOCUS: NM_001003796 ACCESSION : NM_001003796 l gccgcgcggg gccatttccg ctgctgcttc tgtgagtttt tccggtgcac gcgagtgctt 6" ctgaaacgtc gctc ccctaggagt gctgagcccg cggaaccgca gccatgactg 12; aggctgatgt gaatccaaag gcctatcccc ttgccgatgc ccacctcacc aagaagctac 18L tcgt tcagcagtca tgtaactata agcagcttcg gaaaggagcc aatgaggcca 24L ccaaaaccct caacaggggc atctctgagt tcatcgtgat ggctgcagac gccgagccac ; tcat cctg ccgctgctgt gtgaagacaa gaatgtgccc tacgtgtttg 36L tgcgctccaa cctg gggagagcct gtggggtctc caggcctgtc atcgcctgtt 42; ctgtcaccat caaagaaggc tcgcagctga aacagcagat ccaatccatt cagcagtcca 48; ttgaaaggct cttagtctaa acctgtggcc tctgccacgt gctccctgcc agcttccccc 54; ctgaggttgt gtatcatatt atctgtgtta gcatgtagta ttttcagcta ctctctattg 60; ttataaaatg tagtactaaa ttct ggatttttgt gttgtttttg ttctgtttta 66; cagggttgct atcccccttc ctttcctccc tccctctgcc atccttcatc cttttatcct 72; ccctttttgg aacaagtgtt cagagcagac agaagcaggg tggtggcacc gttgaaaggc 78; agaaagagcc aggagaaagc tgatggagcc aggacagaga tctggttcca gctttcagcc 84; ttcc tgttgtgtgc ggggtgtggt ggaattaaac agcattcatt gtgtgtccct 90; gtgcctggca cacagaatca ttcatacgtg ttcaagtgat gttt catttgctct 96; atta ggtatcattt ggggaggaag catgtgttct gtgaggttgt atgt L02; ccaagtgtcg tttactaatg tacccctgct gtttgctttt ggtaatgtga tgttgatgtt L08; ctccccctac ccacaaccat gcccttgagg gtagcagggc agcagcatac caaagagatg L14; tgctgcagga ctccggaggc agcctgggtg ggtgagccat ggggcagttg acctgggtct L20; tgaaagagtc gggagtgaca agctcagaga gcatgaactg atgctggcat gaaggattcc L26; aggaagatca tggagacctg gctggtagct gaga agtc caaggaaaca L32; gcctgtctct ggtgaatggg actttctttg gtggacactt ggcaccagct L38; ttcccctgtg tcctgccacc atgtgggtca gatgtactct ctgtcacatg aggagagtgc L44; tagttcatgt gttctccatt cttgtgagca tcctaataaa tctgttccat tttgatgaca L50; aaaaaaaaaa aaaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_001003796.1 LOCUS: NP_001003796 ACCESSION: NP_001003796 1 mteadvnpka pradahltk klldlqusc nykqlrkgan eatktlnrgi sefivmaada 61 epleiilhlp Llcedknvpy varskqalg racgvsrpvi acsvtikegs qlquiqsiq 121 qsierllv OLA1 Official : OLA1 Official Name: Obg-like ATPase 1 m: 29789 Organism: Homo s Other Aliases: PTD004, DOC45, GBP45, GTBP9, GTPBP9 Other Designations: DNA damage-regulated overexpressed in cancer 45 protein; GTP-binding protein 9 (putative); GTP-binding protein PTD004; homologous yeast-44.2 protein; obg-like ATPase 1 Nucleotide seguence: NCBI Reference ce: NM_001011708.1 LOCUS: NM_001011708 ACCESSION : NM_001011708 l cgcg caggtgccgc tcggcgcccg gcccgcccgt tccgccgctg tcgccgccgt 61 cgtgcgtgcc cgga ggggacgggc ctgcgttctc tcctccttcc tccccgcctc 121 ccgg caggaccttt ctctcgctgc cgctgggacc ccgtgtcatc gcccaggccg 181 agcacggaaa tctactttct tcaatgtgtt aaccaatagt caggcttcag actt 241 cccgttctgc actattgatc ctaatgagag cagagtacct gtgccagatg aaaggtttga ; ctttctttgt caataccaca aaccagcaag caaaattcct gcctttctaa atgtggtgga 36; tattgctggc cttgtgaaag gagctcacaa tgggcagggc ctggggaatg tatc 42; tcatattagt gcctgtgatg gcatctttca tctaacacgt gcttttgaag atgatgatat 48; cacgcacgtt gaaggaagtg tagatcctat tcgagatata gaaataatac atgaagagct 54; tcagcttaaa gatgaggaaa tgattgggcc cattatagat aaactagaaa aggtggctgt 60; gagaggagga gataaaaaac ctga atatgatata atgtgcaaag taaaatcctg 66L ggttatagat caaaagaaac ctgttcgctt ctatcatgat tggaatgaca aagagattga 72; gaat aaacacttat ttttgacttc aaaaccaatg gtctacttgg ttaatctttc 78L tgaaaaagac agaa agaaaaacaa atggttgata aaaattaaag agtgggtgga 84L caagtatgac ccaggtgctt tggtcattcc ttttagtggg gccttggaac tcaagttgca 90; agaattgagt gctgaggaga gacagaagta tctggaagcg aacatgacac aaagtgcttt 96; gccaaagatc attaaggctg ggtttgcagc actccaacta gaatactttt tcactgcagg L02; cccagatgaa gtgcgtgcat ggaccatcag gaaagggact cctc aaagattcac acagattttg aaaagggatt cattatggct gaagtaatga ttttaaagag gaaggttctg aaaatgcagt caaggctgct ggaaagtaca gacaacaagg L20; cagaaattat gaag atggagatat tatcttcttc aaatttaaca cacctcaaca L26; accgaagaag aaataaaatt tagttattgc aaac atacaacttc caaaaggcat L32; ctgattttta aaaaattaaa atttctgaaa accaatgcga caaataaagt tggggagatg L38; ggaatctttg acaaacaaat tatt tgttttaaaa ttaaaatact gtgtaccccc L44; ccat gaaatgcagg ttcactaaat gtgaacagct ttgcttttca cgtgattaag L50; accctactcc aaattgtaga agcttttcag gaaccatatt actctcatga atta L56; atctccatca tgtatgccaa gcctgacaca tttgacagtg aggacaatgt ggcttgctcc L62; tttttgaatc tacagataat gcatgtttta cagtactcca gatgtctaca ctcaataaaa L68; catttgacaa aaccagcctt ggtgtgtttg gggatgtctg ctga ctgtggtgtg L74; ctgaatgcga acct tgct gattacagaa ggcg acagtaactg gcagtgtggg gcagctgcaa ttgtatctta aaagtgagtc tttcatggga L86; atcagaagaa tagtatacca gggatttgtc tcaaaaaagt taattaattt atagatcaac L92; atttattgaa agatgatagc aatgatatta tgagggatat gagataggtg aagaagaaaa ttctgtctca aaaattaaag gttt gttattgttg ttctgaccat 204; attgaaaaga ggttcacttt taatctttcc tttgaaatta ttaaattgta aaaactgacc 210; cattgatgtc tggtgggtta tgttttgctt caattcagca ataa aagttcctac 216; actgattttg aaatactaag cttt gaaaatttta cttt cattcttact 222; tttcagatat ttgagagcag tggt agtatgggta agttaagcac ttcttaacat 228; tgtagcagaa aattccaaaa gaaagactag aacaaattgg taatttggag accctttgcc 234; acttagcctt ctcttgtgat gatggcctga aagctctctc ggccctgagc ctccctttct 240; ctcccatgtt tccattcctg tgagacctcc cttccttgtc cttgtcctct gcccttttgt 246; gctcttctgt agga ccatttccat ggaatgcaga ttcataccca gccagccttt 252; gcctctactg tatcgctttg ggactttaaa gatgcagaag tattagaaca cacacaaact 258; taaaatgqaa caaa tgtagcacta gaggcaagaa aaggggtgat tttttaaaga 264; tttggctata cttaagatat ttaaagagga tgaggttgag tcgtgagtat aatttagaag 270; ctctccagtg agtatttttt tagtatcaaa tagtgatctt cttttgctaa cataaaaatg 276; agtaaactac agcagagaaa taagaataaa actaatgaag ggttttttaa taaa 282; taaaacaaat aggc tacaggtata gctgaccaag tttgctgtga aaaatccttg 288; ttatatctat ttctgtattg gagtaatgtg tagattagtg gattgttagg gatttcagca 294; gaaa gaatcaaata tcttgctttt gggtcttctc caaagtaaat tgcatggact 300; tcttaaagaa gtttgaaatc agtaaaaaga gatc atgcaatttt actttgtcta 306; gcttctaatg tggtaaattg cacattgtgc aacagaacca ttaataaaag aaaggagtta 312; gttttgaggt aactgcattc attaggggct aaat gttaaaaatg tttaatttgg 318; ttagaagttc tacaacattg tatgtacata tacaaaattt cattttctct gaaagtcgag 324; aattccaaga actgttacat ataatcatat atatgcctat atctaaacgt tttccctcct 330; ttga taaagttaat atattttagg atgttatgta aatatttttg catgtatctg 336; tagctccatc ttaatcaaat ttatggttaa aagagatttg aggccaggcg tggaggctca 342; cacctataat cccaacactt tgggaggccg aggtgggagg atcacttgag cacaggagtt 348; tgagaccagc ctgtgcaaca tcatgaaaca tcatctctac ataaaaaaat acatatatac 354; aaaaattagc tcat ggcacacacc ccca gctacttagg aggctgagat 3601 gggaggattg atcgatcgct tgagcctggg aggtggaggc tactgtgagc catg 366; ccactgcact ccagcctggg caacagagtg agaccctgtc tccaaagaga aact 372; tgagaaggct tgtgcccaaa gcttgaagaa agattgtaat agcatttagt tgtgttttat 378; catgttatct tcataacatg cattttgcat caga gcagagccgg atctcccaag 384; gaagcacaaa tagtttttgt cgctaactta gttatgagtg aagcctctgt tcacttataa 3901 cttgccagtt tcattggtgg aataagtccc cttactcatg attcatcaat atat 3961 taaaattcca tgtt tgtctctagt tgtctgtgta taatatttcc aaagctcctt 4021 tgtaaacccc gtctcctttc attcagagag acgg tatctcctcc tcctgcctct 4081 tctattgttc tgattagtct acttaaattc acttgatgct ctttttattt tacaagagac 4141 ttgagaaatg tcggttgttc ctaatggcaa actactttag ctgggaaaat tcatttcaac 4201 ttattttaag cttgtaaaat acactgtggt gataaaataa aagagctggg tttga Protein seguence: NCBI Reference Seguence: NP_001011708.1 LOCUS: NP_001011708 ION: NP_001011708 1 migpiidkle kvavrggdkk lkpeydimck vkswviquk pvrfyhdwnd keievlnkhl 61 fltskpmvyl vnlsekdyir kknkwlikik ewvdkydpga 1vipfsga1e 1k1qelsaee 121 quyleanmt qsalpkiika gfaalqleyf ftagpdevra wtirkgtkap qaagkihtdf 181 aevm kyedfkeegs enavkaagky rqqgrnyive dgdiiffkfn POFUT1 Official Symbol: POFUT1 Official Name: protein O-fucosyltransferase 1 Gene ID: 23509 sm: Homo sapiens Other Aliases: FUT12, O-FUT, O-Fuc-T, O-FucT-1 Other Designations: GDP-fucose protein O-fucosyltransferase 1;o- fucosyltransferase protein; peptide-O-fucosyltransferase 1 Nucleotide seguence: NCBI Reference Seguence: NM_015352.1 LOCUS: NM_015352 ACCESSION : NM_015352 l cttccctccc Cgactgtgcg ccgcggctgg ctcgggttcc cgggccgaca tgggcgccgc 6; cgcgtgggca cggccgctga gcgtgtcttt cctgctgctg cttctgccgc tcccggggat 12L gcctgcgggc tcctgggacc cggccggtta cctgctctac tgcccctgca tggggcgctt l8; tgggaaccag gccgatcact tcttgggctc attt gcaaagctgc taaaccgtac 24L cttggctgtc tgga ttgagtacca gcatcacaag cctcctttca ccaacctcca ; tgtgtcctac cagaagtact tcaagctgga gcccctccag gcttaccatc tcag 36; cttggaggat ttcatggaga agctggcacc cacccactgg ccccctgaga agcgggtggc 42; atactgcttt gaggtggcag cccagcgaag cccagataag aagacgtgcc ccatgaagga 48; aggaaacccc tttggcccat tctgggatca gtttcatgtg agtttcaaca agtcggagct 54; ttttacaggc atttccttca gtgcttccta cagagaacaa tggagccaga ctcc 60; aaaggaacat ccggtgcttg ccctgccagg agccccagcc cagttccccg tcctagagga 66; acacaggcca ctacagaagt acatggtatg gtcagacgaa aaga cgggagaggc 72; ccagattcat gcccaccttg tccggcccta tgtgggcatt catctgcgca ttggctctga 78; ctggaagaac gcctgtgcca agga cgggactgca ggctcgcact tcatggcctc 84; tccgcagtgt gtgggctaca gccgcagcac agcggccccc ctcacgatga gcct 90; gcctgacctg aaggagatcc agagggctgt gaagctctgg gtgaggtcgc tggatgccca 96; gtcggtctac gttgctactg attccgagag ttatgtgcct gagctccaac agctcttcaa L02; agggaaggtg aaggtggtga agcc tgaggtggcc gacc tgtacatcct L08; cggccaagcc ttta ttggcaactg tgtctcctcc ttcactgcct ttgtgaagcg L14; ggagcgggac ggga ggccgtcttc tttcttcggc atggacaggc cccctaagct L20; gcgggacgag ttctgattct ggccggagca ccagaccctc tgatcctgga gggaccagag L26: tctgagctgg tccttccagc caggcctggc aggt ggat tcttctcacc tgccaaagat ggagaagagt gccagggacc cctcaaggag ggagacgctc L38; catatcccag ggcataggac ttgcaggttc ctaggagcag gagcatctcc L44; gctttctgct cttctgggaa acac tggcaaagca gtccagcctc ctgg L50; tccactctgc tctgagcagc ctgggatgct gaactcttca gagagatttt tttatagaga L56; gatttctata attttgatac aaggtcatga ctatcctaga actctctgtg gtttttgaaa L62; atcattgaat tctattaatg taggtaccta aagtgacctt aactgaatgt ggatgaggct L68; ggggctggtg tgggtctttt ggctgctttt caaggtgtcc cccaatgtgg ccctcaagag L74; ccatccccac tgcctggcca gagccattgt tgtcccctac ttcctaggcc atttctgggg L80; cttgggggat gaatgctgtc ctgtgctgta aacactatgc aaatggaagt tatcggttgt L86; ggtgctgtgc agcgctctgt gggcgactaa gtgccactca cgcagcatgt tcctggcaag L92; gagcacatac catcaagcca cactatcatg gtattgttct cacagtcttt L98; tggccactgc aaacctggca ccatcagatc tcttctgatc tcttgcccca cctg 204; gttggtagaa catt cggttgatat ccaaagcctg ttctcccagc cgtcctcctg 210; cagctggagc cttcaggccg cacg agggaacgtt tgccaaggct ctgacctcac 216; agaagatgcc cagggcccag aagccatcag aattatcagt ggagaagcac cttttgactc 222; ttcccttcca atgtaatctc cacc ctta aggtgctcta agtcatgagt 228; gttttggtct caaatgctgc agttttaata atctgtgact cctgagagcc catggttttt 234; tgtg gttctaaaat tccttgtctg gtag tcct tgccatgtca 240; cctcccttgg cctttgatcc tggaaaggtg gcagagcctc cactgagcca ggcccagagc 246; cagt gccttcttcc ttgtttacct ggaa acactttttt tgtcaggggc 252; agcctggttc agagctcaga ggtcacactg tatcaaagat ctcaaacagc aaagtcagca 258; tttgctgtat gcca cccaactcta agaa actgtacaga aagggctttg 264; ctatttttcc cttttgggaa aacaatgaag tgttttaagt cctgggtgga ctgagagatg 270; gtttgcctgt ccagacttgc tctcaagcct catccagaga aggagctgca gatgagggag 276; cccgtacact ccctgccacc ttgt aagcctgtag ctggctggct gatttcattt 282; tggaattcat ttgccatcca cagccttaca ctaggcacac actttagagt ctcc 288; agtggggccc gcctaatttt ttttcccccc aagacagggc cttgctctgt ctcccaggct 294; ggagtgcagt ggcatgatca tggcttactg cagccttgat ctcccaggct caagcgatcc 300; ctca gcctctctgg agac tgcatgccca gctccaaatc accttgattc 3061 atatcagcag taataatcac ttgtgttctg aaagaaaggg caccagaagt tctagcaaaa 312; ttcagttgtg ttctgtgagc tagcactttt tcctctgacc caattttctt acctataaaa 318; tggtgataaa cagg ttgttcaaag gcccagatca gctaaagcat gtatataaga 324; gcacgttgta aacttgaaag agacaaaggc acaaatgtgg ctgttgatta atttgactgc 330; ttctcgttgc tcgtcacctc catgccaggc actgtgcttg ctaattgctt tatgggggca 336; ttctcttatt tattccccag ccctgggaaa ctgt cattatcctt ctctttctgc 342; acaaggaaaa attaatgccc tgagaattgt cataattttc ccaaggctgc ccagctggtg 348; gtgttaagcc agaatttgac ctcccagagc cagtttccat tagctgccat gctctgctgc 354; ctctaattca cagaatgcac tttctaccct gtgtgccatg gagacctcct atggaaaaat 360; gatcagccac cttaccttct actgggtacc tgctgtgagt ctgcctatgc cagaaggatt 366; aaggagggga ggttacccaa gaaacaaagc gccg cttacagccc ccgttggatg 372; gttgctcagt acaacagtct tgcattcagc aggtgtttgt tcatcaccta ctatgtgtca 378; ggctctatgc taggtactgg agga gagaatcaag gtct ttgttctcaa 384; ggaatttgca ttctagaaag tagaagatgt aataaatgta ctgtgggaca tgttaataag 390; tgctataaag aaatataaag ggtttgggag caaaaagagg gagtggatct attttagatg 396; agcccaggta agacctctct gaagagctgt catgaaggag ggagggagca cattcctggc 402; agagaaaaca gcaa cgag actggagtgt gttcctgaag agcagccagg 408; aggccagcat ggctggagag gcaggcatag gcagggaacc cagg tcagagcagg 414; cgagctgaca ttctgcagcc tggacggcca tggcaggaag cttttagttg gagagataca 420; ggaagcctcc tagggttctg agcagaagag gggcatgagc tgattcacat tctgaaggac 426; ctctctagct ggccagtgct gttg gagagagaaa gggtgaaagc agagagacca 432; gtgcagggct gttaacaggg ttgcaggcga gagactgggg tgctgggctc ccctagacta 438; ggactccagt gccctcctct cccaagagac aaaggccatt gcattgaagg aggtgggaaa 444; tgattagatt ctgaacatat gtaattattt ttcagtcttt ttcaaagata caaatattta 450; catagtttta atcatgtaat aatt ccta gtgttttact tgta 456; tcatgttttc cctgttggta tgtagcctgg ataaatgctc ataa aaaattctgt 462; cgaggagtgt tccatagttt attgttttcc tattatgaga atttaggcca agtgtggtgg 468; ctcatgcctg taatcccagc actttgcgag gccgaggtgg gcagatcact tgaggtgagg 474; agttcaagac cagcctggcc aacatggtga tcta ctaaaaatac aaaaaaataa 480; taataatagc caggcgtggt ggcacatgcc tgtattccca gctgcttggg aggctgaggc 4861 aggagaatgg cttgaacctg ggaggtggag gttgcagtga gccgagatgg tgccactgca 492; ttccagcctg ggcaacagag cgagactcca aaaa aaggagactt catgtgcccc 498; caatttttca ctattgttat ttgaaaaaat atttttattt gtaagagttt ttctttattt 504; ttca ttaataaagt acgg gaagcaaaaa aaaaaagttg tttaagataa 510; attcccagaa gtgaatttgt tagatcaaac acttaaaact ttttgttatg gaagaattca 5161 aatataaata aaaaattgtg agtaataaaa tgaactcaca gtttcaacaa tgacccacaa 5221 aaaa aaaaaaaaaa aaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_056167.1 LOCUS: NP_056167 ACCESSION: 167 l mgaaawarpl svsflllllp lpgmpagswd pagyllycpc mgrfgnqadh flgslafakl 61 lnrtlavppw iethhkppf tnlhvsquy fkleplqayh rvisledfme klapthwppe 121 krvaycfeva aqrspdkktc pmkegnpfgp fwdqfhvsfn kselftgisf sasyreqwsq 181 hpvl aqu plqk ymvwsdemvk tgeaqihahl vrpyvgihlr 241 igsdwknaca mlkdgtagsh vgy srstaapltm lkei qravklwvrs 301 1daqsvyvat dsesyvpelq qlfkgkvkvv slkpevaqu lyilgqadhf igncvssfta 361 karerdlqg rpssffgmdr ppklrdef PRKDC Official Symbol: PRKDC Official Name: protein kinase, DNA-activated, catalytic polypeptide m:5591 Organism: Homo sapiens Other Aliases: DNA-PKcs, DNAPK, DNPK1, HYRC, HYRC1, XRCC7, p350 Other Designations: DNA-PK catalytic subunit; DNA-dependent protein kinase catalytic subunit; hyper-radiosensitivity of murine scid mutation, complementing 1; p460 Nucleotide seguence: NCBI Reference Seguence: 081640.1 LOCUS: NM_001081640 ACCESSION : NM_001081640 l ggggcatttc cggg ccgagcgggc gcacgcgcgg gact cggcggcatg 6; gcgggctccg gagccggtgt gcgttgctcc ctgctgcggc tgcaggagac cgct 12; gcggaccgct gcggtgctgc cctggccggt catcaactga tccgcggcct ggggcaggaa 18; tgcgtcctga gcagcagccc cgcggtgctg gcattacaga catctttagt tttttccaga 24; gatttcggtt tgcttgtatt gaag aaca gtattgaatt tcgtgaatgt 30L agagaagaaa tcctaaagtt tttatgtatt ttcttagaaa gcca gaagatcgca 36L ccttactctg ttgaaattaa gaacacttgt accagtgttt atacaaaaga tagagctgct 42; aaatgtaaaa ttccagccct ggaccttctt attaagttac cttt ttct 48L agactcatgg atgaatttaa aattggagaa agta aattctatgg agaacttgca 54; ttgaaaaaaa aaataccaga tacagtttta gaaaaagtat atgagctcct aggattattg 60; ggtgaagttc atcctagtga gatgataaat aatgcagaaa tccg cgcttttctg 66; ggtgaactta agacccagat gacatcagca gtaagagagc ccaaactacc tgttctggca 72; ggatgtctga aggggttgtc tctg tgcaacttca ctaagtccat ggaagaagat 78; ccccagactt caagggagat ttttaatttt gtactaaagg caattcgtcc tcagattgat 84; ctgaagagat atgctgtgcc ctcagctggc ttgcgcctat ttgccctgca tgcatctcag 90; tttagcacct gccttctgga caactacgtg tctctatttg aagtcttgtt aaagtggtgt 96; gcccacacaa atgtagaatt gaaaaaagct gcactttcag ccctggaatc ctttctgaaa L02; caggtttcta atatggtggc gaaaaatgca gaaatgcata aaaataaact gcagtacttt L08; atggagcagt tttatggaat catcagaaat gtggattcga acaacaagga gttatctatt L14; cgtg gatatggact ttttgcagga ccgtgcaagg ttataaacgc aaaagatgtt L20; gacttcatgt acgttgagct cattcagcgc tgcaagcaga tgttcctcac ccagacagac L26; actggtgacg accgtgttta tcagatgcca agcttcctcc agtctgttgc aagcgtcttg L32; ctgtaccttg acacagttcc tgaggtgtat actccagttc tggagcacct cgtggtgatg L38: cagatagaca gtttcccaca gtacagtcca aaaatgcagc tggtgtgttg cagagccata L441 gtgaaggtgt tcctagcttt aaaa gggccagttc tcaggaattg 150; gtggtgcatc agggtttaat cagaatatgt tctaaaccag tggtccttcc aaagggccct 156; gagtctgaat ctgaagacca ttca ggggaagtca gaactggcaa L62; cccacataca aagactacgt cttc agacatctcc tgagctctga ccagatgatg L68; gattctattt tagcagatga agcatttttc tctgtgaatt cctccagtga aagtctgaat 174; catttacttt atgatgaatt tgtaaaatcc aaga ttgttgagaa attggatctt 180; acacttgaaa tacagactgt tggggaacaa gagaatggag atgaggcgcc tggtgtttgg 186; atgatcccaa atcc agcggctaac ttgcatccag ctaa agatttttcg 192; atta acctggtgga attttgcaga gagattctcc ctgagaaaca agcagaattt 198; tttgaaccat gggtgtactc attttcatat gaattaattt tgcaatctac aaggttgccc 204; ctcatcagtg gtttctacaa attgctttct attacagtaa gaaatgccaa gaaaataaaa 210; tatttcgagg gagttagtcc aaagagtctg aaacactctc ctgaagaccc agaaaagtat 216; tcttgctttg ctttatttgt gaaatttggc aaagaggtgg cagttaaaat gaagcagtac 222; aaagatgaac ttttggcctc ttgtttgacc tttcttctgt ccttgccaca caacatcatt 228; gaactcgatg ttagagccta cgttcctgca ctgcagatgg ctttcaaact gggcctgagc 234; tataccccct tggcagaagt aggcctgaat gctctagaag aatggtcaat ttatattgac 240; agacatgtaa tgcagcctta ttacaaagac attctcccct atgg atacctgaag 246; acttcagcct tgtcagatga gaccaagaat aactgggaag tgtcagctct ttctcgggct 252; gcccagaaag gatttaataa agtggtgtta aagcatctga agaagacaaa gaacctttca 258; tcaaacgaag caatatcctt agaagaaata agaattagag tagtacaaat gcttggatct 264; ctaggaggac aaataaacaa aaatcttctg acagtcacgt cctcagatga gaag 270; agctatgtgg cctgggacag agagaagcgg ctgagctttg cagtgccctt tagagagatg 276; aaacctgtca ttttcctgga tgtgttcctg cctcgagtca tagc gctcacagcc 282; agtgacagac aaactaaagt tgcagcctgt gaacttttac tggt tatgtttatg 288; ttgggcaaag ccacgcagat gccagaaggg ggacagggag ccccacccat gtaccagctc 294; cgga cgtttcctgt tcga cttgcgtgtg atgttgatca ggtgacaagg 300; caactgtatg agccactagt tatgcagctg attcactggt tcactaacaa caagaaattt 306; gaaagtcagg atactgttgc cttactagaa gctatattgg atggaattgt ggaccctgtt 312; actt taagagattt ttgtggtcgg tgtattcgag aattccttaa atggtccatt 318; aagcaaataa caccacagca gaag gtaa aatc gcttttcaag 324; cgactttata gccttgcgct caat aaga ggctgggagc atcacttgcc 330; tttaataata tctacaggga attcagggaa gaagagtctc tggtggaaca gtttgtgttt 336; gaagccttgg tgatatacat ggagagtctg gccttagcac atgcagatga gaagtcctta 342; ggtacaattc aacagtgttg tgatgccatt gatcacctat tcat tgaaaagaag 348; catgtttctt taaataaagc aaagaaacga cgtttgccgc gaggatttcc accttccgca 354; tgtt tattggatct ggtcaagtgg cttttagctc attgtgggag gccccagaca 360; gaatgtcgac acaaatccat cttt ttcg ttcctttatt gccaggcaac 366; agatccccta atttgtggct gaaagatgtt ctcaaggaag aaggtgtctc ttttctcatc 372; aacacctttg gtgg ctgtggccag ccctcgggca tcctggccca gcccaccctc 378; ttgtaccttc gggggccatt cagcctgcag gccacgctat gctggctgga cctgctcctg 384; gccgcgttgg agtgctacaa cacgttcatt ggcgagagaa ctgtaggagc gctccaggtc 390; ctaggtactg aagcccagtc ttcacttttg aaagcagtgg ctttcttctt agaaagcatt 396; gccatgcatg acattatagc aaag tgctttggca ctggggcagc caga 402; acaagcccac aagagggaga aaggtacaac tacagcaaat gcaccgttgt ggtccggatt 408; atggagttta ccacgactct gctaaacacc tccccggaag gatggaagct cctgaagaag 414; gacttgtgta atacacacct gatgagagtc ctggtgcaga cgctgtgtga gcccgcaagc 420; ataggtttca gaga cgtccaggtt atggctcatc ttcctgatgt gaat 426; ctgatgaaag ctctaaagat gtccccatac atcc tagagaccca tctgagagag 432; aaaataacag cacagagcat tgaggagctt tgtgccgtca acttgtatgg ccctgacgcg 438; caagtggaca ggct ggctgctgtt gtgtctgcct gtaaacagct tcacagagct 444; gggcttctgc ataatatatt accgtctcag tccacagatt tgcatcattc tgttggcaca 450; gaacttcttt ccctggttta taaaggcatt gcccctggag atgagagaca gtgtctgcct 456; tctctagacc tcagttgtaa gcagctggcc agcggacttc tagc ctttgctttt 462; ggaggactgt gtgagcgcct tgtgagtctt ctcctgaacc cagcggtgct gtccacggcg 468; tccttgggca gctcacaggg cagcgtcatc cacttctccc atggggagta tttctatagc 474; ttgttctcag aaacgatcaa cacggaatta ttgaaaaatc tggatcttgc tgtattggag 480; ctcatgcagt cttcagtgga taataccaaa atggtgagtg tgaa cggcatgtta 486; gaccagagct tcagggagcg agcaaaccag aaacaccaag gactgaaact tgcgactaca 492; attctgcaac actggaagaa gtgtgattca tggtgggcca cccc tctcgaaact 498; aaaatggcag tgctggcctt actggcaaaa attttacaga ttgattcatc tgtatctttt 5041 aatacaagtc atggttcatt ccctgaagtc tttacaacat atattagtct acttgctgac 510; acaaagctgg attt aaagggccaa gctgtcactc catt cttcaccagc 516; ctcactgqag gcagtctgga ggaacttaga cgtgttctgg agcagctcat cgttgctcac 522; ttccccatgc agtccaggga atttcctcca ggaactccgc ggttcaataa ggac 528; aaaa agtttctaga tgcattggaa ttatctcaaa gccctatgtt gttggaattg 534; atgacagaag gtcg ggaacagcag atgg aagaattatt tcaatccagt 540; ttcaggagga gaag gggttcatgt caag taggccttct ggaaagcgtg 546; tatgaaatgt tcaggaagga tgacccccgc ttca cacgccagtc ggac 552; cgctccctcc tcactctgct gtggcactgt agcctggatg ctttgagaga attcttcagc 558; acaattgtgg tggatgccat tgatgtgttg aagtccaggt ttacaaagct aaatgaatct 564; acctttgata ctcaaatcac caagaagatg ggctactata agattctaga gtat 570; cttc ccaaagatga tgttcatgct aaggaatcaa aaattaatca agttttccat 576; ggctcgtgta ttacagaagg aaatgaactt acaaagacat tgattaaatt gtgctacgat 582; gcatttacag agaacatggc aggagagaat cagctgctgg agaggagaag actttaccat 588; tgtgcagcat acaactgcgc catatctgtc tgtg tcttcaatga gttaaaattt 594; taccaaggtt ttctgtttag tgaaaaacca gaaaagaact tgcttatttt tgaaaatctg 600; atcgacctga agcgccgcta taattttcct gtagaagttg aggttcctat ggaaagaaag 606; aaaaagtaca ttgaaattag gaaagaagcc agagaagcag caaatgggga ttcagatggt 612; ccttcctata tgtcttccct gtcatatttg gcagacagta ccctgagtga ggaaatgagt 618; caatttgatt tctcaaccgg agttcagagc tattcataca gctcccaaga ccctagacct 624; ggtc gttttcggag acgggagcag cgggacccca cggtgcatga tgatgtgctg 630; gagctggaga tggacgagct caatcggcat gagtgcatgg tgac ggccctggtc 636; aagcacatgc acagaagcct gggcccgcct caaggagaag aggattcagt gccaagagat 642; cttccttctt ggatgaaatt tggc aaactgggaa atccaatagt accattaaat 648; atccgtctct tcttagccaa gcttgttatt aatacagaag aggtctttcg cgcg 654; tggc ttagcccctt gctgcagctg gctgcttctg aaaacaatgg aggagaagga 660; attcactaca tggtggttga gatagtggcc actattcttt catggacagg cttggccact 666; ccaacagggg tccctaaaga tgaagtgtta gcaaatcgat tgcttaattt cctaatgaaa 672; catgtctttc atccaaaaag agctgtgttt agacacaacc ttgaaattat aaagaccctt 678; gtcgagtgct ggaaggattg tttatccatc ccttataggt taatatttga aaagttttcc 6841 ggtaaagatc ctaa agacaactca attc aattgctagg catcgtgatg 690; gccaatgacc tgcctcccta tgacccacag tgtggcatcc agagtagcga atacttccag 696; gctttggtga tgtc ctttgtaaga tataaagaag tgtatgccgc tgcagcagaa 702; ggac ttatacttcg atatgttatg gagagaaaaa acatactgga ggagtctctg 708; tgtgaactgg ttgcgaaaca attgaagcaa catcagaata ctatggagga caagtttatt 714; gtgtgcttga acaaagtgac caagagcttc cctcctcttg cagacaggtt catgaatgct 720; gtgttctttc tgctgccaaa atttcatgga gtgttgaaaa cactctgtct ggaggtggta 726; ctttgtcgtg tggagggaat gacagagctg tacttccagt taaagagcaa ggacttcgtt 732; caagtcatga gacatagaga aaga caaaaagtat gtttggacat aatttataag 738; atgatgccaa agttaaaacc agtagaactc cgagaacttc tgaaccccgt tgtggaattc 744; gtttcccatc cttctacaac atgtagggaa caaatgtata atattctcat gtggattcat 750; gataattaca gagatccaga aagtgagaca gataatgact cccaggaaat gttg 756; gcaaaagatg tgctgattca aggattgatc aacc ctggacttca attaattatt 762; ttct ggagccatga aactaggtta ccttcaaata ccttggaccg gttgctggca 768; ctaaattcct tatattctcc taagatagaa tttt taagtttagc aacaaatttt 774; ctgctcgaaa tgaccagcat gagcccagat tatccaaacc ccatgttcga gcatcctctg 780; tcagaatgcg aatttcagga atataccatt gatt ggcgtttccg tgtt 786; ccga tgtttgtgga gacccaggcc ggca ctctccagac ccgtacccag 792; gaagggtccc tctcagctcg ctggccagtg gcagggcaga taagggccac gcag 798; catgacttca cactgacaca gactgcagat ggaagaagct catttgattg gctgaccggg 804; agcagcactg acccgctggt cgaccacacc agtccctcat ctgactcctt gctgtttgcc 810; cacaagagga gtgaaaggtt acagagagca cccttgaagt cagtggggcc tgattttggg 816; aaaaaaaggc tgggccttcc aggggacgag gtggataaca aagtgaaagg tgcggccggc 822; cggacggacc tactacgact gcgcagacgg tttatgaggg accaggagaa gctcagtttg 828; atgtatgcca gaaaaggcgt tgctgagcaa aaacgagaga aggaaatcaa gtta 834; aaaatgaagc aggatgccca ggtcgttctg tacagaagct accggcacgg agaccttcct 840; gacattcaga tcaagcacag cagcctcatc accccgttac aggccgtggc ccagagggac 846; ccaataattg caaaacagct ctttagcagc ttgttttctg gaattttgaa agagatggat 852; aaatttaaga cactgtctga caac atcactcaaa agttgcttca agacttcaat 858i cgttttctta cctt ctctttcttt ccaccctttg tctcttgtat tcaggacatt 8641 agctgtcagc acgcagccct gctgagcctc gacccagcgg ctgttagcgc tggttgcctg 870; ctac agcagcccgt gggcatccgc ctgctagagg aggctctgct ccgcctgctg 876; cctgctgagc tgcctgccaa ccgt gggaaggccc ctcc tgatgtcctc 882; gtgg agcttgctaa gctgtataga tcaattggag aatacgacgt tggg 888; atttttacca gtgagatagg aacaaagcaa atcactcaga gtgcattatt agcagaagcc 894; agaagtgatt aagc tgctaagcag tatgatgagg ctctcaataa acaagactgg 900; gtagatggtg agcccacaga agccgagaag gatttttggg aacttgcatc ccttgactgt 906; tacaaccacc ttgctgagtg gaaatcactt tgtt ctacagccag tatagacagt 912; gagaaccccc cagacctaaa taaaatctgg agtgaaccat agga aacatatcta 918; ccttacatga tccgcagcaa gctgaagctg ctgctccagg gagaggctga cctg 924; ctgacattta ttgacaaagc cggg gagctccaga aggcgattct agagcttcat 930; tacagtcaag agctgagtct gctttacctc ctgcaagatg atgttgacag agccaaatat 936; tacattcaaa ttca gagttttatg cagaattatt ctagtattga tgtcctctta 942; caccaaagta ccaa attgcagtct gtacaggctt aaat tcaggagttc 948; atcagcttta taagcaaaca aggcaattta tcatctcaag ttccccttaa gagacttctg 954; aacacctgga caaacagata tccagatgct aaaatggacc caatgaacat ctgggatgac 960; acaa atcgatgttt ctttctcagc gagg ttac ccctcttcca 966; gaagataata gtatgaatgt ggatcaagat ggagacccca gtgacaggat ggaagtgcaa 972; gagcaggaag aagatatcag ctccctgatc aggagttgca agttttccat gaaaatgaag 978; atgatagaca gtgcccggaa gcagaacaat ttctcacttg ctatgaaact actgaaggag 984; ctgcataaag agtcaaaaac cagagacgat tggctggtga gctgggtgca gagctactgc 990; cgcctgagcc actgccggag ccggtcccag ggctgctctg agcaggtgct cactgtgctg 996; aaaacagtct ctttgttgga tgagaacaac gtgtcaagct acttaagcaa aaatattctg L002; gctttccgtg accagaacat tctcttgggt acaacttaca ggatcatagc gaatgctctc L008; agcagtgagc gcct tgctgaaatc gaca aggctagaag aatcttagag L014; ctttctggat ccagttcaga ggattcagag aaggtgatcg cgggtctgta ccagagagca L020; ttccagcacc tctctgaggc tgtgcaggcg gctgaggagg aggcccagcc tccctcctgg L026; agctgtgggc ctgcagctgg ggtgattgat gcttacatga cgctggcaga tttctgtgac L032; caacagctgc gcaaggagga agagaatgca tcagttattg attctgcaga ttgtggtgga gaaaatgttg aaagctttaa aattaaattc agattgaagt ttcctagatt acttcagatt atagaacggt atccagagga gagc -050; ctcatgacaa aagagatctc ttccgttccc tgctggcagt tcatcagctg atggtggcct tactggacaa agaccaagcc gttgctgttc agcactctgt aact acccgcaggc tattgtttat cccttcatca taagcagcga aagctattcc L068; ttcaaggata cttctactgg gaat aaggagtttg tggcaaggat taaaagtaag L074; ttggatcaag gaggagtgat tcaagatttt attaatgcct tagatcagct ctctaatcct L080; gaactgctct ttaaggattg gagcaatgat gtaagagctg aactagcaaa aacccctgta L086; aataaaaaaa acattgaaaa aatgtatgaa agaatgtatg tggg tgacccaaag L092; gctccaggcc cctt gaag caga gaaa agaatttgat L098; aaacattttg ggaaaggagg ttctaaacta ctgagaatga agctcagtga cttcaacgac LL04; attaccaaca tgctactttt aaaaatgaac aaagactcaa agccccctgg gaatctgaaa LL10; gaatgttcac cctggatgag cgacttcaaa gtggagttcc tgagaaatga gctggagatt LL16; cccggtcagt atgacggtag gggaaagcca ttgccagagt accacgtgcg aatcgccggg LL22; tttgatgagc gggtgacagt catggcgtct ctgcgaaggc ccaagcgcat agagggaaca ccctttcctg gtgaagggtg gcgaggacct agcagctctt ccaggtcatg aatgggatcc tggcccaaga ctccgcctgc LL40; agccagaggg ccctgcagct gaggacctat agcgttgtgc ccatgacctc cagtgatccc LL46; agggcaccgc Cgtgtgaata taaagattgg ctgacaaaaa tgtcaggaaa acatgatgtt LL52; ggagcttaca tgta taagggcgct aatcgtactg aaacagtcac gtcttttaga LL58; aaacgagaaa gtaaagtgcc tgctgatctc ttaaagcggg tgag gatgagtaca LL64; agccctgagg ctttcctggc gctccgctcc cacttcgcca gctctcacgc tctgatatgc LL70; cact ggatcctcgg gattggagac agacatctga acaactttat ggtggccatg LL76; gagactggcg gcgtgatcgg gatcgacttt gggcatgcgt ttggatccgc tacacagttt LL82; ctgccagtcc ctgagttgat gccttttcgg ctaactcgcc tcaa tctgatgtta LL88; ccaatgaaag aaacgggcct tatgtacagc atcatggtac acgcactccg ggccttccgc LL94; tcagaccctg gcctgctcac catg gatgtgtttg tcaaggagcc ctcctttgat L200; aatt ttgaacagaa aatgctgaaa aaaggagggt catggattca agaaataaat L206; gttgctgaaa aaaattggta cccccgacag aaaatatgtt acgctaagag aaagttagca L212; ggtgccaatc cagcagtcat tgat gagctactcc tgggtcatga actatgtggc tgtggcacga aaag atcacaacat gtgggctttc agaagagact caagtgaagt gcctgatgga tccttggcag aacctgggaa ggatgggagc cctggatgtg atagaaagca ttacattgtt taaagaatct cttt ttcc tgaaacacta aagagaaatg tcttttgtgc ttaaatcaag attatgatga gtaaatgtgt atgggttaaa tcaaagataa ;254; ggttatagta acatcaaaga ttaggtgagg tttatagaaa gatagatatc caggcttacc ;260; aaagtattaa gtcaagaata taatatgtga tcagctttca ttac aagtgctgca ;266; agttagtgaa acagctgtct ccgtaaatgg aggaaatgtg gggaagcctt ggaatgccct ;272; tctggttctg gcacattgga aagcacactc agaaggcttc aaga ttttgggaga ;278; gtaaagctaa gtatagttga tgtaacattg tagaagcagc ataggaacaa taagaacaat ;284; aggtaaagct ataattatgg cttatattta gaaatgactg catttgatat tttaggatat ;290; ttttctaggt cttt attc tcttctagtt ttgacatttt atgatagatt aaggaaacgt ttag gagggcaaaa attttggtca ccaatctaca actggaagat acataaaagt gctttgcatt gaatttggga ;308; taacttcaaa aatcccatgg ttgttgttag ggatagtact aagcatttca aataaaagaa attcctattt gaaatgaatt cctcatttgg aggaaaaaaa gcatgcattc ;320; tagcacaaca agatgaaatt atggaataca aaagtggctc cttcccatgt gcagtccctg ;326; tccccccccg ccagtcctcc acacccaaac tgtttctgat tggcttttag ctttttgttg ;332; tttttttttt tccttctaac acttgtattt ggaggctctt ctgtgatttt gagaagtata ;338; ctcttgagtg tttaataaag tttttttcca aaagta Protein seguence: NCBI Reference Seguence: 075109.1 LOCUS: 075109 ACCESSION: NP_001075109 l magsgagvrc etls aadrcgaala ghqlirglgq spav lalqtslvfs 6; rdfgllvar kslnsiefre creeilkflc iflekmqui apysveiknt ctsvytkdra 12; akckipaldl likllqtfrs srlmdefkig elfskfygel pdtv lekvyellgl l8; lgevhpsemi nnaenlfraf lgelktqmts avrepklpvl agclkglssl lcnftksmee 24; dpqtsreifn fvlkairpqi dlkryavpsa glrlfalhas qfstclldny vslfevllkw ; cahtnvelkk aalsalesfl kqunmvakn aemhknquy fmequgiir nvdsnnke;s 36; iairgyglfa gpckvinakd vdfmyveliq tqt qum psflqsvasv 42; llyldtvpev ytpvlehlvv mqidsquys pkmqlvccra ivkvflalaa kgpvlrncis 48; liri cskpvvlpkg pesesedhra sgevrtgkwk vptykdyvdl frhllssdqm 54; mdsiladeaf fsvnsssesl eka svlkivekld ltleiqtvge qengdeapgv 60; wmiptsdpaa nlhpakpkdf safinlvefc reilpekqae ffepwvysfs yelilqstrl 66; ykl; sitvrnakki spks lkhspedpek yscfalkaf gkevavkmkq 72; ykdellasc; tfllslphni ieldvrayvp alqmafklgl sytplaevgl naleewsiyi 78; pyyk dilpcldgyl ktsalsdetk nnwevsalsr aaqufnkvv lkhlkktknl 84; ssneaislee irirvvqmlg nknl ltvtssdemm ksyvawdrek rlsfavpfre 90; mkpvifldvf lprvtelalt asdrqtkvaa cellhsmvmf mlgkatqmpe ggqgappmyq 96; lykrtfpvll rlacdvdqvt rqueplvmq lihwftnnkk festhvall rcireflkws ikqitpqqqe kspvntkslf krlyslalhp ;08; afnniyrefr eeeslveqfv fealviymes lalahadeks ccda idhlcriiek ;14; khvslnkakk rrlprgfpps aslclldlvk wllahcgrpq tecrhksiel fykfvpllpg ;20; nrspnlwlkd Vlkeegvsfl intfegggcg qpsgilaqpt llylrgpfsl qatlcwld;l ;26; laalecyntf galq Vlgteaqssl lkavaffles iamhdiiaae kcfgtgaagn ;32; rtquegery nyskctvvvr imeftttlln tspegwkllk kdlcnthlmr vlvqt;cepa ;38; sigfnigdvq vmahlpdvcv nlmkalkmsp ykdilethlr ekitaqsiee lcavn;ygpd ;44; aqursrlaa vvsackqlhr agllhnilps qstdlhhsvg tellslvykg iapgderqcl ;50; psldlsckql asgllelafa fgglcerlvs lllnpavlst aslgssqgsv ihfshgey’:y ;56; slfsetinte llknldlavl elmqssvdnt kmvsavlngm ldqsfreran klat ;62; kkcd swwakdsple tkmavlalla kilqidssvs fntshgsfpe vfttyisl;a ;68; dtkldlhlkg qavtllpfft sltggsleel liva hfpmqsrefp pgtpr:nnyv ;74; dcmkkfldal elsqspmlle lmtevlcreq qhvmeelqu sfrriarrgs cvtqvglles ;80; kddp rlsftrqsfv drslltllwh csldalreff stivvdaidv lksrftklne ;86; stfdtqitkk mgyykildvm ddvh akeskinqvf hgscitegne ltktliklcy ;92; daftenmage nqllerrrly hcaayncais viccvfnelk sek peknl;ifen ;98; rynf pvevevpmer kkkyieirke areaangdsd gpsymsslsy seem 204; sqfdfstqu sysyssqdpr rrre qrdptvhddv lelemdelnr hecmapltal 210; vkhmhrslgp pqgeedsvpr dlpswmkflh gklgnpivpl nirlflaklv inteevfrpy 216; akhwlspllq laasenngge gihymvveiv atilswtgla tptgvpkdev lanrllnf;m 222; khvfhpkrav frhnleiikt lvecwkdcls ipyrlifekf skdn svgiqllgiv 228; mandlppydp chiqsseyf qalvnnmsfv rykevyaaaa evlglilryv merknilees 234; kqlk qhqntmedkf vtks fppladrfmn avffllpkfh gvlktlclev 240; vlcrvegmte lyfqlkskdf qumrhrdde quvcldiiy kmmpklkpve lrellnpvve 246; fvshpsttcr eqmynilmwi hdnyrdpese tdndsqeifk lakdvliqgl idenpglq;i 252; irnfwshetr drll alnslyspki evhflslatn fllemtsmsp dypnpmfehp 258; lsecefqeyt idsdwrfrst vltpmfvetq asqgthtrt qegslsarwp vagqiratqq 264; qhdftltqta dgrssfdwlt gsstdplvdh tspssdsllf ahkrserlqr aplksvgpdf 270; gkkrlglpgd evdnkvkgaa grtdllrlrr rfmrdqekls gvae qkrekeikse 276; lkmkqdaqvv lyrsyrhgdl pdiqikhssl itplqavaqr dpiiakqlfs slfsgilkem 282; dkfktlsekn nitqkllqdf nrflnttfsf fppfvsciqd iscqhaalls sagc 288; lasqupvgi rlleeallrl lpaelpakrv rgkarlppdv lrwvelakly dv;r 294; igtk qitqsallae arsdyseaak qydealnkqd wvdgepteae kdfwelas;d 300; cynhlaewks leycstasid senppdlnki wsepfyqety lpymirsklk lllqgeadqs 306; lltfidkamh gelqkailel hysqelslly llqddvdrak yyiqngiqsf mqnyssidvl 312; lhqsrltqu qualteiqe fisfiskqgn lssqvplkrl lntwtnrypd akmdpmniwd 318; diitnrcffl skieekltpl pednsmnvdq dgdpsdrmev qeqeedissl irsckfsmkm 324; kmidsarkqn nfslamkllk elhkesktrd dwlvsqusy crlshcrsrs qgcseqvltv 330; lktvsllden nvssylskni lafrdqnill gttyriiana lssepaclae ieedkarril 336; elsgssseds ekviaglyqr afthseavq qpps wscgpaagvi daymtladfc 342; dqqlrkeeen asvidsaelq aypalvvekm nsne arlkfprllq iierypee:l 348; issv swis dkdq avavqhsvee itdnypqaiv ypfiissesy 354; sfkdtstghk nkefvariks kldqggviqd finaldqlsn pellfkdwsn dvraelaktp 360; vnkkniekmy lgdp kapglgafrr kfiqtfgkef dkhfgkggsk llrmklsdfn 366; ditnmlllkm nkdskppgnl kecspwmsdf kveflrnele ipgqydgrgk plpeyhvria 372; gfdervtvma slrrpkriii rghderehpf lvkggedqu lfqv mngilaqua 378; csqralqlrt ysvvpmtssd prappceykd wltkmsgkhd vgaymlmykg anrtetvtsf 384; rkreskvpad llkrafvrms tspeaflalr shfasshali cishwilgig drhlnnfmva 390; metggvigid fghafgsatq flpvpelmpf rltrqfinlm glmy simvhalraf 396; rsdpglltnt mdvakepsf dwknfeqkml kkggswiqei nvaeknwypr qkicyakrkl 402; aganpavitc heka pafrdyvava rgskdhnira qepesglsee mdqa 408; tdpnilgrtw egwepwm Official Symbol: PSMDS al Name: some (prosome, macropain) 268 subunit, non-ATPase, 6 m:9861 Organism: Homo sapiens Other Aliases: an7, S10, 3M, p44810 Other Designations: 268 some non-ATPase regulatory subunit 6; 268 proteasome regulatory subunit RPN7; 26S proteasome regulatory subunit S10; breast cancer-associated protein SGA-113M; p42A; phosphonoformate immuno-associated protein 4; proteasome regulatory particle t p44S10 Nucleotide seguence: NCBI nce Seguence: NM_014814.1 LOCUS: NM_014814 ACCESSION : NM_014814 l gtcagccgct gtccccttag ccgcgatgcc gctggagaac gagg agggtctgcc 6; caagaacccc gacttgcgta tcgcgcagct gcgcttcctg ctcagcctgc ccgagcaccg 12; cggagacgct gccgtgcgcg acgagctgat ggcggccgtc cgcgataaca acatggctcc 18; ttactatgaa gccttgtgca aatccctcga ctggcagata gacgtggacc tactcaataa 24; aatgaagaag gcaaatgaag atgagttgaa gcgtttggat gaggagctgg aagatgcaga ; gaagaatcta ggagagagcg aaattcgcga gatg gcaaaggccg tctg 36; ccggataggt gagg gagctctgac agcctttcgc aagacatatg acaaaactgt 42L ggccctgggt caccgattgg atattgtatt ctatctcctt aggattggct tattttatat 48L ggataatgat ctcatcacac gaaacacaga aaaggccaaa agcttaatag aagaaggagg 54L agactgggac aggagaaacc gcctaaaagt gtatcagggt ctttattgtg tggctattcg 60; tgatttcaaa caggcagctg aactcttcct tgacactgtt ttta catcctatga 66; actcatggat tataaaacat ttgtgactta tactgtctat gtcagtatga ttgccttaga 72; aagaccagat ctcagggaaa aggtcattaa aggagcagag attcttgaag tgttgcacag 78; tcttccagca gttcggcagt tttc actctatgaa tgccgttact tctt 84; ccaatcatta gcggttgtgg aacaggaaat ggac tggctttttg cccctcatta 90; tcgatactat gtaagagaaa tgagaattca tgcatacagt cagctgctgg aatcatatag 96; gtcattaacc cttggctata tggcagaagc gtttggtgtt ggtgtggaat tcattgatca 102; ggaactgtcc aggtttattg ctgccgggag ctgc aaaatagata aagtgaatga 108; aatagtagaa accaacagac ctgatagcaa gaactggcag taccaagaaa ctatcaagaa 114; aggagatctg ctactaaaca gagttcaaaa actttccaga aata tgtaaagcca 120; tgtaactaac aaaggatttg ctttagagat aattatttgg aatttttata gcttacttca 126; caatgtgccc aggtcagctg tataaaataa atactgcatt tc Protein seguence: NCBI Reference Seguence: 629.1 LOCUS: NP_055629 ACCESSION: NP_055629 1 mplenleeeg 1pknpdlria qlrfllslpe hrgdaavrde 1maavrdnnm apyyealcks 61 vd11 nkmkkanede 1kr1dee1ed aeknlgesei rdammakaey lcrigdkega 121 1tafrktydk tvalghrldi vfyllriglf ymdndlitrn tekaksliee ggdwdrrnrl 181 kquglycva irdfkqaael fldtvstfts yelmdyktfv tytvyvsmia rekv 241 ikgaeilevl hslpavrqyl fslyecrysv fqulavveq emkkdwlfap hyryyvremr 301 ihaysqlles yrsltlgyma eafgvgvefi dqelsrfiaa grlhckidkv neivetnrpd 361 skanyqeti kkgdlllnrv qklsrvinm ITGB1 al Symbol: ITGB1 Official Name: integrin,beta1 (tibronectin receptor, beta polypeptide, antigen CD29 includes MDF2, MSK12) [Homo sapiens] Gene ID: 3688 sm: Homo sapiens Other Aliases: RP11-479G22.2, CD29, FNRB, GPIIA, MDF2, MSK12, VLA- BETA, VLAB Other Designations: integrin VLA-4 beta subunit; integrin beta-1; very late activation protein, beta polypeptide Nucleotide seguence: NCBI Reference Seguence: NM_002211.3 LOCUS: NM_002211 ACCESSION : NM_002211 l atcagacgcg cagaggaggc ggggccgcgg ctggtttcct gccggggggc ggctctgggc 6; cgccgagtcc cctcctcccg cccctgagga ggaggagccg ccgccacccg ccgcgcccga 12; cacccgggag gccccgccag cccgcgggag aggcccagcg ggagtcgcgg aacagcaggc l8; ccgagcccac cgcgccgggc cgcc aaaa gatgaattta caaccaattt 24; tctggattgg actgatcagt tcagtttgct gtgtgtttgc tcaaacagat gaaaatagat ; aagc aaatgccaaa tcatgtggag aatgtataca agcagggcca aattgtgggt 36; ggtgcacaaa ttcaacattt ttacaggaag gaatgcctac acga tgtgatgatt 42; cctt aaaaaagaag ggttgccctc cagatgacat agaaaatccc agaggctcca 48; aagatataaa gaaaaataaa aatgtaacca accgtagcaa aggaacagca gagaagctca 54; agccagagga tattactcag atccaaccac agcagttggt tttgcgatta agatcagggg 60; agccacagac atttacatta aaattcaaga gagctgaaga ctatcccatt gacctctact 66; accttatgga cctgtcttac tcaatgaaag acgatttgga gaatgtaaaa agtcttggaa 72; cagatctgat gaatgaaatg aggaggatta cttcggactt cagaattgga tttggctcat 78; ttgtggaaaa gatg atta gcacaacacc gctc aggaaccctt 84; gcacaagtga acagaactgc accagcccat acaa aaatgtgctc agtcttacta 90; gaga agtatttaat gaacttgttg gaaaacagcg tgga aatttggatt 96; ctccagaagg tggtttcgat gccatcatgc aagttgcagt ttgtggatca ctgattggct 1021 ggaggaatgt gctg ctggtgtttt ccacagatgc cgggtttcac tttgctggag 108; atgggaaact tggtggcatt gttttaccaa gaca atgtcacctg gaaaataata 114; tgtacacaat gagccattat tatgattatc cttctattgc tcaccttgtc cagaaactga 120; gtgaaaataa tattcagaca atttttgcag aaga atttcagcct gtttacaagg 126; agctgaaaaa cttgatccct aagtcagcag taggaacatt aaat tctagcaatg L32; taattcagtt gatcattgat gcatacaatt ccctttcctc agaagtcatt ttggaaaacg L38; gcaaattgtc agaaggcgta acaataagtt acaaatctta ctgcaagaac ggggtgaatg L44; gaacagggga aaatggaaga aaatgttcca atatttccat tggagatgag gttcaatttg L50; aaattagcat aacttcaaat aagtgtccaa aaaaggattc tgacagcttt aaaattaggc L56; ctctgggctt tacggaggaa gtta ttcttcagta catctgtgaa tgtgaatgcc L62; aaagcgaagg catccctgaa agtcccaagt gtcatgaagg aaatgggaca tgtg L68; gcag gtgcaatgaa gggcgtgttg gtagacattg cagc acagatgaag L74; ttaacagtga agacatggat gcttactgca aaaa cagttcagaa atctgcagta L80; acaatggaga gtgcgtctgc ggacagtgtg ggaa gagggataat acaaatgaaa L86; tttattctgg caaattctgc gagtgtgata atttcaactg tgatagatcc tttgtggagg aaatggtgtt tgcaagtgtc gtgtgtgtga cccc gcagtgcatg tgactgttct ttggatacta gtacttgtga agccagcaac ggacagatct 204; gccg gggcatctgc gagtgtggtg tctgtaagtg tacagatccg aagtttcaag 210; ggcaaacgtg tgagatgtgt cagacctgcc ttggtgtctg tgctgagcat aaagaatgtg 216; ttcagtgcag agccttcaat aaaggagaaa acac atgcacacag gaatgttcct 222; attttaacat ggta gaaagtcggg acaaattacc ccagccggtc caacctgatc 228; ctgtgtccca ttgtaaggag aaggatgttg acgactgttg gttctatttt acgtattcag 234; tgaatgggaa ggtc atggttcatg ttgtggagaa tccagagtgt cccactggtc 240; tcat tccaattgta gctggtgtgg gaat tgttcttatt ggccttgcat 246; tactgctgat atggaagctt ataa ttcatgacag aagggagttt gctaaatttg 252; aaaaggagaa aatgaatgcc aaatgggaca cgggtgaaaa tcctatttat gccg 258; taacaactgt ggtcaatccg aagtatgagg gaaaatgagt actgcccgtg caaatcccac 264; aacactgaat gcaaagtagc aatttccata gtcacagtta ggtagcttta gggcaatatt 270; gccatggttt tactcatgtg caggttttga aaatgtacaa tatgtataat ttttaaaatg 276i ttttattatt ttgaaaataa tgttgtaatt catgccaggg actgacaaaa gacttgagac 2821 aggatggtta tcag ctaaggtcac attgtgcctt tttgaccttt tcttcctgga 288; ctattgaaat caagcttatt ggattaagtg atatttctat agcgattgaa agggcaatag 294; ttaaagtaat gagcatgatg tctg ttaatcatgt attaaaactg atttttagct 300; ttacaaatat gtcagtttgc agttatgcag aatccaaagt aaatgtcctg ctagctagtt 306; aaggattgtt ttaaatctgt tattttgcta tttgcctgtt agacatgact gatgacatat 312; ctgaaagaca agtatgttga gagttgctgg tgtaaaatac gtttgaaata gttgatctac 318; aaaggccatg ggaaaaattc agagagttag gaaggaaaaa ccaatagctt taaaacctgt 324; gtgccatttt aagagttact taatgtttgg taacttttat gccttcactt tacaaattca 330; agccttagat aaaagaaccg agcaattttc tgctaaaaag tccttgattt agcactattt 336; acatacaggc catactttac aaagtatttg ctgaatgggg accttttgag ttgaatttat 342; tttattattt ttattttgtt taatgtctgg tgctttctgt ttct aatcttttaa 348; tgtatttgtt tgcaattttg gggtaagact ttttttatga gtactttttc gttt 354; tagcggtcaa tttgcctttt taatgaacat gtgaagttat actgtggcta tgcaacagct 360; ctcacctacg cgagtcttac tttgagttag aaca gaccactgta tgtttacttc 366; tcaccatttg agttgcccat tcac caca ttcttgtttt aagtgccttt 372; agttttaaca gttcactttt tacagtgcta tttactgaag ttatttatta aatatgccta 378; aaatacttaa atcggatgtc ttgactctga tgtattttat caggttgtgt gcatgaaatt 384; tttatagatt aaagaagttg agca aaaaaaaaa Protein seguence: NCBI Reference Seguence: NP_002202.2 LOCUS: NP_002202 ION: NP 002202 1 mnlqpifwig lissvccvfa qtdenrclka nakscgeciq agpncgwctn stflqegmpt 6; sarcddleal kkkgcppddi enprgskdik knknvtnrsk gtaeklkped itqiqpqu—v 12; lrlrsgepqt ftlkfkraed ypidlyylmd lsysmkddle nvkslgtdlm nemrritsdf 18; rigfgsfvek tvmpyisttp aklrnpctse qnctspfsyk nvlsltnkge vfnelvgqu 24; isgnldspeg qvav cgsligwrnv trllvfstda gfhfagdgkl ggivlpndgq ; chlennmytm shyydypsia hlvqklsenn vtee fqpvykelkn lipksavgtl 36; sanssnviql iidaynslss evilengkls egvtisyksy ckngvngtge ngrkcsnisi 42; gdevqfeisi tsnkcpkkds plgf teevevilqy icececqseg ipespkcheg 48L gacr cnegrvgrhc vnse dmdaycrken sseicsnnge cvcgqcvcrk 54L rdntneiysg nfnc drsnglicgg ngvckcrvce cnpnytgsac dcsldtstce 60; asngqicngr gicecgvckc qtc emcqtclgvc aehkecvqcr afnkgekkdt 66; ctqecsyfni tkvesrdklp qpvqupvsh ckekdvddcw fyftysvngn nevmvhvven 72; pecptgpdii pivagvvagi Vliglallli wkllmiihdr refakfekek mnakwdtgen 78; piyksavttv vnpkyegk OfibblSymbok MYH10 Official Name: myosin, heavy chain 10, non-muscle [Homo s] m: 4628 Organism: Homo sapiens Other Aliases: NMMHC-IIB, NMMHCB Other ations: cellular myosin heavy chain, type B; myosin heavy chain, nonmuscle type B; , heavy polypeptide 10, non-muscle; myosin-10; nonmuscle myosin || heavy chain-B; nonmuscle myosin heavy chain ”8 Nucleotide ce: NCBI Reference Seguence: NM_001256012.1 LOCUS: NM 001256012 ACCESSION : NM_001256012 l agcagtgcta aaggagcccg gcag cggtgggttt gggactgagg cgctggatct 6; gcgg ctggggacgt gcgcccgcgc caccatcttc ggctgaagag gcaattgctt 12; ttggatcgtt ccatttacaa tggcgcagag aactggactc gaggatccag agaggtatct 18; ctttgtggac gtca tctacaaccc tgccactcaa gctgattgga cagctaaaaa 24; gctagtgtgg attccatcag aacgccatgg ttttgaggca gctagtatca aagaagaacg ; gggagatgaa gttatggtgg agttggcaga gaatggaaag aaagcaatgg aaga 36; tgatattcag aagatgaacc cacctaagtt ttccaaggtg gaggatatgg cagaattgac 42; atgcttgaat gaagcttccg ttttacataa tctgaaggat tatt caggactaat 48; ctatacttat tctggactct tctgtgtagt tataaaccct tacaagaatc ttta 54; ctctgagaat attattgaaa tgtacagagg gaagaagcgt catgagatgc ctccacacat 60; ctatgctata tctgaatctg cttacagatg catgcttcaa gatcgtgagg accagtcaat 66L tctttgcacg ggtgagtcag gtgctgggaa aaat acaaagaaag ttattcagta 72L ccttgcccat gttgcttctt cacataaagg aagaaaggac cataatattc ctcaggaatc 78L gcctaaacca gtgaaacacc agggggaact tgaacggcag cttttgcaag caaatccaat 84; tctggaatca tttggaaatg cgaagactgt gaaaaatgat tctc gttttggcaa 90; atttattcgg atcaactttg atgtaactgg ctatatcgtt ggggccaaca ttgaaacata 96; ggaa aagtctcgtg ctgttcgtca agcaaaagat gaacgtactt tctt L02; ttaccagttg ttatctggag caggagaaca cctaaagtct gatttgcttc ttgaaggatt L08; taataactac aggtttctct ccaatggcta tattcctatt ccgggacagc aaga L14; taatttccag gagaccatgg aagcaatgca cataatgggc catg aagagattct L20; gtcaatgctt gtat cttcagtgct acagtttgga aatatttctt tcaaaaagga L26; tact gatcaagctt ccatgccaga aaatacagtt gcgcagaagc tctgccatct L32; gatg aatgtgatgg agtttactcg ggccatcctg actccccgga gtgcaaaaag cccagaccaa agaacaggca gattttgcag gagc ggctctttcg ctggctcgtt catcgcatca L50; ggataggacc aaacgtcagg gagcatcttt cattggaatc ctggatattg ctggatttga L56; aatttttgag ctgaactcct ttgaacaact ttgcatcaac tacaccaatg agaagctgca L62; gcagctgttc aaccacacca tgtttatcct agaacaagag gaataccagc gcgaaggcat L68; cgagtggaac ttcatcgatt tcgggctgga tctgcagcca tgcatcgacc acctgcgaac cctcctggtg tactggccct tttggatgaa gaatgctggt aagc L80; cacagataaa acctttgttg aaaaactggt tcaagagcaa ggttcccact ccaagtttca L86; gaaacctcga caattaaaag acaaagctga tttttgcatt atacattatg cagggaaggt L92; ggactataag gcagatgagt ggctgatgaa gaatatggac cccctgaatg acaacgtggc L98; cacccttttg caccagtcat cagacagatt tgtggcagag ctttggaaag atgagattca 204; gaatattcag agagcttctt tctatgacag tgtttctggt cttcatgagc caccagtgga 210; cgtg ggtctggatc aagtcactgg tatgactgag acagcttttg gctccgcata 216; taaaaccaag aagggcatgt ccgt tgggcaactc tacaaagaat ctctcaccaa 222; gctgatggca actctccgaa acaccaaccc taactttgtt cgttgtatca ttccaaatca 228; cgagaagagg gctggaaaat tggatccaca cctagtccta gatcagcttc gctgtaatgg 234; tgtcctggaa gggatccgaa tctgtcgcca gggcttccct aaccgaatag ttttccagga 240; attcagacag agatatgaga tcctaactcc aaatgctatt cctaaaggtt atgg 2461 gqcc tgtgaacgaa tgatccgggc attg gacccaaact tgtacagaat 252; tggacagagc aagatatttt tcagagctgg agttctggca cacttagagg aagaaagaga 258; tttaaaaatc accgatatca ttatcttctt ccaggccgtt tgcagaggtt acctggccag 264; aaaggccttt gccaagaagc agcagcaact aagtgcctta aaggtcttgc agcggaactg 270; tgccgcgtac ctgaaattac ggca gtggtggcga gtcttcacaa aggtgaagcc 276; gcttctacaa gtgactcgcc aaga acttcaggcc aaagatgaag agctgttgaa 282; ggtgaaggag aagcagacga aagg agagctggag gagatggagc ggaagcacca 288; ttta gaagagaaga atatccttgc agaacaacta caagcagaga ctgagctctt 294; tgctgaagca gaagagatga gggcaagact tgctgctaaa gaat tagaagagat 300; tctacatgac ttggagtcta gggttgaaga agaagaagaa agaaaccaaa tcctccaaaa 306; tgaaaagaaa aaaatgcaag cacatattca ggacctggaa gaacagctag acgaggagga 312; aggggctcgg caaaagctgc agctggaaaa agca gaggccaaga tcaagaagat 318; ggaagagqag attctgcttc tcgaggacca aaattccaag ttcatcaaag aaaagaaact 324; catggaagat cgcattgctg agtgttcctc tcagctggct gaagaggaag aaaaggcgaa 330; aaacttgqcc aaaatcagga ataagcaaga agtgatgatc ttag aagaacgctt 336; aaagaaggaa gaaaagactc gtcaggaact ggaaaaggcc aaaagaaaac ggga 342; gacgaccgac ctgcaggacc agatcgcaga ggcg cagattgatg agctcaagct 348; gcagctggcc aagaaggagg aggagctgca gggcgcactg gccagaggtg atgatgaaac 354; actccataag aacaatgccc ttaaagttgt gcgagagcta caagcccaaa ttgctgaact 360; agac tttgaatccg cttc acggaacaag gccgaaaagc agaaaaggga 366; cttgagtgag gaactggaag ctctgaaaac agagctggag gacacgctgg acaccacggc 372; agcccagcag gaactacgta caaaacgtga acaagaagtg gcagagctga agaaagctct 378; tgaggaggaa actaagaacc atgaagctca aatccaggac atgagacaaa gacacgcaac 384; ggag gagctctcag agcagctgga acaggccaag cggttcaaag caaatctaga 390; gaagaacaag cagggcctgg agacagataa caaggagctg gcgtgtgagg tcct 396; gcagcaggtc aaggctgagt ctgagcacaa gaggaagaag ctcgacgcgc aggtccagga 402; gctccatgcc aaggtctctg aaggcgacag gctcagggtg gagctggcgg agaaagcaag 408; taagctgcag aatgagctag ataatgtctc cacccttctg gaagaagcag agaagaaggg 414; tattaaattt gctaaggatg gtct tgagtctcaa ctacaggata agct 420; tcttcagqag gagacacgcc taaa cctgagcagt cggc agctggaaga 4261 ggagaagaac agtcttcagg agcagcagga ggaggaggag gaggccagga agaacctgga 432; agtg ctggccctgc agtcccagtt ggctgatacc aagaagaaag tagatgacga 438; cctgggaaca attgaaagtc tggaagaagc caagaagaag cttctgaagg acgcggaggc 444; cctgagccag Cgcctggagg agaaggcact ggcgtatgac aaactggaga agaccaagaa 450; ccgcctgcag ctgg acgacctcac ggtggacctg gaccaccagc gccaggtcgc 456; ctccaacttg gagaagaagc agaagaagtt tgaccagctg ttagcagaag agaagagcat 462; tcgc tatgccgaag agcgggaccg ggccgaagcc gaggccagag aaac 468; caaagccctg tcactggccc gggccctcga ggaagccctg gaggccaagg aggagtttga 474; gaggcagaac aagcagctcc gagcagacat ggaagacctc atgagctcca atgt 480; gggaaaaaac gttcacgaac aatc caaacgggcc ctagagcagc aggtggagga 486; aatgaggacc cagctggagg agctggaaga cgaactccag gccacggaag atgccaagct 492; tcgtctgqag atgc aggccatgaa ggcgcagttc gagagagacc tgcaaaccag 498; ggatgagcag aatgaagaga agaagcggct gctgatcaaa caggtgcggg agctcgaggc 504; ggagctgqag gatgagagga aacagcgggc gcttgctgta gcttcaaaga aaaagatgga 510; gatagacctg aaggacctcg aagcccaaat cgaggctgcg aacaaagctc gggatgaggt 516; gattaagcag ctccgcaagc tccaggctca gatgaaggat taccaacgtg aattagaaga 522; agctcgtgca tccagagatg agatttttgc tcaatccaaa gagagtgaaa agaaattgaa 528; gagtctggaa gcagaaatcc ttcaattgca actt gcctcatctg agcgagcccg 534; ccgacacgcc gagcaggaga gagatgagct ggcggacgag atcaccaaca gcgcctctgg 540; caagtccgcg ctgctggatg agaagcggcg tctggaagct cggatcgcac agctggagga 546; ggagctggaa gaggagcaga gcaacatgga gctgctcaac gaccgcttcc gcaagaccac 552; ggtg gacacactga acgccgagct agcagccgag cgcagcgccg cccagaagag 558; tgca cgccagcaac tggagcggca gaacaaggag ctgaaggcca agctgcagga 564; actcgagggt gctgtcaagt ctaagttcaa ggccaccatc tcagccctgg aggccaagat 570; tgggcagctg gaggagcagc ttgagcagga ggaa cgagcagccg aatt 576; agtccgtcgc actgagaaga agctgaaaga aatcttcatg gagg atgagcgtcg 582; ggac cagtataaag agcagatgga gaaggccaac gctcggatga agcagcttaa 588; acgccagctg gaggaagcag aagaagaagc gacgcgtgcc aacgcatctc ggcgtaaact 594; ccagcgggaa ctggatgatg ccaccgaggc caacgagggc ctgagccgcg aggtcagcac 600; cctgaagaac cggctgaggc ggggtggccc catcagcttc tcttccagcc gatctggccg 6061 gcgccagctg caccttgaag gagcttccct ggagctctcc gacgatgaca cagaaagtaa 612; gaccagtgat gtcaacgaga cgcagccacc ccagtcagag taaagttgca ggaagccaga 618; ggaggcaata gaca gttaggaatg gggc ctcctgcaga tttcggaaat 624; tggcaagcta nggattcct tcctgaaaga tcaactgtgt cttaaggctc tccagcctat 630; gcatactgta tcctgcttca gacttaggta ctcc cctttttata tatagacaca 636; cacaggacac taaa cagattgttt catcattgca tctattttcc atatagtcat 642; caagagacca ttttataaaa catggtaaga ccctttttaa aacaaactcc aggcccttgg 648; ttgcgggtcg ctgggttatt ggggcagcgc cgtggtcgtc actcagtcgc tctgcatgct 654; ctctgtcata cagacaggta ttct gtgttcacgt ggcccccgac tcctcagcca 660; catcaagtct cctagaccac tgtggactct aaactgcact tgtctctctc atttccttca 666; aataatgatc attt cagtgagcaa actgtgaaag tgga aagagtagga 672: qqqqtqqqct ggatcggaag caacacccat ttggggttac catgtccatc ccccaagggg 678; ggccctgccc ctcgagtcga tggtgtcccg catctactca tgtgaactgg ccttggcgag 684; ggctggtctg tgcatagaag ggatagtggc cacactgcag ctgaggcccc aggtggcagc 6901 catggatcat gtagacttcc agatggtctc ccgaaccgcc tggctctgcc ggcgccctcc 696; tcacgtcagg agcaagcagc cgtggacccc taagccgagc tggtggaagg cccg 702; tcgccagccg catg ctgaccttgc agcc gctgctttga gcccaaaatg 708; ggaatattgg ttttgtgtcc gaggcttgtt ccaagtttgt caatgaggtt tatggagcct 714; ccagaacaga tgccatcttc ctgaatgttg acatgccagt gact ccttcatttt 720; tccttctccc ttccctttgg ttac agtgaacact tagcatcctg gttg 726; gtagttaagc aaactgacat tacggaaagt gccttagaca ctacagtact aagacaatgt 732; tgaatatatc attcgcctct ataacaattt aatgtattca actg tgcttcatat 738; catgtacctc tctagtcaaa gtggtattac agacattcag tgacaatgaa tcagtgttaa 744; ttctaaatcc ttgatcctct gcaatgtgct tgaaaacaca aaccttttgg gttaaaagct 750; tcta ttaggaagaa tttgtcctgt gggtttggaa tcttggattt tcccccttta 756; tgaactgtac ttga ccaccagaca cctgaccgca aatatctttt cttgtattcc 762; catatttcta gacaatgatt tttgtaagac aataaattta ttcattatag atatttgcgc 768; ctgctctgtt tacttgaaga cacc cgtggagaat aaagagacct caataaacaa 774; gaataatcat gtgaacgtgg aaaaaaaaaa aaaaaaaa Protein seguence: NCBI Reference Seguence: NP_001242941.1 LOCUS: NP_001242941 ACCESSION: NP 001242941 1 maqrtgledp drav iynpatqadw takklvwips erhgfeaasi keergdevmv 61 elaengkkam vnkddiqkmn ppkfskvedm aeltclneas vlhnlkdryy sgliytysgl 12; fcvvinpykn lpiyseniie myrgkkrhem ises ayrcmlqdre dqsilctges 18; gagktentkk viqylahvas shkgrkdhni pqespkpvkh qgelerqllq anpilesfgn 24; dnss rfgkfirinf dvtgyivgan ietylleksr avrqakdert fhifyqllsg ; agehlksdll legfnnyrfl sngyipipgq qdkdnfqetm eamhimgfsh eeilsmlkvv 36; ssvqugnis fkkerntdqa smpentvaqk lchllgmnvm eftrailtpr yvqk 42; aqtkeqadfa vealakatye rlfrwlvhri tqu gasfigildi agfeifelns 48L feqlcinytn equqlfnht mfileqeeyq regiewnfid fgldlqpcid lierpanppg 54L vlalldeecw fpkatdktfv eklvqqush squkprqlk iihy agkvdykade 60; wlmknmdpln dnvatllhqs sdrfvaelwk deiqniqras fydsvsglhe ppvdrivgld 66L qvtgmtetaf kkgm frtvgquke sltklmatlr ntnpnfvrci ipnhekragk 72; ldphlvldql rcngvlegir icrqgfpnri quefrqrye iltpnaipkg acer 78; miraleldpn lyrigqskif fragvlahle eerdlkitdi iiffqavcrg ylarkafakk 84; qqqlsalkvl qrncaaylkl rhwqurvft kvkpllqvtr qeeelqakde ellkvkekqt 90; kvegeleeme rkhqqlleek nilaeqlqae telfaeaeem rarlaakkqe leeilhdles 96; rveeeeernq ilqnekkkmq ahiqdleeql deeegaqul qlekvtaeak ikkmeeeiLl L02; ledqnskfik ekklmedria aeee ekaknlakir nkqevmisdl eekt L08; rqelekakrk ldgettdlqd qiaelqaqid kke eelqgalarg ddetlhknna L14; lkvvrelqaq iaelqedfes ekasrnkaek qkrdlseele alkteledtl dttaaqqeLr L20; tkreqevael kkaleeetkn heaqiqdmrq eels eqleqakrfk anleknkqgl L26; etdnkelace vkvquvkae sehkrkklda qvqelhakvs egdrlrvela ekasqunel L32; dnvstlleea ekkgikfakd aaslesqlqd tqellqeetr qklnlssrir qleeeknqu L38; eqqeeeeear knlekqvlal qsqladtkkk vdddlgties leeakkkllk L44; ekalaydkle ktknrquel ddltvdldhq rqvasnlekk qkkqullae eksisaryae L50; aear lsla raleealeak eeferanql radmedlmss kddvgknvhe L56; lekskraleq qveemrtqle eledelqate daklrlevnm qamkaqferd lqtrdeqnee L621 kkrllikqu eleaeleder qualavask lkdl eaqieaanka rdevikqlrk L68; lqaqudyqr eleearasrd eifaqskese eaei lqlqeelass erarrhaeqe L74; rdeladeitn sasgksalld ekrrlearia qleeeleeeq snmellndrf rktthvdtl L80; naelaaersa aqksdnarqq leranelka quelegavk skfkatisal eakigqleeq L86; eraa anklvrrtek klkeifmqve derrhadqyk eqmekanarm kqlquleea 1921 eeeatranas rrqureldd ateaneglsr nrlr rggpisfsss rsgrrqlhle 1981 gaslelsddd tesktsdvne tqppqse Official Symbol: NCL Official Name: nucleolin [Homo sapiens] Gene ID: 4691 Organism: Homo sapiens Other s: 023 Nucleotide seguence: NCBI Reference Seguence: NM_005381.2 LOCUS: NM 005381 ACCESSION : NM_005381 XM_002342275 ctttcgcctc agtctcgagc tctcgctggc cttcgggtgt acgtgctccg ggatcttcag 61 cacccgcggc cgccatcgcc gtcgcttggc ttcttctgga ctcatctgcg gtcc 12; gcttcacact ccgccgccat catggtgaag ctcgcgaagg caggtaaaaa tcaaggtgac 18; cccaagaaaa tggctcctcc tccaaaggag gtagaagaag atagtgaaga tgaggaaatg 24; tcagaagatg aagaagatga tagcagtgga gtcg tcatacctca gaagaaaggc ; aagaaggctg ctgcaacctc agcaaagaag gtggtcgttt ccccaacaaa aaaggttgca 36; gttgccacac cagccaagaa agcagctgtc actccaggca aaaaggcagc agcaacacct 42; aaga cagttacacc agca gttaccacac ctggcaagaa gggagccaca 48; ccaggcaaag cattggtagc aactcctggt aagaagggtg ctgccatccc agccaagggg 54; gcaaagaatg gcaagaatgc caagaaggaa gacagtgatg aagaggagga tgatgacagt 60; gaggaggatg aggaggatga cgaggacgag gatgaggatg aagatgaaat tgaaccagca 66L gcgatgaaag ctgc tgcccctgcc tcagaggatg aggacgatga ggatgacgaa 72; gatgatgagq atgacgatga cgatgaggaa tctg aagaagaagc tatggagact 78; acaccagcca aaggaaagaa agctgcaaaa gttgttcctg ccaa ggct 841 gaag atgaagaaga ggatgatgag gacgaggatg acgacgacga cgaagatgat 90; gaagatgatg aaga tgatgaggag gaggaagaag aggaggagga agagcctgtc 96; gcac ctggaaaacg aaagaaggaa atggccaaac agaaagcagc tcctgaagcc L02; aagaaacaga aagtggaagg cacagaaccg gctt tcaatctctt tgttggaaac L08; ctaaacttta acaaatctgc tcctgaatta aaaactggta atgt ttttgctaaa L14; aatgatcttg ctgttgtgga tgtcagaatt ggtatgacta ggaaatttgg ttatgtggat L20; tttgaatctg ctgaagacct ggagaaagcg ttggaactca tgaa agtctttggc L26: aatgaaatta aactagagaa accaaaagga aaagacagta agaaagagcg L32; acacttttgg ctaaaaatct cccttacaaa gtcactcagg atgaattgaa agaagtgttt L38; gaagatgctg tcag attagtcagc aaggatggga aaagtaaagg attgaattta agacagaagc tgatgcagag aaaacctttg aagaaaagca gggaacagag L50; atcgatgggc gatctatttc cctgtactat actggagaga aaggtcaaaa agaggtggaa gcac ttggagtggt gaatcaaaaa ctctggtttt cctc L62; tcctacagtg caacagaaga aactcttcag gaagtatttg agaaagcaac ttttatcaaa L68; gtaccccaga accaaaatgg caaatctaaa gggtatgcat ttatagagtt tgcttcattc L74; gaagacgcta aagaagcttt aaattcctgt aataaaaggg aaattgaggg cagagcaatc L80; aggctggagt tgcaaggacc caggggatca cctaatgcca gaagccagcc atccaaaact L86; ctgtttgtca aaggcctgtc tgaggatacc actgaagaga cattaaagga gtcatttgac L92; ggctccgttc gggcaaggat agttactgac cgggaaactg ggtcctccaa agggtttggt L98; tttgtagact gtga ggaggatgcc aaagctgcca aggaggccat ggaagacggt 204; gaaattgatg gaaataaagt taccttggac tgggccaaac ctaagggtga aggtggcttc 210; gggggtcgtg gtggaggcag aggcggcttt ggaggacgag gtggtggtag ccga 216; ggaggatttg gtggcagagg ccggggaggc tttggagggc gaggaggctt ccgaggaggc 222; agaggaggag gaggtgacca acaa ggaaagaaga cgaagtttga atagcttctg 228; tccctctgct ttcccttttc catttgaaag ctct ggggttttta ctgttacctg 234: atcaatgaca gagccttctg aggacattcc aagacagtat ctgt ggtctccttg 2401 gaaatccgtc tagttaacat ttcaagggca ataccgtgtt ggttttgact ggatattcat 246; ataaactttt taaagagttg agtgatagag ctaaccctta tctgtaagtt ttgaatttat 252; tcat taca aaaccatttt ttcctacaaa gggt tttgttgttg 258; tttt ttgttttgtt tttgtttttt ttttttttgc gttcgtgggg ttgtaaaaga 264; aaagaaagca gaatgtttta tcatggtttt tgcttcagcg ggac aaattaaaag 270; tcaactctgg tgccagaaaa aaaaaaaaaa aa n ce: NCBI Reference Seguence: NP_005372.2 LOCUS: NP_005372 ACCESSION: NP_005372 XP_002342316 ; mvklakagkn qupkkmapp pkeveedsed eemsedeedd ssgeevvipq kkgkkaaats 6; akkvvvsptk kvavatpakk aavtpgkkaa tvtp akavttpgkk gatpgkalva 12; tpgkkgaaip gkna kkedsdeeed ddseedeedd ededededei aaaa l8; apasededde ddeddedddd deeddseeea mettpakgkk aakvvpvkak eddedddded deeeeeeeee epvkeapgkr kkemaquaa ; tepttafnlf Vgnlnfnksa pelktgisdv fakndlavvd vrigmtrkfg yvdfesaedl 36; ekaleltglk Vfgneiklek kker dartllaknl pykvtqdelk evfedaaeir 42; lvskdgkskg iayiefktea daektfeekq gteidgrsis lyytgekgqn knst 48; tlvl snlsysatee thevfekat fikqunqng kskgyafief asfedakeal ; nscnkreieg rairlelqu rgspnarsqp sktlkagls edtteetlke sfdgsvrari 60; vtdretgssk gfgfvdfnse edakaakeam edgeidgnkv tldwakpkge gggr 66; ggfggrgggr ggrggfggrg rggfggrggf rggrggggdh kpqgkktkfe Official Symbol: SE061A1 Official Name: Sec61 alpha1 subunit (S. cerevisiae) [Homo sapiens] Gene ID: 29927 Organism: Homo sapiens Other Aliases: HSE061, SE061, SECGfA Other Designations: Sec61 alpha-1;protein transport protein SECGf alpha subunit; protein transport protein 8e061 subunit alpha; protein transport protein Sec61 subunit alpha isoform 1; sech Nucleotide seguence: NCBI nce Seguence: NM_013336.3 LOCUS: NM_013336 ACCESSION : NM_013336 NM_015968 agcgatccga ggcccggccc cggccccgcc ccgcgccgcg ccgcgccgct tgccgccggg 6; ctga cgtgtctctc ggcggagctg ctgtgcagtg gaacgcgctg gggc 12; agcgtcgcct cacgcggagc agagctgagc tgaagcggga cccggagccc gagcagccgc 18; cgccatggca atcaaatttc tggaagtcat caagcccttc tgtgtcatcc tgccggaaat 24; tcagaagcca gagaggaaga ttcagtttaa ggagaaagtg ctgtggaccg ctatcaccct ; ctttatcttc ttagtgtgct ttcc cctgtttggg atcatgtctt cagattcagc 36L tgaccctttc tattggatga gagtgattct agcctctaac agaggcacat tgatggagct 42; agggatctct cctattgtca cgtctggcct tataatgcaa ctcttggctg gcgccaagat 48L agtt ggtgacaccc caaaagaccg agctctcttc aacggagccc aaaagttatt 54L tggcatgatc attactatcg gccagtctat tgtg atgaccggga ggga 60; tgaa atgggtgctg gaatttgcct cacc attcagctct ttgttgctgg 66; tgtc ctacttttgg atgaactcct agga cttg gctctggtat 72; ttctctcttc attgcaacta acatctgtga aaccatcgta tggaaggcat tcagccccac 78; tactgtcaac actggccgag gaatggaatt tgaaggtgct atcatcgcac ttttccatct 84; gctggccaca cgcacagaca aggtccgagc ccttcgggag gcgttctacc gccagaatct 90; tcccaacctc ctca tcgccaccat ctttgtcttt gcagtggtca tctatttcca 96; gggcttccga gtggacctgc caatcaagtc ggcccgctac cgtggccagt acaacaccta L02; caag ctcttctata cgtccaacat ccccatcatc ctgcagtctg ccctggtgtc L08; caacctttat gtcatctccc aaatgctctc agctcgcttc agtggcaact tgctggtcag L14; cctgctgggc acctggtcgg acacgtcttc tgggggccca gcacgtgctt atccagttgg L20; tggcctttgc tattacctgt cccctccaga atcttttggc tccgtgttag aagacccggt L26; ccatgcagtt gtatacatag tgttcatgct gggctcctgt ttct ccaaaacgtg L32; gattgaggtc tcaggttcct ctgccaaaga tgttgcaaag cagctgaagg agcagcagat L38; ggtgatgaga ggccaccgag agacctccat ggtccatgaa ctcaaccggt acatccccac L44; agccgcgqcc tttggtgggc tgtgcatcgg ggccctctcg gtcctggctg cgccattggg tctggaaccg ggatcctgct cgcagtcaca atcatctacc agtactttga L56; gatcttcgtt aaggagcaaa gcgaggttgg cagcatgggg gccctgctct aggttgagga agctgctcca gaagcgcctc ggaaggggag ggcgcgtgct gctgcggcat atggactttt aataatgttt ttgaatttcg ttca L74; ttccactgtg taaagtgcta gacattttcc aatttaaaat tttt atcctggcac 180; aaag aactgtgaaa gtgaaatttt attcagccga ctgccagaga agtgggaatg 186; gtataggatt gtccccaagt gtccatgtaa cttttgtttt aacctttgca ccttctcagt 192; gctgtatgcg gctgcagccg ctgt ttccccacaa agggaatttc tggt 198; tggaagcaca aacactgaaa cgtt tcattttggc agtagggtgt gaagctggga 204; gcagatcatg tatttcccgg agacgtggga ccttgctggc atgtctcctt cacaatcagg 210; cgtgggaata tctggcttag gactgtttct ctctaagaca ccattgtttt cccttatttt 216; aaaagtgatt tttttaagga cagaacttct tccaaaagag agggatggct ttcccagaag 222; acactcctgg gtgg atttgtctgt gcacctattg gctcttctag ctgactcttc 228; tggttggqct ctgc ctgtttctgc tagctccgtg tttagtccac ttgggtcatc 234; agctctgcca agctgagcct ggccaagcta caga cccttgcagt gatgtccgtt 240; gatt ctgccagtca tcactggaca cgtctcctcg cagctgccct agcaagggga 246; gacattgtgg tagctatcag acatggacag aaactgactt agtgctcaca agcccctaca 252; ccttctgggc tgaagatcac ccagctgtgt tcagaatttt cttactgtgc ttaggactgc 258; acgcaagtga gcagacacca ccgacttcct ttctgcgtca ccagtgtcgt cagcagagag 264; aggacagcac aagg gtga agtcaggttc ggggtgcatg ggctgtggtg 270; gtgttgatca gttgctccag tgtttgaaat aagaagactc atgtttatgt ctggaataag 276; ttctgtttgt gctgacaggt ggcctaggtc ctggagatga gcaccctctc tctggccttt 282; agggagtccc ctcttaggac aggcactgcc cagcagcaag ggcagcagag ttgggtgcta 288; agatcctgag gagctcgagg tttcgagctg gctttagaca ttggtgggac caaggatgtt 294; ttgcaggatg ccctgatcct aagaaggggg cctgggggtg cgtgcagcct gtcggggaga 300; ccccactctg acagtgggca cacggcagcc tgcaaagcac agggccaccg ccacagcccg 306; gcagaggggc acactctgga gaccttgctg ctag ccaggaaaca gagtgaccaa 312; gggacaagaa gggacttgcc cacc cagcaactca gcagcagaac caagatgggc 318; cccaggctcc tccatatggc ctta tatc acacgtggcc ttgtctagac 324; ccagtcctga gaga ggctcttgag acctgatgcc ctcctaccca catggttctc 330; ccct gtctgctctg ctgctacaga ggggcagggc ctcccccagc ccacgcttag 336; gaatgcttgg cctctggcag gcaggcagct gtacccaagc tggtgggcag ggggctggaa 342; ggcaccaggc ctcaggagga gccccatagt cccgcctgca gcctgtaacc atcggctggg 348; ccctgcaagg cccacactca cgccctgtgg gtgatggtca ngtgggtgg gtgggggctg 354; accccagctt ccaggggact gtcactgtgg acgccaaaat ggcataactg agataaggtg 360; aataagtgac aaataaagcc agttttttac aaggtaaaaa aaaaaaaaaa aaaaaa Protein seguence: NCBI Reference Seguence: 468.1 LOCUS: NP_037468 ACCESSION: NP_037468 NP_057052 l maikflevik pfcvilpeiq kperkiqfke kvlwtaitlf iflvccqipl fgimssdsad 6; pfywmrvila snrgtlmelg ispivtsgli mqllagakii evgdtpkdra lfngaqklfg 12; qsiv yvmtgmygdp semgagicll itiqlfvagl ivllldellq kgyglgsgis l8; lfiatnicet ivwkafsptt vntgrgmefe gaiialfhll atrtdkvral qn;p 24; nlmnliatif Vfavviyfqg frvdlpiksa ryrgqyntyp iklfytsnip iilqsalvsn ; lyvisqmlsa rfsgnllvsl lgtwsdtssg gparaypvgg lcyylsppes fgsvledpvh 36; avvyivfmlg scaffsktwi evsgssakdv qqmv mrghretsmv helnryipta 42; aafgglciga lsvladflga igsgtgilla vtiiyqyfei kaeqsevgs mgallf PAPSSZ Official Sym bol: PAPSSZ Official Name: 3'-phosphoadenosine 5'—phosphosulfate se 2 [Homo sapiens] Gene ID: 9060 Organism: Homo sapiens Other Aliases: RP11-77F13.2, ATPSK2, 8K2 Other Designations: 3—prime-phosphoadenosine 5-prime-phosphosulfate synthase 2; ATP ylase/APS kinase 2; ATP sulfurylase/adenosine 5'- phosphosulfate kinase; PAPS synthase 2; PAPS synthetase 2; PAPSS 2; SK 2; bifunctional 3'-phosphoadenosine 5'-phosphosu|fate synthase 2; bifunctional 3'- phosphoadenosine 5‘-phosphosulfate synthethase 2 Nucleotide seguence: NCBI nce Seguence: NM_001015880.1 LOCUS: NM_001015880 ACCESSION I NM_001015880 l ctaggcggcg gcggccgggt ccccaaggct gggcgctgct tgcggaaccg acggggcgga 61 cgtg gcgggaggag gagtaggaga agggggctgg tcaagggaag tgcgacgtgt 12; agcc tttttatacc tccttcccgg gagtccggca gccgctgctg ctgctgctgc 18; tgctgctgcc gccgccgccg ccgccgtccc tgcgtccttc ggtctctgct cccgggaccc 24; gggctccgcc gcagccagcc agcatgtcgg ggatcaagaa gcaaaagacg gagaaccagc ; agaaatccac caatgtagtc tatcaggccc accatgtgag taag agagggcaag 36; tggttggaac aaggggtggg ttccgaggat gtaccgtgtg gctaacaggt ctctctggtg 42; ctggaaaaac aacgataagt tttgccctgg acct tgtctcccat gccatccctt 48L gttactccct ggatggggac aatgtccgtc atggccttaa cagaaatctc ggattctctc 54L ctggggacag agaggaaaat atccgccgga ttgctgaggt ggctaagctg tttgctgatg 60; ctggtctggt ctgcattacc agctttattt tcgc aaaggatcgt gagaatgccc 66L taca tgaatcagca gggctgccat aaat atttgtagat gcacctctaa 72; atatttgtga aagcagagac gtaaaaggcc tctataaaag ggccagagct ggggagatta 78; aaggatttac aggtattgat tctgattatg agaaacctga aactcctgag ctta 84; aaaccaattt gtccacagtg agtgactgtg tccaccaggt actt gagc 90; agaacattgt accctatact ataatcaaag atatccacga actctttgtg ccggaaaaca 96; aacttgacca Cgtccgagct gaggctgaaa ctctcccttc attatcaatt actaagctgg L02; atctccagtg ggtccaggtt ttgagcgaag gctgggccac tcccctcaaa ggtttcatgc L08; gggagaagga gtacttacag gttatgcact ttgacaccct gctagatggc cttc L14; ctgatggcgt gatcaacatg agcatcccca ttgtactgcc cgtctctgca gaggataaga L20; cacggctgga agggtgcagc aagtttgtcc atgg tggacggagg gtagctatct L26; tacgagacgc tgaattctat gaacacagaa aagaggaacg ctgttcccgt gtttggggga L32; caacatgtac cccc catatcaaaa tggtgatgga aagtggggac tggctggttg L38; gtggagacct tcaggtgctg gagaaaataa atga tgggctggac caataccgtc L44; tgacacctct ggagctcaaa cagaaatgta aagaaatgaa tgctgatgcg gtgtttgcat L50; tccagttgcg caatcctgtc cacaatggcc atgccctgtt gatgcaggac actcgccgca L56; ggctcctaga gaggggctac aagcacccgg tcctcctact acaccctctg ggcggctgga L621 ccaaggatga cgatgtgcct ctagactggc ggatgaagca gcacgcggct aaggggtcct ggatcccaag tcaaccattg tctt tccgtctccc ctggccccac agaggtccag tggcactgca ggtcccggat gattgcgggt gccaatttct L80; acattgtggg gagggaccct gcaggaatgc cccatcctga gaag aacccactca tgggggcaag gtcttgagca tggcccctgg cctcacctct gtggaaatca 192; ttccattccg agtggctgcc tacaacaaag ccaaaaaagc catggacttc tatgatccag 198; caaggcacaa tgagtttgac ttcatctcag gaactcgaat gaggaagctc gaag 204; gagagaatcc tggc gccc ccaaagcatg gaaggtcctg acagattatt 210; acaggtccct ggagaagaac taagcctttg gagt ttctttctga agtgctcttt 216; gattaccttt tctattttta tgattagatg atta aattgcttct caatgatgca 222; ttttaatctt ttataatgaa gttg tgtctataat taaaaaaaaa tata 228: tacacacaca catatacata caaagtcaaa ctgaagacca agca ggtaaaagca 234; atattcttat acatttcata ataaaattag ctctatgtat tttctactgc acctgagcag 240; gcaggtccca gatttcttaa ggctttgttt gaccatgtgt ctagttactt gctgaaaagt 246; gaatatattt tccagcatgt cttgacaacc cttc caatgtcatt tatcagttgt 252; aaaatatatc agattgtgtc ctcttctgta caattgacaa aaaaaaaaat ttttttttct 258; cactctaaaa gaggtgtggc tcacatcaag attcttcctg atattttacc tcatgctgta 264; caaagcctta atgttgtaat catatcttac gtgttgaaga cctgactgga gaaacaaaat 270; gtgcaataac gtgaatttta tcttagagat ctgtgcagcc tgtc acaaaagtta 276; ctaa taagagaagt cttaatggcc tctgtgaata atgtaactcc agttacacgg 282; tgacttttaa tagcatacag tgatttgatg aaaggacgtc aaacaatgtg gcgatgtcgt 288; ggaaagttat ctttcccgct tgtg gtcattgtgt cttgcagaaa ggatggccct 294; gatgcagcag cagcgccagc tgtaataaaa aataattcac actatcagac tagcaaggca 300; ctagaactgg aaaagaccac agaaaacaaa gaatccaacc ctttcatctt acaggtgaac 306; tgat gatgcacatg tatgtgtttt gtaagctgtg agcaccgtaa caaaatgtaa 312; atttgccatt attaggaagt gctggtggca gtgaagaagc acccaggcca cttgactccc 318; agtctggtgc cctgtctaca ccagacaaca caggagctgg gtcagattcc cctcagctgc 324; aagt tcctcgaaca gaaagtgctt acaaagctgc cttctcggat actgaaaggt 330; cgagttttct gaactgcact attg cagttgaaaa aaaaaaaaag ctattccaaa 336: gatttcaagc tgttctgaga catcttctga tggctttact gagg caatgttttt 3421 actttatgca taattcattg ttgccaagga ataaagtgaa gaaacagcac cttttaatat 348; ataggtctct ctggaagaga cctaaattag aaagagaaaa ctgtgacaat tttcatattc 354; tcattcttaa aaaacactaa tcttaactaa caaaagttct tttgagaata acac 360; aatggccaca gcagtttgtc tttaatagta tagtgcctat actcatgtaa tcggttactc 366; actactgcct ttaaaaaaaa aaaccagcat atttattgaa aacatgagac aggattatag 3721 tgccttaacc gatatatttt gtgacttaaa aaatacattt aaaactgctc tcta 3781 gtaccatgct tagtgcaaat gattatttct atgtacaact gatgcttgtt cttattttaa 3841 taaatttatc agagtgaaaa aaaaaaaaaa aaaa Protein seguence: NCBI Reference Seguence: NP_001015880.1 LOCUS: NP_001015880 ACCESSION: NP_001015880 L msgikqute nqqkstnvvy qahhvsrnkr gqvvgtrggf rgctvwltgl sgagkttisf 6L aleeylvsha ipcysldgdn vrhglnrnlg fspgdreeni rriaevaklf adaglvcits 12L fispfakdre narkihesag 1pffeifvda plnicesrdv kglykrarag eikgftgids 181 dyekpetper Vlktnlstvs dcvhqvvell qeqnivpyti ikdihelfvp enkldhvrae 241 lsit kldlquqvl plkg fmrekequv mhfdtlldgm alpdgvinms ; ipivlpvsae dktrlegcsk fvlahggrrv ailrdaefye hrkeercsrv wgttctkhph 36; ikmvmesgdw qvle kirwndgldq yrltplelkq kckemnadav npvh 42; nghallqut rrrllergyk hpvlllhplg gwtkdddvpl haav dpks 48; tivaifpspm 1yagptevqw hcrsrmiaga nfyivgrdpa gmphpetkkd lyepthggkv 54; lsmapgltsv eiipfrvaay nkakkamdfy dparhnefdf isgtrmrkla pdgf 60; mapkawkvlt dyyrslekn We

Claims (29)

claim:
1. A method for identifying a drug that causes or is at risk for causing druginduced cardiotoxicity, comprising: (1) determining a level of expression of one or more biomarkers in a cell sample 5 obtained following ent with a drug; and (2) comparing the level of expression of the one or more biomarkers present in the cell sample obtained ing treatment with the drug with a level of expression of the corresponding one or more biomarkers present in a cell sample obtained prior to treatment with the drug; 10 wherein the one or more biomarkers comprises coiled-coil domain containing 47 (CCDC47); and n a modulation in the level of expression of the one or more biomarkers in the sample obtained following treatment with the drug as compared to the level of expression of the corresponding one or more biomarkers present in the sample ed 15 prior to treatment with the drug is an indication that the drug causes or is at risk for causing drug-induced cardiotoxicity.
2. A method for identifying a rescue agent that can reduce or prevent druginduced cardiotoxicity comprising: (1) determining a level of sion of one or more biomarkers present in a cell 20 sample obtained following treatment with a cardiotoxicity inducing drug and a candidate rescue agent; and (2) comparing the level of expression of the one or more kers in the sample obtained following treatment with the cardiotoxicity inducing drug and the candidate rescue agent with the normal level of expression of the ponding one or 25 more biomarkers present in a cell sample obtained prior to treatment with the cardiotoxicity ng drug and candidate rescue agent; wherein the one or more biomarkers comprises coiled-coil domain ning 47 (CCDC47); and wherein a normalized level of expression of the one or more biomarkers in the 30 sample obtained following treatment with the cardiotoxicity inducing drug and the ate rescue agent as compared to the normal level of expression of the corresponding one or more biomarkers in the sample obtained prior to treatment with the cardiotoxicity inducing drug and the candidate rescue agent is an indication that the (11544621 candidate rescue agent is a rescue agent which can reduce or prevent nduced cardiotoxicity.
3. Use of a rescue agent identified by the method of claim 2 for the preparation of a medicament for alleviating, reducing or preventing drug-induced cardiotoxicity in a 5 subject.
4. The method of claim 1, wherein the one or more biomarkers further ses one or more biomarkers selected from the group ting of TIMP metallopeptidase inhibitor 1 (TIMP1), pentraxin 3 long (PTX3), heat shock 70kDa protein 6 (HSP76), fibronectin 1 (FINC), cytochrome b5 type A (CYB5), serpin ase tor clade E 10 member 1(PAI1), insulin-like growth factor binding protein 7 (IBP7 or IGFBP7), major histocompatibility complex class I C (1C17), EGF-like repeats and discoidin I-like domains 3 (EDIL3), heme oxygenase (decycling) 1 (HMOX1), nucleobindin 1 (NUCB1), some 19 open reading frame 10 (CS010), and heat shock 70kDa protein 4 15
5. The method of claim 4, wherein the drug-induced cardiotoxicity is cardiomyopathy, heart failure, atrial fibrillation, cardiomyopathy and heart failure, heart failure and LV dysfunction, atrial flutter and fibrillation, or heart valve damage and heart
6. The method of any one of claims 1, 2, 4 or 5, wherein the cell samples are 20 cardiomyocytes or diabetic cardiomyocytes.
7. The method of any one of claims 1, 2, or 4-6, wherein the drug is a cancer drug, diabetic drug, neurological drug, or anti-inflammatory drug.
8. The method of any one of claims 1, 2, or 4-7, wherein the drug is Anthracyclines, rouracil, Cisplatin, Trastuzumab, Gemcitabine, Rosiglitazone, 25 Pioglitazone, Troglitazone, Cabergoline, Pergolide, Sumatriptan, Bisphosphonates, or TNF antagonists.
9. The use of claim 3, wherein the subject is a mammal, a human, or a nonhuman animal.
10. The use of claim 3, wherein the rescue agent is for administration to a subject 30 that has already been treated with a cardiotoxicityinducing drug.
11. The use of claim 3, wherein the rescue agent is for administration to a subject at the same time as treatment of the subject with a cardiotoxicity-inducing drug.
12. The use of claim 3, wherein the rescue agent is for administration to a subject prior to treatment of the t with a cardiotoxicityinducing drug. (11544621
13. The use of claim 3, wherein the rescue agent is Coenzyme Q10.
14. The use of claim 3, wherein the rescue agent is not Coenzyme Q10.
15. The method of claim 2, wherein the one or more biomarkers further comprises one or more biomarkers selected from the group consisting of TIMP opeptidase 5 inhibitor 1 (TIMP1), pentraxin 3 long (PTX3), heat shock 70kDa protein 6 (HSP76), fibronectin 1 (FINC), cytochrome b5 type A (CYB5), serpin peptidase inhibitor clade E member 1(PAI1), insulin-like growth factor binding protein 7 (IBP7 or IGFBP7), major histocompatibility complex class I C (1C17), EGF-like repeats and discoidin I-like domains 3 (EDIL3), heme oxygenase (decycling) 1 (HMOX1), nucleobindin 1 (NUCB1), 10 chromosome 19 open g frame 10 (CS010), and heat shock 70kDa protein 4 (HSPA4).
16. A method for identifying a rescue agent for the prevention, reduction or treatment of drug-induced toxicity, comprising: (a) determining a level of one or more biomarkers in a first cell sample obtained 15 following treatment with a cardiotoxicity inducing drug; (b) ining the level of the one or more biomarkers in a second cell sample obtained following treatment with the cardiotoxicity inducing drug and a candidate rescue agent; and (c) comparing the level of the one or more biomarkers in the second cell sample 20 with the level of the corresponding one or more kers in the first cell sample; n the one or more biomarkers comprises coiled-coil domain containing 47 (CCDC47), and wherein a tion in the level of the one or more biomarkers in the second cell sample as compared to the first cell sample is an indication that the candidate rescue agent 25 is a rescue agent for the prevention, reduction or treatment of drug-induced cardiotoxicity.
17. The method of claim 16, r comprising comparing the level of the one or more biomarkers in the first and/or second cell sample with the level of the one or more biomarkers in a control cell , wherein the control cell sample is obtained prior to treatment with the cardiotoxicity inducing drug or the candidate rescue agent. 30
18. The method of claim 17, n a normalization of the level of the one or more biomarkers in the second cell sample as compared to the control cell sample is an indication that the candidate rescue agent is a rescue agent for the prevention, ion or treatment of drug-induced cardiotoxicity. (11544621
19. The method of claim 16, wherein the one or more biomarkers further comprises one or more biomarkers selected from the group consisting of TIMP metallopeptidase inhibitor 1 (TIMP1), pentraxin 3 long (PTX3), heat shock 70kDa protein 6 (HSP76), fibronectin 1 (FINC), cytochrome b5 type A (CYB5), serpin peptidase 5 inhibitor clade E member 1 (PAI1), insulin-like growth factor binding protein 7 (IBP7 or IGFBP7), major histocompatibility complex class I C (1C17), EGF-like repeats and discoidin I-like domains 3 (EDIL3), heme oxygenase (decycling) 1 (HMOX1), nucleobindin 1 (NUCB1), chromosome 19 open g frame 10 (CS010), and heat shock 70kDa protein 4 (HSPA4). 10
20. The method of any one of claims 1, 2 and 16, n the one or more biomarkers further comprises pentraxin 3 long (PTX3) or serpin peptidase tor clade E member 1 .
21. The method of any one of claims 1, 2 and 16, wherein the level of sion of the one or more biomarkers in the sample is determined using a technique to detect 15 mRNA, protein, cDNA, or genomic DNA.
22. The method of any one of claims 1, 2 and 16, wherein the level of expression of the one or more biomarkers in the sample is determined using a technique selected from the group consisting of polymerase chain reaction (PCR) amplification reaction, reverse-transcriptase PCR analysis, single-strand conformation rphism analysis 20 (SSCP), mismatch cleavage ion, heteroduplex analysis, Southern blot analysis, Northern blot analysis, Western blot is, in situ ization, array analysis, deoxyribonucleic acid sequencing, restriction fragment length polymorphism analysis, immunohistochemistry, immunocytochemistry, flow cytometry, ELISA, mass spectrometry, and combinations thereof. 25
23. The method of any one of claims 1, 2 and 16, wherein the treatment is d out in vitro.
24. The method of any one of claims 1, 2 and 16, n the ent is carried out in vivo.
25. The method of any one of claims 1, 2 and 16, wherein the cardiac cell sample 30 comprises cardiomyocytes.
26. The method of any one of claims 1, 2 and 16, wherein the level of CCDC47 protein expression is determined by using an antibody to CCDC47.
27. The method of claim 1, wherein an increase in level of expression of CCDC47 is an indication of drug-induced cardiotoxicity. (11544621
28. The method of claim 16, wherein a se in the level of expression of CCDC47 in the second cell sample as compared to the first cell sample is an indication that the candidate rescue agent is a rescue agent for the prevention, reduction or treatment of drug-induced toxicity. 5
29. The method of any one of claims 1, 2 and 16, wherein the drug-induced cardiotoxicity is drug-induced cardiomyopathy. Berg LLC By the Attorneys for the Applicant 10 SPRUSON & FERGUSON Per:
NZ701908A 2012-05-22 2012-09-07 Interrogatory cell-based assays for identifying drug-induced toxicity markers NZ701908B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
NZ722231A NZ722231B2 (en) 2012-05-22 2012-09-07 Interrogatory cell-based assays for identifying drug-induced toxicity markers

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201261650462P 2012-05-22 2012-05-22
US61/650,462 2012-05-22
PCT/US2012/054323 WO2013176694A1 (en) 2012-05-22 2012-09-07 Interrogatory cell-based assays for indentifying drug-induced toxicity markers

Publications (2)

Publication Number Publication Date
NZ701908A NZ701908A (en) 2016-08-26
NZ701908B2 true NZ701908B2 (en) 2016-11-29

Family

ID=

Similar Documents

Publication Publication Date Title
AU2012381038B2 (en) Interrogatory cell-based assays for identifying drug-induced toxicity markers
AU2017267184B2 (en) Method for assessing a prognosis and predicting the response of patients with malignant diseases to immunotherapy
CN107255721A (en) Mark is used for the purposes for recognizing cardiac toxic agents
RU2721130C2 (en) Assessment of activity of cell signaling pathways using a linear combination(s) of target gene expression
RU2719194C2 (en) Assessing activity of cell signaling pathways using probabilistic modeling of expression of target genes
CN107077536B (en) Evaluation of activity of TGF-beta cell signaling pathway using mathematical modeling of target gene expression
KR102023584B1 (en) PREDICTING GASTROENTEROPANCREATIC NEUROENDOCRINE NEOPLASMS (GEP-NENs)
US20230416827A1 (en) Assay for distinguishing between sepsis and systemic inflammatory response syndrome
KR101421326B1 (en) Composition for predicting prognosis of breast cancer and kit comprising the same
KR20140140069A (en) Compositions and methods for diagnosis and treatment of pervasive developmental disorder
WO2003042661A2 (en) Methods of diagnosis of cancer, compositions and methods of screening for modulators of cancer
CA2442820A1 (en) Microarray gene expression profiling in clear cell renal cell carcinoma: prognosis and drug target identification
AU779411B2 (en) Biallelic markers derived from genomic regions carrying genes involved in arachidonic acid metabolism
AU2016331663A1 (en) Pathogen biomarkers and uses therefor
MXPA05005653A (en) Heart failure gene determination and therapeutic screening.
KR20220054401A (en) Systems, methods and compositions for rapid early-detection of host RNA biomarkers of infection and early identification of COVID-19 coronavirus infection in humans
KR20060045950A (en) Prognostic for hematological malignancy
JP2003304888A (en) Method for toxicity prediction of compound
KR20070099564A (en) Methods for assessing patients with acute myeloid leukemia
CN114127314A (en) Genetic genomes, methods and kits for identifying or classifying subtypes (subtypes) of breast cancer
KR20160059446A (en) Cancer panel for identification of genomic variations in cancer
US20020137077A1 (en) Genes regulated in activated T cells
JP2003235573A (en) Diabetic nephropathy marker and its utilization
KR20120119215A (en) Composition or kit and method for predicting prognosis of liver cancer
CN100516876C (en) Methods for diagnosing RCC and other solid tumors