CA2429860A1 - Endogenous and non-endogenous versions of human g protein-coupled receptors - Google Patents

Endogenous and non-endogenous versions of human g protein-coupled receptors Download PDF

Info

Publication number
CA2429860A1
CA2429860A1 CA002429860A CA2429860A CA2429860A1 CA 2429860 A1 CA2429860 A1 CA 2429860A1 CA 002429860 A CA002429860 A CA 002429860A CA 2429860 A CA2429860 A CA 2429860A CA 2429860 A1 CA2429860 A1 CA 2429860A1
Authority
CA
Canada
Prior art keywords
leu
val
ser
ile
ala
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
CA002429860A
Other languages
French (fr)
Inventor
Ruoping Chen
Zhi Liang Chu
Huong T. Dang
Kevin P. Lowitz
Cameron Pride
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Arena Pharmaceuticals Inc
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Publication of CA2429860A1 publication Critical patent/CA2429860A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/705Receptors; Cell surface antigens; Cell surface determinants

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Molecular Biology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Immunology (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Medicinal Chemistry (AREA)
  • Toxicology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Cell Biology (AREA)
  • Peptides Or Proteins (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Investigating Or Analysing Biological Materials (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention disclosed in this patent document relates to transmembrane receptors, more particularly to a human G protein-coupled receptor for which the endogenous ligand is unknown, and to mutated (non-endogenous) versions of the human GPCRs for evidence of constitutive activity.

Description

ENDOGENOUS AND NON-ENDOGENOUS VERSIONS OF
HUMAN G PROTEIN-COUPLED RECEPTORS
CROSS-REFERENCE TO RELATED APPLICATIONS
This application is a continuation-in-part of U.S. Serial Number 09/170,496, filed with the United States Patent and Trademark Office on October 13, 1998 and its corresponding PCT application number PCT/LJS99/23938, published as WO 00/22129 on April 20, 2000. This document claims the benefit of priority from the following provisional applications, all filed via U.S.~ Express Mail with the United States Patent and Trademark Office on the indicated dates: U.S. Provisional Number 60/253,404, filed November 27, 2000; U.S. Provisional Number 60/255,366, filed December 12, 2000;
U.S. Provisional Number 60/270,286 filed February 20, 2001; U.S. Provisional Number 60/282,356, filed April 6, 2001, which claims priority from U.S. Provisional Number 60/270,266, filed February 20, 2001; U.S. Provisional Number 60/282,032, filed April 6, 2001; U.S. Provisional Number 60/282,358, filed April 6, 2001; U. S.
Provisional Number 60/282,365, filed April 6, 2001; U.S. Provisional Number 60/290,917, filed May 14, 2001; U.S. Provisional Number 60/309,208, filed July 31, 2001; the disclosures of which are incorporated in their entirety by reference.
FIELD OF THE INVENTION
The present invention relates to transmembrane receptors, in some embodiments to G protein-coupled receptors and, in some preferred embodiments, to endogenous GPCRs that are altered to establish or enhance constitutive activity of the receptor. In some embodiments, the constitutively activated GPCRs will be used for the direct identification of candidate compounds as receptor agonists or inverse agonists having applicability as therapeutic agents.

BACKGROUND OF THE INVENTION
PATENT
Although a number of receptor classes exist in humans, by far the most abundant and therapeutically relevant is represented by the G protein-coupled receptor (GPCR) class.
It is estimated that there are some 30,000-40,000 genes within the human genome, and of these, approximately 2% are estimated to code for GPCRs. Receptors, including GPCRs, for which the endogenous ligand has been identified, are referred to as "known" receptors, while receptors for which the endogenous ligand has not been identified are referred to as "orphan" receptors.
GPCRs represent an important area for the development of pharmaceutical products: from approximately 20 of the 100 known GPCRs, approximately 60% of all prescription pharmaceuticals have been developed. For example, in 1999, of the top 100 brand name prescription drugs, the following drugs interact with GPCRs (diseases and/or disorders treated are indicated in parentheses):
Claritin~ (allergies)Prozac~ (depression)Vasotec~ (hypertension) Paxil~ (depression) Zoloftfl (depression)Zyprexa ~ (psychotic disorder) Cozaar~ (hypertension)Imitrex~ (migraine)Zantac~ (reflux) Propulsid~ (reflux Risperdal~ (schizophrenia)Serevent~ (astlnna) disease) Pepcid~ (reflux) Gaster~ (ulcers) Atrovenifl (bronchospasm) Effexor~ (depression)Depakote~ (epilepsy)Cardura~ (prostatic hypertrophy) Allegra~ (allergies) Lupron~ (prostate Zoladex~ (prostate cancer) cancer) Diprivan~ (anesthesia)BuSpar~ (anxiety) Ventolin~ (bronchospasm) Hytrin~ (hypertension)Wellbutrin~ (depression)Zyrtec~ (rhinitis) Plavix~ (MI/stroke) Toprol-XL~ (hypertension)Tenormin~ (angina) Xalatan~ (glaucoma)Singulair~ (asthma)Diovan~ (hypertension) Harnal~ (prostatic hyperplasia) (Med Ad News 1999 Data).
GPCRs share a common structural motif, having seven sequences of between 22 to 24 hydrophobic amino acids that form seven alpha helices, each of which spans the membrane (each span is identified by number, i.e., transmembrane-1 (TM-1), transmebrane-2 (TM-2), etc.). The transmembrane helices are joined by strands of amino acids between transmembrane-2 and transmembrane-3, transmembrane-4 and transmembrane-5, and transmembrane-6 and transmembrane-7 on the exterior, or "extracellular" side, of the cell membrane (these are referred to as "extracellular"
regions 1, 2 and 3 (EC-1, EC-2 and EC-3), respectively). The transmembrane helices are also joined by strands of amino acids between transmembrane-1 and transmembrane-2, transmembrane-3 and transmembrane-4, and transmembrane-5 and transmembrane-6 on the interior, or "intracellular" side, of the cell membrane (these are referred to as "intracellular"
regions 1, 2 and 3 (IC-1, IC-2 and IC-3), respectively). The "carboxy" ("C") terminus of the receptor lies in the intracellular space withili the cell, and the "amino"
("N") terminus of the receptor lies in the extracellular space outside of the cell.
Generally, when an endogenous ligand binds with the receptor (often referred to as "activation" of the receptor), there is a change in the conformation of the intracellular region that allows for coupling between the intracellular region and an intracellular "G-protein." It has been reported that GPCRs are "promiscuous" with respect to G proteins, i.
e., that a GPCR can interact with more than one G protein. See, I~enal~in, T., 43 Life Sciences 1095 (1988). Although other G proteins exist, currently, Ga, GS, G;, GZ and Go are G proteins that have been identified. Ligand-activated GPCR coupling with the G-protein initiates a signaling cascade process (referred to as "signal transduction"). Under normal conditions, signal transduction ultimately results in cellular activation or cellular inlubition. Although not wishing to be bound to theory, it is thought that the IC-3 loop as well as the carboxy terminus of the receptor interact with the G protein.
Under physiological conditions, GPCRs exist in the cell membrane in equilibrium between two different conformations: an "inactive" state and an "active"
state. A receptor in an inactive state is unable to link to the intracellular signaling transduction pathway to initiate signal transduction leading to a biological response. Changing the receptor conformation to the active state allows linkage to the transduction pathway (via the G-protein) and produces a biological response.
A receptor may be stabilized in an active state by a ligand or a compound such as a drug. Recent discoveries, including but not exclusively limited to modifications to the amino acid sequence of the receptor, provide means other than ligands or drugs to promote and stabilize the receptor in the active state conformation. These means effectively stabilize the receptor in an active state by simulating the effect of a ligand binding to the receptor.
Stabilization by such ligand-independent means is termed "constitutive receptor activation."
Disclosed herein are endogenous and non-endogenous versions of human GPCRs and uses thereof.
Some embodiments of the present invention relate to a G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.N0.:2, non-endogenous, constitutively activated versions of the same, and host cells comprising the same..
Some embodiments of the present invention relate to a plasmid comprising a vector and the cDNA of SEQ.ID.N0.:1 and host cells comprising the same.
Some embodiments of the present invention relate to a G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.N0.:4, non-endogenous, constitutively activated versions of the same, and host cells comprising the same..
Some embodiments of the present invention relate to a plasmid comprising a vector and the cDNA of SEQ.ID.N0.:3, non-endogenous, constitutively activated versions of the same, and host cells comprising the same.
Some embodiments of the present invention relate to G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.N0.:6, non-endogenous, constitutively activated versions of the same, and host cells comprising the same.
Some embodiments of the present invention relate to a plasmid comprising a vector and the cDNA of SEQ.ID.NO.:S and host cells comprising the same.
Some embodiments of the present invention relate to a G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.NO.:B, non-endogenous, constitutively activated versions of the same, and host cells comprising the same.
Some embodiments of the present invention relate to a plasmid comprising a vector and the cDNA of SEQ.ID.N0.:7, non-endogenous, constitutively activated versions of the same, and host cells comprising the same.
Some embodiments of the present invention relate to a G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.NO.:10, non-endogenous, constitutively activated versions of the same, and host cells comprising the same.
Some embodiments of the present invention relate to a plasmid comprising a vector and the cDNA of SEQ.ID.N0.:9 and host cells comprising the same.
Some embodiments of the present invention relate to a G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.N0.:12, non-endogenous, constitutively activated versions of the same, and host cells comprising the same.
Some embodiments of the present invention relate to a plasmid comprising a vector and the cDNA of SEQ.ID.NO.:11, and host cells comprising the same.
Some embodiments of the present invention relate to a G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.N0.:14, constitutively activated versions of the same, and host cells comprising the same.
Some embodiments of the present invention relate to a plasmid comprising a vector and the cDNA of SEQ.ID.NO.:13 and host cells comprising the same.
Some embodiments of the present invention relate to a G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.N0.:16, constitutively activated versions of the same, and host cells comprising the same.
Some embodiments of the present invention relate to a plasmid comprising a vector and the cDNA of SEQ.ID.NO.:15 and host cells comprising the same.
Some embodiments of the present invention relate to a G protein-coupled receptor encoded by an amino acid sequence of SEQ.1D.N0.:18, constitutively activated versions of the same, and host cells comprising the same.
Some embodiments of the present invention relate to a plasmid comprising a vector and the cDNA of SEQ.ID.NO.:17 and host cells comprising the same.
Some embodiments of the present invention relate to a G protein-coupled receptor encoded by an amino acid sequence of SEQ.1D.NO.:20, constitutively activated versions of the same, and host cells comprising the same.
Some embodiments of the present invention relate to a plasmid comprising a vector and the cDNA of SEQ.ID.N0.:19 and host cells comprising the same.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 is a graphic representation of activation of RUP32, Ga(del)/G, RUP32 co-transfected with Gg(del)/G;, and CMV (control; expression vector) in a second messenger assay measuring the accumulation of inositol phosphate (IP3) utilizing 293 cells.
Figure 2 provides an illustration of second messenger IP3 production from endogenous version RUP35 and RUP36 as compared with the control ("CMV").
DETA~ED DESCRIPTION
The scientific literature that has evolved around receptors has adopted a number of terms to refer to ligands having various effects on receptors. For clarity and consistency, the following definitions will be used throughout this patent document. To the extent that these definitions conflict with other definitions for these terms, the following definitions shall control:
AGONISTS shall mean materials (e.g., ligands, candidate compounds) that activate the intracellular response when they bind to the receptor, or enhance GTP
binding to membranes. In some embodiments, AGONISTS are those materials not previously known to activate the intracellular response when they bind to the receptor or to enhance GTP
binding to membranes.
AMINO ACID ABBREVIATIONS used herein are set out in Table A:
TABLE A
ALANINE ALA A

ARGININE ARG R

ASPARAGINE ASN N

ASPARTIC ACID ASP D

CYSTEINE CYS C

GLUTAMIC ACID GLU E

GLYCINE GLY G

HISTIDINE HIS H

ISOLEUCINE ILE I

LYSINE LYS K
METHIONINE MET M

PHENYLALANINE PHE F

PROLINE PRO P

SERINE SER S

THREONINE THR T

TRYPTOPHAN TRP W

TYROSINE TYR Y

VALINE VAL V

ANTAGONIST shall mean materials (e.g., ligazids, candidate compounds) that competitively bind to the receptor at the same site as the agonists but which do not activate the intracellular response initiated by the active form of the receptor, and can thereby inhibit the intracellular responses by agonists. ANTAGOI~IISTS do not diminish the baseline intracellular response in the absence of an agonist. In some embodiments, ANTAGOI~IISTS are those materials not previously known to activate the intracellular response when they bind to the receptor or to enhance GTP binding to membranes.
CANDIDATE COMPOUND shall mean a molecule (for example, and not limitation, a chemical compound) that is amenable to a screening technique.
Preferably, the phrase "candidate compound" does not include compounds which were publicly known to be compounds selected from the group consisting of inverse agonist, agonist or antagonist to a receptor, as previously determined by an indirect identification process ("indirectly identified compound"); more preferably, not including an indirectly identified compound which has previously been determined to have therapeutic efficacy in at least one mammal;
and, most preferably, not including an indirectly identified compound which has previously been determined to have therapeutic utility in humans.
COMPOSITION means a material comprising at least one component; a "pharmaceutical composition" is an example of a composition.
COMPOUND EFFICACY shall mean a measurement of the ability of a compound to inhibit or stimulate receptor functionality; i.e. the ability to activate/inhibit a signal transduction pathway, as opposed to receptor binding affinity.
Exemplary means of detecting compound efficacy are disclosed in the Example section of this patent document.
CODON shall mean a grouping of three nucleotides (or equivalents to nucleotides) which generally comprise a nucleoside (adenosine (A), guanosine (G), cytidine (C), uridine (U) and thymidine (T)) coupled.to a phosphate group and which, when translated, encodes an amino acid.
CONSTITUTIVELY ACTIVATED RECEPTOR shall mean a receptor subjected to constitutive receptor activation. A constitutively activated receptor can be endogenous or non-endogenous.
CONSTITUTIVE RECEPTOR ACTIVATION shall mean stabilization of a receptor in the active state by means other than binding of the receptor with its ligand or a chemical equivalent thereof.
CONTACT or CONTACTING shall mean bringing at least two moieties together, whether in an in vitro system or an in vivo system.
DIRECTLY IDENTIFYING or DIRECTLY IDENTIFIED, in relationship to the phrase "candidate compound", shall mean the screening of a candidate compound against a constitutively activated receptor, preferably a constitutively activated orphan receptor, and most preferably against a coiistitutively activated G protein-coupled cell surface orphan receptor, and assessing the compound efficacy of such compound.
This phrase is, under no circumstances, to be interpreted or understood to be encompassed by or to encompass the phrase "indirectly identifying" or "indirectly identified."
ENDOGENOUS shall mean a material that a mammal naturally produces.
ENDOGENOUS in reference to, for example and not limitation, the term "receptor," shall mean that which is naturally produced by a mammal (for example, and not limitation, a human) or a virus. By contrast, the term NON-ENDOGENOUS in this context shall mean that which is not naturally produced by a mammal (for example, and not limitation, a human) or a virus. For example, and not limitation, a receptor which is not constitutively active in its endogenous form, but when manipulated becomes constitutively active, is most preferably referred to herein as a "non-endogenous, constitutively activated receptor." Both terms can be utilized to describe both "in vivo" and "in vitro" systems. For example, and not limitation, in a screening approach, the endogenous or non-endogenous receptor may be in reference to an in vitro screening system. As a further example and not limitation, where the genome of a mammal has been manipulated to include a non-endogenous constitutively activated receptor, screening of a candidate compound by means of an in vivo system is viable.
G PROTEIN COUPLED RECEPTOR FUSION PROTEIN and GPCR
FUSION PROTEIN, in the context of the invention disclosed herein, each mean a non-endogenous protein comprising an endogenous, constitutively activate GPCR or a non-endogenous, constitutively activated GPCR fused to at least one G protein, most preferably the alpha (oc) subunit ~of such G protein (this being the subunit that binds GTP), with the G
protein preferably being of the same type as the G protein that naturally couples with endogenous orphan GPCR. For example, and not limitation, in an endogenous state, if the G protein "Gsoc" is the predominate G protein that couples with the GPCR, a GPCR Fusion Protein based upon the specific GPCR would be a non-endogenous protein comprising the GPCR fused to Gsa; in some circumstances, as will be set forth below, a non-predominant G protein can be fused to the GPCR. The G protein can be fused directly to the C-terminus of the constitutively active GPCR or there may be spacers between the two.
HOST CELL shall mean a cell capable of having a Plasmid and/or Vector incorporated therein. In the case of a prokaryotic Host Cell, a Plasmid is typically replicated as a autonomous molecule as the Host Cell replicates (generally, the Plasmid is thereafter isolated, for introduction into a eukaryotic Host Cell); in the case of a eulcaryotic Host Cell, a Plasmid is integrated into the cellular DNA of the Host Cell such that when the eukaryotic Host Cell replicates, the Plasmid replicates. In some embodiments the Host Cell is eukaryotic, more preferably, mammalian, and most preferably selected from the group consisting of 293, 293T and COS-7 cells.
INDIRECTLY IDENTIFYING or INDIRECTLY IDENTIFIED means the traditional approach to the drug discovery process involving identification of an endogenous ligand specific for an endogenous receptor, screening of candidate compounds against the receptor for determination of those which interfere andlor compete with the ligand-receptor interaction, and assessing the efficacy of the compound for affecting at least one second messenger pathway associated with the activated receptor.
INHIBIT or INHIBITING, in relationship to the term "response" shall mean that a response is decreased or prevented in the presence of a compound as opposed to in the absence of the compound.
INVERSE AGONISTS shall mean materials (e.g., ligand, candidate compound) which bind to either the endogenous form of the receptor or to the constitutively activated form of the receptor, and which inhibit the baseline intracellular response initiated by the active form of the receptor below the normal base level of activity which is observed in the absence of agonists, or decrease GTP binding to membranes. Preferably, the baseline intracellular response is inhibited in the presence of the inverse agonist by at least 30%, at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 92%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, and most preferably at least 99% as compared with the baseline response in the absence of the inverse agonist.
KNOWN RECEPTOR shall' mean an endogenous receptor for which the endogenous ligand specific for that receptor has been identified.
LIGAND shall~mean a molecule specific for a naturally occurring receptor.
MUTANT or MUTATION in reference to an endogenous receptor's nucleic acid andlor amino acid sequence shall mean a specified change or changes to such endogenous sequences such that a mutated form of an endogenous, non-constitutively activated receptor evidences constitutive activation of the receptor. In terms of equivalents to specific sequences, a subsequent mutated form of a human receptor is considered to be equivalent to a first mutation of the human receptor if (a) the level of constitutive activation of the subsequent mutated form of a human receptor is substantially the same as that evidenced by the first mutation of the receptor; and (b) the percent sequence (amino acid and/or nucleic acid) homology between the subsequent mutated form of the receptor and the first mutation of the receptor is at least 80%, at least 85%, at least 90%, at least 92%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, and most preferably at least 99%. In some embodiments, owing to the fact that some preferred cassettes disclosed herein for achieving constitutive activation include a single amino acid and/or codon change between the endogenous and the non-endogenous forms of the GPCR, it is preferred that the percent sequence homology should be at least 98%.

NON-ORPHAN RECEPTOR shall mean an endogenous naturally occurring molecule specific for an identified ligand wherein the binding of a ligand to a receptor activates an intracellular signaling pathway.
ORPHAN RECEPTOR shall mean an endogenous receptor for which the ligand specific for that receptor has not been identified or is not known.
PHARMACEUTICAL COMPOSITION shall mean a composition comprising at least one active ingredient, whereby the composition is amenable to investigation for a specified, efficacious outcome in a mammal (for example, and not limitation, a human).
Those of ordinary skill in the art will understand and appreciate the techniques appropriate for determining whether an active ingredient has a desired efficacious outcome based upon the needs of the artisan.
PLASMID shall mean the combination of a Vector and cDNA. Generally, a Plasmid is introduced into a Host Cell for the purposes of replication and/or expression of the cDNA as a protein.
SECOND MESSENGER shall mean an intracellular response produced as a result of receptor activation. A second messenger can include, for example, inositol triphosphate (1P3), diacycglycerol (DAG), cyclic AMP (cAMP), and cyclic GMP (cGMP). Second messenger response can be measured for a determination of receptor activation.
1n addition, second messenger response can be measured for the direct identification of candidate compounds, including for example, inverse agonists, agonists, and antagonists.
SIGNAL TO NOISE RATIO shall mean the signal generated in response to activation, amplification, or stimulation wherein the signal is above the baclcground noise or the basal level in response to non-activation, non-amplification, or non-stimulation.
SPACER shall mean a translated number of amino acids that are located after the last codon or last amino acid of a gene, for example a GPCR of interest, but before the start codon or beginning regions of the G protein of interest, wherein the translated number amino acids are placed in-frame with the beginnings regions of the G protein of interest.
The number of translated amino acids can be tailored according to the needs of the skilled artisan and is generally from about one amino acid, preferably two amino acids, more preferably three amino acids, more preferably four amino acids, more preferably five amino acids, more preferably six amino acids, more preferably seven amino acids, more preferably eight amino acids, more preferably. nine amino acids, more preferably ten amino acids, more preferably eleven amino acids, and even more preferably twelve amino acids.
STIMULATE or STIMULATING, in relationship to the term "response" shall mean that a response is increased in the presence of a compound as opposed to in the absence of the compound.
SUBSTANTIALLY shall refer to a result which is within 40% of a control result, preferably within 35%, more preferably within 30%, more preferably within 25%, more preferably within 20%, more preferably within 15%, more preferably within 10%, more preferably within 5%, more preferably within 2%, and most preferably within 1%
of a control result. For example, in the context of receptor functionality, a test receptor may exhibit substantially similar results to a control receptor if the transduced signal, measured using a method taught herein or similar method known to the art-skilled, if within 40% of the signal produced by a control signal.
VECTOR in reference to cDNA shall mean a circular DNA capable of incorporating at least one cDNA and capable of incorporation into a Host Cell.
The order of the following sections is set forth for presentational efficiency and is not intended, nor should be construed, as a limitation on the disclosure or the claims to follow.
A. Introduction The traditional study of receptors has typically proceeded from the a priori assumption (historically based) that the endogenous ligand must first be identified before discovery could proceed to find antagonists and other molecules that could affect the receptor. Even in cases where an antagonist might have been known first, the search immediately extended to looking for the endogenous ligand. This mode of thinking has persisted in receptor research even after the discovery of constitutively activated receptors.
What has not been heretofore recognized is that it is the active state of the receptor that is most useful for discovering agonists and inverse agonists of the receptor. For those diseases which result from an overly active receptor or an under-active receptor, what is desired in a therapeutic drug is a compound which acts to diminish the active state of a receptor or enhance the activity of the receptor, respectively, not necessarily a drug which is an antagonist to the endogenous ligand. This is because a compound that reduces or enhances the activity of the active receptor state need not bind at the same site as the endogenous ligand. Thus, as taught by a method of this invention, any search for therapeutic compounds should start by screening compounds against the ligand-independent active state.
B. Identification of Human GPCRs The efforts of the Human Genome project has led to the identification of a plethora of information regarding nucleic acid sequences located within the human genome; it has been the case in this endeavor that genetic sequence information has been made available without an understanding or recognition as to whether or not any particular genomic sequence does or may contain open-reading frame information that translate human proteins. Several methods of identifying nucleic acid sequences within the human genome are within the purview of those having ordinary shill in the art. For example, and not limitation, a variety of human GPCRs, disclosed herein, were discovered by reviewing the GenBankTM database. Table B, below, lists several endogenous GPCRs that we have discovered, along with other GPCRs that are homologous to the disclosed GPCR.
TABLE S
Disclosed Accession Open Reading Reference Per Cent Human Number Frame To Homology Orphan GPCRsIdentified (Base Pairs) Homologous To Designated GPCR GPCR

hRUP28 AC073957 1,002bp hGPR30 34%

liRUP29 AC083865 918bp hGPRl8 27%

hRUP30 AC055863 1,125bp hBlZB1 27%

hRUP31 AL356214 1,086bp hGALR-1 31%

liRUP32 AL513524 1,038bp hPNR 43%

hRUP33 AL513524 1,020bp GPR57 50%
GPR58 51%

hRUP34 AL513524 1,029bp hPNR 45%

hRUP35 AC021089 1,062bp hK-type 27%

opioid hRUP36 AC090099 969bp GPR90 42%

hRUP37 ~ AC090099 969bp hMRG 41%
Receptor homology is useful in terms of gaining an appreciation of a role of the receptors within the human body. As the patent document progresses, techniques for mutating these receptors to establish non-endogenous, constitutively activated versions of these receptors will be discussed.
The techniques disclosed herein are also applicable to other human GPCRs known to the art, as will be apparent to those skilled in the art.
C. Receptor Screening Screening candidate compounds against a non-endogenous, constitutively activated version of the GPCRs disclosed herein allows for the direct identification of candidate compounds which act at the cell surface receptor, without requiring use of the receptor's endogenous ligand. Using routine, and often commercially available techniques, one can determine areas within the body where the endogenous version of human GPCRs disclosed herein is expressed and/or over-expressed. The expression location of a receptor in a specific tissue provides a scientist with the ability to assign a physiological functional role of the receptor. It is also possible using these techniques to determine related disease/disorder states which are associated with the expression and/or over-expression of the receptor; such an approach is disclosed in this patent document.
Furthermore, expression of a receptor in diseased organs can assist one in determinilig the magnitude of the clinical relevance of the receptor.
Constitutive activation of the GPCRs disclosed herein is based upon the distance from the proline residue at which is presumed to be located within TM6 of the GPCR; this algorithmic technique is disclosed in co-pending and commonly assigned patent document PCT Application Number PCT/US99/2393~, published as WO 00/22129 on April 20, 2000, wluch, along with the other patent documents listed herein, is incorporated herein by reference. The algorithmic technique is not predicated upon traditional sequence "alignment" but rather a specified distance from the aforementioned TM6 proline residue (or, of course, endogenous constitutive substitution for such proline residue). By mutating the amino acid residue located 16 amino acid residues from this residue (presumably located in the IC3 region of the receptor) to, most preferably, a lysine residue, constitutive activation of the receptor may be obtained. Other amino acid residues may be useful in the mutation at this position to achieve this objective and will be discussed in detail, below.
D. Disease/Disorder Identification and/or Selection As will be set forth in greater detail below, inverse agonists and agonists to the non-endogenous, constitutively activated GPCR can be identified by the methodologies of this invention. Such inverse agonists and agonists are ideal candidates as lead compounds in drug discovery programs for treating diseases related to this receptor.
Because of the ability to directly identify inverse agonists to the GPCR, thereby allowing for the development of pharmaceutical compositions, a search for diseases and disorders associated with the GPCR
is relevant. The expression location of a receptor in a specific tissue provides a scientist with the ability to assign a physiological function to the receptor. For example, scanning both diseased and normal tissue samples for the presence of the GPCR now becomes more than an academic exercise or one which might be pursued along the path of identifying an endogenous ligand to the specific GPCR. Tissue scans can be conducted across a broad range of healthy and diseased tissues. Such tissue scans provide a potential first step in associating a specific receptor with a disease and/or disorder. Furthermore, expression of a receptor in diseased organs can assist one in determining the magnitude of clinical relevance of the receptor.
The DNA sequence of the GPCR can be used to make a probe/primer. In some preferred embodiments the DNA sequence is used to make a probe for (a) dot-blot analysis against tissue-mRNA, and/or (b) RT-PCR identification of the expression of the receptor in tissue samples. The presence of a receptor in a tissue source, or a diseased tissue, or the presence of the receptor at elevated concentrations in diseased tissue compared to a normal tissue, can be used to correlate location to function and indicate the receptor's physiological role/function and create a treatment regimen, including but not limited to, a disease associated with that function/role. Receptors can also be localized to regions of organs by this technique. Based on the known or assumed roles/functions of the specific tissues to which the receptor is localized, the putative physiological function of the receptor can be deduced. For example and not limitation, proteins located/expressed in areas of the thalamus are associated with sensorimotor processing and arousal (see, Goodman &
Gilman's, The Pharmacological Basis of Therapeutics, 9th Edition, page 465 (1996)).
Proteins expressed in the hippocampus or in Schwann cells are associated with learning and memory, and myelination of peripheral nerves, respectively (see, Kandel, E. et al., Essentials of Neural Science and Behavior pages 657, 680 and 28, respectively (1995)).
E. Screening of Candidate Compounds 1. Generic GPCR screening assay techniques When a G protein receptor becomes constitutively active, it binds to a G
protein (e.g., Gq, GS, G, GZ, Go) and stimulates the binding of GTP to the G protein.
The G protein then acts as a GTPase and hydrolyzes the GTP to GDP, whereby the receptor, under normal conditions, becomes deactivated. However, constitutively activated receptors continue to exchange GDP to GTP. A non-hydrolyzable analog of GTP, [35S]GTPyS, can be.
used to monitor enhanced binding to membranes which express constitutively activated receptors.
It is reported that [35S]GTPyS can be used to monitor G protein coupling to membranes in the absence and presence of ligand. An example of this monitoring, among other examples well-knomn and available to those in the art, was reported by Traynor and Nahorski in 1995. The use of this assay system is typically for initial screening of candidate compounds because the system is generically applicable to all G protein-coupled receptors regardless of the particular G protein that interacts with the intracellular domain of the receptor.
2. Specific GPCR screening assay techniques Once candidate compounds are identified using the "generic" G protein-coupled receptor assay (i.e., an assay to select compounds that are agonists or inverse agonists), further screening to confirm that the compounds have interacted at the receptor site is preferred. For example, a compound identified by the "generic" assay may not bind to the receptor, but may instead merely "uncouple" the G protein from the intracellular domain.
a. GS, GZ atzd G~.
GS stimulates the enzyme adenylyl cyclase. G; (and GZ and Go), on the other hand, inhibits adenylyl cyclase. Adenylyl cyclase catalyzes the conversion of ATP to cAMP;
thus, constitutively activated GPCRs that couple the GS protein are associated with increased cellular levels of cAMP. On the other hand, constitutively activated GPCRs that couple Gi (or GZ, Go) protein are associated with decreased cellular levels of cAMP. See, generally, "Indirect Mechanisms of Synaptic Transmission," Chpt. 8, From Neuron To Brain (3rd Ed.) Nichols, J.G. et al eds. Sinauer Associates, Inc. (1992).
Thus, assays that detect cAMP can be utilized to determine if a candidate compound is, e.g., an inverse agonist to the receptor (i.e., such a compound would decrease the levels of cAMP). A
variety of approaches known in the art for measuring cAMP can be utilized; a most preferred approach relies upon the use of anti-cAMP antibodies in an ELISA-based format.
Another type of assay that can be utilized is a whole cell second messenger reporter system assay. Promoters on genes drive the expression of the proteins that a particular gene encodes. Cyclic AMP drives gene expression by promoting the binding of a cAMP-responsive DNA binding protein or transcription factor (CREB) that then binds to the promoter at specific sites (CAMP response elements) and drives the expression of the gene.
Reporter systems can be constructed which have a promoter containing multiple cAMP
response elements before the reporter gene, e.g., (3-galactosidase or luciferase. Thus, a constitutively activated GS linked receptor causes the accumulation of cAMP
that then activates the gene and leads to the expression of the reporter protein. The reporter protein such as (3-galactosidase or luciferase can then be detected using standard biochemical assays (Chen et al. 1995).
b. Go ahd G9.
Gq and Go are associated with activation of the enzyme phospholipase C, which in turn hydrolyzes the phospholipid PIPZ, releasing two intracellular messengers:
diacycloglycerol (DAG) and inositol 1,4,5=triphoisphate (IP3). Increased accumulation of IP3 is associated with activation of Gq and Go-associated receptors. See, gehe~ally, "Indirect Mechanisms of Synaptic Transmission," Chpt. 8, From Neuron To Brain (3rd Ed.) Nichols, J.G. et al eds. Siilauer Associates, Inc. (1992). Assays that detect IP3 accumulation can be utilized to determine if a candidate compound is, e.g., an inverse agonist to a Gq or Go-associated receptor (i. e., such a compound would decrease the levels of IP3). Gq associated receptors can also be examined using an AP1 reporter assay wherein Gq dependent phospholipase C causes activation of genes containing AP1 elements;
thus, activated Gg associated receptors will evidence an increase in the expression of such genes, whereby inverse agonists thereto will evidence a decrease in such expression, and agonists will evidence an increase in such expression. Commercially available assays for such detection are available.
3. GPCR Fusion Protein The use of an endogenous, constitutively activated GPCR or a non-endogenous, constitutively activated GPCR, for use in screening of candidate compounds for the direct identification of inverse agonists, agoiusts provide an interesting screening challenge in that, by definition, the receptor is active even in the absence of an endogenous ligand bound thereto. Thus, in order to differentiate between, e.g., the non-endogenous receptor in the presence of a candidate compound and the non-endogenous receptor in the absence of that compound, twith an aim of such a differentiation to allow for an understanding as to whether such compound may be an inverse agonist or agonist or have no affect on such a receptor, it is preferred that an approach be utilized that can enhance such differentiation. A preferred approach is the use of a GPCR Fusion Protein.
Generally, once it is determined that a non-endogenous GPCR has been constitutively activated using the assay techniques set forth above (as well as others), it is possible to determine the predominant G protein that couples with the endogenous GPCR.
Coupling of the G protein to the GPCR provides a signaling pathway that can be assessed.
In some embodiments it is preferred that screening take place using a mammalian expression system, such a system will be expected to have endogenous G protein therein.
Thus, by definition, in such a system, the non-endogenous, constitutively activated GPCR
will continuously signal. In some embodiments it is preferred that this signal be enhanced such that in the presence of, e.g., an inverse agonist to the receptor, it is more likely that it will be able to more readily differentiate, particularly in the context of screening, between the receptor when it is contacted with the inverse agonist.
The GPCR Fusion Protein is intended to enhance the efficacy of G protein coupling with the non-endogenous GPCR. The GPCR Fusion Protein is preferred for screening with either an endogenous, constitutively active GPCR or a non-endogenous, constitutively activated GPCR because such an approach increases the signal that is utilized in such screening techniques. This is important in facilitating a significant "signal to noise" ratio;
such a significant ratio is preferred for the screening of candidate compounds as disclosed herein.
The construction of a construct useful for expression of a GPCR Fusion Protein is within the purview of those having ordinary skill in the art. Commercially available expression vectors and systems offer a variety of approaches that can fit the particular needs of an investigator. Important criteria on the construction of such a GPCR
Fusion Protein construct include but are not limited to, that the endogenous GPCR sequence and the G
protein sequence both be in-frame (preferably, the sequence for the endogenous GPCR is upstream of the G protein sequence), and that the "stop" codon of the GPCR be deleted or replaced such that upon expression of the GPCR, the G protein can also be expressed.
Other embodiments include constructs wherein the endogenous GPCR sequence and the G
protein sequence are not in-frame and/or the "stop" codon is not deleted or replaced. The GPCR can be linked directly to the G protein, or there can be spacer residues between the two (preferably, no more than about 12, although this number can be readily ascertained by one of ordinary skill in the art). Based upon convenience it is preferred to use a spacer.
Preferably, the G protein that couples to the non-endogenous GPCR will have been identified prior to the creation of the GPCR Fusion Protein construct. Because there are only a few G proteins that have been identified, it is preferred that a construct comprising the sequence of the G protein (i.e., a universal G protein construct (see Examples)) be available for insertion of an endogenous GPCR sequence therein; this provides for further efficiency in the context of large-scale screening of a variety of different endogenous GPCRs having different sequences.
As noted above, constitutively activated GPCRs that couple to G, GZ and Go are expected to inhibit the formation of cAMP making assays based upon these types of GPCRs challenging (i.e., the cAMP signal decreases upon activation thus malting the direct identification of, e.g., inverse agonists (which would further decrease this signal), challenging. As will be disclosed herein, we have ascertained that for these types of receptors, it is possible to create a GPCR Fusion Protein that is not based upon the GPCRs endogenous G protein, in an effort to establish a viable cyclase-based assay.
Thus, for example, an endogenous G; coupled receptor can be fused to a GS protein -such a fusion construct, upon expression, "drives" or "forces" the endogenous GPCR to couple with, e.g., GS rather than the "natural" G, protein, such that a cyclase-based assay can be established.
Thus, for G, G~ and Go coupled receptors, in some embodiments it is preferred that when a GPCR Fusion Protein is used and the assay is based upon detection of adenylyl cyclase activity, that the fusion construct be established with GS (or an equivalent G
protein that stimulates the formation of the enzyme adenylyl cyclase).
G Effect of CAMP Effect of IP3 Effect of Effect on Il'3 protein Production Accumulation cAMP Accumulation upon Activation upon ActivationProductionupon contact of GPCR (i.e., of GPCR (i.e.,upon with an Inverse constitutive constitutive contact Agonist activation activation with an or or agonist binding)agonist binding)Inverse Agonist GS Increase N/A Decrease N/A

G; Decrease N/A Increase N/A

GZ Decrease N/A Increase N/A

Go Decrease Increase Increase Decrease Gq N/A Increase NIA Decrease Equally effective is a G Protein Fusion construct that utilizes a Gq Protein fused with a GS, G, GZ or Go Protein. In some embodiments a preferred fusion construct can be accomplished with a Gg Protein wherein the first six (6) amino acids of the G-protein a-subunit ("Gaq") is deleted and the last five (5) amino acids at the C-terminal end of Gaq is replaced with the corresponding amino acids of the Ga of the G protein of interest. For example, a fusion construct can have a Gq (6 amino acid deletion) fused with a G; Protein, resulting in a "Gq/G Fusion Construct". This fusion construct will forces the endogenous G; coupled receptor to couple to its non-endogenous G protein, Gg, such that the second messenger, for example, inositol triphosphate or diacylgycerol, can be measured ifZ lieu of cAMP production.
4. Co-transfection of a Target G; Coupled GPCR with a Signal-Enhancer GS Coupled GPCR (CAMP Based Assays) A G; coupled receptor is known to inhibit adenylyl cyclase, and, therefore, decreases the level of cAMP production, which can malce assessment of cAMP levels challenging. An effective technique in measuring the decrease in production of cAMP as an indication of constitutive activation of a receptor that predominantly couples G; upon activation can be accomplished by co-transfecting a signal enhancer, e.g., a non-endogenous, constitutively activated receptor that predominantly couples with GS upon activation (e.g., TSHR-A623I, disclosed below), with the G; linked GPCR. As is apparent, constitutive activation of a GS

coupled receptor can be determined based upon an increase in production of cAMP.
Constitutive activation of a G coupled receptor leads to a decrease in production cAMP.
Thus, the co-transfection approach is intended to advantageously exploit these "opposite"
affects. For example, co-transfection of a non-endogenous, constitutively activated GS
coupled receptor (the "signal enhancer") with the endogenous G; coupled receptor (the "target receptor") provides a baseline cAMP signal (i. e., although the G;
coupled receptor will decrease cAMP levels, this "decrease" will be relative to the substantial increase in cAMP levels established by constitutively activated GS coupled signal enhancer). By then co-transfecting the signal enhancer with a constitutively activated version of the target receptor, cAMP would be expected to fiufiher decrease (relative to base line) due to the increased functional activity of the G; target (i.e., which decreases cAMP).
Screening of candidate compounds using a cAMP based assay can then be accomplished, with two 'changes' relative to the use of the endogenous receptor/G-protein fusion: first, relative to the G, coupled target receptor, "opposite" effects will result, i. e., an inverse agonist of the G, coupled target receptor will increase the measured cAMP signal, while an agonist of the G; coupled target receptor will decrease this signal;
second, as would be apparent, candidate compounds that are directly identified using this approach should be assessed iildependently to ensure that these do not target the signal enhancing receptor (this can be done prior to or after screening against the co-transfected receptors).
F. Medicinal Chemistry Generally, but not always, direct identification of candidate compounds is conducted in conjunction with compounds generated via combinatorial chemistry techniques, whereby thousands of compounds are randomly prepared for such analysis.
Generally, the results of such screening will be compounds having unique core structures; thereafter, these compounds may be subjected to additional chemical modification around a preferred core structures) to further enhance the medicinal properties thereof. Such techniques are known to those in the art and will not be addressed in detail in this patent document.
G. Pharmaceutical compositions Candidate compounds selected for further development can be formulated into pharmaceutical compositions using techniques well known to those in the art.
Suitable pharmaceutically-acceptable carriers are available to those in the art; for example, see Remington's Pharmaceutical Sciences, 16t1' Edition, 1980, Mack Publishing Co., (Osol et al., eds.).
H. Other Utilities Although a preferred use of the non-endogenous versions of the GPCRs disclosed herein may be for the direct identification of candidate compounds as inverse agonists or agonists (preferably for use as pharmaceutical agents), other uses of these versions of GPCRs exist. For example, ih vitro and in vivo systems incorporating GPCRs can be utilized to further elucidate and understand the roles these receptors play i11 the human condition, both normal and diseased, as well as understanding the role of constitutive activation as it applies to understanding the signaling cascade. In some embodiments it is preferred that the endogenous receptors be "orphan receptors", i.e., the endogenous ligand for the receptor has not been identified. In some embodiments, therefore, the modified, non-endogenous GPCRs can be used to understand the role of endogenous receptors in the human body before the endogenous ligand therefore is identified. Such receptors can also be used to further elucidate known receptors and the pathways through which they transduce a signal. Other uses of the disclosed receptors will become apparent to those in the art based upon, ifzte~ alia, a review of this patent document.
EXAMPLES
The following examples are presented for purposes of elucidation, and not limitation, of the present invention. While specific nucleic acid and amino acid sequences are disclosed herein, those of ordinary skill in the art are credited with the ability to make minor modifications to these sequences while achieving the same or substantially similar results reported below. The traditional approach to application or understanding of sequence cassettes from one sequence to another (e.g. from rat receptor to human receptor or from human receptor A to human receptor B) is generally predicated upon sequence alignment techniques whereby the sequences are aligned in an effort to determine areas of commonality. The mutational approach disclosed herein does not rely upon this approach but is instead based upon an algorithmic approach and a positional distance from a conserved proline residue located within the TM6 region of human GPCRs. Once this approach is secured, those in the art are credited with the ability to make minor modifications thereto to achieve substantially the same results (i.e., constitutive activation) disclosed herein. Such modified approaches are considered within the purview of this disclosure.
Example 1 ENDOGENOUS HUMAN GPCRS
1. Identification of Human GPCRs The disclosed endogenous human GPCRs were identified based upon a review of the GenBankTM database information. While searching the database, the following cDNA clones were identified as evidenced below (Table C).
TABLE C
DisclosedAccessionOpen Reference Nucleic AcidAmino Human Number Reading To SEQ.ID. NO. Acid Orphan IdentifiedFrame Homologous SEQ.H).NO.
GPCRs (Ease GPCR
Pairs) hRUP28 AC073957 1,002bp hGPR30 1 2 hRUP29 AC083865 918bp hGPRl8 3 4 hRUP30 AC055863 1,125bp hBRBl 5 6 hRUP31 AL356214 1,086bp hGALR-1 7 8 hRUP32 AL513524 1,038bp hPNR 9 l0 hRUP33 AL513524 1,020bp GPR57 1l 12 hRUP34 AL513524 1,029bp hPNR 13 14 hRUP35 AC021089 1,062bp hx-type 15 16 opioid hRUP36 AC090099 969bp GPR90 17 18 hRUP37 AC090099 969bp hMRG 19 20 2. Full Length Cloning a. hRUP28 (Seq. Id. Nos. l & 2) The disclosed human RUP28 was identified based upon the use of GenBank database information. While searching the database, a cDNA clone with Accession Number AC073957 was identified as a human genomic sequence from chromosome 7.
The full length RUP28 was cloned by PCR using primers:

5'- CAGAGCTCTGGTGGCCACCTCTGTCC-3' (SEQ.ID.N0.:21; sense, 5' of initiation codon), 5'-CTGCGTCCACCAGAGTCACGTCTCC-3' (SEQ.ID.N0.:22; antisense, 3' of stop codon), and human adult liver Marathon-ReadyTM cDNA (Clontech) as template.
AdvantageTM cDNA polymerase (Clontech) was used for the amplification in a 50w1 reaction by the following cycle with step 2 to 4 repeated 35 times:
95°C for 5 min; 94°C
for 30 sec; 58°C for 30 sec; 72°C for 1 min 30 sec; and 72°C for 7 min.
A 1.16kb PCR fragment was isolated from a 1% agarose gel and cloned into the pCRII-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit (P.E. Biosystems). See, SEQ.ID.NO.:l for the nucleic acid sequence and SEQ.ID.N0.:2 for the putative amino acid sequence.
b. hRUP29 (Seq. Id. Nos. 3 & 4) The disclosed human RUP29 was identified based upon the use of GenBank database information. While searching the database, a cDNA clone with Accession Number AC0083865 was identified as a human genomic sequence from chromosome 7.
The full length RUP29 was cloned by PCR using primers:
5'-GTATGCCTGGCCACAATACCTCCAGG-3' (SEQ.ID.N0.:23; sense, containing the initiation codon), 5'-GTTTGTGGCTAACGGCACAAAACACAATTCC-3' (SEQ.117.N0.:24; antisense, containing the stop codon) and human genomic DNA as template. TaqPlus~
Precision DNA polymerase (Stratagene) was used for the amplification in a 50,1 reaction by the following cycle with step 2 to 4 repeated 35 times: 94°C for 5 min;
94°C for 30 sec; 54°C
for 30 sec; 72°C for 1 min 30 sec; and 72°C for 7 min.
A 930bp PCR fragment was isolated from a 1% agarose gel and cloned into the pCRII-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit (P.E. Biosystems).
Rapid amplification of cDNA ends (R.ACE) was performed using human leukocyte and ovary Marathon-ReadyTM cDNA (Clontech) to determine the precise 5' end of RUP29 cDNA. RUP29 specific primer (1) having the sequence:
5-GGTACCACA.ATGACAATCACCAGCGTCC-3'(SEQ.ID.N0.:25) and AP 1 primer (Clontech) were used for the first-round PCR reaction, and specific primer (2) having the following sequence:

5'-GGAACGTGAGGTACATGTGGATGTGCAGC-3' (SEQ.ID.N0.:26) and AP2 primer (Clontech) were used for the second-round PCR reaction. The products of the RACE reactions were isolated and cloned into the pCRII-TOPO vector (Invitrogen) and sequenced. See, SEQ.ID.N0.:3 for the nucleic acid sequence and SEQ.ID.N0.:4 for the putative amino acid sequence. ' c. hRUP30 (Seq. Id. Nos. 5 & 6) The disclosed human RUP30 was identified based upon the use of GenBank database information. While searching the database, a cDNA clone with Accession Number AC055863 was identified as a human genomic sequence from chromosome 17.
The full length RUP30 was cloned by 5'R.ACE -PCR with a human pancreas Marathon-ReadyTM cDNA (Clontech) as template and the following oligonucleotide:
5'-GCAGTGTAGCGGTCAACCGTGAGCAGG-3'(SEQ.ID.N0.:27; sense, containing the initiation codon), and AP1 primer (Clontech) were used for the first round of RT-PCR and oligonucleotide:
5'-TGAGCAGGATGGCGATCCAGACTGAGGCGTGG-3'(SEQ.ID.NO.:28; antisense, containing the stop codon) and AP2 primer (Clontech) were used for the second round of PCR. DNA fragments generated by the 5' RACE-PCR were cloned into the pCRII-TOPO
vector (Invitrogen) and sequenced using the SP6/T7 primers (Stratagene).
Based on the sequence of the 5' RACE products, the full length RUP30 was cloned by RT-PCR, using primers:
5'-GAGGTACAGCTGGCGATGCTGACAG-3' (SEQ.ID.N0.:29; sense, ATG as the initiation codon);
5'-GTGGCCATGAGCCACCCTGAGCTCC-3' (SEQ.D~.N0.:30; antisense, 3' of the stop codon) and human pancreas Marathon-ReadyTM cDNA (Clontech) as template. Taq DNA
polymerase (Stratagene) was used for the amplification in 50 ~.l reaction by the following cycle with step 2 to step 4 repeated 35 times: 94°C for 40 seconds;
94°C for 20 seconds;
64°C for 20 seconds; 72°C for 2 minutes; and 72°C for 5 minutes.
A 1.2 Kb PCR fragment was isolated from a 1% agarose gel and cloned into the pCRII-TOPO vector (Invitrogen) and several clones were sequenced using the ABI
Big Dye Terminator kit (P.E. Biosystems). See, SEQ.117.N0.:5 for the nucleic acid sequence and SEQ.ID.N0.:6 for the putative amino acid sequence.

d. hRUP31 (Seq. Id. Nos. 7 & 8) The disclosed human RUP31 was identified based upon the use of GenBank database information. While searclung the database, a cDNA clone with Accession Number AL356214 was identified as a human genomic sequence from chromosome 10.
The full length RUP31 was cloned by RT-PCR using primers:
5'-GGAATGTCCACTGAATGCGCGCGG-3' (SEQ.D~.N0.:31; sense, containing the initiation codon), 5'-AGCTCGCCAGGTGTGAGAAACTCGG-3' (SEQ.ID.NO.:32; antisense, 3' of stop codon) and human mammary gland Marathon-ReadyTM cDNA (Clontech) as template.
AdvantageTM cDNA polymerase (Clontech) was used for the amplification in 50 ~.1 reaction by the following cycle with step 2 to step 4 repeated 35 times: 94°C
for 40 sec; 94°C for 20 sec; 66°C for 20 sec; 72°C for 1 min 30 sec; and 72°C for 5 min.
A 1.1 kb PCR fragment was isolated from a 1% agarose gel and cloned into the pCRII-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit (P.E. Biosystems). See, SEQ.ID.NO.:7 for the nucleic acid sequence and SEQ.ID.N0.:8 for the putative amino acid sequence.
e. hRUP32 (Seq. Id. Nos. 9 & 10) The disclosed human RUP32 was identified based upon the use of GenBank database information. While searching the database, a cDNA clone with Accession Number AL513524 was identified as a human genomic sequence from chromosome 6.
The full length RUP32 was cloned by PCR using primers:
5'-GCGTTATGAGCAGCAATTCATCCCTGCTGG-3' (SEQ.ID.N0.:33; sense, containing the initiation codon), 5'-GTATCCTGAACTTCGTCTATACAACTGC-3' (SEQ.ID.N0.:34; antisense) and human genomic DNA (Clontech) as template. TaqPlus~ Precision DNA
polymerase (Stratagene) was used for the amplification by the following cycle with step 2 to step 4 repeated 35 times: 94°C for 3 min; 94°C for 20 sec; 58°C
for 20 sec; 72°C for 1 min 30 sec;
and 72°C for 7 min.
A 1.06 kb PCR fragment was isolated from a 1% agarose gel and cloned into the pCRII-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit (P.E. Biosystems). See, SEQ.ID.N0.:9 for the nucleic acid sequence and SEQ.ID.NO.:10 for the putative amino acid sequence.

f. hRUP33 (Seq. Id. Nos. 1l & 12) The disclosed human RUP33 was identified based upon the use of GenBank database information. While searching the database, a cDNA clone with Accession Number AL513524 was identified as a human genomic sequence from chromosome 6.
The full length RUP33 was cloned by PCR using primers:
5'-CCCTCAGGAATGATGCCCTTTTGCCACAA-3' (SEQ.ID.N0.:35; sense, containing the initiation codon), 5'-ATCCATGTGGTTGGTGCATGTGGTTCGT-3' (SEQ.ID.N0.:36; antisense) and human genomic DNA (Clontech) as template. TaqPlus~ Precision DNA
polymerase (Stratagene) was used for the amplification by the following cycle with step 2 to step 4 repeated 35 times: 94°C for 3 min; 94°C for 20 sec; 56°C
for 20 sec; 72°C for 1 min 30 sec;
and 72°C for 7 min.
A 1.1 kb PCR fragment was isolated from a 1% agarose gel and cloned into the pCRII-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit (P.E. Biosystems). See, SEQ.ID.NO.:11 for the nucleic acid sequence and SEQ.ID.NO.:12 for the putative amino acid sequence.
g. hRUP34 (Seq. Id. Nos.13 & 14) The disclosed human RUP34 was identified based upon the use of GenBank database information. While searching the database, ' a cDNA clone with Accession Number AL513524 was identified as a human genomic sequence from chromosome 6.
The full length RUP34 was cloned by PCR using primers:
5'-AAACAACAAACAGCAGAACCATGACCAGC-3' (SEQ.ID.N0.:37; sense, containing the initiation codon), 5'-ACATAGAGACAAGTGACATGTGTGAACCAC-3' (SEQ.ID.N0.:38; antisense) and human genomic DNA (Clontech) as template. TaqPlus~ Precision DNA
polymerase (Stratagene) was used for the amplification by the following cycle with step 2 to step 4 repeated 35 times: 94°C for 3 min; 94°C for 20 sec; 60°C
for 20 sec; 72°C for 1 min 30 sec;
and 72°C for 7 min.
A 1.27 kb PCR fragment was isolated from a 1% agarose gel and cloned into the pCRII-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit (P.E. Biosystems). See, SEQ.ID.N0.:13 for the nucleic acid sequence and SEQ.ID.N0.:14 for the putative amino acid sequence.

h. hRUP35 (Seq. Id. Nos.15 & 16) The disclosed human RUP35 was identified based upon the use of GenBank database information. While searching the database, a cDNA clone with Accession Number AC021089 was identified as a human genomic sequence from chromosome 16.
The 5' sequence of RUP35 was determined by 5' RACE- PCR with a human fetal brain Marathon-ReadyTM cDNA (Clontech) as template. Oligonucleotide 5'-GGTATGAGACCGTGTGGTACTTGAGC-3' (SEQ.ID.NO.:39; sense) and APl primer (Clontech) were used for the first round of RT-PCR and oligonucleotide 5'-GTGGCAGACAGCGATATACCTGTCAATGG-3' (SEQ.ID N0.:40; antisense) and AP2 primer (Clontech) were used for the second round of PCR. DNA fragments generated by the 5' RACE-PCR were cloned into the pCRII-TOPO vector (Invitrogen) and sequenced using the SP6/T7 primers (Stratagene).
Based upon the sequence of the 5' RACE products, the full length RUP35 was cloned by RT-PCR, using primers 5'-GCGCTCATGGAGCACACGCACGCCCAC-3' (SEQ.ID.N0.:41; sense, ATG as the initiation codon) and 5'-GAGGCAGTAGTTGCCACACCTATGG-3' (SEQ.ID.N0.:42; antisense, 3' of the stop codon) and human brain Marathon-ReadyTM cDNA (Clontech) as template.
AdvantageTM
cDNA polymerase (Clontech) was used for the amplification in 100 ~,1 reaction by the following cycle with step 2 to step 4 repeated 45 times: 95°C for 2 min; 95°C for 20 sec;
60°C for 20 sec; 72°C for 1 min 30 sec; and 72°C for 5 min.
A 1.0 kb PCR fragment was isolated from a 1% agarose gel and cloned into the pCRII-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit (P.E. Biosystems). See, SEQ.ID.NO.:15 for the nucleic acid sequence and SEQ.ID.N0.:16 for the putative amino acid sequence.
i. hRUP36 (Seq. Id. Nos.17 & 18) The disclosed human RUP36 was identified based upon the use of GenBank database information. While searching the database, a cDNA clone with Accession Number AC090099 was identified as a human genomic sequence from chromosome 11.
The full length RUP36 was cloned by PCR using primers:
5'- CATCTGGTTTGTGTTCCCAGGGGCACCAG -3' (SEQ.ID.N0.:43; sense, 5' of start codon), 5'-GACAGTGTTGCTCTCAAAGTCCCGTCTGACTG -3' (SEQ.ID.N0.:44; antisense, 3' of stop codon) and human genomic DNA (Clontech) as template. TaqPlus~ Precision DNA
polymerase (Stratagene) was used for the amplification in a SOp,I reaction by the following cycle with step 2 to step 4 repeated 30 times: 95°C, 5 min; 95°C
for 30 sec; 70°C for 30 sec;
72°C for 1 min 30 sec; and 72°C for 7 min.
A 1.0 kb PCR fragment was isolated from a 1% agarose gel and cloned into the pCRII-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit (P.E. Biosystems). See, SEQ.ID.N0.:17 for the nucleic acid sequence and SEQ.ID.NO.:1 ~ for the putative amino acid sequence.
j. hRUP37 (Seq. Id. Nos.19 & 20) The disclosed human RUP37 was identified based upon the use of GenBank database information. While searching the database, a cDNA clone with Accession Number AC090099 was identified as a human genomic sequence from chromosome 11.
The full length RUP37 was cloned by PCR using primers:
5'-CTGTTTCCAGGGTCATCAGACTGGG-3' (SEQ.ll~.N0.:45; sense);
5'-GCAGCATTGCTCTCAAAGTCCTGTCTG-3' (SEQ.ID.N0.:46; antisense) and human genomic DNA (Clontech) as template. TaqPlus~ Precision DNA
polymerase (Stratagene) was used for the amplification by the following cycle with step 2 to step 4 repeated 35 times: 95°C for 5 min; 95°C for 30 sec; 62°C
for 30 sec; 72°C for 1 min 30 sec;
and 72°C for 7 min.
A 969 base pair was isolated from a 1%% agarose gel and cloned into the pCRII-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit (P.E.
Biosystems). See, SEQ.ID.N0.:19 for the nucleic acid sequence and SEQ.ID.NO.:20 for the putative amino acid sequence.
Example 2 PREPARATION OF NON-ENDOGENOUS, CONSTITUTIVELY ACTIVATED GPCRS
Those skilled in the art are credited with the ability to select techniques for mutation of a nucleic acid sequence. Presented below are approaches utilized to create non-endogenous versions of several of the human GPCRs disclosed above. The mutations disclosed below are based upon an algorithmic approach whereby the 16th amino acid (located in the IC3 region of the GPCR) from a conserved proline (or an endogenous, conservative substitution therefore) residue (located in the TM6 region of the GPCR, near the TM6/IC3 interface) is mutated, preferably to an alanine, histimine, arginine or lysine amino acid residue, most preferably to a lysine amino acid residue.
1. Transformer Site-Directed TM Mutagenesis Preparation of non-endogenous human GPCRs may be accomplished on human GPCRs using, ihte~ alia, Transformer Site-DirectedTM Mutagenesis Kit (Clontech) according to the manufacturer instructions. In some embodiments two mutagenesis primers are used, preferably a lysine mutagenesis oligonucleotide that creates the lysine mutation, and a selection marker oligonucleotide. For convenience, the codon mutation to be incorporated into the human GPCR is also noted, in standard form (Table D):
TABLE D
Receptor Identifier Codon Mutation hRUP28 V274K

hRUP29 T249K

hRUP30 1Z232K

hRUP31 M294K

hRUP32 F220K

hRUP34 A238K

hRUP35 Y215K

hRUP36 L294K

hRUP37 T219K

Example 3 RECEPTOR EXPRESSION
Although a variety of cells are available to the art-skilled for the expression of proteins, it is preferred that mammalian cells be utilized. The primary reason for this is predicated upon practicalities, i.e., utilization of, e.g., yeast cells for the expression of a GPCR, while possible, introduces into the protocol a non-mammalian cell which may not (indeed, in the case of yeast, does not) include the receptor-coupling, genetic-mechanism AREN-0309 , PATENT
and secretary pathways that have evolved fox mammalian systems - thus, results obtained in non-mammalian cells, while of potential use, are not as preferred as those obtained using mammalian cells. Of the mammalian cells, COS-7, 293 and 293T
cells are particularly preferred, although the specific mammalian cell utilized can be predicated upon the particular needs of the artisan.
a. Transient Transfection On day one, 6x10 cells/10 cm dish of 293 cells well were plated out. On day two, two reaction tubes were prepared (the proportions to follow for each tube are per plate):
tube A was prepared by mixing 4~.g DNA (e.g., pCMV vector; pCMV vector with receptor cDNA, ete.) in 0.5 ml serum free DMEM (Gibco BRL); tube B was prepared by mixing 24,1 lipofectamine (Gibco BRL) in O.Sml serum free DMEM. Tubes A and B were admixed by inversion (several times),. followed by incubation at room temperature for 30-45min. The admixture is referred to as the "transfection mixture". Plated 293 cells were washed with 1XPBS, followed by addition of 5 ml serum free DMEM. One ml of the transfection mixture were added to the cells, followed by incubation for 4hrs at 37°C/5%
C02. The transfection mixture was removed by aspiration, followed by the addition of lOml of DMEM/10% Fetal Bovine Serum. Cells were incubated at 37°C/5%
CO2. After 48hr incubation, cells were harvested and utilized for analysis.
b. Stable Cell Lines Approximately 12x10 293 cells will be plated on a l5cm tissue culture plate, and grown in DME High Glucose Medium containing 10% fetal bovine serum and one percent sodium pyruvate, L-glutamine, and antibiotics. Twenty-four hours following plating of 293 cells (to approximately ~80% confluency), the cells will be transfected using 12~,g of DNA.
The 12~,g of DNA is combined with 60,1 of lipofectamine and 2mL of DME High Glucose Medium without serum. The medium will be aspirated from the plates and the cells washed once with medium without serum. The DNA, lipofectamine, and medium mixture will be added to the plate along with lOmL of medium without serum. Following incubation at 37°C for four to five hours, the medium will be aspirated and 25m1 of medium containing serum will be added. Twenty-four hours following transfection, the medium will be aspirated again, and fresh medium with serum will be added. Forty-eight hours following transfection, the medium will be aspirated and medium with serum will be added containing geneticin (G418 drug) at a final concentration of SOO~,g/mL. The transfected cells will then undergo selection for positively transfected cells containing the G41 ~
resistant gene.
The medium will be replaced every four to five days as selection occurs.
During selection, cells will be grown to create stable pools, or split for stable clonal selection.
Example 4 S ASSAYS FOR DETERMINATION OF CONSTITUTIVE ACTIVITY
OF NON-ENDOGENOUS GPCRS
A variety of approaches are available for assessment of constitutive activity of the non-endogenous human GPCRs. The following are illustrative; those of ordinary skill in the art are credited with the ability to determine those techniques that are preferentially beneficial for the needs of the artisan.
1. Membrane Binding Assays: [35S]GTPyS Assay When a G protein-coupled receptor is in its active state, either as a result of ligand binding or constitutive activation, the receptor couples to a G protein and stimulates the release of GDP and subsequent binding of GTP to the G protein. The alpha subunit of the G protein-receptor complex acts as a GTPase and slowly hydrolyzes the GTP to GDP, at which point the receptor normally is deactivated. Constitutively activated receptors continue to exchange GDP for GTP. The non-hydrolyzable GTP analog, [35S]GTPyS, can be utilized to demonstrate enhanced binding of [35S]GTPyS to membranes expressing constitutively activated receptors. Advantages of using [35S]GTPyS binding to measure constitutive activation include but are not limited to the following: (a) it is generically applicable to all G protein-coupled receptors; (b) it is proximal at the membrane surface making it less likely to pick-up molecules which affect the intracellular cascade.
The assay takes advantage of the ability of G protein coupled receptors to stimulate [ssS]GTPyS binding to membranes expressing the relevant receptors. The assay can, therefore, be used in the direct identification method to screen candidate compounds to constitutively activated G protein-coupled receptors. The assay is generic and has application to drug discovery at all G protein-coupled receptors.
The [35S]GTPyS assay is incubated in 20 mM HEPES and between 1 and about 20mM MgCl2 (this amount can be adjusted for optimization of results, although 20mM is preferred) pH 7.4, binding buffer with between about 0.3 and about 1.2 nM
[35S]GTPyS

(this amount can be adjusted for optimization of results, although 1.2 is preferred ) and 12.5 to 75 ~,g membrane protein (e.g., 293 cells expressing the GS Fusion Protein; this amount can be adjusted for optimization) and 10 p.M GDP (this amount can be changed for optimization) for 1 hour. Wheatgerm agglutinin beads (25 ~.1; Amersham) will then be added and the mixture incubated for another 30 minutes at room temperature.
The tubes will be then centrifuged at 1500 x g for 5 minutes at room temperature and then counted in a scintillation counter.
2. Adenylyl Cyclase A Flash PlateTM Adenylyl Cyclase kit (New England Nuclear; Cat. No. SMP004A) designed for cell-based assays can be modified for use with crude plasma membranes. The Flash Plate wells can contain a scintillant coating which also contains a specific antibody recognizing cAMP. The cAMP generated in the wells can be quantitated by a direct competition for binding of radioactive cAMP tracer to the cAMP antibody. The following serves as a brief protocol for the measurement of changes in CAMP levels in whole cells that express the receptors.
Transfected cells will be harvested approximately twenty four hours after transient transfection. Media will be carefully aspirated and discarded. Ten ml of PBS
will gently be added to each dish of cells followed by careful aspiration. One ml of Sigma cell dissociation buffer and 3m1 of PBS will be added to each plate. Cells will be pipetted off the plate and the cell suspension collected into a SOmI conical centrifuge tube. Cells will be centrifuged at room temperature at 1,100 rpm for 5 min. The cell pellet will be carefully re-suspended into an appropriate volume of PBS (about 3m1/plate). The cells will be then counted using a hemocytometer and additional PBS will be added to give the appropriate nmnber of cells (to a final volume of about SOp,I/well).
cAMP standards and Detection Buffer (comprising 1 ~,Ci of tracer [lzsI cAMP
(50 ~,1~ to 11 ml Detection Buffer) will be prepared and maintained in accordance with the manufacturer's instructions. Assay Buffer will be prepared fresh for screening and contained 50,1 of Stimulation Buffer, 3~,1 of test compound (12~,M final assay concentration) and 50,1 cells, Assay Buffer will be stored on ice until utilized. The assay will be initiated by addition of SOwI of cAMP standards to appropriate wells followed by addition of 50.1 of PBSA to wells H-11 and H12. Fifty w1 of Stimulation Buffer will be added to all wells. DMSO (or selected candidate compounds) will be added to appropriate wells using a pin tool capable of dispensing 3~,1 of compound solution, with a final assay concentration of 12~,M test compound and 100.1 total assay volume. The cells will then be added to the wells and incubated for 60 min at room temperature. One hundred ~.l of Detection Mix containing tracer cAMP will then be added to the wells. Plates will be incubated for an additional 2 hours followed by counting in a Wallac MicroBetaTM
scintillation counter. Values of cAMP/well will then be extrapolated from a standard cAMP curve which will be contained within each assay plate.
3. Cell-Based CAMP for G; Coupled Target GPCRs TSHR is a GS coupled GPCR that causes the accumulation of cAMP upon activation. TSHR will be constitutively activated by mutating amino acid residue 623 (i.e., changing an alanine residue to an isoleucine residue). A G, coupled receptor is expected to inhibit adenylyl cyclase, and, therefore, decrease the level of cAMP
production, which can male assessment of cAMP levels challenging. An effective technique for measuring the decrease in production of cAMP as an indication of constitutive activation of a G, coupled receptor can be accomplished by co-transfecting, most preferably, non-endogenous, constitutively activated TSHR (TSHR-A623I) (or an endogenous, constitutively active GS
coupled receptor) as a "signal enhancer" with a G, linked target GPCR to establish a baseline level of cAMP. TJpon creating a non-endogenous version of the G, coupled receptor, this non-endogenous version of the target GPCR is then co-transfected with the signal enhancer, and it is this material that can be used for screening. This approach will be utilized to effectively generate a signal when a CAMP assay is used; this approach is preferably used in the direct identification of candidate compounds against G;
coupled receptors. It is noted that for a G; coupled GPCR, when this approach is used, an inverse agonist of the target GPCR will increase the cAMP signal and an agonist will decrease the cAMP signal.
On day one, 2x104 293 cells/well will be plated out. On day two, two reaction tubes will be prepared (the proportions to follow for each tube are per plate): tube A will be prepared by mixing tug DNA of each receptor transfected into the mammalian cells, for a total of 4ug DNA (e.g., pCMV vector; pCMV vector with mutated THSR (TSHR-A6231);
TSHR-A623I and GPCR, etc.) in 1.2m1 serum free DMEM (Irvine Scientific, Irvine, CA);
tube B will be prepared by mixing 120.1 lipofectamine (Gibco BRL) in 1.2m1 serum free DMEM. Tubes A and B will then be admixed by inversion (several times), followed by incubation at room temperature for 30-45min. The admixture is referred to as the "transfection mixture". Plated 293 cells will be washed with 1XPBS, followed by addition of l Oml serum free DMEM. 2.4m1 of the transfection mixture will then be added to the cells, followed by incubation for 4hrs at 37°C/5% C02. The transfection mixture will then be removed by aspiration, followed by the addition of 25m1 of DMEM/10%
Fetal Bovine Serum. Cells will then be incubated at 37°C/5% C02. After 24hr incubation, cells will be harvested and utilized for analysis.
A Flash PlateTM Adenylyl Cyclase kit (New England Nuclear; Cat. No. SMP004A) although designed for cell-based assays, can be modified for use with crude plasma membranes depending on the need of the skilled artisan. The Flash Plate wells will contain a scintillant coating which also contains a specific antibody recognizing cAMP. The cAMP
generated in the wells can be quantified by a direct competition for binding of radioactive cAMP tracer to the cAMP antibody. The following serves as a brief protocol for the measurement of changes in cAMP levels in whole cells that express the receptors.
Transfected cells will be harvested approximately twenty four hours after transient transfection. Media will be carefully aspirated and discarded. Ten ml of PBS
will be gently added to each dish of cells followed by careful aspiration. One ml of Sigma cell dissociation buffer and 3m1 of PBS will be added to each plate.
Cells will be pipetted off the plate and the cell suspension will be collected into a SOmI
conical centrifuge tube. Cells will be centrifuged at room temperature at 1,100 rpm for 5 min.
The cell pellet will be carefully re-suspended into an appropriate volume of PBS (about 3ml/plate). The cells will then be counted using a hemocytometer and additional PBS is added to give the appropriate number of cells (to a final volume of about 50~,1/well).
cAMP standards and Detection Buffer (comprising 1 ~,Ci of tracer ~l2sI cAMP
(50 ~1] to 11 ml Detection Buffer) will be prepared and maintained in accordance with the manufacturer's instructions. Assay Buffer should be prepared fresh for screening and contained 50,1 of Stimulation Buffer, 3~,1 of test compound (12~,M final assay concentration) and SOp,I cells, Assay Buffer can be stored on ice until utilized. The assay can be initiated by addition of 50,1 of cAMP standards to appropriate wells followed by addition of SOpI of PBSA to wells H-11 and H12. Fifty p.1 of Stimulation Buffer will be added to all wells'. Selected compounds (e.g., TSH) will be added to appropriate wells AREN=0309 PATENT
using a pin tool capable of dispensing 3~,1 of compound solution, with a final assay concentration of 12~,M test compound and 100,1 total assay volume. The cells will then be added to the wells and incubated for 60 min at room temperature. One hundred p,1 of Detection Mix containing tracer cAMP will then be added to the wells. Plates will then be incubated additional 2 hours followed by counting in a Wallac MicroBeta scintillation counter. Values of cAMP/well will then be extrapolated from a standard cAMP
curve which is contained within each assay plate.
4. Reporter-Based Assays a. Cup-Luc Reporter Assay (GS -associated receptors) 293 and 293T cells will be plated-out on 96 well plates at a density of 2 x cells per well and will be transfected using Lipofectamine Reagent (BRL) the following day according to manufacturer instructions. A DNA/lipid mixture will be prepared for each 6-well transfection as follows: 260ng of plasmid DNA in 100,1 of DMEM are gently mixed with 2~,1 of lipid in 100.1 of DMEM (the 260ng of plasmid DNA
consisted of 200ng of a 8xCRE-Luc reporter plasmid, SOng of pCMV comprising endogenous receptor or non-endogenous receptor or pCMV alone, and long of a GPRS
expression plasmid (GPRS in pcDNA3 (Invitrogen)). The 8XCRE-Luc reporter plasmid is prepared as follows: vector SRIF-(3-gal will be obtained by cloning the rat somatostatin promoter (-71/+51) at BglV-HindIII site in the p(3gal-Basic Vector (Clontech). Eight (8) copies of cAMP response element will be obtained by PCR from an adenovirus template AdpCF126CCRE8 (see, 7 Human Gene Therapy 1883 (1996)) and cloned into the SRIF-(3-gal vector at the Kpn-BglV site, resulting in the 8xCRE-(3-gal reporter vector. The 8xCRE-Luc reporter plasmid will be generated by replacing the beta-galactosidase gene in the 8xCRE-(3-gal reporter vector with the luciferase gene obtained from the pGL3-basic vector (Promega) at the HindIII-BamHI site. Following 30 min. incubation at room temperature, the DNA/lipid mixture will be diluted with 400 ~,1 of DMEM
and 100,1 of the diluted mixture will be added to each well. One hundred p1 of DMEM with 10% FCS will be added to each well after a 4hr incubation in a cell culture incubator.
The following day the transfected cells will be changed with 200 ~.1/well of DMEM with 10% FCS. Eight hours later, the wells will be changed to 100 ~,1 /well of DMEM
without phenol red, after one wash with PBS. Luciferase activity will be measured the next day using the LucLiteTM reporter gene assay kit (Packard) following manufacturer's instructions and read on a 1450 MicroBetaTM scintillation and luminescence counter (Wallac).
b. APl reporter assay (Gq associated receptors) A method to detect Gq stimulation depends on the known property of Gq dependent phospholipase C to cause the activation of genes containing AP1 elements in their promoter. A PathdetectTM AP-1 cis-Reporting System (Stratagene, Catalogue #
219073) can be utilized following the protocol set forth above with respect to the CREB
reporter assay, except that the components of the calcium phosphate precipitate were 410 ng pAPl-Luc, 80 ng pCMV-receptor expression plasmid, and 20 ng CMV-SEAP.
c. S~-Luc Reporter Assay (Gg- associated receptors) One method to detect Gq stimulation depends on the known property of G9-dependent phospholipase C to cause the activation of genes containing serum response factors in their promoter. A PathdetectTM SRF-Luc-Reporting System (Stratagene) can be utilized to assay for Gq coupled activity in, e.g., COS7 cells. Cells are transfected with the plasmid components of the system and the indicated expression plasmid encoding endogenous or non-endogenous GPCR using a Mammalian TransfectionTM Kit (Stratagene, Catalogue #200285) according to the manufacturer's instructions.
Briefly, 410 ng SRF-Luc, 80 ng pCMV-receptor expression plasmid and 20 ng CMV-SEAP
(secreted alkaline phosphatase expression plasmid; alkaline phosphatase activity is measured in the media of transfected cells to control for variations in transfection efficiency between samples) are combined in a calcium phosphate precipitate as per the manufacturer's instructions. Half of the precipitate is equally distributed between 3 wells in a 96-well plate, kept on the cells in a serum free media for 24 hours. The last 5 hours the cells are incubated with lp,M Angiotensin, where indicated. Cells are then lysed and assayed for luciferase activity using a LucliteTM Kit (Packard, Cat.
# 6016911) and "Trilux 1450 Microbeta" liquid scintillation and luminescence counter (Wallac) as per the manufacturer's instructions. The data can be analyzed using GraphPad PrismTM
2.0a (GraphPad Software Inc.).
d. Intracellular IP3 Accumulation Assay (Gq-associated receptors) On day 1, cells comprising the receptors (endogenous and/or non-endogenous) are plated onto 24 well plates, usually 1x105 cells/well (although his number can be optimized. On day 2 cells are transfected by firstly mixing 0.25ug DNA
in 50 p,1 serum free DMEM/well and 2 ~.1 lipofectamine in 50 ~,1 serum free DMEM/well.
The solutions are gently mixed and incubated for 15-30 min at room temperature.
Cells are then washed with 0.5 ml PBS aizd 400 ~ul of serum free media and then mixed with the transfection media and added to the cells. The cells are incubated for 3-4 hrs at 37°C/5%C02 and then the transfection media is removed and replaced with lmllwell of regular growth media. On day 3 the cells are labeled with 3H-myo-inositol.
Briefly, the media is removed and the cells are washed with 0.5 ml PBS. Then 0.5 ml inositol-free/serum free media (GIBCO BRL) are added/well with 0.25 ~,Ci of 3H-myo-inositol/
well and the cells incubated for 16-18 hrs overnight at 37°C/5%C02. On Day 4 the cells are washed with 0.5 ml PBS acid 0.45 ml of assay medium is added containing inositol-free/serum free media 10 wM pargyline 10 mM lithium chloride or 0.4 ml of assay medium and 50 p.1 of lOx l~etanserin (let) to final concentration of 10~,M. The cells are then incubated for 30 min at 37°C. The cells are then washed with 0.5 ml PBS
and 200 p,1 of fresh/ice cold stop solution (1M I~OH; 18 mM Na-borate; 3.8 mM EDTA) is added to each well. The solution is Dept on ice for 5-10 min (or until cells are lysed) and then neutralized by 200 ~,1 of fresh/ice cold neutralization solution (7.5 % HCL). The lysate is then transferred into 1.5 ml Eppendorf tubes and 1 ml of chloroform/methanol (1:2) is added/tube. The solution is vortexed for 15 sec and the upper phase is applied to a Biorad AGl-XBTM anion exchange resin (100-200 mesh). First, the resin is washed with water at 1:1.25 W/V and 0.9 ml of upper phase is loaded onto the column. The column is then washed with 10 ml of 5 mM myo-inositol and 10 ml of 5 mM Na-borate/60mM Na-formate. The inositol tris phosphates are eluted into scintillation vials containing 10 ml of scintillation cocktail with 2 ml of 0.1 M formic acid/ 1 M ammonium formate.
The columns are regenerated by washing with 10 ml of 0.1 M formic acid/3M
axmnonium formate and rinsed twice with dd HBO and stored at 4°C in water.
Reference is made to Figure 1. In Figure 1, 293 cells were transfected with Gq protein containing a six amino acid deletion, "Gq(del)"; Gg protein fused to a G protein, "Gq(del)/G"; endogenous RUP32; and RUP32 with Gq(del) ("RUP32+ Gq(del)/G").
The data indicate, based upon measuring IP3 accumulation of RUP32 co-transfection of Gq(del)/G, that RUP32 does not endogenously couple to Gg protein.
However when RUP32 was co-transfected with Gq(del)/G fusion protein, RUP32 was forced to couple to Gq protein. RUP27+ Gq(del)/G; evidence about a nine (9) fold increase in IP3 accumulation when compared to endogenous RUP32. This data demonstrates that the Gq(del)/G Fusion Construct can be co-transfected with a GPCR and used to screen for agonists or inverse agonists.
Reference is made to Figure 2. In Figure 2, 293 cells were transfected with and RUP36 receptor and compared to the control, pCMV. The data indicate that both RUP35 and RUP36 receptor are endogenously, constitutively active. RUP35 evidences about a six (6) fold increase in intracellular inositol phosphate accumulation when compared to pCMV and RUP36 evidences about a four (4) fold increase when compared to pCMV.
Example 5 FUSION PROTEIN PREPARATION
a. GPCR: GS Fusion Construct The design of the constitutively activated GPCR-G protein fusion construct can be accomplished as follows: both the 5' and 3' ends of the rat G protein Gsa (long form; Itoh, H. et al., 83 PN~1S 3776 (1986)) is engineered to include a HindllI (5'-AAGCTT-3') sequence thereon. Following confirmation of the correct sequence (including the flanking HindIII sequences), the entire sequence is shuttled into pcDNA3.1(-) (Invitrogen, cat. no.
V795-20) by subcloning using the HilzdIII restriction site of that vector. The correct orientation for the Gsa sequence will be determined after subcloning into pcDNA3.1(-) The modified pcDNA3.1 (-) containing the rat Gsa gene at HindIII sequence is then verified;
this vector will then be available as a "universal" Gsa protein vector. The pcDNA3.1(-) vector contains a variety of well-known restriction sites upstream of the HindIII site, thus beneficially providing the ability to insert, upstream of the GS protein, the coding sequence of an endogenous, constitutively active GPCR. This same approach can be utilized to create other "universal" G protein vectors, and, of course, other commercially available or proprietary vectors known to the artisan can be utilized. In some embodiments, the important criteria is that the sequence for the GPCR be upstream and in-frame with that of the G protein.

Spacers in the restriction sites between the G protein and the GPCR are optional. The sense and anti-sense primers included the restriction sites for XbaI and EcoRV, respectively, such that spacers (attributed to the restriction sites) exist between the G protein and the GPCR.
PCR will then be utilized to secure the respective receptor sequences for fusion witlvn the Gsa universal vector disclosed above, using the following protocol for each:
100ng cDNA for GPCR will be added to separate tubes containing 2~,1 of each primer (sense and anti-sense), 3~.1 of lOmM dNTPs, 10,1 of lOXTaqPIusTM Precision buffer, lwl of TaqPlusTM Precision polymerase (Stratagene: #600211), and 80.1 of water.
Reaction temperatures and cycle times for the GPCR will be as follows with cycle steps 2 through 4 were repeated 35 times: 94°C for 1 min; 94°C for 30 seconds;
62°C for 20 sec; 72°C 1 min 40sec; and 72°C 5 min. PCR products will be run on a 1% agarose gel and then purified.
The purified products will be digested with XbaI and EcoRV and the desired inserts purified and ligated into the GS unversal vector at the respective restriction sites. The positive clones will be isolated following transformation and determined by restriction enzyme digestion; expression using 293 cells will be accomplished following the protocol set forth infra. Each positive clone for GPCR- GS Fusion Protein will be sequenced to verify correctness.
b. Gq(6 amino acid deletion)/G; Fusion Construct The design of a Gq (del)/G fusion construct was accomplished as follows: the N-terminal six (6) amino acids (amino acids 2 through 7), having the sequence of TLESIM
(SEQ.ID.NO.:47) Gaq-subunit was deleted and the C-terminal five (5) amino acids, having the sequence EYNLV (SEQ.ID.N0.:48) was replaced with the corresponding amino acids of the Gai Protein, having the sequence DCGLF (SEQ.ID.N0.:49). This fusion construct was obtained by PCR using the following primers:
5'-gatcAAGCTTCCATGGCGTGCTGCCTGAGCGAGG-3' (SEQ.ID.NO.:50) and 5'-gatcGGATCCTTAGAACAGGCCGCAGTCCTTCAGGTTCAGCTGCAGGATGGTG-3' (SEQ.ID.N0.:51) and Plasmid 63313 which contains the mouse Gaq-wild type version with a hemagglutinin_ tag as template. Nucleotides in lower caps are included as spacers.
TaqPlus~ Precision DNA polymerase (Stratagene) was utilized for the amplification by the following cycles, with steps 2 through 4 repeated 35 times: 95°C for 2 min; 95°C for 20 sec; 56°C for 20 sec; 72°C for 2 min;
and 72°C for 7 min. The PCR

product will be cloned into a pCRII-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator kit (P.E. Biosystems). Inserts from a TOPO clone containing the sequence of the fusion construct will be shuttled into the expression vector pcDNA3.1(+) at the HindIIIBamHI site by a 2 step cloning process.
Example 6 TISSUE DISTRIBUTION OF THE DISCLOSED HUMAN GPCRS: RT-PCR
RT-PCR was applied to confine the expression and to detennine the tissue distribution of several novel human GPCRs. Oligonucleotides utilized were GPCR-specific and the human multiple tissue cDNA panels (MTC, Clontech) as templates. Taq DNA polymerase (Stratagene) were utilized for the amplification in a 40,1 reaction according to the manufacturer's instructions. Twenty w1 of the reaction will be loaded on a 1.5% agarose gel to analyze the RT-PCR products. Table E, below, lists the receptors, the cycle conditions and the primers utilized, and also lists exemplary diseases/disorders linked to the receptors.
TABLE E
ReceptorCycle 5' Primer 3' Primer DNA Tissue Expression IdentifierConditions (SEQ.ID.NO.)(SEQ.ID.NO.)Fragment Min ('), Sec (") Cycles 2-4 repeated times hRUP28 94C for GTCCTCACT CTGCGTCCAC710bp heart; kidney;
5 min; liver; lung 94C for GGTGGCCAT CAGAGTCAC and pancreas 30 sec;

58C for GTACTCC GTCTCC
30 sec, (52) (53) 72C for 1 min, and 72C
for 7 min hRUP29 94C for CTTGGATGTT GTTTGTGGCT690bp leukocyte and 5 min; ovary 94C for TGGGCTGCC AACGGCACA
30 sec;

58C for CTTCTGC AAACACAAT
30 sec, (54) 72C for TCC (55) 1 min, and 72C for min hRUP30 94C for 2 CTGCTCACG GTGGCCATG 690bp pancreas min;

94C for 15 GTTGACCGC AGCCACCCT
sec;

58C for 20 TACACTGC GAGCTCC
sec, (57) 72C for 1 (56) min, and 72C for min hRUP31 95C for 4 CTTCTTCTCCCCAAATCA 516bp colon, lung, min; pancreas, 95C for 1 GACGTCAAG GTGTGCAA thymus; cerebral min; cortex, G (58) hippocampus 52C for 30 ATCG (59) ofbrain, sec, and fat cells 72C for 1 min, and 72C for min hRUP32 95C for 4 TGAATGGGT CAACGGTCT 527bp thymus min;

95C for 1 CCTGTGTGA GACAACCTC
min;

52C for 30 ~ (60) CT (61) sec, 72C for 1 min, and 72C for min hRUP34 95C for 4 TTGCTGTGATCAGGAAGCC 534bp peripheral min; blood 95C for 1 GTGGCATTT"TCATAAAGGC leukocyte min; ("PBL"), 52C for 30 G (62) ATCAA (63) prostate and sec, kidney 72C for 1 min, and 72C for min hRUP35 95C for 4 ACATCACCT CCAGCATCTT 557bp thalamus min;

95C for 1 GCTTCCTGA GATGCAGTG
min;

52C for 30 CC (64) T (65) sec, 72C for 1 min, and 72C for min hRUP37 95C for 4 CCATCTCCA GCTGTTAAG 517bp testis, cerebral min; cortex 95C for 1 AAATCCTCA AGCGGACAG and hippocampus min;

52C for 30 GTC (66) GAAA (67) sec, 72C for 1 min, and 72°C for 7 min Diseases and disorders related to receptors located in these tissues or regions include, but are not limited to, cardiac disorders and diseases (e.g.
thrombosis, myocardial infarction; atherosclerosis; cardiomyopathies); kidney disease/disorders (e.g., renal failure;
renal tubular acidosis; renal glycosuria; nephrogenic diabetes insipidus;
cystinuria;
polycystic kidney disease); eosinophilia; leukocytosis; leukopenia; ovarian cancer; sexual dysfunction; polycystic ovarian syndrome; pancreatitis and pancreatic cancer;
irritable bowel syndrome; colon cancer; Crohn's disease; ulcerative colitis;
diverticulitis; Chronic Obstructive Pulmonary Disease (COPD); Cystic Fibrosis; pneumonia; pulmonary hypertension; tuberculosis and lung cancer; Parkinson's disease; movement disorders and ataxias; learning and memory disorders; eating disorders (e.g., anorexia;
bulimia, etc.);
obesity; cancers; thymoma; myasthenia gravis; circulatory disorders; prostate cancer;
prostatitis; kidney diseaseldisorders(e.g., renal failure; renal tubular acidosis; renal glycosuria; nephrogenic diabetes insipidus; cystinuria; polycystic l~idney disease);
sensorimotor processing and arousal disorders; obsessive-compulsive disorders;
testicular cancer; priapism; prostatitis; hernia; endocrine disorders; sexual dysfunction; allergies;
depression; psychotic disorders; migraine; reflux; schizophrenia; ulcers;
bronchospasm;
epilepsy; prostatic hypertrophy; anxiety; rhinitis; angina; and glaucoma.
Accordingly, the methods of the present invention may also be useful in the diagnosis and/or treatment of these and other diseases and disorders.
Example 7 Protocol: Direct Identification of Inverse Agonists and Agonists A. [35S]GTPyS Assay Although endogenous, constitutively active GPCRs have been used for the direct identification of candidate compounds as, e.g., inverse agonists, for reasons that are not altogether understood, infra-assay variation can become exacerbated. In some embodiments a GPCR Fusion Protein, as disclosed above, is also utilized with a non-endogenous, constitutively activated GPCR. When such a protein is used, infra-assay variation appears to be substantially stabilized, whereby an effective signal-to-noise ratio is obtained. This has the beneficial result of allowing for a more robust identification of candidate compounds. Thus, in some embodiments it is preferred that for direct identification, a GPCR Fusion Protein be used and that when utilized, the following assay protocols be utilized.
1. Membrane Preparation Membranes comprising the constitutively active orphan GPCR Fusion Protein of interest and for use in the direct identification of candidate compounds as inverse agonists or agoiusts are preferably prepared as follows:
a. Materials "Membrane Scrape Buffer" is comprised of 20mM HEPES and lOmM EDTA, pH 7.4; "Membrane Wash Buffer" is comprised of 20 mM HEPES and 0.1 mM EDTA, pH 7.4; "Binding Buffer" is comprised of 20mM HEPES, 100 mM NaCl, and 10 mM
MgCl2, pH 7.4 b. Procedure All materials will be kept on ice throughout the procedure. Firstly, the media will be aspirated from a confluent monolayer of cells, followed by rinse with 10m1 cold PBS, followed by aspiration. Thereafter, Sml of Membrane Scrape Buffer will be added to scrape cells; this will be followed by transfer of cellular extract into SOmI
centrifizge tubes (centrifuged at 20,000 rpm for 17 minutes at 4°C). Thereafter, the supernatant will be aspirated and the pellet will be resuspended in 30m1 Membrane Wash Buffer followed by centrifugation at 20,000 rpm for 17 minutes at 4°C. The supernatant will then be aspirated and the pellet resuspended in Binding Buffer. The resuspended pellet will then be homogenized using a Brinkman PolytronTM homogenizes (15-20 second bursts until the material is in suspension). This is referred to herein as "Membrane Protein".
2. Bradford Protein Assay Following the homogenization, protein concentration of the membranes will be determined, for example, using the Bradford Protein Assay (protein can be diluted to about l.Smg/ml, aliquoted and frozen (-80°C) for later use; when frozen, protocol for use will be as follows: on the day of the assay, frozen Membrane Protein is thawed at room temperature, followed by vortex and then homogenized with a Polytron at about 12 x 1,000 rpm for about 5-10 seconds; it was noted that for multiple preparations, the homogenizes is thoroughly cleaned between homogenization of different preparations).

a. Materials Binding Buffer (as discussed above); Bradford Dye Reagent; Bradford Protein Standard will be utilized, following manufacturer instructions (Biorad, cat.
no. 500-0006).
~ b. Procedure Duplicate tubes will be prepared, one including the membrane, and one as a control "blank". Each contains 800,1 Binding Buffer. Thereafter, lOwl of Bradford Protein Standard (lmg/ml) will be added to each tube, and lOwl of membrane Protein will then be added to just one tube (not the blank). Thereafter, 200,1 of Bradford Dye Reagent will be added to each tube, followed by vortexing. After five minutes, the tubes will be re-vortexed and the material therein will be transferred to cuvettes.
The cuvettes will then be read using a CECIL 3041 spectrophotometer, at wavelength 595.
3. Direct Identification Assay a. Materials GDP Buffer consisted of 37.5 ml Binding Buffer and 2mg GDP (Sigma, cat. no. G-7127), followed by a series of dilutions in Binding Buffer to obtain 0.2 ~,M
GDP (final concentration of GDP in each well was 0.1 ~,M GDP); each well comprising a candidate compound, has a final volume of 200.1 consisting of 100,1 GDP Buffer (final concentration, 0.1 ~.M GDP), 50,1 Membrane Protein in Binding Buffer, and 50,1 [35S]GTPyS (0.6 nM) in Binding Buffer (2.5 ~,1 [35S]GTPyS per 10m1 Binding Buffer).
b. Procedure Candidate compounds will be preferably screened using a 96-well plate format (these can be frozen at -80°C). Membrane Protein (or membranes with expression vector excluding the GPCR Fusion Protein, as control), will be homogenized briefly until in suspension. Protein concentration will then be determined using, for example, the Bradford Protein Assay set forth above. Membrane Protein (and controls) will then be diluted to 0.25mg/ml in Binding Buffer (final assay concentration, 12.S~,g/well).
Thereafter, 100 ~,1 GDP Buffer is added to each well of a Wallac ScintistripTM (Wallet). A 5~,1 pin-tool will then be used to transfer 5 ~,l of a candidate compound into such well (i.e., 5~,1 in total assay volume of 200 ~,1 is a 1:40 ratio such that the final screening concentration of the candidate compound is 10~,M). Again, to avoid contamination, after each transfer step the pin tool is rinsed in three reservoirs comprising water (1X), ethanol (1X) and water (2X) -excess liquid is shaken from the tool after each rinse and the tool is dried with paper and Kim__ wipes. Thereafter, 50 p,1 of Membrane Protein will be added to each well (a control well comprising membranes without the GPCR Fusion Protein was also utilized), and pre-incubated for 5-10 minutes at room temperature. Thereafter, 50 p1 of [3sS]GTPySo (0.6 nM) in Binding Buffer will be added to each well, followed by incubation on a shaker for 60 minutes at room temperature (again, in this example, plates were covered with foil). The assay will be stopped by spinnuzg the plates at 4000 RPM for 15 minutes at 22°C. The plates will then be aspirated with an 8 channel manifold and sealed with plate covers. The plates will then be read on a Wallac 1450 using setting "Prot. #37" (as per manufacturer's instructions).
B. Cyclic AMP Assay Another assay approach to directly identify candidate compound will be accomplished utilizing a cyclase-based assay. In addition to direct identification, this assay approach can be utilized as an independent approach to provide confinnation of the results from the [3sS]GTPyS approach as set forth above.
A modified Flash PlateTM Adenylyl Cyclase kit (New England Nuclear; Cat. No.
SMP004A) will be preferably utilized for direct identification of candidate compounds as inverse agonists and agonists to GPCRs in accordance with the following protocol.
Transfected cells will be harvested approximately three days after transfection.
Membranes will be prepared by homogenization of suspended cells in buffer containing 20mM HEPES, pH 7.4 and lOmM MgCl2. Homogenization will be performed on ice using a Brinkman PolytronTM for approximately 10 seconds. The resulting homogenate will be centrifuged at 49,000 X g for 15 minutes at 4°C. The resulting pellet will then be resuspended in buffer containing 20mM HEPES, pH 7.4 and 0.1 mM EDTA, homogenized for 10 seconds, followed by centrifugation at 49,000 X g for 15 minutes at 4°C. The resulting pellet will then be stored at -80°C until utilized. On the day of direct identification screening, the membrane pellet will slowly be thawed at room temperature, resuspended in buffer containing 20mM HEPES, pH 7.4 and lOmM MgCl2, to yield a final protein concentration of 0.60mg/ml (the resuspended membranes will be placed on ice until use).
cAMP standards and Detection Buffer (comprising 2 ~,Ci of tracer [lzsI cAMP
(100 p.1] to 11 ml Detection Buffer) will be prepared and maintained in accordance with the manufacturer's instructions. Assay Buffer will be prepared fresh for screening and contain 20mM HEPES, pH 7.4, lOmM MgCl2, 20mM phosphocreatine (Sigma), 0.1 units/ml creatine phosphokinase (Sigma), 50 p,M GTP (Sigma), and 0.2 mM ATP
(Sigma);
Assay Buffer will be stored on ice until utilized.
Candidate compounds identified as per above (if frozen, thawed .at room temperature) will be added, preferably, to 96-well plate wells (3~1/well; 12~M
final assay concentration), together with 40 ~,1 Membrane Protein (30~g/well) and SOp,I of Assay Buffer. This admixture will be incubated for 30 minutes at room temperature, with gentle shaking.
Following the incubation, 100,1 of Detection Buffer will be added to each well, followed by incubation for 2-24 hours. Plates will then be counted in a Wallac MicroBetaTM plate reader using "Prot. #31" (as per manufacturer instructions).
C. Melanophore Screening Assay A method for identifying candidate agonists or inverse agonists for a GPCR can be preformed by introducing tests cells of a pigment cell line capable of dispersing or aggregating their pigment in response to a specific stimulus and expressing an exogenous clone coding for the GCPR. A stimulant, e.g., light, sets an initial state of pigment disposition wherein the pigment is aggregated within the test cells if activation of the GPCR
induces pigment dispersion. However, stimulating the cell with a stimulant to set an initial state of pigment disposition wherein the pigment is dispersed if activation of the GPCR
induces pigment aggregation. The tests cells are then contacted with chemical compounds, and it is determined whether the pigment disposition in the cells changed from the initial state of pigment disposition. Dispersion of pigments cells due to the candidate compound coupling to the GPCR will appear dark on a petri dish, while aggregation of pigments cells will appear light.
Materials and methods will be followed according to the disclosure of U.S.
Patent Number 5,462,856 and U.S. Patent Number 6,051,386, each of which are incorporated by reference.
Although a variety of expression vectors are available to those in the art, for purposes of utilization for both the endogenous and non-endogenous human GPCRs, in some embodiments it is preferred that the vector utilized be pCMV. This vector was deposited with the American Type Culture Collection (ATCC) on October 13, 1998 (10801 University Blvd., Manassas, VA 20110-2209 USA) under the provisions of the Budapest Treaty for the International Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure. The DNA was tested by the ATCC and determined to be viable. The ATCC has assigned the following deposit number to pCMV: ATCC #203351.
References cited throughout this patent document, including co-pending and related patent applications, unless otherwise indicated, are fully incorporated herein by reference.
Modifications and extension of the disclosed inventions that are within the purview of the skilled artisan are encompassed within the above disclosure and the claims that follow.

SEQUENCE LISTING
<110> Arena Pharmaceuticals, Inc.
<120> Endogenous And Non-Endogenous, Constitutively Activated Human G Protein-Coupled Receptors <130> AREN-0309 <150> 09/170,496 <151> 1998-10-13 <150> PCT/US99/23938 <151> 1998-10-13 <150> 60/253,404 <151> 2000-11-27 <150> 60/255,366 <151> 2000-12-12 <150> 60/270,286 <151> 2001-02-20 <150> 60/282,365 <151> 2001-04-06 <150> 60/270,266 <151> 2001-02-20 <150> 60/282,032 <151> 2001-04-06 <150> 60/282,358 <151> 2001-04-06 <150> 60/282,356 <151> 2001-04-06 <150> 60/290,917 <151> 2001-05-14 <150> 60/309,208 <151> 2001-07-31 <160> 67 <170> Patentln version.3.1 <210> 1 <211> 1002 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 1 atgtggagct gcagctggtt caacggcaca gggctggtgg aggagctgcc tgcctgccag 60 gacctgcagc tggggctgtc actgttgtcg ctgctgggcc tggtggtggg cgtgccagtg 120 ggcctgtgct acaacgccct gctggtgctg gccaacctac acagcaaggc cagcatgacc 180 atgccggacg tgtactttgt caacatggca gtggcaggcc tggtgctcag cgccctggcc 240 cctgtgcacctgctcggccccccgagctcccggtgggcgctgtggagtgtgggcggcgaa300 gtccacgtggcactgcagatccccttcaatgtgtcctcactggtggccatgtactccacc360 gccctgctgagcctcgaccactacatcgagcgtgcactgccgcggacctacatggccagc420 gtgtacaacacgcggcacgtgtgcggcttcgtgtggggtggcgcgctgctgaccagcttc480 tcctcgctgctcttctacatctgcagccatgtgtccacccgcgcgctagagtgcgccaag540 atgcagaacgcagaagctgccgacgccacgctggtgttcatcggctacgtggtgccagca600 ctggccaccctctacgcgctggtgctactctcccgcgtccgcagggaggacacgcccctg660 gaccgggacacgggccggctggagccctcggcacacaggctgctggtggccaccgtgtgc720 acgcagtttgggctctggacgccacactatctgatcctgctggggcacacggtcatcatc780 tcgcgagggaagcccgtggacgcacactacctggggctactgcactttgtgaaggatttc840 tccaaactcctggccttctccagcagctttgtgacaccacttctctaccgctacatgaac900 cagagcttccccagcaagctccaacggctgatgaaaaagctgccctgcggggaccggcac960 tgctccccggaccacatgggggtgcagcaggtgctggcgtcg 1002 <210> 2 <211> 333 <212> PRT
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 2 Met Trp Ser Cys Ser Trp Phe Asn Gly Thr Gly Leu Val Glu Glu Leu Pro Ala Cys Gln Asp Leu Gln Leu Gly Leu Ser Leu Leu Ser Leu Leu Gly Leu Val Val Gly Val Pro Val Gly Leu Cys Tyr Asn Ala Leu Leu Val Leu Ala Asn Leu His Ser Lys Ala Ser Met Thr Met Pro Asp Val Tyr Phe Val Asn Met Ala Val Ala Gly Leu Val Leu Ser Ala Leu Ala Pro Val His Leu Leu Gly Pro Pro Ser Ser Arg Trp Ala Leu Trp Ser Val Gly Gly Glu Val His Val Ala Leu Gln Ile Pro Phe Asn Val Ser Ser Leu Val Ala Met Tyr Ser Thr Ala Leu Leu Ser Leu Asp His Tyr Ile Glu Arg Ala Leu Pro Arg Thr Tyr Met Ala Ser Val Tyr Asn Thr l30 135 140 Arg His Val Cys Gly Phe Val Trp Gly Gly A1a Leu Leu Thr Ser Phe Ser Ser Leu Leu Phe Tyr Ile Cys Ser His Val Ser Thr Arg Ala Leu Glu Cys Ala Lys Met Gln Asn Ala Glu Ala Ala Asp Ala Thr Leu Val Phe Tle Gly Tyr Val Val Pro Ala Leu Ala Thr Leu Tyr Ala Leu Val Leu Leu Ser Arg Val Arg Arg Glu Asp Thr Pro Leu Asp Arg Asp Thr Gly Arg Leu Glu Pro Ser Ala His Arg Leu Leu Val Ala Thr Va1 Cys Thr Gln Phe Gly Leu Trp Thr Pro His Tyr Leu Ile Leu Leu Gly His Thr Val Ile Ile Ser Arg G1y Lys Pro Val Asp Ala His Tyr Leu Gly Leu Leu His Phe Val Lys Asp Phe Ser Lys Leu Leu Ala Phe Ser Ser Ser Phe Val Thr Pro Leu Leu Tyr Arg Tyr Met Asn Gln Ser Phe Pro Ser Lys Leu Gln Arg Leu Met Lys Lys Leu Pro Cys Gly Asp Arg His Cys Ser Pro Asp His Met Gly Val Gln Gln Val Leu Ala <210> 3 <211> 918 <212> DNA
<213> Homo Sapiens <400> 3 atgcctggcc acaatacctc caggaattcc tcttgcgatc ctatagtgac accccactta 60 atcagcctct acttcatagt gcttattggc gggctggtgg gtgtcatttc cattcttttc 120 ctcctggtga aaatgaacac ccggtcagtg accaccatgg cggtcattaa cttggtggtg 180 gtccacagcgtttttctgctgacagtgccatttcgcttgacctacctcatcaagaagact240 tggatgtttgggctgcccttctgcaaatttgtgagtgccatgctgcacatccacatgtac300 ctcacgttcctattctatgtggtgatcctggtcaccagatacctcatcttcttcaagtgc360 aaagacaaagtgg attctacagaaaactgcatgctgtggctgccagtgctggcatgtgg420 acgctggtgattgtcattgtggtacccctggttgtctcccggtatggaatccatgaggaa480 tacaatgaggagcactgttttaaatttcacaaagagcttgcttacacatatgtgaaaatc540 atcaactatatgatagtcatttttgtcatagccgttgctgtgattctgttggtcttccag600 gtcttcatcattatgttgatggtgcagaagctacgccactctttactatcccaccaggag660 ttctgggctcagctgaaaaacctattttttataggggtcatccttgtttgtttccttccc720 taccagttctttaggatctattacttgaatgttgtgacgcattccaatgcctgtaacagc780 aaggttgcattttataacgaaatcttcttgagtgtaacagcaattagctgctatgatttg840 cttctctttgtctttgggggaagccattggtttaagcaaaagataattggcttatggaat900 tgtgttttgtgccgttag <210> 4 <211> 305 <2l2> PRT
<213> Homo Sapiens <400> 4 Met Pro Gly His Asn Thr Ser Arg Asn Ser Ser Cys Asp Pro Ile Val Thr Pro His Leu I1e Ser Leu Tyr Phe Ile Val Leu Ile Gly Gly Leu Val Gly Val Ile Ser I1e Leu Phe Leu Leu Val Lys Met Asn Thr Arg Ser Val Thr Thr Met Ala Val Ile Asn Leu Val Val Val His Ser Val Phe Leu Leu Thr Val Pro Phe Arg Leu Thr Tyr Leu Ile Lys Lys Thr Trp Met Phe Gly Leu Pro Phe Cys Lys Phe Val Ser Ala Met Leu His Ile His Met Tyr Leu Thr Phe Leu Phe Tyr Val Val Ile Leu Va1 Thr Arg Tyr Leu Ile Phe Phe Lys Cys Lys Asp Lys Val Glu Phe Tyr Arg Lys Leu His Ala Val Ala A1a Ser Ala Gly Met Trp Thr Leu Val Ile Val Ile Val Val Pro Leu Val Val Ser Arg Tyr Gly Ile His Glu Glu Tyr Asn Glu Glu His Cys Phe Lys Phe His Lys Glu Leu Ala Tyr Thr Tyr Val Lys Tle Ile Asn Tyr Met Ile Val Ile Phe Val 21e Ala Val Ala Val Ile Leu Leu Val Phe Gln Val Phe Tle Ile Met Leu Met Val Gln Lys Leu Arg His Ser Leu Leu Ser His Gln Glu Phe Trp Ala Gln Leu Lys Asn Leu Phe Phe Ile Gly Val Ile Leu Val Cys Phe Leu Pro Tyr Gln Phe Phe Arg Ile Tyr Tyr Leu Asn Val Val Thr His Ser Asn Ala Cys Asn Ser Lys Val Ala Phe Tyr Asn Glu Ile Phe Leu Ser Val Thr Ala Ile Ser Cys Tyr Asp Leu Leu Leu Phe Val Phe Gly Gly Ser His Trp Phe Lys Gln Lys Ile Ile Gly Leu Trp Asn Cys Val Leu Cys Arg <210> 5 <211> 1125 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 5 atgctgacag ggagctgcgg ggaccctcag aaaaagccac aggtgaccca ggactcaggg 60 ccccagagca tggggcttga gggacgagag acagctggcc agccacgagt gaccctgctg 120 cccacgccca acgtcagcgg gctgagccag gagtttgaaa gccactggcc agagatcgca 180 gagaggtccc cgtgtgtggc tggcgtcatc cctgtcatct actacagtgt cctgctgggc 240 ttggggctgcctgtcagcctcctgaccgcagtggccctggcgcgccttgccaccaggacc300 aggaggccctcctactactaccttctggcgctcacagcctcggatatcatcatccaggtg360 gtcatcgtgttcgcgggcttcctcctgcagggagcagtgctggcccgccaggtgccccag420 gctgtggtgcgcacggccaacatcctggagtttgctgccaaccacgcctcagtctggatc480 gccatcctgctcacggttgaccgctacactgccctgtgccaccccctgcaccatcgggcc540 gcctcgtccccaggccggacccgccgggccattgctgctgtcctgagtgctgccctgttg600 accggcatccccttctactggtggctggacatgtggagagacaccgactcacccagaaca660 ctggacgaggtcctcaagtgggctcactgtctcactgtctatttcatcccttgtggcgtg720 ttcctggtcaccaactcggccatcatccaccggctacggaggaggggccggagtgggctg780 cagccccgggtgggcaagagcacagccatcctcctgggcatcaccacactgttcaccctc840 ctgtgggcgccccgggtcttcgtcatgctctaccacatgtacgtggcccctgtccaccgg900 gactggagggtccacctggccttggatgtggccaatatggtggccatgctccacacggca960 gccaacttcggcctctactgctttgtcagcaagactttccgggccactgtccgacaggtc1020 atccacgatgcctacctgccctgcactttggcatcacagccagagggcatggcggcgaag1080 cctgtgatggagcctccgggactccccacaggggcagaagtgtag 1125 <210> 6 <211> 374 <212> PRT
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 6 Met Leu Thr Gly Ser Cys Gly Asp Pro Gln Lys Lys Pro Gln Val Thr Gln Asp Ser Gly Pro Gln Ser Met Gly Leu Glu Gly Arg Glu Thr Ala Gly Gln Pro Arg Val Thr Leu Leu Pro Thr Pro Asn Val Ser G1y Leu Ser Gln Glu Phe Glu Ser His Trp Pro Glu Ile Ala Glu Arg Ser Pro Cys Val Ala Gly Val Ile Pro Val Ile Tyr Tyr Ser Val Leu Leu Gly Leu Gly Leu Pro Val Ser Leu Leu Thr Ala Val Ala Leu Ala Arg Leu Ala Thr Arg Thr Arg Arg Pro Ser Tyr Tyr Tyr Leu Leu Ala Leu Thr Ala Ser Asp Ile Ile Ile Gln Val Val Ile Val Phe Ala Gly Phe Leu Leu Gln Gly Ala Va1 Leu Ala Arg Gln Val Pro Gln Ala Val Val Arg Thr Ala Asn Ile Leu Glu Phe Ala Ala Asn His Ala Ser Val Trp Ile Ala Ile Leu Leu Thr Val Asp Arg Tyr Thr Ala Leu Cys His Pro Leu His His Arg Ala Ala Ser Ser Pro Gly Arg Thr Arg Arg Ala Ile Ala Ala Val Leu Ser Ala Ala Leu Leu Thr Gly Ile Pro Phe Tyr Trp Trp Leu Asp Met Trp Arg Asp Thr Asp Ser Pro Arg Thr Leu Asp Glu Val Leu Lys Trp Ala His Cys Leu Thr Val Tyr Phe Ile Pro Cys Gly Val Phe Leu Val Thr Asn Ser Ala Ile Ile His Arg Leu Arg Arg Arg Gly Arg Ser Gly Leu Gln Pro Arg Val Gly Lys Ser Thr Ala Ile Leu Leu Gly I12 Thr Thr Leu Phe Thr Leu Leu Trp Ala Pro Arg Val Phe Val Met Leu Tyr His Met Tyr Val Ala Pro Val His Arg Asp Trp Arg Val His Leu Ala Leu Asp Va1 Ala Asn Met Val Ala Met Leu His Thr Ala Ala Asn Phe Gly Leu Tyr Cys Phe Val Ser Lys Thr Phe Arg Ala Thr Val Arg Gln Val Ile His Asp Ala Tyr Leu Pro Cys Thr Leu Ala Ser Gln Pro Glu Gly Met Ala Ala Lys Pro Val Met Glu Pro Pro Gly Leu Pro Thr Gly Ala Glu Val <210> 7 <211> 1086 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400>

atgtccactgaatgcgcgcgggcagcgggcgacgcgcccttgcgcagcctggagcaagcc60 aaccgcacccgctttcccttcttctccgacgtcaagggcgaccaccggctggtgctggcc120 gcggtggagacaaccgtgctggtgctcatctttgcagtgtcgctgctgggcaacgtgtgc180 gccctggtgctggtggcgcgccgacgacgccgcggcgcgactgcctgcctggtactcaac240 ctcttctgcgcggacctgctcttcatcagcgctatccctctggtgctggccgtgcgctgg300 actgaggcctggctgctgggccccgttgcctgccacctgctcttctacgtgatgaccctg360 agcggcagcgtcaccatcctcacgctggccgcggtcagcctggagcgcatggtgtgcatc420 gtgcacctgcagcgcggcgtgcggggtcctgggcggcgggcgcgggcagtgctgctggcg480 ctcatatggggctattcggcggtcgccgctctgcctctatgcgtcttcttccgagtcgtc540 ccgcaacggctccccggcgccgaccaggaaatttcgatttgcacactgatttggcccacc600 attcctggagagatctcgtgggatgtctcttttgttactttgaacttcttggtgccagga660 ctggtcattgtgatcagttactccaaaattttacagatcacaaaggcatcaaggaagagg720 ctcacggtaagcctggcctactcggagagccaccagatccgcgtgtcccagcaggacttc780 cggctcttccgcaccctcttcctcctcatggtctccttcttcatcatgtggagccccatc840 atcatcaccatcctcctcatcctgatccagaacttcaagcaagacctggtcatctggccg900 tccctcttcttctgggtggtggccttcacatttgctaattcagccctaaaccccatcctc960 tacaacatgacactgtgcaggaatgagtggaagaaaattttttgctgcttctggttccca1020 gaaaagggagccattttaacagacacatctgtcaaaagaaatgacttgtcgattatttct1080 ggctaa 1086 <210> 8 <211> 361 <212> PRT
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 8 Met Ser Thr Glu Cys Ala Arg Ala Ala Gly Asp Ala Pro Leu Arg Ser Leu Glu Gln Ala Asn Arg Thr Arg Phe Pro Phe Phe Ser Asp Val Lys Gly Asp His Arg Leu Val Leu Ala Ala Val Glu Thr Thr Val Leu Val Leu Ile Phe Ala Val Ser Leu Leu Gly Asn Val Cys Ala Leu Val Leu Val Ala Arg Arg Arg Arg Arg Gly Ala Thr Ala Cys Leu Val Leu Asn Leu Phe Cys Ala Asp Leu Leu Phe Ile Ser A1a Ile Pro Leu Val Leu Ala Val Arg Trp Thr Glu Ala Trp Leu Leu Gly Pro Val Ala Cys His Leu Leu Phe Tyr Val Met Thr Leu Ser Gly Ser Val Thr Ile Leu Thr Leu Ala Ala Val Ser Leu Glu Arg Met Val Cys Ile Val His Leu Gln Arg Gly Val Arg Gly Pro Gly Arg Arg Ala Arg Ala Val Leu Leu Ala Leu Ile Trp Gly Tyr Ser Ala Val Ala Ala Leu Pro Leu Cys Val Phe Phe Arg Val Val Pro Gln Arg Leu Pro Gly Ala Asp G1n Glu Ile Ser Ile Cys Thr Leu Ile Trp Pro Thr Ile Pro Gly Glu Ile Ser Trp Asp Val Ser Phe Val Thr Leu Asn Phe Leu Val Pro Gly Leu Val Ile Val Ile Ser Tyr Ser Lys Ile Leu Gln Tle Thr Lys Ala Ser Arg Lys Arg Leu Thr Val Ser Leu Ala Tyr Ser Glu Ser His Gln Ile Arg Val Ser Gln Gln Asp Phe Arg Leu Phe Arg Thr Leu Phe Leu Leu Met Val Ser Phe Phe Ile Met Trp Ser Pro Ile Ile Ile Thr Ile Leu Leu Ile Leu Ile Gln Asn Phe Lys Gln Asp Leu Val Ile Trp Pro Ser Leu Phe Phe Trp Val Val Ala Phe Thr Phe Ala Asn Ser Ala Leu Asn Pro Ile Leu Tyr Asn Met Thr Leu Cys Arg Asn Glu Trp Lys Lys Ile Phe Cys Cys Phe Trp Phe Pro Glu Lys Gly Ala Ile Leu Thr Asp Thr Ser Val Lys Arg Asn Asp Leu Ser Ile Ile Ser Gly <210> 9 <211> 1038 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400>

atgagcagcaattcatccctgctggtggctgtgcagctgtgctacgcgaacgtgaatggg60 tcctgtgtgaaaatccccttctcgccgggatcccgggtgattctgtacatagtgtttggc120 tttggggctgtgctggctgtgtttggaaacctcctggtgatgatttcaatcctccatttc180 aagcagctgcactctccgaccaattttctcgttgcctctctggcctgcgctgatttcttg240 gtgggtgtgactgtgatgcccttcagcatggtcaggacggtggagagctgctggtatttt300 gggaggagtttttgtactttccacacctgctgtgatgtggcattttgttactcttctctc360 tttcacttgtgcttcatctccatcgacaggtacattgcggttactgaccccctggtctat420 cctaccaagttcaccgtatctgtgtcaggaatttgcatcagcgtgtcctggatcctgccc480 ctcatgtacagcggtgctgtgttctacacaggtgtctatgacgatgggctggaggaatta540 tctgatgccctaaactgtataggaggttgtcagaccgttgtaaatcaaaactgggtgttg600 acagattttctatccttctttatacctacctttattatgataattctgtatggtaacata660 tttcttgtggctagacgacaggcgaaaaagatagaaaatactggtagcaagacagaatca720 tcctcagagagttacaaagccagagtggccaggagagagagaaaagcagctaaaaccctg780 ggggtcacagtggtagcatttatgatttcatggttaccatatagcattgattcattaatt840 gatgcctttatgggctttataacccctgcctgtatttatgagatttgctgttggtgtgct900 tattataactcagccatgaatcctttgatttatgctttattttacccatggtttaggaaa960 gcaataaaagttattgtaactggtcaggttttaaagaacagttcagcaaccatgaatttg1020 ttttctgaac atatataa 1038 <210> 10 <211> 345 <212> PRT
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 10 Met Ser Ser Asn Ser Ser Leu Leu Val Ala Val Gln Leu Cys Tyr Ala Asn Val Asn Gly Ser Cys Val Lys Ile Pro Phe Ser Pro Gly Ser Arg Val Ile Leu Tyr Ile Val Phe Gly Phe Gly Ala Val Leu Ala Val Phe Gly Asn Leu Leu Val Met Ile Ser Ile Leu His Phe Lys Gln Leu His Ser Pro Thr Asn Phe Leu Val Ala Ser Leu Ala Cys Ala Asp Phe Leu Val Gly Val Thr Val Met Pro Phe Ser Met Val Arg Thr Val Glu Ser Cys Trp Tyr Phe Gly Arg Ser Phe Cys Thr Phe His Thr Cys Cys Asp Val Ala Phe Cys Tyr Ser Ser Leu Phe His Leu Cys Phe Ile Ser Ile Asp Arg Tyr Ile Ala Val Thr Asp Pro Leu Val Tyr Pro Thr Lys Phe Thr Val Ser Val Ser Gly Ile Cys Ile Ser Val Ser Trp Ile Leu Pro Leu Met Tyr Ser Gly Ala Val Phe Tyr Thr Gly Val Tyr Asp Asp Gly Leu Glu Glu Leu 5er Asp Ala Leu Asn Cys Ile Gly Gly Cys Gln Thr Val Val Asn Gln Asn Trp Val Leu Thr Asp Phe Leu Ser Phe Phe Ile Pro Thr Phe Ile Met Ile Ile Leu Tyr Gly Asn Tle Phe Leu Val Ala Arg Arg Gln Ala Lys Lys Ile Glu Asn Thr Gly Ser Lys Thr Glu Ser Ser Ser G1u Ser Tyr Lys Ala Arg Val Ala Arg Arg Glu Arg Lys Ala Ala Lys Thr Leu Gly Val Thr Val Val Ala Phe Met Ile Ser Trp Leu Pro Tyr Ser Ile Asp Ser Leu Ile Asp Ala Phe Met Gly Phe Ile Thr Pro Ala Cys Ile Tyr Glu Ile Cys Cys Trp Cys Ala Tyr Tyr Asn Ser Ala Met Asn Pro Leu Ile Tyr Ala Leu Phe Tyr Pro Trp Phe Arg Lys Ala Ile Lys Val Ile Val Thr Gly Gln Val Leu Lys Asn Ser Ser Ala Thr Met Asn Leu Phe Ser Glu His Ile <210>
<211>

<212>
DNA

<213>
Artificial Sequence <220>

<223>
Novel Sequence <400>

atgatgcccttttgccacaatataattaatatttcctgtgtgaaaaacaactggtcaaat60 gatgtccgtgettccctgtacagtttaatggtgctcataattctgaccacactcgttggc120 aatctgatagttattgtttctatatcacacttcaaacaacttcataccccaacaaattgg180 ctcattcattccatggccactgtggactttcttctggggtgtctggtcatgccttacagt240 atggtgagatctgctgagcactgttggtattttggagaagtcttctgtaaaattcacaca300 agcaccgacattatgctgagctcagcctccattttccatttgtctttcatctccattgac360 cgctactatgctgtgtgtgatccactgagatataaagccaagatgaatatcttggttatt420 tgtgtgatgatcttcattagttggagtgtccctgctgtttttgcatttggaatgatcttt480 ctggagctaaacttcaaaggcgctgaagagatatattacaaacatgttcactgcagagga540 ggttgctctgtcttctttagcaaaatatctggggtactgacctttatgacttctttttat600 atacctggatctattatgttatgtgtctattacagaatatatcttatcgctaaagaacag660 gcaagattaattagtgatgccaatcagaagctccaaattggattggaaatgaaaaatgga720 atttcacaaagcaaagaaaggaaagctgtgaagacattggggattgtgatgggagttttc780 ctaatatgctggtgccctttctttatctgtacagtcatggacccttttcttcactacatt840 attccacctactttgaatgatgtattgatttggtttggctacttgaactctacatttaat900 ccaatggtttatgcatttttctatccttggtttagaaaagcactgaagatgatgctgttt960 ggtaaaattttccaaaaagattcatccaggtgtaaattatttttggaattgagttcatag1020 <210> 12 <211> 339 <212> PRT
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 12 Met Met Pro Phe Cys His Asn Ile Tle Asn Tle Ser Cys Val Lys Asn Asn Trp Ser Asn Asp Val Arg Ala Ser Leu Tyr Ser Leu Met Val Leu Ile Ile Leu Thr Thr Leu Val Gly Asn Leu Ile Val Ile Val Ser Ile Ser His Phe Lys Gln Leu His Thr Pro Thr Asn Trp Leu Ile His Ser Met Ala Thr Val Asp Phe Leu Leu Gly Cys Leu Val Met Pro Tyr Ser Met Val Arg Ser Ala Glu His Cys Trp Tyr Phe Gly Glu Val Phe Cys Lys Ile His Thr Ser Thr Asp Ile Met Leu Ser Ser Ala Ser Ile Phe His Leu Ser Phe Ile Ser Ile Asp Arg Tyr Tyr Ala Val Cys Asp Pro Leu Arg Tyr Lys Ala Lys Met Asn Ile Leu Val Ile Cys Val Met Ile Phe Ile Ser Trp Ser Val Pro Ala Val Phe Ala Phe Gly Met Ile Phe Leu Glu Leu Asn Phe Lys Gly Ala Glu Glu Ile Tyr Tyr Lys His Val His Cys Arg Gly Gly Cys Ser Val Phe Phe Ser Lys Ile Ser Gly Val Leu Thr Phe Met Thr Ser Phe Tyr Ile Pro Gly Ser I1e Met Leu Cys Val Tyr Tyr Arg Tle Tyr Leu Ile Ala Lys Glu Gln Ala Arg Leu Ile Ser Asp Ala Asn Gln Lys Leu Gln Ile Gly Leu Glu Met Lys Asn Gly Ile Ser Gln Ser Lys Glu Arg Lys Ala Val Lys Thr Leu Gly Ile Val Met Gly Val Phe Leu Ile Cys Trp Cys Pro Phe Phe Ile Cys Thr Val Met Asp Pro Phe Leu His Tyr Ile Ile Pro Pro Thr Leu Asn Asp Val Leu Ile Trp Phe Gly Tyr Leu Asn Ser Thr Phe Asn Pro Met Val Tyr Ala Phe Phe Tyr Pro Trp Phe Arg Lys Ala Leu Lys Met Met Leu Phe Gly Lys Ile Phe Gln Lys Asp Ser Ser Arg Cys Lys Leu Phe Leu Glu Leu Ser Ser <210> 13 <211> 1029 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400>

atgaccagcaatttttcccaacctgttgtgcagctttgctatgaggatgtgaatggatct60 tgtattgaaactccctattctcctgggtcccgggtaattctgtacacggcgtttagcttt120 gggtctttgctggctgtatttggaaatctcttagtaatgacttctgttcttcattttaag180 cagctgcactctccaaccaattttctcattgcctctctggcctgtgctgacttcttggta240 ggtgtgactgtgatgcttttcagcatggtcaggacggtggagagctgctggtattttgga300 gccaaattttgtactcttcacagttgctgtgatgtggcattttgttactcttctgtcctc360 cacttgtgcttcatctgcatcgacaggtacattgtggttactgatcccctggtctatgct420 accaagttcaccgtgtctgtgtcgggaatttgcatcagcgtgtcctggattctgcctctc480 acgtacagcggtgctgtgttctacacaggtgtcaatgatgatgggctggaggaattagta540 agtgctctcaactgcgtaggtggctgtcaaattattgtaagtcaaggctgggtgttgata600 gattttctgttattcttcatacctacccttgttatgataattctttacagtaagattttt660 cttatagctaaacaacaagctataaaaattgaaactactagtagcaaagtagaatcatcc720 tcagagagttataaaatcagagtggccaagagagagaggaaagcagctaaaaccctgggg780 gtcacggtactagcatttgttatttcatggttaccgtatacagttgatatattaattgat840 gcctttatgggcttcctgacccctgcctatatctatgaaatttgctgttggagtgcttat900 tataactcagccatgaatcctttgatttatgctctattttatccttggtttaggaaagcc960 ataaaacttattttaagtggagatgttttaaaggctagttcatcaaccattagtttattt1020 ttagaataa 1029 <210> 14 <211> 342 <212> PRT
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 14 Met Thr Ser Asn Phe Ser Gln Pro Val Val Gln Leu Cys Tyr Glu Asp Val Asn Gly Ser Cys Ile Glu Thr Pro Tyr Ser Pro Gly Ser Arg Val Ile Leu Tyr Thr Ala Phe Ser Phe Gly Ser Leu Leu Ala Val Phe Gly Asn Leu Leu Val Met Thr Ser Val Leu His Phe Lys Gln Leu His Ser Pro Thr Asn Phe Leu Ile,Ala Ser Leu Ala Cys Ala Asp Phe Leu Val G1y Val Thr Val Met Leu Phe Ser Met Val Arg Thr Val Glu Ser Cys Trp Tyr Phe Gly Ala Lys Phe Cys Thr Leu His Ser Cys Cys Asp Val Ala Phe Cys Tyr Ser Ser Va1 Leu His Leu Cys Phe Ile Cys Ile Asp Arg Tyr Ile Val Val Thr Asp Pro Leu Val Tyr Ala Thr Lys Phe Thr Val Ser Val Ser Gly Ile Cys Ile Ser Val Ser Trp Tle Leu Pro Leu Thr Tyr Ser G1y Ala Val Phe Tyr Thr Gly Val Asn Asp Asp Gly Leu l65 170 175 Glu Glu Leu Val Ser Ala Leu Asn Cys Val Gly Gly Cys Gln Ile Ile Val Ser Gln Gly Trp Val Leu Ile Asp Phe Leu Leu Phe Phe Ile Pro Thr Leu Val Met Ile Ile Leu Tyr Ser Lys Ile Phe Leu Ile Ala Lys Gln Gln Ala Ile Lys Tle Glu Thr Thr Ser Ser Lys Val Glu Ser Ser Ser Glu Ser Tyr Lys Ile Arg Val Ala Lys Arg Glu Arg Lys Ala Ala Lys Thr Leu Gly Val Thr Val Leu Ala Phe Val Ile Sex Trp Leu Pro Tyr Thr Val Asp Ile Leu Ile Asp Ala Phe Met G1y Phe Leu Thr Pro Ala Tyr Ile Tyr Glu Ile Cys Cys Trp Ser Ala Tyr Tyr Asn Ser A1a Met Asn Pro Leu Ile Tyr Ala Leu Phe Tyr Pro Trp Phe Arg Lys Ala Ile Lys Leu Ile Leu Ser Gly Asp Val Leu Lys Ala Ser Ser Ser Thr Tle Ser Leu Phe Leu Glu <210> 15 <211> 1062 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 15 atggagcacacgcacgcccacctcgcagccaacagctcgctgtcttggtggtcccccggc60 tcggcctgcggcttgggtttcgtgcccgtggtctactacagcctcttgctgtgcctcggt120 ttaccagcaaatatcttgacagtgatcatcctctcccagctggtggcaagaagacagaag180 tcctcctacaactatctcttggcactcgctgctgccgacatcttggtcctctttttcata240 gtgtttgtggacttcctgttggaagatttcatcttgaacatgcagatgcctcaggtcccc300 gacaagatcatagaagtgctggaattctcatccatccacacctccatatggattactgta360 ccgttaaccattgacaggtatatcgctgtctgccacccgctcaagtaccacacggtctca420 tacccagcccgcacccggaaagtcattgtaagtgtttacatcacctgcttcctgaccagc480 atcccctattactggtggcccaacatctggactgaagactacatcagcacctctgtgcat540 cacgtcctcatctggatccactgcttcaccgtctacctggtgccctgctccatcttcttc600 atcttgaactcaatcattgtgtacaagctcaggaggaagagcaattttcgtctccgtggc660 tactccacggggaagaccaccgccatcttgttcaccattacctccatctttgccacactt720 tgggccccccgcatcatcatgattctttaccacctctatggggcgcccatccagaaccgc780 tggctggtgcacatcatgtccgacattgccaacatgctagcccttctgaacacagccatc840 aacttcttcctctactgcttcatcagcaagcggttccgcaccatggcagccgccacgctc900 aaggctttcttcaagtgccagaagcaacctgtacagttctacaccaatcataacttttcc960 ataacaagtagcccctggatctcgccggcaaactcacactgcatcaagatgctggtgtac1020 cagtatgacaaaaatggaaaacctataaaagtatccccgtga 1062 <210> 16 <211> 353 <212> PRT
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 16 Met Glu His Thr His Ala His Leu Ala Ala Asn Ser Ser Leu Ser Trp Trp Ser Pro Gly Ser Ala Cys Gly Leu Gly Phe Val Pro Val Val Tyr Tyr,Ser Leu Leu Leu Cys Leu Gly Leu Pro Ala Asn Ile Leu Thr Val Ile Ile Leu Ser Gln Leu Val Ala Arg Arg Gln Lys Ser Ser Tyr Asn Tyr Leu Leu Ala Leu Ala Ala Ala Asp Ile Leu Val Leu Phe Phe Ile 65 70 75 _ 80 Val Phe Val Asp Phe Leu Leu Glu Asp Phe Ile Leu Asn Met Gln Met Pro Gln Val Pro Asp Lys Ile Ile Glu Val Leu Glu Phe Ser Ser Ile His Thr Ser Ile Trp Ile Thr Val Pro Leu Thr Ile Asp Arg Tyr Ile Ala Val Cys His Pro Leu Lys Tyr His Thr Val Ser Tyr Pro Ala Arg Thr Arg Lys Val Ile Val Ser Val Tyr Ile Thr Cys Phe Leu Thr 5er Ile Pro Tyr Tyr Trp Trp Pro Asn Ile Trp Thr G1u Asp Tyr Ile Ser Thr Ser Val His His Val Leu Ile Trp Ile His Cys Phe Thr Val Tyr Leu Val Pro Cys Ser Ile Phe Phe Ile Leu Asn Ser Ile Ile Val Tyr Lys Leu Arg Arg Lys Ser Asn Phe Arg Leu Arg Gly Tyr Ser Thr Gly Lys Thr Thr Ala Ile Leu Phe Thr Ile Thr Ser Ile Phe Ala Thr Leu 225 230 235 ' 240 Trp Ala Pro Arg Ile Ile Met Ile Leu Tyr His Leu Tyr Gly A1a Pro Ile Gln Asn Arg Trp Leu Val His Ile Met Ser Asp Ile Ala Asn Met Leu Ala Leu Leu Asn Thr Ala Ile Asn Phe Phe Leu Tyr Cys Phe Ile Ser Lys Arg Phe Arg Thr Met Ala A1a Ala Thr Leu Lys Ala Phe Phe Lys Cys Gln Lys Gln Pro Val Gln Phe Tyr Thr Asn His Asn Phe Ser Tle Thr 5er Ser Pro Trp Ile Ser Pro Ala Asn Ser His Cys Ile Lys Met Leu Val Tyr Gln Tyr Asp Lys Asn Gly Lys Pro Ile Lys Val Ser Pro <2l0> 17 <211> 969 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400>

atggatccaaccgtcccagtcttcggtacaaaactgacaccaatcaacggacgtgaggag60 actccttgctacaatcagaccctgagcttcacggtgctgacgtgcatcatttcccttgtc120 ggactgacaggaaacgcggtagtgctctggctcctgggctaccgcatgcgcaggaacgct180 gtctccatctacatcctcaacctggccgcagcagacttcctcttcctcagcttccagatt240 atacgttcgccattacgcctcatcaatatcagccatctcatccgcaaaatcctcgtttct300 gtgatgacctttccctactttacaggcctgagtatgctgagcgccatcagcaccgagcgc360 tgcctgtctgttctgtggcccatctggtaccgctgccgccgccccacacacctgtcagcg420 gtcgtgtgtgtcctgctctggggcctgtccctgctgtttagtatgctggagtggaggttc480 tgtgacttcctgtttagtggtgctgattctagttggtgtgaaacgtcagatttcatccca540 gtcgcgtggctgatttttttatgtgtggttctctgtgtttccagcctggtcctgctggtc600 aggatcctctgtggatcccggaagatgccgctgaccaggctgtacgtgaccatcctgctc660 acagtgctggtcttcctcctctgcggcctgcccttcggcattctgggggccctaatttac720 aggatgcacctgaatttggaagtcttatattgtcatgtttatctggtttgcatgtccctg780 tcctctctaaacagtagtgccaaccccatcatttacttcttcgtgggctcctttaggcag840 cgtcaaaataggcagaacctgaagctggttctccagagggctctgcaggacaagcctgag900 gtggataaaggtgaagggcagcttcctgaggaaagcctggagctgtcgggaagcagattg960 gggccatga 969 <210> 18 <211> 322 <212> PRT
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 18 Met Asp Pro Thr Val Pro Val Phe Gly Thr Lys Leu Thr Pro Ile Asn Gly Arg Glu Glu Thr Pro Cys Tyr Asn Gln Thr Leu Ser Phe Thr Val Leu Thr Cys Ile Ile Ser Leu Val Gly Leu Thr Gly Asn Ala Val Val Leu Trp Leu~Leu Gly Tyr Arg Met Arg Arg Asn Ala Val Ser Ile Tyr Ile Leu Asn Leu Ala Ala Ala Asp Phe Leu Phe Leu Ser Phe Gln Ile Ile Arg Ser Pro Leu Arg Leu Ile Asn Ile Ser His Leu Ile Arg Lys I1e Leu Val Ser Val Met Thr Phe Pro Tyr Phe Thr Gly Leu Ser Met Leu Ser Ala Ile Ser Thr Glu Arg Cys Leu Ser Val Leu Trp Pro Ile Trp Tyr Arg Cys Arg Arg Pro Thr His Leu Ser Ala Val Val Cys Val Leu Leu Trp Gly Leu Ser Leu Leu Phe Ser Met Leu Glu Trp Arg Phe Cys Asp Phe Leu Phe Ser G1y Ala Asp Ser Ser Trp Cys Glu Thr Ser Asp Phe Ile Pro Val Ala Trp Leu Ile Phe Leu Cys Val Va1 Leu Cys Val Ser Ser Leu Val Leu Leu Val Arg Ile Leu Cys Gly Ser Arg Lys Met Pro Leu Thr Arg Leu Tyr Val Thr Ile Leu Leu Thr Val Leu Val 2l0 215 220 Phe Leu Leu Cys Gly Leu Pro Phe Gly Ile Leu Giy Ala Leu Ile Tyr Arg Met His Leu Asn Leu Glu Val Leu Tyr Cys His Val Tyr Leu Val Cys Met Ser Leu Ser Ser Leu Asn Ser Ser Ala Asn Pro Ile Ile Tyr Phe Phe Val Gly Ser Phe Arg Gln Arg Gln Asn Arg Gln Asn Leu Lys Leu Val Leu Gln Arg Ala Leu Gln Asp Lys Pro Glu Val Asp Lys Gly Glu Gly Gln Leu Pro Glu Glu Ser Leu Glu Leu Ser Gly Ser Arg Leu Gly Pro <210> 19 <211> 969 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400>

atggattcaaccatcccagtcttgggtacagaactgacaccaatcaacggacgtgaggag60 actccttgctacaagcagaccctgagcttcacggggctgacgtgcatcgtttcccttgtb220 gcgctgacaggaaacgcggttgtgctctggctcctgggctgccgcatgcgcaggaacgct180 gtctccatctacatcctcaacctggtcgcggccgacttcctcttccttagcggccacatt240 atatgttcgccgttacgcctcatcaatatccgccatcccatctccaaaatcctcagtcct300 gtgatgacctttccctactttataggcctaagcatgctgagcgccatcagcaccgagcgc360 tgcctgtccatcctgtggcccatctggtaccactgccgccgccccagatacctgtcatcg420 gtcatgtgtgtcctgctctgggccctgtccctgctgcggagtatcctggagtggatgttc480 tgtgacttcctgtttagtggtgctgattctgtttggtgtgaaacgtcagatttcattaca540 atcgcgtggctggtttttttatgtgtggttctctgtgggtccagcctggtcctgctggtc600 aggattctctgtggatcccggaagatgccgctgaccaggctgtacgtgaccatcctcctc660 acagtgctggtcttcctcctctgtggcctgccctttggcattcagtgggccctgttttcc720 aggatccacctggattggaaagtcttattttgtcatgtgcatctagtttccattttcctg780 tccgctcttaacagcagtgccaaccccatcatttacttcttcgtgggctcctttaggcag840 cgtcaaaataggcagaacctgaagctggttctccagagggctctgcaggacacgcctgag900 gtggatgaaggtggagggtggcttcctcaggaaaccctggagctgtcgggaagcagattg960 gagcagtga 969 <210> 20 <211> 322 <212> PRT
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 20 Met Asp Ser Thr Ile Pro Val Leu Gly Thr Glu Leu Thr Pro Ile Asn Gly Arg Glu Glu Thr Pro Cys Tyr Lys Gln Thr Leu Ser Phe Thr Gly Leu Thr Cys Ile Val Ser Leu Val Ala Leu Thr Gly Asn Ala Val Val Leu Trp Leu Leu Gly Cys Arg Met Arg Arg Asn Ala Val Ser Tle Tyr 21e Leu Asn Leu Val Ala Ala Asp Phe Leu Phe Leu Ser Gly His 21e Ile Cys Ser Pro Leu Arg Leu Ile Asn Ile Arg His Pro Ile Ser Lys Ile Leu Ser Pro Val Met Thr Phe Pro Tyr Phe Ile Gly Leu Ser Met Leu Ser Ala Ile Ser Thr Glu Arg Cys Leu 5er Ile Leu Trp Pro Tle Trp Tyr His Cys Arg Arg Pro Arg Tyr Leu Ser Ser Val Met Cys Val Leu Leu Trp Ala Leu Ser Leu Leu Arg Ser Ile Leu Glu Trp Met Phe Cys Asp Phe Leu Phe Ser Gly Ala Asp Ser Val Trp Cys Glu Thr Ser 165 170 l75 Asp Phe Ile Thr Ile Ala Trp Leu Val Phe Leu Cys Val Val Leu Cys 180 185 l90 Gly Ser Ser Leu Val Leu Leu Val Arg Ile Leu Cys Gly Ser Arg Lys Met Pro Leu Thr Arg Leu Tyr Val Thr Ile Leu Leu Thr Val Leu Val Phe Leu Leu Cys Gly Leu Pro Phe Gly Ile Gln Trp Ala Leu Phe Ser Arg Ile His Leu Asp Trp Lys Val Leu Phe Cys His Val His Leu Val Ser Ile Phe Leu Ser Ala Leu Asn Ser Ser Ala Asn Pro Ile Ile Tyr Phe Phe Val Gly Ser Phe Arg Gln Arg Gln Asn Arg Gln Asn Leu Lys Leu Val Leu Gln Arg Ala Leu Gln Asp Thr Pro Glu Val Asp Glu Gly Gly Gly Trp Leu Pro Gln Glu Thr Leu Glu Leu Ser Gly Ser Arg Leu Glu Gln <210> 21 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 21 cagagctctg gtggccacct ctgtcc 26 <210> 22 <21l> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 22 ctgcgtccac cagagtcacg tctcc 25 <210> 23 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 23 gtatgcctgg ccacaatacc tccagg 26 <210> 24 <21l> 31 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence .
<400> 24 gtttgtggct aacggcacaa aacacaattc c 31 <210> 25 <211> 28 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 25 ggtaccacaa tgacaatcac cagcgtcc 28 <210> 26 <211> 29 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 26 ggaacgtgag gtacatgtgg atgtgcagc 2g <220> 27 <211> 27 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 27 gcagtgtagc ggtcaaccgt gagcagg 27 <210> 28 <211> 32 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 28 tgagcaggat ggcgatccag actgaggcgt gg 32 <210> 29 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 29 gaggtacagc tggcgatgct gacag 25 <210> 30 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 30 gtggccatga gccaccctga gctcc 25 <210> 3l <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 31 ggaatgtcca ctgaatgcgc gcgg 24 <210> 32 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 32 agctcgccag gtgtgagaaa ctcgg 25 <210> 33 <211> 30 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 33 gcgttatgag cagcaattca tccctgctgg 30 <210> 34 <211> 28 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 34 gtatcctgaa cttcgtctat acaactgc 28 <210> 35 <211> 29 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 35 ccctcaggaa tgatgccctt ttgccacaa 29 <210> 36 <211> 28 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 36 atccatgtgg ttggtgcatg tggttcgt 28 <210> 37 <211> 29 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 37 aaacaacaaa cagcagaacc atgaccagc 29 <210> 38 <211> 30 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 38 acatagagac aagtgacatg tgtgaaccac 30 <210> 39 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 39 ggtatgagac cgtgtggtac ttgagc 26 <210> 40 <211> 29 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 40 gtggcagaca gcgatatacc tgtcaatgg 29 <210> 41 <211> 27 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 41 gcgctcatgg agcacacgca cgcccac 27 <210> 42 <211> 25 <2l2> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 42 gaggcagtag ttgccacacc tatgg 25 <210> 43 <2l1> 29 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 43 catctggttt gtgttcccag gggcaccag 29 <210> 44 <2l1> 32 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 44 gacagtgttg ctctcaaagt cccgtctgac tg 32 <210> 45 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 45 ctgtttccag ggtcatcaga ctggg 25 <210> 46 <211> 27 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 46 gcagcattgc tctcaaagtc ctgtctg 27 <210> 47 <211> &
<212> PRT
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 47 Thr Leu Glu Ser Ile Met <210> 48 <211> 5 <212> PRT
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 48 Glu Tyr Asn Leu Val <210> 49 <211> 5 <212> PRT
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 49 Asp Cys Gly Leu Phe <210> 50 <211> 34 <222> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 50 gatcaagctt ccatggcgtg ctgcctgagc gagg 34 <210> 51 <211> 53 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 51 gatcggatcc ttagaacagg ccgcagtcct tcaggttcag ctgcaggatg gtg 53 <210> 52 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 52 gtcctcactg gtggccatgt actcc 25 <210> 53 <21l> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 53 ctgcgtccac cagagtcacg tctcc 25 <210> 54 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 54 cttggatgtt tgggctgccc ttctgc 26 <210> 55 <211> 31 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 55 gtttgtggct aacggcacaa aacacaattc c 31 <210> 56 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 56 ctgctcacgg ttgaccgcta cactgc 26 <210> 57 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 57 gtggccatga gccaccctga gctcc 25 <210> 58 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 58 .
cttcttctcc gacgtcaagg 20 <210> 59 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 59 cttcttctcc gacgtcaagg 20 <210> 60 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 60 cttcttctcc gacgtcaagg , 20 <2l0> 61 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 61 caacggtctg acaacctcct 20 <210> 62 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 62 ttgctgtgat gtggcatttt g 21 <210> 63 <211> 23 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 63 caggaagccc ataaaggcat caa 23 <210> 64 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 64 acatcacctg cttcctgacc 20 <210> 65 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 65 ccagcatctt gatgcagtgt 20 <210> 66 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 66 ccatctccaa aatcctcagt c 21 <210> 67 <211> 22 <212> DNA
<213> Artificial Sequence <220>
<223> Novel Sequence <400> 67 gctgttaaga gcggacagga as 22

Claims (40)

What is claimed is:
1. A G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.NO.:2.
2. A non-endogenous, constitutively activated version of the G protein-coupled receptor of claim 1.
3. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:1.
4. A host cell comprising the plasmid of claim 3.
5. A G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.NO.:4.
6. A non-endogenous, constitutively activated version of the G protein-coupled receptor of claim 5.
7. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:3.
8. A host cell comprising the plasmid of claim 7.
9. A G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.NO.:6.
10. A non-endogenous, constitutively activated version of the G protein-coupled receptor of claim 9.
11. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:5.
12. A host cell comprising the plasmid of claim 11.
13. A G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.NO.:8.
14. A non-endogenous, constitutively activated version of the G protein-coupled receptor of claim 13.
15. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:7.
16. A host cell comprising the plasmid of claim 15.
17. A G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.NO.:10.
18. A non-endogenous, constitutively activated version of the G protein-coupled receptor of claim 17 .
19. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:9.
20. A host cell comprising the plasmid of claim 19.
21. A G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.NO.:12.
22. A non-endogenous, constitutively activated version of the G protein-coupled receptor of claim 21.
23. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:11.
24. A host cell comprising the plasmid of claim 23.
25. A G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.NO.:14.
26. A non-endogenous, constitutively activated version of the G protein-coupled receptor of claim 25.
27. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:13.
28. A host cell comprising the plasmid of claim 27.
29. A G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.NO.:16.
30. A non-endogenous, constitutively activated version of the G protein-coupled receptor of claim 29.
31. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:15.
32. A host cell comprising the plasmid of claim 31.
33. A G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.NO.:18.
34. A non-endogenous, constitutively activated version of the G protein-coupled receptor of claim 33.
35. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:17.
36. A host cell comprising the plasmid of claim 35.
37. A G protein-coupled receptor encoded by an amino acid sequence of SEQ.ID.NO.:20.
38. A non-endogenous, constitutively activated version of the G protein-coupled receptor of claim 37.
39. A plasmid comprising a vector and the cDNA of SE.ID.NO.:19.
40. A host cell comprising the plasmid of claim 39.
CA002429860A 2000-11-27 2001-11-26 Endogenous and non-endogenous versions of human g protein-coupled receptors Abandoned CA2429860A1 (en)

Applications Claiming Priority (21)

Application Number Priority Date Filing Date Title
US25340400P 2000-11-27 2000-11-27
US60/253,404 2000-11-27
US25536600P 2000-12-12 2000-12-12
US60/255,366 2000-12-12
US27026601P 2001-02-20 2001-02-20
US27028601P 2001-02-20 2001-02-20
US60/270,266 2001-02-20
US60/270,286 2001-02-20
US28203201P 2001-04-06 2001-04-06
US28235601P 2001-04-06 2001-04-06
US28235801P 2001-04-06 2001-04-06
US28236501P 2001-04-06 2001-04-06
US60/282,032 2001-04-06
US60/282,365 2001-04-06
US60/282,358 2001-04-06
US60/282,356 2001-04-06
US29091701P 2001-05-14 2001-05-14
US60/290,917 2001-05-14
US30920801P 2001-07-31 2001-07-31
US60/309,208 2001-07-31
PCT/US2001/044386 WO2002042461A2 (en) 2000-11-27 2001-11-26 Endogenous and non-endogenous versions of human g protein-coupled receptors

Publications (1)

Publication Number Publication Date
CA2429860A1 true CA2429860A1 (en) 2002-05-30

Family

ID=27581193

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002429860A Abandoned CA2429860A1 (en) 2000-11-27 2001-11-26 Endogenous and non-endogenous versions of human g protein-coupled receptors

Country Status (6)

Country Link
EP (1) EP1364024A2 (en)
JP (3) JP2004533211A (en)
CN (1) CN1549858A (en)
AU (2) AU1989002A (en)
CA (1) CA2429860A1 (en)
WO (1) WO2002042461A2 (en)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7115375B2 (en) 2000-11-14 2006-10-03 Bristol-Myers Squibb Methods of diagnosing renal tumors by determining the expression level of RNA encoding the HGPRBMY18 polypeptide
WO2002042461A2 (en) * 2000-11-27 2002-05-30 Arena Pharmaceuticals, Inc. Endogenous and non-endogenous versions of human g protein-coupled receptors
GB0101739D0 (en) * 2001-01-23 2001-03-07 Pfizer Ltd Novel polypeptide
US20030087249A1 (en) * 2001-03-01 2003-05-08 Millennium Pharmaceuticals, Inc. 93870, a human G-protein coupled receptor and uses therefor
WO2002101043A2 (en) * 2001-06-08 2002-12-19 Bayer Aktiengesellschaft Regulation of human ta4 receptor
WO2003060150A2 (en) * 2002-01-04 2003-07-24 Metabolex, Inc. Nucleic acids encoding a g-protein coupled receptor involved in islet cell signaling
US20040161799A1 (en) * 2003-02-14 2004-08-19 Murphy Andrew J. KOR3like-proteins and methods of modulating KOR3L-mediated activity
WO2004104596A2 (en) * 2003-05-22 2004-12-02 Bayer Healthcare Ag Diagnostics and therapeutics for diseases associated with igs70 (igs70)
DOP2006000008A (en) 2005-01-10 2006-08-31 Arena Pharm Inc COMBINED THERAPY FOR THE TREATMENT OF DIABETES AND RELATED AFFECTIONS AND FOR THE TREATMENT OF AFFECTIONS THAT IMPROVE THROUGH AN INCREASE IN THE BLOOD CONCENTRATION OF GLP-1
CA2605228A1 (en) 2005-04-27 2007-03-08 Arena Pharmaceuticals, Inc. Combination therapy for the treatment of obesity and diabetes and conditions related thereto and for the treatment of conditions ameliorated by increasing a blood glp-1 level
GB0520568D0 (en) * 2005-10-10 2005-11-16 Paradigm Therapeutics Ltd Receptor
NZ571110A (en) 2006-04-11 2011-03-31 Arena Pharm Inc Methods of using GPR119 receptor to identify Glucose-dependent Insulinotropic Peptide (GIP) secretagogue compounds useful for increasing bone mass in an individual
EP2146210A1 (en) 2008-04-07 2010-01-20 Arena Pharmaceuticals, Inc. Methods of using A G protein-coupled receptor to identify peptide YY (PYY) secretagogues and compounds useful in the treatment of conditions modulated by PYY
WO2013151982A1 (en) 2012-04-03 2013-10-10 Arena Pharmaceuticals, Inc. Methods and compounds useful in treating pruritus, and methods for identifying such compounds

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000000515A2 (en) * 1998-06-29 2000-01-06 Hyseq, Inc. A CHEMOKINE RECEPTOR OBTAINED FROM A cDNA LIBRARY OF FETAL LIVER-SPLEEN
CA2331886A1 (en) * 1998-06-30 2000-01-06 Maria Alexandra Glucksmann 14273 receptor, a g-protein coupled receptor
CA2342833A1 (en) * 1998-09-17 2000-03-23 Incyte Pharmaceuticals, Inc. Human gpcr proteins
DK1133559T3 (en) * 1998-11-20 2006-04-10 Arena Pharm Inc Human G protein-coupled orphan receptor RUP3
JP2001211885A (en) * 2000-02-02 2001-08-07 Kyowa Hakko Kogyo Co Ltd New polypeptide
WO2001062797A2 (en) * 2000-02-23 2001-08-30 Pharmacia & Upjohn Company G protein-coupled receptors
CA2374073A1 (en) * 2000-03-20 2001-09-27 Curagen Corporation Novel polypeptides and nucleic acids encoding same
JP2002355050A (en) * 2000-06-15 2002-12-10 Takeda Chem Ind Ltd New g protein-coupled receptor protein and its dna
JP2002355053A (en) * 2000-07-14 2002-12-10 Takeda Chem Ind Ltd New g protein-coupled receptor protein and its dna
WO2002042461A2 (en) * 2000-11-27 2002-05-30 Arena Pharmaceuticals, Inc. Endogenous and non-endogenous versions of human g protein-coupled receptors

Also Published As

Publication number Publication date
AU2002219890B2 (en) 2007-06-14
EP1364024A2 (en) 2003-11-26
JP2010166925A (en) 2010-08-05
WO2002042461A3 (en) 2003-09-12
CN1549858A (en) 2004-11-24
WO2002042461A2 (en) 2002-05-30
AU1989002A (en) 2002-06-03
JP2008154594A (en) 2008-07-10
JP2004533211A (en) 2004-11-04

Similar Documents

Publication Publication Date Title
AU2005244540B2 (en) Endogenous and non-endogenous versions of human G protein-coupled receptors
US20060240523A1 (en) Endogenous and non-endogenous versions of human G protein-coupled receptors
KR20010080882A (en) Non-Endogenous, Constitutively Activated Human G Protein-Coupled Receptors
NZ521158A (en) Non-endogenous, constitutively activated known G protein-coupled receptors
JP2008154594A (en) Endogenous version and non-endogenous version of human g protein-coupled receptor
EP1412372A2 (en) Endogenous and non-endogenous versions of human g protein-coupled receptors
US20140162358A2 (en) Endogenous and Non-Endogenous Versions of Human G Protein-Coupled Receptors
US20130165633A1 (en) Endogenous and Non-Endogenous Versions of Human G Protein-Coupled Receptors
AU2002219890A1 (en) Endogenous and non-endogenous versions of human G protein-coupled receptors
US7119190B2 (en) Endogenous and non-endogenous versions of human G protein-coupled receptors
US20100317046A1 (en) Non-Endogenous, Constitutively Activated Versions of Human G Protein Coupled Receptor: FSHR
AU2007216751A1 (en) Endogenous and non-endogenous versions of the human G protein-coupled receptor hRUP30
US20150153327A1 (en) Endogenous and Non-Endogenous Versions of Human G Protein-Coupled Receptors
AU6299199A (en) Non-endogenous, constitutively activated human g protein-coupled receptors
AU2002248497A1 (en) Endogenous and non-endogenous versions of human G protein-coupled receptors
CA2732120A1 (en) Endogenous and non-endogenous versions of human g protein-coupled receptors
EP2264068A1 (en) Non-endogenous, constitutively activated human G protein-coupled receptors

Legal Events

Date Code Title Description
EEER Examination request
FZDE Discontinued