EP0623141A1 - Selection of binding-molecules - Google Patents

Selection of binding-molecules

Info

Publication number
EP0623141A1
EP0623141A1 EP93903482A EP93903482A EP0623141A1 EP 0623141 A1 EP0623141 A1 EP 0623141A1 EP 93903482 A EP93903482 A EP 93903482A EP 93903482 A EP93903482 A EP 93903482A EP 0623141 A1 EP0623141 A1 EP 0623141A1
Authority
EP
European Patent Office
Prior art keywords
test
molecule
dna
sequence
molecules
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
EP93903482A
Other languages
German (de)
French (fr)
Inventor
Gregory L. Verdine
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Harvard College
Original Assignee
Harvard College
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harvard College filed Critical Harvard College
Publication of EP0623141A1 publication Critical patent/EP0623141A1/en
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6811Selection methods for production or design of target specific oligonucleotides or binding molecules
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K1/00General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length
    • C07K1/14Extraction; Separation; Purification
    • C07K1/16Extraction; Separation; Purification by chromatography
    • C07K1/22Affinity chromatography or related techniques based upon selective absorption processes

Definitions

  • the present invention relates to methods of design ⁇ ing and producing a member of a binding pair which spe ⁇ cifically binds to its partner. It further relates to the products resulting from the methods. Such members are referred to herein as specific binding molecules. It particularly relates to designing and synthesizing mole ⁇ cules which specifically bind a desired target, such as a DNA sequence; these molecules are referred to as se- quence-specific DNA binding molecules and are also the subject matter of the present invention.
  • Molecules such as the sequence-specific binding molecules (also referred to herein as specific binding molecules) designed by the present method can be a peptide (D-, L- or a mixture of D- and L-) , a peptidomimetic, a complex carbohydrate or other oligomer of individual units or monomers which binds specifically to its binding partner (e.g., to DNA).
  • the present invention further relates to molecules, particularly sequence-specific DNA molecules, designed and produced by the present method and to uses therefor.
  • Specific binding molecules produced by the present method can be used in any application in which predictable or specific joining of two members of a binding pair is desired.
  • sequence-specific DNA binding molecules produced by the methods described herein are useful as gene regulatory molecules, such as molecules which mimic the tight and specific DNA binding character ⁇ istics of transcription factors, which play important roles in regulation of gene transcription by increasing or decreasing the rate of mRNA synthesis.
  • genes are regulated at the level of transcription by proteins, referred to as transcription factors, which bind promoter DNA.
  • transcription factors which bind promoter DNA.
  • a critical step in gene regulation by transcription factors is binding a factor to its specif- ic, or target, DNA sequences in the promoter.
  • Sequence- specific DNA binding molecules designed and produced by the present method can be used as molecules which mimic the tight and specific DNA binding characteristics of transcription factors and, as a result, exert control over gene expression.
  • Sequence specific DNA binding molecules can be used, for example, to control (enhance or repress) gene expression in vivo and, thus, serve as the basis for development of new therapeutic strategies for treating diseases or conditions in which there is a genetic defect.
  • a sequence-specific DNA binding molecule of the present invention can be used as an artificial or synthetic transcription repressor which is designed to bind a particular promoter and inhibit transcription of the gene under its control.
  • An artifi- cial or synthetic transcription repressor can be used to inhibit expression of a gene whose over-expression is associated with a disease or condition. Genetic diseases showing dominant inheritance, such as Huntington's dis ⁇ ease, are promising candidates for counteraction by transcriptional inhibitors designed and produced by the method of the present invention.
  • the present method of designing and producing a sequence-specific binding molecule is exemplified herein by the method of designing and producing a sequence- specific DNA binding molecule, particularly, a sequence- specific DNA binding peptide.
  • the following steps are carried out:
  • a desired or target molecule (e.g., a desired or target DNA sequence, or molecule) is synthesized or otherwise provided, which contains a first moiety capable of forming a reversible bond with a second moiety.
  • the target DNA sequence is one for which a sequence specific binding molecule, particularly a sequence specific DNA binding peptide, is to be designed and produced.
  • the target DNA sequence is combined with a test-binding mole ⁇ cule, which contains a moiety capable of forming a re ⁇ versible bond with the moiety present on the target sequence, such as the target DNA sequence.
  • the test- binding molecule (also referred to herein as test-mole ⁇ cule) comprises a unit such as an amino acid residue, to be assessed for its ability to bind to the desired DNA sequence.
  • the resulting combination of target DNA se ⁇ quences and test-molecules is maintained under conditions that are appropriate for the formation of a reversible bond between the first moiety (i.e., on the DNA sequence) and the second moiety (i.e., on the test-molecule) and binding of the unit being assessed to a region of the target sequence.
  • DNA sequence-test-binding molecule complexes are formed, or produced.
  • a mixture which contains complexes of the test-molecule bound to the desired target sequence, uncomplexed target molecules and uncomplexed test-molecules.
  • a sequence-specific DNA-binding molecule e.g., a DNA binding peptide
  • the resulting mixture contains complexes, uncomplexed target DNA sequence and uncomplexed test molecules.
  • the identity of the test-molecule present in the complexes, and the order of the units comprising the test-molecule, is determined by the present method by carrying out the above-described process.
  • the process is carried out a sufficient number of times to identify a binding partner, such as a DNA binding protein, of appro ⁇ priate makeup and sufficient length to bind to the target DNA and remain bound to the DNA, and subsequently deter- mining the identity and order of the units (e.g., amino acid residues) in the binding partner produced.
  • the test-molecule includes one more unit to be assessed than the test-molecule of the previous cycle; the test-molecule in the complex which is formed also has one additional unit than the complex in the previous cycle.
  • a sequence-specific DNA binding molecule is designed and produced.
  • the moiety present on the target DNA and on the target molecule is a thiol group
  • the reversible bond formed between the two moieties is a disulfide bond
  • the test-molecule is a peptide
  • the unit to be assessed is an amino acid residue.
  • a DNA molecule of a desired sequence which contains a thiol group attached at a specific site on the sequence is combined with a synthetic peptide which also contains a thiol group.
  • the peptide has the formula C0 2 H-Cys-Xaa-NH 2 .
  • the DNA molecule and the peptide bind, or associate, via the formation of a reversible disulfide bond, thus, forming a DNA-peptide complex.
  • a mixture of peptides can be used, all of which have the formula C0 2 H-Cys-Xaa-NH 2 and each of which differs in the amino acid residue Xaa (Xaa can be any amino acid residue which lacks an -SH group) .
  • each peptide will have a different association constant for the DNA sequence, and these differences will affect the reversibility, or reducibili- ty, of the disulfide bond.
  • the peptides Under reversing conditions, such as subjecting the formed complexes to a thiol gradient, the peptides are released from the DNA sequence according to their DNA association constants.
  • the strength of the disulfide bond in a disulfide-linked peptide-DNA complex is direct ⁇ ly related to the strength of the peptide-DNA associa- tion. This relationship permits screening of tight- binding peptides from a mixture of peptides. It is reasonable to expect that the peptide that remains complexed to the DNA sequence under conditions using the highest concentration of thiol binds tightest to the DNA. This screening process can be repeated in subsequent cycles with a peptide which has one additional amino acid residue designated Xaa, in each cycle.
  • each Xaa residue can be determined by conven ⁇ tional methods, such as peptide sequencing or UV absorp- tion. The order of the next residue of the peptide, resulting in the tightest binding to the DNA sequence is determined.
  • binding molecules include oligo- meric molecules in which units can be added or removed (e.g., D-, L- r or DL-peptides, peptidomimetic compounds or complex carbohydrates) .
  • Molecules made by the methods of the invention can be used to regulate a wide variety of biological process ⁇ es which depend on the site specific interaction of one molecule with another molecule. For example, processes mediated by the binding of a peptide with a nucleic acid, or of a peptide with a peptide.
  • Binding molecules which bind with a nucleic acid can be used to prevent gene activation by blocking the access of an activating factor to its sequence element, repress transcription by stabi ⁇ lizing duplex DNA or interfering with the transcriptional machinery, or carry out targeted DNA modification by delivering a reagent to a specific sequence.
  • Binding molecules which bind to peptides can be used to mediate or otherwise participate in, various processes such as antibody-antigen interactions, enzyme substrate interac- tions, hormone-receptor interactions, and lymphokine- receptor interactions.
  • the methods of the invention are chemical rather than biological, they can be used to select or discover binding molecules which are not normally synthe- sized by living organisms, such as peptides which include D-amino acids or nonbiogenic polymers (e.g., polymers derived from polyethylene glycol or nonnatural carbohy ⁇ drates) .
  • peptides which include D-amino acids or nonbiogenic polymers (e.g., polymers derived from polyethylene glycol or nonnatural carbohy ⁇ drates) .
  • Figure 1 is a schematic representation of the reac ⁇ tion between a thiol-tethered oligonucleotide and a mixture of -SH-containing peptides.
  • Figure 2 is a graph of a hypothetical reduction- elution profile.
  • Figure 3 shows the components of the CGN4 binding system, including the oligonucleotides GCN4-1 (SEQ ID N0:l); GCN4-2 (SEQ ID N0:2); GCN4-3 (SEQ ID N0:3); GCN4-4 (SEQ ID NO:4) and the GCN4-derived peptide, including the disulfide tether (SEQ ID NO:5).
  • the clear boxed area indicates the location of the tethered disulfide.
  • Figure 4 shows the results of coupling the disul- fide-linked GCN4 peptide (SEQ ID NO:5) with the GCN4 oligonucleotides (SEQ ID NOS:1-4) as analyzed by denatur- ating polyacryla ide gel electrophoresis.
  • X indicates what appears to be peptide-DNA complexes of differing mobility.
  • the present invention relates to methods of design ⁇ ing and producing a member of a binding pair which spe ⁇ cifically binds to its partner as well as to the products resulting from these methods.
  • Such members are referred to herein as specific binding molecules. It particularly relates to methods of designing and synthesizing mole ⁇ cules which specifically bind a desired DNA sequence (i.e., sequence-specific or site-speci ic DNA binding molecules) .
  • Specific binding molecule refers to an enti ⁇ ty, e.g., a molecule, or a portion of a molecule, which binds to a target.
  • a specific binding mole- cule is susceptible to a plurality of successive or serial modifications, e.g., in the case of a polymeric molecule, the addition of monomeric units to the polymer ⁇ ic chain.
  • the binding affinity of a specific binding molecule with the target can be evaluated before and/or after successive modification of the specific binding molecule.
  • a specific binding molecule is capable of reversible attachment to a target, preferably via a tether.
  • Test-binding molecule refers to a specific binding molecule, some or all of the structure of which is evaluated for inclusion in the final structure of a specific binding molecule.
  • the specific binding molecule e.g., a final full length peptide, which is the product of the entire process, can be referred to as a final or finished spe ⁇ cific binding molecule.
  • Target refers to an entity with which a specific binding molecule binds. Methods of the invention optimize binding affinity between a target and a specific binding molecule.
  • a target can be a molecule, a portion of a molecule, or an aggregate of molecules.
  • a target and a specific binding molecule can be separate molecules, or they may be different moieties on one molecule.
  • a target includes a target site.
  • a target is capable of reversible attachment to a binding molecule via a tether.
  • targets include: nucleic acids (e.g., RNA or DNA, double stranded DNA, single stranded DNA, or supercoiled DNA) , peptides or proteins (e.g., enzymes, receptors or antibodies), carbohydrates, and other molecular structures, such as nucleic acid- protein complexes, chromatin or ribosomes, lipid-bilayer containing structures, such as membranes, or structures derived from membranes, such as vesicles.
  • nucleic acids e.g., RNA or DNA, double stranded DNA, single stranded DNA, or supercoiled DNA
  • peptides or proteins e.g., enzymes, receptors or antibodies
  • carbohydrates e.g., lipid-bilayer containing structures, such as membranes, or structures derived from membranes, such as vesicles.
  • Target site or specific site refers to a site on a target to which a specific binding mole ⁇ cule binds.
  • Methods of the invention optimize binding affinity between a specific binding molecule and a target site on a target.
  • a target site will usually include a specific sequence of monomeric subunits or a three dimensional structure.
  • the actual structure (e.g., the chemical structure, or three dimensional structure) of the target site need only be known with enough particularity to allow formation of a reversible bond to the target.
  • the molecular interactions between a binding molecule and a target site are noncovalent and have energies of less than 25 kcal/mol at 25°C. These molecu ⁇ lar interactions include hydrogen bonds, Van de Waals interactions and electrostatic interactions.
  • Aggregate of molecules refers to two or more molecules which are connected by covalent or noncovalent interactions.
  • Tether refers to a structure which includes a moiety capable of forming a reversible bond with another moiety (e.g., a moiety on another tether) and (optionally) a spacer element. Alkane chains are suitable spacer moieties.
  • Reversible bond refers to a bond linking a binding molecule and a target (i.e., a binding pair) which is thermodynamically stable but capable of being broken by a reversing agent which is a physical or chemical agent capable of breaking the bond. For any given bond an appropriate reversing agent can be readily chosen based on the chemical nature of the bond.
  • a reversing agent for a disulfide bond is a reducing agent such as thiol.
  • the reversible bond is between a tether on a specific binding molecule and a tether on a target, a bond between tether on a specific binding molecule and a target, a bond between a specific binding molecule and a tether on a target, or a bond directly between a target and a specific binding mole- cule.
  • thermodynamically stable is meant a bond whose strength is greater than 10, preferably greater than 20, more preferably greater than 50, even more preferable greater than 65, but preferably less than 100 Kcal/mol at 25°C.
  • Suitable examples of reversible bonds include: R,- S-S-R-,, R.-S-Cd-S-R-,, and R ⁇ S-Hg-S-R., wherein R 1 includes a binding molecule or entity and R 2 includes a target and the reversible bond is within the underlined area.
  • bonds in which a metal e.g., Fe 3+ , Co 2+ , Ni 2* , Cu 2+ , Zn 2+ , Cd 2+ , or Hg 2+
  • a multidentate ligand i.e., a ligand having two (or more) moieties with which to complex an atom or group, prefera ⁇ bly a metal atom
  • a moiety on the binding molecule can be, e.g., S, N, or an imidaz- ole group
  • a multidentate ligand on a target wherein a moiety on the target can be S, N, or an imidaz- ole group.
  • multidentate ligands follow: SH
  • R can be either a binding molecule or a target.
  • multidentate ligands and monodentate ligands i.e., a ligand having one moiety with which to complex a metal or other atom or group
  • a binding molecule having a multidentate ligand and a target having a multidentate ligand a binding molecule having a multidentate ligand and a target having a multidentate ligand and a target having a monodentate ligand, or a binding molecule having a monodentate ligand and a target having a multidentate ligand can be used.
  • Methods of the invention can be used to design specific binding molecules which bind to a target site (i.e., a specific sequence) on a target molecule. These methods include an iterative process comprising successive ⁇ sive cycles of: (1) modifying a test-binding molecule (also referred to as a test-molecule) ; and (2) evaluating the affinity of the modified test-binding molecule for a target site on the target molecule.
  • the evaluation includes evaluating the relative affinity of a test- binding molecule for a target site as compared with other test-binding molecules in a pool, or mixture of test- binding molecules.
  • the affinity of the test-binding molecule for the target can be determined by forming a reversible bond between the test-binding molecule and the target.
  • the susceptibility of the reversible bond to reversal is related to the affinity of the test-binding molecule for the target site on the target.
  • a number of species of test-binding mole ⁇ cules, representing alternative modifications of a test- binding molecule i.e., modifications of the initial test-binding molecule or a test-binding molecule from the previous cycle of the method
  • the structure of the species (at each cycle) which gives the optimum results is chosen to supply an element of the structure of the final specific binding molecule.
  • a moiety capable of forming a reversible bond with a moiety on the test-binding molecule is attached to target DNA mole ⁇ cules.
  • a sulfhydryl group is tethered by an alkane chain to a site such as a site in a major or minor groove in a DNA molecule.
  • the DNA- [C] n -SH is then attached to an immobilizing matrix.
  • the DNA-[C] n -SH molecules are then complexed, via a disulfide bond, to a mixture of synthetic peptides and placed in a chromatography column as shown in Figure 1.
  • X in Figure 1 represents the number of species of peptides in a mixture of peptides.
  • the curved line connecting the peptide to the DNA target represents the tether.
  • the vertical arrows between the peptide and the DNA target represent the specific binding molecule/target site interaction, which, preferably, is the interaction the method optimizes.
  • the synthetic peptides are all of the formula C0 2 H- Cys-Xaa-NH 2 (where Xaa equals any amino acid residue which lacks an -SH group) .
  • N or C terminal can be modified, or blocked, as in the structure HN 2 C0 2 -Cys-Xaa-NHC0 2 CH 3 , to prevent unwanted interaction between the specific binding molecule and the target.
  • Amino acids may be added at either end of the molecule.
  • the mixture of synthetic peptides includes a variety of species (i.e., a plurality of peptides of different sequences) with differences in sequences arising from various candidate residues occupying the second (Xaa) position in different peptides.
  • the candidate residues may be any moiety which lacks an -SH group and which can be incorporated into the peptide chain, including, for example, D- or L-amino acids, naturally occurring or non- naturally occurring amino acids, or - , ⁇ - r or ⁇ amino acids.
  • the test-binding molecule will have different bind ⁇ ing affinities for the target DNA sequence, and these differences will affect the reducibility of the disulfide bond between the peptide and the DNA molecule with which it is complexed.
  • passage of a thiol gradient through the peptide-DNA column results in the release of the peptides according to the susceptibility of the binding molecule-target disulfide bond to reduc- tion (i.e., reversal).
  • reduc- tion i.e., reversal
  • Figure 2 shows a hypothetical elution profile.
  • the concentration of thiol is represented by a dashed line and the elution profile by a solid line.
  • the peak la ⁇ beled A represents the species with the highest binding affinity for the target.
  • C0 2 H-Cys-XAA-Xaa-NH 2 where XAA is the optimum second position residue and Xaa is defined as above, is cycled through the process to determine the optimum residue for the third position in the binding peptide.
  • Subsequent cycles extend the sequence of the binding peptide to the desired length.
  • the desired length can be a predetermined number of amino acid resi ⁇ dues, or can be a length at which the binding molecule exhibits useful or optimum binding affinity and/or se ⁇ quence specificity.
  • the site at which the reversible bond or tether is placed should be chosen so as to allow a specific binding molecule coupled to the target unhindered access to the target site on the target.
  • Stearic hindrance imposed by the location or structure of the bond or tether(s) can interfere with the correlation between bond reversibility and binding molecule-target site affinity.
  • the inclusion of a spacer element can reduce stearic hindrance.
  • an alkane of appropriate length can be used to provide both flexibility and sufficient separation be ⁇ tween the binding molecule and the target site.
  • nucleic acid When a nucleic acid is the target molecule a nucleic acid of any strandedness and of any topology can be used in methods of the invention.
  • the tether In the case of double stranded DNA, the tether can be located in a major or minor groove close to the target sequence, but not so close as to result in stearic hindrance to binding from strain on the bond between the binding peptide and the targe .
  • the reversible bond or tether can be located such that either binding molecule-target interactions or binding molecule-solution interactions are favored.
  • the reversible bond or tether can be placed at or near a terminus of the mole ⁇ cule to favor binding molecule-solution interactions, or in the central areas (away from the termini) , to favor binding molecule-target interactions.
  • a tether can be attached to DNA, or the reversible bond formed, on a base at any exocyclic amine or any vinyl carbon, such as the 5 or 6 position of pyrimidines, 8 or 2 positions of purines, at the ultimate 5' or 3' carbons, at the sugar phosphate backbone, or at internucleotide phosphorus atoms.
  • the binding molecule is conjugated to, or associated with, the target by a reversible bond.
  • the reversible bond is between a tether on the target and a tether on the specific binding molecule.
  • the tether on the binding molecule can be the same as the tether used on the target. Alterna ⁇ tively, different tethers can be used on each. In other embodiments only one tether is used, and in some embodi ⁇ ments the reversible bond is formed directly between the binding molecule and the target.
  • the tethers and the reversible bond should have the following characteristics.
  • a tether should be capable of attachment to the target without substantial alteration of the three dimensional structure of the target.
  • the reversible bond or tether-bearing-target should remain similar enough in conformation to the in vivo target so that the binding molecules generated will recognize and bind to the in vivo target with a useful affinity and site specificity.
  • the reversible bond formed between the target and the binding molecule should reversibly couple, by a covalent or ionic bond, the target to the binding molecule.
  • the susceptibility to reversal, or breakage, of the reversible bond formed between the target and the binding molecule should vary with the affinity of the binding molecule for the target site on the target.
  • the tether or tethers should be of appropriate length and flexibility such that the binding molecule has free access to the target site, and under the conditions used in methods of the invention, the reversible bond and/or tethers should be substantially unreactive with other sites on the binding molecule or target molecule.
  • Thiol groups are suitable moieties for forming a reversible bond.
  • a reversible bond e.g., a disulfide or metal-bridged disulfide bond, formed between -SH groups can be broken by contacting the bond with a reducing agent.
  • the reversible bond can be reversed with a ligand which competes with the metal atom for its position in the bridge.
  • the binding molecule is a peptide
  • the amino acid residue, cysteine is a convenient source of an -SH group for use as the binding molecule tether.
  • Alkane chains are suitable spacer moieties.
  • the reversible bond between the binding molecule and the target is disrupted with a reversing agent
  • immobilize the target molecule before exposure to the reversing agent This can be done by attaching, or linking the target to a matrix, such as a resin. Methods for attaching molecules to resins are known to those skilled in the art.
  • Test-binding molecules i.e., putative or candidate binding molecules
  • GCN4 a derivative of the DNA binding protein, GCN4, (O'Shea, E. K. , et al. , Science 243:538-542 (1989); Talanian, R. V., et al. , Science 249:769-771 (March 1990); Talanian, R. V., et al. , Biochem. 31:6871-6875 (1992)) was synthe- sized.
  • the GCN4-derived peptide is a monomer, comprised of 24 amino acid residues (SEQ ID NO:5).
  • the peptide was reduced, also as described in the Example, and, using the reaction conditions described in the Example, formation of the disulfide bond between the CGN4-derived peptide and the four DNA oligonucleotides was carried out. After incubation of the coupling reac ⁇ tion mixture, aliquots were taken and analyzed on poly- acrylamide gels under denaturing or native conditions.
  • Figure 3 shows the results of the analysis of aliquots from the four reaction mixtures containing the CGN4-derived peptide and the modified DNA sequences, on a denaturing gel. In all four reaction mixtures, a disul- fide-linked GCN4 peptide-DNA complex was formed, as indicated by the arrows denoting uncomplexed DNA and peptide-DNA complexes.
  • the structures of the disulfide-linked GCN4-DNA complexes were also analyzed to determine whether the peptides associated with the DNA oligonucleotides in a way that mimics their natural counterparts, or at least to discern that the peptide is bound in a sequence-spe ⁇ cific manner.
  • Preliminary data using DNA footprinting techniques indicate that three out of the four modified DNA oligonucleotides bound the GCN4-derived peptide in the anticipated region. That is, the data is strongly suggestive that the peptide bound to three DNA sequences in a site-specific manner.
  • binding of peptides to thiol- tethered DNA via formation of a disulfide bond can be performed as follows. Peptides can be bound quantita ⁇ tively to a thiol-tethered DNA molecule that is bound to a polymer resin, by formation of a disulfide bond between the DNA and the peptides. In these experiments, the object is to bind approximately 100% of the peptides to the resin-bound DNA, hence, an excess (2-10-fold mole excess based on the thiol-containing DNA strand) of resin-bound DNA, relative to moles of thiol groups (or disulfide groups) on the peptides is used.
  • the resin-bound DNA is prepared in the reduced state by treatment with common disulfide-reducing agents (alkanethiols or borohydride compounds) .
  • This incubation can be done in a batch mode or by passage of reagents through a column containing the resin-bound DNA.
  • the excess reducing agents can be removed by filtration (batch mode) or elution (column mode) .
  • Charging of the peptides onto the resin can either be done in batch mode or column mode.
  • the thiol group of the peptides will first be activated by conversion to the corresponding 2-thiopyridyl or 5- thio-2-nitrobenzoyl disulfide, using standard methods.
  • the activated peptides, in deaerated buffer, pH 7-9 (for example 50 mM Tris, pH 8.0) will be incubated with the reduced DNA-bound resin either with shaking or stirring (batch mode) or with recirculation (column mode) .
  • the resin-bound DNA can be prepared as the 2- thiopyridyl or 5-thio-2-nitrobenzoyl disulfide, and the reduced peptides bound as described above.
  • the binding reactions can be quantified by UV mea ⁇ surements, monitoring release of the pyridine-2-thione or 5-thio-2-nitrobenzoate chromophores.
  • the amount of peptides bound to the resin or free in solution can be quantified by a routine ninhydrin test.
  • the presence of free thiol groups on any material at any stage of the experiments can be monitored by alkylation with 14 C-iodoacetamide.
  • Binding can be optimized by examination of % pep- tides bound versus method of activation (DNA-disulfide or peptide-disulfide) , activating agent (2-thiopyridyl or 5- thio-2-nitrobenzoyl) , binding mode (batch or column) , time of incubation, temperature, and structure of the thiol-containing tether in the DNA.
  • activating agent (2-thiopyridyl or 5- thio-2-nitrobenzoyl
  • binding mode (batch or column)
  • time of incubation temperature
  • structure of the thiol-containing tether in the DNA In another embodiment, equilibrium binding of peptides to thiol-tethered DNA via formation of a disul ⁇ fide bond can be performed.
  • Peptides can be bound under equilibrium conditions to a thiol-tethered DNA molecule that is bound to a polymer resin, by formation of a disulfide bond between the DNA and the peptides.
  • the disulfide bond between the DNA and peptides can be formed under freely reversible conditions, so the noncovalent interaction of the peptide with DNA will cooperate with the covalent interaction (i.e., disulfide bond formation) to -establish a stable complex.
  • the thiol-tethered DNA is mixed with a stoichiomet- ric amount of the peptides in a deaerated redox buffer.
  • the redox buffer can be the same as the redox eluent described above.
  • the most important components are the reduced and oxidized forms of a thiol reducing agent, such as 2-thiopyridine, 5-thio-2-nitrobenzoate, dithiothreitol, 2-mercaptoethanol, and N,N'-dimethy1- N,N'-bis(mercaptoacetyl)hydrazine (DMH) .
  • the reactants are allowed sufficient time to reach equilibrium.
  • DNA-bound peptides are then eluted by incubation of the resin under strongly reducing conditions (such as 100 mM dithiothre- itol) .
  • strongly reducing conditions such as 100 mM dithiothre- itol
  • parallel incubations should be set up and analyzed separately.
  • the following conditions can be varied to optimize the system: chemical structure of redox eluent, concen ⁇ tration of redox eluent, temperature, flow rate, buffer conditions (pH, ionic strength, addition of organic co- solvents such as trifluoroethanol) .
  • Peptides can be quantified by amino acid analysis and sequenced by automated phenylthiohydantoin methods.
  • Binding Molecule-Target Site Binding Affinity The affinity of a specific binding molecule for the target site on a target can be determined by evaluating the ease with which a reversible bond between the binding molecule and the target can be reversed. These determi ⁇ nations can be made by immobilizing the binding molecule- target complex, such as on a matrix or a resin, and passing a gradient of a reversing agent (an agent which reverses, that is, breaks, or disrupts, the reversible bond and thus releases the binding molecule from the tar ⁇ get site) over the immobilized complexes.
  • a reversing agent an agent which reverses, that is, breaks, or disrupts, the reversible bond and thus releases the binding molecule from the tar ⁇ get site
  • test-binding molecules In most embodiments of the methods described herein, several species (also refrred to herein as a plurality) of test-binding molecules will be screened simultaneously to determine which test-molecule possesses the optimum binding properties.
  • the elution profile allows determi- nation and comparison of the binding affinities of vari ⁇ ous species of test-binding molecule and selection of the species which represents the optimum or desired structure for the final specific binding molecule.
  • the resin bound peptide-DNA complexes are placed in a chromatogra- phy column.
  • a gradient of a reducing agent e.g., a thiol reagent, is applied to the column. This results in the release of peptides according to their DNA associa ⁇ tion constants, producing a reductive elution profile. The peptide that elutes last has the highest affinity for the target DNA. This chemical screening process thus provides the optimal residue at the tested position.
  • Elution of peptides coupled to a target by a disul ⁇ fide bond can be performed, either in batch or column mode, as follows. Column mode allows more precise con ⁇ trol over the elution conditions, since the column can be attached to a commercially available gradient elution system, such as the Fast Protein Liquid Chromatograph
  • FPLC FPLC
  • Pharmacia Pharmacia
  • Batch mode operation may be necessary if the conditions required for elution (e.g., high temperatures, long elution times) are incompatible or inconvenient with FPLC.
  • a redox gradient is passed through the column, causing peptides to be released depending on their redox potential.
  • the redox gradient consists of mixtures of a thiol or dithiol compound and its corresponding disulfide. In the beginning of the gradient, the redox eluent contains 100% of the disulfide form, and at the end of the gradi ⁇ ent, 100% of the thiol (or dithiol) form.
  • Typical redox eluents consist of the thiol and disulfide forms of 2- thiopyridine, 5-thio-2-nitrobenzoate, dithiothreitol, 2- mercaptoethanol, and the N,N'-dimethyl-N,N'—bis(mercapto- acetyl)hydrazine (DMH) reagent recently reported by Whitesides fJ. Org. Chem. 56:2332-2337 (1991)).
  • DMH N,N'-dimethyl-N,N'—bis(mercapto- acetyl)hydrazine
  • the latter may be preferable because of its exceptionally fast kinetics of disulfide reduction. Elution of peptides from the column is monitored by on-line UV detection at 214 n and post-column derivati- zation with ninhydrin.
  • Peptides are quantified by amino acid analysis and sequenced by automated phenylthiohydan- toin methods.
  • the following conditions can be varied to optimize elution for speed, ease, or resolution: chemical struc ⁇ ture of redox eluent, concentration of redox eluent, slope of gradient, shape of gradient (linear, step, exponential) , temperature, flow rate, buffer conditions (pH, ionic strength, addition of organic co-solvents such as trifluoroethanol) .
  • the resin containing DNA-bound peptides is incubated in an Eppendorf tube with deoxygen- ated buffer containing the redox eluent.
  • Redox eluents, quantification and identification of peptides are the same as described above for the column mode.
  • the follow ⁇ ing conditions can be varied to optimize elution: chemi ⁇ cal structure of redox eluent, concentration of redox eluent, number and spacing of stepwise elutions, elution time, temperature, buffer conditions (pH, ionic strength, addition of organic co-solvents such as trifluoroetha ⁇ nol) .
  • a second modification can be performed on the test- binding molecule (e.g., the addition of a subsequent residue to a polymeric binding molecule) and the process of evaluating the binding affinity of the newly modified test-binding molecule repeated. This cycle may be re ⁇ peated a number of times.
  • test-binding molecules representing a number of different modifications
  • a number of species i.e., a plurality
  • test-binding molecules representing a number of different modifications
  • a set of tripeptide ⁇ of the formula C0 2 H-Cys-XAA-Xaa-NH 2 (where XAA is the optimum second position amino acid and Xaa represents any amino acid which lacks an -SH group) , is synthesized.
  • Each peptide of the set differs at Xaa.
  • the elution and determination of binding affinity is repeated with the tripeptide to yield the optimum amino acid residue at the third position. The process is repeated until the de ⁇ sired length is reached.
  • modifications can be performed on the binding molecule. These modifications may be in the form of a second round of selected optimizations of a different binding molecule characteristic. For example, after an initial determination of the optimum primary sequence of a peptide, a second iterative selection can be applied to determine an optimum level of glycosylation, the effect of cofactors, the effect of homo- or heterodimerization. or the effect of inter- or intra-chain cross linking. These, or other modifications may be tested for their effect on binding by non-iterative methods as well.
  • a second iterative selection can be per- formed to select a second specific binding molecule to form a heterodimer with the binding molecule selected in the first iterative cycle.
  • These two specific binding molecules may be cross-linked by conventional methods. Modifications such as the formation of homo- or heterodimers, may require alteration of a selected bind ⁇ ing molecule. For example, new peptides may be constructed to optimize the spacing of binding units relative to each other and the center of target sites .in the DNA, or to allow the introduction of specifically desired residues. Molecular modeling can be used to facilitate the choice of modifications.
  • dimerized peptides can be tested by meth ⁇ ods known to those skilled in the art (e.g., by competi ⁇ tion electrophoretic mobility shift assays, PCR-based target detection assay, or chemical or enzymatic footprinting) .
  • the X-ray crystal structures of the bacteriophage repressor (Jordan et al.. Science 242:893 (1988)) and the murine Zif268 protein (Pavletich et al. , Science 252:809 (1991)) bound to their respective DNA sites are deposited in the Brookhaven Protein Data Bank.
  • These can also be retrieved and molecular modeling methods used to trim the structures down to a peptide-bound DNA core structure, as was done with GCN4.
  • Disulfide tethers can be designed to link the resulting peptides to DNA, bearing in mind that the connector should be as short as possible without generat ⁇ ing strain.
  • the ⁇ repressor and Zif268 systems are favorable for optimization because they represent respec ⁇ tively, examples of extended and ⁇ -helical peptides that bind DNA as isolated units and for which high-resolution structures in the DNA-bound form are available.
  • DNA-binding peptides designed on the basis of X-ray structures can be synthesized by standard methodology.
  • Thiol-tethered oligonucleotides designed similarly (“wild-type” oligonucleotides) can be synthesized by methods and linked to a resin, as described above.
  • the peptides can be tethered to DNA both in solution (for use in high-resolution structural studies) and on a solid matrix (for reductive elution studies) .
  • the conditions for forming and releasing the peptide-DNA reversible bond can be optimized using these molecules, as described in the Example.
  • the structures of the DNA-tethered peptide systems constructed in the previous state can be evaluated to discern whether the peptides are associated with DNA in a way that mimics their natural counterparts, or at least in a way that is discernibly sequence-specific.
  • 1 H-NMR, 15 N-NMR, chemical footprinting, and circular dichroism spectroscopy can be used to evaluate these molecules.
  • Wild-type and mutant peptide-DNA systems, assembled on a solid matrix in a column can be subjected to reduc ⁇ tive elution by a thiol gradient. Parameters affecting elution, such as reducing agent, temperature, pH and slope of the gradient, can be optimized. For example, this approach can be used to find conditions in which wild-type ⁇ and Zif268 peptides are strongly retained (elute late in the gradient) while peptide from mutant systems are not strongly retained (elute early) .
  • the wild-type peptides can be elongated by one peptide unit, using a mixture of any amino acids that lack an -SH group.
  • This 19 peptide mixture can then be coupled to the solid matrix, loaded into a column, and eluted reductively.
  • the late-eluting peptides will be sequenced (e.g., by fast atom bombardment mass spectrometry and/or phenylthiohydantoin degradation) . This synthesis and screening process can be repeated iteratively until either the efficiency of synthesis or resolution of the column procedure falls off.
  • Elongated peptides that are obtained by iterative selection should bind selectively to longer target DNA sequences than the starting peptides.
  • the interaction of these peptides with DNA can be studied by the same meth ⁇ ods as described above for the starting peptides.
  • the three dimensional molecule can serve as a guide in choosing the modifications. This can allow the optimization of residues on the same face or side of a structure. For example, in the case of a binding mole- cule which is a helical molecule, it may be desirable to add subunits in groups of n, where n is the number of subunits involved in one full turn of the helix.
  • the desired three-dimensional structure of the binding molecule can also influence choice of modifica ⁇ tion in other ways.
  • residues which promote the formation of a helical structure such as 2-aminoisobutyric acid or ⁇ -methyl amino acids, can be added.
  • pro-gly could be added to a sequence to interrupt a helical structure.
  • a pro-gly series can be added to a peptide sequence to introduce a fold in a / 5-sheet or ⁇ -ribbon structure.
  • Peptide-on-phage libraries can be used to * supply the binding entities in methods of the invention.
  • a fully degenerate phage library could include all peptide test-binding entities to be tested in one batch.
  • the peptides could be coupled to the target and eluted as a batch.
  • oligonucleotides were synthesized on an Applied Biosystems DNA synthesizer Model 381A using conventional and modified phosphoramidites according to the "convert- ible nucleoside approach" described in MacMillan, A. M. and Verdine, G. L. , J. Org. Chem. 55_:5931 (1990) and Ferentz, A. E. , and Verdine, G. L. , J. Am. Chem. Soc. 113:4000-4002 (1991) .
  • the lyophilized GCN4-derived peptide was dissolved in 0.1 ml of lxTE8 (Tris-EDTA buffer, pH 8) and peptide concentration determined by UV spectroscopy (210 and 220 nm) was 3 mM.
  • the peptide was reduced by the addition of 1 microliter of 1:10 dilution of 2-mercaptoethanol stock (14.4M, obtained from Bio-Rad Laboratories) and incubated at 50° for 30 minutes.
  • the reaction mixture was subse- quentlyophilized in the speedvac concentrator (Savant) to evaporate 2-mercaptoethanol and the dry pellet was dissolved in 0.1 ml of 10xTE8.

Abstract

L'invention concerne des procédés de conception et de production de protéines de liaison d'ADN spécifique à une séquence, des procédés de détermination de l'affinité d'une molécule de liaison spécifique pour une cible ainsi que des produits obtenus par ces procédés. Les procédés comprennent la formation d'une liaison réversible entre une molécule de liaison spécifique et la cible ainsi que la détermination de la susceptibilité à l'inversion de la liaison réversible comme une mesure de l'affinité de la molécule de liaison pour la cible.The invention relates to methods of designing and producing sequence-specific DNA binding proteins, methods of determining the affinity of a specific binding molecule for a target, and products obtained by these methods. The methods include forming a reversible bond between a specific binding molecule and the target as well as determining the susceptibility to reversal of the reversible binding as a measure of the binding molecule's affinity for the target.

Description

SELECTION OF BINDING-MOLECULES
Description
Background of the Invention
Small molecules which bind to other molecules with specific affinity are important in many biological pro¬ cesses. The importance of sequence specific DNA-binding proteins in biology became apparent in the 1960,s with the establishment of models for gene regulation. Because of their important roles, it would be useful to be able to design small molecules which can mimic or replace naturally-occurring molecules. However, despite consid¬ erable interest in the design and production of small binding molecules, a rational process for the design, synthesis and selection of such molecules has not yet been developed.
Summary of the Invention
The present invention relates to methods of design¬ ing and producing a member of a binding pair which spe¬ cifically binds to its partner. It further relates to the products resulting from the methods. Such members are referred to herein as specific binding molecules. It particularly relates to designing and synthesizing mole¬ cules which specifically bind a desired target, such as a DNA sequence; these molecules are referred to as se- quence-specific DNA binding molecules and are also the subject matter of the present invention. Molecules, such as the sequence-specific binding molecules (also referred to herein as specific binding molecules) designed by the present method can be a peptide (D-, L- or a mixture of D- and L-) , a peptidomimetic, a complex carbohydrate or other oligomer of individual units or monomers which binds specifically to its binding partner (e.g., to DNA). The present invention further relates to molecules, particularly sequence-specific DNA molecules, designed and produced by the present method and to uses therefor. Specific binding molecules produced by the present method can be used in any application in which predictable or specific joining of two members of a binding pair is desired.
In one embodiment, sequence-specific DNA binding molecules produced by the methods described herein, are useful as gene regulatory molecules, such as molecules which mimic the tight and specific DNA binding character¬ istics of transcription factors, which play important roles in regulation of gene transcription by increasing or decreasing the rate of mRNA synthesis. Most commonly, genes are regulated at the level of transcription by proteins, referred to as transcription factors, which bind promoter DNA. A critical step in gene regulation by transcription factors is binding a factor to its specif- ic, or target, DNA sequences in the promoter. Sequence- specific DNA binding molecules designed and produced by the present method can be used as molecules which mimic the tight and specific DNA binding characteristics of transcription factors and, as a result, exert control over gene expression. Sequence specific DNA binding molecules can be used, for example, to control (enhance or repress) gene expression in vivo and, thus, serve as the basis for development of new therapeutic strategies for treating diseases or conditions in which there is a genetic defect. For example, a sequence-specific DNA binding molecule of the present invention can be used as an artificial or synthetic transcription repressor which is designed to bind a particular promoter and inhibit transcription of the gene under its control. An artifi- cial or synthetic transcription repressor can be used to inhibit expression of a gene whose over-expression is associated with a disease or condition. Genetic diseases showing dominant inheritance, such as Huntington's dis¬ ease, are promising candidates for counteraction by transcriptional inhibitors designed and produced by the method of the present invention.
The present method of designing and producing a sequence-specific binding molecule is exemplified herein by the method of designing and producing a sequence- specific DNA binding molecule, particularly, a sequence- specific DNA binding peptide. In the present method of designing and producing a sequence-specific DNA binding peptide, the following steps are carried out:
A desired or target molecule (e.g., a desired or target DNA sequence, or molecule) is synthesized or otherwise provided, which contains a first moiety capable of forming a reversible bond with a second moiety. The target DNA sequence is one for which a sequence specific binding molecule, particularly a sequence specific DNA binding peptide, is to be designed and produced. The target DNA sequence is combined with a test-binding mole¬ cule, which contains a moiety capable of forming a re¬ versible bond with the moiety present on the target sequence, such as the target DNA sequence. The test- binding molecule (also referred to herein as test-mole¬ cule) comprises a unit such as an amino acid residue, to be assessed for its ability to bind to the desired DNA sequence. The resulting combination of target DNA se¬ quences and test-molecules is maintained under conditions that are appropriate for the formation of a reversible bond between the first moiety (i.e., on the DNA sequence) and the second moiety (i.e., on the test-molecule) and binding of the unit being assessed to a region of the target sequence. Thus, under the appropriate conditions, DNA sequence-test-binding molecule complexes are formed, or produced.
These complexes are then subjected to conditions under which the reversible bond between the moiety on the DNA sequence and the moiety on the test-molecule is reversed (i.e, disrupted or broken). Under a set of specified conditions, if the unit of the test-molecule is bound tightly to the DNA sequence (i.e., in a site-spe¬ cific manner) the test-molecule will remain bound to, or associated with, the desired DNA sequence. However, if the unit of the test-molecule is weakly bound to the DNA sequence, under the same specified conditions, the test- molecule will easily dissociate from the desired DNA se¬ quence. Thus, a mixture is produced which contains complexes of the test-molecule bound to the desired target sequence, uncomplexed target molecules and uncomplexed test-molecules. In the case in which a sequence-specific DNA-binding molecule (e.g., a DNA binding peptide) is being produced, the resulting mixture contains complexes, uncomplexed target DNA sequence and uncomplexed test molecules.
The identity of the test-molecule present in the complexes, and the order of the units comprising the test-molecule, is determined by the present method by carrying out the above-described process. The process is carried out a sufficient number of times to identify a binding partner, such as a DNA binding protein, of appro¬ priate makeup and sufficient length to bind to the target DNA and remain bound to the DNA, and subsequently deter- mining the identity and order of the units (e.g., amino acid residues) in the binding partner produced. With each subsequent cycle, the test-molecule includes one more unit to be assessed than the test-molecule of the previous cycle; the test-molecule in the complex which is formed also has one additional unit than the complex in the previous cycle. Thus, following the method described herein, a sequence-specific DNA binding molecule is designed and produced.
In a preferred embodiment, the moiety present on the target DNA and on the target molecule is a thiol group, the reversible bond formed between the two moieties is a disulfide bond, the test-molecule is a peptide and the unit to be assessed is an amino acid residue. In this embodiment, a DNA molecule of a desired sequence which contains a thiol group attached at a specific site on the sequence is combined with a synthetic peptide which also contains a thiol group. The peptide has the formula C02H-Cys-Xaa-NH2. The DNA molecule and the peptide bind, or associate, via the formation of a reversible disulfide bond, thus, forming a DNA-peptide complex.
In another embodiment, a mixture of peptides can be used, all of which have the formula C02H-Cys-Xaa-NH2 and each of which differs in the amino acid residue Xaa (Xaa can be any amino acid residue which lacks an -SH group) . In either embodiment, each peptide will have a different association constant for the DNA sequence, and these differences will affect the reversibility, or reducibili- ty, of the disulfide bond.
Under reversing conditions, such as subjecting the formed complexes to a thiol gradient, the peptides are released from the DNA sequence according to their DNA association constants. The strength of the disulfide bond in a disulfide-linked peptide-DNA complex is direct¬ ly related to the strength of the peptide-DNA associa- tion. This relationship permits screening of tight- binding peptides from a mixture of peptides. It is reasonable to expect that the peptide that remains complexed to the DNA sequence under conditions using the highest concentration of thiol binds tightest to the DNA. This screening process can be repeated in subsequent cycles with a peptide which has one additional amino acid residue designated Xaa, in each cycle. The identifica¬ tion of each Xaa residue can be determined by conven¬ tional methods, such as peptide sequencing or UV absorp- tion. The order of the next residue of the peptide, resulting in the tightest binding to the DNA sequence is determined.
Thus, the method described herein is a rational method for the design, selection and production of mole- cules that bind in a site-specific manner, to desired DNA sequences. Examples of binding molecules include oligo- meric molecules in which units can be added or removed (e.g., D-, L-r or DL-peptides, peptidomimetic compounds or complex carbohydrates) . Molecules made by the methods of the invention can be used to regulate a wide variety of biological process¬ es which depend on the site specific interaction of one molecule with another molecule. For example, processes mediated by the binding of a peptide with a nucleic acid, or of a peptide with a peptide. Binding molecules which bind with a nucleic acid can be used to prevent gene activation by blocking the access of an activating factor to its sequence element, repress transcription by stabi¬ lizing duplex DNA or interfering with the transcriptional machinery, or carry out targeted DNA modification by delivering a reagent to a specific sequence. Binding molecules which bind to peptides can be used to mediate or otherwise participate in, various processes such as antibody-antigen interactions, enzyme substrate interac- tions, hormone-receptor interactions, and lymphokine- receptor interactions.
Because the methods of the invention are chemical rather than biological, they can be used to select or discover binding molecules which are not normally synthe- sized by living organisms, such as peptides which include D-amino acids or nonbiogenic polymers (e.g., polymers derived from polyethylene glycol or nonnatural carbohy¬ drates) .
Methods of the invention described herein can be used to optimize a single or small number of modifica¬ tions, such as a single or small number of positions in a polymer, at each cyclic step and thus avoid steps in which extremely large numbers of species are screened. Other advantages and features will become apparent from the following descriptions and from the claims.
Brief Description of the Drawings
Figure 1 is a schematic representation of the reac¬ tion between a thiol-tethered oligonucleotide and a mixture of -SH-containing peptides. Figure 2 is a graph of a hypothetical reduction- elution profile.
Figure 3 shows the components of the CGN4 binding system, including the oligonucleotides GCN4-1 (SEQ ID N0:l); GCN4-2 (SEQ ID N0:2); GCN4-3 (SEQ ID N0:3); GCN4-4 (SEQ ID NO:4) and the GCN4-derived peptide, including the disulfide tether (SEQ ID NO:5). The clear boxed area indicates the location of the tethered disulfide.
Figure 4 shows the results of coupling the disul- fide-linked GCN4 peptide (SEQ ID NO:5) with the GCN4 oligonucleotides (SEQ ID NOS:1-4) as analyzed by denatur- ating polyacryla ide gel electrophoresis. X indicates what appears to be peptide-DNA complexes of differing mobility.
Detailed Description of the Invention The present invention relates to methods of design¬ ing and producing a member of a binding pair which spe¬ cifically binds to its partner as well as to the products resulting from these methods. Such members are referred to herein as specific binding molecules. It particularly relates to methods of designing and synthesizing mole¬ cules which specifically bind a desired DNA sequence (i.e., sequence-specific or site-speci ic DNA binding molecules) .
Specific binding molecule (also referred to herein as binding molecule) , as used herein, refers to an enti¬ ty, e.g., a molecule, or a portion of a molecule, which binds to a target. Preferably, a specific binding mole- cule is susceptible to a plurality of successive or serial modifications, e.g., in the case of a polymeric molecule, the addition of monomeric units to the polymer¬ ic chain. Preferably, the binding affinity of a specific binding molecule with the target can be evaluated before and/or after successive modification of the specific binding molecule. A specific binding molecule is capable of reversible attachment to a target, preferably via a tether.
Test-binding molecule (or test-molecule) , as used herein, refers to a specific binding molecule, some or all of the structure of which is evaluated for inclusion in the final structure of a specific binding molecule. For example, in determining the structure of a peptide, the intermediate or candidate peptides screened for binding affinity are referred to as test-binding pep¬ tides. The specific binding molecule, e.g., a final full length peptide, which is the product of the entire process, can be referred to as a final or finished spe¬ cific binding molecule. Target, as used herein, refers to an entity with which a specific binding molecule binds. Methods of the invention optimize binding affinity between a target and a specific binding molecule. A target can be a molecule, a portion of a molecule, or an aggregate of molecules. A target and a specific binding molecule can be separate molecules, or they may be different moieties on one molecule. A target includes a target site. A target is capable of reversible attachment to a binding molecule via a tether. Examples of targets include: nucleic acids (e.g., RNA or DNA, double stranded DNA, single stranded DNA, or supercoiled DNA) , peptides or proteins (e.g., enzymes, receptors or antibodies), carbohydrates, and other molecular structures, such as nucleic acid- protein complexes, chromatin or ribosomes, lipid-bilayer containing structures, such as membranes, or structures derived from membranes, such as vesicles.
Target site or specific site, as used herein, refers to a site on a target to which a specific binding mole¬ cule binds. Methods of the invention optimize binding affinity between a specific binding molecule and a target site on a target. In the case of polymeric target mole¬ cules, a target site will usually include a specific sequence of monomeric subunits or a three dimensional structure. The actual structure (e.g., the chemical structure, or three dimensional structure) of the target site need only be known with enough particularity to allow formation of a reversible bond to the target. Preferably, the molecular interactions between a binding molecule and a target site are noncovalent and have energies of less than 25 kcal/mol at 25°C. These molecu¬ lar interactions include hydrogen bonds, Van de Waals interactions and electrostatic interactions.
Aggregate of molecules, as used herein, refers to two or more molecules which are connected by covalent or noncovalent interactions.
Tether, as used herein, refers to a structure which includes a moiety capable of forming a reversible bond with another moiety (e.g., a moiety on another tether) and (optionally) a spacer element. Alkane chains are suitable spacer moieties. Reversible bond, as used herein, refers to a bond linking a binding molecule and a target (i.e., a binding pair) which is thermodynamically stable but capable of being broken by a reversing agent which is a physical or chemical agent capable of breaking the bond. For any given bond an appropriate reversing agent can be readily chosen based on the chemical nature of the bond. For example, a reversing agent for a disulfide bond is a reducing agent such as thiol. The reversible bond is between a tether on a specific binding molecule and a tether on a target, a bond between tether on a specific binding molecule and a target, a bond between a specific binding molecule and a tether on a target, or a bond directly between a target and a specific binding mole- cule. By thermodynamically stable is meant a bond whose strength is greater than 10, preferably greater than 20, more preferably greater than 50, even more preferable greater than 65, but preferably less than 100 Kcal/mol at 25°C. Suitable examples of reversible bonds include: R,- S-S-R-,, R.-S-Cd-S-R-,, and R^S-Hg-S-R., wherein R1 includes a binding molecule or entity and R2 includes a target and the reversible bond is within the underlined area. Also included are bonds in which a metal (e.g., Fe3+, Co2+, Ni2*, Cu2+, Zn2+, Cd2+, or Hg2+) is complexed between a multidentate ligand (i.e., a ligand having two (or more) moieties with which to complex an atom or group, prefera¬ bly a metal atom) on a binding molecule, wherein a moiety on the binding molecule can be, e.g., S, N, or an imidaz- ole group, and e.g., a multidentate ligand on a target, wherein a moiety on the target can be S, N, or an imidaz- ole group. Examples of multidentate ligands follow: SH
R
10
SH
_C02H
15
R N
20 C0?H
C02H
25 R C02H
\ /
\ /
\ /
\ /
30 /
/ \ \
C02H
R His
R His-Gly-Gly
wherein R can be either a binding molecule or a target. Any combination of multidentate ligands and monodentate ligands (i.e., a ligand having one moiety with which to complex a metal or other atom or group) can be used in the invention. For example, a binding molecule having a multidentate ligand and a target having a multidentate ligand, a binding molecule having a monodentate ligand and a target having a monodentate ligand, or a binding molecule having a monodentate ligand and a target having a multidentate ligand can be used.
Methods of the invention can be used to design specific binding molecules which bind to a target site (i.e., a specific sequence) on a target molecule. These methods include an iterative process comprising succes¬ sive cycles of: (1) modifying a test-binding molecule (also referred to as a test-molecule) ; and (2) evaluating the affinity of the modified test-binding molecule for a target site on the target molecule. The evaluation includes evaluating the relative affinity of a test- binding molecule for a target site as compared with other test-binding molecules in a pool, or mixture of test- binding molecules. The affinity of the test-binding molecule for the target can be determined by forming a reversible bond between the test-binding molecule and the target. The susceptibility of the reversible bond to reversal is related to the affinity of the test-binding molecule for the target site on the target. In most applications a number of species of test-binding mole¬ cules, representing alternative modifications of a test- binding molecule (i.e., modifications of the initial test-binding molecule or a test-binding molecule from the previous cycle of the method) are evaluated εimultaneous- ly at each cycle. The structure of the species (at each cycle) which gives the optimum results is chosen to supply an element of the structure of the final specific binding molecule.
Thus, application of the method described herein, results in the elucidation of a preferred structure for the final binding molecule. While any molecule or combi¬ nation of molecules which can be subjected to such a process can be used as a test-binding molecule, a partic¬ ularly useful application of methods described herein, involve the generation of DNA binding peptides.
The synthesis and identification of a peptide which can bind to a sequence specific target site on a target DNA molecule can be performed as follows. A moiety capable of forming a reversible bond with a moiety on the test-binding molecule is attached to target DNA mole¬ cules. For example, a sulfhydryl group is tethered by an alkane chain to a site such as a site in a major or minor groove in a DNA molecule. In one embodiment, the DNA- [C]n-SH is then attached to an immobilizing matrix. The DNA-[C]n-SH molecules are then complexed, via a disulfide bond, to a mixture of synthetic peptides and placed in a chromatography column as shown in Figure 1. X in Figure 1 represents the number of species of peptides in a mixture of peptides. The curved line connecting the peptide to the DNA target represents the tether. The vertical arrows between the peptide and the DNA target represent the specific binding molecule/target site interaction, which, preferably, is the interaction the method optimizes. The synthetic peptides are all of the formula C02H- Cys-Xaa-NH2 (where Xaa equals any amino acid residue which lacks an -SH group) . Either or both the N or C terminal can be modified, or blocked, as in the structure HN2C02-Cys-Xaa-NHC02CH3, to prevent unwanted interaction between the specific binding molecule and the target. Amino acids may be added at either end of the molecule.
The mixture of synthetic peptides includes a variety of species (i.e., a plurality of peptides of different sequences) with differences in sequences arising from various candidate residues occupying the second (Xaa) position in different peptides. The candidate residues may be any moiety which lacks an -SH group and which can be incorporated into the peptide chain, including, for example, D- or L-amino acids, naturally occurring or non- naturally occurring amino acids, or - , β-r or γ~ amino acids.
The test-binding molecule will have different bind¬ ing affinities for the target DNA sequence, and these differences will affect the reducibility of the disulfide bond between the peptide and the DNA molecule with which it is complexed. In one embodiment, passage of a thiol gradient through the peptide-DNA column results in the release of the peptides according to the susceptibility of the binding molecule-target disulfide bond to reduc- tion (i.e., reversal). This results in an elution pro- file which reflects the differences in susceptibility to reduction and thus the differences in the target DNA binding constants between the various dipeptides and the target. The later a dipeptide elutes, the higher its binding affinity for the target DNA sequence. Inspection of the elution profile of the dipeptides allows determi¬ nation of the optimal residue at the second position. Figure 2 shows a hypothetical elution profile. The concentration of thiol is represented by a dashed line and the elution profile by a solid line. The peak la¬ beled A represents the species with the highest binding affinity for the target.
The entire process is repeated with a set of tripep- tides. For example, C02H-Cys-XAA-Xaa-NH2, where XAA is the optimum second position residue and Xaa is defined as above, is cycled through the process to determine the optimum residue for the third position in the binding peptide. Subsequent cycles extend the sequence of the binding peptide to the desired length. The desired length can be a predetermined number of amino acid resi¬ dues, or can be a length at which the binding molecule exhibits useful or optimum binding affinity and/or se¬ quence specificity.
While the peptides are lengthened by one residue per cycle in the above example, it is also possible to per¬ form more than one modification, (e.g., to add 1, 2, 3, 4, or more residues) per cycle. When used in conjunction with conventional solid-phase-peptide synthesis technolo¬ gy, this strategy allows the generation of DNA binding peptides of desired lengths. Choice of the Reversible Bond or Tether Sites
The site at which the reversible bond or tether is placed (on both specific binding molecule and target) should be chosen so as to allow a specific binding molecule coupled to the target unhindered access to the target site on the target. Stearic hindrance imposed by the location or structure of the bond or tether(s) can interfere with the correlation between bond reversibility and binding molecule-target site affinity. The inclusion of a spacer element can reduce stearic hindrance. For example, an alkane of appropriate length can be used to provide both flexibility and sufficient separation be¬ tween the binding molecule and the target site.
When a nucleic acid is the target molecule a nucleic acid of any strandedness and of any topology can be used in methods of the invention. In the case of double stranded DNA, the tether can be located in a major or minor groove close to the target sequence, but not so close as to result in stearic hindrance to binding from strain on the bond between the binding peptide and the targe .
The reversible bond or tether can be located such that either binding molecule-target interactions or binding molecule-solution interactions are favored. For example, in the case of an essentially linear target, such as double stranded DNA, the reversible bond or tether can be placed at or near a terminus of the mole¬ cule to favor binding molecule-solution interactions, or in the central areas (away from the termini) , to favor binding molecule-target interactions.
A tether can be attached to DNA, or the reversible bond formed, on a base at any exocyclic amine or any vinyl carbon, such as the 5 or 6 position of pyrimidines, 8 or 2 positions of purines, at the ultimate 5' or 3' carbons, at the sugar phosphate backbone, or at internucleotide phosphorus atoms.
Choice of Reversible Bonds and Tethers
In methods of the invention described herein, the binding molecule is conjugated to, or associated with, the target by a reversible bond. In some embodiments the reversible bond is between a tether on the target and a tether on the specific binding molecule. In embodiments with two tethers, the tether on the binding molecule can be the same as the tether used on the target. Alterna¬ tively, different tethers can be used on each. In other embodiments only one tether is used, and in some embodi¬ ments the reversible bond is formed directly between the binding molecule and the target. The tethers and the reversible bond should have the following characteristics. A tether (or reversible bond) should be capable of attachment to the target without substantial alteration of the three dimensional structure of the target. For example, the reversible bond or tether-bearing-target should remain similar enough in conformation to the in vivo target so that the binding molecules generated will recognize and bind to the in vivo target with a useful affinity and site specificity. Additionally, the reversible bond formed between the target and the binding molecule should reversibly couple, by a covalent or ionic bond, the target to the binding molecule. The susceptibility to reversal, or breakage, of the reversible bond formed between the target and the binding molecule should vary with the affinity of the binding molecule for the target site on the target. The tether or tethers should be of appropriate length and flexibility such that the binding molecule has free access to the target site, and under the conditions used in methods of the invention, the reversible bond and/or tethers should be substantially unreactive with other sites on the binding molecule or target molecule.
Thiol groups are suitable moieties for forming a reversible bond. A reversible bond, e.g., a disulfide or metal-bridged disulfide bond, formed between -SH groups can be broken by contacting the bond with a reducing agent. In the case of a metal bridged disulfide, the reversible bond can be reversed with a ligand which competes with the metal atom for its position in the bridge. When the binding molecule is a peptide, the amino acid residue, cysteine, is a convenient source of an -SH group for use as the binding molecule tether. Alkane chains are suitable spacer moieties.
Methods for attaching tethers to targets, such as nucleic acid molecules, are known to those skilled in the art. (MacMillan et al. , Tetrahedron 47_:2603-2616 (1991) ; MacMillan et al. , J. Orσ. Che . 55:5931-5933 (1990); Ferentz et al. , J. Am. Chem. Soc. 113:4000-4002 (1991); Zucker an et al. , Nuc. Acid Res. 15:5305 (1987); Connolly et al. , Nuc. Acid Res. 33.:4485 (1985); Letsinger et al. , J. Am. Chem. Soc. 103:7394-7396 (1981) ; Fidanza et al. , J. Am. Chem. Soc. 111:9117-9119 (1989)).
In one embodiment of the method described herein, where the reversible bond between the binding molecule and the target is disrupted with a reversing agent, it is convenient to immobilize the target molecule before exposure to the reversing agent. This can be done by attaching, or linking the target to a matrix, such as a resin. Methods for attaching molecules to resins are known to those skilled in the art.
Formation of Test Binding Molecule-Target Complexes
Test-binding molecules (i.e., putative or candidate binding molecules) can be synthesized by methods known to those skilled in the art. As described in the Example, a derivative of the DNA binding protein, GCN4, (O'Shea, E. K. , et al. , Science 243:538-542 (1989); Talanian, R. V., et al. , Science 249:769-771 (August 1990); Talanian, R. V., et al. , Biochem. 31:6871-6875 (1992)) was synthe- sized. The GCN4-derived peptide is a monomer, comprised of 24 amino acid residues (SEQ ID NO:5).
Also as described in the Example, four modified DNA oligonucleotides, carrying a tethered disulfide at four different positions with respect to the CGN4-binding site (Figure 3, SEQ ID NOS:1-4) were synthesized using known methods. (MacMillan, A. M. , and Verdine, G. L. , J. Org. Chem. 55:5931 (1990); Ferentz, A. E. , and Verdine, G. L. , J. Am. Chem. Soc. 113:4000-4002 (1991).
The peptide was reduced, also as described in the Example, and, using the reaction conditions described in the Example, formation of the disulfide bond between the CGN4-derived peptide and the four DNA oligonucleotides was carried out. After incubation of the coupling reac¬ tion mixture, aliquots were taken and analyzed on poly- acrylamide gels under denaturing or native conditions.
Figure 3 shows the results of the analysis of aliquots from the four reaction mixtures containing the CGN4-derived peptide and the modified DNA sequences, on a denaturing gel. In all four reaction mixtures, a disul- fide-linked GCN4 peptide-DNA complex was formed, as indicated by the arrows denoting uncomplexed DNA and peptide-DNA complexes.
The structures of the disulfide-linked GCN4-DNA complexes were also analyzed to determine whether the peptides associated with the DNA oligonucleotides in a way that mimics their natural counterparts, or at least to discern that the peptide is bound in a sequence-spe¬ cific manner. Preliminary data using DNA footprinting techniques (Galas, D. J. and Schmitz, A., Nucleic Acid Res. 5:3157-3170 (1978) indicate that three out of the four modified DNA oligonucleotides bound the GCN4-derived peptide in the anticipated region. That is, the data is strongly suggestive that the peptide bound to three DNA sequences in a site-specific manner. In one embodiment, binding of peptides to thiol- tethered DNA via formation of a disulfide bond can be performed as follows. Peptides can be bound quantita¬ tively to a thiol-tethered DNA molecule that is bound to a polymer resin, by formation of a disulfide bond between the DNA and the peptides. In these experiments, the object is to bind approximately 100% of the peptides to the resin-bound DNA, hence, an excess (2-10-fold mole excess based on the thiol-containing DNA strand) of resin-bound DNA, relative to moles of thiol groups (or disulfide groups) on the peptides is used.
The resin-bound DNA is prepared in the reduced state by treatment with common disulfide-reducing agents (alkanethiols or borohydride compounds) . This incubation can be done in a batch mode or by passage of reagents through a column containing the resin-bound DNA. The excess reducing agents can be removed by filtration (batch mode) or elution (column mode) .
Charging of the peptides onto the resin can either be done in batch mode or column mode. In either case, the thiol group of the peptides will first be activated by conversion to the corresponding 2-thiopyridyl or 5- thio-2-nitrobenzoyl disulfide, using standard methods. The activated peptides, in deaerated buffer, pH 7-9 (for example 50 mM Tris, pH 8.0) will be incubated with the reduced DNA-bound resin either with shaking or stirring (batch mode) or with recirculation (column mode) . Alter¬ natively, the resin-bound DNA can be prepared as the 2- thiopyridyl or 5-thio-2-nitrobenzoyl disulfide, and the reduced peptides bound as described above. The binding reactions can be quantified by UV mea¬ surements, monitoring release of the pyridine-2-thione or 5-thio-2-nitrobenzoate chromophores. Alternatively, the amount of peptides bound to the resin or free in solution can be quantified by a routine ninhydrin test. The presence of free thiol groups on any material at any stage of the experiments can be monitored by alkylation with 14C-iodoacetamide.
Binding can be optimized by examination of % pep- tides bound versus method of activation (DNA-disulfide or peptide-disulfide) , activating agent (2-thiopyridyl or 5- thio-2-nitrobenzoyl) , binding mode (batch or column) , time of incubation, temperature, and structure of the thiol-containing tether in the DNA. In another embodiment, equilibrium binding of peptides to thiol-tethered DNA via formation of a disul¬ fide bond can be performed. Peptides can be bound under equilibrium conditions to a thiol-tethered DNA molecule that is bound to a polymer resin, by formation of a disulfide bond between the DNA and the peptides. The disulfide bond between the DNA and peptides can be formed under freely reversible conditions, so the noncovalent interaction of the peptide with DNA will cooperate with the covalent interaction (i.e., disulfide bond formation) to -establish a stable complex. These experiments can be carried out in a batch mode.
The thiol-tethered DNA is mixed with a stoichiomet- ric amount of the peptides in a deaerated redox buffer. The redox buffer can be the same as the redox eluent described above. The most important components are the reduced and oxidized forms of a thiol reducing agent, such as 2-thiopyridine, 5-thio-2-nitrobenzoate, dithiothreitol, 2-mercaptoethanol, and N,N'-dimethy1- N,N'-bis(mercaptoacetyl)hydrazine (DMH) . The reactants are allowed sufficient time to reach equilibrium. Alter- natively, if the DNA is resin-bound, then the resin is pelleted by centrifugation, and the supernatant is re¬ moved. The pellet is washed with buffer (lacking added thiols or disulfides) and pelleted again. DNA-bound peptides are then eluted by incubation of the resin under strongly reducing conditions (such as 100 mM dithiothre- itol) . Ordinarily, parallel incubations (containing different relative amounts of the reduced and oxidized forms of the thiol reducing agent) should be set up and analyzed separately.
The following conditions can be varied to optimize the system: chemical structure of redox eluent, concen¬ tration of redox eluent, temperature, flow rate, buffer conditions (pH, ionic strength, addition of organic co- solvents such as trifluoroethanol) .
Peptides can be quantified by amino acid analysis and sequenced by automated phenylthiohydantoin methods.
Determination of Binding Molecule-Target Site Binding Affinity The affinity of a specific binding molecule for the target site on a target can be determined by evaluating the ease with which a reversible bond between the binding molecule and the target can be reversed. These determi¬ nations can be made by immobilizing the binding molecule- target complex, such as on a matrix or a resin, and passing a gradient of a reversing agent (an agent which reverses, that is, breaks, or disrupts, the reversible bond and thus releases the binding molecule from the tar¬ get site) over the immobilized complexes. In most embodiments of the methods described herein, several species (also refrred to herein as a plurality) of test-binding molecules will be screened simultaneously to determine which test-molecule possesses the optimum binding properties. The elution profile allows determi- nation and comparison of the binding affinities of vari¬ ous species of test-binding molecule and selection of the species which represents the optimum or desired structure for the final specific binding molecule. In the case of a peptide binding molecule complexed to a DNA target molecule by a disulfide bond, the resin bound peptide-DNA complexes are placed in a chromatogra- phy column. A gradient of a reducing agent, e.g., a thiol reagent, is applied to the column. This results in the release of peptides according to their DNA associa¬ tion constants, producing a reductive elution profile. The peptide that elutes last has the highest affinity for the target DNA. This chemical screening process thus provides the optimal residue at the tested position. Elution of peptides coupled to a target by a disul¬ fide bond can be performed, either in batch or column mode, as follows. Column mode allows more precise con¬ trol over the elution conditions, since the column can be attached to a commercially available gradient elution system, such as the Fast Protein Liquid Chromatograph
(FPLC) , Pharmacia) or any similar apparatus. Batch mode operation may be necessary if the conditions required for elution (e.g., high temperatures, long elution times) are incompatible or inconvenient with FPLC. In the column mode, a redox gradient is passed through the column, causing peptides to be released depending on their redox potential. In the simplest case, the redox gradient consists of mixtures of a thiol or dithiol compound and its corresponding disulfide. In the beginning of the gradient, the redox eluent contains 100% of the disulfide form, and at the end of the gradi¬ ent, 100% of the thiol (or dithiol) form. Typical redox eluents consist of the thiol and disulfide forms of 2- thiopyridine, 5-thio-2-nitrobenzoate, dithiothreitol, 2- mercaptoethanol, and the N,N'-dimethyl-N,N'—bis(mercapto- acetyl)hydrazine (DMH) reagent recently reported by Whitesides fJ. Org. Chem. 56:2332-2337 (1991)). The latter may be preferable because of its exceptionally fast kinetics of disulfide reduction. Elution of peptides from the column is monitored by on-line UV detection at 214 n and post-column derivati- zation with ninhydrin. Peptides are quantified by amino acid analysis and sequenced by automated phenylthiohydan- toin methods. The following conditions can be varied to optimize elution for speed, ease, or resolution: chemical struc¬ ture of redox eluent, concentration of redox eluent, slope of gradient, shape of gradient (linear, step, exponential) , temperature, flow rate, buffer conditions (pH, ionic strength, addition of organic co-solvents such as trifluoroethanol) .
In the batch mode, the resin containing DNA-bound peptides is incubated in an Eppendorf tube with deoxygen- ated buffer containing the redox eluent. Redox eluents, quantification and identification of peptides are the same as described above for the column mode. The follow¬ ing conditions can be varied to optimize elution: chemi¬ cal structure of redox eluent, concentration of redox eluent, number and spacing of stepwise elutions, elution time, temperature, buffer conditions (pH, ionic strength, addition of organic co-solvents such as trifluoroetha¬ nol) .
After the determination of a first optimum modifica¬ tion (i.e., the determination of the optimum residue at a given position of a specific binding molecule) has been made, a second modification can be performed on the test- binding molecule (e.g., the addition of a subsequent residue to a polymeric binding molecule) and the process of evaluating the binding affinity of the newly modified test-binding molecule repeated. This cycle may be re¬ peated a number of times.
As in the first cycle, it will usually be desirable to simultaneously evaluate a number of species (i.e., a plurality) of test-binding molecules (representing a number of different modifications) at each cycle or iteration. For example, in the case of a peptide binding molecule, a plurality of peptide species, differing by the residue at the position (or positions) being opti- mized, are tested simultaneously. The structure (e.g., in the case of a peptide binding molecule, the particular residue) giving optimum results is selected.
In the case of a peptide binding molecule, a DNA target molecule, and -SH tethers, the following protocol can be used. After the optimum amino acid residue at the second position is determined, a set of tripeptideε of the formula C02H-Cys-XAA-Xaa-NH2 (where XAA is the optimum second position amino acid and Xaa represents any amino acid which lacks an -SH group) , is synthesized. Each peptide of the set differs at Xaa. The elution and determination of binding affinity is repeated with the tripeptide to yield the optimum amino acid residue at the third position. The process is repeated until the de¬ sired length is reached. After the iterative methods of synthesis and selec¬ tion described above have been used to generate the sequence order and structure of a binding molecule, further modifications can be performed on the binding molecule. These modifications may be in the form of a second round of selected optimizations of a different binding molecule characteristic. For example, after an initial determination of the optimum primary sequence of a peptide, a second iterative selection can be applied to determine an optimum level of glycosylation, the effect of cofactors, the effect of homo- or heterodimerization. or the effect of inter- or intra-chain cross linking. These, or other modifications may be tested for their effect on binding by non-iterative methods as well. Additionally, a second iterative selection can be per- formed to select a second specific binding molecule to form a heterodimer with the binding molecule selected in the first iterative cycle. These two specific binding molecules may be cross-linked by conventional methods. Modifications such as the formation of homo- or heterodimers, may require alteration of a selected bind¬ ing molecule. For example, new peptides may be constructed to optimize the spacing of binding units relative to each other and the center of target sites .in the DNA, or to allow the introduction of specifically desired residues. Molecular modeling can be used to facilitate the choice of modifications. The sequence specificity of dimerized peptides can be tested by meth¬ ods known to those skilled in the art (e.g., by competi¬ tion electrophoretic mobility shift assays, PCR-based target detection assay, or chemical or enzymatic footprinting) .
Optimization of Conditions for Determining Binding Affinity
General conditions under which the reversible bond between the binding molecule and the target are formed and broken, and the methods of evaluation of the rela¬ tionship between reversible bond breakage and binding molecule/target site binding affinity, can be determined by practicing the methods described above with relatively well characterized molecules, as is exemplified in the Example with the GCN4 system.
In addition to the GCN4 system, the X-ray crystal structures of the bacteriophage repressor (Jordan et al.. Science 242:893 (1988)) and the murine Zif268 protein (Pavletich et al. , Science 252:809 (1991)) bound to their respective DNA sites are deposited in the Brookhaven Protein Data Bank. These can also be retrieved and molecular modeling methods used to trim the structures down to a peptide-bound DNA core structure, as was done with GCN4. Disulfide tethers can be designed to link the resulting peptides to DNA, bearing in mind that the connector should be as short as possible without generat¬ ing strain. The λ repressor and Zif268 systems are favorable for optimization because they represent respec¬ tively, examples of extended and α-helical peptides that bind DNA as isolated units and for which high-resolution structures in the DNA-bound form are available. The α- helices of Zif268, while being part of a zinc finger structural motif, possess all of the residues of that motif that are involved in base-contacts.
DNA-binding peptides designed on the basis of X-ray structures (hereafter referred to as "wild-type" pep¬ tides) can be synthesized by standard methodology. Thiol-tethered oligonucleotides designed similarly ("wild-type" oligonucleotides) can be synthesized by methods and linked to a resin, as described above. The peptides can be tethered to DNA both in solution (for use in high-resolution structural studies) and on a solid matrix (for reductive elution studies) . The conditions for forming and releasing the peptide-DNA reversible bond can be optimized using these molecules, as described in the Example. Systems having sequence changes in the DNA or peptide ("mutant" oligonucleotides or peptides) that should disrupt sequence-specific peptide-DNA interac¬ tions, can be synthesized in parallel for use as controls or to further investigate elution conditions.
The structures of the DNA-tethered peptide systems constructed in the previous state can be evaluated to discern whether the peptides are associated with DNA in a way that mimics their natural counterparts, or at least in a way that is discernibly sequence-specific. 1H-NMR, 15N-NMR, chemical footprinting, and circular dichroism spectroscopy can be used to evaluate these molecules. Wild-type and mutant peptide-DNA systems, assembled on a solid matrix in a column can be subjected to reduc¬ tive elution by a thiol gradient. Parameters affecting elution, such as reducing agent, temperature, pH and slope of the gradient, can be optimized. For example, this approach can be used to find conditions in which wild-type λ and Zif268 peptides are strongly retained (elute late in the gradient) while peptide from mutant systems are not strongly retained (elute early) .
Following optimization of the reductive elution conditions for the elongation of wild-type peptides, screening of peptide mixtures can be optimized. The wild-type peptides can be elongated by one peptide unit, using a mixture of any amino acids that lack an -SH group. This 19 peptide mixture can then be coupled to the solid matrix, loaded into a column, and eluted reductively. The late-eluting peptides will be sequenced (e.g., by fast atom bombardment mass spectrometry and/or phenylthiohydantoin degradation) . This synthesis and screening process can be repeated iteratively until either the efficiency of synthesis or resolution of the column procedure falls off.
Elongated peptides that are obtained by iterative selection should bind selectively to longer target DNA sequences than the starting peptides. The interaction of these peptides with DNA can be studied by the same meth¬ ods as described above for the starting peptides.
Moreover, the three dimensional molecule can serve as a guide in choosing the modifications. This can allow the optimization of residues on the same face or side of a structure. For example, in the case of a binding mole- cule which is a helical molecule, it may be desirable to add subunits in groups of n, where n is the number of subunits involved in one full turn of the helix. In the case of an α-helical protein, wherein n=3.6 residues could be added in groups of 3, with the first two of the three being held constant (e.g., the first two residues being predetermined residues) or in groups of 4 with the first three of the four being held constant (e.g., con¬ sisting of predetermined residues) with the final resi- due, in either case, being varied.
An analogous method can be used to optimize the residues on one face of a ff-sheet or y?-ribbon structure. Since residues i, i + 2, i + 4, i + x, will be on the same surface of a ^-ribbon or a ?-sheet structure, resi- dues can be added as tripeptide, with the final residue of the peptide being varied.
The desired three-dimensional structure of the binding molecule can also influence choice of modifica¬ tion in other ways. For example, in the case of a pep- tide, residues which promote the formation of a helical structure, such as 2-aminoisobutyric acid or α-methyl amino acids, can be added. Similarly, pro-gly could be added to a sequence to interrupt a helical structure. A pro-gly series can be added to a peptide sequence to introduce a fold in a /5-sheet or ^-ribbon structure.
Peptide-on-phage libraries can be used to* supply the binding entities in methods of the invention. For exam¬ ple, a fully degenerate phage library could include all peptide test-binding entities to be tested in one batch. The peptides could be coupled to the target and eluted as a batch.
The invention will now be illustrated further and more specifically by the following Exemplification. Example: Formation of Disulfide-linked-peptide-DNA Complexes
1. Synthesis and purification of peptides All GCN4-derived peptides were synthesized on Ap- plied Biosystems Model 431A peptide synthesizer with standard reaction cycles. Peptides were deprotected and cleaved from the resin by incubation in the mixture of trifluoroacetic acid:phenol:anisole:ethanedithiol (94:2:2:2) for 4 hours at room temperature. The peptide solution was precipitated and washed 4-5 times with ice- cold diethyl ether. The pellet was dried with air, dissolved in 1ml of 10% acetic acid and lyophilized. The peptide was purified by HPLC with ZORBAX reverse-phase C- 8 semi-preparative column (DuPont Instruments) and a linear gradient of acetonitrile-water with 0.1% TFA.
Fast atom bombardment mass spectroscopy revealed a peak at 2613.07 which agrees with the calculated mass of 2611.97. Collected fractions were lyophilized and stored at -20°C.
2. Synthesis and purification of DNA oligonucleo¬ tides
All oligonucleotides were synthesized on an Applied Biosystems DNA synthesizer Model 381A using conventional and modified phosphoramidites according to the "convert- ible nucleoside approach" described in MacMillan, A. M. and Verdine, G. L. , J. Org. Chem. 55_:5931 (1990) and Ferentz, A. E. , and Verdine, G. L. , J. Am. Chem. Soc. 113:4000-4002 (1991) . The displacement reaction was done with the disulfide of aminepropanethiol to yield modified oligonucleotides with N6-thioalkyl-dA or N4-thioalkyl-dC, protected as mixed disulfides. Both modified and unmodi¬ fied oligonucleotides were purified by polyacrylamide gel electrophoresis (PAGE) on 20% denaturing gels. Annealing of different modified oligonucleotides with the corresponding complementary strands produced four double-stranded probes carrying the tethered disul¬ fide at four different positions with respect to the GCN4-binding half-site. (Figure 2; GCN4-binding half- site shaded in gray) .
3. Reduction of peptides
The lyophilized GCN4-derived peptide was dissolved in 0.1 ml of lxTE8 (Tris-EDTA buffer, pH 8) and peptide concentration determined by UV spectroscopy (210 and 220 nm) was 3 mM. The peptide was reduced by the addition of 1 microliter of 1:10 dilution of 2-mercaptoethanol stock (14.4M, obtained from Bio-Rad Laboratories) and incubated at 50° for 30 minutes. The reaction mixture was subse- quently lyophilized in the speedvac concentrator (Savant) to evaporate 2-mercaptoethanol and the dry pellet was dissolved in 0.1 ml of 10xTE8.
4. Coupling reaction and the analysis of results The disulfide bond between the peptide and DNA was formed by mixing the 5-10 pmols (20-8OK CPM) of the 32P end-labeled double stranded DNA probe with different amounts (5pmols-5nmols) of reduced GCN4-derived peptide in the buffer containing 50 mM KC1, 20mM Tris pH 7.5 and 10% glycerol. The coupling reaction mixture (20 micro- liters) was incubated at room temperature for 8-48 hours. Aliquots (2-4K CPM) from each reaction were analyzed on denaturing (Figure 3) or native 20% acrylamide gels, and by DNA footprinting. Equivalents
Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. Such equivalents are intend¬ ed to be encompassed by the following claims.

Claims

The invention claimed is:
CLAIMS 1. A method of designing and producing a specific binding molecule, comprising the steps of: a) combining: 1) a desired target con¬ taining a first moiety capable of forming a reversible bond with a sec¬ ond moiety and; 2) a test-molecule comprising a unit to be assessed for its ability to bind a region of the desired target and containing the second moiety, thereby producing a combination; b) maintaining the combination produced in (a) under conditions appropriate for formation of a reversible bond between the first moiety and the sec¬ ond moiety, and binding of the unit to be assessed with a region of the desired target and the test-molecule, thereby producing desired target - test-molecule complexes; c) subjecting complexes produced in (b) to conditions which result in rever- - sal of the reversible bond, thereby producing a mixture which contains complexes, uncomplexed desired target molecules, and test-molecules; d) determining the identify and order of test- molecules present in the complexes; and e) repeating steps a) through d) in a series of cycles, wherein in each subsequent cycle, test-molecules in step (a) comprise one unit more than in the preceding cycle and the test- molecules in complexes formed in step (b) comprise one unit more than test- molecules present in complexes formed in step (b) of the preceding cycle.
2. A specific binding molecule produced by the method of Claim 1.
3. A method of designing and producing a sequence- specific DNA binding molecule, comprising the steps of: a) combining: 1) a desired DNA sequence containing a first moiety capable of forming a reversible bond with a sec¬ ond moiety and; 2) a test-molecule comprising a unit to be assessed for its ability to bind a region of the desired DNA sequence and containing the second moiety, thereby producing a combination; b) maintaining the combination produced in (a) under conditions appropriate for formation of a reversible bond between the first moiety and the sec¬ ond moiety, and binding of the unit to be assessed with a region of the desired DNA sequence and the test- molecule, thereby producing desired DNA sequence - test-molecule com¬ plexes; c) subjecting complexes produced in (b) to conditions which result in rever¬ sal of the reversible bond, thereby producing a mixture which contains complexes, uncomplexed target DNA sequences, and uncomplexed test-mole¬ cules; d) determining the identify and order of test- molecules present in the complexes; and e) repeating steps a) through d) in a series of cycles, wherein in each subsequent cycle, test-molecules in step (a) comprise one unit more than in the preceding cycle and the text molecule in complexes formed in step
(b) comprise one unit more than test- molecules present in complexes formed in step (b) of the preceding cycle.
4. A sequence-specific DNA binding molecule produced by the method of Claim 3.
5. A method of Claim 2 wherein the test-molecule of step a) is a peptide and the unit to be assessed is an amino acid residue.
6. A sequence-specific DNA binding molecule produced by the method of Claim 5.
7. A method of Claim 5 wherein the reversible bond of step b) is a disulfide bond formed between an -SH group on the test-molecule and an -SH group on the desired DNA sequence.
8. A sequence-specific DNA binding molecule produced by the method of Claim 7.
9. A method of Claim 3 wherein step c) further compris¬ es subjecting complexes to a reversing agent.
10. A method of Claim 5 wherein the reversing agent is a reducing agent.
11. A sequence-specific DNA binding molecule produced by the method of Claim 10.
12. The method of Claim 3, wherein the desired DNA se¬ quence comprises a DNA molecule comprising an -SH group, the test-molecule comprises an -SH group, the reversible bond formed between the -SH groups is a disulfide bond and the reversing conditions comprise subjecting complexes to a reducing agent to break the disulfide bond.
13. A sequence-specific DNA binding molecule produced by the method of Claim 12.
14. The method of Claim 12, further comprising attaching the DNA molecule to an immobilizing matrix, and wherein subjecting complexes to the reducing agent comprises contacting the complex with a concentra¬ tion gradient of the reducing agent, and determining the ability of the reducing agent to disrupt the disulfide bond comprises determining the ability of the reducing agent to elute the test-molecule from the immobilized DNA.
15. The method of Claim 14, wherein the test-molecule comprises a peptide comprising a first and second subunit, the first subunit comprises a first amino acid residue comprising an -SH group and the second subunit comprises a second amino acid residue which does not contain an -SH group.
16. The method of Claim 15, wherein the first subunit comprises cysteine.
17. The method of Claim 12, wherein step a) further comprises providing a plurality of test-molecules comprising a plurality of sequenc¬ es, each of the test molecules comprising a first subunit comprising an -SH group and a second subunit which does not contain an -SH group, step b) further comprises maintaining a plural- ity of the test-molecules with a plurality of the
DNA molecules to form a plurality of complexes, each of the complexes comprising a test-molecule linked by a disulfide bond to a DNA molecule, step c) further comprises subjecting a plurali- ty of the complexes to a reducing agent to break the disulfide bonds; and step d) further comprises determining the sus¬ ceptibility of the bonds to the reducing agent as an inverse measure of the ability of a test-molecule to bind to the DNA molecule, the sequence of the test- molecule comprising the sequence of the test-mole¬ cule of the complex with the disulfide bond most resistant to breakage by the reducing agent.
18. The method of Claim 3, wherein the test-molecule is of a predetermined length and the method further comprises comparing the length of the sequence gen¬ erated in step (d) with the predetermined length and if the desired length has not been reached, then adding another subunit to the subsequent test- molecule and repeating steps (a) through (d) .
EP93903482A 1992-01-13 1993-01-13 Selection of binding-molecules Ceased EP0623141A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US81985592A 1992-01-13 1992-01-13
US819855 1992-01-13
PCT/US1993/000321 WO1993014108A1 (en) 1992-01-13 1993-01-13 Selection of binding-molecules

Publications (1)

Publication Number Publication Date
EP0623141A1 true EP0623141A1 (en) 1994-11-09

Family

ID=25229263

Family Applications (1)

Application Number Title Priority Date Filing Date
EP93903482A Ceased EP0623141A1 (en) 1992-01-13 1993-01-13 Selection of binding-molecules

Country Status (3)

Country Link
EP (1) EP0623141A1 (en)
CA (1) CA2128016A1 (en)
WO (1) WO1993014108A1 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2702219C (en) 1996-11-06 2013-01-08 Sequenom, Inc. High density immobilization of nucleic acids
US7285422B1 (en) 1997-01-23 2007-10-23 Sequenom, Inc. Systems and methods for preparing and analyzing low volume analyte array elements
US6015709A (en) * 1997-08-26 2000-01-18 Ariad Pharmaceuticals, Inc. Transcriptional activators, and compositions and uses related thereto
AU2002245047A1 (en) 2000-10-30 2002-07-24 Sequenom, Inc. Method and apparatus for delivery of submicroliter volumes onto a substrate
WO2009039122A2 (en) 2007-09-17 2009-03-26 Sequenom, Inc. Integrated robotic sample transfer device
EP3645546A4 (en) 2017-06-30 2021-12-01 Solstice Biologics, Ltd. Chiral phosphoramidite auxiliaries and methods of their use

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4182654A (en) * 1974-09-18 1980-01-08 Pierce Chemical Company Production of polypeptides using polynucleotides
US5010175A (en) * 1988-05-02 1991-04-23 The Regents Of The University Of California General method for producing and selecting peptides with specific properties
DE69128350T2 (en) * 1990-06-11 1998-03-26 Nexstar Pharmaceuticals Inc NUCLEIC ACID LIGANDS

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO9314108A1 *

Also Published As

Publication number Publication date
WO1993014108A1 (en) 1993-07-22
CA2128016A1 (en) 1993-07-22

Similar Documents

Publication Publication Date Title
US5783384A (en) Selection of binding-molecules
Halpin et al. DNA display III. Solid-phase organic synthesis on unprotected DNA
CA2132103C (en) Encoded combinatorial chemical libraries
US11753744B2 (en) DNA barcoding of designer mononucleosome and chromatin array libraries for the profiling of chromatin readers, writers, erasers, and modulators thereof
US6436665B1 (en) Methods for encoding and sorting in vitro translated proteins
US6423493B1 (en) Combinatorial selection of oligonucleotide aptamers
US5109124A (en) Nucleic acid probe linked to a label having a terminal cysteine
JPH04504409A (en) Continuous peptide and oligonucleotide synthesis using immunoaffinity technology
EP0405913B1 (en) Hydrophobic nucleic acid probe
Lovrinovic et al. Synthesis of protein–nucleic acid conjugates by expressed protein ligation
JPH0644880B2 (en) Test method and kit for target nucleic acid sequence
Schürer et al. Aptamers that bind to the antibiotic moenomycin A
WO2001062968A2 (en) Mutant nucleic binding enzymes and use thereof in diagnostic, detection and purification methods
WO2017028548A1 (en) Mirror nucleic acid replication system
EP1625230A2 (en) Selection and evolution of chemical libraries
CN103882532B (en) A kind of synthesis of lead compound and screening method and test kit
EP0623141A1 (en) Selection of binding-molecules
US20040091874A1 (en) Sensor chip for nucleic acid selection
JP2008253176A (en) Linker for obtaining highly affinitive molecule
McDougall et al. Tertiary structure of the eukaryotic ribosomal 5 S RNA. Accessibility of phosphodiester bonds to ethylnitrosourea modification.
US6982145B1 (en) Isolation and identification of control sequences and genes modulated by transcription factors
JP3853161B2 (en) Method for amplifying trace amounts of mRNA and cDNA
JP2003516159A (en) Products comprising a support on which nucleic acids are immobilized, and their use as DNA chips
WO2001068807A2 (en) Identification of in vivo dna binding loci of chromatin proteins using a tethered nucleotide modification enzyme
WO2008124111A2 (en) System for pulling out regulatory elements in vitro

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 19940812

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LI LU MC NL PT SE

17Q First examination report despatched

Effective date: 19970130

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED

18R Application refused

Effective date: 20010318