WO2015166036A1 - Peptide libraries - Google Patents

Peptide libraries Download PDF

Info

Publication number
WO2015166036A1
WO2015166036A1 PCT/EP2015/059496 EP2015059496W WO2015166036A1 WO 2015166036 A1 WO2015166036 A1 WO 2015166036A1 EP 2015059496 W EP2015059496 W EP 2015059496W WO 2015166036 A1 WO2015166036 A1 WO 2015166036A1
Authority
WO
WIPO (PCT)
Prior art keywords
library
peptides
amino acids
peptide
phage
Prior art date
Application number
PCT/EP2015/059496
Other languages
French (fr)
Inventor
Katja SIEGERS
Jan Van Den Brulle
Original Assignee
Morphosys Ag
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Morphosys Ag filed Critical Morphosys Ag
Priority to US15/306,538 priority Critical patent/US10822604B2/en
Priority to EP15720692.1A priority patent/EP3137482A1/en
Publication of WO2015166036A1 publication Critical patent/WO2015166036A1/en
Priority to US17/014,084 priority patent/US11352620B2/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1037Screening libraries presented on the surface of microorganisms, e.g. phage display, E. coli display
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K1/00General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length
    • C07K1/04General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length on carriers
    • C07K1/047Simultaneous synthesis of different peptide species; Peptide libraries
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • G01N33/6803General methods of protein analysis not limited to specific proteins or families of proteins
    • G01N33/6845Methods of identifying protein-protein interactions in protein mixtures

Definitions

  • the invention relates to novel libraries containing both linear and cyclic peptides, and methods of generating and screening such libraries for biological, pharmaceutical and other uses.
  • Peptide libraries have many uses. Such libraries can be used to identify therapeutically relevant molecules, or can serve other purposes, such as, the epitope mapping and characterization of therapeutically relevant molecules.
  • peptides have certain advantages over small molecules and large molecule inhibitors, such as, antibodies. As compared to small molecules, peptides typically have a larger interface with an antigen, which interface comprises hydrogen bonds and van der Waals forces. This leads to high binding affinities, a high specificity for the antigen and typically a high potency. As compared to antibodies, peptides are much smaller and therefore typically penetrate tissue more easily. Certain tumors are inaccessible for antibody therapy.
  • Linear peptides may be used in libraries.
  • Linear peptides for example, at times have certain disadvantages. They are highly flexible and do not typically adopt unique, reproducible conformations. The lack of fixed structure reduces the affinity a peptide might have for an antigen and makes determination of the active conformation of the peptide extremely difficult. In addition, linear peptides are more easily susceptible to proteases, therefore, may be degraded in the human body.
  • Constrained peptides have much higher reproducible conformations and are generally more resistant to proteases.
  • Peptides which are presented in a constrained manner may be generated by various means.
  • cyclic peptides may be utilized, e.g. cyclic peptides which are formed by disulfide bonds.
  • a complementary method for peptide library based lead discovery is the display of libraries on filamentous bacteriophages. This method allows the preparation of libraries as large as 10 10 unique peptide members, many orders of magnitude larger than libraries that may be prepared synthetically.
  • advantages of phage display include ease of library construction, coupling of the binding entity (displayed peptide) to a unique identifier (its DNA sequence), a selection protocol for amplifying binding clones in a pool, and the high fidelity of biosynthesis (compared to synthetic methods). Furthermore, rapid and inexpensive selection protocols are available for identifying those library members that bind to a target of interest.
  • libraries displaying constrained, for example, cyclic, peptides can be distinguished based upon whether they display large single loop peptides, e.g. having cyclic structures, or peptides having multiple smaller loops.
  • Large single loop libraries have the advantage of presenting many different conformations. While multiple, smaller loop libraries offer more constrained peptides with reproducible conformations, and peptides having multiple binding sites may have higher binding affinity and specificity than single larger loop peptides.
  • Ph.DTM Phage Display Libraries Two of these libraries are libraries of randomized linear peptides having either 7 or 12 amino acids in length. Other libraries of this system comprise fixed length cyclic peptides, each having an N and C terminal cysteine residue. Cyclic structures are formed via the disulfide bonds between the cysteine residues.
  • Bicycle Therapeutics provides a phage display peptide library having randomized constrained peptides of 16 amino acids in length.
  • the peptides comprise three fixed cysteine residues at positions 2, 9, 16.
  • the constrained structure is formed via a bond between a cysteine residue and a chemical moiety, thus resulting in each member having two fixed length randomized loops (see WO2009098450).
  • Genentech describes phage display libraries of fixed length randomized cyclic peptides where each member has an N and C terminal cysteine which form a disulfide bond (see WO200077194). These libraries each have different fixed lengths, the different libraries ranging from 5-16 amino acids.
  • epitope mapping technologies which utilize libraries of linear peptides. These peptides are tested with the binding molecule of interest thereby allowing the determination of linear epitopes (see Pepscan WO 84/03564 and WO 93/09872).
  • the a target protein of interest is split into a set of overlapping linear oligopeptides, which are separately produced and immobilized by chemical synthesis on solid support systems.
  • the use of such target-specific custom made peptide libraries is, however, a rather cost and labor/time intensive process which is limited to linear peptide sequences and, therefore, does not allow the identification of conformational epitopes.
  • the peptide libraries of the present disclosure comprise linear and cyclic, constrained peptides, wherein in embodiments the ratio of cyclic to linear peptides can be specifically designed.
  • said peptide libraries are phage display libraries.
  • the peptides are translated from nucleic acids.
  • Such libraries can be used to identify therapeutically relevant and therapeutically active molecules, or can be used to characterize such molecules by means, such as, epitope mapping.
  • the disclosed library incorporates a) linear and cyclic peptides, b) constrained, for example, cyclic, peptides having a range of different loop lengths, and c) constrained, for example, cyclic, peptides having one or two or more loops.
  • a small percentage of molecules may have at least two cysteine molecules, forming a small percentage of constrained peptides. This is however a random process and the libraries have a different composition than those disclosed herein.
  • the libraries of the present disclosure have utility in both situations, i.e. in situations where either a linear or constrained peptide are sought in one screening.
  • the library is designed to have a predictable proportion of both linear and constrained peptides.
  • the present disclosure provides such a design as specific positions are selected to encode either a cysteine residue or other amino acid, wherein the ratio selected enables a higher proportion of constrained, for example, cyclic, peptides to be displayed and expressed as compared to the known randomized linear peptide libraries.
  • the presently described libraries allow for a very broad diversity, as compared to the state of the art.
  • the state of the art linear peptides are known to often fail to maintain reproducible conformations.
  • the state of the art cyclic peptides are formed by N and C terminal disulfide bonds, with fixed length cyclic regions, which also limits the diversity of conformations presented.
  • the present disclosure provides peptide libraries having a high diversity of conformations useful in many situations.
  • the library presents both linear, and constrained, for example, cyclic peptides, where the peptides have multiple different loop lengths of disulfide bond formed (cyclic) loops ranging from, e.g., 3-17 amino acids in length, thus allowing for a large diversity of conformations being presented.
  • cyclic peptides where the peptides have multiple different loop lengths of disulfide bond formed (cyclic) loops ranging from, e.g., 3-17 amino acids in length, thus allowing for a large diversity of conformations being presented.
  • such design produces peptides having 0, -1 , 2 or more cysteines,-allowing for the production of peptides having even more than one or two loops, thus further diversifying the conformations presented even more.
  • the libraries comprise (a) linear and cyclic peptides, (b) cyclic. peptides having different loop lengths, and (c) cyclic peptides having two or more loops.
  • This allows for the presentation of both linear and cyclic peptides in one screening.
  • the libraries comprise peptides having different loop lengths, a wide variety of conformations can be displayed in one screening.
  • the libraries comprise peptides having two or more loops, further conformations can be displayed in one screening.
  • Such a diversified library is not yet known, and for the first time provides all of the above features in one library where one screening can be used to more quickly identify important molecules.
  • the state of the art teaches small peptides ranging from 5-16 amino acids in length.
  • the present disclosure provides libraries of peptides of 15 amino acids or more in length.
  • An embodiment of the present disclosure provides a peptide library, wherein each member of the library comprises an amino acid sequence Cmix - (X) m - (Amix) n, wherein a) Cmix is a mixture of 50% cysteine and an equal mixture of the remaining natural occurring amino acids, b) X are each an equal mixture of the natural occurring amino acids, excluding cysteine, c) Amix are each a mixture of 5-50% cysteine and an equal mixture of the remaining natural occurring amino acids, and d) m and n are both, and independently from each other, 3-20.
  • the natural occurring amino acids are selected from A, C, D, E, F, G, H, I, K, L, , N, P, Q, R, S, T, V, W, and Y.
  • An embodiment of the present disclosure provides a peptide library as shown in Figure 1.
  • the peptide libraries of the present disclosure are displayed on bacteriophage.
  • Phage display is known to have significant advantages in allowing the rapid selection of useful molecules. This method allows the preparation of libraries as large as 10 10 unique peptide members, many orders of magnitude larger than libraries that may be prepared synthetically. Using such a robust platform allows for the display of large, diverse libraries.
  • inventions of the present disclosure provide the nucleic acids encoding the peptide libraries of the present diclosure.
  • the present disclosure also provides vectors comprising the nucleic acids encoding the peptide library the present disclosure.
  • the vector is a display vector.
  • the vector is an expression vector.
  • Embodiments of the present disclosure provide methods of identifying a peptide specific for an antigen, comprising contacting an antigen with the peptide library of the present invention, and selecting one or more peptides specific for said antigen.
  • Embodiments of the present disclosure also provide the peptides identified using the peptide libraries of the present invention. DESCRIPTION OF THE DRAWINGS
  • Figure 1 shows a design of a peptide library disclosed herein which expresses a peptide library according to the present disclosure. (SEQ ID No: 6.)
  • Figures 2A-C shows a quality assessment of the peptide library of Figure 1 .
  • This figure shows the position and distribution of each amino acid, including the cysteines, which form the cyclic peptides disclosed herein. Therefore, this figure shows that the design of Figure 1 successfully produces a library with the desired positions and distributions of cysteine residues.
  • Figure 2A shows the amino acid distribution of 99 individually sampled clones using Sanger sequencing.
  • Figure 2B shows the expected amino acid distribution.
  • Figure 2C shows the actual amino acid distribution as evaluated using Next Generation Sequencing.
  • Figure 3A-C shows how the peptide library of Figure 1 expresses clones having 0, 1 , 2 or more cysteine residues.
  • Figure 3A shows the evaluation of 99 individually sampled clones using Sanger sequencing. On average, 2.27 cysteines were identified per clone. Of the 99 clones sampled, 33% were linear and 67% were cyclic.
  • Figure 3B shows an evaluation using Next Generation Sequencing.
  • Figure 3C shows the expected versus obtained cysteines per clone as evaluated using Next Generation Sequencing.
  • Figures 4A-B shows how the peptide library of Figure 1 expresses clones having at least two cysteines, thus forming cyclic structures, and the length distribution of ring sizes.
  • Figure 4 A shows the evaluation of 99 individually sampled clones using Sanger sequencing.
  • Figure 4B shows the ring sizes as evaluated using Next Generation Sequencing.
  • the cyclic peptides comprised loops ranging in size from 3-17 amino acids in length.
  • Figures 5-6 show example peptides (Figure 5: SEQ ID NOS 7-1 1 , respectively, in order of appearance, Figure 6: SEQ ID NOS 12-16, respectively, in order of appearance) expressed from the library of Figure 1. These examples have at least two cysteines and some even four cysteines per peptide, which result in various sized loops and even multiple loops within one peptide.
  • Figure 7 shows a pill display vector for use in displaying the peptide libraries disclosed herein.
  • Figure 8 shows a pVIII display vector for use in displaying the peptide libraries disclosed herein.
  • Figure 9 shows an expression vector for use in expressing the peptides disclosed herein:
  • Figure 10 shows a simplified view of the display and expression vectors for use in displaying the libraries disclosed herein.
  • Figure 11 shows the sequencing results of peptides (SEQ ID NOS 18-40, respectively, in order of appearance) identified in a screening with the peptide library of Figure 1 against streptavidin. This result confirms the utility of the library of Figure 1 as a tool for epitope mapping. The results confirm that the known epitope of streptavidin, HPQ, was to a high confidence level identified in both linear and cyclic peptides.
  • Figure 12 shows the sequencing results of peptides (SEQ ID NOS 41-64, respectively, in order of appearance) identified in a screening with the peptide library of Figure 1 against the anti-c- Myc antibody. This result confirms that a diverse number of specific peptides can be identified, wherein the peptides selected are both linear, constrained, and have a wide range of confirmations.
  • Figure 13 shows a pictorial representation of a portion of the Slonomics method.
  • Figure 14 shows a pictorial representation of a portion of the Slonomics method.
  • Library means an entity comprising more than one member. In the context of the present disclosure this term refers to a library of peptides, wherein said library comprises at least two different peptides.
  • Synthetic means not physically derived from naturally occurring DNA.
  • Peptide means a molecule having less than or equal to 50 amino acids.
  • Peptides "translated from nucleic acids” means peptides that are created using biological processes where the starting material is a nucleic acid, either DNA or RNA and the resulting material are amino acids.
  • the biological process may include intermediary steps, such as transcription from DNA to RNA, and/or translation from RNA to amino acid.
  • Such libraries displaying peptides translated from nucleic acids could be bacteriophage or ribosomal display libraries.
  • Linear refers to a stretch of amino acids or a peptide that does not include any circular structure.
  • Cyclic or “circular” or “loop” as used in the present disclosure refers to a stretch of amino acids or a peptide which includes a circular structure. Not the entire stretch of amino acids or peptide needs to be circular. Cyclic peptides may be formed by covalent or by non-covalent bonds.
  • a typical covalent bond that is utilized within the present disclosure to form cyclic peptides is a disulfide bond, which is formed between two cysteine residues of the peptide.
  • Other covalent bonds that are used within the present disclosure are thioether bonds, such as the thioether bonds which are formed and/or which are present in lanthionines. In vivo lanthionines are formed enzymatically via the dehydration of serine or threonine to yield dehydroalanine and dehydrobutyrine, respectively. These products then react with cysteine thiol to from lanthionine and methyllanthionine, respectively. Chemical synthesis is possible as well.
  • covalent bonds include lysinoalanine linkage between a dehydrated serine to yield dehydroalanine which alkylates a lysine in the same polypeptide.
  • Non-covalent bonds that are used within the present disclosure to from cyclic peptides are typically formed via protein domains, such as zinc-finger domains, a jun-fos interaction or a leucine zipper. Other non-covalent bonds may be used as well, such as hydrogen bonds, dipolar bonds or van der Waals forces.
  • Consstrained refers to a peptide in which the three- dimensional structure is maintained substantially in one spatial arrangement over time.
  • the cyclic peptides within the present disclosure have a constrained conformation.
  • peptides are constrained. For cyclic peptides this can in certain cases be deduced from the analysis of the primary amino acid sequence, for example, by the identification of cysteines. Another way is the addition of a protease to the displayed peptide library. Conformationally constrained peptides are usually not cut by the protease. Reduction in size after cleavage can be detected using mass spectrometry. Finally, mass spectrometry can also be used to analyze the library as such. Many cyclic peptides, especially those that are formed by dehydration, will have a lower mass than the corresponding linear peptides.
  • Member is one molecule forming part of a library. In the context of the present disclosure this term refers to one peptide which is part of the peptide library.
  • “Equal mixture” means that each codon encoding an amino acid has the same probability of occurring as any other codon encoding a different amino acid. As an example, if X1 represents an equal mixture of the naturally occurring amino acids, then each of the 20 naturally occurring amino acids has the same probability of occurring at that position, i.e. 5%. "Natural occurring amino acids” means the following amino acids:
  • vector refers to a polynucleotide molecule capable of transporting another polynucleotide to which it has been linked.
  • Preferred vectors are those capable of autonomous replication and/or expression of nucleic acids to which they are linked.
  • plasmid refers to a circular double stranded DNA loop into which additional DNA segments may be ligated.
  • viral vector Another type of vector is a viral vector, wherein additional DNA segments may be ligated into the viral genome.
  • Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication and mammalian vectors).
  • vectors can be integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome.
  • Vectors may be compatible with prokaryotic or eukaryotic cells.
  • Prokaryotic vectors typically include a prokaryotic replicon which may include a prokaryotic promoter capable of directing the expression (transcription and translation) of the peptide in a bacterial host cell, such as Escherichia coli transformed therewith.
  • a promoter is an expression control element formed by a DNA sequence that permits binding of RNA polymerase and transcription to occur.
  • Promoter sequences compatible with bacterial hosts are typically provided in plasmid vectors containing convenience restriction sites for insertion of a DNA segment. Examples of such vector plasmids include pUC8, pUC9, pBR322, and pBR329, pPL and pKK223, available commercially.
  • “Expression vectors” are those vectors capable of directing the expression of nucleic acids to which they are operatively linked and is intended to include such other forms of expression vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno- associated viruses), which serve equivalent functions.
  • viral vectors e.g., replication defective retroviruses, adenoviruses and adeno- associated viruses
  • Display vector includes a DNA sequence having the ability to direct replication and maintenance of the recombinant DNA molecule extra chromosomally in a host cell, such as a bacterial host cell, transformed therewith. Such DNA sequences are well known in the art.
  • Display vectors can for example be phage vectors or phagemid vectors originating from the class of fd, M13, or fl filamentous bacteriophage. Such vectors are capable of facilitating the display of a protein including, for example, a binding protein or a fragment thereof, on the surface of a filamentous bacteriophage.
  • Display vectors suitable for display on phage, ribosomes, DNA, bacterial cells or eukaryotic cells, for example yeast or mammalian cells are also known in the art, for example, as are viral vectors or vectors encoding chimeric proteins.
  • recombinant host cell refers to a cell into which a recombinant expression vector has been introduced. It should be understood that such terms are intended to refer not only to the particular subject cell but to the progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term "host cell” as used herein.
  • Typical host cells are prokaryotic (such as bacterial, including but not limited to E. coli) or eukaryotic (which includes yeast, mammalian cells, and more).
  • Bacterial cells are preferred prokaryotic host cells and typically are a strain of Escherichia coli (E. coli) such as, for example, the E. coli strain DH5 available from Bethesda Research Laboratories, Inc., Bethesda, Md.
  • E. coli Escherichia coli
  • Preferred eukaryotic host cells include yeast and mammalian cells including murine and rodents, preferably vertebrate cells such as those from a mouse, rat, monkey or human cell line, for example HKB1 1 cells, PERC.6 cells, or CHO cells.
  • vectors into host cells may be accomplished by a number of transformation or transfection methods known to those skilled in the art, including calcium phosphate precipitation, electroporation, microinjection, liposome fusion, RBC ghost fusion, protoplast fusion, viral infection and the like.
  • transformation or transfection methods including calcium phosphate precipitation, electroporation, microinjection, liposome fusion, RBC ghost fusion, protoplast fusion, viral infection and the like.
  • monoclonal full-length antibodies, Fab fragments, Fv fragments and scFv fragments is well known.
  • Transformation of appropriate cell hosts with a recombinant DNA molecule is accomplished by methods that typically depend on the type of vector and cells used.
  • transformation of prokaryotic host cells see, for example, Cohen et al., Proceedings National Academy of Science, USA, Vol. 69, P. 21 10 (1972); and Maniatis et al., Molecular Cloning, a Laboratory Manual, Cold spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1982).
  • retroviral vectors containing rDNAs see for example, Sorge et al., Mol. Cell. Biol., 4:1730-1737 (1984); Graham et al., Virol., 52:456 (1973); and Wigler et al., Proceedings National Academy of Sciences, USA, Vol. 76, P. 1373- 1376 (1979).
  • epitope refers to an antigenic determinant, i.e. the part of an antigen that is recognized by a binding molecule, such as an antibody or a peptide.
  • “Phage display” is a technique by which variant polypeptides are displayed as fusion proteins to a coat protein on the surface of phage, e g filamentous phage particles.
  • a utility of phage display lies in the fact that large libraries of randomized protein variants can be rapidly and efficiently sorted for those sequences that bind to a target molecule with high affinity display of peptides and proteins libraries on phage has been used for screening millions of polypeptides for ones with specific binding properties.
  • Polyvalent phage display methods have been used for displaying small random peptides and small proteins through fusions to either gene III or gene VIII of filamentous phage Wells and Lowman ( 1992) Curr Opin Struct Biol B 355-362 and references cited therein.
  • monovalent phage display a protein or peptide library is fused to a gene III or a portion thereof and expressed at low levels in the presence of wild type gene III protein so that phage particles display one copy or none of the fusion proteins.
  • Phage display describes a selection technique in which a library of peptide or protein variants is expressed on the outside of a phage virion, while the genetic material encoding each variant resides on the inside. This creates a physical linkage between each variant protein sequence and the DNA encoding it, which allows rapid partitioning based on binding affinity to a given target molecule (antibodies, enzymes, cell-surface receptors, etc.) by an in vitro selection process called panning.
  • panning is carried out by incubating a library of phage-displayed peptides on a plate (or bead) coated with the target, washing away the unbound phage, and eluting the specifically bound phage. The eluted phage are then amplified and taken through additional binding/amplification cycles to enrich the pool in favor of binding sequences. After a few rounds, individual clones are characterized by DNA sequencing and ELISA.
  • a "phagemid” is a plasmid vector having a bacterial origin of replication, e g , ColE 1 , and a copy of an intergenic region of a bacteriophage.
  • the phagemid may be based on any known bacteriophage including filamentous bacteriophage.
  • the plasmid will also generally contain a selectable marker for antibiotic resistance. Segments of DNA cloned into these vectors can be propagated as plasmids. When cells harboring these vectors are provided with all genes necessary for the production of phage particles, the mode of replication of the plasmid changes to rolling circle replication to generate copies of one strand of the plasmid DNA and package phage particles.
  • the phagemid may form infectious or non-infectious phage particles
  • This term includes phagemids which contain a phage coat protein gene or fragment thereof linked to a heterologous polypeptide gene as a gene fusion such that the heterologous polypeptide is displayed on the surface of the phage particle Sambrook et. al. 417.
  • phage vector means a double stranded replicative form of a bacteriophage containing a heterologous gene and capable of replication.
  • the phage vector has a phage origin of replication allowing phage replication and phage particle formation.
  • the phage is preferably a filamentous bacteriophage, such as, an M I3 fl . fd, Pf3 phage or a derivative thereof, a lambdoid phage, such as lambda, 21 , phi80, phi81 . 82, 424.
  • Preparation of DNA from cells means isolating the plasmid DNA from a culture of the host cells. Commonly used methods for DNA preparation are the large- and small-scale plasmid preparations described in sections 125- 133 of in Sambrook et al. After preparation of the DNA it can be purified by methods well known in the art such as that described in section 140 of Sambrook et. al.
  • coat protein means a protein, at least a portion of which is present on the surface of the virus particle. From a functional perspective, a coat protein is any protein which associates with a virus particle during the viral assembly process in a host cell, and remains associated with the assembled virus until it infects another host cell.
  • the coat protein may be the major coat protein or may be a minor coat protein.
  • a "major” coat protein is a coat protein which is present in the viral coat at 10 copies of the protein or more, e.g. major coat protein p V III.
  • a major coat protein may be present in tens, hundreds or even thousands of copies per virion.
  • a minor coat protein is present in the viral coat at less than 10 copies per phage, e.g. minor coat protein pill.
  • a "fusion protein” is a polypeptide having two portions covalently linked together, where each of the portions is a polypeptide having a different property
  • the property may be a biological property, such as activity in vitro or in vivo
  • the property may also be a simple chemical or physical property, such as binding to a target molecule, catalysis of a reaction, etc.
  • the two portions may be linked directly by a single peptide bond or through a peptide linker containing one or more ammo acid residues Generally, the two portions and the linker will be in reading frame with each other
  • PCR Polymerase chain reaction
  • RNA and/or DNA are amplified as described in U S Patent No 4 683, 195 issued 28 July 1987
  • sequence information from the ends of the region of interest or beyond needs to be available, such that ohgonucleotide primers can be designed, these primers will be identical or similar in sequence to opposite strands of the template to be amplified
  • the 5' terminal nucleotides of the two primers may coincide with the ends of the amplified material
  • PCR can be used to amplify specific RNA sequences, specific DNA sequences from total genomic DNA, and cDNA transcribed from total cellular RNA, bactenophage or plasmid sequences, etc See generally Mullis et al ( 1987) Cold Spring Harbor S ⁇ mp Quant Biol 51 263 , Erlich.
  • PCR is considered to be one, but not the only, example of a nucleic acid polymerase reaction method for amplifying a nucleic acid test sample comprising the use of a known nucleic acid as a primer and a nucleic acid polymerase to amplify or generate a specific piece of nucleic acid Detailed Description of the Invention
  • the present disclosure is directed to peptide libraries comprising constrained peptides and linear peptides.
  • the present disclosure is directed to peptide libraries comprising cyclic peptides and linear peptides.
  • Such libraries are useful for numerous purposes; including epitope mapping and the identification of peptides with pharmaceutical properties, such as anti-microbial or anti-viral peptides, material-specific peptides, small molecule binders, novel enzyme substrates and other peptides useful for drug lead discovery.
  • a library may be designed to have a predictable proportion of both linear and constrained, for example, cyclic, peptides presented.
  • the peptide libraries comprise linear and cyclic peptides, wherein the cyclic peptides are formed by one or more covalent or one or more non-covalent bonds.
  • the covalent bond is a disulfide bond or a bond between two non- naturally occurring amino acids, such as, a thioether bond.
  • the thioether bond is a lanthiopeptide, such as a lanthiopeptide formed between a dehydrated serine and a cysteine or a dehydrated threonine and a cysteine.
  • the covalent bond is a disulfide bond. In certain embodiments, the disulfide bond is formed by two cysteine residues.
  • the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 or 7 or more cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 and 7 cysteine residues.
  • Figure 3A shows the sequencing results of a peptide library of the present disclosure which comprises peptides having 0, 1 , 2, 3, 4, 5, 6 or 7 cysteines per peptide. . In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3 or more cysteine residues.
  • the library comprises peptides comprising 0, 1 , 2, and 3 cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4 or more cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3 and 4 cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5 or more cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4 and 5 cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 or more cysteine residues.
  • the library comprises peptides comprising 0, 1 , 2, 3, 4, 5 and 6 cysteine residues.
  • the covalent bond is a lysinoalanine linkage formed between a dehydrated serine and a lysine.
  • the non-covalent bond is a formed via a protein domain, such as, a zinc-finger domain, a jun-fos interaction or a leucine zipper.
  • the disclosed library incorporates a) linear and cyclic peptides, b) constrained and non-constrained peptides, c) cyclic peptides having a range of different loop lengths, and d) cyclic peptides having one or two or more loops.
  • a library of peptides comprises (a) linear and cyclic peptides, (b) cyclic peptides having different loop lengths, and (c) cyclic peptides having two or more loops.
  • the library comprises synthetic peptides. In embodiments, the library comprises peptides translated from nucleic acids.
  • the library comprises cyclic peptides with loop lengths ranging from 3-17 amino acids in length. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, or 17 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, and 17 amino acids.
  • Figures 4A and 4B shows the sequencing results of a peptide library of the present disclosure which comprises peptides having comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, or 17 amino acids.
  • the library comprises cyclic peptides comprising loop lengths of 3, or 4, or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, and 4 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4 and 5 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5 and 6 amino acids.
  • the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6 and 7 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7 and 8 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8 and 9 amino acids.
  • the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9 and 10 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 11 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10 and 11 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12 or more amino acids.
  • the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 and 12 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12 and 13 amino acids. In
  • the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13 and 14 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8; 9, 10, 1 1 , 12, 13, 14, 15 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14 and 15 amino acids.
  • the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7,8, 9, 10, 11. ⁇ 2, 13, 14, 15, 16 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15 and 16 amino acids.
  • the library comprises cyclic peptides having two or more loops.
  • the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 or 7 or more loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 and 7 loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, or more loops. In embodiments, the library comprises peptides comprising 0, 1 , and 2 loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3 or more loops. In embodiments, the library comprises peptides comprising 0, 1 , 2 and 3 loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4 or more loops.
  • the library comprises peptides comprising 0, 1 , 2, 3 and 4 loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5 or more loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4 and 5 loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 or more loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5 and 6 loops.
  • the present disclosure provides such a design as specific positions are selected to encode either a cysteine residue or another amino acid, wherein the ratio selected enables a higher proportion of cyclic peptides as compared to linear peptides to be displayed and expressed as compared to randomized linear peptide libraries.
  • a linear peptide library of 20 codons randomized using NNN or NNK technology has a probability of 12.8% that each member contains two or more cysteine residues.
  • An embodiment of the present disclosure therefore provides a library comprising linear and constrained, for example, cyclic, peptides, wherein the proportion of members comprising constrained peptides is 13% or more, 14% or more, 15% or more, 16% or more, 17% or more, 18% or more, 19% or more, 20% or more, 21% or more, 22% or more, 23% or more, 24% or more, or 25% or more.
  • the present disclosure provides a library comprising linear and cyclic peptides, wherein the proportion of members comprising cyclic peptides is 13% or more, 14% or more, 15% or more, 16% or more, 17% or more, 18% or more, 19% or more, 20% or more, 21 % or more, 22% or more, 23% or more, 24% or more, or 25% or more.
  • the present disclosure provides peptide libraries that have a high diversity of conformations useful in many situations, as the library presents both linear, and cyclic peptides, where the cyclic peptides have multiple different lengths of disulfide bond formed loops ranging from, e.g., 3-17 amino acids in length, thus allowing for a large diversity of conformations being presented in the library.
  • such design produces peptides having 0, 1 , 2 or more cysteine residues, allowing for the production of peptides having even more than one or even more than two loops, thus diversifying the conformations presented even more.
  • each member of the library comprises an amino acid sequence Cmix - (X) m - (Amix) n, wherein
  • Cmix is a mixture of 50% cysteine and an equal mixture of the remaining natural occurring amino acids
  • X are each an equal mixture of the natural occurring amino acids, excluding cysteine
  • Amix are each a mixture of 5-50% cysteine and an equal mixture of the remaining natural occurring amino acids
  • m and n are both, and independently from each other, 3-20.
  • m is 3 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • m is 4 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17,
  • n is 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15,
  • m is 6 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • m is 7 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • m is 8 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • m is 9 and n is 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • m is 10 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • m is 1 1 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • m is 12 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • m is 13 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • m is 14 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 18,
  • n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16,
  • m is 16 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • m is 17 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • m is 18 and n is 3, 4, 5, 6, 7, 8,
  • n 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • m is 19 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • m is 20 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 3 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments n is 4 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17,
  • n is 5 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 6 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 7 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 8 and m is 3, 4, 5, 6, 7, 8, 9,
  • n is 9 and m is 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 10 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 1 1 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 12 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 13 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 14 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 14 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 14 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14,
  • n is 15 and m is 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 16 and m is 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 17 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 18 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 19 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 20 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • (Amixj comprises 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, or 50% cysteine, and an equal mixture of the remaining natural occurring amino acids.
  • the library comprises cyclic peptides having disulfide formed loops of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, 20 amino acids in length.
  • the library comprises synthetic peptides that are 9 or more amino acids in length, 10 or more amino acids in length, 11 or more amino acids in length, 12 or more amino acids in length, 13 or more amino acids in length, 14 or more amino acids in length, 15 or more amino acids in length, 16 or more amino acids in length, 17 or more amino acids in length, 18 or more amino acids in length, 19 or more amino acids in length, 20 or more amino acids in length, 21 or more amino acids in length, 22 or more amino acids in length, 23 or more amino acids in length, 24 or more amino acids in length, 25 or more amino acids in length, 26 or more amino acids in length, 27 or more amino acids in length, 28 or more amino acids in length, 29 or more amino acids in length, 30 or more amino acids in length, 31 or more amino acids in length, 32 or more amino acids in length, 33 or more amino acids in length, 34 or more amino acids in length, 35 or more amino acids in length, 36 or more amino acids in length , 37 or more amino acids in length, 38 or more amino acids in length,
  • each member of the library comprises an amino acid sequence Cmix - (X) m - (Amix) n, wherein
  • Cmix is a mixture of 50% cysteine and an equal mixture of the remaining natural occurring amino acids
  • X are each an equal mixture of the natural occurring amino acids, excluding cysteine
  • Amix are each a mixture of 10-20% cysteine and an equal mixture of the remaining natural occurring amino acids
  • n is 3-20.
  • m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • (Amix) comprises a mixture of 10-20% cysteine and an equal mixture of the remaining natural occurring amino acids. In a preferred embodiment, (Amix) comprises a mixture of 15% cysteine and an equal mixture of the remaining natural occurring amino acids.
  • n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments m is 6 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • the natural occurring amino acids are selected from A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, and Y.
  • each member of the library comprises an amino acid sequence (X) I Cmix - (X) m - (Amix) n, wherein
  • Cmix is a mixture of 50% cysteine and an equal mixture of the remaining natural occurring amino acids
  • X are each an equal mixture of the natural occurring amino acids, excluding cysteine
  • Amix are each a mixture of 10-20% cysteine and an equal mixture of the remaining natural occurring amino acids
  • n 3-20
  • I is 1 , 2 or 3.
  • I is 1 and m is 5 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments I is 1 and m is 6 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • I is 2 and m is 5 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments I is 2 and m is 6 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments I is 3 and m is 5 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments I is 3 and m is 6 and n is 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
  • An embodiment of the present disclosure provides a peptide library having a design as shown in Figure 1 .
  • each member of the library comprises an amino acid sequence (X) l-Cmix- (X) m-(Amix)n, wherein
  • Cmix is a mixture of 50% cysteine and an equal mixture of the remaining natural occurring amino acids
  • X are each an equal mixture of the natural occurring amino acids, excluding cysteine, and
  • Amix are each a mixture of 15% cysteine and an equal mixture of the remaining
  • n 10.
  • the peptide library is displayed on bacteriophage.
  • Phage display is known to have significant advantages in allowing the rapid selection of useful molecules. This method allows the preparation of libraries as large as 10 10 unique peptide members, many orders of magnitude larger than libraries that may be prepared synthetically. Using such a robust platform allows for the display of large, diverse libraries.
  • the library of the instant invention contains at least about 10 7 member peptides, each of which has at least one amino acid variation from others.
  • the library contains at least about 10 8 peptides, or at least about 10 9 peptides.
  • the library comprises constrained and/or cyclic peptides which are formed by disulfide bonds between two or more cysteines, zinc-finger domains, a jun-fos interaction, a leucine zipper, thioether bonds, such as the thioether bonds which are formed and/or which are present in lanthionines, or lysinoalanine linkages.
  • the peptide library comprises constrained, for example, cyclic, peptides which are formed by disulfide bonds between two or more cysteine residues. In certain embodiments, the peptide library comprises cyclic peptides which are formed by disulfide bonds between two or more cysteine residues.
  • the library comprises cyclic or constrained peptides which are formed by disulfide bonds between two or more cysteine residues, wherein the two or more cysteine residues are not located at the N or the C-terminus of the peptide, or are located at either the N or the C-terminus of the peptide, but seldom both.
  • the present invention provides peptide libraries comprising peptides greater than 16 amino acids in length. In other embodiments the present invention provides libraries comprising peptides having 20 amino acids in length.
  • the present invention provides peptide libraries wherein a portion of the peptides comprise 0, 1 , or 2 or more cysteine residues.
  • the present invention provides a library of nucleic acids encoding the libraries of peptides of the present disclosure.
  • the present invention provides a vector comprising the nucleic acids encoding the libraries of peptides of the present disclosure.
  • the vector is a display vector. In other embodiments, the vector is an expression vector.
  • the present disclosure provides a method of identifying a peptide specific for an antigen, comprising a) contacting an antigen with a library of peptides disclosed herein, and
  • the present disclosure provides a peptide identified using the library of peptides disclosed herein.
  • Phage display methods for proteins, peptides and mutated variants thereof including constructing a family of variant replicable vectors containing a transcription regulatory element operably linked to a gene fusion encoding a fusion polypeptide, transforming suitable host cells, culturing the transformed cells to form phage particles which display the fusion polypeptide on the surface of the phage particle, contacting the recombinant phage particles with a target molecule so that at least a portion of the particle bind to the target, separating the particles which bind from those that do not bind, are known and may be used with the libraries disclosed herein.
  • the peptides are fused to at least a portion of a phage coat protein to form a fusion protein containing the peptide disclosed herein.
  • the fusion protein can be made by expressing a gene fusion encoding the fusion protein using known techniques of phage display.
  • the fusion protein may form part of a phage or phagemid particle in which one or more copies of the peptide are displayed on the surface of the particle.
  • An embodiment includes a nucleic acid encoding the peptide or the fusion proteins described herein.
  • the present disclosure provides vectors comprising the fusion genes noted above, as well as a library of these vectors.
  • the library of vectors may be in the form of a DNA library, a library of virus (phage or phagemid) particles containing the library of fusion genes or in the form of a library of host cells containing a library of the expression vectors or virus particles.
  • the present disclosure provides a method comprising the steps of preparing a library containing a plurality of vectors, each vector comprising a transcription regulatory element operably linked to a gene fusion encoding a fusion protein, wherein the gene fusion comprises a first gene encoding a peptide disclosed herein and a second gene encoding at least a portion of a phage coat protein, wherein the library comprises a plurality of genes encoding peptide fusion proteins.
  • the gene encoding the coat protein of the phage and the gene encoding the desired polypeptide portion of the fusion protein of the invention can be obtained by methods known in the art (see generally, Sambrook et al)
  • the DNA encoding the gene may be chemically synthesized (Merrfield ( 1963) 7 Am Chem Soc 85:2149).
  • the phage coat protein is preferably the gene III or gene VIII coat protein of a filamentous phage, such as, M13.
  • Suitable gene III vectors for display of peptides include fUSE5 (Scott. J K . and Smith G P (1990) Science 249 386-390), fAFFI (Cwirla et al ( 1990) Proc Natl Acad Set U S A 87 6378- 6382), fd-CATI (McCafferty et al ( 1990) Nature (London) 348 552-554), m663 (Fowlkes et al ( 1992) Biotechniques 13 422-427), tdtetDOG.
  • Suitable phage and phagemid vectors for use in this invention include all known vectors for phage display Additional examples include pCombo (Gram et al. ( 1992) Proc. Natl. Acad. Sci. USA 89:3576-3580), pC89 (Fehci et al. ( 1991 ) 7.
  • helper phage any known helper phage may be used when a phagemid vector is employed in the phage display system.
  • suitable helper phage include M 13-K07 (Pharmacia), M 13-VCS (Stratagene), and R408 (Stratagene).
  • Suitable host cells which can be transformed by electroporation include gram negative bacterial cells such as E. coli.
  • Suitable E. coli strains include TG1 F+, TG1 F-, JM 101 , E. coli K 12 strain 294 (ATCC number 3 1.446), E. coll strain W31 10 (ATCC number 27.325), E. coli X1776 (ATCC number 31 ,537), E. coli XL- 1 Blue (Stratagene). and E. coli B; however many other strains of E. coli, such as XL I -Blue MRF' , SURE. ABLE C. ABLE K.
  • WM 1 100, MC 1061 , HB 101 , CJ 136. MV 1 190. JS4, JS5, NM522. NM538, and NM539 may be used as well.
  • Cells are made competent using known procedures. Sambrook et al, 1.76- 1 .81 ,16.30.
  • the host cell for electroporation is a competent E. coli strain containing a phage F episome. Any P episome which enables phage replication in the strain may be used in the invention.
  • Phage or phagemid vector DNA can be isolated using methods known in the art, for example, as described in Sambrook et al.
  • the isolated DNA can be purified by methods known in the art such as that described in section 140 of Sambrook et al.
  • This purified DNA can then be analyzed by DNA sequencing DNA sequencing may be performed by the method of Messing et al ( 1981 ) Nucleic Acids Res 9 309. The method of Maxam et al (1980) Meth Enzymol 65 499 , or by any other known method. Method of generating diversified libraries
  • the Slonomics method uses a defined number of standardized building blocks is chemically synthesized as single-stranded oligonucleotides containing self-complementary regions. Intra- strand base pairing of these regions leads to the formation of a stable hairpin-like secondary structure comprising a short loop of four nucleotides, a double-stranded stem region assuring the stability of the molecule, and a three-nucleotide single-stranded overhang.
  • Two different classes of building blocks are defined as "splinkers” and “anchors.” All splinker molecules share the same scaffold structure and differ only in their variable three-base single-stranded overhangs. In contrast, the anchor oligonucleotides differ in the overhang and also in the directly adjacent base triplet.
  • each anchor molecule harbors an additional biotin modification in the loop region, allowing the oligonucleotide to be coupled to a streptavid in -coated surface with high affinity.
  • the two types of oligonucleotides are further characterized by the presence of different recognition sites for type IIS restriction enzymes within their stem regions.
  • the anchor oligonucleotide contains a recognition site for Earn 1 104I (CTCTTC[1/4], generating a three-base overhang) and the corresponding splinker molecule harbors a recognition site for Esp3l (CGTCTC[1/5], generating a four-base overhang).
  • the sequence is first assembled as smaller sub-fragments of 18 bp. These so-called “elongation blocks” can be synthesized in parallel reactions.
  • one anchor and one splinker molecule are ligated via hybridization (Watson-Crick base pairing) of complementary single- stranded overhangs.
  • this step is performed in solution, since enzymatic reactions in solution occur at much faster rates than those on solid supports, where diffusion pathways are much longer.
  • the resulting product is immobilized on a streptavidin- coated 96-well plate via the biotin modification of the anchor molecule. No reacted material is removed in a washing step.
  • the remaining surface-bound ligation products are subsequently cleaved by Eam1104I, which is specific for the anchor that donates the base triplet block.
  • the cleavage of the ligation product by this restriction enzyme releases an elongated, highly pure "intermediate product" that has a new three-base single-stranded overhang and serves as an acceptor for the next anchor molecule.
  • this reaction cycle results in the incorporation of three new bases to the growing chain and a shortened anchor that remains bound to the surface.
  • This reaction cycle is repeated five times to produce an 18 bp DNA fragment.
  • the anchor molecules and the intermediate products should be present in equimolar concentrations.
  • unligated intermediate products can be removed efficiently by washing, while uhligated anchors remain bound to the surface after cleavage.
  • the complete synthesis process comprises two distinct phases. During the initial "elongation" phase, short sub-sequences of the target molecule are produced as described above, resulting in individual elongation blocks with 18 independently definable base; pairs. Since . many of these reactions can be performed in parallel, the entire target sequence is already constructed during the initial elongation process, albeit as a series of short sub-fragments.
  • the socalled "transposition” the pre-assembled elongation blocks are connected in a pair-wise fashion after each block has been cleaved with the appropriate type IIS restriction enzyme. Restriction with Eam1 1041 results in the release of the elongation block from the surface and thereby generates a three-base overhang. Cleavage with Esp3l removes the splinker-component of the molecule, leading to a four-base overhang.
  • the resulting molecules can be assembled in a highly selective manner due to the different length and specific sequences of their overhangs.
  • the ligation reactions can be performed at the solid surface or in solution, which allows for the focus to be on either product purity or yield. Since the resulting molecules still harbor the constant anchor and splinker regions at their terminal ends, this reaction cycle (including washing steps) can be repeated several times, each round resulting in DNA molecules that have doubled in length with respect to those of the previous round.
  • the resulting constructs are "harvested" from the solid surface by cleavage and transferred from the automated production platform to a second standardized system for the final assembly and quality control. Each transposition can be developed robustly up to the T5 level (5 transposition rounds), corresponding to a fragment length of 462 bp. If necessary, these "T5- building blocks" can be further assembled by standard recombinant DNA technology.
  • the Slonomics® technology is also highly efficient and cost-effective.
  • our building blocks are used for multiple reactions over the course of several synthesis projects.
  • all steps of the process can be done in parallel, which allows for the simultaneous production of several gene constructs and enables the transfer of every working step to a robotic platform.
  • the complete synthesis is performed in multi-well plates, and hardware components with demonstrated suitability for robust production processes have been combined in an entirely computer-controlled system. This permits the fully-automated synthesis of any 462 bp DNA fragment, from design to end product, within a time frame of 44 hours.
  • Oligonucleotide-mediated mutagenesis is another method for preparing diversified gene libraries. This technique is well known in the art as described by Zoller et al (1987) Nucleic Acids. Res. 10 6487-6504.
  • Cassette mutagenesis is also a method for preparing the diversified gene libraries. The method is based on that described by Wells et al. (1985) Gene 34:315.
  • An aspect comprises a library of synthetic peptides comprising linear and cyclic peptides, wherein the proportion of cyclic peptides within said library is greater than 13%.
  • a library of peptides comprises (a) linear and cyclic peptides, (b) cyclic peptides having different loop lengths, and (c) cyclic peptides having two or more loops.
  • a library of peptides comprises linear and cyclic peptides
  • the library comprises synthetic peptides. In embodiments, the library comprises peptides translated from nucleic acids.
  • the library comprises peptides having a controlled ratio of cysteines at certain positions.
  • a library of peptides consists of (a) linear and cyclic peptides, (b) cyclic peptides having different loop lengths, and (c) cyclic peptides having two or more loops.
  • the library comprises cyclic peptides with loop lengths ranging from 3-17 amino acids in length. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, or 17 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, and 17 amino acids.
  • the library consists of cyclic peptides with loop lengths ranging from 3-17 amino acids in length. In embodiments, the library consists of cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, or 17 amino acids. In
  • the library consists of cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, and 17 amino acids.
  • the library comprises cyclic peptides having two or more loops.
  • the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 or 7 or more loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 and 7 loops.
  • the library consists of cyclic peptides having two or more loops.
  • the library consists of peptides comprising 0, 1 , 2, 3, 4, 5, 6 or 7 or more loops. In embodiments, the library consists of peptides comprising 0, 1 , 2, 3, 4, 5, 6 and 7 loops.
  • the synthetic peptides of the library are displayed on bacteriophage. In embodiments, the synthetic peptides of the library are displayed on ribosomes.
  • the library comprises at least 1 x 10 7 members. In embodiments, the library consists of at least 1 x 10 7 members. In embodiments of the library, the cyclic peptides are formed by one or more covalent or one or more non-covalent bonds.
  • the covalent bond is a disulfide bond or a bond between two non-naturally occurring amino acids, such as, a thioether bond.
  • the thioether bond is a lanthiopeptide, such as a lanthiopeptide formed between a dehydrated serine and a cysteine or a dehydrated threonine and a cysteine.
  • the covalent bond is a lysinoalanine linkage formed between a dehydrated serine and a lysine.
  • the non-covalent bond is a formed via a protein domain, such as, a zinc-finger domain, a jun-fos interaction or a leucine zipper.
  • the covalent bond is a disulfide bond. In certain embodiments ⁇ the disulfide bond is formed by two cysteine residues.
  • the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 or 7 or more cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 and 7 cysteine residues.
  • the library consists of peptides comprising 0, 1 , 2, 3, 4, 5, 6 or 7 or more cysteine residues. In certain embodiments, the library consists of peptides comprising 0, 1 , 2, 3, 4, 5, 6 and 7 cysteine residues.
  • the synthetic peptides are 9 or more amino acids in length.
  • each member of the library comprises an amino acid sequence Cmix - (X) m - (Amix) n, wherein a) Cmix is a mixture of 50% cysteine and an equal mixture of the remaining natural occurring amino acids, b) X are each an equal mixture of the natural occurring amino acids, excluding cysteine, c) Amix are each a mixture of 5-50% cysteine and an equal mixture of the remaining natural occurring amino acids, and d) m and n are both, and independently from each other, 3-20.
  • each member of the library comprises an amino acid sequence Cmix - (X) m - (Amix) n, wherein a) Amix are each a mixture of 10-20% cysteine and an equal mixture of the
  • each member of the library comprises an amino acid sequence (X) I - Cmix - (X) m - (Amix) n, wherein a) I is 1 -3.
  • each member of the library comprises an amino acid sequence (X) l-Cmix- (X) m- (Amix) n, wherein a) Amix are each a mixture of 15% cysteine and an equal mixture of the remaining natural occurring amino acids, b) I is 3, c) m is 6, and d) n is 10.
  • the library has a design as shown in Figure 1 .
  • Amix comprises 15% cysteine, and an equal mixture of the remaining natural occurring amino acids.
  • An aspect includes a library of nucleic acids encoding the libraries disclosed herein.
  • An aspect includes vector comprising the nucleic acids disclosed herein. In embodiments the vector is a display vector or an expression vector.
  • An aspect includes a method of identifying a peptide specific for an antigen, comprising
  • An aspect includes peptide identified using the method described herein. Examples
  • a phagemid vector containing a N-terminal fusion of the peptide library sequence to the gene of the minor coat protein pill, in combination with Hyperphages harboring a pill gene deletion results in a pentavalent display of the peptides.
  • the pill protein is present in 5 copies at the distal end of the phage particles and its function is required for phage infectivity by binding to the F-pilus of bacterial cells. Although it is thought that if the displayed peptides are sufficiently short enough ( ⁇ 50 residues) the function of the pill-fusion would not to be negatively affected and all five copies of the pill protein can carry displayed peptides, functional impairment of the plll-peptide fusion cannot be completely excluded.
  • pVIII is a major capsid protein, therefore, if multivalent display of the peptide library is desired then fusion of the variable peptide to pVIII should be used.
  • Exemplary display vectors for displaying the libraries disclosed herein are shown in Figures 7- 8 and 10.
  • Exemplary expression vectors for expressing the peptides disclosed herein are shown in Figure 9.
  • cyclic and constrained peptides In addition to the use of linear peptides, it is advantageous to present cyclic and constrained peptides to facilitate the identification of specific interaction partners in Phage Display experiments. This can be realized by two cysteine residues flanking the variable region resulting in the generation of disulfide bond stabilized circular peptides of fixed size.
  • the Slonomics Technology allows the controlled introduction of such cysteine residues at various desired positions within the variable peptides.
  • the percentage of the cysteine residues at the chosen variable positions can be designed to result in the generation of an average of one to two or more cysteine residues per molecule.
  • encoded peptides without cysteine or containing only one cysteine residue will be linear, while the encoded peptides with two cysteine residues or more at desired positions will generate cysteine bridged circles of various lengths.
  • this design for example the design of Figure 1 , it is possible to generate a universal peptide library containing a defined mixture of linear and circular peptides, thereby offering the possibility of a simultaneous presentation of both alternatives in the same screening experiment.
  • Evidence of a library comprising peptides have a controlled frequency of cysteines at certain positions is shown in Figures 2A-C, and 11 and 12.
  • a library having the design shown in Figure 1 was chosen for display and testing.
  • the DNA fragments containing the peptide library sequence were synthesized as follows: The flanking constant regions comprising a signal sequence, epitope tag and spacer regions were synthesized by gene synthesis.
  • the peptide library encoding sequence with a randomized stretch of 20 amino acids was synthesized by Slonomics.
  • the resulting 333 bp completely synthetic linear DNA fragment comprising the peptide library and flanking constant regions was cloned via Xbal and Sail into the pill and pVIII display vectors, respectively ( Figure 7 and Figure 8).
  • Typically 0.25 to 2 of the ligated phagemid DNA of the libraries were used to transform E.
  • coli MC1061 F electrocompetent cells and transformants were collected in TB medium and shaken for at 37 ⁇ C for 1 h. Dilutions of the outgrowth medium were plated on LB/Chloramphenicol/Glucose. Amplification of the libraries was performed by shaking o/n in appropriate amounts of LB/Ghloramphenicol/1%Glucose. Library sizes for the cloned peptide library-L20-plll- and -pVIII-fusions ranged between 1.2 E+09 and 4.4E+09.
  • Phage displaying the pill- and the pVIII-fusions of the L20-peptide library were prepared as follows. For each library phage preparation 80 ml 2x YT/Chloramphenicol/Glucose medium were inoculated with bacteria from the corresponding library glycerol stock resulting in an OD600nm of 0.2 - 0.3. Cultures were shaken until an OD600nm of 0.45 - 0.55 was reached. Then helper phage was added at a multiplicity of infection of 10 to the bacterial culture followed by an incubation for 45 min at 37"C without shaking and then for 45 min at 37"C shaking at 120 rpm. Bacteria were spun down and helper phage containing supernatant was discarded.
  • Phage-infected bacteria were resuspended in 400 ml 2x YT/Chlorarhphenicol/ Kanamycin /IPTG medium and incubated overnight at 22 ⁇ with s haking at 120 rpm. The next day bacteria from the overnight culture were pelleted by centrifugation and the supernatant containing the peptide- presenting phage was collected. Phage precipitation was performed by adding PEG/NaCl to the phage-containing supernatant. The sample was incubated for at least 30 min on ice. Precipitated phage were spun down and resuspended in PBS. The sample was rotated slowly to obtain a homogeneous suspension and residual bacterial debris was pelleted and discarded.
  • phage were precipitated again using PEG/NaCl. Finally, the phage pellet was resuspended in PBS, transferred to a sterile tube and incubated with gentle agitation to obtain a homogeneous suspension. Phage titers were determined by spot titration and UV absorbance (Nanodrop) at OD268nm, and ELISA.
  • the anti-M13 antibody (Santa Cruz) was used for capturing, as it captures phage particles via the major coat protein g8p. For detection three different antibodies were used. A monoclonal anti-M13 (directed against the major coat protein of M13 phage, g8p) conjugated to HRP (Amersham), and a monoclonal antibody against the FLAG epitope conjugated to AP (AP27, Sigma) or monoclonal anti Histidine antibody conjugated to HRP (R&D Systems), as both epitope tags are encoded by the pill- and pVIII-peptide libraries and therefore part of the displayed peptides ( Figure 10) .
  • the capture antibody was immobilized by dispensing antibody solution for the anti-M13 antibody into the wells of a 96-well axisorp plate, sealing the plate with laminated foil and incubating overnight. The next day, the plates were washed 3 times with PBST, and each well was blocked with blocking buffer for at least 1 h at room temperature. After blocking and washing of the plates, dilutions of phage containing supernatants were added to the wells and incubated for 1 h at room temperature. For detection washed plates were either analyzed using QuantaBlu (Pierce) for HRP conjugated antibodies or Attophos fluorescence substrate (Roche, #11681982001 ) for the AP-conjugated anti-FLAG antibody.
  • Another important aspect is the evaluation of the quality and functionality of the peptide library.
  • a qualitative assessment of the phage library, with respect to amino acid distribution, frequency and redundancy was carried out using Sanger sequencing and Next Generation Sequencing.
  • Figure 2A shows the position and distribution of each amino acid, including the cysteines, which form the cyclic peptides disclosed herein. Therefore, this figure shows that the design of Figure 1 successfully produces a library with the desired positions and proportions of cysteines.
  • cyclic peptides comprised loops ranging in size from 3-17 amino acids in length, see Figures 4A-B.
  • Figures 5-6 show example peptides that result from the library of Figure 1 , which have at least two disulfide bonds, and may result in various sized loops within one peptide.
  • 42 of the 99 had a cysteine at position 4, which allowed for the formation of a large loop.
  • 16 had a total of 2 Cys forming one large loop; 18 of the 42 had 3 cysteines forming either one large or one small loop; 3 of the 42 had 4 cysteines forming one large and one small loop; 1 of the 42 had 5 cysteines, 1 of the 42 had 6 cysteines, 1 of the 42 had 7 cysteines, each forming one large and several small loops.
  • 10 of the 24 had 2 cysteines forming one loop; 8 of the 24 had 3 cysteines forming one loop; 6 of the 24 had 4 cysteines and were able to form up to two loops; 1 of the 24 had 5 cysteines, and 1 of the 24 had 6 cysteines and were able to form multiple loops.
  • Streptavidin was obtained from IBA (Goettingen, Germany).
  • the anti-cMyc mouse mAb was obtained from Santa Cruz Biotechnology (Heidelberg, Germany).
  • Streptavidin was used as antigen for test panning because the binding consensus motif HPQ/M is well described and used in several studies for test selection of peptide libraries (Devlin, J. J., Panganiban, L. C. and Devlin, P. E. (1990) Science 249, 404-406; Lam, K.S. and Lebl, M: Streptavidin and Avidin Recognize Peptide Ligands with Different Motifs. Immuno Methods 1 : 11 -15, 1992).
  • the target proteins (Streptavidin and anti-cMyc mlgG) were diluted in PBS for direct coating the surface of a microtiter plate with a protein concentration of 100 nM.
  • phage blocking mixtures were incubated in 2 ml reaction tubes for 2 h at RT shaking gently. After the blocking procedure the wells were washed 2 times with 400 ⁇ PBS and the 300 ⁇ of the pre-blocked phage mix transferred into each blocked well. It was incubated for 2 h at RT on a microtiter plate shaker. After that the phage solution from the target protein coated wells were removed by rapidly inverting the plate over a plastic tray and plates were washed with the following washing conditions (Table 1 and 2).
  • the expected range goes from 1 x10 10 -1 x10 12 phage/ml for the input and 10 4 -10 9 phage/ml for the output.
  • Table 3 shows the input and the output after each round of panning and all values are in the expected range.
  • phage output pools were analyzed with respect to their binding specificity in an additional differential panning round and ELISA against the specific and unrelated target proteins, such as VEGF-165.
  • the check for specificity of binding by ELISA was carried out by using direct coated target proteins and peptide displaying phage' from panning outputs. Bound phages were detected by the additionally encoded Flag tag using anti- Flag detection. To analyze phage expression anti- 13 capture and anti-Flag detection was used.
  • NGS analysis and ELISA of the phage outputs from subsequent and differential panning rounds revealed an enrichment of specific binders, that bound to the specific target proteins but did not show binding to unrelated target proteins.
  • Primary hits were defined as an ELISA signal of at least 5-fold above the background.

Abstract

The invention relates to novel libraries of linear and cyclic peptides, and methods of generating and screening such libraries for biological, pharmaceutical and other uses.

Description

PEPTIDE LIBRARIES
SEQUENCE LISTING
The instant application contains a Sequence Listing which is in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on May 30, 2014, is named MS212PCTSL.txt and is 23,198 bytes in size.
Field of the Invention
The invention relates to novel libraries containing both linear and cyclic peptides, and methods of generating and screening such libraries for biological, pharmaceutical and other uses.
Background of the Invention
Peptide libraries have many uses. Such libraries can be used to identify therapeutically relevant molecules, or can serve other purposes, such as, the epitope mapping and characterization of therapeutically relevant molecules.
Therapeutically, peptides have certain advantages over small molecules and large molecule inhibitors, such as, antibodies. As compared to small molecules, peptides typically have a larger interface with an antigen, which interface comprises hydrogen bonds and van der Waals forces. This leads to high binding affinities, a high specificity for the antigen and typically a high potency. As compared to antibodies, peptides are much smaller and therefore typically penetrate tissue more easily. Certain tumors are inaccessible for antibody therapy.
Different peptides may be used in libraries. Linear peptides, for example, at times have certain disadvantages. They are highly flexible and do not typically adopt unique, reproducible conformations. The lack of fixed structure reduces the affinity a peptide might have for an antigen and makes determination of the active conformation of the peptide extremely difficult. In addition, linear peptides are more easily susceptible to proteases, therefore, may be degraded in the human body.
As a result, strategies have been described to introduce constraints into peptides. Constrained peptides have much higher reproducible conformations and are generally more resistant to proteases. Peptides which are presented in a constrained manner may be generated by various means. For example, cyclic peptides may be utilized, e.g. cyclic peptides which are formed by disulfide bonds.
A complementary method for peptide library based lead discovery is the display of libraries on filamentous bacteriophages. This method allows the preparation of libraries as large as 1010 unique peptide members, many orders of magnitude larger than libraries that may be prepared synthetically. In addition to large library sizes, advantages of phage display include ease of library construction, coupling of the binding entity (displayed peptide) to a unique identifier (its DNA sequence), a selection protocol for amplifying binding clones in a pool, and the high fidelity of biosynthesis (compared to synthetic methods). Furthermore, rapid and inexpensive selection protocols are available for identifying those library members that bind to a target of interest.
Additionally, libraries displaying constrained, for example, cyclic, peptides can be distinguished based upon whether they display large single loop peptides, e.g. having cyclic structures, or peptides having multiple smaller loops. Large single loop libraries have the advantage of presenting many different conformations. While multiple, smaller loop libraries offer more constrained peptides with reproducible conformations, and peptides having multiple binding sites may have higher binding affinity and specificity than single larger loop peptides.
Multiple research groups have provided libraries having either linear peptides or cyclic peptides, but not libraries having both. Most commonly the length of the randomized peptides ranges between 5 and 16 amino acids.
New England Biolabs provides the Ph.D™ Phage Display Libraries. Two of these libraries are libraries of randomized linear peptides having either 7 or 12 amino acids in length. Other libraries of this system comprise fixed length cyclic peptides, each having an N and C terminal cysteine residue. Cyclic structures are formed via the disulfide bonds between the cysteine residues.
Bicycle Therapeutics provides a phage display peptide library having randomized constrained peptides of 16 amino acids in length. The peptides comprise three fixed cysteine residues at positions 2, 9, 16. The constrained structure is formed via a bond between a cysteine residue and a chemical moiety, thus resulting in each member having two fixed length randomized loops (see WO2009098450).
Genentech describes phage display libraries of fixed length randomized cyclic peptides where each member has an N and C terminal cysteine which form a disulfide bond (see WO200077194). These libraries each have different fixed lengths, the different libraries ranging from 5-16 amino acids.
Additionally, epitope mapping technologies are known which utilize libraries of linear peptides. These peptides are tested with the binding molecule of interest thereby allowing the determination of linear epitopes (see Pepscan WO 84/03564 and WO 93/09872). The a target protein of interest is split into a set of overlapping linear oligopeptides, which are separately produced and immobilized by chemical synthesis on solid support systems. The use of such target-specific custom made peptide libraries is, however, a rather cost and labor/time intensive process which is limited to linear peptide sequences and, therefore, does not allow the identification of conformational epitopes.
Libraries of randomized cyclic peptides which are fixed to a solid support and which may be used for epitope mapping are also described (see Pepscan WO2002031510).
Sang Hoon Joo et al., High Throughput Sequence Determination of Cyclic Peptide Library Members by Partial Edman Degradation/Mass Spectrometry, J. Am. Chem. Soc. (2006) 128, 13000-13009 describes libraries of chemically synthesized peptides which displayed cyclic peptides 6n beads, where a percentage of the beads presented were linear due to cyclization failure, where the displayed cyclic peptides had only a single loop.
Bedard et al., A Convenient Approach to Prepare Topological^ Segregated Bilayer Beads for One-Bead Two-Compound Combinatorial Peptide Libraries, Int. J. Pept. Res. Ther. (2013) 19:13-23 13009 describes libraries of chemically synthesized peptides which displayed cyclic peptides on beads, where a percentage of the beads presented were linear due to cyclization failure, where the displayed cyclic peptides had only a single loop.
Fumiaki Uchiyama et al., Designing Scaffolds of Peptides for Phage Display Libraries, Journal of Bioscience and Bioengineering (2005) Vol. 99, No. 5, 448-456 describes libraries of cyclic peptides having one fixed loop length and only a single loop.
Perosa et al., Generation of biologically active linear and cyclic peptides has revealed a unique fine specificity of rituximab and its possible cross-reactivity with acid sphingomyelinase-like phosphodiesterase 3b precursor, Blood (2006) 107:3, 1070-1077 discloses a phage display peptide library expressing a 7-mer cyclic (c7c) library having one fixed loop length and only a single loop.
Many of the known libraries generate the randomization of amino acids using degenerate oligonucleotides (NNN and NNK technology). This, however, may result in the generation of unwanted stop codons, or undesired motifs, such as protease cleavage sites or restriction sites.
Multiple approaches exist, but it is clear, that there is a high need to advance the utility of peptide phage display libraries for the use in identifying and/or characterizing therapeutically relevant molecules and other purposes, such as epitopes mapping.
Summary of the Invention
Many of the aforementioned shortcomings are solved by the peptide libraries of the present disclosure. The peptide libraries of the present disclosure comprise linear and cyclic, constrained peptides, wherein in embodiments the ratio of cyclic to linear peptides can be specifically designed. Preferably, said peptide libraries are phage display libraries.
In an aspect, the peptides are translated from nucleic acids.
Such libraries can be used to identify therapeutically relevant and therapeutically active molecules, or can be used to characterize such molecules by means, such as, epitope mapping.
In many cases it is advantageous to present structurally constrained, for example, cyclic, peptides as well as linear peptides in one library.
In addition, there is utility in a library comprising constrained, for example, cyclic, peptides having a range of different loop lengths. Such libraries comprise a wide range of conformations.
In addition, there is utility in screening a library comprising constrained, for example, cyclic, peptides where some members have single loops and other members have multiple loops. Such libraries further comprise a wider range of conformations. With the currently available peptide libraries the simultaneous presentation of a) linear and cyclic peptides, b) constrained peptides having a range of different loop lengths, and c) constrained peptides having one or two or more loops is not feasible. As a result, screening experiments must be done by alternative or successive screenings using two or more different libraries, which is laborious.
The disclosed library, however, incorporates a) linear and cyclic peptides, b) constrained, for example, cyclic, peptides having a range of different loop lengths, and c) constrained, for example, cyclic, peptides having one or two or more loops. In a state of the art linear randomized peptide library, depending upon the length, of the peptides, a small percentage of molecules may have at least two cysteine molecules, forming a small percentage of constrained peptides. This is however a random process and the libraries have a different composition than those disclosed herein.
The libraries of the present disclosure have utility in both situations, i.e. in situations where either a linear or constrained peptide are sought in one screening.
In aspects, the library is designed to have a predictable proportion of both linear and constrained peptides. The present disclosure provides such a design as specific positions are selected to encode either a cysteine residue or other amino acid, wherein the ratio selected enables a higher proportion of constrained, for example, cyclic, peptides to be displayed and expressed as compared to the known randomized linear peptide libraries.
The presently described libraries allow for a very broad diversity, as compared to the state of the art. The state of the art linear peptides are known to often fail to maintain reproducible conformations. The state of the art cyclic peptides are formed by N and C terminal disulfide bonds, with fixed length cyclic regions, which also limits the diversity of conformations presented.
By controlling the ratio of cysteines at various positions, the present disclosure provides peptide libraries having a high diversity of conformations useful in many situations. The library presents both linear, and constrained, for example, cyclic peptides, where the peptides have multiple different loop lengths of disulfide bond formed (cyclic) loops ranging from, e.g., 3-17 amino acids in length, thus allowing for a large diversity of conformations being presented. In addition, such design produces peptides having 0, -1 , 2 or more cysteines,-allowing for the production of peptides having even more than one or two loops, thus further diversifying the conformations presented even more.
Accordingly, the libraries comprise (a) linear and cyclic peptides, (b) cyclic. peptides having different loop lengths, and (c) cyclic peptides having two or more loops. This allows for the presentation of both linear and cyclic peptides in one screening. As, the libraries comprise peptides having different loop lengths, a wide variety of conformations can be displayed in one screening. And, as the libraries comprise peptides having two or more loops, further conformations can be displayed in one screening. Such a diversified library is not yet known, and for the first time provides all of the above features in one library where one screening can be used to more quickly identify important molecules. In addition, the state of the art teaches small peptides ranging from 5-16 amino acids in length. The present disclosure in certain embodiments provides libraries of peptides of 15 amino acids or more in length.
An embodiment of the present disclosure provides a peptide library, wherein each member of the library comprises an amino acid sequence Cmix - (X) m - (Amix) n, wherein a) Cmix is a mixture of 50% cysteine and an equal mixture of the remaining natural occurring amino acids, b) X are each an equal mixture of the natural occurring amino acids, excluding cysteine, c) Amix are each a mixture of 5-50% cysteine and an equal mixture of the remaining natural occurring amino acids, and d) m and n are both, and independently from each other, 3-20.
In embodiments of the present disclosure, the natural occurring amino acids are selected from A, C, D, E, F, G, H, I, K, L, , N, P, Q, R, S, T, V, W, and Y.
An embodiment of the present disclosure provides a peptide library as shown in Figure 1.
In certain embodiments the peptide libraries of the present disclosure are displayed on bacteriophage. Phage display is known to have significant advantages in allowing the rapid selection of useful molecules. This method allows the preparation of libraries as large as 1010 unique peptide members, many orders of magnitude larger than libraries that may be prepared synthetically. Using such a robust platform allows for the display of large, diverse libraries.
Certain embodiments of the present disclosure provide the nucleic acids encoding the peptide libraries of the present diclosure. The present disclosure also provides vectors comprising the nucleic acids encoding the peptide library the present disclosure. In certain embodiments, the vector is a display vector. In other embodiments, the vector is an expression vector.
Embodiments of the present disclosure provide methods of identifying a peptide specific for an antigen, comprising contacting an antigen with the peptide library of the present invention, and selecting one or more peptides specific for said antigen.
Embodiments of the present disclosure also provide the peptides identified using the peptide libraries of the present invention. DESCRIPTION OF THE DRAWINGS
Figure 1 shows a design of a peptide library disclosed herein which expresses a peptide library according to the present disclosure. (SEQ ID No: 6.)
Figures 2A-C shows a quality assessment of the peptide library of Figure 1 . This figure shows the position and distribution of each amino acid, including the cysteines, which form the cyclic peptides disclosed herein. Therefore, this figure shows that the design of Figure 1 successfully produces a library with the desired positions and distributions of cysteine residues. Figure 2A shows the amino acid distribution of 99 individually sampled clones using Sanger sequencing. Figure 2B shows the expected amino acid distribution. Figure 2C shows the actual amino acid distribution as evaluated using Next Generation Sequencing.
Figure 3A-C shows how the peptide library of Figure 1 expresses clones having 0, 1 , 2 or more cysteine residues. Figure 3A shows the evaluation of 99 individually sampled clones using Sanger sequencing. On average, 2.27 cysteines were identified per clone. Of the 99 clones sampled, 33% were linear and 67% were cyclic. Figure 3B shows an evaluation using Next Generation Sequencing. Figure 3C shows the expected versus obtained cysteines per clone as evaluated using Next Generation Sequencing.
Figures 4A-B shows how the peptide library of Figure 1 expresses clones having at least two cysteines, thus forming cyclic structures, and the length distribution of ring sizes. Figure 4 A shows the evaluation of 99 individually sampled clones using Sanger sequencing. Figure 4B shows the ring sizes as evaluated using Next Generation Sequencing. The cyclic peptides comprised loops ranging in size from 3-17 amino acids in length.
Figures 5-6 show example peptides (Figure 5: SEQ ID NOS 7-1 1 , respectively, in order of appearance, Figure 6: SEQ ID NOS 12-16, respectively, in order of appearance) expressed from the library of Figure 1. These examples have at least two cysteines and some even four cysteines per peptide, which result in various sized loops and even multiple loops within one peptide.
Figure 7 shows a pill display vector for use in displaying the peptide libraries disclosed herein.
Figure 8 shows a pVIII display vector for use in displaying the peptide libraries disclosed herein.
Figure 9 shows an expression vector for use in expressing the peptides disclosed herein: Figure 10 shows a simplified view of the display and expression vectors for use in displaying the libraries disclosed herein.
Figure 11 shows the sequencing results of peptides (SEQ ID NOS 18-40, respectively, in order of appearance) identified in a screening with the peptide library of Figure 1 against streptavidin. This result confirms the utility of the library of Figure 1 as a tool for epitope mapping. The results confirm that the known epitope of streptavidin, HPQ, was to a high confidence level identified in both linear and cyclic peptides.
Figure 12 shows the sequencing results of peptides (SEQ ID NOS 41-64, respectively, in order of appearance) identified in a screening with the peptide library of Figure 1 against the anti-c- Myc antibody. This result confirms that a diverse number of specific peptides can be identified, wherein the peptides selected are both linear, constrained, and have a wide range of confirmations.
Figure 13 shows a pictorial representation of a portion of the Slonomics method. Figure 14 shows a pictorial representation of a portion of the Slonomics method. DEFINITIONS
"Library" means an entity comprising more than one member. In the context of the present disclosure this term refers to a library of peptides, wherein said library comprises at least two different peptides.
"Synthetic" means not physically derived from naturally occurring DNA.
"Peptide" means a molecule having less than or equal to 50 amino acids.
Peptides "translated from nucleic acids" means peptides that are created using biological processes where the starting material is a nucleic acid, either DNA or RNA and the resulting material are amino acids. The biological process may include intermediary steps, such as transcription from DNA to RNA, and/or translation from RNA to amino acid.
Such libraries displaying peptides translated from nucleic acids could be bacteriophage or ribosomal display libraries.
"Linear" as used in the present disclosure refers to a stretch of amino acids or a peptide that does not include any circular structure.
"Cyclic" or "circular" or "loop" as used in the present disclosure refers to a stretch of amino acids or a peptide which includes a circular structure. Not the entire stretch of amino acids or peptide needs to be circular. Cyclic peptides may be formed by covalent or by non-covalent bonds.
A typical covalent bond that is utilized within the present disclosure to form cyclic peptides is a disulfide bond, which is formed between two cysteine residues of the peptide. Other covalent bonds that are used within the present disclosure are thioether bonds, such as the thioether bonds which are formed and/or which are present in lanthionines. In vivo lanthionines are formed enzymatically via the dehydration of serine or threonine to yield dehydroalanine and dehydrobutyrine, respectively. These products then react with cysteine thiol to from lanthionine and methyllanthionine, respectively. Chemical synthesis is possible as well.
Other covalent bonds include lysinoalanine linkage between a dehydrated serine to yield dehydroalanine which alkylates a lysine in the same polypeptide.
Non-covalent bonds that are used within the present disclosure to from cyclic peptides are typically formed via protein domains, such as zinc-finger domains, a jun-fos interaction or a leucine zipper. Other non-covalent bonds may be used as well, such as hydrogen bonds, dipolar bonds or van der Waals forces.
"Constrained" as used in the present disclosure refers to a peptide in which the three- dimensional structure is maintained substantially in one spatial arrangement over time. The cyclic peptides within the present disclosure have a constrained conformation.
Methods of determining whether peptides are constrained are known in the art. For cyclic peptides this can in certain cases be deduced from the analysis of the primary amino acid sequence, for example, by the identification of cysteines. Another way is the addition of a protease to the displayed peptide library. Conformationally constrained peptides are usually not cut by the protease. Reduction in size after cleavage can be detected using mass spectrometry. Finally, mass spectrometry can also be used to analyze the library as such. Many cyclic peptides, especially those that are formed by dehydration, will have a lower mass than the corresponding linear peptides.
"Member" is one molecule forming part of a library. In the context of the present disclosure this term refers to one peptide which is part of the peptide library.
"Equal mixture" means that each codon encoding an amino acid has the same probability of occurring as any other codon encoding a different amino acid. As an example, if X1 represents an equal mixture of the naturally occurring amino acids, then each of the 20 naturally occurring amino acids has the same probability of occurring at that position, i.e. 5%. "Natural occurring amino acids" means the following amino acids:
Figure imgf000011_0001
The term "vector" refers to a polynucleotide molecule capable of transporting another polynucleotide to which it has been linked. Preferred vectors are those capable of autonomous replication and/or expression of nucleic acids to which they are linked. One type of vector is a "plasmid", which refers to a circular double stranded DNA loop into which additional DNA segments may be ligated. Another type of vector is a viral vector, wherein additional DNA segments may be ligated into the viral genome. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication and mammalian vectors). Other vectors can be integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Vectors may be compatible with prokaryotic or eukaryotic cells. Prokaryotic vectors typically include a prokaryotic replicon which may include a prokaryotic promoter capable of directing the expression (transcription and translation) of the peptide in a bacterial host cell, such as Escherichia coli transformed therewith. A promoter is an expression control element formed by a DNA sequence that permits binding of RNA polymerase and transcription to occur. Promoter sequences compatible with bacterial hosts are typically provided in plasmid vectors containing convenience restriction sites for insertion of a DNA segment. Examples of such vector plasmids include pUC8, pUC9, pBR322, and pBR329, pPL and pKK223, available commercially.
"Expression vectors" are those vectors capable of directing the expression of nucleic acids to which they are operatively linked and is intended to include such other forms of expression vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno- associated viruses), which serve equivalent functions.
"Display vector" includes a DNA sequence having the ability to direct replication and maintenance of the recombinant DNA molecule extra chromosomally in a host cell, such as a bacterial host cell, transformed therewith. Such DNA sequences are well known in the art. Display vectors can for example be phage vectors or phagemid vectors originating from the class of fd, M13, or fl filamentous bacteriophage. Such vectors are capable of facilitating the display of a protein including, for example, a binding protein or a fragment thereof, on the surface of a filamentous bacteriophage. Display vectors suitable for display on phage, ribosomes, DNA, bacterial cells or eukaryotic cells, for example yeast or mammalian cells are also known in the art, for example, as are viral vectors or vectors encoding chimeric proteins.
The term "recombinant host cell" (or simply "host cell") refers to a cell into which a recombinant expression vector has been introduced. It should be understood that such terms are intended to refer not only to the particular subject cell but to the progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term "host cell" as used herein. Typical host cells are prokaryotic (such as bacterial, including but not limited to E. coli) or eukaryotic (which includes yeast, mammalian cells, and more). Bacterial cells are preferred prokaryotic host cells and typically are a strain of Escherichia coli (E. coli) such as, for example, the E. coli strain DH5 available from Bethesda Research Laboratories, Inc., Bethesda, Md. Preferred eukaryotic host cells include yeast and mammalian cells including murine and rodents, preferably vertebrate cells such as those from a mouse, rat, monkey or human cell line, for example HKB1 1 cells, PERC.6 cells, or CHO cells.
The introduction of vectors into host cells may be accomplished by a number of transformation or transfection methods known to those skilled in the art, including calcium phosphate precipitation, electroporation, microinjection, liposome fusion, RBC ghost fusion, protoplast fusion, viral infection and the like. The production of monoclonal full-length antibodies, Fab fragments, Fv fragments and scFv fragments is well known.
Transformation of appropriate cell hosts with a recombinant DNA molecule is accomplished by methods that typically depend on the type of vector and cells used. With regard to transformation of prokaryotic host cells, see, for example, Cohen et al., Proceedings National Academy of Science, USA, Vol. 69, P. 21 10 (1972); and Maniatis et al., Molecular Cloning, a Laboratory Manual, Cold spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1982). With regard to the transformation of vertebrate cells with retroviral vectors containing rDNAs, see for example, Sorge et al., Mol. Cell. Biol., 4:1730-1737 (1984); Graham et al., Virol., 52:456 (1973); and Wigler et al., Proceedings National Academy of Sciences, USA, Vol. 76, P. 1373- 1376 (1979).
The term "epitope" refers to an antigenic determinant, i.e. the part of an antigen that is recognized by a binding molecule, such as an antibody or a peptide. "Phage display" is a technique by which variant polypeptides are displayed as fusion proteins to a coat protein on the surface of phage, e g filamentous phage particles. A utility of phage display lies in the fact that large libraries of randomized protein variants can be rapidly and efficiently sorted for those sequences that bind to a target molecule with high affinity display of peptides and proteins libraries on phage has been used for screening millions of polypeptides for ones with specific binding properties. Polyvalent phage display methods have been used for displaying small random peptides and small proteins through fusions to either gene III or gene VIII of filamentous phage Wells and Lowman ( 1992) Curr Opin Struct Biol B 355-362 and references cited therein. In monovalent phage display, a protein or peptide library is fused to a gene III or a portion thereof and expressed at low levels in the presence of wild type gene III protein so that phage particles display one copy or none of the fusion proteins. Avidity effects are reduced relative to polyvalent phage so that sorting is on the basis of intrinsic ligand affinity, and phagemid vectors are used, which simplify DNA manipulations Lowman and Wells (1991 ) Methods A companion to Methods in Enzymology 3205-216. In phage display, the phenotype of the phage particle, including the displayed polypeptide corresponds to the genotype inside the phage particle, the DNA enclosed by the phage coat proteins.
Phage display describes a selection technique in which a library of peptide or protein variants is expressed on the outside of a phage virion, while the genetic material encoding each variant resides on the inside. This creates a physical linkage between each variant protein sequence and the DNA encoding it, which allows rapid partitioning based on binding affinity to a given target molecule (antibodies, enzymes, cell-surface receptors, etc.) by an in vitro selection process called panning. In its simplest form, panning is carried out by incubating a library of phage-displayed peptides on a plate (or bead) coated with the target, washing away the unbound phage, and eluting the specifically bound phage. The eluted phage are then amplified and taken through additional binding/amplification cycles to enrich the pool in favor of binding sequences. After a few rounds, individual clones are characterized by DNA sequencing and ELISA.
A "phagemid" is a plasmid vector having a bacterial origin of replication, e g , ColE 1 , and a copy of an intergenic region of a bacteriophage. The phagemid may be based on any known bacteriophage including filamentous bacteriophage. The plasmid will also generally contain a selectable marker for antibiotic resistance. Segments of DNA cloned into these vectors can be propagated as plasmids. When cells harboring these vectors are provided with all genes necessary for the production of phage particles, the mode of replication of the plasmid changes to rolling circle replication to generate copies of one strand of the plasmid DNA and package phage particles. The phagemid may form infectious or non-infectious phage particles This term includes phagemids which contain a phage coat protein gene or fragment thereof linked to a heterologous polypeptide gene as a gene fusion such that the heterologous polypeptide is displayed on the surface of the phage particle Sambrook et. al. 417.
The term "phage vector" means a double stranded replicative form of a bacteriophage containing a heterologous gene and capable of replication. The phage vector has a phage origin of replication allowing phage replication and phage particle formation. The phage is preferably a filamentous bacteriophage, such as, an M I3 fl . fd, Pf3 phage or a derivative thereof, a lambdoid phage, such as lambda, 21 , phi80, phi81 . 82, 424. 434, etc , or a derivative thereof, a Baculovirus or a derivative thereof, a T4 phage or a derivative thereof , a T7 phage virus or a derivative thereof. Preparation of DNA from cells means isolating the plasmid DNA from a culture of the host cells. Commonly used methods for DNA preparation are the large- and small-scale plasmid preparations described in sections 125- 133 of in Sambrook et al. After preparation of the DNA it can be purified by methods well known in the art such as that described in section 140 of Sambrook et. al.
The term "coat protein" means a protein, at least a portion of which is present on the surface of the virus particle. From a functional perspective, a coat protein is any protein which associates with a virus particle during the viral assembly process in a host cell, and remains associated with the assembled virus until it infects another host cell. The coat protein may be the major coat protein or may be a minor coat protein. A "major" coat protein is a coat protein which is present in the viral coat at 10 copies of the protein or more, e.g. major coat protein p V III. A major coat protein may be present in tens, hundreds or even thousands of copies per virion. A minor coat protein is present in the viral coat at less than 10 copies per phage, e.g. minor coat protein pill.
A "fusion protein" is a polypeptide having two portions covalently linked together, where each of the portions is a polypeptide having a different property The property may be a biological property, such as activity in vitro or in vivo The property may also be a simple chemical or physical property, such as binding to a target molecule, catalysis of a reaction, etc. The two portions may be linked directly by a single peptide bond or through a peptide linker containing one or more ammo acid residues Generally, the two portions and the linker will be in reading frame with each other
"Polymerase chain reaction" or "PCR" refers to a procedure or technique in which minute amounts of a specific piece of nucleic acid. RNA and/or DNA. are amplified as described in U S Patent No 4 683, 195 issued 28 July 1987 Generally, sequence information from the ends of the region of interest or beyond needs to be available, such that ohgonucleotide primers can be designed, these primers will be identical or similar in sequence to opposite strands of the template to be amplified The 5' terminal nucleotides of the two primers may coincide with the ends of the amplified material PCR can be used to amplify specific RNA sequences, specific DNA sequences from total genomic DNA, and cDNA transcribed from total cellular RNA, bactenophage or plasmid sequences, etc See generally Mullis et al ( 1987) Cold Spring Harbor S\mp Quant Biol 51 263 , Erlich. ed . PCR Technology (Stockton Press, NY, 1989) As used herein, PCR is considered to be one, but not the only, example of a nucleic acid polymerase reaction method for amplifying a nucleic acid test sample comprising the use of a known nucleic acid as a primer and a nucleic acid polymerase to amplify or generate a specific piece of nucleic acid Detailed Description of the Invention
The present disclosure is directed to peptide libraries comprising constrained peptides and linear peptides. The present disclosure is directed to peptide libraries comprising cyclic peptides and linear peptides. Such libraries are useful for numerous purposes; including epitope mapping and the identification of peptides with pharmaceutical properties, such as anti-microbial or anti-viral peptides, material-specific peptides, small molecule binders, novel enzyme substrates and other peptides useful for drug lead discovery.
In order for a library to have utility in both situations where either a linear or constrained, for example, cyclic, peptide is sought, a library may be designed to have a predictable proportion of both linear and constrained, for example, cyclic, peptides presented.
In certain embodiments, the peptide libraries comprise linear and cyclic peptides, wherein the cyclic peptides are formed by one or more covalent or one or more non-covalent bonds.
In certain embodiments, the covalent bond is a disulfide bond or a bond between two non- naturally occurring amino acids, such as, a thioether bond. In certain embodiments, the thioether bond is a lanthiopeptide, such as a lanthiopeptide formed between a dehydrated serine and a cysteine or a dehydrated threonine and a cysteine.
In certain embodiments, the covalent bond is a disulfide bond. In certain embodiments, the disulfide bond is formed by two cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 or 7 or more cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 and 7 cysteine residues. Figure 3A shows the sequencing results of a peptide library of the present disclosure which comprises peptides having 0, 1 , 2, 3, 4, 5, 6 or 7 cysteines per peptide. . In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3 or more cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, and 3 cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4 or more cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3 and 4 cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5 or more cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4 and 5 cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 or more cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5 and 6 cysteine residues. In certain embodiments, the covalent bond is a lysinoalanine linkage formed between a dehydrated serine and a lysine.
In certain embodiments, the non-covalent bond is a formed via a protein domain, such as, a zinc-finger domain, a jun-fos interaction or a leucine zipper.
In an aspect, the disclosed library incorporates a) linear and cyclic peptides, b) constrained and non-constrained peptides, c) cyclic peptides having a range of different loop lengths, and d) cyclic peptides having one or two or more loops.
In an aspect, a library of peptides comprises (a) linear and cyclic peptides, (b) cyclic peptides having different loop lengths, and (c) cyclic peptides having two or more loops. In
embodiments, the library comprises synthetic peptides. In embodiments, the library comprises peptides translated from nucleic acids.
In embodiments, the library comprises cyclic peptides with loop lengths ranging from 3-17 amino acids in length. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, or 17 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, and 17 amino acids. Figures 4A and 4B shows the sequencing results of a peptide library of the present disclosure which comprises peptides having comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, or 17 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, or 4, or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, and 4 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4 and 5 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5 and 6 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6 and 7 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7 and 8 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8 and 9 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9 and 10 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 11 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10 and 11 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 and 12 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12 and 13 amino acids. In
embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13 and 14 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8; 9, 10, 1 1 , 12, 13, 14, 15 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14 and 15 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7,8, 9, 10, 11.Ί 2, 13, 14, 15, 16 or more amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15 and 16 amino acids.
In an aspect, the library comprises cyclic peptides having two or more loops. In
embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 or 7 or more loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 and 7 loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, or more loops. In embodiments, the library comprises peptides comprising 0, 1 , and 2 loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3 or more loops. In embodiments, the library comprises peptides comprising 0, 1 , 2 and 3 loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4 or more loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3 and 4 loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5 or more loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4 and 5 loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 or more loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5 and 6 loops. The present disclosure provides such a design as specific positions are selected to encode either a cysteine residue or another amino acid, wherein the ratio selected enables a higher proportion of cyclic peptides as compared to linear peptides to be displayed and expressed as compared to randomized linear peptide libraries.
A linear peptide library of 20 codons randomized using NNN or NNK technology has a probability of 12.8% that each member contains two or more cysteine residues.
An embodiment of the present disclosure therefore provides a library comprising linear and constrained, for example, cyclic, peptides, wherein the proportion of members comprising constrained peptides is 13% or more, 14% or more, 15% or more, 16% or more, 17% or more, 18% or more, 19% or more, 20% or more, 21% or more, 22% or more, 23% or more, 24% or more, or 25% or more.
In another embodiment the present disclosure provides a library comprising linear and cyclic peptides, wherein the proportion of members comprising cyclic peptides is 13% or more, 14% or more, 15% or more, 16% or more, 17% or more, 18% or more, 19% or more, 20% or more, 21 % or more, 22% or more, 23% or more, 24% or more, or 25% or more.
By controlling the ratio of cysteines at various positions, the present disclosure provides peptide libraries that have a high diversity of conformations useful in many situations, as the library presents both linear, and cyclic peptides, where the cyclic peptides have multiple different lengths of disulfide bond formed loops ranging from, e.g., 3-17 amino acids in length, thus allowing for a large diversity of conformations being presented in the library. In addition, such design produces peptides having 0, 1 , 2 or more cysteine residues, allowing for the production of peptides having even more than one or even more than two loops, thus diversifying the conformations presented even more.
In certain embodiments the present disclosure provides a library of synthetic peptides, wherein each member of the library comprises an amino acid sequence Cmix - (X) m - (Amix) n, wherein
a) Cmix is a mixture of 50% cysteine and an equal mixture of the remaining natural occurring amino acids,
b) X are each an equal mixture of the natural occurring amino acids, excluding cysteine, c) Amix are each a mixture of 5-50% cysteine and an equal mixture of the remaining natural occurring amino acids, and
d) m and n are both, and independently from each other, 3-20. In certain embodiments m is 3 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments m is 4 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17,
18, 19, or 20. In certain embodiments m is 5 and n is 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15,
16, 17, 18, 19, or 20. In certain embodiments m is 6 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments m is 7 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments m is 8 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments m is 9 and n is 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments m is 10 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments m is 1 1 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments m is 12 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments m is 13 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments m is 14 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18,
19, or 20. In certain embodiments m is 15 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16,
17, 18, 19, or 20. In certain embodiments m is 16 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments m is 17 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments m is 18 and n is 3, 4, 5, 6, 7, 8,
9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments m is 19 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments m is 20 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
In certain embodiments n is 3 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments n is 4 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17,
18, 19, or 20. In certain embodiments n is 5 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments n is 6 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments n is 7 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments n is 8 and m is 3, 4, 5, 6, 7, 8, 9,
10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments n is 9 and m is 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments n is 10 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments n is 1 1 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments n is 12 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments n is 13 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments n is 14 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18,
19, or 20. In certain embodiments n is 15 and m is 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments n is 16 and m is 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments n is 17 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodimerits n is 18 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments n is 19 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments n is 20 and m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
In certain embodiments, (Amixj comprises 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, or 50% cysteine, and an equal mixture of the remaining natural occurring amino acids.
In certain embodiments the library comprises cyclic peptides having disulfide formed loops of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, 20 amino acids in length.
In certain embodiments the library comprises synthetic peptides that are 9 or more amino acids in length, 10 or more amino acids in length, 11 or more amino acids in length, 12 or more amino acids in length, 13 or more amino acids in length, 14 or more amino acids in length, 15 or more amino acids in length, 16 or more amino acids in length, 17 or more amino acids in length, 18 or more amino acids in length, 19 or more amino acids in length, 20 or more amino acids in length, 21 or more amino acids in length, 22 or more amino acids in length, 23 or more amino acids in length, 24 or more amino acids in length, 25 or more amino acids in length, 26 or more amino acids in length, 27 or more amino acids in length, 28 or more amino acids in length, 29 or more amino acids in length, 30 or more amino acids in length, 31 or more amino acids in length, 32 or more amino acids in length, 33 or more amino acids in length, 34 or more amino acids in length, 35 or more amino acids in length, 36 or more amino acids in length , 37 or more amino acids in length, 38 or more amino acids in length, 39 or more amino acids in length, 40 or more amino acids in length, 41 or more amino acids in length or 42 or more amino acids in length.
In certain embodiments the present disclosure provides a library of synthetic peptides, wherein each member of the library comprises an amino acid sequence Cmix - (X) m - (Amix) n, wherein
a) Cmix is a mixture of 50% cysteine and an equal mixture of the remaining natural occurring amino acids,
b) X are each an equal mixture of the natural occurring amino acids, excluding cysteine, c) Amix are each a mixture of 10-20% cysteine and an equal mixture of the remaining natural occurring amino acids,
d) m is 5 or 6, and
e) n is 3-20. In certain embodiments m is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
In certain embodiments n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
In certain embodiments, (Amix) comprises a mixture of 10-20% cysteine and an equal mixture of the remaining natural occurring amino acids. In a preferred embodiment, (Amix) comprises a mixture of 15% cysteine and an equal mixture of the remaining natural occurring amino acids.
In certain embodiments m is 5 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments m is 6 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
In certain embodiments, the natural occurring amino acids are selected from A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, and Y.
In certain embodiments the present disclosure provides a library of synthetic peptides, wherein each member of the library comprises an amino acid sequence (X) I Cmix - (X) m - (Amix) n, wherein
a) Cmix is a mixture of 50% cysteine and an equal mixture of the remaining natural occurring amino acids,
b) X are each an equal mixture of the natural occurring amino acids, excluding cysteine, c) Amix are each a mixture of 10-20% cysteine and an equal mixture of the remaining natural occurring amino acids,
d) m is 5 or 6,
e) n is 3-20, and
f) I is 1 -3.
In certain embodiments, I is 1 , 2 or 3.
In certain embodiments I is 1 and m is 5 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments I is 1 and m is 6 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
In certain embodiments I is 2 and m is 5 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments I is 2 and m is 6 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments I is 3 and m is 5 and n is 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments I is 3 and m is 6 and n is 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, or 20.
An embodiment of the present disclosure provides a peptide library having a design as shown in Figure 1 .
In certain embodiments the present disclosure provides a library of synthetic
peptideswherein each member of the library comprises an amino acid sequence (X) l-Cmix- (X) m-(Amix)n, wherein
a) Cmix is a mixture of 50% cysteine and an equal mixture of the remaining natural occurring amino acids,
b) X are each an equal mixture of the natural occurring amino acids, excluding cysteine, and
c) Amix are each a mixture of 15% cysteine and an equal mixture of the remaining
natural occurring amino acids
d) I is 3
e) m is 6, and
f) n is 10.
In certain embodiments the peptide library is displayed on bacteriophage. Phage display is known to have significant advantages in allowing the rapid selection of useful molecules. This method allows the preparation of libraries as large as 1010 unique peptide members, many orders of magnitude larger than libraries that may be prepared synthetically. Using such a robust platform allows for the display of large, diverse libraries.
In one embodiment, the library of the instant invention contains at least about 107 member peptides, each of which has at least one amino acid variation from others. Alternatively, the library contains at least about 108 peptides, or at least about 109 peptides.
In certain embodiments the library comprises constrained and/or cyclic peptides which are formed by disulfide bonds between two or more cysteines, zinc-finger domains, a jun-fos interaction, a leucine zipper, thioether bonds, such as the thioether bonds which are formed and/or which are present in lanthionines, or lysinoalanine linkages.
In certain embodiments, the peptide library comprises constrained, for example, cyclic, peptides which are formed by disulfide bonds between two or more cysteine residues. In certain embodiments, the peptide library comprises cyclic peptides which are formed by disulfide bonds between two or more cysteine residues.
In certain embodiments, the library comprises cyclic or constrained peptides which are formed by disulfide bonds between two or more cysteine residues, wherein the two or more cysteine residues are not located at the N or the C-terminus of the peptide, or are located at either the N or the C-terminus of the peptide, but seldom both.
In certain embodiments the present invention provides peptide libraries comprising peptides greater than 16 amino acids in length. In other embodiments the present invention provides libraries comprising peptides having 20 amino acids in length.
In certain embodiments the present invention provides peptide libraries wherein a portion of the peptides comprise 0, 1 , or 2 or more cysteine residues.
In certain embodiments the present invention provides a library of nucleic acids encoding the libraries of peptides of the present disclosure.
In certain embodiments the present invention provides a vector comprising the nucleic acids encoding the libraries of peptides of the present disclosure.
In certain embodiments, the vector is a display vector. In other embodiments, the vector is an expression vector.
In certain embodiments the present disclosure provides a method of identifying a peptide specific for an antigen, comprising a) contacting an antigen with a library of peptides disclosed herein, and
b) selecting one or more peptides specific for said antigen.
In certain embodiments the present disclosure provides a peptide identified using the library of peptides disclosed herein.
Phage display methods
Phage display methods for proteins, peptides and mutated variants thereof, including constructing a family of variant replicable vectors containing a transcription regulatory element operably linked to a gene fusion encoding a fusion polypeptide, transforming suitable host cells, culturing the transformed cells to form phage particles which display the fusion polypeptide on the surface of the phage particle, contacting the recombinant phage particles with a target molecule so that at least a portion of the particle bind to the target, separating the particles which bind from those that do not bind, are known and may be used with the libraries disclosed herein.
In certain embodiments the peptides are fused to at least a portion of a phage coat protein to form a fusion protein containing the peptide disclosed herein. The fusion protein can be made by expressing a gene fusion encoding the fusion protein using known techniques of phage display. The fusion protein may form part of a phage or phagemid particle in which one or more copies of the peptide are displayed on the surface of the particle. An embodiment includes a nucleic acid encoding the peptide or the fusion proteins described herein.
In certain embodiments the present disclosure provides vectors comprising the fusion genes noted above, as well as a library of these vectors. The library of vectors may be in the form of a DNA library, a library of virus (phage or phagemid) particles containing the library of fusion genes or in the form of a library of host cells containing a library of the expression vectors or virus particles.
In certain embodiments the present disclosure provides a method comprising the steps of preparing a library containing a plurality of vectors, each vector comprising a transcription regulatory element operably linked to a gene fusion encoding a fusion protein, wherein the gene fusion comprises a first gene encoding a peptide disclosed herein and a second gene encoding at least a portion of a phage coat protein, wherein the library comprises a plurality of genes encoding peptide fusion proteins.
The gene encoding the coat protein of the phage and the gene encoding the desired polypeptide portion of the fusion protein of the invention (the peptide of the invention fused to at least a portion of a phage coat protein) can be obtained by methods known in the art (see generally, Sambrook et al) The DNA encoding the gene may be chemically synthesized (Merrfield ( 1963) 7 Am Chem Soc 85:2149).
The phage coat protein is preferably the gene III or gene VIII coat protein of a filamentous phage, such as, M13.
Suitable gene III vectors for display of peptides include fUSE5 (Scott. J K . and Smith G P (1990) Science 249 386-390), fAFFI (Cwirla et al ( 1990) Proc Natl Acad Set U S A 87 6378- 6382), fd-CATI (McCafferty et al ( 1990) Nature (London) 348 552-554), m663 (Fowlkes et al ( 1992) Biotechniques 13 422-427), tdtetDOG. pHEN I (Hoogenboom et al ( 1991 ) Nucleic Acids Res 19 4133-4137) pComb3 (Gram et al ( 1992) Proc Natl Acad Sc i U S A 89 3576- 3580), pCANTAB 5E (Pharmacia), and LamdaSurt ap (Hogrefe ( 1993)Gene 137 85-91 ) Suitable phage and phagemid vectors for use in this invention include all known vectors for phage display Additional examples include pCombo (Gram et al. ( 1992) Proc. Natl. Acad. Sci. USA 89:3576-3580), pC89 (Fehci et al. ( 1991 ) 7. Mol. Biol. 222:310-310); plF4 (Bianchi et al. ( 1995) 7. Mol. Biol. 247: 154- 160); PM48. PM52. and PM54 (lannolo. ( 1995) 7. Mol. Biol 248:835-844); fdH (Greenwood et al. ( 1991 ) 7. Mol. Biol. 220:821 -827); pfd5SHU. pfd8SU, pfd8SY, and fdlSPLAY8 (Malik & Perham ( 1996) Gene 171 :49-51 ); "88" (Smith ( 1993) Gene 128: 1 -2); f88.4 (Zhong et al. ( 1994) 7. Biol. Chem, 269:24183-24188); p8V5 (Affymax); MB 1 , MB20, MB26, MB27. MB28, MB42. MB48. MB49. MB56: (Markland et al. ( 1991 ) Gene 109: 13- 19). Similarly, any known helper phage may be used when a phagemid vector is employed in the phage display system. Examples of suitable helper phage include M 13-K07 (Pharmacia), M 13-VCS (Stratagene), and R408 (Stratagene).
Any suitable cells which can be transformed by electroporation may be used as host cells in the method of the present invention. Suitable host cells which can be transformed include gram negative bacterial cells such as E. coli. Suitable E. coli strains include TG1 F+, TG1 F-, JM 101 , E. coli K 12 strain 294 (ATCC number 3 1.446), E. coll strain W31 10 (ATCC number 27.325), E. coli X1776 (ATCC number 31 ,537), E. coli XL- 1 Blue (Stratagene). and E. coli B; however many other strains of E. coli, such as XL I -Blue MRF' , SURE. ABLE C. ABLE K. WM 1 100, MC 1061 , HB 101 , CJ 136. MV 1 190. JS4, JS5, NM522. NM538, and NM539 may be used as well. Cells are made competent using known procedures. Sambrook et al, 1.76- 1 .81 ,16.30.
In certain embodiments the host cell for electroporation is a competent E. coli strain containing a phage F episome. Any P episome which enables phage replication in the strain may be used in the invention.
After selection of the transformed cells, these cells are grown in culture and the vector DNA may then be isolated. Phage or phagemid vector DNA can be isolated using methods known in the art, for example, as described in Sambrook et al. The isolated DNA can be purified by methods known in the art such as that described in section 140 of Sambrook et al. This purified DNA can then be analyzed by DNA sequencing DNA sequencing may be performed by the method of Messing et al ( 1981 ) Nucleic Acids Res 9 309. The method of Maxam et al (1980) Meth Enzymol 65 499 , or by any other known method. Method of generating diversified libraries
Methods of generating diversified gene libraries, such as the Slonomics technology, are disclosed in US 12/414,174, which is incorporated by reference in its entirety, and J. Van den Brulle, M. Fischer, T. Langmann, G. Horn, T. Waldmann, S. Arnold, M. Fuhrmann, O. Schatz, T. O'Connell, D. O'Connell et al. (2008), A novel solid phase technology for high-throughput gene synthesis, Biotechniques, 45, pp. 340-343, which is incorporated by reference in its entirety.
The Slonomics method uses a defined number of standardized building blocks is chemically synthesized as single-stranded oligonucleotides containing self-complementary regions. Intra- strand base pairing of these regions leads to the formation of a stable hairpin-like secondary structure comprising a short loop of four nucleotides, a double-stranded stem region assuring the stability of the molecule, and a three-nucleotide single-stranded overhang. Two different classes of building blocks are defined as "splinkers" and "anchors." All splinker molecules share the same scaffold structure and differ only in their variable three-base single-stranded overhangs. In contrast, the anchor oligonucleotides differ in the overhang and also in the directly adjacent base triplet.
A portion of the Slonomics method is shown in Figure 13.
In order of their appearance, the above sequences are SEQ ID Nos: 1-5.
In order to create a library representing all possible permutations, 64 (43) different splinkers and 4096 (46) different anchor oligonucleotides are required. Each anchor molecule harbors an additional biotin modification in the loop region, allowing the oligonucleotide to be coupled to a streptavid in -coated surface with high affinity. The two types of oligonucleotides are further characterized by the presence of different recognition sites for type IIS restriction enzymes within their stem regions. The anchor oligonucleotide contains a recognition site for Earn 1 104I (CTCTTC[1/4], generating a three-base overhang) and the corresponding splinker molecule harbors a recognition site for Esp3l (CGTCTC[1/5], generating a four-base overhang).
To construct a large double-stranded DNA fragment from these molecular building blocks, the sequence is first assembled as smaller sub-fragments of 18 bp. These so-called "elongation blocks" can be synthesized in parallel reactions. In the first step, one anchor and one splinker molecule are ligated via hybridization (Watson-Crick base pairing) of complementary single- stranded overhangs. Generally, this step is performed in solution, since enzymatic reactions in solution occur at much faster rates than those on solid supports, where diffusion pathways are much longer. Following ligation, the resulting product is immobilized on a streptavidin- coated 96-well plate via the biotin modification of the anchor molecule. No reacted material is removed in a washing step. The remaining surface-bound ligation products are subsequently cleaved by Eam1104I, which is specific for the anchor that donates the base triplet block. The cleavage of the ligation product by this restriction enzyme releases an elongated, highly pure "intermediate product" that has a new three-base single-stranded overhang and serves as an acceptor for the next anchor molecule. Thus, this reaction cycle results in the incorporation of three new bases to the growing chain and a shortened anchor that remains bound to the surface. This reaction cycle is repeated five times to produce an 18 bp DNA fragment. For optimal reaction performance, the anchor molecules and the intermediate products should be present in equimolar concentrations. If one of the ligation partners is in excess, resulting in a mixture of correct, immobilized higher level ligation products and unreacted precursors, unligated intermediate products can be removed efficiently by washing, while uhligated anchors remain bound to the surface after cleavage.
The complete synthesis process comprises two distinct phases. During the initial "elongation" phase, short sub-sequences of the target molecule are produced as described above, resulting in individual elongation blocks with 18 independently definable base; pairs. Since . many of these reactions can be performed in parallel, the entire target sequence is already constructed during the initial elongation process, albeit as a series of short sub-fragments. In the second reaction phase, the socalled "transposition," the pre-assembled elongation blocks are connected in a pair-wise fashion after each block has been cleaved with the appropriate type IIS restriction enzyme. Restriction with Eam1 1041 results in the release of the elongation block from the surface and thereby generates a three-base overhang. Cleavage with Esp3l removes the splinker-component of the molecule, leading to a four-base overhang. The resulting molecules can be assembled in a highly selective manner due to the different length and specific sequences of their overhangs.
A further portion of the Slonomics method is shown in Figure 14.
Additionally, depending on the restriction by Eam11041, the ligation reactions can be performed at the solid surface or in solution, which allows for the focus to be on either product purity or yield. Since the resulting molecules still harbor the constant anchor and splinker regions at their terminal ends, this reaction cycle (including washing steps) can be repeated several times, each round resulting in DNA molecules that have doubled in length with respect to those of the previous round. At different transposition stages, the resulting constructs are "harvested" from the solid surface by cleavage and transferred from the automated production platform to a second standardized system for the final assembly and quality control. Each transposition can be developed robustly up to the T5 level (5 transposition rounds), corresponding to a fragment length of 462 bp. If necessary, these "T5- building blocks" can be further assembled by standard recombinant DNA technology.
The Slonomics® technology is also highly efficient and cost-effective. In contrast to classical strategies, where each oligonucleotide is individually designed and used for a single synthesis reaction, our building blocks are used for multiple reactions over the course of several synthesis projects. In addition, all steps of the process can be done in parallel, which allows for the simultaneous production of several gene constructs and enables the transfer of every working step to a robotic platform. The complete synthesis is performed in multi-well plates, and hardware components with demonstrated suitability for robust production processes have been combined in an entirely computer-controlled system. This permits the fully-automated synthesis of any 462 bp DNA fragment, from design to end product, within a time frame of 44 hours.
Oligonucleotide-mediated mutagenesis is another method for preparing diversified gene libraries. This technique is well known in the art as described by Zoller et al (1987) Nucleic Acids. Res. 10 6487-6504.
Cassette mutagenesis is also a method for preparing the diversified gene libraries. The method is based on that described by Wells et al. (1985) Gene 34:315.
The following examples are provided by way of illustration and not by way of limitation. All disclosures of the references cited herein are expressly incorporated herein by reference in their entirety.
Embodiments
An aspect comprises a library of synthetic peptides comprising linear and cyclic peptides, wherein the proportion of cyclic peptides within said library is greater than 13%.
In an aspect, a library of peptides comprises (a) linear and cyclic peptides, (b) cyclic peptides having different loop lengths, and (c) cyclic peptides having two or more loops.
In an aspect, a library of peptides, comprises linear and cyclic peptides,
wherein the cyclic peptides have different loop lengths, and
wherein the cyclic peptides have two or more loops.
In embodiments, the library comprises synthetic peptides. In embodiments, the library comprises peptides translated from nucleic acids.
In embodiments, the library comprises peptides having a controlled ratio of cysteines at certain positions.
In an aspect, a library of peptides consists of (a) linear and cyclic peptides, (b) cyclic peptides having different loop lengths, and (c) cyclic peptides having two or more loops.
In embodiments, the library comprises cyclic peptides with loop lengths ranging from 3-17 amino acids in length. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, or 17 amino acids. In embodiments, the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, and 17 amino acids.
In embodiments, the library consists of cyclic peptides with loop lengths ranging from 3-17 amino acids in length. In embodiments, the library consists of cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, or 17 amino acids. In
embodiments, the library consists of cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, and 17 amino acids.
In an aspect, the library comprises cyclic peptides having two or more loops. In
embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 or 7 or more loops. In embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 and 7 loops.
In an aspect, the library consists of cyclic peptides having two or more loops. In
embodiments, the library consists of peptides comprising 0, 1 , 2, 3, 4, 5, 6 or 7 or more loops. In embodiments, the library consists of peptides comprising 0, 1 , 2, 3, 4, 5, 6 and 7 loops.
In embodiments, the synthetic peptides of the library are displayed on bacteriophage. In embodiments, the synthetic peptides of the library are displayed on ribosomes.
In embodiments, the library comprises at least 1 x 107 members. In embodiments, the library consists of at least 1 x 107 members. In embodiments of the library, the cyclic peptides are formed by one or more covalent or one or more non-covalent bonds.
In embodiments of the library, the covalent bond is a disulfide bond or a bond between two non-naturally occurring amino acids, such as, a thioether bond.
In embodiments of the library, the thioether bond is a lanthiopeptide, such as a lanthiopeptide formed between a dehydrated serine and a cysteine or a dehydrated threonine and a cysteine.
In embodiments of the library, the covalent bond is a lysinoalanine linkage formed between a dehydrated serine and a lysine.
In embodiments of the library, the non-covalent bond is a formed via a protein domain, such as, a zinc-finger domain, a jun-fos interaction or a leucine zipper.
In certain embodiments, the covalent bond is a disulfide bond. In certain embodiments^ the disulfide bond is formed by two cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 or 7 or more cysteine residues. In certain embodiments, the library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 and 7 cysteine residues.
In certain embodiments, the library consists of peptides comprising 0, 1 , 2, 3, 4, 5, 6 or 7 or more cysteine residues. In certain embodiments, the library consists of peptides comprising 0, 1 , 2, 3, 4, 5, 6 and 7 cysteine residues.
In embodiments of the library, the synthetic peptides are 9 or more amino acids in length.
In embodiments of the library, each member of the library comprises an amino acid sequence Cmix - (X) m - (Amix) n, wherein a) Cmix is a mixture of 50% cysteine and an equal mixture of the remaining natural occurring amino acids, b) X are each an equal mixture of the natural occurring amino acids, excluding cysteine, c) Amix are each a mixture of 5-50% cysteine and an equal mixture of the remaining natural occurring amino acids, and d) m and n are both, and independently from each other, 3-20.
In embodiments of the library, each member of the library comprises an amino acid sequence Cmix - (X) m - (Amix) n, wherein a) Amix are each a mixture of 10-20% cysteine and an equal mixture of the
remaining natural occurring amino acids, b) m is 5-6, and c) n is 3-20.
In embodiments of the library, each member of the library comprises an amino acid sequence (X) I - Cmix - (X) m - (Amix) n, wherein a) I is 1 -3.
In embodiments of the library, each member of the library comprises an amino acid sequence (X) l-Cmix- (X) m- (Amix) n, wherein a) Amix are each a mixture of 15% cysteine and an equal mixture of the remaining natural occurring amino acids, b) I is 3, c) m is 6, and d) n is 10.
In embodiments of the library, the library has a design as shown in Figure 1 .
In embodiments of the library, Amix comprises 15% cysteine, and an equal mixture of the remaining natural occurring amino acids. An aspect includes a library of nucleic acids encoding the libraries disclosed herein. An aspect includes vector comprising the nucleic acids disclosed herein. In embodiments the vector is a display vector or an expression vector.
An aspect includes a method of identifying a peptide specific for an antigen, comprising
(a) contacting an antigen with a library as disclosed herein, and
(b) selecting one or more peptides specific for said antigen.
An aspect includes peptide identified using the method described herein. Examples
Example 1 : Selection of the appropriate vectors
First we had to decide on the display vector into which the DNA-library encoding the peptides of the present disclosure will be cloned. Optimal display modes/rates were taken into consideration in the choice of the display vector (gill- or gVIII-fusion), as well as the optional use of structured or unstructured linkers and additional affinity tags for detection and immobilization. For cloning of the variable library fragment into alternative display vectors suitable restriction enzyme sites were identified for the generation of a fusion with the selected phage proteins.
The use of a phagemid vector, containing a N-terminal fusion of the peptide library sequence to the gene of the minor coat protein pill, in combination with Hyperphages harboring a pill gene deletion results in a pentavalent display of the peptides. The pill protein is present in 5 copies at the distal end of the phage particles and its function is required for phage infectivity by binding to the F-pilus of bacterial cells. Although it is thought that if the displayed peptides are sufficiently short enough (<50 residues) the function of the pill-fusion would not to be negatively affected and all five copies of the pill protein can carry displayed peptides, functional impairment of the plll-peptide fusion cannot be completely excluded. In general, reduced functionality of pill-fusion proteins, which might result in reduced phage infectivity, can be compensated by the use of Helperphage with a wild-type pill gene. pVIII is a major capsid protein, therefore, if multivalent display of the peptide library is desired then fusion of the variable peptide to pVIII should be used. Exemplary display vectors for displaying the libraries disclosed herein are shown in Figures 7- 8 and 10. Exemplary expression vectors for expressing the peptides disclosed herein are shown in Figure 9.
Example 2: Design of the peptide library
In addition to the use of linear peptides, it is advantageous to present cyclic and constrained peptides to facilitate the identification of specific interaction partners in Phage Display experiments. This can be realized by two cysteine residues flanking the variable region resulting in the generation of disulfide bond stabilized circular peptides of fixed size. The Slonomics Technology allows the controlled introduction of such cysteine residues at various desired positions within the variable peptides. The percentage of the cysteine residues at the chosen variable positions can be designed to result in the generation of an average of one to two or more cysteine residues per molecule. Therefore, encoded peptides without cysteine or containing only one cysteine residue will be linear, while the encoded peptides with two cysteine residues or more at desired positions will generate cysteine bridged circles of various lengths. Thus with this design, for example the design of Figure 1 , it is possible to generate a universal peptide library containing a defined mixture of linear and circular peptides, thereby offering the possibility of a simultaneous presentation of both alternatives in the same screening experiment. Evidence of a library comprising peptides have a controlled frequency of cysteines at certain positions is shown in Figures 2A-C, and 11 and 12.
Example 3: Library Generation
A library having the design shown in Figure 1 was chosen for display and testing.
The DNA fragments containing the peptide library sequence were synthesized as follows: The flanking constant regions comprising a signal sequence, epitope tag and spacer regions were synthesized by gene synthesis. The peptide library encoding sequence with a randomized stretch of 20 amino acids was synthesized by Slonomics. The resulting 333 bp completely synthetic linear DNA fragment comprising the peptide library and flanking constant regions was cloned via Xbal and Sail into the pill and pVIII display vectors, respectively (Figure 7 and Figure 8). Typically 0.25 to 2 of the ligated phagemid DNA of the libraries were used to transform E. coli MC1061 F electrocompetent cells and transformants were collected in TB medium and shaken for at 37<C for 1 h. Dilutions of the outgrowth medium were plated on LB/Chloramphenicol/Glucose. Amplification of the libraries was performed by shaking o/n in appropriate amounts of LB/Ghloramphenicol/1%Glucose. Library sizes for the cloned peptide library-L20-plll- and -pVIII-fusions ranged between 1.2 E+09 and 4.4E+09.
To analyze the quality of the engineered sub-libraries at least 90 clones for each library were picked and the Xbal/Hindlll region was sequenced to determine correctness and uniqueness of the sequences. The libraries were stored as E. coli glycerol cultures.
Phage displaying the pill- and the pVIII-fusions of the L20-peptide library were prepared as follows. For each library phage preparation 80 ml 2x YT/Chloramphenicol/Glucose medium were inoculated with bacteria from the corresponding library glycerol stock resulting in an OD600nm of 0.2 - 0.3. Cultures were shaken until an OD600nm of 0.45 - 0.55 was reached. Then helper phage was added at a multiplicity of infection of 10 to the bacterial culture followed by an incubation for 45 min at 37"C without shaking and then for 45 min at 37"C shaking at 120 rpm. Bacteria were spun down and helper phage containing supernatant was discarded. Phage-infected bacteria were resuspended in 400 ml 2x YT/Chlorarhphenicol/ Kanamycin /IPTG medium and incubated overnight at 22Ό with s haking at 120 rpm. The next day bacteria from the overnight culture were pelleted by centrifugation and the supernatant containing the peptide- presenting phage was collected. Phage precipitation was performed by adding PEG/NaCl to the phage-containing supernatant. The sample was incubated for at least 30 min on ice. Precipitated phage were spun down and resuspended in PBS. The sample was rotated slowly to obtain a homogeneous suspension and residual bacterial debris was pelleted and discarded.
From the phage-containing supernatant the phage were precipitated again using PEG/NaCl. Finally, the phage pellet was resuspended in PBS, transferred to a sterile tube and incubated with gentle agitation to obtain a homogeneous suspension. Phage titers were determined by spot titration and UV absorbance (Nanodrop) at OD268nm, and ELISA.
Display of peptide on the produced phage was evaluated by ELISA. The anti-M13 antibody (Santa Cruz) was used for capturing, as it captures phage particles via the major coat protein g8p. For detection three different antibodies were used. A monoclonal anti-M13 (directed against the major coat protein of M13 phage, g8p) conjugated to HRP (Amersham), and a monoclonal antibody against the FLAG epitope conjugated to AP (AP27, Sigma) or monoclonal anti Histidine antibody conjugated to HRP (R&D Systems), as both epitope tags are encoded by the pill- and pVIII-peptide libraries and therefore part of the displayed peptides (Figure 10) . The capture antibody was immobilized by dispensing antibody solution for the anti-M13 antibody into the wells of a 96-well axisorp plate, sealing the plate with laminated foil and incubating overnight. The next day, the plates were washed 3 times with PBST, and each well was blocked with blocking buffer for at least 1 h at room temperature. After blocking and washing of the plates, dilutions of phage containing supernatants were added to the wells and incubated for 1 h at room temperature. For detection washed plates were either analyzed using QuantaBlu (Pierce) for HRP conjugated antibodies or Attophos fluorescence substrate (Roche, #11681982001 ) for the AP-conjugated anti-FLAG antibody.
Example 4: Quality Control
Another important aspect is the evaluation of the quality and functionality of the peptide library. A qualitative assessment of the phage library, with respect to amino acid distribution, frequency and redundancy was carried out using Sanger sequencing and Next Generation Sequencing.
99 clones were analyzed from the library design of Figure 1 using Sanger sequencing, the results of which are shown in Figure 2A. Figure 2A shows the position and distribution of each amino acid, including the cysteines, which form the cyclic peptides disclosed herein. Therefore, this figure shows that the design of Figure 1 successfully produces a library with the desired positions and proportions of cysteines.
Figure 2B and 2C confirm these results further, as Next Generation Sequencing shows that the cysteine distribution within the library highly correlates with the predicted cysteine distribution.
Of the 99 individual clones sampled from the library of Figure 1 , on average, 2.27 cysteines were identified per clone. In addition, clones were shown to have 0, 1 , 2 or more cysteines, and of the clones sampled 33% were linear and 67% were cyclic, see Figures 3A-3C.
Of the 99 individual clones sampled from the library of Figure 1 , 67% were cyclic peptides. The cyclic peptides comprised loops ranging in size from 3-17 amino acids in length, see Figures 4A-B. Figures 5-6 show example peptides that result from the library of Figure 1 , which have at least two disulfide bonds, and may result in various sized loops within one peptide.
Of the 99 sampled peptides: 42 of the 99 had a cysteine at position 4, which allowed for the formation of a large loop. Of the 42 having a cysteine at position 4: 16 had a total of 2 Cys forming one large loop; 18 of the 42 had 3 cysteines forming either one large or one small loop; 3 of the 42 had 4 cysteines forming one large and one small loop; 1 of the 42 had 5 cysteines, 1 of the 42 had 6 cysteines, 1 of the 42 had 7 cysteines, each forming one large and several small loops. Of the 24 cyclic peptides having no cysteine at position 4, which therefore can only form small loops, 10 of the 24 had 2 cysteines forming one loop; 8 of the 24 had 3 cysteines forming one loop; 6 of the 24 had 4 cysteines and were able to form up to two loops; 1 of the 24 had 5 cysteines, and 1 of the 24 had 6 cysteines and were able to form multiple loops.
Example 5: Panninas
The suitability of the peptide libraries disclosed herein for epitope mapping, and for the identification of therapeutic peptides was analyzed using available model antigens and antibodies whose epitopes are known.
Both libraries (pill and pVIII) were used for solid phase and solution pannings with Streptavidin and the anti c-Myc antibody. Streptavidin was obtained from IBA (Goettingen, Germany). The anti-cMyc mouse mAb was obtained from Santa Cruz Biotechnology (Heidelberg, Germany).
Test selection with disulfide-constrained peptide libraries pill (L20) and pVIII (L20) against Streptavidin beads and anti c-Myc antibody.
Streptavidin was used as antigen for test panning because the binding consensus motif HPQ/M is well described and used in several studies for test selection of peptide libraries (Devlin, J. J., Panganiban, L. C. and Devlin, P. E. (1990) Science 249, 404-406; Lam, K.S. and Lebl, M: Streptavidin and Avidin Recognize Peptide Ligands with Different Motifs. Immuno Methods 1 : 11 -15, 1992).
Here both peptide libraries were handled according to published standard protocols for phage display based peptide selections (Zwick, M. B., Menendez, A., Bonnycastle, L. L. C. and Scott, J.K. (2001 ). In C. F. Barbas, D. R. Burton, J.K. Scott and G. J. Silverman, (Eds.), Phage Display: A Laboratory Manual (pp.18.1-18.44). New York: Cold Spring Harbor Laboratory Press; ) with minor adjustments in terms of selection stringency and adaptation to phagemide vector system. The test selections were performed over 3 subsequent enrichment rounds with monitoring of specific sequences by conventional and next generation sequencing. In short, all pannings were completed with various antigen concentrations (100 nM for round 1 , 50 nM for round 2, and 25 nM for round 3) under standard and less stringent washing conditions. After incubation of the phage with antigens, unspecific bound phages were washed off using PBST and subsequently, the specifically bound phage were eluted using Glycine/HCI.
For solid phase pannings the target proteins (Streptavidin and anti-cMyc mlgG) were diluted in PBS for direct coating the surface of a microtiter plate with a protein concentration of 100 nM. For each sublibrary 2 wells of a microtiter plate were coated with the target proteins using 300 ul protein solution per well. The plate was stored overnight at 4"C. Then the protein solution was removed from the coated wells by rapidly inverting the plate over a plastic tray. The coated wells were washed twice with 400 μΙ PBS and blocked with 400 μΙ blocking buffer for 2 h at RT on a microtiter plate shaker.
Meanwhile the phage blocking mixtures were incubated in 2 ml reaction tubes for 2 h at RT shaking gently. After the blocking procedure the wells were washed 2 times with 400 μΙ PBS and the 300 μΙ of the pre-blocked phage mix transferred into each blocked well. It was incubated for 2 h at RT on a microtiter plate shaker. After that the phage solution from the target protein coated wells were removed by rapidly inverting the plate over a plastic tray and plates were washed with the following washing conditions (Table 1 and 2).
Table 1 (Standard washing conditions)
1 st round 2nd round 3rd round
3x PBST quick 1 x PBST quick 10x PBST quick
2x PBST for 5 min 4x PBST for 5 min 5x PBST for 5 min
3x PBS quick 1 x PBS quick 10x PBS quick
2x PBS for 5 min 4x PBS for 5 min 5x PBS for 5 min
Table 2 (Less stringent washing conditions)
1 st round 2nd round 3rd round
5x PBST quick 4x PBST quick 5x PBST quick
5x PBS quick 1 x PBST for 5 min 3x PBST for 5 min
4x PBS quick 5x PBS quick
1 x PBS for 5 min 3x PBS for 5 min All washing steps were done at RT. After the washing steps all traces of the wash solution were removed by carefully tapping the microtiter plate on a new stack of paper towels. For the elution of specifically bound phage, 300 μΙ 100 mM Glycine/HCI, pH 2.2, 1 mg/ml BSA were added to each selection well and incubated at room temperature for 10- 15 min without shaking. The eluates of each selection were collected and immediately neutralized with 40μΙ 1 M Tris pH 8.
For bead based pannings Streptavindin-coated Dynabeads M280 (Invitrogen) or for binding of the anit-c yc-antibody, Dynabeads-ProteinG (Life Technologies) were used. After blocking and washing of the beads the target proteins were incubated with pre-adsorbed phage. Washing of the coated magnetic Dynabeads was carried out with a magnetic particle separator and incubations were done by over head rotation in low binding tubes. Washing and elution conditions for the bead based solution pannings were identical to the solid phase pannings listed above (Table 1 and Table 2).
E. CO// TG1 F' with an OD600nm of 0.6-0.8 was added to the phage eluates of each selection and was incubated in an incubator without shaking. After infection bacteria were plated out evenly on two large LB/Chloramphenicol/Glucose agar plates for each selection and incubated overnight at 37*0 and Glycerol phage stocks were pr epared.
For the following panning rounds bacterial suspensions of each pool were collected and used to propagate phages for an additional panning round as described above.
After each round of panning the phage titer was determined. The expected range goes from 1 x1010-1 x1012 phage/ml for the input and 104-109 phage/ml for the output. Table 3 shows the input and the output after each round of panning and all values are in the expected range.
Table 3
Figure imgf000039_0001
Figure imgf000040_0001
After completion of the panning rounds phage output pools were analyzed with respect to their binding specificity in an additional differential panning round and ELISA against the specific and unrelated target proteins, such as VEGF-165. The check for specificity of binding by ELISA was carried out by using direct coated target proteins and peptide displaying phage' from panning outputs. Bound phages were detected by the additionally encoded Flag tag using anti- Flag detection. To analyze phage expression anti- 13 capture and anti-Flag detection was used.
NGS analysis and ELISA of the phage outputs from subsequent and differential panning rounds revealed an enrichment of specific binders, that bound to the specific target proteins but did not show binding to unrelated target proteins. Primary hits were defined as an ELISA signal of at least 5-fold above the background.
Example 6: Results of Pannings
Results of Sequencing of Streptayidin binders
Sanger Sequencing was completed on single clones from the panning outputs. A sample of the sequences identified from the Streptavidin binders are shown in Figure 1 1. The known consensus motif, HPQ/M, binding to Streptavidin is clearly shown in both linear and cyclic peptides. In cyclic peptides the motifs were found in small and large rings. This confirms the utility of such a library and of the libraries' resultant peptides, as they are useful in accurately mapping the epitope of binding molecules and/or the region of interest on antigens.
Example 7: Results of Sequencing of c-Mvc binders
Selections against an anti-c-Myc antibody were completed as described above in Example 5 using the library described in Example 3.
Sanger Sequencing was completed on single clones from the panning outputs. A sample of the sequences identified from the Streptavidin binders are shown in Figure 12. This result confirms that a diverse number of specific peptides were identified, wherein the peptides selected are both linear, constrained, and have a wide range of confirmations, and loop lengths.
In early selection rounds it is desirable to identify a wide range of different peptide conformations that are specific for the antigen of interest, here c-myc. As a next step, the identified peptides could be characterized for their functional properties, such as, affinity, function in relevant assay, etc. A person of skill in the art would immediately realize that this library could be used in the discovery phase to identify potential therapeutically relevant peptides that could be developed for pharmaceutical use.

Claims

1 . A library of synthetic peptides, comprising
linear and cyclic peptides,
wherein the cyclic peptides have different loop lengths, and
wherein the cyclic peptides have two or more loops.
2. The library according to claim 1 , wherein the loop lengths range from 3-17 amino acids in length.
3. The library according to claim 2, wherein the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, or 17 amino acids.
4. The library according to claim 3, wherein the library comprises cyclic peptides comprising loop lengths of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, and 17 amino acids.
5. The library according to claim 1 , where the library comprises peptides
comprising 0, 1 , 2, 3, 4, 5, 6 or 7 or more loops.
6. The library according to claim 5, where the library comprises peptides
comprising 0, 1 , 2, 3, 4, 5, 6 and 7 loops.
7. A library according to any one of the preceding claims, wherein the synthetic peptides of said library are translated from nucleic acids.
8. A library according to any one of the preceding claims, wherein the synthetic peptides of said library are displayed on bacteriophage.
9. A library according to any one of the preceding claims, wherein the synthetic peptides of said library comprise a controlled ratio of cysteines at certain positions.
10. A library according to any one of the preceding claims, wherein the cyclic peptides are formed by one or more covalent or one or more non-covalent bonds.
1 1 . A library according to claim 10, wherein said covalent bond is a disulfide bond or a bond between two non-naturally occurring amino acids.
12. A library according to claim 1 1 , wherein said bond between two non-naturally occurring amino acids is a thioether bond.
13. A library according to claim 1 1 , wherein said covalent bond is a disulfide bond.
14. A library according to claim 13, wherein said disulfide bond is formed by two cysteine residues.
15. A library according to claim 14, wherein said library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 or 7 or more cysteine residues.
16. A library according to claim 15, wherein said library comprises peptides comprising 0, 1 , 2, 3, 4, 5, 6 and 7 cysteine residues.
17. A library according to any one of the preceding claims, wherein each member of the library comprises an amino acid sequence Cmix - (X) m - (Amix) n, wherein a) Cmix is a mixture of 50% cysteine and an equal mixture of the remaining natural occurring amino acids, b) X are each an equal mixture of the natural occurring amino acids, excluding cysteine, c) Amix are each a mixture of 5-50% cysteine and an equal mixture of the remaining natural occurring amino acids, and d) m and n are both, and independently from each other, 3-20.
18. A library according to claim 17, wherein each member of the library comprises an amino acid sequence Cmix - (X) m - (Amix) n, wherein d) Amix are each a mixture of 10-20% cysteine and an equal mixture of the
remaining natural occurring amino acids, e) m is 5-6, and f) n is 3-20.
19. A library according to claim 18, wherein each member of the library comprises an amino acid sequence (X) I - Cmix - (X) m - (Amix) n, wherein b) I is 1 -3.
20. A library according to claim 19, wherein each member of the library comprises an amino acid sequence (X) l-Cmix- (X) m- (Amix) n, wherein e) Amix are each a mixture of 15% cysteine and an equal mixture of the remaining natural occurring amino acids, f) I is 3, g) m is 6, and h) n is 10.
21 . A library according to claim 20, wherein the library has a design as shown in Figure 1 .
22. A library of nucleic acids encoding the library of peptides according to any one of the preceding claims.
23. A vector comprising the nucleic acids of claim 22.
24. The vector according to claim 23, wherein said vector is a display vector or an expression vector.
25. A method of identifying a peptide specific for an antigen, comprising
(a) contacting an antigen with a library of any of claims 1 -21 , and
(b) selecting one or more peptides specific for said antigen.
26. A peptide identified using the method of claim 25.
PCT/EP2015/059496 2014-05-02 2015-04-30 Peptide libraries WO2015166036A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
US15/306,538 US10822604B2 (en) 2014-05-02 2015-04-30 Peptide libraries
EP15720692.1A EP3137482A1 (en) 2014-05-02 2015-04-30 Peptide libraries
US17/014,084 US11352620B2 (en) 2014-05-02 2020-09-08 Peptide libraries

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP14166898 2014-05-02
EP14166898.8 2014-05-02

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US15/306,538 A-371-Of-International US10822604B2 (en) 2014-05-02 2015-04-30 Peptide libraries
US17/014,084 Continuation US11352620B2 (en) 2014-05-02 2020-09-08 Peptide libraries

Publications (1)

Publication Number Publication Date
WO2015166036A1 true WO2015166036A1 (en) 2015-11-05

Family

ID=50679879

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2015/059496 WO2015166036A1 (en) 2014-05-02 2015-04-30 Peptide libraries

Country Status (3)

Country Link
US (2) US10822604B2 (en)
EP (1) EP3137482A1 (en)
WO (1) WO2015166036A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017149117A1 (en) 2016-03-04 2017-09-08 Morphosys Ag Polypeptide library
CN111727194A (en) * 2017-04-26 2020-09-29 湖南中晟全肽生化有限公司 Method for constructing peptide library

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10822604B2 (en) 2014-05-02 2020-11-03 Morphosys Ag Peptide libraries

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NZ207394A (en) 1983-03-08 1987-03-06 Commw Serum Lab Commission Detecting or determining sequence of amino acids
NL9101953A (en) 1991-11-21 1993-06-16 Seed Capital Investments TESTING DEVICE CONTAINING A PLATE WITH A MULTIPLE OF WELLS WITH AN ASSOCIATED DOSING DEVICE, AND A KIT INCLUDING THESE DEVICES AND USE OF THE DEVICES.
US20020068301A1 (en) 1997-05-28 2002-06-06 Hung-Sen Lai Cyclic peptide libraries and methods of use thereof to identify binding motifs
US6441140B1 (en) 1998-09-04 2002-08-27 Cell Signaling Technology, Inc. Production of motif-specific and context-independent antibodies using peptide libraries as antigens
ATE296350T1 (en) 1999-06-14 2005-06-15 Genentech Inc STRUCTURED PEPTIDE SCAFFOLD FOR DISPLAYING TURNED LIBRARIES ON PHAGE
US20030166003A1 (en) 1999-06-14 2003-09-04 Cochran Andrea G. Structured peptide scaffold for displaying turn libraries on phage
EP1197755A1 (en) 2000-10-11 2002-04-17 Pepscan Systems B.V. Identification of protein binding sites
AU2006206848B2 (en) * 2005-01-24 2012-05-31 Pepscan Systems B.V. Binding compounds, immunogenic compounds and peptidomimetics
EP2653544A1 (en) 2008-02-05 2013-10-23 Bicycle Therapeutics Limited Methods and compositions
US10822604B2 (en) 2014-05-02 2020-11-03 Morphosys Ag Peptide libraries

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
DEVLIN J J ET AL: "RANDOM PEPTIDE LIBRARIES: A SOURCE OF SPECIFIC PROTEIN BINDING MOLECULES", SCIENCE, AMERICAN ASSOCIATION FOR THE ADVANCEMENT OF SCIENCE, US, vol. 249, 27 July 1990 (1990-07-27), pages 404 - 406, XP000616029, ISSN: 0036-8075, DOI: 10.1126/SCIENCE.2143033 *
FRANÇOIS BÉDARD, ANICK GIRARD, ÉRIC BIRON: "A Convenient Approach to Prepare Topologically Segregated Bilayer Beads for One-Bead Two-Compound Combinatorial Peptide Libraries", INTERNATIONAL JOURNAL OF PEPTIDE RESEARCH AND THERAPEUTICS, vol. 19, 18 July 2012 (2012-07-18), pages 13 - 23, XP002730970 *
H. VAN DE LANGEMHEEN, M. VAN HOEKE, H.C. QUARLES VAN UFFORD, J.A.KRUIJTZER, R.M.J.LISKAMP: "scaffolded multiple cyclic peptide libraries for protein mimics by native chemical ligation", ORGANIC AND B IOMOLECULAR CHEMISTRY, vol. 12, 24 April 2014 (2014-04-24), pages 4471 - 4478, XP002742187 *
SANG HOON JOO, QING XIAO, YUN LING, BHASKAR GAPISHETTY, DEHUA PEI: "High-Throughput Sequence Determination of Cyclic Peptide Library Members by Partial Edman Degradation/Mass Spectrometry", JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, vol. 128, 9 August 2006 (2006-08-09), pages 13000 - 13009, XP002730969 *
See also references of EP3137482A1 *
UCHIYAMA F ET AL: "Designing scaffolds of peptides for phage display libraries", JOURNAL OF BIOSCIENCE AND BIOENGINEERING, ELSEVIER, AMSTERDAM, NL, vol. 99, no. 5, 1 May 2005 (2005-05-01), pages 448 - 456, XP027707156, ISSN: 1389-1723, [retrieved on 20050501] *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017149117A1 (en) 2016-03-04 2017-09-08 Morphosys Ag Polypeptide library
CN111727194A (en) * 2017-04-26 2020-09-29 湖南中晟全肽生化有限公司 Method for constructing peptide library
CN111727194B (en) * 2017-04-26 2023-11-03 湖南中晟全肽生化有限公司 Method for constructing peptide library

Also Published As

Publication number Publication date
EP3137482A1 (en) 2017-03-08
US10822604B2 (en) 2020-11-03
US20170166886A1 (en) 2017-06-15
US11352620B2 (en) 2022-06-07
US20200399630A1 (en) 2020-12-24

Similar Documents

Publication Publication Date Title
Bratkovič Progress in phage display: evolution of the technique and its applications
US11352620B2 (en) Peptide libraries
T O'Neil et al. Phage display: protein engineering by directed evolution
US5625033A (en) Totally synthetic affinity reagents
JP4312403B2 (en) Novel method for displaying (poly) peptide / protein on bacteriophage particles via disulfide bonds
JP5944029B2 (en) pVII phage display
US7256038B2 (en) Polypeptide display libraries and methods of making and using thereof
Frei et al. Protein and antibody engineering by phage display
WO2002103363A2 (en) Selection by avidity capture
Felici et al. Peptide and protein display on the surface of filamentous bacteriophage
EP0883686A1 (en) Novel method for the identification of nucleic acid sequences encoding two or more interacting (poly)peptides
JP2011507529A (en) Alternative scaffold protein fusion phage display via fusion of M13 phage to pIX
Adda et al. Random sequence libraries displayed on phage: identification of biologically important molecules
CA2586028A1 (en) Ultra high throughput capture lift screening methods
Cesareni et al. Phage displayed peptide libraries
US8969253B2 (en) Method for screening phage display libraries against each other
Crameri pJuFo: a phage surface display system for cloning genes based on protein-ligand interaction
McConnell et al. Construction and screening of M13 phage libraries displaying long random peptides
Garufi et al. Display libraries on bacteriophage lambda capsid
Kay et al. Principles and applications of phage display
Petrenko et al. Vectors and modes of display
Smith Principles of affinity selection
Rader The pComb3 Phagemid Family of Phage Display Vectors
US20050130124A1 (en) Phagemid display system
Gazarian Drug discovery and design via high throughput screening of combinatorial phage-display protein-peptide libraries

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15720692

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 15306538

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

REEP Request for entry into the european phase

Ref document number: 2015720692

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2015720692

Country of ref document: EP