EP1119530A2 - Selecting codes to be used for encoding combinatorial libraries - Google Patents

Selecting codes to be used for encoding combinatorial libraries

Info

Publication number
EP1119530A2
EP1119530A2 EP99950264A EP99950264A EP1119530A2 EP 1119530 A2 EP1119530 A2 EP 1119530A2 EP 99950264 A EP99950264 A EP 99950264A EP 99950264 A EP99950264 A EP 99950264A EP 1119530 A2 EP1119530 A2 EP 1119530A2
Authority
EP
European Patent Office
Prior art keywords
codes
code
reaction conditions
group
tags
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP99950264A
Other languages
German (de)
French (fr)
Inventor
Lawrence Wayne Dillard
James Andrew Connelly
John J. Baldwin
Eric George Horlbeck
Gregory L. Kirk
Giorgio Lauri
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Pharmacopeia Drug Discovery Inc
Original Assignee
Pharmacopeia LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Pharmacopeia LLC filed Critical Pharmacopeia LLC
Publication of EP1119530A2 publication Critical patent/EP1119530A2/en
Withdrawn legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B70/00Tags or labels specially adapted for combinatorial chemistry or libraries, e.g. fluorescent tags or bar codes
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B01PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
    • B01JCHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
    • B01J19/00Chemical, physical or physico-chemical processes in general; Their relevant apparatus
    • B01J19/0046Sequential or parallel reactions, e.g. for the synthesis of polypeptides or polynucleotides; Apparatus and devices for combinatorial chemistry or for making molecular arrays
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B01PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
    • B01JCHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
    • B01J2219/00Chemical, physical or physico-chemical processes in general; Their relevant apparatus
    • B01J2219/00274Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
    • B01J2219/00277Apparatus
    • B01J2219/0054Means for coding or tagging the apparatus or the reagents
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B01PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
    • B01JCHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
    • B01J2219/00Chemical, physical or physico-chemical processes in general; Their relevant apparatus
    • B01J2219/00274Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
    • B01J2219/00277Apparatus
    • B01J2219/0054Means for coding or tagging the apparatus or the reagents
    • B01J2219/00542Alphanumeric characters
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B01PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
    • B01JCHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
    • B01J2219/00Chemical, physical or physico-chemical processes in general; Their relevant apparatus
    • B01J2219/00274Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
    • B01J2219/00277Apparatus
    • B01J2219/0054Means for coding or tagging the apparatus or the reagents
    • B01J2219/00572Chemical means
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B01PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
    • B01JCHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
    • B01J2219/00Chemical, physical or physico-chemical processes in general; Their relevant apparatus
    • B01J2219/00274Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
    • B01J2219/0068Means for controlling the apparatus of the process
    • B01J2219/00686Automatic
    • B01J2219/00689Automatic using computers
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B01PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
    • B01JCHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
    • B01J2219/00Chemical, physical or physico-chemical processes in general; Their relevant apparatus
    • B01J2219/00274Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
    • B01J2219/0068Means for controlling the apparatus of the process
    • B01J2219/00695Synthesis control routines, e.g. using computer programs

Landscapes

  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biochemistry (AREA)
  • General Chemical & Material Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Molecular Biology (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Codes to be used for encoding combinatorial libraries are selectively chosen based on one or more predefined function or criterion. In particular, a subset of N possible codes is selected based on some criterion. In one example, the codes are binary codes, and each code represents the tags used during a particular stage of synthesis of members of a combinatorial library. The tags define the reaction condition used during that particular stage of synthesis. In one embodiment, the predefined criterion ensures that each code includes more than one tag. This helps eliminate ambiguity during a decoding process in which the tags are identified to determine the reaction history during synthesis.

Description

SELECTING CODES TO BE USED FOR ENCODING COMBINATORIAL LIBRARIES
TECHNICAL FIELD
This invention relates, in general, to the encoding of combinatorial libraries and, in particular, to selectively choosing codes to be assigned to reaction conditions used during synthesis of a combinatorial library.
BACKGROUND ART
Combinatorial techniques of chemical synthesis allow the creation of molecular libraries having immense diversity. These techniques entail a series of chemical steps with multiple choices of reaction conditions (e.g., reagents, temperature changes, etc.) for each step. The complexity, or number of members in a combinatorial library, is given by the product of the number of reaction conditions for each step of the synthesis, and can therefore, be quite large. The challenge in using combinatorial libraries is the characterization of members of the library with particular desired properties.
One solution to the above challenge is to use a split synthesis or direct divide technique to perform chemical synthesis on solid particles, such as beads. Through a protocol of separating and mixing beads during the synthesis, each bead in the final library has a product from a single, specific reaction sequence chemically bound to it, and that product is likely to differ from that bound to another bead.
During each step of the synthesis, zero or more tags (e.g., tagging molecules) are attached to each bead in order to encode the reaction condition used in that step, as well as the step number. In one embodiment, the tags are used in combination with one another to form a binary record of the synthetic steps for each bead. For example, assume a combinatorial synthesis using any of seven different reagents in each of N steps is to be carried out. Such a combinatorial synthesis would yield 7N different final products. As an example, the various reagents which can be used in any step are designated as binary 001 (Reagent 1), 010(Reagent 2), 011 (Reagent 3),... 111 (Reagent 7). Thus, a binary synthesis code describing any complete N-STEP synthesis using 3 x N binary digits can be written.
For instance, if Reagent 3 is used in the first step, the binary numerical description is 011. If Reagent 1 is used in the second step, the description is 001 011. Further, if Reagent 6 is used in the third step, the description is 110 001 Oi l. This 9-bit binary synthesis code describes the synthesis, and can be read from right to left in 3 -bit blocks to decode the reagents used in each step of the synthesis.
To represent such a synthesis code chemically, a set of distinguishable, sensitively detectable molecules is used as tags, and the presence of a particular tag represents a binary " 1 " for the corresponding bit. Using a set of nine tagging molecules, T9-T1, for the above example, where T9 represents the leftmost binary bit and Tl represents the rightmost bit, the tag mixture containing only T9, T8, T4, T2 and Tl represents the 110 001 011 synthesis code.
The use of tags and various encoding techniques are described in detail in one or more of the following references, each of which is hereby incorporated herein by reference in its entirety: Ohlmeyer et al., "Complex synthetic chemical libraries indexed with molecular tags", Proceedings Of The National Academy Of Sciences Of The United States Of America, Vol. 90, No. 23, pp. 10922-10926 (December 1993); J.J. Baldwin, "Design, synthesis and use of binary encoded synthetic chemical libraries", Molecular Diversity, Vol. 2, No. 1/2, pp. 81-88 (October 1996); Burbaum et al., "A paradigm for drug discovery employing encoded combinatorial libraries", Proceedings Of The National Academy Of Sciences Of The United States Of America, Nol. 92, No. 13, pp. 6027-6031 (June 1995); Still et al., U.S. Patent No. 5,565,324, entitled "Complex Combinatorial Chemical Libraries Encoded With Tags", issued on October 15, 1996; Baldwin et al., U.S. Patent No. 5,618,825, entitled "Combinatorial Sulfonamide Library", issued on April 08, 1997; Baldwin et al, U.S. Patent No. 5,663,046, entitled
"Synthesis Of Combinatorial Libraries", issued on September 02, 1997; Still et al., International Publication No. WO 94/08051, entitled "Complex Combinatorial Chemical Libraries Encoded With Tags", International Publication Date April 14, 1994; Dower et al., U.S. Patent No. 5,639,603, entitled "Synthesizing And Screening Molecular Diversity", issued on June 17, 1997; and Dower et al.,
International Publication No. WO 93/06121, entitled "Method Of Synthesizing Diverse Collections Of Oligomers", International Publication Date April 01, 1993.
While binary coding has been established as a viable technique in encoding complex combinatorial libraries, there are some shortcomings with the present techniques, especially during decoding of the tags.
Decoding is performed in order to determine the reaction history of a particular bead. In particular, during decoding, any tag(s) attached to a bead is detached and identified to determine the particular conditions that occurred during synthesis. One technique for separating and identifying tags is known as Capillary Gas Chromatography (GC).
During decoding, it is sometimes difficult to determine whether a tag is present due to impurities of similar retention times or tags present in low amounts. Thus, the decoding becomes ambiguous, and it is difficult, under those circumstances, to determine the appropriate binary code that represents the reaction history.
Based on the foregoing, a need exists for a coding technique that eliminates ambiguous code reading. A further need exists for a capability that enables the selective choosing of codes, from N possible codes, to be assigned to reaction conditions. A yet further need exists for a capability that guarantees the presence of enough tag peaks in a chromatogram that, even in the presence of significant variability in the absolute timing of the run, the relative timing can be determined. That is, a need exists for the presence of at least two tag peaks, so that the time distance (i.e., relative timing) between those peaks can be determined, even if there is significant variability in their positions (i.e., absolute timing). A further need exists for a capability that uses this time distance to confirm whether a particular peak represents the presence of a tag or a contaminant (e.g., substantially constant time spacing indicates the presence of a tag). The use of time spacing as a confirmation of whether a peak represents a tag or a contaminant is referred to herein as "self-clocking".
SUMMARY OF THE INVENTION
The shortcomings of the prior art are overcome and additional advantages are provided through the provision of a method of determining codes usable in encoding combinatorial libraries. The method includes, for instance, selecting a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a combinatorial library. Each of the plurality of codes includes a plurality of tags, wherein none of the plurality of codes includes only a single tag. The method further includes assigning selected codes to reaction conditions.
In one embodiment, each of the plurality of codes is a binary code, and a binary "one" within the binary code represents the presence of a particular tag.
In another embodiment of the invention, the plurality of codes is selected using a predefined criterion. The predefined criterion specifies at least one of the following: each of the plurality of codes includes an even number of tags present therein; each of the plurality of codes includes an odd number of tags present therein; each of the plurality of codes includes up to a maximal number of tags present therein; each of the plurality of codes includes up to a maximal number of "zero" bits; and each of the plurality of codes does not include a predetermined pattern of bits.
In another embodiment, the plurality of codes is selected using a parity bit.
In a further aspect of the present invention, a method of determining codes usable in encoding chemical libraries is provided. The method includes selecting, from N possible codes, a group of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library. The selecting includes using a predefined function to select the group of codes, wherein the predefined function selects fewer than N-1 codes from the N possible codes. The method further includes assigning selected codes to reaction conditions.
In yet a further aspect of the present invention, a method of determining codes usable in encoding chemical libraries is provided. The method includes selecting, from N possible codes, a plurality of codes to be assigned to a plurality of reaction conditions, wherein the plurality of codes satisfies a predefined criterion. The predefined criterion is other than excluding an "all zeroes" code. The method further includes assigning selected codes to reaction conditions.
In a further aspect of the present invention, a system of determining codes usable in encoding combinatorial libraries is provided. The system includes means for selecting a plurality of codes to be assigned to a plurality of reaction conditions, in which each of the plurality of codes includes a plurality of tags, such that none of the plurality of codes includes only a single tag. The system further includes means for assigning selected codes to reaction conditions.
In another aspect of the present invention, a system of determining codes usable in encoding chemical libraries is provided. The system includes means for selecting, from N possible codes, a group of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library. The means for selecting includes means for using a predefined function to select the group of codes, wherein the predefined function selects fewer than N-1 codes from the N possible codes. The system further includes means for assigning selected codes to reaction conditions.
In yet a further aspect of the present invention, a system of determining codes usable in encoding chemical libraries is provided. The system includes means for selecting, from N possible codes, a plurality of codes to be assigned to a plurality of reaction conditions, wherein the plurality of codes satisfies a predefined criterion, wherein the predefined criterion is other than excluding an "all zeroes" code. The system further includes means for assigning selected codes to reaction conditions.
In another aspect of the present invention, an article of manufacture is provided, including at least one computer usable medium having computer readable program code means embodied therein for causing the determining of codes usable in encoding combinatorial libraries. The computer readable program code means includes computer readable program code means for causing a computer to select a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a combinatorial library, each of the plurality of codes including a plurality of tags, wherein none of the plurality of codes includes only a single tag; and computer readable program code means for causing a computer to assign selected codes to reaction conditions.
In yet another aspect of the present invention, at least one program storage device is provided, which is readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform a method of determining codes usable in encoding chemical libraries. The method includes selecting, from N possible codes, a group of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library. The selecting includes using a predefined function to select the group of codes, wherein the predefined function selects fewer than N-1 codes from the N possible codes. The method further includes assigning selected codes to reaction conditions.
In a further aspect of the present invention, at least one program storage device is provided, which is readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform a method of determining codes usable in encoding chemical libraries. The method includes selecting, from N possible codes, a plurality of codes to be assigned to a plurality of reaction conditions, wherein the plurality of codes satisfies a predefined criterion. The predefined criterion is other than excluding an "all zeroes" code. The method further includes assigning selected codes to reaction conditions.
In accordance with the principles of the present invention, a coding capability is provided that eliminates ambiguous code reading. Further, the capability of the present invention advantageously enables the selective choosing of codes, from N possible codes, to be assigned to reaction conditions. Further, the capability of the present invention guarantees the presence of enough tag peaks in a chromatogram that, even in the presence of significant variability in the absolute timing of the run, the relative timing can be determined. Additionally, the present invention is advantageously self-clocking. The use of the present invention in each synthesis step can result in the avoidance of single bit codes for reaction conditions.
Additional features and advantages are realized through the techniques of the present invention. Other embodiments and aspects of the invention are described in detail herein and are considered a part of the claimed invention.
BRIEF DESCRIPTION OF THE DRAWINGS The subject matter which is regarded as the invention is particularly pointed out and distinctly claimed in the claims at the conclusion of the specification. The foregoing and other objects, features, and advantages of the invention will be apparent from the following detailed description taken in conjunction with the accompanying drawings in which:
FIG. 1 depicts a generalized method of the present invention;
FIG. 2 depicts one embodiment of the logic associated with the selection capability of the present invention;
FIG. 3 depicts one example of N possible codes, in which a subset is selected (or "accepted") therefrom, in accordance with the principles of the present invention;
FIG. 4 depicts one example of a table of accepted codes, in accordance with the principles of the present invention;
FIG. 5 depicts one embodiment of the table of accepted codes of FIG. 4, in which the codes are assigned to reaction conditions, in accordance with the principles of the present invention;
FIG. 6 depicts another example of accepted codes, in accordance with the principles of the present invention;
FIG. 7a depicts another example of N possible codes, in which a subset is selected (or accepted) therefrom, in accordance with the principles of the present invention;
FIG. 7b depicts one example of adding a parity bit to the N possible codes of FIG. 7a in order to make a selection of the codes to be included in the subset of accepted codes, in accordance with the principles of the present invention;
FIG. 8 depicts one example in which it is difficult to determine whether a tag is present in a sample;
FIG. 9 illustrates one example of output from a gas chromatograph, in accordance with the principles of the present invention;
FIG. 10 depicts another example of output from a gas chromatograph, in accordance with the principles of the present invention; and
FIG. 11 depicts one embodiment of a computer environment providing and/or using the capability of the present invention.
BEST MODE FOR CARRYING OUT THE INVENTION
In accordance with the principles of the present invention, a capability is provided in which codes to be assigned to reaction conditions are selectively chosen based on one or more criterion. That is, out of N possible codes, a subset of the N codes is selected based on some constraint or some predefined function.
The codes are, for instance, binary codes, and each code represents the tags used during a particular stage of synthesis of members of a combinatorial library. Specifically, the tags define the reaction condition used during that particular stage of synthesis.
A generalized technique of one embodiment of the present invention is described with reference to FIG. 1. Initially, a group of acceptable codes is selectively chosen, from N possible codes, based upon a predefined function or one or more criterion, as described in detail below, STEP 100. Thereafter, the selected codes are assigned to reaction conditions, and those conditions may be used during synthesis of library members, STEP 102. The assignment may be arbitrary or may be based on one or more factors. For instance, the first code may be assigned to the first condition to be used during synthesis, etc.
One embodiment of the selection process used to choose a group of codes to be assigned to the reaction conditions is described in detail with reference to FIG. 2. Initially, a decision is made as to the N possible codes that could be used for a particular synthesis step, STEP 200. For example, a determination is made as to how many bits are sufficient to represent the number of reaction conditions to be used during that step. Assume for this one example that four bits are sufficient. Thus, in a binary scheme, there are 24 possible codes (i.e., N=16), as shown in FIG. 3.
Out of the possible codes, only those codes that satisfy one or more criterion (or predefined function) are selected. The criterion can include many different possibilities. As examples, the criterion can select codes having an even number of "one" bits (i.e., an even number of tags); an odd number of "one" bits; an odd number, greater than one, of "one" bits; more than one tag (i.e., more than one binary "one" bit); up to a total number of "zero" bits; up to a total number of "one" bits; up to a maximal allowed sequence of "zero" or "one" bits; an even parity; an odd parity; codes that do not include a predefined sequence of bits (e.g., the codes that do not include a sequence of 101), etc. The above criteria are only provided as examples. There are many more possibilities, and each of those possibilities is considered a part of the claimed invention.
For illustration purposes only, the predefined function, in this example, includes all those codes that have up to a total of two "zeroes". Thus, the process continues with determining which of the codes of FIG. 3 meet this criterion. Returning to FIG. 2, a code is considered from the possible codes, STEP 202. For example, code 0000 is considered from the codes depicted in FIG. 3. A determination is made as to whether the considered code meets the criterion of having no more than two zeroes, STEP 204. Since code 0000 has more than two zeroes, it does not meet the criterion, and thus, is not an acceptable code.
Therefore, if there are further codes to be considered, INQUIRY 206, a new code is considered, STEP 202.
Proceeding sequentially down the list of the possible codes illustrated in FIG. 3, the next code is 0001. Again, since code 0001 does not meet the criterion, STEP 204, and there are more codes, INQUIRY 206, another code is selected. This procedure continues. At some point, a code is selected that does meet the criterion, INQUIRY 204. For instance, code 0011 meets the criterion of having at a maximum two "zeroes", thus, this code is selected as an acceptable code.
In one example, the acceptable code is placed in a table of acceptable codes, STEP 208. One instance of such a table is depicted in FIG. 4. The above process continues until all of the acceptable codes are selected from the possible codes based on the predefined criterion.
As described with reference to FIG. 1 , after the acceptable codes are determined, each of the acceptable codes can be assigned, respectively, to one of a plurality of reaction conditions, as depicted in FIG. 5. In this particular case, ten reaction conditions are represented by ten unique codes.
Another example of a predefined function used to select acceptable codes from N possible codes is one in which all codes having an even number of tags is selected. This is depicted in FIG. 6. In this particular example, five bits are considered necessary to represent the various reaction conditions to be used during synthesis. Thus, there are 32 possible codes. Out of the 32 possible codes, 15 codes are acceptable (designated by "used" in the table). That is, 15 codes meet the criterion of having an even number of tags.
In yet another embodiment of the present invention, parity bits may be added to the N possible codes in order to determine the acceptable codes. For example, assume there are eight possible codes, as shown in FIG. 7a. Further, assume that even parity is to be used, in this example. Thus, for each code in which the addition of a binary "one" provides an even number of "one" bits, a parity "one" bit is added, as shown in FIG. 7b.
Thereafter, the codes having the parity "one" bit are selected from the N possible codes. In the example depicted in FIG. 7b, the following codes are chosen: 0011, 0101, 1001 and 1111.
In the above examples in which each code is a binary code, each binary "one" represents the existence of a tag and each binary "zero" represents the absence of a tag. When a reaction condition is used in a particular synthesis step, the tags represented by the binary code associated with that reaction condition are also added during that synthesis step. This provides a record of the conditions used during synthesis of a particular library member.
For example, assume Reaction Condition 3 (FIG. 5) is used during a first synthesis step, Reaction Condition 1 is used during a second step, and Reaction Condition 6 is used during a third step, then the synthesis code representing the three reaction conditions is 0111 0011 0110. This synthesis code can be read from right to left, in 4-bit blocks, to decode the reaction conditions used during each step of the synthesis. (In another embodiment, the 10 different reaction conditions in FIG. 5 would be alternatives for a single reaction step. A different set of reaction conditions, encoded by a different set of tags, would be used for the next reaction step.) To represent the synthesis code chemically, a set of distinguishable tags is used, in which the presence of a particular tag is represented by a binary " 1 ". In this specific example, twelve (12) bits were used to represent 3 reaction conditions and 3 steps and, thus, a set of twelve tagging molecules are used, T12-T1. T12 represents the leftmost binary bit and Tl represents the rightmost bit of the synthesis code. Therefore, the following tags would represent the 0111 0011 0110 synthesis code: Ti l, T10, T9, T6, T5, T3 and T2.
The manner in which tagging molecules are prepared and the manner in which tagging molecules relate to the binary bits of a synthesis code, are known in the art and described in various of the above references, each of which has been incorporated herein by reference in its entirety.
When the reaction history of a particular bead is desired, the tags associated with that bead are decoded. In one example, a chromatogram is produced showing peaks where tags are present. From the peaks, a binary code is determined. For instance, in the above example, since a chromatogram would show that T2 and T3 are present and Tl and T4 are absent, the binary code 0110 is provided, This code indicates that Reaction Condition 3 was used during the first synthesis step.
Sometimes the peaks of the chromatogram are not well defined, and thus, it is difficult to determine whether a tag is present. For instance, it is possible that during the tagging reaction, one tag may not be incorporated in a particular bead at the same quantitative level as others within the set. It is also possible that during the detagging/gas chromatograph (GC) portion of the process, some impurities can be introduced into the tag mixture, which may have retention times similar to that of one of the tags in the set. Both of these occurrences can result in chromatogram data in which the tagging code is ambiguous. An example of such is depicted in FIG. 8. In the example of FIG. 8, a 5 place binary code is represented. Clearly, tags 5 and 2 are present, while tags 1 and 3 are not. However, because the peak having the retention time in the expected range for tag 4 is of much lower height than either of the other two, its identity is in question. As a result, the binary code, from left to right, can be either 11010 or 10010.
To eliminate this type of ambiguous code reading, the encoding protocol of the present invention is used, in which a specially chosen subset of the N possible binary codes is employed. For example, if a group of codes is selected from the 32 (25) (see FIG. 6) possible codes, based on a selection criterion of only those codes where the sum of the individual digits is even, then 11010 would not be an acceptable code. Thus, 10010 must be the code that represents the sample depicted in FIG. 8. With this strategy, 2(N"1)-1 unique sets of conditions can be encoded with N tags.
Further examples in which the selective code capability of the present invention is used are depicted in FIGs. 9-10. Each of the figures depicts a sample chromatogram, and each tag within the chromatogram is labelled by CnnCLn. For example, in FIG. 9, peak 900 has tag C12CL5.
In FIG. 9, there is an ambiguity as to whether peak 902 represents the presence of a particular tag or not. Thus, the binary code for that step may be 1101 or 1001. However, based on the criterion for acceptable codes used in this example (e.g., an even number of tags), binary code 1001 (reference number 904) must be the correct code. Therefore, in accordance with the present invention, that peak does not have a tag.
Similarly, in FIG. 10, peak 1000 is in question. However, based on the present invention in which the criterion specifies no single tags, the peak is determined to be tagged. Thus, a binary "one" represents that peak (see reference number 1002). The same holds true for peak 1004. The capability of the present invention can readily be automated by creating a suitable program, in software, hardware, microcode, firmware or any combination thereof. Further, any type of computer or computer environment can be employed to provide, incorporate and/or use the capability of the present invention. One such environment is depicted in FIG. 11 and described in detail below.
In one embodiment, a computer environment 1100 includes, for instance, at least one central processing unit 1102, a main storage 1104, and one or more input/output devices 1106, each of which is described below.
As is known, central processing unit 1102 is the controlling center of computer environment 1100 and provides the sequencing and processing facilities for instruction execution, interruption action, timing functions, initial program loading and other machine related functions. The central processing unit executes at least one operating system, which as known, is used to control the operation of the computing unit by controlling the execution of other programs, controlling communication with peripheral devices and controlling use of the computer resources.
Central processing unit 1102 is coupled to main storage 1104, which is directly addressable and provides for high speed processing of data by the central processing unit. Main storage may be either physically integrated with the CPU or constructed in stand alone units.
Main storage 1104 is also coupled to one or more input/output devices 1106. These devices include, for instance, keyboards, communications controllers, teleprocessing devices, printers, magnetic storage media (e.g., tape, disks), direct access storage devices, and sensor based equipment. Data is transferred from main storage 1104 to input/output devices 1106, and from the input/output devices back to main storage. Described in detail above is an improvement of the coding process of combinatorial libraries, in which group coding is used. Group coding includes selectively choosing a subset of codes, from N possible codes, that meets one or more predetermined criterion. The subset can include codes that are selected based on a parity bit or by some other mechanism.
In one example, the code selection capability of the present invention guarantees the presence of enough tag peaks in the chromatogram that, even in the presence of significant variability in the absolute timing of the run, the relative timing can be determined and the "zero" bits identified reliably. The group coding thus is considered self-clocking.
In one embodiment, the selection capability of the present invention allows more than 2(N"1)-1 possibilities for N bits. Further, no bit can be claimed to be merely extraneous to the code. A bit is considered extraneous to the code when the bit is added to the code just for the sake of adding a bit and that bit can really be ignored when the code is read. With the present invention, these extraneous bits are avoided and thus, more efficient use of tags can be made.
The use of the present invention at each synthesis step can result in avoiding single-bit codes for reaction conditions. Additionally, it advantageously allows for more reliable interpretation of an individual chromatogram by using the guaranteed presence of a minimum number of tags to create an "internal standard" for the shifts in that chromatogram. Further, it allows for independent error checking of the validity of the tags at each step based on the frequency of the code occurrence and the identification of invalid codes.
Although the examples described above reference binary coding, the present invention is also applicable to higher order coding or other types of coding. Thus, these are considered a part of the claimed invention. The present invention can be included in an article of manufacture (e.g., one or more computer program products) having, for instance, computer usable media. The media has embodied therein, for instance, computer readable program code means for providing and facilitating the capabilities of the present invention. The article of manufacture can be included as a part of a computer system or sold separately.
Additionally, at least one program storage device readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform the capabilities of the present invention can be provided.
The flow diagrams depicted herein are just exemplary. There may be many variations to these diagrams or the steps (or operations) described therein without departing from the spirit of the invention. For instance, the steps may be performed in a differing order, or steps may be added, deleted or modified. All of these variations are considered a part of the claimed invention.
Although preferred embodiments have been depicted and described in detail herein, it will be apparent to those skilled in the relevant art that various modifications, additions, substitutions and the like can be made without departing from the spirit of the invention and these are therefore considered to be within the scope of the invention as defined in the following claims.

Claims

CLAIMSWhat is claimed is:
1. A method of determining codes usable in encoding combinatorial libraries, said method comprising:
selecting a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a combinatorial library, each of said plurality of codes comprising a plurality of tags, wherein none of said plurality of codes comprises only a single tag; and
assigning selected codes to reaction conditions.
2. The method of claim 1 , wherein each of said plurality of codes is a binary code, and wherein a binary "one" within said binary code represents the presence of a particular tag.
3. The method of claim 1, wherein said selecting comprises using a predefined criterion to select said plurality of codes from N possible codes.
4. The method of claim 3, wherein said predefined criterion specifies at least one of the following:
each of said plurality of codes includes an even number of tags present therein;
each of said plurality of codes includes an odd number of tags present therein, wherein said odd number is greater than one;
each of said plurality of codes includes up to a maximal number of tags present therein;
each of said plurality of codes includes up to a maximal number of "zero" bits; and
each of said plurality of codes does not include a predetermined pattern of bits.
5. The method of claim 1, wherein said selecting comprises using a parity bit to determine which codes of N possible codes are to be selected as said plurality of codes.
6. A method of determining codes usable in encoding chemical libraries, said method comprising:
selecting, from N possible codes, a group of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library, said selecting comprising using a predefined function to select said group of codes, wherein said predefined function selects fewer than N-1 codes from said N possible codes; and
assigning selected codes to reaction conditions.
7. The method of claim 6, wherein said predefined function specifies that each code of said group of codes has up to a maximal number of "zero" bits.
8. The method of claim 6, wherein said predefined function specifies that each code of said group of codes has up to a maximal number of tags.
9. The method of claim 6, wherein said predefined function specifies that each code of said group of codes has up to a maximal number of "one" bits.
10. The method of claim 6, wherein said predefined function specifies that each code of said group of codes has an even number of tags present.
11. The method of claim 6, wherein said predefined function specifies that each code of said group of codes has an odd number of tags present.
12. The method of claim 11, wherein said predefined function specifies that each code of said group of codes has an odd number, greater than one, of tags present.
13. The method of claim 6, wherein said predefined function comprises not including in said group of codes any code having a predetermined pattern.
14. The method of claim 6, wherein said predefined function comprises using a parity bit to select said group of codes.
15. The method of claim 6, wherein each of said plurality of codes is a binary code, and wherein a binary "one" within said binary code represents the presence of a particular tag.
16. A method of determining codes usable in encoding chemical libraries, said method comprising:
selecting, from N possible codes, a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library, wherein said plurality of codes satisfies a predefined criterion, said predefined criterion being other than excluding an "all zeroes" code; and
assigning selected codes to reaction conditions.
17. The method of claim 16, wherein each of said plurality of codes is a binary code, and wherein a binary "one" within said binary code represents the presence of a particular tag.
18. The method of claim 16, wherein said predefined criterion specifies at least one of the following:
each of said plurality of codes includes an even number of tags present therein;
each of said plurality of codes includes an odd number of tags present therein;
each of said plurality of codes includes up to a maximal number of tags present therein;
each of said plurality of codes includes up to a maximal number of "zero" bits; and
each of said plurality of codes does not include a predetermined pattern of bits.
19. The method of claim 16, wherein said selecting comprises using a parity bit to determine which codes of N possible codes are to be selected as said plurality of codes.
20. A system of determining codes usable in encoding combinatorial libraries, said system comprising:
means for selecting a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a combinatorial library, each of said plurality of codes comprising a plurality of tags, wherein none of said plurality of codes comprises only a single tag; and
means for assigning selected codes to reaction conditions.
21. The system of claim 20, wherein said means for selecting comprises means for using a predefined criterion to select said plurality of codes from N possible codes.
22. A system of determining codes usable in encoding chemical libraries, said system comprising:
means for selecting, from N possible codes, a group of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library, said means for selecting comprising means for using a predefined function to select said group of codes, wherein said predefined function selects fewer than N-1 codes from said N possible codes; and
means for assigning selected codes to reaction conditions.
23. The system of claim 22, wherein said predefined function specifies that each code of said group of codes has up to a maximal number of "zero" bits.
24. The system of claim 22, wherein said predefined function specifies that each code of said group of codes has up to a maximal number of tags.
25. The system of claim 22, wherein said predefined function specifies that each code of said group of codes has an even number of tags present.
26. The system of claim 22, wherein said predefined function specifies that each code of said group of codes has an odd number of tags present.
27. The system of claim 22, wherein said predefined function comprises not including in said group of codes any code having a predetermined pattern.
28. The system of claim 22, wherein said predefined function comprises using a parity bit to select said group of codes.
29. A system of determining codes usable in encoding chemical libraries, said system comprising:
means for selecting, from N possible codes, a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library, wherein said plurality of codes satisfies a predefined criterion, said predefined criterion being other than excluding an "all zeroes" code; and
means for assigning selected codes to reaction conditions.
30. The system of claim 29, wherein said predefined criterion specifies at least one of the following:
each of said plurality of codes includes an even number of tags present therein;
each of said plurality of codes includes an odd number of tags present therein;
each of said plurality of codes includes up to a maximal number of tags present therein;
each of said plurality of codes includes up to a maximal number of "zero" bits; and
each of said plurality of codes does not include a predetermined pattern of bits.
31. The system of claim 29, wherein said means for selecting comprises means for using a parity bit to determine which codes of N possible codes are to be selected as said plurality of codes.
32. An article of manufacture, comprising:
at least one computer usable medium having computer readable program code means embodied therein for causing the determining of codes usable in encoding combinatorial libraries, the computer readable program code means in said article of manufacture comprising:
computer readable program code means for causing a computer to select a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a combinatorial library, each of said plurality of codes comprising a plurality of tags, wherein none of said plurality of codes comprises only a single tag; and
computer readable program code means for causing a computer to assign selected codes to reaction conditions.
33. At least one program storage device readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform a method of determining codes usable in encoding chemical libraries, said method comprising:
selecting, from N possible codes, a group of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library, said selecting comprising using a predefined function to select said group of codes, wherein said predefined function selects fewer than N-1 codes from said N possible codes; and
assigning selected codes to reaction conditions.
34. At least one program storage device readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform a method of determining codes usable in encoding chemical libraries, said method comprising:
selecting, from N possible codes, a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library, wherein said plurality of codes satisfies a predefined criterion, said predefined criterion being other than excluding an "all zeroes" code; and
assigning selected codes to reaction conditions.
EP99950264A 1998-10-09 1999-10-07 Selecting codes to be used for encoding combinatorial libraries Withdrawn EP1119530A2 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16942698A 1998-10-09 1998-10-09
US169426 1998-10-09
PCT/US1999/023444 WO2000021909A2 (en) 1998-10-09 1999-10-07 Selecting codes to be used for encoding combinatorial libraries

Publications (1)

Publication Number Publication Date
EP1119530A2 true EP1119530A2 (en) 2001-08-01

Family

ID=22615638

Family Applications (1)

Application Number Title Priority Date Filing Date
EP99950264A Withdrawn EP1119530A2 (en) 1998-10-09 1999-10-07 Selecting codes to be used for encoding combinatorial libraries

Country Status (3)

Country Link
EP (1) EP1119530A2 (en)
AU (1) AU6295999A (en)
WO (1) WO2000021909A2 (en)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002102820A1 (en) 2001-06-20 2002-12-27 Nuevolution A/S Nucleoside derivatives for library preparation
US7413854B2 (en) 2002-03-15 2008-08-19 Nuevolution A/S Method for synthesising templated molecules
US10730906B2 (en) 2002-08-01 2020-08-04 Nuevolutions A/S Multi-step synthesis of templated molecules
AU2003266932B2 (en) 2002-09-27 2009-04-09 Mpm-Holding Aps Spatially encoded polymer matrix
EP3299463B1 (en) 2002-10-30 2020-10-21 Nuevolution A/S Enzymatic encoding
WO2004056994A2 (en) 2002-12-19 2004-07-08 Nuevolution A/S Quasirandom structure and function guided synthesis methods
WO2004074429A2 (en) 2003-02-21 2004-09-02 Nuevolution A/S Method for producing second-generation library
DE602004019764D1 (en) 2003-03-20 2009-04-16 Nuevolution As LIGATION-RELATED CODING OF SMALL MOLECULES
WO2005026387A1 (en) 2003-09-18 2005-03-24 Nuevolution A/S A method for obtaining structural information concerning an encoded molecule and method for selecting compounds
EP1730277B1 (en) 2004-03-22 2009-10-28 Nuevolution A/S Ligational encoding using building block oligonucleotides
LT3305900T (en) 2005-12-01 2021-11-10 Nuevolution A/S Enzymatic encoding methods for efficient synthesis of large libraries
US11225655B2 (en) 2010-04-16 2022-01-18 Nuevolution A/S Bi-functional complexes and methods for making and using such complexes

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5565324A (en) * 1992-10-01 1996-10-15 The Trustees Of Columbia University In The City Of New York Complex combinatorial chemical libraries encoded with tags

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO0021909A3 *

Also Published As

Publication number Publication date
WO2000021909A3 (en) 2001-01-04
WO2000021909A2 (en) 2000-04-20
AU6295999A (en) 2000-05-01

Similar Documents

Publication Publication Date Title
WO2000021909A2 (en) Selecting codes to be used for encoding combinatorial libraries
EP0454985B1 (en) Scalable compound instruction set machine architecture
EP0454984B1 (en) General purpose compounding technique for instruction-level processors
CN100440138C (en) Selective writing of data elements from packed data based upon mask using predication
JP2021532799A (en) Systems and methods for storing and reading nucleic acid-based data with error protection
WO1997012508B1 (en) Transforming and manipulating program object code
CN101438230B (en) Method and system compatible for different form of addressing different address space
US6199017B1 (en) Biochemical information processing apparatus, biochemical information processing method, and biochemical information recording medium
Yi et al. Nonrandom missing data can bias principal component analysis inference of population genetic structure
JP2022531790A (en) Data structures and behaviors for search, calculation, and indexing in DNA-based data storage
Sheridan et al. Toward structure-based predictive tools for the selection of chiral stationary phases for the chromatographic separation of enantiomers
Rosenstein et al. Principles of information storage in small-molecule mixtures
Furka Forty years of combinatorial technology
US6725450B1 (en) Program conversion apparatus, processor, and record medium
Vierstraete et al. Amplicon_sorter: A tool for reference‐free amplicon sorting based on sequence similarity and for building consensus sequences
US7395523B2 (en) Method and apparatus for efficiently developing encoded instructions by tracking multiple unverified instances of repetitive code segments
CN111460849A (en) Method compatible with multiple sets of setting codes and code scanning equipment
CN101719915B (en) Method and device for realizing field decoding
Rose et al. DNA implementation of finite-state machines
CN111475234B (en) Character string transmission method and device, computer and readable storage medium
Gueudré et al. Predicting interacting protein pairs by coevolutionary paralog matching
RU2071112C1 (en) Control apparatus
Knill et al. Group testing problems in experimental molecular biology
US20080052666A1 (en) Apparatus and method for making build-block component
EP1079919A1 (en) Method of synthesizing a plurality of products

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20010502

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

17Q First examination report despatched

Effective date: 20010803

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: PHARMACOPEIA DRUG DISCOVERY, INC.

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20060926