EP1119530A2 - Selecting codes to be used for encoding combinatorial libraries - Google Patents
Selecting codes to be used for encoding combinatorial librariesInfo
- Publication number
- EP1119530A2 EP1119530A2 EP99950264A EP99950264A EP1119530A2 EP 1119530 A2 EP1119530 A2 EP 1119530A2 EP 99950264 A EP99950264 A EP 99950264A EP 99950264 A EP99950264 A EP 99950264A EP 1119530 A2 EP1119530 A2 EP 1119530A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- codes
- code
- reaction conditions
- group
- tags
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000006243 chemical reaction Methods 0.000 claims abstract description 73
- 238000000034 method Methods 0.000 claims abstract description 60
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 56
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 54
- 239000000126 substance Substances 0.000 claims description 26
- 238000004519 manufacturing process Methods 0.000 claims description 5
- 230000008569 process Effects 0.000 abstract description 6
- 239000011324 bead Substances 0.000 description 11
- 239000003153 chemical reaction reagent Substances 0.000 description 11
- 238000012545 processing Methods 0.000 description 7
- 230000014759 maintenance of location Effects 0.000 description 3
- 238000007792 addition Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 239000000356 contaminant Substances 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000004817 gas chromatography Methods 0.000 description 2
- 239000012535 impurity Substances 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 238000003965 capillary gas chromatography Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000007876 drug discovery Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 229940124530 sulfonamide Drugs 0.000 description 1
- 150000003456 sulfonamides Chemical class 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B70/00—Tags or labels specially adapted for combinatorial chemistry or libraries, e.g. fluorescent tags or bar codes
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J19/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J19/0046—Sequential or parallel reactions, e.g. for the synthesis of polypeptides or polynucleotides; Apparatus and devices for combinatorial chemistry or for making molecular arrays
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00277—Apparatus
- B01J2219/0054—Means for coding or tagging the apparatus or the reagents
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00277—Apparatus
- B01J2219/0054—Means for coding or tagging the apparatus or the reagents
- B01J2219/00542—Alphanumeric characters
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00277—Apparatus
- B01J2219/0054—Means for coding or tagging the apparatus or the reagents
- B01J2219/00572—Chemical means
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/0068—Means for controlling the apparatus of the process
- B01J2219/00686—Automatic
- B01J2219/00689—Automatic using computers
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/0068—Means for controlling the apparatus of the process
- B01J2219/00695—Synthesis control routines, e.g. using computer programs
Definitions
- This invention relates, in general, to the encoding of combinatorial libraries and, in particular, to selectively choosing codes to be assigned to reaction conditions used during synthesis of a combinatorial library.
- Combinatorial techniques of chemical synthesis allow the creation of molecular libraries having immense diversity. These techniques entail a series of chemical steps with multiple choices of reaction conditions (e.g., reagents, temperature changes, etc.) for each step.
- reaction conditions e.g., reagents, temperature changes, etc.
- the complexity, or number of members in a combinatorial library is given by the product of the number of reaction conditions for each step of the synthesis, and can therefore, be quite large.
- the challenge in using combinatorial libraries is the characterization of members of the library with particular desired properties.
- tags e.g., tagging molecules
- the tags are used in combination with one another to form a binary record of the synthetic steps for each bead.
- the various reagents which can be used in any step are designated as binary 001 (Reagent 1), 010(Reagent 2), 011 (Reagent 3),... 111 (Reagent 7).
- a binary synthesis code describing any complete N-STEP synthesis using 3 x N binary digits can be written.
- Reagent 3 is used in the first step, the binary numerical description is 011. If Reagent 1 is used in the second step, the description is 001 011. Further, if Reagent 6 is used in the third step, the description is 110 001 Oi l.
- This 9-bit binary synthesis code describes the synthesis, and can be read from right to left in 3 -bit blocks to decode the reagents used in each step of the synthesis.
- a set of distinguishable, sensitively detectable molecules is used as tags, and the presence of a particular tag represents a binary " 1 " for the corresponding bit.
- T9-T1 a set of nine tagging molecules, for the above example, where T9 represents the leftmost binary bit and Tl represents the rightmost bit
- the tag mixture containing only T9, T8, T4, T2 and Tl represents the 110 001 011 synthesis code.
- Decoding is performed in order to determine the reaction history of a particular bead.
- any tag(s) attached to a bead is detached and identified to determine the particular conditions that occurred during synthesis.
- One technique for separating and identifying tags is known as Capillary Gas Chromatography (GC).
- the use of time spacing as a confirmation of whether a peak represents a tag or a contaminant is referred to herein as "self-clocking".
- the shortcomings of the prior art are overcome and additional advantages are provided through the provision of a method of determining codes usable in encoding combinatorial libraries.
- the method includes, for instance, selecting a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a combinatorial library.
- Each of the plurality of codes includes a plurality of tags, wherein none of the plurality of codes includes only a single tag.
- the method further includes assigning selected codes to reaction conditions.
- each of the plurality of codes is a binary code, and a binary "one" within the binary code represents the presence of a particular tag.
- the plurality of codes is selected using a predefined criterion.
- the predefined criterion specifies at least one of the following: each of the plurality of codes includes an even number of tags present therein; each of the plurality of codes includes an odd number of tags present therein; each of the plurality of codes includes up to a maximal number of tags present therein; each of the plurality of codes includes up to a maximal number of "zero" bits; and each of the plurality of codes does not include a predetermined pattern of bits.
- the plurality of codes is selected using a parity bit.
- a method of determining codes usable in encoding chemical libraries includes selecting, from N possible codes, a group of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library.
- the selecting includes using a predefined function to select the group of codes, wherein the predefined function selects fewer than N-1 codes from the N possible codes.
- the method further includes assigning selected codes to reaction conditions.
- a method of determining codes usable in encoding chemical libraries includes selecting, from N possible codes, a plurality of codes to be assigned to a plurality of reaction conditions, wherein the plurality of codes satisfies a predefined criterion.
- the predefined criterion is other than excluding an "all zeroes" code.
- the method further includes assigning selected codes to reaction conditions.
- a system of determining codes usable in encoding combinatorial libraries includes means for selecting a plurality of codes to be assigned to a plurality of reaction conditions, in which each of the plurality of codes includes a plurality of tags, such that none of the plurality of codes includes only a single tag.
- the system further includes means for assigning selected codes to reaction conditions.
- a system of determining codes usable in encoding chemical libraries includes means for selecting, from N possible codes, a group of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library.
- the means for selecting includes means for using a predefined function to select the group of codes, wherein the predefined function selects fewer than N-1 codes from the N possible codes.
- the system further includes means for assigning selected codes to reaction conditions.
- a system of determining codes usable in encoding chemical libraries includes means for selecting, from N possible codes, a plurality of codes to be assigned to a plurality of reaction conditions, wherein the plurality of codes satisfies a predefined criterion, wherein the predefined criterion is other than excluding an "all zeroes" code.
- the system further includes means for assigning selected codes to reaction conditions.
- an article of manufacture including at least one computer usable medium having computer readable program code means embodied therein for causing the determining of codes usable in encoding combinatorial libraries.
- the computer readable program code means includes computer readable program code means for causing a computer to select a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a combinatorial library, each of the plurality of codes including a plurality of tags, wherein none of the plurality of codes includes only a single tag; and computer readable program code means for causing a computer to assign selected codes to reaction conditions.
- At least one program storage device is provided, which is readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform a method of determining codes usable in encoding chemical libraries.
- the method includes selecting, from N possible codes, a group of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library.
- the selecting includes using a predefined function to select the group of codes, wherein the predefined function selects fewer than N-1 codes from the N possible codes.
- the method further includes assigning selected codes to reaction conditions.
- At least one program storage device is provided, which is readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform a method of determining codes usable in encoding chemical libraries.
- the method includes selecting, from N possible codes, a plurality of codes to be assigned to a plurality of reaction conditions, wherein the plurality of codes satisfies a predefined criterion.
- the predefined criterion is other than excluding an "all zeroes" code.
- the method further includes assigning selected codes to reaction conditions.
- a coding capability is provided that eliminates ambiguous code reading. Further, the capability of the present invention advantageously enables the selective choosing of codes, from N possible codes, to be assigned to reaction conditions. Further, the capability of the present invention guarantees the presence of enough tag peaks in a chromatogram that, even in the presence of significant variability in the absolute timing of the run, the relative timing can be determined. Additionally, the present invention is advantageously self-clocking. The use of the present invention in each synthesis step can result in the avoidance of single bit codes for reaction conditions.
- FIG. 1 depicts a generalized method of the present invention
- FIG. 2 depicts one embodiment of the logic associated with the selection capability of the present invention
- FIG. 3 depicts one example of N possible codes, in which a subset is selected (or "accepted”) therefrom, in accordance with the principles of the present invention
- FIG. 4 depicts one example of a table of accepted codes, in accordance with the principles of the present invention.
- FIG. 5 depicts one embodiment of the table of accepted codes of FIG. 4, in which the codes are assigned to reaction conditions, in accordance with the principles of the present invention
- FIG. 6 depicts another example of accepted codes, in accordance with the principles of the present invention.
- FIG. 7a depicts another example of N possible codes, in which a subset is selected (or accepted) therefrom, in accordance with the principles of the present invention
- FIG. 7b depicts one example of adding a parity bit to the N possible codes of FIG. 7a in order to make a selection of the codes to be included in the subset of accepted codes, in accordance with the principles of the present invention
- FIG. 8 depicts one example in which it is difficult to determine whether a tag is present in a sample
- FIG. 9 illustrates one example of output from a gas chromatograph, in accordance with the principles of the present invention.
- FIG. 10 depicts another example of output from a gas chromatograph, in accordance with the principles of the present invention.
- FIG. 11 depicts one embodiment of a computer environment providing and/or using the capability of the present invention.
- a capability in which codes to be assigned to reaction conditions are selectively chosen based on one or more criterion. That is, out of N possible codes, a subset of the N codes is selected based on some constraint or some predefined function.
- the codes are, for instance, binary codes, and each code represents the tags used during a particular stage of synthesis of members of a combinatorial library. Specifically, the tags define the reaction condition used during that particular stage of synthesis.
- a generalized technique of one embodiment of the present invention is described with reference to FIG. 1.
- a group of acceptable codes is selectively chosen, from N possible codes, based upon a predefined function or one or more criterion, as described in detail below, STEP 100.
- the selected codes are assigned to reaction conditions, and those conditions may be used during synthesis of library members, STEP 102.
- the assignment may be arbitrary or may be based on one or more factors. For instance, the first code may be assigned to the first condition to be used during synthesis, etc.
- the criterion can include many different possibilities.
- the criterion can select codes having an even number of "one" bits (i.e., an even number of tags); an odd number of "one” bits; an odd number, greater than one, of "one” bits; more than one tag (i.e., more than one binary "one” bit); up to a total number of "zero” bits; up to a total number of "one” bits; up to a maximal allowed sequence of "zero” or "one” bits; an even parity; an odd parity; codes that do not include a predefined sequence of bits (e.g., the codes that do not include a sequence of 101), etc.
- the above criteria are only provided as examples. There are many more possibilities, and each of those possibilities is considered a part of the claimed invention.
- the predefined function includes all those codes that have up to a total of two "zeroes".
- the process continues with determining which of the codes of FIG. 3 meet this criterion.
- a code is considered from the possible codes, STEP 202.
- code 0000 is considered from the codes depicted in FIG. 3.
- a determination is made as to whether the considered code meets the criterion of having no more than two zeroes, STEP 204. Since code 0000 has more than two zeroes, it does not meet the criterion, and thus, is not an acceptable code.
- next code is 0001. Again, since code 0001 does not meet the criterion, STEP 204, and there are more codes, INQUIRY 206, another code is selected. This procedure continues. At some point, a code is selected that does meet the criterion, INQUIRY 204. For instance, code 0011 meets the criterion of having at a maximum two "zeroes", thus, this code is selected as an acceptable code.
- the acceptable code is placed in a table of acceptable codes, STEP 208.
- a table of acceptable codes STEP 208.
- One instance of such a table is depicted in FIG. 4. The above process continues until all of the acceptable codes are selected from the possible codes based on the predefined criterion.
- each of the acceptable codes can be assigned, respectively, to one of a plurality of reaction conditions, as depicted in FIG. 5.
- ten reaction conditions are represented by ten unique codes.
- Another example of a predefined function used to select acceptable codes from N possible codes is one in which all codes having an even number of tags is selected. This is depicted in FIG. 6. In this particular example, five bits are considered necessary to represent the various reaction conditions to be used during synthesis. Thus, there are 32 possible codes. Out of the 32 possible codes, 15 codes are acceptable (designated by "used" in the table). That is, 15 codes meet the criterion of having an even number of tags.
- parity bits may be added to the N possible codes in order to determine the acceptable codes. For example, assume there are eight possible codes, as shown in FIG. 7a. Further, assume that even parity is to be used, in this example. Thus, for each code in which the addition of a binary "one" provides an even number of "one” bits, a parity "one” bit is added, as shown in FIG. 7b.
- the codes having the parity "one" bit are selected from the N possible codes.
- the following codes are chosen: 0011, 0101, 1001 and 1111.
- each code is a binary code
- each binary "one" represents the existence of a tag and each binary "zero" represents the absence of a tag.
- the tags represented by the binary code associated with that reaction condition are also added during that synthesis step. This provides a record of the conditions used during synthesis of a particular library member.
- Reaction Condition 3 (FIG. 5) is used during a first synthesis step
- Reaction Condition 1 is used during a second step
- Reaction Condition 6 is used during a third step
- the synthesis code representing the three reaction conditions is 0111 0011 0110.
- This synthesis code can be read from right to left, in 4-bit blocks, to decode the reaction conditions used during each step of the synthesis.
- the 10 different reaction conditions in FIG. 5 would be alternatives for a single reaction step.
- a different set of reaction conditions, encoded by a different set of tags, would be used for the next reaction step.
- T12-T1 twelve (12) bits were used to represent 3 reaction conditions and 3 steps and, thus, a set of twelve tagging molecules are used, T12-T1.
- T12 represents the leftmost binary bit and Tl represents the rightmost bit of the synthesis code. Therefore, the following tags would represent the 0111 0011 0110 synthesis code: Ti l, T10, T9, T6, T5, T3 and T2.
- the tags associated with that bead are decoded.
- a chromatogram is produced showing peaks where tags are present. From the peaks, a binary code is determined. For instance, in the above example, since a chromatogram would show that T2 and T3 are present and Tl and T4 are absent, the binary code 0110 is provided, This code indicates that Reaction Condition 3 was used during the first synthesis step.
- the peaks of the chromatogram are not well defined, and thus, it is difficult to determine whether a tag is present. For instance, it is possible that during the tagging reaction, one tag may not be incorporated in a particular bead at the same quantitative level as others within the set. It is also possible that during the detagging/gas chromatograph (GC) portion of the process, some impurities can be introduced into the tag mixture, which may have retention times similar to that of one of the tags in the set. Both of these occurrences can result in chromatogram data in which the tagging code is ambiguous. An example of such is depicted in FIG. 8. In the example of FIG. 8, a 5 place binary code is represented. Clearly, tags 5 and 2 are present, while tags 1 and 3 are not. However, because the peak having the retention time in the expected range for tag 4 is of much lower height than either of the other two, its identity is in question. As a result, the binary code, from left to right, can be either 11010 or 10010.
- the encoding protocol of the present invention is used, in which a specially chosen subset of the N possible binary codes is employed. For example, if a group of codes is selected from the 32 (2 5 ) (see FIG. 6) possible codes, based on a selection criterion of only those codes where the sum of the individual digits is even, then 11010 would not be an acceptable code. Thus, 10010 must be the code that represents the sample depicted in FIG. 8. With this strategy, 2 (N"1) -1 unique sets of conditions can be encoded with N tags.
- FIGs. 9-10 depict a sample chromatogram, and each tag within the chromatogram is labelled by CnnCLn.
- peak 900 has tag C12CL5.
- binary code 1001 reference number 904 must be the correct code. Therefore, in accordance with the present invention, that peak does not have a tag.
- peak 1000 is in question.
- the peak is determined to be tagged.
- a binary "one" represents that peak (see reference number 1002).
- peak 1004. The capability of the present invention can readily be automated by creating a suitable program, in software, hardware, microcode, firmware or any combination thereof.
- any type of computer or computer environment can be employed to provide, incorporate and/or use the capability of the present invention.
- One such environment is depicted in FIG. 11 and described in detail below.
- a computer environment 1100 includes, for instance, at least one central processing unit 1102, a main storage 1104, and one or more input/output devices 1106, each of which is described below.
- central processing unit 1102 is the controlling center of computer environment 1100 and provides the sequencing and processing facilities for instruction execution, interruption action, timing functions, initial program loading and other machine related functions.
- the central processing unit executes at least one operating system, which as known, is used to control the operation of the computing unit by controlling the execution of other programs, controlling communication with peripheral devices and controlling use of the computer resources.
- Central processing unit 1102 is coupled to main storage 1104, which is directly addressable and provides for high speed processing of data by the central processing unit.
- Main storage may be either physically integrated with the CPU or constructed in stand alone units.
- Main storage 1104 is also coupled to one or more input/output devices 1106. These devices include, for instance, keyboards, communications controllers, teleprocessing devices, printers, magnetic storage media (e.g., tape, disks), direct access storage devices, and sensor based equipment. Data is transferred from main storage 1104 to input/output devices 1106, and from the input/output devices back to main storage. Described in detail above is an improvement of the coding process of combinatorial libraries, in which group coding is used. Group coding includes selectively choosing a subset of codes, from N possible codes, that meets one or more predetermined criterion. The subset can include codes that are selected based on a parity bit or by some other mechanism.
- the code selection capability of the present invention guarantees the presence of enough tag peaks in the chromatogram that, even in the presence of significant variability in the absolute timing of the run, the relative timing can be determined and the "zero" bits identified reliably.
- the group coding thus is considered self-clocking.
- the selection capability of the present invention allows more than 2 (N"1) -1 possibilities for N bits. Further, no bit can be claimed to be merely extraneous to the code. A bit is considered extraneous to the code when the bit is added to the code just for the sake of adding a bit and that bit can really be ignored when the code is read. With the present invention, these extraneous bits are avoided and thus, more efficient use of tags can be made.
- the use of the present invention at each synthesis step can result in avoiding single-bit codes for reaction conditions. Additionally, it advantageously allows for more reliable interpretation of an individual chromatogram by using the guaranteed presence of a minimum number of tags to create an "internal standard" for the shifts in that chromatogram. Further, it allows for independent error checking of the validity of the tags at each step based on the frequency of the code occurrence and the identification of invalid codes.
- the present invention is also applicable to higher order coding or other types of coding. Thus, these are considered a part of the claimed invention.
- the present invention can be included in an article of manufacture (e.g., one or more computer program products) having, for instance, computer usable media.
- the media has embodied therein, for instance, computer readable program code means for providing and facilitating the capabilities of the present invention.
- the article of manufacture can be included as a part of a computer system or sold separately.
- At least one program storage device readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform the capabilities of the present invention can be provided.
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biochemistry (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Codes to be used for encoding combinatorial libraries are selectively chosen based on one or more predefined function or criterion. In particular, a subset of N possible codes is selected based on some criterion. In one example, the codes are binary codes, and each code represents the tags used during a particular stage of synthesis of members of a combinatorial library. The tags define the reaction condition used during that particular stage of synthesis. In one embodiment, the predefined criterion ensures that each code includes more than one tag. This helps eliminate ambiguity during a decoding process in which the tags are identified to determine the reaction history during synthesis.
Description
SELECTING CODES TO BE USED FOR ENCODING COMBINATORIAL LIBRARIES
TECHNICAL FIELD
This invention relates, in general, to the encoding of combinatorial libraries and, in particular, to selectively choosing codes to be assigned to reaction conditions used during synthesis of a combinatorial library.
BACKGROUND ART
Combinatorial techniques of chemical synthesis allow the creation of molecular libraries having immense diversity. These techniques entail a series of chemical steps with multiple choices of reaction conditions (e.g., reagents, temperature changes, etc.) for each step. The complexity, or number of members in a combinatorial library, is given by the product of the number of reaction conditions for each step of the synthesis, and can therefore, be quite large. The challenge in using combinatorial libraries is the characterization of members of the library with particular desired properties.
One solution to the above challenge is to use a split synthesis or direct divide technique to perform chemical synthesis on solid particles, such as beads. Through a protocol of separating and mixing beads during the synthesis, each bead in the final library has a product from a single, specific reaction sequence chemically bound to it, and that product is likely to differ from that bound to another bead.
During each step of the synthesis, zero or more tags (e.g., tagging molecules) are attached to each bead in order to encode the reaction condition used in that step, as well as the step number.
In one embodiment, the tags are used in combination with one another to form a binary record of the synthetic steps for each bead. For example, assume a combinatorial synthesis using any of seven different reagents in each of N steps is to be carried out. Such a combinatorial synthesis would yield 7N different final products. As an example, the various reagents which can be used in any step are designated as binary 001 (Reagent 1), 010(Reagent 2), 011 (Reagent 3),... 111 (Reagent 7). Thus, a binary synthesis code describing any complete N-STEP synthesis using 3 x N binary digits can be written.
For instance, if Reagent 3 is used in the first step, the binary numerical description is 011. If Reagent 1 is used in the second step, the description is 001 011. Further, if Reagent 6 is used in the third step, the description is 110 001 Oi l. This 9-bit binary synthesis code describes the synthesis, and can be read from right to left in 3 -bit blocks to decode the reagents used in each step of the synthesis.
To represent such a synthesis code chemically, a set of distinguishable, sensitively detectable molecules is used as tags, and the presence of a particular tag represents a binary " 1 " for the corresponding bit. Using a set of nine tagging molecules, T9-T1, for the above example, where T9 represents the leftmost binary bit and Tl represents the rightmost bit, the tag mixture containing only T9, T8, T4, T2 and Tl represents the 110 001 011 synthesis code.
The use of tags and various encoding techniques are described in detail in one or more of the following references, each of which is hereby incorporated herein by reference in its entirety: Ohlmeyer et al., "Complex synthetic chemical libraries indexed with molecular tags", Proceedings Of The National Academy Of Sciences Of The United States Of America, Vol. 90, No. 23, pp. 10922-10926 (December 1993); J.J. Baldwin, "Design, synthesis and use of binary encoded synthetic chemical libraries", Molecular Diversity, Vol. 2, No. 1/2, pp. 81-88 (October 1996); Burbaum et al., "A paradigm for drug discovery employing encoded combinatorial libraries", Proceedings Of The National Academy Of
Sciences Of The United States Of America, Nol. 92, No. 13, pp. 6027-6031 (June 1995); Still et al., U.S. Patent No. 5,565,324, entitled "Complex Combinatorial Chemical Libraries Encoded With Tags", issued on October 15, 1996; Baldwin et al., U.S. Patent No. 5,618,825, entitled "Combinatorial Sulfonamide Library", issued on April 08, 1997; Baldwin et al, U.S. Patent No. 5,663,046, entitled
"Synthesis Of Combinatorial Libraries", issued on September 02, 1997; Still et al., International Publication No. WO 94/08051, entitled "Complex Combinatorial Chemical Libraries Encoded With Tags", International Publication Date April 14, 1994; Dower et al., U.S. Patent No. 5,639,603, entitled "Synthesizing And Screening Molecular Diversity", issued on June 17, 1997; and Dower et al.,
International Publication No. WO 93/06121, entitled "Method Of Synthesizing Diverse Collections Of Oligomers", International Publication Date April 01, 1993.
While binary coding has been established as a viable technique in encoding complex combinatorial libraries, there are some shortcomings with the present techniques, especially during decoding of the tags.
Decoding is performed in order to determine the reaction history of a particular bead. In particular, during decoding, any tag(s) attached to a bead is detached and identified to determine the particular conditions that occurred during synthesis. One technique for separating and identifying tags is known as Capillary Gas Chromatography (GC).
During decoding, it is sometimes difficult to determine whether a tag is present due to impurities of similar retention times or tags present in low amounts. Thus, the decoding becomes ambiguous, and it is difficult, under those circumstances, to determine the appropriate binary code that represents the reaction history.
Based on the foregoing, a need exists for a coding technique that eliminates ambiguous code reading. A further need exists for a capability that
enables the selective choosing of codes, from N possible codes, to be assigned to reaction conditions. A yet further need exists for a capability that guarantees the presence of enough tag peaks in a chromatogram that, even in the presence of significant variability in the absolute timing of the run, the relative timing can be determined. That is, a need exists for the presence of at least two tag peaks, so that the time distance (i.e., relative timing) between those peaks can be determined, even if there is significant variability in their positions (i.e., absolute timing). A further need exists for a capability that uses this time distance to confirm whether a particular peak represents the presence of a tag or a contaminant (e.g., substantially constant time spacing indicates the presence of a tag). The use of time spacing as a confirmation of whether a peak represents a tag or a contaminant is referred to herein as "self-clocking".
SUMMARY OF THE INVENTION
The shortcomings of the prior art are overcome and additional advantages are provided through the provision of a method of determining codes usable in encoding combinatorial libraries. The method includes, for instance, selecting a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a combinatorial library. Each of the plurality of codes includes a plurality of tags, wherein none of the plurality of codes includes only a single tag. The method further includes assigning selected codes to reaction conditions.
In one embodiment, each of the plurality of codes is a binary code, and a binary "one" within the binary code represents the presence of a particular tag.
In another embodiment of the invention, the plurality of codes is selected using a predefined criterion. The predefined criterion specifies at least one of the following: each of the plurality of codes includes an even number of tags present therein; each of the plurality of codes includes an odd number of tags present therein; each of the plurality of codes includes up to a maximal number of tags
present therein; each of the plurality of codes includes up to a maximal number of "zero" bits; and each of the plurality of codes does not include a predetermined pattern of bits.
In another embodiment, the plurality of codes is selected using a parity bit.
In a further aspect of the present invention, a method of determining codes usable in encoding chemical libraries is provided. The method includes selecting, from N possible codes, a group of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library. The selecting includes using a predefined function to select the group of codes, wherein the predefined function selects fewer than N-1 codes from the N possible codes. The method further includes assigning selected codes to reaction conditions.
In yet a further aspect of the present invention, a method of determining codes usable in encoding chemical libraries is provided. The method includes selecting, from N possible codes, a plurality of codes to be assigned to a plurality of reaction conditions, wherein the plurality of codes satisfies a predefined criterion. The predefined criterion is other than excluding an "all zeroes" code. The method further includes assigning selected codes to reaction conditions.
In a further aspect of the present invention, a system of determining codes usable in encoding combinatorial libraries is provided. The system includes means for selecting a plurality of codes to be assigned to a plurality of reaction conditions, in which each of the plurality of codes includes a plurality of tags, such that none of the plurality of codes includes only a single tag. The system further includes means for assigning selected codes to reaction conditions.
In another aspect of the present invention, a system of determining codes usable in encoding chemical libraries is provided. The system includes means for selecting, from N possible codes, a group of codes to be assigned to a plurality of
reaction conditions usable during synthesis of a chemical library. The means for selecting includes means for using a predefined function to select the group of codes, wherein the predefined function selects fewer than N-1 codes from the N possible codes. The system further includes means for assigning selected codes to reaction conditions.
In yet a further aspect of the present invention, a system of determining codes usable in encoding chemical libraries is provided. The system includes means for selecting, from N possible codes, a plurality of codes to be assigned to a plurality of reaction conditions, wherein the plurality of codes satisfies a predefined criterion, wherein the predefined criterion is other than excluding an "all zeroes" code. The system further includes means for assigning selected codes to reaction conditions.
In another aspect of the present invention, an article of manufacture is provided, including at least one computer usable medium having computer readable program code means embodied therein for causing the determining of codes usable in encoding combinatorial libraries. The computer readable program code means includes computer readable program code means for causing a computer to select a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a combinatorial library, each of the plurality of codes including a plurality of tags, wherein none of the plurality of codes includes only a single tag; and computer readable program code means for causing a computer to assign selected codes to reaction conditions.
In yet another aspect of the present invention, at least one program storage device is provided, which is readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform a method of determining codes usable in encoding chemical libraries. The method includes selecting, from N possible codes, a group of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library. The selecting
includes using a predefined function to select the group of codes, wherein the predefined function selects fewer than N-1 codes from the N possible codes. The method further includes assigning selected codes to reaction conditions.
In a further aspect of the present invention, at least one program storage device is provided, which is readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform a method of determining codes usable in encoding chemical libraries. The method includes selecting, from N possible codes, a plurality of codes to be assigned to a plurality of reaction conditions, wherein the plurality of codes satisfies a predefined criterion. The predefined criterion is other than excluding an "all zeroes" code. The method further includes assigning selected codes to reaction conditions.
In accordance with the principles of the present invention, a coding capability is provided that eliminates ambiguous code reading. Further, the capability of the present invention advantageously enables the selective choosing of codes, from N possible codes, to be assigned to reaction conditions. Further, the capability of the present invention guarantees the presence of enough tag peaks in a chromatogram that, even in the presence of significant variability in the absolute timing of the run, the relative timing can be determined. Additionally, the present invention is advantageously self-clocking. The use of the present invention in each synthesis step can result in the avoidance of single bit codes for reaction conditions.
Additional features and advantages are realized through the techniques of the present invention. Other embodiments and aspects of the invention are described in detail herein and are considered a part of the claimed invention.
BRIEF DESCRIPTION OF THE DRAWINGS
The subject matter which is regarded as the invention is particularly pointed out and distinctly claimed in the claims at the conclusion of the specification. The foregoing and other objects, features, and advantages of the invention will be apparent from the following detailed description taken in conjunction with the accompanying drawings in which:
FIG. 1 depicts a generalized method of the present invention;
FIG. 2 depicts one embodiment of the logic associated with the selection capability of the present invention;
FIG. 3 depicts one example of N possible codes, in which a subset is selected (or "accepted") therefrom, in accordance with the principles of the present invention;
FIG. 4 depicts one example of a table of accepted codes, in accordance with the principles of the present invention;
FIG. 5 depicts one embodiment of the table of accepted codes of FIG. 4, in which the codes are assigned to reaction conditions, in accordance with the principles of the present invention;
FIG. 6 depicts another example of accepted codes, in accordance with the principles of the present invention;
FIG. 7a depicts another example of N possible codes, in which a subset is selected (or accepted) therefrom, in accordance with the principles of the present invention;
FIG. 7b depicts one example of adding a parity bit to the N possible codes of FIG. 7a in order to make a selection of the codes to be
included in the subset of accepted codes, in accordance with the principles of the present invention;
FIG. 8 depicts one example in which it is difficult to determine whether a tag is present in a sample;
FIG. 9 illustrates one example of output from a gas chromatograph, in accordance with the principles of the present invention;
FIG. 10 depicts another example of output from a gas chromatograph, in accordance with the principles of the present invention; and
FIG. 11 depicts one embodiment of a computer environment providing and/or using the capability of the present invention.
BEST MODE FOR CARRYING OUT THE INVENTION
In accordance with the principles of the present invention, a capability is provided in which codes to be assigned to reaction conditions are selectively chosen based on one or more criterion. That is, out of N possible codes, a subset of the N codes is selected based on some constraint or some predefined function.
The codes are, for instance, binary codes, and each code represents the tags used during a particular stage of synthesis of members of a combinatorial library. Specifically, the tags define the reaction condition used during that particular stage of synthesis.
A generalized technique of one embodiment of the present invention is described with reference to FIG. 1. Initially, a group of acceptable codes is selectively chosen, from N possible codes, based upon a predefined function or
one or more criterion, as described in detail below, STEP 100. Thereafter, the selected codes are assigned to reaction conditions, and those conditions may be used during synthesis of library members, STEP 102. The assignment may be arbitrary or may be based on one or more factors. For instance, the first code may be assigned to the first condition to be used during synthesis, etc.
One embodiment of the selection process used to choose a group of codes to be assigned to the reaction conditions is described in detail with reference to FIG. 2. Initially, a decision is made as to the N possible codes that could be used for a particular synthesis step, STEP 200. For example, a determination is made as to how many bits are sufficient to represent the number of reaction conditions to be used during that step. Assume for this one example that four bits are sufficient. Thus, in a binary scheme, there are 24 possible codes (i.e., N=16), as shown in FIG. 3.
Out of the possible codes, only those codes that satisfy one or more criterion (or predefined function) are selected. The criterion can include many different possibilities. As examples, the criterion can select codes having an even number of "one" bits (i.e., an even number of tags); an odd number of "one" bits; an odd number, greater than one, of "one" bits; more than one tag (i.e., more than one binary "one" bit); up to a total number of "zero" bits; up to a total number of "one" bits; up to a maximal allowed sequence of "zero" or "one" bits; an even parity; an odd parity; codes that do not include a predefined sequence of bits (e.g., the codes that do not include a sequence of 101), etc. The above criteria are only provided as examples. There are many more possibilities, and each of those possibilities is considered a part of the claimed invention.
For illustration purposes only, the predefined function, in this example, includes all those codes that have up to a total of two "zeroes". Thus, the process continues with determining which of the codes of FIG. 3 meet this criterion.
Returning to FIG. 2, a code is considered from the possible codes, STEP 202. For example, code 0000 is considered from the codes depicted in FIG. 3. A determination is made as to whether the considered code meets the criterion of having no more than two zeroes, STEP 204. Since code 0000 has more than two zeroes, it does not meet the criterion, and thus, is not an acceptable code.
Therefore, if there are further codes to be considered, INQUIRY 206, a new code is considered, STEP 202.
Proceeding sequentially down the list of the possible codes illustrated in FIG. 3, the next code is 0001. Again, since code 0001 does not meet the criterion, STEP 204, and there are more codes, INQUIRY 206, another code is selected. This procedure continues. At some point, a code is selected that does meet the criterion, INQUIRY 204. For instance, code 0011 meets the criterion of having at a maximum two "zeroes", thus, this code is selected as an acceptable code.
In one example, the acceptable code is placed in a table of acceptable codes, STEP 208. One instance of such a table is depicted in FIG. 4. The above process continues until all of the acceptable codes are selected from the possible codes based on the predefined criterion.
As described with reference to FIG. 1 , after the acceptable codes are determined, each of the acceptable codes can be assigned, respectively, to one of a plurality of reaction conditions, as depicted in FIG. 5. In this particular case, ten reaction conditions are represented by ten unique codes.
Another example of a predefined function used to select acceptable codes from N possible codes is one in which all codes having an even number of tags is selected. This is depicted in FIG. 6. In this particular example, five bits are considered necessary to represent the various reaction conditions to be used during synthesis. Thus, there are 32 possible codes. Out of the 32 possible codes,
15 codes are acceptable (designated by "used" in the table). That is, 15 codes meet the criterion of having an even number of tags.
In yet another embodiment of the present invention, parity bits may be added to the N possible codes in order to determine the acceptable codes. For example, assume there are eight possible codes, as shown in FIG. 7a. Further, assume that even parity is to be used, in this example. Thus, for each code in which the addition of a binary "one" provides an even number of "one" bits, a parity "one" bit is added, as shown in FIG. 7b.
Thereafter, the codes having the parity "one" bit are selected from the N possible codes. In the example depicted in FIG. 7b, the following codes are chosen: 0011, 0101, 1001 and 1111.
In the above examples in which each code is a binary code, each binary "one" represents the existence of a tag and each binary "zero" represents the absence of a tag. When a reaction condition is used in a particular synthesis step, the tags represented by the binary code associated with that reaction condition are also added during that synthesis step. This provides a record of the conditions used during synthesis of a particular library member.
For example, assume Reaction Condition 3 (FIG. 5) is used during a first synthesis step, Reaction Condition 1 is used during a second step, and Reaction Condition 6 is used during a third step, then the synthesis code representing the three reaction conditions is 0111 0011 0110. This synthesis code can be read from right to left, in 4-bit blocks, to decode the reaction conditions used during each step of the synthesis. (In another embodiment, the 10 different reaction conditions in FIG. 5 would be alternatives for a single reaction step. A different set of reaction conditions, encoded by a different set of tags, would be used for the next reaction step.)
To represent the synthesis code chemically, a set of distinguishable tags is used, in which the presence of a particular tag is represented by a binary " 1 ". In this specific example, twelve (12) bits were used to represent 3 reaction conditions and 3 steps and, thus, a set of twelve tagging molecules are used, T12-T1. T12 represents the leftmost binary bit and Tl represents the rightmost bit of the synthesis code. Therefore, the following tags would represent the 0111 0011 0110 synthesis code: Ti l, T10, T9, T6, T5, T3 and T2.
The manner in which tagging molecules are prepared and the manner in which tagging molecules relate to the binary bits of a synthesis code, are known in the art and described in various of the above references, each of which has been incorporated herein by reference in its entirety.
When the reaction history of a particular bead is desired, the tags associated with that bead are decoded. In one example, a chromatogram is produced showing peaks where tags are present. From the peaks, a binary code is determined. For instance, in the above example, since a chromatogram would show that T2 and T3 are present and Tl and T4 are absent, the binary code 0110 is provided, This code indicates that Reaction Condition 3 was used during the first synthesis step.
Sometimes the peaks of the chromatogram are not well defined, and thus, it is difficult to determine whether a tag is present. For instance, it is possible that during the tagging reaction, one tag may not be incorporated in a particular bead at the same quantitative level as others within the set. It is also possible that during the detagging/gas chromatograph (GC) portion of the process, some impurities can be introduced into the tag mixture, which may have retention times similar to that of one of the tags in the set. Both of these occurrences can result in chromatogram data in which the tagging code is ambiguous. An example of such is depicted in FIG. 8.
In the example of FIG. 8, a 5 place binary code is represented. Clearly, tags 5 and 2 are present, while tags 1 and 3 are not. However, because the peak having the retention time in the expected range for tag 4 is of much lower height than either of the other two, its identity is in question. As a result, the binary code, from left to right, can be either 11010 or 10010.
To eliminate this type of ambiguous code reading, the encoding protocol of the present invention is used, in which a specially chosen subset of the N possible binary codes is employed. For example, if a group of codes is selected from the 32 (25) (see FIG. 6) possible codes, based on a selection criterion of only those codes where the sum of the individual digits is even, then 11010 would not be an acceptable code. Thus, 10010 must be the code that represents the sample depicted in FIG. 8. With this strategy, 2(N"1)-1 unique sets of conditions can be encoded with N tags.
Further examples in which the selective code capability of the present invention is used are depicted in FIGs. 9-10. Each of the figures depicts a sample chromatogram, and each tag within the chromatogram is labelled by CnnCLn. For example, in FIG. 9, peak 900 has tag C12CL5.
In FIG. 9, there is an ambiguity as to whether peak 902 represents the presence of a particular tag or not. Thus, the binary code for that step may be 1101 or 1001. However, based on the criterion for acceptable codes used in this example (e.g., an even number of tags), binary code 1001 (reference number 904) must be the correct code. Therefore, in accordance with the present invention, that peak does not have a tag.
Similarly, in FIG. 10, peak 1000 is in question. However, based on the present invention in which the criterion specifies no single tags, the peak is determined to be tagged. Thus, a binary "one" represents that peak (see reference number 1002). The same holds true for peak 1004.
The capability of the present invention can readily be automated by creating a suitable program, in software, hardware, microcode, firmware or any combination thereof. Further, any type of computer or computer environment can be employed to provide, incorporate and/or use the capability of the present invention. One such environment is depicted in FIG. 11 and described in detail below.
In one embodiment, a computer environment 1100 includes, for instance, at least one central processing unit 1102, a main storage 1104, and one or more input/output devices 1106, each of which is described below.
As is known, central processing unit 1102 is the controlling center of computer environment 1100 and provides the sequencing and processing facilities for instruction execution, interruption action, timing functions, initial program loading and other machine related functions. The central processing unit executes at least one operating system, which as known, is used to control the operation of the computing unit by controlling the execution of other programs, controlling communication with peripheral devices and controlling use of the computer resources.
Central processing unit 1102 is coupled to main storage 1104, which is directly addressable and provides for high speed processing of data by the central processing unit. Main storage may be either physically integrated with the CPU or constructed in stand alone units.
Main storage 1104 is also coupled to one or more input/output devices 1106. These devices include, for instance, keyboards, communications controllers, teleprocessing devices, printers, magnetic storage media (e.g., tape, disks), direct access storage devices, and sensor based equipment. Data is transferred from main storage 1104 to input/output devices 1106, and from the input/output devices back to main storage.
Described in detail above is an improvement of the coding process of combinatorial libraries, in which group coding is used. Group coding includes selectively choosing a subset of codes, from N possible codes, that meets one or more predetermined criterion. The subset can include codes that are selected based on a parity bit or by some other mechanism.
In one example, the code selection capability of the present invention guarantees the presence of enough tag peaks in the chromatogram that, even in the presence of significant variability in the absolute timing of the run, the relative timing can be determined and the "zero" bits identified reliably. The group coding thus is considered self-clocking.
In one embodiment, the selection capability of the present invention allows more than 2(N"1)-1 possibilities for N bits. Further, no bit can be claimed to be merely extraneous to the code. A bit is considered extraneous to the code when the bit is added to the code just for the sake of adding a bit and that bit can really be ignored when the code is read. With the present invention, these extraneous bits are avoided and thus, more efficient use of tags can be made.
The use of the present invention at each synthesis step can result in avoiding single-bit codes for reaction conditions. Additionally, it advantageously allows for more reliable interpretation of an individual chromatogram by using the guaranteed presence of a minimum number of tags to create an "internal standard" for the shifts in that chromatogram. Further, it allows for independent error checking of the validity of the tags at each step based on the frequency of the code occurrence and the identification of invalid codes.
Although the examples described above reference binary coding, the present invention is also applicable to higher order coding or other types of coding. Thus, these are considered a part of the claimed invention.
The present invention can be included in an article of manufacture (e.g., one or more computer program products) having, for instance, computer usable media. The media has embodied therein, for instance, computer readable program code means for providing and facilitating the capabilities of the present invention. The article of manufacture can be included as a part of a computer system or sold separately.
Additionally, at least one program storage device readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform the capabilities of the present invention can be provided.
The flow diagrams depicted herein are just exemplary. There may be many variations to these diagrams or the steps (or operations) described therein without departing from the spirit of the invention. For instance, the steps may be performed in a differing order, or steps may be added, deleted or modified. All of these variations are considered a part of the claimed invention.
Although preferred embodiments have been depicted and described in detail herein, it will be apparent to those skilled in the relevant art that various modifications, additions, substitutions and the like can be made without departing from the spirit of the invention and these are therefore considered to be within the scope of the invention as defined in the following claims.
Claims
1. A method of determining codes usable in encoding combinatorial libraries, said method comprising:
selecting a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a combinatorial library, each of said plurality of codes comprising a plurality of tags, wherein none of said plurality of codes comprises only a single tag; and
assigning selected codes to reaction conditions.
2. The method of claim 1 , wherein each of said plurality of codes is a binary code, and wherein a binary "one" within said binary code represents the presence of a particular tag.
3. The method of claim 1, wherein said selecting comprises using a predefined criterion to select said plurality of codes from N possible codes.
4. The method of claim 3, wherein said predefined criterion specifies at least one of the following:
each of said plurality of codes includes an even number of tags present therein;
each of said plurality of codes includes an odd number of tags present therein, wherein said odd number is greater than one;
each of said plurality of codes includes up to a maximal number of tags present therein;
each of said plurality of codes includes up to a maximal number of "zero" bits; and
each of said plurality of codes does not include a predetermined pattern of bits.
5. The method of claim 1, wherein said selecting comprises using a parity bit to determine which codes of N possible codes are to be selected as said plurality of codes.
6. A method of determining codes usable in encoding chemical libraries, said method comprising:
selecting, from N possible codes, a group of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library, said selecting comprising using a predefined function to select said group of codes, wherein said predefined function selects fewer than N-1 codes from said N possible codes; and
assigning selected codes to reaction conditions.
7. The method of claim 6, wherein said predefined function specifies that each code of said group of codes has up to a maximal number of "zero" bits.
8. The method of claim 6, wherein said predefined function specifies that each code of said group of codes has up to a maximal number of tags.
9. The method of claim 6, wherein said predefined function specifies that each code of said group of codes has up to a maximal number of "one" bits.
10. The method of claim 6, wherein said predefined function specifies that each code of said group of codes has an even number of tags present.
11. The method of claim 6, wherein said predefined function specifies that each code of said group of codes has an odd number of tags present.
12. The method of claim 11, wherein said predefined function specifies that each code of said group of codes has an odd number, greater than one, of tags present.
13. The method of claim 6, wherein said predefined function comprises not including in said group of codes any code having a predetermined pattern.
14. The method of claim 6, wherein said predefined function comprises using a parity bit to select said group of codes.
15. The method of claim 6, wherein each of said plurality of codes is a binary code, and wherein a binary "one" within said binary code represents the presence of a particular tag.
16. A method of determining codes usable in encoding chemical libraries, said method comprising:
selecting, from N possible codes, a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library, wherein said plurality of codes satisfies a predefined criterion, said predefined criterion being other than excluding an "all zeroes" code; and
assigning selected codes to reaction conditions.
17. The method of claim 16, wherein each of said plurality of codes is a binary code, and wherein a binary "one" within said binary code represents the presence of a particular tag.
18. The method of claim 16, wherein said predefined criterion specifies at least one of the following:
each of said plurality of codes includes an even number of tags present therein;
each of said plurality of codes includes an odd number of tags present therein;
each of said plurality of codes includes up to a maximal number of tags present therein;
each of said plurality of codes includes up to a maximal number of "zero" bits; and
each of said plurality of codes does not include a predetermined pattern of bits.
19. The method of claim 16, wherein said selecting comprises using a parity bit to determine which codes of N possible codes are to be selected as said plurality of codes.
20. A system of determining codes usable in encoding combinatorial libraries, said system comprising:
means for selecting a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a combinatorial library, each of said plurality of codes comprising a plurality of tags, wherein none of said plurality of codes comprises only a single tag; and
means for assigning selected codes to reaction conditions.
21. The system of claim 20, wherein said means for selecting comprises means for using a predefined criterion to select said plurality of codes from N possible codes.
22. A system of determining codes usable in encoding chemical libraries, said system comprising:
means for selecting, from N possible codes, a group of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library, said means for selecting comprising means for using a predefined function to select said group of codes, wherein said predefined function selects fewer than N-1 codes from said N possible codes; and
means for assigning selected codes to reaction conditions.
23. The system of claim 22, wherein said predefined function specifies that each code of said group of codes has up to a maximal number of "zero" bits.
24. The system of claim 22, wherein said predefined function specifies that each code of said group of codes has up to a maximal number of tags.
25. The system of claim 22, wherein said predefined function specifies that each code of said group of codes has an even number of tags present.
26. The system of claim 22, wherein said predefined function specifies that each code of said group of codes has an odd number of tags present.
27. The system of claim 22, wherein said predefined function comprises not including in said group of codes any code having a predetermined pattern.
28. The system of claim 22, wherein said predefined function comprises using a parity bit to select said group of codes.
29. A system of determining codes usable in encoding chemical libraries, said system comprising:
means for selecting, from N possible codes, a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library, wherein said plurality of codes satisfies a predefined criterion, said predefined criterion being other than excluding an "all zeroes" code; and
means for assigning selected codes to reaction conditions.
30. The system of claim 29, wherein said predefined criterion specifies at least one of the following:
each of said plurality of codes includes an even number of tags present therein;
each of said plurality of codes includes an odd number of tags present therein;
each of said plurality of codes includes up to a maximal number of tags present therein;
each of said plurality of codes includes up to a maximal number of "zero" bits; and
each of said plurality of codes does not include a predetermined pattern of bits.
31. The system of claim 29, wherein said means for selecting comprises means for using a parity bit to determine which codes of N possible codes are to be selected as said plurality of codes.
32. An article of manufacture, comprising:
at least one computer usable medium having computer readable program code means embodied therein for causing the determining of codes usable in encoding combinatorial libraries, the computer readable program code means in said article of manufacture comprising:
computer readable program code means for causing a computer to select a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a combinatorial library, each of said plurality of codes comprising a plurality of tags, wherein none of said plurality of codes comprises only a single tag; and
computer readable program code means for causing a computer to assign selected codes to reaction conditions.
33. At least one program storage device readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform a method of determining codes usable in encoding chemical libraries, said method comprising:
selecting, from N possible codes, a group of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library, said selecting comprising using a predefined function to select said group of codes, wherein said predefined function selects fewer than N-1 codes from said N possible codes; and
assigning selected codes to reaction conditions.
34. At least one program storage device readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform a method of determining codes usable in encoding chemical libraries, said method comprising:
selecting, from N possible codes, a plurality of codes to be assigned to a plurality of reaction conditions usable during synthesis of a chemical library, wherein said plurality of codes satisfies a predefined criterion, said predefined criterion being other than excluding an "all zeroes" code; and
assigning selected codes to reaction conditions.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US16942698A | 1998-10-09 | 1998-10-09 | |
US169426 | 1998-10-09 | ||
PCT/US1999/023444 WO2000021909A2 (en) | 1998-10-09 | 1999-10-07 | Selecting codes to be used for encoding combinatorial libraries |
Publications (1)
Publication Number | Publication Date |
---|---|
EP1119530A2 true EP1119530A2 (en) | 2001-08-01 |
Family
ID=22615638
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP99950264A Withdrawn EP1119530A2 (en) | 1998-10-09 | 1999-10-07 | Selecting codes to be used for encoding combinatorial libraries |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP1119530A2 (en) |
AU (1) | AU6295999A (en) |
WO (1) | WO2000021909A2 (en) |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002102820A1 (en) | 2001-06-20 | 2002-12-27 | Nuevolution A/S | Nucleoside derivatives for library preparation |
EP2186897B1 (en) | 2002-03-15 | 2016-02-17 | Nuevolution A/S | An improved method for synthesising templated molecules |
EP1539980B1 (en) | 2002-08-01 | 2016-02-17 | Nuevolution A/S | Library of complexes comprising small non-peptide molecules and double-stranded oligonucleotides identifying the molecules |
WO2004028682A2 (en) | 2002-09-27 | 2004-04-08 | Carlsberg A/S | Spatially encoded polymer matrix |
ES2368215T3 (en) | 2002-10-30 | 2011-11-15 | Nuevolution A/S | ENZYMATIC CODING. |
EP2175019A3 (en) | 2002-12-19 | 2011-04-06 | Nuevolution A/S | Quasirandom structure and function guided synthesis methods |
US20070026397A1 (en) | 2003-02-21 | 2007-02-01 | Nuevolution A/S | Method for producing second-generation library |
US7915201B2 (en) | 2003-03-20 | 2011-03-29 | Nuevolution A/S | Ligational encoding of small molecules |
DE602004023960D1 (en) | 2003-09-18 | 2009-12-17 | Nuevolution As | Method for obtaining structural information of encoded molecules and for selection of compounds |
DK1730277T3 (en) | 2004-03-22 | 2010-03-01 | Nuevolution As | Coding by ligation using building block oligonucleotides |
DE602006018648D1 (en) | 2005-12-01 | 2011-01-13 | Nuevolution As | ENZYMMETRING CODING METHODS FOR EFFICIENT SYNTHESIS OF LARGE LIBRARIES |
DK2558577T3 (en) | 2010-04-16 | 2019-04-01 | Nuevolution As | Bi-functional complexes and methods for the preparation and use of such complexes |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5565324A (en) * | 1992-10-01 | 1996-10-15 | The Trustees Of Columbia University In The City Of New York | Complex combinatorial chemical libraries encoded with tags |
-
1999
- 1999-10-07 AU AU62959/99A patent/AU6295999A/en not_active Abandoned
- 1999-10-07 EP EP99950264A patent/EP1119530A2/en not_active Withdrawn
- 1999-10-07 WO PCT/US1999/023444 patent/WO2000021909A2/en not_active Application Discontinuation
Non-Patent Citations (1)
Title |
---|
See references of WO0021909A3 * |
Also Published As
Publication number | Publication date |
---|---|
WO2000021909A2 (en) | 2000-04-20 |
AU6295999A (en) | 2000-05-01 |
WO2000021909A3 (en) | 2001-01-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1119530A2 (en) | Selecting codes to be used for encoding combinatorial libraries | |
Nikolaiev et al. | Peptide-encoding for structure determination of nonsequenceable polymers within libraries synthesized and tested on solid-phase supports. | |
EP0454985B1 (en) | Scalable compound instruction set machine architecture | |
Stuck et al. | A computer and communications network performance analysis primer | |
EP0454984B1 (en) | General purpose compounding technique for instruction-level processors | |
JP2021532799A (en) | Systems and methods for storing and reading nucleic acid-based data with error protection | |
Yi et al. | Nonrandom missing data can bias principal component analysis inference of population genetic structure | |
JP2022531790A (en) | Data structures and behaviors for search, calculation, and indexing in DNA-based data storage | |
CN101438230B (en) | Method and system compatible for different form of addressing different address space | |
US6199017B1 (en) | Biochemical information processing apparatus, biochemical information processing method, and biochemical information recording medium | |
Rinehart et al. | A machine-learning tool to predict substrate-adaptive conditions for Pd-catalyzed C–N couplings | |
Rosenstein et al. | Principles of information storage in small-molecule mixtures | |
Vierstraete et al. | Amplicon_sorter: A tool for reference‐free amplicon sorting based on sequence similarity and for building consensus sequences | |
Furka | Forty years of combinatorial technology | |
CN111460849A (en) | Method compatible with multiple sets of setting codes and code scanning equipment | |
CN112015529B (en) | Data task scheduling method, system, electronic device and storage medium | |
CN111475234B (en) | Character string transmission method and device, computer and readable storage medium | |
JPS63500547A (en) | Circular context addressable memory | |
Corel et al. | MS4-Multi-Scale Selector of Sequence Signatures: An alignment-free method for classification of biological sequences | |
CN110021366A (en) | A kind of system and its analysis method based on DNA encoding compound database | |
GB2400942A (en) | Processor type determination using reset vector | |
AU746309B2 (en) | Method of synthesizing a plurality of products | |
Knill et al. | Group testing problems in experimental molecular biology | |
US20080052666A1 (en) | Apparatus and method for making build-block component | |
Fralish et al. | Finding the most potent compounds using active learning on molecular pairs |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20010502 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
AX | Request for extension of the european patent |
Free format text: AL;LT;LV;MK;RO;SI |
|
17Q | First examination report despatched |
Effective date: 20010803 |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: PHARMACOPEIA DRUG DISCOVERY, INC. |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20060926 |