WO2014094620A1 - 一种先导化合物的合成及筛选方法与试剂盒 - Google Patents

一种先导化合物的合成及筛选方法与试剂盒 Download PDF

Info

Publication number
WO2014094620A1
WO2014094620A1 PCT/CN2013/089873 CN2013089873W WO2014094620A1 WO 2014094620 A1 WO2014094620 A1 WO 2014094620A1 CN 2013089873 W CN2013089873 W CN 2013089873W WO 2014094620 A1 WO2014094620 A1 WO 2014094620A1
Authority
WO
WIPO (PCT)
Prior art keywords
sequence
stranded dna
building block
synthetic
compound
Prior art date
Application number
PCT/CN2013/089873
Other languages
English (en)
French (fr)
Inventor
李进
江洋
陈湜
瞿庆喜
万金桥
钟丽娜
葛啸虎
黄奇
Original Assignee
成都先导药物开发有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 成都先导药物开发有限公司 filed Critical 成都先导药物开发有限公司
Priority to US14/653,729 priority Critical patent/US20150321164A1/en
Publication of WO2014094620A1 publication Critical patent/WO2014094620A1/zh

Links

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B01PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
    • B01JCHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
    • B01J19/00Chemical, physical or physico-chemical processes in general; Their relevant apparatus
    • B01J19/0046Sequential or parallel reactions, e.g. for the synthesis of polypeptides or polynucleotides; Apparatus and devices for combinatorial chemistry or for making molecular arrays
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1068Template (nucleic acid) mediated chemical library synthesis, e.g. chemical and enzymatical DNA-templated organic molecule synthesis, libraries prepared by non ribosomal polypeptide synthesis [NRPS], DNA/RNA-polymerase mediated polypeptide synthesis
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1093General methods of preparing gene libraries, not provided for in other subgroups
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • C12Q1/6874Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B50/00Methods of creating libraries, e.g. combinatorial synthesis
    • C40B50/08Liquid phase synthesis, i.e. wherein all library building blocks are in liquid phase or in solution during library creation; Particular methods of cleavage from the liquid support
    • C40B50/10Liquid phase synthesis, i.e. wherein all library building blocks are in liquid phase or in solution during library creation; Particular methods of cleavage from the liquid support involving encoding steps
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B01PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
    • B01JCHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
    • B01J2219/00Chemical, physical or physico-chemical processes in general; Their relevant apparatus
    • B01J2219/00274Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
    • B01J2219/00718Type of compounds synthesised
    • B01J2219/0072Organic compounds
    • B01J2219/00722Nucleotides

Definitions

  • the invention relates to the field of chemistry, in particular to a method and a kit for synthesizing and screening a lead compound.
  • Combinatorial chemistry is a technique for synthesizing chemical libraries in which different series of synthetic blocks, the reaction components, are arranged in an orderly fashion to form a large series of diverse molecular entities.
  • Combinatorial chemical synthesis is often referred to as a digital game, that is, how to arrange a large number of synthetic blocks.
  • the total number of reaction products N in a combined synthesis is determined by two factors, the synthesis of each step.
  • the goal of combinatorial chemistry research is how to effectively obtain all the products of this reaction, N.
  • combinatorial chemistry has made breakthroughs in the synthesis methods.
  • Several commonly used synthetic methods include solid phase organic synthesis and liquid phase organic synthesis, solid phase organic synthesis including mixing.
  • Splitting and parallel synthesis, liquid phase organic synthesis includes multicomponent liquid phase synthesis and functional group transformation.
  • High-throughput screening (HTS) technology is based on experimental methods at the molecular and cellular levels, using microplates as an experimental tool carrier, and performing automated testing procedures to perform sensitive and rapid testing instruments. The experimental result data is collected, and the experimental data is analyzed and processed by the computer. The thousands of samples are detected at the same time, and the technical system of the whole system is supported by the corresponding database. High-throughput screening methods greatly increase the speed and efficiency of screening for small molecule compounds, and compounds that act on target molecules can be screened from combinatorial chemical libraries. However, after screening a compound from a chemical library using a conventional high-throughput screening method, it is very difficult to purify and determine the structure of the target compound, which takes a long time and is costly, and is more difficult to expand with the compound library.
  • the title of the invention discloses a method of synthesizing a compound in a compound at each stage of the synthesis (for example On the granules, the labeling is specifically performed to define the specific matters (usually referred to as the added chemical reagents) accompanying the synthesis of the compound on the carrier.
  • the labeling is accomplished using an identifier molecule that records the sequential events experienced by the carrier particles during synthesis, thereby providing a reaction history for the production of the compound on the carrier.
  • this application does not provide a technical solution to implement the method.
  • the synthesis unit is labeled, for example: Publication No.: EP 0643778, the name of the invention: "Quantified combinatorial chemical l ibrari es" discloses a method for labeling an amino acid or a polypeptide with a single-stranded oligonucleotide; Publication No.: US 7935658, name of the invention
  • the method of "method for synths is of encoded l ibrari es" discloses a method for labeling a synthetic building block with a double-stranded DNA fragment to form a compound library; Application No.: W0/2010/094036, Title: “MethodS OF CREATING
  • the patent application of AND SCREENING DNA-ENC0DED LIBRARIES discloses a method of labeling a compound with an oligonucleotide to form a library of compounds, the oligonucleotide of which is a double-stranded DNA of a hairpin structure.
  • the double-stranded DNA is easily cross-linked during the ligation extension to form a coiled tertiary structure.
  • sequencing it needs to be melted, and the operation is complicated, and the double-stranded DNA is labeled.
  • the linear combination reaction is more than three steps, the sequencing result of the double-stranded DNA is greatly errored, which results in the method can only stay in two dimensions, so the number of compounds in the library can only be increased by increasing the number of synthetic blocks per reaction.
  • the obtained compound library has poor diversity, and it is difficult to synthesize the target compound.
  • the present invention provides a kit and method for synthesizing and screening a lead compound, and a novel combinatorial chemical library.
  • Synthetic Bui lding Block also known as synthon, is a small molecule compound that must be used in the development of new drugs (Western medicines, pesticides) with various physical and chemical properties and specific biochemical properties.
  • a lead compound is referred to as a lead, and is a compound having a certain biological activity and chemical structure obtained by various routes and means for further structural modification and modification, and is a starting point of modern new drug research.
  • Reaction process It is the process that the reaction goes through.
  • Tandem linkage refers to the connection of several single-stranded DNA sequences at different ends, and there are no branches at the junction.
  • the method for synthesizing and screening a lead compound of the present invention comprises the following steps:
  • the DNA of the target compound obtained by the step (3) is sequenced to determine the structure of the target compound.
  • the starting sequence of step (1) comprises polyadenylation.
  • the polyadenylation is 12 to 20 adenosines.
  • the length of the label sequence in step (1) is not less than 6 bp.
  • the marker sequence is 9 bp in length.
  • the pH is 8 to 12 during the synthesis, and the temperature is (T3 (rC).
  • step (1) wherein the 3' end of the tag sequence is linked to a ribonucleotide.
  • the ribonucleotide is cytidine.
  • step (2) the method of connecting the start sequence to the initial synthesis block in step a is:
  • step (2) of the joining start sequence and the tag sequence, the tag sequence or the tag sequence and the terminal sequence are: using a polynucleoside kinase to make the 5 '-end of the single-stranded DNA acidified, using RNA ligase Connect, you can.
  • the polynucleoside kinase is a T4 polynucleoside kinase
  • the RNA ligase is a T4 RNA ligase.
  • the screening method in the step (3) is a screening method based on a receptor-ligand specific reaction.
  • a kit for synthesizing and screening a lead compound of the present invention which comprises the following components:
  • the start sequence comprises polyadenylation.
  • the polyadenylation is 12 to 20 adenosines.
  • the length of the marker sequence of the component 1) is not less than 6 bp.
  • the marker sequence is 9 bp in length.
  • step (1) wherein the 3' end of the tag sequence is linked to a ribonucleotide.
  • the ribonucleotide is cytidine.
  • the reagent for linking the start end sequence to the synthetic block comprises a reagent for aminating the single-stranded DNA and a reagent for the initial synthesis of the carboxyl group, the thiol group or the alkynyl group.
  • the single-stranded DNA fragment-ligating agent comprises a polynucleoside kinase and an RNA ligase.
  • the polynucleoside kinase is a T4 polynucleoside kinase
  • the RNA ligase is a T4 RNA ligase linkage
  • the combinatorial chemical library of the present invention is a combinatorial chemical library synthesized by a combinatorial chemical method using a synthetic building block, wherein each compound is labeled with a single-stranded DNA sequence, and the structure of the single-stranded DNA sequence is a starting sequence.
  • the length of the labeling sequence is not less than 6 bp.
  • the marker sequence is 9 bp in length.
  • the length of the marker sequence is 6, 4096 single-stranded DNA fragments with different sequences can be prepared.
  • the number of synthetic blocks encoded by DNA fragments for preparing combinatorial chemical libraries can meet the needs of most compounds for synthesis and screening.
  • the marker sequence is or 9
  • a single-stranded DNA fragment of 262144 different sequences can be prepared, which encodes up to 262144 synthetic blocks for the preparation of combinatorial chemical libraries, which are encoded by DNA fragments and used to prepare combinatorial chemical libraries. With millions of synthetic blocks, it can fully meet the needs of compound synthesis and screening. If the marker sequence is longer, the larger the number of synthetic blocks that can be encoded, the larger the combined chemical library prepared, but correspondingly, the higher the cost, considering the storage capacity and cost, and the length of the marker sequence is 9 is optimal.
  • the building blocks are labeled with single-stranded DNA, and the single-stranded DNA does not complement each other to form a double strand during the ligation process.
  • the structure is stable, and it is not easy to cross-link. It does not need to be melted during sequencing, and the operation is simple, rapid, and accurate. Therefore, the method of the present invention can comprise a multi-step linear combination reaction, and the synthesized compound library has good diversity, large storage capacity, easy synthesis of the target compound, determination of its synthesis block, reaction history and chemical structure, thereby rapidly synthesizing a large number of target compounds. It is an accurate, efficient, easy to operate, low cost lead compound library synthesis and screening method with good application prospects.
  • Figure 1 is a schematic diagram of the process of synthesizing a compound by a combinatorial chemistry method, wherein "H” represents a synthetic building block; “initial” indicates an initial sequence; “B” indicates a labeled sequence, and its specific marker is a synthetic building block, the numbers representing the two Corresponding relationship, for example, B1 specific marker HI; “End” indicates the end sequence; the left column indicates the reaction step, which is consistent with the reaction step of Example 1; the final product obtained, the synthetic block from right to left only indicates the addition of the synthetic block
  • the sequence, the initial sequence, the marker sequence and the terminal sequence represent the structure of the finally obtained single-stranded DNA sequence from left to right;
  • Figure 2 is an electropherogram of the chemical library of the present invention and the selected trypsin inhibitor
  • Figure 3 is a histogram of the sequencing results, the column corresponds to the compound, and its height is related to the binding force of the compound to the target;
  • Figure 5 is an IC50 map of the trypsin inhibitor of the present invention.
  • Polyadenosine can be linked to the start sequence for isolation and purification, and the cytidine can be linked to the marker sequence to improve the efficiency of subsequent single-strand DNA fragments.
  • Polynucleoside kinase is used to make the 5 '-end of the single-stranded DNA acidified; RNA ligase is linked;
  • the compound is synthesized by linear combination reaction.
  • each new synthetic building block is connected in series with the free end of the single-stranded DNA connected to the initial synthetic building block.
  • Synthesis (in addition to the following synthesis methods, other chemical synthesis methods can also be used): Take the synthetic blocks 5 ⁇ 6, place them in 2 micro-reaction vessels, and mix them with the mixture prepared in step b, according to the mixed splitting method. , parallel synthesis, multi-component liquid phase synthesis or functional group conversion synthesis;
  • a chromatographic separation screening method based on receptor-ligand specific reaction, which screens a library of DNA-labeled compounds with biological target molecules.
  • the chromatogram is eluted, and the DNA-labeled compound not bound to the biological target molecule is removed, and the DNA-labeled compound bound to the biological target molecule is isolated.
  • the DNA-labeled compound obtained by the step (3) is screened, and the DNA on the DNA-labeled compound is sequenced, and the synthetic building block and the reaction course of the compound can be determined based on the DNA sequence.
  • T4 PNK 500U NEB-M0201V
  • T4 RNA l igase 1 NEB-M0204S
  • Cartridges PCR purification kit (cat. no 28104, Nucleotides removal Kit cat. no 28306) purchased from Qiagen (Hi lden, Germany) .
  • dNTPs (0. 5 mM, NEB, cat. no89009).
  • the single-stranded DNA fragment shown in Table 1 was synthesized by Genscript and Biosune.
  • Cytosine can be linked to the following marker sequences to increase the efficiency of subsequent single-strand DNA fragments.
  • Preparing the initial synthetic block Select a synthetic block, connect one end of the starting sequence to the synthetic block, and connect the other end to the specific marking sequence of the synthetic block in series to obtain a mark with one end free.
  • the starting sequence is aminated, taken from the synthetic building block 1, and carboxylated; the activated synthetic building block 1 is reacted with the activated starting sequence to obtain the initial synthetic building block of the starting sequence.
  • the total volume of the reaction mixture is 150 ⁇ l, and the solvent is water and dimethyl sulfoxide in a volume ratio of 3:7, which contains a triethylamine hydrochloride buffer system (H10.O, 80 mM), wherein the concentration of the synthetic block 1 is 30 mM, activating reagent 1_ethyl_3_(3-dimethylaminopropyl)carbodiimide hydrochloride (EDCI) at a concentration of 4 mM, 2-sulfonic acid hydroxysuccinimide at 10 mM, The starting sequence concentration was 20 M and the reaction was carried out for 1 hour at room temperature.
  • H10.O triethylamine hydrochloride buffer system
  • Polynucleoside kinase is used to make the 5'-end of the single-stranded DNA acidified; RNA ligase is ligated.
  • the reaction mixture of step 1 and the labeled sequence of 1, 15 ul comprises a 225 pmol start sequence, 25 pmol of the tag sequence 1, 50 units of T4 RNA ligase, and a ligation reaction buffer; the mixture is incubated at 25 ° C. 5h, followed by heating at 70 ° C for 20 min, T4RNA ligase denaturation; addition of T4 polynucleotide kinase and InmATP, reaction for 10 cycles, followed by incubation at 75 ° C for 20 min to denature additional polynucleoside kinase;
  • the product was used in an equal volume of 2 X loading buffer containing 40 mM Tris-HCL (pH 7.6), 1 M NaCL and 1 mM EDTA;
  • the resulting mixture was purified by the following procedure: Qiagen Cartridge column on the reaction solution; suspended in IX loading buffer; lOOOrmp was centrifuged for 1 min, filtered with silica glass wool; IX loading buffer, 0.5 M NaCL solution and 80% ethanol were sequentially used. Rinse; elute with 20 ul of PE eluent; vacuum dry.
  • the compound is synthesized by a three-step linear combination reaction. During the synthesis process, each new synthetic building block is separated from the single-stranded DNA linked to the initial synthetic building block. The single-stranded DNA is gradually extended in tandem with the specific tag sequence added to the synthetic building block, and after the synthesis is completed, the terminal sequence is ligated in tandem at the free end of the single-stranded DNA to obtain a single-stranded DNA-labeled compound library;
  • the first batch of synthetic blocks namely the initial block (1): synthetic block 1;
  • the third batch of synthetic blocks (49) synthetic blocks 55;
  • the synthetic blocks 2 ⁇ 6 are placed in 5 micro-reaction vessels and mixed with the initial synthetic block prepared in step a, according to mixed splitting method, parallel synthesis method, multi-component liquid phase synthesis method or functional group transformation. Method synthesis.
  • the reaction condition is 150 ⁇ l of the reaction mixture, and the solvent is water and dimethyl sulfoxide in a volume ratio of 3:7, which contains a triethylamine hydrochloride buffer system (pH 9.0, 80 mM), and is synthesized.
  • the concentration of block 1 was 30 mM, the activation reagent 1-ethyl-3-(3-dimethylaminopropyl) carbodiimide hydrochloride (EDCI) 4 mM, 2-sulfonic acid hydroxysuccinimide 10 mM, The concentration of the building block 2 was 1.5 M, and the reaction was carried out at room temperature for 15 hours.
  • EDCI 1-ethyl-3-(3-dimethylaminopropyl) carbodiimide hydrochloride
  • 2-sulfonic acid hydroxysuccinimide 10 mM The concentration of the building block 2 was 1.5 M, and the reaction was carried out at room temperature for 15 hours.
  • the synthetic blocks 75 are taken in 49 micro-reaction vessels and mixed with the initial synthetic blocks prepared in step a, according to the mixed splitting method, the parallel synthesis method, the multi-component liquid phase synthesis method or the functional group conversion method. synthesis.
  • the reaction condition is 150 ⁇ l of the reaction mixture, and the solvent is water and dimethyl sulfoxide in a volume ratio of 3:7, which contains a triethylamine hydrochloride buffer system (pH 9.0, 80 mM), and is synthesized.
  • the concentration of block 1 was 30 mM, the activation reagent 1-ethyl-3-(3-dimethylaminopropyl) carbodiimide hydrochloride (EDCI) 4 mM, 2-sulfonic acid hydroxysuccinimide 10 mM, The concentration of the building block 2 was 1.5 M, and the reaction was carried out at room temperature for 15 hours.
  • EDCI 1-ethyl-3-(3-dimethylaminopropyl) carbodiimide hydrochloride
  • 2-sulfonic acid hydroxysuccinimide 10 mM The concentration of the building block 2 was 1.5 M, and the reaction was carried out at room temperature for 15 hours.
  • a chromatographic separation screening method based on receptor-ligand specific reaction, which screens a library of DNA-labeled compounds with biological target molecules.
  • Washing Solution 1 0.1 M acetic acid, 0.5 M NaCl, pH 4.0;
  • Washing Solution 2 0.1 M Tris-HCl, 0.5 M NaCl, pH 8.0.
  • Step (2) The library of the obtained single-stranded DNA-labeled compound was mixed with PBS buffer at a ratio of 1:15 by volume (17 uL: 255 uL);
  • pancreatic bovine trypsin/CNBr resin slurry (2.5, 0.5, 0.1, 0.02, 0.004) And Omg/mL);
  • Step 3) The obtained salmon sperm DNA solution is incubated with the pancreatic bovine trypsin/CNBr resin slurry obtained in the step 2) at 25 ° C for 1 hour;
  • the single-strand DNA-labeled compound library obtained in the step (2) is subjected to electrophoresis detection of the trypsin affinity sample obtained by the step (3).
  • test results are shown in Fig. 2.
  • a target band was obtained by screening with pancreatic bovine trypsin/CNBr resin slurry, and the negative control was a blank band, indicating that the purified trypsin ligand sample was screened by the present invention.
  • Step (3) is used to screen the obtained DNA-labeled compound, and the DNA on the DNA-labeled compound is sequenced, and the synthetic block and reaction course of the compound can be determined based on the DNA sequence:
  • the sample obtained by the step (3) is subjected to polymerase chain reaction (PCR), and the oligonucleotide code encoding the compound is subjected to PCR amplification (total volume 50 ⁇ l, 30 cycles, 94 ° C for 1 minute per cycle)
  • PCR polymerase chain reaction
  • total volume 50 ⁇ l, 30 cycles, 94 ° C for 1 minute per cycle The reaction was carried out at 55 ° C for 1 minute and at 72 ° C for 40 seconds), and 5 ⁇ L of trypsin 245 library (concentration 100 fM) was used as a template.
  • Hi-seq2000 specific sequencing linker (6 bases length) was ligated to the 5' and 3' ends of the sequencing sample using Illumina's chip-seq DNA sample kit, and then fixed to the chip-seq of the Hiseq2000 sequencer. Above the plate, perform the next bridge amplification;
  • the labeled dNTP of the Truseq SBS Kit v3-HS (200cycles), record the sequence and frequency of each base read from the sequencing link, and test the base of the nucleic acid sample;
  • the compound is synthesized and detected, and the compound is determined to be a trypsin inhibitor, and the inhibitory activity thereof is shown in Figs. 4 to 5, and the IC50 is 8. 1 ⁇ 2. 1 ⁇ , indicating that the compound obtained by the screening is indeed Trypsin ligand.
  • the experimental results show that the present invention constructs a chemical library containing 245 compounds, and screens to obtain a trypsin ligand having inhibitory activity of trypsin, indicating that the method of the present invention can efficiently synthesize and screen lead compounds.
  • composition of the kit of the invention (synthesis amount of one synthetic building block)
  • Sequence name Single-stranded DNA sequence 1. 5 M synthetic building block, 30 mM 5, -P03-AGATCTGATGGCGCGAGGGAAAAAAAAA start sequence
  • the present invention uses a single-stranded DNA marker to synthesize a building block, and the single-stranded DNA does not complement each other during the ligation process, is not easily cross-linked, and has a stable structure, single-stranded DNA.
  • PCR amplification and sequencing are more convenient and faster than double-stranded DNA. Therefore, the method of the invention can comprise a multi-step linear combination reaction, the synthesized compound library has good diversity, large storage capacity, easy synthesis of the target compound, and the synthesis block, reaction history and chemical structure can be determined by sequencing, accurate, efficient, and operation. Simple, low cost and good application prospects.
  • the kit of the invention can be used for rapid synthesis and screening of lead compounds, and the method of the invention can comprise a multi-step linear combination reaction, and the synthesized compound library has good diversity, large storage capacity, easy synthesis and target compound, and can be determined by sequencing. Block, reaction history and chemical structure, accurate and efficient, easy to operate, low cost. Has a good application prospect, suitable for industrial applications.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biochemistry (AREA)
  • Molecular Biology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Microbiology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Analytical Chemistry (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Plant Pathology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Immunology (AREA)
  • Structural Engineering (AREA)
  • General Chemical & Material Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

一种先导化合物的合成及筛选的方法,包括如下步骤:(1)取原料:取i种合成砌块与(i+2)种单链DNA片段;(2)用组合化学方法合成化合物,得到单链DNA标记的化合物的文库;(3)筛选:对DNA标记的化合物的文库进行筛选;(4)测序:取步骤(3)筛选得到的DNA标记的化合物,对DNA标记的化合物上的DNA测序,根据DNA序列即可确定该化合物的合成砌块和反应历程。还公开了一种先导化合物的合成及筛选试剂盒,以及一种组合化学文库。

Description

说 明 书
一种先导化合物的合成及筛选方法与试剂盒 技术领域
本发明涉及化学领域, 特别是涉及一种先导化合物的合成及筛选方法与试剂盒。
背景技术
自 80年代末期,随着分子生物学研究的突破一高通量筛选技术的发展,新药开发所需要的新分子 实体的数目越来越多, 科学家们把注意力从寻找天然产物转入合成大数目的化合物群——化学库, 化 学库是由诸多不同属性的有机化合物组成的。组合化学方法是一种合成化学库的技术,运用这项技术, 不同系列的合成砌块——反应成分有序地排列起来以组成大系列的多样化分子实体群。 组合化学合成 法经常被人们称为数字游戏, 也就是如何排列众多的合成砌块的问题, 从理论上讲, 组合合成的总反 应产物数 N是由两个因素决定的, 每一步的合成砌块数目 b和合成的步骤 x, 例如, 对于一个三步的 线性组合反应,如果每步的反应物数目分别是 bl、b2、b3,那么理论上的总反应产物的数目是 N=blb2b3。 组合化学研究的目标就是怎样有效地得到这一反应的所有产物 N。 近年来, 从固相合成到快速液相平 行合成, 组合化学在合成方法上取得了突破性的进展, 常用的几种合成方法有固相有机合成和液相有 机合成, 固相有机合成包括混合裂分法和平行合成法, 液相有机合成包括多组分液相合成法和官能团 转化法。
组合化学合成技术建立的化学库中, 产物成千上万, 甚至上亿, 像经典有机合成那样一个一个的 纯化分离鉴定已不再可能。 高通量筛选(Hi gh throughput screening, HTS)技术是指以分子水平 和细胞水平的实验方法为基础,以微板形式作为实验工具载体,以自动化操作系统执行试验过程, 以灵敏快速的检测仪器采集实验结果数据, 以计算机对实验数据进行分析处理, 同一时间对数以 千、 万计的样品检测, 并以相应的数据库支持整个体系运转的技术体系。 高通量筛选方法极大地 提高了小分子化合物筛选的速度和效率, 可以从组合化学文库中筛选作用于靶标分子的化合物。 但是, 用传统的高通量筛选方法将化合物从化学文库中筛选出来后, 纯化并确定目标化合物的结 构非常困难, 需要的时间长, 成本高, 随着化合物文库的扩大, 难度更大。
为了解决该问题, 申请号: 95193518. 6, 发明名称: "用标记编码的复杂组合化学文库 " 的专利申请公开了一种方法, 即在合成的每个阶段, 在化合物进行合成的载体 (例如颗粒) 上, 专一地进行标记, 以定义随同该载体上化合物合成时伴随的特定事项 (通常指所加化学试剂)。 所述标记使用鉴定剂分子来完成, 该分子记录合成期间载体颗粒所经历的按顺序的事项, 由此提 供在该载体上生产化合物的反应历程。 但是该申请并未提供实现该方法的技术方案。
现有技术中有报道使用寡核苷酸标记化合物的合成单元, 根据生物领域的常识, 在常规条件 下,双链 DNA较单链 DNA稳定, 因此通常选择双链的寡核苷酸对化合物的合成单元进行标记,如: 公告号: EP 0643778 , 发明名称: " encoded combinatorial chemical l ibrari e s " 的专利 公开了用单链寡核苷酸标记氨基酸或多肽的方法;公告号: US 7935658 ,发明名称: " methods for synths i s of encoded l ibrari es " 的专利公开了一种用双链 DNA片段标记合成砌块, 形成化合 物文库的方法; 申请号: W0/2010/094036, 发明名称: " METHODS OF CREATING AND SCREENING DNA-ENC0DED LIBRARIES " 的专利申请公开了用寡核苷酸标记化合物, 形成化合物文库的方法, 其寡核苷酸为发卡结构的双链 DNA。
但是, 用双链 DNA标记合成砌块或者化合物时, 在连接延伸过程中, 双链 DNA容易交联, 形 成卷曲的三级结构, 测序时, 需要解链, 操作较复杂, 用双链 DNA标记三步以上的线性组合反应 时, 双链 DNA的测序结果误差较大, 导致该方法只能停留在两维, 因而只能通过增加每一步反应 的合成砌块数目来增加文库中化合物数量, 制得的化合物文库多样性差, 不易合成得到目标化合 物。
需要寻找新的, 操作简便, 结果更准确的标记方法。
发明内容
为了解决上述问题, 本发明提供了一种先导化合物的合成及筛选的试剂盒和方法, 以及一种新的 组合化学文库。
名词解释:
合成砌块 (Synthetic Bui lding Block), 又叫合成子, 是指具备各种理化性质以及特定生物化学 性质的、 在新药 (西药、 农药) 研发过程中必须使用的小分子化合物。 先导化合物(lead compound)简称先导物, 是通过各种途径和手段得到的具有某种生物活性和化 学结构的化合物, 用于进一步的结构改造和修饰, 是现代新药研究的出发点。
反应历程: 就是反应所经历的过程。
串联连接: 是指若干段单链 DNA序列之间依次两端点相连, 且连接点上没有分枝。
本发明先导化合物的合成及筛选的方法, 它包括如下步骤:
( 1 ) 取原料: i种合成砌块与 (i+2 ) 种单链 DNA片段, (i+2 ) 种单链 DNA片段包括 i种标记序 列、 1种始端序列和 1种末端序列, i种标记序列分别特异标记 i种合成砌块, 其中, i=l, 2, 3…! 1 ;
(2 ) 用组合化学方法合成化合物:
a、 制备初始合成砌块: 选择广 i种合成砌块, 将始端序列的一端连接在合成砌块上, 另一端与所 述合成砌块的特异标记序列串联连接, 得到 l~i种标记有一端游离的单链 DNA的初始合成砌块; b、 以步骤 a得到的初始合成砌块为基础, 用线性组合反应的方式合成化合物, 合成过程中, 每加 入新的合成砌块, 就在与初始合成砌块相连的单链 DNA游离端串联连接所加入合成砌块的特异标记序 列, 使所述单链 DNA逐渐延长, 合成结束后, 在所述单链 DNA游离端串联连接末端序列, 即得到单链 DNA标记的化合物文库;
( 3 ) 筛选: 对 DNA标记的化合物的文库进行筛选, 选出目标化合物;
(4 ) 测序: 将步骤 (3 ) 筛选得到的目标化合物的 DNA进行测序, 确定目标化合物的结构。 其中, 步骤 (1 ) 所述始端序列包括多聚腺苷。 优选地, 所述多聚腺苷为 12~20个腺苷。
其中, 步骤 (1 ) 所述标记序列的长度不低于 6bp。 优选地, 所述标记序列的长度为 9个 bp。 其中, 所述步骤 (2 ) 合成过程中, pH为 8~12, 温度为 (T3(rC。
其中, 步骤 (1 ) 所述标记序列的 3 ' 端连接一个核糖核苷酸。 所述核糖核苷酸为胞苷。
其中, 步骤 (2 ) 中, a步骤中始端序列与初始合成砌块连接的方法是:
将始端序列氨基化, 初始合成砌块羧基、 巯基或炔基化, 反应即得。
其中, 步骤(2 )所述连接始端序列与标记序列、 标记序列之间或者标记序列与末端序列的连接方 法是: 用多核苷激酶使得单链 DNA的 5 ' -端憐酸化, 用 RNA连接酶连接, 即可。 所述的多核苷激酶为 T4多核苷激酶, 所述 RNA连接酶为 T4 RNA连接酶。
其中, 所述步骤 (3 ) 中的筛选方法是基于受体-配体特异性反应的筛选方法。
本发明先导化合物的合成及筛选试剂盒, 它包括如下成分:
1 ) i种合成砌块与 (i+2 ) 种单链 DNA片段, 单链 DNA片段分为始端序列、 末端序列和 i种标记 序列, i种标记序列分别特异标记 i种合成砌块, 其中, i=l, 2, 3…! 1 ;
2 ) 始端序列与合成砌块连接用试剂、 组合化学方法用试剂和单链 DNA片段连接用试剂;
3 ) 化合物筛选用试剂;
4) DNA测序用试剂。
其中, 成分 1 ) 所述始端序列包括多聚腺苷。 优选地, 所述多聚腺苷为 12~20个腺苷。
其中, 成分 1 ) 所述标记序列的长度不低于 6bp。 优选地, 所述标记序列的长度为 9个 bp。 其中, 步骤 (1 ) 所述标记序列的 3 ' 端连接一个核糖核苷酸。 所述核糖核苷酸为胞苷。
其中, 成分 2 ) 所述始端序列与合成砌块连接用试剂包含氨基化单链 DNA的试剂以及羧基、 巯基 或炔基化初始合成砌块的试剂。
其中, 成分 2 ) 所述单链 DNA片段连接用试剂包括多核苷激酶和 RNA连接酶。
优选地, 所述的多核苷激酶为 T4多核苷激酶, 所述 RNA连接酶为 T4RNA连接酶连接。
本发明组合化学文库, 它是以合成砌块为原料, 用组合化学方法合成的组合化学文库, 其中, 每 个化合物标记了一段单链 DNA序列, 该单链 DNA序列的结构为始端序列—— i种标记序列——末端序 列, 所述 i种标记序列特异标记 i种组合化学合成过程中使用的合成砌块, 其排列顺序与组合化学合 成过程中合成砌块的加入顺序相同。
其中, 所述标记序列的长度不低于 6bp。 优选地, 所述标记序列的长度为 9个 bp。
标记序列的长度为 6时, 可制备 4096个不同序列的单链 DNA片段, DNA片段编码的、 用于制备组 合化学文库的合成砌块数以千计, 能满足大多数化合物合成及筛选的需要; 标记序列为或 9时, 可制 备 262144个不同序列的单链 DNA片段,其编码的、用于制备组合化学文库的合成砌块高达 262144个, DNA片段编码的、 用于制备组合化学文库的合成砌块数以百万计, 完全可以满足化合物合成及筛选的 需要。 若标记序列越长, 其可以编码的合成砌块数量越大, 制备的组合化学文库越大, 但是相应的, 成本越高, 综合考虑库容和成本, 标记序列的长度为 9时最优。
在本发明条件下, 用单链 DNA标记合成砌块, 连接过程中, 单链 DNA之间不会互补而形成双链, 结构稳定, 也不易交联, 测序时不需要解链, 操作简单、 快速、 结果准确。 因此, 本发明方法可以包 含多步线性组合反应, 合成的化合物文库多样性好, 库容大, 容易合成得到目标化合物并确定其合成 砌块、 反应历程及化学结构, 从而迅速合成得到大量目标化合物, 是一种准确高效、 操作简便、 成本 低廉的先导化合物文库合成和筛选方法, 应用前景良好。
以下通过实施例形式的具体实施方式, 对本发明的上述内容作进一步的详细说明。 但不应将此理 解为本发明上述主题的范围仅限于以下的实施例。 凡基于本发明上述内容所实现的技术均属于本发明 的范围。
附图说明
图 1 本发明用组合化学方法合成化合物的进程示意图, 其中, "H"表示合成砌块; "初"表示初 始序列; "B"表示标记序列, 其特异标记合成砌块, 数字代表二者的对应关系, 如, B1特异标记 HI; "末"表示末端序列; 左栏表示反应步骤, 与实施例 1反应步骤一致; 得到的终产物, 合成砌块从右 至左仅表示合成砌块的加入顺序, 初始序列、 标记序列和末端序列从左至右表示最终得到的单链 DNA 序列的结构;
图 2 本发明化学文库以及筛选得到的胰蛋白酶抑制剂的电泳图;
图 3 测序结果柱状图, 柱与化合物一一对应, 其高度与该化合物与靶标的结合力相关; .
图 4 本发明胰蛋白酶抑制剂的 IC50图谱;
图 5 本发明胰蛋白酶抑制剂的 IC50图谱。
具体实施方式
以下通过实施例的形式来阐述具体实施方式, 并对本发明的上述内容作进一步的详细说明。 但不 应将此理解为本发明上述主题的范围仅限于以下的实施例。 凡基于本发明上述内容所实现的技术均属 于本发明的范围。
实施例 1 本发明先导化合物的合成及筛选方法
1、 制备方法
( 1 )取合成砌块和单链 DNA片段:
i种合成砌块与(i+2)种单链 DNA片段, (i+2)种单链 DNA片段包括 i种标记序列、 一种始端序 列和一种末端序列, i种标记序列分别特异标记 i种合成砌块, 其中, i=l, 2, 3—n;
始端序列上可连接多聚腺苷, 以便分离纯化, 标记序列上均可以连接胞苷, 提高后续单链 DNA片 段的连接效率。
表 1 单链 DNA片段
Figure imgf000005_0001
3
替换页 (细则第 26条)
Figure imgf000006_0001
=2;
①初始合成砌块与始端序列连接:
取始端序列, 氨基化, 取合成砌块 1和 2, 羧基、 巯基或炔基化; 取活化的合成砌块 1和 2与活 化的始端序列反应, 得连接了始端序列的初始合成砌块;
4
替换页 (细则第 26条) ②分别在始端序列上连接合成砌块 1和 2的标记序列 (该连接方法除了可以用下述方法以外, 还 可以用其他单链 DNA的连接方法):
用多核苷激酶使得单链 DNA的 5 ' -端憐酸化; RNA连接酶连接, 即可;
③混合, 得初始合成砌块混合物。
b、 以步骤 a得到的初始合成砌块为基础, 用线性组合反应的方式合成化合物, 合成过程中, 每加 入新的合成砌块, 就在与初始合成砌块相连的单链 DNA游离端串联连接所加入合成砌块的特异标记序 列, 使所述单链 DNA逐渐延长, 合成结束后, 在所述单链 DNA游离端串联连接末端序列, 即得到单链 DNA标记的化合物文库; 比如, 三步线性组合反应。
I:
①合成(除了下述合成方法以外, 还可以用其他化学合成方法): 取合成砌块 3~4, 置于 2个微型 反应容器中, 分别与步骤 a制得的初始合成砌块混合物中混合, 按照混合裂分法、 平行合成法、 多组 分液相合成法或者官能团转化法合成;
②加标记序列: 同步骤 a步骤②。
③混合, 得混合物。
II:
①合成(除了下述合成方法以外, 还可以用其他化学合成方法): 取合成砌块 5~6, 置于 2个微型 反应容器中, 分别与步骤 b制得混合物混合, 按照混合裂分法、 平行合成法、 多组分液相合成法或者 官能团转化法合成;
②加标记序列: 同步骤 a步骤②。
③加末端序列: 同步骤 a步骤②。
④混合, 即得到单链 DNA标记的化合物的文库。
(3) 筛选: 对 DNA标记的化合物的文库进行筛选:
基于受体-配体特异性反应的色谱分离筛选方法,用生物靶分子对 DNA标记的化合物的文库进行筛 选。
对色谱进行洗脱, 将未与生物靶标分子结合的 DNA标记的化合物除去, 分离得到与生物靶分子结 合的 DNA标记的化合物。
(4) 测序:
取步骤(3 )筛选得到的 DNA标记的化合物, 对 DNA标记的化合物上的 DNA测序, 根据 DNA序列即 可确定该化合物的合成砌块和反应历程。
实施例 2 用本发明方法成和筛选胰蛋白酶配体
1、 材料和试剂
T4 PNK (500U NEB-M0201V) , T4 RNA l igase 1 (NEB- M0204S), Cartridges (PCR purification Kit (cat. no 28104, Nucleotides removal Kit cat. no 28306) purchased from Qiagen (Hi lden, Germany) . dNTPs (0. 5 mM, NEB, cat. no89009) .
表 1所示单链 DNA片段, 由 Genscript公司和 Biosune公司合成。
2、 制备方法
( 1 )制备单链 DNA片段:
本实施例一共用了多少个合成砌块, 请提供这写合成砌块及其编码序列。
54种合成砌块与 56种单链 DNA片段, 57种单链 DNA片段包括 55种标记序列、 1种始端序列和 1 种末端序列;
下述标记序列上均可以连接胞苷, 提高后续单链 DNA片段的连接效率。
表 2 单链 DNA片段
Figure imgf000007_0001
标记序列 1 TGCCCAAGGc
2 CGTCTCGATc
Figure imgf000008_0001
3 TGCGCCGAGc
4 ATGGATTTAc
5 CATGTTTACc
6 GTAACATTAc
7 GGAGTTCAAc
8 CTTTGTACTc
9 ACTACCGTGc
10 ATGAATAAGc
6
替换页 (细则第 26条)
Figure imgf000009_0001
C.8680/ClOZN3/X3d 0Z9t60/ 0Z OAV
Figure imgf000010_0001
6
Figure imgf000011_0001
C.8680/CTOZN3/X3d 0Z9t60/ 0Z OAV 01
Figure imgf000012_0001
C.8680/CT0ZN3/X3d 0Z9t60/ 0Z OAV 52 CGACTTAGCc
Figure imgf000013_0001
53 GTATGAAAAc
54 TTGGCAGGGc
55 TAGATATTGc
Figure imgf000013_0002
GGAGCTTGTGAATTCTGGc
末端序列
(2) 合成:
a、 制备初始合成砌块: 选择一种合成砌块, 将始端序列的一端连接在合成砌块上, 另一端与所述 合成砌块的特异标记序列串联连接, 得到一种标记有一端游离的单链 DNA的初始合成砌块;
①初始合成砌块与始端序列连接:
取始端序列, 氨基化, 取合成砌块 1, 羧基化; 取活化的合成砌块 1与活化的始端序列反应, 得 连接了始端序列的初始合成砌块。
反应混合物总体积为 150微升, 溶剂为水和二甲亚砜体积比为 3 : 7, 其中含三乙胺盐酸缓冲体系 ( H lO. O, 80mM), 其中, 合成砌块 1的浓度为 30 mM, 活化试剂 1_乙基 _3_ (3_二甲胺丙基)碳二亚胺 盐酸盐 (EDCI ) 浓度为 4 mM, 2-磺酸基 羟基琥珀酰亚胺浓度为 10 mM, 始端序列浓度为 20 M, 室温反应 1小时。
②在始端序列上连接合成砌块 1的标记序列 (该连接方法除了可以用下述方法以外, 还可以用其 他单链 DNA的连接方法):
用多核苷激酶使得单链 DNA的 5 ' -端憐酸化; RNA连接酶连接, 即可。
连接: 取步骤①处理后的始端序列与标记序列 1, 15 ul的反应混合物包括 225pmol始端序列、 25pmol标记序列 1、 50单位 T4RNA连接酶以及连接反应的缓冲液; 该混合物在 25°C孵育 1. 5h, 后再 70°C加热 20min, T4RNA连接酶变性; 加入 T4多核苷酸激酶和 InmATP, 反应 10个循环, 接着在 75°C 孵育 20min使额外的多核苷激酶变性;
纯化: 产物用等体积的 2 X上样缓冲液中, 该缓冲液包含 40mM Tris-HCL (pH7. 6)、 1M NaCL和 ImM EDTA;
得到的混合物通过如下步骤纯化: 反应液上 Qiagen Cartridge柱子; 用 I X上样缓冲液悬浮; lOOOrmp离心 lmin, 用硅化玻璃棉过滤; 用 I X上样缓冲液、 0. 5M NaCL溶液和 80%乙醇依次冲洗; 用 20 ul PE洗脱液洗脱; 真空干燥, 即可。
b、 以步骤 a得到的初始合成砌块为基础, 用三步线性组合反应的方式合成化合物, 合成过程中, 每加入新的合成砌块, 就在与初始合成砌块相连的单链 DNA游离端串联连接所加入合成砌块的特异标 记序列, 使所述单链 DNA逐渐延长, 合成结束后, 在所述单链 DNA游离端串联连接末端序列, 即得到 单链 DNA标记的化合物文库;
第一批合成砌块, 即初始砌块 (1个): 合成砌块 1 ; 第二批合成砌块 (5个): 合成砌块 2~6;
第三批合成砌块 (49个): 合成砌块 55;
I:
①合成
取合成砌块 2~6, 分别置于 5个微型反应容器中, 与步骤 a制得的初始合成砌块混合, 按照混合 裂分法、 平行合成法、 多组分液相合成法或者官能团转化法合成。
分别置于 5个微型反应容器中, 与步骤 a制得的初始合成砌块反应。 以合成砌块 2为例, 反应条 件为 150微升反应混合物, 溶剂为水和二甲亚砜体积比为 3:7, 其中含三乙胺盐酸缓冲体系 (pH9.0, 80mM),合成砌块 1的浓度为 30mM,活化试剂 1-乙基 -3- (3-二甲胺丙基)碳二亚胺盐酸盐(EDCI) 4mM, 2-磺酸基 羟基琥珀酰亚胺 10 mM, 合成砌块 2的浓度 1.5 M, 室温反应 15小时。
②分别加合成砌块 2~6的标记序列: 同步骤 a步骤②。
③混合, 得混合物。
II:
①合成
取合成砌块 7 5, 分别置于 49个微型反应容器中, 与步骤 a制得的初始合成砌块混合, 按照混 合裂分法、 平行合成法、 多组分液相合成法或者官能团转化法合成。
分别置于 49个微型反应容器中, 与步骤 a制得的初始合成砌块反应。 以合成砌块 2为例, 反应条 件为 150微升反应混合物, 溶剂为水和二甲亚砜体积比为 3:7, 其中含三乙胺盐酸缓冲体系 (pH9.0, 80mM),合成砌块 1的浓度为 30mM,活化试剂 1-乙基 -3- (3-二甲胺丙基)碳二亚胺盐酸盐(EDCI) 4mM, 2-磺酸基 羟基琥珀酰亚胺 10 mM, 合成砌块 2的浓度 1.5 M, 室温反应 15小时。
②加标记序列: 同步骤 a步骤②。
③加末端序列: 同步骤 a步骤②。
④混合, 即得到单链 DNA标记的化合物的文库。
(3) 筛选: 对 DNA标记的化合物的文库进行筛选:
基于受体-配体特异性反应的色谱分离筛选方法,用生物靶分子对 DNA标记的化合物的文库进行筛 选。
① CNBr树脂活化:
1 ) 0.1033克 CBNr活化 Sepharose 4B树脂, 并把它们分为 2支, 然后在 4ml的 ImM的氯化氢溶 液 (PH3.0) 里静止;
2) 用 ImM的盐酸 (pH值 3.0) 清洗液清洗 15分钟;
3) 将 4mg的胰蛋白酶溶解在 0.5毫升的偶联缓冲液 (0.1M碳酸氢钠, 0.5M氯化钠, pH值 8.3) 中;
4) 轻微上下震荡混合物 1小时, 在室温或 4°C过夜孵育;
5) 用 4ML耦合溶液洗去多余的蛋白质;
6) 将树脂转入 4mL0.1M的 Tris-HCl缓冲液 (pH8.0) 中, 孵育 2小时;
7) 用清洗缓冲液 1和 2清洗树脂三次; (清洗液 1: 0.1M acetic acid, 0.5M NaCl, pH4.0;, 清 洗液 2: 0.1M Tris-HCl, 0.5M NaCl, pH8.0. )
8) 离心分离树脂, 6000r/min, 10min。
②胰蛋白酶在活化 CNBr树脂上的固化
1) 称取 100毫克活化的 CNBr 树脂至于 4毫升 ImM的盐酸中孵育;
2) 用 8mL的 ImM盐酸 (1¾值 3.0) 清洗;
3)将 0.004mg/ml, 0.02mg/ml, 0. lmg/ml, 0.5mg/ml, 2.5mg/ml的胰蛋白酶溶液分别与五份活化 CNBr 树脂混合, 在 4°C下孵育 5小时;
4) 用 0.1M Tris盐酸, 0.5M氯化钠, (pH值 8.3) 溶液清洗树脂;
5) 用 0.1M醋酸钠, 0.5M氯化钠, (pH4.0) 溶液清洗树脂;
6) 重复 4, 5步骤, 交替清洗至少 3个循环。
7) 将胰蛋白酶固化的树脂在 4°C下保存在 PBS缓冲液中 (pH7.4);
③胰蛋白酶化合物库亲和筛选
1) 取的步骤 (2) 得到的单链 DNA标记的化合物的文库与 PBS缓冲液以 1:15体积比混合 (17uL: 255uL);
2) 分别将 50 的文库样品加入胰腺牛胰蛋白酶 /CNBr树脂浆 (2.5, 0.5, 0.1, 0.02, 0.004 和 Omg/mL ) ;
3 ) 用 PBS缓冲液配制 0. 3毫克 /毫升的鯡鱼精 DNA溶液;
4) 步骤 3 ) 得到的鯡鱼精 DNA溶液与步骤 2 ) 得到的胰腺牛胰蛋白酶 /CNBr树脂浆在 25 °C下孵育 1小时;
5 ) 将步骤 4 ) 的混合物转移到 2ml的 Spin柱子中, 除去上清液;
6) 用 200 μ L PBS缓冲液洗涤树脂, 重复 4次;
7 ) 在清洗后的浆液中加入 100 无菌水, 筛选得到与胰蛋白酶配体样品。
鉴定: 取步骤 (2 ) 得到的单链 DNA标记的化合物文库与步骤 (3 ) 筛选得到的胰蛋白酶亲和样品 电泳检测。
检测结果如图 2所示, 用胰腺牛胰蛋白酶 /CNBr树脂浆筛选均得到一目标条带, 阴性对照则为空 白条带, 说明本发明筛选得到了纯化的胰蛋白酶配体样品。
(4) 测序:
取步骤(3 )筛选得到的 DNA标记的化合物, 对 DNA标记的化合物上的 DNA测序, 根据 DNA序列即 可确定该化合物的合成砌块和反应历程:
取步骤(3 )筛选得到的样品进行聚合酶链接反应(PCR) , 将编码化合物的寡核苷酸代码进行 PCR 扩增(总体积 50微升, 30个循环,每个循环 94°C 1分钟, 55°C反应 1分钟, 72°C反应 40秒), 以 5 μ L 胰蛋白酶 245库 (浓度 100 fM) 为模板。
采用 Illumina Hi seq2000高通量测序平台, 测序流程如下:
1 ) PCR扩增后的筛选寡核苷酸文库,利用 Axygen公司的 MAG-PCR-CL-250试剂盒进行纯化及质量 检测报告;
2 ) 利用 Illumina公司的 Picogreen试剂盒进行核酸定量, 得出样品核酸浓度, 进行下一步测 序文库制备;
3 ) 利用 Illumina公司的 chip-seq DNA sample试剂盒将 Hi seq2000特定测序接头 (6个碱基长 度) 接在测序样本的 5 '端和 3 ' 端, 再固定到 Hiseq2000测序仪的芯片 chip-seq plate上面, 进行 下一步桥式扩增;
4 )利用 Truseq PE Cluster Kit v3- cBot- HS 试剂盒进行核酸样本桥式扩增,在每个 chip- seq lane 上面得到足够测序使用的核酸簇 (cluster ) ;
5 ) 利用 Hiseq 2000的 laser imaging系统, Truseq SBS Kit v3- HS (200cycles)的带标签 dNTP, 记录从测序接头开始读取的每个碱基出现顺序及频次, 测试核酸样本碱基;
6) 下机取出数据, 数据处理。
测序结果如图 3所示, 序列如 SEQ ID NO. 1所示: TCAGGCAGAGGCGATAGAGGCGATAGA, 结合表 2可以 确定筛选得到的胰蛋白酶配体的结构如下:
Figure imgf000015_0001
根据该结构式, 合成该化合物后进行检测, 检测确定该化合物为胰蛋白酶抑制剂, 其抑酶活性如 图 4~5所示, IC50为 8. 1 ± 2. 1ηΜ, 说明筛选得到的化合物确实为胰蛋白酶配体。
实验结果说明, 本发明构建了一个含有 245个化合物的化学文库, 并筛选得到了一个胰蛋白酶配 体, 其具有抑制胰蛋白酶的活性, 说明本发明方法可以有效合成并筛选先导化合物。
实施例 3 本发明先导化合物的合成和筛选试剂盒
1、 本发明试剂盒的组成 (Ν个合成砌块的合成用量)
1 ) i种合成砌块与 i+2种单链 DNA片段, 将单链 DNA片段分为始端序列、 末端序列和 i种标记序 列, i种标记序列分别特意标记 i种合成砌块, 其中, i=l, 2, 3…! 1 ;
表 3 合成砌块和不同序列的单链 DNA片段
序列名称 单链 DNA的序列, 1. 5 M 合成砌块, 30 mM 5, -P03-AGATCTGATGGCGCGAGGGAAAAAAAAA 始端序列
AAA- 3' -P04
标记序列 TCAGGCAGAc
AGCATTTCAc
CGACTTAGCc
GGAGTTCAAc
CTACGAGAAc
TAGGCGTTAc
CGTTCTAATc
GGGAACGCGc
TTGTAGATCc
14
替换页 (细则第 26条) : : :
n TCTATGGGTc
Figure imgf000017_0001
GGAGCTTGTGAATTCTGGc
末端序列
2 ) 始端序列与合成砌块连接用试剂、 组合化学方法合成用试剂和单链 DNA片段连接用试剂;
表 4 始端序列与合成砌块连接用试剂
Figure imgf000017_0003
表 6 DNA片段连接用试剂
Figure imgf000017_0004
Figure imgf000017_0005
4) DNA测序用试剂。
Figure imgf000017_0002
PCR纯化 MAG-PCR-CL-250 核酸定量 Picogreen试齐 !j盒 文库构建 chip-seq DNA sample kit 桥式扩增 Truseq PE Cluster Kit v3 - cBot - HS 上机测序 Truseq SBS Kit v3- HS (200cycles) 本发明试剂盒按照本发明实施例 1提供的方法使用, 可用于先导化合物的快速合成和筛选。 综上, 与现有技术用双链 DNA标记合成砌块相比, 本发明用单链 DNA标记合成砌块, 连接过程中 单链 DNA不会互补, 不易交联, 结构稳定, 单链 DNA的 PCR扩增以及测序较双链 DNA更为方便、快速。 因此, 本发明方法可以包含多步线性组合反应, 合成的化合物文库多样性好, 库容大, 容易合成得到 目标化合物, 通过测序即可确定其合成砌块、 反应历程及化学结构, 准确高效、 操作简便、 成本低廉, 具有良好的应用前景。
工业应用性
本发明试剂盒可用于先导化合物的快速合成和筛选, 本发明方法可以包含多步线性组合反应, 合 成的化合物文库多样性好, 库容大, 容易合成得到目标化合物, 通过测序即可确定其合成砌块、 反应 历程及化学结构, 准确高效、 操作简便、 成本低廉。 具有良好的应用前景, 适合产业化应用。

Claims

权 利 要 求 书
1、 一种先导化合物的合成及筛选方法, 其特征在于: 它包括如下步骤:
( 1 ) 取原料: i种合成砌块与 (i+2 ) 种单链 DNA片段, (i+2 ) 种单链 DNA片段包括 i种标记序 列、 一种始端序列和一种末端序列, i种标记序列分别特异标记 i种合成砌块, 其中, i=l, 2, 3…! 1;
(2 ) 用组合化学方法合成化合物文库:
a、 制备初始合成砌块: 选择广 i种合成砌块, 将始端序列的一端连接在合成砌块上, 另一端与所 述合成砌块的特异标记序列串联连接, 得到 l~i种标记有一端游离的单链 DNA的初始合成砌块; b、 用步骤 a得到的初始合成砌块, 与广 i种合成砌块, 用线性组合反应的方式合成化合物, 合成 过程中, 每加入新的合成砌块, 就在与初始合成砌块相连的单链 DNA游离端串联连接所加入合成砌块 的特异标记序列, 使所述单链 DNA逐渐延长, 合成结束后, 在所述单链 DNA游离端串联连接末端序列, 即得到单链 DNA标记的化合物文库;
( 3 ) 筛选: 对 DNA标记的化合物的文库进行筛选, 选出目标化合物;
(4 ) 测序: 将步骤 (3 ) 筛选得到的目标化合物的 DNA进行测序, 确定目标化合物的合成砌块和 反应历程。
2、 根据权利要求 1所述的方法, 其特征在于 步骤 (1 ) 所述始端序列包括多聚腺苷。
3、 根据权利要求 2所述的方法, 其特征在于 所述多聚腺苷为 12~20个腺苷。
4、 根据权利要求 1所述的方法, 其特征在于 步骤 (1 ) 所述标记序列的长度不低于 6bp。
5、 根据权利要求 4所述的方法, 其特征在于 所述标记序列的长度为 9bp。
6、 根据权利要求 1所述的方法, 其特征在于 步骤 (1 ) 所述标记序列的 3 ' 端连接一个核糖核 苷酸
7、 根据权利要求 6所述的方法, 其特征在于: 所述核糖核苷酸为胞苷。
8、 根据权利要求 1所述的方法, 其特征在于: 步骤 (2 ) 中, a步骤中始端序列与初始合成砌块 连接的方法是:
将始端序列氨基化, 初始合成砌块羧基、 巯基或炔基化, 反应即得。
9、根据权利要求 1所述的方法,其特征在于: 步骤(2 )中,合成过程 pH为 8~12,温度为 (T3(rC。
10、 根据权利要求 1所述的方法, 其特征在于: 步骤(2 ) 中, 始端序列与标记序列、 标记序列之 间或者标记序列与末端序列的连接方法是: 用多核苷激酶使得单链 DNA的 5 ' -端憐酸化, 用 RNA连接 酶连接, 即可。
11、 根据权利要求 10所述的方法, 其特征在于: 所述的多核苷激酶为 T4多核苷激酶, 所述 RNA 连接酶为 T4 RNA连接酶。
12、 根据权利要求 1所述的方法, 其特征在于: 步骤 (3 ) 所述筛选方法是基于受体 -配体特异性 反应的筛选方法。
13、 一种先导化合物的合成及筛选试剂盒, 其特征在于: 它包括如下成分:
1 ) i种合成砌块与 (i+2 ) 种单链 DNA片段, (i+2 ) 种单链 DNA片段包括 i种标记序列、 一种始 端序列和一种末端序列, i种标记序列分别特异标记 i种合成砌块, 其中, i=l, 2, 3…! 1;
2 ) 始端序列与初始合成砌块连接用试剂、 组合化学方法合成用试剂和单链 DNA片段连接用试剂;
3 ) 化合物筛选用试剂;
4) DNA测序用试剂。
14、 根据权利要求 13所述的试剂盒, 其特征在于 成分 1 ) 所述始端序列包括多聚腺苷。
15、 根据权利要求 14所述的试剂盒, 其特征在于 所述多聚腺苷为 12~20个腺苷。
16、 根据权利要求 13所述的试剂盒, 其特征在于 成分 1 ) 所述标记序列的长度不低于 6bp。
17、 根据权利要求 16所述的试剂盒, 其特征在于 所述标记序列的长度为 9 bp。
18、 根据权利要求 13所述的试剂盒, 其特征在于 成分 1 ) 所述标记序列的 3 ' 端连接一个核糖 核苷酸。
19、 根据权利要求 18所述的试剂盒 其特征在于: 所述核糖核苷酸为胞苷。
20、 根据权利要求 13所述的试剂盒 其特征在于: 成分 2 )所述始端序列与合成砌块连接用试剂 包含氨基化单链 DNA的试剂以及羧基、 巯基或炔基化合成砌块的试剂。
21、 根据权利要求 13所述的试剂盒, 其特征在于: 成分 2 )所述单链 DNA片段连接用试剂包括多 核苷激酶和 RNA连接酶。
22、根据权利要求 21所述的试剂盒, 其特征在于: 所述的多核苷激酶为 T4多核苷激酶,所述 RNA 连接酶为 T4 RNA连接酶连接。
23、 一种组合化学文库, 其特征在于: 它是以合成砌块为原料, 用组合化学方法合成的组合化学 文库, 其中, 每个化合物标记了一段单链 DNA序列, 该单链 DNA序列的结构为始端序列—— i种标记 序列——末端序列, 所述 i种标记序列特异标记 i种组合化学合成过程中使用的合成砌块, 其排列顺 序与组合化学合成过程中合成砌块的加入顺序相同。
24、 根据权利要求 23所述的组合化学文库, 其特征在于: 所述标记序列的长度不低于 6bp。
25、 根据权利要求 24所述的组合化学文库, 其特征在于: 所述标记序列的长度为 9个 bp。
PCT/CN2013/089873 2012-12-20 2013-12-18 一种先导化合物的合成及筛选方法与试剂盒 WO2014094620A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US14/653,729 US20150321164A1 (en) 2012-12-20 2013-12-18 Method for synthesizing and screening lead compound and reagent testing kit

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201210555548.3 2012-12-20
CN201210555548.3A CN103882532B (zh) 2012-12-20 2012-12-20 一种先导化合物的合成及筛选方法与试剂盒

Publications (1)

Publication Number Publication Date
WO2014094620A1 true WO2014094620A1 (zh) 2014-06-26

Family

ID=50951641

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2013/089873 WO2014094620A1 (zh) 2012-12-20 2013-12-18 一种先导化合物的合成及筛选方法与试剂盒

Country Status (3)

Country Link
US (1) US20150321164A1 (zh)
CN (1) CN103882532B (zh)
WO (1) WO2014094620A1 (zh)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3284851B1 (en) 2015-04-14 2020-12-23 Hitgen Inc. Method for solid-phase synthesis of dna-encoded chemical library
US11186836B2 (en) 2016-06-16 2021-11-30 Haystack Sciences Corporation Oligonucleotide directed and recorded combinatorial synthesis of encoded probe molecules
EP3619340A4 (en) 2017-05-02 2021-01-20 Haystack Sciences Corporation MOLECULES FOR THE VERIFICATION OF OLIGONUCLEOTIDE DIRECTED COMBINATIONAL SYNTHESIS AND METHOD OF MANUFACTURING AND USING THEREOF
CN114411267B (zh) * 2020-10-28 2024-04-16 成都先导药物开发股份有限公司 一种On-DNA反应构建β-脂肪取代酮类化合物的方法
JP2023550813A (ja) 2020-11-27 2023-12-05 成都先導薬物開発股▲フン▼有限公司 Dnaコード化化合物ライブラリー及び化合物のスクリーニング方法

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1898257A (zh) * 2003-12-17 2007-01-17 普雷西斯药品公司 合成编码文库的方法

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6677160B1 (en) * 1999-09-29 2004-01-13 Pharmacia & Upjohn Company Methods for creating a compound library and identifying lead chemical templates and ligands for target molecules
DE60044030D1 (de) * 1999-10-08 2010-04-29 Robert C Leif Markerkomplexe aus konjugierten polymeren
WO2011056866A2 (en) * 2009-11-05 2011-05-12 Epicentre Technologies Corporation Methods and kits for 3'-end-tagging of rna
BR112014005205A2 (pt) * 2011-09-07 2017-03-21 X-Chem Inc métodos para etiquetar bibliotecas codificadas com dna

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1898257A (zh) * 2003-12-17 2007-01-17 普雷西斯药品公司 合成编码文库的方法

Also Published As

Publication number Publication date
CN103882532B (zh) 2015-10-21
US20150321164A1 (en) 2015-11-12
CN103882532A (zh) 2014-06-25

Similar Documents

Publication Publication Date Title
Lennon et al. A scalable, fully automated process for construction of sequence-ready barcoded libraries for 454
Oberley et al. High-throughput screening of chromatin immunoprecipitates using CpG-island microarrays
Lee et al. A microfluidic oligonucleotide synthesizer
US9783843B2 (en) Encoded self-assembling chemical libraries (esachel)
Buller et al. Design and synthesis of a novel DNA-encoded chemical library using Diels-Alder cycloadditions
Chan et al. Novel selection methods for DNA-encoded chemical libraries
Melkko et al. Lead discovery by DNA-encoded chemical libraries
WO2014094620A1 (zh) 一种先导化合物的合成及筛选方法与试剂盒
EP2859139B1 (en) Dna barcoding of designer mononucleosome and chromatin array libraries for the profiling of chromatin readers, writers, erasers, and modulators thereof
JP2003520050A (ja) タンパク質分析のための高感度多重化診断アッセイ法
WO2022021279A1 (zh) 多种核酸共标记支持物及其制作方法与应用
CN107002080B (zh) 一种基于多重pcr的目标区域富集方法和试剂
EP1625230A2 (en) Selection and evolution of chemical libraries
Adey Tagmentation-based single-cell genomics
WO2014094621A1 (zh) 一种先导化合物的合成及筛选方法与试剂盒
US20210207128A1 (en) Aptamer of nattokinase and method for screening the aptamer
Blackwell [41] Selection of protein binding sites from random nucleic acid sequences
Sternlieb et al. Efficient Generation of Genome-wide Libraries for Protein–ligand Screens Using Gibson Assembly
Cotten et al. mRNA display-based selections using synthetic peptide and natural protein libraries
WO2014086037A1 (zh) 构建核酸测序文库的方法及其应用
Khan et al. Different approaches for protein engineering in industrial biotechnology
CN104774923A (zh) 一种测定转录调控复合体的方法
EP3898968A1 (en) Nucleic acid encoded chemical libraries
Ha et al. Library construction for high-throughput mobile element identification and genotyping
US20100256012A1 (en) Flexible extraction method for the production of sequence-specific molecule libraries

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13865062

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 14653729

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 13865062

Country of ref document: EP

Kind code of ref document: A1