WO2018059088A1 - Dna encoding molecular library and compound screening method - Google Patents

Dna encoding molecular library and compound screening method Download PDF

Info

Publication number
WO2018059088A1
WO2018059088A1 PCT/CN2017/093564 CN2017093564W WO2018059088A1 WO 2018059088 A1 WO2018059088 A1 WO 2018059088A1 CN 2017093564 W CN2017093564 W CN 2017093564W WO 2018059088 A1 WO2018059088 A1 WO 2018059088A1
Authority
WO
WIPO (PCT)
Prior art keywords
dna
protein target
compound
protein
molecular library
Prior art date
Application number
PCT/CN2017/093564
Other languages
French (fr)
Chinese (zh)
Inventor
李笑宇
Original Assignee
深圳劲宇生物科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳劲宇生物科技有限公司 filed Critical 深圳劲宇生物科技有限公司
Publication of WO2018059088A1 publication Critical patent/WO2018059088A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B30/00Methods of screening libraries
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B30/00Methods of screening libraries
    • C40B30/04Methods of screening libraries by measuring the ability to specifically bind a target molecule, e.g. antibody-antigen binding, receptor-ligand binding
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B40/00Libraries per se, e.g. arrays, mixtures

Definitions

  • the invention belongs to the technical field of biochemistry, and particularly relates to a DNA coding molecular library and a compound screening method.
  • the DNA-encoded molecular library enables high-throughput screening of millions or even billions of scales in a very small system.
  • the screening results can be decoded and analyzed by PCR amplification and DNA sequencing to obtain lead compounds for further drug development.
  • DNA coding molecular libraries have been widely recognized and applied in the field of new drug research and development, and become an important supporting technology in the development of new drugs.
  • the DNA-encoding molecular library is used for drug screening.
  • Most of the targets used are purified proteins. After the protein target is modified, it is immobilized on a solid phase such as magnetic beads, and then incubated with the molecular library. Small molecules that cannot bind to the target protein are eluted and separated from small molecules bound to the protein target. The bound small molecule is eluted, PCR amplified, and DNA sequenced under protein denaturation conditions to read the coding sequence to obtain the chemical structure of the small molecule bound to the target.
  • the use of purified, immobilized protein targets limits the range of applications of DNA-encoding molecular libraries.
  • the object of the present invention is to overcome the above-mentioned deficiencies of the prior art, and to provide a DNA coding molecular library and a compound screening method, aiming at solving the existing DNA coding molecular library for drug screening, only using purified and immobilized protein targets, thereby The technical problem of the application of the DNA-encoding molecular library is limited.
  • the invention provides a DNA encoding molecular library comprising a short chain of 8-12 bases DNA, one end of the short-stranded DNA is linked to a photocrosslinking group, and the short-stranded DNA is combined with a PCR primer region of a DNA-encoding molecule in the DNA-encoding molecule library.
  • Another aspect of the present invention provides a method for screening a compound encoding a DNA encoding molecule, comprising the steps of:
  • the above-mentioned DNA encoding molecular library is incubated with a protein target, followed by light treatment, and then added by exonuclease to decompose the DNA tag bound to the protein target;
  • the DNA tag is subjected to PCR amplification and DNA sequencing, and a compound that binds to the protein target is read.
  • PC-DNA A unique short-chain DNA (PC-DNA) is introduced into the DNA-encoding molecular library provided by the present invention, since all DNA coding has the same PCR primer region at both ends in the DNA-encoding molecular library, PC-DNA is capable of binding to the PCR primer region of all compounds. In this way, it is more advantageous to screen a compound that binds to a protein target from a library of DNA-encoding molecules.
  • the screening method of the compound of the DNA encoding molecular library completely eliminates the requirement for purification and immobilization of the protein target in the screening of the traditional DNA encoding molecular library.
  • the method no longer relies on physical elution to separate compounds that bind to the protein target and not to the protein target, but instead uses ligand-induced selective enzymatic degradation to remove compounds that do not bind to the protein target from the system. Therefore, in principle, the method can be applied to any protein target.
  • the method has been experimentally proven to be useful for the screening of DNA-encoding molecular libraries of various complex systems such as non-loaded proteins, protein complexes, living cell surface membrane proteins, and cell lysates.
  • 1 is a conventional DNA encoding molecular library compound screening method in which a protein target is purified and modified;
  • FIG. 2 is a schematic flow chart of a method for screening a DNA-encoding molecular library compound of the present invention
  • Example 3 is a schematic view showing the flow of DNA coding carried out by a small molecule capable of binding to a protein target in Example 1 of the present invention
  • Example 4a is a molecular library having 1027 different series synthesized in Example 2 of the present invention, which comprises a GLCBS, a CBS, and a CBM in addition to a large number of background sequences; wherein GLCBS is a protein target CA-II High binding force small molecule, CBS is medium combined with small molecules, while CBM is not combined with protein target CA-II, used as a negative control;
  • 4b is a data result of screening a molecular library compound screening method according to Example 2 of the present invention.
  • Figure 5a is a molecular library having 1027 different series synthesized in Example 3 of the present invention, the library comprising 4800 DNA-encoded macrocyclic polypeptide compounds, and a GLCBS for positive control;
  • Fig. 5b is a data result of screening the molecular library compound screening method according to Example 3 of the present invention.
  • the embodiments of the present invention provide a DNA encoding molecular library comprising a short-chain DNA (PC-DNA) having 8-12 bases, the structure of which is shown below, and one end of the short-chain DNA is linked A photocrosslinking group, and the short-stranded DNA is combined with a PCR primer region of a DNA-encoding molecule in a DNA-encoding molecule library.
  • PC-DNA short-chain DNA
  • a unique short-chain DNA (PC-DNA) is introduced, since in the molecular library, all DNA coding has the same PCR primer region at both ends, The PC-DNA is capable of binding to the PCR primer region of the DNA-encoding molecule of all compounds (as shown in Figure 2); thus, the use of a compound that binds to a protein target from a library of DNA-encoding molecules is utilized.
  • the photocrosslinking group includes at least one of phenyl azide, benzophenone, and propyl acridine.
  • the photocrosslinking group is capable of sending a cross-linking reaction with the protein target to protect the DNA tag of the small molecule capable of binding to the protein target; and if the small molecule does not bind to the protein target, the protein target is cross-linked with the PC-DNA Union cannot happen.
  • the photocrosslinking group of the present invention is not limited thereto.
  • the embodiment of the present invention further provides a screening method for a compound of a DNA encoding molecular library, and the flow thereof is as shown in FIG. 2, and includes the following steps:
  • S01 incubated the above-mentioned DNA-encoding molecular library with a protein target, and then irradiating the light, and then adding a DNA exonuclease to degrade a DNA tag that binds to the protein target;
  • the screening method of the above-mentioned DNA encoding molecular library completely eliminates the requirement for purification and immobilization of protein targets in the screening of traditional DNA encoding molecular libraries (as shown in FIG. 1).
  • the method no longer relies on physical elution to separate compounds that bind to the protein target and not to the protein target, but instead uses ligand-induced selective enzymatic degradation to remove compounds that do not bind to the protein target from the system. Therefore, in principle, the method can be applied to any protein target.
  • the method has been experimentally proven to be useful for the screening of DNA-encoding molecular libraries of various complex systems such as non-loaded proteins, protein complexes, living cell surface membrane proteins, and cell lysates.
  • the protein target may be a purified protein and/or a non-purified protein, the protein target may also be a modified protein or a non-modified protein, and the protein target may also be a immobilized protein and/or a non-solid protein.
  • Loading protein In principle, the method can be applied to any protein target. After incubating a DNA-encoding molecular library with PC-DNA with a non-loaded, unmodified protein target, a portion of the small molecule in the DNA-encoding library can bind to the protein target, causing light on the PC-DNA The cross-linking group is located near the protein target.
  • the photo-crosslinking group can send a cross-linking reaction with the protein target to protect the DNA tag of the small molecule that can bind to the protein target; if the small molecule does not When the protein target binds, cross-linking of the protein target with PC-DNA cannot occur.
  • the exonuclease I (exoI, exonuclease I) was added to the system. Among them, there is no molecular library compound cross-linked with a protein target, and the DNA tag carried by it will be degraded by exoI; while the molecular library compound cross-linked with the protein target, the DNA tag is protected by the protein target itself, Degraded by exoI. Therefore, after exoI treatment, the remaining DNA in the system is a DNA tag of a small molecule capable of binding to a protein target.
  • the protein target is CA-II
  • the compound bound to the protein target is GLCBS.
  • PCR expansion and DNA sequencing are performed, and the chemical structure of the selected small molecule can be read.
  • PCR amplification was performed by fluorescent quantitative PCR amplification.
  • Real-time quantitative PCR (qPCR) analysis can give C T values. The smaller the C T value, the more DNA is retained; the larger the C T value, the more the DNA has been hydrolyzed by the enzyme and difficult to be amplified by PCR.
  • This example selects a series of small molecule compounds whose structures are as shown above. They are ligated to the PC-DNA of the present invention. After these small molecule-DNA conjugates were incubated with their known protein targets, illumination, digestion, and real-time quantitative PCR (qPCR) analysis were performed according to the procedure in FIG. qPCR can give C T value, the smaller the C T value, the more DNA remains; the larger the C T value, the more the DNA has been hydrolyzed by the enzyme, and it is difficult to be amplified by PCR.
  • Table 1 shows the data obtained, wherein the ⁇ C T values: No C T value of the target protein - protein C T values are target; the larger the value, the stronger the protective effect of the protein target.
  • K d dissociation constant
  • K i inhibition constant, both are constants that quantitatively indicate the ability of small molecules to bind or inhibit proteins
  • nM means nanomolar, nanomolar.
  • a library of DNA-encoding molecules The library has 1027 different sequences. Except for a large number of background sequences In addition to the column, it contains a GLCBS, a CBS, and a CBM.
  • GLCBS is a high binding small molecule of protein target CA-II
  • CBS is moderately bound to small molecules
  • CBM is not combined with protein target CA-II and used as a negative control.
  • the PC-DNA specific to the present invention was introduced into the molecular library for synthesis, and the non-fixed and unmodified protein target CA-II was screened according to the method flow of FIG. 2, and the selected molecules were selected. Library members were subjected to PCR extension and DNA sequencing, and the data is shown in Figure 4b.
  • a library of DNA-encoding molecules contains 4,800 DNA-encoded macrocyclic polypeptide compounds, and a GLCBS for positive control, as shown in Figure 5a.

Abstract

Disclosed is a DNA encoding molecular library, comprising a short-strand DNA of 8-12 bases, wherein one end of the short-strand DNA is connected to an optical cross-linking group, and the short-strand DNA binds to the PCR primer region of the DNA encoding molecule in the DNA encoding molecule library. Also disclosed is a compound screening method of the DNA encoding molecule library: performing a light treatment after incubating the DNA encoding molecule library with a protein target point, then adding a DNA exonuclease for degradation, and obtaining a DNA tag binding to the protein target point; performing a PCR amplification and a DNA sequencing of the DNA tag, and reading the compound binding to the protein target point. The method removes the compound not binding to the protein target point from the system using the selective enzymic degradation method induced by a ligand, and thus can be applied to any protein target point.

Description

DNA编码分子库及化合物筛选方法DNA coding molecular library and compound screening method 技术领域Technical field
本发明属于生物化学技术领域,具体涉及一种DNA编码分子库及化合物筛选方法。The invention belongs to the technical field of biochemistry, and particularly relates to a DNA coding molecular library and a compound screening method.
背景技术Background technique
当代药物研发中,针对疾病的药物靶点,通过构建大型的候选药物分子库,进行高通量、大规模筛选是新药研发中不可或缺的手段。当今世界上主要的制药公司均拥有大型的分子库和大规模的筛选平台用于新药研发。然而,传统的分子库和筛选平台成本高昂、技术门槛高、管理运行复杂,严重制约高通量筛选的发展和应用。近5年来,DNA编码分子库技术逐渐发展起来,成为药物研发中的新兴筛选方法。在DNA编码分子库中,每一个化合物与一个特异性的DNA链相连接,成为一个特异的条形码,实现对化合物的特异性编码。DNA编码分子库能够在极小的体系中,实现千万乃至上亿级的高通量筛选。筛选结果可以通过PCR扩增和DNA测序进行解码分析,以获得先导化合物用于进一步药物研发。近年来,DNA编码分子库已经得到新药研发领域中的广泛认可和应用,成为新药研发中的一种重要支撑技术。In contemporary drug development, high-throughput, large-scale screening is an indispensable tool for the development of new drugs by constructing a large library of drug candidates for drug targets. The major pharmaceutical companies in the world today have large molecular libraries and large-scale screening platforms for new drug development. However, traditional molecular libraries and screening platforms are costly, have high technical thresholds, and are complicated to manage, seriously restricting the development and application of high-throughput screening. In the past five years, DNA-encoding molecular library technology has gradually developed and become an emerging screening method in drug development. In the library of DNA-encoding molecules, each compound is linked to a specific DNA strand to form a specific barcode that enables specific encoding of the compound. The DNA-encoded molecular library enables high-throughput screening of millions or even billions of scales in a very small system. The screening results can be decoded and analyzed by PCR amplification and DNA sequencing to obtain lead compounds for further drug development. In recent years, DNA coding molecular libraries have been widely recognized and applied in the field of new drug research and development, and become an important supporting technology in the development of new drugs.
使用DNA编码分子库进行药物筛选,所使用的靶点大多为纯化后的蛋白质,蛋白质靶点经修饰后,固载在磁珠之类的固相之上,再与分子库进行孵育。不能与靶点蛋白结合的小分子被洗脱,与结合在蛋白靶点上的小分子相分离。再在蛋白质变性条件下,对结合的小分子进行洗脱,PCR扩增,以及DNA测序,从而读出编码序列,获得与靶点结合的小分子的化学结构。然而,使用纯化、固载的蛋白靶点限制了DNA编码分子库的应用范围,很多其它类型的药物靶点,例如膜蛋白、蛋白质复合体、活细胞、病理组织等,由于较难或无法纯化和固载,并不能够用于DNA编码分子库的筛选,成为本领域中的一个瓶颈问题。The DNA-encoding molecular library is used for drug screening. Most of the targets used are purified proteins. After the protein target is modified, it is immobilized on a solid phase such as magnetic beads, and then incubated with the molecular library. Small molecules that cannot bind to the target protein are eluted and separated from small molecules bound to the protein target. The bound small molecule is eluted, PCR amplified, and DNA sequenced under protein denaturation conditions to read the coding sequence to obtain the chemical structure of the small molecule bound to the target. However, the use of purified, immobilized protein targets limits the range of applications of DNA-encoding molecular libraries. Many other types of drug targets, such as membrane proteins, protein complexes, living cells, pathological tissues, etc., are difficult or impossible to purify. And immobilization, which cannot be used for the screening of DNA-encoding molecular libraries, has become a bottleneck problem in the field.
技术问题technical problem
本发明的目的在于克服现有技术的上述不足,提供一种DNA编码分子库及化合物筛选方法,旨在解决现有DNA编码分子库进行药物筛选只能使用纯化、固载的蛋白靶点,从而限制了DNA编码分子库的应用的技术问题。The object of the present invention is to overcome the above-mentioned deficiencies of the prior art, and to provide a DNA coding molecular library and a compound screening method, aiming at solving the existing DNA coding molecular library for drug screening, only using purified and immobilized protein targets, thereby The technical problem of the application of the DNA-encoding molecular library is limited.
技术解决方案Technical solution
为实现上述发明目的,本发明采用的技术方案如下:In order to achieve the above object, the technical solution adopted by the present invention is as follows:
本发明一方面提供一种DNA编码分子库,包括一种具有8-12个碱基的短链 DNA,所述短链DNA的一端连接有光交联基团,且所述短链DNA与所述DNA编码分子库中DNA编码分子的PCR引物区相结合。In one aspect, the invention provides a DNA encoding molecular library comprising a short chain of 8-12 bases DNA, one end of the short-stranded DNA is linked to a photocrosslinking group, and the short-stranded DNA is combined with a PCR primer region of a DNA-encoding molecule in the DNA-encoding molecule library.
本发明另一方面提供一种DNA编码分子库的化合物筛选方法,包括如下步骤:Another aspect of the present invention provides a method for screening a compound encoding a DNA encoding molecule, comprising the steps of:
将上述DNA编码分子库与蛋白质靶点孵育后光照处理,然后加入DNA外切酶降解得与所述蛋白质靶点结合的DNA标签;The above-mentioned DNA encoding molecular library is incubated with a protein target, followed by light treatment, and then added by exonuclease to decompose the DNA tag bound to the protein target;
将所述DNA标签进行PCR扩增和DNA测序,读取与所述蛋白质靶点相结合的化合物。The DNA tag is subjected to PCR amplification and DNA sequencing, and a compound that binds to the protein target is read.
有益效果Beneficial effect
本发明提供的DNA编码分子库中,引入了一种特有的短链DNA(PC-DNA),由于在DNA编码分子库中,所有的DNA编码在两个末端具有相同的PCR引物区,所以该PC-DNA能够结合在所有化合物的PCR引物区。这样,更有利于从DNA编码分子库中筛选与蛋白质靶点结合的化合物。A unique short-chain DNA (PC-DNA) is introduced into the DNA-encoding molecular library provided by the present invention, since all DNA coding has the same PCR primer region at both ends in the DNA-encoding molecular library, PC-DNA is capable of binding to the PCR primer region of all compounds. In this way, it is more advantageous to screen a compound that binds to a protein target from a library of DNA-encoding molecules.
本发明提供的DNA编码分子库的化合物筛选方法,彻底摆脱了传统DNA编码分子库筛选中对蛋白质靶点纯化和固载的要求。本方法不再依赖于物理洗脱来分离结合蛋白质靶点和不结合蛋白质靶点的化合物,而是利用配体诱导的选择性酶降解的方法从体系中去除不与蛋白质靶点结合的化合物。因此从原理上讲,本方法能够应用于任何蛋白质靶点。本方法已经通过实验证明能够用于非固载蛋白质、蛋白质复合体、活细胞表面膜蛋白、细胞裂解液等多种复杂体系的DNA编码分子库的筛选。The screening method of the compound of the DNA encoding molecular library provided by the invention completely eliminates the requirement for purification and immobilization of the protein target in the screening of the traditional DNA encoding molecular library. The method no longer relies on physical elution to separate compounds that bind to the protein target and not to the protein target, but instead uses ligand-induced selective enzymatic degradation to remove compounds that do not bind to the protein target from the system. Therefore, in principle, the method can be applied to any protein target. The method has been experimentally proven to be useful for the screening of DNA-encoding molecular libraries of various complex systems such as non-loaded proteins, protein complexes, living cell surface membrane proteins, and cell lysates.
附图说明DRAWINGS
图1为传统DNA编码分子库化合物筛选方法,其中,蛋白质靶点进行了纯化修饰;1 is a conventional DNA encoding molecular library compound screening method in which a protein target is purified and modified;
图2为本发明DNA编码分子库化合物筛选方法的流程示意图;2 is a schematic flow chart of a method for screening a DNA-encoding molecular library compound of the present invention;
图3为本发明实施例1中验证能与蛋白质靶点结合的小分子所带的DNA编码不会被酶切水解的流程示意图;3 is a schematic view showing the flow of DNA coding carried out by a small molecule capable of binding to a protein target in Example 1 of the present invention;
图4a为本发明实施例2合成的具有1027个不同系列的分子库,该分子库除了大量的背景序列之外,包含了一个GLCBS、一个CBS、一个CBM;其中GLCBS为蛋白质靶点CA-II的高结合力小分子,CBS为中等结合了小分子,而CBM则不与蛋白质靶点CA-II相结合,用作于阴性对照;4a is a molecular library having 1027 different series synthesized in Example 2 of the present invention, which comprises a GLCBS, a CBS, and a CBM in addition to a large number of background sequences; wherein GLCBS is a protein target CA-II High binding force small molecule, CBS is medium combined with small molecules, while CBM is not combined with protein target CA-II, used as a negative control;
图4b为本发明实施例2的分子库化合物筛选方法进行筛选后的数据结果; 4b is a data result of screening a molecular library compound screening method according to Example 2 of the present invention;
图5a为本发明实施例3合成的具有1027个不同系列的分子库,该分子库包含4800个DNA编码的大环多肽化合物,以及一个GLCBS用于阳性控制;Figure 5a is a molecular library having 1027 different series synthesized in Example 3 of the present invention, the library comprising 4800 DNA-encoded macrocyclic polypeptide compounds, and a GLCBS for positive control;
图5b为本发明实施例3的分子库化合物筛选方法进行筛选后的数据结果。Fig. 5b is a data result of screening the molecular library compound screening method according to Example 3 of the present invention.
本发明的实施方式Embodiments of the invention
为了使本发明要解决的技术问题、技术方案及有益效果更加清楚明白,以下结合附图和实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。The present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
一方面,本发明实施例提供了一种DNA编码分子库,包括一种具有8-12个碱基的短链DNA(PC-DNA),其结构如下所示,该短链DNA的一端连接有光交联基团,且该短链DNA与DNA编码分子库中DNA编码分子的PCR引物区相结合。In one aspect, the embodiments of the present invention provide a DNA encoding molecular library comprising a short-chain DNA (PC-DNA) having 8-12 bases, the structure of which is shown below, and one end of the short-chain DNA is linked A photocrosslinking group, and the short-stranded DNA is combined with a PCR primer region of a DNA-encoding molecule in a DNA-encoding molecule library.
Figure PCTCN2017093564-appb-000001
Figure PCTCN2017093564-appb-000001
本发明实施例提供的上述DNA编码分子库中,引入了一种特有的短链DNA(PC-DNA),由于在分子库中,所有的DNA编码在两个末端具有相同的PCR引物区,所以该PC-DNA能够结合在所有化合物的DNA编码分子的PCR引物区(如图2所示);这样,有利用从DNA编码分子库中筛选与蛋白质靶点结合的化合物。In the above-mentioned DNA-encoding molecular library provided by the embodiment of the present invention, a unique short-chain DNA (PC-DNA) is introduced, since in the molecular library, all DNA coding has the same PCR primer region at both ends, The PC-DNA is capable of binding to the PCR primer region of the DNA-encoding molecule of all compounds (as shown in Figure 2); thus, the use of a compound that binds to a protein target from a library of DNA-encoding molecules is utilized.
进一步地,在本发明实施例的DNA编码分子库中,光交联基团包括苯基叠氮、二苯甲酮、丙基吖啶中的至少一种。光交联基团能够与蛋白质靶点发送交联反应以保护能与蛋白质靶点结合的小分子的DNA标签;而如果小分子不与蛋白质靶点结合,则蛋白质靶点与PC-DNA的交联不能发生。本发明的光交联基团不局限于此。Further, in the DNA-encoding molecular library of the embodiment of the present invention, the photocrosslinking group includes at least one of phenyl azide, benzophenone, and propyl acridine. The photocrosslinking group is capable of sending a cross-linking reaction with the protein target to protect the DNA tag of the small molecule capable of binding to the protein target; and if the small molecule does not bind to the protein target, the protein target is cross-linked with the PC-DNA Union cannot happen. The photocrosslinking group of the present invention is not limited thereto.
另一方面,本发明实施例还提供了一种DNA编码分子库的化合物筛选方法,其流程如图2所示,包括如下步骤:In another aspect, the embodiment of the present invention further provides a screening method for a compound of a DNA encoding molecular library, and the flow thereof is as shown in FIG. 2, and includes the following steps:
S01:将上述DNA编码分子库与蛋白质靶点孵育后光照处理,然后加入DNA外切酶降解得与该蛋白质靶点结合的DNA标签;S01: incubated the above-mentioned DNA-encoding molecular library with a protein target, and then irradiating the light, and then adding a DNA exonuclease to degrade a DNA tag that binds to the protein target;
S02:将上述DNA标签进行PCR扩增和DNA测序,读取与蛋白质靶点相 结合的化合物。S02: performing PCR amplification and DNA sequencing on the above DNA tag, and reading with a protein target Combined compounds.
本发明实施例提供的上述DNA编码分子库的化合物筛选方法,彻底摆脱了传统DNA编码分子库筛选中对蛋白质靶点纯化和固载的要求(如图1所示)。本方法不再依赖于物理洗脱来分离结合蛋白质靶点和不结合蛋白质靶点的化合物,而是利用配体诱导的选择性酶降解的方法从体系中去除不与蛋白质靶点结合的化合物。因此从原理上讲,本方法能够应用于任何蛋白质靶点。本方法已经通过实验证明能够用于非固载蛋白质、蛋白质复合体、活细胞表面膜蛋白、细胞裂解液等多种复杂体系的DNA编码分子库的筛选。The screening method of the above-mentioned DNA encoding molecular library provided by the embodiments of the present invention completely eliminates the requirement for purification and immobilization of protein targets in the screening of traditional DNA encoding molecular libraries (as shown in FIG. 1). The method no longer relies on physical elution to separate compounds that bind to the protein target and not to the protein target, but instead uses ligand-induced selective enzymatic degradation to remove compounds that do not bind to the protein target from the system. Therefore, in principle, the method can be applied to any protein target. The method has been experimentally proven to be useful for the screening of DNA-encoding molecular libraries of various complex systems such as non-loaded proteins, protein complexes, living cell surface membrane proteins, and cell lysates.
进一步地,在上述步骤S01中,蛋白质靶点可以为纯化蛋白质和/或非纯化蛋白质,蛋白质靶点还可以为修饰蛋白质或非修饰蛋白质,蛋白质靶点还可以为固载蛋白质和/或非固载蛋白质。从原理上讲,本方法能够应用于任何蛋白质靶点。将具有PC-DNA的DNA编码分子库与非固载、无修饰的蛋白质靶点孵育之后,该DNA编码分子库中的一部分小分子能够与蛋白质靶点结合,使得PC-DNA上所带的光交联基团处于蛋白质靶点附近,在光照条件下,光交联基团能够与蛋白质靶点发送交联反应以保护能与蛋白质靶点结合的小分子的DNA标签;而如果小分子不与蛋白质靶点结合,则蛋白质靶点与PC-DNA的交联不能发生。体系中加入DNA外切酶exonuclease I(exoI,外切核酸酶I)。其中,没有和蛋白质靶点交联的分子库化合物,其所带的DNA标签将被exoI所降解;而与蛋白质靶点交联的分子库化合物,其DNA标签被蛋白靶点本身所保护,不被exoI所降解。因此在exoI处理之后,体系中余下的DNA即为能够与蛋白靶点结合的小分子的DNA标签。Further, in the above step S01, the protein target may be a purified protein and/or a non-purified protein, the protein target may also be a modified protein or a non-modified protein, and the protein target may also be a immobilized protein and/or a non-solid protein. Loading protein. In principle, the method can be applied to any protein target. After incubating a DNA-encoding molecular library with PC-DNA with a non-loaded, unmodified protein target, a portion of the small molecule in the DNA-encoding library can bind to the protein target, causing light on the PC-DNA The cross-linking group is located near the protein target. Under light conditions, the photo-crosslinking group can send a cross-linking reaction with the protein target to protect the DNA tag of the small molecule that can bind to the protein target; if the small molecule does not When the protein target binds, cross-linking of the protein target with PC-DNA cannot occur. The exonuclease I (exoI, exonuclease I) was added to the system. Among them, there is no molecular library compound cross-linked with a protein target, and the DNA tag carried by it will be degraded by exoI; while the molecular library compound cross-linked with the protein target, the DNA tag is protected by the protein target itself, Degraded by exoI. Therefore, after exoI treatment, the remaining DNA in the system is a DNA tag of a small molecule capable of binding to a protein target.
进一步地,在上述步骤S01中,蛋白质靶点为CA-II,且与蛋白质靶点相结合的化合物为GLCBS。Further, in the above step S01, the protein target is CA-II, and the compound bound to the protein target is GLCBS.
进一步地,在上述步骤S02中,进行PCR扩展和DNA测序,即可读出被选择的小分子的化学结构。PCR扩增为荧光定量PCR扩增。实时定量PCR(qPCR)的分析能够给出CT值,CT值越小,表明存留DNA越多;CT值越大,表明DNA已经被酶切水解,难以被PCR扩增出来。Further, in the above step S02, PCR expansion and DNA sequencing are performed, and the chemical structure of the selected small molecule can be read. PCR amplification was performed by fluorescent quantitative PCR amplification. Real-time quantitative PCR (qPCR) analysis can give C T values. The smaller the C T value, the more DNA is retained; the larger the C T value, the more the DNA has been hydrolyzed by the enzyme and difficult to be amplified by PCR.
本发明先后进行过多次试验,现举一部分试验结果作为参考对发明进行进一步详细描述,下面结合具体实施例进行详细说明。The present invention has been subjected to a number of tests in succession, and a part of the test results are now described in further detail as a reference, and will be described in detail below in conjunction with specific embodiments.
实施例1:Example 1:
本实施例验证小分子和蛋白质靶点结合时,其所带的DNA编码不会被酶切 水解。This example demonstrates that when a small molecule and a protein target are combined, the DNA coding carried by the small molecule is not cleaved by the enzyme. hydrolysis.
Figure PCTCN2017093564-appb-000002
Figure PCTCN2017093564-appb-000002
本实施例选取了一系列的小分子化合物,其结构如上所示。将它们连接到本发明PC-DNA上。将这些小分子-DNA偶合物与它们的已知蛋白质靶点相孵育结合之后,再按照图3中的流程进行光照、酶切,以及实时定量PCR(qPCR)的分析。qPCR能够给出CT值,CT值越小,表明存留DNA越多;CT值越大,表明DNA已经被酶切水解,难以被PCR扩增出来。表1为所获得的数据,其中ΔCT值为:没有蛋白质靶点的CT值-有蛋白质靶点的CT值;该值越大,表明蛋白质靶点的保护作用越强。表1中:Kd:解离常数,Ki:抑制常数,两者均为定量表示小分子与蛋白质结合或抑制能力的常数;nM指nanomolar,纳摩尔的意思。This example selects a series of small molecule compounds whose structures are as shown above. They are ligated to the PC-DNA of the present invention. After these small molecule-DNA conjugates were incubated with their known protein targets, illumination, digestion, and real-time quantitative PCR (qPCR) analysis were performed according to the procedure in FIG. qPCR can give C T value, the smaller the C T value, the more DNA remains; the larger the C T value, the more the DNA has been hydrolyzed by the enzyme, and it is difficult to be amplified by PCR. Table 1 shows the data obtained, wherein the ΔC T values: No C T value of the target protein - protein C T values are target; the larger the value, the stronger the protective effect of the protein target. In Table 1: K d : dissociation constant, K i : inhibition constant, both are constants that quantitatively indicate the ability of small molecules to bind or inhibit proteins; nM means nanomolar, nanomolar.
表1Table 1
Figure PCTCN2017093564-appb-000003
Figure PCTCN2017093564-appb-000003
上述表1的数据表明,对于多种类型的小分子,其与蛋白质靶点的结合都能够有效的实现对DNA编码的保护。即便是作用力较弱的编号4和5,也能有很好的效果。本数据在原理上验证了本实施例所提出的方法的可行性。The data in Table 1 above shows that for many types of small molecules, their binding to protein targets can effectively protect DNA coding. Even the weaker numbers 4 and 5 have a good effect. This data verifies the feasibility of the method proposed in this embodiment in principle.
实施例2:Example 2:
一个DNA编码分子库:该分子库具有1027个不同序列。除了大量的背景序 列之外,包含了一个GLCBS、一个CBS、一个CBM。A library of DNA-encoding molecules: The library has 1027 different sequences. Except for a large number of background sequences In addition to the column, it contains a GLCBS, a CBS, and a CBM.
如图4a所示:其中GLCBS为蛋白质靶点CA-II的高结合力小分子,CBS为中等结合了小分子,而CBM则不与蛋白质靶点CA-II相结合,用作于阴性对照。本实施例将本发明特有的PC-DNA引入至该分子库进行合成后,按照图2的方法流程,进行了与非固载、无修饰的蛋白质靶点CA-II的筛选,筛选出的分子库成员进行了PCR扩展和DNA测序,数据显示在图4b之中。As shown in Figure 4a: GLCBS is a high binding small molecule of protein target CA-II, CBS is moderately bound to small molecules, and CBM is not combined with protein target CA-II and used as a negative control. In this example, the PC-DNA specific to the present invention was introduced into the molecular library for synthesis, and the non-fixed and unmodified protein target CA-II was screened according to the method flow of FIG. 2, and the selected molecules were selected. Library members were subjected to PCR extension and DNA sequencing, and the data is shown in Figure 4b.
从图4b的数据图可以看出:经过第一轮的筛选,强结合力的GLCBS被富集了38.4倍,中结合力的CBS被富集了7.2倍,而不结合CA-II靶点的CBM基本没有被富集。本筛选方法的一个特点是,第一轮筛选出来的分子库,能够直接进入下一轮进行二次筛选,实现进一步的富集。图4b中显示,经过第二轮的筛选,GLCBS和CBS得到了约200倍的富集。It can be seen from the data plot of Figure 4b that after the first round of screening, the strong binding force of GLCBS is enriched by 38.4 times, and the medium binding CBS is enriched by 7.2 times, without binding to the CA-II target. CBM is basically not enriched. One of the characteristics of this screening method is that the molecular library selected in the first round can directly enter the next round for secondary screening to achieve further enrichment. Figure 4b shows that after a second round of screening, GLCBS and CBS were approximately 200-fold enriched.
实施例3:Example 3:
一个DNA编码分子库:该分子库包含4800个DNA编码的大环多肽化合物,以及一个GLCBS用于阳性控制,结构图5a所示。A library of DNA-encoding molecules: This library contains 4,800 DNA-encoded macrocyclic polypeptide compounds, and a GLCBS for positive control, as shown in Figure 5a.
本实施例将本发明特有的PC-DNA引入至该分子库进行合成后,按照图2的方法流程,经过分子库合成、对非固载无修饰的蛋白质靶点CA-II的筛选之后,进行PCR扩增和DNA测序来解码,最终数据以散点图的方式显示在图5b之中。从图5b中可知:阳性控制GLCBS被富集了近100倍(图5b上);而分子库中的其它化合物,由于缺乏和CA-II相结合的化学结构,基本没有被富集(图5b下)。从图5b中的数据验证了本发明所提出来的分子库筛选方法针对于非固载、无修饰蛋白质靶点筛选的可行性。In this embodiment, after the PC-DNA specific to the present invention is introduced into the molecular library for synthesis, according to the method flow of FIG. 2, after molecular library synthesis and screening of the non-fixed unmodified protein target CA-II, PCR amplification and DNA sequencing were used to decode, and the final data was shown in Figure 5b as a scatter plot. It can be seen from Fig. 5b that the positive control GLCBS is enriched nearly 100 times (Fig. 5b); while other compounds in the molecular library are not substantially enriched due to the lack of chemical structure combined with CA-II (Fig. 5b) under). From the data in Figure 5b, the feasibility of the molecular library screening method proposed by the present invention for non-fixed, unmodified protein target screening was verified.
以上所述仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明的保护范围之内。 The above is only the preferred embodiment of the present invention, and is not intended to limit the present invention. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention should be included in the protection of the present invention. Within the scope.

Claims (10)

  1. 一种DNA编码分子库,其特征在于,包括一种具有8-12个碱基的短链DNA,所述短链DNA的一端连接有光交联基团,且所述短链DNA与所述DNA编码分子库中DNA编码分子的PCR引物区相结合。A DNA-encoding molecular library comprising a short-stranded DNA having 8-12 bases, one end of the short-stranded DNA linked to a photo-crosslinking group, and the short-stranded DNA and the The PCR primer region of the DNA-encoding molecule in the DNA-encoding molecule library is combined.
  2. 如权利要求1所述的DNA编码分子库,其特征在于,所述光交联基团包括苯基叠氮、二苯甲酮、丙基吖啶中的至少一种。The DNA-encoding molecular library according to claim 1, wherein the photocrosslinking group comprises at least one of phenyl azide, benzophenone, and propyl acridine.
  3. 一种DNA编码分子库的化合物筛选方法,其特征在于,包括如下步骤:A method for screening a compound of a DNA encoding molecular library, comprising the steps of:
    将权利要求1所述的DNA编码分子库与蛋白质靶点孵育后光照处理,然后加入DNA外切酶降解,得与所述蛋白质靶点结合的DNA标签;The DNA encoding molecular library of claim 1 is incubated with a protein target, followed by light treatment, and then added by exonuclease degradation to obtain a DNA tag that binds to the protein target;
    将所述DNA标签进行PCR扩增和DNA测序,读取与所述蛋白质靶点相结合的化合物。The DNA tag is subjected to PCR amplification and DNA sequencing, and a compound that binds to the protein target is read.
  4. 如权利要求3所述的DNA编码分子库的化合物筛选方法,其特征在于,所述光交联基团包括苯基叠氮、二苯甲酮、丙基吖啶中的至少一种。The method for screening a compound of a DNA-encoding molecular library according to claim 3, wherein the photocrosslinking group comprises at least one of phenyl azide, benzophenone, and propyl acridine.
  5. 如权利要求3所述的DNA编码分子库的化合物筛选方法,其特征在于,所述蛋白质靶点为纯化蛋白质和/或非纯化蛋白质。The method for screening a compound of a DNA-encoding molecular library according to claim 3, wherein the protein target is a purified protein and/or a non-purified protein.
  6. 如权利要求3所述的DNA编码分子库的化合物筛选方法,其特征在于,所述蛋白质靶点为修饰蛋白质或非修饰蛋白质.The method for screening a compound of a DNA encoding molecular library according to claim 3, wherein the protein target is a modified protein or a non-modified protein.
  7. 如权利要求3所述的DNA编码分子库的化合物筛选方法,其特征在于,所述蛋白质靶点为固载蛋白质和/或非固载蛋白质。The method for screening a compound of a DNA-encoding molecular library according to claim 3, wherein the protein target is a immobilized protein and/or a non-loaded protein.
  8. 如权利要求3-7任一项所述的DNA编码分子库的化合物筛选方法,其特征在于,所述蛋白质靶点为CA-II,且与所述蛋白质靶点相结合的化合物为GLCBS。The method for screening a compound of a DNA-encoding molecular library according to any one of claims 3 to 7, wherein the protein target is CA-II, and the compound that binds to the protein target is GLCBS.
  9. 如权利要求3-7任一项所述的DNA编码分子库的化合物筛选方法,其特征在于,所述PCR扩增为荧光定量PCR扩增。The method for screening a compound of a DNA-encoding molecular library according to any one of claims 3 to 7, wherein the PCR amplification is fluorescence quantitative PCR amplification.
  10. 如权利要求3-7任一项所述的DNA编码分子库的化合物筛选方法,其特征在于,所述DNA外切酶为exonuclease I。 The method for screening a compound of a DNA-encoding molecular library according to any one of claims 3 to 7, wherein the exonuclease is exonuclease I.
PCT/CN2017/093564 2016-09-30 2017-07-19 Dna encoding molecular library and compound screening method WO2018059088A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610871861.6 2016-09-30
CN201610871861.6A CN107130299A (en) 2016-09-30 2016-09-30 A kind of DNA encoding library of molecules and method for screening compound

Publications (1)

Publication Number Publication Date
WO2018059088A1 true WO2018059088A1 (en) 2018-04-05

Family

ID=59720645

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/093564 WO2018059088A1 (en) 2016-09-30 2017-07-19 Dna encoding molecular library and compound screening method

Country Status (2)

Country Link
CN (1) CN107130299A (en)
WO (1) WO2018059088A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113403690A (en) * 2021-06-21 2021-09-17 吉林大学 DNA coding compound library drug molecule fishing method

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108178790B (en) * 2018-01-18 2021-09-07 上海药明康德新药开发有限公司 Method for separating cell membrane to screen transmembrane protein by using DNA coding compound library
WO2019149198A1 (en) * 2018-01-31 2019-08-08 成都先导药物开发股份有限公司 Dna-encoded compound library and compound screening method
WO2019157693A1 (en) * 2018-02-14 2019-08-22 深圳劲宇生物科技有限公司 Dna-coded molecular library and widely-applicable compound screening method
WO2019157692A1 (en) * 2018-02-14 2019-08-22 深圳劲宇生物科技有限公司 Dna-encoded molecular library and target restriction-free compound screening method
CN109023535B (en) * 2018-07-19 2020-11-17 上海药明康德新药开发有限公司 Method for screening DNA coding compound by using antibody to non-recognition marker protein or cell lysate
CN110105408B (en) * 2019-05-09 2022-11-08 上海药明康德新药开发有限公司 Synthetic method of On-DNA aryl azide in construction of DNA coding compound library
CN112143783A (en) * 2019-06-27 2020-12-29 成都先导药物开发股份有限公司 Method for identifying complex biological system by identifiable mixture
CN112430598A (en) * 2019-08-26 2021-03-02 成都先导药物开发股份有限公司 Method for screening living cell membrane protein of DNA coding compound library
CN111304753B (en) * 2020-03-24 2020-12-01 深圳奇点药物科技有限公司 Method and kit for screening DNA encoding molecular library
CN112760720B (en) * 2020-12-23 2022-10-21 重庆大学 Method for synthesizing DNA coding compound library through single-strand and double-strand controllable transformation of DNA

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008057555A2 (en) * 2006-11-06 2008-05-15 Beth Israel Deaconess Medical Center Identification and use of small molecules to modulate pdef transcription factor function and to treat pdef transcription factor associated diseases

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101565859A (en) * 2009-06-10 2009-10-28 吉林大学 Novel molecular library and construction method thereof

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008057555A2 (en) * 2006-11-06 2008-05-15 Beth Israel Deaconess Medical Center Identification and use of small molecules to modulate pdef transcription factor function and to treat pdef transcription factor associated diseases

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
CAO, C. ET AL.: "A DNA-templated synthesis of encoded small molecules by DNA self-assembly", CHEM. COMMUN., vol. 50, no. 75, 28 July 2014 (2014-07-28), pages 10997 - 10999, XP055473848 *
LI, G. ET AL.: "Design, preparation, and selection of DNA-encoded dynamic libraries", CHEMICAL SCIENCE, vol. 6, 9 November 2015 (2015-11-09), pages 7097 - 7104, XP055473781 *
LI, G. ET AL.: "Novel encoding methods for DNA-templated chemical libraries", CURRENT OPINION IN CHEMICAL BIOLOGY, vol. 26, 28 January 2015 (2015-01-28), pages 25 - 33, XP055473833 *
LIU, Y. ET AL.: "Photoaffinity labeling of transcription factors by DNA-templa-ted crosslinking", CHEMICAL SCIENCE, vol. 6, 1 October 2014 (2014-10-01), pages 745 - 751, XP055473829 *
ZHAO, P. ET AL.: "Selection of DNA-Encoded Small Molecule Libraries Against Unmodified and Non-Inmobilized Protein Targets", ANGEW. CHEM., vol. 53, 7 July 2014 (2014-07-07), pages 10056 - 10059, XP055473842 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113403690A (en) * 2021-06-21 2021-09-17 吉林大学 DNA coding compound library drug molecule fishing method
CN113403690B (en) * 2021-06-21 2022-07-19 吉林大学 DNA coding compound library drug molecule fishing method

Also Published As

Publication number Publication date
CN107130299A (en) 2017-09-05

Similar Documents

Publication Publication Date Title
WO2018059088A1 (en) Dna encoding molecular library and compound screening method
Wheeler et al. Advances and challenges in the detection of transcriptome‐wide protein–RNA interactions
Zhu et al. Comparative performance of the BGISEQ-500 and Illumina HiSeq4000 sequencing platforms for transcriptome analysis in plants
AU2018358057B2 (en) Kits for analysis using nucleic acid encoding and/or label
JP7253833B2 (en) Methods and Kits Using Nucleic Acid Encoding and/or Labeling
Gao et al. CIRI: an efficient and unbiased algorithm for de novo circular RNA identification
AU2017259794B2 (en) Macromolecule analysis employing nucleic acid encoding
JP6438126B2 (en) Method and reagent kit for constructing nucleic acid single-stranded circular library
KR102282863B1 (en) Methods of sequencing nucleic acids in mixtures and compositions related thereto
CN107109698B (en) RNA STITCH sequencing: assay for direct mapping RNA-RNA interaction in cells
JP2015521462A (en) Chromatin array library for profiling DNA barcodes of designer mononucleosomes and chromatin readers, writers, erasers and their modulators
CN113272441A (en) Methods and compositions for preparing nucleic acids that preserve spatially contiguous continuity information
Carapelli et al. The complete mitochondrial genome of the Antarctic springtail Cryptopygus antarcticus (Hexapoda: Collembola)
CN111304753B (en) Method and kit for screening DNA encoding molecular library
Parmakelis et al. The transcriptome of a “sleeping” invader: de novo assembly and annotation of the transcriptome of aestivating Cornu aspersum
JP7203276B2 (en) Methods and kits for constructing sequencing libraries based on target regions of methylated DNA
WO2021115319A1 (en) Screening method for dna encoded compound library simultaneously binding to multiple biological targets
WO2019157692A1 (en) Dna-encoded molecular library and target restriction-free compound screening method
WO2019157693A1 (en) Dna-coded molecular library and widely-applicable compound screening method
EP2762563B1 (en) Method for detecting protein that interacts with target substance
WO2021182587A1 (en) Target module for use in search for novel peptide aptamer utilizing photocrosslinkable base, linker for use in search for novel peptide aptamer, and method for searching for novel peptide aptamer using said target module or said linker
Zhang et al. DHX36 binding induces RNA structurome remodeling and regulates RNA abundance via m6A/YTHDF1
Su The impact of transposable elements on the epigenome: A look at the functional TE-ome
JP2023548857A (en) Hairpin oligonucleotide and its use
Park Investigating the extent and function of RNA editing in animal transcriptomes

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17854542

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 12/07/2019)

122 Ep: pct application non-entry in european phase

Ref document number: 17854542

Country of ref document: EP

Kind code of ref document: A1