WO2022077258A1 - Free energy perturbation network design method based on machine learning - Google Patents
Free energy perturbation network design method based on machine learning Download PDFInfo
- Publication number
- WO2022077258A1 WO2022077258A1 PCT/CN2020/120845 CN2020120845W WO2022077258A1 WO 2022077258 A1 WO2022077258 A1 WO 2022077258A1 CN 2020120845 W CN2020120845 W CN 2020120845W WO 2022077258 A1 WO2022077258 A1 WO 2022077258A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- machine learning
- model
- std
- learning model
- free energy
- Prior art date
Links
- 238000010801 machine learning Methods 0.000 title claims abstract description 27
- 238000000034 method Methods 0.000 title claims abstract description 23
- 238000013461 design Methods 0.000 title claims abstract description 13
- 238000004364 calculation method Methods 0.000 claims abstract description 16
- 238000012360 testing method Methods 0.000 claims abstract description 16
- 238000012549 training Methods 0.000 claims abstract description 14
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 6
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 6
- 150000003384 small molecules Chemical class 0.000 claims description 41
- 238000000329 molecular dynamics simulation Methods 0.000 description 3
- 238000010219 correlation analysis Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000007637 random forest analysis Methods 0.000 description 2
- 229940126586 small molecule drug Drugs 0.000 description 2
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000003066 decision tree Methods 0.000 description 1
- 238000012938 design process Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16C—COMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
- G16C10/00—Computational theoretical chemistry, i.e. ICT specially adapted for theoretical aspects of quantum chemistry, molecular mechanics, molecular dynamics or the like
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16C—COMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
- G16C20/00—Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
- G16C20/50—Molecular design, e.g. of drugs
Definitions
- the invention belongs to the technical field of molecular dynamics simulation, in particular to a method for designing a free energy perturbation network based on machine learning.
- the binding free energy ( ⁇ G) of small molecule drugs and target proteins plays a very important role in guiding the design of small molecule drugs.
- FEP free energy perturbation
- each node represents a small molecule
- each edge represents the difference ( ⁇ G) of binding free energy between two small molecules.
- the core problem is to judge whether two small molecules should be connected, so that the uncertainty (std) of ⁇ G calculated by this edge is minimized.
- the existing methods mainly have the following problems
- the purpose of the present invention is to provide a free energy perturbation network design method based on machine learning, using a large number of calculation results of ⁇ G, using the method of machine learning to train the model, and designing free energy perturbation more quickly. network to improve computational accuracy.
- the present invention provides the following technical solutions:
- the design method of free energy perturbation network based on machine learning includes the following steps:
- the two-dimensional structure feature descriptor of the small molecule includes molecular mass, topological connection information, and the number of flexible dihedral angles.
- this method can handle a large number of scenarios where the binding free energy of small molecules needs to be calculated and predicted, and can quickly design the required perturbation network;
- the number of molecules is gradually increased with the calculation. More data can be collected for model training, and the generalization ability and accuracy of the model can be improved.
- Fig. 1 is the flow chart of the present invention
- Fig. 2 is the correlation analysis result of embodiment Tanimoto similarity score and std
- Fig. 3 is the correlation analysis result between RFscore and std of the embodiment.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Computing Systems (AREA)
- Chemical & Material Sciences (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biophysics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Molecular Biology (AREA)
- Software Systems (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Computation (AREA)
- Data Mining & Analysis (AREA)
- Computational Linguistics (AREA)
- Biomedical Technology (AREA)
- Artificial Intelligence (AREA)
- Analytical Chemistry (AREA)
- Genetics & Genomics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- Medical Informatics (AREA)
- Medicinal Chemistry (AREA)
- Pharmacology & Pharmacy (AREA)
- Crystallography & Structural Chemistry (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
A free energy perturbation network design method based on machine learning. The method comprises the following steps: S1, preparing a micromolecule data set required for calculation; S2, preparing a micromolecule/protein input file; S3, calculating △△G and std between different micromolecule pairs by using FEP; S4, extracting feature descriptors of micromolecules; S5, preparing a training set and test set required for a machine learning model; S6, constructing the machine learning model; S7, training the machine learning model; and S8, compiling error statistics on the test set. By means of the method, a scenario in which binding free energy of a large number of micromolecules needs to be calculated and predicted can be processed, and a required perturbation network can be rapidly designed; and the correlation between an obtained result and std is higher, such that the calculation precision can be effectively improved. In addition, along with an increase in the number of calculated molecules, more data can be collected for model training, and the generalization capability and precision of a model are improved.
Description
本发明属于分子动力学模拟技术领域,具体涉及一种基于机器学习的自由能微扰网络设计方法。The invention belongs to the technical field of molecular dynamics simulation, in particular to a method for designing a free energy perturbation network based on machine learning.
小分子药物与靶点蛋白的结合自由能(△G),对于小分子药物的设计有着十分重要的指导作用。自由能微扰方法(free energy perturbation,FEP)作为一种基于分子动力学(molecular dynamics,MD)的计算方法,能够对于结合自由能进行预测。当预测任务涉及多个小分子时,自由能微扰网络的设计十分必要,能够有效的提高预测的精度。设计的自由能微扰网络图中,每个节点代表小分子,而每条边代表两个小分子之间结合自由能的差值(△△G)。在网络的设计过程中,核心问题是判断两个小分子是否应该连接,使得这条边计算得到的△△G不确定性(std)最小。现有设计方法大多按照下述原则进行判断,以确定两个小分子是否应该连接:The binding free energy (ΔG) of small molecule drugs and target proteins plays a very important role in guiding the design of small molecule drugs. As a calculation method based on molecular dynamics (MD), free energy perturbation (FEP) can predict binding free energy. When the prediction task involves multiple small molecules, the design of the free energy perturbation network is very necessary, which can effectively improve the prediction accuracy. In the designed free energy perturbation network diagram, each node represents a small molecule, and each edge represents the difference (△△G) of binding free energy between two small molecules. In the design process of the network, the core problem is to judge whether two small molecules should be connected, so that the uncertainty (std) of △△G calculated by this edge is minimized. Most of the existing design methods make judgments according to the following principles to determine whether two small molecules should be linked:
(1)基于经验的人工判断;(1) Manual judgment based on experience;
(2)基于谷本相似系数(Tanimoto similarity score)判断。(2) Judgment based on Tanimoto similarity score.
现有方法主要存在以下问题The existing methods mainly have the following problems
1、基于经验的人工判断:需要计算的小分子数目为n时,所有能够连接的边总数,即可以进行FEP计算的分子对总数为n(n-1)/2。随着小分子数目的增加,需要进行判断的边数会迅速增加。这种情况下几乎不可能通过人工的方法进行识别判断。1. Manual judgment based on experience: When the number of small molecules to be calculated is n, the total number of edges that can be connected, that is, the total number of molecule pairs that can be calculated by FEP is n(n-1)/2. As the number of small molecules increases, the number of edges that need to be judged increases rapidly. In this case, it is almost impossible to identify and judge by manual methods.
2、基于Tanimoto similarity score判断:使用这一指标时,通常尽量将相似的小分子(Tanimoto similarity score越接近1,两个小分子越相似)进行连接。相似系数是基于分子指纹进行计算,考虑的小分子的特征十分有限。同时,按照这种方法判断得到的相似的分子,并不能保证计算得到的△△G不确定性小。2. Judgment based on Tanimoto similarity score: When using this indicator, usually try to connect similar small molecules (the closer the Tanimoto similarity score is to 1, the more similar the two small molecules are). Similarity coefficients are calculated based on molecular fingerprints, considering very limited characteristics of small molecules. At the same time, similar molecules judged by this method cannot guarantee that the uncertainty of the calculated △△G is small.
发明内容SUMMARY OF THE INVENTION
针对上述技术问题,本发明的目的在于提供一种基于机器学习的自由能微扰网络设计方法,利用大量△△G的计算结果,使用机器学习的方法训练模型,更加快捷的设计自由能微扰网络,提高计算精度。In view of the above technical problems, the purpose of the present invention is to provide a free energy perturbation network design method based on machine learning, using a large number of calculation results of △△G, using the method of machine learning to train the model, and designing free energy perturbation more quickly. network to improve computational accuracy.
为实现上述目的,本发明提供如下技术方案:To achieve the above object, the present invention provides the following technical solutions:
基于机器学习的自由能微扰网络设计方法,包括以下步骤:The design method of free energy perturbation network based on machine learning includes the following steps:
S1、准备计算所需的小分子数据集;S1. Prepare the small molecule data set required for the calculation;
S2、准备小分子/蛋白质输入文件;S2. Prepare small molecule/protein input files;
S3、利用FEP计算不同小分子对之间的△△G及std;S3. Use FEP to calculate △△G and std between different small molecule pairs;
S4、提取小分子的特征描述符;S4. Extract feature descriptors of small molecules;
S5、准备训练集和测试集;S5, prepare training set and test set;
S6、构建机器学习模型;S6. Build a machine learning model;
S7、训练机器学习模型;S7. Train the machine learning model;
S8、测试集统计误差。S8, test set statistical error.
具体包括以下步骤:Specifically include the following steps:
S1、准备计算所需的小分子数据集:准备数据集时保证体系的多样性,以免出现模型对于部分体系的过拟合;S1. Prepare the small molecule data set required for the calculation: ensure the diversity of the system when preparing the data set, so as to avoid overfitting of the model to some systems;
S2、准备小分子/蛋白质输入文件:根据FEP计算的需求,生成用于FEP计算的初始文件;S2. Prepare small molecule/protein input files: According to the requirements of FEP calculation, generate an initial file for FEP calculation;
S3、利用FEP计算不同小分子对之间的△△G及std:设计小分子之间必须的分子对,利用FEP计算多次△△G结果,进而得到对应的std值;S3. Use FEP to calculate △△G and std between different small molecule pairs: design the necessary molecular pairs between small molecules, use FEP to calculate the results of △△G multiple times, and then obtain the corresponding std value;
S4、提取小分子的特征描述符:提取小分子的二维结构特征描述符;S4. Extract feature descriptors of small molecules: extract the two-dimensional structure feature descriptors of small molecules;
S5、准备训练集和测试集:收集FEP计算得到的分子对的std结果及对应小分子的二维特征描述符,并将收集到的数据按照一定比例划分为训练集和测试集;S5. Prepare training set and test set: collect std results of molecule pairs calculated by FEP and two-dimensional feature descriptors of corresponding small molecules, and divide the collected data into training set and test set according to a certain proportion;
S6、构建机器学习模型:将得到的小分子的二维描述符作为输入,分子对的std结果作为输出构建机器学习模型;S6. Build a machine learning model: use the obtained two-dimensional descriptor of the small molecule as input, and the std result of the molecule pair as output to build a machine learning model;
S7、训练机器学习模型:选取适当的参数对于模型进行训练,根据不同类型的机器学习模型设置不同的参数;S7. Train the machine learning model: select appropriate parameters to train the model, and set different parameters according to different types of machine learning models;
S8、测试集统计误差:训练完成后在测试集上统计误差,根据统计的误差对于模型参数进行优化,得到最佳的模型。S8. Statistical error of the test set: after the training is completed, the error is counted on the test set, and the model parameters are optimized according to the statistical error to obtain the best model.
其中,步骤S4中,所述的小分子的二维结构特征描述符,包括分子质量、拓扑连接信息、柔性二面角数量。Wherein, in step S4, the two-dimensional structure feature descriptor of the small molecule includes molecular mass, topological connection information, and the number of flexible dihedral angles.
与现有技术相比,本发明的有益效果是:Compared with the prior art, the beneficial effects of the present invention are:
1、自动化设计微扰网络1. Automatically design perturbation network
相对于人工设计微扰网络的方法,本方法能够处理大量小分子结合自由能需要计算预测的场景,能够快速的设计出需要的微扰网络;Compared with the method of manually designing the perturbation network, this method can handle a large number of scenarios where the binding free energy of small molecules needs to be calculated and predicted, and can quickly design the required perturbation network;
2、提高自由能微扰的计算精度2. Improve the calculation accuracy of free energy perturbation
相对于基于Tanimoto similarity score的方法,本方法得到的结果与std的相关性更高,进而能够有效的提高计算精度。Compared with the method based on Tanimoto similarity score, the results obtained by this method have higher correlation with std, which can effectively improve the calculation accuracy.
3、易于拓展3. Easy to expand
当计算过程确定之后,随着计算的分子数量逐渐增加。能够收集到更多的数据用于模型的训练,提高模型的泛化能力和精度。After the calculation process is determined, the number of molecules is gradually increased with the calculation. More data can be collected for model training, and the generalization ability and accuracy of the model can be improved.
图1为本发明的流程图;Fig. 1 is the flow chart of the present invention;
图2为实施例Tanimoto similarity score和std的相关性分析结果;Fig. 2 is the correlation analysis result of embodiment Tanimoto similarity score and std;
图3为实施例RFscore与std的相关性分析结果。Fig. 3 is the correlation analysis result between RFscore and std of the embodiment.
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.
按照图1所示的流程图,本实施例选取8个激酶体系共200个小分子,设计300个分子对,计算5次△△G的std作为模型的输出。According to the flowchart shown in FIG. 1 , in this example, a total of 200 small molecules from 8 kinase systems were selected, 300 molecular pairs were designed, and the std of ΔΔG was calculated 5 times as the output of the model.
比较Tanimoto similarity score和std的相关性,如图2所示,可见,两者的相关性很弱,肯德尔相关系数(Kendall rank correlation coefficient)为-0.113。显然,通过这一标准构建的微扰网络将引进比较大的不确定性。Comparing the correlation between Tanimoto similarity score and std, as shown in Figure 2, it can be seen that the correlation between the two is very weak, and the Kendall rank correlation coefficient (Kendall rank correlation coefficient) is -0.113. Obviously, the perturbation network constructed by this standard will introduce relatively large uncertainty.
在本实施例中,提取各小分子的二维特征值,每个小分子有77个特征值。并通过按照7:3的比例划分训练集和测试集。选择随机森林作为本实例的机器学习模型。同时,对于最大特征数、决策树最大深度、内部节点在划分所需最小样本数、叶节点最小样本数等多个模型参数的不同组合,得到最佳的随机森林模型。利用该模型在训练集上得到误差为0.14,在测试集上得到的误差为0.31.同时,利用现有模型得到的RF score与前述Tanimoto similarity score进行同样的相关性结果分析,如图3所示。得到的肯德尔相关系数为0.41。In this embodiment, two-dimensional eigenvalues of each small molecule are extracted, and each small molecule has 77 eigenvalues. And by dividing the training set and the test set according to the ratio of 7:3. Random Forest is chosen as the machine learning model for this example. At the same time, for different combinations of multiple model parameters, such as the maximum number of features, the maximum depth of the decision tree, the minimum number of samples required for internal node division, and the minimum number of samples of leaf nodes, the best random forest model is obtained. Using this model, the error obtained on the training set is 0.14, and the error obtained on the test set is 0.31. At the same time, the RF score obtained by using the existing model and the aforementioned Tanimoto similarity score are analyzed for the same correlation results, as shown in Figure 3 . The resulting Kendall correlation coefficient was 0.41.
由此可见,该方法得到的结果能够对于大量小分子进行自由能微扰网络设计,同时相对于Tanimoto similarity score方法能够提高精度。It can be seen that the results obtained by this method can be used to design a free energy perturbation network for a large number of small molecules, and at the same time, the accuracy can be improved compared with the Tanimoto similarity score method.
尽管已经示出和描述了本发明的实施例,对于本领域的普通技术人员而言,可以理解在不脱离本发明的原理和精神的情况下可以对这些实施例进行多种变化、修改、替换和变型,本发明的范围由所附权利要求及其等同物限定。Although embodiments of the present invention have been shown and described, it will be understood by those skilled in the art that various changes, modifications, and substitutions can be made in these embodiments without departing from the principle and spirit of the invention and modifications, the scope of the invention is defined by the appended claims and their equivalents.
Claims (3)
- 基于机器学习的自由能微扰网络设计方法,其特征在于,包括以下步骤:The design method of free energy perturbation network based on machine learning is characterized in that it includes the following steps:S1、准备计算所需的小分子数据集;S1. Prepare the small molecule data set required for the calculation;S2、准备小分子/蛋白质输入文件;S2. Prepare small molecule/protein input files;S3、利用FEP计算不同小分子对之间的△△G及std;S3. Use FEP to calculate △△G and std between different small molecule pairs;S4、提取小分子的特征描述符;S4. Extract feature descriptors of small molecules;S5、准备机器学习模型所需的训练集和测试集;S6、构建机器学习模型;S5, prepare the training set and test set required for the machine learning model; S6, build the machine learning model;S7、训练机器学习模型;S7. Train the machine learning model;S8、测试集统计误差。S8, test set statistical error.
- 根据权利要求1所述的基于机器学习的自由能微扰网络设计方法,其特征在于,具体包括以下步骤:The method for designing a free energy perturbation network based on machine learning according to claim 1, characterized in that it specifically comprises the following steps:S1、准备计算所需的小分子数据集:准备数据集时保证体系的多样性,以免出现模型对于部分体系的过拟合;S1. Prepare the small molecule data set required for the calculation: ensure the diversity of the system when preparing the data set, so as to avoid overfitting of the model to some systems;S2、准备小分子/蛋白质输入文件:根据FEP计算的需求,生成用于FEP计算的初始文件;S2. Prepare small molecule/protein input files: According to the requirements of FEP calculation, generate an initial file for FEP calculation;S3、利用FEP计算不同小分子对之间的△△G及std:设计小分子之间必须的分子对,利用FEP计算多次△△G结果,进而得到对应的std值;S3. Use FEP to calculate △△G and std between different small molecule pairs: design the necessary molecular pairs between small molecules, use FEP to calculate the results of △△G multiple times, and then obtain the corresponding std value;S4、提取小分子的特征描述符:提取小分子的二维结构特征描述符;S4. Extract feature descriptors of small molecules: extract the two-dimensional structure feature descriptors of small molecules;S5、准备机器学习模型所需的训练集和测试集:收集FEP计算得到的分子对的std结果及对应小分子的二维特征描述符,并将收集到的数据按照一定比例划分为训练集和测试集;S5. Prepare the training set and test set required for the machine learning model: collect the std results of the molecule pairs calculated by FEP and the two-dimensional feature descriptors of the corresponding small molecules, and divide the collected data into training sets and test set;S6、构建机器学习模型:将得到的小分子的二维描述符作为输入,分子对的std结果作为输出构建机器学习模型;S6. Build a machine learning model: use the obtained two-dimensional descriptors of small molecules as input, and the std results of the molecule pairs as output to build a machine learning model;S7、训练机器学习模型:选取适当的参数对于模型进行训练,根据不同类型的机器学习模型设置不同的参数;S7. Train the machine learning model: select appropriate parameters to train the model, and set different parameters according to different types of machine learning models;S8、测试集统计误差:训练完成后在测试集上统计误差,根据统计的误差对于模型参数进行优化,得到最佳的模型。S8. Statistical error of the test set: after the training is completed, the error is counted on the test set, and the model parameters are optimized according to the statistical error to obtain the best model.
- 根据权利要求2所述的基于机器学习的自由能微扰网络设计方法,其特征在于,步骤S4中,所述的小分子的二维结构特征描述符,包括分子质量、拓扑连接信息、柔性二面角数量。The method for designing a free energy perturbation network based on machine learning according to claim 2, wherein in step S4, the two-dimensional structure feature descriptor of the small molecule includes molecular mass, topological connection information, flexibility two Number of face angles.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2020/120845 WO2022077258A1 (en) | 2020-10-14 | 2020-10-14 | Free energy perturbation network design method based on machine learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2020/120845 WO2022077258A1 (en) | 2020-10-14 | 2020-10-14 | Free energy perturbation network design method based on machine learning |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022077258A1 true WO2022077258A1 (en) | 2022-04-21 |
Family
ID=81207574
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/120845 WO2022077258A1 (en) | 2020-10-14 | 2020-10-14 | Free energy perturbation network design method based on machine learning |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2022077258A1 (en) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101059520A (en) * | 2007-05-29 | 2007-10-24 | 南京大学 | Organic ER affinity quick screening and forecast method based on receptor binding mode |
WO2019040444A1 (en) * | 2017-08-22 | 2019-02-28 | Schrodinger, Inc. | Methods and systems for calculating free energy differences using an alchemical restraint potential |
CN110728047A (en) * | 2019-10-08 | 2020-01-24 | 中国工程物理研究院化工材料研究所 | Computer-aided design system for predicting energetic molecules based on machine learning performance |
EP3614388A1 (en) * | 2018-08-20 | 2020-02-26 | Fujitsu Limited | Method and device for calculating binding free energy, and program |
CN110910951A (en) * | 2019-11-19 | 2020-03-24 | 江苏理工学院 | Method for predicting protein and ligand binding free energy based on progressive neural network |
CN111279419A (en) * | 2017-10-17 | 2020-06-12 | 富士胶片株式会社 | Feature amount calculation method, feature amount calculation program, feature amount calculation device, screening method, screening program, screening device, compound creation method, compound creation program, and compound creation device |
-
2020
- 2020-10-14 WO PCT/CN2020/120845 patent/WO2022077258A1/en active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101059520A (en) * | 2007-05-29 | 2007-10-24 | 南京大学 | Organic ER affinity quick screening and forecast method based on receptor binding mode |
WO2019040444A1 (en) * | 2017-08-22 | 2019-02-28 | Schrodinger, Inc. | Methods and systems for calculating free energy differences using an alchemical restraint potential |
CN111279419A (en) * | 2017-10-17 | 2020-06-12 | 富士胶片株式会社 | Feature amount calculation method, feature amount calculation program, feature amount calculation device, screening method, screening program, screening device, compound creation method, compound creation program, and compound creation device |
EP3614388A1 (en) * | 2018-08-20 | 2020-02-26 | Fujitsu Limited | Method and device for calculating binding free energy, and program |
CN110728047A (en) * | 2019-10-08 | 2020-01-24 | 中国工程物理研究院化工材料研究所 | Computer-aided design system for predicting energetic molecules based on machine learning performance |
CN110910951A (en) * | 2019-11-19 | 2020-03-24 | 江苏理工学院 | Method for predicting protein and ligand binding free energy based on progressive neural network |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TWI705388B (en) | Feature selection method, device and equipment for constructing machine learning model | |
CN112364880B (en) | Omics data processing method, device, equipment and medium based on graph neural network | |
Xiao et al. | Towards confidence in the truth: A bootstrapping based truth discovery approach | |
CN102768659B (en) | Method and system for identifying repeated account | |
CN108806718B (en) | Audio identification method based on analysis of ENF phase spectrum and instantaneous frequency spectrum | |
CN104538035B (en) | A kind of method for distinguishing speek person and system based on Fisher super vectors | |
CN103761965B (en) | A kind of sorting technique of instrument signal | |
CN111429977B (en) | Novel molecular similarity search algorithm based on attention of graph structure | |
CN109656545A (en) | A kind of software development activity clustering method based on event log | |
Gupta et al. | Will I get in? modeling the graduate admission process for American universities | |
CN116561542B (en) | Model optimization training system, method and related device | |
CN106127244A (en) | A kind of parallelization K means improved method and system | |
CN110491443B (en) | lncRNA protein correlation prediction method based on projection neighborhood non-negative matrix decomposition | |
CN114783539A (en) | Traditional Chinese medicine component analysis method and system based on spectral clustering | |
CN114036276A (en) | Information extraction method, device, equipment and storage medium | |
WO2022077258A1 (en) | Free energy perturbation network design method based on machine learning | |
CN106874286B (en) | Method and device for screening user characteristics | |
CN106911512A (en) | Link Forecasting Methodology and system based on game in commutative figure | |
CN112330368B (en) | Data processing method, system, storage medium and terminal equipment | |
CN110825583A (en) | Energy efficiency qualitative assessment technology for multi-index fusion of cloud data center | |
CN112102889B (en) | Free energy perturbation network design method based on machine learning | |
CN105447158A (en) | Graph based automatic mining method for synonym set in patent search log | |
CN115438153A (en) | Training method and device for intention matching degree analysis model | |
CN115472181A (en) | Method, device and storage medium for singing recognition based on feature fusion and clustering | |
CN110298545B (en) | Patent evaluation method, system and medium based on neural network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20957039 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20957039 Country of ref document: EP Kind code of ref document: A1 |