WO2022077258A1 - Free energy perturbation network design method based on machine learning - Google Patents

Free energy perturbation network design method based on machine learning Download PDF

Info

Publication number
WO2022077258A1
WO2022077258A1 PCT/CN2020/120845 CN2020120845W WO2022077258A1 WO 2022077258 A1 WO2022077258 A1 WO 2022077258A1 CN 2020120845 W CN2020120845 W CN 2020120845W WO 2022077258 A1 WO2022077258 A1 WO 2022077258A1
Authority
WO
WIPO (PCT)
Prior art keywords
machine learning
model
std
learning model
free energy
Prior art date
Application number
PCT/CN2020/120845
Other languages
French (fr)
Chinese (zh)
Inventor
李治鹏
温书豪
杨明俊
林志雄
邹俊杰
马健
赖力鹏
Original Assignee
深圳晶泰科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳晶泰科技有限公司 filed Critical 深圳晶泰科技有限公司
Priority to PCT/CN2020/120845 priority Critical patent/WO2022077258A1/en
Publication of WO2022077258A1 publication Critical patent/WO2022077258A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
    • G16C10/00Computational theoretical chemistry, i.e. ICT specially adapted for theoretical aspects of quantum chemistry, molecular mechanics, molecular dynamics or the like
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
    • G16C20/00Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
    • G16C20/50Molecular design, e.g. of drugs

Definitions

  • the invention belongs to the technical field of molecular dynamics simulation, in particular to a method for designing a free energy perturbation network based on machine learning.
  • the binding free energy ( ⁇ G) of small molecule drugs and target proteins plays a very important role in guiding the design of small molecule drugs.
  • FEP free energy perturbation
  • each node represents a small molecule
  • each edge represents the difference ( ⁇ G) of binding free energy between two small molecules.
  • the core problem is to judge whether two small molecules should be connected, so that the uncertainty (std) of ⁇ G calculated by this edge is minimized.
  • the existing methods mainly have the following problems
  • the purpose of the present invention is to provide a free energy perturbation network design method based on machine learning, using a large number of calculation results of ⁇ G, using the method of machine learning to train the model, and designing free energy perturbation more quickly. network to improve computational accuracy.
  • the present invention provides the following technical solutions:
  • the design method of free energy perturbation network based on machine learning includes the following steps:
  • the two-dimensional structure feature descriptor of the small molecule includes molecular mass, topological connection information, and the number of flexible dihedral angles.
  • this method can handle a large number of scenarios where the binding free energy of small molecules needs to be calculated and predicted, and can quickly design the required perturbation network;
  • the number of molecules is gradually increased with the calculation. More data can be collected for model training, and the generalization ability and accuracy of the model can be improved.
  • Fig. 1 is the flow chart of the present invention
  • Fig. 2 is the correlation analysis result of embodiment Tanimoto similarity score and std
  • Fig. 3 is the correlation analysis result between RFscore and std of the embodiment.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Chemical & Material Sciences (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biophysics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Molecular Biology (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Artificial Intelligence (AREA)
  • Analytical Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • Medical Informatics (AREA)
  • Medicinal Chemistry (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A free energy perturbation network design method based on machine learning. The method comprises the following steps: S1, preparing a micromolecule data set required for calculation; S2, preparing a micromolecule/protein input file; S3, calculating △△G and std between different micromolecule pairs by using FEP; S4, extracting feature descriptors of micromolecules; S5, preparing a training set and test set required for a machine learning model; S6, constructing the machine learning model; S7, training the machine learning model; and S8, compiling error statistics on the test set. By means of the method, a scenario in which binding free energy of a large number of micromolecules needs to be calculated and predicted can be processed, and a required perturbation network can be rapidly designed; and the correlation between an obtained result and std is higher, such that the calculation precision can be effectively improved. In addition, along with an increase in the number of calculated molecules, more data can be collected for model training, and the generalization capability and precision of a model are improved.

Description

基于机器学习的自由能微扰网络设计方法Design method of free energy perturbation network based on machine learning 技术领域technical field
本发明属于分子动力学模拟技术领域,具体涉及一种基于机器学习的自由能微扰网络设计方法。The invention belongs to the technical field of molecular dynamics simulation, in particular to a method for designing a free energy perturbation network based on machine learning.
背景技术Background technique
小分子药物与靶点蛋白的结合自由能(△G),对于小分子药物的设计有着十分重要的指导作用。自由能微扰方法(free energy perturbation,FEP)作为一种基于分子动力学(molecular dynamics,MD)的计算方法,能够对于结合自由能进行预测。当预测任务涉及多个小分子时,自由能微扰网络的设计十分必要,能够有效的提高预测的精度。设计的自由能微扰网络图中,每个节点代表小分子,而每条边代表两个小分子之间结合自由能的差值(△△G)。在网络的设计过程中,核心问题是判断两个小分子是否应该连接,使得这条边计算得到的△△G不确定性(std)最小。现有设计方法大多按照下述原则进行判断,以确定两个小分子是否应该连接:The binding free energy (ΔG) of small molecule drugs and target proteins plays a very important role in guiding the design of small molecule drugs. As a calculation method based on molecular dynamics (MD), free energy perturbation (FEP) can predict binding free energy. When the prediction task involves multiple small molecules, the design of the free energy perturbation network is very necessary, which can effectively improve the prediction accuracy. In the designed free energy perturbation network diagram, each node represents a small molecule, and each edge represents the difference (△△G) of binding free energy between two small molecules. In the design process of the network, the core problem is to judge whether two small molecules should be connected, so that the uncertainty (std) of △△G calculated by this edge is minimized. Most of the existing design methods make judgments according to the following principles to determine whether two small molecules should be linked:
(1)基于经验的人工判断;(1) Manual judgment based on experience;
(2)基于谷本相似系数(Tanimoto similarity score)判断。(2) Judgment based on Tanimoto similarity score.
现有方法主要存在以下问题The existing methods mainly have the following problems
1、基于经验的人工判断:需要计算的小分子数目为n时,所有能够连接的边总数,即可以进行FEP计算的分子对总数为n(n-1)/2。随着小分子数目的增加,需要进行判断的边数会迅速增加。这种情况下几乎不可能通过人工的方法进行识别判断。1. Manual judgment based on experience: When the number of small molecules to be calculated is n, the total number of edges that can be connected, that is, the total number of molecule pairs that can be calculated by FEP is n(n-1)/2. As the number of small molecules increases, the number of edges that need to be judged increases rapidly. In this case, it is almost impossible to identify and judge by manual methods.
2、基于Tanimoto similarity score判断:使用这一指标时,通常尽量将相似的小分子(Tanimoto similarity score越接近1,两个小分子越相似)进行连接。相似系数是基于分子指纹进行计算,考虑的小分子的特征十分有限。同时,按照这种方法判断得到的相似的分子,并不能保证计算得到的△△G不确定性小。2. Judgment based on Tanimoto similarity score: When using this indicator, usually try to connect similar small molecules (the closer the Tanimoto similarity score is to 1, the more similar the two small molecules are). Similarity coefficients are calculated based on molecular fingerprints, considering very limited characteristics of small molecules. At the same time, similar molecules judged by this method cannot guarantee that the uncertainty of the calculated △△G is small.
发明内容SUMMARY OF THE INVENTION
针对上述技术问题,本发明的目的在于提供一种基于机器学习的自由能微扰网络设计方法,利用大量△△G的计算结果,使用机器学习的方法训练模型,更加快捷的设计自由能微扰网络,提高计算精度。In view of the above technical problems, the purpose of the present invention is to provide a free energy perturbation network design method based on machine learning, using a large number of calculation results of △△G, using the method of machine learning to train the model, and designing free energy perturbation more quickly. network to improve computational accuracy.
为实现上述目的,本发明提供如下技术方案:To achieve the above object, the present invention provides the following technical solutions:
基于机器学习的自由能微扰网络设计方法,包括以下步骤:The design method of free energy perturbation network based on machine learning includes the following steps:
S1、准备计算所需的小分子数据集;S1. Prepare the small molecule data set required for the calculation;
S2、准备小分子/蛋白质输入文件;S2. Prepare small molecule/protein input files;
S3、利用FEP计算不同小分子对之间的△△G及std;S3. Use FEP to calculate △△G and std between different small molecule pairs;
S4、提取小分子的特征描述符;S4. Extract feature descriptors of small molecules;
S5、准备训练集和测试集;S5, prepare training set and test set;
S6、构建机器学习模型;S6. Build a machine learning model;
S7、训练机器学习模型;S7. Train the machine learning model;
S8、测试集统计误差。S8, test set statistical error.
具体包括以下步骤:Specifically include the following steps:
S1、准备计算所需的小分子数据集:准备数据集时保证体系的多样性,以免出现模型对于部分体系的过拟合;S1. Prepare the small molecule data set required for the calculation: ensure the diversity of the system when preparing the data set, so as to avoid overfitting of the model to some systems;
S2、准备小分子/蛋白质输入文件:根据FEP计算的需求,生成用于FEP计算的初始文件;S2. Prepare small molecule/protein input files: According to the requirements of FEP calculation, generate an initial file for FEP calculation;
S3、利用FEP计算不同小分子对之间的△△G及std:设计小分子之间必须的分子对,利用FEP计算多次△△G结果,进而得到对应的std值;S3. Use FEP to calculate △△G and std between different small molecule pairs: design the necessary molecular pairs between small molecules, use FEP to calculate the results of △△G multiple times, and then obtain the corresponding std value;
S4、提取小分子的特征描述符:提取小分子的二维结构特征描述符;S4. Extract feature descriptors of small molecules: extract the two-dimensional structure feature descriptors of small molecules;
S5、准备训练集和测试集:收集FEP计算得到的分子对的std结果及对应小分子的二维特征描述符,并将收集到的数据按照一定比例划分为训练集和测试集;S5. Prepare training set and test set: collect std results of molecule pairs calculated by FEP and two-dimensional feature descriptors of corresponding small molecules, and divide the collected data into training set and test set according to a certain proportion;
S6、构建机器学习模型:将得到的小分子的二维描述符作为输入,分子对的std结果作为输出构建机器学习模型;S6. Build a machine learning model: use the obtained two-dimensional descriptor of the small molecule as input, and the std result of the molecule pair as output to build a machine learning model;
S7、训练机器学习模型:选取适当的参数对于模型进行训练,根据不同类型的机器学习模型设置不同的参数;S7. Train the machine learning model: select appropriate parameters to train the model, and set different parameters according to different types of machine learning models;
S8、测试集统计误差:训练完成后在测试集上统计误差,根据统计的误差对于模型参数进行优化,得到最佳的模型。S8. Statistical error of the test set: after the training is completed, the error is counted on the test set, and the model parameters are optimized according to the statistical error to obtain the best model.
其中,步骤S4中,所述的小分子的二维结构特征描述符,包括分子质量、拓扑连接信息、柔性二面角数量。Wherein, in step S4, the two-dimensional structure feature descriptor of the small molecule includes molecular mass, topological connection information, and the number of flexible dihedral angles.
与现有技术相比,本发明的有益效果是:Compared with the prior art, the beneficial effects of the present invention are:
1、自动化设计微扰网络1. Automatically design perturbation network
相对于人工设计微扰网络的方法,本方法能够处理大量小分子结合自由能需要计算预测的场景,能够快速的设计出需要的微扰网络;Compared with the method of manually designing the perturbation network, this method can handle a large number of scenarios where the binding free energy of small molecules needs to be calculated and predicted, and can quickly design the required perturbation network;
2、提高自由能微扰的计算精度2. Improve the calculation accuracy of free energy perturbation
相对于基于Tanimoto similarity score的方法,本方法得到的结果与std的相关性更高,进而能够有效的提高计算精度。Compared with the method based on Tanimoto similarity score, the results obtained by this method have higher correlation with std, which can effectively improve the calculation accuracy.
3、易于拓展3. Easy to expand
当计算过程确定之后,随着计算的分子数量逐渐增加。能够收集到更多的数据用于模型的训练,提高模型的泛化能力和精度。After the calculation process is determined, the number of molecules is gradually increased with the calculation. More data can be collected for model training, and the generalization ability and accuracy of the model can be improved.
附图说明Description of drawings
图1为本发明的流程图;Fig. 1 is the flow chart of the present invention;
图2为实施例Tanimoto similarity score和std的相关性分析结果;Fig. 2 is the correlation analysis result of embodiment Tanimoto similarity score and std;
图3为实施例RFscore与std的相关性分析结果。Fig. 3 is the correlation analysis result between RFscore and std of the embodiment.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.
按照图1所示的流程图,本实施例选取8个激酶体系共200个小分子,设计300个分子对,计算5次△△G的std作为模型的输出。According to the flowchart shown in FIG. 1 , in this example, a total of 200 small molecules from 8 kinase systems were selected, 300 molecular pairs were designed, and the std of ΔΔG was calculated 5 times as the output of the model.
比较Tanimoto similarity score和std的相关性,如图2所示,可见,两者的相关性很弱,肯德尔相关系数(Kendall rank correlation coefficient)为-0.113。显然,通过这一标准构建的微扰网络将引进比较大的不确定性。Comparing the correlation between Tanimoto similarity score and std, as shown in Figure 2, it can be seen that the correlation between the two is very weak, and the Kendall rank correlation coefficient (Kendall rank correlation coefficient) is -0.113. Obviously, the perturbation network constructed by this standard will introduce relatively large uncertainty.
在本实施例中,提取各小分子的二维特征值,每个小分子有77个特征值。并通过按照7:3的比例划分训练集和测试集。选择随机森林作为本实例的机器学习模型。同时,对于最大特征数、决策树最大深度、内部节点在划分所需最小样本数、叶节点最小样本数等多个模型参数的不同组合,得到最佳的随机森林模型。利用该模型在训练集上得到误差为0.14,在测试集上得到的误差为0.31.同时,利用现有模型得到的RF score与前述Tanimoto similarity score进行同样的相关性结果分析,如图3所示。得到的肯德尔相关系数为0.41。In this embodiment, two-dimensional eigenvalues of each small molecule are extracted, and each small molecule has 77 eigenvalues. And by dividing the training set and the test set according to the ratio of 7:3. Random Forest is chosen as the machine learning model for this example. At the same time, for different combinations of multiple model parameters, such as the maximum number of features, the maximum depth of the decision tree, the minimum number of samples required for internal node division, and the minimum number of samples of leaf nodes, the best random forest model is obtained. Using this model, the error obtained on the training set is 0.14, and the error obtained on the test set is 0.31. At the same time, the RF score obtained by using the existing model and the aforementioned Tanimoto similarity score are analyzed for the same correlation results, as shown in Figure 3 . The resulting Kendall correlation coefficient was 0.41.
由此可见,该方法得到的结果能够对于大量小分子进行自由能微扰网络设计,同时相对于Tanimoto similarity score方法能够提高精度。It can be seen that the results obtained by this method can be used to design a free energy perturbation network for a large number of small molecules, and at the same time, the accuracy can be improved compared with the Tanimoto similarity score method.
尽管已经示出和描述了本发明的实施例,对于本领域的普通技术人员而言,可以理解在不脱离本发明的原理和精神的情况下可以对这些实施例进行多种变化、修改、替换和变型,本发明的范围由所附权利要求及其等同物限定。Although embodiments of the present invention have been shown and described, it will be understood by those skilled in the art that various changes, modifications, and substitutions can be made in these embodiments without departing from the principle and spirit of the invention and modifications, the scope of the invention is defined by the appended claims and their equivalents.

Claims (3)

  1. 基于机器学习的自由能微扰网络设计方法,其特征在于,包括以下步骤:The design method of free energy perturbation network based on machine learning is characterized in that it includes the following steps:
    S1、准备计算所需的小分子数据集;S1. Prepare the small molecule data set required for the calculation;
    S2、准备小分子/蛋白质输入文件;S2. Prepare small molecule/protein input files;
    S3、利用FEP计算不同小分子对之间的△△G及std;S3. Use FEP to calculate △△G and std between different small molecule pairs;
    S4、提取小分子的特征描述符;S4. Extract feature descriptors of small molecules;
    S5、准备机器学习模型所需的训练集和测试集;S6、构建机器学习模型;S5, prepare the training set and test set required for the machine learning model; S6, build the machine learning model;
    S7、训练机器学习模型;S7. Train the machine learning model;
    S8、测试集统计误差。S8, test set statistical error.
  2. 根据权利要求1所述的基于机器学习的自由能微扰网络设计方法,其特征在于,具体包括以下步骤:The method for designing a free energy perturbation network based on machine learning according to claim 1, characterized in that it specifically comprises the following steps:
    S1、准备计算所需的小分子数据集:准备数据集时保证体系的多样性,以免出现模型对于部分体系的过拟合;S1. Prepare the small molecule data set required for the calculation: ensure the diversity of the system when preparing the data set, so as to avoid overfitting of the model to some systems;
    S2、准备小分子/蛋白质输入文件:根据FEP计算的需求,生成用于FEP计算的初始文件;S2. Prepare small molecule/protein input files: According to the requirements of FEP calculation, generate an initial file for FEP calculation;
    S3、利用FEP计算不同小分子对之间的△△G及std:设计小分子之间必须的分子对,利用FEP计算多次△△G结果,进而得到对应的std值;S3. Use FEP to calculate △△G and std between different small molecule pairs: design the necessary molecular pairs between small molecules, use FEP to calculate the results of △△G multiple times, and then obtain the corresponding std value;
    S4、提取小分子的特征描述符:提取小分子的二维结构特征描述符;S4. Extract feature descriptors of small molecules: extract the two-dimensional structure feature descriptors of small molecules;
    S5、准备机器学习模型所需的训练集和测试集:收集FEP计算得到的分子对的std结果及对应小分子的二维特征描述符,并将收集到的数据按照一定比例划分为训练集和测试集;S5. Prepare the training set and test set required for the machine learning model: collect the std results of the molecule pairs calculated by FEP and the two-dimensional feature descriptors of the corresponding small molecules, and divide the collected data into training sets and test set;
    S6、构建机器学习模型:将得到的小分子的二维描述符作为输入,分子对的std结果作为输出构建机器学习模型;S6. Build a machine learning model: use the obtained two-dimensional descriptors of small molecules as input, and the std results of the molecule pairs as output to build a machine learning model;
    S7、训练机器学习模型:选取适当的参数对于模型进行训练,根据不同类型的机器学习模型设置不同的参数;S7. Train the machine learning model: select appropriate parameters to train the model, and set different parameters according to different types of machine learning models;
    S8、测试集统计误差:训练完成后在测试集上统计误差,根据统计的误差对于模型参数进行优化,得到最佳的模型。S8. Statistical error of the test set: after the training is completed, the error is counted on the test set, and the model parameters are optimized according to the statistical error to obtain the best model.
  3. 根据权利要求2所述的基于机器学习的自由能微扰网络设计方法,其特征在于,步骤S4中,所述的小分子的二维结构特征描述符,包括分子质量、拓扑连接信息、柔性二面角数量。The method for designing a free energy perturbation network based on machine learning according to claim 2, wherein in step S4, the two-dimensional structure feature descriptor of the small molecule includes molecular mass, topological connection information, flexibility two Number of face angles.
PCT/CN2020/120845 2020-10-14 2020-10-14 Free energy perturbation network design method based on machine learning WO2022077258A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2020/120845 WO2022077258A1 (en) 2020-10-14 2020-10-14 Free energy perturbation network design method based on machine learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2020/120845 WO2022077258A1 (en) 2020-10-14 2020-10-14 Free energy perturbation network design method based on machine learning

Publications (1)

Publication Number Publication Date
WO2022077258A1 true WO2022077258A1 (en) 2022-04-21

Family

ID=81207574

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/120845 WO2022077258A1 (en) 2020-10-14 2020-10-14 Free energy perturbation network design method based on machine learning

Country Status (1)

Country Link
WO (1) WO2022077258A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101059520A (en) * 2007-05-29 2007-10-24 南京大学 Organic ER affinity quick screening and forecast method based on receptor binding mode
WO2019040444A1 (en) * 2017-08-22 2019-02-28 Schrodinger, Inc. Methods and systems for calculating free energy differences using an alchemical restraint potential
CN110728047A (en) * 2019-10-08 2020-01-24 中国工程物理研究院化工材料研究所 Computer-aided design system for predicting energetic molecules based on machine learning performance
EP3614388A1 (en) * 2018-08-20 2020-02-26 Fujitsu Limited Method and device for calculating binding free energy, and program
CN110910951A (en) * 2019-11-19 2020-03-24 江苏理工学院 Method for predicting protein and ligand binding free energy based on progressive neural network
CN111279419A (en) * 2017-10-17 2020-06-12 富士胶片株式会社 Feature amount calculation method, feature amount calculation program, feature amount calculation device, screening method, screening program, screening device, compound creation method, compound creation program, and compound creation device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101059520A (en) * 2007-05-29 2007-10-24 南京大学 Organic ER affinity quick screening and forecast method based on receptor binding mode
WO2019040444A1 (en) * 2017-08-22 2019-02-28 Schrodinger, Inc. Methods and systems for calculating free energy differences using an alchemical restraint potential
CN111279419A (en) * 2017-10-17 2020-06-12 富士胶片株式会社 Feature amount calculation method, feature amount calculation program, feature amount calculation device, screening method, screening program, screening device, compound creation method, compound creation program, and compound creation device
EP3614388A1 (en) * 2018-08-20 2020-02-26 Fujitsu Limited Method and device for calculating binding free energy, and program
CN110728047A (en) * 2019-10-08 2020-01-24 中国工程物理研究院化工材料研究所 Computer-aided design system for predicting energetic molecules based on machine learning performance
CN110910951A (en) * 2019-11-19 2020-03-24 江苏理工学院 Method for predicting protein and ligand binding free energy based on progressive neural network

Similar Documents

Publication Publication Date Title
TWI705388B (en) Feature selection method, device and equipment for constructing machine learning model
CN112364880B (en) Omics data processing method, device, equipment and medium based on graph neural network
Xiao et al. Towards confidence in the truth: A bootstrapping based truth discovery approach
CN102768659B (en) Method and system for identifying repeated account
CN108806718B (en) Audio identification method based on analysis of ENF phase spectrum and instantaneous frequency spectrum
CN104538035B (en) A kind of method for distinguishing speek person and system based on Fisher super vectors
CN103761965B (en) A kind of sorting technique of instrument signal
CN111429977B (en) Novel molecular similarity search algorithm based on attention of graph structure
CN109656545A (en) A kind of software development activity clustering method based on event log
Gupta et al. Will I get in? modeling the graduate admission process for American universities
CN116561542B (en) Model optimization training system, method and related device
CN106127244A (en) A kind of parallelization K means improved method and system
CN110491443B (en) lncRNA protein correlation prediction method based on projection neighborhood non-negative matrix decomposition
CN114783539A (en) Traditional Chinese medicine component analysis method and system based on spectral clustering
CN114036276A (en) Information extraction method, device, equipment and storage medium
WO2022077258A1 (en) Free energy perturbation network design method based on machine learning
CN106874286B (en) Method and device for screening user characteristics
CN106911512A (en) Link Forecasting Methodology and system based on game in commutative figure
CN112330368B (en) Data processing method, system, storage medium and terminal equipment
CN110825583A (en) Energy efficiency qualitative assessment technology for multi-index fusion of cloud data center
CN112102889B (en) Free energy perturbation network design method based on machine learning
CN105447158A (en) Graph based automatic mining method for synonym set in patent search log
CN115438153A (en) Training method and device for intention matching degree analysis model
CN115472181A (en) Method, device and storage medium for singing recognition based on feature fusion and clustering
CN110298545B (en) Patent evaluation method, system and medium based on neural network

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20957039

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC

122 Ep: pct application non-entry in european phase

Ref document number: 20957039

Country of ref document: EP

Kind code of ref document: A1