CN111798934A

CN111798934A - Molecular property prediction method based on graph neural network

Info

Publication number: CN111798934A
Application number: CN202010578320.0A
Authority: CN
Inventors: 蔡翔
Original assignee: Suzhou Puyi Intelligent Medical Technology Co ltd
Current assignee: Suzhou Puyi Intelligent Medical Technology Co ltd
Priority date: 2020-06-23
Filing date: 2020-06-23
Publication date: 2020-10-20
Anticipated expiration: 2040-06-23
Also published as: CN111798934B

Abstract

The invention provides a molecular property prediction method based on a graph neural network, which comprises a training process and an application process based on the training process, wherein a chemical molecular formula is converted into a directed graph structure or an undirected graph structure for storage, all nodes are connected to information by using a computer data structure for description and storage, the discretized data are changed into continuous values by an embedding module, the graph structure and node embedding vectors jointly enter an information transmission module to obtain fusion information, the fusion information is sent to a reading module for further extraction, and the obtained result is finally sent to a judgment module for prediction. The invention has the beneficial effects that: the graph neural network is applied to molecular property prediction, thereby realizing large-scale molecular screening.

Description

Molecular property prediction method based on graph neural network

Technical Field

The invention belongs to the technical field of computer application, and particularly relates to a molecular property prediction method based on a graph neural network.

Background

In the field of biomedical research and development, the molecular screening process plays an important role in determining the cost of medicine research and development.

The traditional drug molecule screening technology is composed of a set of complex framework, and each step needs a great deal of engineering and experience through a plurality of manual design and fussy steps. For complex molecular structure information and in the face of large-scale drug molecule screening, the ideal effect can not be achieved by only applying the traditional method.

In view of this, how to analyze complex molecular structure information and predict molecular properties, and to implement large-scale drug molecule screening is a problem that is urgently faced at present.

Disclosure of Invention

In order to solve the defects of the prior art, the invention provides a molecular property prediction method based on a graph neural network.

The purpose of the invention is realized by the following technical scheme:

a molecular property prediction method based on a graph neural network is applied to prediction of molecular properties.

Preferably, the molecular property prediction method based on the graph neural network comprises a training process and an application process based on the training process, wherein the training process comprises the following steps,

s1, converting the chemical molecular formula into an undirected graph or a directed graph structure, connecting all nodes in the converted mechanism to information by using a computer data structure for description and storage, wherein all stored information is represented by a certain integer, and the integer is an ID (identity) for representing the information;

s2, sending the information stored in S1 to an embedded vector module, wherein the embedded vector module converts the received information into an embedded vector which can be expressed by a computer, namely a process of converting ID into the embedded vector;

s3, the information stored after being converted in S1 and the embedded vector information after being converted in S2 are transmitted to a transfer module for fusion, and the information transfer module fuses the node information and the graph connection information to be used as middle hidden layer information;

s4, sending the intermediate hidden layer information formed by fusing the information in the S3 into a reading module, and further abstracting and fusing the obtained information to obtain a vector with fixed dimension;

s5, sending the vector obtained in S4 to a judgment module, and analyzing the vector to obtain a judgment result;

and S6, forming a loss function by the network output result after the judgment of the S5 and the standard answer together for judging the quality of the network output result, updating parameters by using a gradient return algorithm after the loss function is calculated, repeating the steps S1-S6, continuously updating until the loss function is reduced to be below a set threshold value, and indicating that the network training is finished.

Preferably, the application process comprises the following steps:

and S7, converting any new molecular structure into an undirected graph or directed graph structure, sequentially entering an embedding vector expression module, an information transmission module, a vector reading module and a judgment module, and finally obtaining answers given by the network.

Preferably, the fusion of the delivery module in S3 includes word embedding multi-layer long-and-short-term memory model and/or convolutional layer stacking processing, and full-connection nonlinear transformation processing, and sequentially fuses node information and structural connection information described in the graph structure by circularly traversing the layer.

The neural network layer applied in the transmission module can be formed by mixing one or more of a long-time memory layer, a convolutional layer or a residual error structure. Of course, the specific components included in the neural network layer can be adjusted according to the actual application.

Preferably, the vector in S4 is a reading vector, and the dimension can be freely set according to actual conditions.

Preferably, the processing step of the S4 reading module includes matrix merging and adding.

Preferably, the step of discriminating in S6 includes a step of multi-layer nonlinear transformation and linear transformation.

Preferably, the selection of the loss function in S6 may be changed according to a classification problem or a regression problem, where the classification problem uses cross entropy and the regression problem uses least squares or average absolute error.

The invention has the beneficial effects that: the chemical molecular formula is converted into a form of an undirected graph or a directed graph structure, extraction and analysis are carried out by combining a neural network, and the neural network of the graph is applied to molecular property prediction, so that large-scale molecular screening is realized.

Drawings

FIG. 1: the invention discloses a frame structure schematic diagram of a molecular property prediction method based on a graph neural network.

Detailed Description

The invention discloses a molecular property prediction method based on a graph neural network, which is combined with a graph 1 and comprises a training process and an application process based on the training process, wherein the training process comprises the following steps,

s3, the information stored after being converted in S1 and the embedded vector information after being converted in S2 are transmitted to a transfer module for fusion, and the information transfer module fuses the node information and the graph connection information to be used as middle hidden layer information; the fusion of the transmission module comprises word embedding multi-layer long-time memory model and/or convolution layer stacking processing and full-connection nonlinear transformation processing, and node information and structure connection information described in the graph structure are sequentially fused by circularly traversing the layers. The method mainly aims to perform fusion between vector expressions on a plurality of equal node information and connection information through circular information transmission. The advantage of multiple transmission is that the network is subjected to deep and continuous information abstraction, so that the neural network has strong generalization capability.

S4, sending the intermediate hidden layer information formed by fusing the information in the S3 into a reading module, and further abstracting and fusing the obtained information to obtain a vector with fixed dimension; the vector is a reading vector, and the dimensionality can be freely set according to actual conditions. The processing steps of the reading module comprise matrix combination and addition.

S5, sending the vector obtained in S4 to a judgment module, and analyzing the vector to obtain a judgment result; the data type of the result is determined by the specific application, and can be used for classification and regression.

And S6, forming a loss function by the network output result after the judgment of the S5 and the standard answer together for judging the quality of the network output result, updating parameters by using a gradient return algorithm after the loss function is calculated, repeating the steps S1-S6, continuously updating until the loss function is reduced to be below a set threshold value, and indicating that the network training is finished. The discriminating step includes a multi-layer nonlinear transformation and a linear transformation step. The selection of the loss function can be changed according to classification or regression problems, wherein the classification problems adopt cross entropy, and the regression problems adopt least square or average absolute error.

The application process comprises the following steps:

The graph neural network is a novel artificial intelligence technology, the principle of the graph neural network is that input data are described into a directed graph or undirected graph structure, and data fitting is carried out on complex data distribution through a deep neural network, so that a final target is achieved. Compared with the traditional method, the deep neural network has the characteristics of big data analysis, autonomous learning, parallel computation, low cost and the like, is introduced and applied to analyzing complex molecular structure information and predicting molecular properties, and therefore, large-scale drug molecule screening is achieved.

The above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims

1. A molecular property prediction method based on a graph neural network, characterized in that the graph neural network is applied to the prediction of molecular properties.

2. The method of claim 1, wherein the method comprises: comprises a training process and an application process based on the training process, wherein the training process comprises the following steps,

3. The method of claim 1, wherein the method comprises: the application process comprises the following steps:

4. The method of claim 2, wherein the method comprises: the fusion of the transfer modules in S3 includes word embedding multi-layer long-and-short term memory models and/or convolutional layer stacking processing, and full-connection nonlinear transformation processing, and sequentially fuses node information and structural connection information described in the graph structure by circularly traversing the layers.

5. The method of claim 2, wherein the method comprises: the vector in S4 is a read vector, and the dimension can be freely set according to actual conditions.

6. The method of claim 2, wherein the method comprises: the processing steps of the S4 reading module comprise matrix combination and addition.

7. The method of claim 2, wherein the method comprises: the discriminating step in S6 includes a multi-layer nonlinear transformation and a linear transformation step.

8. The method of claim 2, wherein the method comprises: the selection of the loss function in S6 may be changed according to a classification problem using cross entropy or a regression problem using least squares or average absolute errors.