CN114822718B

CN114822718B - Human oral bioavailability prediction method based on graph neural network

Info

Publication number: CN114822718B
Application number: CN202210306054.5A
Authority: CN
Inventors: 杨云; 于明浩
Original assignee: Yunnan University YNU
Current assignee: Yunnan University YNU
Priority date: 2022-03-25
Filing date: 2022-03-25
Publication date: 2024-04-09
Anticipated expiration: 2042-03-25
Also published as: CN114822718A

Abstract

The invention discloses a human oral bioavailability prediction method based on a graph neural network in the technical field of molecular chemistry prediction, which comprises an initial atom, a chemical bond characteristic extraction module and a graph neural network module; the image neural network needs to convert molecular structure information into a molecular graph, defines initial characteristics of atoms and chemical bonds for the image neural network, and constructs a topological structure of the atomic adjacency matrix representative molecule by utilizing the atomic structure information; the graph neural network module comprises two steps, namely message transmission and reading, wherein the message transmission needs to be carried out for many times to generate hidden representations of atoms and chemical bonds, the graph neural network is used for avoiding extraction of molecular descriptors, reducing workload and using a chemical bond message absorption mechanism, so that a chemical bond auxiliary model learns better molecular representations, and the interpretation of the graph neural network is improved.

Description

Human oral bioavailability prediction method based on graph neural network

Technical Field

The invention relates to the technical field of molecular chemistry prediction, in particular to a human oral bioavailability prediction method based on a graph neural network.

Background

Human oral bioavailability is one of the most important pharmacokinetic properties in human oral drug development. In the early stage of the discovery and research and development of oral medicines, candidate medicines with low oral bioavailability of human bodies are eliminated, so that the consumption of resources can be reduced. Currently, a molecular descriptor based on a specific calculation method or based on expert definition is often used for predicting the human oral bioavailability of a candidate drug in combination with a machine learning algorithm, and the predefined molecular descriptor not only increases the workload, but also does not bring new insight and new ideas to the development of oral drugs, and the traditional prediction of the human oral bioavailability uses the molecular descriptor in combination with the machine learning to develop a prediction model, but the molecular descriptor is often based on the experience of the development of the traditional drug, does not provide new insight to the development of the new drug, and has a certain unavoidable experience deviation. With the development of deep learning technology, the graph neural network has been widely applied to the task of molecular property prediction. The use of the graph neural network does not need to extract molecular descriptors, and can automatically learn the hidden representation of the molecules by defining simple atomic characteristics and chemical bond characteristics, thereby completing the prediction of molecular properties. Therefore, the graph neural network is utilized to construct a human oral bioavailability prediction model, the development of new drugs is assisted, and the application and development of artificial intelligence in the field of drug discovery are promoted.

Because the human oral bioavailability prediction has higher theoretical research and application value, the resource waste caused by the too low human oral bioavailability of the candidate drug can be obviously reduced, and a plurality of researchers at home and abroad always propose a new method for predicting the property. Falcdelta-Cano [1] et al use multiple machine learning model integration, extract 0D-2D multiple molecular descriptors to construct human oral bioavailability prediction model, experimental results show that the prediction method has certain advantage in the aspect of prediction accuracy, which is also representative of the traditional prediction method. The application of graph neural network to predict human oral bioavailability belongs to the field of molecular property prediction, gilmer [3] et al propose a message transfer graph neural network model, and the convolution operation of the graph neural network is constructed based on atomic message transfer, which has greatly exceeded the traditional method in the field of quantum chemistry property prediction;

the prior art has the following defects:

(1) Human oral bioavailability prediction model

The prior prediction model for predicting the oral bioavailability of a human body is expressed by taking a molecular descriptor as a molecule, wherein the molecular descriptor can be divided into a molecular descriptor based on a predefined method and a molecular descriptor based on a specific calculation method. The compound synthesized by human at present only occupies a small part of chemical space based on predefined molecular descriptors which are developed by pharmacologists through past drug development experience, and the problems of experience deviation, misjudgment and the like are unavoidable based on the past drug development experience. Whereas for descriptors based on a specific computational method, researchers are often unaware of the relevance of the descriptor to the task, which limits the performance of predictions for a particular property, such as human oral bioavailability predictions. The use of the graph neural network to automatically extract a molecular representation that is highly correlated to the human oral bioavailability or will help predict this property in a more accurate manner.

(2) Molecular property prediction model based on graph neural network

Currently, the forward propagation process of the graph neural network predicting molecular properties does not take into account the intrinsic nature of chemical bonds, which represent electron clouds around atom pairs. When the atomic state changes, the chemical bond state should also change. However, most models do not update chemical bonds during message passing, and even if the chemical bonds are updated, the interaction of atoms and chemical bonds is insufficient. Improving the interaction of atoms and chemical bonds, renewing chemical bonds in a manner consistent with chemical knowledge, or would help to improve the performance of the molecular property predictions of the graph neural network.

Based on the above, the present invention designs a human oral bioavailability prediction method based on a graph neural network to solve the above problems.

Disclosure of Invention

The invention aims to provide a human oral bioavailability prediction method based on a graph neural network, so as to solve the problems in the background technology.

1. In order to achieve the above purpose, the present invention provides the following technical solutions: the human oral bioavailability prediction method based on the graph neural network comprises an initial atom, a chemical bond characteristic extraction module and a graph neural network module;

the initial atomic and chemical bond characteristic extraction module is used for converting molecular structure information into a molecular graph, defining initial characteristics of atoms and chemical bonds for the graph neural network, and constructing an atomic adjacency matrix to represent a topological structure of molecules by utilizing the atomic structure information;

the forward propagation of the graph neural network comprises two steps, namely message transmission and reading, wherein the message transmission needs to be carried out for many times to generate hidden representations of atoms and chemical bonds, the hidden representations of the atoms and the chemical bonds are generated into hidden representations of molecules by reading operation, and then the hidden representations of the atoms and the chemical bonds are predicted by using a fully-connected network to obtain a prediction result;

s1: messaging, which includes three phases of atomic messaging, chemical bond message absorption, and scaling self-attention;

in the atomic messaging phase, each atom in the molecular diagram absorbs information about the atom and chemical bond to which it is attachedAccording to the following:

wherein,and->Are learning matrices, d ^t And c ^t In the t-th update, the dimensions of the atomic state vector and the chemical bond state vector are respectively; d, d ^t+1 Is the dimension of the atomic state vector in the t+1st update; sigma (·) is a ReLU nonlinear activation function; the process updates the information of the central atom i by using the information of the neighbor atoms around the central atom and the chemical bonds connected with the central atom;

in the chemical bond message absorption phase, the chemical bond absorbs information of two atoms connected with the chemical bond message for updating the chemical bond message, and the chemical bond message is based on the following steps:

wherein,and->All are learning matrices->Will be combined with e _ij Splicing the state vectors of the two connected atoms;

through atom information transmission and chemical bond information absorption, information of atoms flows to atoms and chemical bonds connected with the atoms, the chemical bonds absorb surrounding atom information, and after multiple updates, molecular information flows through all the atoms and the chemical bonds, so that the atoms and the chemical bonds have topology information of the neighborhood of the atoms and the chemical bonds;

in the scale self-attention phase, the model will focus on the atomic and chemical bond features, according to:

wherein V is ^t+1 And E is ^t+1 The state matrix of atoms and chemical bonds when completing the transfer of the atomic message and the absorption of the chemical bond message in the t-th update, and->All are learning matrices->Is Hadamard Product (Hadamard Product) of matrix, W _va1 For embedding information in an atomic state matrix into a high-dimensional space, W after activation _va2 Extracting information, converting the numerical value into attention weight through a SoftMax (·) function to obtain an atomic attention weight vector, and reducing the numerical value of all features, namely important features, the reduction amplitude and d when the attention weight vector and an atomic state matrix are directly used for Hadamard product ^t+1 Related to the size of d ^t ⁺¹ The larger the feature value is, the larger the degree of feature value shrinkage is, and the attention weight vector is amplified by d ^t+1 The average value of the attention weight vector is amplified to 1 and is not influenced by the length d of the feature vector ^t+1 The influence of (2) makes the model easier to train;

s2: readout, in the readout phase, atoms and chemical bonds are processed simultaneously using multiple readout functions to obtain a better molecular hidden representation, according to:

v _all ＝Set2Set(V ^T )||Mean(V ^T )||Max(V ^T ) (8)

e _all ＝Set2Set(E ^T )||Mean(E ^T )||Max(E ^T ) (9)

z＝v _all ||e _all (10)

wherein, mean (-) and Max (-) are global average pooling and global maximum pooling, respectively.

Preferably, the extracted atomic initiation features include atomic type, atomic number, aromaticity, and hybridization mode features as atomic representations; extracting chemical bond initiation features includes bond type, whether covalent bond, or stereoisomeric type features as chemical bond representations.

Preferably, in S1, the matrix is embeddedAnd->The method is used for embedding information of atoms and chemical bonds into a hidden space respectively, and the space dimension is h; dimension-reducing matrix->Dimension required for transforming information in hidden space into the underlying graphic neural network, ++>For collecting information about atom i itself.

Preferably, in S1, the matrix is embeddedIs used to embed the two atomic information into a hidden space with dimensions h,/and h->For collecting chemical bonds e _ij The information of the self is embedded into the hidden space as well, and the dimension-reducing matrix is +.>For converting the information in the hidden space into the dimensions required by the chemical bonds of the neural network of the next layer.

Preferably, in S1, the average value of the attention weight vectors is

Preferably, in S1, the chemical bond state vector matrix is processed in the same manner as described above.

Preferably, in S2, the results obtained by the various Readout functions are spliced so that the obtained atoms are denoted as v in their entirety _all And chemical bond overall represents e _all Will be more representative of its overall state.

Preferably, in S2, v _all And e _all And splicing to obtain a hidden representation z of the molecule, and predicting by using a full-connection layer f (·) to obtain a prediction result.

Compared with the prior art, the invention has the beneficial effects that:

1. according to the method, chemical bond message absorption is proposed, so that the graphic neural network can adaptively fuse important layer number characteristics according to molecular structure information, noise information is filtered, and molecular representation capacity is improved; providing a scaling self-attention mechanism, so that the model can pay attention to the characteristics which are strongly related to the oral bioavailability of a human body, simultaneously avoid the strongly related characteristics from being reduced too much, and improve the molecular representation capability; the method has strong interpretation, can analyze molecular substructures highly related to the oral bioavailability of human bodies, and provides new insights of artificial intelligence layers exceeding the human visual angle for new medicine research and development;

2. by using the graphic neural network, extraction of molecular descriptors can be avoided, workload is reduced, and a chemical bond message absorption mechanism is used, so that a better molecular representation is learned by a chemical bond auxiliary model, and the interpretation of the graphic neural network is improved.

Of course, it is not necessary for any one product to practice the invention to achieve all of the advantages set forth above at the same time.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are needed for the description of the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.

FIG. 1 is a flow chart of the method of the present invention;

FIG. 2 is a schematic diagram of the neural network module of the present invention;

fig. 3 is a schematic diagram of atomic messaging and chemical bond message absorption in accordance with the present invention.

Detailed Description

The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

Example 1

Referring to fig. 1 to 3, the present invention provides a human oral bioavailability prediction method based on a graph neural network, which comprises the following steps: the human oral bioavailability prediction method based on the graph neural network comprises an initial atom, a chemical bond characteristic extraction module and a graph neural network module;

the initial atomic and chemical bond characteristic extraction module is used for converting molecular structure information into a molecular graph, defining initial characteristics of atoms and chemical bonds for the graph neural network, and extracting atomic initial characteristics including atomic types, atomic numbers, aromaticity and hybridization mode characteristics as atomic representations; extracting initial characteristics of chemical bonds including bond types, covalent bonds or stereoisomerism types as chemical bond representations, and constructing a topological structure of molecules represented by an atomic adjacency matrix by utilizing atomic structure information;

wherein,and->Are learning matrices, d ^t And c ^t In the t-th update, the dimensions of the atomic state vector and the chemical bond state vector are respectively; d, d ^t+1 Is the dimension of the atomic state vector in the t+1st update; sigma (·) is a ReLU nonlinear activation function; embedding matrix->And->The method is used for embedding information of atoms and chemical bonds into a hidden space respectively, and the space dimension is h; dimension-reducing matrix->Dimension required for transforming information in hidden space into the underlying graphic neural network, ++>For collecting atom i self information, this process updates the central atom i self information with information of its surrounding neighbor atoms and chemical bonds connected thereto, fig. 3 (a) shows the process of atomic messaging;

wherein,and->All are learning matrices->Will be combined with e _ij State vector concatenation of two connected atoms, embedding matrix +.>Is used to embed the two atomic information into a hidden space with dimensions h,/and h->For collecting chemical bonds e _ij The information of the self is embedded into the hidden space as well, and the dimension-reducing matrix is +.>The method is used for converting the information in the hidden space into the dimension required by the chemical bond of the neural network of the next layer of graph, and the chemical bond message absorption process is shown in the figure 3 (b);

wherein V is ^t+1 And E is ^t+1 The state matrix of atoms and chemical bonds when completing the transfer of the atomic message and the absorption of the chemical bond message in the t-th update, and->All are learning matrices->Is Hadamard Product (Hadamard Product) of matrix, W _va1 For embedding information in an atomic state matrix into a high-dimensional space, W after activation _va2 Extracting information, converting the numerical value into attention weight through a SoftMax (·) function to obtain an atomic attention weight vector, wherein the average value of the attention weight vector at the moment is +.>When the attention weight vector and the atomic state matrix are directly used for Hadamard product, the numerical value of all the features is reduced, even the important featuresThe reduction of the amplitude and d ^t+1 Related to the size of d ^t+1 The larger the feature value is, the larger the degree of feature value shrinkage is, and the attention weight vector is amplified by d ^t+1 The average value of the attention weight vector is amplified to 1 and is not influenced by the length d of the feature vector ^t+1 The influence of the characteristic value is avoided from being excessively reduced when the attention is used, so that the model is easier to train, and the processing mode of the chemical bond state vector matrix is the same as that described above;

v _all ＝Set2Set(V ^T )||Mean(V ^T )||Max(VT) (8)

e _all ＝Set2Set(E ^T )||Mean(E ^T )||Max(E ^T ) (9)

z＝v _all ||e _all (10)

wherein Mean (-) and Max (-) are global average pooling and global maximum pooling respectively, and the results obtained by the plurality of Readout functions are spliced to obtain an atomic integral representation v _all And chemical bond overall represents e _all Will be more representative of its overall state, will be v _all And e _all And splicing to obtain a hidden representation z of the molecule, and predicting by using a full-connection layer f (·) to obtain a prediction result.

According to the method, chemical bond message absorption is proposed, so that the graphic neural network can adaptively fuse important layer number characteristics according to molecular structure information, noise information is filtered, and molecular representation capacity is improved; providing a scaling self-attention mechanism, so that the model can pay attention to the characteristics which are strongly related to the oral bioavailability of a human body, simultaneously avoid the strongly related characteristics from being reduced too much, and improve the molecular representation capability; the method has strong interpretation, can analyze molecular substructures highly related to the oral bioavailability of human bodies, and provides new insights of artificial intelligence layers exceeding the human visual angle for new medicine research and development; by using the graphic neural network, extraction of molecular descriptors can be avoided, workload is reduced, and a chemical bond message absorption mechanism is used, so that a better molecular representation is learned by a chemical bond auxiliary model, and the interpretation of the graphic neural network is improved.

In the description of the present specification, the descriptions of the terms "one embodiment," "example," "specific example," and the like, mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiments or examples. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

The preferred embodiments of the invention disclosed above are intended only to assist in the explanation of the invention. The preferred embodiments are not exhaustive or to limit the invention to the precise form disclosed. Obviously, many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles of the invention and the practical application, to thereby enable others skilled in the art to best understand and utilize the invention. The invention is limited only by the claims and the full scope and equivalents thereof.

Claims

1. The human oral bioavailability prediction method based on the graph neural network is characterized by comprising an initial atom, a chemical bond characteristic extraction module and a graph neural network module;

wherein V is ^t+1 And E is ^t+1 The state matrix of atoms and chemical bonds when completing the transfer of the atomic message and the absorption of the chemical bond message in the t-th update, and->All are learning matrices->Is Hadamard product (Hadamard product) of matrix, W _va1 For embedding information in an atomic state matrix into a high-dimensional space, W after activation _va2 Extracting information, converting the numerical value into attention weight through a SoftMax (·) function to obtain an atomic attention weight vector, and reducing the numerical value of all features, namely important features, the reduction amplitude and d when the attention weight vector and an atomic state matrix are directly used for Hadamard product ^t+1 Related to the size of d ^t+1 The larger the feature value is, the larger the degree of feature value shrinkage is, and the attention weight vector is amplified by d ^t+1 The average value of the attention weight vector is amplified to 1 and is not influenced by the length d of the feature vector ^t+1 The influence of (2) makes the model easier to train;

v _all ＝Set2Set(V ^T )||Mean(V ^T )||Max(V ^T ) (8)

e _all ＝Set2Set(E ^T )||Mean(E ^T )||Max(E ^T ) (9)

z＝v _all ||e _all (10)

2. The method for predicting human oral bioavailability based on the graph neural network of claim 1, wherein: the extracted atomic initial features include atomic type, atomic number, aromaticity, and hybrid mode features as atomic representations; extracting chemical bond initiation features includes bond type, whether covalent bond, or stereoisomeric type features as chemical bond representations.

3. The method for predicting human oral bioavailability based on the graph neural network of claim 1, wherein: in the S1, an embedding matrixAnd->The method is used for embedding information of atoms and chemical bonds into a hidden space respectively, and the space dimension is h; dimension-reducing matrix->Dimension required for transforming information in hidden space into the underlying graphic neural network, ++>For collecting information about atom i itself.

4. The method for predicting human oral bioavailability based on the graph neural network of claim 1, wherein: in the S1, an embedding matrixIs used to embed the two atomic information into a hidden space, the hidden space dimension being h,for collecting chemical bonds e _ij The information of the self is embedded into the hidden space as well, and the dimension-reducing matrix is +.>For converting the information in the hidden space into the dimensions required by the chemical bonds of the neural network of the next layer.

5. The method for predicting human oral bioavailability based on the graph neural network of claim 1, wherein: in S1, the average value of the attention weight vectors is

6. The method for predicting human oral bioavailability based on the graph neural network of claim 1, wherein: in S1, the processing manner of the chemical bond state vector matrix is the same as that described above.

7. The method for predicting human oral bioavailability based on the graph neural network of claim 1, wherein: in S2, splicing the results obtained by the plurality of Readout functions to obtain an integral representation v _all And chemical bond overall represents e _all Will be more representative of its overall state.

8. The method for predicting human oral bioavailability based on the graph neural network of claim 1, wherein: in S2, v is _all And e _all And splicing to obtain a hidden representation z of the molecule, and predicting by using a full-connection layer f (·) to obtain a prediction result.