CN114822718A

CN114822718A - Human oral bioavailability prediction method based on graph neural network

Info

Publication number: CN114822718A
Application number: CN202210306054.5A
Authority: CN
Inventors: 杨云; 于明浩
Original assignee: Yunnan University YNU
Current assignee: Yunnan University YNU
Priority date: 2022-03-25
Filing date: 2022-03-25
Publication date: 2022-07-29
Anticipated expiration: 2042-03-25
Also published as: CN114822718B

Abstract

The invention discloses a human oral bioavailability prediction method based on a graph neural network in the technical field of molecular chemical property prediction, which comprises an initial atom and chemical bond characteristic extraction module and a graph neural network module; the map neural network needs to convert molecular structure information into a molecular map, the initial characteristics of atoms and chemical bonds need to be defined for the map neural network to use, and the atomic structure information is utilized to construct an atomic adjacency matrix representing the topological structure of molecules; the graph neural network module and the forward propagation of the graph neural network comprise two steps including message transmission and reading, wherein the message transmission needs to be carried out for multiple times to generate a good hidden representation of atoms and chemical bonds, the extraction of molecular descriptors can be avoided by using the graph neural network, the workload is reduced, a chemical bond message absorption mechanism is used, a chemical bond auxiliary model is made to learn a better molecular representation, and the interpretability of the graph neural network is improved.

Description

Human oral bioavailability prediction method based on graph neural network

Technical Field

The invention relates to the technical field of molecular chemical property prediction, in particular to a human oral bioavailability prediction method based on a graph neural network.

Background

Oral bioavailability in humans is one of the most important pharmacokinetic properties in the development of oral drugs in humans. In the early stage of oral drug discovery and development, candidate drugs with low oral bioavailability in human bodies are excluded, and resource consumption can be reduced. At present, human oral bioavailability of candidate drugs is often predicted by combining a molecular descriptor based on a specific calculation method or on expert definition with a machine learning algorithm, the predefined molecular descriptor not only increases workload, but also does not bring new insights and new ideas for oral drug development, and the traditional prediction of human oral bioavailability uses the molecular descriptor to combine with machine learning to develop a prediction model, but the molecular descriptor is often based on previous drug development experiences, does not provide new insights for new drug development, and has certain unavoidable experience deviation. With the development of deep learning technology, the graph neural network has been widely applied to molecular property prediction tasks. By using the graph neural network, the molecular hidden representation can be automatically learned by only defining simple atomic characteristics and chemical bond characteristics without extracting a molecular descriptor, and the molecular property prediction is completed. Therefore, the method has great practical significance for constructing a human oral bioavailability prediction model by using the graph neural network, assisting in research and development of new drugs and promoting application and development of artificial intelligence in the field of drug discovery.

Because the prediction of the human oral bioavailability has higher theoretical research and application values, the resource waste caused by the too low human oral bioavailability of the candidate drug can be obviously reduced, and many researchers at home and abroad always propose a new method for predicting the property. Falc Lou n-Cano [1] and the like are integrated by using various machine learning models, and 0D-2D various molecular descriptors are extracted to construct a human oral bioavailability prediction model. The application of graph neural network to predict human oral bioavailability belongs to the field of molecular property prediction, Gilmer [3] et al propose a message transmission graph neural network model, construct the convolution operation of graph neural network based on atomic message transmission, and greatly exceed the traditional method in the field of quantum chemical property prediction;

the prior art has the following disadvantages:

(1) human oral bioavailability prediction model

The previous prediction models for predicting oral bioavailability in humans are represented by molecular descriptors, which can be classified into predefined molecular descriptors and specific calculation-based molecular descriptors. The molecular descriptors based on the pre-definition are developed by pharmacologists through previous drug development experiences, the compounds synthesized by human beings currently only occupy a small part of chemical space, and the problems of experience deviation, misjudgment and the like are inevitably generated based on the previous drug development experiences. For descriptors based on a particular computational method, the researcher is usually unaware of the relevance of the descriptor to the task, which limits the performance of predictions for certain properties, such as oral bioavailability in humans. The use of a graphical neural network to automatically extract a molecular representation that is highly correlated with human oral bioavailability or will help predict this property in a more accurate manner.

(2) Molecular property prediction model based on graph neural network

At present, the forward propagation process of predicting molecular properties by the neural network does not take the essential characteristics of chemical bonds, which represent electron clouds around atom pairs, into account. When the atomic state is changed, the chemical bond state should also be changed. However, most models do not update chemical bonds during message passing, and even if chemical bonds are updated, interaction of atoms and chemical bonds is not sufficient. Improving the interaction of atoms and chemical bonds, updating chemical bonds in a manner consistent with chemical knowledge, or will help improve the performance of molecular property predictions for graphical neural networks.

Based on the above, the invention designs a human oral bioavailability prediction method based on a graph neural network to solve the problems.

Disclosure of Invention

The present invention aims to provide a method for predicting human oral bioavailability based on a graph neural network, so as to solve the problems proposed in the background technology.

1. In order to achieve the purpose, the invention provides the following technical scheme: the human body oral bioavailability prediction method based on the graph neural network comprises an initial atom and chemical bond characteristic extraction module and a graph neural network module;

in the initial atom and chemical bond feature extraction module, the graph neural network needs to convert molecular structure information into a molecular graph, the initial features of atoms and chemical bonds need to be defined for the graph neural network to use, and an atom adjacency matrix is constructed to represent a topological structure of molecules by utilizing the atom structure information;

the forward propagation of the graph neural network comprises two steps, namely message transmission and reading, wherein the message transmission needs to be carried out for multiple times to generate good hidden representations of atoms and chemical bonds, the reading operation enables the hidden representations of the atoms and the chemical bonds to generate hidden representations of molecules, and then the prediction is carried out by using a full-connection network to obtain a prediction result;

s1: message transmission, wherein the message transmission comprises three stages of atomic message transmission, chemical bond message absorption and self-attention zooming;

during the atomic messaging phase, each atom in the molecular graph will absorb information about the atoms and chemical bonds to which it is attached

According to the following steps:

wherein the content of the first and second substances,

and

are all learning matrices, d ^t And c ^t The dimensionality of the atomic state vector and the chemical bond state vector in the t-th update respectively; d ^t+1 Is the dimension of the primitive state vector in the t +1 th update; σ (-) isA ReLU nonlinear activation function; in the process, the information of the central atom i is updated by the information of the peripheral neighbor atoms and the chemical bonds connected with the peripheral neighbor atoms;

in the chemical bond message absorption phase, the chemical bond will absorb the information of the two atoms connected to it for updating itself, according to:

wherein the content of the first and second substances,

and

all the learning matrixes are used as learning matrixes,

will be reacted with e _ij Splicing the state vectors of two connected atoms;

through atom message transmission and chemical bond message absorption, the information of atoms flows to atoms and chemical bonds connected with the atoms and the chemical bonds, the chemical bonds also absorb the information of surrounding atoms, and after multiple updates, the molecular information flows through all atoms and chemical bonds, so that the atoms and the chemical bonds have the topological information of the neighborhood;

in the zoom from attention stage, the model will focus on atomic and chemical bond features according to:

wherein, V ^t+1 And E ^t+1 The state matrices of atoms and chemical bonds at the time of completing the atomic message delivery and chemical bond message absorption, respectively, in the t-th update,

and

are all the learning matrixes,

is a Hadamard Product of a matrix (Hadamard Product), W _va1 For embedding information in an atomic state matrix into a high-dimensional space, after activation, W _va2 Extracting information, converting numerical values into attention weights through a SoftMax (·) function to obtain an atomic attention weight vector, and directly using the attention weight vector and an atomic state matrix to carry out Hadamard product to reduce the numerical values of all characteristics, namely important characteristics, the reduction amplitude and d ^t+1 Is related to the size of d ^t ⁺¹ The larger the eigenvalue is reduced, the larger the attention weight vector is enlarged by d ^t+1 Multiple, such that the average of the attention weight vector is scaled to 1 regardless of the feature vector length d ^t+1 The model is easier to train;

s2: reading, in the reading phase, simultaneously processing atoms and chemical bonds using a plurality of reading functions to obtain a better molecular hidden representation, according to:

v _all ＝Set2Set(V ^T )||Mean(V ^T )||Max(V ^T ) (8)

e _all ＝Set2Set(E ^T )||Mean(E ^T )||Max(E ^T ) (9)

z＝v _all ||e _all (10)

wherein Mean (-) and Max (-) are global average pooling and global maximum pooling, respectively.

Preferably, extracting the atomic initial features including atomic type, atomic number, aromaticity and hybridization mode features as atomic representation; extracting chemical bond initial characteristics including bond type, whether the chemical bond is a covalent bond or not and stereoisomerism type characteristics as chemical bond representation.

Preferably, in S1, the matrix is embedded

And

respectively embedding information of atoms and chemical bonds into a hidden space, wherein the dimension of the space is h; dimension reduction matrix

For translating information in hidden space into the dimension required by the neural network of the lower graph,

for collecting information about atom i itself.

Preferably, in S1, the matrix is embedded

Is used to embed the two atomic information into a hidden space, with a dimension h,

for collecting chemical bonds e _ij Embedding own information into hidden space, and reducing dimension matrix

The method is used for converting the information in the hidden space into the dimension required by the chemical bond of the neural network of the next layer diagram.

Preferably, in S1, the average value of the attention weight vector is

Preferably, in S1, the chemical bond state vector matrix is processed in the same manner as described above.

Preferably, in S2, the results obtained by the various Readout functions are concatenated so that the obtained atoms as a whole represent v _all And chemical bond as a whole represent e _all It will be more representative of its overall state.

Preferably, in S2, v is _all And e _all And (4) splicing to obtain a hidden representation z of the molecule, and then predicting by using the full-connection layer f (-) to obtain a prediction result.

Compared with the prior art, the invention has the beneficial effects that:

1. the method provides chemical bond message absorption, so that a graph neural network can adaptively fuse important layer number characteristics according to molecular structure information, and simultaneously filters noise information to improve molecular representation capability; a self-attention zooming mechanism is provided, so that the model can focus on the characteristics strongly related to the human oral bioavailability and simultaneously avoid the strong related characteristics from being excessively reduced, and the molecular representation capability is improved; the method has strong explanatory property, can analyze the molecular substructure highly related to the human oral bioavailability, and provides new insight of artificial intelligence level exceeding human visual angle for the research and development of new drugs;

2. by using the graph neural network, the extraction of molecular descriptors can be avoided, the workload is reduced, and a chemical bond message absorption mechanism is used, so that a chemical bond auxiliary model learns better molecular representation, and the explanatory performance of the graph neural network is improved.

Of course, it is not necessary for any product in which the invention is practiced to achieve all of the above-described advantages at the same time.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.

FIG. 1 is a flow chart of a method of the present invention;

FIG. 2 is a schematic diagram of a neural network module of the present invention;

FIG. 3 is a schematic diagram of atomic messaging and chemical bond message absorption in accordance with the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Example 1

Referring to fig. 1 to 3, the present invention provides a technical solution of a method for predicting human oral bioavailability based on a neural network: the human body oral bioavailability prediction method based on the graph neural network comprises an initial atom and chemical bond characteristic extraction module and a graph neural network module;

in the initial atom and chemical bond feature extraction module, a graph neural network needs to convert molecular structure information into a molecular graph, the initial features of atoms and chemical bonds need to be defined for the graph neural network to use, and the extracted atomic initial features comprise atomic type, atomic number, aromaticity and hybridization mode features as atomic representation; extracting chemical bond initial characteristics including bond types, whether the chemical bond initial characteristics are covalent bonds or not and stereoisomerism type characteristics as chemical bond representation, and constructing a topological structure of an atom adjacency matrix representative molecule by using atom structure information;

According to the following steps:

wherein the content of the first and second substances,

and

are all learning matrices, d ^t And c ^t The dimensionality of the atomic state vector and the chemical bond state vector in the t-th update respectively; d ^t+1 Is the dimension of the primitive state vector in the t +1 th update; σ (-) is the ReLU nonlinear activation function; embedded matrix

And

for embedding information of atoms and chemical bonds, respectivelyEntering a hidden space, wherein the dimension of the space is h; dimension reduction matrix

for collecting the atom i self information, this process updates the central atom i self information with the information of its surrounding neighbor atoms and chemical bonds connected to them, fig. 3(a) shows the process of atom messaging;

wherein the content of the first and second substances,

and

are all the learning matrixes,

will be reacted with e _ij State vector stitching, embedding matrix of two connected atoms

For hidingThe information in the space is converted into the dimension required by the chemical bond of the neural network of the next layer diagram, and the chemical bond message absorption process is shown in fig. 3 (b);

and

are all the learning matrixes,

hadamard Product (Hadamard Product), W, as a matrix _va1 For embedding information in an atomic state matrix into a high-dimensional space, after activation, W _va2 Extracting information, converting numerical values into attention weights through a SoftMax (·) function to obtain an atomic attention weight vector, wherein the average value of the attention weight vector at the moment is

When the attention weight vector and the atomic state matrix are directly used to perform the Hadamard product, the values of all the features are reduced, even the important features, the reduction range and d ^t+1 Is related to the size of d ^t+1 The larger the eigenvalue is reduced, the larger the attention weight vector is enlarged by d ^t+1 Multiple, such that the average of the attention weight vector is scaled to 1 regardless of the feature vector length d ^t+1 The influence of (3) avoids overlarge reduction of characteristic numerical values when attention is used, so that the model is easier to train, and the processing mode of the chemical bond state vector matrix is the same as that of the chemical bond state vector matrix;

v _all ＝Set2Set(V ^T )||Mean(V ^T )||Max(VT) (8)

e _all ＝Set2Set(E ^T )||Mean(E ^T )||Max(E ^T ) (9)

z＝v _all ||e _all (10)

wherein Mean (-) and Max (-) are respectively global average pooling and global maximum pooling, and results obtained by various Readout functions are spliced to enable obtained atoms to integrally express v _all And chemical bond as a whole represent e _all Will be more representative of its overall state, will v _all And e _all And (4) splicing to obtain a hidden representation z of the molecule, and then predicting by using the full-connection layer f (-) to obtain a prediction result.

The method provides chemical bond message absorption, so that a graph neural network can adaptively fuse important layer number characteristics according to molecular structure information, and simultaneously filters noise information to improve molecular representation capability; a self-attention zooming mechanism is provided, so that the model can focus on the characteristics strongly related to the human oral bioavailability and simultaneously avoid the strong related characteristics from being excessively reduced, and the molecular representation capability is improved; the method has strong explanatory property, can analyze the molecular substructure highly related to the human oral bioavailability, and provides new insight of artificial intelligence level exceeding human visual angle for the research and development of new drugs; by using the graph neural network, the extraction of molecular descriptors can be avoided, the workload is reduced, and a chemical bond message absorption mechanism is used, so that a chemical bond auxiliary model learns better molecular representation, and the explanatory performance of the graph neural network is improved.

In the description herein, references to the description of "one embodiment," "an example," "a specific example" or the like are intended to mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

The preferred embodiments of the invention disclosed above are intended to be illustrative only. The preferred embodiments are not intended to be exhaustive or to limit the invention to the precise embodiments disclosed. Obviously, many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles of the invention and the practical application, to thereby enable others skilled in the art to best utilize the invention. The invention is limited only by the claims and their full scope and equivalents.

Claims

1. The human body oral bioavailability prediction method based on the graph neural network is characterized by comprising an initial atom and chemical bond feature extraction module and a graph neural network module;

According to the following steps:

wherein the content of the first and second substances,

and

are all learning matrices, d ^t And c ^t The dimensionality of the atomic state vector and the chemical bond state vector in the t-th update respectively; d ^t+1 Is the dimension of the primitive state vector in the t +1 th update; σ (-) is the ReLU nonlinear activation function; in the process, the information of the central atom i is updated by the information of the peripheral neighbor atoms and the chemical bonds connected with the peripheral neighbor atoms;

wherein the content of the first and second substances,

and

are all the learning matrixes,

will be reacted with e _ij Splicing the state vectors of two connected atoms;

and

are all the learning matrixes,

is the Hadamard product (Hadamard product) of a matrix, W _va1 For embedding information in an atomic state matrix into a high-dimensional space, after activation, W _va2 Extracting information, converting numerical values into attention weights through a SoftMax (·) function to obtain an atomic attention weight vector, and directly using the attention weight vector and an atomic state matrix to carry out Hadamard product to reduce the numerical values of all characteristics, namely important characteristics, the reduction amplitude and d ^t+1 Is related to the size of d ^t+1 The larger the eigenvalue is reduced, the larger the attention weight vector is enlarged by d ^t+1 Multiple, such that the average of the attention weight vector is scaled to 1 regardless of the feature vector length d ^t+1 Make the model easier to trainRefining;

v _all ＝Set2Set(V ^T )||Mean(V ^T )||Max(V ^T ) (8)

e _all ＝Set2Set(E ^T )||Mean(E ^T )||Max(E ^T ) (9)

z＝v _all ||e _all (10)

2. The method of predicting human oral bioavailability based on neural networks of claim 1, wherein: the extracted atomic initial features comprise atomic type, atomic number, aromaticity and hybridization mode features as atomic representations; extracting chemical bond initial characteristics including bond type, whether the bond is a covalent bond or not and stereoisomeric type characteristics as chemical bond representation.

3. The method of predicting human oral bioavailability based on neural networks of claim 1, wherein: in the S1, a matrix is embedded

And

for collecting information about atom i itself.

4. The method of predicting human oral bioavailability based on neural networks of claim 1, wherein: in the S1, a matrix is embedded

5. The method of predicting human oral bioavailability based on neural networks of claim 1, wherein: in the step S1, the average value of the attention weight vector is

6. The method of predicting human oral bioavailability based on neural networks of claim 1, wherein: in S1, the chemical bond state vector matrix is processed in the same manner as described above.

7. The method for predicting human oral bioavailability based on neural networks of claim 1, whereinIs characterized in that: in the step S2, the results obtained by various Readout functions are spliced, so that the obtained atom integrally represents v _all And chemical bond as a whole represent e _all It will be more representative of its overall state.

8. The method of predicting human oral bioavailability based on neural networks of claim 1, wherein: in said S2, v is _all And e _all And (4) splicing to obtain a hidden representation z of the molecule, and then predicting by using the full-connection layer f (-) to obtain a prediction result.