CN117095767A - Unknown substance acute toxicity prediction method, system and equipment based on deep learning - Google Patents
Unknown substance acute toxicity prediction method, system and equipment based on deep learning Download PDFInfo
- Publication number
- CN117095767A CN117095767A CN202311034419.4A CN202311034419A CN117095767A CN 117095767 A CN117095767 A CN 117095767A CN 202311034419 A CN202311034419 A CN 202311034419A CN 117095767 A CN117095767 A CN 117095767A
- Authority
- CN
- China
- Prior art keywords
- acute toxicity
- attention
- graph
- layer
- improved
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 239000000126 substance Substances 0.000 title claims abstract description 113
- 230000007059 acute toxicity Effects 0.000 title claims abstract description 64
- 231100000403 acute toxicity Toxicity 0.000 title claims abstract description 64
- 238000000034 method Methods 0.000 title claims abstract description 50
- 238000013135 deep learning Methods 0.000 title claims abstract description 27
- 230000007246 mechanism Effects 0.000 claims abstract description 36
- 238000011176 pooling Methods 0.000 claims abstract description 35
- 238000000605 extraction Methods 0.000 claims abstract description 17
- 230000006870 function Effects 0.000 claims description 12
- 238000004590 computer program Methods 0.000 claims description 8
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 claims description 7
- 235000006008 Brassica napus var napus Nutrition 0.000 claims description 7
- 240000000385 Brassica napus var. napus Species 0.000 claims description 7
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 claims description 7
- 235000004977 Brassica sinapistrum Nutrition 0.000 claims description 7
- 230000008569 process Effects 0.000 claims description 3
- 238000012549 training Methods 0.000 claims description 3
- 239000013626 chemical specie Substances 0.000 claims description 2
- 230000001988 toxicity Effects 0.000 abstract description 10
- 231100000419 toxicity Toxicity 0.000 abstract description 10
- 238000001514 detection method Methods 0.000 abstract description 2
- 230000000694 effects Effects 0.000 description 3
- 230000002776 aggregation Effects 0.000 description 2
- 238000004220 aggregation Methods 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 2
- 231100000425 computational toxicology Toxicity 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 231100000331 toxic Toxicity 0.000 description 2
- 230000002588 toxic effect Effects 0.000 description 2
- 206010008428 Chemical poisoning Diseases 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 231100000369 acute toxicity data Toxicity 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000009412 basement excavation Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 239000002574 poison Substances 0.000 description 1
- 231100000614 poison Toxicity 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 231100000048 toxicity data Toxicity 0.000 description 1
- 231100000027 toxicology Toxicity 0.000 description 1
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16C—COMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
- G16C20/00—Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
- G16C20/30—Prediction of properties of chemical compounds, compositions or mixtures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16C—COMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
- G16C20/00—Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
- G16C20/70—Machine learning, data mining or chemometrics
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Physics & Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computing Systems (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Health & Medical Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Chemical & Material Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Crystallography & Structural Chemistry (AREA)
- Databases & Information Systems (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Medical Informatics (AREA)
- Mathematical Physics (AREA)
- Evolutionary Biology (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a method, a system and equipment for predicting the acute toxicity of an unknown substance based on deep learning, and relates to the technical field of substance toxicity detection, wherein the method comprises the steps of encoding a CANONICAL SMILES code of the chemical substance in an obtained target area into a corresponding DGL chart, and inputting the corresponding DGL chart into an acute toxicity prediction model of the unknown substance to obtain the types of the chemical substance in the target area and the acute toxicity degree of each chemical substance; wherein the model is determined from a modified schematic force network model; the improved graph meaning network model comprises an input layer, a feature extraction layer, an implicit layer, a pooling layer and a linear classification layer; the attention mechanism of the hidden layer is an attention mechanism of SuperGAT based on graph node-oriented classification tasks. The invention can predict the chemical substance in the target area and the acute toxicity degree of the chemical substance with high precision and high efficiency.
Description
Technical Field
The invention relates to the technical field of material toxicity detection, in particular to a method, a system and equipment for predicting the acute toxicity of unknown materials based on deep learning.
Background
The toxicity prediction of the source poison is an important reference basis for the emergency rescue scheme of sudden chemical poisoning accidents, and the clinical treatment of site casualties and the pollution treatment of site environments can be realized through the prediction of the toxic source substances at the accident site.
The computational toxicology technology is a method for constructing a computer model based on principles of computational chemistry, bioinformatics, systematic biology and the like, can realize the prediction of structures, characteristics, effects and the like of chemical substances in vitro according to the excavation and analysis of the existing data, and can avoid the problems of high cost, long time consumption, violation of animal ethics and the like when the traditional experimental means acquire related information. However, this method has drawbacks of low prediction efficiency and low prediction accuracy.
Disclosure of Invention
The invention aims to provide a method, a system and equipment for predicting the acute toxicity of unknown substances based on deep learning, which can predict the chemical substances in a target area and the acute toxicity degree of the chemical substances with high precision and high efficiency.
In order to achieve the above object, the present invention provides the following solutions:
in a first aspect, the present invention provides a method for predicting acute toxicity of an unknown substance based on deep learning, comprising:
acquiring a CANONICALSMILES code of the chemical substances in the target area, and encoding the CANONICALSMILES code of the chemical substances in the target area into a DGL chart of the chemical substances in the target area;
inputting the DGL graph of the chemical substances in the target area into an unknown substance acute toxicity prediction model to obtain the types of the chemical substances in the target area and the acute toxicity degree of each chemical substance;
the unknown substance acute toxicity prediction model is determined according to an improved graph annotation force network model; the improved graph meaning network model comprises an input layer, a feature extraction layer, an implicit layer, a pooling layer and a linear classification layer; the attention mechanism of the hidden layer is an attention mechanism of SuperGAT based on graph node-oriented classification tasks.
In a second aspect, the present invention provides a deep learning-based system for predicting acute toxicity of an unknown substance, comprising:
the target area parameter acquisition module is used for acquiring a CANONICALSMILES code of the chemical substances in the target area and encoding the CANONICAL SMILES code of the chemical substances in the target area into a DGL chart of the chemical substances in the target area;
the chemical substance acute toxicity prediction module is used for inputting the DGL graph of the chemical substances in the target area into the unknown substance acute toxicity prediction model to obtain the types of the chemical substances in the target area and the acute toxicity degree of each chemical substance;
the unknown substance acute toxicity prediction model is determined according to an improved graph annotation force network model; the improved graph meaning network model comprises an input layer, a feature extraction layer, an implicit layer, a pooling layer and a linear classification layer; the attention mechanism of the hidden layer is an attention mechanism of SuperGAT based on graph node-oriented classification tasks.
In a third aspect, the present invention provides an electronic device, comprising a memory for storing a computer program and a processor that runs the computer program to cause the electronic device to perform the deep learning based unknown substance acute toxicity prediction method according to the first aspect.
According to the specific embodiment provided by the invention, the invention discloses the following technical effects:
the invention combines the computational toxicology and the deep learning technology to research an alternative method for predicting the toxicity of unknown substances based on the determination of the toxicity end point index of the existing experiment, can continuously enrich and perfect the existing toxicity database, fills the blank of experimental data, and provides a certain reference and reference in the innovation of new technology and new method in the field of predictive toxicology.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions of the prior art, the drawings that are needed in the embodiments will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a schematic flow chart of a method for predicting acute toxicity of unknown substances based on deep learning according to an embodiment of the present invention;
fig. 2 is a schematic structural diagram of an unknown substance acute toxicity prediction system based on deep learning according to an embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
In order that the above-recited objects, features and advantages of the present invention will become more readily apparent, a more particular description of the invention will be rendered by reference to the appended drawings and appended detailed description.
Example 1
As shown in fig. 1, the embodiment provides a method for predicting acute toxicity of unknown substances based on deep learning, which includes:
step 101: a canonicalalsmles code of the chemical substance in the target area is acquired,
and encoding the canola codes for the chemicals within the target area into a DGL plot of the chemicals within the target area. Wherein, SMILES (simplified molecular linear input Specification), CANONICAL SMILES (normalized SMILES-the only SMILES of a compound).
Step 201: and inputting the DGL graph of the chemical substances in the target area into an unknown substance acute toxicity prediction model to obtain the types of the chemical substances in the target area and the acute toxicity degree of each chemical substance.
The unknown substance acute toxicity prediction model is determined according to an improved graph annotation force network model; the improved graph attention network model comprises an input layer, a feature extraction layer, an implicit layer, a pooling layer and a linear classification layer.
The feature extraction layer is a graph isomorphic network.
The attention mechanism of the hidden layer is an attention mechanism of SuperGAT based on the classification task of the graph-oriented nodes.
The pooling layer comprises a maximum pooling method and a weighted sum pooling method.
Further, the improved attention mechanism is an attention mechanism obtained by fusing the dot product attention function and the spliced attention function.
Still further, the hidden layer includes two layers of improved graph attention network; the improved attention network is obtained by modifying the attention mechanism of the original attention network into an improved attention mechanism on the basis of the original attention network; the artwork attention network is an artwork attention network with version GATv 2.
As a preferred embodiment, the determining process of the acute toxicity prediction model of the unknown substance is as follows:
(1) Constructing a sample data set; the sample data set includes a plurality of sample data; the sample data comprises input data and label data corresponding to the input data; the input data is a DGL map of the chemical substances in the sample area, and the label data is the types of the chemical substances in the sample area and the acute toxicity degree of each chemical substance.
(2) An improved graph-annotation network model is constructed.
(3) Training an improved graph attention network model by adopting sample data, and adjusting network parameters of the improved graph attention network model by adopting a back propagation iteration mode so that the loss value of the trained improved graph attention network model is smaller than a set threshold value to obtain a trained improved graph attention network model; the trained improved graph annotation force network model is an unknown substance acute toxicity prediction model.
The method for constructing the sample data set specifically comprises the following steps:
1) Acquiring the CANONICALSMILES codes of the chemical substances in the plurality of sample areas, and the corresponding chemical substance types and the acute toxicity degrees of the chemical substances; 2) Encoding the canola codes for the chemicals in each sample region into a DGL plot of the chemicals in the sample region; 3) A sample data set is constructed from DGL maps of chemicals within a plurality of sample regions, and corresponding chemical species and chemical acute toxicity levels.
The unknown substance acute toxicity prediction model of the present embodiment is further described below by way of an example.
Step 1: the chemical's canonicalalsmies code is encoded into a DGL map and the DGL map is used as an input to the improved graph-annotation network model.
Step 2: the original attention network is improved, namely the attention mechanism of the original attention network is modified into an improved attention mechanism, and the improved model is recorded as a GAT_MX model. Among the improved attentiveness mechanisms are: based on the attention mechanism of the SuperGAT facing the graph node classification task, namely the attention mechanism obtained by fusing the dot product attention function and the spliced attention function. The artwork attention network is an artwork attention network version GATv 2.
The improvement formula is as follows:
e ij =(a T LeakyReLU(W·[h i ||h j ]))·Sigmoid((Wh i ) T ×(Wh j ));
wherein alpha is T Is a neural network; e, e ij Is the attention coefficient; w is a weight matrix; h is a i ,h j Respectively node i and node jIs a feature vector of (1); leakyReLU is the activation function.
Step 3: comparing the average scores of the 10 randomly partitioned datasets of the gat_mx model for different hidden layer numbers, determining the hidden layer of the gat_mx model as a two-layer improved graph-annotation-force network.
Step 4: before the improved graph meaning network, a graph isomorphic network is introduced to extract the characteristics, and the aggregation formula of the graph isomorphic network to the characteristics is as follows:
wherein L is the layer number of the graph isomorphic network,features of the first layer node i and the node j respectively; />Is a feature of the node i of the layer 1; epsilon is a learnable scalar parameter.
Step 5: the pooling layer is introduced before the improved graph-meaning network, and the pooling layer utilizes a maximum pooling method and a weighted sum pooling method simultaneously, and corresponds to different frequency domain information, uses large-scale features and small-scale features together, combines high-frequency (i.e. small-scale) information and low-frequency (i.e. large-scale) information and sends the combined information and the combined information into the linear classification layer.
The expression of the max-pooling method is as follows:
wherein N is i All adjacent nodes of the node i;output values are maximized for pooling; k is the kth feature map.
The expression of weighted sum pooling method is as follows:
wherein,a weighted sum of all eigenvalues within k multiplied by corresponding weights.
In this embodiment, the expression after splicing the two pooling methods is as follows:
wherein R is the splicing result of the maximum pooling and weighted sum pooling output.
In summary, the best model finally obtained in this embodiment is a two-layer graph attention network with improved attention mechanism combined with a maximum pooling and weighted sum pooling model gat_mx_mws, i.e. an unknown acute toxicity prediction model.
The embodiment designs a single multi-task learning model, the same network is used for the feature extraction of the front stage, and the feature extraction of the front stage is only carried out on the linear output layers respectively, so that the prediction efficiency is improved, and the design is closer to the requirement of toxicity prediction tasks of accident sites.
In contrast to the single-task model performed by researchers in the prior art, some of the same molecular features may exist in different toxic tasks, and this part of the information may be discarded in the single-task model, which in turn results in information loss. In a real emergency use scenario, the user can determine the type of toxicity more quickly for the multitasking model. In addition, the basic graph annotation force network is improved, the problems of insufficient feature extraction and general generalization capability in the traditional graph neural network-based toxicity prediction method are solved, and the better performance of the graph annotation force network in a toxicity prediction task is shown.
The present embodiment chooses to encode the canonicalsmles code as a DGL map. The canola code is a linear symbol that represents a molecule using a single line of ASCII text, which possesses uniqueness, i.e., the canola code and structure of the molecule are synonymous. And when the DGL graph object is converted, compared with the information loss possibly caused by the molecular descriptor and the molecular fingerprint, the topology structure of the molecules is almost completely reserved when the DGL graph object is converted, and the method is more beneficial to the extraction of subsequent features.
The dot product attention is introduced on the mature GATv2 network to realize the aim of improving the attention mechanism, thereby providing a novel GAT_MX method. The method improves the capability of feature extraction of the acute toxicity data set, and has fewer training iteration times, more stable convergence and higher prediction precision when the optimal result is achieved.
In the graph-meaning network aggregation process, the front end of the graph already obtains structural information, and the introduction of the graph-isomorphic network brings a lead at the front end compared with the original GATv 2.
Previous researchers have recommended weighted sum pooling in molecular property prediction tasks, which is similar to average pooling, and aims at low frequency information of the whole molecule, but such pooling operation has the possibility of losing part of small-scale information, although good results are achieved. And in the embodiment, the molecular structure information is better processed through the realization of the pooling layer.
Example two
In order to execute the corresponding method of the above embodiment to achieve the corresponding functions and technical effects, a deep learning-based system for predicting acute toxicity of unknown substances is provided below.
As shown in fig. 2, the system for predicting acute toxicity of unknown substances based on deep learning provided in this embodiment includes:
the target area parameter obtaining module 201 is configured to obtain a canola code of a chemical in the target area, and encode the canola code of the chemical in the target area into a DGL chart of the chemical in the target area.
The chemical acute toxicity prediction module 202 is configured to input the DGL diagram of the chemical in the target area into the unknown acute toxicity prediction model, and obtain the type of the chemical in the target area and the acute toxicity degree of each chemical.
The unknown substance acute toxicity prediction model is determined according to an improved graph annotation force network model; the improved graph meaning network model comprises an input layer, a feature extraction layer, an implicit layer, a pooling layer and a linear classification layer; the attention mechanism of the hidden layer is an attention mechanism of SuperGAT based on graph node-oriented classification tasks.
Further, the feature extraction layer is a graph isomorphic network; the hidden layer comprises two layers of improved graph attention network; the improved attention network is obtained by modifying the attention mechanism of the original attention network into an improved attention mechanism on the basis of the original attention network; the original image attention network is an image attention network with a version of GATv 2; the improved attention mechanism is obtained by fusing the dot product attention function and the spliced attention function; the pooling layer comprises a maximum pooling method and a weighted sum pooling method.
Example III
The embodiment of the invention provides an electronic device which comprises a memory and a processor, wherein the memory is used for storing a computer program, and the processor runs the computer program to enable the electronic device to execute the unknown substance acute toxicity prediction method based on deep learning.
Alternatively, the electronic device may be a server.
In addition, an embodiment of the present invention further provides a computer readable storage medium storing a computer program, where the computer program when executed by a processor implements a method for predicting acute toxicity of an unknown substance based on deep learning according to the first embodiment.
The invention provides a method, a system and equipment for predicting the acute toxicity of unknown substances based on deep learning, which are based on an updated version GATv2 of a graph attention network, and provide a new thought for predicting the acute toxicity of molecules by improving the attention mechanism and the network composition structure of the graph attention network and improving the feature extraction capacity of the graph attention network, so that the prediction accuracy is improved on a toxicity data set.
In the present specification, each embodiment is described in a progressive manner, and each embodiment is mainly described in a different point from other embodiments, and identical and similar parts between the embodiments are all enough to refer to each other. For the system disclosed in the embodiment, since it corresponds to the method disclosed in the embodiment, the description is relatively simple, and the relevant points refer to the description of the method section.
The principles and embodiments of the present invention have been described herein with reference to specific examples, the description of which is intended only to assist in understanding the methods of the present invention and the core ideas thereof; also, it is within the scope of the present invention to be modified by those of ordinary skill in the art in light of the present teachings. In view of the foregoing, this description should not be construed as limiting the invention.
Claims (10)
1. A method for predicting acute toxicity of an unknown substance based on deep learning, comprising the steps of:
acquiring a CANONICAL SMILES code of the chemical substances in the target area, and encoding the CANONICAL SMILES code of the chemical substances in the target area into a DGL chart of the chemical substances in the target area;
inputting the DGL graph of the chemical substances in the target area into an unknown substance acute toxicity prediction model to obtain the types of the chemical substances in the target area and the acute toxicity degree of each chemical substance;
the unknown substance acute toxicity prediction model is determined according to an improved graph annotation force network model; the improved graph meaning network model comprises an input layer, a feature extraction layer, an implicit layer, a pooling layer and a linear classification layer; the attention mechanism of the hidden layer is an attention mechanism of SuperGAT based on graph node-oriented classification tasks.
2. The method for predicting the acute toxicity of the unknown substance based on deep learning according to claim 1, wherein the determining process of the model for predicting the acute toxicity of the unknown substance is as follows:
constructing a sample data set; the sample data set includes a plurality of sample data; the sample data comprises input data and label data corresponding to the input data; the input data is a DGL graph of chemical substances in the sample area, and the label data is the types of the chemical substances in the sample area and the acute toxicity degree of each chemical substance;
constructing an improved graph annotation force network model;
training an improved graph attention network model by adopting sample data, and adjusting network parameters of the improved graph attention network model by adopting a back propagation iteration mode so that the loss value of the trained improved graph attention network model is smaller than a set threshold value to obtain a trained improved graph attention network model; the trained improved graph annotation force network model is an unknown substance acute toxicity prediction model.
3. The method for predicting the acute toxicity of an unknown substance based on deep learning according to claim 2, wherein the constructing a sample data set specifically comprises:
acquiring CANONICAL SMILES codes of chemical substances in a plurality of sample areas, and corresponding chemical substance types and acute toxicity degrees of the chemical substances;
encoding the canola signature for each chemical in the sample area into a DGL plot for the chemical in the sample area;
a sample data set is constructed from DGL maps of chemicals within a plurality of sample regions, and corresponding chemical species and chemical acute toxicity levels.
4. The method for predicting the acute toxicity of an unknown substance based on deep learning according to claim 1 or 2, wherein the feature extraction layer is a graph isomorphic network.
5. A method for predicting acute toxicity of unknown substances based on deep learning as claimed in either of claims 1 or 2, wherein said hidden layer comprises two layers of improved graph attention network; the improved attention network is obtained by modifying the attention mechanism of the original attention network into an improved attention mechanism on the basis of the original attention network; the artwork attention network is an artwork attention network with version GATv 2.
6. The method for predicting acute toxicity of unknown substance based on deep learning of claim 5, wherein the improved attention mechanism is an attention mechanism obtained by fusing a dot product attention function and a spliced attention function.
7. A method for predicting acute toxicity of unknown substances based on deep learning as claimed in claim 1 or 2, wherein said pooling layer comprises a maximum pooling method and a weighted sum pooling method.
8. A deep learning-based unknown substance acute toxicity prediction system, comprising:
the target area parameter acquisition module is used for acquiring the CANONICAL SMILES codes of the chemical substances in the target area and encoding the CANONICAL SMILES codes of the chemical substances in the target area into a DGL chart of the chemical substances in the target area;
the chemical substance acute toxicity prediction module is used for inputting the DGL graph of the chemical substances in the target area into the unknown substance acute toxicity prediction model to obtain the types of the chemical substances in the target area and the acute toxicity degree of each chemical substance;
the unknown substance acute toxicity prediction model is determined according to an improved graph annotation force network model; the improved graph meaning network model comprises an input layer, a feature extraction layer, an implicit layer, a pooling layer and a linear classification layer; the attention mechanism of the hidden layer is an attention mechanism of SuperGAT based on graph node-oriented classification tasks.
9. The deep learning-based unknown substance acute toxicity prediction system according to claim 8, wherein the feature extraction layer is a graph isomorphic network;
the hidden layer comprises two layers of improved graph attention network; the improved attention network is obtained by modifying the attention mechanism of the original attention network into an improved attention mechanism on the basis of the original attention network; the original image attention network is an image attention network with a version of GATv 2; the improved attention mechanism is obtained by fusing the dot product attention function and the spliced attention function;
the pooling layer comprises a maximum pooling method and a weighted sum pooling method.
10. An electronic device comprising a memory for storing a computer program and a processor that runs the computer program to cause the electronic device to perform a deep learning based method of predicting acute toxicity of an unknown substance according to any one of claims 1 to 7.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311034419.4A CN117095767B (en) | 2023-08-16 | 2023-08-16 | Unknown substance acute toxicity prediction method, system and equipment based on deep learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311034419.4A CN117095767B (en) | 2023-08-16 | 2023-08-16 | Unknown substance acute toxicity prediction method, system and equipment based on deep learning |
Publications (2)
Publication Number | Publication Date |
---|---|
CN117095767A true CN117095767A (en) | 2023-11-21 |
CN117095767B CN117095767B (en) | 2024-07-19 |
Family
ID=88774556
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311034419.4A Active CN117095767B (en) | 2023-08-16 | 2023-08-16 | Unknown substance acute toxicity prediction method, system and equipment based on deep learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117095767B (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113192571A (en) * | 2021-04-29 | 2021-07-30 | 南京邮电大学 | Small molecule drug hERG toxicity prediction method and device based on graph attention mechanism transfer learning |
CN114038517A (en) * | 2021-08-25 | 2022-02-11 | 暨南大学 | Self-supervision graph neural network pre-training method based on contrast learning |
CN115831260A (en) * | 2023-02-16 | 2023-03-21 | 天津大学 | Small sample molecular toxicity prediction method |
CN116403730A (en) * | 2023-04-04 | 2023-07-07 | 齐鲁工业大学(山东省科学院) | Medicine interaction prediction method and system based on graph neural network |
-
2023
- 2023-08-16 CN CN202311034419.4A patent/CN117095767B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113192571A (en) * | 2021-04-29 | 2021-07-30 | 南京邮电大学 | Small molecule drug hERG toxicity prediction method and device based on graph attention mechanism transfer learning |
CN114038517A (en) * | 2021-08-25 | 2022-02-11 | 暨南大学 | Self-supervision graph neural network pre-training method based on contrast learning |
CN115831260A (en) * | 2023-02-16 | 2023-03-21 | 天津大学 | Small sample molecular toxicity prediction method |
CN116403730A (en) * | 2023-04-04 | 2023-07-07 | 齐鲁工业大学(山东省科学院) | Medicine interaction prediction method and system based on graph neural network |
Non-Patent Citations (1)
Title |
---|
LEI XU ET AL.,: "Molecular Property Prediction by Combining LSTM and GAT", 《BIOMOLECULES》, vol. 13, no. 503, 6 March 2023 (2023-03-06), pages 1 - 15 * |
Also Published As
Publication number | Publication date |
---|---|
CN117095767B (en) | 2024-07-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Kim et al. | Patent document clustering with deep embeddings | |
CN107622182B (en) | Method and system for predicting local structural features of protein | |
CN112364880B (en) | Omics data processing method, device, equipment and medium based on graph neural network | |
CN113705772A (en) | Model training method, device and equipment and readable storage medium | |
WO2022001333A1 (en) | Hyperbolic space representation and label text interaction-based fine-grained entity recognition method | |
Sinha et al. | Audio classification using braided convolutional neural networks | |
Pietsch | Big data | |
CN111640471A (en) | Method and system for predicting activity of drug micromolecules based on two-way long-short memory model | |
CN113571125A (en) | Drug target interaction prediction method based on multilayer network and graph coding | |
Chen et al. | Binarized neural architecture search for efficient object recognition | |
Soni et al. | Feature extraction through deepwalk on weighted graph | |
CN116386899A (en) | Graph learning-based medicine disease association relation prediction method and related equipment | |
CN117976035A (en) | Protein SNO site prediction method of feature fusion deep learning network | |
CN115995293A (en) | Circular RNA and disease association prediction method | |
CN118136112A (en) | Polypeptide hemolysis prediction method integrating multi-view feature deep ensemble learning | |
Azuaje et al. | Ontology-driven approaches to analyzing data in functional genomics | |
Al Nuaimi et al. | Online streaming feature selection with incremental feature grouping | |
CN115116557A (en) | Method and related device for predicting molecular label | |
CN116646001B (en) | Method for predicting drug target binding based on combined cross-domain attention model | |
Liu et al. | Protein–Protein Interaction Prediction via Structure‐Based Deep Learning | |
CN117095767B (en) | Unknown substance acute toxicity prediction method, system and equipment based on deep learning | |
CN115458046B (en) | Method for predicting drug target binding property based on parallel deep fine granularity model | |
CN117150041A (en) | Small sample knowledge graph completion method based on reinforcement learning | |
Wu et al. | Generating life course trajectory sequences with recurrent neural networks and application to early detection of social disadvantage | |
Cui et al. | Hyper‐Mol: Molecular Representation Learning via Fingerprint‐Based Hypergraph |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |