CN112466406A - Method for predicting reactivity and carcinogenicity of cyclic organic compounds by quantum chemical calculation - Google Patents

Method for predicting reactivity and carcinogenicity of cyclic organic compounds by quantum chemical calculation Download PDF

Info

Publication number
CN112466406A
CN112466406A CN202011325807.4A CN202011325807A CN112466406A CN 112466406 A CN112466406 A CN 112466406A CN 202011325807 A CN202011325807 A CN 202011325807A CN 112466406 A CN112466406 A CN 112466406A
Authority
CN
China
Prior art keywords
carcinogenicity
reactivity
index
predicting
cyclic organic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011325807.4A
Other languages
Chinese (zh)
Inventor
王晓华
边佳辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Botany of CAS
Beijing Forestry University
Original Assignee
Institute of Botany of CAS
Beijing Forestry University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Botany of CAS, Beijing Forestry University filed Critical Institute of Botany of CAS
Priority to CN202011325807.4A priority Critical patent/CN112466406A/en
Publication of CN112466406A publication Critical patent/CN112466406A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
    • G16C10/00Computational theoretical chemistry, i.e. ICT specially adapted for theoretical aspects of quantum chemistry, molecular mechanics, molecular dynamics or the like
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
    • G16C20/00Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
    • G16C20/40Searching chemical structures or physicochemical data

Landscapes

  • Computing Systems (AREA)
  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Chemical & Material Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a method for predicting the reactivity and carcinogenicity of a cyclic organic compound by quantum chemical calculation, which comprises the steps of firstly optimizing the energy and structure of various structures according to the molecular structures of several organic compounds in an IARC database, calculating the energy and wave functions of different electronic states on the basis, and establishing the quantitative index prediction of the reactivity and the carcinogenicity prediction of the cyclic organic compound by comparing the obtained parameters. The method is based on wave function analysis of a Concept Density Functional Theory (CDFT), and 5 optimal descriptors are screened out by calculating quantum chemical parameters such as a global index, a real space function, an atomic index and the like of a compound as prediction descriptors and combining an IARC database for classification. The model has definite application domain and good robustness and prediction capability. The prediction method can accurately and efficiently predict the toxicity and carcinogenicity of the compound, and provides an effective method for evaluating the health hazard of the organic compound.

Description

Method for predicting reactivity and carcinogenicity of cyclic organic compounds by quantum chemical calculation
Technical Field
The invention relates to a method for predicting cyclic organic compounds, in particular to a method for predicting reactivity and carcinogenicity of cyclic organic compounds by quantum chemistry calculation.
Background
With the increasing progress of global development and the production and application of a large amount of chemicals, the kinds and amounts of chemicals discharged to the environment and food and drinking water are increasing, and safety evaluation and environmental evaluation of chemicals are becoming more and more important. Among them, the most important having the greatest impact on life and health are cyclic compounds and easily formed oxidized or nitrated polycyclic aromatic hydrocarbon derivatives, which tend to have greater toxicity and carcinogenicity. Therefore, the method has important theoretical and practical significance for risk assessment, management and control, life application and the like of dangerous substances by acquiring dangerous properties of cyclic organic substances and derivatives thereof, such as toxicity, DNA base binding, carcinogenicity and the like. The traditional toxicity risk assessment comprises the following four steps: hazard identification, dose-response assessment, exposure assessment, and risk characterization. However, in actual research analysis, these data are acquired experimentally, and the workload is enormous in the face of the large number of organic compounds that are already present and are about to be put into use. This results in inadequate measured toxicity data and inconsistent toxicity test receptors, which are not effective in evaluating toxicity and carcinogenicity.
Disclosure of Invention
The invention aims to provide a method for predicting the reactivity and the carcinogenicity of a cyclic organic compound by quantum chemical calculation, which determines the optimal prediction index parameters of the reactivity and the carcinogenicity of the cyclic compound by quantum chemical calculation and wave function analysis. The problems set forth in the background art described above can be solved by quantitative structure-reactivity-related studies of cyclic compounds and predicting their carcinogenicity.
In order to achieve the purpose, the invention provides the following technical scheme:
the method for predicting the reactivity and carcinogenicity of the cyclic organic compounds by quantum chemistry calculation comprises the following steps:
step 1: obtaining the data that the carcinogenicity of 6 cyclic organic matters is negative or positive through related toxicity tests or the existing database and literature;
step 2: constructing the molecular structure of the cyclic compound by using ChemDraw chemical software, and performing structure optimization on the cyclic compound by using a B3LYP method of not less than DFT and 6-311G basis group by using quantum chemical software Gaussian or ORCA;
and step 3: taking the optimized structure file, respectively manufacturing quantitative software Gaussian or ORCA input files corresponding to different charged states for neutral N, molecules with 1 electron N +1 and 3 states of losing one electron N-1;
and 4, step 4: calculating single-point energy of molecules at a calculation level not lower than B3LYP/6-311G to obtain quantum chemical parameters and corresponding wfn file containing wave function information;
and 5: by utilizing a CDFT module of wave function analysis software Multiwfn, various CDFT indexes are obtained by reading energy information and wave function information in wfn files and calculating Hirshfeld charges;
step 6: further calculating the FOWEL function isosurface of molecules in different charged states by a Fukui function calculation module of Multiwfn, and deriving corresponding isosurface maps of electrophilic reaction, nucleophilic reaction, free radical reaction and double descriptors;
and 7: through investigating the correlation between different carcinogenicity values and CDFT indexes and different descriptors, the optimal prediction index parameters of the reactivity and the carcinogenicity of the cyclic compound are determined and used for predicting the related reactivity and the carcinogenicity of the same type of organic matters which are not determined through experiments.
Further, in the step 2, a molecular mechanics method is adopted, the established geometric configuration is preliminarily optimized under an MM2 force field, or the structure is directly optimized through a semi-empirical PM6 quantum chemistry method, so that a stable configuration with the lowest energy is obtained.
Further, step 3, an optimized molecular structure is taken, an input file of quantum chemical software Gaussian or ORCA is constructed, and the structure of the cyclic compound is optimized by a B3LYP method not lower than DFT and 6-311G-base group, so that quantum chemical parameters and check point files are obtained.
Further, the CDFT index in step 5 includes a global index, a real space function, an atomic index, a fujing function, a dual descriptor, a relative electrophilic index, and a relative nucleophilic index.
Compared with the prior art, the invention has the beneficial effects that:
1) the method realizes the prediction of two aspects of reactivity and carcinogenicity of the cyclic compound, is also applicable to other organic compounds, and is more beneficial to popularization and application.
2) By applying quantum chemical calculation and wave function analysis and by considering the quantum chemical parameters such as the global index, the wave function and the Fujing function as prediction descriptors, the optimal prediction index is established, the prediction accuracy is improved, and the prediction capability is improved.
3) By inspecting the size distribution of the Fullwell function of each atom in the cyclic organic matter, the corresponding reactivity is predicted, and the electrophilic reaction, the nucleophilic reaction and the nucleophilic radical reaction of the cyclic organic matter and DNA base are more intuitively and clearly predicted through the coverage degree of the isosurface of the Fullwell function.
4) According to the real space function defined by the density functional theory, the characteristic values of the double descriptors simultaneously display the electrophilic reaction sites and the nucleophilic reaction sites, and different welfare function characteristic values do not need to be considered respectively. By deriving and directly examining the isosurface map of the dual descriptors, the addition reaction of the circular organic matter and DNA base and the gene mutation carcinogenesis caused by the addition reaction can be correctly predicted.
Drawings
FIG. 1 is an iso-surface diagram of the active site of electrophilic or nucleophilic reactions upon covalent addition of 6 cyclic compounds (benzopyrene A-B, benzidine C-D, ethidium bromide E-F, arecolin G-H, aristolochic acid I-J, caffeine K-L) to DNA bases for use in the present invention;
FIG. 2 is a two-descriptor iso-surface diagram of the covalent addition of 6 cyclic compounds (benzopyrene A, benzidine B, ethidium bromide C, arecolin D, aristolochic acid E, caffeine F) used in the present invention to DNA bases.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The method for predicting the reactivity and carcinogenicity of the cyclic organic compounds by quantum chemistry calculation comprises the following steps:
step 1: selecting 6 cyclic compounds with different structures as investigation objects, and modeling various molecules through ChemDraw or online modeling software;
step 2: the established geometric configuration is preliminarily optimized under an MM2 force field by adopting a molecular mechanics method, or the structure is directly optimized by a semi-empirical PM6 quantum chemical method so as to obtain a stable configuration with the lowest energy;
and step 3: taking the optimized molecular structure, constructing an input file of quantum chemical software Gaussian or ORCA, and performing structural optimization on the cyclic compound by using a B3LYP method not lower than DFT and 6-311G basis group to obtain quantum chemical parameters and check point files;
and 4, step 4: taking the optimized structure file, and respectively manufacturing input files of quantization software Gaussian or ORCA (object oriented language) corresponding to different charged states aiming at neutral N, molecules with 1 electron N +1 and 3 states of losing one electron N-1;
and 5: calculating single-point energy of molecules in different charged states under the calculation level not lower than B3LYP/6-311G to obtain quantum chemical parameters and corresponding wfn file containing wave function information;
step 6: by utilizing a CDFT module of wave function analysis software Multiwfn, obtaining quantitative indexes such as a global index, a real space function, an atomic index, a welfare function, a double descriptor, a relative electrophilic index, a relative nucleophilic index and the like by reading energy information and wave function information in wfn files and calculating Hirshfeld charges;
and 7: further, by means of a Fukui function calculation module of Multiwfn, the well function isosurface of the molecules in different charged states is calculated, and corresponding electrophilic, nucleophilic, radical reaction and double-descriptor isosurface maps are derived.
The various output quantities of the CDFT of different cyclic organic compounds are compared with the carcinogenicity reported by the International agency for research on cancer (IARC), and the prediction results are scored. Due to the fact that the actual situation is complex, the score is adjusted according to the reasonable degree of the prediction result. Table 1 gives the partial CDFT parameters and the score for the predicted carcinogenicity for 6 cyclic compounds of different carcinogenic degrees. As can be seen from table 1, the CDFT parameters obtained by partial calculation have a large correlation with the carcinogenicity of the literature sources, especially the polycyclic compounds have the highest hardness and nucleophilic index scores and the moderate softness and electrophilic index scores, which indicates that the first two descriptors can be used as two indicators for predicting carcinogenicity.
TABLE 1 comparison and scoring of CDFT quantification parameters and carcinogenicity prediction results for cyclic compounds of different carcinogenic degrees
Figure BDA0002794242530000051
The unit of the CDFT index is eV, except for softness.bThe scores (good, medium and bad) are adjusted according to the reasonable degree of the prediction result.
A more intuitive and convenient way to look at the higher scoring descriptors in (8) is to derive and observe their iso-surface maps. By means of a Fukui function calculation module of Multiwfn, the well function isosurface of molecules with different charge states is calculated, and corresponding electrophilic and nucleophilic reactions (figure 1) and isosurface maps of double descriptors (figure 2) are derived, wherein the green isosurface in figure 1 represents a high-activity reaction site, and consistent with the literature report, nucleophilic and electrophilic reaction sites are simultaneously displayed in a double-descriptor isosurface mode in figure 2 without the need of respectively inspecting the distribution states of 2 well functions, so that the method is more convenient. By calculating important concepts in the conceptual density functional theory framework, the well function and the dual descriptors, the sites where the circular organics react with DNA bases can be predicted.
The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art should be able to cover the technical solutions and the inventive concepts of the present invention within the technical scope of the present invention.

Claims (4)

1. The method for predicting the reactivity and carcinogenicity of the cyclic organic compounds by quantum chemical calculation is characterized by comprising the following steps of:
step 1: obtaining the data that the carcinogenicity of 6 cyclic organic matters is negative or positive through related toxicity tests or the existing database and literature;
step 2: constructing the molecular structure of the cyclic compound by using ChemDraw chemical software, and performing structure optimization on the cyclic compound by using a B3LYP method of not less than DFT and 6-311G basis group by using quantum chemical software Gaussian or ORCA;
and step 3: taking the optimized structure file, respectively manufacturing quantitative software Gaussian or ORCA input files corresponding to different charged states for neutral N, molecules with 1 electron N +1 and 3 states of losing one electron N-1;
and 4, step 4: calculating single-point energy of molecules at a calculation level not lower than B3LYP/6-311G to obtain quantum chemical parameters and corresponding wfn file containing wave function information;
and 5: by utilizing a CDFT module of wave function analysis software Multiwfn, various CDFT indexes are obtained by reading energy information and wave function information in wfn files and calculating Hirshfeld charges;
step 6: further calculating the FOWEL function isosurface of molecules in different charged states by a Fukui function calculation module of Multiwfn, and deriving corresponding isosurface maps of electrophilic reaction, nucleophilic reaction, free radical reaction and double descriptors;
and 7: through investigating the correlation between different carcinogenicity values and CDFT indexes and different descriptors, the optimal prediction index parameters of the reactivity and the carcinogenicity of the cyclic compound are determined and used for predicting the related reactivity and the carcinogenicity of the same type of organic matters which are not determined through experiments.
2. The method for predicting the reactivity and carcinogenicity of cyclic organic compounds through quantum chemistry calculation as claimed in claim 1, wherein in the step 2, a molecular mechanics method is adopted, and the established geometric configuration is preliminarily optimized under an MM2 force field, or the structure is directly optimized through a semi-empirical PM6 quantum chemistry method, so that a stable configuration with the lowest energy is obtained.
3. The method for predicting the reactivity and carcinogenicity of cyclic organic compounds by quantum chemistry calculation according to claim 1, wherein the optimized molecular structure is taken in step 3, an input file of quantum chemistry software Gaussian or ORCA is constructed, and the cyclic compound is structurally optimized by a B3LYP method not lower than DFT and a 6-311G basis group to obtain quantum chemistry parameters and a check point file.
4. The method for predicting reactivity and carcinogenicity of cyclic organic compounds by quantum chemical calculation according to claim 1, wherein the CDFT index in step 5 comprises a global index, a real space function, an atomic index, a foell function, a dual descriptor, a relative electrophilic index and a relative nucleophilic index.
CN202011325807.4A 2020-11-23 2020-11-23 Method for predicting reactivity and carcinogenicity of cyclic organic compounds by quantum chemical calculation Pending CN112466406A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011325807.4A CN112466406A (en) 2020-11-23 2020-11-23 Method for predicting reactivity and carcinogenicity of cyclic organic compounds by quantum chemical calculation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011325807.4A CN112466406A (en) 2020-11-23 2020-11-23 Method for predicting reactivity and carcinogenicity of cyclic organic compounds by quantum chemical calculation

Publications (1)

Publication Number Publication Date
CN112466406A true CN112466406A (en) 2021-03-09

Family

ID=74798304

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011325807.4A Pending CN112466406A (en) 2020-11-23 2020-11-23 Method for predicting reactivity and carcinogenicity of cyclic organic compounds by quantum chemical calculation

Country Status (1)

Country Link
CN (1) CN112466406A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103646180A (en) * 2013-12-19 2014-03-19 山东大学 Method for forecasting acute toxicity of organic compounds by building quantitative structure-activity relationship model with quantum chemistry method
CN104995625A (en) * 2012-12-18 2015-10-21 弗·哈夫曼-拉罗切有限公司 Prediction of molecular bioactivation
CN105868540A (en) * 2016-03-25 2016-08-17 哈尔滨理工大学 A polycyclic aromatic hydrocarbon property/toxicity prediction method using an intelligent support vector machine
CN106198847A (en) * 2016-06-24 2016-12-07 重庆医科大学 Evaluation methodology about anabasine insecticide hydrolysis reaction activity
CN110321608A (en) * 2019-06-21 2019-10-11 三峡大学 A kind of method of insulating gas molecular chemistry stability under quantitative analysis external electric field

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104995625A (en) * 2012-12-18 2015-10-21 弗·哈夫曼-拉罗切有限公司 Prediction of molecular bioactivation
CN103646180A (en) * 2013-12-19 2014-03-19 山东大学 Method for forecasting acute toxicity of organic compounds by building quantitative structure-activity relationship model with quantum chemistry method
CN105868540A (en) * 2016-03-25 2016-08-17 哈尔滨理工大学 A polycyclic aromatic hydrocarbon property/toxicity prediction method using an intelligent support vector machine
CN106198847A (en) * 2016-06-24 2016-12-07 重庆医科大学 Evaluation methodology about anabasine insecticide hydrolysis reaction activity
CN110321608A (en) * 2019-06-21 2019-10-11 三峡大学 A kind of method of insulating gas molecular chemistry stability under quantitative analysis external electric field

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
曹静思 等: "预测亲核反应位点方法的比较", 《中国科学》, vol. 45, no. 12, 23 October 2015 (2015-10-23), pages 1281 - 1290 *

Similar Documents

Publication Publication Date Title
Lewis et al. Similarity measures for rational set selection and analysis of combinatorial libraries: the diverse property-derived (DPD) approach
Benfenati et al. Computational predictive programs (expert systems) in toxicology
Jurman et al. Canberra distance on ranked lists
Gramatica et al. QSARINS: A new software for the development, analysis, and validation of QSAR MLR models
Kazius et al. Substructure mining using elaborate chemical representation
Wong et al. Application of interval clustering approach to water quality evaluation
Carrera et al. The Solubility of Gases in Ionic Liquids: A Chemoinformatic Predictive and Interpretable Approach
Lepak et al. Where do qualitative assessments fit in an era of increasingly quantitative monitoring? Perspectives from Interpreting Indicators of Rangeland Health
CN112466406A (en) Method for predicting reactivity and carcinogenicity of cyclic organic compounds by quantum chemical calculation
Lombardo et al. Development of new QSAR models for water, sediment, and soil half-life
CN104573863A (en) Method for predicting organic compound and hydroxyl radical reaction rate constant in water phase
CN105844081A (en) Wastewater treatment effect quantification method and apparatus
Zhang et al. Bioavailability (BA)-based risk assessment of soil heavy metals in provinces of China through the predictive BA-models
Maleki et al. Comparison of QSAR models based on combinations of genetic algorithm, stepwise multiple linear regression, and artificial neural network methods to predict K d of some derivatives of aromatic sulfonamides as carbonic anhydrase II inhibitors
Jurs et al. Computer-assisted studies of molecular structure and carcinogenic activity
Moorthy et al. The min-max test: an objective method for discriminating mass spectra
CN111768813A (en) Method for predicting organic PDMS membrane-water distribution coefficient based on SW-SVM algorithm quantitative structure-activity relationship model
CN113611363B (en) Method for identifying cancer driving gene by using consensus prediction result
CN112184495A (en) Low-efficiency land stock monitoring system and analysis platform applying same
CN113779888B (en) Ground subsidence risk assessment method, device, equipment and storage medium
Ding et al. Use of resampling method to construct variance index and repeatability limit of damage characteristic curve
ShahrjooiHaghighi et al. Ensemble feature selection for biomarker discovery in mass spectrometry-based metabolomics
Yonchev et al. Computational assessment of chemical saturation of analogue series under varying conditions
Ojha et al. First report on exploring structural requirements of 1, 2, 3, 4-tetrahydroacridin-9 (10H)-one analogs as antimalarials using multiple QSAR approaches: descriptor-based QSAR, CoMFA-CoMSIA 3DQSAR, HQSAR and G-QSAR approaches
Mitrofanov et al. Simple automatized tool for exchange–correlation functional fitting

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination