WO2023102870A1 - Method and apparatus for computing free energy of molecule, and stability analysis method and apparatus, and device and storage medium - Google Patents

Method and apparatus for computing free energy of molecule, and stability analysis method and apparatus, and device and storage medium Download PDF

Info

Publication number
WO2023102870A1
WO2023102870A1 PCT/CN2021/136976 CN2021136976W WO2023102870A1 WO 2023102870 A1 WO2023102870 A1 WO 2023102870A1 CN 2021136976 W CN2021136976 W CN 2021136976W WO 2023102870 A1 WO2023102870 A1 WO 2023102870A1
Authority
WO
WIPO (PCT)
Prior art keywords
molecule
crystal form
energy
measured
free energy
Prior art date
Application number
PCT/CN2021/136976
Other languages
French (fr)
Chinese (zh)
Inventor
周天
彭春望
方利文
孙广旭
Original Assignee
上海智药科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 上海智药科技有限公司 filed Critical 上海智药科技有限公司
Priority to PCT/CN2021/136976 priority Critical patent/WO2023102870A1/en
Publication of WO2023102870A1 publication Critical patent/WO2023102870A1/en

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
    • G16C10/00Computational theoretical chemistry, i.e. ICT specially adapted for theoretical aspects of quantum chemistry, molecular mechanics, molecular dynamics or the like

Definitions

  • the present application relates to the technical field of molecular dynamics, in particular to a molecular free energy calculation and stability analysis method, device, equipment and storage medium.
  • Crystal Structure Predict is a very popular technology at present.
  • the virtual crystal structure ranking displayed by CSP can only show the energy distribution of the crystal at 0K, while the experimental crystal structure screening is usually at room temperature. Therefore, it is necessary to calculate the relative free energy ranking among different crystal forms at room temperature.
  • some methods that can predict the absolute free energy of crystal molecules more accurately include calculation methods such as harmonic approximation method, quasi-harmonic approximation method, and pseudo-supercritical path method, but these methods generally have some problems.
  • calculation methods such as harmonic approximation method, quasi-harmonic approximation method, and pseudo-supercritical path method, but these methods generally have some problems.
  • the cost of calculating the free energy by quantum chemical methods is high, or even impossible to calculate, and the packing mode of the crystal and the conformation of the molecules will hardly be changed during the calculation process.
  • the research is unrealizable, and the simulated thermal expansion effect is isotropic, but for the usually anisotropic organic molecular crystals, the calculation results often have large errors.
  • the first aspect of the present application provides a molecular free energy calculation method, including:
  • the minimum free energy value in the free energy versus volume curve of the first crystal form of the molecule to be measured at each preset temperature is taken as the absolute free energy of the first crystal form of the molecule to be measured at the corresponding preset temperature. able.
  • the molecular dynamics method is used to minimize the energy of the first crystal form of the molecule to be tested based on the force field data of the molecule to be tested, to obtain The structural information of the first crystal form of the molecule to be tested after energy minimization processing, including:
  • the energy minimization process is performed on the supercell to obtain the structural information of the supercell after the energy minimization process.
  • the molecular dynamics method is used, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, under the isothermal and isobaric ensemble calculating the vibration frequencies of all atoms in the first crystal form of the molecule to be measured in each coordinate direction when the first crystal form of the molecule to be measured after energy minimization is in an equilibrium state at a preset temperature, and
  • the potential energy and volume of the first crystal form of the molecule to be measured include:
  • the isothermal and isobaric ensemble simulation is performed on the supercell after the energy minimization treatment, and the simulation trajectory of the supercell of the first crystal form of the molecule to be tested is obtained;
  • the supercell of the first crystal form of the measured molecule is subjected to energy minimization processing to obtain the molecular structure after energy minimization processing; wherein, the volume of the first crystal form of the molecule to be measured corresponding to the simulated trajectory frame is selected to be equal to the average volume
  • the simulated trajectory frame with the smallest error is the target simulated trajectory frame;
  • Mode analysis is performed on the molecular structure after the energy minimization treatment, and the vibration frequencies of all atoms in the first crystal form of the molecule to be measured are calculated in each coordinate direction.
  • the molecular dynamics method is used, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, under the isothermal and isobaric ensemble calculating the vibration frequencies of all atoms in the first crystal form of the molecule to be measured in each coordinate direction when the first crystal form of the molecule to be measured after energy minimization is in an equilibrium state at a preset temperature, and
  • the potential energy and volume of the first crystal form of the molecule to be measured include:
  • the method further includes:
  • the method according to any one of claims 1-4 is used respectively to obtain the absolute free energy corresponding to the different crystal forms of the molecule to be tested at different preset temperatures.
  • the second aspect of the present application provides a molecular stability analysis method, the method comprising:
  • the most stable crystal form at a specific temperature is determined, and the crystal form corresponding to the lowest relative free energy value at the same temperature is the most stable crystal form.
  • the third aspect of the present application provides a molecular free energy calculation device, including:
  • the energy minimization module is used to perform energy minimization processing on the first crystal form of the molecule to be detected based on the force field data of the molecule to be detected by using a molecular dynamics method, so as to obtain the first crystal form of the molecule to be detected after energy minimization processing.
  • the balance simulation module is used to calculate the energy-minimized structure information of the first crystal form of the molecule to be tested by using molecular dynamics method under the isothermal and isobaric ensemble.
  • the curve calculation module is used to determine the first crystal form of the molecule to be tested based on the vibration frequency, the potential energy and the volume of the first crystal form of the molecule to be tested by using a preset free energy calculation formula. Set the curve of free energy versus volume at temperature;
  • the free energy calculation module is used to use the minimum free energy value in the free energy versus volume curve of the first crystal form of the molecule to be measured at each preset temperature as the value of the molecule to be measured at the corresponding preset temperature Absolute free energy of the first crystal form.
  • the fourth aspect of the present application provides a molecular stability analysis device, the device comprising:
  • the absolute free energy calculation unit is used in the method in the foregoing embodiment to obtain the absolute free energy of different crystal forms of the molecule to be measured at different preset temperatures;
  • the relative free energy calculation unit is used to use the absolute free energy of any crystal form at different preset temperatures as a reference, and subtract the reference from the absolute free energy of other crystal forms at the same preset temperature to obtain each crystal
  • the stable crystal form determination unit is used to determine the most stable crystal form at a specific temperature based on the relative free energy values of each crystal form at different preset temperatures, and the crystal form corresponding to the lowest relative free energy value at the same temperature is the most stable crystal form.
  • the fifth aspect of the present application provides an electronic device, including:
  • a memory on which executable codes are stored, which, when executed by the processor, cause the processor to perform the method as described above.
  • the sixth aspect of the present application provides a storage medium, on which executable code is stored, and when the executable code is executed by a processor of an electronic device, the processor is made to execute the above-mentioned method.
  • the technical solution provided by the application may include the following beneficial effects: the application obtains the force field data of the molecule to be tested, and performs energy minimization processing on the first crystal form of the molecule to be tested, and then calculates the When the first crystal form of the molecule to be measured is in an equilibrium state at a preset temperature, the vibration frequencies of all atoms in the first crystal form of the molecule to be measured in each coordinate direction, and the potential energy of the first crystal form of the molecule to be measured and volume, based on the vibration frequency, the potential energy and volume of the first crystalline form of the analyte molecule, using a preset free energy calculation formula to determine the first crystal form of the analyte molecule at multiple preset temperatures
  • the change curve of free energy with volume can accurately calculate the free energy of molecular crystals for molecular crystals with more atoms, more flexible angles and anisotropy, saving computing power and computing time.
  • Fig. 1 is a schematic flow chart of a molecular free energy calculation method shown in the embodiment of the present application
  • Fig. 2 is a schematic flow chart of a cell expansion method shown in the embodiment of the present application.
  • Fig. 3 is a schematic flow chart of a vibration frequency calculation method shown in an embodiment of the present application.
  • Fig. 4 is a schematic flow chart of a potential energy and volume calculation method shown in the embodiment of the present application.
  • FIG. 5 is a schematic flow diagram of a molecular stability analysis method shown in the embodiment of the present application.
  • Fig. 6 is a schematic diagram of a molecular structure provided by an embodiment of the present application.
  • Fig. 7 is a diagram of the root mean square difference of molecular displacement as a function of time provided by the embodiment of the present application.
  • Figure 8 is a diagram of the change in molecular volume with time provided by the embodiment of the present application.
  • Fig. 9 is a schematic diagram of an absolute free energy curve provided in an embodiment of the present application.
  • Figure 10 is a graph of the relative free energy of different crystal forms as a function of temperature provided by the examples of the present application.
  • Fig. 11 is a kind of relative free energy curve chart that adopts PCSP calculation provided by the embodiment of the present application.
  • Fig. 12 is a schematic structural diagram of a molecular free energy calculation device provided in an embodiment of the present application.
  • Figure 13 is a schematic diagram of a molecular stability analysis device adopted in the embodiment of the present application.
  • FIG. 14 is a schematic structural diagram of an electronic device shown in an embodiment of the present application.
  • first, second, third and so on may be used in this application to describe various information, such information should not be limited to these terms. These terms are only used to distinguish information of the same type from one another.
  • first information may also be called second information, and similarly, second information may also be called first information.
  • second information may also be called first information.
  • a feature defined as “first” and “second” may explicitly or implicitly include one or more of these features.
  • “plurality” means two or more, unless otherwise specifically defined.
  • HA Harmonic Approximation
  • QHA Quasi Harmonic Approximation
  • PSCP pseudo-supercritical path
  • these methods generally have some problems, such as for some organic molecular crystals with a large number of atoms, the cost of calculating free energy by quantum chemical methods is high, or even impossible to calculate, the calculation process
  • the packing mode and molecular conformation of the crystal will hardly be changed, but the study of polymorphism is impossible, and the simulated thermal expansion effect is isotropic, while the calculation of the usually anisotropic organic molecular crystal The results are often very inaccurate.
  • the embodiment of the present application provides a molecular free energy calculation method, by obtaining the force field data of the molecule to be tested, and performing energy minimization on the first crystal form of the molecule to be tested, and then under the isothermal and isobaric ensemble , calculating the vibration frequencies of all atoms in the first crystal form of the molecule to be measured in each coordinate direction when the first crystal form of the molecule to be measured is in an equilibrium state at a preset temperature, and the first crystal form of the molecule to be measured.
  • the potential energy and volume of the type based on the vibration frequency, the potential energy and volume of the first crystal form of the molecule to be measured, using the preset free energy calculation formula to determine the first crystal form of the molecule to be measured
  • the free energy of molecular crystals can be accurately calculated for molecular crystals with more atoms, more flexible angles, and anisotropy, saving computing power and computing time.
  • Fig. 1 is a schematic flowchart of a method for calculating free energy of a molecular crystal shown in an embodiment of the present application.
  • the molecular crystal free energy calculation method includes:
  • Step S101 using a molecular dynamics method to perform energy minimization on the first crystal form of the molecule to be tested based on the force field data of the molecule to be tested, to obtain the energy-minimized first crystal form of the molecule to be tested. structural information.
  • the molecular dynamics method is a molecular simulation method, and the pre-set gromacs (molecular dynamics program package) program can be used.
  • the gromacs program is pre-set.
  • the gromacs program is generated through the force field data
  • the required input file, input the input file to the gromacs program can perform functions such as energy minimization processing and equilibrium state simulation of any crystal form of the molecule to be tested;
  • the molecule to be tested is a molecule that needs to calculate the free energy, and the molecule can have Multiple crystal forms, or only one crystal form; the free energy of the molecule in various crystal forms can be calculated by using the molecular free energy calculation method adopted in the embodiment of the present application.
  • the force field data refers to the data used to represent the force field parameters between each atom in the molecule to be measured; the force field data can be stored in the form of a force field file for subsequent access.
  • the initial coordinate data refers to the coordinate data of the structure of the molecule to be tested in a certain crystal form placed in a three-dimensional rectangular coordinate system.
  • the initial coordinate data is used to represent the coordinates of each atom in the molecule to be measured in a three-dimensional Cartesian coordinate system; the initial coordinate data can be stored in the form of a coordinate file, which is convenient for subsequent access.
  • the molecules to be measured are in different crystal type, the corresponding initial coordinate data are different.
  • the energy minimization process refers to repeated iterative adjustment of the position of atoms in the first crystal form of the molecule to be tested to reduce the total energy of the crystal form system, wherein, in the energy minimization process, preset There is an iteration stop criterion, and when the iteration stop criterion is reached during the iteration, the iteration is stopped, the total energy of the crystal system is reduced to an expected critical point, and the energy-minimized first crystal form of the molecule to be tested is obtained. structural information.
  • Step S102 using the molecular dynamics method, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, under the isothermal and isobaric ensemble, calculate the molecule to be tested after the energy minimization process
  • the vibration frequencies of all atoms in the first crystal form of the molecule to be measured in each coordinate direction, and the potential energy and volume.
  • the isothermal and isobaric ensemble is an ensemble with controllable temperature and controllable pressure.
  • energy and volume can be exchanged between systems, but The sum of energy and volume of each system in the ensemble is constant, and in the embodiment of the present application, the molecule to be measured can be used as a system in the isothermal and isobaric ensemble.
  • the molecules to be tested are simulated under the isothermal and isobaric ensemble, and the specified temperature is selected in the preset temperature range , at the specified temperature, simulate the molecule to be measured to an equilibrium state, determine the structure of the molecule to be measured in the equilibrium state, and calculate the vibration frequency of the molecule to be measured in each coordinate direction under the structure.
  • the vibration frequency of the molecule to be measured in each coordinate direction refers to the vibration frequency of each atom in the molecule to be measured in the three directions of x-axis, y-axis and z-axis respectively.
  • the potential energy and volume of the first crystal form of the molecule to be tested are calculated, wherein the potential energy refers to the first crystal form of the molecule to be tested
  • the molecular potential energy in the state of energy minimization can be calculated by molecular dynamics methods, such as gromacs; similarly, the volume of the first crystal form of the molecule to be tested can also be calculated by molecular dynamics methods, such as gromacs.
  • Step S103 based on the vibration frequency, the potential energy and volume of the first crystalline form of the analyte molecule, using a preset free energy calculation formula to determine the first crystal form of the analyte molecule at multiple preset temperatures The curve of free energy as a function of volume.
  • the calculated potential energy of the molecule to be measured is U
  • the volume is V
  • the vibration frequency is ⁇
  • the preset free energy formula can be as follows:
  • F(V,T) is the free energy of the molecule to be measured at the temperature T and volume V
  • U(V) is the potential energy of the molecule to be measured when the volume is V
  • N A is Avogadro's constant
  • P is the system Comprehensive pressure
  • k B is the Boltzmann constant
  • k( ⁇ ) is the frequency of atom k
  • G(T) is expressed as the final free energy at temperature T.
  • the above-mentioned formula can be used to solve the curve of the free energy of the molecule to be measured as a function of the volume at the temperature T.
  • Step S104 taking the minimum free energy value in the free energy versus volume curve of the first crystal form of the molecule to be tested at each preset temperature as the first crystal form of the molecule to be tested at the corresponding preset temperature of absolute free energy.
  • the minimum value of the free energy of the molecule to be measured at each temperature is selected, and these minimum values are the minimum values of the free energy of the molecule to be measured at the corresponding temperature.
  • the absolute free energy at different temperatures can be obtained by using this method.
  • the technical solution provided by the present application may include the following beneficial effects:
  • by acquiring the force field data of the molecule to be tested and the initial coordinate data of the first crystal form of the molecule to be tested, and performing energy analysis on the first crystal form of the molecule to be tested Minimize the processing, and then under the isothermal and isobaric ensemble, calculate the vibration of all atoms in the first crystal form of the molecule to be measured in each coordinate direction when the first crystal form of the molecule to be measured is in an equilibrium state at a preset temperature frequency, and the potential energy and volume of the first crystal form of the molecule to be measured, based on the vibration frequency, the potential energy and volume of the first crystal form of the molecule to be measured, using a preset free energy calculation formula to determine the
  • the curve of free energy versus volume of the first crystal form of molecules at multiple preset temperatures can accurately calculate the free energy of molecular crystals for molecular crystals with more atoms, more flexible angles and anisotropy.
  • the first crystal form of the molecule to be tested is analyzed based on the force field data of the molecule to be tested using a molecular dynamics method.
  • Energy minimization processing obtaining the structural information of the first crystal form of the molecule to be tested after energy minimization processing, including:
  • Step S201 based on the force field data of the molecule to be tested and the structure data of the first crystal form of the molecule to be tested, expand the unit cell of the first crystal form of the molecule to be tested to obtain a preset size supercell.
  • the structural data of the molecule to be measured is a file storing a specific molecular structure, such as a .cif file or a .res file, and the cell in the structure file is expanded to generate a super
  • the preset size of the supercell is a 3 ⁇ 2 ⁇ 5 supercell.
  • the specific size of the supercell can be set according to needs, which is not limited here.
  • Step S202 based on the force field data of the molecule to be tested, perform energy minimization processing on the supercell, and obtain structural information of the supercell after energy minimization processing.
  • the energy minimization process refers to repeated iterative adjustment of the position of atoms in the first crystal form of the molecule to be tested to reduce the total energy of the crystal form system, wherein, in the energy minimization process, preset There is an iteration stop criterion, when the iteration stop criterion is reached in the iteration, the iteration is stopped, the total energy of the crystal form system is reduced to an expected critical point, and the energy minimization treatment of the first crystal form of the molecule to be tested is obtained. Structural information of the supercell.
  • the pre-set gromacs program when the energy minimization process is performed on the first crystal form of the molecule to be tested, the pre-set gromacs program is used.
  • the gromacs program is pre-set, and the gromacs program is first generated from the force field data
  • the required input file input the input file to the gromacs program to perform energy minimization on the supercell of the first crystal form of the molecule to be tested, and obtain the supercell of the first crystal form of the molecule to be tested after energy minimization structural information.
  • the molecular structure is expanded and the energy minimization is performed on the expanded supercell to ensure the accuracy of the subsequent molecular free energy measurement.
  • molecular dynamics method is used, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, in Under the isothermal and isobaric ensemble, when the first crystal form of the molecule to be tested after the energy minimization process is in an equilibrium state at a preset temperature, all atoms of the first crystal form of the molecule to be tested at each coordinate
  • the vibration frequency of the direction, and the potential energy and volume of the first crystal form of the molecule to be measured include:
  • step S301 within a preset temperature range, an isothermal and isobaric ensemble simulation is performed on the energy-minimized supercell to obtain a simulated trajectory of the supercell of the first crystal form of the molecule to be tested.
  • the preset temperature range can be selected from a temperature range of 50K to 300K, and the temperature range It can be selected according to the actual situation, and this application does not make a limitation here.
  • NPT simulation is performed on the supercell.
  • the preset step size can be selected as 1 fs, and the simulation time is 7 ns each time.
  • the supercell is simulated with the above preset step size and simulation time, and the simulation result is obtained.
  • Step S302 selecting a plurality of temperature values in the preset temperature range, and selecting simulation trajectory frames respectively corresponding to the plurality of temperature values in the simulation trajectory of the supercell of the first crystal form of the molecule to be measured, performing isothermal and isobaric ensemble simulations on a plurality of the simulated trajectory frames respectively, to obtain volumes in equilibrium states of the first crystal form of the molecule to be measured at the plurality of temperature values respectively.
  • the trajectory frame refers to the frame used to represent the coordinates and speed of each atom in the molecule at a certain moment or temperature point in the simulated trajectory.
  • the selected temperature range multiple temperature values are selected, and in a specific implementation
  • the selected temperature values are 50K, 60K, 70K, 80K, etc.
  • the simulated trajectory frames corresponding to these temperature values can be obtained according to the preset step size, and then NPT simulation is performed on each simulated trajectory frame.
  • Select the preset step size as 1fs, and each simulation time as 4ns use the above preset step size and simulation time to simulate the supercell, and obtain the simulation results.
  • Step S303 calculating the average value of the volumes of the equilibrium state of the first crystal form of the molecule to be measured at the multiple temperature values, selecting a target simulation trajectory frame based on the average value, and calculating the target simulation trajectory frame
  • the supercell corresponding to the first crystal form of the molecule to be measured is subjected to energy minimization to obtain the molecular structure after the energy minimization process; wherein, the volume of the first crystal form of the molecule to be tested corresponding to the simulated trajectory frame is selected to match the The simulated trajectory frame with the smallest average volume error is the target simulated trajectory frame.
  • the average volume of the molecule to be measured is calculated for the volume of the molecule to be measured when it is stable at each temperature value, and the target simulation trajectory frame is obtained, wherein the target molecule to be measured corresponding to the simulation trajectory frame
  • the error between the volume of the molecule and the average volume is the smallest, and the average volume of the molecules to be measured can be accurately determined by using the average volume of the molecules to be measured. Simulation accuracy.
  • Step S304 performing mode analysis on the molecular structure after the energy minimization process, and calculating the vibration frequencies of all atoms in the first crystal form of the molecule to be measured in each coordinate direction.
  • the first crystal form of the molecule to be tested is simulated under the isothermal and isobaric ensemble, and a specified temperature is selected in a preset temperature range. Under the temperature, simulate the first crystal form of the molecule to be measured to an equilibrium state, determine the structure of the first crystal form of the molecule to be measured in the equilibrium state, and calculate all the first crystal forms of the molecule to be measured under the structure The vibration frequency of the atom in each coordinate direction.
  • the vibration frequencies of the molecules to be measured in each coordinate direction refer to the vibration frequencies of each atom of the first crystal form of the molecules to be measured in the three directions of x-axis, y-axis and z-axis respectively.
  • molecular dynamics method is used, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, in Under the isothermal and isobaric ensemble, when the first crystal form of the molecule to be tested after the energy minimization process is in an equilibrium state at a preset temperature, all atoms of the first crystal form of the molecule to be tested at each coordinate
  • the vibration frequency of the direction, and the potential energy and volume of the first crystal form of the molecule to be measured include:
  • step S401 the energy minimization process is performed on the target simulated trajectory frame by using the steepest descent method and the conjugate gradient method in sequence.
  • the steepest descent method and the conjugate gradient method can be used in sequence, and the target simulation trajectory frame is used as input, and the steepest descent method and the conjugate gradient method are successively performed energy minimization.
  • Step S402 when the energy minimization process reaches the preset standard, calculate the potential energy and volume of the first crystalline form of the analyte molecule after the energy minimization process, and obtain the first crystalline form of the analyte molecule potential energy and volume.
  • the molecular dynamics method is adopted.
  • the energy minimization reaches the preset standard, such as gradient ⁇ F ⁇ 0.001
  • the method further includes:
  • the methods in the foregoing examples are used respectively to obtain the absolute free energy corresponding to different crystal forms of the molecule to be tested at different preset temperatures.
  • Step S402 select different preset temperatures, select the minimum value of free energy of the molecules to be measured, and obtain absolute free energy curves of the molecules to be measured at different preset temperatures.
  • the molecular free energy calculation method provided in any of the foregoing embodiments can be used to calculate the absolute free energy curves of the crystal forms at different temperatures.
  • the principle is the same as that of the foregoing The principles in the embodiments are similar and will not be repeated here.
  • the molecular free energy calculation method provided can aim at the free energy when the molecule to be tested is in different crystal forms. After obtaining the initial coordinate data of different crystal forms of the molecule to be tested, the method provided in the previous examples can be used The free energy calculation method of molecules is used to calculate the free energy variation curves with volume at different preset temperatures for different crystal forms of the molecules to be tested. The calculation principle is the same as that of the previous embodiment, and will not be repeated here.
  • the technical solution provided by the present application may include the following beneficial effects: the embodiment of the present application obtains the force field data of the molecule to be tested and the initial coordinate data of different crystal forms of the molecule to be tested, and performs energy minimization for the different crystal forms of the molecule to be tested respectively.
  • the embodiment of the present application provides a molecular stability analysis method, as shown in Figure 5, including:
  • Step S501 using the method in the foregoing embodiments, to obtain the absolute free energies of different crystal forms of the molecule to be measured at different preset temperatures.
  • the absolute free energies of different crystal forms of the molecule to be tested at different temperatures can be determined, and the principle will not be repeated here.
  • Step S502 taking the absolute free energy of any one of the crystal forms at different preset temperatures as a reference, and subtracting the reference from the absolute free energy of other crystal forms at the same preset temperature, to obtain the absolute free energy of each crystal form at different preset temperatures. Relative free energy values at temperature.
  • the absolute free energy of the crystal form at different preset temperatures is used as a reference, and the absolute free energy of other crystal forms at the temperature is subtracted With this reference, the relative free energy of each crystal form at each temperature can be obtained.
  • Step S503 based on the relative free energy values of each crystal form at different preset temperatures, determine the most stable crystal form at a specific temperature, and the crystal form corresponding to the lowest relative free energy value at the same temperature is the most stable crystal form.
  • the crystal form with the lowest relative free energy is the most stable crystal form of the molecule to be tested at that temperature.
  • Figure 6 is the structural formula of the new crown drug Favipiravir (Favipiravir) molecule, and obtain the molecular coordinate file of the molecule.
  • the file format can be It is a file in qbd format.
  • the yoda program Use the yoda program to capture the corresponding GAFF force field data, obtain a crystal structure cif file of a crystal form, expand the structure file to generate a 3x2x5 supercell, and read the force field data Generate the input file required by the gromacs (molecular dynamics method) program, call the gromacs program to minimize the energy of the supercell, and then call the gromacs program to perform NPT simulation on the energy-minimized supercell, and the temperature is raised from 50K to 300K , the step size is 1fs, the simulation is 7ns, and the simulation trajectory is obtained, and the trajectory frame of the time point corresponding to the specific temperature is extracted from the above simulation trajectory at an interval of 10K, and the NPT simulation is continued, the step size is 1fs, the simulation is 4ns, and finally all the temperatures are obtained
  • the equilibrium structure below, its RMSD (Root Mean Square Difference of Displacement) and the variation of its volume with the simulation time are shown in Figures
  • the volume frame is used as input, and the energy minimization of the steepest descent method and the conjugate gradient method are carried out successively to ensure that the optimization reaches the convergence standard (such as gradient ⁇ F ⁇ 0.001), and the potential energy (U) and the current volume (V) are calculated; call the gromacs program, Using the above energy-minimized structure as input, conduct mode analysis to calculate the vibration frequency ( ⁇ ); use the following formula to calculate the curve of free energy versus volume at different temperatures.
  • the convergence standard such as gradient ⁇ F ⁇ 0.001
  • U potential energy
  • V current volume
  • F(V,T) is the free energy of the molecule to be measured at the temperature T and volume V
  • U(V) is the potential energy of the molecule to be measured when the volume is V
  • N A is Avogadro's constant
  • P is the system Comprehensive pressure
  • k B is the Boltzmann constant
  • k( ⁇ ) is the frequency of atom k
  • G(T) is expressed as the final free energy at temperature T.
  • the absolute free energy ⁇ Gabs(T) at the temperature can be obtained, as shown in Figure 9.
  • the relationship between the relative free energy of different crystal forms with temperature can be calculated, as shown in Figure 10.
  • the relative free energy results calculated by PCSP are shown in Figure 11.
  • the relative free energy trends of different structures in the low temperature segment are basically the same, but due to the limitations of the PSCP method itself, the 50K-90K The calculation result is inaccurate (not shown in this application), and the force field QHA and PSCP methods are calculated at the same time.
  • the average core time ratio for each structure calculation is 663/2958, about 0.22. It shows that the molecular free energy calculation method provided in the examples of this application saves about 80% of the calculation cost while ensuring the correctness of the results.
  • the present application also provides a molecular free energy calculation device, electronic equipment, and corresponding embodiments.
  • Fig. 12 is a schematic structural diagram of a molecular free energy calculation device shown in an embodiment of the present application.
  • the molecular free energy calculation device 120 includes an energy minimization module 1210, a balance simulation module 1220, a curve calculation module 1230, and a free energy calculation module 1240, wherein:
  • the energy minimization module 1210 is configured to use a molecular dynamics method to perform energy minimization on the first crystal form of the molecule to be tested based on the force field data of the molecule to be tested, to obtain the energy minimization process of the molecule to be tested. Structural information of the first crystal form;
  • the balance simulation module 1220 is configured to use a molecular dynamics method, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, to calculate the energy minimization process under the isothermal and isobaric ensemble.
  • a molecular dynamics method based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, to calculate the energy minimization process under the isothermal and isobaric ensemble.
  • the curve calculation module 1230 is configured to determine the first crystal form of the molecule to be measured in multiple The curve of free energy versus volume at a preset temperature;
  • the free energy calculation module 1240 is configured to use the minimum free energy value in the curve of free energy versus volume of the first crystal form of the molecule to be measured at each preset temperature as the value of the molecule to be measured at the corresponding preset temperature The absolute free energy of the first crystal form of .
  • the energy minimization module 1210 performs energy minimization on the first crystal form of the molecule to be tested based on the force field data of the molecule to be tested by using a molecular dynamics method.
  • a molecular dynamics method When obtaining the structure information of the first crystal form of the molecule to be tested after energy minimization treatment, it can be used for:
  • the energy minimization process is performed on the supercell to obtain the structural information of the supercell after the energy minimization process.
  • the equilibrium simulation module 1220 adopts the molecular dynamics method, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, at an isothermal Under the isobaric ensemble, when the first crystal form of the molecule to be measured after the energy minimization process is in an equilibrium state at a preset temperature, all atoms of the first crystal form of the molecule to be measured in each coordinate direction
  • the vibration frequency, and the potential energy and volume of the first crystal form of the molecule to be tested can be used for:
  • the isothermal and isobaric ensemble simulation is performed on the supercell after the energy minimization treatment, and the simulation trajectory of the supercell of the first crystal form of the molecule to be tested is obtained;
  • the supercell of the first crystal form of the measured molecule is subjected to energy minimization processing to obtain the molecular structure after energy minimization processing; wherein, the volume of the first crystal form of the molecule to be measured corresponding to the simulated trajectory frame is selected to be equal to the average volume
  • the simulated trajectory frame with the smallest error is the target simulated trajectory frame;
  • Mode analysis is performed on the molecular structure after the energy minimization treatment, and the vibration frequencies of all atoms in the first crystal form of the molecule to be measured are calculated in each coordinate direction.
  • the equilibrium simulation module 1220 adopts the molecular dynamics method, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, at an isothermal Under the isobaric ensemble, when the first crystal form of the molecule to be measured after the energy minimization process is in an equilibrium state at a preset temperature, all atoms of the first crystal form of the molecule to be measured in each coordinate direction
  • the vibration frequency, and the potential energy and volume of the first crystal form of the molecule to be tested can be used for:
  • the molecular free energy calculation device can also be used for:
  • the above method is used respectively to obtain the absolute free energy corresponding to the different crystal forms of the molecule to be tested at different preset temperatures.
  • the technical solution provided by the present application may include the following beneficial effects:
  • the embodiment of the present application by acquiring the force field data of the molecule to be tested, and performing energy minimization on the first crystal form of the molecule to be tested, and then under the isothermal and isobaric ensemble, Calculating the vibration frequencies of all atoms in the first crystal form of the molecule to be measured in each coordinate direction when the first crystal form of the molecule to be measured is in equilibrium at a preset temperature, and the first crystal form of the molecule to be measured potential energy and volume, based on the vibration frequency, the potential energy and volume of the first crystal form of the molecule to be measured, using the preset free energy calculation formula to determine the first crystal form of the molecule to be measured in multiple preset
  • the curve of free energy versus volume at temperature can accurately calculate the free energy of molecular crystals for molecular crystals with more atoms, more flexible angles, and anisotropy, saving computing power and computing time.
  • Fig. 13 is a schematic structural diagram of a molecular stability analysis device shown in an embodiment of the present application.
  • the molecular stability analysis device 120 includes an absolute free energy calculation unit 1310, a relative free energy calculation unit 1320, and a stable crystal form determination unit 1330, wherein:
  • the absolute free energy calculation unit 1310 is used to obtain the absolute free energy of different crystal forms of the molecule to be measured at different preset temperatures by using the above method;
  • the relative free energy calculation unit 1320 is used to use the absolute free energy of any crystal form at different preset temperatures as a reference, and subtract the reference from the absolute free energy of other crystal forms at the same preset temperature to obtain each The relative free energy value of the crystal form at different preset temperatures;
  • the stable crystal form determining unit 1330 is used to determine the most stable crystal form at a specific temperature based on the relative free energy values of each crystal form at different preset temperatures, and the crystal form corresponding to the lowest relative free energy value at the same temperature is the most stable crystal form.
  • the relative free energy of different crystal forms of the same molecule is generally calculated, and the crystal form with the lowest relative free energy at the same temperature is regarded as the most stable crystal form of the molecule at this temperature, so that the molecule can be accurately analyzed. steady state at different temperatures.
  • FIG. 14 is a schematic structural diagram of an electronic device shown in an embodiment of the present application.
  • an electronic device 1000 includes a memory 1010 and a processor 1020 .
  • the processor 1020 can be a central processing unit (Central Processing Unit, CPU), and can also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), application specific integrated circuits (Application Specific Integrated Circuit, ASIC), on-site Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
  • a general-purpose processor may be a microprocessor, or the processor may be any conventional processor, or the like.
  • the memory 1010 may include various types of storage units such as system memory, read only memory (ROM), and persistent storage.
  • the ROM may store static data or instructions required by the processor 1020 or other modules of the computer.
  • the persistent storage device may be a readable and writable storage device.
  • Persistent storage may be a non-volatile storage device that does not lose stored instructions and data even if the computer is powered off.
  • the permanent storage device adopts a mass storage device (such as a magnetic or optical disk, flash memory) as the permanent storage device.
  • the permanent storage device may be a removable storage device (such as a floppy disk, an optical drive).
  • System memory can be a readable and writable storage device or a volatile readable and writable storage device, such as dynamic random access memory.
  • System memory can store some or all of the instructions and data that the processor needs at runtime.
  • the memory 1010 may include any combination of computer-readable storage media, including various types of semiconductor memory chips (such as DRAM, SRAM, SDRAM, flash memory, programmable read-only memory), and magnetic disks and/or optical disks may also be used.
  • memory 1010 may include a readable and/or writable removable storage device, such as a compact disc (CD), a read-only digital versatile disc (e.g., DVD-ROM, dual-layer DVD-ROM), Read-only Blu-ray Disc, Super Density Disc, Flash memory card (such as SD card, min SD card, Micro-SD card, etc.), magnetic floppy disk, etc.
  • a readable and/or writable removable storage device such as a compact disc (CD), a read-only digital versatile disc (e.g., DVD-ROM, dual-layer DVD-ROM), Read-only Blu-ray Disc, Super Density Disc, Flash memory card (such as SD card, min SD card, Micro-SD card, etc.), magnetic floppy disk, etc.
  • Computer-readable storage media do not contain carrier waves and transient electronic signals transmitted by wireless or wire.
  • Executable codes are stored in the memory 1010 , and when the executable codes are processed by the processor 1020 , the processor 1020 may execute part or all of the methods mentioned above.
  • the method according to the present application can also be implemented as a computer program or computer program product, the computer program or computer program product including computer program code instructions for executing some or all of the steps in the above method of the present application.
  • the present application may also be implemented as a computer-readable storage medium (or a non-transitory machine-readable storage medium or a machine-readable storage medium), on which executable code (or computer program or computer instruction code) is stored,
  • executable code or computer program or computer instruction code
  • the processor of the electronic device or server, etc.
  • the processor is made to perform part or all of the steps of the above-mentioned method according to the present application.

Abstract

The present application relates to a method and apparatus for computing the free energy of a molecule, and a stability analysis method and apparatus, and an electronic device and a storage medium. The method for computing the free energy of a molecule comprises: performing an energy minimization treatment on a molecule under test; under an isothermal-isobaric ensemble, when said molecule is in a balanced state at a preset temperature, computing the vibration frequency of said molecule in each coordinate direction; computing the potential energy and volume of said molecule; and on the basis of the vibration frequencies, and the potential energy and volume of said molecule, determining a curve of the free energy of said molecule changing, at the preset temperature, along with the volume. By means of the embodiments of the present application, for anisotropic molecular crystals having more atoms and flexible angles, the free energy of the molecular crystals can be accurately computed, thereby saving on the computing power and the computing time.

Description

分子自由能计算、稳定性分析方法、装置、设备及存储介质Molecular free energy calculation, stability analysis method, device, equipment and storage medium 技术领域technical field
本申请涉及分子动力学技术领域,尤其涉及一种分子自由能计算、稳定性分析方法、装置、设备及存储介质。The present application relates to the technical field of molecular dynamics, in particular to a molecular free energy calculation and stability analysis method, device, equipment and storage medium.
背景技术Background technique
药物小分子晶型预测(Crystal Structure Predict,CSP)是当前非常热门的技术,CSP展示的虚拟晶型排位仅能表现晶体0K下的能量分布,而实验上的晶型筛选通常是在室温下进行的,因此计算室温下不同晶型之间的相对自由能排位十分必要。Crystal Structure Predict (CSP) is a very popular technology at present. The virtual crystal structure ranking displayed by CSP can only show the energy distribution of the crystal at 0K, while the experimental crystal structure screening is usually at room temperature. Therefore, it is necessary to calculate the relative free energy ranking among different crystal forms at room temperature.
在相关技术方案中,一些可以较为准确的预测晶体分子绝对自由能的方法中,包括简谐近似法、准简谐近似法、伪超临界路径法等计算方法,但是这些方法普遍存在一些问题,如对于一些原子数较多的有机分子晶体,通过量子化学方法计算自由能的成本较高,甚至无法计算,计算过程中几乎不会改变晶体的堆积模式和分子的构象,而对于多晶型的研究则是无法实现的,以及模拟热膨胀效应是各向同性的,而对于通常为各向异性的有机分子晶体其计算结果往往会有很大误差。In related technical solutions, some methods that can predict the absolute free energy of crystal molecules more accurately include calculation methods such as harmonic approximation method, quasi-harmonic approximation method, and pseudo-supercritical path method, but these methods generally have some problems. For example, for some organic molecular crystals with a large number of atoms, the cost of calculating the free energy by quantum chemical methods is high, or even impossible to calculate, and the packing mode of the crystal and the conformation of the molecules will hardly be changed during the calculation process. The research is unrealizable, and the simulated thermal expansion effect is isotropic, but for the usually anisotropic organic molecular crystals, the calculation results often have large errors.
技术问题technical problem
现有技术对药物小分子自由能计算的方法中,存在计算成本高或无法计算的情况,或者对各向异性的有机分子的自由能计算结果不准确。In the prior art methods for calculating the free energy of small drug molecules, the calculation cost is high or cannot be calculated, or the calculation results for the free energy of anisotropic organic molecules are inaccurate.
技术解决方案technical solution
本申请第一方面提供一种分子自由能计算方法,包括:The first aspect of the present application provides a molecular free energy calculation method, including:
采用分子动力学方法,基于待测分子的力场数据对所述待测分子的第一晶型进行能量最小化处理,得到能量最小化处理后的待测分子的第一晶型的结构信息;Using a molecular dynamics method to perform energy minimization processing on the first crystal form of the molecule to be tested based on the force field data of the molecule to be tested, to obtain structural information of the first crystal form of the molecule to be tested after the energy minimization process;
采用分子动力学方法,基于所述能量最小化处理后的待测分子的第一 晶型的结构信息,在等温等压系综下,计算能量最小化处理后的所述待测分子的第一晶型在预设温度下处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积;Using a molecular dynamics method, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, under the isothermal and isobaric ensemble, calculate the first crystal form of the molecule to be tested after the energy minimization process When the crystal form is in an equilibrium state at a preset temperature, the vibration frequency of all atoms in the first crystal form of the molecule to be tested in each coordinate direction, and the potential energy and volume of the first crystal form of the molecule to be tested;
基于所述振动频率、所述待测分子的第一晶型的势能和体积,采用预设的自由能计算公式确定所述待测分子的第一晶型在多个预设温度下的自由能随体积的变化曲线;Based on the vibration frequency, the potential energy and the volume of the first crystalline form of the analyte molecule, using a preset free energy calculation formula to determine the free energy of the first crystalline form of the analyte molecule at multiple preset temperatures Variation curve with volume;
将所述待测分子的第一晶型在各个预设温度下的自由能随体积的变化曲线中的最小自由能值作为相应预设温度下所述待测分子的第一晶型的绝对自由能。The minimum free energy value in the free energy versus volume curve of the first crystal form of the molecule to be measured at each preset temperature is taken as the absolute free energy of the first crystal form of the molecule to be measured at the corresponding preset temperature. able.
作为本申请一种可能的实施方式,在该实施方式中,所述采用分子动力学方法,基于待测分子的力场数据对所述待测分子的第一晶型进行能量最小化处理,得到能量最小化处理后的待测分子的第一晶型的结构信息,包括:As a possible implementation of the present application, in this implementation, the molecular dynamics method is used to minimize the energy of the first crystal form of the molecule to be tested based on the force field data of the molecule to be tested, to obtain The structural information of the first crystal form of the molecule to be tested after energy minimization processing, including:
基于所述待测分子的力场数据和所述待测分子的第一晶型的结构数据,对所述待测分子的第一晶型的晶胞进行扩胞,得到预设尺寸的超胞;Expanding the unit cell of the first crystalline form of the analyte molecule based on the force field data of the analyte molecule and the structure data of the first crystalline form of the analyte molecule to obtain a supercell of a preset size ;
基于所述待测分子的力场数据,对所述超胞进行能量最小化处理,得到能量最小化处理后的超胞的结构信息。Based on the force field data of the molecules to be tested, the energy minimization process is performed on the supercell to obtain the structural information of the supercell after the energy minimization process.
作为本申请一种可能的实施方式,在该实施方式中,采用分子动力学方法,基于所述能量最小化处理后的待测分子的第一晶型的结构信息,在等温等压系综下,计算能量最小化处理后的所述待测分子的第一晶型在预设温度下处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积,包括:As a possible implementation of the present application, in this implementation, the molecular dynamics method is used, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, under the isothermal and isobaric ensemble calculating the vibration frequencies of all atoms in the first crystal form of the molecule to be measured in each coordinate direction when the first crystal form of the molecule to be measured after energy minimization is in an equilibrium state at a preset temperature, and The potential energy and volume of the first crystal form of the molecule to be measured include:
在预设的温度区间内,对能量最小化处理后的超胞进行等温等压系综模拟,获得待测分子的第一晶型的超胞的模拟轨迹;In the preset temperature range, the isothermal and isobaric ensemble simulation is performed on the supercell after the energy minimization treatment, and the simulation trajectory of the supercell of the first crystal form of the molecule to be tested is obtained;
在所述预设的温度区间选取多个温度值,在所述待测分子的第一晶型的超胞的模拟轨迹中选取与所述多个温度值分别对应的模拟轨迹帧,分别对多个所述模拟轨迹帧进行等温等压系综模拟,得到所述待测分子的第一晶型分别在所述多个温度值下的平衡状态的体积;Select a plurality of temperature values in the preset temperature interval, and select simulation trajectory frames corresponding to the plurality of temperature values in the simulation trajectory of the supercell of the first crystal form of the molecule to be measured, respectively for multiple performing an isothermal and isobaric ensemble simulation on each of the simulated trajectory frames to obtain the volumes of the equilibrium states of the first crystal form of the molecule to be measured at the multiple temperature values;
计算所述待测分子的第一晶型分别在所述多个温度值下的平衡状态 的体积的平均值,基于所述平均值选取目标模拟轨迹帧,对所述目标模拟轨迹帧对应的待测分子的第一晶型的超胞进行能量最小化处理,得到能量最小化处理后的分子结构;其中,选择模拟轨迹帧对应的待测分子的第一晶型的体积与所述平均体积的误差最小的模拟轨迹帧为目标模拟轨迹帧;Calculate the average value of the volume of the equilibrium state of the first crystal form of the molecule to be measured at the multiple temperature values, select a target simulation trajectory frame based on the average value, and select the target simulation trajectory frame corresponding to the target simulation trajectory frame. The supercell of the first crystal form of the measured molecule is subjected to energy minimization processing to obtain the molecular structure after energy minimization processing; wherein, the volume of the first crystal form of the molecule to be measured corresponding to the simulated trajectory frame is selected to be equal to the average volume The simulated trajectory frame with the smallest error is the target simulated trajectory frame;
对所述能量最小化处理后的分子结构进行模式分析,计算所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率。Mode analysis is performed on the molecular structure after the energy minimization treatment, and the vibration frequencies of all atoms in the first crystal form of the molecule to be measured are calculated in each coordinate direction.
作为本申请一种可能的实施方式,在该实施方式中,采用分子动力学方法,基于所述能量最小化处理后的待测分子的第一晶型的结构信息,在等温等压系综下,计算能量最小化处理后的所述待测分子的第一晶型在预设温度下处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积,包括:As a possible implementation of the present application, in this implementation, the molecular dynamics method is used, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, under the isothermal and isobaric ensemble calculating the vibration frequencies of all atoms in the first crystal form of the molecule to be measured in each coordinate direction when the first crystal form of the molecule to be measured after energy minimization is in an equilibrium state at a preset temperature, and The potential energy and volume of the first crystal form of the molecule to be measured include:
依次采用最速下降法和共轭梯度法对所述目标模拟轨迹帧进行能量最小化处理;Using the steepest descent method and the conjugate gradient method in turn to perform energy minimization processing on the target simulation trajectory frame;
当所述能量最小化处理达到预设的标准时,计算进行能量最小化处理后的所述待测分子的第一晶型的势能和体积,得到所述待测分子的第一晶型的势能和体积。When the energy minimization process reaches the preset standard, calculate the potential energy and volume of the first crystal form of the molecule to be tested after the energy minimization process, and obtain the potential energy and volume of the first crystal form of the molecule to be tested volume.
作为本申请一种可能的实施方式,在该实施方式中,所述方法还包括:As a possible implementation of the present application, in this implementation, the method further includes:
针对所述待测分子的任一晶型,分别采用权利要求1-4任一所述的方法,得到在不同预设温度下,所述待测分子不同晶型对应的绝对自由能。For any crystal form of the molecule to be tested, the method according to any one of claims 1-4 is used respectively to obtain the absolute free energy corresponding to the different crystal forms of the molecule to be tested at different preset temperatures.
本申请第二方面提供了一种分子稳定性分析方法,该方法包括:The second aspect of the present application provides a molecular stability analysis method, the method comprising:
采用上一实施例中的方法,获得不同预设温度下所述待测分子的不同晶型的绝对自由能;Using the method in the previous embodiment, obtain the absolute free energy of different crystal forms of the molecule to be tested at different preset temperatures;
以其中任一晶型在不同预设温度下的绝对自由能为参比,将其他晶型在相同预设温度下的绝对自由能减去参比,得到各晶型在不同预设温度下的相对自由能值;Taking the absolute free energy of any crystal form at different preset temperatures as a reference, the absolute free energy of other crystal forms at the same preset temperature is subtracted from the reference to obtain the absolute free energy of each crystal form at different preset temperatures Relative free energy value;
基于各晶型在不同预设温度下的相对自由能值,确定特定温度下最稳定的晶型,以同一温度下相对自由能值最低对应的晶型为最稳定晶型。Based on the relative free energy values of each crystal form at different preset temperatures, the most stable crystal form at a specific temperature is determined, and the crystal form corresponding to the lowest relative free energy value at the same temperature is the most stable crystal form.
本申请第三方面提供一种分子自由能计算装置,包括:The third aspect of the present application provides a molecular free energy calculation device, including:
能量最小化模块,用于采用分子动力学方法,基于待测分子的力场数据对所述待测分子的第一晶型进行能量最小化处理,得到能量最小化处理 后的待测分子的第一晶型的结构信息;The energy minimization module is used to perform energy minimization processing on the first crystal form of the molecule to be detected based on the force field data of the molecule to be detected by using a molecular dynamics method, so as to obtain the first crystal form of the molecule to be detected after energy minimization processing. - Structural information of the crystal form;
平衡模拟模块,用于采用分子动力学方法,基于所述能量最小化处理后的待测分子的第一晶型的结构信息,在等温等压系综下,计算能量最小化处理后的所述待测分子的第一晶型在预设温度下处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积;The balance simulation module is used to calculate the energy-minimized structure information of the first crystal form of the molecule to be tested by using molecular dynamics method under the isothermal and isobaric ensemble. When the first crystalline form of the analyte molecule is in equilibrium at a preset temperature, the vibration frequencies of all atoms in the first crystalline form of the analyte molecule in each coordinate direction, and the first crystalline form of the analyte molecule potential energy and volume;
曲线计算模块,用于基于所述振动频率、所述待测分子的第一晶型的势能和体积,采用预设的自由能计算公式确定所述待测分子的第一晶型在多个预设温度下的自由能随体积的变化曲线;The curve calculation module is used to determine the first crystal form of the molecule to be tested based on the vibration frequency, the potential energy and the volume of the first crystal form of the molecule to be tested by using a preset free energy calculation formula. Set the curve of free energy versus volume at temperature;
自由能计算模块,用于将所述待测分子的第一晶型在各个预设温度下的自由能随体积的变化曲线中的最小自由能值作为相应预设温度下所述待测分子的第一晶型的绝对自由能。The free energy calculation module is used to use the minimum free energy value in the free energy versus volume curve of the first crystal form of the molecule to be measured at each preset temperature as the value of the molecule to be measured at the corresponding preset temperature Absolute free energy of the first crystal form.
本申请第四方面提供了一种分子稳定性分析装置,该装置包括:The fourth aspect of the present application provides a molecular stability analysis device, the device comprising:
绝对自由能计算单元,用于前述实施例中的方法,获得不同预设温度下所述待测分子的不同晶型的绝对自由能;The absolute free energy calculation unit is used in the method in the foregoing embodiment to obtain the absolute free energy of different crystal forms of the molecule to be measured at different preset temperatures;
相对自由能计算单元,用于以其中任一晶型在不同预设温度下的绝对自由能为参比,将其他晶型在相同预设温度下的绝对自由能减去参比,得到各晶型在不同预设温度下的相对自由能值;The relative free energy calculation unit is used to use the absolute free energy of any crystal form at different preset temperatures as a reference, and subtract the reference from the absolute free energy of other crystal forms at the same preset temperature to obtain each crystal The relative free energy value of the type at different preset temperatures;
稳定晶型确定单元,用于基于各晶型在不同预设温度下的相对自由能值,确定特定温度下最稳定的晶型,以同一温度下相对自由能值最低对应的晶型为最稳定晶型。The stable crystal form determination unit is used to determine the most stable crystal form at a specific temperature based on the relative free energy values of each crystal form at different preset temperatures, and the crystal form corresponding to the lowest relative free energy value at the same temperature is the most stable crystal form.
本申请第五方面提供一种电子设备,包括:The fifth aspect of the present application provides an electronic device, including:
处理器;以及processor; and
存储器,其上存储有可执行代码,当所述可执行代码被所述处理器执行时,使所述处理器执行如上所述的方法。A memory, on which executable codes are stored, which, when executed by the processor, cause the processor to perform the method as described above.
本申请第六方面提供一种存储介质,其上存储有可执行代码,当所述可执行代码被电子设备的处理器执行时,使所述处理器执行如上所述的方法。The sixth aspect of the present application provides a storage medium, on which executable code is stored, and when the executable code is executed by a processor of an electronic device, the processor is made to execute the above-mentioned method.
有益效果Beneficial effect
本申请提供的技术方案可以包括以下有益效果:本申请通过获取待测分子的力场数据,并对待测分子的第一晶型进行能量最小化处理,然后在等温等压系综下,计算待测分子的第一晶型在预设温度处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积,基于所述振动频率、所述待测分子的第一晶型的势能和体积,采用预设的自由能计算公式确定所述待测分子的第一晶型在多个预设温度下的自由能随体积的变化曲线,在针对原子数较多、柔性角较多且各向异性的分子晶体,能够准确计算分子晶体的自由能,节约算力和计算时间。应当理解的是,以上的一般描述和后文的细节描述仅是示例性和解释性的,并不能限制本申请。The technical solution provided by the application may include the following beneficial effects: the application obtains the force field data of the molecule to be tested, and performs energy minimization processing on the first crystal form of the molecule to be tested, and then calculates the When the first crystal form of the molecule to be measured is in an equilibrium state at a preset temperature, the vibration frequencies of all atoms in the first crystal form of the molecule to be measured in each coordinate direction, and the potential energy of the first crystal form of the molecule to be measured and volume, based on the vibration frequency, the potential energy and volume of the first crystalline form of the analyte molecule, using a preset free energy calculation formula to determine the first crystal form of the analyte molecule at multiple preset temperatures The change curve of free energy with volume can accurately calculate the free energy of molecular crystals for molecular crystals with more atoms, more flexible angles and anisotropy, saving computing power and computing time. It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the application.
附图说明Description of drawings
通过结合附图对本申请示例性实施方式进行更详细的描述,本申请的上述以及其它目的、特征和优势将变得更加明显,其中,在本申请示例性实施方式中,相同的参考标号通常代表相同部件。The above and other objects, features and advantages of the present application will become more apparent by describing the exemplary embodiments of the present application in more detail with reference to the accompanying drawings, wherein, in the exemplary embodiments of the present application, the same reference numerals generally represent same parts.
图1是本申请实施例示出的一种分子自由能计算方法的流程示意图;Fig. 1 is a schematic flow chart of a molecular free energy calculation method shown in the embodiment of the present application;
图2是本申请实施例示出的一种扩胞方法的流程示意图;Fig. 2 is a schematic flow chart of a cell expansion method shown in the embodiment of the present application;
图3是本申请实施例示出的一种振动频率计算方法的流程示意图;Fig. 3 is a schematic flow chart of a vibration frequency calculation method shown in an embodiment of the present application;
图4是本申请实施例示出的一种势能、体积计算方法的流程示意图;Fig. 4 is a schematic flow chart of a potential energy and volume calculation method shown in the embodiment of the present application;
图5是本申请实施例示出的一种分子稳定性分析分析方法的流程示意图;5 is a schematic flow diagram of a molecular stability analysis method shown in the embodiment of the present application;
图6是本申请实施例提供的一种分子结构示意图;Fig. 6 is a schematic diagram of a molecular structure provided by an embodiment of the present application;
图7是本申请实施例提供的一种分子位移均方根差随时间变化图;Fig. 7 is a diagram of the root mean square difference of molecular displacement as a function of time provided by the embodiment of the present application;
图8是本申请实施例提供的一种分子体积随时间变化图;Figure 8 is a diagram of the change in molecular volume with time provided by the embodiment of the present application;
图9是本申请实施例提供的一种绝对自由能曲线示意图;Fig. 9 is a schematic diagram of an absolute free energy curve provided in an embodiment of the present application;
图10是本申请实施例提供的一种不同晶型相对自由能随温度变化的曲线图;Figure 10 is a graph of the relative free energy of different crystal forms as a function of temperature provided by the examples of the present application;
图11是本申请实施例提供的一种采用PCSP计算的相对自由能曲线图;Fig. 11 is a kind of relative free energy curve chart that adopts PCSP calculation provided by the embodiment of the present application;
图12是本申请实施例提供的一种分子自由能计算装置的结构示意图;Fig. 12 is a schematic structural diagram of a molecular free energy calculation device provided in an embodiment of the present application;
图13是本申请实施例通过的一种分子稳定性分析装置示意图;Figure 13 is a schematic diagram of a molecular stability analysis device adopted in the embodiment of the present application;
图14本申请实施例示出的一种电子设备的结构示意图。FIG. 14 is a schematic structural diagram of an electronic device shown in an embodiment of the present application.
本发明的实施方式Embodiments of the present invention
下面将参照附图更详细地描述本申请的实施方式。虽然附图中显示了本申请的实施方式,然而应该理解,可以以各种形式实现本申请而不应被这里阐述的实施方式所限制。相反,提供这些实施方式是为了使本申请更加透彻和完整,并且能够将本申请的范围完整地传达给本领域的技术人员。Embodiments of the present application will be described in more detail below with reference to the accompanying drawings. Although embodiments of the present application are shown in the drawings, it should be understood that the present application may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided so that this application will be thorough and complete, and will fully convey the scope of this application to those skilled in the art.
在本申请使用的术语是仅仅出于描述特定实施例的目的,而非旨在限制本申请。在本申请和所附权利要求书中所使用的单数形式的“一种”、“所述”和“该”也旨在包括多数形式,除非上下文清楚地表示其他含义。还应当理解,本文中使用的术语“和/或”是指并包含一个或多个相关联的列出项目的任何或所有可能组合。The terminology used in this application is for the purpose of describing particular embodiments only, and is not intended to limit the application. As used in this application and the appended claims, the singular forms "a", "the", and "the" are intended to include the plural forms as well, unless the context clearly dictates otherwise. It should also be understood that the term "and/or" as used herein refers to and includes any and all possible combinations of one or more of the associated listed items.
应当理解,尽管在本申请可能采用术语“第一”、“第二”、“第三”等来描述各种信息,但这些信息不应限于这些术语。这些术语仅用来将同一类型的信息彼此区分开。例如,在不脱离本申请范围的情况下,第一信息也可以被称为第二信息,类似地,第二信息也可以被称为第一信息。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括一个或者更多个该特征。在本申请的描述中,“多个”的含义是两个或两个以上,除非另有明确具体的限定。It should be understood that although the terms "first", "second", "third" and so on may be used in this application to describe various information, such information should not be limited to these terms. These terms are only used to distinguish information of the same type from one another. For example, without departing from the scope of the present application, first information may also be called second information, and similarly, second information may also be called first information. Thus, a feature defined as "first" and "second" may explicitly or implicitly include one or more of these features. In the description of the present application, "plurality" means two or more, unless otherwise specifically defined.
在相关技术方案中,一些可以较为准确的预测晶体分子绝对自由能的方法中,包括简谐近似法(Harmonic Approximation,HA)、准简谐近似法(Quasi Harmonic Approximation,QHA)、伪超临界路径(Pseudo-supercritical Path,PSCP)等计算方法,但是这些方法普遍存在一些问题,如对于一些原子数较多的有机分子晶体,通过量子化学方法计算自由能的成本较高,甚至无法计算,计算过程中几乎不会改变晶体的堆积模式和分子的构象,而对于多晶型的研究则是无法实现的,以及模拟热膨胀效应是各向同性的,而对于通常为各向异性的有机分子晶体其计算结果往往会有很大误差。In related technical solutions, some methods that can accurately predict the absolute free energy of crystal molecules include Harmonic Approximation (HA), Quasi Harmonic Approximation (QHA), pseudo-supercritical path (Pseudo-supercritical Path, PSCP) and other calculation methods, but these methods generally have some problems, such as for some organic molecular crystals with a large number of atoms, the cost of calculating free energy by quantum chemical methods is high, or even impossible to calculate, the calculation process The packing mode and molecular conformation of the crystal will hardly be changed, but the study of polymorphism is impossible, and the simulated thermal expansion effect is isotropic, while the calculation of the usually anisotropic organic molecular crystal The results are often very inaccurate.
针对上述问题,本申请实施例提供一种分子自由能计算方法,通过获取待测分子的力场数据,并对待测分子的第一晶型进行能量最小化处理, 然后在等温等压系综下,计算待测分子的第一晶型在预设温度处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积,基于所述振动频率、所述待测分子的第一晶型的势能和体积,采用预设的自由能计算公式确定所述待测分子的第一晶型在多个预设温度下的自由能随体积的变化曲线,在针对原子数较多、柔性角较多且各向异性的分子晶体,能够准确计算分子晶体的自由能,节约算力和计算时间。In view of the above problems, the embodiment of the present application provides a molecular free energy calculation method, by obtaining the force field data of the molecule to be tested, and performing energy minimization on the first crystal form of the molecule to be tested, and then under the isothermal and isobaric ensemble , calculating the vibration frequencies of all atoms in the first crystal form of the molecule to be measured in each coordinate direction when the first crystal form of the molecule to be measured is in an equilibrium state at a preset temperature, and the first crystal form of the molecule to be measured The potential energy and volume of the type, based on the vibration frequency, the potential energy and volume of the first crystal form of the molecule to be measured, using the preset free energy calculation formula to determine the first crystal form of the molecule to be measured By setting the curve of free energy versus volume at temperature, the free energy of molecular crystals can be accurately calculated for molecular crystals with more atoms, more flexible angles, and anisotropy, saving computing power and computing time.
以下结合附图详细描述本申请实施例的技术方案。The technical solutions of the embodiments of the present application are described in detail below with reference to the accompanying drawings.
图1是本申请实施例示出的分子晶体自由能计算方法的流程示意图。Fig. 1 is a schematic flowchart of a method for calculating free energy of a molecular crystal shown in an embodiment of the present application.
参见图1,本申请实施例提供的分子晶体自由能计算方法包括:Referring to Figure 1, the molecular crystal free energy calculation method provided by the embodiment of the present application includes:
步骤S101,采用分子动力学方法,基于待测分子的力场数据对所述待测分子的第一晶型进行能量最小化处理,得到能量最小化处理后的待测分子的第一晶型的结构信息。Step S101, using a molecular dynamics method to perform energy minimization on the first crystal form of the molecule to be tested based on the force field data of the molecule to be tested, to obtain the energy-minimized first crystal form of the molecule to be tested. structural information.
在本申请实施例中,分子动力学方法是一种分子模拟方法,可以采用预先设置的gromacs(分子动力学程序包)程序,该gromacs程序是预先设置好的,先通过力场数据生成gromacs程序需要的输入文件,将该输入文件输入至gromacs程序,可以对待测分子的任意晶型进行能量最小化处理、平衡状态的模拟等功能;待测分子是需要计算自由能的分子,该分子可以有多种晶型,也可以仅有一种晶型;采用本申请实施例通过的分子自由能计算方法能够计算该分子处于各种晶型下的自由能。力场数据是指用于表示待测分子中各个原子之间力场参数的数据;该力场数据可以力场文件的形式存储,便于后续取用。在本申请实施例中,还需要获取待测分子的某一晶型的初始坐标数据,初始坐标数据是指待测分子处于某一晶型下的结构放置在三维直角坐标系中的坐标数据,该初始坐标数据用于在三维直角坐标系中表示待测分子中各个原子的坐标;该初始坐标数据可以坐标文件的形式存储,便于后续取用在本申请实施例中,待测分子处于不同晶型时,其对应的初始坐标数据不同。In the embodiment of this application, the molecular dynamics method is a molecular simulation method, and the pre-set gromacs (molecular dynamics program package) program can be used. The gromacs program is pre-set. First, the gromacs program is generated through the force field data The required input file, input the input file to the gromacs program, can perform functions such as energy minimization processing and equilibrium state simulation of any crystal form of the molecule to be tested; the molecule to be tested is a molecule that needs to calculate the free energy, and the molecule can have Multiple crystal forms, or only one crystal form; the free energy of the molecule in various crystal forms can be calculated by using the molecular free energy calculation method adopted in the embodiment of the present application. The force field data refers to the data used to represent the force field parameters between each atom in the molecule to be measured; the force field data can be stored in the form of a force field file for subsequent access. In the embodiment of the present application, it is also necessary to obtain the initial coordinate data of a certain crystal form of the molecule to be tested. The initial coordinate data refers to the coordinate data of the structure of the molecule to be tested in a certain crystal form placed in a three-dimensional rectangular coordinate system. The initial coordinate data is used to represent the coordinates of each atom in the molecule to be measured in a three-dimensional Cartesian coordinate system; the initial coordinate data can be stored in the form of a coordinate file, which is convenient for subsequent access. In the embodiments of the present application, the molecules to be measured are in different crystal type, the corresponding initial coordinate data are different.
在本申请实施例中,能量最小化处理是指对待测分子的第一晶型中的原子位置进行反复的迭代调整,降低该晶型系统的总能量,其中,能量最小化处理中,预先设置有迭代停止标准,当迭代中达到该迭代停止标准时, 停止迭代,将该晶型系统的总能量降低到一个预期的临界点,并得到能量最小化处理后的待测分子的第一晶型的结构信息。In the embodiment of the present application, the energy minimization process refers to repeated iterative adjustment of the position of atoms in the first crystal form of the molecule to be tested to reduce the total energy of the crystal form system, wherein, in the energy minimization process, preset There is an iteration stop criterion, and when the iteration stop criterion is reached during the iteration, the iteration is stopped, the total energy of the crystal system is reduced to an expected critical point, and the energy-minimized first crystal form of the molecule to be tested is obtained. structural information.
步骤S102,采用分子动力学方法,基于所述能量最小化处理后的待测分子的第一晶型的结构信息,在等温等压系综下,计算能量最小化处理后的所述待测分子的第一晶型在预设温度下处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积。Step S102, using the molecular dynamics method, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, under the isothermal and isobaric ensemble, calculate the molecule to be tested after the energy minimization process When the first crystal form of the molecule is in equilibrium at a preset temperature, the vibration frequencies of all atoms in the first crystal form of the molecule to be measured in each coordinate direction, and the potential energy and volume.
在本申请实施例中,等温等压系综(NPT系综)是一种具有可控温度和可控压强的系综,在该系综中,各体系之间可以进行能量和体积交换,但是该系综内各体系能量的总和以及体积的总和是固定不变的,在本申请实施例中,待测分子可以作为该等温等压系综中的一个体系。In the embodiment of this application, the isothermal and isobaric ensemble (NPT ensemble) is an ensemble with controllable temperature and controllable pressure. In this ensemble, energy and volume can be exchanged between systems, but The sum of energy and volume of each system in the ensemble is constant, and in the embodiment of the present application, the molecule to be measured can be used as a system in the isothermal and isobaric ensemble.
作为本申请一种可能的实施方式,在该实施方式中,在对待测分子进行能量最小化处理之后,在等温等压系综下,对待测分子进行模拟,在预设的温度区间选取指定温度,在该指定温度下,将该待测分子模拟至平衡状态,确定该平衡状态下待测分子的结构,并计算该结构下,待测分子在各个坐标方向的振动频率。在本申请实施例中,待测分子在各个坐标方向上的震动频率是指待测分子中的各个原子分别在x轴、y轴和z轴三个方向上的振动频率。As a possible implementation of the present application, in this embodiment, after the energy minimization process is performed on the molecules to be tested, the molecules to be tested are simulated under the isothermal and isobaric ensemble, and the specified temperature is selected in the preset temperature range , at the specified temperature, simulate the molecule to be measured to an equilibrium state, determine the structure of the molecule to be measured in the equilibrium state, and calculate the vibration frequency of the molecule to be measured in each coordinate direction under the structure. In the embodiment of the present application, the vibration frequency of the molecule to be measured in each coordinate direction refers to the vibration frequency of each atom in the molecule to be measured in the three directions of x-axis, y-axis and z-axis respectively.
在本申请实施例中,在对待测分子的第一晶型进行能量最小化处理后,计算待测分子的第一晶型的势能和体积,其中势能是指该待测分子的第一晶型处于能量最小化状态时的分子势能,可以采用分子动力学方法计算获得,例如gromacs;同理,该待测分子的第一晶型的体积也可以采用分子动力学方法计算获得,例如gromacs。In the embodiment of the present application, after the energy minimization process is performed on the first crystal form of the molecule to be tested, the potential energy and volume of the first crystal form of the molecule to be tested are calculated, wherein the potential energy refers to the first crystal form of the molecule to be tested The molecular potential energy in the state of energy minimization can be calculated by molecular dynamics methods, such as gromacs; similarly, the volume of the first crystal form of the molecule to be tested can also be calculated by molecular dynamics methods, such as gromacs.
步骤S103,基于所述振动频率、所述待测分子的第一晶型的势能和体积,采用预设的自由能计算公式确定所述待测分子的第一晶型在多个预设温度下的自由能随体积的变化曲线。Step S103, based on the vibration frequency, the potential energy and volume of the first crystalline form of the analyte molecule, using a preset free energy calculation formula to determine the first crystal form of the analyte molecule at multiple preset temperatures The curve of free energy as a function of volume.
在本申请实施例中,计算得到的待测分子的势能为U,体积为V,振动频率为ω,其中,预设的自由能公式可采用如下公式:In the embodiment of the present application, the calculated potential energy of the molecule to be measured is U, the volume is V, and the vibration frequency is ω, wherein the preset free energy formula can be as follows:
Figure PCTCN2021136976-appb-000001
Figure PCTCN2021136976-appb-000001
Figure PCTCN2021136976-appb-000002
Figure PCTCN2021136976-appb-000002
其中,F(V,T)为待测分子处于温度T体积V时的自由能,U(V)为待测分子体积为V时的势能,N A为阿伏伽德罗常数,P为系综压强,k B为玻尔兹曼常数,k(ω)为原子k的频率,
Figure PCTCN2021136976-appb-000003
为约化普朗克常数,G(T)表示为温度T时的最终自由能。
Among them, F(V,T) is the free energy of the molecule to be measured at the temperature T and volume V, U(V) is the potential energy of the molecule to be measured when the volume is V, N A is Avogadro's constant, and P is the system Comprehensive pressure, k B is the Boltzmann constant, k(ω) is the frequency of atom k,
Figure PCTCN2021136976-appb-000003
To reduce Planck's constant, G(T) is expressed as the final free energy at temperature T.
在本申请实施例中,采用上述公式可以求解出在温度T时,待测分子的自由能随体积变化的曲线。In the embodiment of the present application, the above-mentioned formula can be used to solve the curve of the free energy of the molecule to be measured as a function of the volume at the temperature T.
步骤S104,将所述待测分子的第一晶型在各个预设温度下的自由能随体积的变化曲线中的最小自由能值作为相应预设温度下所述待测分子的第一晶型的绝对自由能。Step S104, taking the minimum free energy value in the free energy versus volume curve of the first crystal form of the molecule to be tested at each preset temperature as the first crystal form of the molecule to be tested at the corresponding preset temperature of absolute free energy.
在本申请实施例中,根据待测分子在不同温度下自由能与体积的曲线关系,选取待测分子在每个温度下的自由能最小值,这些最小值即为该待测分子在对应温度下的绝对自由能,采用该方法可以获取待测分子处于不同温度下的绝对自由能。In the embodiment of the present application, according to the curve relationship between the free energy and the volume of the molecule to be measured at different temperatures, the minimum value of the free energy of the molecule to be measured at each temperature is selected, and these minimum values are the minimum values of the free energy of the molecule to be measured at the corresponding temperature. The absolute free energy at different temperatures can be obtained by using this method.
本申请提供的技术方案可以包括以下有益效果:本申请实施例通过获取待测分子的力场数据和待测分子的第一晶型的初始坐标数据,并对待测分子的第一晶型进行能量最小化处理,然后在等温等压系综下,计算待测分子的第一晶型在预设温度处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积,基于所述振动频率、所述待测分子的第一晶型的势能和体积,采用预设的自由能计算公式确定待测分子的第一晶型在多个预设温度下的自由能随体积的变化曲线,在针对原子数较多、柔性角较多且各向异性的分子晶体,能够准确计算分子晶体的自由能,节约算力和计算时间。The technical solution provided by the present application may include the following beneficial effects: In the embodiment of the present application, by acquiring the force field data of the molecule to be tested and the initial coordinate data of the first crystal form of the molecule to be tested, and performing energy analysis on the first crystal form of the molecule to be tested Minimize the processing, and then under the isothermal and isobaric ensemble, calculate the vibration of all atoms in the first crystal form of the molecule to be measured in each coordinate direction when the first crystal form of the molecule to be measured is in an equilibrium state at a preset temperature frequency, and the potential energy and volume of the first crystal form of the molecule to be measured, based on the vibration frequency, the potential energy and volume of the first crystal form of the molecule to be measured, using a preset free energy calculation formula to determine the The curve of free energy versus volume of the first crystal form of molecules at multiple preset temperatures can accurately calculate the free energy of molecular crystals for molecular crystals with more atoms, more flexible angles and anisotropy. Save computing power and computing time.
作为本申请一种可能的实施方式,在该实施方式中,如图2所示,所述采用分子动力学方法,基于待测分子的力场数据对所述待测分子的第一晶型进行能量最小化处理,得到能量最小化处理后的待测分子的第一晶型的结构信息,包括:As a possible implementation of the present application, in this implementation, as shown in Figure 2, the first crystal form of the molecule to be tested is analyzed based on the force field data of the molecule to be tested using a molecular dynamics method. Energy minimization processing, obtaining the structural information of the first crystal form of the molecule to be tested after energy minimization processing, including:
步骤S201,基于所述待测分子的力场数据和所述待测分子的第一晶型的结构数据,对所述待测分子的第一晶型的晶胞进行扩胞,得到预设尺寸的超胞。Step S201, based on the force field data of the molecule to be tested and the structure data of the first crystal form of the molecule to be tested, expand the unit cell of the first crystal form of the molecule to be tested to obtain a preset size supercell.
在本申请实施例中,待测分子的结构数据为存储有具体分子结构的文 件,例如.cif文件或.res文件,对该结构文件中的晶胞进行扩胞处理,生成预设尺寸的超胞,可选的,该预设尺寸的超胞为3×2×5的超胞,可选的,该超胞的具体尺寸可以根据需要设定,此处不做限定。In the embodiment of the present application, the structural data of the molecule to be measured is a file storing a specific molecular structure, such as a .cif file or a .res file, and the cell in the structure file is expanded to generate a super Optionally, the preset size of the supercell is a 3×2×5 supercell. Optionally, the specific size of the supercell can be set according to needs, which is not limited here.
步骤S202,基于所述待测分子的力场数据,对所述超胞进行能量最小化处理,得到能量最小化处理后的超胞的结构信息。Step S202, based on the force field data of the molecule to be tested, perform energy minimization processing on the supercell, and obtain structural information of the supercell after energy minimization processing.
在本申请实施例中,能量最小化处理是指对待测分子的第一晶型中的原子位置进行反复的迭代调整,降低该晶型系统的总能量,其中,能量最小化处理中,预先设置有迭代停止标准,当迭代中达到该迭代停止标准时,停止迭代,将该晶型系统的总能量降低到一个预期的临界点,并得到能量最小化处理后的待测分子的第一晶型的超胞的结构信息。In the embodiment of the present application, the energy minimization process refers to repeated iterative adjustment of the position of atoms in the first crystal form of the molecule to be tested to reduce the total energy of the crystal form system, wherein, in the energy minimization process, preset There is an iteration stop criterion, when the iteration stop criterion is reached in the iteration, the iteration is stopped, the total energy of the crystal form system is reduced to an expected critical point, and the energy minimization treatment of the first crystal form of the molecule to be tested is obtained. Structural information of the supercell.
作为本申请一种可能的实施方式,在对待测分子的第一晶型进行能量最小化处理时,采用预先设置的gromacs程序,该gromacs程序是预先设置好的,先通过力场数据生成gromacs程序需要的输入文件,将该输入文件输入至gromacs程序对待测分子的第一晶型的超胞进行能量最小化处理,可以得到能量最小化处理后的待测分子的第一晶型的超胞的结构信息。As a possible implementation of the present application, when the energy minimization process is performed on the first crystal form of the molecule to be tested, the pre-set gromacs program is used. The gromacs program is pre-set, and the gromacs program is first generated from the force field data The required input file, input the input file to the gromacs program to perform energy minimization on the supercell of the first crystal form of the molecule to be tested, and obtain the supercell of the first crystal form of the molecule to be tested after energy minimization structural information.
本申请实施例通过对分子结构进行扩胞处理,并对扩胞得到的超胞进行能量最小化处理,保证后续分子自由能测量的准确性。In the embodiment of the present application, the molecular structure is expanded and the energy minimization is performed on the expanded supercell to ensure the accuracy of the subsequent molecular free energy measurement.
作为本申请一种可能的实施方式,在该实施方式中,如图3所示,采用分子动力学方法,基于所述能量最小化处理后的待测分子的第一晶型的结构信息,在等温等压系综下,计算能量最小化处理后的所述待测分子的第一晶型在预设温度下处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积,包括:As a possible implementation of the present application, in this implementation, as shown in Figure 3, molecular dynamics method is used, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, in Under the isothermal and isobaric ensemble, when the first crystal form of the molecule to be tested after the energy minimization process is in an equilibrium state at a preset temperature, all atoms of the first crystal form of the molecule to be tested at each coordinate The vibration frequency of the direction, and the potential energy and volume of the first crystal form of the molecule to be measured include:
步骤S301,在预设的温度区间内,对能量最小化处理后的超胞进行等温等压系综模拟,获得待测分子的第一晶型的超胞的模拟轨迹。In step S301 , within a preset temperature range, an isothermal and isobaric ensemble simulation is performed on the energy-minimized supercell to obtain a simulated trajectory of the supercell of the first crystal form of the molecule to be tested.
在本申请实施例中,在计算待测分子的第一晶型的所有原子的振动频率时,需要先对待测分子的第一晶型进行模拟,可选的,可以在等压等温系综下模拟,接前述实施例,对超胞进行能量最小化之后,调用gromacs程序,对能量最小化之后的超胞进行模拟,其中,预设的温度区间可以选择50K~300K的温度范围,该温度范围可以根据实际情况选择,本申请在 此不做限定。在该温度区间内,对超胞进行NPT模拟,其中,可以选取预设步长为1fs,每次模拟时间为7ns,采用上述预设步长和模拟时间对超胞进行模拟,得到模拟结果。In the embodiment of the present application, when calculating the vibration frequencies of all atoms in the first crystal form of the molecule to be tested, it is necessary to simulate the first crystal form of the molecule to be tested first. Simulation, following the previous example, after the energy minimization of the supercell, call the gromacs program to simulate the supercell after energy minimization, wherein the preset temperature range can be selected from a temperature range of 50K to 300K, and the temperature range It can be selected according to the actual situation, and this application does not make a limitation here. In this temperature range, NPT simulation is performed on the supercell. The preset step size can be selected as 1 fs, and the simulation time is 7 ns each time. The supercell is simulated with the above preset step size and simulation time, and the simulation result is obtained.
步骤S302,在所述预设的温度区间选取多个温度值,在所述待测分子的第一晶型的超胞的模拟轨迹中选取与所述多个温度值分别对应的模拟轨迹帧,分别对多个所述模拟轨迹帧进行等温等压系综模拟,得到所述待测分子的第一晶型分别在所述多个温度值下的平衡状态的体积。Step S302, selecting a plurality of temperature values in the preset temperature range, and selecting simulation trajectory frames respectively corresponding to the plurality of temperature values in the simulation trajectory of the supercell of the first crystal form of the molecule to be measured, performing isothermal and isobaric ensemble simulations on a plurality of the simulated trajectory frames respectively, to obtain volumes in equilibrium states of the first crystal form of the molecule to be measured at the plurality of temperature values respectively.
在本申请实施例中,轨迹帧是指模拟轨迹中某一时刻或者温度点用于表示分子中各个原子坐标和速度的帧,在选取的温度区间内,选择多个温度值,在一个具体实施例中,选取的温度值为50K、60K、70K、80K等,然后可按照预设步长,分别获取这些温度值对应的模拟轨迹帧,然后对每个模拟轨迹帧进行NPT模拟,其中,可以选取预设步长为1fs,每次模拟时间为4ns,采用上述预设步长和模拟时间对超胞进行模拟,得到模拟结果,当待测分子达到稳定平衡状态时,确定该待测分子分别在每个温度值下的稳定结构,并得到待测分子在每个温度值下稳定时的体积。In the embodiment of the present application, the trajectory frame refers to the frame used to represent the coordinates and speed of each atom in the molecule at a certain moment or temperature point in the simulated trajectory. In the selected temperature range, multiple temperature values are selected, and in a specific implementation In the example, the selected temperature values are 50K, 60K, 70K, 80K, etc., and then the simulated trajectory frames corresponding to these temperature values can be obtained according to the preset step size, and then NPT simulation is performed on each simulated trajectory frame. Among them, Select the preset step size as 1fs, and each simulation time as 4ns, use the above preset step size and simulation time to simulate the supercell, and obtain the simulation results. When the molecules to be measured reach a stable equilibrium state, determine the molecules to be measured respectively The stable structure at each temperature value, and obtain the volume of the analyte molecule at each temperature value when it is stable.
步骤S303,计算所述待测分子的第一晶型分别在所述多个温度值下的平衡状态的体积的平均值,基于所述平均值选取目标模拟轨迹帧,对所述目标模拟轨迹帧对应的待测分子的第一晶型的超胞进行能量最小化处理,得到能量最小化处理后的分子结构;其中,选择模拟轨迹帧对应的待测分子的第一晶型的体积与所述平均体积的误差最小的模拟轨迹帧为目标模拟轨迹帧。Step S303, calculating the average value of the volumes of the equilibrium state of the first crystal form of the molecule to be measured at the multiple temperature values, selecting a target simulation trajectory frame based on the average value, and calculating the target simulation trajectory frame The supercell corresponding to the first crystal form of the molecule to be measured is subjected to energy minimization to obtain the molecular structure after the energy minimization process; wherein, the volume of the first crystal form of the molecule to be tested corresponding to the simulated trajectory frame is selected to match the The simulated trajectory frame with the smallest average volume error is the target simulated trajectory frame.
在本申请实施例中,针对待测分子在每个温度值下稳定时的体积,计算待测分子的平均体积,并获取目标模拟轨迹帧,其中,所述目标模拟轨迹帧对应的待测分子的体积与所述平均体积的误差最小,采用待测分子的平均体积能够准确的确定待测分子在不同温度下体积的平均水平,然后基于待测分子体积的平均水平确定目标模拟轨迹帧,保证模拟的准确性。In the embodiment of the present application, the average volume of the molecule to be measured is calculated for the volume of the molecule to be measured when it is stable at each temperature value, and the target simulation trajectory frame is obtained, wherein the target molecule to be measured corresponding to the simulation trajectory frame The error between the volume of the molecule and the average volume is the smallest, and the average volume of the molecules to be measured can be accurately determined by using the average volume of the molecules to be measured. Simulation accuracy.
步骤S304,对所述能量最小化处理后的分子结构进行模式分析,计算所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率。Step S304, performing mode analysis on the molecular structure after the energy minimization process, and calculating the vibration frequencies of all atoms in the first crystal form of the molecule to be measured in each coordinate direction.
在本申请实施例中,在对待测分子进行能量最小化处理之后,在等温等压系综下,对待测分子的第一晶型进行模拟,在预设的温度区间选取指 定温度,在该指定温度下,将该待测分子的第一晶型模拟至平衡状态,确定该平衡状态下待测分子的第一晶型的结构,并计算该结构下,待测分子的第一晶型的所有原子在各个坐标方向的振动频率。在本申请实施例中,待测分子在各个坐标方向上的震动频率是指待测分子的第一晶型的各个原子分别在x轴、y轴和z轴三个方向上的振动频率。In the embodiment of the present application, after the energy minimization process is performed on the molecule to be tested, the first crystal form of the molecule to be tested is simulated under the isothermal and isobaric ensemble, and a specified temperature is selected in a preset temperature range. Under the temperature, simulate the first crystal form of the molecule to be measured to an equilibrium state, determine the structure of the first crystal form of the molecule to be measured in the equilibrium state, and calculate all the first crystal forms of the molecule to be measured under the structure The vibration frequency of the atom in each coordinate direction. In the embodiments of the present application, the vibration frequencies of the molecules to be measured in each coordinate direction refer to the vibration frequencies of each atom of the first crystal form of the molecules to be measured in the three directions of x-axis, y-axis and z-axis respectively.
在本申请实施例中,通过在不同温度下进行分子动力学采样,可以得到各向异性更好的膨胀体积,可以更好的处理各向同性膨胀和高耗时等问题。In the embodiment of the present application, by performing molecular dynamics sampling at different temperatures, an expansion volume with better anisotropy can be obtained, and problems such as isotropic expansion and high time consumption can be better dealt with.
作为本申请一种可能的实施方式,在该实施方式中,如图4所示,采用分子动力学方法,基于所述能量最小化处理后的待测分子的第一晶型的结构信息,在等温等压系综下,计算能量最小化处理后的所述待测分子的第一晶型在预设温度下处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积,包括:As a possible implementation of the present application, in this implementation, as shown in Figure 4, molecular dynamics method is used, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, in Under the isothermal and isobaric ensemble, when the first crystal form of the molecule to be tested after the energy minimization process is in an equilibrium state at a preset temperature, all atoms of the first crystal form of the molecule to be tested at each coordinate The vibration frequency of the direction, and the potential energy and volume of the first crystal form of the molecule to be measured include:
步骤S401,依次采用最速下降法和共轭梯度法对所述目标模拟轨迹帧进行能量最小化处理。In step S401, the energy minimization process is performed on the target simulated trajectory frame by using the steepest descent method and the conjugate gradient method in sequence.
在本申请实施例中,在对目标模拟轨迹帧进行能量最小化处理时,可以依次采用最速下降法和共轭梯度法,将目标模拟轨迹帧作为输入,先后进行最速下降法和共轭梯度法的能量最小化。In the embodiment of this application, when performing energy minimization processing on the target simulation trajectory frame, the steepest descent method and the conjugate gradient method can be used in sequence, and the target simulation trajectory frame is used as input, and the steepest descent method and the conjugate gradient method are successively performed energy minimization.
步骤S402,当所述能量最小化处理达到预设的标准时,计算进行能量最小化处理后的所述待测分子的第一晶型的势能和体积,得到所述待测分子的第一晶型的势能和体积。Step S402, when the energy minimization process reaches the preset standard, calculate the potential energy and volume of the first crystalline form of the analyte molecule after the energy minimization process, and obtain the first crystalline form of the analyte molecule potential energy and volume.
在本申请实施例中,采用分子动力学方法,当能量最小化达到预设的标准,如梯度ΔF<0.001时,表示能量最小化达到最小值,计算此时待测分子的势能和体积。In the embodiment of the present application, the molecular dynamics method is adopted. When the energy minimization reaches the preset standard, such as gradient ΔF<0.001, it means that the energy minimization reaches the minimum value, and the potential energy and volume of the molecule to be measured at this time are calculated.
作为本申请一种可能的实施方式,在该实施方式中,该方法还包括:As a possible implementation of the present application, in this implementation, the method further includes:
针对所述待测分子的任一晶型,分别采用前述实施例中的方法,得到在不同预设温度下,所述待测分子不同晶型对应的绝对自由能。For any crystal form of the molecule to be tested, the methods in the foregoing examples are used respectively to obtain the absolute free energy corresponding to different crystal forms of the molecule to be tested at different preset temperatures.
步骤S402,选取不同预设温度下,选取所述待测分子的自由能最小值,得到所述待测分子在不同预设温度下的绝对自由能曲线。Step S402, select different preset temperatures, select the minimum value of free energy of the molecules to be measured, and obtain absolute free energy curves of the molecules to be measured at different preset temperatures.
在本申请实施例中,针对待测分子不同的晶型,可以采用前述实施例中任意实施例提供的分子自由能计算方法计算该晶型在不同温度下的绝对自由能曲线,其原理与前述实施例中的原理类似,此处不再赘述。In the embodiment of the present application, for the different crystal forms of the molecules to be tested, the molecular free energy calculation method provided in any of the foregoing embodiments can be used to calculate the absolute free energy curves of the crystal forms at different temperatures. The principle is the same as that of the foregoing The principles in the embodiments are similar and will not be repeated here.
在本申请实施例中,提供的分子自由能计算方法能够针对待测分子处于不同晶型时的自由能,在获取待测分子的不同晶型的初始坐标数据后,可以采用前述实施例中提供的分子自由能计算方法,分别计算待测分子的不同晶型,在所述不同预设温度下的自由能随体积的变化曲线,其计算原理与前述实施例相同,此处不再赘述。In the embodiment of the present application, the molecular free energy calculation method provided can aim at the free energy when the molecule to be tested is in different crystal forms. After obtaining the initial coordinate data of different crystal forms of the molecule to be tested, the method provided in the previous examples can be used The free energy calculation method of molecules is used to calculate the free energy variation curves with volume at different preset temperatures for different crystal forms of the molecules to be tested. The calculation principle is the same as that of the previous embodiment, and will not be repeated here.
本申请提供的技术方案可以包括以下有益效果:本申请实施例通过获取待测分子的力场数据和待测分子的不同晶型的初始坐标数据,并对待测分子的不同晶型分别进行能量最小化处理,然后在等温等压系综下,分别计算所述待测分子的不同晶型在预设温度处于平衡状态时,所述待测分子的不同晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的不同晶型的势能和体积,然后基于所述待测分子的不同晶型的所有原子在各个坐标方向的振动频率、所述待测分子的不同晶型的势能和体积,采用预设的自由能计算公式分别确定所述待测分子的不同晶型在多个预设温度下的自由能随体积的变化曲线,在针对原子数较多、柔性角较多且各向异性的分子晶体,能够准确计算分子晶体的自由能,节约算力和计算时间。The technical solution provided by the present application may include the following beneficial effects: the embodiment of the present application obtains the force field data of the molecule to be tested and the initial coordinate data of different crystal forms of the molecule to be tested, and performs energy minimization for the different crystal forms of the molecule to be tested respectively. Then, under the isothermal and isobaric ensemble, respectively calculate the vibrations of all the atoms of the different crystal forms of the molecule to be tested in each coordinate direction when the different crystal forms of the molecule to be tested are in an equilibrium state at a preset temperature frequency, and the potential energy and volume of the different crystal forms of the molecule to be measured, and then based on the vibration frequencies of all atoms of the different crystal forms of the molecule to be measured in each coordinate direction, the potential energy of the different crystal forms of the molecule to be measured and volume, using the preset free energy calculation formula to determine the curves of the free energy versus volume of different crystal forms of the molecule to be tested at multiple preset temperatures. Anisotropic molecular crystals can accurately calculate the free energy of molecular crystals, saving computing power and computing time.
本申请实施例提供了一种分子稳定性分析方法,如图5所示,包括:The embodiment of the present application provides a molecular stability analysis method, as shown in Figure 5, including:
步骤S501,采用前述实施例中的方法,获得不同预设温度下所述待测分子的不同晶型的绝对自由能。Step S501, using the method in the foregoing embodiments, to obtain the absolute free energies of different crystal forms of the molecule to be measured at different preset temperatures.
在本申请实施例中,根据前述实施例,可以针对待测分子的不同晶型分别在不同温度下的绝对自由能,原理此处不再赘述。In the embodiment of the present application, according to the foregoing embodiments, the absolute free energies of different crystal forms of the molecule to be tested at different temperatures can be determined, and the principle will not be repeated here.
步骤S502,以其中任一晶型在不同预设温度下的绝对自由能为参比,将其他晶型在相同预设温度下的绝对自由能减去参比,得到各晶型在不同预设温度下的相对自由能值。Step S502, taking the absolute free energy of any one of the crystal forms at different preset temperatures as a reference, and subtracting the reference from the absolute free energy of other crystal forms at the same preset temperature, to obtain the absolute free energy of each crystal form at different preset temperatures. Relative free energy values at temperature.
在本申请实施例中,针对该待测分子的任一晶型,以该晶型在不同预设温度下的绝对自由能为参比,将其它晶型在该温度下的绝对自由能减去该参比,可得到各晶型在各温度下的相对自由能。In the examples of this application, for any crystal form of the molecule to be tested, the absolute free energy of the crystal form at different preset temperatures is used as a reference, and the absolute free energy of other crystal forms at the temperature is subtracted With this reference, the relative free energy of each crystal form at each temperature can be obtained.
步骤S503,基于各晶型在不同预设温度下的相对自由能值,确定特定 温度下最稳定的晶型,以同一温度下相对自由能值最低对应的晶型为最稳定晶型。Step S503, based on the relative free energy values of each crystal form at different preset temperatures, determine the most stable crystal form at a specific temperature, and the crystal form corresponding to the lowest relative free energy value at the same temperature is the most stable crystal form.
在本申请实施例中,针对不同晶型,在相同温度时,相对自由能最低的晶型即为该待测分子在该温度下最稳定的晶型。In the examples of the present application, for different crystal forms, at the same temperature, the crystal form with the lowest relative free energy is the most stable crystal form of the molecule to be tested at that temperature.
对于本申请实施例,为方便说明,以一个具体实施例为例,如图6所示,为新冠药物法匹拉韦(Favipiravir)分子的结构式,获取该分子的分子坐标文件,该文件格式可以是qbd格式的文件,使用yoda程序抓取去对应的GAFF力场数据,获取其一种晶型的晶体结构cif文件,对该结构文件扩胞,生成一个3x2x5的超胞,读取力场数据生成gromacs(分子动力学方法)程序所需的输入文件,调用gromacs程序,对超胞进行能量最小化,再调用gromacs程序,对能量最小化后的超胞进行NPT模拟,温度从50K升温至300K,步长1fs,模拟7ns,得到模拟轨迹,以10K为间隔分别从上述模拟轨迹中抽取特定温度对应的时间点的轨迹帧,并继续进行NPT模拟,步长1fs,模拟4ns,最终得到所有温度下的平衡结构,其RMSD(位移均方根差)和体积随模拟时间变化分别如图7和8所示,体系在每个温度下模拟均达到平衡;调用gromacs程序,选取上述NPT模拟中平均体积帧作为输入,先后进行最速下降法和共轭梯度法的能量最小化,保证优化达到收敛标准(如梯度ΔF<0.001),并计算势能(U)与当前体积(V);调用gromacs程序,以上述能量最小化后的结构作为输入,进行模式分析,计算振动频率(ω);采用以下公式,计算不同温度的自由能随体积的变化曲线。For the embodiment of the present application, for the convenience of description, take a specific embodiment as an example, as shown in Figure 6, which is the structural formula of the new crown drug Favipiravir (Favipiravir) molecule, and obtain the molecular coordinate file of the molecule. The file format can be It is a file in qbd format. Use the yoda program to capture the corresponding GAFF force field data, obtain a crystal structure cif file of a crystal form, expand the structure file to generate a 3x2x5 supercell, and read the force field data Generate the input file required by the gromacs (molecular dynamics method) program, call the gromacs program to minimize the energy of the supercell, and then call the gromacs program to perform NPT simulation on the energy-minimized supercell, and the temperature is raised from 50K to 300K , the step size is 1fs, the simulation is 7ns, and the simulation trajectory is obtained, and the trajectory frame of the time point corresponding to the specific temperature is extracted from the above simulation trajectory at an interval of 10K, and the NPT simulation is continued, the step size is 1fs, the simulation is 4ns, and finally all the temperatures are obtained The equilibrium structure below, its RMSD (Root Mean Square Difference of Displacement) and the variation of its volume with the simulation time are shown in Figures 7 and 8, respectively. The volume frame is used as input, and the energy minimization of the steepest descent method and the conjugate gradient method are carried out successively to ensure that the optimization reaches the convergence standard (such as gradient ΔF<0.001), and the potential energy (U) and the current volume (V) are calculated; call the gromacs program, Using the above energy-minimized structure as input, conduct mode analysis to calculate the vibration frequency (ω); use the following formula to calculate the curve of free energy versus volume at different temperatures.
Figure PCTCN2021136976-appb-000004
Figure PCTCN2021136976-appb-000004
Figure PCTCN2021136976-appb-000005
Figure PCTCN2021136976-appb-000005
其中,F(V,T)为待测分子处于温度T体积V时的自由能,U(V)为待测分子体积为V时的势能,N A为阿伏伽德罗常数,P为系综压强,k B为玻尔兹曼常数,k(ω)为原子k的频率,
Figure PCTCN2021136976-appb-000006
为约化普朗克常数,G(T)表示为温度T时的最终自由能。
Among them, F(V,T) is the free energy of the molecule to be measured at the temperature T and volume V, U(V) is the potential energy of the molecule to be measured when the volume is V, N A is Avogadro's constant, and P is the system Comprehensive pressure, k B is the Boltzmann constant, k(ω) is the frequency of atom k,
Figure PCTCN2021136976-appb-000006
To reduce Planck's constant, G(T) is expressed as the final free energy at temperature T.
根据不同温度(T)下自由能(F(V,T))随体积(V)变化曲线的最小值,即可求得该温度下的绝对自由能ΔGabs(T),如图9所示。将待 测分子的不同晶型的坐标文件作为输入重复上述步骤,即可计算出不同晶型之间的相对自由能随温度变化关系,如图10所示。同时,针对该分子的相同晶型,采用PCSP计算得到的相对自由能结果如图11所述,低温段不同结构的相对自由能趋势基本一致,但由于PSCP方法本身的局限性,50K-90K的计算结果是不准确的(本申请未示出),同时计算了力场QHA和PSCP方法平均每个结构计算所用核时比为663/2958,约0.22。说明本申请实施例提供的分子自由能计算方法在保证结果正确性的同时节省了约80%的计算成本。According to the minimum value of the curve of free energy (F(V, T)) versus volume (V) at different temperatures (T), the absolute free energy ΔGabs(T) at the temperature can be obtained, as shown in Figure 9. Using the coordinate files of different crystal forms of the molecule to be measured as input and repeating the above steps, the relationship between the relative free energy of different crystal forms with temperature can be calculated, as shown in Figure 10. At the same time, for the same crystal form of the molecule, the relative free energy results calculated by PCSP are shown in Figure 11. The relative free energy trends of different structures in the low temperature segment are basically the same, but due to the limitations of the PSCP method itself, the 50K-90K The calculation result is inaccurate (not shown in this application), and the force field QHA and PSCP methods are calculated at the same time. The average core time ratio for each structure calculation is 663/2958, about 0.22. It shows that the molecular free energy calculation method provided in the examples of this application saves about 80% of the calculation cost while ensuring the correctness of the results.
与前述应用功能实现方法实施例相对应,本申请还提供了一种分子自由能计算装置、电子设备及相应的实施例。Corresponding to the aforementioned embodiments of the method for realizing application functions, the present application also provides a molecular free energy calculation device, electronic equipment, and corresponding embodiments.
图12是本申请实施例示出的分子自由能计算装置的结构示意图。Fig. 12 is a schematic structural diagram of a molecular free energy calculation device shown in an embodiment of the present application.
参见图12,该分子自由能计算装置120包括能量最小化模块1210、平衡模拟模块1220、曲线计算模块1230、以及自由能计算模块1240,其中:Referring to Fig. 12, the molecular free energy calculation device 120 includes an energy minimization module 1210, a balance simulation module 1220, a curve calculation module 1230, and a free energy calculation module 1240, wherein:
能量最小化模块1210,用于采用分子动力学方法,基于待测分子的力场数据对所述待测分子的第一晶型进行能量最小化处理,得到能量最小化处理后的待测分子的第一晶型的结构信息;The energy minimization module 1210 is configured to use a molecular dynamics method to perform energy minimization on the first crystal form of the molecule to be tested based on the force field data of the molecule to be tested, to obtain the energy minimization process of the molecule to be tested. Structural information of the first crystal form;
平衡模拟模块1220,用于采用分子动力学方法,基于所述能量最小化处理后的待测分子的第一晶型的结构信息,在等温等压系综下,计算能量最小化处理后的所述待测分子的第一晶型在预设温度下处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积;The balance simulation module 1220 is configured to use a molecular dynamics method, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, to calculate the energy minimization process under the isothermal and isobaric ensemble. When the first crystalline form of the analyte molecule is in an equilibrium state at a preset temperature, the vibration frequencies of all atoms in the first crystalline form of the analyte molecule in each coordinate direction, and the first crystal form of the analyte molecule The potential energy and volume of the type;
曲线计算模块1230,用于基于所述振动频率、所述待测分子的第一晶型的势能和体积,采用预设的自由能计算公式确定所述待测分子的第一晶型在多个预设温度下的自由能随体积的变化曲线;The curve calculation module 1230 is configured to determine the first crystal form of the molecule to be measured in multiple The curve of free energy versus volume at a preset temperature;
自由能计算模块1240,用于将所述待测分子的第一晶型在各个预设温度下的自由能随体积的变化曲线中的最小自由能值作为相应预设温度下所述待测分子的第一晶型的绝对自由能。The free energy calculation module 1240 is configured to use the minimum free energy value in the curve of free energy versus volume of the first crystal form of the molecule to be measured at each preset temperature as the value of the molecule to be measured at the corresponding preset temperature The absolute free energy of the first crystal form of .
作为本申请一种可能的实施方式,在该实施例中,能量最小化模块1210在采用分子动力学方法,基于待测分子的力场数据对所述待测分子的 第一晶型进行能量最小化处理,得到能量最小化处理后的待测分子的第一晶型的结构信息时,可以用于:As a possible implementation manner of the present application, in this embodiment, the energy minimization module 1210 performs energy minimization on the first crystal form of the molecule to be tested based on the force field data of the molecule to be tested by using a molecular dynamics method. When obtaining the structure information of the first crystal form of the molecule to be tested after energy minimization treatment, it can be used for:
基于所述待测分子的力场数据和所述待测分子的第一晶型的结构数据,对所述待测分子的第一晶型的晶胞进行扩胞,得到预设尺寸的超胞;Expanding the unit cell of the first crystalline form of the analyte molecule based on the force field data of the analyte molecule and the structure data of the first crystalline form of the analyte molecule to obtain a supercell of a preset size ;
基于所述待测分子的力场数据,对所述超胞进行能量最小化处理,得到能量最小化处理后的超胞的结构信息。Based on the force field data of the molecules to be tested, the energy minimization process is performed on the supercell to obtain the structural information of the supercell after the energy minimization process.
作为本申请一种可能的实施方式,在该实施例中,平衡模拟模块1220在采用分子动力学方法,基于所述能量最小化处理后的待测分子的第一晶型的结构信息,在等温等压系综下,计算能量最小化处理后的所述待测分子的第一晶型在预设温度下处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积时,可以用于:As a possible implementation of the present application, in this embodiment, the equilibrium simulation module 1220 adopts the molecular dynamics method, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, at an isothermal Under the isobaric ensemble, when the first crystal form of the molecule to be measured after the energy minimization process is in an equilibrium state at a preset temperature, all atoms of the first crystal form of the molecule to be measured in each coordinate direction The vibration frequency, and the potential energy and volume of the first crystal form of the molecule to be tested can be used for:
在预设的温度区间内,对能量最小化处理后的超胞进行等温等压系综模拟,获得待测分子的第一晶型的超胞的模拟轨迹;In the preset temperature range, the isothermal and isobaric ensemble simulation is performed on the supercell after the energy minimization treatment, and the simulation trajectory of the supercell of the first crystal form of the molecule to be tested is obtained;
在所述预设的温度区间选取多个温度值,在所述待测分子的第一晶型的超胞的模拟轨迹中选取与所述多个温度值分别对应的模拟轨迹帧,分别对多个所述模拟轨迹帧进行等温等压系综模拟,得到所述待测分子的第一晶型分别在所述多个温度值下的平衡状态的体积;Select a plurality of temperature values in the preset temperature interval, and select simulation trajectory frames corresponding to the plurality of temperature values in the simulation trajectory of the supercell of the first crystal form of the molecule to be measured, respectively for multiple performing an isothermal and isobaric ensemble simulation on each of the simulated trajectory frames to obtain the volumes of the equilibrium states of the first crystal form of the molecule to be measured at the multiple temperature values;
计算所述待测分子的第一晶型分别在所述多个温度值下的平衡状态的体积的平均值,基于所述平均值选取目标模拟轨迹帧,对所述目标模拟轨迹帧对应的待测分子的第一晶型的超胞进行能量最小化处理,得到能量最小化处理后的分子结构;其中,选择模拟轨迹帧对应的待测分子的第一晶型的体积与所述平均体积的误差最小的模拟轨迹帧为目标模拟轨迹帧;Calculate the average value of the volume of the equilibrium state of the first crystal form of the molecule to be measured at the multiple temperature values, select a target simulation trajectory frame based on the average value, and select the target simulation trajectory frame corresponding to the target simulation trajectory frame. The supercell of the first crystal form of the measured molecule is subjected to energy minimization processing to obtain the molecular structure after energy minimization processing; wherein, the volume of the first crystal form of the molecule to be measured corresponding to the simulated trajectory frame is selected to be equal to the average volume The simulated trajectory frame with the smallest error is the target simulated trajectory frame;
对所述能量最小化处理后的分子结构进行模式分析,计算所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率。Mode analysis is performed on the molecular structure after the energy minimization treatment, and the vibration frequencies of all atoms in the first crystal form of the molecule to be measured are calculated in each coordinate direction.
作为本申请一种可能的实施方式,在该实施例中,平衡模拟模块1220在采用分子动力学方法,基于所述能量最小化处理后的待测分子的第一晶型的结构信息,在等温等压系综下,计算能量最小化处理后的所述待测分子的第一晶型在预设温度下处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能 和体积时,可以用于:As a possible implementation of the present application, in this embodiment, the equilibrium simulation module 1220 adopts the molecular dynamics method, based on the structural information of the first crystal form of the molecule to be tested after the energy minimization process, at an isothermal Under the isobaric ensemble, when the first crystal form of the molecule to be measured after the energy minimization process is in an equilibrium state at a preset temperature, all atoms of the first crystal form of the molecule to be measured in each coordinate direction The vibration frequency, and the potential energy and volume of the first crystal form of the molecule to be tested can be used for:
依次采用最速下降法和共轭梯度法对所述目标模拟轨迹帧进行能量最小化处理;Using the steepest descent method and the conjugate gradient method in turn to perform energy minimization processing on the target simulation trajectory frame;
当所述能量最小化处理达到预设的标准时,计算进行能量最小化处理后的所述待测分子的第一晶型的势能和体积,得到所述待测分子的第一晶型的势能和体积。When the energy minimization process reaches the preset standard, calculate the potential energy and volume of the first crystal form of the molecule to be tested after the energy minimization process, and obtain the potential energy and volume of the first crystal form of the molecule to be tested volume.
作为本申请一种可能的实施方式,在该实施例中,该分子自由能计算装置还可以用于:As a possible implementation of the present application, in this embodiment, the molecular free energy calculation device can also be used for:
针对所述待测分子的任一晶型,分别采用上述方法,得到在不同预设温度下,所述待测分子不同晶型对应的绝对自由能。For any crystal form of the molecule to be tested, the above method is used respectively to obtain the absolute free energy corresponding to the different crystal forms of the molecule to be tested at different preset temperatures.
关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关该方法的实施例中进行了详细描述,此处将不再做详细阐述说明。Regarding the apparatus in the above embodiments, the specific manner in which each module executes operations has been described in detail in the embodiments related to the method, and will not be described in detail here.
本申请提供的技术方案可以包括以下有益效果:本申请实施例通过获取待测分子的力场数据,并对待测分子的第一晶型进行能量最小化处理,然后在等温等压系综下,计算待测分子的第一晶型在预设温度处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积,基于所述振动频率、所述待测分子的第一晶型的势能和体积,采用预设的自由能计算公式确定所述待测分子的第一晶型在多个预设温度下的自由能随体积的变化曲线,在针对原子数较多、柔性角较多且各向异性的分子晶体,能够准确计算分子晶体的自由能,节约算力和计算时间。The technical solution provided by the present application may include the following beneficial effects: In the embodiment of the present application, by acquiring the force field data of the molecule to be tested, and performing energy minimization on the first crystal form of the molecule to be tested, and then under the isothermal and isobaric ensemble, Calculating the vibration frequencies of all atoms in the first crystal form of the molecule to be measured in each coordinate direction when the first crystal form of the molecule to be measured is in equilibrium at a preset temperature, and the first crystal form of the molecule to be measured potential energy and volume, based on the vibration frequency, the potential energy and volume of the first crystal form of the molecule to be measured, using the preset free energy calculation formula to determine the first crystal form of the molecule to be measured in multiple preset The curve of free energy versus volume at temperature can accurately calculate the free energy of molecular crystals for molecular crystals with more atoms, more flexible angles, and anisotropy, saving computing power and computing time.
图13是本申请实施例示出的分子稳定性分析装置的结构示意图。Fig. 13 is a schematic structural diagram of a molecular stability analysis device shown in an embodiment of the present application.
参见图13,该分子稳定性分析装置120包括绝对自由能计算单元1310、相对自由能计算单元1320、以及稳定晶型确定单元1330,其中:Referring to FIG. 13, the molecular stability analysis device 120 includes an absolute free energy calculation unit 1310, a relative free energy calculation unit 1320, and a stable crystal form determination unit 1330, wherein:
绝对自由能计算单元1310,用于采用上述方法,获得不同预设温度下所述待测分子的不同晶型的绝对自由能;The absolute free energy calculation unit 1310 is used to obtain the absolute free energy of different crystal forms of the molecule to be measured at different preset temperatures by using the above method;
相对自由能计算单元1320,用于以其中任一晶型在不同预设温度下的绝对自由能为参比,将其他晶型在相同预设温度下的绝对自由能减去参比,得到各晶型在不同预设温度下的相对自由能值;The relative free energy calculation unit 1320 is used to use the absolute free energy of any crystal form at different preset temperatures as a reference, and subtract the reference from the absolute free energy of other crystal forms at the same preset temperature to obtain each The relative free energy value of the crystal form at different preset temperatures;
稳定晶型确定单元1330,用于基于各晶型在不同预设温度下的相对自 由能值,确定特定温度下最稳定的晶型,以同一温度下相对自由能值最低对应的晶型为最稳定晶型。The stable crystal form determining unit 1330 is used to determine the most stable crystal form at a specific temperature based on the relative free energy values of each crystal form at different preset temperatures, and the crystal form corresponding to the lowest relative free energy value at the same temperature is the most stable crystal form.
关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关该方法的实施例中进行了详细描述,此处将不再做详细阐述说明。Regarding the apparatus in the above embodiments, the specific manner in which each module executes operations has been described in detail in the embodiments related to the method, and will not be described in detail here.
本申请实施例通对同一分子的不同晶型的相对自由能进行计算,将同一温度下,相对自由能最低的晶型作为该分子在该温度下最稳定的晶型,能够准确分析出分子在不同温度下的稳定状态。In the examples of the present application, the relative free energy of different crystal forms of the same molecule is generally calculated, and the crystal form with the lowest relative free energy at the same temperature is regarded as the most stable crystal form of the molecule at this temperature, so that the molecule can be accurately analyzed. steady state at different temperatures.
图14是本申请实施例示出的电子设备的结构示意图。FIG. 14 is a schematic structural diagram of an electronic device shown in an embodiment of the present application.
参见图14,电子设备1000包括存储器1010和处理器1020。Referring to FIG. 14 , an electronic device 1000 includes a memory 1010 and a processor 1020 .
处理器1020可以是中央处理单元(Central Processing Unit,CPU),还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。The processor 1020 can be a central processing unit (Central Processing Unit, CPU), and can also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), application specific integrated circuits (Application Specific Integrated Circuit, ASIC), on-site Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. A general-purpose processor may be a microprocessor, or the processor may be any conventional processor, or the like.
存储器1010可以包括各种类型的存储单元,例如系统内存、只读存储器(ROM)和永久存储装置。其中,ROM可以存储处理器1020或者计算机的其他模块需要的静态数据或者指令。永久存储装置可以是可读写的存储装置。永久存储装置可以是即使计算机断电后也不会失去存储的指令和数据的非易失性存储设备。在一些实施方式中,永久性存储装置采用大容量存储装置(例如磁或光盘、闪存)作为永久存储装置。另外一些实施方式中,永久性存储装置可以是可移除的存储设备(例如软盘、光驱)。系统内存可以是可读写存储设备或者易失性可读写存储设备,例如动态随机访问内存。系统内存可以存储一些或者所有处理器在运行时需要的指令和数据。此外,存储器1010可以包括任意计算机可读存储媒介的组合,包括各种类型的半导体存储芯片(例如DRAM,SRAM,SDRAM,闪存,可编程只读存储器),磁盘和/或光盘也可以采用。在一些实施方式中,存储器1010可以包括可读和/或写的可移除的存储设备,例如激光唱片(CD)、只读数字多功能光盘(例如DVD-ROM,双层DVD-ROM)、只读蓝光光盘、超密度光盘、闪存卡(例如SD卡、min SD卡、Micro-SD卡等)、 磁性软盘等。计算机可读存储媒介不包含载波和通过无线或有线传输的瞬间电子信号。The memory 1010 may include various types of storage units such as system memory, read only memory (ROM), and persistent storage. Wherein, the ROM may store static data or instructions required by the processor 1020 or other modules of the computer. The persistent storage device may be a readable and writable storage device. Persistent storage may be a non-volatile storage device that does not lose stored instructions and data even if the computer is powered off. In some embodiments, the permanent storage device adopts a mass storage device (such as a magnetic or optical disk, flash memory) as the permanent storage device. In some other implementations, the permanent storage device may be a removable storage device (such as a floppy disk, an optical drive). System memory can be a readable and writable storage device or a volatile readable and writable storage device, such as dynamic random access memory. System memory can store some or all of the instructions and data that the processor needs at runtime. In addition, the memory 1010 may include any combination of computer-readable storage media, including various types of semiconductor memory chips (such as DRAM, SRAM, SDRAM, flash memory, programmable read-only memory), and magnetic disks and/or optical disks may also be used. In some embodiments, memory 1010 may include a readable and/or writable removable storage device, such as a compact disc (CD), a read-only digital versatile disc (e.g., DVD-ROM, dual-layer DVD-ROM), Read-only Blu-ray Disc, Super Density Disc, Flash memory card (such as SD card, min SD card, Micro-SD card, etc.), magnetic floppy disk, etc. Computer-readable storage media do not contain carrier waves and transient electronic signals transmitted by wireless or wire.
存储器1010上存储有可执行代码,当可执行代码被处理器1020处理时,可以使处理器1020执行上文述及的方法中的部分或全部。Executable codes are stored in the memory 1010 , and when the executable codes are processed by the processor 1020 , the processor 1020 may execute part or all of the methods mentioned above.
此外,根据本申请的方法还可以实现为一种计算机程序或计算机程序产品,该计算机程序或计算机程序产品包括用于执行本申请的上述方法中部分或全部步骤的计算机程序代码指令。In addition, the method according to the present application can also be implemented as a computer program or computer program product, the computer program or computer program product including computer program code instructions for executing some or all of the steps in the above method of the present application.
或者,本申请还可以实施为一种计算机可读存储介质(或非暂时性机器可读存储介质或机器可读存储介质),其上存储有可执行代码(或计算机程序或计算机指令代码),当可执行代码(或计算机程序或计算机指令代码)被电子设备(或服务器等)的处理器执行时,使处理器执行根据本申请的上述方法的各个步骤的部分或全部。Alternatively, the present application may also be implemented as a computer-readable storage medium (or a non-transitory machine-readable storage medium or a machine-readable storage medium), on which executable code (or computer program or computer instruction code) is stored, When the executable code (or computer program or computer instruction code) is executed by the processor of the electronic device (or server, etc.), the processor is made to perform part or all of the steps of the above-mentioned method according to the present application.
以上已经描述了本申请的各实施例,上述说明是示例性的,并非穷尽性的,并且也不限于所披露的各实施例。在不偏离所说明的各实施例的范围和精神的情况下,对于本技术领域的普通技术人员来说许多修改和变更都是显而易见的。本文中所用术语的选择,旨在最好地解释各实施例的原理、实际应用或对市场中的技术的改进,或者使本技术领域的其它普通技术人员能理解本文披露的各实施例。Having described various embodiments of the present application above, the foregoing description is exemplary, not exhaustive, and is not limited to the disclosed embodiments. Many modifications and alterations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described embodiments. The terminology used herein is chosen to best explain the principle of each embodiment, practical application or improvement of technology in the market, or to enable other ordinary skilled in the art to understand each embodiment disclosed herein.

Claims (15)

  1. 一种分子自由能计算方法,其特征在于,所述方法包括:采用分子动力学方法,基于待测分子的力场数据对所述待测分子的第一晶型进行能量最小化处理,得到能量最小化处理后的待测分子的第一晶型的结构信息;采用分子动力学方法,基于所述能量最小化处理后的待测分子的第一晶型的结构信息,在等温等压系综下,计算能量最小化处理后的所述待测分子的第一晶型在预设温度下处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积;基于所述振动频率、所述待测分子的第一晶型的势能和体积,采用预设的自由能计算公式确定所述待测分子的第一晶型在多个预设温度下的自由能随体积的变化曲线;将所述待测分子的第一晶型在各个预设温度下的自由能随体积的变化曲线中的最小自由能值作为相应预设温度下所述待测分子的第一晶型的绝对自由能。A molecular free energy calculation method, characterized in that the method comprises: using a molecular dynamics method, based on the force field data of the molecule to be measured, performing energy minimization processing on the first crystal form of the molecule to be measured to obtain the energy Minimize the structural information of the first crystal form of the molecule to be measured after processing; using molecular dynamics methods, based on the structural information of the first crystal form of the molecule to be measured after the energy minimization process, in the isothermal and isobaric ensemble Next, calculate the vibration frequencies of all atoms in the first crystal form of the molecule to be tested in each coordinate direction when the first crystal form of the molecule to be tested after energy minimization treatment is in an equilibrium state at a preset temperature, and the potential energy and volume of the first crystal form of the molecule to be measured; based on the vibration frequency, the potential energy and volume of the first crystal form of the molecule to be measured, the preset free energy calculation formula is used to determine the The free energy of the first crystal form of the molecule varies with volume at a plurality of preset temperatures; The free energy value is taken as the absolute free energy of the first crystal form of the molecule to be measured at the corresponding preset temperature.
  2. 根据权利要求1所述的分子自由能计算方法,其特征在于,所述采用分子动力学方法,基于待测分子的力场数据对所述待测分子的第一晶型进行能量最小化处理,得到能量最小化处理后的待测分子的第一晶型的结构信息,包括:基于所述待测分子的力场数据和所述待测分子的第一晶型的结构数据,对所述待测分子的第一晶型的晶胞进行扩胞,得到预设尺寸的超胞;基于所述待测分子的力场数据,对所述超胞进行能量最小化处理,得到能量最小化处理后的超胞的结构信息。The molecular free energy calculation method according to claim 1, wherein the molecular dynamics method is used to minimize the energy of the first crystal form of the molecule to be measured based on the force field data of the molecule to be measured, Obtaining the structural information of the first crystal form of the molecule to be tested after the energy minimization process includes: based on the force field data of the molecule to be tested and the structure data of the first crystal form of the molecule to be tested, expand the unit cell of the first crystal form of the molecule to obtain a supercell with a preset size; based on the force field data of the molecule to be measured, perform energy minimization on the supercell to obtain the energy-minimized The structural information of the supercell.
  3. 根据权利要求2所述的分子自由能计算方法,其特征在于,采用分子动力学方法,基于所述能量最小化处理后的待测分子的第一晶型的结构信息,在等温等压系综下,计算能量最小化处理后的所述待测分子的第一晶型在预设温度下处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积,包括:在预设的温度区间内,对能量最小化处理后的超胞进行等温等压系综模拟,获得待测分子的第一晶型的超胞的模拟轨迹;在所述预设的温度 区间选取多个温度值,在所述待测分子的第一晶型的超胞的模拟轨迹中选取与所述多个温度值分别对应的模拟轨迹帧,分别对多个所述模拟轨迹帧进行等温等压系综模拟,得到所述待测分子的第一晶型分别在所述多个温度值下的平衡状态的体积;计算所述待测分子的第一晶型分别在所述多个温度值下的平衡状态的体积的平均值,基于所述平均值选取目标模拟轨迹帧,对所述目标模拟轨迹帧对应的待测分子的第一晶型的超胞进行能量最小化处理,得到能量最小化处理后的分子结构;其中,选择模拟轨迹帧对应的待测分子的第一晶型的体积与所述平均体积的误差最小的模拟轨迹帧为目标模拟轨迹帧;对所述能量最小化处理后的分子结构进行模式分析,计算所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率。The molecular free energy calculation method according to claim 2, wherein molecular dynamics is used, based on the structural information of the first crystal form of the molecule to be measured after the energy minimization process, in the isothermal and isobaric ensemble Next, calculate the vibration frequencies of all atoms in the first crystal form of the molecule to be tested in each coordinate direction when the first crystal form of the molecule to be tested after energy minimization treatment is in an equilibrium state at a preset temperature, and the potential energy and volume of the first crystal form of the molecule to be measured, including: performing isothermal and isobaric ensemble simulation on the supercell after energy minimization within a preset temperature range to obtain the first crystal form of the molecule to be measured. The simulated trajectory of the supercell of the crystal form; multiple temperature values are selected in the preset temperature range, and the simulated trajectory of the supercell of the first crystal form of the molecule to be measured is selected separately from the multiple temperature values For the corresponding simulated trajectory frame, perform isothermal and isobaric ensemble simulation on a plurality of the simulated trajectory frames respectively, to obtain the volumes of the equilibrium state of the first crystal form of the molecule to be measured at the multiple temperature values respectively; calculate The average value of the volumes of the equilibrium state of the first crystal form of the molecule to be measured at the multiple temperature values, based on the average value, the target simulation trajectory frame is selected, and the target simulation trajectory frame corresponding to the target The supercell of the first crystal form of the molecule is subjected to energy minimization processing to obtain the molecular structure after energy minimization processing; wherein, the error between the volume of the first crystal form of the molecule to be tested corresponding to the simulated trajectory frame and the average volume is selected The smallest simulated trajectory frame is the target simulated trajectory frame; performing mode analysis on the energy-minimized molecular structure, and calculating the vibration frequencies of all atoms in the first crystal form of the molecule to be measured in each coordinate direction.
  4. 根据权利要求3所述的分子自由能计算方法,其特征在于,采用分子动力学方法,基于所述能量最小化处理后的待测分子的第一晶型的结构信息,在等温等压系综下,计算能量最小化处理后的所述待测分子的第一晶型在预设温度下处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积,包括:依次采用最速下降法和共轭梯度法对所述目标模拟轨迹帧进行能量最小化处理;当所述能量最小化处理达到预设的标准时,计算进行能量最小化处理后的所述待测分子的第一晶型的势能和体积,得到所述待测分子的第一晶型的势能和体积。The molecular free energy calculation method according to claim 3, characterized in that, using molecular dynamics method, based on the structural information of the first crystal form of the molecule to be measured after the energy minimization process, in the isothermal and isobaric ensemble Next, calculate the vibration frequencies of all atoms in the first crystal form of the molecule to be tested in each coordinate direction when the first crystal form of the molecule to be tested after energy minimization treatment is in an equilibrium state at a preset temperature, and the potential energy and volume of the first crystal form of the molecule to be measured, comprising: sequentially adopting the steepest descent method and the conjugate gradient method to perform energy minimization processing on the target simulation trajectory frame; when the energy minimization processing reaches the predetermined When setting a standard, calculate the potential energy and volume of the first crystal form of the molecule to be detected after the energy minimization process, and obtain the potential energy and volume of the first crystal form of the molecule to be detected.
  5. 根据权利要求1至4任一项所述的分子自由能计算方法,其特征在于,所述方法还包括:针对所述待测分子的任一晶型,分别采用权利要求1-4任一所述的方法,得到在不同预设温度下,所述待测分子不同晶型对应的绝对自由能。The molecular free energy calculation method according to any one of claims 1 to 4, characterized in that the method further comprises: for any crystal form of the molecule to be tested, respectively adopting any of the crystal forms of claims 1-4 According to the method described above, the absolute free energy corresponding to the different crystal forms of the molecule to be tested is obtained at different preset temperatures.
  6. 一种分子稳定性分析方法,其特征在于,包括:采用权利要求5所述的方法,获得不同预设温度下所述待测分子的不同晶型的绝对自由能;以其中任一晶型在不同预设温度下的绝对自由能为参比,将其他晶型在相同预设温度下的绝对自由能减去参比,得到各晶型在不同预设温度下的相 对自由能值;基于各晶型在不同预设温度下的相对自由能值,确定特定温度下最稳定的晶型。A molecular stability analysis method, characterized in that, comprising: adopting the method according to claim 5 to obtain the absolute free energy of different crystal forms of the molecule to be measured at different preset temperatures; The absolute free energy at different preset temperatures is used as a reference, and the absolute free energy of other crystal forms at the same preset temperature is subtracted from the reference to obtain the relative free energy value of each crystal form at different preset temperatures; based on each The relative free energy values of crystal forms at different preset temperatures determine the most stable crystal form at a specific temperature.
  7. 根据权利要求6所述的分子稳定性分析方法,其特征在于,所述基于各晶型在不同预设温度下的相对自由能值,确定特定温度下最稳定的晶型,包括:The molecular stability analysis method according to claim 6, wherein the determination of the most stable crystal form at a specific temperature based on the relative free energy values of each crystal form at different preset temperatures includes:
    将同一温度下相对自由能值最低对应的晶型确定为最稳定晶型。The crystal form corresponding to the lowest relative free energy value at the same temperature was determined as the most stable crystal form.
  8. 一种分子自由能计算装置,其特征在于,所述装置包括:A molecular free energy calculation device, characterized in that the device comprises:
    能量最小化模块,用于采用分子动力学方法,基于待测分子的力场数据对所述待测分子的第一晶型进行能量最小化处理,得到能量最小化处理后的待测分子的第一晶型的结构信息;The energy minimization module is used to perform energy minimization processing on the first crystal form of the molecule to be detected based on the force field data of the molecule to be detected by using a molecular dynamics method, so as to obtain the first crystal form of the molecule to be detected after energy minimization processing. - Structural information of the crystal form;
    平衡模拟模块,用于采用分子动力学方法,基于所述能量最小化处理后的待测分子的第一晶型的结构信息,在等温等压系综下,计算能量最小化处理后的所述待测分子的第一晶型在预设温度下处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积;The balance simulation module is used to calculate the energy-minimized structure information of the first crystal form of the molecule to be tested by using molecular dynamics method under the isothermal and isobaric ensemble. When the first crystalline form of the analyte molecule is in equilibrium at a preset temperature, the vibration frequencies of all atoms in the first crystalline form of the analyte molecule in each coordinate direction, and the first crystalline form of the analyte molecule potential energy and volume;
    曲线计算模块,用于基于所述振动频率、所述待测分子的第一晶型的势能和体积,采用预设的自由能计算公式确定所述待测分子的第一晶型在多个预设温度下的自由能随体积的变化曲线;The curve calculation module is used to determine the first crystal form of the molecule to be tested based on the vibration frequency, the potential energy and the volume of the first crystal form of the molecule to be tested by using a preset free energy calculation formula. Set the curve of free energy versus volume at temperature;
    自由能计算模块,用于将所述待测分子的第一晶型在各个预设温度下的自由能随体积的变化曲线中的最小自由能值作为相应预设温度下所述待测分子的第一晶型的绝对自由能。The free energy calculation module is used to use the minimum free energy value in the free energy versus volume curve of the first crystal form of the molecule to be measured at each preset temperature as the value of the molecule to be measured at the corresponding preset temperature Absolute free energy of the first crystal form.
  9. 根据权利要求7所述的分子自由能计算装置,其特征在于,所述能量最小化模块在采用分子动力学方法,基于待测分子的力场数据对所述待测分子的第一晶型进行能量最小化处理,得到能量最小化处理后的待测分子的第一晶型的结构信息时,用于:基于所述待测分子的力场数据和所述待测分子的第一晶型的结构数据,对所述待测分子的第一晶型的晶胞进行扩胞,得到预设尺寸的超胞;基于所述待测分子的力场数据,对所述超 胞进行能量最小化处理,得到能量最小化处理后的超胞的结构信息。The molecular free energy calculation device according to claim 7, wherein the energy minimization module performs molecular dynamics analysis on the first crystal form of the molecule to be tested based on the force field data of the molecule to be tested. Energy minimization processing, when obtaining the structural information of the first crystal form of the molecule to be tested after the energy minimization process, is used to: based on the force field data of the molecule to be tested and the first crystal form of the molecule to be tested Structural data, expanding the unit cell of the first crystal form of the molecule to be tested to obtain a supercell of a preset size; based on the force field data of the molecule to be tested, performing energy minimization processing on the supercell , to obtain the structural information of the supercell after energy minimization.
  10. 根据权利要求7所述的分子自由能计算装置,其特征在于,所述平衡模拟模块在采用分子动力学方法,基于所述能量最小化处理后的待测分子的第一晶型的结构信息,在等温等压系综下,计算能量最小化处理后的所述待测分子的第一晶型在预设温度下处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积时,用于:在预设的温度区间内,对能量最小化处理后的超胞进行等温等压系综模拟,获得待测分子的第一晶型的超胞的模拟轨迹;在所述预设的温度区间选取多个温度值,在所述待测分子的第一晶型的超胞的模拟轨迹中选取与所述多个温度值分别对应的模拟轨迹帧,分别对多个所述模拟轨迹帧进行等温等压系综模拟,得到所述待测分子的第一晶型分别在所述多个温度值下的平衡状态的体积;计算所述待测分子的第一晶型分别在所述多个温度值下的平衡状态的体积的平均值,基于所述平均值选取目标模拟轨迹帧,对所述目标模拟轨迹帧对应的待测分子的第一晶型的超胞进行能量最小化处理,得到能量最小化处理后的分子结构;其中,选择模拟轨迹帧对应的待测分子的第一晶型的体积与所述平均体积的误差最小的模拟轨迹帧为目标模拟轨迹帧;对所述能量最小化处理后的分子结构进行模式分析,计算所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率。The molecular free energy calculation device according to claim 7, wherein the balance simulation module adopts a molecular dynamics method, based on the structural information of the first crystal form of the molecule to be measured after the energy minimization process, Under the isothermal and isobaric ensemble, when the first crystal form of the molecule to be measured after the energy minimization process is in an equilibrium state at a preset temperature, all the atoms of the first crystal form of the molecule to be measured are in each The vibration frequency in the coordinate direction, and the potential energy and volume of the first crystal form of the molecule to be measured are used for: performing an isothermal and isobaric ensemble simulation on the energy-minimized supercell within a preset temperature range , to obtain the simulated trajectory of the supercell of the first crystal form of the molecule to be measured; select a plurality of temperature values in the preset temperature range, and select from the simulated trajectory of the supercell of the first crystal form of the molecule to be measured The simulated trajectory frames corresponding to the plurality of temperature values respectively perform isothermal and isobaric ensemble simulation on the plurality of simulated trajectory frames to obtain the first crystal form of the molecule to be measured at the plurality of temperature values respectively The volume of the equilibrium state under the condition; calculate the average value of the volume of the equilibrium state of the first crystal form of the molecule to be measured at the multiple temperature values, and select the target simulation trajectory frame based on the average value, and the The supercell of the first crystal form of the molecule to be measured corresponding to the target simulation trajectory frame is subjected to energy minimization processing to obtain the molecular structure after energy minimization processing; wherein, the first crystal form of the molecule to be measured corresponding to the simulation trajectory frame is selected. The simulated trajectory frame with the smallest error between the volume and the average volume is the target simulated trajectory frame; the molecular structure after the energy minimization process is subjected to pattern analysis, and all atoms of the first crystal form of the molecule to be measured are calculated in each Vibration frequency in the coordinate direction.
  11. 根据权利要求7所述的分子自由能计算装置,其特征在于,所述平衡模拟模块在采用分子动力学方法,基于所述能量最小化处理后的待测分子的第一晶型的结构信息,在等温等压系综下,计算能量最小化处理后的所述待测分子的第一晶型在预设温度下处于平衡状态时,所述待测分子的第一晶型的所有原子在各个坐标方向的振动频率,和所述待测分子的第一晶型的势能和体积时,用于:依次采用最速下降法和共轭梯度法对所述目标模拟轨迹帧进行能量最小化处理;当所述能量最小化处理达到预设的标准时,计算进行能量最小化处理后的所述待测分子的第一晶型的势能和体积,得到所述待测分子的第一晶型的势能和体积。The molecular free energy calculation device according to claim 7, wherein the balance simulation module adopts a molecular dynamics method, based on the structural information of the first crystal form of the molecule to be measured after the energy minimization process, Under the isothermal and isobaric ensemble, when the first crystal form of the molecule to be measured after the energy minimization process is in an equilibrium state at a preset temperature, all the atoms of the first crystal form of the molecule to be measured are in each The vibration frequency in the coordinate direction, and the potential energy and volume of the first crystal form of the molecule to be measured are used to: sequentially adopt the steepest descent method and the conjugate gradient method to perform energy minimization processing on the target simulation trajectory frame; when When the energy minimization process reaches the preset standard, calculate the potential energy and volume of the first crystal form of the molecule to be tested after the energy minimization process, to obtain the potential energy and volume of the first crystal form of the molecule to be tested .
  12. 根据权利要求7所述的分子自由能计算装置,其特征在于,所述装置还用于:针对所述待测分子的任一晶型,分别采用权利要求1-4任一所述的方法,得到在不同预设温度下,所述待测分子不同晶型对应的绝对自由能。The molecular free energy calculation device according to claim 7, characterized in that the device is also used to: adopt the method described in any one of claims 1-4 for any crystal form of the molecule to be measured, Obtain the absolute free energy corresponding to the different crystal forms of the molecule to be tested at different preset temperatures.
  13. 一种分子稳定性分析装置,其特征在于,所述装置包括:A molecular stability analysis device, characterized in that the device comprises:
    绝对自由能计算单元,用于采用权利要求5所述的方法,获得不同预设温度下所述待测分子的不同晶型的绝对自由能;The absolute free energy calculation unit is used to obtain the absolute free energy of different crystal forms of the molecule to be measured at different preset temperatures by using the method described in claim 5;
    相对自由能计算单元,用于以其中任一晶型在不同预设温度下的绝对自由能为参比,将其他晶型在相同预设温度下的绝对自由能减去参比,得到各晶型在不同预设温度下的相对自由能值;The relative free energy calculation unit is used to use the absolute free energy of any crystal form at different preset temperatures as a reference, and subtract the reference from the absolute free energy of other crystal forms at the same preset temperature to obtain each crystal The relative free energy value of the type at different preset temperatures;
    稳定晶型确定单元,用于基于各晶型在不同预设温度下的相对自由能值,确定特定温度下最稳定的晶型,以同一温度下相对自由能值最低对应的晶型为最稳定晶型。The stable crystal form determination unit is used to determine the most stable crystal form at a specific temperature based on the relative free energy values of each crystal form at different preset temperatures, and the crystal form corresponding to the lowest relative free energy value at the same temperature is the most stable crystal form.
  14. 一种设备,其特征在于,包括:处理器;以及存储器,其上存储有可执行代码,当所述可执行代码被所述处理器执行时,使所述处理器执行如权利要求1-6中任一项所述的方法。A device, characterized by comprising: a processor; and a memory on which executable codes are stored, and when the executable codes are executed by the processor, the processor is made to execute the method according to claims 1-6. any one of the methods described.
  15. 一种存储介质,其上存储有可执行代码,当所述可执行代码被电子设备的处理器执行时,使所述处理器执行如权利要求1-6中任一项所述的方法。A storage medium, on which executable codes are stored, and when the executable codes are executed by a processor of an electronic device, the processor is made to execute the method according to any one of claims 1-6.
PCT/CN2021/136976 2021-12-10 2021-12-10 Method and apparatus for computing free energy of molecule, and stability analysis method and apparatus, and device and storage medium WO2023102870A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2021/136976 WO2023102870A1 (en) 2021-12-10 2021-12-10 Method and apparatus for computing free energy of molecule, and stability analysis method and apparatus, and device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2021/136976 WO2023102870A1 (en) 2021-12-10 2021-12-10 Method and apparatus for computing free energy of molecule, and stability analysis method and apparatus, and device and storage medium

Publications (1)

Publication Number Publication Date
WO2023102870A1 true WO2023102870A1 (en) 2023-06-15

Family

ID=86729338

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/136976 WO2023102870A1 (en) 2021-12-10 2021-12-10 Method and apparatus for computing free energy of molecule, and stability analysis method and apparatus, and device and storage medium

Country Status (1)

Country Link
WO (1) WO2023102870A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107004064A (en) * 2014-09-30 2017-08-01 国立大学法人大阪大学 Free energy computing device, method, program and record have the recording medium of the program
US20200134246A1 (en) * 2018-05-10 2020-04-30 Shenzhen Jingtai Technology Co., Ltd. Gromacs cloud computing process control method
CN112199909A (en) * 2020-10-22 2021-01-08 深圳晶泰科技有限公司 Method for accurately calculating absolute free energy of gas molecules
US20210193270A1 (en) * 2019-12-20 2021-06-24 International Business Machines Corporation Quantum computing thermodynamic observables of a chemical system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107004064A (en) * 2014-09-30 2017-08-01 国立大学法人大阪大学 Free energy computing device, method, program and record have the recording medium of the program
US20200134246A1 (en) * 2018-05-10 2020-04-30 Shenzhen Jingtai Technology Co., Ltd. Gromacs cloud computing process control method
US20210193270A1 (en) * 2019-12-20 2021-06-24 International Business Machines Corporation Quantum computing thermodynamic observables of a chemical system
CN112199909A (en) * 2020-10-22 2021-01-08 深圳晶泰科技有限公司 Method for accurately calculating absolute free energy of gas molecules

Similar Documents

Publication Publication Date Title
Naets et al. An online coupled state/input/parameter estimation approach for structural dynamics
Terwilliger Map-likelihood phasing
Byun et al. Towards optimal cosmological parameter recovery from compressed bispectrum statistics
Minami et al. Performance evaluation of nonlinear algorithms with line‐search for partitioned coupling techniques for fluid–structure interactions
Angélil et al. Homogeneous SPC/E water nucleation in large molecular dynamics simulations
TW200905166A (en) Auto-calibration of orientation sensing system
Dhabal et al. Excess entropy and crystallization in Stillinger-Weber and Lennard-Jones fluids
JP2009541866A (en) Die-unit temperature programming for thermally efficient integrated circuit (IC) operation
JP2016503495A (en) Estimation of gravity vector in world coordinate system using accelerometer in mobile device
Grønbech-Jensen et al. Constant pressure and temperature discrete-time Langevin molecular dynamics
CN115394365A (en) Crystal structure similarity calculation method, screening method, device and equipment
WO2023102870A1 (en) Method and apparatus for computing free energy of molecule, and stability analysis method and apparatus, and device and storage medium
US9684747B2 (en) Parameter modeling for semiconductor arrangements
Rutkai et al. Dynamic Monte Carlo simulation in mixtures
JP2011257397A (en) Automatic data collection algorithm for 3d magnetic field calibration having reduced memory requirement
JP5589665B2 (en) Analysis device, analysis program, and analysis method
CN114187971B (en) Molecular free energy calculation and stability analysis method, device, equipment and storage medium
Leong et al. A molecular dynamics investigation of the surface tension of water nanodroplets and a new technique for local pressure determination through density correlation
US20160356632A1 (en) Sensor degradation compensation
Chapwanya et al. A soft condensed matter approach towards mathematical modelling of mass transport and swelling in food grains
JP2021012078A (en) Temperature distribution estimation device and temperature distribution estimation method
US11415590B2 (en) Method and apparatus with posture estimation
CN109740216B (en) Modal separation degree judging method for structural model correction
WO2017013726A1 (en) Method and device for calculating acid dissociation constant, and program
Yang et al. Effect of polydispersity on the relative stability of hard-sphere crystals

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21966792

Country of ref document: EP

Kind code of ref document: A1