CN109635473B - Heuristic high-flux material simulation calculation optimization method - Google Patents

Heuristic high-flux material simulation calculation optimization method Download PDF

Info

Publication number
CN109635473B
CN109635473B CN201811556665.5A CN201811556665A CN109635473B CN 109635473 B CN109635473 B CN 109635473B CN 201811556665 A CN201811556665 A CN 201811556665A CN 109635473 B CN109635473 B CN 109635473B
Authority
CN
China
Prior art keywords
job
execution
unexecuted
execution mode
executed
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811556665.5A
Other languages
Chinese (zh)
Other versions
CN109635473A (en
Inventor
都志辉
王崇愚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tsinghua University
Original Assignee
Tsinghua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tsinghua University filed Critical Tsinghua University
Priority to CN201811556665.5A priority Critical patent/CN109635473B/en
Publication of CN109635473A publication Critical patent/CN109635473A/en
Application granted granted Critical
Publication of CN109635473B publication Critical patent/CN109635473B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Evolutionary Computation (AREA)
  • Geometry (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention provides a heuristic high-flux material simulation calculation optimization method, and belongs to the field of material science. The method comprises the steps of firstly, determining a shared execution mode of all operations in high-throughput material calculation simulation; then selecting one unexecuted operation from the model with the maximum number of unexecuted operations in the high-throughput material calculation simulation and executing the operation in an independent execution mode; and acquiring the next execution operation by utilizing the heuristic information until all operations in the high-flux material calculation simulation are executed. The invention uses the adjacent relation between different elements in the periodic table of elements and the atomic radius of different elements as heuristic information, which can greatly improve the efficiency of high-flux operation execution and greatly shorten the time of new material design.

Description

Heuristic high-flux material simulation calculation optimization method
Technical Field
The invention belongs to the field of material science, and particularly relates to a heuristic high-flux material simulation calculation optimization method.
Background
At present, the acquisition of new materials is shifted from a traditional mode of finding new materials through a large number of experiments to a high-throughput calculation mode of designing new materials through a large number of simulation calculations, so that the acquisition efficiency of new materials can be greatly improved.
The high-throughput material calculation is a design that a large batch of calculation tasks can be rapidly completed at one time by means of calculation resources with strong capacity, and candidate materials meeting requirements are screened out through analysis of calculation results. Such computing tasks, in their particular form, exist as jobs on high performance computing systems, and are referred to as high throughput material computing jobs. Because the calculation amount is generally very large, how to optimize and improve the performance becomes a challenging problem. Currently, mainstream performance optimization methods are performed for a single high-throughput material calculation operation. The method has the same optimization effect on other similar jobs as long as the performance of one job can be improved. However, such methods have the disadvantages that they are optimized only from the local information of a single job, the physical internal association between different jobs is not considered, and the more significant optimization potential possibly brought by the analysis from the macroscopic and overall perspectives is neglected, so the optimization effect is easily limited by various local factors, and the optimization performance of the whole job cannot be improved more comprehensively and greatly.
High-throughput material calculation represented by material genetic engineering is mainly carried out at present on the automatic connection of all stages of calculation, typical projects are international AFlow and MP projects, but no good solution exists on how to greatly reduce the execution time of operation, particularly the execution time of optimization calculation-intensive operation.
Disclosure of Invention
The invention aims to overcome the defect that the prior art cannot fully optimize the execution time of calculation-intensive operation, and provides a heuristic high-flux material simulation calculation optimization method. The invention combines the characteristics of the high-flux material calculation simulation task, opens up a brand-new path to realize performance optimization, is easy to implement and can obtain very obvious effect. The efficiency of high flux operation execution can be greatly improved, and the time for designing a new material is greatly shortened.
The invention provides a heuristic high-flux material simulation calculation optimization method which is characterized by comprising the following steps of:
1) determining a shared execution mode of all the jobs in the high-flux material calculation simulation; the method comprises the following specific steps:
1-1) selecting two operations in any model of high-flux material calculation simulation, wherein doping elements of the two operations are adjacent elements, and the two operations are respectively marked as X and Y;
1-2) executing X in an independent execution mode;
1-3) Y shares X execution by adopting different sharing execution modes, and selects the sharing execution mode with the shortest execution time as the sharing execution mode adopted by all the jobs;
2) selecting one unexecuted operation from a model containing the largest number of unexecuted operations in the high-throughput material calculation simulation and executing the operation in an independent execution mode; the specific selection method is as follows:
in the model containing the maximum number of the unexecuted jobs, counting the number of the shared jobs which can be adjacent to each unexecuted job, selecting the unexecuted job with the maximum number of the shared jobs which can be adjacent to each unexecuted job, and executing the unexecuted job in an independent execution mode; if a plurality of unexecuted jobs with the same number of adjacent sharable jobs exist, one of the unexecuted jobs is randomly selected and then executed in an independent execution mode; wherein, the shared operation which can be adjacent to each operation is the operation of the doping element adjacent to the operation doped element;
3) acquiring the next execution operation by utilizing heuristic information until all operations in the high-flux material calculation simulation are executed; the method comprises the following specific steps:
3-1) determining all sharable execution job pairs, wherein the sharable execution job pairs meet the following requirements: two jobs in each job pair belong to the same model, wherein one job is executed and the other job is not executed;
3-2) determining sharable execution job pairs:
if the shareable execution job pair exists, calculating the difference between the atomic radii of the two doping elements of each job pair, selecting one job pair with the smallest absolute value of the difference, executing the unexecuted jobs in the job pair in the shared execution mode determined in the step 1), and returning to the step 2); if no shareable execution job pair exists and no executed job exists in the high-throughput material calculation simulation, returning to the step 2) again; and ending the method until all the operations in the high-flux material calculation simulation are finished.
The invention has the characteristics and beneficial effects that:
(1) the method provided by the invention is optimized by analyzing the internal correlation among a plurality of simulation operations in the high-throughput material calculation, and is an optimization method for focusing on the whole operation. The method is completely different from the prior single-job performance optimization method, but the prior local optimization method aiming at the single job can be independently combined into the method, and the two methods are in a complementary and superposed relationship.
(2) The optimization method provided by the invention has very obvious overall effect, and although the possible effect of a few operations is not obvious, the performance of most operations is obviously improved. Can greatly improve the efficiency of high-flux operation execution and greatly shorten the time of new material design
(3) The method only needs very little well-known information, namely the adjacent relation between different elements in the periodic table of the elements and the atomic radius of the different elements, can analyze the similarity degree between different operations and make optimization decision on the basis, and the decision-making method is simple and efficient.
Detailed Description
The invention provides a heuristic high-flux material simulation calculation optimization method, which is further detailed below by combining specific embodiments.
The invention provides a heuristic High-Throughput material Simulation calculation optimization method, which assumes that a High-Throughput material calculation Simulation (HTCS) is composed of M models, and the expression is as follows:
Figure BDA0001912109720000031
where M represents the total number of high-throughput material calculation models, Model i Representing the ith model.
Wherein the ith model may be represented by N i A job is composed, so that the Model can be further modeled i Expressed as:
Figure BDA0001912109720000032
wherein S is i,j A j-th job representing an i-th model;
for different operations corresponding to the same model, the doping elements are different from each other, and other aspects (referring to the initial settings of all input files) are the same.
How the job is calculated and optimized is specifically described below based on widely used VASP software. Two implementations of high-throughput operation are first defined here: an independent execution mode and a shared execution mode.
The independent execution mode is a mode in which the job is executed on the parallel computing system based on input files of the INCAR, POSCAR, POTCAR, and KPOINT files independently generated for the job at the time of design, and has no relation to other jobs.
The shared execution mode refers to two jobs A and B, wherein A is a completed job, B is a job to be executed, and B optimizes the running efficiency of A by sharing the result of the execution of A. Specifically, three different shared execution modes are shared CONTCAR execution, shared CHGCAR execution, and shared CONTCAR and CHGCAR execution at the same time. The execution of the B-shared-A CONTCAR is that B replaces POSCAR generated in the original design stage of B with the CONTCAR obtained after A is executed. B-share a's CHGCAR implementation means that B uses the CHGCAR of a's output as an additional input file to get its charge distribution, which requires modifying its original INCAR file at the same time, setting its icharge flag to 1. B-shared a CONTCAR and CHGCAR execution means that B executes with a's CONTCAR as its POSCAR, but also with a's CHGCAR as an additional input to get its charge distribution, while setting its icharge flag in its INCAR file to 1. The effect of the three shared execution modes will be different due to different operations.
In the following we define the concept of adjacent elements, for two elements EA and EB in the periodic table, if they are in the same row of the periodic table and are directly adjacent, they are called row-adjacent elements each other. If they are in the same column of the periodic table of elements and are immediately adjacent, they are mutually referred to as column-adjacent elements. Two elements, whether row-adjacent or column-adjacent, are referred to as adjacent elements.
The concept of shareable jobs is further defined below. For operations based on the same model, since they are different only in doping element, operations adjacent to each other in doping element are defined as mutually adjacently sharable operations.
The invention provides a heuristic high-flux material simulation calculation optimization method, which comprises the following steps:
1) determining a shared execution mode of all the jobs in the high-flux material calculation simulation; the method comprises the following specific steps:
1-1) selecting two operations in any model of high-flux material calculation simulation, wherein doping elements of the two operations are adjacent elements, and the two operations are respectively marked as X and Y;
1-2) executing X in an independent execution mode, and storing all results obtained after execution in an output file of the operation, wherein the output result can describe some basic characteristics of a given material design.
1-3) Y shares X execution by using three different shared execution modes, and selects the shared execution mode with the shortest execution time as the shared execution mode adopted by all the jobs.
If the workload is very large, a plurality of groups of different X and Y jobs can be selected from different models to carry out advanced test, and the sharing execution mode is selected according to the average effect.
2) Selecting one unexecuted operation from a model containing the largest number of unexecuted operations in the high-throughput material calculation simulation and executing the operation in an independent execution mode; the specific selection method is as follows:
in the model containing the maximum number of the unexecuted jobs, counting the number of the shared jobs which can be adjacent to each unexecuted job, selecting the unexecuted job with the maximum number of the shared jobs which can be adjacent to each unexecuted job, and executing the unexecuted job in an independent execution mode; if there are a plurality of unexecuted jobs, the number of the shared jobs which can be adjacent is the same. Randomly selecting one of the unexecuted jobs, and then executing the job in an independent execution mode;
3) acquiring the next execution operation by utilizing heuristic information until all operations in the high-flux material calculation simulation are executed; the method comprises the following specific steps:
3-1) determining all sharable execution job pairs, wherein the sharable execution job pairs meet the following requirements: two jobs in each job pair belong to the same model, wherein one job is executed and the other job is not executed; all job pairs < a1, B1>, …, < Am, Bm > … that meet the above conditions will be recorded.
3-2) determining sharable execution job pairs:
if the shareable execution job pair exists, calculating the difference between the atomic radii of the two doping elements of each job pair, selecting one job pair with the smallest absolute value of the difference, executing the unexecuted jobs in the job pair in the shared execution mode determined in the step 1), and returning to the step 2); if no shareable execution job pair exists and no executed job exists in the high-throughput material calculation simulation, returning to the step 2) again; and ending the method until all the operations in the high-flux material calculation simulation are finished.

Claims (1)

1. A heuristic high-flux material simulation calculation optimization method is characterized by comprising the following steps:
1) determining a shared execution mode of all the jobs in the high-flux material calculation simulation; the method comprises the following specific steps:
1-1) selecting two operations in any model of high-flux material calculation simulation, wherein doping elements of the two operations are adjacent elements, and the two operations are respectively marked as X and Y;
1-2) executing X in an independent execution mode;
1-3) Y shares X execution by adopting different sharing execution modes, and selects the sharing execution mode with the shortest execution time as the sharing execution mode adopted by all the jobs;
the shared execution mode includes: sharing a CONTCAR execution mode, sharing a CHGCAR execution mode, and sharing both a CONTCAR and CHGCAR execution mode simultaneously; the shared CONTCAR execution mode refers to that the original POSCAR of the unexecuted operation is replaced by the CONTCAR obtained by the executed operation for execution; the shared CHGCAR execution mode is that an unexecuted job obtains corresponding charge distribution by using CHGCAR output by an executed job as an additional input file, and simultaneously modifies an original INCAR file of the unexecuted job and sets an ICHARG mark in the INCAR file to be 1; the shared CONTCAR and CHGCAR execution mode means that the non-executed job is executed by using a CONTCAR obtained by the executed job to replace an original POSCAR of the non-executed job, the non-executed job uses the CHGCAR output by the executed job as an additional input file to obtain a corresponding charge distribution, and simultaneously modifies an original INCAR file of the non-executed job, and an ICHARG mark in the non-executed job is set to be 1;
2) selecting one unexecuted operation from a model containing the largest number of unexecuted operations in the high-throughput material calculation simulation and executing the operation in an independent execution mode; the specific selection method is as follows:
in the model containing the maximum number of the unexecuted jobs, counting the number of the shared jobs which can be adjacent to each unexecuted job, selecting the unexecuted job with the maximum number of the shared jobs which can be adjacent to each unexecuted job, and executing the unexecuted job in an independent execution mode; if a plurality of unexecuted jobs with the same number of adjacent sharable jobs exist, one of the unexecuted jobs is randomly selected and then executed in an independent execution mode; wherein the adjacently sharable operation of each operation is an operation in which the doping element is adjacent to the operation doping element;
3) acquiring the next execution operation by utilizing heuristic information until all operations in the high-flux material calculation simulation are executed; the method comprises the following specific steps:
3-1) determining all sharable execution job pairs, wherein the sharable execution job pairs meet the following requirements: two jobs in each job pair belong to the same model, wherein one job is executed and the other job is not executed;
3-2) determining sharable execution job pairs:
if the shareable execution job pair exists, calculating the difference between the atomic radii of the two doping elements of each job pair, selecting one job pair with the smallest absolute value of the difference, executing the unexecuted jobs in the job pair in the shared execution mode determined in the step 1), and returning to the step 2); if no shareable execution job pair exists and no executed job exists in the high-throughput material calculation simulation, returning to the step 2) again; and ending the method until all the operations in the high-flux material calculation simulation are finished.
CN201811556665.5A 2018-12-19 2018-12-19 Heuristic high-flux material simulation calculation optimization method Active CN109635473B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811556665.5A CN109635473B (en) 2018-12-19 2018-12-19 Heuristic high-flux material simulation calculation optimization method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811556665.5A CN109635473B (en) 2018-12-19 2018-12-19 Heuristic high-flux material simulation calculation optimization method

Publications (2)

Publication Number Publication Date
CN109635473A CN109635473A (en) 2019-04-16
CN109635473B true CN109635473B (en) 2022-08-02

Family

ID=66075503

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811556665.5A Active CN109635473B (en) 2018-12-19 2018-12-19 Heuristic high-flux material simulation calculation optimization method

Country Status (1)

Country Link
CN (1) CN109635473B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110334055B (en) * 2019-06-06 2023-04-11 苏州迈高材料基因技术有限公司 Method for acquiring material calculation data
CN113127973B (en) * 2021-04-16 2022-05-10 湖南大学 CAE simulation technology-based multi-material intelligent material selection method and system and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1319761A (en) * 2000-03-15 2001-10-31 清华大学 Apparatus and method for high-flux electric rotation detection
CN103631988A (en) * 2013-10-22 2014-03-12 芜湖大学科技园发展有限公司 Multi-user simulation data management platform for electric power system simulation
CN104992258A (en) * 2012-07-05 2015-10-21 爱利门图供应链管理(开曼)有限公司 Method and system for controlling supply chains
CN108123109A (en) * 2016-11-28 2018-06-05 华为技术有限公司 Lithium cobaltate cathode material and preparation method thereof and lithium rechargeable battery

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9342355B2 (en) * 2013-06-20 2016-05-17 International Business Machines Corporation Joint optimization of multiple phases in large data processing

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1319761A (en) * 2000-03-15 2001-10-31 清华大学 Apparatus and method for high-flux electric rotation detection
CN104992258A (en) * 2012-07-05 2015-10-21 爱利门图供应链管理(开曼)有限公司 Method and system for controlling supply chains
CN103631988A (en) * 2013-10-22 2014-03-12 芜湖大学科技园发展有限公司 Multi-user simulation data management platform for electric power system simulation
CN108123109A (en) * 2016-11-28 2018-06-05 华为技术有限公司 Lithium cobaltate cathode material and preparation method thereof and lithium rechargeable battery

Also Published As

Publication number Publication date
CN109635473A (en) 2019-04-16

Similar Documents

Publication Publication Date Title
US20210373888A1 (en) Multi-objective optimization method and system for master production plan of casting parallel workshops
CN105446979B (en) Data digging method and node
CN109815537B (en) High-flux material simulation calculation optimization method based on time prediction
Wang et al. Predator—An experience guided configuration optimizer for Hadoop MapReduce
CN107908536B (en) Performance evaluation method and system for GPU application in CPU-GPU heterogeneous environment
CN104750780B (en) A kind of Hadoop configuration parameter optimization methods based on statistical analysis
WO2016025924A1 (en) Systems and methods for auto-scaling a big data system
CN109522428B (en) External memory access method of graph computing system based on index positioning
CN106446134B (en) Local multi-query optimization method based on predicate specification and cost estimation
CN102799486A (en) Data sampling and partitioning method for MapReduce system
CN102855153B (en) Towards the stream compile optimization method of chip polycaryon processor
CN109635473B (en) Heuristic high-flux material simulation calculation optimization method
CN106371924B (en) A kind of method for scheduling task minimizing MapReduce cluster energy consumption
CN105468439A (en) Adaptive parallel algorithm for traversing neighbors in fixed radius under CPU-GPU (Central Processing Unit-Graphic Processing Unit) heterogeneous framework
JP6694447B2 (en) Big data calculation method and system, program, and recording medium
CN104408106A (en) Scheduling method for big data inquiry in distributed file system
Schlag et al. Scalable edge partitioning
Xu et al. Distributed maximal clique computation and management
CN106202092A (en) The method and system that data process
CN105654187A (en) Grid binary tree method of control system midpoint locating method
CN105205052A (en) Method and device for mining data
CN105867998A (en) Virtual machine cluster deployment algorithm
CN106502790A (en) A kind of task distribution optimization method based on data distribution
CN111553834A (en) Concurrent graph data preprocessing method based on FPGA
CN111078380A (en) Multi-target task scheduling method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant