CN114003385A

CN114003385A - Parallelization method for improving post-processing performance

Info

Publication number: CN114003385A
Application number: CN202111287291.3A
Authority: CN
Inventors: 王普勇; 李季
Original assignee: Shanghai Suochen Information Technology Co ltd
Current assignee: Shanghai Suochen Information Technology Co ltd
Priority date: 2021-11-02
Filing date: 2021-11-02
Publication date: 2022-02-01

Abstract

The invention discloses a parallelization method for improving post-processing performance, which comprises the following steps: each process opens the file by calling a file operation function of the MPI; each process moves the global shared file pointer to a corresponding position in the file according to the read address displacement by calling a file addressing function of the MPI, and the processes simultaneously use one global shared file pointer to carry out I/O operation; each process reads a plurality of data in the file from the file into a memory buffer area by calling a file reading function of the MPI; each process realizes file closing and I/O operation ending by calling a file pointer closing function of the MPI; dividing the data into a plurality of sub data sets which are independent of each other by applying a data division algorithm; and (4) parallelly and collectively writing the sub data sets into the visual object, and transmitting the visual object to the main process to finish parallel and efficient display results.

Description

Parallelization method for improving post-processing performance

Technical Field

The method is applied to post-processing software in the field of simulation software, and particularly used for parallelization processing of large-scale data visualization and data IO parts.

Background

The physical and engineering simulation software carries out simulation and numerical calculation in various fields by methods such as finite element and the like, and covers a plurality of fields such as mechanics, fluid, electromagnetism, optics, acoustics, electrochemistry, chemical engineering, semiconductors and the like. Data results obtained by simulation computing software are often in the order of GB to hundreds of GB, and when large data results are subjected to visual post-processing, data reading, processing, checking and the like bring inconvenience, and the software often needs to wait for a long time.

The speed of the current simulation analysis software for reading and displaying ten GB magnitude is slow, and more manpower and time are consumed. Therefore, in order to improve the efficiency of the simulation software user in post-processing, a method is needed to be found, which can read the displayed functions to save the time of the simulation user when the post-processing is completed quickly.

Disclosure of Invention

In the parallel post-processing technical method, aiming at the problem that the time consumption is too long when the large-magnitude simulation calculation result data is post-processed, the invention provides the method for parallelizing the post-processing by using the multi-core CPU of the high-performance computer, thereby greatly improving the speed and the efficiency of the post-processing.

The invention solves the technical problems through the following technical scheme:

the invention provides a parallelization method for improving post-processing performance, which is characterized by comprising the following steps of:

s1, in the I/O of the parallel MPI (information transfer interface), each process opens the file by calling the file operation function of the MPI;

s2, each process moves the global shared file pointer to the corresponding position in the file according to the read address displacement by calling the file addressing function of MPI, and the processes simultaneously use one global shared file pointer to perform I/O operation;

s3, each process reads a plurality of data in the file from the file into a memory buffer area by calling a file reading function of the MPI;

s4, each process realizes file closing and I/O operation ending by calling a file pointer closing function of the MPI;

s5, dividing the data into a plurality of independent subdata sets by using a data segmentation algorithm;

and S6, writing the sub data sets into the visual object in a parallel and aggregated manner, and transmitting the visual object to the main process to finish parallel and efficient display results.

Preferably, in step S1, a first parameter of the file operation function represents a communication domain, a second parameter represents a file path and a file name to be opened, a third parameter represents a file opening manner, a fourth parameter transfers information to the I/O implementation by attaching a pair of key values to an information object declaring MPI, a fifth parameter represents a global shared file pointer, which is a handle returned by the I/O operation and can be used for a subsequent I/O operation, and in an object of the fourth parameter, information such as file fragmentation and internal buffer size can be transferred for optimizing the I/O operation of the MPI.

On the basis of the common knowledge in the field, the above preferred conditions can be combined randomly to obtain the preferred embodiments of the invention.

The positive progress effects of the invention are as follows:

the method provided by the invention can improve the performance of the simulation calculation software from the parallelization point of view aiming at a large number of calculation results of the simulation calculation software, accelerate the post-processing speed of the simulation data from the aspect of combining software and hardware, and avoid that a software user consumes too much time when facing huge data in the post-processing stage, thereby improving the post-processing efficiency while ensuring the accuracy.

Drawings

FIG. 1 is a flow chart of a parallelization method for improving post-processing performance according to a preferred embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, are within the scope of the present invention.

As shown in fig. 1, the present embodiment provides a parallelization method for improving post-processing performance, which is characterized in that it includes the following steps:

in step 101, in the I/O of the parallel MPI (information transfer interface), each process opens a file by calling a file operation function of the MPI.

In step 101, a first parameter of the file operation function represents a communication domain, a second parameter represents a file path and a file name that need to be opened, a third parameter represents a file opening manner, a fourth parameter transmits information to an I/O implementation by attaching a pair of key values to an information object that declares MPI, a fifth parameter represents a global shared file pointer, is a handle returned by the I/O operation, and can be used for a subsequent I/O operation, wherein in the object of the fourth parameter, information such as file fragments and an internal buffer size can be transmitted for optimizing the I/O operation of the MPI.

And 102, each process moves the global shared file pointer to a corresponding position in the file according to the read address displacement by calling a file addressing function of the MPI, and the processes simultaneously use one global shared file pointer to perform I/O operation.

Step 103, each process reads a plurality of data in the file from the file into the memory buffer by calling the file reading function of the MPI.

And step 104, each process realizes file closing and I/O operation ending by calling a file pointer closing function of the MPI.

And 105, segmenting the data into a plurality of independent subdata sets by using a data segmentation algorithm.

When data segmentation is carried out, load distribution balance is ensured, so that delay caused by waiting is reduced. Each processor is responsible for processing different data subsets, has a set of processing flow, and in visualization, after each process obtains a result, the result of each subprocess is transmitted to a main process for display.

And 106, parallelly and collectively writing the sub data sets into the visual object, and transmitting the visual object to the main process to finish parallel and efficient display results.

While specific embodiments of the invention have been described above, it will be appreciated by those skilled in the art that these are by way of example only, and that the scope of the invention is defined by the appended claims. Various changes and modifications to these embodiments may be made by those skilled in the art without departing from the spirit and scope of the invention, and these changes and modifications are within the scope of the invention.

Claims

1. A parallelization method for improving post-processing performance, comprising the steps of:

s1, in the I/O of the parallel MPI, each process opens the file by calling the file operation function of the MPI;

2. The parallelization method for improving post-processing performance according to claim 1, wherein in step S1, a first parameter of the file operation function represents a communication domain, a second parameter represents a file path and a file name to be opened, a third parameter represents a file opening manner, a fourth parameter transfers information to the I/O implementation by appending a pair of key values to an information object declaring MPI, a fifth parameter represents a global shared file pointer, is a handle returned by the I/O operation, and can be used for a subsequent I/O operation, and in an object of the fourth parameter, information such as file fragmentation and an internal buffer size can be transferred to optimize the I/O operation of the MPI.