CN109857560A

CN109857560A - A kind of collaboration parallelization mechanism based on CPU/GPU isomerous environment

Info

Publication number: CN109857560A
Application number: CN201910082779.9A
Authority: CN
Inventors: 张瑞聪; 张卫山; 房凯
Original assignee: China University of Petroleum East China
Current assignee: China University of Petroleum East China
Priority date: 2019-01-28
Filing date: 2019-01-28
Publication date: 2019-06-07

Abstract

The collaboration parallelization mechanism based on CPU/GPU isomerous environment that the invention proposes a kind of is realized and greatly improves its speed when big data calculates and handles.One side CPU provides data to GPU and receives the data that GPU is passed back, manages the work of GPU；Another aspect CPU and GPU collaboration are parallel to complete calculating task, in such a way that threshold value is set, compare this different task to request CPU and the different of GPU resource, number can be received under this task using CPU and using GPU by calculating separately out, select the processor that can accommodate most numbers.This mode that CPU or GPU is reasonably selected according to the loading condition of node, the calculating of current big data and processing speed are greatly improved.

Description

A kind of collaboration parallelization mechanism based on CPU/GPU isomerous environment

Technical field

The present invention relates to industrial equipment state data analysis fields, and in particular to is based on CPU/GPU isomerous environment to a kind of Collaboration parallelization mechanism.

Background technique

On the one hand collaboration parallelization mechanism based on CPU/GPU isomerous environment, CPU manage the work of GPU, on the other hand join It carries out CPU-GPU according to the loading condition of node with the calculating task of part and flexibly selects.It can accelerate to the maximum extent The speed of operation and processing data.

Have closest to technology of the invention:

(1), the cooperated computing mode based on CPU+GPU: CPU is merely responsible for the work of management GPU, provides data simultaneously for GPU The data that GPU is passed back are received, entire calculating task is undertaken by GPU.The CPU and CPU division of labor is clear, but wastes valuable CPU meter Calculate resource.

(2), the cooperated computing load equilibrium design based on CPU+GPU: CPU individually undertakes the calculating task of a part, But load balancing at this time is difficult to accomplish.

Due to the restriction of various history and practical reasons, Heterogeneous Computing still suffers from the problem of all various aspects, wherein most Distinct issues are program development difficulties, and this problem is more prominent when especially expanding to cluster scale rank.Main performance Scalability, load balancing, adaptivity, communication, in terms of.

Summary of the invention

To solve shortcoming and defect in the prior art, the invention proposes the collaborations based on CPU/GPU isomerous environment simultaneously Row mechanism substantially increases calculating and handles the speed of data.

The technical solution of the present invention is as follows:

A kind of collaboration parallelization mechanism based on CPU/GPU isomerous environment, CPU not only manage the work of GPU, also participation portion The calculating task divided suitably flexibly selects CPU or GPU according to the loading condition of node, comprising the following steps:

Step (1), algorithm count the computing resource of each calculate node in cloud environment before topological operation submission；

Step (2), when this topology submit after, obtain each worker of topology resource request, by this request with it is each The available resources of a cloud node compare, and when the available resources of certain cloud node are greater than the request of this worker, this worker is advised It draws and arrives this node；

Step (3) requests CPU and the different of GPU resource by comparing this worker, calculates separately out and is using CPU With use this worker in the case where GPU that can be received number, select the processor that can accommodate most numbers.

Beneficial effects of the present invention:

(1) management to be worked by CPU GPU, provides data for GPU and receives the data that GPU is passed back, by the fortune of GPU Efficiency is calculated to be greatly improved on original base；

(2) CPU and GPU cooperates with operation, and the computing capability of CPU is adequately used, cleverer to the processing of data It is living；

(3) loading condition for passing through each node when operation compares resource and requests the difference of CPU and GPU, selects energy The processor for accommodating most numbers, solves the problems, such as load imbalance.

Detailed description of the invention

In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of invention for those of ordinary skill in the art without creative efforts, can be with It obtains other drawings based on these drawings.

Fig. 1 is that the present invention is based on the collaboration parallel processing flow charts under CPU/GPU isomerous environment:

Specific embodiment

Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.

As shown in Figure 1, the core of the collaboration parallelization mechanism under the isomerous environment of the invention based on CPU/GPU is pair The reasonable selection of CPU and GPU, by the way that two threshold values: CPU usage α and RAM utilization rate β are arranged, according to the load feelings of node Condition carries out the flexible choice of CPU-GPU.

Below with reference to figure, the detailed process based on the collaboration parallelization mechanism under CPU/GPU isomerous environment is carried out detailed Illustrate:

It is of the invention based on the collaboration parallelization mechanism under CPU/GPU isomerous environment, one side CPU provides data to GPU And receive the data that GPU is passed back, another aspect CPU and the parallel completion calculating task of GPU collaboration, in such a way that threshold value is set, CPU or GPU is reasonably selected according to the loading condition of node, the calculating of current big data and processing speed are greatly improved.

The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all in essence of the invention Within mind and principle, any modification, equivalent replacement, improvement and so on be should all be included in the protection scope of the present invention.

Claims

1. a kind of collaboration parallelization mechanism based on CPU/GPU isomerous environment, which is characterized in that can be according to the load feelings of node The flexible choice of condition progress CPU-GPU, comprising the following steps:

Step (2), when this topology submit after, obtain each worker of topology resource request, by this request and each cloud The available resources of node compare, and when the available resources of certain cloud node are greater than the request of this worker, this worker is planned for This node；

Step (3) requests the different of CPU and GPU resource by comparing this worker, and calculating separately out using CPU and makes It can be received number with this worker in the case where GPU, select the processor that can accommodate most numbers.