CN109144944A

CN109144944A - A kind of program groups bandwidth scheduling method that concurrency performance is optimal

Info

Publication number: CN109144944A
Application number: CN201810858682.8A
Authority: CN
Inventors: 张彩霞; 王向东; 王新东; 肖人苗
Original assignee: Foshan University
Current assignee: Foshan University
Priority date: 2018-07-31
Filing date: 2018-07-31
Publication date: 2019-01-04

Abstract

The invention discloses a kind of program groups bandwidth scheduling methods that concurrency performance is optimal, comprising steps of binding a performance counter (PMU) for each program process, program process is divided into several program segments according to scheduling time leaf length, the PMU data of each program segment is stored respectively into program segment execution information database, is corresponded with specific identification code；Program segment identification is carried out to program each in present procedure group, reads corresponding PMU data respectively；According to the corresponding PMU data of present procedure section, bandwidth demand is estimated, carry out bandwidth scheduling；Option program concurrently executes, while the bandwidth demand of next timeslice is estimated according to the corresponding PMU data of next program segment；Present procedure section is executed, execution information is stored, constantly repeats, until completing the bandwidth scheduling of present procedure group and executing optimization.This method is not necessarily to user intervention, is able to achieve continuous updating and self-optimization, Prediction of Bandwidth Requirement precise and high efficiency, and dispatching effect is good.

Description

A kind of program groups bandwidth scheduling method that concurrency performance is optimal

Technical field

The invention belongs to computer systems technology field more particularly to a kind of program groups bandwidth schedulings that concurrency performance is optimal Method.

Background technique

It is limited by power consumption and hardware etc., the simple method warp that processor performance is improved by chip frequency is promoted Become more and more difficult, therefore, the development trend of multicore and many-core as computing system.But this kind of system is by multiple processing Core is integrated on a processor, and multiple processing cores share a memory access bus, system performance by memory access bandwidth constraint so that Memory access performance becomes the main bottleneck that limitation system performance is promoted.

For this purpose, improving memory access efficiency, system performance is reduced to the dependence in memory access broadband, it is possible to reduce memory access latency, into The operational efficiency of one step raising system.

Summary of the invention

For the above-mentioned deficiency of the prior art, the present invention provides a kind of program groups bandwidth scheduling side that concurrency performance is optimal Method, comprising steps of

(1) performance counter (PMU) is bound for each program process, is held program according to scheduling time leaf length Row process is divided into several program segments, and the PMU data of each program segment is stored respectively into program segment execution information database, It is corresponded with specific identification code；

(2) program segment identification is carried out to program each in present procedure group, reads corresponding PMU data respectively；

(3) it according to the corresponding PMU data of present procedure section, estimates bandwidth demand, carries out bandwidth scheduling；

(4) option program concurrently executes, while estimating next timeslice according to the corresponding PMU data of next program segment Bandwidth demand；

(5) present procedure section is executed, corresponding PMU data is then updated to program segment execution information database, and it is corresponding Identification code match；

(6) next program segment weight bandwidth scheduling is carried out, is repeated step (4) and (5), until completing the bandwidth of present procedure group Dispatch and execute optimization.

Preferably, in step (4), the total bandwidth need of next timeslice is not less than the average bandwidth demand of program groups.

Preferably, scheduling time leaf length is 5-200ms.

Preferably, bandwidth demand monopolizes execution performance estimation according to program；Further, program monopolizes execution performance according to journey The efficiency calculation of frequency, access Cache and main memory that sequence section occurs.

Preferably, each program segment, which respectively corresponds, is provided with program segment information table, current for logging program section and go through History execution information.

Beneficial effects of the present invention:

Bandwidth scheduling method of the invention carries out bandwidth scheduling according to historical execution information, executes end in program Afterwards, current execution information automatically updates in program segment execution information database, is loaded into again in the execution in program future, Continuous updating and self-optimization are realized in this way, and accordingly executes data and is obtained by PMU, are not necessarily to user intervention, Realize Prediction of Bandwidth Requirement and the scheduling of precise and high efficiency.

Detailed description of the invention

Fig. 1 is bandwidth scheduling flow diagram of the invention.

Specific embodiment

With reference to the accompanying drawing and specific embodiment the present invention will be described in detail.

Referring to Fig. 1, the present invention provides a kind of program groups bandwidth scheduling method that concurrency performance is optimal, comprising steps of

(1) a performance counter (PMU) is bound for each program process, is 50ms by journey by scheduling time leaf length Program process is divided into several program segments, and the PMU data of each program segment is stored respectively to program segment execution information data In library, corresponded with specific identification code.

(2) program segment identification is carried out to program each in present procedure group, reads corresponding PMU number respectively according to identification code According to.It for the validity for improving program segment identification, can be realized by basic block vector, i.e., by being formed in program segment implementation procedure The manhatton distance of basic block vector whether be less than the similarity threshold of basic block vector and judged.

(3) it according to the corresponding PMU data of present procedure section, estimates bandwidth demand, carries out bandwidth scheduling.Each program segment Respectively correspond and be provided with program segment information table (being stored in program segment execution information database), for logging program section it is current and Historical execution information, i.e. PMU data are estimated according to the average value of execution information.

(4) option program concurrently executes, while estimating next timeslice according to the corresponding PMU data of next program segment Bandwidth demand.To guarantee that performance is stablized in system operation, the total bandwidth need of next timeslice is not less than program groups Average bandwidth demand.Specifically, the efficiency of frequency, access Cache and main memory that bandwidth demand occurs according to program segment is counted It calculates.

(5) present procedure section is executed, after program segment execution terminates, corresponding execution information continues to update to corresponding letter It ceases in table, matches with corresponding identification code.It is loaded into when executing in the program segment future, is needed for improving next step bandwidth again Seek the accuracy of estimation.

For the dispatching effect for evaluating and testing the above method, the present embodiment generates the program that 20 concurrencies are 5 at random in systems Group records the execution performance that each program groups continuously performs 50 times under above-mentioned bandwidth scheduling method, the results showed that, program groups is most Big speed-up ratio increase rate is up to 5.5%, and main memory access amount is declined slightly, and effectively increases the operational efficiency of system.

The above embodiments are merely illustrative of the technical solutions of the present invention and is not intended to limit it, all without departing from the present invention Any modification of spirit and scope or equivalent replacement should all cover in the protection scope of technical solution of the present invention.

Claims

1. a kind of program groups bandwidth scheduling method that concurrency performance is optimal, which is characterized in that comprising steps of

(1) performance counter (PMU) is bound for each program process, was executed program according to scheduling time leaf length Journey is divided into several program segments, and the PMU data of each program segment is stored respectively into program segment execution information database, with spy Fixed identification code corresponds；

(4) option program concurrently executes, while the bandwidth of next timeslice is estimated according to the corresponding PMU data of next program segment Demand；

(5) present procedure section is executed, corresponding PMU data is then updated to program segment execution information database, with corresponding knowledge Other code matches；

(6) next program segment weight bandwidth scheduling is carried out, is repeated step (4) and (5), until completing the bandwidth scheduling of present procedure group With execute optimization.

2. the method according to claim 1, wherein the total bandwidth need of next timeslice is not in step (4) Lower than the average bandwidth demand of program groups.

3. the method according to claim 1, wherein scheduling time leaf length is 5-200ms.

4. being estimated the method according to claim 1, wherein bandwidth demand monopolizes execution performance according to program.

5. according to the method described in claim 4, it is characterized in that, program monopolizes the frequency that execution performance occurs according to program segment Rate, the efficiency calculation for accessing Cache and main memory.

6. the method according to claim 1, wherein each program segment, which respectively corresponds, is provided with program segment information Table, for logging program section to be current and historical execution information.