CN114756358A

CN114756358A - DAG task scheduling method, device, equipment and storage medium

Info

Publication number: CN114756358A
Application number: CN202210671115.8A
Authority: CN
Inventors: 胡克坤; 鲁璐; 赵坤; 董刚; 赵雅倩; 李仁刚
Original assignee: Suzhou Inspur Intelligent Technology Co Ltd
Current assignee: Suzhou Inspur Intelligent Technology Co Ltd
Priority date: 2022-06-15
Filing date: 2022-06-15
Publication date: 2022-07-15
Anticipated expiration: 2042-06-15
Also published as: WO2023241000A1; CN114756358B

Abstract

The application discloses a DAG task scheduling method, device, equipment and storage medium. The method comprises the following steps: constructing a network model according to the sequence of the directed graph neural network and the sequential decoder, and defining an objective function of the network model by taking the minimum task scheduling length as an objective; acquiring a DAG task data set, and generating a corresponding information matrix for each DAG task in the DAG task data set; training the network model by using the information matrix, and updating model parameters of the network model by using reinforcement learning according to the objective function to obtain a trained DAG task scheduling model; and determining the scheduling sequence of the subtasks in the DAG task to be executed by using the DAG task scheduling model, and executing the DAG task to be executed by using the parallel computing system according to the scheduling sequence. The method can shorten the DAG task scheduling length, improve the parallel execution efficiency of the DAG task, and solve the problem that enough supervision labels are difficult to collect for the optimal priority distribution of the DAG task.

Description

DAG task scheduling method, device, equipment and storage medium

Technical Field

The present invention relates to the field of task scheduling technologies, and in particular, to a method, an apparatus, a device, and a storage medium for DAG task scheduling.

Background

Currently, driven by the demand for high performance and complex functionality, parallel computing systems are increasingly used to execute real-time applications, such as autonomous driving tasks with complex functional components such as perception, planning, and control, which have extremely high requirements for high performance and real-time. DAG (Directed Acyclic Graph) tasks are often used to represent complex dependencies between multiple task components (subtasks) similar to real-time applications and to formally describe fine-grained parallel task scheduling problems, i.e., DAG task scheduling problems. Considering that a Non-preemptive task model can avoid task migration and switching overhead, priority-based Non-preemptive scheduling for DAG tasks is receiving much attention, and this problem is how to Non-preemptively schedule a given DAG task to be executed on a parallel computing system, so that processing time is minimized, and is a typical NP (Non-preemptive multimedia Complete) Complete problem. In the prior art, a large number of excellent heuristic scheduling algorithms such as a table scheduling algorithm and a clustering scheduling algorithm are accumulated in long-term parallel computing practice. However, due to the nature of heuristic strategies, these algorithms cannot establish basic design principles for the DAG task scheduler, for example, how to allocate priorities to each subtask by using DAG task execution time and DAG task graph topological structure features under different DAG task scales and configurations, and scheduling performance is not ideal.

Disclosure of Invention

In view of this, an object of the present invention is to provide a method, an apparatus, a device, and a medium for scheduling DAG tasks, which can shorten a length of the DAG task scheduling and improve parallel execution efficiency of the DAG tasks. The specific scheme is as follows:

in a first aspect, the present application discloses a DAG task scheduling method, including:

constructing a network model according to the sequence of the directed graph neural network and the sequential decoder, and defining an objective function of the network model by taking the minimum task scheduling length as an objective;

acquiring a DAG task data set, and generating a corresponding information matrix for each DAG task in the DAG task data set;

training the network model by using the information matrix, and updating model parameters of the network model by using reinforcement learning according to the objective function to obtain a trained DAG task scheduling model;

and determining the scheduling sequence of the subtasks in the DAG task to be executed by using the DAG task scheduling model, and executing the DAG task to be executed by using a parallel computing system according to the scheduling sequence.

Optionally, before the network model is built according to the sequence of the directed graph neural network and the sequential decoder, the method further includes:

Constructing a graph convolution layer for DAG task feature learning based on an aggregation function and a nonlinear activation function;

and constructing the directed graph neural network according to the sequence of the input layer, the K-layer graph convolution layer and the output layer.

Optionally, before constructing the network model according to the sequence of the directed graph neural network and the sequential decoder, the method further includes:

the priority distribution state of subtasks in the DAG task is taken as a variable, and a vector expression of a context environment is defined for the DAG task;

constructing a sequential decoder for prioritization based on an attention mechanism and a vector expression of the context environment.

Optionally, the defining an objective function of the network model with the minimum task scheduling length as a target includes:

generating a deceleration evaluation index of the DAG task by taking the task scheduling length corresponding to the priority sequence of the DAG task at different time steps and the lower limit of the task scheduling length as independent variables; the lower limit of the task scheduling length is determined according to the path length of the key path of the DAG task;

constructing a reward function based on a strategy gradient algorithm and the deceleration evaluation index;

and constructing an objective function of the network model based on the reward function.

Optionally, the acquiring the DAG task dataset includes:

Configuring DAG task parameters; the DAG task parameters comprise the number of task layers, the number of sub-nodes of a target node, the generation probability of the sub-nodes of the target node, the adding probability of a connecting edge between two adjacent task layers and the calculation load of each sub-task;

and generating a DAG task according to the DAG task parameters to obtain the DAG task data set.

Optionally, the generating a corresponding information matrix for each DAG task in the DAG task data set includes:

generating a node characteristic matrix according to the characteristics of each subtask in the DAG task data set;

generating an adjacency matrix according to the connection relation between different subtasks in the DAG task data set;

and obtaining an information matrix corresponding to the DAG task based on the node characteristic matrix and the adjacency matrix.

Optionally, the training the network model by using the information matrix, and updating the model parameters of the network model by using reinforcement learning according to the objective function includes:

inputting the information matrix into the network model, and outputting vector representation of each subtask by using the directed graph neural network according to the characteristics of the subtasks and the dependency relationship among the subtasks;

Prioritizing, with the order decoder, the subtasks within the DAG task according to the vector representation of the subtasks based on attention mechanism and context environment of the DAG task;

calculating the task scheduling length of the DAG task by utilizing a DAG task scheduling simulator according to the priority sequence;

and updating the model parameters of the network model by utilizing reinforcement learning according to the task scheduling length and the objective function until the network model is converged.

In a second aspect, the present application discloses a DAG task scheduling apparatus, including:

the network construction module is used for constructing a network model according to the sequence of the directed graph neural network and the sequential decoder, and defining an objective function of the network model by taking the minimum task scheduling length as an objective;

the data set acquisition module is used for acquiring a DAG task data set and generating a corresponding information matrix for each DAG task in the DAG task data set;

the training module is used for training the network model by using the information matrix and updating model parameters of the network model by using reinforcement learning according to the objective function so as to obtain a trained DAG task scheduling model;

And the scheduling sequence determining module is used for determining the scheduling sequence of the subtasks in the DAG task to be executed by using the DAG task scheduling model and executing the DAG task to be executed by using a parallel computing system according to the scheduling sequence.

Optionally, the DAG task scheduling device further includes:

the graph convolution layer construction unit is used for constructing a graph convolution layer for DAG task feature learning based on the aggregation function and the nonlinear activation function;

and the directed graph neural network construction unit is used for constructing and obtaining the directed graph neural network according to the sequence of the input layer, the K-layer graph convolution layer and the output layer.

Optionally, the DAG task scheduling device further includes:

the vector expression definition unit is used for defining a vector expression of a context environment for the DAG task by taking the priority distribution state of the subtasks in the DAG task as a variable;

a sequential decoder construction unit for constructing a sequential decoder for prioritization based on an attention mechanism and a vector expression of the context environment to arrive at the decoder.

Optionally, the network building module includes:

the deceleration evaluation index construction unit is used for generating a deceleration evaluation index of the DAG task by taking the task scheduling length and the lower limit of the task scheduling length corresponding to the priority sequence of the DAG task at different time steps as independent variables; the lower limit of the task scheduling length is determined according to the path length of the key path of the DAG task;

The reward function construction unit is used for constructing a reward function based on a strategy gradient algorithm and the deceleration evaluation index;

and the target function construction unit is used for constructing a target function of the network model based on the reward function.

Optionally, the data set obtaining module includes:

a task parameter configuration unit, configured to configure DAG task parameters; the DAG task parameters comprise the number of task layers, the number of sub-nodes of a target node, the generation probability of the sub-nodes of the target node, the adding probability of a connecting edge between two adjacent task layers and the calculation load of each sub-task;

and the task generation unit is used for generating the DAG task according to the DAG task parameters so as to obtain the DAG task data set.

Optionally, the data set obtaining module includes:

a node feature matrix generating unit, configured to generate a node feature matrix according to a feature of each subtask in the DAG task data set;

the adjacency matrix generation unit is used for generating an adjacency matrix according to the connection relation between different subtasks in the DAG task data set;

and the information matrix determining unit is used for obtaining an information matrix corresponding to the DAG task based on the node characteristic matrix and the adjacency matrix.

In a third aspect, the present application discloses an electronic device, comprising:

a memory for storing a computer program;

a processor configured to execute the computer program to implement the DAG task scheduling method described above.

In a fourth aspect, the present application discloses a computer readable storage medium for storing a computer program; wherein the computer program when executed by the processor implements the aforementioned DAG task scheduling method.

According to the method, a network model is constructed according to the sequence of a directed graph neural network and a sequential decoder, and an objective function of the network model is defined by taking the minimum task scheduling length as an objective; acquiring a DAG task data set, and generating a corresponding information matrix for each DAG task in the DAG task data set; training the network model by using the information matrix, and updating model parameters of the network model by using reinforcement learning according to the objective function to obtain a trained DAG task scheduling model; and determining the scheduling sequence of the subtasks in the DAG task to be executed by using the DAG task scheduling model, and executing the DAG task to be executed by using a parallel computing system according to the scheduling sequence. According to the method, a DAG task scheduling model is obtained based on the directed graph neural network and reinforcement learning, the directed graph neural network can automatically identify rich characteristics related to subtasks in the DAG task, a sequence decoder can utilize the characteristics to perform task priority sequencing on the subtasks, meanwhile, a scheduling target of minimizing the DAG task scheduling length is achieved by utilizing the reinforcement learning optimization model, the DAG task scheduling length can be shortened, the DAG task parallel execution efficiency is improved, and the problem that enough supervision labels are difficult to collect for optimal priority distribution of the DAG task can be solved by utilizing the reinforcement learning.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the provided drawings without creative efforts.

Fig. 1 is a flowchart of a DAG task scheduling method provided in the present application;

fig. 2 is a structural diagram of a specific DAG task scheduling system provided in the present application;

FIG. 3 is a diagram of a specific directed graph neural network architecture provided herein;

FIG. 4 is a flowchart of a specific training method for a DAG task scheduling model according to the present disclosure;

fig. 5 is a schematic structural diagram of a DAG task scheduling apparatus according to the present disclosure;

fig. 6 is a block diagram of an electronic device provided in the present application.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

In the prior art, a large number of excellent heuristic scheduling algorithms such as a table scheduling algorithm and a clustering scheduling algorithm are accumulated in long-term parallel computing practice, but due to the nature of heuristic strategies, the algorithms cannot establish basic design principles for a DAG task scheduling program, for example, how to allocate priorities to sub-tasks by using DAG task execution time and DAG task graph topological structure characteristics under different DAG task scales and configurations, and scheduling performance is not ideal. In order to overcome the technical problem, the DAG task scheduling method can shorten the DAG task scheduling length and improve the parallel execution efficiency of DAG tasks.

The embodiment of the application discloses a DAG task scheduling method, which can comprise the following steps as shown in FIG. 1:

step S11: and constructing a network model according to the sequence of the directed graph neural network and the sequential decoder, and defining an objective function of the network model by taking the minimum task scheduling length as an objective.

In this embodiment, a Network model is first constructed according to the sequence of a Directed Graph Neural Network (DGNN) and a sequential decoder, and an objective function of the Network model is defined with a minimum task scheduling length as an objective; the directed graph neural network is used for identifying task characteristics of subtasks in the DAG task and outputting embedded representations, namely vector representations, corresponding to each subtask, wherein the task characteristics comprise execution time and dependency relationship; the sequence decoder is used for sequencing the priorities of all the subtasks according to the embedded representation output by the directed graph neural network and outputting the priority sequencing of the subtasks. The objective function is used for guiding the learning of the network model, so that the network model can finally realize the minimum task scheduling length for outputting the DAG task according to the input DAG task. The network model also comprises a DAG task scheduling simulator which is used for calculating the scheduling length of the DAG task on the given parallel computing system.

Before performing a detailed description of the present embodiment, the basic concept involved in the present embodiment is explained first. Parallel computing systems, which can be generally described as a quad

. Wherein:

is a set of processing nodes;

is a set of communication links between processing nodes;

is a set of computation speeds for the processing nodes,

represent

And satisfy

；

Is a collection of communication link bandwidths that may be,

representing a communication link

Of the bandwidth of (c).

The DAG task refers to a plurality of subtasks which have complex dependency relationships and can be executed in parallel on a parallel computing system, is represented by a commonly-used weighted directed acyclic graph and is recorded as

. Wherein the content of the first and second substances,

is a set of nodes, each node representing a subtask,

is the total number of subtasks.

Is a set of directed edges, directed edges

Representing subtasks connected from the edge

To another subtask thereof

The communication and data dependencies between them,

must be receiving

Can only be started after the result of the calculation.

Is a set of computational loads that are,

representing subtasks

And recording the sum of the computing loads of all the subtasks as

Then there is

。

Let Pred (t)_i) And Succ (t)_i) Are respectively as

A direct predecessor subtask set and a direct successor subtask set. Balance

And Pred (t)_i) And Succ (t)_i) The set of the connecting edges among all the subtasks in the task is respectively

Incident edge set of

And an emergent edge set

(ii) a Note book

Respectively are

And

then there is

And

. If it is

Then call

Is an entry subtask and is noted as t_entry(ii) a If it is

Then call

Is an egress subtask and is denoted t_exit. Route of travel

Is a finite sequence of subtask nodes and satisfies

Is provided with

. If a path contains both an ingress subtask and an egress subtask, the path is said to be a complete path. The path length Λ (λ) of λ is the sum of the computational loads of all subtasks on that path, i.e.

。

Longest path length integrityThe path is called a critical path. Each subtask node v_iOriginal feature x of_iBesides the characteristics of task load, in-degree and out-degree node levels, the method also comprises the length of a critical path and the length of a non-critical path, wherein the length of the non-critical path can be obtained by subtracting the length of the critical path from the total calculation load of the DAG tasks. For example, as shown in fig. 2, a DAG task scheduling system is shown, which includes 9 nodes, and a DAG task having a unique entry and an exit, and the character strings in the nodes represent subtask IDs and computation loads. In fact, a DAG task may have multiple entries and exits, which can be changed to a DAG task with unique entries and exits by adding a virtual entry subtask or exit subtask and corresponding connecting edges. Unless otherwise stated, the latter type of DAG task is referred to in this embodiment, and the number of satisfied subtasks is satisfied nFar greater than the number of nodes of a parallel computing systemm。

In this embodiment, before constructing the network model according to the order of the directed graph neural network and the sequential decoder, the method may further include: constructing a graph convolution layer for DAG task feature learning based on an aggregation function and a nonlinear activation function; and constructing the directed graph neural network according to the sequence of the input layer, the K-layer graph convolution layer and the output layer. Learning the vector representation of each subtask in the DAG task by utilizing the directed graph neural network;

the above vectors are expressed as

；

As shown in fig. 3, a directed graph neural network is designed, which is composed of an input layer (input layer), a K-layer graph conv layer (graph conv layer) and an output layer (output layer), wherein the input layer reads a node feature matrix X and an adjacency matrix a of a DAG task; the graph convolution operation of the k-th layer graph convolution layer is implemented by an aggregation function (aggregate function) and a nonlinear activation function, as follows:

(1)

(2)

wherein the aggregate function aggregates data from the subtasks

The update function performs nonlinear transformation on the aggregated messages; pred (t)_i) Is composed of

Is directly preceding the set of subtasks. For the aggregate function, it can adopt many ways such as taking the maximum value, taking the average value, etc., and this patent adopts the attention mechanism to realize:

(3)

Wherein alpha is_ijRepresenting subtaskst _jFor is tot _iThe attention coefficient of (2), which is learned through training. For update function, it can be any non-linear activation function. Not in generality, the ReLu function is used in this embodiment to implement:

namely that

；

The output layer directly outputs the vertex embedded representation learned by the Kth layer graph convolution layer. It can be understood that the graph convolution layer constructed by using the aggregation function and the nonlinear activation function can better adapt to the directed characteristics of the DAG task graph, extract the dependency relationship among the subtasks, and identify the characteristics of the subtasks and the dependency relationship between the subtasks and other subtasks in the DAG task, so that the embedded representation of subtask nodes can be more effectively learned, richer characteristics can be provided for the priority ordering of the subsequent subtasks, and the accuracy of the priority ordering can be further improved.

In this embodimentBefore the network model is built according to the sequence of the directed graph neural network and the sequential decoder, the method may further include: defining a vector expression of a context environment for a DAG task by taking a priority distribution state of a subtask in the DAG task as a variable; a sequential decoder for prioritization is constructed based on an attention mechanism and a vector expression of the context environment. It will be appreciated that the decoder described above is a sequential decoder for ordering all sub-tasks of the DAG task, specifically all of the DAG task learned from the directed graph neural network nAn embedded representation of a subtask node.

The sequential decoder sequentially selects the subtask nodes to generate the size ofnIn a priority order

It corresponds to a priority ordered arrangement of subtasks

And satisfies: priority level

。

The sequential decoder in this embodiment can formally describe the DAG task priority assignment as a probability distribution defined by the following equation:

(4)

wherein, the first and the second end of the pipe are connected with each other,

are the network parameters to be optimized. The sequential decoder first derives a probability distribution

The subtask node with the highest priority is sampled and then the next time

And (5) sampling the subtask nodes with the second highest priority, and repeating the steps until all subtask nodes are selected by sampling.

At each time step

The sequential decoder selects subtask nodes according to the following rule

And give priority to it

：

(5)

Wherein the argmax function is used for determining a maximum independent variable point set; conditional probability distribution

Calculated according to the following formula:

(6)

wherein softmax is a normalized exponential function;

transforming a matrix for the features to be trained; tan h is an activation function; att is an attention function;

is the time step

The subtask set with the priority not yet distributed is recorded as the subtask set with the priority distributed

And satisfy

In the process of prioritizing for a DAG task, aggregation

And

will be updated in real time according to the priority assignment status; cont is a context environment selected by the real-time subtask of the sequential decoder, and it can be understood that, in formula (6), the vector representation of the subtask is compared with the vector representation of the context environment, and then the weight of the subtask is assigned by using the attention mechanism; the vector representation of the context is calculated as follows:

cont=W[cont_O; cont_U]+b, (7)

wherein W represents a linear transformation, [;]"represents tensor connectors; cont_OAnd cont_UAre respectively as

And

corresponding embedded representations, which are calculated by the following formula:

(8)

(9)

where σ is a nonlinear activation function. Thus, a sequential decoder is constructed based on the attention mechanism, and the problem of selecting sub-task nodes is solved into the problem of randomly selecting sub-node numbers from the conditional probability distribution, so that the priority ordering of the sub-tasks can be more accurately determined according to the vector representation of the sub-task nodes.

In this embodiment, the defining an objective function of the network model with the minimum task scheduling length as a target may include: generating a deceleration evaluation index of the DAG task by taking the task scheduling length corresponding to the priority sequence of the DAG task at different time steps and the lower limit of the task scheduling length as independent variables; the lower limit of the task scheduling length is determined according to the path length of the key path of the DAG task; constructing a reward function based on a strategy gradient algorithm and the deceleration evaluation index; and constructing an objective function of the network model based on the reward function. It is to be appreciated that the DAG task scheduling problem can be modeled as a Markov Decision Process (MDP); a basic MDP can be generally described by a five-tuple:

。

is said sequential decodernSet of actions at time step

Is used to select subtask nodes from DAG task

And give priority to it

；

Is thatnA set of states for each time step; time step

State of (1)

Embedded representation of all subtask nodes including DAG task

And assigned priority subtasks

；

Namely, it is

。

Indicating an ambient immediate return value for evaluating the effect of a select action on the sequential decoder. Since the goal of DAG task scheduling is to minimize the task scheduling length, in this embodiment, a reward function is designed based on the deceleration evaluation index

:

(10)

Wherein the content of the first and second substances,

is shown at time step

DAG task prioritization

Corresponding subtask

The length of the schedule of (a) is,

is shown at time step

DAG task prioritization

Corresponding subtask

Is lower bound of the task schedule length. On a parallel computing system consisting of m computing nodes,

the calculation can be based on the length of the critical path, and the specific calculation formula is as follows:

(11)

wherein the content of the first and second substances,

to be sorted according to priority

Deterministic to subtask

The length of the critical path is such that,

to be sorted according to priority

Deterministic to subtask

All ofThe subtasks calculate the sum of the loads.

In MDP, it is assumed that the state transition matrix Π is deterministic, because for a given state and action, the determination of the next state is not random, because the scheduling action does not execute the task, it only affects the scheduling policy and changes the ranking of the tasks. Finally, setting the discount factor delta as constant 1, and according to formulas (4) and (10), utilizing a policy-gradient-based algorithm to target the expectation of the accumulated rewards corresponding to the maximum DAG task priority ordering pi, and defining an objective function J of the network model as follows:

(12)

the characterization specifies DAG task priority order as sampled from the learning strategy.

Step S12: and acquiring a DAG task data set, and generating a corresponding information matrix for each DAG task in the DAG task data set.

In this embodiment, a DAG task data set for model training is obtained, and then an information matrix of each DAG task in the DAG task data set, including a node feature matrix and an adjacency matrix, is extracted. Specifically, in this embodiment, the generating a corresponding information matrix for each DAG task in the DAG task data set may include: generating a node characteristic matrix according to the characteristics of each subtask in the DAG task data set; the node feature matrix is a feature matrix for representing the calculation load and the normalization calculation load of the subtasks and the in-degree and out-degree of the subtasks; generating an adjacency matrix according to the connection relation between different subtasks in the DAG task data set; and obtaining an information matrix corresponding to the DAG task based on the node characteristic matrix and the adjacency matrix.

In this embodiment, the acquiring the DAG task data set may include: configuring DAG task parameters; the DAG task parameters comprise the task layer number and the children of the target node The method comprises the following steps of counting nodes, generating probability of sub-nodes of a target node, adding probability of a connecting edge between two adjacent task layers and calculating load of each sub-task; and generating a DAG task according to the DAG task parameters to obtain the DAG task data set. Due to the lack of a publicly available large-scale DAG task data set, in this embodiment, a DAG task is generated first, and specifically, the DAG task can be acquired by using a parallel task generation model based on DAG task parameters. Synthesizing a DAG task like a nested fork-join task model; the model is controlled by four parameters, n respectively_depth、n_child、p_forkAnd p_pert. Wherein n is_depthRepresenting the number or depth of DAG task layers; n is_childThe number of sub-nodes representing a certain node; p is a radical of_forkRepresenting the probability of generating a child node for a node; p is a radical of_pertAnd randomly adding the probability of connecting edges between two adjacent layers of nodes. For the firstkEach subtask node in a layert _iSub-nodes thereoft _jAnd edgee _ijIs based on the probability p_forkGenerating; first, thekThe number of sub-nodes in +1 layer is uniformly distributedn _childAnd determining, namely determining the number of sub-nodes under the target node through uniform distribution. The process starts with the entry subtask node and repeats n_depthThen, thereby creating a block with n_depthA layered DAG task. Furthermore, with a probability p_pertRandomly adding a connecting edge, p, between the kth level and the (k + 1) th level nodes of a DAG task _pertThe larger the value the higher the parallelism of the generated DAG task. Finally, edges from the last level node to the egress node are added, and a computational load is distributed for each subtask. Wherein the calculation load obedience parameter of the subtask is mu (mu)>0) And a normal distribution of δ, where μ represents an average computation load of the subtasks, and δ represents a standard deviation of the computation load of each subtask, and of course, other distributions may be assumed as long as the computation load of each subtask is guaranteed to be a positive value, and the distribution is not limited herein. Extracting the characteristics of each subtask node and constructing a node characteristic matrix X for the constructed DAG task; and constructing an adjacency matrix A according to the interconnection relation among the nodes.

Step S13: and training the network model by using the information matrix, and updating model parameters of the network model by using reinforcement learning according to the objective function so as to obtain a trained DAG task scheduling model.

In this embodiment, after the network model is constructed, model parameters are initialized first. Initializing parameters W of each layer of the directed graph neural network according to a specific strategy such as normal distribution random Initialization, Xavier Initialization or He Initialization, and initializing model parameters of a sequential decoder

Initialization is performed.

And then, training a network model by using an information matrix corresponding to the DAG task data set, wherein after the DAG task data set is obtained, the DAG task data set is divided into a training set and a test set, and a cross validation method, a leave-out method or a leave-one method and other dividing methods can be specifically adopted, wherein the test set is used for training the network model, and the test set is used for testing the trained network model.

In this embodiment, the training the network model by using the information matrix and updating the model parameters of the network model by using reinforcement learning according to the objective function may include:

s130: inputting the information matrix into the network model, and outputting vector representation of each subtask by using the directed graph neural network according to the characteristics of the subtasks and the dependency relationship among the subtasks;

s131: prioritizing, with the sequential decoder, the subtasks within the DAG task according to the vector representation of the subtasks based on an attention mechanism and a context environment of the DAG task;

s132: calculating the task scheduling length of the DAG task by utilizing a DAG task scheduling simulator according to the priority sequence;

S133: and updating the model parameters of the network model by utilizing reinforcement learning according to the task scheduling length and the objective function until the network model is converged.

The node characteristic matrix and the adjacent matrix contained in the information matrix are used as the input of a network model, forward propagation is carried out, vector representation of all subtasks is obtained through a directed graph neural network, a sequence decoder outputs priority ordering of the subtasks, the subtasks can be sequentially scheduled to be executed by a DAG task simulation scheduler, corresponding scheduling length is calculated, and then a model objective function value is calculated according to a formula (12); and according to a certain strategy, such as random gradient descent or Adam and other algorithms, the network parameter values of each layer are corrected through back propagation. Therefore, by utilizing a reinforcement learning algorithm, aiming at minimizing DAG task scheduling length, and continuously optimizing a network model by rewarding DAG task priority sequencing with shorter scheduling length; therefore, the obtained scheduling length is shorter, and the parallel computing efficiency is higher. The difficulty of collecting enough supervision tags for optimal priority assignment for DAG tasks can be effectively avoided.

Specifically, the network model is trained by relating to an objective function J defined by equation (11)

Gradient of (a):

(13)

wherein the content of the first and second substances,

is a gradient operator. The model gradient in equation (12) can be estimated using the monte carlo stochastic gradient descent method:

(14)

wherein the content of the first and second substances,

representing subtasks of DAG tasks resulting from random sampling in the data setAnd (4) collecting. And optimizing the objective function by using a random gradient descent method or an Adam algorithm, and terminating the model training when the objective function value is not reduced or reaches the maximum iteration number, wherein the obtained scheduling scheme is the optimal scheduling scheme. Namely, the objective function gradient is estimated based on the Monte Carlo random gradient descent method. Therefore, in the embodiment, deep reinforcement learning on the DAG task is realized based on the directed graph neural network and the objective function. The DAG task scheduling length can be obtained by sequentially scheduling all subtasks to the parallel computing system ARC according to the arrangement through the DAG task scheduling simulator and executing the subtasks in parallel, and recording the completion time of the export task.

Step S14: and determining the scheduling sequence of the subtasks in the DAG task to be executed by using the DAG task scheduling model, and executing the DAG task to be executed by using a parallel computing system according to the scheduling sequence.

In this embodiment, after the DAG task scheduling model is obtained through training, the node feature matrix and the adjacency matrix of the DAG task to be executed are input to the model, the optimal DAG task scheduling order obtained by the model is output as a result, and the DAG task to be executed is executed by using the parallel computing system according to the scheduling order. Therefore, for the non-preemptive scheduling problem of the DAG task, the task priority ordering is carried out based on the deep reinforcement learning and the directed graph neural network, the scheduling sequence of the tasks is further determined, the execution time of the tasks is reduced, and the execution efficiency of the tasks is improved.

The embodiment also provides a DAG task scheduling system based on the deep reinforcement learning and the directed graph neural network. As shown in fig. 2, the system is composed of an input module, a directed graph neural network, a sequence decoder, a schedule length calculation module, and a model parameter update module. The input module is responsible for reading a node characteristic matrix X and an adjacency matrix A of the DAG task; the directed graph neural network takes X and A as input, identifies the execution time and the dependency relationship of a DAG task, and learns the embedded representation of the subtasks; the embedded representations are decoded by a sequence decoder, and the priority sequence of all the subtasks is obtained through output; and the scheduling length calculation module schedules the parallel calculation system to execute according to the sequence, takes the scheduling length as a feedback signal and updates the model parameters by using a reinforcement learning algorithm. Therefore, the DAG task scheduling system based on the deep reinforcement learning and the directed graph neural network takes the DAG task as input, generates embedded representation for each subtask of the DAG task through the directed graph neural network, generates priority sequence of all subtasks by using a sequence decoder, and calculates task scheduling length or completion time corresponding to the sequence. The system targets minimizing the scheduled length of the DAG tasks, which is used as a reward signal to update the model through a reinforcement learning algorithm.

As can be seen from the above, in this embodiment, a network model is constructed according to the order of the directed graph neural network and the sequential decoder, and an objective function of the network model is defined with the minimum task scheduling length as an objective; acquiring a DAG task data set, and generating a corresponding information matrix for each DAG task in the DAG task data set; training the network model by using the information matrix, and updating model parameters of the network model by using reinforcement learning according to the objective function to obtain a trained DAG task scheduling model; and determining the scheduling sequence of the subtasks in the DAG task to be executed by using the DAG task scheduling model, and executing the DAG task to be executed by using a parallel computing system according to the scheduling sequence. According to the method, a DAG task scheduling model is obtained based on a directed graph neural network and reinforcement learning, the directed graph neural network can automatically identify rich characteristics related to subtasks in the DAG task, a sequence decoder can use the characteristics to perform task priority sequencing on the subtasks, meanwhile, a scheduling target of minimizing the DAG task scheduling length is achieved by using a reinforcement learning optimization model, the DAG task scheduling length can be shortened, the parallel execution efficiency of the DAG task is improved, and the problem that enough supervision labels are difficultly collected for optimal priority distribution of the DAG task can be solved by using reinforcement learning.

Correspondingly, an embodiment of the present application further discloses a DAG task scheduling device, as shown in fig. 5, the device includes:

the network construction module 11 is configured to construct a network model according to the sequence of the directed graph neural network and the sequential decoder, and define an objective function of the network model with a minimum task scheduling length as an objective;

the data set acquisition module 12 is configured to acquire a DAG task data set, and generate a corresponding information matrix for each DAG task in the DAG task data set;

the training module 13 is configured to train the network model by using the information matrix, and update model parameters of the network model by using reinforcement learning according to the objective function to obtain a trained DAG task scheduling model;

and a scheduling order determining module 14, configured to determine a scheduling order of sub-tasks within the DAG task to be executed by using the DAG task scheduling model, and execute the DAG task to be executed by using the parallel computing system according to the scheduling order.

As can be seen from the above, in this embodiment, a network model is constructed according to the sequence of the directed graph neural network and the sequential decoder, and an objective function of the network model is defined with the minimum task scheduling length as an objective; acquiring a DAG task data set, and generating a corresponding information matrix for each DAG task in the DAG task data set; training the network model by using the information matrix, and updating model parameters of the network model by using reinforcement learning according to the objective function to obtain a trained DAG task scheduling model; and determining the scheduling sequence of the subtasks in the DAG task to be executed by using the DAG task scheduling model, and executing the DAG task to be executed by using a parallel computing system according to the scheduling sequence. According to the method, a DAG task scheduling model is obtained based on the directed graph neural network and reinforcement learning, the directed graph neural network can automatically identify rich characteristics related to subtasks in the DAG task, a sequence decoder can utilize the characteristics to perform task priority sequencing on the subtasks, meanwhile, a scheduling target of minimizing the DAG task scheduling length is achieved by utilizing the reinforcement learning optimization model, the DAG task scheduling length can be shortened, the DAG task parallel execution efficiency is improved, and the problem that enough supervision labels are difficult to collect for optimal priority distribution of the DAG task can be solved by utilizing the reinforcement learning.

In some specific embodiments, the DAG task scheduling device may specifically include:

a sequential decoder construction unit for constructing a sequential decoder for priority ordering based on an attention mechanism and a vector expression of the context environment to obtain the decoder.

In some specific embodiments, the network building module 11 may specifically include:

the deceleration evaluation index construction unit is used for generating a deceleration evaluation index of the DAG task by taking the task scheduling length corresponding to the priority sequence of the DAG task at different time steps and the lower limit of the task scheduling length as arguments; the lower limit of the task scheduling length is determined according to the path length of the key path of the DAG task;

In some embodiments, the data set obtaining module 12 may specifically include:

the task parameter configuration unit is used for configuring DAG task parameters; the DAG task parameters comprise the number of task layers, the number of sub-nodes of a target node, the generation probability of the sub-nodes of the target node, the adding probability of a connecting edge between two adjacent task layers and the calculation load of each sub-task;

and the task generation unit is used for generating a DAG task according to the DAG task parameters to obtain the DAG task data set.

In some embodiments, the data set obtaining module 12 may specifically include:

In some embodiments, the training module 13 may specifically include:

the vector representation determining unit is used for inputting the information matrix into the network model and outputting the vector representation of each subtask by utilizing the directed graph neural network according to the characteristics of the subtasks and the dependency relationship among the subtasks;

a prioritization determination unit to prioritize, with the order decoder, subtasks within the DAG task according to the vector representation of the subtasks based on an attention mechanism and a context environment of the DAG task;

a task scheduling length determining unit, configured to calculate a task scheduling length of the DAG task by using a DAG task scheduling simulator according to the priority order;

and the model optimization unit is used for updating the model parameters of the network model by utilizing reinforcement learning according to the task scheduling length and the objective function until the network model is converged.

Further, the embodiment of the present application also discloses an electronic device, which is shown in fig. 6, and the content in the drawing cannot be considered as any limitation to the application scope.

Fig. 6 is a schematic structural diagram of an electronic device 20 according to an embodiment of the present disclosure. The electronic device 20 may specifically include: at least one processor 21, at least one memory 22, a power supply 23, a communication interface 24, an input output interface 25, and a communication bus 26. Wherein the memory 22 is used for storing a computer program, which is loaded and executed by the processor 21 to implement the relevant steps in the DAG task scheduling method disclosed in any of the foregoing embodiments.

In this embodiment, the power supply 23 is configured to provide an operating voltage for each hardware device on the electronic device 20; the communication interface 24 can create a data transmission channel between the electronic device 20 and an external device, and a communication protocol followed by the communication interface is any communication protocol that can be applied to the technical solution of the present application, and is not specifically limited herein; the input/output interface 25 is configured to acquire external input data or output data to the outside, and a specific interface type thereof may be selected according to specific application requirements, which is not specifically limited herein.

In addition, the memory 22 can be a read-only memory, a random access memory, a magnetic disk or an optical disk as a carrier for storing resources, the resources stored thereon include an operating system 221, a computer program 222, and data 223 including DAG tasks, and the storage manner can be transient storage or permanent storage.

The operating system 221 is configured to manage and control each hardware device and the computer program 222 on the electronic device 20, so as to implement the operation and processing of the mass data 223 in the memory 22 by the processor 21, and may be Windows Server, Netware, Unix, Linux, or the like. The computer programs 222 may further include computer programs that can be used to perform other specific tasks in addition to the computer programs that can be used to perform the DAG task scheduling method performed by the electronic device 20 disclosed in any of the foregoing embodiments.

Further, an embodiment of the present application further discloses a computer storage medium, where computer-executable instructions are stored in the computer storage medium, and when the computer-executable instructions are loaded and executed by a processor, the steps of the DAG task scheduling method disclosed in any of the foregoing embodiments are implemented.

The embodiments are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same or similar parts among the embodiments are referred to each other. The device disclosed by the embodiment corresponds to the method disclosed by the embodiment, so that the description is simple, and the relevant points can be referred to the method part for description.

The steps of a method or algorithm described in connection with the embodiments disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may reside in Random Access Memory (RAM), memory, Read Only Memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art.

Finally, it should also be noted that, in this document, relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in the process, method, article, or apparatus that comprises the element.

The DAG task scheduling method, apparatus, device and medium provided by the present invention are described in detail above, and specific examples are applied herein to explain the principles and embodiments of the present invention, and the description of the above embodiments is only used to help understanding the method and its core idea of the present invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, the specific embodiments and the application range may be changed, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims

1. A DAG task scheduling method, comprising:

constructing a network model according to the sequence of the directed graph neural network and the sequence decoder, and defining an objective function of the network model by taking the minimum task scheduling length as an objective;

2. The DAG task scheduling method of claim 1, wherein prior to constructing the network model in an order of the directed graph neural network and the sequential decoder, further comprising:

3. The DAG task scheduling method of claim 1, wherein before constructing the network model according to the sequence of the directed graph neural network and the sequential decoder, the method further comprises:

4. The DAG task scheduling method of claim 1, wherein defining an objective function of the network model with a minimum task scheduling length as an objective comprises:

taking the task scheduling length corresponding to the priority sequence of the DAG task at different time steps and the lower limit of the task scheduling length as independent variables to generate a deceleration evaluation index of the DAG task; the lower limit of the task scheduling length is determined according to the path length of the key path of the DAG task;

5. The DAG task scheduling method of claim 1, wherein the obtaining a DAG task dataset comprises:

Configuring DAG task parameters; the DAG task parameters comprise the task layer number, the sub-node number of the target node, the sub-node generation probability of the target node, the connecting edge adding probability between two adjacent task layers and the calculation load of each sub-task;

6. The method as recited in claim 1, wherein the generating a corresponding information matrix for each DAG task within the DAG task dataset comprises:

7. The DAG task scheduling method according to any one of claims 1 to 6, wherein the training the network model by using the information matrix and updating model parameters of the network model by using reinforcement learning according to the objective function comprises:

inputting the information matrix into the network model, and outputting a vector representation of each subtask by using the directed graph neural network according to the characteristics of the subtasks and the dependency relationship among the subtasks;

Prioritizing, with the sequential decoder, the subtasks within the DAG task according to the vector representation of the subtasks based on an attention mechanism and a context environment of the DAG task;

and updating the model parameters of the network model by using reinforcement learning according to the task scheduling length and the objective function until the network model converges.

8. A DAG task scheduling apparatus, comprising:

9. The DAG task scheduling device of claim 8, further comprising:

10. The DAG task scheduling device of claim 8, further comprising:

11. The DAG task scheduler of claim 8, wherein the network construction module comprises:

12. The DAG task scheduler of claim 8, wherein the dataset acquisition module comprises:

13. The DAG task scheduler of claim 8, wherein the dataset acquisition module comprises:

A node characteristic matrix generating unit, configured to generate a node characteristic matrix according to a characteristic of each subtask in the DAG task dataset;

the adjacency matrix generating unit is used for generating an adjacency matrix according to the connection relation between different subtasks in the DAG task data set;

14. An electronic device, comprising:

a memory for storing a computer program;

a processor for executing the computer program to implement the DAG task scheduling method of any of claims 1 to 7.

15. A computer-readable storage medium for storing a computer program; wherein the computer program, when executed by the processor, implements the DAG task scheduling method of any of claims 1 to 7.