WO2019223283A1 - Combinatorial optimization scheduling method for predicting task execution time - Google Patents

Combinatorial optimization scheduling method for predicting task execution time Download PDF

Info

Publication number
WO2019223283A1
WO2019223283A1 PCT/CN2018/118871 CN2018118871W WO2019223283A1 WO 2019223283 A1 WO2019223283 A1 WO 2019223283A1 CN 2018118871 W CN2018118871 W CN 2018118871W WO 2019223283 A1 WO2019223283 A1 WO 2019223283A1
Authority
WO
WIPO (PCT)
Prior art keywords
execution
task
node
combination
completion time
Prior art date
Application number
PCT/CN2018/118871
Other languages
French (fr)
Chinese (zh)
Inventor
郭乃网
田英杰
苏运
陈睿
宋岩
沈泉江
庞天宇
方炯
杨洪山
Original Assignee
国网上海市电力公司
华东电力试验研究院有限公司
星环信息科技(上海)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 国网上海市电力公司, 华东电力试验研究院有限公司, 星环信息科技(上海)有限公司 filed Critical 国网上海市电力公司
Priority to JP2018566566A priority Critical patent/JP2020524313A/en
Publication of WO2019223283A1 publication Critical patent/WO2019223283A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/51Discovery or management thereof, e.g. service location protocol [SLP] or web services
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/145Network analysis or design involving simulating, designing, planning or modelling of a network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/147Network analysis or design for predicting network behaviour
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/60Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources

Definitions

  • the present application relates to an offline job scheduling method, for example, to a combined optimized scheduling method for predicting the execution time of a task.
  • Power supply and distribution big data applications need to realize massive data real-time processing, rely on related parallel processing technologies, and also emphasize the flexibility, reliability, manageability and economy of related computing and storage capabilities.
  • the running time of a MapReduce job is determined by the sum of the running time of the longest running Map task and the longest running Reduce task. Therefore, it is necessary to minimize the running time of the job. It is necessary to minimize the maximum running time of the Map task and the running time of the Reduce task. Therefore, how to minimize the running time of MapReduce jobs becomes a Min-Max optimization problem.
  • the related technology can basically minimize the running time of the job, but in the optimal case, it cannot effectively increase the performance gain, and it is not well applicable to the heterogeneity and dynamics of resources in a distributed computing environment.
  • the present application provides a combined optimal scheduling method for predicting the execution time of a task.
  • the bandwidth between task nodes is small, and the processing capabilities of task nodes are heterogeneous. Data locality needs to be considered when scheduling prediction execution tasks. Aiming at the heterogeneity and dynamics of resources in a distributed computing environment, a series of methods and systems for resource monitoring have been generated. At the same time, methods such as the execution time model of tasks in MapReduce jobs and the sampling operation of sample tasks to predict the processing capability of nodes on tasks have also achieved good application results. Therefore, fine-grained resource information such as bandwidth between nodes and the ability to process tasks in a distributed computing environment can be used to optimize predictive execution task scheduling.
  • the present application relates to a combined optimization scheduling method for predicting the execution time of an execution task.
  • the method includes the following steps S1 to S6.
  • step S1 a combination optimization-based prediction execution technology (Combination, Re-Execution, Speculative Technology, CREST) combination optimization prediction execution task scheduling model;
  • CREST includes: (i) the CREST combined optimized predictive execution task scheduling model and (ii) the CREST combined re-execution scheduling algorithm and its implementation. Because there is a direct network connection between each computing node, that is, communication between any two points does not need to be forwarded by a third node, therefore, a complete graph is used to represent the network topology structure diagram and task migration diagram of the nodes.
  • the CREST combined optimized predictive execution task scheduling model is a directed complete graph. Each edge represents a possible migration path, that is, the task running at the starting point of the edge is ended, and it is migrated to the end point of the edge and re-executed.
  • CREST A combined re-execution mechanism.
  • the combined re-execution scheme can be regarded as a loop-free path from a slow task node to an idle node. For each edge on the path, the task at its starting point is migrated to its end point for re-execution, and it is postponed in order to keep the slow task on the slow task node to continue execution.
  • there is no exchange job between nodes in the combined re-execution mechanism that is, there is no loop in the path, because the data of the job that has begun to run has been transmitted to the node local, and the load of the Map task is approximately the same. The requirements of data locality have been taken into consideration, so the interchange operation between nodes does not bring performance gains.
  • step S2 fine-grained resource information of bandwidth between nodes and processing capabilities of the nodes is introduced.
  • T u represents Map tasks running on nodes u
  • d (T u) represents the processed data T u
  • D represents ( T u )
  • PR v represents the progress rate at which a node completes this type of Map task
  • bw (u, v) represents the bandwidth of the network connection between nodes u and v.
  • step S3 obtaining the estimated completion time of the rescheduled task.
  • the Map task executed on a given node u is rescheduled for execution on node v, then its estimated completion time (ExpectedTimetoFinish, ETF) is represented by t ′ etf (u, v), which is defined as:
  • t c represents the current time.
  • the data transmission time t data_movement can be obtained by the formula:
  • d (T u ) can be transmitted from nodes other than u to v.
  • a copy optimization selection strategy is used to speed up the transmission.
  • step S4 obtaining a re-execution slow task combination scheme.
  • s represents a slow task running node
  • f represents an idle node
  • PATH (s, f) represents a path from s to f
  • PATH (s, f) cascades all tasks containing the starting node in the edge.
  • Migration to the termination node for execution is defined as a combination re-execution scheme of slow tasks along PATH (s, f).
  • Slow task combination re-execution estimated completion time is: given the slow task combination re-execution of a given edge, its estimated completion time is defined as the maximum estimated completion time of all migration jobs, and its expression is:
  • step S5 obtaining an optimal combination re-execution scheme for slow tasks.
  • the optimal slow task combination execution plan is defined as the combination re-execution plan with the smallest predicted completion time, which is expressed in CRES, that is, the estimated optimal completion time of the slow task optimal combination re-execution is:
  • t spec (CRES) min (t spec (PATH (s, f))) forallpathconnects, f ...
  • step S3 The objective equation of the combination and re-execution optimization can be obtained by integrating step S3 as:
  • step S6 the weight is adjusted to obtain a shortened running time of the predicted execution task.
  • This application uses the CREST technology to reduce the average execution time of the predicted execution task by more than 50%. In the best case, this performance gain can reach 70%. At the same time, when the CREST technology is used, there is a probability of more than 50%. The performance gain of more than 40%, with the increase of the number of copies factor, the performance improvement using CREST technology has further increased.
  • This application introduces fine-grained resource information of inter-node bandwidth and node processing capabilities to design a combination optimization mechanism that can meet the data locality requirements of predictive execution tasks and is well applicable to the heterogeneity of resources in distributed computing environments. And dynamic.
  • FIG. 1 is a flowchart of a combined optimal scheduling method for predicting execution time of an execution task according to an embodiment of the present application.
  • the present application relates to a combined optimization scheduling method for predicting the execution time of a task, including the following steps 110 to 160.
  • step 110 based on the CREST technology, a combined optimized predictive execution task scheduling model is combined.
  • step 120 fine-grained resource information of bandwidth between nodes and processing capabilities of the nodes is collected.
  • the fine-grained resource information of node processing capabilities mainly includes Map tasks running on the nodes, the data processed by the Map tasks, the size of the data processed by the Map tasks, the progress rate of the nodes to complete Map tasks of this type, and The bandwidth of the network connection between nodes.
  • step 130 the estimated completion time of the rescheduled task is obtained according to the fine-grained resource information of the inter-node bandwidth and the processing capability of the node.
  • step 140 a re-execution slow task combination scheme is designed, and an estimated completion time of the slow task combination re-execution is obtained.
  • step 150 an optimal combined re-execution plan for slow tasks is designed, and an objective equation of combined re-execution optimization is obtained according to the estimated completion time of the re-scheduled task and the estimated completion time of the combined re-execution of the slow tasks.
  • step 160 a weight is set, and the target execution equation of the combined re-execution optimization is used to obtain an effectively shortened prediction execution task running time.
  • the bandwidth between task nodes is small, and the processing capabilities of the task nodes are heterogeneous, and data locality needs to be considered when scheduling prediction execution tasks.
  • Aiming at the heterogeneity and dynamics of resources in a distributed computing environment a series of methods and systems for resource monitoring have been generated.
  • methods such as the execution time model of tasks in MapReduce jobs and the sampling operation of sample tasks to predict the processing capacity of nodes on the tasks have also achieved good application results. Therefore, fine-grained resource information such as bandwidth between nodes and the ability to process tasks in a distributed computing environment can be used to optimize predictive execution task scheduling.
  • CREST is an optimization technology that uses fine-grained resource information to meet the data locality requirements of predictive execution tasks through a combination of optimization mechanisms, eliminates the time overhead of data transmission, and significantly reduces the execution time of the entire job Map phase.
  • CREST technology includes two parts: CREST combined optimized predictive execution task scheduling model and CREST combined re-execution scheduling algorithm and its implementation. Because there is a direct network connection between each computing node, that is, communication between any two points does not need to be forwarded by a third node, we can use a complete graph to represent the network topology structure diagram and task migration diagram of the nodes.
  • the CREST combined optimized predictive execution task scheduling model is a directed complete graph.
  • Each edge represents a possible migration path, that is, the task running at the starting point of the edge is ended, and it is migrated to the end point of the edge and re-executed.
  • LATE the execution of tasks
  • CREST the mechanism for predicting the execution of tasks
  • CREST A combined re-execution mechanism.
  • the combined re-execution scheme can be regarded as an acyclic path from a slow task node to an idle node in the above figure (directed acyclic graph). For each edge on the path, the task at its starting point will be migrated to its end point and re-executed, which will be postponed in order, but the slow task on the slow task node will continue to execute.
  • there is no exchange job between nodes in the combined re-execution mechanism that is, there is no loop in the path, which is reasonable in reality, because the data of the job that has started running has been transmitted to the node local, and the Map The load of the tasks is roughly the same. The data locality requirements have been taken into account in the initial assignment of the jobs, so swapping jobs between nodes does not bring performance gains.
  • CREST Compared with LATE, CREST introduces fine-grained resource information between node bandwidth and node processing capabilities, and designs a combined optimization mechanism to meet the data locality requirements of predictive execution tasks.
  • the set T u represents Map tasks running on nodes u
  • d (T u) represents the processed data T u
  • PR v represents the progress rate (ProgressRate) of a node completing this type of Map task
  • bw (u, v) represents the bandwidth of the network connection between nodes u and v.
  • a Map task executed on a given node u is rescheduled for execution on node v, and its estimated completion time t ′ etf (u, v) is defined as follows:
  • the data transmission time t data_movement can be obtained by the formula:
  • t c represents the current time. It should be noted that d (T u ) can also be transmitted from nodes other than u to v. We use the copy optimization selection strategy in the algorithm implementation to speed up the transmission.
  • Slow task combination re-execution scheme given s represents a slow task running node, f represents an idle node, PATH (s, f) represents a path from s to f, and PATH (s, f) includes all along the cascade
  • the tasks of the starting node in the edge are migrated to the terminating node for execution, which is defined as a combination re-execution scheme of slow tasks along PATH (s, f).
  • Slow task combination re-execution estimated completion time Given the slow task combination re-execution of a given edge, its estimated completion time is defined as the maximum estimated completion time of all migration jobs as follows:
  • PATH (s, f) is a path from s to f
  • t ′ etf (u, v) is the estimated completion time of the rescheduled task
  • the meaning of the formula is: the estimated completion time of the slow task combination re-execution is defined as The maximum expected completion time of re-scheduled tasks for all directed edges in the path from s to f, that is, the maximum expected completion time of all migration jobs.
  • Slow task optimal combination re-execution scheme given s represents a slow task running node, f represents an idle node. PATH (s, f) is a path from s to f.
  • the optimal slow task combination execution plan is defined as the combination re-execution plan with the smallest predicted completion time, which is expressed in CRES, as follows:
  • t spec (CRES) min (t spec (PATH (s, f))) forallpathconnects, f
  • t spec (CRES) be the estimated completion time of the slow task prediction execution obtained along the optimal combination re-execution scheme.
  • forallpathconnects represents all path connection combinations.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Disclosed is a combinatorial optimization scheduling method for predicting task execution time, comprising: combinatorially optimizing a predictive execution task scheduling model on the basis of combination re-execution scheduling technology (CREST); on the basis of the predictive execution task scheduling model, collecting fine-grained resource information of inter-node bandwidth and node processing capacity; obtaining an expected completion time of a rescheduling task according to the fine-grained resource information of inter-node bandwidth and node processing capacity; designing a scheme for re-executing a slow task combination, and obtaining an expected completion time of slow task combination re-execution; designing a slow task optimal combination re-execution scheme, and obtaining a target equation for combination re-execution optimization according to the expected completion time of the re-scheduling task and the expected completion time of the slow task combination re-execution; and setting weights, and obtaining an effectively shortened predictive execution task running time using the target equation for combination re-execution optimization.

Description

预测执行任务执行时间的组合优化调度方法Combination optimal scheduling method for predicting execution time of execution tasks
本申请要求在2018年05月24日提交中国专利局、申请号为201810510271.X的中国专利申请的优先权,该申请的全部内容通过引用结合在本申请中。This application claims priority from a Chinese patent application filed with the Chinese Patent Office on May 24, 2018 with application number 201810510271.X, the entire contents of which are incorporated herein by reference.
技术领域Technical field
本申请涉及一种离线作业调度方法,例如涉及一种预测执行任务执行时间的组合优化调度方法。The present application relates to an offline job scheduling method, for example, to a combined optimized scheduling method for predicting the execution time of a task.
背景技术Background technique
供配电大数据应用需要实现海量数据实时处理,需要依赖相关并行处理技术,同时还强调相关计算与存储能力的灵活性、可靠性、可管理性和经济性。其中,在对类似MapReduce等离线作业进行调度时,MapReduce作业的运行时间由执行时间最长的Map任务和运行时间最长的Reduce任务的运行时间之和决定,因此,要使作业的运行时间最短,就要最小化Map任务的运行时间和Reduce任务的运行时间的最大值。因此,如何最小化MapReduce作业运行时间转化为一个Min-Max优化问题。相关技术可实现作业运行时间基本的最小化,但在最优情况下无法有效地增加性能增益,且不能良好地适用于分布计算环境中的资源的异构性和动态性。Power supply and distribution big data applications need to realize massive data real-time processing, rely on related parallel processing technologies, and also emphasize the flexibility, reliability, manageability and economy of related computing and storage capabilities. Among them, when scheduling offline jobs such as MapReduce, the running time of a MapReduce job is determined by the sum of the running time of the longest running Map task and the longest running Reduce task. Therefore, it is necessary to minimize the running time of the job. It is necessary to minimize the maximum running time of the Map task and the running time of the Reduce task. Therefore, how to minimize the running time of MapReduce jobs becomes a Min-Max optimization problem. The related technology can basically minimize the running time of the job, but in the optimal case, it cannot effectively increase the performance gain, and it is not well applicable to the heterogeneity and dynamics of resources in a distributed computing environment.
发明内容Summary of the Invention
本申请为了克服上述相关技术存在的缺陷而提供一种预测执行任务执行时间的组合优化调度方法。In order to overcome the shortcomings of the foregoing related technologies, the present application provides a combined optimal scheduling method for predicting the execution time of a task.
由于Reduce任务数目比较少,任务需要传输的数据集较小,其输入数据需要从所有Map任务获取,不存在数据本地性问题。因此,采用CREST技术可降低所有Map任务中慢任务的运行时间,因而问题可以进一步转化为如何最小化Map慢任务的预测执行任务的执行时间t specBecause the number of Reduce tasks is relatively small, the data set to be transmitted by the task is small, and its input data needs to be obtained from all Map tasks. There is no data locality problem. Therefore, the use of CREST technology can reduce the running time of slow tasks in all Map tasks, so the problem can be further transformed into how to minimize the execution time t spec of predicted execution tasks of Map slow tasks.
在一个支持离线调度的网络中,任务节点之间的带宽较小,任务节点的处理能力异构,需要在调度预测执行任务时考虑数据本地性需求。针对分布计算环境中的资源的异构性和动态性,产生了一系列资源监控的方法和系统。同时, 关于MapReduce作业中任务的执行时间模型和通过样本任务的抽样运行来预测节点对任务的处理能力等方法也取得了很好的应用效果。因此,分布计算环境中节点间的带宽和处理任务的能力等细粒度资源信息可以用于优化预测执行任务调度。In a network that supports offline scheduling, the bandwidth between task nodes is small, and the processing capabilities of task nodes are heterogeneous. Data locality needs to be considered when scheduling prediction execution tasks. Aiming at the heterogeneity and dynamics of resources in a distributed computing environment, a series of methods and systems for resource monitoring have been generated. At the same time, methods such as the execution time model of tasks in MapReduce jobs and the sampling operation of sample tasks to predict the processing capability of nodes on tasks have also achieved good application results. Therefore, fine-grained resource information such as bandwidth between nodes and the ability to process tasks in a distributed computing environment can be used to optimize predictive execution task scheduling.
基于上述内容,本申请涉及一种预测执行任务执行时间的组合优化调度方法,该方法包括以下步骤S1至步骤S6。Based on the foregoing, the present application relates to a combined optimization scheduling method for predicting the execution time of an execution task. The method includes the following steps S1 to S6.
在步骤S1中:基于组合优化的预测执行技术(Combination Re-Execution Speculative Technology,CREST)组合优化预测执行任务调度模型;In step S1: a combination optimization-based prediction execution technology (Combination, Re-Execution, Speculative Technology, CREST) combination optimization prediction execution task scheduling model;
CREST包含:(i)CREST组合优化预测执行任务调度模型和(ii)CREST组合重执行调度算法及其实现。由于每个计算节点之间都存在直接网络连接,即任意两点的通讯不需要第三个节点转发,因此,采用完全图表示节点的网络拓扑结构图和任务迁移图。CREST组合优化预测执行任务调度模型是一个有向完全图,每条边代表一个可能的迁移路径,即将该边始点上运行的任务结束,迁移到该边的终点上重新执行。CREST includes: (i) the CREST combined optimized predictive execution task scheduling model and (ii) the CREST combined re-execution scheduling algorithm and its implementation. Because there is a direct network connection between each computing node, that is, communication between any two points does not need to be forwarded by a third node, therefore, a complete graph is used to represent the network topology structure diagram and task migration diagram of the nodes. The CREST combined optimized predictive execution task scheduling model is a directed complete graph. Each edge represents a possible migration path, that is, the task running at the starting point of the edge is ended, and it is migrated to the end point of the edge and re-executed.
预测执行任务的调度通常存在LATE和CREST两种机制。There are two mechanisms for predictive execution task scheduling: LATE and CREST.
1)预测任务最长完成时间的调度机制(Longest Approximate Time to End,LATE):直接在空闲资源上运行预测执行任务。1) Schedule mechanism for predicting the longest completion time of a task (Longest, Approximate, Time, End): Run the predictive execution task directly on idle resources.
2)CREST:一种组合重执行机制,其组合重执行方案可视为从慢任务节点到空闲节点的一条无环路径。对该路径上的每条边,其始点上的任务被迁移到其终点上重新执行,依次顺延,保留慢任务节点上的慢任务继续执行。假设组合重执行机制中不存在节点之间互换作业的情况,即路径中不存在环路,因已经开始运行的作业其数据已经传输到节点本地,且Map任务的负载大致相当,初始分配作业时已经考虑到了数据本地性的要求,因此节点之间互换作业并不会带来性能收益。2) CREST: A combined re-execution mechanism. The combined re-execution scheme can be regarded as a loop-free path from a slow task node to an idle node. For each edge on the path, the task at its starting point is migrated to its end point for re-execution, and it is postponed in order to keep the slow task on the slow task node to continue execution. Assume that there is no exchange job between nodes in the combined re-execution mechanism, that is, there is no loop in the path, because the data of the job that has begun to run has been transmitted to the node local, and the load of the Map task is approximately the same. The requirements of data locality have been taken into consideration, so the interchange operation between nodes does not bring performance gains.
在步骤S2中:引入节点间带宽和节点处理能力的细粒度资源信息。In step S2: fine-grained resource information of bandwidth between nodes and processing capabilities of the nodes is introduced.
对有向完全图的边(u,v)而言,假设T u表示节点u上运行的Map任务,d(T u)表示T u所处理的数据,|d(T u)|表示d(T u)的大小,PR v表示节点完成该类型Map任务的进度速率(Progress Rate),bw(u,v)表示节点u,v之间网络连接的带宽。 For a directed edge complete graph (u, v), it is assumed that T u represents Map tasks running on nodes u, d (T u) represents the processed data T u, | d (T u) | D represents ( T u ), PR v represents the progress rate at which a node completes this type of Map task, and bw (u, v) represents the bandwidth of the network connection between nodes u and v.
在步骤S3中:获取重调度任务的预计完成时间。In step S3: obtaining the estimated completion time of the rescheduled task.
给定节点u上执行的Map任务重新调度到节点v上执行,则其预计完成时间(ExpectedTimetoFinish,ETF)用t′ etf(u,v)表示,定义为式: The Map task executed on a given node u is rescheduled for execution on node v, then its estimated completion time (ExpectedTimetoFinish, ETF) is represented by t ′ etf (u, v), which is defined as:
Figure PCTCN2018118871-appb-000001
Figure PCTCN2018118871-appb-000001
其中,t c表示当前时刻。数据传输时间t data_movement可由公式得到: Among them, t c represents the current time. The data transmission time t data_movement can be obtained by the formula:
需要指出的是,d(T u)可以从u以外的节点传输到v,在算法实现中使用副本优化选择策略来加速传输。 It should be pointed out that d (T u ) can be transmitted from nodes other than u to v. In the algorithm implementation, a copy optimization selection strategy is used to speed up the transmission.
在步骤S4中:获取重执行慢任务组合方案。In step S4: obtaining a re-execution slow task combination scheme.
给定s表示慢任务运行节点,f表示空闲节点,PATH(s,f)表示从s到f的一条路径,PATH(s,f)沿着级联地将所有包含边中起始节点的任务迁移到终止节点上执行,定义为沿PATH(s,f)的慢任务的组合重执行方案。Given s represents a slow task running node, f represents an idle node, PATH (s, f) represents a path from s to f, and PATH (s, f) cascades all tasks containing the starting node in the edge. Migration to the termination node for execution is defined as a combination re-execution scheme of slow tasks along PATH (s, f).
慢任务组合重执行预计完成时间为:给定沿的慢任务组合重执行,则其预计完成时间定义为所有迁移作业的预计完成时间的最大值,其表达式为:Slow task combination re-execution estimated completion time is: given the slow task combination re-execution of a given edge, its estimated completion time is defined as the maximum estimated completion time of all migration jobs, and its expression is:
Figure PCTCN2018118871-appb-000003
Figure PCTCN2018118871-appb-000003
在步骤S5中:获取慢任务最优组合重执行方案。In step S5: obtaining an optimal combination re-execution scheme for slow tasks.
最优慢任务组合执方案定义为预测完成时间最小的组合重新执行方案,用CRES表示,即慢任务最优组合重执行预计完成时间为:The optimal slow task combination execution plan is defined as the combination re-execution plan with the smallest predicted completion time, which is expressed in CRES, that is, the estimated optimal completion time of the slow task optimal combination re-execution is:
t spec(CRES)=min(t spec(PATH(s,f)))forallpathconnects,f... t spec (CRES) = min (t spec (PATH (s, f))) forallpathconnects, f ...
则沿最优组合重执行方案所取得的慢任务预测执行的预计完成时间为:Then the estimated completion time of the slow task prediction execution obtained along the optimal combination re-execution scheme is:
t cres=min(t spec(PATH(s,f)))forallpathconnects,f t cres = min (t spec (PATH (s, f))) forallpathconnects, f
综合步骤S3可获取组合重执行优化的目标方程为:The objective equation of the combination and re-execution optimization can be obtained by integrating step S3 as:
Figure PCTCN2018118871-appb-000004
Figure PCTCN2018118871-appb-000004
在步骤S6中:调节权重,获取缩短的预测执行任务运行时间。In step S6: the weight is adjusted to obtain a shortened running time of the predicted execution task.
将任务调度模型图中有向边(u,v)的权重设置为t′ etf(u,v),该权值大于零,则慢任务最优组合重执行方案在任务调度模型图中上体现为一条最优路径(CRES),该路径的权值不是所含边的权值算术和,而是所含边中权值的最大值。优化后即可获取有效缩短的预测执行任务运行时间。 Set the weight of the directed edge (u, v) in the task scheduling model graph to t ′ etf (u, v), and the weight is greater than zero, then the optimal combination of slow task re-execution scheme is reflected in the task scheduling model graph. Is an optimal path (CRES), and the weight of the path is not the arithmetic sum of the weights of the contained edges, but the maximum value of the weights of the contained edges. After optimization, you can effectively shorten the running time of the forecast execution task.
与相关技术相比,本申请具有以下优点:Compared with related technologies, this application has the following advantages:
(1)本申请采用CREST技术平均能够缩短50%以上的预测执行任务运行时间,在最优情况下,这一性能增益可达70%,同时,采用CREST技术时,有 超过50%的概率获得40%以上的性能收益,随着副本数因子增大,采用CREST技术的性能提升幅度进一步变大。(1) This application uses the CREST technology to reduce the average execution time of the predicted execution task by more than 50%. In the best case, this performance gain can reach 70%. At the same time, when the CREST technology is used, there is a probability of more than 50%. The performance gain of more than 40%, with the increase of the number of copies factor, the performance improvement using CREST technology has further increased.
(2)本申请引入了节点间带宽和节点处理能力的细粒度资源信息来设计组合优化机制,可满足预测执行任务的数据本地性需求,并良好地适用于分布计算环境中的资源的异构性和动态性。(2) This application introduces fine-grained resource information of inter-node bandwidth and node processing capabilities to design a combination optimization mechanism that can meet the data locality requirements of predictive execution tasks and is well applicable to the heterogeneity of resources in distributed computing environments. And dynamic.
附图概述Overview of the drawings
图1为本申请实施例提供的一种预测执行任务执行时间的组合优化调度方法的流程图。FIG. 1 is a flowchart of a combined optimal scheduling method for predicting execution time of an execution task according to an embodiment of the present application.
具体实施方式Detailed ways
下面结合附图和具体实施例对本申请进行详细说明。The following describes the present application in detail with reference to the drawings and specific embodiments.
实施例Examples
如图1所示,本申请涉及一种预测执行任务执行时间的组合优化调度方法,包括以下步骤110至步骤160。As shown in FIG. 1, the present application relates to a combined optimization scheduling method for predicting the execution time of a task, including the following steps 110 to 160.
在步骤110中,基于CREST技术,组合优化预测执行任务调度模型。In step 110, based on the CREST technology, a combined optimized predictive execution task scheduling model is combined.
在步骤120中,采集节点间带宽和节点处理能力的细粒度资源信息。In step 120, fine-grained resource information of bandwidth between nodes and processing capabilities of the nodes is collected.
其中,节点处理能力细粒度的资源信息,主要包括节点上运行的Map任务、所述Map任务所处理的数据、所述Map任务所处理的数据的大小、节点完成该类型Map任务的进度速率以及节点之间网络连接的带宽。The fine-grained resource information of node processing capabilities mainly includes Map tasks running on the nodes, the data processed by the Map tasks, the size of the data processed by the Map tasks, the progress rate of the nodes to complete Map tasks of this type, and The bandwidth of the network connection between nodes.
在步骤130中,根据节点间带宽和节点处理能力的细粒度资源信息获取重调度任务的预计完成时间。In step 130, the estimated completion time of the rescheduled task is obtained according to the fine-grained resource information of the inter-node bandwidth and the processing capability of the node.
在步骤140中,设计重执行慢任务组合方案,获取慢任务组合重执行预计完成时间。In step 140, a re-execution slow task combination scheme is designed, and an estimated completion time of the slow task combination re-execution is obtained.
在步骤150中,设计慢任务最优组合重执行方案,根据所述重调度任务的预计完成时间及所述慢任务组合重执行预计完成时间,获取组合重执行优化的目标方程。In step 150, an optimal combined re-execution plan for slow tasks is designed, and an objective equation of combined re-execution optimization is obtained according to the estimated completion time of the re-scheduled task and the estimated completion time of the combined re-execution of the slow tasks.
在步骤160中,设置权值,利用组合重执行优化的目标方程获取有效缩短的预测执行任务运行时间。In step 160, a weight is set, and the target execution equation of the combined re-execution optimization is used to obtain an effectively shortened prediction execution task running time.
在一个支持离线调度的网络中,任务节点之间的带宽较小,任务节点的处 理能力异构,需要在调度预测执行任务时考虑数据本地性需求。针对分布计算环境中的资源的异构性和动态性,产生了一系列资源监控的方法和系统。同时,关于MapReduce作业中任务的执行时间模型和通过样本任务的抽样运行来预测节点对任务的处理能力等方法也取得了很好的应用效果。因此,分布计算环境中节点间的带宽和处理任务的能力等细粒度资源信息可以用于优化预测执行任务调度。In a network that supports offline scheduling, the bandwidth between task nodes is small, and the processing capabilities of the task nodes are heterogeneous, and data locality needs to be considered when scheduling prediction execution tasks. Aiming at the heterogeneity and dynamics of resources in a distributed computing environment, a series of methods and systems for resource monitoring have been generated. At the same time, methods such as the execution time model of tasks in MapReduce jobs and the sampling operation of sample tasks to predict the processing capacity of nodes on the tasks have also achieved good application results. Therefore, fine-grained resource information such as bandwidth between nodes and the ability to process tasks in a distributed computing environment can be used to optimize predictive execution task scheduling.
CREST是一种借助细粒度资源信息,通过组合优化机制满足预测执行任务的数据本地性需求,消除数据传输的时间开销,进而大幅降低整个作业Map阶段执行时间的优化技术。CREST技术包含CREST组合优化预测执行任务调度模型和CREST组合重执行调度算法及其实现两部分。由于每个计算节点之间都存在直接网络连接,即任意两点的通讯不需要第三个节点转发,因此,我们可采用完全图表示节点的网络拓扑结构图和任务迁移图。CREST组合优化预测执行任务调度模型是一个有向完全图,每条边代表一个可能的迁移路径,即将该边始点上运行的任务结束,迁移到该边的终点上重新执行。通常,预测执行任务的调度,存在LATE和CREST两种机制。CREST is an optimization technology that uses fine-grained resource information to meet the data locality requirements of predictive execution tasks through a combination of optimization mechanisms, eliminates the time overhead of data transmission, and significantly reduces the execution time of the entire job Map phase. CREST technology includes two parts: CREST combined optimized predictive execution task scheduling model and CREST combined re-execution scheduling algorithm and its implementation. Because there is a direct network connection between each computing node, that is, communication between any two points does not need to be forwarded by a third node, we can use a complete graph to represent the network topology structure diagram and task migration diagram of the nodes. The CREST combined optimized predictive execution task scheduling model is a directed complete graph. Each edge represents a possible migration path, that is, the task running at the starting point of the edge is ended, and it is migrated to the end point of the edge and re-executed. Generally, there are two mechanisms for predicting the execution of tasks: LATE and CREST.
1)LATE:直接在空闲资源上运行预测执行任务。1) LATE: run the predictive execution task directly on the idle resource.
2)CREST:一种组合重执行机制,其组合重执行方案在上图(有向无环图)中可视为从慢任务节点到空闲节点的一条无环路径。对该路径上的每条边,其始点上的任务会被迁移到其终点上重新执行,依次顺延,但是保留慢任务节点上的慢任务继续执行。我们假设组合重执行机制中不存在节点之间互换作业的情况,即路径中不存在环路,这在现实中是合理的,因为已经开始运行的作业其数据已经传输到节点本地,并且Map任务的负载大致相当,初始分配作业时已经考虑到了数据本地性的要求,因此节点之间互换作业并不会带来性能收益。2) CREST: A combined re-execution mechanism. The combined re-execution scheme can be regarded as an acyclic path from a slow task node to an idle node in the above figure (directed acyclic graph). For each edge on the path, the task at its starting point will be migrated to its end point and re-executed, which will be postponed in order, but the slow task on the slow task node will continue to execute. We assume that there is no exchange job between nodes in the combined re-execution mechanism, that is, there is no loop in the path, which is reasonable in reality, because the data of the job that has started running has been transmitted to the node local, and the Map The load of the tasks is roughly the same. The data locality requirements have been taken into account in the initial assignment of the jobs, so swapping jobs between nodes does not bring performance gains.
相对于LATE,CREST引入了节点间带宽和节点处理能力的细粒度资源信息,并且设计了组合优化机制满足预测执行任务的数据本地性需求。对上图中的边(u,v)而言,设T u表示节点u上运行的Map任务,d(T u)表示T u所处理的数据,|d(T u)|表示d(T u)的大小,PR v表示节点完成该类型Map任务的进度速率(ProgressRate),bw(u,v)表示节点u,v之间网络连接的带宽。 Compared with LATE, CREST introduces fine-grained resource information between node bandwidth and node processing capabilities, and designs a combined optimization mechanism to meet the data locality requirements of predictive execution tasks. Side of the above figure (u, v), the set T u represents Map tasks running on nodes u, d (T u) represents the processed data T u, | d (T U) | represents d (T u ), PR v represents the progress rate (ProgressRate) of a node completing this type of Map task, and bw (u, v) represents the bandwidth of the network connection between nodes u and v.
重调度任务的预计完成时间:给定节点u上执行的Map任务重新调度到节点v上执行,则其预计完成时间t′ etf(u,v)的定义如下: Estimated completion time of rescheduled tasks: A Map task executed on a given node u is rescheduled for execution on node v, and its estimated completion time t ′ etf (u, v) is defined as follows:
Figure PCTCN2018118871-appb-000005
Figure PCTCN2018118871-appb-000005
其中数据传输时间t data_movement可由公式得到: The data transmission time t data_movement can be obtained by the formula:
Figure PCTCN2018118871-appb-000006
Figure PCTCN2018118871-appb-000006
其中t c表示当前时刻。需要指出的是,d(T u)也可以从u以外的节点传输到v,我们在算法实现中使用副本优化选择策略来加速传输。 Where t c represents the current time. It should be noted that d (T u ) can also be transmitted from nodes other than u to v. We use the copy optimization selection strategy in the algorithm implementation to speed up the transmission.
慢任务组合重执行方案:给定s表示慢任务运行节点,f表示空闲节点,PATH(s,f)表示从s到f的一条路径,PATH(s,f)沿着级联地将所有包含边中起始节点的任务迁移到终止节点上执行,定义为沿PATH(s,f)的慢任务的组合重执行方案。慢任务组合重执行预计完成时间:给定沿的慢任务组合重执行,则其预计完成时间定义为所有迁移作业的预计完成时间的最大值,如下式:Slow task combination re-execution scheme: given s represents a slow task running node, f represents an idle node, PATH (s, f) represents a path from s to f, and PATH (s, f) includes all along the cascade The tasks of the starting node in the edge are migrated to the terminating node for execution, which is defined as a combination re-execution scheme of slow tasks along PATH (s, f). Slow task combination re-execution estimated completion time: Given the slow task combination re-execution of a given edge, its estimated completion time is defined as the maximum estimated completion time of all migration jobs as follows:
Figure PCTCN2018118871-appb-000007
Figure PCTCN2018118871-appb-000007
其中,PATH(s,f)为从s到f的一条路径,t′ etf(u,v)为重调度任务的预计完成时间;公式的含义为:慢任务组合重执行的预计完成时间定义为所有属于s到f的路径中的有向边的重调度任务的预计完成时间的最大值,即所有迁移作业的预计完成时间的最大值。 Among them, PATH (s, f) is a path from s to f, and t ′ etf (u, v) is the estimated completion time of the rescheduled task; the meaning of the formula is: the estimated completion time of the slow task combination re-execution is defined as The maximum expected completion time of re-scheduled tasks for all directed edges in the path from s to f, that is, the maximum expected completion time of all migration jobs.
慢任务最优组合重执行方案:给定s表示慢任务运行节点,f表示空闲节点。PATH(s,f)从s到f的一条路径,最优慢任务组合执方案定义为预测完成时间最小的组合重新执行方案,用CRES表示,如下式:Slow task optimal combination re-execution scheme: given s represents a slow task running node, f represents an idle node. PATH (s, f) is a path from s to f. The optimal slow task combination execution plan is defined as the combination re-execution plan with the smallest predicted completion time, which is expressed in CRES, as follows:
t spec(CRES)=min(t spec(PATH(s,f)))forallpathconnects,f t spec (CRES) = min (t spec (PATH (s, f))) forallpathconnects, f
令t spec(CRES)表示沿最优组合重执行方案所取得的慢任务预测执行的预计完成时间,见式: Let t spec (CRES) be the estimated completion time of the slow task prediction execution obtained along the optimal combination re-execution scheme.
t cres=min(t spec(PATH(s,f)))forallpathconnects,f t cres = min (t spec (PATH (s, f))) forallpathconnects, f
则由公式可得组合重执行优化的目标方程:Then the formula can be used to obtain the objective equation of the re-execution optimization:
Figure PCTCN2018118871-appb-000008
Figure PCTCN2018118871-appb-000008
其中,forallpathconnects表示所有路径连接组合。Among them, forallpathconnects represents all path connection combinations.
将任务调度模型图中有向边(u,v)的权重设置为t′ etf(u,v),该权值大于零,则慢任务最优组合重执行方案在任务调度模型图中上体现为一条最优路径 (CRES),该路径的权值不是所含边的权值算术和,而是所含边中权值的最大值。优化后即可获取有效缩短的预测执行任务运行时间。 Set the weight of the directed edge (u, v) in the task scheduling model graph to t ′ etf (u, v), and the weight is greater than zero, then the optimal combination of slow task re-execution scheme is reflected in the task scheduling model graph. Is an optimal path (CRES), and the weight of the path is not the arithmetic sum of the weights of the contained edges, but the maximum value of the weights of the contained edges. After optimization, you can effectively shorten the running time of the forecast execution task.
经过大量的实验表明,使用CREST技术平均能够缩短50%以上的预测执行任务运行时间,最优情况下,这一性能增益可达70%。同时,采用CREST技术时,有超过50%的概率获得40%以上的性能收益。随着副本数因子增大,采用CREST技术的性能提升幅度会进一步变大。A large number of experiments have shown that the use of CREST technology can shorten the average execution task execution time by more than 50%, and in the best case, this performance gain can reach 70%. At the same time, when using CREST technology, there is a probability of more than 50% to obtain a performance gain of more than 40%. As the number of copies increases, the performance improvement of using CREST technology will further increase.

Claims (10)

  1. 一种预测执行任务执行时间的组合优化调度方法,包括:A combined optimal scheduling method for predicting the execution time of an execution task includes:
    基于组合优化的预测执行技术CREST,组合优化预测执行任务调度模型;Combined optimization-based predictive execution technology CREST, combined optimized predictive execution task scheduling model;
    基于所述预测执行任务调度模型,采集节点间带宽和节点处理能力的细粒度资源信息;Based on the predictive execution task scheduling model, collecting fine-grained resource information of bandwidth between nodes and processing capabilities of the nodes;
    根据所述节点间带宽和所述节点处理能力的细粒度资源信息获取重调度任务的预计完成时间;Obtaining an estimated completion time of a rescheduling task according to the fine-grained resource information of the inter-node bandwidth and the processing capability of the node;
    设计重执行慢任务组合方案,获取慢任务组合重执行的预计完成时间;Design a re-execution slow task combination scheme and obtain the estimated completion time of the slow task combination re-execution;
    设计慢任务最优组合重执行方案,根据所述重调度任务的预计完成时间及所述慢任务组合重执行的预计完成时间,获取组合重执行优化的目标方程;Designing an optimal combined re-execution plan for slow tasks, and obtaining the target equation for combined re-execution optimization based on the estimated completion time of the re-scheduled task and the estimated completion time of the combined re-execution of the slow task;
    设置权重,利用所述组合重执行优化的目标方程获取有效缩短的预测执行任务运行时间。Set weights, and use the combined target execution optimization to obtain an effectively shortened prediction execution task running time.
  2. 根据权利要求1所述的方法,其中,所述CREST包括:(i)CREST组合优化预测执行任务调度模型和(ii)CREST组合重执行调度算法及其实现,其中,所述CREST组合优化预测执行任务调度模型为一个有向完全图。The method according to claim 1, wherein the CREST comprises: (i) a CREST combined optimized prediction execution task scheduling model and (ii) a CREST combined re-execution scheduling algorithm and implementation thereof, wherein the CREST combined optimized prediction execution The task scheduling model is a directed complete graph.
  3. 根据权利要求2所述的方法,其中,基于CREST技术组合优化预测执行任务调度模型包括:The method according to claim 2, wherein the CREST technology-based optimized predictive execution task scheduling model comprises:
    采用完全图表示节点的网络拓扑结构图和任务迁移图,CREST组合优化预测执行任务调度模型的有向完全图的每条边代表一个可能的迁移路径,将所述每条边的始点上运行的任务结束,迁移到所述每条边的终点上重新执行。A complete graph is used to represent the network topology structure of the nodes and a task migration diagram. Each edge of the directed complete graph of the CREST combined optimization prediction task scheduling model represents a possible migration path. The starting point of each edge is The task ends, migrates to the end point of each edge and re-executes.
  4. 根据权利要求3所述的方法,其中,所述节点间带宽和节点处理能力的细粒度资源信息包括:节点上运行的Map任务、所述Map任务所处理的数据、所述Map任务所处理的数据的大小、节点完成该类型Map任务的进度速率以及节点之间网络连接的带宽。The method according to claim 3, wherein the fine-grained resource information of the inter-node bandwidth and node processing capability comprises: a Map task running on the node, data processed by the Map task, and data processed by the Map task. The size of the data, the rate at which nodes complete this type of Map task, and the bandwidth of the network connection between nodes.
  5. 根据权利要求4所述的方法,其中,所述根据所述节点间带宽和所述节点处理能力的的细粒度资源信息获取重调度任务的预计完成时间包括:根据重调度任务的预计完成时间的表达式获取重调度任务的预计完成时间,所述重调度任务的预计完成时间t′ etf(u,v)的表达式为: The method according to claim 4, wherein the obtaining an estimated completion time of a rescheduled task according to the fine-grained resource information of the inter-node bandwidth and the processing capability of the node comprises: The expression obtains the estimated completion time of the rescheduled task, and the expression of the estimated completion time t ′ etf (u, v) of the rescheduled task is:
    Figure PCTCN2018118871-appb-100001
    Figure PCTCN2018118871-appb-100001
    式中,t c为当前时刻,t data_movement为数据传输时间,(u,v)为有向完全图的边,u,v分别为两个节点,,PR v为节点完成该类型Map任务的进度速率。 Where t c is the current time, t data_movement is the data transmission time, (u, v) is the edge of the directed complete graph, u, v are two nodes, and PR v is the progress of the node in completing this type of Map task. rate.
  6. 根据权利要求5所述的方法,其中,所述设计重执行慢任务组合方案,获取慢任务组合重执行的预计完成时间包括:The method according to claim 5, wherein the designing a re-execution slow task combination scheme and obtaining an estimated completion time of the slow task combination re-execution comprises:
    假设s为慢任务运行节点,f为空闲节点,PATH(s,f)为从s到f的一条路径,PATH(s,f)沿着级联地将所有包含边中起始节点的任务迁移到终止节点上执行,定义为沿PATH(s,f)的慢任务的组合重执行方案;Suppose s is a slow task running node, f is an idle node, PATH (s, f) is a path from s to f, and PATH (s, f) migrates all tasks that include the starting node in the edge along a cascade. Execute to the termination node, defined as a combination of slow task re-execution schemes along PATH (s, f);
    慢任务组合重执行的预计完成时间为:给定沿的慢任务组合重执行,则其预计完成时间定义为所有迁移作业的预计完成时间的最大值,其表达式为:The estimated completion time of the slow task combination re-execution is: The slow task combination re-execution of a given edge is defined as the maximum estimated completion time of all migration jobs, and its expression is:
    Figure PCTCN2018118871-appb-100002
    Figure PCTCN2018118871-appb-100002
  7. 根据权利要求6所述的方法,其中,所述设计慢任务最优组合重执行方案,根据所述重调度任务的预计完成时间及所述慢任务组合重执行预计完成时间,获取组合重执行优化的目标方程,包括:The method according to claim 6, wherein said designing an optimal combined re-execution plan for a slow task obtains a combined re-execution optimization based on the estimated completion time of said re-scheduled task and said estimated completion time of said slow task combination re-execution The objective equations include:
    最优慢任务组合执方案定义为预测完成时间最小的组合重新执行方案,用CRES表示,即慢任务最优组合重执行预计完成时间为:The optimal slow task combination execution plan is defined as the combination re-execution plan with the smallest predicted completion time, which is expressed in CRES, that is, the estimated optimal completion time of the slow task optimal combination re-execution is:
    t spec(CRES)=min(t spec(PATH(s,f)))forallpathconnects,f... t spec (CRES) = min (t spec (PATH (s, f))) forallpathconnects, f ...
    则沿最优组合重执行方案所取得的慢任务预测执行的预计完成时间为:Then the estimated completion time of the slow task prediction execution obtained along the optimal combination re-execution scheme is:
    t cres=min(t spec(PATH(s,f)))forallpathconnects,f t cres = min (t spec (PATH (s, f))) forallpathconnects, f
    综合重调度任务的预计完成时间,获取组合重执行优化的目标方程为:The estimated completion time of the comprehensive rescheduling task, and the objective equation for obtaining the combined re-execution optimization is:
    Figure PCTCN2018118871-appb-100003
    Figure PCTCN2018118871-appb-100003
  8. 根据权利要求7所述的方法,其中,所述设置权重,利用所述组合重执行优化的目标方程获取有效缩短的预测执行任务运行时间:The method according to claim 7, wherein the setting weight is used to obtain an effectively shortened prediction execution task running time by using the combined weighted execution optimization objective equation:
    将所述预测执行任务调度模型图中有向边(u,v)的权重设置为t′ etf(u,v),根据组合重执行优化的目标方程获取有效缩短的预测执行任务运行时间。 The weight of the directed edge (u, v) in the predictive execution task scheduling model graph is set to t ′ etf (u, v), and the effectively shortened execution time of the predictive execution task is obtained according to the target equation of the combined reexecution optimization.
  9. 根据权利要求8所述的方法,其中,所述权重t′ etf(u,v)大于零。 The method according to claim 8, wherein the weight t ' etf (u, v) is greater than zero.
  10. 根据权利要求5所述的方法,其中,采用副本优化选择策略加速数据传输,所述数据传输时间的表达式为:The method according to claim 5, wherein a copy optimization selection strategy is used to accelerate data transmission, and an expression of the data transmission time is:
    Figure PCTCN2018118871-appb-100004
    Figure PCTCN2018118871-appb-100004
    其中,d(T u)表示从u以外的节点传输到v;T u为节点u上运行的Map任务,d(T u)为T u所处理的数据,|d(T u)|为d(T u)的大小;bw(u,v)为节点u,v之间网络连接的带宽。 Wherein, d (T u) represents the node transmits than u to v; T u is a Map task running on the node u, d (T u) data T u processed, | d (T u) | is d (T u ); bw (u, v) is the bandwidth of the network connection between nodes u and v.
PCT/CN2018/118871 2018-05-24 2018-12-03 Combinatorial optimization scheduling method for predicting task execution time WO2019223283A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2018566566A JP2020524313A (en) 2018-05-24 2018-12-03 Scheduling method for optimizing execution time of predictive execution tasks

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810510271.X 2018-05-24
CN201810510271.XA CN108769182A (en) 2018-05-24 2018-05-24 A kind of prediction executes the Combinatorial Optimization dispatching method of task execution time

Publications (1)

Publication Number Publication Date
WO2019223283A1 true WO2019223283A1 (en) 2019-11-28

Family

ID=64005528

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/118871 WO2019223283A1 (en) 2018-05-24 2018-12-03 Combinatorial optimization scheduling method for predicting task execution time

Country Status (3)

Country Link
JP (1) JP2020524313A (en)
CN (1) CN108769182A (en)
WO (1) WO2019223283A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111475298A (en) * 2020-04-03 2020-07-31 北京字节跳动网络技术有限公司 Task processing method, device, equipment and storage medium
CN114579268A (en) * 2022-01-30 2022-06-03 阿里巴巴(中国)有限公司 Resource scheduling method and device
CN115375249A (en) * 2022-10-26 2022-11-22 埃克斯工业有限公司 Material handling scheduling method, device and equipment and readable storage medium

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108769182A (en) * 2018-05-24 2018-11-06 国网上海市电力公司 A kind of prediction executes the Combinatorial Optimization dispatching method of task execution time
CN109960573B (en) * 2018-12-29 2021-01-08 天津南大通用数据技术股份有限公司 Cross-domain computing task scheduling method and system based on intelligent perception
CN110083672A (en) * 2019-04-29 2019-08-02 广东广量测绘信息技术有限公司 Geography information big data three-dimension visible sysem
CN111199316A (en) * 2019-12-31 2020-05-26 中国电力科学研究院有限公司 Cloud and mist collaborative computing power grid scheduling method based on execution time evaluation
CN111240844A (en) * 2020-01-13 2020-06-05 星环信息科技(上海)有限公司 Resource scheduling method, equipment and storage medium
CN112698911B (en) * 2020-12-28 2022-05-17 广东石油化工学院 Cloud job scheduling method based on deep reinforcement learning

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140215471A1 (en) * 2013-01-28 2014-07-31 Hewlett-Packard Development Company, L.P. Creating a model relating to execution of a job on platforms
CN104461748A (en) * 2015-01-04 2015-03-25 电子科技大学 Optimal localized task scheduling method based on MapReduce
CN104636204A (en) * 2014-12-04 2015-05-20 中国联合网络通信集团有限公司 Task scheduling method and device
CN108769182A (en) * 2018-05-24 2018-11-06 国网上海市电力公司 A kind of prediction executes the Combinatorial Optimization dispatching method of task execution time

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140215471A1 (en) * 2013-01-28 2014-07-31 Hewlett-Packard Development Company, L.P. Creating a model relating to execution of a job on platforms
CN104636204A (en) * 2014-12-04 2015-05-20 中国联合网络通信集团有限公司 Task scheduling method and device
CN104461748A (en) * 2015-01-04 2015-03-25 电子科技大学 Optimal localized task scheduling method based on MapReduce
CN108769182A (en) * 2018-05-24 2018-11-06 国网上海市电力公司 A kind of prediction executes the Combinatorial Optimization dispatching method of task execution time

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111475298A (en) * 2020-04-03 2020-07-31 北京字节跳动网络技术有限公司 Task processing method, device, equipment and storage medium
CN111475298B (en) * 2020-04-03 2023-05-02 北京字节跳动网络技术有限公司 Task processing method, device, equipment and storage medium
CN114579268A (en) * 2022-01-30 2022-06-03 阿里巴巴(中国)有限公司 Resource scheduling method and device
CN115375249A (en) * 2022-10-26 2022-11-22 埃克斯工业有限公司 Material handling scheduling method, device and equipment and readable storage medium

Also Published As

Publication number Publication date
CN108769182A (en) 2018-11-06
JP2020524313A (en) 2020-08-13

Similar Documents

Publication Publication Date Title
WO2019223283A1 (en) Combinatorial optimization scheduling method for predicting task execution time
Zhang et al. Dynamic cloud task scheduling based on a two-stage strategy
CN105117286B (en) The dispatching method of task and streamlined perform method in MapReduce
Singh et al. Workflow task clustering for best effort systems with Pegasus
CN104636204B (en) A kind of method for scheduling task and device
CN108108233B (en) Cluster job scheduling method and system for task multi-copy execution
CN111026519A (en) Distributed task priority scheduling method and system and storage medium
CN112214301A (en) Smart city-oriented dynamic calculation migration method and device based on user preference
Yun et al. An integrated approach to workflow mapping and task scheduling for delay minimization in distributed environments
Xue et al. BOLAS: bipartite-graph oriented locality-aware scheduling for MapReduce tasks
Khan et al. Classification of task partitioning and load balancing strategies in distributed parallel computing systems
CN111782627B (en) Task and data cooperative scheduling method for wide-area high-performance computing environment
CN112114951A (en) Bottom-up distributed scheduling system and method
CN109298919B (en) Multi-core scheduling method of soft real-time system for high-utilization-rate task set
Wang et al. On mapreduce scheduling in hadoop yarn on heterogeneous clusters
Balasangameshwara et al. A fault tolerance optimal neighbor load balancing algorithm for grid environment
Weissman Metascheduling: A scheduling model for metacomputing systems
CN110764886B (en) Batch job cooperative scheduling method and system supporting multi-partition processing
Chetto et al. Real-Time Scheduling of DAG Tasks in Self-Powered Sensors with Scavenged Energy
CN109828837B (en) Relevance task scheduling method based on longest path priority
Patil et al. Review on a comparative study of various task scheduling algorithm in cloud computing environment
Kadale et al. Survey of Task Scheduling Method for Mapreduce Framework in Hadoop
Hu et al. Scheduling Method Based on Backfill Strategy for Multiple DAGs in Cloud Computing
Wen et al. Fast DRL-based scheduler configuration tuning for reducing tail latency in edge-cloud jobs
Li et al. Dash: Scheduling deep learning workloads on multi-generational gpu-accelerated clusters

Legal Events

Date Code Title Description
ENP Entry into the national phase

Ref document number: 2018566566

Country of ref document: JP

Kind code of ref document: A

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18919973

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18919973

Country of ref document: EP

Kind code of ref document: A1