CN107015856A - Task scheduling approach generation method and device under cloud environment in scientific workflow - Google Patents

Task scheduling approach generation method and device under cloud environment in scientific workflow Download PDF

Info

Publication number
CN107015856A
CN107015856A CN201710202301.6A CN201710202301A CN107015856A CN 107015856 A CN107015856 A CN 107015856A CN 201710202301 A CN201710202301 A CN 201710202301A CN 107015856 A CN107015856 A CN 107015856A
Authority
CN
China
Prior art keywords
task
node
path
scheduled
scheduling approach
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN201710202301.6A
Other languages
Chinese (zh)
Inventor
王晓英
牛成水
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qinghai University
Original Assignee
Qinghai University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qinghai University filed Critical Qinghai University
Priority to CN201710202301.6A priority Critical patent/CN107015856A/en
Publication of CN107015856A publication Critical patent/CN107015856A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/48Program initiating; Program switching, e.g. by interrupt
    • G06F9/4806Task transfer initiation or dispatching
    • G06F9/4843Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
    • G06F9/4881Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • G06F9/5038Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the execution order of a plurality of tasks, e.g. taking priority or time dependency constraints into consideration

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses the task scheduling approach generation method in scientific workflow under a kind of cloud environment and device, this method comprises the following steps, step 1, and acquisition needs scheduled task;Step 2, describe to treat scheduled task by the directed acyclic graph of Weighted Coefficients;Step 3, in the directed acyclic graph, traveled through from start node to end node, obtain all paths;Step 4, calculate the time loss in each path, according to the descending order of the time loss by all path descending sorts, determine the priority in path;Step 5, task scheduling approach is generated according to the priority orders in path;The device includes task acquisition module, task description module, node traverses module, paths ordering module and schemes generation module.The present invention can make full use of the free time of processor, improve the concurrency of scientific workflow tasks carrying, improve processor utillization, shorten the deadline of scientific workflow task.

Description

Task scheduling approach generation method and device under cloud environment in scientific workflow
Technical field
The present invention relates to scientific workflow task scheduling technique field, specifically for, the present invention is section under cloud environment Learn the task scheduling approach generation method and device in workflow.
Background technology
Scientific workflow (Scientific Workflow, SWF) is a kind of general type of new application developed in recent years, Data, service and the Software tool of integrated scientific research personnel, construction and collaboration distributed delays are supported, and there is provided a scientific algorithm The management platform of Complicated Flow and automatic running.With the Business Stream of traditional Control-oriented stream (Business Workflow, BWF) Comparatively speaking, Data Flow Oriented is one of important feature of scientific workflow.
As science and technology is growing, SWF has evolved into data and the general type of computation-intensive application, its data Producing speed and scientific algorithm becomes increasingly complicated, and conventional computing environment is difficult to meet SWF to calculate demand.Therefore, cloud computing A kind of brand-new deployment and execution are provided by research-on-research stream application of the storage resource of high performance computing resource and magnanimity Mode.
Although cloud environment can provide infinite calculating and storage capacity in theory, user can rent it as desired Computing resource.But, computing resource provide scheme change may relate to example establishment, distribution and data move, it is necessary to The cost of correlation is paid, and the execution efficiency of scientific workflow and the expense paid may be influenceed.
So, it is all very important to user and administrative staff to formulate a rational original execution plan generation.Will Workflow task be reasonably mapped in computing resource be workflow execution basis, the also execution efficiency with workflow and execution Cost is closely related, and the process is referred to as:The executive plan generation of workflow, its key is task scheduling to suitable calculating In resource, in the prior art, the scientific workflow dispatching algorithm of main flow is based on heuristic mutation operations, and it mainly includes clustering algorithms With the dispatching algorithm based on genetic algorithm etc..It is that a cluster is mapped on virtual reality machine based on clustering algorithms, but exists Between cluster the problem of task duplication;Although Gene hepatitis B vaccine can pass through selection, the tight scheduling for intersecting, making a variation in dispatching algorithm Obtain preferably convergence time, but it in the selection process randomness it is very high, it is impossible to determine next schedule job in advance, and Intersect and generally require largely to calculate time overhead and computing cost during making a variation.
Therefore, in view of prior art exist the wasting of resources, the problems such as time overhead is big, how to generate suitable task and adjust Degree scheme becomes those skilled in the art's technical problem urgently to be resolved hurrily and the emphasis studied all the time.
The content of the invention
For overcome prior art exist the wasting of resources, time overhead is big the problems such as, the invention provides section under cloud environment The task scheduling approach generation method and device in workflow are learned, resource utilization is not only increased, and shorten science work Make the run time flowed.
To realize above-mentioned technical purpose, the invention discloses the life of the task scheduling approach in scientific workflow under cloud environment Into method, the task scheduling approach generation method comprises the following steps,
Step 1, obtain and need scheduled task;
Step 2, describe to treat scheduled task by the directed acyclic graph of Weighted Coefficients;In the directed acyclic graph, circle For task node, represent being scheduled for task, the leading to while for communication, between the task that representative is scheduled of two circles of connection Letter, the direction on the side represents the order of tasks carrying;
Step 3, in the directed acyclic graph, traveled through from start node to end node, obtain all paths;
Step 4, the time loss in each path is calculated, all paths are dropped according to the descending order of the time loss Sequence sequence, the priority for determining path;
Step 5, task scheduling approach is generated according to the priority orders in path.
By the above-mentioned path priority sequencing schemes comprising task node, by task node, according to priority order is adjusted Degree, the present invention is advanced by the earliest finish time of scientific workflow, the number of its required processor is reduced, it is achieved thereby that carrying High resource utilization, the purpose for shortening scientific workflow run time.
Further, in step 5, the task scheduling approach meets following schedulable condition:Treat that scheduled task does not have It was scheduled, and treats that the corresponding task node of scheduled task has not had predecessor node or the corresponding task of predecessor node It is scheduled.
By above-mentioned schedulable condition, repetitive schedule is avoided the occurrence of in the task scheduling approach that generation can be made, from And it is advanced by the earliest finish time of scientific workflow task.
Further, after current task is chosen, step 5 also includes judging the corresponding current task node of current task Predecessor node number the step of:
If predecessor node number=0, in task scheduling approach, current task is distributed to first in cloud environment Processor;
If the predecessor node number=1 and corresponding task of the predecessor node has been scheduled, in task scheduling approach In, current task is individually distributed to the second processing machine in cloud environment, and by where the corresponding task of the predecessor node All Task Duplications on processor are to the second processing machine where current task;
If the predecessor node number >=2 and corresponding task of all forerunner's nodes has been scheduled, in task dispatching party In case, all predecessor nodes of current task node are reached current according to from start node along each path by each predecessor node The total time of task node carries out descending sort, forms father's task scheduling queue, by current task and father's task merging of head of the queue, It is described to merge into:By head of the queue father's task all Task Duplications on treaters to processor where current task.
Can determine that by way of above-mentioned determination predecessor node number needs the number of processor in task scheduling approach, because This, the present invention can determine that needs how many processor before task scheduling.Meanwhile, it can be reduced by modes such as duplication, merging The quantity of processor, improves the utilization rate of processor.
Further, in step 5, if predecessor node number >=2, the father's task for meeting insertion condition is adjusted Spend on the processor where father's task node outside the head of the queue in queue is inserted into current task;The insertion condition is:Currently The free time of processor where task is more than father's task outside father's task node the calculating time overhead, and head of the queue outside head of the queue The earliest finish time of node the earliest start time earlier than current task node.
It is negligible due to being communicated between the business on same processor, therefore appointed some fathers by above-mentioned interleaved plan The free time that business is inserted on the processor where current task and made some father's tasks to manage machine in this place performs, after execution Inform that current task is performed, notifying time greatly shortens, almost nil, so as to reduce intertask communication expense, shorten whole The execution time of individual workflow task, and reduce the quantity of processor.
Further, in step 5, if current path be not present meet default schedulable condition treat scheduled appoint Business, then select next paths according to the priority orders in path, until the scheduling scheme for needing scheduled task is whole Generation.
Further, in step 5, until after all path processing terminate, generating task scheduling approach;If the task of generation There is the scheduling scheme without practical significance in scheduling scheme, then delete the scheduling scheme of no practical significance, retain remaining task tune Degree scheme.
Further, in step 2, in the directed acyclic graph, predecessor task node correspondence task the execution time earlier than The execution time of current task node correspondence task.
Further, in step 2, in the directed acyclic graph, including between task computation time overhead mark and task Call duration time expense is marked.
Further, in step 4, the time loss in each path is on the task computation time overhead on path and path Call duration time expense sum between task.
Another goal of the invention of the present invention is the provision of the task scheduling side under a kind of cloud environment in scientific workflow Case generating means, the task scheduling approach generating means include:
Task acquisition module, scheduled task is needed for obtaining;
Task description module, describes to treat scheduled task by the directed acyclic graph of Weighted Coefficients;The directed acyclic graph In, circle is task node, represents scheduled task, two circles of connection while for communication, represent being scheduled for task Between communication, the direction on the side represents the order of tasks carrying;
Node traverses module, in the directed acyclic graph, for being traveled through from start node to end node, is owned Path;
Paths ordering module, the time loss for calculating each path, according to the order that the time loss is descending By all path descending sorts, determine the priority in path;
Schemes generation module, task scheduling approach is generated according to the priority orders in path.
By the above-mentioned path priority sequencing schemes comprising task node, by task node, according to priority order is adjusted Degree, the present invention is advanced by the earliest finish time of scientific workflow, the number of its required processor is reduced, it is achieved thereby that carrying High resource utilization, the purpose for shortening scientific workflow run time.
Beneficial effects of the present invention are:The present invention can make full use of the free time of processor, improve scientific workflow The concurrency of tasks carrying, improves processor utillization, shortens the deadline of scientific workflow task, does sth. in advance scientific workflow Earliest finish time.
Brief description of the drawings
Fig. 1 is the task scheduling approach generation method schematic flow sheet in scientific workflow under cloud environment of the present invention.
Fig. 2 is the task scheduling approach generating means structural representation in scientific workflow under cloud environment of the present invention.
Fig. 3 is directed acyclic graph composition schematic diagram of the present invention using Weighted Coefficients.
Embodiment
Detailed explanation and illustration is carried out to the generation method and device of the present invention with reference to Figure of description.
As shown in Figure 1, 2, 3, the invention discloses the generation of the task scheduling approach in scientific workflow under a kind of cloud environment Method, the present invention is innovatively by list scheduling, Task Duplication, task Intercalation, and can be applicable to that processor number do not limit is same Task scheduling strategy is performed under structure environment.For convenience of the explanation of postorder, the present invention is defined as follows.
The executive plan generation of scientific workflow is reasonably to be mapped to workflow task in computing resource, and the present invention is related to And computing resource be processor;Mapping, T={ t are represented with Fi, tiIt is the task in workflow, i=1,2,3 ..., n, n is appoints Business number };V={ vj, vjIt is the processor in cloud environment, j=1,2,3 ..., m, m is processor number };
F (T, V)=<ti,vj>, ti∈ T, vj∈V};
Specifically, task scheduling approach generation method of the invention comprises the following steps.
Step 1, obtain and need scheduled task, which of scientific workflow task need to be read in advance to be needed to be adjusted Degree.
Step 2, describe to treat scheduled task by the directed acyclic graph of Weighted Coefficients, including each task composition and task it Between correspondence.In directed acyclic graph, circle is task node, represents scheduled task, connects the side of two circles For the communication between the scheduled task of communication side, representative, the direction on side represents the order of tasks carrying, and directed acyclic graph is also wrapped The call duration time expense included between task computation time overhead mark and task is marked, specific as follows.
There is the relation mutually restricted, such as, predecessor task node pair in the communication between the task of each in scientific workflow The execution time of task should be corresponded to earlier than current task node by answering the execution time of task, specifically, as shown in figure 3, of the invention Described using the directed acyclic graph (DAG) of Weighted Coefficients, as shown in figure 3, i.e. G (T, E, M, W) come represent the dependence between task close System.Wherein, T represents the set of task node in workflow, ti∈ T represent a task in workflow;E represents task node Between communicate side set, eij∈ E represent ti、tkBetween side, tiIt is tkForerunner's node;M represents the collection of intertask communication expense Close, mik∈ M represent task tiWith task tkBetween call duration time expense, call duration time expense is marked at by the side in DAG;W tables Show the set of task computation expense, ci∈ W represent tiCalculating time overhead, task computation time overhead is marked at the circle in DAG Circle is other.On the basis of above-mentioned directed acyclic graph, and it is defined as follows.
Utilize est (ti) and eft (ti) task t is represented respectivelyiEarliest start time and earliest finish time, start knot Earliest start time est (the t of pointi)=0, task tiEft (t on earliest finish timei)=est (ti)+ci, whole workflow Earliest finish time makespan=eft (tend), wherein, tendTo end task.
For task ti, parameter pre (ti) represent task tiPredecessor node.In processor (vm) on, by a upper task The free time that end of run runs beginning to next task is defined as Free, is expressed as Free (tpre(ti),i, vm)=est (ti, vm)-eft(pre(ti, vm))
Parameter pxThe path in DAG figures is represented, span represents the deadline in path, Span (px)=Σti∈Px(ci+ mij), reach (tpre(ti),i) represent task tiPredecessor task reach task tiTime, reach (tpre(ti),i) there are two kinds of feelings Condition.The first situation:Task tiAnd its predecessor task is on same virtual machine, now reach (tpre(ti),i)=Σti∈px (cpre(ti)+ci)+mpre(ti),i;Second of situation:Task tiAnd its predecessor task is not on a virtual machine, now, reach (tpre(ti),i) and deadline Span (px) identical.
Step 3, in directed acyclic graph, traveled through from start node to end node, all possible road in traversal DAG figures Footpath, and then obtain all paths.
Step 4, the time loss Span values in each path are calculated, according to the descending order of time loss by all paths Descending sort, the priority for determining path, such as, priority is represented by way of generating heat source order list, by when Between consume maximum path as critical path, preferentially judge whether the task in critical path is scheduled;Each path when Between consumption be the task computation time overhead on path and the call duration time expense sum between task, formula is expressed as:Span (px)=Σti∈Px(ci+mij)。
Step 5, task scheduling approach is generated according to the priority orders in path, circulating path, until at all path After reason terminates, task scheduling approach is generated, then when traffic control starts, the scheme generated according to the present invention is scheduled, and is pressed Task is assigned on processor by order;Specifically, during schemes generation, if current path is not present meets default Schedulable condition treats scheduled task, then according to the list of heat source order, it is next according to the priority orders selection in path Paths, until the scheduling scheme for needing scheduled task is all generated.
To realize scientific and rational scheduling, task scheduling approach of the invention meets following schedulable condition:Wait to be scheduled Task be not scheduled, and treat that the corresponding task node of scheduled task does not have predecessor node or predecessor node corresponding Task has been scheduled.Meet the task t of the two conditionsiIt can just be scheduled, then select other in current path meet The task of condition, this is met if since next paths, continuing selection if the task of condition is not met in current path The task of part.
To determine that the task in scientific workflow is needed in the quantity of processor, this step 5, in task tiAfter being chosen, Present invention additionally comprises the step of the predecessor node number for judging the corresponding current task node of current task:
If predecessor node number=0, in task scheduling approach, current task is distributed to first in cloud environment Processor.
If the predecessor node number=1 and corresponding task of the predecessor node has been scheduled, in task scheduling approach In, current task individually distributed to the second processing machine in cloud environment, and by the processing where the corresponding task of predecessor node All Task Duplications on machine are to the second processing machine where current task.
If the predecessor node number >=2 and corresponding task of all forerunner's nodes has been scheduled, in task dispatching party In case, all predecessor nodes of current task node are reached current according to from start node along each path by each predecessor node The total time of task node carries out descending sort, forms father's task scheduling queue, by current task and father's task merging of head of the queue, The merging is interpreted as:By head of the queue father's task all Task Duplications on treaters to processor where current task, And update the deadline of current task.The present invention combines the thought of list scheduling and Task Duplication, selects rational father's task Replicated, and then expense, the quantity of reduction processor between reduction task.Moreover, based between task in same processor The negligible principle of expense, during the free time that some tasks are sent to the processor of existing task and machine are managed in this place Between perform, if predecessor node number >=2, the father's task section outside head of the queue that will be met in father's task scheduling queue of insertion condition Point is inserted on the processor where current task, updates the deadline for the father's task for meeting insertion condition;Insertion condition is: The free time of processor where current task is more than the father outside father's task node the calculating time overhead, and head of the queue outside head of the queue The earliest finish time of task node the earliest start time earlier than current task node.The insertion condition is formulated For:Free(ti, vk)≥cpre(ti)And eft (pre (ti),vk)<est(ti, vk).Until after all path processing terminate, generation Task scheduling approach;There is the scheduling scheme without practical significance in task scheduling approach if generated, delete no practical significance Scheduling scheme, retain remaining task scheduling approach.
The free time piece of suitable father's task insertion process machine is judged that all fathers appoint outside all heads of the queue by the present invention successively Business node, untill current processor free time deficiency is unsatisfactory for father's task computation time overhead or no father's task.This The mode of kind can improve the utilization rate of processor, reduce the quantity of processor.Above-mentioned scheduling scheme can be regarded as the pre- of task Distribution, can be by duty mapping to suitable processor, and can determine that the quantity of processor.
It should be noted that " the first processor ", " second processing machine " that is related in the present invention is not to number or excellent The restriction of first level, simply plays the purpose for distinguishing nonidentical processor, and according to mission requirements, " second processing machine " can be many It is individual.
The generation method of the above-mentioned scheduling scheme of correspondence, as shown in Figures 2 and 3, the invention also discloses under a kind of cloud environment Task scheduling approach generating means in scientific workflow, task scheduling approach generating means include:
Task acquisition module, needs scheduled task for obtaining, which of scientific workflow need to be read in advance Task needs scheduled.
Task description module, describes to treat scheduled task by the directed acyclic graph of Weighted Coefficients;In directed acyclic graph, circle Enclose as task node, represent scheduled task, two circles of connection while for communication, represent between being scheduled for task Communication, the direction on side represents the order of tasks carrying;Communication between the task of each in scientific workflow has what is mutually restricted Relation, such as, the execution time that predecessor task node corresponds to task should correspond to the execution time of task earlier than current task node.
Node traverses module, in directed acyclic graph, for being traveled through from start node to end node, obtains all roads Footpath.
Paths ordering module, the time loss for calculating each path, according to the descending order of time loss by institute There is path descending sort, determine the priority in path;Such as, represent preferential by way of generating heat source order list Level, using the maximum path of time loss as critical path, preferentially judges whether the task in critical path is scheduled;Each road The time loss in footpath is the task computation time overhead on path and the call duration time expense sum between task.
Schemes generation module, task scheduling approach is generated according to the priority orders in path, circulating path, until all After path processing terminates, task scheduling approach is generated, then when traffic control starts, the scheme generated according to the present invention is adjusted Task, is assigned on processor by degree in order;Specifically, during schemes generation, met if current path is not present Default schedulable condition treats scheduled task, then is selected according to the list of heat source order, according to the priority orders in path Next paths are selected, until the scheduling scheme for needing scheduled task is all generated.
With reference to concrete example to the task scheduling approach generation method in scientific workflow under cloud environment of the present invention and Device carries out detailed explanation and illustration.With reference to Figure of description 3 and following path dispatch lists, the priority in path is suitable After sequence is determined, pre-scheduling is carried out to task.
Priority Task path px Deadline (Span) Scheduled task
1 t1, t2, t7, t9 26 t1、t2、t7
2 t1, t4, t8, t9 24 t4
3 t1, t3, t8, t9 23 t3、t8
4 t1, t2, t6, t9 22 t6、t9
5 t1, t7, t9 16 ——
6 t1, t5 13 t5
For task path p1, the task t in circulating path 11, t first1Without father's task and without forerunner's section in path 1 Point, then t1Schedulable condition is met, secondly t1There is no predecessor node in DAG figures, so by t1It is separately positioned at a processor On, then v11={ t1};Continuation task t2, the t in path 12Father's task be scheduled and t2It was not scheduled, and met scheduling Condition, the t in DAG figures2Only one of which predecessor node, by all tasks on the processor where the corresponding task of predecessor node The second processing machine where current task is copied to, then v12={ t1、t2};Continuation task t7, the t in path 17It is not scheduled Cross and t7Father's task be scheduled, meet schedulable condition, the t in DAG figures7There are two predecessor node t1、t2And before the two Drive node to be scheduled, t is calculated respectively1、t2Reach t7Time, reach (t1,7)=2+3=5, wherein " 2 " are t1Calculate Time overhead, " 3 " are t1To t7Call duration time expense;reach(t2,7)=2+3+6=11, wherein " 2 " are t1The calculating time opens Pin, " 3 " are t2Calculate time overhead, t2And t1It is placed on same processor, so, call duration time expense therebetween Almost nil, " 6 " are t2To t7Call duration time expense;Then by t7With t2Merge and update the deadline, then v13={ t1、t2、t7}; Continue the next task t in path9, the t in path 19Father's task be scheduled, but the t in DAG figures9Predecessor node t6、t8It is not scheduled, so jumping out path 1.
For task path p2, continue cycling through path 2.t1It has been be scheduled that, skip t1.Continuation task t4, t4Meet scheduling bar Part, and t4The predecessor node number of corresponding node is 1 and the corresponding task of predecessor node has been scheduled, by predecessor node correspondence Task where processor on all Task Duplications to the second processing machine where current task, then v14={ t1、t4}.After Continuous task t8, task t8Although not being scheduled, the task t in DAG figures8Predecessor node t3It is not scheduled, then jumps Cross task t8;Similarly, task t is skipped9
For task path p3, continue cycling through path 3, t1It has been scheduled, has skipped t1;As the task t being recycled in path 33 When, task t3It was not scheduled and the corresponding task of predecessor node has been scheduled, and met schedulable condition, task t3Forerunner saves Point number is 1, by predecessor node t1All Task Duplications on processor where corresponding task are to where current task Second processing machine, then v15={ t1、t3};As the task t being recycled in path 38, now t8Predecessor task t3、t4Adjusted Degree, calculates t respectively3、t4Reach t8Time.reach(t3,8)=2+3+5=10, reach (t4,8)=2+4+3=9.T3、t4 Sorted { t by arrival time3、t4, by t8With t3Merge and update deadline, now, processor v16={ t1、t3、t8, eft (t3)=5, est (t8)=9, Free (t are obtained according to formula 23,8, vk)=9-5=4 >=c4=4, and eft (t4)=6<est(t8) =9 meet insertion condition, by t4Insert reason machine task queue v16={ t1、t3、t8In, then the v after inserting16={ t1、t3、t4、 t8, continue next task t9, schedulable condition is not met, then jumps out path 3, continues path 4.
For task path p4, continue cycling through path 4, t1、t2It has been be scheduled that, then skipped, task t6Only one of which forerunner appoints Be engaged in t2, then by t2Institute is on treaters from t2Task Duplication before is arrived and t6On single processor together, v17={ t1、 t2、t6}.Continue, t9The same t of method8It is identical, t6Reach t9Time be 15, t7Reach t9Time be 14, t8Reach t9When Between be 16, then by t8And t9Merge, then v18={ t1、t3、t4、t8、t9, and t6、t7Insertion condition is not met.
For task path p5, path 5 is continued cycling through, without the node for meeting schedulable condition, path p is skipped5
For task path p6, path 6 is continued cycling through, the t in path 65Father's task be scheduled and t5It is not scheduled Cross, meet schedulable condition, the t in DAG figures5Only one of which predecessor node, so by the place where the corresponding task of predecessor node All Task Duplications on reason machine are to the second processing machine where current task, then v19={ t1、t5}。
It is by scheduling result obtained above, v11={ t1, v12={ t1、t2, v13={ t1、t2、t7, v14={ t1、 t4, v15={ t1、t3, v16={ t1、t3、t4、t8, v17={ t1、t2、t6, v18={ t1、t3、t4、t8、t9, v19={ t1、 t5};There is scheduling scheme without practical significance or result in above-mentioned scheduling result, only task is repeated, such as v11={ t1, v12={ t1、t2Etc., so screening out scheduling knot of its whole task in the carrying into execution a plan of other processors Really, then the end product for actual task scheduling scheme is:v1={ t1、t3、t4、t8、t9, v2={ t1、t2、t7, v3= {t1、t2、t6, v4={ t1、t5, and makespan=eft (t9)=16.
It should be noted that for convenience of present invention explanation and describe, in DAG figures, tiIt can be regarded as the task corresponding Node.Moreover, it relates to various " times " be relative time, can be by task t1At the beginning of between be interpreted as 0.
In the description of this specification, reference term " one embodiment ", " some embodiments ", " the present embodiment ", " specific The description of example " or " some examples " etc. mean to combine the specific features that the embodiment or example describe, structure, material or Feature is contained at least one embodiment of the present invention or example.In this manual, to the schematic representation of above-mentioned term Necessarily it is directed to identical embodiment or example.Moreover, specific features, structure, material or the feature of description can be Combined in an appropriate manner in any one or more embodiments or example.In addition, in the case of not conflicting, this area Technical staff can be carried out the feature of the not be the same as Example described in this specification or example and non-be the same as Example or example With reference to and combination.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all in essence of the invention Any modification, equivalent substitution and simple modifications for being made in content etc., should be included in the scope of the protection.

Claims (10)

1. the task scheduling approach generation method under cloud environment in scientific workflow, it is characterised in that:The task scheduling approach Generation method comprises the following steps,
Step 1, obtain and need scheduled task;
Step 2, describe to treat scheduled task by the directed acyclic graph of Weighted Coefficients;In the directed acyclic graph, circle is to appoint The scheduled task of business node, representative, communication while for communication, between the task that representative is scheduled of two circles of connection, The direction on the side represents the order of tasks carrying;
Step 3, in the directed acyclic graph, traveled through from start node to end node, obtain all paths;
Step 4, the time loss in each path is calculated, all path descendings are arranged according to the descending order of the time loss Sequence, the priority for determining path;
Step 5, task scheduling approach is generated according to the priority orders in path.
2. the task scheduling approach generation method under cloud environment according to claim 1 in scientific workflow, its feature exists In:In step 5, the task scheduling approach meets following schedulable condition:Treat that scheduled task was not scheduled, and treat The corresponding task node of scheduled task does not have predecessor node or the corresponding task of predecessor node to be scheduled.
3. the task scheduling approach generation method under cloud environment according to claim 2 in scientific workflow, its feature exists In:After current task is chosen, step 5 also includes the predecessor node number for judging the corresponding current task node of current task The step of:
If predecessor node number=0, in task scheduling approach, current task is distributed to the first processing in cloud environment Machine;
, will in task scheduling approach if the predecessor node number=1 and corresponding task of the predecessor node has been scheduled Current task individually distributes to the second processing machine in cloud environment, and by the processor where the corresponding task of the predecessor node On all Task Duplications to the second processing machine where current task;
If the predecessor node number >=2 and corresponding task of all forerunner's nodes has been scheduled, in task scheduling approach, All predecessor nodes of current task node are reached into current task according to from start node along each path by each predecessor node Total time of node carries out descending sort, forms father's task scheduling queue, by current task and father's task merging of head of the queue, described Merge into:By head of the queue father's task all Task Duplications on treaters to processor where current task.
4. the task scheduling approach generation method under cloud environment according to claim 3 in scientific workflow, its feature exists In:In step 5, if predecessor node number >=2, the head of the queue in father's task scheduling queue of insertion condition will be met Outer father's task node is inserted on the processor where current task;The insertion condition is:Processing where current task The free time of machine is more than the earliest completion that father's task node outside head of the queue calculates father's task node outside time overhead, and head of the queue Earliest start time of the time earlier than current task node.
5. the task scheduling approach under the cloud environment according to any claim in claim 2 to 4 in scientific workflow Generation method, it is characterised in that:In step 5, if current path be not present meet default schedulable condition wait be scheduled Task, then next paths are selected according to the priority orders in path, until needing the scheduling scheme of scheduled task All generations.
6. according to the task scheduling approach generation method in scientific workflow under claim 1 or 4 or described cloud environment, it is special Levy and be:In step 5, until after all path processing terminate, generating task scheduling approach;If generated in task scheduling approach In the presence of the scheduling scheme without practical significance, then delete the scheduling scheme of no practical significance, retain remaining task scheduling approach.
7. the task scheduling approach generation method under cloud environment according to claim 1 in scientific workflow, its feature exists In:In step 2, in the directed acyclic graph, the execution time of predecessor task node correspondence task is earlier than current task node pair Answer the execution time of task.
8. the task scheduling approach generation method under cloud environment according to claim 1 in scientific workflow, its feature exists In:In step 2, in the directed acyclic graph, including the call duration time expense mark between task computation time overhead mark and task Note.
9. the task scheduling approach generation method under cloud environment according to claim 8 in scientific workflow, its feature exists In:In step 4, when the time loss in each path is the communication between the task on task computation time overhead and path on path Between expense sum.
10. the task scheduling approach generating means under cloud environment in scientific workflow, it is characterised in that:The task scheduling approach Generating means include:
Task acquisition module, scheduled task is needed for obtaining;
Task description module, describes to treat scheduled task by the directed acyclic graph of Weighted Coefficients;In the directed acyclic graph, circle Enclose as task node, represent scheduled task, two circles of connection while for communication, represent between being scheduled for task Communication, the direction on the side represents the order of tasks carrying;
Node traverses module, in the directed acyclic graph, for being traveled through from start node to end node, obtains all roads Footpath;
Paths ordering module, the time loss for calculating each path, according to the descending order of the time loss by institute There is path descending sort, determine the priority in path;
Schemes generation module, task scheduling approach is generated according to the priority orders in path.
CN201710202301.6A 2017-03-30 2017-03-30 Task scheduling approach generation method and device under cloud environment in scientific workflow Withdrawn CN107015856A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710202301.6A CN107015856A (en) 2017-03-30 2017-03-30 Task scheduling approach generation method and device under cloud environment in scientific workflow

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710202301.6A CN107015856A (en) 2017-03-30 2017-03-30 Task scheduling approach generation method and device under cloud environment in scientific workflow

Publications (1)

Publication Number Publication Date
CN107015856A true CN107015856A (en) 2017-08-04

Family

ID=59445562

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710202301.6A Withdrawn CN107015856A (en) 2017-03-30 2017-03-30 Task scheduling approach generation method and device under cloud environment in scientific workflow

Country Status (1)

Country Link
CN (1) CN107015856A (en)

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107943561A (en) * 2017-12-14 2018-04-20 长春工程学院 A kind of scientific workflow method for scheduling task towards cloud computing platform
CN108628672A (en) * 2018-05-04 2018-10-09 武汉轻工大学 Method for scheduling task, system, terminal device and storage medium
CN108647084A (en) * 2018-05-08 2018-10-12 武汉轻工大学 Efficiency cloud method for scheduling task
CN108733473A (en) * 2018-05-11 2018-11-02 北京航天发射技术研究所 A kind of positioning based on VxWorks aims at the control method of integration apparatus task
CN108762899A (en) * 2018-05-16 2018-11-06 武汉轻工大学 A kind of cloud task rescheduling method and apparatus
CN108897625A (en) * 2018-07-06 2018-11-27 陈霖 Method of Scheduling Parallel based on DAG model
CN109783213A (en) * 2018-12-28 2019-05-21 杭州电子科技大学 The workflow fault-tolerant scheduling method of reliability is directed under a kind of edge calculations environment
CN109918182A (en) * 2019-01-23 2019-06-21 中国人民解放军战略支援部队信息工程大学 More GPU task dispatching methods under virtualization technology
CN110019986A (en) * 2018-09-27 2019-07-16 福建福链科技有限公司 The method for sequencing and terminal of message in a kind of directed acyclic graph
CN110084507A (en) * 2019-04-24 2019-08-02 信雅达系统工程股份有限公司 The scientific workflow method for optimizing scheduling of perception is classified under cloud computing environment
CN110147270A (en) * 2019-04-15 2019-08-20 平安普惠企业管理有限公司 Database dispatching method, device, computer equipment and storage medium
CN110489214A (en) * 2019-06-19 2019-11-22 南京邮电大学 The dynamic task allocation of data-intensive workflow in cloud environment
CN111026534A (en) * 2019-12-10 2020-04-17 浙江工业大学 Workflow execution optimization method based on multi-population genetic algorithm in cloud computing environment
CN111798063A (en) * 2020-07-09 2020-10-20 北京理工大学 Research and development resource allocation method based on operation double-layer redundancy time reduction
CN111861412A (en) * 2020-07-27 2020-10-30 上海交通大学 Completion time optimization-oriented scientific workflow scheduling method and system
CN112162727A (en) * 2020-09-16 2021-01-01 深圳晶泰科技有限公司 Cloud high-performance scientific computing workflow design control system and user graphical interface
CN112363819A (en) * 2020-12-02 2021-02-12 深圳市房多多网络科技有限公司 Big data task dynamic scheduling method and device and computing equipment
CN112379998A (en) * 2020-10-20 2021-02-19 广东工业大学 Switching time delay optimization method based on task fusion and task scheduling
CN112633753A (en) * 2020-12-30 2021-04-09 广东赛诺科技股份有限公司 Dynamic work order sharing system
CN113742089A (en) * 2021-11-04 2021-12-03 苏州浪潮智能科技有限公司 Method, device and equipment for distributing neural network computing tasks in heterogeneous resources
CN117610320A (en) * 2024-01-23 2024-02-27 中国人民解放军国防科技大学 Directed acyclic graph workflow engine cyclic scheduling method, device and equipment

Cited By (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107943561B (en) * 2017-12-14 2019-06-11 长春工程学院 A kind of scientific workflow method for scheduling task towards cloud computing platform
CN107943561A (en) * 2017-12-14 2018-04-20 长春工程学院 A kind of scientific workflow method for scheduling task towards cloud computing platform
CN108628672A (en) * 2018-05-04 2018-10-09 武汉轻工大学 Method for scheduling task, system, terminal device and storage medium
CN108647084A (en) * 2018-05-08 2018-10-12 武汉轻工大学 Efficiency cloud method for scheduling task
CN108733473A (en) * 2018-05-11 2018-11-02 北京航天发射技术研究所 A kind of positioning based on VxWorks aims at the control method of integration apparatus task
CN108762899B (en) * 2018-05-16 2020-05-15 武汉轻工大学 Cloud task rescheduling method and device
CN108762899A (en) * 2018-05-16 2018-11-06 武汉轻工大学 A kind of cloud task rescheduling method and apparatus
CN108897625A (en) * 2018-07-06 2018-11-27 陈霖 Method of Scheduling Parallel based on DAG model
CN110019986A (en) * 2018-09-27 2019-07-16 福建福链科技有限公司 The method for sequencing and terminal of message in a kind of directed acyclic graph
CN110019986B (en) * 2018-09-27 2020-12-29 福建福链科技有限公司 Method and terminal for sequencing messages in directed acyclic graph
CN109783213A (en) * 2018-12-28 2019-05-21 杭州电子科技大学 The workflow fault-tolerant scheduling method of reliability is directed under a kind of edge calculations environment
CN109783213B (en) * 2018-12-28 2021-02-19 杭州电子科技大学 Workflow fault tolerance scheduling method for reliability in edge computing environment
CN109918182B (en) * 2019-01-23 2020-12-15 中国人民解放军战略支援部队信息工程大学 Multi-GPU task scheduling method under virtualization technology
CN109918182A (en) * 2019-01-23 2019-06-21 中国人民解放军战略支援部队信息工程大学 More GPU task dispatching methods under virtualization technology
CN110147270A (en) * 2019-04-15 2019-08-20 平安普惠企业管理有限公司 Database dispatching method, device, computer equipment and storage medium
CN110084507A (en) * 2019-04-24 2019-08-02 信雅达系统工程股份有限公司 The scientific workflow method for optimizing scheduling of perception is classified under cloud computing environment
CN110489214A (en) * 2019-06-19 2019-11-22 南京邮电大学 The dynamic task allocation of data-intensive workflow in cloud environment
CN110489214B (en) * 2019-06-19 2022-09-20 南京邮电大学 Dynamic task allocation for data intensive workflows in a cloud environment
CN111026534A (en) * 2019-12-10 2020-04-17 浙江工业大学 Workflow execution optimization method based on multi-population genetic algorithm in cloud computing environment
CN111026534B (en) * 2019-12-10 2023-12-22 浙江工业大学 Workflow execution optimization method based on multiple group genetic algorithms in cloud computing environment
CN111798063A (en) * 2020-07-09 2020-10-20 北京理工大学 Research and development resource allocation method based on operation double-layer redundancy time reduction
CN111861412A (en) * 2020-07-27 2020-10-30 上海交通大学 Completion time optimization-oriented scientific workflow scheduling method and system
CN111861412B (en) * 2020-07-27 2024-03-15 上海交通大学 Completion time optimization-oriented scientific workflow scheduling method and system
CN112162727A (en) * 2020-09-16 2021-01-01 深圳晶泰科技有限公司 Cloud high-performance scientific computing workflow design control system and user graphical interface
CN112379998B (en) * 2020-10-20 2022-11-15 广东工业大学 Switching time delay optimization method based on task fusion and task scheduling
CN112379998A (en) * 2020-10-20 2021-02-19 广东工业大学 Switching time delay optimization method based on task fusion and task scheduling
CN112363819A (en) * 2020-12-02 2021-02-12 深圳市房多多网络科技有限公司 Big data task dynamic scheduling method and device and computing equipment
CN112633753A (en) * 2020-12-30 2021-04-09 广东赛诺科技股份有限公司 Dynamic work order sharing system
CN113742089A (en) * 2021-11-04 2021-12-03 苏州浪潮智能科技有限公司 Method, device and equipment for distributing neural network computing tasks in heterogeneous resources
CN117610320A (en) * 2024-01-23 2024-02-27 中国人民解放军国防科技大学 Directed acyclic graph workflow engine cyclic scheduling method, device and equipment
CN117610320B (en) * 2024-01-23 2024-04-02 中国人民解放军国防科技大学 Directed acyclic graph workflow engine cyclic scheduling method, device and equipment

Similar Documents

Publication Publication Date Title
CN107015856A (en) Task scheduling approach generation method and device under cloud environment in scientific workflow
Nie et al. A GEP-based reactive scheduling policies constructing approach for dynamic flexible job shop scheduling problem with job release dates
Gamache et al. A method for optimally solving the rostering problem
El-Rewini et al. Scheduling parallel program tasks onto arbitrary target machines
Roshani et al. A simulated annealing algorithm for multi-manned assembly line balancing problem
Fan et al. An improved genetic algorithm for flexible job shop scheduling problem considering reconfigurable machine tools with limited auxiliary modules
CN101237469B (en) Method for optimizing multi-QoS grid workflow based on ant group algorithm
CN109582448A (en) A kind of edge calculations method for scheduling task towards criticality and timeliness
Lequy et al. Assigning multiple activities to work shifts
CN103279818A (en) Method for cloud workflow scheduling based on heuristic genetic algorithm
CN106934539B (en) Workflow scheduling method with deadline and expense constraints
Lu et al. A resource investment problem based on project splitting with time windows for aircraft moving assembly line
CN107943561B (en) A kind of scientific workflow method for scheduling task towards cloud computing platform
Colak et al. Multi-mode resource-constrained project-scheduling problem with renewable resources: new solution approaches
Li et al. Single machine scheduling with two-agent for total weighted completion time objectives
WO2022198754A1 (en) Method for optimizing large-scale cloud service process
CN110084507A (en) The scientific workflow method for optimizing scheduling of perception is classified under cloud computing environment
Đurasević et al. Heuristic and metaheuristic methods for the unrelated machines scheduling problem: A survey
Gong et al. Energy-efficient flexible job shop scheduling problem considering discrete operation sequence flexibility
Furmans Models of heijunka-levelled kanban-systems
CN109783189A (en) A kind of quiescent operation stream scheduling method and device
CN109635328A (en) Integrated circuit layout method and distributed design approach
CN109034551A (en) Equilibrium of stock method for construction plan establishment
Li Improved NSGA-II for the job-shop multi-objective scheduling problem
Moin Hybrid genetic algorithms for vehicle routing problems with time windows

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20170804