CN106528275A - Processing method of data tasks and task scheduler - Google Patents
Processing method of data tasks and task scheduler Download PDFInfo
- Publication number
- CN106528275A CN106528275A CN201510573492.8A CN201510573492A CN106528275A CN 106528275 A CN106528275 A CN 106528275A CN 201510573492 A CN201510573492 A CN 201510573492A CN 106528275 A CN106528275 A CN 106528275A
- Authority
- CN
- China
- Prior art keywords
- task
- transferred
- processor
- submitted
- storage area
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a processing method of data tasks. The method comprises the steps that a task to be executed is acquired from a task table, wherein multiple tasks and a dependency item of each task are recorded in the task table, and the tasks recorded in the task table can be acquired by multiple task schedulers; according to the dependency item of the acquired task, dependency information of the task is obtained; according to the dependency information, it is determined that when the current state conforms to execution conditions, the acquired task is submitted to a task processor for data processing. Accordingly, the invention furthermore discloses a task scheduler. By the adoption of the embodiment, the data tasks can be processed concurrently, and work efficiency can be improved.
Description
Technical field
The present invention relates to field of computer technology, more particularly to a kind of processing method of data task and task tune
Degree device.
Background technology
Currently, Internet enterprises increasingly pay attention to the collection of big data, such as network selling data, Yong Huhang
For data etc..In the face of increasing data volume, most of enterprise selects Hadoop+Hive more ripe at present
As distributed data processing framework.
By Hadoop+Hive distributed data processing frameworks, enterprise can easily process the number of more than one hundred million ranks
According to.Generally data processing work flow process includes some steps, and each step has the indefinite data processing of quantity
Task is constituted, and there may be dependence, it is also possible to independent between these data processing tasks.For
There is the task of dependence, need to strictly observe sequencing, can otherwise cause data result mistake.Cause
How this, manage daily data processing work flow process, data processing task is carried out efficiently, in an orderly manner, is
A big difficult point and emphasis of the big data with.
In the face of problem above, current solution has two kinds:Automated serial perform, by Oozie.
In the technical scheme that automated serial is performed, the newly-built script file of data engineering Shi Shouxian, and
In script file in turn, according to the data of each step of daily data processing work flow process under set sequential write
Process task, system perform the task of data processing successively according to the script file of automated serial, complete daily
Flow chart of data processing.However, as data processing task is probably directly independent, such as index A and finger
The calculating of mark B has no association, and corresponding task can be executed in parallel, and in the task of data processing
In the case of more than very, automated serial performs task, can substantially reduce operation operating efficiency.On the other hand,
If one of data processing task error, the characteristics of operation due to script, whole handling process will skip mistake
Miss and continue executing with, and be probably the presence of dependence between data processing task, it is thus possible to cause number
Make a mistake according to the result for processing.
The document of a workflow defining in the technical scheme of Oozie, is introduced, is defined in this document
The dependence of task, trend of task etc..Oozie can judge which currently has appoint according to this document
Perform while business can be safe, which improves reliability compared to the technical scheme of automated serial.
But, due to Oozie documents it is excessively complicated, in actual production environment, thousands of up to a hundred data processing tasks
Workflow be it is extremely common, if now use Oozie, high document maintenance cost, work will be brought
Make inefficiency.
The content of the invention
The embodiment of the present invention proposes a kind of processing method of data task and task dispatcher, can parallel processing number
According to task, and improve operating efficiency.
The embodiment of the present invention provides a kind of processing method of data task, including:
A pending task is transferred from task list;Wherein, the task list record multiple tasks and
The dependence item of each task, and the task described in the task list can be transferred by multiple tasks scheduler;
According to the dependence item of being transferred for task, the Dependency Specification of the task is obtained;
When judging that current state meets execution condition according to the Dependency Specification, being transferred for task is submitted to
Task processor performs data processing.
Further, it is described to transfer a pending task from task list, specifically include:
A task is selected by task queue order from the task list;
According to the mark of the task, the gathering information of the task is inquired about from data storage area and letter is performed
Breath;Wherein, the data storage area record has the gathering information of each task in the task list and performs letter
Breath, the gathering information are used for recording whether the task is transferred by task dispatcher, the execution information
It is carrying out and whether runs succeeded for records the task and whether being submitted to task processor;
When selected task is not transferred, or selected task is transferred but is not submitted to task
Computing device, or selected task be submitted to task processor perform but perform failure when, adjust
The task is taken, and the task is recorded in the data storage area transferred;
Transferred and be submitted to task processor when selected task and be carrying out, or it is selected
Task when being transferred and being run succeeded, continue to select next by task queue order from the task list
Individual task.
Yet further, Dependency Specification is included in the dependence item;It is then described according to being transferred for task
Dependence item, obtain the Dependency Specification of the task, specially:
From the dependence item of being transferred for task, the Dependency Specification of the task is directly obtained;
Or, mark is judged comprising dependence automatically in the dependence item;It is then described according to appointing for being transferred
The dependence item of business, obtains the Dependency Specification of the task, specially:
Mark is judged according to the automatic dependence in the dependence item of being transferred for task, is started to rely on automatically and is sentenced
It is disconnected;
The concrete execution content of the task is scanned, the Dependency Specification of the task is therefrom obtained.
In one embodiment, the Startup time that the Dependency Specification is relied on by the task;
When then judging that current state meets execution condition according to the Dependency Specification, being transferred for task is submitted to
Data processing is performed to task processor, is specifically included:
Judge Startup time that the task relied on whether earlier than current time;
If so, then judge that current state meets execution condition, and being transferred for task is submitted at the task
Reason device performs data processing, and has been submitted at task in the transferred task of data storage area record
Reason device is carrying out;
If it is not, then judging that current state does not meet execution condition, in the data storage area, record is transferred
Task is not submitted to task processor execution.
In another embodiment, the mark of the task that the Dependency Specification is relied on by the task;
When then judging that current state meets execution condition according to the Dependency Specification, being transferred for task is submitted to
Data processing is performed to task processor, is specifically included:
The mark of the task relied on by the task according to being transferred, inquires about from the data storage area and is relied on
Task whether run succeeded by the task processor;
If so, then judge that current state meets execution condition, and being transferred for task is submitted at the task
Reason device performs data processing, and has been submitted at task in the transferred task of data storage area record
Reason device is carrying out;
If it is not, then judging that current state does not meet execution condition, in the data storage area, record is transferred
Task is not submitted to task processor execution.
In further embodiment, Startup time that the Dependency Specification is relied on by transferring for task and
The mark of being relied on for task;
When then judging that current state meets execution condition according to the Dependency Specification, being transferred for task is submitted to
Data processing is performed to task processor, is specifically included:
Judge Startup time that the task relied on whether earlier than current time;
When the Startup time is earlier than current time, the mark of being relied on according to the task for task, from
Inquire about whether relied on task is run succeeded by the task processor in the data storage area;
When the tasks carrying success for being relied on, judge that current state meets execution condition, by appointing for being transferred
Business is submitted to the task processor and performs data processing, and in appointing that data storage area record is transferred
Business has been submitted to task processor and has been carrying out;
When the Startup time is later than current time, or the Startup time is earlier than current time but is relied on
Task judge that current state does not meet execution condition when not being performed successfully, then, in the data storage area
The transferred task of record is not submitted to task processor execution.
Further, after submit to, also wrap
Include:
Receive the task action result that the task processor is returned;
The task action result is stored in into data storage area;
Next pending task is transferred in continuation from the task list, until described in the task list
Till task is performed success entirely.
Further, after the task action result that the task processor is returned is received, also include:
Judge whether the task runs succeeded according to the task action result;
If so, the tasks carrying success is then recorded in the data storage area;
If it is not, then record the tasks carrying in the data storage area failing.
Further, after recording the tasks carrying failure in the data storage area, also include:
Alarm mail is generated according to the task action result, and the alarm mail is sent to default is connect
Receiving end mouth;
Or, adding one to the number of times of the tasks carrying failure, the number of times after Jia is more than alarm door limit value
When, alarm mail is generated according to the task action result, and the alarm mail is sent to default is connect
Receiving end mouth.
Correspondingly, the embodiment of the present invention also provides a kind of task dispatcher, including:
Task transfers module, for transferring a pending task from task list;Wherein, the task
Token is loaded with the dependence item of multiple tasks and each task, and the task described in the task list can quilt
Multiple tasks scheduler is transferred;
Rely on acquisition module, for the dependence item according to being transferred for task, obtain the task according to
Bad information;With,
Task processing module, during for judging that current state meets execution condition according to the Dependency Specification, will
Being transferred for task is submitted to task processor and performs data processing.
Further, the task is transferred module and is specifically included:
Task choosing unit, for selecting a task by task queue order from the task list;
First query unit, according to the mark of the task, inquires about the tune of the task from data storage area
Win the confidence breath and execution information;Wherein, the data storage area record has the tune of each task in the task list
Win the confidence breath and execution information, the gathering information is used for recording whether the task is transferred by task dispatcher,
Whether the execution information is used for recording the task and whether has been submitted to task processor and be carrying out and
Run succeeded;
Task transfers unit, and for not transferred when selected task, or selected task is adjusted
Task processor execution is taken but be not submitted to, or selected task has been submitted to task processor and has held
When row but execution failure, the task is transferred, and the task is recorded in the data storage area and transferred;
The task choosing unit, is additionally operable to when selected task is transferred and has been submitted at task
Reason device is carrying out, or selected task is when being transferred and being run succeeded, and continues from the task list
In by task queue order select next task.
Further, Dependency Specification is included in the dependence item, then the dependence acquisition module is specifically used
In from the dependence item of being transferred for task, the Dependency Specification of the task is directly obtained;
Or, mark is judged comprising dependence automatically in the dependence item, then the dependence acquisition module,
Specifically include:
Start from judging unit, for the dependence item according to being transferred for task in automatic dependence judge
Mark, starts;With,
Scanning acquiring unit, for scanning the concrete execution content of the task, therefrom obtains the task
Dependency Specification.
In one embodiment, the Startup time that the Dependency Specification is relied on by the task;It is then described
Task processing module is specifically included:
Whether the first judging unit, for judging Startup time that the task relied on earlier than current time;
First task submits unit to, for when the Startup time relied on by the task is earlier than current time,
Judge that current state meets execution condition, being transferred for task is submitted to into the task processor and performs data
Process, and task processor has been submitted in the transferred task of data storage area record be carrying out;
With,
First information recording unit, for when the Startup time relied on by the task is later than current time,
Judge that current state does not meet execution condition, transferred task is recorded in the data storage area not submitted
Perform to task processor.
In another embodiment, the mark of the task that the Dependency Specification is relied on by the task, then
The task processing module is specifically included:
Second query unit, the mark of the task for being relied on according to being transferred for task, from the data
Inquire about whether relied on task is run succeeded by the task processor in memory block;
Second task submits unit to, for when being relied on for task is run succeeded by the task processor,
Judge that current state meets execution condition, being transferred for task is submitted to into the task processor and performs data
Process, and task processor has been submitted in the transferred task of data storage area record be carrying out;
With,
Second information recording unit, for when being relied on for task is not run succeeded by the task processor,
Judge that current state does not meet execution condition, transferred task is recorded in the data storage area not submitted
Perform to task processor.
In further embodiment, Startup time that the Dependency Specification is relied on by transferring for task and
The mark of being relied on for task, then the task processing module specifically include:
Whether the second judging unit, for judging Startup time that the task relied on earlier than current time;
3rd query unit, for when the Startup time is earlier than current time, according to task institute according to
Whether the mark of bad task, inquire about relied on task from the data storage area by the task
Reason device runs succeeded;
3rd task submits unit to, for when the tasks carrying for being relied on is successful, judging that current state meets
Being transferred for task is submitted to the task processor and performs data processing by execution condition, and in the number
Task processor has been submitted to according to the transferred task of memory block record to be carrying out;With,
3rd information recording unit, for being later than current time when the Startup time, or during the startup
When carving earlier than current time but being relied on for task is not performed successfully, judge that current state does not meet execution bar
Part, records transferred task in the data storage area and is not submitted to task processor execution.
Further, the task dispatcher also includes:
Information receiving module, for receiving the task action result that the task processor is returned;With,
Information unloading module, for being stored in data storage area by the task action result;
The task transfers module, is additionally operable to after the task action result is stored in data storage area,
Next pending task is transferred in continuation from the task list, until the task described in the task list
Till being performed success entirely.
Further, the task dispatcher also includes:
Second judge module, for after the task action result that the task processor is returned is received, root
Judge whether the task runs succeeded according to the task action result;
4th information logging modle, for when the tasks carrying is successful, then in the data storage area
Record the tasks carrying success;With,
5th information logging modle, for when the tasks carrying fails, then in the data storage area
Record the tasks carrying failure.
Further, the task dispatcher also includes:
Alarm module, after recording the tasks carrying failure in the data storage area, according to institute
State task action result and generate alarm mail, and the alarm mail is sent to default receiving port;Or
Person, after recording the tasks carrying failure in the data storage area, loses to the tasks carrying
The number of times for losing adds one, when the number of times after Jia is more than alarm door limit value, is given birth to according to the task action result
Into alarm mail, and the alarm mail is sent to default receiving port.
Implement the embodiment of the present invention, have the advantages that:
The processing method of data task provided in an embodiment of the present invention and task dispatcher, adjust can multiple tasks
Degree device concurrently transfers pending task respectively from task list.Wherein, the task list records multiple
The dependence item of task and each task, the addition of the dependence item are simple, can improve work
Efficiency;And task dispatcher can parse transferred task from the dependence item of being transferred for task
Dependence, when judging that current state meets execution condition according to the Dependency Specification, then will be transferred
Task submit to task processor perform data processing so that task is performed in an orderly manner, it is ensured that data
Process does not malfunction, and improves the operating efficiency of multiple tasks scheduler parallel processing task.
Description of the drawings
Fig. 1 is the schematic flow sheet of one embodiment of the processing method of the data task that the present invention is provided;
The flow process of one embodiment of the step of Fig. 2 is the processing method of the data task provided by Fig. 1 S1 is shown
It is intended to;
Fig. 3 is the structural representation of the one embodiment for the task dispatcher that the present invention is provided;
Fig. 4 is the structural representation of one embodiment that the task of the task dispatcher that the present invention is provided transfers module
Figure;
Fig. 5 is the structural representation of one embodiment of the dependence acquisition module of the task dispatcher that the present invention is provided
Figure;
Fig. 6 is the structural representation of the embodiment one of the task processing module of the task dispatcher that the present invention is provided
Figure;
Fig. 7 is the structural representation of the embodiment two of the task processing module of the task dispatcher that the present invention is provided
Figure;
Fig. 8 is the structural representation of the embodiment three of the task processing module of the task dispatcher that the present invention is provided
Figure;
Fig. 9 is the structure chart provided by the present invention for realizing one embodiment of the system of data task process;
Figure 10 is the schematic flow sheet of another embodiment of the processing method of the data task that the present invention is provided.
Specific embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clearly
Chu, it is fully described by, it is clear that described embodiment is only a part of embodiment of the invention, rather than
Whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art are not making creation
Property work under the premise of the every other embodiment that obtained, belong to the scope of protection of the invention.
The embodiment of the present invention provides a kind of processing method of data task, can make multiple tasks scheduler concurrently
Transfer pending task respectively from task list, and can from task list each task dependence item
The dependence that parsing is gone out on missions, performs task in an orderly manner according to the Dependency Specification, it is ensured that data processing is not
Error, and improve the operating efficiency of multiple tasks scheduler parallel processing task.
For convenience of describing, below only by taking one of task dispatcher as an example, how task dispatcher is carried out
The method of task scheduling is described in detail, and the method that other task dispatchers carry out task scheduling is also identical,
The present invention is not repeated one by one.
Referring to Fig. 1, it is that the flow process of one embodiment of the processing method of the data task that the present invention is provided is illustrated
Figure.The method can be performed by task dispatcher, including step S1 to step S3, specific as follows:
S1, transfers a pending task from task list;Wherein, the task list records multiple
Business and the dependence item of each task, and the task described in the task list can be by multiple tasks scheduler
Transfer.
In the present embodiment, engineer can flexibly arrange task dispatcher quantity according to the task amount of task list,
So that multiple tasks scheduler can concurrently in process task table task.Wherein, multiple tasks scheduling
Device can be with regular working, and upon actuation, being transferred in task list automatically for task is processed.Engineer can be with
The quantity of task dispatcher and the time of regular working is configured as the case may be.
The task list is generally an executable automated serial script file, records multiple tasks order line
And the dependence item of each task order row, engineer only need to directly add behind each task order row
Plus the dependence item of the task order row, Dependency Specification of the dependence item comprising the task, you can
The configuration of the dependence of each task of the task list is realized, it is simple, improve operating efficiency.
S2, according to the dependence item of being transferred for task, obtains the Dependency Specification of the task.
In the present embodiment, task dispatcher utilizes the matching process of " regular expression ", matching task
Dependence item, therefrom obtains the Dependency Specification of task, and for example, regular expression is " s*##@{ } ", tool
Body is implemented:Transferred task is read in, when a dependence mark " s*##@" is read,
Then behind dependence mark " { } " bracket information for task Dependency Specification, task dispatcher is
The Dependency Specification of the task can be obtained from " { } ".
S3, when judging that current state meets execution condition according to the Dependency Specification, being transferred for task is carried
Give task processor and perform data processing.
In the present embodiment, the Dependency Specification is used for recording the dependence of task, and the dependence can be with
For depending on the Startup time of time, i.e. appointed task, when the Startup time of task is earlier than current time,
Task can just submit to task processor execution;The dependence can also be the task that depends on, i.e., this
The execution of business need to rely on other tasks, and when being relied on for task is performed successfully, this task is
Task processor can be submitted to perform.
Further, it is as shown in Fig. 2 in above-mentioned steps S1, described to transfer one from task list and wait to hold
Capable task, specifically includes step S101 to S104, as follows:
S101, selects a task by task queue order from the task list.
Specifically, in one embodiment, the task queue order is:One task then another
Order of the task order into team;I.e. task dispatcher can be seriatim selected in order from the beginning of task queue
Task.In another embodiment, each task of the task list also includes priority, then described
Task queue order be:The priority of each task order from high to low, the task of same levels press one
Task then order of another task order into team;That is, task dispatcher can according in task list each
The priority orders of task seriatim select task.In the middle of being embodied as, engineer can be according to concrete feelings
Condition arranges the order of each pending task in task list.
S102, according to the mark of the task, inquire about from data storage area the task gathering information and
Execution information;Wherein, data storage area record have each task in the task list gathering information and
Execution information, the gathering information are used for recording whether the task is transferred by task dispatcher, described to hold
Whether row information is used for recording the task and whether being submitted to task processor being carrying out and performing into
Work(.
In the present embodiment, engineer configures a mark for each task, and the mark can be the task
Specific instructions row, or the task simple name.And the tune of each task that data storage area is recorded
Win the confidence breath and execution information, the mark associated record with corresponding task.Task dispatcher only need to be deposited in data
Storage area searches the mark of the task, you can inquires about gathering information and the execution information of the task, knows the task
State.Wherein, each task there may be following five kinds of states:
First, the task is not transferred by task dispatcher, task processor execution is not submitted to yet;
Second, the task is transferred by task dispatcher, but due to not meeting the reasons such as execution condition, and not
Submit to task processor execution;
Third, the task is transferred by task dispatcher, and task processor it has been submitted to and has been carrying out;
Fourth, the task is transferred by task dispatcher, and has been submitted to task processor execution, and hold
Go successfully;
Fifth, the task is transferred by task dispatcher, and has been submitted to task processor execution, but hold
Row failure.
Correspondingly, in the gathering information and execution information of the task recorded by data storage area, can adopt
Identified with specifically transferring mark and performing to distinguish five kinds of states of the above.
In the present embodiment, the gathering information of each task for being recorded due to data storage area and execution information,
Mark associated record with corresponding task, therefore, after the task of task list is interrupted process, or task
When scheduler is again started up, according to the mark of each task in the task list and the state for whether running succeeded
Information, skips the successful task of executed, only need to from the task list be not performed successful task from the beginning of,
A task is selected to continue with by task queue order, without the 1st from second task list
Item task starts to select a task to be processed.
Further, since in the middle of being embodied as, task list regularly can be adjusted, may produce much not
Same task list, and the task list transferred by task dispatcher is it is also possible to be interrupted process.In order that task
Scheduler after breakpoint is restarted can be transferred the task list being interrupted exactly, and improve treatment effeciency,
The embodiment of the present invention can also be further introduced into " time marking ", by time marking come logger task scheduler
The task list transferred.
Preferably, the present embodiment can using task dispatcher start date as time marking, or according to
After specific algorithm was calculated to the date, time marking is obtained;Or engineer is input into a date,
And the date is calculated according to another specific algorithm, obtain time marking.For example, at first day,
Multiple tasks scheduler starts, and prepares to process the task in first task table, is then generated according to the date on the same day
The very first time identifies, and the very first time mark is associated with the mark of all tasks in first task table, and
Record is in data storage area.At second day, multiple tasks scheduler started, and prepared to process the second task list
In task, then the second time marking is generated according to the date on the same day, and by second time marking and second
The mark association of all tasks in task list, and record in data storage area.Assume at second day, when the
After front 100 tasks in two task lists are performed successfully, occur to interrupt.When task dispatcher is again started up
When, the second time marking is generated according to the date that the generation of engineer's input is interrupted, and from data storage area
The execution state of all tasks that inquiry is associated with the second time marking, knows front 100 in the second task list
Item task is performed successfully, then the task dispatcher can skip front 100 tasks, only need to be from described second
The 101st task in business table starts, and selects a task to continue with by task queue order, without
To start to select from the 1st in second task list task, therefore the embodiment of the present invention not only can be
Breakpoint transfers the task list being interrupted after restarting exactly, and can improve treatment effeciency.
S103, when selected task is not transferred, or selected task is transferred but not submitted
Perform to task processor, or selected task has been submitted to task processor and has performed but perform failure
When, the task is transferred, and the task is recorded in the data storage area transferred.
S104, is transferred and has been submitted to task processor when selected task and be carrying out, or
When selected task is transferred and run succeeded, continue to select by task queue order from the task list
Select next task.
In the present invention, the acquisition methods of Dependency Specification have two kinds, as follows:
In one embodiment, Dependency Specification is included in the dependence item;It is then described according to being transferred
Task dependence item, obtain the Dependency Specification of the task, specially:From being transferred for task
In dependence item, the Dependency Specification of the task is directly obtained.
Specifically, the dependence item of the task is " s*##@{ } ", for relying on letter inside " { } " bracket
Breath, for example, Dependency Specification is the Startup time { ' time ' relied on by task:${dt}2015-8-16 08:00:00 },
The mark of the Startup time that time is relied on by task, $ { dt } time 2015-8-16 08 below:00:00 is to appoint
The concrete Startup time relied on by business;Dependency Specification is the mark { ' rely ' of the task relied on by task:' taskA ' },
For indicating that the task is to rely on running succeeded for task taskA, rely is indicated for the task and depends on
The mark for running succeeded of task taskA, the task that taskA is relied on by task are specifically identified;Rely on letter
Breath is the simple name { ' name ' of task:' taskB ' }, name is the mark for the simple name of task configuration, and taskB is
For the simple name of the task configuration, the simple name of the task and the mark associated record data storage area of the task.With
Upper described mark is can be, but not limited to as time, rely, name.
In another embodiment, mark is judged comprising dependence automatically in the dependence item;It is then described
According to the dependence item of being transferred for task, the Dependency Specification of the task is obtained, specially:According to institute
Automatic dependence in the dependence item of transferring for task judges mark, starts dependence automatically and judges;Scanning institute
The concrete execution content of task is stated, the Dependency Specification of the task is therefrom obtained.
Specifically, the dependence item of the task is " s*##@{ } ", inside " { } " bracket be automatically according to
Rely and judge mark, the automatic dependence judges that mark is can be, but not limited to as ' autorely ':' 1 ', work as task scheduling
When the automatic dependence judgement mark of task is arrived in device scanning, start dependence judgement automatically, according to the mark of the task,
Jump in the file of concrete execution content of the task, scanning is concrete to perform content, is used for referring to when scanning to
When showing the keyword of Dependency Specification of task, you can obtain Dependency Specification, the keyword can be according to concrete
Situation is arranged by engineer, need to only be sought unity of standard.In the present embodiment, due to the concrete execution of task
Content is built based on sql like language, thus this it is concrete perform content in building process comprising the task according to
Bad information, and after the Dependency Specification of the task follows FROM or JOIN closely, so as to work as task dispatcher
When scanning the keyword FROM or JOIN of concrete execution content, task dispatcher is obtained immediately following FROM
Or information after JOIN, you can obtain the Dependency Specification of the task.In addition, in the task phase of the task list
In the case of many, it is only necessary to directly replicate behind each task of task list and paste comprising dependence automatically
The mark of judgement dependence item " s*##@{ ' autorely ':' 1 ' } ", you can realize matching somebody with somebody for dependence item
Put, configure simple, and improve operating efficiency, reduce the cost of time and manpower.
In the present invention, Dependency Specification can be to depend on Startup time, or for depending on other tasks,
Or both combination, illustrate separately below, to above-mentioned steps S3 in, how to judge to work as according to Dependency Specification
Whether front state meets execution condition, performs at data so as to being transferred for task is submitted to task processor
The method of reason is described in detail.
In one embodiment, the Startup time that the Dependency Specification is relied on by the task;It is then above-mentioned
In step S3, when judging that current state meets execution condition according to the Dependency Specification, by being transferred for task
Submit to task processor and perform data processing, specifically include:
Judge Startup time that the task relied on whether earlier than current time;
If so, then judge that current state meets execution condition, and being transferred for task is submitted at the task
Reason device performs data processing, and has been submitted at task in the transferred task of data storage area record
Reason device is carrying out;
If it is not, then judging that current state does not meet execution condition, in the data storage area, record is transferred
Task is not submitted to task processor execution.For example, the task for being recorded in the data storage area
In execution information, a specific character is write, or makes certain position in the execution information be sky, remembered with this
Record the task and be not submitted to task processor execution.
It should be noted that when judging that current state does not meet execution condition, task dispatcher can also be skipped
The task, and continue to select next task by task queue order from the task list, that is, return step
Rapid S1 continues to transfer next pending task from task list, and all tasks in task list are all held
Till row success.
In the present embodiment, the Startup time relied on by the task, for example, { ' time ':${dt}2015-8-16
08:00:00 }, wherein, the mark of ' time ' for Startup time, specific Startup time is exactly 2015-8-16
08:00:00, as specific Startup time 2015-8-16 08:00:00 earlier than current time, or Startup time
2015-8-16 08:00:00 it is identical with current time when, judge current state meet execution condition, will be transferred
Task submit to the task processor and perform data processing.
In another embodiment, the mark of the task that the Dependency Specification is relied on by the task;Then
In above-mentioned steps S3, when judging that current state meets execution condition according to the Dependency Specification, by what is transferred
Task is submitted to task processor and performs data processing, specifically includes:
The mark of the task relied on by the task according to being transferred, inquires about from the data storage area and is relied on
Task whether run succeeded by the task processor;
If so, then judge that current state meets execution condition, and being transferred for task is submitted at the task
Reason device performs data processing, and has been submitted at task in the transferred task of data storage area record
Reason device is carrying out;
If it is not, then judging that current state does not meet execution condition, in the data storage area, record is transferred
Task is not submitted to task processor execution.
It should be noted that when judging that current state does not meet execution condition, task dispatcher can also be skipped
The task, and continue to select next task by task queue order from the task list, that is, return step
Rapid S1 continues to transfer next pending task from task list, and all tasks in task list are all held
Till row success.
In the present embodiment, the mark of the task relied on by task taskC, can be { ' rely ':' taskA ' },
Wherein rely is used for indicating that task taskC depends on running succeeded for task taskA that taskA to be task taskC
The task of being relied on is specifically identified;Task that task dispatcher is relied on according to task taskC is specifically identified
TaskA, inquires about relied on task taskA from the data storage area whether by the task processor
Run succeeded, when taskA is performed successfully, task taskC is submitted to into task processor and performs data
Process.The task relied on by the task can be multiple, when the equal executed success of multiple dependence task, institute
State task and can just submit to task processor execution data processing.
In further embodiment, Startup time that the Dependency Specification is relied on by transferring for task and
The mark of being relied on for task;Then in above-mentioned steps S3, judge that current state meets according to the Dependency Specification
During execution condition, being transferred for task is submitted to into task processor and performs data processing, specifically included:
Judge Startup time that the task relied on whether earlier than current time;
When the Startup time is earlier than current time, the mark of being relied on according to the task for task, from
Inquire about whether relied on task is run succeeded by the task processor in the data storage area;
When the tasks carrying success for being relied on, judge that current state meets execution condition, by appointing for being transferred
Business is submitted to the task processor and performs data processing, and in appointing that data storage area record is transferred
Business has been submitted to task processor and has been carrying out;
When the Startup time is later than current time, or the Startup time is earlier than current time but is relied on
Task judge that current state does not meet execution condition when not being performed successfully, then, in the data storage area
The transferred task of record is not submitted to task processor execution.
It should be noted that when the Startup time is later than current time, or the Startup time is earlier than working as
Front moment but when being relied on for task is not performed successfully, judge that current state does not meet execution condition, task
Scheduler can also skip the task, and continue to select next by task queue order from the task list
Task, i.e. return to step S1 continue to transfer next pending task from task list, in task list
All tasks be carried out success till.
In the present embodiment, task taskC is relied on Startup time and the mark of being relied on for task,
Can be { ' time ':${dt}2015-8-16 08:00:00}、{‘rely’:‘taskA’};Or be
{‘time’:${dt}2015-8-16 08:00:00,‘rely’:' taskA ' }, then when specific Startup time 08:00:00 is early
In or be equal to current time, and the task taskA executed for being relied on is when completing, just can be by task taskC
Submit to task processor and perform data processing.
Further, after submit to, also wrap
Include:
Receive the task action result that the task processor is returned;
The task action result is stored in into data storage area;
Next pending task is transferred in continuation from the task list, until described in the task list
Till task is performed success entirely.
In the present embodiment, task dispatcher can be transferred next after task action result is received,
Individual pending task, to avoid the task dispatcher from producing conflict in process task.In addition, described
Business implementing result can be, but not limited to include that the beginning of tasks carrying is taken, appointed with end time, tasks carrying
Business run succeeded or failure information and the reason for perform failure, it is of the invention that task action result is stored in
Data storage area, can facilitate the problem in later stage to investigate and optimize, and for example, engineer is from data storage area
It was found that the tasks carrying of certain task is time-consuming very high always, it may be considered that further optimize this task.
Further, after the task action result that the task processor is returned is received, also include:
Judge whether the task runs succeeded according to the task action result;
If so, the tasks carrying success is then recorded in the data storage area;
If it is not, then record the tasks carrying in the data storage area failing.
In the present embodiment, the letter of all about task of the task processor recorded in data storage area
Breath, all the mark associated storage with the task, in order to task processor according to the mark of the task can be
Can the quick search task information.
Further, after recording the tasks carrying failure in the data storage area, also include:
Alarm mail is generated according to the task action result, and the alarm mail is sent to default is connect
Receiving end mouth;
Or, adding one to the number of times of the tasks carrying failure, the number of times after Jia is more than alarm door limit value
When, alarm mail is generated according to the task action result, and the alarm mail is sent to default is connect
Receiving end mouth.
In the present embodiment, it may be preferable that add one using the number of times to the tasks carrying failure, after Jia one
Number of times when being more than alarm door limit value, alarm mail is generated according to the task action result, and by the report
Alert mail is sent to this alarm method of default receiving port, can further be improved operating efficiency and be appointed
The reliability that business is performed.
The processing method of data task provided in an embodiment of the present invention, can be such that multiple tasks scheduler concurrently divides
Pending task is not transferred from task list.Wherein, the task list records multiple tasks and each is appointed
The dependence item of business, the addition of the dependence item are simple, can improve operating efficiency;And task
Scheduler can parse the dependence of transferred task from the dependence item of being transferred for task,
When judging that current state meets execution condition according to the Dependency Specification, then being transferred for task is submitted to
Task processor performs data processing, so that task is performed in an orderly manner, it is ensured that data processing does not malfunction,
And improve the operating efficiency of multiple tasks scheduler parallel processing task.
The system that the embodiment of the present invention also provides a kind of task dispatcher and realizes data task process, Neng Goushi
All flow processs of the processing method of above-mentioned data task are applied, is described in detail with reference to Fig. 3 to Figure 10.
Referring to Fig. 3, it is the structural representation of task dispatcher one embodiment that the present invention is provided.
As shown in figure 3, the embodiment of the present invention also provides a kind of task dispatcher, including task transfer module 10,
Acquisition module 20 and task processing module 30 is relied on, wherein:
Task transfers module 10, for transferring a pending task from task list;Wherein, described
Business token is loaded with the dependence item of multiple tasks and each task, and the task described in the task list can
Transferred by multiple tasks scheduler;
Acquisition module 20 is relied on, for the dependence item according to being transferred for task, the task is obtained
Dependency Specification;
Task processing module 30, during for judging that current state meets execution condition according to the Dependency Specification,
Being transferred for task is submitted to into task processor and performs data processing.
Further, as shown in figure 4, which is the task of the task dispatcher that the present invention is provided transfers module
The structural representation of one embodiment, the task transfer module 10, specifically include task choosing unit 11,
First query unit 12 and task transfer unit 13, wherein,
Task choosing unit 11, for selecting a task by task queue order from the task list;
First query unit 12, according to the mark of the task, inquires about the task from data storage area
Gathering information and execution information;Wherein, data storage area record has each task in the task list
Gathering information and execution information, the gathering information are used for recording whether the task is adjusted by task dispatcher
Take, the execution information be used for record the task whether be submitted to task processor be carrying out and
Whether run succeeded;
Task transfers unit 13, for not transferred when selected task, or selected task by
Task processor execution is transferred but be not submitted to, or selected task has been submitted to task processor
When performing but performing failure, the task is transferred, and the task is recorded in the data storage area and adjusted
Take;
The task choosing unit 11, is additionally operable to when selected task is transferred and has been submitted to task
Processor is carrying out, or selected task is when being transferred and being run succeeded, and continues from the task
Next task is selected by task queue order in table.
Further, Dependency Specification is included in the dependence item, then the dependence acquisition module 20 is concrete,
For, from the dependence item of being transferred for task, directly obtaining the Dependency Specification of the task;
Or, comprising judgement mark is relied on automatically in the dependence item, then as shown in figure 5, which is this
The structural representation of one embodiment of the dependence acquisition module of the task dispatcher that invention is provided, the dependence
Acquisition module 20, specifically includes startup from judging unit 22 and scans acquiring unit 23, wherein,
Start from judging unit 22, for the dependence item according to being transferred for task in automatic dependence sentence
Disconnected mark, starts dependence automatically and judges;
Scanning acquiring unit 23, for scanning the concrete execution content of the task, therefrom obtains the task
Dependency Specification.
In one embodiment, the Startup time that the Dependency Specification is relied on by the task, then, such as
Shown in Fig. 6, which is the structure of the embodiment one of the task processing module of the task dispatcher that the present invention is provided
Schematic diagram, the task processing module 30 specifically include the first judging unit 311, first task and submit single to
Unit 312 and first information recording unit 313, wherein,
First judging unit 311, for judge Startup time that the task relied on whether earlier than it is current when
Carve;
First task submits unit 312 to, for the Startup time that relied on when the task earlier than current time
When, judge that current state meets execution condition, being transferred for task is submitted to into the task processor and is performed
Data processing, and task processor is being submitted to in the transferred task of data storage area record
Perform;With;
First information recording unit 313, the Startup time for being relied on when the task are later than current time
When, judge that current state does not meet execution condition, the data storage area record transferred task not by
Submit to task processor execution.
In another embodiment, the mark of the task that the Dependency Specification is relied on by the task, then,
As shown in fig. 7, which is the knot of the embodiment two of the task processing module of the task dispatcher that the present invention is provided
Structure schematic diagram, the task processing module 30 specifically include the second query unit 321, the second task and submit to
Unit 322 and the second information recording unit 323, wherein,
Second query unit 321, the mark of the task for being relied on according to being transferred for task, from the number
Whether run succeeded by the task processor according to the relied on task of inquiry in memory block;
Second task submits unit 322 to, is run succeeded by the task processor for working as relied on task
When, judge that current state meets execution condition, being transferred for task is submitted to into the task processor and is performed
Data processing, and task processor is being submitted to in the transferred task of data storage area record
Perform;
Second information recording unit 323, is not run succeeded by the task processor for working as relied on task
When, judge that current state does not meet execution condition, the data storage area record transferred task not by
Submit to task processor execution.
In further embodiment, Startup time that the Dependency Specification is relied on by transferring for task and
The mark of being relied on for task, then, as shown in figure 8, which is the task of the task dispatcher that the present invention is provided
The structural representation of the embodiment three of processing module, the task processing module 30 specifically include second and sentence
Disconnected unit 331, the 3rd query unit 332, the 3rd task submit unit 333 and the 3rd information recording unit 334 to,
Wherein,
Second judging unit 331, for judge Startup time that the task relied on whether earlier than it is current when
Carve;
3rd query unit 332, for when the Startup time is earlier than current time, then according to the task
The mark of being relied on for task, inquires about relied on task from the data storage area whether by described
Business computing device success;
3rd task submits unit 333 to, for when the tasks carrying for being relied on is successful, judging current state symbol
Execution condition is closed, being transferred for task is submitted to into the task processor and is performed data processing, and described
Record transferred task in data storage area has been submitted to task processor and has been carrying out;
3rd information recording unit 334, for being later than current time, or the startup when the Startup time
Moment, earlier than current time but when the task of being relied on is not performed successfully, judges that current state does not meet execution
Condition, records transferred task in the data storage area and is not submitted to task processor execution.
Further, the task dispatcher, also including information receiving module 40 and information unloading module 50,
Wherein,
Information receiving module 40, for receiving the task action result that the task processor is returned;
Information unloading module 50, for being stored in data storage area by the task action result;
The task transfers module 10, be additionally operable to by the task action result be stored in data storage area it
Afterwards, continue to transfer next pending task from the task list, until described in the task list
Till task is performed success entirely.
Further, the task dispatcher also includes the second judge module 60, the 4th information logging modle 70
With the 5th information logging modle 80, wherein,
Second judge module 60, for after the task action result that the task processor is returned is received,
Judge whether the task runs succeeded according to the task action result;
4th information logging modle 70, for when the tasks carrying is successful, then in the data storage area
In record the tasks carrying success;
5th information logging modle 80, for when the tasks carrying fails, then in the data storage area
In record tasks carrying failure.
Further, the task dispatcher also includes:
Alarm module 90, after recording the tasks carrying failure in the data storage area, according to
The task action result generates alarm mail, and the alarm mail is sent to default receiving port;
Or, after recording the tasks carrying failure in the data storage area, to the tasks carrying
The number of times of failure adds one, when the number of times after Jia is more than alarm door limit value, according to the task action result
Alarm mail is generated, and the alarm mail is sent to default receiving port.
Below in conjunction with Fig. 9 and Figure 10, to realizing that the system and method that data task is processed is illustrated.
Referring to Fig. 9, it is the knot provided by the present invention for realizing one embodiment of the system of data task process
Composition.1 random start of multiple tasks scheduler, reads in same task list, and wherein, the task list is recorded
There is the dependence item of multiple tasks and each task, multiple tasks scheduler 1 is parallel to the task list
Task is processed successively in an orderly manner, the dependence respectively according to transferred task of multiple tasks scheduler 1
Item obtains the Dependency Specification of transferred task, and judges to be transferred according to the Dependency Specification of transferred task respectively
The current state of task, wherein, when the mark of the task relied on comprising task by the Dependency Specification of transferred task
During knowledge, then the mark of task relied on according to task is needed, relied on task is inquired about to data storage area 2
Execution state, when the current state of transferred task meets execution condition, respectively by being transferred for task
Submit to task processor 3 and perform data processing, otherwise, skip this task, continue to transfer next task.
In addition, the implementing result that returns task processor 3 respectively of multiple tasks scheduler 1 and transferred task
Mark association is stored in data storage area 2, and whether judges transferred task according to the implementing result for returning respectively
Run succeeded, if performing failure, generate alarm mail, and alarm mail is sent to into default receiving terminal
Mouth 4.
Referring to Figure 10, it is that the flow process of another embodiment of the processing method of the data task that the present invention is provided is shown
It is intended to.The embodiment of the present invention provides a kind of processing method of more specifically data task, including step S202
It is to S214, specific as follows:
Step S202:Task T is selected from task list;
Step S203:Judge whether task T can be transferred;I.e. in data storage area, whether query task T
Transferred by task dispatcher, if be submitted to task processor and be carrying out and whether run succeeded;
If so, i.e. when task T is not transferred by task dispatcher, or task T is transferred by task dispatcher
But it is not submitted to task processor execution, or task T is when being transferred but is performed unsuccessfully by task dispatcher,
Execution step S204;
If it is not, i.e. when task T is transferred and run succeeded by task dispatcher, or task T is adjusted by task
When degree device is transferred but has been submitted to task processor and is carrying out, return to step S202 continues from described
Next task T+1 is selected by task queue order in business table;
Step S204:Judgement task relies on item and whether there is automatic decision mark;
If so, execution step S205;If it is not, direct execution step S206;
Step S205:Scanning is concrete to perform content, obtains Dependency Specification;
Step S206:Judge whether Startup time;
If so, execution step S207;If it is not, execution step S208;
Step S207:Judge Startup time whether earlier than current time;
If so, execution step S208;If it is not, return to step S202, continues to press task from the task list
Queue sequence selects next task T+1;
Step S208:Judge whether the mark of relied on task;
If so, execution step S209;If it is not, execution step S210;
Step S209:Whether the relied on task of inquiry runs succeeded;
If so, execution step S210;If it is not, return to step S202, continues to press task from the task list
Queue sequence selects next task T+1;
Step S210:Task T is submitted to into task processor and performs data processing
Step S211:Receive the implementing result for returning;
Step S212:Preserve the implementing result for returning
Step S213:Judge whether task runs succeeded;
If so, return to step S202, continues to select next appointing by task queue order from the task list
Business T+1;If it is not, execution step S214.
Step S214:Alert process;After alert process, return to step S202 continues from the task list
Next task is selected by task queue order.
Implement the embodiment of the present invention, have the advantages that:
Task dispatcher provided in an embodiment of the present invention, can be with other task dispatchers concurrently from task list
In transfer pending task, wherein, the task list records the dependence of multiple tasks and each task and closes
Set entry, task dispatcher can parse what transferred task was relied on from the dependence item of transferred task
According to the Dependency Specification, relation, when judging that current state meets execution condition, then by being transferred for task
Submit to task processor and perform data processing, so that task is performed in an orderly manner, it is ensured that data processing is not
Error, and improve the operating efficiency of multiple tasks scheduler parallel processing task.
One of ordinary skill in the art will appreciate that all or part of flow process in realizing above-described embodiment method,
Computer program be can be by instruct the hardware of correlation to complete, described program can be stored in a calculating
In machine read/write memory medium, the program is upon execution, it may include such as the flow process of the embodiment of above-mentioned each method.
Wherein, described storage medium can for magnetic disc, CD, read-only memory (Read-Only Memory,
) or random access memory (Random Access Memory, RAM) etc. ROM.
The above is the preferred embodiment of the present invention, it is noted that for the common skill of the art
For art personnel, under the premise without departing from the principles of the invention, some improvements and modifications can also be made, this
A little improvements and modifications are also considered as protection scope of the present invention.
Claims (18)
1. a kind of processing method of data task, it is characterised in that include:
A pending task is transferred from task list;Wherein, the task list record multiple tasks and
The dependence item of each task, and the task described in the task list can be transferred by multiple tasks scheduler;
According to the dependence item of being transferred for task, the Dependency Specification of the task is obtained;
When judging that current state meets execution condition according to the Dependency Specification, being transferred for task is submitted to
Task processor performs data processing.
2. the processing method of data task as claimed in claim 1, it is characterised in that described from task list
In transfer a pending task, specifically include:
A task is selected by task queue order from the task list;
According to the mark of the task, the gathering information of the task is inquired about from data storage area and letter is performed
Breath;Wherein, the data storage area record has the gathering information of each task in the task list and performs letter
Breath, the gathering information are used for recording whether the task is transferred by task dispatcher, the execution information
It is carrying out and whether runs succeeded for records the task and whether being submitted to task processor;
When selected task is not transferred, or selected task is transferred but is not submitted to task
Computing device, or selected task be submitted to task processor perform but perform failure when, adjust
The task is taken, and the task is recorded in the data storage area transferred;
Transferred and be submitted to task processor when selected task and be carrying out, or it is selected
Task when being transferred and being run succeeded, continue to select next by task queue order from the task list
Individual task.
3. the processing method of data task as claimed in claim 1, it is characterised in that the dependence
Dependency Specification is included in;Then the dependence item according to being transferred for task, obtains the task
Dependency Specification, specially:
From the dependence item of being transferred for task, the Dependency Specification of the task is directly obtained;
Or, mark is judged comprising dependence automatically in the dependence item;It is then described according to appointing for being transferred
The dependence item of business, obtains the Dependency Specification of the task, specially:
Mark is judged according to the automatic dependence in the dependence item of being transferred for task, is started to rely on automatically and is sentenced
It is disconnected;
The concrete execution content of the task is scanned, the Dependency Specification of the task is therefrom obtained.
4. the processing method of data task as claimed in claim 1, it is characterised in that the Dependency Specification
The Startup time relied on by the task;
Then it is described according to the Dependency Specification judge current state meet execution condition when, by being transferred for task
Submit to task processor and perform data processing, specifically include:
Judge Startup time that the task relied on whether earlier than current time;
If so, then judge that current state meets execution condition, and being transferred for task is submitted at the task
Reason device performs data processing, and has been submitted at task in the transferred task of data storage area record
Reason device is carrying out;
If it is not, then judging that current state does not meet execution condition, in the data storage area, record is transferred
Task is not submitted to task processor execution.
5. the processing method of data task as claimed in claim 1, it is characterised in that the Dependency Specification
The mark of being relied on by the task for task;
Then it is described according to the Dependency Specification judge current state meet execution condition when, by being transferred for task
Submit to task processor and perform data processing, specifically include:
The mark of the task relied on by the task according to being transferred, inquires about from the data storage area and is relied on
Task whether run succeeded by the task processor;
If so, then judge that current state meets execution condition, and being transferred for task is submitted at the task
Reason device performs data processing, and has been submitted at task in the transferred task of data storage area record
Reason device is carrying out;
If it is not, then judging that current state does not meet execution condition, in the data storage area, record is transferred
Task is not submitted to task processor execution.
6. the processing method of data task as claimed in claim 1, it is characterised in that the Dependency Specification
The mark of the Startup time relied on by transferring for task and being relied on for task;
Then it is described according to the Dependency Specification judge current state meet execution condition when, by being transferred for task
Submit to task processor and perform data processing, specifically include:
Judge Startup time that the task relied on whether earlier than current time;
When the Startup time is earlier than current time, the mark of being relied on according to the task for task, from
Inquire about whether relied on task is run succeeded by the task processor in the data storage area;
When the tasks carrying success for being relied on, judge that current state meets execution condition, by appointing for being transferred
Business is submitted to the task processor and performs data processing, and in appointing that data storage area record is transferred
Business has been submitted to task processor and has been carrying out;
When the Startup time is later than current time, or the Startup time is earlier than current time but is relied on
Task judge that current state does not meet execution condition when not being performed successfully, then, in the data storage area
The transferred task of record is not submitted to task processor execution.
7. the processing method of the data task as described in any one of claim 1 to 6, it is characterised in that
After being transferred for task is submitted to task processor execution data processing, also include:
Receive the task action result that the task processor is returned;
The task action result is stored in into data storage area;
Next pending task is transferred in continuation from the task list, until described in the task list
Till task is performed success entirely.
8. the processing method of data task as claimed in claim 7, it is characterised in that receiving described appointing
After the task action result that business processor is returned, also include:
Judge whether the task runs succeeded according to the task action result;
If so, the tasks carrying success is then recorded in the data storage area;
If it is not, then record the tasks carrying in the data storage area failing.
9. the processing method of data task as claimed in claim 8, it is characterised in that deposit in the data
After recording the tasks carrying failure in storage area, also include:
Alarm mail is generated according to the task action result, and the alarm mail is sent to default is connect
Receiving end mouth;
Or, adding one to the number of times of the tasks carrying failure, the number of times after Jia is more than alarm door limit value
When, alarm mail is generated according to the task action result, and the alarm mail is sent to default is connect
Receiving end mouth.
10. a kind of task dispatcher, it is characterised in that include:
Task transfers module, for transferring a pending task from task list;Wherein, the task
Token is loaded with the dependence item of multiple tasks and each task, and the task described in the task list can quilt
Multiple tasks scheduler is transferred;
Rely on acquisition module, for the dependence item according to being transferred for task, obtain the task according to
Bad information;With,
Task processing module, during for judging that current state meets execution condition according to the Dependency Specification, will
Being transferred for task is submitted to task processor and performs data processing.
11. task dispatchers as claimed in claim 10, it is characterised in that the task transfers module tool
Body includes:
Task choosing unit, for selecting a task by task queue order from the task list;
First query unit, according to the mark of the task, inquires about the tune of the task from data storage area
Win the confidence breath and execution information;Wherein, the data storage area record has the tune of each task in the task list
Win the confidence breath and execution information, the gathering information is used for recording whether the task is transferred by task dispatcher,
Whether the execution information is used for recording the task and whether has been submitted to task processor and be carrying out and
Run succeeded;
Task transfers unit, and for not transferred when selected task, or selected task is adjusted
Task processor execution is taken but be not submitted to, or selected task has been submitted to task processor and has held
When row but execution failure, the task is transferred, and the task is recorded in the data storage area and transferred;
The task choosing unit, is additionally operable to when selected task is transferred and has been submitted at task
Reason device is carrying out, or selected task is when being transferred and being run succeeded, and continues from the task list
In by task queue order select next task.
12. task dispatchers as claimed in claim 10, it is characterised in that wrap in the dependence item
Containing Dependency Specification, then the dependence acquisition module is specifically for from the dependence item of being transferred for task,
Directly obtain the Dependency Specification of the task;
Or, mark is judged comprising dependence automatically in the dependence item, then the dependence acquisition module tool
Body includes:
Start from judging unit, for the dependence item according to being transferred for task in automatic dependence judge
Mark, starts;With,
Scanning acquiring unit, for scanning the concrete execution content of the task, therefrom obtains the task
Dependency Specification.
13. task dispatchers as claimed in claim 10, it is characterised in that the Dependency Specification is described
The Startup time relied on by task;
Then the task processing module is specifically included:
Whether the first judging unit, for judging Startup time that the task relied on earlier than current time;
First task submits unit to, for when the Startup time relied on by the task is earlier than current time,
Judge that current state meets execution condition, being transferred for task is submitted to into the task processor and performs data
Process, and task processor has been submitted in the transferred task of data storage area record be carrying out;
With,
First information recording unit, for when the Startup time relied on by the task is later than current time,
Judge that current state does not meet execution condition, transferred task is recorded in the data storage area not submitted
Perform to task processor.
14. task dispatchers as claimed in claim 10, it is characterised in that the Dependency Specification is described
The mark of the task relied on by task;
Then the task processing module is specifically included:
Second query unit, the mark of the task for being relied on according to being transferred for task, from the data
Inquire about whether relied on task is run succeeded by the task processor in memory block;
Second task submits unit to, for when being relied on for task is run succeeded by the task processor,
Judge that current state meets execution condition, being transferred for task is submitted to into the task processor and performs data
Process, and task processor has been submitted in the transferred task of data storage area record be carrying out;
With,
Second information recording unit, for when being relied on for task is not run succeeded by the task processor,
Judge that current state does not meet execution condition, transferred task is recorded in the data storage area not submitted
Perform to task processor.
15. task dispatchers as claimed in claim 10, it is characterised in that the Dependency Specification is by adjusting
Startup time and the mark of being relied on for task that taking for task is relied on;
Then the task processing module is specifically included:
Whether the second judging unit, for judging Startup time that the task relied on earlier than current time;
3rd query unit, for when the Startup time is earlier than current time, according to task institute according to
Whether the mark of bad task, inquire about relied on task from the data storage area by the task
Reason device runs succeeded;
3rd task submits unit to, for when the tasks carrying for being relied on is successful, judging that current state meets
Being transferred for task is submitted to the task processor and performs data processing by execution condition, and in the number
Task processor has been submitted to according to the transferred task of memory block record to be carrying out;With,
3rd information recording unit, for being later than current time when the Startup time, or during the startup
When carving earlier than current time but being relied on for task is not performed successfully, judge that current state does not meet execution bar
Part, records transferred task in the data storage area and is not submitted to task processor execution.
16. task dispatchers as described in any one of claim 10 to 15, it is characterised in that described
Business scheduler also includes:
Information receiving module, for receiving the task action result that the task processor is returned;
Information unloading module, for being stored in data storage area by the task action result;
The task transfers module, is additionally operable to after the task action result is stored in data storage area,
Next pending task is transferred in continuation from the task list, until the task described in the task list
Till being performed success entirely.
17. task dispatchers as claimed in claim 16, it is characterised in that the task dispatcher is also wrapped
Include:
Second judge module, for after the task action result that the task processor is returned is received, root
Judge whether the task runs succeeded according to the task action result;
4th information logging modle, for when the tasks carrying is successful, then in the data storage area
Record the tasks carrying success;
5th information logging modle, for when the tasks carrying fails, then in the data storage area
Record the tasks carrying failure.
18. task dispatchers as claimed in claim 17, it is characterised in that the task dispatcher is also wrapped
Include:
Alarm module, after recording the tasks carrying failure in the data storage area, according to institute
State task action result and generate alarm mail, and the alarm mail is sent to default receiving port;
Or, after recording the tasks carrying failure in the data storage area, to the task
The number of times for performing failure adds one, when the number of times after Jia is more than alarm door limit value, according to the tasks carrying
As a result alarm mail is generated, and the alarm mail is sent to default receiving port.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510573492.8A CN106528275A (en) | 2015-09-10 | 2015-09-10 | Processing method of data tasks and task scheduler |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510573492.8A CN106528275A (en) | 2015-09-10 | 2015-09-10 | Processing method of data tasks and task scheduler |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106528275A true CN106528275A (en) | 2017-03-22 |
Family
ID=58346069
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510573492.8A Pending CN106528275A (en) | 2015-09-10 | 2015-09-10 | Processing method of data tasks and task scheduler |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106528275A (en) |
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107341061A (en) * | 2017-07-20 | 2017-11-10 | 郑州云海信息技术有限公司 | A kind of data dispatch processing method and processing device |
CN107402956A (en) * | 2017-06-07 | 2017-11-28 | 网易(杭州)网络有限公司 | Data processing method, equipment and the computer-readable recording medium of big task |
CN107423131A (en) * | 2017-03-29 | 2017-12-01 | 努比亚技术有限公司 | A kind of sharing method and server |
CN108108235A (en) * | 2017-12-07 | 2018-06-01 | 北京荣之联科技股份有限公司 | Task processing method and device |
CN108182111A (en) * | 2018-01-23 | 2018-06-19 | 百度在线网络技术(北京)有限公司 | Task scheduling system, method and apparatus |
CN108319499A (en) * | 2018-02-07 | 2018-07-24 | 麒麟合盛网络技术股份有限公司 | Method for scheduling task and device |
CN108694564A (en) * | 2018-06-07 | 2018-10-23 | 阿里巴巴集团控股有限公司 | A kind of task status control method and device |
CN108805723A (en) * | 2018-05-24 | 2018-11-13 | 中国平安人寿保险股份有限公司 | Data creation method, equipment, storage medium based on loss ratio system and device |
CN109271238A (en) * | 2017-07-12 | 2019-01-25 | 北京京东尚科信息技术有限公司 | Support the task scheduling apparatus and method of a variety of programming languages |
CN109375993A (en) * | 2018-08-21 | 2019-02-22 | 中国平安人寿保险股份有限公司 | Detection method, device, computer equipment and the storage medium of dispatch list |
CN109471705A (en) * | 2017-09-08 | 2019-03-15 | 杭州海康威视数字技术股份有限公司 | Method, equipment and system, the computer equipment of task schedule |
CN109933414A (en) * | 2017-12-18 | 2019-06-25 | 飞狐信息技术(天津)有限公司 | A kind of method for scheduling task, apparatus and system |
CN110119306A (en) * | 2019-05-22 | 2019-08-13 | 无锡华云数据技术服务有限公司 | Operation Automatic dispatching equalization methods, device, equipment and storage medium |
CN110673938A (en) * | 2019-09-23 | 2020-01-10 | 腾讯科技(深圳)有限公司 | Task processing method, system, server and storage medium |
CN110766341A (en) * | 2019-10-31 | 2020-02-07 | 北京东软望海科技有限公司 | Control method and device for optimizing job scheduling, computer equipment and storage medium |
CN110764883A (en) * | 2018-07-26 | 2020-02-07 | 浙江宇视科技有限公司 | Start-stop control method and device for service group |
CN110807048A (en) * | 2019-10-30 | 2020-02-18 | 泰康保险集团股份有限公司 | Automatic task processing method and device, computer storage medium and electronic equipment |
CN111190725A (en) * | 2019-08-09 | 2020-05-22 | 腾讯科技(深圳)有限公司 | Task processing method and device, storage medium and server |
WO2020186809A1 (en) * | 2019-03-19 | 2020-09-24 | 深圳壹账通智能科技有限公司 | Big data platform-based hive task scheduling method and apparatus, device, and storage medium |
CN112052575A (en) * | 2020-08-27 | 2020-12-08 | 浙江工业大学 | Automatic working principle screening method based on function execution sequence |
CN112069367A (en) * | 2020-09-07 | 2020-12-11 | 上海金仕达软件科技有限公司 | Task scheduling method and system |
CN112905321A (en) * | 2021-02-07 | 2021-06-04 | 北京红山信息科技研究院有限公司 | Event response type task triggering method and device, electronic equipment and storage medium |
CN113760471A (en) * | 2020-06-01 | 2021-12-07 | 阿里巴巴集团控股有限公司 | Task processing system and method, continuous integration system, execution machine and storage device |
CN114650287A (en) * | 2022-04-06 | 2022-06-21 | 中国工商银行股份有限公司 | Breakpoint continuous making method, device and system of data file |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101425024A (en) * | 2008-10-24 | 2009-05-06 | 中国移动通信集团山东有限公司 | Multitasking method and device |
CN101567013A (en) * | 2009-06-02 | 2009-10-28 | 阿里巴巴集团控股有限公司 | Method and apparatus for implementing ETL scheduling |
US20110131448A1 (en) * | 2009-11-30 | 2011-06-02 | Iron Mountain, Incorporated | Performing a workflow having a set of dependancy-related predefined activities on a plurality of task servers |
US20110321051A1 (en) * | 2010-06-25 | 2011-12-29 | Ebay Inc. | Task scheduling based on dependencies and resources |
CN103064745A (en) * | 2013-01-09 | 2013-04-24 | 苏州亿倍信息技术有限公司 | Method and system for distributing and processing tasks |
CN103294533A (en) * | 2012-10-30 | 2013-09-11 | 北京安天电子设备有限公司 | Task flow control method and task flow control system |
CN104536811A (en) * | 2014-12-26 | 2015-04-22 | 广州华多网络科技有限公司 | HIVE task based task scheduling method and device |
-
2015
- 2015-09-10 CN CN201510573492.8A patent/CN106528275A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101425024A (en) * | 2008-10-24 | 2009-05-06 | 中国移动通信集团山东有限公司 | Multitasking method and device |
CN101567013A (en) * | 2009-06-02 | 2009-10-28 | 阿里巴巴集团控股有限公司 | Method and apparatus for implementing ETL scheduling |
US20110131448A1 (en) * | 2009-11-30 | 2011-06-02 | Iron Mountain, Incorporated | Performing a workflow having a set of dependancy-related predefined activities on a plurality of task servers |
US20110321051A1 (en) * | 2010-06-25 | 2011-12-29 | Ebay Inc. | Task scheduling based on dependencies and resources |
CN103294533A (en) * | 2012-10-30 | 2013-09-11 | 北京安天电子设备有限公司 | Task flow control method and task flow control system |
CN103064745A (en) * | 2013-01-09 | 2013-04-24 | 苏州亿倍信息技术有限公司 | Method and system for distributing and processing tasks |
CN104536811A (en) * | 2014-12-26 | 2015-04-22 | 广州华多网络科技有限公司 | HIVE task based task scheduling method and device |
Cited By (33)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107423131A (en) * | 2017-03-29 | 2017-12-01 | 努比亚技术有限公司 | A kind of sharing method and server |
CN107423131B (en) * | 2017-03-29 | 2021-01-15 | 努比亚技术有限公司 | Sharing method and server |
CN107402956A (en) * | 2017-06-07 | 2017-11-28 | 网易(杭州)网络有限公司 | Data processing method, equipment and the computer-readable recording medium of big task |
CN107402956B (en) * | 2017-06-07 | 2020-02-21 | 网易有道信息技术(杭州)有限公司 | Data processing method and device for large task and computer readable storage medium |
CN109271238A (en) * | 2017-07-12 | 2019-01-25 | 北京京东尚科信息技术有限公司 | Support the task scheduling apparatus and method of a variety of programming languages |
CN107341061A (en) * | 2017-07-20 | 2017-11-10 | 郑州云海信息技术有限公司 | A kind of data dispatch processing method and processing device |
CN109471705A (en) * | 2017-09-08 | 2019-03-15 | 杭州海康威视数字技术股份有限公司 | Method, equipment and system, the computer equipment of task schedule |
CN108108235B (en) * | 2017-12-07 | 2021-10-29 | 哈勃智远(北京)科技有限公司 | Task processing method and device |
CN108108235A (en) * | 2017-12-07 | 2018-06-01 | 北京荣之联科技股份有限公司 | Task processing method and device |
CN109933414A (en) * | 2017-12-18 | 2019-06-25 | 飞狐信息技术(天津)有限公司 | A kind of method for scheduling task, apparatus and system |
CN108182111A (en) * | 2018-01-23 | 2018-06-19 | 百度在线网络技术(北京)有限公司 | Task scheduling system, method and apparatus |
CN108182111B (en) * | 2018-01-23 | 2022-07-15 | 百度在线网络技术(北京)有限公司 | Task scheduling system, method and device |
CN108319499A (en) * | 2018-02-07 | 2018-07-24 | 麒麟合盛网络技术股份有限公司 | Method for scheduling task and device |
CN108319499B (en) * | 2018-02-07 | 2021-06-04 | 麒麟合盛网络技术股份有限公司 | Task scheduling method and device |
CN108805723A (en) * | 2018-05-24 | 2018-11-13 | 中国平安人寿保险股份有限公司 | Data creation method, equipment, storage medium based on loss ratio system and device |
CN108694564A (en) * | 2018-06-07 | 2018-10-23 | 阿里巴巴集团控股有限公司 | A kind of task status control method and device |
CN110764883A (en) * | 2018-07-26 | 2020-02-07 | 浙江宇视科技有限公司 | Start-stop control method and device for service group |
CN110764883B (en) * | 2018-07-26 | 2022-08-23 | 浙江宇视科技有限公司 | Start-stop control method and device for service group |
CN109375993B (en) * | 2018-08-21 | 2024-04-05 | 中国平安人寿保险股份有限公司 | Method, device, computer equipment and storage medium for detecting schedule |
CN109375993A (en) * | 2018-08-21 | 2019-02-22 | 中国平安人寿保险股份有限公司 | Detection method, device, computer equipment and the storage medium of dispatch list |
WO2020186809A1 (en) * | 2019-03-19 | 2020-09-24 | 深圳壹账通智能科技有限公司 | Big data platform-based hive task scheduling method and apparatus, device, and storage medium |
CN110119306A (en) * | 2019-05-22 | 2019-08-13 | 无锡华云数据技术服务有限公司 | Operation Automatic dispatching equalization methods, device, equipment and storage medium |
CN111190725A (en) * | 2019-08-09 | 2020-05-22 | 腾讯科技(深圳)有限公司 | Task processing method and device, storage medium and server |
CN110673938B (en) * | 2019-09-23 | 2021-05-28 | 腾讯科技(深圳)有限公司 | Task processing method, system, server and storage medium |
CN110673938A (en) * | 2019-09-23 | 2020-01-10 | 腾讯科技(深圳)有限公司 | Task processing method, system, server and storage medium |
CN110807048A (en) * | 2019-10-30 | 2020-02-18 | 泰康保险集团股份有限公司 | Automatic task processing method and device, computer storage medium and electronic equipment |
CN110766341A (en) * | 2019-10-31 | 2020-02-07 | 北京东软望海科技有限公司 | Control method and device for optimizing job scheduling, computer equipment and storage medium |
CN113760471A (en) * | 2020-06-01 | 2021-12-07 | 阿里巴巴集团控股有限公司 | Task processing system and method, continuous integration system, execution machine and storage device |
CN112052575A (en) * | 2020-08-27 | 2020-12-08 | 浙江工业大学 | Automatic working principle screening method based on function execution sequence |
CN112069367A (en) * | 2020-09-07 | 2020-12-11 | 上海金仕达软件科技有限公司 | Task scheduling method and system |
CN112905321A (en) * | 2021-02-07 | 2021-06-04 | 北京红山信息科技研究院有限公司 | Event response type task triggering method and device, electronic equipment and storage medium |
CN114650287A (en) * | 2022-04-06 | 2022-06-21 | 中国工商银行股份有限公司 | Breakpoint continuous making method, device and system of data file |
CN114650287B (en) * | 2022-04-06 | 2024-04-30 | 中国工商银行股份有限公司 | Breakpoint continuous method, device and system for data file |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106528275A (en) | Processing method of data tasks and task scheduler | |
US20170046450A1 (en) | Enhanced search suggestion for personal information services | |
US20110270834A1 (en) | Data Classifier | |
CN107958028A (en) | Method, apparatus, storage medium and the terminal of data acquisition | |
WO2016060547A1 (en) | Emulating manual system of filing using electronic document and electronic file | |
CN111400011B (en) | Real-time task scheduling method, system, equipment and readable storage medium | |
US8707259B2 (en) | Framework for open dynamic reflective applications | |
CN109635024A (en) | A kind of data migration method and system | |
US11567735B1 (en) | Systems and methods for integration of multiple programming languages within a pipelined search query | |
CN102521218B (en) | File combining method and file combining device | |
CN112559525B (en) | Data checking system, method, device and server | |
CN108345658A (en) | Algorithm calculates decomposing process, server and the storage medium of track | |
US10089350B2 (en) | Proactive query migration to prevent failures | |
CN109388390A (en) | A kind of small routine preview code generating method, equipment and storage medium | |
CN116701011A (en) | Algorithm service dispatching management system based on rule engine | |
US20090187913A1 (en) | Ordering multiple resources | |
CN112817799B (en) | Method and device for accessing multiple data sources based on Spring framework | |
CN112559641B (en) | Pull chain table processing method and device, readable storage medium and electronic equipment | |
CN110838338A (en) | System, method, storage medium, and electronic device for creating biological analysis item | |
CN111290855B (en) | GPU card management method, system and storage medium for multiple GPU servers in distributed environment | |
CN116263717A (en) | Order service processing method and device based on event | |
US7984045B2 (en) | Scalable data extraction from data stores | |
JP5622049B2 (en) | Batch processing system and batch processing method | |
US6792609B1 (en) | System and method for associating action diaries with a parent class object | |
CN113468155B (en) | Question data processing method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170322 |
|
RJ01 | Rejection of invention patent application after publication |