CN108205470A - A kind of distribution ad data calculating task management system and method - Google Patents

A kind of distribution ad data calculating task management system and method Download PDF

Info

Publication number
CN108205470A
CN108205470A CN201611188392.4A CN201611188392A CN108205470A CN 108205470 A CN108205470 A CN 108205470A CN 201611188392 A CN201611188392 A CN 201611188392A CN 108205470 A CN108205470 A CN 108205470A
Authority
CN
China
Prior art keywords
task
data calculating
calculating task
library
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201611188392.4A
Other languages
Chinese (zh)
Inventor
王晓伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201611188392.4A priority Critical patent/CN108205470A/en
Publication of CN108205470A publication Critical patent/CN108205470A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • G06F9/5038Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the execution order of a plurality of tasks, e.g. taking priority or time dependency constraints into consideration
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5083Techniques for rebalancing the load in a distributed system
    • G06F9/5088Techniques for rebalancing the load in a distributed system involving task migration
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0247Calculate past, present or future revenues
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/50Indexing scheme relating to G06F9/50
    • G06F2209/5021Priority

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • General Engineering & Computer Science (AREA)
  • Accounting & Taxation (AREA)
  • Development Economics (AREA)
  • Finance (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computer And Data Communications (AREA)

Abstract

The invention discloses a kind of distributed ad data calculating task management system and methods.The system includes:Task creation server, task library, distributed server cluster and multiple client;Task creation server, suitable for ad data calculating task is saved in task library;Task library, suitable for storing the ad data calculating task that the task creation server is created;The ad data calculating task extracted suitable for extracting ad data calculating task from task library, and is sent to distributed server cluster by client;Distributed server cluster, the ad data calculating task sent suitable for operation client.As it can be seen that the drawbacks of the invention avoids systemic breakdown is caused during single client failure;Ensure that multiple tasks are performed simultaneously simultaneously, improve system running speed;And system resource is made to have obtained effective distribution and make full use of, and then realize system load balancing.

Description

A kind of distribution ad data calculating task management system and method
Technical field
The present invention relates to distributed proccessing fields, and in particular to a kind of distribution ad data calculating task management system System and method.
Background technology
With the WEB application more and more universal diversification with business, distributed task management system has obtained users Favor.But existing distributed management system has the following problems, cause system resource do not obtain effectively distribute and It makes full use of, and then makes system load out of balance.
Existing distributed management system there are the problem of it is as follows:(1) a centre management father node is only set to system Interior child node publication control instruction, causes synchronization that can only perform an instruction, is not had so as to the resource of system The utilization of effect, once and the centre management father node break down, whole system will be out of service.(2) distributed management system The task that part of nodes performs in system is more, and the task that part of nodes performs is few, leads to part of nodes in system at runtime, The problems such as his node in a dormant state, leads to system resource waste, load imbalance long-term existence.
Invention content
In view of the above problems, it is proposed that the present invention overcomes the above problem in order to provide one kind or solves at least partly State the distributed task management system and method for problem.
One side according to the present invention provides a kind of distributed ad data calculating task management system, wherein, it should Ad data calculating task management system includes:Task creation server, task library, distributed server cluster and multiple clients End;
Task creation server, suitable for ad data calculating task is saved in task library;
Task library, suitable for storing the ad data calculating task that the task creation server is created;
Client, suitable for extracting ad data calculating task, and the ad data calculating task that will be extracted from task library It is sent to distributed server cluster;
Distributed server cluster, the ad data calculating task sent suitable for operation client.
Optionally, the distributed server cluster includes:Distributed resource manager YARN, suitable for being sent to client Ad data calculating task be scheduled, run with being assigned in the respective server in cluster.
Optionally, in the distributed server cluster, YARN is using container DOCKER in the cluster on server for not Different running environment needed for same ad data calculating task structure operation.
Optionally, the distributed resource manager YARN, suitable for the ad data calculating task in cluster is run money Source is divided into multigroup, and one of which is preferential group, the correspondence markings ad data calculating task of assigned priority information;Work as reception During to the ad data calculating task that assigned priority information is marked, it is assigned to preferential group and is run;If in preferential group It is run temporarily without ad data calculating task, then the part ad data calculating task operation resource allocation in preferential group is arrived it He organizes help, and other organize operation ad data calculating tasks, once there is the operation of ad data calculating task in preferential group, recycling divides The operation resource of other groups is fitted on to preferential group.
Optionally, the task creation server, suitable for receiving newly-built ad data calculating task by front end page, Ad data calculating task will be received to be saved in task library;And it is further adapted for receiving modification ad data by front end page The instruction of calculating task modifies to the respective advertisement data calculating task in task library operation according to the instruction.
Optionally, client, suitable for from task library extract ad data calculating task when, first check task in task library Table, according to the high ad data calculating task of task list advantage distillation priority, and the ad data calculating task that will be extracted It is sent to distributed server cluster;
Wherein, ad data calculating task currently stored in task library and corresponding preservation are saved in task list The precedence information of each ad data calculating task.
Optionally, the multiple client includes one or more priority tasks processing clients;
Priority tasks handle client, suitable for extracting the ad data that assigned priority information is marked from task library Calculating task, and the ad data calculating task extracted is sent to distributed server cluster;
Other clients in addition to priority tasks handle client, it is unmarked specified excellent suitable for being extracted from task library The ad data calculating task of first grade information, and the ad data calculating task extracted is sent to distributed server collection Group.
Optionally, distributed server cluster, suitable for the operating status of each ad data calculating task run is deposited It stores up in the ad data calculating task operating status list in task library.
Optionally, client when being further adapted for extracting an ad data calculating task from task library, judges that this is wide Accuse whether data calculating task depends on other ad data calculating tasks;If the ad data calculating task is independent of it His ad data calculating task, then directly extract the ad data calculating task from task library and be sent to distributed server Cluster;If the ad data calculating task depends on other ad data calculating tasks, pass through the advertisement in query task library Data calculating task operating status list, judge the ad data calculating task rely on other ad data calculating tasks whether It is finished;The ad data calculating task is extracted from task library if being finished and is sent to distributed server collection Group;It carries out waiting until other ad data calculating tasks that the ad data calculating task relies on if being not carried out finishing It extracts the ad data calculating task when being finished from task library again and is sent to distributed server cluster.
Optionally, the task library is appointed suitable for extracting an ad data when the request for receiving a client and calculating During the request of business, lock token is added for the ad data calculating task, other clients has been avoided to ask to lift the advertisement again Data calculating task.
According to another aspect of the present invention, a kind of distributed ad data calculating task management method is provided, wherein,
Ad data calculating task is saved in task library by task creation server;
The ad data calculating task that task creation server described in task library storage is created;
Client extracts ad data calculating task from task library, and the ad data calculating task extracted is sent to Distributed server cluster;
The ad data calculating task that distributed server cluster operation client is sent.
Optionally, client is sent by the distributed resource manager YARN in the distributed server cluster wide It accuses data calculating task to be scheduled, be run with being assigned in the respective server in cluster.
Optionally, the distributed resource manager YARN is using being different on container DOCKER in the cluster server Different running environment needed for the structure operation of ad data calculating task.
Optionally, the ad data calculating task operation resource in cluster is divided by the distributed resource manager YARN Multigroup, one of which is preferential group, the correspondence markings ad data calculating task of assigned priority information;YARN, which works as, to be received When the ad data calculating task of assigned priority information is marked, it is assigned to preferential group and is run;If in preferential group temporarily When run without ad data calculating task, then YARN arrives the part ad data calculating task operation resource allocation in preferential group Other groups help other group operation ad data calculating tasks, once there are the operation of ad data calculating task, recycling in preferential group The operation resource of other groups is assigned to preferential group.
Optionally, the task creation server receives newly-built ad data calculating task by front end page, will connect The ad data calculating task received is saved in task library;
The task creation server also receives the instruction of modification ad data calculating task by front end page, according to this Instruct operation of modifying to the respective advertisement data calculating task in task library.
Optionally, client from task library extract ad data calculating task when, first check task list in task library, according to The high ad data calculating task of task list advantage distillation priority, and the ad data calculating task extracted is sent to point Cloth server cluster;
Wherein, ad data calculating task currently stored in task library and corresponding preservation are saved in task list The precedence informations of multiple ad data calculating tasks.
Optionally, the multiple client includes one or more priority tasks processing clients;
Priority tasks processing client extracts the ad data calculating that assigned priority information is marked from task library Task, and the ad data calculating task extracted is sent to distributed server cluster;
Other clients in addition to priority tasks handle client extract unmarked assigned priority from task library The ad data calculating task of information, and the ad data calculating task extracted is sent to distributed server cluster.
Optionally, distributed server cluster takes office the operating status storage of each ad data calculating task run It is engaged in the ad data calculating task operating status list in library.
Optionally, this method further comprises:
When client extracts an ad data calculating task from task library, whether the ad data calculating task is judged Dependent on other ad data calculating tasks;
If the ad data calculating task is directly carried independent of other ad data calculating tasks from task library It takes the ad data calculating task and is sent to distributed server cluster;
If the ad data calculating task depends on other ad data calculating tasks, by wide in query task library Data calculating task operating status list is accused, judges that other ad data calculating tasks that the ad data calculating task relies on are It is no to be finished;The ad data calculating task is extracted from task library and be sent to distributed server if being finished Cluster;It carries out waiting until that other ad datas calculating that the ad data calculating task relies on is appointed if being not carried out finishing Business extracts the ad data calculating task from task library and is sent to distributed server cluster again when being finished.
Optionally, the task library asking when the request one ad data calculating task of extraction for receiving a client When asking, lock token is added for the ad data calculating task, asks to lift the ad data meter again to avoid other clients Calculation task.
A kind of distributed task management system that the present invention is built includes:Task creation server, task library, distributed clothes Business device cluster and multiple client;Task creation server, suitable for ad data calculating task is saved in task library;Task Library, suitable for storing the ad data calculating task that the task creation server is created;Client, suitable for being extracted from task library Ad data calculating task, and the ad data calculating task extracted is sent to distributed server cluster;Distribution clothes Business device cluster, the ad data calculating task sent suitable for operation client.As it can be seen that the present invention by set multiple client from The drawbacks of ad data calculating task is extracted in task library, leads to systemic breakdown when avoiding single client failure, into One step ensure that the stable operation of system;It ensure that multiple tasks are performed simultaneously simultaneously, further improve the operation speed of system Degree;The ad data calculating task that client is sent is run by setting distributed server cluster, obtains system resource It effective distribution and makes full use of, and then realize system load balancing.
Above description is only the general introduction of technical solution of the present invention, in order to better understand the technological means of the present invention, And it can be implemented in accordance with the contents of the specification, and in order to allow above and other objects of the present invention, feature and advantage can It is clearer and more comprehensible, below the special specific embodiment for lifting the present invention.
Description of the drawings
By reading the detailed description of hereafter preferred embodiment, it is various other the advantages of and benefit it is common for this field Technical staff will become clear.Attached drawing is only used for showing the purpose of preferred embodiment, and is not considered as to the present invention Limitation.And throughout the drawings, the same reference numbers will be used to refer to the same parts.
In the accompanying drawings:
Fig. 1 shows a kind of showing for distributed ad data calculating task management system according to an embodiment of the invention It is intended to;
Fig. 2 shows a kind of showing for distributed ad data calculating task management method according to an embodiment of the invention It is intended to.
Specific embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although the disclosure is shown in attached drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here It is limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure Completely it is communicated to those skilled in the art.
Fig. 1 shows a kind of showing for distributed ad data calculating task management system according to an embodiment of the invention It is intended to.As shown in Figure 1, the system includes:Task creation server, task library, distributed server cluster and multiple client;
Task creation server, suitable for ad data calculating task is saved in task library;
Task library, suitable for storing the ad data calculating task that the task creation server is created;
Client, suitable for extracting ad data calculating task, and the ad data calculating task that will be extracted from task library It is sent to distributed server cluster;
Distributed server cluster, the ad data calculating task sent suitable for operation client.
To make the solution of the present invention clearer, illustrated with reference to a specific example.It is specific at one In example, first, if the ad data calculating task received includes, " the daily ad margins of calculating " wide pool " ", " calculating " is covered The daily ad margins of ox " " and " the daily ad margins of calculating " Erie " ", then the task creation server connects described The ad data calculating task received is saved in the task library.Then, the task library will " calculatings " wide pool " daily wide Announcement profit ", " the daily ad margins of calculating " Mongolia Ox " " and " the daily ad margins of calculating " Erie " " are stored;In addition, Client ad margins daily by " wide pool " is calculated ", " the daily ad margins of calculating " Mongolia Ox " " and " calculating " Erie " Daily ad margins " are extracted from the task library, and 3 ad data calculating tasks of the extraction are sent to The distributed server cluster.Finally, the distributed server cluster runs the client and sends " calculating " wide pool " Daily ad margins ", " the daily ad margins of calculating " Mongolia Ox " " and " the daily ad margins of calculating " Erie " ".
In one embodiment of the invention, the distributed server cluster includes:Distributed resource manager YARN, Ad data calculating task suitable for being sent to client is scheduled, and is run with being assigned in the respective server in cluster. Such as:The distributed server cluster includes " calculating receiving the ad data calculating task that the client sends " wide pool " daily ad margins ", " the daily ad margins of calculating " Mongolia Ox " " and " the daily ad margins of calculating " Erie " " Task after, by the distributed resource manager YARN in the distributed server cluster to above-mentioned ad data calculating task It is scheduled, and is run in the respective server being assigned in the distributed server cluster, realize effective distribution of resource With make full use of, and promotion system load balancing.
In one embodiment of the invention, in the distributed server cluster, YARN is being collected using container DOCKER It is the different running environment needed for different ad data calculating task structure operations on server in group.Such as:It is distributed The ad data calculating task that server cluster receives includes " the daily ad margins of calculating " wide pool " ", " calculating " Mongolia Ox " Daily ad margins " and " the daily ad margins of calculating " Erie " ", the distributed resource manager YARN utilize container The ad data calculating tasks of the DOCKER in the cluster on server to receive builds different needed for operation respectively Running environment realizes and different environment is isolated, to play the role of resource isolation, and then promote source effective distribution and It makes full use of, realizes system load balancing.
In one embodiment of the invention, the distributed resource manager YARN, suitable for by the advertisement number in cluster It is divided into according to calculating task operation resource multigroup, one of which is preferential group, the correspondence markings advertisement number of assigned priority information According to calculating task;When receiving the ad data calculating task that assigned priority information is marked, it is assigned to preferential group and carries out Operation;If run in preferential group temporarily without ad data calculating task, the part ad data in preferential group is calculated and is appointed Business operation resource allocation helps other group operation ad data calculating tasks to other groups, once there is ad data meter in preferential group Task run is calculated, recycling is assigned to the operation resource of other groups to preferential group.
Such as:Ad data calculating task operation resource in cluster is divided into more by the distributed resource manager YARN Group, and one of which is determined as preferential group, while the correspondence markings ad data calculating task of assigned priority information, only Receive the ad data calculating task that assigned priority information is marked, then first by the ad data calculating task It is assigned in this preferential group and runs.It should be noted that the operation resource preferentially organized is most, processing speed is most fast.Such as: The ad data calculating task that the YARN is received is including " the daily ad margins of calculating " wide pool " ", " calculating " Mongolia Ox " is daily Ad margins " and " the daily ad margins of calculating " Erie " ", the distributed resource manager YARN has found wherein " to calculate " Erie " daily ad margins " are the ad data calculating task that assigned priority information is marked, then the YARN will " the daily ad margins of calculating " Erie " ", which are assigned in described preferential group, to be run;If temporarily without advertisement number in described preferential group When being run according to calculating task, then the part ad data calculating task in described preferential group is run resource allocation by the YARN This two groups of operations are helped to " the daily ad margins of calculating " wide pool " " and " the daily ad margins of calculating " Mongolia Ox " " task groups Ad data calculating task, once there is the operation of ad data calculating task in described preferential group, recycling is assigned to described " calculate The operation resource of " wide pool " daily ad margins " and " the daily ad margins of calculating " Mongolia Ox " " task groups to it is described preferentially Group.
In one embodiment of the invention, the task creation server, suitable for receiving what is created by front end page Ad data calculating task will receive ad data calculating task and be saved in task library;And it is further adapted for through preceding end page Face receive modification ad data calculating task instruction, according to the instruction to the respective advertisement data calculating task in task library into Row modification operation.
Such as:Front end page is set, is convenient to the ad data that the task creation server real-time reception user creates Calculating task, the ad data calculating task that the user creates include " the daily ad margins of calculating " wide pool " ", " calculate " Mongolia Ox " daily ad margins " and " the daily ad margins of calculating " Erie " ", while the task creation server will connect The ad data calculating task that the user received creates is saved in real time in the task library.And user can also pass through Front end page modifies to the ad data calculating task, described in the task creation server is received by front end page User changes the instruction of ad data calculating task, according to the instruction to the respective advertisement data calculating task in the task library It modifies operation, greatly improves flexibility and the practicability of system.
In one embodiment of the invention, client, suitable for from task library extract ad data calculating task when, first look into See task list in task library, according to the high ad data calculating task of task list advantage distillation priority, and it is wide by what is extracted It accuses data calculating task and is sent to distributed server cluster;
Wherein, ad data calculating task currently stored in task library and corresponding preservation are saved in task list The precedence information of each ad data calculating task.
For example, the currently stored ad data calculating task list of user is equipped in the task library, and described wide It accuses each ad data calculating task in data calculating task list and distinguishes correspondence markings precedence information;It is if described wide Data calculating task list is accused to include:
Ad data calculating task 1:" the daily ad margins of calculating " wide pool " ", priority is minimum;
Ad data calculating task 2:" the daily ad margins of calculating " Mongolia Ox " ", highest priority;
Ad data calculating task 3:" the daily ad margins of calculating " Erie " ", during priority is;
So when client extracts ad data calculating task from the task library, first check described in the task library Ad data calculating task table is calculated according to the high ad data of the ad data calculating task table advantage distillation priority and is appointed Business 2, and the ad data calculating task 2 extracted is sent to distributed server cluster.
In one embodiment of the invention, the multiple client includes one or more priority tasks processing visitors Family end;
Priority tasks handle client, suitable for extracting the ad data that assigned priority information is marked from task library Calculating task, and the ad data calculating task extracted is sent to distributed server cluster;
Other clients in addition to priority tasks handle client, it is unmarked specified excellent suitable for being extracted from task library The ad data calculating task of first grade information, and the ad data calculating task extracted is sent to distributed server collection Group further ensures effectively distributing and make full use of, and then promotion system load balancing for system resource.
Such as:As shown in Figure 1, N number of client includes one or M priority tasks processing client, need Illustrate, if the ad data calculating task list of the task library includes:
Ad data calculating task 1:" the daily ad margins of calculating " wide pool " ", no priority;
Ad data calculating task 2:" the daily ad margins of calculating " Mongolia Ox " ", priority;
Client 1 handles client for the priority tasks, and client 2 handles client for no priority tasks;That Client 1 extracts ad data calculating task 2 from above-mentioned task, and ad data calculating task 2 is sent to described point Cloth server cluster;Client 2 extracts ad data calculating task 1 from above-mentioned task, and by ad data calculating task 1 It is sent to the distributed server cluster.
In one embodiment of the invention, distributed server cluster, suitable for each ad data run is calculated The operating status storage of task is in the task run status list in task library, convenient for the fault condition of real-time monitoring system, It ensure that the normal operation of system, further promote effectively distributing and make full use of for resource.
Such as the task run status list includes:
Ad data calculating task 1:" the daily ad margins of calculating " wide pool " ", operation is completed;
Ad data calculating task 2:" the daily ad margins of calculating " Mongolia Ox " " wait for operation;
Ad data calculating task 3:" the daily ad margins of calculating " Erie " ", operation troubles
Further, the client when being further adapted for extracting an ad data calculating task from task library, is sentenced Whether the ad data calculating task of breaking depends on other ad data calculating tasks;If the ad data calculating task is disobeyed Rely in other ad data calculating tasks, then the ad data calculating task is directly extracted from task library and be sent to distribution Server cluster;If the ad data calculating task depends on other ad data calculating tasks, by query task library Ad data calculating task operating status list, judge the ad data calculating task rely on other ad datas calculate appoint Whether business is finished;The ad data calculating task is extracted from task library if being finished and is sent to distributed clothes Business device cluster;It carries out waiting until other ad data meters that the ad data calculating task relies on if being not carried out finishing It calculates when tasks carrying finishes and extracts the ad data calculating task from task library again and be sent to distributed server cluster, into One step ensure that effectively distributing and make full use of, and then promotion system load balancing for system resource.
Such as:When client extracts an ad data calculating task including " calculating " wide pool " is every from the task library During it ad margins ", judge whether " the daily ad margins of calculating " wide pool " " depend on other tasks;
" if calculating " wide pool " daily ad margins " are independent of other ad data calculating tasks, then directly from " the daily ad margins of calculating " wide pool " " are extracted in the task library and are sent to distributed server cluster;
If " the daily ad margins of calculating " wide pool " " include dependent on other ad data calculating tasks, " calculating is " wide The daily advertising income in pool " " by the ad data calculating task operating status list in query task library, judges " to calculate Whether " wide pool " daily advertising income " is finished.If the ad data calculating task operating status list includes:
Ad data calculating task 1:" the daily advertising income of calculating " wide pool " ", operation is completed;
Ad data calculating task 2:" the daily advertising income of calculating " Mongolia Ox " " waits for operation;
Ad data calculating task 3:" the daily advertising income of calculating " Erie " ", operation troubles;
So described client extracts " calculating " wide pool " daily ad margins from the task library " and be sent to point Cloth server cluster;
If the ad data calculating task operating status list includes:
Ad data calculating task 1:" the daily advertising income of calculating " wide pool " ", is currently running;
Ad data calculating task 2:" the daily advertising income of calculating " Mongolia Ox " " waits for operation;
Ad data calculating task 3:" the daily advertising income of calculating " Erie " ", operation troubles;
So " the daily ad margins of calculating " wide pool " " are waited until " the daily ad margins of calculating " wide pool " " " calculating " wide pool " is daily for " the daily advertising income of calculating " wide the pool " " extraction in task library described in Shi Zaicong that is finished relied on Ad margins " task and be sent to distributed server cluster.
In one embodiment of the invention, the task library, suitable for when the request extraction one for receiving a client During the request of a ad data calculating task, lock token is added for the ad data calculating task, has avoided other clients It asks to lift the ad data calculating task again, avoids same task and be repeatedly executed, further ensure system resource Effectively distribute and make full use of, and then promotion system load balancing.
Such as:One client sends extraction " the daily ad margins of calculating " wide pool " " task to the task library please It asks, is that " calculating " wide pool " is daily when the task library receives " the daily ad margins of calculating " wide pool " " task requests Ad margins " task adds lock token, asks to lift the ad data calculating task again to avoid other clients, further Repeating for same task is avoided, effectively distributing and make full use of for system resource is further promoted, so as to fulfill being The load balancing of system.
Fig. 2 shows a kind of schematic diagrames of distributed task management method according to an embodiment of the invention.Such as Fig. 2 institutes Show, this method includes:
Ad data calculating task is saved in task library by step S200, task creation server.
Step S210, the ad data calculating task that task creation server described in task library storage is created.
Step S220, client extract ad data calculating task, and the ad data extracted is calculated from task library Task is sent to distributed server cluster.
Step S230, the ad data calculating task that distributed server cluster operation client is sent.
To make the solution of the present invention clearer, illustrated with reference to a specific example.It is specific at one In example, first, if the ad data calculating task received includes, " the daily ad margins of calculating " wide pool " ", " calculating " is covered The daily ad margins of ox " " and " the daily ad margins of calculating " Erie " ", then the task creation server connects described The ad data calculating task received is saved in the task library.Then, the task library will " calculatings " wide pool " daily wide Announcement profit ", " the daily ad margins of calculating " Mongolia Ox " " and " the daily ad margins of calculating " Erie " " are stored;In addition, Client ad margins daily by " wide pool " is calculated ", " the daily ad margins of calculating " Mongolia Ox " " and " calculating " Erie " Daily ad margins " are extracted from the task library, and 3 ad data calculating tasks of the extraction are sent to The distributed server cluster.Finally, the distributed server cluster runs the client and sends " calculating " wide pool " Daily ad margins ", " the daily ad margins of calculating " Mongolia Ox " " and " the daily ad margins of calculating " Erie " ".
In one embodiment of the invention, in step S230, by the distribution in the distributed server cluster Explorer YARN is scheduled the ad data calculating task that client is sent, to be assigned to the respective service in cluster It is run on device.Such as:The distributed server cluster is in the ad data calculating task for receiving the client and sending Including " calculating " wide pool " daily ad margins ", " the daily ad margins of calculatings " Mongolia Ox " " and " calculating " Erie " is daily Ad margins " calculate above-mentioned ad data by the distributed resource manager YARN in the distributed server cluster Task is scheduled, and is run in the respective server being assigned in the distributed server cluster, realizes the effective of resource It distributes and makes full use of, and promotion system load balancing.
Further, the distributed resource manager YARN is using being different on container DOCKER in the cluster server Ad data calculating task structure operation needed for different running environment.Such as:Distributed server cluster receives Ad data calculating task include " calculatings " wide damp " daily ad margins ", " the daily ad margins of calculating " Mongolia Ox " " " the daily ad margins of calculating " Erie " ", the distributed resource manager YARN is taken in the cluster using container DOCKER The ad data calculating task on business device to receive builds the different running environment needed for operation respectively, and realizing will Different environment are isolated, and to play the role of resource isolation, and then are promoted effectively distributing and make full use of for source, are realized system System load balancing.
In one embodiment of the invention, the distributed resource manager YARN calculates the ad data in cluster Task run resource is divided into multigroup, and one of which is preferential group, and the ad data of correspondence markings assigned priority information calculates Task;YARN is assigned to preferential group and is transported when receiving the ad data calculating task that assigned priority information is marked Row;If run in preferential group temporarily without ad data calculating task, YARN calculates the part ad data in preferential group Task run resource allocation helps other group operation ad data calculating tasks to other groups, once there is ad data in preferential group Calculating task is run, and recycling is assigned to the operation resource of other groups to preferential group, further ensures effectively dividing for system resource Match and make full use of, to play the role of promotion system load balancing.
Such as:The distributed resource manager YARN task run resource in cluster is divided into it is multigroup, and will wherein One group be determined as preferential group, while the correspondence markings ad data calculating task of assigned priority information, as long as receiving mark The ad data calculating task of assigned priority information is remembered, then first that the ad data distribution of computation tasks is excellent to this First run in group.It should be noted that the operation resource preferentially organized is most, processing speed is most fast.Such as:The YARN is received The ad data calculating task arrived includes " the daily ad margins of calculating " wide pool " ", " the daily ad margins of calculating " Mongolia Ox " " " the daily ad margins of calculating " Erie " ", the distributed resource manager YARN have found that wherein " calculating " Erie " is daily Ad margins " be that the ad data calculating task of assigned priority information is marked, then the YARN is by " calculating " Erie " Daily ad margins " are assigned in described preferential group and run;If temporarily without ad data calculating task in described preferential group During operation, then the YARN is by the partial task operation resource allocation in described preferential group to " calculatings " wide damp " daily wide Announcement profit " and " the daily ad margins of calculating " Mongolia Ox " " task groups help this two groups of operation ad data calculating tasks, once Have the operation of ad data calculating task in described preferential group, recycling be assigned to " the daily ad margins of calculatings " wide damp " " and The operation resource of " the daily ad margins of calculating " Mongolia Ox " " task groups is to described preferential group.
In one embodiment of the invention, the task creation server receives newly-built advertisement number by front end page According to calculating task, the ad data calculating task received is saved in task library;
The task creation server also receives the instruction of modification ad data calculating task by front end page, according to this Instruct operation of modifying to the respective advertisement data calculating task in task library.
Such as:Front end page is set, is convenient to the ad data that the task creation server real-time reception user creates Calculating task, the ad data calculating task that the user creates include " the daily ad margins of calculating " wide pool " ", " calculate " Mongolia Ox " daily ad margins " and " the daily ad margins of calculating " Erie " ", while the task creation server will connect The ad data calculating task that the user received creates is saved in real time in the task library.And user can also pass through Front end page modifies to the ad data calculating task, described in the task creation server is received by front end page User changes the instruction of ad data calculating task, according to the instruction to the respective advertisement data calculating task in the task library It modifies operation, greatly improves flexibility and the practicability of system.
In one embodiment of the invention, client from task library extract ad data calculating task when, first check appoint It is engaged in task list in library, according to the high ad data calculating task of task list advantage distillation priority, and the advertisement number that will be extracted Distributed server cluster is sent to according to calculating task;
Wherein, ad data calculating task currently stored in task library and corresponding preservation are saved in task list The precedence informations of multiple ad data calculating tasks further ensures effectively distributing and make full use of for system resource, And then promotion system load balancing.
For example, the currently stored ad data calculating task list of user is equipped in the task library, and described wide It accuses each ad data calculating task in data calculating task list and distinguishes correspondence markings precedence information;It is if described wide Data calculating task list is accused to include:
Ad data calculating task 1:" the daily ad margins of calculating " wide pool " ", priority is minimum;
Ad data calculating task 2:" the daily ad margins of calculating " Mongolia Ox " ", highest priority;
Ad data calculating task 3:" the daily ad margins of calculating " Erie " ", during priority is;
So when client extracts ad data calculating task from the task library, first check described in the task library Ad data calculating task table is calculated according to the high ad data of the ad data calculating task table advantage distillation priority and is appointed Business 2, and the ad data calculating task 2 extracted is sent to distributed server cluster.
In one embodiment of the invention, the multiple client includes one or more priority tasks processing visitors Family end;
Priority tasks processing client extracts the ad data calculating that assigned priority information is marked from task library Task, and the ad data calculating task extracted is sent to distributed server cluster;
Other clients in addition to priority tasks handle client extract unmarked assigned priority from task library The ad data calculating task of information, and the ad data calculating task extracted is sent to distributed server cluster, into One step ensure that effectively distributing and make full use of, and then promotion system load balancing for system resource.
Such as:As shown in Figure 1, N number of client includes one or M priority tasks processing client, need Illustrate, if the ad data calculating task list of the task library includes:
Ad data calculating task 1:" the daily ad margins of calculating " wide pool " ", no priority;
Ad data calculating task 2:" the daily ad margins of calculating " Mongolia Ox " ", priority;
Client 1 handles client for the priority tasks, and client 2 handles client for no priority tasks;That Client 1 extracts ad data calculating task 2 from above-mentioned task, and ad data calculating task 2 is sent to described point Cloth server cluster;Client 2 extracts ad data calculating task 1 from above-mentioned task, and by ad data calculating task 1 It is sent to the distributed server cluster.
In one embodiment of the invention, distributed server cluster is by each ad data calculating task run In operating status storage to the ad data calculating task operating status list in task library, convenient for the failure of real-time monitoring system Situation ensure that the normal operation of system, further promote effectively distributing and make full use of for resource.
Such as the task run status list includes:
Ad data calculating task 1:" the daily ad margins of calculating " wide pool " ", operation is completed;
Ad data calculating task 2:" the daily ad margins of calculating " Mongolia Ox " " wait for operation;
Ad data calculating task 3:" the daily ad margins of calculating " Erie " ", operation troubles
Further, this method further comprises:
When client extracts an ad data calculating task from task library, whether the ad data calculating task is judged Dependent on other ad data calculating tasks;
If the ad data calculating task is directly carried independent of other ad data calculating tasks from task library It takes the ad data calculating task and is sent to distributed server cluster;
If the ad data calculating task depends on other ad data calculating tasks, by wide in query task library Data calculating task operating status list is accused, judges that other ad data calculating tasks that the ad data calculating task relies on are It is no to be finished;The ad data calculating task is extracted from task library and be sent to distributed server if being finished Cluster;It carries out waiting until that other ad datas calculating that the ad data calculating task relies on is appointed if being not carried out finishing Business extracts the ad data calculating task from task library and is sent to distributed server cluster again when being finished, further It ensure that effectively distributing and make full use of, and then promotion system load balancing for system resource.
Such as:When client extracts an ad data calculating task including " calculating " wide pool " is every from the task library During it ad margins ", judge whether " the daily ad margins of calculating " wide pool " " depend on other tasks;
" if calculating " wide pool " daily ad margins " are independent of other ad data calculating tasks, then directly from " the daily ad margins of calculating " wide pool " " are extracted in the task library and are sent to distributed server cluster;
If " the daily ad margins of calculating " wide pool " " include dependent on other ad data calculating tasks, " calculating is " wide The daily advertising income in pool " " by the ad data calculating task operating status list in query task library, judges " to calculate Whether " wide pool " daily advertising income " is finished.If the ad data calculating task operating status list includes:
Ad data calculating task 1:" the daily advertising income of calculating " wide pool " ", operation is completed;
Ad data calculating task 2:" the daily advertising income of calculating " Mongolia Ox " " waits for operation;
Ad data calculating task 3:" the daily advertising income of calculating " Erie " ", operation troubles;
So described client extracts " calculating " wide pool " daily ad margins from the task library " and be sent to point Cloth server cluster;
If the ad data calculating task operating status list includes:
Ad data calculating task 1:" the daily advertising income of calculating " wide pool " ", is currently running;
Ad data calculating task 2:" the daily advertising income of calculating " Mongolia Ox " " waits for operation;
Ad data calculating task 3:" the daily advertising income of calculating " Erie " ", operation troubles;
So " the daily ad margins of calculating " wide pool " " are waited until " the daily ad margins of calculating " wide pool " " " calculating " wide pool " is daily for " the daily advertising income of calculating " wide the pool " " extraction in task library described in Shi Zaicong that is finished relied on Ad margins " task and be sent to distributed server cluster.
In one embodiment of the invention, the task library is when request one advertisement of extraction for receiving a client During the request of data calculating task, lock token is added for the ad data calculating task, is asked again to avoid other clients Lift the ad data calculating task, avoid same task and be repeatedly executed, further ensure effectively dividing for system resource Match and make full use of, and then promotion system load balancing.
Such as:One client sends extraction " the daily ad margins of calculating " wide pool " " task to the task library please It asks, is that " calculating " wide pool " is daily when the task library receives " the daily ad margins of calculating " wide pool " " task requests Ad margins " task adds lock token, asks to lift the ad data calculating task again to avoid other clients, further Repeating for same task is avoided, further promotes effectively distributing and make full use of for system resource;So as to fulfill being The load balancing of system.
All in all, a kind of distributed ad data calculating task management system that the present invention is built includes:Task creation Server, task library, distributed server cluster and multiple client;Task creation server, suitable for ad data is calculated Task is saved in task library;Task library, suitable for storing the ad data calculating task that the task creation server is created; Client suitable for extracting ad data calculating task from task library, and the ad data calculating task extracted is sent to point Cloth server cluster;Distributed server cluster, the ad data calculating task sent suitable for operation client.As it can be seen that this Invention avoids single client failure by the way that multiple client is set to extract ad data calculating task from task library When the drawbacks of leading to systemic breakdown, further ensure the stable operation of system;It ensure that multiple tasks are performed simultaneously simultaneously, into One step improves the speed of service of system;The ad data meter that client is sent is run by setting distributed server cluster Calculation task makes system resource obtain effective distribution and make full use of, and then realize system load balancing.
It should be noted that:
Algorithm and display be not inherently related to any certain computer, virtual bench or miscellaneous equipment provided herein. Various fexible units can also be used together with teaching based on this.As described above, required by constructing this kind of device Structure be obvious.In addition, the present invention is not also directed to any certain programmed language.It should be understood that it can utilize various Programming language realizes the content of invention described herein, and the description done above to language-specific is to disclose this hair Bright preferred forms.
In the specification provided in this place, numerous specific details are set forth.It is to be appreciated, however, that the implementation of the present invention Example can be put into practice without these specific details.In some instances, well known method, structure is not been shown in detail And technology, so as not to obscure the understanding of this description.
Similarly, it should be understood that in order to simplify the disclosure and help to understand one or more of each inventive aspect, Above in the description of exemplary embodiment of the present invention, each feature of the invention is grouped together into single implementation sometimes In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:I.e. required guarantor Shield the present invention claims the more features of feature than being expressly recited in each claim.More precisely, as following Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore, Thus the claims for following specific embodiment are expressly incorporated in the specific embodiment, wherein each claim is in itself Separate embodiments all as the present invention.
Those skilled in the art, which are appreciated that, to carry out adaptively the module in the equipment in embodiment Change and they are arranged in one or more equipment different from the embodiment.It can be the module or list in embodiment Member or component be combined into a module or unit or component and can be divided into addition multiple submodule or subelement or Sub-component.Other than such feature and/or at least some of process or unit exclude each other, it may be used any Combination is disclosed to all features disclosed in this specification (including adjoint claim, abstract and attached drawing) and so to appoint Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification is (including adjoint power Profit requirement, abstract and attached drawing) disclosed in each feature can be by providing the alternative features of identical, equivalent or similar purpose come generation It replaces.
In addition, it will be appreciated by those of skill in the art that although some embodiments described herein include other embodiments In included certain features rather than other feature, but the combination of the feature of different embodiments means in of the invention Within the scope of and form different embodiments.For example, in the following claims, embodiment claimed is appointed One of meaning mode can use in any combination.
The all parts embodiment of the present invention can be with hardware realization or to be run on one or more processor Software module realize or realized with combination thereof.It will be understood by those of skill in the art that it can use in practice Microprocessor or digital signal processor (DSP) realize distributed ad data calculating task according to embodiments of the present invention The some or all functions of some or all components in management system.The present invention is also implemented as performing here The some or all equipment or program of device of described method are (for example, computer program and computer program production Product).Such program for realizing the present invention can may be stored on the computer-readable medium or can have one or more The form of signal.Such signal can be downloaded from internet website to be obtained either providing or to appoint on carrier signal What other forms provides.
It should be noted that the present invention will be described rather than limits the invention, and ability for above-described embodiment Field technique personnel can design alternative embodiment without departing from the scope of the appended claims.In the claims, Any reference mark between bracket should not be configured to limitations on claims.Word "comprising" does not exclude the presence of not Element or step listed in the claims.Word "a" or "an" before element does not exclude the presence of multiple such Element.The present invention can be by means of including the hardware of several different elements and being come by means of properly programmed computer real It is existing.If in the unit claim for listing equipment for drying, several in these devices can be by same hardware branch To embody.The use of word first, second, and third does not indicate that any sequence.These words can be explained and run after fame Claim.
The invention discloses A1, a kind of distributed ad data calculating task management system, wherein, which calculates Task management system includes:Task creation server, task library, distributed server cluster and multiple client;
Task creation server, suitable for ad data calculating task is saved in task library;
Task library, suitable for storing the ad data calculating task that the task creation server is created;
Client, suitable for extracting ad data calculating task, and the ad data calculating task that will be extracted from task library It is sent to distributed server cluster;
Distributed server cluster, the ad data calculating task sent suitable for operation client.
A2, the method as described in claim A1, wherein, the distributed server cluster includes:Distributed resource pipe Device YARN is managed, the ad data calculating task suitable for being sent to client is scheduled, to be assigned to the respective service in cluster It is run on device.
A3, the system as described in claim A2, wherein,
In the distributed server cluster, YARN is using being different advertisements on container DOCKER in the cluster server Different running environment needed for the structure operation of data calculating task.
A4, the system as described in claim A1, wherein,
The distributed resource manager YARN is more suitable for the ad data calculating task operation resource in cluster is divided into Group, one of which are preferential group, the correspondence markings ad data calculating task of assigned priority information;It is marked when receiving During the ad data calculating task of assigned priority information, it is assigned to preferential group and is run;If temporarily without wide in preferential group The operation of data calculating task is accused, then the part ad data calculating task operation resource allocation in preferential group is organized into help to other Other group operation ad data calculating tasks, once there is the operation of ad data calculating task in preferential group, recycling is assigned to other The operation resource of group is to preferential group.
A5, the system as described in claim A1, wherein,
The task creation server suitable for receiving newly-built ad data calculating task by front end page, will receive It is saved in task library to ad data calculating task;And it is further adapted for receiving modification ad data by front end page and calculating appointing The instruction of business modifies to the respective advertisement data calculating task in task library operation according to the instruction.
A6, the system as described in claim A1, wherein,
Client, suitable for from task library extract ad data calculating task when, first check task list in task library, according to appoint The high ad data calculating task of table advantage distillation priority of being engaged in, and the ad data calculating task extracted is sent to distribution Formula server cluster;
Wherein, ad data calculating task currently stored in task library and corresponding preservation are saved in task list The precedence information of each ad data calculating task.
A7, the system as described in claim A1, wherein, the multiple client includes one or more preferential in charge of a grade Business processing client;
Priority tasks handle client, suitable for extracting the ad data that assigned priority information is marked from task library Calculating task, and the ad data calculating task extracted is sent to distributed server cluster;
Other clients in addition to priority tasks handle client, it is unmarked specified excellent suitable for being extracted from task library The ad data calculating task of first grade information, and the ad data calculating task extracted is sent to distributed server collection Group.
A8, the system as described in claim A1, wherein,
Distributed server cluster, suitable for storing the operating status of each ad data calculating task run to task In ad data calculating task operating status list in library.
A9, the system as described in claim A8, wherein,
Client when being further adapted for extracting an ad data calculating task from task library, judges the ad data Whether calculating task depends on other ad data calculating tasks;If the ad data calculating task is independent of other advertisements Data calculating task then directly extracts the ad data calculating task from task library and is sent to distributed server cluster; If the ad data calculating task depends on other ad data calculating tasks, pass through the ad data meter in query task library Task run status list is calculated, judges whether other ad data calculating tasks that the ad data calculating task relies on have performed Finish;The ad data calculating task is extracted from task library and be sent to distributed server cluster if being finished;Such as Fruit is not carried out finishing, and carries out waiting until that other ad data calculating tasks that the ad data calculating task relies on have performed Bi Shizai extracts the ad data calculating task from task library and is sent to distributed server cluster.
A10, the system as described in claim A1, wherein,
The task library, suitable for when the request for the request one ad data calculating task of extraction for receiving a client When, lock token is added for the ad data calculating task, other clients has been avoided to ask to lift ad data calculating again Task.
The invention also discloses B11, a kind of distributed ad data calculating task management method, wherein,
Ad data calculating task is saved in task library by task creation server;
The ad data calculating task that task creation server described in task library storage is created;
Client extracts ad data calculating task from task library, and the ad data calculating task extracted is sent to Distributed server cluster;
The ad data calculating task that distributed server cluster operation client is sent.
B12, the method as described in claim B11, wherein,
The ad data sent by the distributed resource manager YARN in the distributed server cluster to client Calculating task is scheduled, and is run with being assigned in the respective server in cluster.
B13, the method as described in claim B12, wherein,
The distributed resource manager YARN is using being different advertisement numbers on container DOCKER in the cluster server According to the different running environment needed for calculating task structure operation.
B14, the method as described in claim B11, wherein,
The distributed resource manager YARN by cluster ad data calculating task operation resource be divided into it is multigroup, In one group be preferential group, the correspondence markings ad data calculating task of assigned priority information;YARN, which works as to receive, to be marked During the ad data calculating task of assigned priority information, it is assigned to preferential group and is run;If temporarily without wide in preferential group The operation of data calculating task is accused, then the part ad data calculating task in preferential group is run resource allocation to other groups by YARN Other group operation ad data calculating tasks are helped, once there is the operation of ad data calculating task in preferential group, recycling is assigned to The operation resource of other groups is to preferential group.
B15, the method as described in claim B11, wherein,
The task creation server receives newly-built ad data calculating task by front end page, wide by what is received Data calculating task is accused to be saved in task library;
The task creation server also receives the instruction of modification ad data calculating task by front end page, according to this Instruct operation of modifying to the respective advertisement data calculating task in task library.
B16, the method as described in claim B11, wherein,
Client from task library extract ad data calculating task when, task list in task library is first checked, according to task list The high ad data calculating task of advantage distillation priority, and the ad data calculating task extracted is sent to distributed clothes Business device cluster;
Wherein, ad data calculating task currently stored in task library and corresponding preservation are saved in task list The precedence informations of multiple ad data calculating tasks.
B17, the method as described in claim B11, wherein, the multiple client includes one or more priority Task handles client;
Priority tasks processing client extracts the ad data calculating that assigned priority information is marked from task library Task, and the ad data calculating task extracted is sent to distributed server cluster;
Other clients in addition to priority tasks handle client extract unmarked assigned priority from task library The ad data calculating task of information, and the ad data calculating task extracted is sent to distributed server cluster.
B18, the method as described in claim B11, wherein,
Distributed server cluster will be in the operating status storage to task library for each ad data calculating task that run Ad data calculating task operating status list in.
B19, the method as described in claim B18, wherein, this method further comprises:
When client extracts an ad data calculating task from task library, whether the ad data calculating task is judged Dependent on other ad data calculating tasks;
If the ad data calculating task is directly carried independent of other ad data calculating tasks from task library It takes the ad data calculating task and is sent to distributed server cluster;
If the ad data calculating task depends on other ad data calculating tasks, by wide in query task library Data calculating task operating status list is accused, judges that other ad data calculating tasks that the ad data calculating task relies on are It is no to be finished;The ad data calculating task is extracted from task library and be sent to distributed server if being finished Cluster;It carries out waiting until that other ad datas calculating that the ad data calculating task relies on is appointed if being not carried out finishing Business extracts the ad data calculating task from task library and is sent to distributed server cluster again when being finished.
B20, the method as described in claim B11, wherein,
The task library is when the request of ad data calculating task is extracted in the request for receiving client The ad data calculating task adds lock token, asks to lift the ad data calculating task again to avoid other clients.

Claims (10)

1. a kind of distribution ad data calculating task management system, wherein, ad data calculating task management system includes: Task creation server, task library, distributed server cluster and multiple client;
Task creation server, suitable for ad data calculating task is saved in task library;
Task library, suitable for storing the ad data calculating task that the task creation server is created;
Client suitable for extracting ad data calculating task from task library, and the ad data calculating task extracted is sent To distributed server cluster;
Distributed server cluster, the ad data calculating task sent suitable for operation client.
2. the method for claim 1, wherein the distributed server cluster includes:Distributed resource manager YARN, the ad data calculating task suitable for being sent to client are scheduled, to be assigned in the respective server in cluster Operation.
3. system as claimed in claim 2, wherein,
In the distributed server cluster, YARN is using being different ad datas on container DOCKER in the cluster server Different running environment needed for calculating task structure operation.
4. the system as claimed in claim 1, wherein,
The distributed resource manager YARN, it is multigroup suitable for the ad data calculating task operation resource in cluster is divided into, One of which is preferential group, the correspondence markings ad data calculating task of assigned priority information;Finger is marked when receiving When determining the ad data calculating task of precedence information, it is assigned to preferential group and is run;If temporarily without advertisement in preferential group Data calculating task is run, then the part ad data calculating task in preferential group is run resource allocation helps it to other groups He organizes operation ad data calculating task, once there is the operation of ad data calculating task in preferential group, recycling is assigned to other groups Operation resource to preferential group.
5. the system as claimed in claim 1, wherein,
The task creation server suitable for receiving newly-built ad data calculating task by front end page, will receive wide Data calculating task is accused to be saved in task library;And it is further adapted for receiving modification ad data calculating task by front end page Instruction modifies to the respective advertisement data calculating task in task library operation according to the instruction.
6. a kind of distribution ad data calculating task management method, wherein,
Ad data calculating task is saved in task library by task creation server;
The ad data calculating task that task creation server described in task library storage is created;
Client extracts ad data calculating task from task library, and the ad data calculating task extracted is sent to distribution Formula server cluster;
The ad data calculating task that distributed server cluster operation client is sent.
7. method as claimed in claim 6, wherein,
The ad data that client is sent is calculated by the distributed resource manager YARN in the distributed server cluster Task is scheduled, and is run with being assigned in the respective server in cluster.
8. the method for claim 7, wherein,
The distributed resource manager YARN is using being different ad data meters on container DOCKER in the cluster server Different running environment needed for the structure operation of calculation task.
9. method as claimed in claim 6, wherein,
The distributed resource manager YARN by cluster ad data calculating task operation resource be divided into it is multigroup, wherein one Group is preferential group, the correspondence markings ad data calculating task of assigned priority information;YARN when receive be marked it is specified During the ad data calculating task of precedence information, it is assigned to preferential group and is run;If temporarily without advertisement number in preferential group It is run according to calculating task, then the part ad data calculating task operation resource allocation in preferential group is organized help by YARN to other Other group operation ad data calculating tasks, once there is the operation of ad data calculating task in preferential group, recycling is assigned to other The operation resource of group is to preferential group.
10. method as claimed in claim 6, wherein,
The task creation server receives newly-built ad data calculating task, the advertisement number that will be received by front end page It is saved in task library according to calculating task;
The task creation server also receives the instruction of modification ad data calculating task by front end page, according to the instruction It modifies to the respective advertisement data calculating task in task library operation.
CN201611188392.4A 2016-12-20 2016-12-20 A kind of distribution ad data calculating task management system and method Pending CN108205470A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611188392.4A CN108205470A (en) 2016-12-20 2016-12-20 A kind of distribution ad data calculating task management system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611188392.4A CN108205470A (en) 2016-12-20 2016-12-20 A kind of distribution ad data calculating task management system and method

Publications (1)

Publication Number Publication Date
CN108205470A true CN108205470A (en) 2018-06-26

Family

ID=62603230

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611188392.4A Pending CN108205470A (en) 2016-12-20 2016-12-20 A kind of distribution ad data calculating task management system and method

Country Status (1)

Country Link
CN (1) CN108205470A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111338790A (en) * 2020-02-12 2020-06-26 中山大学 High-throughput computing task management method and system
CN117076555A (en) * 2023-05-08 2023-11-17 芜湖本初子午信息技术有限公司 Distributed task management system and method based on calculation

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102915254A (en) * 2011-08-02 2013-02-06 中兴通讯股份有限公司 Task management method and device
KR20130074227A (en) * 2011-12-26 2013-07-04 텔코웨어 주식회사 Deistributed data management system and method thereof
CN104468638A (en) * 2013-09-12 2015-03-25 北大方正集团有限公司 Distributed data processing method and system
CN104657214A (en) * 2015-03-13 2015-05-27 华存数据信息技术有限公司 Multi-queue multi-priority big data task management system and method for achieving big data task management by utilizing system
CN104794003A (en) * 2015-02-04 2015-07-22 汉鼎信息科技股份有限公司 Large data analysis system integrating real-time mode and non-real-time mode

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102915254A (en) * 2011-08-02 2013-02-06 中兴通讯股份有限公司 Task management method and device
KR20130074227A (en) * 2011-12-26 2013-07-04 텔코웨어 주식회사 Deistributed data management system and method thereof
CN104468638A (en) * 2013-09-12 2015-03-25 北大方正集团有限公司 Distributed data processing method and system
CN104794003A (en) * 2015-02-04 2015-07-22 汉鼎信息科技股份有限公司 Large data analysis system integrating real-time mode and non-real-time mode
CN104657214A (en) * 2015-03-13 2015-05-27 华存数据信息技术有限公司 Multi-queue multi-priority big data task management system and method for achieving big data task management by utilizing system

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111338790A (en) * 2020-02-12 2020-06-26 中山大学 High-throughput computing task management method and system
CN111338790B (en) * 2020-02-12 2023-07-04 中山大学 High-throughput computing task management method and system
CN117076555A (en) * 2023-05-08 2023-11-17 芜湖本初子午信息技术有限公司 Distributed task management system and method based on calculation
CN117076555B (en) * 2023-05-08 2024-03-22 深圳市优友网络科技有限公司 Distributed task management system and method based on calculation

Similar Documents

Publication Publication Date Title
CN108182111A (en) Task scheduling system, method and apparatus
CN104834561B (en) A kind of data processing method and device
CN102426602B (en) Scoped database connections
CN104915285B (en) A kind of container process monitoring method, apparatus and system
CN106888254A (en) A kind of exchange method between container cloud framework based on Kubernetes and its each module
CN109729106B (en) Method, system and computer program product for processing computing tasks
CN110325968A (en) System upgrade management in distributed computing system
CN104461239B (en) A kind of information interacting method and device
CN106529682A (en) Method and apparatus for processing deep learning task in big-data cluster
CN107315627A (en) A kind of method and apparatus of automatic configuration data warehouse parallel task queue
US9836330B2 (en) Virtual resource management tool for cloud computing service
CN110532059B (en) Quota management method and device for K8s cluster management software
CN108829469A (en) A kind of application program page methods of exhibiting and device
CN106453501A (en) Method and apparatus for modifying configuration information of service
CN107370796A (en) A kind of intelligent learning system based on Hyper TF
CN109818810A (en) A kind of access server connection optimization method, access server and communication system
CN109343972A (en) Task processing method and terminal device
WO2016077146A1 (en) Application assignment reconciliation and license management
CN108153877A (en) Data dictionary methods of exhibiting, device, terminal device and storage medium
CN108112268A (en) Management and the relevant load balancer of automatic expanded set
CN108205470A (en) A kind of distribution ad data calculating task management system and method
CN109800078B (en) Task processing method, task distribution terminal and task execution terminal
CN107357640A (en) Request processing method and device, the electronic equipment in multi-thread data storehouse
CN110457559A (en) Distributed data crawls system, method and storage medium
CN105975329A (en) Creating method and device of virtual machine

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180626

RJ01 Rejection of invention patent application after publication