CN100473065C

CN100473065C - A network-oriented machine group working management system and realizing method thereof

Info

Publication number: CN100473065C
Application number: CNB2004100294783A
Authority: CN
Inventors: 赵玉萍; 柳书广; 肖利民
Original assignee: Lenovo Beijing Ltd
Current assignee: Lenovo Beijing Ltd
Priority date: 2004-03-19
Filing date: 2004-03-19
Publication date: 2009-03-25
Anticipated expiration: 2024-03-19
Also published as: CN1671135A

Abstract

This invention provides a grid faced cluster operation management system and method for realizing same, which contains the arbitrator maintaining the joint operation needed information and total resource information taken from every grid node server, the neachievediator responses large joint operation without user involving. Said invention fully utilizes various kinds of resources in grid, greatly raises resource availability and completes the operation can not be realized in single computer.

Description

A kind of cluster job management system and its implementation towards grid

Technical field

The present invention relates to grid and Network of Workstation technical field, be meant a kind of cluster job management system and its implementation especially towards grid.

Background technology

In a large enterprise, all there is relatively independent Network of Workstation in each department, the operation of existing single independent Network of Workstation and resource management system generally are made up of four parts, be respectively user command, comprise graphic user interface (GUI) or command line mode, client server (Server), job scheduler (Sched) and operation actuator (Execd).

Figure 1 shows that the operation and the resource management system of existing single Network of Workstation.Submit to node 101 to receive user command, this user command can be divided into three grades of domestic consumer, operator and keepers, all can move by order line or graphic user interface mode, be the interface that user and system manager carry out operation submission, monitoring operation and system management configuration.Client server 102 in the service node is in charge of operation in the group of planes and resource information, the reception various requests from client, is safeguarding the table of node, formation, operation and user right information in the group of planes.Job scheduler 103 in the scheduling node obtains load condition by collect operation and the resource information on all nodes the group of planes from Server from Execd, and according to scheduling strategy, decision moves on which XM.Three functions of operation actuator 104 main realizations in the XM: the one, according to the instruction of Server, running job is set up resource limit, the resource that monitoring results takies, notice client server job run end etc.; The 2nd, resource management function, the monitoring resource request of response Sched is as the running status of operation and free memory etc.; The 3rd, communicate by letter with Execd on other nodes of running job, guarantee the normal load and the end of concurrent job.

Though existing cluster job management system can be realized the resource in this group of planes is managed and dispatches, but development along with society, people find the solution the continuous expansion of problem domain, the problem that is run into becomes increasingly complex, in finding the solution the process of these new problems, local computational resource can't satisfy people's demand.And existing cluster job management system can't be realized job management and resource-sharing between a plurality of group of planes, promptly can't realize management and job scheduling to gridding resource, the management that fulfils assignment of multiple resource in can not joint trellis can not adapt to present grid system.

Having part relevant with submission with job management in the grid now, mainly is information service center (InfoService) and Resource Manager (Resource Manager).Information service center in the grid is mainly finished work such as the discovery of information in the grid computing environment, registration, inquiry, modifications, provides one of grid computing environment true, real-time dynamic reflection.If the various resources in the information spinner grid computing environment of its processing, as data resource and computational resource etc., the description of service and other main body (entity).Each resource in the grid is all managed by corresponding unique grid resource person, and Resource Manager is mainly realized use and the condition monitoring to institute's management resource.

At first need the resource request to information service center submit job needs before the grid user submit job, resource information tabulation of information server central service after receiving that the user asks, is searched the resource information tabulation, returns all available the Resources lists.According to these available the Resources lists, to select employed resource by the user, and selected Resource Manager execution is submitted in operation and a detailed resource requirement table, Resource Manager moves this user job on the resource of being managed.Resource Manager is monitored the resource behaviour in service of grid at any time, and constantly up-to-date resource operating position is notified to information service center, and upgrade in time resource information tabulation of information service center responds the resource request that the operation in the grid proposes at any time.

There are following defective in job management system in the above-mentioned grid and commit method: the using and dispatching user and opaque of resource, the user is consultation information service centre in advance, to obtain all resource informations that satisfy job requirements and to make their own selection, determine promptly which Resource Manager operation is submitted to.And the information resources that information service center is safeguarded tabulation always has the regular hour to lag behind and error with present resource behaviour in service.When this error is big, may cause job run unsuccessful.In addition, under this mode, the user need wait for the resource query result, and needs to participate in resource allocation process, has reduced user's operating efficiency.

How to make existing cluster job management system can support grid system, and in the job management procedure of supporting grid, do not need the user to participate in any scheduling of resource and management work, become one of present research contents.

Summary of the invention

In view of this, the object of the present invention is to provide a kind of cluster job management system and its implementation, make the user not need to participate in any scheduling of resource and management work, only need on grid, get final product by submit job towards grid.

For achieving the above object, technical scheme of the present invention is achieved in that

A kind of cluster job management system towards grid, at least comprise the more than one grid node that constitutes by cluster job management system independently, comprise client server Serverd and operation actuator in this grid node, should comprise also that moderator Arbitor, scheduler Negotiator and storage needed the memory of combined dispatching job queue towards the cluster job management system of grid, wherein

Described moderator, when local resource can not satisfy the job request resource requirement, reception is from the job request information of client server, and the node of the job request resource requirement that receives is satisfied in inquiry, and this job request information is sent to the interior client server of this grid node; When single gridding resource can not be finished this job request information, can not put into the memory that needs the combined dispatching job queue by the job request information that single gridding resource is finished; When scheduler is finished scheduling, and after scheduling result returned to moderator, this scheduling result returned to carry out this requesting users server; When between grid node during laod unbalance, the long client server of control operation formation by routing queue will be in queueing condition job transfer give the client server that is in the idle condition grid node;

Described scheduler obtains from moderator and treats job information, and the job scheduler in the grid node obtains the behaviour in service information of current resource, carries out the global resource scheduling according to predefined algorithm, and scheduling result is returned to moderator.

A kind of application process of system as mentioned above is characterized in that, this method may further comprise the steps:

After a, moderator receive user's job request information, judge whether global resource can satisfy the required resource of this operation that receives, if, execution in step b then, otherwise to the error message and the process ends of user's retrieval system inadequate resource;

B, judge that the resource that whether has in the current global resource in certain grid node can satisfy the required resource of the described operation of step a, if, then the described job request of step a is sent to the client server that can satisfy this operation resource requirement, and process ends; Otherwise the described job request of step a is put into the formation that needs the combined dispatching operation wait for scheduling, and process ends.Preferably, the described judgement of step b is to finish from the aggregate resource information that the client server in each grid node obtains according to moderator.

Preferably, this method further comprises: moderator is the scheduling result of self scheduling device in the future, be transmitted to the client server that can satisfy the operation resource requirement, after this client server receives job request information from Arbitor, the current behaviour in service information of change local resource, and notify this node that can satisfy the operation resource requirement to start and operation.

Preferably, the method that described scheduler carries out scheduling of resource is: scheduler from moderator obtain treat job information after, job scheduler Schedd in all grid nodes on the scheduling node sends the lock resource order, and, carry out scheduling of resource according to predefined algorithm from the user mode information that each Schedd obtains current resource; After scheduling is finished, discharge resource lock, notify each Schedd to change the behaviour in service information of current resource simultaneously, return scheduling result to moderator then according to scheduling result.

Preferably, if the user job solicited message that moderator receives comes from the grid node, then this method further comprises:

Before step a carries out, after the client server in the grid node receives user's job request information, judge at first whether local resource can satisfy the required resource of this operation, if, then the operation that this job request information is put into local maintenance is lined up, otherwise, execution in step a again; Step b is described the described job request of step a is sent to the client server that can satisfy this operation resource requirement after, the client server of this job request information of initiation is informed in the address that moderator will be carried out the server of this operation.

Preferably, this method further comprises: the long Serverd of moderator control operation formation by routing queue will be in queueing condition job transfer give the client server that is in the idle condition grid node.

Use the cluster job management system towards grid provided by the present invention, safeguard the job information need combined dispatching and the aggregate resource information that obtains from the Serverd of each grid node by moderator (Arbitor), be responsible for the combined dispatching of large-scale operation by scheduler (Negotiator), thereby not needing to have realized the user to participate in any scheduling of resource and management work, only need be on grid the purpose of submit job.Use the present invention, for the user, its occupation mode is in full accord with the mode of use job management system in a group of planes, local area network (LAN).The present invention has made full use of the various resources in the grid, has improved the utilance of resource greatly, and can also finish the operation that single computational resource can't be realized.

Description of drawings

Figure 1 shows that the operation and the resource management system of existing single Network of Workstation;

Figure 2 shows that the general structure frame schematic diagram of using the cluster job management system towards grid of the present invention;

Figure 3 shows that use of the present invention in certain grid node submit job give the process chart of moderator;

Figure 4 shows that and use the process chart that the interior direct submit job of client of non-grid node of the present invention is given moderator.

Embodiment

For making technical scheme of the present invention clearer, again the present invention is done detailed description further below in conjunction with accompanying drawing.

Thinking of the present invention is: safeguard the job information need combined dispatching and the aggregate resource information that obtains from the Serverd of each grid node by moderator (Arbitor), be responsible for the combined dispatching of large-scale operation by scheduler (Negotiator), thereby make the user not need to participate in any scheduling of resource and management work, only need get final product by submit job on grid, its occupation mode is in full accord with the mode of use job management system in a group of planes, local area network (LAN).Use the present invention, not only can make full use of the various computational resources in the grid, improve the utilance of resource greatly, and can also finish the operation that single computational resource can't be realized.

Figure 2 shows that the general structure frame schematic diagram of using the cluster job management system towards grid of the present invention.Arbitor 202 is used to safeguard the job information that needs combined dispatching and the aggregate resource information of obtaining from the Serverd of each grid node.After the job request information of moderator reception from client server, can satisfy the grid node of this operation resource requirement according to this job request information inquiry, and send this job request information in the grid node place grid that can satisfy this operation resource requirement Serverd.If the resource of all single grid nodes all can not satisfy the required resource of this job request in the system, then Arbitor puts into the memory 201 that storage needs the combined dispatching job queue with this job request, and promptly all need the operation of combined dispatching all to be placed into needing in the combined dispatching job queue of Arbitor maintenance.In addition, if the job queue in certain grid node is long, and some grid node is in idle condition, then the long Serverd of Arbitor control operation formation by routing queue will be in queueing condition job transfer give the Serverd of idle grid node, thereby between grid node, realize load balance.

If the Serverd in certain grid node is when receiving job request information from the local user, after then all node resources in it judges local grid 204 all can not satisfy the required resource of operation, just this job request information is sent to moderator.

Negotiator 203 is responsible for the combined dispatching of large-scale operation.Negotiator from Arbitor obtain treat job information after, send the lock resource order to the Schedd of each grid node, obtain the information of current resource behaviour in service from each Schedd, dispatch, promptly comprehensively distribute the idling-resource of each grid node according to the reservation algorithm.After scheduling is finished, discharge resource lock, notify each Schedd self current change resource behaviour in service simultaneously, and scheduling result returned Arbitor, scheduling result is passed to the Serverd that can satisfy in the grid of operation resource requirement node place by Arbitor then, the current resource behaviour in service information of this Serverd change self, and be responsible for notifying this can satisfy operation resource requirement node starting and move this operation.Because the physical restriction that Execd can cross over grid node, set up the concurrent job execution environment jointly with the Execd on the XM in involved each grid node of operation, the common concurrent job that loads, and the operation of process combined dispatching is owing to relate to the XM of a plurality of nodes in the grid, therefore, all XM in the grid all need the support of Execd.

Certainly, above-mentionedly in the cluster operation system of grid, also can comprise one or more client, when client has job request, directly job request information be sent to moderator.

Figure 3 shows that use of the present invention in certain grid node submit job give the process chart of moderator.

Step 301, the Serverd of certain grid node receives the job request information from the user in the local grid;

Serverd in the step 302, this grid node judges whether local resource can satisfy this job request, if then execution in step 303, otherwise execution in step 304;

Step 303 is put into the job queue that local Serverd safeguards with this job request, and process ends;

Step 304 is transmitted to Arbitor with this job request information;

Step 305, Arbitor judges according to the job request information that receives whether system's global resource can satisfy the required resource of this operation, if then execution in step 307, otherwise execution in step 306;

Step 306, Arbitor is by the error message of Serverd to user's retrieval system inadequate resource, and process ends;

Step 307, Arbitor judges whether have certain grid node resource can satisfy the required resource of this operation the global system according to the aggregate resource information that the Serverd from each grid node obtains, if then execution in step 309, otherwise execution in step 308;

Step 308 is put into the job queue wait scheduling that needs combined dispatching that Arbitor safeguards with this job request, and process ends;

Step 309 is transmitted to the Serverd that can satisfy in the grid of operation resource requirement node place with this job request, and the client server of this job request information of initiation, process ends are then informed in the address that will carry out the client server of this operation.

Step 401, Arbitor receives the job request information from client in the non-grid node;

Step 402, Arbitor judges according to the job request information that receives whether system's global resource can satisfy the required resource of this operation, if then execution in step 404, otherwise execution in step 403;

Step 403, Arbitor is to the error message of user's retrieval system inadequate resource, and process ends;

Step 404, Arbitor judges whether have certain grid node resource can satisfy the required resource of this operation the global system according to the aggregate resource information that the Serverd from each grid node obtains, if then execution in step 406, otherwise execution in step 405;

Step 405 is put into the job queue wait scheduling that needs combined dispatching that Arbitor safeguards with this job request, and process ends;

Step 406 is transmitted to the Serverd that can satisfy in the grid of operation resource requirement node place, process ends then with this job request.

The method of above-mentioned scheduling of resource is: Negotiator from Arbitor obtain treat job information after, send the lock resource order to the Schedd of all gridding scheduling intranodals, and, carry out scheduling of resource according to predefined algorithm from the user mode that each Schedd obtains current resource; After scheduling is finished, discharge resource lock, notify each Schedd to change the behaviour in service information of current resource simultaneously, return scheduling result to Arbitor then according to scheduling result.

After Arbitor receives scheduling result information from Negotiator, this scheduling result is passed to the client server that can satisfy in the grid of operation resource requirement node place, after this client server receives job request information from Arbitor, the current behaviour in service information of change local resource, and notify this node that can satisfy the operation resource requirement to start and operation.

In addition, if the job queue in certain grid node is long, and some grid node is in idle condition, then the long Serverd of Arbitor control operation formation by routing queue will be in queueing condition job transfer give the Serverd of idle grid node, thereby between grid node, realize load balance.

The above only is preferred embodiment of the present invention, and is in order to restriction the present invention, within the spirit and principles in the present invention not all, any modification of being done, is equal to replacement, improvement etc., all should be included within protection scope of the present invention.

Claims

1, a kind of cluster job management system towards grid, at least comprise the more than one grid node that constitutes by cluster job management system independently, comprise client server Serverd and operation actuator in this grid node, it is characterized in that, should comprise also that moderator Arbitor, scheduler Negotiator and storage needed the memory of combined dispatching job queue towards the cluster job management system of grid, wherein

Described moderator, when local resource can not satisfy the job request resource requirement, reception is from the job request information of client server, and the node of the job request resource requirement that receives is satisfied in inquiry, and this job request information is sent to the interior client server of this grid node; When single gridding resource can not be finished this job request information, can not put into the memory that needs the combined dispatching job queue by the job request information that single gridding resource is finished; When scheduler is finished scheduling, and after scheduling result returned to moderator, this scheduling result is returned to the client server of carrying out this job request; When between grid node during laod unbalance, the long client server of control operation formation by routing queue will be in queueing condition job transfer give the client server that is in the idle condition grid node;

2, a kind of application process of system according to claim 1 is characterized in that, this method may further comprise the steps:

After a, moderator receive user's job request information, judge whether global resource can satisfy the required resource of this operation, if, execution in step b then, otherwise to the error message and the process ends of user's retrieval system inadequate resource;

B, judge that the resource that whether has in the current global resource in certain grid node can satisfy the required resource of the described operation of step a, if, then the described job request of step a is sent to the client server that can satisfy this operation resource requirement, and process ends; Otherwise the described job request of step a is put into the formation that needs the combined dispatching operation wait for scheduling, and process ends.

3, method according to claim 2 is characterized in that, the described judgement of step b is to finish from the aggregate resource information that the client server in each grid node obtains according to moderator.

4, method according to claim 2, it is characterized in that, this method further comprises: moderator is the scheduling result of self scheduling device in the future, be transmitted to the client server that can satisfy the operation resource requirement, after this client server receives job request information from Arbitor, the current behaviour in service information of change local resource, and notify this node that can satisfy the operation resource requirement to start and operation.

5, method according to claim 4, it is characterized in that, the method of described scheduling is: scheduler from moderator obtain treat job information after, job scheduler Schedd in all grid nodes on the scheduling node sends the lock resource order, and, carry out scheduling of resource according to predefined algorithm from the user mode information that each Schedd obtains current resource; After scheduling is finished, discharge resource lock, notify each Schedd to change the behaviour in service information of current resource simultaneously, return scheduling result to moderator then according to scheduling result.

6, method according to claim 2 is characterized in that, if the user job solicited message that moderator receives comes from the grid node, then this method further comprises:

Before step a carries out, after the client server in the grid node receives user's job request information, judge at first whether local resource can satisfy the required resource of this operation, if, then the operation that this job request information is put into local maintenance is lined up, otherwise, execution in step a again;

Step b is described the described job request of step a is sent to the client server that can satisfy this operation resource requirement after, the client server of this job request information of initiation is informed in the address that moderator will be carried out the client server of this operation.

7, according to claim 2 or 6 described methods, it is characterized in that this method further comprises: the long Serverd of moderator control operation formation by routing queue will be in queueing condition job transfer give the client server that is in the idle condition grid node.