CN1670707A - A method for managing cluster job - Google Patents

A method for managing cluster job Download PDF

Info

Publication number
CN1670707A
CN1670707A CN 200410029483 CN200410029483A CN1670707A CN 1670707 A CN1670707 A CN 1670707A CN 200410029483 CN200410029483 CN 200410029483 CN 200410029483 A CN200410029483 A CN 200410029483A CN 1670707 A CN1670707 A CN 1670707A
Authority
CN
China
Prior art keywords
formation
attribute
node
user
occupying mode
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 200410029483
Other languages
Chinese (zh)
Other versions
CN1315047C (en
Inventor
赵玉萍
张喜青
柳书广
肖利民
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lenovo Beijing Ltd
Original Assignee
Lenovo Beijing Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lenovo Beijing Ltd filed Critical Lenovo Beijing Ltd
Priority to CNB2004100294834A priority Critical patent/CN1315047C/en
Publication of CN1670707A publication Critical patent/CN1670707A/en
Application granted granted Critical
Publication of CN1315047C publication Critical patent/CN1315047C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Computer And Data Communications (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

This invention discloses machine group operation management method, which adds submit mode sequence property according to operation submit mode and sets the sequences with different submit mode sequence property. The method comprises the following steps: a, submitting the operation with mode sequence property to the relative sequence when the operation server receives the submitted operation; b, getting the operation from the relative sequence and allocating to the spot of the operation when operation dispatcher dispatching the operation.

Description

A kind of management method of cluster operation
Technical field
The present invention relates to cluster job management system, particularly a kind of in cluster job management system the management method of cluster operation.
Background technology
Development along with computing machine, cluster job management system has appearred in computing machine, cluster job management system is to be based upon in the Network of Workstation, system promptly between operating system in the Network of Workstation and the application program, that be used for unified management and scheduling Network of Workstation operation and resource.This system is according to user's demand, make full use of various software and hardware resources and CPU time in the Network of Workstation, the rational management operation, unified management and scheduling group of planes resource, guarantee the fairly and reasonably shared group of planes resource of operation that the user submits to, improve the utilization factor and the throughput of whole Network of Workstation, thereby improve user's the work efficiency and the work management ability of increase enterprise.
Cluster job management system is made up of several main parts such as user command, Job Server, job scheduler, operation actuators.
Wherein, user command is the bridge between user and the cluster job management system, and the user is by user command, and this user command can adopt graphic interface, to the cluster job management system submit job, the Job Server of cluster job management system responds this user after carrying out this user command.
Job Server is safeguarded a collection of queues of being made up of operation, by to the management realization of the formation management to user's submit job.
Job scheduler is by analyzing loading condition, the formation attribute of operation place formation and the attribute of operation self of the various software and hardware resources in the Network of Workstation, and the operation in the formation of indication Job Server is dynamically delivered on the corresponding node and handled.
The operation actuator is accepted the operation that sends and is indicated corresponding node to handle this operation according to the indication of job scheduler from the formation of Job Server.
In whole process, Job Server is safeguarded a set of queues, each formation in this group has different formation attributes, the formation attribute that formation has has: the formation attribute that allows operation is submitted to the user list of this formation, user list is preserved in i.e. this formation, and the operation of having only the user in the user list to submit to just can be put in this formation; Permission is submitted to operation the formation attribute of the priority of this formation, i.e. this formation is provided with priority level, and the operation that only has this priority level formation attribute just can be put in this formation; Allow the formation attribute of the node tabulation of this formation of visit, i.e. the node tabulation is preserved in this formation, and the operation in this formation can only be carried out on the node in the corresponding node tabulation; The formation attribute of the maximum queuing number that this formation allows and the formation attribute of maximum operation number etc.
When user's submit job, the formation attribute that Job Server has according to operation submits the job in the corresponding formation, job scheduler extracts operation according to the utilization of resources and the configuring condition of current cluster job management system from corresponding formation, and, indicate the operation actuator on this node, to carry out this operation according to the node that the formation attribute and the predefined strategy of the formation of operation place determine to carry out this operation.For example: predefined strategy is for allowing the light node of load carry out the operation of high priority earlier, then job scheduler extracts operation and determines the light node of load from the formation with high-priority queue attribute, and indication operation actuator is carried out this operation on this node.
Along with the segmentation of homework type and the growing tension of cluster job management system resource, the submission pattern of operation became present shared model, user's exclusive occupying mode and node exclusive occupying mode by former single shared model.Shared model is exactly that all cluster job management system resources are shared for All Jobs; User's exclusive occupying mode is a part of resource that this user's All Jobs need be monopolized cluster job management system, and this part resource of identical cluster job management system is used in the operation that does not allow other users; The cluster job management system resource that the node exclusive occupying mode need be monopolized currently used node for the operation of submitting to.
At present, owing to not be not set to the formation attribute of operation according to above-mentioned mode division formation and also not above-mentioned pattern, so when user's submit job, can not be submitted to according to the submission pattern of operation in the different formations, thereby the formation attribute that job scheduler is had according to formation under this operation carries out the node of this operation for this job assignment, and after can only from formation, extracting this operation, move this operation and obtain the submission pattern that this operation sets in advance, according to the submission pattern of this operation again to the corresponding node of this job assignment and indicate the operation actuator on this node, to carry out this operation.
Because scheduler is all wanted running job when extracting operation each time and is judged the submission pattern reallocation node that this operation has, thereby wasted the resource of whole Network of Workstation, prolong the time of whole Network of Workstation processing operation, reduced the resource utilization of Network of Workstation and the operational efficiency of operation.
Summary of the invention
In view of this, fundamental purpose of the present invention is to provide a kind of management method of cluster operation, and this method can be saved the resource of Network of Workstation, shortens the time of Network of Workstation processing operation, improves the resource utilization of Network of Workstation and the operational efficiency of operation.
According to above-mentioned purpose, technical scheme of the present invention is achieved in that
A kind of management method of cluster operation is that operation increases submission pattern formation attribute according to the submission pattern of operation, and the formation with different submission pattern formation attributes is set, and this method also comprises:
A, when the operation server receives the operation of submission, submit the job in the formation with corresponding submission pattern formation attribute according to the submission pattern formation attribute of this operation;
B, when this operation of operation scheduler schedules, obtain this operation the formation under this operation, and divide and to be used in the node of carrying out this operation.
Described submission pattern according to operation is that operation increase submission pattern formation attribute is:
When the submission pattern of operation is shared model, for operation increases shared model formation attribute;
When the submission pattern of operation is user's exclusive occupying mode, for operation increases user's exclusive occupying mode formation attribute;
When the submission pattern of operation is the node exclusive occupying mode, for operation increases node exclusive occupying mode formation attribute.
This method further is included in the step that user list is set in the formation with user's exclusive occupying mode formation attribute;
Steps A further comprises: whether Job Server judge submits to the user with the operation of user's exclusive occupying mode formation attribute in the user list that this formation with user's exclusive occupying mode formation attribute is provided with, if submit the job in this formation; Otherwise, do not submit this operation to.
The process that described setting has the formation of different submission pattern formation attributes is: formation with shared model formation attribute is set respectively, has the formation of user's exclusive occupying mode formation attribute and has the formation of node exclusive occupying mode formation attribute.
Described setting has the formation of different submission pattern formation attributes for the formation with shared model formation attribute was set before steps A, if the formation attribute of the operation of being submitted to is user's exclusive occupying mode formation attribute or node exclusive occupying mode formation attribute, the formation that has the formation of user's exclusive occupying mode formation attribute or have node exclusive occupying mode formation attribute is set further in steps A.
After described setting had the formation of user's exclusive occupying mode formation attribute or has the formation of node exclusive occupying mode formation attribute, this method also comprised:
After intact this operation of operation scheduler schedules, delete set formation or have the formation of node exclusive occupying mode formation attribute, or be shared model formation attribute set formation or submission pattern formation attribute changes with formation of node exclusive occupying mode formation attribute with user's exclusive occupying mode formation attribute with user's exclusive occupying mode formation attribute.
The present invention further comprises the corresponding relation of the formation of setting up different submission pattern formation attributes and different node tabulation, and the described branch of step B is used in the node of carrying out this operation and is: will carry out on the node in the node tabulation of this job assignment formation correspondence under this operation.
Node during described different node is tabulated is identical.
From such scheme as can be seen, the submission pattern of method operation provided by the invention is set to the attribute of formation, and according to different formation attributes different formations is set.When user's submit job, the formation attribute that this operation has is set, the formation attribute that Job Server has according to this operation again is submitted to operation in the corresponding formation to be handled.Like this, job scheduler is when obtaining operation from corresponding formation, do not need to move the submission pattern that the sets in advance reallocation node execution that this operation is obtained in this operation, thereby this method has been saved the resource of cluster management system, shorten the time of cluster management system processing operation, improved the resource utilization of cluster management system and the operational efficiency of operation.Further, the present invention will have the corresponding different node of formation of different submission pattern formation attributes, when the operation scheduler when operation is obtained in formation and give this job assignment node, can with this job assignment to the node of affiliated formation correspondence, manage thereby can effectively utilize the resource of cluster management system and be easy to.
Description of drawings
The method that Fig. 1 manages cluster operation in cluster job management system for the present invention.
Embodiment
In order to make the purpose, technical solutions and advantages of the present invention clearer, by the following examples and with reference to accompanying drawing, the present invention is further elaborated.
Method provided by the invention also is set to user's exclusive occupying mode, shared model and node exclusive occupying mode the formation attribute of formation, and three different formations are set according to these three kinds of formation attributes, when user's submit job, the submission pattern formation attribute that this operation has is set, just user's exclusive occupying mode formation attribute, shared model formation attribute or node exclusive occupying mode formation attribute, the submission pattern formation attribute that Job Server has according to this operation again is submitted to operation in the corresponding formation to be handled.
As shown in Figure 1, the method that Fig. 1 manages cluster operation in cluster job management system for the present invention, its concrete steps are:
Step 100, as user during to the Job Server submit job, Job Server judges whether this operation has submission pattern formation attribute, if, execution in step 101; Otherwise, execution in step 106;
Step 101, Job Server are shared model, user's exclusive occupying mode or node exclusive occupying mode according to the submission pattern of this operation of submission pattern formation determined property that user's submit job has, if shared model changes step 102 over to; If user's exclusive occupying mode changes step 103 over to; If the node exclusive occupying mode changes step 104 over to;
Step 102, Job Server are put into this operation in the formation with shared model formation attribute, change step 105 over to;
Step 103, Job Server are put into this operation in the formation with user's exclusive occupying mode formation attribute, change step 105 over to;
Step 104, Job Server are put into this operation in the formation with node exclusive occupying mode formation attribute, change step 105 over to;
Step 105, job scheduler according to the strategy that sets in advance from different formations, promptly have in the formation of different submission pattern formation attributes and extract operation, give this job assignment node and indicate the operation actuator on this node, to carry out this operation according to the formation attribute that this formation has.
Step 106, Job Server are put into the formation with shared model formation attribute with this operation, and job scheduler is handled this operation according to prior art.
Because the submission pattern formation attribute that the present invention can make job scheduler have according to the formation under this operation carries out the node of this operation for this job assignment, after not needing from formation, to extract this operation, move this operation and obtain the submission pattern that this operation is provided with, the more corresponding node of this job assignment is carried out this operation according to the submission pattern of this operation.So method provided by the invention has been saved the resource of Network of Workstation, shortened the time of Network of Workstation processing operation, improved the resource utilization of Network of Workstation and the operational efficiency of operation.
In the present invention, when user's submission has user's exclusive occupying mode formation attribute and/or has the operation of node exclusive occupying mode formation attribute, formation with user's exclusive occupying mode formation attribute and/or the formation with node exclusive occupying mode formation attribute can also be set temporarily, by the time operation is handled the formation that deletion again has the formation of user's exclusive occupying mode formation attribute and/or has node exclusive occupying mode formation attribute by job scheduler, perhaps user's exclusive occupying mode formation attribute and/or the node exclusive occupying mode formation attribute modification with formation is shared model formation attribute, thereby make Job Server that the formation of different queue attribute more reasonably is set, the operation of different queue attribute is submitted in the formation of different queue attribute the resource that the formation that reduces to distribute takies.
The present invention can also be provided with permission the user list of this formation is submitted in operation in the formation with user's exclusive occupying mode formation attribute, the operation with user's exclusive occupying mode formation attribute of having only the user in the user list to submit to could be used the resource of this formation.
The present invention can also make the corresponding different node tabulation of formation with different submission pattern formation attributes, and the node in these different nodes tabulations can be identical, also can be different.When the operation in the operation scheduler handle device processing queue, can determine the node of this formation correspondence according to corresponding relation, thereby the job assignment in this formation is carried out by the operation actuator to the node of correspondence.
When illustrating that for an embodiment the corresponding different node of formation with different submission pattern formation attributes is tabulated, job scheduler is handled the process of operation in the formation with submission pattern formation attribute: the node that the formation correspondence with user's exclusive occupying mode formation attribute is set is node 1~node 5, when operation a period of time of this formation of operation scheduler handle, because the corresponding node 1~node 5 of this formation, then operation one is assigned to node 1~node 5, monopolizes node 1~node 5 by the user's who submits this operation one to operation one; When the operation two of this formation of operation scheduler handle, at first move this operation two and judge that whether this operation two is that the user of submit job one submits to, if then operation one is assigned to node 1~node 5 and carries out these operations two; Otherwise timesharing utilizes node 1~node 5 to carry out operations two, wait for that promptly node 1~node 5 executes operation one after, again this operation two is assigned to node 1~node 5 and carries out these operations two.
Because present embodiment makes the corresponding different node tabulation of formation with different queue attribute, so job scheduler just can not only distribute the operation of each formation according to the node utilization factor in the current cluster job management system, for example: when same user has submitted operation one and operation two respectively, the submission pattern all is user's exclusive occupying mode.If the employing prior art, then job scheduler obtains operation one from formation, moves this operation one and finds that the submission pattern of this operation one is user's exclusive occupying mode, and then principles and requirements node 1~node 5 of carrying out according to the light node of load is carried out operation one; Then, job scheduler obtains operation two from formation, moves this operation two and finds that the submission pattern of this operation two is user's exclusive occupying mode, and then principles and requirements node 6~node 10 of carrying out according to the light node of load is carried out operation two.So, all nodes in this cluster job management system are all taken by operation one and the operation two that this user submits to, even the resource that each node takies only is 10%, the operation that other user submits to does not have node to handle yet, and must wait until after node is handled operation one and operation two and just can handle.If employing present embodiment, then the operation one and the operation two of this user's submission are assigned in the formation with user's exclusive occupying mode formation attribute by Job Server, when the operation scheduler obtains operation one and operation two from this formation, operation one and operation two can be assigned on the node of this formation correspondence, as node 1~node 5, do not monopolize and the node of all cluster job management systems all can be set to the user, make other operation not be set to the node execution that the user monopolizes, thereby reasonably disposed the resource of node, utilized the resource of node more fully.
Shared model, user's exclusive occupying mode and the operation exclusive occupying mode of the operation that the present invention proposes is set to the submission pattern formation attribute of operation, and the operation that the different queue with this submission pattern formation attribute is used to store different submission patterns is set, therefore, the present invention has not only satisfied the execution demand of the operation of different submission patterns, and improved the resource utilization of existing cluster job management system, increased the manageability of Job Server to operation.
The above only is preferred embodiment of the present invention, not in order to restriction the present invention, all any modifications of being made within the spirit and principles in the present invention, is equal to and replaces and improvement etc., all should be included within protection scope of the present invention.

Claims (8)

1, a kind of management method of cluster operation is characterized in that, is that operation increases submission pattern formation attribute according to the submission pattern of operation, and the formation with different submission pattern formation attributes is set, and this method also comprises:
A, when the operation server receives the operation of submission, submit the job in the formation with corresponding submission pattern formation attribute according to the submission pattern formation attribute of this operation;
B, when this operation of operation scheduler schedules, obtain this operation the formation under this operation, and divide and to be used in the node of carrying out this operation.
2, the method for claim 1 is characterized in that, described submission pattern according to operation is that operation increase submission pattern formation attribute is:
When the submission pattern of operation is shared model, for operation increases shared model formation attribute;
When the submission pattern of operation is user's exclusive occupying mode, for operation increases user's exclusive occupying mode formation attribute;
When the submission pattern of operation is the node exclusive occupying mode, for operation increases node exclusive occupying mode formation attribute.
3, method as claimed in claim 2 is characterized in that, this method further is included in the step that user list is set in the formation with user's exclusive occupying mode formation attribute;
Steps A further comprises: whether Job Server judge submits to the user with the operation of user's exclusive occupying mode formation attribute in the user list that this formation with user's exclusive occupying mode formation attribute is provided with, if submit the job in this formation; Otherwise, do not submit this operation to.
4, the method for claim 1, it is characterized in that the process that described setting has the formation of different submission pattern formation attributes is: formation with shared model formation attribute is set respectively, has the formation of user's exclusive occupying mode formation attribute and has the formation of node exclusive occupying mode formation attribute.
5, the method for claim 1, it is characterized in that, described setting has the formation of different submission pattern formation attributes for the formation with shared model formation attribute was set before steps A, if the formation attribute of the operation of being submitted to is user's exclusive occupying mode formation attribute or node exclusive occupying mode formation attribute, the formation that has the formation of user's exclusive occupying mode formation attribute or have node exclusive occupying mode formation attribute is set further in steps A.
6, method as claimed in claim 5 is characterized in that, after described setting had the formation of user's exclusive occupying mode formation attribute or has the formation of node exclusive occupying mode formation attribute, this method also comprised:
After intact this operation of operation scheduler schedules, delete set formation or have the formation of node exclusive occupying mode formation attribute, or be shared model formation attribute set formation or submission pattern formation attribute changes with formation of node exclusive occupying mode formation attribute with user's exclusive occupying mode formation attribute with user's exclusive occupying mode formation attribute.
7, the method for claim 1, it is characterized in that, the present invention further comprises the corresponding relation of the formation of setting up different submission pattern formation attributes and different node tabulation, and the described branch of step B is used in the node of carrying out this operation and is: will carry out on the node in the node tabulation of this job assignment formation correspondence under this operation.
8, method as claimed in claim 7 is characterized in that, the node during described different node is tabulated is identical.
CNB2004100294834A 2004-03-19 2004-03-19 A method for managing cluster job Expired - Fee Related CN1315047C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2004100294834A CN1315047C (en) 2004-03-19 2004-03-19 A method for managing cluster job

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2004100294834A CN1315047C (en) 2004-03-19 2004-03-19 A method for managing cluster job

Publications (2)

Publication Number Publication Date
CN1670707A true CN1670707A (en) 2005-09-21
CN1315047C CN1315047C (en) 2007-05-09

Family

ID=35041980

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2004100294834A Expired - Fee Related CN1315047C (en) 2004-03-19 2004-03-19 A method for managing cluster job

Country Status (1)

Country Link
CN (1) CN1315047C (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104765643A (en) * 2015-03-25 2015-07-08 华迪计算机集团有限公司 Method and system for achieving hybrid scheduling of cloud computing resources
WO2016061935A1 (en) * 2014-10-20 2016-04-28 中兴通讯股份有限公司 Resource scheduling method, device and computer storage medium
CN110515737A (en) * 2019-09-02 2019-11-29 北京明略软件系统有限公司 Data management task operation method and device

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2743865B2 (en) * 1995-04-28 1998-04-22 日本電気株式会社 Job scheduling method
US6345287B1 (en) * 1997-11-26 2002-02-05 International Business Machines Corporation Gang scheduling for resource allocation in a cluster computing environment
EP1283466A1 (en) * 2001-08-06 2003-02-12 Hewlett-Packard Company (a Delaware corporation) Management system for a cluster

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016061935A1 (en) * 2014-10-20 2016-04-28 中兴通讯股份有限公司 Resource scheduling method, device and computer storage medium
CN105592110A (en) * 2014-10-20 2016-05-18 中兴通讯股份有限公司 Resource scheduling method and device
CN105592110B (en) * 2014-10-20 2020-06-30 中兴通讯股份有限公司 Resource scheduling method and device
CN104765643A (en) * 2015-03-25 2015-07-08 华迪计算机集团有限公司 Method and system for achieving hybrid scheduling of cloud computing resources
CN110515737A (en) * 2019-09-02 2019-11-29 北京明略软件系统有限公司 Data management task operation method and device

Also Published As

Publication number Publication date
CN1315047C (en) 2007-05-09

Similar Documents

Publication Publication Date Title
CN1266590C (en) Progress pole/linear procedure pole management method of construction member oriented backbone system internal core
CN114138486B (en) Method, system and medium for arranging containerized micro-services for cloud edge heterogeneous environment
US7689996B2 (en) Method to distribute programs using remote Java objects
US7373640B1 (en) Technique for dynamically restricting thread concurrency without rewriting thread code
CN1818875A (en) Grouped hard realtime task dispatching method of built-in operation system
TW200401529A (en) System and method for the allocation of grid computing workload to network workstations
KR100944912B1 (en) Disk I/O Scheduler for Server Virtualization Environment and Scheduling Method Thereof
CN102081554A (en) Cloud computing operating system as well as kernel control system and method thereof
CN1636191A (en) Apparatus and method of dynamically repartitioning a computer system in response to partition workloads
CN1577253A (en) EDF scheduling method
CN1845075A (en) Service oriented high-performance grid computing job scheduling method
CN112596904A (en) Quantum service resource calling optimization method based on quantum cloud platform
CN103503412A (en) Method and device for scheduling resources
CN109597674B (en) Shared virtual resource pool share scheduling method and system
Dong et al. A grid task scheduling algorithm based on QoS priority grouping
CN1315047C (en) A method for managing cluster job
CN101051302A (en) Method and system for loading programme on computer system
CN103677959A (en) Virtual machine cluster migration method and system based on multicast
CN1881895A (en) Apparatus operation method in network management system
US20230161620A1 (en) Pull mode and push mode combined resource management and job scheduling method and system, and medium
CN111966481A (en) Parallel computing management method and system suitable for multi-tenant scene
US7181491B2 (en) Intelligent data pool management engine
CN115878910A (en) Line query method, device and storage medium
Kravetz et al. Enhancing Linux scheduler scalability
CN115098220A (en) Large-scale network node simulation method based on container thread management technology

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20070509

Termination date: 20210319