CN1744047A - Method for realizing dynamic layout of high-performance server based on group structure - Google Patents

Method for realizing dynamic layout of high-performance server based on group structure Download PDF

Info

Publication number
CN1744047A
CN1744047A CN 200510044818 CN200510044818A CN1744047A CN 1744047 A CN1744047 A CN 1744047A CN 200510044818 CN200510044818 CN 200510044818 CN 200510044818 A CN200510044818 A CN 200510044818A CN 1744047 A CN1744047 A CN 1744047A
Authority
CN
China
Prior art keywords
node
resource
computational resource
reflection
planes
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 200510044818
Other languages
Chinese (zh)
Other versions
CN100451970C (en
Inventor
王恩东
李景山
魏健
王守昊
胡雷钧
董小社
伍卫国
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xian Jiaotong University
Inspur Electronic Information Industry Co Ltd
Original Assignee
Langchao Electronic Information Industry Co Ltd
Xian Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Langchao Electronic Information Industry Co Ltd, Xian Jiaotong University filed Critical Langchao Electronic Information Industry Co Ltd
Priority to CNB2005100448184A priority Critical patent/CN100451970C/en
Publication of CN1744047A publication Critical patent/CN1744047A/en
Application granted granted Critical
Publication of CN100451970C publication Critical patent/CN100451970C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The method prepares following steps: separating computing resources from storing resources in computer cluster, and setting up different identifiers for them; then, binding computing resources and storing resources in different ID dynamically so as to build new computing nodes; changing computing performances of new computing nodes and adding the said new computing nodes to functional partitions in lack of resources in order to raise integral use ratio of server. The method changes computing performances of nodes quickly and easily, and adds modified computing performances of idle resources dynamically to functional partitions in heavy loads in order to raise use ratio of server.

Description

A kind of high-performance server dynamic deployment method of realizing based on group of planes structure
Technical field
The present invention relates to high-performance server architecture field, particularly a kind of high-performance server dynamic deployment method of realizing based on group of planes structure.
Background technology
Along with the network technology develop rapidly, 10 Gigabit Ethernets, 10Gb infiniband network are ripe in succession to be used, and makes that the high-speed interconnect between computational resource, the storage resources becomes possibility; Along with the transfer of operating system to networked distributed system, the procotol function has become the prerequisite function of modern operating system software.In the design of parts such as computing machine network interface card, increased support to network startup, like this, in system starting process, the carrier of computing environment can be selected in certain stage, and using network storage resource to set up computing environment as carrier becomes possibility.
A group of planes is the main fluid architecture of high-performance server at present, high-performance server based on group of planes structure is by many relatively independent servers, couple together by high speed internet and to constitute, each server is called a node, all move an independently operating system on each node, by the cooperation and the management of software and hardware, form the high performance computer system of a single reflection.
The operating system of guiding and operation generally derives from the hard disk of node this locality on each node of Network of Workstation.Computational resource of node (processor is the parts of core) and storage resources (local hard drive is the parts of core) are static bindings, that is to say, the estimated performance of node is to be determined by operating system and related software thereof that it is stored on the local hard drive.
In the applied environment of a group of planes, a group of planes usually is deployed supports multiple application, be divided into different function divisions, at a time, the demand of using resource in the different function divisions is unbalanced, be easy to occur node utilization factor height in some function division, and the low phenomenon of the utilization factor of some subregion node.We wish can the dynamic adjustment function subregion in the quantity of node, make that calculating the nodal point number amount in the function division is complementary with the calculation task of bearing.But, because the estimated performance of node is determined by the operating system and the system software of this locality, if the node in the difference in functionality subregion can not change the application of being supported, even from the low functional pool of utilization factor, join in the high function division of utilization factor, also not necessarily can share the task in the high function division of utilization factor.
As seen, need a kind of method that can change the node estimated performance fast,, adjust the quantity of node in the function division easily, improve the utilization factor of resource by the estimated performance of dynamic change node.
Summary of the invention
The objective of the invention is to a kind of dynamic binding computational resource and storage resources, change the holding property of calculating of node fast, improve the utilization factor of server, realize high-performance server dynamic deployment method based on group of planes structure.
Resource in the group of planes is divided into computational resource and storage resources, be respectively computational resource and different signs be set with storage resources, dynamic binding computational resource and storage resources make up new calculating node, then by changing the estimated performance that calculates node, calculating node dynamic state part after changing estimated performance is deployed in the heavy function division of operating load, to improve the overall utilization rate of server, dispositions method is divided into following steps:
The operating system that a, making can move and the reflection of related software, and write down the application characteristic that this reflection is supported; Utilize the Image Data that existing node is made to be needed, the Image Data that copy has been made, the base attribute in record making source in Image Data as CPU, network interface card, memory information, is used to check whether the computational resource of being bound is fit to this reflection operation.
B, computational resource and storage resources binding relationship are set;
The startup and the operation of c, control computational resource and storage resources, newly calculate the application program that node can be supported by the imaged features identification that is connected, automatically the hardware information of surveymeter operator resource contrasts the constraint information in the reflection resource, determines whether computational resource meets the service condition of reflection;
D, the new calculating node characteristic that makes up of basis and resource binding relation are sometime, such as the end of month of every month; Under certain condition, continue too high such as the cpu busy percentage in certain function division; Automatically the calculating node in the Free Partition being transferred to needs in this estimated performance and the operating load function division heavily, and the estimated performance of adjusting a group of planes automatically is with the continuous ruuning situation that changes of adaptation server;
In dynamic deployment method of the present invention, computational resource be Ethernet card be unique identification;
In the process that has the calculating node operation storage device image of not supporting remote boot storage network interface card, two stage bootup processs by operating system are finished, the driver of load store card in the phase one guiding, identify memory device, root file system is switched on the memory device in the subordinate phase guiding.
For the network components of not supporting remote boot, be by supporting remote boot network components pilot operationp system kernel and load the corresponding driving program, make it be connected equipment on the network in the identification of booting operating system stage.
Description of drawings
Fig. 1 is the structural representation of computational resource and storage resources;
Fig. 2 is a group of planes structural representation;
Fig. 3 is the start-up course synoptic diagram of computational resource on NFS type stores resource;
Fig. 4 is the start-up course synoptic diagram of computational resource on SAN type stores resource.
Embodiment
For making the purpose, technical solutions and advantages of the present invention clearer, below in conjunction with embodiment and accompanying drawing, the present invention is described in more detail.
The present invention has realized the dynamic combined of computational resource and storage resources in the Network of Workstation, dynamically changes the estimated performance of computational resource, and idle calculating node is added in the high function division of operating load quickly and easily, improves the utilization factor of resource.
Fig. 1 is the sign and the connection diagram of a computational resource and storage resources.Calculate node C, comprise parts such as processor, internal memory, Ethernet card, HBA (HCA) card, by unique this calculating node of sign of the MAC Address of Ethernet card.
Storage resources is divided into two classes, one class is the storage resources on the nfs server, the address of resource addresses storage resources is with the combination sign of the catalogue of nfs server IP address+NFS output, note is S, another kind of resource is the resource on the SAN memory device, combination sign with SAN storage switch IP address+LUN number, note is S ', Ethernet card is supported remote boot agreements such as PXE, HBA (HCA) card can not supported the remote boot agreement, the bootup process that calculating node C operates in the operating system S ' on the SAN is after guiding successfully by Ethernet card, to switch on the SAN.
Fig. 2, be the synoptic diagram of the high-performance server of a typical group of planes structure, the node C in the group of planes is because the requirement of using is dispensed in the different function divisions, node is connected on the nfs server by Ethernet, is connected on the memory device by storage networking.
Usually use the pattern of a group of planes to be, deposit on nfs server or SAN and use relevant data, node is by local hard disk startup and each node of operation operating system independently.Local hard drive is to install when a group of planes is initially installed or according to needs afterwards, and the installation of operating system is a job very consuming time on the node, and the operating system after installing can not be used by other node.
Method of the present invention be by operating system installation in nfs server of sharing or SAN storage, dynamic-configuration is calculated the startup of node and is videoed, and reaches the estimated performance of dynamic change node.Calculate the reflection that node needs by quick deployment, can change the clearing characteristic of node easily, node is joined need in the function division of this computational resource and go.
Embodiment one
As shown in Figure 2; A group of planes supports three classes to use, and the operation in function division 1,2,3 respectively of employed node suppose that the operating load in the current function division 1 is very heavy, and function division 2 operating loads is very light, and dynamically the step of disposing is as follows:
1. monitoring newly adds new computational resource in the group of planes, as standby computational resource;
2. the operating load of node in each function division in the monitoring group of planes is the lighter node of operating load
3. as standby computational resource;
4. can a certain calculating node in the function division 2 be extracted as C2;
5. be the reflection S1 that C2 dynamic binding function division 1 needs;
6.C2 guiding and execution reflection S1, the new calculating node C ' of structure support function subregion 1;
7. C ' is joined in the function division 1, increase the computing power of function division 1.
When realizing the prerequisite of this function, the Ethernet card network enabled of node starts, and the priority that is set to network startup is higher than the startup priority of local hard drive; Application support in the function division is dynamic to be added and the deletion node.
Fig. 3 and Fig. 4 further specify and make up the concrete steps of dynamically disposing node.
Embodiment two
Shown in Figure 3 is one dynamically dispose control desk, control the process that the reflection of node from nfs server that newly joins in the group of planes starts the reciprocal process of control desk, calculating node, nfs server.As can be seen, just utilize the reflection on the nfs server to make up new calculating node, just do not need SAN equipment, the investment of SAN equipment that like this can conserve expensive if calculate node.Just nfs server can bring IO visit bottleneck as the storage of concentrating, and utilizes the method for the structure node of the memory map on the nfs server to can be regarded as a solution cheapness, that low performance requires so calculate node.
The 1-14 step is removed the 4-10 step and to set up the process based on the non-disk workstation of NFS, PXE, tftp instrument identical among Fig. 3, the purpose in 4-10 step is to check whether the computational resource and the reflection resource of binding mate, and prevents that difference because of hardware from causing the new node of structure can not normal boot and operation.
Embodiment three
As shown in Figure 4, a usefulness is dynamically disposed the reflection start-up course of node from SAN that newly joins in the group of planes of control desk control, key is two stage bootup processs of utilizing operating system itself to provide, phase one, utilize the guiding function of Ethernet card, the driver of the kernel of pilot operationp system and load store card on Ethernet (12,13 step); The second chicken stage, the storage card that does not have the network startup function is after the os starting stage finishes, root file system identifies memory device before switching, and the SAN that root file system switches to identification gone up (14 step), though this node guides on the Ethernet of low speed, but but can utilize network at a high speed in the operational process, (as infiniband, light channel network) carries out the high-speed communication and the network storage.
By the above embodiments as seen, the dynamic deployment method of the high-performance server node of this realization group of planes structure of the present invention, can guarantee fast, conveniently to change the estimated performance of node, idling-resource dynamically changed in the function division of adding to behind the estimated performance in the operating load go, improve the utilization factor of server.
Embodiment four
1) computational resource in the group of planes is separated with storage resources, the address of computational resource and associated components thereof is a unique identification with the NIC address (MAC Address) of Ethernet, note is C, and the address of storage resources is with the combination sign of the output directory of the IP address+NFS of nfs server, and note is S; With IP address+LUN number combination sign of SAN storage switch, note is S ', and S and S ' are as the storage resources sign of concentrating;
2) dynamic deployment method is divided into four steps, detailed process comprises:
A, go up the operating system that making can move and the reflection of related software, and write down the application characteristic of this reflection support at S or S '
B, computational resource and storage resources binding are provided with, and the corresponding relation of C and S or S ' is set
Start c, the control reflection of C on S or S ', and with S (S ') as local storage use, constituted node in the virtual group of planes dynamically by C and S (S '), assert the application that this node can be supported by the imaged features that is connected
D, according to the characteristic of the new node that makes up this node is added in the function division of this estimated performance of needs and goes.
Wherein, the address of the link that belongs to computational resource that step 1) SAN storage switch is connected with computational resource is not as the unique identification of computational resource.
Step 2) further tells at certain hour (as the end of month of every month) according to the strategy of formulating in advance, (continue too high) under certain condition as the cpu busy percentage in certain function division, automatically perform b, c, the d process, node in the Free Partition is transferred in the heavy function division of operating load, adjusted the estimated performance of a group of planes automatically, be fit to the applicable cases that constantly changes.
Step a can further comprise the Image Data that utilize existing node (making source) to make needs; The Image Data that copy has been made; The base attribute in record making source in Image Data, as CPU, network interface card, memory information, be used for checking with the computational resource binding stage whether the computational resource bound is fit to this reflection and moves, on the storage resources S of assigned address (S '), set up Image Data.
Step b can for: search and collect in the group of planes initiate computational resource or the not high node of utilization factor is accepted as computational resource, set up the binding relationship between computational resource and the storage resources.
The startup method can further comprise the reflection of the described control computational resource of step c C on storage resources S or S ': when C starts and during operation, C directly communicates by letter with S by Ethernet card, makes up new node from S; When C starts and during operation from S ', because C can not directly communicate by letter with S ' by Ethernet card, because C is connected by HBA (Host Bus Adapter) card or HCA (Host ChannelAdapter) card connection with S's ', " initial RAM disk " (or initrd) that this method adopts the Linux type operating system to provide provides two stage bootup processs, in initrd, load the driving of HBA card or HCA card, make kernel and the initrd on the NFS Server of leaving in by the guiding of MAC network interface card, can discern S ', then root file system be switched on the S '.This method also comprises the operating system (as Windows, AIX etc.) of utilizing other types, computational resource is earlier from network components guiding that can netboot, loading then can not be after network components (as the HCA card) driving of netboot, identify the computational resource S ' that directly is not connected network components that can netboot, root file system is switched to method on the S '.
Step c can for: utilize the PXE function of Ethernet card,, realize guiding and the operation of C from the S by the configuration of DHCP, tftp service; Can realize that also C goes up pilot operationp system kernel and initrd from NFS Server by configuration, utilize the driver among the initrd, discern the S ' that preliminary SAN goes up connection, the root file is switched on the S ' by DHCP, tftp service.
Steps d can be the new node that makes up, and the information that goes up reflection according to S (S ') can be known the application that new node is supported, can add new node in the function corresponding subregion to and go.
By technical scheme of the present invention as seen, the present invention is by decouples computation resource and storage resources, dynamic binding computational resource and storage resources, realized the estimated performance of node in the quick change group of planes, make the estimated performance of computational resource not be subjected to the constraint of the memory map of local hard drive, can support multiple estimated performance, according to the needs of using, reconstruct group of planes part and whole estimated performances have improved the utilization factor of computational resource and storage resources.

Claims (4)

1, a kind of high-performance server dynamic deployment method of realizing based on group of planes structure, it is characterized in that the resource in the group of planes is divided into computational resource and storage resources, be respectively two kinds of resources different signs is set, two kinds of resource constructions of dynamic binding become the new node that calculates, change the estimated performance that calculates node then, and its dynamic state part is deployed in the heavy function division of operating load improves with the integral body that realizes server utilization, this dispositions method divides following steps to realize:
The operating system that a, making can move and the reflection of related software, and write down the application characteristic that this reflection is supported; Utilize the Image Data that existing node is made to be needed, the base attribute in record making source in the Image Data that copy is made as CPU, network interface card, memory information, is used to check whether the resource of being bound is fit to this reflection operation;
B, computational resource and storage resources binding relationship are set;
The startup and the operation of c, control computational resource and storage resources, by the new application program of calculating the node support of imaged features identification that is connected, automatically the hardware information of surveymeter operator resource contrasts the constraint information in the reflection resource, determines whether computational resource meets the service condition of reflection;
D, new calculating node characteristic and the binding relationship that makes up of basis are sometime, such as the end of month of every month; Under certain condition, continue too high CPU such as utilization factor in certain function division; Automatically the calculating node in the Free Partition being transferred to needs in this estimated performance and the operating load function division heavily, and the estimated performance of adjusting a group of planes automatically is with the continuous ruuning situation that changes of adaptation server.
2, dynamic deployment method as claimed in claim 1 is characterized in that, computational resource is that Ethernet card is a unique identification.
3, a kind of high-performance server dynamic deployment method of realizing based on group of planes structure, it is characterized in that, in the method that has the calculating node operation storage device image of not supporting remote boot storage network interface card, be to finish by two stage bootup processs of operating system, the driver of load store card in the phase one guiding, identify memory device, root file system is switched on the memory device in the subordinate phase guiding.
4, dynamic deployment method as claimed in claim 3, it is characterized in that, can be connected equipment on the network in the identification of booting operating system stage by the network components supporting remote boot network components pilot operationp system kernel and load the corresponding driving program, make not support remote boot.
CNB2005100448184A 2005-09-27 2005-09-27 Method for realizing dynamic layout of high-performance server based on group structure Active CN100451970C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2005100448184A CN100451970C (en) 2005-09-27 2005-09-27 Method for realizing dynamic layout of high-performance server based on group structure

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2005100448184A CN100451970C (en) 2005-09-27 2005-09-27 Method for realizing dynamic layout of high-performance server based on group structure

Publications (2)

Publication Number Publication Date
CN1744047A true CN1744047A (en) 2006-03-08
CN100451970C CN100451970C (en) 2009-01-14

Family

ID=36139434

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2005100448184A Active CN100451970C (en) 2005-09-27 2005-09-27 Method for realizing dynamic layout of high-performance server based on group structure

Country Status (1)

Country Link
CN (1) CN100451970C (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102169448A (en) * 2011-03-18 2011-08-31 浪潮电子信息产业股份有限公司 Deployment method of cluster parallel computing environment
CN101820387B (en) * 2010-02-08 2012-12-12 北京航空航天大学 Method for rapidly deploying extensible cluster
CN102833096A (en) * 2012-08-06 2012-12-19 杭州华三通信技术有限公司 Method and device for implementation of low-cost high-availability system
CN103116569A (en) * 2012-10-31 2013-05-22 劲智数位科技股份有限公司 Cluster type computer system with operating system environment adjustment
CN105703911A (en) * 2014-11-25 2016-06-22 上海天脉聚源文化传媒有限公司 Image processing computer and formation method
CN107172208A (en) * 2017-06-30 2017-09-15 联想(北京)有限公司 The dispositions method and its system of server
CN111866188A (en) * 2020-04-30 2020-10-30 中科院计算所西部高等技术研究院 Computer group construction method with OODA fractal mechanism

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9692649B2 (en) 2014-02-26 2017-06-27 International Business Machines Corporation Role assignment for servers in a high performance computing system based on measured performance characteristics

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3377125B2 (en) * 1994-03-09 2003-02-17 日本電信電話株式会社 Network load smoothing method
US7155515B1 (en) * 2001-02-06 2006-12-26 Microsoft Corporation Distributed load balancing for single entry-point systems
CN1242338C (en) * 2002-06-05 2006-02-15 中国科学院计算技术研究所 A system architecture of concentration system
CN1251111C (en) * 2002-12-31 2006-04-12 联想(北京)有限公司 Load weighing method based on systematic grade diagnosis information
US7313795B2 (en) * 2003-05-27 2007-12-25 Sun Microsystems, Inc. Method and system for managing resource allocation in non-uniform resource access computer systems
CN1296850C (en) * 2003-12-10 2007-01-24 中国科学院计算技术研究所 Partition lease method for cluster system resource management
CN1315046C (en) * 2004-03-17 2007-05-09 联想(北京)有限公司 A method for allocating computation nodes in cluster job management system

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101820387B (en) * 2010-02-08 2012-12-12 北京航空航天大学 Method for rapidly deploying extensible cluster
CN102169448A (en) * 2011-03-18 2011-08-31 浪潮电子信息产业股份有限公司 Deployment method of cluster parallel computing environment
CN102169448B (en) * 2011-03-18 2013-10-23 浪潮电子信息产业股份有限公司 Deployment method of cluster parallel computing environment
CN102833096A (en) * 2012-08-06 2012-12-19 杭州华三通信技术有限公司 Method and device for implementation of low-cost high-availability system
CN102833096B (en) * 2012-08-06 2016-06-29 杭州华三通信技术有限公司 The high-availability system of a kind of low cost realizes method and device
CN103116569A (en) * 2012-10-31 2013-05-22 劲智数位科技股份有限公司 Cluster type computer system with operating system environment adjustment
CN105703911A (en) * 2014-11-25 2016-06-22 上海天脉聚源文化传媒有限公司 Image processing computer and formation method
CN107172208A (en) * 2017-06-30 2017-09-15 联想(北京)有限公司 The dispositions method and its system of server
CN107172208B (en) * 2017-06-30 2021-09-14 联想(北京)有限公司 Server deployment method and system
CN111866188A (en) * 2020-04-30 2020-10-30 中科院计算所西部高等技术研究院 Computer group construction method with OODA fractal mechanism
CN111866188B (en) * 2020-04-30 2022-05-17 中科院计算所西部高等技术研究院 Computer group construction method with OODA fractal mechanism

Also Published As

Publication number Publication date
CN100451970C (en) 2009-01-14

Similar Documents

Publication Publication Date Title
CN1744047A (en) Method for realizing dynamic layout of high-performance server based on group structure
CN1848787A (en) Automatic fast dispositioning method for aggregated server system node
EP2021939B1 (en) Converting machines to virtual machines
US7725559B2 (en) Virtual data center that allocates and manages system resources across multiple nodes
CN100345415C (en) Method and apparatus for perfoming boot, maintenance, or install operations on a storage area network
WO2014142473A1 (en) Key value-based data storage system and operation method thereof
US8612553B2 (en) Method and system for dynamically purposing a computing device
CN1700178A (en) System and method for computer cluster virtualization using dynamic boot images and virtual disk
US20070067366A1 (en) Scalable partition memory mapping system
CN100347672C (en) Long-distance guide chip of transparent computing equipment based on dragon chip rack and panel construction and method thereof
CN102521063A (en) Shared storage method suitable for migration and fault tolerance of virtual machine
CN104468734A (en) Virtual cluster expanding method based on cloning
CN1916861A (en) Method for modifying configuration information of computer
CN102693230B (en) For the file system of storage area network
CN1869933A (en) Computer processing system for implementing data update and data updating method
CN1968168A (en) Blade server positioning method and system
CN1367439A (en) Several customer terminals interdynamic load equalizing method and its system
US7668938B1 (en) Method and system for dynamically purposing a computing device
CN1293493C (en) Computer group file service system and its input output treatment method
CN116382585A (en) Temporary volume storage method, containerized cloud platform and computer readable medium
CN200944605Y (en) Domain name server and communication system
CN1295601C (en) Time-optimized replacement of software application
CN106657220A (en) Nginx based Cloud Foundry intranet deployment scheme
CN1279439C (en) System and method of streaming data to computer in a network
CN1315053C (en) Refresh method of network computer BIOS

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant