CN114422361A - Operation and maintenance management method, device, equipment and product of cluster server - Google Patents
Operation and maintenance management method, device, equipment and product of cluster server Download PDFInfo
- Publication number
- CN114422361A CN114422361A CN202111422292.4A CN202111422292A CN114422361A CN 114422361 A CN114422361 A CN 114422361A CN 202111422292 A CN202111422292 A CN 202111422292A CN 114422361 A CN114422361 A CN 114422361A
- Authority
- CN
- China
- Prior art keywords
- node
- configuration information
- expanded
- capacity expansion
- cluster server
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000007726 management method Methods 0.000 title claims abstract description 74
- 238000012423 maintenance Methods 0.000 title claims abstract description 70
- 238000000034 method Methods 0.000 claims abstract description 19
- 238000004590 computer program Methods 0.000 claims description 19
- 238000012937 correction Methods 0.000 claims description 9
- 239000000758 substrate Substances 0.000 claims description 6
- 238000012546 transfer Methods 0.000 claims description 4
- 238000004891 communication Methods 0.000 abstract description 6
- 238000011900 installation process Methods 0.000 abstract description 6
- 230000002596 correlated effect Effects 0.000 abstract description 4
- 238000009434 installation Methods 0.000 description 19
- 238000005516 engineering process Methods 0.000 description 9
- 238000010586 diagram Methods 0.000 description 5
- 230000002159 abnormal effect Effects 0.000 description 4
- 230000006399 behavior Effects 0.000 description 4
- 230000001276 controlling effect Effects 0.000 description 3
- 230000000875 corresponding effect Effects 0.000 description 3
- KKIMDKMETPPURN-UHFFFAOYSA-N 1-(3-(trifluoromethyl)phenyl)piperazine Chemical compound FC(F)(F)C1=CC=CC(N2CCNCC2)=C1 KKIMDKMETPPURN-UHFFFAOYSA-N 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/08—Configuration management of networks or network elements
- H04L41/0803—Configuration setting
- H04L41/0823—Configuration setting characterised by the purposes of a change of settings, e.g. optimising configuration for enhancing reliability
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/44—Arrangements for executing specific programs
- G06F9/4401—Bootstrapping
- G06F9/4416—Network booting; Remote initial program loading [RIPL]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/06—Protocols specially adapted for file transfer, e.g. file transfer protocol [FTP]
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Theoretical Computer Science (AREA)
- Computer Security & Cryptography (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Debugging And Monitoring (AREA)
Abstract
The invention provides an operation and maintenance management method, device, equipment and product of a cluster server, relating to the technical field of communication, wherein the method comprises the following steps: acquiring a capacity expansion request of a node to be expanded; acquiring configuration information for capacity expansion according to the capacity expansion request; according to the configuration information for capacity expansion, starting the node to be expanded based on a pre-starting execution environment, and correcting the node configuration information of the node to be expanded; wherein the node configuration information comprises a master node name and an internet protocol address; and acquiring the node configuration information, and setting the node to be expanded to be started based on a hard disk. The invention can fix the relevant configuration of the nodes to be expanded, so that the names and IP addresses of the main nodes of the node operating system can be correlated with the BMC information, the problem of disorder in the installation process is avoided, and the nodes to be expanded can be accurately expanded in batch in a large-scale cluster server environment.
Description
Technical Field
The present invention relates to the field of communications technologies, and in particular, to an operation and maintenance management method, apparatus, device, and product for a cluster server.
Background
In the field of software application, installation and capacity expansion of a software cluster based on a physical server cluster are required to have large-scale deployment and capacity expansion capabilities. In the aspect of deployment and capacity expansion of a cluster server, a Preboot execution Environment (PXE) supports a workstation to download an image from a remote server through a network and thus supports starting an operating system through the network, in the starting process, a terminal requires the server to allocate an Internet Protocol (IP) address, then downloads a starting software package into a local memory for execution, and the starting software package completes the setting of basic software of the terminal (client), so as to guide the terminal operating system pre-installed in the server.
However, at present, due to uncertainty generated by PXE during batch installation, the installation order of cluster nodes is disordered, and for traditional software to be installed and expanded on the cluster nodes, a Baseboard Management Controller (BMC) of all cluster servers needs to be logged in many times, and operations such as configuring node startup items and controlling the node power to be turned on and off need to be performed many times. Therefore, the existing operation and maintenance management method for the cluster server causes that the installation and capacity expansion efficiency of the cluster server is very limited.
Disclosure of Invention
The invention provides an operation and maintenance management method, device, equipment and product of a cluster server, which are used for solving the problems of disorder and repeated operation generated when PXE is used for installing and expanding cluster server nodes in the prior art and realizing accurate batch expansion of nodes to be expanded in a large-scale cluster server environment.
The invention provides an operation and maintenance management method of a cluster server, which comprises the following steps:
acquiring a capacity expansion request of a node to be expanded;
acquiring configuration information for capacity expansion according to the capacity expansion request;
according to the configuration information for capacity expansion, the node to be expanded executes the capacity expansion of the node based on a pre-starting execution environment, and corrects the node configuration information of the node to be expanded; wherein the node configuration information comprises a master node name, an internet protocol address and substrate controller information;
and acquiring the node configuration information, and setting the node to be expanded to be started based on the hard disk after the expansion is completed.
According to the operation and maintenance management method for the cluster server provided by the present invention, the obtaining of the configuration information for capacity expansion according to the capacity expansion request specifically includes:
acquiring first configuration information and second configuration information according to the capacity expansion request; the first configuration information and the second configuration information constitute configuration information for capacity expansion, the first configuration information is configuration information for capacity expansion of a master node in the cluster server, and the second configuration information is baseboard management controller information of other nodes in the cluster server.
According to the operation and maintenance management method for the cluster server provided by the present invention, the obtaining of the configuration information for capacity expansion according to the capacity expansion request specifically includes:
and sharing the second configuration information of each node in the cluster server based on a hypertext transfer protocol.
According to the operation and maintenance management method of the cluster server provided by the present invention, according to the configuration information for capacity expansion, the node to be expanded executes node capacity expansion based on a pre-boot execution environment, and corrects the node configuration information of the node to be expanded, specifically including the following steps:
generating starting configuration information of a pre-starting execution environment according to the first configuration information;
according to the starting configuration information, the node to be expanded executes node expansion based on a pre-starting execution environment;
after the node to be expanded is started based on the pre-starting execution environment, acquiring the second configuration information and the third configuration information shared by other nodes; the third configuration information is baseboard management controller information of the node to be expanded;
and correcting the node configuration information by the node to be expanded according to the second configuration information and the third configuration information.
According to the operation and maintenance management method of the cluster server provided by the invention, before the step of obtaining the capacity expansion request of the node to be expanded, the method further comprises the following steps:
and deploying the operation and maintenance management environment of the cluster server and initializing the pre-starting execution environment service.
According to the operation and maintenance management method of the cluster server provided by the invention, the obtaining of the node configuration information and the setting of the node to be expanded to be started based on the hard disk after the expansion are completed specifically include:
after the deployment node deploying the operation and maintenance management environment acquires that the internet interconnection protocol address of the node to be expanded is on-line, after the expansion is completed, the deployment node sets the node to be expanded to be started based on a hard disk.
The invention also provides an operation and maintenance management device of the cluster server, which comprises:
the first acquisition module is used for acquiring a capacity expansion request of a node to be expanded;
the second acquisition module is used for acquiring configuration information for capacity expansion according to the capacity expansion request;
the correction module is used for executing node capacity expansion based on a pre-starting execution environment by the node to be subjected to capacity expansion according to the configuration information for capacity expansion and correcting the node configuration information of the node to be subjected to capacity expansion; wherein the node configuration information comprises an internet protocol address;
and the protection module is used for acquiring the node configuration information and setting the node to be expanded to be started based on the hard disk after the expansion is completed.
The invention further provides an electronic device, which includes a memory, a processor and a computer program stored on the memory and capable of running on the processor, and when the processor executes the program, the steps of the operation and maintenance management method of the cluster server are implemented.
The present invention also provides a non-transitory computer readable storage medium, on which a computer program is stored, which, when being executed by a processor, implements the steps of the operation and maintenance management method of the cluster server according to any one of the above.
The present invention also provides a computer program product, which includes a computer program, and when the computer program is executed by a processor, the computer program implements the steps of the operation and maintenance management method for the cluster server as described in any one of the above.
According to the operation and maintenance management method, device, equipment and product of the cluster server, the configuration information for capacity expansion is obtained, and based on the configuration information for capacity expansion, when a PXE technology is used, the node configuration information of the node to be expanded can be automatically corrected to be expected information, accurate capacity expansion and automatic error correction are achieved, the relevant configuration of the node to be expanded is fixed, the name and the IP address of the node operating system main node can be correlated with BMC information, the disorder problem generated in the batch unattended installation process is avoided, the node to be expanded can be accurately and massively expanded in a large-scale cluster server environment, the installation and capacity expansion efficiency of the cluster node can be greatly improved, the operation and maintenance management work of the cluster server is better reduced to the minute level from several hours or several working days originally.
Drawings
In order to more clearly illustrate the technical solutions of the present invention or the prior art, the drawings needed for the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and those skilled in the art can also obtain other drawings according to the drawings without creative efforts.
FIG. 1 is a schematic flow chart of an operation and maintenance management method for a cluster server provided in the present invention;
FIG. 2 is a second flowchart illustrating an operation and maintenance management method for a cluster server according to the present invention;
fig. 3 is a schematic flowchart of step S400 in the operation and maintenance management method for a cluster server provided in the present invention;
fig. 4 is a schematic structural diagram of an operation and maintenance management apparatus of a cluster server provided in the present invention;
fig. 5 is a second schematic structural diagram of an operation and maintenance management apparatus of a cluster server provided in the present invention;
fig. 6 is a schematic structural diagram of a correction module in the operation and maintenance management method for a cluster server provided by the present invention;
fig. 7 is a schematic structural diagram of an electronic device provided by the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is obvious that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In the prior art, when a cluster server installs a system at an open office or expands a new node in a batch for an existing cluster, if a PXE technology and a Dynamic Host Configuration Protocol (DHCP) are used by the cluster server, since an IP address of a Host node of an operating system and BMC information cannot be associated with each other, uncertainty occurs when the PXE is installed in a batch, and a problem of a disordered installation sequence of the cluster nodes occurs, where the uncertainty is, for example, different hardware configurations of nodes and is assigned to different roles of the cluster server; and the IP addresses of the network planning and the actual cluster servers after installation are inconsistent.
Based on the above problems, the present invention aims to provide a solution that can perform precise batch capacity expansion in a large-scale cluster server environment, and can fix the relevant configuration of new nodes to be expanded according to capacity expansion planning information such as BMC, etc., so as to solve the problem of disordered node installation in a cluster, so that the installation and capacity expansion efficiency of cluster nodes can be greatly improved, and the operation and maintenance management work of the cluster server can be better performed by reducing the operation and maintenance efficiency to a minute level from several hours or several working days originally.
The operation and maintenance management method of the cluster server of the present invention is described below with reference to fig. 1, and the method includes the following steps:
s200, acquiring a capacity expansion request of a node to be expanded.
When the cluster server needs to add a new node to realize capacity expansion, the new node generates a corresponding capacity expansion request, and then the cluster server determines whether the new node is needed. The newly added node may be a computer. For example, when the storage space of all the nodes in the cluster server is saturated, a new node may be added to the cluster system to expand the cluster server.
It should be noted that the newly added node may be at least one node, that is, the operation and maintenance management method of the cluster server of the present invention may be applicable to batch capacity expansion of new nodes of the cluster server.
Specifically, in some possible embodiments, the cluster server may be one computer or a system composed of a plurality of computers.
Optionally, the cluster server in the embodiment of the present invention may further include a backup cluster server, which may replace the failed cluster server when the cluster server fails, and execute an operation of the cluster server, so as to further improve reliability of cluster management.
S300, according to the capacity expansion request of the node to be expanded, obtaining configuration information for capacity expansion.
Specifically, in this embodiment, in step S300, the first configuration information and the second configuration information are obtained according to the capacity expansion request of the node to be capacity expanded. In the method, first configuration information and second configuration information constitute configuration information for capacity expansion, and the first configuration information is configuration information for capacity expansion of a master node in a cluster server, for example, information such as a master node name, a master node master IP, a master node bmc _ IP, a master node bmc _ user, and a master node bmc _ pass; the second configuration information is BMC information of other nodes in the cluster server, that is, BMC _ info of other nodes.
S400, according to the configuration information for capacity expansion, the nodes to be expanded execute installation after capacity expansion is performed on the basis of the PXE execution nodes, and the node configuration information of the nodes to be expanded is corrected. In the method, the node configuration information includes a master node name, an IP address, and BMC information.
In some possible embodiments, the node configuration information further includes role configuration information of the node to be expanded, and the like.
In step S400, by correcting the node configuration information of the nodes to be expanded, when the nodes to be expanded are installed in batch by using the PXE technology, the configuration information of these newly added nodes can be fixed, so that the node operating system host node name and IP address can be associated with the BMC information, thereby avoiding the problem of out-of-order in the installation process.
S500, obtaining node configuration information, and setting the nodes to be expanded to be started based on the hard disk after expansion is completed so as to protect the nodes to be expanded and avoid abnormal node access and possibly repeated PXE behaviors.
In this embodiment, the protection measures for the node to be expanded in step S500 further include clearing the pxe configuration file associated with the existing mac address.
Based on the operation and maintenance management method of the cluster server, the PXE technology can be utilized to complete cluster installation and the existing cluster expansion task.
According to the operation and maintenance management method of the cluster server, the configuration information for capacity expansion is obtained, based on the configuration information for capacity expansion, when a PXE technology is used, the node configuration information of the node to be expanded can be automatically corrected to be expected information, accurate capacity expansion and automatic error correction are achieved, the relevant configuration of the node to be expanded is fixed, the name and the IP address of the node operating system main node can be correlated with BMC information, the disorder problem generated in the batch unattended installation process is avoided, the node to be expanded can be accurately and massively expanded in a large-scale cluster server environment, the installation and capacity expansion efficiency of the cluster node can be greatly improved, the operation and maintenance management work of the cluster server is better reduced to the minute level from the original hours or working days.
The operation and maintenance management method of the cluster server according to the present invention is described below with reference to fig. 2, where the method further includes the following steps before step S200:
s100, deploying an operation and maintenance management environment of the cluster server and initializing PXE service.
In the method, the nodes with the operation and maintenance management environment are deployment nodes.
In some possible embodiments, the operation and maintenance management environment uses DNSmasq, ipmitool, and the like, and completes system installation of the newly added node through the DNSmasq, ipmitool, and the like. The DNSmasq provides DHCP and simple File Transfer Protocol (TFTP) services required by PXE; the operating system image comprises an ipmitool tool and BMC information shared by other nodes acquired according to the ipmitool lan print, so that the correction of the host node name, the IP address, the role configuration information and the like of the node to be expanded is achieved.
Accordingly, in step S300, the deployment node manages and registers the first configuration information, and the first configuration information may be set by the operation and maintenance personnel. Therefore, in the method, after the operation and maintenance personnel input the first configuration information, namely all information of the main node, the subsequent operation can be automatically completed, and the manual complicated operation is avoided.
Correspondingly, in step S500, the deployment node waits for the management IP of the node to be expanded to be online, and after the deployment node obtains the IP of the node to be expanded to be online, the deployment node sets the node to be expanded to be started based on the hard disk after the expansion is completed.
The deployment node sets PXE starting of the nodes to be expanded through the ipmitool and manages the power state of the nodes to be expanded.
In this embodiment, in step S500, the deployment node also turns off the DNSmasq service, so as to protect the node to be expanded, and avoid abnormal node access and possibly repeated PXE behaviors.
The operation and maintenance management method of the cluster server of the present invention is described below with reference to fig. 3, where step S400 specifically includes the following steps:
s410, starting configuration information of the PXE is generated according to the first configuration information.
And S420, according to the starting configuration information, the to-be-expanded nodes are installed after being expanded based on the PXE execution nodes, and the computer is started.
In this embodiment, the first configuration information is configured as hosts files. Correspondingly, in step S400, the PXE service start script is generated according to the hosts file, HTTP shares BMC _ info, and the to-be-expanded node is set by ipmitool according to BMC information to start from the PXE and start up.
S430, after the node to be expanded is started based on the PXE, second configuration information shared by other nodes and third configuration information of the node to be expanded are acquired, and the third configuration information can be acquired through a built-in ipmitool lan print command. In the method, the third configuration information is BMC information of the node to be expanded.
In this embodiment, the second configuration information of each node in the cluster server is shared based on a hypertext Transfer Protocol (HTTP), and based on the shared second configuration information, the problems that the BMCs of all the cluster servers are repeatedly logged in when the node to be expanded is installed, and operations such as repeatedly configuring a node start item and controlling the node power to be turned on and off are required can be avoided.
And S440, according to the second configuration information and the third configuration information, based on the bmc _ info, automatically correcting the node configuration information of the node to be expanded.
The operation and maintenance management device of the cluster server provided by the present invention is described below, and the operation and maintenance management device of the cluster server described below and the operation and maintenance management method of the cluster server described above may be referred to correspondingly.
The operation and maintenance management device of the cluster server of the present invention is described below with reference to fig. 4, and the device includes:
the first obtaining module 200 is configured to obtain a capacity expansion request of a node to be subjected to capacity expansion.
When the cluster server needs to add a new node to realize capacity expansion, the new node generates a corresponding capacity expansion request, and then the cluster server determines whether the new node is needed. The newly added node may be a computer. For example, when the storage space of all the nodes in the cluster server is saturated, a new node may be added to the cluster system to expand the cluster server.
It should be noted that the newly added node may be at least one node, that is, the operation and maintenance management device of the cluster server of the present invention may be applicable to batch capacity expansion of new nodes of the cluster server.
Specifically, in some possible embodiments, the cluster server may be one computer or a system composed of a plurality of computers.
Optionally, the cluster server in the embodiment of the present invention may further include a backup cluster server, which may replace the failed cluster server when the cluster server fails, and execute an operation of the cluster server, so as to further improve reliability of cluster management.
The second obtaining module 300 is configured to obtain configuration information for capacity expansion according to a capacity expansion request of a node to be subjected to capacity expansion.
Specifically, in this embodiment, the second obtaining module 300 obtains the first configuration information and the second configuration information according to the capacity expansion request of the node to be capacity expanded. In the device, the first configuration information and the second configuration information constitute configuration information for capacity expansion, and the first configuration information is configuration information for capacity expansion of a master node in a cluster server, for example, information such as a master node name, a master node master IP, a master node bmc _ IP, a master node bmc _ user, and a master node bmc _ pass; the second configuration information is BMC information of other nodes in the cluster server, that is, BMC _ info of other nodes.
The correcting module 400 is configured to execute installation after the capacity expansion of the node to be expanded is performed based on the PXE execution node according to the configuration information for capacity expansion, and correct the node configuration information of the node to be expanded. In the apparatus, the node configuration information includes a master node name, an IP address, and BMC information.
In some possible embodiments, the node configuration information further includes role configuration information of the node to be expanded, and the like.
In the correction module 400, by correcting the node configuration information of the nodes to be expanded, when the nodes to be expanded are installed in batch by using the PXE technology, the configuration information of the newly added nodes can be fixed, so that the names and the IP addresses of the host nodes of the node operating system can be associated with the BMC information, and the problem of disorder in the installation process is avoided.
The protection module 500 is configured to obtain node configuration information, and set a node to be expanded to be started based on a hard disk, so as to protect the node to be expanded, and avoid abnormal node access and possibly repeated PXE behaviors.
In this embodiment, the protection measures for the node to be expanded in the protection module 500 further include clearing the pxe configuration file associated with the existing mac address.
Therefore, by using the operation and maintenance management device of the cluster server, the cluster installation and the existing cluster expansion task can be completed by using the PXE technology.
According to the operation and maintenance management device of the cluster server, the configuration information for capacity expansion is obtained, based on the configuration information for capacity expansion, when a PXE technology is used, the node configuration information of the node to be expanded can be automatically corrected to be expected information, accurate capacity expansion and automatic error correction are achieved, the relevant configuration of the node to be expanded is fixed, the name and the IP address of the node operating system main node can be correlated with BMC information, the disorder problem generated in the batch unattended installation process is avoided, the node to be expanded can be accurately and massively expanded in a large-scale cluster server environment, the installation and capacity expansion efficiency of the cluster node can be greatly improved, the operation and maintenance management work of the cluster server is better reduced to the level of minutes from the original hours or working days.
The operation and maintenance management device of the cluster server according to the present invention is described below with reference to fig. 5, and the device further includes the following modules:
the deployment module 100 is configured to deploy an operation and maintenance management environment of the cluster server and initialize the PXE service.
In the device, the node with the operation and maintenance management environment is a deployment node
In some possible embodiments, the operation and maintenance management environment uses DNSmasq, ipmitool, and the like, and completes system installation of the newly added node through the DNSmasq, ipmitool, and the like. The DNSmasq provides DHCP and TFTP services required by the PXE; the operating system image comprises an ipmitool tool and BMC information shared by other nodes acquired according to the ipmitool lan print, so that the correction of the host node name, the IP address, the role configuration information and the like of the node to be expanded is achieved.
Accordingly, the deployment node manages and registers the first configuration information, and the first configuration information can be set by operation and maintenance personnel. Therefore, in the device, after the operation and maintenance personnel input the first configuration information, namely each item of information of the main node, the subsequent operation can be automatically completed, and the manual complicated operation is avoided.
Correspondingly, the deployment node waits for the management IP of the node to be expanded to be on-line, and after the deployment node acquires the IP of the node to be expanded to be on-line, the deployment node sets the node to be expanded to be started based on the hard disk after the expansion is completed.
The deployment node sets PXE starting of the nodes to be expanded through the ipmitool and manages the power state of the nodes to be expanded.
In this embodiment, the deployment node also closes the DNSmasq service, which also aims to protect the node to be expanded and avoid abnormal node access and possibly repeated PXE behaviors.
The operation and maintenance management device of the cluster server of the present invention is described below with reference to fig. 6, where the calibration module specifically includes:
the configuration unit 410 is configured to generate the starting configuration information of the PXE according to the first configuration information.
The starting unit 420 is configured to execute installation and start up, according to the start configuration information, when the capacity expansion node performs capacity expansion based on the PXE execution node.
In this embodiment, the first configuration information is configured as hosts files. Correspondingly, a PXE service starting script is generated according to a hosts file, HTTP shares BMC _ info, the IPmitool is used for setting a node to be expanded according to BMC information to start from the PXE, and the device is started.
The obtaining unit 430 is configured to obtain, after the node to be expanded is started based on the PXE, second configuration information shared by other nodes and third configuration information of the node, where the third configuration information may be obtained through a built-in ipmitool lan print command. In the apparatus, the third configuration information is BMC information of a node to be expanded.
In this embodiment, the second configuration information of each node in the cluster server is shared based on HTTP, and based on the shared second configuration information, the problems that when the node to be expanded is installed, BMCs of all the cluster servers are repeatedly logged in, and operations such as repeatedly configuring a node start item and controlling the node power to be turned on and off are required can be avoided.
And a correcting unit 440, configured to automatically correct the node configuration information of the node to be expanded based on the bmc _ info according to the second configuration information and the third configuration information.
Fig. 7 illustrates a physical structure diagram of an electronic device, and as shown in fig. 7, the electronic device may include: a processor (processor)810, a communication Interface 820, a memory 830 and a communication bus 840, wherein the processor 810, the communication Interface 820 and the memory 830 communicate with each other via the communication bus 840. The processor 810 may call logic instructions in the memory 830 to perform a method for operation and maintenance management of cluster servers, the method comprising the steps of:
acquiring a capacity expansion request of a node to be expanded;
acquiring configuration information for capacity expansion according to the capacity expansion request;
according to the configuration information for capacity expansion, the node to be expanded executes node installation based on a pre-starting execution environment, and corrects the node configuration information of the node to be expanded; wherein the node configuration information comprises a master node name, an internet protocol address and substrate controller information;
and acquiring the node configuration information, and setting the node to be expanded to be started based on the hard disk after the expansion is completed.
In addition, the logic instructions in the memory 830 may be implemented in software functional units and stored in a computer readable storage medium when the logic instructions are sold or used as independent products. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
In another aspect, the present invention further provides a computer program product, where the computer program product includes a computer program, the computer program may be stored on a non-transitory computer readable storage medium, and when the computer program is executed by a processor, a computer can execute the operation and maintenance management method for a cluster server provided by the above methods, where the method includes the following steps:
acquiring a capacity expansion request of a node to be expanded;
acquiring configuration information for capacity expansion according to the capacity expansion request;
according to the configuration information for capacity expansion, the node to be expanded executes the capacity expansion of the node based on a pre-starting execution environment, and corrects the node configuration information of the node to be expanded; wherein the node configuration information comprises a master node name, an internet protocol address and substrate controller information;
and acquiring the node configuration information, and setting the node to be expanded to be started based on the hard disk after the expansion is completed.
In still another aspect, the present invention further provides a non-transitory computer-readable storage medium, on which a computer program is stored, the computer program, when being executed by a processor, implementing an operation and maintenance management method for a cluster server provided by the above methods, the method including the following steps:
acquiring a capacity expansion request of a node to be expanded;
acquiring configuration information for capacity expansion according to the capacity expansion request;
according to the configuration information for capacity expansion, the node to be expanded executes the capacity expansion of the node based on a pre-starting execution environment, and corrects the node configuration information of the node to be expanded; wherein the node configuration information comprises a master node name, an internet protocol address and substrate controller information;
and acquiring the node configuration information, and setting the node to be expanded to be started based on the hard disk after the expansion is completed.
The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.
Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in the embodiments or some parts of the embodiments.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.
Claims (10)
1. An operation and maintenance management method for a cluster server is characterized by comprising the following steps:
acquiring a capacity expansion request of a node to be expanded;
acquiring configuration information for capacity expansion according to the capacity expansion request;
according to the configuration information for capacity expansion, the node to be expanded executes the capacity expansion of the node based on a pre-starting execution environment, and corrects the node configuration information of the node to be expanded; wherein the node configuration information comprises a master node name, an internet protocol address and substrate controller information;
and acquiring the node configuration information, and setting the node to be expanded to be started based on the hard disk after the expansion is completed.
2. The operation and maintenance management method of the cluster server according to claim 1, wherein the obtaining of configuration information for capacity expansion according to the capacity expansion request specifically includes:
acquiring first configuration information and second configuration information according to the capacity expansion request; the first configuration information and the second configuration information constitute configuration information for capacity expansion, the first configuration information is configuration information for capacity expansion of a master node in the cluster server, and the second configuration information is baseboard management controller information of other nodes in the cluster server.
3. The operation and maintenance management method of the cluster server according to claim 2, wherein the obtaining of configuration information for capacity expansion according to the capacity expansion request specifically includes:
and sharing the second configuration information of each node in the cluster server based on a hypertext transfer protocol.
4. The operation and maintenance management method of the cluster server according to claim 2 or 3, wherein according to the configuration information for capacity expansion, the node to be expanded executes node capacity expansion based on a pre-boot execution environment, and corrects the node configuration information of the node to be expanded, specifically including the following steps:
generating starting configuration information of a pre-starting execution environment according to the first configuration information;
according to the starting configuration information, the node to be expanded executes node expansion based on a pre-starting execution environment;
after the node to be expanded is started based on the pre-starting execution environment, acquiring the second configuration information and the third configuration information shared by other nodes; the third configuration information is baseboard management controller information of the node to be expanded;
and correcting the node configuration information by the node to be expanded according to the second configuration information and the third configuration information.
5. The operation and maintenance management method of the cluster server according to claim 1, wherein before the step of obtaining the capacity expansion request of the node to be expanded, the method further comprises the following steps:
and deploying the operation and maintenance management environment of the cluster server and initializing the pre-starting execution environment service.
6. The operation and maintenance management method of the cluster server according to claim 1, wherein the obtaining of the node configuration information and the setting of the node to be expanded to be started based on a hard disk after the expansion are completed specifically include:
after the deployment node deploying the operation and maintenance management environment acquires that the internet interconnection protocol address of the node to be expanded is on-line, after the expansion is completed, the deployment node sets the node to be expanded to be started based on a hard disk.
7. An operation and maintenance management device for a cluster server, comprising:
the first acquisition module is used for acquiring a capacity expansion request of a node to be expanded;
the second acquisition module is used for acquiring configuration information for capacity expansion according to the capacity expansion request;
the correction module is used for executing node capacity expansion based on a pre-starting execution environment by the node to be subjected to capacity expansion according to the configuration information for capacity expansion and correcting the node configuration information of the node to be subjected to capacity expansion; wherein the node configuration information comprises a master node name, an internet protocol address and substrate controller information;
and the protection module is used for acquiring the node configuration information and setting the node to be expanded to be started based on the hard disk after the expansion is completed.
8. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the steps of the operation and maintenance management method of the cluster server according to any one of claims 1 to 6 when executing the program.
9. A non-transitory computer readable storage medium, having a computer program stored thereon, wherein the computer program, when being executed by a processor, implements the steps of the operation and maintenance management method of a cluster server according to any one of claims 1 to 6.
10. A computer program product comprising a computer program, wherein the computer program when executed by a processor implements the steps of the operation and maintenance management method of a cluster server according to any of claims 1 to 6.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111422292.4A CN114422361A (en) | 2021-11-26 | 2021-11-26 | Operation and maintenance management method, device, equipment and product of cluster server |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111422292.4A CN114422361A (en) | 2021-11-26 | 2021-11-26 | Operation and maintenance management method, device, equipment and product of cluster server |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114422361A true CN114422361A (en) | 2022-04-29 |
Family
ID=81265455
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111422292.4A Pending CN114422361A (en) | 2021-11-26 | 2021-11-26 | Operation and maintenance management method, device, equipment and product of cluster server |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114422361A (en) |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106657444A (en) * | 2017-02-24 | 2017-05-10 | 郑州云海信息技术有限公司 | Method and device for configuring IP address of BMC |
US20190281012A1 (en) * | 2018-03-09 | 2019-09-12 | Fujitsu Limited | Information processing apparatus and information processing apparatus management system |
CN110879712A (en) * | 2019-11-07 | 2020-03-13 | 北京浪潮数据技术有限公司 | Cloud data center physical host installation method and related device |
CN111427624A (en) * | 2020-03-20 | 2020-07-17 | 苏州浪潮智能科技有限公司 | Method, device and system for batch automatic deployment and configuration of servers |
CN112434278A (en) * | 2020-11-20 | 2021-03-02 | 北京浪潮数据技术有限公司 | Bare computer authentication method, apparatus, device and medium |
CN112866017A (en) * | 2021-01-08 | 2021-05-28 | 苏州浪潮智能科技有限公司 | Method, system, medium and device for configuring BMC IP address of bare metal server |
CN113268256A (en) * | 2021-06-09 | 2021-08-17 | 中国建设银行股份有限公司 | Batch installation method and device, server and computer storage medium |
-
2021
- 2021-11-26 CN CN202111422292.4A patent/CN114422361A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106657444A (en) * | 2017-02-24 | 2017-05-10 | 郑州云海信息技术有限公司 | Method and device for configuring IP address of BMC |
US20190281012A1 (en) * | 2018-03-09 | 2019-09-12 | Fujitsu Limited | Information processing apparatus and information processing apparatus management system |
CN110879712A (en) * | 2019-11-07 | 2020-03-13 | 北京浪潮数据技术有限公司 | Cloud data center physical host installation method and related device |
CN111427624A (en) * | 2020-03-20 | 2020-07-17 | 苏州浪潮智能科技有限公司 | Method, device and system for batch automatic deployment and configuration of servers |
CN112434278A (en) * | 2020-11-20 | 2021-03-02 | 北京浪潮数据技术有限公司 | Bare computer authentication method, apparatus, device and medium |
CN112866017A (en) * | 2021-01-08 | 2021-05-28 | 苏州浪潮智能科技有限公司 | Method, system, medium and device for configuring BMC IP address of bare metal server |
CN113268256A (en) * | 2021-06-09 | 2021-08-17 | 中国建设银行股份有限公司 | Batch installation method and device, server and computer storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
RU2438168C1 (en) | Method and system for deploying software, software deployment server and user server | |
US9465625B2 (en) | Provisioning of operating environments on a server in a networked environment | |
US6684327B1 (en) | Extensible, flexible, memory efficient technique for network boot without special DHCP/PXE hardware | |
US8332490B2 (en) | Method, apparatus and program product for provisioning a computer system | |
US20090077634A1 (en) | Firmware update method and system using the same | |
CN106572200A (en) | IP address configuration method and IP address configuration device for baseboard management controller BMC | |
CN103200271A (en) | Advanced Risc machine (ARM) server and method of automatic installation system thereof | |
CN105183529A (en) | Method for refreshing server firmware, target server, source server and system | |
CN111786810A (en) | Automatic deployment method and system for large-scale test bed nodes | |
CN105512026A (en) | Automatic batch testing method | |
CN111273924A (en) | Software updating method and device | |
CN111367735B (en) | Test method and system based on server to be tested and Wuban diagram operating system | |
CN102567050B (en) | The method and apparatus of B/S system remote deploying projects | |
CN114115917A (en) | Operating system installation method and device | |
CN113268254A (en) | Cluster system installation method and device, electronic equipment and storage medium | |
CN110187890B (en) | Project deployment method, electronic equipment and storage medium | |
US20170034120A1 (en) | Network device setting method and information processing device | |
CN110688130A (en) | Physical machine deployment method, physical machine deployment device, readable storage medium and electronic equipment | |
CN114422361A (en) | Operation and maintenance management method, device, equipment and product of cluster server | |
CN110633086B (en) | Blade server | |
CN113608932B (en) | Database drilling method, device, equipment and storage medium | |
CN112256289A (en) | Automatic deployment method, device and equipment | |
CN106506276A (en) | A kind of information detecting method for server | |
CN109254782B (en) | Operating system installation method and device | |
CN112363737A (en) | System installation method and related device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |