CN114422361A - Operation and maintenance management method, device, equipment and product of cluster server - Google Patents

Operation and maintenance management method, device, equipment and product of cluster server Download PDF

Info

Publication number
CN114422361A
CN114422361A CN202111422292.4A CN202111422292A CN114422361A CN 114422361 A CN114422361 A CN 114422361A CN 202111422292 A CN202111422292 A CN 202111422292A CN 114422361 A CN114422361 A CN 114422361A
Authority
CN
China
Prior art keywords
node
configuration information
expanded
capacity expansion
cluster server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111422292.4A
Other languages
Chinese (zh)
Inventor
刘庆
厉肃
郭晨曦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Communication Technology Co Ltd
Original Assignee
Inspur Communication Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Communication Technology Co Ltd filed Critical Inspur Communication Technology Co Ltd
Priority to CN202111422292.4A priority Critical patent/CN114422361A/en
Publication of CN114422361A publication Critical patent/CN114422361A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements
    • H04L41/0803Configuration setting
    • H04L41/0823Configuration setting characterised by the purposes of a change of settings, e.g. optimising configuration for enhancing reliability
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/4401Bootstrapping
    • G06F9/4416Network booting; Remote initial program loading [RIPL]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/06Protocols specially adapted for file transfer, e.g. file transfer protocol [FTP]

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Security & Cryptography (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention provides an operation and maintenance management method, device, equipment and product of a cluster server, relating to the technical field of communication, wherein the method comprises the following steps: acquiring a capacity expansion request of a node to be expanded; acquiring configuration information for capacity expansion according to the capacity expansion request; according to the configuration information for capacity expansion, starting the node to be expanded based on a pre-starting execution environment, and correcting the node configuration information of the node to be expanded; wherein the node configuration information comprises a master node name and an internet protocol address; and acquiring the node configuration information, and setting the node to be expanded to be started based on a hard disk. The invention can fix the relevant configuration of the nodes to be expanded, so that the names and IP addresses of the main nodes of the node operating system can be correlated with the BMC information, the problem of disorder in the installation process is avoided, and the nodes to be expanded can be accurately expanded in batch in a large-scale cluster server environment.

Description

Operation and maintenance management method, device, equipment and product of cluster server
Technical Field
The present invention relates to the field of communications technologies, and in particular, to an operation and maintenance management method, apparatus, device, and product for a cluster server.
Background
In the field of software application, installation and capacity expansion of a software cluster based on a physical server cluster are required to have large-scale deployment and capacity expansion capabilities. In the aspect of deployment and capacity expansion of a cluster server, a Preboot execution Environment (PXE) supports a workstation to download an image from a remote server through a network and thus supports starting an operating system through the network, in the starting process, a terminal requires the server to allocate an Internet Protocol (IP) address, then downloads a starting software package into a local memory for execution, and the starting software package completes the setting of basic software of the terminal (client), so as to guide the terminal operating system pre-installed in the server.
However, at present, due to uncertainty generated by PXE during batch installation, the installation order of cluster nodes is disordered, and for traditional software to be installed and expanded on the cluster nodes, a Baseboard Management Controller (BMC) of all cluster servers needs to be logged in many times, and operations such as configuring node startup items and controlling the node power to be turned on and off need to be performed many times. Therefore, the existing operation and maintenance management method for the cluster server causes that the installation and capacity expansion efficiency of the cluster server is very limited.
Disclosure of Invention
The invention provides an operation and maintenance management method, device, equipment and product of a cluster server, which are used for solving the problems of disorder and repeated operation generated when PXE is used for installing and expanding cluster server nodes in the prior art and realizing accurate batch expansion of nodes to be expanded in a large-scale cluster server environment.
The invention provides an operation and maintenance management method of a cluster server, which comprises the following steps:
acquiring a capacity expansion request of a node to be expanded;
acquiring configuration information for capacity expansion according to the capacity expansion request;
according to the configuration information for capacity expansion, the node to be expanded executes the capacity expansion of the node based on a pre-starting execution environment, and corrects the node configuration information of the node to be expanded; wherein the node configuration information comprises a master node name, an internet protocol address and substrate controller information;
and acquiring the node configuration information, and setting the node to be expanded to be started based on the hard disk after the expansion is completed.
According to the operation and maintenance management method for the cluster server provided by the present invention, the obtaining of the configuration information for capacity expansion according to the capacity expansion request specifically includes:
acquiring first configuration information and second configuration information according to the capacity expansion request; the first configuration information and the second configuration information constitute configuration information for capacity expansion, the first configuration information is configuration information for capacity expansion of a master node in the cluster server, and the second configuration information is baseboard management controller information of other nodes in the cluster server.
According to the operation and maintenance management method for the cluster server provided by the present invention, the obtaining of the configuration information for capacity expansion according to the capacity expansion request specifically includes:
and sharing the second configuration information of each node in the cluster server based on a hypertext transfer protocol.
According to the operation and maintenance management method of the cluster server provided by the present invention, according to the configuration information for capacity expansion, the node to be expanded executes node capacity expansion based on a pre-boot execution environment, and corrects the node configuration information of the node to be expanded, specifically including the following steps:
generating starting configuration information of a pre-starting execution environment according to the first configuration information;
according to the starting configuration information, the node to be expanded executes node expansion based on a pre-starting execution environment;
after the node to be expanded is started based on the pre-starting execution environment, acquiring the second configuration information and the third configuration information shared by other nodes; the third configuration information is baseboard management controller information of the node to be expanded;
and correcting the node configuration information by the node to be expanded according to the second configuration information and the third configuration information.
According to the operation and maintenance management method of the cluster server provided by the invention, before the step of obtaining the capacity expansion request of the node to be expanded, the method further comprises the following steps:
and deploying the operation and maintenance management environment of the cluster server and initializing the pre-starting execution environment service.
According to the operation and maintenance management method of the cluster server provided by the invention, the obtaining of the node configuration information and the setting of the node to be expanded to be started based on the hard disk after the expansion are completed specifically include:
after the deployment node deploying the operation and maintenance management environment acquires that the internet interconnection protocol address of the node to be expanded is on-line, after the expansion is completed, the deployment node sets the node to be expanded to be started based on a hard disk.
The invention also provides an operation and maintenance management device of the cluster server, which comprises:
the first acquisition module is used for acquiring a capacity expansion request of a node to be expanded;
the second acquisition module is used for acquiring configuration information for capacity expansion according to the capacity expansion request;
the correction module is used for executing node capacity expansion based on a pre-starting execution environment by the node to be subjected to capacity expansion according to the configuration information for capacity expansion and correcting the node configuration information of the node to be subjected to capacity expansion; wherein the node configuration information comprises an internet protocol address;
and the protection module is used for acquiring the node configuration information and setting the node to be expanded to be started based on the hard disk after the expansion is completed.
The invention further provides an electronic device, which includes a memory, a processor and a computer program stored on the memory and capable of running on the processor, and when the processor executes the program, the steps of the operation and maintenance management method of the cluster server are implemented.
The present invention also provides a non-transitory computer readable storage medium, on which a computer program is stored, which, when being executed by a processor, implements the steps of the operation and maintenance management method of the cluster server according to any one of the above.
The present invention also provides a computer program product, which includes a computer program, and when the computer program is executed by a processor, the computer program implements the steps of the operation and maintenance management method for the cluster server as described in any one of the above.
According to the operation and maintenance management method, device, equipment and product of the cluster server, the configuration information for capacity expansion is obtained, and based on the configuration information for capacity expansion, when a PXE technology is used, the node configuration information of the node to be expanded can be automatically corrected to be expected information, accurate capacity expansion and automatic error correction are achieved, the relevant configuration of the node to be expanded is fixed, the name and the IP address of the node operating system main node can be correlated with BMC information, the disorder problem generated in the batch unattended installation process is avoided, the node to be expanded can be accurately and massively expanded in a large-scale cluster server environment, the installation and capacity expansion efficiency of the cluster node can be greatly improved, the operation and maintenance management work of the cluster server is better reduced to the minute level from several hours or several working days originally.
Drawings
In order to more clearly illustrate the technical solutions of the present invention or the prior art, the drawings needed for the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and those skilled in the art can also obtain other drawings according to the drawings without creative efforts.
FIG. 1 is a schematic flow chart of an operation and maintenance management method for a cluster server provided in the present invention;
FIG. 2 is a second flowchart illustrating an operation and maintenance management method for a cluster server according to the present invention;
fig. 3 is a schematic flowchart of step S400 in the operation and maintenance management method for a cluster server provided in the present invention;
fig. 4 is a schematic structural diagram of an operation and maintenance management apparatus of a cluster server provided in the present invention;
fig. 5 is a second schematic structural diagram of an operation and maintenance management apparatus of a cluster server provided in the present invention;
fig. 6 is a schematic structural diagram of a correction module in the operation and maintenance management method for a cluster server provided by the present invention;
fig. 7 is a schematic structural diagram of an electronic device provided by the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is obvious that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In the prior art, when a cluster server installs a system at an open office or expands a new node in a batch for an existing cluster, if a PXE technology and a Dynamic Host Configuration Protocol (DHCP) are used by the cluster server, since an IP address of a Host node of an operating system and BMC information cannot be associated with each other, uncertainty occurs when the PXE is installed in a batch, and a problem of a disordered installation sequence of the cluster nodes occurs, where the uncertainty is, for example, different hardware configurations of nodes and is assigned to different roles of the cluster server; and the IP addresses of the network planning and the actual cluster servers after installation are inconsistent.
Based on the above problems, the present invention aims to provide a solution that can perform precise batch capacity expansion in a large-scale cluster server environment, and can fix the relevant configuration of new nodes to be expanded according to capacity expansion planning information such as BMC, etc., so as to solve the problem of disordered node installation in a cluster, so that the installation and capacity expansion efficiency of cluster nodes can be greatly improved, and the operation and maintenance management work of the cluster server can be better performed by reducing the operation and maintenance efficiency to a minute level from several hours or several working days originally.
The operation and maintenance management method of the cluster server of the present invention is described below with reference to fig. 1, and the method includes the following steps:
s200, acquiring a capacity expansion request of a node to be expanded.
When the cluster server needs to add a new node to realize capacity expansion, the new node generates a corresponding capacity expansion request, and then the cluster server determines whether the new node is needed. The newly added node may be a computer. For example, when the storage space of all the nodes in the cluster server is saturated, a new node may be added to the cluster system to expand the cluster server.
It should be noted that the newly added node may be at least one node, that is, the operation and maintenance management method of the cluster server of the present invention may be applicable to batch capacity expansion of new nodes of the cluster server.
Specifically, in some possible embodiments, the cluster server may be one computer or a system composed of a plurality of computers.
Optionally, the cluster server in the embodiment of the present invention may further include a backup cluster server, which may replace the failed cluster server when the cluster server fails, and execute an operation of the cluster server, so as to further improve reliability of cluster management.
S300, according to the capacity expansion request of the node to be expanded, obtaining configuration information for capacity expansion.
Specifically, in this embodiment, in step S300, the first configuration information and the second configuration information are obtained according to the capacity expansion request of the node to be capacity expanded. In the method, first configuration information and second configuration information constitute configuration information for capacity expansion, and the first configuration information is configuration information for capacity expansion of a master node in a cluster server, for example, information such as a master node name, a master node master IP, a master node bmc _ IP, a master node bmc _ user, and a master node bmc _ pass; the second configuration information is BMC information of other nodes in the cluster server, that is, BMC _ info of other nodes.
S400, according to the configuration information for capacity expansion, the nodes to be expanded execute installation after capacity expansion is performed on the basis of the PXE execution nodes, and the node configuration information of the nodes to be expanded is corrected. In the method, the node configuration information includes a master node name, an IP address, and BMC information.
In some possible embodiments, the node configuration information further includes role configuration information of the node to be expanded, and the like.
In step S400, by correcting the node configuration information of the nodes to be expanded, when the nodes to be expanded are installed in batch by using the PXE technology, the configuration information of these newly added nodes can be fixed, so that the node operating system host node name and IP address can be associated with the BMC information, thereby avoiding the problem of out-of-order in the installation process.
S500, obtaining node configuration information, and setting the nodes to be expanded to be started based on the hard disk after expansion is completed so as to protect the nodes to be expanded and avoid abnormal node access and possibly repeated PXE behaviors.
In this embodiment, the protection measures for the node to be expanded in step S500 further include clearing the pxe configuration file associated with the existing mac address.
Based on the operation and maintenance management method of the cluster server, the PXE technology can be utilized to complete cluster installation and the existing cluster expansion task.
According to the operation and maintenance management method of the cluster server, the configuration information for capacity expansion is obtained, based on the configuration information for capacity expansion, when a PXE technology is used, the node configuration information of the node to be expanded can be automatically corrected to be expected information, accurate capacity expansion and automatic error correction are achieved, the relevant configuration of the node to be expanded is fixed, the name and the IP address of the node operating system main node can be correlated with BMC information, the disorder problem generated in the batch unattended installation process is avoided, the node to be expanded can be accurately and massively expanded in a large-scale cluster server environment, the installation and capacity expansion efficiency of the cluster node can be greatly improved, the operation and maintenance management work of the cluster server is better reduced to the minute level from the original hours or working days.
The operation and maintenance management method of the cluster server according to the present invention is described below with reference to fig. 2, where the method further includes the following steps before step S200:
s100, deploying an operation and maintenance management environment of the cluster server and initializing PXE service.
In the method, the nodes with the operation and maintenance management environment are deployment nodes.
In some possible embodiments, the operation and maintenance management environment uses DNSmasq, ipmitool, and the like, and completes system installation of the newly added node through the DNSmasq, ipmitool, and the like. The DNSmasq provides DHCP and simple File Transfer Protocol (TFTP) services required by PXE; the operating system image comprises an ipmitool tool and BMC information shared by other nodes acquired according to the ipmitool lan print, so that the correction of the host node name, the IP address, the role configuration information and the like of the node to be expanded is achieved.
Accordingly, in step S300, the deployment node manages and registers the first configuration information, and the first configuration information may be set by the operation and maintenance personnel. Therefore, in the method, after the operation and maintenance personnel input the first configuration information, namely all information of the main node, the subsequent operation can be automatically completed, and the manual complicated operation is avoided.
Correspondingly, in step S500, the deployment node waits for the management IP of the node to be expanded to be online, and after the deployment node obtains the IP of the node to be expanded to be online, the deployment node sets the node to be expanded to be started based on the hard disk after the expansion is completed.
The deployment node sets PXE starting of the nodes to be expanded through the ipmitool and manages the power state of the nodes to be expanded.
In this embodiment, in step S500, the deployment node also turns off the DNSmasq service, so as to protect the node to be expanded, and avoid abnormal node access and possibly repeated PXE behaviors.
The operation and maintenance management method of the cluster server of the present invention is described below with reference to fig. 3, where step S400 specifically includes the following steps:
s410, starting configuration information of the PXE is generated according to the first configuration information.
And S420, according to the starting configuration information, the to-be-expanded nodes are installed after being expanded based on the PXE execution nodes, and the computer is started.
In this embodiment, the first configuration information is configured as hosts files. Correspondingly, in step S400, the PXE service start script is generated according to the hosts file, HTTP shares BMC _ info, and the to-be-expanded node is set by ipmitool according to BMC information to start from the PXE and start up.
S430, after the node to be expanded is started based on the PXE, second configuration information shared by other nodes and third configuration information of the node to be expanded are acquired, and the third configuration information can be acquired through a built-in ipmitool lan print command. In the method, the third configuration information is BMC information of the node to be expanded.
In this embodiment, the second configuration information of each node in the cluster server is shared based on a hypertext Transfer Protocol (HTTP), and based on the shared second configuration information, the problems that the BMCs of all the cluster servers are repeatedly logged in when the node to be expanded is installed, and operations such as repeatedly configuring a node start item and controlling the node power to be turned on and off are required can be avoided.
And S440, according to the second configuration information and the third configuration information, based on the bmc _ info, automatically correcting the node configuration information of the node to be expanded.
The operation and maintenance management device of the cluster server provided by the present invention is described below, and the operation and maintenance management device of the cluster server described below and the operation and maintenance management method of the cluster server described above may be referred to correspondingly.
The operation and maintenance management device of the cluster server of the present invention is described below with reference to fig. 4, and the device includes:
the first obtaining module 200 is configured to obtain a capacity expansion request of a node to be subjected to capacity expansion.
When the cluster server needs to add a new node to realize capacity expansion, the new node generates a corresponding capacity expansion request, and then the cluster server determines whether the new node is needed. The newly added node may be a computer. For example, when the storage space of all the nodes in the cluster server is saturated, a new node may be added to the cluster system to expand the cluster server.
It should be noted that the newly added node may be at least one node, that is, the operation and maintenance management device of the cluster server of the present invention may be applicable to batch capacity expansion of new nodes of the cluster server.
Specifically, in some possible embodiments, the cluster server may be one computer or a system composed of a plurality of computers.
Optionally, the cluster server in the embodiment of the present invention may further include a backup cluster server, which may replace the failed cluster server when the cluster server fails, and execute an operation of the cluster server, so as to further improve reliability of cluster management.
The second obtaining module 300 is configured to obtain configuration information for capacity expansion according to a capacity expansion request of a node to be subjected to capacity expansion.
Specifically, in this embodiment, the second obtaining module 300 obtains the first configuration information and the second configuration information according to the capacity expansion request of the node to be capacity expanded. In the device, the first configuration information and the second configuration information constitute configuration information for capacity expansion, and the first configuration information is configuration information for capacity expansion of a master node in a cluster server, for example, information such as a master node name, a master node master IP, a master node bmc _ IP, a master node bmc _ user, and a master node bmc _ pass; the second configuration information is BMC information of other nodes in the cluster server, that is, BMC _ info of other nodes.
The correcting module 400 is configured to execute installation after the capacity expansion of the node to be expanded is performed based on the PXE execution node according to the configuration information for capacity expansion, and correct the node configuration information of the node to be expanded. In the apparatus, the node configuration information includes a master node name, an IP address, and BMC information.
In some possible embodiments, the node configuration information further includes role configuration information of the node to be expanded, and the like.
In the correction module 400, by correcting the node configuration information of the nodes to be expanded, when the nodes to be expanded are installed in batch by using the PXE technology, the configuration information of the newly added nodes can be fixed, so that the names and the IP addresses of the host nodes of the node operating system can be associated with the BMC information, and the problem of disorder in the installation process is avoided.
The protection module 500 is configured to obtain node configuration information, and set a node to be expanded to be started based on a hard disk, so as to protect the node to be expanded, and avoid abnormal node access and possibly repeated PXE behaviors.
In this embodiment, the protection measures for the node to be expanded in the protection module 500 further include clearing the pxe configuration file associated with the existing mac address.
Therefore, by using the operation and maintenance management device of the cluster server, the cluster installation and the existing cluster expansion task can be completed by using the PXE technology.
According to the operation and maintenance management device of the cluster server, the configuration information for capacity expansion is obtained, based on the configuration information for capacity expansion, when a PXE technology is used, the node configuration information of the node to be expanded can be automatically corrected to be expected information, accurate capacity expansion and automatic error correction are achieved, the relevant configuration of the node to be expanded is fixed, the name and the IP address of the node operating system main node can be correlated with BMC information, the disorder problem generated in the batch unattended installation process is avoided, the node to be expanded can be accurately and massively expanded in a large-scale cluster server environment, the installation and capacity expansion efficiency of the cluster node can be greatly improved, the operation and maintenance management work of the cluster server is better reduced to the level of minutes from the original hours or working days.
The operation and maintenance management device of the cluster server according to the present invention is described below with reference to fig. 5, and the device further includes the following modules:
the deployment module 100 is configured to deploy an operation and maintenance management environment of the cluster server and initialize the PXE service.
In the device, the node with the operation and maintenance management environment is a deployment node
In some possible embodiments, the operation and maintenance management environment uses DNSmasq, ipmitool, and the like, and completes system installation of the newly added node through the DNSmasq, ipmitool, and the like. The DNSmasq provides DHCP and TFTP services required by the PXE; the operating system image comprises an ipmitool tool and BMC information shared by other nodes acquired according to the ipmitool lan print, so that the correction of the host node name, the IP address, the role configuration information and the like of the node to be expanded is achieved.
Accordingly, the deployment node manages and registers the first configuration information, and the first configuration information can be set by operation and maintenance personnel. Therefore, in the device, after the operation and maintenance personnel input the first configuration information, namely each item of information of the main node, the subsequent operation can be automatically completed, and the manual complicated operation is avoided.
Correspondingly, the deployment node waits for the management IP of the node to be expanded to be on-line, and after the deployment node acquires the IP of the node to be expanded to be on-line, the deployment node sets the node to be expanded to be started based on the hard disk after the expansion is completed.
The deployment node sets PXE starting of the nodes to be expanded through the ipmitool and manages the power state of the nodes to be expanded.
In this embodiment, the deployment node also closes the DNSmasq service, which also aims to protect the node to be expanded and avoid abnormal node access and possibly repeated PXE behaviors.
The operation and maintenance management device of the cluster server of the present invention is described below with reference to fig. 6, where the calibration module specifically includes:
the configuration unit 410 is configured to generate the starting configuration information of the PXE according to the first configuration information.
The starting unit 420 is configured to execute installation and start up, according to the start configuration information, when the capacity expansion node performs capacity expansion based on the PXE execution node.
In this embodiment, the first configuration information is configured as hosts files. Correspondingly, a PXE service starting script is generated according to a hosts file, HTTP shares BMC _ info, the IPmitool is used for setting a node to be expanded according to BMC information to start from the PXE, and the device is started.
The obtaining unit 430 is configured to obtain, after the node to be expanded is started based on the PXE, second configuration information shared by other nodes and third configuration information of the node, where the third configuration information may be obtained through a built-in ipmitool lan print command. In the apparatus, the third configuration information is BMC information of a node to be expanded.
In this embodiment, the second configuration information of each node in the cluster server is shared based on HTTP, and based on the shared second configuration information, the problems that when the node to be expanded is installed, BMCs of all the cluster servers are repeatedly logged in, and operations such as repeatedly configuring a node start item and controlling the node power to be turned on and off are required can be avoided.
And a correcting unit 440, configured to automatically correct the node configuration information of the node to be expanded based on the bmc _ info according to the second configuration information and the third configuration information.
Fig. 7 illustrates a physical structure diagram of an electronic device, and as shown in fig. 7, the electronic device may include: a processor (processor)810, a communication Interface 820, a memory 830 and a communication bus 840, wherein the processor 810, the communication Interface 820 and the memory 830 communicate with each other via the communication bus 840. The processor 810 may call logic instructions in the memory 830 to perform a method for operation and maintenance management of cluster servers, the method comprising the steps of:
acquiring a capacity expansion request of a node to be expanded;
acquiring configuration information for capacity expansion according to the capacity expansion request;
according to the configuration information for capacity expansion, the node to be expanded executes node installation based on a pre-starting execution environment, and corrects the node configuration information of the node to be expanded; wherein the node configuration information comprises a master node name, an internet protocol address and substrate controller information;
and acquiring the node configuration information, and setting the node to be expanded to be started based on the hard disk after the expansion is completed.
In addition, the logic instructions in the memory 830 may be implemented in software functional units and stored in a computer readable storage medium when the logic instructions are sold or used as independent products. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
In another aspect, the present invention further provides a computer program product, where the computer program product includes a computer program, the computer program may be stored on a non-transitory computer readable storage medium, and when the computer program is executed by a processor, a computer can execute the operation and maintenance management method for a cluster server provided by the above methods, where the method includes the following steps:
acquiring a capacity expansion request of a node to be expanded;
acquiring configuration information for capacity expansion according to the capacity expansion request;
according to the configuration information for capacity expansion, the node to be expanded executes the capacity expansion of the node based on a pre-starting execution environment, and corrects the node configuration information of the node to be expanded; wherein the node configuration information comprises a master node name, an internet protocol address and substrate controller information;
and acquiring the node configuration information, and setting the node to be expanded to be started based on the hard disk after the expansion is completed.
In still another aspect, the present invention further provides a non-transitory computer-readable storage medium, on which a computer program is stored, the computer program, when being executed by a processor, implementing an operation and maintenance management method for a cluster server provided by the above methods, the method including the following steps:
acquiring a capacity expansion request of a node to be expanded;
acquiring configuration information for capacity expansion according to the capacity expansion request;
according to the configuration information for capacity expansion, the node to be expanded executes the capacity expansion of the node based on a pre-starting execution environment, and corrects the node configuration information of the node to be expanded; wherein the node configuration information comprises a master node name, an internet protocol address and substrate controller information;
and acquiring the node configuration information, and setting the node to be expanded to be started based on the hard disk after the expansion is completed.
The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.
Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in the embodiments or some parts of the embodiments.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (10)

1. An operation and maintenance management method for a cluster server is characterized by comprising the following steps:
acquiring a capacity expansion request of a node to be expanded;
acquiring configuration information for capacity expansion according to the capacity expansion request;
according to the configuration information for capacity expansion, the node to be expanded executes the capacity expansion of the node based on a pre-starting execution environment, and corrects the node configuration information of the node to be expanded; wherein the node configuration information comprises a master node name, an internet protocol address and substrate controller information;
and acquiring the node configuration information, and setting the node to be expanded to be started based on the hard disk after the expansion is completed.
2. The operation and maintenance management method of the cluster server according to claim 1, wherein the obtaining of configuration information for capacity expansion according to the capacity expansion request specifically includes:
acquiring first configuration information and second configuration information according to the capacity expansion request; the first configuration information and the second configuration information constitute configuration information for capacity expansion, the first configuration information is configuration information for capacity expansion of a master node in the cluster server, and the second configuration information is baseboard management controller information of other nodes in the cluster server.
3. The operation and maintenance management method of the cluster server according to claim 2, wherein the obtaining of configuration information for capacity expansion according to the capacity expansion request specifically includes:
and sharing the second configuration information of each node in the cluster server based on a hypertext transfer protocol.
4. The operation and maintenance management method of the cluster server according to claim 2 or 3, wherein according to the configuration information for capacity expansion, the node to be expanded executes node capacity expansion based on a pre-boot execution environment, and corrects the node configuration information of the node to be expanded, specifically including the following steps:
generating starting configuration information of a pre-starting execution environment according to the first configuration information;
according to the starting configuration information, the node to be expanded executes node expansion based on a pre-starting execution environment;
after the node to be expanded is started based on the pre-starting execution environment, acquiring the second configuration information and the third configuration information shared by other nodes; the third configuration information is baseboard management controller information of the node to be expanded;
and correcting the node configuration information by the node to be expanded according to the second configuration information and the third configuration information.
5. The operation and maintenance management method of the cluster server according to claim 1, wherein before the step of obtaining the capacity expansion request of the node to be expanded, the method further comprises the following steps:
and deploying the operation and maintenance management environment of the cluster server and initializing the pre-starting execution environment service.
6. The operation and maintenance management method of the cluster server according to claim 1, wherein the obtaining of the node configuration information and the setting of the node to be expanded to be started based on a hard disk after the expansion are completed specifically include:
after the deployment node deploying the operation and maintenance management environment acquires that the internet interconnection protocol address of the node to be expanded is on-line, after the expansion is completed, the deployment node sets the node to be expanded to be started based on a hard disk.
7. An operation and maintenance management device for a cluster server, comprising:
the first acquisition module is used for acquiring a capacity expansion request of a node to be expanded;
the second acquisition module is used for acquiring configuration information for capacity expansion according to the capacity expansion request;
the correction module is used for executing node capacity expansion based on a pre-starting execution environment by the node to be subjected to capacity expansion according to the configuration information for capacity expansion and correcting the node configuration information of the node to be subjected to capacity expansion; wherein the node configuration information comprises a master node name, an internet protocol address and substrate controller information;
and the protection module is used for acquiring the node configuration information and setting the node to be expanded to be started based on the hard disk after the expansion is completed.
8. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the steps of the operation and maintenance management method of the cluster server according to any one of claims 1 to 6 when executing the program.
9. A non-transitory computer readable storage medium, having a computer program stored thereon, wherein the computer program, when being executed by a processor, implements the steps of the operation and maintenance management method of a cluster server according to any one of claims 1 to 6.
10. A computer program product comprising a computer program, wherein the computer program when executed by a processor implements the steps of the operation and maintenance management method of a cluster server according to any of claims 1 to 6.
CN202111422292.4A 2021-11-26 2021-11-26 Operation and maintenance management method, device, equipment and product of cluster server Pending CN114422361A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111422292.4A CN114422361A (en) 2021-11-26 2021-11-26 Operation and maintenance management method, device, equipment and product of cluster server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111422292.4A CN114422361A (en) 2021-11-26 2021-11-26 Operation and maintenance management method, device, equipment and product of cluster server

Publications (1)

Publication Number Publication Date
CN114422361A true CN114422361A (en) 2022-04-29

Family

ID=81265455

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111422292.4A Pending CN114422361A (en) 2021-11-26 2021-11-26 Operation and maintenance management method, device, equipment and product of cluster server

Country Status (1)

Country Link
CN (1) CN114422361A (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106657444A (en) * 2017-02-24 2017-05-10 郑州云海信息技术有限公司 Method and device for configuring IP address of BMC
US20190281012A1 (en) * 2018-03-09 2019-09-12 Fujitsu Limited Information processing apparatus and information processing apparatus management system
CN110879712A (en) * 2019-11-07 2020-03-13 北京浪潮数据技术有限公司 Cloud data center physical host installation method and related device
CN111427624A (en) * 2020-03-20 2020-07-17 苏州浪潮智能科技有限公司 Method, device and system for batch automatic deployment and configuration of servers
CN112434278A (en) * 2020-11-20 2021-03-02 北京浪潮数据技术有限公司 Bare computer authentication method, apparatus, device and medium
CN112866017A (en) * 2021-01-08 2021-05-28 苏州浪潮智能科技有限公司 Method, system, medium and device for configuring BMC IP address of bare metal server
CN113268256A (en) * 2021-06-09 2021-08-17 中国建设银行股份有限公司 Batch installation method and device, server and computer storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106657444A (en) * 2017-02-24 2017-05-10 郑州云海信息技术有限公司 Method and device for configuring IP address of BMC
US20190281012A1 (en) * 2018-03-09 2019-09-12 Fujitsu Limited Information processing apparatus and information processing apparatus management system
CN110879712A (en) * 2019-11-07 2020-03-13 北京浪潮数据技术有限公司 Cloud data center physical host installation method and related device
CN111427624A (en) * 2020-03-20 2020-07-17 苏州浪潮智能科技有限公司 Method, device and system for batch automatic deployment and configuration of servers
CN112434278A (en) * 2020-11-20 2021-03-02 北京浪潮数据技术有限公司 Bare computer authentication method, apparatus, device and medium
CN112866017A (en) * 2021-01-08 2021-05-28 苏州浪潮智能科技有限公司 Method, system, medium and device for configuring BMC IP address of bare metal server
CN113268256A (en) * 2021-06-09 2021-08-17 中国建设银行股份有限公司 Batch installation method and device, server and computer storage medium

Similar Documents

Publication Publication Date Title
RU2438168C1 (en) Method and system for deploying software, software deployment server and user server
US9465625B2 (en) Provisioning of operating environments on a server in a networked environment
US6684327B1 (en) Extensible, flexible, memory efficient technique for network boot without special DHCP/PXE hardware
US8332490B2 (en) Method, apparatus and program product for provisioning a computer system
US20090077634A1 (en) Firmware update method and system using the same
CN106572200A (en) IP address configuration method and IP address configuration device for baseboard management controller BMC
CN103200271A (en) Advanced Risc machine (ARM) server and method of automatic installation system thereof
CN105183529A (en) Method for refreshing server firmware, target server, source server and system
CN111786810A (en) Automatic deployment method and system for large-scale test bed nodes
CN105512026A (en) Automatic batch testing method
CN111273924A (en) Software updating method and device
CN111367735B (en) Test method and system based on server to be tested and Wuban diagram operating system
CN102567050B (en) The method and apparatus of B/S system remote deploying projects
CN114115917A (en) Operating system installation method and device
CN113268254A (en) Cluster system installation method and device, electronic equipment and storage medium
CN110187890B (en) Project deployment method, electronic equipment and storage medium
US20170034120A1 (en) Network device setting method and information processing device
CN110688130A (en) Physical machine deployment method, physical machine deployment device, readable storage medium and electronic equipment
CN114422361A (en) Operation and maintenance management method, device, equipment and product of cluster server
CN110633086B (en) Blade server
CN113608932B (en) Database drilling method, device, equipment and storage medium
CN112256289A (en) Automatic deployment method, device and equipment
CN106506276A (en) A kind of information detecting method for server
CN109254782B (en) Operating system installation method and device
CN112363737A (en) System installation method and related device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination