CN114443439B - ARM server-based control method, ARM server-based control device, ARM server-based control equipment and storage medium - Google Patents

ARM server-based control method, ARM server-based control device, ARM server-based control equipment and storage medium Download PDF

Info

Publication number
CN114443439B
CN114443439B CN202210112986.6A CN202210112986A CN114443439B CN 114443439 B CN114443439 B CN 114443439B CN 202210112986 A CN202210112986 A CN 202210112986A CN 114443439 B CN114443439 B CN 114443439B
Authority
CN
China
Prior art keywords
service
service board
arm server
bmc
board
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210112986.6A
Other languages
Chinese (zh)
Other versions
CN114443439A (en
Inventor
黄玉秀
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN202210112986.6A priority Critical patent/CN114443439B/en
Publication of CN114443439A publication Critical patent/CN114443439A/en
Application granted granted Critical
Publication of CN114443439B publication Critical patent/CN114443439B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3006Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3055Monitoring arrangements for monitoring the status of the computing system or of the computing system component, e.g. monitoring if the computing system is on, off, available, not available
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/324Display of status information
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Abstract

The disclosure provides a control method, a control device and a control storage medium based on an ARM server, and relates to the technical field of computers, in particular to the technical field of cloud computing. The specific implementation scheme is as follows: responding to a service board processing instruction, determining a first ARM server in an ARM server cluster of an advanced reduced instruction set processor and a target service board in the first ARM server; and controlling a baseboard management controller BMC in the first ARM server to process the target service board. So as to improve the operation and maintenance management efficiency of the ARM server cluster.

Description

ARM server-based control method, ARM server-based control device, ARM server-based control equipment and storage medium
Technical Field
The disclosure relates to the technical field of computers, in particular to the technical field of cloud computing, and specifically relates to a control method, a control device, control equipment and a storage medium based on an ARM server.
Background
With the development of computer technology, the application range of the cluster hardware server (i.e. the ARM server cluster) based on the ARM architecture is wider and wider, and how to efficiently complete the operation and maintenance management of the ARM server cluster in the process of the operation of the ARM server cluster is a current problem to be solved.
Disclosure of Invention
The disclosure provides a control method, a control device and a control device based on an ARM server, and a storage medium.
According to an aspect of the present disclosure, there is provided an ARM server-based control method, including:
responding to a service board processing instruction, determining a first ARM server in an ARM server cluster of an advanced reduced instruction set processor and a target service board in the first ARM server;
and controlling the BMC in the first ARM server to process the target service board.
According to another aspect of the present disclosure, there is provided an electronic device including:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein, the liquid crystal display device comprises a liquid crystal display device,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the ARM server-based control method of any of the embodiments of the present disclosure.
According to another aspect of the present disclosure, there is provided a non-transitory computer-readable storage medium storing computer instructions for causing a computer to execute the ARM server-based control method of any of the embodiments of the present disclosure.
According to the scheme, the operation and maintenance management efficiency of the ARM server cluster can be improved.
It should be understood that the description in this section is not intended to identify key or critical features of the embodiments of the disclosure, nor is it intended to be used to limit the scope of the disclosure. Other features of the present disclosure will become apparent from the following specification.
Drawings
The drawings are for a better understanding of the present solution and are not to be construed as limiting the present disclosure. Wherein:
FIG. 1 is a flow chart of an ARM server-based control method provided in accordance with an embodiment of the present disclosure;
FIG. 2 is a flow chart of an ARM server-based control method provided in accordance with an embodiment of the present disclosure;
FIG. 3 is a flow chart of an ARM server-based control method provided in accordance with an embodiment of the present disclosure;
FIG. 4 is a schematic diagram of a BMC interacting with a service board according to an embodiment of the present disclosure;
FIG. 5 is a flow chart of an ARM server-based control method provided in accordance with an embodiment of the present disclosure;
FIG. 6 is a diagram of an ARM server-based management architecture provided in accordance with an embodiment of the present disclosure;
fig. 7 is a schematic structural diagram of an ARM server-based control device according to an embodiment of the present disclosure;
Fig. 8 is a block diagram of an electronic device for implementing an ARM server-based control method of an embodiment of the present disclosure.
Detailed Description
Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and should be considered as merely exemplary. Accordingly, one of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
Before describing the embodiments of the present disclosure, a simple description will be given of the deployment situation of elements inside each ARM server in the ARM server cluster of the embodiments of the present disclosure. Each ARM server of the present embodiment includes a plurality of service boards (i.e., core boards for executing different service services) in addition to conventional server components, such as a central processing unit (central processing unit, CPU), memory, hard disk, fan, chassis, etc., and a BMC (Baseboard Management Controller ) disposed on a set of individual ARM chips of the ARM server. The BMC integrates an embedded operating system in the ARM chip, is connected to a switch of a working environment through a standard RJ45 interface, and allows a client to remotely manage through the network no matter what state the ARM server is in (such as startup, shutdown or restarting) as long as the ARM server is connected to the network and the ARM server is not disconnected from the power supply. Briefly, the BMC may be considered a high-level remote KVM (Keyboard Video Mouse, keyboard, video and mouse) system, may display ARM server display information locally, and control or operate the ARM server using the local keyboard and mouse, and may map and load local optical disk images as virtual drives into the ARM server. Specifically, the BMC comprises out-of-band management software and a hardware module, wherein the out-of-band management software and the hardware module interact through a REST API interface to complete the work of setting a bottom basic input output system (Basic Input Output System, BIOS), remote Console management, operating system installation, hardware state real-time monitoring, remote power-on and power-off of a server, service system reset, network and time zone setting, access control strategy, log and fault alarm, management software and firmware upgrading and the like, and realize the remote control and management of a service system after the completion of the operating system installation.
Fig. 1 is a flowchart of an ARM server-based control method provided according to an embodiment of the present disclosure. The embodiment of the disclosure is suitable for the situation of managing the service boards deployed in all ARM servers in the ARM server cluster. The method may be performed by an ARM server based control device, which may be implemented in software and/or hardware. Preferably, the apparatus may be a BMC console configured in an electronic device. As shown in fig. 1, the ARM server-based control method provided in this embodiment may include:
s101, responding to a service board processing instruction, and determining a first ARM server in an ARM server cluster and a target service board in the first ARM server.
The service board processing instruction may be an instruction for processing a service board in an ARM server, for example, an instruction for restarting, powering off, upgrading, detecting a state, querying information, and the like, which may be an instruction for processing a service board. Optionally, the service board processing instruction at least includes: related information of the service board to be processed, such as identification information of the service board to be processed, service associated with the service board to be processed, or ARM server identification information (such as ARM server serial number or Internet protocol (Internet Protocol Address, IP) address) to which the service board to be processed belongs, and the like.
Alternatively, the service board processing instructions may be generated based on user operation triggers on the BMC console. For example, if the user has a service board processing requirement (such as a requirement of power-off for a certain service board), the user can select a service board to be processed on the BMC console, input the processing requirement for the service board, and click the submit button to trigger generation of a service board processing instruction. The service board processing instruction can also be generated by the BMC console according to a default rule of the system. For example, the BMC console system defaults to perform state detection on each service board in each ARM server in the ARM server cluster once every preset time period (such as a week), and if the current time reaches the preset time period, the BMC console will automatically touch a service board processing instruction for performing state detection on all service boards in the ARM server cluster. Optionally, if the service board processing instruction is a service board state query instruction, the service board processing instruction may also be generated by the BMC console based on a service exception alert fed back by the user, specifically, according to the service exception alert fed back by the user, determining identification information of an ARM server and/or a service board associated with an abnormal service, and further generating the service board processing instruction according to the identification information of the ARM server and/or the service board associated with the abnormal service.
The first ARM server may be an ARM server associated with the service board processing instruction in the ARM server cluster, that is, an ARM server to which each service board to be processed corresponding to the service board processing instruction belongs is the first ARM server. The target service boards are the service boards which are corresponding to the service board processing instructions and need to be processed.
Optionally, in this embodiment, in response to the service board processing instruction, the BMC console determines that an implementation manner of the first ARM server in the ARM server cluster is: under the condition that the related information of the first ARM server is given in the service board processing instruction, the related information of the first ARM server can be directly obtained from the service board processing instruction, and then the first ARM server is positioned from the ARM server cluster based on the related information. For example, if the related information of the first ARM server is the identification information (such as the IP or serial number of the ARM server) of the ARM server, the first ARM server may be located from the ARM server cluster according to the identification information at this time; if the related information of the first ARM server is related information (such as IP or serial number of the service board) of the service board to be processed, the target service board to be processed may be located according to the related information of the service board, and then the ARM server where the target service board is located is used as the first ARM server.
Another embodiment is: the service board processing instruction does not give the related information of the first ARM server, and at this time, all ARM servers in the ARM server cluster can be used as the first ARM server; the first ARM server can be selected from the ARM server cluster according to ARM server screening rules corresponding to the business processing instructions. Alternatively, this embodiment may set different server screening rules for different service processing instructions. For example, when the service processing instruction is a service board status detection instruction, the server screening rule may be to take, as the first ARM server, an ARM server to which a service board that has failed for a period of time (for example, within one month) belongs.
Optionally, the manner of determining the target service board in the first ARM server in this embodiment may be similar to the manner of determining the first ARM server, and in particular, one embodiment is: under the condition that the relevant information of the target service board is given in the service board processing instruction, the relevant information of the target service board can be directly obtained from the service board processing instruction, and then the target service board is positioned from the first ARM server based on the relevant information. Another embodiment is: the service board processing instruction does not give the related information of the target service board, and at this time, all the service boards (i.e. usable service boards) which can be used in the first ARM server can be used as the target service board; and the target service board can be positioned from the first ARM server according to the service board screening rule corresponding to the service processing instruction. In this embodiment, different service board screening rules may be set for different service processing instructions.
It should be noted that, in this embodiment, the first ARM server may be determined from the ARM server cluster in other manners, and the target service board may be determined from each service board in the first ARM server. This is not limited.
It should be further noted that, the number of the first ARM servers determined in this embodiment is one or more, and for each first ARM server, the number of the target service boards included therein may also be one or more, and this embodiment is preferably a plurality.
S102, controlling the BMC in the first ARM server to process the target service board.
Optionally, after determining the first ARM server corresponding to the current service board processing request and the target service boards to be processed in the first ARM server, the BMC console may call the BMCs deployed in each first ARM server in parallel, and perform corresponding processing on each target service board in the first ARM server based on the service board processing instruction. Specifically, the BMC console may first generate a service board processing request including a service board processing requirement and target service board identification information according to a service board processing instruction, and send the service board processing request to the BMC in the first ARM server, so that the BMC responds to the service board processing request and processes each target service board in parallel. Such as restarting, powering off, upgrading, detecting state, or inquiring information.
Optionally, when the BMC console invokes the BMC deployed in each first ARM server, the IP address and the subnet mask of the BMC deployed in the first ARM server may be first obtained, and based on the IP address and the subnet mask of the BMC, a network connection is established with the BMC, and then based on the username, the password, and the name of the domain name system (Domain Name System, DNS) of the BMC firmware, a client corresponding to the network connection is logged in through a global wide area network (Web) browser, and then the BMC is remotely invoked through the client. The user name, password, DNS name, etc. of the BMC firmware may be located in the serial number tag of the first ARM server.
It should be noted that in this embodiment, an identification information correspondence is established between the BMC deployed in each ARM server and each service board in the ARM server, that is, the BMC can process each service board having a correspondence with the BMC. That is, the BMC deployed in each ARM server in this embodiment can perform parallel processing on multiple service boards in the ARM server where the BMC is located based on control of the BMC console, for example, performing processing such as service board detection, batch outage, batch upgrade, etc. on multiple service boards in the ARM server in parallel, so that service board processing efficiency is greatly improved.
According to the scheme of the embodiment of the disclosure, the target service board to be processed in the ARM server cluster and the first ARM server where the target service board is located are determined by responding to the service board processing instruction, so that the BMC deployed in the first ARM server is controlled to process the target service board in the first ARM server. According to the scheme, the BMC in the ARM servers in the ARM server cluster can be controlled in parallel according to the service board processing instruction through one BMC console, so that each service board in the ARM servers is processed, the processing efficiency of each service board in the ARM server cluster is greatly improved, and the operation and maintenance management efficiency of the ARM server cluster is further improved.
Optionally, in the case that the service board processing instruction is a status detection instruction for a service board, and no related information of the target service board is given in the service board processing instruction, the embodiment may preferably use an available service board deployed in the first ARM server as the target service board. The method has the advantages that the BMC console is used for controlling one BMC to perform state detection on all service boards on the ARM server in batches, so that the state detection efficiency is improved, and meanwhile, the comprehensiveness of service board state detection is improved.
Fig. 2 is a flowchart of an ARM server-based control method provided according to an embodiment of the present disclosure. Based on the foregoing embodiments, the embodiments of the present disclosure further explain in detail how to determine the target service board in the first ARM server, as shown in fig. 2, a control method based on the ARM server provided in this embodiment may include:
s201, responding to a service board processing instruction, and determining a first ARM server in an ARM server cluster.
S202, determining a service board to be processed in the first ARM server according to the service board processing requirements in the service board processing instructions.
The service boards to be processed can be all service boards which meet the service board processing requirements corresponding to the service processing instructions and need to be subjected to service processing in the ARM server cluster.
Specifically, in this embodiment, the service board to be processed in the first ARM server may be determined as the service board to be processed according to the service board processing requirement in the service board processing instruction by using the manner described in the foregoing embodiment. For example, according to the related information of the service boards given in the service board processing instruction, each service board corresponding to the related information is found in the first ARM server to be used as the service board to be processed. Or all available service boards in the first ARM server are used as service boards to be processed; and the service board to be processed and the like can be positioned from the first ARM server according to the service board screening rule corresponding to the service processing instruction.
S203, determining a target service board from the service boards to be processed according to the current running state and/or the historical operation and maintenance data of the service boards to be processed.
The current operating state may include an operating state, an idle state, and a fault state, among others. The working state can also comprise business service information of current operation, continuous operation duration and the like. Historical operational data may be data that records the relevant data that was processed on the business board, and may include, but is not limited to: processing time, processing events, processing results, etc. Optionally, the historical operation and maintenance data may be all historical operation and maintenance data corresponding to various requirements of the service board to be processed; the historical operation and maintenance data corresponding to the service requirement processing related to the current service board processing instruction executed by the service board to be processed can also be processed.
Optionally, according to the current operation state of the service board to be processed, the method for determining the target service board from the service board to be processed may be: and determining the target service board from the service boards to be processed according to the current running state of the service boards to be processed and the expected running state required by executing the service board processing instruction. Specifically, if the current running state of the service board to be processed is the same as the expected running state, the service board to be processed is taken as the target service board. When the service board is subjected to power-off or upgrading treatment, the expected running state is an idle state; when the service board is subjected to state monitoring processing, the expected running state is a working state. When the expected operation state is an operation state, the expected operation state may further include: desired operational business services and desired duration of continuous operation, etc.
According to the historical operation and maintenance data of the service board to be processed, the mode of determining the target service board from the service board to be processed can be as follows: and judging that each service processing is executed in a history mode according to the history operation and maintenance data of the service to be processed, or when the corresponding processing result is executed in the process of executing the requirement processing associated with the current service board processing instruction, further determining whether the service board to be processed needs to be processed based on the current service board processing instruction according to the processing result, and if so, taking the service board to be processed as a target service board.
For example, if the service board processing requirement of the current service board processing instruction is to upgrade the firmware version of the service board, judging whether the history of the service board to be processed has completed the upgrade of the firmware version according to the history operation data of the service board to be processed, and if not, taking the service board to be processed as the target service board. For another example, if the service board processing requirement of the current service board processing instruction is to perform state monitoring on the service board, judging whether the execution failure condition frequently occurs when the history of the service board to be processed executes each processing according to the history operation data of the service board to be processed, and if so, taking the service board as the target service board. Or, according to the historical operation data of the service board to be processed, judging whether the service board to be processed is determined to be the fault service board or not, and if so, not taking the service board to be processed as the target service board.
The method for determining the target service board from the service boards to be processed can combine the two modes to determine the target service board according to the current running state and the historical operation and maintenance data of the service boards to be processed, so that the determined target service board is more accurate.
Optionally, in this embodiment, for other service boards to be processed, which do not belong to the target service board due to the current running state not meeting the requirement, the current running state may be monitored, and when the current running state meets the desired running state, the service board is triggered to be used as the target service board, and the subsequent operation of S204 is performed on the service board.
S204, controlling the BMC in the first ARM server to process the target service board.
According to the scheme of the embodiment of the disclosure, the first ARM server in the ARM server cluster and the service boards to be processed in the first ARM server are determined through responding to the service board processing instruction, and then the target service boards are further screened out from the service boards to be processed according to the current running state and/or the historical operation and maintenance data of each service board to be processed, so that the BMC deployed in the first ARM server is controlled to process the target service boards in the first ARM server. According to the method and the device, the current running state and/or historical operation and maintenance data of the service boards are introduced to determine the target service boards which are finally required to be processed, so that the accuracy of determining the target service boards is improved, and the problem of processing the target service boards in parallel subsequently due to improper selection of the target service boards is avoided.
Fig. 3 is a flowchart of an ARM server-based control method provided according to an embodiment of the present disclosure. Based on the foregoing embodiments, the embodiments of the present disclosure further explain in detail how to control a BMC in a first ARM server to process a target service board, as shown in fig. 3, where the ARM server-based control method provided in this embodiment may include:
s301, responding to a service board processing instruction, and determining a first ARM server in an ARM server cluster and a target service board in the first ARM server.
S302, generating a service processing request according to the service board processing instruction and the identification information of the target service board.
The service board processing request may be generated by the BMC console, and is used to instruct the BMC to process a request for the target service board. The service board processing request at least comprises the following steps: the target service board identification information and the service board processing requirement can also comprise identification information of the BMC (or the first ARM server), request time, service board processing time and the like.
Optionally, the BMC console may analyze the service board processing requirement from the service board processing instruction, and acquire the identification information of the target service board determined in S301, so as to generate the service processing request according to the service board processing requirement and the identification information of the target service board, and according to the adapted data format of the BMC deployed in the first ARM server. Optionally, identification information of the BMC (or the first ARM server), the request time, the service board processing time, and the like can also be added in the service processing request.
It should be noted that, in this embodiment, the adaptation data formats of BMCs deployed in each ARM server in the ARM server cluster may be the same or different.
S303, sending a service processing request to the BMC in the first ARM server through the special management network, so that the BMC responds to the service processing request and interacts with the target service board through the service network to process the target service board.
The special management network is used for communication between the BMC in the ARM server and the BMC console, namely a special communication network between the ARM server and the external BMC console; alternatively, the private management networks between different BMCs and BMC consoles may be the same or different. The service network is used for communication between the BMC in the ARM server and each service board in the ARM server. I.e. a private network inside the ARM server for transmitting traffic data.
Optionally, in this embodiment, the BMC console may send a service processing request to the BMC in the first ARM server through the dedicated management network, and the BMC determines a target service board to be processed in response to the service processing request, generates a corresponding control instruction according to a service board processing requirement, and transmits the control instruction to the corresponding target service board through the service network, so that the target service board responds to the control instruction and executes the relevant service processing operation.
According to the scheme of the embodiment of the disclosure, a target service board to be processed in an ARM server cluster and a first ARM server where the target service board is located are determined by responding to a service board processing instruction, a service processing request containing service board processing requirements and identification information of the target service board is generated, the service processing request is sent to a BMC deployed in the first ARM server through a special management network, so that the BMC responds to the service processing request, interacts with the target service board through the service network, and processes the target service board. According to the scheme, different networks are set for communication between the BMC and the BMC console and communication between the BMC and the service board, namely, separation of the network of the internal communication and the external communication of the ARM server is ensured, and the safety and the high efficiency of communication can be improved.
Optionally, the service network of the embodiment includes a general service network and a special service network; further, the special service network is used for communication between the target service board and the universal network conversion component; the universal service network is used for communication between the universal network conversion component and the BMC. Specifically, the general network special conversion component may be a circuit configured in an ARM server for performing general logic conversion on special service networks of different specifications And the element can be configured between the BMC and each business board. As shown in fig. 4, the service network is composed of an asynchronous transceiver (Universal Asynchronous Receiver/Transmitter, UART) and a bus (I2C). BMC may first pass UART Universal use And I2C Universal use Transmitting the control command to the target service board to a universal network conversion component, wherein the universal network conversion component receives the control command and converts different special service network logics to pass through different UARTs i And I2C i And transmitting the converted control instruction to the corresponding service board i. Correspondingly, if each service board i needs to feed back the processing result to the BMC after executing the control instruction, each service board i will pass through its UART i And I2C i Transmitting the processing result to a universal network conversion component, and after the universal network conversion component carries out universal service network logic conversion on the processing result fed back by each service board, the processing result passes through a UART (universal asynchronous receiver/transmitter) Universal use And I2C Universal use It is transmitted to the BMC. According to the embodiment, different special service networks are set for different service boards, communication interference among the different service boards is avoided, and the safety and the high efficiency of communication of each service board are ensured.
Fig. 5 is a flowchart of an ARM server-based control method provided according to an embodiment of the present disclosure. Based on the foregoing embodiments, the embodiments of the present disclosure are further optimized, as shown in fig. 5, and the control method based on the ARM server provided in this embodiment may include:
S501, responding to a service board processing instruction, determining a first ARM server in an ARM server cluster and a target service board in the first ARM server.
S502, controlling the BMC in the first ARM server to process the target service board.
S503, determining the health degree of the target service board according to the service processing result fed back by the BMC in the first ARM server.
The service processing result may be a result of the BMC processing the target service board, and specifically, if the BMC performs power-off or upgrade processing on the target service board, the processing result may be whether the power-off or upgrade processing is successfully performed on the target service board; if the BMC performs information query processing on the target service board, the processing result can be related information of the queried target service board; if the BMC performs the operation state monitoring processing on the target service board, the processing result may be the current operation state of the target service board, such as a working state, an idle state or a fault state.
The health of the service board in this embodiment may be an indicator for measuring whether the service board can normally operate. The health may include at least two levels, e.g., a health level from high to low may include both healthy and unhealthy levels; three classes of health, sub-health and unhealthy may also be included.
Specifically, in this embodiment, according to the service processing result fed back by the BMC in the first ARM server, the manner of determining the health degree of the target service board may be to preset health degree judgment standards corresponding to different service processing results, and determine the health degree of the target service board based on the judgment standards and the service processing results fed back by the BMCs. For example, the following cases may be included, but are not limited to:
in the first case, if the processing result is whether the power-off or upgrade is successfully performed on the target service board, the health status of the target service board on which the power-off or upgrade is successfully performed is the highest health level (for example, health), and the health status of other target service boards can further determine the corresponding health level according to the reason of the failure in execution.
And in the second case, if the processing result can be the related information of the target service board queried by the BMC, the health degree of the target service board can be analyzed and determined according to the related information. For example, the health of the target service board is determined according to the temperature of the target service board, the intensity of environmental noise, and the like.
And thirdly, if the processing result can be the current running state of the target service board, the health degree grade of the target service board with the running state being the fault state is lower, and the health degree grade of the target service boards with other running states is higher.
S504, determining the service state of the service corresponding to the target service board according to the health degree.
The service state of the service is used for representing whether the service provided by the ARM server cluster is normal service or not, and specifically, the service state comprises normal service and abnormal service.
Optionally, in this embodiment, when determining the service state of the service corresponding to the target service board according to the health degree, the service state of the service corresponding to the target service board may be determined by combining requirements of different service services on the health degree of the service board. For example, if a certain business service requires that only the business board with the highest health level (such as health) can process the business, then only if the health level of all the target business boards corresponding to the business service is the highest level, the server state of the business server is normal service, otherwise, the server state is abnormal service. Specifically, if the service state of the service is abnormal, the service state may further include a cause of the abnormality.
Optionally, if a certain service requirement does not enable the service board with the lowest health level to execute the service, when the health of the target service board executing the service is at a middle level (such as sub-health), although the service state of the service is normal service, the target service board with the middle level may have an influence on the normal operation of the service, and at this time, the embodiment may add the number and/or the identifier of the target service boards with the non-highest health level as warning information in the service state.
S505, outputting the health degree of the target business board and/or the service state of the corresponding business of the target business board.
Specifically, the BMC console of this embodiment may output, on a display screen of the BMC console, the health degree of the target service board and/or the service state of the service corresponding to the target service board in a list, an icon, text, or voice format, etc.
Preferably, for the target service board with lower health degree or the service with abnormal service state, a notification message can be sent to the manager of the ARM server cluster so as to remind the manager of repairing the target service board with lower health degree or the target service board associated with the service with abnormal state as soon as possible.
According to the scheme of the embodiment of the disclosure, the target service board to be processed in the ARM server cluster and the first ARM server where the target service board is located are determined by responding to the service board processing instruction, so that the BMC deployed in the first ARM server is controlled to process the target service board in the first ARM server, the service processing result fed back by the BMC after the target service board is processed is obtained, and the health degree of the target service board and the service state of the corresponding service of the target service board are determined and output according to the service processing result. According to the scheme, the BMC can be controlled to process the target service board, and the health degree of the target service board and/or the service state of the corresponding service of the target service board can be displayed to the user according to the processing result, so that the functions of the BMC control console are further enriched.
Fig. 6 is a management architecture diagram based on an ARM server according to an embodiment of the present disclosure.
The system comprises a BMC console and an ARM server cluster, wherein the ARM server cluster comprises a plurality of ARM servers, and each ARM server is provided with a BMC and a plurality of service boards. Each BMC is used for responding to the control of the BMC console and managing a plurality of service boards deployed on the ARM server where the BMC is located. Optionally, the BMC console of the present embodiment may implement management of the BMC and the plurality of service boards deployed in the ARM server. Specifically, the process of managing the service board includes: the BMC console responds to the service board processing instruction and determines a first ARM server in the ARM server cluster and a target service board in the first ARM server; and controlling the BMC in the first ARM server to process the target service board, such as restarting, power-off, upgrading, state detection or information inquiry and the like.
The process of managing the BMC comprises the following steps: responding to the BMC processing instruction, and determining a second ARM server in the ARM server cluster; and processing the BMC in the second ARM server.
The second ARM server may be an ARM server associated with the BMC processing instruction in the ARM server cluster, that is, an ARM server to which each BMC that corresponds to the BMC processing instruction and needs to process belongs is the second ARM server. The manner of determining the second ARM server from the ARM server cluster in this embodiment may be similar to the manner of determining the first ARM server from the ARM server cluster described in the foregoing embodiment, and will not be described herein. Alternatively, the number of the second ARM servers determined in this embodiment may be one or more. After determining each second ARM server, the BMC control console can generate a BMC processing request corresponding to the BMC of each second ARM server according to the BMC processing instruction and the identification information of the BMC of each second ARM server, and send each BMC processing request to the BMC in the corresponding second ARM server through the special management network so as to enable the BMC in each second ARM server to perform corresponding processing, such as restarting, powering off, upgrading, state detection or information query and the like.
Illustratively, fig. 6 shows relevant functions of the BMC console of the present embodiment: 1. basic functions related to user account: such as account login, account setup, and account information overview. 2. BMC management function: such as new BMC, BMC detail query, BMC list presentation, and BMC handling (e.g., power down and reboot, etc.). 3. Service board management function: such as service board list presentation, service board processing (e.g., power down and reboot, etc.), and service board information querying (e.g., running status querying). 4. Monitoring and warning functions: such as a monitoring list (e.g., a monitored service board or BMC list) and sending a warning message. 5. Log audit function: the oplog list and oplog query. The operation log may include a log of the operation BMC and a log of the operation service board.
The BMC control console of the embodiment not only can realize parallel processing of BMCs deployed in ARM servers, but also can realize parallel management of a plurality of service boards. The operation and maintenance management efficiency and the richness of the management dimension of the BMC control console to the ARM server cluster are greatly improved.
Fig. 7 is a schematic structural diagram of an ARM server-based control device according to an embodiment of the present disclosure. The embodiment of the disclosure is suitable for the situation of managing the service boards deployed in all ARM servers in the ARM server cluster. The apparatus may be implemented in software and/or hardware, and preferably the apparatus may be a BMC console configured in an electronic device. The device can realize the ARM server-based control method of any embodiment of the disclosure. As shown in fig. 7, the ARM server-based control device 700 includes:
A first object determining module 701, configured to determine, in response to a service board processing instruction, a first ARM server in an ARM server cluster of an advanced reduced instruction set processor, and a target service board in the first ARM server;
the service board processing module 702 is configured to control the baseboard management controller BMC in the first ARM server to process the target service board.
According to the scheme of the embodiment of the disclosure, the target service board to be processed in the ARM server cluster and the first ARM server where the target service board is located are determined by responding to the service board processing instruction, so that the BMC deployed in the first ARM server is controlled to process the target service board in the first ARM server. According to the scheme, the BMC in the ARM servers in the ARM server cluster can be controlled in parallel according to the service board processing instruction through one BMC console, so that each service board in the ARM servers is processed, the processing efficiency of each service board in the ARM server cluster is greatly improved, and the operation and maintenance management efficiency of the ARM server cluster is further improved.
Further, the service board determining unit in the first object determining module 701 is configured to: and taking the available service board deployed in the first ARM server as a target service board.
Further, the service board determining unit in the first object determining module 701 is further configured to:
determining a service board to be processed in a first ARM server according to the service board processing requirement in the service board processing instruction;
and determining the target service board from the service boards to be processed according to the current running state and/or the historical operation and maintenance data of the service boards to be processed.
Further, the service board processing module 702 is specifically configured to:
generating a service processing request according to the service board processing instruction and the identification information of the target service board;
and sending a service processing request to the BMC in the first ARM server through the special management network, so that the BMC responds to the service processing request and interacts with the target service board through the service network to process the target service board.
Further, the service network includes a general service network and a special service network; the special service network is used for communication between the target service board and the general network conversion component; the universal service network is used for communication between the universal network conversion component and the BMC.
Further, the ARM server-based control device 700 further includes:
the health degree determining module is used for determining the health degree of the target service board according to the service processing result fed back by the BMC in the first ARM server;
The service state determining module is used for determining the service state of the service corresponding to the target service board according to the health degree;
and the output module is used for outputting the health degree of the target service board and/or the service state of the corresponding service of the target service board.
Further, the ARM server-based control device 700 further includes:
the second object determining module is used for responding to the BMC processing instruction and determining a second ARM server in the ARM server cluster;
and the BMC processing module is used for processing the BMC in the second ARM server.
The product can execute the method provided by any embodiment of the disclosure, and has the corresponding functional modules and beneficial effects of executing the method.
In the technical scheme of the disclosure, related data, such as identification information, service data, detail information and the like, of the BMC or the service board are acquired, stored, applied and the like, all conform to the regulations of related laws and regulations, and are not in violation of the public order.
According to embodiments of the present disclosure, the present disclosure also provides an electronic device, a readable storage medium and a computer program product.
Fig. 8 illustrates a schematic block diagram of an example electronic device 800 that may be used to implement embodiments of the present disclosure. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile apparatuses, such as personal digital assistants, cellular telephones, smartphones, wearable devices, and other similar computing apparatuses. The components shown herein, their connections and relationships, and their functions, are meant to be exemplary only, and are not meant to limit implementations of the disclosure described and/or claimed herein.
As shown in fig. 8, the apparatus 800 includes a computing unit 801 that can perform various appropriate actions and processes according to a computer program stored in a Read Only Memory (ROM) 802 or a computer program loaded from a storage unit 808 into a Random Access Memory (RAM) 803. In the RAM 803, various programs and data required for the operation of the device 800 can also be stored. The computing unit 801, the ROM 802, and the RAM 803 are connected to each other by a bus 804. An input/output (I/O) interface 805 is also connected to the bus 804.
Various components in device 800 are connected to I/O interface 805, including: an input unit 806 such as a keyboard, mouse, etc.; an output unit 807 such as various types of displays, speakers, and the like; a storage unit 808, such as a magnetic disk, optical disk, etc.; and a communication unit 809, such as a network card, modem, wireless communication transceiver, or the like. The communication unit 809 allows the device 800 to exchange information/data with other devices via a computer network such as the internet and/or various telecommunication networks.
The computing unit 801 may be a variety of general and/or special purpose processing components having processing and computing capabilities. Some examples of computing unit 801 include, but are not limited to, a Central Processing Unit (CPU), a Graphics Processing Unit (GPU), various specialized Artificial Intelligence (AI) computing chips, various computing units running machine learning model algorithms, a Digital Signal Processor (DSP), and any suitable processor, controller, microcontroller, etc. The computing unit Y01 performs the respective methods and processes described above, for example, an ARM server-based control method. For example, in some embodiments, the ARM server-based control method may be implemented as a computer software program tangibly embodied on a machine-readable medium, such as the storage unit 808. In some embodiments, part or all of the computer program may be loaded and/or installed onto device 800 via ROM 802 and/or communication unit 809. When a computer program is loaded into RAM 803 and executed by computing unit 801, one or more steps of the ARM server-based control method described above may be performed. Alternatively, in other embodiments, the computing unit 801 may be configured to perform ARM server-based control methods by any other suitable means (e.g., by means of firmware).
Various implementations of the systems and techniques described here above can be implemented in digital electronic circuitry, integrated circuit systems, field Programmable Gate Arrays (FPGAs), application Specific Integrated Circuits (ASICs), application Specific Standard Products (ASSPs), systems On Chip (SOCs), complex Programmable Logic Devices (CPLDs), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs, the one or more computer programs may be executed and/or interpreted on a programmable system including at least one programmable processor, which may be a special purpose or general-purpose programmable processor, that may receive data and instructions from, and transmit data and instructions to, a storage system, at least one input device, and at least one output device.
Program code for carrying out methods of the present disclosure may be written in any combination of one or more programming languages. These program code may be provided to a processor or controller of a general purpose computer, special purpose computer, or other programmable data processing apparatus such that the program code, when executed by the processor or controller, causes the functions/operations specified in the flowchart and/or block diagram to be implemented. The program code may execute entirely on the machine, partly on the machine, as a stand-alone software package, partly on the machine and partly on a remote machine or entirely on the remote machine or server.
In the context of this disclosure, a machine-readable medium may be a tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. The machine-readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and pointing device (e.g., a mouse or trackball) by which a user can provide input to the computer. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user may be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic input, speech input, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a background component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such background, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), wide Area Networks (WANs), blockchain networks, and the internet.
The computer system may include a client and a server. The client and server are typically remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. The server can be a cloud server, also called a cloud computing server or a cloud host, and is a host product in a cloud computing service system, so that the defects of high management difficulty and weak service expansibility in the traditional physical hosts and VPS service are overcome. The server may also be a server of a distributed system or a server that incorporates a blockchain.
Artificial intelligence is the discipline of studying the process of making a computer mimic certain mental processes and intelligent behaviors (e.g., learning, reasoning, thinking, planning, etc.) of a person, both hardware-level and software-level techniques. Artificial intelligence hardware technologies generally include technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, big data processing, and the like; the artificial intelligent software technology mainly comprises a computer vision technology, a voice recognition technology, a natural language processing technology, a machine learning/deep learning technology, a big data processing technology, a knowledge graph technology and the like.
Cloud computing (cloud computing) refers to a technical system that a shared physical or virtual resource pool which is elastically extensible is accessed through a network, resources can comprise servers, operating systems, networks, software, applications, storage devices and the like, and resources can be deployed and managed in an on-demand and self-service mode. Through cloud computing technology, high-efficiency and powerful data processing capability can be provided for technical application such as artificial intelligence and blockchain, and model training.
It should be appreciated that various forms of the flows shown above may be used to reorder, add, or delete steps. For example, the steps recited in the present disclosure may be performed in parallel, sequentially, or in a different order, provided that the desired results of the disclosed aspects are achieved, and are not limited herein.
The above detailed description should not be taken as limiting the scope of the present disclosure. It will be apparent to those skilled in the art that various modifications, combinations, sub-combinations and alternatives are possible, depending on design requirements and other factors. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present disclosure are intended to be included within the scope of the present disclosure.

Claims (8)

1. A control method based on ARM server includes:
in response to a service board processing instruction, determining a first ARM server in an advanced reduced instruction set processor ARM server cluster, and determining a target service board in the first ARM server, comprising: determining a service board to be processed in a first ARM server according to the service board processing requirements in the service board processing instructions; determining a target service board from the service boards to be processed according to the current running state and/or the historical operation and maintenance data of the service boards to be processed; the current operating state includes: working state, idle state and fault state;
generating a service processing request according to the service board processing instruction and the identification information of the target service board;
the service processing request is sent to the BMC in the first ARM server through a special management network, so that the BMC responds to the service processing request and interacts with the target service board through a service network to process the target service board; wherein the service network comprises a general service network and a special service network; the special service network is used for communication between the target service board and the universal network conversion component; the general service network is used for communication between the general network conversion component and the BMC;
Responding to the BMC processing instruction, and determining a second ARM server in the ARM server cluster;
and processing the BMC in the second ARM server.
2. The method of claim 1, wherein the determining the target service board in the first ARM server comprises:
and taking the available service board deployed in the first ARM server as a target service board.
3. The method of claim 1, further comprising:
determining the health degree of the target service board according to a service processing result fed back by the BMC in the first ARM server;
determining the service state of the service corresponding to the target service board according to the health degree;
and outputting the health degree of the target service board and/or the service state of the corresponding service of the target service board.
4. An ARM server-based control device, comprising:
the first object determining module is used for responding to the service board processing instruction and determining a first ARM server in the ARM server cluster of the advanced reduced instruction set processor, and comprises a service board determining unit used for: determining a service board to be processed in a first ARM server according to the service board processing requirements in the service board processing instructions; determining a target service board from the service boards to be processed according to the current running state and/or the historical operation and maintenance data of the service boards to be processed; the current operating state includes: working state, idle state and fault state;
The service board processing module is used for generating a service processing request according to the service board processing instruction and the identification information of the target service board;
the service processing request is sent to the BMC in the first ARM server through a special management network, so that the BMC responds to the service processing request and interacts with the target service board through a service network to process the target service board;
wherein the service network comprises a general service network and a special service network; the special service network is used for communication between the target service board and the universal network conversion component; the general service network is used for communication between the general network conversion component and the BMC;
the second object determining module is used for responding to the BMC processing instruction and determining a second ARM server in the ARM server cluster;
and the BMC processing module is used for processing the BMC in the second ARM server.
5. The apparatus of claim 4, wherein the service board determining unit in the first object determining module is configured to: and taking the available service board deployed in the first ARM server as a target service board.
6. The apparatus of claim 4, further comprising:
The health degree determining module is used for determining the health degree of the target service board according to the service processing result fed back by the BMC in the first ARM server;
the service state determining module is used for determining the service state of the service corresponding to the target service board according to the health degree;
and the output module is used for outputting the health degree of the target service board and/or the service state of the service corresponding to the target service board.
7. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein, the liquid crystal display device comprises a liquid crystal display device,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the ARM server-based control method of any one of claims 1-3.
8. A non-transitory computer-readable storage medium storing computer instructions for causing the computer to execute the ARM server-based control method according to any one of claims 1-3.
CN202210112986.6A 2022-01-29 2022-01-29 ARM server-based control method, ARM server-based control device, ARM server-based control equipment and storage medium Active CN114443439B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210112986.6A CN114443439B (en) 2022-01-29 2022-01-29 ARM server-based control method, ARM server-based control device, ARM server-based control equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210112986.6A CN114443439B (en) 2022-01-29 2022-01-29 ARM server-based control method, ARM server-based control device, ARM server-based control equipment and storage medium

Publications (2)

Publication Number Publication Date
CN114443439A CN114443439A (en) 2022-05-06
CN114443439B true CN114443439B (en) 2023-06-30

Family

ID=81372169

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210112986.6A Active CN114443439B (en) 2022-01-29 2022-01-29 ARM server-based control method, ARM server-based control device, ARM server-based control equipment and storage medium

Country Status (1)

Country Link
CN (1) CN114443439B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113110970A (en) * 2021-04-08 2021-07-13 浪潮商用机器有限公司 Method, device, equipment and medium for monitoring components in server working mode
CN113867815A (en) * 2021-09-17 2021-12-31 杭州当虹科技股份有限公司 Server suspension monitoring and automatic restarting method and server applying same

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10860311B2 (en) * 2018-06-13 2020-12-08 Dell Products, L.P. Method and apparatus for drift management in clustered environments
CN111008379B (en) * 2019-11-22 2023-02-28 腾讯科技(深圳)有限公司 Firmware safety detection method of electronic equipment and related equipment
CN111694707A (en) * 2020-05-23 2020-09-22 苏州浪潮智能科技有限公司 Small server cluster management system and method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113110970A (en) * 2021-04-08 2021-07-13 浪潮商用机器有限公司 Method, device, equipment and medium for monitoring components in server working mode
CN113867815A (en) * 2021-09-17 2021-12-31 杭州当虹科技股份有限公司 Server suspension monitoring and automatic restarting method and server applying same

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
服务器监控技术综述及展望;王慧强;戴秀豪;吕宏武;林俊宇;;计算机科学(第08期);6-11+23 *

Also Published As

Publication number Publication date
CN114443439A (en) 2022-05-06

Similar Documents

Publication Publication Date Title
US10756990B1 (en) Monitoring and performance improvement of enterprise applications using correlated data associated with a plurality of service layers
CN111045901B (en) Container monitoring method and device, storage medium and electronic equipment
CN110489306A (en) A kind of alarm threshold value determines method, apparatus, computer equipment and storage medium
CN104360878B (en) A kind of method and device of application software deployment
US7426592B2 (en) Management function setting method for intelligent platform management interface
CN101771565B (en) Analogy method for realizing multitudinous or different baseboard management controllers by single server
US11662803B2 (en) Control method, apparatus, and electronic device
CN103748553A (en) Display of operating status information of client in remote desktop session
KR102176028B1 (en) System for Real-time integrated monitoring and method thereof
CN117280327B (en) Detecting data center large scale interruptions through near real time/offline data using machine learning models
CN114398354A (en) Data monitoring method and device, electronic equipment and storage medium
CN113608970A (en) Core board, server, failure recovery method, device and storage medium
CN114443439B (en) ARM server-based control method, ARM server-based control device, ARM server-based control equipment and storage medium
CN111176958B (en) Information monitoring method, system and storage medium
EP3926429A1 (en) Control system
CN106155859B (en) monitoring management system, information processing method and high-density server
CN115098294B (en) Abnormal event processing method, electronic equipment and management terminal
CN116723198A (en) Multi-node server host control method, device, equipment and storage medium
CN112445549A (en) Operation and maintenance method, operation and maintenance device, electronic equipment and medium
CN115687026A (en) Multi-node server fault early warning method, device, equipment and medium
CN114356699A (en) Embedded equipment alarm method, device, equipment and storage medium
CN113849367A (en) Server, management method, device and system thereof, electronic equipment and storage medium
CN112311560B (en) Remote wakeup method and remote wakeup system
CN114115657B (en) Display mode switching method, device, equipment and medium of cloud mobile phone
US20230259377A1 (en) User input predictive action system and method for user interfaces

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant