CN117421038A - System maintenance method, device, computer equipment and storage medium - Google Patents

System maintenance method, device, computer equipment and storage medium Download PDF

Info

Publication number
CN117421038A
CN117421038A CN202311457401.5A CN202311457401A CN117421038A CN 117421038 A CN117421038 A CN 117421038A CN 202311457401 A CN202311457401 A CN 202311457401A CN 117421038 A CN117421038 A CN 117421038A
Authority
CN
China
Prior art keywords
server
server maintenance
maintenance
operation information
strategy
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311457401.5A
Other languages
Chinese (zh)
Inventor
陈鹏
金立江
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Metabrain Intelligent Technology Co Ltd
Original Assignee
Suzhou Metabrain Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Metabrain Intelligent Technology Co Ltd filed Critical Suzhou Metabrain Intelligent Technology Co Ltd
Priority to CN202311457401.5A priority Critical patent/CN117421038A/en
Publication of CN117421038A publication Critical patent/CN117421038A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/70Software maintenance or management
    • G06F8/71Version control; Configuration management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/26Power supply means, e.g. regulation thereof
    • G06F1/32Means for saving power
    • G06F1/3203Power management, i.e. event-based initiation of a power-saving mode
    • G06F1/3234Power saving characterised by the action undertaken
    • G06F1/324Power saving characterised by the action undertaken by lowering clock frequency
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/4401Bootstrapping
    • G06F9/4403Processor initialisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Security & Cryptography (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The application relates to a system maintenance method, a system maintenance device, computer equipment and a storage medium. The method comprises the following steps: acquiring a server maintenance instruction sent by a user, wherein the server maintenance instruction carries a server maintenance strategy; triggering an interrupt according to the server maintenance instruction so as to enable the system to enter a system management mode SMM; and executing the server maintenance strategy through a resident function in the SMM so as to maintain the server. According to the scheme, the server maintenance instruction carrying the server maintenance strategy triggers the interrupt, so that the system enters the SMM, the server is maintained by executing the server maintenance strategy through the resident function under the SMM, and the system maintenance efficiency is improved.

Description

System maintenance method, device, computer equipment and storage medium
Technical Field
The present disclosure relates to the field of computer technologies, and in particular, to a system maintenance method, a system maintenance device, a computer device, and a storage medium.
Background
In the current server application scenario, after initializing the BIOS (Basic Input Output System ) during the startup process, the related information of the BIOS register is fixed. If the user wants to modify the BIOS configuration according to the requirement, the system needs to be restarted to perform initialization setting on the BIOS again. Thus, not only the service continuity of the user is affected, but also the system maintenance is inconvenient.
Disclosure of Invention
In view of the foregoing, it is desirable to provide a system maintenance method, apparatus, computer device, and storage medium that can improve system maintenance efficiency.
In one aspect, a system maintenance method is provided, the method comprising:
acquiring a server maintenance instruction sent by a user, wherein the server maintenance instruction carries a server maintenance strategy;
triggering an interrupt according to the server maintenance instruction so as to enable the system to enter a system management mode SMM;
and executing the server maintenance strategy through a resident function in the SMM so as to maintain the server.
In some embodiments, the triggering an interrupt according to the server maintenance instruction to cause the system to enter a system management mode SMM step includes:
performing format packaging on the server maintenance strategy according to a preset format;
triggering interrupt through advanced configuration and power interface ACPI, and checking and analyzing the format packaged server maintenance strategy through the ACPI.
In some embodiments, the server maintenance instruction further carries an operation information acquisition policy, and after triggering an interrupt according to the server maintenance instruction to make the system enter the SMM step of the system management mode, the method further includes:
executing the operation information acquisition strategy through the resident function to acquire the current operation information of the server;
and updating the server maintenance strategy according to the current running information of the server.
In some embodiments, the updating the server maintenance policy according to the current running information of the server includes:
acquiring an execution result of the server maintenance strategy;
and updating the server maintenance strategy according to the current operation information and the execution result.
In some embodiments, the step of obtaining the execution result of the server maintenance policy includes:
acquiring historical operation information before executing the server maintenance strategy;
and determining an execution result of the server maintenance strategy according to the historical operation information and the current operation information.
In some embodiments, the updating the server maintenance policy according to the operation information and the execution result further includes:
determining whether to acquire complete operation information and complete execution results;
and in response to obtaining complete operation information and complete execution results, updating the server maintenance strategy according to the operation information and the execution results.
In some embodiments, the updating the server maintenance policy according to the current running information of the server includes:
when detecting that the current running information is that the server has a correctable error storm, updating the server maintenance strategy to suppress the correctable error storm;
when the current operation information is detected to be too high in load, the server maintenance strategy is updated to reduce the frequency, the voltage and the current of the CPU or the memory;
when the current operation information is detected to be equipment failure, updating the server maintenance strategy to disable the equipment with failure;
when the current operation information is detected to be abnormal in heat dissipation, the server maintenance strategy is updated to reduce the frequency, the voltage and the current of the CPU or the memory;
and when the current running information is detected to be idle and the load is low, updating the server maintenance strategy into an energy-saving strategy.
In another aspect, a system maintenance device is provided, including:
the instruction sending module is used for obtaining a server maintenance instruction sent by a user, wherein the server maintenance instruction carries a server maintenance policy;
the interrupt module is used for triggering interrupt according to the server maintenance instruction so as to enable the system to enter a System Management Mode (SMM);
and the maintenance module is used for executing the server maintenance strategy by using a resident function in the SMM so as to maintain the server.
In one embodiment, the interrupt module includes:
the packaging submodule is used for carrying out format packaging on the server maintenance strategy according to a preset format;
and the analysis submodule is used for triggering interruption through advanced configuration and a power interface ACPI, and checking and analyzing a format-packaged server maintenance strategy through the ACPI.
In an embodiment, the device further comprises:
the operation information acquisition module is used for executing the operation information acquisition strategy through the resident function so as to acquire the current operation information of the server;
and the updating module is used for updating the server maintenance strategy according to the current running information of the server.
In one embodiment, the update module includes:
the execution result acquisition sub-module is used for acquiring the execution result of the server maintenance strategy;
and the first updating sub-module is used for updating the server maintenance strategy according to the current running information and the execution result.
In an embodiment, the execution result obtaining submodule is configured to:
acquiring historical operation information before executing the server maintenance strategy;
and determining an execution result of the server maintenance strategy according to the historical operation information and the current operation information.
In an embodiment, the update module further comprises:
the execution result determining submodule is used for determining whether complete operation information and complete execution results are acquired or not;
and the second updating sub-module is used for updating the server maintenance strategy according to the running information and the execution result in response to the acquisition of the complete running information and the complete execution result.
In one embodiment, the update module includes:
a third updating sub-module, configured to update the server maintenance policy to suppress a correctable error storm when it is detected that the current operation information is that the correctable error storm occurs for the server;
a fourth updating sub-module, configured to update the server maintenance policy to reduce the frequency, the voltage, and the current limit for the CPU or the memory when the current operation information is detected to be that the load is too high;
a fifth updating sub-module, configured to update the server maintenance policy to disable the failed device when the current operation information is detected as the device failure;
a sixth updating sub-module, configured to update the server maintenance policy to reduce the frequency, reduce the voltage, and limit the current to the CPU or the memory when the current operation information is detected to be abnormal in heat dissipation;
and a seventh updating sub-module, configured to update the server maintenance policy to an energy-saving policy when detecting that the current operation information is idle and the load is low.
In yet another aspect, a computer device is provided that includes a memory, a processor, and a computer program stored on the memory and executable on the processor, the processor implementing the above system maintenance method when executing the computer program.
In yet another aspect, a computer readable storage medium is provided, on which a computer program is stored, which when executed by a processor, implements the above-described system maintenance method.
According to the system maintenance method, the device, the computer equipment and the storage medium, the server maintenance instruction carrying the server maintenance strategy triggers the interrupt, so that the system performs SMM, the server is maintained by executing the server maintenance strategy through the resident function under the SMM, and the system maintenance efficiency is improved.
Drawings
FIG. 1 is a schematic flow chart of a system maintenance method according to an embodiment of the present invention;
FIG. 2 is another flow chart of a system maintenance method according to an embodiment of the present invention;
FIG. 3 is a block diagram of a system maintenance device according to an embodiment of the present invention;
fig. 4 is an internal structure diagram of a computer device according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application will be further described in detail with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the present application.
According to the system maintenance method, firstly, a server maintenance instruction sent by a user is obtained, and the server maintenance instruction carries a server maintenance strategy. And triggering an interrupt according to the server maintenance instruction so as to enable the system to enter a system management mode SMM. And finally, executing the server maintenance strategy through a resident function under the SMM so as to maintain the server.
In one embodiment, as shown in fig. 1, a system maintenance method is provided, and the method is applied to a server for illustration, and includes the following steps:
step S101, a server maintenance instruction sent by a user is obtained, wherein the server maintenance instruction carries a server maintenance strategy.
In modern network environment, stable operation of the server is critical to ensure continuity of service, so when the server fails, the server needs to be maintained in time. In the embodiment of the application, the server maintenance instruction is used for indicating to maintain the server.
A user interface may be provided on the server that may display the current actual operating state of the server and provide settings that allow a user to maintain the server online according to business needs. For example, a system maintenance button is provided that when pressed by a user triggers the generation of server instructions.
In one embodiment, the server maintenance instructions carry a server maintenance policy. The server maintenance policy encapsulates various ways of maintaining the server, including configuration optimization of power supplies and other hardware devices in the server, optimization of performance, optimization of fault diagnosis policies, and the like. For example, the CPU (Central Processing Unit ) or the memory is subjected to frequency reduction, voltage reduction and current limiting, and the equipment with faults is disabled.
When the system maintenance method of the embodiment of the application is executed, specific system maintenance options such as frequency reduction for the CPU and disabling of system maintenance options such as fault equipment can be provided for a user on a user interface. And triggering and generating a corresponding server maintenance strategy after a user selects a certain system maintenance option.
The server maintenance policy may also be initially set. For example, the server maintenance policy is set to null. And then the server analyzes according to the current running information of the server and the execution result of the historical server maintenance strategy to formulate a new server maintenance strategy.
Step S102, triggering an interrupt according to the server maintenance instruction to enable the system to enter a system management mode SMM.
An SMI (System Management Interrupt ) may be triggered by an OS (Operating System) Agent call to a DSM_method function corresponding to an ACPI (Advanced Configuration and Power Interface ) interface, and after the CPU (Central Processing Unit, central processing unit) receives the SMI, the System enters SMM (System Management Mode ).
After the system enters SMM, BIOS (Basic Input Output System ) registers can be modified without restarting the system, so that the configuration information and state of hardware equipment can be changed, and the service requirement can be met. Not only is the downtime caused by restarting avoided, but also the service interruption is avoided, and the availability and the stability of the system are improved.
Step S103, executing a server maintenance strategy through a resident function under the SMM to maintain the server.
The resident function may be set in SMM in advance. After receiving the server maintenance policy sent by the user, executing the server maintenance policy through the resident function to maintain the server.
The embodiment of the invention provides a system maintenance method, which triggers interruption through a server maintenance instruction carrying a server maintenance strategy to enable a system to enter SMM, and the server is maintained through executing the server maintenance strategy through a resident function under the SMM, so that the system maintenance efficiency is improved.
The embodiment of the invention also provides another system maintenance method, as shown in fig. 2, which comprises the following steps:
step S201, a server maintenance instruction sent by a user is acquired, where the server maintenance instruction carries a server maintenance policy and an operation information acquisition policy.
In modern network environment, stable operation of the server is critical to ensure continuity of service, so when the server fails, the server needs to be maintained in time. In the embodiment of the application, the server maintenance instruction is used for indicating to maintain the server.
A user interface may be provided on the server that may display the current actual operating state of the server and provide settings that allow a user to maintain the server online according to business needs. For example, a system maintenance button is provided that when pressed by a user triggers the generation of server instructions. Therefore, the user can not only know the running condition of the server in time and discover and solve potential faults or problems in time and reduce downtime of the server, but also improve response speed and processing capacity of the system and safety of the server.
In one embodiment, the server maintenance instructions carry a server maintenance policy. The server maintenance policy encapsulates various ways of maintaining the server, including configuration optimization of power supplies and other hardware devices in the server, optimization of performance, optimization of fault diagnosis policies, and the like. For example, the CPU (Central Processing Unit ) or the memory is subjected to frequency reduction, voltage reduction and current limiting, and the equipment with faults is disabled.
When the system maintenance method of the embodiment of the application is executed, specific system maintenance options such as frequency reduction for the CPU and disabling of system maintenance options such as fault equipment can be provided for a user on a user interface. And triggering and generating a corresponding server maintenance strategy after a user selects a certain system maintenance option. Therefore, the user can independently maintain the server, the dependence on external professionals is reduced, and the maintenance cost and the workload are reduced.
The server maintenance policy may also be initially set. For example, the server maintenance policy is set to null. And then the server analyzes according to the current running information of the server and the execution result of the historical server maintenance strategy to formulate a new server maintenance strategy.
Wherein the operational state of the server is affected by a number of factors. Such as: load, memory, PCIE device health, network environment, hardware failures, etc. Therefore, the influence factors can be actively monitored to acquire the operation information of the server so as to determine the operation state of the server.
Therefore, in the embodiment of the application, the server maintenance instruction also carries an operation information acquisition policy. The operation information acquisition strategy is used for collecting operation information of equipment in the server. Such as: load state, power state, fault state, overheat state, operation state of various main components, and the like. A data collection module may be provided in the SMM service of the BIOS for collecting running information of the server in real time.
Step S202, triggering an interrupt according to the server maintenance instruction to enable the system to enter a system management mode SMM.
After receiving the server maintenance instruction, the ACPI interface can be called by the OS Agent to realize the interrupt. Among other things, ACPI defines a standardized set of methods and data structures that enable power management, device control, and configuration functions. In one embodiment, the system enters SMM (System Management Mode ) after the CPU (Central Processing Unit, central processing unit) receives the SMI by invoking the DSM_method function of ACPI to trigger the SMI (System Management Interrupt ).
After the system BIOS enters the SMM, the BIOS (Basic Input Output System ) register can be modified without restarting the system, so that the configuration information and the state of the hardware equipment can be changed, and the service requirement can be met. Not only is the downtime caused by restarting avoided, but also the service interruption is avoided, and the availability and the stability of the system are improved.
Furthermore, when the service demand is continuously changed, the BIOS register can be modified in real time to adjust the performance parameters of the hardware equipment, or new hardware equipment is added into the system to perform configuration updating so as to meet the continuously changed service demand, and flexible configuration and expansion of the service demand are realized. By modifying the BIOS register in real time, the hardware device can be optimized and adjusted to improve the efficiency and performance of the system. For example, the frequency and power consumption management policy of the CPU may be adjusted, or the cache and mapping manner of the memory may be adjusted, so as to improve the operation speed and response capability of the system.
In an embodiment, the server maintenance policy may also be encapsulated in a predetermined format. And then checking and analyzing the server maintenance strategy after format encapsulation through ACPI.
For example, the server maintenance policy may be encapsulated according to the SMM Capsule format, to obtain an encapsulated server maintenance policy. And then checking the integrity of the packaged server maintenance strategy through ACPI, and analyzing the packaged server maintenance strategy to obtain the analyzed server maintenance strategy.
Similarly, the operation information acquisition strategy can be packaged in a format according to a preset format, and then the operation information acquisition strategy after the format packaging is checked and analyzed through ACPI.
For example, the operation information acquisition policy may be encapsulated according to the SMM Capsule format, to obtain an encapsulated operation information acquisition policy. And then checking the integrity of the packaged operation information acquisition strategy through ACPI, and analyzing the packaged operation information acquisition strategy to obtain the analyzed operation information acquisition strategy.
In step S203, a server maintenance policy is executed by the resident function in SMM to maintain the server.
The resident function may be set in SMM in advance. After receiving the server maintenance policy sent by the user, executing the server maintenance policy through the resident function to maintain the server.
Step S204, executing the operation information acquisition strategy through the resident function to acquire the current operation information of the server.
Similarly, after receiving the service operation information acquisition strategy sent by the user, executing the operation information acquisition server through the resident function to acquire the current operation information of the server.
It should be noted that the operation of the system may have changed after maintenance of the server maintenance policy. Therefore, the server maintenance strategy can be updated according to the current running information so as to improve the accuracy of the server maintenance strategy.
In an embodiment, the server maintenance policy is set to suppress a correctable error storm when it is detected that the current operation information is that the server has a correctable error storm. And when the current operation information is detected to be too high in load, setting the server maintenance strategy to reduce the frequency, the voltage and the current of the CPU or the memory. And when the current operation information is detected to be equipment failure, setting the server maintenance strategy to disable the equipment with failure. When the current operation information is detected to be abnormal in heat dissipation, the server maintenance strategy is set to reduce the frequency, the voltage and the current of the CPU or the memory. And when the current running information is detected to be idle and the load is low, setting the server maintenance strategy as an energy-saving strategy.
Step S205, obtaining an execution result of the server maintenance policy.
In one embodiment, historical operating information prior to executing the server maintenance policy may be obtained. And then determining an execution result of the server maintenance strategy according to the historical operation information and the current operation information.
For example, when the server maintenance policy is to limit the CPU or the memory, it is assumed that the historical operation information before executing the server maintenance policy is obtained as follows: the flow rate of the CPU in the preset time period is the first flow rate, and the flow rate in the preset time period is the second flow rate. The current operation information after the server maintenance strategy is executed is obtained as follows: the flow rate of the CPU in the preset time period is the third flow rate, and the flow rate in the preset time period is the fourth flow rate. When the first flow is greater than the third flow and the second flow is greater than the fourth flow, it is indicated that both the CPU and the memory are limited, and the execution result may be set to be optimal. When the first flow is greater than the third flow, the second flow is not greater than the fourth flow, or the first flow is not greater than the third flow, and the second flow is greater than the fourth flow, it is indicated that one of the CPU and the memory is being restricted, and the execution result may be set to be medium. When the first flow is not greater than the third flow and the second flow is not greater than the fourth flow, it is indicated that the CPU and the memory are not limited, and the execution result may be set to be bad.
It should be noted that, after the current running information and the execution result are obtained, the SMI interrupt may be exited, and the BIOS may be notified to clear the field to recover the field data.
And step S206, updating the server maintenance strategy according to the current operation information and the execution result.
In one embodiment, it may be determined whether to obtain complete operational information and complete execution results. And then, in response to obtaining complete operation information and complete execution results, updating the server maintenance strategy according to the operation information and the execution results.
The server transmits polling information to the equipment needing to acquire the operation information to determine whether the equipment transmits the operation information of the equipment, and after receiving the successful transmission feedback of the equipment, the server confirms that the operation information of the equipment is received. After receiving the successful transmission feedback of all the devices, the complete operation information can be confirmed to be acquired. Similarly, query information about whether the server maintenance strategy is executed or not can be sent, and if the feedback of successful execution is received, the complete execution result is confirmed to be obtained. Then, the server maintenance strategy is updated according to the current operation information and the execution result.
The embodiment of the invention provides a system maintenance method, which not only triggers interrupt through a server maintenance instruction carrying a server maintenance strategy to enable a system to enter SMM, but also maintains the server through executing the server maintenance strategy through a resident function under the SMM, thereby improving the system maintenance efficiency. After the operation information of the server is acquired through the operation information acquisition strategy carried by the server maintenance instruction, the server maintenance strategy is updated by combining the execution result of the server maintenance strategy, so that the accuracy of the server maintenance strategy is improved.
It should be understood that, although the steps in the flowcharts of fig. 1-2 are shown in order as indicated by the arrows, these steps are not necessarily performed in order as indicated by the arrows. The steps are not strictly limited to the order of execution unless explicitly recited herein, and the steps may be executed in other orders. Moreover, at least some of the steps in fig. 1-2 may include multiple sub-steps or phases that are not necessarily performed at the same time, but may be performed at different times, nor does the order in which the sub-steps or phases are performed necessarily occur sequentially, but may be performed alternately or alternately with at least a portion of the sub-steps or phases of other steps or other steps.
In one embodiment, as shown in FIG. 3, there is provided a system maintenance device 30 comprising: an instruction sending module 301, an interrupting module 302 and a maintaining module 303, wherein: the instruction sending module 301 is configured to obtain a server maintenance instruction sent by a user, where the server maintenance instruction carries a server maintenance policy; the interrupt module 302 is configured to trigger an interrupt according to the server maintenance instruction, so that the system enters a system management mode SMM; the maintenance module 303 executes the server maintenance policy with a resident function in SMM to maintain the server.
In one embodiment, the interrupt module 302 includes: and the encapsulation submodule and the analysis submodule. The packaging submodule is used for carrying out format packaging on the server maintenance strategy according to a preset format; the analysis submodule is used for triggering interruption through advanced configuration and a power interface ACPI, and checking and analyzing a format-packaged server maintenance strategy through the ACPI.
In one embodiment, the apparatus 30 further comprises: and the operation information acquisition module and the updating module. The operation information acquisition module is used for executing the operation information acquisition strategy through the resident function so as to acquire the current operation information of the server; and the updating module is used for updating the server maintenance strategy according to the current running information of the server.
In one embodiment, the update module includes: and the execution result acquisition sub-module and the first updating sub-module. The execution result acquisition sub-module is used for acquiring an execution result of the server maintenance strategy; and the first updating sub-module is used for updating the server maintenance strategy according to the current running information and the execution result.
In an embodiment, the execution result obtaining submodule is configured to: acquiring historical operation information before executing the server maintenance strategy; and determining an execution result of the server maintenance strategy according to the historical operation information and the current operation information.
In an embodiment, the update module further comprises: and executing a result determining sub-module and a second updating sub-module. The execution result determining submodule is used for determining whether complete operation information and complete execution results are acquired or not; and the second updating sub-module is used for updating the server maintenance strategy according to the running information and the execution result in response to the acquisition of the complete running information and the complete execution result.
In one embodiment, the update module includes: the system comprises a third updating sub-module, a fourth updating sub-module, a fifth updating sub-module, a sixth updating sub-module and a seventh updating sub-module. The third updating sub-module is used for updating the server maintenance strategy to inhibit the correctable error storm when detecting that the current operation information is the correctable error storm of the server; the fourth updating submodule is used for updating the server maintenance strategy into frequency-reducing, voltage-reducing and current-limiting of the CPU or the memory when the current operation information is detected to be too high in load; the fifth updating sub-module is used for updating the server maintenance strategy to disable the equipment with faults when the current operation information is detected to be the equipment faults; the sixth updating submodule is used for updating the server maintenance strategy into frequency-reducing, voltage-reducing and current-limiting of the CPU or the memory when the current operation information is detected to be abnormal in heat radiation; and the seventh updating submodule is used for updating the server maintenance strategy into an energy-saving strategy when the current running information is detected to be idle and the load is low.
The specific limitation regarding the system maintenance device may be referred to as limitation regarding the system maintenance method hereinabove, and will not be described herein. The various modules in the system maintenance device described above may be implemented in whole or in part by software, hardware, and combinations thereof. The above modules may be embedded in hardware or may be independent of a processor in the computer device, or may be stored in software in a memory in the computer device, so that the processor may call and execute operations corresponding to the above modules.
In one embodiment, a computer device is provided, which may be a server, the internal structure of which may be as shown in fig. 4. The computer device includes a processor, a memory, a network interface, and a database connected by a system bus. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, computer programs, and a database. The internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage media. The database of the computer device is used for storing the execution result of the strategy and the operation information of the server. The network interface of the computer device is used for communicating with an external terminal through a network connection. The computer program is executed by a processor to implement a system maintenance method.
Those skilled in the art will appreciate that the structures shown in FIG. 4 are block diagrams only and do not constitute a limitation of the computer device on which the present aspects apply, and that a particular computer device may include more or less components than those shown, or may combine some of the components, or have a different arrangement of components.
In one embodiment, a computer device is provided comprising a memory, a processor, and a computer program stored on the memory and executable on the processor, the processor implementing the steps of when executing the computer program:
acquiring a server maintenance instruction sent by a user, wherein the server maintenance instruction carries a server maintenance strategy;
triggering an interrupt according to the server maintenance instruction so as to enable the system to enter a system management mode SMM;
and executing the server maintenance strategy through a resident function in the SMM so as to maintain the server.
In one embodiment, the processor when executing the computer program further performs the steps of:
performing format packaging on the server maintenance strategy according to a preset format; triggering interrupt through advanced configuration and power interface ACPI, and checking and analyzing the format packaged server maintenance strategy through the ACPI.
In one embodiment, the processor when executing the computer program further performs the steps of:
executing the operation information acquisition strategy through the resident function to acquire the current operation information of the server; and updating the server maintenance strategy according to the current running information of the server.
In one embodiment, the processor when executing the computer program further performs the steps of:
acquiring an execution result of the server maintenance strategy; and updating the server maintenance strategy according to the current operation information and the execution result.
In one embodiment, the processor when executing the computer program further performs the steps of:
acquiring historical operation information before executing the server maintenance strategy; and determining an execution result of the server maintenance strategy according to the historical operation information and the current operation information.
In one embodiment, the processor when executing the computer program further performs the steps of:
determining whether to acquire complete operation information and complete execution results; and in response to obtaining complete operation information and complete execution results, updating the server maintenance strategy according to the operation information and the execution results.
In one embodiment, the processor when executing the computer program further performs the steps of:
when detecting that the current running information is that the server has a correctable error storm, updating the server maintenance strategy to suppress the correctable error storm; when the current operation information is detected to be too high in load, the server maintenance strategy is updated to reduce the frequency, the voltage and the current of the CPU or the memory; when the current operation information is detected to be equipment failure, updating the server maintenance strategy to disable the equipment with failure; when the current operation information is detected to be abnormal in heat dissipation, the server maintenance strategy is updated to reduce the frequency, the voltage and the current of the CPU or the memory; and when the current running information is detected to be idle and the load is low, updating the server maintenance strategy into an energy-saving strategy.
In one embodiment, a computer readable storage medium is provided having a computer program stored thereon, which when executed by a processor, performs the steps of:
acquiring a server maintenance instruction sent by a user, wherein the server maintenance instruction carries a server maintenance strategy;
triggering an interrupt according to the server maintenance instruction so as to enable the system to enter a system management mode SMM;
and executing the server maintenance strategy through a resident function in the SMM so as to maintain the server.
In one embodiment, the computer program when executed by the processor further performs the steps of:
performing format packaging on the server maintenance strategy according to a preset format; triggering interrupt through advanced configuration and power interface ACPI, and checking and analyzing the format packaged server maintenance strategy through the ACPI.
In one embodiment, the computer program when executed by the processor further performs the steps of:
executing the operation information acquisition strategy through the resident function to acquire the current operation information of the server; and updating the server maintenance strategy according to the current running information of the server.
In one embodiment, the computer program when executed by the processor further performs the steps of:
acquiring an execution result of the server maintenance strategy; and updating the server maintenance strategy according to the current operation information and the execution result.
In one embodiment, the computer program when executed by the processor further performs the steps of:
acquiring historical operation information before executing the server maintenance strategy; and determining an execution result of the server maintenance strategy according to the historical operation information and the current operation information.
In one embodiment, the computer program when executed by the processor further performs the steps of:
determining whether to acquire complete operation information and complete execution results; and in response to obtaining complete operation information and complete execution results, updating the server maintenance strategy according to the operation information and the execution results.
In one embodiment, the computer program when executed by the processor further performs the steps of:
when detecting that the current running information is that the server has a correctable error storm, updating the server maintenance strategy to suppress the correctable error storm; when the current operation information is detected to be too high in load, the server maintenance strategy is updated to reduce the frequency, the voltage and the current of the CPU or the memory; when the current operation information is detected to be equipment failure, updating the server maintenance strategy to disable the equipment with failure; when the current operation information is detected to be abnormal in heat dissipation, the server maintenance strategy is updated to reduce the frequency, the voltage and the current of the CPU or the memory; and when the current running information is detected to be idle and the load is low, updating the server maintenance strategy into an energy-saving strategy.
Those skilled in the art will appreciate that implementing all or part of the above described methods may be accomplished by way of a computer program stored on a non-transitory computer readable storage medium, which when executed, may comprise the steps of the embodiments of the methods described above. Any reference to memory, storage, database, or other medium used in the various embodiments provided herein may include non-volatile and/or volatile memory. The nonvolatile memory can include Read Only Memory (ROM), programmable ROM (PROM), electrically Programmable ROM (EPROM), electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double Data Rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous Link DRAM (SLDRAM), memory bus direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), among others.
The technical features of the above embodiments may be arbitrarily combined, and all possible combinations of the technical features in the above embodiments are not described for brevity of description, however, as long as there is no contradiction between the combinations of the technical features, they should be considered as the scope of the description.
The above examples merely represent a few embodiments of the present application, which are described in more detail and are not to be construed as limiting the scope of the invention. It should be noted that it would be apparent to those skilled in the art that various modifications and improvements could be made without departing from the spirit of the present application, which would be within the scope of the present application. Accordingly, the scope of protection of the present application is to be determined by the claims appended hereto.

Claims (10)

1. A system maintenance method, comprising:
acquiring a server maintenance instruction sent by a user, wherein the server maintenance instruction carries a server maintenance strategy;
triggering an interrupt according to the server maintenance instruction so as to enable the system to enter a system management mode SMM;
and executing the server maintenance strategy through a resident function in the SMM so as to maintain the server.
2. The system maintenance method according to claim 1, wherein the step of triggering an interrupt according to the server maintenance instruction to cause the system to enter a system management mode SMM includes:
performing format packaging on the server maintenance strategy according to a preset format;
triggering interrupt through advanced configuration and power interface ACPI, and checking and analyzing the format packaged server maintenance strategy through the ACPI.
3. The system maintenance method according to claim 2, wherein the server maintenance instruction further carries an operation information acquisition policy, and the triggering of the interrupt according to the server maintenance instruction to cause the system to enter the SMM step further comprises:
executing the operation information acquisition strategy through the resident function to acquire the current operation information of the server;
and updating the server maintenance strategy according to the current running information of the server.
4. The system maintenance method according to claim 3, wherein the step of updating the server maintenance policy according to the current operation information of the server includes:
acquiring an execution result of the server maintenance strategy;
and updating the server maintenance strategy according to the current operation information and the execution result.
5. The system maintenance method according to claim 4, wherein the step of obtaining the execution result of the server maintenance policy includes:
acquiring historical operation information before executing the server maintenance strategy;
and determining an execution result of the server maintenance strategy according to the historical operation information and the current operation information.
6. The system maintenance method according to claim 4, wherein the step of updating the server maintenance policy according to the operation information and the execution result further comprises:
determining whether to acquire complete operation information and complete execution results;
and in response to obtaining complete operation information and complete execution results, updating the server maintenance strategy according to the operation information and the execution results.
7. The system maintenance method according to claim 3, wherein the step of updating the server maintenance policy according to the current operation information of the server includes:
when detecting that the current running information is that the server has a correctable error storm, updating the server maintenance strategy to suppress the correctable error storm;
when the current operation information is detected to be too high in load, the server maintenance strategy is updated to reduce the frequency, the voltage and the current of the CPU or the memory;
when the current operation information is detected to be equipment failure, updating the server maintenance strategy to disable the equipment with failure;
when the current operation information is detected to be abnormal in heat dissipation, the server maintenance strategy is updated to reduce the frequency, the voltage and the current of the CPU or the memory;
and when the current running information is detected to be idle and the load is low, updating the server maintenance strategy into an energy-saving strategy.
8. A system maintenance device, the device comprising:
the instruction sending module is used for obtaining a server maintenance instruction sent by a user, wherein the server maintenance instruction carries a server maintenance policy;
the interrupt module is used for triggering interrupt according to the server maintenance instruction so as to enable the system to enter a System Management Mode (SMM);
and the maintenance module is used for executing the server maintenance strategy by using a resident function in the SMM so as to maintain the server.
9. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the steps of the method according to any one of claims 1 to 7 when the computer program is executed by the processor.
10. A computer readable storage medium, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, implements the steps of the method of any of claims 1 to 7.
CN202311457401.5A 2023-11-03 2023-11-03 System maintenance method, device, computer equipment and storage medium Pending CN117421038A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311457401.5A CN117421038A (en) 2023-11-03 2023-11-03 System maintenance method, device, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311457401.5A CN117421038A (en) 2023-11-03 2023-11-03 System maintenance method, device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
CN117421038A true CN117421038A (en) 2024-01-19

Family

ID=89524573

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311457401.5A Pending CN117421038A (en) 2023-11-03 2023-11-03 System maintenance method, device, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN117421038A (en)

Similar Documents

Publication Publication Date Title
US11144416B2 (en) Device fault processing method, apparatus, and system
KR101944873B1 (en) Method for checking whether hardware of intelligent terminal runs abnormally and intelligent terminal
US9442786B2 (en) Determining and correcting software server error conditions
US10789184B2 (en) Vehicle control device
US11360832B2 (en) Operation method of robot operating system and a robot control method
CN111198921A (en) Database switching method and device, computer equipment and storage medium
CN114385418A (en) Protection method, device, equipment and storage medium for communication equipment
CN107818034B (en) Method and device for monitoring running space of process in computer equipment
CN117421038A (en) System maintenance method, device, computer equipment and storage medium
CN112132652A (en) Order information acquisition method and device, computer equipment and storage medium
US20200111539A1 (en) Information processing apparatus for repair management of storage medium
TWI469573B (en) Method for processing system failure and server system using the same
CN111880945B (en) Shared device information processing method, system, device, computer and storage medium
CN110633176B (en) Working system switching method, cube star and switching device
CN110879757B (en) Restarting method and device for abnormal operation of client and computer equipment
CN109062718B (en) Server and data processing method
CN116483612B (en) Memory fault processing method, device, computer equipment and storage medium
JP2012181737A (en) Computer system
CN117573418B (en) Processing method, system, medium and equipment for video memory access exception
CN113742141B (en) Server measuring method and device, computer equipment and storage medium
WO2022267812A1 (en) Software recovery method, electronic device, and storage medium
CN117234787B (en) Method and system for monitoring running state of system-level chip
CN108415788B (en) Data processing apparatus and method for responding to non-responsive processing circuitry
CN114817908A (en) Self-isolation method, system, terminal and medium for dual-computer hot standby software
CN117312037A (en) Memory repair method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination