CN117033144A - Remote inspection method, equipment and medium for server - Google Patents

Remote inspection method, equipment and medium for server Download PDF

Info

Publication number
CN117033144A
CN117033144A CN202311296503.3A CN202311296503A CN117033144A CN 117033144 A CN117033144 A CN 117033144A CN 202311296503 A CN202311296503 A CN 202311296503A CN 117033144 A CN117033144 A CN 117033144A
Authority
CN
China
Prior art keywords
historical
inspection
server
management controller
target server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202311296503.3A
Other languages
Chinese (zh)
Other versions
CN117033144B (en
Inventor
周高登
陈立军
郭雄
屈行运
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan Barda Technology Co ltd
Original Assignee
Wuhan Barda Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan Barda Technology Co ltd filed Critical Wuhan Barda Technology Co ltd
Priority to CN202311296503.3A priority Critical patent/CN117033144B/en
Publication of CN117033144A publication Critical patent/CN117033144A/en
Application granted granted Critical
Publication of CN117033144B publication Critical patent/CN117033144B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3058Monitoring arrangements for monitoring environmental properties or parameters of the computing system or of the computing system component, e.g. monitoring of power, currents, temperature, humidity, position, vibrations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/26Special purpose or proprietary protocols or architectures

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The application discloses a server remote inspection method, equipment and medium, wherein a client communicates with a baseboard management controller in a target server based on an IPMI protocol, and the method comprises the following steps: acquiring the data processing load capacity of the substrate management controller; determining a historical patrol score of the target server during the latest historical patrol; determining a target patrol period of a target server based on the data processing load capacity and the historical patrol score; and sending a remote inspection instruction to the baseboard management controller according to the target inspection period so that the baseboard management controller performs inspection processing on the target server. According to the embodiment of the application, the target inspection period is comprehensively determined through the data processing load capacity of the substrate management controller and the historical inspection score of the target server, so that the inspection period can be changed along with the load of the substrate management controller and the historical inspection score, the unified fixed inspection period is avoided, and the reliability and timeliness of remote inspection are improved.

Description

Remote inspection method, equipment and medium for server
Technical Field
The application relates to the technical field of server inspection, in particular to a server remote inspection method, equipment and medium.
Background
When remote inspection is performed on a server, inspection processing is generally performed according to a uniform fixed inspection cycle. However, due to the difference between different servers, a uniform fixed inspection cycle may not fit all servers, thereby reducing the reliability and timeliness of remote inspection.
Disclosure of Invention
The embodiment of the application provides a remote inspection method, equipment and medium for a server, aiming at improving the reliability and timeliness of remote inspection.
In one aspect, the present application provides a method for remote inspection of a server, where the method is applied to a client, and the client communicates with a baseboard management controller in a target server based on an IPMI protocol, and the method includes:
acquiring the data processing load capacity of the baseboard management controller;
acquiring a historical operation state parameter of the target server during the latest historical inspection;
based on the historical running state parameters, determining the historical inspection score of the target server in the last historical inspection;
determining a target patrol period of the target server based on the data processing load capacity and the historical patrol score;
and sending a remote inspection instruction to the baseboard management controller according to the target inspection period so that the baseboard management controller performs inspection processing on the target server.
In some embodiments, the obtaining the data processing load of the baseboard management controller includes:
acquiring the type of the controller to which the baseboard management controller belongs;
determining a maximum data throughput associated with the controller type;
acquiring historical data transmission quantity between the client and the baseboard management controller within a preset historical time length;
and determining the data processing load capacity of the baseboard management controller based on the historical data transmission capacity and the maximum data processing capacity.
In some embodiments, the determining the data processing load of the baseboard management controller based on the historical data transmission amount and the maximum data processing amount includes:
acquiring a temperature value of the baseboard management controller by using a temperature sensor arranged on the baseboard management controller;
determining a ratio of the historical data transfer amount to the maximum data processing amount;
and correcting the ratio based on the temperature value to obtain the data processing load capacity of the baseboard management controller.
In some embodiments, the correcting the ratio based on the temperature value to obtain the data processing load of the baseboard management controller includes:
acquiring a preset normal temperature range of the baseboard management controller;
and when the temperature value is smaller than or equal to the maximum value in the preset normal temperature range, not correcting the ratio, and taking the ratio as the data processing load of the baseboard management controller.
In some embodiments, the determining, based on the historical operating state parameter, a historical patrol score of the target server at a last historical patrol includes:
determining the type of the server to which the target server belongs;
acquiring expected running state parameters associated with the server types;
determining an absolute value of a difference between the historical operating state parameter and the expected operating state parameter;
and determining the historical inspection score of the target server in the last historical inspection based on the absolute value of the difference value.
In some embodiments, the determining the historical patrol score of the target server at the last historical patrol based on the absolute value of the difference comprises:
acquiring service functions provided by the target server;
determining an importance coefficient of the service function;
determining an operation state parameter score corresponding to the absolute value of the difference value;
and taking the product of the operation state parameter score and the importance coefficient as a historical inspection score of the target server during the latest historical inspection.
In some embodiments, the determining the target patrol period of the target server based on the data processing load, the historical patrol score includes:
acquiring a preset inspection period of the target server;
when the data processing load capacity is larger than a first preset threshold value, a first preset amplitude is obtained;
when the historical inspection score is larger than a second preset threshold value, a second preset amplitude is obtained;
determining a larger value of the first preset amplitude and the second preset amplitude;
and increasing the preset inspection period according to the larger value of the first preset amplitude and the second preset amplitude to obtain the target inspection period of the target server.
In some embodiments, the obtaining the preset polling period of the target server includes:
acquiring historical access quantity of the target server in a preset historical time;
and determining the preset inspection period corresponding to the historical access quantity, wherein the preset inspection period is inversely related to the historical access quantity.
In another aspect, the present application also provides a computer apparatus, including:
one or more processors;
a memory; and
one or more applications, wherein the one or more applications are stored in the memory and configured to be executed by the processor to implement the steps in any of the server remote patrol methods.
In another aspect, the present application also provides a computer readable storage medium having stored thereon a computer program, the computer program being loaded by a processor to perform the steps of any of the server remote inspection methods described herein.
The method, the equipment and the medium for remote inspection of the server, provided by the embodiment of the application, are applied to a client, and the client communicates with a baseboard management controller in a target server based on an IPMI protocol, and the method comprises the following steps: acquiring the data processing load capacity of the substrate management controller; acquiring a historical operation state parameter of a target server in the last historical inspection; based on the historical running state parameters, determining the historical inspection score of the target server in the last historical inspection; determining a target patrol period of a target server based on the data processing load capacity and the historical patrol score; and sending a remote inspection instruction to the baseboard management controller according to the target inspection period so that the baseboard management controller performs inspection processing on the target server. According to the embodiment of the application, the target inspection period is comprehensively determined through the data processing load capacity of the substrate management controller and the historical inspection score of the target server, so that the inspection period can be changed along with the load of the substrate management controller and the historical inspection score, the unified fixed inspection period is avoided, and the reliability and timeliness of remote inspection are improved.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings that are needed in the description of the embodiments will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present application, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flow chart of one embodiment of a server remote inspection method provided in an embodiment of the present application;
FIG. 2 is a flowchart of another embodiment of a remote inspection method for a server according to an embodiment of the present application;
FIG. 3 is a flow chart of a further embodiment of a server remote inspection method provided in an embodiment of the present application;
FIG. 4 is a schematic diagram of an embodiment of a computer device provided in an embodiment of the present application.
Detailed Description
The following description of the embodiments of the present application will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present application, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to fall within the scope of the application.
In the description of the present application, the terms "first," "second," and the like are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defining "a first" or "a second" may explicitly or implicitly include one or more of the described features. In the description of the present application, the meaning of "a plurality" is two or more, unless explicitly defined otherwise.
In the present application, the term "exemplary" is used to mean "serving as an example, instance, or illustration. Any embodiment described as "exemplary" in this disclosure is not necessarily to be construed as preferred or advantageous over other embodiments. The following description is presented to enable any person skilled in the art to make and use the application. In the following description, details are set forth for purposes of explanation. It will be apparent to one of ordinary skill in the art that the present application may be practiced without these specific details. In other instances, well-known structures and processes have not been described in detail so as not to obscure the description of the application with unnecessary detail. Thus, the present application is not intended to be limited to the embodiments shown, but is to be accorded the widest scope consistent with the principles and features disclosed herein.
The embodiment of the application provides a server remote inspection method, equipment and medium, and the method, the equipment and the medium are respectively described in detail below.
Referring to fig. 1, fig. 1 is a flow chart of an embodiment of a server remote inspection method provided in an embodiment of the present application, where the server remote inspection method is applied to a client, and the client communicates with a baseboard management controller (Board Management Controller, BMC) in a target server based on an IPMI (Intelligent Platform Management Interface ) protocol, so that functions of hardware monitoring, remote management, diagnosis, etc. of the target server are implemented by using the BMC, so as to help an administrator of the client discover and solve problems in the target server in time, and improve reliability and availability of a target server system.
IPMI (Intelligent Platform Management Interface ) is an industry standard adopted by peripheral devices of enterprise systems, and based on the IPMI protocol, the operation status of a large number of servers can be intelligently monitored, controlled and automatically reported back so as to reduce the cost of the server system.
The IPMI protocol is a BMC (Board Management Controller, baseboard management controller), typically an embedded device, such as a stand-alone board mounted on a server motherboard, which does not rely on the processor, BIOS or operating system of the server to work, and can provide hardware monitoring, remote management, diagnostics, etc. to help an administrator discover and solve problems in the server in time, improving reliability and availability of the server system. Due to the performance difference of BMCs in different servers, the unified fixed inspection period may cause overload of BMCs in part of servers, so that remote inspection errors are caused, and the reliability of remote inspection is reduced. Due to the running state difference of different servers, the unified fixed inspection period may cause the remote inspection of part of the servers to be not timely enough, and the timeliness of the remote inspection is reduced.
For this reason, the server remote inspection method provided by the embodiment of the application may include:
101. acquiring the data processing load capacity of the substrate management controller;
in this embodiment, the BMC may perform data transmission with the target server and feedback data to the client, so as to implement functions such as hardware monitoring, remote management, and diagnosis of the target server, so that the BMC has a load that needs to perform data processing, and thus the corresponding data processing load of the BMC may be calculated. The data processing load represents the load size of the BMC when data processing is performed.
102. Acquiring a historical operation state parameter of a target server in the last historical inspection;
in this embodiment, the last history inspection, that is, the last inspection process performed by the BMC on the target server. The historical operating state parameters may include at least one of historical CPU usage, memory usage, hard disk occupancy, number of processes, network bandwidth occupancy, middleware response time, temperature, voltage, current, fan operating state, power state, etc. of the target server. The historical running state parameter of the target server during the latest historical inspection can be obtained by the BMC through the latest inspection processing of the target server. And after the latest inspection processing of the target server is finished, the BMC feeds back the historical running state parameters to the client.
It should be noted that, in the server remote inspection method shown in the embodiment of the present application, the execution is started immediately after the execution of the last inspection processing of the target server by the BMC is completed, that is, the step 101 and the subsequent steps are executed after the execution of the last inspection processing of the target server by the BMC is completed.
103. Based on the historical running state parameters, determining the historical inspection score of the target server in the last historical inspection;
in this embodiment, a scoring rule for the historical running state parameter is preset in the client, and the historical running state parameter is scored according to the scoring rule, so that the historical inspection score of the target server in the last historical inspection can be obtained. The historical patrol score characterizes the well degree of the running state of the target server in the last historical patrol.
In some embodiments, determining the historical patrol score of the target server at the last historical patrol based on the historical operating state parameters may include: determining a server type to which the target server belongs, where the server type may specifically refer to a type of server architecture, or a type of server usage, where the type of server architecture may include, for example, CISC (Complex Instruction Set Computer, complex instruction system computer) servers, RISC (Reduced Instruction Set Computing, reduced instruction set) servers, VLIW (Very Long Instruction Word, very long instruction set architecture) servers, and the like, and the type of server usage may include, for example, general-purpose servers, special-purpose servers, and the like; the method comprises the steps of obtaining expected operation state parameters associated with server types, wherein the expected operation state parameters associated with different server types are preset, namely, the expected operation state parameters, namely, the optimal operation state parameters of corresponding servers in an ideal state, can be set manually based on actual requirements, and it is understood that the expected operation state parameters of different server architectures or servers with different server uses are different due to differences of server architectures, server uses and the like, so that the expected operation state parameters are more in accordance with expected conditions of the servers; determining absolute values of differences between the historical operating state parameters and the expected operating state parameters, wherein if a plurality of different historical operating state parameters exist at the same time, a plurality of corresponding expected operating state parameters exist, so that the absolute values of differences between each historical operating state parameter and the corresponding expected operating state parameter are respectively determined, and the absolute values of the differences represent the difference between the historical operating state and the expected operating state of the target server during the last historical inspection; based on the absolute value of the difference, a historical patrol score of the target server at the time of the last historical patrol is determined, e.g., the historical patrol score may be inversely related to the absolute value of the difference.
In a further embodiment, determining the historical patrol score of the target server at the time of the last historical patrol based on the absolute value of the difference may include: acquiring service functions provided by a target server, for example, a service system comprises a plurality of servers, wherein the plurality of servers can comprise the target server, and the service functions provided by different servers can be different, so that different service services are provided for users; determining importance coefficients of service functions, wherein the importance coefficients of different service functions are preset, the importance coefficients can be manually set based on actual requirements, and in general, the larger the importance coefficient is, the more important the corresponding service function is; determining an operating state parameter score corresponding to the absolute value of the difference value, e.g., the operating state parameter score may be inversely related to the absolute value of the difference value; the product of the running state parameter score and the importance coefficient is used as the historical inspection score of the target server in the latest historical inspection, so that the historical inspection scores of servers providing different service functions are more differentiated, and different inspection standards for different servers are more met, namely, the historical inspection scores are more accurate.
104. Determining a target patrol period of a target server based on the data processing load capacity and the historical patrol score;
in this embodiment, the target inspection period is comprehensively determined based on the data processing load capacity and the historical inspection score, so that the target inspection period comprehensively considers the load condition of the BMC and the last inspection condition of the target server. Because the target inspection period can be changed along with the load condition of the BMC, the problem that when the load of the BMC is high, the target server is inspected frequently still can be avoided, and the load of the BMC is further increased. Because the target inspection period can be changed along with the last inspection condition of the target server, frequent inspection of the target server can be avoided when the last inspection condition of the target server is better, and therefore the burden of the BMC can be reduced.
105. And sending a remote inspection instruction to the baseboard management controller according to the target inspection period so that the baseboard management controller performs inspection processing on the target server.
In this embodiment, according to the target inspection cycle, a remote inspection instruction is sent to the BMC, that is, when the time interval between the current time point and the time point of the last historical inspection of the target server by the BMC is equal to the target inspection cycle, the remote inspection instruction is sent to the BMC. Therefore, when the BMC receives the remote inspection instruction, the BMC can carry out inspection processing on the target server, and then the operation state parameter of the target server obtained through the inspection processing is used as the historical operation state parameter of the target server in the last historical inspection and is fed back to the client.
According to the server remote inspection method provided by the embodiment of the application, the target inspection period is comprehensively determined through the data processing load capacity of the substrate management controller and the historical inspection score of the target server, so that the inspection period can be changed along with the load of the substrate management controller and the historical running state of the target server in the last historical inspection, the unified fixed inspection period is avoided, the target inspection period of the target server is more reasonable, and the reliability and timeliness of remote inspection are improved.
As shown in fig. 2, on the basis of the embodiment shown in fig. 1, acquiring the data processing load of the baseboard management controller may include:
201. acquiring the type of a controller to which the baseboard management controller belongs;
in this embodiment, the controller type may specifically refer to a model of the BMC or a configuration rule of the BMC. It will be appreciated that the performance of BMCs of different models or configuration rules tend to be different.
202. Determining the maximum data processing amount associated with the controller type;
in this embodiment, the maximum data processing amounts of different controller types are preset, and the maximum data processing amounts represent the maximum data processing capacity of the BMC of the controller type. The association of the controller type with the maximum data throughput may be set manually based on actual demand.
203. Acquiring a historical data transmission amount between a client and a baseboard management controller within a preset historical time;
in this embodiment, the history duration is preset, that is, the interval duration between the last history inspection and the last second history inspection, so that the amount of historical data transmission between the client and the BMC within the preset history duration, that is, the amount of historical data transmission between the client and the BMC at the time point of the last history inspection and the time point of the last second history inspection. The historical data transmission amount may include a first historical data transmission amount sent by the BMC to the client and a second historical data transmission amount sent by the client to the BMC, where the first historical data transmission amount and the second historical data transmission amount may be calculated by the client instead of the BMC to avoid further increasing the load of the BMC.
204. And determining the data processing load of the baseboard management controller based on the historical data transmission amount and the maximum data processing amount.
In this embodiment, since the main function of the BMC is to provide the client with functions of hardware monitoring, remote management, diagnosis, and the like for the target server, the data processing load of the baseboard management controller can be measured based on the historical data transmission amount between the client and the BMC within the preset historical time.
In some embodiments, determining the data processing load of the baseboard management controller based on the historical data transmission amount and the maximum data processing amount may include: the temperature sensor arranged on the baseboard management controller is utilized to obtain the temperature value of the baseboard management controller, for example, the temperature sensor can be arranged on the surface of the BMC, the temperature sensor is used for obtaining the surface temperature of the BMC, the temperature sensor is connected with the BMC, the BMC can automatically take the surface temperature detected by the temperature sensor as the temperature value of the BMC through the arrangement of the IPMI firmware in the BMC, and the temperature value of the BMC can be fed back to the client, so that the client can obtain the temperature value of the BMC; determining the ratio of the historical data transmission quantity to the maximum data processing quantity; and correcting the ratio based on the temperature value to obtain the data processing load capacity of the substrate management controller. It will be appreciated that when the load of the BMC is high, the heating value of the BMC will generally increase, so that the temperature value of the BMC increases, and therefore the ratio can be corrected based on the temperature value, resulting in more accurate data processing capacity of the BMC.
In a further embodiment, correcting the ratio based on the temperature value to obtain the data processing load of the baseboard management controller may include: acquiring a preset normal temperature range of the baseboard management controller, wherein the preset normal temperature range of the BMC can be set manually based on actual conditions; when the temperature value is larger than the maximum value in the preset normal temperature range, the substrate management controller is judged to be excessively heated, and the BMC load is judged to be large, so that the ratio can be increased, and the data processing load capacity of the substrate management controller is obtained. The increasing amplitude of the ratio may be a preset fixed amplitude. When the temperature value is smaller than or equal to the maximum value in the preset normal temperature range, the substrate management controller is judged to heat normally, and the BMC load is judged not to be large, so that the ratio is not corrected, and the ratio is directly used as the data processing load capacity of the substrate management controller.
In the technical scheme disclosed in the embodiment, the maximum data processing capacity associated with the controller type is determined by acquiring the controller type to which the BMC belongs, and the data processing load capacity of the BMC is determined based on the historical data transmission capacity and the maximum data processing capacity between the client and the BMC, so that the measurement of the data processing load capacity of the BMC also considers the controller types to which different BMCs belong, and the obtained B data processing load capacity is more accurate.
As shown in fig. 3, on the basis of the embodiment shown in fig. 1 or fig. 2, determining the target patrol period of the target server based on the data processing load amount and the historical patrol score may include:
301. acquiring a preset inspection period of a target server;
in some embodiments, the preset polling period of the target server may be set manually in advance, and the preset polling period is the original fixed polling period.
In some embodiments, obtaining the preset inspection cycle of the target server may include: acquiring the historical access amount of a target server in a preset historical time, wherein the preset historical time is the preset historical time described in the embodiment shown in fig. 2; and determining a preset inspection period corresponding to the historical access quantity, wherein the preset inspection period is inversely related to the historical access quantity. It can be understood that, when the historical access amount of the target server within the preset historical time period is larger, if the service function provided by the target server fails, the influence is larger, so that more frequent inspection is required, and therefore, the preset inspection period is inversely related to the historical access amount, and the specific corresponding relation between the preset inspection period and the historical access amount can be set manually in advance, which is not limited herein.
302. When the data processing load capacity is larger than a first preset threshold value, a first preset amplitude is obtained;
in this embodiment, when the data processing load capacity is greater than the first preset threshold, it is determined that the load of the BMC is too large, so that the preset inspection period may be increased to reduce the inspection frequency, and at this time, the first preset amplitude may be obtained.
303. When the historical inspection score is larger than a second preset threshold value, a second preset amplitude is obtained;
in this embodiment, when the historical inspection score is greater than the first preset threshold, it is determined that the running state of the target server is relatively good during the last historical inspection, so that the preset inspection period can be increased to reduce the inspection frequency, and at this time, the second preset amplitude can be obtained.
304. Determining a larger value of the first preset amplitude and the second preset amplitude;
305. and increasing the preset inspection period according to the larger value of the first preset amplitude and the second preset amplitude to obtain the target inspection period of the target server.
In this embodiment, the preset inspection period is increased by determining a larger value of the first preset amplitude and the second preset amplitude and according to the larger value of the first preset amplitude and the second preset amplitude, but not according to a smaller value of the first preset amplitude and the second preset amplitude, so that the problem that the inspection period of the target server is overlong, the inspection times are too small, that is, the excessive adjustment of the inspection period of the target server is avoided.
In some embodiments, when the data processing load is greater than the first preset threshold, but the historical inspection score is less than or equal to the second preset threshold, the preset inspection period is increased only according to the first preset amplitude, so as to obtain the target inspection period of the target server.
In some embodiments, when the data processing load is less than or equal to the first preset threshold, but the historical inspection score is greater than the second preset threshold, the preset inspection period is increased only according to the second preset amplitude, so as to obtain the target inspection period of the target server.
In some embodiments, when the data processing load is less than or equal to the first preset threshold and the historical patrol score is less than or equal to the second preset threshold, the preset patrol cycle is not adjusted, but the preset patrol cycle is directly used as the target patrol cycle of the target server.
In the technical scheme disclosed in the embodiment, when the data processing load capacity is greater than a first preset threshold value, a first preset amplitude is obtained, when the history inspection score is greater than a second preset threshold value, a second preset amplitude is obtained, and according to the larger value of the first preset amplitude and the second preset amplitude, the preset inspection period is increased to obtain the target inspection period of the target server, so that the excessive adjustment of the inspection period of the target server is avoided, and the adjustment of the inspection period of the target server is more suitable.
In addition to the above-described remote inspection method for a server, an embodiment of the present application further provides a computer device, where the computer device includes:
one or more processors;
a memory; and
one or more applications, wherein the one or more applications are stored in memory and configured to be executed by the processor to perform the steps of any of the server remote inspection method embodiments described above.
As shown in fig. 4, a schematic structural diagram of a computer device according to an embodiment of the present application is shown, specifically:
the computer device may include one or more processors 401 of a processing core, a storage unit 402 of one or more computer readable storage media, a power supply 403, and an input unit 404, among other components. Those skilled in the art will appreciate that the computer device structure shown in FIG. 4 is not limiting of the computer device and may include more or fewer components than shown, or may be combined with certain components, or a different arrangement of components. Wherein:
the processor 401 is a control center of the computer device, connects respective portions of the entire computer device using various interfaces and lines, and performs various functions of the computer device and processes data by running or executing software programs and/or modules stored in the storage unit 402 and calling data stored in the storage unit 402, thereby performing overall monitoring of the computer device. Optionally, processor 401 may include one or more processing cores; preferably, the processor 401 may integrate an application processor and a modem processor, wherein the application processor mainly processes an operating system, a user interface, an application program, etc., and the modem processor mainly processes wireless communication. It will be appreciated that the modem processor described above may not be integrated into the processor 401.
The storage unit 402 may be used to store software programs and modules, and the processor 401 executes various functional applications and data processing by running the software programs and modules stored in the storage unit 402. The storage unit 402 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program (such as a sound playing function, an image playing function, etc.) required for at least one function, and the like; the storage data area may store data created according to the use of the computer device, etc. In addition, the storage unit 402 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid-state storage device. Accordingly, the memory unit 402 may also include a memory controller to provide the processor 401 with access to the memory unit 402.
The computer device further comprises a power supply 403 for supplying power to the various components, preferably the power supply 403 may be logically connected to the processor 401 by a power management system, so that functions of charge, discharge, and power consumption management may be performed by the power management system. The power supply 403 may also include one or more of any of a direct current or alternating current power supply, a recharging system, a power failure detection circuit, a power converter or inverter, a power status indicator, and the like.
The computer device may also include an input unit 404, which input unit 404 may be used to receive input numeric or character information and to generate keyboard, mouse, joystick, optical or trackball signal inputs related to user settings and function control.
Although not shown, the computer device may further include a display unit or the like, which is not described herein. In particular, in the embodiment of the present application, the processor 401 in the computer device loads executable files corresponding to the processes of one or more application programs into the storage unit 402 according to the following instructions, and the processor 401 executes the application programs stored in the storage unit 402, so as to implement various functions, as follows:
acquiring the data processing load capacity of the substrate management controller; acquiring a historical operation state parameter of a target server in the last historical inspection; based on the historical running state parameters, determining the historical inspection score of the target server in the last historical inspection; determining a target patrol period of a target server based on the data processing load capacity and the historical patrol score; and sending a remote inspection instruction to the baseboard management controller according to the target inspection period so that the baseboard management controller performs inspection processing on the target server.
According to the server remote inspection method provided by the embodiment of the application, the target inspection period is comprehensively determined through the data processing load capacity of the substrate management controller and the historical inspection score of the target server, so that the inspection period can be changed along with the load of the substrate management controller and the historical inspection score, the unified fixed inspection period is avoided, and the reliability and timeliness of remote inspection are improved.
To this end, embodiments of the present application provide a computer-readable storage medium, which may include: read Only Memory (ROM), random access Memory (RAM, random Access Memory), magnetic or optical disk, and the like. The computer readable storage medium stores a plurality of instructions that can be loaded by a processor to perform any of the steps in the server remote inspection method provided by the embodiments of the present application. For example, the instructions may perform the steps of:
acquiring the data processing load capacity of the substrate management controller; acquiring a historical operation state parameter of a target server in the last historical inspection; based on the historical running state parameters, determining the historical inspection score of the target server in the last historical inspection; determining a target patrol period of a target server based on the data processing load capacity and the historical patrol score; and sending a remote inspection instruction to the baseboard management controller according to the target inspection period so that the baseboard management controller performs inspection processing on the target server.
In the foregoing embodiments, the descriptions of the embodiments are emphasized, and for parts of one embodiment that are not described in detail, reference may be made to related descriptions of other embodiments.
The foregoing describes in detail a method, apparatus and medium for remote inspection of a server according to embodiments of the present application, and specific examples are applied to illustrate principles and implementations of the present application, where the foregoing examples are only for helping to understand the method and core idea of the present application; meanwhile, as those skilled in the art will have variations in the specific embodiments and application scope in light of the ideas of the present application, the present description should not be construed as limiting the present application.

Claims (10)

1. A method for remote inspection of a server, the method being applied to a client that communicates with a baseboard management controller in a target server based on an IPMI protocol, the method comprising:
acquiring the data processing load capacity of the baseboard management controller;
acquiring a historical operation state parameter of the target server during the latest historical inspection;
based on the historical running state parameters, determining the historical inspection score of the target server in the last historical inspection;
determining a target patrol period of the target server based on the data processing load capacity and the historical patrol score;
and sending a remote inspection instruction to the baseboard management controller according to the target inspection period so that the baseboard management controller performs inspection processing on the target server.
2. The server remote inspection method according to claim 1, wherein the obtaining the data processing load of the baseboard management controller comprises:
acquiring the type of the controller to which the baseboard management controller belongs;
determining a maximum data throughput associated with the controller type;
acquiring historical data transmission quantity between the client and the baseboard management controller within a preset historical time length;
and determining the data processing load capacity of the baseboard management controller based on the historical data transmission capacity and the maximum data processing capacity.
3. The server remote inspection method according to claim 2, wherein the determining the data processing load amount of the baseboard management controller based on the historical data transmission amount and the maximum data processing amount includes:
acquiring a temperature value of the baseboard management controller by using a temperature sensor arranged on the baseboard management controller;
determining a ratio of the historical data transfer amount to the maximum data processing amount;
and correcting the ratio based on the temperature value to obtain the data processing load capacity of the baseboard management controller.
4. The server remote inspection method according to claim 3, wherein the correcting the ratio based on the temperature value to obtain the data processing load of the baseboard management controller includes:
acquiring a preset normal temperature range of the baseboard management controller;
and when the temperature value is smaller than or equal to the maximum value in the preset normal temperature range, not correcting the ratio, and taking the ratio as the data processing load of the baseboard management controller.
5. The server remote patrol method according to claim 1, wherein the determining a historical patrol score of the target server at the time of the latest historical patrol based on the historical operating state parameter comprises:
determining the type of the server to which the target server belongs;
acquiring expected running state parameters associated with the server types;
determining an absolute value of a difference between the historical operating state parameter and the expected operating state parameter;
and determining the historical inspection score of the target server in the last historical inspection based on the absolute value of the difference value.
6. The server remote patrol method according to claim 5, wherein the determining a historical patrol score of the target server at the time of the latest historical patrol based on the absolute value of the difference value comprises:
acquiring service functions provided by the target server;
determining an importance coefficient of the service function;
determining an operation state parameter score corresponding to the absolute value of the difference value;
and taking the product of the operation state parameter score and the importance coefficient as a historical inspection score of the target server during the latest historical inspection.
7. The server remote patrol method according to claim 1, wherein the determining a target patrol cycle of the target server based on the data processing load amount, the historical patrol score comprises:
acquiring a preset inspection period of the target server;
when the data processing load capacity is larger than a first preset threshold value, a first preset amplitude is obtained;
when the historical inspection score is larger than a second preset threshold value, a second preset amplitude is obtained;
determining a larger value of the first preset amplitude and the second preset amplitude;
and increasing the preset inspection period according to the larger value of the first preset amplitude and the second preset amplitude to obtain the target inspection period of the target server.
8. The server remote inspection method according to claim 7, wherein the obtaining the preset inspection cycle of the target server includes:
acquiring historical access quantity of the target server in a preset historical time;
and determining the preset inspection period corresponding to the historical access quantity, wherein the preset inspection period is inversely related to the historical access quantity.
9. A computer device, the computer device comprising:
one or more processors;
a memory; and
one or more applications, wherein the one or more applications are stored in the memory and configured to be executed by the processor to implement the steps in the server remote patrol method of any one of claims 1 to 8.
10. A computer readable storage medium, having stored thereon a computer program, the computer program being loaded by a processor to perform the steps of the server remote patrol method of any one of claims 1 to 8.
CN202311296503.3A 2023-10-09 2023-10-09 Remote inspection method, equipment and medium for server Active CN117033144B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311296503.3A CN117033144B (en) 2023-10-09 2023-10-09 Remote inspection method, equipment and medium for server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311296503.3A CN117033144B (en) 2023-10-09 2023-10-09 Remote inspection method, equipment and medium for server

Publications (2)

Publication Number Publication Date
CN117033144A true CN117033144A (en) 2023-11-10
CN117033144B CN117033144B (en) 2024-01-19

Family

ID=88641613

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311296503.3A Active CN117033144B (en) 2023-10-09 2023-10-09 Remote inspection method, equipment and medium for server

Country Status (1)

Country Link
CN (1) CN117033144B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117670313A (en) * 2024-02-01 2024-03-08 西安咸林能源科技有限公司 Power plant inspection method, power plant inspection system and storable medium

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100306357A1 (en) * 2009-05-27 2010-12-02 Aten International Co., Ltd. Server, computer system, and method for monitoring computer system
TW201715331A (en) * 2015-10-16 2017-05-01 神雲科技股份有限公司 Server and method for auto repairing a baseboard management controller
US20170126795A1 (en) * 2015-10-29 2017-05-04 Capital One Services, Llc Automated server workload management using machine learning
CN111193904A (en) * 2019-12-17 2020-05-22 视联动力信息技术股份有限公司 Inspection control method and device and readable storage medium
CN112738238A (en) * 2020-12-29 2021-04-30 北京天融信网络安全技术有限公司 Method, device and system for health check in load balancing
CN112988433A (en) * 2019-12-12 2021-06-18 伊姆西Ip控股有限责任公司 Method, apparatus and computer program product for fault management
CN114595848A (en) * 2022-04-29 2022-06-07 武汉四通信息服务有限公司 Equipment supervision method and device
CN116016276A (en) * 2022-12-02 2023-04-25 京东科技信息技术有限公司 Method and device for detecting health state of server
CN116405408A (en) * 2023-04-28 2023-07-07 曙光信息产业股份有限公司 Automatic equipment inspection method, device, equipment and storage medium
CN116643908A (en) * 2023-07-19 2023-08-25 深圳市同泰怡信息技术有限公司 Automatic fault alarm method based on Feiteng multipath server

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100306357A1 (en) * 2009-05-27 2010-12-02 Aten International Co., Ltd. Server, computer system, and method for monitoring computer system
TW201715331A (en) * 2015-10-16 2017-05-01 神雲科技股份有限公司 Server and method for auto repairing a baseboard management controller
US20170126795A1 (en) * 2015-10-29 2017-05-04 Capital One Services, Llc Automated server workload management using machine learning
CN112988433A (en) * 2019-12-12 2021-06-18 伊姆西Ip控股有限责任公司 Method, apparatus and computer program product for fault management
CN111193904A (en) * 2019-12-17 2020-05-22 视联动力信息技术股份有限公司 Inspection control method and device and readable storage medium
CN112738238A (en) * 2020-12-29 2021-04-30 北京天融信网络安全技术有限公司 Method, device and system for health check in load balancing
CN114595848A (en) * 2022-04-29 2022-06-07 武汉四通信息服务有限公司 Equipment supervision method and device
CN116016276A (en) * 2022-12-02 2023-04-25 京东科技信息技术有限公司 Method and device for detecting health state of server
CN116405408A (en) * 2023-04-28 2023-07-07 曙光信息产业股份有限公司 Automatic equipment inspection method, device, equipment and storage medium
CN116643908A (en) * 2023-07-19 2023-08-25 深圳市同泰怡信息技术有限公司 Automatic fault alarm method based on Feiteng multipath server

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117670313A (en) * 2024-02-01 2024-03-08 西安咸林能源科技有限公司 Power plant inspection method, power plant inspection system and storable medium
CN117670313B (en) * 2024-02-01 2024-04-30 西安咸林能源科技有限公司 Power plant inspection method, power plant inspection system and storable medium

Also Published As

Publication number Publication date
CN117033144B (en) 2024-01-19

Similar Documents

Publication Publication Date Title
CN117033144B (en) Remote inspection method, equipment and medium for server
US10877533B2 (en) Energy efficient workload placement management using predetermined server efficiency data
US8677160B2 (en) Managing power consumption of a computer
CN102027430A (en) Managing power consumption of a computer
CN113590403B (en) Pressure testing method, device, system, electronic equipment, storage medium and product
US20110029798A1 (en) Power Consumption Monitor and Method Therefor
WO2021073130A1 (en) Frequency modulation method and apparatus for processor, and computing device
US9772920B2 (en) Dynamic service fault detection and recovery using peer services
CN110618889A (en) Service availability detection method and device, computer equipment and storage medium
JP2019102078A (en) System power source management method and computer system
US8850234B2 (en) Power management apparatus and method
WO2023178923A1 (en) Intelligent monitoring micro-adjustment method and apparatus, device, and storage medium
US20170324636A1 (en) Method of monitoring a server rack system, and the server rack system
CN115727487A (en) Air conditioner compressor frequency control method and device, air conditioner and storage medium
CN115190044B (en) Device connection state checking method, device and storage medium
CN115378841B (en) Method and device for detecting state of equipment accessing cloud platform, storage medium and terminal
CN116361703A (en) Energy-saving control method and device for data center, electronic equipment and readable medium
CN115643163A (en) Fault equipment positioning method, device, equipment and storage medium
TW201710895A (en) Method for monitoring server racks
CN115080412A (en) Software update quality evaluation method, device, equipment and computer storage medium
CN114326999A (en) Thermal imaging based heat dissipation regulation and control method, system, equipment and storage medium
CN114126168A (en) Lamp effect control method and device, intelligent equipment and storage medium
CN116594739B (en) Control method of virtual machine, computer equipment and storage medium
CN112181708B (en) Method and device for controlling load based on frequency converter error code rate
CN116991636B (en) Data incremental backup method, system and storage medium based on distributed storage

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant