CN114153689A - BMC, device monitoring method and device thereof and readable storage medium - Google Patents

BMC, device monitoring method and device thereof and readable storage medium Download PDF

Info

Publication number
CN114153689A
CN114153689A CN202111376534.0A CN202111376534A CN114153689A CN 114153689 A CN114153689 A CN 114153689A CN 202111376534 A CN202111376534 A CN 202111376534A CN 114153689 A CN114153689 A CN 114153689A
Authority
CN
China
Prior art keywords
monitoring
event
overrun
bmc
monitoring device
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202111376534.0A
Other languages
Chinese (zh)
Other versions
CN114153689B (en
Inventor
王鹏
朱英澍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202111376534.0A priority Critical patent/CN114153689B/en
Publication of CN114153689A publication Critical patent/CN114153689A/en
Application granted granted Critical
Publication of CN114153689B publication Critical patent/CN114153689B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3055Monitoring arrangements for monitoring the status of the computing system or of the computing system component, e.g. monitoring if the computing system is on, off, available, not available
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computing Systems (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention discloses a device monitoring method of BMC, which comprises the following steps: acquiring an overrun signal sent by a monitoring device; reading monitoring data in the monitoring device according to the overrun signal; determining an overrun event of the monitoring device according to the monitoring data and a monitoring threshold value of the monitoring device; wherein the overrun event comprises an ultra-high event and/or an ultra-low event; according to the invention, by acquiring the overrun signal sent by the monitoring device and utilizing the advantages of hardware resources possessed by the monitoring device, the work of capturing the overrun event is realized by the monitoring device at the bottommost layer, so that the asynchronous IO replaces the conventional polling scheme, the monitoring device is accurately monitored in real time, the dispatching pressure of the BMC is reduced, and the response speed of the system process is ensured; the invention also discloses a device monitoring device of the BMC, the BMC and a computer readable storage medium, and the device monitoring device of the BMC has the beneficial effects.

Description

BMC, device monitoring method and device thereof and readable storage medium
Technical Field
The invention relates to the technical field of monitoring, in particular to a device monitoring method and device for BMC, the BMC and a computer readable storage medium.
Background
The BMC (Baseboard management Controller) is an essential functional component in network infrastructure such as a server and a switch, and has a main function of monitoring health information such as temperature, voltage, current, power consumption, and fan rotation speed in a hardware environment in real time. The information such as temperature, Voltage, current, power consumption, and fan speed is collected from different types of monitoring devices, such as a temp sensor, an ADC (analog to digital converter), and a VR (Voltage Regulator).
In the prior art, a monitoring scheme of the BMC for monitoring devices adopts a polling mode (such as single-thread polling or multi-thread polling), periodically polls each monitoring device in a system, reads real-time data, compares the real-time data with preset upper and lower limit values, and sends an alarm signal and performs log dump if the real-time data exceeds the preset upper and lower limit values. However, the polling method consumes a large amount of clocks, which causes the response rate of other service processes to be reduced, and affects the user experience; and single-thread polling can cause the situation that a single device exceeds the limit value cannot be captured in time, and multi-thread polling can cause the problem of high memory occupancy rate. Therefore, how to accurately monitor the monitoring devices in real time, reduce the scheduling pressure of the BMC, and ensure the response speed of the system process is a problem that needs to be solved urgently nowadays.
Disclosure of Invention
The invention aims to provide a device monitoring method and device of BMC, the BMC and a computer readable storage medium, so as to accurately monitor a monitoring device in real time, reduce the scheduling pressure of the BMC and ensure the response speed of a system process.
In order to solve the above technical problem, the present invention provides a device monitoring method for BMC, including:
acquiring an overrun signal sent by a monitoring device;
reading monitoring data in the monitoring device according to the overrun signal;
determining an overrun event of the monitoring device according to the monitoring data and a monitoring threshold value of the monitoring device; wherein the overrun event comprises an ultra-high event and/or an ultra-low event.
Optionally, the acquiring the overrun signal sent by the monitoring device includes:
acquiring an overrun signal sent by an alarm pin of the monitoring device; the overrun signal is an interrupt signal output by the alarm pin after the monitoring device detects that the monitoring data exceeds the monitoring threshold.
Optionally, the reading the monitoring data in the monitoring device according to the overrun signal includes:
and determining the overrun event according to the monitoring data and the monitoring threshold value by using the monitoring thread corresponding to the monitoring device.
Optionally, the reading the monitoring data in the monitoring device according to the overrun signal includes:
sending a wake-up signal to the monitoring thread according to the acquired overrun signal by using a drive corresponding to the monitoring device, and waking up the dormant monitoring thread;
and reading the monitoring data in the monitoring device by utilizing the monitoring thread.
Optionally, after determining the overrun event of the monitoring device according to the monitoring data and the monitoring threshold of the monitoring device, the method further includes:
and controlling a controlled device corresponding to the event type to perform corresponding adjustment action according to the event type of the overrun event.
Optionally, when the event type of the over-limit event is an over-temperature event, the controlling, according to the event type of the over-limit event, the controlled device corresponding to the event type to perform a corresponding adjustment action includes:
and adjusting the rotating speed of the fan corresponding to the over-temperature event to be a preset over-temperature rotating speed, and lightening an alarm indicator lamp corresponding to the over-temperature event.
Optionally, the controlling, according to the event type of the overrun event, the controlled device corresponding to the event type to perform a corresponding adjustment action includes:
and controlling a controlled device corresponding to the event type to perform corresponding adjustment action by utilizing the monitoring main thread according to the event type of the overrun event.
The invention also provides a device monitoring device of BMC, comprising:
the acquisition module is used for acquiring the overrun signal sent by the monitoring device;
the reading module is used for reading the monitoring data in the monitoring device according to the overrun signal;
the determining module is used for determining an overrun event of the monitoring device according to the monitoring data and the monitoring threshold value of the monitoring device; wherein the overrun event comprises an ultra-high event and/or an ultra-low event.
The invention also provides a BMC comprising:
a memory for storing a computer program;
a processor for implementing the steps of the BMC device monitoring method as described above when executing the computer program.
In addition, the present invention further provides a computer readable storage medium, wherein a computer program is stored on the computer readable storage medium, and when being executed by a processor, the computer program implements the steps of the device monitoring method for BMC as described above.
The invention provides a device monitoring method for BMC, which comprises the following steps: acquiring an overrun signal sent by a monitoring device; reading monitoring data in the monitoring device according to the overrun signal; determining an overrun event of the monitoring device according to the monitoring data and a monitoring threshold value of the monitoring device; wherein the overrun event comprises an ultra-high event and/or an ultra-low event;
therefore, the invention realizes the capture of the events exceeding the limit value by the monitoring device at the bottom layer by acquiring the overrun signal sent by the monitoring device and utilizing the advantages of hardware resources of the monitoring device, thereby replacing the conventional polling scheme with asynchronous IO (input/output), carrying out real-time and accurate monitoring on the monitoring device, reducing the scheduling pressure of BMC (baseboard management controller) and ensuring the response speed of the system process. In addition, the invention also provides a device monitoring device of BMC, BMC and a computer readable storage medium, and the device monitoring device, the BMC and the computer readable storage medium also have the beneficial effects.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the provided drawings without creative efforts.
Fig. 1 is a flowchart of a device monitoring method for BMC according to an embodiment of the present invention;
fig. 2 is an interaction diagram of another device monitoring method for BMC according to an embodiment of the present invention;
fig. 3 is a block diagram of a device monitoring apparatus for BMC according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, fig. 1 is a flowchart illustrating a device monitoring method for BMC according to an embodiment of the present invention. The method can comprise the following steps:
step 101: and acquiring an overrun signal sent by the monitoring device.
The monitoring devices in this step may be devices capable of monitoring, such as a temperature sensor, an analog-to-digital converter (ADC), a Voltage Regulator (VR), and the like.
Specifically, the overrun signal in this step may be a signal for prompting that the monitored data of the monitoring class device exceeds the monitoring threshold, that is, a signal output by the monitoring class device to the BMC after detecting that the monitored data of the monitoring class device exceeds the monitoring threshold, for example, an interrupt signal (e.g., a low level signal) of an alarm pin (alert # pin). In this embodiment, a specific signal type of the overrun signal sent by the monitoring device is not limited, for example, the overrun signal may be an interrupt signal output by a preset pin (e.g., an alarm pin) of the monitoring device, that is, the interrupt signal output by the preset pin after the monitoring device detects that the monitoring data exceeds the monitoring threshold; that is, in this step, the BMC may obtain the overrun signal sent by the alarm pin of the monitoring device; the overrun signal is an interrupt signal output by an alarm pin after the monitoring device detects that the monitoring data exceeds a monitoring threshold. For example, in view of the initial design of the SMBus (two-wire serial bus) protocol, namely Critical parameters positioned as a monitoring system, when the monitoring device and the BMC communicate by using the SMBus protocol, the SMBus protocol adds an alert # pin definition on the basis of an I2C (serial communication protocol) protocol, and when a target event is generated inside the device, the alert # pin is pulled down; correspondingly, the registers in the monitoring devices can also comprise threshold registers for storing monitoring thresholds, such as a high limit register and/or a low limit register, besides data registers for storing monitoring data, so that a user can write the upper and lower limits of the monitoring data, such as temperature, voltage and the like, to be monitored into the devices according to the requirement of the user, when the actual monitoring data actually exceed the upper and lower limits, the logic of the devices can capture such target events, further pull down the alert # pin, and output a low-level signal (namely, an overrun signal) to the BMC through the alert # pin. The overrun signal may be other types of signals besides the interrupt signal, and the overrun signal is output to the BMC only when the monitoring device can detect that the monitoring data exceeds the monitoring threshold, which is not limited in this embodiment.
Step 102: and reading the monitoring data in the monitoring device according to the overrun signal.
It can be understood that, in this step, the BMC may read the monitoring data in the monitoring device that sends the overrun signal after receiving the overrun signal.
Specifically, the specific manner in which the BMC reads the monitoring data in the monitoring device according to the overrun signal in this step may be set by a designer according to a practical scene and a user requirement, for example, in this embodiment, the BMC may read the monitoring data in the monitoring device according to the overrun signal by using a monitoring thread corresponding to the monitoring device; for example, the BMC may pre-configure the monitoring threads corresponding to the monitoring devices in the monitoring process of the user space, for example, each monitoring thread corresponds to one monitoring device, and for example, the monitoring device of the temperature monitoring class and the monitoring device of the voltage monitoring class may respectively correspond to one different monitoring thread.
Correspondingly, the specific mode of reading the monitoring data in the monitoring device by using the monitoring thread corresponding to the monitoring device according to the overrun signal can be set by a designer, for example, the overrun signal sent by the monitoring device can be directly detected by the monitoring thread corresponding to the monitoring device, so that the monitoring data in the monitoring device can be read after the overrun signal is detected. The BMC may also detect the overrun signal sent by the monitoring device by using the driver corresponding to the monitoring device, so as to control the monitoring thread corresponding to the monitoring device and monitor the monitoring thread corresponding to the monitoring device through a corresponding signal (e.g., a wake-up signal) after the driver detects the overrun signal. That is, in this step, the BMC may send a wake-up signal to the monitoring thread corresponding to the monitoring device according to the obtained overrun signal by using the driver corresponding to the monitoring device, so as to wake up the dormant monitoring thread; and reading the monitoring data in the monitoring device by using the monitoring thread corresponding to the monitoring device.
Correspondingly, in the embodiment, an interrupt + asynchronous IO drive design may be adopted, in a drive corresponding to a monitoring device, a preset pin (e.g., an alarm pin) of the monitoring device is used as an external interrupt source, an interrupt handling routine is registered for the external interrupt source, when real-time monitoring data exceeds a monitoring threshold, the preset pin may be pulled down to trigger an interrupt, so that the interrupt routine is called, interrupt information may be recorded in the interrupt routine, a kill _ fasync () function (an application function in the drive) is called, and a SIGIO signal (a drive signal, e.g., a wake-up signal) is sent to a user space monitoring process, so that an event that the monitoring data exceeds the limit is notified; at this time, the user space is responsible for monitoring that the monitoring thread of the monitoring device is awakened by the signal, and in turn, the monitoring data in the data register of the monitoring device is read and compared with the monitoring threshold value in the threshold value register.
As shown in fig. 2, in a driver corresponding to a monitoring device (i.e., a monitoring-type device), in addition to an interrupt context that captures an interrupt of an alarm pin (alert # pin) and reports a target event to a user space through asynchronous IO, a process context that responds to an asynchronous IO request of a monitoring sub-thread (i.e., a monitoring thread) of the user space may be set, so that the monitoring sub-thread may read real-time monitoring data through the driver, for example, a fasync _ helper () function (a kernel function) of a kernel is used to implement a fasync () function (an asynchronous notification function) in a file _ operations (a data structure) function set, so as to receive the asynchronous IO request of the user space.
Step 103: determining an overrun event of the monitoring device according to the monitoring data and a monitoring threshold of the monitoring device; wherein the overrun event comprises an ultra-high event and/or an ultra-low event.
It is understood that the monitoring threshold of the monitoring device in this step may be an upper limit value and/or a lower limit value of the monitoring data monitored by the monitoring device, which are set in advance, so as to determine the specific type of the overrun event through the comparison between the monitoring data and the monitoring threshold, such as an overrun event higher than the upper limit value and an overrun event lower than the lower limit value.
Specifically, the embodiment does not limit the specific content and the obtaining manner of the monitoring threshold of the monitoring device, for example, the BMC may read the monitoring threshold stored in the threshold register from the monitoring device, and for example, the BMC may read the monitoring threshold of the monitoring data in the monitoring device according to the overrun signal by using the monitoring thread corresponding to the monitoring device; and determining an overrun event according to the monitoring data and the monitoring threshold. The BMC stores the monitoring threshold of the monitoring device by itself, for example, the monitoring threshold of the monitoring device may be different from the upper limit value and/or the lower limit value in the threshold register of the monitoring device.
Further, after the step, the BMC may further control the controlled device corresponding to the event type to perform a corresponding adjustment action according to the event type of the overrun event. For example, a monitoring process of a user space in the BMC may adopt a frame of one main thread (i.e., a monitoring main thread) + a plurality of sub-threads (i.e., monitoring threads), and the BMC may control a controlled device corresponding to an event type to perform a corresponding adjustment action according to the event type of an overrun event by using the monitoring main thread; the BMC determines an overrun event of the monitoring device by using a monitoring thread corresponding to the monitoring device, and then notifies the monitoring main thread, so that the monitoring main thread can control a controlled device corresponding to the event type to perform corresponding adjustment action according to the event type of the overrun event.
Specifically, in this embodiment, each sub-thread (i.e., monitoring thread) may be configured to manage a class of monitoring devices, and is responsible for capturing device overrun events and collecting real-time data (i.e., monitoring data), and the specific work includes: packaging read-write functions for accessing real-time data and upper and lower line limit data (namely monitoring threshold values), wherein a programming node called in the functions is an equipment node which is driven and thrown to a user space, the read-write functions cannot be actively called by code main line logic and are only called in an SIGIO signal callback function realized by a sub-thread; and realizing and registering a SIGIO signal callback function, wherein the callback function is registered by a signal () function (a signal processing function) and is called when the SIGIO signal is sent by driving, a read-write function is called inside the function to acquire real-time data and upper limit data, a struct hwmon _ event structure is updated, then logical comparison is carried out, and whether a main thread needs to be awakened or not is judged.
Correspondingly, the main thread (i.e. the monitoring main thread) can be responsible for uniformly managing various monitoring events, a struct hwmon _ event structure body for describing specific target events is abstracted and delivered to each sub-thread for instantiation, and the structure body contains specific target event types, such as over-temperature, under-voltage and the like, and can also contain the generation times of the target events, the historical data of the last three times and the like; when the monitoring main thread is awakened by a certain monitoring sub-thread, the monitoring main thread can acquire an hmmon _ event instance updated by the sub-thread, the target event type and related data in the hmmon _ event instance are taken out, the target event type and the related data are substituted into a data reliability algorithm, the severity of the event is judged, corresponding follow-up actions are performed, for example, when the event type of the overrun event is the over-temperature event, the BMC can adjust the rotating speed of a fan corresponding to the over-temperature event (namely, the fan corresponding to a detection device which detects the over-temperature event) to be a preset over-temperature rotating speed by using the main thread, and an alarm indicator lamp corresponding to the over-temperature event is lightened; and may also log the over-temperature event to the system.
Specifically, as shown in fig. 2, after each monitoring thread (i.e., sub-thread) is created in this embodiment, an fcntl () function (a function in a computer) may be called first to initiate an asynchronous IO request, and the function may finally call a fasync () function implemented by a driver to complete asynchronous IO setting; after the asynchronous IO request is initiated, the whole user space monitoring process is initialized and is in a ready state.
When no monitoring device generates an event exceeding the limit value, the monitoring thread can be always in a dormant state (the monitoring main thread is also dormant on a specified condition variable after the initialization of the related software resource and the creation of each monitoring thread), and the processing resource of the BMC can be dispatched to other service processes for use, so that the dispatching pressure of the BMC is reduced, and the response speed of the system process is improved; when a monitoring device generates an event exceeding the limit value, the hardware logic of the monitoring device can sense the event and pull down the alert # pin, trigger the corresponding drive in the BMC to enter an interrupt context, and further send a SIGIO signal to the corresponding monitoring thread to wake up the monitoring thread.
The awakened monitoring thread can read the monitoring data and the upper and lower limit data (namely the monitoring threshold) of the monitoring device, and after comparison, if the monitoring data is found to exceed the preset monitoring threshold, the monitoring main thread in dormancy is awakened through a condition variable, and after the monitoring main thread is awakened, follow-up work, such as alarming and corresponding adjustment action, can be completed according to a specific reporting event type.
It should be noted that the method provided by this embodiment obtains a good effect in multiple projects based on OpenBMC (an open source BMC scheme), and can adopt yocoto (an open source community) management project engineering to manage the monitoring process of the user space by system (init software under a linux system); the commands to start and stop the monitoring process may be # sudo system start hwmon service and # sudo system stop hwmon service, respectively.
In this embodiment, the overrun signal sent by the monitoring device is acquired, and the monitoring device itself has the advantage of hardware resources, so that the monitoring device itself at the bottommost layer can capture the overrun event, and therefore asynchronous IO is used to replace the existing polling scheme, real-time and accurate monitoring is performed on the monitoring device, the scheduling pressure of the BMC is reduced, and the response speed of the system process is ensured.
Corresponding to the above method embodiment, an embodiment of the present invention further provides a device monitoring apparatus for BMC, and the device monitoring apparatus for BMC described below and the device monitoring method for BMC described above may be referred to correspondingly.
Referring to fig. 3, fig. 3 is a block diagram of a device monitoring apparatus for BMC according to an embodiment of the present invention. The apparatus may include:
the acquisition module 10 is used for acquiring an overrun signal sent by a monitoring device;
the reading module 20 is used for reading the monitoring data in the monitoring device according to the overrun signal;
the determining module 30 is configured to determine an overrun event of the monitoring device according to the monitoring data and a monitoring threshold of the monitoring device; wherein the overrun event comprises an ultra-high event and/or an ultra-low event.
Optionally, the obtaining module 10 may be specifically configured to obtain an overrun signal sent by an alarm pin of the monitoring device; the overrun signal is an interrupt signal output by an alarm pin after the monitoring device detects that the monitoring data exceeds a monitoring threshold.
Optionally, the reading module 20 may be specifically configured to determine the overrun event according to the monitoring data and the monitoring threshold by using a monitoring thread corresponding to the monitoring device.
Alternatively, the reading module 20 may include:
the awakening sub-module is used for sending an awakening signal to the monitoring thread according to the acquired overrun signal by utilizing the corresponding drive of the monitoring device, and awakening the dormant monitoring thread;
and the reading submodule is used for reading the monitoring data in the monitoring device by utilizing the monitoring thread.
Optionally, the apparatus may further include:
and the adjusting module is used for controlling the controlled device corresponding to the event type to perform corresponding adjusting action according to the event type of the overrun event.
Optionally, when the event type of the overrun event is an over-temperature event, the adjusting module may include:
and the fan adjusting submodule is used for adjusting the rotating speed of the fan corresponding to the over-temperature event to be a preset over-temperature rotating speed and lightening an alarm indicator lamp corresponding to the over-temperature event.
Optionally, the adjusting module may be specifically configured to control, by using the monitoring main thread, the controlled device corresponding to the event type to perform a corresponding adjusting action according to the event type of the overrun event.
In this embodiment, the obtaining module 10 obtains the overrun signal sent by the monitoring device, and the monitoring device uses the advantage of hardware resources of the monitoring device to deliver the work of capturing the overrun event to the monitoring device at the bottom layer, so that the asynchronous IO is used to replace the existing polling scheme, the monitoring device is accurately monitored in real time, the scheduling pressure of the BMC is reduced, and the response speed of the system process is ensured.
Corresponding to the above method embodiment, an embodiment of the present invention further provides a BMC, and the following BMC and the above-described device monitoring method of the BMC may be referred to correspondingly.
The embodiment of the invention provides a BMC, which comprises:
a memory for storing a computer program;
a processor for implementing the steps of the device monitoring method for BMC as provided in the above embodiments when executing the computer program.
Corresponding to the above method embodiment, an embodiment of the present invention further provides a computer-readable storage medium, and a computer-readable storage medium described below and a device monitoring method for BMC described above may be referred to correspondingly.
An embodiment of the present invention provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the steps of the device monitoring method for BMC provided in the foregoing method embodiment are implemented.
The embodiments are described in a progressive manner in the specification, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other. The device, the BMC, and the computer-readable storage medium disclosed in the embodiments correspond to the method disclosed in the embodiments, so that the description is simple, and the relevant points can be referred to the description of the method.
The device monitoring method and device for BMC, BMC and computer readable storage medium provided by the present invention are described in detail above. The principles and embodiments of the present invention are explained herein using specific examples, which are presented only to assist in understanding the method and its core concepts. It should be noted that, for those skilled in the art, it is possible to make various improvements and modifications to the present invention without departing from the principle of the present invention, and those improvements and modifications also fall within the scope of the claims of the present invention.

Claims (10)

1. A device monitoring method for BMC is characterized by comprising the following steps:
acquiring an overrun signal sent by a monitoring device;
reading monitoring data in the monitoring device according to the overrun signal;
determining an overrun event of the monitoring device according to the monitoring data and a monitoring threshold value of the monitoring device; wherein the overrun event comprises an ultra-high event and/or an ultra-low event.
2. The device monitoring method for the BMC of claim 1, wherein the acquiring the over-limit signal sent by the monitoring class device comprises:
acquiring an overrun signal sent by an alarm pin of the monitoring device; the overrun signal is an interrupt signal output by the alarm pin after the monitoring device detects that the monitoring data exceeds the monitoring threshold.
3. The device monitoring method for the BMC of claim 1, wherein the reading the monitoring data in the monitoring class device according to the over-limit signal comprises:
and determining the overrun event according to the monitoring data and the monitoring threshold value by using the monitoring thread corresponding to the monitoring device.
4. The device monitoring method for the BMC of claim 3, wherein the reading the monitoring data in the monitoring class device according to the over-limit signal comprises:
sending a wake-up signal to the monitoring thread according to the acquired overrun signal by using a drive corresponding to the monitoring device, and waking up the dormant monitoring thread;
and reading the monitoring data in the monitoring device by utilizing the monitoring thread.
5. The device monitoring method for the BMC of any one of claims 1 to 4, wherein after determining the overrun event of the monitoring class device according to the monitoring data and the monitoring threshold of the monitoring class device, further comprising:
and controlling a controlled device corresponding to the event type to perform corresponding adjustment action according to the event type of the overrun event.
6. The device monitoring method for the BMC of claim 5, wherein when the event type of the over-limit event is an over-temperature event, the controlling the controlled device corresponding to the event type to perform the corresponding adjustment action according to the event type of the over-limit event includes:
and adjusting the rotating speed of the fan corresponding to the over-temperature event to be a preset over-temperature rotating speed, and lightening an alarm indicator lamp corresponding to the over-temperature event.
7. The device monitoring method for the BMC of claim 5, wherein the controlling, according to the event type of the over-limit event, the controlled device corresponding to the event type to perform a corresponding adjustment action includes:
and controlling a controlled device corresponding to the event type to perform corresponding adjustment action by utilizing the monitoring main thread according to the event type of the overrun event.
8. A device monitoring apparatus for BMC, comprising:
the acquisition module is used for acquiring the overrun signal sent by the monitoring device;
the reading module is used for reading the monitoring data in the monitoring device according to the overrun signal;
the determining module is used for determining an overrun event of the monitoring device according to the monitoring data and the monitoring threshold value of the monitoring device; wherein the overrun event comprises an ultra-high event and/or an ultra-low event.
9. A BMC, comprising:
a memory for storing a computer program;
a processor for implementing the steps of the device monitoring method for BMC according to any of claims 1 to 7 when executing said computer program.
10. A computer-readable storage medium, characterized in that a computer program is stored thereon, which, when being executed by a processor, carries out the steps of the device monitoring method for BMC according to any of claims 1 to 7.
CN202111376534.0A 2021-11-19 2021-11-19 BMC and device monitoring method and device thereof and readable storage medium Active CN114153689B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111376534.0A CN114153689B (en) 2021-11-19 2021-11-19 BMC and device monitoring method and device thereof and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111376534.0A CN114153689B (en) 2021-11-19 2021-11-19 BMC and device monitoring method and device thereof and readable storage medium

Publications (2)

Publication Number Publication Date
CN114153689A true CN114153689A (en) 2022-03-08
CN114153689B CN114153689B (en) 2024-01-12

Family

ID=80456739

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111376534.0A Active CN114153689B (en) 2021-11-19 2021-11-19 BMC and device monitoring method and device thereof and readable storage medium

Country Status (1)

Country Link
CN (1) CN114153689B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104603716A (en) * 2012-08-27 2015-05-06 三星电子株式会社 An apparatus and method for waking up a processor
US20180107600A1 (en) * 2016-10-19 2018-04-19 International Business Machines Corporation Response times in asynchronous i/o-based software using thread pairing and co-execution
CN110058979A (en) * 2019-04-18 2019-07-26 苏州浪潮智能科技有限公司 A kind of temperature reads monitoring method, BMC and the storage medium of failure failure
CN113032216A (en) * 2021-03-26 2021-06-25 山东英信计算机技术有限公司 Monitoring method, device, equipment and medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104603716A (en) * 2012-08-27 2015-05-06 三星电子株式会社 An apparatus and method for waking up a processor
US20180107600A1 (en) * 2016-10-19 2018-04-19 International Business Machines Corporation Response times in asynchronous i/o-based software using thread pairing and co-execution
CN110058979A (en) * 2019-04-18 2019-07-26 苏州浪潮智能科技有限公司 A kind of temperature reads monitoring method, BMC and the storage medium of failure failure
CN113032216A (en) * 2021-03-26 2021-06-25 山东英信计算机技术有限公司 Monitoring method, device, equipment and medium

Also Published As

Publication number Publication date
CN114153689B (en) 2024-01-12

Similar Documents

Publication Publication Date Title
CN107851050B (en) Device with watchdog timer and method for operating watchdog timer
CN107278283B (en) Techniques for managing power of an embedded controller during a low power state
CN110618869B (en) Resource management method, device and equipment
KR20200104248A (en) Methods and apparatus to improve performance data collection of a high performance computing application
CN109582536B (en) Application program non-response reporting method and device and computer equipment
WO2021077260A1 (en) Power consumption information processing method, device, electronic apparatus and storage medium
CN114153689B (en) BMC and device monitoring method and device thereof and readable storage medium
CN114257474A (en) Power utilization control method and device for intelligent gateway, computer equipment and storage medium
CN111104289B (en) System and method for checking efficiency of GPU (graphics processing Unit) cluster
CN113360344B (en) Server monitoring method, device, equipment and computer readable storage medium
WO2023279829A1 (en) Message delay detection method and apparatus, electronic device, and storage medium
CN112631872B (en) Exception handling method and device for multi-core system
CN114301857A (en) Water meter communication method and device based on Internet of things
CN212809125U (en) Health management system of computer host
CN114624569A (en) Sensor circuit abnormality detection method, circuit, device, intelligent device, and medium
CN112910732A (en) Method and equipment for resetting edge computing server
CN114416326A (en) Big data control method, device, control system and readable storage medium
CN110691135A (en) Equipment data uploading method and system, Internet of things module and storage medium
CN103885371A (en) Controller, alarm system and alarm method
CN115601893A (en) Doorbell equipment control method and device, storage medium and electronic device
CN110672145B (en) Sensor data acquisition and processing method, device, equipment and storage medium
US20220229755A1 (en) Docking stations health
CN115794550B (en) Process management method, device, vehicle and storage medium
CN117555760B (en) Server monitoring method and device, substrate controller and embedded system
CN114385252A (en) Device control method, device, storage medium and electronic device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant