CN114721859A - Method and device for monitoring MLCC (multilayer ceramic capacitor) short circuit risk and storage medium - Google Patents

Method and device for monitoring MLCC (multilayer ceramic capacitor) short circuit risk and storage medium Download PDF

Info

Publication number
CN114721859A
CN114721859A CN202210467903.5A CN202210467903A CN114721859A CN 114721859 A CN114721859 A CN 114721859A CN 202210467903 A CN202210467903 A CN 202210467903A CN 114721859 A CN114721859 A CN 114721859A
Authority
CN
China
Prior art keywords
mlcc
management controller
mlcc capacitor
risk
capacitor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202210467903.5A
Other languages
Chinese (zh)
Other versions
CN114721859B (en
Inventor
王吉宏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202210467903.5A priority Critical patent/CN114721859B/en
Publication of CN114721859A publication Critical patent/CN114721859A/en
Application granted granted Critical
Publication of CN114721859B publication Critical patent/CN114721859B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0751Error or fault detection not based on redundancy
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0793Remedial or corrective actions

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention relates to a method and a device for monitoring the risk of MLCC (multi-level capacitor cell) short circuit and a storage medium. The invention relates to a baseboard management controller which collects abnormal information in a server and detects whether voltage drop abnormality exists in the abnormal information; if the voltage drop is detected to be abnormal, collecting the current and the voltage of each MLCC capacitor and all elements around the MLCC capacitor, and collecting the temperature measured by a temperature sensor arranged at each MLCC capacitor and elements around the MLCC capacitor; detecting whether a condition exceeding a preset threshold exists or not; if the situation of exceeding a preset threshold exists, the substrate management controller further analyzes according to a preset analysis strategy to determine the MLCC capacitor with the short circuit risk; and performing protection action aiming at the MLCC capacitor with the short circuit risk. According to the power supply circuit, when the MLCC capacitor has short-circuit risk, the power supply circuit where the MLCC capacitor is located is powered down in advance, and short circuit is avoided.

Description

Method and device for monitoring MLCC (multilayer ceramic capacitor) short circuit risk and storage medium
Technical Field
The invention relates to the technical field of power supply circuit monitoring management, in particular to a method and a device for monitoring the risk of a short circuit of an MLCC capacitor and a storage medium.
Background
The normal and stable power supply of the server power supply circuit is the premise of stable and normal work of the server.
MLCC (Multi-layer Ceramic Capacitors) chip type multilayer Ceramic Capacitors are formed by stacking Ceramic dielectric diaphragms with printed inner electrodes in a staggered mode, forming a Ceramic capacitor through one-time high-temperature sintering, and sealing outer electrodes at two ends of the Ceramic capacitor. The power supply circuit is widely applied to the power supply circuit, and when the MLCC capacitor in the power supply circuit is thoroughly short-circuited and damaged, the power supply circuit supplies power abnormally. In the existing power supply design of the server, the realization of short-circuit prevention design protection of a server mainboard mainly depends on a power supply IC to detect the current of a power supply circuit, and when the current exceeds a threshold value, efuse is used for preventing short-circuit protection of the power supply circuit. At present, the short-circuit prevention protection of a server power supply circuit does not extend into the MLCC capacitor level, and the situation that the MLCC capacitor is short-circuited to cause burning loss is often encountered in the actual use process of the server. Therefore, it is necessary to avoid burning loss caused by the short circuit of the MLCC capacitor based on monitoring the short circuit of the MLCC capacitor layer.
Disclosure of Invention
To solve the above technical problem or at least partially solve the above technical problem, the present invention provides a method, an apparatus and a storage medium for monitoring a short circuit risk of an MLCC capacitor.
In a first aspect, the present invention provides a method for monitoring a risk of a short circuit of an MLCC capacitor, including:
the substrate management controller collects abnormal information in the server and detects whether voltage drop abnormity exists in the abnormal information;
if the substrate management controller detects that the voltage drop of the power supply circuit is abnormal in the abnormal information, the substrate management controller collects the current and the voltage of each MLCC capacitor and all elements around the MLCC capacitor, and collects the temperature measured by the temperature sensors arranged at the MLCC capacitor and the elements around the MLCC capacitor;
detecting whether any MLCC capacitor or element current around the MLCC capacitor exceeds a first safety value range indicated by a first threshold value of a corresponding device, whether any MLCC capacitor or element voltage around the MLCC capacitor exceeds a second safety value range indicated by a second threshold value of the corresponding device, and whether a temperature measured by a temperature sensor arranged at any MLCC capacitor or element around the MLCC capacitor is greater than a third threshold value of the corresponding device;
if yes, the baseboard management controller further analyzes relevant electrical parameters of all devices involved in the yes item according to a preset analysis strategy to determine the MLCC capacitor with the short circuit risk;
and performing protection action aiming at the MLCC capacitor with short circuit risk.
Still further, the baseboard management controller further analyzing the related electrical parameters of all related devices in the yes item according to a preset analysis strategy to determine the MLCC capacitance with short circuit risk includes:
the baseboard management controller collects the current of the device and/or the voltage of the related device and/or the temperature measured by a temperature sensor arranged around the related device for multiple times aiming at the related device of the items and analyzes the current and/or the voltage;
for all involved device currents, if the substrate management controller compares that one involved device current collected for many times exceeds a first safety value range indicated by a first threshold value of a corresponding device, and the current of the involved device does not return to the corresponding first safety value range trend according to time sequence, judging that the MLCC capacitor related to the device has a short-circuit risk;
for all the voltages of the involved devices, if the voltage of one involved device collected for multiple times by the board management controller exceeds a second safety value range indicated by a second threshold value of the corresponding device and the voltage of the involved device does not return to the corresponding second safety value range trend according to time sequence, judging that the MLCC capacitor has a short-circuit risk;
and for all the involved device temperatures, if the board management controller compares that the involved device temperatures collected for multiple times are all larger than the third threshold value of the corresponding device, and the involved device has no descending condition according to the time sequence, judging that the MLCC capacitor has short circuit risk.
Further, the baseboard management controller collects the device current, voltage, and temperature measured by the temperature sensor provided at the device a plurality of times at preset time intervals at preset times.
Still further, the performing a protection action for the MLCC capacitor at risk of short circuit includes:
determining a power supply circuit of a circuit where the MLCC capacitor with the short circuit risk is located;
controlling the power supply circuit to be powered down;
and the baseboard management controller generates and sends alarm information of the MLCC capacitor with the short circuit risk.
Furthermore, the nodes powered by the circuits where the MLCC capacitors with the short circuit risks are located in the server are informed, and the nodes store the working data.
Furthermore, when the substrate management controller analyzes that any MLCC capacitor has a short-circuit risk, the substrate management controller records a plurality of current and voltage data of the MLCC capacitor and all elements around the MLCC capacitor related to the analyzed MLCC capacitor having the short-circuit risk and a plurality of temperature data measured by the temperature sensors arranged at the MLCC capacitor and the elements around the MLCC capacitor.
In a second aspect, the present application provides an apparatus for monitoring a risk of a short circuit of a MLCC capacitor, comprising: the system comprises a substrate management controller, at least one first detection unit, at least one second detection unit, at least one temperature sensor, at least one third detection unit and at least one third detection unit, wherein the substrate management controller is connected with the at least one first detection unit, the first detection unit detects the current of the MLCC and the peripheral elements thereof, the second detection unit detects the voltage of the MLCC and the peripheral elements thereof, the substrate management controller is connected with the at least one temperature sensor, the temperature sensors are distributed at the positions of the MLCC and the peripheral elements thereof, the substrate management controller is connected with the at least one third detection unit for detecting abnormal information of a server, and the at least one third detection unit is used for detecting the steep drop of the voltage;
the substrate management controller is connected with at least one power supply circuit to realize power-down control of the power supply circuit;
the baseboard management controller is provided with at least one memory, at least one instruction is stored in the memory, and the baseboard management controller reads and executes the instruction, so that the method for monitoring the MLCC capacitor short-circuit risk is realized.
Furthermore, the baseboard management controller is connected with an alarm configured on the server chassis, and when determining that the MLCC capacitor with short circuit risk exists, the baseboard management controller sends alarm information through the alarm.
Furthermore, the baseboard management controller is connected with the switch through the network port, the switch is connected with the management machine through the network, and when the baseboard management controller determines that the MLCC capacitor with the short circuit risk exists, the alarm information is sent to the management machine.
In a third aspect, the present application provides a storage medium, where the storage medium stores at least one instruction, and reads and executes the instruction to implement the method for monitoring the risk of the MLCC capacitance short circuit.
Compared with the prior art, the technical scheme provided by the embodiment of the invention has the following advantages:
the method utilizes the condition that the server generates the voltage steep drop abnormity before the MLCC capacitor is short-circuited, and monitors whether the voltage steep drop abnormity exists in the collected abnormal information through the substrate management controller; if the voltage drop is abnormal, the baseboard management controller collects the current and the voltage of each MLCC capacitor and all elements around the MLCC capacitor for many times, and detects the temperature measured by the temperature sensors arranged at the MLCC capacitor and the elements around the MLCC capacitor. The baseboard management controller analyzes a plurality of currents and voltages of the MLCC capacitor and all elements around the MLCC capacitor based on a preset analysis strategy, and analyzes a plurality of temperatures measured by temperature sensors arranged at the MLCC capacitor and the elements around the MLCC capacitor so as to determine the MLCC capacitor with short circuit risk. Once the MLCC capacitor with the short circuit risk is located, immediately executing protection action aiming at the MLCC capacitor with the short circuit risk, stopping loss in time and avoiding burning loss of the MLCC capacitor due to short circuit.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the invention and together with the description, serve to explain the principles of the invention.
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without inventive exercise.
Fig. 1 is a flowchart of a method for monitoring a short circuit risk of an MLCC capacitor according to an embodiment of the present invention;
fig. 2 is a flowchart of a baseboard management controller according to an embodiment of the present invention, further analyzing, according to a preset analysis strategy, all related electrical parameters of the component involved in the yes item to determine an MLCC capacitance at which a short circuit risk exists;
fig. 3 is a flowchart of a protection action performed on an MLCC capacitor with a short circuit risk according to an embodiment of the present invention;
fig. 4 is a schematic diagram of an apparatus for monitoring a risk of a short circuit of a capacitor of an MLCC according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, are within the scope of the present invention.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in the process, method, article, or apparatus that comprises the element.
Example 1
Referring to fig. 1, an embodiment of the present invention provides a method for monitoring a risk of a short circuit of an MLCC capacitor, including:
s10, the bmc collects the abnormal information in the server, specifically, the bmc is a monitoring and managing device of the whole server board. The temperature and voltage detection device is mainly used for detecting the states of temperature, voltage and the like of each part (CPU, memory, hard disk, fan, machine frame and the like) of the server, and simultaneously, the rotating speed of the fan is adjusted in real time according to the conditions of each temperature acquisition point to ensure that the server does not generate over-temperature. The baseboard management controller monitors abnormal information including voltage steep drop abnormity, in the specific implementation process, a CPLD (complex programmable logic device) connected with the baseboard management controller in a communication mode is used for monitoring the voltage steep drop abnormity in a power supply circuit of the server, and the voltage steep drop abnormity is sent to the baseboard management controller.
S20, the bmc detects whether there is a voltage drop abnormality in the abnormality information. The baseboard management controller detects whether the voltage of the power supply circuit drops abnormally or not in all the received server abnormalities. If the board management controller detects that there is a voltage drop abnormality of the power supply circuit in the abnormality information, S30 is executed.
S30, the baseboard management controller collects the current and voltage of each MLCC capacitor and all elements around the MLCC capacitor, and collects the temperature measured by the temperature sensor arranged at each MLCC capacitor and elements around the MLCC capacitor.
S40, detecting whether any MLCC capacitor or its surrounding element current exceeds a first safety value range indicated by a first threshold value, whether any MLCC capacitor or its surrounding element voltage exceeds a second safety value range indicated by a second threshold value, and whether the temperature measured by a temperature sensor arranged at any MLCC capacitor or its surrounding element is greater than a third threshold value of a corresponding device. If yes, S50 is executed.
S50, the baseboard management controller further analyzes the related electrical parameters of all related devices in the yes item according to a preset analysis strategy to determine the MLCC capacitance with short circuit risk. In the specific implementation process, referring to fig. 2, the step S50 includes:
and S51, the baseboard management controller collects the current of the device or/and the voltage of the related device or/and the temperature measured by a temperature sensor arranged around the related device for multiple times of the device involved in the item of yes and analyzes the current or/and the voltage of the related device. Specifically, the baseboard management controller collects the device current, the device voltage and the temperature measured by the temperature sensor arranged at the device for a plurality of times at preset time intervals according to preset times.
S52, for all the involved device currents, the baseboard management controller compares whether the involved device currents collected for multiple times exceed a first safety value range indicated by a first threshold value of the corresponding device, and the situation that the currents of the involved devices do not return to the corresponding first safety value range trend according to time sequence exists, otherwise, S53 is executed, and S55 is executed if the currents of the involved devices do not return to the corresponding first safety value range trend.
S53, for all the involved device voltages, the board management controller compares whether the involved device voltages collected for multiple times exceed a second safe value range indicated by a second threshold value of the corresponding device, and the involved device voltages do not return to the corresponding second safe value range trend according to time sequence, otherwise, S54 is executed, and S55 is executed if the involved device voltages do not return to the corresponding second safe value range trend.
And S54, for all the involved device temperatures, the board management controller compares whether the involved device temperatures collected for multiple times are all larger than the third threshold of the corresponding device, and the involved device has no descending condition according to the time sequence, if so, S55 is executed, otherwise, S56 is executed.
And S55, judging that the MLCC capacitor related to the device has short circuit risk, namely judging that the device has short circuit risk when the device is the MLCC capacitor, and judging that the MLCC capacitor related to the device has short circuit risk when the device is a peripheral element of the MLCC capacitor.
S56, the MLCC capacitor has no short circuit risk.
And S60, executing protection action aiming at the MLCC capacitor with the short circuit risk.
In a specific implementation process, referring to fig. 3, the performing a protection action for the MLCC capacitor with the short circuit risk includes:
and S61, determining a power supply circuit of the circuit where the MLCC capacitor with the short circuit risk is located.
S62, the baseboard management controller generates and sends alarm information of the MLCC capacitance with the short circuit risk. Specifically, when analyzing that any MLCC capacitor has a short-circuit risk, the substrate management controller records and analyzes a plurality of current and voltage data of the MLCC capacitor and all elements around the MLCC capacitor related to the short-circuit risk of the MLCC capacitor and a plurality of temperature data measured by temperature sensors arranged at the MLCC capacitor and the elements around the MLCC capacitor. The alarm information comprises a data link or a storage address related to the MLCC capacitor with the short circuit risk, and an analyst acquires data through the data link or the storage data.
And S63, controlling the power supply circuit to power down. In a specific implementation process, a preferable scheme is that before power failure is implemented, a node powered by a circuit where an MLCC capacitor with a short circuit risk is located in a server is notified, and the node stores working data.
Example 2
Referring to fig. 4, an embodiment of the invention provides an apparatus for monitoring a short-circuit risk of an MLCC capacitor, including: the substrate management controller is connected with at least one first detection unit, the first detection unit has the function of current detection, and the first detection unit is used for detecting the MLCC capacitor and the current of the elements around the MLCC capacitor.
The substrate management controller is connected with at least one second detection unit, the second detection unit has the function of voltage detection, and the second detection unit detects the voltage of the MLCC capacitor and the voltage of the peripheral elements of the MLCC capacitor.
The baseboard management controller is connected with at least one temperature sensor, the temperature sensor is distributed at each MLCC capacitor and the surrounding elements, and the temperature sensor is used for measuring the temperature of each MLCC capacitor and the surrounding elements.
The substrate management controller is connected with at least one third detection unit for detecting abnormal information of the server, and the at least one third detection unit is used for detecting the steep drop of the voltage; the third detection unit for detecting the steep drop of the voltage in the specific implementation process comprises a plurality of voltage detection circuits connected with the power supply circuit and CPLDs connected with the voltage detection circuits, and the CPLDs are in communication connection with the substrate management controller.
The substrate management controller is connected with at least one power supply circuit to realize power-down control of the power supply circuit.
The baseboard management controller is provided with at least one memory, at least one instruction is stored in the memory, and the baseboard management controller reads and executes the instruction, so that the method for monitoring the MLCC capacitor short-circuit risk is realized.
The baseboard management controller is connected with the switch through the network port, the switch is connected with the management machine through the network, and when the baseboard management controller determines that the MLCC capacitor with the short circuit risk exists, the alarm information is sent to the management machine. The baseboard management controller reports the information of the MLCC capacitor short circuit risk to the management machine in time through any one or more of SNMP protocol, SMTP protocol or Redfish protocol, so that operation and maintenance personnel can process the information in time and the service is guaranteed to be lossless.
In a specific implementation process, the baseboard management controller is connected with an alarm configured on a server case, and when determining that the MLCC capacitor with short circuit risk exists, the baseboard management controller sends alarm information through the alarm. Specifically, a feasible alarm adopts the LED warning light, and when the substrate management controller determines that the short circuit risk exists, the substrate management controller controls the warning light to emit red light.
Example 3
The embodiment of the invention provides a storage medium, wherein the storage medium stores at least one instruction, reads and executes the instruction, and realizes the method for monitoring the risk of the MLCC short circuit.
The method utilizes the condition that the server generates the voltage steep drop abnormity before the MLCC capacitor is short-circuited, and monitors whether the voltage steep drop abnormity exists in the collected abnormal information through the substrate management controller; if the voltage drop is abnormal, the baseboard management controller collects the current and the voltage of each MLCC capacitor and all elements around the MLCC capacitor for many times, and detects the temperature measured by the temperature sensors arranged at the MLCC capacitor and the elements around the MLCC capacitor. The baseboard management controller analyzes a plurality of currents and voltages of the MLCC capacitor and all elements around the MLCC capacitor based on a preset analysis strategy, and analyzes a plurality of temperatures measured by temperature sensors arranged at the MLCC capacitor and the elements around the MLCC capacitor so as to determine the MLCC capacitor with short circuit risk. Once the MLCC capacitor with the short circuit risk is located, immediately executing protection action aiming at the MLCC capacitor with the short circuit risk, stopping loss in time and avoiding burning loss of the MLCC capacitor due to short circuit.
In the embodiments provided in the present invention, it should be understood that the disclosed apparatus and method may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units is only one logical division, and other divisions may be realized in practice, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit may be implemented in the form of hardware, or may also be implemented in the form of a software functional unit.
The foregoing are merely exemplary embodiments of the present invention, which enable those skilled in the art to understand or practice the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (10)

1. A method of monitoring MLCC capacitive short circuit risk, comprising:
the substrate management controller collects abnormal information in the server and detects whether voltage drop abnormity exists in the abnormal information;
if the substrate management controller detects that the voltage drop of the power supply circuit is abnormal in the abnormal information, the substrate management controller collects the current and the voltage of each MLCC capacitor and all elements around the MLCC capacitor, and collects the temperature measured by the temperature sensors arranged at the MLCC capacitor and the elements around the MLCC capacitor;
detecting whether any MLCC capacitor or element current around the MLCC capacitor exceeds a first safety value range indicated by a first threshold value of a corresponding device, whether any MLCC capacitor or element voltage around the MLCC capacitor exceeds a second safety value range indicated by a second threshold value of the corresponding device, and whether a temperature measured by a temperature sensor arranged at any MLCC capacitor or element around the MLCC capacitor is greater than a third threshold value of the corresponding device;
if yes, the baseboard management controller further analyzes related electrical parameters related to the device in all yes items according to a preset analysis strategy to determine the MLCC capacitance with the short circuit risk;
and performing protection action aiming at the MLCC capacitor with the short circuit risk.
2. The method of monitoring the risk of short circuit in MLCC capacitors according to claim 1, wherein the baseboard management controller further analyzes the related electrical parameters relating to the devices in all yes terms according to a preset analysis strategy to determine the risk of short circuit in MLCC capacitors comprises:
the baseboard management controller collects the current of the device and/or the voltage of the related device and/or the temperature measured by a temperature sensor arranged around the related device for multiple times aiming at the related device of the items and analyzes the current and/or the voltage;
for all involved device currents, if the substrate management controller compares that one involved device current collected for many times exceeds a first safety value range indicated by a first threshold value of a corresponding device, and the current of the involved device does not return to the corresponding first safety value range trend according to time sequence, judging that the MLCC capacitor related to the device has a short-circuit risk;
for all involved device voltages, if the voltage of a device involved in comparison of multiple times collected by the board management controller exceeds a second safety value range indicated by a second threshold value of a corresponding device, and the voltage of the device involved in comparison does not return to the corresponding second safety value range trend according to time sequence, judging that the MLCC capacitor related to the device has a short-circuit risk;
and for all the involved device temperatures, if the board management controller compares that the involved device temperatures collected for multiple times are all larger than the third threshold of the corresponding device, and the involved device has no descending condition according to the time sequence, judging that the MLCC capacitor related to the device has a short-circuit risk.
3. The method of monitoring the risk of MLCC short circuit according to claim 2, wherein the baseboard management controller collects the device current, voltage, and temperature measured by the temperature sensor provided at the device a plurality of times at preset time intervals and at preset times.
4. The method of monitoring the risk of short circuit of the MLCC capacitor according to claim 1, wherein the performing a protection action for the MLCC capacitor at risk of short circuit comprises:
determining a power supply circuit of a circuit where the MLCC capacitor with the short circuit risk is located;
controlling the power supply circuit to implement power failure;
and the baseboard management controller generates and sends alarm information of the MLCC capacitor with short circuit risk.
5. The method for monitoring the risk of short circuit of MLCC capacitor according to claim 4, wherein the nodes powered by the circuit where the MLCC capacitor with the risk of short circuit is located in the server are notified, and the nodes store the working data.
6. The method as claimed in claim 1, wherein the baseboard management controller records a plurality of current and voltage data of the MLCC capacitor and all its surrounding components involved in the analyzed short-circuit risk of the MLCC capacitor and a plurality of temperature data measured by the temperature sensors disposed at the MLCC capacitor and its surrounding components when analyzing that any MLCC capacitor has the short-circuit risk.
7. An apparatus for monitoring MLCC capacitance short risk, comprising: the system comprises a substrate management controller, at least one first detection unit, at least one second detection unit, at least one temperature sensor, at least one third detection unit and at least one third detection unit, wherein the substrate management controller is connected with the at least one first detection unit, the first detection unit is used for detecting the current of the MLCC and the current of elements around the MLCC, the substrate management controller is connected with the at least one second detection unit, the second detection unit is used for detecting the voltage of the MLCC and the voltage of elements around the MLCC, the substrate management controller is connected with the at least one temperature sensor, the temperature sensor is scattered at each MLCC and the elements around the MLCC, the substrate management controller is connected with the at least one third detection unit used for detecting abnormal information of a server, and the at least one third detection unit is used for detecting the steep drop of the voltage;
the substrate management controller is connected with at least one power supply circuit to realize power-down control of the power supply circuit;
the baseboard management controller is provided with at least one memory, at least one instruction is stored in the memory, and the baseboard management controller reads and executes the instruction to realize the method for monitoring the MLCC short circuit risk according to any one of claims 1 to 6.
8. The device for monitoring the short-circuit risk of the MLCC capacitor as claimed in claim 7, wherein the baseboard management controller is connected to an alarm configured in the server chassis, and when the baseboard management controller determines that the MLCC capacitor at short-circuit risk exists, the alarm sends an alarm message.
9. The apparatus for monitoring MLCC capacitor short circuit risk according to claim 7, wherein the baseboard management controller is connected to the switch through a network port, the switch is connected to the management machine through a network, and the baseboard management controller sends alarm information to the management machine when determining that there is MLCC capacitor short circuit risk.
10. A storage medium storing at least one instruction, which is read and executed to implement the method of monitoring the risk of a short circuit in a MLCC capacitor as claimed in any one of claims 1-6.
CN202210467903.5A 2022-04-29 2022-04-29 Method, device and storage medium for monitoring MLCC capacitor short circuit risk Active CN114721859B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210467903.5A CN114721859B (en) 2022-04-29 2022-04-29 Method, device and storage medium for monitoring MLCC capacitor short circuit risk

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210467903.5A CN114721859B (en) 2022-04-29 2022-04-29 Method, device and storage medium for monitoring MLCC capacitor short circuit risk

Publications (2)

Publication Number Publication Date
CN114721859A true CN114721859A (en) 2022-07-08
CN114721859B CN114721859B (en) 2024-05-24

Family

ID=82246105

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210467903.5A Active CN114721859B (en) 2022-04-29 2022-04-29 Method, device and storage medium for monitoring MLCC capacitor short circuit risk

Country Status (1)

Country Link
CN (1) CN114721859B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101824718B1 (en) * 2017-06-28 2018-02-02 (주)대신피아이씨 Intelligent Automatic Control Panel Equipped with Self Diagnosis and Safety Functions
CN108736440A (en) * 2018-07-13 2018-11-02 深圳市华宝新能源股份有限公司 A kind of runaway electric circuit protection circuit and electronic device
CN208767775U (en) * 2018-07-13 2019-04-19 深圳市华宝新能源股份有限公司 A kind of runaway electric circuit protection circuit and electronic device
CN110058979A (en) * 2019-04-18 2019-07-26 苏州浪潮智能科技有限公司 A kind of temperature reads monitoring method, BMC and the storage medium of failure failure
WO2019174653A2 (en) * 2019-06-17 2019-09-19 广东恒翼能科技有限公司 Lithium battery thermal runaway early warning protection system and method
CN113708986A (en) * 2020-05-21 2021-11-26 鸿富锦精密电子(天津)有限公司 Server monitoring apparatus, method and computer-readable storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101824718B1 (en) * 2017-06-28 2018-02-02 (주)대신피아이씨 Intelligent Automatic Control Panel Equipped with Self Diagnosis and Safety Functions
CN108736440A (en) * 2018-07-13 2018-11-02 深圳市华宝新能源股份有限公司 A kind of runaway electric circuit protection circuit and electronic device
CN208767775U (en) * 2018-07-13 2019-04-19 深圳市华宝新能源股份有限公司 A kind of runaway electric circuit protection circuit and electronic device
CN110058979A (en) * 2019-04-18 2019-07-26 苏州浪潮智能科技有限公司 A kind of temperature reads monitoring method, BMC and the storage medium of failure failure
WO2019174653A2 (en) * 2019-06-17 2019-09-19 广东恒翼能科技有限公司 Lithium battery thermal runaway early warning protection system and method
CN113708986A (en) * 2020-05-21 2021-11-26 鸿富锦精密电子(天津)有限公司 Server monitoring apparatus, method and computer-readable storage medium

Also Published As

Publication number Publication date
CN114721859B (en) 2024-05-24

Similar Documents

Publication Publication Date Title
US20030158697A1 (en) Method and system for monitoring and profiling an integrated circuit die temperature
CN101387691B (en) Health monitoring for power converter components
CN107145428A (en) A kind of server and server monitoring method
EP3734310A1 (en) Power monitoring method, system and power supply
US20180120914A1 (en) Unified power device management and analyzer
US7003409B2 (en) Predictive failure analysis and failure isolation using current sensing
CN108959025A (en) A kind of server alarm method, device and server
US20240126639A1 (en) Server board card apparatus, detection method therefor, and detection device thereof, and storage medium
US20030115397A1 (en) Computer system with dedicated system management buses
CN114721859B (en) Method, device and storage medium for monitoring MLCC capacitor short circuit risk
CN107943654A (en) A kind of method of quick determining server environmental temperature monitoring abnormal cause
CN111475292B (en) Server system and frequency control device of internal processor thereof
WO2024103745A1 (en) Mainboard protection system and method
CN116792407A (en) Magnetic suspension bearing controller, power supply control method and device thereof and storage medium
CN115080360A (en) Method, device and equipment for detecting liquid cooling server and readable medium
CN113407418B (en) Computer early warning monitoring circuit and monitoring method thereof
KR102363987B1 (en) Smart server rack with high safety
CN113448422A (en) Server BIOS flash memory chip protection circuit and server
CN109284218A (en) A kind of method and device thereof of detection service device operation troubles
CN110927611A (en) Automatic fault reporting power supply, system and fault detection method
CN105245409A (en) Abnormality detection system and applicable router
CN1980096B (en) Fault sensor for a laser driver circuit
KR20210081046A (en) Preservation method of server rack
CN216595329U (en) Fan early warning device and electronic equipment
CN113606176B (en) Fan running state detection method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant