CN106681886A - Method and system for judging server fan damage - Google Patents
Method and system for judging server fan damage Download PDFInfo
- Publication number
- CN106681886A CN106681886A CN201611219197.3A CN201611219197A CN106681886A CN 106681886 A CN106681886 A CN 106681886A CN 201611219197 A CN201611219197 A CN 201611219197A CN 106681886 A CN106681886 A CN 106681886A
- Authority
- CN
- China
- Prior art keywords
- fan
- rotation speed
- damage
- zero
- damaged
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3058—Monitoring arrangements for monitoring environmental properties or parameters of the computing system or of the computing system component, e.g. monitoring of power, currents, temperature, humidity, position, vibrations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3452—Performance evaluation by statistical analysis
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Computer Hardware Design (AREA)
- Probability & Statistics with Applications (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computing Systems (AREA)
- Control Of Positive-Displacement Air Blowers (AREA)
Abstract
The invention discloses a method and system for judging server fan damage and belongs to the technical field of server heat dissipation. The method for judging server fan damage comprises the steps that synchronous real-time monitoring is conducted on the duty ratio and rotation speed of a fan based on a BMC chip function, monitored information is sent to a fan damage judging module in a BMC, and two fan damage judging algorithms are integrated in the fan damage judging module and are respectively used for judging whether the rotation speed of the fan is zero or not, a fan rotation speed free signal corresponding to zero fan rotation speed is transmitted back to the BMC, it is indicated that the fan is completed damaged, an alarm is started, and a fan adjustment and control mechanism during fan damage is called; when the fan rotation speed is not zero, the fan is normal or slightly damaged, and the rotation speed is reduced. The method for judging server fan damage conducts synchronous real-time monitoring on the duty ratio and rotation speed of the fan, fan faults are confirmed according to the monitored information, and the method has a very good popularization and application value.
Description
Technical field
The present invention relates to server radiating technical field, specifically provides a kind of method of determining server fan damage and is
System.
Background technology
Server compared with traditional computer with more preferable autgmentability, ease for use and ease of manageability, by each big
The extensive application of type enterprise.With economic further development, performance requirements more and more higher of the user to server, phase
Each hardware capability for the server answered is optimized.With the raising of server performance, server is produced in running
The more heats of life, in order to server can normally run, it is the most important thing that the heat of generation is discharged in time.Existing service
Radiated using many fans in device system, when fan breaks down, whole fan delivery can be less than actual demand air quantity, cause
System radiating goes wrong, and then affects the normal work of server.Can operationally there is the impact of various faults situation in fan
Systematic air flow, mainly has following two:First, fan pcb board is destroyed causes fan to work;2nd, fan works long hours
Aft-fan rotating speed gradually lowers.Existing server fan damage monitoring mechanism only may determine that the first situation, i.e. fan
Situation about cannot work is damaged completely, for second situation but cannot be monitored.Fan failure monitoring can not cause service comprehensively
There is potential radiating risk in device, it is impossible to ensure the normal work of server, be further improved.
The content of the invention
The technical assignment of the present invention is for above-mentioned problem, there is provided a kind of by the way that to fan duty, when fan turns
Speed synchronizes real-time monitoring, and the method damaged according to the determining server fan of the validation of information fan failure for monitoring.
Further technical assignment of the invention is to provide what a kind of determining server fan that can realize said method was damaged
System.
For achieving the above object, the invention provides following technical scheme:
A kind of method that determining server fan is damaged, based on BMC chip function, to fan duty, when rotation speed of the fan is carried out together
Step real-time monitoring, and the information for monitoring is sent in BMC in fan damage determination module, fan is damaged in determination module and collected
Into two kinds of fan failure decision algorithms, it is respectively used to judge the situation that rotation speed of the fan is not zero as zero-sum:Rotation speed of the fan is zero pair
Fan-free tach signal is answered to return to BMC, then fan is damaged completely, fan regulation and controlling machine when startup is reported to the police and calls fan to damage
System;Rotation speed of the fan is not zero, then fan is normal or slight damage rotating speed declines, further corresponding with dutycycle according to rotation speed of the fan
Confirming that fan is normal or slight damage, the formula is formula:RPM<PWM*X+Y, when formula is less than into rotation speed of the fan immediately
During duty ratio corresponding correspondence rotation speed of the fan, confirm that fan slight damage rotating speed declines, start fan failure alarm mechanism, and call
Fan regulation and controlling mechanism when fan is damaged, when formula is false, confirms that fan is normal.
PWM is the abbreviation of Pulse Width Modulation, and this patent expression is fan dutycycle.RPM is
The abbreviation of Revolutions Per Minute, this patent expression is corresponding rotation speed of the fan under different fan dutycycles.
The method that the determining server fan of the present invention is damaged, for the irrational problem of fan failure detection mode, leads to
Cross to fan duty when rotation speed of the fan synchronous detecting, actual rotation speed of the fan and theoretical value corresponding relation are contrasted by algorithm optimization
Judge that fan damages problem with the presence or absence of slow;And different decision mechanism is called for different fan failure situations, ensureing system
Optimize fan damage monitoring mechanism under system heat dispersal situations, can at utmost ensure the normal work of server.
The fan model of specific service type number collocation is fixed, and can define rotation speed of the fan according to fan specifications book reasonable
Value, clearly corresponds to rotation speed of the fan lower limit during difference fan dutycycle, it is possible to be set as that correspondence computing formula is integrated in fan
In damaging decision algorithm, concrete formula is:RPM<PWM*X+Y, enters using whether the formula can occur slight damage to fan
Row confirms.When formula is set up, i.e., when rotation speed of the fan is less than duty ratio corresponding correspondence rotation speed of the fan, confirm fan slight damage rotating speed
Decline, so as to start fan failure alarm mechanism, and fan regulation and controlling mechanism when calling fan to damage in time, it is ensured that fan can be just
Normal work.When formula is false, confirm that fan is normal.
Preferably, when rotation speed of the fan is zero, if rotating speed is always zero in setting 3s, confirming that fan is damaged, start report
Fan regulation and controlling mechanism when warning and call fan to damage;If rotating speed is not always zero in setting 3s, fan cannot be judged whether
Damage, be again started up fan damage determination module and judged.
Fan duty cycle signals presence mutation is possible during practical application, and it is predetermined that rotation speed of the fan needs certain response time to reach
Rotating speed, to avoid fan from damaging false alarm, for two kinds of fan failures different warning durations is set.For rotation speed of the fan is zero
Situation, in setting 3s rotating speed is always zero just to can confirm that fan is damaged completely, it is to avoid fan damages false alarm situation
Occur.
Preferably, working as formula RPM<When PWM*X+Y sets up, if setting more than the 15s formula are set up, fan is confirmed
Slight damage rotating speed declines, and starts fan failure alarm mechanism, and fan regulation and controlling mechanism when calling fan to damage;Setting 15s
Above the formula is false, then cannot judge whether fan damages, and is again started up fan damage determination module and is judged.
The system that a kind of determining server fan is damaged, including:
Fan control module:For monitoring the duty cycle signals of fan and the tach signal of fan;
Fan damages determination module:For judging whether fan damages according to fan duty cycle signals and fan rotating speed signals;
The fan control module and fan damage determination module and communicate with system fan respectively, in being located at BMC chip.
Preferably, the fan damages integrated two kinds of fan failure decision algorithms in determination module, it is respectively used to judge
The situation that rotation speed of the fan is not zero for zero-sum, rotation speed of the fan is that zero corresponding fan-free tach signal returns to BMC, then fan is complete
Damage;Rotation speed of the fan is not zero, then fan is normal or slight damage rotating speed declines.
Preferably, when fan is normal or slight damage rotating speed declines, using rotation speed of the fan formula corresponding with dutycycle come
Confirm that fan is normal or slight damage, the formula is:RPM<PWM*X+Y, when formula is accounted for into rotation speed of the fan immediately less than correspondence
When sky is than corresponding rotation speed of the fan, confirm that fan slight damage rotating speed declines, start fan failure alarm mechanism, and call fan to damage
The fan regulation and controlling mechanism of bad when, when formula is false, confirms that fan is normal.
Fan duty cycle signals presence mutation is possible during practical application, and it is predetermined that rotation speed of the fan needs certain response time to reach
Rotating speed, to avoid fan from damaging false alarm, for two kinds of fan failures different warning durations is set.For rotation speed of the fan is zero
Situation, rotating speed is always zero just to can confirm that fan is damaged completely in setting 3s.As formula RPM<When PWM*X+Y sets up, if
Determine more than the 15s formula to set up, then confirm that fan slight damage rotating speed declines, setting more than the 15s formula are false, then without
Method judges whether fan damages, and is again started up fan damage determination module and is judged.
Compared with prior art, the method that determining server fan of the invention is damaged has beneficial effect following prominent
Really:
(One)The method that the determining server fan of the present invention is damaged, by fan duty when rotation speed of the fan synchronous monitoring,
Actual rotation speed of the fan is contrasted by algorithm optimization and judges that fan damages problem with the presence or absence of slow with theoretical value corresponding relation;And pin
Different decision mechanism is called to different fan failure situations, fan damage monitoring mechanism is optimized in the case of system radiating is ensured,
The normal work of server can at utmost be ensured;
(Two)Fan duty cycle signals presence mutation is possible during practical application, and it is predetermined that rotation speed of the fan needs certain response time to reach
Rotating speed, to avoid fan from damaging false alarm, for two kinds of fan failures different warning durations is set, and can effectively avoid sending out
Raw fan false alarm problem.
Description of the drawings
Fig. 1 is the flow chart of the method that determining server fan of the present invention is damaged;
Fig. 2 is the topological diagram of the system that determining server fan of the present invention is damaged.
Specific embodiment
Below in conjunction with drawings and Examples, the method and system that the determining server fan of the present invention is damaged are made into one
Step is described in detail.
Embodiment 1
As shown in figure 1, the method that the determining server fan of the present invention is damaged, based on BMC chip function, to fan duty when
Rotation speed of the fan synchronizes real-time monitoring, and the information for monitoring is sent in BMC in fan damage determination module, and fan is damaged
Integrated two kinds of fan failure decision algorithms, are respectively used to judge the situation that rotation speed of the fan is not zero as zero-sum in bad determination module.
Rotation speed of the fan is that zero corresponding fan-free tach signal returns to BMC, and rotating speed is always zero in setting 3s, confirms that fan is damaged, and is opened
Move fan regulation and controlling mechanism when reporting to the police and calling fan to damage;Rotating speed is not always zero in setting 3s, then cannot judge that fan is
No damage, is again started up fan damage determination module and is judged.Rotation speed of the fan is not zero, then fan is normal or slight damage turns
Speed declines, further according to rotation speed of the fan formula corresponding with dutycycle confirming fan normally or slight damage.The formula is:
RPM<PWM*X+Y, when formula rotation speed of the fan corresponding less than duty ratio corresponding into rotation speed of the fan immediately, sets more than the 15s public affairs
Formula is set up, then confirm that fan slight damage rotating speed declines, and starts fan failure alarm mechanism, and wind when calling fan to damage
Fan regulatory mechanism.Setting more than the 15s formula are false, then cannot judge whether fan damages, and are again started up fan and damage to sentence
Cover half block is judged.
Embodiment 2
As shown in Fig. 2 the system that the determining server fan of the present invention is damaged, including:
Fan control module:For monitoring the duty cycle signals of fan and the tach signal of fan.
Fan damages determination module:For judging whether fan damages according to fan duty cycle signals and fan rotating speed signals
It is bad.Fan damages integrated two kinds of fan failure decision algorithms in determination module, be respectively used to judge rotation speed of the fan as zero-sum not as
Zero situation.Rotation speed of the fan is that zero corresponding fan-free tach signal returns to BMC, if rotating speed is always zero in setting 3s, is confirmed
Fan is damaged, fan regulation and controlling mechanism when startup is reported to the police and calls fan to damage;If rotating speed is not always zero in setting 3s, nothing
Method judges whether fan damages, and is again started up fan damage determination module and is judged.Rotation speed of the fan is not zero, then fan is normal
Or slight damage rotating speed declines, further according to rotation speed of the fan formula corresponding with dutycycle confirming that fan is normal or slight damage
It is bad.The formula is:RPM<PWM*X+Y, when formula rotation speed of the fan corresponding less than duty ratio corresponding into rotation speed of the fan immediately, if
Determine more than the 15s formula to set up, then confirm that fan slight damage rotating speed declines, start fan failure alarm mechanism, and call
Fan regulation and controlling mechanism when fan is damaged.Setting more than the 15s formula are false, then cannot judge whether fan damages, again
Start fan damage determination module to be judged.
Fan control module and damage determination module are communicated respectively with system fan, in being located at BMC chip.
Embodiment described above, the simply present invention more preferably specific embodiment, those skilled in the art is at this
The usual variations and alternatives carried out in the range of inventive technique scheme all should be comprising within the scope of the present invention.
Claims (6)
1. a kind of method that determining server fan is damaged, it is characterised in that:Based on BMC chip function, to fan duty when
Rotation speed of the fan synchronizes real-time monitoring, and the information for monitoring is sent in BMC in fan damage determination module, and fan is damaged
Integrated two kinds of fan failure decision algorithms, are respectively used to judge the situation that rotation speed of the fan is not zero as zero-sum in bad determination module:
Rotation speed of the fan is that zero corresponding fan-free tach signal returns to BMC, then fan is damaged completely, and startup is reported to the police and calls fan to damage
When fan regulation and controlling mechanism;Rotation speed of the fan is not zero, then fan is normal or slight damage rotating speed declines, and is further turned according to fan
Confirming fan normally or slight damage, the formula is corresponding with the dutycycle formula of speed:RPM<PWM*X+Y, when formula is set up
When i.e. rotation speed of the fan is less than duty ratio corresponding correspondence rotation speed of the fan, confirm that fan slight damage rotating speed declines, start fan failure
Alarm mechanism, and fan regulation and controlling mechanism when calling fan to damage, when formula is false, confirm that fan is normal.
2. the method that determining server fan according to claim 1 is damaged, it is characterised in that:When rotation speed of the fan is zero,
If rotating speed is always zero in setting 3s, the fan regulation and controlling mechanism that fan is damaged, when startup is reported to the police and calls fan to damage is confirmed;
If rotating speed is not always zero in setting 3s, cannot judge whether fan damages, being again started up fan damage determination module is carried out
Judge.
3. the method that determining server fan according to claim 1 and 2 is damaged, it is characterised in that:As formula RPM<
When PWM*X+Y sets up, if setting more than the 15s formula are set up, confirm that fan slight damage rotating speed declines, start fan therefore
Barrier alarm mechanism, and fan regulation and controlling mechanism when calling fan to damage;Setting more than the 15s formula are false, then cannot be judged
Whether fan damages, and is again started up fan damage determination module and is judged.
4. the system that a kind of determining server fan is damaged, it is characterised in that:Including:
Fan control module:For monitoring the duty cycle signals of fan and the tach signal of fan;
Fan damages determination module:For judging whether fan damages according to fan duty cycle signals and fan rotating speed signals;
The fan control module and fan damage determination module and communicate with system fan respectively, in being located at BMC chip.
5. the system that determining server fan according to claim 4 is damaged, it is characterised in that:The fan is damaged and judged
Integrated two kinds of fan failure decision algorithms in module, are respectively used to judge the situation that rotation speed of the fan is not zero as zero-sum, fan turns
Speed is that zero corresponding fan-free tach signal returns to BMC, then fan is damaged completely;Rotation speed of the fan is not zero, then fan it is normal or
Slight damage rotating speed declines.
6. the system that determining server fan according to claim 5 is damaged, it is characterised in that:Fan is normal or slightly damages
When bad rotating speed declines, fan is confirmed using rotation speed of the fan formula corresponding with dutycycle normally or slight damage, the formula is:
RPM<PWM*X+Y, when formula rotation speed of the fan corresponding less than duty ratio corresponding into rotation speed of the fan immediately, confirms fan slight damage
Rotating speed declines, and starts fan failure alarm mechanism, and fan regulation and controlling mechanism when calling fan to damage, when formula is false,
Confirm that fan is normal.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611219197.3A CN106681886A (en) | 2016-12-26 | 2016-12-26 | Method and system for judging server fan damage |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611219197.3A CN106681886A (en) | 2016-12-26 | 2016-12-26 | Method and system for judging server fan damage |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106681886A true CN106681886A (en) | 2017-05-17 |
Family
ID=58870821
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611219197.3A Pending CN106681886A (en) | 2016-12-26 | 2016-12-26 | Method and system for judging server fan damage |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106681886A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107023504A (en) * | 2017-06-02 | 2017-08-08 | 郑州云海信息技术有限公司 | A kind of fan control system and control method based on BMC |
CN108459942A (en) * | 2018-03-15 | 2018-08-28 | 联想(北京)有限公司 | A kind of data processing method, device and storage medium |
CN109763992A (en) * | 2019-03-29 | 2019-05-17 | 新华三技术有限公司 | The service life method for early warning and device of equipment fan |
CN110630552A (en) * | 2019-09-21 | 2019-12-31 | 苏州浪潮智能科技有限公司 | System, method and device for detecting fan link fault |
CN111367251A (en) * | 2018-12-26 | 2020-07-03 | 技嘉科技股份有限公司 | Method and system for testing fan control signal of mainboard |
CN111750918A (en) * | 2020-05-28 | 2020-10-09 | 苏州浪潮智能科技有限公司 | Dust inlet monitoring method and system for edge computing server |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080104453A1 (en) * | 2004-07-06 | 2008-05-01 | Udayan Mukherjee | System and Method to Detect Errors and Predict Potential Failures |
CN102193855A (en) * | 2010-03-12 | 2011-09-21 | 鸿富锦精密工业(深圳)有限公司 | Abnormity alarm circuit for fan |
CN102419625A (en) * | 2011-12-31 | 2012-04-18 | 曙光信息产业股份有限公司 | Heat dissipation system and fan controlling device |
CN103186207A (en) * | 2011-12-28 | 2013-07-03 | 鸿富锦精密工业(深圳)有限公司 | Radiation system and control method applied to same |
CN104763665A (en) * | 2015-03-04 | 2015-07-08 | 杭州华三通信技术有限公司 | Fan fault detection method and device of network device |
-
2016
- 2016-12-26 CN CN201611219197.3A patent/CN106681886A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080104453A1 (en) * | 2004-07-06 | 2008-05-01 | Udayan Mukherjee | System and Method to Detect Errors and Predict Potential Failures |
CN102193855A (en) * | 2010-03-12 | 2011-09-21 | 鸿富锦精密工业(深圳)有限公司 | Abnormity alarm circuit for fan |
CN103186207A (en) * | 2011-12-28 | 2013-07-03 | 鸿富锦精密工业(深圳)有限公司 | Radiation system and control method applied to same |
CN102419625A (en) * | 2011-12-31 | 2012-04-18 | 曙光信息产业股份有限公司 | Heat dissipation system and fan controlling device |
CN104763665A (en) * | 2015-03-04 | 2015-07-08 | 杭州华三通信技术有限公司 | Fan fault detection method and device of network device |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107023504A (en) * | 2017-06-02 | 2017-08-08 | 郑州云海信息技术有限公司 | A kind of fan control system and control method based on BMC |
CN108459942A (en) * | 2018-03-15 | 2018-08-28 | 联想(北京)有限公司 | A kind of data processing method, device and storage medium |
CN111367251A (en) * | 2018-12-26 | 2020-07-03 | 技嘉科技股份有限公司 | Method and system for testing fan control signal of mainboard |
CN109763992A (en) * | 2019-03-29 | 2019-05-17 | 新华三技术有限公司 | The service life method for early warning and device of equipment fan |
CN109763992B (en) * | 2019-03-29 | 2020-12-29 | 新华三技术有限公司 | Service life early warning method and device for equipment fan |
CN110630552A (en) * | 2019-09-21 | 2019-12-31 | 苏州浪潮智能科技有限公司 | System, method and device for detecting fan link fault |
CN111750918A (en) * | 2020-05-28 | 2020-10-09 | 苏州浪潮智能科技有限公司 | Dust inlet monitoring method and system for edge computing server |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106681886A (en) | Method and system for judging server fan damage | |
US7612508B2 (en) | System and method for communication with an information handling system cooling fan | |
CN105677500A (en) | Method for diagnosing fault of server in real time | |
CN104850485A (en) | BMC based method and system for remote diagnosis of server startup failure | |
CN107612748A (en) | A kind of multi node server power consumption management system | |
CN105045689A (en) | Method for using RAID card to perform hard disk batch detection, monitoring and alerting | |
CN107632907B (en) | BMC chip hosting system and control method thereof | |
CN103780432A (en) | Parking lot operation and maintenance method and system, lane controller, server and mobile terminal | |
CN106933710A (en) | The method of testing that DC is restarted is carried out to server based on WOL functions | |
CN103607314A (en) | System for monitoring and managing server by using SNMP (Simple Network Management Protocol) | |
CN103577298A (en) | Baseboard management controller monitoring system and method | |
CN108280016A (en) | A kind of fan detection method, device, equipment and computer readable storage medium | |
CN109766694A (en) | Program protocol white list linkage method and device of industrial control host | |
CN103580941B (en) | Network watchdog and its implementation | |
CN104699589A (en) | Fan error detection system and method | |
CN108983922A (en) | Working frequency adjusting method, device and server | |
CN102915265A (en) | BMC (baseboard management controller) loop test method and system | |
CN104780062A (en) | Method for quickly acquiring IP address of BMC management network interface | |
CN107026759A (en) | The firmware and its development approach of a kind of remote management BBU modules based on BMC | |
CN110134546B (en) | Batch restarting windows system method, electronic device and storage medium | |
JP6583942B1 (en) | BMC, determination method and BMC firmware | |
CN104714866A (en) | Fan testing system and method | |
CN109960638A (en) | BMC starts reason recording method, system, device and readable storage medium storing program for executing | |
CN104699585A (en) | Server early warning value setting system and method and server | |
CN102882698B (en) | virtual machine management method and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170517 |
|
RJ01 | Rejection of invention patent application after publication |