CN106681886A - Method and system for judging server fan damage - Google Patents

Method and system for judging server fan damage Download PDF

Info

Publication number
CN106681886A
CN106681886A CN201611219197.3A CN201611219197A CN106681886A CN 106681886 A CN106681886 A CN 106681886A CN 201611219197 A CN201611219197 A CN 201611219197A CN 106681886 A CN106681886 A CN 106681886A
Authority
CN
China
Prior art keywords
fan
rotation speed
damage
zero
damaged
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201611219197.3A
Other languages
Chinese (zh)
Inventor
于光义
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou Yunhai Information Technology Co Ltd
Original Assignee
Zhengzhou Yunhai Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou Yunhai Information Technology Co Ltd filed Critical Zhengzhou Yunhai Information Technology Co Ltd
Priority to CN201611219197.3A priority Critical patent/CN106681886A/en
Publication of CN106681886A publication Critical patent/CN106681886A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3058Monitoring arrangements for monitoring environmental properties or parameters of the computing system or of the computing system component, e.g. monitoring of power, currents, temperature, humidity, position, vibrations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3452Performance evaluation by statistical analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Computer Hardware Design (AREA)
  • Probability & Statistics with Applications (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computing Systems (AREA)
  • Control Of Positive-Displacement Air Blowers (AREA)

Abstract

The invention discloses a method and system for judging server fan damage and belongs to the technical field of server heat dissipation. The method for judging server fan damage comprises the steps that synchronous real-time monitoring is conducted on the duty ratio and rotation speed of a fan based on a BMC chip function, monitored information is sent to a fan damage judging module in a BMC, and two fan damage judging algorithms are integrated in the fan damage judging module and are respectively used for judging whether the rotation speed of the fan is zero or not, a fan rotation speed free signal corresponding to zero fan rotation speed is transmitted back to the BMC, it is indicated that the fan is completed damaged, an alarm is started, and a fan adjustment and control mechanism during fan damage is called; when the fan rotation speed is not zero, the fan is normal or slightly damaged, and the rotation speed is reduced. The method for judging server fan damage conducts synchronous real-time monitoring on the duty ratio and rotation speed of the fan, fan faults are confirmed according to the monitored information, and the method has a very good popularization and application value.

Description

The method and system that a kind of determining server fan is damaged
Technical field
The present invention relates to server radiating technical field, specifically provides a kind of method of determining server fan damage and is System.
Background technology
Server compared with traditional computer with more preferable autgmentability, ease for use and ease of manageability, by each big The extensive application of type enterprise.With economic further development, performance requirements more and more higher of the user to server, phase Each hardware capability for the server answered is optimized.With the raising of server performance, server is produced in running The more heats of life, in order to server can normally run, it is the most important thing that the heat of generation is discharged in time.Existing service Radiated using many fans in device system, when fan breaks down, whole fan delivery can be less than actual demand air quantity, cause System radiating goes wrong, and then affects the normal work of server.Can operationally there is the impact of various faults situation in fan Systematic air flow, mainly has following two:First, fan pcb board is destroyed causes fan to work;2nd, fan works long hours Aft-fan rotating speed gradually lowers.Existing server fan damage monitoring mechanism only may determine that the first situation, i.e. fan Situation about cannot work is damaged completely, for second situation but cannot be monitored.Fan failure monitoring can not cause service comprehensively There is potential radiating risk in device, it is impossible to ensure the normal work of server, be further improved.
The content of the invention
The technical assignment of the present invention is for above-mentioned problem, there is provided a kind of by the way that to fan duty, when fan turns Speed synchronizes real-time monitoring, and the method damaged according to the determining server fan of the validation of information fan failure for monitoring.
Further technical assignment of the invention is to provide what a kind of determining server fan that can realize said method was damaged System.
For achieving the above object, the invention provides following technical scheme:
A kind of method that determining server fan is damaged, based on BMC chip function, to fan duty, when rotation speed of the fan is carried out together Step real-time monitoring, and the information for monitoring is sent in BMC in fan damage determination module, fan is damaged in determination module and collected Into two kinds of fan failure decision algorithms, it is respectively used to judge the situation that rotation speed of the fan is not zero as zero-sum:Rotation speed of the fan is zero pair Fan-free tach signal is answered to return to BMC, then fan is damaged completely, fan regulation and controlling machine when startup is reported to the police and calls fan to damage System;Rotation speed of the fan is not zero, then fan is normal or slight damage rotating speed declines, further corresponding with dutycycle according to rotation speed of the fan Confirming that fan is normal or slight damage, the formula is formula:RPM<PWM*X+Y, when formula is less than into rotation speed of the fan immediately During duty ratio corresponding correspondence rotation speed of the fan, confirm that fan slight damage rotating speed declines, start fan failure alarm mechanism, and call Fan regulation and controlling mechanism when fan is damaged, when formula is false, confirms that fan is normal.
PWM is the abbreviation of Pulse Width Modulation, and this patent expression is fan dutycycle.RPM is The abbreviation of Revolutions Per Minute, this patent expression is corresponding rotation speed of the fan under different fan dutycycles.
The method that the determining server fan of the present invention is damaged, for the irrational problem of fan failure detection mode, leads to Cross to fan duty when rotation speed of the fan synchronous detecting, actual rotation speed of the fan and theoretical value corresponding relation are contrasted by algorithm optimization Judge that fan damages problem with the presence or absence of slow;And different decision mechanism is called for different fan failure situations, ensureing system Optimize fan damage monitoring mechanism under system heat dispersal situations, can at utmost ensure the normal work of server.
The fan model of specific service type number collocation is fixed, and can define rotation speed of the fan according to fan specifications book reasonable Value, clearly corresponds to rotation speed of the fan lower limit during difference fan dutycycle, it is possible to be set as that correspondence computing formula is integrated in fan In damaging decision algorithm, concrete formula is:RPM<PWM*X+Y, enters using whether the formula can occur slight damage to fan Row confirms.When formula is set up, i.e., when rotation speed of the fan is less than duty ratio corresponding correspondence rotation speed of the fan, confirm fan slight damage rotating speed Decline, so as to start fan failure alarm mechanism, and fan regulation and controlling mechanism when calling fan to damage in time, it is ensured that fan can be just Normal work.When formula is false, confirm that fan is normal.
Preferably, when rotation speed of the fan is zero, if rotating speed is always zero in setting 3s, confirming that fan is damaged, start report Fan regulation and controlling mechanism when warning and call fan to damage;If rotating speed is not always zero in setting 3s, fan cannot be judged whether Damage, be again started up fan damage determination module and judged.
Fan duty cycle signals presence mutation is possible during practical application, and it is predetermined that rotation speed of the fan needs certain response time to reach Rotating speed, to avoid fan from damaging false alarm, for two kinds of fan failures different warning durations is set.For rotation speed of the fan is zero Situation, in setting 3s rotating speed is always zero just to can confirm that fan is damaged completely, it is to avoid fan damages false alarm situation Occur.
Preferably, working as formula RPM<When PWM*X+Y sets up, if setting more than the 15s formula are set up, fan is confirmed Slight damage rotating speed declines, and starts fan failure alarm mechanism, and fan regulation and controlling mechanism when calling fan to damage;Setting 15s Above the formula is false, then cannot judge whether fan damages, and is again started up fan damage determination module and is judged.
The system that a kind of determining server fan is damaged, including:
Fan control module:For monitoring the duty cycle signals of fan and the tach signal of fan;
Fan damages determination module:For judging whether fan damages according to fan duty cycle signals and fan rotating speed signals;
The fan control module and fan damage determination module and communicate with system fan respectively, in being located at BMC chip.
Preferably, the fan damages integrated two kinds of fan failure decision algorithms in determination module, it is respectively used to judge The situation that rotation speed of the fan is not zero for zero-sum, rotation speed of the fan is that zero corresponding fan-free tach signal returns to BMC, then fan is complete Damage;Rotation speed of the fan is not zero, then fan is normal or slight damage rotating speed declines.
Preferably, when fan is normal or slight damage rotating speed declines, using rotation speed of the fan formula corresponding with dutycycle come Confirm that fan is normal or slight damage, the formula is:RPM<PWM*X+Y, when formula is accounted for into rotation speed of the fan immediately less than correspondence When sky is than corresponding rotation speed of the fan, confirm that fan slight damage rotating speed declines, start fan failure alarm mechanism, and call fan to damage The fan regulation and controlling mechanism of bad when, when formula is false, confirms that fan is normal.
Fan duty cycle signals presence mutation is possible during practical application, and it is predetermined that rotation speed of the fan needs certain response time to reach Rotating speed, to avoid fan from damaging false alarm, for two kinds of fan failures different warning durations is set.For rotation speed of the fan is zero Situation, rotating speed is always zero just to can confirm that fan is damaged completely in setting 3s.As formula RPM<When PWM*X+Y sets up, if Determine more than the 15s formula to set up, then confirm that fan slight damage rotating speed declines, setting more than the 15s formula are false, then without Method judges whether fan damages, and is again started up fan damage determination module and is judged.
Compared with prior art, the method that determining server fan of the invention is damaged has beneficial effect following prominent Really:
(One)The method that the determining server fan of the present invention is damaged, by fan duty when rotation speed of the fan synchronous monitoring, Actual rotation speed of the fan is contrasted by algorithm optimization and judges that fan damages problem with the presence or absence of slow with theoretical value corresponding relation;And pin Different decision mechanism is called to different fan failure situations, fan damage monitoring mechanism is optimized in the case of system radiating is ensured, The normal work of server can at utmost be ensured;
(Two)Fan duty cycle signals presence mutation is possible during practical application, and it is predetermined that rotation speed of the fan needs certain response time to reach Rotating speed, to avoid fan from damaging false alarm, for two kinds of fan failures different warning durations is set, and can effectively avoid sending out Raw fan false alarm problem.
Description of the drawings
Fig. 1 is the flow chart of the method that determining server fan of the present invention is damaged;
Fig. 2 is the topological diagram of the system that determining server fan of the present invention is damaged.
Specific embodiment
Below in conjunction with drawings and Examples, the method and system that the determining server fan of the present invention is damaged are made into one Step is described in detail.
Embodiment 1
As shown in figure 1, the method that the determining server fan of the present invention is damaged, based on BMC chip function, to fan duty when Rotation speed of the fan synchronizes real-time monitoring, and the information for monitoring is sent in BMC in fan damage determination module, and fan is damaged Integrated two kinds of fan failure decision algorithms, are respectively used to judge the situation that rotation speed of the fan is not zero as zero-sum in bad determination module. Rotation speed of the fan is that zero corresponding fan-free tach signal returns to BMC, and rotating speed is always zero in setting 3s, confirms that fan is damaged, and is opened Move fan regulation and controlling mechanism when reporting to the police and calling fan to damage;Rotating speed is not always zero in setting 3s, then cannot judge that fan is No damage, is again started up fan damage determination module and is judged.Rotation speed of the fan is not zero, then fan is normal or slight damage turns Speed declines, further according to rotation speed of the fan formula corresponding with dutycycle confirming fan normally or slight damage.The formula is: RPM<PWM*X+Y, when formula rotation speed of the fan corresponding less than duty ratio corresponding into rotation speed of the fan immediately, sets more than the 15s public affairs Formula is set up, then confirm that fan slight damage rotating speed declines, and starts fan failure alarm mechanism, and wind when calling fan to damage Fan regulatory mechanism.Setting more than the 15s formula are false, then cannot judge whether fan damages, and are again started up fan and damage to sentence Cover half block is judged.
Embodiment 2
As shown in Fig. 2 the system that the determining server fan of the present invention is damaged, including:
Fan control module:For monitoring the duty cycle signals of fan and the tach signal of fan.
Fan damages determination module:For judging whether fan damages according to fan duty cycle signals and fan rotating speed signals It is bad.Fan damages integrated two kinds of fan failure decision algorithms in determination module, be respectively used to judge rotation speed of the fan as zero-sum not as Zero situation.Rotation speed of the fan is that zero corresponding fan-free tach signal returns to BMC, if rotating speed is always zero in setting 3s, is confirmed Fan is damaged, fan regulation and controlling mechanism when startup is reported to the police and calls fan to damage;If rotating speed is not always zero in setting 3s, nothing Method judges whether fan damages, and is again started up fan damage determination module and is judged.Rotation speed of the fan is not zero, then fan is normal Or slight damage rotating speed declines, further according to rotation speed of the fan formula corresponding with dutycycle confirming that fan is normal or slight damage It is bad.The formula is:RPM<PWM*X+Y, when formula rotation speed of the fan corresponding less than duty ratio corresponding into rotation speed of the fan immediately, if Determine more than the 15s formula to set up, then confirm that fan slight damage rotating speed declines, start fan failure alarm mechanism, and call Fan regulation and controlling mechanism when fan is damaged.Setting more than the 15s formula are false, then cannot judge whether fan damages, again Start fan damage determination module to be judged.
Fan control module and damage determination module are communicated respectively with system fan, in being located at BMC chip.
Embodiment described above, the simply present invention more preferably specific embodiment, those skilled in the art is at this The usual variations and alternatives carried out in the range of inventive technique scheme all should be comprising within the scope of the present invention.

Claims (6)

1. a kind of method that determining server fan is damaged, it is characterised in that:Based on BMC chip function, to fan duty when Rotation speed of the fan synchronizes real-time monitoring, and the information for monitoring is sent in BMC in fan damage determination module, and fan is damaged Integrated two kinds of fan failure decision algorithms, are respectively used to judge the situation that rotation speed of the fan is not zero as zero-sum in bad determination module: Rotation speed of the fan is that zero corresponding fan-free tach signal returns to BMC, then fan is damaged completely, and startup is reported to the police and calls fan to damage When fan regulation and controlling mechanism;Rotation speed of the fan is not zero, then fan is normal or slight damage rotating speed declines, and is further turned according to fan Confirming fan normally or slight damage, the formula is corresponding with the dutycycle formula of speed:RPM<PWM*X+Y, when formula is set up When i.e. rotation speed of the fan is less than duty ratio corresponding correspondence rotation speed of the fan, confirm that fan slight damage rotating speed declines, start fan failure Alarm mechanism, and fan regulation and controlling mechanism when calling fan to damage, when formula is false, confirm that fan is normal.
2. the method that determining server fan according to claim 1 is damaged, it is characterised in that:When rotation speed of the fan is zero, If rotating speed is always zero in setting 3s, the fan regulation and controlling mechanism that fan is damaged, when startup is reported to the police and calls fan to damage is confirmed; If rotating speed is not always zero in setting 3s, cannot judge whether fan damages, being again started up fan damage determination module is carried out Judge.
3. the method that determining server fan according to claim 1 and 2 is damaged, it is characterised in that:As formula RPM< When PWM*X+Y sets up, if setting more than the 15s formula are set up, confirm that fan slight damage rotating speed declines, start fan therefore Barrier alarm mechanism, and fan regulation and controlling mechanism when calling fan to damage;Setting more than the 15s formula are false, then cannot be judged Whether fan damages, and is again started up fan damage determination module and is judged.
4. the system that a kind of determining server fan is damaged, it is characterised in that:Including:
Fan control module:For monitoring the duty cycle signals of fan and the tach signal of fan;
Fan damages determination module:For judging whether fan damages according to fan duty cycle signals and fan rotating speed signals;
The fan control module and fan damage determination module and communicate with system fan respectively, in being located at BMC chip.
5. the system that determining server fan according to claim 4 is damaged, it is characterised in that:The fan is damaged and judged Integrated two kinds of fan failure decision algorithms in module, are respectively used to judge the situation that rotation speed of the fan is not zero as zero-sum, fan turns Speed is that zero corresponding fan-free tach signal returns to BMC, then fan is damaged completely;Rotation speed of the fan is not zero, then fan it is normal or Slight damage rotating speed declines.
6. the system that determining server fan according to claim 5 is damaged, it is characterised in that:Fan is normal or slightly damages When bad rotating speed declines, fan is confirmed using rotation speed of the fan formula corresponding with dutycycle normally or slight damage, the formula is: RPM<PWM*X+Y, when formula rotation speed of the fan corresponding less than duty ratio corresponding into rotation speed of the fan immediately, confirms fan slight damage Rotating speed declines, and starts fan failure alarm mechanism, and fan regulation and controlling mechanism when calling fan to damage, when formula is false, Confirm that fan is normal.
CN201611219197.3A 2016-12-26 2016-12-26 Method and system for judging server fan damage Pending CN106681886A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611219197.3A CN106681886A (en) 2016-12-26 2016-12-26 Method and system for judging server fan damage

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611219197.3A CN106681886A (en) 2016-12-26 2016-12-26 Method and system for judging server fan damage

Publications (1)

Publication Number Publication Date
CN106681886A true CN106681886A (en) 2017-05-17

Family

ID=58870821

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611219197.3A Pending CN106681886A (en) 2016-12-26 2016-12-26 Method and system for judging server fan damage

Country Status (1)

Country Link
CN (1) CN106681886A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107023504A (en) * 2017-06-02 2017-08-08 郑州云海信息技术有限公司 A kind of fan control system and control method based on BMC
CN108459942A (en) * 2018-03-15 2018-08-28 联想(北京)有限公司 A kind of data processing method, device and storage medium
CN109763992A (en) * 2019-03-29 2019-05-17 新华三技术有限公司 The service life method for early warning and device of equipment fan
CN110630552A (en) * 2019-09-21 2019-12-31 苏州浪潮智能科技有限公司 System, method and device for detecting fan link fault
CN111367251A (en) * 2018-12-26 2020-07-03 技嘉科技股份有限公司 Method and system for testing fan control signal of mainboard
CN111750918A (en) * 2020-05-28 2020-10-09 苏州浪潮智能科技有限公司 Dust inlet monitoring method and system for edge computing server

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080104453A1 (en) * 2004-07-06 2008-05-01 Udayan Mukherjee System and Method to Detect Errors and Predict Potential Failures
CN102193855A (en) * 2010-03-12 2011-09-21 鸿富锦精密工业(深圳)有限公司 Abnormity alarm circuit for fan
CN102419625A (en) * 2011-12-31 2012-04-18 曙光信息产业股份有限公司 Heat dissipation system and fan controlling device
CN103186207A (en) * 2011-12-28 2013-07-03 鸿富锦精密工业(深圳)有限公司 Radiation system and control method applied to same
CN104763665A (en) * 2015-03-04 2015-07-08 杭州华三通信技术有限公司 Fan fault detection method and device of network device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080104453A1 (en) * 2004-07-06 2008-05-01 Udayan Mukherjee System and Method to Detect Errors and Predict Potential Failures
CN102193855A (en) * 2010-03-12 2011-09-21 鸿富锦精密工业(深圳)有限公司 Abnormity alarm circuit for fan
CN103186207A (en) * 2011-12-28 2013-07-03 鸿富锦精密工业(深圳)有限公司 Radiation system and control method applied to same
CN102419625A (en) * 2011-12-31 2012-04-18 曙光信息产业股份有限公司 Heat dissipation system and fan controlling device
CN104763665A (en) * 2015-03-04 2015-07-08 杭州华三通信技术有限公司 Fan fault detection method and device of network device

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107023504A (en) * 2017-06-02 2017-08-08 郑州云海信息技术有限公司 A kind of fan control system and control method based on BMC
CN108459942A (en) * 2018-03-15 2018-08-28 联想(北京)有限公司 A kind of data processing method, device and storage medium
CN111367251A (en) * 2018-12-26 2020-07-03 技嘉科技股份有限公司 Method and system for testing fan control signal of mainboard
CN109763992A (en) * 2019-03-29 2019-05-17 新华三技术有限公司 The service life method for early warning and device of equipment fan
CN109763992B (en) * 2019-03-29 2020-12-29 新华三技术有限公司 Service life early warning method and device for equipment fan
CN110630552A (en) * 2019-09-21 2019-12-31 苏州浪潮智能科技有限公司 System, method and device for detecting fan link fault
CN111750918A (en) * 2020-05-28 2020-10-09 苏州浪潮智能科技有限公司 Dust inlet monitoring method and system for edge computing server

Similar Documents

Publication Publication Date Title
CN106681886A (en) Method and system for judging server fan damage
US7612508B2 (en) System and method for communication with an information handling system cooling fan
CN105677500A (en) Method for diagnosing fault of server in real time
CN104850485A (en) BMC based method and system for remote diagnosis of server startup failure
CN107612748A (en) A kind of multi node server power consumption management system
CN105045689A (en) Method for using RAID card to perform hard disk batch detection, monitoring and alerting
CN107632907B (en) BMC chip hosting system and control method thereof
CN103780432A (en) Parking lot operation and maintenance method and system, lane controller, server and mobile terminal
CN106933710A (en) The method of testing that DC is restarted is carried out to server based on WOL functions
CN103607314A (en) System for monitoring and managing server by using SNMP (Simple Network Management Protocol)
CN103577298A (en) Baseboard management controller monitoring system and method
CN108280016A (en) A kind of fan detection method, device, equipment and computer readable storage medium
CN109766694A (en) Program protocol white list linkage method and device of industrial control host
CN103580941B (en) Network watchdog and its implementation
CN104699589A (en) Fan error detection system and method
CN108983922A (en) Working frequency adjusting method, device and server
CN102915265A (en) BMC (baseboard management controller) loop test method and system
CN104780062A (en) Method for quickly acquiring IP address of BMC management network interface
CN107026759A (en) The firmware and its development approach of a kind of remote management BBU modules based on BMC
CN110134546B (en) Batch restarting windows system method, electronic device and storage medium
JP6583942B1 (en) BMC, determination method and BMC firmware
CN104714866A (en) Fan testing system and method
CN109960638A (en) BMC starts reason recording method, system, device and readable storage medium storing program for executing
CN104699585A (en) Server early warning value setting system and method and server
CN102882698B (en) virtual machine management method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170517

RJ01 Rejection of invention patent application after publication