WO2016202084A1 - 单板过温处理方法及装置 - Google Patents

单板过温处理方法及装置 Download PDF

Info

Publication number
WO2016202084A1
WO2016202084A1 PCT/CN2016/079666 CN2016079666W WO2016202084A1 WO 2016202084 A1 WO2016202084 A1 WO 2016202084A1 CN 2016079666 W CN2016079666 W CN 2016079666W WO 2016202084 A1 WO2016202084 A1 WO 2016202084A1
Authority
WO
WIPO (PCT)
Prior art keywords
board
temperature
over
module
temperature protection
Prior art date
Application number
PCT/CN2016/079666
Other languages
English (en)
French (fr)
Inventor
李可
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2016202084A1 publication Critical patent/WO2016202084A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D23/00Control of temperature
    • G05D23/19Control of temperature characterised by the use of electric means
    • G05D23/30Automatic controllers with an auxiliary heating device affecting the sensing element, e.g. for anticipating change of temperature

Definitions

  • the present invention relates to the field of communications, and in particular to a method and device for processing a single board over temperature.
  • the fan cooling technology is basically adopted.
  • the fan on the device fails, or the heat-dissipating air inlet and the air outlet are blocked by foreign objects, the fan cannot effectively remove the heat generated by the board, thereby increasing the chip on the board and being burnt due to excessive temperature. risks of.
  • the main control board periodically detects the temperature of the main control board, the temperature of each peripheral board, and the field-programmable gate array (FPGA) junction temperature on each board.
  • FPGA field-programmable gate array
  • the main control board controls the board to be powered off.
  • the present invention provides a method and a device for over-temperature processing of a single board, so as to solve at least the possible problems in the related art that only the temperature of the fixed chip on the single board is detected, causing other chips to burn out.
  • a method for processing a single board over temperature includes: obtaining temperature detection on a single board a temperature detected by the point, wherein the board is provided with two or more temperature detecting points; determining whether the board has a first temperature detecting point, wherein the temperature detected by the first temperature detecting point exceeds Determining a power-down threshold temperature corresponding to the first temperature detection point; if the determination result is YES, processing the board according to an over-temperature protection action corresponding to the board, wherein the over-temperature protection action This includes: board reset and / or board power off.
  • the method further includes: determining whether the temperature detected by the temperature detection point on the board is less than the temperature corresponding to the temperature monitoring point. The electrical threshold temperature; if the determination result is yes, the board is controlled.
  • the method further includes: determining whether the number of times of resetting the board exceeds a predetermined threshold within a predetermined time; and if the determination result is yes, controlling The board is powered off.
  • the method further includes: marking the board as a non-re-powerable board.
  • the method further includes: when the determination result is yes, improving the working efficiency of the heat dissipation device for dissipating the board.
  • a single-board over-temperature processing apparatus including: an acquisition module configured to acquire a temperature detected by a temperature detection point on a board, wherein the board is provided with two The first temperature detecting point is configured to determine whether the first temperature detecting point exists in the board, wherein the temperature detected by the first temperature detecting point exceeds the power lowering of the first temperature detecting point a threshold temperature; the processing module is configured to process the board according to an over-temperature protection action corresponding to the board, wherein the over-temperature is performed when the determination result of the first determining module is YES Protection actions include: board reset and / or board power off.
  • the device further includes: a second determining module, configured to determine whether the temperature detected by the temperature detecting point on the board is less than a power-on threshold temperature corresponding to the temperature monitoring point; The module is configured to control the power on the board when the determination result of the second determining module is yes.
  • the device further includes: a third determining module, configured to determine whether the number of times the board is reset exceeds a predetermined threshold in a predetermined time; and the second control module is configured to determine at the third determining module If the result is YES, the board is controlled to be powered off.
  • a third determining module configured to determine whether the number of times the board is reset exceeds a predetermined threshold in a predetermined time
  • the device further includes: a marking module, configured to mark the board as a non-re-powerable single board.
  • the device further includes: an improvement module, configured to improve the working efficiency of the heat dissipation device for dissipating heat from the board when the determination result of the first determination module is YES.
  • an improvement module configured to improve the working efficiency of the heat dissipation device for dissipating heat from the board when the determination result of the first determination module is YES.
  • Another embodiment of the present invention provides a computer storage medium storing execution instructions for performing the method in the above embodiments.
  • the temperature detected by the temperature detection point on the board is obtained, wherein the board is provided with two or more temperature detection points; determining whether the board has the first temperature detection point, wherein The temperature detected by the first temperature detection point exceeds the power-down threshold temperature corresponding to the first temperature detection point; if the determination result is YES, the over-temperature protection action corresponding to the board is The board performs processing, wherein the over-temperature protection action includes: a single board Resetting and/or powering off the board solves the problem that the related art can only detect the temperature of the fixed chip on the board and cause other chips to burn out, thereby achieving detection of multiple chips on the board and avoiding chips. The problem of burning the problem.
  • FIG. 1 is a flowchart of a method for processing a single board over temperature according to an embodiment of the present invention
  • FIG. 2 is a block diagram showing the structure of a single board over-temperature processing apparatus according to an embodiment of the present invention
  • FIG. 3 is a block diagram 1 of a preferred structure of a single-board over-temperature processing apparatus according to an embodiment of the present invention
  • FIG. 4 is a block diagram 2 of a preferred structure of a single-board over-temperature processing apparatus according to an embodiment of the present invention
  • FIG. 5 is a block diagram 3 of a preferred structure of a single-board over-temperature processing apparatus according to an embodiment of the present invention
  • FIG. 6 is a block diagram 4 of a preferred structure of a single-board over-temperature processing apparatus according to an embodiment of the present invention
  • FIG. 7 is a flowchart of a method for over-temperature protection of a chassis of a chassis according to an embodiment of the invention.
  • FIG. 1 is a flowchart of a method for processing an over-temperature of a single board according to an embodiment of the present invention. As shown in FIG. 1 , the flow includes the following steps:
  • Step S102 Acquire a temperature detected by a temperature detection point on the board, where two or more temperature detection points are disposed on the board;
  • Step S104 determining whether the first temperature detection point exists in the board, wherein the temperature detected by the first temperature detection point exceeds a power-down threshold temperature corresponding to the first temperature detection point;
  • step S106 if the determination result is yes, the board is processed according to the over-temperature protection action corresponding to the board, wherein the over-temperature protection action includes: board resetting and/or board power-off.
  • the temperature detection points may be respectively set on different chips of the single board. When there are multiple chips on the board, temperature detection points may be respectively set on each chip, so that the temperature of each chip can be monitored at all times. After the temperature detection point of the board is detected, the over-temperature protection action corresponding to the board is performed on the board. The over-temperature protection action corresponding to each board may be based on the board. The performance is determined. If the temperature of the board is reduced to the normal range after the board is no longer loaded with the traffic, the board can be reset once, and the temperature protection can be achieved. Compared with the operations in the related art that can only perform power-off of the board, the time spent by the board can be effectively reduced, and the performance of the communication system is also improved to some extent. Through the above steps, the possible problems in the related art that can only detect the temperature of the fixed chip on the single board and cause the other chips to burn out can be effectively solved, thereby achieving detection of multiple chips on the board and avoiding the problem of chip burning. .
  • the method further includes: determining whether the temperature detected by the temperature detection point on the board is less than the temperature corresponding to the temperature monitoring point. The electric threshold temperature; if the judgment result is yes, the board is controlled.
  • determining whether the board can be powered on again it is determined according to the temperature of the board.
  • the related technology determines whether the board can be powered on again. It is determined according to the time when the board is powered off. That is, when the board is powered off for a certain period of time, the board is allowed to be powered on again, but the related art is used. In this case, the temperature of the board that is powered off is still high after a certain period of time.
  • the board to be powered can cause the chip on the board to burn out. Therefore, it can be seen that the judgment condition of the duration as the board is powered on again is unreasonable.
  • the time for the temperature of different boards to return to normal after power-off is different. It is not possible to judge whether the board can be powered on by time. In this embodiment, the problem can be solved well.
  • the board power can be controlled according to the problem of the board to ensure that the temperature of the board is low enough, and the chips on the board are not burned.
  • the method further includes: determining whether the number of times of resetting the board exceeds a predetermined threshold within a predetermined time; if the judgment result is yes, Control the board to power off. That is, if the number of times that the board is continuously reset in a short period of time is too large, it indicates that some boards on the board or the board are aging or other faults occur. In order to avoid larger problems, the board should be stopped. The board can adopt the solution in this embodiment to effectively avoid the problem that the faulty board continues to work. This ensures the safety of the system.
  • the board is powered off after the number of times of the board resetting exceeds a predetermined threshold in a predetermined time. To ensure the security of the system, the board can be stopped. Therefore, After the board is powered off, you can also mark the board as a non-re-able board.
  • the method further includes: when the determination result is yes, improving the working efficiency of the heat dissipation device for dissipating heat from the board.
  • the common heat dissipation device is a fan. When the temperature of the board is high, the temperature of the board can be accelerated. That is, the speed of the fan can be increased.
  • the method according to the above embodiment can be implemented by means of software plus a necessary general hardware platform, and of course, by hardware, but in many cases, the former is A better implementation.
  • the technical solution of the present invention which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk,
  • the optical disc includes a number of instructions for causing a terminal device (which may be a cell phone, a computer, a server, or a network device, etc.) to perform the methods described in various embodiments of the present invention.
  • a single-plate over-temperature processing device is provided, which is used to implement the above-mentioned embodiments and preferred embodiments, and has not been described again.
  • the term "module” may implement a combination of software and/or hardware of a predetermined function.
  • the apparatus described in the following embodiments is preferably implemented in software, hardware, or soft The implementation of a combination of hardware and hardware is also possible and conceived.
  • FIG. 2 is a block diagram showing the structure of a single board over-temperature processing apparatus according to an embodiment of the present invention. As shown in FIG. 2, the apparatus includes an acquisition module 22, a first determination module 24, and a processing module 26. The apparatus will be described below.
  • the obtaining module 22 is configured to obtain the temperature detected by the temperature detecting point on the board, wherein the board is provided with two or more temperature detecting points; the first determining module 24 is connected to the obtaining module 22, and is set to determine Whether the temperature detected by the first temperature detecting point exceeds the power-off threshold temperature corresponding to the first temperature detecting point; the processing module 26 is connected to the first determining module 24, and is set to If the result of the determination by the first determining module 24 is YES, the board is processed according to the over-temperature protection action corresponding to the board, wherein the over-temperature protection action includes: resetting the board and/or powering off the board. .
  • FIG. 3 is a block diagram of a preferred structure of a single-board over-temperature processing apparatus according to an embodiment of the present invention. As shown in FIG. 3, the apparatus includes a second judging module 32 and a first Control module 34, which will be described below.
  • the second determining module 32 is connected to the processing module 26, and is configured to determine whether the temperature detected by the temperature detecting point on the board is less than the powering of the temperature monitoring point after the board is powered off.
  • the threshold temperature; the first control module 34 is connected to the second determining module 32, and is configured to control the power on the board when the determination result of the second determining module 32 is YES.
  • FIG. 4 is a block diagram of a preferred structure of a single-board over-temperature processing apparatus according to an embodiment of the present invention. As shown in FIG. 4, the apparatus includes a third judging module 42 and a second, in addition to all the modules shown in FIG. Control module 44, the device will be described below.
  • the third determining module 42 is connected to the processing module 26, and is configured to determine whether the number of times the board is reset exceeds a predetermined threshold within a predetermined time after performing a board resetting process on the board; the second control module 44 is connected to The third determining module 42 is configured to control the board to be powered off when the determination result of the third determining module 42 is YES.
  • FIG. 5 is a block diagram 3 of a preferred structure of a single-board over-temperature processing apparatus according to an embodiment of the present invention. As shown in FIG. 5, the apparatus includes a marking module 52 in addition to all the modules shown in FIG. The device is described.
  • the marking module 52 is connected to the second control module 44 and is configured to mark the board as a non-re-powerable board.
  • FIG. 6 is a block diagram showing a preferred structure of a single-board over-temperature processing apparatus according to an embodiment of the present invention. As shown in FIG. 6, the apparatus includes an improvement module 62 in addition to all the modules shown in FIG. The device is described.
  • the improvement module 62 is connected to the first determination module 24, and is configured to improve the working efficiency of the heat dissipation device for dissipating heat from the board when the determination result of the first determination module 24 is YES.
  • the device and method for over-temperature protection of the chassis of the chassis are provided in the embodiment of the present invention.
  • the device controls the single-board to implement different types of temperature protection actions according to the configured performance parameters and the command configuration mode, so as to solve the related technologies.
  • the problem that the temperature of the board is too high due to the failure of the heat dissipation system can easily use the over-temperature protection function of the board and increase the single
  • the accuracy of the board temperature protection protects the board chip from being burned and extends the life of the board.
  • the embodiment of the invention provides a device and a method for over-temperature protection of a chassis of a chassis, and the implementation scheme thereof is:
  • the information about the over-temperature protection threshold of the board is saved in the performance parameter file.
  • the information saved in the performance parameter file is only an example. You can also use other methods to save the board over-temperature protection threshold information. The following is described in the performance parameter file.
  • the system command line configuration interface can be configured with the protection action type of the board over temperature protection.
  • the main control board periodically polls the peripheral board temperature.
  • the main control board When the temperature of the peripheral board exceeds the threshold of the over-temperature protection of the board (corresponding to the power-off threshold temperature), the main control board performs the corresponding over-temperature protection action on the peripheral board according to the type of over-temperature protection action.
  • the device for over-temperature protection of the chassis can also be configured with the over-temperature protection function switch: the user can determine whether to enable the over-temperature protection function through the command line configuration. When the over-temperature protection function is enabled, the system can protect the temperature of the board. When the over-temperature protection function is disabled, the system will not protect the temperature of the board.
  • the protection action type of the over-temperature protection may include: resetting the board and powering down the board.
  • the user determines the type of protection action for the over-temperature protection function by means of the command line configuration. If the protection action type is set to the operation board, the main control board resets the chip of the peripheral board when the temperature of the board exceeds the over-temperature protection threshold. If the protection action type is set to the operation board If the temperature of the board exceeds the threshold of the over-temperature protection of the board, the main board will cut off the power of the board and the board will be powered off.
  • the above-mentioned board over-temperature protection threshold information may be defined in the form of performance parameters.
  • the over-temperature protection action type of the board is set to the operation board reset, if the over-temperature protection function is triggered, if the number of times the board is continuously reset exceeds a certain number of times, the chip of the board is aging or other unpredictable. To ensure the stability of the board and the system, you need to force the board to be powered off. At the same time, mark the board as a non-re-able board.
  • the system When the temperature of any peripheral board exceeds the over-temperature protection threshold, the system will determine the running status of the fan. If the fan speed does not reach full speed, the system will set all fan groups on the system to full-speed operation.
  • the temperature of the board When the temperature of the board is queried, the temperature of the board that has been powered off is also checked. If the temperature of the power-off board is lower than the power-on threshold of the board and the board is not powered off, the main board will perform power-on operation on the board. The overall performance of the system.
  • the embodiment of the present invention further provides a computer readable storage medium, where program instructions are stored, and the board is over temperature
  • the protection function switch and the configuration data of the board over-temperature protection action type can be saved in the readable storage medium.
  • the above two configurations can be loaded again from the readable storage medium, and the above method is implemented.
  • the device and method for over-temperature protection of the chassis of the chassis provided by the embodiment of the present invention can set the temperature protection threshold of the board through the performance parameter, and determine the over-temperature protection function by using the command line configuration.
  • the action type can not only conveniently manage the board information that participates in over-temperature protection, but also select different protection types, which achieves the purpose of convenient and accurate use of the board over-temperature protection function.
  • the board over-temperature protection threshold information may be stored in the file of the performance parameter.
  • the system command line configuration interface configures the protection action type of the board over temperature protection.
  • the main control board periodically polls the peripheral board temperature. When the temperature of the peripheral board exceeds the over-temperature protection threshold of the board, the main control board will reset or power off the peripheral board according to the type of over-temperature protection action.
  • the above-mentioned board over-temperature protection threshold information is defined in the form of performance parameters. Store the over-temperature protection threshold information of the board in the performance parameter file and configure the over-temperature protection information of multiple types of boards in the performance parameter file. Each type of board can be configured with an overtemperature protection threshold for one or more temperature detection points.
  • the over-temperature protection threshold information may include a board type, a temperature monitoring point address, and a temperature monitoring point over-temperature protection upper limit value.
  • the following situations may be met:
  • the system does not determine whether the temperature of the board is over-temperature, and the temperature protection function of the board is not triggered.
  • the system searches for the temperature point information of the board type according to the configured board type in the temperature information of all the boards. And compare whether the actual temperature of the temperature measurement point exceeds the over temperature protection threshold at that point.
  • the configuration information of the board in the performance parameter file includes the over-temperature protection threshold of the temperature measurement points.
  • the system searches for the board type according to the board type in the configuration. Temperature point information, and compared with all over-temperature protection thresholds configured by the over-temperature protection points in the performance parameters. If the temperature of any one of the over-temperature protection points exceeds the over-temperature protection threshold at that temperature point, the above will be followed. Method to trigger over temperature protection.
  • the action type of the over-temperature protection function includes: resetting the board and powering off the board.
  • the user can determine the type of protection action for the over-temperature protection function by means of the command line configuration. If the over-temperature protection action type is set to reset the board, the master board resets the chip of the peripheral board when the temperature of the peripheral board exceeds the over-temperature protection threshold. If the protection action is set to the power-off state of the board, the power-off board of the board is powered off and the board is powered off.
  • the reset board can achieve the purpose of temperature protection.
  • the reset operation can protect the board from being burned out and restore the load flow of the board in a short time.
  • Power off operation is to cut off the power The way to protect the board, but if you want to restore the load function of the board, you can only power it up again, and the power-on time is slow, which will have a certain impact on the system load.
  • the power-off operation is to directly cut off the power of the single board, so that the chip can no longer generate heat and achieve the purpose of temperature protection.
  • the power-off operation can also be used in special cases: the temperature of the chip is kept over-temperature, and after the board is reset multiple times, the temperature of the chip cannot be lowered. At this time, it can be considered that the heat sink of the single-board chip has fallen off, or There are other abnormalities. To ensure that the board is not damaged and the stability of the system is considered, the board can be powered off.
  • the method for over-temperature protection of the chassis of the chassis further includes a progressive operation of the temperature protection action:
  • the over-temperature protection action type of the board is set to reset the board, after the over-temperature protection function is triggered, if the number of times the board is continuously reset exceeds a certain number of times, the chip of the board has aging or other unpredictable problems. To ensure the stability of the board and the system, you need to force the board to be powered off. At the same time, the board is marked as a non-powerable board.
  • the method for over-temperature protection of the chassis is as follows:
  • the temperature of the board When the temperature of the board is queried, the temperature of the board that has been powered off is also checked. If the temperature of the power-off board is lower than the power-on threshold of the board and the board is not powered off, the board will perform the power-on operation and the board will run again. Improve the overall performance of the system.
  • the method for over-temperature protection of the chassis of the chassis further includes the synchronous operation of the fan group:
  • the system fan When the temperature of any peripheral board exceeds the over-temperature protection threshold, the system fan does not reach the ideal gear position. At this time, the system will judge the running status of the fan. If the fan speed does not reach full speed, the system will all the system. The fan group is set to run at full speed. It ensures that the fan can provide the maximum heat dissipation capacity to remove the heat generated by the board in time to ensure that the temperature of the board is reduced to the desired range in the shortest time.
  • the method for providing over-temperature protection of the board includes the switch setting of the over-temperature protection function of the board.
  • the user can determine whether to enable the over-temperature protection function through the command line configuration.
  • the over-temperature protection function is set to be on, the system can protect the temperature of the board by the above method.
  • the over-temperature protection function is set to off, the system will not protect the temperature of the board.
  • FIG. 7 is a flowchart of a method for over-temperature protection of a chassis of a chassis according to an embodiment of the present invention. As shown in FIG. 7, the process includes the following steps:
  • Step S01 The command line configures the over temperature protection switch and the protection action type.
  • the user selects whether to enable the over-temperature protection function by means of the command line configuration.
  • the over-temperature protection function is set to on, the system can protect the temperature of the board.
  • the user can also select the type of protection action for the over-temperature protection function by means of command line configuration.
  • the protection action type of the over-temperature protection function includes: resetting the board and powering off the board.
  • the reset board can achieve the purpose of temperature protection.
  • the power-off operation is to directly cut off the power of the single board, so that the chip can no longer generate heat and achieve the purpose of temperature protection.
  • Step S02 Acquire command line configuration parameters.
  • the over-temperature protection switch and the protection action type configured by the user in step S01 are queried and recorded in the local memory for use in the following steps.
  • Step S03 Obtain performance parameters.
  • the performance parameter file stores the over-temperature protection threshold information of the board.
  • each type of board can be configured with one or more temperature detection points.
  • Over temperature protection threshold The system reads the over-temperature protection information in the performance parameter file and records it in the local memory for use in the following steps.
  • Step S04 It is judged whether the over-temperature protection switch is turned on, if not, step S11 is performed, and if it is turned on, step S05 is performed.
  • Step S05 determining whether the temperature of the board is greater than an over-temperature protection threshold.
  • the possibility of judgment can be as follows:
  • step S11 is performed.
  • the configuration information of the board in the performance parameter file only contains the over-temperature protection threshold of the temperature measurement point.
  • the system searches for the board type according to the board type in the configuration.
  • the temperature point information is compared with whether the actual temperature of the temperature measurement point exceeds the over temperature protection threshold of the point. If yes, step S06 is performed, and if not, step S11 is performed.
  • the configuration information of the board in the performance parameter file includes the over-temperature protection threshold of the temperature measurement points.
  • the system searches for the board type according to the board type in the configuration. Temperature point information, and compared with the over-temperature protection threshold of all over-temperature protection points, if the temperature of any one of the over-temperature protection points exceeds the over-temperature protection threshold of the temperature point, step S06 is performed, if all over-temperature If the temperature of the protection point does not exceed the temperature point over-temperature protection threshold, step S11 is performed.
  • Step S06 Determine whether the type of the over-temperature protection action of the board is a reset operation. If it is a reset operation, step S07 is performed, otherwise step S09 is performed.
  • Step S07 Determine whether the number of times of board reset exceeds the maximum limit. If the number of board resets exceeds the maximum limit of the board reset set by the system, go to step S10 and mark the board as a forced power-off board. If the number of board resets does not exceed the maximum board reset set by the system. If it is limited, step S08 is performed.
  • Step S08 Reset the board.
  • the master board resets the chips of the peripheral board. After resetting the board, increase the number of resets of the board by 1.
  • Step S09 Determine whether the type of the over-temperature protection action of the board is a power-off operation. If it is a power-off operation, step S10 is performed, otherwise step S11 is performed.
  • Step S10 Power off the board.
  • the main control board cuts off the power of the peripheral board and powers off the board.
  • Step S11 Query the temperature information of the board.
  • the main control board periodically queries the temperature information of all the boards in the chassis, including the boards that have been powered off.
  • Step S12 Perform power-on processing on the powered-off board. If the temperature of the board is lower than the power-on threshold of the board, and the board is not forced to be powered off, the main board is operated on the board. Step S04 is executed again, and the above steps are cycled.
  • the method for over-temperature protection of the chassis of the chassis further includes the synchronous operation of the fan group:
  • the system fan When the temperature of the peripheral board exceeds the over-temperature protection threshold, the system fan does not reach the ideal gear position. At this time, the operating status of the system fan group will be judged. If the status of the fan group is not full, the system will set the fan group to the full-speed state. This ensures that the fan group can provide the maximum heat dissipation capacity to remove the heat generated by the board in time to ensure that the board chip will be in the shortest time. The temperature drops to the desired range.
  • each of the above modules may be implemented by software or hardware.
  • the foregoing may be implemented by, but not limited to, the foregoing modules are all located in the same processor; or, the modules are located in multiple In the processor.
  • Embodiments of the present invention also provide a storage medium.
  • the foregoing storage medium may be configured to store program code for performing the following steps:
  • the temperature detected by the temperature detection point on the board is obtained, where two or more temperature detection points are disposed on the board;
  • the board is processed according to the over-temperature protection action corresponding to the board, wherein the over-temperature protection action includes: board resetting and/or board power-off.
  • the foregoing storage medium may include, but is not limited to, a USB flash drive, a Read-Only Memory (ROM), and a Random Access Memory (RAM).
  • ROM Read-Only Memory
  • RAM Random Access Memory
  • modules or steps of the present invention described above can be implemented by a general-purpose computing device that can be centralized on a single computing device or distributed across a network of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device such that they may be stored in the storage device by the computing device and, in some cases, may be different from the order herein.
  • the steps shown or described are performed, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps thereof are fabricated as a single integrated circuit module.
  • the invention is not limited to any specific combination of hardware and software.
  • the single-board over-temperature processing method and apparatus provided by the embodiments of the present invention have the following beneficial effects: solving the possible problems in the related art that only the temperature of the fixed chip on the single board is detected, causing other chips to burn out. In turn, multiple chips on the board are detected to avoid the problem of chip burning.

Landscapes

  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Automation & Control Theory (AREA)
  • Measuring Temperature Or Quantity Of Heat (AREA)
  • Cooling Or The Like Of Electrical Apparatus (AREA)

Abstract

一种单板过温处理方法及装置,该方法包括:获取单板上的温度检测点检测到的温度,其中,单板上设置有两个以上温度检测点(S102);判断单板是否存在第一温度检测点,其中,第一温度检测点检测到的温度超过第一温度检测点对应的下电阈值温度(S104);在判断结果为是的情况下,按照与单板对应的过温保护动作对单板进行处理,其中,过温保护动作包括:单板复位和/或单板下电(S106)。解决了现有技术中存在的只能检测单板上固定芯片的温度导致其他芯片有烧毁可能的问题。

Description

单板过温处理方法及装置 技术领域
本发明涉及通信领域,具体而言,涉及一种单板过温处理方法及装置。
背景技术
在通信领域不断快速发展的背景下,承载通信能力的大型通信设备也随之更新换代。通信能力越大,对通信设备的硬件要求就越高,同时,设备中单板产生的热量就越多。因此,如何有效地散热就成了通信设备运行稳定的关键因素之一。
在一般的大型通信设备系统中,为了更好地解决对设备单板的散热问题,基本上都会采用风扇制冷的技术。但是,当设备上的风扇发生故障,或者散热的进风和出风口有异物阻挡时,风扇就无法有效地带走单板产生的热量,从而会增加单板上的芯片由于温度过高而被烧毁的风险。
针对以上问题,目前所使用的单板温度保护的方法是:
(1)主控板周期检测主控板温度、各外围单板温度和各单板上的现场可编程门阵列(Field-Programmable Gate Array,简称为FPGA)结温。
(2)当所述主控板温度超过预设下电温度时,主控板控制所有单板下电。
(3)当一个单板的温度或者单板上的FPGA结温超过预设下电温度时,主控板控制该单板下电。
(4)在满足下电时长的条件下,将单板重新上电。
以上单板温度保护的方法主要存在以下缺陷:
只能检测单板上固定芯片的温度,而对于大型的通信单板来说,会存在多个测温点,每个测温点都会有相应的温度越限门限值。采用相关技术中的方案会有可能导致其他芯片烧毁。温度保护的动作类型只是单纯地将单板下电。
针对相关技术中存在的只能检测单板上固定芯片的温度导致其他芯片有烧毁的可能的问题,目前尚未提出有效的解决方案。
发明内容
本发明提供了一种单板过温处理方法及装置,以至少解决相关技术中存在的只能检测单板上固定芯片的温度导致其他芯片有烧毁的可能的问题。
根据本发明的一个方面,提供了一种单板过温处理方法,包括:获取单板上的温度检测 点检测到的温度,其中,所述单板上设置有两个以上温度检测点;判断所述单板是否存在第一温度检测点,其中,所述第一温度检测点检测到的温度超过所述第一温度检测点对应的下电阈值温度;在判断结果为是的情况下,按照与所述单板对应的过温保护动作对所述单板进行处理,其中,所述过温保护动作包括:单板复位和/或单板下电。
可选地,当采用单板下电的方式对所述单板进行处理后,还包括:判断所述单板上的温度检测点检测到的温度是否均小于与所述温度监测点对应的上电阈值温度;在判断结果为是的情况下,控制所述单板上电。
可选地,当采用单板复位的方式对所述单板进行处理后,还包括:判断预定时间内,所述单板复位的次数是否超过预定阈值;在判断结果为是的情况下,控制所述单板下电。
可选地,在控制所述单板下电之后,还包括:将所述单板标记为不可再上电单板。
可选地,在判断所述单板是否存在第一温度检测点之后,还包括:当判断结果为是的情况下,提高用于对所述单板进行散热的散热设备的工作效率。
根据本发明的另一方面,提供了一种单板过温处理装置,包括:获取模块,设置为获取单板上的温度检测点检测到的温度,其中,所述单板上设置有两个以上温度检测点;第一判断模块,设置为判断所述单板是否存在第一温度检测点,其中,所述第一温度检测点检测到的温度超过所述第一温度检测点对应的下电阈值温度;处理模块,设置为在所述第一判断模块的判断结果为是的情况下,按照与所述单板对应的过温保护动作对所述单板进行处理,其中,所述过温保护动作包括:单板复位和/或单板下电。
可选地,所述装置还包括:第二判断模块,设置为判断所述单板上的温度检测点检测到的温度是否均小于与所述温度监测点对应的上电阈值温度;第一控制模块,设置为在所述第二判断模块的判断结果为是的情况下,控制所述单板上电。
可选地,所述装置还包括:第三判断模块,设置为判断预定时间内,所述单板复位的次数是否超过预定阈值;第二控制模块,设置为在所述第三判断模块的判断结果为是的情况下,控制所述单板下电。
可选地,所述装置还包括:标记模块,设置为将所述单板标记为不可再上电单板。
可选地,所述装置还包括:提高模块,设置为在所述第一判断模块的判断结果为是的情况下,提高用于对所述单板进行散热的散热设备的工作效率。
本发明另一实施例提供了一种计算机存储介质,所述计算机存储介质存储有执行指令,所述执行指令用于执行上述实施例中的方法。
通过本发明,采用获取单板上的温度检测点检测到的温度,其中,所述单板上设置有两个以上温度检测点;判断所述单板是否存在第一温度检测点,其中,所述第一温度检测点检测到的温度超过所述第一温度检测点对应的下电阈值温度;在判断结果为是的情况下,按照与所述单板对应的过温保护动作对所述单板进行处理,其中,所述过温保护动作包括:单板 复位和/或单板下电,解决了相关技术中存在的只能检测单板上固定芯片的温度导致其他芯片有烧毁的可能的问题,进而达到了检测单板上多个芯片,避免出现芯片烧毁的问题效果。
附图说明
此处所说明的附图用来提供对本发明的进一步理解,构成本申请的一部分,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:
图1是根据本发明实施例的单板过温处理方法的流程图;
图2是根据本发明实施例的单板过温处理装置的结构框图;
图3是根据本发明实施例的单板过温处理装置的优选结构框图一;
图4是根据本发明实施例的单板过温处理装置的优选结构框图二;
图5是根据本发明实施例的单板过温处理装置的优选结构框图三;
图6是根据本发明实施例的单板过温处理装置的优选结构框图四;
图7是根据本发明实施例的机框单板过温保护的方法流程图。
具体实施方式
下文中将参考附图并结合实施例来详细说明本发明。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。
需要说明的是,本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。
在本实施例中提供了一种单板过温处理方法,图1是根据本发明实施例的单板过温处理方法的流程图,如图1所示,该流程包括如下步骤:
步骤S102,获取单板上的温度检测点检测到的温度,其中,该单板上设置有两个以上温度检测点;
步骤S104,判断单板是否存在第一温度检测点,其中,该第一温度检测点检测到的温度超过第一温度检测点对应的下电阈值温度;
步骤S106,在判断结果为是的情况下,按照与单板对应的过温保护动作对单板进行处理,其中,该过温保护动作包括:单板复位和/或单板下电。
其中,上述的温度检测点可以分别设置在单板的不同芯片上,当单板上存在多个芯片时,可以分别在每个芯片上均设置温度检测点,从而可以时刻监视各个芯片的温度。通过上述步骤,当单板上的任意一个温度检测点检测到超过对应温度检测点的温度后,便会对该单板执行与该单板对应的过温保护动作。其中,每个单板对应的过温保护动作都可以是根据该单板 的性能确定的,如果单板不再负载流量后,温度会很快降低到正常范围的话,可以将单板做一次复位操作,也就可以达到温度保护的目的。相对于相关技术中存在的只能执行单板下电的操作,能够有效的降低耗费的时间,同时也会在一定程度上提高通信系统的性能。通过上述步骤,可以有效解决相关技术中存在的只能检测单板上固定芯片的温度导致其他芯片有烧毁的可能的问题,进而达到了检测单板上多个芯片,避免出现芯片烧毁的问题效果。
在一个可选的实施例中,当采用单板下电的方式对单板进行处理后,还包括:判断该单板上的温度检测点检测到的温度是否均小于与温度监测点对应的上电阈值温度;在判断结果为是的情况下,控制该单板上电。在该实施例中,在判断单板是否可以重新上电时,是根据单板的温度确定的。而相关技术中在判断单板是否可以重新上电时,是根据单板下电的时间确定的,即,单板下电超过一定的时间就允许单板重新上电,但是采用相关技术中的该种方法,会导致下电的单板在超过一定的时间后,温度仍然很高,在该种情况下,允许单板上电的话会导致单板上的芯片烧毁的问题。由此可知,时长作为单板再次上电的判断条件不合理。不同单板的温度在下电后恢复到正常情况的时间是不同的,不能以时间来判断单板是否可以上电。而本实施例可以很好的解决这个问题,根据单板的问题来控制单板上电可以有效的保证单板温度足够低,单板上的芯片不会烧毁的。
在一个可选的实施例中,当采用单板复位的方式对单板进行处理后,还包括:判断预定时间内,该单板复位的次数是否超过预定阈值;在判断结果为是的情况下,控制单板下电。也就是说,若单板短时间内连续被复位的次数过多,说明该单板或者该单板上的一些芯片有老化或者出现了其他的故障,为了避免出现更大的问题,应该停止使用该单板,采用本实施例中的方案可以有效避免故障单板继续工作的问题。从而保证了系统的安全。
在一个可选的实施例中,当在预定时间内,单板复位的次数是否超过预定阈值之后会控制该单板下电,为了保证系统的安全性,可以停止使用该单板,因此,在控制单板下电之后,还可以将该单板标记为不可再上电单板。
在一个可选的实施例中,在判断单板是否存在第一温度检测点之后,还包括:当判断结果为是的情况下,提高用于对单板进行散热的散热设备的工作效率。其中,常见的散热设备是风扇,当单板的温度偏高后,可以加快对该单板的降温,即,可以提高风扇的转速。
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到根据上述实施例的方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,或者网络设备等)执行本发明各个实施例所述的方法。
在本实施例中还提供了一种单板过温处理装置,该装置用于实现上述实施例及优选实施方式,已经进行过说明的不再赘述。如以下所使用的,术语“模块”可以实现预定功能的软件和/或硬件的组合。尽管以下实施例所描述的装置较佳地以软件来实现,但是硬件,或者软 件和硬件的组合的实现也是可能并被构想的。
图2是根据本发明实施例的单板过温处理装置的结构框图,如图2所示,该装置包括获取模块22、第一判断模块24和处理模块26,下面对该装置进行说明。
获取模块22,设置为获取单板上的温度检测点检测到的温度,其中,该单板上设置有两个以上温度检测点;第一判断模块24,连接至上述获取模块22,设置为判断单板是否存在第一温度检测点,其中,该第一温度检测点检测到的温度超过第一温度检测点对应的下电阈值温度;处理模块26,连接至上述第一判断模块24,设置为在第一判断模块24的判断结果为是的情况下,按照与单板对应的过温保护动作对单板进行处理,其中,该过温保护动作包括:单板复位和/或单板下电。
图3是根据本发明实施例的单板过温处理装置的优选结构框图一,如图3所示,该装置除包括图2所示的所有模块外,还包括第二判断模块32和第一控制模块34,下面对该装置进行说明。
第二判断模块32,连接至上述处理模块26,设置为在对单板进行单板下电处理后,判断单板上的温度检测点检测到的温度是否均小于与温度监测点对应的上电阈值温度;第一控制模块34,连接至上述第二判断模块32,设置为在第二判断模块32的判断结果为是的情况下,控制单板上电。
图4是根据本发明实施例的单板过温处理装置的优选结构框图二,如图4所示,该装置除包括图2所示的所有模块外,还包括第三判断模块42和第二控制模块44,下面对该装置进行说明。
第三判断模块42,连接至上述处理模块26,设置为在对单板进行单板复位处理后,判断预定时间内,该单板复位的次数是否超过预定阈值;第二控制模块44,连接至上述第三判断模块42,设置为在第三判断模块42的判断结果为是的情况下,控制单板下电。
图5是根据本发明实施例的单板过温处理装置的优选结构框图三,如图5所示,该装置除包括图4所示的所有模块外,还包括标记模块52,下面对该装置进行说明。
标记模块52,连接至上述第二控制模块44,设置为将单板标记为不可再上电单板。
图6是根据本发明实施例的单板过温处理装置的优选结构框图四,如图6所示,该装置除包括图2所示的所有模块外,还包括提高模块62,下面对该装置进行说明。
提高模块62,连接至上述第一判断模块24,设置为在第一判断模块24的判断结果为是的情况下,提高用于对单板进行散热的散热设备的工作效率。
下面对机框单板过温保护进行说明。
本发明实施例中还提供一种机框单板过温保护的装置、方法,其根据配置的性能参数和命令配置模式来控制单板实现不同类型的温度保护动作,以解决相关技术中存在的由于散热系统失效导致的单板温度过高的问题,既可以方便地使用单板过温保护功能,又可以增加单 板温度保护的准确性,保护单板芯片不被烧毁,延长单板使用寿命。
为了解决上述问题,本发明实施例提供了一种机框单板过温保护的装置、方法,其实现方案是:
单板过温保护阈值信息保存在性能参数文件中,当然,保存在性能参数文件中仅是一种示例,也可以利用其他的方式保存单板过温保护阈值信息。下面以保存在性能参数文件进行说明。
系统命令行配置界面可以配置单板过温保护的保护动作类型。
主控单板定时循环查询外围单板温度。
当外围单板温度超过单板过温保护阈值(对应于上述的下电阈值温度)时,主控单板将按照过温保护动作类型对外围单板执行相应的过温保护动作。
其中,上述机框单板过温保护的装置中还可以设置单板过温保护功能开关:用户通过命令行配置的方式,确定是否开启过温保护功能。当过温保护功能打开后,系统可以实现对单板的温度保护;当过温保护功能关闭时,系统将不再对单板进行温度保护。
上述的过温保护的保护动作类型可以包括:操作单板复位和操作单板下电。
用户通过命令行配置的方式,确定过温保护功能的保护动作类型。若保护动作类型被设置为操作单板复位,则当单板温度超过单板过温保护阈值时,主控单板将外围单板的芯片进行复位;若保护动作类型被设置为操作单板下电,则当单板温度超过单板过温保护阈值时,主控单板切断外围单板的电源,单板下电。
可选地,上述的单板过温保护阈值信息可以是以性能参数的形式来定义的。
在性能参数的文件中存放单板过温保护阈值信息,在该性能参数文件中可配置多个类型单板的过温保护信息。其中,每个类型的单板可以配置一个或者多个温度检测点的过温保护阈值。
当上述单板过温保护动作类型被设置为操作单板复位时,过温保护功能触发后,若单板连续被复位的次数超过一定次数,说明该单板的芯片有老化或者其他不可预知的问题,为了保障单板和系统的稳定,则需将该单板做强制下电操作,同时,将单板标记为不可再上电单板。
当任何外围单板的温度超过过温保护阈值后,系统将会判断风扇的运行状态,若风扇转速没有达到全速,则系统会将系统上的所有风扇组设置为全速运行状态。
主控单板在定时循环查询单板温度时,也会查询已下电的单板的温度。当已下电单板的温度低于单板的上电阈值温度,并且不是不可上电单板,主控单板会将已下电单板执行上电操作,使单板再次运行,提高了系统的整体性能。
可选地,本发明实施例中还提供一种计算机可读存储介质,存储有程序指令,单板过温 保护功能开关和单板过温保护动作类型的配置数据,可以保存在可读存储介质中,当系统重启后可以再次从可读存储介质中加载以上两项配置,并实现以上的方法。
由此可知,本发明实施例中提供的机框单板过温保护的装置、方法可以通过性能参数的形式来设置单板的温度保护阈值,并通过命令行配置的方式来确定过温保护功能的动作类型,既能方便的管理参与过温保护的单板信息,又可以选择不同的保护类型,达到了方便、精确地使用单板过温保护功能的目的。
在本发明实施例中,可以在性能参数的文件中存放单板过温保护阈值信息。系统命令行配置界面配置单板过温保护的保护动作类型。主控单板定时循环查询外围单板温度。当外围单板温度超过单板过温保护阈值时,主控单板将按照过温保护动作类型将外围单板进行复位操作或者下电操作。
其中,上述的单板过温保护阈值信息是以性能参数的形式来定义的。在性能参数的文件中存放单板过温保护阈值信息,在性能参数文件中配置多个类型单板的过温保护信息。每个类型的单板可以配置一个或者多个温度检测点的过温保护阈值。在的过温保护阈值信息中可以包括单板类型、温度监测点地址和温度监测点过温保护上限值。
可选地,确定外围单板温度是否超过单板过温保护阈值,可以有以下的情况:
(1)当性能参数文件中的配置,没有该单板的配置信息,则系统不会判断该单板的温度是否有过温的情况,进而也不会触发对该单板的温度保护功能。
(2)性能参数文件中只包含单板的一个测温点的过温保护阈值,则系统会在所有单板的温度点信息中,按照配置的单板类型查找该单板类型的温度点信息,并比较该测温点的实际温度是否超过该点的过温保护阈值。
(3)性能参数文件中该单板的配置信息包含多个测温点的过温保护阈值,系统会在所有单板的温度点信息中,按照配置中的单板类型查找该单板类型的温度点信息,并与所有由性能参数中配置的过温保护点的过温保护阈值比较,如果其中任何一个过温保护点的温度超过了该温度点的过温保护阈值,则会按照以上的方法来触发过温保护功能。
其中,上述的过温保护功能的动作类型包括:操作单板复位和操作单板下电。
用户通过命令行配置的方式,可以确定过温保护功能的保护动作类型。若过温保护动作类型被设置为操作单板复位,则当外围单板温度超过单板过温保护阈值时,主控单板将外围单板的芯片进行复位。若保护动作类型被设置为操作单板下电,则当单板温度超过单板过温保护阈值时,主控单板将外围单板的电源切断,单板下电。
上述的两种过温保护功能的保护动作类型,主要有以下区别:
某些单板在大的通信流量的情况下,芯片的温度比较高,但是如果芯片不再负载流量,那么芯片的温度会很快降低到正常范围,因此复位单板可以达到温度保护的目的。复位操作可以保护单板不被烧坏,而且短时间内可以恢复单板的负载流量。下电操作则是以切断电源 的方式来保护单板的,但是,如果想恢复单板的负载功能,只能将其再次上电,而且,上电时间比较慢,对系统的负载会产生一定的影响。
下电操作是直接切断单板电源,这样芯片就无法再产生热量,达到温度保护的目的。
下电操作还可以在特殊的情况下使用:芯片温度一直保持超温的状态,并且将单板复位多次后,仍然无法降低芯片温度,此时,可以认为单板芯片散热器有脱落,或者存在其他的异常情况,为保证单板不被损坏,并且考虑系统的稳定性,可以将该单板执行下电操作。
用户可以根据单板或者系统的实际情况选择不同的保护动作类型。
可选地,上述的机框单板过温保护方法还包括温度保护动作的递进操作:
当单板过温保护动作类型被设置为操作单板复位时,过温保护功能触发后,若单板连续被复位的次数超过一定次数,说明该单板的芯片有老化或者其他不可预知的问题,为了保障单板和系统的稳定,则需将该单板做强制下电操作,同时,将单板标记为不可上电单板,不允许再上电已强制下电的单板。
可选地,上述的机框单板过温保护的方法还包括对下电单板的上电处理:
主控单板在定时循环查询单板温度时,也会查询已下电的单板的温度。当已下电单板的温度低于单板的上电阈值温度,并且不是被强制下电的单板,主控单板会将已下电单板执行上电操作,使单板再次运行,提高了系统的整体性能。
可选地,上述的机框单板过温保护方法还包括风扇组的同步操作:
当任何外围单板的温度超过过温保护阈值时,说明系统风扇没有达到理想的档位,这时系统将会判断风扇的运行状态,若风扇转速没有达到全速,则系统会将系统上的所有风扇组设置为全速运行状态,已确保风扇能提供最大的散热能力,来及时地带走单板产生的热量,保障单板芯片在最短时间内将温度降到理想范围。
可选地,在上述的机框单板过温保护方法中还包括提供了单板过温保护功能的开关设置。用户通过命令行配置的方式,确定是否开启过温保护功能。当过温保护功能被设置为打开时,系统可以通过以上方法对单板的温度进行保护;而当过温保护功能被设置为关闭时,系统将不再对单板进行温度保护。
图7是根据本发明实施例的机框单板过温保护的方法流程图,如图7所示,该流程包括如下步骤:
步骤S01:命令行配置过温保护开关和保护动作类型。
用户通过命令行配置的方式,选择是否开启过温保护功能。当过温保护功能被设置为打开时,系统才可以对单板进行温度保护。
用户还可以通过命令行配置的方式,选择过温保护功能的保护动作类型。过温保护功能的保护动作类型包括:操作单板复位和操作单板下电。
某些单板在大的通信流量的情况下,芯片的温度比较高,但是如果芯片不再负载流量,那么芯片的温度会很快降低到正常范围,因此复位单板可以达到温度保护的目的。
下电操作是直接切断单板电源,这样芯片就无法再产生热量,达到温度保护的目的。
步骤S02:获取命令行配置参数。查询步骤S01中用户配置的过温保护开关和保护动作类型,并记录到本地内存,用以在以下步骤中使用。
步骤S03:获取性能参数。性能参数的文件中存放单板过温保护阈值信息,在性能参数文件中可配置多个类型单板的过温保护信息,而且,每个类型的单板可以配置一个或者多个温度检测点的过温保护阈值。系统将性能参数文件中的过温保护信息读取后记录到本地内存,用以在以下步骤中使用。
步骤S04:判断过温保护开关是否开启,如果未开启则执行步骤S11,如果开启则执行步骤S05。
步骤S05:判断单板温度是否大于过温保护阈值。判断的可能性可以有以下的情况:
(1)当性能参数文件中的配置,没有该单板的配置信息,则系统不会判断该单板的温度是否有过温的情况,则执行步骤S11。
(2)性能参数文件中该单板的配置信息只包含一个测温点的过温保护阈值,系统会在所有单板的温度点信息中,按照配置中的单板类型查找该单板类型的温度点信息,并比较该测温点的实际温度是否超过该点的过温保护阈值,如果超过则执行步骤S06,如果没有超过则执行步骤S11。
(3)性能参数文件中该单板的配置信息包含多个测温点的过温保护阈值,系统会在所有单板的温度点信息中,按照配置中的单板类型查找该单板类型的温度点信息,并与所有的过温保护点的过温保护阈值比较,如果其中任何一个过温保护点的温度超过了该温度点的过温保护阈值,则执行步骤S06,如果所有的过温保护点的温度都没有超过这些温度点过温保护阈值,则执行步骤S11。
步骤S06:判断单板过温保护动作类型是否是复位操作。如果是复位操作,则执行步骤S07,否则执行步骤S09。
步骤S07:判断单板复位次数是否超过最大限制。如果单板复位次数已经超过了系统设置的单板复位最大限制,则执行步骤S10,同时,将该单板标记为强制下电单板;如果单板复位次数没有超过系统设置的单板复位最大限制,则执行步骤S08。
步骤S08:复位单板。主控单板将外围单板的芯片进行复位。在复位单板后,将该单板的复位次数加1。
步骤S09:判断单板过温保护动作类型是否是下电操作。如果是下电操作,则执行步骤S10,否则执行步骤S11。
步骤S10:下电单板。主控单板将外围单板的电源切断,单板下电。
步骤S11:查询单板温度信息。主控板定时查询机框上所有单板的温度信息,包括已下电的单板。
步骤S12:对已下电单板做上电处理。如果单板温度低于单板的上电阈值温度,并且单板不是的被强制下电的单板,主控单板操作单板上电。再次执行步骤S04,循环以上的步骤。
可选地,上述的机框单板过温保护方法还包括风扇组的同步操作:
外围单板的温度超过过温保护阈值时,说明系统风扇没有达到理想的档位,这时将会判断系统风扇组的运行状态。如果风扇组的状态不是全速,则系统会将风扇组设置为全速运行状态,已确保风扇组能提供最大的散热能力,来及时地带走单板产生的热量,保障单板芯片在最短时间内将温度降到理想范围。
需要说明的是,上述各个模块是可以通过软件或硬件来实现的,对于后者,可以通过以下方式实现,但不限于此:上述模块均位于同一处理器中;或者,上述模块分别位于多个处理器中。
本发明的实施例还提供了一种存储介质。可选地,在本实施例中,上述存储介质可以被设置为存储用于执行以下步骤的程序代码:
S1,获取单板上的温度检测点检测到的温度,其中,该单板上设置有两个以上温度检测点;
S2,判断单板是否存在第一温度检测点,其中,该第一温度检测点检测到的温度超过第一温度检测点对应的下电阈值温度;
S3,在判断结果为是的情况下,按照与单板对应的过温保护动作对单板进行处理,其中,该过温保护动作包括:单板复位和/或单板下电。
可选地,在本实施例中,上述存储介质可以包括但不限于:U盘、只读存储器(Read-Only Memory,简称为ROM)、随机存取存储器(Random Access Memory,简称为RAM)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。
可选地,本实施例中的具体示例可以参考上述实施例及可选实施方式中所描述的示例,本实施例在此不再赘述。
显然,本领域的技术人员应该明白,上述的本发明的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,可选地,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本发明不限制于任何特定的硬件和软件结合。
以上所述仅为本发明的优选实施例而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。
工业实用性
如上所述,本发明实施例提供的一种单板过温处理方法及装置具有以下有益效果:解决相关技术中存在的只能检测单板上固定芯片的温度导致其他芯片有烧毁的可能的问题,进而达到了检测单板上多个芯片,避免出现芯片烧毁的问题效果。

Claims (10)

  1. 一种单板过温处理方法,包括:
    获取单板上的温度检测点检测到的温度,其中,所述单板上设置有两个以上温度检测点;
    判断所述单板是否存在第一温度检测点,其中,所述第一温度检测点检测到的温度超过所述第一温度检测点对应的下电阈值温度;
    在判断结果为是的情况下,按照与所述单板对应的过温保护动作对所述单板进行处理,其中,所述过温保护动作包括:单板复位和/或单板下电。
  2. 根据权利要求1所述的方法,其中,当采用单板下电的方式对所述单板进行处理后,还包括:
    判断所述单板上的温度检测点检测到的温度是否均小于与所述温度监测点对应的上电阈值温度;
    在判断结果为是的情况下,控制所述单板上电。
  3. 根据权利要求1所述的方法,其中,当采用单板复位的方式对所述单板进行处理后,还包括:
    判断预定时间内,所述单板复位的次数是否超过预定阈值;
    在判断结果为是的情况下,控制所述单板下电。
  4. 根据权利要求3所述的方法,其中,在控制所述单板下电之后,还包括:
    将所述单板标记为不可再上电单板。
  5. 根据权利要求1所述的方法,其中,在判断所述单板是否存在第一温度检测点之后,还包括:
    当判断结果为是的情况下,提高用于对所述单板进行散热的散热设备的工作效率。
  6. 一种单板过温处理装置,包括:
    获取模块,设置为获取单板上的温度检测点检测到的温度,其中,所述单板上设置有两个以上温度检测点;
    第一判断模块,设置为判断所述单板是否存在第一温度检测点,其中,所述第一温度检测点检测到的温度超过所述第一温度检测点对应的下电阈值温度;
    处理模块,设置为在所述第一判断模块的判断结果为是的情况下,按照与所述单板对应的过温保护动作对所述单板进行处理,其中,所述过温保护动作包括:单板复位和/或单板下电。
  7. 根据权利要求6所述的装置,其中,还包括:
    第二判断模块,设置为判断所述单板上的温度检测点检测到的温度是否均小于与所述温度监测点对应的上电阈值温度;
    第一控制模块,设置为在所述第二判断模块的判断结果为是的情况下,控制所述单板上电。
  8. 根据权利要求6所述的装置,其中,还包括:
    第三判断模块,设置为判断预定时间内,所述单板复位的次数是否超过预定阈值;
    第二控制模块,设置为在所述第三判断模块的判断结果为是的情况下,控制所述单板下电。
  9. 根据权利要求8所述的装置,其中,还包括:
    标记模块,设置为将所述单板标记为不可再上电单板。
  10. 根据权利要求6所述的装置,其中,还包括:
    提高模块,设置为在所述第一判断模块的判断结果为是的情况下,提高用于对所述单板进行散热的散热设备的工作效率。
PCT/CN2016/079666 2015-06-18 2016-04-19 单板过温处理方法及装置 WO2016202084A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510342060.6 2015-06-18
CN201510342060.6A CN106257372A (zh) 2015-06-18 2015-06-18 单板过温处理方法及装置

Publications (1)

Publication Number Publication Date
WO2016202084A1 true WO2016202084A1 (zh) 2016-12-22

Family

ID=57544938

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/079666 WO2016202084A1 (zh) 2015-06-18 2016-04-19 单板过温处理方法及装置

Country Status (2)

Country Link
CN (1) CN106257372A (zh)
WO (1) WO2016202084A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114143619A (zh) * 2021-11-25 2022-03-04 新华三技术有限公司成都分公司 一种基站过温保护方法、装置及电子设备

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108627715B (zh) * 2017-03-21 2022-09-02 中兴通讯股份有限公司 单板状态的检测方法及装置
CN109756791A (zh) * 2017-11-07 2019-05-14 中兴通讯股份有限公司 一种网络设备保护方法、保护设备及计算机可读存储介质
CN109348310A (zh) * 2018-10-23 2019-02-15 京信通信系统(中国)有限公司 一种射频拉远单元过温保护的方法及装置
CN114777958B (zh) * 2022-06-20 2022-10-28 深圳比特微电子科技有限公司 芯片散热状况检测方法、装置、电子设备及存储介质

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN2659032Y (zh) * 2003-09-29 2004-11-24 光宝科技股份有限公司 无线通信装置的散热结构
CN101086670A (zh) * 2006-06-09 2007-12-12 技嘉科技股份有限公司 电子元件的温度控制方法及其系统
TW200921345A (en) * 2007-11-15 2009-05-16 Acer Inc Heat dissipation fan control method of electric device and the apparatus thereof
CN102609014A (zh) * 2011-01-25 2012-07-25 鼎桥通信技术有限公司 一种单板温度的控制方法
CN102620843A (zh) * 2012-04-18 2012-08-01 上海中科高等研究院 芯片过温监测器
GB2513841A (en) * 2013-04-24 2014-11-12 Isis Technology Ltd Temperature-controlled computing device

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN2569190Y (zh) * 2002-09-23 2003-08-27 华为技术有限公司 电子电路系统的风扇转速智能控制装置
CN1780348B (zh) * 2004-11-17 2010-08-25 华为技术有限公司 一种通信系统电源控制方法和系统
FR2896319B1 (fr) * 2006-01-17 2010-09-03 Vernet Thermostat de regulation d'un fluide et procede de fabrication d'un tel thermostat
US8092080B2 (en) * 2008-08-15 2012-01-10 Siemens Energy, Inc. Wireless telemetry circuit structure for measuring temperature in high temperature environments
CN102541665A (zh) * 2010-12-27 2012-07-04 深圳市恒扬科技有限公司 一种降低电子芯片温度的方法和装置
CN104636221B (zh) * 2013-11-12 2020-03-17 研祥智能科技股份有限公司 一种计算机系统故障处理方法和装置

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN2659032Y (zh) * 2003-09-29 2004-11-24 光宝科技股份有限公司 无线通信装置的散热结构
CN101086670A (zh) * 2006-06-09 2007-12-12 技嘉科技股份有限公司 电子元件的温度控制方法及其系统
TW200921345A (en) * 2007-11-15 2009-05-16 Acer Inc Heat dissipation fan control method of electric device and the apparatus thereof
CN102609014A (zh) * 2011-01-25 2012-07-25 鼎桥通信技术有限公司 一种单板温度的控制方法
CN102620843A (zh) * 2012-04-18 2012-08-01 上海中科高等研究院 芯片过温监测器
GB2513841A (en) * 2013-04-24 2014-11-12 Isis Technology Ltd Temperature-controlled computing device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114143619A (zh) * 2021-11-25 2022-03-04 新华三技术有限公司成都分公司 一种基站过温保护方法、装置及电子设备

Also Published As

Publication number Publication date
CN106257372A (zh) 2016-12-28

Similar Documents

Publication Publication Date Title
WO2016202084A1 (zh) 单板过温处理方法及装置
CN103020545B (zh) 一种基于龙芯处理器的过温保护方法
CN106817243A (zh) 服务器资源的管理系统及其管理方法
CN106371540B (zh) 系统电源管理方法、芯片及电子设备
TW201426271A (zh) 目標裝置的過熱保護方法、過熱保護裝置、及其資訊處理系統
US8959379B2 (en) Thermal protection method for computer system and device thereof
CN102609014B (zh) 一种单板温度的控制方法
WO2024164843A1 (zh) 一种调整短路保护值的方法、电动工具及计算机可读介质
US20180011521A1 (en) Information Handling System Having Regional Cooling
CN113377188B (zh) 存储服务器温度控制方法、装置及设备
CN110011279B (zh) 一种开关电源保护电路及开关电源保护方法
WO2015058593A1 (zh) 除湿机的控制方法、控制装置及除湿机
CN102662820A (zh) 数据保护方法和装置
CN103486067B (zh) 风扇调速处理方法及装置
CN104680087B (zh) 一种刀片服务器热插拔的控制电路和装置
CN101661422B (zh) 硬盘过温保护装置及硬盘过温保护方法
CN115543042A (zh) 散热控制方法、控制器、组件、加速卡及电子设备
CN111780381B (zh) 风机的控制方法及装置、空调外机
US20190138074A1 (en) Power monitoring and reduction
WO2017016359A1 (zh) 运算资源的控制方法、装置和计算机存储介质
JP6853044B2 (ja) 電力制御監視装置とこれを用いたコンピュータ及び電力制御監視方法とプログラム
CN103902416A (zh) 一种检测散热器安装情况的方法及数据处理设备
TWI700642B (zh) 安全溫控方法與伺服器
CN103645784A (zh) 一种笔记本电脑外置式散热器
CN113884888B (zh) 一种电池控制方法及装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16810825

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16810825

Country of ref document: EP

Kind code of ref document: A1