TWI761134B - Smart nic with fpga chip overheating monitoring function - Google Patents

Smart nic with fpga chip overheating monitoring function Download PDF

Info

Publication number
TWI761134B
TWI761134B TW110109336A TW110109336A TWI761134B TW I761134 B TWI761134 B TW I761134B TW 110109336 A TW110109336 A TW 110109336A TW 110109336 A TW110109336 A TW 110109336A TW I761134 B TWI761134 B TW I761134B
Authority
TW
Taiwan
Prior art keywords
clock
frequency
signal
clock frequency
clock signal
Prior art date
Application number
TW110109336A
Other languages
Chinese (zh)
Other versions
TW202238385A (en
Inventor
劉葉
Original Assignee
英業達股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 英業達股份有限公司 filed Critical 英業達股份有限公司
Priority to TW110109336A priority Critical patent/TWI761134B/en
Application granted granted Critical
Publication of TWI761134B publication Critical patent/TWI761134B/en
Publication of TW202238385A publication Critical patent/TW202238385A/en

Links

Images

Abstract

A smart NIC with FPGA chip overheating monitoring function includes a FPGA module and a CPLD module. The FPGA module includes an FPGA chip, a thermal detection unit, and a clock signal generation unit. The thermal detection unit detects a temperature of the FPGA chip to generate a chip temperature value. The clock signal generating unit is built-in with a first temperature warning value and a second temperature warning value greater than the first temperature warning value, and the clock signal generating unit generates a clock signal at different clock frequencies according to the chip temperature values. The CPLD module has a built-in counter clock that oscillates with a reference frequency, so that the clock frequency of the clock signal can be analyzed according by the counter clock, so that generates the clock frequencies of the clock signal, and then sent a cooling signal or an emergency cooling signal to a host.

Description

具有FPGA晶片過熱監控功能之智能網卡Smart NIC with FPGA chip thermal monitoring function

本發明係關於一種智能網卡,尤其是指一種具有FPGA晶片過熱監控功能之智能網卡。The present invention relates to an intelligent network card, in particular to an intelligent network card with the function of overheating monitoring of FPGA chips.

一般來說,智能網卡的FPGA晶片過熱監測與保護都是由CPLD來執行,而具體方式主要是利用hot信號通道與thermtrip信號通道等兩個通道分別連接FPGA與CPLD;其中,當CPLD接收到FPGA所傳送之hot信號時,會通知主機來做降頻的動作,以使FPGA晶片的溫度可以降低。另外,當CPLD接收到FPGA所傳送之thermtrip信號時,則會通知主機關閉智能網卡的電源或直接關機來避免FPGA晶片的溫度繼續升高。Generally speaking, the overheating monitoring and protection of the FPGA chip of the smart network card is performed by the CPLD, and the specific method is mainly to use the hot signal channel and the thermtrip signal channel to connect the FPGA and the CPLD respectively; among them, when the CPLD receives the FPGA When the transmitted hot signal is sent, the host will be notified to reduce the frequency so that the temperature of the FPGA chip can be lowered. In addition, when the CPLD receives the thermtrip signal sent by the FPGA, it will notify the host to turn off the power of the smart network card or directly shut down to prevent the temperature of the FPGA chip from continuing to rise.

承上所述,現有的FPGA晶片過熱保護機制雖然可以有效的保護FPGA晶片,但卻需要佔用到FPGA與CPLD的兩個引腳,導致FPGA與CPLD所能控制的元件受到了限制。As mentioned above, although the existing FPGA chip overheating protection mechanism can effectively protect the FPGA chip, it needs to occupy two pins of the FPGA and the CPLD, which limits the components that the FPGA and the CPLD can control.

有鑒於在先前技術中,現有的智能網卡的FPGA晶片主要是透過hot信號通道與thermtrip信號通道來通知CPLD其偵測到的溫度值是否超過預設的安全值,但也因此同時限制了FPGA與CPLD的引腳數;緣此,本發明的主要目的在於提供一種具有FPGA晶片過熱監控功能之智能網卡,可以使FPGA僅利用一條通道來通知CPLD其晶片的兩種過熱程度,使得FPGA與CPLD皆能釋放出一個引腳來連接其他元件進行控制。In view of the fact that in the prior art, the FPGA chip of the existing smart network card mainly informs the CPLD whether the detected temperature value exceeds the preset safe value through the hot signal channel and the thermtrip signal channel. The number of pins of the CPLD; therefore, the main purpose of the present invention is to provide a smart network card with the function of monitoring the overheating of the FPGA chip, so that the FPGA can only use one channel to notify the CPLD of two overheating degrees of its chip, so that both the FPGA and the CPLD are overheated. A pin can be released to connect other components for control.

本發明為解決先前技術之問題,所採用的必要技術手段是提供一種具有FPGA晶片過熱監控功能之智能網卡,係電性連結於一主機,且智能網卡包含一FPGA模組以及一CPLD模組。FPGA模組包含一FPGA晶片、一熱偵測單元以及一時脈訊號產生單元。In order to solve the problems of the prior art, the present invention adopts the necessary technical means to provide an intelligent network card with an FPGA chip overheating monitoring function, which is electrically connected to a host, and the intelligent network card includes an FPGA module and a CPLD module. The FPGA module includes an FPGA chip, a thermal detection unit and a clock signal generation unit.

熱偵測單元係用以偵測FPGA晶片之溫度而產生一晶片溫度值,並據以發出一溫度感測訊號。時脈訊號產生單元係電性連結於熱偵測單元,用以接收溫度感測訊號,內建有一第一溫度警戒值與一大於第一溫度警戒值之第二溫度警戒值,且時脈訊號產生單元係依據晶片溫度值以不同之時脈頻率發出一時脈訊號,當晶片溫度值介於第一溫度警戒值與第二溫度警戒值之間時以一第一時脈頻率發送出時脈訊號,當晶片溫度值大於等於第二溫度警戒值時以一小於第一時脈頻率之第二時脈頻率發送出時脈訊號。The thermal detection unit is used for detecting the temperature of the FPGA chip to generate a chip temperature value, and accordingly send out a temperature sensing signal. The clock signal generating unit is electrically connected to the thermal detection unit for receiving the temperature sensing signal, and has a built-in first temperature warning value and a second temperature warning value greater than the first temperature warning value, and the clock signal The generating unit sends a clock signal at different clock frequencies according to the chip temperature value, and sends the clock signal at a first clock frequency when the chip temperature value is between the first temperature warning value and the second temperature warning value , when the chip temperature value is greater than or equal to the second temperature warning value, a clock signal is sent out at a second clock frequency less than the first clock frequency.

CPLD模組係電性連結於時脈訊號產生單元,內建有一以一基準計次頻率振盪之計次時脈,藉以依據計次時脈之振盪次數解析出時脈訊號之時脈頻率,在解析出時脈訊號之時脈頻率為第一時脈頻率時,將一降溫訊號發送至主機,並在解析出時脈訊號之時脈頻率為第二時脈頻率時,將一緊急降溫訊號發送至主機。The CPLD module is electrically connected to the clock signal generating unit, and has a built-in clock clock that oscillates with a reference clock frequency, so as to analyze the clock frequency of the clock signal according to the oscillation number of the clock clock. When the clock frequency of the analyzed clock signal is the first clock frequency, send a cooling signal to the host, and when the clock frequency of the analyzed clock signal is the second clock frequency, send an emergency cooling signal to the host.

在上述必要技術手段所衍生之一附屬技術手段中,CPLD模組更包含一計次單元、一時脈解析單元以及一過熱監控單元。In an auxiliary technical means derived from the above-mentioned necessary technical means, the CPLD module further includes a count unit, a clock analysis unit and an overheat monitoring unit.

計次單元係電性連結於時脈訊號產生單元,內建有計次時脈,藉以在時脈訊號之振盪週期間,計算以基準計次頻率進行振盪之次數而產生一即時振盪累積次數。The timing unit is electrically connected to the clock signal generating unit, and has a built-in timing clock, so as to calculate the number of oscillations at the reference timing frequency during the oscillation period of the clock signal to generate a real-time cumulative number of oscillations.

時脈解析單元係電性連結於計次單元,並內建有一第一振盪判斷值與一大於第一振盪判斷值之第二振盪判斷值,當計次時脈之振盪滿足時脈訊號之振盪週期,且即時振盪累積次數小於等於第一振盪判斷值時,判斷時脈訊號之時脈頻率為第一時脈頻率,當計次時脈之即時振盪累積次數大於第二振盪判斷值時,判斷時脈訊號之時脈頻率為第二時脈頻率。The clock analysis unit is electrically connected to the timing unit, and has a built-in first oscillation judgment value and a second oscillation judgment value greater than the first oscillation judgment value. When the oscillation of the timing clock meets the oscillation of the clock signal period, and the cumulative number of real-time oscillations is less than or equal to the first oscillation judgment value, the clock frequency of the clock signal is judged to be the first clock frequency; The clock frequency of the clock signal is the second clock frequency.

過熱監控單元係電性連結於時脈解析單元,用以在時脈解析單元判斷時脈訊號之時脈頻率為第一時脈頻率時,發送降溫訊號至主機,並在時脈解析單元判斷時脈訊號之時脈頻率為第二時脈頻率時,發送緊急降溫訊號至主機。The overheat monitoring unit is electrically connected to the clock analysis unit, and is used for sending a cooling signal to the host when the clock analysis unit determines that the clock frequency of the clock signal is the first clock frequency, and when the clock analysis unit determines When the clock frequency of the pulse signal is the second clock frequency, an emergency cooling signal is sent to the host.

較佳者,基準計次頻率為第一時脈頻率之10倍,且第一振盪判斷值係為10。此外,基準計次頻率為第二時脈頻率之100倍,且第二振盪判斷值係大於10且小於100;更進一步地,第二振盪判斷值係為30。Preferably, the reference count frequency is 10 times the first clock frequency, and the first oscillation judgment value is 10. In addition, the reference count frequency is 100 times the second clock frequency, and the second oscillation judgment value is greater than 10 and less than 100; further, the second oscillation judgment value is 30.

如上所述,由於本發明之具有FPGA晶片過熱監控功能之智能網卡主要是FPGA模組利用時脈訊號產生單元依據不同的晶片溫度值以不同之時脈頻率發出時脈訊號至CPLD模組,而CPLD模組再透過基準計次頻率振盪之振盪解析出時脈訊號之時脈頻率,進而判斷出FPGA晶片的溫度值為何,以進一步通知主機作降頻或關機等動作。As mentioned above, because the smart network card with FPGA chip overheating monitoring function of the present invention mainly uses the clock signal generation unit of the FPGA module to send clock signals to the CPLD module with different clock frequencies according to different chip temperature values, and The CPLD module then analyzes the clock frequency of the clock signal through the oscillation of the reference count frequency oscillation, and then determines the temperature value of the FPGA chip, so as to further notify the host to perform actions such as frequency reduction or shutdown.

本發明所採用的具體實施例,將藉由以下之實施例及圖式作進一步之說明。The specific embodiments adopted by the present invention will be further described by the following embodiments and drawings.

請參閱第一圖,第一圖係顯示本發明較佳實施例所提供之具有FPGA晶片過熱監控功能之智能網卡之系統方塊圖。如第一圖所示,一種具有FPGA晶片過熱監控功能之智能網卡(以下簡稱智能網卡)100,係電性連結於一主機200,且主機200包含一過熱處理單元201、一處理器202與一電力供應單元203。Please refer to the first figure. The first figure is a system block diagram of an intelligent network card with an overheat monitoring function of an FPGA chip provided by a preferred embodiment of the present invention. As shown in the first figure, a smart network card (hereinafter referred to as a smart network card) 100 with an FPGA chip overheating monitoring function is electrically connected to a host 200, and the host 200 includes an overheating unit 201, a processor 202 and a Power supply unit 203 .

智能網卡100包含一FPGA模組1以及一CPLD模組2。FPGA模組1包含一FPGA晶片11、一熱偵測單元12以及一時脈訊號產生單元13。熱偵測單元12係用以偵測FPGA晶片11之溫度而產生一晶片溫度值,並據以發出一溫度感測訊號12a。The smart network card 100 includes an FPGA module 1 and a CPLD module 2 . The FPGA module 1 includes an FPGA chip 11 , a thermal detection unit 12 and a clock signal generation unit 13 . The thermal detection unit 12 is used for detecting the temperature of the FPGA chip 11 to generate a chip temperature value, and accordingly send a temperature sensing signal 12a.

時脈訊號產生單元13係電性連結於熱偵測單元12,用以接收溫度感測訊號12a,並內建有一第一溫度警戒值與一大於第一溫度警戒值之第二溫度警戒值,且時脈訊號產生單元13係依據晶片溫度值以不同之時脈頻率發出一時脈訊號13a;其中,當晶片溫度值介於第一溫度警戒值與第二溫度警戒值之間時是以一第一時脈頻率發送出時脈訊號13a,當晶片溫度值大於等於第二溫度警戒值時以一小於第一時脈頻率之第二時脈頻率發送出時脈訊號13a。The clock signal generating unit 13 is electrically connected to the thermal detection unit 12 for receiving the temperature sensing signal 12a, and has a first temperature warning value and a second temperature warning value greater than the first temperature warning value built in. And the clock signal generating unit 13 sends out a clock signal 13a with different clock frequencies according to the chip temperature value; wherein, when the chip temperature value is between the first temperature warning value and the second temperature warning value, a first temperature warning value is used. The clock signal 13a is sent out at a clock frequency, and the clock signal 13a is sent out at a second clock frequency less than the first clock frequency when the chip temperature value is greater than or equal to the second temperature warning value.

CPLD模組2包含一計次單元21、一時脈解析單元22以及一過熱監控單元23。計次單元21係電性連結於時脈訊號產生單元13,內建有一計次時脈211,計次時脈211是以一基準計次頻率進行振盪,藉以在時脈訊號13a之振盪週期間,計算以基準計次頻率進行振盪之次數而產生一即時振盪累積次數。The CPLD module 2 includes a count unit 21 , a clock analysis unit 22 and an overheat monitoring unit 23 . The counting unit 21 is electrically connected to the clock signal generating unit 13, and has a built-in counting clock 211. The counting clock 211 oscillates at a reference counting frequency, so that during the oscillation period of the clock signal 13a, the counting clock 211 oscillates. , calculate the number of oscillations at the reference frequency to generate a cumulative number of real-time oscillations.

時脈解析單元22係電性連結於計次單元21,並內建有一第一振盪判斷值221與一大於第一振盪判斷值221之第二振盪判斷值222;其中,當計次時脈211之振盪滿足時脈訊號之振盪週期,且即時振盪累積次數小於等於第一振盪判斷值221時,判斷時脈訊號13a之時脈頻率為第一時脈頻率,然而當計次時脈之即時振盪累積次數大於第二振盪判斷值時,判斷時脈訊號13a之時脈頻率為第二時脈頻率。The clock analysis unit 22 is electrically connected to the counting unit 21, and has a built-in first oscillation judgment value 221 and a second oscillation judgment value 222 greater than the first oscillation judgment value 221; When the oscillation meets the oscillation period of the clock signal, and the cumulative number of real-time oscillations is less than or equal to the first oscillation judgment value 221, it is determined that the clock frequency of the clock signal 13a is the first clock frequency. When the accumulated number of times is greater than the second oscillation determination value, it is determined that the clock frequency of the clock signal 13a is the second clock frequency.

過熱監控單元23係電性連結於時脈解析單元22,用以在時脈解析單元22判斷時脈訊號13a之時脈頻率為第一時脈頻率時,發送一降溫訊號23a至主機200之過熱處理單元201,使過熱處理單元201依據降溫訊號23a發送一降頻訊號201a至處理器202,藉以使處理器202透過降頻運作來減輕FPGA晶片11之負擔,進而降低FPGA晶片11之溫度;此外,當時脈解析單元22判斷時脈訊號13a之時脈頻率為第二時脈頻率時,過熱監控單元23會發送一緊急降溫訊號23b至主機200之過熱處理單元201,使過熱處理單元201依據緊急降溫訊號23b發送一關機訊號201b至電力供應單元203,藉以使電力供應單元203透過停止電力供應來關閉智能網卡100之運作,進而避免FPGA晶片11因為持續運作所產生之高溫而毀損。The overheat monitoring unit 23 is electrically connected to the clock analysis unit 22 for sending a cooling signal 23a to the overheating of the host 200 when the clock analysis unit 22 determines that the clock frequency of the clock signal 13a is the first clock frequency The processing unit 201 enables the overheating unit 201 to send a frequency reduction signal 201a to the processor 202 according to the temperature reduction signal 23a, so that the processor 202 can reduce the burden of the FPGA chip 11 through frequency reduction operation, thereby reducing the temperature of the FPGA chip 11; , when the clock analysis unit 22 determines that the clock frequency of the clock signal 13a is the second clock frequency, the overheat monitoring unit 23 sends an emergency cooling signal 23b to the overheating unit 201 of the host 200, so that the overheating unit 201 responds to the emergency The cooling signal 23b sends a shutdown signal 201b to the power supply unit 203, so that the power supply unit 203 shuts down the operation of the smart network card 100 by stopping the power supply, thereby preventing the FPGA chip 11 from being damaged due to the high temperature generated by the continuous operation.

請繼續參閱第一圖至第三圖,第二圖係顯示計次時脈之基準計次頻率與第一時脈頻率之比較示意圖;第三圖係顯示計次時脈之基準計次頻率與第二時脈頻率之比較示意圖。如第一圖至第二圖所示,在本實施例中,基準計次頻率例如為10KHz,第一時脈頻率例如為1KHz,第二時脈頻率例如為100Hz,藉此,當時脈訊號13a之時脈頻率為第一時脈頻率1KHz時,在時脈訊號13a之振盪週期P間,計次時脈211以基準計次頻率10KHz進行振盪所產生之即時振盪累積次數會是10。更詳細的說,時脈訊號13a之振盪週期P更分為前半週期P1與後半週期P2,在時脈訊號13a以第一時脈頻率1KHz作為時脈頻率振盪至前半週期P1之時間點T1時,基準計次頻率10KHz之即時振盪累積次數為5,而當時脈訊號13a以第一時脈頻率1KHz作為時脈頻率自時間點T1振盪至後半週期P2之時間點T2時,基準計次頻率10KHz之即時振盪累積次數會累積到10。Please continue to refer to the first figure to the third figure, the second figure is a schematic diagram showing the comparison between the reference clock frequency of the clock clock and the first clock frequency; the third figure shows the reference clock frequency of the clock clock and A schematic diagram of the comparison of the second clock frequency. As shown in the first to second diagrams, in this embodiment, the reference count frequency is, for example, 10KHz, the first clock frequency is, for example, 1KHz, and the second clock frequency is, for example, 100Hz, whereby the clock signal 13a is When the clock frequency is the first clock frequency of 1KHz, during the oscillation period P of the clock signal 13a, the cumulative number of real-time oscillations generated by the timing clock 211 oscillating at the reference timing frequency of 10KHz will be 10. More specifically, the oscillation period P of the clock signal 13a is further divided into the first half period P1 and the second half period P2. When the clock signal 13a oscillates with the first clock frequency of 1KHz as the clock frequency to the time point T1 of the first half period P1 , the cumulative number of real-time oscillations at the reference frequency of 10KHz is 5, and the clock signal 13a uses the first clock frequency of 1KHz as the clock frequency to oscillate from the time point T1 to the time point T2 of the second half cycle P2, and the reference count frequency is 10KHz. The number of real-time oscillation accumulation will be accumulated to 10.

此外,當時脈訊號13a之時脈頻率為第二時脈頻率100Hz時,在時脈訊號13a之振盪週期P'間,計次時脈211以基準計次頻率10KHz進行振盪所產生之即時振盪累積次數會是100。更詳細的說,時脈訊號13a之振盪週期P'同樣更分為前半週期P1'與後半週期P2',在時脈訊號13a以第二時脈頻率100Hz作為時脈頻率振盪至前半週期P1'之時間點T1'時,基準計次頻率10KHz之即時振盪累積次數為50,而當時脈訊號13a以第二時脈頻率100Hz作為時脈頻率自時間點T1'振盪至後半週期P2'之時間點T2'時,基準計次頻率10KHz之即時振盪累積次數會累積到100。In addition, when the clock frequency of the clock signal 13a is the second clock frequency of 100Hz, during the oscillation period P' of the clock signal 13a, the timing clock 211 oscillates at the reference clock frequency of 10KHz, and the real-time oscillation accumulated The number of times will be 100. More specifically, the oscillation period P' of the clock signal 13a is also further divided into the first half period P1' and the second half period P2'. At the time point T1', the cumulative number of real-time oscillations at the reference frequency of 10KHz is 50, and the clock signal 13a uses the second clock frequency of 100Hz as the clock frequency from the time point T1' to the time point of the second half cycle P2'. At T2', the real-time oscillation accumulation times of the reference frequency of 10KHz will accumulate to 100.

承上所述,在本實施例中,當熱偵測單元12偵測到FPGA晶片11之晶片溫度值介於第一溫度警戒值與第二溫度警戒值之間時,時脈訊號產生單元13會以第一時脈頻率發送出時脈訊號13a至CPLD模組2,此時透過計次時脈211計算,在時脈訊號13a之振盪週期內以基準計次頻率進行振盪所產生的即時振盪累積次數,而由於基準計次頻率10KHz為第一時脈頻率1KHz的10倍,因此當計次時脈211以基準計次頻率進行振盪至滿足時脈訊號13a之振盪週期P時,其計次時脈211之即時振盪累積次數會是10;藉此,時脈解析單元22會因為即時振盪累積次數小於等於第一振盪判斷值221(本實施例為10),進而判斷時脈訊號13a之時脈頻率為第一時脈頻率,使過熱監控單元23可以依據時脈訊號13a之時脈頻率為第一時脈頻率來判斷FPGA晶片11之晶片溫度值介於第一溫度警戒值與第二溫度警戒值之間。As mentioned above, in this embodiment, when the thermal detection unit 12 detects that the chip temperature value of the FPGA chip 11 is between the first temperature warning value and the second temperature warning value, the clock signal generating unit 13 The clock signal 13a is sent to the CPLD module 2 at the first clock frequency. At this time, the clock signal 13a is calculated by the clock clock 211 to oscillate at the reference clock frequency during the oscillation period of the clock signal 13a. Accumulated times, and since the reference counting frequency 10KHz is 10 times the first clock frequency 1KHz, when the counting clock 211 oscillates at the reference counting frequency to satisfy the oscillation period P of the clock signal 13a, its counting The cumulative number of real-time oscillations of the clock 211 will be 10; thus, the clock analysis unit 22 will determine the time of the clock signal 13 a because the cumulative number of real-time oscillations is less than or equal to the first oscillation judgment value 221 (10 in this embodiment). The pulse frequency is the first clock frequency, so that the overheat monitoring unit 23 can determine that the chip temperature value of the FPGA chip 11 is between the first temperature warning value and the second temperature according to the clock frequency of the clock signal 13a being the first clock frequency between the warning values.

此外,當熱偵測單元12偵測到FPGA晶片11之晶片溫度值大於等於第二溫度警戒值時,時脈訊號產生單元13會以第二時脈頻率發送出時脈訊號13a至CPLD模組2,此時透過計次時脈211計算,在時脈訊號13a之振盪週期P'內以基準計次頻率進行振盪所產生的即時振盪累積次數,而由於基準計次頻率10KHz為第二時脈頻率100Hz的100倍,因此當計次時脈211以基準計次頻率進行振盪至滿足時脈訊號13a之振盪週期P'時,其計次時脈211之即時振盪累積次數會是100;然而,由於本實施例之第二振盪判斷值為30,因此雖然計次時脈211以基準計次頻率進行振盪至即時振盪累積次數累積到大於30時,時脈解析單元22便會判斷時脈訊號13a之時脈頻率為第二時脈頻率,使過熱監控單元23可以依據時脈訊號13a之時脈頻率為第二時脈頻率來判斷FPGA晶片11之晶片溫度值大於等於該第二溫度警戒值。In addition, when the thermal detection unit 12 detects that the chip temperature value of the FPGA chip 11 is greater than or equal to the second temperature warning value, the clock signal generating unit 13 will send the clock signal 13a to the CPLD module at the second clock frequency 2. At this time, through the calculation of the timing clock 211, the cumulative number of real-time oscillations generated by oscillating at the reference timing frequency within the oscillation period P' of the clock signal 13a, and since the reference timing frequency 10KHz is the second clock The frequency is 100 times of 100Hz, so when the timing clock 211 oscillates at the reference timing frequency to satisfy the oscillation period P' of the clock signal 13a, the cumulative number of real-time oscillations of the timing clock 211 will be 100; however, Since the second oscillation determination value of this embodiment is 30, although the timing clock 211 oscillates at the reference timing frequency until the accumulated number of real-time oscillations exceeds 30, the clock analysis unit 22 will determine the clock signal 13a The clock frequency is the second clock frequency, so that the overheat monitoring unit 23 can determine that the chip temperature value of the FPGA chip 11 is greater than or equal to the second temperature warning value according to the clock frequency of the clock signal 13a being the second clock frequency.

需特別說明的是,雖然在本實施例中,計次時脈211以基準計次頻率進行振盪至滿足時脈訊號13a之振盪週期P'時,其計次時脈211之即時振盪累積次數會是100,但由於計次時脈211之即時振盪累積次數超過第一振盪判斷值時,FPGA晶片11之晶片溫度值便已經是大於等於第二溫度警戒值,因此將第二振盪判斷值設為30可以提早判斷出FPGA晶片11之晶片溫度值大於等於第二溫度警戒值,換句話說,第二振盪判斷值設為11至100都能反應出FPGA晶片11之晶片溫度值大於等於第二溫度警戒值,只是第二振盪判斷值略超過第一振盪判斷值的10,可以避免產生誤判,而不到最高值100的一半,則可以提高判斷的效率。It should be noted that, although in this embodiment, the timing clock 211 oscillates at the reference timing frequency until the oscillation period P' of the clock signal 13a is satisfied, the accumulated number of real-time oscillations of the timing clock 211 will be is 100, but since the cumulative number of real-time oscillations of the timing clock 211 exceeds the first oscillation judgment value, the chip temperature value of the FPGA chip 11 is already greater than or equal to the second temperature warning value, so the second oscillation judgment value is set as 30 can determine in advance that the chip temperature value of the FPGA chip 11 is greater than or equal to the second temperature warning value. In other words, setting the second oscillation judgment value to 11 to 100 can reflect that the chip temperature value of the FPGA chip 11 is greater than or equal to the second temperature For the warning value, only the second oscillation judgment value is slightly more than 10 of the first oscillation judgment value, which can avoid misjudgment, and is less than half of the maximum value of 100, which can improve the judgment efficiency.

綜上所述,相較於先前技術的智能網卡是江FPGA晶片之溫度透過hot信號通道與thermtrip信號通道通知CPLD,進而限制了FPGA與CPLD的引腳數,本發明之具有FPGA晶片過熱監控功能之智能網卡主要是FPGA模組利用時脈訊號產生單元依據不同的晶片溫度值以不同之時脈頻率發出時脈訊號至CPLD模組,而CPLD模組再透過基準計次頻率振盪之振盪解析出時脈訊號之時脈頻率,進而判斷出FPGA晶片的溫度值為何,以進一步通知主機作降頻或關機等動作,藉此,FPGA模組與CPLD模組都能因為少了一條信號通道而各釋放出一個引腳,進而可以用於連結其他元件加以控制。To sum up, compared with the smart network card of the prior art, the temperature of the FPGA chip is notified to the CPLD through the hot signal channel and the thermtrip signal channel, thereby limiting the number of pins of the FPGA and the CPLD. The present invention has the function of monitoring the overheating of the FPGA chip. The smart network card is mainly that the FPGA module uses the clock signal generation unit to send clock signals to the CPLD module at different clock frequencies according to different chip temperature values, and the CPLD module then analyzes the oscillation of the reference frequency oscillation. The clock frequency of the clock signal, and then determine the temperature value of the FPGA chip, so as to further notify the host to perform actions such as frequency reduction or shutdown. In this way, the FPGA module and the CPLD module can be separated from each other because one signal channel is missing. A pin is released, which can then be used to connect other components for control.

藉由以上較佳具體實施例之詳述,係希望能更加清楚描述本發明之特徵與精神,而並非以上述所揭露的較佳具體實施例來對本發明之範疇加以限制。相反地,其目的是希望能涵蓋各種改變及具相等性的安排於本發明所欲申請之專利範圍的範疇內。Through the detailed description of the preferred embodiments above, it is hoped that the features and spirit of the present invention can be described more clearly, and the scope of the present invention is not limited by the preferred embodiments disclosed above. On the contrary, the intention is to cover various modifications and equivalent arrangements within the scope of the claimed scope of the present invention.

100:具有FPGA晶片過熱監控功能之智能網卡 1:FPGA模組 11:FPGA晶片 12:熱偵測單元 13:時脈訊號產生單元 2:CPLD模組 21:計次單元 211:計次時脈 22:時脈解析單元 221:第一振盪判斷值 222:第二振盪判斷值 23:過熱監控單元 200:主機 201:過熱處理單元 202:處理器 203:電力供應單元 12a:溫度感測訊號 13a:時脈訊號 23a:降溫訊號 23b:緊急降溫訊號 201a:降頻訊號 201b:關機訊號 P,P':振盪週期 P1,P1':前半週期 P2,P2':後半週期 T1,T1',T2,T2':時間點100: Smart NIC with FPGA chip overheating monitoring function 1: FPGA module 11:FPGA chip 12: Thermal detection unit 13: Clock signal generation unit 2: CPLD module 21: Counting unit 211: count clock 22: Clock analysis unit 221: The first oscillation judgment value 222: The second oscillation judgment value 23: Overheating monitoring unit 200: host 201: Overheating unit 202: Processor 203: Power Supply Unit 12a: Temperature sensing signal 13a: Clock signal 23a: cooling signal 23b: Emergency cooling signal 201a: Down-frequency signal 201b: Shutdown signal P,P': oscillation period P1,P1': the first half cycle P2,P2': the second half cycle T1,T1',T2,T2': time points

第一圖係顯示本發明較佳實施例所提供之具有FPGA晶片過熱監控功能之智能網卡之系統方塊圖; 第二圖係顯示計次時脈之基準計次頻率與第一時脈頻率之比較示意圖;以及 第三圖係顯示計次時脈之基準計次頻率與第二時脈頻率之比較示意圖。 The first figure shows the system block diagram of the smart network card with FPGA chip overheating monitoring function provided by the preferred embodiment of the present invention; The second figure is a schematic diagram showing the comparison between the reference clock frequency of the clock clock and the first clock frequency; and The third figure is a schematic diagram showing the comparison between the reference clock frequency of the clock clock and the second clock frequency.

100:具有FPGA晶片過熱監控功能之智能網卡 100: Smart NIC with FPGA chip overheating monitoring function

1:FPGA模組 1: FPGA module

11:FPGA晶片 11:FPGA chip

12:熱偵測單元 12: Thermal detection unit

13:時脈訊號產生單元 13: Clock signal generation unit

2:CPLD模組 2: CPLD module

21:計次單元 21: Counting unit

211:計次時脈 211: count clock

22:時脈解析單元 22: Clock analysis unit

221:第一振盪判斷值 221: The first oscillation judgment value

222:第二振盪判斷值 222: The second oscillation judgment value

23:過熱監控單元 23: Overheating monitoring unit

200:主機 200: host

201:過熱處理單元 201: Overheating unit

202:處理器 202: Processor

203:電力供應單元 203: Power Supply Unit

12a:溫度感測訊號 12a: Temperature sensing signal

13a:時脈訊號 13a: Clock signal

23a:降溫訊號 23a: cooling signal

23b:緊急降溫訊號 23b: Emergency cooling signal

201a:降頻訊號 201a: Down-frequency signal

201b:關機訊號 201b: Shutdown signal

Claims (5)

一種具有FPGA晶片過熱監控功能之智能網卡,係電性連結於一主機,且該智能網卡包含: 一FPGA模組,包含: 一FPGA晶片; 一熱偵測單元,係用以偵測該FPGA晶片之溫度而產生一晶片溫度值,並據以發出一溫度感測訊號;以及 一時脈訊號產生單元,係電性連結於該熱偵測單元,用以接收該溫度感測訊號,內建有一第一溫度警戒值與一大於該第一溫度警戒值之第二溫度警戒值,且該時脈訊號產生單元係依據該晶片溫度值以不同之時脈頻率發出一時脈訊號,當該晶片溫度值介於該第一溫度警戒值與該第二溫度警戒值之間時以一第一時脈頻率發送出該時脈訊號,當該晶片溫度值大於等於該第二溫度警戒值時以一小於該第一時脈頻率之第二時脈頻率發送出該時脈訊號;以及 一CPLD模組,係電性連結於該時脈訊號產生單元,內建有一以一基準計次頻率振盪之計次時脈,且該基準計次頻率係大於該第一時脈頻率或該第二時脈頻率,藉以依據該計次時脈之振盪次數解析出該時脈訊號之時脈頻率,在解析出該時脈訊號之時脈頻率為該第一時脈頻率時,將一降溫訊號發送至該主機,並在解析出該時脈訊號之時脈頻率為該第二時脈頻率時,將一緊急降溫訊號發送至該主機。 An intelligent network card with an FPGA chip overheating monitoring function is electrically connected to a host, and the intelligent network card comprises: An FPGA module, including: an FPGA chip; a thermal detection unit for detecting the temperature of the FPGA chip to generate a chip temperature value, and to send a temperature sensing signal accordingly; and A clock signal generating unit is electrically connected to the thermal detection unit for receiving the temperature sensing signal, and has a built-in first temperature warning value and a second temperature warning value greater than the first temperature warning value, And the clock signal generating unit sends out a clock signal at different clock frequencies according to the chip temperature value, when the chip temperature value is between the first temperature warning value and the second temperature warning value, a first temperature warning value is used. The clock signal is sent at a clock frequency, and the clock signal is sent at a second clock frequency less than the first clock frequency when the chip temperature value is greater than or equal to the second temperature warning value; and A CPLD module, which is electrically connected to the clock signal generating unit, has a built-in clock clock oscillating at a reference clock frequency, and the reference clock frequency is greater than the first clock frequency or the first clock frequency. Two clock frequencies, so as to analyze the clock frequency of the clock signal according to the number of oscillations of the timing clock, when the clock frequency of the clock signal is analyzed to be the first clock frequency, a cooling signal Send to the host, and send an emergency cooling signal to the host when the clock frequency of the clock signal is analyzed to be the second clock frequency. 如請求項1所述之具有FPGA晶片過熱監控功能之智能網卡,其中,該CPLD模組更包含: 一計次單元,係電性連結於該時脈訊號產生單元,內建有該計次時脈,藉以在該時脈訊號之振盪週期間,計算以該基準計次頻率進行振盪之次數而產生一即時振盪累積次數; 一時脈解析單元,係電性連結於該計次單元,並內建有一第一振盪判斷值與一大於該第一振盪判斷值之第二振盪判斷值,當該計次時脈之振盪滿足該時脈訊號之振盪週期,且該即時振盪累積次數小於等於該第一振盪判斷值時,判斷該時脈訊號之時脈頻率為該第一時脈頻率,當該計次時脈之該即時振盪累積次數大於該第二振盪判斷值時,判斷該時脈訊號之時脈頻率為該第二時脈頻率;以及 一過熱監控單元,係電性連結於該時脈解析單元,用以在該時脈解析單元判斷該時脈訊號之時脈頻率為該第一時脈頻率時,發送該降溫訊號至該主機,並在該時脈解析單元判斷該時脈訊號之時脈頻率為該第二時脈頻率時,發送該緊急降溫訊號至該主機。 The smart network card with FPGA chip overheating monitoring function as described in claim 1, wherein the CPLD module further comprises: A counting unit, which is electrically connected to the clock signal generating unit, has the counting clock built in, and is generated by calculating the number of oscillations at the reference counting frequency during the oscillation period of the clock signal. 1. The cumulative number of real-time oscillations; A clock analysis unit is electrically connected to the count unit, and has a built-in first oscillation judgment value and a second oscillation judgment value greater than the first oscillation judgment value. When the oscillation of the count clock meets the The oscillation period of the clock signal, and the cumulative number of real-time oscillations is less than or equal to the first oscillation judgment value, determine that the clock frequency of the clock signal is the first clock frequency, when the real-time oscillation of the timing clock When the accumulated number of times is greater than the second oscillation determination value, determine that the clock frequency of the clock signal is the second clock frequency; and an overheat monitoring unit electrically connected to the clock analysis unit for sending the cooling signal to the host when the clock analysis unit determines that the clock frequency of the clock signal is the first clock frequency, And when the clock analysis unit determines that the clock frequency of the clock signal is the second clock frequency, the emergency cooling signal is sent to the host. 如請求項2所述之具有FPGA晶片過熱監控功能之智能網卡,其中,該基準計次頻率為該第一時脈頻率之10倍,且該第一振盪判斷值係為10。The smart network card with FPGA chip overheating monitoring function as described in claim 2, wherein the reference count frequency is 10 times the first clock frequency, and the first oscillation judgment value is 10. 如請求項3所述之具有FPGA晶片過熱監控功能之智能網卡,其中,該基準計次頻率為該第二時脈頻率之100倍,且該第二振盪判斷值係大於10且小於100。The smart network card with FPGA chip overheat monitoring function as claimed in claim 3, wherein the reference count frequency is 100 times the second clock frequency, and the second oscillation judgment value is greater than 10 and less than 100. 如請求項4所述之具有FPGA晶片過熱監控功能之智能網卡,其中,該第二振盪判斷值係為30。The smart network card with FPGA chip overheating monitoring function according to claim 4, wherein the second oscillation judgment value is 30.
TW110109336A 2021-03-16 2021-03-16 Smart nic with fpga chip overheating monitoring function TWI761134B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW110109336A TWI761134B (en) 2021-03-16 2021-03-16 Smart nic with fpga chip overheating monitoring function

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW110109336A TWI761134B (en) 2021-03-16 2021-03-16 Smart nic with fpga chip overheating monitoring function

Publications (2)

Publication Number Publication Date
TWI761134B true TWI761134B (en) 2022-04-11
TW202238385A TW202238385A (en) 2022-10-01

Family

ID=82199152

Family Applications (1)

Application Number Title Priority Date Filing Date
TW110109336A TWI761134B (en) 2021-03-16 2021-03-16 Smart nic with fpga chip overheating monitoring function

Country Status (1)

Country Link
TW (1) TWI761134B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201039104A (en) * 2009-04-21 2010-11-01 Univ Nat Sun Yat Sen System and method for controlling chip temperature
US20190226922A1 (en) * 2018-01-24 2019-07-25 Samsung Electronics Co., Ltd. Temperature sensing device and temperature-voltage converter
CN110515437A (en) * 2019-08-16 2019-11-29 苏州浪潮智能科技有限公司 A kind of high-temperature protection method and device of FPGA accelerator card
US20200089292A1 (en) * 2018-09-18 2020-03-19 Dell Products L.P. Apparatus And Method To Improve Thermal Management For One Or More Heat Generating Components Within An Information Handling System
TW202106150A (en) * 2019-07-17 2021-02-01 中華電信股份有限公司 Smart chassis cooling system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201039104A (en) * 2009-04-21 2010-11-01 Univ Nat Sun Yat Sen System and method for controlling chip temperature
US20190226922A1 (en) * 2018-01-24 2019-07-25 Samsung Electronics Co., Ltd. Temperature sensing device and temperature-voltage converter
US20200089292A1 (en) * 2018-09-18 2020-03-19 Dell Products L.P. Apparatus And Method To Improve Thermal Management For One Or More Heat Generating Components Within An Information Handling System
TW202106150A (en) * 2019-07-17 2021-02-01 中華電信股份有限公司 Smart chassis cooling system
CN110515437A (en) * 2019-08-16 2019-11-29 苏州浪潮智能科技有限公司 A kind of high-temperature protection method and device of FPGA accelerator card

Also Published As

Publication number Publication date
TW202238385A (en) 2022-10-01

Similar Documents

Publication Publication Date Title
US20070124609A1 (en) System and method for controlling cpu overclocking
CN102724846B (en) Cooling control method and controller for heat sink
TW201312966A (en) Network wakeup system with protection function
CN107482590B (en) A kind of driving of IGBT module and fault secure circuit
TWI761134B (en) Smart nic with fpga chip overheating monitoring function
CN104834581A (en) Oscillation stop monitoring circuit of crystal oscillator
CN105743068B (en) A kind of reacance generator protects system
EP1936392B1 (en) Automatic frequency monitoring circuit, electronic device, automatic frequency monitoring method and automatic frequency monitoring program
KR102644538B1 (en) System monitoring computational procedure of pwm duty for controlling motor
US20020130695A1 (en) System and method for dynamically controlling an integrated circuit's clock
CN106066817A (en) clock monitoring circuit and method thereof
US8638154B2 (en) System clock monitoring apparatus and motor control system
CN107962280A (en) Electric welding machine thermel protection device
CN100461065C (en) Frequency regulation method and device for clock/data restoring circuit
CN112764999A (en) Intelligent network card with FPGA chip overheating monitoring function
CN108089631B (en) Clock detection circuit for micro-controller chip
WO2021190093A1 (en) Server system, and frequency control device for processor therein
CN110701084A (en) Fan control method in electronic system
CN207867388U (en) A kind of clock circuit for detecting for microcontroller chip
CN107179911A (en) A kind of method and apparatus for restarting management engine
CN101377755A (en) Mainboard, computer system and multiple monitoring method thereof
CN112578181A (en) Oscillator abnormal state detection circuit
JP3166552B2 (en) CPU monitoring method and CPU monitoring device
CN217692087U (en) Semiconductor double-water-flow protection device
TWI782841B (en) Event detection control device and method for circuit system controlled by pulse wave modulation signal