JP2009287883A

JP2009287883A - Temperature anomaly cause portion determining device and temperature anomaly cause portion determining method

Info

Publication number: JP2009287883A
Application number: JP2008143121A
Authority: JP
Inventors: Noriaki Ishihara; 礼暁石原
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2008-05-30
Filing date: 2008-05-30
Publication date: 2009-12-10

Abstract

<P>PROBLEM TO BE SOLVED: To provide a temperature anomaly cause portion determining device and method, efficiently and automatically determining a temperature anomaly cause portion in a passage of cooling air supplied to an equipment group. <P>SOLUTION: The temperature anomaly cause portion detecting device 1 determining a temperature anomaly cause portion in the passage of cooling air supplied from one or a plurality of cooling sources, is equipped with: a temperature anomaly detection result acquiring part 11 acquiring a detection result of temperature anomaly of the equipment; and a determining part 12 determining the temperature anomaly cause portion in the passage of cooling air on the basis of passage information of the cooling air supplied to the equipment group and the detection result of temperature anomaly of the equipment. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、機器群に対して供給される冷却空気の流路における温度異常原因箇所を判定する温度異常原因箇所判定装置及び温度異常原因箇所判定方法に関する。 The present invention relates to a temperature abnormality cause location determination apparatus and a temperature abnormality cause location determination method for determining a temperature abnormality cause location in a flow path of cooling air supplied to a device group.

空調機の高圧側圧力の計測結果に基づいて、故障した空調機があるか否かを判定し、該判定結果に基づいて、他の空調機の冷却能力を増加させるように補完制御する空調機監視システムが提案されている（例えば、特許文献１参照）。
特開２００６−６４２５４号公報 Based on the measurement result of the high-pressure side pressure of the air conditioner, it is determined whether there is a malfunctioning air conditioner, and based on the determination result, the air conditioner that performs complementary control to increase the cooling capacity of other air conditioners A monitoring system has been proposed (see, for example, Patent Document 1).
JP 2006-64254 A

サーバ装置等の機器は、稼働可能な温度範囲が決まっており、例えば、多数のサーバ装置が高密度に実装されるデータセンターのような空間では、冷却機能が重要となる。図２２に示すように、データセンターにおいて、冷却源である空調機２が冷却空気をラック３に搭載されたサーバ群（複数のサーバ装置３１）に対して送って、該サーバ装置３１を冷却するシステムを想定する。冷却空気は、図中の矢印で示す流路に従って、空調機２からラック３内のサーバ装置３１に送られ、サーバ装置３１から排気される。図中のＦがサーバ装置３１への吸気であり、Ｇがサーバ装置３１からの排気である。なお、以下ではサーバ装置３１の冷却について議論するため、吸気に着目する。サーバ装置は、ネットワーク４を通じて監視装置１０に接続されている。各サーバ装置３１は、温度センサを備える。何らかのミスで障害物（例えばケーブルの束）等が空調機２からサーバ装置３１に至るまでの流路内に設置され、サーバ装置３１の冷却が十分に行われず、その結果、サーバ装置３１への吸気の温度が異常値まで上昇すると、サーバ装置３１が備える温度センサが、該吸気の温度が異常値まで上昇したこと（温度異常）を検知する。そして、サーバ装置３１は、該温度異常の検知結果を監視装置１０に通知する。監視装置１０は、サーバ装置３１から通知された温度異常の検知結果に基づいて、例えば、どのサーバが温度異常を検知したかを判断する。 Equipment such as a server device has a determined temperature range in which the device can be operated. For example, in a space such as a data center where a large number of server devices are mounted at high density, a cooling function is important. As shown in FIG. 22, in the data center, the air conditioner 2 as a cooling source sends cooling air to the server group (a plurality of server devices 31) mounted on the rack 3 to cool the server device 31. Assume a system. The cooling air is sent from the air conditioner 2 to the server device 31 in the rack 3 and exhausted from the server device 31 according to the flow path indicated by the arrow in the figure. F in the drawing is intake air to the server device 31, and G is exhaust air from the server device 31. Hereinafter, in order to discuss cooling of the server device 31, attention is paid to intake air. The server device is connected to the monitoring device 10 through the network 4. Each server device 31 includes a temperature sensor. An obstacle (for example, a bundle of cables) or the like is installed in the flow path from the air conditioner 2 to the server device 31 due to some mistake, and the server device 31 is not sufficiently cooled. When the temperature of the intake air rises to an abnormal value, a temperature sensor provided in the server device 31 detects that the temperature of the intake air has risen to an abnormal value (temperature abnormality). Then, the server device 31 notifies the monitoring device 10 of the detection result of the temperature abnormality. The monitoring device 10 determines, for example, which server has detected the temperature abnormality based on the detection result of the temperature abnormality notified from the server device 31.

しかし、図２２に示すシステムによっては、どのサーバ装置が温度異常を検知したかを判断することはできるが、例えば、障害物が設置された場所等の、サーバ装置の温度異常の原因となる箇所（温度異常原因箇所）を自動で判定することはできない。また、例えば、空調機２が床下等にある場合には、温度異常原因箇所を監視者が目視で特定することは困難である。更に、複数のサーバ装置３１が温度異常を検知した場合には、温度異常原因箇所を判定することは困難である。監視カメラや障害物を探知するセンサを設置して、温度異常原因箇所を特定することも考えられるが、監視カメラ等の設置にコストがかかるという問題がある。 However, depending on the system shown in FIG. 22, it is possible to determine which server device has detected a temperature abnormality. For example, a location that causes a temperature abnormality of the server device, such as a place where an obstacle is installed. (Temperature abnormality cause location) cannot be determined automatically. For example, when the air conditioner 2 is under the floor or the like, it is difficult for the monitor to visually identify the cause of the temperature abnormality. Furthermore, when a plurality of server devices 31 detect temperature abnormality, it is difficult to determine the cause of temperature abnormality. Although it is conceivable to install a monitoring camera or a sensor for detecting an obstacle to identify the cause of the temperature abnormality, there is a problem in that the installation of the monitoring camera or the like is costly.

本発明は、機器群に対して供給される冷却空気の流路における温度異常原因箇所を効率良く自動で判定する温度異常原因箇所判定装置の提供を目的とする。 An object of the present invention is to provide a temperature abnormality cause point determination device that efficiently and automatically determines a temperature abnormality cause point in a flow path of cooling air supplied to a device group.

また、本発明は、機器群に対して供給される冷却空気の流路における温度異常原因箇所を効率良く自動で判定する温度異常原因箇所判定方法の提供を目的とする。 It is another object of the present invention to provide a temperature abnormality cause location determination method for efficiently and automatically determining a temperature abnormality cause location in a flow path of cooling air supplied to a device group.

本温度異常原因箇所判定装置は、複数の機器からなる機器群に対して１又は複数の冷却源から供給される冷却空気の流路における温度異常原因箇所を判定する温度異常原因箇所判定装置であって、前記機器の温度異常の検知結果を取得する温度異常検知結果取得手段と、前記機器群に対して供給される冷却空気の流路情報と前記機器の温度異常の検知結果とに基づいて、該冷却空気の流路における温度異常原因箇所を判定する判定手段を備える。 This temperature abnormality cause location determination device is a temperature abnormality cause location determination device that determines a temperature abnormality cause location in a flow path of cooling air supplied from one or more cooling sources to a device group composed of a plurality of devices. Based on the temperature abnormality detection result acquisition means for acquiring the temperature abnormality detection result of the device, the flow path information of the cooling air supplied to the device group, and the temperature abnormality detection result of the device, Judgment means for judging the location of the temperature abnormality in the cooling air flow path is provided.

また、本温度異常原因箇所判定方法は、複数の機器からなる機器群に対して１又は複数の冷却源から供給される冷却空気の流路における温度異常原因箇所を判定する温度異常原因箇所判定方法であって、前記機器の温度異常の検知結果を取得し、前記機器群に対して供給される冷却空気の流路情報と前記機器の温度異常の検知結果とに基づいて、該冷却空気の流路における温度異常原因箇所を判定する。 The temperature abnormality cause location determination method is a temperature abnormality cause location determination method for determining a temperature abnormality cause location in a flow path of cooling air supplied from one or a plurality of cooling sources to a device group including a plurality of devices. A temperature abnormality detection result of the device is obtained, and the flow of the cooling air is determined based on flow path information of the cooling air supplied to the device group and the temperature abnormality detection result of the device. Determine the cause of temperature abnormality on the road.

本温度異常原因箇所判定装置及び本温度異常原因箇所判定方法によれば、複数の機器からなる機器群に対して１又は複数の冷却源から供給される冷却空気の流路における温度異常原因箇所を、機器の温度異常の検知結果と冷却空気の流路情報とに基づいて自動で特定することが可能となる。 According to this temperature abnormality cause location determination apparatus and this temperature abnormality cause location determination method, a temperature abnormality cause location in a flow path of cooling air supplied from one or a plurality of cooling sources to a device group consisting of a plurality of devices is determined. It becomes possible to automatically specify the temperature abnormality detection result of the device and the flow path information of the cooling air.

図１は、本実施形態の温度異常原因箇所判定装置の構成例を示す図である。温度異常原因箇所判定装置１は、複数の機器からなる機器群に対して１又は複数の冷却源から供給される冷却空気の流路における温度異常の原因となる箇所（温度異常原因箇所）を判定する処理装置である。 FIG. 1 is a diagram illustrating a configuration example of a temperature abnormality cause location determination apparatus according to the present embodiment. The temperature abnormality cause location determination apparatus 1 determines a location (temperature abnormality cause location) that causes a temperature abnormality in a flow path of cooling air supplied from one or a plurality of cooling sources to a device group including a plurality of devices. Is a processing device.

図１では、機器としてサーバ装置を例にとって説明するが、本実施形態の温度異常原因箇所判定装置１は、ネットワーク４を通じて温度異常原因箇所判定装置１に対して温度異常の検知結果を通知する機能を備える任意の機器（例えば、ルーター、ストレージ装置等）に適用可能である。また、図１に示す空調機２、ラック３、該ラック３に格納される複数のサーバ装置３１、ネットワーク４は、それぞれ、図２２を参照して前述した空調機２、ラック３、サーバ装置３１、ネットワーク４と同様である。空調機２とサーバ装置３１とがデータセンターを構成し、温度異常原因箇所判定装置１は、該データセンターを監視対象として温度異常原因箇所を判定する。また、図中のＦはサーバ装置３１への吸気であり、Ｇはサーバ装置３１からの排気である。なお、図１中には複数の空調機２が示されているが、空調機２が１台でもよい。空調機２が冷却空気を送出すると、該冷却空気が図中の矢印で示す流路に従ってラック３内のサーバ装置３１に送られ、サーバ装置３１から排気される。また、サーバ装置３１は、吸気の温度が異常値まで上昇したこと（温度異常）を検知する温度センサを備えており、サーバ装置３１は、該温度異常の検知結果をネットワーク４を通じて温度異常原因箇所判定装置１に通知する。 In FIG. 1, a server apparatus is described as an example of the device. However, the temperature abnormality cause location determination device 1 according to the present embodiment notifies the temperature abnormality cause location determination device 1 of the temperature abnormality cause determination device 1 through the network 4. The present invention can be applied to any device (for example, a router, a storage device, etc.) including Further, the air conditioner 2, the rack 3, the plurality of server devices 31 stored in the rack 3, and the network 4 shown in FIG. 1 are respectively the air conditioner 2, the rack 3, and the server device 31 described above with reference to FIG. This is the same as the network 4. The air conditioner 2 and the server device 31 constitute a data center, and the temperature abnormality cause location determination apparatus 1 determines the temperature abnormality cause location using the data center as a monitoring target. Further, F in the figure is intake air to the server device 31, and G is exhaust air from the server device 31. In addition, although the some air conditioner 2 is shown in FIG. 1, the one air conditioner 2 may be sufficient. When the air conditioner 2 sends out the cooling air, the cooling air is sent to the server device 31 in the rack 3 according to the flow path indicated by the arrow in the figure, and is exhausted from the server device 31. In addition, the server device 31 includes a temperature sensor that detects that the temperature of the intake air has risen to an abnormal value (temperature abnormality). The determination device 1 is notified.

温度異常原因箇所判定装置１は、温度異常検知結果取得部１１、判定部１２、通知部１３、流路情報データベース（ＤＢ）１４を備える。温度異常検知結果取得部１１は、ネットワーク４を通じて、サーバ装置３１から温度異常の検知結果（温度異常検知結果）を取得する。判定部１２は流路情報ＤＢ１４に記憶された流路情報と、温度異常検知結果取得部１１によって取得されたサーバ装置３１の温度異常検知結果とに基づいて、冷却空気の流路における温度異常原因箇所を判定する。流路情報は、空調機２からサーバ群（複数のサーバ装置３１）に対して供給される冷却空気の流路の情報である。本実施形態では、空調機２から始まりサーバ装置３１に至るまでの、冷却空気が流れる経路（ルート）を冷却空気の流路とする。すなわち、判定部１２は、該冷却空気の流路内のどの箇所が原因となってサーバ装置３１において温度異常が検知されたかを判定する。通知部１３は、判定部１２によって判定された温度異常原因箇所を、例えば表示して、温度異常原因箇所判定装置１のユーザに通知する。流路情報ＤＢ１４には流路情報が記憶される。 The temperature abnormality cause location determination apparatus 1 includes a temperature abnormality detection result acquisition unit 11, a determination unit 12, a notification unit 13, and a flow path information database (DB) 14. The temperature abnormality detection result acquisition unit 11 acquires a temperature abnormality detection result (temperature abnormality detection result) from the server device 31 through the network 4. Based on the flow path information stored in the flow path information DB 14 and the temperature abnormality detection result of the server device 31 acquired by the temperature abnormality detection result acquisition unit 11, the determination unit 12 causes the temperature abnormality in the cooling air flow path. Determine the location. The flow path information is information on the flow path of the cooling air supplied from the air conditioner 2 to the server group (a plurality of server apparatuses 31). In the present embodiment, a path (route) through which cooling air flows from the air conditioner 2 to the server device 31 is defined as a cooling air flow path. That is, the determination unit 12 determines which part of the cooling air flow path causes a temperature abnormality in the server device 31. The notification unit 13 displays, for example, the temperature abnormality cause location determined by the determination unit 12 and notifies the user of the temperature abnormality cause location determination apparatus 1. The flow path information DB 14 stores flow path information.

なお、本実施形態の温度異常原因箇所判定装置１の機能は、ＣＰＵとその上で実行されるプログラムにより実現される。該プログラムは、コンピュータが読み取り可能な記録媒体、例えば半導体メモリ、ハードディスク、ＣＤ−ＲＯＭ、ＤＶＤ等に格納することができ、これらの記録媒体に記録して提供され、又は、通信インタフェースを介してネットワークを利用した送受信により提供される。 In addition, the function of the temperature abnormality cause location determination apparatus 1 of this embodiment is implement | achieved by CPU and the program run on it. The program can be stored in a computer-readable recording medium, such as a semiconductor memory, a hard disk, a CD-ROM, a DVD, or the like, provided by being recorded in these recording media, or a network via a communication interface. Provided by sending and receiving using.

以下に、本実施形態の温度異常原因箇所判定装置１による温度異常原因箇所の判定処理の概要について説明する。図２は、本実施形態の温度異常原因箇所判定装置１が温度異常原因箇所判定処理の対象（監視対象）とするデータセンターのモデル図である。データセンター内のサーバ装置を効率よく冷却するためには、空調機は単に近傍の空気を冷やせばよいのではなく、空気を循環させる必要がある。また、各サーバ装置は、動作温度条件が決まっており、環境温度が動作温度条件の範囲外になってはならない。また、空調機の冷却能力が不足するとサーバ装置が十分に冷却されなくなる。このような要素は、データセンター内の増設やレイアウトを変更する時に考慮されるべきものである。従って、本実施形態において、冷却空気の流れについて、予め設計されているものとする。データセンターのサーバ装置は常時稼働するため、冷却空気の流れは定常的であり、冷却空気の流れを、図２に示すモデル図のように、方向付きの経路として表現することができる。 Below, the outline | summary of the determination process of the temperature abnormality cause location by the temperature abnormality cause location determination apparatus 1 of this embodiment is demonstrated. FIG. 2 is a model diagram of a data center that is a target (monitoring target) of temperature abnormality cause location determination processing by the temperature abnormality cause location determination apparatus 1 of the present embodiment. In order to efficiently cool the server apparatus in the data center, the air conditioner does not simply cool the air in the vicinity but needs to circulate the air. In addition, each server device has an operating temperature condition, and the environmental temperature should not be outside the range of the operating temperature condition. Further, if the cooling capacity of the air conditioner is insufficient, the server device is not sufficiently cooled. Such factors should be considered when adding or changing layouts in the data center. Therefore, in this embodiment, it is assumed that the flow of the cooling air is designed in advance. Since the server device in the data center operates constantly, the flow of cooling air is steady, and the flow of cooling air can be expressed as a route with a direction as shown in the model diagram shown in FIG.

図２中の矢印は冷却空気の流れの向きを示す。図２に示すモデル図は２次元の図であるが、温度異常原因箇所判定処理の対象とするデータセンターのモデル図が３次元であってもよい。図２中の矢印は、冷却空気が流れる方向を示している。すなわち、空調機Ｃ１からの冷却空気は、サーバ装置Ａ１、Ａ２、Ａ３の各々に対して送出され、サーバ装置Ａ１、Ａ２、Ａ３から排出される。また、空調機Ｃ２から送出された冷却空気は、サーバ装置Ａ４、Ａ５、Ａ６の各々に対して送出され、サーバ装置Ａ４、Ａ５、Ａ６から排出される。該モデル図が示す冷却空気の流路の情報（流路情報）は流路情報ＤＢ１４に予め記憶されている。 The arrows in FIG. 2 indicate the direction of the cooling air flow. The model diagram shown in FIG. 2 is a two-dimensional diagram, but the model diagram of the data center that is the target of the temperature abnormality cause location determination process may be three-dimensional. The arrows in FIG. 2 indicate the direction in which the cooling air flows. That is, the cooling air from the air conditioner C1 is sent to each of the server devices A1, A2, and A3, and is discharged from the server devices A1, A2, and A3. In addition, the cooling air sent from the air conditioner C2 is sent to each of the server devices A4, A5, and A6 and discharged from the server devices A4, A5, and A6. The flow path information (flow path information) of the cooling air indicated by the model diagram is stored in advance in the flow path information DB 14.

ここで、例えば、サーバ装置Ａ５とサーバ装置Ａ６とが温度異常を検知したと想定する。図３に示すように、サーバ装置Ａ５とサーバ装置Ａ６とが温度異常検知結果を温度異常原因箇所判定装置１に通知し（図３の＃１を参照）、温度異常原因箇所判定装置１の温度異常検知結果取得部１１が、該温度異常検知結果を取得する。温度異常原因箇所判定装置１の判定部１２が、流路情報ＤＢ１４から図２に示すモデル図が示す冷却空気の流路情報を抽出し、抽出された流路情報と、上記取得された温度異常抽出結果とに基づいて、以下のようにして、冷却空気の流路における温度異常原因箇所を判定する。 Here, for example, it is assumed that the server device A5 and the server device A6 have detected a temperature abnormality. As shown in FIG. 3, the server apparatus A5 and the server apparatus A6 notify the temperature abnormality cause location determination device 1 of the temperature abnormality detection result (see # 1 in FIG. 3), and the temperature of the temperature abnormality cause location determination device 1 The abnormality detection result acquisition unit 11 acquires the temperature abnormality detection result. The determination unit 12 of the temperature abnormality cause location determination device 1 extracts the cooling air flow path information shown in the model diagram shown in FIG. 2 from the flow path information DB 14, and the extracted flow path information and the acquired temperature abnormality Based on the extraction result, the cause of temperature abnormality in the cooling air flow path is determined as follows.

まず、空調機２から正常なサーバ装置３１（温度異常を検知していないサーバ装置３１）に至るルート内には、温度異常原因箇所はないはずである。そこで、判定部１２が、図４中の太線で示す、空調機２から正常なサーバ装置３１に至るルート（第１ルート）を求める。 First, in the route from the air conditioner 2 to a normal server device 31 (a server device 31 that has not detected a temperature abnormality), there should be no cause for temperature abnormality. Therefore, the determination unit 12 obtains a route (first route) from the air conditioner 2 to the normal server device 31 indicated by a thick line in FIG.

次に、温度異常原因箇所は、空調機２から温度異常を検知したサーバ装置３１に至るルート内にあるはずである。そこで、判定部１２が、図５中の太線で示す、空調機Ｃ２から温度異常を検知したサーバ装置３１に至るルート（第２ルート）を求める。 Next, the temperature abnormality cause location should be in the route from the air conditioner 2 to the server device 31 that detected the temperature abnormality. Therefore, the determination unit 12 obtains a route (second route) from the air conditioner C2 to the server device 31 that detects the temperature abnormality, which is indicated by a thick line in FIG.

判定部１２は、上記求めた第２ルートに含まれ、かつ、第１ルートに含まれない部分（図６中の太線で示す部分）を、温度異常原因箇所として判定する。判定部１２が、第２ルートに含まれ、かつ、第１ルートに含まれない部分を温度異常原因箇所として判定することによって、監視対象とするデータセンターにおける温度異常原因箇所を効率良く自動で求めることができる。 The determination unit 12 determines a portion (a portion indicated by a thick line in FIG. 6) included in the obtained second route and not included in the first route as a temperature abnormality cause location. The determination unit 12 determines a portion that is included in the second route and is not included in the first route as a temperature abnormality cause location, thereby efficiently and automatically obtaining the temperature abnormality cause location in the data center to be monitored. be able to.

判定部１２が、温度異常が検知されたサーバ装置３１が複数ある場合に、各々の温度異常が検知されたサーバ装置３１から空調機２に至るまでの辺の集合の論理積を上記第２のルートとするようにしてもよい。判定部１２が、各々の温度異常が検知されたサーバ装置３１から空調機２に至るまでの辺の集合の論理積を上記第２ルートとすることによって、監視対象とするデータセンターにおける温度異常原因箇所を精度良く求めることができる。 When there are a plurality of server devices 31 in which the temperature abnormality is detected, the determination unit 12 calculates the logical product of the set of edges from the server device 31 in which each temperature abnormality is detected to the air conditioner 2. A route may be used. The determination unit 12 uses the logical product of the set of sides from the server device 31 where each temperature abnormality is detected to the air conditioner 2 as the second route, thereby causing the temperature abnormality in the data center to be monitored. The location can be obtained with high accuracy.

以下に、流路情報ＤＢ１４に記憶される流路情報の例について説明する。例えば、前述した図２に示すようなデータセンターのモデル図が示す、空調機Ｃ１、Ｃ２から各サーバ装置に至るまでの冷却空気の流れは、図７に示すような有向グラフで表現することができる。図７に示す有向グラフ中の矢印は冷却空気の流れの向きを示す。図７に示す有向グラフは、ノード（Ｃｎ，Ｎｎ，Ａｎ）と、隣り合うノード間を結んだルート（以下、辺）とからなる。具体的には、図７中のＣ１、Ｃ２は、それぞれ、図２中の空調機Ｃ１を示すノード、Ｃ２を示すノードである。また、Ａ１、Ａ２、Ａ３、Ａ４、Ａ５、Ａ６は、それぞれ、図２中のサーバ装置Ａ１を示すノード、サーバ装置Ａ２を示すノード、サーバ装置Ａ３を示すノード、サーバ装置Ａ４を示すノード、サーバ装置Ａ５を示すノード、サーバ装置Ａ６を示すノードである。Ｎ１乃至Ｎ６は、中間ノードである。中間ノードは、冷却空気が分岐又は結合するポイントを示すノードである。図７に示す有向グラフの例では、Ｎ１乃至Ｎ６は冷却空気が分岐するポイントを示す中間ノードを示す。 Below, the example of the flow-path information memorize | stored in flow-path information DB14 is demonstrated. For example, the flow of cooling air from the air conditioners C1 and C2 to each server device shown in the model diagram of the data center as shown in FIG. 2 can be expressed as a directed graph as shown in FIG. . The arrows in the directed graph shown in FIG. 7 indicate the direction of the cooling air flow. The directed graph shown in FIG. 7 includes nodes (Cn, Nn, An) and a route (hereinafter referred to as an edge) connecting adjacent nodes. Specifically, C1 and C2 in FIG. 7 are a node indicating the air conditioner C1 and a node indicating C2 in FIG. 2, respectively. A1, A2, A3, A4, A5, and A6 are respectively a node indicating the server device A1, a node indicating the server device A2, a node indicating the server device A3, a node indicating the server device A4, and a server in FIG. A node indicating the device A5 and a node indicating the server device A6. N1 to N6 are intermediate nodes. The intermediate node is a node indicating a point at which the cooling air branches or joins. In the example of the directed graph shown in FIG. 7, N1 to N6 indicate intermediate nodes indicating points where the cooling air branches.

すなわち、図７中の有向グラフが示す情報は、図８に示すようにノードの集合と辺の集合とを含んでいる。図８に示す情報が、図２に示すモデル図が示す冷却空気の流路情報である。図８中、辺は、「始点ノード→終点ノード」の形式で表現される。例えば、Ｃ１→Ｎ１は、ノードＣ１とノードＮ１との間の辺を示し、Ｎ１→Ａ１は、ノードＮ１とノードＡ１との間の辺を示す。流路情報ＤＢ１４には、例えば図８に示すような冷却空気の流路情報が、以下に説明するようなプログラム形式で記憶される。 That is, the information indicated by the directed graph in FIG. 7 includes a set of nodes and a set of edges as shown in FIG. The information shown in FIG. 8 is the flow path information of the cooling air shown in the model diagram shown in FIG. In FIG. 8, the side is expressed in the format of “start node → end node”. For example, C1 → N1 indicates a side between the node C1 and the node N1, and N1 → A1 indicates a side between the node N1 and the node A1. In the flow path information DB 14, for example, flow path information of cooling air as shown in FIG. 8 is stored in a program format as described below.

例えば、流路情報ＤＢ１４には、ノードの集合（リスト）が、プログラム形式で予め記憶されている。ノードのプログラム表現例として、Ｃ言語の構造体を以下に示す。
typedef struct node ｛
string *indentfier; // 識別子Ｃ１，Ｎ１など
string *location; // 位置情報柱Ａ３など
｝ node ＿t;
上記ノードのプログラム表現例において、"indentfier"は、例えば、”Ｃ１”、”Ｎ１”等の、ノードを一意に特定する、ノードの識別子である。識別子はユニークであることに加え、サーバの場合はホスト名やＩＰアドレスなど実機情報と対応付けられている。"location"は、例えば柱番号等のノードの位置情報である。ノードの位置を特定できる情報であれば、位置情報として任意の表現を用いることができる。 For example, a set (list) of nodes is stored in advance in a program format in the flow path information DB 14. As an example of a program representation of a node, a C language structure is shown below.
typedef struct node {
string * indentfier; // identifier C1, N1, etc.
string * location; // location information pillar A3 etc.} node _t;
In the example of the program representation of the node, “indentfier” is a node identifier that uniquely identifies the node, such as “C1” or “N1”. In addition to being unique, in the case of a server, the identifier is associated with actual machine information such as a host name and an IP address. “location” is node position information such as a column number, for example. Any expression can be used as the position information as long as the information can identify the position of the node.

ノードのリストは、例えば、以下に示すように、配列で表現できる。
node＿t nodes[];
すなわち、流路情報ＤＢ１４に記憶されている流路情報に含まれるノードのリストは、例えば図９に示すようなデータ構造を有する。図９中、ｉｄはノードの識別子、ｌｏｃはノードの位置情報である。 The list of nodes can be expressed by an array as shown below, for example.
node_t nodes [];
That is, the list of nodes included in the flow path information stored in the flow path information DB 14 has, for example, a data structure as shown in FIG. In FIG. 9, id is a node identifier, and loc is node position information.

辺は、始点ノードと終点ノードのペアで表現できるので、例えばＣ言語で以下のように表現できる。src 、dst は、それぞれ、辺の始点ノード、終点ノードを意味する。
struct edge ｛
node＿t src; //始点
node＿t dst; //終点
｝;
また、辺の集合（リスト）は、配列を用いて、
struct edge edges[];
と表現できる。 Since an edge can be expressed by a pair of a start point node and an end point node, for example, it can be expressed as follows in C language. src and dst mean the start point node and end point node of the side, respectively.
struct edge {
node_t src; // starting point
node_t dst; // end point};
In addition, the set of edges (list) is an array,
struct edge edges [];
Can be expressed as

すなわち、流路情報ＤＢ１４に記憶されている流路情報に含まれる辺のリストは、例えば図１０に示すようなデータ構造を有する。 That is, the list of sides included in the channel information stored in the channel information DB 14 has a data structure as shown in FIG. 10, for example.

各サーバ装置３１は、自装置が備える温度センサによる温度異常の検知結果を有している。すなわち、各サーバ装置３１は、温度異常（ＮＧ）か正常（ＯＫ）かのステータスを示す情報を持つ。従って、サーバ装置３１の集合（リスト）の情報を以下のようなプログラムを用いて表現することができる。なお、温度異常原因箇所判定装置１の温度異常検知結果取得部１１は、定期的又は任意のタイミングでサーバ装置３１から取得した温度異常の検知結果に基づいて、サーバ装置３１のステータスに関する情報（ステータス情報）をサーバ装置３１のリストの情報中に設定し、該サーバ装置３１のリストの情報を流路情報ＤＢ１４に記憶する。
struct server ｛
node＿t node; //どのノードかを示す
int status; //ステータス（ OK or NG ）
｝；
struct server servers[];
上記プログラム表現例において、statusは、サーバ装置３１から取得された温度異常検知結果を示す。statusはＯＫ又はＮＧのうちいずれかである。ＯＫは、サーバ装置３１において温度異常が検知されていないことを示し、ＮＧは、サーバ装置３１において温度異常が検知されていることを示す。また、nodeは、サーバ装置３１を示すノードである。 Each server device 31 has a detection result of a temperature abnormality by a temperature sensor provided in the own device. That is, each server device 31 has information indicating a status of temperature abnormality (NG) or normal (OK). Therefore, the information of the set (list) of server devices 31 can be expressed using the following program. Note that the temperature abnormality detection result acquisition unit 11 of the temperature abnormality cause location determination apparatus 1 is configured to obtain information on the status of the server device 31 (status) based on the temperature abnormality detection result acquired from the server device 31 periodically or at an arbitrary timing. Information) is set in the list information of the server device 31, and the list information of the server device 31 is stored in the flow path information DB.
struct server {
node_t node; // indicate which node
int status; // Status (OK or NG)
};
struct server servers [];
In the program expression example, status indicates a temperature abnormality detection result acquired from the server device 31. status is either OK or NG. OK indicates that a temperature abnormality is not detected in the server device 31, and NG indicates that a temperature abnormality is detected in the server device 31. Node is a node indicating the server device 31.

すなわち、流路情報ＤＢ１４に記憶される流路情報に含まれる、サーバ装置３１のリストは、例えば図１１に示すようなデータ構造を有する。図１１中のｎｄはサーバ装置３１を示すノード、ｓｔｓはサーバ装置３１のステータスを示す。 That is, the list of server devices 31 included in the flow path information stored in the flow path information DB 14 has a data structure as shown in FIG. 11, for example. In FIG. 11, nd indicates a node indicating the server device 31, and sts indicates a status of the server device 31.

判定部１２は、例えば、該温度異常原因箇所が存在する辺の始点のノードと終点のノードの情報を温度異常原因箇所として出力する。従って、出力される温度異常原因箇所の情報は、例えば、図１２に示すようなデータ構造を有する。 For example, the determination unit 12 outputs information on the start point node and the end point node of the side where the temperature abnormality cause location exists as the temperature abnormality cause location. Accordingly, the output information on the location of the cause of temperature abnormality has a data structure as shown in FIG. 12, for example.

図１３は、本実施形態の温度異常原因箇所判定装置による温度異常原因箇所の判定処理フローの一例を示す図である。まず、温度異常原因箇所判定装置１の温度異常検知結果取得部１１が、各サーバ装置３１から温度異常検知結果を、例えば温度情報とともに取得する（ステップＳ１）。具体的には、温度異常検知結果取得部１１が、監視対象のデータセンター内のサーバ装置３１から定期的又は温度異常の検知時に送信される温度異常検知結果を取得する。 FIG. 13 is a diagram illustrating an example of a determination process flow of a temperature abnormality cause location by the temperature abnormality cause location determination apparatus of the present embodiment. First, the temperature abnormality detection result acquisition part 11 of the temperature abnormality cause location determination apparatus 1 acquires the temperature abnormality detection result from each server device 31, for example, with temperature information (step S1). Specifically, the temperature abnormality detection result acquisition unit 11 acquires a temperature abnormality detection result transmitted from the server device 31 in the monitoring target data center periodically or when a temperature abnormality is detected.

判定部１２が、取得された温度異常検知結果に基づいて、温度異常があるかを判断する（ステップＳ２）。なお、判定部１２が、各サーバ装置３１から取得された温度情報に基づいて、該温度情報が示す温度が所定の閾値を越えるかを判断し、該温度が閾値を超えた場合に、温度異常があると判断するようにしてもよい。 The determination unit 12 determines whether there is a temperature abnormality based on the acquired temperature abnormality detection result (step S2). Note that the determination unit 12 determines whether the temperature indicated by the temperature information exceeds a predetermined threshold based on the temperature information acquired from each server device 31, and if the temperature exceeds the threshold, the temperature abnormality You may make it judge that there exists.

判定部１２が、温度異常がないと判断した場合は処理を終了する。判定部１２が、温度異常があると判断した場合は、監視対象のデータセンター内の全てのサーバ装置３１から同時刻の温度異常検知結果を取得するまで待つ（ステップＳ３）。ステップＳ３においては、更に、温度異常検知結果取得部１１が、各々のサーバ装置３１から取得した温度異常検知結果に基づいて、各々のサーバ装置３１のステータスに関する情報をサーバ装置３１のリストの情報中に設定し、サーバ装置３１のリストの情報を流路情報の一部として流路情報ＤＢ１４に記憶する。次に、判定部１２が、温度異常原因箇所（被疑箇所）を絞り込んで決定する（ステップＳ４）。判定部１２は、例えば辺を被疑箇所として決定する。 If the determination unit 12 determines that there is no temperature abnormality, the process ends. When the determination unit 12 determines that there is a temperature abnormality, the determination unit 12 waits until the temperature abnormality detection result at the same time is acquired from all the server devices 31 in the monitored data center (step S3). In step S <b> 3, the temperature abnormality detection result acquisition unit 11 further displays information regarding the status of each server device 31 in the information of the server device 31 based on the temperature abnormality detection result acquired from each server device 31. Is stored in the flow path information DB 14 as part of the flow path information. Next, the determination part 12 narrows down and determines the temperature abnormality cause location (suspected location) (step S4). For example, the determination unit 12 determines the side as the suspected place.

判定部１２が、被疑箇所が見つかったかを判断する（ステップＳ５）。すなわち、判定部１２は、被疑箇所の絞り込みに成功したか失敗したかを判断する。判定部１２が、被疑箇所が見つからなかったと判断した場合は、処理を終了する。判定部１２が、被疑箇所が見つかったと判断した場合は、通知部１３が、被疑箇所を通知する（ステップＳ６）。通知部１３は、例えば、被疑箇所である辺の情報をディスプレイ上に表示したり、電子メール等で通知したりする。通知部１３が、グラフィック形式で被疑箇所を表示するようにしてもよいし、被疑箇所である辺の始点ノード、終点ノードの位置情報（柱番号等）を表示するようにしてもよい。 The determination unit 12 determines whether a suspected place has been found (step S5). That is, the determination unit 12 determines whether the suspected part has been successfully narrowed down or failed. If the determination unit 12 determines that the suspected place has not been found, the process ends. When the determination unit 12 determines that a suspected place has been found, the notification unit 13 notifies the suspected place (step S6). The notification unit 13 displays, for example, information on a side that is a suspected place on a display, or notifies by e-mail or the like. The notification unit 13 may display the suspected place in a graphic format, or may display the position information (post number etc.) of the start point node and end point node of the side that is the suspected place.

図１４は、図１３のステップＳ４における被疑箇所の絞り込み処理の詳細を説明する図である。まず、判定部１２が、温度異常が検知されていないサーバ装置３１から空調機２に至るまでの辺の集合Ｓ１（第１の集合）を求める（ステップＳ４１）。次に、判定部１２が、温度異常が検知されたサーバ装置３１から空調機２に至るまでの辺の集合Ｓ２（第２の集合）を求める（ステップＳ４２）。そして、Ｓ２に含まれ、かつ、Ｓ１に含まれない辺を被疑箇所として判定する（ステップＳ４３）。 FIG. 14 is a diagram illustrating details of the suspected place narrowing-down process in step S4 of FIG. First, the determination unit 12 obtains a set S1 (first set) of sides from the server device 31 from which no temperature abnormality is detected to the air conditioner 2 (step S41). Next, the determination part 12 calculates | requires the set S2 (2nd set) of the edge | side from the server apparatus 31 from which the temperature abnormality was detected to the air conditioner 2 (step S42). Then, the side included in S2 and not included in S1 is determined as the suspected place (step S43).

図１５は、図１４のステップＳ４１における辺の集合Ｓ１を求める処理の詳細を説明する図である。まず、判定部１２が、流路情報ＤＢ１４から流路情報を抽出し、該流路情報に含まれるサーバ装置３１のリスト（図１１を参照）から、サーバ装置３１を示すノードを順番に取り出す（ステップＳ１０１）。次に、判定部１２が、取り出されたノードのステータスがＯＫであるかを判断する（ステップＳ１０２）。判定部１２が、取り出されたノードのステータスがＯＫでないと判断した場合は、ステップＳ１０５に進む。判定部１２が、取り出されたノードのステータスがＯＫであると判断した場合は、判定部１２は、該ノードに繋がる辺を全て求め（ステップＳ１０３）、求められた辺の集合をＡとする。判定部１２が、ステップＳ１０３で求められる辺の集合ＡをＯＲ演算し、ＯＲ演算結果をＳ１として求める（ステップＳ１０４）。具体的には、Ｓ１とＡとをＯＲ演算した結果をＳ１とする。判定部１２が、サーバ装置３１のリストに次ノードがあるかを判断する（ステップＳ１０５）。判定部１２が、サーバ装置３１のリストに次ノードがあると判断した場合は、ステップＳ１０１に戻る。判定部１２が、サーバ装置３１のリストに次ノードがないと判断した場合は、処理を終了する。 FIG. 15 is a diagram for explaining the details of the processing for obtaining the edge set S1 in step S41 of FIG. First, the determination unit 12 extracts the flow path information from the flow path information DB 14, and sequentially extracts the nodes indicating the server apparatuses 31 from the list of server apparatuses 31 included in the flow path information (see FIG. 11) ( Step S101). Next, the determination unit 12 determines whether the status of the extracted node is OK (step S102). If the determination unit 12 determines that the status of the extracted node is not OK, the process proceeds to step S105. If the determination unit 12 determines that the status of the extracted node is OK, the determination unit 12 determines all sides connected to the node (step S103), and sets the determined set of sides as A. The determination unit 12 performs an OR operation on the set A of edges obtained in step S103, and obtains an OR operation result as S1 (step S104). Specifically, the result of ORing S1 and A is S1. The determination unit 12 determines whether there is a next node in the list of the server device 31 (step S105). If the determination unit 12 determines that there is a next node in the list of the server device 31, the process returns to step S101. If the determination unit 12 determines that there is no next node in the list of the server device 31, the process ends.

図１６は、図１４のステップＳ４２における辺の集合Ｓ２を求める処理の詳細を説明する図である。まず、判定部１２が、流路情報ＤＢ１４から流路情報を抽出し、該流路情報に含まれるサーバ装置３１のリスト（図１１を参照）から、サーバ装置３１を示すノードを順番に取り出す（ステップＳ２０１）。次に、判定部１２が、取り出されたノードのステータスがＮＧであるかを判断する（ステップＳ２０２）。判定部１２が、取り出されたノードのステータスがＮＧでないと判断した場合は、ステップＳ２０５に進む。判定部１２が、取り出されたノードのステータスがＮＧであると判断した場合は、判定部１２は、該ノードに繋がる辺を全て求め、求められた辺の集合をＢとする（ステップＳ２０３）。判定部１２が、ステップＳ２０３で求められる辺の集合ＢをＡＮＤ演算し、ＡＮＤ演算結果をＳ２として求める（ステップＳ２０４）。具体的には、Ｓ２とＢとの論理積の結果をＳ２とする。求められたＳ２に含まれる辺は、ステータスがＮＧである各々のサーバ装置３１から空調機２に至るまでのルートの共通部分に該当する辺である。判定部１２が、サーバ装置３１のリストに次ノードがあるかを判断する（ステップＳ２０５）。判定部１２が、サーバ装置３１のリストに次ノードがあると判断した場合は、ステップＳ２０１に戻る。判定部１２が、サーバ装置３１のリストに次ノードがないと判断した場合は、処理を終了する。 FIG. 16 is a diagram for explaining the details of the processing for obtaining the edge set S2 in step S42 of FIG. First, the determination unit 12 extracts the flow path information from the flow path information DB 14, and sequentially extracts the nodes indicating the server apparatuses 31 from the list of server apparatuses 31 included in the flow path information (see FIG. 11) ( Step S201). Next, the determination unit 12 determines whether the status of the extracted node is NG (step S202). If the determination unit 12 determines that the status of the extracted node is not NG, the process proceeds to step S205. If the determination unit 12 determines that the status of the extracted node is NG, the determination unit 12 determines all sides connected to the node, and sets the obtained set of sides as B (step S203). The determination unit 12 performs an AND operation on the edge set B obtained in step S203, and obtains an AND operation result as S2 (step S204). Specifically, the result of the logical product of S2 and B is S2. The sides included in the obtained S2 are sides corresponding to the common part of the route from each server device 31 whose status is NG to the air conditioner 2. The determination unit 12 determines whether there is a next node in the list of the server device 31 (step S205). If the determination unit 12 determines that there is a next node in the list of the server device 31, the process returns to step S201. If the determination unit 12 determines that there is no next node in the list of the server device 31, the process ends.

図１７は、図１５のＳ１０３における、辺の集合Ａを求める処理の詳細を説明する図である。図１６のＳ２０３における、辺の集合Ｂを求める処理は、図１７に示す辺の集合Ａを求める処理と同様の手順に従って行われるので、辺の集合Ｂを求める処理の詳細については説明を省略する。 FIG. 17 is a diagram for explaining the details of the processing for obtaining the edge set A in S103 of FIG. The process for obtaining the edge set B in S203 of FIG. 16 is performed according to the same procedure as the process for obtaining the edge set A shown in FIG. 17, and thus the description of the details of the process for obtaining the edge set B is omitted. .

まず、判定部１２が、図１５のステップＳ１０１において抽出された流路情報に含まれる辺のリスト（図１０を参照）から辺ｅを取り出す（ステップＳ３０１）。判定部１２が、辺ｅの終点ノードがサーバ装置であるかを判断する（ステップＳ３０２）。判定部１２が、辺ｅの終点ノードがサーバ装置でない（終点ノードが中間ノードである）と判断した場合、判定部１２が、その中間ノードを始点ノードとする辺を探索して求め（ステップＳ３０３）、ステップＳ３０２に戻る。具体的には、求まった辺を辺ｅとして、上記Ｓ３０２における処理の対象とする。 First, the determination unit 12 extracts the side e from the list of sides (see FIG. 10) included in the flow path information extracted in step S101 of FIG. 15 (step S301). The determination unit 12 determines whether the end point node of the side e is a server device (step S302). When the determination unit 12 determines that the end point node of the edge e is not a server device (the end point node is an intermediate node), the determination unit 12 searches and obtains an edge having the intermediate node as the start point node (step S303). ), The process returns to step S302. Specifically, the obtained edge is set as the edge e, and the process is performed in S302.

判定部１２が、辺ｅの終点ノードがサーバ装置であると判断した場合、判定部１２が、終点ノードが処理対象のサーバ装置であるかを判断する（ステップＳ３０４）。処理対象のサーバ装置は、上記図１５のステップＳ１０３において、繋がる辺を求める対象となるノードが示すサーバ装置である。判定部１２が、終点ノードが処理対象のサーバ装置でないと判断した場合は、ステップＳ３０６に進む。判定部１２が、終点ノードが処理対象のサーバ装置であると判断した場合は、辺ｅを辺の集合Ａに加える（ステップＳ３０５）。ステップＳ３０５の処理を繰り返すことによって、辺の集合Ａが求められる。次に、判定部１２が、辺のリストに次の辺があるかを判断する（ステップＳ３０６）。判定部１２が、辺のリストに次の辺があると判断した場合は、ステップＳ３０１に戻る。判定部１２が、辺のリストに次の辺がないと判断した場合は、処理を終了する。 When the determination unit 12 determines that the end node of the side e is a server device, the determination unit 12 determines whether the end node is a server device to be processed (step S304). The server device to be processed is the server device indicated by the node that is the target for obtaining the connected side in step S103 of FIG. If the determination unit 12 determines that the end node is not the server server to be processed, the process proceeds to step S306. When the determination unit 12 determines that the end node is a server device to be processed, the side e is added to the side set A (step S305). By repeating the processing in step S305, a set A of edges is obtained. Next, the determination unit 12 determines whether there is a next side in the side list (step S306). If the determination unit 12 determines that there is a next side in the side list, the process returns to step S301. If the determination unit 12 determines that there is no next side in the side list, the process ends.

図１８は、本実施形態の温度異常原因箇所判定装置による温度異常原因箇所の判定処理の第１の適用例を説明する図である。この例では、図１８に示すように、監視対象のデータセンターには、１台の空調機Ｃ１と１台のサーバ装置Ａ１とが含まれる。また、サーバ装置Ａ１のステータスはＮＧである。温度異常が検知されていないサーバ装置から空調機に至るまでの辺はないので、温度異常原因箇所判定装置１が求める辺の集合Ｓ１は空集合（Ｓ１＝φ）となる。また、温度異常が検知されたサーバ装置から空調機に至るまでの辺は、ｅ１であるので、温度異常原因箇所判定装置１が求める辺の集合Ｓ２に含まれる辺は、ｅ１である（Ｓ２＝｛ｅ１｝）。Ｓ２に含まれ、かつＳ１に含まれない辺はｅ１である。従って、温度異常原因箇所判定装置１はｅ１を温度異常原因箇所として決定する。 FIG. 18 is a diagram illustrating a first application example of the determination process of the temperature abnormality cause location by the temperature abnormality cause location determination apparatus of the present embodiment. In this example, as shown in FIG. 18, the data center to be monitored includes one air conditioner C1 and one server device A1. Further, the status of the server device A1 is NG. Since there is no side from the server device in which no temperature abnormality is detected to the air conditioner, the side set S1 obtained by the temperature abnormality cause location determination device 1 is an empty set (S1 = φ). Moreover, since the edge from the server apparatus in which the temperature abnormality is detected to the air conditioner is e1, the edge included in the edge set S2 obtained by the temperature abnormality cause location determination apparatus 1 is e1 (S2 = {E1}). The side included in S2 and not included in S1 is e1. Therefore, the temperature abnormality cause location determination apparatus 1 determines e1 as the temperature abnormality cause location.

図１９は、本実施形態の温度異常原因箇所判定装置による温度異常原因箇所の判定処理の第２の適用例を説明する図である。この例では、図１９に示すように、監視対象のデータセンターには、１台の空調機Ｃ１と２台のサーバ装置Ａ１、Ａ２とが含まれる。また、サーバ装置Ａ１のステータスはＮＧ、サーバ装置Ａ２のステータスはＯＫである。温度異常が検知されていないサーバ装置から空調機に至るまでの辺は、ｅ１とｅ３であるので、Ｓ１＝｛ｅ１，ｅ３｝となる。また、温度異常が検知されたサーバ装置から空調機に至るまでの辺は、ｅ１とｅ２であるので、Ｓ２＝｛ｅ１，ｅ２｝である。Ｓ２に含まれ、かつＳ１に含まれない辺はｅ２である。従って、温度異常原因箇所判定装置１はｅ２を温度異常原因箇所として決定する。 FIG. 19 is a diagram illustrating a second application example of the temperature abnormality cause location determination process performed by the temperature abnormality cause location determination apparatus according to the present embodiment. In this example, as shown in FIG. 19, the data center to be monitored includes one air conditioner C1 and two server apparatuses A1 and A2. Further, the status of the server apparatus A1 is NG, and the status of the server apparatus A2 is OK. Since the sides from the server device where no temperature abnormality is detected to the air conditioner are e1 and e3, S1 = {e1, e3}. Further, since the sides from the server device where the temperature abnormality is detected to the air conditioner are e1 and e2, S2 = {e1, e2}. The side included in S2 and not included in S1 is e2. Therefore, the temperature abnormality cause location determination apparatus 1 determines e2 as the temperature abnormality cause location.

図２０は、本実施形態の温度異常原因箇所判定装置による温度異常原因箇所の判定処理の第３の適用例を説明する図である。この例では、図２０に示すように、監視対象のデータセンターには、２台の空調機Ｃ１、Ｃ２と２台のサーバ装置Ａ１、Ａ２とが含まれる。また、サーバ装置Ａ１のステータスはＮＧ、サーバ装置Ａ２のステータスはＯＫである。温度異常が検知されていないサーバ装置から空調機に至るまでの辺は、ｅ１，ｅ２，ｅ３，ｅ５であるので、Ｓ１＝｛ｅ１，ｅ２，ｅ３，ｅ５｝となる。また、温度異常が検知されたサーバ装置から空調機に至るまでの辺は、ｅ１，ｅ２，ｅ３，ｅ４であるので、Ｓ２＝｛ｅ１，ｅ２，ｅ３，ｅ４｝である。Ｓ２に含まれ、かつＳ１に含まれない辺はｅ４である。従って、温度異常原因箇所判定装置１は、ｅ４を温度異常原因箇所として決定する。 FIG. 20 is a diagram for explaining a third application example of the temperature abnormality cause location determination process performed by the temperature abnormality cause location determination apparatus according to the present embodiment. In this example, as shown in FIG. 20, the data center to be monitored includes two air conditioners C1 and C2 and two server apparatuses A1 and A2. Further, the status of the server apparatus A1 is NG, and the status of the server apparatus A2 is OK. Since the sides from the server device where no temperature abnormality is detected to the air conditioner are e1, e2, e3, e5, S1 = {e1, e2, e3, e5}. Further, since the sides from the server device where the temperature abnormality is detected to the air conditioner are e1, e2, e3, e4, S2 = {e1, e2, e3, e4}. The side included in S2 and not included in S1 is e4. Therefore, the temperature abnormality cause location determination apparatus 1 determines e4 as the temperature abnormality cause location.

図２１は、本実施形態の温度異常原因箇所判定装置による温度異常原因箇所の判定処理の第４の適用例を説明する図である。この例では、温度異常が検知されていないサーバ装置から空調機に至るまでの辺はないので、Ｓ１＝φとなる。温度異常が検知されたサーバ装置Ａ１から空調機に至るまでの辺の集合は、｛ｅ１，ｅ２，ｅ３，ｅ４｝である。温度異常が検知されたサーバ装置Ａ２から空調機に至るまでの辺の集合は、｛ｅ１，ｅ２，ｅ３，ｅ５｝である。従って、辺の集合Ｓ２は、辺の集合｛ｅ１，ｅ２，ｅ３，ｅ４｝と辺の集合｛ｅ１，ｅ２，ｅ３，ｅ５｝との論理積の結果得られる、｛ｅ１，ｅ２，ｅ３｝となる。Ｓ２に含まれ、かつＳ１に含まれない辺はｅ１とｅ２とｅ３である。従って、温度異常原因箇所判定装置１は、ｅ１とｅ２とｅ３とを温度異常原因箇所として決定する。 FIG. 21 is a diagram illustrating a fourth application example of the temperature abnormality cause location determination process performed by the temperature abnormality cause location determination apparatus according to the present embodiment. In this example, since there is no side from the server apparatus in which no temperature abnormality is detected to the air conditioner, S1 = φ. A set of sides from the server device A1 where the temperature abnormality is detected to the air conditioner is {e1, e2, e3, e4}. A set of sides from the server device A2 where the temperature abnormality is detected to the air conditioner is {e1, e2, e3, e5}. Therefore, the edge set S2 is obtained as a result of a logical product of the edge set {e1, e2, e3, e4} and the edge set {e1, e2, e3, e5}, and {e1, e2, e3} Become. The sides included in S2 and not included in S1 are e1, e2, and e3. Therefore, the temperature abnormality cause location determination apparatus 1 determines e1, e2, and e3 as temperature abnormality cause locations.

本実施形態の温度異常原因箇所判定装置の構成例を示す図である。It is a figure which shows the structural example of the temperature abnormality cause location determination apparatus of this embodiment. 温度異常原因箇所判定処理の対象とするデータセンターのモデル図である。It is a model figure of the data center made into the object of temperature abnormality cause location determination processing. 温度異常原因箇所の判定処理の概要を説明する図である。It is a figure explaining the outline | summary of the determination process of a temperature abnormality cause location. 温度異常原因箇所の判定処理の概要を説明する図である。It is a figure explaining the outline | summary of the determination process of a temperature abnormality cause location. 温度異常原因箇所の判定処理の概要を説明する図である。It is a figure explaining the outline | summary of the determination process of a temperature abnormality cause location. 温度異常原因箇所の判定処理の概要を説明する図である。It is a figure explaining the outline | summary of the determination process of a temperature abnormality cause location. 空調機からサーバ装置に至るまでの冷却空気の流れを表現する有向グラフを示す図である。It is a figure which shows the directed graph expressing the flow of the cooling air from an air conditioner to a server apparatus. 有向グラフが示す情報の例である。It is an example of the information which a directed graph shows. 流路情報に含まれるノードのリストを示す図である。It is a figure which shows the list | wrist of the node contained in flow path information. 流路情報に含まれる辺のリストを示す図である。It is a figure which shows the list | wrist of the edge contained in flow path information. 流路情報に含まれる、サーバ装置のリストを示す図である。It is a figure which shows the list | wrist of the server apparatus contained in flow path information. 温度異常原因箇所の情報のデータ構造例を示す図である。It is a figure which shows the example of a data structure of the information of a temperature abnormality cause location. 温度異常原因箇所の判定処理フローの一例を示す図である。It is a figure which shows an example of the determination processing flow of a temperature abnormality cause location. 被疑箇所の絞り込み処理の詳細を説明する図である。It is a figure explaining the detail of the narrowing-down process of a suspected place. 辺の集合Ｓ１を求める処理の詳細を説明する図である。It is a figure explaining the detail of the process which calculates | requires edge set S1. 辺の集合Ｓ２を求める処理の詳細を説明する図である。It is a figure explaining the detail of the process which calculates | requires edge set S2. 辺の集合Ａを求める処理の詳細を説明する図である。It is a figure explaining the detail of the process which calculates | requires edge set A. FIG. 温度異常原因箇所の判定処理の第１の適用例を説明する図である。It is a figure explaining the 1st application example of the determination process of a temperature abnormality cause location. 温度異常原因箇所の判定処理の第２の適用例を説明する図である。It is a figure explaining the 2nd application example of the determination process of a temperature abnormality cause location. 温度異常原因箇所の判定処理の第３の適用例を説明する図である。It is a figure explaining the 3rd application example of the determination process of a temperature abnormality cause location. 温度異常原因箇所の判定処理の第４の適用例を説明する図である。It is a figure explaining the 4th example of application of judgment processing of a temperature abnormal cause part. データセンターにおいてサーバ装置を冷却するシステムの例を示す図である。It is a figure which shows the example of the system which cools a server apparatus in a data center.

Explanation of symbols

１温度異常原因箇所判定装置
２空調機
３ラック
４ネットワーク
１０監視装置
１１温度異常検知結果取得部
１２判定部
１３通知部
１４流路情報ＤＢ
３１サーバ装置 DESCRIPTION OF SYMBOLS 1 Temperature abnormality cause location determination apparatus 2 Air conditioner 3 Rack 4 Network 10 Monitoring apparatus 11 Temperature abnormality detection result acquisition part 12 Judgment part 13 Notification part 14 Flow path information DB
31 Server device

Claims

A temperature abnormality cause location determination device that determines a temperature abnormality cause location in a flow path of cooling air supplied from one or a plurality of cooling sources to a device group composed of a plurality of devices,
A temperature abnormality detection result acquisition means for acquiring a temperature abnormality detection result of the device;
And a determination unit that determines a cause of temperature abnormality in the flow path of the cooling air based on flow path information of the cooling air supplied to the device group and a detection result of the temperature abnormality of the device. Temperature abnormality cause location determination device.

The determination means reaches the cooling source from a device in which no temperature abnormality is detected, with the cooling source, the point where the cooling air branches or joins, and the route connecting the adjacent nodes as a side. Is obtained as a first set, and a set of sides from the device in which the temperature abnormality is detected to the cooling source is obtained as a second set, and is included in the second set, and The temperature abnormality cause location determination apparatus according to claim 1, wherein a side that is not included in the set of 1 is determined as a temperature abnormality cause location in the flow path of the cooling air.

When there are a plurality of devices in which a temperature abnormality is detected, the determination means uses the logical product of a set of edges from the device in which each temperature abnormality is detected to the cooling source as the second set. The temperature abnormality cause location determination apparatus according to claim 2.

A temperature abnormality cause location determination method for determining a temperature abnormality cause location in a flow path of cooling air supplied from one or more cooling sources to a device group consisting of a plurality of devices,
Obtain the temperature abnormality detection result of the device,
Based on flow path information of cooling air supplied to the device group and a detection result of temperature abnormality of the device, a temperature abnormality cause location in the flow path of the cooling air is determined. Cause location determination method.

The cooling source, the point at which cooling air branches or joins, the set of sides from the device in which no temperature abnormality is detected to the cooling source, with the device as a node and the route connecting adjacent nodes as the side Is obtained as a first set, and a set of sides from the device in which the temperature abnormality is detected to the cooling source is obtained as a second set, and is included in the second set and included in the first set. The temperature abnormality cause location determination method according to claim 4, wherein a side that is not present is determined as a temperature abnormality cause location in the cooling air flow path.

When there are a plurality of devices in which a temperature abnormality is detected, a logical product of a set of edges from the device in which each temperature abnormality is detected to the cooling source is defined as the second set. Item 6. The method for determining the cause of temperature abnormality according to Item 5.