WO2012073407A1 - 表示処理システム、表示処理方法、およびプログラム - Google Patents
表示処理システム、表示処理方法、およびプログラム Download PDFInfo
- Publication number
- WO2012073407A1 WO2012073407A1 PCT/JP2011/005089 JP2011005089W WO2012073407A1 WO 2012073407 A1 WO2012073407 A1 WO 2012073407A1 JP 2011005089 W JP2011005089 W JP 2011005089W WO 2012073407 A1 WO2012073407 A1 WO 2012073407A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- server
- predetermined period
- display processing
- failure
- service
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/0703—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
- G06F11/0751—Error or fault detection not based on redundancy
- G06F11/0754—Error or fault detection not based on redundancy by exceeding limits
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F1/00—Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
- G06F1/26—Power supply means, e.g. regulation thereof
- G06F1/32—Means for saving power
- G06F1/3203—Power management, i.e. event-based initiation of a power-saving mode
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F1/00—Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
- G06F1/16—Constructional details or arrangements
- G06F1/20—Cooling means
- G06F1/206—Cooling means comprising thermal management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3058—Monitoring arrangements for monitoring environmental properties or parameters of the computing system or of the computing system component, e.g. monitoring of power, currents, temperature, humidity, position, vibrations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3089—Monitoring arrangements determined by the means or processing involved in sensing the monitored data, e.g. interfaces, connectors, sensors, probes, agents
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/32—Monitoring with visual or acoustical indication of the functioning of the machine
- G06F11/324—Display of status information
- G06F11/325—Display of status information by lamps or LED's
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Definitions
- the present invention relates to a display processing system, a display processing method, and a program, and more particularly to a display processing system, a display processing method, and a program for displaying management information of the system.
- Non-Patent Document 1 describes an example of a system that manages services provided in an integrated manner.
- the current heat accumulation abnormality information of the data center is displayed in a list or the like, and a floor map indicating a heat accumulation occurrence location at the time of abnormality is further displayed.
- the supervisor can check the place where the heat accumulation has occurred on the floor map, and can quickly take measures.
- Patent Document 1 describes a configuration for acquiring and accumulating operation data including power consumption of an air conditioner and visualizing the analyzed data.
- Patent Document 2 discloses a remote real-time asset management system for air-conditioning equipment, which realizes a remote monitoring function of the air-conditioning equipment by monitoring the operating state of the controlled air-conditioning equipment in real time, thereby saving air-conditioning costs. It is described that it is illustrated.
- the present invention has been made in view of the above circumstances, and an object of the present invention is to provide a display processing system, a display processing method, and a program for improving service reliability.
- the display processing system of the present invention includes: The occurrence status information of the failure that has occurred due to the accumulation of heat of the server that provides the service for a predetermined period is displayed separately for the impact level on the service due to the failure, and is associated with the occurrence status information of the failure for the predetermined period. Display means for displaying energy efficiency information of the server is provided.
- the display processing method of the present invention includes: The processing device connected to the display device The occurrence status information of a failure that has occurred due to the accumulation of heat of a server that provides a service for a predetermined period is displayed on the display device in accordance with the impact level on the service due to the failure, and is associated with the occurrence status information of the failure Energy efficiency information of the server for the predetermined period is displayed on the display device.
- the program of the present invention To the computer connected to the display device, The occurrence status information of a failure that has occurred due to the accumulation of heat of a server that provides a service for a predetermined period is displayed on the display device in accordance with the impact level on the service due to the failure, and is associated with the occurrence status information of the failure This is for executing a procedure for displaying the energy efficiency information of the server for the predetermined period on the display device.
- a plurality of components are formed as a single member, and a single component is formed of a plurality of members. It may be that a certain component is a part of another component, a part of a certain component overlaps with a part of another component, or the like.
- the plurality of procedures of the method and computer program of the present invention are not limited to being executed at different timings. For this reason, another procedure may occur during the execution of a certain procedure, or some or all of the execution timing of a certain procedure and the execution timing of another procedure may overlap.
- a display processing system a display processing method, and a program that improve service reliability are provided.
- FIG. 1 is a block diagram showing a configuration of a display processing system 1 according to an embodiment of the present invention.
- the display processing system 1 according to the embodiment of the present invention shows the operation management status of a large number of IT devices such as a large number of servers 7 and services of a data center (DC) 5 connected via a network 3, for example. Information is presented to an administrator who manages the system of the data center 5.
- DC data center
- the display processing system 1 of the present invention presents information that can be grasped at a glance as to whether or not the countermeasure is properly performed.
- SLA Service Level Agreement
- This SLA is a system in which a service provider providing a service using the server 7 of the data center 5 guarantees the service user the quality level of the service provided to the service user.
- the guaranteed quality level of the service is determined for various items according to the rank. For example, it is assumed that the maximum value of response time, the minimum communication speed, the upper limit of service unavailable time, etc. are determined for each rank. In the present embodiment, it is assumed that four ranks S, A, B, and C are determined in descending order of quality.
- the display processing system 1 includes a computer 10 including, for example, a CPU 12, a memory 14, a storage device 16 such as a hard disk, and a communication device (a network connection interface (“I / F” in the figure) 18)).
- the CPU 12 is connected to each element of the computer 10 via the bus 20 and controls the entire computer 10 together with each element.
- a computer 10 of the display processing system 1 is a server computer or a personal computer connected to an input device 40 such as a keyboard or a mouse, a display device 30 such as a display, an output device (not shown) such as a printer, or an equivalent device. Can be realized.
- the CPU 12 reads out the program stored in the storage device 16 to the memory 14 and executes it, thereby realizing each function of each unit described below.
- it has abbreviate
- FIG. 2 is a functional block diagram showing a configuration of the display processing device 100 realized by the computer 10 of the display processing system 1 according to the embodiment of the present invention.
- the display processing system 1 according to the present embodiment generates the occurrence of a failure caused by the accumulation of heat for a predetermined period along with the information on the state of occurrence of the accumulation of heat of the server 7 (FIG. 1) that provides the service for a predetermined period.
- a display unit 114 is provided that displays status information divided into service impact levels due to failures.
- Each component of the display processing apparatus 100 is stored in the CPU 12, the memory 14, the program that realizes the components shown in FIG. 1 loaded in the memory 14, and the hard disk that stores the program.
- the present invention is realized by any combination of hardware and software of the computer 10 having the device 16 and the network connection interface 18. It will be understood by those skilled in the art that there are various modifications to the implementation method and apparatus. Each figure described below shows functional unit blocks, not hardware unit configurations.
- the computer program according to the present embodiment has a predetermined period of time together with information on the occurrence status of the heat accumulation of the server 7 (FIG. 1) that provides a predetermined period of service to the computer 10 (FIG. 1) for realizing the display processing apparatus 100. It is described to execute a procedure for displaying the occurrence status information of the failure caused by the heat accumulation on the display device 30 (FIG. 1) by dividing the information into the service impact level due to the failure.
- the computer program of this embodiment may be recorded on a computer-readable recording medium.
- the recording medium is not particularly limited, and various forms can be considered.
- the program may be loaded from a recording medium into a computer memory, or downloaded to a computer through a network and loaded into the memory.
- the display processing apparatus 100 includes a heat accumulation information acquisition unit 102, a failure information acquisition unit 104, a service table storage unit (shown as “service table” in the figure) 106, a determination unit 108, and a totalization.
- Unit 110 determination unit 112, and display unit 114.
- the hot pool information acquisition unit 102 acquires hot pool occurrence status information of the server 7 (FIG. 1) that provides the service via the network 3.
- the heat accumulation information acquisition unit 102 further includes a detection unit 120, a temperature sensor arrangement table storage unit (shown as “temperature sensor arrangement table” in the drawing) 122, and a specification unit 124. May be included.
- the detection unit 120 receives temperature information measured by a temperature sensor (not shown) arranged around the server 7 that provides a service from the management device (not shown) of the data center 5 (FIG. 1) via the network 3. Based on the received temperature information, the occurrence of a heat pool is detected.
- the detection unit 120 detects the occurrence of a heat pool based on the position where the temperature information of the temperature sensor is equal to or higher than a preset threshold and the period thereof.
- the temperature sensor arrangement table storage unit 122 stores a correspondence table of the arrangement of the server 7 and the temperature sensor.
- the temperature sensor arrangement table storage unit 122 is included in the memory 14 or the storage device 16.
- the identifying unit 124 refers to the temperature sensor arrangement table storage unit 122, identifies at least one server 7 in which the detected heat pool has occurred, and outputs information on the occurrence status of the heat pool in the identified server 7. It should be noted that the generation of heat accumulation occurrence state information by the above configuration may be performed by the management device of the data center 5 and transmitted to the display processing device 100.
- the failure information acquisition unit 104 acquires the occurrence status information of the failure that has occurred in the service or the server 7 (FIG. 1) from the data center 5 via the network 3.
- the heat accumulation occurrence information and the failure occurrence information collected from the server 7 may be stored in the memory 14 or the storage device 16 periodically or as needed. Further, the heat storage information acquisition unit 102 and the failure information acquisition unit 104 of the display processing apparatus 100 may use values stored in the memory 14 or the storage device 16. Alternatively, the display processing apparatus 100 can read and acquire each piece of information about the target computer collected from each server 7 or management apparatus from a recording medium that has been recorded in advance for a certain period. In this way, each piece of information can be acquired at various timings and in various methods.
- the service table storage unit 106 stores a server table in which SLA ranks of services provided by the server 7 are associated.
- the service table storage unit 106 is included in the memory 14 or the storage device 16.
- the server table may include two tables, a correspondence table between the server 7 and the service, and a correspondence table between the service and the SLA rank.
- Various tables stored in the service table storage unit 106 can be confirmed and changed by an operator from a setting screen (not shown).
- the service table storage unit 106 stores a failure level table 150 illustrated in FIG. 6A and a server rank table 152 illustrated in FIG.
- the server rank table 152 stores an SLA rank determined in advance for each server 7 of the data center 5 in association with it. Note that the service provided by the server 7 may be changed along with resource allocation. When changed, the server rank table 152 is rewritten as needed.
- the failure level table 150 stores a reference value set for each monitoring item for each SLA rank.
- the monitoring items include, for example, response time, minimum communication speed, upper limit of unavailable time, and the like.
- two different threshold values for example, a first threshold value as1 and a second threshold value as2 are set as reference values.
- the first threshold value as1 may be a value that violates SLA, for example.
- the second threshold value as2 can be set as a reference value for issuing a warning before an SLA violation occurs.
- two threshold values are provided for each monitoring item of each rank, but the present invention is not limited to this.
- the number of threshold values may be one or more, but is preferably two or more.
- the discriminating unit 112 is aggregated based on different judgment criteria (threshold values) corresponding to the service impact levels (SLA violations and warnings) provided for each service quality assurance level (SLA rank).
- the failure occurrence status information for a predetermined period is discriminated and classified.
- the SLA rank of the service provided by the server 7 in which the failure has occurred can be specified, and the influence level due to the failure can be determined based on a criterion (threshold value) corresponding to the rank.
- first threshold value is provided for each rank for a certain monitoring item, and a value for calculating a reference value (second threshold value) for issuing a warning is set for all ranks. Also good.
- first threshold value is provided in common for a certain monitoring item, and a value for calculating a reference value (second threshold value) for issuing a warning is set for each rank. Also good.
- first threshold value is a value set in advance for each rank in the SLA, it is desirable to adopt a value defined in the SLA.
- a threshold other than the first threshold can be arbitrarily set by the administrator.
- the display processing apparatus 100 of this embodiment can be provided with a user interface for accepting operations such as a setting screen (not shown). These values may be set in advance on the setting screen by an operator's operation, or may be set or changed at any time.
- the determination unit 108 determines that the failure in the server 7 and the service has occurred. Judged as having occurred due to heat accumulation.
- a failure due to heat accumulation is detected by a management device (not shown) of the server 7 in the data center 5. Then, the display processing device 100 may acquire failure information due to heat accumulation in the server 7 from the management device.
- the totaling unit 110 totals the acquired state information of the heat accumulation of the server 7 and the state information of the failure determined to have occurred due to the heat accumulation for a predetermined period. In the present embodiment, the totaling unit 110 counts the number of occurrences of heat accumulation within a predetermined period.
- the predetermined period for performing the aggregation is specified by the operator in advance, such as one day, one week, one month, three months, six months, one year, multiple years, or an arbitrarily specified period. It can be changed to at least one specified period.
- the predetermined period may be selected in advance by an operator through a setting screen (not shown), or may be selected at any time.
- the operator can also specify the definition of the number of heat pools in advance. Various designation methods are conceivable as exemplified below. (1) Whether the heat pool generated in the same place for one day is counted as one case, or is it counted for several days. (2) The size of the heat pool is large, and a plurality of servers 7 are related. Whether to count the number of related servers 7 or to count for each related service. The above items can be individually and arbitrarily designated.
- the determining unit 112 refers to the service table storage unit 106 and identifies the SLA rank of the service provided by the server 7 determined to have failed based on the server rank table 152. Further, the determination unit 112 determines the influence level of the failure on the service based on the specified reference value of the rank based on the failure level table 150, and classifies the warning level or the SLA violation level, for example.
- the influence level can be divided by the determination result based on the first threshold value and the second threshold value set for each monitoring item described above.
- the service impact level is classified as a warning level. If the response time exceeds the first threshold, the service impact level is classified as SLA violation.
- the display unit 114 displays the occurrence status information of the failure caused by the accumulation of heat during the predetermined period together with the accumulated occurrence status information of the predetermined period for each service impact level classified by the determination unit 112. Displayed on the device 30 (FIG. 1).
- FIG. 3 is a diagram illustrating an example of the screen 130 of the display processing system 1 according to the present embodiment.
- the screen 130 displayed on the display device 30 (FIG. 1) by the display unit 114 (FIG. 2) includes a hot pool number display unit 132, a warning number display unit 134, and an SLA violation number display unit 136. And including.
- a period selection list 142 can be provided on the screen 130.
- the period selection list 142 can be used to accept an operation of selecting the total period for displaying the number of cases to be displayed on the hot-lump number display unit 132, the warning number display unit 134, and the SLA violation number display unit 136.
- the counting period can be selected from a predetermined period. In FIG. 3, the tabulated results for the past month are displayed.
- the present invention is not limited to this. You may make it possible to refer back to the past.
- the heat accumulation number display section 132 the number of heat accumulation accumulated for a predetermined period selected from the period selection list 142 by the aggregation section 110 is displayed.
- the number of warnings displayed for the predetermined period selected by the totaling unit 110 in the period selection list 142 is displayed in the warning number display unit 134.
- the SLA violation number display unit 136 displays the number of SLA violations totaled for a predetermined period selected from the period selection list 142 by the totaling unit 110.
- the number display part is highlighted 138 and an icon for notifying the administrator that an SLA violation has occurred.
- 140 is displayed in the vicinity of the SLA violation number display unit 136.
- various methods such as making the color red, making the character thicker, blinking, changing the background color, and highlighting can be considered.
- the icon 140 is assumed to be marked with a cross in a red circle, but is not limited to this.
- the color of the icon 140 may be yellow, the shape may be other shapes, the image inside may be another character or symbol such as “NG”, or another image. An image that allows an operator to visually recognize the occurrence of an SLA violation visually at a glance is preferable.
- the administrator who has seen the screen 130 can easily recognize a warning such as the occurrence of an SLA violation visually and intuitively.
- the number of heat accumulation number display unit 132 and the number of warning display unit 134 are similarly displayed by highlighting the number display part or displaying an icon in the vicinity of the predetermined number or more. It can make it easier to recognize warnings visually and intuitively.
- the predetermined number of highlights and icons to be displayed may be set in advance or changed at any time by an operator operating a setting screen (not shown).
- FIG. 5 is a flowchart showing an example of the operation of the display processing system 1 of the present embodiment. This will be described below with reference to FIGS.
- the display processing method according to the present embodiment converts the occurrence status information of the failure caused by the accumulation of heat in the predetermined period together with the occurrence status information of the accumulation of heat of the server 7 (FIG. 1) providing the service for the predetermined period to the service due to the failure. Are displayed on the display device 30 (FIG. 1) (step S113 in FIG. 5).
- the heat accumulation information acquisition unit 102 (FIG. 2) of the display processing device 100 generates the heat accumulation of the server 7 that provides the service. Information is acquired via the network 3 (FIG. 2) (step S101). Then, the failure information acquisition unit 104 (FIG. 2) of the display processing device 100 acquires the occurrence status information of the failure that occurred in the service or the server 7 from the data center 5 (FIG. 1) via the network 3 (step S103). .
- the determination unit 108 (FIG. 2) of the display processing apparatus 100 includes the server 7 where the heat accumulation acquired by the heat accumulation information acquisition unit 102 and the server 7 where the failure acquired by the failure information acquisition unit 104 occurs. It is determined whether or not they match (step S105). When they match (YES in step S105), it is determined that a failure in the server 7 and the service has occurred due to heat accumulation (step S107), and the process proceeds to step S109. If they do not match (NO in step S105), the failure is not caused by a heat accumulation, so step S107 is bypassed and the process proceeds to step S109. Then, the totaling unit 110 (FIG. 2) of the display processing apparatus 100 totals the acquired heat generation status information of the server 7 and the fault generation status information determined to have occurred due to the heat storage in a predetermined period. (Step S109).
- step S101 to step S107 can always be repeatedly executed at a predetermined cycle, and step S109 can be executed every predetermined period, but the description of the repeated processing is omitted in this figure.
- the determination unit 112 (FIG. 2) of the display processing device 100 refers to the service table storage unit 106 (FIG. 2) of the display processing device 100 and based on the server rank table 152 (FIG. 6B) The SLA rank of the service provided by the server 7 determined to have occurred is specified. Then, the determination unit 112 of the display processing device 100 determines the influence level of the failure on the service from the specified reference value of the rank based on the failure level table 150 (FIG. 6A). Or SLA violation level is classified (step S111).
- the display unit 114 (FIG. 2) of the display processing device 100 displays the occurrence status information of the failure generated by the accumulation of heat for a predetermined period together with the accumulated occurrence status information of the predetermined period of time for the display processing device 100.
- the service is classified into the service impact levels classified by the determination unit 112 and displayed (step S113).
- the number of heat accumulation that occurred in the month, the number of warnings that occurred due to heat accumulation, and the number of SLA violations are displayed on the screen 130 in FIG.
- the icon 140 and the highlight display 138 that an SLA violation has occurred due to a failure due to heat accumulation.
- the counting process in step S109 does not necessarily have to be executed every predetermined period.
- the aggregation process in step S109 can be performed at a timing at which the aggregation value for each predetermined period can be calculated.
- the display update process in step S113 may be performed only when the operator instructs display of the screen 130 using an operation button (not shown) or the like.
- the display unit 114 when there is a change in the information to be displayed on the heat accumulation number display unit 132, the warning number display unit 134, and the SLA violation number display unit 136, When a warning or SLA violation occurs, it is preferable to update the display immediately. However, whether to update the display automatically or manually using an operation button (not shown), etc., and the automatic update period, etc., is set in advance by an operator operating on a setting screen (not shown), or changed as needed It may be possible.
- the display processing system 1 of the present embodiment it is possible to visualize the failure occurrence state due to the accumulation of heat for a predetermined period and the service impact level in association with each other.
- the display processing system 1 of the present invention is not intended to present detailed information, but to give an overview of how much the service provided is being affected by the accumulation of heat. is there.
- the information regarding the maintenance of the quality of the service is presented in an easy-to-understand manner by presenting the state of occurrence of the heat accumulation in the predetermined period and the number of cases resulting in the failure and the influence level given to the quality of the service. be able to. Furthermore, according to the present invention, the quality maintenance status of the provided service can be easily grasped, so that when a problem occurs, it can be quickly noticed and measures can be taken. As a result, service reliability is improved.
- FIG. 7 is a functional block diagram showing the configuration of the display processing device 200 realized by the computer of the display processing system 1 according to the embodiment of the present invention.
- the display processing system 1 of the present embodiment is different from the above-described embodiment in that the energy efficiency is displayed together with the influence on the service due to the heat pool.
- the display processing system 1 of the present embodiment presents information that can be grasped at a glance as to whether or not the countermeasure is properly performed.
- the display processing system 1 of the present embodiment presents information that can be grasped at a glance whether or not power consumption is wasted. Furthermore, the display processing system 1 according to the present embodiment provides information that allows the stability of the service to be grasped at a glance together with the above information so that the occurrence state of the failure due to the heat accumulation does not impair the quality of the provided service. Present.
- the display processing apparatus 200 of the present embodiment displays the occurrence status information of the failure that has occurred due to the accumulation of heat of the server 7 (FIG. 1) that provides the service for a predetermined period, divided into service impact levels due to the failure, A display unit 208 that displays the energy efficiency information of the server 7 for a predetermined period in association with the failure occurrence status information is provided.
- Each component of the display processing device 200 is stored in the CPU 12, the memory 14, the program that realizes the components shown in FIG. 1 loaded in the memory 14, and the hard disk that stores the program.
- the present invention is realized by any combination of hardware and software of the computer 10 having the device 16 and the network connection interface 18. It will be understood by those skilled in the art that there are various modifications to the implementation method and apparatus. Each figure described below shows functional unit blocks, not hardware unit configurations.
- the computer program according to the present embodiment provides information on the occurrence status of a failure caused by the accumulation of heat in the server 7 that provides the service for a predetermined period to the computer 10 (FIG. 1) for realizing the display processing device 200.
- the procedure for displaying the energy efficiency information of the server 7 for a predetermined period on the display device 30 (FIG. 1) in association with the failure occurrence status information is executed. It is described to let you.
- the computer program of this embodiment may be recorded on a computer-readable recording medium.
- the recording medium is not particularly limited, and various forms can be considered.
- the program may be loaded from a recording medium into a computer memory, or downloaded to a computer through a network and loaded into the memory.
- the display processing device 200 of the present embodiment includes a temperature acquisition unit 202, a calculation unit 204, a totaling unit 206, and a display unit 208.
- information on the occurrence status of a failure caused by the accumulation of heat of the server 7 that provides the service for a predetermined period described in the display processing device 100 in FIG. 2 is acquired from another device or the like via the network 3.
- Shall. About a specific example of a configuration in which the configuration of the display processing device 100 in FIG. 2 that obtains information on the occurrence status of a failure caused by the accumulation of heat of the server 7 that provides a service for a predetermined period and the configuration of the display processing device 200 of the present embodiment are combined. Will be described in detail in an embodiment described later.
- the temperature acquisition unit 202 acquires temperature information measured by a temperature sensor (not shown) arranged around the server 7 via the network 3.
- the calculation unit 204 obtains energy efficiency information of the server 7 for a predetermined period based on the temperature information acquired by the temperature acquisition unit 202.
- the calculation unit 204 obtains the total time of supercooling when the temperature acquired by the temperature acquisition unit 202 is equal to or lower than a predetermined temperature in a predetermined period.
- the predetermined temperature is a temperature that can be recognized as supercooling in system operation, and may be set in advance or changed at any time by an operator operating a setting screen (not shown).
- the calculation unit 204 can further include a power consumption amount acquisition unit (not shown) that acquires the power consumption amount (kwh) of the server 7 via the network 3. And the calculation part 204 may calculate the sum total of the predetermined period of the power consumption (kwh) of the server 7 which the power consumption acquisition part acquired. Further, instead of the power consumption amount acquired by the power consumption amount acquisition unit, an electricity bill for one month may be acquired via the network 3, and the electricity bill (yen) for a predetermined period may be totaled.
- the calculation unit 204 further obtains an increasing / decreasing tendency of the total power consumption for the predetermined period calculated above. Specifically, for example, a difference (kwh) between the average value of the total amount of power consumption in the past certain period and the total amount of power consumption in the predetermined period, or an average ratio such as a change rate (%) is obtained. .
- the calculated average ratio may be a difference or a change amount from a standard value set in advance based on past power consumption.
- the aggregation unit 206 acquires and aggregates information on the occurrence status of failures that have occurred due to the accumulation of heat of the server 7 that provides services for a predetermined period via the network 3.
- the display unit 208 displays the occurrence status information of the failure that has occurred due to the accumulation of heat of the server 7 that provides the service for a predetermined period, which is aggregated by the aggregation unit 206, while displaying the information on the service impact level due to the failure.
- the energy efficiency information of the server 7 for a predetermined period is displayed on the display device 30 (FIG. 1) in association with the failure occurrence status information.
- FIG. 8 is a diagram illustrating an example of the screen 230 of the display processing system 1 according to the present embodiment.
- the screen 230 displayed on the display device 30 (FIG. 1) by the display unit 208 (FIG. 7) displays heat accumulation occurrence status information similar to the screen 130 of FIG. 3 of the above embodiment.
- the supercooling information display field 230a is further provided.
- the period selection list 142 can be provided on the screen 230 as in FIG.
- the period selection list 142 can be used to accept an operation for selecting the total number of periods displayed on each display unit. As described above, the counting period can be selected from a predetermined period. In FIG. 8, the tabulated results for the past month are displayed.
- the present invention is not limited to this. You may make it possible to refer back to the past.
- looking at this screen 230 and investigating the cause of the failure that has occurred is not a preferential purpose, and at least the current situation is preferably displayed. This is because, in the present invention, it can be confirmed that there is no influence on the quality of service due to the failure and that no overcooling has occurred, the case where there is an influence on the quality, the overcooling can be recognized quickly, or the influence on the quality. This is because it is a priority to be able to recognize the degree of seriousness and the degree of wasteful energy consumption.
- the temperature acquired by the temperature acquisition unit 202 (FIG. 7) aggregated for a predetermined period selected in the period selection list 142 by the calculation unit 204 (FIG. 7) is equal to or lower than the predetermined temperature.
- the total cooling time is displayed.
- the power consumption amount display unit 234 displays the sum of the power consumption amount (kwh) of the server 7 (FIG. 1) totaled for a predetermined period selected by the calculation unit 204 from the period selection list 142.
- the average ratio display unit 236 displays the average ratio of the total sum of the power consumption amounts calculated for a predetermined period selected by the calculation unit 204 from the period selection list 142.
- the average ratio display portion is highlighted 238 and the average ratio tends to increase to the administrator. Is displayed in the vicinity of the average ratio display unit 236.
- the highlight display 238 for example, various methods such as making the color red, making the character thicker, blinking, changing the background color to highlight, and the like can be considered.
- the icon 240 is, for example, a red upward arrow in the present embodiment, but is not limited thereto.
- the icon 240 may be another symbol or character indicating an unfavorable tendency such as “x” or “NG”, or another image, for example. It is preferable that the operator can visually and intuitively recognize the increasing tendency of the average ratio.
- An icon 240 may be displayed. It is preferable that the operator can visually and intuitively recognize the decreasing tendency of the average ratio.
- another image icon 240 may be displayed. It is preferable that the operator can visually and intuitively recognize that the average ratio is equal to the standard value.
- the display of the emphasis display 238 and the icon 240 makes it easy for an administrator who has viewed the screen 230 to visually recognize a state such as an increase in power consumption due to overcooling.
- the display processing device 200 determines whether or not the difference or change rate calculated by the calculation unit 204 is within an allowable range with respect to the standard value (not shown). And a determination notification unit (screen 230 in FIG. 8) for notifying the user of the determination result of the determination unit.
- the supercooling time display unit 232 when the supercooling time display unit 232 is outside the allowable range with respect to the standard value such as a predetermined time or more, or the power consumption display unit 234.
- the numerical display portion When the value is out of the allowable range with respect to the standard value such as a predetermined power consumption amount or more, the numerical display portion may be highlighted or an icon may be displayed in the vicinity thereof. By doing so, it is possible to easily recognize the state visually and intuitively.
- the permissible range for displaying the emphasis display or the icon may be set in advance or changed at any time by an operator operating a setting screen (not shown).
- FIG. 9 is a flowchart showing an example of the operation of the display processing system 1 of the present embodiment. This will be described below with reference to FIGS. 1 and 7 to 9.
- the display processing device 200 converts the occurrence status information of the failure generated by the heat accumulation of the server 7 (FIG. 1) providing the service for a predetermined period to the service due to the failure.
- the impact level is displayed on the display device 30 (FIG. 1), and the energy efficiency information of the server 7 for a predetermined period is displayed on the display device 30 (FIG. 1) in association with the failure occurrence status information (step S207).
- the temperature acquisition unit 202 (FIG. 7) of the display processing device 200 is a temperature sensor (not shown) arranged around the server 7 (FIG. 1).
- the temperature information measured in (1) is acquired via the network 3 (FIG. 1) (step S201).
- the calculation part 204 (FIG. 7) of the display processing apparatus 200 calculates
- the display unit 208 (FIG. 7) of the display processing device 200 converts the occurrence status information of the failure generated by the heat accumulation of the server 7 providing the service for a predetermined period, which is aggregated by the aggregation unit 206, to the service due to the failure.
- the display is divided into impact levels, and the calculated energy efficiency information of the server 7 for a predetermined period is displayed in association with the failure occurrence status information (step S207).
- the screen 230 in FIG. 8 shows that the number of heat pools that occurred during the month, the number of warnings that occurred due to heat pools, and the number of violations of the SLA have fallen below the specified temperature (for example, 17 ° C.) due to overcooling in this month
- the increase / decrease value of the power consumption when compared with the standard time is displayed.
- the icon 140 and the highlighting 138 indicate that the SLA violation has occurred due to a failure due to heat accumulation
- the icon 240 and the highlighting 238 increases the power consumption by 30% from the standard value due to overcooling. You can see at a glance what happened.
- step S201 and step S203 can always be repeatedly executed at a predetermined cycle.
- Step S205 does not necessarily have to be executed every predetermined period, and can be performed at a timing at which the total value for each predetermined period can be calculated.
- the display update process in step S207 may be performed only when the operator instructs display of the screen 230 by an operation button (not shown) or the like.
- the information changes to the information to be displayed on the heat accumulation number display unit 132, the warning number display unit 134, and the SLA violation number display unit 136.
- whether to update the display automatically or manually using an operation button (not shown), etc., and the automatic update period, etc. is set in advance by an operator operating on a setting screen (not shown), or changed as needed It may be possible.
- the same effects as those of the above-described embodiment can be obtained, and the relationship between the influence of the heat accumulation on the service and the energy efficiency can be visualized. As a result, service reliability is improved.
- FIG. 10 is a functional block diagram showing the configuration of the display processing device 300 realized by the computer of the display processing system 1 according to the embodiment of the present invention.
- the display processing device 300 of the display processing system 1 according to the present embodiment has a function of combining the display processing devices of the display processing system 1 according to the above-described embodiments shown in FIG. 2, FIG. 4 and FIG.
- the occurrence status information, the occurrence status information of the failure due to the heat accumulation, and the energy efficiency are obtained, and the information and the influence on the service due to the heat accumulation are associated and presented.
- the display processing device 300 of the display processing system 1 of the present embodiment includes a failure information acquisition unit 104, a service table storage unit 106, and a determination unit similar to the display processing device 100 of the above-described embodiment of FIG. 108, a totaling unit 110, a determination unit 112, a temperature sensor arrangement table storage unit 122 similar to FIG. 4, a temperature acquisition unit 202 similar to the display processing device 200 of the above embodiment of FIG. 7, and a calculation unit 204. And a detection unit 302, a specifying unit 304, and a display unit 308.
- the detection unit 302 detects the occurrence of a heat pool based on the temperature information acquired by the temperature acquisition unit 202.
- the detection unit 302 detects the occurrence of a heat pool based on the position of the temperature information of the temperature sensor that is equal to or higher than a preset threshold and the period thereof.
- the specifying unit 304 refers to the temperature sensor arrangement table storage unit 122, specifies at least one server 7 (FIG. 1) in which the detected heat pool has occurred, and outputs information on the occurrence status of the heat pool of the specified server 7 To do.
- the display unit 308 displays the occurrence status information of the failure that has occurred due to the accumulation of heat of the server 7 that provides the service for a predetermined period, which is aggregated by the aggregation unit 110, by dividing the information into the service impact level caused by the failure, and the calculation unit
- the energy efficiency information of the server for a predetermined period calculated by 204 is displayed in association with the failure occurrence status information.
- the display unit 308 displays the above-described screen 230 of FIG. 8 on the display device 30 (FIG. 1).
- FIG. 11 is a flowchart showing an example of the operation of the display processing system 1 of the present embodiment. This will be described below with reference to FIGS. 1 and 6 to 11.
- the display processing apparatus 300 of the present embodiment executes the processing of steps S201 to S207 similar to the flowchart of FIG. 9 of the display processing apparatus 200 of the above embodiment, and then the flowchart of FIG. It is assumed that the following process is executed.
- the description of the processing of the flowchart of FIG. 9 is omitted.
- the detection unit 302 Based on the temperature information acquired by the temperature acquisition unit 202 (FIG. 10) of the display processing device 300 from the management device (not shown) of the data center 5 (FIG. 1) via the network 3 (FIG. 10), the display processing device 300.
- the detection unit 302 detects the occurrence of a heat pool (step S301).
- the specifying unit 304 (FIG. 10) of the display processing device 300 refers to the temperature sensor arrangement table storage unit 122 and specifies and specifies at least one server 7 (FIG. 1) in which the detected heat pool has occurred. Information on the state of occurrence of heat accumulation in the server 7 is output (step S303).
- the failure information acquisition unit 104 (FIG. 10) of the display processing device 300 acquires the occurrence status information of the failure that has occurred in the service or the server 7 (FIG. 1) from the data center 5 (FIG. 1) via the network 3. (Step S304).
- the determination unit 108 (FIG. 10) of the display processing device 300 determines whether the server 7 with the heat accumulation identified by the identification unit 304 matches the server 7 with the failure acquired by the failure information acquisition unit 104. It is determined whether or not (step S305). When they match (YES in step S305), it is determined that a failure in the server 7 and the service has occurred due to heat accumulation (step S307), and the process proceeds to step S309. If they do not match (NO in step S305), the failure is not caused by a heat pool, so step S307 is bypassed and the process proceeds to step S309.
- the totaling unit 110 (FIG. 10) of the display processing apparatus 300 totals the acquired heat generation status information of the server 7 and the fault generation status information determined to have occurred due to the heat storage in a predetermined period. (Step S309).
- step S301 to step S307 can always be repeatedly executed at a predetermined cycle, and step S309 can be executed every predetermined period, but the description of the repeated processing is omitted in this figure.
- the determination unit 112 (FIG. 10) of the display processing device 300 refers to the service table storage unit 106 (FIG. 10) of the display processing device 300, and based on the server rank table 152 (FIG. 6 (b))
- the SLA rank of the service provided by the server 7 determined to have occurred is specified.
- the determination unit 112 of the display processing apparatus 300 determines the influence level of the failure on the service from the specified reference value of the rank based on the failure level table 150 (FIG. 6A), for example, a warning level. Or SLA violation level is classified (step S311).
- the display unit 308 (FIG. 10) of the display processing device 300 displays the occurrence status information of the failure caused by the accumulation of heat for a predetermined period together with the accumulated occurrence status information for the predetermined period of time.
- the server energy efficiency information for a predetermined period is displayed in association with the failure occurrence status information (step S313).
- the same effects as those of the above-described embodiment can be obtained, the occurrence status of the heat accumulation and the occurrence status of the failure due to the heat accumulation are grasped, and the information is included. Visualize the relationship between the effect of heat accumulation on service and energy efficiency.
- FIG. 12 is a diagram illustrating an example of the screen 330 of the display processing system 1 according to the present embodiment.
- the display processing system 1 according to the present embodiment is different from the above-described embodiment in that the degree of influence on the operation of the service due to heat accumulation during a predetermined period is calculated and presented in association with energy efficiency.
- the display processing device further includes an influence calculation unit (not shown) in addition to the configuration of the display processing device 300 (FIG. 10).
- the influence degree calculation unit is configured to determine whether the server 7 (FIG. 1) has a heat accumulation occurrence state information, a failure occurrence state information due to the heat accumulation, or a service influence level due to the heat accumulation for a predetermined period. Calculate the degree of influence of the pool on service operation.
- the display unit 308 (FIG. 10) displays the energy efficiency information of the server for a predetermined period and the degree of influence on service operation in association with each other.
- the degree of influence can be calculated by multiplying the number of occurrences of heat accumulation, the number of occurrences of warnings, and the number of violations of SLA, respectively, by a weighting factor in accordance with prescribed rules.
- the following calculation formula (1) of the degree of influence is an example, and the present invention is not limited to this. Further, the calculation formula and each coefficient may be set in advance or changed at any time by an operator operating a setting screen (not shown).
- a, b, and c are weighting coefficients.
- the degree of influence is calculated as 14 by the following equation (2).
- 19 ⁇ 0 + 3 ⁇ 3 + 1 ⁇ 5 14 Equation (2)
- the degree of influence calculated in this way is used when screen display is made in association with energy efficiency information described later and the degree of influence on the service.
- step S311 The operation of the display processing apparatus of the present embodiment configured as described above has steps S301 to S311 similar to those in the flowchart of FIG. 11 showing the operation of the display processing apparatus 300 of the above embodiment, and further continues to step S311. Instead of step S313, the following steps S413 and S415 (not shown) are included. That is, the influence calculation unit of the display processing device is based on the occurrence status information of the heat pool of the server 7 (FIG. 1), the occurrence status information of the failure due to the heat accumulation, or the level of influence on the service due to the heat accumulation during a predetermined period. Then, the degree of influence on the service operation due to the accumulation of heat for a predetermined period is calculated (step S413).
- the display unit 308 (FIG. 10) of the display processing apparatus 300 displays the energy efficiency information of the server for a predetermined period and the degree of influence on the service operation in association with each other (step S415). Further, the display unit 308 classifies the occurrence status information of the failure caused by the accumulation of heat during the predetermined period together with the accumulated occurrence status information of the predetermined period by the determination unit 112 (FIG. 10) of the display processing device 300. You may further display according to the impact level to the service which was made.
- the screen 330 displayed on the display device 30 by the display unit 308 displays the calculated server energy efficiency information (power consumption) for a predetermined period and the operation of the calculated service for the predetermined period.
- power consumption the calculated server energy efficiency information
- FIG. 12A shows a graph for February
- FIG. 12B and FIG. 12C show a graph for April.
- a user interface for accepting operations such as a selection list (not shown) is provided so that the operator can select the date, week, month, year, period, etc. of the graph to be displayed. It is also preferable that a plurality of graphs designated by the operator can be displayed side by side.
- a first reference value P0 of energy efficiency information (power consumption) and a predetermined reference value L0 of influence (service level) are respectively extended on the two-dimensional matrix perpendicular to the axis of the two-dimensional matrix.
- An auxiliary line 343 and a second auxiliary line 342 are shown, and the plot area of the two-dimensional matrix is divided into four.
- Each reference value may be set in advance or changed at any time by an operator operating a setting screen (not shown). For example, when the reference value is set to a value far from the entity, it can be reviewed as appropriate.
- the screen 330 includes a first area 351 (one-dot broken line in FIG. 12C), a second area 352 (two-dot broken line in FIG. 12C), and a third area 353 (in FIG. 12C). A dotted line) and a fourth region 354 (solid line in FIG. 12C).
- the first area 351 is a plot area when the calculated energy efficiency information (power consumption) is less than the reference value P0 and the calculated influence (service level) is less than the reference value L0.
- the second area 352 includes the first area 351 (dotted line in FIG. 12C), the calculated energy efficiency information (power consumption) is less than the reference value P0, and the calculated influence level (service level) is the reference.
- the third area 353 is a plot area when the calculated energy efficiency information (power consumption) is equal to or greater than the reference value P0 and the calculated influence (service level) is less than the reference value L0.
- the fourth area 354 is a plot area in the case where the calculated energy efficiency information (power consumption) is the reference value P0 or more and the calculated influence (service level) is the reference value L0 or more.
- the display processing apparatus 300 further includes a reference determination unit (not shown) that determines whether or not the predetermined threshold value and the influence degree of the energy efficiency information satisfy predetermined reference values. Good.
- the energy efficiency information and the influence level are both smaller than the reference value, which indicates a normal state with no problem.
- the numerical value decreases in the direction of the arrow for each axis, indicating a normal state.
- the background color of each area is changed and displayed based on the determination result of the reference determination unit. For example, as shown in FIG. 12C, when the energy efficiency information (power consumption) and the influence level (service level) exceed the reference values, the background colors of the second area 352 and the third area 353 are respectively set. Make pink. In addition, when the energy efficiency information (power consumption) and the influence level (service level) do not exceed the reference values, the background colors of the second area 352 and the third area 353 are each yellowish green.
- the background color of the fourth area 354 can be pink.
- the background color of the fourth region 354 can be yellow.
- the background color of the fourth region 354 can be yellowish green.
- the first auxiliary line 343 and the second auxiliary line 342 respectively subtracted from the reference value P0 of the energy efficiency information (power consumption) and the reference value L0 of the degree of influence (service level).
- a first auxiliary broken line 332 and a second auxiliary broken line 333 drawn vertically from the plot 331 can be shown.
- the calculated energy efficiency information (power consumption) and the calculated influence are displayed in the second region 352, the third region 353, and the fourth region 354 based on the determination result of the reference determination unit.
- Images indicating the degree (service level) determination results are displayed.
- these images may display an NG mark (red circle) icon when the energy efficiency and the influence degree do not satisfy the standards. Good.
- an icon of a caution mark (triangle yellow!) May be displayed, or an icon of an OK mark (green circle) may be displayed. By doing so, the user can be notified of the situation intuitively and easily.
- the background color of the second region 352 is yellowish green, and further, the second region 352
- a first evaluation icon 382 with an OK mark indicating that the influence level (service level) satisfies the reference value L0 is displayed.
- a first auxiliary broken line 372 drawn vertically from the plot 371 can be shown with respect to the first auxiliary line 343 drawn from the reference value P0 of the energy efficiency information (power consumption). .
- the background color of the third region 353 is pink, and further, the energy efficiency (power consumption) has the reference value P0 in the third region 353.
- a second evaluation icon 383 with an NG mark indicating that it is not satisfied is displayed.
- the background color of the fourth area 354 is yellow, and a third evaluation icon 384 of a yellow triangular caution mark is displayed in the fourth area 354.
- the same effects as the above-described embodiment can be obtained, and the relationship between energy efficiency and service level can be recognized at a glance.
- a recording unit (not shown) that records the degree of influence of the period on service operation as a history can be further provided.
- FIG. 13 is a diagram illustrating an example of a transition graph screen in the display processing system 1 of the present embodiment.
- the screen 430 includes a bar graph 442 of the degree of influence of the heat pool on the service indicated by the influence degree legend 432 and a line graph 444 of the power consumption due to supercooling indicated by the power consumption legend 434 on the time axis.
- the data is displayed in time series for each month.
- a selection list or the like may be provided on the screen 430 so that the operator can select the time axis 440 in units of weeks, years, etc. in addition to monthly units and switch the screen display.
- the energy consumption by supercooling is graphed as energy efficiency, but other energy efficiency information calculated by the calculation unit 204 (FIG. 7 or 10) may be displayed. Further, a selection list or the like may be provided so that the operator can select and switch the screen display. The types of influence and energy efficiency graphs (bar graph, line graph, etc.) can be appropriately selected by the operator. In addition to the graph, a list may be used.
- a reception unit (not shown) that receives a time when a measure is taken against the heat accumulation, a graph (a bar graph 452 on the screen 450 in FIG. 13B, a line graph). 454) or a presentation unit (icon 460 in FIG. 13B) shown in a list.
- a presentation unit icon 460 in FIG. 13B
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Human Computer Interaction (AREA)
- Computing Systems (AREA)
- Debugging And Monitoring (AREA)
- Air Conditioning Control Device (AREA)
Abstract
Description
所定期間のサービスを提供するサーバの熱だまりにより発生した障害の発生状況情報を、前記障害による前記サービスへの影響レベルに分けて表示するとともに、前記障害の発生状況情報に関連付けて前記所定期間の前記サーバのエネルギ効率情報を表示する表示手段を備える。
表示装置に接続された処理装置が、
所定期間のサービスを提供するサーバの熱だまりにより発生した障害の発生状況情報を、前記障害による前記サービスへの影響レベルに分けて前記表示装置に表示するとともに、前記障害の発生状況情報に関連付けて前記所定期間の前記サーバのエネルギ効率情報を前記表示装置に表示する。
表示装置に接続されたコンピュータに、
所定期間のサービスを提供するサーバの熱だまりにより発生した障害の発生状況情報を、前記障害による前記サービスへの影響レベルに分けて前記表示装置に表示するとともに、前記障害の発生状況情報に関連付けて前記所定期間の前記サーバのエネルギ効率情報を前記表示装置に表示する手順を実行させるためのものである。
図1は、本発明の実施の形態に係る表示処理システム1の構成を示すブロック図である。
本発明の実施の形態に係る表示処理システム1は、たとえば、ネットワーク3を介して接続されるデータセンタ(DC)5の多数のIT機器、たとえば、多数のサーバ7やサービスの運用管理状況を示す情報をデータセンタ5のシステムを管理する管理者に提示する。
なお、各図において、本発明の本質に関わらない部分の構成については省略してあり、図示されていない。
図2に示すように、本実施形態の表示処理システム1は、所定期間のサービスを提供するサーバ7(図1)の熱だまりの発生状況情報とともに、所定期間の熱だまりにより発生した障害の発生状況情報を、障害によるサービスへの影響レベルに分けて表示する表示部114を備える。
検出部120は、ネットワーク3を介してデータセンタ5(図1)の管理装置(不図示)からサービスを提供するサーバ7周辺に配置された温度センサ(不図示)で測定した温度情報を受信し、受信した温度情報に基づいて、熱だまりの発生を検出する。検出部120は、温度センサの温度情報が、予め設定されたしきい値以上のもの位置およびその期間に基づいて、熱だまりの発生を検出する。
なお、上記構成による熱だまりの発生状況情報の生成は、データセンタ5の管理装置で行なわれ、表示処理装置100に送信されてもよい。
(1)数日間、同じ場所で発生した熱だまりを1件とするか、日数分カウントするか
(2)熱だまりの規模が大きく、複数のサーバ7が関連する場合も熱だまりを1件とするか、関連するサーバ7の台数分とするか、あるいは、関連するサービス毎にカウントするか
上記項目は、個別にかつ任意に指定することができるものとする。
図3に示すように、表示部114(図2)が表示装置30(図1)に表示する画面130は、熱だまり件数表示部132と、警告件数表示部134と、SLA違反件数表示部136と、を含む。さらに、画面130には、期間選択リスト142が設けることができる。期間選択リスト142を用いて、熱だまり件数表示部132、警告件数表示部134、およびSLA違反件数表示部136に表示する件数の集計期間を選択する操作を受け付けることができる。集計期間は、上述したように、所定期間から選択することができる。図3では、過去1ヶ月間の集計結果が表示されている。
図5は、本実施形態の表示処理システム1の動作の一例を示すフローチャートである。以下、図1乃至図6を用いて説明する。
本実施形態の表示処理方法は、所定期間のサービスを提供するサーバ7(図1)の熱だまりの発生状況情報とともに、所定期間の熱だまりにより発生した障害の発生状況情報を、障害によるサービスへの影響レベルに分けて表示装置30(図1)に表示する(図5のステップS113)。
そして、表示処理装置100の集計部110(図2)が、取得したサーバ7の熱だまりの発生状況情報、および熱だまりにより発生したものと判定された障害の発生状況情報を、所定期間で集計する(ステップS109)。
熱だまりを検出して対策を講じることは、当然必要なことであり、そのためには、詳細な障害情報を確認する必要がある。しかし、本発明の表示処理システム1では、詳細な情報の提示ではなく、熱だまりが発生したことによって、提供しているサービスにどの程度の影響を及ぼしているのかを、概略知ることが目的である。
図7は、本発明の実施の形態に係る表示処理システム1のコンピュータが実現する表示処理装置200の構成を示す機能ブロック図である。本実施形態の表示処理システム1は、上記実施の形態とは、熱だまりによるサービスへの影響とともに、エネルギ効率をあわせて表示する点で相違する。
さらに、熱だまりによる障害の発生状況が、提供しているサービスの品質を損なうことのないように、本実施形態の表示処理システム1は、サービスの安定性も上記情報とともに一見して把握できる情報を提示する。
算出部204は、温度取得部202が取得した温度情報に基づき、所定期間のサーバ7のエネルギ効率情報を求める。
図8に示すように、表示部208(図7)が表示装置30(図1)に表示する画面230は、上記実施形態の図3の画面130と同様な熱だまりの発生状況情報を表示する熱だまり障害情報表示欄130aに加え、さらに、過冷却情報表示欄230aを有する。
平均比表示部236には、算出部204により期間選択リスト142で選択された所定期間分集計された消費電力量の総和の平均比が表示される。
図9は、本実施形態の表示処理システム1の動作の一例を示すフローチャートである。以下、図1、図7乃至図9を用いて説明する。
本実施形態の表示処理方法は、表示処理装置200(図7)が、所定期間のサービスを提供するサーバ7(図1)の熱だまりにより発生した障害の発生状況情報を、障害によるサービスへの影響レベルに分けて表示装置30(図1)に表示するとともに、障害の発生状況情報に関連付けて所定期間のサーバ7のエネルギ効率情報を表示装置30(図1)に表示する(ステップS207)。
そして、表示処理装置200の集計部206(図7)が、所定期間のサービスを提供するサーバ7の熱だまりにより発生した障害の発生状況情報をネットワーク3を介して取得して集計する(ステップS205)。
図10は、本発明の実施の形態に係る表示処理システム1のコンピュータが実現する表示処理装置300の構成を示す機能ブロック図である。本実施形態の表示処理システム1の表示処理装置300は、図2、図4および図7の上記実施の形態の表示処理システム1の表示処理装置を組み合わせた機能を有し、サーバの熱だまりの発生状況情報と、熱だまりによる障害の発生状況情報と、エネルギ効率とを求め、これらの情報と熱だまりによるサービスへの影響とを関連付けて提示する。
特定部304は、温度センサ配置テーブル記憶部122を参照し、検出された熱だまりが発生したサーバ7(図1)を少なくとも1つ特定し、特定したサーバ7の熱だまりの発生状況情報を出力する。
図11は、本実施形態の表示処理システム1の動作の一例を示すフローチャートである。以下、図1、図6乃至図11を用いて説明する。
なお、これに限定されないが、本実施形態の表示処理装置300は、上記実施形態の表示処理装置200の図9のフローチャートと同様なステップS201~ステップS207の処理を実行した後、図11のフローチャートの処理を実行するものとする。ここでは、図9のフローチャートの処理の説明は省略する。
図12は、本実施形態の表示処理システム1の画面330の一例を示す図である。
本実施形態の表示処理システム1は、上記実施形態とは、所定期間の熱だまりによるサービスの運用への影響度を算出し、エネルギ効率と関連付けて提示する点で相違する。
そして、表示部308(図10)は、所定期間のサーバのエネルギ効率情報とサービスの運用への影響度を関連付けて表示する。
19×0+3×3+1×5=14 ・・・式(2)
このようにして算出された影響度は、後述するエネルギ効率情報とサービスへの影響度を関連付けて画面表示する際、使用される。
すなわち、表示処理装置の影響度算出部が、所定期間における、サーバ7(図1)の熱だまりの発生状況情報、熱だまりによる障害の発生状況情報、または熱だまりによるサービスへの影響レベルに基づいて、所定期間の熱だまりによるサービスの運用への影響度を算出する(ステップS413)。
第1領域351は、算出されたエネルギ効率情報(消費電力)が基準値P0未満、かつ算出された影響度(サービスレベル)が基準値L0未満の場合のプロット領域となる。
第2領域352は、第1領域351(図12(c)の一点破線)と、算出されたエネルギ効率情報(消費電力)が基準値P0未満、かつ算出された影響度(サービスレベル)が基準値L0以上の場合のプロット領域となる。
第3領域353は、算出されたエネルギ効率情報(消費電力)が基準値P0以上、かつ算出された影響度(サービスレベル)が基準値L0未満の場合のプロット領域となる。
第4領域354は、算出されたエネルギ効率情報(消費電力)が基準値P0以上、かつ算出された影響度(サービスレベル)が基準値L0以上の場合のプロット領域となる。
たとえば、これらの画像(第1評価アイコン、第2評価アイコン、および第3評価アイコン)は、エネルギ効率と影響度ともに基準を満たしていない場合は、NGマーク(赤丸)のアイコンを表示してもよい。あるいは、いずれか一方が基準を満たしている場合は、注意マーク(三角黄色!)のアイコンを表示したり、OKマーク(緑丸)のアイコンを表示してもよい。このようにすることで、直感的に分かりやすく利用者に状況を通知できる。
たとえば、上記実施形態の表示処理装置300(図10)において、算出部204(図10)で算出された所定期間のサーバ7(図1)のエネルギ効率情報および影響度算出部で算出された所定期間のサービスの運用への影響度を履歴として記録する記録部(不図示)をさらに備えることができる。そして、表示部308(図10)は、記録された所定期間ごとの履歴を、所定期間のサーバ7のエネルギ効率情報および所定期間のサービスの運用への影響度の推移を時系列で表示することができる。
図13に示すように、画面430には、影響度凡例432で示す熱だまりのサービスへの影響度の棒グラフ442と、消費電力凡例434で示す過冷却による消費電力の線グラフ444が、時間軸440に沿って、月毎に時系列に表示される。
画面430に、選択リストなどを設けて、時間軸440を月単位以外に、週単位、年単位などをオペレータが選択して画面表示切り替え可能としてもよい。
影響度とエネルギ効率のグラフの種類(棒グラフ、線グラフなど)は、オペレータにより適宜選択可能とすることができる。また、グラフ以外に、一覧表などでもよい。
この構成によれば、対策を講じたことによる効果も視覚的に確認でき、もし効果がない場合には、さらなる対策をとるなどのアクションをとることが可能になる。
Claims (15)
- 所定期間のサービスを提供するサーバの熱だまりにより発生した障害の発生状況情報を、前記障害による前記サービスへの影響レベルに分けて表示するとともに、前記障害の発生状況情報に関連付けて前記所定期間の前記サーバのエネルギ効率情報を表示する表示手段を備える表示処理システム。
- 請求項1に記載の表示処理システムにおいて、
前記サーバの周辺に配置された温度センサで測定した温度情報を取得する温度取得手段と、
取得した前記温度情報に基づき、前記所定期間の前記サーバの前記エネルギ効率情報を求める効率算出手段と、
前記所定期間の前記サービスを提供する前記サーバの熱だまりにより発生した前記障害の発生状況情報を取得して集計する集計手段と、
前記表示手段は、集計された前記所定期間の前記サービスを提供する前記サーバの熱だまりにより発生した前記障害の発生状況情報を、前記障害による前記サービスへの影響レベルに分けて表示するとともに、算出された前記所定期間の前記サーバのエネルギ効率情報を前記障害の発生状況情報に関連付けて表示する表示処理システム。 - 請求項2に記載の表示処理システムにおいて、
前記効率算出手段は、前記表示手段に表示する前記サーバの前記エネルギ効率情報として、取得した前記温度が前記所定期間で既定温度以下となった過冷却ののべ時間を求める表示処理システム。 - 請求項2または3に記載の表示処理システムにおいて、
前記サーバの消費電力量を取得する消費電力量取得手段をさらに備え、
前記効率算出手段は、前記表示手段に表示する前記サーバの前記エネルギ効率情報として、取得した前記サーバの前記消費電力量を前記所定期間の総和を算出する表示処理システム。 - 請求項4に記載の表示処理システムにおいて、
前記効率算出手段は、前記表示手段に表示する前記サーバの前記エネルギ効率情報として、算出された前記所定期間の前記消費電力量の総和の増減傾向を求め、
前記表示手段は、前記所定期間の前記消費電力量の総和の増減傾向を提示する表示処理システム。 - 請求項5に記載の表示処理システムにおいて、
前記効率算出手段は、前記所定期間の前記消費電力量の総和の増減傾向として、前記所定期間の前記消費電力量の総和の、所定の標準値に対する差分または変化率を求める表示処理システム。 - 請求項6に記載の表示処理システムにおいて、
前記効率算出手段が算出した前記差分または前記変化率が前記標準値に対して、許容範囲内か否かを判定する判定手段と、
前記判定手段の判定結果を利用者に通知する判定通知手段と、をさらに備える表示処理システム。 - 請求項1乃至7いずれかに記載の表示処理システムにおいて、
前記所定期間における、前記サーバの熱だまりの前記発生状況情報、前記熱だまりによる前記障害の前記発生状況情報、または前記熱だまりによる前記サービスへの前記影響レベルに基づいて、前記所定期間の前記熱だまりによる前記サービスの運用への影響度を算出する影響度算出手段をさらに備え、
前記表示手段は、前記所定期間の前記サーバのエネルギ効率情報と前記サービスの運用への前記影響度を関連付けて表示する表示処理システム。 - 請求項8に記載の表示処理システムにおいて、
前記表示手段は、算出された前記所定期間の前記サーバのエネルギ効率情報および算出された前記所定期間の前記サービスの運用への前記影響度を二次元マトリクスの軸にそれぞれ割り当てプロットして表示する表示処理システム。 - 請求項9に記載の表示処理システムにおいて、
前記エネルギ効率情報の所定のしきい値および前記影響度がそれぞれ予め決められた基準値を満たすか否かを判定する基準判定手段をさらに備え、
前記表示手段は、
前記エネルギ効率情報の前記基準値および前記影響度の前記基準値をそれぞれ前記二次元マトリクスの軸に垂直に前記二次元マトリクス上に延ばした線で示し、前記二次元マトリクスのプロット領域を4分割して、
算出された前記エネルギ効率情報が前記基準値未満、かつ算出された前記影響度が前記基準値未満の場合のプロット領域となる第1領域と、算出された前記エネルギ効率情報が前記基準値未満、かつ算出された前記影響度が前記基準値以上の場合のプロット領域となる第2領域と、算出された前記エネルギ効率情報が前記基準値以上、かつ算出された前記影響度が前記基準値未満の場合のプロット領域となる第3領域と、算出された前記エネルギ効率情報が前記基準値以上、かつ算出された前記影響度が前記基準値以上の場合のプロット領域となる第4領域と、を作成し、
前記基準判定手段の判定結果に基づいて、各領域の背景色を色変えして表示する表示処理システム。 - 請求項10に記載の表示処理システムにおいて、
前記表示手段は、前記基準判定手段の判定結果に基づいて、算出された前記エネルギ効率情報または算出された前記影響度の前記判定結果を示す画像を前記第2領域、前記第3領域、および前記第4領域にそれぞれ表示する表示処理システム。 - 請求項8乃至11いずれかに記載の表示処理システムにおいて、
前記所定期間の前記サーバのエネルギ効率情報および前記所定期間の前記サービスの運用への前記影響度を履歴として記録する記録手段をさらに備え、
前記表示手段は、記録された前記所定期間ごとの前記履歴を、前記所定期間の前記サーバのエネルギ効率情報および前記所定期間の前記サービスの運用への前記影響度の推移を時系列で表示する表示処理システム。 - 請求項2に記載の表示処理システムにおいて、
取得した前記温度情報に基づいて、前記熱だまりの発生を検出する検出手段と、
検出された前記熱だまりが発生した前記サーバを特定する特定手段と、
前記特定手段が特定した前記サーバと、前記集計手段が取得した前記障害の発生状況情報に基づき前記障害が発生した前記サーバとが一致したとき、前記サーバおよび前記サービスにおける前記障害が前記熱だまりにより発生したものと判定する熱だまり判定手段と、
前記熱だまりにより発生したものと判定された前記障害の前記発生状況情報を、前記障害が発生したと判定された前記サーバが提供する前記サービスへの影響レベルに応じて判別して分類する判別手段と、をさらに備え、
前記集計手段は、取得した前記サーバの前記熱だまりの前記発生状況情報を前記所定期間で集計し、
前記表示手段は、集計された前記所定期間の前記熱だまりの発生状況情報とともに、集計された前記所定期間の前記サービスを提供する前記サーバの熱だまりにより発生した前記障害の発生状況情報を、前記障害による前記サービスへの影響レベルに分けて表示するとともに、算出された前記所定期間の前記サーバのエネルギ効率情報を前記障害の発生状況情報に関連付けて表示する表示処理システム。 - 表示装置に接続された処理装置が、
所定期間のサービスを提供するサーバの熱だまりにより発生した障害の発生状況情報を、前記障害による前記サービスへの影響レベルに分けて前記表示装置に表示するとともに、前記障害の発生状況情報に関連付けて前記所定期間の前記サーバのエネルギ効率情報を前記表示装置に表示する表示処理方法。 - 表示装置に接続されたコンピュータに、
所定期間のサービスを提供するサーバの熱だまりにより発生した障害の発生状況情報を、前記障害による前記サービスへの影響レベルに分けて前記表示装置に表示するとともに、前記障害の発生状況情報に関連付けて前記所定期間の前記サーバのエネルギ効率情報を前記表示装置に表示する手順を実行させるためのプログラム。
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2012546667A JP5761201B2 (ja) | 2010-11-29 | 2011-09-09 | 表示処理システム、表示処理方法、およびプログラム |
CN201180053042.6A CN103189848B (zh) | 2010-11-29 | 2011-09-09 | 显示处理系统、显示处理方法和程序 |
US13/823,816 US9195532B2 (en) | 2010-11-29 | 2011-09-09 | Display processing system, display processing method, and program |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2010265496 | 2010-11-29 | ||
JP2010-265496 | 2010-11-29 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2012073407A1 true WO2012073407A1 (ja) | 2012-06-07 |
Family
ID=46171385
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2011/005089 WO2012073407A1 (ja) | 2010-11-29 | 2011-09-09 | 表示処理システム、表示処理方法、およびプログラム |
Country Status (4)
Country | Link |
---|---|
US (1) | US9195532B2 (ja) |
JP (1) | JP5761201B2 (ja) |
CN (1) | CN103189848B (ja) |
WO (1) | WO2012073407A1 (ja) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2015141601A (ja) * | 2014-01-29 | 2015-08-03 | 日本電気株式会社 | ジョブ性能分析システム、ジョブ性能分析方法及びプログラム |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012073407A1 (ja) * | 2010-11-29 | 2012-06-07 | 日本電気株式会社 | 表示処理システム、表示処理方法、およびプログラム |
JP7385826B2 (ja) * | 2019-11-08 | 2023-11-24 | オムロン株式会社 | 動作分析装置、動作分析方法及び動作分析プログラム |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2005092886A (ja) * | 2003-09-18 | 2005-04-07 | Internatl Business Mach Corp <Ibm> | ユーティリティ・コンピューティング環境の、自動アラートおよび率先リベートを伴うリアルタイム・サービス・レベル・パフォーマンス、違反、および保証された均一性を表示する方法。 |
JP2009252056A (ja) * | 2008-04-09 | 2009-10-29 | Hitachi Ltd | 情報処理システムの運用管理方法および運用管理装置 |
JP2009265904A (ja) * | 2008-04-24 | 2009-11-12 | Hitachi Ltd | 管理装置、ストレージ装置、及び、情報処理システム |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6145098A (en) * | 1997-05-13 | 2000-11-07 | Micron Electronics, Inc. | System for displaying system status |
US6871300B1 (en) * | 2001-05-04 | 2005-03-22 | Rlx Technologies, Inc. | Embedded server chassis hardware master system and method |
JP2004086278A (ja) * | 2002-08-23 | 2004-03-18 | Hitachi Kokusai Electric Inc | 装置障害監視方法および装置障害監視システム |
US6889908B2 (en) * | 2003-06-30 | 2005-05-10 | International Business Machines Corporation | Thermal analysis in a data processing system |
US7493193B2 (en) * | 2005-08-04 | 2009-02-17 | International Business Machines Corporation | Monitoring and real-time heat load control based upon server and environmental parameters |
CN1870750B (zh) | 2006-04-29 | 2011-11-30 | 北京北大方正电子有限公司 | 一种报警处理系统及方法 |
JP4151727B2 (ja) | 2006-12-22 | 2008-09-17 | ダイキン工業株式会社 | 空調管理装置 |
JP4930051B2 (ja) | 2006-12-28 | 2012-05-09 | ブラザー工業株式会社 | ステータスモニタプログラム |
US7894944B2 (en) * | 2007-07-06 | 2011-02-22 | Microsoft Corporation | Environmental monitoring in data facilities |
TW200931330A (en) | 2008-01-15 | 2009-07-16 | Chunghwa Telecom Co Ltd | Remote real-time asset management system for air conditioning equipments |
CA2882794C (en) * | 2008-02-15 | 2017-08-22 | The Pnc Financial Services Group, Inc. | Systems and methods for computer equipment management |
JP5790662B2 (ja) * | 2010-11-29 | 2015-10-07 | 日本電気株式会社 | 表示処理システム、表示処理方法、およびプログラム |
WO2012073407A1 (ja) * | 2010-11-29 | 2012-06-07 | 日本電気株式会社 | 表示処理システム、表示処理方法、およびプログラム |
US8676397B2 (en) * | 2010-12-20 | 2014-03-18 | International Business Machines Corporation | Regulating the temperature of a datacenter |
-
2011
- 2011-09-09 WO PCT/JP2011/005089 patent/WO2012073407A1/ja active Application Filing
- 2011-09-09 US US13/823,816 patent/US9195532B2/en active Active
- 2011-09-09 CN CN201180053042.6A patent/CN103189848B/zh active Active
- 2011-09-09 JP JP2012546667A patent/JP5761201B2/ja active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2005092886A (ja) * | 2003-09-18 | 2005-04-07 | Internatl Business Mach Corp <Ibm> | ユーティリティ・コンピューティング環境の、自動アラートおよび率先リベートを伴うリアルタイム・サービス・レベル・パフォーマンス、違反、および保証された均一性を表示する方法。 |
JP2009252056A (ja) * | 2008-04-09 | 2009-10-29 | Hitachi Ltd | 情報処理システムの運用管理方法および運用管理装置 |
JP2009265904A (ja) * | 2008-04-24 | 2009-11-12 | Hitachi Ltd | 管理装置、ストレージ装置、及び、情報処理システム |
Non-Patent Citations (2)
Title |
---|
HITOSHI TAKAGI ET AL.: "NEC no Green IT eno Torikumi 'REAL IT COOL PROJECT'", NEC TECHNICAL JOURNAL, vol. 62, no. 3, 25 September 2009 (2009-09-25), pages 24 - 27 * |
KUNIO FUKUMOTO ET AL.: "Optimization of IT load and facility energy in data centers", FUJITSU, vol. 61, no. 3, 10 May 2010 (2010-05-10), pages 255 - 260 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2015141601A (ja) * | 2014-01-29 | 2015-08-03 | 日本電気株式会社 | ジョブ性能分析システム、ジョブ性能分析方法及びプログラム |
Also Published As
Publication number | Publication date |
---|---|
JP5761201B2 (ja) | 2015-08-12 |
JPWO2012073407A1 (ja) | 2014-05-19 |
US20130198576A1 (en) | 2013-08-01 |
CN103189848B (zh) | 2015-12-16 |
US9195532B2 (en) | 2015-11-24 |
CN103189848A (zh) | 2013-07-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5790662B2 (ja) | 表示処理システム、表示処理方法、およびプログラム | |
JP6364800B2 (ja) | 監視装置及び監視方法 | |
JP5267736B2 (ja) | 障害検出装置、障害検出方法およびプログラム記録媒体 | |
CN106776214B (zh) | 一种服务器健康度评估方法 | |
JP5368388B2 (ja) | 災害危機管理装置、被害レベル計算方法、およびプログラム | |
JP5761201B2 (ja) | 表示処理システム、表示処理方法、およびプログラム | |
US9575488B2 (en) | Method and system for monitoring an industrial system involving an eye tracking system | |
JP2008134039A (ja) | 空調設備の監視装置および監視方法 | |
JP5790658B2 (ja) | 表示処理システム、表示処理方法、およびプログラム | |
US11113364B2 (en) | Time series data analysis control method and analysis control device | |
JP5884735B2 (ja) | 運用状況可視化装置、運用状況可視化方法およびプログラム | |
WO2012070475A1 (ja) | 情報処理装置、情報処理方法、並びに情報処理プログラム | |
CN110059906B (zh) | 策略效能分析方法、装置、服务器及存储介质 | |
JP2015014823A (ja) | エネルギー使用量評価システム、エネルギー使用量評価演算装置及びプログラム | |
JP2005071136A (ja) | 納期管理支援システム、そのプログラム、そのプログラムを記録した記録媒体および製品の納期管理方法 | |
JP6147205B2 (ja) | メータデータ確認装置及びメータデータ確認方法 | |
JP7020835B2 (ja) | 監視制御装置、監視制御システム及び表示方法 | |
JP5932887B2 (ja) | 業務処理システム、業務処理能力の監視システムおよび監視方法 | |
JP2019191662A (ja) | 設備遠隔監視システムおよび設備遠隔監視方法 | |
JP6177022B2 (ja) | 情報処理装置及び情報処理方法及びプログラム | |
JPWO2022113569A5 (ja) | ||
JP2014049045A (ja) | ジョブ管理システムにおける障害対応システム及びそのプログラム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11844412 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 13823816 Country of ref document: US |
|
ENP | Entry into the national phase |
Ref document number: 2012546667 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 11844412 Country of ref document: EP Kind code of ref document: A1 |