JP4430989B2 - Operation management support system and performance information display method - Google Patents

Operation management support system and performance information display method Download PDF

Info

Publication number
JP4430989B2
JP4430989B2 JP2004189267A JP2004189267A JP4430989B2 JP 4430989 B2 JP4430989 B2 JP 4430989B2 JP 2004189267 A JP2004189267 A JP 2004189267A JP 2004189267 A JP2004189267 A JP 2004189267A JP 4430989 B2 JP4430989 B2 JP 4430989B2
Authority
JP
Japan
Prior art keywords
performance information
performance
selected
type
types
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2004189267A
Other languages
Japanese (ja)
Other versions
JP2006011902A (en
JP2006011902A5 (en
Inventor
陽子 志賀
健史 石▲崎▼
公徳 菅内
Original Assignee
株式会社日立製作所
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 株式会社日立製作所 filed Critical 株式会社日立製作所
Priority to JP2004189267A priority Critical patent/JP4430989B2/en
Publication of JP2006011902A publication Critical patent/JP2006011902A/en
Publication of JP2006011902A5 publication Critical patent/JP2006011902A5/ja
Application granted granted Critical
Publication of JP4430989B2 publication Critical patent/JP4430989B2/en
Application status is Active legal-status Critical
Anticipated expiration legal-status Critical

Links

Images

Description

  The present invention relates to an operation management support system that collects performance information from a managed system.

  An enterprise information system that has been conventionally constructed as a dedicated system has been realized by general-purpose components such as a server for an Internet service such as a Web server and a database server. Since such a corporate information system is composed of a plurality of independent elements, it is difficult to identify the cause of system performance degradation.

  Especially in relatively large-scale systems, it is important to manage a large amount of management information in an integrated manner for performance monitoring and cause analysis of performance degradation, so system configuration information and response of services provided by the system An operation management support system that centrally manages time has been developed. Such an operation management support system collects performance information such as service response performance, component CPU usage, and memory usage, detects anomalies by threshold monitoring, and analyzes the cause of the detected anomalies. (See Patent Document 1).

  Next, a conventional method for analyzing the cause of performance degradation of the operation management support system is described. First, system component performance information is grouped for each service, and when the service response performance deteriorates, narrow down the performance information that may have affected the response performance from the group, and identify the problematic component. Identify. For narrowing down, correlation analysis between performance information is often used. The correlation analysis indicates a correlation between the variable X and the variable Y. If the variable Y increases as the variable X increases, it can be said that there is a positive correlation between the variable X and the variable Y. On the other hand, if the variable Y decreases as the variable X increases, it can be said that there is a negative correlation between the variable X and the variable Y. The correlation is quantitatively expressed by a correlation coefficient, and the correlation coefficient can be calculated by a formula. When a correlation coefficient with the variable Y is obtained with this variable analysis as a reference, the variable X is called an objective variable and the variable Y is called an explanatory variable.

  As described above, by using the function to analyze the cause of performance degradation in the operation management support system, it is possible to identify factors that may have affected the response performance of a service, and to find the root cause Can be useful. An operation management support system having such an analysis function is already on the market (see Non-Patent Document 1).

  Many servers and network devices hold performance information such as a CPU usage rate and a line operation rate as a MIB (Management Information Base). MIB defines the structure of management information of a distributed system, and management information can be specified by a unique identifier called an object ID. SNMP (Simple Network Management Protocol) is a protocol that enables MIB values of servers and network devices to be acquired from the outside via a network. The operation management support system generally collects performance information in MIB format using this SNMP. However, there are systems that have a function to collect performance information by remote login directly to the server.

JP 2002-342182 A "Nikkei Business" March 18, 2002 issue

  Using the above-described conventional technology, the system administrator can monitor the performance information of the system, detect the performance degradation early, and analyze the cause of the performance degradation. However, the prior art has the following problems.

  Depending on the component, performance information may not be provided, or the providing means may be unique, so the operation management support server may not be able to collect performance information. However, in the conventional technology, when the performance information that causes the performance degradation of the component is not collected, the cause cannot be specified.

  The present invention relates to a managed system in which at least one computer operates, processes a received request, and returns a response, and an operation management support system that collects performance information indicating the operating state and processing performance of the managed system Is a performance information display technology by the operation management support system in a system having a plurality of performance information types indicating the type of performance information of the managed system, and the operation management support system Performance information is collected periodically from the managed system for the selected performance information type, and the performance information history is recorded. When one of the performance information types not selected is specified, the dependency between the performance information types is affected. Dependency information that indicates the relationship between the type of performance information to be given and the type of performance information that is affected is searched, and the specified performance information Influenced by kind and acquires a plurality of performance information type Selected, technology, characterized in that displays performance information history about the acquired performance information types.

  According to the operation management support system of the present invention, performance degradation for a certain performance information type is determined by a change in performance information for other performance information types affected by the performance information type. Even if the performance information is not collected, it is possible to determine the performance degradation related to the type of performance information, which is effective for identifying the cause of the performance degradation of the entire managed system.

  FIG. 1 is a configuration diagram of a system according to an embodiment of the present invention. The system according to this embodiment includes a management target system 200 including one operation management support server 100 and a plurality of management target servers 222. The operation management support server 100 is connected to the management target server 222 via the IP network 120.

  The operation management support server 100 is a computer including a CPU, a memory, an external storage device, and the like. In addition to GUI (Graphical User Interface), the operation management support server 100 includes a configuration information / collection condition registration unit 104, an analysis / search instruction unit 103, a performance information collection unit 105, a threshold value monitoring unit 102, a correlation. Each program of the analysis unit 107 and the search unit 101 is stored and executed by the CPU. The configuration information / collection condition registration unit 104 receives the configuration definition information 600 indicating the configuration of the management target system 200 and the performance information collection condition 112 and registers them in the database 106. The analysis / search instruction unit 103 accepts an analysis execution instruction for the cause of performance degradation. The performance information collection unit 105 collects performance information from the management target server 222. The threshold value monitoring unit 102 determines whether the collected performance information exceeds a predetermined threshold value, and if it exceeds, displays a pop-up window to notify the administrator. The correlation analysis unit 107 calculates the strength of association between performance information types by correlation analysis. The search unit 101 searches for related performance information affected by certain performance information. The database 106 stored on the external storage device stores configuration information 500 about the management target system 200, performance information collection conditions 112, and collected performance information 113.

  The configuration information 500 includes a configuration information template 110 and configuration definition information 600. The configuration information template 110 is information serving as a template for configuration information, and includes an object definition 300 and a metric correlation 400. The configuration definition information 600 is information input by the administrator, and is created based on the configuration information template 110.

  The management target server 222 is a computer including a CPU, a memory, an external storage device, and the like. The managed server 222 stores each program of the managed server program 221 and the agent 201 in its memory and executes it by its CPU.

  The agent 201 operates on the management target server 222 on which the management target server program 221 operates, and by means such as interprocess communication, the CPU usage rate, the memory usage rate, the number of waiting requests of the management target server, etc. It is a program that collects performance information. In this embodiment, the agent 201 is a program independent of the managed server program 221, but the managed server program 221 has an interface for providing performance information, and the performance information collection unit 105 directly manages the managed server program 221. If the performance information can be acquired from the agent 201, the agent 201 is unnecessary.

  The performance information collection unit 105 transmits a pseudo request to the management target server program 221 and collects a response time until a response is obtained. The performance information collection unit 105 transmits a performance information collection request by SNMP to the agent 201. The agent 201 receives a performance information collection request from the performance information collection unit 105, and sends performance information such as the CPU usage rate of the managed server 222 on which the agent operates and the number of waiting requests for the managed server program to the performance information collection unit 105. Send.

  FIG. 2 is a diagram illustrating a configuration of the management target system 200 according to the embodiment. The management target system 200 includes a plurality of management target servers 222, a storage apparatus 1 (207), and a storage apparatus 2 (208). Each managed server 222 includes a Web server 1 (202), a Web server 2 (203), an application server 1 (hereinafter referred to as AP server 1) (204), an application server 2 (hereinafter referred to as a managed server program 221). AP database 2 (205) and a database server (hereinafter referred to as DB server) 206.

  Web server 1 (202), Web server 2 (203), AP server 1 (204), AP server 2 (205), and DB server 206 may operate on the same managed server 222, It may operate on another managed server 222 connected by a network. In the present embodiment, it is assumed that each server program operates on a different managed server 222, and each managed server 222 is connected by the IP network 210. The agent 201 is also running on the managed server 222 on which the web server 1 (202), the web server 2 (203), the AP server 1 (204), or the AP server 2 (205) is running, and the CPU is used. The performance information of the management target server 222 itself such as the rate and the memory usage rate and the performance information of the server program such as the number of received requests of the Web server are extracted and transmitted to the operation management support server 100.

  The storage device 1 (207) is connected to the DB server 206 and the storage device 2 (208) via an FC (Fiber Channel) network 220. The storage apparatus 1 (207) has a virtual storage function, and the storage area on the storage apparatus 2 (208) can be used from the DB server 206 in the same manner as the storage area on the own storage apparatus.

  This managed system 200 realizes a membership shopping site provided by Company A. The services provided by the company A shopping site are the member information management service and the online shopping service. The managed system 200 monitors the member information management service and the online store as separate services. Web server 1 (202) and AP server 1 (204) implement a member information management service. On the other hand, the Web server 2 (203) and the AP server 2 (205) realize online shopping. The DB server 206 manages member information accessed in common by these two services. The storage device 1 (207) and the storage device 2 (208) record member information managed by the DB server 206, and the DB server 206 records on the storage device 1 (207) and the storage device 2 (208). Register, update, delete, and reference registered member information.

  Next, a processing example of a shopping site realized by the management target system 200 will be described. The user computer 211 of this shopping site is connected to the management target system 200 via the IP network 120, changes the member information using the member information change Web page, and then the online store Web page. It is assumed that the product is purchased at

  The Web server 1 (202) has a member information change page. Then, an HTTP request issued by the computer 211 of the user of the company A shopping site is received, a member information change request is taken out from the HTTP request, and transmitted to the AP server 1 (204). The AP server 1 (204) receives the member information change request, creates a member information change SQL request, and transmits it to the DB server 206. The DB server 206 receives the SQL request and changes the member information. Then, the DB server 206 transmits a response indicating that the change operation has been normally completed to the AP server 1 (204). The AP server 1 (204) receives this response and transmits the response to the Web server 1 (202). The Web server 1 (202) receives this response and returns a response as an HTTP response to the user's computer 211.

  On the other hand, the Web server 2 (203) has an online shopping page. The Web server 2 (203) receives an HTTP request issued by the user's computer 211, extracts member information, selected product information, and the like from the HTTP request, and transmits them to the AP server 2 (205). The AP server 2 (205) receives member information and selected product information, creates an SQL request for acquiring member information, and transmits it to the DB server 206 in order to confirm the member information. The DB server 206 receives the SQL request for acquiring the member information, and transmits the member information to the AP server 2 (205) as a response. The AP server 2 confirms this member information. When it is determined that the member information is correct, an SQL request for purchase record registration request is created and transmitted to the DB server 206 in the same manner. The DB server 206 receives this SQL request and stores a purchase record.

  FIG. 3 is a diagram illustrating a data configuration of the object definition 300 included in the configuration information template 110 according to an embodiment of the present invention. The object definition 300 models a three-tier system and defines related performance parameters. The performance parameter is information indicating the type of performance information. In this specification, information indicating the type of performance information is also called a performance parameter. The three-tier system is a configuration used for realizing a Web application, and includes a Web server, an AP server, and a DB server. Company A's shopping site is a three-tier system. Here, the object means a management target object of the management target system 200. The object definition defines the types of objects and performance parameters that can be collected if performance information is collected for each object type. The objects of the three-tier system include an HTTP service, a host, a Web server, an AP server, and a DB server. Here, the HTTP service regards the entire service as a management target. The host means the management target server 222 as hardware. The object definition 300 according to the present embodiment includes an HTTP service object definition 310, a host object definition 320, a Web server object definition 330, an AP server object definition 340, and a DB server object definition 350.

  The performance parameters of the HTTP service include a total response time 311 which is a time required from preparation for sending an HTTP request to reception of an HTTP response, a setup time 312 required for DNS name resolution and TCP connection establishment, and sending an HTTP request. And an HTTP time 313, which is the time required to receive the HTTP response. The total response time is obtained by adding the setup time to the HTTP time. The host performance parameters include a CPU usage rate 321, a memory usage rate 322, and a line usage rate 323.

  The performance parameter of the Web server includes the number of received requests 331 that is the number of requests received by the Web server per unit time, the average response time 332, and the request processing that is the number of threads during request processing for the number of threads allocated to the Web server. It consists of a medium thread ratio 333. The average response time is the time from when a web server receives a request until it returns a response. In general, there is a one-to-one correspondence between requests and used threads, and the number of threads used increases as the number of requests increases.

  The performance parameter of the AP server includes a JVM memory usage rate 341 indicating a ratio of the memory in use to a memory allocated to the JVM (Java (registered trademark) Virtual Machine), and a thread that is processing a request for the number of threads allocated to the AP server. A request processing thread ratio 342, an execution waiting request count 343, and a DB average transaction time 344. In general, an AP server issues a plurality of transactions to a DB server for one request. The average transaction time is an average value of the transaction time from when the AP server transmits a transaction to the DB server until it receives a response.

  The performance parameter of the DB server includes a transaction number 351 that is the number of transactions processed per unit time, and a cache hit rate 352. The cache hit rate 352 is a rate at which the DB server 206 finds data in the database in the memory of the management target server 222.

  FIG. 4 is a diagram illustrating a data configuration example of the metric correlation 400 included in the configuration information template 110 according to the embodiment of this invention. The metric correlation 400 represents the dependency relationship between the performance parameters of the objects related to the performance of the managed system 200. The source of the arrow in the figure is a performance parameter that affects, and the tip of the arrow indicates the performance parameter that is affected.

  The number of received requests 331 of the Web server affects the request server thread ratio 333 of the Web server and the number of pending requests 343 of the AP server. The DB server transaction number 351 affects the AP server DB average transaction time 344, and the DB average transaction time 344 affects the execution waiting request number 343. On the other hand, the cache hit rate 352 of the DB server affects the number of transactions 351. In this embodiment, since the AP server is composed of the AP server 1 and the AP server 2, the number of transactions 351 of the DB server affects the average DB transaction time 344 and the number of pending requests 343 of both AP servers. Become. Other performance parameters also have metric correlations, but their dependencies are not simple, so here are examples of performance parameters with simple dependencies.

  FIG. 5 is a diagram showing a data configuration of configuration information 500 according to an embodiment of the present invention. The configuration information 500 includes one or more system information 510. The system information 510 includes a management target system name 511 and one or more service information 520 realized by the management target system. The service information 520 includes a service name 521 and one or more object information 530 included in the service. Further, the object information 530 has an object name 531, an object type 532, object specific information 533, one or more performance parameters 535 possessed by the object, and a metric correlation 534 that is a dependency relationship between the performance parameters. Here, the set of performance parameters 535 differs depending on the object type 532 and follows the object definition 300. The metric correlation 534 is a dependency indicated by the metric correlation 400, and is represented by link information that connects the affected performance parameter 535 and the affected performance parameter 535.

  When the system configuration is large, the number of objects becomes very large, and the burden on the administrator who inputs the object information 530 is large. Therefore, the operation management support server 100 according to the present embodiment includes a configuration information template for the performance parameter 535 and the metric correlation 534 as the configuration information template 110. In the configuration information template 110, the object definition 300 and the metric correlation 400 corresponding to the three-tier system are defined. Therefore, if the management information is a three-tier system in which a management target is defined, the administrator uses the configuration information template 110. This eliminates the need to input metric correlation 534 and performance parameters 535. The configuration information 500 input by the administrator is only the management target system name 511, service name 521, object name 531, object type 532, and object specific information 533. The entire information input by the administrator is referred to as configuration definition information 600.

  FIG. 6 is a diagram showing an example of configuration definition information 600 according to an embodiment of the present invention. The configuration definition information 600 is for defining the management target system 200, and the system name 511 of this example is “Company A shopping system”. The two service names 521 of the “Company A shopping system” are “member information management” and “online store”, respectively. Each service information 520 includes object information 530 for realizing a service.

  The object information 530 includes HTTP service information, Web server information, AP server information, and DB server information. In this example, there is one server, but a plurality of servers may be used. The HTTP service information includes an object name 531, an object type 532, and a URL that is object specific information 533. This URL is the transmission destination address of the request. The Web server information includes an object name 531, an object type 532, and host name, port number, and management port number that are object specific information 533. The AP server information includes an object name 531, an object type 532, and host name, port number, and management port number, which are object specific information 533. The DB server information includes an object name 531, an object type 532, and host name, port number, and management port number that are object specific information 533. The host name is the address of the managed server 222, the port number is the service port of the managed server program 221, and the managed port number is the port number of the agent 201.

  The configuration information / collection condition registration unit 104 receives the configuration definition information 600 input by the administrator and registers it in the configuration information 500.

  FIG. 7 is a diagram showing the collection condition 112 according to an embodiment of the present invention. The collection condition 112 is for determining the performance information collection timing 710 and the performance parameter 720 to be collected. The collection timing 710 includes a measurement interval 711 that is a performance information collection interval and a time-out 712 for one measurement process. The timeout 712 is an abort time when there is no response. The collection target performance parameter 720 includes a definition 721 indicating whether or not a performance parameter of each object is a performance information collection target, and a threshold value 722 when threshold monitoring is performed on the performance information. The threshold value 722 is input only when the performance parameter is targeted for threshold monitoring. The configuration information / collection condition registration unit 104 receives the collection condition 112 input by the administrator and registers it in the database 106.

  In this example, the total response time is collected for the member information management service, and the threshold of the total response time is monitored. The threshold is 20 seconds. The performance information collection targets of the Web server 1 are the number of received requests and the average response time. The performance information collection targets of the AP server are the JVM memory usage rate, the request processing thread ratio, the number of requests waiting for execution, and the DB average response time. Since the DB server 206 does not include an interface for providing performance information to the outside, the performance information cannot be collected in this embodiment. Since each host computer is not defined in the configuration definition information 600, it is excluded from performance information collection. The collection target performance parameter 720 can be similarly defined for the online store service.

  FIG. 8 is a diagram showing an example of the collected performance information 113 according to an embodiment of the present invention. This performance information is the result of the operation management support server 100 collecting the member information management service at 10-minute intervals based on the collection condition 112 shown in FIG.

  The performance information collection unit 105 transmits an HTTP request, which is a pseudo member information change request, to the Web server 1 (202) in order to measure the time required to change the member information, and measures the time until a response is obtained. The service performance information history 810 is created. The performance information collection unit 105 also collects the number of received requests and the average response time from the agent 201 of the Web server 1 (202), and creates a Web server performance information history 820. Further, the performance information collection unit 105 collects the JVM memory usage rate, the number of requests waiting for execution, the ratio of threads that are processing requests, and the DB average transaction time from the agent 201 of the AP server 1 (204), and creates an AP server performance information history 830. To do. Note that the performance information collection unit 105 creates the same performance history for the online store service.

  Next, the cause analysis procedure from the discovery of performance degradation will be described. The threshold value monitoring unit 102 of the operation management support server 100 monitors whether the total response time of the service performance information history 810 deviates from the threshold value 722 specified by the collection condition 112. Here, when the total response time 311 for changing the member information is 25 seconds and exceeds 20 seconds of the threshold value 722 specified in the collection condition 112, the threshold value monitoring unit 102 displays a popup window to display the threshold value to the administrator. Notify excess.

  When the administrator finds that the total response time exceeds the threshold, it instructs the analysis / search instruction unit 103 to investigate the cause of the threshold exceeding. Upon receiving an instruction from the analysis / search instruction unit 103, the correlation analysis unit 107 performs correlation analysis using the total response time as an objective variable and other performance parameters as explanatory variables. The correlation analysis unit 107 searches for other performance parameters having a high correlation coefficient with respect to the total response time. When a performance parameter having a high correlation coefficient can be found by correlation analysis, other performance parameters that affect this performance parameter are adjusted. For example, if the affected performance parameter is the JVM memory usage rate, the amount of memory allocated to the JVM is increased.

  On the other hand, if a performance parameter having a high correlation coefficient cannot be found by correlation analysis, the possibility that a performance parameter for which performance information has not been collected has been affected is examined. In this case, the administrator selects a performance parameter of the DB server 206 that is not a collection target under the collection condition 112 but may have an influence, and instructs the analysis / search instruction unit 103 to search for related information. . FIG. 9 is a diagram illustrating an example of a search instruction screen 900 displayed by the analysis / search instruction unit 103. The search instruction screen 900 includes a system configuration display unit 910, an execution button 920, and a cancel button 930. The system configuration display unit 910 displays the services that constitute the system, the components that constitute the service, and the performance parameters of the components in a tree format. Note that this tree displays even performance parameters that have not been collected. In this example, in order to simplify the drawing, only the performance parameters of the DB server are displayed, and the performance parameters of other objects are omitted.

  The administrator selects, for example, the transaction number 351 that is the performance parameter of the DB server 206 via the search instruction screen 900 and presses the execution button 920. As a result, the search unit 101 refers to the metric correlation 400, searches for performance parameters that are considered to have been affected by the number of transactions 351, and displays a time change graph of the performance information on the search result screen 1010 shown in FIG. . Since the performance parameters that may be affected by the number of transactions 351 are the DB average transaction time 344 and the number of pending requests 343, a graph of each of these performance information is displayed.

  FIG. 10 is a diagram illustrating an example of a search result screen 1010 displayed by the search unit 101. The search result screen 1010 displays the performance information of the total response time 311 that currently exceeds the threshold in the result display unit 1011 and the AP server 1 (204) that is affected by the number of transactions 351 that is the selected performance parameter. Changes in the performance information of the number of pending requests 343 of the AP server 2 (205) and the performance information of the DB average transaction number 344 of the AP server 1 (204) and the AP server 2 (205) in a line graph format. indicate.

  As shown in the result display unit 1011, when the change tendency of the related performance information affected by the number of transactions 351 is similar, the administrator has degraded the performance of the number of transactions 351 in the same manner as the related performance parameter. Therefore, it can be determined that there is a problem in the DB server 206. Even when the transition of the performance information of the DB average transaction count 344 over time is displayed, the administrator will be able to make the same determination.

  On the other hand, as shown in the result display unit 1012, when the related performance parameter does not show a similar change tendency, the administrator determines that the number of transactions 351 has not deteriorated and the DB server 206 has no problem. can do.

  The operation management support server 100 according to the present embodiment includes the configuration information template 110 including the object definition 300 and the metric correlation 400 defined in advance. However, the administrator defines the object definition 300 and the metric correlation 400 independently. As a definition, a GUI for inputting such information may be provided.

  In this embodiment, when the total response time exceeds the threshold, the administrator instructs the related performance parameter to be searched. However, the operation management support server 100 determines that the performance information of the specific parameter exceeds the threshold. As an opportunity, a related performance parameter search may be automatically executed.

  Further, the metric correlation 400 of the present embodiment represents the dependency relationship between the performance parameters. However, the metric correlation 400 has information related to the strength of the relationship in addition to the dependency relationship, and the relationship obtained by the search unit 101 is obtained. When there are many types of performance parameters, the display order may be determined according to the strength of the relationship.

  As described above, according to the present embodiment, it is possible to determine the change tendency of the performance information related to the performance information type for which performance information is not collected. Therefore, by using the processing procedure of this embodiment in a complementary manner to the correlation analysis, it becomes possible to identify the cause of the performance deterioration at an early stage.

It is a block diagram of the system of an Example. It is a block diagram of the management object system of an Example. It is a figure which shows the data structure of the object definition of an Example. It is a figure which shows the example of a data structure of metric correlation. It is a figure which shows the data structure of the structure information of an Example. It is a figure which shows the example of structure definition information. It is a figure which shows the collection conditions of an Example. It is a figure which shows the example of the collected performance information. It is a figure which shows the example of a search instruction | indication screen. It is a figure which shows the example of a search result screen.

Explanation of symbols

DESCRIPTION OF SYMBOLS 100 ... Operation management support server 101 ... Search part 102 ... Threshold monitoring part 103 ... Analysis / search instruction part 104 ... Configuration information / collection condition registration part 105 ... Performance information collection part 106 ... Database 107 ... correlation analysis unit, 110 ... configuration information template, 112 ... collection condition, 113 ... performance information, 200 ... managed system, 201 ... agent, 221 ... managed server program, 222 ... managed server, 300 ... object definition, 400 ... metric correlation, 500 ... configuration information, 600 ... configuration definition information

Claims (12)

  1. At least one computer operates, and includes a management target system that processes a received request and returns a response, and an operation management support system that collects performance information indicating the operating state and processing performance of the management target system A performance information display method by the operation management support system in a system,
    The operation management support system includes:
    Collecting performance information periodically from the managed system for a performance information type selected as a collection target among a plurality of performance information types indicating the type of performance information of the managed system, and recording a performance information history,
    When one of Tei no performance information types is selected as a collection target is specified, to search the dependency relationship information indicating a relationship between performance information type and the affected performance information types that affect the dependency between performance information type To obtain the performance information type that is affected by the specified performance information type and selected for collection ,
    A performance information display method for displaying the performance information history for the acquired performance information type.
  2. When a threshold value is set corresponding to one of the performance information types selected as a collection target, and the performance information for the performance information type for which the threshold value is set deviates from the threshold value, performance information display method according to claim 1, wherein the receiving the selected operation Tei not the performance information type specified to be collected.
  3. The dependency relationship information includes information indicating presence / absence of a dependency relationship between two or more performance information types and the strength of the dependency relationship, and the operation management support system according to the strength of the dependency relationship in the dependency relationship information. The performance information display method according to claim 1, wherein the performance information type is affected by the specified performance information type and is ranked.
  4. The managed system includes a plurality of Web servers, a plurality of AP servers, and one database server shared by the plurality of AP servers, and the non-selected performance information type is a performance information type related to the database server. The performance information display method according to claim 1, wherein:
  5. At least one computer operates, and includes a managed system that processes a received request and returns a response, and an operation management support computer that collects performance information indicating the operating state and processing performance of the managed system. A program executed by the operation management support computer in a system, the operation management support computer
    A function of periodically collecting performance information from the managed system and recording a performance information history for the performance information type selected as a collection target among a plurality of performance information types indicating the type of performance information of the managed system;
    When one of Tei no performance information types is selected as a collection target is specified, to search the dependency relationship information indicating a relationship between performance information type and the affected performance information types that affect the dependency between performance information type A program for realizing the function of acquiring the performance information type that is influenced by the specified performance information type and selected as the collection target , and the function of displaying the performance information history for the acquired performance information type .
  6. Further, the operation management support computer sets a threshold corresponding to one of the performance information types selected as a collection target, and the performance information about the performance information type for which the threshold is set is when deviating from the threshold, selected Tei not the performance information type according to claim 5, wherein the program for realizing a function of accepting designation of operations to be collected.
  7. The managed system includes a plurality of Web servers, a plurality of AP servers, and one database server shared by the plurality of AP servers, and the non-selected performance information type is a performance information type related to the database server. 6. The program according to claim 5, wherein:
  8. At least one computer operates, and includes a management target system that processes a received request and returns a response, and an operation management support system that collects performance information indicating the operating state and processing performance of the management target system In the system, the operation management support system includes:
    The relationship between the performance information types indicating the types of performance information of the managed system, the history of collected performance information, and the performance information types that affect the dependency relationship between the performance information types and the affected performance information types A storage device for storing dependency relationship information indicating
    A performance information collection unit that periodically collects performance information from the management target system and records a performance information history for the performance information type selected as a collection target among the performance information types;
    When one of Tei no performance information types is selected as a collection target is designated, the search for dependency information, acquires the selected said performance information types as affected by specified performance information types and collection target And a search unit that displays the performance information history for the acquired performance information type.
  9. Further, when a threshold value is set corresponding to one of the performance information types selected as the collection target, and the performance information for the performance information type for which the threshold value is set deviates from the threshold value operation management support system according to claim 8, characterized in that it has an analysis and search instruction unit that accepts an operation of the selected Tei not the performance information type specified to be collected.
  10. The dependency relationship information includes information indicating presence / absence of dependency relationship between two or more performance information types and the strength of the dependency relationship, and the search unit is designated according to the strength of the dependency relationship of the dependency relationship information. 9. The operation management support system according to claim 8, wherein the performance information types that are influenced by the selected performance information types and that have been selected are ranked.
  11. The managed system includes a plurality of Web servers, a plurality of AP servers, and one database server shared by the plurality of AP servers, and the non-selected performance information type is a performance information type related to the database server. The operation management support system according to claim 8.
  12. The dependency relationship information includes information indicating presence / absence of dependency between two or more performance information types and the strength of the dependency relationship, and the specified performance information type according to the strength of the dependency relationship of the dependency relationship information The program according to claim 5 for realizing a function of ranking the selected performance information types that are influenced by
JP2004189267A 2004-06-28 2004-06-28 Operation management support system and performance information display method Active JP4430989B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2004189267A JP4430989B2 (en) 2004-06-28 2004-06-28 Operation management support system and performance information display method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2004189267A JP4430989B2 (en) 2004-06-28 2004-06-28 Operation management support system and performance information display method

Publications (3)

Publication Number Publication Date
JP2006011902A JP2006011902A (en) 2006-01-12
JP2006011902A5 JP2006011902A5 (en) 2006-01-12
JP4430989B2 true JP4430989B2 (en) 2010-03-10

Family

ID=35779089

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2004189267A Active JP4430989B2 (en) 2004-06-28 2004-06-28 Operation management support system and performance information display method

Country Status (1)

Country Link
JP (1) JP4430989B2 (en)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4859558B2 (en) * 2006-06-30 2012-01-25 株式会社日立製作所 Computer system control method and computer system
JP4957256B2 (en) * 2007-01-12 2012-06-20 日本電気株式会社 System configuration change rule generation system, method and program
JP4752767B2 (en) * 2007-01-12 2011-08-17 日本電気株式会社 System configuration candidate derivation device, method and program
JP5028129B2 (en) * 2007-04-03 2012-09-19 株式会社日立製作所 Facility business integrated management method and system and program thereof
JP4872944B2 (en) * 2008-02-25 2012-02-08 日本電気株式会社 Operation management apparatus, operation management system, information processing method, and operation management program
JP4872945B2 (en) * 2008-02-25 2012-02-08 日本電気株式会社 Operation management apparatus, operation management system, information processing method, and operation management program
US20090235267A1 (en) * 2008-03-13 2009-09-17 International Business Machines Corporation Consolidated display of resource performance trends
WO2010010621A1 (en) * 2008-07-24 2010-01-28 富士通株式会社 Troubleshooting support program, troubleshooting support method, and troubleshooting support device
JP5668425B2 (en) * 2010-11-17 2015-02-12 日本電気株式会社 Failure detection apparatus, information processing method, and program
JP5141789B2 (en) * 2011-04-26 2013-02-13 日本電気株式会社 Operation management apparatus, operation management system, information processing method, and operation management program
WO2013111560A1 (en) * 2012-01-23 2013-08-01 日本電気株式会社 Operation management device, operation management method, and program
JP5768796B2 (en) * 2012-10-23 2015-08-26 日本電気株式会社 Operation management apparatus, operation management method, and program
JP6064571B2 (en) * 2012-12-10 2017-01-25 富士通株式会社 Processing program, processing method, and processing apparatus
JP5590196B2 (en) * 2013-07-22 2014-09-17 日本電気株式会社 Operation management apparatus, operation management system, information processing method, and operation management program
WO2016007824A1 (en) * 2014-07-10 2016-01-14 Sios Technology Corporation Interface for orchestration and analysis of a computer environment
WO2018146714A1 (en) * 2017-02-07 2018-08-16 株式会社日立製作所 Device and method for monitoring computer system

Also Published As

Publication number Publication date
JP2006011902A (en) 2006-01-12

Similar Documents

Publication Publication Date Title
US6763380B1 (en) Methods, systems and computer program products for tracking network device performance
KR100763326B1 (en) Methods and apparatus for root cause identification and problem determination in distributed systems
EP0994602B1 (en) Computer system and network performance monitoring
JP4782100B2 (en) Management computer for monitoring performance of storage system, computer system including the management computer, and control method thereof
EP1386240B1 (en) Synthetic transaction monitor
US8566443B2 (en) Unobtrusive methods and systems for collecting information transmitted over a network
US7505872B2 (en) Methods and apparatus for impact analysis and problem determination
US8260907B2 (en) Methods, systems and computer program products for triggered data collection and correlation of status and/or state in distributed data processing systems
US7240325B2 (en) Methods and apparatus for topology discovery and representation of distributed applications and services
US7984007B2 (en) Proactive problem resolution system, method of proactive problem resolution and program product therefor
US8185910B2 (en) Method and system for testing interactions between web clients and networked servers
ES2610739T3 (en) System and procedure for implementing polling agents in a client management tool
US20020184363A1 (en) Techniques for server-controlled measurement of client-side performance
KR20140025316A (en) Method and system for fingerprinting operating systems running on nodes in a communication network
US7401143B2 (en) System and methods for monitoring application server performance
US20090158189A1 (en) Predictive monitoring dashboard
US8521868B2 (en) Platform-level indicators of application performance
JP2004530179A (en) System and method for providing behavioral information of a user accessing an online resource
US20060085166A1 (en) Method of collecting and storing storage network performance information, computer system, and program
US20050049924A1 (en) Techniques for use with application monitoring to obtain transaction data
US7222269B2 (en) Performance evaluation device, performance evaluation information managing device, performance evaluation method, performance evaluation information managing method, performance evaluation system
US6901442B1 (en) Methods, system and computer program products for dynamic filtering of network performance test results
US9225554B2 (en) Device-health-based dynamic configuration of network management systems suited for network operations
US20080235365A1 (en) Automatic root cause analysis of performance problems using auto-baselining on aggregated performance metrics
DE60114999T2 (en) Monitoring and interaction with network services

Legal Events

Date Code Title Description
RD02 Notification of acceptance of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7422

Effective date: 20070222

A521 Written amendment

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20070222

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20070222

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20090519

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20090526

A521 Written amendment

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20090716

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20091201

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20091218

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20121225

Year of fee payment: 3

R150 Certificate of patent or registration of utility model

Ref document number: 4430989

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

Free format text: JAPANESE INTERMEDIATE CODE: R150

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20131225

Year of fee payment: 4