JP4453823B2

JP4453823B2 - Performance bottleneck analysis system and performance bottleneck analysis method

Info

Publication number: JP4453823B2
Application number: JP2004160755A
Authority: JP
Inventors: 育大網代
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2004-05-31
Filing date: 2004-05-31
Publication date: 2010-04-21
Anticipated expiration: 2024-05-31
Also published as: JP2005339437A

Description

本発明は、情報処理システム、及び情報処理方法に係り、特にコンピュータシステムの性能劣化または性能が期待通りに向上しない原因を特定するコンピュータシステムの性能ボトルネック解析システム及び性能ボトルネック解析方法に関する。 The present invention relates to an information processing system and an information processing method, and more particularly, to a performance bottleneck analysis system and a performance bottleneck analysis method for a computer system for identifying a cause of performance deterioration or performance improvement of a computer system as expected.

従来のコンピュータシステムの性能ボトルネック解析システムの一例が、特許文献１に記載されている。図２０に示すように、この従来の性能ボトルネック解析システム１０は、入力装置１２、処理装置１４、及び記憶装置１６を具備する。処理装置１４は、測定対象設定手段１８、セマフォ待ち状態開始検出手段２０、セマフォ待ち状態終了検出手段２２、セマフォ情報検索手段２４、及びセマフォ待ち累積時間計算手段２６とを備える。また、記憶装置１６は、セマフォ待ち情報リスト記憶手段２８を備える。 An example of a performance bottleneck analysis system of a conventional computer system is described in Patent Document 1. As shown in FIG. 20, the conventional performance bottleneck analysis system 10 includes an input device 12, a processing device 14, and a storage device 16. The processing device 14 includes a measurement target setting unit 18, a semaphore wait state start detection unit 20, a semaphore wait state end detection unit 22, a semaphore information search unit 24, and a semaphore wait accumulated time calculation unit 26. The storage device 16 includes semaphore waiting information list storage means 28.

上記構成を有する従来の性能ボトルネック解析システム１０は、以下のように動作する。まず、入力装置１２を通じて、ユーザから測定対象であるセマフォ識別子のリストが指定され、測定対象設定手段１８に送信される。この測定対象設定手段１８は、各セマフォ識別子に対して、セマフォ待ちに要した時間の累積を示すセマフォ待ち累積時間を付与し、セマフォ待ち情報リスト記憶手段２８に格納する。その際、セマフォ待ち累積時間は、０の値で初期化しておく。 The conventional performance bottleneck analysis system 10 having the above configuration operates as follows. First, a list of semaphore identifiers to be measured is designated by the user through the input device 12 and transmitted to the measurement target setting means 18. The measurement target setting unit 18 assigns to each semaphore identifier a semaphore waiting accumulated time indicating the accumulation of the time required to wait for the semaphore, and stores it in the semaphore waiting information list storage unit 28. At this time, the accumulated semaphore waiting time is initialized with a value of zero.

セマフォ待ち状態開始検出手段２０は、コンピュータシステム３０上で動作するプロセス（タスク）があるセマフォに対するセマフォ待ち状態に変化したことを検出して、検出時の時刻（待ち開始時刻）を取得する。次に、セマフォ待ち状態開始検出手段２０は、検出したセマフォの識別子と待ち開始時刻をセマフォ情報検索手段２４に付与する。 The semaphore wait state start detection means 20 detects that a process (task) operating on the computer system 30 has changed to a semaphore wait state for a semaphore, and acquires the time at the time of detection (wait start time). Next, the semaphore wait state start detection means 20 gives the detected semaphore identifier and the wait start time to the semaphore information search means 24.

セマフォ情報検索手段２４は、記憶装置１６に備わるセマフォ待ち情報リスト記憶手段２８に記憶されたセマフォ待ち情報のリストを使用して、セマフォ待ち状態開始検出手段２０から受けとったセマフォ識別子を検索し、セマフォ識別子が測定対象であるか否かを調べる。そして、受けとったセマフォ識別子が測定対象であれば、セマフォ識別子に待ち開始時刻を付与して、セマフォ待ち情報リスト記憶手段２８が有するセマフォ待ち情報のリストを更新する。 The semaphore information search means 24 uses the list of semaphore wait information stored in the semaphore wait information list storage means 28 provided in the storage device 16 to search for the semaphore identifier received from the semaphore wait state start detection means 20, and Check whether the identifier is a measurement target. If the received semaphore identifier is a measurement target, a waiting start time is given to the semaphore identifier, and the semaphore waiting information list stored in the semaphore waiting information list storage unit 28 is updated.

セマフォ待ち状態終了検出手段２２は、プロセスがあるセマフォに対するセマフォ待ち状態を終了して他の処理を開始したことを検出して、検出時の時刻（待ち終了時刻）を取得する。その後、セマフォ待ち状態終了検出手段２２は、検出したセマフォの識別子と待ち終了時刻をセマフォ情報検索手段２４に与える。 The semaphore wait state end detection means 22 detects that the process has ended the semaphore wait state for a semaphore and started another process, and acquires the time at detection (wait end time). Thereafter, the semaphore wait state end detection means 22 gives the detected semaphore identifier and the wait end time to the semaphore information search means 24.

セマフォ情報検索手段２４は、セマフォ待ち状態終了検出手段から受けとったセマフォ識別子を検索し、セマフォ識別子が測定対象かどうかを調べる。このとき、受けとったセマフォ識別子が測定対象であれば、セマフォ識別子に付与されている待ち開始時刻及びその時点での待ち累積時間を読み出し、待ち開始時刻と待ち終了時刻と待ち累積時間とを、セマフォ待ち累積時間計算手段２６に与える。 The semaphore information retrieval unit 24 retrieves the semaphore identifier received from the semaphore wait state end detection unit, and checks whether the semaphore identifier is a measurement target. At this time, if the received semaphore identifier is a measurement target, the waiting start time given to the semaphore identifier and the waiting accumulated time at that time are read, and the waiting start time, waiting end time, and waiting accumulated time are calculated. This is given to the waiting accumulated time calculating means 26.

セマフォ待ち累積時間計算手段２６は、受けとった待ち開始時刻と待ち終了時刻とから、今回の待ち時間を計算し、一緒に受けとった待ち累積時間に今回の待ち時間を加算する。その後、加算した結果の待ち累積時間をセマフォ情報検索手段２４に与える。 The semaphore waiting accumulated time calculating means 26 calculates the current waiting time from the received waiting start time and waiting end time, and adds this waiting time to the waiting accumulated time received together. Thereafter, the accumulated waiting time as a result of the addition is given to the semaphore information search means 24.

セマフォ情報検索手段２４は、受けとった新しい待ち累積時間をセマフォ識別子と共に記憶装置１６に備わるセマフォ待ち情報リスト記憶手段２８に付与して、セマフォ待ち情報リストを更新する。このとき、セマフォ識別子に付与されていた待ち開始時刻は、削除される。 The semaphore information search means 24 gives the received new accumulated waiting time together with the semaphore identifier to the semaphore wait information list storage means 28 provided in the storage device 16 to update the semaphore wait information list. At this time, the waiting start time assigned to the semaphore identifier is deleted.

そして、プログラムの終了後、セマフォ待ち累積時間の最大なセマフォがボトルネックとして特定される。 After the program ends, the semaphore with the longest accumulated semaphore waiting time is identified as the bottleneck.

特開平１０―６３５１６号JP-A-10-63516

しかしながら、あるコンピュータシステムに関して、待ち時間の長いセマフォが必ずしもボトルネックとは限らない。なぜなら、長い待ち状態が意図されたものであり、関連する他のプロセスがＣＰＵ等のハードウェア資源を有効に活用してさえいれば、待ち時間は、システムの性能効率を低下させないためである。 However, for some computer systems, semaphores with long latencies are not always a bottleneck. This is because a long waiting state is intended, and the waiting time does not decrease the performance efficiency of the system as long as other related processes effectively use hardware resources such as a CPU.

また、特定のセマフォを待つプロセスが複数ある場合に、真のボトルネックを特定できない可能性がある。なぜなら、待ち累積時間の算定が各々のプロセスごとに実行されるため、特定のセマフォを待つプロセスが多数存在し、なおかつ各々の待ち累積時間が比較的短い場合に、そのセマフォがボトルネックとして特定されないためである。 Also, if there are multiple processes waiting for a specific semaphore, the true bottleneck may not be identified. Because the calculation of the waiting time is executed for each process, there are many processes waiting for a specific semaphore, and when each waiting time is relatively short, the semaphore is not specified as a bottleneck. Because.

さらに、プロセスの状態が変化した時刻を計測しなければならないために、プロセスの待ち状態への変化や待ち状態の終了を検出するためには、プログラムの実行環境に特別な機構が必要とされる。 Furthermore, since the time at which the process state changes must be measured, a special mechanism is required in the program execution environment in order to detect a change to the process wait state and the end of the wait state. .

また、観測対象のセマフォをユーザが指定しなければならないために、解析対象のプログラムの中身をユーザが知っている必要がある。 In addition, since the user must specify the semaphore to be observed, the user needs to know the contents of the program to be analyzed.

そこで、本発明は、従来のコンピュータシステムにおける性能ボトルネック解析システムが有する上記問題点に鑑みてなされたものであり、本発明の目的は、ソフトウェアの中身のよくわからないコンピュータシステムに対し、セマフォやファイルロック待ち、ネットワークを介したデータ到着待ち等のソフトウェア資源待ちを原因とする性能劣化を、汎用の実行環境やツールを用いて検出可能な、新規かつ改良されたコンピュータシステムにおける性能ボトルネック解析システム及び性能ボトルネック解析方法を提供することである。 Therefore, the present invention has been made in view of the above-mentioned problems of the performance bottleneck analysis system in the conventional computer system, and the object of the present invention is to provide a semaphore and a file for a computer system whose software contents are not well understood. A performance bottleneck analysis system in a new and improved computer system capable of detecting performance degradation caused by waiting for software resources such as waiting for lock and waiting for data arrival via a network using a general-purpose execution environment and tools, and It is to provide a performance bottleneck analysis method.

また、ソフトウェアの稼働に最適なＣＰＵ等のハードウェア資源の量を推定することの可能な、新規かつ改良されたコンピュータシステムにおける性能ボトルネック解析システム及び性能ボトルネック解析方法を提供することである。 Another object of the present invention is to provide a performance bottleneck analysis system and a performance bottleneck analysis method in a new and improved computer system capable of estimating the amount of hardware resources such as a CPU optimal for software operation.

上記課題を解決するために、本発明のある観点によれば、複数のプロセスが少なくとも１つのハードウエア資源（以下、ＨＷ資源と記す。）を利用するコンピュータシステムの性能ボトルネック解析システムにおいて、前記複数のプロセスの各々について、前記ＨＷ資源を利用中であるＨＷ資源利用状態であるか、あるいは相互排他的に利用する複数のソフトウェア資源（以下、ＳＷ資源と記す。）の１つの取得を待っているＳＷ資源待ち状態であるか、および、当該ＳＷ資源待ち状態のプロセスが待っているＳＷ資源の種類をそれぞれ示すプロセス状態を計測するプロセス状態計測手段と、前記ＨＷ資源利用状態のプロセス数が前記ＨＷ資源数より少ないＨＷ資源未活用状況を検出する未活用状況検出手段と、前記ＨＷ資源未活用状況が検出された場合に、前記複数のＳＷ資源の各々についてＳＷ資源待ち状態のプロセス数（以下、待ちプロセス数という。）をカウントするカウント手段と、ＳＷ資源ごとに求められた前記待ちプロセス数を性能ボトルネック解析結果として出力する出力手段と、を備えることを特徴とする、性能ボトルネック解析システムが提供される。 In order to solve the above problems, according to an aspect of the present invention, a plurality of processes at least one hardware resource (hereinafter referred to as HW resource.) In the performance bottleneck analysis system of a computer system that utilizes the For each of a plurality of processes, waiting for acquisition of one of the plurality of software resources (hereinafter referred to as SW resources) that are in the HW resource use state in which the HW resource is being used or that are mutually exclusive. A process state measuring unit for measuring a process state indicating a SW resource waiting state and a type of SW resource waiting for the SW resource waiting process, and the number of processes in the HW resource use state Unused status detecting means for detecting an unused status of HW resources smaller than the number of HW resources, and detecting the unused status of HW resources. In this case, the counting means for counting the number of processes waiting for SW resources (hereinafter referred to as the number of waiting processes) for each of the plurality of SW resources, and the number of waiting processes obtained for each SW resource are set as performance bottles. There is provided a performance bottleneck analysis system comprising an output means for outputting as a neck analysis result .

このような構成とすることにより、プロセス状態計測手段が外部の対象システムの上で動作する各プロセスの状態を計測し、その計測結果から、未活用状況検出手段がＨＷ資源を利用中のプロセスがＨＷ資源数より少ないＨＷ未活用状況を検出し、ＨＷ資源未活用状況が検出された場合に、カウント手段がＨＷ未活用状況時における各ソフトウェア資源に対する待ちプロセスの数を算出できる。すなわち、ＨＷ（ハードウェア）資源利用中のプロセスがＨＷ資源数を下回るＨＷ未活用状況の検出と、ＨＷ未活用状況下におけるＳＷ資源待ちプロセス数の算出ができるように構成されているため、待ちプロセス数の多いＳＷ資源をボトルネックとみなすことにより、性能上のボトルネックとなっているＳＷ（ソフトウェア）資源を特定することができる。 By adopting such a configuration, the process state measuring unit measures the state of each process operating on the external target system, and from the measurement result, the process in which the unused state detecting unit is using the HW resource is determined. When an HW unused status less than the number of HW resources is detected, and the HW resource unused status is detected, the counting means can calculate the number of waiting processes for each software resource when the HW unused status is detected . In other words, the configuration is such that the process that is using the HW (hardware) resource can detect the HW unused state where the number of HW resources is less than the number of HW resources and can calculate the number of SW resource waiting processes when the HW is not used. By considering a SW resource having a large number of processes as a bottleneck, it is possible to identify a SW (software) resource that is a bottleneck in performance.

また、このとき、プロセス状態計測手段は前記プロセス状態を複数回計測し、前記カウント手段によりカウントされたプロセス数から平均プロセス数を待ちプロセス数として算出する平均値算出手段を更に備えることとしてもよい。 At this time, the process state measuring means measures a plurality of times the process state, it is also possible that the counted number of the process further comprises an average value calculating means for calculating a number of processes waiting for the number average process by they said counting means .

このような構成とすることにより、複数回の測定に基づいて、ＳＷ資源待ちプロセス数の平均値が算出されるので、性能上のボトルネックとなっているＳＷ資源をより正確に特定することができる。 With this configuration, the average value of the number of SW resource waiting processes is calculated based on a plurality of measurements, so that it is possible to more accurately identify the SW resource that is a bottleneck in performance. it can.

さらに、このとき、プロセス状態計測手段が計測に要した時間を算出する計測所要時間取得手段と、計測に要した時間と所定しきい値とを比較してボトルネックがＨＷ資源側にあるか否かを判定するＨＷボトルネック判定手段とを更に有し、計測に要した時間が所定しきい値を超えていればボトルネックがＨＷ資源側にあると判定され、出力手段はＨＷ資源がボトルネックであることを出力し、計測に要した時間が所定しきい値以内であればボトルネックがＨＷ資源側にないと判定され、カウント手段は前記待ちプロセス数をカウントする、こととしてもよい。 Further, at this time, the measurement time acquisition means for calculating the time required for the measurement by the process state measurement means, and the time required for the measurement are compared with a predetermined threshold value to determine whether the bottleneck is on the HW resource side. or further comprising a determining HW Bottoleneck means, time required for measurement is a bottleneck if more than a predetermined threshold is determined to be in HW resource side, the output means is HW resource bottlenecks If the time required for the measurement is within a predetermined threshold value, it is determined that the bottleneck is not on the HW resource side, and the counting means may count the number of waiting processes.

このような構成とすることにより、プロセス状態の計測自身がＨＷ資源を利用するという事実に基づき、計測に要した時間を算出できるため、性能上のボトルネックがＨＷ資源側にあることを判定できる。 By adopting such a configuration, it is possible to calculate the time required for the measurement based on the fact that the process state measurement itself uses the HW resource, so it is possible to determine that the performance bottleneck is on the HW resource side. .

さらに、このとき、ＨＷボトルネック判定手段がボトルネックがＨＷ資源側にあると判定した時、コンピュータシステムのＨＷ資源数を変更させるＨＷ構成変更手段を更に備えることとしてもよい。 Further, at this time, when the HW bottleneck determining unit determines that the bottleneck is on the HW resource side, it may further include an HW configuration changing unit that changes the number of HW resources of the computer system .

このような構成とすることにより、プロセス状態計測手段が外部のシステムの上で動作する各プロセスの状態の計測に要した時間を、計測所要時間取得部が算出し、算出結果から、ＨＷボトルネック判定部が、ＨＷ資源がボトルネックになっているかどうかを判定し、ボトルネックになっている場合は、ＨＷ資源数を一定数増やし、なっていない場合は、ＨＷ資源を一定数減らすよう動作される。このため、性能上のボトルネックの所在に基づいてＨＷ資源数を増減できるため、ＨＷ資源の数を自動的に調整され、最適なＨＷ資源数でシステムを運用することができる。 With such a configuration, the measurement time acquisition unit calculates the time required for the process state measurement means to measure the state of each process operating on the external system, and the HW bottleneck is calculated from the calculation result. The determination unit determines whether or not the HW resource is a bottleneck. If the bottleneck is a bottleneck, the determination unit increases the number of HW resources. If not, the determination unit operates to decrease the HW resource by a certain number. The For this reason, since the number of HW resources can be increased or decreased based on the location of the bottleneck in performance, the number of HW resources can be automatically adjusted, and the system can be operated with the optimum number of HW resources.

また、このとき、前記待ちプロセス数を所定基準でソートおよび／または抽出するフィルタリング手段を備えることとしてもよい。 At this time, a filtering means for sorting and / or extracting the number of waiting processes on a predetermined basis may be provided.

このような構成とすることにより、待ちプロセスの数が多いＳＷ資源を抽出できるため、性能上のボトルネックとなっているＳＷ資源をより早く特定することができる。 With such a configuration, SW resources with a large number of waiting processes can be extracted, so that SW resources that are bottlenecks in performance can be identified earlier.

以上説明したように本発明によれば、ソフトウェアの中身のよくわからないコンピュータシステムに対し、セマフォやファイルロック待ち、ネットワークを介したデータ到着待ち等のソフトウェア資源待ちを原因とする性能ボトルネックを、汎用の実行環境やツールを用いて検出できる。 As described above, according to the present invention, a performance bottleneck caused by waiting for software resources such as waiting for semaphores, waiting for file locks, waiting for data arrival via the network, etc. Can be detected using the execution environment and tools.

また、ＨＷ資源の数を自動的に調整することにより、ソフトウェアの稼働に最適なＣＰＵ等のハードウェア資源の量を推定することも実現される。 In addition, by automatically adjusting the number of HW resources, it is possible to estimate the amount of hardware resources such as a CPU that is optimal for the operation of software.

以下に添付図面を参照しながら、本発明の好適な実施形態について詳細に説明する。なお、本明細書及び図面において、実質的に同一の機能構成を有する構成要素については、同一の符号を付することにより重複説明を省略する。 Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. In addition, in this specification and drawing, about the component which has the substantially same function structure, the duplicate description is abbreviate | omitted by attaching | subjecting the same code | symbol.

（第１実施形態）
まず、本発明の第１実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成について図面を使用しながら説明する。図１は、本発明の第１実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成を示すブロック図である。 (First embodiment)
First, the configuration of the performance bottleneck analysis system for a computer system according to the first embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing a configuration of a performance bottleneck analysis system for a computer system according to a first embodiment of the present invention.

図１に示すように、本実施形態の性能ボトルネック解析システム１００は、処理装置１０２と、記憶装置１０４と、出力装置１０６とを具備して構成されている。 As shown in FIG. 1, the performance bottleneck analysis system 100 of this embodiment includes a processing device 102, a storage device 104, and an output device 106.

処理装置１０２は、プロセス状態計測手段１０８と、ＨＷ（hardware：ハードウェア）資源利用プロセス数比較部１１０と、ＳＷ（software：ソフトウェア）資源待ちプロセスカウント部１１２と、出力部１１４とを備える。また、記憶装置１０４は、外部のシステム１２０の備えるＨＷ資源の個数を記憶するＨＷ資源数記憶部１１６と、ＳＷ資源待ち行列長記憶部１１８とを備える。 The processing apparatus 102 includes a process state measuring unit 108, a hardware (hardware) resource use process number comparison unit 110, a SW (software) resource waiting process count unit 112, and an output unit 114. The storage device 104 also includes an HW resource number storage unit 116 that stores the number of HW resources included in the external system 120 and an SW resource queue length storage unit 118.

次に、本実施形態の性能ボトルネック解析システム１００に備わる上述した各構成要素の動作の概略について、以下で説明する。 Next, an outline of the operation of each component described above provided in the performance bottleneck analysis system 100 of the present embodiment will be described below.

処理装置１０２に備わるプロセス状態計測手段１０８は、外部のシステム１２０上で動作するプロセスの実行状態に関する情報を取得し、取得した情報の一部を処理装置１０２内に備わるＨＷ資源利用プロセス数比較部１１０やＳＷ資源待ちプロセスカウント部１１２に付与する。 The process state measuring means 108 provided in the processing apparatus 102 acquires information on the execution state of a process operating on the external system 120, and a part of the acquired information is a HW resource use process number comparison unit provided in the processing apparatus 102. 110 and SW resource waiting process count unit 112.

ＨＷ資源利用プロセス数比較部１１０は、プロセス状態計測手段１０８からＨＷ資源を利用中のプロセスに関するデータを入力し、記憶装置１０４に備わるＨＷ資源数記憶部１１６から外部システム１２０のＨＷ資源数を入力し、ＨＷ資源を利用中のプロセス数とＨＷ資源数とを比較する。そして、この比較の結果により得られるＨＷ資源を利用中のプロセス数とＨＷ資源数との大小関係をＳＷ資源待ちプロセスカウント部１１２に報知する。 The HW resource utilization process number comparison unit 110 inputs data related to the process that is using the HW resource from the process state measurement unit 108, and inputs the number of HW resources of the external system 120 from the HW resource number storage unit 116 provided in the storage device 104. Then, the number of processes using the HW resource is compared with the number of HW resources. Then, the SW resource waiting process count unit 112 is informed of the magnitude relationship between the number of processes using the HW resource obtained from the comparison result and the number of HW resources.

ＳＷ資源待ちプロセスカウント部１１２は、プロセス状態計測手段１０８からＳＷ資源待ち状態のプロセスに関するデータを入力し、ＨＷ資源利用プロセス数比較部１１０からＨＷ資源を利用中のプロセス数とＨＷ資源数との大小関係を入力する。そして、ＨＷ資源を利用中のプロセス数がＨＷ資源数よりも小さければ、ＳＷ資源待ち状態のプロセスが待つＳＷ資源の種類を抽出し、各ＳＷ資源に関して、ＳＷ資源を待つプロセスがいくつあるかを数え、その結果得られた各ＳＷ資源に関する待ちプロセスの数をＳＷ資源待ち行列長記憶部１１８に記憶する。 The SW resource wait process count unit 112 receives data related to a process in the SW resource wait state from the process state measuring unit 108, and the number of processes using the HW resource and the number of HW resources from the HW resource use process number comparison unit 110. Enter the magnitude relationship. If the number of processes using the HW resource is smaller than the number of HW resources, the type of SW resource that the process waiting for the SW resource waits is extracted, and for each SW resource, how many processes are waiting for the SW resource. The number of waiting processes related to each SW resource obtained as a result is stored in the SW resource queue length storage unit 118.

出力部１１４は、ＳＷ資源待ち行列長記憶部１１８から、各ＳＷ資源に関する待ちプロセスの数を入力し、ディスプレイ装置やファイル、ディスク等の出力装置１０６に出力する。 The output unit 114 receives the number of waiting processes related to each SW resource from the SW resource queue length storage unit 118 and outputs the number to the output device 106 such as a display device, a file, or a disk.

次に、本実施形態の性能ボトルネック解析システム１００の動作について、図１及び図２を参照しながら詳細に説明する。図２は、本実施形態によるコンピュータシステムの性能ボトルネック解析システム１００の動作を示すフローチャートである。 Next, the operation of the performance bottleneck analysis system 100 of the present embodiment will be described in detail with reference to FIGS. 1 and 2. FIG. 2 is a flowchart showing the operation of the performance bottleneck analysis system 100 of the computer system according to the present embodiment.

まず、本実施形態では、事前準備として、外部システム１２０のＨＷ（ハードウェア）資源数をＨＷ資源数記憶部１１６に設定する（ステップＡ１１）。ここでＨＷ資源とは、例えばＣＰＵ（中央演算装置）のように、複数の利用主体（プロセス）が時分割方式に代表されるスケジューリングポリシーに基づいて利用する物理的な資源を指す。コンピュータシステムは、一般に、複数のＣＰＵを備えることができる。コンピュータシステムのＣＰＵ数は、システムの仕様やＢＩＯＳ（basic input output system）等のファームウェア、ＯＳの提供するコマンドや関数等から知ることができる。 First, in the present embodiment, as advance preparation, the number of HW (hardware) resources of the external system 120 is set in the HW resource number storage unit 116 (step A11). Here, the HW resource refers to a physical resource used by a plurality of users (processes) based on a scheduling policy represented by a time division method, such as a CPU (Central Processing Unit). A computer system can generally include a plurality of CPUs. The number of CPUs of a computer system can be known from system specifications, firmware such as BIOS (basic input output system), commands and functions provided by the OS, and the like.

次に、ＳＷ資源待ち行列長記憶部１１８に記憶された、各ソフトウェア資源Ｓｉに対する待ちプロセス数ＱＳｉを全て０に初期化する（ステップＡ１２）。このとき、ステップＡ１１とステップＡ１２は、順番が逆であってもよい。 Next, the number of waiting processes QSi for each software resource Si stored in the SW resource queue length storage unit 118 is initialized to 0 (step A12). At this time, the order of step A11 and step A12 may be reversed.

次に、プロセス状態計測手段１０８は、外部システム１２０の上で動作するプロセスの現在の実行状態を計測する（ステップＡ１３）。プロセスの実行状態は、次の２種類の情報を含む。１つは、プロセスが現在ＨＷ資源を利用中（実行可能状態）か、または特定のＳＷ（ソフトウェア）資源が取得可能になるのを待っている状態（休眠状態）かの区別である。もう１つは、休眠状態のプロセスが待つＳＷ資源の種類である。 Next, the process state measuring unit 108 measures the current execution state of the process operating on the external system 120 (step A13). The execution state of the process includes the following two types of information. One is the distinction between whether the process is currently using the HW resource (executable state) or waiting for a specific SW (software) resource to become available (sleep state). The other is the type of SW resource that a dormant process waits for.

ＳＷ資源としては、排他制御を行なうためのセマフォやロックのほか、ネットワークやディスクに対するデータの送受信が終了したことを示すフラグ等がある。Ｌｉｎｕｘ等のＵＮＩＸ（登録商標、以下同じ。）系ＯＳの場合、プロセス状態に関するこれらの情報は、例えば外部システム１２０上でｐｓコマンドを発行することにより取得可能である。 The SW resource includes a semaphore and lock for performing exclusive control, and a flag indicating that transmission / reception of data to / from the network or disk has been completed. In the case of a UNIX (registered trademark, the same shall apply hereinafter) OS such as Linux, these pieces of information relating to the process state can be obtained by issuing a ps command on the external system 120, for example.

また、本実施形態の性能ボトルネック解析システム１００は、外部のシステム１２０と同一のシステム上で動作させてもよい。 Further, the performance bottleneck analysis system 100 of the present embodiment may be operated on the same system as the external system 120.

次に、ＨＷ資源利用プロセス数比較部１１０は、プロセス状態計測手段１０８が計測したＨＷ資源を利用中のプロセスの個数Ｒ（実行可能プロセス数）を数える（ステップＡ１４）と共に、ＨＷ資源数記憶部１１６を参照し、実行可能プロセス数ＲとＨＷ資源数とを比較する（ステップＡ１５）。 Next, the HW resource utilization process number comparison unit 110 counts the number of processes R (the number of executable processes) that are using the HW resource measured by the process state measurement unit 108 (step A14), and the HW resource number storage unit. Referring to 116, the number of executable processes R is compared with the number of HW resources (step A15).

ここで検出したいのは、実行可能プロセス数ＲがＨＷ資源数を下回るような状況（ＨＷ未活用状況）である。このような状況が頻繁に発生する場合、システム１２０のＨＷ資源が十分に活用されないために性能劣化が起きている可能性が高い。そして、このような状況が発生する原因の大半は、プロセスのＳＷ資源待ちである。 What is desired to be detected here is a situation in which the number of executable processes R is less than the number of HW resources (HW unused status). When such a situation occurs frequently, there is a high possibility that performance degradation has occurred because the HW resources of the system 120 are not fully utilized. Most of the causes of such a situation are waiting for the SW resource of the process.

一般に、計算の過程において、プロセスには、多くの待ち状態が発生する。待ち状態には、複数プロセスによるクリティカルセクションの同時実行を禁止するためのセマフォや、ファイルやエントリの同時更新を禁止するためのファイルロックの取得待ち以外には、ネットワークの送受信待ちやディスクの入出力待ち等がある。 In general, many wait states occur in a process in the course of calculation. In the wait state, other than waiting to acquire a semaphore for prohibiting simultaneous execution of critical sections by multiple processes and file lock for prohibiting simultaneous update of files and entries, network waits for transmission / reception and disk input / output There is waiting etc.

システムがクライアントからのリクエストに対して結果を返すサーバであると考えると、リクエストの内容が均一で、かつＨＷ資源の利用率が一定である限り、システムが一定時間内に処理できるリクエスト数は、略一定である。これは、リクエストの到着間隔に関係なく成立する。なぜなら、リクエストによって発生する処理が空ループのような無駄な計算をしていない限り、一定数のリクエストの処理に要する計算量は、一定だからである。 Assuming that the system is a server that returns a result in response to a request from a client, as long as the content of the request is uniform and the usage rate of the HW resource is constant, It is almost constant. This is true regardless of the arrival interval of requests. This is because the amount of calculation required for processing a certain number of requests is constant unless the processing generated by the request performs a wasteful calculation such as an empty loop.

従って、リクエストの到着数が処理能力の範囲内であれば、リクエストの応答時間も一定時間内に収まるが、リクエストの到着数がシステムの処理能力の限界を越えると、応答時間は、急激に悪化し始める。これは、システム内に処理待ちのリクエストが滞留し始めると、リクエストの応答時間が悪化し、その結果、滞留するリクエストがさらに増えるという悪循環のためである。 Therefore, if the number of request arrivals is within the processing capability range, the response time of the request will be within a certain time. However, if the number of request arrivals exceeds the limit of the system processing capability, the response time will deteriorate rapidly. Begin to. This is due to a vicious circle in which the response time of requests deteriorates when requests waiting to be processed start to stay in the system, and as a result, the number of staying requests further increases.

つまり、ＨＷ資源の利用率が限界に達するまでは、リクエストの応答時間は、通常では、あまり悪化しないため、従来の性能ボトルネック解析では、ＣＰＵ負荷のようなＨＷ資源利用率を監視して、利用率が非常に高くなった場合に、利用率の高いＨＷをボトルネックとして特定していた。しかしながら、ＳＷ資源を待つプロセスは、ＨＷ資源を消費しないため、通常の解析対象であるＣＰＵ負荷のようなＨＷ資源利用率からは、性能劣化原因を特定することが困難であった。本発明の目的は、ＳＷ資源待ちを原因とする性能劣化を検出し、原因となっているＳＷ資源を特定することにある。 That is, until the utilization rate of the HW resource reaches the limit, the response time of the request does not usually deteriorate so much, so in the conventional performance bottleneck analysis, the HW resource utilization rate such as the CPU load is monitored, When the utilization rate became very high, the HW with the high utilization rate was identified as the bottleneck. However, since the process waiting for the SW resource does not consume the HW resource, it is difficult to identify the cause of the performance deterioration from the HW resource utilization rate such as the CPU load that is a normal analysis target. An object of the present invention is to detect performance degradation caused by waiting for SW resources and to identify the SW resource that is the cause.

システムの応答時間は、ＨＷ資源利用時間とＳＷ資源待ち時間との和で表現できる。ここで、「ＨＷを利用している」というのは、ＨＷ資源を各計算主体間で少しの時間ずつ共用しながら利用しているという意味であり、正確には、ＨＷ利用時間は、ＨＷ資源を物理的に利用している時間とＨＷ資源待ち時間との和となる。 The response time of the system can be expressed as the sum of the HW resource use time and the SW resource wait time. Here, “using HW” means that the HW resource is shared while being used for a short time between the calculation subjects. To be precise, the HW usage time is the HW resource usage time. Is the sum of the time for physically using and the HW resource waiting time.

ＨＷ未活用状況時においては、ＨＷ資源をほぼ占有できるため、ＨＷ資源待ち時間は、相対的に小さな値となる一方、ＨＷ資源利用時間は、略一定と考えられる。しかしながら、ボトルネックとなっているＳＷ資源に関しては、資源の取得に長時間を要し、ＳＷ資源待ちのプロセスが滞留するため、休眠プロセスが待つＳＷ資源を調べて、待つプロセスの多いＳＷ資源をボトルネックとみなすのが合理的である。 Since the HW resources can be almost occupied when the HW is not being used, the HW resource waiting time is a relatively small value, while the HW resource utilization time is considered to be substantially constant. However, regarding SW resources that are bottlenecks, it takes a long time to acquire the resources, and the processes waiting for the SW resources stay, so the SW resources that the dormant process waits for are examined, and the SW resources that have many waiting processes are found. It is reasonable to regard it as a bottleneck.

そこで、次の動作として、ＳＷ資源待ちプロセスカウント部１１２は、ＨＷ未活用状況が観測された場合、プロセス状態計測手段１０８が計測した休眠プロセス情報から、プロセスが待つ各ＳＷ資源を調べて、各ＳＷ資源を待つ休眠プロセスがいくつあるかを１つ１つ数え、ＳＷ資源待ち行列長記憶部１１８に記憶する（ステップＡ１６及びステップＡ１７）。 Therefore, as the next operation, when the HW underutilization state is observed, the SW resource waiting process count unit 112 checks each SW resource that the process waits from the dormant process information measured by the process state measuring unit 108, and The number of sleep processes waiting for the SW resource is counted one by one and stored in the SW resource queue length storage unit 118 (step A16 and step A17).

全ての休眠プロセスを数え終わると、出力部１１４は、各ＳＷ資源に関する待ちプロセスの数を、ディスプレイ装置やファイル、ディスク等の出力装置１０６に出力する。また、外部のシステムや装置に結果を伝達してもよい。さらに、ＨＷ資源利用プロセス数比較部１１０によってＨＷ未活用状況が観測されなかった場合には、その旨を出力することもできる。 When all the sleep processes are counted, the output unit 114 outputs the number of waiting processes related to each SW resource to the output device 106 such as a display device, a file, or a disk. Further, the result may be transmitted to an external system or apparatus. Further, when the HW resource utilization process number comparison unit 110 does not observe the HW unused state, it can also output that fact.

以上詳述したように、本実施形態では、ＨＷ資源利用中のプロセスの数がＨＷ資源数を下回っている状況を検出し、このような状況下において各ＳＷ資源を待つプロセスの数を算出できるように構成されているため、性能上のボトルネックとなっているＳＷ資源を特定することができる。 As described above in detail, in this embodiment, it is possible to detect a situation where the number of processes using HW resources is less than the number of HW resources, and to calculate the number of processes waiting for each SW resource in such a situation. Therefore, it is possible to identify the SW resource that is a bottleneck in performance.

（第２実施形態）
次に、本発明の第２実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成について図面を使用しながら説明する。図３は、本発明の第２実施形態によるコンピュータシステムの性能ボトルネック解析システム２００の構成を示すブロック図である。 (Second Embodiment)
Next, the configuration of a performance bottleneck analysis system for a computer system according to a second embodiment of the present invention will be described with reference to the drawings. FIG. 3 is a block diagram showing a configuration of a performance bottleneck analysis system 200 for a computer system according to the second embodiment of the present invention.

図３に示すように、本発明の第２実施形態の性能ボトルネック解析システム２００は、図１に示した第１実施形態にかかる性能ボトルネック解析システム１００と比べて、処理装置２０２がフィルタリング部２２０を具備する点で異なる。 As shown in FIG. 3, the performance bottleneck analysis system 200 according to the second embodiment of the present invention has a filtering device that includes a processing unit 202 as compared with the performance bottleneck analysis system 100 according to the first embodiment shown in FIG. 1. The difference is that 220 is provided.

フィルタリング部２２０は、ＳＷ資源待ち行列長記憶部２１８から、各ＳＷ資源に関する待ちプロセスの数を入力し、ある条件で抽出またはソートしたＳＷ資源を出力部２１４に付与する。 The filtering unit 220 inputs the number of waiting processes related to each SW resource from the SW resource queue length storage unit 218 and gives the SW resource extracted or sorted under a certain condition to the output unit 214.

次に、本実施形態の性能ボトルネック解析システム２００の全体の動作について、図３及び図４を参照しながら詳細に説明する。図４は、本実施形態によるコンピュータシステムの性能ボトルネック解析システム２００の動作を示すフローチャートである。 Next, the overall operation of the performance bottleneck analysis system 200 of this embodiment will be described in detail with reference to FIGS. 3 and 4. FIG. 4 is a flowchart showing the operation of the performance bottleneck analysis system 200 of the computer system according to this embodiment.

図４のステップＡ２１〜Ａ２７で示される本実施形態におけるプロセス状態計測手段１０８、ＨＷ資源利用プロセス数比較部１１０、ＳＷ資源待ちプロセスカウント部１１２の動作は、第１実施形態の対応する部分の動作と同一のため、これらの動作の説明は、省略する。 The operations of the process state measuring unit 108, the HW resource utilization process number comparison unit 110, and the SW resource waiting process count unit 112 in the present embodiment indicated by steps A21 to A27 in FIG. 4 are the operations of the corresponding parts in the first embodiment. Therefore, the description of these operations is omitted.

第１実施形態では、各ＳＷ資源に関する待ちプロセス数が求まった時点で処理を終了していたが、本実施形態では、フィルタリング部２２０が、待ちプロセス数を使って抽出またはソートしたＳＷ資源を出力部２１４に付与する（図４のステップＡ２８）。抽出またはソートの方法としては、待ちプロセス数が一定数を越えたＳＷ資源のみを抽出する、または待ちプロセスの多い若しくは少ない順にＳＷ資源をソートする、または待ちプロセス数を使ってソートしたＳＷ資源の中から待ちプロセス数が一定数を越えているものだけをソートされた順番で抽出する等がある。出力部２１４には、抽出またはソートしたＳＷ資源だけでなく、その待ちプロセス数を一緒に与えてもよい。 In the first embodiment, the process is terminated when the number of waiting processes for each SW resource is obtained. In this embodiment, the filtering unit 220 outputs the SW resources extracted or sorted using the number of waiting processes. To the unit 214 (step A28 in FIG. 4). As an extraction or sorting method, only SW resources whose number of waiting processes exceeds a certain number are extracted, or SW resources are sorted in order of increasing or decreasing number of waiting processes, or SW resources sorted using the number of waiting processes are sorted. For example, only those in which the number of waiting processes exceeds a certain number are extracted in the sorted order. The output unit 214 may be given not only the extracted or sorted SW resources but also the number of waiting processes.

このように、本実施形態では、待ちプロセスの数が多いＳＷ資源を抽出できるように構成されているため、性能上のボトルネックとなっているＳＷ資源をより早く特定することができる。 Thus, in this embodiment, since it is comprised so that SW resource with many number of waiting processes can be extracted, SW resource used as the bottleneck in performance can be identified earlier.

（第３実施形態）
次に、本発明の第３実施形態によるコンピュータシステムにおける性能ボトルネック解析システムの構成について図面を使用しながら説明する。図５は、本発明の第３実施形態によるコンピュータシステムの性能ボトルネック解析システム３００の構成を示すブロック図である。 (Third embodiment)
Next, the configuration of the performance bottleneck analysis system in the computer system according to the third embodiment of the present invention will be described with reference to the drawings. FIG. 5 is a block diagram showing the configuration of a performance bottleneck analysis system 300 for a computer system according to the third embodiment of the present invention.

図５に示すように、本発明の第３実施形態の性能ボトルネック解析システム３００は、図１に示した第１実施形態にかかる性能ボトルネック解析システム１００と比べて、処理装置３０２が平均値算出部３２０を有する点と、記憶装置３０４が未活用事象発生回数記憶部３２２を有する点で異なる。 As illustrated in FIG. 5, the performance bottleneck analysis system 300 according to the third embodiment of the present invention has an average value of the processing device 302 as compared with the performance bottleneck analysis system 100 according to the first embodiment illustrated in FIG. 1. The difference is that the calculation unit 320 is included, and the storage device 304 includes an unused event occurrence number storage unit 322.

平均値算出部３２０は、ＳＷ資源待ち行列長記憶部３１８に記憶された各ＳＷ資源に関する１個以上の待ちプロセス数を参照し、各ＳＷ資源に関する待ちプロセス数の平均値を算出し、出力部３１４に与える。 The average value calculation unit 320 refers to one or more waiting process numbers related to each SW resource stored in the SW resource queue length storage unit 318, calculates an average value of the number of waiting processes related to each SW resource, and outputs an output unit 314.

次に、本実施形態の性能ボトルネック解析システム３００の全体の動作について、図５及び図６を参照しながら詳細に説明する。図６は、本実施形態によるコンピュータシステムの性能ボトルネック解析システム３００の動作を示すフローチャートである。 Next, the overall operation of the performance bottleneck analysis system 300 of this embodiment will be described in detail with reference to FIGS. 5 and 6. FIG. 6 is a flowchart showing the operation of the performance bottleneck analysis system 300 of the computer system according to the present embodiment.

図６のステップＡ３１〜Ａ３７で示される本実施形態におけるプロセス状態計測手段１０８、ＨＷ資源利用プロセス数比較部１１０、ＳＷ資源待ちプロセスカウント部１１２の動作は、第１実施形態の対応する部分の動作と同一のため、これらの動作の説明は、省略する。 The operations of the process state measuring unit 108, the HW resource utilization process number comparison unit 110, and the SW resource waiting process count unit 112 in the present embodiment indicated by steps A31 to A37 in FIG. 6 are the operations of the corresponding parts of the first embodiment. Therefore, the description of these operations is omitted.

第１実施形態では、プロセスの状態を１回しか計測していなかったが、本実施形態では、複数回の計測を行なう。このため、ステップＡ３２のあとに、ＨＷ未活用状況の発生回数を数えるための変数Ｃを０に初期化して、未活用事象発生回数記憶部３２２に記憶する（図６のステップＢ３１）。ステップＢ３１は、ステップＡ３２の前やステップＡ３１の前に行なってもよい。 In the first embodiment, the process state is measured only once, but in the present embodiment, measurement is performed a plurality of times. For this reason, after step A32, a variable C for counting the number of occurrences of the HW underutilization situation is initialized to 0 and stored in the unutilized event occurrence number storage unit 322 (step B31 in FIG. 6). Step B31 may be performed before step A32 or before step A31.

次に、計測を続けるかどうかが判断される（ステップＢ３２）。計測回数が予め定められた回数に達した場合、外部から終了命令が到着した場合、一定時間が経過した場合等に、計測を終了する。 Next, it is determined whether or not to continue the measurement (step B32). When the number of times of measurement reaches a predetermined number of times, when an end command arrives from the outside, or when a certain time has passed, the measurement is ended.

計測を続ける際に、ステップＡ３５において、ＨＷ未活用状況が観測されなかった場合は、ステップＢ３２以降の実行を繰り返す。また、ステップＡ３５において、ＨＷ未活用状況が観測された場合、ＨＷ資源利用プロセス数比較部１１０は、未活用事象発生回数記憶部３２２の記憶する変数Ｃを１増やす（ステップＢ３３） When the measurement is continued, if the HW unused state is not observed in step A35, the execution after step B32 is repeated. When the HW unused state is observed in step A35, the HW resource utilization process number comparison unit 110 increases the variable C stored in the unused event occurrence number storage unit 322 by 1 (step B33).

その後、ステップＡ３６及びＡ３７のループ処理により、各ＳＷ資源Ｓｉに対する待ちプロセス数ＱＳｉに、今回計測された待ちプロセス数を追加し、ＳＷ資源待ち行列長記憶部３１８を更新する。上述した待ちプロセス数の追加を終えると、再度ステップＢ３２以降の工程の実行を繰り返す。 Thereafter, the number of waiting processes measured this time is added to the number of waiting processes QSi for each SW resource Si by the loop processing of steps A36 and A37, and the SW resource queue length storage unit 318 is updated. When the addition of the number of waiting processes is finished, the execution of the processes after step B32 is repeated again.

ステップＢ３２において計測が終了した場合、平均値算出部３２０は、まず未活用事象発生回数記憶部３２２の記憶する変数Ｃと０を比較することにより、ＨＷ未活用状況が発生したかどうかを判断する（ステップＢ３４）。 When the measurement is completed in step B32, the average value calculation unit 320 first compares the variable C stored in the unutilized event occurrence count storage unit 322 with 0 to determine whether an HW unutilized situation has occurred. (Step B34).

ＨＷ未活用状況が発生していた（Ｃ≠０）場合は、各ＳＷ資源Ｓｉに対する待ちプロセス数ＱＳｉには、１回以上の計測による合計値が保持されているため、各ＱＳｉをＨＷ未活用状況の発生回数Ｃで割って、各ＳＷ資源Ｓｉに対する待ちプロセス数の平均値を算出する（ステップＢ３５）。 When the HW unused status has occurred (C ≠ 0), the number of waiting processes QSi for each SW resource Si holds the total value by one or more measurements, so that each QSi is not used in HW Dividing by the number of occurrences C of the situation, the average value of the number of waiting processes for each SW resource Si is calculated (step B35).

ＨＷ未活用状況が発生していなかった場合は、処理を終了する。また、発生していなかった旨を出力部３１４に伝達して、出力装置１０６に出力するようにしてもよい。 If no HW unused state has occurred, the process is terminated. Further, the fact that it has not occurred may be transmitted to the output unit 314 and output to the output device 106.

最後に、出力部３１４は、各ＳＷ資源Ｓｉに対する待ちプロセス数の平均値を出力装置１０６に出力する。このとき、第２実施形態におけるフィルタリング部（図３の２２０）が実行するように、待ちプロセス数の平均値を使って抽出またはソートした結果を出力するようにしてもよい。 Finally, the output unit 314 outputs the average value of the number of waiting processes for each SW resource Si to the output device 106. At this time, the result of extraction or sorting using the average value of the number of waiting processes may be output as executed by the filtering unit (220 in FIG. 3) in the second embodiment.

このように、本実施形態では、複数回の測定を行なって、ＳＷ資源に対する待ちプロセス数の平均値を算出するように構成されているため、測定した瞬間だけ偶然に発生していたような例外的な状態の影響を小さくすることにより、性能上のボトルネックとなっているＳＷ資源をより正確に特定することができる。 As described above, in the present embodiment, the measurement is performed a plurality of times and the average value of the number of waiting processes for the SW resource is calculated. By reducing the influence of the general state, it is possible to more accurately identify the SW resource that is a bottleneck in performance.

（第４実施形態）
次に、本発明の第４実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成について図面を使用しながら説明する。図７は、本発明の第４実施形態によるコンピュータシステムの性能ボトルネック解析システム４００の構成を示すブロック図である。 (Fourth embodiment)
Next, the configuration of a performance bottleneck analysis system for a computer system according to a fourth embodiment of the present invention will be described with reference to the drawings. FIG. 7 is a block diagram showing the configuration of a performance bottleneck analysis system 400 for a computer system according to the fourth embodiment of the present invention.

図７に示すように、本発明の第４実施形態の性能ボトルネック解析システム４００は、図１に示した第１実施形態にかかる性能ボトルネック解析システム１００と比べて、処理装置４０２が計測所要時間取得部４２２と、ＨＷボトルネック判定部４２０とを有する点で異なる。 As shown in FIG. 7, the performance bottleneck analysis system 400 according to the fourth embodiment of the present invention requires measurement by the processing device 402 as compared with the performance bottleneck analysis system 100 according to the first embodiment shown in FIG. The difference is that the time acquisition unit 422 and the HW bottleneck determination unit 420 are provided.

計測所要時間取得部４２２は、プロセス状態計測手段１０８の計測の開始時刻と終了時刻を取得して、計測に要した時間を算出し、ＨＷボトルネック判定部４２０に与える。 The measurement required time acquisition unit 422 acquires the measurement start time and end time of the process state measurement unit 108, calculates the time required for the measurement, and provides the time to the HW bottleneck determination unit 420.

ＨＷボトルネック判定部４２０は、計測所要時間取得部４２２から計測所要時間を入力し、計測所要時間と閾値との比較によって、ＨＷ資源がボトルネックになっているかどうかを判断する。ＨＷ資源がボトルネックになっていなかった場合は、ＳＷ資源待ち行列長記憶部４１８から各ＳＷ資源に関する待ちプロセスの数を入力し、出力部４１４に与える。 The HW bottleneck determination unit 420 inputs the measurement required time from the measurement required time acquisition unit 422 and determines whether the HW resource is a bottleneck by comparing the measurement required time with a threshold value. If the HW resource is not a bottleneck, the number of waiting processes related to each SW resource is input from the SW resource queue length storage unit 418 and provided to the output unit 414.

次に、本実施形態の性能ボトルネック解析システム４００の全体の動作について、図７及び図８を参照しながら詳細に説明する。図８は、本実施形態によるコンピュータシステムの性能ボトルネック解析システム４００の動作を示すフローチャートである。 Next, the overall operation of the performance bottleneck analysis system 400 of the present embodiment will be described in detail with reference to FIGS. FIG. 8 is a flowchart showing the operation of the performance bottleneck analysis system 400 of the computer system according to the present embodiment.

図８のステップＡ４１〜Ａ４７で示される本実施形態におけるプロセス状態計測手段１０８、ＨＷ資源利用プロセス数比較部１１０、ＳＷ資源待ちプロセスカウント部１１２の動作は、第１実施形態の対応する部分の動作と同一のため、これらの動作の説明は、省略する。 The operations of the process state measuring means 108, the HW resource utilization process number comparison unit 110, and the SW resource waiting process count unit 112 in the present embodiment indicated by steps A41 to A47 in FIG. 8 are the operations of the corresponding parts in the first embodiment. Therefore, the description of these operations is omitted.

第１実施形態では、プロセスの状態を計測した（ステップＡ１３）後に、すぐにＨＷ資源を利用中のプロセスの個数を数えていた（ステップＡ１４）が、本実施形態では、計測所要時間取得部４２２が、プロセス状態計測手段１０８による計測動作と協調して、計測にかかった時間を算出する。 In the first embodiment, after the process state is measured (step A13), the number of processes that are using the HW resource is counted immediately (step A14). However, in this embodiment, the measurement required time acquisition unit 422 is counted. However, the time taken for the measurement is calculated in cooperation with the measurement operation by the process state measurement means 108.

次に、ＨＷボトルネック判定部４２０は、計測所要時間取得部４２２が算出した計測所要時間が一定の範囲内に収まっているかどうかを判定する（図８のステップＣ４１）。 Next, the HW bottleneck determination unit 420 determines whether or not the required measurement time calculated by the required measurement time acquisition unit 422 is within a certain range (step C41 in FIG. 8).

計測が一定時間内に完了していれば、ボトルネックがＨＷ資源側には有していないと判断する（ステップＣ４２）。このときの閾値は、システムの正常稼働時やテスト稼働時のデータを元に外部から付与される。これ以降のステップＡ４４〜Ａ４７における各部の動作は、第１実施形態と同様である。 If the measurement is completed within a certain time, it is determined that the bottleneck does not exist on the HW resource side (step C42). The threshold value at this time is given from the outside based on data during normal operation or test operation of the system. Subsequent operations in steps A44 to A47 are the same as those in the first embodiment.

計測に一定以上の時間がかかっていた場合、ＨＷボトルネック判定部４２０は、ボトルネックがＨＷ資源側にあると判断する（ステップＣ４３）。 When the measurement takes a certain time or more, the HW bottleneck determination unit 420 determines that the bottleneck is on the HW resource side (step C43).

このように判断するのは、プロセス状態の計測自体がＣＰＵ等のＨＷ資源を使用するためである。例えば、外部システム１２０上にＨＷ資源を利用中のプロセスが大量に存在する場合、プロセス状態の計測のためにｐｓコマンドを発行しても、ｐｓコマンドの処理がスケジューリングされず、応答が非常に遅れるため、プロセス状態を即座に測定することができない。このような場合には、ＨＷ資源がボトルネックになっていると判断することができる。 This determination is made because the process state measurement itself uses a HW resource such as a CPU. For example, if there are a large number of processes using HW resources on the external system 120, even if the ps command is issued for measuring the process state, the ps command processing is not scheduled and the response is very delayed. Therefore, the process state cannot be measured immediately. In such a case, it can be determined that the HW resource is a bottleneck.

最後に出力部４１４は、ＨＷ資源がボトルネックであれば、その旨を出力装置１０６に出力し、ＨＷ資源がボトルネックでなければ、第１実施形態と同様、ＨＷ未活用状況が観測されたかどうかに基づいて、各ＳＷ資源に関する待ちプロセスの数を出力装置１０６に出力する。 Finally, if the HW resource is a bottleneck, the output unit 414 outputs a message to that effect to the output device 106. If the HW resource is not a bottleneck, whether or not an HW unused state has been observed as in the first embodiment. Based on whether or not, the number of waiting processes for each SW resource is output to the output device 106.

このように、本実施形態では、プロセス状態の計測自身がＨＷ資源を利用するという事実に基づき、プロセス状態の計測に要した時間を算出できるように構成されているため、性能上のボトルネックがＨＷ資源側にあることを判定できる。 As described above, in the present embodiment, since the process state measurement itself is configured to be able to calculate the time required for the process state measurement based on the fact that the HW resource is used, there is a performance bottleneck. It can be determined that it is on the HW resource side.

（第５実施形態）
次に、本発明の第５実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成について図面を使用しながら説明する。図９は、本発明の第５実施形態によるコンピュータシステムの性能ボトルネック解析システム５００の構成を示すブロック図である。 (Fifth embodiment)
Next, the configuration of a performance bottleneck analysis system for a computer system according to a fifth embodiment of the present invention will be described with reference to the drawings. FIG. 9 is a block diagram showing the configuration of a performance bottleneck analysis system 500 for a computer system according to the fifth embodiment of the present invention.

図９に示すように、本発明の第５実施形態は、図５に示した第３実施形態にかかる性能ボトルネック解析システム３００と比べて、処理装置５０２に計測所要時間取得部５２２とＨＷボトルネック判定部５２４を有する点と、記憶装置５０４が計測所要時間及び計測回数記憶部５２６を有する点で異なる。この計測所要時間及び計測回数記憶部５２６には、プロセス状態計測手段１０８がプロセス状態の計測に要した時間及び計測回数を記憶される。 As shown in FIG. 9, the fifth embodiment of the present invention has a processing time 502 acquisition unit 522 and an HW bottle in the processing device 502 as compared with the performance bottleneck analysis system 300 according to the third embodiment shown in FIG. 5. The difference is that the neck determination unit 524 is included, and the storage device 504 includes a measurement required time and measurement count storage unit 526. The measurement required time and measurement count storage unit 526 stores the time and measurement count required for the process state measurement unit 108 to measure the process state.

次に、本実施形態の性能ボトルネック解析システム５００の全体の動作について、図９及び図１０を参照しながら詳細に説明する。図１０は、本実施形態によるコンピュータシステムの性能ボトルネック解析システム５００の動作を示すフローチャートである。 Next, the overall operation of the performance bottleneck analysis system 500 of this embodiment will be described in detail with reference to FIGS. 9 and 10. FIG. 10 is a flowchart showing the operation of the performance bottleneck analysis system 500 of the computer system according to the present embodiment.

図１０のステップＡ５１〜Ａ５７及びＢ５１〜Ｂ５５で示される本実施形態によるコンピュータシステムの性能ボトルネック解析システム５００におけるプロセス状態計測手段１０８、ＨＷ資源利用プロセス数比較部１１０、ＳＷ資源待ちプロセスカウント部１１２、平均値算出部３２０の動作は、第３実施形態の対応する部分の動作と同一のため、これらの動作の説明は、省略する。 In the performance bottleneck analysis system 500 of the computer system according to the present embodiment indicated by steps A51 to A57 and B51 to B55 in FIG. 10, the process state measuring means 108, the HW resource utilization process number comparison unit 110, and the SW resource waiting process count unit 112 Since the operation of the average value calculation unit 320 is the same as the operation of the corresponding part of the third embodiment, description of these operations will be omitted.

第３実施形態では、プロセス状態の計測に要した時間を取得していなかったが、本実施形態では、計測に要した時間を１回以上記憶し、計測に要した時間の平均値を求めることによって、ボトルネックがＨＷ資源側にあるかどうかを判定する。 In the third embodiment, the time required for the measurement of the process state is not acquired, but in this embodiment, the time required for the measurement is stored once or more, and the average value of the time required for the measurement is obtained. To determine whether the bottleneck is on the HW resource side.

まず、ステップＢ５１のあとで、計測回数Ｎと計測所要時間の合計ＴＴを両方とも０に初期化する（図１０のステップＤ５１）。このステップＤ５１は、ステップＢ５１やステップＡ５２、ステップＡ５１の前に行なってもよい。 First, after step B51, the total number TT of the number of times of measurement N and the time required for measurement is initialized to 0 (step D51 in FIG. 10). This step D51 may be performed before step B51, step A52, or step A51.

次に、ステップＢ５２において計測を続ける場合、ステップＡ５３においてプロセス状態を計測後に計測回数Ｎを１増やす（ステップＤ５２）。このステップＤ５２は、ステップＡ５３の前に行なってもよい。 Next, when the measurement is continued in step B52, the measurement count N is incremented by 1 after measuring the process state in step A53 (step D52). This step D52 may be performed before step A53.

次に、今回の計測に要した時間Ｔを計測回数Ｎと計測所要時間の合計ＴＴに加える（ステップＤ５３）。これ以降の処理（ステップＡ５４以降）は、第３実施形態のステップＡ５３以降と同様である。 Next, the time T required for the current measurement is added to the total number TT of the measurement times N and the required measurement time (step D53). The subsequent processing (after step A54) is the same as that after step A53 of the third embodiment.

ステップＢ５２において、計測を終了したときは、計測所要時間の平均となるＴＴ／Ｎを算出する（ステップＤ５４）。このとき、１回も計測を行なっていない可能性がある場合は、平均値の算出を行なう前に計測回数Ｎが０でないことを確認して、０による除算が発生しないようにしてもよい。 In step B52, when the measurement is completed, TT / N that is the average of the required measurement time is calculated (step D54). At this time, if there is a possibility that the measurement has not been performed once, it may be confirmed that the number of times of measurement N is not 0 before the calculation of the average value, so that division by 0 does not occur.

次に、計測所要時間の平均ＴＴ／Ｎが一定の範囲内に収まっているかどうかを判定する（ステップＤ５５）。 Next, it is determined whether or not the average required time TT / N is within a certain range (step D55).

一定の範囲内に収まっている場合は、ボトルネックがＨＷ資源側にはないと判断する（ステップＣ５２）。これ以降のステップＢ５４及びＢ５５の動作は、第３実施形態のステップＢ３４及びＢ３５と同様である。 If it is within a certain range, it is determined that the bottleneck is not on the HW resource side (step C52). The subsequent operations of Steps B54 and B55 are the same as Steps B34 and B35 of the third embodiment.

計測所要時間の平均ＴＴ／Ｎが一定の範囲内に収まっていなかった場合は、ボトルネックがＨＷ資源側にあると判断する（ステップＣ５３）。 If the average measurement time TT / N is not within a certain range, it is determined that the bottleneck is on the HW resource side (step C53).

最後に、出力部５１４は、ＨＷ資源がボトルネックであれば、その旨を出力装置１０６に出力し、ＨＷ資源がボトルネックでなければ、第１実施形態と同様、ＨＷ未活用状況が観測されたかどうかに基づいて、各ＳＷ資源に関する待ちプロセスの数を出力装置１０６に出力する。 Finally, if the HW resource is a bottleneck, the output unit 514 outputs a message to that effect to the output device 106. If the HW resource is not a bottleneck, the HW unused state is observed as in the first embodiment. Based on whether or not, the number of waiting processes for each SW resource is output to the output device 106.

このように、本実施形態では、プロセス状態の計測に要した時間を複数回測定し、その平均値を算出できるように構成されているため、性能上のボトルネックがＨＷ資源側にあることをより正確に判定できる。 As described above, in this embodiment, the time required for measuring the process state is measured a plurality of times, and the average value can be calculated. Therefore, the performance bottleneck is on the HW resource side. More accurate judgment can be made.

（第６実施形態）
次に、本発明の第６実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成について図面を使用しながら説明する。図１１は、本発明の第６実施形態によるコンピュータシステムの性能ボトルネック解析システム６００の構成を示すブロック図である。 (Sixth embodiment)
Next, the configuration of a performance bottleneck analysis system for a computer system according to a sixth embodiment of the present invention will be described with reference to the drawings. FIG. 11 is a block diagram showing the configuration of a performance bottleneck analysis system 600 for a computer system according to the sixth embodiment of the present invention.

図１１を参照すると、本発明の第６実施形態の性能ボトルネック解析システム６００は、図７に示した第４実施形態にかかる性能ボトルネック解析システム４００と比べて、ＨＷ構成変更手段６４０を有する点で異なる。 Referring to FIG. 11, the performance bottleneck analysis system 600 according to the sixth embodiment of the present invention has a HW configuration changing unit 640 as compared with the performance bottleneck analysis system 400 according to the fourth embodiment shown in FIG. 7. It is different in point.

ＨＷ構成変更手段６４０は、処理装置６０２に備わるＨＷボトルネック判定部６２０から、ボトルネックがＨＷ資源とＳＷ資源のどちらにあるかを入力し、外部システム１２０の有するＨＷ資源の個数を変更して、この変更を記憶装置６０４に備わるＨＷ資源数記憶部１１６に反映させる。 The HW configuration changing means 640 inputs whether the bottleneck is in the HW resource or the SW resource from the HW bottleneck determination unit 620 provided in the processing device 602, and changes the number of HW resources that the external system 120 has. This change is reflected in the HW resource number storage unit 116 provided in the storage device 604.

次に、本実施形態の性能ボトルネック解析システム６００の全体の動作について、図１１及び図１２を参照しながら詳細に説明する。図１２は、本実施形態によるコンピュータシステムの性能ボトルネック解析システム６００の動作を示すフローチャートである。 Next, the overall operation of the performance bottleneck analysis system 600 of this embodiment will be described in detail with reference to FIGS. 11 and 12. FIG. 12 is a flowchart showing the operation of the performance bottleneck analysis system 600 of the computer system according to this embodiment.

図１２のステップＡ６１〜Ａ６７及びステップＣ６１〜Ｃ６３で示される本実施形態によるコンピュータシステムの性能ボトルネック解析システム６００におけるプロセス状態計測手段１０８、ＨＷ資源利用プロセス数比較部１１０、ＳＷ資源待ちプロセスカウント部１１２、計測所要時間取得部４２２、ＨＷボトルネック判定部４２０の動作は、第４実施形態の対応する部分の動作と同一のため、これらの動作の説明は、省略する。 In the performance bottleneck analysis system 600 of the computer system according to the present embodiment shown in steps A61 to A67 and steps C61 to C63 in FIG. 12, the process state measuring means 108, the HW resource utilization process number comparison unit 110, and the SW resource waiting process count unit The operations of the measurement required time acquisition unit 422 and the HW bottleneck determination unit 420 are the same as the operations of the corresponding parts in the fourth embodiment, and thus description of these operations is omitted.

第４実施形態では、ステップＣ４３において、ボトルネックがＨＷ資源側にあると判断した時点で処理を終了していたが、本実施形態では、ＨＷ構成の変更を試みる（図１２のステップＥ６１）。 In the fourth embodiment, the processing is terminated when it is determined in step C43 that the bottleneck is on the HW resource side. However, in this embodiment, the HW configuration is changed (step E61 in FIG. 12).

具体的には、ボトルネックがＨＷ資源側にある場合には、ＨＷ資源が不足していると考え、ＨＷ資源の数を増やす。このようなＨＷ資源の追加は、外部システム１２０におけるＯＳやＢＩＯＳのパラメータの変更後に再起動を行なう等の処理を実行することによって実現できる。また、複数の計算ノードの集合で構成され、利用する計算ノードの数を動的に変更できるシステムにおいては、計算ノードの追加命令を発行するだけでよい。 Specifically, when the bottleneck is on the HW resource side, it is considered that the HW resource is insufficient, and the number of HW resources is increased. Such addition of HW resources can be realized by executing processing such as restarting after changing the parameters of the OS or BIOS in the external system 120. Further, in a system configured with a set of a plurality of calculation nodes and capable of dynamically changing the number of calculation nodes to be used, it is only necessary to issue a calculation node addition command.

ＨＷの構成を変更できるかどうかは、外部システム１２０の状況等に応じて、外部から判断結果が入力される。ＨＷ構成の変更を行なった場合は、ＨＷ資源の個数を再設定し（ステップＡ６１）、さらにステップＡ６３以降の処理を繰り返す。ＨＷ構成を変更しない場合は、処理を終了する。 Whether the configuration of the HW can be changed is input from the outside according to the status of the external system 120 or the like. If the HW configuration has been changed, the number of HW resources is reset (step A61), and the processing after step A63 is repeated. If the HW configuration is not changed, the process ends.

また、図１２のフローチャートには記載していないが、ボトルネックがＨＷ資源側にないと判断した場合には、システム内のＨＷ資源に余剰があると考え、ステップＣ６２以降の何れかの工程でＨＷ資源を削減することもできる。 Also, although not described in the flowchart of FIG. 12, if it is determined that the bottleneck is not on the HW resource side, it is considered that there is a surplus in the HW resource in the system, and in any process after step C62 HW resources can also be reduced.

最後に、出力部６１４は、ボトルネックの所在や、ＨＷ構成変更手段６４０によってＨＷの構成が変更されていれば、その変更結果を出力装置１０６に出力する。また、ボトルネックがＨＷ資源側になければ、ＨＷ未活用状況が観測されたかどうかに基づいて、各ＳＷ資源に関する待ちプロセスの数を出力装置１０６に出力する。 Finally, if the location of the bottleneck or the configuration of the HW has been changed by the HW configuration change means 640, the output unit 614 outputs the change result to the output device 106. If the bottleneck is not on the HW resource side, the number of waiting processes related to each SW resource is output to the output device 106 based on whether or not an HW underutilization situation has been observed.

このように、本実施形態では、ボトルネックの所在に基づいてＨＷ資源数を増減できるように構成されているため、最適なＨＷ資源数でシステムを運用することができる。 Thus, in this embodiment, since it is comprised so that the number of HW resources can be increased / decreased based on the location of a bottleneck, a system can be operated with the optimal number of HW resources.

（第７実施形態）
次に、本発明の第７実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成について図面を使用しながら説明する。図１３は、本発明の第７実施形態によるコンピュータシステムの性能ボトルネック解析システム７００の構成を示すブロック図である。 (Seventh embodiment)
Next, the configuration of a performance bottleneck analysis system for a computer system according to a seventh embodiment of the present invention will be described with reference to the drawings. FIG. 13 is a block diagram showing a configuration of a performance bottleneck analysis system 700 for a computer system according to the seventh embodiment of the present invention.

図１３を参照すると、本発明の第７実施形態の性能ボトルネック解析システム７００は、図９に示した第５実施形態にかかる性能ボトルネック解析システム５００と比べて、ＨＷ構成変更手段７４０を有する点と、処理装置７０２が記憶初期化部７３０を具備する点で異なる。 Referring to FIG. 13, the performance bottleneck analysis system 700 according to the seventh embodiment of the present invention has a HW configuration changing unit 740, as compared with the performance bottleneck analysis system 500 according to the fifth embodiment shown in FIG. The difference is that the processing device 702 includes a storage initialization unit 730.

ＨＷ構成変更手段７４０は、第６実施形態におけるＨＷ構成変更手段６４０と同様に、処理装置７０２に備わるＨＷボトルネック判定部７２４から、ボトルネックがＨＷ資源とＳＷ資源のどちらにあるかを入力し、外部システム１２０の有するＨＷ資源の個数を変更して、この変更を記憶装置７０４に備わるＨＷ資源数記憶部１１６に反映させる。 Similarly to the HW configuration change unit 640 in the sixth embodiment, the HW configuration change unit 740 inputs whether the bottleneck is in the HW resource or the SW resource from the HW bottleneck determination unit 724 provided in the processing device 702. Then, the number of HW resources included in the external system 120 is changed, and this change is reflected in the HW resource number storage unit 116 provided in the storage device 704.

記憶初期化部７３０は、記憶装置７０４の有する各構成要素である計測所要時間及び計測回数記憶部７２６、ＨＷ資源数記憶部１１６、未活用事象発生回数記憶部７２２、及びＳＷ資源待ち行列長記憶部７１８の記憶内容を初期状態に初期化する。 The storage initialization unit 730 includes the measurement required time and measurement count storage unit 726, the HW resource number storage unit 116, the unutilized event occurrence count storage unit 722, and the SW resource queue length storage, which are the constituent elements of the storage device 704. The contents stored in the unit 718 are initialized to the initial state.

次に、本実施形態の性能ボトルネック解析システム７００の全体の動作について、図１３及び図１４を参照しながら詳細に説明する。図１４は、本実施形態によるコンピュータシステムの性能ボトルネック解析システム７００の動作を示すフローチャートである。 Next, the overall operation of the performance bottleneck analysis system 700 of this embodiment will be described in detail with reference to FIGS. 13 and 14. FIG. 14 is a flowchart showing the operation of the performance bottleneck analysis system 700 of the computer system according to this embodiment.

図１４のステップＡ７１〜Ａ７７、Ｂ７１〜Ｂ７５、Ｃ７２〜Ｃ７３及びＤ７１〜Ｄ７５で示される本実施形態によるコンピュータシステムの性能ボトルネック解析システム７００におけるプロセス状態計測手段１０８、ＨＷ資源利用プロセス数比較部１１０、ＳＷ資源待ちプロセスカウント部１１２、平均値算出部３２０、計測所要時間取得部５２２、ＨＷボトルネック判定部７２４の動作は、第５実施形態の対応する部分の動作と同一のため、これらの動作の説明は、省略する。 The process state measuring means 108 and the HW resource utilization process comparison unit 110 in the performance bottleneck analysis system 700 of the computer system according to the present embodiment indicated by steps A71 to A77, B71 to B75, C72 to C73, and D71 to D75 in FIG. The operations of the SW resource waiting process count unit 112, the average value calculation unit 320, the required measurement time acquisition unit 522, and the HW bottleneck determination unit 724 are the same as the operations of the corresponding parts in the fifth embodiment. The description of is omitted.

第５実施形態では、ステップＣ５３において、ボトルネックがＨＷ資源側にあると判断した時点で処理を終了していたが、本実施形態では、第６実施形態と同様、ＨＷ構成の変更を試みる（図１４のステップＥ７１）。 In the fifth embodiment, the processing is terminated when it is determined in step C53 that the bottleneck is on the HW resource side, but in this embodiment, as in the sixth embodiment, an attempt is made to change the HW configuration ( Step E71 in FIG. 14).

ＨＷ構成の変更を行なった場合、解析を最初からやり直すために、処理装置７０２に備わる記憶初期化部７３０が、ステップＡ７１、ステップＡ７２、ステップＢ７１、及びステップＤ７１の初期化処理を再実行して、記憶装置７０４の有するＨＷ資源数記憶部１１６、ＳＷ資源待ち行列長記憶部７１８、未活用事象発生回数記憶部７２２、及び計測所要時間及び計測回数記憶部７２６の記憶内容を初期化する。初期化のあと、さらにステップＢ７２以降の処理を繰り返す。 When the HW configuration is changed, the storage initialization unit 730 provided in the processing device 702 re-executes the initialization processing of Step A71, Step A72, Step B71, and Step D71 in order to perform analysis again from the beginning. The storage contents of the HW resource number storage unit 116, the SW resource queue length storage unit 718, the unutilized event occurrence count storage unit 722, and the measurement required time and measurement count storage unit 726 of the storage device 704 are initialized. After the initialization, the processing after step B72 is further repeated.

また、図１４のフローチャートには記載していないが、ステップＣ７２においてボトルネックがＨＷ資源側にないと判断した場合には、システム内のＨＷ資源に余剰があると考え、ステップＣ７２以降のいずれかの工程で、ＨＷ資源を削減することもできる。 Although not described in the flowchart of FIG. 14, if it is determined in step C72 that the bottleneck is not on the HW resource side, it is considered that there is a surplus in the HW resource in the system, and any of the steps after step C72 is performed. In this step, HW resources can be reduced.

最後に、出力部７１４は、ボトルネックの所在や、ＨＷ構成変更手段７４０によってＨＷの構成が変更されていれば、その変更結果を出力装置１０６に出力する。また、ボトルネックがＨＷ資源側になければ、ＨＷ未活用状況が観測されたかどうかに基づいて、各ＳＷ資源に関する待ちプロセスの数を出力装置１０６に出力する。 Finally, if the location of the bottleneck or the configuration of the HW has been changed by the HW configuration changing means 740, the output unit 714 outputs the change result to the output device 106. If the bottleneck is not on the HW resource side, the number of waiting processes related to each SW resource is output to the output device 106 based on whether or not an HW underutilization situation has been observed.

このように、本実施形態では、ボトルネックの所在に基づいてＨＷ資源数を増減できるように構成されているため、最適なＨＷ資源数でシステムを運用することができる。また、プロセス状態の計測に要した時間を複数回測定し、その平均値を算出できるように構成されているため、ＨＷ資源がボトルネックとなっている場合に、性能上のボトルネックがＨＷ資源側にあることをより正確に判定できる。 Thus, in this embodiment, since it is comprised so that the number of HW resources can be increased / decreased based on the location of a bottleneck, a system can be operated with the optimal number of HW resources. In addition, since the time required for measuring the process state is measured a plurality of times and the average value can be calculated, when the HW resource is a bottleneck, the performance bottleneck is the HW resource. It is possible to more accurately determine that it is on the side.

（第８実施形態）
次に、本発明の第８実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成について図面を使用しながら説明する。図１５は、本発明の第８実施形態によるコンピュータシステムの性能ボトルネック解析システム８００の構成を示すブロック図である。 (Eighth embodiment)
Next, the configuration of a performance bottleneck analysis system for a computer system according to an eighth embodiment of the present invention will be described with reference to the drawings. FIG. 15 is a block diagram showing a configuration of a performance bottleneck analysis system 800 for a computer system according to an eighth embodiment of the present invention.

図１５に示すように、本実施形態の性能ボトルネック解析システム８００は、処理装置８０２、記憶装置８０４、及び出力装置１０６を具備する。 As shown in FIG. 15, the performance bottleneck analysis system 800 of this embodiment includes a processing device 802, a storage device 804, and an output device 106.

本実施形態における処理装置８０２は、ＳＷ資源識別子取得手段８０６と、ＳＷ資源待ち時間計測部８０８と、出力部８１０とを備え、記憶装置８０４は、ＳＷ資源識別子記憶部８１２と、ＳＷ資源待ち時間記憶部８１４とを備える。 The processing device 802 in this embodiment includes an SW resource identifier acquisition unit 806, an SW resource wait time measurement unit 808, and an output unit 810. The storage device 804 includes an SW resource identifier storage unit 812, and an SW resource wait time. A storage unit 814.

上記のＳＷ資源識別子取得手段８０６は、外部システム１２０の上で動作するプロセスが利用している各ＳＷ資源の識別子情報を取得して、かかる識別子情報をＳＷ資源識別子記憶部８１２に記憶する。 The SW resource identifier acquisition unit 806 acquires identifier information of each SW resource used by a process operating on the external system 120, and stores the identifier information in the SW resource identifier storage unit 812.

ＳＷ資源待ち時間計測部８０８は、ＳＷ資源識別子記憶部８１２からＳＷ資源識別子を入力し、ＳＷ資源の取得を待つプロセスを外部システム１２０に新たに投入し、各ＳＷ資源の取得に要する時間をそれぞれ計測して、記憶装置８０４に備わるＳＷ資源待ち時間記憶部８１４に記憶させる。 The SW resource wait time measurement unit 808 inputs the SW resource identifier from the SW resource identifier storage unit 812, newly inputs a process waiting for the acquisition of the SW resource to the external system 120, and sets the time required for the acquisition of each SW resource. It is measured and stored in the SW resource waiting time storage unit 814 provided in the storage device 804.

出力部８１０は、記憶装置８０４に備わるＳＷ資源待ち時間記憶部８１４から各ＳＷ資源の取得に要した時間を入力し、これを出力装置１０６に出力する。 The output unit 810 inputs the time required to acquire each SW resource from the SW resource wait time storage unit 814 provided in the storage device 804, and outputs this to the output device 106.

次に、本実施形態の性能ボトルネック解析システム８００の全体の動作について、図１５及び図１６を参照しながら詳細に説明する。図１６は、本実施形態によるコンピュータシステムの性能ボトルネック解析システム８００の動作を示すフローチャートである。 Next, the overall operation of the performance bottleneck analysis system 800 of this embodiment will be described in detail with reference to FIGS. 15 and 16. FIG. 16 is a flowchart showing the operation of the performance bottleneck analysis system 800 of the computer system according to this embodiment.

まず、ＳＷ資源識別子取得手段８０６は、外部システム１２０の上で動作するプロセスがＯＳに対して発行しているシステムコールを調べて、システムコールの引数として指定されたＳＷ資源の識別子情報を取得し、記憶装置８０４に備わるＳＷ資源識別子記憶部８１２に記憶させる（図１６のステップＦ８１）。 First, the SW resource identifier acquisition unit 806 checks a system call issued to the OS by a process operating on the external system 120, and acquires identifier information of the SW resource specified as an argument of the system call. Then, the data is stored in the SW resource identifier storage unit 812 included in the storage device 804 (step F81 in FIG. 16).

上記のシステムコールの引数情報は、プログラムのソースコードが参照でき、かつプログラム上に直接識別子が記載されている場合は、そこから取得できる。また、例えば、Ｌｉｎｕｘ等のＵＮＩＸ系ＯＳ上で動作するプロセスに対しては、ｓｔｒａｃｅ等のコマンドやｐｔｒａｃｅ等のシステムコールによって、動作中のプロセスが発行しているシステムコール及び呼出し時の引数をリアルタイムに取得できる。計測にかけられる時間が限られている場合は、一定時間内に取得できたＳＷ資源識別子のみをＳＷ資源識別子記憶部８１２に記憶する。 The argument information of the above system call can be obtained from the source code of the program when the identifier is directly described on the program. In addition, for example, for a process that runs on a UNIX OS such as Linux, a system call issued by a running process and an argument at the time of the call are issued in real time by a command such as trace or a system call such as ptrace. Can be obtained. When the time available for measurement is limited, only the SW resource identifier that can be acquired within a certain time is stored in the SW resource identifier storage unit 812.

次に、ＳＷ資源待ち時間計測部８０８は、ＳＷ資源識別子記憶部８１２から１個以上のＳＷ資源識別子を入力し、各ＳＷ資源識別子に関して、ＳＷ資源を待つプログラムを作成、または既に作成済みのプログラムに対してＳＷ資源識別子を引数として与えて外部システム１２０上で実行し、ＳＷ資源を実際に取得するまでにかかる時間を計測する。 Next, the SW resource waiting time measuring unit 808 inputs one or more SW resource identifiers from the SW resource identifier storage unit 812, and creates a program waiting for the SW resources for each SW resource identifier, or a program that has already been created Is executed on the external system 120 by giving the SW resource identifier as an argument, and the time taken to actually acquire the SW resource is measured.

このとき実行するプログラムは、まずＳＷ資源待ちを行なう処理の前に現在時刻を取得し、次にＳＷ資源識別子を使ってＳＷ資源の取得を行ない、取得後に再度現在時刻を取得して、取得の前後の時刻から取得に要した時間を算出するプログラムとなる。取得したＳＷ資源識別子は、極力早く解放する。 The program to be executed at this time first acquires the current time before the process of waiting for the SW resource, then acquires the SW resource using the SW resource identifier, acquires the current time again after acquisition, and acquires the current time. This program calculates the time required for acquisition from the previous and subsequent times. The acquired SW resource identifier is released as soon as possible.

ＳＷ資源待ち時間の計測を、ＳＷ資源識別子取得手段８０６が取得した全てのＳＷ資源識別子に対して行ない（ステップＦ８２及びステップＦ８３）、計測した所要時間をＳＷ資源待ち時間記憶部８１４に記憶する。 SW resource waiting time is measured for all SW resource identifiers acquired by the SW resource identifier acquiring unit 806 (step F82 and step F83), and the measured required time is stored in the SW resource waiting time storage unit 814.

最後に、出力部８１０は、ＳＷ資源待ち時間記憶部８１４から、各ＳＷ資源識別子に対して計測したＳＷ資源待ち時間を入力し、待ち時間を出力装置１０６に出力する。このとき、待ち時間とともに、ＳＷ資源の種類やＳＷ資源識別子を出力してもよい。また、このとき、待ち時間を用いてＳＷ資源識別子をソートまたは一定以上の待ち時間のＳＷ資源識別子だけを抽出して出力してもよい。 Finally, the output unit 810 inputs the SW resource wait time measured for each SW resource identifier from the SW resource wait time storage unit 814, and outputs the wait time to the output device 106. At this time, the SW resource type and SW resource identifier may be output together with the waiting time. At this time, the SW resource identifiers may be sorted using the waiting time, or only the SW resource identifiers having a certain waiting time or more may be extracted and output.

このように、本実施形態では、ＳＷ資源識別子に対して、ＳＷ資源を取得するのに要する時間を計測できるように構成されているため、性能上のボトルネックとなっているＳＷ資源とその識別子を特定できる。 As described above, in the present embodiment, the SW resource identifier is configured so that the time required to acquire the SW resource can be measured. Can be identified.

（第９実施形態）
次に、本発明の第９実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成について図面を使用しながら説明する。図１７は、本発明の第９実施形態によるコンピュータシステムの性能ボトルネック解析システム９００の構成を示すブロック図である。 (Ninth embodiment)
Next, the configuration of a performance bottleneck analysis system for a computer system according to a ninth embodiment of the present invention will be described with reference to the drawings. FIG. 17 is a block diagram showing a configuration of a performance bottleneck analysis system 900 for a computer system according to the ninth embodiment of the present invention.

図１７に示すように、本実施形態の性能ボトルネック解析システム９００は、図１５に示した第８実施形態にかかる性能ボトルネック解析システム８００と比べて、処理装置９０２がプロセス状態計測手段９１６と、ＨＷ資源利用プロセス数比較部９１８とを具備する点と、記憶装置９０４がＨＷ資源数記憶部９２０を具備する点で異なる。 As shown in FIG. 17, the performance bottleneck analysis system 900 of this embodiment is different from the performance bottleneck analysis system 800 according to the eighth embodiment shown in FIG. The HW resource utilization process number comparison unit 918 is different from the storage device 904 in that the storage device 904 includes an HW resource number storage unit 920.

プロセス状態計測手段９１６と、ＨＷ資源利用プロセス数比較部９１８の動作は、第１実施形態で示した対応する手段及び処理部と同様である。 The operations of the process state measurement unit 916 and the HW resource utilization process number comparison unit 918 are the same as the corresponding units and processing units described in the first embodiment.

次に、本実施形態の性能ボトルネック解析システム９００の全体の動作について、図１７及び図１８を参照しながら詳細に説明する。図１８は、本実施形態によるコンピュータシステムの性能ボトルネック解析システム９００の動作を示すフローチャートである。 Next, the overall operation of the performance bottleneck analysis system 900 of this embodiment will be described in detail with reference to FIGS. 17 and 18. FIG. 18 is a flowchart showing the operation of the performance bottleneck analysis system 900 of the computer system according to this embodiment.

図１８のステップＡ９１及びステップＡ９３〜Ａ９５で示される本実施形態におけるプロセス状態計測手段９１６、ＨＷ資源利用プロセス数比較部９１８の動作は、第１実施形態の対応する部分の動作と同一のため、説明を省略する。 The operations of the process state measuring unit 916 and the HW resource utilization process number comparison unit 918 in the present embodiment shown in Step A91 and Steps A93 to A95 in FIG. 18 are the same as the operations of the corresponding parts in the first embodiment. Description is omitted.

第１実施形態では、ステップＡ１５においてＨＷ未活用状況が観測された場合、各ＳＷ資源を待つ休眠プロセスがいくつあるかを数えていたが、本実施形態では、ＳＷ資源資源識別子取得手段９０６が、ＳＷ資源待ち状態の各プロセスに対して、発行中のシステムコールを調べ、引数として与えられているＳＷ資源識別子情報を取得して、ＳＷ資源識別子記憶部８１２に記憶する（図１８のステップＡ９６及びステップＧ９１）。 In the first embodiment, when the HW underutilization state is observed in step A15, the number of dormant processes waiting for each SW resource is counted. However, in this embodiment, the SW resource resource identifier acquisition unit 906 For each process in the SW resource waiting state, the system call being issued is checked, SW resource identifier information given as an argument is acquired, and stored in the SW resource identifier storage unit 812 (step A96 and FIG. 18). Step G91).

このとき、ＳＷ資源待ち状態のプロセスがＳＷ資源待ち状態に遷移する際に発行されたシステムコール及び引数は、例えば、Ｌｉｎｕｘ等のＵＮＩＸ系ＯＳの場合、ｓｔｒａｃｅ等のコマンドやｐｔｒａｃｅ等のシステムコールによって取得可能である。 At this time, the system call and argument issued when the SW resource waiting process transitions to the SW resource waiting state are, for example, in the case of a UNIX-based OS such as Linux, by a command such as trace or a system call such as ptrace. It can be acquired.

全てのＳＷ資源待ち状態のプロセスに対して、ＳＷ資源識別子情報を取得し終わると、次にＳＷ資源待ち時間計測部８０８は、第８実施形態における対応する処理部と同様、ＳＷ資源識別子記憶部８１２から１個以上のＳＷ資源識別子を入力し、各ＳＷ資源識別子に関して、ＳＷ資源待ち時間の計測を行ない、計測した待ち時間をＳＷ資源待ち時間記憶部８１４に記憶する（ステップＦ９２及びステップＦ９３）。 When the SW resource identifier information has been acquired for all the SW resource waiting processes, the SW resource waiting time measurement unit 808 then performs the SW resource identifier storage unit as in the corresponding processing unit in the eighth embodiment. One or more SW resource identifiers are input from 812, SW resource waiting time is measured for each SW resource identifier, and the measured waiting time is stored in the SW resource waiting time storage unit 814 (step F92 and step F93). .

最後に、出力部８１０は、第８実施形態と同様に、ＳＷ資源待ち時間記憶部８１４から、各ＳＷ資源識別子に対して計測したＳＷ資源待ち時間を入力し、出力装置１０６に出力する。このとき、待ち時間とともに、ＳＷ資源の種類やＳＷ資源識別子を出力してもよい。 Finally, as in the eighth embodiment, the output unit 810 inputs the SW resource wait time measured for each SW resource identifier from the SW resource wait time storage unit 814 and outputs the SW resource wait time to the output device 106. At this time, the SW resource type and SW resource identifier may be output together with the waiting time.

このように、本実施形態では、ＳＷ資源待ち状態のプロセスに対してのみＳＷ資源識別子情報を採集し、各ＳＷ資源を取得するのに要する時間を計測できるように構成されているため、性能上のボトルネックとなっているＳＷ資源とその識別子をより早く特定することができる。 As described above, the present embodiment is configured to collect the SW resource identifier information only for the process waiting for the SW resource and measure the time required to acquire each SW resource. It is possible to identify the SW resource that is the bottleneck and the identifier thereof earlier.

次に、以下に記載した具体的な各実施例を用いて本発明を実施するための最良の形態の動作について説明する。 Next, the operation of the best mode for carrying out the present invention will be described using specific embodiments described below.

実施例１は、上述した本発明の第１及び第２実施形態に対応するものである。 Example 1 corresponds to the first and second embodiments of the present invention described above.

ここでは、ＨＷ資源をＣＰＵとし、外部システム１２０の備えるＨＷ資源の数は４つ、つまり４つのＣＰＵを備えているものとする。また、外部システム１２０には、ＵＮＩＸ系ＯＳであるＬｉｎｕｘが搭載されているとする。 Here, it is assumed that the HW resource is a CPU, and the number of HW resources included in the external system 120 is four, that is, four CPUs. Further, it is assumed that Linux, which is a UNIX OS, is installed in the external system 120.

まず、メモリ等の記憶装置１０４内に、外部のシステム１２０の備えるＨＷ資源数である整数値４が設定される。 First, an integer value 4 that is the number of HW resources provided in the external system 120 is set in the storage device 104 such as a memory.

次に、プロセス状態計測手段１０８は、外部システム１２０上でｐｓコマンドを発行し、プロセスの実行状態に関する情報を取得する。ここで、ｐｓコマンドの出力例を図１９に示す。図１９は、ｐｓコマンドをlオプションつきで発行した出力例を示す図である。 Next, the process state measuring unit 108 issues a ps command on the external system 120 and acquires information on the execution state of the process. Here, an output example of the ps command is shown in FIG. FIG. 19 is a diagram illustrating an output example in which the ps command is issued with the l option.

図１９に示すように、３行目以降の各行が各プロセスに対応しており、ＰＩＤの列は、プロセスＩＤ（識別子）を表し、ＣＯＭＭＡＮＤの列は、プロセスの名前を表す。また、ＳＴＡＴの列は、プロセスの状態の概略を表しており、Ｒ表示は、ＣＰＵを利用中であることを表し、Ｓ表示は、ＳＷ資源を待つ休眠状態であることを表す。さらに、ＷＣＨＡＮの列は、休眠状態のプロセスがＳＷ資源の取得を待つ際に実行しているＯＳ内の関数を表す。例えば、セマフォの取得を待っていればｓｅｍｏｐ、プロセス間通信におけるメッセージの到着待ちであればｍｓｇｒｃｖ、子プロセスの終了待ちであればｗａｉｔ４と言った具合に表示される。つまり、ＷＣＨＡＮの列から休眠中のプロセスが待つＳＷ資源の種類がわかる。 As shown in FIG. 19, each of the third and subsequent rows corresponds to each process, the PID column represents the process ID (identifier), and the COMMAND column represents the name of the process. The STAT column represents an outline of the process state. The R display represents that the CPU is being used, and the S display represents a sleep state waiting for SW resources. Further, the WCHAN column represents a function in the OS that is executed when a dormant process waits for acquisition of SW resources. For example, semop is displayed when waiting for acquisition of a semaphore, msggrcv is displayed when waiting for arrival of a message in inter-process communication, and wait4 is displayed when waiting for termination of a child process. That is, the type of SW resource that a sleeping process waits for can be known from the WCHAN column.

次に、ＨＷ資源利用プロセス数比較部１１０は、ｐｓコマンドの出力から、ＣＰＵ利用中、すなわちプロセスの状態の概略を表すＳＴＡＴがＣＰＵを利用中であることを表すＲのプロセス数を数える。このとき、全てのプロセスを対象としてもよいが、一般に外部システム１２０上には、性能を解析したいプロセスとは、直接関係ないプロセスが存在しているため、ここではａｂｃと名づけられた一連のプロセスの性能について考えることにする。 Next, from the output of the ps command, the HW resource utilization process number comparison unit 110 counts the number of R processes indicating that the CPU is being used, that is, the STAT representing the outline of the process state is using the CPU. At this time, all processes may be targeted, but in general, there are processes that are not directly related to the process whose performance is to be analyzed on the external system 120, and therefore, a series of processes named abc here. Let's consider the performance.

外部システム１２０の備えるＣＰＵ数は、４つであるが、プロセスａｂｃに関しては、ＣＰＵ利用中のプロセス数は、３つしかないため、ＨＷ資源利用プロセス数比較部１１０は、外部システム１２０が、使用されていないＣＰＵの存在するＨＷ未活用状況にあることを検出する。この時点で、検出した旨を出力部１１４に与えて、出力装置１０６に出力してもよい。 The number of CPUs provided in the external system 120 is four. However, for the process abc, since the number of processes using the CPU is only three, the HW resource use process number comparison unit 110 is used by the external system 120. It is detected that there is an unutilized HW in which a CPU that has not been used exists. At this time, the fact that it has been detected may be given to the output unit 114 and output to the output device 106.

次に、ＳＷ資源待ちプロセスカウント部１１２は、ＨＷ未活用状況の検出を受けて、各ＳＷ資源を待つ休眠プロセスがいくつあるかを数える。図１９の例では、ｗａｉｔ４すなわち子プロセスの終了待ちプロセスが１つ、ｓｅｍｏｐすなわちセマフォを待ちプロセスが３つ、ｍｓｇｒｃｖすなわちメッセージ待ちのプロセスが２つと算出される。 Next, the SW resource waiting process count unit 112 receives the detection of the HW unused state and counts how many dormant processes wait for each SW resource. In the example of FIG. 19, wait 4, that is, one process waiting for the end of a child process, three semops, that is, three processes waiting for a semaphore, and two msgrc, that is, two processes waiting for a message are calculated.

最後に出力部１１４は、ディスプレイ等の出力装置１０６に算出結果を出力する。待ちプロセスの多いＳＷ資源が、より重要なボトルネックである。このとき、第２実施形態におけるフィルタリング部２２０を用いて、待ちプロセス数の多い順番でソートを行ない、ｓｅｍｏｐ:３、ｍｓｇｒｃｖ:２、ｗａｉｔ４:１等のように出力することもできる。また、待ちプロセスの数が一定数以上のＳＷ資源のみを出力するようにしてもよい。 Finally, the output unit 114 outputs the calculation result to the output device 106 such as a display. SW resources with many waiting processes are a more important bottleneck. At this time, the filtering unit 220 in the second embodiment may be used to sort in the order of the number of waiting processes and output such as semop: 3, msgrcv: 2, wait4: 1. Alternatively, only SW resources having a certain number of waiting processes may be output.

次に、実施例２を用いて、本発明を実施するための最良の形態の動作を説明する。かかる実施例２は、本発明の第３実施形態に対応するものである。 Next, the operation of the best mode for carrying out the present invention will be described using the second embodiment. Example 2 corresponds to the third embodiment of the present invention.

実施例１と同様、ＨＷ未活用状況下での各ＳＷ資源を待つ休眠プロセスがいくつあるかを数えるが、本実施例では、複数回の計測を行なって休眠プロセス数の平均値を求める。 Similar to the first embodiment, the number of dormant processes waiting for each SW resource in the HW unused state is counted. In this embodiment, the average value of the number of dormant processes is obtained by performing a plurality of measurements.

例えば、計測を５回行ない、そのうちの３回については、ＨＷ未活用状況が検出されたとする。１回目のＨＷ未活用状態検出時には、ｗａｉｔ４:１、ｓｅｍｏｐ:３、ｍｓｇｒｃｖ:２、すなわち子プロセスの終了待ちプロセスが１つ、セマフォを待ちプロセスが３つ、メッセージ待ちのプロセスが２つ観測され、２回目と３回目の検出時には、それぞれｗａｉｔ４:１、ｓｅｍｏｐ:３、ｍｓｇｒｃｖ:３とｗａｉｔ４:１、ｓｅｍｏｐ:４、ｍｓｇｒｃｖ:１のように観測されたとする。 For example, it is assumed that the measurement is performed 5 times, and the HW unused state is detected about 3 times. When the first HW underutilization state is detected, wait4: 1, semop: 3, msggrcv: 2, that is, one process waiting for child process termination, three processes waiting for semaphores, and two processes waiting for messages are observed. In the second and third detections, it is assumed that observations are made as wait4: 1, semop: 3, msgrcv: 3 and wait4: 1, semop: 4, msggrcv: 1, respectively.

平均値算出部３２０は、ＳＷ資源待ち行列長記憶部３１８に記憶された、各ＳＷ資源に対する待ちプロセス数の合計、すなわちｗａｉｔ４:３、ｓｅｍｏｐ:１０、ｍｓｇｒｃｖ:６のそれぞれを未活用事象発生回数記憶部３２２に記憶された３で割って、平均待ちプロセス数ｗａｉｔ４:１.００、ｓｅｍｏｐ:３.３３、ｍｓｇｒｃｖ:２.００を算出する。このとき、平均待ちプロセス数は、小数値であってもよい。そして、出力部３１４は、求めた各ＳＷ資源に対する平均待ちプロセス数を出力装置１０６に出力する。 The average value calculation unit 320 uses the total number of waiting processes for each SW resource stored in the SW resource queue length storage unit 318, that is, wait4: 3, semop: 10, msgrcv: 6 as the number of occurrences of unused events. By dividing by 3 stored in the storage unit 322, the average number of waiting processes wait4: 1.00, semop: 3.33, msgrcv: 2.00 is calculated. At this time, the average number of waiting processes may be a decimal value. Then, the output unit 314 outputs the obtained average number of waiting processes for each SW resource to the output device 106.

次に、実施例３を用いて、本発明を実施するための最良の形態の動作を説明する。かかる実施例３は、本発明の第４及び第５実施形態に対応するものである。 Next, the operation of the best mode for carrying out the present invention will be described using the third embodiment. Example 3 corresponds to the fourth and fifth embodiments of the present invention.

実施例１と同様の具体例を使って説明すると、本実施例では、プロセス状態計測手段１０８がｐｓコマンドを発行してから、ｐｓコマンドの結果を取得するまでに要した時間を計測所要時間取得部４２２が計測する。 To explain using a specific example similar to the first embodiment, in this embodiment, the time required from the process state measuring means 108 issuing the ps command to acquiring the result of the ps command is measured. The unit 422 performs measurement.

このときの計測には、ｇｅｔｔｉｍｅｏｆｄａｙ等の現在時刻を取得するシステムコールを使用して、ｐｓコマンドを実行する直前と直後の時刻を取得して、コマンドの応答時間を算出する方法や、コマンドの応答時間を出力するｔｉｍｅ等のコマンドを使用する方法等がある。 For the measurement at this time, a system call for obtaining the current time such as gettimeofday is used to obtain the time immediately before and after the execution of the ps command to calculate the response time of the command. There is a method of using a command such as time for outputting time.

次に、ＨＷボトルネック判定部４２０は、計測所要時間が一定の範囲内に収まっているかどうかを調べる。計測所要時間が２.３秒であり、許容値が１.５秒であった場合、計測所要時間が許容値を越えているため、ＨＷボトルネック判定部４２０は、ボトルネックがＨＷ資源側、すなわちＣＰＵにあると判断する。この場合、出力部４１４は、ボトルネックがＣＰＵであることを出力装置１０６に出力する。 Next, the HW bottleneck determination unit 420 checks whether the required measurement time is within a certain range. When the measurement required time is 2.3 seconds and the allowable value is 1.5 seconds, the measurement required time exceeds the allowable value, so the HW bottleneck determination unit 420 determines that the bottleneck is on the HW resource side. That is, it is determined that it is in the CPU. In this case, the output unit 414 outputs to the output device 106 that the bottleneck is a CPU.

計測所要時間が許容値を越えていなかった場合は、実施例１と同様、ＨＷ未活用状況の検出を行ない、各ＳＷ資源に対する平均待ちプロセス数を出力する。 If the required measurement time does not exceed the allowable value, the HW unused status is detected as in the first embodiment, and the average number of waiting processes for each SW resource is output.

第５実施形態においては、複数回の計測に基づき、計測所要時間の平均値をもってＨＷボトルネックの判定を行なう。例えば、プロセス状態の計測を１分ごとに１００回行なった結果、所要時間の平均値が１.３秒であったとする。許容値が１.５秒の場合、この期間では、ボトルネックがＨＷ資源側には存在しないものと判断する。 In the fifth embodiment, the HW bottleneck is determined based on the average value of the required measurement times based on a plurality of measurements. For example, it is assumed that the average value of the required time is 1.3 seconds as a result of measuring the process state 100 times per minute. If the allowable value is 1.5 seconds, it is determined that there is no bottleneck on the HW resource side during this period.

次に、実施例４を用いて、本発明を実施するための最良の形態の動作を説明する。かかる実施例４は、本発明の第６及び第７実施形態に対応するものである。 Next, the operation of the best mode for carrying out the present invention will be described using a fourth embodiment. Example 4 corresponds to the sixth and seventh embodiments of the present invention.

実施例１及び実施例３と同様の具体例を使って説明すると、本実施例では、ＨＷボトルネック判定部６２０によってボトルネックがＣＰＵにあると判断された場合、外部システムのＣＰＵを増強する。 In the present embodiment, when the HW bottleneck determination unit 620 determines that the bottleneck is in the CPU, the CPU of the external system is augmented.

例えば、外部システム１２０に未使用のＣＰＵや他のサービスに割り当てられたＣＰＵが存在していた場合、ＯＳのパラメータやＢＩＯＳの書換え及び再起動等の手段によって、性能解析対象のプロセスに割り当てられるＣＰＵ数を増やすことができる。また、外部システム１２０をより性能の高いコンピュータシステムに交換または切り替えても良い。例えば、ネットワークにつながったコンピュータシステムは、ＩＰアドレスの動的変更等の手段によって、他のコンピュータシステムに自動的に切り替えることが可能である。 For example, when there are unused CPUs or CPUs assigned to other services in the external system 120, the CPUs assigned to the processes subject to performance analysis by means such as OS parameter and BIOS rewriting and restarting. You can increase the number. The external system 120 may be replaced or switched to a computer system with higher performance. For example, a computer system connected to a network can be automatically switched to another computer system by means such as dynamic change of an IP address.

外部システム１２０の切替えや、構成の変更を行なった場合、出力部６１４は、その旨を出力装置１０６に出力する。 When the external system 120 is switched or the configuration is changed, the output unit 614 outputs a message to that effect to the output device 106.

さらに処理を続ける場合は、記憶装置６０４に保持されているＨＷ資源数を、変更後の構成に再設定して、プロセス状態の計測以降の処理を繰り返す。 When the processing is further continued, the number of HW resources held in the storage device 604 is reset to the changed configuration, and the processing after the measurement of the process state is repeated.

第７実施形態においては、ボトルネックがＨＷ資源側にあるかどうかの判定を実施例３における第５実施形態と同様、測定結果の平均値をもって行なう。 In the seventh embodiment, whether the bottleneck is on the HW resource side is determined based on the average value of the measurement results as in the fifth embodiment in the third embodiment.

次に、実施例５を用いて、本発明を実施するための最良の形態の動作を説明する。かかる実施例５は、本発明の第８実施形態に対応するものである。 Next, the operation of the best mode for carrying out the present invention will be described using a fifth embodiment. Example 5 corresponds to the eighth embodiment of the present invention.

まず、ＳＷ資源識別子取得手段８０６が、外部システム１２０の上で動作するプロセスが利用している各ＳＷ資源の識別子情報を取得する。ＳＷ資源識別子とは、例えばセマフォの場合であれば、各セマフォを識別するためのセマフォＩＤである。ＵＮＩＸ系ＯＳ等の場合、セマフォＩＤは、単なる整数値である。 First, the SW resource identifier acquisition unit 806 acquires identifier information of each SW resource used by a process operating on the external system 120. For example, in the case of a semaphore, the SW resource identifier is a semaphore ID for identifying each semaphore. In the case of a UNIX OS or the like, the semaphore ID is a simple integer value.

ＳＷ資源識別子は、プログラムのソースコード上から直接取得できる場合がある他、動作中のプロセスに対しても、ｓｔｒａｃｅコマンドを使用して、ＳＷ資源識別子を取得できる。例えばセマフォ待ちを行なうプロセスの場合、プロセスＩＤが例えば２４６だとすると、コマンドｓｔｒａｃｅ-p２４６の発行によって出力ｓｅｍｏｐ（８、 ...)が得られ、セマフォ識別子が８であることがわかる。 In some cases, the SW resource identifier can be obtained directly from the source code of the program, and the SW resource identifier can be obtained for a running process by using the trace command. For example, in the case of a process that waits for a semaphore, if the process ID is 246, for example, an output semop (8,.

次に、ＳＷ資源待ち時間計測部８０８は、取得したセマフォＩＤとセマフォの取得を行なう関数ｓｅｍｏｐを使用して、実際にセマフォを取得するプログラムを作成し、外部システム１２０上で実行する。 Next, the SW resource wait time measurement unit 808 creates a program that actually acquires the semaphore using the acquired semaphore ID and the function semaphore that acquires the semaphore, and executes it on the external system 120.

取得にかかる時間の計測には、システムコールｇｅｔｔｉｍｅｏｆｄａｙ等を使用して、関数ｓｅｍｏｐの呼出しの直前と直後の時刻から算出する方法や、ｔｉｍｅコマンドを使用する方法等がある。 The measurement of the time taken for acquisition includes a method of calculating from the time immediately before and immediately after calling the function semop using a system call gettimeofday or the like, and a method of using a time command.

異なるセマフォ識別子や、セマフォ以外のＳＷ資源に対しても、同様に計測を行なう。 The same measurement is performed for different semaphore identifiers and SW resources other than semaphores.

最後に出力部８１０は、ディスプレイ等の出力装置１０６に計測結果を出力する。 Finally, the output unit 810 outputs the measurement result to the output device 106 such as a display.

次に、実施例６を用いて、本発明を実施するための最良の形態の動作を説明する。かかる実施例６は、本発明の第９実施形態に対応するものである。 Next, the operation of the best mode for carrying out the present invention will be described using Example 6. Example 6 corresponds to the ninth embodiment of the present invention.

実施例１及び実施例５と同様の具体例を使って説明すると、本実施例では、プロセス状態計測手段９１６がｐｓコマンドの出力から取得した休眠状態のプロセスに対して、ＳＷ資源識別子取得手段９０６がｓｔｒａｃｅ等のコマンドを使用して、休眠中のプロセスが待つＳＷ資源の識別子を取得する。 In the present embodiment, the SW resource identifier acquisition unit 906 is used for a dormant process acquired by the process state measurement unit 916 from the output of the ps command. Uses a command such as “trace” to obtain the identifier of the SW resource that the sleeping process waits for.

ＳＷ資源待ち時間計測部８０８は、ＨＷ資源利用プロセス数比較部９１８によってＨＷ未活用状況が検出された場合に、実施例５と同様、ＳＷ資源識別子を用いて、ＳＷ資源に対する実際の待ち時間を計測する。 The SW resource wait time measurement unit 808 uses the SW resource identifier to determine the actual wait time for the SW resource when the HW resource utilization process number comparison unit 918 detects the HW unused state, as in the fifth embodiment. measure.

以上、添付図面を参照しながら本発明の好適な実施形態について説明したが、本発明は係る例に限定されないことは言うまでもない。当業者であれば、特許請求の範囲に記載された範疇内において、各種の変更例または修正例に想到し得ることは明らかであり、それらについても当然に本発明の技術的範囲に属するものと了解される。 As mentioned above, although preferred embodiment of this invention was described referring an accompanying drawing, it cannot be overemphasized that this invention is not limited to the example which concerns. It will be apparent to those skilled in the art that various changes and modifications can be made within the scope of the claims, and these are naturally within the technical scope of the present invention. Understood.

本発明は、コンピュータシステムにおける性能ボトルネック解析システムに適用可能であり、特にサーバの状態を監視して性能劣化の原因を解析する性能評価システムや、システム開発時における性能ボトルネックを特定するための開発ツールといった用途に適用可能である。また、並列計算機やクラスタ型サーバ等の複数の計算ノードが用意され、あるユーザの利用する計算ノードの数を調整できるシステムにおいて、利用効率や利用に対する課金等の理由から、ユーザの利用する計算ノードの数を最適化するといった用途にも適用可能である。 The present invention is applicable to a performance bottleneck analysis system in a computer system, and in particular, a performance evaluation system for analyzing the cause of performance degradation by monitoring the state of a server, and for identifying a performance bottleneck at the time of system development It is applicable to uses such as development tools. In addition, in a system in which a plurality of computing nodes such as parallel computers and cluster type servers are prepared and the number of computing nodes used by a user can be adjusted, the computing nodes used by the user for reasons such as usage efficiency and charging for usage It is also applicable to uses such as optimizing the number of

本発明の第１実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成を示すブロック図である。It is a block diagram which shows the structure of the performance bottleneck analysis system of the computer system by 1st Embodiment of this invention. 同実施形態によるコンピュータシステムの性能ボトルネック解析システムの動作を示すフローチャートである。It is a flowchart which shows operation | movement of the performance bottleneck analysis system of the computer system by the embodiment. 本発明の第２実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成を示すブロック図である。It is a block diagram which shows the structure of the performance bottleneck analysis system of the computer system by 2nd Embodiment of this invention. 同実施形態によるコンピュータシステムの性能ボトルネック解析システムの動作を示すフローチャートである。It is a flowchart which shows operation | movement of the performance bottleneck analysis system of the computer system by the embodiment. 本発明の第３実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成を示すブロック図である。It is a block diagram which shows the structure of the performance bottleneck analysis system of the computer system by 3rd Embodiment of this invention. 同実施形態によるコンピュータシステムの性能ボトルネック解析システムの動作を示すフローチャートである。It is a flowchart which shows operation | movement of the performance bottleneck analysis system of the computer system by the embodiment. 本発明の第４実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成を示すブロック図である。It is a block diagram which shows the structure of the performance bottleneck analysis system of the computer system by 4th Embodiment of this invention. 同実施形態によるコンピュータシステムの性能ボトルネック解析システムの動作を示すフローチャートである。It is a flowchart which shows operation | movement of the performance bottleneck analysis system of the computer system by the embodiment. 本発明の第５実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成を示すブロック図である。It is a block diagram which shows the structure of the performance bottleneck analysis system of the computer system by 5th Embodiment of this invention. 同実施形態によるコンピュータシステムの性能ボトルネック解析システムの動作を示すフローチャートである。It is a flowchart which shows operation | movement of the performance bottleneck analysis system of the computer system by the embodiment. 本発明の第６実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成を示すブロック図である。It is a block diagram which shows the structure of the performance bottleneck analysis system of the computer system by 6th Embodiment of this invention. 同実施形態によるコンピュータシステムの性能ボトルネック解析システムの動作を示すフローチャートである。It is a flowchart which shows operation | movement of the performance bottleneck analysis system of the computer system by the embodiment. 本発明の第７実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成を示すブロック図である。It is a block diagram which shows the structure of the performance bottleneck analysis system of the computer system by 7th Embodiment of this invention. 同実施形態によるコンピュータシステムの性能ボトルネック解析システムの動作を示すフローチャートである。It is a flowchart which shows operation | movement of the performance bottleneck analysis system of the computer system by the embodiment. 本発明の第８実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成を示すブロック図である。It is a block diagram which shows the structure of the performance bottleneck analysis system of the computer system by 8th Embodiment of this invention. 同実施形態によるコンピュータシステムの性能ボトルネック解析システムの動作を示すフローチャートである。It is a flowchart which shows operation | movement of the performance bottleneck analysis system of the computer system by the embodiment. 本発明の第９実施形態によるコンピュータシステムの性能ボトルネック解析システムの構成を示すブロック図である。It is a block diagram which shows the structure of the performance bottleneck analysis system of the computer system by 9th Embodiment of this invention. 同実施形態によるコンピュータシステムの性能ボトルネック解析システムの動作を示すフローチャートである。It is a flowchart which shows operation | movement of the performance bottleneck analysis system of the computer system by the embodiment. ｐｓコマンドの出力例を示す図である。It is a figure which shows the example output of a ps command. 従来のコンピュータシステムの性能ボトルネック解析システムの構成を示すブロック図である。It is a block diagram which shows the structure of the performance bottleneck analysis system of the conventional computer system.

Explanation of symbols

１００性能ボトルネック解析システム
１０２処理装置
１０４記憶装置
１０６出力装置
１０８プロセス状態計測手段
１１０ＨＷ資源利用プロセス数比較部
１１２ＳＷ資源待ちプロセスカウント部
１１４出力部
１１６ＨＷ資源数記憶部
１１８ＳＷ資源待ち行列長記憶部
１２０システム
２２０フィルタリング部
３２０平均値算出部
３２２未活用事象発生回数記憶部
４２０ボトルネック判定部
４２２計測所要時間取得部
５２６計測所要時間及び計測回数記憶部
６４０ＨＷ構成変更手段
７３０記憶初期化部
８０６ＳＷ資源識別子取得手段
８０８ＳＷ資源待ち時間計測部
８１２ＳＷ資源識別子記憶部
８１４ＳＷ資源待ち時間記憶部

DESCRIPTION OF SYMBOLS 100 Performance bottleneck analysis system 102 Processing apparatus 104 Storage apparatus 106 Output apparatus 108 Process state measurement means 110 HW resource utilization process number comparison part 112 SW resource waiting process count part 114 Output part 116 HW resource number storage part 118 SW resource queue length Storage unit 120 System 220 Filtering unit 320 Average value calculation unit 322 Unutilized event occurrence number storage unit 420 Bottleneck determination unit 422 Measurement required time acquisition unit 526 Measurement required time and measurement number storage unit 640 HW configuration change means 730 Storage initialization unit 806 SW resource identifier acquisition means 808 SW resource waiting time measurement unit 812 SW resource identifier storage unit 814 SW resource waiting time storage unit

Claims

In a performance bottleneck analysis system for a computer system in which a plurality of processes use at least one hardware resource (hereinafter referred to as HW resource),
For each of the plurality of processes, waiting for acquisition of one of a plurality of software resources (hereinafter referred to as SW resources) that are in the HW resource use state in which the HW resource is being used or are mutually exclusive. A process state measuring means for measuring a process state indicating each of the SW resource waiting states and the types of SW resources waiting for the SW resource waiting state processes;
An underutilization status detecting means for detecting an unutilized status of an HW resource in which the number of processes in the HW resource utilization status is less than the number of the HW resources;
Counting means for counting the number of processes waiting for SW resources (hereinafter referred to as the number of waiting processes) for each of the plurality of SW resources when the HW resource unused state is detected;
An output means for outputting the number of waiting processes obtained for each SW resource as a performance bottleneck analysis result;
A performance bottleneck analysis system comprising:

The process state measuring means measures the process state a plurality of times,
2. The performance bottleneck analysis system according to claim 1, further comprising an average value calculating unit that calculates an average number of processes as the number of waiting processes from the number of processes counted by the counting unit.

And measuring the required time obtaining means for calculating a time during which the process state measuring means required for the measurement,
HW bottleneck determination means for comparing the time required for the measurement with a predetermined threshold to determine whether the bottleneck is on the HW resource side ;
Further comprising
The HW bottleneck determining means determines that the bottleneck is on the HW resource side if the time required for the measurement exceeds a predetermined threshold, and in that case, the output means is that the HW resource is a bottleneck. Output that
If the time required for the measurement is within a predetermined threshold , the HW bottleneck determination means determines that the bottleneck is not on the HW resource side, and in that case, the counting means counts the number of waiting processes.
The performance bottleneck analysis system according to claim 1.

And measuring the required time obtaining means for calculating a time required for a plurality of times of measurement by said process condition measurement means,
First average value calculating means for calculating an average value of the measurement required time over the plurality of measurements;
Second average value calculating means for calculating the average number of processes as the number of waiting processes from the number of processes counted by the counting means;
HW bottleneck determination means for comparing the average measurement required time with a predetermined threshold value to determine whether the bottleneck is on the HW resource side;
Further comprising
The HW bottleneck determination means determines that the bottleneck is on the HW resource side if the average measurement time exceeds the predetermined threshold value, and in this case, the output means indicates that the HW resource is a bottleneck. Output that
The HW bottleneck determination means determines that the bottleneck is not on the HW resource side if the average measurement required time is within the predetermined threshold value , and in this case, the second average value calculation means determines the average number of processes. the calculated as the number of the waiting process,
The performance bottleneck analysis system according to claim 1.

5. The HW configuration change means for changing the number of HW resources of the computer system when the HW bottleneck determination means determines that the bottleneck is on the HW resource side. Performance bottleneck analysis system.

6. The performance bottleneck analysis system according to claim 1, further comprising a filtering unit that sorts and / or extracts the number of waiting processes based on a predetermined criterion.

In a performance bottleneck analysis method for a computer system in which a plurality of processes use at least one hardware resource (hereinafter referred to as HW resource),
For each of the plurality of processes, waiting for acquisition of one of a plurality of software resources (hereinafter referred to as SW resources) that are in the HW resource use state in which the HW resource is being used or are mutually exclusive. A process state measurement step for measuring a process state indicating the SW resource waiting state and the type of SW resource waiting for the SW resource waiting process;
An underutilization status detection step of detecting an underutilization status of HW resources in which the number of processes in the HW resource utilization status is less than the number of HW resources;
A counting step of counting the number of SW resource waiting processes (hereinafter referred to as the number of waiting processes) for each of the plurality of SW resources when the HW resource unused state is detected;
An output step of outputting the number of waiting processes obtained for each SW resource as a performance bottleneck analysis result;
A performance bottleneck analysis method comprising:

The process state measuring step measures the process state a plurality of times,
8. The performance bottleneck analysis method according to claim 7, further comprising an average value calculation step of calculating an average process number as the number of waiting processes from the number of processes counted in the counting step.

And measuring the required time obtaining step of calculating the time required for measurement of the process conditions,
HW bottleneck determination step for comparing the time required for the measurement with a predetermined threshold value to determine whether the bottleneck is on the HW resource side ;
Further including
If the time required for the measurement exceeds a predetermined threshold value, the HW bottleneck determination step determines that the bottleneck is on the HW resource side, and in that case , the HW resource has a bottleneck in the output step. Is output,
If the time required for the measurement is within a predetermined threshold , the HW bottleneck determination step determines that the bottleneck is not on the HW resource side, and in that case , the number of waiting processes is counted in the counting step. ,
The performance bottleneck analysis method according to claim 7.

A plurality of measurements required time obtaining step of calculating the time required for measurement in the process state measuring step,
A first average value calculating step for calculating an average value of the measurement required time over the plurality of measurements;
A second average value calculating step of calculating an average number of processes as the number of waiting processes from the number of processes counted in the counting step;
An HW bottleneck determination step for comparing the average measurement required time with a predetermined threshold to determine whether the bottleneck is on the HW resource side;
Further comprising
The HW bottleneck determination step determines that the bottleneck is on the HW resource side if the average required time exceeds the predetermined threshold value, and in that case , the output step causes a bottleneck in the HW resource. Is output,
The HW bottleneck determination step determines that the bottleneck is not on the HW resource side if the average measurement required time is within the predetermined threshold value . In this case , the second average value calculation step includes the average number of processes. the calculated as the number of the waiting process,
The performance bottleneck analysis method according to claim 7.

11. The HW configuration change step of changing the number of HW resources of the computer system when the HW bottleneck determination step determines that the bottleneck is on the HW resource side. Performance bottleneck analysis method.

The performance bottleneck analysis method according to any one of claims 7 to 11, further comprising a filtering step of sorting and / or extracting the number of waiting processes according to a predetermined criterion.