JP4983636B2

JP4983636B2 - Transaction apparatus, delay fault analysis apparatus, delay fault analysis method and program

Info

Publication number: JP4983636B2
Application number: JP2008030519A
Authority: JP
Inventors: 誠立花
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2008-02-12
Filing date: 2008-02-12
Publication date: 2012-07-25
Anticipated expiration: 2028-02-12
Also published as: JP2009193128A

Description

本発明は、トランザクションの遅延障害の原因を分析するトランザクション装置、遅延障害分析装置、遅延障害分析方法およびプログラムに関する。 The present invention relates to a transaction apparatus, a delay fault analysis apparatus, a delay fault analysis method, and a program for analyzing the cause of a delay fault of a transaction.

互いに不可分な複数の処理から構成されるトランザクションの遅延障害を検出する方法には、以下の検出方法が知られている。なお、遅延障害は、トランザクションが通常に比べて遅延する障害である。 The following detection methods are known as methods for detecting a delay failure in a transaction composed of a plurality of processes inseparable from each other. Note that a delay failure is a failure in which a transaction is delayed as compared with a normal case.

この検出方法では、トランザクションを構成する処理ごとに閾値となる時間が設定され、その閾値を超える処理を検出した場合に、その処理を有するトランザクションに遅延が発生したと判断している。 In this detection method, a threshold time is set for each process constituting a transaction, and when a process exceeding the threshold is detected, it is determined that a delay has occurred in the transaction having the process.

ここで、閾値は、トランザクション装置の管理者にて手動で設定されていた。このため、トランザクション装置は、遅延障害の発生を正確にまたは全く検出することができない場合があった。例えば、閾値が適切な値に設定されていないと、遅延障害の発生を正確に検出できないし、管理者が閾値を設定し忘れると、遅延障害の発生を全く検出できない。 Here, the threshold value is manually set by the administrator of the transaction apparatus. For this reason, the transaction apparatus may not be able to accurately or completely detect the occurrence of a delay fault. For example, if the threshold value is not set to an appropriate value, the occurrence of the delay fault cannot be detected accurately, and if the administrator forgets to set the threshold value, the occurrence of the delay fault cannot be detected at all.

特許文献１には、管理者がトランザクションを構成する処理ごとに閾値を設定しなくても遅延障害の発生を検出することが可能なトランザクション装置が記載されている。 Patent Document 1 describes a transaction apparatus that can detect the occurrence of a delay failure without setting a threshold for each process that constitutes a transaction by an administrator.

このトランザクション装置は、先ず、トランザクションの実行時間を計測する。続いて、トランザクション装置は、その実行時間に基づいて、直近の単位時間以前に実行されたトランザクションの実行時間の平均値（過去の実行時間の平均値）と、直近の所定時間内に実行されたトランザクションの実行時間の平均値（直近の実行時間の平均値）とを算出する。そして、トランザクション装置は、過去の実行時間の平均値と直近の実行時間の平均値を比較して、直近の実行時間の平均値が過去の実行時間の平均値を超えると、トランザクションに遅延障害が発生したと判断する。 This transaction apparatus first measures the execution time of a transaction. Subsequently, based on the execution time, the transaction device is executed within the last predetermined time and the average value of the execution time of transactions executed before the latest unit time (the average value of the past execution time). The average value of transaction execution times (the average value of the latest execution times) is calculated. Then, the transaction device compares the average value of the past execution time and the average value of the latest execution time, and if the average value of the latest execution time exceeds the average value of the past execution time, the transaction has a delay failure. Judge that it occurred.

このため、特許文献１に記載のトランザクション装置では、管理者がトランザクションを構成する処理ごとに閾値を設定しなくても遅延障害の発生を検出することができる。このため、遅延障害の発生を正確に検出することが可能になる。
特開２００７−２４９６６３号公報 For this reason, in the transaction apparatus described in Patent Document 1, it is possible to detect the occurrence of a delay failure without the administrator setting a threshold value for each process constituting a transaction. For this reason, it becomes possible to accurately detect the occurrence of a delay fault.
JP 2007-249663 A

トランザクションの遅延障害の検出とは別に、発生したトランザクションの遅延障害の原因を分析する技術が提案または実用化されている。 Apart from detecting a transaction delay failure, a technique for analyzing the cause of the generated transaction delay failure has been proposed or put into practical use.

トランザクションの遅延障害の原因には、トランザクションの実行時間の増加と、トランザクションの実行の滞留とがある。また、トランザクションの実行の滞留は、トランザクションの処理要求がトランザクションを同時に実行できる実行可能数よりも多くなることで生じることもあるし、トランザクション装置に恒久的な障害が発生し、実行中のトランザクションがストールしたことで発生することもある。また、これらの原因が、同時に発生することもある。 Causes of transaction delay failures include an increase in transaction execution time and retention of transaction execution. Transaction stagnation may occur when the number of transaction processing requests exceeds the number of executable transactions that can be executed at the same time, or when a permanent failure occurs in the transaction device, It may occur due to a stall. Moreover, these causes may occur simultaneously.

なお、トランザクション装置は、スレッドやプロセスなどの処理単位ごとにトランザクションを実行することで、複数のトランザクションを同時に実行する。この処理単位の数が、実行可能数となる。ここで、実行中のトランザクションがストールした場合、そのストール状態のトランザクションが処理単位を専有することとなり、トランザクション装置の処理能力が低下する。具体的には、トランザクションを同時に実行できる実質的な数が減少する。このため、処理要求の数が実行可能数よりも少なくても、トランザクションの実行が滞留される。 The transaction apparatus executes a plurality of transactions simultaneously by executing a transaction for each processing unit such as a thread or a process. The number of processing units is the executable number. Here, when a transaction being executed stalls, the transaction in the stalled state occupies a unit of processing, and the processing capability of the transaction apparatus decreases. Specifically, the substantial number of transactions that can be executed simultaneously is reduced. For this reason, even if the number of processing requests is smaller than the executable number, the execution of the transaction is retained.

上述のトランザクションを構成する処理ごとに閾値となる時間を設定する検出方法や、特許文献１に記載の発明では、トランザクションの実行時間の増加により発生した遅延障害を検出することしかできない。このため、遅延障害が発生した場合に、その遅延障害の原因が、トランザクションの実行時間の増加なのか、それとも、トランザクションの実行の滞留なのかを分析することができないという問題がある。 The detection method for setting a threshold time for each process constituting the above-described transaction and the invention described in Patent Document 1 can only detect a delay failure caused by an increase in the execution time of a transaction. For this reason, when a delay failure occurs, there is a problem that it is impossible to analyze whether the cause of the delay failure is an increase in the execution time of a transaction or a stay of execution of a transaction.

本発明の目的は、上記の課題である、トランザクションの遅延障害の原因を分析することはできないという問題を解決する、トランザクション装置、遅延障害分析装置、遅延障害分析方法およびプログラムを提供することである。 An object of the present invention is to provide a transaction apparatus, a delay fault analysis apparatus, a delay fault analysis method, and a program that solve the above-described problem that the cause of a delay fault of a transaction cannot be analyzed. .

本発明によるトランザクション装置は、トランザクションの処理要求を受け付ける受付手段と、前記受付手段が受け付けた処理要求を待ち行列に滞留する滞留手段と、前記トランザクションを新たに実行することが可能になると、前記滞留手段の待ち行列に保持された処理要求に応じたトランザクションを実行する実行手段と、前記実行手段が前記トランザクションの実行を開始した実行開始時刻を計測する実行制御手段と、前記滞留手段の待ち行列に滞留されている処理要求の滞留数を計測するリクエスト管理手段と、実行制御手段が計測した実行開始時刻と、前記リクエスト管理手段が計測した滞留数とを記憶する情報記憶手段と、前記情報記憶手段に記憶された情報に基づいて、前記トランザクションの遅延障害の原因が、前記実行時間の増加か、それとも、前記トランザクションの実行の滞留かを分析する分析手段と、を含む。 The transaction apparatus according to the present invention includes a receiving unit that receives a transaction processing request, a staying unit that holds a processing request received by the receiving unit in a queue, and the stay when the transaction is newly executed. An execution means for executing a transaction according to a processing request held in the queue of the means, an execution control means for measuring an execution start time when the execution means starts executing the transaction, and a queue for the staying means. A request management means for measuring the number of staying processing requests, an information storage means for storing the execution start time measured by the execution control means, and the staying number measured by the request management means, and the information storage means The cause of the delay failure of the transaction is based on the information stored in the execution time Or increase, or comprises, an analyzing means for analyzing whether retention of the execution of the transaction.

また、本発明による遅延障害分析装置は、トランザクションの処理要求を受け付け、該処理要求を待ち行列に滞留し、前記トランザクションが新たに実行可能になると、前記待ち行列に保持された処理要求に応じたトランザクションを実行するトランザクション装置において前記トランザクションの遅延障害の原因を分析する遅延障害分析装置であって、前記トランザクション装置が前記トランザクションの実行を開始した実行開始時刻を計測する実行制御手段と、前記トランザクション装置の待ち行列に滞留されている処理要求の滞留数を計測するリクエスト管理手段と、前記実行制御手段が計測した実行開始時刻と、前記リクエスト管理手段が計測した滞留数と、を記憶する情報記憶手段と、前記情報記憶手段に記憶された情報に基づいて、前記トランザクションの遅延障害の原因が、前記実行時間の増加か、それとも、前記トランザクションの実行の滞留かを分析する分析手段と、を含む。 The delay fault analysis apparatus according to the present invention accepts a transaction processing request, stays in the queue, and responds to the processing request held in the queue when the transaction can be newly executed. A delay fault analysis apparatus that analyzes a cause of a delay fault of a transaction in a transaction apparatus that executes a transaction, the execution control means for measuring an execution start time at which the transaction apparatus starts executing the transaction, and the transaction apparatus Information storage means for storing the request management means for measuring the staying number of processing requests staying in the queue, the execution start time measured by the execution control means, and the staying number measured by the request management means And based on the information stored in the information storage means , The cause of the delay fault of the transaction, or an increase of the execution time, or, including, an analysis unit for analyzing whether retention of the execution of the transaction.

また、本発明による遅延障害分析方法は、トランザクションの処理要求を受け付け、
前記受け付けられた処理要求を待ち行列に滞留し、前記トランザクションを新たに実行することが可能になると、前記待ち行列に保持された処理要求に応じたトランザクションを実行し、前記トランザクションの実行が開始された実行開始時刻を計測し、前記待ち行列に滞留されている処理要求の滞留数を計測し、前記計測された実行開始時刻および滞留数を記憶し、前記記憶された情報に基づいて、前記トランザクションの遅延障害の原因が、前記実行時間の増加か、それとも、前記トランザクションの実行の滞留かを分析する。 The delay fault analysis method according to the present invention accepts a transaction processing request,
When the accepted processing request stays in the queue and the transaction can be newly executed, the transaction corresponding to the processing request held in the queue is executed, and the execution of the transaction is started. The execution start time is measured, the number of stays of the processing request staying in the queue is measured, the measured execution start time and the stay number are stored, and the transaction is based on the stored information. It is analyzed whether the cause of the delay failure is the increase in the execution time or the stay of the execution of the transaction.

また、本発明によるプログラムは、トランザクションの遅延障害の分析をコンピュータに実行させるプログラムであって、トランザクションの処理要求を受け付ける手順と、前記受け付けられた処理要求を待ち行列に滞留する手順と、前記トランザクションを新たに実行することが可能になると、前記待ち行列に保持された処理要求に応じたトランザクションを実行する手順と、前記トランザクションの実行が開始された実行開始時刻を計測する手順と、前記待ち行列に滞留されている処理要求の滞留数を計測する手順と、前記計測された実行開始時刻および滞留数を記憶する手順と、前記記憶された情報に基づいて、前記トランザクションの遅延障害の原因が、前記実行時間の増加か、それとも、前記トランザクションの実行の滞留かを分析する手順と、を前記コンピュータに実行させる。 Further, a program according to the present invention is a program for causing a computer to analyze a transaction delay fault, a procedure for receiving a transaction processing request, a procedure for retaining the received processing request in a queue, and the transaction Can be newly executed, a procedure for executing a transaction according to a processing request held in the queue, a procedure for measuring an execution start time at which execution of the transaction is started, and the queue Based on the procedure for measuring the number of stays of processing requests staying in the process, the step of storing the measured execution start time and the number of stays, and the stored information, the cause of the delay failure of the transaction is Whether the execution time increases or the execution of the transaction remains A step of, the causes the computer to perform.

本発明によれば、遅延障害の原因を分析することが可能になる。 According to the present invention, it is possible to analyze the cause of a delay fault.

以下、本発明の実施形態について図面を参照して説明する。 Embodiments of the present invention will be described below with reference to the drawings.

図１は、本発明の第一の実施形態のトランザクションシステムを示したブロック図である。図１において、トランザクションシステムは、サーバコンピュータ１（トランザクション装置）と、複数のクライアント端末２と、運用管理端末３とを含む。ここで、サーバコンピュータ１は、クライアント端末２と運用管理端末３のそれぞれに、ネットワークＮを介して相互に接続される。 FIG. 1 is a block diagram showing a transaction system according to the first embodiment of the present invention. In FIG. 1, the transaction system includes a server computer 1 (transaction apparatus), a plurality of client terminals 2, and an operation management terminal 3. Here, the server computer 1 is connected to each of the client terminal 2 and the operation management terminal 3 via the network N.

サーバコンピュータ１は、クライアント端末２からトランザクションの処理要求を受け付け、その処理要求に応じたトランザクションを実行し、その実行結果を処理要求に応じた応答情報としてクライアント端末２に送信する。また、サーバコンピュータ１は、運用管理端末３と任意の情報を送受信する。 The server computer 1 receives a transaction processing request from the client terminal 2, executes a transaction corresponding to the processing request, and transmits the execution result to the client terminal 2 as response information corresponding to the processing request. The server computer 1 transmits / receives arbitrary information to / from the operation management terminal 3.

なお、トランザクションシステムは、図１で示した形態に限らず、サーバコンピュータ１が処理要求を受け付け、その処理要求に応じたトランザクションを実行するように動作するものであればよい。 The transaction system is not limited to the form shown in FIG. 1, and any transaction system may be used as long as the server computer 1 operates to accept a processing request and execute a transaction according to the processing request.

クライアント端末２は、トランザクションの処理要求をサーバコンピュータ１に送信し、その処理要求に応じた応答情報をサーバコンピュータ１から受信する。なお、クライアント端末２は、クライアントアプリケーションプログラムをＣＰＵに実行させることで、上記の処理を行っているものとする。 The client terminal 2 transmits a transaction processing request to the server computer 1 and receives response information corresponding to the processing request from the server computer 1. It is assumed that the client terminal 2 performs the above processing by causing the CPU to execute a client application program.

運用管理端末３は、サーバコンピュータ１の管理者にて使用され、サーバコンピュータ１の制御および管理を行なう。 The operation management terminal 3 is used by an administrator of the server computer 1 to control and manage the server computer 1.

以下、サーバコンピュータ１について詳細に説明する。 Hereinafter, the server computer 1 will be described in detail.

サーバコンピュータ１では、トランザクションを実行するためのサーバアプリケーションプログラム（以下、サーバアプリケーションと略す）に従ってＣＰＵがトランザクションを実行する。サーバアプリケーションは、一つでも複数でもよい。なお、各サーバアプリケーションには、実行可能なトランザクションの種別が定められている。 In the server computer 1, the CPU executes a transaction in accordance with a server application program (hereinafter abbreviated as a server application) for executing the transaction. There may be one or more server applications. Each server application has a type of transaction that can be executed.

また、サーバアプリケーションは、一つまたは複数の処理単位を有し、ＣＰＵは、その処理単位ごとにトランザクションを独立に実行する。なお、処理単位は、例えば、スレッドまたはプロセスなどである。本実施形態では、サーバアプリケーションは、複数あり、各サーバアプリケーションは、複数の処理単位を有するものとする。また、処理単位は、スレッドであるとする。 The server application has one or a plurality of processing units, and the CPU executes a transaction independently for each processing unit. Note that the processing unit is, for example, a thread or a process. In the present embodiment, there are a plurality of server applications, and each server application has a plurality of processing units. The processing unit is assumed to be a thread.

図２は、サーバコンピュータ１の構成例を示したブロック図である。図２において、サーバコンピュータ１は、記憶装置１Ａと、演算装置１Ｂとを含む。 FIG. 2 is a block diagram illustrating a configuration example of the server computer 1. In FIG. 2, the server computer 1 includes a storage device 1A and an arithmetic device 1B.

記憶装置１Ａは、例えば、ハードディスクであり、トランザクションの遅延障害の原因を分析するためデータベースである管理表を記憶する管理表記憶部１１を含む。 The storage device 1A is, for example, a hard disk, and includes a management table storage unit 11 that stores a management table that is a database for analyzing the cause of a transaction delay failure.

演算装置１Ｂは、ＣＰＵであり、ＣＰＵにて読み取り可能な記録媒体からプログラムを読み取り、読み取ったプログラムを実行して、リクエスト受付部１２と、キュー制御部１３と、サーバアプリケーション部１４と、トランザクション制御部１５と、情報管理部１６と、障害分析部１７と、障害対応部１８とを実現する。ここで、記録媒体は、記憶装置１Ａでもよいし、他のものでもよい。 The arithmetic device 1B is a CPU, reads a program from a recording medium readable by the CPU, executes the read program, a request receiving unit 12, a queue control unit 13, a server application unit 14, and transaction control The unit 15, the information management unit 16, the failure analysis unit 17, and the failure handling unit 18 are realized. Here, the recording medium may be the storage device 1A or another recording medium.

また、管理表記憶部１１、トランザクション制御部１５、情報管理部１６、障害分析部１７および障害対応部１８は、トランザクションの遅延障害を分析する遅延障害分析装置を構成する。 The management table storage unit 11, the transaction control unit 15, the information management unit 16, the failure analysis unit 17, and the failure handling unit 18 constitute a delay failure analysis device that analyzes a delay failure of a transaction.

なお、演算装置１Ｂには、トランザクションの遅延障害の原因を分析する機能の他に、一般的なサーバコンピュータが備える機能なども有しているが、それらの機能は本発明と直接関係しないので、その説明は省略する。また、記憶装置１Ａには、管理表の他に、演算装置１Ｂが一般的なサーバコンピュータが備える機能を実現するために必要な情報を記憶しているが、その情報は本発明と直接関係しないので、その情報の説明は省略する。 In addition to the function of analyzing the cause of the transaction delay failure, the arithmetic device 1B also has a function of a general server computer, but these functions are not directly related to the present invention. The description is omitted. In addition to the management table, the storage device 1A stores information necessary for realizing the functions of the general server computer in the arithmetic device 1B, but the information is not directly related to the present invention. Therefore, explanation of the information is omitted.

以下、記憶装置１Ａおよび演算装置１Ｂの各部について詳細に説明する。 Hereinafter, each part of the storage device 1A and the arithmetic device 1B will be described in detail.

記憶装置１Ａの管理表記憶部１１は、情報記憶手段の一例である。管理表記憶部１１は、管理表として、トランザクション実行時間管理表と、リクエスト受付情報管理表と、サーバアプリケーション状態管理表とを記憶する。 The management table storage unit 11 of the storage device 1A is an example of an information storage unit. The management table storage unit 11 stores a transaction execution time management table, a request reception information management table, and a server application state management table as management tables.

トランザクション実行時間管理表（以下、実行時間管理表と略す）では、トランザクションが実行された実行回数と、トランザクションの実行時間の合計時間とが、トランザクションの種別を特定するトランザクション識別子ごとに対応付けられている。 In the transaction execution time management table (hereinafter abbreviated as the execution time management table), the number of times a transaction has been executed and the total transaction execution time are associated with each transaction identifier that identifies the transaction type. Yes.

トランザクションの実行回数としては、直近の単位時間内にトランザクションが実行された回数である直近実行回数と、直近の単位時間以前にトランザクションが実行された回数である全実行回数とがある。 As the number of executions of a transaction, there are a most recent execution number that is the number of times a transaction has been executed within the latest unit time, and a total execution number that is the number of times that a transaction has been executed before the most recent unit time.

また、トランザクションの実行時間の合計時間としては、トランザクションの実行時間の直近の単位時間内の合計である直近合計時間と、トランザクションの実行時間の直近の単位時間以前の合計である全合計時間とがある。なお、単位時間は、固定でもよいし、可変でもよい。 The total transaction execution time includes the latest total time within the unit time immediately preceding the transaction execution time and the total total time before the unit time immediately before the transaction execution time. is there. The unit time may be fixed or variable.

図３は、実行時間管理表の一例を示した説明図である。図３において、実行時間管理表１００は、トランザクション識別子１０１と、トランザクション実行回数（全体）１０２と、トランザクション実行時間合計（全体）１０３と、トランザクション実行回数（直近）１０４と、トランザクション実行時間合計（直近）１０５とを含む。 FIG. 3 is an explanatory diagram showing an example of the execution time management table. In FIG. 3, the execution time management table 100 includes a transaction identifier 101, a transaction execution count (total) 102, a total transaction execution time (total) 103, a transaction execution count (most recent) 104, and a total transaction execution time (most recent). ) 105.

トランザクション識別子１０１は、トランザクションの種別を特定するための情報である。トランザクション実行回数（全体）１０２は、全実行回数の一例である。トランザクション実行時間合計（全体）１０３は、全合計時間の一例である。トランザクション実行回数（直近）１０４は、直近実行回数の一例である。トランザクション実行時間合計（直近）１０５は、直近合計時間の一例である。 The transaction identifier 101 is information for specifying the type of transaction. The transaction execution count (total) 102 is an example of the total execution count. The transaction execution time total (total) 103 is an example of the total total time. The transaction execution count (most recent) 104 is an example of the latest execution count. The total transaction execution time (latest) 105 is an example of the latest total time.

図２に戻る。リクエスト受付情報管理表（以下、受付情報管理表と略す）では、処理要求が受け付けられた受付回数と、トランザクションの滞留数とが、処理要求に応じたトランザクションを実行可能なサーバアプリケーションを特定するサーバアプリケーション識別子ごとに対応付けられている。 Returning to FIG. In the request reception information management table (hereinafter abbreviated as reception information management table), the number of receptions of processing requests and the number of staying transactions specify a server application that can execute a transaction according to the processing request. Corresponds to each application identifier.

受付回数には、サーバコンピュータ１が直近の所定時間内に処理要求を受け付けた回数である直近受付回数と、サーバコンピュータ１が直近の所定時間以前に処理要求を受け付けた回数の単位時間当たりの平均値である平均受付回数とがある。また、滞留数は、後述するキュー制御部１３の待ち行列に滞留されている処理要求の数である。 The number of times of reception is the average number of times the server computer 1 has received a processing request within the latest predetermined time and the average number of times that the server computer 1 has received a processing request before the latest predetermined time per unit time. There is an average number of receptions that is a value. The staying number is the number of processing requests staying in a queue of the queue control unit 13 described later.

なお、所定時間は、単位時間と同じでもよいし異なっていてもよいが、本実施形態では、所定時間は、単位時間と同じであるとし、所定時間を単位時間と称する
図４は、受付情報管理表の一例を示した説明図である。図４において、受付情報管理表２００は、サーバアプリケーション識別子２０１と、単位時間当たりのリクエスト受付回数平均（全体）２０２と、リクエスト受付回数（直近）２０３と、キュー滞留リクエスト数２０４とを含む。 Although the predetermined time may be the same as or different from the unit time, in the present embodiment, the predetermined time is assumed to be the same as the unit time, and the predetermined time is referred to as a unit time. It is explanatory drawing which showed an example of the management table. In FIG. 4, the reception information management table 200 includes a server application identifier 201, a request reception average (total) 202 per unit time, a request reception frequency (most recent) 203, and a queue stay request number 204.

単位時間当たりのリクエスト受付回数平均（全体）２０２は、平均受付回数の一例である。リクエスト受付回数（直近）２０３は、直近受付回数の一例である。キュー滞留リクエスト数２０４は、滞留数の一例である。 The average number of received requests per unit time (overall) 202 is an example of the average number of received requests. The request reception count (nearest) 203 is an example of the latest reception count. The queue stay request number 204 is an example of the stay number.

図２に戻る。サーバアプリケーション状態管理表（以下、状態管理表と略す）では、トランザクション識別子と、サーバコンピュータ１がトランザクションの実行を開始した実行開始時刻とが、トランザクションを実行可能な処理単位を特定する処理識別子ごとに対応付けられている。なお、実行が終了したトランザクションの実行開始時刻は、クリアされている。 Returning to FIG. In the server application state management table (hereinafter abbreviated as a state management table), a transaction identifier and an execution start time when the server computer 1 starts executing a transaction are for each processing identifier that identifies a processing unit that can execute the transaction. It is associated. Note that the execution start time of a transaction that has been executed is cleared.

処理識別子は、トランザクションを実行可能なサーバアプリケーション識別子と、サーバアプリケーションの処理単位を特定する処理単位識別子とからなる。ここで、処理単位識別子は、サーバアプリケーションごとに一意に定められていればよい。 The processing identifier includes a server application identifier that can execute a transaction and a processing unit identifier that identifies a processing unit of the server application. Here, the processing unit identifier only needs to be uniquely determined for each server application.

図５は、状態管理表の一例を示した説明図である。図５において、状態管理表３００は、サーバアプリケーション識別子３０１と、処理単位識別子３０２と、トランザクション識別子３０３と、トランザクション開始時間３０４とを含む。 FIG. 5 is an explanatory diagram showing an example of the state management table. 5, the state management table 300 includes a server application identifier 301, a processing unit identifier 302, a transaction identifier 303, and a transaction start time 304.

サーバアプリケーション識別子３０１は、サーバアプリケーションを特定する。処理単位識別子は、サーバアプリケーション識別子３０１の処理単位を特定する。トランザクション識別子３０３は、トランザクションの種別を特定する。トランザクション開始時間３０４は、トランザクションの実行開始時刻の一例である。 The server application identifier 301 specifies a server application. The processing unit identifier specifies the processing unit of the server application identifier 301. The transaction identifier 303 specifies the transaction type. The transaction start time 304 is an example of a transaction execution start time.

次に、演算装置１Ｂの各部について詳細に説明する。 Next, each part of the arithmetic unit 1B will be described in detail.

リクエスト受付部１２は、受付手段の一例である。リクエスト受付部１２は、クライアント端末２からトランザクションの処理要求を受け付ける。トランザクションの処理要求は、実行を要求するトランザクションの種別を特定するトランザクション識別子を含む。 The request reception unit 12 is an example of a reception unit. The request receiving unit 12 receives a transaction processing request from the client terminal 2. The transaction processing request includes a transaction identifier that identifies the type of transaction that requires execution.

キュー制御部１３は、滞留手段の一例である。キュー制御部１３は、リクエスト受付部１２が受け付けた処理要求を、キュー制御部１３自身が保持している待ち行列に滞留する。このとき、キュー制御部１３は、トランザクションの種別ごとに待ち行列を保持し、処理要求を、その処理要求に応じたトランザクションの種別の待ち行列に滞留する。 The queue control unit 13 is an example of a staying unit. The queue control unit 13 stays in the queue that the queue control unit 13 itself holds the processing request received by the request reception unit 12. At this time, the queue control unit 13 holds a queue for each transaction type, and the processing request stays in the queue for the transaction type corresponding to the processing request.

サーバアプリケーション部１４は、実行手段の一例であり、サーバアプリケーションにて実現される。 The server application unit 14 is an example of an execution unit, and is realized by a server application.

サーバアプリケーション部１４は、トランザクションを実行し、その実行結果に応じた応答情報をクライアント端末２に送信する。ここで、サーバアプリケーション部１４は、トランザクションの種別ごとに、その種別のトランザクションを実行可能なサーバアプリケーションが有する処理単位の数だけ、トランザクションを同時に実行することが可能である。 The server application unit 14 executes the transaction and transmits response information corresponding to the execution result to the client terminal 2. Here, for each transaction type, the server application unit 14 can simultaneously execute transactions by the number of processing units included in the server application that can execute the transaction of that type.

トランザクション制御部１５は、実行制御手段の一例である。トランザクション制御部１５は、サーバアプリケーション部１４によるトランザクションの実行を制御する。 The transaction control unit 15 is an example of an execution control unit. The transaction control unit 15 controls execution of a transaction by the server application unit 14.

具体的には、トランザクション制御部１５は、サーバアプリケーション部１４でトランザクションを実行中でない処理単位があるか否かを判断する。以下、トランザクションを実行中でない処理単位を、空きスレッドと称する。 Specifically, the transaction control unit 15 determines whether there is a processing unit in which the server application unit 14 is not executing a transaction. Hereinafter, a processing unit in which a transaction is not being executed is referred to as an empty thread.

トランザクション制御部１５は、空きスレッドがあると、サーバアプリケーション部１４にてトランザクションが新たに実行可能であると判断する。そして、トランザクション制御部１５は、空きスレッドで実行可能なトランザクションの種別の待ち行列から処理要求を取得し、その取得した処理要求をサーバアプリケーション部１４に渡す。サーバアプリケーション部１４は、処理要求を受け付けると、その処理要求に応じたトランザクションを実行する。 When there is an empty thread, the transaction control unit 15 determines that the server application unit 14 can newly execute a transaction. Then, the transaction control unit 15 acquires a processing request from a queue of transaction types that can be executed by a free thread, and passes the acquired processing request to the server application unit 14. When receiving the processing request, the server application unit 14 executes a transaction according to the processing request.

これにより、サーバアプリケーション部１４は、トランザクションを新たに実行することが可能になると、キュー制御部１３の待ち行列に保持された処理要求に応じたトランザクションを実行することになる。 Thus, when the server application unit 14 can newly execute a transaction, the server application unit 14 executes a transaction corresponding to the processing request held in the queue of the queue control unit 13.

また、トランザクション制御部１５は、サーバアプリケーション部１４がトランザクションの実行を開始した時刻を実行開始時刻として計測する。 Further, the transaction control unit 15 measures the time when the server application unit 14 starts executing the transaction as the execution start time.

情報管理部１６は、管理表記憶部１１内の情報を管理する。 The information management unit 16 manages information in the management table storage unit 11.

図６は、情報管理部１６の構成例を示したブロック図である。図６において、情報管理部１６は、リクエスト管理部１６１と、実行時間管理部１６２と、状態管理部１６３とを含む。 FIG. 6 is a block diagram illustrating a configuration example of the information management unit 16. In FIG. 6, the information management unit 16 includes a request management unit 161, an execution time management unit 162, and a state management unit 163.

リクエスト管理部１６１は、管理表記憶部１１内の受付情報管理表を管理する。具体的には、リクエスト管理部１６１は、以下の処理を行う。 The request management unit 161 manages the reception information management table in the management table storage unit 11. Specifically, the request management unit 161 performs the following processing.

リクエスト管理部１６１は、サーバアプリケーションを特定するサーバアプリケーション識別子と、そのサーバアプリケーションが実行可能なトランザクションの種別を特定するトランザクション識別子との対応関係を示した対応情報を保持する。 The request management unit 161 holds correspondence information indicating a correspondence relationship between a server application identifier that identifies a server application and a transaction identifier that identifies a transaction type that can be executed by the server application.

リクエスト管理部１６１は、その対応情報から、リクエスト受付部１２が受け付けた処理要求内のトランザクション識別子に対応するサーバアプリケーション識別子を割り出す。そして、リクエスト管理部１６１は、リクエスト受付部１２が処理要求を受け付けた受付回数と、キュー制御部１３の待ち行列が保持している処理要求の滞留数とを、その割り出したサーバアプリケーション識別子ごとに計測する。ここで、リクエスト管理部１６１は、受付回数として、直近受付回数および平均受付回数を計測する。 The request management unit 161 calculates a server application identifier corresponding to the transaction identifier in the processing request received by the request receiving unit 12 from the correspondence information. Then, the request management unit 161 sets the number of times the request reception unit 12 has received the processing request and the number of processing requests held in the queue of the queue control unit 13 for each of the calculated server application identifiers. measure. Here, the request management unit 161 measures the latest reception count and the average reception count as the reception count.

リクエスト管理部１６１は、計測した受付回数および滞留数をそのアプリケーション識別子と対応付けて管理表記憶部１１内の受付情報管理表に記憶する。 The request management unit 161 stores the measured number of receptions and the number of stays in the reception information management table in the management table storage unit 11 in association with the application identifier.

実行時間管理部１６２は、管理表記憶部１１内の実行時間管理表を管理する。具体的には、実行時間管理部１６２は、以下の処理を行う。 The execution time management unit 162 manages the execution time management table in the management table storage unit 11. Specifically, the execution time management unit 162 performs the following processing.

実行時間管理部１６２は、サーバアプリケーション部１４によるトランザクションの実行時間の合計時間と、そのトランザクションの実行回数とを、トランザクション識別子ごとに計測する。ここで、実行時間管理部１６２は、実行時間の合計時間として、全合計時間および直近合計時間を計測し、実行回数として、全実行回数および直近実行回数を計測する。 The execution time management unit 162 measures the total execution time of transactions by the server application unit 14 and the number of executions of the transaction for each transaction identifier. Here, the execution time management unit 162 measures the total total time and the latest total time as the total time of the execution time, and measures the total execution count and the latest execution count as the execution count.

実行時間管理部１６２は、計測した実行時間の合計時間および実行回数を、そのトランザクション識別子と対応付けて管理表記憶部１１内の実行時間管理表に記憶する。 The execution time management unit 162 stores the total measured execution time and the number of executions in the execution time management table in the management table storage unit 11 in association with the transaction identifier.

状態管理部１６３は、管理表記憶部１１内の実行時間管理表を管理する。具体的には、状態管理部１６３は、以下の処理を行う。 The state management unit 163 manages the execution time management table in the management table storage unit 11. Specifically, the state management unit 163 performs the following processing.

状態管理部１６３は、サーバアプリケーション部１４がトランザクションの実行を開始すると、トランザクション制御部１５が計測した実行開始時刻と、そのトランザクションの種別を特定するトランザクション識別子とを、そのトランザクションを実行した処理単位を特定する処理識別子に対応付けて、管理表記憶部１１内の状態管理表に記憶する。 When the server application unit 14 starts executing a transaction, the state management unit 163 displays the execution start time measured by the transaction control unit 15 and the transaction identifier that identifies the type of the transaction, and the processing unit that executed the transaction. The information is stored in the state management table in the management table storage unit 11 in association with the specified process identifier.

また、状態管理部１６３は、サーバアプリケーション部１４がトランザクションの実行を終了すると、管理表記憶部１１に記憶された状態管理表から、トランザクションの実行を終了した処理単位を特定する処理識別子に対応するトランザクション識別子および実行開始時刻をクリアする。 In addition, when the server application unit 14 finishes executing the transaction, the state management unit 163 corresponds to a process identifier that identifies a processing unit that has finished executing the transaction from the state management table stored in the management table storage unit 11. Clear the transaction identifier and execution start time.

また、情報管理部１６は、運用管理者から管理表記憶部１１に記憶された各管理表をリセットする旨のリセット要求を受け付けると、管理表記憶部１１内の管理表をリセット（削除）する。ここで、情報管理部１６において、リクエスト管理部１６１が受付情報管理表をリセットし、実行時間管理部１６２が実行時間管理表をリセットし、状態管理部１６３が状態管理表をリセットする。なお、実行時間管理部１６２は、実行時間管理表の全体をリセットしてもよいし、運用管理者にて指定されたトランザクション識別子の行のみをリセットしてもよい。 In addition, upon receiving a reset request for resetting each management table stored in the management table storage unit 11 from the operation manager, the information management unit 16 resets (deletes) the management table in the management table storage unit 11. . Here, in the information management unit 16, the request management unit 161 resets the received information management table, the execution time management unit 162 resets the execution time management table, and the state management unit 163 resets the state management table. Note that the execution time management unit 162 may reset the entire execution time management table, or may reset only the row of the transaction identifier specified by the operation manager.

また、管理者は、運用管理端末３を用いて情報管理部１６にリセット要求を入力してもよいし、サーバコンピュータ１に直接接続されたキーボードなどの入力装置（図示せず）を用いて情報管理部１６にリセット要求を入力してもよい。なお、運用管理者は、例えば、サーバコンピュータ１の稼動環境に変化があった場合に、リセット要求を入力する。これにより、新しい環境に即した遅延障害の原因の分析を行うことが可能になる。 In addition, the administrator may input a reset request to the information management unit 16 using the operation management terminal 3, or information using an input device (not shown) such as a keyboard directly connected to the server computer 1. A reset request may be input to the management unit 16. The operation manager inputs a reset request when the operating environment of the server computer 1 has changed, for example. As a result, it becomes possible to analyze the cause of the delay fault according to the new environment.

障害分析部１７は、分析手段の一例である。障害分析部１７は、管理表記憶部１１に記憶された各管理表に基づいて、トランザクションの遅延障害の原因を分析する。具体的には、障害分析部１７は、トランザクションの遅延障害の原因が、トランザクションの実行時間の増加か、処理要求の増加によって生じたトランザクションの実行の滞留か、それとも、トランザクションのストールによって生じたトランザクションの実行の滞留かを分析する。 The failure analysis unit 17 is an example of an analysis unit. The failure analysis unit 17 analyzes the cause of the transaction delay failure based on each management table stored in the management table storage unit 11. Specifically, the failure analysis unit 17 indicates that the cause of the transaction delay failure is an increase in the execution time of the transaction, a retention of execution of the transaction caused by an increase in processing requests, or a transaction caused by a transaction stall Analyze whether the execution of

ここで、障害分析部１７は、運用管理者からトランザクションの遅延障害の原因を分析する旨の分析要求を受け付けたときや、サーバコンピュータ１などに備わったトランザクションの遅延障害を検出する検出部（図示せず）にて遅延障害が検出され、その検出部から分析要求を受け付けたときなど、任意のタイミングで遅延障害を分析する。 Here, the failure analysis unit 17 detects a transaction delay failure provided in the server computer 1 or the like when receiving an analysis request for analyzing the cause of the transaction delay failure from the operation manager (see FIG. The delay failure is analyzed at an arbitrary timing, for example, when a delay failure is detected by the detection unit and an analysis request is received from the detection unit.

なお、分析要求は、遅延障害の分析対象となるサーバアプリケーションを特定するサーバアプリケーション識別子を含む。以下、遅延障害の分析対象となるサーバアプリケーションを対象サーバアプリケーションと称し、対象サーバアプリケーションを特定するサーバアプリケーション識別子を対象サーバアプリケーション識別子と称することもある。また、対象サーバアプリケーションにて実行可能なトランザクションの種別を特定するトランザクション識別子を対象トランザクション識別子と称することもある。 Note that the analysis request includes a server application identifier that identifies a server application to be analyzed for a delay fault. Hereinafter, a server application to be analyzed for a delay fault may be referred to as a target server application, and a server application identifier that identifies the target server application may be referred to as a target server application identifier. In addition, a transaction identifier that specifies the type of transaction that can be executed by the target server application may be referred to as a target transaction identifier.

また、障害分析部１７は、対応情報を保持し、その対応情報から、分析要求内の対象サーバアプリケーションに対応する対象トランザクション識別子を割り出すことができる。 Further, the failure analysis unit 17 holds correspondence information, and can determine the target transaction identifier corresponding to the target server application in the analysis request from the correspondence information.

障害分析部１７は、具体的には、以下のように遅延障害の原因を分析する。 Specifically, the failure analysis unit 17 analyzes the cause of the delay failure as follows.

先ず、障害分析部１７は、管理表記憶部１１内のリクエスト管理表において、分析要求内の対象サーバアプリケーション識別子に対応する滞留数が０以下か否かを確認する。 First, the failure analysis unit 17 checks whether or not the staying number corresponding to the target server application identifier in the analysis request is 0 or less in the request management table in the management table storage unit 11.

障害分析部１７は、滞留数が０以下の場合、対象サーバアプリケーション識別子にて特定されるアプリケーションに空きスレッドがあるか否かを確認する。 When the staying number is 0 or less, the failure analysis unit 17 checks whether there is an empty thread in the application specified by the target server application identifier.

障害分析部１７は、空きスレッドがあると、トランザクションの遅延障害の原因が、トランザクションの実行時間の増加であると判断する。なお、障害分析部１７は、空きスレッドの確認を行わず、滞留数が０以下であると、トランザクションの遅延障害の原因が、トランザクションの実行時間の増加であると判断してもよい。 If there is an empty thread, the failure analysis unit 17 determines that the cause of the transaction delay failure is an increase in the execution time of the transaction. The failure analysis unit 17 may determine that the cause of the transaction delay failure is an increase in the execution time of the transaction if the number of stays is 0 or less without checking for a free thread.

このとき、情報管理部１６が、例えば、トランザクションの開始時および終了時に、管理表記憶部１１内の実行時間管理表にＣＰＵ使用時間などの追加情報が付加するなどしておき、障害分析部１７が、トランザクションの実行時間の増加が、ＣＰＵ使用時間の増加によって生じたものか、それとも、処理待ち時間の増加によって生じたものなのかなど、遅延障害の原因を絞り込んでもよい。 At this time, the information management unit 16 adds additional information such as CPU usage time to the execution time management table in the management table storage unit 11 at the start and end of the transaction, for example, and the failure analysis unit 17 However, the cause of the delay failure may be narrowed down, such as whether the increase in the execution time of the transaction is caused by the increase in the CPU usage time or the increase in the processing waiting time.

一方、障害分析部１７は、滞留数が１以上の場合、または、滞留数が０以下であり、かつ、空きスレッドがない場合、以下の処理を行う。 On the other hand, when the staying number is 1 or more, or when the staying number is 0 or less and there is no empty thread, the failure analysis unit 17 performs the following processing.

具体的には、障害分析部１７は、管理表記憶部１１内の状態管理表において、対象サーバアプリケーション識別子に対応する実行開始時刻の中で、現在時刻までの時間が基準時間以上である実行開始時刻があるか否かを判断する。障害分析部１７は、現在時刻までの時間が基準時間以上である実行開始時刻があると、トランザクションの遅延障害の原因が、その実行開始時刻に対応する処理識別子にて特定される処理単位におけるトランザクションのストールによって生じた、トランザクションの実行の滞留であると判断する。 Specifically, the failure analysis unit 17 starts execution whose time up to the current time is equal to or greater than the reference time in the execution start time corresponding to the target server application identifier in the state management table in the management table storage unit 11. Determine whether there is a time. If there is an execution start time in which the time until the current time is equal to or greater than the reference time, the failure analysis unit 17 determines that the cause of the transaction delay failure is the transaction in the processing unit identified by the process identifier corresponding to the execution start time. It is determined that the transaction execution is retained due to the stall.

ここで、基準時間は、運用管理者にて設定されてもよいし、障害分析部１７が実行時間管理表に基づいて算出してもよい。例えば、障害分析部１７は、全合計時間を全実行回数で除算して実行時間の平均値を算出し、その平均値を基準時間として算出する。 Here, the reference time may be set by the operation manager, or may be calculated by the failure analysis unit 17 based on the execution time management table. For example, the failure analysis unit 17 calculates the average value of the execution times by dividing the total total time by the total number of executions, and calculates the average value as the reference time.

情報管理部１６が、例えば、トランザクションの開始時および終了時に、状態管理表および実行時間管理表にＣＰＵ使用時間などの追加情報などを付加するなどしておき、障害分析部１７が、トランザクションがストール状態である原因が、ＣＰＵループによって生じたものか、それとも、それ以外のバックエンドの処理待ち等によるものなのかなど、遅延障害の原因を絞り込んでもよい。 The information management unit 16 adds, for example, additional information such as CPU usage time to the state management table and the execution time management table at the start and end of the transaction, and the failure analysis unit 17 causes the transaction to stall. The cause of the delay failure may be narrowed down, such as whether the cause of the state is caused by the CPU loop or due to waiting for other back-end processing.

また、障害分析部１７は、管理表記憶部１１内の実行時間管理表において、対象トランザクション識別子に対応する直近合計時間を直近実行回数で除算して、サーバアプリケーション部１４が直近の単位時間内に実行したトランザクションの実行時間の平均値である直近平均実行時間を算出する。また、障害分析部１７は、対象トランザクション識別子に対応する全合計時間を全実行回数で除算して、サーバアプリケーション部１４が直近の単位時間以前に実行したトランザクションの実行時間の平均値である全平均実行時間を算出する。 Further, the failure analysis unit 17 divides the latest total time corresponding to the target transaction identifier by the latest execution count in the execution time management table in the management table storage unit 11 so that the server application unit 14 is within the latest unit time. The most recent average execution time, which is the average value of the execution times of executed transactions, is calculated. In addition, the failure analysis unit 17 divides the total total time corresponding to the target transaction identifier by the total number of executions, and the total average that is an average value of the execution times of transactions executed by the server application unit 14 before the latest unit time. Calculate the execution time.

障害分析部１７は、直近平均実行時間の全平均実行時間に対する増加度合を求め、その増加度合が設定値以上か否かを判断する。ここで、増加度合は、直近平均実行時間および全平均実行時間の差分または比率である。 The failure analysis unit 17 obtains an increase degree of the latest average execution time with respect to the total average execution time, and determines whether the increase degree is equal to or greater than a set value. Here, the degree of increase is the difference or ratio between the latest average execution time and the total average execution time.

障害分析部１７は、その増加度合が設定値以上であると、トランザクション装置の遅延障害の原因が、トランザクションの実行時間の増加であると判断する。また、障害分析部１７は、その結果、トランザクションの実行の滞留も発生していると判断する。 The failure analysis unit 17 determines that the cause of the delay failure of the transaction apparatus is an increase in the execution time of the transaction if the increase degree is equal to or greater than the set value. As a result, the failure analysis unit 17 determines that the stay of transaction execution has also occurred.

情報管理部１６が、例えば、トランザクションの開始時および終了時に、実行時間管理表にＣＰＵ使用時間などの追加情報などを付加するなどしておき、障害分析部１７が、トランザクションがストール状態である原因が、ＣＰＵループによって生じたものか、それとも、処理待ち時間の増加なのかなど、遅延障害の原因を絞り込んでもよい。 The information management unit 16 adds, for example, additional information such as CPU usage time to the execution time management table at the start and end of a transaction, and the failure analysis unit 17 causes the transaction to be in a stalled state. However, the cause of the delay failure may be narrowed down, such as whether it is caused by a CPU loop or an increase in processing wait time.

また、障害分析部１７は、管理表記憶部１１内の実行時間管理表において、対象トランザクション識別子に対応する直近受付回数の、対象トランザクション識別子に対応する平均受付回数に対する増加度合を算出する。ここで、増加度合は、直近受付回数および平均受付回数の差分または比率である。 Also, the failure analysis unit 17 calculates the degree of increase of the latest reception count corresponding to the target transaction identifier with respect to the average reception count corresponding to the target transaction identifier in the execution time management table in the management table storage unit 11. Here, the degree of increase is the difference or ratio between the latest reception count and the average reception count.

障害分析部１７は、その増加度合が基準値以上か否かを判断する。なお、基準値は、運用管理者にて設定されてもよいし、障害分析部１７は、直近受付回数および全受付回数から求めてもよい。 The failure analysis unit 17 determines whether or not the increase degree is equal to or higher than a reference value. The reference value may be set by the operation manager, or the failure analysis unit 17 may obtain the reference value from the latest reception count and the total reception count.

障害分析部１７は、その増加度合が基準値以上であると、トランザクションの遅延障害の原因が、処理要求の増加によって生じたトランザクションの実行の滞留であると判断する。 If the increase degree is equal to or greater than the reference value, the failure analysis unit 17 determines that the cause of the transaction delay failure is retention of transaction execution caused by an increase in processing requests.

障害対応部１８は、障害分析部１７の分析結果に応じた、トランザクションの遅延障害に対する対処を行う。 The failure handling unit 18 copes with a transaction delay failure according to the analysis result of the failure analysis unit 17.

例えば、障害対応部１８は、障害分析部１７にてトランザクションの遅延障害の原因がトランザクションの実行時間の増加であると判断されると、運用管理者への通知や、サーバアプリケーションの処理能力増強、追加の情報採取などを行う。 For example, if the failure analysis unit 17 determines that the cause of the transaction delay failure is an increase in the execution time of the transaction, the failure handling unit 18 notifies the operation manager, increases the processing capacity of the server application, Collect additional information.

また、障害対応部１８は、障害分析部１７にてトランザクションの遅延障害の原因がトランザクションのストールによって生じたトランザクションの実行の滞留であると判断されると、サーバアプリケーションの同時処理可能数の追加、追加の情報採取およびストール状態のトランザクションの強制終了などを行なう。 Further, when the failure analysis unit 17 determines that the cause of the transaction delay failure is the retention of execution of the transaction caused by the transaction stall, the failure handling unit 18 adds the number of server applications that can be processed simultaneously, Collect additional information and forcibly terminate a stalled transaction.

また、障害対応部１８は、障害分析部１７にてトランザクションの遅延障害の原因が処理要求の増加によって生じたトランザクションの実行の滞留であると判断すると、サーバアプリケーションの同時処理可能数の追加、追加の情報採取を行う。また、障害対応部１８は、待ち行列理論を利用して、最適なサーバアプリケーションの多重度（処理単位の数の上限および下限）を設定するなどしてもよい。 If the failure analysis unit 17 determines that the cause of the transaction delay failure is the retention of transaction execution caused by an increase in processing requests, the failure handling unit 18 adds or adds the number of server applications that can be processed simultaneously. Collect information. Further, the failure handling unit 18 may set an optimal multiplicity of server applications (upper limit and lower limit of the number of processing units) using queuing theory.

次に動作を説明する。 Next, the operation will be described.

先ず、トランザクションの処理要求を受け付けた際のサーバコンピュータ１の動作を説明する。図７は、この動作を説明するためのシーケンス図である。 First, the operation of the server computer 1 when a transaction processing request is received will be described. FIG. 7 is a sequence diagram for explaining this operation.

リクエスト受付部１２は、クライアント端末２からトランザクションの処理要求を受け付けると、ステップＳ１を実行する。 When receiving a transaction processing request from the client terminal 2, the request receiving unit 12 executes Step S1.

ステップＳ１では、リクエスト受付部１２は、処理要求内のトランザクション識別子と、処理要求を受け付けた旨の受付情報とを情報管理部１６のリクエスト管理部１６１に出力する。 In step S 1, the request reception unit 12 outputs the transaction identifier in the processing request and the reception information indicating that the processing request has been received to the request management unit 161 of the information management unit 16.

リクエスト管理部１６１は、トランザクション識別子および受付情報を受け付けると、保持している対応情報から、そのトランザクション識別子に対応するサーバアプリケーション識別子を割り出し、そのサーバアプリケーション識別子を管理表記憶部１１内の受付情報管理表から検索する。 When the request management unit 161 receives the transaction identifier and the reception information, the request management unit 161 calculates a server application identifier corresponding to the transaction identifier from the held correspondence information, and manages the server application identifier in the reception information management in the management table storage unit 11. Search from the table.

リクエスト管理部１６１は、そのサーバアプリケーション識別子を見つけると、そのサーバアプリケーション識別子に対応する直近受付回数をインクリメントする。また、リクエスト管理部１６１は、そのサーバアプリケーション識別子が見つからないと、そのサーバアプリケーションを有する受付情報管理表の行を生成する。このとき、リクエスト管理部１６１は、その行内の平均受付回数を０とし、直近受付回数を１とし、滞留数を０とする。 When the request management unit 161 finds the server application identifier, the request management unit 161 increments the latest reception count corresponding to the server application identifier. If the server application identifier is not found, the request management unit 161 generates a row of the reception information management table having the server application. At this time, the request management unit 161 sets the average number of receptions in the row to 0, sets the number of latest receptions to 1, and sets the number of stays to 0.

また、リクエスト管理部１６１は、単位時間ごとに、受付情報管理表の平均受付回数を更新する。例えば、リクエスト管理部１６１は、直近受付回数を平均受付回数に加算し、その加算結果を２で除算する。リクエスト管理部１６１は、その除算結果を新しい平均受付回数として計測し、受付情報管理表の平均受付回数を、その新しい平均受付回数に更新する。なお、リクエスト管理部１６１は、この処理をサーバアプリケーション識別子ごとに行う。 In addition, the request management unit 161 updates the average number of receptions in the reception information management table every unit time. For example, the request management unit 161 adds the latest reception count to the average reception count and divides the addition result by 2. The request management unit 161 measures the division result as a new average reception count, and updates the average reception count in the reception information management table to the new average reception count. Note that the request management unit 161 performs this process for each server application identifier.

また、リクエスト受付部１２は、トランザクション識別子および受付情報を出力すると、ステップＳ２を実行する。 Further, when the request reception unit 12 outputs the transaction identifier and the reception information, the request reception unit 12 executes Step S2.

ステップＳ２では、リクエスト受付部１２は、処理要求をキュー制御部１３に出力する。 In step S 2, the request reception unit 12 outputs a processing request to the queue control unit 13.

キュー制御部１３は、処理要求を受け付けると、その処理要求を、処理要求内のトランザクション識別子にて特定されるトランザクションの種別の待ち行列に滞留する。キュー制御部１３は、処理要求を滞留すると、ステップＳ３を実行する。 When receiving the processing request, the queue control unit 13 stays in the queue of the transaction type specified by the transaction identifier in the processing request. When the queue control unit 13 retains the processing request, the queue control unit 13 executes Step S3.

ステップＳ３では、キュー制御部１３は、処理要求内のトランザクション識別子と、処理要求を滞留した旨の滞留情報とを情報管理部１６のリクエスト管理部１６１に出力する。 In step S 3, the queue control unit 13 outputs the transaction identifier in the processing request and the staying information indicating that the processing request has stayed to the request management unit 161 of the information management unit 16.

リクエスト管理部１６１は、トランザクション識別子および滞留情報を受け付けると、保持している対応情報から、そのトランザクション識別子に対応するサーバアプリケーション識別子を割り出し、そのサーバアプリケーション識別子を管理表記憶部１１内の受付情報管理表から検索する。リクエスト管理部１６１は、その検索されたサーバアプリケーション識別子に対応する滞留数をインクリメントして、ステップＳ３を終了する。 When the request management unit 161 receives the transaction identifier and the staying information, the request management unit 161 determines the server application identifier corresponding to the transaction identifier from the held correspondence information, and manages the server application identifier in the reception information management in the management table storage unit 11. Search from the table. The request management unit 161 increments the staying number corresponding to the retrieved server application identifier, and ends Step S3.

また、トランザクション制御部１５は、定期的にステップＳ４を実行する。 Moreover, the transaction control part 15 performs step S4 regularly.

ステップＳ４では、トランザクション制御部１５は、アプリケーション識別子ごとに、空きスレッドを確認する旨の確認要求を情報管理部１６内の状態管理部１６３に出力する。確認要求は、そのアプリケーション識別子を含む。 In step S4, the transaction control unit 15 outputs a confirmation request for confirming an empty thread to the state management unit 163 in the information management unit 16 for each application identifier. The confirmation request includes the application identifier.

状態管理部１６３は、確認要求を受け付けると、その確認要求内のアプリケーション識別子を管理表記憶部１１内の状態管理表から全て検索する。状態管理部１６３は、その検索されたアプリケーション識別子に対応付けられた実行開始時刻の中で、クリアされた実行開始時刻があるか否かを確認する。状態管理部１６３は、クリアされた実行開始時刻があると、空きスレッドがあることを示す確認結果を生成し、クリアされた実行開始時刻がないと、空きスレッドがないことを示す確認結果を生成する。確認結果は、確認要求内のサーバアプリケーションを含む。 When receiving the confirmation request, the state management unit 163 searches all the application identifiers in the confirmation request from the state management table in the management table storage unit 11. The state management unit 163 confirms whether or not there is a cleared execution start time among the execution start times associated with the searched application identifier. The state management unit 163 generates a confirmation result indicating that there is an empty thread if there is a cleared execution start time, and generates a confirmation result indicating that there is no empty thread if there is no cleared execution start time. To do. The confirmation result includes the server application in the confirmation request.

状態管理部１６３は、その確認結果をトランザクション制御部１５に出力する。 The state management unit 163 outputs the confirmation result to the transaction control unit 15.

トランザクション制御部１５は、確認結果を受け付けると、その確認結果を確認する。トランザクション制御部１５は、その確認結果が空きスレッドがあることを示すと、サーバアプリケーション部１４にてトランザクションが新たに実行可能であると判断し、ステップＳ５を実行する。一方、トランザクション制御部１５は、その確認結果が空きスレッドがないことを示すと、サーバアプリケーション部１４にてトランザクションが新たに実行可能ではないと判断し、処理を終了する。 When the transaction control unit 15 receives the confirmation result, the transaction control unit 15 confirms the confirmation result. When the confirmation result indicates that there is an empty thread, the transaction control unit 15 determines that the transaction can be newly executed by the server application unit 14, and executes step S5. On the other hand, when the confirmation result indicates that there is no empty thread, the transaction control unit 15 determines that the server application unit 14 cannot newly execute a transaction, and ends the process.

ステップＳ５では、トランザクション制御部１５は、確認結果内のサーバアプリケーション識別子にて特定されるサーバプリケーションが実行可能な種別のトランザクションに応じた処理要求を、キュー制御部１３の待ち行列から取得する。 In step S 5, the transaction control unit 15 acquires from the queue of the queue control unit 13 a processing request corresponding to a type of transaction that can be executed by the server application specified by the server application identifier in the confirmation result.

例えば、トランザクション制御部１５は、対応情報を保持し、対応情報から、確認結果内のサーバアプリケーション識別子に対応するトランザクション識別子を割り出す。そして、トランザクション制御部１５は、そのトランザクション識別子にて特定されるトランザクションの種別の待ち行列から、その待ち行列内の処理要求を一つ取得する。 For example, the transaction control unit 15 holds correspondence information, and determines a transaction identifier corresponding to the server application identifier in the confirmation result from the correspondence information. Then, the transaction control unit 15 acquires one processing request in the queue from the queue of the transaction type specified by the transaction identifier.

トランザクション制御部１５は、処理要求を取得すると、ステップＳ６を実行する。 When acquiring the processing request, the transaction control unit 15 executes Step S6.

ステップＳ６では、トランザクション制御部１５は、その処理要求内のトランザクション識別子と、滞留数が減少した旨の減少情報とを情報管理部１６のリクエスト管理部１６１に出力する。 In step S6, the transaction control unit 15 outputs the transaction identifier in the processing request and the decrease information indicating that the staying number has decreased to the request management unit 161 of the information management unit 16.

リクエスト管理部１６１は、トラザクション識別子および減少情報を受け付けると、保持している対応情報から、そのトランザクション識別子に対応するサーバアプリケーション識別子を割り出し、そのサーバアプリケーション識別子を管理表記憶部１１内の受付情報管理表から検索する。リクエスト管理部１６１は、その検索されたサーバアプリケーション識別子に対応する滞留数をディクリメントする。 When the request management unit 161 receives the transaction identifier and the decrease information, the request management unit 161 calculates the server application identifier corresponding to the transaction identifier from the held correspondence information, and the server application identifier is received information in the management table storage unit 11. Search from the management table. The request management unit 161 decrements the staying number corresponding to the retrieved server application identifier.

また、トランザクション制御部１５は、トランザクション識別子および減少情報を出力すると、ステップＳ７を実行する。 Further, when the transaction control unit 15 outputs the transaction identifier and the decrease information, the transaction control unit 15 executes Step S7.

ステップＳ７では、トランザクション制御部１５は、処理要求をサーバアプリケーション部１４に出力する。 In step S 7, the transaction control unit 15 outputs a processing request to the server application unit 14.

サーバアプリケーション部１４は、処理要求を受け付けると、その処理要求内のトランザクション識別子にて特定されるトランザクションを、そのトランザクションに対応するサーバアプリケーションの空きスレッドで実行する。 When the server application unit 14 receives the processing request, the server application unit 14 executes the transaction specified by the transaction identifier in the processing request with an empty thread of the server application corresponding to the transaction.

そして、サーバアプリケーション部１４は、トランザクションの実行を開始した旨の開始情報をトランザクション制御部１５に出力する。なお、開始情報は、実行を開始したトランザクションの種別を特定するトランザクション識別子と、そのトランザクションを実行したスレッドを特定する処理識別子（サーバアプリケーション識別子および処理単位識別子）とを含む。 Then, the server application unit 14 outputs start information indicating that the execution of the transaction has started to the transaction control unit 15. The start information includes a transaction identifier that identifies the type of transaction that has started execution, and a process identifier (server application identifier and process unit identifier) that identifies the thread that executed the transaction.

トランザクション制御部１５は、開始情報を受け付けると、ステップＳ８を実行する。 When the transaction control unit 15 receives the start information, the transaction control unit 15 executes Step S8.

ステップＳ８では、トランザクション制御部１５は、現在の時刻を実行開始時刻として計測し、その実行開始時刻と、開始情報内のトランザクション識別子および処理識別子を情報管理部１６内の状態管理部１６３に出力する。 In step S8, the transaction control unit 15 measures the current time as the execution start time, and outputs the execution start time, the transaction identifier and the process identifier in the start information to the state management unit 163 in the information management unit 16. .

状態管理部１６３は、実行開始時刻、トランザクション識別子および処理識別子を受け付けると、その処理識別子を管理表記憶部１１内の状態管理表から検索する。状態管理部１６３は、その処理識別子を見つけると、その処理識別子に、その実行開始時刻およびトランザクション識別子を対応付けて、管理表記憶部１１に記憶して、ステップＳ８を終了する。 When the state management unit 163 receives the execution start time, the transaction identifier, and the processing identifier, the state management unit 163 searches the processing table storage unit 11 for the processing identifier. When the state management unit 163 finds the processing identifier, the state management unit 163 associates the processing identifier with the execution start time and the transaction identifier, stores them in the management table storage unit 11, and ends step S8.

また、サーバアプリケーション部１４は、ステップＳ７で実行を開始したトランザクションが終了すると、ステップＳ９を実行する。 Moreover, the server application part 14 will perform step S9, if the transaction which started execution at step S7 is complete | finished.

ステップＳ９では、サーバアプリケーション部１４は、トランザクションの実行を終了した旨の終了情報と、そのトランザクションに応じた応答情報をトランザクション制御部１５に出力する。なお、終了情報は、トランザクションを実行していた処理単位を特定する処理識別子を含む。トランザクション制御部１５は、終了情報および応答情報を受け付けると、ステップＳ１０を実行する。 In step S 9, the server application unit 14 outputs to the transaction control unit 15 end information indicating that the execution of the transaction has ended and response information corresponding to the transaction. Note that the end information includes a processing identifier that identifies a processing unit that has executed the transaction. When receiving the end information and the response information, the transaction control unit 15 executes Step S10.

ステップＳ１０では、トランザクション制御部１５は、現在の時刻を実行終了時刻として計測し、その実行終了時刻と、終了情報内の処理識別子を情報管理部１６内の実行時間管理部１６２に出力する。 In step S 10, the transaction control unit 15 measures the current time as the execution end time, and outputs the execution end time and the processing identifier in the end information to the execution time management unit 162 in the information management unit 16.

実行時間管理部１６２は、実行終了時刻および処理識別子を受け付けると、実行開始時刻を取得する旨の取得要求を状態管理部１６３に出力する。なお、取得要求は、その処理識別子を含む。 When the execution time management unit 162 receives the execution end time and the process identifier, the execution time management unit 162 outputs an acquisition request to the state management unit 163 to acquire the execution start time. The acquisition request includes the processing identifier.

状態管理部１６３は、取得要求を受け付けると、その取得要求内の処理識別子に対応するトランザクション識別子および実行開始時刻を、管理表記憶部１１内の状態管理表から取得する。状態管理部１６３は、その取得したトランザクション識別子および実行開始時刻を実行時間管理部１６２に出力する。状態管理部１６３は、トランザクション識別子および実行開始時刻を出力すると、管理表記憶部１１内の状態管理表から、その取得要求内の処理識別子に対応するトランザクション識別子および実行開始時刻をクリアする。 When receiving the acquisition request, the state management unit 163 acquires a transaction identifier and an execution start time corresponding to the processing identifier in the acquisition request from the state management table in the management table storage unit 11. The state management unit 163 outputs the acquired transaction identifier and execution start time to the execution time management unit 162. When outputting the transaction identifier and the execution start time, the state management unit 163 clears the transaction identifier and the execution start time corresponding to the processing identifier in the acquisition request from the state management table in the management table storage unit 11.

実行時間管理部１６２は、トランザクション識別子および実行開始時刻を受け付けると、その実行開始時刻から実行終了時刻までの時間を実行時間として計測する。 When receiving the transaction identifier and the execution start time, the execution time management unit 162 measures the time from the execution start time to the execution end time as the execution time.

実行時間管理部１６２は、そのトランザクション識別子を管理表記憶部１１内の実行時間管理表から検索する。 The execution time management unit 162 retrieves the transaction identifier from the execution time management table in the management table storage unit 11.

実行時間管理部１６２は、そのトランザクション識別子が見つかると、そのトランザクション識別子に対応する直近合計時間にその計測した実行時間を加算し、かつ、そのトランザクション識別子に対応する直近実行回数をインクリメントする。また、実行時間管理部１６２は、そのトランザクション識別子が見つからないと、そのトランザクション識別子を有する実行時間管理表の行を生成する。このとき、実行時間管理部１６２は、その行内の、全実行回数を０とし、全合計時間を０とし、直近実行回数を１とし、直近合計時間を、計測した実行時間とする。 When the transaction identifier is found, the execution time management unit 162 adds the measured execution time to the latest total time corresponding to the transaction identifier, and increments the number of latest executions corresponding to the transaction identifier. If the transaction identifier is not found, the execution time management unit 162 generates a row of the execution time management table having the transaction identifier. At this time, the execution time management unit 162 sets the total number of executions to 0, sets the total total time to 0, sets the latest number of executions to 1, and sets the latest total time as the measured execution time.

また、実行時間管理部１６２は、単位時間ごとに、実行時間管理表の全合計時間および全実行回数を更新する。例えば、実行時間管理部１６２は、直近合計時間を全合計時間に加算し、直近実行回数を全実行回数に加算する。なお、実行時間管理部１６２は、この処理をトランザクション識別子ごとに行う。 In addition, the execution time management unit 162 updates the total total time and the total number of executions in the execution time management table for each unit time. For example, the execution time management unit 162 adds the latest total time to the total total time, and adds the latest execution count to the total execution count. Note that the execution time management unit 162 performs this process for each transaction identifier.

また、トランザクション制御部１５は、実行終了時刻および処理識別子を情報管理部１６内の実行時間管理部１６２に出力すると、ステップＳ１１を実行する。 When the transaction control unit 15 outputs the execution end time and the process identifier to the execution time management unit 162 in the information management unit 16, the transaction control unit 15 executes Step S11.

ステップＳ１１では、トランザクション制御部１５は、ステップＳ９で受け付けた応答情報を、キュー制御部１３を介してリクエスト受付部１２に出力する。リクエスト受付部１２は、応答情報を受け付けると、ステップＳ１２を実行する。 In step S11, the transaction control unit 15 outputs the response information received in step S9 to the request reception unit 12 via the queue control unit 13. The request reception part 12 will perform step S12, if response information is received.

ステップＳ１２では、リクエスト受付部１２は、応答情報をクライアント端末２に送信して、処理を終了する。 In step S 12, the request reception unit 12 transmits response information to the client terminal 2 and ends the process.

次に、トランザクションの遅延障害の原因を分析する際のサーバコンピュータ１の動作について説明する。図８は、この動作を説明するためのシーケンス図である。 Next, the operation of the server computer 1 when analyzing the cause of a transaction delay failure will be described. FIG. 8 is a sequence diagram for explaining this operation.

ステップＳ１０１では、障害分析部１７は、運用管理者などから分析要求を受け付ける。その後、ステップＳ１０２が実行される。 In step S101, the failure analysis unit 17 receives an analysis request from an operation manager or the like. Thereafter, step S102 is executed.

ステップＳ１０２では、障害分析部１７は、管理表記憶部１１内のリクエスト管理表から、分析要求内のサーバアプリケーション識別子を検索する。障害分析部１７は、その検索されたサーバアプリケーション識別子に対応する滞留数が０以下か否かを確認する。障害分析は、その滞留数が０以下であると、ステップＳ１０３を実行し、その滞留数が１以上であると、ステップＳ１０５を実行する。 In step S 102, the failure analysis unit 17 searches the request management table in the management table storage unit 11 for a server application identifier in the analysis request. The failure analysis unit 17 checks whether or not the staying number corresponding to the retrieved server application identifier is 0 or less. In the failure analysis, if the staying number is 0 or less, step S103 is executed, and if the staying number is 1 or more, step S105 is executed.

ステップＳ１０３では、障害分析部１７は、管理表記憶部１１内の状態管理表から、対象サーバアプリケーション識別子を全て検索し、その検索された対象サーバアプリケーション識別子に対応する実行開始時刻の中で、クリアされた実行開始時刻があるか否かを確認する。障害分析部１７は、クリアされている実行開始時刻があると、空きスレッドがあると判断し、クリアされている実行開始時刻がないと、空きスレッドがないと判断する。 In step S103, the failure analysis unit 17 searches all the target server application identifiers from the state management table in the management table storage unit 11, and clears them within the execution start time corresponding to the searched target server application identifiers. It is confirmed whether there is a specified execution start time. The failure analysis unit 17 determines that there is an empty thread when there is a cleared execution start time, and determines that there is no empty thread when there is no cleared execution start time.

障害分析部１７は、空きスレッドがあると、ステップＳ１０４を実行し、空きスレッドがないと、ステップＳ１０５を実行する。なお、ステップＳ１０３は、スキップされてもよい。 The failure analysis unit 17 executes step S104 when there is an empty thread, and executes step S105 when there is no empty thread. Note that step S103 may be skipped.

ステップＳ１０４では、障害分析部１７は、トランザクションの遅延障害の原因が、トランザクションの実行時間の増加であると判断し、その判断結果を遅延障害の原因の分析結果として障害対応部１８に出力する。障害対応部１８は、分析結果を受け付けると、ステップＳ１１１を実行する。なお、分析結果は、対象サーバアプリケーション識別子を含む。 In step S104, the failure analysis unit 17 determines that the cause of the transaction delay failure is an increase in the execution time of the transaction, and outputs the determination result to the failure handling unit 18 as the analysis result of the cause of the delay failure. When the failure handling unit 18 receives the analysis result, the failure handling unit 18 executes Step S111. The analysis result includes the target server application identifier.

また、ステップＳ１０５では、障害分析部１７は、管理表記憶部１１内の状態管理表から、対象サーバアプリケーション識別子に対応する実行開始時刻を全て取得する。障害分析部１７は、実行開始時刻ごとに、現在時刻から実行開始時刻を減算して実行中時間を算出する。障害分析部１７は、その実行中時間の中で、基準時間以上の実行中時間があるか否かを判断する。障害分析部１７は、基準時間以上の実行中時間があると、ステップＳ１０６を実行し、基準時間以上の実行中時間がないと、ステップＳ１０７を実行する。 In step S105, the failure analysis unit 17 acquires all execution start times corresponding to the target server application identifiers from the state management table in the management table storage unit 11. The failure analysis unit 17 calculates the running time by subtracting the execution start time from the current time for each execution start time. The failure analysis unit 17 determines whether there is an execution time that is equal to or greater than the reference time in the execution time. The failure analysis unit 17 executes Step S106 when there is an execution time that is equal to or greater than the reference time, and executes Step S107 when there is no execution time that is equal to or greater than the reference time.

ステップＳ１０６では、障害分析部１７は、トランザクションの遅延障害の原因が、トランザクションのストールによって生じたトランザクションの実行の滞留であると判断し、その判断結果を遅延障害の原因の分析結果として障害対応部１８に出力する。障害対応部１８は、分析結果を受け付けると、ステップＳ１１１を実行する。 In step S106, the failure analysis unit 17 determines that the cause of the transaction delay failure is retention of the execution of the transaction caused by the transaction stall, and uses the determination result as the analysis result of the cause of the delay failure. 18 is output. When the failure handling unit 18 receives the analysis result, the failure handling unit 18 executes Step S111.

また、ステップＳ１０７では、障害分析部１７は、保持している対応情報から、対象アプリケーション識別子に対応する対象トランザクション識別子を割り出し、その対象トランザクション識別子に対応する、全合計時間、全実行回数、直近合計時間および直近実行回数を管理表記憶部１１内の実行時間管理表から取得する。 In step S107, the failure analysis unit 17 determines the target transaction identifier corresponding to the target application identifier from the correspondence information held, and the total total time, total number of executions, and latest total corresponding to the target transaction identifier. The time and the latest execution count are acquired from the execution time management table in the management table storage unit 11.

障害分析部１７は、全合計時間を全実行回数で除算して全平均実行時間を算出し、また、直近合計時間を直近実行回数で除算して直近平均実行時間を算出する。障害分析部１７は、直近平均実行時間の全平均実行時間に対する増加度合を算出する。例えば、障害分析部１７は、直近平均実行時間および全平均実行時間の比率または差分を増加度合として算出する。 The failure analysis unit 17 calculates the total average execution time by dividing the total total time by the total number of executions, and calculates the latest average execution time by dividing the most recent total time by the most recent execution number. The failure analysis unit 17 calculates the degree of increase of the latest average execution time with respect to the total average execution time. For example, the failure analysis unit 17 calculates a ratio or difference between the latest average execution time and the total average execution time as the increase degree.

そして、障害分析部１７は、増加度合が設定値以上か否かを判断する。障害分析部１７は、増加度合が設定値以上であると、ステップＳ１０８を実行し、増加度合が設定値未満であると、ステップＳ１０９を実行する。 Then, the failure analysis unit 17 determines whether or not the degree of increase is greater than or equal to a set value. The failure analysis unit 17 executes Step S108 when the increase degree is equal to or greater than the set value, and executes Step S109 when the increase degree is less than the set value.

ステップＳ１０８では、障害分析部１７は、トランザクションの遅延障害の原因が、トランザクションの実行時間の増加であり、その結果、トランザクションの実行の滞留も発生していると判断する。障害分析部１７は、その判断結果を遅延障害の原因の分析結果として障害対応部１８に出力する。障害対応部１８は、分析結果を受け付けると、ステップＳ１１１を実行する。 In step S108, the failure analysis unit 17 determines that the cause of the transaction delay failure is an increase in the execution time of the transaction, and as a result, the retention of the execution of the transaction also occurs. The failure analysis unit 17 outputs the determination result to the failure handling unit 18 as the analysis result of the cause of the delay failure. When the failure handling unit 18 receives the analysis result, the failure handling unit 18 executes Step S111.

ステップＳ１０９では、障害分析部１７は、管理表記憶部１１から対象サーバアプリケーション識別子に対応する直近受付回数および平均受付回数を取得し、その直近受付回数の平均受付回数に対する増加度合を算出する。例えば、障害分析部１７は、直近受付回数および平均受付回数の比率または差分を増加度合として算出する。 In step S109, the failure analysis unit 17 acquires the latest reception count and the average reception count corresponding to the target server application identifier from the management table storage unit 11, and calculates the degree of increase of the latest reception count with respect to the average reception count. For example, the failure analysis unit 17 calculates the ratio or difference between the latest reception count and the average reception count as the increase degree.

障害分析部１７は、その増加度合が基準値以上か否かを判断する。障害分析部１７は、その増加度合が基準値以上であると、ステップＳ１１０を実行し、その増加度合が基準値未満であると、処理を終了する。 The failure analysis unit 17 determines whether or not the increase degree is equal to or higher than a reference value. The failure analysis unit 17 executes Step S110 when the increase degree is equal to or greater than the reference value, and ends the process when the increase degree is less than the reference value.

ステップＳ１１１では、障害分析部１７は、トランザクションの遅延障害の原因が、処理要求の増加によって生じたトランザクションの実行の滞留であると判断し、その判断結果を遅延障害の原因の分析結果として障害対応部１８に出力する。障害分析結果を受け付けると、ステップＳ１１１を実行する。 In step S111, the failure analysis unit 17 determines that the cause of the delay failure of the transaction is retention of execution of the transaction caused by the increase in processing requests, and handles the failure as the analysis result of the cause of the delay failure. To the unit 18. When the failure analysis result is received, step S111 is executed.

ステップＳ１１１では、障害対応部１８は、障害分析部１７の分析結果に応じた、トランザクションの遅延障害に対する対処を行う。 In step S 111, the failure handling unit 18 copes with a transaction delay failure according to the analysis result of the failure analysis unit 17.

例えば、障害対応部１８は、分析結果がトランザクションの実行時間の増加を示すと、運用管理者への通知や、サーバアプリケーションの処理能力増強、追加の情報採取などを行う。 For example, when the analysis result indicates an increase in the execution time of the transaction, the failure handling unit 18 performs notification to the operation manager, enhancement of the processing capacity of the server application, collection of additional information, and the like.

また、障害対応部１８は、分析結果がトランザクションのストールによって生じたトランザクションの実行の滞留を示すと、サーバアプリケーションの同時処理可能数の追加、追加の情報採取およびストール状態のトランザクションの強制終了などを行なう。 Further, when the analysis result indicates that the execution of the transaction caused by the transaction stalling, the failure handling unit 18 adds the number of server applications that can be processed simultaneously, collects additional information, forcibly terminates the stalled transaction, and the like. Do.

また、障害対応部１８は、分析結果が処理要求の増加によって生じたトランザクションの実行の滞留を示すと、サーバアプリケーションの同時処理可能数の追加、追加の情報採取を行う。また、障害対応部１８は、待ち行列理論を利用して、最適なサーバアプリケーションの多重度（処理単位の数の上限および下限）を設定するなどしてもよい。 Further, the failure handling unit 18 adds the number of server applications that can be processed simultaneously and collects additional information when the analysis result indicates that the execution of the transaction caused by the increase in the processing requests has occurred. Further, the failure handling unit 18 may set an optimal multiplicity of server applications (upper limit and lower limit of the number of processing units) using queuing theory.

なお、本実施形態において、サーバコンピュータ１は、複数のコンピュータにて構成されていてもよい。例えば、障害分析部１７がサーバコンピュータ１に接続された他のコンピュータにて構成され、障害分析装置として動作し、遅延障害の分析結果をサーバコンピュータ１の障害対応部１８に通知してもよい。同様に、他の各部（障害対応部１８など）も他のコンピュータにて構成されていてもよい。 In the present embodiment, the server computer 1 may be composed of a plurality of computers. For example, the failure analysis unit 17 may be configured by another computer connected to the server computer 1, operate as a failure analysis device, and notify the failure handling unit 18 of the server computer 1 of the analysis result of the delay failure. Similarly, other units (such as the failure handling unit 18) may be configured by other computers.

次に効果を説明する。 Next, the effect will be described.

本実施形態によれば、トランザクション制御部１５は、サーバアプリケーション部１４がトランザクションの実行を開始した実行開始時刻を計測する。リクエスト管理部１６１は、キュー制御部１３の待ち行列に滞留されている処理要求の滞留数を計測する。管理表記憶部１１は、実行開始時刻および滞留数を記憶する。障害分析部１７は、管理表記憶部１１に記憶された情報に基づいて、トランザクションの遅延障害の原因が、トランザクションの実行時間の増加か、それとも、トランザクションの実行の滞留かを分析する。 According to the present embodiment, the transaction control unit 15 measures the execution start time when the server application unit 14 starts executing the transaction. The request management unit 161 measures the staying number of processing requests staying in the queue of the queue control unit 13. The management table storage unit 11 stores the execution start time and the number of stays. Based on the information stored in the management table storage unit 11, the failure analysis unit 17 analyzes whether the cause of the transaction delay failure is an increase in the execution time of the transaction or a retention of execution of the transaction.

この場合、トランザクションの遅延障害の原因が、実行開始時刻および滞留数に基づいて、トランザクションの実行時間の増加か、それとも、トランザクションの実行の滞留かが分析される。 In this case, it is analyzed whether the cause of the transaction delay failure is an increase in the execution time of the transaction or a retention of the execution of the transaction based on the execution start time and the number of stays.

このため、遅延障害の原因を分析することが可能になる。 For this reason, it becomes possible to analyze the cause of the delay failure.

また、本実施形態では、状態管理部１６３は、サーバアプリケーション部１４がトランザクションの実行を終了すると、管理表記憶部１１に記憶された実行開始時刻をクリアする。障害分析部１７は、滞留数が１以上であり、かつ、管理表記憶部１１の中に現在時刻までの時間が基準時間以上となる実行開始時刻があると、トランザクションの遅延障害の原因が、トランザクションのストールによって生じたトランザクションの実行の滞留であると判断する。 In this embodiment, the state management unit 163 clears the execution start time stored in the management table storage unit 11 when the server application unit 14 finishes executing the transaction. If the number of stays is one or more and the execution table time in the management table storage unit 11 is equal to or longer than the reference time in the management table storage unit 11, the failure analysis unit 17 causes the transaction delay failure to be Judged as a stay of transaction execution caused by a transaction stall.

この場合、トランザクションの実行の滞留の原因を分析することが可能になり、より詳細な遅延障害の分析が可能になる。 In this case, it becomes possible to analyze the cause of the stay of the execution of the transaction, and it becomes possible to analyze the delay failure in more detail.

また、本実施形態では、滞留数が１以上であり、かつ、管理表記憶部１１の中に現在時刻までの時間が基準時間以上となる実行開始時刻があると、トランザクションの遅延障害の原因が、その実行開始時刻に対応する処理識別子にて特定される処理単位におけるトランザクションのストールによって生じた、トランザクションの実行の滞留であると判断する。 In the present embodiment, if the number of stays is 1 or more and there is an execution start time in the management table storage unit 11 where the time until the current time is equal to or greater than the reference time, the cause of the transaction delay failure is Then, it is determined that the transaction execution is retained due to the transaction stall in the processing unit specified by the processing identifier corresponding to the execution start time.

この場合、トランザクションのストールが発生した処理単位まで、トランザクションの遅延障害の原因を分析することが可能になり、より詳細な遅延障害の分析が可能になる。 In this case, it is possible to analyze the cause of the delay failure of the transaction up to the processing unit in which the transaction stall occurs, and it becomes possible to analyze the delay failure in more detail.

また、本実施形態では、リクエスト管理部１６１は、直近受付回数と平均受付回数とを計測する。障害分析部１７は、滞留数が１以上であり、かつ、直近受付回数の平均値受付回数に対する増加度合が基準値以上であると、トランザクション装置の遅延障害の原因が、処理要求の増加によって生じたトランザクションの実行の滞留であると判断する。 In the present embodiment, the request management unit 161 measures the latest reception count and the average reception count. If the number of stays is 1 or more and the degree of increase in the number of recent receptions relative to the average number of receptions is greater than or equal to a reference value, the failure analysis unit 17 causes a delay failure of the transaction device due to an increase in processing requests. It is determined that the transaction execution is stuck.

実行時間管理部１６２は、直近合計時間、全合計時間、直近実行回数および全実行回数を計測する。障害分析部１７は、直近合計時間を直近実行回数で除算して、直近平均実行時間を算出する。また、障害分析部１７は、全合計時間を全実行回数で除算して全平均実行時間を算出する。障害分析部１７は、滞留数が１以上であり、直近平均実行時間の全平均実行時間に対する増加度合が設定値以上であると、トランザクションの遅延障害の原因が、実行時間の増加であり、その結果、トランザクションの実行の滞留も発生していると判断する。 The execution time management unit 162 measures the latest total time, the total total time, the latest execution count, and the total execution count. The failure analysis unit 17 calculates the latest average execution time by dividing the latest total time by the latest execution count. Also, the failure analysis unit 17 calculates the total average execution time by dividing the total total time by the total number of executions. When the number of stays is 1 or more and the degree of increase in the latest average execution time with respect to the total average execution time is equal to or greater than the set value, the failure analysis unit 17 causes the transaction delay failure to be an increase in execution time. As a result, it is determined that the stay of transaction execution has occurred.

この場合、トランザクションの実行時間の増加と、その結果発生するトランザクションの実行の滞留も分析することが可能になり、より詳細な遅延障害の分析が可能になる。 In this case, it is possible to analyze the increase in the execution time of the transaction and the retention of the execution of the resulting transaction, and it becomes possible to analyze the delay failure in more detail.

また、本実施形態では、障害分析部１７は、その滞留数が０であると、トランザクションの原因が実行時間の増加であると判断する。 In this embodiment, the failure analysis unit 17 determines that the cause of the transaction is an increase in execution time when the staying number is 0.

この場合、トランザクションを構成する処理ごとに閾値を設けなくても、トランザクションの原因が実行時間の増加であることを分析することが可能になる。 In this case, it is possible to analyze that the cause of the transaction is an increase in execution time without providing a threshold for each process constituting the transaction.

また、本実施形態では、障害分析部１７は、リセット要求を受け付けると、管理表記憶部１１に記憶された情報をリセットする。 In the present embodiment, when the failure analysis unit 17 receives a reset request, the failure analysis unit 17 resets the information stored in the management table storage unit 11.

この場合、例えば、運用管理者がサーバコンピュータ１の稼動環境に変化があった場合にリセット要求を入力すれば、新しい環境に即した遅延障害の原因の分析を行うことが可能になる。 In this case, for example, if the operation manager inputs a reset request when there is a change in the operating environment of the server computer 1, it becomes possible to analyze the cause of the delay failure in accordance with the new environment.

また、本実施形態では、障害分析手段は、分析要求を受け付けると、トランザクションの遅延障害の原因を分析する。 In this embodiment, the failure analysis means analyzes the cause of the transaction delay failure when receiving the analysis request.

この場合、任意のタイミングでトランザクションの遅延障害の原因を分析することが可能になる。 In this case, it is possible to analyze the cause of the transaction delay failure at an arbitrary timing.

また、本実施形態では、障害対応部１８は、障害分析部１７の分析結果に応じて、遅延障害に対する対処を行う。 In the present embodiment, the failure handling unit 18 copes with the delay failure according to the analysis result of the failure analysis unit 17.

この場合、遅延障害の運用管理者への通知や、遅延障害の復旧などを行うことが可能になる。 In this case, it becomes possible to notify the operation manager of the delay fault or to recover from the delay fault.

次に第二の実施形態について説明する。本実施形態では、本発明によるサーバコンピュータ１（トランザクション装置）の最も簡単な構成例について説明する。なお、サーバコンピュータは、サーバアプリケーションを一つだけ有し、トランザクションの種別も一つであるとする。また、サーバコンピュータ１は、サーバアプリケーションを一つだけ有するものとする。 Next, a second embodiment will be described. In the present embodiment, the simplest configuration example of the server computer 1 (transaction apparatus) according to the present invention will be described. It is assumed that the server computer has only one server application and has only one transaction type. The server computer 1 has only one server application.

図９は、本実施形態のサーバコンピュータ１の構成を示したブロック図である。なお、図９において、図２と同じ機能を有する構成には、同じ符号が付してある。 FIG. 9 is a block diagram showing the configuration of the server computer 1 of this embodiment. In FIG. 9, the same reference numerals are given to the components having the same functions as those in FIG.

図９において、サーバコンピュータ１は、管理表記憶部１１と、リクエスト受付部１２と、キュー制御部１３と、サーバアプリケーション部１４と、トランザクション制御部１５と、リクエスト管理部１６１と、障害分析部１７とを含む。 In FIG. 9, the server computer 1 includes a management table storage unit 11, a request reception unit 12, a queue control unit 13, a server application unit 14, a transaction control unit 15, a request management unit 161, and a failure analysis unit 17. Including.

リクエスト受付部１２は、トランザクションの処理要求を受け付ける。 The request receiving unit 12 receives a transaction processing request.

キュー制御部１３は、リクエスト受付部１２が受け付けた処理要求を待ち行列に保持する。 The queue control unit 13 holds the processing request received by the request receiving unit 12 in a queue.

サーバアプリケーション部１４は、トランザクションを新たに実行することが可能になると、キュー制御部１３の待ち行列に保持された処理要求に応じたトランザクションを実行する。 When the server application unit 14 can newly execute a transaction, the server application unit 14 executes the transaction according to the processing request held in the queue of the queue control unit 13.

トランザクション制御部１５は、サーバアプリケーション部１４がトランザクションの実行を開始した時刻を実行開始時刻として計測し、その実行開始時刻を管理表記憶部１１に記憶する。 The transaction control unit 15 measures the time when the server application unit 14 starts executing the transaction as the execution start time, and stores the execution start time in the management table storage unit 11.

リクエスト管理部１６１は、キュー制御部１３の待ち行列が保持している処理要求の滞留数を計測し、その滞留数を管理表記憶部１１に記憶する。 The request management unit 161 measures the staying number of processing requests held in the queue of the queue control unit 13 and stores the staying number in the management table storage unit 11.

障害分析部１７は、管理表記憶部１１に記憶された実行開始時刻および滞留数に基づいて、トランザクションの遅延障害の原因が、トランザクションの実行時間の増加か、トランザクションの実行の滞留か、それとも、トランザクションの実行の滞留かを分析する。 Based on the execution start time and the number of stays stored in the management table storage unit 11, the failure analysis unit 17 determines whether the cause of the transaction delay failure is an increase in the transaction execution time, a stay in the execution of the transaction, Analyze whether the transaction execution is stuck.

例えば、障害分析部１７は、滞留数が０未満であると、トランザクションの遅延障害の原因が、トランザクションの実行時間の増加と判断する。また、障害分析部１７は、滞留数が１以上であり、かつ、管理表記憶部１１の中に現在時刻までの時間が基準時間以上となる実行開始時刻が記憶されていると、トランザクションの遅延障害の原因が、トランザクションの実行の滞留と判断する。 For example, if the staying number is less than 0, the failure analysis unit 17 determines that the cause of the transaction delay failure is an increase in the execution time of the transaction. Further, the failure analysis unit 17 delays a transaction when the number of stays is 1 or more and the execution start time at which the time until the current time is equal to or longer than the reference time is stored in the management table storage unit 11. Determine that the cause of the failure is the retention of transaction execution.

次に効果を説明する。 Next, the effect will be described.

本実施形態でも、トランザクション制御部１５は、サーバアプリケーション部１４がトランザクションの実行を開始した実行開始時刻を計測する。リクエスト管理部１６１は、キュー制御部１３の待ち行列に滞留されている処理要求の滞留数を計測する。管理表記憶部１１は、実行開始時刻および滞留数を記憶する。障害分析部１７は、管理表記憶部１１に記憶された情報に基づいて、トランザクションの遅延障害の原因が、トランザクションの実行時間の増加か、それとも、トランザクションの実行の滞留かを分析する。 Also in this embodiment, the transaction control unit 15 measures the execution start time when the server application unit 14 starts executing the transaction. The request management unit 161 measures the staying number of processing requests staying in the queue of the queue control unit 13. The management table storage unit 11 stores the execution start time and the number of stays. Based on the information stored in the management table storage unit 11, the failure analysis unit 17 analyzes whether the cause of the transaction delay failure is an increase in the execution time of the transaction or a retention of execution of the transaction.

したがって、本実施形態でも、遅延障害の原因を分析することが可能になる。 Therefore, also in this embodiment, it becomes possible to analyze the cause of the delay failure.

以上説明した各実施形態において、図示した構成は単なる一例であって、本発明はその構成に限定されるものではない。 In each embodiment described above, the illustrated configuration is merely an example, and the present invention is not limited to the configuration.

第一の実施形態のトランザクションシステムを示したブロック図である。It is the block diagram which showed the transaction system of 1st embodiment. 第一の実施形態のサーバコンピュータの構成を示したブロック図である。It is the block diagram which showed the structure of the server computer of 1st embodiment. トランザクション実行時間管理表の一例を示した説明図である。It is explanatory drawing which showed an example of the transaction execution time management table. リクエスト受付情報管理表の一例を示した説明図である。It is explanatory drawing which showed an example of the request reception information management table. サーバアプリケーション状態管理表の一例を示した説明図である。It is explanatory drawing which showed an example of the server application state management table. 情報管理部の構成例を示したブロック図である。It is the block diagram which showed the structural example of the information management part. トランザクションの処理要求を受け付けた際のサーバコンピュータ１の動作を説明するためのシーケンス図である。It is a sequence diagram for demonstrating operation | movement of the server computer 1 at the time of receiving the transaction processing request. トランザクションの遅延障害の原因を分析する際のサーバコンピュータ１の動作を説明するためのシーケンス図である。It is a sequence diagram for demonstrating operation | movement of the server computer 1 at the time of analyzing the cause of the delay failure of a transaction. 第二の実施形態のサーバコンピュータの構成を示したブロック図である。It is the block diagram which showed the structure of the server computer of 2nd embodiment.

Explanation of symbols

１サーバコンピュータ（トランザクション装置）
２クライアント端末
３運用管理端末
１１管理表記憶部
１２リクエスト受付部
１３キュー制御部
１４サーバアプリケーション部
１５トランザクション制御部
１６情報管理部
１７障害分析部
１８障害対応部 1 Server computer (transaction device)
DESCRIPTION OF SYMBOLS 2 Client terminal 3 Operation management terminal 11 Management table memory | storage part 12 Request reception part 13 Queue control part 14 Server application part 15 Transaction control part 16 Information management part 17 Failure analysis part 18 Failure response part

Claims

A receiving means for receiving a transaction processing request;
A staying means for staying in a queue with a processing request received by the receiving means;
When it becomes possible to newly execute the transaction, execution means for executing a transaction according to the processing request held in the queue of the staying means,
Execution control means for measuring an execution start time at which the execution means starts executing the transaction;
The number of stays of processing requests that are staying in the queue of the staying means, the latest number of times that the accepting means has accepted the process requests within the most recent predetermined time, and the accepting means before the most recent predetermined time A request management means for measuring an average value per predetermined time of the total number of times of accepting the processing request ;
And information storing means for storing the execution start time of the execution control means is measured, the number of retention of the request management unit is measured, and the most recent reception times and the average value,
Based on the information stored in the information storage means, an analysis means for analyzing whether the cause of the transaction delay failure is an increase in the execution time of the transaction, an increase in the processing request, or a stall in the transaction and, only including,
When the staying number is 1 or more and the degree of increase in the number of latest receptions with respect to the average value is a reference value or more, the cause of the delay failure of the transaction is an increase in the processing request. Transaction device that determines that it exists .

A receiving means for receiving a transaction processing request;
A staying means for staying in a queue with a processing request received by the receiving means;
When it becomes possible to newly execute the transaction, execution means for executing a transaction according to the processing request held in the queue of the staying means,
Execution control means for measuring an execution start time at which the execution means starts executing the transaction;
Request management means for measuring the number of stays of processing requests that are staying in the queue of the stay means;
Measuring the latest total time within the latest unit time of the transaction execution time by the execution means and the total total time before the latest unit time of the transaction execution time by the execution means; and the execution means Execution management means for measuring the most recent execution count for executing the transaction within the latest unit time and the total execution count for executing the transaction before the latest unit time by the execution means;
The execution start time measured by the execution control unit, the staying number measured by the request management unit, and the latest total time, total total time, latest execution count, and total execution count measured by the execution management unit are stored. Information storage means;
Based on the information stored in the information storage means, an analysis means for analyzing whether the cause of the transaction delay failure is an increase in the execution time of the transaction, an increase in the processing request, or a stall in the transaction and, only including,
The analyzing means divides the most recent total time by the most recent execution count to calculate the latest average execution time, and also divides the total total time by the total execution count to calculate the total average execution time. When the staying number is 1 or more and the degree of increase in the latest average execution time with respect to the total average execution time is a set value or more, the cause of the transaction delay failure is the increase in the execution time. Transaction device that determines that

The transaction apparatus according to claim 1 or 2 ,
When the execution means finishes executing the transaction, the execution means includes state management means for clearing the execution start time stored in the information storage means,
The analysis means has a delay failure of the transaction when the staying number is 1 or more and the execution start time at which the time until the current time is a reference time or more is stored in the information storage means. cause, it is determined that the stall of the transaction, the transaction device.

The transaction device according to claim 3 , wherein
The execution means has a plurality of processing units, and each of the plurality of processing units executes the transaction,
The state management means stores the execution start time of the transaction measured by the execution control means in the information storage means in association with a process identifier that identifies a processing unit that has started execution of the transaction,
When the analysis means has an execution start time in which the time until the current time is equal to or greater than the reference time, the cause of the delay failure of the transaction is a processing unit identified by a process identifier corresponding to the execution start time it is determined that the stall of the transaction in the transaction device.

The transaction apparatus according to any one of claims 1 to 4 ,
When the staying number is 0, the analysis unit determines that the cause of the delay failure of the transaction is an increase in the execution time.

The transaction apparatus according to any one of claims 1 to 5 ,
When the analysis unit receives a reset request to the information storage unit, the analysis unit resets the information stored in the information storage unit.

The transaction apparatus according to any one of claims 1 to 6 ,
When the analysis unit receives an analysis request for analyzing the cause of the delay failure, the analysis unit analyzes the cause of the delay failure of the transaction.

The transaction apparatus according to any one of claims 1 to 7 ,
A transaction apparatus comprising failure handling means for coping with the delay fault according to an analysis result of the analysis means.

When a transaction processing request is received, the processing request stays in the queue, and the transaction is newly executable, the transaction delay is performed in the transaction device that executes the transaction according to the processing request held in the queue. A delay failure analyzer for analyzing the cause of a failure,
Execution control means for measuring an execution start time at which the transaction apparatus starts executing the transaction;
The number of processing requests that are retained in the queue of the transaction device, the number of times the transaction device has received the processing request within the most recent predetermined time, and the transaction device before the most recent predetermined time. A request management means for measuring an average value per predetermined time of the total number of times of accepting the processing request ;
Information storage means for storing the execution start time measured by the execution control means, the staying number measured by the request management means , the latest reception count, and the average value ;
Based on the information stored in the information storage means, an analysis means for analyzing whether the cause of the transaction delay failure is an increase in the execution time of the transaction, an increase in the processing request, or a stall in the transaction and, only including,
When the staying number is 1 or more and the degree of increase in the number of latest receptions with respect to the average value is a reference value or more, the cause of the delay failure of the transaction is an increase in the processing request. Delay failure analysis device that judges that there is .

When a transaction processing request is received, the processing request stays in the queue, and the transaction is newly executable, the transaction delay is performed in the transaction device that executes the transaction according to the processing request held in the queue. A delay failure analyzer for analyzing the cause of a failure,
Execution control means for measuring an execution start time at which the transaction apparatus starts executing the transaction;
Request management means for measuring the staying number of processing requests staying in the queue of the transaction device;
The latest total time within the latest unit time of the transaction execution time by the transaction device and the total total time before the latest unit time of the transaction execution time by the transaction device are measured, and the execution means An execution management means for measuring the most recent execution count for executing the transaction within the most recent unit time and the total execution count for executing the transaction before the most recent unit time by the transaction device;
Information for storing the execution start time measured by the execution control unit, the staying number measured by the request management unit, and the latest total time, total total time, latest execution count, and total execution count measured by the execution management unit Storage means;
Based on the information stored in the information storage means, an analysis means for analyzing whether the cause of the transaction delay failure is an increase in the execution time of the transaction , an increase in the processing request, or a stall in the transaction and, only including,
The analyzing means divides the most recent total time by the most recent execution count to calculate the latest average execution time, and also divides the total total time by the total execution count to calculate the total average execution time. When the staying number is 1 or more and the degree of increase in the latest average execution time with respect to the total average execution time is a set value or more, the cause of the transaction delay failure is the increase in the execution time. Delay failure analysis device that judges that .

The delay fault analyzer according to claim 10,
When the transaction device finishes executing the transaction, it includes a state management unit that clears the execution start time stored in the information storage unit,
The analysis means has a delay failure of the transaction when the staying number is 1 or more and the execution start time at which the time until the current time is a reference time or more is stored in the information storage means. cause, it is determined that the stall of the transaction, a delay fault analysis device.

Accept transaction processing requests,
Staying in the queue with the accepted processing request;
When it becomes possible to newly execute the transaction, the transaction according to the processing request held in the queue is executed,
Measure the execution start time when execution of the transaction was started,
Measure the number of stays of processing requests staying in the queue,
Measure the latest number of times the processing request has been received within the most recent predetermined time,
Measure the average value per predetermined time of the total number of receptions that received the processing request before the most recent predetermined time,
Store the measured execution start time , number of stays, number of latest receptions and average value ,
Based on the stored information, analyze whether the cause of the delay failure of the transaction is an increase in the execution time of the transaction, an increase in the processing request, or a stall of the transaction ,
In the analysis, if the number of stays is 1 or more and the degree of increase in the number of latest receptions with respect to the average value is greater than or equal to a reference value, the cause of the transaction delay failure is an increase in the processing request. This is a method for analyzing delay failures.

Accept transaction processing requests,
Staying in the queue with the accepted processing request;
When it becomes possible to newly execute the transaction, the transaction according to the processing request held in the queue is executed,
Measure the execution start time when execution of the transaction was started,
Measure the number of stays of processing requests staying in the queue,
Measure the latest total time within the latest unit time of the execution time of the transaction,
Measure the total total time before the most recent unit time of the execution time of the transaction,
Measure the number of the most recent executions that executed the transaction within the most recent unit time,
Measure the total number of executions of the transaction before the most recent unit time,
Store the measured execution start time , number of stays , near total time, total total time, latest execution count and total execution count ,
Based on the stored information, analyze whether the cause of the delay failure of the transaction is an increase in the execution time of the transaction, an increase in the processing request, or a stall of the transaction ,
The analysis divides the most recent total time by the most recent execution count to calculate the latest average execution time, and also divides the total total time by the total execution count to calculate the total average execution time, When the staying number is 1 or more and the degree of increase in the latest average execution time with respect to the total average execution time is a set value or more, the cause of the delay failure of the transaction is an increase in the execution time Judgment method for delay failure analysis.

In the delay fault analysis method according to claim 1 3,
When the execution of the transaction is finished, the stored execution start time is cleared,
The staying number is 1 or more, and, when the execution start time is the time until the current time becomes the reference time or more is stored, causing a delay fault of the transaction, a stall of the transaction determined A delay failure analysis method.

A program that causes a computer to analyze transaction delay failures,
A procedure for accepting transaction processing requests;
A procedure for retaining the accepted processing request in a queue;
A procedure for executing a transaction according to a processing request held in the queue when the transaction can be newly executed;
Measuring the execution start time at which the execution of the transaction was started;
A procedure for measuring the staying number of processing requests staying in the queue;
A procedure for measuring the latest number of times of accepting the processing request within the latest predetermined time; and
A procedure for measuring an average value per predetermined time of the total number of times of accepting the processing request before the most recent predetermined time;
Storing the measured execution start time , number of stays , near total time, total total time, latest execution count, and total execution count ;
Analyzing, based on the stored information, whether the cause of the delay failure of the transaction is an increase in execution time of the transaction, an increase in the processing request, or a stall of the transaction; to run in,
In the analyzing procedure, when the number of stays is 1 or more and the degree of increase in the number of latest receptions with respect to the average value is a reference value or more, the cause of the delay failure of the transaction is an increase in the processing request. A program that determines that

A program that causes a computer to analyze transaction delay failures,
A procedure for accepting transaction processing requests;
A procedure for retaining the accepted processing request in a queue;
A procedure for executing a transaction according to a processing request held in the queue when the transaction can be newly executed;
Measuring the execution start time at which the execution of the transaction was started;
A procedure for measuring the staying number of processing requests staying in the queue;
A step of measuring the latest total time within the latest unit time of the execution time of the transaction;
Measuring the total total time of the transaction execution time before the most recent unit time;
A procedure for measuring the most recent execution number of times the transaction was executed within the most recent unit time;
Measuring the total number of executions of the transaction before the most recent unit time; and
Storing the measured execution start time , number of stays , near total time, total total time, latest execution count, and total execution count ;
Analyzing, based on the stored information, whether the cause of the delay failure of the transaction is an increase in execution time of the transaction, an increase in the processing request, or a stall of the transaction; to run in,
In the analyzing step, the latest total execution time is divided by the most recent execution count to calculate the latest average execution time, and the total total time is divided by the total execution count to calculate the total average execution time. When the retention number is 1 or more and the degree of increase in the latest average execution time with respect to the total average execution time is equal to or greater than a set value, the cause of the delay failure of the transaction is the increase in the execution time. A program that determines that there is .

The program according to claim 16, wherein
When the execution of the transaction is terminated , the computer further executes a procedure for clearing the stored execution start time ,
In the analyzing procedure, if the execution start time at which the number of stays is 1 or more and the time until the current time is equal to or more than a reference time is stored, the cause of the delay failure of the transaction is that of the transaction. it determined to be a stall, program.