JP2006331135A

JP2006331135A - Performance prediction device, performance prediction method and performance prediction program for cluster system

Info

Publication number: JP2006331135A
Application number: JP2005154751A
Authority: JP
Inventors: Morio Sasaki; 盛朗佐々木; Atsuhiro Tanaka; 淳裕田中
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2005-05-26
Filing date: 2005-05-26
Publication date: 2006-12-07
Anticipated expiration: 2025-05-26
Also published as: JP4392665B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a cluster system performance prediction device capable of predicting clustering overhead generated in a cluster system sharing data present on a main storage device of each node via a communication line. <P>SOLUTION: In this performance prediction device for the cluster system sharing the data on the main storage device of the plurality of nodes connected via the communication line between the nodes, a frequency of a block copy per unit time performed between the nodes with a memory block inside the main storage device as a unit is calculated, and the clustering overhead generated in the cluster system is calculated on the basis of information about a CPU load per block copy and the block copy frequency. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、クラスタシステムの性能予測装置に関し、特に、各ノードの主記憶装置上にあるデータをネットワーク経由で共有するクラスタシステムで発生するクラスタリング・オーバーヘッドを予測することのできるクラスタシステムの性能予測装置、性能予測方法及び性能予測プログラムに関する。 The present invention relates to a performance prediction device for a cluster system, and more particularly to a performance prediction device for a cluster system capable of predicting a clustering overhead generated in a cluster system that shares data on the main storage device of each node via a network. The invention relates to a performance prediction method and a performance prediction program.

近年、Webシステムの普及に伴い、サーバ用途で用いられる計算機システムが増加している。このような計算機システムでは、Webシステムの処理能力を向上させることを目的として、複数の計算機を使用したクラスタシステムが使用される。このため、クラスタシステムの性能設計の重要性が増している。 In recent years, with the spread of Web systems, computer systems used for server applications are increasing. In such a computer system, a cluster system using a plurality of computers is used for the purpose of improving the processing capacity of the Web system. For this reason, the importance of the performance design of the cluster system is increasing.

クラスタシステムは単純に考えると計算機の数に見合った性能向上が実現されるように思われるが、実際には単一ノードのシステムでは発生しないクラスタリング・オーバーヘッドが生じるため、クラスタシステムの性能はクラスタサイズに比例して向上するものではない。ここで、クラスタリング・オーバーヘッドは、ノードを複数にしたことが原因となって発生するオーバーヘッドを表す。以下、スケーラビリティの用語によりクラスタサイズに対する性能増加率を表すこととする。クラスタサイズは、クラスタシステムを構成するノードの数を表す。 Although the cluster system seems to achieve a performance improvement commensurate with the number of computers when considered simply, there is actually a clustering overhead that does not occur in a single-node system. It does not improve in proportion to Here, the clustering overhead represents an overhead generated due to a plurality of nodes. Hereinafter, the performance increase rate with respect to the cluster size will be expressed in terms of scalability. The cluster size represents the number of nodes constituting the cluster system.

クラスタシステムのスケーラビリティが低い場合には、その原因がクラスタリング・オーバーヘッドによるのか又は他の要因によるのかを判定したい、という要求が生じる。なぜなら、クラスタリング・オーバーヘッドが主原因であれば、スケールアウトによる性能向上を狙うよりも、ＣＰＵの高速化等ハードウェア単体の性能強化によるスケールアップを狙うべきであると判断できるからである。このように、クラスタリング・オーバーヘッドの予測は、クラスタシステムの性能設計を行う上で重要となる。 When the scalability of the cluster system is low, there is a demand to determine whether the cause is due to clustering overhead or other factors. This is because if the main cause is clustering overhead, it can be determined that rather than aiming to improve performance by scaling out, it should be aimed to scale up by enhancing the performance of a single hardware such as increasing the CPU speed. Thus, the prediction of clustering overhead is important in designing the performance of a cluster system.

ノードの主記憶上にあるデータをネットワーク経由で共有するクラスタシステムにおいては、データ共有のためのブロックコピーがクラスタリング・オーバーヘッドの主要な原因となっていることがわかっている。 In a cluster system that shares data on the main memory of a node via a network, it is known that block copying for data sharing is a major cause of clustering overhead.

従来の計算機システムでの性能予測の方法として、次のような方法が提案されている。 The following methods have been proposed as performance prediction methods in conventional computer systems.

特許文献１に開示される方法は、計算機システムの性能予測プログラムに関するものである。この性能予測プログラムでは、部品プログラムの性能とこれらの結合情報を入力パラメータとして、部品プログラムの実行時間を追跡するシミュレーションを行うことで計算機システムの性能を予測する。性能予測プログラムは、単一ノードからなるシステム上で、特定のアプリケーションとワークロードを対象とした性能予測を行うものである。 The method disclosed in Patent Document 1 relates to a computer system performance prediction program. In this performance prediction program, the performance of the computer system is predicted by performing a simulation for tracking the execution time of the component program using the performance of the component program and the combination information thereof as input parameters. The performance prediction program performs performance prediction for a specific application and workload on a system composed of a single node.

特許文献２の方法は、Webシステムの性能予測方法に関するものである。この性能予測方法では、Webシステムの構成に関する情報であるモデルパターン情報、Webシステムの特性に関する情報、及びWebシステムが提供する機能に関する情報を予め登録した上で、Webシステムが提供する機能名（例えば文書管理）、優先項目（価格、性能）、利用者数に関する情報を入力すると、構成、機能、サーバ機種、価格、性能、サーバ負荷を、性能の異なるサーバ機種毎に出力する。なお、性能予測は、過去の性能情報に基づき行われる。したがって、この方法は過去の性能情報が入手可能なクラスタシステムの性能予測に適用することができる。 The method of Patent Document 2 relates to a Web system performance prediction method. In this performance prediction method, model pattern information that is information related to the configuration of the Web system, information related to the characteristics of the Web system, and information related to functions provided by the Web system are registered in advance, and the function name provided by the Web system (for example, When information on document management), priority items (price, performance), and the number of users is input, the configuration, function, server model, price, performance, and server load are output for each server model with different performance. Note that the performance prediction is performed based on past performance information. Therefore, this method can be applied to the performance prediction of a cluster system for which past performance information is available.

特許文献３の方法は、システム・コンポーネント接続モデルとデータの経路を示すワークフローを分割することにより、１つのシステム・コンポーネント接続モデルで多様なワークフローを実現する方法に関するものである。特許文献３の方法では、最初に対象とするシステムのシステム・コンポーネント、ワークロード及びワークフローを入力する。システム・コンポーネントは、マスタ・コンポーネントとスレーブ・コンポーネントにより構成され、マスタ・コンポーネントから発生するデータ量がワークロードとして入力される。 The method of Patent Document 3 relates to a method for realizing various workflows with one system component connection model by dividing a system component connection model and a workflow indicating a data path. In the method of Patent Document 3, first, system components, a workload, and a workflow of a target system are input. The system component includes a master component and a slave component, and the amount of data generated from the master component is input as a workload.

特許文献４の方法は、並列化のための負荷分散量を数値で確認できると共に、並列実行プログラムの並列化チューニングの支援を行うことのできる並列実行プログラムの実行回数解析方法に関するものである。この実行回数解析方法では、解析対象プログラムへプログラムの実行回数を計数するカウント命令を挿入する。カウント命令により測定されるプログラムの実行文の総実行回数を計算し、並列度数を変化させた場合の性能向上率を計算する。次に、これらの結果を用いて、並列度数の変化に対する特定ループの実行文実行回数、ループの平均回転数及び性能向上率の見積り値を表示する。 The method of Patent Document 4 relates to a method for analyzing the number of execution times of a parallel execution program that can confirm the load distribution amount for parallelization numerically and can support parallelization tuning of the parallel execution program. In this execution frequency analysis method, a count instruction for counting the number of program executions is inserted into the analysis target program. Calculate the total number of executions of the executable statement of the program measured by the count instruction, and calculate the performance improvement rate when the parallel degree is changed. Next, using these results, the execution number of execution statements of the specific loop, the average rotation speed of the loop, and the estimated value of the performance improvement rate with respect to the change in the parallel degree are displayed.

非特許文献１には、従来のクラスタシステムの性能モデルの一例、LogPモデルが記載されている。LogPモデルは、クラスタシステムのボトルネックの発見や並列アルゴリズムの解析に用いられ、通信遅延の上限(Latency)、１つのメッセージの送信または受信によって消費されるプロセッサ時間(overhead)、メッセージの最小送信（受信）間隔(gap)、プロセッサ数(Processors)の４つを入力パラメータとして性能を予測する。LogPモデルは、例えば高速フーリエ変換アルゴリズムを実行したときのボトルネックの発見や、より効率的な高速フーリエ変換アルゴリズムの設計に用いられる。 Non-Patent Document 1 describes an example of a performance model of a conventional cluster system, a LogP model. The LogP model is used for finding bottlenecks in cluster systems and analyzing parallel algorithms. The upper limit of communication delay (Latency), the processor time consumed by sending or receiving one message (overhead), and the minimum message sending ( The performance is predicted using four parameters, ie, (reception) interval (gap) and the number of processors (Processors). The LogP model is used, for example, for finding a bottleneck when a fast Fourier transform algorithm is executed and for designing a more efficient fast Fourier transform algorithm.

非特許文献２には、非特許文献１を拡張したモデルの一例、LogGPSモデルが記載されている。LogGPSモデルは、LogPモデルで考慮している入力パラメータに加え、長いメッセージの送信とその送信中に発生する同期に関する入力パラメータを含む。このように、入力パラメータを追加することにより、非特許文献１よりも高い精度で性能を予測できる。 Non-Patent Document 2 describes an example of a model obtained by extending Non-Patent Document 1, a LogGPS model. In addition to the input parameters considered in the LogP model, the LogGPS model includes input parameters related to the transmission of long messages and the synchronization that occurs during the transmission. Thus, by adding input parameters, performance can be predicted with higher accuracy than in Non-Patent Document 1.

非特許文献１と非特許文献２に記載の技術は、アルゴリズムから計算量と通信量が算出できるアプリケーションが動作するクラスタシステムに対して適用できる。
特開２００４−２７２５８２号公報特開２００４−４６７３４号公報特開２００４−１４９３７号公報特開２００４−１０２５９４号公報 David Culler, Richard Karp, David Patterson, Abhijit Sahay, Klaus Erik Schauser, Eunice Santos, Ramesh Subramonian, and Thorsten von Eicken. LogP: Towards a Realistic Model of Parallel Computation. In Proceedings of the Fourth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pages 1--12, May, 1993. 伊野文彦、藤本典幸、萩原兼一。LogGPS：メッセージ通信プロトコルの切替えを考慮した高水準通信ライブラリ向けの並列計算モデル。情報処理学会論文誌：ハイパフォーマンスコンピューティングシステム、Vol.42, No.SIG09。 The techniques described in Non-Patent Document 1 and Non-Patent Document 2 can be applied to a cluster system in which an application that can calculate the calculation amount and the communication amount from an algorithm operates.
JP 2004-272582 A JP 2004-46734 A JP 2004-14937 A JP 2004-102594 A David Culler, Richard Karp, David Patterson, Abhijit Sahay, Klaus Erik Schauser, Eunice Santos, Ramesh Subramonian, and Thorsten von Eicken.LogP: Towards a Realistic Model of Parallel Computation.In Proceedings of the Fourth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pages 1--12, May, 1993. Fumihiko Ino, Noriyuki Fujimoto, Kenichi Sugawara. LogGPS: A parallel computing model for high-level communication libraries that considers switching message communication protocols. IPSJ Transactions: High Performance Computing System, Vol.42, No.SIG09.

上述した従来の技術は、いずれも以下に述べるような問題点があった。 All of the conventional techniques described above have the following problems.

特許文献１の方法における性能予測プログラムは、単一ノードからなるシステムに関するものであり、ここで対象とする複数ノードからなるクラスタシステムへ適用できるものではない。 The performance prediction program in the method of Patent Document 1 relates to a system composed of a single node, and is not applicable to a cluster system composed of a plurality of nodes.

特許文献２の方法は、過去の性能情報が入手可能なクラスタシステムの性能予測に適用できる。しかしながら、本発明の対象とするクラスタリング・オーバーヘッドの予測はできない。 The method of Patent Document 2 can be applied to performance prediction of a cluster system for which past performance information is available. However, the clustering overhead targeted by the present invention cannot be predicted.

特許文献３の方法は、１つのシステム・コンポーネント接続モデルで多様なワークフローを実現する方法であって、本発明の対象とするクラスタリング・オーバーヘッドを予測する方法ではない。 The method of Patent Document 3 is a method for realizing various workflows with one system component connection model, and is not a method for predicting the clustering overhead targeted by the present invention.

特許文献４の方法は、並列化のための負荷分散量を数値で確認する方法であるが、本発明の対象とするクラスタリング・オーバーヘッドを予測する方法ではない。 The method of Patent Document 4 is a method of confirming the load distribution amount for parallelization with a numerical value, but is not a method of predicting the clustering overhead targeted by the present invention.

非特許文献１と非特許文献２に記載の技術は、アルゴリズムから計算量と通信量を算出できるアプリケーションが動作するクラスタシステムの性能を予測できる。しかしながら、アルゴリズムから計算量と通信量を算出することのできない、ビジネス向けアプリケーションが動作するクラスタシステムの場合にはクラスタリング・オーバーヘッドを予測することはできない。 The technologies described in Non-Patent Document 1 and Non-Patent Document 2 can predict the performance of a cluster system in which an application that can calculate the calculation amount and the communication amount from an algorithm operates. However, the clustering overhead cannot be predicted in the case of a cluster system in which a business application operates and the calculation amount and communication amount cannot be calculated from the algorithm.

本発明の目的は、上記従来技術の欠点を解決し、各ノードの主記憶上にあるデータを通信回線経由で共有するクラスタシステムで発生するクラスタリング・オーバーヘッドを予測することのできるクラスタシステムの性能予測装置、性能予測方法及び性能予測プログラムを提供することにある。 An object of the present invention is to solve the above-described drawbacks of the prior art and to predict the performance of a cluster system that can predict the clustering overhead that occurs in a cluster system that shares data on the main memory of each node via a communication line. An apparatus, a performance prediction method, and a performance prediction program are provided.

上記目的を達成するための本発明は、通信回線を介して接続される複数のノードの主記憶装置上のデータを前記ノード間で互いに共有するクラスタシステムの性能予測装置であって、前記主記憶装置内のメモリーブロックを単位として前記ノード間でなされるブロックコピーの回数を算出し、単位時間当たりの前記ブロックコピー回数と、前記ブロックコピー当たりのＣＰＵ負荷の情報に基づき、前記ブロックコピーが原因で生じる、前記クラスタシステムのクラスタリング・オーバーヘッドを予測することを特徴とする。 In order to achieve the above object, the present invention provides a performance prediction device for a cluster system in which data on a main storage device of a plurality of nodes connected via a communication line is shared between the nodes. Calculate the number of block copies made between the nodes in units of memory blocks in the device, and based on the block copy number per unit time and the CPU load information per block copy, due to the block copy The resulting clustering overhead of the cluster system is predicted.

本発明では、主記憶上のデータをノード間で互いに共有するクラスタシステムにおいては、データ共有のためのブロックコピーがクラスタリング・オーバーヘッドの主要な原因となっている点に着目した。 The present invention focuses on the fact that block copying for data sharing is a major cause of clustering overhead in a cluster system in which data on the main memory is shared among nodes.

本発明では、最初に、コピーコスト情報、ワークロード特性情報及び負荷分散特性情報をクラスタシステム性能予測装置に対して入力する。 In the present invention, first, copy cost information, workload characteristic information, and load distribution characteristic information are input to the cluster system performance prediction apparatus.

クラスタシステム性能予測装置では、これらの情報からコピーコスト、ワークロード特性テーブル、負荷分散特性テーブルを生成する。 The cluster system performance prediction apparatus generates a copy cost, a workload characteristic table, and a load distribution characteristic table from these pieces of information.

コピーコスト情報は前記ブロックコピーが１回なされる際のＣＰＵ負荷の情報である。 Copy cost information is CPU load information when the block copy is performed once.

ワークロード特性テーブルは各メモリーブロック集合における各ウィンドウサイズの出現回数の情報を含むテーブルである。ここで、ウィンドウサイズは、特定の種類のアクセスがメモリーブロック集合に対してなされてから次に同種のアクセスがなされる迄の一連のアクセスの集合であるウィンドウにおける、一連のアクセスの出現回数である。 The workload characteristic table is a table including information on the number of appearances of each window size in each memory block set. Here, the window size is the number of occurrences of a series of accesses in a window which is a series of accesses from the time when a specific type of access is made to the memory block set until the next type of access is made. .

また、負荷分散特性テーブルは、各ノードの部分ブロック集合へのアクセスの到着確率の情報を含むテーブルである。 The load distribution characteristic table is a table including information on the arrival probability of access to the partial block set of each node.

次に、クラスタシステム性能予測装置は、クラスタシステムのノード数情報とウィンドウサイズ情報と負荷分散特性テーブル情報に基づき、ウィンドウサイズを与えた場合に、１つのウィンドウ中にアクセスが到着するノードの数の期待値である到着ノード数を算出する。 Next, the cluster system performance prediction device determines the number of nodes that access arrives in one window when the window size is given based on the node number information, window size information, and load distribution characteristic table information of the cluster system. The number of arrival nodes that is the expected value is calculated.

次に、ウィンドウ内のアクセスが書込みのアクセスの後、読出しのアクセスが複数回続くことを想定し、到着ノード数からウィンドウサイズを与えた場合のブロックコピー回数を算出する。 Next, assuming that the access in the window is a write access followed by a read access a plurality of times, the number of block copies when the window size is given from the number of arrival nodes is calculated.

次に、ワークロード特性テーブルとウィンドウサイズを与えた場合のブロックコピー回数から、クラスタシステム全体のブロックコピー回数を算出する。 Next, the block copy count of the entire cluster system is calculated from the workload copy table and the block copy count given the window size.

最後に、クラスタシステム全体のブロックコピー回数とコピーコストから、クラスタリング・オーバーヘッドを算出する。 Finally, the clustering overhead is calculated from the block copy count and copy cost of the entire cluster system.

本発明によれば、クラスタシステムで発生するクラスタリング・オーバーヘッドを予測することができる。 According to the present invention, the clustering overhead generated in the cluster system can be predicted.

その理由は、クラスタシステムのワークロード特性、負荷分散特性及びコピーコストに基づき、クラスタリング・オーバーヘッドの主要因であるブロックコピーにより発生するオーバーヘッドを算出するためである。 The reason is to calculate the overhead generated by block copy, which is the main factor of clustering overhead, based on the workload characteristics, load distribution characteristics and copy cost of the cluster system.

以下、本発明を実施するための最良の形態について説明する。 Hereinafter, the best mode for carrying out the present invention will be described.

（第１の実施の形態）
以下、本発明の第１の実施の形態について図面を参照して説明する。 (First embodiment)
Hereinafter, a first embodiment of the present invention will be described with reference to the drawings.

図１は、本実施の形態によるクラスタシステムの性能予測装置１０の構成を示すブロック図である。 FIG. 1 is a block diagram illustrating a configuration of a performance prediction apparatus 10 of a cluster system according to the present embodiment.

図１を参照すると、本実施の形態によるクラスタシステムの性能予測装置１０は、入力手段１１０と、コピーコスト生成手段１２０と、ワークロード特性生成手段１３０と、負荷分散特性生成手段１４０と、到着ノード数算出手段１５０と、オーバーヘッド予測手段１６０と、出力手段１７０を備える。 Referring to FIG. 1, the cluster system performance prediction apparatus 10 according to the present exemplary embodiment includes an input unit 110, a copy cost generation unit 120, a workload characteristic generation unit 130, a load distribution characteristic generation unit 140, an arrival node. Number calculation means 150, overhead prediction means 160, and output means 170 are provided.

クラスタシステムの性能予測装置１０の各手段を説明する前に、本実施の形態の対象とするクラスタシステム４０について説明する。 Before describing each means of the performance prediction device 10 of the cluster system, the cluster system 40 that is a target of the present embodiment will be described.

図２は、本実施の形態の対象とするクラスタシステム４０の一例を示す図である。 FIG. 2 is a diagram illustrating an example of the cluster system 40 that is a target of the present embodiment.

図２を参照すると、本実施の形態の対象とするクラスタシステム４０は、通信回線を介して接続された２つのノードにより構成される。各ノードは、それぞれＣＰＵ（Central Processing Unit）及び主記憶装置を備える。 Referring to FIG. 2, a cluster system 40 that is a target of the present embodiment includes two nodes connected via a communication line. Each node includes a CPU (Central Processing Unit) and a main storage device.

クラスタシステム４０には、通信回線を介してクライアントが接続されている。クライアントは、クラスタシステム４０に対して情報を書き込むための「write」アクセス（書込みのアクセス）あるいは読み出すための「read」アクセス（読出しのアクセス）を行う。クラスタシステム４０は、空いているノードにクライアントを接続する。 A client is connected to the cluster system 40 via a communication line. The client performs “write” access (write access) for writing information to the cluster system 40 or “read” access (read access) for reading. The cluster system 40 connects a client to an available node.

クライアントから「read」アクセスがあり、クライアントの読み出す主記憶装置内の部分ブロック集合が最新でないことをアクセス先のノードで検知すると、別のノードから最新の部分ブロック集合のデータをコピーするブロックコピーが発生する。なお、アクセス先のノードでクライアントの読み出すブロックが最新であることを検知した場合にはブロックコピーは起こらない。部分ブロック集合はクライアントのアクセスするブロックであるが、詳細は後述する。 When there is a “read” access from the client and the access destination node detects that the partial block set in the main storage device read by the client is not the latest, a block copy that copies the data of the latest partial block set from another node appear. If it is detected that the block read by the client is the latest in the access destination node, block copy does not occur. The partial block set is a block accessed by the client, and details will be described later.

このように、ノード間では、主記憶装置間で部分ブロック集合単位のブロックコピーがなされることによりＣＰＵが消費され、クラスタリング・オーバーヘッドが発生する。クラスタリング・オーバーヘッドは複数の原因により発生するが、主要な原因はデータ共有のためのブロックコピーにより発生するＣＰＵの負荷であることがわかっている。なお、ノード数は３個以上であっても同様である。 As described above, between nodes, the CPU is consumed by performing block copy in units of partial block sets between main storage devices, and clustering overhead occurs. Although the clustering overhead is caused by a plurality of causes, it is known that the main cause is a CPU load caused by block copy for data sharing. The same applies even if the number of nodes is three or more.

次に、クラスタシステムの性能予測装置１０の各手段を説明する。 Next, each unit of the cluster system performance prediction apparatus 10 will be described.

入力手段１１０は、性能予測の対象とするクラスタシステム４０の情報を入力する手段であって、クラスタシステムの性能予測装置１０にオーバーヘッド基礎情報２０を入力する機能を有する。オーバーヘッド基礎情報２０には、コピーコスト情報２１０、ワークロード特性情報２２０及び負荷分散特性情報２３０が含まれる。 The input unit 110 is a unit for inputting information of the cluster system 40 that is a target of performance prediction, and has a function of inputting the overhead basic information 20 to the performance prediction apparatus 10 of the cluster system. The overhead basic information 20 includes copy cost information 210, workload characteristic information 220, and load distribution characteristic information 230.

コピーコスト生成手段１２０は、コピーコスト情報２１０をオーバーヘッド基礎情報２０から読出し、コピーコストテーブル３１を生成する機能を有する。 The copy cost generation unit 120 has a function of reading the copy cost information 210 from the overhead basic information 20 and generating the copy cost table 31.

ここで、コピーコストはブロックコピーが１回発生したときの１つのＣＰＵ負荷である。ＣＰＵ負荷は、例えば単位時間あたりのＣＰＵ実行時間で表す。 Here, the copy cost is one CPU load when a block copy occurs once. The CPU load is represented by CPU execution time per unit time, for example.

ワークロード特性生成手段１３０は、オーバーヘッド基礎情報２０に含まれるワークロード特性情報２２０を読出し、ワークロード特性テーブル３２を生成する機能を有する。 The workload characteristic generation unit 130 has a function of reading the workload characteristic information 220 included in the overhead basic information 20 and generating the workload characteristic table 32.

負荷分散特性生成手段１４０は、オーバーヘッド基礎情報２０に含まれる負荷分散特性情報２３０を読出し、負荷分散特性テーブル３３を生成する機能を有する。 The load distribution characteristic generation unit 140 has a function of reading the load distribution characteristic information 230 included in the overhead basic information 20 and generating the load distribution characteristic table 33.

到着ノード数算出手段１５０は、負荷分散特性とワークロード特性から、部分ブロック集合へのアクセスが到着するノード数を算出する機能を有する。 The arrival node number calculating means 150 has a function of calculating the number of nodes where access to the partial block set arrives from the load distribution characteristic and the workload characteristic.

ここで、部分ブロック集合とは、１つ以上のメモリーブロックを含むメモリーブロックの集合である。各ノードの主記憶装置には同じＩＤ（識別番号）を持つ部分ブロック集合が配置され、ノード間で共有される。 Here, the partial block set is a set of memory blocks including one or more memory blocks. A partial block set having the same ID (identification number) is arranged in the main storage device of each node and is shared among the nodes.

部分ブロック集合は各ノードの主記憶装置のキャッシュメモリーに設定され、ノード間で同じ情報を保つようにコントロールされる。但し、同じＩＤの部分ブロック集合であっても、クライアントからのアクセスがないなどの理由により、全てのノード間でデータが一致していない場合もある。 The partial block set is set in the cache memory of the main storage device of each node, and is controlled so as to maintain the same information between the nodes. However, even in a partial block set with the same ID, there is a case where data does not match between all nodes due to reasons such as no access from the client.

オーバーヘッド予測手段１６０は、クラスタシステム４０のクラスタリング・オーバーヘッドを計算する手段であり、到着ノード数からクラスタリング・オーバーヘッドとスケーラビリティを算出する機能を有する。 The overhead prediction unit 160 is a unit that calculates the clustering overhead of the cluster system 40 and has a function of calculating the clustering overhead and scalability from the number of arrival nodes.

出力手段１７０は、結果を出力する手段であり、予測したクラスタリング・オーバーヘッドとスケーラビリティを出力する。 The output unit 170 is a unit that outputs a result, and outputs the predicted clustering overhead and scalability.

クラスタシステムの性能予測装置１０は、データ処理装置、コンピュータなどにより構成することができる。この場合、入力手段１１０としては、キーボード、マウス等を用いることができる。また、出力手段１７０としては、プリンタ、ディスプレイなどを用いることができる。 The cluster system performance prediction device 10 can be configured by a data processing device, a computer, or the like. In this case, a keyboard, a mouse, or the like can be used as the input unit 110. As the output unit 170, a printer, a display, or the like can be used.

次に、本実施の形態によるクラスタシステムの性能予測装置１０の利用形態を説明する。 Next, a usage form of the performance prediction apparatus 10 of the cluster system according to the present embodiment will be described.

図２を参照すると、本実施の形態によるクラスタシステムの性能予測装置１０は、通信回線を介してクラスタシステム４０のノードに接続されている。クラスタシステムの性能予測装置１０は、ノードからオーバーヘッド基礎情報２０を取得して、クラスタシステム４０の性能予測を行う。 Referring to FIG. 2, the performance prediction apparatus 10 of the cluster system according to this embodiment is connected to a node of the cluster system 40 via a communication line. The performance prediction device 10 of the cluster system acquires the overhead basic information 20 from the node and performs performance prediction of the cluster system 40.

なお、クラスタシステムの性能予測装置１０は必ずしもクラスタシステム４０のノードに接続されている必要はない。クラスタシステムの性能予測装置１０がクラスタシステム４０と接続されていない場合、クラスタシステム４０のクラスタリング・オーバーヘッド基礎情報２０は人手、あるいは記憶媒体によりクラスタシステムの性能予測装置１０に入力する。 Note that the cluster system performance prediction apparatus 10 does not necessarily have to be connected to a node of the cluster system 40. When the cluster system performance prediction apparatus 10 is not connected to the cluster system 40, the clustering overhead basic information 20 of the cluster system 40 is input to the cluster system performance prediction apparatus 10 manually or by a storage medium.

また、クラスタシステムの性能予測装置１０はノード内に実装されていてもよい。このような場合、オーバーヘッド基礎情報２０はノードからクラスタシステムの性能予測装置１０へ入力される。 Further, the performance prediction device 10 of the cluster system may be mounted in the node. In such a case, the overhead basic information 20 is input from the node to the performance prediction apparatus 10 of the cluster system.

本発明の特徴である、ワークロード特性情報２１０と負荷分散特性情報２２０とコピーコスト情報２３０に基づき、クラスタリング・オーバーヘッドを算出する機能については、コンピュータ装置の内部にそのような機能を実現するプログラムを組み込んだ回路部品を実装して実現することも可能である。しかしながら、本発明の特徴的な機能を実現するためのプログラム（アプリケーション）を記憶媒体に記憶させ、コンピュータ装置で当該プログラムを実行することで、クラスタシステムの性能予測装置１０として機能させることもできる。 As for the function of calculating the clustering overhead based on the workload characteristic information 210, the load distribution characteristic information 220, and the copy cost information 230, which is a feature of the present invention, a program for realizing such a function is provided in the computer apparatus. It can also be realized by mounting the incorporated circuit components. However, it is also possible to store the program (application) for realizing the characteristic function of the present invention in a storage medium and execute the program on a computer device, thereby functioning as the performance prediction device 10 of the cluster system.

図３は、本実施の形態による特性テーブル３０の内容の構成を示すブロック図である。 FIG. 3 is a block diagram showing a configuration of the contents of the characteristic table 30 according to the present embodiment.

図３を参照すると、特性テーブル３０は、コピーコストテーブル３１と、ワークロード特性テーブル３２と、負荷分散特性テーブル３３による構成となっている。 Referring to FIG. 3, the characteristic table 30 includes a copy cost table 31, a workload characteristic table 32, and a load distribution characteristic table 33.

コピーコストテーブル３１にはコピーコストが書き込まれる。コピーコストテーブル３１に記入される情報は、コピーコスト１つのみである。 A copy cost is written in the copy cost table 31. The information entered in the copy cost table 31 is only one copy cost.

ワークロード特性テーブル３２は、各部分ブロック集合におけるウィンドウの出現回数のテーブルであり、ウィンドウの出現回数はウィンドウサイズ毎に書き込まれる。ここで、ウィンドウとは、クライアントから特定の種類のアクセスがメモリーブロック集合に対してなされてから次に同種のアクセスがなされる迄のアクセスの集合（全てのアクセス）を指す。また、ウィンドウサイズとは、ウィンドウを構成するアクセスの数を表す。ウィンドウサイズの詳細については、図６で説明する。 The workload characteristic table 32 is a table of the number of appearances of windows in each partial block set, and the number of appearances of windows is written for each window size. Here, the window refers to a set of accesses (all accesses) from when a specific type of access is made to the memory block set by the client until the next same type of access is made. The window size represents the number of accesses constituting the window. Details of the window size will be described with reference to FIG.

負荷分散特性テーブル３３は、ノードの各部分ブロック集合へのアクセスの到着確率のテーブルである。 The load distribution characteristic table 33 is a table of the arrival probability of access to each partial block set of a node.

ワークロード特性テーブル３２、負荷分散特性テーブル３３については、図５、図６でそれぞれ詳細に説明する。 The workload characteristic table 32 and the load distribution characteristic table 33 will be described in detail with reference to FIGS.

次に、本実施の形態の動作について詳細に説明する。 Next, the operation of the present embodiment will be described in detail.

図４は、本実施の形態によるクラスタシステムの性能予測装置１０の動作を説明するためのフローチャートである。以下の説明では、必要に応じて図１の主要な部分を参照する。 FIG. 4 is a flowchart for explaining the operation of the performance prediction apparatus 10 of the cluster system according to this embodiment. In the following description, reference is made to the main part of FIG. 1 as necessary.

図４を参照すると、最初に、入力手段１１０はオーバーヘッド基礎情報２０を入力する（ステップＡ１）。 Referring to FIG. 4, first, the input means 110 inputs the overhead basic information 20 (step A1).

オーバーヘッド基礎情報２０は、人手により入力する場合と、クラスタシステム４０のノードから通信回線経由でクラスタシステムの性能予測装置１０に入力する場合がある。 The overhead basic information 20 may be input manually or may be input from a node of the cluster system 40 to the performance prediction apparatus 10 of the cluster system via a communication line.

次に、コピーコスト生成手段１２０がオーバーヘッド基礎情報２０に含まれるコピーコスト情報２１０から、コピーコストテーブル３１を生成する（ステップＡ２）。 Next, the copy cost generation means 120 generates the copy cost table 31 from the copy cost information 210 included in the overhead basic information 20 (step A2).

本実施の形態ではブロックコピー１回で、１つのＣＰＵの何パーセントを消費するかがコピーコスト情報２１０として与えられているとする。したがって、コピーコスト生成手段１２０はコピーコスト情報２１０をそのままコピーコストテーブル３１に書き込む。 In this embodiment, it is assumed that what percentage of one CPU is consumed in one block copy is given as copy cost information 210. Therefore, the copy cost generation unit 120 writes the copy cost information 210 into the copy cost table 31 as it is.

ワークロード特性生成手段１３０は、オーバーヘッド基礎情報２０に含まれるワークロード特性情報２２０からワークロード特性テーブル３２を生成する（ステップＡ３）。 The workload characteristic generation unit 130 generates the workload characteristic table 32 from the workload characteristic information 220 included in the overhead basic information 20 (step A3).

本実施の形態では、ワークロード特性情報２２０がワークロード特性テーブル３２として与えられているので、ワークロード特性生成手段１３０はワークロード特性情報２２をそのままワークロード特性テーブル３２に書き込む。 In this embodiment, since the workload characteristic information 220 is given as the workload characteristic table 32, the workload characteristic generation unit 130 writes the workload characteristic information 22 into the workload characteristic table 32 as it is.

負荷分散特性取得手段１４０は、オーバーヘッド基礎情報２０に含まれる負荷分散特性情報２３０から負荷分散特性テーブル３３を生成する（ステップＡ４）。 The load distribution characteristic acquisition unit 140 generates the load distribution characteristic table 33 from the load distribution characteristic information 230 included in the overhead basic information 20 (step A4).

本実施の形態では、負荷分散特性情報２３０が負荷分散特性テーブル３３として与えられているので、負荷分散特性取得手段１４０は負荷分散特性情報２３０をそのまま負荷分散特性テーブル３３に書き込む。 In this embodiment, since the load distribution characteristic information 230 is given as the load distribution characteristic table 33, the load distribution characteristic acquisition unit 140 writes the load distribution characteristic information 230 into the load distribution characteristic table 33 as it is.

上述した、ステップＡ１からステップＡ４のステップの内、ステップＡ２、ステップＡ３、ステップＡ４の順序は入れ替えることができる。 Among the steps A1 to A4 described above, the order of steps A2, A3, and A4 can be changed.

図５は、本実施の形態によるワークロード特性テーブル例３２Ａを示す図である。 FIG. 5 is a diagram showing a workload characteristic table example 32A according to the present embodiment.

図５を参照すると、ワークロード特性テーブル例３２Ａでは、各部分ブロック集合でのウィンドウの出現回数をウィンドウサイズ毎に示している。なお、部分ブロック集合は全てのノードに配置されている。 Referring to FIG. 5, in the workload characteristic table example 32A, the number of appearances of windows in each partial block set is shown for each window size. Note that the partial block set is arranged in all nodes.

各行に示したＡ、Ｂ、Ｃの文字は各部分ブロック集合のＩＤであり、各列に示した３、４、５、７の数字は各ウィンドウサイズを表す。 The characters A, B, and C shown in each row are the IDs of each partial block set, and the numbers 3, 4, 5, and 7 shown in each column represent the window sizes.

図５のワークロード特性テーブル例３２Ａから、例えば、部分ブロック集合Ｂに対するアクセスではサイズ５のウィンドウが８００回出現することがわかる。 From the workload characteristic table example 32A in FIG. 5, for example, it is understood that a window of size 5 appears 800 times when accessing the partial block set B.

図６は、本実施の形態による負荷分散特性テーブル例３３Ａを示す図である。 FIG. 6 is a diagram showing a load distribution characteristic table example 33A according to the present embodiment.

図６を参照すると、負荷分散特性テーブル例３３Ａでは、Ａ、Ｂ、Ｃのそれぞれの部分ブロック集合毎に、１、２、３、４の各ノードへアクセスが到着する確率が示されている。負荷分散特性テーブル例３３Ａからは、部分ブロック集合Ａへのアクセスは、ノード１に対しては０．１、ノード２に対しては０．３、ノード３に対しては０．２、ノード４に対しては０．４の確率で到着することがわかる。 Referring to FIG. 6, the load distribution characteristic table example 33 </ b> A shows the probability of access reaching each of the nodes 1, 2, 3, and 4 for each of the partial block sets of A, B, and C. From the load distribution characteristic table example 33A, the access to the partial block set A is 0.1 for the node 1, 0.3 for the node 2, 0.2 for the node 3, and the node 4 It can be seen that there is a probability of 0.4.

本実施の形態では、クライアントからノードに対してブロックの内容を書き換える「write」アクセスと最新のブロックの内容を読み出す「read」アクセスの２種類のアクセスがなされることを前提とする。また、ワークロード特性テーブルは、「write」アクセスにより書き込まれたデータが「read」アクセスにより読み出されるシーケンス動作による負荷を表したものとなる。例えば、ウィンドウサイズ４であると、「write, read, read, read, (write) 」というアクセスシーケンスが出現することを意味する。ここで、アクセスシーケンスの末尾の(write)は「write」アクセスであるが、最後の「(write)」アクセスで内容が書き換えられるため、括弧内で示してある。ウィンドウサイズは、「write」アクセスから、次の「write」アクセスの直前の「read」アクセス迄のアクセス回数で表される。 In the present embodiment, it is assumed that two types of access are performed: a “write” access that rewrites the contents of a block from a client to a node and a “read” access that reads the contents of the latest block. The workload characteristic table represents a load caused by a sequence operation in which data written by the “write” access is read by the “read” access. For example, when the window size is 4, it means that an access sequence “write, read, read, read, (write)” appears. Here, (write) at the end of the access sequence is “write” access, but the content is rewritten by the last “(write)” access, and therefore, it is shown in parentheses. The window size is represented by the number of accesses from “write” access to “read” access immediately before the next “write” access.

到着ノード数算出手段１５０は、与えられた負荷分散特性テーブル３３に関して、到着ノード数がウィンドウサイズの増加に伴い広義の意味で単調増加するように到着ノード数を算出する（ステップＡ５）。 The arrival node number calculation means 150 calculates the arrival node number so that the arrival node number monotonously increases in a broad sense as the window size increases with respect to the given load distribution characteristic table 33 (step A5).

なお、到着ノード数は各部分ブロック集合毎に算出される。 The number of arrival nodes is calculated for each partial block set.

ここで、広義の意味で単調増加するとは、ウィンドウサイズが増加した場合に、到着ノード数が増加すること又は不変であることを表す。また、到着ノード数は、１つのウィンドウ内でアクセスが到着するノードの数の期待値を表す。 Here, monotonically increasing in a broad sense means that the number of arrival nodes increases or remains unchanged when the window size increases. In addition, the number of arrival nodes represents an expected value of the number of nodes to which access arrives in one window.

到着ノード数を、ウィンドウサイズに対して広義の意味で単調増加するように算出するのは、ウィンドウサイズが増すとノードへのアクセス回数が増加するため、到着ノード数が増加すること又は不変であることはあっても、到着ノード数が減少することはないという理由によるものである。 The reason why the number of arriving nodes is calculated so as to increase monotonically in a broad sense with respect to the window size is that the number of times of accessing the node increases as the window size increases, so that the number of arriving nodes increases or does not change. This is because the number of arrival nodes does not decrease.

例えば、ある負荷分散特性テーブル３３の下で、部分ブロック集合Ａへのアクセスがウィンドウサイズ５で発生したときを考える。到着ノード数算出手段１５０が、この５回のアクセスは３つのノードに到着する（すなわち到着ノード数は３）と算出したら、到着ノード数算出手段１５０はウィンドウサイズが６のときには到着ノード数は３以上と算出する。 For example, consider a case where an access to a partial block set A occurs with a window size of 5 under a certain load distribution characteristic table 33. If the arrival node number calculation means 150 calculates that these five accesses arrive at three nodes (that is, the arrival node number is 3), the arrival node number calculation means 150 determines that the arrival node number is 3 when the window size is 6. Calculated as above.

クラスタサイズ n とウィンドウサイズ w が与えられたときの、到着ノード数の算出方法の例を説明する。各ノードＩＤ（ノードの識別番号）を、１から n の数で表す。上述したように、到着ノード数は部分ブロック集合毎に算出される。 An example of a method for calculating the number of arrival nodes when a cluster size n and a window size w are given will be described. Each node ID (node identification number) is represented by a number from 1 to n. As described above, the number of arrival nodes is calculated for each partial block set.

ノード i へのアクセスが到着する確率を p_i （Σ p_i ＝１）とする。p_i は負荷分散特性テーブル３３から取得できる。 The probability of access to the node i arrives and _{_{p i (Σ p i = 1}} ). p _i can be acquired from the load distribution characteristic table 33.

また、ある１つのウィンドウ内でノード i へ到着したアクセスの数を d_iとする。ここで

ｄ＝（ｄ_１，ｄ_２，... ，ｄ_ｎ）・・・（式１）

ｐ＝（ｐ_１，ｐ_２，... ，ｐ_ｎ）・・・（式２）

と書く。 Also, let d _i be the number of accesses that have arrived at node i within a window. here

d = (d ₁ , d ₂ ,..., d _n ) (Formula 1)

p = (p ₁ , p ₂ ,..., p _n ) (Expression 2)

Write.

ノードＩＤが１, ２, … n であるそれぞれのノードへアクセスが到着する n 個の事象が出現する確率は、上述のようにそれぞれp₁, p₂,… p_n で与えられる。ウィンドウサイズ w 回のアクセスがなされた際、当該 n 個の事象がそれぞれd₁, d₂,… d_n 回起こる事象の出現確率 m(w, d, p) は、多項分布を用いることにより

で与えられる。式３でΠは、積を計算する記号である。また、d₁, d₂,… d_nは

ｄ_１＋ｄ_２＋・・・ｄ_ｎ＝ｗ・・・（式４）

を満たす自然数である。ここで、自然数は０を含む。 Node ID is 1, 2, the probability that the n-number of events ... arrives access to a is each node n appears, p _1, p _2, respectively, as described above, is given by ... p _n. When the window size is accessed w times, the occurrence probability m (w, d, p) of the event in which the _n events occur d ₁ , d ₂ , ... d _n times is obtained by using the multinomial distribution.

Given in. In Equation 3, Π is a symbol for calculating the product. D ₁ , d ₂ , ... d _n are

_{_{_{d 1 + d 2 + ··· d}}} n = w ··· ( Equation 4)

It is a natural number that satisfies Here, the natural number includes 0.

次に、到着ノード数の算出を説明する。ここで、到着ノード数は、到着ノード数の期待値を表す。 Next, calculation of the number of arrival nodes will be described. Here, the number of arrival nodes represents an expected value of the number of arrival nodes.

まずアクセスが到着するノード数（以下ではアクセスが到着するノード数をアクセス到着ノード数と略す）を固定して、到着ノード数の期待値を求め、次に全てのアクセス到着ノード数について到着ノード数の期待値を合計する。以下、アクセス到着ノード数を固定した場合の到着ノード数の期待値を、到着ノード数の部分期待値と記述する。 First, the number of nodes arriving at the access (hereinafter, the number of nodes arriving at the access is abbreviated as the number of access arrival nodes) is fixed, the expected value of the number of arrival nodes is obtained, and then the number of arrival nodes for all the number of access arrival nodes Sum the expected values of. Hereinafter, the expected value of the number of arrival nodes when the number of access arrival nodes is fixed is described as a partial expected value of the number of arrival nodes.

式４を満たす d_i のうち、k 個が１以上の自然数である d の集合を D(n, w, k) と記述する。ここで、k はアクセス到着ノード数を表す。 Among d _i satisfying Equation 4, a set of d whose k is a natural number of 1 or more is described as D (n, w, k). Here, k represents the number of access arrival nodes.

一例であるが、クラスタサイズ n が５の場合、ノード数は５であるが、k が３であると、５個のノードの内、３個のノードにはアクセスが到着するが、２個のノードにはアクセスは到着しない。 As an example, if the cluster size n is 5, the number of nodes is 5, but if k is 3, access will arrive at 3 of the 5 nodes, but 2 Access does not arrive at the node.

まず、到着ノード数の部分期待値を求めるが、この部分期待値は、式３の確率にk を乗算した値を D(n, w, k) の集合について合計することにより算出することができる。 First, the partial expected value of the number of arriving nodes is obtained. This partial expected value can be calculated by adding the value obtained by multiplying the probability of Equation 3 by k for the set of D (n, w, k). .

次に全ての k について、到着ノード数の部分期待値を加えるが、その前に k の最大値を決定する。 Next, for all k, the partial expected value of the number of arrival nodes is added, but before that, the maximum value of k is determined.

まず、ウィンドウサイズ w がクラスタサイズ n よりも大きい場合、アクセス到着ノード数 k はクラスタサイズ n （＝ノード数）を越えることはない。また、ウィンドウサイズ w がクラスタサイズ n よりも小さい場合、アクセス到着ノード数 k は、ウィンドウサイズ w （＝アクセス数）を越えることはできない。ウィンドウサイズ w とクラスタサイズ n が等しい場合も同様である。したがって、k の最大値は、min(n, w)であることがわかる。 First, when the window size w is larger than the cluster size n, the access arrival node count k does not exceed the cluster size n (= node count). Further, when the window size w is smaller than the cluster size n, the access arrival node count k cannot exceed the window size w (= access count). The same applies when the window size w is equal to the cluster size n. Therefore, it can be seen that the maximum value of k is min (n, w).

以上より、次の式で到着ノード数を算出する。

a(n, w)は、１ウィンドウ中にアクセスが到着するノードの数の期待値になっている。なお、a(n, w) の値や m(w, d, p) の値は、予め計算して表として持っていてもよい。 From the above, the number of arrival nodes is calculated by the following equation.

a (n, w) is an expected value of the number of nodes to which access arrives in one window. The value of a (n, w) and the value of m (w, d, p) may be calculated in advance and stored as a table.

式５以外の到着ノード数 a(n, w)の算出方法の例としては、

ａ(ｎ,ｗ)＝αｎ^θ＋βｗ^φ＋γ ・・・（式６）

ａ(ｎ,ｗ)＝αlogｎ＋βlogｗ＋γ ・・・（式７）

などがある。ここで、α、β、γ、θ、φは実数である。なお、これらの式においても到着ノード数は、ウィンドウサイズの増加に伴い広義の意味で単調増加するように算出される。 As an example of the calculation method of the arrival node number a (n, w) other than Equation 5,

a (n, w) = αn θ + βw φ + γ ··· ( Equation 6)

a (n, w) = αlogn + βlogw + γ (Expression 7)

and so on. Here, α, β, γ, θ, and φ are real numbers. In these formulas as well, the number of arrival nodes is calculated so as to increase monotonically in a broad sense as the window size increases.

また、a(n, w) を多項分布から算出する代わりに、多次元正規分布を用いる方法がある。 There is a method of using a multidimensional normal distribution instead of calculating a (n, w) from a multinomial distribution.

上述のようにして、クラスタサイズ n とウィンドウサイズ w が与えられたときの、到着ノード数が算出される。 As described above, the number of arrival nodes when the cluster size n and the window size w are given is calculated.

オーバーヘッド予測手段１６０は、到着ノード数からブロックコピー回数を算出する（ステップＡ６）。 The overhead prediction unit 160 calculates the number of block copies from the number of arrival nodes (step A6).

なお、ブロックコピー回数は単位時間当たりのブロックコピー回数である。また、ステップＡ６で算出されるブロックコピー回数はウィンドウサイズ w を固定した場合のブロックコピー回数である。以下、ウィンドウサイズ w を固定した場合のブロックコピー回数を部分ブロックコピー回数と記述する。 The block copy count is the block copy count per unit time. The block copy count calculated in step A6 is the block copy count when the window size w is fixed. Hereinafter, the block copy count when the window size w is fixed is described as the partial block copy count.

負荷分散特性テーブル３３とクラスタサイズとウィンドウサイズを指定した場合の部分ブロックコピー回数 b(n, w) の算出方法の例としては、

ｂ(ｎ,ｗ)＝ａ(ｎ,ｗ)−１・・・（式８）

がある。b(n, w) は、「read」アクセスだけが到着したノード数に等しい。式８は、部分ブロック集合が「write」アクセスにより更新された後に、「write」アクセスが到着しなかったノードに初めて「read」アクセスが到着したときにブロックコピーが起こることを想定した算出方法である。 As an example of a method of calculating the partial block copy count b (n, w) when the load distribution characteristic table 33, cluster size, and window size are specified,

b (n, w) = a (n, w) −1 (Expression 8)

There is. b (n, w) is equal to the number of nodes where only “read” access has arrived. Equation 8 is a calculation method that assumes that after the partial block set is updated by “write” access, block copy occurs when “read” access arrives for the first time at a node to which “write” access has not arrived. is there.

本実施の形態の最初に述べたように、「write」アクセスではクライアントによる書込みがなされるため、書込みのなされた部分ブロック集合の内容は最新のものではなくなる。そのため、初めて「read」アクセスが到着したときには、ノードでクライアントの読みだす主記憶装置内の部分ブロック集合が最新でないことが検知され、別のノードから最新の部分ブロック集合のデータをコピーするブロックコピーが発生する。 As described at the beginning of the present embodiment, since “write” access is performed by the client, the contents of the partial block set that has been written are not the latest. Therefore, when the “read” access arrives for the first time, it is detected that the partial block set in the main storage device read by the client is not the latest, and the block copy that copies the latest partial block set data from another node Will occur.

式８は、例えば、ノード数が５個あり、ウィンドウサイズが３個であった場合、ウィンドウを構成するアクセスは、「write、read、read」となるが、最初の「write」アクセスが到着しなかった４個のノードの内、１つのノードに「read」アクセスが到着し、別のノードに「read」アクセスが到着した場合、a(n, w)は３であるが、b(n, w)は２となることを表している。 For example, if the number of nodes is 5 and the window size is 3, the access constituting the window is “write, read, read”, but the first “write” access arrives. If the “read” access arrives at one node and the “read” access arrives at another node among the four nodes that did not exist, a (n, w) is 3, but b (n, w) represents 2;

また、この例で、２個の「read」アクセスが到着するノードが同じノードの場合、最初の「read」アクセスでは部分ブロック集合を最新のものとするためのブロックコピーが起こるが、２個めの「read」アクセスではブロックコピーは発生しない。この場合、a(n, w)は２となり、b(n, w)は１となる。 Also, in this example, if two “read” accesses arrive at the same node, the first “read” access causes a block copy to make the partial block set up-to-date. Block read does not occur in "read" access. In this case, a (n, w) is 2 and b (n, w) is 1.

次に、全てのウィンドウサイズの場合について部分ブロックコピー回数を合計することにより、部分ブロック集合に対して発生するブロックコピー回数を算出する。 Next, the total number of partial block copies for all window sizes is calculated to calculate the number of block copies generated for the partial block set.

W を出現回数が０でないウィンドウサイズの集合、f (w) をウィンドウサイズ w の出現回数として、

とする。b(n, W) は、部分ブロック集合に対して発生するブロックコピー回数を表している。 Let W be the set of non-zero window sizes, f (w) be the number of occurrences of window size w,

And b (n, W) represents the number of block copies generated for the partial block set.

次に、各部分ブロック集合に対するブロックコピー回数を合計して、全体のブロックコピー回数を算出する。 Next, the total number of block copies is calculated by summing up the number of block copies for each partial block set.

特性テーブル３０に出現する各部分ブロック集合をS_iとして

ｂ(Ｓ_ｉ)＝ｂ(ｎ,Ｗ) ・・・（式１０）

とする。また、S_i∈Sとする。ここで、S は各部分ブロック集合S_iの集合である。ここで、ワークロード特性テーブル３２の全てのデータを取得するために、ウィンドウ情報を測定した期間をperiodと記述する。そして、ブロックコピー回数 B(S) を、

と算出する。ここで、ブロックコピー回数は単位時間当たりのブロックコピー回数である。 Each partial block set appearing in the characteristic table 30 is defined as S _i.

b (S _i ) = b (n, W) (Equation 10)

And Also, let S _i ∈S. Here, S is a set of each partial block set S _i . Here, in order to acquire all the data of the workload characteristic table 32, the period during which the window information is measured is described as period. And the block copy count B (S) is

And calculate. Here, the number of block copies is the number of block copies per unit time.

オーバーヘッド予測手段１６０は、コピーコストをcopy 、クラスタリング・オーバーヘッドをoverhead 、スケーラビリティをscalability と記述すると、

により、ブロックコピー回数からクラスタリング・オーバーヘッドとスケーラビリティを算出する（ステップＡ７）。 The overhead predictor 160 describes the copy cost as copy, the clustering overhead as overhead, and the scalability as scalability.

Thus, the clustering overhead and scalability are calculated from the number of block copies (step A7).

最後に、出力手段１７０はクラスタリング・オーバーヘッドとスケーラビリティを出力する（ステップＡ８）。 Finally, the output unit 170 outputs the clustering overhead and scalability (step A8).

次に、クラスタシステムの性能予測装置１０のハードウェア構成を説明する。 Next, the hardware configuration of the performance prediction device 10 of the cluster system will be described.

図７は、本実施の形態によるクラスタシステムの性能予測装置１０のハードウェア構成を示すブロック図である。 FIG. 7 is a block diagram showing a hardware configuration of the performance prediction apparatus 10 of the cluster system according to this embodiment.

図７を参照すると、クラスタシステム性能予測装置１０は、ＣＰＵ１０１、ＲＡＭ及びＲＯＭで構成される主記憶部１０２、通信回線を介してデータ通信を行う通信部１０３、プリンタ、ディスプレイ等の出力部１０４、キーボード、マウス等の入力部１０５及びハードディスク装置等の補助記憶部１０６を備えている。 Referring to FIG. 7, a cluster system performance prediction apparatus 10 includes a CPU 101, a main storage unit 102 including a RAM and a ROM, a communication unit 103 that performs data communication via a communication line, an output unit 104 such as a printer and a display, An input unit 105 such as a keyboard and a mouse and an auxiliary storage unit 106 such as a hard disk device are provided.

以上述べたように、本実施例によるクラスタシステムの性能予測装置１０は、
通信回線を介して接続される複数のノードの主記憶装置上のデータをノード間で互いに共有するクラスタシステム４０の性能を予測する装置であって、主記憶装置内のメモリーブロックを単位としてノード間でなされる単位時間当たりのブロックコピーの回数を算出し、ブロックコピー回数の情報と、ブロックコピーが１回なされる際のＣＰＵ負荷の情報に基づき、クラスタシステムで発生するクラスタリング・オーバーヘッドを算出するものである。 As described above, the performance prediction device 10 of the cluster system according to the present embodiment is
A device for predicting the performance of a cluster system 40 that shares data on main storage devices of a plurality of nodes connected to each other via a communication line between the nodes. Calculates the number of block copies per unit time made in step 1, and calculates the clustering overhead that occurs in the cluster system based on the information on the number of block copies and the information on the CPU load when one block copy is made It is.

（第１の実施の形態の効果）
以上説明した実施の形態例によれば、通信回線経由でデータを共有するクラスタシステム４０で発生するクラスタリング・オーバーヘッドを予測することができる。 (Effects of the first embodiment)
According to the embodiment described above, the clustering overhead generated in the cluster system 40 that shares data via the communication line can be predicted.

その理由は、クラスタシステム４０のワークロード特性と負荷分散特性とコピーコストから、クラスタリング・オーバーヘッドの主要因であるブロックコピーにより発生するオーバーヘッドを算出するためである。 This is because the overhead generated by block copy, which is the main factor of clustering overhead, is calculated from the workload characteristics, load distribution characteristics, and copy cost of the cluster system 40.

また、上述したクラスタシステム４０で発生するスケーラビリティを予測することができる。 In addition, the scalability that occurs in the cluster system 40 described above can be predicted.

その理由は、クラスタシステムのスケーラビリティの決定要因がクラスタリング・オーバーヘッドであることを仮定して、クラスタリング・オーバーヘッドからスケーラビリティを算出するためである。 This is because the scalability is calculated from the clustering overhead on the assumption that the determining factor of the scalability of the cluster system is the clustering overhead.

（第２の実施の形態）
以下、本発明の第２の実施の形態について図面を参照して説明する。 (Second Embodiment)
The second embodiment of the present invention will be described below with reference to the drawings.

図８は、本実施の形態によるクラスタシステムの性能予測装置１１の構成を示すブロック図である。 FIG. 8 is a block diagram showing a configuration of the performance prediction apparatus 11 of the cluster system according to the present embodiment.

図８を参照すると、本実施の形態によるクラスタシステムの性能予測装置１１の構成は、図１に示した第１の実施の形態によるクラスタシステムの性能予測装置１０の構成と同様である。しかしながら、負荷分散特性生成手段１４１の機能、到着ノード数算出手段１５１の機能は、第１の実施の形態による負荷分散特性生成手段１４０の機能、到着ノード数算出手段１５０の機能とそれぞれ異なるものとなっている。また、オーバーヘッド基礎情報２１に含まれる負荷分散特性情報２３１も、第１の実施の形態によるオーバーヘッド基礎情報２０に含まれる負荷分散特性情報２３０と異なる。 Referring to FIG. 8, the configuration of the cluster system performance prediction apparatus 11 according to the present embodiment is the same as the configuration of the cluster system performance prediction apparatus 10 according to the first embodiment shown in FIG. However, the function of the load distribution characteristic generation unit 141 and the function of the arrival node number calculation unit 151 are different from the function of the load distribution characteristic generation unit 140 and the function of the arrival node number calculation unit 150 according to the first embodiment, respectively. It has become. Further, the load distribution characteristic information 231 included in the overhead basic information 21 is also different from the load distribution characteristic information 230 included in the overhead basic information 20 according to the first embodiment.

負荷分散特性生成手段１４１の機能及び到着ノード数算出手段１５１の機能及び負荷分散特性情報２３１については、図１１で説明する。 The function of the load distribution characteristic generation unit 141, the function of the arrival node number calculation unit 151, and the load distribution characteristic information 231 will be described with reference to FIG.

図９は、本実施の形態による遷移特性テーブル集合３６の一例を示す図である。本実施の形態では、第１の実施の形態で説明した、図３の負荷分散特性テーブル３３の代わりに、遷移特性テーブル集合３６を使用する。 FIG. 9 is a diagram showing an example of the transition characteristic table set 36 according to the present embodiment. In this embodiment, a transition characteristic table set 36 is used instead of the load distribution characteristic table 33 of FIG. 3 described in the first embodiment.

遷移特性テーブル集合３６は、遷移特性テーブル３６Ａ、３６Ｂ、…、からなり、部分ブロック集合Ａ、Ｂ、…に関するアクセスの情報は、それぞれ遷移特性テーブル３６Ａ、３６Ｂ、…に書き込まれている。 The transition characteristic table set 36 includes transition characteristic tables 36A, 36B,..., And access information related to the partial block sets A, B,... Is written in the transition characteristic tables 36A, 36B,.

図９を参照すると、遷移特性テーブルは便宜上２次元のテーブルで表されている。しかし、一般には遷移特性テーブル（図示しない）は k 次元のノードＩＤ（識別番号）の表である。第１の実施の形態では、k はアクセス到着ノード数を表したが、本実施の形態では、k はノードＩＤの表の次元を表す自然数である。 Referring to FIG. 9, the transition characteristic table is represented as a two-dimensional table for convenience. However, in general, a transition characteristic table (not shown) is a table of k-dimensional node IDs (identification numbers). In the first embodiment, k represents the number of access arrival nodes, but in the present embodiment, k is a natural number representing the dimension of the node ID table.

最初に、ノードへのアクセスシーケンスが出現する確率を算出する式を説明する。 First, an expression for calculating the probability of occurrence of an access sequence to a node will be described.

アクセスシーケンス(v₁, v₂, … , v_k)の出現確率を q(v₁, v₂, … v_k)で表す。ここで、アクセスシーケンス(v₁, v₂, … , v_k)は、前回アクセスしたノードＩＤがv₁であり、前々回アクセスしたノードＩＤがv₂であり、k 回前にアクセスしたノードＩＤがv_kであるアクセスシーケンスを表す。 Access sequence _{_{(v 1, v 2, ...}} , v k) the occurrence probability q of _{_{(v 1, v 2, ...}} v k) represented by. Here, in the access sequence (v ₁ , v ₂ ,..., V _k ), the node ID accessed last time is v ₁ , the node ID accessed last time is v ₂ , and the node ID accessed k times before is v represents an access sequence that is _k .

また、v_iをｉ回前にアクセスしたノードＩＤとし、i はk −１以下の自然数であるとする。 Also, suppose that v _i is a node ID accessed i times before, and i is a natural number of k −1 or less.

アクセスシーケンス(v₁, v₂, … , v_i)の出現確率は

により算出できる。 The appearance probability of the access sequence (v ₁ , v ₂ ,…, v _i ) is

Can be calculated.

式１４は、前回からｉ回前迄のノードへのアクセスシーケンスの出現確率を、前回からｉ＋１回以上前迄のノードへのアクセスシーケンスの出現確率から算出する式である。ｉが k −１の場合、式１４は、前回からk 回前迄のノードへのアクセスシーケンスの出現確率から、前回からk −１回前迄のアクセスシーケンスの出現確率を算出する。 Expression 14 is an expression for calculating the appearance probability of the access sequence to the node from the previous time to i times before from the appearance probability of the access sequence to the node from i to the previous time i + 1 times or more before. When i is k-1, Equation 14 calculates the appearance probability of the access sequence from the previous time to k-1 times from the previous probability of the access sequence to the node from the previous time to k times.

図１０は、本実施の形態によるクラスタシステムの性能予測装置１１における２次元の遷移特性テーブル例３７を示す図である。 FIG. 10 is a diagram showing a two-dimensional transition characteristic table example 37 in the performance prediction device 11 of the cluster system according to the present embodiment.

遷移特性テーブル例３７では、クラスタシステム４０は、４ノードからなることを想定している。各行の左端の１から４のノードＩＤは前回アクセスの発生したノードのＩＤ、また各列の上端の１から４のノードＩＤは今回アクセスの発生するノードのＩＤを表す。 In the transition characteristic table example 37, it is assumed that the cluster system 40 includes four nodes. The node IDs 1 to 4 at the left end of each row represent the IDs of the nodes where the previous access occurred, and the node IDs 1 to 4 at the upper end of each column represent the IDs of the nodes where the current access occurred.

以下では、アクセスシーケンス(n)で、今回のノードＩＤが n であるアクセスシーケンスの確率を表すことにする。また、アクセスシーケンス(m、n)で、今回のノードＩＤが m で前回のノードＩＤが n であるアクセスシーケンスの出現確率を表すことにする。 In the following, the access sequence (n) represents the probability of an access sequence in which the current node ID is n. Also, the access sequence (m, n) represents the appearance probability of an access sequence in which the current node ID is m and the previous node ID is n.

図１０を参照すると、アクセスシーケンス（１）が確率０．２５で、アクセスシーケンス（２）が確率０．１で発生することがわかる。例えば、アクセスシーケンス（１）は、前回のノードＩＤが１である確率であるため、遷移特性テーブル例３７で行の左端が１である行の数値を横に加えることにより算出する。 Referring to FIG. 10, it can be seen that access sequence (1) occurs with probability 0.25 and access sequence (2) occurs with probability 0.1. For example, since the access sequence (1) is the probability that the previous node ID is 1, the value is calculated by adding the numerical value of the row where the left end of the row is 1 in the example of the transition characteristic table 37 to the side.

また、２つのアクセスからなるシーケンス（４、３）は確率０．１で発生することがわかる。 It can also be seen that the sequence (4, 3) consisting of two accesses occurs with a probability of 0.1.

２次元の遷移特性テーブルでは、例えば、前回アクセスが到着したノードから今回アクセスが到着するノードがどのノードになるかの確率を求めることのできる負荷分散方法や、ラウンドロビンのような静的な負荷分散方法を記述できる。 In the two-dimensional transition characteristic table, for example, a load distribution method that can determine the probability of which node the current access arrives from the node that the previous access arrived, or a static load such as round robin Describe the distribution method.

また、k 次元の遷移特性テーブルでは、 ( k −1 ) 回前までのアクセスが到着したノードから今回アクセスが到着するノードがどのノードになるかの確率を求めることのできる負荷分散方法や、ウェイテッドラウンドロビンのような、前回のアクセス到着ノードだけでなく、それ以前のアクセス到着にも依存して静的に次回アクセス到着ノードを決定するような静的な負荷分散方法も記述できる。 In the k-dimensional transition characteristics table, a load balancing method that can determine the probability of which node the current access arrives from the node that has reached the access up to (k −1) times before, and the way It is also possible to describe a static load distribution method such as Ted Round Robin that statically determines the next access arrival node depending on not only the previous access arrival node but also the previous access arrival.

本発明の特徴である、ブロックコピーの発生するノードへのアクセスシーケンスを指定した遷移特性テーブル３６で負荷分散特性を記述してブロックコピー回数を求める機能については、コンピュータ装置の内部にそのような機能を実現するプログラムを組み込んだ回路部品を実装して実現することも可能である。しかしながら、本発明の特徴的な機能を実現するためのプログラム（アプリケーション）を記憶媒体に記憶させ、コンピュータ装置で当該プログラムを実行することで、クラスタシステムの性能予測装置１１として機能させることもできる。 The function that describes the load distribution characteristics in the transition characteristic table 36 that specifies the access sequence to the node where the block copy occurs, which is a feature of the present invention, and obtains the number of times of block copying is provided inside the computer device. It is also possible to implement by mounting a circuit component incorporating a program for realizing the above. However, it is also possible to store the program (application) for realizing the characteristic functions of the present invention in a storage medium and execute the program on a computer device, thereby functioning as the performance prediction device 11 of the cluster system.

次に、本実施の形態の動作を説明する。 Next, the operation of the present embodiment will be described.

図１１は、本実施の形態によるクラスタシステムの性能予測装置１１の動作を説明するためのフローチャートである。 FIG. 11 is a flowchart for explaining the operation of the performance prediction apparatus 11 of the cluster system according to this embodiment.

本実施の形態の動作は、Ｂ１からＢ３のステップ及びＢ６からＢ８のステップは、図４で説明した第１の実施の形態のＡ１からＡ３のステップ及びＡ６からＡ８のステップと同じである。しかし、Ｂ４及びＢ５のステップは、Ａ４及びＡ５のステップと異なる。以下、Ｂ４及びＢ５のステップについて説明する。 In the operation of this embodiment, the steps B1 to B3 and the steps B6 to B8 are the same as the steps A1 to A3 and the steps A6 to A8 of the first embodiment described in FIG. However, the steps B4 and B5 are different from the steps A4 and A5. Hereinafter, steps B4 and B5 will be described.

負荷分散特性作成手段１４１は、遷移特性テーブル集合３６を生成する（ステップＢ４）。 The load distribution characteristic creating unit 141 generates a transition characteristic table set 36 (step B4).

ただし、本実施の形態では、負荷分散特性情報２３１として遷移特性テーブル集合３６が与えられているので、負荷分散特性作成手段１４１は負荷分散特性情報２３１をそのまま遷移特性テーブル集合３６に書き込む。 However, in this embodiment, since the transition characteristic table set 36 is given as the load distribution characteristic information 231, the load distribution characteristic creation unit 141 writes the load distribution characteristic information 231 as it is in the transition characteristic table set 36.

到着ノード数算出手段１５１はアクセスシーケンスの発生確率から到着ノード数を算出する（ステップＢ５）。 The arrival node number calculating means 151 calculates the number of arrival nodes from the occurrence probability of the access sequence (step B5).

ノードＩＤの w 次元空間をD^wとすると、w 個のアクセスシーケンスはベクトル v ∈ D^w で表すことができる。ここでベクトル v は w 次元のベクトルであり、アクセスシーケンスを表す。v の i 番目の成分を v_i とし、v_iをｉ回前にアクセスが発生したノードＩＤであるとする。また、

ｑ(ｖ)＝ｑ(ｖ_１，ｖ_２, ... ，ｖ_ｗ) ・・・（式１５）

とする。 When the w-dimensional space of the node ID is D ^w , w access sequences can be represented by a vector v ∈ D ^w . Here, the vector v is a w-dimensional vector and represents an access sequence. Let the i-th component of v be v _i, and let v _{i be} the node ID that has been accessed i times before. Also,

q (v) = q (v ₁ , v ₂ ,..., v _w ) (Equation 15)

And

ただし、遷移特性テーブルの次元数 k が w よりも小さい場合には、

ｑ(ｖ)＝ｑ(ｖ₁, ... ,ｖ_k)×ｑ(ｖ_k+1, ... ,ｖ_2k)×ｑ(ｖ_hk+1, ... ,ｖ_ｗ) ・・・ (式１６）

となる。ここで h は正又は０の整数である。 However, if the dimension k of the transition characteristic table is smaller than w,

q (v) = q (v ₁ ,..., v _k ) × q (v _{k + 1} ,..., v _2k ) × q (v _{hk + 1} ,..., v _w ). (Formula 16)

It becomes. Here, h is a positive or 0 integer.

このとき、到着ノード数算出手段１５０は各アクセスシーケンス v とウィンドウサイズ w に対して到着ノード数 a(w) を算出する。ただし、到着ノード数は、w の増加に伴い広義の意味で単調増加することに注意する。 At this time, the arrival node number calculating means 150 calculates the arrival node number a (w) for each access sequence v and window size w. However, note that the number of arrival nodes increases monotonically in a broad sense as w increases.

到着ノード数の算出方法の例を以下の式に示す。以下の式で算出された到着ノード数は、ウィンドウサイズが w の時に１ウィンドウ中にアクセスが到着するノードの数の期待値になっている。

a(w) は、第１の実施の形態における a(n, w) に相当する。Card(v)はアクセスシーケンス vにおける、ノードＩＤの数を表す。ノードＩＤの数は、v の成分に同じノードが出現した場合には１であると計数し、２つのノードが出現した場合には２であると計数する。 An example of a method for calculating the number of arrival nodes is shown in the following equation. The number of arrival nodes calculated by the following formula is an expected value of the number of nodes to which access arrives in one window when the window size is w.

a (w) corresponds to a (n, w) in the first embodiment. Card (v) represents the number of node IDs in the access sequence v. The number of node IDs is counted as 1 when the same node appears in the component of v, and is counted as 2 when two nodes appear.

（第２実施の形態の効果）
以上説明した実施の形態によれば、第１の実施の形態よりもブロックコピー回数を正確に算出できる。 (Effects of the second embodiment)
According to the embodiment described above, the block copy count can be calculated more accurately than in the first embodiment.

その理由は、アクセスの発生するノードへのアクセスシーケンスを指定した遷移特性テーブル３６で負荷分散特性を記述するため、多くの負荷分散方法をより正確にかつ詳細に記述できるためである。 The reason is that many load distribution methods can be described more accurately and in detail because the load distribution characteristics are described in the transition characteristic table 36 that specifies the access sequence to the node where the access occurs.

（第３の実施の形態）
以下、本発明の第３の実施の形態について図面を参照して説明する。 (Third embodiment)
The third embodiment of the present invention will be described below with reference to the drawings.

図１２は、本実施の形態によるクラスタシステムの性能予測装置１２の構成を示すブロック図である。 FIG. 12 is a block diagram showing a configuration of the performance prediction device 12 of the cluster system according to the present embodiment.

図１２を参照すると、本実施の形態によるクラスタシステムの性能予測装置１２の構成は、図１に示した第１の実施の形態によるクラスタシステムの性能予測装置１０の構成と同様である。しかしながら、ワークロード特性生成手段１３１の機能、到着ノード数算出手段１５１の機能は、第１の実施の形態によるワークロード特性生成手段１３０、到着ノード数算出手段１５０の機能とそれぞれ異なるものとなっている。また、オーバーヘッド基礎情報２２に含まれるワークロード特性情報２２１は、第１の実施の形態によるオーバーヘッド基礎情報２０に含まれるワークロード特性情報２２０と異なる。 Referring to FIG. 12, the configuration of the cluster system performance prediction apparatus 12 according to the present embodiment is the same as the configuration of the cluster system performance prediction apparatus 10 according to the first embodiment shown in FIG. However, the function of the workload characteristic generation unit 131 and the function of the arrival node number calculation unit 151 are different from the functions of the workload characteristic generation unit 130 and the arrival node number calculation unit 150 according to the first embodiment, respectively. Yes. Further, the workload characteristic information 221 included in the overhead basic information 22 is different from the workload characteristic information 220 included in the overhead basic information 20 according to the first embodiment.

本実施の形態では、ワークロード特性生成手段１３１は、ワークロード特性情報２２１をワークロード特性テーブル３２へ変換する機能を有する。また、到着ノード数算出手段１５１は、ウィンドウサイズが自然数でない場合でも到着ノード数を計算する機能を有する。 In the present embodiment, the workload characteristic generation unit 131 has a function of converting the workload characteristic information 221 into the workload characteristic table 32. The arrival node number calculation means 151 has a function of calculating the arrival node number even when the window size is not a natural number.

第１の実施の形態においては、ワークロード特性情報２２０がワークロード特性テーブルで与えられていた。本実施の形態では、ワークロード特性情報２２１は、ワークロード特性テーブル３２を生成するための情報として与えられる。具体的には、ワークロード特性情報２２１は、部分ブロック集合に到着したアクセス種毎のアクセス回数、つまり「read」アクセスの数と「write」アクセスの数で与えられる。ワークロード特性情報２２１のワークロード特性テーブル３２への変換は、ワークロード特性生成手段１３１によりなされる。 In the first embodiment, the workload characteristic information 220 is given in the workload characteristic table. In the present embodiment, the workload characteristic information 221 is given as information for generating the workload characteristic table 32. Specifically, the workload characteristic information 221 is given by the number of accesses for each access type arriving at the partial block set, that is, the number of “read” accesses and the number of “write” accesses. The workload characteristic information 221 is converted into the workload characteristic table 32 by the workload characteristic generation unit 131.

図１３は、本実施の形態によるクラスタシステムの性能予測装置１２の動作を説明するためのフローチャートである。 FIG. 13 is a flowchart for explaining the operation of the performance prediction device 12 of the cluster system according to this embodiment.

本実施の形態の動作は、Ｃ１、Ｃ２、Ｃ４及びＣ６からＣ８のステップは、図３で説明した第１の実施の形態のＡ１、Ａ２、Ａ４及びＡ６からＡ８のステップと同じである。しかし、Ｃ３及びＣ５のステップは、Ａ３及びＡ５のステップと異なる。以下、Ｃ３及びＣ５のステップについて説明する。 In the operation of this embodiment, the steps C1, C2, C4, and C6 to C8 are the same as the steps A1, A2, A4, and A6 to A8 of the first embodiment described in FIG. However, the steps C3 and C5 are different from the steps A3 and A5. Hereinafter, steps C3 and C5 will be described.

第１の実施の形態では、ステップＡ３におけるワークロード特性生成手段１３の動作は、ワークロード特性情報２２０をワークロード特性テーブル３２に書き込むだけであった。 In the first embodiment, the operation of the workload characteristic generation unit 13 in step A3 only writes the workload characteristic information 220 to the workload characteristic table 32.

また、ステップＡ５においては、到着ノード数算出手段１５０は自然数のウィンドウサイズからのみ到着ノード数を算出していた。 In step A5, the arrival node number calculating means 150 calculates the number of arrival nodes only from the natural window size.

本実施の形態では、ステップＣ３において、ワークロード特性生成手段１３１は「read」アクセスの数と「write」アクセスの数からウィンドウサイズとその出現頻度を算出し、ワークロード特性テーブル３２を生成する（ステップＣ３）。 In this embodiment, in step C3, the workload characteristic generation unit 131 calculates the window size and the appearance frequency from the number of “read” accesses and the number of “write” accesses, and generates the workload characteristic table 32 ( Step C3).

ステップＣ３においては、ワークロード特性生成手段１３１は、「read」アクセスの数と「write」アクセス数との和を「write」アクセスの数で割った値をウィンドウサイズとし、当該ウィンドウサイズの出現頻度を「write」アクセスの数として、ワークロード特性テーブル３２に書き込む。この算出方法は、全ての「write」アクセスが等間隔に到着することを仮定した場合に特に正確に算出できる。 In step C3, the workload characteristic generation unit 131 sets the window size to a value obtained by dividing the sum of the number of “read” accesses and the number of “write” accesses by the number of “write” accesses, and the appearance frequency of the window size. Is written in the workload characteristic table 32 as the number of “write” accesses. This calculation method can be calculated particularly accurately when it is assumed that all “write” accesses arrive at equal intervals.

また、ステップＣ５においては、自然数でない正の実数のウィンドウサイズ w から到着ノード数を算出できるようにするために、到着ノード数算出手段１６０はウィンドウサイズが、

及び

であるときの到着ノード数から w に対する到着ノード数を算出する（ステップＣ５）。 In step C5, in order to be able to calculate the number of arrival nodes from a positive real window size w which is not a natural number, the arrival node number calculation means 160 has a window size of

as well as

The number of arriving nodes for w is calculated from the number of arriving nodes at step S5 (step C5).

ここで、

は、それぞれ w 以下の最大の整数、w 以上の最小の整数を表す記号である。 here,

Are symbols representing the largest integer less than or equal to w and the smallest integer greater than or equal to w.

ステップＣ５においては、例えば、到着ノード数算出手段１５１は次の式で正の実数のウィンドウサイズに対して到着ノード数を算出する。

上記の式は、

と

の間で線形補間を行っているが、他の一般に知られた補間方法を用いてもよい。なお、式１８及び式１９で h の記号が使用されているが、第２の実施の形態における式１６に記載された h とは、別の量を表している。 In step C5, for example, the arrival node number calculating means 151 calculates the number of arrival nodes for a positive real window size by the following equation.

The above formula is

When

However, other generally known interpolation methods may be used. Note that the symbol h is used in Equation 18 and Equation 19, but h described in Equation 16 in the second embodiment represents a different amount.

次のＣ６のステップ以降は、第１の実施の形態のＡ６以降と同様であり、線形補間により算出した到着ノード数を用いて、クラスタリング・オーバーヘッド及びスケーラビリティを算出できる。 The subsequent steps after C6 are the same as those after A6 in the first embodiment, and the clustering overhead and scalability can be calculated using the number of arrival nodes calculated by linear interpolation.

本発明の特徴である、「read」アクセスの数と「write」アクセスの数から求めたウィンドウサイズ w が整数でない場合でも、ウィンドウサイズが w 以下の最大の整数であるときの到着ノード数と、w 以上の最小の整数であるときの到着ノード数から、 w に対する到着ノード数を補間により算出する機能については、コンピュータ装置の内部にそのような機能を実現するプログラムを組み込んだ回路部品を実装して実現することも可能である。しかしながら、本発明の特徴的な機能を実現するためのプログラム（アプリケーション）を記憶媒体に記憶させ、コンピュータ装置で当該プログラムを実行することで、クラスタシステムの性能予測装置１２として機能させることもできる。 Even if the window size w obtained from the number of “read” accesses and the number of “write” accesses, which is a feature of the present invention, is not an integer, the number of arrival nodes when the window size is the largest integer equal to or less than w; For the function to calculate the number of arrival nodes for w by interpolation from the number of arrival nodes when the minimum integer is greater than or equal to w, a circuit component incorporating a program that implements such a function is installed inside the computer device. It can also be realized. However, it is also possible to store the program (application) for realizing the characteristic function of the present invention in a storage medium and execute the program on the computer device, thereby functioning as the performance prediction device 12 of the cluster system.

（第３の実施の形態の効果）
以上説明した実施の形態によれば、ワークロード特性情報２２１が「read」アクセスの数と「write」アクセスの数で与えられる場合でも、クラスタシステム４０のクラスタリング・オーバーヘッド及びスケーラビリティを予測できる。 (Effect of the third embodiment)
According to the embodiment described above, the clustering overhead and scalability of the cluster system 40 can be predicted even when the workload characteristic information 221 is given by the number of “read” accesses and the number of “write” accesses.

その理由は、「read」アクセスの数と「write」アクセスの数とから求めたウィンドウサイズ w が整数でない場合でも、ウィンドウサイズが w 以下の最大の整数であるときの到着ノード数とw 以上の最小の整数であるときの到着ノード数から、 w に対する到着ノード数を算出するためである。 The reason is that even if the window size w obtained from the number of “read” accesses and the number of “write” accesses is not an integer, the number of nodes that arrive when the window size is the largest integer less than or equal to w This is because the number of arrival nodes for w is calculated from the number of arrival nodes when the number is the smallest integer.

（第４の実施の形態）
以下、本発明の第４の実施の形態について図面を参照して説明する。 (Fourth embodiment)
The fourth embodiment of the present invention will be described below with reference to the drawings.

上述した第１の実施の形態、第２の実施の形態及び第３の実施の形態においては、クライアントからのアクセス種は「read」と「write」の２種類であった。しかし、本実施の形態においては、アクセス種として「read」と「write」と「get」の３種類がある場合の、クラスタリング・オーバーヘッド及びスケーラビリティの予測について説明する。 In the first embodiment, the second embodiment, and the third embodiment described above, there are two types of access from the client, “read” and “write”. However, in the present embodiment, clustering overhead and scalability prediction when there are three types of access, “read”, “write”, and “get” will be described.

本実施の形態で説明する「get」アクセス（取得のアクセス）では、「read」アクセスと同様にブロックの読出しを行い、最新のブロックを持たないブロックに到着したときは、ブロックコピーを起こす。「write」アクセスしたノード又は「read」アクセスしたノードへクライアントからアクセスした場合と異なり、「get」アクセスしたノードへアクセスしても、ノードでは現在の部分ブロック集合が最新であることの情報は得られない。 In the “get” access (acquisition access) described in the present embodiment, a block is read in the same manner as in the “read” access, and when a block that does not have the latest block is reached, a block copy is generated. Unlike the case where the client accesses the node that accessed “write” or the node that accessed “read”, even if the node that accessed “get” is accessed, the node can obtain information that the current partial block set is the latest. I can't.

このため、過去に「get」アクセスのみが到着したノードに「get」アクセスが到着した場合に、ブロックコピーが発生する。このようになる理由は、「get」アクセスしたノードでは最新であるかどうかの情報が得られないため、「get」アクセスでは最新のブロックを持たないブロックに到着したと判定し、ブロックコピーを起こすためである。一方、過去に「write」又は「read」アクセスが発生したノードへ「get」アクセスが到着しても、ブロックコピーは生じない。上述の説明で、過去の記述は、ブロックコピーが発生するかどうかを判定しようとするノードについて、判定時刻の含まれる１つの「write」アクセスのウィンドウ内で時間を遡ることを意味する。 Therefore, block copy occurs when “get” access arrives at a node where only “get” access has arrived in the past. The reason why this happens is that the node that accessed "get" cannot obtain information about whether it is the latest, so in "get" access, it is determined that a block that does not have the latest block has arrived, and a block copy occurs. Because. On the other hand, even if “get” access arrives at a node where “write” or “read” access has occurred in the past, block copy does not occur. In the above description, the past description means that the time is traced back within one “write” access window including the determination time for a node that is to determine whether or not block copy occurs.

図１４は、本実施の形態によるクラスタシステムの性能予測装置１３の構成を示すブロック図である。 FIG. 14 is a block diagram illustrating a configuration of the performance prediction apparatus 13 of the cluster system according to the present embodiment.

図１４を参照すると、本実施の形態によるクラスタシステムの性能予測装置１３の構成は、図１に示した第１の実施の形態によるクラスタシステムの性能予測装置１０の構成と同様である。しかしながら、コピーコスト生成手段１２１の機能、ワークロード特性生成手段１３２の機能は、第１の実施の形態によるコピーコスト生成手段１２０の機能、ワークロード特性生成手段１３０の機能とそれぞれ異なるものとなっている。また、オーバーヘッド基礎情報２３に含まれるコピーコスト情報２１１、ワークロード特性情報２２２は、第１の実施の形態によるオーバーヘッド基礎情報２０に含まれるコピーコスト情報２１０、ワークロード特性情報２２０とそれぞれ異なる。 Referring to FIG. 14, the configuration of the cluster system performance prediction apparatus 13 according to the present embodiment is the same as the configuration of the cluster system performance prediction apparatus 10 according to the first embodiment shown in FIG. However, the functions of the copy cost generation unit 121 and the workload characteristic generation unit 132 are different from the functions of the copy cost generation unit 120 and the workload characteristic generation unit 130 according to the first embodiment. Yes. The copy cost information 211 and the workload characteristic information 222 included in the overhead basic information 23 are different from the copy cost information 210 and the workload characteristic information 220 included in the overhead basic information 20 according to the first embodiment.

コピーコスト生成手段１２１の機能、ワークロード特性生成手段１３２の機能、コピーコスト情報２１１及びワークロード特性情報２２２については、図１６で説明する。 The function of the copy cost generation unit 121, the function of the workload characteristic generation unit 132, the copy cost information 211, and the workload characteristic information 222 will be described with reference to FIG.

図１５は、本実施の形態による特性テーブル３５の内容の構成を示すブロック図である。 FIG. 15 is a block diagram showing a configuration of contents of the characteristic table 35 according to the present embodiment.

図１５を参照すると、本実施の形態による特性テーブル３５は、コピーコストテーブル３１とワークロード特性テーブル３２と負荷分散特性テーブル３３とコピーコストテーブル（ＧＥＴ）３８とワークロード特性テーブル（ＧＥＴ）３９による構成となっている。なお、ワークロード特性テーブル３２及び負荷分散特性テーブル３３については構成の詳細を図示していないが、これらのテーブルの構成は、図３に示した構成と同様である。 Referring to FIG. 15, the characteristic table 35 according to the present embodiment is based on a copy cost table 31, a workload characteristic table 32, a load distribution characteristic table 33, a copy cost table (GET) 38, and a workload characteristic table (GET) 39. It has a configuration. Details of the configuration of the workload characteristic table 32 and the load distribution characteristic table 33 are not shown, but the configuration of these tables is the same as that shown in FIG.

本実施の形態による特性テーブル３５では、第１の実施の形態の特性テーブル３０に対して、コピーコストテーブル（ＧＥＴ）３８及びワークロード特性テーブル（ＧＥＴ）３９が追加されている。 In the characteristic table 35 according to the present embodiment, a copy cost table (GET) 38 and a workload characteristic table (GET) 39 are added to the characteristic table 30 of the first embodiment.

コピーコストテーブル（ＧＥＴ）３８に書き込まれるコピーコスト（ＧＥＴ）は、「get」アクセスによるブロックコピーが１回発生したときに１つのＣＰＵが消費される割合を表す。ＣＰＵが消費される割合は、第１の実施の形態と同じく、例えば単位時間あたりのＣＰＵ実行時間で表す。 The copy cost (GET) written in the copy cost table (GET) 38 represents a rate at which one CPU is consumed when one block copy by “get” access occurs. The ratio of CPU consumption is represented by CPU execution time per unit time, for example, as in the first embodiment.

ワークロード特性テーブル（ＧＥＴ）３９は、ワークロード特性テーブル３２と同様、部分ブロック集合ＩＤを各行に、「read」アクセスに関する「write」アクセスのウィンドウサイズを各列に配置したテーブルである。 Similar to the workload characteristics table 32, the workload characteristics table (GET) 39 is a table in which the partial block set ID is arranged in each row and the window size of “write” access related to “read” access is arranged in each column.

本実施の形態では、「read」アクセスに関する「write」アクセスのウィンドウサイズとは、例えば「write, read, read, read, (write) 」のようなアクセスシーケンスのウィンドウサイズを表すこととする。なお、アクセスシーケンスの間に「get」アクセスを挟むこともあるが、その場合は当該ウィンドウサイズには「get」アクセスの数を含まないものとする。 In the present embodiment, the window size of “write” access related to “read” access represents the window size of an access sequence such as “write, read, read, read, (write)”. Note that “get” access may be sandwiched between access sequences, but in this case, the window size does not include the number of “get” accesses.

ワークロード特性テーブル（ＧＥＴ）３９のウィンドウサイズ（write & read）と書かれた区画（レコード）には、「get」アクセスに関する、「write」アクセス又は「read」アクセスのウィンドウサイズが記録される。ここで、「get」アクセスに関する、「write」アクセス又は「read」アクセスのウィンドウサイズとは、アクセスシーケンスの間に「get」アクセスを挟む「read」アクセスに関する「write」アクセスのウィンドウにおいて、「get」アクセスに関する「write」アクセスのウィンドウサイズ又は「get」アクセスに関する「read」アクセスのウィンドウサイズを表す。ここで、「get」アクセスに関する「write」アクセスのウィンドウサイズとは、例えば「write, get, get,(read) 」のようなアクセスシーケンスのウィンドウサイズを表すこととする。 In the section (record) written as window size (write & read) in the workload characteristic table (GET) 39, the window size of “write” access or “read” access related to “get” access is recorded. Here, the window size of “write” access or “read” access related to “get” access means “get” access window for “read” access that sandwiches “get” access between access sequences. The window size of “write” access related to “access” or the window size of “read” access related to “get” access. Here, the window size of “write” access related to “get” access represents the window size of an access sequence such as “write, get, get, (read)”.

例えば、「write, get, get, read, get, get, read, get, get, read, get, get, (write) 」というアクセスシーケンスでは、「read」アクセスに関する「write」アクセスのウィンドウサイズが４であり、「get」アクセスに関する、「write」アクセス又は「read」アクセスのウィンドウサイズが３である。 For example, in the access sequence “write, get, get, read, get, get, read, get, get, read, get, get, (write)”, the window size of “write” access related to “read” access is 4 The window size of “write” access or “read” access relating to “get” access is 3.

上述のシーケンスが発生した部分、ワークロード特性テーブル（ＧＥＴ）３９では、該当する部分ブロック集合ＩＤの行とウィンドウサイズ（write）が４と書かれた列との交差する区画には３と書かれる。 In the workload characteristic table (GET) 39 where the above-described sequence occurs, 3 is written in a section where the row of the corresponding partial block set ID and the column where the window size (write) is written as 4. .

上記のアクセスシーケンスでは、「get」アクセスに関する「write」アクセスのウィンドウサイズと、「read」アクセスのウィンドウサイズは共に３で一致している。しかしながら、「get」アクセスに関するウィンドウサイズが、「write」アクセスのウィンドウと「read」アクセスのウィンドウで一致しない場合には、なんらかの１つの代表値を「get」アクセスに関する、「write」アクセス及び「read」アクセスのウィンドウサイズとして使用する。後述する式２０で計算する w'は、このような代表値の例である。 In the above access sequence, the window size of “write” access related to “get” access and the window size of “read” access are both 3. However, if the window size for “get” access does not match between the window for “write” access and the window for “read” access, one representative value is assigned to “write” access and “read” for “get” access. Use as window size for access. W ′ calculated by Expression 20 described later is an example of such a representative value.

本発明の特徴である、「read」アクセスのブロックコピー回数及び「get」アクセスのブロックコピー回数からクラスタリング・オーバーヘッドを計算する機能については、コンピュータ装置の内部にそのような機能を実現するプログラムを組み込んだ回路部品を実装して実現することも可能である。しかしながら、本発明の特徴的な機能を実現するためのプログラム（アプリケーション）を記憶媒体に記憶させ、コンピュータ装置で当該プログラムを実行することで、クラスタシステムの性能予測装置１３として機能させることもできる。 Regarding the function of calculating the clustering overhead from the number of block copies of “read” access and the number of block copies of “get” access, which is a feature of the present invention, a program for realizing such a function is incorporated in the computer device. It can also be realized by mounting circuit components. However, it is also possible to store the program (application) for realizing the characteristic function of the present invention in a storage medium and execute the program on the computer device, thereby functioning as the performance prediction device 13 of the cluster system.

図１６は、本実施の形態によるクラスタシステムの性能予測装置１３の動作を説明するためのフローチャートである。以下の説明では、必要に応じて図１４の主要な部分を参照する。 FIG. 16 is a flowchart for explaining the operation of the performance prediction device 13 of the cluster system according to this embodiment. In the following description, reference is made to the main part of FIG. 14 as necessary.

図１６を参照すると、本実施の形態のフローチャートは、図３に示した第１の実施の形態のフローチャートに、ステップＤ３、Ｄ５、Ｄ８、Ｄ１０、Ｄ１２を加えたものである。これらのステップは、「get」アクセスによるブロックコピー回数を算出するためのステップである。 Referring to FIG. 16, the flowchart of the present embodiment is obtained by adding steps D3, D5, D8, D10, and D12 to the flowchart of the first embodiment shown in FIG. These steps are steps for calculating the number of block copies by “get” access.

最初に、入力手段１１０はオーバーヘッド基礎情報２３を入力する（ステップＤ１）。 First, the input means 110 inputs the overhead basic information 23 (step D1).

本実施の形態による、オーバーヘッド基礎情報２３のコピーコスト情報２１１には、「read」アクセスによるブロックコピーによって生じるコピーコストと「get」アクセスによるブロックコピーによって生じるコピーコストに関する情報が含まれる。 According to the present embodiment, the copy cost information 211 of the overhead basic information 23 includes information on the copy cost caused by block copy by “read” access and the copy cost caused by block copy by “get” access.

また、ワークロード特性情報２２２には、「read」アクセスに関するワークロード特性に加えて、部分ブロック集合毎の「get」アクセスの数と、「write」アクセスと「read」アクセスの数の和が与えられているとする。以下では、部分ブロック集合毎の「get」アクセスの数を g で表し、また「write」アクセスと「read」アクセスの数の和を wr で表す。 In addition to the workload characteristics related to “read” access, the workload characteristics information 222 includes the number of “get” accesses for each partial block set and the sum of the numbers of “write” accesses and “read” accesses. Suppose that Below, the number of “get” accesses for each partial block set is represented by g, and the sum of the numbers of “write” accesses and “read” accesses is represented by wr.

次に、コピーコスト生成手段１２１がオーバーヘッド基礎情報２３に含まれるコピーコスト情報２１１から、コピーコストテーブル３１を生成する（ステップＤ２）。ステップＤ２は、図３のステップＡ２に対応する。 Next, the copy cost generation unit 121 generates the copy cost table 31 from the copy cost information 211 included in the overhead basic information 23 (step D2). Step D2 corresponds to step A2 in FIG.

次に、コピーコスト生成手段１２１がオーバーヘッド基礎情報２３に含まれるコピーコスト情報２１１から、コピーコストテーブル（ＧＥＴ）３８を生成する（ステップＤ３）。 Next, the copy cost generation unit 121 generates a copy cost table (GET) 38 from the copy cost information 211 included in the overhead basic information 23 (step D3).

本実施の形態では、「get」アクセスのブロックコピー１回で発生するコピーコストが、コピーコスト情報２１１にコピーコスト（ＧＥＴ）として含まれている。したがって、コピーコスト生成手段１２１はコピーコスト情報２１１から、コピーコスト（ＧＥＴ）をコピーコストテーブル（ＧＥＴ）３８に書き込む。 In the present embodiment, the copy cost that occurs in one block copy of “get” access is included in the copy cost information 211 as a copy cost (GET). Therefore, the copy cost generation unit 121 writes the copy cost (GET) from the copy cost information 211 into the copy cost table (GET) 38.

次に、ワークロード特性生成手段１３２は、オーバーヘッド基礎情報２３に含まれるワークロード特性情報２２２から、ワークロード特性テーブル３２を生成する（ステップＤ４）。 Next, the workload characteristic generation unit 132 generates the workload characteristic table 32 from the workload characteristic information 222 included in the overhead basic information 23 (step D4).

ステップＤ４は、図３のステップＡ３に対応する。 Step D4 corresponds to step A3 in FIG.

次に、ワークロード特性生成手段１３２は、部分ブロック集合毎に、「get」アクセスに関する、「write」アクセス及び「read」アクセスに関するウィンドウサイズ w'

ｗ′＝（ｇ＋ｗｒ）／ｗｒ・・・（式２０）

をワークロード特性テーブル（ＧＥＴ）３９の対象とする部分ブロック集合の行の全てのレコードに書き込む（ステップＤ５）。 Next, the workload characteristic generation unit 132 sets, for each partial block set, a window size w ′ related to “write” access and “read” access related to “get” access.

w ′ = (g + wr) / wr (Equation 20)

Are written in all the records in the row of the partial block set targeted by the workload characteristic table (GET) 39 (step D5).

負荷分散特性生成手段１４０は、オーバーヘッド基礎情報２３に含まれる負荷分散特性情報２３０から負荷分散特性テーブル３３を生成する（ステップＤ６）。 The load distribution characteristic generation unit 140 generates the load distribution characteristic table 33 from the load distribution characteristic information 230 included in the overhead basic information 23 (step D6).

ステップＤ６は、図３のステップＡ４に対応する。 Step D6 corresponds to step A4 in FIG.

到着ノード数算出手段１５０は、与えられた負荷分散特性テーブル３３に関して、到着ノード数がウィンドウサイズに対して（広義の意味で）単調増加するように「read」アクセスの到着ノード数を算出する（ステップＤ７）。 The arrival node number calculation means 150 calculates the number of arrival nodes of “read” access so that the number of arrival nodes monotonically increases (in a broad sense) with respect to the window size, for the given load distribution characteristic table 33 ( Step D7).

ステップＤ７は、図３のステップＡ５に対応する。 Step D7 corresponds to step A5 in FIG.

到着ノード数算出手段１５０は、ワークロード特性テーブル２２とワークロード特性テーブル（ＧＥＴ）３９から、「read」アクセスに関する「write」アクセスのウィンドウ内における、「get」アクセスの到着ノード数を算出する。但し、「read」アクセスの到着ノード数の場合とは異なり、「get」アクセスの到着ノード数は、１つのウィンドウ内で、過去に一度も「write」アクセスまたは「read」アクセスが到着したことがないノードへ到着した「get」アクセスの数であるとして到着ノード数を算出する（ステップＤ８）。 The arrival node number calculating means 150 calculates the number of arrival nodes of “get” access in the “write” access window related to “read” access from the workload characteristic table 22 and the workload characteristic table (GET) 39. However, unlike the case of the number of arrival nodes for “read” access, the number of arrival nodes for “get” access indicates that “write” access or “read” access has arrived in the past in one window. The number of arrival nodes is calculated assuming that the number of “get” accesses that have arrived at a node that does not exist (step D8).

「get」アクセスの到着ノード数は、クラスタサイズ n と、「read」アクセスに関する「write」アクセスのウィンドウサイズ w 、「get」アクセスに関する「write」アクセス又は「read」アクセスのウィンドウサイズ g(w) から算出する。以下では、「get」アクセスに関する「write」アクセス又は「read」アクセスのウィンドウサイズを g(w) で表す。 The number of arrival nodes for “get” access is the cluster size n, the window size w for “write” access for “read” access, and the window size for write access or “read” access for “get” access g (w) Calculate from In the following, the window size of “write” access or “read” access related to “get” access is represented by g (w).

g(w) は、ワークロード特性テーブル（ＧＥＴ）３９から求められる。そして、与えられた負荷分散特性テーブル３３に対して、「get」アクセスの到着ノード数 a'(n, w) を w, g(w) に関して単調に増加するように算出する。 g (w) is obtained from the workload characteristic table (GET) 39. Then, for the given load distribution characteristic table 33, the number of arrival nodes a ′ (n, w) of “get” access is calculated so as to increase monotonously with respect to w and g (w).

「write」アクセスのウィンドウサイズ w が与えられたときの、「get」アクセスの到着ノード数 a'(n, w) の算出方法の例を以下の式に示す。

オーバーヘッド予測手段１６０は、「read」アクセスの到着ノード数からブロックコピー回数を算出する（ステップＤ９）。 An example of a calculation method of the number of arrival nodes a ′ (n, w) of “get” access when the window size w of “write” access is given is shown in the following formula.

The overhead prediction unit 160 calculates the number of block copies from the number of arrival nodes of “read” access (step D9).

ステップＤ９は図３のステップＡ６に対応する。 Step D9 corresponds to step A6 in FIG.

オーバーヘッド予測手段１６０は、「get」アクセスの到着ノード数からブロックコピー回数（ＧＥＴ）を算出する（ステップＤ１０）。 The overhead prediction unit 160 calculates the number of block copies (GET) from the number of arrival nodes of “get” access (step D10).

W を出現回数が０でない「write」アクセスのウィンドウサイズの集合、f (w) を「write」アクセスのウィンドウサイズ w の出現回数として、

とする。B'(n,W) は、部分ブロック集合に対して発生する「get」アクセスによるブロックコピー回数を表している。 Let W be the set of window sizes for “write” access with non-zero occurrences, and f (w) be the number of occurrences of window size w for “write” access.

And B ′ (n, W) represents the number of times of block copy by “get” access generated for the partial block set.

ここで、特性テーブル３５に出現する各部分ブロック集合をS_i として

ｂ′(Ｓ_ｉ）＝ｂ′(Ｅ，Ｗ）・・・（式２３）

とする。また、S_i ∈ S とする。そして、「get」アクセスによって発生するブロックコピー回数 B'(S) を、

と算出する。ここで、ブロックコピー回数 B'(S)は単位時間当たりのブロックコピー回数である。 Here, each partial block set appearing in the characteristic table 35 is defined as S _i.

b ′ (S _i ) = b ′ (E, W) (Equation 23)

And Also, let S _i ∈ S. Then, the block copy count B '(S) generated by the "get" access is

And calculate. Here, the number of block copies B ′ (S) is the number of block copies per unit time.

次に、オーバーヘッド予測手段１６０は、「read」アクセスによって発生するブロックコピー回数からクラスタリング・オーバーヘッドを算出する（ステップＤ１１）。 Next, the overhead prediction unit 160 calculates a clustering overhead from the number of block copies generated by the “read” access (step D11).

ステップＤ１１は、図３のステップＡ７に対応する。 Step D11 corresponds to step A7 in FIG.

「get」アクセスによるコピーコスト（ＧＥＴ）を copy(get) 、「get」アクセスによるクラスタリング・オーバーヘッドをoverhead(get) 、スケーラビリティをscalability と表すことにすると、オーバーヘッド予測手段１６０は、

を用いて、「get」アクセスによるクラスタリング・オーバーヘッドと、スケーラビリティを算出する（ステップＤ１２）。 Assuming that the copy cost (GET) by “get” access is copy (get), the clustering overhead by “get” access is overhead (get), and the scalability is scalability, the overhead predicting means 160

Is used to calculate the clustering overhead and scalability due to “get” access (step D12).

最後に、出力手段１７０は、「read」アクセスによるクラスタリング・オーバーヘッド、「get」アクセスによるクラスタリング・オーバーヘッド及びスケーラビリティを出力する（ステップＤ１３）。 Finally, the output unit 170 outputs the clustering overhead by the “read” access, the clustering overhead by the “get” access, and the scalability (step D13).

（第４の実施の形態の効果）
以上説明した実施の形態によれば、「write」アクセス及び「read」アクセスに加えて「get」アクセスがあるクラスタシステム４０の場合に対しても、クラスタリング・オーバーヘッド及びスケーラビリティを計算できる。 (Effect of the fourth embodiment)
According to the embodiment described above, the clustering overhead and scalability can be calculated even in the case of the cluster system 40 having “get” access in addition to “write” access and “read” access.

その理由は、「read」アクセスのブロックコピー回数及び「get」アクセスのブロックコピー回数からクラスタリング・オーバーヘッドを計算するためである。 This is because the clustering overhead is calculated from the number of block copies for “read” access and the number of block copies for “get” access.

本発明のクラスタシステムの性能予測装置１０、１１、１２、１３は、その動作をハードウェア的に実現することは勿論として、上記した各手段を実行する性能予測プログラム（アプリケーション）３００をコンピュータ処理装置であるクラスタシステムの性能予測装置１０、１１、１２、１３で実行することにより、ソフトウェア的に実現することができる。この性能予測プログラム３００は、磁気ディスク、半導体メモリその他の記録媒体に格納され、その記録媒体からクラスタシステムの性能予測装置１０、１１、１２、１３にロードされ、その動作を制御することにより、上述した各機能を実現する。 The performance prediction apparatuses 10, 11, 12, and 13 of the cluster system of the present invention not only realize the operation in hardware, but also execute a performance prediction program (application) 300 for executing each of the above-described means as a computer processing apparatus. It can be realized by software by being executed by the performance prediction apparatuses 10, 11, 12, and 13 of the cluster system. The performance prediction program 300 is stored in a magnetic disk, a semiconductor memory, or other recording medium, loaded from the recording medium to the performance prediction apparatuses 10, 11, 12, and 13 of the cluster system, and controls the operation thereof. Each function is realized.

以上好ましい複数の実施の形態をあげて本発明を説明したが、本発明は必ずしも、上記実施の形態に限定されるものでなく、その技術的思想の範囲内において様々に変形して実施することができる。 Although the present invention has been described above with a plurality of preferred embodiments, the present invention is not necessarily limited to the above-described embodiments, and various modifications may be made within the scope of the technical idea. Can do.

本発明の第１の実施の形態によるクラスタシステムの性能予測装置の構成を示すブロック図である。It is a block diagram which shows the structure of the performance prediction apparatus of the cluster system by the 1st Embodiment of this invention. 本発明の第１の実施の形態の対象とするクラスタシステムの一例を示す図である。It is a figure which shows an example of the cluster system made into the object of the 1st Embodiment of this invention. 本発明の第１の実施の形態による特性テーブルの内容の構成を示すブロック図である。It is a block diagram which shows the structure of the content of the characteristic table by the 1st Embodiment of this invention. 本発明の第１の実施の形態によるクラスタシステムの性能予測装置の動作を説明するためのフローチャートである。It is a flowchart for demonstrating operation | movement of the performance prediction apparatus of the cluster system by the 1st Embodiment of this invention. 本発明の第１の実施の形態によるワークロード特性テーブル例を示す図である。It is a figure which shows the example of a workload characteristic table by the 1st Embodiment of this invention. 本発明の第１の実施の形態による負荷分散特性テーブル例を示す図である。It is a figure which shows the example of a load distribution characteristic table by the 1st Embodiment of this invention. 本発明の第１の実施の形態によるクラスタシステムの性能予測装置のハードウェア構成を示すブロック図である。It is a block diagram which shows the hardware constitutions of the performance prediction apparatus of the cluster system by the 1st Embodiment of this invention. 本発明の第２の実施の形態によるクラスタシステムの性能予測装置の構成を示すブロック図である。It is a block diagram which shows the structure of the performance prediction apparatus of the cluster system by the 2nd Embodiment of this invention. 本発明の第２の実施の形態による遷移特性テーブル集合の一例を示す図である。It is a figure which shows an example of the transition characteristic table set by the 2nd Embodiment of this invention. 本発明の第２の実施の形態によるクラスタシステムの性能予測装置における２次元の遷移特性テーブル例を示す図である。It is a figure which shows the example of a two-dimensional transition characteristic table in the performance prediction apparatus of the cluster system by the 2nd Embodiment of this invention. 本発明の第２の実施の形態による性能予測システムの動作を説明するためのフローチャートである。It is a flowchart for demonstrating operation | movement of the performance prediction system by the 2nd Embodiment of this invention. 本発明の第３の実施の形態によるクラスタシステムの性能予測装置の構成を示すブロック図である。It is a block diagram which shows the structure of the performance prediction apparatus of the cluster system by the 3rd Embodiment of this invention. 本発明の第３の実施の形態によるクラスタシステムの性能予測装置の動作を説明するためのフローチャートである。It is a flowchart for demonstrating operation | movement of the performance prediction apparatus of the cluster system by the 3rd Embodiment of this invention. 本発明の第４の実施の形態によるクラスタシステムの性能予測装置の構成を示すブロック図である。It is a block diagram which shows the structure of the performance prediction apparatus of the cluster system by the 4th Embodiment of this invention. 本発明の第４の実施の形態による特性テーブルの内容の構成を示すブロック図である。It is a block diagram which shows the structure of the content of the characteristic table by the 4th Embodiment of this invention. 本発明の第４の実施の形態によるクラスタシステムの性能予測装置の動作を説明するためのフローチャートである。It is a flowchart for demonstrating operation | movement of the performance prediction apparatus of the cluster system by the 4th Embodiment of this invention.

Explanation of symbols

１０、１１、１２、１３：クラスタシステムの性能予測装置
２０、２１、２２、２３：オーバーヘッド基礎情報
３０、３５：特性テーブル
３１：コピーコストテーブル
３２：ワークロード特性テーブル
３２Ａ：ワークロード特性テーブル例
３３：負荷分散特性テーブル
３３Ａ：負荷分散特性テーブル例
３６：遷移特性テーブル集合
３６Ａ：遷移特性テーブル
３６Ｂ：遷移特性テーブル
３７：遷移特性テーブル例
３８：コピーコストテーブル（ＧＥＴ）
３９：ワークロード特性テーブル（ＧＥＴ）
４０：クラスタシステム
１０１：ＣＰＵ
１０２：主記憶部
１０３：通信部
１０４：出力部
１０５：入力部
１０６：補助記憶部
１１０：入力手段
１２０、１２１：コピーコスト生成手段
１３０、１３１、１３２：ワークロード特性生成手段
１４０、１４１：負荷分散特性生成手段
１５０、１５１：到着ノード数算出手段
１６０：オーバーヘッド予測手段
１７０：出力手段
２１０、２１１：コピーコスト情報
２２０、２２１、２２２：ワークロード特性情報
２３０、２３１：負荷分散特性情報
３００：性能予測プログラム 10, 11, 12, 13: Cluster system performance prediction device 20, 21, 22, 23: Overhead basic information 30, 35: Characteristic table 31: Copy cost table 32: Workload characteristic table 32A: Workload characteristic table example 33 : Load distribution characteristic table 33A: Load distribution characteristic table example 36: Transition characteristic table set 36A: Transition characteristic table 36B: Transition characteristic table 37: Transition characteristic table example 38: Copy cost table (GET)
39: Workload characteristics table (GET)
40: Cluster system 101: CPU
102: main storage unit 103: communication unit 104: output unit 105: input unit 106: auxiliary storage unit 110: input unit 120, 121: copy cost generation unit 130, 131, 132: workload characteristic generation unit 140, 141: load Distribution characteristic generation means 150, 151: Arrival node number calculation means 160: Overhead prediction means 170: Output means 210, 211: Copy cost information 220, 221, 222: Workload characteristic information 230, 231: Load distribution characteristic information 300: Performance Prediction program

Claims

A performance prediction device for a cluster system that shares data on main storage devices of a plurality of nodes connected via a communication line between the nodes,
Calculating the number of block copies per unit time made between the nodes in units of memory blocks in the main storage device;
A cluster system performance prediction apparatus, wherein clustering overhead generated in the cluster system is calculated based on the block copy count and CPU load information per block copy.

The cluster system performance prediction apparatus according to claim 1, wherein scalability of the cluster system is calculated based on the calculated clustering overhead and the number of nodes.

The block copy count is calculated for each window size, which is the number of accesses from the time when a specific type of access is made to the memory block set until the next access of the same type, and the block copy count for each window size. The cluster system performance prediction apparatus according to claim 1, wherein the number of block copies per unit time performed between the nodes is calculated based on the number of partial block copies.

4. The performance prediction apparatus for a cluster system according to claim 3, wherein the specific type of access is a write access, and access other than the specific type of access is a read access.

5. The performance of the cluster system according to claim 3, wherein the partial block copy count is calculated from an arrival node count that is an expected value of the number of nodes arriving at the window size count. Prediction device.

Calculating the probability of occurrence of an event in which access to the plurality of nodes is made a natural number of times under the condition that the sum of the number of accesses to all nodes is equal to the window size, and the occurrence of the calculated event The cluster system performance prediction apparatus according to claim 5, wherein the number of arrival nodes is calculated based on a probability of performing the operation.

The cluster system performance prediction apparatus according to claim 6, wherein the probability of occurrence of the event is calculated using a multinomial distribution or a multidimensional normal distribution.

6. The cluster system according to claim 5, wherein the number of arrival nodes is calculated using a polynomial, an exponential function expression, or a series expression in which the number of arrival nodes monotonously increases as the window size increases. Performance prediction device.

The number of arriving nodes is calculated based on a probability of appearance of an access sequence to the node realized by the number of accesses of the window size and a number of nodes receiving access by the access sequence. Item 6. The cluster system performance prediction device according to Item 5.

The probability of the appearance of an access sequence to a node from the previous several times to the natural number of times is calculated from the appearance probability of the access sequence from the previous time to the number of times obtained by adding 1 to the natural number of times, and based on the calculated result, The cluster system performance prediction apparatus according to claim 9, wherein a probability of appearance of an access sequence to the node realized by accessing the window size times is calculated.

The window size, which is the number of accesses from the time when a particular type of access is made to a set of memory blocks until the next type of access is made, is the number of read accesses and the number of write accesses included in the access. The cluster system performance prediction apparatus according to claim 1, wherein the sum is calculated by dividing the sum by the number of write accesses.

When the window size is an integer equal to or larger than the window size and when the window size is an integer equal to or smaller than the window size, the access to the plurality of nodes is performed under the condition that the total number of accesses is equal to the window size. Calculating the probability of occurrence of an event that is made any number of times, and calculating the number of arrival nodes, which is an expected value of the number of nodes arriving at the number of accesses of the window size, based on the calculated result. The performance prediction device for a cluster system according to claim 11.

The specific type of access is a write access, the access other than the specific type of access is a read access and an acquisition access, and the acquisition access is an access made only by the acquisition access in the past. 4. The cluster system performance prediction apparatus according to claim 3, wherein a block is read by a node that has been read and a block that has been previously accessed for write or read is not read by a block. .

The block copy is a block copy by the read access and a block copy by the acquisition access, and the CPU load is a CPU load by the read access block copy and a CPU load by the acquisition access block copy. The cluster system performance prediction apparatus according to claim 13.

A performance prediction method for a cluster system in which data on a main storage device of a plurality of nodes connected via a communication line is shared between the nodes,
Calculating the number of block copies per unit time made between the nodes in units of memory blocks in the main storage device;
Based on the block copy count and CPU load information per block copy, the clustering overhead generated in the cluster system is calculated,
A performance prediction method comprising: calculating scalability of the cluster system based on the calculated clustering overhead and the number of nodes.

The block copy count is calculated for each window size, which is the number of accesses from the time when a specific type of access is made to the memory block set until the next access of the same type, and the block copy count for each window size. The performance prediction method according to claim 15, wherein the number of block copies per unit time made between the nodes is calculated based on the number of partial block copies.

17. The performance prediction method according to claim 16, wherein the specific type of access is a write access, and an access excluding the specific type of access is a read access.

18. The performance prediction method according to claim 16, wherein the partial block copy count is calculated from an arrival node count that is an expected value of the number of nodes arriving at the window size count.

The probability of the occurrence of an event in which the access to the plurality of nodes is made a natural number of times is calculated under the condition that the sum of the number of accesses in all the nodes is equal to the window size, and the calculated occurrence of the event 19. The performance prediction method according to claim 18, wherein the number of arrival nodes is calculated based on a probability.

A performance prediction program for predicting the performance of a cluster system that is executed on a computer processing device and that shares data on main storage devices of a plurality of nodes connected via a communication line between the nodes,
Calculating the number of block copies per unit time made between the nodes in units of memory blocks in the main storage device;
Based on the block copy count and CPU load information per block copy, the clustering overhead generated in the cluster system is calculated,
A performance prediction program comprising a function of calculating scalability of the cluster system based on the calculated clustering overhead and the number of nodes.

The block copy count is calculated for each window size, which is the number of accesses from the time when a specific type of access is made to the memory block set until the next access of the same type, and the block copy count for each window size. 21. The performance prediction program according to claim 20, further comprising a function of calculating the number of block copies per unit time made between the nodes based on the number of partial block copies.

The performance prediction program according to claim 21, wherein the specific type of access is a write access, and access other than the specific type of access is a read access.

23. The performance according to claim 21, further comprising a function of calculating the number of partial block copies from an arrival node number that is an expected value of the number of nodes arriving at the number of accesses of the window size. Prediction program.

The probability of the occurrence of an event in which the access to the plurality of nodes is made a natural number of times is calculated under the condition that the sum of the number of accesses in all the nodes is equal to the window size, and the calculated occurrence of the event The performance prediction program according to claim 23, having a function of calculating the number of arrival nodes based on a probability.

The performance prediction program according to claim 24, having a function of calculating the probability of occurrence of the event using a multinomial distribution or a multidimensional normal distribution.

It has a function of calculating the number of arrival nodes based on the probability of appearance of an access sequence to the node realized by the number of accesses of the window size and the number of nodes that are accessed by the access sequence. The performance prediction program according to claim 23.

The window size, which is the number of accesses from the time when a particular type of access is made to a set of memory blocks until the next type of access is made, is the number of read accesses and the number of write accesses included in the access. 21. The performance prediction program according to claim 20, further comprising: a function of calculating the sum of the above and the number of write accesses.

The specific type of access is a write access, the access other than the specific type of access is a read access and an acquisition access, and the acquisition access is an access made only by the acquisition access in the past. 23. The performance prediction program according to claim 21, wherein said node has a function of reading a block at a node and not reading a block at a node to which said write access or said read access has been made in the past. .