JP6796994B2

JP6796994B2 - Information processing system

Info

Publication number: JP6796994B2
Application number: JP2016208345A
Authority: JP
Inventors: 芳樹松浦; 辰彦宮田; 衣津美水谷; 哲朗安部
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2016-10-25
Filing date: 2016-10-25
Publication date: 2020-12-09
Anticipated expiration: 2036-10-25
Also published as: JP2018072907A; WO2018079162A1

Description

本発明は、複数のサーバを含む情報処理システム及びその制御方法に関するものである。 The present invention relates to an information processing system including a plurality of servers and a control method thereof.

近年、人工知能や機械学習などのように、大量のデータを網羅的に繰返し分析して、人が想定し得ない結果を導き出す分析アプリケーションが注目を集めている。このようなアプリケーションは、結果を導き出すまでに長時間掛かるため、繰返し処理部分に並列コンピューティングシステムを適用して、実行時間を短縮することが望まれている。しかし、アプリケーションの分析対象となるデータ量や分析パラメータ（たとえば、データの分割粒度など）により、実行時間が異なるため、アプリケーションを利用する分析者は、希望する実行時間以内に処理を完了するために、どれだけの計算機リソースを準備しておけばよいか決定することが困難である。 In recent years, analysis applications such as artificial intelligence and machine learning that comprehensively and repeatedly analyze a large amount of data to derive results that cannot be expected by humans have been attracting attention. Since it takes a long time to derive a result in such an application, it is desired to apply a parallel computing system to the iterative processing part to shorten the execution time. However, the execution time differs depending on the amount of data to be analyzed by the application and the analysis parameters (for example, data division granularity), so the analyst using the application can complete the process within the desired execution time. , It is difficult to determine how much computer resources should be prepared.

このような分野の背景技術として、特許文献１では、クラウドを活用して、アプリケーションに対する処理の需要を予測して、クラウドのリソースを自動で拡張及び縮小するアプリケーション・リソース・マネージャを提供している。 As a background technology in such a field, Patent Document 1 provides an application resource manager that uses the cloud to predict the processing demand for an application and automatically expands and contracts the resources of the cloud. ..

特表２０１４−５２７２２１号公報Special Table 2014-527221

特許文献１に記載されたアプリケーション・リソース・マネージャを用いれば、アプリケーションの負荷状況を予測して、指定されたポリシーに基づき迅速に計算機リソースを確保すると共に、イメージを高速配備（プロビジョニング）、もしくは使用されてないイメージをスタッシュして、アプリケーションの処理負荷を動的に変更することができる。これにより、アプリケーション利用者は、事前に計算機リソース量を決定しなくても、ポリシーに基づいた計算機リソースを利用することができる。 By using the application resource manager described in Patent Document 1, the load status of the application is predicted, computer resources are quickly secured based on the specified policy, and the image is rapidly deployed (provisioned) or used. You can stash unprovisioned images to dynamically change the processing load of your application. As a result, the application user can use the computer resource based on the policy without determining the computer resource amount in advance.

しかしながら、アプリケーション・リソース・マネージャで想定されているポリシーは、継続的にアプリケーションが実行されているときの負荷変動に対して、一定に保つように計算機リソースを確保する方法であり、アプリケーション利用者が、実行要求毎に利用形態やコスト等を鑑みて、計算機リソース量を決定するようなケースは想定されていない。 However, the policy envisioned by the application resource manager is a method of securing computer resources so that the load fluctuations when the application is continuously executed are kept constant, and the application user can use it. , It is not assumed that the amount of computer resources is determined for each execution request in consideration of the usage pattern and cost.

たとえば、アプリケーション利用者が、分析パラメータを試行錯誤しながら調整する利用形態を想定した場合、最初は分析粒度を粗く検証するために短実行時間であまり計算機リソースを使わず、すなわち計算機リソースにコストを掛けずに分析を行い、詳細分析をする際に、分析粒度を細かくするように分析パラメータを設定して、実行時間を短縮させるために、コストを掛けて計算機リソースを多めに利用したい、などのニーズが考えられる。このような利用形態の場合、アプリケーション利用者ごとに必要とする計算機リソース量が異なるために、特許文献１に記載の技術のように、ポリシーに基づく計算機リソースの確保を行う方法では、対応が困難である。 For example, if an application user assumes a usage pattern in which analysis parameters are adjusted by trial and error, at first, in order to roughly verify the analysis granularity, a short execution time and not much computer resources are used, that is, the cost to the computer resources is increased. When performing analysis without multiplication and performing detailed analysis, you want to set analysis parameters so that the analysis granularity is fine, and use more computer resources at a cost to shorten the execution time. Needs can be considered. In the case of such a usage pattern, since the amount of computer resources required for each application user is different, it is difficult to deal with the method of securing computer resources based on the policy as in the technique described in Patent Document 1. Is.

上記目的を達成するために、本発明の一実施形態に係る情報処理システムは、管理サーバと、アプリケーションプログラムを実行するための１以上のプロセッサを備えた処理サーバを複数有する。管理サーバは、ユーザからアプリケーションプログラムの並列度を受領すると、複数の処理サーバの有する使用可能な計算機リソースの中から、受領した並列度でアプリケーションプログラムを実行するために必要な計算機リソースを確保し、確保された計算機リソースを有する処理サーバに、アプリケーションプログラムを配置し、アプリケーションプログラムを並列実行させる。 In order to achieve the above object, the information processing system according to the embodiment of the present invention has a management server and a plurality of processing servers including one or more processors for executing an application program. When the management server receives the degree of parallelism of the application program from the user, it secures the computer resources necessary to execute the application program at the received degree of parallelism from the available computer resources of the multiple processing servers. The application program is placed on the processing server having the secured computer resources, and the application program is executed in parallel.

本発明によれば、アプリケーション利用者が、アプリケーションの実行要求毎に必要な計算機リソース量を、アプリケーション利用者の処理要求に合わせて柔軟に決定して、決定した計算機リソース量で迅速に並列コンピューティングシステムを構築することが可能となる。 According to the present invention, an application user flexibly determines the amount of computer resources required for each application execution request according to the processing request of the application user, and quickly performs parallel computing with the determined amount of computer resources. It becomes possible to build a system.

情報処理システムの全体構成の例を示す図である。It is a figure which shows the example of the whole structure of an information processing system. 各種サーバの物理的な構成の例を示す図である。It is a figure which shows the example of the physical configuration of various servers. 処理サーバの機能の概要を示す図である。It is a figure which shows the outline of the function of the processing server. アプリ管理記憶部のテーブルの例を示す図である。It is a figure which shows the example of the table of the application management storage part. ノード-クラスタ管理情報記憶部のテーブルの例を示す図である。It is a figure which shows the example of the table of the node-cluster management information storage part. アプリ実行計算部の動作フローの例を示す図である。It is a figure which shows the example of the operation flow of the application execution calculation part. クラスタ生成部の動作フローの例を示す図である。It is a figure which shows the example of the operation flow of the cluster generation part. クラスタ破棄部の動作フローの例を示す図である。It is a figure which shows the example of the operation flow of the cluster destruction part. アプリの実行依頼前に並列度を設定する動作シーケンスの例を示す図である。It is a figure which shows the example of the operation sequence which sets the degree of parallelism before the execution request of an application. アプリの実行依頼から処理サーバで並列処理を実行する動作シーケンスの例を示す図である。It is a figure which shows the example of the operation sequence which executes the parallel processing in the processing server from the execution request of the application. アプリ実行完了後にクラスタ破棄する動作シーケンスの例を示す図である。It is a figure which shows the example of the operation sequence which discards a cluster after the completion of application execution. 計算機リソース量の設定画面の例を示す図である。It is a figure which shows the example of the setting screen of the computer resource amount. 計算機リソース量の設定画面の別の例を示す図である。It is a figure which shows another example of the setting screen of the computer resource amount.

以下、各実施例における実施形態について図面を参照して説明する。なお、以下の実施例に用いる図において、同一の符号を付した部分は同一物を表し、それらの構造および動作は同じである。 Hereinafter, embodiments in each embodiment will be described with reference to the drawings. In the figures used in the following examples, the parts with the same reference numerals represent the same objects, and their structures and operations are the same.

図１は、実施例１に係る情報処理システムの全体構成の例である。実施例１に係る情報システムは、クライアント端末１０１と、クライアント端末１０１とネットワーク１０２を介して接続されるリクエスト受付サーバ１０３、そしてネットワーク１０５を介してリクエスト受付サーバ１０３と接続されるデータ管理サーバ１０４、アプリ管理サーバ１１０、クラスタ管理サーバ１２０、複数の処理サーバ１３０を有する。図１ではクライアント端末１０１とそれ以外のサーバ（リクエスト受付サーバ１０３、データ管理サーバ１０４、アプリ管理サーバ１１０、クラスタ管理サーバ１２０、処理サーバ１３０）が異なるネットワーク（１０２，１０５）に接続されているが、クライアント端末１０１とそれ以外のサーバが同一ネットワークに接続されるように、情報処理システムが構成されていてもよい。 FIG. 1 is an example of the overall configuration of the information processing system according to the first embodiment. The information system according to the first embodiment includes a client terminal 101, a request reception server 103 connected to the client terminal 101 via the network 102, and a data management server 104 connected to the request reception server 103 via the network 105. It has an application management server 110, a cluster management server 120, and a plurality of processing servers 130. In FIG. 1, the client terminal 101 and other servers (request reception server 103, data management server 104, application management server 110, cluster management server 120, processing server 130) are connected to different networks (102, 105). The information processing system may be configured so that the client terminal 101 and other servers are connected to the same network.

クライアント端末１０１は、アプリケーション利用者が使用する端末であり、アプリケーション利用者が、アプリケーションプログラム（以下、「アプリケーション」と略記する）に処理させるための入力データを作成して、リクエスト受付サーバ１０３にアプリケーションの処理要求を入力データとともに送信するために用いられる。クライアント端末１０１はたとえば、会社や工場内のパーソナルコンピュータやサーバである。あるいはクライアント端末１０１は、スマートフォンやタブレット端末などの、通信機能を有する通信デバイスであってもよい。 The client terminal 101 is a terminal used by the application user, and the application user creates input data for the application program (hereinafter, abbreviated as "application") to process the application on the request reception server 103. It is used to send the processing request of. The client terminal 101 is, for example, a personal computer or a server in a company or factory. Alternatively, the client terminal 101 may be a communication device having a communication function, such as a smartphone or a tablet terminal.

ネットワーク１０２は、通信キャリアなどによって提供される無線ネットワークまたは有線ネットワークである。ネットワーク１０２は、個別の会社などが所有するネットワークを、ネットワーク１０２の一部に含んでもよく、複数種類のプロトコルを通過させるネットワークであってもよい。 The network 102 is a wireless network or a wired network provided by a communication carrier or the like. The network 102 may include a network owned by an individual company or the like as a part of the network 102, or may be a network through which a plurality of types of protocols are passed.

リクエスト受付サーバ１０３は、クライアント端末１０１からアプリ実行要求などの処理要求を受け付け、受け付けた処理要求に基づき、データ管理サーバ１０４、アプリ管理サーバ１１０、クラスタ管理サーバ１２０、処理サーバ１３０に処理依頼を行い、処理結果をクライアント端末１０１に返信する処理を実行するサーバである。 The request reception server 103 receives a processing request such as an application execution request from the client terminal 101, and makes a processing request to the data management server 104, the application management server 110, the cluster management server 120, and the processing server 130 based on the received processing request. , A server that executes a process of returning a process result to the client terminal 101.

データ管理サーバ１０４は、アプリケーションの実行時に処理対象となるデータ（入力データ）を格納するサーバであり、入力データがファイルの場合は共有ファイルサーバ、レコードとして格納しておく場合は構造データベースサーバ、ｊｓｏｎなどの形式で格納しておく場合はキーバリューストアなどの非構造データベースなどのデータを格納するサーバである。 The data management server 104 is a server that stores data (input data) to be processed when the application is executed. If the input data is a file, it is a shared file server, and if it is stored as a record, it is a structural database server, json. When storing in a format such as, it is a server that stores data such as unstructured databases such as key value stores.

アプリ管理サーバ１１０は、処理サーバ１３０で実行されるアプリケーションの情報を管理するとともに、入力データや計算機リソースを設定することでアプリケーションの実行処理時間の見積もり値を計算するサーバである。アプリ管理サーバ１１０は、アプリケーションの情報を管理するアプリ管理記憶部１１１と、入力データと計算機リソース量に基づきアプリケーションの実行時間を事前に計算するアプリ実行時間計算部１１２と、を有する。詳細は、図４および図６で説明する。 The application management server 110 is a server that manages information on an application executed by the processing server 130 and calculates an estimated value of an application execution processing time by setting input data and computer resources. The application management server 110 includes an application management storage unit 111 that manages application information, and an application execution time calculation unit 112 that calculates an application execution time in advance based on input data and the amount of computer resources. Details will be described with reference to FIGS. 4 and 6.

クラスタ管理サーバ１２０は、各処理サーバ１３０の利用状態を管理して、クラスタの生成/破棄を動的に行うサーバであり、ノード-クラスタ管理情報記憶部１２１とクラスタ生成部１２２、クラスタ破棄部１２３、を有する。本実施例では、１つのアプリケーションを実行する際に使用される計算機リソースの集合（あるいはこの計算機リソースを有する処理サーバ１３０の集合）を「クラスタ」と呼ぶ。詳細は、図５および図７、図８で説明する。 The cluster management server 120 is a server that manages the usage status of each processing server 130 and dynamically creates / destroys a cluster, and is a node-cluster management information storage unit 121, a cluster generation unit 122, and a cluster destruction unit 123. Have. In this embodiment, a set of computer resources (or a set of processing servers 130 having this computer resource) used when executing one application is referred to as a "cluster". Details will be described with reference to FIG. 5, FIG. 7, and FIG.

処理サーバ１３０は、アプリ管理サーバ１１０が管理しているアプリケーションを実行するためのサーバであり、アプリケーションの実行コードを記憶するアプリケーション管理部１３１と、アプリケーションの並列処理を実現する並列処理管理部１３２と、を有する。アプリケーション管理部１３１には、複数のアプリケーションが登録されてもよい。複数のアプリケーションが登録されている場合、クラスタはアプリケーションの処理要求ごとに生成されるため、処理サーバ１３０は複数のクラスタに属していることとなり、それぞれのクラスタ内の処理サーバ１３０からアプリケーションの処理を割り振られることとなる。詳細は図３で説明する。 The processing server 130 is a server for executing an application managed by the application management server 110, and includes an application management unit 131 that stores the execution code of the application and a parallel processing management unit 132 that realizes parallel processing of the application. Have. A plurality of applications may be registered in the application management unit 131. When a plurality of applications are registered, the cluster is generated for each application processing request, so that the processing server 130 belongs to a plurality of clusters, and the processing server 130 in each cluster processes the application. It will be allocated. Details will be described with reference to FIG.

本実施例では、これらのサーバがそれぞれ物理的に異なる計算機である例を説明する。ただし必ずしもこれらのサーバが、異なる計算機である必要はなく、上で述べたいくつかのサーバが有する機能部が、単一の計算機上に実装されていてもよい。たとえば情報処理システム内に、上で述べたリクエスト受付サーバ１０３、データ管理サーバ１０４、アプリ管理サーバ１１０、クラスタ管理サーバ１２０に代えて、１台の計算機（仮に「管理サーバ」と呼ぶ）を設け、上で述べたリクエスト受付サーバ１０３、データ管理サーバ１０４、アプリ管理サーバ１１０、クラスタ管理サーバ１２０が有する機能部を、その管理サーバ上に設けてもよい。あるいは、処理サーバの１つ（または複数）が、管理サーバとして用いられてもよい。 In this embodiment, an example in which these servers are physically different computers will be described. However, these servers do not necessarily have to be different computers, and the functional parts of some of the servers described above may be implemented on a single computer. For example, in the information processing system, one computer (tentatively called a "management server") is provided in place of the request reception server 103, the data management server 104, the application management server 110, and the cluster management server 120 described above. The functional unit of the request reception server 103, the data management server 104, the application management server 110, and the cluster management server 120 described above may be provided on the management server. Alternatively, one (or more) of the processing servers may be used as the management server.

さらに別の実施形態として、情報処理システム内に設けられた１台または複数台の計算機上で、いわゆる仮想計算機を提供するためのソフトウェア（一般的にハイパーバイザと呼ばれる）を実行させ、計算機上に、リクエスト受付サーバの役割を果たす仮想計算機、データ管理サーバの役割を果たす仮想計算機、アプリ管理サーバの役割を果たす仮想計算機、クラスタ管理サーバの役割を果たす仮想計算機を定義することで、情報処理システムが構成されてもよい。 As yet another embodiment, software for providing a so-called virtual computer (generally called a hypervisor) is executed on one or a plurality of computers provided in the information processing system, and the computer is used. By defining a virtual computer that plays the role of a request reception server, a virtual computer that plays the role of a data management server, a virtual computer that plays the role of an application management server, and a virtual computer that plays the role of a cluster management server, the information processing system can be created. It may be configured.

図２は、図１で示したリクエスト受付サーバ１０３、データ管理サーバ１０４、アプリ管理サーバ１１０、クラスタ管理サーバ１２０、処理サーバ１３０、クライアント端末１０１の物理的な構成を示す図である。本実施例ではこれらのサーバ（またはクライアント端末）には、プロセッサ（ＣＰＵ）２０１、メモリ２０２、補助記憶装置２０３及び通信インターフェース（通信Ｉ／Ｆ）２０４を有する計算機２００が用いられる。この計算機は一例として、パーソナルコンピュータ（ＰＣ）等の汎用的な計算機でよい。 FIG. 2 is a diagram showing the physical configurations of the request reception server 103, the data management server 104, the application management server 110, the cluster management server 120, the processing server 130, and the client terminal 101 shown in FIG. In this embodiment, as these servers (or client terminals), a computer 200 having a processor (CPU) 201, a memory 202, an auxiliary storage device 203, and a communication interface (communication I / F) 204 is used. As an example, this computer may be a general-purpose computer such as a personal computer (PC).

プロセッサ２０１は、メモリ２０２に格納されたプログラムを実行する。プロセッサ２０１の数は１とは限らない。計算機２００は複数のプロセッサ２０１を有していてもよい。またプロセッサ２０１は複数のプロセッサコアを有する、いわゆるマルチコアプロセッサであってもよい。メモリ２０２は、不揮発性の記憶素子であるＲＯＭ及び揮発性の記憶素子であるＲＡＭを含む。ＲＯＭは、不変のプログラム（例えば、ＢＩＯＳ）などを格納する。ＲＡＭは、ＤＲＡＭ（Dynamic Random Access Memory）のような高速かつ揮発性の記憶素子であり、プロセッサ２０１が実行するプログラム及びプログラムの実行時に使用されるデータを一時的に格納する。 The processor 201 executes a program stored in the memory 202. The number of processors 201 is not limited to one. The computer 200 may have a plurality of processors 201. Further, the processor 201 may be a so-called multi-core processor having a plurality of processor cores. The memory 202 includes a ROM which is a non-volatile storage element and a RAM which is a volatile storage element. The ROM stores an invariant program (for example, BIOS) and the like. The RAM is a high-speed and volatile storage element such as a DRAM (Dynamic Random Access Memory), and temporarily stores a program executed by the processor 201 and data used when the program is executed.

補助記憶装置２０３は、例えば、磁気記憶装置（ＨＤＤ）、フラッシュメモリ（ＳＳＤ）等の大容量かつ不揮発性の記憶装置であり、プロセッサ２０１が実行するプログラム及びプログラムの実行時に使用されるデータを格納する。すなわち、プログラムは、補助記憶装置２０３から読み出されて、メモリ２０２にロードされて、プロセッサ２０１によって実行される。 The auxiliary storage device 203 is, for example, a large-capacity and non-volatile storage device such as a magnetic storage device (HDD) or a flash memory (SSD), and stores a program executed by the processor 201 and data used when executing the program. To do. That is, the program is read from the auxiliary storage device 203, loaded into the memory 202, and executed by the processor 201.

通信インターフェース２０４は、所定のプロトコルに従って、他の装置との通信を制御するネットワークインターフェース装置である。 The communication interface 204 is a network interface device that controls communication with another device according to a predetermined protocol.

計算機２００はまた、入力インターフェース（入力Ｉ／Ｆ）２０５及び出力インターフェース（出力Ｉ／Ｆ）２０８を有してもよい。入力インターフェース２０５は、キーボード２０６やマウス２０７などが接続され、オペレータからの入力を受けるインターフェースである。出力インターフェース２０８は、ディスプレイ装置２０９やプリンタなどが接続され、プログラムの実行結果をオペレータが視認可能な形式で出力するインターフェースである。 The computer 200 may also have an input interface (input I / F) 205 and an output interface (output I / F) 208. The input interface 205 is an interface to which a keyboard 206, a mouse 207, or the like is connected and receives input from an operator. The output interface 208 is an interface to which a display device 209, a printer, or the like is connected, and outputs a program execution result in a format that can be visually recognized by an operator.

なお、本実施例では、アプリ管理サーバ１１０、クラスタ管理サーバ１２０、処理サーバ１３０の有する各機能部は、ソフトウェア（プログラム）によって実装されるものとする。たとえばアプリ管理サーバ１１０では、アプリ管理サーバ１１０をアプリ管理記憶部１１１とアプリ実行時間計算部１１２として機能させるためのプログラムが、アプリ管理サーバ１１０（計算機２００）のメモリ２０２上にロードされ、プロセッサ２０１により実行される。これによりアプリ管理サーバ１１０は、アプリ管理記憶部１１１とアプリ実行時間計算部１１２を有する装置として動作する。 In this embodiment, it is assumed that each functional unit of the application management server 110, the cluster management server 120, and the processing server 130 is implemented by software (program). For example, in the application management server 110, a program for causing the application management server 110 to function as the application management storage unit 111 and the application execution time calculation unit 112 is loaded on the memory 202 of the application management server 110 (computer 200), and the processor 201 Is executed by. As a result, the application management server 110 operates as a device having the application management storage unit 111 and the application execution time calculation unit 112.

クラスタ管理サーバ１２０や処理サーバ１３０でも同様に、計算機２００（クラスタ管理サーバ１２０や処理サーバ１３０）のプロセッサ２０１で、上で述べた各機能部を実現するためのプログラムが実行される。これによってクラスタ管理サーバ１２０や処理サーバ１３０は、上で述べた各機能部を有する装置として動作する。以下では、アプリ管理サーバ１１０やクラスタ管理サーバ１２０、あるいは処理サーバ１３０等で実行される処理を説明する際に、アプリ実行時間計算部１１２やクラスタ生成部１２２等の機能部を主語とした説明を行うことがあるが、それは実際には、機能部を有する計算機２００のプロセッサ２０１が処理を行うことを意味する。 Similarly, in the cluster management server 120 and the processing server 130, the processor 201 of the computer 200 (cluster management server 120 and the processing server 130) executes a program for realizing each functional unit described above. As a result, the cluster management server 120 and the processing server 130 operate as a device having each of the above-mentioned functional units. In the following, when explaining the processing executed by the application management server 110, the cluster management server 120, the processing server 130, etc., the explanation will be given with functional units such as the application execution time calculation unit 112 and the cluster generation unit 122 as the main subjects. Although it may be done, it actually means that the processor 201 of the computer 200 having the functional part performs the processing.

また、プロセッサ２０１が実行するプログラムは、計算機が読み取り可能な記憶メディア又はネットワークを介して計算機２００に提供され、非一時的記憶媒体である補助記憶装置２０３に格納される。計算機が読み取り可能な記憶メディアとは、非一時的なコンピュータ可読媒体で、たとえばＣＤ−ＲＯＭやフラッシュメモリなどの、不揮発性のリムーバブルメディアである。このため計算機２００は、リムーバブルメディアからデータを読み込むインターフェースを有するとよい。 Further, the program executed by the processor 201 is provided to the computer 200 via a storage medium or network that can be read by the computer, and is stored in the auxiliary storage device 203, which is a non-temporary storage medium. A computer-readable storage medium is a non-transitory computer-readable medium, such as a non-volatile removable medium such as a CD-ROM or flash memory. Therefore, the computer 200 may have an interface for reading data from removable media.

また、別の実施形態として、各機能部の一部またはすべては、ＦＰＧＡやＡＳＩＣなどのハードウェアを用いて実装されていてもよい。 Further, as another embodiment, a part or all of each functional unit may be implemented by using hardware such as FPGA or ASIC.

図３は、処理サーバ１３０でアプリケーションが実行される時の仕組みを概説する図である。 FIG. 3 is a diagram illustrating a mechanism when an application is executed on the processing server 130.

処理サーバ１３０は先に述べたとおり、アプリケーションが配置されるアプリケーション管理部１３１と、同一クラスタ内の処理サーバ１３０を管理して、処理を各処理サーバ１３０に割り振りながらアプリケーションを並列実行することを管理する並列処理管理部１３２と、を有する。 As described above, the processing server 130 manages the application management unit 131 in which the application is arranged and the processing server 130 in the same cluster, and manages to execute the application in parallel while allocating the processing to each processing server 130. It has a parallel processing management unit 132 and a parallel processing management unit 132.

アプリケーション管理部１３１はアプリケーションプログラムを格納する機能部で、メモリ２０２や補助記憶装置２０３の記憶領域を用いてアプリケーションプログラムを保持する。 The application management unit 131 is a functional unit that stores the application program, and holds the application program by using the storage area of the memory 202 or the auxiliary storage device 203.

並列処理管理部１３２は、アプリケーションを並列実行させるために必要な、各種機能を提供する。並列処理管理部１３２の説明の前に、処理サーバ１３０でアプリケーションがどのようにして並列実行されるか、概説する。 The parallel processing management unit 132 provides various functions necessary for executing applications in parallel. Before the explanation of the parallel processing management unit 132, how the application is executed in parallel on the processing server 130 will be outlined.

本実施例では一例として、アプリケーションがデータの分析を行うためのプログラムである例を説明する。アプリケーションは、１以上の処理を実行するためのプログラムコード（実行コード）を含む。図４の４１０は、アプリケーション（ＡｐｐＡ）の構成例を示している。図４の４１０に示されているように、ＡｐｐＡは複数の処理Ａａ，Ａｂ，Ａｃを含んでおり、ＡｐｐＡが処理サーバで実行される時、処理Ａａ，Ａｂ，Ａｃの順に実行される。ここで、たとえば処理Ａａは入力データの正規化を行う処理、処理Ａｂは正規化されたデータの分析を行う処理、そして処理Ａｃは処理Ａｂにて分析されたデータの統計処理である。 In this embodiment, as an example, an example in which the application is a program for analyzing data will be described. The application includes a program code (execution code) for executing one or more processes. 410 in FIG. 4 shows a configuration example of an application (App A). As shown in 410 of FIG. 4, the application A includes a plurality of processes Aa, Ab, and Ac, and when the application A is executed on the processing server, the processes Aa, Ab, and Ac are executed in this order. .. Here, for example, the process Aa is a process of normalizing the input data, the process Ab is a process of analyzing the normalized data, and the process Ac is a statistical process of the data analyzed by the process Ab.

各処理の中には、複数の処理サーバ１３０（あるいは複数のプロセッサ２０１）で並列処理されてもよいものもある。本実施例では処理Ａａ，Ａｂが、並列実行可能な処理である例を説明する。 Some of the processes may be processed in parallel by a plurality of processing servers 130 (or a plurality of processors 201). In this embodiment, an example in which the processes Aa and Ab are processes that can be executed in parallel will be described.

アプリケーションは、これらの各処理（Ａａ，Ａｂ，Ａｃ）をプロセッサ２０１に実行させるための実行コードと、各処理の実行を各処理サーバ１３０に依頼する（振り分ける）処理をプロセッサ２０１に行わせる実行コードとを有し、前者の実行コードのことを「実行部」（図３の３１２）と呼び、後者の実行コードのことを「振分部」（図３の３１１）と呼ぶ。本実施例では、振分部３１１が各処理サーバ１３０に、実行部の処理を依頼するために送信される情報のことを「メッセージ」と呼ぶ。また、図３または図４に示されたＡｐｐＡのように、複数の処理（Ａａ，Ａｂ，Ａｃ）が実行されるアプリケーションでは、実行部３１２には処理Ａａを行う実行コード，処理Ａｂを行う実行コード，処理Ａｃを行う実行コードが含まれる。以下では処理Ａａ，Ａｂ，Ａｃを行う実行コードをそれぞれ、「コードＡａ」，「コードＡｂ」，「コードＡｃ」と呼ぶ。 The application has an execution code for causing the processor 201 to execute each of these processes (Aa, Ab, Ac), and an execution code for causing the processor 201 to perform (distribute) the execution of each process to each processing server 130. The former execution code is called an "execution unit" (312 in FIG. 3), and the latter execution code is called a "distribution unit" (311 in FIG. 3). In this embodiment, the information transmitted by the distribution unit 311 to each processing server 130 to request the processing of the execution unit is referred to as a "message". Further, in an application in which a plurality of processes (Aa, Ab, Ac) are executed as in App A shown in FIG. 3 or 4, the execution unit 312 executes an execution code and a process Ab that perform the process Aa. Execution code and execution code that performs processing Ac are included. Hereinafter, the execution code that performs the processes Aa, Ab, and Ac will be referred to as "code Aa", "code Ab", and "code Ac", respectively.

処理サーバ１３０の並列処理管理部１３２は、振分部３１１と実行部３１２の形で分離設計定義されたアプリケーションの並列実行の管理を行う。並列処理管理部１３２は、リクエスト受付サーバ１０３などの外部からアプリケーションの実行依頼を受け付けて、アプリケーションの振分部３１１の実行を開始するリクエスト受付部３２１と、振分部３１１が生成したメッセージを、処理サーバ（実行）１３０に送信するメッセージ振分部３２２、処理サーバ（振分）１３０から受信したメッセージを解析して、対象の実行部３１２に含まれる実行コード（コードＡａ，Ａｂ，Ａｃ）を呼び出すメッセージ受付部３２３により、アプリケーションの並列実行を行う。 The parallel processing management unit 132 of the processing server 130 manages the parallel execution of the application defined separately in the form of the distribution unit 311 and the execution unit 312. The parallel processing management unit 132 receives the application execution request from the outside such as the request reception server 103, and starts the execution of the application distribution unit 311. The request reception unit 321 and the message generated by the distribution unit 311 are displayed. The message distribution unit 322 to be transmitted to the processing server (execution) 130 and the message received from the processing server (distribution) 130 are analyzed, and the execution code (codes Aa, Ab, Ac) included in the target execution unit 312 is analyzed. The message receiving unit 323 to be called executes the application in parallel.

また並列処理管理部１３２は、クラスタ管理サーバ１２０などからアプリケーションのデプロイまたはアンデプロイ依頼を受けとり、アプリケーション管理部１３１にアプリケーションの配置と削除を行うアプリ・デプロイ/アンデプロイ受付部３２４と、アプリケーション管理部１３１に配置されているアプリケーションが所属しているクラスタについてのクラスタ情報を管理するクラスタ情報記憶部３２５の機能も提供する。クラスタ情報については後述する。 Further, the parallel processing management unit 132 receives an application deployment or undeployment request from the cluster management server 120 or the like, and arranges and deletes the application in the application management unit 131. The application deployment / undeploy reception unit 324 and the application management unit. It also provides the function of the cluster information storage unit 325 that manages the cluster information about the cluster to which the application located in 131 belongs. The cluster information will be described later.

並列処理管理部１３２は、このメッセージの送受信や、受信したメッセージに基づいて実行部に処理を実行させる等の処理を行う。以下では図３を参照しながら、ＡｐｐＡ３１０が実行される時の処理の流れを概説する。 The parallel processing management unit 132 performs processing such as sending and receiving this message and causing the execution unit to execute processing based on the received message. In the following, with reference to FIG. 3, the flow of processing when the Ap A310 is executed will be outlined.

以下では、処理ＡａがＮ個のプロセッサ２０１で並列処理され、処理ＡｂがＭ個のプロセッサ２０１で並列処理される例を説明する（Ｎ，Ｍはいずれも１以上の整数で、ＮとＭは等しい場合もある）。なお、アプリケーションが処理サーバ１３０で実行される前に、アプリケーションを実行するクラスタ内の各処理サーバ１３０にはアプリケーションが配布され、クラスタ内の各処理サーバ１３０のアプリケーション管理部１３１にはアプリケーションが格納された状態にある。この処理は後述する。 In the following, an example in which processing Aa is processed in parallel by N processors 201 and processing Ab is processed in parallel by M processors 201 will be described (N and M are both integers of 1 or more, and N and M are integers of 1 or more. May be equal). Before the application is executed on the processing server 130, the application is distributed to each processing server 130 in the cluster that executes the application, and the application is stored in the application management unit 131 of each processing server 130 in the cluster. Is in a state of being. This process will be described later.

ここで、処理サーバ１３０のうち、メッセージを生成して振り分ける振分部３１１を担当する処理サーバ１３０を処理サーバ（振分）１３０、メッセージを受け取り、処理を実行する実行部３１２を担当する処理サーバ１３０を処理サーバ（実行）１３０と呼ぶ。処理サーバ（実行）１３０と処理サーバ（振分）１３０は同一サーバであってもよい。 Here, among the processing servers 130, the processing server 130 in charge of the distribution unit 311 that generates and distributes messages is the processing server (distribution) 130, and the processing server in charge of the execution unit 312 that receives the message and executes the processing. 130 is called a processing server (execution) 130. The processing server (execution) 130 and the processing server (distribution) 130 may be the same server.

アプリケーションＡｐｐＡ３１０の実行が開始されると、処理サーバ（振分）１３０の振分部３１１はまずメッセージＡａをＮ個生成して、並列処理管理部１３２のメッセージ振分部３２２を介して、クラスタ内の各処理サーバ１３０にメッセージＡａを送信する。メッセージＡａの送信先となる処理サーバ１３０は、振分部３１１により決定される。メッセージＡａを送信された処理サーバ１３０（実行）では、メッセージ受付部３２３がメッセージＡａに対応した実行部３１２内の処理Ａａを実行するコードを呼び出して、処理Ａａを実行させる。処理Ａａの実行後、メッセージ受付部３２３は処理サーバ（振分）１３０に、処理結果を返信する。 When the execution of the application AppA310 is started, the distribution unit 311 of the processing server (distribution) 130 first generates N messages Aa, and in the cluster via the message distribution unit 322 of the parallel processing management unit 132. Message Aa is transmitted to each processing server 130 of. The processing server 130 to which the message Aa is transmitted is determined by the distribution unit 311. In the processing server 130 (execution) to which the message Aa is transmitted, the message receiving unit 323 calls the code for executing the processing Aa in the execution unit 312 corresponding to the message Aa to execute the processing Aa. After executing the process Aa, the message reception unit 323 returns the process result to the process server (distribution) 130.

処理サーバ（振分）１３０の振分部３１１は、メッセージＡａに対応する処理結果の返信をＮ個分受け取ると、次の処理としてメッセージＡｂをＭ個生成して、同様に並列処理管理部１３２のメッセージ振分部３２２を介して、メッセージＡｂをクラスタ内の処理サーバ（実行）１３０に送信する。振分部３１１は、各処理（Ａａ，Ａｂ，Ａｃ）について、メッセージの送信及び結果の受信を行い、メッセージＡｃに対応する結果を受信し終えると、アプリケーションは終了する。つまり、アプリケーションを、処理依頼となるメッセージを生成する振分部３１１と、メッセージを受け取る実行部３１２に分けて設計定義しておくことで、繰り返し処理部分を並列に処理させることができる。 When the distribution unit 311 of the processing server (distribution) 130 receives N replies of the processing results corresponding to the message Aa, it generates M messages Ab as the next processing, and similarly, the parallel processing management unit 132. Message Ab is transmitted to the processing server (execution) 130 in the cluster via the message distribution unit 322 of the above. The distribution unit 311 transmits a message and receives a result for each process (Aa, Ab, Ac), and when the result corresponding to the message Ac is received, the application ends. That is, by designing and defining the application separately for the distribution unit 311 that generates the message to be the processing request and the execution unit 312 that receives the message, the iterative processing portion can be processed in parallel.

並列処理管理部１３２により、処理サーバ１３０にアプリケーションを配置するだけで、クラスタ内の処理サーバ１３０のどれか１つに対して実行依頼を送信することで、処理サーバ１３０が自動で処理サーバ（振分）１３０と処理サーバ（実行）１３０に分かれて、アプリケーションの処理を処理サーバ（実行）１３０に振り分けながら並列に処理を実行することができる。これらの処理の流れについては、後で図９から図１１のシーケンス図を用いて説明する。 The parallel processing management unit 132 simply places the application on the processing server 130, and the processing server 130 automatically sends an execution request to any one of the processing servers 130 in the cluster. Minutes) 130 and processing server (execution) 130 can be divided, and processing can be executed in parallel while distributing application processing to processing server (execution) 130. The flow of these processes will be described later with reference to the sequence diagrams of FIGS. 9 to 11.

図４は、アプリ管理サーバ１１０内に保持されているアプリ管理記憶部１１１のテーブルの例を示す図である。 FIG. 4 is a diagram showing an example of a table of the application management storage unit 111 held in the application management server 110.

アプリ管理記憶部１１１は、アプリケーションとして配置する実行コードや、アプリケーションの処理時間を計算するための処理フローの情報や、処理ごとの実行時間を計算するための計算ロジックの情報を格納する機能部で、これらの情報を格納するために、メモリ２０２や補助記憶装置２０３の記憶領域を用いる。アプリ管理記憶部１１１は一例として、公知のファイルシステムプログラムまたはデータベース管理システム（ＤＢＭＳ）のようなプログラムを用いて実装されて良い。本実施例ではアプリ管理記憶部１１１は、アプリケーションの実行コードや処理フローや計算ロジックの情報を、メモリ２０２や補助記憶装置２０３の記憶領域上に形成されたテーブルに記憶させる例を説明する。 The application management storage unit 111 is a functional unit that stores the execution code to be arranged as an application, the processing flow information for calculating the processing time of the application, and the calculation logic information for calculating the execution time for each processing. , The storage area of the memory 202 or the auxiliary storage device 203 is used to store such information. As an example, the application management storage unit 111 may be implemented by using a known file system program or a program such as a database management system (DBMS). In this embodiment, the application management storage unit 111 describes an example in which the application execution code, the processing flow, and the information of the calculation logic are stored in the table formed on the storage area of the memory 202 or the auxiliary storage device 203.

アプリ管理記憶部１１１が有するテーブル４００は、図４に示されるように６つのカラムを有する。以下、各カラムに格納される情報について説明する。アプリ名４０１には、アプリケーションの名称が格納される。アプリケーションの名称とは、アプリケーション利用者が、アプリケーションの実行を依頼する際に、アプリケーションを特定するために用いる名称である。実行コード４０２には、アプリ名４０１に対応したアプリケーションの実行コード（のファイル）が格納される。 The table 400 included in the application management storage unit 111 has six columns as shown in FIG. The information stored in each column will be described below. The name of the application is stored in the application name 401. The name of the application is a name used by the application user to specify the application when requesting the execution of the application. The execution code 402 stores the execution code (file) of the application corresponding to the application name 401.

並列度計算ロジック４０３には、入力データ量に応じてアプリケーションの各処理の繰り返し回数を算出するためのロジックが記述されたファイルが格納される。本実施例では、各処理の繰り返し回数を算出するためのロジックを「並列度計算ロジック」と呼ぶ。処理フロー４０４には、アプリケーションの処理実行手順が記録される。並列性４０５には、処理フロー４０４に記述されている各処理が、並列実行が可能か否かを表す情報が格納される。計算ロジック４０６には、処理フロー４０４内の各処理の１回の実行時間を算出するための計算ロジック（これを「実行時間計算ロジック」と呼ぶ）が記述されたファイルが格納される。 The parallel degree calculation logic 403 stores a file in which the logic for calculating the number of repetitions of each process of the application according to the amount of input data is described. In this embodiment, the logic for calculating the number of repetitions of each process is called "parallelism calculation logic". The processing execution procedure of the application is recorded in the processing flow 404. In the parallelism 405, information indicating whether or not each process described in the process flow 404 can be executed in parallel is stored. The calculation logic 406 stores a file in which a calculation logic (referred to as "execution time calculation logic") for calculating one execution time of each process in the process flow 404 is described.

たとえば図４を参照しながら、各カラムに格納される情報の具体例を説明する。図４のテーブルの先頭行に格納されているアプリケーション（ＡｐｐＡと呼ぶ）が、図４の４１０に記述されているように、処理Ａａ、処理Ａｂ、処理Ａｃの３つの処理を含み、処理Ａａ、処理Ａｂ、処理Ａｃの順で処理を行うものとする。また処理Ａａ及び処理Ａｂは並列実行可能で、与えられた入力データの量に応じて、繰り返し実行される回数が変動する処理とする。 For example, a specific example of the information stored in each column will be described with reference to FIG. The application (referred to as AppA) stored in the first row of the table of FIG. 4 includes three processes of process Aa, process Ab, and process Ac, as described in 410 of FIG. 4, and the process Aa, It is assumed that the processing is performed in the order of processing Ab and processing Ac. Further, the processes Aa and Ab can be executed in parallel, and the number of times they are repeatedly executed varies depending on the amount of input data given.

この時並列度計算ロジック４０３には、入力データの量から処理Ａａ及び処理Ａｂの繰り返し実行回数を算出するためのロジックが記述されたファイルのファイル名（図４の例では“AppA_message.py”）が記述される。また処理フロー４０４には“処理Ａａ，処理Ａｂ，処理Ａｃ”が記述される。以下では、処理フロー４０４のカラムに“処理Ａａ”が格納された行を“行４０７”，“処理Ａｂ”が格納された行を“行４０８”，“処理Ａｃ” が格納された行を“行４０９”と呼ぶ。 At this time, the parallel degree calculation logic 403 is the file name of the file in which the logic for calculating the number of repeated executions of the processing Aa and the processing Ab is described from the amount of input data (“AppA_message.py” in the example of FIG. 4). Is described. Further, "processing Aa, processing Ab, processing Ac" is described in the processing flow 404. In the following, the row in which "processing Aa" is stored is "row 407", the row in which "processing Ab" is stored is "row 408", and the row in which "processing Ac" is stored is "row 407" in the column of the processing flow 404. Called line 409 ".

並列性４０５の欄には、行４０７及び行４０８には“○”が格納され、処理Ａａ及び処理Ａｂは並列実行可能であることを表す。一方、行４０９には“×”、つまり処理Ａｃは並列実行可能でないことを表す情報が格納される。 In the column of parallelism 405, “◯” is stored in rows 407 and 408, indicating that the processes Aa and Ab can be executed in parallel. On the other hand, line 409 stores "x", that is, information indicating that the processing Ac cannot be executed in parallel.

そして、処理Ａａの実行時間計算ロジックが記述されたファイルが“AppA_calcAa.py”、処理Ａｂの実行時間計算ロジックが記述されたファイルが“AppA_calcAb.py”、処理Ａｃの実行時間計算ロジックが記述されたファイルが“AppA_calcAc.py”の場合、計算ロジック４０６の欄には、行４０７に“AppA_calcAa.py”、行４０８に“AppA_calcAb.py”、行４０９に“AppA_calcAc.py”が格納される。 Then, the file in which the execution time calculation logic of the process Aa is described is "AppA_calcAa.py", the file in which the execution time calculation logic of the process Ab is described is "AppA_calcAb.py", and the execution time calculation logic of the process Ac is described. When the file is "AppA_calcAc.py", "AppA_calcAa.py" is stored in line 407, "AppA_calcAb.py" is stored in line 408, and "AppA_calcAc.py" is stored in line 409 in the calculation logic 406 column.

アプリ管理記憶部１１１のテーブルに格納されるこれらの情報は、あらかじめ情報処理システムの管理者、またはアプリケーションの利用者によって、アプリ管理記憶部１１１に登録される。また並列度計算ロジックや実行時間計算ロジックは、あらかじめアプリケーションの開発者によって作成されたものである。 These information stored in the table of the application management storage unit 111 are registered in the application management storage unit 111 in advance by the administrator of the information processing system or the user of the application. The parallelism calculation logic and the execution time calculation logic are created in advance by the application developer.

ただし別の実施形態として、実行時間計算ロジックを情報処理システムが自動作成する手段を備えていてもよい。たとえばデータ量と実行時間の因果関係を考慮して、入力データを統計的に処理して自動で計算ロジックを作成する機能，またデータ量以外に実行時間との因果関係のある項目を分析して自動で計算ロジックの予測モデルを構築する機能を、情報処理システムが備えており、アプリケーションがアプリ管理サーバ１１０に登録された時に、情報処理システムが実行時間計算ロジックを生成して、アプリ管理記憶部１１１に登録してもよい。 However, as another embodiment, the information processing system may automatically create the execution time calculation logic. For example, a function that statistically processes input data to automatically create calculation logic in consideration of the causal relationship between the amount of data and execution time, and analyzes items that have a causal relationship with execution time other than the amount of data. The information processing system has a function to automatically build a prediction model of calculation logic, and when an application is registered in the application management server 110, the information processing system generates execution time calculation logic and the application management storage unit. It may be registered in 111.

なお、図４では説明を分かりやすくするために、実行コード４０２、並列度計算ロジック４０３、計算ロジック４０６のカラムには、実行コードや計算ロジックのファイル名（AppA.appなど）のみが記載されているが、これらのカラムにファイルの実体も格納される。あるいは別の実施形態として、実行コードや計算ロジックのファイルの実体は、アプリ管理記憶部１１１（を構成する補助記憶装置２０３の記憶領域）に格納され、実行コード４０２、並列度計算ロジック４０３、計算ロジック４０６のカラムには、各ファイルのパス名が格納される形態であってもよい。 In FIG. 4, for the sake of clarity, only the execution code and the file name of the calculation logic (AppA.app, etc.) are described in the columns of the execution code 402, the parallel degree calculation logic 403, and the calculation logic 406. However, the actual file is also stored in these columns. Alternatively, as another embodiment, the actual file of the execution code and the calculation logic is stored in the application management storage unit 111 (the storage area of the auxiliary storage device 203 constituting the storage area), and the execution code 402, the parallel degree calculation logic 403, and the calculation. The path name of each file may be stored in the column of logic 406.

図５は、クラスタ管理サーバ１２０内に保持されているノード-クラスタ管理情報記憶部１２１のテーブルの例を示す図である。本実施例ではノード-クラスタ管理情報記憶部１２１はアプリ管理記憶部１１１と同様に、メモリ２０２や補助記憶装置２０３の記憶領域上に形成されたテーブルに、各種情報を記憶させる例を説明する。 FIG. 5 is a diagram showing an example of a table of the node-cluster management information storage unit 121 held in the cluster management server 120. In this embodiment, an example will be described in which the node-cluster management information storage unit 121 stores various information in a table formed on the storage area of the memory 202 or the auxiliary storage device 203, similarly to the application management storage unit 111.

ノード-クラスタ管理情報記憶部１２１は、アプリケーションを配置可能な全ての処理サーバ１３０の情報を管理しており、またこれらの処理サーバ１３０のうち、同一アプリケーションが配置されて、クラスタを形成している処理サーバ１３０の情報もテーブル５００に格納して管理している。 The node-cluster management information storage unit 121 manages the information of all the processing servers 130 to which the applications can be arranged, and among these processing servers 130, the same application is arranged to form a cluster. The information of the processing server 130 is also stored and managed in the table 500.

ノード-クラスタ管理情報記憶部１２１が有するテーブル５００の各行（レコード）は、図５に示す、６つのカラムを有し、各レコードには情報処理システム内の処理サーバ１３０についての情報が格納される。ノード名５０１には、処理サーバ１３０の名称を格納するための欄である。各処理サーバ１３０は情報処理システム内で一意な名称を有しており、本実施例ではその名称を「ノード名」と呼ぶ。IPアドレス５０２には、ノード名５０１で特定される処理サーバ１３０のＩＰアドレスが格納される。ＣＰＵＣｏｒｅ数５０３には、処理サーバ１３０の有するプロセッサコア（ＣＰＵＣｏｒｅ）の数が格納される。 Each row (record) of the table 500 included in the node-cluster management information storage unit 121 has six columns as shown in FIG. 5, and each record stores information about the processing server 130 in the information processing system. .. The node name 501 is a column for storing the name of the processing server 130. Each processing server 130 has a unique name in the information processing system, and the name is referred to as a "node name" in this embodiment. The IP address 502 stores the IP address of the processing server 130 specified by the node name 501. The number of CPU Cores 503 stores the number of processor cores (CPU Cores) included in the processing server 130.

クラスタ名５０４には、処理サーバ１３０がクラスタに属している場合、所属しているクラスタの名称が格納され、割り当てＣＰＵＣｏｒｅ数５０５には、クラスタに割り当てられているプロセッサコア数が格納される。そのため、ＣＰＵＣｏｒｅ数５０３と割り当てＣＰＵＣｏｒｅ数５０５の差を算出することで、まだいずれのクラスタにも割り当てられていないプロセッサコア（「未使用コア」と呼ぶ）の数が求められる。またアプリ名５０６には、処理サーバ１３０に配置されているアプリケーションのアプリ名が格納される。 When the processing server 130 belongs to the cluster, the name of the cluster to which the processing server 130 belongs is stored in the cluster name 504, and the number of processor cores assigned to the cluster is stored in the allocated CPU Core number 505. Therefore, by calculating the difference between the number of CPU cores 503 and the number of allocated CPU cores 505, the number of processor cores (referred to as "unused cores") that have not yet been allocated to any cluster can be obtained. Further, the application name 506 stores the application name of the application arranged on the processing server 130.

なお本実施例では、処理サーバ１３０がいわゆるマルチコアプロセッサを有する前提で説明しているが、処理サーバ１３０の有するプロセッサがシングルコアプロセッサの場合、ＣＰＵＣｏｒｅ数５０３や割り当てＣＰＵＣｏｒｅ数５０５には、プロセッサコア数に代えてプロセッサ数が格納される。 In this embodiment, the description is made on the premise that the processing server 130 has a so-called multi-core processor. However, when the processor of the processing server 130 is a single-core processor, the CPU Core number 503 and the allocated CPU Core number 505 include processors. The number of processors is stored instead of the number of cores.

また本実施例では、ノード-クラスタ管理情報記憶部１２１が有するテーブル５００の各レコードのうち、クラスタ名５０４が同じレコードの集合に含まれる情報、特にこれらのレコードのカラム５０４〜５０６の情報を、「クラスタ情報」と呼ぶ。図５において、行５１０−１と行５１０−２のカラム５０４〜５０６がそれぞれ、クラスタ“User1-AppB-1”のクラスタ情報、クラスタ“User2-AppA-5”のクラスタ情報である。クラスタ情報を参照することで、クラスタに所属している処理サーバ１３０、ＣＰＵＣｏｒｅ数を知ることができる。 Further, in this embodiment, among the records in the table 500 held by the node-cluster management information storage unit 121, the information in which the cluster name 504 is included in the same set of records, particularly the information in columns 504 to 506 of these records is displayed. Called "cluster information". In FIG. 5, columns 504 to 506 in rows 510-1 and 510-2 are cluster information of the cluster “User1-AppB-1” and cluster information of the cluster “User2-AppA-5”, respectively. By referring to the cluster information, the number of processing servers 130 and CPU cores belonging to the cluster can be known.

後述するクラスタ管理サーバ１２０のクラスタ生成部１２２がクラスタを生成（定義）するとき、クラスタに所属させる処理サーバ１３０をテーブル５００の中から選択する。そしてクラスタ生成部１２２は、選択された処理サーバ１３０に対応するレコードのカラム５０４〜５０６に、クラスタ名や使用するＣＰＵＣｏｒｅ数などの情報を格納する。本実施例ではクラスタ生成部１２２が、カラム５０４〜５０６に、クラスタ名等の情報を格納する処理を「クラスタ情報を作成する」処理と呼ぶ。クラスタ情報が作成されることにより、アプリケーションの実行に使用される計算機リソースが実質的に確保（予約）されることを意味する。また、クラスタ情報が作成されると、処理サーバ１３０のクラスタ情報記憶部３２５にも作成されたクラスタ情報が配置される。 When the cluster generation unit 122 of the cluster management server 120, which will be described later, creates (defines) a cluster, the processing server 130 to belong to the cluster is selected from the table 500. Then, the cluster generation unit 122 stores information such as the cluster name and the number of CPU cores to be used in the columns 504 to 506 of the record corresponding to the selected processing server 130. In this embodiment, the process in which the cluster generation unit 122 stores information such as the cluster name in columns 504 to 506 is called a process of "creating cluster information". By creating the cluster information, it means that the computer resources used for executing the application are substantially reserved (reserved). When the cluster information is created, the created cluster information is also arranged in the cluster information storage unit 325 of the processing server 130.

逆に定義されたクラスタにおけるアプリケーションの実行が終了すると、クラスタ破棄部１２３がカラム５０４〜５０６からクラスタ名等の情報を削除する。この処理は「クラスタ情報を削除する」処理と呼ばれる。クラスタ情報の削除により、アプリケーションの実行のために確保されていた計算機リソースが実質的に解放され、解放された計算機リソースを他の用途に使用することができるようになる。 On the contrary, when the execution of the application in the defined cluster is completed, the cluster destruction unit 123 deletes the information such as the cluster name from the columns 504 to 506. This process is called the "delete cluster information" process. By deleting the cluster information, the computer resources reserved for executing the application are substantially released, and the released computer resources can be used for other purposes.

ここで、処理サーバ１３０としてクラウド（非図示）上の計算機リソースを使う場合、つまりクラスタ生成の要求ごとにクラウド上の計算機リソースを確保して使用する場合は、計算機リソースが確保されるたびにノード-クラスタ管理情報記憶部１２１のテーブルにレコードが追加され、アプリケーションの実行が終了してクラスタを削除すると、そのレコードが削除される。 Here, when the computer resource on the cloud (not shown) is used as the processing server 130, that is, when the computer resource on the cloud is secured and used for each request for cluster generation, the node is used every time the computer resource is secured. -A record is added to the table of the cluster management information storage unit 121, and when the execution of the application ends and the cluster is deleted, the record is deleted.

また、処理サーバ１３０が複数のＣＰＵＣｏｒｅを保持しており、アプリケーションの並列度が、処理サーバ１３０の有するＣＰＵＣｏｒｅ数より少ない場合は、１つの処理サーバ１３０に複数のアプリケーションが配置されることもあり得る。その場合は、処理サーバ１３０は複数のクラスタに所属することになる。 Further, when the processing server 130 holds a plurality of CPU cores and the degree of parallelism of applications is less than the number of CPU cores possessed by the processing server 130, a plurality of applications may be arranged in one processing server 130. possible. In that case, the processing server 130 belongs to a plurality of clusters.

また、本実施例では、処理サーバ１３０がｎ個のＣＰＵＣｏｒｅを有している場合、アプリケーションの実行コードをｎ個並列実行可能という前提で、計算機リソースの確保が行われる。そのため、アプリケーションの並列度が４の場合（アプリケーション利用者がアプリケーションを４並列実行させたい場合）、クラスタ管理サーバ１２０のクラスタ生成部１２２（後述）は、未使用コアを有する処理サーバ１３０を１または複数選択する。その際クラスタ生成部１２２は、選択された処理サーバ１３０が有する未使用コアの数が４つ（以上）になるように、処理サーバ１３０を選択する。 Further, in this embodiment, when the processing server 130 has n CPU cores, computer resources are secured on the premise that n application execution codes can be executed in parallel. Therefore, when the degree of parallelism of the application is 4 (when the application user wants to execute the application in parallel of 4), the cluster generation unit 122 (described later) of the cluster management server 120 sets the processing server 130 having an unused core to 1 or Select multiple. At that time, the cluster generation unit 122 selects the processing server 130 so that the number of unused cores of the selected processing server 130 is four (or more).

たとえば情報処理システム内に、図５のテーブル５００に示されているように、Ｎｏｄｅ１〜Ｎｏｄｅ８の処理サーバ１３０が存在し、Ｎｏｄｅ１〜Ｎｏｄｅ５のＣＰＵＣｏｒｅが既に何らかのアプリケーションに割り当てられている場合、未使用コアを２以上有する処理サーバ１３０としてＮｏｄｅ５，Ｎｏｄｅ６が選択されるとよい。そしてこの場合、クラスタ生成部１２２はＮｏｄｅ５とＮｏｄｅ６の割り当てＣＰＵＣｏｒｅ５０５に２を加算することで、計算機リソース（ＣＰＵＣｏｒｅ）を確保するとよい。 For example, in the information processing system, as shown in Table 500 of FIG. 5, when the processing server 130 of Node1 to Node8 exists and the CPU Core of Node1 to Node5 is already assigned to some application, it is not used. Node 5 and Node 6 may be selected as the processing server 130 having two or more cores. In this case, the cluster generation unit 122 may secure the computer resource (CPU Core) by adding 2 to the allocated CPU Core505 of Node5 and Node6.

ただし、アプリケーションの特性によっては、ＣＰＵＣｏｒｅの数以外に、メモリ量やＣＰＵの処理性能を考慮して、１または複数の処理サーバ１３０が選択されてもよい。 However, depending on the characteristics of the application, one or a plurality of processing servers 130 may be selected in consideration of the amount of memory and the processing performance of the CPU in addition to the number of CPU Cores.

図６は、アプリ管理サーバ１１０のアプリ実行時間計算部１１２の動作フローの例である。まず、アプリ実行時間計算部１１２は要求発行元から、アプリ名、入力データ、並列度を引数として指定した、アプリ実行時間計算依頼を受け付ける（ステップ６０１）。本実施例ではアプリ実行時間計算依頼の要求発行元は、リクエスト受付サーバ１０３とする。また並列度は、アプリケーションを構成する処理毎に指定されてもよい。たとえばアプリケーションが図４の４１０のように処理Ａａ，Ａｂ，Ａｃから構成されており、処理Ａａ，Ａｂがそれぞれ並列実行可能な処理の場合、要求発行元は処理Ａａの並列度と処理Ａｂの並列度を引数として指定したアプリ実行時間計算依頼を、アプリ実行時間計算部１１２に発行してもよい。ただし以下の説明では、特に断りのない限り、並列度が１つだけ指定される例（並列実行可能な各処理がいずれも、同じ並列度で実行される例）を説明し、またここで指定される並列度をｎとする。 FIG. 6 is an example of the operation flow of the application execution time calculation unit 112 of the application management server 110. First, the application execution time calculation unit 112 receives an application execution time calculation request from the request issuer, in which the application name, input data, and degree of parallelism are specified as arguments (step 601). In this embodiment, the request issuing source of the application execution time calculation request is the request reception server 103. Further, the degree of parallelism may be specified for each process constituting the application. For example, when an application is composed of processes Aa, Ab, and Ac as shown in 410 of FIG. 4, and the processes Aa and Ab are processes that can be executed in parallel, the request issuing source is the degree of parallelism of the process Aa and the parallelism of the process Ab. The application execution time calculation request in which the degree is specified as an argument may be issued to the application execution time calculation unit 112. However, in the following explanation, unless otherwise specified, an example in which only one degree of parallelism is specified (an example in which each process that can be executed in parallel is executed at the same degree of parallelism) will be described and specified here. Let n be the degree of parallelism.

次にアプリ実行時間計算部１１２は、アプリ管理記憶部１１１から、アプリ名に対応した並列度計算ロジック４０３と処理フロー４０４内の各処理に対応する計算ロジック４０６を取得する（ステップ６０２）。そしてアプリ実行時間計算部１１２は、並列度計算ロジック４０３を利用して、入力データ量からアプリケーションの各処理の繰り返し数を算出し（ステップ６０３）、次に各処理の計算ロジック４０６を利用して、各処理が入力データに対応した処理を１回実行する時の実行時間を算出する（ステップ６０４）。 Next, the application execution time calculation unit 112 acquires the parallel degree calculation logic 403 corresponding to the application name and the calculation logic 406 corresponding to each process in the processing flow 404 from the application management storage unit 111 (step 602). Then, the application execution time calculation unit 112 uses the parallel degree calculation logic 403 to calculate the number of repetitions of each process of the application from the amount of input data (step 603), and then uses the calculation logic 406 of each process. , Calculate the execution time when each process executes the process corresponding to the input data once (step 604).

次にアプリ実行時間計算部１１２は、ステップ６０３で求められた各処理の繰り返し数と、ステップ６０４で求められた各処理の１回の実行時間を用いて、アプリケーションの実行時間（並列処理を行わない場合の実行時間）を計算し（ステップ６０５）、さらに並列実行可能な処理群が並列実行された場合の、各処理の繰り返し回数、各処理の実行時間、アプリケーションの合計実行時間を算出し、実行結果を要求発行元に返信する（ステップ６０６）。各処理が並列実行される場合の、繰り返し回数や実行時間は、ステップ６０３で求められた各処理の繰り返し数と、ステップ６０４で求められた各処理の１回の実行時間をそれぞれ、並列度（ｎ）で除算することにより求められる。 Next, the application execution time calculation unit 112 uses the number of repetitions of each process obtained in step 603 and the execution time of each process obtained in step 604 to execute the application (parallel processing is performed). (Execution time when there is no) is calculated (step 605), and when the processes that can be executed in parallel are executed in parallel, the number of repetitions of each process, the execution time of each process, and the total execution time of the application are calculated. The execution result is returned to the request issuer (step 606). When each process is executed in parallel, the number of repetitions and the execution time are the number of repetitions of each process obtained in step 603 and the execution time of each process obtained in step 604, respectively. It is obtained by dividing by n).

アプリ実行時間計算部１１２は上に述べたフローを実行することで、アプリケーションの実行時間を入力データと並列度から瞬時に計算して、アプリケーション利用者に対して計算時間に対する情報を提示する。これによりアプリケーション利用者は、許容可能な実行時間に対する並列度を試行錯誤しながら決定することができる。 By executing the flow described above, the application execution time calculation unit 112 instantly calculates the execution time of the application from the input data and the degree of parallelism, and presents the information on the calculation time to the application user. This allows the application user to determine the degree of parallelism with respect to the allowable execution time by trial and error.

図７は、クラスタ管理サーバ１２０のクラスタ生成部１２２の動作フローの例である。まず、クラスタ生成部１２２は要求発行元から発行されたクラスタ生成依頼を受け付ける（ステップ７０１）。本実施例では、クラスタ生成依頼の要求発行元は、リクエスト受付サーバ１０３とする。またクラスタ生成依頼には、アプリ名と並列度が引数として含まれている。 FIG. 7 is an example of the operation flow of the cluster generation unit 122 of the cluster management server 120. First, the cluster generation unit 122 receives the cluster generation request issued by the request issuer (step 701). In this embodiment, the request issuing source of the cluster generation request is the request receiving server 103. In addition, the cluster generation request includes the application name and the degree of parallelism as arguments.

次にクラスタ生成部１２２は、ノード-クラスタ管理情報記憶部１２１を見て、まだノード-クラスタ管理情報記憶部１２１に記録されていない名称のクラスタ名を生成することで、今回生成されるクラスタに一意な名称を付す（ステップ７０２）。そしてクラスタ生成部１２２はノード-クラスタ管理情報記憶部１２１を参照することで、まだどのクラスタにも割り当てられていないプロセッサコアを有する処理サーバ１３０を１または複数選択して（ステップ７０３）、ノード-クラスタ管理情報記憶部１２１にクラスタ情報を作成する（ステップ７０４）。ステップ７０３における処理サーバ１３０の選択方法は、図５の説明で述べたため、ここでの説明は略す。 Next, the cluster generation unit 122 looks at the node-cluster management information storage unit 121 and generates a cluster name having a name that has not yet been recorded in the node-cluster management information storage unit 121, so that the cluster generated this time is created. Give it a unique name (step 702). Then, by referring to the node-cluster management information storage unit 121, the cluster generation unit 122 selects one or more processing servers 130 having processor cores that have not yet been assigned to any cluster (step 703), and the node- Create cluster information in the cluster management information storage unit 121 (step 704). Since the method of selecting the processing server 130 in step 703 has been described in the description of FIG. 5, the description here will be omitted.

次にクラスタ生成部１２２は、選定した処理サーバ１３０にアプリケーションを配置するために、アプリ管理サーバ１１０からアプリ名に対応するアプリケーションの実行コード４０２を取得して、各処理サーバにアプリケーションの配置を依頼する（ステップ７０５、７０６）。アプリケーションの配置を依頼された処理サーバ１３０で行われる処理については、後で説明する。 Next, the cluster generation unit 122 acquires the application execution code 402 corresponding to the application name from the application management server 110 and requests each processing server to arrange the application in order to arrange the application on the selected processing server 130. (Steps 705 and 706). The processing performed by the processing server 130 requested to deploy the application will be described later.

続いてクラスタ生成部１２２は、アプリケーションの実行コード４０２を配置した処理サーバ１３０の中から、処理サーバ（振分）１３０となる処理サーバ１３０を選択して（ステップ７０７）、クラスタ名と処理サーバ（振分）１３０へのアクセスＵＲＬ（ＵｎｉｆｏｒｍＲｅｓｏｕｒｃｅＬｏｃａｔｏｒ）を、要求発行元に返信する（ステップ７０８）。 Subsequently, the cluster generation unit 122 selects the processing server 130 to be the processing server (distribution) 130 from the processing server 130 in which the application execution code 402 is arranged (step 707), and selects the cluster name and the processing server (step 707). The access URL (Uniform Resource Locator) to the distribution) 130 is returned to the request issuer (step 708).

図８は、クラスタ管理サーバ１２０のクラスタ破棄部１２３の動作フローの例である。まず、クラスタ破棄部１２３は要求発行元から、クラスタ名が引数に指定されたクラスタ破棄依頼を受け付ける（ステップ８０１）。ここでも要求発行元はリクエスト受付サーバ１０３とする。次にクラスタ破棄部１２３は、ノード-クラスタ管理情報記憶部１２１からクラスタ内の処理サーバ１３０の情報を取得して（ステップ８０２）、各処理サーバ１３０にアプリケーションを削除させる（ステップ８０３）。削除が完了すると、クラスタ破棄部１２３はノード-クラスタ管理情報記憶部１２１のクラスタ情報を削除して（ステップ８０４）、完了通知を要求発行元に返信する（ステップ８０５）。 FIG. 8 is an example of the operation flow of the cluster destruction unit 123 of the cluster management server 120. First, the cluster destruction unit 123 receives a cluster destruction request in which the cluster name is specified as an argument from the request issuer (step 801). Here, too, the request issuing source is the request receiving server 103. Next, the cluster discard unit 123 acquires the information of the processing servers 130 in the cluster from the node-cluster management information storage unit 121 (step 802), and causes each processing server 130 to delete the application (step 803). When the deletion is completed, the cluster destruction unit 123 deletes the cluster information of the node-cluster management information storage unit 121 (step 804), and returns a completion notification to the request issuer (step 805).

図９は、アプリケーション利用者が、本実施例に係る情報処理システムを用いてアプリケーションの実行を要求した時に、情報処理システム内の各サーバで行われる処理の流れを表したシーケンス図である。図９では、クライアント端末１０１がリクエスト受付サーバ１０３に要求を発行し、アプリケーションを実行するクラスタが生成されるまでの処理の流れが記述されている。 FIG. 9 is a sequence diagram showing a flow of processing performed by each server in the information processing system when the application user requests execution of the application using the information processing system according to the present embodiment. FIG. 9 describes a processing flow from the client terminal 101 issuing a request to the request reception server 103 to the generation of a cluster for executing the application.

まずクライアント端末１０１はアプリケーション利用者から、アプリケーション利用者が利用するアプリケーションのアプリ名と入力データを受け付けると、リクエスト受付サーバ１０３にアプリケーションの登録依頼を送信する（９０１）。このアプリケーションの登録依頼には、アプリケーション名（たとえば“ＡｐｐＡ”など）と入力データが含まれる。リクエスト受付サーバ１０３はこの登録依頼に応じて、まず入力データをデータ管理サーバ１０４に登録する（９０２、９０３）。データ管理サーバ１０４は入力データを受領すると、入力データへのアクセス方法であるアクセスＵＲＬ（９０４）をリクエスト受付サーバ１０３に返送する。リクエスト受付サーバ１０３はアクセスＵＲＬ（９０４）を受け取ると、クライアント端末１０１にＯＫ（９０５）を返信する。この時、リクエスト受付サーバ１０３は、入力データへのＵＲＬとアプリ名とを対応付けて保持する。 First, when the client terminal 101 receives the application name and input data of the application used by the application user from the application user, the client terminal 101 transmits an application registration request to the request reception server 103 (901). The application registration request includes an application name (eg, "AppA") and input data. In response to this registration request, the request reception server 103 first registers the input data in the data management server 104 (902, 903). When the data management server 104 receives the input data, it returns the access URL (904), which is an access method for the input data, to the request reception server 103. When the request reception server 103 receives the access URL (904), it returns OK (905) to the client terminal 101. At this time, the request reception server 103 holds the URL to the input data and the application name in association with each other.

次にアプリケーション利用者は、クライアント端末１０１を用いて並列度（９０６）を指定する。リクエスト受付サーバ１０３は並列度を受け取ると、アプリ管理サーバ１１０のアプリ実行時間計算部１１２に、繰り返し数と各処理の実行時間を計算させて（９０７、９０８、９０９）、その結果をクライアント端末１０１に返信する（９１０）。９０７、９０８、９０９でアプリ管理サーバ１１０で行われる処理は、図６の処理に相当する。 Next, the application user specifies the degree of parallelism (906) using the client terminal 101. When the request reception server 103 receives the degree of parallelism, the application execution time calculation unit 112 of the application management server 110 is made to calculate the number of repetitions and the execution time of each process (907, 908, 909), and the result is obtained by the client terminal 101. Reply to (910). The processing performed by the application management server 110 at 907, 908, and 909 corresponds to the processing of FIG.

アプリケーション利用者は、アプリ実行時間計算部１１２によって算出されるアプリケーションの実行時間が、アプリケーション利用者の希望する時間に収まるようになるまで、並列度を変更しながら、９０６〜９１０の処理を繰り返す。たとえばある並列度（ｎとする）が指定された時に算出されたアプリケーションの実行時間が、アプリケーション利用者の希望する実行時間より長かった場合には、アプリケーション利用者は、ｎよりも高い並列度（たとえば（ｎ＋１）等）を指定して、アプリ実行時間計算部１１２にアプリケーションの実行時間を算出させるとよい。逆に算出されたアプリケーションの実行時間が、アプリケーション利用者の希望する時間よりも短かった場合、アプリケーション利用者は最初に指定した並列度（ｎ）よりも低い並列度（たとえば（ｎ−１）等）を指定して、アプリ実行時間計算部１１２にアプリケーションの実行時間を算出させてもよい。 The application user repeats the processes of 906 to 910 while changing the degree of parallelism until the application execution time calculated by the application execution time calculation unit 112 falls within the time desired by the application user. For example, if the execution time of the application calculated when a certain degree of parallelism (let's say n) is specified is longer than the execution time desired by the application user, the application user has a degree of parallelism higher than n (assuming n). For example, (n + 1) and the like) may be specified to cause the application execution time calculation unit 112 to calculate the execution time of the application. On the contrary, when the calculated execution time of the application is shorter than the time desired by the application user, the application user has a parallel degree lower than the initially specified parallel degree (n) (for example, (n-1), etc.). ) May be specified to cause the application execution time calculation unit 112 to calculate the execution time of the application.

アプリケーション利用者は上に述べた９０６〜９１０の処理を繰り返すことで、実際にアプリケーションを実行する時の並列度を決定する（以下では、ここでアプリケーション利用者が決定した並列度を「実行時並列度」と呼び、図９の９０６でアプリケーション利用者が指定する並列度とを区別する）。実行時並列度が決定されると、アプリケーション利用者はクライアント端末１０１から、実行時並列度とアプリケーション名を指定したクラスタ生成依頼をリクエスト受付サーバ１０３経由でクラスタ管理サーバ１２０に送信する（９１１，９１２）。ここの処理でアプリケーション利用者が並列度等を指定するための具体的な方法については、後で図１２（または図１３）を用いて説明する。 The application user determines the degree of parallelism when actually executing the application by repeating the processes of 906 to 910 described above (hereinafter, the degree of parallelism determined by the application user here is referred to as "runtime parallelism". It is called "degree" and is distinguished from the degree of parallelism specified by the application user in 906 of FIG. 9). When the run-time parallelism is determined, the application user sends a cluster generation request specifying the run-time parallelism and the application name from the client terminal 101 to the cluster management server 120 via the request reception server 103 (911, 912). ). A specific method for the application user to specify the degree of parallelism and the like in this process will be described later with reference to FIG. 12 (or FIG. 13).

クラスタ管理サーバ１２０はクラスタ生成依頼（９１２）を受け取ると、クラスタ生成部１２２により、クラスタ名を作成し（９１３）、実行時並列度に応じた処理サーバ１３０の計算機リソース（ＣＰＵＣｏｒｅ）の確保を行い（９１４）、ノード-クラスタ管理情報記憶部１２１にクラスタ情報を作成する（９１５）。９１２〜９１５の処理はそれぞれ、図７のステップ７０１〜７０４に相当する処理である。 When the cluster management server 120 receives the cluster generation request (912), the cluster generation unit 122 creates a cluster name (913) and secures the computer resource (CPU Core) of the processing server 130 according to the degree of parallelism at runtime. Perform (914) and create cluster information in the node-cluster management information storage unit 121 (915). The processes of 912 to 915 are the processes corresponding to steps 701 to 704 of FIG. 7, respectively.

続いてクラスタ生成部１２２は、アプリ管理サーバ１１０からアプリケーションの実行コード（９１６）を取得して（９１７）、各処理サーバ１３０にアプリケーションの配置を依頼する（９１８）。９１７〜９１８の処理はそれぞれ、図７のステップ７０５〜７０６に相当する処理である。クラスタ生成部１２２が処理サーバ１３０にアプリケーションの配置を依頼する際、アプリケーションの実行コード、そしてクラスタ情報を処理サーバ１３０に送信する。 Subsequently, the cluster generation unit 122 acquires the application execution code (916) from the application management server 110 (917), and requests each processing server 130 to arrange the application (918). The processes of 917 to 918 are the processes corresponding to steps 705 to 706 in FIG. 7, respectively. When the cluster generation unit 122 requests the processing server 130 to deploy the application, the execution code of the application and the cluster information are transmitted to the processing server 130.

アプリケーションの配置を依頼された処理サーバ１３０は、アプリケーションをインストールするとともに（９１９）、並列処理管理部１３２のクラスタ情報記憶部３２５にクラスタ情報を作成する（９２０）。クラスタに属する各処理サーバ１３０へのアプリケーションの配置が完了すると（９２１）、クラスタ管理サーバ１２０はクラスタに属する各処理サーバ１３０の中から、処理サーバ（振分）１３０となる処理サーバ１３０を１台選定して、リクエスト受付サーバ１０３にクラスタ名とともに、処理サーバ（振分）１３０へのアクセスＵＲＬを返信する（９２３）。 The processing server 130 requested to arrange the application installs the application (919) and creates cluster information in the cluster information storage unit 325 of the parallel processing management unit 132 (920). When the placement of the application on each processing server 130 belonging to the cluster is completed (921), the cluster management server 120 includes one processing server 130 as the processing server (distribution) 130 from each processing server 130 belonging to the cluster. It is selected and the access URL to the processing server (distribution) 130 is returned to the request reception server 103 together with the cluster name (923).

リクエスト受付サーバ１０３はクライアント端末１０１にＯＫ（９２４）を返信して、処理が完了する。 The request reception server 103 returns OK (924) to the client terminal 101, and the process is completed.

図１０は、図９の処理の続きで、図９の処理によって決定された処理サーバ１３０群を利用して、アプリケーションの処理を並列に実行する動作シーケンスの例である。 FIG. 10 is an example of an operation sequence in which application processing is executed in parallel by using the processing server 130 group determined by the processing of FIG. 9 following the processing of FIG.

まず、アプリケーション利用者がクライアント端末１０１を用いてアプリケーション実行要求（１００１）をリクエスト受付サーバ１０３に発行すると、リクエスト受付サーバ１０３は、処理サーバ（振分）１３０へのアクセスＵＲＬに対して、入力データのアクセスＵＲＬと合わせて実行依頼を送信する（１００２）。 First, when the application user issues an application execution request (1001) to the request reception server 103 using the client terminal 101, the request reception server 103 receives input data for the access URL to the processing server (distribution) 130. The execution request is transmitted together with the access URL of (1002).

なお、図９（及び図１０）のシーケンス図には、リクエスト受付サーバ１０３はクライアント端末１０１にＯＫを返送し（９２４）、その後アプリケーション利用者がアプリケーション実行要求（１００１）を発行したことを契機に、処理サーバ（振分）１３０に実行依頼を送信（１００２）する例を示している。ただし別の実施形態として、リクエスト受付サーバ１０３がクラスタ管理サーバ１２０から処理サーバ（振分）１３０へのアクセスＵＲＬを受領（９２３）した後、リクエスト受付サーバ１０３はクライアント端末１０１に返信（９２４）を行うことなく、処理サーバ（振分）１３０にアプリケーションの実行依頼を送信（１００２）してもよい。 In the sequence diagram of FIG. 9 (and FIG. 10), the request reception server 103 returns OK to the client terminal 101 (924), and then the application user issues an application execution request (1001). , An example of sending an execution request (1002) to the processing server (distribution) 130 is shown. However, as another embodiment, after the request reception server 103 receives the access URL from the cluster management server 120 to the processing server (distribution) 130 (923), the request reception server 103 returns to the client terminal 101 (924). The application execution request may be transmitted (1002) to the processing server (distribution) 130 without performing the application.

処理サーバ（振分）１３０では、アプリケーションの振分部３１１が、９１１で指定された並列度（実行時並列度）と同数のメッセージＡａを生成して（１００４）、メッセージＡａ（１００５）を各処理サーバ（実行）１３０に送信する。メッセージＡａを生成する際に、入力データを利用する場合は、処理サーバ（振分）１３０はデータ管理サーバ１０４から入力データを取得する（１００３）。 In the processing server (distribution) 130, the application distribution unit 311 generates the same number of messages Aa as the parallel degree (runtime parallel degree) specified in 911 (1004), and sends each message Aa (1005). It is transmitted to the processing server (execution) 130. When the input data is used when generating the message Aa, the processing server (distribution) 130 acquires the input data from the data management server 104 (1003).

処理サーバ（実行）１３０はメッセージＡａを受け取ると、データ管理サーバ１０４に格納されている入力データの中から処理Ａａに必要な対象データ（１００６）を取得して、実行部３１２の処理Ａａを実行して（１００７）、処理結果（１００８）をデータ管理サーバ１０４に書き込むとともに、処理の完了通知（１００９）を処理サーバ（振分）１３０に返信する。 When the processing server (execution) 130 receives the message Aa, it acquires the target data (1006) required for the processing Aa from the input data stored in the data management server 104, and executes the processing Aa of the execution unit 312. Then (1007), the processing result (1008) is written to the data management server 104, and the processing completion notification (1009) is returned to the processing server (distribution) 130.

処理サーバ（振分）１３０は、メッセージを送信した全ての処理サーバ（実行）１３０から完了通知を受領すると（１００９）、次のメッセージ（図１０の例では“メッセージＡｂ”）を生成して、各処理サーバ（実行）１３０に振り分ける。処理サーバ（振分）１３０はこのように、メッセージを生成して各処理サーバ（実行）１３０にメッセージを振り分け、各処理サーバ（実行）１３０から処理の完了通知を受領する、という処理を繰り返す。そして処理サーバ（振分）１３０は、最後のメッセージ（図１０の例では“メッセージＡｃ”）に対する処理の完了通知を処理サーバ（実行）１３９から受け取ると、最終結果をデータ管理サーバ１０４から取得し（１０２２）、アプリケーションとしての実行結果を生成して（１０２３）、リクエスト受付サーバ１０３経由で実行結果（１０２４，１０２５）をクライアント端末１０１に返信する。 When the processing server (distribution) 130 receives the completion notification from all the processing servers (execution) 130 that have sent the message (1009), the processing server (distribution) 130 generates the next message (“Message Ab” in the example of FIG. 10). Allocate to each processing server (execution) 130. In this way, the processing server (distribution) 130 repeats the process of generating a message, distributing the message to each processing server (execution) 130, and receiving the processing completion notification from each processing server (execution) 130. Then, when the processing server (distribution) 130 receives the processing completion notification for the last message (“message Ac” in the example of FIG. 10) from the processing server (execution) 139, the processing server (distribution) 130 acquires the final result from the data management server 104. (1022), the execution result as an application is generated (1023), and the execution result (1024, 1025) is returned to the client terminal 101 via the request reception server 103.

図１１は、図１０の後に行われる処理、つまりアプリケーションの実行が終わってから、クラスタを破棄するまでの処理の例である。 FIG. 11 is an example of the process performed after FIG. 10, that is, the process from the end of the execution of the application to the destruction of the cluster.

まず、クライアント端末１０１からアプリケーションの実行完了通知（１１０１）をリクエスト受付サーバ１０３が受け取ると、リクエスト受付サーバ１０３は、クラスタ管理サーバ１２０に対して、クラスタ破棄依頼（１１０２）を送信し、クラスタ破棄部１２３はこのクラスタ破棄依頼を受け付ける。この処理は図８のステップ８０１に相当する処理である。先に述べたとおり、クラスタ破棄依頼には、破棄対象のクラスタ名が含まれている。 First, when the request reception server 103 receives the application execution completion notification (1101) from the client terminal 101, the request reception server 103 sends a cluster destruction request (1102) to the cluster management server 120, and the cluster destruction unit. 123 accepts this cluster destruction request. This process corresponds to step 801 in FIG. As mentioned earlier, the cluster destruction request includes the name of the cluster to be destroyed.

クラスタ破棄依頼を受け取ったクラスタ管理サーバ１２０では、クラスタ破棄部１２３がノード-クラスタ管理情報記憶部１２１を参照することで、クラスタ内の処理サーバ１３０とアプリ名を特定する（１１０３）。この処理はステップ８０２に相当する処理である。そしてクラスタ破棄部１２３は、特定された各処理サーバ１３０にアプリケーション破棄依頼（１１０４）を送信する（ステップ８０３に相当する処理である）。 In the cluster management server 120 that has received the cluster destruction request, the cluster destruction unit 123 identifies the processing server 130 and the application name in the cluster by referring to the node-cluster management information storage unit 121 (1103). This process corresponds to step 802. Then, the cluster destruction unit 123 transmits an application destruction request (1104) to each of the specified processing servers 130 (a process corresponding to step 803).

アプリケーション破棄依頼を受領した各処理サーバ１３０は、アプリケーションのアンインストール（１１０５）、クラスタ情報記憶部３２５に記録されていたクラスタ情報の破棄（１１０６）を実施した後、完了通知をクラスタ管理サーバ１２０に返送する。クラスタ破棄部１２３が各処理サーバ１３０から完了通知（１１０７）を受け取ると、ノード-クラスタ管理情報記憶部１２１のクラスタ情報を削除して（１１０８）、リクエスト受付サーバ１０３経由でクライアント端末１０１に完了通知（１１０９，１１０１）を返信する。 Each processing server 130 that has received the application destruction request notifies the cluster management server 120 of the completion after uninstalling the application (1105) and destroying the cluster information recorded in the cluster information storage unit 325 (1106). I will send it back. When the cluster destruction unit 123 receives the completion notification (1107) from each processing server 130, the cluster information of the node-cluster management information storage unit 121 is deleted (1108), and the completion notification is sent to the client terminal 101 via the request reception server 103. (1109, 1101) is returned.

図１２は、アプリケーション利用者が実行要求毎に計算機リソース量を決定するための計算機リソース量設定画面イメージの例である。本実施例では、リクエスト受付サーバ１０３がこの設定画面１２００を作成してクライアント端末１０１に提供する（クライアント端末１０１のディスプレイ装置２０９に表示させる）例を説明する。ただし、リクエスト受付サーバ１０３以外の計算機が、この設定画面１２００を作成してもよい。 FIG. 12 is an example of a computer resource amount setting screen image for the application user to determine the computer resource amount for each execution request. In this embodiment, an example in which the request reception server 103 creates the setting screen 1200 and provides it to the client terminal 101 (displayed on the display device 209 of the client terminal 101) will be described. However, a computer other than the request reception server 103 may create this setting screen 1200.

図１２において、１２０１はアプリ名入力ボックス、１２０２はデータ名入力ボックス、１２０６は並列度設定欄である。アプリケーション利用者がアプリ名入力ボックス１２０１とデータ名入力ボックス１２０２のそれぞれに、アプリケーションの名称及び入力データの名称（ファイル名）を入力することで、リクエスト受付サーバ１０３は図９の９０１〜９０５を実行する。 In FIG. 12, 1201 is an application name input box, 1202 is a data name input box, and 1206 is a parallel degree setting field. When the application user inputs the application name and the input data name (file name) in the application name input box 1201 and the data name input box 1202, respectively, the request reception server 103 executes 901 to 905 in FIG. To do.

その後リクエスト受付サーバ１０３は、アプリケーション利用者がアプリ名入力ボックス１２０１とデータ名入力ボックス１２０２に入力したアプリ名と登録した入力データを基に、まず並列処理を行わない場合の処理フロー内の各処理の繰り返し数、各処理の処理時間の予想値、各処理のトータルの実行時間の算出をアプリ実行時間計算部１１２に行わせる（図６のステップ６０５までの処理が行われる）。そしてリクエスト受付サーバ１０３は、算出されたこれらの情報（１２０４）をアプリケーションの処理フロー（１２０３）と対応付けて表示する画面を作成し、この画面をクライアント端末１０１のディスプレイ装置２０９に出力させる。 After that, the request reception server 103 first performs each process in the processing flow when parallel processing is not performed based on the application name input by the application user in the application name input box 1201 and the data name input box 1202 and the registered input data. The application execution time calculation unit 112 calculates the number of repetitions, the estimated value of the processing time of each processing, and the total execution time of each processing (processing up to step 605 in FIG. 6 is performed). Then, the request reception server 103 creates a screen for displaying the calculated information (1204) in association with the processing flow (1203) of the application, and outputs this screen to the display device 209 of the client terminal 101.

アプリケーション利用者が、この表示された情報を基に、並列度設定欄１２０６に並列度を入力すると、入力された並列度はアプリ管理サーバ１１０に送信される。先に図６や図９を用いて説明したとおり、アプリ管理サーバ１１０は、渡された並列度等を用いて並列処理を行った場合の、各処理の繰り返し数と処理時間の予想値およびアプリケーションの合計実行時間を求め、その結果を表示領域（１２０５）に表示した画面を作成し、クライアント端末１０１に表示させる。そのためアプリケーション利用者は、表示領域（１２０５）に表示されるアプリケーションの合計実行時間が、アプリケーション利用者の希望する実行時間以内になるまで、並列度設定欄１２０６に入力する並列度を少しずつ増やすことを繰り返すとよい。 When the application user inputs the parallel degree in the parallel degree setting field 1206 based on this displayed information, the input parallel degree is transmitted to the application management server 110. As described above with reference to FIGS. 6 and 9, when the application management server 110 performs parallel processing using the passed degree of parallelism or the like, the expected value of the number of repetitions of each processing and the processing time and the application The total execution time of the above is calculated, the result is displayed in the display area (1205), a screen is created, and the result is displayed on the client terminal 101. Therefore, the application user gradually increases the degree of parallelism input to the parallel degree setting field 1206 until the total execution time of the application displayed in the display area (1205) is within the execution time desired by the application user. It is good to repeat.

また、使用する計算機リソースの量と計算機リソースの使用時間に応じて、アプリケーション利用者が情報処理システムの管理者（または所有者）に使用料金を支払うように、情報処理システムが運営されている場合、計算機リソース量の設定画面１２００にコスト表示欄（１２０８）を設け、リクエスト受付サーバ１０３（またはアプリ管理サーバ１１０）はアプリケーションの並列度とアプリケーションの実行時間（アプリケーションが並列実行される場合の実行時間）に応じたコスト（情報処理システムの使用料金）を算出し、算出されたコストの情報をアプリケーション利用者に対して提供してもよい。これによりアプリケーション利用者は、アプリケーションを完了させたい実行時間と並列度に応じて掛かるコストのバランスを見ながら、今回の実行要求を満たす並列度（実行時並列度）を決定することができる。 In addition, when the information processing system is operated so that the application user pays the usage fee to the administrator (or owner) of the information processing system according to the amount of computer resources used and the usage time of the computer resources. , A cost display column (1208) is provided on the computer resource amount setting screen 1200, and the request reception server 103 (or application management server 110) has the degree of parallelism of the application and the execution time of the application (execution time when the application is executed in parallel). ) May be calculated, and the calculated cost information may be provided to the application user. As a result, the application user can determine the degree of parallelism (runtime parallelism) that satisfies the current execution request while observing the balance between the execution time for completing the application and the cost incurred according to the degree of parallelism.

アプリケーション利用者が実行時並列度を決定した後確定ボタン（１２０７）を押すと、図９の９１１，９１２の処理が行われる。つまりリクエスト受付サーバ１０３はクライアント端末１０１から、アプリケーション利用者がアプリ名入力ボックス１２０１と並列度設定欄１２０６に設定したアプリケーション名称と並列度（実行時並列度）とを受け取る。そしてリクエスト受付サーバ１０３はクラスタ管理サーバ１２０に対して、実行時並列度とアプリケーション名を指定したクラスタ生成依頼を送信する（図９の９１１，９１２の処理が行われる）。クラスタの生成が完了し、リクエスト受付サーバ１０３がクラスタ管理サーバ１２０からの返答を受領すると（図９９２３）、リクエスト受付サーバ１０３は処理サーバ（振分）１３０にアプリケーションの実行依頼を送信する（図１０１００２）。 When the application user presses the confirm button (1207) after determining the run-time parallel degree, the processes of 911 and 912 in FIG. 9 are performed. That is, the request reception server 103 receives from the client terminal 101 the application name set by the application user in the application name input box 1201 and the parallel degree setting field 1206, and the degree of parallelism (runtime parallel degree). Then, the request reception server 103 transmits a cluster generation request specifying the run-time parallelism and the application name to the cluster management server 120 (the processing of 911 and 912 in FIG. 9 is performed). When the cluster generation is completed and the request reception server 103 receives the response from the cluster management server 120 (FIG. 9 923), the request reception server 103 sends an application execution request to the processing server (distribution) 130 (FIG. 9 923). 10 1002).

本実施例に係る情報処理システムは、上で述べた機能を備えることにより、実行要求を満たす並列コンピューティングシステムの実行環境を実行要求毎に生成し、アプリケーションを並列実行させることができる。 By providing the functions described above, the information processing system according to the present embodiment can generate an execution environment of a parallel computing system that satisfies the execution request for each execution request, and execute the application in parallel.

実施例２では、アプリケーションの処理ごとに並列度を設定できる情報処理システムの例を説明する。実施例２に係る情報処理システムの構成は実施例１で説明したものと同じなので、構成の説明は略し、実施例１で説明した内容と異なる点についてのみ説明する。 In the second embodiment, an example of an information processing system capable of setting the degree of parallelism for each processing of the application will be described. Since the configuration of the information processing system according to the second embodiment is the same as that described in the first embodiment, the description of the configuration is omitted, and only the points different from the contents described in the first embodiment will be described.

図１３は、実施例２に係る計算機リソース量の設定画面１２００’の例を示している。図１３の設定画面１２００’と図１２で説明した設定画面１２００との違いは、図１３の設定画面１２００’では並列実行可能な処理毎に、並列度設定欄が設けられており（図１３１２０６’及び１２０６’’）、アプリケーション利用者は処理毎に並列度を設定可能である。また、アプリ管理サーバ１１０がアプリケーションの実行時間を計算する際には、設定画面１２００’’で処理毎に設定された並列度に基づいて計算を行う。 FIG. 13 shows an example of the computer resource amount setting screen 1200 ′ according to the second embodiment. The difference between the setting screen 1200'of FIG. 13 and the setting screen 1200 described with reference to FIG. 12 is that the setting screen 1200'of FIG. 13 is provided with a parallel degree setting column for each process that can be executed in parallel (FIG. 13 1206). 'And 1206''), the application user can set the degree of parallelism for each process. Further, when the application management server 110 calculates the execution time of the application, the calculation is performed based on the degree of parallelism set for each process on the setting screen 1200 ″.

実施例２に係る情報処理システムでは、アプリケーションの処理ごとに並列度を設定できることで、各処理の１回あたりの処理時間が異なる場合に、アプリケーション利用者は処理時間がより大きい処理の並列度の設定を大きくするなどして、トータルの実行時間の短縮の効果が大きく、コストをできるだけ小さくするような施策を選択することができるようになる。 In the information processing system according to the second embodiment, the degree of parallelism can be set for each processing of the application, so that when the processing time for each processing is different, the application user can set the degree of parallelism of the processing in which the processing time is larger. The effect of shortening the total execution time is great by increasing the setting, and it becomes possible to select measures that reduce the cost as much as possible.

このように、各処理に並列度を設定させることで、たとえば、並列後のトータルの実行時間を指定することで、各処理の並列度を算出する、といった並列度の設定方法や、コストを設定して、それに応じてもっとも実行時間が短くなるような各処理の並列度の設定を算出する、といった並列度の設定方法も考えられる。 In this way, by setting the degree of parallelism for each process, for example, by specifying the total execution time after parallelism, the degree of parallelism for each process is calculated, and the cost is set. Then, a parallel degree setting method such as calculating the parallel degree setting of each process that shortens the execution time accordingly is also conceivable.

以上により、アプリケーション利用者がアプリケーションの実行要求毎に、トータルの実行時間やコストなどの観点から、アプリケーション利用者の希望する計算機リソース量を決定でき、決定した計算機リソースを自動で確保して、アプリケーション利用者がすぐにアプリケーションを並列実行させる並列コンピューティングシステムの実行環境を提供することができる。 From the above, the application user can determine the amount of computer resources desired by the application user from the viewpoint of total execution time and cost for each application execution request, and the determined computer resources are automatically secured to apply. It is possible to provide an execution environment for a parallel computing system in which a user can immediately execute an application in parallel.

なお、上で説明した実施例では、アプリケーションの実行要求の際にクライアント端末１０１から入力データが指定される方法を説明したが、事前にデータ管理サーバ１０４にデータを登録しておき、実行要求の際に、アプリケーション利用者がデータ管理サーバ１０４に蓄積されているデータを入力データとして指定することで、入力データを処理してもよい。 In the embodiment described above, the method in which the input data is specified from the client terminal 101 at the time of the application execution request has been described, but the data is registered in the data management server 104 in advance and the execution request is made. At that time, the application user may process the input data by designating the data stored in the data management server 104 as the input data.

１０１：クライアント端末、１０２：ネットワーク、１０３：リクエスト受付サーバ、１０４：データ管理サーバ、１１０：アプリ管理サーバ、１２０：クラスタ管理サーバ、１３０：処理サーバ 101: Client terminal, 102: Network, 103: Request reception server, 104: Data management server, 110: App management server, 120: Cluster management server, 130: Processing server

Claims

An information processing system that has a management server and multiple processing servers.
Each of the processing servers has one or more processors for executing application programs.
The management server manages the usage status of computer resources possessed by each of the processing servers.
When the management server receives the degree of parallelism of the application program from the user,
From the available computer resources of the plurality of processing servers, secure the computer resources required to execute the application program at the degree of parallelism.
The application program is placed on the processing server having the reserved computer resources, and the application program is executed in parallel .
When the management server executes the application program in parallel,
Select one of the processing servers having the reserved computer resources,
Request the selected processing server to execute the application program,
The application program
An execution unit that is a program code that causes the processor to execute processing related to input data.
It has a distribution unit, which is a program code for instructing a plurality of the processors to execute the execution unit.
The processor of the selected processing server executes the distribution unit to issue a message for causing the plurality of processing servers to execute the execution unit.
Each of the processing servers that received the message executes the execution unit to execute the processing related to the input data in parallel.
An information processing system characterized by this.

The application program is a program for causing the processor to process input data received from the user.
When the management server receives the input data and the degree of parallelism (n) from the user, the execution time when one processor executes the process related to the input data by executing the application program, and n The execution time when the processes related to the input data are executed in parallel by the processors are calculated and presented to the user.
The information processing system according to claim 1, wherein the information processing system is characterized by the above.

The management server holds, for each application program, a calculation logic for calculating an execution time when the processor executes a process related to the input data by executing the application program.
The information processing system according to claim 2 , wherein the information processing system is characterized by the above.

The process related to the input data includes a first process and a second process.
The calculation logic includes a first calculation logic for calculating the execution time of the first process and a second calculation logic for calculating the execution time of the second process.
The information processing system according to claim 3 , wherein the information processing system is characterized by the above.

The management server is configured to be able to accept the degree of parallelism of the first process and the degree of parallelism of the second process.
When the management server receives the parallel degree (N) of the first process and the parallel degree (M) of the second process, the management server uses the calculation logic to use the calculation logic.
The first execution time when the first process is executed by the N processors, the second execution time when the second process is executed by the M processors, and the first execution time. The total of the second execution time and the total are calculated and presented to the user.
The information processing system according to claim 4 , wherein the information processing system is characterized by the above.

The management server calculates the usage fee of the information processing system from the degree of parallelism and the execution time when the processing related to the input data is executed in parallel, and presents it to the user.
The information processing system according to any one of claims 2 to 5 , wherein the information processing system is characterized by the above.

The management server has a management information storage unit for managing the usage status of the computer resource.
The management server holds, in the management information storage unit, the number of processors possessed by each of the processing servers and the number of processors among the processors used in executing the application program.
When the management server receives the degree of parallelism of the application program,
By referring to the management information storage unit, the number of the processing servers having the unused processor from the plurality of the processing servers is secured as many as necessary to satisfy the degree of parallelism, and the management information storage is performed. Information about the number of the processing server and the processor used in the execution of the application program is recorded in the unit in association with the name of the application program.
The reserved processing server is requested to execute the application program.
The information processing system according to any one of claims 1 to 6 , wherein the information processing system is characterized by the above.

When the management server receives from the processing server that the execution of the application program has been completed,
Have each processing server that was executing the application program execute the uninstallation of the application program.
Information about the number of the processing server and the processor used to execute the application program is deleted from the management information storage unit.
The information processing system according to claim 7 , wherein the information processing system is characterized by the above.

A control method for an information processing system having a management server and a plurality of processing servers including one or more processors for executing application programs.
a) from the management server user, the input data, a step of receiving the parallelism of the application program (n),
b) The execution time when the management server executes the process related to the input data by executing the application program by one processor, and the execution time when the application program is executed by n of the processors. A process of calculating and presenting the parallel processing execution time, which is the execution time when the processing related to the input data is executed in parallel, to the user .
c ) The process of receiving the run-time parallel degree, which is the degree of parallelism when the management server executes the application program, from the user.
d ) A step in which the management server secures the computer resources necessary for executing the application program at the run-time parallel degree from the available computer resources of the plurality of processing servers.
e ) A process in which the management server arranges the application program on the processing server having the reserved computer resources.
f ) A process in which the management server causes the processing server to execute the application program in parallel.
The execution,
In step f),
The process of selecting one of the processing servers having the reserved computer resources, and
The process in which the management server requests the selected processing server to execute the application program, and
Including
The application program
An execution unit that is a program code that causes the processor to execute processing related to input data.
It has a distribution unit, which is a program code for instructing a plurality of the processors to execute the execution unit.
further,
g) A step of issuing a message for causing the plurality of processing servers to execute the execution unit by executing the distribution unit by the processor of the selected processing server.
h) A step of executing the process related to the input data in parallel by executing the execution unit by each of the processing servers receiving the message.
A control method for an information processing system, which is characterized by executing .

The method further
i ) When the execution of the application program on the processing server is completed, the step of causing each processing server executing the application program to uninstall the application program.
The control method of the information processing system according to claim 9 , further comprising.