JP5470148B2

JP5470148B2 - Node device and computer program

Info

Publication number: JP5470148B2
Application number: JP2010096986A
Authority: JP
Inventors: 金子　　豊; ▲ミン▼錫黄; 真也竹内; 吉則和泉
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2010-04-20
Filing date: 2010-04-20
Publication date: 2014-04-16
Anticipated expiration: 2030-04-20
Also published as: JP2011227712A

Description

本発明は、ネットワーク上に分散されたノード間におけるファイルの分散管理に関し、特にＰ２Ｐ（Peer-to-Peer）型の分散ファイルシステムのノード装置及びコンピュータプログラムに関する。 The present invention relates to file distribution management among nodes distributed on a network, and more particularly to a node device and a computer program of a P2P (Peer-to-Peer) type distributed file system.

コンピュータではユーザが保存するデータをファイルと呼ばれる一連のデータの塊として取り扱う。このファイルはハードディスクドライブ等の蓄積装置に記録される。ファイルのどこのデータを蓄積装置のどの場所に記録したかを管理している部分をファイルシステムと呼び、ＯＳ（オペレーティングシステム）の一部として動作している。
分散ファイルシステムは、コンピュータネットワークで接続された複数のストレージサーバーを見かけ上１台の蓄積装置とみなして、ファイルの読み出しや書き込みなどのファイル操作を利用者に提供するシステムである。 A computer handles data stored by a user as a series of data chunks called files. This file is recorded in a storage device such as a hard disk drive. A portion that manages where data in a file is recorded in which location of the storage device is called a file system and operates as a part of an OS (operating system).
The distributed file system is a system that provides a user with file operations such as reading and writing of a file, apparently considering a plurality of storage servers connected via a computer network as one storage device.

分散ファイルシステムでは、保存したデータがどのストレージサーバーに保管されているかを管理する必要がある。管理方法は、大きく集中型と分散型に分けることができる。ここでは、分散型のファイル管理方法、特に、Ｐ２Ｐ（Peer-to-Peer）型のファイル管理方法について説明する。Ｐ２Ｐ型のファイル管理方法では、ファイル名とその保管先アドレスを一元管理しているサーバーが存在しないため、障害に強い特長を持つ。Ｐ２Ｐ型のファイル管理方法は、非構造型と構造型に分類されるが、以下に、ＤＨＴ（Distributed Hash Table）を使った構造型Ｐ２Ｐによるファイル管理技術を説明する。 In the distributed file system, it is necessary to manage to which storage server the stored data is stored. Management methods can be broadly divided into a centralized type and a distributed type. Here, a distributed file management method, particularly a P2P (Peer-to-Peer) type file management method will be described. The P2P type file management method has a strong feature against failure because there is no server that centrally manages the file name and its storage address. The P2P type file management method is classified into a non-structural type and a structural type. A file management technique based on the structural type P2P using DHT (Distributed Hash Table) will be described below.

ＤＨＴを使ったＰ２Ｐでは、複数のノードによりオーバーレイネットワークを構成する。ここで、ノードとは、オーバーレイネットワークに参加するサーバーなどの総称である。図２０は、ＤＨＴを用いたＰ２Ｐオーバーレイネットワークの原理を示す図である。同図に示す例では、円形のオーバーレイネットワークを形成している。オーバーレイネットワークに参加するノードは、例えば、ＳＨＡ（Secure Hash Algorithm）−１などのある決められたハッシュ関数によりノードＩＤを生成し、オーバーレイネットワーク内では、このノードＩＤにより各ノードを識別する。円形のオーバーレイネットワーク（図２０における実線の円）は、ノードＩＤをＩＤの数値が小さい順に時計周りに並べた状態を示している。また、各ノードが保有するファイルにはファイルＩＤを付与する。ファイルＩＤは、例えばファイル名を元にハッシュ関数を使って生成する。オーバーレイネットワーク内では、ファイルＩＤの値に時計回りに近いノードＩＤのノードがそのファイルを管理する。ここで、ファイルの管理とは、ファイルＩＤとそのファイルを保有するノードのアドレスを保存することである。 In P2P using DHT, an overlay network is configured by a plurality of nodes. Here, the node is a generic term for servers that participate in the overlay network. FIG. 20 is a diagram illustrating the principle of a P2P overlay network using DHT. In the example shown in the figure, a circular overlay network is formed. A node participating in the overlay network generates a node ID by using a predetermined hash function such as SHA (Secure Hash Algorithm) -1, and identifies each node by this node ID in the overlay network. A circular overlay network (solid-line circle in FIG. 20) shows a state in which node IDs are arranged clockwise in order of increasing numerical values of IDs. Further, a file ID is assigned to a file held by each node. The file ID is generated using a hash function based on the file name, for example. In the overlay network, a node having a node ID close to the file ID value in the clockwise direction manages the file. Here, the file management is to store the file ID and the address of the node that holds the file.

ファイルを保有するノードは、ファイルＩＤと自身のノードアドレスとの対応付けを、このファイルＩＤを管理すべきノードに登録する。ファイルの利用者は、利用したいファイルのファイルＩＤから、そのファイルＩＤに近いノードＩＤのノードをオーバーレイネットワーク上で発見し、その利用したいファイルを保有するノードのアドレスを知ることによって、ファイルにアクセスすることができる。 The node that owns the file registers the association between the file ID and its own node address in the node that should manage this file ID. The user of the file accesses the file by finding a node having a node ID close to the file ID from the file ID of the file to be used on the overlay network and knowing the address of the node that holds the file to be used. be able to.

なお、ここではファイルＩＤをキーとし、ノードアドレスをそのキーに対応した値として、ＤＨＴによるＰ２ＰをファイルＩＤからファイルを保管するノードアドレスを取得するデータベースとして利用している。このＤＨＴによるＰ２Ｐは、キーからその値を検索する汎用的なKey-Value型データベースとして利用できる。 Here, the file ID is a key, the node address is a value corresponding to the key, and P2P by DHT is used as a database for acquiring a node address for storing a file from the file ID. This P2P by DHT can be used as a general-purpose key-value type database for retrieving a value from a key.

ファイルＩＤからそのファイルＩＤを管理するノード（ファイルを記憶しているノードではないことに注意）を検索するには、各ノードが、オーバーレイネットワークに参加しているノードを検索できることが必要である。そのため、各ノードは、ノードＩＤとノードアドレスとの対応付けを登録したノードテーブルを保持し、所望のファイルＩＤに近いノードを検索する。保持しているノードテーブルから所望のファイルＩＤの管理ノードを見つけられない場合には、他のノードに転送（ルーティング）することによって、最終的に参加ノードの中からファイルＩＤに近いノードを検索する。ＤＨＴを使ったＰ２Ｐでは、ノードテーブルの保持方法および、検索方法（検索要求のルーティング方法）により様々な方式が提案されている（例えば、非特許文献１参照）。ここでは、参加している全ノードを含むノードテーブルを、全ノードが保管する方式であるOneHopについて説明する（例えば、非特許文献２参照）。 In order to search a node that manages the file ID from the file ID (note that it is not a node that stores the file), each node needs to be able to search for a node participating in the overlay network. Therefore, each node holds a node table in which a correspondence between a node ID and a node address is registered, and searches for a node close to a desired file ID. When a management node having a desired file ID cannot be found from the held node table, a node close to the file ID is finally searched from participating nodes by transferring (routing) it to another node. . In P2P using DHT, various methods have been proposed according to a node table holding method and a search method (search request routing method) (for example, see Non-Patent Document 1). Here, OneHop, which is a method in which all nodes store a node table including all participating nodes, will be described (for example, see Non-Patent Document 2).

OneHopでは、全ノードが全参加ノードの完全なテーブルを保持するため、各ノードが持つノードテーブルのサイズが大きくなる欠点があるが、検索時のルーティングを必要としないため高速な検索が実現できる。また、ノードテーブルを分散して管理するＰ２Ｐでは、オーバーレイネットワークに参加している全ノードを把握することが困難であるが、OneHopでは全参加ノードを容易に把握できるため、ノードの状態管理が必要な用途では有効な方式である。 In OneHop, all nodes hold a complete table of all participating nodes, so there is a drawback that the size of the node table of each node becomes large. However, since routing at the time of search is not required, high-speed search can be realized. In addition, with P2P, which manages the node table in a distributed manner, it is difficult to grasp all the nodes participating in the overlay network. However, since OneHop can easily grasp all the participating nodes, node state management is required. This is an effective method for various applications.

OneHopでは、ノードの参加や離脱を、オーバーレイネットワーク内の全ノードへ通知することによって、全ノードが完全なノードテーブルを保持する。そのため、図２１に示すようにオーバーレイネットワークを均等にｋ個に分割し、各領域においてスライス・リーダー（Slice Leader）を決める。さらに、ｋ個に分割された領域内を、均等にｓ個に分割し、その分割したそれぞれにおいてユニット・リーダー（Unit Leader）を決める。ノードの参加や離脱を検知したノードは、まず、自領域のスライス・リーダーに通知する（ステップＳ１）。通知を受けたスライス・リーダーは、他の領域のスライス・リーダーに通知する（ステップＳ２）。通知を受けた各領域のスライス・リーダーは、自領域内のユニット・リーダーに通知する。このように、OneHopでは、階層的構造により、ノードの参加や離脱の情報をオーバーレイネットワーク内の全ノードに通知する。 In OneHop, all nodes hold a complete node table by notifying all nodes in the overlay network of node joins and leaves. Therefore, as shown in FIG. 21, the overlay network is equally divided into k pieces, and a slice leader is determined in each region. Further, the area divided into k pieces is equally divided into s pieces, and a unit leader is determined for each of the divided areas. The node that detects the joining or leaving of the node first notifies the slice leader of its own region (step S1). The slice leader that has received the notification notifies the slice leader in another area (step S2). The slice leader of each area that has received the notification notifies the unit leader in its own area. As described above, OneHop notifies all nodes in the overlay network of node joining and leaving information by a hierarchical structure.

以上説明したように、OneHopに限らずＤＨＴを使ったＰ２Ｐ型ファイル管理技術では、ファイルＩＤの検索にノードテーブルを利用するため、ノードの参加や離脱が発生した場合にも、各ノードが保持するノードテーブルを正しい状態にしておくことが必要である。しかし、ノードの参加や離脱の通知が正しく到達しないなどが原因で、ノードテーブルに不整合が生じる場合がある。その対応として、一般に各ノードはStabilization（安定化）と呼ばれる処理を行う。Stabilizationでは、各ノードが自身の保持するノードテーブルに記録されている各ノードに対して、それら各ノードが存在するかどうかを検出するための生存確認用のパケットを定期的に送信する。これによって、ノードの参加や離脱を検知し、各ノードが保持するノードテーブルのメンテナンスを行なう。 As described above, in the P2P type file management technology using DHT as well as OneHop, the node table is used for searching the file ID. Therefore, even when a node joins or leaves, each node holds it. It is necessary to keep the node table in the correct state. However, inconsistencies may occur in the node table due to the failure of node join / leave notifications to arrive correctly. In response, each node generally performs a process called stabilization. In Stabilization, each node periodically transmits a survival confirmation packet for detecting whether or not each node exists to each node recorded in the node table held by the node. As a result, the joining or leaving of the node is detected, and the node table held by each node is maintained.

また、オーバーレイネットワークは、物理的なネットワークとは無関係にノードＩＤによって決められる仮想的なネットワークである。そのため、オーバーレイネットワーク上では隣接ノードであっても、物理的には遠距離に配置されたノードである場合もあるため、結果として物理ネットワーク上に多くのパケットが流れることになる。遠距離のノード間の通信量を減らす方法としてOneHop拡張方式が提案されている（例えば、特許文献１、非特許文献３参照）。 The overlay network is a virtual network determined by the node ID regardless of the physical network. Therefore, even if it is an adjacent node on the overlay network, it may be a node physically located at a long distance. As a result, many packets flow on the physical network. OneHop expansion method has been proposed as a method for reducing the amount of communication between nodes at long distances (see, for example, Patent Document 1 and Non-Patent Document 3).

OneHop拡張方式では、これまで説明したような、ファイルＩＤから当該ファイルＩＤを管理するノードを検索するためのノードテーブルである検索用テーブルとは別に、物理ネットワーク上のノードの位置（ローカリティ）を考慮してノードを並べたノードテーブルである管理用テーブルを保持する。そして、その管理用テーブルを使ってノードの参加や離脱の通知、Stabilizationの通信を行うことで、遠距離のノード間の通信量を減少させる。 In the OneHop expansion method, the position (locality) of the node on the physical network is considered separately from the search table that is a node table for searching the node that manages the file ID from the file ID as described above. Thus, a management table which is a node table in which nodes are arranged is held. Then, by using the management table, notification of node joining / leaving and Stabilization communication are performed, thereby reducing the amount of communication between nodes at a long distance.

特開２００９−２３０６８６号公報JP 2009-230686 A

首藤一幸, “スケールアウトの技術”, 情報処理, pp.1080-1085, Vol.50, No.11, 2009Kazuyuki Shudo, “Technology for Scale Out”, Information Processing, pp.1080-1085, Vol.50, No.11, 2009 A. Gupta, B. 他, “One Hop Lookups for Peer-to-Peer Overlays,” 9th Workshop on Hot Topics in Operating Systems (HotOS-IX), 2003.A. Gupta, B. et al., “One Hop Lookups for Peer-to-Peer Overlays,” 9th Workshop on Hot Topics in Operating Systems (HotOS-IX), 2003. 金子、他, “ノードの局所性と管理の公平性を考慮したOneHop-Ｐ２Ｐ拡張方式”, RL-006, FIT2008, 2008Kaneko et al., “OneHop-P2P extension method considering node locality and management fairness”, RL-006, FIT2008, 2008

分散ファイルシステムでは、ファイルの保管場所の管理だけでなく、利用者が分散ファイルシステムに接続するためのアクセス機能、ファイルを保持（ストレージ）するノードである各ストレージノードの使用容量を均等化する容量負荷分散機能、オーバーレイネットワークに参加しているノードの状況を把握するための管理機能など、様々な機能の追加が求められる。その場合、利用者の増減に応じてアクセス負荷分散を行うためにアクセス機能を持ったノードを増減させるなど、状況に応じて機能の追加、削除が必要になる。 In the distributed file system, in addition to managing the storage location of the file, the access function for users to connect to the distributed file system, the capacity to equalize the used capacity of each storage node that holds the file (storage) Various functions such as a load balancing function and a management function for grasping the status of nodes participating in the overlay network are required. In that case, it is necessary to add or delete functions depending on the situation, such as increasing or decreasing the number of nodes having access functions in order to distribute the access load according to the increase or decrease of users.

一方、上記において説明したように、ＤＨＴを用いたＰ２Ｐ型のファイル管理技術では、ノードの参加や離脱が発生すると、参加や離脱したノードの隣接ノードが管理するファイルＩＤの範囲が変わるため、各ノードが持つノードテーブルが古い状態（不整合な状態）の間は、所望のファイルＩＤを管理するノードを発見できないという問題がある。また、ファイルＩＤの管理ノードが変更になった場合、新しい管理ノードにファイルＩＤに対するノードの情報（ファイルの保管先のサーバーアドレスなど）が正しく登録されるまでの間もファイルＩＤを発見できないという問題がある。 On the other hand, as described above, in the P2P type file management technology using DHT, when a node joins or leaves, the range of file IDs managed by adjacent nodes of the joined or detached node changes. There is a problem that a node managing a desired file ID cannot be found while the node table of the node is old (inconsistent state). Also, when the file ID management node is changed, the file ID cannot be found until the node information (such as the server address of the file storage destination) is correctly registered in the new management node. There is.

ノードテーブルの不整合はStabilizationにより自動的に修正することができるが、修正されるまでには時間がかかるため、修正が行なわれている間はファイル（ファイルＩＤ）の検索に障害が生じてしまう。
つまり、ストレージノードと、ファイルストレージ以外の個別の機能を持った機能ノードとからなる分散ファイルシステムにおいては、必要に応じて機能ノードを頻繁にオーバーレイネットワークに参加、離脱させると、ファイルＩＤを管理するノードが頻繁に変更になってしまい、ファイルを記憶しているストレージノード自体には参加や離脱が発生しなくても、ファイルＩＤを検索できない期間が増大するという問題がある。 Inconsistencies in the node table can be corrected automatically by stabilization, but it takes time until the correction is made, so that the search for the file (file ID) will be hindered while the correction is being made. .
In other words, in a distributed file system composed of storage nodes and functional nodes having individual functions other than file storage, file IDs are managed when functional nodes are frequently joined and removed from the overlay network as necessary. There is a problem that the period when the file ID cannot be searched increases even if the node is frequently changed and the storage node itself storing the file does not participate or leave.

ノードの参加、離脱による検索障害に対応するために、ファイルＩＤの登録を管理ノードだけでなく近隣のノードにも登録しておく方法や、ノードの参加、離脱時に隣接ノードとの間でノードテーブルをコピーするなどの方法が提案されているが、いずれもその処理のためにノードに処理負荷や、ノード間の通信負荷が発生する。 In order to cope with a search failure due to node join / leave, file ID registration not only to the management node but also to neighboring nodes, and the node table between neighboring nodes at the time of node join / leave Have been proposed, but in any case, a processing load and a communication load between the nodes are generated for the processing.

本発明は、このような事情を考慮してなされたもので、その目的は、ファイルストレージ以外の機能を持ったノードが、他のノードやネットワークに負荷をかけずに、かつ、ファイルの検索に影響を及ぼすことなく、参加や離脱を行なうことが可能な分散ファイルシステムのノード装置及びコンピュータプログラムを提供することにある。 The present invention has been made in view of such circumstances, and its purpose is to allow a node having a function other than file storage to search for a file without imposing a load on other nodes and the network. It is an object of the present invention to provide a distributed file system node device and a computer program that can participate and leave without affecting.

［１］本発明の一態様は、ファイルを分散して管理する分散ファイルシステムにネットワークを介して参加するノード装置であって、ファイルを分散して記憶するストレージノード装置または付加機能を提供する機能ノード装置であり、前記分散ファイルシステムを構成するノード装置のノードＩＤとアドレスとを対応付けする管理用ノードテーブルと、前記ストレージノード装置のノードＩＤとアドレスとを対応付けする検索用ノードテーブルとを記憶する記憶部と、前記分散ファイルシステムへの各ノード装置の参加または離脱の通知情報を受信し、前記管理用ノードテーブルに記憶されている前記ノードＩＤに基づいて選択した前記アドレスをあて先として前記通知情報を送信する通知情報処理部と、ストレージノード装置が参加したことを表す通知情報を前記通知情報処理部が受信した場合には、前記通知情報に基づいて前記ストレージノード装置のノードＩＤとアドレスとを前記管理用ノードテーブルと前記検索用ノードテーブルとに書き込み、機能ノード装置が参加したことを表す通知情報を前記通知情報処理部が受信した場合には、前記通知情報に基づいて前記機能ノード装置のノードＩＤとアドレスとを前記管理用ノードテーブルのみに書き込み、ストレージノード装置が離脱したことを表す通知情報を前記通知情報処理部が受信した場合には、前記通知情報に基づいて前記ストレージノード装置のノードＩＤと対応するアドレスとを前記管理用ノードテーブルと前記検索用ノードテーブルとから削除し、機能ノード装置が離脱したことを表す通知情報を前記通知情報処理部が受信した場合には、前記通知情報に基づいて前記機能ノード装置のノードＩＤと対応するアドレスとを前記管理用ノードテーブルから削除するテーブル処理部と、指定されたファイルＩＤと、前記検索用ノードテーブルに記憶されている前記ノードＩＤとに基づいて、前記ファイルＩＤを管理する前記ストレージノード装置のアドレスを取得するテーブル検索部と、を備えることを特徴とするノード装置である。
この発明によれば、ネットワークを介して接続される複数のノード装置から構成される分散ファイルシステムにおいて、各ノード装置は、ファイル検索用のノードテーブルと、ノード装置管理用のノードテーブルを保持し、ファイルを保管するストレージノード装置については、検索用のノードテーブル及び管理用のノードテーブルにノード情報を登録し、付加機能を有するノード装置については、管理用のノードテーブルにのみノード情報を登録する。
これにより、分散ファイルシステムの稼働中であっても、ファイル管理に影響を及ぼすことなく、ファイル記憶以外の個別の機能を有するノードの追加や削除が可能となる。また、検索用ノードテーブルを同期させる必要がないため、ネットワークやノード装置に負荷がかからない。 [1] One aspect of the present invention is a node device that participates via a network in a distributed file system that distributes and manages files, and a storage node device that distributes and stores files or a function that provides an additional function A management node table that associates node IDs and addresses of node devices that constitute the distributed file system, and a search node table that associates node IDs and addresses of the storage node devices. A storage unit for storing, and notification information of participation or withdrawal of each node device in the distributed file system, and the address selected based on the node ID stored in the management node table as the destination The notification information processing unit that sends the notification information and the storage node device participated When the notification information processing unit receives the notification information indicating that, the node ID and address of the storage node device are written to the management node table and the search node table based on the notification information, When the notification information processing unit receives notification information indicating that a functional node device has participated, the node ID and address of the functional node device are written only in the management node table based on the notification information, When the notification information processing unit receives notification information indicating that the storage node device has left, the management node table and the address corresponding to the node ID of the storage node device based on the notification information The notification information indicating that the functional node device has been removed is deleted from the search node table. When the information processing unit receives, the table processing unit for deleting the address corresponding to the node ID of the functional node device based on the notification information from the management node table, the specified file ID, And a table search unit that acquires an address of the storage node device that manages the file ID based on the node ID stored in the search node table.
According to the present invention, in a distributed file system composed of a plurality of node devices connected via a network, each node device holds a node table for file search and a node table for node device management, For storage node devices that store files, node information is registered in the search node table and management node table, and for node devices having additional functions, node information is registered only in the management node table.
As a result, even when the distributed file system is in operation, it is possible to add or delete nodes having individual functions other than file storage without affecting file management. Further, since there is no need to synchronize the search node table, no load is applied to the network or the node device.

［２］本発明の一態様は、上述するノード装置であって、前記管理用ノードテーブルに登録されている前記アドレスの前記ノード装置が離脱していないかを確認する安定化部をさらに備え、前記テーブル処理部は、前記管理用ノードテーブルに登録されている前記アドレスの前記ノード装置が離脱したことを前記安定化部が検出した場合、前記管理用ノードテーブルと前記検索用ノードテーブルとから離脱したノード装置のノードＩＤとアドレスとを削除し、前記通知情報処理部は、前記管理用ノードテーブルに記憶されている前記ノードＩＤに基づいて選択した前記アドレスをあて先として、前記安定化部が検出した前記ノード装置が離脱したことを表す通知情報を送信する、ことを特徴とする。
この発明によれば、分散ファイルシステムから離脱したノード装置を検出し、検索用のノードテーブルと管理用のノードテーブルからこの離脱したノード装置の情報を削除するとともに、他のノード装置へも当該ノード装置の離脱を通知する。
これにより、ノード装置が分散ファイルシステムから離脱するときに、他のノード装置へ離脱を通知しなくとも、検索用のノードテーブルと管理用のノードテーブルを更新することができる。 [2] One aspect of the present invention is the above-described node device, further including a stabilization unit that confirms whether the node device at the address registered in the management node table has left, The table processing unit leaves the management node table and the search node table when the stabilization unit detects that the node device at the address registered in the management node table has left. And the notification information processing unit detects the address selected based on the node ID stored in the management node table as the destination, and the stabilization unit detects The notification information indicating that the node device has left is transmitted.
According to the present invention, a node device that has left the distributed file system is detected, the information of the node device that has left is deleted from the search node table and the management node table, and the node device is also transferred to other node devices. Notify device disconnection.
As a result, when the node device leaves the distributed file system, the search node table and the management node table can be updated without notifying other node devices of the departure.

［３］本発明の一態様は、上述するノード装置であって、ファイルを記憶するファイル蓄積部と、ファイルＩＤと、ファイルを記憶している前記ストレージノード装置のアドレスとを対応付けて記憶するキーテーブル蓄積部と、指定されたファイルＩＤに対応した前記ストレージノード装置のアドレスを前記キーテーブル蓄積部から読み出して出力するキーテーブル操作部と、ファイル名を指定した操作指示を受信し、前記ファイル名により特定される前記ファイル蓄積部内の前記ファイルに対する操作を行うファイル操作部と、をさらに備えることを特徴とする。
この発明によれば、ノード装置は、分散ファイルシステムにおける管理対象のファイルを記憶するとともに、ファイルが記憶されているストレージノード装置のアドレスを管理する。そして、ファイルＩＤを受信すると、該ファイルＩＤのファイルを記憶しているストレージノード装置のアドレスを返送する。また、自身が記憶しているファイルに対して、他のノード装置から指示されたファイル操作を行なう。
これにより、ノード装置は、ストレージノード装置として機能し、分散ファイルシステムにおける管理対象のファイルの所在を管理するとともに、他のノード装置から自身が記憶しているファイルに対するファイル操作を可能とする。 [3] One aspect of the present invention is the above-described node device, which stores a file storage unit that stores a file, a file ID, and an address of the storage node device that stores the file in association with each other. A key table storage unit; a key table operation unit that reads out and outputs the address of the storage node device corresponding to the specified file ID from the key table storage unit; and an operation instruction that specifies a file name; And a file operation unit for performing an operation on the file in the file storage unit specified by a name.
According to this invention, the node device stores the file to be managed in the distributed file system and manages the address of the storage node device in which the file is stored. When the file ID is received, the address of the storage node apparatus storing the file with the file ID is returned. In addition, a file operation instructed by another node device is performed on the file stored in itself.
As a result, the node device functions as a storage node device, manages the location of the file to be managed in the distributed file system, and enables file operations on files stored in itself from other node devices.

［４］本発明の一態様は、上述するノード装置であって、前記キーテーブル蓄積部は、ステータスＩＤとステータス情報とを対応付けて記憶し、自ノード装置のステータス情報を特定するステータスＩＤを前記テーブル検索部に渡し、前記ステータスＩＤに対応して前記テーブル検索部が取得したアドレスをあて先として前記ステータスＩＤ及び前記ステータス情報の登録要求を送信するステータス情報登録部をさらに備え、前記キーテーブル操作部は、ステータスＩＤ及びステータス情報の登録要求を受信した場合、登録が要求された前記ステータスＩＤと前記ステータス情報とを対応付けて前記キーテーブル蓄積部に書き込み、ステータスＩＤを指定したステータス情報の要求を受信した場合、受信した前記ステータスＩＤに対応したステータス情報を前記キーテーブル蓄積部から読み出して出力し、前記テーブル検索部は、ファイルＩＤの代わりに前記ステータスＩＤを用いて前記ストレージノード装置のアドレスを取得する、ことを特徴とする。
この発明によれば、ノード装置は、各ノード装置のステータス情報を分散管理し、問合せがあった場合には、管理しているステータス情報を出力する。
これにより、ストレージノード装置として機能するノード装置において、分散ファイルシステム内の各ストレージノード装置のステータス情報を分散し管理し、ストレージノード装置の使用ディスク容量を均等化する容量負荷分散ノード装置として機能するノード装置や、各ノード装置のステータス情報をユーザに提供する管理ノード装置として機能するノード装置にステータス情報を通知することができる。 [4] One aspect of the present invention is the node device described above, wherein the key table storage unit stores a status ID and status information in association with each other, and stores a status ID for specifying the status information of the own node device. A status information registration unit that sends the status ID and a request for registration of the status information to an address acquired by the table search unit corresponding to the status ID as a destination, the key table operation When receiving a status ID and status information registration request, the unit writes the status ID requested for registration and the status information in association with each other and writes them in the key table storage unit, and requests status information specifying the status ID. Received, the scan corresponding to the received status ID is received. The status information is read from the key table storage unit and output, and the table search unit acquires the address of the storage node device using the status ID instead of the file ID.
According to the present invention, the node device distributes and manages the status information of each node device, and outputs the managed status information when there is an inquiry.
As a result, the node device functioning as a storage node device functions as a capacity load distribution node device that distributes and manages the status information of each storage node device in the distributed file system and equalizes the used disk capacity of the storage node device. The status information can be notified to the node device and the node device functioning as a management node device that provides the status information of each node device to the user.

［５］本発明の一態様は、上述するノード装置であって、ファイル名を指定したファイル操作指示を受信し、受信したファイル名から生成されるファイルＩＤを前記テーブル検索部に渡し、前記ファイルＩＤに対応して前記テーブル検索部が取得したアドレスをあて先として前記ファイルＩＤを送信し、送信した前記ファイルＩＤに対応してファイルを記憶しているストレージノード装置のアドレスを受信し、受信したアドレスをあて先として前記ファイル名を指定したファイル操作指示を送信するファイル処理部をさらに備える、ことを特徴とする。
この発明によれば、クライアント装置からファイル操作指示を受信し、検索用ノードテーブルを利用してファイル操作対象のファイルの所在を検索し、検索の結果得られたストレージノード装置にファイル操作指示を出力する。
これにより、ノード装置はアクセスノード装置として機能し、クライアント装置に対して、分散ファイルシステムで管理しているファイルへのファイル操作を可能とする。 [5] One aspect of the present invention is the node device described above, which receives a file operation instruction specifying a file name, passes a file ID generated from the received file name to the table search unit, and The file ID is transmitted to the address acquired by the table search unit corresponding to the ID, the address of the storage node device storing the file corresponding to the transmitted file ID is received, and the received address And a file processing unit for transmitting a file operation instruction designating the file name as a destination.
According to the present invention, a file operation instruction is received from a client device, the location of the file operation target file is searched using the search node table, and the file operation instruction is output to the storage node device obtained as a result of the search To do.
As a result, the node device functions as an access node device and enables the client device to perform file operations on files managed by the distributed file system.

［６］本発明の一態様は、上述するノード装置であって、分散ファイルシステム内の前記ストレージノード装置それぞれのステータス情報を特定するステータスＩＤを前記テーブル検索部に渡し、前記ステータスＩＤに対応して前記テーブル検索部が取得したアドレスをあて先として前記ステータスＩＤを送信し、ステータス情報を要求するステータス情報取得部と、前記ステータス情報取得部の要求に応じて取得した前記ストレージノード装置のステータス情報に基づいて、ファイルの移動元及び移動先の前記ストレージノード装置と、移動容量とを決定する容量均等化計算部と、前記容量均等化計算部によって決定されたファイル移動元の前記ストレージノード装置からファイル移動先の前記ストレージノード装置へ前記移動容量に基づいてファイルを移動するファイル移動部と、をさらに備え、前記テーブル検索部は、ファイルＩＤの代わりに前記ステータスＩＤを用いて前記ストレージノード装置のアドレスを取得する、ことを特徴とする。
この発明によれば、分散ファイルシステム内の各ストレージノード装置のステータス情報を取得し、取得したステータス情報に基づいて、各ストレージノード装置における使用ディスク容量が均等に近づくよう、ストレージノード装置間でファイルを移動させる。
これにより、容量負荷分散ノード装置として機能するノード装置は、ユーザによる操作を必要とせず、分散ファイルシステム内の各ストレージノード装置のディスク使用率が均等に近づくように調整することができる。例えば、ファイル未登録のストレージノード装置を分散ファイルシステムに参加させた場合、管理者が明示的にファイル移動を指示しなくとも、ディスク使用率が高いストレージノード装置から新たに参加したストレージノード装置へファイルを移動させることができる。 [6] One aspect of the present invention is the above-described node device, which passes a status ID for specifying status information of each of the storage node devices in the distributed file system to the table search unit, and corresponds to the status ID. The status ID is transmitted to the address acquired by the table search unit as a destination, the status information acquisition unit requesting status information, and the status information of the storage node device acquired in response to the request of the status information acquisition unit Based on the storage node device of the migration source and destination of the file, a capacity equalization calculation unit for determining the migration capacity, and the file from the storage node device of the file migration source determined by the capacity equalization calculation unit Based on the migration capacity to the destination storage node device And a file moving unit that moves the file, wherein the table search unit obtains the address of the storage node device using the status ID instead of the file ID.
According to the present invention, the status information of each storage node device in the distributed file system is acquired, and based on the acquired status information, the file capacity between the storage node devices is set so that the used disk capacity in each storage node device approaches evenly. Move.
As a result, the node device functioning as a capacity load distribution node device does not require any user operation, and can be adjusted so that the disk usage rates of the storage node devices in the distributed file system approach evenly. For example, if a storage node device that has not been registered with a file is added to the distributed file system, the storage node device with a high disk usage rate can be changed to a newly joined storage node device without the administrator explicitly instructing to move the file. You can move files.

［７］本発明の一態様は、上述するノード装置であって、前記ノード装置のステータス情報を特定するステータスＩＤを前記テーブル検索部に渡し、前記ステータスＩＤに対応して前記テーブル検索部が取得したアドレスをあて先として前記ステータスＩＤを送信し、ステータス情報を取得するステータス情報取得部と、前記ステータス情報取得部が取得した前記ステータス情報を出力する情報提示部と、をさらに備え、前記テーブル検索部は、ファイルＩＤの代わりに前記ステータスＩＤを用いて前記ストレージノード装置のアドレスを取得する、ことを特徴とする。
この発明によれば、検索用ノードテーブルを用いて、各ストレージノード装置のステータス情報を記憶しているストレージノード装置を検索し、この検索の結果得られたストレージノード装置から読み出したステータス情報を提示する。
これにより、ノード装置は管理ノード装置として機能し、利用者に指示されたストレージノード装置のステータス情報を提示することができる。 [7] One aspect of the present invention is the node device described above, wherein a status ID that specifies status information of the node device is passed to the table search unit, and the table search unit acquires the status ID corresponding to the status ID. The table search unit further includes: a status information acquisition unit that transmits the status ID with the address as a destination and acquires status information; and an information presentation unit that outputs the status information acquired by the status information acquisition unit Uses the status ID instead of the file ID to obtain the address of the storage node device.
According to the present invention, the search node table is used to search for the storage node device storing the status information of each storage node device, and the status information read from the storage node device obtained as a result of this search is presented. To do.
Thereby, the node device functions as a management node device, and can present status information of the storage node device instructed by the user.

［８］本発明の一態様は、上述するノード装置であって、前記管理用ノードテーブルにノードＩＤの追加または削除が行なわれたことを検出した場合に、ノード装置の参加または離脱を出力する情報提示部をさらに備える、ことを特徴とする。
この発明によれば、ノード装置は、管理用ノードテーブルにノードＩＤが登録された場合はノード装置の参加を、ノードＩＤが削除された場合はノード装置の離脱を出力する。
これにより、ノード装置は管理ノード装置として機能し、分散ファイルシステムにノード装置が参加または離脱したことを利用者に提示することができる。 [8] One aspect of the present invention is the node device described above, and outputs join or leave of a node device when it is detected that a node ID has been added to or deleted from the management node table. An information presentation unit is further provided.
According to this invention, the node device outputs the participation of the node device when the node ID is registered in the management node table, and outputs the detachment of the node device when the node ID is deleted.
Accordingly, the node device functions as a management node device, and can present to the user that the node device has joined or left the distributed file system.

［９］本発明の一態様は、ファイルを分散して管理する分散ファイルシステムにネットワークを介して参加するノード装置として用いられるコンピュータを、ファイルを分散して記憶するストレージノード装置または付加機能を提供する機能ノード装置であり、前記分散ファイルシステムを構成するノード装置のノードＩＤとアドレスとを対応付けする管理用ノードテーブルと、前記ストレージノード装置のノードＩＤとアドレスとを対応付けする検索用ノードテーブルとを記憶する記憶部と、前記分散ファイルシステムへの各ノード装置の参加または離脱の通知情報を受信し、前記管理用ノードテーブルに記憶されている前記ノードＩＤに基づいて選択した前記アドレスをあて先として前記通知情報を送信する通知情報処理部と、ストレージノード装置が参加したことを表す通知情報を前記通知情報処理部が受信した場合には、前記通知情報に基づいて前記ストレージノード装置のノードＩＤとアドレスとを前記管理用ノードテーブルと前記検索用ノードテーブルとに書き込み、機能ノード装置が参加したことを表す通知情報を前記通知情報処理部が受信した場合には、前記通知情報に基づいて前記機能ノード装置のノードＩＤとアドレスとを前記管理用ノードテーブルのみに書き込み、ストレージノード装置が離脱したことを表す通知情報を前記通知情報処理部が受信した場合には、前記通知情報に基づいて前記ストレージノード装置のノードＩＤと対応するアドレスとを前記管理用ノードテーブルと前記検索用ノードテーブルとから削除し、機能ノード装置が離脱したことを表す通知情報を前記通知情報処理部が受信した場合には、前記通知情報に基づいて前記機能ノード装置のノードＩＤと対応するアドレスとを前記管理用ノードテーブルから削除するテーブル処理部と、指定されたファイルＩＤと、前記検索用ノードテーブルに記憶されている前記ノードＩＤとに基づいて、前記ファイルＩＤを管理する前記ストレージノード装置のアドレスを取得するテーブル検索部、として機能させることを特徴とするコンピュータプログラムである。 [9] One aspect of the present invention provides a storage node apparatus or an additional function for distributing and storing a computer used as a node apparatus that participates in a distributed file system that distributes and manages files via a network. A management node table that associates node IDs and addresses of node devices that constitute the distributed file system, and a search node table that associates node IDs and addresses of the storage node devices And a storage unit that stores the notification information of participation or withdrawal of each node device from the distributed file system, and the address selected based on the node ID stored in the management node table A notification information processing unit for transmitting the notification information as When the notification information processing unit receives notification information indicating that the node device has joined, the management node table and the search node are used to determine the node ID and address of the storage node device based on the notification information. And when the notification information processing unit receives notification information indicating that the functional node device has participated, the node ID and address of the functional node device are assigned to the management node based on the notification information. When the notification information processing unit receives notification information that is written only in the table and indicates that the storage node device is detached, the node ID of the storage node device and the corresponding address are managed based on the notification information. Deleted from the node table for search and the node table for search and A table processing unit that deletes the node ID of the functional node device and the corresponding address from the management node table based on the notification information when the notification information processing unit receives And a table search unit that acquires an address of the storage node device that manages the file ID based on the file ID and the node ID stored in the search node table. It is a computer program.

本発明によれば、ネットワークを介して接続される複数のノード装置によって構成される分散ファイルシステムにおいて、各ノード装置がファイル検索用の検索用ノードテーブルと、ノード装置管理用の管理用ノードテーブルとを保持し、ファイルを保管するノード装置については、検索用ノードテーブル及び管理用のノードテーブルにノード情報を登録し、ファイルを保管せず個別の機能を有するノード装置については、管理用ノードテーブルにのみノード情報を登録することにより、分散ファイルシステムの稼働中であっても、ファイル管理に影響を及ぼすことなく、個別の機能を有するノード装置の追加や削除が可能となる。また、検索用ノードテーブルを同期させる必要がないため、ネットワークやノード装置へ高い負荷をかけることがない。 According to the present invention, in a distributed file system composed of a plurality of node devices connected via a network, each node device has a search node table for file search, and a management node table for node device management. For node devices that store files and store files, register node information in the search node table and management node table, and for node devices that do not store files and have individual functions, in the management node table By registering only node information, node devices having individual functions can be added or deleted without affecting file management even when the distributed file system is in operation. In addition, since it is not necessary to synchronize the search node table, a high load is not applied to the network and the node device.

本発明の一実施形態による分散ファイルシステムの構成図である。1 is a configuration diagram of a distributed file system according to an embodiment of the present invention. FIG. 同実施形態によるノード装置の内部構成を示すブロック図である。It is a block diagram which shows the internal structure of the node apparatus by the embodiment. 同実施形態によるノードテーブル管理部の詳細な構成を示すブロック図である。It is a block diagram which shows the detailed structure of the node table management part by the embodiment. 同実施形態による管理用ノードテーブルおよび検索用ノードテーブルの例を説明する図である。It is a figure explaining the example of the management node table and search node table by the embodiment. 同実施形態による要求パケット及び応答パケットの構造を示す図である。It is a figure which shows the structure of the request packet and response packet by the embodiment. 同実施形態による要求パケット及び応答パケットの固有データの内容を示す図である。It is a figure which shows the content of the specific data of the request packet and response packet by the embodiment. 同実施形態によるＮＯＴＩＦＹ受信処理の流れ図である。5 is a flowchart of NOTIFY reception processing according to the embodiment. 同実施形態によるストレージノード装置の固有機能処理部の構成を示すブロック図である。3 is a block diagram illustrating a configuration of a unique function processing unit of the storage node device according to the embodiment. FIG. 同実施形態によるキーテーブルの例を示すブロック図である。It is a block diagram which shows the example of the key table by the embodiment. 同実施形態による低レベルインタフェース部において提供されるインタフェース関数の一覧である。It is a list of interface functions provided in the low-level interface unit according to the embodiment. 同実施形態によるファイル情報登録部の動作の流れ図である。It is a flowchart of operation | movement of the file information registration part by the embodiment. 同実施形態によるステータス情報登録部の動作の流れ図である。It is a flowchart of operation | movement of the status information registration part by the embodiment. 同実施形態によるアクセスノード装置の固有機能処理部の構成を示すブロック図である。It is a block diagram which shows the structure of the specific function process part of the access node apparatus by the embodiment. 同実施形態による高レベルインタフェース部において提供されるインタフェース関数の一覧である。It is a list of interface functions provided in the high-level interface unit according to the embodiment. 同実施形態による容量負荷分散ノード装置の固有機能処理部の構成を示すブロック図である。It is a block diagram which shows the structure of the intrinsic | native function process part of the capacity load distribution node apparatus by the embodiment. 同実施形態によるステータス情報取得部の動作の流れ図である。It is a flowchart of operation | movement of the status information acquisition part by the embodiment. 同実施形態による容量均等化計算部の動作の流れ図である。It is a flowchart of operation | movement of the capacity equalization calculation part by the embodiment. 同実施形態による容量均等化計算部の動作の流れ図である。It is a flowchart of operation | movement of the capacity equalization calculation part by the embodiment. 同実施形態による管理ノード装置の固有機能処理部の構成を示すブロック図である。It is a block diagram which shows the structure of the intrinsic | native function process part of the management node apparatus by the embodiment. Ｐ２Ｐ方式におけるコンテンツの分散管理を説明する図である。It is a figure explaining the distribution management of the content in a P2P system. OneHopにおける情報の通知方法を説明する図である。It is a figure explaining the notification method of the information in OneHop.

以下、図面を参照しながら本発明の実施形態を詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

[１．全体説明]
図１は、本発明の一実施の形態による分散ファイルシステムの構成図である。同図において、分散ファイルシステムは、ストレージノード装置１００、アクセスノード装置２００、容量負荷分散ノード装置３００、管理ノード装置４００を、通信ネットワーク１を介して接続することによって構成される。通信ネットワーク１は、ルータなどの通信装置やケーブルなどの通信線等の各種ネットワーク装置から構成されるＩＰ（Internet Protocol）通信網である。同図に示す分散ファイルシステムにおいては、複数のストレージノード装置１００として、ストレージノード装置１００ａ、１００ｂ、１００ｃが示されている。なお、アクセスノード装置２００、容量負荷分散ノード装置３００、管理ノード装置４００が複数存在してもよい。 [1. Overall description]
FIG. 1 is a configuration diagram of a distributed file system according to an embodiment of the present invention. In the figure, the distributed file system is configured by connecting a storage node device 100, an access node device 200, a capacity load distribution node device 300, and a management node device 400 via a communication network 1. The communication network 1 is an IP (Internet Protocol) communication network composed of various network devices such as communication devices such as routers and communication lines such as cables. In the distributed file system shown in the figure, storage node devices 100a, 100b, and 100c are shown as a plurality of storage node devices 100. There may be a plurality of access node devices 200, capacity load distribution node devices 300, and management node devices 400.

ストレージノード装置１００は、分散ファイルシステムにおいて管理対象となるファイルを記憶する。アクセスノード装置２００、容量負荷分散ノード装置３００、管理ノード装置４００はそれぞれ、ファイルストレージ以外の個別の機能を提供する。以下、ストレージノード装置１００、アクセスノード装置２００、容量負荷分散ノード装置３００、及び、管理ノード装置４００を総称して「ノード装置」と記載し、アクセスノード装置２００、容量負荷分散ノード装置３００、及び、管理ノード装置４００を総称して「機能ノード装置」と記載する。これらの機能ノード装置が提供する機能を付加機能という。付加機能は、分散ファイルシステムへ提供される、ファイルを記憶する機能以外の付加的な機能である。 The storage node device 100 stores files to be managed in the distributed file system. Each of the access node device 200, the capacity load distribution node device 300, and the management node device 400 provides individual functions other than file storage. Hereinafter, the storage node device 100, the access node device 200, the capacity load distribution node device 300, and the management node device 400 are collectively referred to as “node devices”, and the access node device 200, the capacity load distribution node device 300, and The management node device 400 is collectively referred to as “function node device”. The functions provided by these function node devices are called additional functions. The additional function is an additional function other than the function of storing a file provided to the distributed file system.

本実施形態のノード装置はノードを実行する装置であり、ノードとはコンピュータ内のプロセスに相当する。１台のコンピュータ装置が１つのノード装置のみを備える場合もあり、１台のコンピュータ装置が２以上のノード装置を備えることにより、１台のコンピュータ装置において機能の異なる複数のノードが実行される場合もある。 The node device of the present embodiment is a device that executes a node, and the node corresponds to a process in a computer. In some cases, one computer device includes only one node device, and when one computer device includes two or more node devices, a plurality of nodes having different functions are executed in one computer device. There is also.

利用者は、アクセスノード装置２００に接続されたクライアント装置２を介して、分散ファイルシステム内のファイルを操作することができる。１台のコンピュータ装置がクライアント装置２のみを備える場合もあり、１台のコンピュータ装置において、アクセスノード装置２００などのノード装置とクライアント装置２を備える場合もある。 A user can operate a file in the distributed file system via the client device 2 connected to the access node device 200. One computer apparatus may include only the client apparatus 2, and one computer apparatus may include a node apparatus such as the access node apparatus 200 and the client apparatus 2.

本実施形態による分散ファイルシステムは、通信ネットワーク１を介して接続されるノード装置によりオーバーレイネットワークを実現する。オーバーレイネットワークに参加する各ノード装置は、ノードテーブルとして、ファイル検索に用いる検索用ノードテーブルと、オーバーレイネットワークに参加している全てのノード装置の管理に用いる管理用ノードテーブルとを記憶する。そして、ストレージノード装置１００については、検索用ノードテーブル及び管理用ノードテーブルの両方に登録し、機能ノード装置については管理用ノードテーブルのみに登録する。ノード装置は、検索用ノードテーブルのみを用いて、ＤＨＴ（Distributed Hash Table）を使ったＰ２Ｐと同様に、ファイルＩＤを管理するノード装置の検索を行なう。これにより、機能ノード装置の参加や離脱が生じた場合にも、ファイルＩＤの検索が滞るなどの影響を及ぼさないようにすることが可能となる。 The distributed file system according to the present embodiment implements an overlay network with node devices connected via the communication network 1. Each node device participating in the overlay network stores, as node tables, a search node table used for file search and a management node table used for managing all node devices participating in the overlay network. The storage node device 100 is registered in both the search node table and the management node table, and the function node device is registered only in the management node table. The node device uses only the search node table and searches for the node device that manages the file ID as in P2P using DHT (Distributed Hash Table). As a result, even when the function node device joins or leaves, it is possible to prevent the file ID search from being delayed.

[２．ノード装置の構成]
図２は、ノード装置の構成を示すブロック図であり、本発明と関係する機能ブロックのみ抽出して示している。同図に示すように、ストレージノード装置１００、アクセスノード装置２００、容量負荷分散ノード装置３００、及び、管理ノード装置４００の各ノード装置は、ノードテーブル管理部１０、固有機能処理部２０、ネットワークインタフェース部３０、記憶部４０を備えて構成される。全てのノード装置は、固有機能処理部２０を除き、共通な構成である。 [2. Node device configuration]
FIG. 2 is a block diagram showing the configuration of the node device, and shows only the functional blocks related to the present invention. As shown in the figure, each node device of a storage node device 100, an access node device 200, a capacity load distribution node device 300, and a management node device 400 includes a node table management unit 10, a unique function processing unit 20, a network interface. The unit 30 and the storage unit 40 are provided. All the node devices have a common configuration except for the unique function processing unit 20.

記憶部４０は、ハードディスク装置や半導体メモリなどで実現され、管理用ノードテーブル４１と検索用ノードテーブル４２を記憶する。管理用ノードテーブル４１は、オーバーレイネットワークに参加している全ノード装置のノードＩＤ及びノードアドレスの対応付けを示す。検索用ノードテーブル４２は、ストレージノード装置１００のノードＩＤ及びノードアドレスの対応付けを示す。 The storage unit 40 is realized by a hard disk device, a semiconductor memory, or the like, and stores a management node table 41 and a search node table 42. The management node table 41 shows correspondence between node IDs and node addresses of all node devices participating in the overlay network. The search node table 42 indicates a correspondence between the node ID and the node address of the storage node device 100.

ネットワークインタフェース部３０は、通信ネットワーク１を介したデータの送受信インタフェースを提供する。ネットワークインタフェース部３０は、通信ネットワーク１から受信したデータを各部へ出力したり、各部から指示に従って通信ネットワーク１を介してデータを送信したりする。ノードテーブル管理部１０は、他のノード装置との間で、ノード装置の参加及び離脱の通知メッセージを交換し、記憶部４０内の管理用ノードテーブル４１、検索用ノードテーブル４２を更新する。固有機能処理部２０は、記憶部４０内の管理用ノードテーブル４１、検索用ノードテーブル４２を利用して、ストレージノード装置１００、アクセスノード装置２００、容量負荷分散ノード装置３００、管理ノード装置４００などのノード装置の種類に応じた機能を実行する。 The network interface unit 30 provides a data transmission / reception interface via the communication network 1. The network interface unit 30 outputs data received from the communication network 1 to each unit, and transmits data via the communication network 1 according to instructions from each unit. The node table management unit 10 exchanges node device participation and withdrawal notification messages with other node devices, and updates the management node table 41 and the search node table 42 in the storage unit 40. The unique function processing unit 20 uses the management node table 41 and the search node table 42 in the storage unit 40 to store the storage node device 100, the access node device 200, the capacity load distribution node device 300, the management node device 400, and the like. The function corresponding to the type of the node device is executed.

[２．１ノードテーブル管理部１０の内部構成]
図３は、図２に示すノード装置のノードテーブル管理部１０の内部構成を示すブロック図であり、本発明と関係する機能ブロックのみ抽出して示している。同図に示すように、ノードテーブル管理部１０は、通知処理部１２（通知情報処理部）、安定化（Stabilization）処理部１３、テーブル処理部１４を備えて構成される。
通知処理部１２は、ネットワークインタフェース部３０を介して他のノードの通知処理部１２との間で、ノード装置の参加および離脱を表す通知情報として通知メッセージを交換する。安定化処理部１３は、記憶部４０内の管理用ノードテーブル４１及び検索用ノードテーブル４２を正しく維持するために、ネットワークインタフェース部３０を介して他のノード装置との間で存在確認を行う。テーブル処理部１４は、通知処理部１２または安定化処理部１３からノード装置の参加、離脱の通知を受け、記憶部４０内の管理用ノードテーブル４１および検索用ノードテーブル４２を書き換える。 [2.1 Internal configuration of node table management unit 10]
FIG. 3 is a block diagram showing the internal configuration of the node table management unit 10 of the node device shown in FIG. 2, and shows only functional blocks related to the present invention. As shown in the figure, the node table management unit 10 includes a notification processing unit 12 (notification information processing unit), a stabilization processing unit 13, and a table processing unit 14.
The notification processing unit 12 exchanges notification messages with the notification processing unit 12 of other nodes via the network interface unit 30 as notification information indicating participation and withdrawal of the node device. In order to correctly maintain the management node table 41 and the search node table 42 in the storage unit 40, the stabilization processing unit 13 performs existence confirmation with other node devices via the network interface unit 30. The table processing unit 14 receives a notification of joining or leaving the node device from the notification processing unit 12 or the stabilization processing unit 13 and rewrites the management node table 41 and the search node table 42 in the storage unit 40.

[２．２テーブル構成]
図４は、記憶部４０に記憶される管理用ノードテーブル４１及び検索用ノードテーブル４２の設定例を示す図である。同図に示すように、管理用ノードテーブル４１には、オーバーレイネットワークに参加している全てのノード装置のノードＩＤを示すキー値と、そのノードＩＤにより特定されるノード装置のノードアドレスを示すデータとが対応付けて登録されている。 [2.2 Table structure]
FIG. 4 is a diagram illustrating a setting example of the management node table 41 and the search node table 42 stored in the storage unit 40. As shown in the figure, the management node table 41 includes a key value indicating the node IDs of all the node devices participating in the overlay network, and data indicating the node addresses of the node devices specified by the node IDs. Are registered in association with each other.

一方、検索用ノードテーブル４２には、オーバーレイネットワークに参加しているストレージノード装置１００のノードＩＤを示すキー値と、そのノードＩＤにより特定されるストレージノード装置１００のノードアドレスを示すデータとが対応付けて登録されている。
管理用ノードテーブル４１、検索用ノードテーブル４２においては、ノードＩＤが小さい順に並べられている。 On the other hand, in the search node table 42, a key value indicating the node ID of the storage node device 100 participating in the overlay network corresponds to data indicating the node address of the storage node device 100 specified by the node ID. It is registered with it.
In the management node table 41 and the search node table 42, the node IDs are arranged in ascending order.

[２．３ノード装置間の通信パケット]
図５は、通知処理部１２および安定化処理部１３が他のノード装置と交換する通信パケット（通知情報）の構造を示す図である。同図に示すように、通信パケットは、コマンド種別または応答種別と、送信先アドレス、送信元アドレス、及び、固有データとから構成される。本実施形態の通信パケットには、要求パケットと、要求パケットに対する応答である応答パケットとがある。要求パケットのコマンドの種類には「ＪＯＩＮ」、「ＰＩＮＧ」、「ＮＯＴＩＦＹ」があり、応答パケットの応答種別には、「ＯＫ」、「ＮＧ」がある。 [2.3 Communication packets between node devices]
FIG. 5 is a diagram illustrating a structure of a communication packet (notification information) exchanged between the notification processing unit 12 and the stabilization processing unit 13 with another node device. As shown in the figure, the communication packet includes a command type or a response type, a transmission destination address, a transmission source address, and unique data. The communication packet of this embodiment includes a request packet and a response packet that is a response to the request packet. Request packet command types include “JOIN”, “PING”, and “NOTIFY”, and response packet response types include “OK” and “NG”.

図６は、図５に示す通信パケットの固有データの内容を示す図である。同図に示すように、固有データの内容は、コマンドの種別により異なる。
ＪＯＩＮコマンド及びＰＩＮＧコマンドの場合、ノードＩＤ、機能種別、及び、ノードアドレスが固有データに設定され、ＮＯＴＩＦＹコマンドの場合、ホストＩＤ、機能種別、ノードアドレス、動作種別、及び、通知モードが固有データに設定される。また、ＪＯＩＮコマンドの応答の場合、管理用ノードテーブルが固有データに設定される。ＰＩＮＧコマンド、ＮＯＴＩＦＹコマンドの応答には、固有データは設定されない。
機能種別は、例えば、「ストレージノード」、「アクセスノード」、「容量負荷分散ノード」、「管理ノード」などを示し、動作種別は、「参加（ＪＯＩＮ）」、または、「離脱（ＬＥＡＶＥ）」のいずれかを示す。
以下に、各コマンドについて説明する。 FIG. 6 is a diagram showing the contents of the unique data of the communication packet shown in FIG. As shown in the figure, the content of the unique data varies depending on the type of command.
In the case of the JOIN command and the PING command, the node ID, the function type, and the node address are set to the unique data. In the case of the NOTIFY command, the host ID, the function type, the node address, the operation type, and the notification mode are set to the unique data. Is set. In the case of a response to the JOIN command, the management node table is set as unique data. No unique data is set in the response to the PING command and NOTIFY command.
The function type indicates, for example, “storage node”, “access node”, “capacity load distribution node”, “management node”, etc., and the operation type is “join (JOIN)” or “leave (LEAVE)”. Indicates one of the following.
Each command will be described below.

[２．３．１ＪＯＩＮコマンド]
ＪＯＩＮコマンドは、ノード装置がオーバーレイネットワークに参加する場合に用いる。ＪＯＩＮコマンドの要求パケット（以下、「要求パケット（ＪＯＩＮ）」と記載する。）には、オーバーレイネットワークに参加したノード装置のノードＩＤ、機能種別、ノードアドレスが固有データとして格納される。また、ノードＩＤは、あらかじめ決められたハッシュ関数により生成される。例えば、ＳＨＡ−１のハッシュ関数を使ってノードアドレス（ノード装置のＩＰドレスとポート番号の組み合わせ）のハッシュ値を算出し、その算出したハッシュ値をノードＩＤとする。
ＪＯＩＮコマンドが成功すると、要求パケット（ＪＯＩＮ）を受信したノード装置において記憶されている管理用ノードテーブル４１が、ＪＯＩＮコマンドに対する応答種別「ＯＫ」の応答パケット（以下、「応答パケット（ＯＫ）」と記載する。）の固有データに格納され、要求パケット（ＪＯＩＮ）の送信元ノード装置に返送される。 [2.3.1 JOIN command]
The JOIN command is used when the node device participates in the overlay network. In the JOIN command request packet (hereinafter referred to as “request packet (JOIN)”), the node ID, function type, and node address of the node device participating in the overlay network are stored as unique data. Further, the node ID is generated by a predetermined hash function. For example, the hash value of the node address (combination of the IP address and port number of the node device) is calculated using the hash function of SHA-1, and the calculated hash value is used as the node ID.
When the JOIN command is successful, the management node table 41 stored in the node device that has received the request packet (JOIN) has a response type “OK” response packet (hereinafter referred to as “response packet (OK)”) to the JOIN command. The data is stored in the unique data and sent back to the source node device of the request packet (JOIN).

[２．３．２ＰＩＮＧコマンド]
ＰＩＮＧコマンドは、ノード装置の生存確認を行うために用いる。ＰＩＮＧコマンドの要求パケット（以下、「要求パケット（ＰＩＮＧ）」と記載する。）には、ＰＩＮＧコマンド送信元のノード装置のノードＩＤ、機能種別、ノードアドレスが固有データとして格納される。ノード装置が、相手先のノード装置へ要求パケット（ＰＩＮＧ）を送信し、その応答パケット（ＯＫ）を受信することによって、生存確認対象のノード装置の存在を確認する。また、要求パケット（ＰＩＮＧ）を受信したノード装置は、要求パケット（ＰＩＮＧ）の送信元のノード装置が自身の保持するノードテーブルに存在しない場合、ノードテーブルに送信元のノード装置のノードＩＤとノードアドレスを追加する。 [2.3.2 PING command]
The PING command is used to check the existence of the node device. In the PING command request packet (hereinafter referred to as “request packet (PING)”), the node ID, function type, and node address of the node device that is the source of the PING command are stored as unique data. The node device transmits the request packet (PING) to the destination node device and receives the response packet (OK), thereby confirming the existence of the node device subject to the existence confirmation. In addition, when the node device that has transmitted the request packet (PING) does not exist in the node table held by the node device that has received the request packet (PING), the node ID and the node of the source node device are stored in the node table. Add an address.

[２．３．２ＮＯＴＩＦＹコマンド]
ＮＯＴＩＦＹコマンドは、ノード装置の参加、離脱をオーバーレイネットワーク上の全ノード装置に通知するために用いる。ＮＯＴＩＦＹコマンドの要求パケット（以下、「要求パケット（ＮＯＴＩＦＹ）」と記載する。）には、参加または離脱したノード装置のノードＩＤ、機能種別及びノードアドレスと、動作種別、通知モードが格納される。通知モードは、OneHopにおける通知の段階を示すものであり、図２１において説明した各ステップに相当する。すなわち、通知モード「１」は、通報元からスライス・リーダー（Slice Leader）への通知（図２１のステップＳ１）を、通知モード「２」は、スライス・リーダーから他のスライス・リーダーへの通知（図２１のステップＳ２）を、通知モード「３」は、スライス・リーダーからユニット・リーダー（Unit Leader）への通知（図２１のステップＳ３）を表す。通知モード「４」は、ユニット・リーダーからその配下のノードへの通知を表す。 [2.3.2 NOTIFY command]
The NOTIFY command is used to notify all node devices on the overlay network that the node device has joined or left. The NOTIFY command request packet (hereinafter referred to as “request packet (NOTIFY)”) stores the node ID, function type and node address, operation type, and notification mode of the node device that has joined or withdrawn. The notification mode indicates a notification stage in OneHop and corresponds to each step described in FIG. That is, the notification mode “1” is a notification from the reporting source to the slice leader (Slice Leader) (step S1 in FIG. 21), and the notification mode “2” is a notification from the slice leader to another slice leader. (Step S2 in FIG. 21), the notification mode “3” represents notification from the slice leader to the unit leader (Step S3 in FIG. 21). The notification mode “4” represents notification from the unit leader to its subordinate nodes.

[２．４ノードテーブル管理部１０の動作]
続いて、ノードテーブル管理部１０の各部の動作について説明する。 [2.4 Operations of Node Table Management Unit 10]
Next, the operation of each unit of the node table management unit 10 will be described.

[２．４．１通知処理部１２の動作]
通知処理部１２は、ＪＯＩＮコマンドおよびＮＯＴＩＦＹコマンドの送信および受信処理を行う。 [2.4.1 Operation of notification processing unit 12]
The notification processing unit 12 performs transmission and reception processing of the JOIN command and NOTIFY command.

［２．４．１．１ＪＯＩＮコマンド送信処理］
ノード装置が新規にオーバーレイネットワークに参加する場合、該ノード装置の通知処理部１２は、すでにオーバーレイネットワークに参加している他のノード装置に要求パケット（ＪＯＩＮ）を送信する。参加通知先の他のノード装置のノードアドレスを取得する方法は任意であり、例えば、利用者が入力してもよく、通信ネットワーク１上に検索パケットを送信し、すでにオーバーレイネットワークに参加しているノード装置から検索パケットに対応した応答を受信することにより取得してもよい。通知処理部１２は、送信した要求パケット（ＪＯＩＮ）の応答パケット（ＯＫ）を受信すると、受信した応答パケット（ＯＫ）の固有データに格納された管理用ノードテーブルをテーブル処理部１４に出力し、ノードテーブルの更新を指示する。 [2.4.1.1 JOIN command transmission processing]
When a node device newly joins an overlay network, the notification processing unit 12 of the node device transmits a request packet (JOIN) to another node device that has already joined the overlay network. The method for acquiring the node address of the other node device to which the participation is notified is arbitrary. For example, the user may input it, and the search packet is transmitted on the communication network 1 and is already participating in the overlay network. You may acquire by receiving the response corresponding to a search packet from a node apparatus. Upon receiving the response packet (OK) of the transmitted request packet (JOIN), the notification processing unit 12 outputs the management node table stored in the unique data of the received response packet (OK) to the table processing unit 14. Instructs the node table to be updated.

［２．４．１．２ＪＯＩＮコマンド受信処理］
通知処理部１２は、オーバーレイネットワークに新規に参加した他のノード装置から要求パケット（ＪＯＩＮ）を受信すると、受信した要求パケット（ＪＯＩＮ）の固有データをノード情報としてテーブル処理部１４に出力し、ノードテーブルの更新を指示する。さらに、通知処理部１２は、記憶部４０から管理用ノードテーブル４１を読み出すと、ＪＯＩＮコマンドの応答パケット（ＯＫ）の固有データにこの読み出した管理用ノードテーブル４１を格納し、要求パケット（ＪＯＩＮ）の送信元ノード装置に返送する。さらに、通知処理部１２は、動作種別に「参加（ＪＯＩＮ）」を、通知モードに「１（通報元からスライス・リーダーへの通知）」をセットした要求パケット（ＮＯＴＩＦＹ）を生成し、スライス・リーダーとなるノード装置へ送信する。
上記処理によって送信された要求パケット（ＮＯＴＩＦＹ）を受信したノード装置の通知処理部１２は、受信したパケットの通知モードに応じて、さらにＮＯＴＩＦＹパケットを送信する。 [2.4.1.2 JOIN command reception processing]
When the notification processing unit 12 receives a request packet (JOIN) from another node device newly participating in the overlay network, the notification processing unit 12 outputs the unique data of the received request packet (JOIN) to the table processing unit 14 as node information, Instruct to update the table. Further, when the notification processing unit 12 reads the management node table 41 from the storage unit 40, the notification processing unit 12 stores the read management node table 41 in the unique data of the response packet (OK) of the JOIN command, and requests packet (JOIN) To the transmission source node device. Further, the notification processing unit 12 generates a request packet (NOTIFY) in which “participation (JOIN)” is set as the operation type and “1 (notification from the report source to the slice leader)” is set as the notification mode. Transmit to the leader node device.
The notification processing unit 12 of the node device that has received the request packet (NOTIFY) transmitted by the above process further transmits a NOTIFY packet according to the notification mode of the received packet.

［２．４．１．３ＮＯＴＩＦＹコマンド受信処理］
図７は、通知処理部１２におけるＮＯＴＩＦＹコマンドの受信処理の流れ図を示す。
通知処理部１２は、他のノード装置から要求パケット（ＮＯＴＩＦＹ）を受信すると、受信した要求パケット（ＮＯＴＩＦＹ）の固有データをノード情報としてテーブル処理部１４に出力し、ノードテーブルの更新を指示する（ステップＳ１２−１）。このときのテーブル処理部１４の動作の詳細は後述する。 [2.4.1.3 NOTIFY command reception processing]
FIG. 7 is a flowchart of the NOTIFY command reception process in the notification processing unit 12.
When receiving the request packet (NOTIFY) from another node device, the notification processing unit 12 outputs the unique data of the received request packet (NOTIFY) to the table processing unit 14 as node information, and instructs to update the node table ( Step S12-1). Details of the operation of the table processing unit 14 at this time will be described later.

通知処理部１２は、受信した要求パケット（ＮＯＴＩＦＹ）のモードが「１（通報元からスライス・リーダーへの通知）」であると判断した場合（ステップＳ１２−２：ＹＥＳ）、当該要求パケット（ＮＯＴＩＦＹ）の固有データを読み出して通知モードを「２（スライス・リーダーから他のスライス・リーダーへの通知）」に書き換え、この書き換えた固有データを設定した要求パケット（ＮＯＴＩＦＹ）を全スライス・リーダーに送信する（ステップＳ１２−３）。 When the notification processing unit 12 determines that the mode of the received request packet (NOTIFY) is “1 (notification from the report source to the slice leader)” (step S12-2: YES), the request packet (NOTIFY) ) And the notification mode is rewritten to “2 (notification from slice reader to other slice reader)”, and a request packet (NOTIFY) in which the rewritten unique data is set is transmitted to all slice readers. (Step S12-3).

通知処理部１２は、受信した要求パケット（ＮＯＴＩＦＹ）のモードが「２」であると判断した場合（ステップＳ１２−２：ＮＯ，Ｓ１２−４：ＹＥＳ）、当該要求パケット（ＮＯＴＩＦＹ）から固有データを読み出して通知モードを「３（スライス・リーダーからユニット・リーダーへの通知）」に書き換え、この書き換えた固有データを設定した要求パケット（ＮＯＴＩＦＹ）を自ノード装置が所属するユニット（領域）のユニット・リーダーに送信する（ステップＳ１２−５）。 When the notification processing unit 12 determines that the mode of the received request packet (NOTIFY) is “2” (step S12-2: NO, S12-4: YES), the notification processing unit 12 obtains unique data from the request packet (NOTIFY). Read and rewrite the notification mode to “3 (notification from the slice leader to the unit leader)”, and change the request packet (NOTIFY) in which the rewritten unique data is set to the unit (area) to which the node device belongs. Transmit to the reader (step S12-5).

通知処理部１２は、受信した要求パケット（ＮＯＴＩＦＹ）のモードが「３」であると判断した場合（ステップＳ１２−２：ＮＯ，Ｓ１２−４：ＮＯ、ステップＳ１２−６：ＹＥＳ）、当該要求パケット（ＮＯＴＩＦＹ）から固有データを読み出して通知モードを「４」に書き換え、この書き換えた固有データを設定した要求パケット（ＮＯＴＩＦＹ）を自ノード装置が所属するユニット（領域）内の全ノード装置に送信する（ステップＳ１２−７）。 When the notification processing unit 12 determines that the mode of the received request packet (NOTIFY) is “3” (step S12-2: NO, S12-4: NO, step S12-6: YES), the request packet The unique data is read from (NOTIFY), the notification mode is rewritten to “4”, and the request packet (NOTIFY) in which the rewritten unique data is set is transmitted to all the node devices in the unit (area) to which the own node device belongs. (Step S12-7).

［２．４．１．４スライス・リーダー、ユニット・リーダーの決定方法］
通知処理部１２は、管理用ノードテーブル４１に登録されているノード装置によって構成されるオーバーレイネットワークをｋ個の領域に分割し、それぞれの分割領域内のノード装置の１台をスライス・リーダーとして選択する。通知処理部１２は、このｋ個に分割された領域をさらにｓ個に分割し、その領域内のノード装置の１台をユニット・リーダーとする。各領域内でのスライス・リーダー、ユニット・リーダーの選択方法は任意であるが、例えば、各領域内においてノードＩＤが最小のノード装置をスライス・リーダー、ユニット・リーダーとして選択する方法などがある。例えば、ｋ＝２５６（２^８）、ｓ＝１６（２^４）で分割する場合、ノードＩＤの上位８ビットをスライス番号、上位９ビット目から１２ビット目までの４ビットをユニット番号として領域を分割することができるので、それぞれの領域における最小値のノードＩＤを持つノードをスライス・リーダー、ユニット・リーダーと決めるなどの方法がある。 [2.4.1.4 Determination method of slice leader and unit leader]
The notification processing unit 12 divides the overlay network configured by the node devices registered in the management node table 41 into k regions, and selects one of the node devices in each divided region as a slice leader. To do. The notification processing unit 12 further divides the k divided area into s pieces, and sets one of the node devices in the area as a unit leader. The selection method of the slice leader and unit leader in each area is arbitrary. For example, there is a method of selecting a node device having the smallest node ID in each area as a slice leader or unit leader. For example, when dividing with k = 256 (2 ⁸ ) and s = 16 (2 ⁴ ), the upper 8 bits of the node ID are the slice number, and the 4 bits from the upper 9 bits to the 12th bit are the unit numbers. Since it can be divided, there is a method of determining a node having the minimum node ID in each area as a slice leader or a unit leader.

[２．４．２安定化処理部１３の動作]
安定化処理部１３は、ノード装置の生存確認を行うことで、オーバーレイネットワークの構造を維持する。 [2.4.2 Operation of stabilization processing unit 13]
The stabilization processing unit 13 maintains the overlay network structure by confirming the existence of the node device.

[２．４．２．１ＰＩＮＧコマンドの送信処理］
安定化処理部１３は、記憶部４０に記憶されている管理用ノードテーブル４１を参照し、管理用ノードテーブル４１に登録されているノード装置に対し、順にＰＩＮＧコマンドの要求パケット（ＰＩＮＧ）を定期的に送信する。 [2.4.2.1 PING command transmission processing]
The stabilization processing unit 13 refers to the management node table 41 stored in the storage unit 40 and periodically sends a PING command request packet (PING) to the node devices registered in the management node table 41 in order. To send.

具体的には、所定の時間毎に生存確認処理が起動されると、安定化処理部１３は、管理用ノードテーブル４１から登録されているノード装置のノードアドレスを順に読み出し、この読み出したノードアドレスを送信先アドレスとして要求パケット（ＰＩＮＧ）を送信する。送信した要求パケット（ＰＩＮＧ）に対する応答を受信した場合、安定化処理部１３は、次の生存確認対象のノード装置へ要求パケット（ＰＩＮＧ）を送信する。 Specifically, when the survival confirmation process is started every predetermined time, the stabilization processing unit 13 sequentially reads out the node addresses of the node devices registered from the management node table 41, and the read node address A request packet (PING) is transmitted using as a transmission destination address. When the response to the transmitted request packet (PING) is received, the stabilization processing unit 13 transmits the request packet (PING) to the next node device to be checked for survival.

一方、送信した要求パケット（ＰＩＮＧ）に対する応答がなかった場合、安定化処理部１３は、要求パケット（ＰＩＮＧ）の送信先のノード装置が離脱したものとして、動作種別に「離脱（ＬＥＡＶＥ）」を設定し、通知モードに「１（通知元からスライス・リーダー）」を設定した要求パケット（ＮＯＴＩＦＹ）をスライス・リーダーへ送信する。これによって、オーバーレイネットワークを構成する全ノード装置にノード装置の離脱を通知する。その後、次の生存確認対象のノード装置へ要求パケット（ＰＩＮＧ）を送信する。 On the other hand, when there is no response to the transmitted request packet (PING), the stabilization processing unit 13 sets “Leave” as the operation type on the assumption that the node device that is the transmission destination of the request packet (PING) has left. Then, a request packet (NOTIFY) in which “1 (notification source to slice reader)” is set as the notification mode is transmitted to the slice reader. As a result, the node devices are notified of the detachment of all the node devices constituting the overlay network. Thereafter, a request packet (PING) is transmitted to the next node device to be checked for survival.

[２．４．２．２ＰＩＮＧコマンドの受信処理］
安定化処理部１３は、他のノード装置から要求パケット（ＰＩＮＧ）を受信した場合、受信した要求パケット（ＰＩＮＧ）内の固有データに格納されている情報をノード情報としてテーブル処理部１４に出力し、ノードテーブルの更新を指示する。その後、安定化処理部１３は、要求パケット（ＰＩＮＧ）の送信元へ応答パケット（ＯＫ）を返送する。 [2.4.2.2 PING command reception processing]
When the stabilization processing unit 13 receives a request packet (PING) from another node device, the stabilization processing unit 13 outputs the information stored in the specific data in the received request packet (PING) to the table processing unit 14 as node information. The node table is instructed to be updated. Thereafter, the stabilization processing unit 13 returns a response packet (OK) to the transmission source of the request packet (PING).

[２．４．３テーブル処理部１４の動作]
テーブル処理部１４は、記憶部４０に記憶された管理用ノードテーブル４１と検索用ノードテーブル４２の変更を行う。
本実施形態では、管理用ノードテーブルにはオーバーレイネットワークに参加している全ノード装置を登録し、検索用ノードテーブルにはストレージノード装置１００のみを登録することを特徴としている。これを実現するため、テーブル処理部１４は、通知処理部１２あるいは安定化処理部１３から動作種別が「参加（ＪＯＩＮ）」のノード情報を受信した場合、このノード情報に含まれている機能種別が「ストレージノード」である場合には、ノード情報に含まれているノードＩＤ及びノードアドレスの組みを管理用ノードテーブル４１および検索用ノードテーブル４２に書き込み、機能種別が「アクセスノード」、「容量負荷分散ノード」、「管理ノード」などの機能ノードである場合には、ノード情報に含まれているノードＩＤ及びノードアドレスの組みを管理用ノードテーブル４１のみに書き込む。このとき、テーブル処理部１４は、テーブル内でノードＩＤの値が小さい順となるように書き込みを行なう。 [2.4.3 Operation of table processing unit 14]
The table processing unit 14 changes the management node table 41 and the search node table 42 stored in the storage unit 40.
This embodiment is characterized in that all node devices participating in the overlay network are registered in the management node table, and only the storage node device 100 is registered in the search node table. In order to achieve this, when the table processing unit 14 receives node information whose operation type is “join (JOIN)” from the notification processing unit 12 or the stabilization processing unit 13, the function type included in the node information Is a “storage node”, a combination of the node ID and the node address included in the node information is written in the management node table 41 and the search node table 42, and the function type is “access node”, “capacity” In the case of a functional node such as “load distribution node” or “management node”, the combination of the node ID and the node address included in the node information is written only in the management node table 41. At this time, the table processing unit 14 performs writing so that the node ID values are in ascending order in the table.

また、テーブル処理部１４は、通知処理部１２あるいは安定化処理部１３から動作種別が「離脱（ＬＥＡＶＥ）」のノード情報を受信した場合、このノード情報に設定されているノードＩＤ及びノードアドレスの組を、管理用ノードテーブル４１および検索用ノードテーブル４２から削除する。ただし、ノード情報に含まれている機能種別が機能ノードを示している場合、検索用ノードテーブル４２に削除対象のノードＩＤ及びノードアドレスの組みは登録されていないため、実際には削除は行なわれない。
また、テーブル処理部１４は、通知処理部１２から管理用ノードテーブルを受信した場合、この受信した管理用ノードテーブルを記憶部４０に書き込む。 Further, when the table processing unit 14 receives node information whose operation type is “LEAVE” from the notification processing unit 12 or the stabilization processing unit 13, the table processing unit 14 sets the node ID and node address set in the node information. The set is deleted from the management node table 41 and the search node table 42. However, when the function type included in the node information indicates a function node, since the combination of the node ID and the node address to be deleted is not registered in the search node table 42, the deletion is actually performed. Absent.
When the table processing unit 14 receives the management node table from the notification processing unit 12, the table processing unit 14 writes the received management node table in the storage unit 40.

[２．５本実施形態のノード装置の効果]
以上の動作により、オーバーレイネットワークに頻繁に機能ノード装置の参加、離脱が生じたとしても、検索用ノードテーブル４２には変化がないため、ファイルＩＤの検索動作には影響が生じない。また、検索用ノードテーブル４２は、管理用ノードテーブルを更新するための情報に基づいて生成可能であるため、ノード間の通信量は従来のOneHopと変わらない。 [2.5 Effects of the node device of this embodiment]
With the above operation, even if the function node device frequently participates or leaves the overlay network, the search node table 42 does not change, and thus the file ID search operation is not affected. Further, since the search node table 42 can be generated based on information for updating the management node table, the communication amount between the nodes is the same as that of the conventional OneHop.

なお、本実施形態では、管理用ノードテーブル４１のノードの並びはノードＩＤの小さい順で並べることで説明を行ったがこれに限定されるものではない。例えば、非特許文献３に記載されているように、ノードの物理的な位置の情報が含まれるローカルノードＩＤを使うことによって、物理的に離れたノード間のパケット量を減少させることが可能である。また、非特許文献３に記載されているように、ストレージノード装置の性能により、仮想ノードＩＤを生成して検索用ノードテーブル４２に登録することで、処理負荷をストレージノード装置の性能に比例させることが可能である。 In the present embodiment, the description has been given by arranging the nodes in the management node table 41 in ascending order of the node IDs, but the present invention is not limited to this. For example, as described in Non-Patent Document 3, it is possible to reduce the amount of packets between physically separated nodes by using a local node ID including information on the physical position of the node. is there. Further, as described in Non-Patent Document 3, by generating a virtual node ID and registering it in the search node table 42 based on the performance of the storage node device, the processing load is proportional to the performance of the storage node device. It is possible.

[３．各ノード装置の固有機能]
次に、ストレージノード装置１００、アクセスノード装置２００、容量負荷分散ノード装置３００、管理ノード装置４００の固有機能の動作について説明する。各ノード装置の固有機能の動作は、図２に示す固有機能処理部２０の動作によって決定される。 [3. Unique function of each node device]
Next, operations of the unique functions of the storage node device 100, the access node device 200, the capacity load distribution node device 300, and the management node device 400 will be described. The operation of the unique function of each node device is determined by the operation of the unique function processing unit 20 shown in FIG.

[３．１ストレージノード装置１００の固有機能]
[３．１．１固有機能処理部の構成]
図８は、ストレージノード装置１００の固有機能処理部１２０の構成を示す図であり、図２に示す固有機能処理部２０に相当する。同図に示すストレージノード装置１００の固有機能処理部１２０は、低レベルインタフェース部１０１、キーテーブル操作部１０２、ファイル操作部１０３、ファイル情報登録部１０４、ステータス情報登録部１０５、テーブル検索部１０６、キーテーブル蓄積部１０７、及び、ファイル蓄積部１０８を備えて構成される。キーテーブル蓄積部１０７及びファイル蓄積部１０８は、ハードディスク装置や半導体メモリなどで実現される。 [3.1 Unique Function of Storage Node Device 100]
[3.1.1 Configuration of Unique Function Processing Unit]
FIG. 8 is a diagram showing a configuration of the unique function processing unit 120 of the storage node device 100, and corresponds to the unique function processing unit 20 shown in FIG. The unique function processing unit 120 of the storage node device 100 shown in the figure includes a low-level interface unit 101, a key table operation unit 102, a file operation unit 103, a file information registration unit 104, a status information registration unit 105, a table search unit 106, A key table storage unit 107 and a file storage unit 108 are provided. The key table storage unit 107 and the file storage unit 108 are realized by a hard disk device, a semiconductor memory, or the like.

キーテーブル蓄積部１０７は、key（キー）とvalue（値）との対応付けを示すキーテーブルを記憶する。キーテーブルには、ＤＨＴ（分散ハッシュテーブル、Distributed Hash Table）によるＰ２Ｐ（Peer-to-Peer）が構成するKey-Value型データベースのうち、自ストレージノード装置１００が管理する範囲のKey-Value値を記憶する。本実施形態の分散ファイルシステムでは、Ｐ２ＰによるKey-Value型データベースをファイルＩＤの管理に用いるため、キーテーブルには、keyとしてファイルを特定するファイルＩＤが設定され、valueとしてファイルを保存しているストレージノード装置１００のノードアドレスが設定される。また、キーテーブルには、各ノード装置の各種ステータス情報もまた設定される。この場合、keyとしてストレージノード装置１００及びステータス情報の種類を特定するステータスＩＤが設定され、valueとしてテータス情報が設定される。ステータス情報は、例えば、ファイル蓄積装置１０８の最大蓄積容量であるディスク容量、ファイル蓄積装置１０８の使用ディスク容量、ストレージノード装置１００のＣＰＵ負荷、ファイル蓄積部１０８に記憶されているファイル数など、ストレージノード装置１００に関する様々な情報である。
ファイル蓄積部１０８は、分散ファイルシステムにおいて操作対象となるファイルを記憶する。 The key table storage unit 107 stores a key table indicating the association between keys and values. In the key table, key-value values in a range managed by the storage node device 100 in a key-value type database configured by P2P (Peer-to-Peer) based on DHT (Distributed Hash Table) are stored. Remember. In the distributed file system of this embodiment, since a key-value type database based on P2P is used for file ID management, a file ID for specifying a file as a key is set in the key table, and the file is stored as a value. The node address of the storage node device 100 is set. Various status information of each node device is also set in the key table. In this case, the storage node device 100 and a status ID that identifies the type of status information are set as a key, and status information is set as a value. The status information includes storage capacity such as a disk capacity that is the maximum storage capacity of the file storage device 108, a used disk capacity of the file storage device 108, a CPU load of the storage node device 100, and the number of files stored in the file storage unit 108. Various pieces of information related to the node device 100.
The file storage unit 108 stores a file to be operated in the distributed file system.

低レベルインタフェース部１０１は、ネットワークインタフェース部３０を介して、自ストレージノード装置１００のキーテーブル蓄積部１０７に記憶されているキーテーブル、および、自ストレージノード装置１００のファイル蓄積部１０８に記録されているファイルを操作するためのインタフェースを提供する。キーテーブル操作部１０２は、低レベルインタフェース部１０１が提供するキーテーブル操作用のインタフェースに対応した処理を実行し、キーテーブル蓄積部１０７に記憶されているキーテーブルに対するキーの登録や検索を行う。ファイル操作部１０３は、低レベルインタフェース部１０１が提供するファイル操作用のインタフェースに対応した処理を実行し、ファイル蓄積部１０８内に記録されたファイルの読み出し、書き込み、削除や、ファイルのリストの取得を行なう。 The low level interface unit 101 is recorded in the key table stored in the key table storage unit 107 of the own storage node device 100 and the file storage unit 108 of the own storage node device 100 via the network interface unit 30. Provides an interface for manipulating existing files. The key table operation unit 102 executes processing corresponding to the key table operation interface provided by the low-level interface unit 101, and performs registration and search of keys with respect to the key table stored in the key table storage unit 107. The file operation unit 103 executes processing corresponding to the file operation interface provided by the low-level interface unit 101, and reads, writes, and deletes files recorded in the file storage unit 108, and obtains a list of files. To do.

ファイル情報登録部１０４は、ファイル蓄積部１０８内に記憶されているファイルのファイルＩＤと、自ストレージノード装置１００のノードアドレスを、このファイルＩＤを管理するストレージノード装置１００が保持しているキーテーブルに登録する。ステータス情報登録部１０５は、自ストレージノード装置１００のステータス情報を定期的に、このステータス情報を管理するストレージノード装置１００が保持しているキーテーブルに登録する。テーブル検索部１０６は、記憶部４０に保存されている検索用ノードテーブル４２を参照して、指定されたＩＤに時計回りに近いノードＩＤを特定し、その特定したノードＩＤに対応付けられているノードアドレスを読み出して出力する。 The file information registration unit 104 stores the file ID of the file stored in the file storage unit 108 and the node address of the own storage node device 100, and a key table held by the storage node device 100 that manages this file ID. Register with. The status information registration unit 105 periodically registers the status information of the own storage node device 100 in a key table held by the storage node device 100 that manages the status information. The table search unit 106 refers to the search node table 42 stored in the storage unit 40, specifies a node ID close to the specified ID in the clockwise direction, and associates it with the specified node ID. Read and output the node address.

図９は、キーテーブル蓄積部１０７に記憶されるキーテーブルを示す図である。同図に示すように、キーテーブルは、キー（key）と、値（value）とを対応付けたレコードからなる。ここで、キーは、ファイルを一意に特定するファイルＩＤ、または、ステータス情報名とノード装置の組み合わせを特定するステータスＩＤである。
ファイルＩＤは、例えば、所定のハッシュ関数を用いてファイル名から生成したハッシュ値を用いることができる。キー（key）がファイルＩＤの場合、当該ファイルＩＤにより特定されるファイルを記憶しているストレージノード装置１００のノードアドレスが対応する値（value）として設定される。
一方、ステータスＩＤは、例えば、所定のハッシュ関数を用いて、ステータス情報名とノードアドレスとを結合したデータから生成したハッシュ値を用いることができる。ステータス情報名は、例えば、ディスク容量、使用ディスク容量、ＣＰＵ負荷などのステータス情報の種類を表す名前である。キー（key）がステータスＩＤの場合、ステータスＩＤにより特定されるノード装置及びステータス情報の種類に対応したステータス情報が値（value）として設定される。 FIG. 9 is a diagram illustrating a key table stored in the key table storage unit 107. As shown in the figure, the key table is composed of records in which a key and a value are associated with each other. Here, the key is a file ID that uniquely identifies a file, or a status ID that identifies a combination of a status information name and a node device.
As the file ID, for example, a hash value generated from a file name using a predetermined hash function can be used. When the key is a file ID, the node address of the storage node device 100 that stores the file specified by the file ID is set as a corresponding value.
On the other hand, for the status ID, for example, a hash value generated from data obtained by combining the status information name and the node address using a predetermined hash function can be used. The status information name is a name indicating the type of status information such as disk capacity, used disk capacity, CPU load, and the like. When the key is a status ID, status information corresponding to the type of the node device and status information specified by the status ID is set as a value.

図１０は、低レベルインタフェース部１０１が提供するインタフェースを示す。同図に示すように、低レベルインタフェース部１０１は、キーテーブル操作用としてgetおよびput、ファイル操作用としてlow_read、low_write、low_delete、ファイルリスト取得用としてget_listの各インタフェースを提供する。引数のinおよびoutはデータの通信方向を示しており、inはインタフェースの呼び出し側が設定する値、outは呼び出し側へ返答される値を示している。 FIG. 10 shows an interface provided by the low-level interface unit 101. As shown in the figure, the low-level interface unit 101 provides get and put interfaces for key table operations, low_read, low_write, low_delete for file operations, and get_list interfaces for file list acquisition. The arguments “in” and “out” indicate the data communication direction, “in” indicates a value set by the caller of the interface, and “out” indicates a value returned to the caller.

getは、キーテーブル蓄積部１０７に記憶されているキーテーブルから引数keyに対応した値（value）を読み出し、呼び出し元に引数valueとして出力するインタフェースである。
putは、キーテーブル蓄積部１０７に記憶されているキーテーブルに、引数key及び引数valueの組を書き込むインタフェースである。 get is an interface that reads a value (value) corresponding to the argument key from the key table stored in the key table storage unit 107 and outputs it to the caller as an argument value.
put is an interface for writing a set of an argument key and an argument value to the key table stored in the key table storage unit 107.

low_readは、ファイル蓄積部１０８に記憶されているファイルからデータを読み出すインタフェースである。引数fnameは、データ読み出し対象ファイルのファイル名を示す。引数offsetは、データの読み出しを行なう先頭位置を示し、ファイルの先頭からのバイト数が設定される。引数sizeは、読み出すデータのバイトサイズを示す。また、引数dataは、読み出されたデータを示す。
low_writeは、ファイル蓄積部１０８に記憶されているファイルへデータを書き込むインタフェースである。引数fnameは、データ書き込み対象ファイルのファイル名を示す。引数offsetは、データの書き込みを行なう先頭位置を示し、ファイルの先頭からのバイト数が設定される。引数sizeは、書き込むデータのバイトサイズを示し、引数dataは、ファイルに書き込むデータを示す。 low_read is an interface that reads data from a file stored in the file storage unit 108. The argument fname indicates the file name of the data read target file. The argument offset indicates the head position from which data is read, and the number of bytes from the head of the file is set. The argument size indicates the byte size of the data to be read. The argument data indicates the read data.
low_write is an interface for writing data to a file stored in the file storage unit 108. The argument fname indicates the file name of the data write target file. The argument offset indicates the head position where data is written, and the number of bytes from the head of the file is set. The argument size indicates the byte size of data to be written, and the argument data indicates data to be written to the file.

low_deleteは、ファイル蓄積部１０８に記憶されているファイルを削除するインタフェースである。引数fnameは、削除対象ファイルのファイル名を示す。
get_listは、ファイル蓄積部１０８に記憶されているファイルのリストを提供するインタフェースである。引数flistは、ファイル蓄積部１０８に記憶されているファイルのファイル名とサイズとを関連付けたリストを示す。 low_delete is an interface for deleting a file stored in the file storage unit 108. The argument fname indicates the file name of the file to be deleted.
get_list is an interface that provides a list of files stored in the file storage unit 108. The argument flist indicates a list in which file names and sizes of files stored in the file storage unit 108 are associated with each other.

[３．１．２キーテーブル操作部１０２の動作]
キーテーブル操作部１０２は、低レベルインタフェース部１０１のgetおよびputインタフェースに対する処理を行う。キーテーブル操作部１０２は、低レベルインタフェース部１０１のputインタフェースが呼び出された場合、引数key及び引数valueとして指定された（key, value）値の組をキーテーブル蓄積部１０７に記憶されているキーテーブルに登録する。一方、低レベルインタフェース部１０１のgetインタフェースが呼び出された場合、キーテーブル操作部１０２は、引数keyに対する値（value）をキーテーブル蓄積部１０７に記憶されているキーテーブルから検索して読み出し、getインタフェースの呼び出し元のノード装置に返送する。 [3.1.2 Operation of Key Table Operation Unit 102]
The key table operation unit 102 performs processing for the get and put interfaces of the low level interface unit 101. When the put interface of the low-level interface unit 101 is called, the key table operation unit 102 stores a key pair stored in the key table storage unit 107 as a key (value) specified as an argument key and an argument value. Register in the table. On the other hand, when the get interface of the low-level interface unit 101 is called, the key table operation unit 102 retrieves the value (value) for the argument key from the key table stored in the key table storage unit 107 and reads it, and get Return to the node device that called the interface.

[３．１．３ファイル操作部１０３の動作]
ファイル操作部１０３は、低レベルインタフェース部１０１のlow_read、low_write、low_delete、get_listの各インタフェースに対する処理を行う。ファイル操作部１０３は、低レベルインタフェース部１０１のlow_writeインタフェースが呼び出された場合、ファイル蓄積部１０８に記憶されている引数fnameのファイル名のファイルに、ファイルの先頭から引数offsetで示されるバイト数だけ進んだ位置を開始位置として、引数sizeで示されるバイト数のデータを書き込む。指定されたファイルが存在しない場合は、新たにファイルを作成し、データを書き込む。 [3.1.3 Operation of file operation unit 103]
The file operation unit 103 performs processing for the low_read, low_write, low_delete, and get_list interfaces of the low-level interface unit 101. When the low_write interface of the low-level interface unit 101 is called, the file operation unit 103 adds the number of bytes indicated by the argument offset from the beginning of the file to the file with the file name of the argument fname stored in the file storage unit 108. Write the data of the number of bytes indicated by the argument size, starting from the advanced position. If the specified file does not exist, create a new file and write the data.

一方、低レベルインタフェース部１０１のlow_readインタフェースが呼び出された場合、ファイル操作部１０３は、ファイル蓄積部１０８に記憶されている引数fnameがファイル名のファイルの先頭から、引数offsetで示されるバイト数だけすすんだ位置を開始位置として、引数sizeで示されるバイトのデータを読み出して引数dataに設定し、low_readインタフェースの呼び出し元のノード装置に返送する。 On the other hand, when the low_read interface of the low-level interface unit 101 is called, the file operation unit 103 sets the number of bytes indicated by the argument offset from the beginning of the file whose argument fname is stored in the file storage unit 108. Using the proceeding position as the start position, the byte data indicated by the argument size is read and set as the argument data, and returned to the node device that called the low_read interface.

また、低レベルインタフェース部１０１のlow_deleteインタフェースが呼び出された場合、ファイル操作部１０３は、引数fnameがファイル名であるファイルをファイル蓄積部１０８から削除する。
また、低レベルインタフェース部１０１のget_listインタフェースが呼び出された場合、ファイル操作部１０３は、ファイル蓄積部１０８に記憶されている全ファイルのファイル名とサイズを読み出し、読み出したファイル名とサイズのリストを呼び出し元のノード装置に返送する。 When the low_delete interface of the low-level interface unit 101 is called, the file operation unit 103 deletes the file whose argument fname is the file name from the file storage unit 108.
When the get_list interface of the low-level interface unit 101 is called, the file operation unit 103 reads the file names and sizes of all the files stored in the file storage unit 108, and displays a list of the read file names and sizes. Return to the calling node device.

[３．１．４ファイル情報登録部１０４の動作]
ファイル情報登録部１０４は、ファイル蓄積部１０８内に記憶されているファイルのファイルＩＤをキー（key）とし、自ストレージノード装置１００のノードアドレスを値（value）とした（key, value）値の組を、Ｐ２Ｐが構成するKey-Value型データベースへ登録する。 [3.1.4 Operation of file information registration unit 104]
The file information registration unit 104 uses the file ID of the file stored in the file storage unit 108 as a key, and sets the node address of the storage node device 100 as a value (value). The set is registered in the key-value type database configured by P2P.

図１１は、ファイル情報登録部の１０４の動作の流れ図を示す。同図に示す流れ図は、ファイル蓄積部１０８に蓄積された１つのファイルに対する登録動作であり、ファイル情報登録部１０４は、ファイル蓄積部１０８に蓄積されたファイル数だけ繰り返しこの動作を行う。なお、ファイル情報登録部１０４は、定期的に、あるいは、ファイル蓄積部１０８に新たなファイルが書き込まれたときに起動される。 FIG. 11 shows a flowchart of the operation of the file information registration unit 104. The flowchart shown in the figure is a registration operation for one file stored in the file storage unit 108, and the file information registration unit 104 repeatedly performs this operation for the number of files stored in the file storage unit 108. The file information registration unit 104 is activated periodically or when a new file is written in the file storage unit 108.

まず、ファイル情報登録部１０４は、ファイル蓄積部１０８に記憶されているファイルのファイル名からファイルＩＤを生成する（ステップＳ１０４−１）。ファイルＩＤの生成方法には、例えばファイル名からＳＨＡ−１ハッシュ関数により生成する方法がある。次に、生成したファイルＩＤから、そのファイルＩＤを登録すべきノード装置を検索する（ステップＳ１０４−２）。検索は、テーブル検索部１０６が行う。テーブル検索部１０６は、記憶部４０に保存されている検索用ノードテーブル４２から、指定されたファイルＩＤに時計回りに近いノードＩＤを検索し、そのノードアドレスを返す。次に、ファイル情報登録部１０４は、テーブル検索部１０６から検索結果として出力されたノードアドレスにより特定される登録先のノード装置に対してputインタフェースの実行を要求する（ステップＳ１０４−３）。putインタフェースの引数keyには生成したファイルＩＤが設定され、引数valueには自ストレージノード装置１００のノードアドレスが設定される。これにより、（ファイルＩＤ，ノードアドレス）の組を、生成したファイルＩＤの管理ノードであるストレージノード装置１００に登録する。 First, the file information registration unit 104 generates a file ID from the file name of the file stored in the file storage unit 108 (step S104-1). As a file ID generation method, for example, there is a method of generating a file ID from a file name using a SHA-1 hash function. Next, the node device to which the file ID is to be registered is searched from the generated file ID (step S104-2). The table search unit 106 performs the search. The table search unit 106 searches the search node table 42 stored in the storage unit 40 for a node ID close to the designated file ID in the clockwise direction, and returns the node address. Next, the file information registration unit 104 requests the registration destination node device specified by the node address output as the search result from the table search unit 106 to execute the put interface (step S104-3). The generated file ID is set in the argument key of the put interface, and the node address of the own storage node device 100 is set in the argument value. As a result, a set of (file ID, node address) is registered in the storage node device 100 which is the management node of the generated file ID.

[３．１．５ステータス情報登録部１０５の動作]
ステータス情報登録部１０５は、ストレージノード装置１００のステータス情報を定期的にＰ２ＰによるKey-Value型データベースに登録する。ステータス情報登録部１０５が、あらかじめ決められたキーを使ってこれらの情報を登録しておくことで、他のノード装置からストレージノード装置１００の様々なステータス情報を取得できるようにする。 [3.1.5 Operation of Status Information Registration Unit 105]
The status information registration unit 105 periodically registers the status information of the storage node device 100 in a key-value type database based on P2P. The status information registration unit 105 registers these pieces of information using a predetermined key so that various status information of the storage node device 100 can be acquired from other node devices.

図１２は、ステータス情報登録部１０５の動作の流れ図を示す。
ステータス情報登録部１０５が起動される際に、ステータス情報の種類を示すステータス情報名が、ステータス情報と共に入力される。ディスク容量、ディスクの使用容量、ＣＰＵ負荷などのステータス情報は、例えば、ノード装置を実装するストレージノード装置１００が有するリソース管理機能部によって取得することができる。ステータス情報登録部１０５は、はじめにキーを生成する(Ｓ１０５−１)。ステータスＩＤは、ステータス情報の種別を表す予め決められたステータス情報名と、自ストレージノード装置１００のノードアドレスとの組み合わせから、ＳＨＡ−１等のハッシュ関数により生成したハッシュ値とする。例えば、ディスク容量であれば、「DISKCAPA_ノードアドレス」、ＣＰＵの負荷であれば「CPU_ノードアドレス」などをハッシュ関数の入力とし、ハッシュ値を算出する。 FIG. 12 shows a flowchart of the operation of the status information registration unit 105.
When the status information registration unit 105 is activated, a status information name indicating the type of status information is input together with the status information. Status information such as disk capacity, disk usage capacity, and CPU load can be acquired by, for example, a resource management function unit included in the storage node device 100 in which the node device is mounted. The status information registration unit 105 first generates a key (S105-1). The status ID is a hash value generated by a hash function such as SHA-1 from a combination of a predetermined status information name indicating the type of status information and the node address of the own storage node device 100. For example, “DISKCAPA_node address” for the disk capacity and “CPU_node address” for the CPU load are input as the hash function, and the hash value is calculated.

ステータス情報登録部１０５は、ハッシュ関数により作られたステータスＩＤを使ってステータス情報をＰ２ＰのKey-Value型データベースに登録するため、テーブル検索部１０６により当該ステータスＩＤを管理するノード装置を検索する（ステップＳ１０５−２）。すなわち、テーブル検索部１０６は、記憶部４０に保存されている検索用ノードテーブル４２から、指定されたステータスＩＤに時計回りに近いノードＩＤを検索し、そのノードアドレスを返す。ステータス情報登録部１０５は、テーブル検索部１０６から検索結果として出力されたノードアドレスの登録先ストレージノード装置１００に対してputインタフェースの実行を要求し、ステータス情報を登録する（ステップＳ１０５−３）。putインタフェースの引数keyには生成したステータスＩＤが設定され、引数valueにはステータス情報が設定される。 Since the status information registration unit 105 registers the status information in the P2P key-value database using the status ID created by the hash function, the table search unit 106 searches for a node device that manages the status ID ( Step S105-2). That is, the table search unit 106 searches for a node ID close to the designated status ID in the clockwise direction from the search node table 42 stored in the storage unit 40, and returns the node address. The status information registration unit 105 requests the registration destination storage node device 100 of the node address output as the search result from the table search unit 106 to execute the put interface, and registers the status information (step S105-3). The generated status ID is set in the argument key of the put interface, and status information is set in the argument value.

［３．１．６ストレージノード装置１００の効果］
以上説明したように、固有機能処理部１２０を有するストレージノード装置１００により、ＤＨＴを使ったＰ２Ｐによる分散ファイルシステムを構築することができる。 [3.1.6 Effects of Storage Node Device 100]
As described above, a P2P distributed file system using DHT can be constructed by the storage node device 100 having the unique function processing unit 120.

[３．２アクセスノード装置２００の固有機能]
[３．２．１固有機能処理部の構成]
図１３は、アクセスノード装置２００の固有機能処理部２２０の構成を示す図であり、図２に示す固有機能処理部２０に相当する。
アクセスノード装置２００の固有機能処理部２２０は、高レベルインタフェース部２０１、ファイル処理部２０２、テーブル検索部２０３を備えて構成される。 [3.2 Unique Function of Access Node Device 200]
[3.2.1 Configuration of Unique Function Processing Unit]
FIG. 13 is a diagram illustrating a configuration of the unique function processing unit 220 of the access node device 200, and corresponds to the unique function processing unit 20 illustrated in FIG.
The unique function processing unit 220 of the access node apparatus 200 includes a high-level interface unit 201, a file processing unit 202, and a table search unit 203.

高レベルインタフェース部２０１は、分散ファイルシステム内のストレージノード装置１００に保管されているファイルを操作するためのインタフェースを提供する。クライアント装置２は、通信ネットワーク１を介してこのインタフェースを使用することで、分散ファイルシステム内のファイルを操作することができる。ファイル処理部２０２は、高レベルインタフェース部２０１が受信したインタフェースに従って、操作対象ファイルの操作を行なう。テーブル検索部２０３は、ストレージノード装置１００のテーブル検索部１０６と同様の機能を有し、記憶部４０に保存されている検索用ノードテーブル４２を参照して、指定されたファイルＩＤに時計回りに近いノードＩＤを特定し、その特定したノードＩＤに対応付けられているノードアドレスを読み出して出力する。 The high-level interface unit 201 provides an interface for manipulating files stored in the storage node device 100 in the distributed file system. The client apparatus 2 can operate files in the distributed file system by using this interface via the communication network 1. The file processing unit 202 operates the operation target file according to the interface received by the high level interface unit 201. The table search unit 203 has the same function as the table search unit 106 of the storage node device 100, and refers to the search node table 42 stored in the storage unit 40, and rotates the specified file ID clockwise. A near node ID is specified, and a node address associated with the specified node ID is read and output.

図１４は、高レベルインタフェース部２０１が提供するインタフェースを示す。同図に示すように、高レベルインタフェース部２０１は、hi_read、及び、hi_writeインタフェースを提供する。
hi_readは、指定されたファイル名のファイルからデータを読み出すインタフェースである。引数fnameは、データ読み出し対象ファイルのファイル名を示す。引数offsetは、データを読み出す先頭位置を示し、ファイルの先頭からのバイト数が設定される。引数sizeは、読み出すデータのバイトサイズを示す。また、引数dataは、読み出されたデータを示す。
hi_writeは、指定されたファイル名のファイルへデータを書き込むインタフェースである。引数fnameは、データ書き込み対象ファイルのファイル名を示す。引数offsetは、データを書き込む先頭位置を示し、ファイルの先頭からのバイト数が設定される。引数sizeは、書き込むデータのバイトサイズを示し、引数dataは、ファイルに書き込むデータを示す。 FIG. 14 shows an interface provided by the high-level interface unit 201. As shown in the figure, the high-level interface unit 201 provides hi_read and hi_write interfaces.
hi_read is an interface for reading data from a file having a specified file name. The argument fname indicates the file name of the data read target file. The argument offset indicates the head position from which data is read, and the number of bytes from the head of the file is set. The argument size indicates the byte size of the data to be read. The argument data indicates the read data.
hi_write is an interface for writing data to a file having a specified file name. The argument fname indicates the file name of the data write target file. The argument offset indicates the start position for writing data, and the number of bytes from the start of the file is set. The argument size indicates the byte size of data to be written, and the argument data indicates data to be written to the file.

[３．２．２固有機能処理部２２０の動作]
アクセスノード装置２００において、クライアント装置２から高レベルインタフェース部２０１のhi_readインタフェースまたはhi_writeインタフェースの実行が要求されると、ファイル処理部２０２は、高レベルインタフェース部２０１に要求されたインタフェースについての実際の処理を行う。 [3.2.2 Operation of Unique Function Processing Unit 220]
In the access node device 200, when the client device 2 requests execution of the hi_read interface or the hi_write interface of the high level interface unit 201, the file processing unit 202 performs actual processing on the interface requested by the high level interface unit 201. I do.

クライアント装置２によって高レベルインタフェース部２０１のhi_readインタフェースが呼び出された場合、ファイル処理部２０２は、引数fnameとして指定されたファイル名fnameから、ファイルＩＤを生成する。ファイルＩＤの生成には、例えば、ＳＨＡ−１などのあらかじめ決められたハッシュ関数を用いる。次に、ファイル処理部２０２は、テーブル検索部２０３にファイルＩＤを出力し、ノードアドレスを要求する。テーブル検索部２０３は、受信したファイルＩＤをテーブル検索部２０３に出力し、当該ファイルＩＤを管理するストレージノード装置１００のノードアドレスを検索する。テーブル検索部２０３は、受信したファイルＩＤによって記憶部４０に記憶されている検索用ノードテーブル４２を検索してノードアドレスを取得すると、ファイル処理部２０２に出力する。 When the hi_read interface of the high-level interface unit 201 is called by the client device 2, the file processing unit 202 generates a file ID from the file name fname specified as the argument fname. For example, a predetermined hash function such as SHA-1 is used for generating the file ID. Next, the file processing unit 202 outputs a file ID to the table search unit 203 and requests a node address. The table search unit 203 outputs the received file ID to the table search unit 203, and searches for the node address of the storage node device 100 that manages the file ID. When the table search unit 203 searches the search node table 42 stored in the storage unit 40 by the received file ID and acquires the node address, the table search unit 203 outputs the node address to the file processing unit 202.

ファイル処理部２０２は、テーブル検索部２０３が検索の結果取得したノードアドレスを用いて、ファイルＩＤを管理しているストレージノード装置１００の低レベルインタフェース部１０１にgetインタフェースの実行を要求する。getインタフェースの引数keyには、ファイルＩＤが設定される。getインタフェースの実行が要求されたストレージノード装置１００は、キーテーブル蓄積部１０７に記憶されているキーテーブルから引数keyに対応した値を読み出し、引数valueとして返送する。この引数valueは、ファイルＩＤにより特定されるファイルを記憶しているストレージノード装置１００のノードアドレスを示す。 The file processing unit 202 requests the low-level interface unit 101 of the storage node apparatus 100 that manages the file ID to execute the get interface, using the node address acquired by the table search unit 203 as a result of the search. A file ID is set in the argument key of the get interface. The storage node device 100 requested to execute the get interface reads a value corresponding to the argument key from the key table stored in the key table storage unit 107 and returns it as an argument value. The argument value indicates the node address of the storage node device 100 that stores the file specified by the file ID.

ファイル処理部２０２は、getインタフェースの引数valueにより示されるノードアドレスを用いて、操作対象のファイルを記憶しているストレージノード装置１００の低レベルインタフェース部１０１に、low_readインタフェースの実行を要求し、ファイルの読み出し処理を行う。low_readインタフェースの引数fname, offset, sizeには、hi_readインタフェースの引数fname, offset, sizeが設定される。low_readインタフェースの実行を受信したストレージノード装置１００は、実行結果の引数dataを返送する。アクセスノード装置２００のファイル処理部２０２は、ストレージノード装置１００から返送された引数dataをhi_readインタフェースの実行要求元のクライアント装置２へ出力する。 Using the node address indicated by the argument value of the get interface, the file processing unit 202 requests the low-level interface unit 101 of the storage node device 100 storing the operation target file to execute the low_read interface, and Is read out. The arguments fname, offset, and size of the hi_read interface are set in the arguments fname, offset, and size of the low_read interface. The storage node device 100 that has received the execution of the low_read interface returns an argument data of the execution result. The file processing unit 202 of the access node apparatus 200 outputs the argument data returned from the storage node apparatus 100 to the client apparatus 2 that is the execution request source of the hi_read interface.

また、高レベルインタフェース部２０１のhi_writeインタフェースが呼び出された場合も、hi_readインタフェースが呼び出された場合と同様の処理を行ない、ファイルを記憶しているストレージノード装置１００の低レベルインタフェース部１０１に対して、low_writeインタフェースの実行を指示し、ファイルの書き込み処理を行う。low_writeインタフェースの引数fname, offset, size, dataには、hi_writeインタフェースの引数fname, offset, size, dataが設定される。low_writeインタフェースの実行が要求されたストレージノード装置１００は、low_writeインタフェースを実行する。 Also, when the hi_write interface of the high level interface unit 201 is called, the same processing as when the hi_read interface is called is performed, and the low level interface unit 101 of the storage node device 100 storing the file is processed. , Instructs execution of the low_write interface and performs file write processing. The arguments fname, offset, size, and data of the hi_write interface are set in the arguments fname, offset, size, and data of the low_write interface. The storage node device 100 requested to execute the low_write interface executes the low_write interface.

[３．２．３アクセスノード装置２００の効果]
以上説明した固有機能処理部２２０を有するアクセスノード装置２００により、クライアント装置２から、本実施形態の分散ファイルシステムに保存されているファイルの操作が可能となる。アクセスノード装置２００は、高レベルインタフェース部２０１を介して、複数のクライアント装置２に対して、分散ファイルシステムへのアクセスを提供することができる。 [3.2.3 Effects of access node apparatus 200]
The access node device 200 having the unique function processing unit 220 described above allows the client device 2 to operate files stored in the distributed file system of the present embodiment. The access node device 200 can provide access to the distributed file system to the plurality of client devices 2 via the high-level interface unit 201.

本実施形態によるアクセスノード装置２００を用いることにより、クライアント装置２が増加した場合、アクセスノード装置２００を分散ファイルシステムに追加し、分散ファイルシステムへのアクセスの負荷を分散することが可能となる。また、クライアント装置２の数が減少した場合には、アクセスノード装置２００を離脱させ、無駄なアクセスノードを削除することができる。本実施形態では、アクセスノード装置２００の追加、離脱は、分散ファイルシステムのファイル管理を滞らせることがない。従って、アクセスノード装置２００の参加及び離脱は、稼働中の分散ファイルシステムにおいてどのタイミングにおいて行っても問題ない。また、アクセスノード装置２００とクライアント装置２を同一のコンピュータ上で稼動させることも可能であり、その場合には、他のクライアント装置２からのアクセスが無い、高速なファイル操作が可能となる。 By using the access node device 200 according to the present embodiment, when the number of client devices 2 increases, it becomes possible to add the access node device 200 to the distributed file system and distribute the access load to the distributed file system. Further, when the number of client devices 2 decreases, the access node device 200 can be detached and a useless access node can be deleted. In the present embodiment, the addition and removal of the access node device 200 does not delay the file management of the distributed file system. Therefore, there is no problem in joining and leaving the access node device 200 at any timing in the distributed file system in operation. Further, the access node device 200 and the client device 2 can be operated on the same computer. In this case, high-speed file operation without access from other client devices 2 is possible.

なお、本実施形態では、アクセスノード装置２００が提供する高レベルインタフェース部２０１はhi_read、hi_writeインタフェースのような基本的なインタフェースのみについて述べたが、ファイル処理部２０２に、金子他, “構造型Ｐ２Ｐを使った分散ファイルシステムにおける分散ディレクトリ管理手法”, L-033, FIT2009, 2009（参考文献１）にあるように、ディレクトリファイルによる分散管理方法を用いることで、本実施形態の分散ファイルシステムを、ディレクトリ構造をもったファイルシステムとして提供することも可能である。例えば、分散ファイルシステム内で一意なファイル名が付与されたｉノードファイル、ディレクトリファイル、データファイルを分散してストレージノード装置１００のファイル蓄積装置１０８に記憶しておく。ディレクトリファイルは、配下のディレクトリ名とディレクトリファイルのファイル名との対応関係、および、配下のユーザ利用ファイル名とデータファイルのファイル名との対応関係に関する管理情報を含む。ｉノードファイルは、ディレクトリファイルまたはデータファイルのファイル名を含む。データファイルは、ファイル本体である。アクセスノード装置２００は、クライアント装置２からディレクトリ名及びユーザ利用ファイル名によるファイル操作要求を受信する。アクセスノード装置２００は、ルートディレクトリに対応したｉノードファイルから順に、ｉノードファイル、及び、ディレクトリファイルをストレージノード装置１００より読み出して、ディレクトリ名及びユーザ利用ファイル名に対応したデータファイル名を取得すると、このデータファイル名のデータファイル本体を保持するストレージノード装置１００に対して、クライアント装置２から要求されたファイル操作を要求する。なお、ｉノードファイル、ディレクトリファイルの読み出しや、データファイルのファイル操作は、ｉノードファイル、ディレクトリファイル、データファイルのファイル名から生成したファイルＩＤに基づいて、上述した実施形態と同様に行なう。 In the present embodiment, the high-level interface unit 201 provided by the access node apparatus 200 has been described only for basic interfaces such as the hi_read and hi_write interfaces. However, the file processing unit 202 includes Kaneko et al., “Structural P2P”. As described in “Distributed Directory Management Method in Distributed File System Using”, L-033, FIT2009, 2009 (Reference Document 1), the distributed file system of this embodiment can be obtained by using the distributed management method using directory files. It is also possible to provide a file system having a directory structure. For example, i-node files, directory files, and data files with unique file names in the distributed file system are distributed and stored in the file storage device 108 of the storage node device 100. The directory file includes management information regarding the correspondence between the subordinate directory name and the file name of the directory file, and the correspondence between the subordinate user use file name and the data file name. The i-node file includes a file name of a directory file or a data file. The data file is the file body. The access node device 200 receives a file operation request based on the directory name and the user use file name from the client device 2. When the access node device 200 reads the i-node file and the directory file from the storage node device 100 in order from the i-node file corresponding to the root directory, the access node device 200 acquires the data file name corresponding to the directory name and the user-used file name. The file operation requested by the client apparatus 2 is requested to the storage node apparatus 100 that holds the data file body having the data file name. Note that the reading of the i-node file and the directory file and the file operation of the data file are performed in the same manner as in the above-described embodiment based on the file ID generated from the file names of the i-node file, the directory file, and the data file.

[３．３容量負荷分散ノード装置３００の固有機能]
[３．３．１固有機能処理部の構成]
容量負荷分散ノード装置３００は、分散ファイルシステム内のストレージノード装置１００が備えるファイル蓄積部１０８の使用率を均等に近づける機能ノード装置である。
図１５は、容量負荷分散ノード装置３００の固有機能処理部３２０の構成を示す図であり、図２に示す固有機能処理部２０に相当する。容量負荷分散ノード装置３００の固有機能処理部３２０は、ステータス情報取得部３０１、容量均等化計算部３０２、ファイル移動部３０３、テーブル検索部３０４を備えて構成される。 [3.3 Unique Function of Capacity Load Balancing Node Device 300]
[3.3.1 Configuration of unique function processor]
The capacity load distribution node device 300 is a functional node device that makes the usage rate of the file storage unit 108 included in the storage node device 100 in the distributed file system evenly close.
FIG. 15 is a diagram illustrating a configuration of the unique function processing unit 320 of the capacity load balancing node apparatus 300, and corresponds to the unique function processing unit 20 illustrated in FIG. The unique function processing unit 320 of the capacity load distribution node device 300 includes a status information acquisition unit 301, a capacity equalization calculation unit 302, a file movement unit 303, and a table search unit 304.

ステータス情報取得部３０１は、分散ファイルシステム内の各ストレージノード装置１００のステータス情報を取得する。容量均等化計算部３０２は、ステータス情報取得部３０１が取得したステータス情報に基づいて、各ストレージノード装置１００のファイル蓄積部１０８の使用率を均等にするために、ファイル移動を行なう対象のストレージノード装置１００と、移動するファイル容量を決定する。ファイル移動部３０３は、容量均等化計算部３０２が決定したファイルの移動元及び移動先ストレージノード装置１００と、移動するファイル容量に従って、ファイルの移動を指示する。テーブル検索部３０４は、記憶部４０に保存されている検索用ノードテーブル４２から、ストレージノード装置１００のノードアドレスを順に読み出してステータス情報取得部３０１へ出力する。また、テーブル検索部３０４は、ストレージノード装置１００のテーブル検索部１０６と同様の機能を有し、検索用ノードテーブル４２を参照して、ステータス情報取得部３０１またはファイル移動部３０３から指定されたＩＤに時計回りに近いノードＩＤを特定し、その特定したノードＩＤに対応付けられているノードアドレスを読み出して出力する。 The status information acquisition unit 301 acquires status information of each storage node device 100 in the distributed file system. The capacity equalization calculation unit 302 is based on the status information acquired by the status information acquisition unit 301, and in order to equalize the usage rate of the file storage unit 108 of each storage node device 100, the storage node that is the target of file migration The apparatus 100 and the file capacity to be moved are determined. The file migration unit 303 instructs file migration according to the file migration source and destination storage node devices 100 determined by the capacity equalization calculation unit 302 and the file capacity to be migrated. The table search unit 304 sequentially reads the node address of the storage node device 100 from the search node table 42 stored in the storage unit 40 and outputs the node address to the status information acquisition unit 301. The table search unit 304 has a function similar to that of the table search unit 106 of the storage node device 100. The ID specified from the status information acquisition unit 301 or the file migration unit 303 with reference to the search node table 42 is provided. A node ID close to clockwise is identified, and a node address associated with the identified node ID is read and output.

[３．３．２ステータス情報取得部３０１の動作]
図１６は、ステータス情報取得部３０１の動作の流れ図を示す。容量負荷分散ノード装置３００は、所定の時間に定期的に、あるいは、利用者に指示された時間に、使用率の均等化処理を起動する。これにより、図１６の処理が起動される。 [3.3.2 Operation of status information acquisition unit 301]
FIG. 16 shows a flowchart of the operation of the status information acquisition unit 301. The capacity load distribution node device 300 activates the usage rate equalization processing periodically at a predetermined time or at a time instructed by the user. Thereby, the process of FIG. 16 is started.

同図において、ステータス情報取得部３０１は、ｎに初期値として０を設定すると、テーブル検索部３０４により、検索用ノードテーブル４２にｎ番目に登録されているストレージノード装置１００（以下、「ストレージノード装置（ｎ）」と記載する。）のノードアドレスを取得する（ステップＳ３０１−１）。次に、ステータス情報取得部３０１は、ディスク容量及び使用ディスク容量のそれぞれについて、ステータス情報名と、テーブル検索部３０４が取得したノードアドレスとからステータスＩＤを生成する（ステップＳ３０１−２）。ステータスＩＤの生成方法は、ストレージノード装置１００のステータス情報登録部１０５の動作において説明した方法と同一であり、例えば、ディスク容量の場合であれば、「DISKCAPA_ノードアドレス」のハッシュ値をハッシュ関数によって生成する。次に、生成したステータスＩＤをテーブル検索部３０４に出力し、当該ステータスＩＤを管理しているノード装置のノードアドレスを要求する。テーブル検索部３０４は、ステータスＩＤにより記憶部４０に記憶されている検索用ノードテーブル４２を検索してノードアドレスを取得し、ステータス情報取得部３０１は、テーブル検索部３０４から検索結果のノードアドレスを取得する（ステップＳ３０１−３）。 In the figure, when the status information acquisition unit 301 sets n to 0 as an initial value, the storage node device 100 registered in the search node table 42 by the table search unit 304 (hereinafter referred to as “storage node”). The node address of “device (n)” is acquired (step S301-1). Next, the status information acquisition unit 301 generates a status ID for each of the disk capacity and the used disk capacity from the status information name and the node address acquired by the table search unit 304 (step S301-2). The method of generating the status ID is the same as the method described in the operation of the status information registration unit 105 of the storage node device 100. For example, in the case of disk capacity, the hash value of “DISKCAPA_node address” is used as the hash function. Generate by. Next, the generated status ID is output to the table search unit 304, and the node address of the node device managing the status ID is requested. The table search unit 304 searches the search node table 42 stored in the storage unit 40 by the status ID to acquire a node address, and the status information acquisition unit 301 obtains the node address of the search result from the table search unit 304. Obtain (step S301-3).

ステータス情報取得部３０１は、ステップＳ３０１−３において取得したノードアドレスを用い、ストレージノード装置１００の低レベルインタフェース部１０１に対して、getインタフェースの実行を要求し、ストレージノード装置（ｎ）のディスク容量及び使用ディスク容量それぞれのステータス情報を取得する（ステップＳ３０１−４）。getインタフェースの引数keyには、ステップＳ３０１−２において生成したステータスＩＤが設定される。ストレージノード装置（ｎ）が、検索用ノードテーブル４２に登録されている最後のノード装置ではない場合（ステップＳ３０１−５：ＮＯ）、ｎに１を加算した後（ステップＳ３０１−６）、ステップＳ３０１−１からの動作を繰り返す。そして、ステップＳ３０１−５において、ストレージノード装置（ｎ）が検索用ノードテーブル４２の最後のノード装置であった場合（ステップＳ３０１−５：ＹＥＳ）、ステータス情報取得部３０１の動作を終了し、容量均等化計算部３０２の動作の実行を開始する（ステップＳ３０１−７）。 The status information acquisition unit 301 uses the node address acquired in step S301-3 to request the low-level interface unit 101 of the storage node device 100 to execute the get interface, and the disk capacity of the storage node device (n). And status information of each used disk capacity is acquired (step S301-4). The status ID generated in step S301-2 is set in the argument key of the get interface. If the storage node device (n) is not the last node device registered in the search node table 42 (step S301-5: NO), 1 is added to n (step S301-6), and then step S301. Repeat the operation from -1. In step S301-5, if the storage node device (n) is the last node device in the search node table 42 (step S301-5: YES), the operation of the status information acquisition unit 301 is terminated, and the capacity Execution of the operation of the equalization calculation unit 302 is started (step S301-7).

[３．３．３容量均等化計算部３０２の動作]
容量均等化計算部３０２は、ステータス情報取得部３０１が取得したストレージノードの最大蓄積容量と使用中の蓄積容量から、全ストレージノード装置１００の使用率が一定となるように、ストレージノード装置１００間におけるファイルの受け渡し容量を計算する。 [3.3.3 Operations of Capacity Equalization Calculation Unit 302]
The capacity equalization calculation unit 302 determines whether the usage rate of all the storage node devices 100 is constant from the maximum storage capacity of the storage nodes acquired by the status information acquisition unit 301 and the storage capacity in use. Calculate the file transfer capacity at.

[３．３．３．１容量均等化アルゴリズム]
まず、本実施形態による容量均等化の計算方法について説明する。
ノード数Ｎをストレージノード装置１００の数（Ｎは１以上の整数）、ディスク容量Ｃ_ｉをストレージノード装置（ｉ）のディスク容量、使用ディスク容量Ｍ_ｉをストレージノード装置（ｉ）の使用ディスク容量とすると（ｉは０以上（Ｎ−１）以下の整数）、全ストレージノード装置１００におけるストレージの平均使用率ｒ_ａｖｇは下記の式（１）により算出される。 [3.3.3.1 Capacity equalization algorithm]
First, the capacity equalization calculation method according to the present embodiment will be described.
The number of nodes N is the number of storage node devices 100 (N is an integer of 1 or more), the disk capacity C _i is the disk capacity of the storage node device (i), and the used disk capacity M _i is the used disk capacity of the storage node device (i). Then (i is an integer of 0 or more and (N−1) or less), the average storage usage rate r _avg in all the storage node devices 100 is calculated by the following equation (1).

Ｍ^’ _ｉを容量均等化後のストレージノード装置（ｉ）の使用ディスク容量とすると、以下の式（２）が成り立つ。 When M ^′ _i is the used disk capacity of the storage node device (i) after capacity equalization, the following expression (2) is established.

ストレージノード装置（ｉ）の現在のディスクの使用容量Ｍ_ｉから、ストレージノード装置（ｉ）の容量均等化後のディスクの使用容量Ｍ^’ _ｉへの変換行列をＡとすると、以下の式（３）が成り立つ。 Assuming that the conversion matrix from the current disk usage capacity M _i of the storage node device (i) to the disk usage capacity M ^′ _i after capacity equalization of the storage node device (i) is A, the following equation (3) ) Holds.

変換行列Ａの成分ａ_ｉｊ（ｉ、ｊは０以上（Ｎ−１）以下の整数）は、現在のストレージノード装置（ｊ）の使用ディスク容量に対する、該ストレージノード装置（ｊ）からストレージノード装置（ｉ）に移動するディスク容量の割合を示す。したがって、容量均等化とは、下記の式（４）が示す条件の下で変換行列Ａを求めることである。 The component a _ij (i, j is an integer not _smaller than 0 and not larger than (N−1)) of the transformation matrix A is _calculated from the storage node device (j) to the storage node device with respect to the currently used disk capacity of the storage node device (j). (I) shows the ratio of the disk capacity to be moved. Therefore, capacity equalization is to obtain the transformation matrix A under the condition indicated by the following equation (4).

これは、ｉ列の成分は、ストレージノード装置（ｉ）の使用ディスク容量をどのように分配するかの割合を示しており、分配する割合を合計した列成分の和は１となるためである。 This is because the i column component indicates the proportion of how the used disk capacity of the storage node device (i) is distributed, and the sum of the column components obtained by adding up the distribution proportions is 1. .

変換行列Ａは１つに定まらないため、本実施形態では、下記の（手順１）〜（手順３）により変換行列Ａを求める。 Since the transformation matrix A is not fixed to one, in this embodiment, the transformation matrix A is obtained by the following (procedure 1) to (procedure 3).

（手順１）対角成分の決定：
変換行列Ａの対角成分ａ_ｉｉ、および、関連する他の成分を、下記にしたがって決定する。 (Procedure 1) Determination of diagonal component:
The diagonal component a _ii of the transformation matrix A and other related components are determined according to:

（ａ）は、現在の使用ディスク容量が、目標の使用容量以上の場合であり、他のストレージノード装置１００からファイルを受け取る必要がないためである。また、（ｂ）は、現在の使用ディスク容量が、目標の使用容量より少ない場合であり、他のストレージノード装置１００へファイルを移動させる必要がなく、また、列成分の和が１となるためである。 (A) is a case where the current used disk capacity is equal to or greater than the target used capacity, and it is not necessary to receive a file from another storage node device 100. Further, (b) is a case where the current used disk capacity is smaller than the target used capacity, and it is not necessary to move the file to another storage node apparatus 100, and the sum of the column components is 1. It is.

ここで、下記の表１に示す状態の５台のストレージノード装置（０）〜（４）を例にして説明を行う。 Here, description will be given by taking five storage node devices (0) to (4) in the state shown in Table 1 below as an example.

式（５）に示すように、この例での平均使用率ｒ_ａｖｇは以下である。 As shown in Formula (5), the average usage rate r _{avg in} this example is as follows.

このとから、Ｍ’＝ＡＭとして下記の式（６）が求められる。つまり、（手順１）においてノードの番号ｉ＝０、１、３のストレージノード装置（ｉ）については（ａ）が適用され、ノードの番号ｉ＝２、４のストレージノード装置（ｉ）については（ｂ）が適用される。なお、「−」は未設定成分である。 From this, the following equation (6) is obtained as M ′ = AM. That is, in (procedure 1), (a) is applied to the storage node device (i) of the node number i = 0, 1, 3 and the storage node device (i) of the node number i = 2, 4 is applied. (B) applies. "-" Is an unset component.

（手順２）未設定成分の決定：
処理の対象とする未設定成分をａ_ＩＪとする。このとき、ファイル移動後のストレージノード１００（Ｉ）の容量は以下の（７）のように表すことができる。 (Procedure 2) Determination of unset components:
An unset component to be processed is defined as a _IJ . At this time, the capacity of the storage node 100 (I) after moving the file can be expressed as (7) below.

そして、（７）に示すファイル移動後のストレージノード装置（Ｉ）の容量が最大となるように、下記の条件下で成分ａ_ＩＪを算出する。 Then, the component a _IJ is calculated under the following conditions so that the capacity of the storage node device (I) after the file movement shown in (7) is maximized.

条件１は、ストレージノード装置（Ｊ）から移動するディスク容量の割合の合計が１以下となることを示している。また、条件２は、ファイル移動後のストレージノード装置（Ｉ）の使用ディスク容量が、容量均等化後の使用ディスク容量Ｍ^’ _Ｉ以下となることを示している。
さらに、上記による成分ａ_ＩＪの決定により、以下の条件を満たした場合に他の成分も決定する。 Condition 1 indicates that the total ratio of disk capacities moved from the storage node device (J) is 1 or less. The condition 2 is the amount of used disk space storage node device after file migration (I) have shown to be a less used disk space M ^_'I after the capacity equalization.
Furthermore, by determining the component a _IJ as described above, other components are also determined when the following conditions are satisfied.

以上を全未設定成分について順に求めていく。 The above is obtained sequentially for all unset components.

前記の式（６）の例を用いて未設定成分を計算すると、下記のようになる。
まず、処理対象を第０列の未設定成分ａ_２０とする。条件１及び条件２から、成分ａ_２０＝０．５１１が算出され、（ｃ）により成分ａ_４０＝０となる。従って、式（６）は、以下の式（８）となる。 When an unset component is calculated using the example of the above equation (6), the result is as follows.
First, the processing target is an unset component a ₂₀ in the 0th column. From condition 1 and condition 2, component a ₂₀ = 0.511 is calculated, and component a ₄₀ = 0 is obtained by (c). Therefore, Expression (6) becomes the following Expression (8).

次に、処理対象を第１列の未設定成分ａ_２１とする。条件１及び条件２から、成分ａ_２１＝０．４５が算出され、（ｃ）により成分ａ_４１＝０となる。従って、式（８）は、以下の式（９）となる。 Next, the processing target is an unset component a ₂₁ in the first column. From condition 1 and condition 2, component a ₂₁ = 0.45 is calculated, and component a ₄₁ = 0 is obtained by (c). Therefore, Expression (8) becomes the following Expression (9).

次に、処理対象を第３列の未設定成分ａ_２３とする。条件１及び条件２から、成分ａ_２３＝０．０９５が算出され、以下の式（１０）となる。 Next, the processing target is an unset component a ₂₃ in the third column. From condition 1 and condition 2, component a ₂₃ = 0.095 is calculated, and the following equation (10) is obtained.

次に、処理対象を第３列の未設定成分ａ_４３とする。条件１及び条件２から、成分ａ_４３＝０．３５５が算出され、以下の式（１１）となる。 Next, the processing target is an unset component a ₄₃ in the third column. From condition 1 and condition 2, component a ₄₃ = 0.355 is calculated, and the following expression (11) is obtained.

（手順３）移動元、移動先及び移動容量の決定：
以上の（手順２）により求められた変換行列Ａの対角成分以外で、かつ、０以外の成分がディスク容量の移動を表す部分である。すなわち、成分ａ_ｉｊに対し、移動元のノード装置がストレージノード装置（ｊ）、移動先のノード装置がストレージノード装置（ｉ）、移動容量がａ_ｉｊ×Ｍ_ｊである。
前記の例の場合、表２のようにストレージノード装置１００間の移動量を求めることができる。 (Procedure 3) Determination of movement source, movement destination and movement capacity:
A component other than the diagonal component of the transformation matrix A obtained by the above (procedure 2) and a component other than 0 is a portion representing the movement of the disk capacity. That is, for the component a _ij , the source node device is the storage node device (j), the destination node device is the storage node device (i), and the migration capacity is a _ij × M _j .
In the case of the above example, the movement amount between the storage node devices 100 can be obtained as shown in Table 2.

[３．３．３．２処理フロー]
図１７及び図１８は、容量均等化計算部３０２の流れ図を示す。
図１７において、まず、容量均等化計算部３０２は、Ｎ行Ｎ列の変換行列Ａの全成分ａ_ｉｊ（ｉ、ｊは０以上（Ｎ−１）以下の整数）を初期化すると（ステップＳ３０２−１）、ステータス情報取得部３０１が取得したＮ個の全ストレージノード装置（ｉ）のディスク容量Ｃ_ｉ及び使用ディスク容量Ｍ_ｉを用い、式（２）により全ストレージノード装置１００の平均使用率ｒ_ａｖｇを算出する（ステップＳ３０２−２）。続いて、容量均等化計算部３０２は、ステップＳ３０２−２において算出した平均使用率ｒ_ａｖｇとストレージノード装置１００（ｉ）の使用ディスク容量Ｍ_ｉとを用い、式（３）により全てのストレージノード装置（ｉ）について、容量均等化後のストレージノード装置（ｉ）の使用ディスク容量Ｍ^’ _ｉを算出する（ステップＳ３０２−３）。 [3.3.3.2 Processing flow]
17 and 18 show a flowchart of the capacity equalization calculation unit 302.
In FIG. 17, first, the capacity equalization calculation unit 302 initializes all components a _ij (i, j are integers of 0 or more and (N−1) or less) of the N-row N-column conversion matrix A (step S302). -1) Using the disk capacity C _i and the used disk capacity M _i of all the N storage node devices (i) acquired by the status information acquisition unit 301, the average usage rate of all the storage node devices 100 according to equation (2) r _avg is calculated (step S302-2). Subsequently, the capacity equalization calculation unit 302 uses the average usage rate r _avg calculated in step S302-2 and the used disk capacity M _{i of the} storage node device 100 (i), and calculates all the storage nodes according to Expression (3). For the device (i), the used disk capacity M ^′ _i of the storage node device (i) after capacity equalization is calculated (step S302-3).

続いて、容量均等化計算部３０２は、ｉに０を代入して初期化すると（ステップＳ３０２−４）、ストレージノード装置（ｉ）の現在の使用ディスク容量Ｍ_ｉは容量均等化後の使用ディスク容量Ｍ^' _ｉ以上であるかを判断する（ステップＳ３０２−５）。ステップＳ３０２−５においてＹＥＳと判断した場合、容量均等化アルゴリズムの（手順１）の（ａ）の処理を行なう。すなわち、容量均等化計算部３０２は、対角成分ａ_ｉｉに（Ｍ^' _ｉ／Ｍ_ｉ）を設定し（ステップＳ３０２−６）、他の全てのストレージノード装置（ｊ）からストレージノード装置（ｉ）への移動に対応した成分ａ_ｉｊに０を設定する（ステップＳ３０２−７）。一方、ステップＳ３０２−５においてＮＯと判断した場合、容量均等化計算部３０２は、対角成分ａ_ｉｉに１を設定し（ステップＳ３０２−８）、ストレージノード装置（ｉ）から他の全てのストレージノード装置（ｊ）への移動に対応した成分ａ_ｊｉに０を設定する（ステップＳ３０２−９）。 Subsequently, when the capacity equalization calculation unit 302 initializes by substituting 0 for i (step S302-4), the current used disk capacity M _i of the storage node device (i) is the used disk after capacity equalization. It is determined whether the capacity is equal to or greater than the capacity M ^′ _i (step S302-5). If YES is determined in the step S302-5, the process (a) of the (procedure 1) of the capacity equalization algorithm is performed. That is, the capacity equalization calculation unit 302 sets (M ^′ _i / M _i ) in the diagonal component a _ii (step S302-6), and sets the storage node device (i) from all other storage node devices (j). ) Is set to 0 for the component a _ij corresponding to the movement to () (step S302-7). On the other hand, if NO is determined in step S302-5, the capacity equalization calculation unit 302 sets 1 to the diagonal component a _ii (step S302-8), and all other storages from the storage node device (i). The component a _ji corresponding to the movement to the node device (j) is set to 0 (step S302-9).

ステップＳ３０２−７またはステップＳ３０２−９の処理の後、容量均等化計算部３０２は、現在のｉの値に１を加算すると（ステップＳ３０２−１０）、ｉが（Ｎ−１）よりも大きいかを判断する（ステップＳ３０２−１１）。ステップＳ３０２−１１でＮＯと判断した場合、容量均等化計算部３０２は、ステップＳ３０２−５からの処理を繰り返し、ステップＳ３０２−１１でＹＥＳと判断した場合、図１８の処理を行なう。 After the process of step S302-7 or step S302-9, the capacity equalization calculation unit 302 adds 1 to the current value of i (step S302-10), and is i greater than (N-1)? Is determined (step S302-11). When it is determined NO in step S302-11, the capacity equalization calculation unit 302 repeats the process from step S302-5, and when it is determined YES in step S302-11, the process of FIG. 18 is performed.

図１８において、容量均等化計算部３０２は、容量均等化アルゴリズムの（手順２）の処理を行なう。すなわち、容量均等化計算部３０２は、現在値が設定されていない成分ａ_ＩＪを選択する（ステップＳ３０２−１２）。現在値が設定されていない成分の中から任意に成分ａ_ＩＪを選択することができるが、ここでは、Ｉ、Ｊが最も小さい成分を選択する。 In FIG. 18, the capacity equalization calculation unit 302 performs the process (procedure 2) of the capacity equalization algorithm. That is, the capacity equalization calculation unit 302 selects the component a _IJ for which the current value is not set (step S302-12). The component a _IJ can be arbitrarily selected from the components for which the current value is not set. Here, the component having the smallest I and J is selected.

容量均等化計算部３０２は、容量均等化アルゴリズムの（手順２）における（条件１）及び（条件２）を満たすように、成分ａ_ＩＪの値を算出すると（ステップＳ３０２−１３）、容量均等化アルゴリズムの（手順２）の（ｃ）または（ｄ）の条件を満たすかを判断する。 When the capacity equalization calculation unit 302 calculates the value of the component a _IJ so as to satisfy (condition 1) and (condition 2) in (procedure 2) of the capacity equalization algorithm (step S302-13), the capacity equalization It is determined whether the condition (c) or (d) of (Procedure 2) of the algorithm is satisfied.

容量均等化計算部３０２は、ストレージノード装置（Ｊ）から移動するディスク容量の割合の合計が１であり、（ｃ）の条件を満たすと判断した場合（ステップＳ３０２−１４：ＹＥＳ）、まだ値が設定されていない成分ａ_ｋＪに０を設定する（ステップＳ３０２−１５）。また、容量均等化計算部３０２は、ステップＳ３０２−１４でＮＯと判断した場合、ファイル移動後のストレージノード装置（Ｉ）の使用ディスク容量が、容量均等化後の使用ディスク容量Ｍ^' _ｉに等しいかを判断する（ステップＳ３０２−１６）。容量均等化計算部３０２は、ステップＳ３０２−１６がＹＥＳであり、（ｄ）の条件を満たすと判断した場合、まだ値が設定されていない成分ａ_Ｉｋに０を設定する（ステップＳ３０２−１７）。 When the capacity equalization calculation unit 302 determines that the total ratio of the disk capacity moved from the storage node device (J) is 1, and satisfies the condition of (c) (step S302-14: YES), the value is still There is set to 0 component _{a kJ} not set (step S302-15). If the capacity equalization calculation unit 302 determines NO in step S302-14, the used disk capacity of the storage node device (I) after the file movement is equal to the used disk capacity M ^′ _i after the capacity equalization. Is determined (step S302-16). When the capacity equalization calculation unit 302 determines that step S302-16 is YES and satisfies the condition of (d), the capacity equalization calculation unit 302 sets 0 to the component a _Ik that has not yet been set (step S302-17). .

ステップＳ３０２−１５、ステップＳ３０２−１７の処理の後、あるいは、ステップＳ３０２−１６においてＮＯと判断した場合、容量均等化計算部３０２は、値が設定されていない成分があるか否かを判断する（ステップＳ３０２−１８）。値が設定されていない成分があると判断した場合（ステップＳ３０２−１８：ＹＥＳ）、ステップＳ３０２−１２からの処理を繰り返す。 After the processing of step S302-15 and step S302-17, or when NO is determined in step S302-16, the capacity equalization calculation unit 302 determines whether there is a component for which no value is set. (Step S302-18). When it is determined that there is a component for which no value is set (step S302-18: YES), the processing from step S302-12 is repeated.

値が設定されていない成分がないと判断した場合（ステップＳ３０２−１８：ＮＯ）、容量均等化計算部３０２は、容量均等化アルゴリズムの（手順３）を実行する。すなわち、容量均等化計算部３０２は、０以外の成分ａ_ｉｊ（ｉ≠ｊ、かつ、ｉ及びｊは０以上（Ｎ−１）以下の整数）を用いて、ストレージノード装置（ｉ）からストレージノード装置（ｊ）への移動容量ａ_ｉｊ×Ｍ_ｊを算出し（ステップＳ３０２−１９）、ファイル移動部３０３にファイルの移動を指示する（ステップＳ３０２−２０）。 When it is determined that there is no component for which no value is set (step S302-18: NO), the capacity equalization calculation unit 302 executes (procedure 3) of the capacity equalization algorithm. That is, the capacity equalization calculation unit 302 uses the non-zero component a _ij (i ≠ j, and i and j are integers of 0 or more and (N−1) or less) from the storage node device (i). The migration capacity a _ij × M _j to the node device (j) is calculated (step S302-19), and the file migration unit 303 is instructed to migrate the file (step S302-20).

［３．３．３ファイル移動部３０３の動作］
ファイル移動部３０３は、容量均等化計算部３０２により得られたストレージノード装置１００間のファイルの移動処理を行う。しかし、容量均等化計算部３０２において得られたストレージノード装置１００間のファイルの移動処理を全て行なうと、通信ネットワーク１に負荷がかかるだけでなく、多くのストレージノード装置１００についても負荷が増大してしまう。また、移動の間はファイルへのアクセスが制限される場合もある。そこで、ファイル移動処理は、例えば、移動容量が所定以上である、ファイル移動元のストレージノード装置１００のディスク使用率が所定以上であるなど、所定の条件に合致するストレージノード装置１００間の移動のみを対象とする。以下では、最も移動容量が多いストレージノード装置１００間のファイル移動のみを行うものとして説明する。また、ファイル移動をトラヒックが少ない時間に行うことによって、通信ネットワーク１への負荷を抑える。 [3.3.3 Operation of file moving unit 303]
The file moving unit 303 performs a file moving process between the storage node devices 100 obtained by the capacity equalization calculating unit 302. However, if all the file movement processing between the storage node devices 100 obtained in the capacity equalization calculation unit 302 is performed, not only the load is applied to the communication network 1, but also the load increases for many storage node devices 100. End up. Also, access to the file may be restricted during the move. In view of this, the file migration process is performed only for migration between storage node devices 100 that meet a predetermined condition, for example, the migration capacity is greater than or equal to a predetermined value, or the disk usage rate of the storage node device 100 that is the file migration source is greater than or equal to a predetermined value. Is targeted. In the following description, it is assumed that only file movement between storage node devices 100 having the largest migration capacity is performed. Further, the load on the communication network 1 is suppressed by performing the file movement at a time when the traffic is low.

まず、ファイル移動部３０３は、容量均等化計算部３０２が算出した移動容量に基づいて、最も移動容量が多い、ストレージノード装置（ｉ）からストレージノード装置（ｊ）へのファイル移動を行うと判断する。
ファイル移動部３０３は、テーブル検索部３０４にストレージノード装置（ｉ）のノードＩＤを出力してノードアドレスを要求し、テーブル検索部３０４は、記憶部４０に記憶されている検索用ノードテーブル４２から、ノードＩＤに対応したノードアドレスを読み出す。ファイル移動部３０３は、テーブル検索部３０４が読み出したノードアドレスをあて先として、get_listの実行を要求する。 First, the file migration unit 303 determines that the file migration from the storage node device (i) to the storage node device (j) having the largest migration capacity is performed based on the migration capacity calculated by the capacity equalization calculation unit 302. To do.
The file moving unit 303 outputs the node ID of the storage node device (i) to the table searching unit 304 and requests a node address. The table searching unit 304 uses the search node table 42 stored in the storage unit 40. The node address corresponding to the node ID is read out. The file moving unit 303 requests execution of get_list using the node address read by the table searching unit 304 as a destination.

ストレージノード装置１００は、ファイル蓄積部１０８に記憶されているファイルのファイル名とサイズのリストを読み出し、引数listに設定して容量負荷分散ノード装置３００へ返送する。容量負荷分散ノード装置３００のファイル移動部３０３は、ストレージノード装置１００から取得したリストに基づいて、サイズの合計が移動容量に近くなるように移動対象のファイルを選択する。例えば、サイズの大きいファイルから順に選択していき、選択したファイルのサイズ移動容量を最初に超えたとき、あるいは、超える直前までに選択したファイルを移動対象として決定したり、ファイルの全ての組み合わせを生成し、合計のファイルサイズが最も移動容量が近いファイルの組み合わせを選択し、移動対象として決定したりすることができる。 The storage node device 100 reads a list of file names and sizes of files stored in the file storage unit 108, sets the argument list, and returns the list to the capacity load balancing node device 300. The file migration unit 303 of the capacity load distribution node device 300 selects a migration target file based on the list acquired from the storage node device 100 so that the total size is close to the migration capacity. For example, select files in order from the largest file, and when the size movement capacity of the selected file is exceeded for the first time, or before the time when it exceeds, the selected file is determined to be moved, or all combinations of files are selected. It is possible to generate and select a combination of files whose total file size is the closest to the transfer capacity, and determine a transfer target.

ファイル移動部３０３は、移動対象として決定したファイルを読み出すため、ストレージノード装置（ｉ）に移動対象の各ファイルのlow_readの実行を出力する。low_readの引数fnameには、移動対象のファイルのファイル名を、引数offsetに０を、引数sizeに移動対象のファイルのサイズが設定される。ストレージノード装置（ｉ）は、ファイル蓄積部１０８から指定されたファイル名のファイル本体を読み出し、引数dataに設定して容量負荷分散ノード装置３００へ返送する。 The file moving unit 303 outputs execution of low_read of each file to be moved to the storage node device (i) in order to read the file determined as the target to be moved. The file name of the file to be moved is set in the argument fname of low_read, 0 is set in the argument offset, and the size of the file to be moved is set in the argument size. The storage node device (i) reads the file body of the specified file name from the file storage unit 108, sets it as an argument data, and returns it to the capacity load distribution node device 300.

ファイル移動部３０３は、移動対象のファイルをストレージノード装置（ｉ）から読み出すと、読み出した各ファイルを移動先に書き込むため、ストレージノード装置（ｊ）に移動対象の各ファイルのlow_writeの実行を出力する。low_readの引数fnameには、移動対象のファイルのファイル名が、引数offsetには０が、引数sizeには移動対象のファイルのサイズが、引数dataにはファイル本体が設定される。ストレージノード装置（ｊ）は、ファイル蓄積部１０８へ移動対象のファイルを書き込む。ストレージノード装置（ｊ）は、ファイルを書き込むと図１１に示す処理を行なう。 When the file migration unit 303 reads the file to be migrated from the storage node device (i), the file migration unit 303 outputs execution of low_write of each file to be migrated to the storage node device (j) in order to write each read file to the migration destination. To do. The file name of the file to be moved is set in the argument fname of low_read, 0 is set in the argument offset, the size of the file to be moved is set in the argument size, and the file body is set in the argument data. The storage node device (j) writes the file to be moved to the file storage unit 108. When the storage node device (j) writes the file, it performs the processing shown in FIG.

続いてファイル移動部３０３は、ストレージノード装置（ｉ）に記憶されている各移動対象のファイルのlow_deleteの実行を出力する。low_deleteの引数fnameには、移動対象のファイルのファイル名が設定される。ストレージノード装置（ｉ）は、ファイル蓄積部１０８から引数fnameによって指定されたファイル名のファイルを削除する。 Subsequently, the file moving unit 303 outputs execution of low_delete for each file to be moved stored in the storage node device (i). The file name of the file to be moved is set in the argument fname of low_delete. The storage node device (i) deletes the file having the file name specified by the argument fname from the file storage unit 108.

［３．３．４容量負荷分散ノード装置３００の効果］
以上、説明したように、容量負荷分散ノード装置３００により、本実施形態による分散ファイルシステム内のストレージノード装置の使用容量を均等化する機能を追加することができる。そして、１日１回など定期的に、移動容量が多いストレージノード装置１００間についてのみファイル移動を行なうことによって、分散ファイルシステム全体に負荷をかけないよう、ゆるやかに使用容量を均等に近づけることができる。
また、容量負荷分散ノード装置３００の分散ファイルシステムの追加や削除は、分散ファイルシステムの検索処理を滞らせることがないため、分散ファイルシステムの稼働中のどのタイミングでも容量負荷分散機能の追加、削除が行え、容量負荷分散ノード装置３００の数も限定されない。 [3.3.4 Effect of Capacity Load Balancing Node Device 300]
As described above, the capacity load distribution node device 300 can add a function for equalizing the used capacity of the storage node device in the distributed file system according to the present embodiment. Then, by regularly transferring files only between the storage node devices 100 having a large migration capacity, such as once a day, the used capacity can be gradually approached evenly so as not to place a load on the entire distributed file system. it can.
In addition, since the addition and deletion of the distributed file system of the capacity load distribution node device 300 does not delay the search processing of the distributed file system, the addition and deletion of the capacity load distribution function is performed at any time during the operation of the distributed file system. The number of capacity load distribution node devices 300 is not limited.

本実施形態では、分散ファイルシステム内の全ストレージノード装置１００の容量を均等化する場合の説明を行ったが、これに限定されるものではない。例えば、非特許文献３で述べられている、ストレージノード装置の物理的な位置を考慮したローカルノードＩＤを使って管理用ノードテーブル４１を作成した場合、容量負荷分散ノード装置３００の処理を、ローカルＩＤで限定される範囲のストレージノード装置１００に限定して容量負荷分散を行うことも可能である。 In the present embodiment, the description has been given of the case where the capacities of all the storage node devices 100 in the distributed file system are equalized. However, the present invention is not limited to this. For example, when the management node table 41 is created using the local node ID considering the physical position of the storage node device described in Non-Patent Document 3, the processing of the capacity load balancing node device 300 is performed locally. It is also possible to perform capacity load distribution by limiting to the storage node device 100 in a range limited by the ID.

また、容量負荷分散ノード装置３００のファイル移動部３０３は、各ストレージノード装置１００からファイル蓄積部１０８に記憶されているファイルのファイル名と更新日時のリストを取得し、所定の日時以前の更新日時のファイルを上述したストレージノード装置１００間のファイル移動手順に従ってアーカイブ専用のストレージノード装置１００に移動させることも可能である。これによって、アーカイブ専用のストレージノード装置１００にのみ、古いファイルを移動させるような方法も可能となる。
さらに、容量負荷分散機能の処理を、分散ファイルシステムのアクセスが少ない深夜などの特定の時間のみに稼動させるなども可能である。 In addition, the file moving unit 303 of the capacity load balancing node device 300 acquires a list of file names and update dates / times of files stored in the file storage unit 108 from each storage node device 100, and updates / dates before a predetermined date / time. It is also possible to move these files to the storage node device 100 dedicated to archiving in accordance with the file movement procedure between the storage node devices 100 described above. As a result, it is possible to move the old file only to the storage node device 100 dedicated to the archive.
Furthermore, the processing of the capacity load distribution function can be operated only at a specific time such as midnight when there are few accesses of the distributed file system.

[３．４管理ノード装置４００の固有機能]
[３．４．１固有機能処理部の構成]
管理ノード装置４００は、分散ファイルシステムの各ノード装置の状態を表示するための機能ノードである。 [3.4 Unique Function of Management Node Device 400]
[3.4.1 Configuration of unique function processor]
The management node device 400 is a functional node for displaying the status of each node device in the distributed file system.

図１９は、管理ノード装置４００の固有機能処理部４２０の構成を示す図であり、図２に示す固有機能処理部２０に相当する。管理ノード装置４００の固有機能処理部４２０は、ステータス選択部４０１、ステータス情報取得部４０２、情報提示部４０３、ノード通知部４０４、テーブル検索部４０５を備えて構成される。 FIG. 19 is a diagram illustrating a configuration of the unique function processing unit 420 of the management node device 400, and corresponds to the unique function processing unit 20 illustrated in FIG. The unique function processing unit 420 of the management node device 400 includes a status selection unit 401, a status information acquisition unit 402, an information presentation unit 403, a node notification unit 404, and a table search unit 405.

ステータス選択部４０１は、ステータス情報の確認対象を受け付ける。ステータス情報取得部４０２は、ステータス選択部４０１が受け付けた確認対象であるノード装置のステータス情報を管理しているストレージノード装置１００から、ネットワークインタフェース部３０を介してステータス情報を取得し、情報提示部４０３へ通知する。ノード通知部４０４は、記憶部４０の管理用ノードテーブル４１を監視し、ノード装置の参加や離脱が生じた場合に、その参加あるいは離脱したノード装置の情報を情報提示部４０３に通知する。情報提示部４０３は、ステータス情報取得部４０２から通知されたステータス情報、あるいは、ノード通知部４０４から通知されたノード装置の参加や離脱をディスプレイに表示するなどして利用者に提示する。テーブル検索部４０５は、ストレージノード装置１００のテーブル検索部１０６と同様の機能を有し、記憶部４０に保存されている検索用ノードテーブル４２から、指定されたステータスＩＤに時計回りに近いノードＩＤを検索し、そのノードアドレスを返す。 The status selection unit 401 accepts a status information confirmation target. The status information acquisition unit 402 acquires status information via the network interface unit 30 from the storage node device 100 that manages the status information of the node device that is the confirmation target received by the status selection unit 401, and the information presentation unit 403 is notified. The node notification unit 404 monitors the management node table 41 of the storage unit 40 and notifies the information presentation unit 403 of information on the node device that has joined or left when the node device joins or leaves. The information presentation unit 403 presents the status information notified from the status information acquisition unit 402 or the participation or withdrawal of the node device notified from the node notification unit 404 to the user by displaying on the display. The table search unit 405 has a function similar to that of the table search unit 106 of the storage node device 100, and the node ID close to the specified status ID in the clockwise direction from the search node table 42 stored in the storage unit 40. And return its node address.

[３．４．２管理ノード装置４００の動作]
まず、ステータス選択部４０１は、管理ノード装置４００を利用する利用者から、ステータス情報の確認対象となるノード装置の選択、確認したいステータス情報の種別の入力を受け付け、ステータス情報取得部４０２へそれらの情報を通知する。 [3.4.2 Operation of management node apparatus 400]
First, the status selection unit 401 receives, from a user who uses the management node device 400, the selection of a node device that is a confirmation target of status information and the input of the type of status information to be confirmed, and the status information acquisition unit 402 receives these information. Notify information.

ステータス情報取得部４０２は、ステータス選択部４０１から通知されたノード装置のノードアドレス及びステータス情報の種別から、ステータス情報登録部１０５と同様の方法によりキーとなるステータスＩＤを生成する。ステータス情報取得部４０２は、ステータスＩＤをテーブル検索部４０５へ出力し、ステータスＩＤを管理するストレージノード装置１００のノードアドレスを取得する。ステータス情報取得部４０２は、取得したノードアドレスのストレージノード装置１００に対してgetインタフェースの実行を要求し、ステータス情報を取得する。getインタフェースの引数keyには、生成したステータスＩＤが設定される。ステータス情報取得部４０２は、ストレージノード装置１００から取得したステータス情報を情報提示部４０３に通知し、情報提示部４０３は、ステータス情報取得部４０２から通知されたステータス情報を提示する。情報提示部４０３は、確認対象として指定されたノード装置やステータス情報の種類を併せて提示してもよい。情報提示部４０３は、例えば、テキストによるデータの表示、ノード毎にグラフ化した表示など、任意の提示方法を用いることができる。 The status information acquisition unit 402 generates a status ID as a key from the node address of the node device notified from the status selection unit 401 and the type of status information by the same method as the status information registration unit 105. The status information acquisition unit 402 outputs the status ID to the table search unit 405, and acquires the node address of the storage node device 100 that manages the status ID. The status information acquisition unit 402 requests the storage node device 100 of the acquired node address to execute a get interface, and acquires status information. The generated status ID is set in the argument key of the get interface. The status information acquisition unit 402 notifies the information presentation unit 403 of the status information acquired from the storage node device 100, and the information presentation unit 403 presents the status information notified from the status information acquisition unit 402. The information presentation unit 403 may also present the node device designated as the confirmation target and the type of status information. The information presenting unit 403 can use any presenting method such as, for example, displaying data by text or displaying graphs for each node.

一方、ノード通知部４０４は、記憶部４０の管理用ノードテーブル４１を監視し、テーブル処理部１４によって管理用ノードテーブル４１にノードＩＤ及びノードアドレスが追加された場合、そのノードＩＤ及びノードアドレスが追加されるトリガとなったノード情報を情報提示部４０３に通知する。情報提示部４０３は、ノード通知部４０４から通知されたノード情報と、参加である旨を提示する。
同様に、ノード通知部４０４は、テーブル処理部１４によって管理用ノードテーブル４１からノードＩＤ及びノードアドレスが削除された場合、その削除のトリガとなったノード情報を情報提示部４０３に通知する。情報提示部４０３は、ノード通知部４０４から通知されたノード情報と、離脱である旨を提示する。 On the other hand, the node notification unit 404 monitors the management node table 41 of the storage unit 40, and when the node ID and node address are added to the management node table 41 by the table processing unit 14, the node ID and node address are The information presentation unit 403 is notified of the node information that has been added as a trigger. The information presenting unit 403 presents the node information notified from the node notifying unit 404 and the participation.
Similarly, when the node ID and the node address are deleted from the management node table 41 by the table processing unit 14, the node notification unit 404 notifies the information presentation unit 403 of the node information that triggered the deletion. The information presenting unit 403 presents the node information notified from the node notifying unit 404 and the fact that it is a detachment.

[３．４．３管理ノード装置４００の効果]
以上説明したように、管理ノード装置４００により、本実施形態の分散ファイルシステム内に参加している全ノード装置の参加、離脱の状態や、各ノード装置のステータス情報を利用者に提供することができる。本実施形態による管理ノード装置４００の参加及び離脱は、分散ファイルシステムのファイル管理を滞らせることがないため、分散ファイルシステムの稼働中にいつでも、参加または離脱することができ、管理ノード装置４００の数に制限もない。
なお、本実施形態ではノード情報の提示を分散ファイルシステムに参加する全ノード装置を対象として説明したが、これに限定されるものではない。例えば、非特許文献３に提案されているように、ノード装置の物理的な位置の情報を含んだローカルノードＩＤを本実施形態の管理用ノードテーブルに用いることにより、管理ノード装置４００を特定のローカルＩＤ、すなわち特定の物理的な範囲のノード装置に限定して、そのステータス情報を表示することも可能である。 [3.4.3 Effects of management node device 400]
As described above, the management node device 400 can provide the user with the participation / leaving status of all the node devices participating in the distributed file system of this embodiment and the status information of each node device. it can. Since the participation and withdrawal of the management node device 400 according to the present embodiment does not delay the file management of the distributed file system, the management node device 400 can join or leave at any time during the operation of the distributed file system. There is no limit to the number.
In the present embodiment, the presentation of node information has been described for all node devices participating in the distributed file system, but the present invention is not limited to this. For example, as proposed in Non-Patent Document 3, a local node ID including information on the physical position of a node device is used in the management node table of this embodiment, so that the management node device 400 is specified. It is also possible to display the status information limited to the local ID, that is, the node device within a specific physical range.

[４．本実施形態の効果]
以上のように本実施形態によれば、ＤＨＴを用いたＰ２Ｐ型の分散ファイルシステムにおいて、オーバーレイネットワークに参加する各ノード装置は、ファイル検索用のノードテーブルである検索用ノードテーブルと、ノード装置管理用のノードテーブルである管理用ノードテーブルを保持し、ファイルを保管するストレージノード装置のノードＩＤ及びノードアドレスは検索用ノードテーブル及び管理用ノードテーブルに登録し、ファイルを保管せず、各種の付加機能を提供する機能ノード装置については管理用ノードテーブルのみに登録する。従って、機能ノード装置の参加や離脱が生じても検索用ノードテーブルには変更がないため、ファイルＩＤの検索が滞ることがなく、操作対象のファイルの検索に影響を与えない。また、検索用ノードテーブルを同期させる必要もない。 [4. Effects of this embodiment]
As described above, according to the present embodiment, in a P2P type distributed file system using DHT, each node device participating in the overlay network has a search node table that is a file search node table, and node device management. The node ID and node address of the storage node device that holds the management node table that stores the file and stores the file are registered in the search node table and the management node table. The function node device that provides the function is registered only in the management node table. Therefore, the search node table is not changed even if the function node device joins or leaves, so that the search for the file ID is not delayed and the search for the operation target file is not affected. Further, it is not necessary to synchronize the search node table.

このように、Ｐ２Ｐ型分散ファイルシステムにおいて、ファイルストレージ以外の様々な機能を持った機能ノード装置の追加や離脱を、ファイルの検索に影響を与えずに行なうことができるため、分散ファイルシステムを停止することなく機能を拡張することが可能となる。
また、分散ファイルシステムの利用者の増減に対応して、本実施形態のアクセスノード装置の参加や離脱を行なうことで、アクセス負荷分散が可能となる。
また、本実施形態の容量負荷分散ノード装置により、ストレージノード装置がファイルの記憶に要するディスクの使用量を均等化したり、あるいは、管理者が意図した使用量に調整したりすることができる。
また、本実施形態の管理ノード装置により、オーバーレイネットワークに参加しているノード装置の状態を把握し、利用者へ提示することが可能となる。 In this way, in the P2P type distributed file system, the addition and removal of functional node devices having various functions other than file storage can be performed without affecting the file search, so the distributed file system is stopped. It is possible to extend the functions without having to do so.
In addition, the access load can be distributed by joining or leaving the access node device according to the present embodiment in response to the increase or decrease of users of the distributed file system.
In addition, the capacity load distribution node device according to the present embodiment can equalize the usage amount of the disk required for the storage node device to store the file, or adjust the usage amount intended by the administrator.
In addition, the management node device according to the present embodiment can grasp the state of the node device participating in the overlay network and present it to the user.

［５．その他］
上述したストレージノード装置１００、アクセスノード装置２００、容量負荷分散ノード装置３００、管理ノード装置４００、及び、クライアント装置２を実装する装置は、内部にコンピュータシステムを有している。そして、ストレージノード装置１００、アクセスノード装置２００、容量負荷分散ノード装置３００、管理ノード装置４００、クライアント装置２の動作の過程は、プログラムの形式でコンピュータ読み取り可能な記録媒体に記憶されており、このプログラムをコンピュータシステムが読み出して実行することによって、上記処理が行われる。ここでいうコンピュータシステムとは、ＣＰＵ及び各種メモリやＯＳ、周辺機器等のハードウェアを含むものである。 [5. Others]
The storage node device 100, the access node device 200, the capacity load distribution node device 300, the management node device 400, and the device that implements the client device 2 described above have a computer system therein. The operation processes of the storage node device 100, the access node device 200, the capacity load distribution node device 300, the management node device 400, and the client device 2 are stored in a computer-readable recording medium in the form of a program. The above processing is performed by the computer system reading and executing the program. The computer system here includes a CPU, various memories, an OS, and hardware such as peripheral devices.

また、「コンピュータシステム」は、ＷＷＷシステムを利用している場合であれば、ホームページ提供環境（あるいは表示環境）も含むものとする。
また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶部のことをいう。さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間の間、動的にプログラムを保持するもの、その場合のサーバーやクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含むものとする。また上記プログラムは、前述した機能の一部を実現するためのものであっても良く、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであっても良い。 Further, the “computer system” includes a homepage providing environment (or display environment) if a WWW system is used.
The “computer-readable recording medium” refers to a portable medium such as a flexible disk, a magneto-optical disk, a ROM, and a CD-ROM, and a storage unit such as a hard disk built in the computer system. Furthermore, the “computer-readable recording medium” dynamically holds a program for a short time like a communication line when transmitting a program via a network such as the Internet or a communication line such as a telephone line. In this case, a volatile memory in a computer system serving as a server or a client in that case is also used to hold a program for a certain period of time. The program may be a program for realizing a part of the functions described above, and may be a program capable of realizing the functions described above in combination with a program already recorded in a computer system.

１…通信ネットワーク
２…クライアント装置
１０…ノードテーブル管理部
１２…通知処理部（通知情報処理部）
１３…安定化処理部
１４…テーブル処理部
２０、１２０、２２０、３２０、４２０…固有機能処理部
３０…ネットワークインタフェース部
４０…記憶部
４１…管理用ノードテーブル
４２…検索用ノードテーブル
１００…ストレージノード装置
１０１…低レベルインタフェース部
１０２…キーテーブル操作部
１０３…ファイル操作部
１０４…ファイル情報登録部
１０５…ステータス情報登録部
１０６…テーブル検索部
１０７…キーテーブル蓄積部
１０８…ファイル蓄積部
２００…アクセスノード装置
２０１…高レベルインタフェース部
２０２…ファイル処理部
２０３…テーブル検索部
３００…容量負荷分散ノード装置
３０１…ステータス情報取得部
３０２…容量均等化計算部
３０３…ファイル移動部
３０４…テーブル検索部
４００…管理ノード装置
４０１…ステータス選択部
４０２…ステータス情報取得部
４０３…情報提示部
４０４…ノード通知部
４０５…テーブル検索部 DESCRIPTION OF SYMBOLS 1 ... Communication network 2 ... Client apparatus 10 ... Node table management part 12 ... Notification processing part (notification information processing part)
13 ... Stabilization processing unit 14 ... Table processing units 20, 120, 220, 320, 420 ... Unique function processing unit 30 ... Network interface unit 40 ... Storage unit 41 ... Management node table 42 ... Search node table 100 ... Storage node Device 101 ... Low-level interface unit 102 ... Key table operation unit 103 ... File operation unit 104 ... File information registration unit 105 ... Status information registration unit 106 ... Table search unit 107 ... Key table storage unit 108 ... File storage unit 200 ... Access node Device 201 ... High-level interface unit 202 ... File processing unit 203 ... Table search unit 300 ... Capacity load distribution node device 301 ... Status information acquisition unit 302 ... Capacity equalization calculation unit 303 ... File transfer unit 304 ... Table search unit 400 ... Management node Location 401 ... status selection section 402 ... status information acquisition unit 403 ... information presentation unit 404 ... node notification unit 405 ... table search unit

Claims

A node device that participates via a network in a distributed file system that distributes and manages files,
A storage node device that distributes and stores files, or a function node device that provides additional functions, a management node table that associates node IDs and addresses of node devices that constitute the distributed file system, and the storage node A storage unit that stores a search node table that associates node IDs and addresses of devices;
Notification of receiving notification information of participation or withdrawal of each node device from the distributed file system, and transmitting the notification information to the address selected based on the node ID stored in the management node table An information processing unit;
When the notification information processing unit receives notification information indicating that a storage node device has participated, the management node table and the search node are used to determine the node ID and address of the storage node device based on the notification information. Write to node table,
When the notification information processing unit receives notification information indicating that a functional node device has participated, the node ID and address of the functional node device are written only in the management node table based on the notification information,
When the notification information processing unit receives notification information indicating that the storage node device has left, the management node table and the address corresponding to the node ID of the storage node device based on the notification information Delete from the search node table,
If the notification information processing unit receives notification information indicating that the functional node device has left, the node ID of the functional node device and the corresponding address are deleted from the management node table based on the notification information A table processing unit to
A table search unit that acquires an address of the storage node device that manages the file ID based on the specified file ID and the node ID stored in the search node table;
A node device comprising:

A stabilizing unit for confirming whether or not the node device at the address registered in the management node table is disconnected;
The table processing unit leaves the management node table and the search node table when the stabilization unit detects that the node device at the address registered in the management node table has left. Delete the node ID and address of the node device
The notification information processing unit, with the address selected based on the node ID stored in the management node table as a destination, notification information indicating that the node device detected by the stabilization unit has detached Send,
The node device according to claim 1, wherein:

A file storage unit for storing files;
A key table storage unit that stores a file ID and an address of the storage node device storing the file in association with each other;
A key table operation unit that reads out and outputs the address of the storage node device corresponding to the specified file ID from the key table storage unit;
A file operation unit that receives an operation instruction specifying a file name and performs an operation on the file in the file storage unit specified by the file name;
The node device according to claim 1, further comprising:

The key table storage unit stores a status ID and status information in association with each other,
A status ID for specifying the status information of the own node device is passed to the table search unit, and the registration request for the status ID and the status information is transmitted to the address acquired by the table search unit corresponding to the status ID. It further includes a status information registration unit,
When receiving a registration request for status ID and status information, the key table operation unit writes the status ID requested for registration and the status information in association with each other in the key table storage unit, and designates the status ID. When a request for status information is received, the status information corresponding to the received status ID is read from the key table storage unit and output,
The table search unit acquires the address of the storage node device using the status ID instead of the file ID.
The node device according to claim 3.

A file operation instruction designating a file name is received, a file ID generated from the received file name is passed to the table search unit, and the file obtained by the table search unit corresponding to the file ID is used as the destination. A file processing unit that transmits an ID, receives an address of a storage node device storing a file corresponding to the transmitted file ID, and transmits a file operation instruction specifying the file name with the received address as a destination Further comprising
The node device according to claim 1, wherein the node device is a node device.

A status ID for identifying status information of each storage node device in the distributed file system is passed to the table search unit, and the status ID is transmitted to the address acquired by the table search unit corresponding to the status ID. A status information acquisition unit that requests status information;
Based on the status information of the storage node device acquired in response to the request of the status information acquisition unit, a capacity equalization calculation unit that determines the storage node device of the migration source and migration destination of the file, and the migration capacity;
A file mover for moving a file based on the move capacity from the storage node device of the file move source determined by the capacity equalization calculation unit to the storage node device of the file move destination,
The table search unit acquires the address of the storage node device using the status ID instead of the file ID.
The node device according to claim 1, wherein the node device is a node device.

A status for acquiring status information by passing a status ID for identifying status information of the node device to the table search unit, transmitting the status ID to the address acquired by the table search unit corresponding to the status ID An information acquisition unit;
An information presentation unit that outputs the status information acquired by the status information acquisition unit,
The table search unit acquires the address of the storage node device using the status ID instead of the file ID.
The node device according to claim 1, wherein the node device is a node device.

An information presenting unit that outputs participation or withdrawal of a node device when it is detected that a node ID has been added or deleted from the management node table;
The node device according to claim 1, wherein the node device is a node device.

A computer used as a node device that participates via a network in a distributed file system that distributes and manages files.
A storage node device that distributes and stores files, or a function node device that provides additional functions, a management node table that associates node IDs and addresses of node devices that constitute the distributed file system, and the storage node A storage unit that stores a search node table that associates node IDs and addresses of devices;
Notification of receiving notification information of participation or withdrawal of each node device from the distributed file system, and transmitting the notification information to the address selected based on the node ID stored in the management node table An information processing unit;
When the notification information processing unit receives notification information indicating that a storage node device has participated, the management node table and the search node are used to determine the node ID and address of the storage node device based on the notification information. Write to node table,
When the notification information processing unit receives notification information indicating that a functional node device has participated, the node ID and address of the functional node device are written only in the management node table based on the notification information,
When the notification information processing unit receives notification information indicating that the storage node device has left, the management node table and the address corresponding to the node ID of the storage node device based on the notification information Delete from the search node table,
If the notification information processing unit receives notification information indicating that the functional node device has left, the node ID of the functional node device and the corresponding address are deleted from the management node table based on the notification information A table processing unit to
A table search unit that acquires an address of the storage node device that manages the file ID based on the specified file ID and the node ID stored in the search node table;
A computer program that functions as a computer program.