JP2007265314A

JP2007265314A - Distributed storage system and distributed storage program

Info

Publication number: JP2007265314A
Application number: JP2006092704A
Authority: JP
Inventors: Yasuo Ishikawa; 康雄石川
Original assignee: Mitsubishi Electric Information Systems Corp; Mitsubishi Electric Information Technology Corp
Current assignee: Mitsubishi Electric Information Systems Corp; Mitsubishi Electric Information Technology Corp
Priority date: 2006-03-30
Filing date: 2006-03-30
Publication date: 2007-10-11

Abstract

<P>PROBLEM TO BE SOLVED: To obtain a distributed storage system with higher safety. <P>SOLUTION: In the distributed storage system, a super node 104 stores a file in a plurality of distributed disks 101 and reproduces the file by using elimination correction coding technology. Since there are a plurality of super nodes 104, they can distribute a load and since the super nodes 104 do not store management information, they are safe. A management terminal 102 stores the management information and is safe since general users can not access it. It is safe since the distributed disks are restored by using the management information from the management terminal 102 even if they are broken. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、分散ストレージシステム及び分散ストレージプログラムに関する。 The present invention relates to a distributed storage system and a distributed storage program.

図１４に、従来の専用ストレージシステムの代表的な構成であるＳＡＮ（ＳｔｏｒａｇｅＡｒｅａＮｅｔｗｏｒｋ）の構成図を示す。ＳＡＮは、データを保管する専用ストレージデバイス１４０１、ファイルサーバ１４０３、ユーザ端末１４０５、専用ストレージデバイス１４０１とファイルサーバ１４０３とを高速に接続するＦＣＳＷ（ＦｉｂｒｅＣｈａｎｎｅｌＳｗｉｔｃｈ）１４０２、及びファイルサーバ１４０３とユーザ端末１４０５を接続するインターネット１４０４から構成される。ユーザが、ユーザ端末１４０５からファイルサーバ１４０３に対して、ファイルの読出し又は書込み等の依頼を行うと、自動的に専用ストレージデバイス１４０１に読出し又は書込み等の処理が行なわれる。 FIG. 14 shows a configuration diagram of a SAN (Storage Area Network) which is a typical configuration of a conventional dedicated storage system. The SAN includes a dedicated storage device 1401 that stores data, a file server 1403, a user terminal 1405, an FC SW (Fibre Channel Switch) 1402 that connects the dedicated storage device 1401 and the file server 1403 at high speed, and a file server 1403 and a user terminal 1405. The Internet 1404 is connected. When the user requests the file server 1403 from the user terminal 1405 to read or write a file, the dedicated storage device 1401 is automatically read or written.

図１５に特開２００３−３４８０６５号公報に示されるデータ分散保管システムの構成を示す。前記データ分散保管システムは、複数の分散保管サーバ１５０１、ユーザーパソコン１５０３、及び複数の分散保管サーバ１５０１とユーザーパソコン１５０３とを接続するインターネット１５０２から構成される。 FIG. 15 shows a configuration of a data distributed storage system disclosed in Japanese Patent Application Laid-Open No. 2003-348065. The data distributed storage system includes a plurality of distributed storage servers 1501, user personal computers 1503, and the Internet 1502 connecting the plurality of distributed storage servers 1501 and user personal computers 1503.

次に、図１６のフローチャートを用いて、図１５のデータ分散保管システムにおけるデータ保管の処理の流れを説明する。ユーザーパソコン１５０３は、ユーザからデータ保管の依頼を受け付けると、保管対象の元データファイルを暗号化し、誤り訂正符号化し、五つの冗長化ファイルを生成する（ステップＳ１６０１）。次に、暗号鍵をシェア鍵に符号化する（ステップＳ１６０２）。次に、冗長化ファイルとシェア鍵を五つの分散保管ペアとして、インターネットを介して五つの分散保管サーバへ送信する（ステップＳ１６０３）。 Next, the flow of data storage processing in the data distributed storage system of FIG. 15 will be described using the flowchart of FIG. Upon receiving a data storage request from the user, the user personal computer 1503 encrypts the original data file to be stored, performs error correction encoding, and generates five redundant files (step S1601). Next, the encryption key is encoded into a shared key (step S1602). Next, the redundant file and the share key are transmitted as five distributed storage pairs to the five distributed storage servers via the Internet (step S1603).

次に、図１７のフローチャートを用いて、図１５のデータ分散保管システムにおけるデータ再生の処理の流れを説明する。ユーザーパソコン１５０３は、ユーザからデータ再生の依頼を受け付けると、インターネットを介して分散保管サーバにアクセスして冗長化ファイルとシェア鍵のペアを取得する（ステップＳ１７０１）。なお、２個のシェア鍵があれば、暗号鍵を復元でき、３つの冗長化ファイルがあれば、誤り訂正復号により暗号化ファイルを得ることができる。次に、誤り訂正復号により暗号化ファイルを復元する（ステップＳ１７０２）。次に、暗号化ファイルを暗号鍵で復号して、元データファイルを得る（ステップＳ１７０３）。
特開２００３−３４８０６５号公報 Next, the flow of data reproduction processing in the data distributed storage system of FIG. 15 will be described using the flowchart of FIG. Upon receiving a data reproduction request from the user, the user personal computer 1503 accesses the distributed storage server via the Internet and acquires a redundant file and share key pair (step S1701). If there are two shared keys, the encryption key can be restored, and if there are three redundant files, an encrypted file can be obtained by error correction decoding. Next, the encrypted file is restored by error correction decoding (step S1702). Next, the encrypted file is decrypted with the encryption key to obtain the original data file (step S1703).
JP 2003-348065 A

従来の専用ストレージシステムの代表的な構成であるＳＡＮでは、専用ストレージデバイスが非常に高価であること、専用ストレージデバイスに負荷が集中すること、データが分散されていないため災害時に専用ストレージデバイスが破損した場合、保管したデータを全て紛失する可能性があること、等の問題点があった。 In SAN, which is a typical configuration of a conventional dedicated storage system, the dedicated storage device is very expensive, the load is concentrated on the dedicated storage device, and the data is not distributed, so the dedicated storage device is damaged during a disaster. In such a case, there is a problem that all stored data may be lost.

また、特開２００３−３４８０６５号公報に示されるデータ分散保管システムでは、ユーザーパソコンが全ての処理を行うため、ユーザーパソコンに負荷が集中すること、分散保管サーバが故障するなどした場合に保管したデータの復元手段がなく、予め想定した個数以上の分散保管サーバが破損した場合には、保管したデータを復号できなくなること等の問題点があった。また、前述のようなデータ分散保管システムでは、ファイル管理をユーザーパソコンにおいて行っており、管理情報の安全性に問題があった。また、複数のユーザーパソコンから分散保管サーバを使用する場合に、あるデータの保管を依頼したユーザーパソコンとは異なるユーザーパソコンから、同じデータの再生を依頼するなどの柔軟性に欠けていた。 Further, in the data distributed storage system disclosed in Japanese Patent Application Laid-Open No. 2003-348065, the data stored when the load is concentrated on the user personal computer or the distributed storage server fails because the user personal computer performs all processing. However, there is a problem that stored data cannot be decrypted when the number of distributed storage servers more than a predetermined number is damaged. Further, in the data distributed storage system as described above, file management is performed on the user personal computer, and there is a problem in the safety of management information. In addition, when using a distributed storage server from a plurality of user computers, there was a lack of flexibility such as requesting the reproduction of the same data from a user computer different from the user computer that requested the storage of certain data.

本発明は、前述のような問題点を解決するためのものであり、消失訂正符号化技術を用いた分散ストレージシステムにおいて、ユーザからファイルの保管依頼及び読込み依頼を受け付けて、実際の保管処理及び読込み処理を行うコンピュータであるスーパーノードにおいて、分散保管の対象となる元ファイル、元ファイルを分割して符号化した符号化ファイル、及び符号化ファイルを保管する分散ディスク等に係る名称、ＩＤなどの管理情報を、保管処理中及び読込み処理中以外は、記憶しないようにして、情報秘匿性の高い分散ストレージシステムを得ること、また、複数の分散ディスクの内、任意の１台が故障又は非正常稼動となった場合に、該当の分散ディスク上に保管されていた符号化ファイル全てを、修理又は交換後の分散ディスク上に復元できるようにして、信頼性・保守性の高い分散ストレージシステムを得ること、などにより、情報隠匿性・信頼性・保守性などの面で、より安全性の高い分散ストレージシステムを得ることが目的である。 The present invention is for solving the above-described problems, and in a distributed storage system using erasure correction coding technology, a file storage request and a read request are received from a user, and an actual storage process and In a super node that is a computer that performs read processing, an original file to be distributed and stored, an encoded file obtained by dividing and encoding the original file, a name, an ID, and the like related to a distributed disk that stores the encoded file, etc. Management information is not stored except during storage processing and reading processing to obtain a distributed storage system with high information confidentiality, and any one of a plurality of distributed disks has failed or is abnormal. In the case of operation, all the encoded files stored on the corresponding distributed disk are all repaired or replaced on the distributed disk. To obtain a distributed storage system with higher security in terms of information concealment, reliability, maintainability, etc. Is the purpose.

本発明に係る分散ストレージシステムは、
複数の分散ディスク及びひとつ又は複数のユーザ端末にネットワークを介して接続され、ファイルの保管依頼及び読込み依頼を受け付けるスーパーノードを有する分散ストレージシステムであり、
前記スーパーノードは、
前記ユーザ端末から、分散保管の対象となる元ファイルと、前記元ファイルの保管依頼とを受け付けると、前記元ファイルを分割して符号化した複数の符号化ファイルを生成し、前記符号化ファイル名を、前記元ファイルとの対応を特定するための文字列を含んだ名称とする分割符号化部と、
前記複数の符号化ファイル各々を、異なる複数の分散ディスクに保管依頼する配信部と、
前記複数の符号化ファイルを分散ディスクの保管後、前記元ファイル、前記複数の符号化ファイル、前記元ファイル又は前記複数の符号化ファイルを特定することができる情報、前記複数の符号化ファイルを保管した複数の分散ファイルを特定することができる情報を削除するファイル管理部と、
前記ユーザ端末から、元ファイル名と、前記元ファイルの読込み依頼とを受け付けると、分散ディスクに対して、前記元ファイルから生成された符号化ファイルを保管しているかどうかの確認を行い、前記元ファイルと符号化ファイル名とを比べることにより、保管を確認できた前記分散ディスクから所定の個数の前記符号化ファイルを取得して、復号することにより、前記元ファイルを再生する再生部と
を備えた事を特徴とする。 The distributed storage system according to the present invention includes:
A distributed storage system having a super node connected to a plurality of distributed disks and one or a plurality of user terminals via a network and receiving a file storage request and a read request,
The super node is
When receiving the original file to be distributed and the storage request for the original file from the user terminal, a plurality of encoded files obtained by dividing and encoding the original file are generated, and the encoded file names A division encoding unit having a name including a character string for specifying the correspondence with the original file,
A distribution unit that requests storage of each of the plurality of encoded files on a plurality of different distributed disks;
After the plurality of encoded files are stored on a distributed disk, the original file, the plurality of encoded files, information that can specify the original file or the plurality of encoded files, and the plurality of encoded files are stored. A file management unit that deletes information that can identify a plurality of distributed files,
When receiving the original file name and the read request of the original file from the user terminal, it is confirmed whether or not the encoded file generated from the original file is stored in the distributed disk, A reproduction unit that reproduces the original file by obtaining and decoding a predetermined number of the encoded files from the distributed disk whose storage has been confirmed by comparing the file and the encoded file name; It is characterized by that.

前記分散ストレージシステムは、さらに前記スーパーノードとネットワークを介して接続された管理端末を有し、
前記ファイル管理部は、さらに、元ファイルと符号化ファイルと分散ディスクとの対応を表すファイル管理情報を生成して、前記管理端末に送信し、送信完了した前記ファイル管理情報を削除し、
前記管理端末は、
前記ファイル管理部から受信した前記ファイル管理情報を保管するファイル管理情報記憶部と、
任意の分散ディスクの復元依頼要求に基づき、前記ファイル管理情報記憶部に保管された前記ファイル管理情報から、前記分散ディスクに保管された符号化ファイルと、前記符号化ファイルに対応する元ファイルとを特定し、前記元ファイルを復号するために必要な前記分散ディスク以外の分散ディスクに保管されるその他の符号化ファイルを特定して取得し、前記その他の符号化ファイルから前記元ファイルを復号し、復号された前記元ファイルを分割し、符号化することにより、前記符号化ファイルを再生成し、前記分散ディスクに、再生成された前記符号化ファイルを保管依頼する分散ディスク復元部と
を備えたことを特徴とする。 The distributed storage system further includes a management terminal connected to the super node via a network,
The file management unit further generates file management information indicating correspondence between the original file, the encoded file, and the distributed disk, transmits the file management information to the management terminal, deletes the file management information that has been transmitted,
The management terminal
A file management information storage unit for storing the file management information received from the file management unit;
Based on the request for restoration of any distributed disk, the encoded file stored in the distributed disk and the original file corresponding to the encoded file are obtained from the file management information stored in the file management information storage unit. Identifying and obtaining other encoded files stored on a distributed disk other than the distributed disk necessary for decoding the original file, decoding the original file from the other encoded files, A distributed disk restoration unit that regenerates the encoded file by dividing and encoding the decoded original file, and requests the storage of the regenerated encoded file in the distributed disk; It is characterized by that.

前記管理端末は、さらに、
複数の分散ディスクに対して正常稼動しているかどうかの確認を行い、正常稼動を確認できなかった分散ディスクについてはアラームを出力し、その後、前記正常稼動を確認できなかった分散ディスクから正常稼動の通知を受信した場合、前記分散ディスク復元部に前記正常稼動を確認できなかった分散ディスクの復元を指示する分散ディスク状態確認部
を備えたことを特徴とする。 The management terminal further includes:
Checks whether multiple distributed disks are operating normally, outputs an alarm for distributed disks that could not be confirmed to operate normally, and then started normal operation from the distributed disks that could not be confirmed to operate normally. When the notification is received, the distributed disk restoration unit includes a distributed disk state confirmation unit that instructs the distributed disk restoration unit to restore the distributed disk whose normal operation could not be confirmed.

前記分散ストレージシステムは、さらに複数の前記スーパーノードと、前記複数のスーパーノードとネットワークを介して接続されたユーザ端末とを有し、
前記ユーザ端末は、
複数の前記スーパーノードから、正常稼動中の任意のひとつのスーパーノードを選択して、ファイルの保管依頼及び読込み依頼を行うこと
を特徴とする。 The distributed storage system further includes a plurality of the super nodes and user terminals connected to the plurality of super nodes via a network,
The user terminal is
One arbitrary super node in normal operation is selected from the plurality of super nodes, and a file storage request and a read request are made.

また、本発明に係る分散ストレージシステムは、
複数の分散ディスク及びひとつ又は複数のユーザ端末にネットワークを介して接続され、ファイルの保管依頼、読込み依頼、及び分散ディスクの復元依頼を受け付けるスーパーノードを有する分散ストレージシステムであり、
前記スーパーノードは、
前記ユーザ端末から、分散保管の対象となる元ファイルと、前記元ファイルの保管依頼とを受け付けると、前記元ファイルを分割して符号化した複数の符号化ファイルを生成する分割符号化部と、
前記複数の符号化ファイル各々を、異なる分散ディスクに保管依頼する配信部と、
元ファイルと符号化ファイルと分散ディスクとの対応を表すファイル管理情報を生成するファイル管理部と、
前記ファイル管理情報を保管するファイル管理情報記憶部と、
前記ユーザ端末から、元ファイル名と、前記元ファイルの読込み依頼とを受け付けると、前記ファイル管理情報により、前記元ファイルから生成した符号化ファイルを保管する分散ディスクを特定し、前記分散ディスクに対して、正常稼動の確認を行い、確認できた前記分散ディスクから所定の個数の符号化ファイルを取得して、復号することにより、前記元ファイルを再生する再生部と
任意の分散ディスクの復元依頼要求に基づき、前記ファイル管理情報記憶部に保管された前記ファイル管理情報から、前記分散ディスクに保管された符号化ファイルと、前記符号化ファイルに対応する元ファイルとを特定し、前記元ファイルを復号するために必要な前記分散ディスク以外の分散ディスクに保管されるその他の符号化ファイルを特定して取得し、前記その他の符号化ファイルから前記元ファイルを復号し、復号された前記元ファイルを分割し、符号化することにより、前記符号化ファイルを再生成し、前記分散ディスクに、再生成された前記符号化ファイルを保管依頼する分散ディスク復元部と
を備えたことを特徴とする。 Further, the distributed storage system according to the present invention includes:
A distributed storage system having a super node connected to a plurality of distributed disks and one or a plurality of user terminals via a network and receiving a file storage request, a read request, and a distributed disk restoration request,
The super node is
When receiving an original file to be distributed and a storage request for the original file from the user terminal, a division encoding unit that generates a plurality of encoded files obtained by dividing and encoding the original file;
A distribution unit that requests storage of each of the plurality of encoded files on different distributed disks;
A file management unit that generates file management information indicating correspondence between the original file, the encoded file, and the distributed disk;
A file management information storage unit for storing the file management information;
When receiving the original file name and the read request of the original file from the user terminal, the file management information identifies a distributed disk that stores the encoded file generated from the original file, and Then, a normal operation is confirmed, a predetermined number of encoded files are obtained from the confirmed distributed disk, and the playback unit that reproduces the original file and decoding request for any distributed disk are obtained by decoding. Based on the file management information, the encoded file stored in the distributed disk and the original file corresponding to the encoded file are identified from the file management information stored in the file management information storage unit, and the original file is decoded. Identify and capture other encoded files stored on a distributed disk other than the distributed disk Then, the original file is decoded from the other encoded files, the decoded original file is divided and encoded, and the encoded file is regenerated and regenerated on the distributed disk. And a distributed disk restoration unit for requesting storage of the encoded file.

また、本発明に係る分散ストレージプログラムは、
ネットワークを介して接続されたユーザ端末から、分散保管の対象となる元ファイルと、前記元ファイルの保管依頼とを受け付けると、前記元ファイルを分割して符号化した複数の符号化ファイルを生成し、前記符号化ファイル名を、前記元ファイルとの対応を特定するための文字列を含んだ名称とする分割符号化処理と、
前記複数の符号化ファイル各々を、ネットワークを介して接続された、異なる複数の分散ディスクに保管依頼する配信処理と、
前記複数の符号化ファイルを分散ディスクの保管後、前記元ファイル、前記複数の符号化ファイル、前記元ファイル又は前記複数の符号化ファイルを特定することができる情報、前記複数の符号化ファイルを保管した複数の分散ファイルを特定することができる情報を削除するファイル管理処理と、
ネットワークを介して接続されたユーザ端末から、元ファイル名と、前記元ファイルの読込み依頼とを受け付けると、分散ディスクに対して、前記元ファイルから生成された符号化ファイルを保管しているかどうかの確認を行い、前記元ファイルと符号化ファイル名とを比べることにより、保管を確認できた前記分散ディスクから所定の個数の前記符号化ファイルを取得して、復号することにより、前記元ファイルを再生する再生処理と
をコンピュータに実行させることを特徴とする。 The distributed storage program according to the present invention is
When receiving an original file to be distributed and a storage request for the original file from a user terminal connected via a network, a plurality of encoded files obtained by dividing and encoding the original file are generated. , A division encoding process in which the encoded file name is a name including a character string for specifying correspondence with the original file;
A delivery process for requesting storage of each of the plurality of encoded files to a plurality of different distributed disks connected via a network;
After the plurality of encoded files are stored on a distributed disk, the original file, the plurality of encoded files, information that can specify the original file or the plurality of encoded files, and the plurality of encoded files are stored. File management process to delete information that can identify multiple distributed files,
When an original file name and a request to read the original file are received from a user terminal connected via a network, whether or not the encoded file generated from the original file is stored in a distributed disk Confirming and comparing the original file with the encoded file name, obtains a predetermined number of the encoded files from the distributed disk whose storage has been confirmed, and reproduces the original file by decoding. And a playback process to be executed by a computer.

また、本発明に係る分散ストレージプログラムは、
ネットワークを介して接続されたユーザ端末から、分散保管の対象となる元ファイルと、前記元ファイルの保管依頼とを受け付けると、前記元ファイルを分割して符号化した複数の符号化ファイルを生成する分割符号化処理と、
前記複数の符号化ファイル各々を、ネットワークを介して接続された、異なる分散ディスクに保管依頼する配信処理と、
元ファイルと符号化ファイルと分散ディスクとの対応を表すファイル管理情報を生成するファイル管理処理と、
前記ファイル管理情報を保管するファイル管理情報記憶処理と、
ネットワークを介して接続されたユーザ端末から、元ファイル名と、前記元ファイルの読込み依頼とを受け付けると、前記ファイル管理情報により、前記元ファイルから生成した符号化ファイルを保管する分散ディスクを特定し、前記分散ディスクに対して、正常稼動の確認を行い、確認できた前記分散ディスクから所定の個数の符号化ファイルを取得して、復号することにより、前記元ファイルを再生する再生処理と
任意の分散ディスクの復元依頼要求に基づき、前記ファイル管理情報記憶処理に保管された前記ファイル管理情報から、前記分散ディスクに保管された符号化ファイルと、前記符号化ファイルに対応する元ファイルとを特定し、前記元ファイルを復号するために必要な前記分散ディスク以外の分散ディスクに保管されるその他の符号化ファイルを特定して取得し、前記その他の符号化ファイルから前記元ファイルを復号し、復号された前記元ファイルを分割し、符号化することにより、前記符号化ファイルを再生成し、前記分散ディスクに、再生成された前記符号化ファイルを保管依頼する分散ディスク復元処理と
をコンピュータに実行させることを特徴とする。 The distributed storage program according to the present invention is
When an original file to be distributed and a storage request for the original file are received from a user terminal connected via a network, a plurality of encoded files obtained by dividing and encoding the original file are generated. Division coding processing,
A delivery process for requesting storage of each of the plurality of encoded files to different distributed disks connected via a network;
A file management process for generating file management information indicating the correspondence between the original file, the encoded file, and the distributed disk;
A file management information storage process for storing the file management information;
When an original file name and a request to read the original file are received from a user terminal connected via a network, a distributed disk that stores an encoded file generated from the original file is specified by the file management information. The normal operation of the distributed disk is confirmed, a predetermined number of encoded files are obtained from the confirmed distributed disk, and decoding is performed to reproduce the original file. Based on the request to restore the distributed disk, the encoded file stored in the distributed disk and the original file corresponding to the encoded file are identified from the file management information stored in the file management information storage process. Stored on a distributed disk other than the distributed disk required to decrypt the original file Identifying and obtaining the encoded file, decoding the original file from the other encoded files, dividing the decoded original file, and encoding to regenerate the encoded file; A distributed disk restoration process for requesting storage of the regenerated encoded file to the distributed disk is executed by a computer.

本発明によれば、より安全性の高い分散ストレージシステムを得ることができる。 According to the present invention, a more secure distributed storage system can be obtained.

実施の形態１．
以下、図１を用いて、本実施の形態における分散ストレージシステム１００の構成を説明する。本実施の形態における分散ストレージシステム１００は、データを保管する複数の分散ディスク１０１と、分散ディスク１０１に保管されたデータのファイル管理情報を扱う管理端末１０２と、分散保管のための分割、及び符号化を行なう複数のスーパーノード１０４と、ユーザからのデータ保管の依頼、及びデータ読込みの依頼を受け付ける複数のユーザ端末１０６と、分散ディスク１０１、管理端末１０２、及びスーパーノード１０４を接続する管理者ネットワーク１０３と、スーパーノード１０４、及びユーザ端末１０６を接続するＷＡＮ１０５（ＷｉｄｅＡｒｅａＮｅｔｗｏｒｋ）とから構成される。ユーザは任意のユーザ端末１０６から、任意のスーパーノード１０４を選択して、元ファイルの保管を依頼し、依頼を受け付けたスーパーノード１０４は元ファイルを分割及び符号化して、分散ディスク１０１に保管し、元ファイルをどの分散ディスク１０１に保管したのかに係るファイル管理情報を管理端末１０２に保管する。また、ユーザは任意のユーザ端末１０６から、任意のスーパーノード１０４を選択して、元ファイルの読込みを依頼し、依頼を受け付けたスーパーノード１０４は分散ディスク１０１に保管された元ファイルに係るデータを取得して復号し、ユーザ端末１０６に送信する。また、管理端末１０２は、前述のように保管されたファイル管理情報により、任意の分散ディスク１０１のデータを復元する。なお、管理端末１０２は、管理者ネットワーク１０３に接続されており、ＷＡＮ１０５には接続されていないため、管理者を除く一般のユーザはアクセスできない。管理者ネットワーク１０３は、ファイアウォールで保護されたＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）であることが望ましい。ＷＡＮ１０５は、スーパーノード１０４と広範囲に配置されたユーザ端末１０６とを接続することを想定しているが、ＬＡＮ等、狭い範囲で構築されていても構わない。 Embodiment 1 FIG.
Hereinafter, the configuration of the distributed storage system 100 according to the present embodiment will be described with reference to FIG. The distributed storage system 100 according to the present embodiment includes a plurality of distributed disks 101 that store data, a management terminal 102 that handles file management information of data stored in the distributed disk 101, a division and code for distributed storage Network connecting a plurality of supernodes 104, a plurality of user terminals 106 receiving data storage requests and data reading requests, a distributed disk 101, a management terminal 102, and a supernode 104 103, a super node 104, and a WAN 105 (Wide Area Network) connecting the user terminal 106. The user selects an arbitrary super node 104 from an arbitrary user terminal 106 and requests storage of the original file. Upon receiving the request, the super node 104 divides and encodes the original file and stores it in the distributed disk 101. The file management information relating to which distributed disk 101 the original file is stored in is stored in the management terminal 102. In addition, the user selects an arbitrary super node 104 from an arbitrary user terminal 106 and requests reading of the original file. The super node 104 that has received the request receives data related to the original file stored in the distributed disk 101. Obtaining, decoding, and transmitting to the user terminal 106. In addition, the management terminal 102 restores data on an arbitrary distributed disk 101 based on the file management information stored as described above. Since the management terminal 102 is connected to the administrator network 103 and not connected to the WAN 105, general users other than the administrator cannot access it. The administrator network 103 is preferably a LAN (Local Area Network) protected by a firewall. The WAN 105 is assumed to connect the super node 104 and the user terminals 106 arranged over a wide range, but may be constructed in a narrow range such as a LAN.

次に、図２を用いて、本実施の形態における分散ストレージシステム１００の詳細な構成を説明する。分散ディスク１０１は、分割されて符号化されたデータを保管する符号化データ記憶部２０１を有する。分割されて符号化されたデータについては、図４において後述する。また、分割されて符号化されたデータの名称については、図５において後述する。 Next, a detailed configuration of the distributed storage system 100 according to the present embodiment will be described with reference to FIG. The distribution disk 101 has an encoded data storage unit 201 that stores the divided and encoded data. The divided and encoded data will be described later with reference to FIG. The names of the divided and encoded data will be described later with reference to FIG.

管理端末１０２は、分散ディスク記憶部１（２０２）と、ファイル管理情報記憶部２０３と、分散ディスク状態確認部２０４と、分散ディスク復元部２０５とを有する。分散ディスク記憶部１（２０２）は、複数の分散ディスク１０１のノード名、及びＩＰアドレスなどを対応付けて保管する。ファイル管理情報記憶部２０３は、保管対象である元ファイルを特定するＩＤ（元ファイル名）と、元ファイルを分割して符号化した符号化ファイルを特定するＩＤ（符号化ファイルＮｏ．）と、保管した分散ディスク１０１を特定するＩＤ（分散ディスクノード名）とを対応付けたファイル管理情報を保管する。分散ディスク状態確認部２０４は、分散ディスク１０１が正常稼動しているかどうかの確認、及び復元の指示を行う。分散ディスク復元部２０５は、ファイル管理情報記憶部２０３に保管されたファイル管理情報により、任意の分散ディスク１０１の復元を行う。ファイル管理情報記憶部２０３に保管されたファイル管理情報については、図６において後述する。 The management terminal 102 includes a distributed disk storage unit 1 (202), a file management information storage unit 203, a distributed disk state confirmation unit 204, and a distributed disk restoration unit 205. The distributed disk storage unit 1 (202) stores node names, IP addresses, and the like of the plurality of distributed disks 101 in association with each other. The file management information storage unit 203 includes an ID (original file name) that identifies an original file that is a storage target, an ID (encoded file No.) that identifies an encoded file obtained by dividing and encoding the original file, The file management information in which the ID (distributed disk node name) specifying the stored distributed disk 101 is associated is stored. The distributed disk status confirmation unit 204 confirms whether or not the distributed disk 101 is operating normally, and instructs restoration. The distributed disk restoration unit 205 restores an arbitrary distributed disk 101 based on the file management information stored in the file management information storage unit 203. The file management information stored in the file management information storage unit 203 will be described later with reference to FIG.

スーパーノード１０４は、分散ディスク記憶部２（２０６）と、分割符号化部２０７と、配信部２０８と、ファイル管理部２０９と、再生部２１０とを有する。分散ディスク記憶部２（２０６）は、複数の分散ディスクのノード名、及びＩＰアドレスなどを対応付けて保管する。分割符号化部２０７は、保管対象である元ファイルを分割して符号化して、符号化ファイルを作成する。配信部２０８は、前述の複数の符号化ファイルを、複数の分散ディスク１０１に配信する。ファイル管理部２０９は、保管対象である元ファイルを特定するＩＤ（元ファイル名）と、元ファイルを分割して符号化した符号化ファイルを特定するＩＤ（符号化ファイルＮｏ．）と、保管した分散ディスク１０１を特定するＩＤ（分散ディスクノード名）とを対応付けたファイル管理情報を作成し、管理端末１０２に送信する。なお、ファイル管理情報については、図５において後述する。再生部２１０は、元ファイルの再生の依頼をユーザ端末１０６から受け付けると、分散ディスク１０１から、依頼を受け付けた元ファイルに係るデータを、復元に必要な個数だけ取得し、元ファイルの再生を行う。 The super node 104 includes a distributed disk storage unit 2 (206), a division encoding unit 207, a distribution unit 208, a file management unit 209, and a reproduction unit 210. The distributed disk storage unit 2 (206) stores node names and IP addresses of a plurality of distributed disks in association with each other. The division encoding unit 207 generates an encoded file by dividing and encoding the original file to be stored. The distribution unit 208 distributes the plurality of encoded files described above to the plurality of distributed disks 101. The file management unit 209 stores an ID (original file name) for specifying the original file to be stored, an ID (encoded file No.) for specifying the encoded file obtained by dividing the original file, and the encoded file. File management information in association with an ID (distributed disk node name) for specifying the distributed disk 101 is created and transmitted to the management terminal 102. The file management information will be described later with reference to FIG. When the playback unit 210 receives a request for playback of the original file from the user terminal 106, the playback unit 210 acquires the number of data related to the source file for which the request has been received from the distributed disk 101, and plays back the original file. .

ユーザ端末１０６は、スーパーノード記憶部２１１と、保管依頼部２１２と、読込み依頼部２１３とを有する。スーパーノード記憶部２１１は、複数のスーパーノード１０４のノード名、及びＩＰアドレスなどを対応付けて保管する。保管依頼部２１２は、任意のスーパーノード１０４に対して、元ファイルの保管を依頼する。読込み依頼部２１３は、任意のスーパーノード１０４に対して、元ファイルの復元を依頼する。 The user terminal 106 includes a super node storage unit 211, a storage request unit 212, and a read request unit 213. The super node storage unit 211 stores node names, IP addresses, and the like of the plurality of super nodes 104 in association with each other. The storage request unit 212 requests an arbitrary super node 104 to store the original file. The read request unit 213 requests an arbitrary super node 104 to restore the original file.

図３は、分散ストレージシステム１００のハードウェア資源の一例を示す図である。分散ストレージシステム１００を構成する複数の分散ディスク１０１、管理端末１０２、複数のスーパーノード１０４、及び複数のユーザ端末１０６の各々は、例えば図３に示すようなパーソナルコンピュータにより構成されている。ただし、分散ディスク１０１については、ストレージ専用のデバイスであってもかまわない。 FIG. 3 is a diagram illustrating an example of hardware resources of the distributed storage system 100. Each of the plurality of distributed disks 101, the management terminal 102, the plurality of super nodes 104, and the plurality of user terminals 106 constituting the distributed storage system 100 is configured by a personal computer as shown in FIG. 3, for example. However, the distributed disk 101 may be a storage-dedicated device.

図３において、分散ストレージシステム１００を構成する複数の分散ディスク１０１、管理端末１０２、複数のスーパーノード１０４、及び複数のユーザ端末１０６の各々は、不図示のシステムユニット、ＣＲＴ（ＣａｔｈｏｄｅＲａｙＴｕｂｅ）やＬＣＤ（液晶ディスプレイ）の表示画面を有する表示装置３０１、キーボード３０２（Ｋ／Ｂ）、マウス３０３、ＦＤＤ３０４（ＦｌｅｘｉｂｌｅＤｉｓｋＤｒｉｖｅ）、ＣＤＤ３０５（コンパクトディスク装置）等のハードウェア資源を備え、これらはケーブルや信号線で接続されている。システムユニットは、パーソナルコンピュータであり、例えばＬＡＮ（ローカルエリアネットワーク）、ゲートウェイを介してＷＡＮに接続されている。 In FIG. 3, each of a plurality of distributed disks 101, a management terminal 102, a plurality of super nodes 104, and a plurality of user terminals 106 constituting the distributed storage system 100 includes a system unit (not shown), a CRT (Cathode Ray Tube), Hardware resources such as a display device 301 having an LCD (liquid crystal display) display screen, a keyboard 302 (K / B), a mouse 303, an FDD 304 (Flexible Disk Drive), and a CDD 305 (compact disk device) are provided. Connected with signal lines. The system unit is a personal computer, and is connected to the WAN via, for example, a LAN (local area network) or a gateway.

また、分散ストレージシステム１００を構成する複数の分散ディスク１０１、管理端末１０２、複数のスーパーノード１０４、及び複数のユーザ端末１０６の各々は、プログラムを実行するＣＰＵ３１１（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ、中央処理装置、演算装置ともいう）を備えている。ＣＰＵ３１１は、バス３１２を介してＲＯＭ３１３（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＲＡＭ３１４（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）、通信ボード３１５、表示装置３０１、キーボード３０２、マウス３０３、ＦＤＤ３０４、ＣＤＤ３０５、磁気ディスク装置３２０と接続され、これらのハードウェアデバイスを制御する。磁気ディスク装置３２０の代わりに、光ディスク装置、メモリカードリーダライタ等の記録装置が用いられてもよい。 In addition, each of the plurality of distributed disks 101, the management terminal 102, the plurality of super nodes 104, and the plurality of user terminals 106 constituting the distributed storage system 100 includes a CPU 311 (Central Processing Unit, a central processing unit, an arithmetic unit that executes a program) Device). The CPU 311 is connected to a ROM 313 (Read Only Memory), a RAM 314 (Random Access Memory), a communication board 315, a display device 301, a keyboard 302, a mouse 303, an FDD 304, a CDD 305, and a magnetic disk device 320 via a bus 312. Control hardware devices. Instead of the magnetic disk device 320, a recording device such as an optical disk device or a memory card reader / writer may be used.

ＲＡＭ３１４は、揮発性メモリの一例である。ＲＯＭ３１３、ＦＤＤ３０４、ＣＤＤ３０５、磁気ディスク装置３２０の記憶媒体は、不揮発性メモリの一例である。 The RAM 314 is an example of a volatile memory. The storage media of the ROM 313, the FDD 304, the CDD 305, and the magnetic disk device 320 are an example of a nonvolatile memory.

通信ボード３１５は、ＬＡＮ、ＷＡＮ（ＷｉｄｅＡｒｅａＮｅｔｗｏｒｋ）等に接続されている。ＷＡＮにはゲートウェイを介して接続されていてもよい。 The communication board 315 is connected to a LAN, a WAN (Wide Area Network), or the like. The WAN may be connected via a gateway.

磁気ディスク装置３２０には、オペレーティングシステム３２１（ＯＳ）、ウィンドウシステム３２２、プログラム群３２３、ファイル群３２４が記憶されている。プログラム群３２３のプログラムは、ＣＰＵ３１１、オペレーティングシステム３２１、ウィンドウシステム３２２により実行される。プログラム群３２３には、本実施の形態の説明において「〜部」、「〜手段」、「〜処理」として説明する機能を実行するプログラムが記憶されている。プログラムは、ＣＰＵ３１１により読み出され実行される。又はファイル群３２４には、本実施の形態の説明において、「〜データ」、「〜情報」、「〜結果」として説明するデータや信号値や変数値やパラメータが、「〜ファイル」や「〜データベース」や「〜テーブル」の各項目として記憶されている。 The magnetic disk device 320 stores an operating system 321 (OS), a window system 322, a program group 323, and a file group 324. The programs in the program group 323 are executed by the CPU 311, the operating system 321, and the window system 322. The program group 323 stores programs that execute functions described as “˜unit”, “˜means”, and “˜processing” in the description of the present embodiment. The program is read and executed by the CPU 311. Alternatively, the file group 324 includes data, signal values, variable values, and parameters described as “˜data”, “˜information”, and “˜result” in the description of the present embodiment. It is stored as each item of “database” and “˜table”.

また、本実施の形態の説明において説明するブロック図やフローチャートの矢印の部分は主としてデータや信号の入出力を示し、データや信号値は、ＲＡＭ３１４等のメモリ、ＦＤＤ３０４のフレキシブルディスク（ＦＤ）、ＣＤＤ３０５のコンパクトディスク（ＣＤ）、磁気ディスク装置３２０の磁気ディスク、その他光ディスク、ミニディスク（ＭＤ）、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋ）等の記録媒体に記録される。また、データや信号は、バス３１２や信号線やケーブルその他の伝送媒体によりオンライン伝送される。 In addition, the arrows in the block diagrams and flowcharts described in the description of this embodiment mainly indicate input / output of data and signals. Data and signal values are stored in a memory such as a RAM 314, a flexible disk (FD) of an FDD 304, and a CDD 305. Recording disks such as a compact disk (CD), a magnetic disk of the magnetic disk device 320, other optical disks, a mini disk (MD), and a DVD (Digital Versatile Disk). Data and signals are transmitted online via the bus 312, signal lines, cables and other transmission media.

また、本実施の形態の説明において「〜部」、「〜手段」、「〜処理」として説明するものは、ＲＯＭ３１３に記憶されたファームウェアで実現されていても構わない。あるいは、ソフトウェアのみ、あるいは、ハードウェアのみ、あるいは、ソフトウェアとハードウェアとの組み合わせ、さらには、ファームウェアとの組み合わせで実現されていても構わない。ファームウェアとソフトウェアは、プログラムとして、磁気ディスク、フレキシブルディスク、光ディスク、コンパクトディスク、ミニディスク、ＤＶＤ等の記録媒体に記憶される。このプログラムはＣＰＵ３１１により読み出され、ＣＰＵ３１１により実行される。即ち、プログラムは、本実施の形態の説明で述べる「〜部」、「〜手段」、「〜処理」としてコンピュータを機能させるものである。あるいは、本実施の形態の説明で述べる「〜部」、「〜手段」、「〜処理」の手順をコンピュータに実行させるものである。 In addition, what is described as “˜unit”, “˜means”, and “˜processing” in the description of the present embodiment may be realized by firmware stored in the ROM 313. Alternatively, it may be realized by software alone, hardware alone, a combination of software and hardware, or a combination of firmware. Firmware and software are stored as programs in a recording medium such as a magnetic disk, a flexible disk, an optical disk, a compact disk, a mini disk, and a DVD. This program is read by the CPU 311 and executed by the CPU 311. In other words, the program causes the computer to function as “˜unit”, “˜means”, and “˜processing” described in the description of the present embodiment. Alternatively, the procedure of “˜unit”, “˜means”, and “˜processing” described in the description of the present embodiment is executed by a computer.

図４は、消失訂正符号化技術を利用した、分割符号化部２０７における分割と符号化の手順と、再生部２１０における復号化の手順を説明する模式図である。まず、符号化について説明する。分割符号化部２０７は、保管対象である元ファイル４０１を１００個の情報パケット４０２に分割し、さらに、１００個の情報パケット４０２を符号化（冗長化）して、１５０個の符号化ファイル４０３を生成する。次に、復号化について説明する。再生部２１０は、前述の１５０個の符号化ファイル４０３の内、任意の一定数（例えば１０５個）の符号化ファイルを集めて復号化すれば、元ファイル４０１と同じ内容の復号ファイル４０５を再生できる。 FIG. 4 is a schematic diagram illustrating a division and encoding procedure in the division encoding unit 207 and a decoding procedure in the reproduction unit 210 using the erasure correction encoding technique. First, encoding will be described. The division encoding unit 207 divides the original file 401 to be stored into 100 information packets 402, further encodes (redundant) 100 information packets 402, and generates 150 encoded files 403. Is generated. Next, decoding will be described. The reproduction unit 210 reproduces a decoded file 405 having the same contents as the original file 401 by collecting and decoding an arbitrary fixed number (for example, 105) of the 150 encoded files 403 described above. it can.

すなわち、消失訂正符号化技術を利用すると、データの一部が消失しても、元データが再生できるという特長をもつ。なお、消失訂正符号化技術自体については周知であるので、詳細な説明は省略する。また、本実施の形態では、元ファイル４０１を１００個の情報パケット４０２に分割し、１５０個の符号化ファイルを作成し、任意の一定数（例えば１０５個）の符号化ファイル４０３から復号ファイル４０５を再生しているが、前述の１００個、１５０個、１０５個の各数値は、消失訂正符号化技術のアルゴリズムの変更により、変更が可能である。 In other words, the use of the erasure correction coding technique has a feature that the original data can be reproduced even if a part of the data is lost. Since the erasure correction coding technique itself is well known, detailed description thereof is omitted. Further, in the present embodiment, the original file 401 is divided into 100 information packets 402, 150 encoded files are created, and an arbitrary fixed number (for example, 105) of encoded files 403 are decoded files 405. However, the numerical values of 100, 150, and 105 can be changed by changing the algorithm of the erasure correction coding technique.

さらに本実施の形態では、前述のようにして作成した１５０個の符号化ファイルを、複数の分散ディスク１０１に分散させて保管するので、任意の一部の分散ディスク１０１が故障、電源断、メンテナンス、及びネットワーク回線の断線等の要因で正常稼動していなくても、正常稼動している分散ディスク１０１から任意の一定数の符号化ファイルを取得し、復号化し、元ファイル４０１と同じ内容の復号ファイル４０５を再生することができるため、高い信頼性、連続可動性を得ることができる。 Furthermore, in the present embodiment, 150 encoded files created as described above are distributed and stored in a plurality of distributed disks 101, so that any part of the distributed disks 101 may fail, power off, or be maintained. Even if the network line is not operating normally due to disconnection or the like, an arbitrary fixed number of encoded files are obtained from the normally operating distributed disk 101 and decoded, and the same contents as the original file 401 are decoded. Since the file 405 can be reproduced, high reliability and continuous mobility can be obtained.

次に、図５を用いて、ファイル管理情報、元ファイルの名称、及び符号化ファイルの名称について説明する。図５は、ファイル管理部２０９において生成するファイル管理情報のデータ例の図である。ファイル管理情報とは、保管対象である元ファイルの名称である元ファイル名と、元ファイルを分割して符号化した符号化ファイルを特定するための符号化ファイルＮｏ．と、符号化ファイルを保管した分散ディスク１０１を特定するための分散ディスクノード名とを対応付けた情報である。なお、本実施の形態において、元ファイル名はユニークであることを前提としている。また、ひとつの元ファイル名に対応する符号化ファイルＮｏ．は、ユニークである。 Next, the file management information, the name of the original file, and the name of the encoded file will be described with reference to FIG. FIG. 5 is a diagram illustrating a data example of file management information generated by the file management unit 209. The file management information includes an original file name that is the name of the original file to be stored, and an encoded file No. for specifying an encoded file obtained by dividing and encoding the original file. And the distributed disk node name for identifying the distributed disk 101 that stores the encoded file. In the present embodiment, it is assumed that the original file name is unique. The encoded file No. corresponding to one original file name is also displayed. Is unique.

また、本実施の形態において、符号化ファイルの名称は、元ファイル名と＿（アンダーバー）と符号化ファイルＮｏ．とを結合したものとする。例えば、元ファイル名が「Ｄｉｇｉｔａｌｍｏｖｉｅ１」であり、符号化ファイルＮｏ．が「００１」であれば、対応する符号化ファイルの名称は「Ｄｉｇｉｔａｌｍｏｖｉｅ１＿００１」である。すなわち、符号化ファイルの名称は、元ファイルの名称を含んでいる。 In the present embodiment, the name of the encoded file includes the original file name, _ (underbar), encoded file No. Are combined. For example, the original file name is “Digitalmovie1”, and the encoded file No. Is “001”, the name of the corresponding encoded file is “Digitalmovie1_001”. That is, the name of the encoded file includes the name of the original file.

次に、図６を用いて、管理端末１０２のファイル管理情報記憶部２０３に保管するファイル管理情報について説明する。図５では、ファイル管理部２０９において生成する、ひとつの元ファイルに係るファイル管理情報について説明したが、図６において説明するファイル管理情報記憶部２０３に保管するファイル管理情報とは、全ての分散ディスク１０１に保管している全ての元ファイルに係るファイル管理情報である。図６の例では、元ファイルＤｉｇｉｔａｌｍｏｖｉｅ１、Ｄｉｇｉｔａｌｍｏｖｉｅ２、Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１について、元ファイル名と、元ファイルを分割して符号化した符号化ファイルを特定するための符号化ファイルＮｏ．と、符号化ファイルを保管した分散ディスク１０１を特定するための分散ディスクノード名とを対応付けたファイル管理情報を、ファイル管理部２０９に保管している。 Next, file management information stored in the file management information storage unit 203 of the management terminal 102 will be described with reference to FIG. In FIG. 5, the file management information relating to one original file generated by the file management unit 209 has been described. However, the file management information stored in the file management information storage unit 203 described in FIG. This is file management information related to all original files stored in 101. In the example of FIG. 6, for the original files Digitalmovie1, Digitalmovie2, and Digitalpicture1, the original file name and the encoded file No. for specifying the encoded file obtained by dividing the original file and encoding it. And file management information in which the distributed disk node name for specifying the distributed disk 101 storing the encoded file is associated is stored in the file management unit 209.

次に図７、図８、及び図９のフローチャートを用いて、分散ストレージシステム１００におけるデータ保管の処理の流れについて説明する。分散ストレージシステム１００におけるデータ保管とは、図４において説明したように、消失訂正符号化技術を利用して、保管対象の元ファイルを分割し符号化した、複数の符号化ファイルを作成して、これらの符号化ファイル各々を、異なる分散ディスク１０１に保管することである。 Next, the flow of data storage processing in the distributed storage system 100 will be described using the flowcharts of FIGS. 7, 8, and 9. As described in FIG. 4, data storage in the distributed storage system 100 creates a plurality of encoded files obtained by dividing and encoding the original file to be stored using the erasure correction encoding technology, Each of these encoded files is stored on a different distributed disk 101.

図７において、ユーザ端末１０６の保管依頼部２１２は、ユーザからデータ保管の依頼を受け付けると、スーパーノード記憶部２１１から、任意の１台のスーパーノード１０４を選択する（ステップＳ７０１）。なお、ユーザからは複数のスーパーノード１０４がひとつの仮想スーパーノードに見える。ユーザ端末１０６の保管依頼部２１２は、ランダムに任意の１台のスーパーノード１０４を選択し、該当のスーパーノード１０４が正常稼動していなければ、正常稼動しているスーパーノード１０４が選択されるまで、同様にランダムに任意の１台のスーパーノード１０４を選択する。又は、各スーパーノード１０４の負荷状況を確認して、負荷の低いスーパーノード１０４を選択する。又は、その他の方法で任意の１台のスーパーノード１０４を選択してもよい。次に、ユーザ端末１０６の保管依頼部２１２は、選択したスーパーノード１０４に、保管の対象となる元ファイルと保管指示を送信する（ステップＳ７０２）。 In FIG. 7, when the storage request unit 212 of the user terminal 106 receives a data storage request from the user, the storage request unit 212 selects any one super node 104 from the super node storage unit 211 (step S <b> 701). Note that a plurality of super nodes 104 appear to the user as one virtual super node. The storage request unit 212 of the user terminal 106 randomly selects an arbitrary super node 104. If the corresponding super node 104 is not operating normally, until the super node 104 operating normally is selected. Similarly, any one super node 104 is randomly selected. Alternatively, the load status of each super node 104 is confirmed, and the super node 104 with a low load is selected. Alternatively, any one super node 104 may be selected by other methods. Next, the storage request unit 212 of the user terminal 106 transmits the original file to be stored and a storage instruction to the selected super node 104 (step S702).

次に、保管指示を受信したスーパーノード１０４の分割符号化部２０７は、受信した元ファイルを１００個に分割し、符号化し、１５０個の符号化ファイルを生成する（ステップＳ７０３）。ここで、分割と符号化にあたっては、図４において説明したように、消失訂正符号化技術を利用する。また、図５において説明したように、符号化ファイルの名称は、元ファイルの名称を含んでいる。 Next, the division encoding unit 207 of the super node 104 that has received the storage instruction divides the received original file into 100, encodes, and generates 150 encoded files (step S703). Here, in the division and encoding, the erasure correction encoding technique is used as described in FIG. Further, as described in FIG. 5, the name of the encoded file includes the name of the original file.

次に、スーパーノード１０４の配信部２０８は、分割符号化部２０７において生成した１５０個の符号化ファイルと保管指示を、分散ディスク記憶部２（２０６）に記憶された３００個の分散ディスク１０１の内、任意の１５０台の分散ディスク１０１へ各々送信する（ステップＳ７０４）。なお、ステップＳ７０４の詳細については、図８において後述する。 Next, the distribution unit 208 of the super node 104 stores the 150 encoded files generated in the division encoding unit 207 and the storage instruction in the 300 distributed disks 101 stored in the distributed disk storage unit 2 (206). Among them, each of them is transmitted to any 150 distributed disks 101 (step S704). Details of step S704 will be described later with reference to FIG.

次に、スーパーノード１０４のファイル管理部２０９は、元ファイルと分割・符号化した符号化ファイルとの対応に係る情報を分割符号化部２０７から取得し、符号化ファイルと保管先の分散ディスク１０１との対応に係る情報を配信部２０８から取得し、元ファイルと符号化ファイルと分散ディスク１０１との対応を表すファイル管理情報を生成して、管理端末１０２のファイル管理情報記憶部２０３に送信する（ステップＳ７０５）。ここで、ファイル管理情報については、図５において説明した通りである。なお、ステップＳ７０５の詳細については、図９において後述する。 Next, the file management unit 209 of the super node 104 obtains information related to the correspondence between the original file and the encoded file that has been divided and encoded from the divided encoding unit 207, and the encoded file and the storage destination distributed disk 101. Is acquired from the distribution unit 208, file management information indicating the correspondence between the original file, the encoded file, and the distributed disk 101 is generated and transmitted to the file management information storage unit 203 of the management terminal 102. (Step S705). Here, the file management information is as described in FIG. Details of step S705 will be described later with reference to FIG.

次に、図８において図７のステップＳ７０４の詳細な処理の流れについて説明する。スーパーノード１０４の配信部２０８は、全ての分散ディスク１０１に対して、正常稼動中かどうかの確認要求を行う（ステップＳ８０１）。ステップＳ８０１における確認要求に対して、正常稼動中の分散ディスク１０１が応答する（ステップＳ８０２）。 Next, the detailed processing flow of step S704 in FIG. 7 will be described with reference to FIG. The distribution unit 208 of the super node 104 makes a confirmation request as to whether all the distributed disks 101 are operating normally (step S801). In response to the confirmation request in step S801, the normally operating distributed disk 101 responds (step S802).

一方、スーパーノード１０４の配信部２０８は、一定時間、分散ディスク１０１からの応答を待機する（ステップＳ８０３）。そして、一定時間経過後、応答のあった分散ディスク１０１から１５０台を任意に選択する（ステップＳ８０４）。スーパーノード１０４の配信部２０８は、符号化ファイルを保管可能な程度にディスク容量の空きがある、任意の１５０台の分散ディスク１０１をランダムに選択する。又は、各分散ディスク１０１のディスク容量の空きを確認し、空き容量の大きい分散ディスク１０１から順に１５０台選択する。又は、その他の方法で任意の１５０台の分散ディスク１０１を選択してもよい。次に、１５０台の分散ディスク１０１へ、１５０個の符号化ファイルを各々送信する（ステップＳ８０５）。次に、分散ディスク１０１の符号化データ記憶部２０１は、ステップＳ８０５において送信された符号化ファイルを受信して、保管する（ステップＳ８０６）。なお、ステップＳ８０６において正常に保管することができない分散ディスク１０１があった場合は、別の分散ディスク１０１に送信して、保管する。 On the other hand, the distribution unit 208 of the super node 104 waits for a response from the distributed disk 101 for a certain time (step S803). Then, after a certain period of time, 150 units are arbitrarily selected from the distributed disks 101 that have responded (step S804). The distribution unit 208 of the super node 104 randomly selects any 150 distributed disks 101 that have enough disk capacity to store the encoded file. Alternatively, the disk capacity of each distributed disk 101 is confirmed to be free, and 150 units are selected in order from the distributed disk 101 having the largest available capacity. Alternatively, any 150 distributed disks 101 may be selected by other methods. Next, 150 encoded files are transmitted to 150 distributed disks 101, respectively (step S805). Next, the encoded data storage unit 201 of the distributed disk 101 receives and stores the encoded file transmitted in step S805 (step S806). If there is a distributed disk 101 that cannot be normally stored in step S806, it is transmitted to another distributed disk 101 and stored.

次に、図９において図７のステップＳ７０５の詳細な処理の流れについて説明する。スーパーノード１０４のファイル管理部２０９は、ファイル管理情報を作成する（ステップＳ９０１）。ファイル管理情報は、図５において説明したように、保管対象である元ファイルの名称である元ファイル名と、元ファイルを分割して符号化した符号化ファイルを特定するための符号化ファイルＮｏ．と、符号化ファイルを保管した分散ディスク１０１を特定するための分散ディスクノード名とを対応付けた情報である。なお、元ファイル名と符号化ファイルＮｏ．との対応に係る情報は、分割符号化部２０７から取得し、符号化ファイルＮｏ．と分散ディスクノード名との対応に係る情報は、配信部２０８から取得する。次に、スーパーノード１０４のファイル管理部２０９は、ファイル管理情報を管理端末１０２のファイル管理情報記憶部２０３に送信する（ステップＳ９０２）。 Next, the detailed processing flow of step S705 in FIG. 7 will be described with reference to FIG. The file management unit 209 of the super node 104 creates file management information (step S901). As described with reference to FIG. 5, the file management information includes the original file name that is the name of the original file to be stored and the encoded file No. for specifying the encoded file obtained by dividing and encoding the original file. And the distributed disk node name for identifying the distributed disk 101 that stores the encoded file. Note that the original file name and the encoded file No. Is obtained from the division encoding unit 207, and the encoded file No. And the information on the correspondence between the distributed disk node names are acquired from the distribution unit 208. Next, the file management unit 209 of the super node 104 transmits file management information to the file management information storage unit 203 of the management terminal 102 (step S902).

次に、管理端末１０２のファイル管理情報記憶部２０３は、受信したファイル管理情報を保管する（ステップＳ９０３）。次に、正常受信した旨を、スーパーノード１０４のファイル管理部２０９に応答する（ステップＳ９０４）。次に、スーパーノード１０４のファイル管理部２０９は、ファイル管理情報等（図７、図８、及び図９のフローチャートを用いて説明したデータ保管に係る処理で扱った各種情報であり、ファイル管理情報、元ファイル、符号化ファイル、元ファイル又は前記複数の符号化ファイルを特定することができる情報、前記複数の符号化ファイルを保管した複数の分散ファイルを特定することができる情報等）を削除する（ステップＳ９０５）。図７、図８、及び図９のフローチャートを用いて説明したデータ保管の処理の流れにおいて、スーパーノード１０４の分割符号化部２０７、配信部２０８、ファイル管理部２０９は、前述のファイル管理情報等を、一切、不揮発性メモリ（磁気ディスク装置３２０等）に記憶せず、揮発性メモリ（ＲＡＭ３１４等）に一時的に記憶された前述の情報についても、ステップＳ９０５において、ファイル管理部２０９により明示的に削除される。又は、揮発性メモリ（ＲＡＭ３１４）に一時的に記憶された前述の情報については、ＯＳ３２１により、プログラム終了時にメモリの開放と共に、自動的に削除されるものであってもよい。又は、スーパーノード１０４の分割符号化部２０７、配信部２０８、ファイル管理部２０９は、前述のファイル管理情報等の全部又は一部を、不揮発性メモリ（磁気ディスク装置３２０等）に記憶してもよいが、その場合には、ステップＳ９０５において、ファイル管理部２０９により明示的に削除される。 Next, the file management information storage unit 203 of the management terminal 102 stores the received file management information (step S903). Next, the fact that it has been normally received is responded to the file management unit 209 of the super node 104 (step S904). Next, the file management unit 209 of the super node 104 includes file management information and the like (various information handled in the data storage processing described with reference to the flowcharts of FIGS. 7, 8, and 9). , The original file, the encoded file, the information that can specify the original file or the plurality of encoded files, the information that can specify the plurality of distributed files that store the plurality of encoded files, and the like are deleted. (Step S905). In the data storage processing flow described with reference to the flowcharts of FIGS. 7, 8, and 9, the division encoding unit 207, the distribution unit 208, and the file management unit 209 of the super node 104 include the file management information and the like described above. Are not stored in the non-volatile memory (such as the magnetic disk device 320), and the above information temporarily stored in the volatile memory (such as the RAM 314) is also explicitly displayed by the file management unit 209 in step S905. Deleted. Alternatively, the above-described information temporarily stored in the volatile memory (RAM 314) may be automatically deleted by the OS 321 when the program is terminated and the memory is released. Alternatively, the division encoding unit 207, the distribution unit 208, and the file management unit 209 of the super node 104 may store all or a part of the above-described file management information in a nonvolatile memory (such as the magnetic disk device 320). In this case, the file management unit 209 explicitly deletes the file in step S905.

次に図１０のフローチャートを用いて、分散ストレージシステム１００におけるデータ読込みの処理の流れについて説明する。分散ストレージシステム１００におけるデータ読込みとは、複数の分散ディスク１０１に分散保管された符号化ファイルを集めて、図４において説明したように、消失訂正符号化技術を利用して、ひとつの元ファイルを再生することである。 Next, the flow of data read processing in the distributed storage system 100 will be described using the flowchart of FIG. Data reading in the distributed storage system 100 refers to collecting encoded files distributed and stored in a plurality of distributed disks 101, and using an erasure correction encoding technique as described in FIG. To play.

図１０において、ユーザ端末１０６の読込み依頼部２１３は、ユーザからデータ読込みの依頼を受け付けると、スーパーノード記憶部２１１から、任意の１台のスーパーノード１０４を選択する（ステップＳ１００１）。なお、ユーザからは複数のスーパーノード１０４がひとつの仮想スーパーノードに見える。選択するスーパーノード１０４は、保管時に指定したスーパーノード１０４である必要はない。ユーザ端末１０６の読込み依頼部２１３は、ランダムに任意の１台のスーパーノード１０４を選択し、該当のスーパーノード１０４が正常稼動していなければ、正常稼動しているスーパーノード１０４が選択されるまで、同様にランダムに任意の１台のスーパーノード１０４を選択する。又は、各スーパーノード１０４の負荷状況を確認して、負荷の低いスーパーノード１０４を選択する。又は、その他の方法で任意の１台のスーパーノード１０４を選択してもよい。次に、ユーザ端末１０６の読込み依頼部２１３は、選択したスーパーノード１０４に、読込みの対象となる元ファイル名と読込み指示を送信する（ステップＳ１００２）。 In FIG. 10, the read request unit 213 of the user terminal 106 selects an arbitrary super node 104 from the super node storage unit 211 when receiving a data read request from the user (step S1001). Note that a plurality of super nodes 104 appear to the user as one virtual super node. The super node 104 to be selected does not need to be the super node 104 designated at the time of storage. The read request unit 213 of the user terminal 106 randomly selects an arbitrary super node 104, and if the corresponding super node 104 is not operating normally, until the super node 104 operating normally is selected. Similarly, any one super node 104 is randomly selected. Alternatively, the load status of each super node 104 is confirmed, and the super node 104 with a low load is selected. Alternatively, any one super node 104 may be selected by other methods. Next, the read request unit 213 of the user terminal 106 transmits the original file name to be read and the read instruction to the selected super node 104 (step S1002).

次に、読込み指示を受信したスーパーノード１０４の再生部２１０は、分散ディスク記憶部２（２０６）に記憶された全分散ディスク１０１に対して、元ファイルから生成した符号化ファイルを保管しているかどうかの確認要求を行う（ステップＳ１００３）。ここで、確認要求と共に、元ファイル名を送信する。 Next, the reproducing unit 210 of the super node 104 that has received the read instruction stores the encoded file generated from the original file in all the distributed disks 101 stored in the distributed disk storage unit 2 (206). A confirmation request for whether or not is made (step S1003). Here, the original file name is transmitted together with the confirmation request.

次に、分散ディスク１０１の符号化データ記憶部２０１は、ステップＳ１００３で受信した元ファイル名をキーとして検索し、元ファイルから生成した符号化ファイルを保管している場合に、応答を返す（ステップＳ１００４）。なお、図５において説明したように、符号化ファイルの名称は、元ファイルの名称を含んでいる。従って、分散ディスク１０１の符号化データ記憶部２０１は、スーパーノード１０４の再生部２１０から受信した元ファイル名と符号化ファイル名とを比べて、元ファイル名と＿（アンダーバー）と数値とを結合した名称を有する符号化ファイルが存在すれば、元ファイルから生成した符号化ファイルを保管しているとして、応答を返す。 Next, the encoded data storage unit 201 of the distributed disk 101 searches using the original file name received in Step S1003 as a key, and returns a response when the encoded file generated from the original file is stored (Step S1003). S1004). As described with reference to FIG. 5, the name of the encoded file includes the name of the original file. Therefore, the encoded data storage unit 201 of the distributed disk 101 compares the original file name received from the playback unit 210 of the super node 104 with the encoded file name, and combines the original file name, _ (underbar), and a numerical value. If there is an encoded file having the specified name, a response is returned assuming that the encoded file generated from the original file is stored.

なお、ステップＳ１００３において、読込み指示を受信したスーパーノード１０４の再生部２１０は、元ファイル名と＿（アンダーバー）と所定の数値（本実施の形態では「００１」〜「１５０」）とを結合した名称を作成して、全分散ディスク１０１に送信し、次に、ステップＳ１００４では、分散ディスク１０１の符号化データ記憶部２０１は、ステップＳ１００３で受信した名称の符号化ファイルを検索し、存在すれば、元ファイルから生成した符号化ファイルを保管しているとして、応答を返してもよい。 In step S1003, the reproduction unit 210 of the super node 104 that has received the read instruction combines the original file name, _ (underscore), and a predetermined numerical value ("001" to "150" in the present embodiment). A name is created and transmitted to all the distributed disks 101. Next, in step S1004, the encoded data storage unit 201 of the distributed disk 101 searches for the encoded file received in step S1003, and if it exists. The response may be returned assuming that the encoded file generated from the original file is stored.

次に、応答を受信したスーパーノード１０４の再生部２１０は、応答のあった分散ディスク１０１から、１０５個の符号化データを取得する（ステップＳ１００５）。次に、符号化データを復号し、元ファイルと同じ内容の復号ファイルを再生する（ステップＳ１００６）。ここで、復号化にあたっては、図４において説明したように、消失訂正符号化技術を利用する。次に、復号ファイルをユーザ端末１０６へ送信する（ステップＳ１００７）。次に、ユーザ端末１０６の読込み依頼部２１３は、復号ファイルを受信する（ステップＳ１００８）。なお、ユーザ端末１０６は、表示装置３０１等を介して、ユーザに復号ファイルの内容を表示してもよい。 Next, the reproducing unit 210 of the super node 104 that has received the response acquires 105 encoded data from the distributed disk 101 that has responded (step S1005). Next, the encoded data is decoded, and a decoded file having the same contents as the original file is reproduced (step S1006). Here, in decoding, the erasure correction coding technique is used as described in FIG. Next, the decrypted file is transmitted to the user terminal 106 (step S1007). Next, the read request unit 213 of the user terminal 106 receives the decrypted file (step S1008). Note that the user terminal 106 may display the contents of the decrypted file to the user via the display device 301 or the like.

また、ステップＳ１００３では、全分散ディスク１０１に対して符号化ファイルを保管しているかどうかの確認要求を行い、ステップＳ１００４では該当する分散ディスク１０１が応答し、ステップＳ１００５では、応答のあった分散ディスク１０１から１０５個の符号化データを取得しているが、ひとつの全分散ディスク１０１に対して符号化ファイルを保管しているかどうかの確認要求を行い、応答があれば符号化データを取得することを、１０５回繰り返すものであってもよい。 In step S1003, a confirmation request is made to determine whether or not the encoded file is stored in all the distributed disks 101. In step S1004, the corresponding distributed disk 101 responds, and in step S1005, the distributed disk that has responded. 101 to 105 pieces of encoded data have been acquired, but a confirmation request is made as to whether or not the encoded file is stored for one all distributed disk 101, and if there is a response, the encoded data is acquired. May be repeated 105 times.

次に図１１、及び図１２のフローチャートを用いて、分散ストレージシステム１００における分散ディスク復元の処理の流れについて説明する。分散ストレージシステム１００における分散ディスク復元とは、任意の１台の分散ディスク１０１に保管された符号化ファイル全てを、まとめて復元することである。 Next, the flow of distributed disk restoration processing in the distributed storage system 100 will be described using the flowcharts of FIGS. 11 and 12. The distributed disk restoration in the distributed storage system 100 is to collectively restore all the encoded files stored in one arbitrary distributed disk 101.

図１１において、管理端末１０２の分散ディスク状態確認部２０４から、全分散ディスク１０１に対して、正常稼動しているかどうかの確認要求を行う（ステップＳ１１０１）。なお、確認要求は、予め定められた時間に定期的に行うようになっていてもよい。 In FIG. 11, the distributed disk status confirmation unit 204 of the management terminal 102 makes a confirmation request as to whether all the distributed disks 101 are operating normally (step S1101). The confirmation request may be made periodically at a predetermined time.

次に、確認要求を受信した分散ディスク１０１は、自身が正常稼動中であれば、応答する（ステップＳ１１０２）。次に、管理端末１０２の分散ディスク状態確認部２０４は、一定期間、正常稼動の旨の応答がない分散ディスク１０１に対して、故障と判断し、アラームを上げる（ステップＳ１１０３）。次に、人手により、故障中の分散ディスク１０１の交換、又は修理を行う（ステップＳ１１０４）。ただし、この時、ＩＰアドレスは交換、修理前と同じものを引き継ぐ。次に、分散ディスク１０１は、自身が正常稼動したこと及び復元依頼情報を、そのＩＰアドレスと共に、管理端末１０２に通知する（ステップＳ１１０５）。次に、管理端末１０２の分散ディスク復元部２０５は、故障後正常稼動した分散ディスク１０１について、自動でデータ復元処理を行う（ステップＳ１１０６）。なお、データ復元処理については、図１２において後述する。また、データ復元処理を行う前に、ファイル管理情報記憶部２０３に保管されたファイル管理情報と、故障後正常稼動した分散ディスク１０１の符号化データ記憶部２０１に保管する符号化ファイルとを比較することにより、ファイル管理情報には記録されているが、符号化データ記憶部２０１からは失われた符号化ファイルがあるかどうかを判断し、失われた符号化ファイルがあれば、自動でデータ復元処理を行うようにしてもよい。 Next, the distributed disk 101 that has received the confirmation request responds if it is operating normally (step S1102). Next, the distributed disk status confirmation unit 204 of the management terminal 102 determines that the distributed disk 101 that has not responded for normal operation for a certain period of time is a failure and raises an alarm (step S1103). Next, the failed distribution disk 101 is replaced or repaired manually (step S1104). However, at this time, the same IP address as before the replacement or repair is taken over. Next, the distributed disk 101 notifies the management terminal 102 of the normal operation of itself and the restoration request information together with its IP address (step S1105). Next, the distributed disk restoration unit 205 of the management terminal 102 automatically performs data restoration processing on the distributed disk 101 that has been operating normally after the failure (step S1106). The data restoration process will be described later with reference to FIG. Further, before performing the data restoration process, the file management information stored in the file management information storage unit 203 is compared with the encoded file stored in the encoded data storage unit 201 of the distributed disk 101 that has normally operated after the failure. Thus, it is determined whether there is a lost encoded file from the encoded data storage unit 201 that is recorded in the file management information, and if there is a lost encoded file, the data is automatically restored. Processing may be performed.

次に、図１２において図１１のステップＳ１１０６の詳細な処理の流れについて説明する。なお、説明を補足するために、図６のファイル管理情報の例を用いる。管理端末１０２の分散ディスク復元部２０５は、分散ディスク記憶部１（２０２）を参照し、受け付けたＩＰアドレスが分散ディスクとして登録されていることと、ＩＰアドレスに対応する分散ディスクノード名を検索して、復元依頼を受け付けた分散ディスク１０１の分散ディスクノード名を認識する（ステップＳ１２０１）。ここで、例えば、分散ディスクノード名が「ＰＣ００５」であったとする。 Next, the detailed processing flow of step S1106 of FIG. 11 will be described with reference to FIG. In order to supplement the description, the example of the file management information in FIG. 6 is used. The distributed disk restoration unit 205 of the management terminal 102 refers to the distributed disk storage unit 1 (202), and searches the distributed disk node name corresponding to the IP address that the accepted IP address is registered as the distributed disk. Then, the distributed disk node name of the distributed disk 101 that received the restoration request is recognized (step S1201). Here, for example, it is assumed that the distributed disk node name is “PC005”.

次に、分散ディスク復元部２０５は、ファイル管理情報記憶部２０３を参照し、ファイル管理情報から、認識した分散ディスクノード名に対応する元ファイル名と符号化ファイルＮｏ．とを全て検索する（ステップＳ１２０２）。例えば、ファイル管理情報記憶部２０３は図６に示すファイル管理情報を有していた場合、認識した分散ディスクノード名「ＰＣ０００５」に対応する元ファイル名と符号化ファイルＮｏ．は、二組有り、一組は元ファイル名が「Ｄｉｇｉｔａｌｍｏｖｉｅ１」であり、符号化ファイルＮｏ．が「００２」である。また、別のもう一組は、元ファイル名が「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１」であり、符号化ファイルＮｏ．が「００１」である。すなわち、分散ディスク「ＰＣ００５」上の符号化ファイルは、「Ｄｉｇｉｔａｌｍｏｖｉｅ１＿００２」と「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１＿００１」である。 Next, the distributed disk restoration unit 205 refers to the file management information storage unit 203, and from the file management information, the original file name corresponding to the recognized distributed disk node name and the encoded file number. Are searched (step S1202). For example, if the file management information storage unit 203 has the file management information shown in FIG. 6, the original file name corresponding to the recognized distributed disk node name “PC0005” and the encoded file No. , There are two sets, one set has the original file name “Digitalmovie1”, and the encoded file No. Is “002”. In another set, the original file name is “Digitalpicture1”, and the encoded file No. Is “001”. That is, the encoded files on the distributed disk “PC005” are “Digitalmovie1_002” and “Digitalpicture1_001”.

次に、分散ディスク復元部２０５は、検索した元ファイル名ごとに、ステップＳ１２０３からステップＳ１２０８までの処理を繰り返す。例えば、元ファイル「Ｄｉｇｉｔａｌｍｏｖｉｅ１」について、ステップＳ１２０３からステップＳ１２０９までの処理を行い、次に、元ファイル名「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１」について、ステップＳ１２０３からステップＳ１２０９までの処理を行う。以下、まず、元ファイル「Ｄｉｇｉｔａｌｍｏｖｉｅ１」について、ステップＳ１２０３からステップＳ１２０９までの処理を行う。 Next, the distributed disk restoration unit 205 repeats the processing from step S1203 to step S1208 for each retrieved original file name. For example, the process from step S1203 to step S1209 is performed for the original file “Digitalmovie1”, and then the process from step S1203 to step S1209 is performed for the original file name “Digitalpicture1”. Hereinafter, first, the process from step S1203 to step S1209 is performed for the original file “Digitalmovie1”.

分散ディスク復元部２０５は、符号化ファイルＮｏ．から、前記ファイルを復号するために必要な他の符号化ファイルＮｏ．と、前記他の符号化ファイルＮｏ．に対応する稼動中の分散ディスクノード名とを確定する（ステップＳ１２０３）。例えば、本実施の形態では、符号化ファイルは１５０個存在することになっているので、元ファイル名「Ｄｉｇｉｔａｌｍｏｖｉｅ１」を復号するために用いることができる他の符号化ファイルＮｏ．は、「００２」を除いたものであり、すなわち「００１」と、「００３」〜「１５０」とである。 The distributed disk restoration unit 205 stores the encoded file number. To another encoded file No. required for decoding the file. And the other encoded file No. The active distributed disk node name corresponding to is determined (step S1203). For example, since there are 150 encoded files in the present embodiment, another encoded file No. that can be used to decode the original file name “Digitalmovie 1” is used. Is obtained by removing “002”, that is, “001” and “003” to “150”.

次に、分散ディスク復元部２０５は、稼動中の分散ディスク１０１から、復元に必要な符号化ファイルＮｏ．に対応する符号化ファイルを取得する（ステップＳ１２０４）。例えば、本実施の形態では、復号化に必要な符号化ファイルは１０５個となっているので、元ファイル名「Ｄｉｇｉｔａｌｍｏｖｉｅ１」と＿（アンダーバー）と符号化ファイルＮｏ．を組合わせた符号化ファイル名であり、かつ符号化ファイルＮｏ．は、「００１」、または「００３」〜「１５０」であるような符号化ファイル名を有する符号化ファイルを、１０５個、正常稼動中の分散ディスク１０１から取得する。すなわち、符号化ファイル「Ｄｉｇｉｔａｌｍｏｖｉｅ１＿００１」、「Ｄｉｇｉｔａｌｍｏｖｉｅ１＿００３」〜「Ｄｉｇｉｔａｌｍｏｖｉｅ１＿１５０」の内の１０５個のファイルを、正常稼動中の分散ディスク１０１から取得する。 Next, the distributed disk restoration unit 205 reads the encoded file No. required for restoration from the active distributed disk 101. An encoded file corresponding to is acquired (step S1204). For example, in the present embodiment, since there are 105 encoded files necessary for decoding, the original file name “Digitalmovie1”, _ (underbar), encoded file No. Are encoded file names, and the encoded file No. Obtains 105 encoded files having encoded file names such as “001” or “003” to “150” from the distributed disk 101 operating normally. That is, 105 files of the encoded files “Digitalmovie1_001”, “Digitalmovie1_003” to “Digitalmovie1_150” are acquired from the distributed disk 101 that is operating normally.

次に、分散ディスク復元部２０５は、取得した符号化ファイルから復号ファイルを再生する（ステップＳ１２０５）。例えば、「Ｄｉｇｉｔａｌｍｏｖｉｅ１」を再生する。 Next, the distributed disk restoration unit 205 reproduces a decoded file from the acquired encoded file (step S1205). For example, “Digitalmovie1” is reproduced.

次に、分散ディスク復元部２０５は、復号ファイルを分割し復号化し、復元すべき符号化ファイルを生成する（ステップＳ１２０６）。ここで、分割と符号化にあたっては、ステップＳ７０３と同様の手順で行うこととする。例えば、「Ｄｉｇｉｔａｌｍｏｖｉｅ１」を分割し符号化することにより、「Ｄｉｇｉｔａｌｍｏｖｉｅ１＿００１」、「Ｄｉｇｉｔａｌｍｏｖｉｅ１＿００２」、・・・「Ｄｉｇｉｔａｌｍｏｖｉｅ１＿１５０」を生成し、その中から復元すべき符号化ファイル「Ｄｉｇｉｔａｌｍｏｖｉｅ１＿００２」を残して、他の符号化ファイルは削除する。 Next, the distributed disk restoration unit 205 divides and decodes the decoded file, and generates an encoded file to be restored (step S1206). Here, division and encoding are performed in the same procedure as in step S703. For example, by dividing and encoding “Digitalmovie1”, “Digitalmovie1 — 001”, “Digitalmovie1 — 002”,... “Digitalmovie1 — 150” is generated, and the encoded file “Digitalmov2 and others” to be restored is left out. Delete the encoded file.

次に、分散ディスク復元部２０５は、復元依頼を受け付けた分散ディスク１０１へ、符号化ファイルを送信する（ステップＳ１２０７）。例えば、分散ディスク「ＰＣ０００５」に「Ｄｉｇｉｔａｌｍｏｖｉｅ１＿００２」を送信する。なお、送信が正常に完了した後に、管理端末１０２上の符号化ファイル「Ｄｉｇｉｔａｌｍｏｖｉｅ１＿００２」は削除する。 Next, the distributed disk restoration unit 205 transmits the encoded file to the distributed disk 101 that has received the restoration request (step S1207). For example, “Digitalmovie1 — 002” is transmitted to the distributed disk “PC0005”. Note that after the transmission is completed normally, the encoded file “Digitalmovie1 — 002” on the management terminal 102 is deleted.

次に、分散ディスク復元部２０５は、ステップＳ１２０２で検索した全ての元ファイルについて処理を完了したか確認し、完了していれば、ステップＳ１１０６を終了し、完了していなければ、ステップＳ１２０３に戻り、ステップＳ１２０８までの処理を繰り返す（ステップＳ１２０８）。この例では、元ファイル名「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１」についての処理が完了していないので、ステップＳ１２０３に戻り、元ファイル名「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１」について、ステップＳ１２０３からステップＳ１２０８までの処理を行う。 Next, the distributed disk restoration unit 205 checks whether the processing has been completed for all the original files searched in step S1202, and if completed, ends step S1106, and if not completed, returns to step S1203. The process up to step S1208 is repeated (step S1208). In this example, since the process for the original file name “Digitalpicture 1” has not been completed, the process returns to step S1203, and the process from step S1203 to step S1208 is performed for the original file name “Digital picture 1”.

そして、次に、元ファイル名「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１」を復号するために用いることができる他の符号化ファイルＮｏ．は、「００１」を除いたものであり、「００２」〜「１５０」である（ステップＳ１２０３）。次に、元ファイル名「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１」と＿（アンダーバー）と符号化ファイルＮｏ．を組合わせた符号化ファイル名であり、かつ符号化ファイルＮｏ．は、「００２」〜「１５０」であるような符号化ファイル名を有する符号化ファイルを、１０５個、正常稼動中の分散ディスク１０１から取得する（ステップＳ１２０４）。すなわち、符号化ファイル「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１＿００２」〜「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１＿１５０」の内の１０５個のファイルを、正常稼動中の分散ディスク１０１から取得する。次に、「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１」を再生する（ステップＳ１２０５）。次に、「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１」を分割し符号化することにより、「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１＿００１」、「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１＿００２」、・・・「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１＿１５０」を生成し、その中から復元すべき符号化ファイル「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１＿００１」を残して、他の符号化ファイルは削除する（ステップＳ１２０６）。なお、送信が正常に完了した後に、管理端末１０２上の符号化ファイル「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１＿００１」は削除する。次に、分散ディスク「ＰＣ０００５」に「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１＿００１」を送信する（ステップＳ１２０７）。次に、ステップＳ１２０２で検索した全ての元ファイル「Ｄｉｇｉｔａｌｍｏｖｉｅ１」と「Ｄｉｇｉｔａｌｐｉｃｔｕｒｅ１」とについて処理を完了したので、ステップＳ１１０６を終了する（ステップＳ１２０８）。このようにして、分散ディスク１０１上には、元々あった符号化ファイル全てが復元される。 Next, another encoded file No. that can be used to decode the original file name “Digitalpicture1” is displayed. Is obtained by removing “001” and is “002” to “150” (step S1203). Next, the original file name “Digitalpicture 1”, _ (underscore), and the encoded file number. Are encoded file names, and the encoded file No. Acquires 105 encoded files having encoded file names such as “002” to “150” from the normally operating distributed disk 101 (step S1204). That is, 105 files of the encoded files “Digitalpicture1_002” to “Digitalpicture1_150” are acquired from the distributed disk 101 operating normally. Next, “Digitalpicture 1” is reproduced (step S1205). Next, by dividing and encoding “Digitalpicture1”, “Digitalpicture1_001”, “Digitalpicture1_002”,... “Digitalpicture1_150” is generated, and the encoded file “Digital_picture1” and the like to be restored are generated. The encoded file is deleted (step S1206). Note that after the transmission is completed normally, the encoded file “Digitalpicture1_001” on the management terminal 102 is deleted. Next, “Digitalpicture1_001” is transmitted to the distributed disk “PC0005” (step S1207). Next, since processing has been completed for all the original files “Digitalmovie1” and “Digitalpicture1” searched in step S1202, step S1106 is terminated (step S1208). In this way, all the original encoded files are restored on the distributed disk 101.

なお、本実施の形態において、分散ディスク１０１の特定にＩＰアドレスを使用し、ＩＰアドレスと分散ディスクノード名との対応付けに、分散ディスク記憶部１（２０２）と分散ディスク記憶部２（２０６）とを使用しているが、分散ディスク１０１の特定には分散ディスクノード名を使用し、ＩＰアドレスと分散ディスクノード名との対応付けには、ｈｏｓｔｓファイルを利用してもよい。なお、ここでｈｏｓｔｓファイルとは、周知の技術であるＴＣＰ／ＩＰネットワーク上のＩＰアドレスとホスト名との対応を記述するテキストファイルである。この場合、分散ディスクノード名は、前述のホスト名と同じ名称を設定することとし、分散ディスク記憶部１（２０２）及び分散ディスク記憶部２（２０６）には、分散ディスク１０１として使用されるパーソナルコンピュータ又はストレージ専用のデバイス等のホスト名が記憶されれば十分である。 In this embodiment, an IP address is used to specify the distributed disk 101, and the distributed disk storage unit 1 (202) and the distributed disk storage unit 2 (206) are used to associate the IP address with the distributed disk node name. However, a distributed disk node name may be used to identify the distributed disk 101, and a hosts file may be used to associate an IP address with the distributed disk node name. Here, the hosts file is a text file that describes correspondence between an IP address and a host name on a TCP / IP network, which is a well-known technique. In this case, the distributed disk node name is set to the same name as the above-described host name, and the distributed disk storage unit 1 (202) and the distributed disk storage unit 2 (206) are personally used as the distributed disk 101. It is sufficient if a host name such as a computer or a storage dedicated device is stored.

そして、図１１と図１２とを用いて説明した分散ディスク復元の処理の流れにおいて、ステップＳ１１０４では、人手により、故障中の分散ディスク１０１の交換、又は修理を行うが、ただし、この時、分散ディスクノード名（ホスト名）は交換、修理前と同じものを引き継ぐ。又、分散ディスクノード名（ホスト名）に対するＩＰアドレスが変更されている場合は、合わせて、分散ディスク１０１、管理端末１０２、及びスーパーノード１０４のｈｏｓｔｓファイルに保管されている分散ディスクノード名（ホスト名）の該当するＩＰアドレスを修正する。次に、ステップＳ１１０５では、分散ディスク１０１は、自身が正常稼動したこと及び復元依頼情報を、その分散ディスクノード名（ホスト名）と共に、管理端末１０２に通知する。また、ステップＳ１２０１では、管理端末１０２の分散ディスク復元部２０５は、分散ディスク記憶部１（２０２）を参照し、受け付けた分散ディスクノード名（ホスト名）が分散ディスクとして登録されていることを確認する。 In the distributed disk restoration processing flow described with reference to FIGS. 11 and 12, in step S1104, the failed distributed disk 101 is replaced or repaired manually. The disk node name (host name) will be the same as before replacement and repair. If the IP address for the distributed disk node name (host name) has been changed, the distributed disk node name (host) stored in the hosts file of the distributed disk 101, management terminal 102, and super node 104 is also added. Correct the corresponding IP address. In step S1105, the distributed disk 101 notifies the management terminal 102 of the normal operation of itself and the restoration request information together with the distributed disk node name (host name). In step S1201, the distributed disk restoration unit 205 of the management terminal 102 refers to the distributed disk storage unit 1 (202) and confirms that the received distributed disk node name (host name) is registered as a distributed disk. To do.

なお、本実施の形態において、図１１と図１２とを用いて説明したように、分散ディスク１０１の正常稼動と復元依頼とが管理端末１０２に通知されると（ステップＳ１１０５）、分散ディスク復元部２０５は、通知元の分散ディスク１０１を復元する（ステップＳ１１０６）構成であってもよいが、管理端末１０２の分散ディスク復元部２０５が、管理端末１０２のキーボード３０２、マウス３０３などの入力装置を介して、管理者から任意の分散ディスク１０１の復元依頼を受け付けて、前記の分散ディスク１０１の復元を行うものであってもよい。後者の場合も、管理者から受け付けた任意の分散ディスク１０１に対して、図１２のステップＳ１２０１からステップＳ１２０８までの処理を、前述の説明と同様に行う。 In this embodiment, as described with reference to FIGS. 11 and 12, when the management terminal 102 is notified of the normal operation of the distributed disk 101 and the restoration request (step S1105), the distributed disk restoration unit. 205 may be configured to restore the notification-source distributed disk 101 (step S1106), but the distributed disk restoration unit 205 of the management terminal 102 is configured via an input device such as the keyboard 302 and the mouse 303 of the management terminal 102. The administrator may accept a restoration request for an arbitrary distributed disk 101 from an administrator and restore the distributed disk 101. Also in the latter case, the processing from step S1201 to step S1208 of FIG. 12 is performed in the same manner as described above for an arbitrary distributed disk 101 received from the administrator.

なお、本実施の形態において、管理端末１０２を有しない構成であってもよい。この場合には、図１１、及び図１２において説明した分散ディスク復元は行わない。また、図７、図８、及び図９において説明したデータ保管については、図７のステップＳ７０５及び図９を行わない。すなわち、ファイル管理情報を作成し、管理端末１０２に送信する処理を行わない。また、図１０において説明したデータ読込みについては、管理端末１０２は係らない処理であるので、前述の通り動作する。 In the present embodiment, the management terminal 102 may not be provided. In this case, the distributed disk restoration described with reference to FIGS. 11 and 12 is not performed. Further, for the data storage described in FIGS. 7, 8, and 9, steps S705 and FIG. 9 in FIG. 7 are not performed. That is, the file management information is not created and transmitted to the management terminal 102. Further, since the data reading described with reference to FIG. 10 is a process that does not involve the management terminal 102, it operates as described above.

なお、本実施の形態において、前述したように、元ファイル名はユニークであることを前提としている。ただし、ユーザ端末１０６において名称が重複していた元ファイルを登録可能とするために、ユーザ端末１０６の保管依頼部２１２において、元ファイル名と他の情報（例えば、ユーザＩＤ、端末ＩＤ、時刻等）とを組合わせて、新たなユニークな名称に作成して、元ファイル名を前述のユニークな名称に変更して、スーパーノード１０４に送信するようにしてもよい。又は、スーパーノード１０４の分割符号化部２０７において、保管依頼を受けた元ファイル名が、管理端末１０２のファイル管理情報記憶部２０３に記憶されているかどうかの確認を行い、既に格納されている場合には、ユーザ端末１０６の保管依頼部２１２にエラーを返すようにしてもよい。又は、スーパーノード１０４の分割符号化部２０７において、保管依頼を受けた元ファイル名を含む名称を有する符号化ファイルが、分散ディスク１０１の符号化データ記憶部２０１に記憶されているかどうかの確認を行い、既に格納されている場合には、ユーザ端末１０６の保管依頼部２１２にエラーを返すようにしてもよい。また、ユーザ端末１０６の読込み依頼部２１３は、保管依頼部２１２が元ファイル名の変更を行っている場合には、同様の方法で、読込み依頼を受けた元ファイル名を変更する。 In the present embodiment, as described above, it is assumed that the original file name is unique. However, the original file name and other information (for example, user ID, terminal ID, time, etc.) are stored in the storage request unit 212 of the user terminal 106 so that the original file whose name has been duplicated in the user terminal 106 can be registered. ) To create a new unique name, the original file name may be changed to the above-mentioned unique name, and transmitted to the super node 104. Alternatively, the division encoding unit 207 of the super node 104 checks whether or not the original file name that received the storage request is stored in the file management information storage unit 203 of the management terminal 102 and has already been stored. Alternatively, an error may be returned to the storage request unit 212 of the user terminal 106. Alternatively, the division encoding unit 207 of the super node 104 checks whether or not an encoded file having a name including the original file name that has received the storage request is stored in the encoded data storage unit 201 of the distributed disk 101. If it is already stored, an error may be returned to the storage request unit 212 of the user terminal 106. In addition, when the storage request unit 212 changes the original file name, the read request unit 213 of the user terminal 106 changes the original file name that has received the read request in the same manner.

以上のように、本実施の形態では、ストレージデバイスとして、近年、ＣＰＵ及びＬＡＮの高速化、大容量ディスク化、及び低価格が進んでいるパーソナルコンピュータを利用することにより、従来、大変高価であった専用ストレージシステムと同等な機能と性能とを有する分散ストレージシステム１００を安価に構築できる。また、消失訂正符号化技術を利用することにより、１台の分散ディスク１０１を参照しても、元ファイルを復元することはできず、従来の専用ストレージシステムと同等の情報秘匿性を確保できる。また、スーパーノード１０４は、元ファイル、元ファイル名、元ファイルＮｏ．、保管先の分散ディスクノード名などの管理情報を記憶しないため、さらに、情報秘匿性を確保できる。 As described above, according to the present embodiment, a storage device has recently been very expensive by using a personal computer, which has recently been increasing in speed and CPU, LAN, and a low price as a storage device. In addition, the distributed storage system 100 having the same functions and performance as the dedicated storage system can be constructed at low cost. In addition, by using the erasure correction coding technique, the original file cannot be restored even if the single distributed disk 101 is referred to, and the same information confidentiality as that of the conventional dedicated storage system can be ensured. In addition, the super node 104 includes an original file, an original file name, an original file No. Further, since management information such as the storage destination distributed disk node name is not stored, information confidentiality can be further ensured.

また、本実施の形態では、管理端末１０２のみがファイル管理情報を保管し、ファイル管理情報を使用して任意の１台の分散ディスク１０１上の符号化ファイル全ての復元が可能であるため、ファイアウォールや各種認証機能等を用いて管理端末１０２上のファイル管理情報を安全に管理することができ、また、分散ディスク１０１を、故障、その他の諸事情による修理・交換することが容易になり、分散ストレージシステム１００の信頼性・保守性を向上することができる。従来のデータ分散保管システムにおいても、消失訂正符号化技術を利用しているため、一部の分散保管サーバが故障又は非正常稼動となっていても、一定数の符号化ファイルを集めることができれば、システム全体の動作上何ら問題がない。しかし、故障又は非正常稼動な分散保管サーバが増えすぎると、正常稼動中の分散保管サーバに負荷が集中し全体のパフォーマンスが低下する、総ストレージ容量が低下する、復号が可能となる一定数の符号化ファイルが確保できなくなるといった問題があったが、本実施の形態では、故障又は非正常稼動となった任意の１台の分散ディスク１０１上の符号化ファイル全てを復元することができるために、信頼性・保守性を向上することができる。 In this embodiment, only the management terminal 102 stores file management information, and the file management information can be used to restore all the encoded files on any one distributed disk 101. File management information on the management terminal 102 can be safely managed by using various authentication functions, etc., and the distributed disk 101 can be easily repaired or replaced due to failure or other circumstances. The reliability and maintainability of the storage system 100 can be improved. Since the conventional data distributed storage system also uses the erasure correction encoding technology, if a certain number of encoded files can be collected even if some of the distributed storage servers are out of order or operating abnormally. There is no problem in the operation of the entire system. However, if there are too many failed or abnormally distributed distributed storage servers, the load will be concentrated on the distributed storage servers that are operating normally, resulting in a decrease in overall performance, a decrease in total storage capacity, and a certain number of data that can be decrypted. There is a problem that the encoded file cannot be secured, but in this embodiment, all the encoded files on any one distributed disk 101 that has failed or malfunctioned can be restored. Reliability and maintainability can be improved.

また、本実施の形態では、複数の分散ディスク１０１に対して正常稼動しているかどうかの確認を行い、正常稼動を確認できなかった分散ディスク１０１について、その情報を管理者に通知し、その後正常稼動確認後、直ちに復元を行うことができる。 Also, in this embodiment, it is confirmed whether or not the plurality of distributed disks 101 are operating normally, and the administrator is notified of the information about the distributed disks 101 whose normal operation has not been confirmed. After the operation is confirmed, restoration can be performed immediately.

また、本実施の形態では、ユーザが複数のスーパーノード１０４があることを意識せず、ひとつの仮想スーパーノードとして操作しつつ、実際には複数のスーパーノード１０４を有して、負荷を分散することができる。また、前述のように、スーパーノード１０４は、元ファイル、元ファイル名、元ファイルＮｏ．、保管先の分散ディスクノード名などの管理情報を記憶しないため、スーパーノード１０４の追加・削除などの保守が容易である。 In this embodiment, the user is not aware that there are a plurality of super nodes 104 and operates as one virtual super node, but actually has a plurality of super nodes 104 and distributes the load. be able to. Further, as described above, the super node 104 includes the original file, the original file name, the original file No. Since management information such as the storage destination distributed disk node name is not stored, maintenance such as addition / deletion of the super node 104 is easy.

このように、本実施の形態では、情報隠匿性・信頼性・保守性などの面で、より安全性の高い分散ストレージシステム１００を得ることができる。 As described above, in the present embodiment, it is possible to obtain the distributed storage system 100 with higher security in terms of information concealment, reliability, maintainability, and the like.

実施の形態２．
本実施の形態では、実施の形態１で説明した管理端末１０２とスーパーノード１０４を１台のスーパーノード１０４にまとめた実施例について説明する。 Embodiment 2. FIG.
In the present embodiment, an example in which the management terminal 102 and the super node 104 described in the first embodiment are combined into one super node 104 will be described.

以下、図１３を用いて、本実施の形態における分散ストレージシステム１００の詳細な構成を説明する。分散ディスク１０１は、符号化データ記憶部２０１を有する。スーパーノード１０４は、１台のみの構成であり、分散ディスク記憶部２（２０６）と、分割符号化部２０７と、配信部２０８と、ファイル管理部２０９と、再生部２１０とを有する。さらに、スーパーノード１０４は、ファイル管理情報記憶部２０３と、分散ディスク状態確認部２０４と、分散ディスク復元部２０５とを有する。ユーザ端末１０６は、スーパーノード記憶部２１１と、保管依頼部２１２と、読込み依頼部２１３とを有する。なお、実施の形態１における分散ディスク記憶部１（２０２）は、分散ディスク記憶部２（２０６）と同じ情報を保管するので、本実施の形態においては不要となる。 Hereinafter, the detailed configuration of the distributed storage system 100 according to the present embodiment will be described with reference to FIG. The distribution disk 101 has an encoded data storage unit 201. The super node 104 has only one configuration, and includes a distributed disk storage unit 2 (206), a division encoding unit 207, a distribution unit 208, a file management unit 209, and a reproduction unit 210. Furthermore, the super node 104 includes a file management information storage unit 203, a distributed disk state confirmation unit 204, and a distributed disk restoration unit 205. The user terminal 106 includes a super node storage unit 211, a storage request unit 212, and a read request unit 213. Since the distributed disk storage unit 1 (202) in the first embodiment stores the same information as the distributed disk storage unit 2 (206), it is not necessary in the present embodiment.

実施の形態１において、図７から図１２までを用いて説明した、データ保管、データ読込み、及び分散ディスク復元の処理の流れは、次の点を除いて実施の形態２においても同様である。本実施の形態では、スーパーノード１０４は１台のみの構成ため、複数のスーパーノードから任意の１台のスーパーノードを選択する処理は省略する。また、実施の形態１における管理端末１０２は、本実施の形態ではスーパーノード１０４と読み替える。また、実施の形態１における分散ディスク記憶部１（２０２）は、分散ディスク記憶部２（２０６）と読み替える。 In the first embodiment, the processing flow of data storage, data reading, and distributed disk restoration described with reference to FIGS. 7 to 12 is the same as in the second embodiment except for the following points. In this embodiment, since there is only one super node 104, the process of selecting any one super node from a plurality of super nodes is omitted. Further, the management terminal 102 in the first embodiment is read as the super node 104 in the present embodiment. Further, the distributed disk storage unit 1 (202) in the first embodiment is read as the distributed disk storage unit 2 (206).

以下、図７、図８、及び図９を用いて、本実施の形態におけるデータ保管処理について説明する。図７において、本実施の形態では、ステップＳ７０１は省略する（ステップＳ７０１）。次に、ユーザ端末１０６の保管依頼部２１２は、スーパーノード１０４に、保管の対象となる元ファイルと保管指示を送信する（ステップＳ７０１）。次に、実施の形態１と同様に、元ファイルの分割し、符号化し、符号化ファイルを作成する（ステップＳ７０３）。次に、実施の形態１と同様に、符号化ファイルを分散ディスクに送信する（ステップＳ７０４）。次に、スーパーノード１０４のファイル管理部２０９は、ファイル管理情報を生成し、スーパーノード１０４のファイル管理情報記憶部２０３に送信する（ステップＳ７０５）。図８は、前述のステップＳ７０４の詳細な動作を説明したものであるが、本実施の形態においても、実施の形態１と同様であり、スーパーノード１０４の配信部２０８が稼動中の分散ディスク１０１から１５０台を任意に選択して符号化ファイルを送信し、分散ディスク１０１の符号化データ記憶部２０１は符号化ファイルを保管する。 Hereinafter, data storage processing according to the present embodiment will be described with reference to FIGS. 7, 8, and 9. In FIG. 7, step S701 is omitted in the present embodiment (step S701). Next, the storage request unit 212 of the user terminal 106 transmits an original file to be stored and a storage instruction to the super node 104 (step S701). Next, as in the first embodiment, the original file is divided and encoded to create an encoded file (step S703). Next, as in the first embodiment, the encoded file is transmitted to the distributed disk (step S704). Next, the file management unit 209 of the super node 104 generates file management information and transmits it to the file management information storage unit 203 of the super node 104 (step S705). FIG. 8 explains the detailed operation of the above-described step S704, but this embodiment is the same as in the first embodiment, and the distributed disk 101 in which the distribution unit 208 of the super node 104 is operating. 150 are arbitrarily selected from the above, and the encoded file is transmitted, and the encoded data storage unit 201 of the distributed disk 101 stores the encoded file.

図９は、前述のステップＳ７０４の詳細な動作を説明したものである。本実施の形態では、実施の形態１と同様に、ファイル管理情報を作成する（ステップＳ９０１）。次に、スーパーノード１０４のファイル管理部２０９は、ファイル管理情報をスーパーノード１０４のファイル管理情報記憶部２０３に送信する（ステップＳ９０２）。次に、スーパーノード１０４のファイル管理情報記憶部２０３は、受信したファイル管理情報を保管する（ステップＳ９０３）。次に、スーパーノード１０４のファイル管理情報記憶部２０３は、正常受信した旨を、スーパーノード１０４のファイル管理部２０９に応答する（ステップＳ９０４）。 FIG. 9 explains the detailed operation of step S704 described above. In the present embodiment, file management information is created as in the first embodiment (step S901). Next, the file management unit 209 of the super node 104 transmits file management information to the file management information storage unit 203 of the super node 104 (step S902). Next, the file management information storage unit 203 of the super node 104 stores the received file management information (step S903). Next, the file management information storage unit 203 of the super node 104 responds to the file management unit 209 of the super node 104 that it has been normally received (step S904).

次に、図１０を用いて、本実施の形態におけるデータ読込み処理について説明する。本実施の形態では、ステップＳ１００１は省略する（ステップＳ１００１）。次にユーザ端末１０６の読込み依頼部２１３は、スーパーノード１０４に、読込みの対象となる元ファイル名と読込み指示を送信する（ステップＳ１００２）。次に、実施の形態１と同様に、読込み指示を受信したスーパーノード１０４の再生部２１０は、分散ディスク記憶部２（２０６）に記憶された全分散ディスク１０１に対して、元ファイルから生成した符号化ファイルを保管しているかどうかの確認要求を行う（ステップＳ１００３）。 Next, the data reading process in the present embodiment will be described with reference to FIG. In this embodiment, step S1001 is omitted (step S1001). Next, the read request unit 213 of the user terminal 106 transmits an original file name to be read and a read instruction to the super node 104 (step S1002). Next, as in the first embodiment, the playback unit 210 of the super node 104 that has received the read instruction generates the source file from all the distributed disks 101 stored in the distributed disk storage unit 2 (206). A confirmation request is made as to whether or not the encoded file is stored (step S1003).

次に、実施の形態１と同様に、分散ディスク１０１の符号化データ記憶部２０１は、指定された符号化ファイルを保管している場合に、その応答を返す（ステップＳ１００４）。次に、ステップＳ１００４からステップ１００８までは、実施の形態１と同様に、応答を受信したスーパーノード１０４の再生部２１０は、応答のあった分散ディスク１０１から、１０５個の符号化データを取得する（ステップＳ１００５）。次に、符号化データを復号し、元ファイルと同じ内容の復号ファイルを再生する（ステップＳ１００６）。次に、復号ファイルをユーザ端末１０６へ送信する（ステップＳ１００７）。次に、ユーザ端末１０６の読込み依頼部２１３は、復号ファイルを受信する（ステップＳ１００８）。 Next, as in the first embodiment, the encoded data storage unit 201 of the distributed disk 101 returns a response when the specified encoded file is stored (step S1004). Next, from step S1004 to step 1008, as in the first embodiment, the reproducing unit 210 of the super node 104 that has received the response acquires 105 encoded data from the distributed disk 101 that has responded. (Step S1005). Next, the encoded data is decoded, and a decoded file having the same contents as the original file is reproduced (step S1006). Next, the decrypted file is transmitted to the user terminal 106 (step S1007). Next, the read request unit 213 of the user terminal 106 receives the decrypted file (step S1008).

次に、図１１及び図１２を用いて、本実施の形態における分散ディスク復元処理について説明する。図１１及び図１２においても、実施の形態１における管理端末１０２をスーパーノード１０４と読み替える点以外の詳細な動作は、実施の形態１と同様である。 Next, the distributed disk restoration process according to the present embodiment will be described with reference to FIGS. 11 and 12, the detailed operation is the same as that of the first embodiment except that the management terminal 102 in the first embodiment is replaced with the super node 104.

図１１において、まず、スーパーノード１０４の分散ディスク状態確認部２０４から、全分散ディスク１０１に対して、正常稼動しているかどうかの確認要求を行う（ステップＳ１１０１）。次に、確認要求を受信した分散ディスク１０１は、自身が正常稼動中であれば、応答する（ステップＳ１１０２）。次に、スーパーノード１０４の分散ディスク状態確認部２０４は、一定期間、正常稼動の旨の応答がない分散ディスク１０１に対して、故障と判断し、アラームを上げる（ステップＳ１１０３）。次に、人手により、故障中の分散ディスク１０１の交換、又は修理を行う（ステップＳ１１０４）。次に、分散ディスク１０１は、自身が正常稼動したこと及び復元依頼を、スーパーノード１０４に通知する（ステップＳ１１０５）。次に、スーパーノード１０４の分散ディスク復元部２０５は、故障後正常稼動した分散ディスク１０１について、自動でデータ復元処理を行う（ステップＳ１１０６）。 In FIG. 11, first, the distributed disk status confirmation unit 204 of the super node 104 issues a confirmation request to the all distributed disks 101 as to whether it is operating normally (step S1101). Next, the distributed disk 101 that has received the confirmation request responds if it is operating normally (step S1102). Next, the distributed disk status confirmation unit 204 of the super node 104 determines that the distributed disk 101 that has not responded for normal operation for a certain period of time is a failure and raises an alarm (step S1103). Next, the failed distribution disk 101 is replaced or repaired manually (step S1104). Next, the distributed disk 101 notifies the super node 104 of the normal operation of itself and the restoration request (step S1105). Next, the distributed disk restoration unit 205 of the super node 104 automatically performs data restoration processing on the distributed disk 101 that has been operating normally after the failure (step S1106).

次に、図１２において図１１のステップＳ１１０６の詳細な処理の流れについて説明する。スーパーノード１０４の分散ディスク復元部２０５は、分散ディスク記憶部２（２０６）を参照し、復元依頼を受け付けた分散ディスク１０１の分散ディスクノード名を認識する。（ステップＳ１２０１）次に、分散ディスク復元部２０５は、ファイル管理情報記憶部２０３を参照し、ファイル管理情報から、認識した分散ディスクノード名に対応する元ファイル名と符号化ファイルＮｏ．とを全て検索する（ステップＳ１２０２）。次に、分散ディスク復元部２０５は、検索した元ファイル名ごとに、ステップＳ１２０３からステップＳ１２０８までの処理を繰り返す。 Next, the detailed processing flow of step S1106 of FIG. 11 will be described with reference to FIG. The distributed disk restoration unit 205 of the super node 104 refers to the distributed disk storage unit 2 (206) and recognizes the distributed disk node name of the distributed disk 101 that has received the restoration request. (Step S1201) Next, the distributed disk restoration unit 205 refers to the file management information storage unit 203, and from the file management information, the original file name corresponding to the recognized distributed disk node name and the encoded file number. Are searched (step S1202). Next, the distributed disk restoration unit 205 repeats the processing from step S1203 to step S1208 for each retrieved original file name.

分散ディスク復元部２０５は、符号化ファイルＮｏ．から、前記元ファイルを復号するために必要な他の符号化ファイルＮｏ．と、前記他の符号化ファイルＮｏ．に対応する稼動中の分散ディスクノード名とを確定する（ステップＳ１２０３）。次に、分散ディスク復元部２０５は、稼動中の分散ディスク１０１から、復元に必要な符号化ファイルＮｏ．に対応する符号化ファイルを取得する（ステップＳ１２０４）。次に、分散ディスク復元部２０５は、取得した符号化ファイルから復号ファイルを再生する（ステップＳ１２０５）。 The distributed disk restoration unit 205 stores the encoded file number. To the other encoded file No. required for decoding the original file. And the other encoded file No. The active distributed disk node name corresponding to is determined (step S1203). Next, the distributed disk restoration unit 205 reads the encoded file No. required for restoration from the active distributed disk 101. An encoded file corresponding to is acquired (step S1204). Next, the distributed disk restoration unit 205 reproduces a decoded file from the acquired encoded file (step S1205).

次に、分散ディスク復元部２０５は、復号ファイルを分割し復号化し、復元すべき符号化ファイルを生成する（ステップＳ１２０６）。次に、分散ディスク復元部２０５は、復元依頼を受け付けた分散ディスク１０１へ、符号化ファイルを送信する（ステップＳ１２０７）。次に、分散ディスク復元部２０５は、ステップＳ１２０２で検索した全ての元ファイルについて処理を完了したか確認し、完了していれば、ステップＳ１１０６を終了し、完了していなければ、ステップＳ１２０３に戻り、ステップＳ１２０８までの処理を繰り返す（ステップＳ１２０８）。 Next, the distributed disk restoration unit 205 divides and decodes the decoded file, and generates an encoded file to be restored (step S1206). Next, the distributed disk restoration unit 205 transmits the encoded file to the distributed disk 101 that has received the restoration request (step S1207). Next, the distributed disk restoration unit 205 checks whether the processing has been completed for all the original files searched in step S1202, and if completed, ends step S1106, and if not completed, returns to step S1203. The process up to step S1208 is repeated (step S1208).

なお、実施の形態１、及び本実施の形態では、図５において説明したように、符号化ファイルの名称は、元ファイル名と＿（アンダーバー）と符号化ファイルＮｏ．とを結合したものとする。また、図５と図６において説明したように、ファイル管理情報とは、保管対象である元ファイルの名称である元ファイル名と、元ファイルを分割して符号化した符号化ファイルを特定するための符号化ファイルＮｏ．と、符号化ファイルを保管した分散ディスク１０１を特定するための分散ディスクノード名とを対応付けた情報である。 In the first embodiment and the present embodiment, as described in FIG. 5, the name of the encoded file includes the original file name, _ (underscore), encoded file No. Are combined. Further, as described in FIGS. 5 and 6, the file management information is used to specify the original file name that is the name of the original file to be stored and the encoded file obtained by dividing and encoding the original file. Encoded file No. And the distributed disk node name for identifying the distributed disk 101 that stores the encoded file.

そして、このようなデータを前提として、データ保管処理において、図７のステップＳ７０３では、符号化ファイルを生成する際に、符号化ファイルの名称は、元ファイル名と＿（アンダーバー）と符号化ファイルＮｏ．とを結合したものとする。そして、データ読込み処理において、図１０のステップＳ１００３では、再生部２１０は、全分散ディスク１０１に対して、元ファイルから生成した符号化ファイルを保管しているかどうかの確認要求を行うが、ここで、確認要求と共に、元ファイル名を送信する。次に、ステップＳ１００４では、符号化データ記憶部２０１は、ステップＳ１００３で受信した元ファイル名をキーとして検索し、元ファイルから生成した符号化ファイルを保管している場合に、応答を返す。 Based on such data, in the data storage process, in step S703 of FIG. 7, when the encoded file is generated, the name of the encoded file is the original file name, _ (underscore), and the encoded file. No. Are combined. In the data reading process, in step S1003 of FIG. 10, the playback unit 210 requests the all distributed disks 101 to check whether or not the encoded file generated from the original file is stored. The original file name is transmitted together with the confirmation request. In step S1004, the encoded data storage unit 201 searches using the original file name received in step S1003 as a key, and returns a response when the encoded file generated from the original file is stored.

ただし、本実施の形態においては、符号化ファイルの名称に前述のような制約を設けず、単に符号化データ記憶部２０１においてユニークな符号化ファイル名を付してもよい。この場合、図５と図６において説明したファイル管理情報は、保管対象である元ファイルの名称である元ファイル名と、元ファイルを分割して符号化した符号化ファイルを特定するための「符号化ファイル名」と、符号化ファイルを保管した分散ディスク１０１を特定するための分散ディスクノード名とを対応付けた情報である。 However, in the present embodiment, the encoded file name may be simply given a unique encoded file name in the encoded data storage unit 201 without providing the above-described restrictions on the encoded file name. In this case, the file management information described in FIG. 5 and FIG. 6 includes the original file name that is the name of the original file to be stored, and the “code” for specifying the encoded file obtained by dividing and encoding the original file. This is information in which the “encoded file name” is associated with the distributed disk node name for specifying the distributed disk 101 storing the encoded file.

そして、前述のようなデータ（符号化ファイル名及びファイル管理情報）を前提として、データ保管処理において、図７のステップＳ７０３では、スーパーノード１０４の分割符号化部２０７は、受信した元ファイルを１００個に分割し、符号化し、１５０個の符号化ファイルを生成するが、符号化ファイルの名称は、符号化データ記憶部２０１においてユニークな名称を付す。また、図９のステップＳ９０１では、スーパーノード１０４のファイル管理部２０９は、ファイル管理情報を作成するが、ファイル管理情報は、保管対象である元ファイルの名称である元ファイル名と、元ファイルを分割して符号化した符号化ファイルを特定するための「符号化ファイル名」と、符号化ファイルを保管した分散ディスク１０１を特定するための分散ディスクノード名とを対応付けた情報である。なお、元ファイル名と「符号化ファイル名」との対応に係る情報は、分割符号化部２０７から取得し、「符号化ファイル名」と分散ディスクノード名との対応に係る情報は、配信部２０８から取得する。 Then, on the premise of the data (encoded file name and file management information) as described above, in step S703 in FIG. 7, in the data storage process, the division encoding unit 207 of the super node 104 sets the received original file as 100. The encoded file is divided into pieces and encoded to generate 150 encoded files. The encoded data storage unit 201 assigns a unique name to the encoded file name. In step S901 in FIG. 9, the file management unit 209 of the super node 104 creates file management information. The file management information includes the original file name that is the name of the original file to be stored, and the original file. This is information in which an “encoded file name” for specifying an encoded file divided and encoded is associated with a distributed disk node name for specifying the distributed disk 101 storing the encoded file. Information relating to the correspondence between the original file name and the “encoded file name” is acquired from the division encoding unit 207, and information relating to the correspondence between the “encoded file name” and the distributed disk node name is obtained from the distribution unit. Obtained from 208.

そして、前述のようなデータ（符号化ファイル名及びファイル管理情報）を前提として、データ読込み処理において、図１０のステップＳ１００３では、ファイル管理情報記憶部２０３に保管されたファイル管理情報を参照することにより、元ファイル名に対応する符号化ファイル名と、前述の符号化ファイル各々に対応する分散ディスクノード名を認識することが可能であるので、スーパーノード１０４の再生部２１０は、ファイル管理情報により、元ファイルから生成した符号化ファイルを保管する分散ディスク１０１を特定し、該当の分散ディスク１０１に対して、正常稼動の確認要求を行う。次に、ステップＳ１００４では、符号化データ記憶部２０１は、正常稼動していれば、応答を返す。 In the data reading process on the premise of the data (encoded file name and file management information) as described above, the file management information stored in the file management information storage unit 203 is referred to in step S1003 of FIG. Thus, it is possible to recognize the encoded file name corresponding to the original file name and the distributed disk node name corresponding to each of the aforementioned encoded files, so that the reproduction unit 210 of the super node 104 uses the file management information. Then, the distributed disk 101 that stores the encoded file generated from the original file is specified, and a normal operation confirmation request is made to the corresponding distributed disk 101. In step S1004, the encoded data storage unit 201 returns a response if it is operating normally.

そして、前述のようなデータ（符号化ファイル名及びファイル管理情報）を前提として、データ復元処理において、図１２の処理の流れは以下のようになる。スーパーノード１０４の分散ディスク復元部２０５は、分散ディスク記憶部２（２０６）を参照し、復元依頼を受け付けた分散ディスク１０１の分散ディスクノード名を認識する。（ステップＳ１２０１）次に、分散ディスク復元部２０５は、ファイル管理情報記憶部２０３を参照し、ファイル管理情報から、認識した分散ディスクノード名に対応する元ファイル名と符号化ファイル名とを全て検索する（ステップＳ１２０２）。次に、分散ディスク復元部２０５は、検索した元ファイル名ごとに、ステップＳ１２０３からステップＳ１２０８までの処理を繰り返す。 Then, assuming the data (encoded file name and file management information) as described above, the flow of the process in FIG. 12 is as follows in the data restoration process. The distributed disk restoration unit 205 of the super node 104 refers to the distributed disk storage unit 2 (206) and recognizes the distributed disk node name of the distributed disk 101 that has received the restoration request. (Step S1201) Next, the distributed disk restoration unit 205 refers to the file management information storage unit 203 and searches all the original file names and encoded file names corresponding to the recognized distributed disk node names from the file management information. (Step S1202). Next, the distributed disk restoration unit 205 repeats the processing from step S1203 to step S1208 for each retrieved original file name.

分散ディスク復元部２０５は、「ステップＳ１２０２で検索した符号化ファイル名」から、前記元ファイルを復号するために必要な「他の符号化ファイル名」と、前記「他の符号化ファイル名」に対応する稼動中の分散ディスクノード名とを確定する（ステップＳ１２０３）。すなわち、分散ディスク復元部２０５は、ファイル管理情報記憶部２０３を参照し、まず、「ステップＳ１２０２で検索した符号化ファイル名」に対応する元ファイル名を特定する。次に、ファイル管理情報記憶部２０３を参照し、特定した元ファイル名各々について、この元ファイルを分割し符号化することにより作成されたものである符号化ファイルの符号化ファイル名を特定する。次に、特定された全ての符号化ファイル名から、「ステップＳ１２０２で検索した符号化ファイル名」を省いたものが、前述の「他の符号化ファイル名」である。次に、ファイル管理情報記憶部２０３を参照し、前記「他の符号化ファイル名」に対応する稼動中の分散ディスクノード名とを確定する。 The distributed disk restoration unit 205 converts the “encoded file name searched in step S1202” into “another encoded file name” necessary for decoding the original file and the “other encoded file name”. A corresponding active distributed disk node name is determined (step S1203). That is, the distributed disk restoration unit 205 refers to the file management information storage unit 203 and first specifies the original file name corresponding to the “encoded file name searched in step S1202”. Next, referring to the file management information storage unit 203, for each identified original file name, the encoded file name of the encoded file created by dividing and encoding the original file is specified. Next, the aforementioned “other encoded file names” are obtained by omitting “the encoded file names searched in step S1202” from all the specified encoded file names. Next, the file management information storage unit 203 is referred to and an active distributed disk node name corresponding to the “other encoded file name” is determined.

次に、分散ディスク復元部２０５は、稼動中の分散ディスク１０１から、復元に必要な「他の符号化ファイル名」を持つ符号化ファイルを取得する（ステップＳ１２０４）。次に、分散ディスク復元部２０５は、取得した符号化ファイルから復号ファイルを再生する（ステップＳ１２０５）。 Next, the distributed disk restoration unit 205 acquires an encoded file having “another encoded file name” necessary for restoration from the active distributed disk 101 (step S1204). Next, the distributed disk restoration unit 205 reproduces a decoded file from the acquired encoded file (step S1205).

次に、分散ディスク復元部２０５は、復号ファイルを分割し符号化し、復元すべき符号化ファイル（「ステップＳ１２０２で検索した符号化ファイル名」を持つ符号化ファイル）を生成する（ステップＳ１２０６）。ここで、分割と符号化にあたっては、前述のようなデータ（符号化ファイル名及びファイル管理情報）を前提としたステップＳ７０３と同様の手順で行うこととする。復号ファイルを分割し符号化し、１５０個の符号化ファイルを作成し、この中から、「ステップＳ１２０２で検索した符号化ファイル名」を持つ符号化ファイルのみを残して、他を削除する。次に、分散ディスク復元部２０５は、復元依頼を受け付けた分散ディスク１０１へ、ステップＳ１２０６で作成した符号化ファイルを送信する（ステップＳ１２０７）。次に、分散ディスク復元部２０５は、ステップＳ１２０２で検索した全ての元ファイルについて処理を完了したか確認し、完了していれば、ステップＳ１１０６を終了し、完了していなければ、ステップＳ１２０３に戻り、ステップＳ１２０８までの処理を繰り返す（ステップＳ１２０８）。 Next, the distributed disk restoration unit 205 divides and encodes the decoded file, and generates an encoded file to be restored (an encoded file having the “encoded file name searched in step S1202”) (step S1206). Here, division and encoding are performed in the same procedure as step S703 on the premise of the data (encoded file name and file management information) as described above. The decoded file is divided and encoded to create 150 encoded files. Among them, only the encoded file having the “encoded file name searched in step S1202” is left and the others are deleted. Next, the distributed disk restoration unit 205 transmits the encoded file created in step S1206 to the distributed disk 101 that has received the restoration request (step S1207). Next, the distributed disk restoration unit 205 confirms whether or not the processing has been completed for all the original files searched in step S1202, and if completed, ends step S1106, and if not completed, returns to step S1203. The process up to step S1208 is repeated (step S1208).

以上のように、本実施の形態では、スーパーノードを１台のみ有し、別途管理端末を有さない構成であっても、スーパーノード１０４がファイル管理情報を保管し、ファイル管理情報を使用して任意の１台の分散ディスク１０１上の符号化ファイル全ての復元が可能であるため、分散ディスク１０１を、故障、その他の諸事情による修理・交換することが容易になり、実施の形態１と同様に、分散ストレージシステム１００の保守性を向上することができる。 As described above, in this embodiment, even if the configuration has only one super node and no separate management terminal, the super node 104 stores the file management information and uses the file management information. Thus, it is possible to restore all of the encoded files on any one distributed disk 101, so that it becomes easy to repair and replace the distributed disk 101 due to a failure or other circumstances. Similarly, maintainability of the distributed storage system 100 can be improved.

この発明の実施の形態１における分散ストレージシステムの構成を示す図である。It is a figure which shows the structure of the distributed storage system in Embodiment 1 of this invention. この発明の実施の形態１における分散ストレージシステムの詳細な構成を示すブロック図である。It is a block diagram which shows the detailed structure of the distributed storage system in Embodiment 1 of this invention. この発明の実施の形態１におけるハードウェア資源の一例を示す図である。It is a figure which shows an example of the hardware resource in Embodiment 1 of this invention. この発明の実施の形態１におけるファイルの符号化と復号化の一例を示す模式図である。It is a schematic diagram which shows an example of the encoding and decoding of the file in Embodiment 1 of this invention. この発明の実施の形態１における管理情報のデータ例の図である。It is a figure of the example of data of the management information in Embodiment 1 of this invention. この発明の実施の形態１におけるファイル管理情報記憶部のデータ例の図である。It is a figure of the example of data of the file management information storage part in Embodiment 1 of this invention. この発明の実施の形態１におけるデータ保管に係る分散ストレージシステムの動作の詳細を示すフローチャートである。It is a flowchart which shows the detail of operation | movement of the distributed storage system which concerns on the data storage in Embodiment 1 of this invention. この発明の実施の形態１におけるデータ保管に係る分散ストレージシステムの動作の詳細を示すフローチャートである。It is a flowchart which shows the detail of operation | movement of the distributed storage system which concerns on the data storage in Embodiment 1 of this invention. この発明の実施の形態１におけるデータ保管に係る分散ストレージシステムの動作の詳細を示すフローチャートである。It is a flowchart which shows the detail of operation | movement of the distributed storage system which concerns on the data storage in Embodiment 1 of this invention. この発明の実施の形態１におけるデータ読込みに係る分散ストレージシステムの動作の詳細を示すフローチャートである。It is a flowchart which shows the detail of operation | movement of the distributed storage system which concerns on the data reading in Embodiment 1 of this invention. この発明の実施の形態１における分散ディスク復元に係る分散ストレージシステムの動作の詳細を示すフローチャートである。It is a flowchart which shows the detail of operation | movement of the distributed storage system which concerns on the distribution disk restoration in Embodiment 1 of this invention. この発明の実施の形態１における分散ディスク復元に係る分散ストレージシステムの動作の詳細を示すフローチャートである。It is a flowchart which shows the detail of operation | movement of the distributed storage system which concerns on the distribution disk restoration in Embodiment 1 of this invention. この発明の実施の形態２における分散ストレージシステムの詳細な構成を示すブロック図である。It is a block diagram which shows the detailed structure of the distributed storage system in Embodiment 2 of this invention. 従来の専用ストレージシステムの一例の構成を示す図である。It is a figure which shows the structure of an example of the conventional exclusive storage system. 従来のデータ分散保管システムの構成を示す図である。It is a figure which shows the structure of the conventional data distributed storage system. 従来のデータ分散保管システムのデータ保管にかかわる動作の詳細を示すフローチャートである。It is a flowchart which shows the detail of the operation | movement regarding the data storage of the conventional data distributed storage system. 従来のデータ分散保管システムのデータ再生にかかわる動作の詳細を示すフローチャートである。It is a flowchart which shows the detail of the operation | movement in connection with the data reproduction | regeneration of the conventional data distributed storage system.

Explanation of symbols

１００分散ストレージシステム、１０１分散ディスク、１０２管理端末、１０３管理者ネットワーク、１０４スーパーノード、１０５ＷＡＮ（ＷｉｄｅＡｒｅａＮｅｔｏｗｏｒｋ）、１０６ユーザ端末、２０１符号化データ記憶部、２０２分散ディスク記憶部１、２０３ファイル管理情報記憶部、２０４分散ディスク状態確認部、２０５分散ディスク復元部、２０６分散ディスク記憶部２、２０７分割符号化部、２０８配信部、２０９ファイル管理部、２１０再生部、２１１スーパーノード記憶部、２１２保管依頼部、２１３読込み依頼部、３０１表示装置、３０２Ｋ／Ｂ、３０３マウス、３０４ＦＤＤ、３０５ＣＤＤ、３１１ＣＰＵ、３１２バス、３１３ＲＯＭ、３１４ＲＡＭ、３１５通信ボード、３２０磁気ディスク装置、３２１ＯＳ、３２２ウィンドウシステム、３２３プログラム群、３２４ファイル群。 100 distributed storage system, 101 distributed disk, 102 management terminal, 103 administrator network, 104 super node, 105 WAN (Wide Area Network), 106 user terminal, 201 encoded data storage unit, 202 distributed disk storage unit 1, 203 file Management information storage unit, 204 Distributed disk status confirmation unit, 205 Distributed disk restoration unit, 206 Distributed disk storage unit 2, 207 Division encoding unit, 208 Distribution unit, 209 File management unit, 210 Playback unit, 211 Super node storage unit, 212 storage request unit, 213 read request unit, 301 display device, 302 K / B, 303 mouse, 304 FDD, 305 CDD, 311 CPU, 312 bus, 313 ROM, 314 RAM, 315 communication box 320, magnetic disk drive, 321 OS, 322 window system, 323 program group, 324 file group.

Claims

A distributed storage system having a super node connected to a plurality of distributed disks and one or a plurality of user terminals via a network and receiving a file storage request and a read request,
The super node is
When receiving the original file to be distributed and the storage request for the original file from the user terminal, a plurality of encoded files obtained by dividing and encoding the original file are generated, and the encoded file names A division encoding unit having a name including a character string for specifying the correspondence with the original file,
A distribution unit that requests storage of each of the plurality of encoded files on a plurality of different distributed disks;
After the plurality of encoded files are stored on a distributed disk, the original file, the plurality of encoded files, information that can specify the original file or the plurality of encoded files, and the plurality of encoded files are stored. A file management unit that deletes information that can identify a plurality of distributed files,
When receiving the original file name and the read request of the original file from the user terminal, it is confirmed whether or not the encoded file generated from the original file is stored in the distributed disk, A reproduction unit that reproduces the original file by obtaining and decoding a predetermined number of the encoded files from the distributed disk whose storage has been confirmed by comparing the file and the encoded file name; Distributed storage system characterized by

The distributed storage system further includes a management terminal connected to the super node via a network,
The file management unit further generates file management information indicating correspondence between the original file, the encoded file, and the distributed disk, transmits the file management information to the management terminal, deletes the file management information that has been transmitted,
The management terminal
A file management information storage unit for storing the file management information received from the file management unit;
Based on the request for restoration of any distributed disk, the encoded file stored in the distributed disk and the original file corresponding to the encoded file are obtained from the file management information stored in the file management information storage unit. Identifying and obtaining other encoded files stored on a distributed disk other than the distributed disk necessary for decoding the original file, decoding the original file from the other encoded files, A distributed disk restoration unit that regenerates the encoded file by dividing and encoding the decoded original file, and requests the storage of the regenerated encoded file in the distributed disk; The distributed storage system according to claim 1.

The management terminal further includes:
Checks whether multiple distributed disks are operating normally, outputs an alarm for distributed disks that could not be confirmed to operate normally, and then started normal operation from the distributed disks that could not be confirmed to operate normally. 3. The distributed storage system according to claim 2, further comprising a distributed disk state confirmation unit that instructs the distributed disk restoration unit to restore the distributed disk that could not confirm the normal operation when the notification is received.

The distributed storage system further includes a plurality of the super nodes and user terminals connected to the plurality of super nodes via a network,
The user terminal is
4. The distributed storage system according to claim 1, wherein an arbitrary one super node that is operating normally is selected from the plurality of super nodes, and a file storage request and a read request are made. .

A distributed storage system having a super node connected to a plurality of distributed disks and one or a plurality of user terminals via a network and receiving a file storage request, a read request, and a distributed disk restoration request,
The super node is
When receiving an original file to be distributed and a storage request for the original file from the user terminal, a division encoding unit that generates a plurality of encoded files obtained by dividing and encoding the original file;
A distribution unit that requests storage of each of the plurality of encoded files on different distributed disks;
A file management unit that generates file management information indicating correspondence between the original file, the encoded file, and the distributed disk;
A file management information storage unit for storing the file management information;
When receiving the original file name and the read request of the original file from the user terminal, the file management information identifies a distributed disk that stores the encoded file generated from the original file, and Then, a normal operation is confirmed, a predetermined number of encoded files are obtained from the confirmed distributed disk, and the playback unit that reproduces the original file and decoding request for any distributed disk are obtained by decoding. Based on the file management information, the encoded file stored in the distributed disk and the original file corresponding to the encoded file are identified from the file management information stored in the file management information storage unit, and the original file is decoded. Identify and capture other encoded files stored on a distributed disk other than the distributed disk Then, the original file is decoded from the other encoded files, the decoded original file is divided and encoded, and the encoded file is regenerated and regenerated on the distributed disk. A distributed storage system comprising: a distributed disk restoration unit that requests storage of the encoded file.

When receiving an original file to be distributed and a storage request for the original file from a user terminal connected via a network, a plurality of encoded files obtained by dividing and encoding the original file are generated. , A division encoding process in which the encoded file name is a name including a character string for specifying correspondence with the original file;
A delivery process for requesting storage of each of the plurality of encoded files to a plurality of different distributed disks connected via a network;
After the plurality of encoded files are stored on a distributed disk, the original file, the plurality of encoded files, information that can specify the original file or the plurality of encoded files, and the plurality of encoded files are stored. File management process to delete information that can identify multiple distributed files,
When an original file name and a request to read the original file are received from a user terminal connected via a network, whether or not the encoded file generated from the original file is stored in a distributed disk Confirming and comparing the original file with the encoded file name, obtains a predetermined number of the encoded files from the distributed disk whose storage has been confirmed, and reproduces the original file by decoding. A distributed storage program characterized by causing a computer to execute a reproduction process.

When an original file to be distributed and a storage request for the original file are received from a user terminal connected via a network, a plurality of encoded files obtained by dividing and encoding the original file are generated. Division coding processing,
A delivery process for requesting storage of each of the plurality of encoded files to different distributed disks connected via a network;
A file management process for generating file management information indicating the correspondence between the original file, the encoded file, and the distributed disk;
A file management information storage process for storing the file management information;
When an original file name and a request to read the original file are received from a user terminal connected via a network, a distributed disk that stores an encoded file generated from the original file is specified by the file management information. The normal operation of the distributed disk is confirmed, a predetermined number of encoded files are obtained from the confirmed distributed disk, and decoding is performed to reproduce the original file. Based on the request to restore the distributed disk, the encoded file stored in the distributed disk and the original file corresponding to the encoded file are identified from the file management information stored in the file management information storage process. Stored on a distributed disk other than the distributed disk required to decrypt the original file Identifying and obtaining the encoded file, decoding the original file from the other encoded files, dividing the decoded original file, and encoding to regenerate the encoded file; A distributed storage program which causes a computer to execute a distributed disk restoration process for requesting storage of the regenerated encoded file in the distributed disk.