JP2014085883A

JP2014085883A - Device, method, and program for data matching

Info

Publication number: JP2014085883A
Application number: JP2012234965A
Authority: JP
Inventors: Yoshimi Ichiyanagi; 淑美一柳; Koji Sato; 孝治佐藤; Hiroyuki Uchiyama; 寛之内山; Koichi Washisaka; 光一鷲坂
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2012-10-24
Filing date: 2012-10-24
Publication date: 2014-05-12
Anticipated expiration: 2032-10-24
Also published as: JP5475085B1

Abstract

PROBLEM TO BE SOLVED: To match redundant data sets without deteriorating performance of a host application.SOLUTION: A data matching device of the present invention comprises: a comparison unit; and a unification unit. The comparison unit compares data length of redundant data sets, which are copies of a same data set and are stored in a respective plurality of storage units, when a write process by a host application terminates abnormally. When the comparison unit determines that the redundant data sets do not have an identical data length, the unification unit adjusts the data length of all the redundant data sets to a data length of the shortest redundant data set of all the redundant data sets stored in the plurality of storage units.

Description

本発明は、冗長化されたデータ間の整合性を維持する技術に関する。 The present invention relates to a technique for maintaining consistency between redundant data.

従来、データを格納する装置が故障した場合に、当該装置に格納されたデータが破損または消失することを防止するため、複数の装置に同じデータを格納する冗長化が行われている。たとえば、同じデータのレプリカを複数のサーバに分散させて管理する分散型ファイルシステムが実現されている。 Conventionally, when a device that stores data fails, data stored in the device is prevented from being damaged or lost, and redundancy for storing the same data in a plurality of devices has been performed. For example, a distributed file system that manages replicas of the same data distributed to a plurality of servers has been realized.

たとえば、非特許文献１は、分散型ファイルシステムの一例を開示している。このシステムは、各クラスタを１つのマスタと複数のサーバとで構成する。複数のクライアントがクラスタにアクセスする。クラスタにファイルを格納するときは、ファイルを固定長の複数のブロックに分割し、ブロック１つ１つの複製をサーバに格納する。このように、ファイルを複製して冗長化し、複数サーバ上に分散して保持することにより、耐故障性を向上させ、複数サーバにおける並列処理を可能にして処理効率の向上を図ることができる。 For example, Non-Patent Document 1 discloses an example of a distributed file system. In this system, each cluster is composed of one master and a plurality of servers. Multiple clients access the cluster. When storing a file in a cluster, the file is divided into a plurality of fixed-length blocks, and a copy of each block is stored in the server. Thus, by duplicating files to make them redundant and holding them distributed on a plurality of servers, fault tolerance can be improved, parallel processing on a plurality of servers can be performed, and processing efficiency can be improved.

Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung著、「The Google File System」、19th ACM Symposium on Operating Systems Principles、２００３年１０月、p. 20-43Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung, “The Google File System”, 19th ACM Symposium on Operating Systems Principles, October 2003, p. 20-43 Vijayan Prabhakaran, Lakshmi N. Bairavasundaram, Nitin Agrawal, Haryadi S. Gunawi, Andrea C. Arpaci-Dusseau, and Remzi H. Arpaci-Dusseau著、「IRON File Systems」、SOSP ’05 Proceedings of the twentieth ACM symposium on Operating systems principles、２００５年１０月、p. 206-220Vijayan Prabhakaran, Lakshmi N. Bairavasundaram, Nitin Agrawal, Haryadi S. Gunawi, Andrea C. Arpaci-Dusseau, and Remzi H. Arpaci-Dusseau, "IRON File Systems", SOSP '05 Proceedings of the twentieth ACM symposium on Operating systems principles, October 2005, p. 206-220

しかしながら、上記の技術においては、冗長化した複数のデータ（以下、「冗長データ」とも呼ぶ。冗長データは、複製元のデータおよび複製先のデータを含む。）間での整合性を維持する際に、処理性能が低下するという問題があった。 However, in the above technique, when maintaining consistency between a plurality of redundant data (hereinafter also referred to as “redundant data”. The redundant data includes data at the copy source and data at the copy destination). In addition, there is a problem that the processing performance deteriorates.

たとえば、ファイルシステムのひとつのサーバへの書込処理が途中で異常終了した場合、当該サーバに格納されたデータと、他のサーバに格納されたデータとの間でデータが不一致となる。かかる問題に対処する手法として、トランザクション処理やチェックサムを利用して不一致状態を検知し自動的に冗長データを相互に一致させる手法や、上位アプリケーションから書込処理を再実行して冗長データを相互に一致させる手法が存在する。 For example, when the writing process to one server of the file system ends abnormally in the middle, the data stored in the server does not match the data stored in the other server. To deal with this problem, transaction processing and checksums are used to detect inconsistencies and automatically match redundant data with each other. Redundant data is re-executed from a higher-level application to exchange redundant data. There is a method to match.

しかしながら、トランザクション処理を利用した不一致状態の検知手法では、冗長データの同一性は維持されるが、ファイルシステム上のファイルにデータを書き込む際の処理手順が多くなり、上位アプリケーションの書込性能が低下する。 However, with the mismatch detection method using transaction processing, the identity of redundant data is maintained, but the processing procedure for writing data to a file on the file system increases, and the writing performance of the higher-level application decreases. To do.

これに対して、たとえば、非特許文献２が開示する、チェックサムを利用することで不一致状態を検知する手法では、まず、冗長データのチェックサムを取得しておく。そして、定期的または冗長データへのアクセスの際または書込処理が失敗した際などに、複数の冗長データ間でチェックサムを比較して、チェックサムが異なる場合には、冗長データが同一になるように冗長データの書き換えを行う。 On the other hand, for example, in the technique disclosed in Non-Patent Document 2 for detecting a mismatch state using a checksum, first, a checksum of redundant data is acquired. When the checksums are compared between the plurality of redundant data, for example, periodically or when accessing the redundant data or when the writing process fails, the redundant data becomes the same. Redundant data is rewritten as described above.

しかし、チェックサムを利用した不一致状態の検知手法では、上位アプリケーションからファイルシステム上のファイルにアクセスする処理と、ファイルシステム内部のチェックサムを比較する処理が競合する。このため、やはり、ファイルシステムの書込処理および読出処理の性能が低下する。 However, in the mismatch detection method using the checksum, the process of accessing the file on the file system from the upper application and the process of comparing the checksum in the file system conflict. For this reason, the performance of the write processing and read processing of the file system also deteriorates.

上位アプリケーションから書込処理を再実行する手法では、書込みが失敗した場合、上位アプリケーションが再度、失敗した書込の対象データを書き込む。この手法によれば、冗長データの整合性維持にかかる負荷を上位アプリケーションが制御することができ、自動的に冗長データの整合性を維持する手法に比べて上位アプリケーションの書込性能低下を抑制することができる。 In the method of re-execution of the writing process from the upper application, when the writing fails, the upper application writes the target data of the failed writing again. According to this method, the load applied to maintaining the consistency of redundant data can be controlled by the upper application, and the deterioration of the writing performance of the upper application is suppressed compared to the method of automatically maintaining the consistency of redundant data. be able to.

しかし、上位アプリケーションから書込処理を再実行する手法では、書込処理を行っている上位アプリケーションが異常終了した場合、再度書込処理を行うためには、上位アプリケーション自身がローカルファイルなどのデータ永続化装置に、処理結果を書き込んでおく必要がある。このため、上位アプリケーションの書込処理時間が長くなり、上位アプリケーションの書込性能が低下してしまう。 However, in the method of re-execution of the writing process from the upper application, if the upper application that is performing the writing process terminates abnormally, the upper application itself has to make a permanent file such as a local file in order to perform the writing process again. The processing result needs to be written in the processing apparatus. For this reason, the write processing time of the upper application becomes longer, and the write performance of the upper application is degraded.

本発明は、上記に鑑みてなされたものであって、上位アプリケーションの性能を低下させることなく、冗長データを整合させることができるデータ整合装置、データ整合方法およびデータ整合プログラムを提供することを目的とする。 The present invention has been made in view of the above, and an object of the present invention is to provide a data matching apparatus, a data matching method, and a data matching program capable of matching redundant data without degrading the performance of a higher-level application. And

上述した課題を解決し、目的を達成するために、本発明は、複数の記憶部それぞれに格納される、同一のデータの複製である冗長データのデータ長を、上位アプリケーションからの書込処理が異常終了した場合に比較し、前記冗長データのデータ長が同一ではないと判定した場合に、前記複数の記憶部に格納された前記冗長データのうち最も短い冗長データのデータ長に他の冗長データのデータ長を揃えることを特徴とする。 In order to solve the above-described problems and achieve the object, the present invention provides a data length of redundant data, which is a duplicate of the same data stored in each of a plurality of storage units, by a write process from a higher-level application. When it is determined that the data length of the redundant data is not the same as in the case of abnormal termination, other redundant data is added to the data length of the shortest redundant data among the redundant data stored in the plurality of storage units. It is characterized by the same data length.

本発明にかかるデータ整合装置、データ整合方法およびデータ整合プログラムは、上位アプリケーションの性能を低下させることなく、冗長データを整合させることができるという効果を奏する。 The data matching apparatus, the data matching method, and the data matching program according to the present invention have an effect that redundant data can be matched without lowering the performance of the host application.

図１は、本発明の第１の実施形態に係るデータ整合装置の概要を示すブロック図である。FIG. 1 is a block diagram showing an outline of a data matching apparatus according to the first embodiment of the present invention. 図２は、第１の実施形態に係るデータ整合装置のデータ整合処理の流れの一例を示すフローチャートである。FIG. 2 is a flowchart showing an example of the flow of data matching processing of the data matching apparatus according to the first embodiment. 図３は、本発明の第２の実施形態に係るデータ整合システムの概要を示すブロック図である。FIG. 3 is a block diagram showing an outline of a data matching system according to the second embodiment of the present invention. 図４−１は、第２の実施形態に係る冗長データ位置テーブルに格納する情報の一例を説明するための図である。FIG. 4A is a diagram for explaining an example of information stored in the redundant data position table according to the second embodiment. 図４−２は、第２の実施形態に係る冗長データ位置テーブルに格納する情報とデータを格納するディスクとの対応を説明するための図である。FIG. 4B is a diagram for explaining the correspondence between the information stored in the redundant data position table according to the second embodiment and the disk storing the data. 図５は、第２の実施形態に係る処理対象リストに格納するデータの一例を示す図である。FIG. 5 is a diagram illustrating an example of data stored in the processing target list according to the second embodiment. 図６は、第２の実施形態に係るデータ配置制御部による、書込処理時のデータ配置制御処理の流れの一例を示すフローチャートである。FIG. 6 is a flowchart illustrating an example of the flow of the data arrangement control process during the writing process by the data arrangement control unit according to the second embodiment. 図７は、第２の実施形態に係るデータ配置制御部による、書込制御処理の流れの一例を示すフローチャートである。FIG. 7 is a flowchart illustrating an example of a flow of write control processing by the data arrangement control unit according to the second embodiment. 図８は、第２の実施形態に係る書込部による書込処理の流れの一例を示すフローチャートである。FIG. 8 is a flowchart illustrating an example of a flow of a writing process by the writing unit according to the second embodiment. 図９は、第２の実施形態に係るデータアクセス部における書込実行前処理の流れの一例を示すフローチャートである。FIG. 9 is a flowchart illustrating an example of a flow of pre-write execution processing in the data access unit according to the second embodiment. 図１０は、第２の実施形態に係るデータアクセス部における書込実行処理の流れの一例を示すフローチャートである。FIG. 10 is a flowchart illustrating an example of the flow of the write execution process in the data access unit according to the second embodiment. 図１１は、第２の実施形態に係るデータ整合システムにおける正常時の処理の流れの一例を概括的に示す図である。FIG. 11 is a diagram schematically showing an example of a normal processing flow in the data matching system according to the second embodiment. 図１２は、第２の実施形態に係るデータ配置制御部による、読出処理時のデータ配置制御処理の流れの一例を示すフローチャートである。FIG. 12 is a flowchart illustrating an example of the flow of the data arrangement control process during the reading process by the data arrangement control unit according to the second embodiment. 図１３は、第２の実施形態に係る読出部による読出処理の流れの一例を示すフローチャートである。FIG. 13 is a flowchart illustrating an example of a flow of reading processing by the reading unit according to the second embodiment. 図１４は、第２の実施形態に係るデータアクセス部における読出実行処理の流れの一例を示すフローチャートである。FIG. 14 is a flowchart illustrating an example of the flow of read execution processing in the data access unit according to the second embodiment. 図１５は、第２の実施形態に係るデータアクセス部が、書込処理中に異常終了した場合の冗長データの状態を説明するための図である。FIG. 15 is a diagram for explaining the state of redundant data when the data access unit according to the second embodiment abnormally ends during the writing process. 図１６は、第２の実施形態に係るデータ整合装置におけるデータ整合処理の流れの一例を示すフローチャートである。FIG. 16 is a flowchart illustrating an example of the flow of data matching processing in the data matching apparatus according to the second embodiment. 図１７は、第２の実施形態に係るデータ整合装置における同一化処理の流れの一例を示すフローチャートである。FIG. 17 is a flowchart illustrating an example of the flow of the identification process in the data matching apparatus according to the second embodiment. 図１８は、データ整合システムによる一連の処理を実行するプログラムであるデータ整合プログラムによる情報処理が、コンピュータを用いて具体的に実現されることを示す図である。FIG. 18 is a diagram showing that the information processing by the data matching program, which is a program for executing a series of processes by the data matching system, is specifically realized using a computer.

以下に、本発明にかかるデータ整合装置、データ整合方法およびデータ整合プログラムの実施形態を図面に基づいて詳細に説明する。なお、この実施形態によりこの発明が限定されるものではない。 Hereinafter, embodiments of a data matching apparatus, a data matching method, and a data matching program according to the present invention will be described in detail with reference to the drawings. In addition, this invention is not limited by this embodiment.

［第１の実施形態］
［データ整合装置の構成の一例］
図１は、本発明の第１の実施形態に係るデータ整合装置１０の概要を示すブロック図である。図１を参照して、第１の実施形態に係るデータ整合装置１０について説明する。第１の実施形態に係るデータ整合装置１０は、同一のデータを複製して作成した複数の冗長データを異なる記憶装置に記憶させる場合に生じた冗長データの不整合を検出して、冗長データ同士のデータ長を一致させる。 [First Embodiment]
[Example of configuration of data matching device]
FIG. 1 is a block diagram showing an overview of a data matching apparatus 10 according to the first embodiment of the present invention. A data matching apparatus 10 according to the first embodiment will be described with reference to FIG. The data matching apparatus 10 according to the first embodiment detects an inconsistency of redundant data that occurs when a plurality of redundant data created by duplicating the same data is stored in different storage devices. Match the data length of the.

図１に示すように、データ整合装置１０は、比較部１１と同一化部１２とを備える。比較部１１は、データの複製を作成して冗長化した冗長データをそれぞれ格納する複数の記憶部（図示せず）に対して、冗長データのデータ長を比較する比較処理を実行する。同一化部１２は、比較処理の結果、比較部１１が冗長データのデータ長が同一ではないと判定した場合に、当該冗長データのうち、最も短いデータ長の冗長データに他の冗長データのデータ長を揃える。 As shown in FIG. 1, the data matching apparatus 10 includes a comparison unit 11 and an identification unit 12. The comparison unit 11 performs a comparison process for comparing the data lengths of the redundant data with respect to a plurality of storage units (not shown) that respectively store redundant data created by duplicating data. If the comparison unit 11 determines that the data length of the redundant data is not the same as a result of the comparison process, the identification unit 12 adds the data of the other redundant data to the redundant data having the shortest data length among the redundant data. Align the length.

データ整合装置１０は、たとえば、分散型ファイルシステムのマスタ装置であってよい。また、たとえば、データ整合装置１０の機能は、クライアントライブラリによって実現されるように構成してもよい。ただし、分散型ファイルシステムに限らず、複数の独立した記憶部を有し、記憶部各々に同一の複製データを格納することでシステムの冗長性を確保するシステムに適用することができる。たとえば、外部からのアクセスによって各記憶部に対する並列処理が実行され、複数の外部からの命令を調整して処理するシステムに適用することができる。 The data matching device 10 may be, for example, a master device of a distributed file system. Further, for example, the function of the data matching apparatus 10 may be configured to be realized by a client library. However, the present invention is not limited to a distributed file system, and can be applied to a system that has a plurality of independent storage units and stores the same replication data in each storage unit to ensure system redundancy. For example, the present invention can be applied to a system in which parallel processing for each storage unit is executed by external access, and a plurality of external instructions are adjusted and processed.

また、図１には図示しないが、データ整合装置１０は、冗長データを格納する複数の記憶部や、記憶部に対するデータの読出要求や書込要求を送信する情報処理装置等とネットワーク等を介して接続される。複数の記憶部は例えば、物理サーバと当該サーバに接続されたディスク装置や、データセンタ内に仮想的に構築された仮想サーバと仮想ストレージ等であってよい。また、情報処理装置は、物理サーバやデータセンタを利用するクライアント装置であってよい。 Although not shown in FIG. 1, the data matching device 10 is connected to a plurality of storage units for storing redundant data, an information processing device for transmitting data read requests and write requests to the storage units, and the like via a network. Connected. The plurality of storage units may be, for example, a physical server and a disk device connected to the server, a virtual server and a virtual storage that are virtually constructed in the data center, and the like. Further, the information processing apparatus may be a client apparatus that uses a physical server or a data center.

また、ネットワークとして、ワイドエリアネットワーク（ＷＡＮ）、ローカルエリアネットワーク（ＬＡＮ）等任意のネットワークを使用できる。 Further, any network such as a wide area network (WAN) or a local area network (LAN) can be used as the network.

［データ整合処理の流れの一例］
図２は、第１の実施形態に係るデータ整合装置１０のデータ整合処理の流れの一例を示すフローチャートである。図２を参照して、第１の実施形態に係るデータ整合装置１０によるデータ整合処理の流れの一例を説明する。 [Example of data consistency processing flow]
FIG. 2 is a flowchart illustrating an example of the flow of data matching processing of the data matching apparatus 10 according to the first embodiment. With reference to FIG. 2, an example of the flow of data matching processing by the data matching apparatus 10 according to the first embodiment will be described.

図２に示すように、データ整合装置１０はまず、整合要求を受ける（ステップＳ２１）。整合要求はたとえば、データ整合装置１０に接続され、記憶部に格納されたデータを利用するクライアントから発行される。クライアントはたとえば、記憶部にデータを書き込む処理が異常終了した場合等に、データ整合装置１０に対して整合要求を発行する。ただし、整合要求は、クライアントにおいて記憶部への書込処理が異常終了したことを検出した場合に、自動的にデータ整合装置１０に発行されるものとしてもよい。 As shown in FIG. 2, the data matching apparatus 10 first receives a matching request (step S21). For example, the matching request is issued from a client connected to the data matching apparatus 10 and using data stored in the storage unit. For example, the client issues a matching request to the data matching apparatus 10 when, for example, the process of writing data in the storage unit ends abnormally. However, the matching request may be automatically issued to the data matching apparatus 10 when it is detected that the writing process to the storage unit has ended abnormally in the client.

整合要求は、比較部１１に渡される。比較部１１は、整合要求によって特定される冗長データを格納する複数の記憶部にアクセスし、格納された冗長データのデータ長を抽出する（ステップＳ２２）。そして比較部１１は、抽出したデータ長を相互に比較する（ステップＳ２３）。比較部１１がデータ長は同一であると判定した場合（ステップＳ２４、肯定）、データ整合処理は終了する。他方、比較部１１がデータ長は同一ではないと判定した場合（ステップＳ２４、否定）、比較部１１は処理を同一化部１２に渡す。同一化部１２は、比較部１１がデータ長を比較した冗長データのデータ長を揃える（ステップＳ２５）。すなわち、同一化部１２は、複数の冗長データのうち、最も短いデータ長の冗長データに他の冗長データのデータ長を揃える。これによってデータ整合処理は終了する。 The matching request is passed to the comparison unit 11. The comparison unit 11 accesses a plurality of storage units that store redundant data specified by the matching request, and extracts the data length of the stored redundant data (step S22). Then, the comparison unit 11 compares the extracted data lengths with each other (step S23). When the comparison unit 11 determines that the data lengths are the same (Yes at Step S24), the data matching process ends. On the other hand, when the comparison unit 11 determines that the data lengths are not the same (No at Step S24), the comparison unit 11 passes the processing to the identification unit 12. The identification unit 12 aligns the data lengths of the redundant data compared by the comparison unit 11 (step S25). That is, the identifying unit 12 aligns the data length of other redundant data with the redundant data having the shortest data length among the plurality of redundant data. This completes the data matching process.

［第１の実施形態の効果］
このように、第１の実施形態に係るデータ整合装置１０は、複数の記憶部それぞれに格納される、同一のデータの複製である冗長データのデータ長を比較する比較部１１と、比較部１１が冗長データのデータ長が同一ではないと判定した場合に、複数の記憶部に格納された冗長データのうち最も短い冗長データのデータ長に他の冗長データのデータ長を揃える同一化部１２と、を備える。このため、第１の実施形態に係るデータ整合装置１０は、複数の記憶部が格納する冗長データ間で整合性を確保することができる。 [Effect of the first embodiment]
As described above, the data matching apparatus 10 according to the first embodiment includes the comparison unit 11 that compares the data length of redundant data that is a duplicate of the same data stored in each of the plurality of storage units, and the comparison unit 11. When the data lengths of the redundant data are not the same, the identification unit 12 aligns the data length of the other redundant data to the data length of the shortest redundant data among the redundant data stored in the plurality of storage units. . For this reason, the data matching apparatus 10 according to the first embodiment can ensure consistency among redundant data stored in a plurality of storage units.

また、データ整合装置１０は、整合要求を受けたときにデータ整合処理を実行し、特に必要がない場合は冗長データ間の不整合に対処しないように構成してもよい。このように構成することで、クライアント側が利用するアプリケーションのニーズに合わせて、冗長データの整合性制御を実行することができる。 Further, the data matching apparatus 10 may be configured to perform data matching processing when receiving a matching request, and not to deal with inconsistencies between redundant data unless particularly necessary. With this configuration, it is possible to execute redundancy data consistency control in accordance with the needs of applications used on the client side.

たとえば、クライアント側が利用するアプリケーションが多少の冗長データの不整合があっても問題なく動作できるものであれば、クライアントは整合要求を発行せずに処理を継続する。そして、クライアントまたはデータ整合装置１０にかかる処理負荷の低いときに整合要求を発行してデータ整合処理を実行させることができる。また、クライアント側が利用するアプリケーションが冗長データの高い整合性を要求するものである場合は、書込動作が予期せず終了した場合等に、クライアントが直ちに整合要求を発行することで、冗長データの整合性を確保することができる。 For example, if an application used on the client side can operate without any problem even if there is some inconsistency in redundant data, the client continues processing without issuing an alignment request. Then, when the processing load on the client or the data matching apparatus 10 is low, a matching request can be issued to execute the data matching process. In addition, if the application used on the client side requires high consistency of redundant data, the client immediately issues a consistency request when the write operation is terminated unexpectedly. Consistency can be ensured.

また、データ整合装置１０は、書込処理の異常終了が発生した場合は常にデータ整合処理を実行するように設定してもよい。また、データ整合装置１０は、予め定められたアプリケーションからの書込要求に対応した書込処理が異常終了した場合はデータ整合処理を実行し、他のアプリケーションからの書込要求に対応した書込処理が異常終了した場合はデータ整合処理を実行しないように設定してもよい。 Further, the data matching apparatus 10 may be set to always execute the data matching process when an abnormal end of the writing process occurs. In addition, the data matching apparatus 10 executes the data matching process when the writing process corresponding to the writing request from the predetermined application is abnormally terminated, and the writing corresponding to the writing request from the other application is performed. It may be set so that the data matching process is not executed when the process ends abnormally.

このように、第１の実施形態のデータ整合装置１０は、上位アプリケーションに係る負荷を抑制して処理性能の低下を防止しつつ、複数の記憶部に格納された冗長データ間の整合性を確保することができる。 As described above, the data matching apparatus 10 according to the first embodiment ensures consistency between redundant data stored in a plurality of storage units while suppressing a load related to a higher-level application and preventing a decrease in processing performance. can do.

［第２の実施形態］
図３は、本発明の第２の実施形態に係るデータ整合システム１の概要を示すブロック図である。図３を参照して、第２の実施形態に係るデータ整合システム１の構成の概要について説明する。 [Second Embodiment]
FIG. 3 is a block diagram showing an outline of the data matching system 1 according to the second embodiment of the present invention. With reference to FIG. 3, the outline of the configuration of the data matching system 1 according to the second embodiment will be described.

［データ整合システム］
図３に示すように、データ整合システム１は、マスタ１００と、サーバ２００Ａ，２００Ｂ，２００Ｃと、ディスク３００Ａ，３００Ｂ，３００Ｃと、クライアント４００Ａ，４００Ｂ，４００Ｃと、を備える。マスタ１００は、ネットワークを介して、サーバ２００Ａ，２００Ｂ，２００Ｃおよびクライアント４００Ａ，４００Ｂ，４００Ｃと接続され、相互に情報の送受信を行う。また、クライアント４００Ａ，４００Ｂ，４００Ｃは、マスタ１００から送信された情報に基づき、サーバ２００Ａ，２００Ｂ，２００Ｃに対して適宜、読出命令、書込命令、またはデータ整合命令を送信する。マスタ１００は、クライアント４００Ａ，４００Ｂ，４００Ｃからデータの読み出しまたは書込の要求を受信し、クライアント４００Ａ，４００Ｂ，４００Ｃに対して、読み出しまたは書込みまたはデータ整合化のための情報を送信する。サーバ２００Ａ，２００Ｂ，２００Ｃは、クライアント４００Ａ，４００Ｂ，４００Ｃからの命令に基づき、ディスク３００Ａ，３００Ｂ，３００Ｃに対して読出処理または書込処理またはデータ整合処理を実行する。 [Data alignment system]
As shown in FIG. 3, the data matching system 1 includes a master 100, servers 200A, 200B, and 200C, disks 300A, 300B, and 300C, and clients 400A, 400B, and 400C. The master 100 is connected to the servers 200A, 200B, and 200C and the clients 400A, 400B, and 400C via the network, and transmits and receives information to and from each other. Further, the clients 400A, 400B, and 400C appropriately transmit a read command, a write command, or a data matching command to the servers 200A, 200B, and 200C based on the information transmitted from the master 100. The master 100 receives a request for reading or writing data from the clients 400A, 400B, and 400C, and transmits information for reading or writing or data alignment to the clients 400A, 400B, and 400C. Servers 200A, 200B, and 200C execute read processing, write processing, or data matching processing on disks 300A, 300B, and 300C based on instructions from clients 400A, 400B, and 400C.

マスタ１００はたとえば、分散型ファイルシステムのマスタである。サーバ２００Ａ，２００Ｂ，２００Ｃはたとえば、分散型ファイルシステムのチャンクサーバである。クライアント４００Ａ，４００Ｂ，４００Ｃはたとえば、分散型ファイルシステムを利用するユーザである。 The master 100 is, for example, a distributed file system master. Servers 200A, 200B, and 200C are, for example, chunk servers of a distributed file system. Clients 400A, 400B, and 400C are, for example, users who use a distributed file system.

また、マスタ１００は、インタークラウドシステムのインタークラウドサーバとして構成してもよい。その場合、サーバ２００Ａ，２００Ｂ，２００Ｃおよびディスク３００Ａ，３００Ｂ，３００Ｃは、クラウドシステムによって仮想的に構成してもよい。また、マスタ１００、サーバ２００Ａ，２００Ｂ，２００Ｃ、ディスク３００Ａ，３００Ｂ，３００Ｃは、１つのデータセンタ内に構築したものであってもよいし、複数のデータセンタに分散して構築したものでもよい。 The master 100 may be configured as an intercloud server of an intercloud system. In that case, the servers 200A, 200B, and 200C and the disks 300A, 300B, and 300C may be configured virtually by a cloud system. Further, the master 100, the servers 200A, 200B, and 200C and the disks 300A, 300B, and 300C may be constructed in one data center or may be constructed in a distributed manner in a plurality of data centers.

サーバ２００Ａ，２００Ｂ，２００Ｃは、マスタ１００およびクライアント４００Ａ，４００Ｂ，４００Ｃからの命令に応じて、各々の配下のディスク３００Ａ，３００Ｂ，３００Ｃにアクセスし、データの読み出しや書込みを行う。サーバ２００Ａ，２００Ｂ，２００Ｃはそれぞれ、配下のディスク３００Ａ，３００Ｂ，３００Ｃへのアクセスを制御するためのデータアクセス部２０１Ａ，２０１Ｂ，２０１Ｃを備える。 The servers 200A, 200B, and 200C access the subordinate disks 300A, 300B, and 300C in accordance with commands from the master 100 and the clients 400A, 400B, and 400C, and read and write data. Each of the servers 200A, 200B, and 200C includes data access units 201A, 201B, and 201C for controlling access to the subordinate disks 300A, 300B, and 300C.

サーバ２００Ａ，２００Ｂ，２００Ｃは、いずれかのサーバに障害が発生した場合でもデータが失われないように、それぞれ同一のデータを冗長データとして格納する。クライアント４００Ａ，４００Ｂ，４００Ｃは、サーバ２００Ａ，２００Ｂ，２００Ｃのいずれかにアクセスすることで同一のデータに対して並列処理を実行することができる。なお、図３には、３つのサーバ２００Ａ，２００Ｂ，２００Ｃを示すが、サーバの数は３に限定されない。また、冗長データを格納するディスクおよびサーバの数は、第２の実施形態では３として説明するが、たとえば１０のサーバを備えるデータ整合システムの場合、同じデータの冗長データは１０のサーバのうち適宜選択した３つのサーバの配下のディスクに格納する等としてよい。どのサーバのディスクに冗長データを格納するかは、マスタ１００が決定制御してクライアント４００Ａ，４００Ｂ，４００Ｃに指示する。ただし、クライアント４００Ａ，４００Ｂ，４００Ｃが冗長データを格納するサーバを指定するように構成してもよい。 Each of the servers 200A, 200B, and 200C stores the same data as redundant data so that the data is not lost even if a failure occurs in any of the servers. The clients 400A, 400B, and 400C can execute parallel processing on the same data by accessing any of the servers 200A, 200B, and 200C. 3 shows three servers 200A, 200B, and 200C, the number of servers is not limited to three. In the second embodiment, the number of disks and servers that store redundant data will be described as 3. In the case of a data matching system including 10 servers, for example, redundant data of the same data is appropriately selected from 10 servers. It may be stored in a disk under the selected three servers. The master 100 determines and controls which server's disk stores redundant data, and instructs the clients 400A, 400B, and 400C. However, the clients 400A, 400B, and 400C may be configured to designate a server that stores redundant data.

ディスク３００Ａ，３００Ｂ，３００Ｃは、データを記憶するデータベース等の記憶装置である。ディスク３００Ａ，３００Ｂ，３００Ｃへのアクセスはそれぞれサーバ２００Ａ，２００Ｂ，２００Ｃにより制御される。 The disks 300A, 300B, and 300C are storage devices such as a database that stores data. Access to the disks 300A, 300B, and 300C is controlled by the servers 200A, 200B, and 200C, respectively.

クライアント４００Ａ，４００Ｂ，４００Ｃは、ディスク３００Ａ，３００Ｂ，３００Ｃに格納されたデータを利用する。たとえば、クライアント４００Ａ，４００Ｂ，４００Ｃは、インタークラウドシステムを利用するユーザ端末である。クライアント４００Ａ，４００Ｂ，４００Ｃはそれぞれ、アプリケーション実行部４０１Ａ，４０１Ｂ，４０１Ｃ、書込部４０２Ａ，４０２Ｂ，４０２Ｃ、読出部４０３Ａ，４０３Ｂ，４０３Ｃ、データ整合装置４１０Ａ，４１０Ｂ，４１０Ｃを備える。アプリケーション実行部４０１Ａ，４０１Ｂ，４０１Ｃはそれぞれ、クライアントのアプリケーションを実行する機能部である。書込部４０２Ａ，４０２Ｂ，４０２Ｃ、読出部４０３Ａ，４０３Ｂ，４０３Ｃおよびデータ整合装置４１０Ａ，４１０Ｂ，４１０Ｃはそれぞれ、アプリケーション実行部４０１Ａ，４０１Ｂ，４０１Ｃの制御の下、ディスク３００Ａ，３００Ｂ，３００Ｃへの書込命令、読出命令、データ整合命令を送信する。 Clients 400A, 400B, and 400C use data stored in disks 300A, 300B, and 300C. For example, the clients 400A, 400B, and 400C are user terminals that use an intercloud system. Each of the clients 400A, 400B, and 400C includes application execution units 401A, 401B, and 401C, writing units 402A, 402B, and 402C, reading units 403A, 403B, and 403C, and data matching devices 410A, 410B, and 410C. Each of the application execution units 401A, 401B, and 401C is a functional unit that executes a client application. The writing units 402A, 402B, and 402C, the reading units 403A, 403B, and 403C and the data matching devices 410A, 410B, and 410C are respectively written to the disks 300A, 300B, and 300C under the control of the application execution units 401A, 401B, and 401C. Send command, read command, data match command.

［データ整合装置の構成の一例］
クライアント４００Ａ，４００Ｂ，４００Ｃはそれぞれ、データ整合装置４１０Ａ，４１０Ｂ，４１０Ｃを備える。データ整合装置４１０Ａ，４１０Ｂ，４１０Ｃにつき、以下に更に説明する。データ整合装置４１０（以下、「データ整合装置４１０」は、データ整合装置４１０Ａ，４１０Ｂ，４１０Ｃのいずれか１つを示す）は、比較部４１２（以下、「比較部４１２」は、比較部４１２Ａ，４１２Ｂ，４１２Ｃのいずれか１つを示す）と、同一化部４１３（以下、「同一化部４１３」は、同一化部４１３Ａ，４１３Ｂ，４１３Ｃのいずれか１つを示す）と、を備える。 [Example of configuration of data matching device]
The clients 400A, 400B, and 400C include data matching devices 410A, 410B, and 410C, respectively. The data matching devices 410A, 410B, and 410C will be further described below. The data matching device 410 (hereinafter, “data matching device 410” indicates any one of the data matching devices 410A, 410B, 410C) is compared with the comparison unit 412 (hereinafter, “comparison unit 412” is the comparison unit 412A, 412B and 412C) and an identifying unit 413 (hereinafter, “identifying unit 413” represents any one of the identifying units 413A, 413B, and 413C).

比較部４１２は、クライアントからの整合要求に応じて比較処理を実行する。比較部４１２はたとえば、クライアントが要求した書込処理が正常に終了しなかった場合に比較処理を実行する。第２の実施形態のマスタ１００では、データの冗長性を確保するため、サーバ２００Ａ，２００Ｂ，２００Ｃにそれぞれ同一のデータを格納させる。しかし、書込処理が異常終了した場合等、サーバ２００Ａに格納されたデータとサーバ２００Ｂに格納されたデータに不一致が生じていることがある。また、書込が異常終了した場合、いずれかのサーバが外部からアクセスできない状態となり、他のサーバについてもデータが適切に書込完了したか否かを外部（クライアント）から知ることができない場合がある。そこで、比較部４１２は、異常終了した書込処理の書込対象データを、アクセス可能なサーバから検出して、それぞれのデータ長を比較する比較処理を実行する。 The comparison unit 412 performs comparison processing in response to a matching request from the client. For example, the comparison unit 412 executes the comparison process when the writing process requested by the client is not normally completed. In the master 100 of the second embodiment, the same data is stored in the servers 200A, 200B, and 200C in order to ensure data redundancy. However, there may be a discrepancy between the data stored in the server 200A and the data stored in the server 200B, such as when the writing process ends abnormally. In addition, when the writing is abnormally terminated, one of the servers cannot be accessed from the outside, and it may not be possible to know from the outside (client) whether or not the data has been properly written to other servers. is there. Therefore, the comparison unit 412 detects the write target data of the abnormally terminated write process from the accessible server, and executes a comparison process for comparing the respective data lengths.

同一化部４１３は、比較部４１２が複数のサーバに格納された冗長データを比較した結果、データ長に不一致が検出された場合に、それらのデータのうち、最もデータ長が短いデータのデータ長に他の冗長データの長さを揃えるデータ同一化処理を実行する。なお、比較部４１２および同一化部４１３による処理についてはさらに詳細に後述する。 When the comparison unit 412 compares the redundant data stored in a plurality of servers and the mismatch is detected in the data length, the identification unit 413 compares the data length of the data with the shortest data length. In addition, data equalization processing is performed to align the lengths of other redundant data. Note that the processing by the comparison unit 412 and the identification unit 413 will be described in more detail later.

［マスタの構成の一例］
マスタ１００の構成の一例につき説明する。図３に示すようにマスタ１００は、データ配置制御部１１０と、記憶部１２０と、入出力部１３０と、を備える。データ配置制御部１１０は、マスタ１００における各部の機能および動作を制御する。記憶部１２０は、マスタ１００における処理に使用する情報および処理の結果として生成される情報を適宜格納する。入出力部１３０は、外部とマスタ１００との間での情報のやりとりを行う。入出力部１３０が受信した情報は適宜、データ配置制御部１１０および記憶部１２０に渡される。 [Example of master configuration]
An example of the configuration of the master 100 will be described. As shown in FIG. 3, the master 100 includes a data arrangement control unit 110, a storage unit 120, and an input / output unit 130. The data arrangement control unit 110 controls the function and operation of each unit in the master 100. The storage unit 120 appropriately stores information used for processing in the master 100 and information generated as a result of the processing. The input / output unit 130 exchanges information between the outside and the master 100. Information received by the input / output unit 130 is appropriately transferred to the data arrangement control unit 110 and the storage unit 120.

データ配置制御部１１０は、データの書込みおよび読み出しを行う場所を判定し、冗長データを識別するための情報や冗長データの所在を示す情報を記憶部１２０に記憶する処理を実行する。また、サーバ２００Ａ，２００Ｂ，２００Ｃにおける読出処理および書込処理およびデータ整合処理の結果に応じて、記憶部１２０に格納された情報を更新する。また、データ配置制御部１１０は、クライアント４００Ａ，４００Ｂ，４００Ｃからの要求に応じて、データの書込みおよび読み出しおよびデータ整合化に必要な情報をクライアント４００Ａ，４００Ｂ，４００Ｃに送信する。 The data arrangement control unit 110 determines a place where data is written and read, and executes processing for storing in the storage unit 120 information for identifying redundant data and information indicating the location of redundant data. Further, the information stored in the storage unit 120 is updated according to the results of the read process, the write process, and the data matching process in the servers 200A, 200B, and 200C. Further, the data arrangement control unit 110 transmits information necessary for writing and reading data and data alignment to the clients 400A, 400B, and 400C in response to requests from the clients 400A, 400B, and 400C.

［記憶部の構成および記憶する情報の一例］
次に、記憶部１２０の構成および記憶部１２０に記憶する情報の一例につき説明する。図３に示すように、記憶部１２０は、冗長データ位置テーブル１２１と、処理対象リスト（リースリスト）１２２と、を備える。 [Example of configuration of storage unit and stored information]
Next, the configuration of the storage unit 120 and an example of information stored in the storage unit 120 will be described. As illustrated in FIG. 3, the storage unit 120 includes a redundant data position table 121 and a processing target list (lease list) 122.

冗長データ位置テーブル１２１は、各サーバ２００Ａ，２００Ｂ，２００Ｃに格納したデータの識別子（ＩＤ：Identifier）と、当該データを格納したサーバの情報とを記憶する。図４−１は、冗長データ位置テーブル１２１に格納する情報の一例を説明するための図である。図４−２は、冗長データ位置テーブル１２１に格納する情報とデータを格納するディスクとの対応を説明するための図である。 The redundant data position table 121 stores an identifier (ID: Identifier) of data stored in each of the servers 200A, 200B, and 200C and information of the server that stores the data. FIG. 4A is a diagram for explaining an example of information stored in the redundant data position table 121. FIG. 4B is a diagram for explaining the correspondence between the information stored in the redundant data position table 121 and the disk storing the data.

たとえば、クライアント４００Ａからの書込要求に応じてファイル名、すなわち、ファイルＩＤ「Ａ」のデータをディスク３００Ａ，３００Ｂ，３００Ｃに格納するとする。この場合、ファイルＩＤ「Ａ」のデータはまず、固定長に分割されて複数のデータブロックとなる（図４−１参照）。そして、各ブロックにハンドル名とバージョン番号が付与される。ハンドル名とバージョン番号とを組み合わせたものが、当該ブロックのデータＩＤとなる。たとえば、図４−１の例では、ファイルＩＤ「Ａ」のデータを３つの固定長のブロックに分割している。そして、最初のブロックにハンドル名「ａ１」とバージョン番号「２」を付与する。したがって、ファイルＩＤ「Ａ」のデータの最初のブロックのデータＩＤは、「ａ１２」となり、当該データＩＤがファイルＩＤ「Ａ」に対応づけて、冗長データ位置テーブル１２１に格納される。同様に、２番目のデータブロックに、ハンドル名「ａ２」とバージョン番号「３」が付与され、冗長データ位置テーブル１２１に格納される。また、３番目のデータブロックに、ハンドル名「ａ３」とバージョン番号「３」が付与され、冗長データ位置テーブル１２１に格納される。 For example, it is assumed that the file name, that is, the data of the file ID “A” is stored in the disks 300A, 300B, and 300C in response to a write request from the client 400A. In this case, the data of the file ID “A” is first divided into fixed lengths to form a plurality of data blocks (see FIG. 4A). A handle name and a version number are assigned to each block. A combination of the handle name and the version number is the data ID of the block. For example, in the example of FIG. 4A, the data of the file ID “A” is divided into three fixed-length blocks. Then, the handle name “a1” and the version number “2” are assigned to the first block. Therefore, the data ID of the first block of the data of the file ID “A” is “a12”, and the data ID is stored in the redundant data position table 121 in association with the file ID “A”. Similarly, the handle name “a2” and the version number “3” are assigned to the second data block and stored in the redundant data position table 121. In addition, the handle name “a3” and the version number “3” are assigned to the third data block and stored in the redundant data position table 121.

なお、各データブロックに付与するデータＩＤのうち、ハンドル名については、各データブロックを一意に識別できるよう、各ブロックに異なるハンドル名を付与する。バージョン番号については、複数のデータブロックに同一のバージョン番号が付与されている場合がある。また、データブロックを一意に特定できるのであれば、他の形式のＩＤを付与してもよい。 Of the data IDs assigned to each data block, for the handle name, a different handle name is assigned to each block so that each data block can be uniquely identified. Regarding the version number, the same version number may be assigned to a plurality of data blocks. Moreover, as long as a data block can be specified uniquely, ID of another format may be given.

本実施形態では、各データブロックの冗長データを３つのディスク３００Ａ，３００Ｂ，３００Ｃに格納する。そこで、冗長データ位置テーブル１２１には、たとえば、ファイルＩＤ「Ａ」のデータの各ブロックについて、冗長データが格納されている場所を示す位置情報が記憶される。図４−２の例では、ファイルＩＤ「Ａ」のデータ中、データＩＤ「ａ３３」のデータブロックの冗長データが、サーバＩＤ「２００Ａ」，「２００Ｂ」，「２００Ｃ」のサーバが制御するディスクに格納されていることが示されている。すなわち、ここでは位置情報は当該データブロックのデータを管理するデータアクセス部が動作しているサーバを特定する。 In this embodiment, redundant data of each data block is stored in three disks 300A, 300B, and 300C. Therefore, the redundant data position table 121 stores, for example, position information indicating the location where the redundant data is stored for each block of the data with the file ID “A”. In the example of FIG. 4B, the redundant data of the data block with the data ID “a33” in the data with the file ID “A” is transferred to the disk controlled by the servers with the server IDs “200A”, “200B”, and “200C”. It is shown that it is stored. That is, here, the location information specifies the server on which the data access unit that manages the data of the data block is operating.

なお、冗長データが格納されている場所を示す位置情報としては、たとえば、サーバのＩＰアドレスやホスト名、サーバ上で動作するデータアクセス部のＩＰアドレスやホスト名などを用いることができる。 As position information indicating the location where redundant data is stored, for example, the IP address and host name of the server, the IP address and host name of the data access unit operating on the server, and the like can be used.

処理対象リスト１２２は、その時点で、書込処理を実行しているサーバ又はデータアクセス部と、処理対象データとを特定するリストである。図５は、処理対象リスト１２２に格納するデータの一例を示す図である。 The processing target list 122 is a list that identifies the server or data access unit that is executing the writing process and the processing target data at that time. FIG. 5 is a diagram illustrating an example of data stored in the processing target list 122.

処理対象リスト１２２は、その時点で書込処理を実行しているデータブロックを特定する情報と、当該データブロックへの書込処理を制御している機能部を特定する情報とを記憶する。たとえば、データＩＤ「ａ１２」のデータブロックに対する書込処理を、データアクセス部「２０１Ａ」が制御している場合、処理対象リスト１２２には、当該データのハンドル名である「ａ１」と、データアクセス部「２０１Ａ」の位置情報である「２０１Ａ」とが対応づけて格納される。データアクセス部「２０１Ａ」による、ハンドル名「ａ１」のデータブロックに対する書込処理が終了すると、所定の時間が経過した後、データ配置制御部１１０が、処理対象リスト１２２からハンドル名「ａ１」とデータアクセス部「２０１Ａ」の情報を削除する。なお、ここでは処理対象リスト１２２は、処理対象データのハンドル名のみを格納し、バージョン番号は格納しないものとする。 The processing target list 122 stores information for specifying the data block that is executing the writing process at that time, and information for specifying the functional unit that controls the writing process to the data block. For example, when the data access unit “201A” controls the writing process for the data block with the data ID “a12”, the processing target list 122 includes “a1”, which is the handle name of the data, and the data access The position information “201A” of the part “201A” is stored in association with each other. When the data access unit “201A” finishes writing the data block with the handle name “a1”, after a predetermined time has elapsed, the data placement control unit 110 changes the handle name “a1” from the processing target list 122. The information of the data access unit “201A” is deleted. Here, it is assumed that the processing target list 122 stores only the handle name of the processing target data and does not store the version number.

［書込処理時のデータ配置制御処理の流れの一例］
図６は、第２の実施形態に係るマスタ１００による、データ配置制御処理の流れの一例を示すフローチャートである。図６を参照して、マスタ１００による、データ配置制御処理の流れを説明する。 [Example of data placement control process flow during write process]
FIG. 6 is a flowchart illustrating an example of the flow of data arrangement control processing by the master 100 according to the second embodiment. With reference to FIG. 6, the flow of data arrangement control processing by the master 100 will be described.

まず、たとえばクライアント４００Ａが、ファイルＩＤ「Ａ」のデータの書込みを要求する書込要求をマスタ１００に対して送信する。書込要求は、データを書き込む対象であるファイルのファイルＩＤと、ファイル中の書込データの位置を示すオフセット「Ｚ」とを含む。なお、書込要求は後述する同一化部４１３によっても送信される。この場合の処理も以下と同様である。 First, for example, the client 400A transmits a write request for requesting writing of data of the file ID “A” to the master 100. The write request includes a file ID of a file to which data is to be written and an offset “Z” indicating the position of the write data in the file. The write request is also transmitted by the identifying unit 413 described later. The processing in this case is similar to the following.

データ配置制御部１１０は、書込要求を受信すると（ステップＳ６０１）、書込要求に含まれた情報に基づき、データを書き込む位置を特定する（ステップＳ６０２）。たとえば、書込要求に含まれたオフセット「Ｚ」が正の数である場合、データ配置制御部１１０は、オフセット「Ｚ」と予め定められたブロックサイズ（固定長）とを加算して、ブロックサイズで除算する。データ配置制御部１１０は、その結果得られた整商に応じた場所にあるブロックを、データを書き込むブロックであると判断する。また、書込要求に含まれたオフセット「Ｚ」が負の数である場合、データ配置制御部１１０は、当該ファイルの終端ブロックがデータを書き込むブロックであると判断する。 When receiving a write request (step S601), the data arrangement control unit 110 specifies a position to write data based on information included in the write request (step S602). For example, when the offset “Z” included in the write request is a positive number, the data arrangement control unit 110 adds the offset “Z” and a predetermined block size (fixed length) to obtain a block Divide by size. The data arrangement control unit 110 determines that the block located at the place corresponding to the quotient obtained as a result is a block to which data is written. When the offset “Z” included in the write request is a negative number, the data arrangement control unit 110 determines that the end block of the file is a block in which data is written.

次に、データ配置制御部１１０は、データを書き込むブロックのデータＩＤの取得を試みる（ステップＳ６０３）。すなわち、データ配置制御部１１０は、冗長データ位置テーブル１２１を参照して、該当するデータＩＤのデータについての情報が格納されているか否かを判定する。情報が格納されていない場合（ステップＳ６０３、否定）、データ配置制御部１１０は、対応するデータがまだいずれのディスクにも格納されていないと判断して、新しいハンドル名を作成する（ステップＳ６０４）。そして、データ配置制御部１１０は、当該データの冗長データを格納するディスク（すなわち、対応するサーバまたはデータアクセス部）を決定する（ステップＳ６０５）。たとえば、データ配置制御部１１０は、予め作成すると定められている冗長データの数が３である場合、「３−１」個以上のスイッチ配下で動作するサーバを、作成する冗長データの数「３」だけ選択する。 Next, the data arrangement control unit 110 tries to acquire the data ID of the block to which data is written (step S603). That is, the data arrangement control unit 110 refers to the redundant data position table 121 and determines whether or not information about the data of the corresponding data ID is stored. If the information is not stored (No at Step S603), the data arrangement control unit 110 determines that the corresponding data is not yet stored on any disk, and creates a new handle name (Step S604). . Then, the data arrangement control unit 110 determines a disk (that is, a corresponding server or data access unit) that stores redundant data of the data (step S605). For example, when the number of redundant data determined to be created in advance is 3, the data placement control unit 110 creates a server that operates under “3-1” or more switches as the number of redundant data created “3”. "Only.

そして、データ配置制御部１１０は、ステップＳ６０４で作成したハンドル名にもとづくデータＩＤと、ステップＳ６０５で選択したサーバの位置情報と、を対応づけて冗長データ位置テーブル１２１に登録する。冗長データ位置テーブル１２１への登録が失敗した場合（ステップＳ６０６、否定）、データ配置制御部１１０は、書込処理が失敗したことを、書込要求を出したクライアント４００Ａ（要求元）に送信する（ステップＳ６０７）。そして、書込処理を終える。冗長データ位置テーブル１２１への登録が成功した場合（ステップＳ６０６、肯定）は、ステップＳ６０３において、データ書込ブロックのデータＩＤを冗長データ位置テーブル１２１から取得し、かつ、ステップＳ６０８において冗長データ位置テーブル１２１から位置情報を取得した場合と同じ処理となる（後述）。 Then, the data placement control unit 110 registers the data ID based on the handle name created in step S604 and the server location information selected in step S605 in the redundant data location table 121 in association with each other. When the registration to the redundant data position table 121 has failed (No at Step S606), the data arrangement control unit 110 transmits that the writing process has failed to the client 400A (request source) that has issued the write request. (Step S607). Then, the writing process is finished. If the registration to the redundant data position table 121 is successful (Yes in step S606), the data ID of the data writing block is acquired from the redundant data position table 121 in step S603, and the redundant data position table is stored in step S608. This is the same processing as when position information is acquired from 121 (described later).

ステップＳ６０３に戻り、データ配置制御部１１０は、データ書込ブロックのデータＩＤを冗長データ位置テーブル１２１から取得した場合（ステップＳ６０３、肯定）、次に、当該データＩＤに対応する冗長データが格納される位置情報を取得する（ステップＳ６０８）。すなわち、冗長データ位置テーブル１２１を参照して、当該データＩＤに対応づけて格納されている位置情報を取得する。位置情報を取得できなかった場合（ステップＳ６０８、否定）、ステップＳ６０７に進み、書込処理失敗をクライアント４００Ａに通知する。そして、処理を終える。 Returning to step S603, when the data placement control unit 110 acquires the data ID of the data writing block from the redundant data position table 121 (Yes in step S603), next, redundant data corresponding to the data ID is stored. Position information is acquired (step S608). That is, with reference to the redundant data position table 121, the position information stored in association with the data ID is acquired. When the position information cannot be acquired (No at Step S608), the process proceeds to Step S607, and the writing process failure is notified to the client 400A. Then, the process ends.

位置情報を取得できた場合（ステップＳ６０８、肯定）、データ配置制御部１１０は、位置情報によって示されるサーバまたはデータアクセス部に対する書込制御処理を実行する（ステップＳ６０９）。図７は、第２の実施形態に係るマスタ１００による、書込制御処理の流れの一例を示すフローチャートである。図７を参照し、データ配置制御部１１０の書込制御処理の流れの一例を説明する。 If the position information can be acquired (Yes at Step S608), the data arrangement control unit 110 executes a write control process for the server or the data access unit indicated by the position information (Step S609). FIG. 7 is a flowchart illustrating an example of a flow of a write control process by the master 100 according to the second embodiment. With reference to FIG. 7, an example of the flow of the write control process of the data arrangement control unit 110 will be described.

データ配置制御部１１０は、クライアント４００Ａから受信した書込要求に対応するデータＩＤと書込場所を示す位置情報とを取得すると（図６、ステップＳ６０８、肯定）、取得したデータＩＤのハンドル名が、処理対象リスト１２２に登録されているか否かを判定する（ステップＳ７０１）。処理対象リスト１２２に当該データＩＤのハンドル名が登録されている場合、当該データＩＤに対応するデータブロックの書込処理が実行中であることを意味する。したがって、処理対象リスト１２２に当該データＩＤのハンドル名が登録されている場合（ステップＳ７０１、肯定）、データ配置制御部１１０は書込制御処理をそのまま終了する。 When the data placement control unit 110 acquires the data ID corresponding to the write request received from the client 400A and the position information indicating the write location (FIG. 6, step S608, positive), the handle name of the acquired data ID is Then, it is determined whether or not it is registered in the processing target list 122 (step S701). When the handle name of the data ID is registered in the processing target list 122, it means that the writing process of the data block corresponding to the data ID is being executed. Therefore, when the handle name of the data ID is registered in the processing target list 122 (Yes in step S701), the data arrangement control unit 110 ends the write control process as it is.

処理対象リスト１２２に当該データＩＤが登録されていない場合（ステップＳ７０１、否定）、当該データＩＤに対応するデータブロックに対する書込処理は実行中ではないことを意味する。したがって、データ配置制御部１１０は、次の処理に進み、ステップＳ６０８において取得した位置情報に含まれるデータアクセス部（または対応するサーバ若しくはディスク）を１つ選択する（ステップＳ７０２）。たとえば、データ配置制御部１１０は、データアクセス部をランダムに選択する。また、たとえば、クライアント４００Ａから経由するスイッチ数が最も少ないデータアクセス部を選択してもよい。そして、データ配置制御部１１０は、選択した位置情報に対応するデータアクセス部に対して、書込制御命令を送信する（ステップＳ７０３）。書込制御命令は、書込対象であるデータブロックを特定する情報を含む。書込制御命令は、書込対象のデータハンドルについて書込制御を行うことを命令するものである。 If the data ID is not registered in the processing target list 122 (No at Step S701), it means that the writing process for the data block corresponding to the data ID is not being executed. Therefore, the data arrangement control unit 110 proceeds to the next process, and selects one data access unit (or corresponding server or disk) included in the position information acquired in step S608 (step S702). For example, the data arrangement control unit 110 randomly selects a data access unit. Further, for example, a data access unit having the smallest number of switches via the client 400A may be selected. Then, the data arrangement control unit 110 transmits a write control command to the data access unit corresponding to the selected position information (step S703). The write control instruction includes information for specifying a data block to be written. The write control command is a command to perform write control for the data handle to be written.

そして、データ配置制御部１１０は、書込制御命令の送信先であるデータアクセス部から、命令受信応答を受信したか否かを判定する（ステップＳ７０４）。なお、命令受信応答は、書込制御命令に応じて、書込制御処理および書込処理が完了したことを意味するものではなく、書込制御命令を受信したことを意味するものである。 Then, the data arrangement control unit 110 determines whether or not a command reception response has been received from the data access unit that is the transmission destination of the write control command (step S704). The command reception response does not mean that the write control process and the write process have been completed in response to the write control command, but means that the write control command has been received.

データ配置制御部１１０は、命令受信応答を受信していないと判定した場合（ステップＳ７０４、否定）、再び、ステップＳ７０２に戻って、データアクセス部を選択しなおし（ステップＳ７０２）、書込制御命令を送信する（ステップＳ７０３）。命令受信応答を受信しない場合としては、たとえば、タイムアウトやコネクションが切れた場合等が考えられる。他方、データ配置制御部１１０は、命令受信応答を受信したと判定すると（ステップＳ７０４、肯定）、冗長データ位置テーブル１２１に、当該データＩＤに対応付けて格納されている位置情報の中から、ステップＳ７０２において選択したデータアクセス部に対応する位置情報を抽出し、当該位置情報が、リストの先頭にくるように、冗長データ位置テーブル１２１を更新する（ステップＳ７０５）。そして、データ配置制御部１１０は、処理対象リスト１２２に、書込制御命令の対象である書込対象データのハンドル名と、書込制御命令の送信先であるデータアクセス部の位置情報と、を登録する（ステップＳ７０６）。そして、データ配置制御部１１０は、冗長データ位置テーブル１２１に登録した、書込対象データのデータＩＤのうち、バージョン番号を更新する（ステップＳ７０７）。データ配置制御部１１０は、更新したバージョン番号を、当該データＩＤに対応づけて格納されている位置情報に含まれるデータアクセス部に通知する（ステップＳ７０８）。これによって、データ配置制御部１１０による書込制御処理が終了する。 If the data placement control unit 110 determines that the command reception response has not been received (No at Step S704), the data placement control unit 110 returns to Step S702 again to reselect the data access unit (Step S702), and the write control command. Is transmitted (step S703). As a case where the command reception response is not received, for example, a time-out or a case where the connection is cut off can be considered. On the other hand, if the data arrangement control unit 110 determines that the command reception response has been received (Yes in step S704), the data arrangement control unit 110 executes step from the position information stored in the redundant data position table 121 in association with the data ID. The position information corresponding to the data access unit selected in S702 is extracted, and the redundant data position table 121 is updated so that the position information comes to the top of the list (step S705). Then, the data placement control unit 110 adds, to the processing target list 122, the handle name of the write target data that is the target of the write control command and the position information of the data access unit that is the transmission destination of the write control command. Registration is performed (step S706). Then, the data arrangement control unit 110 updates the version number among the data IDs of the write target data registered in the redundant data position table 121 (step S707). The data arrangement control unit 110 notifies the updated version number to the data access unit included in the position information stored in association with the data ID (step S708). Thereby, the write control process by the data arrangement control unit 110 is completed.

再び図６に戻り、書込処理を説明する。ステップＳ６０９におけるデータ配置制御部１１０による書込制御処理が完了すると、データ配置制御部１１０は、書込要求の送信元であるクライアント４００Ａに、冗長データ位置テーブル１２１に登録した情報を送信する（ステップＳ６１０）。すなわち、データ配置制御部１１０は、図６の書込制御処理を通じて、冗長データ位置テーブル１２１に格納されている、書込要求の対象であるデータブロックのデータＩＤと当該データＩＤに対応付けられている位置情報とを、クライアント４００Ａに送信する。 Returning to FIG. 6 again, the writing process will be described. When the write control process by the data arrangement control unit 110 in step S609 is completed, the data arrangement control unit 110 transmits the information registered in the redundant data position table 121 to the client 400A that is the transmission source of the write request (step S609). S610). That is, the data placement control unit 110 associates the data ID of the data block that is the target of the write request stored in the redundant data position table 121 with the data ID through the write control process of FIG. Position information is transmitted to the client 400A.

そして、データ配置制御部１１０は、クライアント４００Ａからの情報を受信した旨の受信応答の受信有無によって、要求元への送信処理が成功したか否かを判定する（ステップＳ６１１）。送信処理が成功したと判定した場合（ステップＳ６１１、肯定）、データ配置制御部１１０は、データ配置制御処理を終了する。また、送信処理が失敗したと判定した場合（ステップＳ６１１、否定）、データ配置制御部１１０は、送信処理を行った回数がＮ回（Ｎは１以上の自然数）以下であるか否かを判定する（ステップＳ６１２）。Ｎ回以下であると判定した場合（ステップＳ６１２、肯定）、データ配置制御部１１０は、ステップＳ６１０に戻って再び情報を送信する。また、Ｎ回以下ではないと判定した場合（ステップＳ６１２、否定）、データ配置制御部１１０は、それ以上情報を送信することなく、データ配置制御処理を終了する。これによって、書込処理時の、データ配置制御部１１０によるデータ配置制御処理が終了する。 Then, the data arrangement control unit 110 determines whether or not the transmission process to the request source has been successful based on whether or not a reception response indicating that the information from the client 400A has been received has been received (step S611). When it is determined that the transmission process is successful (Yes in step S611), the data arrangement control unit 110 ends the data arrangement control process. If it is determined that the transmission process has failed (No in step S611), the data arrangement control unit 110 determines whether or not the number of times the transmission process has been performed is N or less (N is a natural number of 1 or more). (Step S612). If it is determined that the number is N or less (step S612, affirmative), the data arrangement control unit 110 returns to step S610 and transmits information again. If it is determined that it is not less than N times (No at Step S612), the data arrangement control unit 110 ends the data arrangement control process without transmitting any more information. Thereby, the data arrangement control process by the data arrangement control unit 110 at the time of the writing process is completed.

［書込部による書込処理の流れの一例］
図６及び図７に示すように、データ配置制御部１１０がデータ配置制御処理および書込制御処理を実行すると、要求元（クライアント）に対して、処理を実行するために必要なデータが送信される。クライアントは受信したデータを使用して、データアクセス部にアクセスし、データの書込みおよび読み出しを実行させる。次に、クライアント側からの書込処理の流れの一例を説明する。 [Example of flow of writing process by writing unit]
As shown in FIGS. 6 and 7, when the data arrangement control unit 110 executes the data arrangement control process and the write control process, the data necessary for executing the process is transmitted to the request source (client). The The client uses the received data to access the data access unit and execute data writing and reading. Next, an example of the flow of writing processing from the client side will be described.

図８は、第２の実施形態に係る書込部による書込処理の流れの一例を示すフローチャートである。図８は、たとえばクライアント４００Ａが備える書込部４０２Ａによる書込処理の流れの一例を示す図である。図８を参照し、クライアント４００Ａによる書込処理の流れの一例を説明する。 FIG. 8 is a flowchart illustrating an example of a flow of a writing process by the writing unit according to the second embodiment. FIG. 8 is a diagram illustrating an example of a flow of writing processing by the writing unit 402A included in the client 400A, for example. With reference to FIG. 8, an example of the flow of the writing process by the client 400A will be described.

まず、クライアント４００Ａがデータの書込処理を実行しようとする場合、アプリケーション実行部４０１Ａは、書込要求を生成する。書込要求は、書き込むデータのファイルＩＤと、ファイル終端の位置を示すオフセットと、を含む。書込要求に含まれるファイルＩＤ、オフセットは、書き込む対象であるデータとともに、アプリケーション実行部４０１Ａから書込部４０２Ａに送られる（ステップＳ８０１）。また、書込部４０２Ａが受けた書込要求は、書込部４０２Ａからマスタ１００に送信される（図６のＳ６０１参照）。 First, when the client 400A intends to execute a data write process, the application execution unit 401A generates a write request. The write request includes a file ID of data to be written and an offset indicating the position of the end of the file. The file ID and offset included in the write request are sent from the application execution unit 401A to the writing unit 402A together with the data to be written (step S801). The write request received by the writing unit 402A is transmitted from the writing unit 402A to the master 100 (see S601 in FIG. 6).

また、書込要求を受信したマスタ１００のデータ配置制御部１１０は、書込要求に応じてデータを書き込む対象となるデータブロックのデータＩＤおよび当該データブロックの位置情報を、冗長データ位置テーブル１２１を参照して取得し、書込部４０２Ａに送信する。これによって書込部４０２Ａは、書き込む対象となるデータブロックのデータＩＤと当該データブロックの位置情報を取得する（ステップＳ８０２）。 In addition, the data arrangement control unit 110 of the master 100 that has received the write request stores the data ID of the data block to which data is to be written in response to the write request and the position information of the data block in the redundant data position table 121. The information is acquired by referring to the writing unit 402A. Thereby, the writing unit 402A acquires the data ID of the data block to be written and the position information of the data block (step S802).

次に、書込部４０２Ａは、取得した位置情報から、書込対象データブロックが格納されるディスクを配下にもつサーバまたは書込対象データブロックへの書込アクセスを制御するデータアクセス部の情報を抽出する。そして、抽出したサーバまたはデータアクセス部のうち、書込部４０２Ａから最も近いサーバまたはデータアクセス部を１つ選択する（ステップＳ８０３）。ここで、「最も近い」とは、たとえば、書込部４０２Ａから当該サーバまたはデータアクセス部までの間に介装されているスイッチの数が最も少ないこと、さらに、経由するスイッチの数が最も少ないデータアクセス部が複数存在する場合は、「ｌｏｇ（「自身のＩＰアドレス」ＥＸＯＲ「相手のＩＰアドレス」）」の値が最も小さい値、さらに、この値が同一となるデータアクセス部が複数存在する場合は、同一となるデータアクセス部のうち取得した位置情報の中で先に取得したデータアクセス部を選択する。 Next, the writing unit 402A obtains, from the acquired position information, information on a data access unit that controls write access to a server or a write target data block under the disk in which the write target data block is stored. Extract. Then, one server or data access unit closest to the writing unit 402A is selected from the extracted servers or data access units (step S803). Here, “closest” means, for example, that the number of switches interposed between the writing unit 402A and the server or the data access unit is the smallest, and further, the number of switches passing therethrough is the smallest. When there are a plurality of data access units, the value of “log (“ own IP address ”EXOR“ partner IP address ”)” is the smallest value, and there are a plurality of data access units with the same value. In this case, the previously acquired data access unit is selected from the acquired position information among the same data access units.

書込部４０２Ａは、データアクセス部、たとえば、データアクセス部２０１Ａを選択する。そして、書込部４０２Ａは、選択したデータアクセス部２０１Ａに対して、書込処理のための情報を送信する（ステップＳ８０４）。たとえば、書込部４０２Ａは、書込対象ブロックのデータＩＤと、書込対象ブロックのオフセットと、実際に書き込むデータと（ステップＳ８０１で取得）、位置情報（ステップＳ８０２で取得）と、メッセージＩＤと、を送信する。メッセージＩＤとは、書込部４０２Ａが、データ整合システム１において一意となるように生成して付与する識別子である。 Write unit 402A selects a data access unit, for example, data access unit 201A. Then, writing unit 402A transmits information for writing processing to selected data access unit 201A (step S804). For example, the writing unit 402A has a data ID of the writing target block, an offset of the writing target block, data to be actually written (obtained in step S801), position information (obtained in step S802), message ID, , Send. The message ID is an identifier generated and assigned by the writing unit 402A so as to be unique in the data matching system 1.

そして、データアクセス部２０１Ａに書込処理のための情報を送信した後、書込部４０２Ａは、メッセージＩＤに対応づけられた書込命令を、位置情報の中でリストの先頭のサーバ（すなわち、書込制御を行うデータアクセス部）に送信する（ステップＳ８０５）。書込命令は、メッセージＩＤに対応付けられたデータの書込を命令する。 Then, after transmitting the information for the writing process to the data access unit 201A, the writing unit 402A sends the write command associated with the message ID to the first server in the list (ie, the position information) The data is transmitted to a data access unit that performs write control (step S805). The write command instructs writing of data associated with the message ID.

そして、書込部４０２Ａは、書込命令の送信先のデータアクセス部から命令受信応答を受信したか否かを判定する（ステップＳ８０６）。たとえば、予め所定の時間を設定しておき、当該所定の時間内に命令受信応答を受信した場合は受信したと判定し、当該所定の時間内に命令受信応答を受信していない場合は受信していないと判定すればよい。そして、命令受信応答を受信した場合（ステップＳ８０６、肯定）、書込部４０２Ａは、アプリケーション実行部４０１Ａに、データアクセス部２０１Ａからの命令受信応答を受信した旨を通知できるか否かを判定する（ステップＳ８０７）。書込部４０２Ａは、通知できると判定した場合（ステップＳ８０７、肯定）、アプリケーション実行部４０１Ａに通知する（ステップＳ８０８）。そして、書込処理を正常終了する（ステップＳ８０９）。 Then, writing unit 402A determines whether or not a command reception response has been received from the data access unit that is the transmission destination of the write command (step S806). For example, a predetermined time is set in advance, and if a command reception response is received within the predetermined time, it is determined that it has been received. If a command reception response is not received within the predetermined time, it is received. What is necessary is just to determine that it is not. If a command reception response is received (Yes at step S806), the writing unit 402A determines whether or not the application execution unit 401A can be notified that the command reception response from the data access unit 201A has been received. (Step S807). If the writing unit 402A determines that notification can be made (Yes in step S807), the writing unit 402A notifies the application execution unit 401A (step S808). Then, the writing process ends normally (step S809).

これに対し、ステップＳ８０７において、書込部４０２Ａは命令受信応答を受信した旨を通知できないと判定した場合（ステップＳ８０７、否定）、通知を行わずに、書込処理を異常終了する（ステップＳ８１０）。 On the other hand, when it is determined in step S807 that the writing unit 402A cannot notify that the instruction reception response has been received (No in step S807), the writing process is abnormally terminated without performing notification (step S810). ).

また、ステップＳ８０６において、データアクセス部２０１Ａから命令受信応答を受信していないと判定した場合（ステップＳ８０６、否定）、書込部４０２Ａは、アプリケーション実行部４０１Ａに書込命令が失敗した旨を通知できるか否かを判定する（ステップＳ８１１）。そして、通知できると判定した場合（ステップＳ８１１、肯定）、書込部４０２Ａは、アプリケーション実行部４０１Ａに、書込命令が失敗した旨を通知する（ステップＳ８１２）。そして、書込処理を異常終了する（ステップＳ８１３）。これに対して、通知できないと判定した場合（ステップＳ８１１、否定）、通知は行わずに、書込処理を異常終了する（ステップＳ８１０）。これによってクライアント４００Ａにおける書込処理が終了する。 In Step S806, when it is determined that the command reception response has not been received from the data access unit 201A (No in Step S806), the writing unit 402A notifies the application execution unit 401A that the writing command has failed. It is determined whether or not it can be performed (step S811). If it is determined that notification can be made (Yes at Step S811), the writing unit 402A notifies the application execution unit 401A that the write command has failed (Step S812). Then, the writing process ends abnormally (step S813). On the other hand, if it is determined that notification is not possible (No in step S811), the writing process is abnormally terminated without performing notification (step S810). This completes the writing process in the client 400A.

［データアクセス部における書込実行前処理の流れの一例］
図９は、第２の実施形態に係るデータアクセス部における書込実行前処理の流れの一例を示すフローチャートである。図９を参照し、データアクセス部における書込処理の流れの一例を説明する。 [Example of the flow of pre-write execution processing in the data access unit]
FIG. 9 is a flowchart illustrating an example of a flow of pre-write execution processing in the data access unit according to the second embodiment. With reference to FIG. 9, an example of the flow of the writing process in the data access unit will be described.

まず、たとえばデータアクセス部２０１Ａは、データの書込処理を実行するための情報を、書込部４０２Ａまたは他のデータアクセス部から受信する（ステップＳ９０１）。受信する情報は、たとえば図８のステップＳ８０４において、書込部４０２Ａが送信した情報である。本実施例では、書込のための情報は、書込部４０２Ａから１つのデータアクセス部に送られ、当該データアクセス部から他のデータアクセス部に送信される。 First, for example, the data access unit 201A receives information for executing data writing processing from the writing unit 402A or another data access unit (step S901). The information to be received is, for example, information transmitted by the writing unit 402A in step S804 in FIG. In the present embodiment, information for writing is transmitted from the writing unit 402A to one data access unit, and transmitted from the data access unit to another data access unit.

データアクセス部２０１Ａは、受信した情報のうち、位置情報を抽出し、その中に、自身以外のデータアクセス部が含まれているか否かを判定する（ステップＳ９０２）。他のデータアクセス部が位置情報に含まれていない場合（ステップＳ９０２、否定）、データアクセス部２０１Ａは、書込要求の送信元であるクライアント４００Ａに情報受信に成功した旨の通知を送信する（ステップＳ９０３）。 The data access unit 201A extracts position information from the received information, and determines whether or not a data access unit other than itself is included therein (step S902). If no other data access unit is included in the position information (No at Step S902), the data access unit 201A transmits a notification to the effect that the information has been successfully received to the client 400A that is the transmission source of the write request ( Step S903).

これに対して、他のデータアクセス部が位置情報に含まれている場合（ステップＳ９０２、肯定）、データアクセス部２０１Ａは、他のデータアクセス部を１つ選択する（ステップＳ９０４）。このとき、データアクセス部２０１Ａは、図８のステップＳ８０３と同様の基準でデータアクセス部を選択すればよい。そして、データアクセス部２０１Ａは、自身の位置情報を除いた位置情報を作成する。 On the other hand, when another data access unit is included in the position information (Yes at step S902), the data access unit 201A selects one other data access unit (step S904). At this time, the data access unit 201A may select the data access unit based on the same criteria as in step S803 in FIG. Then, the data access unit 201A creates position information excluding its own position information.

次に、データアクセス部２０１Ａは、受信した書込処理のためのデータのうち、位置情報をステップＳ９０４において作成した位置情報と置き換えて、ステップＳ９０４で選択したデータアクセス部に送信する（ステップＳ９０５）。そして、データアクセス部２０１Ａは、情報を受信した旨の応答が送信されるのを待つ。情報を受信した旨の応答を受信した場合（ステップＳ９０６、肯定）、データアクセス部２０１Ａは、ステップＳ９０３に進み、送信元のクライアント４００Ａに、情報受信が成功した旨の通知を送信する。これに対して、情報を受信した旨の応答を受信しない場合（ステップＳ９０６、否定）、データアクセス部２０１Ａは、送信元のクライアント４００Ａに、情報受信に失敗した旨の通知を送信する（ステップＳ９０７）。これによってデータアクセス部２０１Ａでの処理が終了する。 Next, the data access unit 201A replaces the position information in the received data for writing processing with the position information created in step S904, and transmits it to the data access unit selected in step S904 (step S905). . Then, the data access unit 201A waits for a response indicating that the information has been received. If a response indicating that the information has been received is received (Yes at Step S906), the data access unit 201A proceeds to Step S903, and transmits a notification that the information reception has been successful to the transmission source client 400A. On the other hand, when the response indicating that the information has been received is not received (No at Step S906), the data access unit 201A transmits a notification that the information reception has failed to the transmission source client 400A (Step S907). ). As a result, the processing in the data access unit 201A ends.

［データアクセス部における書込実行処理の流れの一例］
次に、図１０を参照して、データアクセス部における書込実行処理の流れを説明する。図１０は、第２の実施形態に係るデータアクセス部における書込実行処理の流れの一例を示すフローチャートである。なお、本実施例では、複数のサーバに冗長データを書き込む場合、まず、書込処理のための情報を１つのサーバに送信し、当該サーバから他の該当するサーバに順次情報を送信する。そして、書込を実行する際には、書込部が予め定められた処理順序に従い、最初に書込を行うサーバに書込命令を送信する。他のサーバに対しては、書込部ではなく、最初に書込を行うサーバから書込命令を送信する。図１０は、最初に書込を行うサーバ（すなわち、書込制御を行うデータアクセス部）での書込実行処理を示す。 [Example of flow of write execution processing in data access unit]
Next, the flow of write execution processing in the data access unit will be described with reference to FIG. FIG. 10 is a flowchart illustrating an example of the flow of the write execution process in the data access unit according to the second embodiment. In this embodiment, when writing redundant data to a plurality of servers, first, information for writing processing is transmitted to one server, and information is sequentially transmitted from the server to other corresponding servers. Then, when executing the writing, the writing unit transmits a writing command to the server that performs the writing first in accordance with a predetermined processing order. For other servers, a writing command is transmitted from the server that performs the writing first, not the writing unit. FIG. 10 shows a write execution process in a server that performs writing first (that is, a data access unit that performs write control).

まず、たとえばデータアクセス部２０１Ａは、書込部４０２Ａから書込命令を受信する（ステップＳ１００１）。データアクセス部２０１Ａは、それまでに例えば、図９に示すステップＳ９０１の処理によって、書込処理を実行するためのデータを受信している。そして、データアクセス部２０１Ａは、書込命令に応じて、書込対象データを指定された位置に書き込む処理を実行する（ステップＳ１００２）。このとき、データアクセス部２０１Ａは、書込部４０２Ａから受信した書込命令に含まれるメッセージＩＤによって、書込対象データを特定する。次に、データアクセス部２０１Ａは、ステップＳ７０３（図７）において受信したデータに含まれる位置情報の中の他のデータアクセス部に対して、書込命令を送信する（ステップＳ１００３）。そして、データアクセス部２０１Ａは、命令を送信できたか否かを判定する（ステップＳ１００４）。命令を送信できなかったと判定した場合（ステップＳ１００４、否定）、データアクセス部２０１Ａは、それ以上処理を行わず、異常終了する（ステップＳ１００８）。 First, for example, the data access unit 201A receives a write command from the writing unit 402A (step S1001). Until then, the data access unit 201A has received data for executing the writing process, for example, by the process of step S901 shown in FIG. Then, in response to the write command, the data access unit 201A executes a process of writing the write target data at the designated position (step S1002). At this time, the data access unit 201A specifies the data to be written based on the message ID included in the write command received from the writing unit 402A. Next, the data access unit 201A transmits a write command to other data access units in the position information included in the data received in step S703 (FIG. 7) (step S1003). Then, the data access unit 201A determines whether or not the command has been transmitted (step S1004). If it is determined that the command could not be transmitted (No at Step S1004), the data access unit 201A does not perform any more processing and ends abnormally (Step S1008).

これに対して、命令を送信できたと判定した場合（ステップＳ１００４、肯定）、データアクセス部２０１Ａは、書込命令の送信先であるデータアクセス部から命令成功応答があったか否かを判定する（ステップＳ１００５）。全てのデータアクセス部から命令成功応答があったと判定した場合（ステップＳ１００５、肯定）、データアクセス部２０１Ａは、送信元のクライアント４００Ａに、書込が完了した旨の通知を送信する（ステップＳ１００６）。そして、データアクセス部２０１Ａは、書込実行処理を正常終了する（ステップＳ１００７）。これに対して、全てのデータアクセス部からの命令成功応答を受信していないと判定した場合（ステップＳ１００５、否定）、データアクセス部２０１Ａは、それ以上処理を行わず、異常終了する（ステップＳ１００８）。これによって書込実行処理が終了する。 On the other hand, when it is determined that the command has been transmitted (Yes at step S1004), the data access unit 201A determines whether there has been a command success response from the data access unit that is the transmission destination of the write command (step S1004). S1005). If it is determined that there has been a command success response from all the data access units (Yes at step S1005), the data access unit 201A transmits a notification to the effect that the writing has been completed to the transmission source client 400A (step S1006). . Then, the data access unit 201A normally ends the write execution process (step S1007). On the other hand, if it is determined that instruction success responses from all the data access units have not been received (No at step S1005), the data access unit 201A does not perform any further processing and ends abnormally (step S1008). ). This completes the write execution process.

［第２の実施形態に係るデータ整合システムにおける処理の流れ］
図１１は、第２の実施形態に係るデータ整合システム１における正常時の処理の流れの一例を概括的に示す図である。図１１を参照し、第２の実施形態に係るデータ整合システム１における正常時の処理の流れの一例について説明する。 [Processing Flow in Data Matching System According to Second Embodiment]
FIG. 11 is a diagram schematically showing an example of a normal processing flow in the data matching system 1 according to the second embodiment. With reference to FIG. 11, an example of a normal processing flow in the data matching system 1 according to the second embodiment will be described.

図１１に示すように、クライアント４００（以下、「クライアント４００」は、クライアント４００Ａ，４００Ｂ，４００Ｃのいずれか１つを示す）が、マスタ１００に対して書込要求を送信する（図１１の（１））。書込要求は、ファイルＩＤとオフセットを含む。書込要求を受信したマスタ１００は、ファイルＩＤとオフセットに基づき、冗長データ位置テーブル１２１を参照して、要求された書込処理の対象データのデータＩＤと位置情報とを抽出する。対応するデータＩＤが冗長データ位置テーブル１２１に登録されていない場合は、新たなハンドル名を作成して登録する（図１１の（２））。 As shown in FIG. 11, the client 400 (hereinafter, “client 400” indicates one of the clients 400A, 400B, and 400C) transmits a write request to the master 100 ((( 1)). The write request includes a file ID and an offset. The master 100 that has received the write request refers to the redundant data position table 121 based on the file ID and the offset, and extracts the data ID and position information of the requested target data for the writing process. If the corresponding data ID is not registered in the redundant data position table 121, a new handle name is created and registered ((2) in FIG. 11).

マスタ１００はさらに、処理対象リスト１２２を参照して（図１１の（３））、書込対象データのハンドル名が登録されていない場合、書込処理制御を行うデータアクセス部を１つ選択する（図１１の（４））。そして、選択したデータアクセス部に書込制御命令を送信する（図１１の（５））。データアクセス部から応答を受信する（図１１の（６））と、マスタ１００は、冗長データ位置テーブル１２１と処理対象リスト１２２とを更新し（図１１の（７））、更新後のバージョン番号をデータアクセス部に通知する（図１１の（８））。その後、マスタ１００は、クライアント４００に書込処理の対象データのデータＩＤと、対応する位置情報とを送信する（図１１の（９））。 Further, the master 100 refers to the processing target list 122 ((3) in FIG. 11), and if the handle name of the write target data is not registered, the master 100 selects one data access unit that performs the write processing control. ((4) in FIG. 11). Then, a write control command is transmitted to the selected data access unit ((5) in FIG. 11). When receiving a response from the data access unit ((6) in FIG. 11), the master 100 updates the redundant data position table 121 and the processing target list 122 ((7) in FIG. 11), and the updated version number Is notified to the data access unit ((8) in FIG. 11). Thereafter, the master 100 transmits to the client 400 the data ID of the target data to be written and the corresponding position information ((9) in FIG. 11).

クライアント４００は、書込要求に含めたファイルＩＤおよびオフセット（図１１の（１）参照）と、書込むデータと、マスタ１００から受信したデータＩＤおよび位置情報（図１１の（９））と、クライアント４００において一意となるよう付与したメッセージＩＤと、を図８のステップＳ８０３と同様の基準で選択したデータアクセス部に送信する（図１１の（１０））。情報を受信したデータアクセス部は、位置情報を参照して、他のデータアクセス部が位置情報に含まれていれば、自身の情報を位置情報から削除した上で、当該他のデータアクセス部に受信した情報を送信する（図１１の（１１））。情報を受信した他のデータアクセス部も同様の処理を実行する（図１１の（１２））。 The client 400 includes the file ID and offset included in the write request (see (1) in FIG. 11), the data to be written, the data ID and position information received from the master 100 ((9) in FIG. 11), The message ID assigned to be unique in the client 400 is transmitted to the data access unit selected based on the same criteria as in step S803 in FIG. 8 ((10) in FIG. 11). The data access unit that has received the information refers to the location information, and if the other data access unit is included in the location information, deletes its own information from the location information, and then sends it to the other data access unit. The received information is transmitted ((11) in FIG. 11). Other data access units that have received the information also execute the same processing ((12) in FIG. 11).

その後、クライアント４００は、書込制御を行うデータアクセス部（図１１の（４）参照）に書込実行命令を送信する（図１１の（１３））。書込実行命令を受信したデータアクセス部は、データを配下のディスクに書き込んだ上で、書込実行命令を位置情報に含まれていた他のデータアクセス部（複数あれば全てに）送信する（図１１の（１４））。データが該当する全てのディスクに書き込まれると、書込制御を行うデータアクセス部に他のデータアクセス部から完了応答が送信される（図１１の（１５））。書込制御を行うデータアクセス部は、全ての完了応答を受信すると、クライアントに完了応答を送信する（図１１の（１６））。正常時の書込処理はこのようにして完了する。 Thereafter, the client 400 transmits a write execution command to the data access unit (see (4) in FIG. 11) that performs write control ((13) in FIG. 11). The data access unit that has received the write execution command writes the data to the subordinate disk, and then transmits the write execution command to other data access units (if there are a plurality of data access units) included in the position information ( (14) of FIG. When the data is written to all the corresponding disks, a completion response is transmitted from the other data access unit to the data access unit that performs write control ((15) in FIG. 11). When the data access unit that performs the write control receives all the completion responses, it transmits the completion response to the client ((16) in FIG. 11). The writing process at the normal time is completed in this way.

［読出処理時のデータ配置制御処理の流れの一例］
図１２は、第２の実施形態に係るデータ配置制御部１１０による、読出処理時のデータ配置制御処理の流れの一例を示すフローチャートである。図１２を参照し、第２の実施形態に係るマスタ１００における、読出処理時のデータ配置制御処理の流れの一例につき説明する。 [Example of flow of data arrangement control processing during read processing]
FIG. 12 is a flowchart illustrating an example of the flow of the data arrangement control process during the reading process by the data arrangement control unit 110 according to the second embodiment. With reference to FIG. 12, an example of the flow of the data arrangement control process during the reading process in the master 100 according to the second embodiment will be described.

まず、たとえばクライアント４００Ａがマスタ１００に読出要求を送信する。なお、読出要求は後述する比較部４１２によっても送信される。この場合の処理も以下と同様である。読出要求は、読み出す対象ファイルのファイルＩＤと、読み出す対象ファイルの終端位置を示すオフセット「Ｚ」と、を含む。マスタ１００のデータ配置制御部１１０は、読出要求を受信する（ステップＳ１２０１）と、読出要求に含まれるファイルＩＤとオフセット「Ｚ」とを抽出する。そして、データ配置制御部１１０は、抽出した情報に基づき、当該ファイルの何番目のブロックからの読み出しを行うのかを特定する（ステップＳ１２０２）。この処理は図６のステップＳ６０２と同様である。 First, for example, the client 400A transmits a read request to the master 100. The read request is also transmitted by the comparison unit 412 described later. The processing in this case is similar to the following. The read request includes the file ID of the target file to be read and the offset “Z” indicating the end position of the target file to be read. Upon receiving the read request (step S1201), the data arrangement control unit 110 of the master 100 extracts the file ID and the offset “Z” included in the read request. Then, based on the extracted information, the data arrangement control unit 110 specifies from which block of the file the reading is to be performed (step S1202). This process is the same as step S602 in FIG.

次に、データ配置制御部１１０は、冗長データ位置テーブル１２１から、特定したブロックに対応するブロックデータのデータＩＤを取得することを試みる（ステップＳ１２０３）。データＩＤを取得できなかった場合（ステップＳ１２０３、否定）、データ配置制御部１１０は、読出要求を送信したクライアント４００Ａ（要求元）に対して、読出処理に失敗した旨を通知して（ステップＳ１２０４）、処理を終える。他方、データＩＤを取得できた場合（ステップＳ１２０３、肯定）、データ配置制御部１１０は、次に、当該データＩＤに対応する位置情報を、冗長データ位置テーブル１２１から取得することを試みる（ステップＳ１２０５）。位置情報を取得できなかった場合（ステップＳ１２０５、否定）は、データＩＤを取得できなかった場合（ステップＳ１２０３、否定）と同様に、クライアント４００Ａに読出処理に失敗した旨を通知して（ステップＳ１２０４）、処理を終える。 Next, the data arrangement control unit 110 tries to acquire the data ID of the block data corresponding to the identified block from the redundant data position table 121 (step S1203). When the data ID cannot be acquired (No at Step S1203), the data arrangement control unit 110 notifies the client 400A (request source) that has transmitted the read request that the read process has failed (Step S1204). ) Finish the process. On the other hand, if the data ID can be acquired (Yes at step S1203), the data arrangement control unit 110 then tries to acquire the position information corresponding to the data ID from the redundant data position table 121 (step S1205). ). When the position information cannot be acquired (No at Step S1205), the client 400A is notified that the reading process has failed (Step S1204), similarly to the case where the data ID cannot be acquired (No at Step S1203). ) Finish the process.

位置情報を取得できた場合（ステップＳ１２０５、肯定）は、データ配置制御部１１０は、冗長データ位置テーブル１２１から取得したデータＩＤ（ステップＳ１２０３）および位置情報（ステップＳ１２０５）を、クライアント４００Ａ（要求元）に送信する（ステップＳ１２０６）。そして、データ配置制御部１１０は、送信処理が成功したか否かを判定する（ステップＳ１２０７）。たとえば、クライアント４００Ａから受信応答を受け取ることによって、成功したか否かを判定すればよい。 If the position information can be acquired (Yes at Step S1205), the data arrangement control unit 110 uses the data ID (Step S1203) and the position information (Step S1205) acquired from the redundant data position table 121 as the client 400A (request source). ) (Step S1206). Then, the data arrangement control unit 110 determines whether or not the transmission process is successful (step S1207). For example, it may be determined whether or not it has succeeded by receiving a reception response from the client 400A.

送信処理が成功したと判定した場合（ステップＳ１２０７、肯定）、データ配置制御部１１０は、処理を終える。他方、送信処理が失敗したと判定した場合（ステップＳ１２０７、否定）、データ配置制御部１１０は、送信処理の失敗回数がＮ回（Ｎは１以上の自然数）以下であるか否かを判定する（ステップＳ１２０８）。Ｎ回以下であると判定した場合（ステップＳ１２０８、肯定）、データ配置制御部１１０は、ステップＳ１２０６に戻って再び情報を要求元に送信する。他方、Ｎ回以下ではないと判定した場合（ステップＳ１２０８、否定）、データ配置制御部１１０は、そのまま処理を終える。ここで、Ｎはリトライ数の上限値として予め定めておけばよい。 If it is determined that the transmission process is successful (Yes at step S1207), the data arrangement control unit 110 ends the process. On the other hand, if it is determined that the transmission process has failed (No in step S1207), the data arrangement control unit 110 determines whether the number of transmission process failures is N or less (N is a natural number equal to or greater than 1). (Step S1208). If it is determined that the number is N times or less (Yes at Step S1208), the data arrangement control unit 110 returns to Step S1206 and transmits information to the request source again. On the other hand, if it is determined that it is not less than N times (No in step S1208), the data arrangement control unit 110 ends the process as it is. Here, N may be determined in advance as the upper limit value of the number of retries.

［読出部による読出処理の流れの一例］
図１３は、第２の実施形態に係る読出部による読出処理の流れの一例を示すフローチャートである。図１３を参照し、読出部（たとえば４０３Ａ）による読出処理の流れの一例を説明する。 [Example of the flow of read processing by the read unit]
FIG. 13 is a flowchart illustrating an example of a flow of reading processing by the reading unit according to the second embodiment. With reference to FIG. 13, an example of the flow of reading processing by the reading unit (for example, 403A) will be described.

まず、読出部４０３Ａは、アプリケーション実行部４０１Ａからデータを読み出すファイルＩＤを取得する（ステップＳ１３０１）。次に、読出部４０３Ａは、アプリケーション実行部４０１Ａから読み出すデータのオフセットと、データ長を取得する（ステップＳ１３０２）。次に、読出部４０３Ａは、データ配置制御部１１０から、取得したファイルＩＤ、オフセットおよびデータ長に基づき、読み出すデータを保持するブロックのデータＩＤと、当該データＩＤに対応付けられた位置情報を取得する（ステップＳ１３０３）。データＩＤと、当該データＩＤに対応付けられた位置情報と、を取得する際のデータ配置制御部１１０の処理が、図１２の処理にあたる。 First, the reading unit 403A obtains a file ID for reading data from the application execution unit 401A (step S1301). Next, the reading unit 403A acquires the offset and data length of data read from the application execution unit 401A (step S1302). Next, the reading unit 403A acquires, from the data arrangement control unit 110, the data ID of the block holding the data to be read and the position information associated with the data ID based on the acquired file ID, offset, and data length. (Step S1303). The process of the data arrangement control unit 110 when acquiring the data ID and the position information associated with the data ID corresponds to the process of FIG.

次に、読出部４０３Ａは、データ配置制御部１１０から取得した位置情報のうち最も近いデータアクセス部を１つ選択する（ステップＳ１３０４）。このとき、読出部４０３Ａは、図８のステップＳ８０３と同様の基準でデータアクセス部を選択すればよい。 Next, reading unit 403A selects one of the closest data access units from the position information acquired from data arrangement control unit 110 (step S1304). At this time, the reading unit 403A may select the data access unit based on the same criteria as in step S803 in FIG.

そして、読出部４０３Ａは、選択したデータアクセス部に読出命令を送信してデータを読み出す（ステップＳ１３０５）。読出命令は、当該処理でアプリケーション実行部４０１Ａが指定したデータを読み出す命令である。 Then, reading unit 403A transmits a read command to the selected data access unit to read data (step S1305). The read command is a command for reading data designated by the application execution unit 401A in the processing.

読出部４０３Ａは、読み出したデータをアプリケーション実行部４０１Ａに送信する（ステップＳ１３０６）。これによって、読出処理が完了する。 The reading unit 403A transmits the read data to the application execution unit 401A (step S1306). This completes the reading process.

［データアクセス部における読出実行処理の流れの一例］
図１４は、第２の実施形態に係るデータアクセス部における読出実行処理の流れの一例を示すフローチャートである。図１４を参照し、図１３のように読出部４０３Ａから読出命令があった場合のデータアクセス部における処理の流れの一例を説明する。 [Example of flow of read execution processing in data access unit]
FIG. 14 is a flowchart illustrating an example of the flow of read execution processing in the data access unit according to the second embodiment. With reference to FIG. 14, an example of the flow of processing in the data access unit when there is a read command from the reading unit 403A as shown in FIG. 13 will be described.

まず、データアクセス部、たとえばデータアクセス部２０１Ａは、読出部４０３Ａから読出命令を受信する（ステップＳ１４０１）。データアクセス部２０１Ａは、読出命令に応じて、配下のディスクから該当するデータを読み出す（ステップＳ１４０２）。そして、データアクセス部２０１Ａは、読み出したデータを読出部４０３Ａに送信する（ステップＳ１４０３）。これによってデータアクセス部における読出実行処理が終了する。 First, the data access unit, for example, the data access unit 201A receives a read command from the read unit 403A (step S1401). In response to the read command, the data access unit 201A reads the corresponding data from the subordinate disk (step S1402). The data access unit 201A transmits the read data to the reading unit 403A (step S1403). This completes the read execution process in the data access unit.

［異常終了時の冗長データの状態］
図１５は、データアクセス部が、書込処理中に異常終了した場合の冗長データの状態を説明するための図である。図１５を参照して、書込処理が異常終了した場合に発生しうる冗長データの状態について説明する。 [Redundant data status at abnormal termination]
FIG. 15 is a diagram for explaining the state of redundant data when the data access unit abnormally terminates during the writing process. With reference to FIG. 15, the state of redundant data that may occur when the writing process ends abnormally will be described.

たとえば、図１０のステップＳ１００５においてデータアクセス部２０１Ａが他のデータアクセス部からの命令成功応答を受信しなかった場合、他のデータアクセス部において冗長データの正常な書込が完了していない場合が考えられる。また、図１０の処理を実行する書込制御を実行するデータアクセス部が、他のデータアクセス部に書込命令を送信（ステップＳ１００３、図１０）した後に何らかの理由で停止した場合、当該データアクセス部は、外部からアクセス不可能な状態となる。また、他のデータアクセス部は書込が成功したか否かの応答を当該データアクセス部に送信することができないため、これらのデータアクセス部において冗長データの書込が成功したか否かを、外部から知ることができない。 For example, if the data access unit 201A does not receive an instruction success response from another data access unit in step S1005 of FIG. 10, normal writing of redundant data may not be completed in the other data access unit. Conceivable. If the data access unit that executes the write control for executing the processing of FIG. 10 stops for some reason after transmitting a write command to another data access unit (step S1003, FIG. 10), the data access The part becomes inaccessible from the outside. In addition, since the other data access units cannot send a response indicating whether or not the writing is successful to the data access unit, whether or not the redundant data writing is successful in these data access units, I can't know from outside.

図１５に示すように、ファイル「Ａ」のデータを３つのブロックに分割して、データＩＤ「ａ３３」の冗長データを３つのディスクに格納しようとした場合を考える。この場合、最初に書込を実行しようとしたサーバが停止すると、当該サーバに対応するディスクに冗長データが書き込まれたのか否かは不明であり、また当該ディスクにはデータアクセスすることができなくなる。この状態を図１５の（１）に示す。このようにサーバが停止して書込の成否が不明になった場合、冗長データ位置テーブル１２１においては、データＩＤに対応して格納されていたサーバの位置情報が削除される。 As shown in FIG. 15, consider a case where the data of the file “A” is divided into three blocks and the redundant data with the data ID “a33” is to be stored on three disks. In this case, if the server that attempted to execute writing first stops, it is unclear whether redundant data has been written to the disk corresponding to the server, and data cannot be accessed on the disk. . This state is shown in FIG. Thus, when the server stops and the success or failure of writing becomes unknown, the server location information stored corresponding to the data ID is deleted in the redundant data location table 121.

また、停止したサーバのデータアクセス部に対して書込の成否を通知するデータアクセス部については、書込が成功したのか否かを知ることができない。したがって、これらのデータアクセス部が制御する書き込みデータの状態は、図１５の（２），（３）に示すようになる。 Further, the data access unit that notifies the data access unit of the stopped server of the success or failure of writing cannot know whether or not the writing is successful. Therefore, the states of the write data controlled by these data access units are as shown in (2) and (3) of FIG.

［データ整合処理の流れの一例］
図１６は、第２の実施形態に係るデータ整合装置４１０におけるデータ整合処理の流れの一例を示すフローチャートである。図１６を参照し、第２の実施形態に係るデータ整合装置４１０におけるデータ整合処理の流れの一例を説明する。 [Example of data consistency processing flow]
FIG. 16 is a flowchart illustrating an example of the flow of data matching processing in the data matching apparatus 410 according to the second embodiment. An example of the flow of data matching processing in the data matching apparatus 410 according to the second embodiment will be described with reference to FIG.

まず、クライアント４００がデータ整合処理を実行しようとする場合、整合要求を生成する。データ整合装置４１０（比較部４１２）は、生成された整合要求を受け取る（ステップＳ１６０１）。整合要求は、たとえば、データアクセス部２０１Ａ，２０１Ｂ，２０１Ｃのいずれかにおいて、データの書込処理が正常に終了せず、書込要求の要求元であるクライアント４００Ａ，４００Ｂ，４００Ｃのいずれかに、書込完了の通知が送信されなかった場合等に、クライアント４００Ａ，４００Ｂ，４００Ｃが生成する。クライアント４００Ａ，４００Ｂ，４００Ｃではアプリケーション実行部４０１Ａ，４０１Ｂ，４０１Ｃが、書込対象データの重要性や処理の内容に応じて、整合要求を送るか否かを判断すればよい。整合要求は、整合の対象となるファイルＩＤを含む。整合要求はまた、指定するファイルＩＤの末尾ブロックのデータ長を揃える要求である。 First, when the client 400 intends to execute the data matching process, a matching request is generated. The data matching apparatus 410 (comparison unit 412) receives the generated matching request (step S1601). For example, in the data access unit 201A, 201B, or 201C, the data write process does not end normally and any of the clients 400A, 400B, and 400C that is the request source of the write request Generated by the clients 400A, 400B, and 400C, for example, when a write completion notification is not transmitted. In the clients 400A, 400B, and 400C, the application execution units 401A, 401B, and 401C may determine whether to send a consistency request according to the importance of the write target data and the contents of the processing. The matching request includes a file ID to be matched. The matching request is also a request for aligning the data length of the last block of the specified file ID.

整合要求を受けたデータ整合装置４１０は、冗長データ位置テーブル１２１から、整合要求に含まれるファイルＩＤで書き込まれているデータブロックのうち、末尾のブロックに対応するデータＩＤと、当該データＩＤに対応付けられている位置情報の取得を試みる（ステップＳ１６０２）。データ整合装置４１０は、データＩＤと位置情報を取得できなかった場合（ステップＳ１６０２、否定）は、そのままデータ整合処理を終了する。なお、データＩＤと位置情報の取得は図１３に示す手順と同様に行えばよい。 The data matching apparatus 410 that has received the matching request corresponds to the data ID corresponding to the last block of the data blocks written with the file ID included in the matching request from the redundant data position table 121 and the data ID. An attempt is made to acquire attached position information (step S1602). If the data matching apparatus 410 cannot acquire the data ID and the position information (No at Step S1602), the data matching process is ended as it is. The acquisition of the data ID and position information may be performed in the same manner as the procedure shown in FIG.

他方、データ整合装置４１０は、データＩＤと位置情報を取得できた場合（ステップＳ１６０２、肯定）、次に、取得した位置情報の中に複数のデータアクセス部が含まれているか否かを判定する（ステップＳ１６０３）。 On the other hand, if the data matching apparatus 410 can acquire the data ID and the position information (Yes in step S1602), it next determines whether or not a plurality of data access units are included in the acquired position information. (Step S1603).

そして、複数のデータアクセス部が含まれていない（ステップＳ１６０３、否定）と判定した場合、そのままデータ整合処理を終了する。他方、複数のデータアクセス部が含まれている（ステップＳ１６０３、肯定）と判定した場合、比較部４１２は、各データアクセス部が制御するディスクに格納された、整合要求（具体的には、ステップＳ１０６２）によって特定されたデータＩＤに対応するデータブロックのデータ長を取得する（ステップＳ１６０４）。データ長を取得できなかった場合（ステップＳ１６０５、否定）は、再びステップＳ１６０２に戻る。これに対して、データ長を取得できた場合（ステップＳ１６０５、肯定）は、次に比較部４１２は、複数のデータブロックのデータ長がすべて一致するか否かを判定する（ステップＳ１６０６）。比較部４１２は、データ長が一致したと判定した場合（ステップＳ１６０６、肯定）、データ整合処理を終了する。これに対して、データ長が一致しないと判定した場合（ステップＳ１６０６、否定）、比較部４１２は、処理を同一化部４１３に渡し、同一化部４１３が同一化処理を実行する（ステップＳ１６０７）。同一化処理については、詳しく後述する。同一化処理が完了すると、データ整合処理は終了する。 If it is determined that a plurality of data access units are not included (No in step S1603), the data matching process is terminated as it is. On the other hand, if it is determined that a plurality of data access units are included (Yes in step S1603), the comparison unit 412 stores the matching request (specifically, the step stored in the disk controlled by each data access unit). The data length of the data block corresponding to the data ID specified in S1062) is acquired (step S1604). If the data length could not be acquired (No at step S1605), the process returns to step S1602. On the other hand, if the data length can be acquired (Yes in step S1605), the comparison unit 412 next determines whether or not the data lengths of the plurality of data blocks all match (step S1606). If the comparison unit 412 determines that the data lengths match (Yes in step S1606), the comparison unit 412 ends the data matching process. On the other hand, when it is determined that the data lengths do not match (No at Step S1606), the comparison unit 412 passes the process to the identification unit 413, and the identification unit 413 executes the identification process (Step S1607). . The identification process will be described later in detail. When the identification process is completed, the data matching process ends.

［同一化処理の流れの一例］
図１７は、第２の実施形態に係るデータ整合装置４１０における同一化処理の流れの一例を示す図である。図１７を参照し、同一化部４１３による同一化処理の流れの一例を説明する。なお、図１７の例では、比較部４１２が比較した冗長データの数は２であるとする。 [Example of identification process flow]
FIG. 17 is a diagram illustrating an example of the flow of the identification process in the data matching apparatus 410 according to the second embodiment. With reference to FIG. 17, an example of the flow of the identification process by the identification unit 413 will be described. In the example of FIG. 17, it is assumed that the number of redundant data compared by the comparison unit 412 is two.

まず、比較部４１２が、複数（２つ）のデータのデータ長が一致しないと判定した場合（図１６のステップＳ１６０６、否定）、同一化部４１３は、比較部４１２が比較した２つのデータブロックのデータ長のうち、短い方のデータ長を選択する（ステップＳ１７０１）。ここでは、比較した２つのデータブロックをそれぞれ「データＡ」、「データＢ」と呼ぶ。そして、同一化部４１３は、選択したデータ長がデータＡのものであるか否かを判定する（ステップＳ１７０２）。選択したデータ長がデータＢのものであると判定した場合（ステップＳ１７０２、否定）、同一化部４１３は、データＡのデータ長をデータＢに合わせるように、データＡを切り詰める（ステップＳ１７０３）。他方、選択したデータ長がデータＡのものであると判定した場合（ステップＳ１７０２、肯定）、同一化部４１３は、データＢのデータ長をデータＡに合わせるように、データＢの長さを切り詰める（ステップＳ１７０４）。そして、同一化部４１３は、同一化処理が完了した旨を、整合要求の要求元に通知する（ステップＳ１７０５）。 First, when the comparison unit 412 determines that the data lengths of a plurality (two) of data do not match (No in step S1606 in FIG. 16), the identification unit 413 determines that the two data blocks compared by the comparison unit 412 The shorter one of the data lengths is selected (step S1701). Here, the two data blocks compared are referred to as “data A” and “data B”, respectively. The identifying unit 413 determines whether the selected data length is that of the data A (step S1702). If it is determined that the selected data length is that of data B (No at step S1702), the identifying unit 413 truncates the data A so that the data length of the data A matches the data B (step S1703). On the other hand, when it is determined that the selected data length is that of data A (Yes in step S1702), the identifying unit 413 truncates the length of data B so that the data length of data B matches the data A. (Step S1704). The identification unit 413 notifies the requester of the matching request that the identification process has been completed (step S1705).

同一化部４１３が、同一化処理を行う際は、図６に示したデータ配置制御処理と同様の処理が実行される。すなわち、同一化部４１３は、データ配置制御部１１０に対して、データ長を揃える対象データブロックを取得するため、ファイルＩＤと、負数を指定したオフセットと、を含む命令を送信する。データは位置制御部１１０では、オフセットに負数を指定した場合は、ファイルの末尾ブロックが指定されたことと同意であると判定する。そのため、命令に応じてデータ配置制御部１１０が、指定したファイルＩＤのファイル末尾のデータＩＤと位置情報を同一化部４１３に送信するとともに、データアクセス部に制御命令を送る。その後の処理は、図６に例示したものと同様である。 When the identification unit 413 performs the identification process, a process similar to the data arrangement control process illustrated in FIG. 6 is executed. That is, the identifying unit 413 transmits a command including a file ID and an offset designating a negative number to the data arrangement control unit 110 in order to acquire a target data block whose data length is uniform. If the position controller 110 designates a negative number for the offset, the data is determined to be in agreement with the designation of the last block of the file. Therefore, in response to the command, the data placement control unit 110 transmits the data ID and position information at the end of the specified file ID to the identification unit 413 and sends a control command to the data access unit. The subsequent processing is the same as that illustrated in FIG.

なお、データ整合処理を行った場合の読出処理も図１３、図１４に示した処理と同様に実行される。ただし、データアクセス部のいずれかの異常終了等により、ディスクがアクセスできない状態になっている場合は、冗長データ位置テーブル１２１の位置情報からは当該データアクセス部の情報が削除されているため、読出部が取得できる位置情報の内容が通常とは変更されている。また、データ整合処理の結果、通常の書込処理時と同様に冗長データ位置テーブル１２１や処理対象リスト１２２が更新されている。 Note that the reading process when the data matching process is performed is executed in the same manner as the processes shown in FIGS. However, if the disk is in an inaccessible state due to any abnormal termination or the like of the data access unit, the information of the data access unit is deleted from the position information in the redundant data position table 121. The content of the position information that can be acquired by the department has been changed from normal. As a result of the data matching process, the redundant data position table 121 and the processing target list 122 are updated as in the normal writing process.

［第２の実施形態の効果］
このように第２の実施形態に係るデータ整合システム１は、１以上のクライアントとネットワークを介して接続されるシステムであって、同一のデータの複製である冗長データをそれぞれ格納する複数の記憶装置と、複数の記憶装置に格納される冗長データへのアクセスを制御し、それぞれ複数の記憶装置の１つに接続される複数のサーバと、クライアントからの要求に応じて、複数の記憶装置それぞれに格納される、冗長データのデータ長を比較する比較部４１２と、比較部４１２が冗長データのデータ長が同一ではないと判定した場合に、複数の記憶装置に格納された冗長データのうち最も短い冗長データのデータ長に他の冗長データのデータ長を揃える同一化部４１３と、を備える。このため、書込処理が異常終了した場合であっても、上位アプリケーション側の選択に応じて、冗長データのデータ長を揃えることができる。したがって、異なる記憶装置に格納される冗長データ相互間で不整合が生じることが防止されるとともに、データの信頼性を向上させることができる。また、上位アプリケーションは、異常終了した書込処理の対象データの特徴に応じて、データ整合処理の要否を判定した上で、整合要求を発行することができ、異常終了時に自動的に再書き込みを行う等の対処法に比べて、上位アプリケーションにかかる負荷を減じて、処理性能の低下を抑制することができる。 [Effects of Second Embodiment]
As described above, the data matching system 1 according to the second embodiment is a system connected to one or more clients via a network, and stores a plurality of storage devices each storing redundant data that is a duplicate of the same data. And controlling access to redundant data stored in a plurality of storage devices, each of a plurality of servers connected to one of the plurality of storage devices, and each of the plurality of storage devices in response to a request from a client The comparison unit 412 that compares the data lengths of the redundant data to be stored and the comparison unit 412 determine that the data lengths of the redundant data are not the same, the shortest of the redundant data stored in the plurality of storage devices And an identification unit 413 that aligns the data length of the redundant data with the data length of the redundant data. For this reason, even if the writing process ends abnormally, the data lengths of the redundant data can be made uniform according to the selection on the higher application side. Therefore, it is possible to prevent inconsistency between redundant data stored in different storage devices and improve data reliability. In addition, the upper application can issue an alignment request after determining whether data alignment processing is necessary according to the characteristics of the target data of the abnormally terminated write processing, and automatically rewrites upon abnormal termination Compared with a coping method such as performing the above, it is possible to reduce the load on the host application and suppress the degradation of the processing performance.

また、第２の実施形態に係るデータ整合システム１においては、複数の記憶部はそれぞれ、同一のファイルデータを固定長のブロックに分割し同一のデータブロックを複製して得た前記冗長データを格納し、比較部４１２は、データブロックの識別子と、当該データブロックの格納先として指定される複数の記憶部の中から選択された２以上の記憶部の位置情報と、を対応付けて記憶する冗長データ位置テーブル１２１を参照して、同一のデータブロックに対応づけて記憶される２以上の記憶部の冗長データのうち末尾のデータブロックのデータ長を比較する。このため、ファイル末尾のブロックの冗長データ間でデータ長が一致している場合は、データ長を揃える処理を実行しない。したがって、チェックサム等を用いて冗長データ間の不一致を検出する場合と比較して、短い処理時間で上位アプリケーションに応答を返すことができる。また、上位アプリケーションは、データ整合処理を実行するか否かを選択するのみで冗長データの整合性を維持することができ、処理負荷が低減される。また、書込処理が異常終了した場合には自動的にデータ整合処理を実行するものとしておけば、アプリケーション実行部自体が停止した場合でも、冗長データ間の整合性を維持することができる。 In the data matching system 1 according to the second embodiment, each of the plurality of storage units stores the redundant data obtained by dividing the same file data into fixed-length blocks and duplicating the same data block. The comparison unit 412 stores the identifier of the data block and the positional information of two or more storage units selected from the plurality of storage units specified as the storage destination of the data block in association with each other. With reference to the data position table 121, the data length of the last data block is compared among redundant data of two or more storage units stored in association with the same data block. For this reason, when the data lengths match between the redundant data of the block at the end of the file, the process of aligning the data lengths is not executed. Therefore, a response can be returned to the upper application in a shorter processing time compared to the case where a mismatch between redundant data is detected using a checksum or the like. Further, the host application can maintain the consistency of redundant data only by selecting whether or not to execute the data matching process, and the processing load is reduced. If the data matching process is automatically executed when the writing process is abnormally terminated, the consistency between redundant data can be maintained even when the application execution unit itself stops.

［第３の実施形態］
これまで本発明の実施形態について説明したが、本発明は上述した実施形態以外にも、その他の実施形態にて実施されてもよい。以下に、その他の実施形態を説明する。 [Third Embodiment]
Although the embodiments of the present invention have been described so far, the present invention may be implemented in other embodiments besides the above-described embodiments. Other embodiments will be described below.

［システム構成］
上記実施形態において説明した各処理のうち、自動的に行われるものとして説明した処理の全部又は一部を手動的に行うこともでき、あるいは、手動的に行われるものとして説明した処理の全部又は一部を公知の方法で自動的に行うこともできる。この他、上述文書中や図面中に示した処理手順、制御手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。 [System configuration]
Of the processes described in the above embodiment, all or part of the processes described as being performed automatically can be performed manually, or all of the processes described as being performed manually or A part can be automatically performed by a known method. In addition, the processing procedures, control procedures, specific names, and information including various data and parameters shown in the above-described document and drawings can be arbitrarily changed unless otherwise specified.

また、上記実施形態の説明においては、クライアント４００Ａがアプリケーション実行部４０１Ａ、書込部４０２Ａ、読出部４０３Ａ、データ整合装置４１０Ａを備えるものとして説明したが、これらの機能の一部をマスタ１００に組み入れてもよい。たとえば、書込命令、読出命令、またはデータ整合命令はマスタ１００のデータ配置制御部１１０がデータアクセス部２０１Ａ，２０１Ｂ，２０１Ｃに対して送信するように構成してもよい。また、書込部４０２Ａ，４０２Ｂ，４０２Ｃ、読出部４０３Ａ，４０３Ｂ，４０３Ｃ、またはデータ整合装置４１０Ａ，４１０Ｂ，４１０Ｃの機能の一部をサーバ２００Ａ，２００Ｂ，２００Ｃのデータアクセス部２０１Ａ，２０１Ｂ，２０１Ｃが実行してもよい。 In the description of the above embodiment, the client 400A has been described as including the application execution unit 401A, the writing unit 402A, the reading unit 403A, and the data matching device 410A. However, some of these functions are incorporated into the master 100. May be. For example, the data placement control unit 110 of the master 100 may be configured to transmit a write command, a read command, or a data matching command to the data access units 201A, 201B, and 201C. Also, some of the functions of the writing units 402A, 402B, 402C, the reading units 403A, 403B, 403C, or the data matching devices 410A, 410B, 410C are performed by the data access units 201A, 201B, 201C of the servers 200A, 200B, 200C. May be executed.

また、図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部又は一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的又は物理的に分散・統合して構成することができる。例えば、図３に示す例では、書込部４０２Ａ、かつ、読出部４０３Ａ、かつ、データ整合装置４１０Ａを、クライアント４００Ａの一部として図示したが、書込部４０２Ａ、読出部４０３Ａ、および、データ整合装置４１０Ａの機能の一部を、マスタ１００に組み込んでもよい。たとえば、クライアント４００Ａの動作または機能として説明した動作または機能は、マスタ１００に組み入れることができる。 Further, each component of each illustrated apparatus is functionally conceptual, and does not necessarily need to be physically configured as illustrated. That is, the specific form of distribution / integration of each device is not limited to the one shown in the figure, and all or a part of the distribution / integration may be functionally or physically distributed in arbitrary units according to various loads or usage conditions. Can be integrated and configured. For example, in the example illustrated in FIG. 3, the writing unit 402A, the reading unit 403A, and the data matching device 410A are illustrated as a part of the client 400A, but the writing unit 402A, the reading unit 403A, and the data A part of the function of the matching device 410A may be incorporated in the master 100. For example, the operations or functions described as the operations or functions of the client 400 </ b> A can be incorporated in the master 100.

［プログラム］
図１８は、データ整合システム１による一連の処理を実行するプログラムであるデータ整合プログラムによる情報処理が、コンピュータを用いて具体的に実現されることを示す図である。図１８に例示するように、コンピュータ３０００は、例えば、メモリ３０１０と、ＣＰＵ（Central Processing Unit）３０２０と、ハードディスクドライブ３０８０と、ネットワークインタフェース３０７０とを有する。コンピュータ３０００の各部はバス３１００によって接続される。 [program]
FIG. 18 is a diagram showing that the information processing by the data matching program that is a program for executing a series of processes by the data matching system 1 is specifically realized by using a computer. As illustrated in FIG. 18, the computer 3000 includes, for example, a memory 3010, a CPU (Central Processing Unit) 3020, a hard disk drive 3080, and a network interface 3070. Each part of the computer 3000 is connected by a bus 3100.

メモリ３０１０は、図１８に例示するように、ＲＯＭ３０１１及びＲＡＭ３０１２を含む。ＲＯＭ３０１１は、例えば、ＢＩＯＳ（Basic Input Output System）等のブートプログラムを記憶する。 The memory 3010 includes a ROM 3011 and a RAM 3012 as illustrated in FIG. The ROM 3011 stores a boot program such as BIOS (Basic Input Output System).

ここで、図１８に例示するように、ハードディスクドライブ３０８０は、例えば、ＯＳ３０８１、アプリケーションプログラム３０８２、プログラムモジュール３０８３、プログラムデータ３０８４を記憶する。すなわち、開示の技術に係るデータ整合プログラムは、コンピュータによって実行される指令が記述されたプログラムモジュール３０８３として、例えばハードディスクドライブ３０８０に記憶される。例えば、データ配置制御部１１０、データ整合装置４１０、および、データアクセス部における処理と同様の情報処理を実行する手順各々が記述されたプログラムモジュール３０８３が、ハードディスクドライブ３０８０に記憶される。 Here, as illustrated in FIG. 18, the hard disk drive 3080 stores, for example, an OS 3081, an application program 3082, a program module 3083, and program data 3084. In other words, the data matching program according to the disclosed technique is stored in, for example, the hard disk drive 3080 as the program module 3083 in which instructions executed by the computer are described. For example, the data placement control unit 110, the data matching device 410, and a program module 3083 in which procedures for executing information processing similar to the processing in the data access unit are described are stored in the hard disk drive 3080.

また、記憶部１２０に記憶されるデータのように、データ整合プログラムによる情報処理に用いられるデータは、プログラムデータ３０８４として、例えばハードディスクドライブ３０８０に記憶される。そして、ＣＰＵ３０２０が、ハードディスクドライブ３０８０に記憶されたプログラムモジュール３０８３やプログラムデータ３０８４を必要に応じてＲＡＭ３０１２に読み出し、各種の手順を実行する。 Further, data used for information processing by the data matching program, such as data stored in the storage unit 120, is stored as program data 3084 in, for example, the hard disk drive 3080. The CPU 3020 reads the program module 3083 and program data 3084 stored in the hard disk drive 3080 to the RAM 3012 as necessary, and executes various procedures.

なお、データ整合プログラムに係るプログラムモジュール３０８３やプログラムデータ３０８４は、ハードディスクドライブ３０８０に記憶される場合に限られない。例えば、プログラムモジュール３０８３やプログラムデータ３０８４は、着脱可能な記憶媒体に記憶されてもよい。この場合、ＣＰＵ３０２０は、ディスクドライブなどの着脱可能な記憶媒体を介してデータを読み出す。また、同様に、更新プログラムに係るプログラムモジュール３０８３やプログラムデータ３０８４は、ネットワーク（ＬＡＮ（Local Area Network）、ＷＡＮ（Wide Area Network）等）を介して接続された他のコンピュータに記憶されてもよい。この場合、ＣＰＵ３０２０は、ネットワークインタフェース３０７０を介して他のコンピュータにアクセスすることで各種データを読み出す。 Note that the program module 3083 and the program data 3084 related to the data matching program are not limited to being stored in the hard disk drive 3080. For example, the program module 3083 and the program data 3084 may be stored in a removable storage medium. In this case, the CPU 3020 reads data via a removable storage medium such as a disk drive. Similarly, the program module 3083 and the program data 3084 related to the update program may be stored in another computer connected via a network (LAN (Local Area Network), WAN (Wide Area Network), etc.). . In this case, the CPU 3020 reads various data by accessing another computer via the network interface 3070.

［その他］
なお、本実施例で説明したデータ整合プログラムは、インターネット等のネットワークを介して配布することができる。また、データ整合プログラムは、ハードディスク、フレキシブルディスク（ＦＤ）、ＣＤ−ＲＯＭ（Compact Disk Read Only Memory）、ＭＯ（Magnetooptic Disc）、ＤＶＤ（Digital Versatile Disc）などのコンピュータで読取可能な記録媒体に記録され、コンピュータによって記録媒体から読み出されることによって実行することもできる。 [Others]
The data matching program described in the present embodiment can be distributed via a network such as the Internet. The data alignment program is recorded on a computer-readable recording medium such as a hard disk, flexible disk (FD), CD-ROM (Compact Disk Read Only Memory), MO (Magnetooptic Disc), DVD (Digital Versatile Disc). It can also be executed by being read from a recording medium by a computer.

１データ整合システム
１０，４１０Ａ，４１０Ｂ，４１０Ｃデータ整合装置
１１比較部
１２同一化部
１００マスタ
１１０データ配置制御部
１２０記憶部
１２１冗長データ位置テーブル
１２２処理対象リスト
１３０入出力部
２００Ａ，２００Ｂ，２００Ｃサーバ
２０１Ａ，２０１Ｂ，２０１Ｃデータアクセス部
３００Ａ，３００Ｂ，３００Ｃディスク
４００Ａ，４００Ｂ，４００Ｃクライアント
４０１Ａ，４０１Ｂ，４０１Ｃアプリケーション実行部
４０２Ａ，４０２Ｂ，４０２Ｃ書込部
４０３Ａ，４０３Ｂ，４０３Ｃ読出部
４１２Ａ，４１２Ｂ，４１２Ｃ比較部
４１３Ａ，４１３Ｂ，４１３Ｃ同一化部
３０００コンピュータ
３０１０メモリ
３０１１ＲＯＭ
３０１２ＲＡＭ
３０２０ＣＰＵ
３０７０ネットワークインタフェース
３０８０ハードディスクドライブ
３０８１ＯＳ
３０８２アプリケーションプログラム
３０８３プログラムモジュール
３０８４プログラムデータ
３１００バス DESCRIPTION OF SYMBOLS 1 Data alignment system 10,410A, 410B, 410C Data alignment apparatus 11 Comparison part 12 Identity part 100 Master 110 Data arrangement control part 120 Storage part 121 Redundant data position table 122 Processing object list 130 Input / output part 200A, 200B, 200C Server 201A, 201B, 201C Data access unit 300A, 300B, 300C Disk 400A, 400B, 400C Client 401A, 401B, 401C Application execution unit 402A, 402B, 402C Writing unit 403A, 403B, 403C Reading unit 412A, 412B, 412C Comparison unit 413A, 413B, 413C Identification unit 3000 Computer 3010 Memory 3011 ROM
3012 RAM
3020 CPU
3070 Network interface 3080 Hard disk drive 3081 OS
3082 Application program 3083 Program module 3084 Program data 3100 Bus

Claims

A comparison unit that compares the data length of redundant data that is a copy of the same data stored in each of the plurality of storage units when the writing process from the upper application ends abnormally;
When the comparison unit determines that the data length of the redundant data is not the same, the data length of the other redundant data is set to the data length of the shortest redundant data among the redundant data stored in the plurality of storage units. An identification unit to be aligned,
A data matching apparatus comprising:

Each of the plurality of storage units stores the redundant data obtained by dividing the same file data into fixed-length blocks and replicating the same data block,
The comparing unit stores the identifier of the data block and the positional information of two or more storage units selected from the plurality of storage units specified as the storage destination of the data block in association with each other. 2. The data length of a last data block of the redundant data in the two or more storage units stored in association with the same data block is compared by referring to a data position table. The data alignment device described.

A comparison step of comparing the data length of redundant data, which is a copy of the same data, stored in each of the plurality of storage units, when the writing process from the upper application ends abnormally;
When it is determined in the comparison step that the data length of the redundant data is not the same, the data length of the other redundant data is set to the data length of the shortest redundant data among the redundant data stored in the plurality of storage units. The same identification process,
A data alignment method comprising:

The plurality of storage units each store the redundant data obtained by dividing the same file data into fixed-length blocks and replicating the same data block,
The comparison step includes redundantly storing the identifier of the data block and the positional information of two or more storage units selected from the plurality of storage units specified as the storage destination of the data block in association with each other Referencing the data position table, comparing the data length of the last block among the redundant data of the two or more storage units stored in association with the same data block in the redundant data position table. The data matching method according to claim 3, wherein:

A comparison procedure for comparing the data length of redundant data, which is a copy of the same data stored in each of the plurality of storage units, when the writing process from the upper application ends abnormally,
When it is determined in the comparison procedure that the data length of the redundant data is not the same, the data length of the other redundant data is set to the data length of the shortest redundant data among the redundant data stored in the plurality of storage units. The same identification procedure,
A data alignment program for causing a computer to execute

The plurality of storage units each store the redundant data obtained by dividing the same file data into fixed-length blocks and replicating the same data block,
The comparison procedure is a redundancy in which an identifier of the data block and position information of two or more storage units selected from the plurality of storage units specified as storage destinations of the data block are stored in association with each other. Referencing the data position table, comparing the data length of the last block among the redundant data of the two or more storage units stored in association with the same data block in the redundant data position table. 6. The data matching program according to claim 5, wherein