JP2011170667A

JP2011170667A - File-synchronizing system, file synchronization method, and file synchronization program

Info

Publication number: JP2011170667A
Application number: JP2010034574A
Authority: JP
Inventors: Satoshi Mizuno; 佐年士水野
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2010-02-19
Filing date: 2010-02-19
Publication date: 2011-09-01

Abstract

<P>PROBLEM TO BE SOLVED: To improve the efficiency of file synchronization. <P>SOLUTION: A transfer device 10 includes an actual data splitting unit 11 for splitting data in a file 141 to be transferred into a plurality of blocks; a hash-value creation unit 12 for creating a hash value out of the data for each block; and a transmission unit 13 for transmitting the hash value to the transfer device 20. The transfer device 20 includes an actual data splitting unit 21 for splitting the data in the file 241 into a block of the same size as the blocks split by the actual data splitting unit 11; a hash value creation unit 22 for creating a hash value out of the data for each block; and a transmission unit 23 for comparing for each block number the hash values generated by the hash value creation units 12, 22, and transmitting the block numbers of the unmatched hash values to the transfer device 10. The transmission unit 13 transmits the actual data corresponding to the block number received from the transfer device 20 to the transfer device 20. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、ファイル同期システム、ファイル同期方法およびファイル同期プログラムに関する。 The present invention relates to a file synchronization system, a file synchronization method, and a file synchronization program.

同じファイルを複数のシステムで取り扱う場合に、ファイル同期を行ってそれぞれのファイルの内容が同じ内容になるように管理することがある。ファイル同期を行う方法としては、例えば、対象ファイル全体を転送して同期をとる方法や、対象ファイルの更新前後における差分情報のみを転送して同期をとる方法等がある。差分情報のみを転送する方法を採用することで、同期処理のターンアラウンドタイムを短縮することが可能となる。 When the same file is handled by a plurality of systems, file synchronization may be performed so that the contents of each file are the same. As a method of performing file synchronization, for example, there are a method of transferring and synchronizing the entire target file, and a method of transferring and synchronizing only difference information before and after updating of the target file. By adopting a method of transferring only the difference information, it becomes possible to shorten the turnaround time of the synchronization process.

下記特許文献１には、差分情報のみを転送する方法でファイル同期を行うシステムが開示されている。このシステムでは、ファイルの転送元装置が更新前後のファイルをそれぞれ管理し、更新前後のファイルから抽出した差分データを転送先装置に送信している。そして、転送先装置が差分データと更新前のファイルとを用いて更新後のファイルを生成することで、ファイル同期を行っている。 Patent Document 1 below discloses a system that performs file synchronization by transferring only difference information. In this system, the file transfer source device manages the files before and after the update, and transmits the difference data extracted from the files before and after the update to the transfer destination device. Then, the transfer destination apparatus performs file synchronization by generating the updated file using the difference data and the file before update.

特開２００３−３３７７２３号公報JP 2003-337723 A

上記特許文献1に記載のシステムでは、更新後のファイルに加え、更新前のファイルも管理している。したがって、記憶領域の増大やファイルメンテナンス作業の増加等を招来し、コストが増大する要因になっている。 In the system described in Patent Document 1, in addition to the updated file, the file before the update is managed. Therefore, an increase in storage area and an increase in file maintenance work, etc., are a factor that increases costs.

本発明は、上述した課題を解決するためになされたものであり、ファイル同期の効率を向上させることができるファイル同期システム、ファイル同期方法およびファイル同期プログラムを提供することを目的とする。 SUMMARY An advantage of some aspects of the invention is that it provides a file synchronization system, a file synchronization method, and a file synchronization program capable of improving the efficiency of file synchronization.

本発明のファイル同期システムは、ファイル同期の対象となるファイルをそれぞれ有する転送元装置と転送先装置とを備えるファイル同期システムであって、前記転送元装置は、ファイル同期の対象となる転送元の前記ファイルのデータを複数のブロックに分割する第１の分割部と、前記第１の分割部によって分割された各ブロックに含まれるデータからそれぞれハッシュ値を生成する第１の生成部と、前記第１の生成部によって生成されたハッシュ値を前記転送先装置に送信する第１の送信部と、を有し、前記転送先装置は、ファイル同期の対象となる転送先の前記ファイルのデータを、前記第１の分割部によって分割された各ブロックと同サイズのブロックに分割する第２の分割部と、前記第２の分割部によって分割された各ブロックに含まれるデータからそれぞれハッシュ値を生成する第２の生成部と、前記第１の生成部によって生成されたハッシュ値と前記第２の生成部によって生成されたハッシュ値とを、各ハッシュ値の生成元データを含むブロックを識別するブロック番号ごとにそれぞれ比較し、不一致となったハッシュ値に対応する前記ブロック番号を前記転送元装置に送信する第２の送信部と、を有し、前記転送元装置の前記第１の送信部は、前記転送先装置から受信した前記ブロック番号に対応する前記ブロックに含まれるデータを前記転送先装置に送信する、を備える。 The file synchronization system of the present invention is a file synchronization system comprising a transfer source device and a transfer destination device each having a file to be synchronized with the file, wherein the transfer source device is a transfer source that is to be synchronized with the file. A first dividing unit that divides the data of the file into a plurality of blocks; a first generating unit that generates a hash value from data included in each block divided by the first dividing unit; A first transmission unit that transmits the hash value generated by one generation unit to the transfer destination device, and the transfer destination device stores the data of the file at the transfer destination that is the target of file synchronization, A second dividing unit that divides the blocks divided by the first dividing unit into blocks of the same size, and a block that is divided by the second dividing unit. A second generation unit that generates a hash value from each piece of data, a hash value generated by the first generation unit, and a hash value generated by the second generation unit, A second transmission unit that compares each block number that identifies a block including the original data and transmits the block number corresponding to the hash value that does not match to the transfer source device, and the transfer source The first transmission unit of the apparatus includes: transmitting data included in the block corresponding to the block number received from the transfer destination apparatus to the transfer destination apparatus.

本発明のファイル同期方法は、ファイル同期の対象となるファイルをそれぞれ有する転送元装置と転送先装置とを備えるファイル同期システムで実行するファイル同期方法であって、前記転送元装置が、ファイル同期の対象となる転送元の前記ファイルのデータを複数のブロックに分割する第１の分割ステップと、前記第１の分割ステップにおいて分割された各ブロックに含まれるデータからそれぞれハッシュ値を生成する第１の生成ステップと、前記第１の生成ステップにおいて生成されたハッシュ値を前記転送先装置に送信する第１の送信ステップと、を実行し、前記転送先装置が、ファイル同期の対象となる転送先の前記ファイルのデータを、前記第１の分割ステップにおいて分割された各ブロックと同サイズのブロックに分割する第２の分割ステップと、前記第２の分割ステップにおいて分割された各ブロックに含まれるデータからそれぞれハッシュ値を生成する第２の生成ステップと、前記第１の生成ステップにおいて生成されたハッシュ値と前記第２の生成ステップにおいて生成されたハッシュ値とを、各ハッシュ値の生成元データを含むブロックを識別するブロック番号ごとにそれぞれ比較し、不一致となったハッシュ値に対応する前記ブロック番号を前記転送元装置に送信する第２の送信ステップと、を実行し、前記転送元装置の前記第１の送信ステップが、前記転送先装置から受信した前記ブロック番号に対応する前記ブロックに含まれるデータを前記転送先装置に送信する。 The file synchronization method of the present invention is a file synchronization method executed by a file synchronization system including a transfer source device and a transfer destination device each having a file to be synchronized with the file synchronization method, wherein the transfer source device performs file synchronization. A first division step of dividing the data of the transfer source file as a target into a plurality of blocks, and a first value for generating a hash value from the data included in each block divided in the first division step Performing a generation step and a first transmission step of transmitting the hash value generated in the first generation step to the transfer destination device, wherein the transfer destination device is a destination of a file synchronization target Dividing the file data into blocks of the same size as the blocks divided in the first dividing step; A second generation step for generating a hash value from data included in each block divided in the second division step, the hash value generated in the first generation step, and the second The hash value generated in the generating step 2 is compared for each block number that identifies the block including the generation data of each hash value, and the block number corresponding to the hash value that does not match is compared with the transfer source. A second transmission step of transmitting to the device, wherein the first transmission step of the transfer source device transfers the data included in the block corresponding to the block number received from the transfer destination device. Send to the destination device.

本発明のファイル同期プログラムは、上記ファイル同期方法に含まれる各ステップをコンピュータに実行させる。 The file synchronization program of the present invention causes a computer to execute each step included in the file synchronization method.

本発明によれば、ファイル同期の効率を向上させることができる。 According to the present invention, the efficiency of file synchronization can be improved.

実施形態におけるファイル同期システムの構成を例示するブロック図である。It is a block diagram which illustrates the composition of the file synchronization system in an embodiment. 転送元装置におけるハッシュ値生成レートテーブルのデータ構成を例示する図である。It is a figure which illustrates the data structure of the hash value production | generation rate table in a transfer origin apparatus. 転送元装置における転送レートテーブルのデータ構成を例示する図である。It is a figure which illustrates the data structure of the transfer rate table in a transfer source apparatus. 転送元装置におけるヒット率テーブルのデータ構成を例示する図である。It is a figure which illustrates the data structure of the hit rate table in a transfer origin apparatus. 転送先装置におけるハッシュ値生成レートテーブルのデータ構成を例示する図である。It is a figure which illustrates the data structure of the hash value generation rate table in a transfer destination apparatus. ファイル同期システムにおけるファイル転送の処理手順を説明するためのフローチャートである。It is a flowchart for demonstrating the process sequence of the file transfer in a file synchronous system. 図６の処理手順に含まれる差分転送の処理手順を説明するためのフローチャートである。It is a flowchart for demonstrating the process sequence of the difference transfer contained in the process sequence of FIG.

以下、添付図面を参照して、本発明に係るファイル同期システム、ファイル同期方法およびファイル同期プログラムの好適な実施形態について説明する。 DESCRIPTION OF EXEMPLARY EMBODIMENTS Hereinafter, preferred embodiments of a file synchronization system, a file synchronization method, and a file synchronization program according to the present invention will be described with reference to the accompanying drawings.

まず、実施形態におけるファイル同期システムの構成について説明する。図１は、実施形態におけるファイル同期システムの構成を例示するブロック図である。図１に示すように、ファイル同期システム１は、転送元装置１０と、転送先装置２０とを有する。転送元装置１０および転送先装置２０は、ネットワーク３０を介して接続される。 First, the configuration of the file synchronization system in the embodiment will be described. FIG. 1 is a block diagram illustrating a configuration of a file synchronization system in the embodiment. As shown in FIG. 1, the file synchronization system 1 includes a transfer source device 10 and a transfer destination device 20. The transfer source device 10 and the transfer destination device 20 are connected via a network 30.

転送元装置１０および転送先装置２０は、ファイル同期の対象となるファイル１４１および２４１をそれぞれ備えている。本実施形態では、説明の便宜のために、転送元装置１０のファイル１４１が更新された場合に、この更新された情報を、転送先装置２０のファイル２４１に反映させる場合について説明する。なお、転送先装置２０のファイル２４１が更新され、この更新された情報を転送元装置１０のファイル１４１に反映させる場合には、以下に説明する転送元装置１０の各機能と転送先装置２０の各機能とを入れ替えることで実現することができる。転送元装置１０および転送先装置２０は、それぞれの各機能を併せて備えることができる。 The transfer source device 10 and the transfer destination device 20 respectively include files 141 and 241 that are subject to file synchronization. In the present embodiment, for convenience of explanation, a case where the updated information is reflected in the file 241 of the transfer destination apparatus 20 when the file 141 of the transfer source apparatus 10 is updated will be described. When the file 241 of the transfer destination apparatus 20 is updated and the updated information is reflected in the file 141 of the transfer source apparatus 10, each function of the transfer source apparatus 10 described below and the transfer destination apparatus 20 This can be realized by replacing each function. The transfer source device 10 and the transfer destination device 20 can be provided with respective functions.

ここで、転送元装置１０および転送先装置２０は、物理的には、例えば、ＣＰＵ（Central Processing Unit）と、記憶装置と、入出力インターフェースとを含んで構成される。記憶装置には、例えば、ＣＰＵで処理されるプログラムやデータを記憶するＲＯＭ（Read Only Memory）やＨＤＤ（Hard Disk Drive）、主として制御処理のための各種作業領域として使用されるＲＡＭ（Random Access Memory）等が含まれる。転送元装置１０および転送先装置２０のＨＤＤには、ファイル同期の対象となるファイル１４１および２４１がそれぞれ格納される。ＣＰＵが、ＲＯＭに記憶されたプログラムを実行し、ＲＡＭに展開されるデータを処理することで、後述する転送元装置１０および転送先装置２０の各機能が実現する。なお、ファイル同期の対象となるファイルは、一組であることには限定されず、複数組設定することができる。 Here, the transfer source device 10 and the transfer destination device 20 physically include, for example, a CPU (Central Processing Unit), a storage device, and an input / output interface. The storage device includes, for example, a ROM (Read Only Memory) and HDD (Hard Disk Drive) that store programs and data processed by the CPU, and a RAM (Random Access Memory) mainly used as various work areas for control processing. ) Etc. are included. The HDDs of the transfer source device 10 and the transfer destination device 20 store files 141 and 241 that are subject to file synchronization, respectively. Each function of the transfer source device 10 and the transfer destination device 20 described later is realized by the CPU executing the program stored in the ROM and processing the data expanded in the RAM. Note that the file synchronization target file is not limited to one set, and a plurality of sets can be set.

転送元装置１０は、実データ分割部１１と、ハッシュ値生成部１２と、送信部１３と、記憶部１４と、転送選択部１５と、を有する。 The transfer source device 10 includes a real data dividing unit 11, a hash value generation unit 12, a transmission unit 13, a storage unit 14, and a transfer selection unit 15.

実データ分割部１１は、ファイル同期の対象となる転送元のファイル１４１の実データを複数のブロックに分割する。分割するブロック数は、分割ブロック数Ｎとして予め設定し、記憶部１４に記憶させる。Ｎ個に分割した各ブロックには、ブロックを識別するブロック番号を付与する。実データ分割部１１は、各ブロックのサイズを、分割ブロックサイズＤとして記憶部１４に一時的に記憶させる。 The actual data dividing unit 11 divides the actual data of the transfer source file 141 to be subjected to file synchronization into a plurality of blocks. The number of blocks to be divided is preset as the number of divided blocks N and is stored in the storage unit 14. Each block divided into N blocks is assigned a block number for identifying the block. The actual data dividing unit 11 temporarily stores the size of each block in the storage unit 14 as a divided block size D.

ハッシュ値生成部１２は、実データ分割部１１によってＮ個に分割された各ブロックに含まれる実データからそれぞれハッシュ値を生成する。ハッシュ値は、所定のハッシュ関数に各実データを入力することで生成する。ハッシュ値生成部１２は、ハッシュ値の生成に要した時間を積算してハッシュ値生成時間を算出する。ハッシュ値生成部１２は、全てのブロックの実データからハッシュ値を生成した後に、ファイルサイズＡおよびハッシュ値生成時間を用いて、図２に示す転送元側のハッシュ値生成レートテーブルに新規レコードを追加する。 The hash value generation unit 12 generates a hash value from the actual data included in each block divided into N blocks by the actual data dividing unit 11. The hash value is generated by inputting each real data to a predetermined hash function. The hash value generation unit 12 calculates the hash value generation time by accumulating the time required for generating the hash value. The hash value generation unit 12 generates a new value in the hash value generation rate table on the transfer source side illustrated in FIG. 2 using the file size A and the hash value generation time after generating the hash value from the actual data of all blocks. to add.

図２に示す転送元側のハッシュ値生成レートテーブルは、データ項目として、例えば、転送処理名項目、ファイルサイズ項目、ハッシュ値生成時間項目および重み付け項目を有する。転送処理名項目は、転送処理を識別するための情報を格納する。ファイルサイズ項目は、転送元のファイルのファイルサイズを格納する。積算時間項目は、ハッシュ値生成部１２でハッシュ値を生成する際に要した時間の積算値を格納する。重み付け項目は、各レコードを用いてハッシュ値生成レートを算出する際に、各レコードに付与する重み付けを格納する。重み付けは、新しく生成されたレコード程、大きくなるように設定する。これにより、直近の転送状況を反映した生成レートを算出することが可能となる。 The hash value generation rate table on the transfer source side illustrated in FIG. 2 includes, for example, a transfer process name item, a file size item, a hash value generation time item, and a weighting item as data items. The transfer process name item stores information for identifying the transfer process. The file size item stores the file size of the transfer source file. The accumulated time item stores an accumulated value of time required when the hash value generating unit 12 generates the hash value. The weighting item stores the weight given to each record when the hash value generation rate is calculated using each record. The weighting is set so that the newly generated record becomes larger. This makes it possible to calculate a generation rate that reflects the latest transfer status.

送信部１３は、ハッシュ値取得要求を転送先装置２０に送信する。ハッシュ値取得要求には、例えば、ハッシュ値生成部１２によって生成されたハッシュ値、転送元ファイル１４１のファイル名、ファイルサイズＡ、分割ブロック数Ｎ、分割ブロックサイズＤおよびハッシュ値サイズＳが含まれる。ファイル名、ファイルサイズＡおよびハッシュ値サイズＳは、予め設定し、記憶部１４に記憶させる。 The transmission unit 13 transmits a hash value acquisition request to the transfer destination device 20. The hash value acquisition request includes, for example, the hash value generated by the hash value generation unit 12, the file name of the transfer source file 141, the file size A, the number of divided blocks N, the divided block size D, and the hash value size S. . The file name, file size A, and hash value size S are set in advance and stored in the storage unit 14.

送信部１３は、転送先装置２０から受信したブロック番号に対応するブロックに含まれる実データを転送先装置２０に送信する。送信部１３は、送信した実データのサイズを積算して転送データサイズを算出するとともに、実データの送信に要した時間を積算して転送時間を算出する。送信部１３は、転送対象となる全ての実データを送信した後に、算出した転送データサイズおよび転送時間を用いて、図３に示す転送レートテーブルに新規レコードを追加する。 The transmission unit 13 transmits the actual data included in the block corresponding to the block number received from the transfer destination device 20 to the transfer destination device 20. The transmitter 13 calculates the transfer data size by adding up the sizes of the transmitted actual data, and calculates the transfer time by adding up the time required for transmitting the actual data. After transmitting all the actual data to be transferred, the transmitter 13 adds a new record to the transfer rate table shown in FIG. 3 using the calculated transfer data size and transfer time.

図３に示す転送レートテーブルは、データ項目として、例えば、転送処理名項目、転送データサイズ項目、転送時間項目および重み付け項目を有する。転送処理名項目は、転送処理を識別するための情報を格納する。転送データサイズ項目は、送信部１３で送信した実データのサイズの積算値を格納する。転送時間項目は、送信部１３で実データを送信する際に要した時間の積算値を格納する。重み付け項目は、各レコードを用いて転送レートを算出する際に、各レコードに付与する重み付けを格納する。重み付けは、新しく生成されたレコード程、大きくなるように設定する。これにより、直近の転送状況を反映した生成レートを算出することが可能となる。 The transfer rate table shown in FIG. 3 includes, for example, a transfer process name item, a transfer data size item, a transfer time item, and a weighting item as data items. The transfer process name item stores information for identifying the transfer process. The transfer data size item stores an integrated value of the size of actual data transmitted by the transmission unit 13. The transfer time item stores an integrated value of the time required when the transmission unit 13 transmits actual data. The weighting item stores the weight given to each record when the transfer rate is calculated using each record. The weighting is set so that the newly generated record becomes larger. This makes it possible to calculate a generation rate that reflects the latest transfer status.

送信部１３は、送信対象の全ての実データを送信した後に、ファイルサイズＡおよび転送データサイズを用いて、図４に示すヒット率レートテーブルに新規レコードを追加する。 After transmitting all the actual data to be transmitted, the transmitting unit 13 adds a new record to the hit rate rate table shown in FIG. 4 using the file size A and the transfer data size.

図４に示すヒット率テーブルは、データ項目として、例えば、転送ファイル名項目、ファイルサイズ項目およびヒットサイズ項目を有する。転送ファイル名項目は、転送元のファイルの名称を格納する。ファイルサイズ項目は、転送元のファイルのファイルサイズを格納する。ヒットサイズ項目は、転送元ファイルのファイルサイズから、送信部１３で送信した転送データサイズを減算した値、つまり、転送元ファイルの実データのうち未送信の実データサイズを格納する。 The hit rate table shown in FIG. 4 includes, for example, a transfer file name item, a file size item, and a hit size item as data items. The transfer file name item stores the name of the transfer source file. The file size item stores the file size of the transfer source file. The hit size item stores a value obtained by subtracting the transfer data size transmitted by the transmission unit 13 from the file size of the transfer source file, that is, the actual data size that has not been transmitted among the actual data of the transfer source file.

転送選択部１５については、後述する。 The transfer selection unit 15 will be described later.

転送先装置２０は、実データ分割部２１と、ハッシュ値生成部２２と、送信部２３と、記憶部２４と、を有する。 The transfer destination device 20 includes an actual data dividing unit 21, a hash value generating unit 22, a transmitting unit 23, and a storage unit 24.

実データ分割部２１は、ファイル同期の対象となる転送先のファイル２４１の実データを、実データ分割部１１によって分割された各ブロックと同サイズのブロックに分割する。実データ分割部１１によって分割された各ブロックのサイズは、転送元装置１０から受信した分割ブロックサイズＤを参照することで得られる。 The actual data dividing unit 21 divides the actual data of the transfer destination file 241 to be subjected to file synchronization into blocks having the same size as the blocks divided by the actual data dividing unit 11. The size of each block divided by the actual data dividing unit 11 is obtained by referring to the divided block size D received from the transfer source device 10.

ハッシュ値生成部２２は、実データ分割部２１によって分割された各ブロックに含まれる実データからそれぞれハッシュ値を生成する。ハッシュ値は、所定のハッシュ関数に各実データを入力することで生成する。ハッシュ値生成部１２は、ハッシュ値の生成に要した時間を積算してハッシュ値生成時間を算出する。ハッシュ値生成部２２は、全てのブロックの実データからハッシュ値を生成した後に、ファイルサイズおよびハッシュ値生成時間を用いて、図５に示す転送先側のハッシュ値生成レートテーブルに新規レコードを追加する。 The hash value generation unit 22 generates a hash value from the actual data included in each block divided by the actual data dividing unit 21. The hash value is generated by inputting each real data to a predetermined hash function. The hash value generation unit 12 calculates the hash value generation time by accumulating the time required for generating the hash value. The hash value generation unit 22 adds a new record to the transfer destination side hash value generation rate table shown in FIG. 5 using the file size and the hash value generation time after generating hash values from the actual data of all blocks. To do.

図５に示す転送先側のハッシュ値生成レートテーブルは、データ項目として、例えば、転送処理名項目、ファイルサイズ項目、ハッシュ値生成時間項目および重み付け項目を有する。転送処理名項目は、転送処理を識別するための情報を格納する。ファイルサイズ項目は、転送元のファイルのファイルサイズおよび転送先のファイルのファイルサイズのうち、小さい方のファイルサイズを格納する。ハッシュ値生成時間項目は、ハッシュ値生成部２２でハッシュ値を生成する際に要した時間の積算値を格納する。重み付け項目は、各レコードを用いてハッシュ値生成レートを算出する際に、各レコードに付与する重み付けを格納する。重み付けは、新しく生成されたレコード程、大きくなるように設定する。これにより、直近の転送状況を反映した生成レートを算出することが可能となる。 The hash value generation rate table on the transfer destination side illustrated in FIG. 5 includes, for example, a transfer process name item, a file size item, a hash value generation time item, and a weighting item as data items. The transfer process name item stores information for identifying the transfer process. The file size item stores the smaller file size of the file size of the transfer source file and the file size of the transfer destination file. The hash value generation time item stores an integrated value of time required when the hash value generation unit 22 generates a hash value. The weighting item stores the weight given to each record when the hash value generation rate is calculated using each record. The weighting is set so that the newly generated record becomes larger. This makes it possible to calculate a generation rate that reflects the latest transfer status.

送信部２３は、ハッシュ値生成部１２によって生成されたハッシュ値とハッシュ値生成部２２によって生成されたハッシュ値とを、同じブロック番号ごとにそれぞれ比較し、不一致となったハッシュ値に対応するブロック番号を含む応答メッセージを転送元装置１０に送信する。 The transmission unit 23 compares the hash value generated by the hash value generation unit 12 and the hash value generated by the hash value generation unit 22 for each of the same block numbers, and the block corresponding to the hash value that does not match A response message including the number is transmitted to the transfer source device 10.

転送元装置１０の転送選択部１５は、ファイル同期時の転送方法として、転送元のファイル１４１を、通常転送で送信するのか、差分転送で送信するのかを選択する。通常転送は、転送元ファイルの全ての実データを転送先装置に送信する転送処理である。差分転送は、データ更新されたブロックに含まれる実データを転送先装置に送信する転送処理である。 The transfer selection unit 15 of the transfer source device 10 selects whether the transfer source file 141 is transmitted by normal transfer or differential transfer as a transfer method at the time of file synchronization. Normal transfer is a transfer process in which all actual data of a transfer source file is transmitted to a transfer destination device. Differential transfer is a transfer process in which actual data included in a data-updated block is transmitted to a transfer destination device.

転送選択部１５は、転送元のファイル１４１のファイルサイズＡが、分割ブロック数Ｎとハッシュ値サイズＳとを乗算した値よりも大きい場合に、差分転送を選択する。転送選択部１５は、転送元のファイル１４１のファイルサイズＡが、分割ブロック数Ｎとハッシュ値サイズＳとを乗算した値以下である場合に、通常転送を選択する。総実データサイズが総ハッシュ値サイズよりも大きい場合に、差分転送を選択することで、転送効率を向上させることができる。一方、総実データサイズが総ハッシュ値サイズ以下である場合に、通常転送を選択することで、差分転送により転送効率が低下する事態を防止することができる。 The transfer selection unit 15 selects differential transfer when the file size A of the transfer source file 141 is larger than the value obtained by multiplying the number of divided blocks N and the hash value size S. The transfer selection unit 15 selects normal transfer when the file size A of the transfer source file 141 is equal to or smaller than the value obtained by multiplying the number of divided blocks N and the hash value size S. When the total real data size is larger than the total hash value size, transfer efficiency can be improved by selecting differential transfer. On the other hand, when the total actual data size is equal to or smaller than the total hash value size, it is possible to prevent the transfer efficiency from being lowered due to the differential transfer by selecting the normal transfer.

転送選択部１５は、転送元のファイル１４１を通常転送したときの推定総処理時間（以下、「通常転送処理時間」という。）が、転送元のファイル１４１を差分転送したときの推定総処理時間（以下、「差分転送処理時間」という。）よりも長い場合に、差分転送を選択し、通常転送処理時間が差分転送処理時間以下である場合に、通常転送を選択する。以下に、通常転送処理時間および差分転送処理時間を算出する方法について、転送元のファイル１４１を転送元装置１０から転送先装置２０に転送する場合を例にして説明する。 The transfer selection unit 15 estimates the estimated total processing time when the transfer source file 141 is normally transferred (hereinafter referred to as “normal transfer processing time”), and the estimated total processing time when the transfer source file 141 is differentially transferred. (Hereinafter referred to as “differential transfer processing time”), the differential transfer is selected. When the normal transfer processing time is equal to or shorter than the differential transfer processing time, the normal transfer is selected. Hereinafter, a method for calculating the normal transfer processing time and the differential transfer processing time will be described by taking as an example the case of transferring the transfer source file 141 from the transfer source device 10 to the transfer destination device 20.

通常転送処理時間は、例えば、下記式１を用いて算出することができる。 The normal transfer processing time can be calculated using the following equation 1, for example.

通常転送処理時間Ｔ１＝ファイルサイズＡ／転送レートＲ３ … 式１ Normal transfer processing time T1 = file size A / transfer rate R3 Equation 1

上記式１のファイルサイズＡは、転送元のファイル１４１のファイルサイズである。上記式１の転送レートＲ３は、例えば、図３に示す転送レートテーブルのデータを下記式２に代入して算出することができる。 The file size A in the above equation 1 is the file size of the transfer source file 141. The transfer rate R3 in Equation 1 can be calculated, for example, by substituting the data in the transfer rate table shown in FIG.

転送レートＲ３＝ Σ（送信データサイズ／転送時間 × 重み付け）／ Σ（重み付け） … 式２ Transfer rate R3 = Σ (transmission data size / transfer time × weighting) / Σ (weighting) Equation 2

差分転送処理時間は、例えば、下記式３を用いて算出することができる。 The difference transfer processing time can be calculated using, for example, the following Equation 3.

差分転送処理時間Ｔ２＝｛通常転送処理時間Ｔ１ × （１ − ヒット率Ｈ）｝＋ハッシュ値生成時間Ｔ３＋ハッシュ値転送時間Ｔ４＋ハッシュ値生成時間Ｔ５＋メッセージ送信時間Ｔ６ … 式３ Differential transfer processing time T2 = {Normal transfer processing time T1 × (1−Hit rate H)} + Hash value generation time T3 + Hash value transfer time T4 + Hash value generation time T5 + Message transmission time T6

上記式３のヒット率Ｈは、転送元のファイル１４１と転送先のファイル２４１とに同じデータが存在する確率であり、例えば、図４に示すヒット率テーブルのデータを下記式４に代入して算出することができる。 The hit rate H in Equation 3 above is the probability that the same data exists in the transfer source file 141 and the transfer destination file 241. For example, the hit rate table data shown in FIG. Can be calculated.

ヒット率Ｈ＝ファイル１４１のヒットサイズ／ファイルサイズＡ … 式４ Hit ratio H = file 141 hit size / file size A Equation 4

上記式３のハッシュ値生成時間Ｔ３は、転送元装置１０でハッシュ値を生成するのに要する時間であり、例えば、下記式５を用いて算出することができる。 The hash value generation time T3 in Expression 3 is a time required for generating the hash value in the transfer source device 10, and can be calculated using Expression 5 below, for example.

ハッシュ値生成時間Ｔ３＝ファイルサイズＡ／ハッシュ値生成レートＲ１ … 式５ Hash value generation time T3 = file size A / hash value generation rate R1 Equation 5

上記式５のハッシュ値生成レートＲ１は、例えば、図２に示すハッシュ値生成レートテーブルのデータを下記式６に代入して算出することができる。 The hash value generation rate R1 of the above formula 5 can be calculated by substituting the data of the hash value generation rate table shown in FIG.

ハッシュ値生成レートＲ１＝ Σ（ファイルサイズ／ハッシュ値生成時間 × 重み付け）／ Σ（重み付け） … 式６ Hash value generation rate R1 = Σ (file size / hash value generation time × weighting) / Σ (weighting) Equation 6

上記式３のハッシュ値転送時間Ｔ４は、転送元装置１０から転送先装置２０にハッシュ値を転送するのに要する時間であり、例えば、下記式７を用いて算出することができる。 The hash value transfer time T4 in Equation 3 above is the time required to transfer the hash value from the transfer source device 10 to the transfer destination device 20, and can be calculated using the following Equation 7, for example.

ハッシュ値転送時間Ｔ４＝分割ブロック数Ｎ × ハッシュ値サイズＳ／転送レートＲ３ … 式７ Hash value transfer time T4 = number of divided blocks N × hash value size S / transfer rate R3 Expression 7

上記式３のハッシュ値生成時間Ｔ５は、転送先装置２０でハッシュ値を生成するのに要する時間であり、例えば、下記式８を用いて算出することができる。 The hash value generation time T5 of the above formula 3 is a time required for generating the hash value in the transfer destination device 20, and can be calculated using the following formula 8, for example.

ハッシュ値生成時間Ｔ５＝ファイルサイズＢ／ハッシュ値生成レートＲ２ … 式８ Hash value generation time T5 = file size B / hash value generation rate R2 Equation 8

上記式８のファイルサイズＢは、転送先のファイル２４１のファイルサイズである。上記式８のハッシュ値生成レートＲ２は、図５に示すハッシュ値生成レートテーブルのデータを下記式９に代入して算出することができる。 The file size B in Expression 8 is the file size of the transfer destination file 241. The hash value generation rate R2 of Equation 8 can be calculated by substituting the data of the hash value generation rate table shown in FIG.

ハッシュ値生成レートＲ２＝ Σ（ファイルサイズ／ハッシュ値生成時間 × 重み付け）／ Σ（重み付け） … 式９ Hash value generation rate R2 = Σ (file size / hash value generation time × weighting) / Σ (weighting) Equation 9

上記式３のメッセージ送信時間Ｔ６は、転送処理の際に転送元装置１０と転送先装置２０との間でやり取りされる各種メッセージの送信に要する時間であり、例えば、下記式１０を用いて算出することができる。 The message transmission time T6 in the above equation 3 is the time required for transmitting various messages exchanged between the transfer source device 10 and the transfer destination device 20 during the transfer process, and is calculated using, for example, the following equation 10. can do.

メッセージ送信時間Ｔ６＝メッセージのデータサイズＣ／転送レートＲ３ … 式１０ Message transmission time T6 = message data size C / transfer rate R3 Equation 10

各種メッセージとしては、例えば、ハッシュ値取得要求や、不一致となったハッシュ値に対応するブロック番号を含む応答メッセージ等が該当する。 Examples of the various messages include a hash value acquisition request, a response message including a block number corresponding to a hash value that does not match.

次に、実施形態におけるファイル同期システムの動作について説明する。図６は、ファイル同期システムにおけるファイル転送の処理手順を説明するためのフローチャートである。 Next, the operation of the file synchronization system in the embodiment will be described. FIG. 6 is a flowchart for explaining a file transfer processing procedure in the file synchronization system.

最初に、転送元装置１０の転送選択部１５は、ファイル同期の対象となる転送元ファイル１４１のファイルサイズＡを取得し（ステップＳ１０１）、ファイルサイズＡが、分割ブロック数Ｎとハッシュ値サイズＳとを乗算した値よりも大きいか否かを判定する（ステップＳ１０２）。この判定がＮＯである場合（ステップＳ１０２；ＮＯ）に、転送元装置１０の転送選択部１５は、通常転送を選択して本処理手順を終了する。 First, the transfer selection unit 15 of the transfer source device 10 acquires the file size A of the transfer source file 141 that is the target of file synchronization (step S101), and the file size A includes the number of divided blocks N and the hash value size S. It is determined whether or not the value is greater than the value obtained by multiplying (step S102). When this determination is NO (step S102; NO), the transfer selection unit 15 of the transfer source device 10 selects normal transfer and ends this processing procedure.

一方、上記ステップＳ１０２の判定でファイルサイズＡが分割ブロック数Ｎとハッシュ値サイズＳとを乗算した値よりも大きいと判定した場合（ステップＳ１０２；ＹＥＳ）に、転送元装置１０の転送選択部１５は、転送元ファイル１４１のファイル名を転送先装置２０に送信する（ステップＳ１０３）。 On the other hand, when it is determined in step S102 that the file size A is larger than the value obtained by multiplying the divided block number N and the hash value size S (step S102; YES), the transfer selection unit 15 of the transfer source device 10 is determined. Transmits the file name of the transfer source file 141 to the transfer destination device 20 (step S103).

続いて、転送先装置２０は、転送元ファイル１４１のファイル名に基づいて、転送元ファイル１４１と同期をとるファイルに設定されている転送先ファイル２４１を特定し（ステップＳ１０４）、転送先ファイル２４１のファイルサイズＢを転送元装置１０に送信する（ステップＳ１０５）。 Subsequently, based on the file name of the transfer source file 141, the transfer destination device 20 identifies the transfer destination file 241 set as a file synchronized with the transfer source file 141 (step S104), and the transfer destination file 241. File size B is transmitted to the transfer source device 10 (step S105).

続いて、転送元装置１０の転送選択部１５は、通常転送処理時間が差分転送処理時間よりも長いか否かを判定する（ステップＳ１０６）。この判定がＮＯである場合（ステップＳ１０６；ＮＯ）に、転送元装置１０の転送選択部１５は、通常転送を選択して本処理手順を終了する。 Subsequently, the transfer selection unit 15 of the transfer source device 10 determines whether or not the normal transfer processing time is longer than the differential transfer processing time (step S106). If this determination is NO (step S106; NO), the transfer selection unit 15 of the transfer source device 10 selects normal transfer and ends this processing procedure.

一方、上記ステップＳ１０６の判定で通常転送処理時間が差分転送処理時間よりも長いと判定した場合（ステップＳ１０６；ＹＥＳ）に、転送元装置１０の転送選択部１５は、差分転送を選択し、転送元装置１０および転送先装置２０は、差分転送を実行する（ステップＳ１０７）。差分転送の処理手順について、以下に説明する。 On the other hand, if it is determined in step S106 that the normal transfer processing time is longer than the difference transfer processing time (step S106; YES), the transfer selection unit 15 of the transfer source device 10 selects the difference transfer and transfers the transfer. The source device 10 and the transfer destination device 20 execute differential transfer (step S107). The difference transfer processing procedure will be described below.

図７は、図６のステップＳ１０７で実行される差分転送の処理手順を詳細に説明するためのフローチャートである。 FIG. 7 is a flowchart for explaining in detail the difference transfer processing procedure executed in step S107 of FIG.

最初に、転送元装置１０の実データ分割部１１は、ファイル同期の対象となる転送元のファイル１４１の実データを分割ブロック数Ｎ分のブロックに分割する（ステップＳ２０１）。 First, the actual data dividing unit 11 of the transfer source device 10 divides the actual data of the transfer source file 141 to be subjected to file synchronization into blocks corresponding to the number of divided blocks N (step S201).

続いて、転送元装置１０のハッシュ値生成部１２は、上記ステップＳ２０１でＮ個に分割された各ブロックに含まれる実データをそれぞれハッシュ関数に入力してハッシュ値を生成し、ハッシュ値の生成に要した時間を積算する（ステップＳ２０２）。 Subsequently, the hash value generation unit 12 of the transfer source device 10 generates the hash value by inputting the actual data included in each block divided into N pieces in step S201 to the hash function, and generates the hash value. Is accumulated (step S202).

続いて、転送元装置１０の送信部１３は、上記ステップＳ２０２で生成された各ハッシュ値、転送元ファイル１４１のファイル名、ファイルサイズＡ、分割ブロック数Ｎ、分割ブロックサイズＤおよびハッシュ値サイズＳを含むハッシュ値取得要求を転送先装置２０に送信する（ステップＳ２０３）。 Subsequently, the transmission unit 13 of the transfer source device 10 transmits each hash value generated in step S202, the file name of the transfer source file 141, the file size A, the number of divided blocks N, the divided block size D, and the hash value size S. Is sent to the transfer destination device 20 (step S203).

続いて、転送先装置２０の実データ分割部２１は、ハッシュ値取得要求に基づいて、転送先のファイル２４１の実データを、分割ブロックサイズＤと同サイズのブロックに分割する（ステップＳ２０４）。 Subsequently, the actual data dividing unit 21 of the transfer destination device 20 divides the actual data of the transfer destination file 241 into blocks having the same size as the divided block size D based on the hash value acquisition request (step S204).

続いて、転送先装置２０のハッシュ値生成部２２は、上記ステップＳ２０４で分割された各ブロックに含まれる実データをそれぞれハッシュ関数に入力してハッシュ値を生成し、ハッシュ値の生成に要した時間を積算する（ステップＳ２０５）。 Subsequently, the hash value generation unit 22 of the transfer destination device 20 generates the hash value by inputting the actual data included in each block divided in step S204 to the hash function, and is required for generation of the hash value. The time is integrated (step S205).

続いて、転送先装置２０の送信部２３は、上記ステップＳ２０２で生成されたハッシュ値と上記ステップＳ２０５で生成されたハッシュ値とを、同じブロック番号ごとにそれぞれ比較し（ステップＳ２０６）、比較対象のハッシュ値が不一致となるブロック番号を特定する（ステップＳ２０７）。送信部２３は、不一致であると特定したブロック番号を含む応答メッセージを転送元装置１０に送信する（ステップＳ２０８）。 Subsequently, the transmission unit 23 of the transfer destination apparatus 20 compares the hash value generated in step S202 and the hash value generated in step S205 for each same block number (step S206), and the comparison target The block number that does not match the hash value is identified (step S207). The transmission unit 23 transmits a response message including the block number identified as inconsistent to the transfer source device 10 (step S208).

続いて、転送元装置１０の送信部１３は、転送先装置２０から受信したブロック番号に対応するブロックに含まれる実データを転送先装置２０に送信し（ステップＳ２０９）、実データの転送時間を積算する（ステップＳ２１０）。 Subsequently, the transmission unit 13 of the transfer source device 10 transmits the actual data included in the block corresponding to the block number received from the transfer destination device 20 to the transfer destination device 20 (step S209), and sets the transfer time of the actual data. Integration is performed (step S210).

上述したように、実施形態におけるファイル同期システム１によれば、転送元装置１０は、更新後ファイルを管理することで差分転送を行うことができる。これにより、更新前ファイルや差分情報を管理するための記憶領域を削減することができ、更新前ファイルや差分情報のメンテナンス作業を省くことができる。それゆえに、ファイル同期の効率を向上させることができる。 As described above, according to the file synchronization system 1 in the embodiment, the transfer source device 10 can perform differential transfer by managing the updated file. Thereby, the storage area for managing the pre-update file and the difference information can be reduced, and the maintenance work for the pre-update file and the difference information can be omitted. Therefore, the efficiency of file synchronization can be improved.

このような効果は、ファイル同期の対象となるファイルが、固定長のレコードを格納するファイルやデータ追記型のファイルである場合に、特に有効となる。例えば、固定長のレコードを格納するファイルで特定のレコードが更新された場合には、更新されたレコードを含むブロックの実データのみを差分転送すればよい。データ追記型のファイルで実データが追記された場合には、追記された実データを含むブロックの実データのみを差分転送すればよい。これにより、同期処理のターンアラウンドタイムを大幅に短縮することができる。このような効果は、ファイルの容量が大きくなるほど増大する。 Such an effect is particularly effective when the file to be subjected to file synchronization is a file that stores a fixed-length record or a data write-once file. For example, when a specific record is updated in a file that stores a fixed-length record, only the actual data of the block including the updated record may be differentially transferred. When actual data is appended in a data appending type file, only the actual data of the block including the appended actual data need be differentially transferred. Thereby, the turnaround time of the synchronization process can be greatly shortened. Such an effect increases as the file capacity increases.

なお、上述した各実施形態は、単なる例示に過ぎず、実施形態に明示していない種々の変形や技術の適用を排除するものではない。すなわち、本発明は、その趣旨を逸脱しない範囲で様々な形態に変形して実施することができる。 Note that each of the above-described embodiments is merely an example, and does not exclude various modifications and technical applications that are not explicitly described in the embodiment. That is, the present invention can be implemented by being modified into various forms without departing from the spirit of the present invention.

例えば、上述した実施形態では、ハッシュ値生成レートや転送レートを算出する際に、過去の複数の転送処理に対して重み付けを付与しているが、例えば、最新の転送処理にのみ重み付けを付与して、他の転送処理には重み付けを付与しないこととしてもよい。これにより、ＣＰＵの使用状況やネットワークの使用状況を機敏に反映させたレートを算出することが可能となる。また、例えば、過去の複数の転送処理に対して均等に重み付けを付与することとしてもよい。これにより、ＣＰＵの使用状況やネットワークの使用状況が急激に変化するような不安定な環境下では、最新の転送処理の状況にのみ左右されることなく、過去の複数の転送処理の状況を同等に考慮したレートを算出することが可能となる。 For example, in the above-described embodiment, when calculating the hash value generation rate and the transfer rate, weights are assigned to a plurality of past transfer processes. For example, only the latest transfer processes are weighted. Thus, no weighting may be given to other transfer processes. As a result, it is possible to calculate a rate that quickly reflects the usage status of the CPU and the usage status of the network. Further, for example, weighting may be equally applied to a plurality of past transfer processes. As a result, in an unstable environment where CPU usage and network usage change suddenly, the status of multiple past transfer processes is the same, regardless of only the latest transfer process status. It is possible to calculate a rate in consideration of

上記の実施形態の一部または全部は、以下の付記のようにも記載され得るが、本発明を以下に限定するものではない。 A part or all of the above embodiments can be described as in the following supplementary notes, but the present invention is not limited to the following.

（付記１）ファイル同期の対象となるファイルをそれぞれ有する転送元装置と転送先装置とを備えるファイル同期システムであって、前記転送元装置は、ファイル同期の対象となる転送元の前記ファイルのデータを複数のブロックに分割する第１の分割部と、前記第１の分割部によって分割された各ブロックに含まれるデータからそれぞれハッシュ値を生成する第１の生成部と、前記第１の生成部によって生成されたハッシュ値を前記転送先装置に送信する第１の送信部と、を有し、前記転送先装置は、ファイル同期の対象となる転送先の前記ファイルのデータを、前記第１の分割部によって分割された各ブロックと同サイズのブロックに分割する第２の分割部と、前記第２の分割部によって分割された各ブロックに含まれるデータからそれぞれハッシュ値を生成する第２の生成部と、前記第１の生成部によって生成されたハッシュ値と前記第２の生成部によって生成されたハッシュ値とを、各ハッシュ値の生成元データを含むブロックを識別するブロック番号ごとにそれぞれ比較し、不一致となったハッシュ値に対応する前記ブロック番号を前記転送元装置に送信する第２の送信部と、を有し、前記転送元装置の前記第１の送信部は、前記転送先装置から受信した前記ブロック番号に対応する前記ブロックに含まれるデータを前記転送先装置に送信する、ことを特徴とするファイル同期システム。 (Supplementary note 1) A file synchronization system including a transfer source device and a transfer destination device each having a file to be synchronized with the file, wherein the transfer source device has data of the file at the transfer source to be synchronized with the file A first division unit that divides the block into a plurality of blocks, a first generation unit that generates a hash value from data included in each block divided by the first division unit, and the first generation unit A first transmission unit that transmits the hash value generated by the transfer destination device to the transfer destination device, wherein the transfer destination device stores the data of the file at the transfer destination that is a target of file synchronization as the first transmission unit. A second dividing unit that divides the blocks divided by the dividing unit into blocks of the same size, and data included in each block divided by the second dividing unit; A second generation unit that generates a hash value, a hash value generated by the first generation unit, and a hash value generated by the second generation unit, including generation data of each hash value A second transmission unit that compares each block number for identifying a block and transmits the block number corresponding to the hash value that does not match to the transfer source device, and the second number of the transfer source device 1. The file synchronization system according to claim 1, wherein the transmission unit 1 transmits data included in the block corresponding to the block number received from the transfer destination device to the transfer destination device.

（付記２）前記ファイルは、固定長のレコードを格納するファイルであることを特徴とする付記１記載のファイル同期システム。 (Supplementary note 2) The file synchronization system according to supplementary note 1, wherein the file is a file storing a fixed-length record.

（付記３）前記ファイルは、追記型のファイルであることを特徴とする付記１または２記載のファイル同期システム。 (Supplementary Note 3) The file synchronization system according to Supplementary Note 1 or 2, wherein the file is a write-once file.

（付記４）ファイル同期の対象となるファイルをそれぞれ有する転送元装置と転送先装置とを備えるファイル同期システムで実行するファイル同期方法であって、前記転送元装置が、ファイル同期の対象となる転送元の前記ファイルのデータを複数のブロックに分割する第１の分割ステップと、前記第１の分割ステップにおいて分割された各ブロックに含まれるデータからそれぞれハッシュ値を生成する第１の生成ステップと、前記第１の生成ステップにおいて生成されたハッシュ値を前記転送先装置に送信する第１の送信ステップと、を実行し、前記転送先装置が、ファイル同期の対象となる転送先の前記ファイルのデータを、前記第１の分割ステップにおいて分割された各ブロックと同サイズのブロックに分割する第２の分割ステップと、前記第２の分割ステップにおいて分割された各ブロックに含まれるデータからそれぞれハッシュ値を生成する第２の生成ステップと、前記第１の生成ステップにおいて生成されたハッシュ値と前記第２の生成ステップにおいて生成されたハッシュ値とを、各ハッシュ値の生成元データを含むブロックを識別するブロック番号ごとにそれぞれ比較し、不一致となったハッシュ値に対応する前記ブロック番号を前記転送元装置に送信する第２の送信ステップと、を実行し、前記転送元装置の前記第１の送信ステップが、前記転送先装置から受信した前記ブロック番号に対応する前記ブロックに含まれるデータを前記転送先装置に送信する、ことを特徴とするファイル同期方法。 (Supplementary note 4) A file synchronization method executed in a file synchronization system including a transfer source device and a transfer destination device each having a file to be synchronized with the file, wherein the transfer source device is a file synchronization target A first dividing step of dividing the original data of the file into a plurality of blocks; a first generating step of generating a hash value from the data included in each block divided in the first dividing step; A first transmission step of transmitting the hash value generated in the first generation step to the transfer destination device, wherein the transfer destination device is the data of the file at the transfer destination that is the target of file synchronization. A second dividing step of dividing the block into blocks of the same size as the blocks divided in the first dividing step; In the second generation step of generating a hash value from the data included in each block divided in the second division step, the hash value generated in the first generation step, and the second generation step, The generated hash value is compared for each block number identifying the block including the generation source data of each hash value, and the block number corresponding to the hash value that does not match is transmitted to the transfer source device. 2, and the first transmission step of the transfer source device transmits data included in the block corresponding to the block number received from the transfer destination device to the transfer destination device. A file synchronization method characterized by that.

（付記５）付記４に記載の各ステップをコンピュータに実行させるためのファイル同期プログラム。 (Supplementary Note 5) A file synchronization program for causing a computer to execute each step described in Supplementary Note 4.

１…ファイル同期システム、１０…転送元装置、１１…実データ分割部、１２…ハッシュ値生成部、１３…送信部、１４…記憶部、１５…転送選択部、２０…転送先装置、２１…実データ分割部、２２…ハッシュ値生成部、２３…送信部、２４…記憶部、３０…ネットワーク、１４１、２４１…ファイル。 DESCRIPTION OF SYMBOLS 1 ... File synchronization system, 10 ... Transfer origin apparatus, 11 ... Real data division part, 12 ... Hash value generation part, 13 ... Transmission part, 14 ... Memory | storage part, 15 ... Transfer selection part, 20 ... Transfer destination apparatus, 21 ... Real data division unit, 22 ... hash value generation unit, 23 ... transmission unit, 24 ... storage unit, 30 ... network, 141, 241 ... file.

Claims

A file synchronization system comprising a transfer source device and a transfer destination device each having a file to be synchronized with a file,
The transfer source device
A first dividing unit that divides the data of the file that is the target of file synchronization into a plurality of blocks;
A first generator that generates a hash value from data included in each block divided by the first divider;
A first transmission unit that transmits the hash value generated by the first generation unit to the transfer destination device,
The transfer destination device is:
A second division unit that divides the data of the transfer destination file to be subjected to file synchronization into blocks of the same size as the blocks divided by the first division unit;
A second generation unit that generates a hash value from data included in each block divided by the second division unit;
The hash value generated by the first generation unit and the hash value generated by the second generation unit are respectively compared for each block number that identifies the block including the generation source data of each hash value. A second transmitter that transmits the block number corresponding to the hash value to the transfer source device,
The first transmission unit of the transfer source device transmits data included in the block corresponding to the block number received from the transfer destination device to the transfer destination device;
A file synchronization system characterized by that.

The file synchronization system according to claim 1, wherein the file is a file storing a fixed-length record.

3. The file synchronization system according to claim 1, wherein the file is a write-once file.

A file synchronization method executed by a file synchronization system including a transfer source device and a transfer destination device each having a file to be synchronized with a file,
The transfer source device is
A first dividing step of dividing the data of the file of the transfer source subject to file synchronization into a plurality of blocks;
A first generation step of generating a hash value from data included in each block divided in the first division step;
Performing a first transmission step of transmitting the hash value generated in the first generation step to the transfer destination device;
The destination device is
A second division step of dividing the data of the file of the transfer destination to be subjected to file synchronization into blocks having the same size as the blocks divided in the first division step;
A second generation step of generating a hash value from data included in each block divided in the second division step;
The hash value generated in the first generation step and the hash value generated in the second generation step are respectively compared for each block number that identifies the block including the generation source data of each hash value. A second transmission step of transmitting the block number corresponding to the hash value to the transfer source device,
The first transmission step of the transfer source device transmits data included in the block corresponding to the block number received from the transfer destination device to the transfer destination device;
A file synchronization method characterized by the above.

A file synchronization program for causing a computer to execute each step according to claim 4.