JP5966184B2

JP5966184B2 - Management system

Info

Publication number: JP5966184B2
Application number: JP2012200627A
Authority: JP
Inventors: 克典宮原; 真依岩野; 和也櫛山
Original assignee: Nippon Telegraph and Telephone West Corp
Current assignee: Nippon Telegraph and Telephone West Corp
Priority date: 2012-09-12
Filing date: 2012-09-12
Publication date: 2016-08-10
Anticipated expiration: 2032-09-12
Also published as: JP2014056407A

Description

本発明は、ファイルを複数のデータに分割して管理する技術に関する。 The present invention relates to a technique for managing a file by dividing it into a plurality of data.

近年、データのバックアップ方法の１つとして、重複排除技術が提案されている（特許文献１参照）。重複排除技術とは、ストレージ内にあるブロックデータの重複を検出し、重複しているブロックデータの一方を排除することでストレージ容量を効率的に利用できる技術である。
このような技術を用いることによって、データセンタは、ストレージ内に同一の情報を格納しているブロックデータが蓄積されることを無くすことを可能にする。さらに、データセンタは、ストレージ内に記憶するデータ量を削減できる。 In recent years, a deduplication technique has been proposed as one of data backup methods (see Patent Document 1). The deduplication technology is a technology that can efficiently use storage capacity by detecting duplication of block data in the storage and eliminating one of the duplicated block data.
By using such a technique, the data center can eliminate the accumulation of block data storing the same information in the storage. Furthermore, the data center can reduce the amount of data stored in the storage.

データセンタは、ネットワークを介して他のデータセンタと連携することで、データセンタ間でのブロックデータの重複排除を行う（以下、「グローバル重複排除処理」という。）。複数のデータセンタは、グローバル重複排除処理を行うことによって、データセンタ間でブロックデータの重複を排除し、データセンタ全体でストレージを効率的に利用することを可能にする。
このようなデータセンタが実現されている場合、ユーザはリストアを行うことによって所望のデータを取得する。具体的には、ユーザは、所望のファイルを復元するのに必要となるブロックデータを各データセンタから取得し、取得されたブロックデータに対してリストア処理を行うことによって所望のファイルを復元する。 Data centers perform deduplication of block data between data centers by cooperating with other data centers via a network (hereinafter referred to as “global deduplication processing”). A plurality of data centers perform global deduplication processing, thereby eliminating duplication of block data between data centers and making it possible to efficiently use storage throughout the data centers.
When such a data center is realized, the user acquires desired data by performing restoration. Specifically, the user obtains block data necessary for restoring a desired file from each data center, and restores the desired file by performing a restoration process on the obtained block data.

特開２００９−２０５２０１号公報JP 2009-205201 A

しかしながら、グローバル重複排除処理を行う場合、重複排除されたブロックデータは、データセンタ全体で唯一のブロックデータとなってしまう。ユーザは、リストアを行う際に、ファイルをリストアするのに必要なブロックデータを保持しているデータセンタに、必要なブロックデータを取得しに行かなければならない。そのため、リストア時のブロックデータ転送にかかる遅延の影響によってリストア時間が大幅にかかってしまうという問題があった。 However, when global deduplication processing is performed, the deduplicated block data becomes the only block data in the entire data center. When performing the restore, the user must go to the data center that holds the block data necessary for restoring the file to obtain the necessary block data. For this reason, there is a problem that the restoration time is significantly increased due to the influence of the delay in the block data transfer at the time of restoration.

上記事情に鑑み、本発明は、リストア時間を短縮させる技術の提供を目的としている。 In view of the above circumstances, an object of the present invention is to provide a technique for shortening the restoration time.

本発明の一態様は、複数のブロックデータで構成される１つのデータを、要求に応じて前記複数のブロックデータを送信することによって送信先において前記１つのデータを復元可能であり、前記ブロックデータは、他のブロックデータの一部である部分ブロックデータを複数組み合わせることによって生成可能であり、複数の地点にそれぞれ設置され１以上の前記ブロックデータを記憶する複数のデータセンタから、ユーザが要求するデータを生成するために必要となる前記ブロックデータ又は前記部分ブロックデータを送信する通信システムにおける管理システムであって、前記ブロックデータ毎に、前記ブロックデータを生成可能な前記複数の部分ブロックデータの組み合わせを記憶するブロックリスト記憶部と、前記ブロックデータ毎に、前記ブロックデータを前記複数の部分ブロックデータを組み合わせることによって生成する際に要する処理時間を、前記ブロックリスト記憶部に記憶される前記組み合わせ毎に算出する処理時間算出部と、前記データ毎に、前記データを構成する前記ブロックデータを前記データセンタから受信するのに要する処理時間と、前記処理時間算出部によって算出された処理時間と、に基づいて、前記ブロックデータ毎の取得方法を決定する決定部と、を備える管理システムである。 According to one embodiment of the present invention, one piece of data composed of a plurality of pieces of block data can be restored at a transmission destination by transmitting the plurality of pieces of block data in response to a request. Can be generated by combining a plurality of partial block data, which is a part of other block data, and is requested by a user from a plurality of data centers respectively installed at a plurality of points and storing one or more block data. A management system in a communication system for transmitting the block data or the partial block data necessary for generating data, and a combination of the plurality of partial block data capable of generating the block data for each block data A block list storage unit for storing the block data and the block data In addition, a processing time calculation unit that calculates a processing time required for generating the block data by combining the plurality of partial block data for each combination stored in the block list storage unit, and for each data The acquisition method for each block data is determined based on the processing time required to receive the block data constituting the data from the data center and the processing time calculated by the processing time calculation unit. And a determination unit.

本発明の一態様は、上記の管理システムであって、前記処理時間算出部は、前記ブロックデータを生成するために必要となる複数の前記部分ブロックデータを受信するのに要する通信時間と、前記複数の部分ブロックデータを組み合わせることによって前記ブロックデータを生成するのに要する生成処理時間と、の和に基づいて前記処理時間を算出する。
One aspect of the present invention is the above-described management system, the processing time calculation unit, a communication time required to receive a plurality of the partial block data required to generate the block data, the The processing time is calculated based on the sum of the generation processing time required to generate the block data by combining a plurality of partial block data.

本発明の一態様は、上記の管理システムであって、前記決定部は、前記ブロックデータ毎に、前記ブロックデータを前記データセンタから受信するのに要する処理時間と、前記処理時間算出部によって算出された各処理時間と、を比較して最も短い処理時間における取得方法を候補として選択する候補選択部と、前記データを構成する複数のブロックデータの各候補の前記処理時間のうち最長の処理時間を最長時間として決定し、前記最長時間の候補を最長候補として決定する最長時間決定部と、前記最長候補以外の候補毎に、前記処理時間が前記最長時間を超えない範囲で、前記生成処理時間が最短となる前記取得方法を最終候補として決定する最終候補決定部と、前記最長候補及び前記最終候補の組み合わせを、前記データを取得する際の処理方法として出力する出力部と、を備える。 One aspect of the present invention is the management system described above, wherein the determination unit calculates, for each block data, a processing time required to receive the block data from the data center and the processing time calculation unit. A candidate selection unit that compares the obtained processing times with each other and selects an acquisition method in the shortest processing time as a candidate, and the longest processing time among the processing times of the candidates of the plurality of block data constituting the data A longest time determination unit for determining the longest time candidate as the longest candidate, and for each candidate other than the longest candidate, the generation processing time within a range not exceeding the longest time A final candidate determination unit that determines the acquisition method that is the shortest as a final candidate, and a combination of the longest candidate and the final candidate when acquiring the data And an output unit for outputting as the processing method, the.

本発明により、リストア時間を短縮させることが可能となる。 According to the present invention, the restoration time can be shortened.

ファイルを構成するデータの関係性を表した図である。It is a figure showing the relationship of the data which comprise a file. 本実施形態における管理システムのシステム構成を示す図である。It is a figure which shows the system configuration | structure of the management system in this embodiment. データセンタ１０及び中央管理装置２０の機能構成を表す概略ブロック図である。2 is a schematic block diagram showing functional configurations of a data center 10 and a central management device 20. FIG. 部分ブロックリスト及びブロック配置リストの具体例を示す図である。It is a figure which shows the specific example of a partial block list and a block arrangement list. ブロックデータ“Ａ”に関するブロックデータリストの構成図である。It is a block diagram of the block data list regarding the block data “A”. ファイル１を構成するブロックデータ毎のブロックデータリストを表す図である。FIG. 4 is a diagram illustrating a block data list for each block data constituting the file 1. ファイル１を構成するブロックデータ毎のブロックデータリストを表す図である。FIG. 4 is a diagram illustrating a block data list for each block data constituting the file 1. 本実施形態のインデックスリスト作成の処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a process of index list creation of this embodiment. 本実施形態のインデックスリスト作成の処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a process of index list creation of this embodiment. 本実施形態のリストアの処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a restore process of this embodiment.

以下、本発明の一実施形態を、図面を参照しながら説明する。
図１は、ファイルを構成するデータの関係性を表した図である。図１（Ａ）は、ユーザからバックアップされたファイル１の構成例を表す図である。ファイル１は、複数のブロックデータで構成される。ブロックデータとは、ファイルをリストアするのに必要となるデータである。図１（Ａ）では、例えば、ファイル１を構成するブロックデータは、“Ａ”、“Ｂ”、“Ｃ”である。
図１（Ｂ）は、ブロックデータの構成例を表す図である。ブロックデータは、複数の部分ブロックデータを用いて生成することができる。部分ブロックデータとは、ブロックデータに対して分割処理が行われることによって生成されるデータである。言い換えれば、部分ブロックデータとは、ブロックデータの一部分を切り出して構成されるデータである。 Hereinafter, an embodiment of the present invention will be described with reference to the drawings.
FIG. 1 is a diagram showing the relationship of data constituting a file. FIG. 1A is a diagram illustrating a configuration example of a file 1 backed up by a user. File 1 is composed of a plurality of block data. Block data is data required to restore a file. In FIG. 1A, for example, the block data constituting the file 1 are “A”, “B”, and “C”.
FIG. 1B is a diagram illustrating a configuration example of block data. Block data can be generated using a plurality of partial block data. Partial block data is data generated by performing a division process on block data. In other words, the partial block data is data configured by cutting out a part of the block data.

図１（Ｂ）の例では、ブロックデータ“Ｅ”は、“１１０１１１”の２進数で表される。また、ブロックデータ“Ｆ”は、“００１１０１”の２進数で表される。ブロックデータ“Ｅ”は、分割処理が行われることによって部分ブロックデータ“Ｅ１”、部分ブロックデータ“Ｅ２”に分割される。部分ブロックデータ“Ｅ１”は、“１１０”の２進数で表される。部分ブロックデータ“Ｅ２”は、“１１１”の２進数で表される。
ブロックデータ“Ｆ”は、分割処理が行われることによって部分ブロックデータ“Ｆ１”、部分ブロックデータ“Ｆ２”に分割される。部分ブロックデータ“Ｆ１”は、“００１”の２進数で表される。部分ブロックデータ“Ｆ２”は、“１０１”の２進数で表される。 In the example of FIG. 1B, the block data “E” is represented by a binary number “110111”. The block data “F” is represented by a binary number “001101”. The block data “E” is divided into partial block data “E1” and partial block data “E2” by performing a division process. The partial block data “E1” is represented by a binary number “110”. The partial block data “E2” is represented by a binary number “111”.
The block data “F” is divided into partial block data “F1” and partial block data “F2” by performing a division process. The partial block data “F1” is represented by a binary number “001”. The partial block data “F2” is represented by a binary number “101”.

ブロックデータ“Ａ”は、“１１０１０１”の２進数で表される。ブロックデータ“Ａ”は、分割処理が行われることによって部分ブロックデータ“Ａ１”、部分ブロックデータ“Ａ２”に分割される。部分ブロックデータ“Ａ１”は、“１１０”の２進数で表される。部分ブロックデータ“Ａ２”は、“１０１”の２進数で表される。部分ブロックデータ“Ａ１”と部分ブロックデータ“Ｅ１”とは、同じ２進数で表される。また、部分ブロックデータ“Ａ２”と部分ブロックデータ“Ｆ２”とは、同じ２進数で表される。すなわち、図１（Ｂ）に示すように、ブロックデータ“Ａ”は、部分ブロックデータ“Ｅ１”と部分ブロックデータ“Ｆ２”との組み合わせにより生成することが可能である。 The block data “A” is represented by a binary number “110101”. The block data “A” is divided into partial block data “A1” and partial block data “A2” by performing the division process. The partial block data “A1” is represented by a binary number “110”. The partial block data “A2” is represented by a binary number “101”. The partial block data “A1” and the partial block data “E1” are represented by the same binary number. Further, the partial block data “A2” and the partial block data “F2” are represented by the same binary number. That is, as shown in FIG. 1B, the block data “A” can be generated by a combination of the partial block data “E1” and the partial block data “F2”.

図２は、本実施形態における管理システムのシステム構成を示す図である。本実施形態の管理システムは、複数のデータセンタ１０、中央管理装置２０、複数のＰＣ３０及びネットワーク４０を備える。
データセンタ１０は、ネットワーク４０を介してＰＣ３０からバックアップされたファイル（以下、「バックアップファイル」という。）を記憶している。
中央管理装置２０は、情報処理装置を用いて構成される。中央管理装置２０は、各データセンタ１０から転送されるブロックデータを管理する。 FIG. 2 is a diagram showing a system configuration of the management system in the present embodiment. The management system of this embodiment includes a plurality of data centers 10, a central management device 20, a plurality of PCs 30, and a network 40.
The data center 10 stores a file backed up from the PC 30 via the network 40 (hereinafter referred to as “backup file”).
The central management device 20 is configured using an information processing device. The central management device 20 manages block data transferred from each data center 10.

ＰＣ３０は、ネットワーク４０を介してデータセンタ１０にファイルをバックアップする。また、ＰＣ３０は、ネットワーク４０を介してデータセンタ１０からデータを受信し、ユーザが所望するファイルをリストアする。
ネットワーク４０は、どのように構成されたネットワークでも良い。例えば、インターネットを用いて構成されてもよい。 The PC 30 backs up files to the data center 10 via the network 40. Further, the PC 30 receives data from the data center 10 via the network 40 and restores a file desired by the user.
The network 40 may be a network configured in any way. For example, it may be configured using the Internet.

図３は、データセンタ１０及び中央管理装置２０の機能構成を表す概略ブロック図である。まず、データセンタ１０の機能構成を説明する。
データセンタ１０は、バスで接続されたＣＰＵ（Central Processing Unit）やメモリや補助記憶装置などを備え、データセンタプログラムを実行する。データセンタプログラムの実行によって、データセンタ１０は、通信部１０１、構成リスト作成部１０２、リストアデータ記憶部１０３、重複排除部１０４、ブロックデータ記憶部１０５、ブロックリスト記憶部１０６、処理時間算出部１０７、決定部１０８を備える装置として機能する。また、決定部１０８は、候補選択部１０９、最長時間決定部１１０、最終候補決定部１１１、出力部１１２、インデックスリスト記憶部１１３、部分ブロックデータ生成部１１４として機能する。なお、データセンタ１０の各機能の全て又は一部は、ＡＳＩＣ（Application Specific Integrated Circuit）やＰＬＤ（Programmable Logic Device）やＦＰＧＡ（Field Programmable Gate Array）等のハードウェアを用いて実現されても良い。また、データセンタプログラムは、コンピュータ読み取り可能な記録媒体に記録されても良い。コンピュータ読み取り可能な記録媒体とは、例えばフレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置である。また、データセンタプログラムは、電気通信回線を介して送受信されても良い。 FIG. 3 is a schematic block diagram showing the functional configuration of the data center 10 and the central management device 20. First, the functional configuration of the data center 10 will be described.
The data center 10 includes a CPU (Central Processing Unit), a memory, an auxiliary storage device, and the like connected by a bus, and executes a data center program. By executing the data center program, the data center 10 has a communication unit 101, a configuration list creation unit 102, a restore data storage unit 103, a deduplication unit 104, a block data storage unit 105, a block list storage unit 106, and a processing time calculation unit 107. , Functions as a device including the determination unit 108. The determination unit 108 functions as a candidate selection unit 109, a longest time determination unit 110, a final candidate determination unit 111, an output unit 112, an index list storage unit 113, and a partial block data generation unit 114. All or some of the functions of the data center 10 may be realized using hardware such as an application specific integrated circuit (ASIC), a programmable logic device (PLD), and a field programmable gate array (FPGA). The data center program may be recorded on a computer-readable recording medium. The computer-readable recording medium is, for example, a portable medium such as a flexible disk, a magneto-optical disk, a ROM, a CD-ROM, or a storage device such as a hard disk built in the computer system. The data center program may be transmitted / received via a telecommunication line.

通信部１０１は、他のデータセンタ１０、中央管理装置２０及びＰＣ３０との間でデータの送受信を行う。
構成リスト作成部１０２は、リストアデータ構成リストを作成する。リストアデータ構成リストは、各バックアップファイルについて、復元するために必要となるブロックデータの情報が定義されているリストである。
リストアデータ記憶部１０３は、磁気ハードディスク装置や半導体記憶装置などの記憶装置を用いて構成される。例えば、半導体記憶装置は、ＳＳＤ（Solid State Drive）である。リストアデータ記憶部１０３は、構成リスト作成部１０２で作成されたリストアデータ構成リストを記憶している。 The communication unit 101 transmits / receives data to / from other data centers 10, the central management device 20, and the PC 30.
The configuration list creation unit 102 creates a restore data configuration list. The restore data configuration list is a list in which block data information necessary for restoration is defined for each backup file.
The restore data storage unit 103 is configured using a storage device such as a magnetic hard disk device or a semiconductor storage device. For example, the semiconductor memory device is an SSD (Solid State Drive). The restore data storage unit 103 stores the restore data configuration list created by the configuration list creation unit 102.

重複排除部１０４は、複数のデータセンタ１０間でグローバル重複排除処理を行う。重複排除部１０４は、グローバル重複排除処理を行うことによって他のデータセンタ１０に存在しないブロックデータを選択する。
ブロックデータ記憶部１０５は、磁気ハードディスク装置や半導体記憶装置などの記憶装置を用いて構成される。ブロックデータ記憶部１０５は、他のデータセンタ１０に存在しないブロックデータを記憶している。 The deduplication unit 104 performs global deduplication processing between the plurality of data centers 10. The deduplication unit 104 selects block data that does not exist in other data centers 10 by performing global deduplication processing.
The block data storage unit 105 is configured using a storage device such as a magnetic hard disk device or a semiconductor storage device. The block data storage unit 105 stores block data that does not exist in other data centers 10.

ブロックリスト記憶部１０６は、磁気ハードディスク装置や半導体記憶装置などの記憶装置を用いて構成される。例えば、半導体記憶装置は、ＳＳＤである。ブロックリスト記憶部１０６は、部分ブロックリスト及びブロック配置リストを記憶している。部分ブロックリストは、ブロックデータ毎にブロックデータを生成可能な部分ブロックデータの組み合わせ（以下、「部分ブロックグループ」という。）が記述されているリストである。なお、部分ブロックグループには、ブロックデータをデータセンタ１０から直接取得する部分ブロックグループが含まれてもよい。ブロック配置リストは、各部分ブロックデータに関する情報が記述されているリストである。例えば、ブロック配置リストには、部分ブロックデータのブロックサイズ、部分ブロックデータの配置情報（部分ブロックデータを記憶しているデータセンタ１０の所在地）が記述されても良い。 The block list storage unit 106 is configured using a storage device such as a magnetic hard disk device or a semiconductor storage device. For example, the semiconductor memory device is an SSD. The block list storage unit 106 stores a partial block list and a block arrangement list. The partial block list is a list in which combinations of partial block data capable of generating block data for each block data (hereinafter referred to as “partial block group”) are described. The partial block group may include a partial block group that directly acquires block data from the data center 10. The block arrangement list is a list in which information on each partial block data is described. For example, the block arrangement list may describe the block size of the partial block data and the arrangement information of the partial block data (location of the data center 10 storing the partial block data).

処理時間算出部１０７は、ブロックリスト記憶部１０６に記憶されている部分ブロックリスト及びブロック配置リストに基づいて、総リストア時間（処理時間）を算出する。総リストア時間は、ファイルのリストアを行うのに要する処理時間の合計時間である。処理時間算出部１０７は、部分ブロックリスト及びブロック配置リストを用いて、部分ブロックグループ毎に総リストア時間を算出する。 The processing time calculation unit 107 calculates the total restoration time (processing time) based on the partial block list and the block arrangement list stored in the block list storage unit 106. The total restore time is the total processing time required to restore a file. The processing time calculation unit 107 calculates the total restoration time for each partial block group using the partial block list and the block arrangement list.

決定部１０８は、ファイルを構成するブロックデータ毎に１つの部分ブロックグループを決定する。以下、決定部１０８の具体的な構成について説明する。
候補選択部１０９は、ファイルを構成するブロックデータ毎に候補部分ブロックグループを選択する。候補部分ブロックグループは、部分ブロックグループの中で総リストア時間が最も短い部分ブロックグループである。
最長時間決定部１１０は、候補選択部１０９が選択した候補部分ブロックグループの中から、総リストア時間が最長となる部分ブロックグループ（以下、「最長部分ブロックグループ」という。）を決定する。最長時間決定部１１０は、最長部分ブロックグループの総リストア時間を最長リストア時間として取得する。 The determination unit 108 determines one partial block group for each block data constituting the file. Hereinafter, a specific configuration of the determination unit 108 will be described.
The candidate selection unit 109 selects a candidate partial block group for each block data constituting the file. The candidate partial block group is a partial block group having the shortest total restoration time among the partial block groups.
The longest time determination unit 110 determines a partial block group (hereinafter referred to as “longest partial block group”) having the longest total restoration time from the candidate partial block groups selected by the candidate selection unit 109. The longest time determination unit 110 acquires the total restore time of the longest partial block group as the longest restore time.

最終候補決定部１１１は、まず、候補部分ブロックグループ毎に、ブロックデータをデータセンタ１０から直接取得する場合に要する総リストア時間（以下、「直接リストア時間」という。）を取得する。そして、最終候補決定部１１１は、最長部分ブロックグループ以外の候補部分ブロックグループ毎に、最長リストア時間と、直接リストア時間と、を比較する。 The final candidate determination unit 111 first acquires a total restore time (hereinafter referred to as “direct restore time”) required for acquiring block data directly from the data center 10 for each candidate partial block group. Then, the final candidate determination unit 111 compares the longest restore time with the direct restore time for each candidate partial block group other than the longest partial block group.

最長リストア時間より直接リストア時間の方が短い場合、最終候補決定部１１１は、ブロックデータをデータセンタ１０から直接取得する部分ブロックグループ（以下、「直接リストアグループ」という。）を、最終部分ブロックグループに決定する。
一方、最長リストア時間より直接リストア時間の方が長い場合、最終候補決定部１１１は、候補部分ブロックグループを最終部分ブロックグループに決定する。最終候補決定部１１１は、このような処理をブロックデータ毎に行うことによって、ブロックデータ毎の最終部分ブロックグループを決定する。 When the direct restore time is shorter than the longest restore time, the final candidate determination unit 111 sets a partial block group (hereinafter referred to as “direct restore group”) that directly acquires block data from the data center 10 as a final partial block group. To decide.
On the other hand, when the direct restore time is longer than the longest restore time, the final candidate determining unit 111 determines the candidate partial block group as the final partial block group. The final candidate determination unit 111 determines the final partial block group for each block data by performing such processing for each block data.

出力部１１２は、最長部分ブロックグループ及び最終部分ブロックグループに基づいて、ファイル毎にインデックスリストを生成する。インデックスリストは、総リストア時間が最短となる部分ブロックデータの組み合わせが記述されたリストである。出力部１１２は、生成したインデックスリストをインデックスリスト記憶部１１３に出力する。出力部１１２は、インデックスリストを生成する際に各部分ブロックデータの先頭ポインタ情報及びバイト数をインデックスリストに記録する。 The output unit 112 generates an index list for each file based on the longest partial block group and the final partial block group. The index list is a list in which combinations of partial block data that minimize the total restoration time are described. The output unit 112 outputs the generated index list to the index list storage unit 113. The output unit 112 records the head pointer information and the number of bytes of each partial block data in the index list when generating the index list.

インデックスリスト記憶部１１３は、磁気ハードディスク装置や半導体記憶装置などの記憶装置を用いて構成される。例えば、半導体記憶装置は、ＳＳＤである。インデックスリスト記憶部１１３は、出力部１１２が出力したインデックスリストを記憶する。インデックスリスト記憶部１１３は、出力部１１２がインデックスリストを出力する度に、インデックスリストを追加して記憶する。
部分ブロックデータ生成部１１４は、インデックスリスト記憶部１１３が記憶しているインデックスリストを参照することによって、ブロックデータの分割処理を行い、部分ブロックデータを生成する。例えば、部分ブロックデータ生成部１１４は、先頭ポインタ情報が指し示すアドレスから、インデックスリストに記録されているバイト数分のデータを、部分ブロックデータとして取得する。 The index list storage unit 113 is configured using a storage device such as a magnetic hard disk device or a semiconductor storage device. For example, the semiconductor memory device is an SSD. The index list storage unit 113 stores the index list output by the output unit 112. The index list storage unit 113 adds and stores an index list each time the output unit 112 outputs the index list.
The partial block data generation unit 114 performs block data division processing by referring to the index list stored in the index list storage unit 113, and generates partial block data. For example, the partial block data generation unit 114 acquires data corresponding to the number of bytes recorded in the index list as partial block data from the address indicated by the head pointer information.

次に、中央管理装置２０の機能構成を説明する。中央管理装置２０は、バスで接続されたＣＰＵやメモリや補助記憶装置などを備え、管理プログラムを実行する。管理プログラムの実行によって、中央管理装置２０は、通信部２０１、ブロックデータ記憶部２０２、分割処理部２０３、検出部２０４、更新部２０５を備える装置として機能する。なお、中央管理装置２０の各機能の全て又は一部は、ＡＳＩＣやＰＬＤやＦＰＧＡ等のハードウェアを用いて実現されても良い。また、管理プログラムは、コンピュータ読み取り可能な記録媒体に記録されても良い。コンピュータ読み取り可能な記録媒体とは、例えばフレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置である。また、管理プログラムは、電気通信回線を介して送受信されても良い。 Next, the functional configuration of the central management device 20 will be described. The central management device 20 includes a CPU, a memory, an auxiliary storage device, and the like connected by a bus, and executes a management program. By executing the management program, the central management device 20 functions as a device including a communication unit 201, a block data storage unit 202, a division processing unit 203, a detection unit 204, and an update unit 205. Note that all or some of the functions of the central management device 20 may be realized using hardware such as an ASIC, PLD, or FPGA. The management program may be recorded on a computer-readable recording medium. The computer-readable recording medium is, for example, a portable medium such as a flexible disk, a magneto-optical disk, a ROM, a CD-ROM, or a storage device such as a hard disk built in the computer system. Further, the management program may be transmitted / received via a telecommunication line.

通信部２０１は、複数のデータセンタ１０との間でのデータの送受信を行う。
ブロックデータ記憶部２０２は、磁気ハードディスク装置や半導体記憶装置などの記憶装置を用いて構成される。例えば、半導体記憶装置は、ＳＳＤである。ブロックデータ記憶部２０２は、通信部２０１が複数のデータセンタ１０から受信したブロックデータを記憶している。
分割処理部２０３は、ブロックデータ記憶部２０２に記憶されているブロックデータの分割処理を行う。分割処理は、１つのブロックデータを、複数の部分ブロックデータに細分化する処理である。分割処理部２０３は、ブロックデータを分割処理することによって部分ブロックデータを生成する。
検出部２０４は、生成された部分ブロックデータの重複を検出する。
更新部２０５は、検出部２０４による検出結果に応じて、部分ブロックリスト及びブロック配置リストを更新する。 The communication unit 201 transmits / receives data to / from a plurality of data centers 10.
The block data storage unit 202 is configured using a storage device such as a magnetic hard disk device or a semiconductor storage device. For example, the semiconductor memory device is an SSD. The block data storage unit 202 stores block data received from the plurality of data centers 10 by the communication unit 201.
The division processing unit 203 performs a division process on the block data stored in the block data storage unit 202. The division process is a process of subdividing one block data into a plurality of partial block data. The division processing unit 203 generates partial block data by dividing the block data.
The detection unit 204 detects duplication of the generated partial block data.
The update unit 205 updates the partial block list and the block arrangement list according to the detection result by the detection unit 204.

図４は、部分ブロックリスト及びブロック配置リストの具体例を示す図である。
図４（Ａ）は、部分ブロックリストの具体例を表す図である。部分ブロックリストは、レコード５０を複数有する。レコード５０は、ブロックデータ及び部分ブロックグループの値を有する。
ブロックデータの値は、ブロックデータの識別情報を表す。 FIG. 4 is a diagram illustrating specific examples of the partial block list and the block arrangement list.
FIG. 4A is a diagram illustrating a specific example of the partial block list. The partial block list has a plurality of records 50. The record 50 has block data and partial block group values.
The value of the block data represents identification information of the block data.

部分ブロックグループの値は、ブロックデータを生成可能な部分ブロックデータの識別情報を表す。なお、図４では部分ブロックグループに含まれる部分ブロックデータの数が２つであるが、１つであっても良いし３つ以上であっても良い。
図４（Ａ）では、例えば、ブロックデータ“Ａ”は、部分ブロックデータ“Ｅ１”と部分ブロックデータ“Ｆ２”との組み合わせによって生成可能である。また、ブロックデータ“Ａ”は、部分ブロックデータ“Ｄ１”と部分ブロックデータ“Ｊ２”との組み合わせによって生成可能である。 The value of the partial block group represents identification information of partial block data that can generate block data. In FIG. 4, the number of partial block data included in the partial block group is two, but may be one or three or more.
In FIG. 4A, for example, block data “A” can be generated by a combination of partial block data “E1” and partial block data “F2”. The block data “A” can be generated by a combination of the partial block data “D1” and the partial block data “J2”.

図４（Ｂ）は、ブロック配置リストの具体例を表す図である。ブロック配置リストは、レコード５１を複数有する。レコード５１は、部分ブロックデータ、ブロックサイズ、配置先の各値を有する。
部分ブロックデータの値は、部分ブロックデータの識別情報を表す。ブロックサイズは、部分ブロックデータ容量の大きさを表す。ブロックサイズは、例えば、単位Ｂ（バイト）で表されても良い。配置先は、部分ブロックデータの配置情報（部分ブロックデータを記憶しているデータセンタ１０の所在地）を表す。
図４（Ｂ）では、例えば、部分ブロックデータ“Ｅ１”は、大きさが１００ＭＢ（メガバイト）であり、“愛媛”のデータセンタ１０に記憶されている。 FIG. 4B is a diagram illustrating a specific example of the block arrangement list. The block arrangement list has a plurality of records 51. The record 51 has partial block data, block size, and placement destination values.
The value of the partial block data represents identification information of the partial block data. The block size represents the size of the partial block data capacity. The block size may be expressed in units of B (bytes), for example. The placement destination represents placement information of partial block data (location of the data center 10 storing the partial block data).
In FIG. 4B, for example, the partial block data “E1” has a size of 100 MB (megabytes) and is stored in the data center 10 of “Ehime”.

図５〜図７を用いて、インデックスリストを作成する具体的な方法について説明する。以下の説明では簡単化のために、ファイル１のインデックスリストを作成する場合について説明する。ファイル１は、例えば、ブロックデータ“Ａ”、ブロックデータ“Ｂ”及びブロックデータ“Ｃ”で構成される。 A specific method for creating an index list will be described with reference to FIGS. In the following description, a case where an index list of file 1 is created will be described for the sake of simplicity. The file 1 is composed of, for example, block data “A”, block data “B”, and block data “C”.

図５は、ブロックデータ“Ａ”に関するブロックデータリストの構成図である。
ブロックデータリストは、リストア情報を表すレコード５２を複数有する。レコード５２は、ブロックデータ部分ブロックグループ、ＣＰＵ処理時間（生成処理時間）、転送時間（通信時間）、総リストア時間の各値を有する。
ブロックデータの値は、ブロックデータの識別情報を表す。
部分ブロックグループの値は、各部分ブロックグループに含まれる部分ブロックデータの識別情報を表す。 FIG. 5 is a configuration diagram of a block data list regarding the block data “A”.
The block data list has a plurality of records 52 representing restore information. The record 52 has values of a block data partial block group, a CPU processing time (generation processing time), a transfer time (communication time), and a total restore time.
The value of the block data represents identification information of the block data.
The value of the partial block group represents identification information of partial block data included in each partial block group.

ＣＰＵ処理時間の値は、１つのブロックデータを生成する際に要する生成処理時間を表す。なお、直接リストアグループを表すレコード５２（図５最上段のレコード５２）のＣＰＵ処理時間は、部分ブロックデータを組み合わせる必要がないため“０”である。ＣＰＵ処理時間は、部分ブロックデータの数毎にテーブルとして保持されていても良い。この場合、処理時間算出部１０７は、テーブルを参照することによって、部分ブロックグループに含まれる部分ブロックデータの数に応じたＣＰＵ処理時間を算出する。 The value of the CPU processing time represents the generation processing time required for generating one block data. Note that the CPU processing time of the record 52 representing the direct restore group (the uppermost record 52 in FIG. 5) is “0” because it is not necessary to combine partial block data. The CPU processing time may be held as a table for each number of partial block data. In this case, the processing time calculation unit 107 calculates the CPU processing time according to the number of partial block data included in the partial block group by referring to the table.

転送時間の値は、リストアに必要なブロックデータ及び部分ブロックデータを、各データセンタ１０からリストア要求元（ＰＣ３０）に転送する際に要する時間を表す。転送時間は、例えばブロック配置リストに基づいてリストアを行うのに必要なブロックデータ及び部分ブロックデータのブロックサイズを、ブロックデータ及び部分ブロックデータをデータセンタ１０から受信するのに要する遅延時間で除算することで算出される。
総リストア時間の値は、各部分ブロックグループの総リストア時間を表す。具体的には、レコード５２毎にＣＰＵ処理時間と転送時間との和によって算出される合計時間である。 The transfer time value represents the time required to transfer the block data and partial block data necessary for restoration from each data center 10 to the restoration request source (PC 30). For example, the transfer time is obtained by dividing the block size of block data and partial block data required for restoration based on the block arrangement list by the delay time required to receive the block data and partial block data from the data center 10. It is calculated by.
The value of the total restore time represents the total restore time of each partial block group. Specifically, this is the total time calculated by the sum of the CPU processing time and the transfer time for each record 52.

図６は、ファイル１を構成するブロックデータ毎のブロックデータリストを表す図である。
図６（Ａ）は、ブロックデータ“Ａ”に関するブロックデータリストの具体例を表す図である。図６（Ｂ）は、ブロックデータ“Ｂ”に関するブロックデータリストの具体例を表す図である。図６（Ｃ）は、ブロックデータ“Ｃ”に関するブロックデータリストの具体例を表す図である。 FIG. 6 is a diagram showing a block data list for each block data constituting the file 1.
FIG. 6A is a diagram illustrating a specific example of a block data list regarding the block data “A”. FIG. 6B is a diagram illustrating a specific example of a block data list regarding the block data “B”. FIG. 6C is a diagram illustrating a specific example of a block data list regarding the block data “C”.

候補選択部１０９は、ファイルを構成するブロックデータ毎に候補部分ブロックグループを選択する。図６（Ａ）では、部分ブロックデータ“Ｅ１”及び“Ｆ２”で構成される部分ブロックグループの総リストア時間が“１０”であり、ブロックデータ“Ａ”に関する部分ブロックグループの中で最も総リストア時間が短い。したがって、候補選択部１０９は、部分ブロックデータ“Ｅ１”及び“Ｆ２”で構成される部分ブロックグループをブロックデータ“Ａ”の候補部分ブロックグループに選択する（図６（Ａ）の太枠）。 The candidate selection unit 109 selects a candidate partial block group for each block data constituting the file. In FIG. 6A, the total restoration time of the partial block group composed of the partial block data “E1” and “F2” is “10”, and is the most total restoration among the partial block groups related to the block data “A”. The time is short. Therefore, the candidate selection unit 109 selects a partial block group composed of the partial block data “E1” and “F2” as a candidate partial block group of the block data “A” (thick frame in FIG. 6A).

図６（Ｂ）では、ブロックデータ“Ｂ”のみで構成される直接リストアグループの総リストア時間（直接リストア時間」が“１４”であり、ブロックデータ“Ｂ”に関する部分ブロックグループの中で最も総リストア時間が短い。したがって、候補選択部１０９は、ブロックデータ“Ｂ”のみで構成される直接リストアグループをブロックデータ“Ｂ”の候補部分ブロックグループに選択する（図６（Ｂ）の太枠）。 In FIG. 6B, the total restore time (direct restore time) of the direct restore group composed only of the block data “B” is “14”, which is the most total among the partial block groups related to the block data “B”. Therefore, the restore time is short, so that the candidate selection unit 109 selects a direct restore group including only block data “B” as a candidate partial block group of block data “B” (thick frame in FIG. 6B). .

図６（Ｃ）では、部分ブロックデータ“Ｐ２”及び“Ｏ２”で構成される部分ブロックグループの総リストア時間が“１１”であり、ブロックデータ“Ｃ”に関する部分ブロックグループの中で最も総リストア時間が短い。したがって、候補選択部１０９は、部分ブロックデータ“Ｐ２”及び“Ｏ２”で構成される部分ブロックグループをブロックデータ“Ｃ”の候補部分ブロックグループに選択する（図６（Ｃ）の太枠）。 In FIG. 6C, the total restoration time of the partial block group composed of the partial block data “P2” and “O2” is “11”, which is the most total restoration among the partial block groups related to the block data “C”. The time is short. Therefore, the candidate selection unit 109 selects a partial block group composed of the partial block data “P2” and “O2” as a candidate partial block group of the block data “C” (thick frame in FIG. 6C).

次に、最長時間決定部１１０は、ブロックデータリスト毎の候補部分ブロックグループの中から、総リストア時間が最長となる部分ブロックグループを決定する。図６では、ブロックデータ“Ｂ”の候補部分ブロックグループの総リストア時間が最も長い。したがって、最長時間決定部１１０は、ブロックデータ“Ｂ”の総リストア時間をファイル１の最長リストア時間に決定する。また、最長時間決定部１１０は、最長リストア時間の部分ブロックグループを最長部分ブロックグループに決定する。 Next, the longest time determination unit 110 determines a partial block group having the longest total restore time from candidate partial block groups for each block data list. In FIG. 6, the total restoration time of the candidate partial block group of the block data “B” is the longest. Therefore, the longest time determination unit 110 determines the total restore time of the block data “B” as the longest restore time of the file 1. Further, the longest time determination unit 110 determines the partial block group having the longest restore time as the longest partial block group.

図７は、ファイル１を構成するブロックデータ毎のブロックデータリストを表す図である。
図７において、最終候補決定部１１１は、最長部分ブロックグループ以外の候補部分ブロックグループ毎に、最終部分ブロックグループを決定する。最終候補決定部１１１は、候補部分ブロックグループ毎に、最長リストア時間と直接リストア時間とを比較する。直接リストア時間が最長リストア時間よりも短い場合、最終候補決定部１１１は、直接リストアグループを最終部分ブロックグループに決定する。一方、直接リストア時間が最長リストア時間より長い場合、最終候補決定部１１１は、候補部分ブロックグループを最終部分ブロックグループに決定する。 FIG. 7 is a diagram illustrating a block data list for each block data constituting the file 1.
In FIG. 7, the final candidate determination unit 111 determines a final partial block group for each candidate partial block group other than the longest partial block group. The final candidate determination unit 111 compares the longest restore time with the direct restore time for each candidate partial block group. When the direct restore time is shorter than the longest restore time, the final candidate determining unit 111 determines the direct restore group as the final partial block group. On the other hand, when the direct restore time is longer than the longest restore time, the final candidate determination unit 111 determines the candidate partial block group as the final partial block group.

図７（Ａ）において、最終候補決定部１１１は、ブロックデータ“Ａ”の直接リストア時間と、最長リストア時間とを比較する。ブロックデータ“Ａ”の直接リストア時間は“１２”であり、最長リストア時間は“１４”である。すなわち、ブロックデータ“Ａ”の直接リストア時間は最長リストア時間より短い。そのため、最終候補決定部１１１は、直接リストアグループを、ブロックデータ“Ａ”の最終部分ブロックグループに決定する。 In FIG. 7A, the final candidate determination unit 111 compares the direct restore time of the block data “A” with the longest restore time. The direct restore time of the block data “A” is “12”, and the longest restore time is “14”. That is, the direct restore time of the block data “A” is shorter than the longest restore time. Therefore, the final candidate determination unit 111 determines the direct restore group as the final partial block group of the block data “A”.

図７（Ｃ）において、最終候補決定部１１１は、ブロックデータ“Ｃ”の直接リストア時間と、最長リストア時間とを比較する。ブロックデータ“Ｃ”の直接リストア時間は“１５”であり、最長リストア時間は“１４”である。すなわち、ブロックデータ“Ｃ”の直接リストア時間は最長リストア時間より長い。そのため、最終候補決定部１１１は、候補部分ブロックグループを、ブロックデータ“Ｃ”の最終部分ブロックグループに決定する。
出力部１１２は、最長部分ブロックグループと最終部分ブロックグループとに基づいて、ファイル１のインデックスリストを生成する。 In FIG. 7C, the final candidate determination unit 111 compares the direct restore time of the block data “C” with the longest restore time. The direct restore time of the block data “C” is “15”, and the longest restore time is “14”. That is, the direct restore time of the block data “C” is longer than the longest restore time. Therefore, final candidate determination section 111 determines the candidate partial block group as the final partial block group of block data “C”.
The output unit 112 generates an index list of the file 1 based on the longest partial block group and the final partial block group.

図８及び図９は、本実施形態のインデックスリスト作成の処理の流れを示すフローチャートである。
通信部１０１は、複数のＰＣ３０からバックアップファイルを受信する（ステップＳ１０１）。構成リスト作成部１０２は、リストアデータ構成リストを作成する。構成リスト作成部１０２は、複数のＰＣ３０からバックアップファイルをリストアデータ構成リストにファイル単位で記述する（ステップＳ１０２）。重複排除部１０４は重複排除処理を行うことによって、データセンタ１０間でブロックデータの重複を排除する（ステップＳ１０３）。各データセンタ１０は、自装置（データセンタ１０）が記憶しているブロックデータを中央管理装置２０に転送する（ステップＳ１０４）。 8 and 9 are flowcharts showing the flow of processing for creating an index list according to this embodiment.
The communication unit 101 receives backup files from the plurality of PCs 30 (step S101). The configuration list creation unit 102 creates a restore data configuration list. The configuration list creation unit 102 describes backup files from a plurality of PCs 30 in the restore data configuration list in units of files (step S102). The deduplication unit 104 eliminates duplication of block data between the data centers 10 by performing deduplication processing (step S103). Each data center 10 transfers the block data stored in its own device (data center 10) to the central management device 20 (step S104).

中央管理装置２０の通信部２０１は、各データセンタ１０から転送されたブロックデータを受信する。分割処理部２０３は、受信したブロックデータの分割処理を行う。分割処理部２０３はブロックデータを生成可能な部分ブロックデータを生成する。検出部２０４は、分割処理部２０３が生成した部分ブロックデータの重複を検出する（ステップＳ１０５）。 The communication unit 201 of the central management device 20 receives the block data transferred from each data center 10. The division processing unit 203 performs a division process on the received block data. The division processing unit 203 generates partial block data that can generate block data. The detection unit 204 detects the overlap of the partial block data generated by the division processing unit 203 (step S105).

更新部２０５は、検出部２０４による検出結果に応じて、部分ブロックリストを更新する（ステップＳ１０６）。更新部２０５は、検出部２０４による検出結果に応じて、ブロック配置リストを更新する（ステップＳ１０７）。通信部２０１は、部分ブロックリスト及びブロック配置リストを、各データセンタ１０に転送する（ステップＳ１０８）。 The update unit 205 updates the partial block list according to the detection result by the detection unit 204 (step S106). The update unit 205 updates the block arrangement list according to the detection result by the detection unit 204 (step S107). The communication unit 201 transfers the partial block list and the block arrangement list to each data center 10 (step S108).

データセンタ１０の通信部１０１は、中央管理装置２０から転送された部分ブロックリスト及びブロック配置リストを受信する。通信部１０１は、受信した部分ブロックリスト及びブロック配置リストをブロックリスト記憶部１０６に記録する。処理時間算出部１０７は、ブロックリスト記憶部１０６に記憶されている部分ブロックリスト及びブロック配置リストに基づいて、ファイルを構成する部分ブロックデータ毎に部分ブロックグループの総リストア時間を算出する（ステップＳ１０９）。 The communication unit 101 of the data center 10 receives the partial block list and the block arrangement list transferred from the central management device 20. The communication unit 101 records the received partial block list and block arrangement list in the block list storage unit 106. The processing time calculation unit 107 calculates the total restoration time of the partial block group for each partial block data constituting the file based on the partial block list and the block arrangement list stored in the block list storage unit 106 (step S109). ).

候補選択部１０９は、ファイルを構成するブロックデータ毎に部分ブロックグループの中から、総リストア時間が最も短い部分ブロックグループ（候補部分ブロックグループ）を選択する（ステップＳ１１０）。
最長時間決定部１１０は、候補部分ブロックグループの中から、総リストア時間が最長となる最長部分ブロックグループを決定する。最長時間決定部１１０は、最長部分ブロックグループの総リストア時間を最長リストア時間として取得する（ステップＳ１１１）。 The candidate selection unit 109 selects a partial block group (candidate partial block group) having the shortest total restoration time from the partial block groups for each block data constituting the file (step S110).
The longest time determination unit 110 determines the longest partial block group having the longest total restoration time from the candidate partial block groups. The longest time determination unit 110 acquires the total restore time of the longest partial block group as the longest restore time (step S111).

最終候補決定部１１１は、最長部分ブロックグループ以外の候補部分ブロックグループ毎に、直接リストア時間と直接リストア時間とを比較する（ステップＳ１１２）。直接リストア時間が最長リストア時間よりも長い場合（ステップＳ１１２−ＹＥＳ）、最終候補決定部１１１は、候補部分ブロックグループを最終部分ブロックグループに決定する（ステップＳ１１３）。 The final candidate determination unit 111 compares the direct restore time with the direct restore time for each candidate partial block group other than the longest partial block group (step S112). If the direct restore time is longer than the longest restore time (step S112—YES), the final candidate determining unit 111 determines the candidate partial block group as the final partial block group (step S113).

一方、ステップＳ１１２の処理において、直接リストア時間が最長リストア時間よりも短い場合（ステップＳ１１２−ＮＯ）、最終候補決定部１１１は、直接リストアグループを最終部分ブロックグループに決定する（ステップＳ１１４）。
出力部１１２は、最終部分ブロックグループと最長部分ブロックグループとに基づいて、インデックスリストを生成する（ステップＳ１１５）。出力部１１２は、生成したインデックスリストをインデックスリスト記憶部１１３に出力する。インデックスリスト記憶部１１３は、出力部１１２がインデックスリストを出力する度に、インデックスリストを追加し記憶する（ステップＳ１１６）。 On the other hand, when the direct restore time is shorter than the longest restore time in the process of step S112 (step S112—NO), the final candidate determination unit 111 determines the direct restore group as the final partial block group (step S114).
The output unit 112 generates an index list based on the final partial block group and the longest partial block group (step S115). The output unit 112 outputs the generated index list to the index list storage unit 113. The index list storage unit 113 adds and stores an index list every time the output unit 112 outputs the index list (step S116).

図１０は、本実施形態のリストアの処理の流れを示すフローチャートである。
データセンタ１０の通信部１０１は、ネットワーク４０を介してＰＣ３０から送信されたユーザのリストア要求を受信する（ステップＳ２０１）。通信部１０１は、受信したリストア要求に応じてインデックスリストを参照することによって、リストア要求されたファイルを構成するブロックデータ及び部分ブロックデータとその配置先を判定する（ステップＳ２０２）。通信部１０１は、リストア要求されたファイルを構成するブロックデータ及び部分ブロックデータのうち、自装置（データセンタ１０）に記憶されているデータについては、ブロックデータ記憶部１０５からブロックデータを読み出す。通信部１０１は、インデックスリストを参照することによって、リストア要求されたファイルを構成するブロックデータを要求元（ＰＣ３０）に送信する。 FIG. 10 is a flowchart showing the flow of restoration processing according to this embodiment.
The communication unit 101 of the data center 10 receives the user restore request transmitted from the PC 30 via the network 40 (step S201). The communication unit 101 refers to the index list according to the received restore request, thereby determining the block data and partial block data constituting the file requested to be restored and the arrangement destination thereof (step S202). The communication unit 101 reads block data from the block data storage unit 105 for data stored in its own device (data center 10) among the block data and partial block data constituting the file requested to be restored. The communication unit 101 refers to the index list to transmit block data constituting the file requested to be restored to the request source (PC 30).

また、部分ブロックデータ生成部１１４は、インデックスリストに記憶されている先頭ポインタ情報及びバイト数を参照する。部分ブロックデータ生成部１１４は、先頭ポインタ情報及びバイト数に基づいて、ブロックデータの分割処理を行い、部分ブロックデータを生成する。通信部１０１は、生成された部分ブロックデータを要求元（ＰＣ３０）に送信する。通信部１０１は、リストア要求されたファイルを構成するブロックデータ及び部分ブロックデータのうち、自装置（データセンタ１０）に記憶されていないデータについては、その配置先のデータセンタ１０に対して、要求元（ＰＣ３０）へブロックデータ又は部分ブロックデータを転送することを要求する（ステップＳ２０３）。 Further, the partial block data generation unit 114 refers to the head pointer information and the number of bytes stored in the index list. The partial block data generation unit 114 performs block data division processing based on the head pointer information and the number of bytes, and generates partial block data. The communication unit 101 transmits the generated partial block data to the request source (PC 30). Of the block data and partial block data constituting the file requested to be restored, the communication unit 101 makes a request to the data center 10 of the arrangement destination for data not stored in its own device (data center 10). A request is made to transfer block data or partial block data to the source (PC 30) (step S203).

以上のように構成された管理システムによれば、各データセンタ１０は、自装置と各データセンタ１０との間のデータ転送時間に基づいて、ファイル毎にリストアに要する時間を最短にするためのインデックスリストを作成する。そのため、ユーザによってリストアが要求されているファイルについて、リストアに要する時間を低減する事が可能となる。 According to the management system configured as described above, each data center 10 uses the data transfer time between its own device and each data center 10 to minimize the time required for restoration for each file. Create an index list. For this reason, it is possible to reduce the time required for restoration of a file requested to be restored by the user.

また、最長リストア時間よりも短い時間でリストア可能なブロックデータについては、部分ブロックデータの結合を行うことなくブロックデータを直接転送するように、インデックスリストが生成される。一般的に、部分ブロックデータを結合してブロックデータを生成する処理はユーザの装置（ＰＣ３０）で行われる。そのため、結合を行う回数を減らすことによって、ユーザの装置（ＰＣ３０）におけるＣＰＵの処理負荷を軽減することができる。 For block data that can be restored in a time shorter than the longest restoration time, an index list is generated so that the block data is directly transferred without combining partial block data. In general, processing for combining partial block data to generate block data is performed by a user device (PC 30). Therefore, the processing load of the CPU in the user device (PC 30) can be reduced by reducing the number of times of combining.

＜変形例＞
ブロック配置リストのレコード５１は、ブロックサイズ及び配置先以外の情報を有しても良い。
上述したステップＳ１０９〜ステップＳ１１５の処理は、データセンタ１０以外で行われても良い。例えば、中央管理装置２０がインデックスリストを作成して、各データセンタ１０にインデックスリストを転送しても良い。各データセンタ１０の通信部１０１は、転送されたインデックスリストをインデックスリスト記憶部１１３に記録する。
複数のデータセンタ１０の中の１つのデータセンタ１０が、中央管理装置２０として機能するように構成されても良い。
ユーザによって操作される装置は、上述したＰＣ３０に限定される必要は無い。例えば、ＰＣ（パーソナルコンピュータ）に変えて、スマートフォン、携帯電話機、ゲーム装置、テレビ受像機、映像記録装置などの通信端末装置であればどのような装置であっても良い。 <Modification>
The block arrangement list record 51 may have information other than the block size and the arrangement destination.
The processes in steps S109 to S115 described above may be performed in a place other than the data center 10. For example, the central management device 20 may create an index list and transfer the index list to each data center 10. The communication unit 101 of each data center 10 records the transferred index list in the index list storage unit 113.
One data center 10 among the plurality of data centers 10 may be configured to function as the central management device 20.
The device operated by the user need not be limited to the PC 30 described above. For example, instead of a PC (personal computer), any device may be used as long as it is a communication terminal device such as a smartphone, a mobile phone, a game device, a television receiver, and a video recording device.

以上、この発明の実施形態について図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計等も含まれる。 The embodiment of the present invention has been described in detail with reference to the drawings. However, the specific configuration is not limited to this embodiment, and includes designs and the like that do not depart from the gist of the present invention.

１０…データセンタ，２０…中央管理装置，３０…ＰＣ，４０…ネットワーク，１０１…通信部，１０２…構成リスト作成部，１０３…リストアデータ記憶部，１０４…重複排除部，１０５…ブロックデータ記憶部，１０６…ブロックリスト記憶部，１０７…処理時間算出部，１０８…決定部，１０９…候補選択部，１１０…最長時間決定部，１１１…最終候補決定部，１１２…出力部，１１３…インデックスリスト記憶部，１１４…部分ブロックデータ生成部，２０１…通信部，２０２…ブロックデータ記憶部，２０３…分割処理部，２０４…検出部，２０５…更新部 DESCRIPTION OF SYMBOLS 10 ... Data center, 20 ... Central management apparatus, 30 ... PC, 40 ... Network, 101 ... Communication part, 102 ... Configuration list creation part, 103 ... Restore data storage part, 104 ... Deduplication part, 105 ... Block data storage part 106: Block list storage unit, 107: Processing time calculation unit, 108 ... Determination unit, 109 ... Candidate selection unit, 110 ... Longest time determination unit, 111 ... Final candidate determination unit, 112 ... Output unit, 113 ... Index list storage , 114 ... Partial block data generation unit, 201 ... Communication unit, 202 ... Block data storage unit, 203 ... Division processing unit, 204 ... Detection unit, 205 ... Update unit

Claims

One data composed of a plurality of block data can be restored at a transmission destination by transmitting the plurality of block data in response to a request, and the block data includes other block data It can be generated by combining a plurality of partial block data, which is a part, and is necessary for generating data requested by a user from a plurality of data centers installed at a plurality of points and storing one or more of the block data. A management system in a communication system for transmitting the block data or the partial block data,
For each block data, a block list storage unit that stores a combination of the plurality of partial block data capable of generating the block data;
For each block data, a processing time calculation unit that calculates a processing time required for generating the block data by combining the plurality of partial block data for each combination stored in the block list storage unit;
Acquisition for each block data based on the processing time required to receive the block data constituting the data from the data center and the processing time calculated by the processing time calculation unit for each data A determination unit for determining a method;
A management system comprising:

The processing time calculation unit, a communication time required to receive a plurality of the partial block data required to generate the block data, to generate the block data by combining said plurality of partial block data The management system according to claim 1, wherein the processing time is calculated based on a sum of a generation processing time required for the processing.

The determination unit
For each block data, the processing time required to receive the block data from the data center is compared with each processing time calculated by the processing time calculation unit, and an acquisition method in the shortest processing time is selected. A candidate selector to select as,
A longest time determination unit that determines the longest processing time among the processing times of each candidate of a plurality of block data constituting the data as the longest time, and determines the longest time candidate as the longest candidate;
For each candidate other than the longest candidate, a final candidate determining unit that determines, as a final candidate, the acquisition method in which the generation processing time is the shortest in a range in which the processing time does not exceed the longest time;
An output unit that outputs a combination of the longest candidate and the final candidate as a processing method when acquiring the data;
The management system according to claim 2, comprising: