JP5660617B2

JP5660617B2 - Storage device

Info

Publication number: JP5660617B2
Application number: JP2011072791A
Authority: JP
Inventors: 正貴的場; 充洋松尾; 弘崇中野
Original assignee: NEC Corp; NEC Solutions Innovators Ltd
Current assignee: NEC Corp; NEC Solutions Innovators Ltd
Priority date: 2011-03-29
Filing date: 2011-03-29
Publication date: 2015-01-28
Anticipated expiration: 2031-03-29
Also published as: JP2012208650A

Description

本発明は、ストレージ装置にかかり、特に、同一内容のデータの重複記憶を排除するストレージ装置に関する。 The present invention relates to a storage apparatus, and more particularly to a storage apparatus that eliminates redundant storage of data having the same content.

近年、コンピュータの発達及び普及に伴い、種々の情報がデジタルデータ化されている。このようなデジタルデータを保存しておく装置として、磁気テープや磁気ディスクなどの記憶装置がある。そして、保存すべきデータは日々増大し、膨大な量となるため、大容量なストレージシステムが必要となっている。また、記憶装置に費やすコストを削減しつつ、信頼性も必要とされる。これに加えて、後にデータを容易に取り出すことが可能であることも必要である。その結果、自動的に記憶容量や性能の増大を実現できると共に、重複記憶を排除して記憶コストを削減し、さらには、冗長性の高いストレージシステムが望まれている。 In recent years, with the development and spread of computers, various types of information have been converted into digital data. As a device for storing such digital data, there are storage devices such as a magnetic tape and a magnetic disk. Since the data to be stored increases day by day and becomes enormous, a large-capacity storage system is required. In addition, reliability is required while reducing the cost of the storage device. In addition to this, it is necessary that data can be easily retrieved later. As a result, there is a demand for a storage system that can automatically increase storage capacity and performance, eliminate duplicate storage, reduce storage costs, and have high redundancy.

このような状況に応じて、近年では、特許文献１に示すように、コンテンツアドレスストレージシステムが開発されている。このコンテンツアドレスストレージシステムは、データを分散して複数の記憶装置に記憶すると共に、このデータの内容に応じて特定される固有のコンテンツアドレスによって、当該データを格納した格納位置が特定される。具体的に、コンテンツアドレスストレージシステムでは、所定のデータを複数のフラグメントに分割すると共に、冗長データとなるフラグメントをさらに付加して、これら複数のフラグメントをそれぞれ複数の記憶装置にそれぞれ格納している。 In response to such a situation, in recent years, a content address storage system has been developed as shown in Patent Document 1. In this content address storage system, data is distributed and stored in a plurality of storage devices, and the storage location where the data is stored is specified by a unique content address specified according to the content of the data. Specifically, in the content address storage system, predetermined data is divided into a plurality of fragments, and a fragment that becomes redundant data is further added, and the plurality of fragments are respectively stored in a plurality of storage devices.

そして、後に、コンテンツアドレスを指定することにより、当該コンテンツアドレスにて特定される格納位置に格納されているデータつまりフラグメントを読み出し、複数のフラグメントから分割前の所定のデータを復元することができる。 Then, by designating the content address later, it is possible to read out the data stored at the storage location specified by the content address, that is, the fragment, and restore the predetermined data before the division from the plurality of fragments.

また、上記コンテンツアドレスとして、データの内容に応じて固有となるよう生成される例えばデータのハッシュ値を用いる。このため、重複データであれば同じ格納位置のデータを参照することで、同一内容のデータを取得することができる。従って、重複データを別々に格納する必要がなく、重複記録を排除し、データ容量の削減を図ることができる。 Further, as the content address, for example, a hash value of data generated to be unique according to the content of data is used. For this reason, if it is duplicate data, the data of the same content can be acquired by referring to the data at the same storage position. Therefore, it is not necessary to store the duplicate data separately, and duplicate recording can be eliminated and the data capacity can be reduced.

特開２００５−２３５１７１号公報JP 2005-235171 A

ここで、上述したデータの重複排除の判定は、一般的に、ファイルのデータをある長さ（固定長または可変長）に区切って、それらを比較することにより行っている。このため、１つのファイル中のデータの差異が、ファイルを区切った長さより小さな間隔で存在すると、それらは同一内容のデータとは判定されない。つまり、区切ったデータ間に同一内容のデータ部分が存在していた場合であっても、わずかな差異が存在すると、両方の区切ったデータが記憶されることとなり、効率よく記憶するデータの重複排除ができない。 Here, the above-described data deduplication determination is generally performed by dividing file data into a certain length (fixed length or variable length) and comparing them. For this reason, if there is a difference in data in one file at an interval smaller than the length of dividing the file, they are not determined as data having the same content. In other words, even if there is a data part with the same content between the divided data, if there is a slight difference, both the divided data will be stored, and deduplication of the data to be stored efficiently I can't.

例えば、データをバックアップするソフトウェアは、バックアップするデータの他に、独自の管理情報をバックアップファイルへ挿入することを行っている。すると、この挿入される独自の管理情報がバックアップ毎に異なり、かつ、この管理情報の挿入間隔がデータ重複判定時のデータ区切り間隔より短い場合には、上述したように、記憶するデータの重複排除ができない。その結果、データの格納効率が低下する、という問題が生じる。 For example, software for backing up data inserts unique management information into a backup file in addition to the data to be backed up. Then, when the inserted unique management information is different for each backup and the insertion interval of this management information is shorter than the data delimiter interval at the time of data duplication determination, as described above, deduplication of stored data is performed. I can't. As a result, there arises a problem that data storage efficiency is lowered.

このため、本発明の目的は、上述した課題である、データ格納効率の低下を解決することにある。 Therefore, an object of the present invention is to solve the decrease in data storage efficiency, which is the problem described above.

上記目的を達成すべく、本発明の一形態であるストレージシステムは、
書き込み対象となるデータを所定容量に分割して記憶装置に格納すると共に、当該記憶装置に既に記憶されているデータと同一内容の他のデータを前記記憶装置に格納する場合に当該記憶装置に既に記憶されているデータを他のデータとして参照させるデータ書き込み部と、
書き込み対象となるデータを、予め設定された基準に従って分類されるユーザデータ部分と管理データ部分とに分離するデータ分離部と、を備え、
前記データ書き込み部は、前記ユーザデータ部分を格納したファイルを生成すると共に、当該ファイルに格納された前記ユーザデータ部分に関連付けられたメタデータとして前記管理データ部分を前記ファイルに含めて格納し、前記ファイルを書き込み対象となるデータとして前記記憶装置に記憶する、
という構成を取る。 In order to achieve the above object, a storage system according to one aspect of the present invention provides:
The data to be written is divided into a predetermined capacity and stored in the storage device, and other data having the same content as the data already stored in the storage device is already stored in the storage device. A data writing section for referring to stored data as other data;
A data separation unit that separates data to be written into a user data part and a management data part that are classified according to a preset criterion;
The data writing unit generates a file storing the user data part, stores the management data part as metadata associated with the user data part stored in the file, and stores the management data part in the file, Storing the file in the storage device as data to be written;
Take the configuration.

また、本発明の他の形態であるプログラムは、
書き込み対象となるデータを所定容量に分割して記憶装置に格納すると共に、当該記憶装置に既に記憶されているデータと同一内容の他のデータを前記記憶装置に格納する場合に当該記憶装置に既に記憶されているデータを他のデータとして参照させるデータ書き込み部を備えた情報処理装置に、
書き込み対象となるデータを、予め設定された基準に従って分類されるユーザデータ部分と管理データ部分とに分離するデータ分離部を実現させると共に、
前記データ書き込み部は、前記ユーザデータ部分を格納したファイルを生成すると共に、当該ファイルに格納された前記ユーザデータ部分に関連付けられたメタデータとして前記管理データ部分を前記ファイルに含めて格納し、前記ファイルを書き込み対象となるデータとして前記記憶装置に記憶する、ことを実現させるためのプログラムである。 Moreover, the program which is the other form of this invention is:
The data to be written is divided into a predetermined capacity and stored in the storage device, and other data having the same content as the data already stored in the storage device is already stored in the storage device. In an information processing apparatus provided with a data writing unit for referring to stored data as other data,
In addition to realizing a data separation unit that separates data to be written into a user data part and a management data part that are classified according to preset criteria,
The data writing unit generates a file storing the user data part, stores the management data part as metadata associated with the user data part stored in the file, and stores the management data part in the file, This is a program for realizing that a file is stored in the storage device as data to be written.

また、本発明の他の形態であるデータ格納方法は、
書き込み対象となるデータを所定容量に分割して記憶装置に格納すると共に、当該記憶装置に既に記憶されているデータと同一内容の他のデータを前記記憶装置に格納する場合に当該記憶装置に既に記憶されているデータを他のデータとして参照させるデータ書き込み工程を有するデータ格納方法であって、
前記データ書き込み工程の前に、書き込み対象となるデータを、予め設定された基準に従って分類されるユーザデータ部分と管理データ部分とに分離するデータ分離工程を有し、
前記データ書き込み工程は、前記ユーザデータ部分を格納したファイルを生成すると共に、当該ファイルに格納された前記ユーザデータ部分に関連付けられたメタデータとして前記管理データ部分を前記ファイルに含めて格納し、前記ファイルを書き込み対象となるデータとして前記記憶装置に記憶する、
という構成を取る。 In addition, a data storage method according to another aspect of the present invention includes:
The data to be written is divided into a predetermined capacity and stored in the storage device, and other data having the same content as the data already stored in the storage device is already stored in the storage device. A data storage method including a data writing step for referring to stored data as other data,
Before the data writing step, it has a data separation step of separating the data to be written into a user data portion and a management data portion that are classified according to a preset standard,
The data writing step generates a file storing the user data portion, stores the management data portion in the file as metadata associated with the user data portion stored in the file, and stores the file. Storing the file in the storage device as data to be written;
Take the configuration.

本発明は、以上のように構成されることにより、データ格納効率の向上を図ることができる。 By configuring as described above, the present invention can improve the data storage efficiency.

本発明の実施形態１におけるストレージシステムを含むシステム全体の構成を示すブロック図である。1 is a block diagram showing a configuration of an entire system including a storage system in Embodiment 1 of the present invention. 本発明の実施形態１におけるストレージシステムの構成の概略を示すブロック図である。1 is a block diagram showing an outline of the configuration of a storage system in Embodiment 1 of the present invention. 本発明の実施形態１におけるストレージシステムの構成を示す機能ブロック図である。1 is a functional block diagram showing a configuration of a storage system in Embodiment 1 of the present invention. 図３に開示したストレージシステムにおけるデータ書き込み処理の様子を説明するための説明図である。FIG. 4 is an explanatory diagram for explaining a state of data write processing in the storage system disclosed in FIG. 3; 図３に開示したストレージシステムにおけるデータ書き込み処理の様子を説明する説明図である。It is explanatory drawing explaining the mode of the data writing process in the storage system disclosed in FIG. 図３に開示したストレージシステムにおけるデータ書き込み処理の様子を説明する説明図である。It is explanatory drawing explaining the mode of the data writing process in the storage system disclosed in FIG. 図６に開示したインデックスデータのデータ構造の一例を示す図である。It is a figure which shows an example of the data structure of the index data disclosed in FIG. 図３に開示したストレージシステムにおけるデータ書き込み処理の動作を示すフローチャートである。4 is a flowchart showing an operation of a data write process in the storage system disclosed in FIG. 3. 図３に開示したストレージシステムにおけるデータ読み取り処理の動作を示すフローチャートである。4 is a flowchart showing an operation of data reading processing in the storage system disclosed in FIG. 3. 付記１におけるストレージシステムの構成を示す機能ブロック図である。2 is a functional block diagram illustrating a configuration of a storage system according to attachment 1. FIG.

＜実施形態１＞
本発明の第１の実施形態を、図１乃至図９を参照して説明する。図１は、システム全体の構成を示すブロック図である。図２は、ストレージシステムの概略を示すブロック図であり、図３は、ストレージシステムの構成を示す機能ブロック図である。図４乃至図６は、ストレージシステムにおけるデータ書き込み処理の動作を説明するための説明図である。図７は、記憶装置に記憶されるインデックスデータのデータ構造を示す図である。図８乃至図９は、ストレージシステムの動作を示すフローチャートである。 <Embodiment 1>
A first embodiment of the present invention will be described with reference to FIGS. FIG. 1 is a block diagram showing the configuration of the entire system. FIG. 2 is a block diagram showing an outline of the storage system, and FIG. 3 is a functional block diagram showing the configuration of the storage system. 4 to 6 are explanatory diagrams for explaining the operation of data write processing in the storage system. FIG. 7 is a diagram illustrating a data structure of index data stored in the storage device. 8 to 9 are flowcharts showing the operation of the storage system.

ここで、本実施形態は、後述する付記に記載のストレージ装置等の具体的な一例を示すものである。そして、以下では、ストレージシステムが、複数台のサーバコンピュータが接続されて構成されている場合を説明する。但し、本発明におけるストレージシステムは、複数台のコンピュータにて構成されることに限定されず、１台のコンピュータで構成されていてもよい。 Here, the present embodiment shows a specific example of a storage device or the like described in an appendix to be described later. In the following, a case where the storage system is configured by connecting a plurality of server computers will be described. However, the storage system according to the present invention is not limited to being configured by a plurality of computers, and may be configured by a single computer.

［構成］
図１に示すように、本発明におけるストレージシステム１は、ネットワークＮを介してバックアップ処理を制御するバックアップシステム４に接続している。そして、バックアップシステム４は、ネットワークＮを介して接続されたバックアップ対象装置５に格納されているバックアップ対象データ（書き込み対象となるデータ）を取得し、ストレージシステム１に対して記憶するよう要求する。これにより、ストレージシステム１は、記憶要求されたバックアップ対象データをバックアップ用に記憶する。 [Constitution]
As shown in FIG. 1, the storage system 1 in the present invention is connected to a backup system 4 that controls backup processing via a network N. Then, the backup system 4 acquires backup target data (data to be written) stored in the backup target device 5 connected via the network N and requests the storage system 1 to store it. Thereby, the storage system 1 stores the backup target data requested to be stored for backup.

そして、図２に示すように、本実施形態におけるストレージシステム１は、複数のサーバコンピュータが接続された構成を採っている。具体的に、ストレージシステム１は、ストレージシステム１自体における記憶再生動作を制御するサーバコンピュータであるアクセラレータノード２と、データを格納する記憶装置を備えたサーバコンピュータであるストレージノード３と、を備えている。なお、アクセラレータノード２の数とストレージノード３の数は、図２に示したものに限定されず、さらに多くの各ノード２，３が接続されて構成されていてもよい。 As shown in FIG. 2, the storage system 1 in this embodiment has a configuration in which a plurality of server computers are connected. Specifically, the storage system 1 includes an accelerator node 2 that is a server computer that controls storage and reproduction operations in the storage system 1 itself, and a storage node 3 that is a server computer including a storage device that stores data. Yes. The number of accelerator nodes 2 and the number of storage nodes 3 are not limited to those shown in FIG. 2, and more nodes 2 and 3 may be connected.

さらに、本実施形態におけるストレージシステム１は、データを分割及び冗長化し、分散して複数の記憶装置に記憶すると共に、記憶するデータの内容に応じて設定される固有のコンテンツアドレスによって、当該データを格納した格納位置を特定するコンテンツアドレスストレージシステムである。このコンテンツアドレスストレージシステムについては、後に詳述する。 Furthermore, the storage system 1 according to the present embodiment divides and redundantly stores the data, distributes and stores the data in a plurality of storage devices, and stores the data by a unique content address set according to the content of the stored data. It is a content address storage system for specifying a stored location. This content address storage system will be described in detail later.

なお、以下では、ストレージシステム１が１つのシステムであるとして、当該ストレージシステム１が備えている構成及び機能を説明する。つまり、以下に説明するストレージシステム１が有する構成及び機能は、アクセラレータノード２あるいはストレージノード３のいずれに備えられていてもよい。なお、ストレージシステム１は、図２に示すように、必ずしもアクセラレータノード２とストレージノード３とを備えていることに限定されず、いかなる構成であってもよく、例えば、１台のコンピュータにて構成されていてもよい。さらには、ストレージシステム１は、コンテンツアドレスストレージシステムであることにも限定されず、重複排除機能を有しているストレージシステムであればよい。 Hereinafter, assuming that the storage system 1 is one system, the configuration and functions of the storage system 1 will be described. That is, the configuration and function of the storage system 1 described below may be provided in either the accelerator node 2 or the storage node 3. As shown in FIG. 2, the storage system 1 is not necessarily limited to including the accelerator node 2 and the storage node 3, and may have any configuration, for example, a single computer. May be. Furthermore, the storage system 1 is not limited to being a content address storage system, and may be a storage system having a deduplication function.

図３に、本実施形態におけるストレージシステム１の構成を示す。図３に示すストレージシステム１は、上述したアクセラレータノード２及びストレージノード３内に仮想的に構築されたデータ分離／結合装置１０とＣＡＳ装置２０とを備えている。そして、データ分離／結合装置１０は、上述したバックアップ対象装置５やバックアップシステム４に装備されたデータ書き込み／読み取りソフトウェア３０に接続されており、このソフトウェア３０からの要求に応じて、ファイルを記憶したり、読み出す処理を行うものである。 FIG. 3 shows the configuration of the storage system 1 in this embodiment. The storage system 1 shown in FIG. 3 includes a data separation / combination device 10 and a CAS device 20 that are virtually constructed in the accelerator node 2 and the storage node 3 described above. The data separation / combination device 10 is connected to the data writing / reading software 30 provided in the backup target device 5 and the backup system 4 described above, and stores a file in response to a request from the software 30. Or a reading process.

そして、上記データ分離／結合装置１０は、装備された演算装置にプログラムが組み込まれることによって構築された、分離データ書き込み部１１と、データ分離部１２と、分離データ関連付け部１３と、分離データ読み取り部１４と、を備えている。また、ＣＡＳ装置２０のファイル２１には、ユーザデータ記憶部２２と、管理データ及びインデックスデータ記憶部２３と、が形成されている。 The data separation / combination device 10 is constructed by incorporating a program into the equipped arithmetic device, the separated data writing unit 11, the data separating unit 12, the separated data associating unit 13, and the separated data reading. Part 14. In addition, the user data storage unit 22 and the management data and index data storage unit 23 are formed in the file 21 of the CAS device 20.

なお、実際には、上述したストレージシステム１が備える構成は、図２に示したアクセラレータノード２及びストレージノード３がそれぞれ備えているＣＰＵ（Central Processing Unit）などの演算装置やハードディスクドライブなどの記憶装置にて構成されている。 Actually, the configuration of the storage system 1 described above includes an arithmetic device such as a CPU (Central Processing Unit) and a storage device such as a hard disk drive that are included in the accelerator node 2 and the storage node 3 shown in FIG. It is composed of.

ここで、上述したように、本実施形態におけるストレージシステム１は、コンテンツアドレスストレージシステムである。このため、上述した分離データ書き込み部１１（データ書き込み部）及びＣＡＳ装置２０は、コンテンツアドレスを利用してデータを記憶装置２５に格納する機能を有しており、以下に説明するように、データを分割及び分散し、かつ、コンテンツアドレスにて格納位置を特定して、当該データを格納する。以下、ストレージシステム１にてコンテンツアドレスを利用したデータ書き込み処理について、図４乃至図５を参照して説明する。 Here, as described above, the storage system 1 in the present embodiment is a content address storage system. For this reason, the separated data writing unit 11 (data writing unit) and the CAS device 20 described above have a function of storing data in the storage device 25 using the content address, and as described below, the data Are divided and distributed, the storage location is specified by the content address, and the data is stored. Hereinafter, the data writing process using the content address in the storage system 1 will be described with reference to FIGS.

まず、図４及び図５の矢印Ｙ１に示すように、データ書き込み／読み取りソフトウェア３０から分離データ書き込み部１１を介して書き込み要求されたファイルＡの入力を受けると、図４及び図５の矢印Ｙ２に示すように、当該ファイルＡを、所定容量（例えば、６４ＫＢ）のブロックデータＤに分割する。なお、上記ファイルＡは、例えば、後述するように、データ分離部１２にて分離されたユーザデータ部分２２を含むファイル２１であり、このファイル２１は、図３に示すように、ユーザデータ２２に関連付けられた拡張属性及び／又は代替データストリーム２３を含む。そして、拡張属性及び／又は代替データストリーム２３には、データ分離部１２にて分離された管理データ部分及びインデックスデータ部分が格納される。 First, as shown by the arrow Y1 in FIGS. 4 and 5, when the input of the file A requested to be written is received from the data writing / reading software 30 via the separated data writing unit 11, the arrow Y2 in FIGS. As shown in FIG. 5, the file A is divided into block data D having a predetermined capacity (for example, 64 KB). The file A is, for example, a file 21 including a user data portion 22 separated by the data separation unit 12, as will be described later. This file 21 is stored in the user data 22 as shown in FIG. Including an associated extended attribute and / or alternative data stream 23. The extended attribute and / or alternative data stream 23 stores the management data portion and the index data portion separated by the data separation unit 12.

そして、ファイルＡが分割されたブロックデータＤのデータ内容に基づいて、当該データ内容を代表する固有のハッシュ値Ｈを算出する（図５の矢印Ｙ３）。例えば、ハッシュ値Ｈは、予め設定されたハッシュ関数を用いて、ブロックデータＤのデータ内容から算出する。 Then, based on the data contents of the block data D obtained by dividing the file A, a unique hash value H representing the data contents is calculated (arrow Y3 in FIG. 5). For example, the hash value H is calculated from the data content of the block data D using a preset hash function.

続いて、ファイルＡのブロックデータＤのハッシュ値Ｈを用いて、当該ブロックデータＤが既に格納されているか否かを調べる。具体的には、まず、既に格納されているブロックデータＤは、そのハッシュ値Ｈと格納位置を表すコンテンツアドレスＣＡとが、関連付けられてＭＦＩ（ＭａｉｎＦｒａｇｍｅｎｔＩｎｄｅｘ）ファイルに登録されている。従って、格納前に算出したブロックデータＤのハッシュ値ＨがＭＦＩファイル内に存在している場合には、既に同一内容のブロックデータＤが格納されていると判断できる（図５の矢印Ｙ４）。この場合には、格納前のブロックデータＤのハッシュ値Ｈと一致したＭＦＩ内のハッシュ値Ｈに関連付けられているコンテンツアドレスＣＡを、当該ＭＦＩファイルから取得する。そして、このコンテンツアドレスＣＡを、書き込み要求されたブロックデータＤのコンテンツアドレスＣＡとして返却する。これにより、このコンテンツアドレスＣＡにて参照される既に格納されているデータが、書き込み要求されたブロックデータＤとして使用されることとなり、当該書き込み要求にかかるブロックデータＤを記憶する必要がなくなる。 Subsequently, using the hash value H of the block data D of the file A, it is checked whether or not the block data D is already stored. Specifically, first, the block data D that has already been stored has its hash value H and a content address CA representing the storage position associated with each other and registered in an MFI (Main Fragment Index) file. Therefore, if the hash value H of the block data D calculated before storage is present in the MFI file, it can be determined that the same block data D has already been stored (arrow Y4 in FIG. 5). In this case, the content address CA associated with the hash value H in the MFI that matches the hash value H of the block data D before storage is acquired from the MFI file. Then, this content address CA is returned as the content address CA of the block data D requested to be written. As a result, the already stored data referred to by the content address CA is used as the block data D requested to be written, and there is no need to store the block data D related to the write request.

また、書き込み要求にかかるブロックデータＤがまだ記憶されていないと判断された場合には、かかるブロックデータＤを圧縮して、図５の矢印Ｙ５に示すように、複数の所定の容量のフラグメントデータに分割する。例えば、図４の符号Ｄ１〜Ｄ９に示すように、９つのフラグメントデータ（分割データ４１）に分割する。そしてさらに、分割したフラグメントデータのうちいくつかが欠けた場合であっても、元となるブロックデータを復元可能なよう冗長データを生成し、上記分割したフラグメントデータ４１に追加する。例えば、図４の符号Ｄ１０〜Ｄ１２に示すように、３つのフラグメントデータ（冗長データ４２）を追加する。これにより、９つの分割データ４１と、３つの冗長データとにより構成される１２個のフラグメントデータからなるデータセット４０を生成する。 If it is determined that the block data D related to the write request is not yet stored, the block data D is compressed, and a plurality of pieces of fragment data having a predetermined capacity as shown by an arrow Y5 in FIG. Divide into For example, as shown by symbols D1 to D9 in FIG. 4, the data is divided into nine fragment data (divided data 41). Further, even if some of the divided fragment data is missing, redundant data is generated so that the original block data can be restored and added to the divided fragment data 41. For example, three pieces of fragment data (redundant data 42) are added as indicated by reference numerals D10 to D12 in FIG. As a result, a data set 40 composed of twelve fragment data composed of nine divided data 41 and three redundant data is generated.

続いて、上述したように生成されたデータセットを構成する各フラグメントデータを、記憶装置に形成された各記憶領域に、それぞれ分散して格納する。例えば、図４に示すように、１２個のフラグメントデータＤ１〜Ｄ１２を生成した場合には、複数の記憶装置内にそれぞれ形成したデータ格納ファイルに、各フラグメントデータＤ１〜Ｄ１２を１つずつそれぞれ格納する（図５の矢印Ｙ６参照）。 Subsequently, each fragment data constituting the data set generated as described above is distributed and stored in each storage area formed in the storage device. For example, as shown in FIG. 4, when 12 pieces of fragment data D1 to D12 are generated, each piece of fragment data D1 to D12 is stored in a data storage file formed in each of a plurality of storage devices. (See arrow Y6 in FIG. 5).

続いて、ストレージシステム１は、上述したように格納したフラグメントデータＤ１〜Ｄ１２の格納位置、つまり、当該フラグメントデータＤ１〜Ｄ１２にて復元されるブロックデータＤの格納位置を表すコンテンツアドレスＣＡを生成して管理する。具体的には、格納したブロックデータＤの内容に基づいて算出したハッシュ値Ｈの一部（ショートハッシュ）（例えば、ハッシュ値Ｈの先頭８Ｂ（バイト））と、論理格納位置を表す情報と、を組み合わせて、コンテンツアドレスＣＡを生成する。そして、このコンテンツアドレスＣＡを、ストレージシステム１内のファイルシステムに返却する（図５の矢印Ｙ７）。すると、ストレージシステム１は、バックアップ対象データのファイル名などの識別情報と、コンテンツアドレスＣＡとを関連付けてファイルシステムで管理する。 Subsequently, the storage system 1 generates a content address CA indicating the storage position of the fragment data D1 to D12 stored as described above, that is, the storage position of the block data D restored by the fragment data D1 to D12. Manage. Specifically, a part of the hash value H (short hash) calculated based on the contents of the stored block data D (for example, the top 8B (bytes) of the hash value H), information indicating the logical storage position, Are combined to generate a content address CA. Then, this content address CA is returned to the file system in the storage system 1 (arrow Y7 in FIG. 5). Then, the storage system 1 manages identification information such as the file name of the backup target data and the content address CA in association with the file system.

また、ブロックデータＤのコンテンツアドレスＣＡと、当該ブロックデータＤのハッシュ値Ｈと、を関連付けて、各ストレージノード３がＭＦＩファイルにて管理する。このように、上記コンテンツアドレスＣＡは、ファイルを特定する情報やハッシュ値Ｈなどと関連付けられて、アクセラレータノード２やストレージノード３の記憶装置に格納される。 Further, each storage node 3 manages the content address CA of the block data D and the hash value H of the block data D by using the MFI file. As described above, the content address CA is stored in the storage device of the accelerator node 2 or the storage node 3 in association with the information specifying the file, the hash value H, or the like.

さらに、ストレージシステム１は、上述したように格納したファイルを読み出す制御を行う。例えば、ストレージシステム１に対して、特定のファイルを指定して読み出し要求があると、まず、ファイルシステムに基づいて、読み出し要求にかかるファイルに対応するハッシュ値の一部であるショートハッシュと論理位置の情報からなるコンテンツアドレスＣＡを指定する。そして、コンテンツアドレスＣＡがＭＦＩファイルに登録されているか否かを調べる。登録されていなければ、要求されたデータは格納されていないため、エラーを返却する。 Furthermore, the storage system 1 performs control to read out the stored file as described above. For example, when a read request is made by designating a specific file to the storage system 1, first, based on the file system, a short hash and a logical position that are part of a hash value corresponding to the file related to the read request A content address CA consisting of the above information is designated. Then, it is checked whether or not the content address CA is registered in the MFI file. If it is not registered, the requested data is not stored and an error is returned.

一方、読み出し要求にかかるコンテンツアドレスＣＡが登録されている場合には、上記コンテンツアドレスＣＡにて指定される格納位置を特定し、この特定された格納位置に格納されている各フラグメントデータを、読み出し要求されたデータとして読み出す。このとき、各フラグメントが格納されているデータ格納ファイルと、当該データ格納ファイルのうち１つのフラグメントデータの格納位置が分かれば、同一の格納位置から他のフラグメントデータの格納位置を特定することができる。 On the other hand, when the content address CA related to the read request is registered, the storage location specified by the content address CA is specified, and each fragment data stored in the specified storage location is read. Read as requested data. At this time, if the data storage file in which each fragment is stored and the storage position of one fragment data in the data storage file are known, the storage position of other fragment data can be specified from the same storage position. .

そして、読み出し要求に応じて読み出した各フラグメントデータからブロックデータＤを復元する。さらに、復元したブロックデータＤを複数連結し、ファイルＡなどの一群のデータに復元して返却する。 Then, the block data D is restored from each fragment data read in response to the read request. Further, a plurality of restored block data D are concatenated, restored to a group of data such as file A, and returned.

以上のようにして、本実施形態におけるストレージシステム１はデータの重複を排除しているが、さらにデータ格納効率を高めるための構成となっている。かかる構成について詳述する。 As described above, the storage system 1 in this embodiment eliminates duplication of data, but has a configuration for further improving data storage efficiency. This configuration will be described in detail.

まず、データ分離／結合装置１０が備えるデータ分離部１２は、図６に示すように、所定のファイルを構成する分離前データ３０を、ファイルの実データ部分であり、当該ファイルが生成された時間や更新回数などによっても値が変化しない「ユーザデータ」群からなるユーザデータ部分３１と、タイムスタンプや通番などのように時間や更新回数などの違いによって値が変化しファイル自体の管理情報を含む「管理データ」群からなる管理データ部分３２と、に分離する。つまり、ユーザデータ部分３１は、他のユーザデータと同一内容となる可能性があり、重複排除効果が期待できるデータである。一方、管理データ部分３２は、他の管理データと同一内容となる可能性が低く、重複排除効果が期待しづらいデータである。なお、ユーザデータ部分３１と管理データ部分３２との分離は、予めデータ分離部１２に各データ部分を区別するための基準情報が設定されており、かかる基準情報に基づいて行われる。 First, as shown in FIG. 6, the data separation unit 12 included in the data separation / combination apparatus 10 sets the pre-separation data 30 constituting a predetermined file as the actual data portion of the file, and the time when the file is generated. And the user data portion 31 consisting of “user data” group whose value does not change depending on the number of updates and the number of updates, etc., and includes the management information of the file itself, the value changes depending on the time and the number of updates, such as time stamp and serial number The management data portion 32 is divided into “management data” groups. That is, the user data portion 31 may be the same content as other user data, and can be expected to have a deduplication effect. On the other hand, the management data portion 32 is data that is unlikely to have the same content as other management data and is difficult to expect the deduplication effect. The separation of the user data portion 31 and the management data portion 32 is performed based on the reference information, in which reference information for distinguishing each data portion is set in the data separation unit 12 in advance.

そして、データ分離部１２は、書き込み対象となるデータを、必ずしも「ユーザデータ部分」と「管理データ部分」とに分離することに限定されない。予め設定された基準に従って分類される１種類あるいは複数種類の他のデータ部分に分離してもよい。 The data separation unit 12 is not necessarily limited to separating the data to be written into the “user data portion” and the “management data portion”. You may isolate | separate into the other data part of 1 type or several types classified according to the preset reference | standard.

また、データ分離部１２は、図６に示すように、分離結果に基づいて、各ユーザデータや各管理データの分離前のデータに対する格納位置をそれぞれ特定する情報を含むインデックスデータ群からなるインデックスデータ部分３３を生成する。例えば、上記インデックデータは、図７に示すように、ファイル種別、分離前ファイルにおけるファイル内オフセット、分離後の各ファイルにおけるファイル内オフセット、分離後のデータ長など、種々の管理情報を含んでいる。具体的に、上記「ファイル種別」は、対応するユーザデータあるいは管理データが格納されているファイル種別やファイル自体を特定するファイル情報である。また、「分離前ファイルにおけるファイル内オフセット」は、分離されたユーザデータあるいは管理データの分離される前のファイル内における位置情報である。また、「分離後の各ファイルにおけるファイル内オフセット」は、分離されたユーザデータあるいは管理データが分離後に格納されたユーザデータファイルあるいは管理データファイル内における位置情報である。そして、「分離後のデータ長」は、分離されたユーザデータあるいは管理データ自体のデータ長である。但し、インデックスデータに記憶される情報は、上述した情報に限定されない。 Further, as shown in FIG. 6, the data separation unit 12 is an index data composed of an index data group including information for specifying the storage positions of each user data and each management data before the separation based on the separation result. Part 33 is generated. For example, as shown in FIG. 7, the index data includes various management information such as file type, file offset in the file before separation, file offset in each file after separation, and data length after separation. . Specifically, the “file type” is file information for specifying the file type or the file itself in which the corresponding user data or management data is stored. The “intra-file offset in the pre-separation file” is position information in the file before the separation of the separated user data or management data. The “file offset in each file after separation” is position information in the user data file or management data file in which the separated user data or management data is stored after separation. The “separated data length” is the data length of the separated user data or management data itself. However, the information stored in the index data is not limited to the information described above.

そして、分離データ書き込み部１１（データ書き込み部）は、上記データ分離部１２にて分離されたデータを記憶装置２５に格納するが、このとき、図６に示すように分離されたユーザデータ部分３１と管理データ部分３２と生成されたインデックスデータ部分３３とを、同一のファイル２１に格納する。具体的に、分離データ書き込み部１１は、図３のファイル２１に示すように、ユーザデータ部分３１のみをユーザデータ２２としてファイル２１に格納し、このユーザデータ２２に関連付けられたメタデータである拡張属性や代替ストリーム２３に、管理データ部分３２とインデックスデータ部分３３とを格納する。例えば、ファイル２１は、拡張属性及び代替データストリームといったメタデータを含んでおり、拡張属性に管理データ部分３２を格納し、代替データストリームにインデックスデータ部分３３を格納している。但し、管理データ部分やインデックスデータ部分は、ファイル２１に関連付けられたメタデータであれば、いかなるデータに格納されていてもよい。 The separated data writing unit 11 (data writing unit) stores the data separated by the data separating unit 12 in the storage device 25. At this time, the separated user data portion 31 as shown in FIG. The management data portion 32 and the generated index data portion 33 are stored in the same file 21. Specifically, as shown in the file 21 of FIG. 3, the separated data writing unit 11 stores only the user data portion 31 in the file 21 as the user data 22 and is an extension that is metadata associated with the user data 22. The management data portion 32 and the index data portion 33 are stored in the attribute or alternative stream 23. For example, the file 21 includes metadata such as an extended attribute and an alternative data stream, stores a management data portion 32 in the extended attribute, and stores an index data portion 33 in the alternative data stream. However, the management data part and the index data part may be stored in any data as long as it is metadata associated with the file 21.

ここで、上述したファイルの「拡張属性」は、ファイル２１の属性データを拡張したデータであり、検索キーと任意の内容のデータの組み合わせを要素とする検索可能なリスト構造のデータである。このリスト構造は、ファイル２１のオフセットをキーに、データを取り出すユーザデータ２２と類似しており、当該ユーザデータ２２と同様の扱いができる。 Here, the “extended attribute” of the above-described file is data obtained by extending the attribute data of the file 21, and is searchable list-structured data having a combination of a search key and data of any content as an element. This list structure is similar to the user data 22 from which data is extracted using the offset of the file 21 as a key, and can be handled in the same manner as the user data 22.

また、上述した「代替データストリーム」は、ファイル２１上、ユーザデータ２２とは区分けして管理されたデータであり、代替データストリームごとに、データにアクセスするための識別子が割り当てられる。なお、１個のファイル２１に、複数の代替データストリームを持たせることも可能である。ファイル２１に代替データストリームの識別子を与え、ファイルを参照・更新すると、ユーザデータ２２の代わりに、識別子に対応した代替データストリームを参照・更新できる。 The “alternative data stream” described above is data managed separately from the user data 22 on the file 21, and an identifier for accessing the data is assigned to each alternative data stream. One file 21 can have a plurality of alternative data streams. When the identifier of the alternative data stream is given to the file 21 and the file is referred / updated, the alternative data stream corresponding to the identifier can be referred / updated instead of the user data 22.

ここで、ファイル２１と、ファイル２１の拡張属性、代替データストリームの存在の整合性は、ストレージシステムが保障することとなる。 Here, the storage system guarantees the consistency between the file 21, the extended attribute of the file 21, and the existence of the alternative data stream.

以上のように、ファイル２１の拡張属性と代替データストリームは、１個のファイル２１に複数のサブデータを持たせる効果がある。ファイル２１の拡張属性と代替データストリームを、ユーザデータ２２に対するサブデータストリームとみなすことで、ユーザデータ、管理データ、インデックスデータを、１個のファイルに書き込むことを実現している。 As described above, the extended attribute of the file 21 and the alternative data stream have an effect of having a plurality of sub data in one file 21. By considering the extended attribute and alternative data stream of the file 21 as a sub data stream for the user data 22, the user data, management data, and index data can be written to one file.

そして、上述したように一旦分離された後に１つのファイル２１に含められた書き込み対象となるデータは、当該ファイルが書き込み対象となるデータとして、分離データ書き込み部１１にてＣＡＳ装置２０に書き込まれる。以降のファイル２１の書き込み動作は、通常のコンテンツアドレスストレージシステムにおける書き込み動作と同様であり、上記図４，５を参照して説明したように、上述したファイル２１をさらに分割して、冗長化を行った上で分散して複数の記憶装置に記憶される。 Then, as described above, the data to be written included in one file 21 after being once separated is written to the CAS device 20 by the separated data writing unit 11 as data to be written to the file. The subsequent writing operation of the file 21 is the same as the writing operation in the normal content address storage system. As described with reference to FIGS. 4 and 5, the file 21 is further divided to make redundancy. And then distributed and stored in a plurality of storage devices.

このとき、分離データ書き込み部１１は、ファイル２１中の拡張属性及び／あるいは代替データストリームに書き込まれた管理データとインデックスデータを、可変長のブロックに区切り、記憶装置２５内にブロック集合２６、ブロック集合２７として格納する。例えば、管理データをブロック集合２６に格納し、インデックスデータをブロック集合２７に格納する。これにより、ブロック集合２６とブロック集合２７とは、重複排除が望めない管理データとインデックスデータのみを含むこととなる。また、分離データ書き込み部１１は、ファイル２１に書き込まれたユーザデータ２２を、可変長のブロックに区切り、記憶装置２５内にブロック集合２８として格納する。このブロック集合２８は、重複排除が望めるユーザデータのみを含むこととなる。なお、各ブロック集合２６，２７，２８は、上述したように予め特定のデータが格納される領域として記憶装置２５内に設定されている。 At this time, the separated data writing unit 11 divides the management data and index data written in the extended attribute and / or alternative data stream in the file 21 into variable-length blocks, and stores the block set 26 and block in the storage device 25. Store as set 27. For example, management data is stored in the block set 26 and index data is stored in the block set 27. As a result, the block set 26 and the block set 27 include only management data and index data for which deduplication cannot be expected. Further, the separated data writing unit 11 divides the user data 22 written in the file 21 into variable-length blocks and stores them as a block set 28 in the storage device 25. This block set 28 includes only user data that can be deduplicated. Each of the block sets 26, 27, and 28 is set in the storage device 25 as an area for storing specific data in advance as described above.

また、データ分離／結合装置１０が有する分離データ読み取り部１４（データ読み取り部）は、データ書き込み／読み取りソフトウェア３０から読み取りの要求があったファイル名をもとに、ファイル２１を記憶装置２５内から取得する。そして、分離データ関連付け部１３が、取得したファイル２１に関連付けられている拡張属性及び／又は代替データストリーム２３内のインデックスデータに記憶されている情報に基づいて、ユーザデータ及び管理データを読み取る。そして、これらをインデックスデータに記憶されている情報に従って、分離前のデータにおけるユーザデータ及び管理データの位置を特定し、当該ユーザデータ及び管理データを結合して、分離前のデータを生成する。 Further, the separated data reading unit 14 (data reading unit) included in the data separation / combination device 10 retrieves the file 21 from the storage device 25 based on the file name requested to be read from the data writing / reading software 30. get. Then, the separated data association unit 13 reads user data and management data based on the extended attribute associated with the acquired file 21 and / or information stored in the index data in the alternative data stream 23. Then, according to the information stored in the index data, the positions of the user data and management data in the data before separation are specified, and the user data and management data are combined to generate data before separation.

その後、分離データ読み取り部１４は、生成された分離前のデータを、データ書き込み／読み取りソフトウェア３０に返却する。これらにより、データ書き込み／読み取りソフトウェア３０は、データが分離され格納されていることを意識せずにファイルを参照することができる。 Thereafter, the separated data reading unit 14 returns the generated data before separation to the data writing / reading software 30. Thus, the data writing / reading software 30 can refer to the file without being aware that the data is separated and stored.

ここで、ストレージシステム１は、上述したように、記憶装置２５内で別々のブロック集合２６，２７，２８として格納されている拡張属性、代替データストリーム、ユーザデータを、１個のユーザデータファイル２１に関連付けて管理していることから、以下のような処理も可能である。例えば、拡張属性更新要求を受け取ると、ファイル２１の拡張属性部分を更新し、拡張属性部分だけを可変長のブロックに区切り、記憶装置２５にブロック集合２６として格納する。また、拡張属性参照要求を受けると、記憶装置２５に格納済みのブロックの集合２６から、拡張属性を再生して返却する。また、代替データストリーム更新要求を受け取ると、ファイル２１の代替データストリーム部分を更新し、代替データストリームだけを可変長のブロックに区切り、記憶装置２５にブロック集合２７として格納する。代替データストリーム参照要求を受けると、記憶装置２５に格納済みのブロック集合２７から、代替データストリームを再生して返却する。さらに、ユーザデータ更新要求を受け取ると、ファイル２１のユーザデータ部分を更新し、ユーザデータだけを可変長のブロックに区切り、記憶装置２５にブロックの集合２８として格納する。ユーザデータ参照要求を受けると、記憶装置２５に格納済みのブロックの集合２８からユーザデータを再生して返却する。 Here, as described above, the storage system 1 stores the extended attribute, alternative data stream, and user data stored as separate block sets 26, 27, and 28 in the storage device 25 into one user data file 21. Since the management is performed in association with the above, the following processing is also possible. For example, when an extended attribute update request is received, the extended attribute portion of the file 21 is updated, and only the extended attribute portion is divided into variable-length blocks and stored in the storage device 25 as a block set 26. When the extended attribute reference request is received, the extended attribute is reproduced from the block set 26 stored in the storage device 25 and returned. Also, upon receiving an alternative data stream update request, the alternative data stream portion of the file 21 is updated, and only the alternative data stream is divided into variable-length blocks and stored in the storage device 25 as a block set 27. When the alternative data stream reference request is received, the alternative data stream is reproduced and returned from the block set 27 stored in the storage device 25. When a user data update request is received, the user data portion of the file 21 is updated, and only the user data is divided into variable-length blocks and stored in the storage device 25 as a set 28 of blocks. When the user data reference request is received, the user data is reproduced and returned from the block set 28 stored in the storage device 25.

なお、書き込み対象となるデータの分離後のデータ（ユーザデータ、管理データ、インデックスデータ）と、データ書き込み先（ファイル２１のユーザデータ２２、拡張属性２３、代替データストリーム２３）の組み合わせの自由度は、データ格納先となるＣＡＳ装置２０の仕様に依存する。拡張属性および代替データストリームにおいても、ユーザデータを重複排除するＣＡＳ装置２０に格納することで、重複排除性能を確保できる。但し、拡張属性や代替データストリームを重複排除しないストレージ装置にデータを格納してもよい。かかる場合であっても、分離後のデータのうち、重複排除が望めない管理データとインデックスデータを、拡張属性や代替データストリームに書き込むが、ユーザデータ部分をユーザデータ２２に書き込んで重複排除機能を有するＣＡＳ装置２０に書き込むことで、重複排除性能を実現できる。 It should be noted that the degree of freedom of the combination of data after separation of the data to be written (user data, management data, index data) and the data write destination (user data 22 of file 21, extended attribute 23, alternative data stream 23) is Depends on the specifications of the CAS device 20 as the data storage destination. Even in the extended attribute and alternative data stream, deduplication performance can be ensured by storing user data in the CAS device 20 that deduplicates. However, the data may be stored in a storage device that does not deduplicate extended attributes or alternative data streams. Even in such a case, management data and index data that cannot be deduplicated among the separated data are written to the extended attribute or the alternative data stream, but the user data portion is written to the user data 22 to provide the deduplication function. Deduplication performance can be realized by writing to the CAS device 20.

［動作］
次に、上述したストレージシステム１つまりデータ分離／結合装置１０の動作を、図３と、図８及び図９のフローチャートを参照して説明する。 [Operation]
Next, the operation of the above-described storage system 1, that is, the data separation / combination apparatus 10 will be described with reference to FIG. 3 and the flowcharts of FIGS.

まず、ファイルへのデータ書き込み時の動作について説明する。データ書き込み／読み取りソフトウェア３０は、データ分離／結合装置１０へデータ書き込み要求を発行する（ステップＳ１）。分離データ書き込み部１１は、受け取ったデータをデータ分離部１２に渡す。データ分離部１２は、受け取ったデータをユーザデータと管理データに分離し（ステップＳ２）、インデックスデータを生成して（ステップＳ３）、これらを分離データ書き込み部１１に返却する。 First, the operation at the time of writing data to a file will be described. The data write / read software 30 issues a data write request to the data separator / combiner 10 (step S1). The separated data writing unit 11 passes the received data to the data separating unit 12. The data separation unit 12 separates the received data into user data and management data (step S2), generates index data (step S3), and returns these to the separated data writing unit 11.

分離データ書き込み部１１は、ユーザデータを、ＣＡＳ装置２０へ、ファイル２１のユーザデータ２２として書き込む。管理データとインデックスデータについては、ファイル２１の拡張属性及び／又は代替データストリームとして書き込む（ステップＳ４）。拡張属性や代替データストリームに書き込まれた管理データとインデックスデータは、可変長のブロックに区切られ、記憶装置２５内に、ブロック集合２６、ブロック集合２７として格納される。これにより、ブロック集合２６とブロック集合２７は、重複排除が望めない管理データとインデックスデータのみを含むこととなる。また、ファイル２１に書き込まれたユーザデータ２２は、可変長のブロックに区切られ、データ重複排除機構を備えた記憶装置２５内に、ブロック集合２８として格納される。これにより、ブロック集合２８は、重複排除が望めるユーザデータのみを含むこととなる。 The separated data writing unit 11 writes user data to the CAS device 20 as user data 22 of the file 21. The management data and index data are written as an extended attribute and / or alternative data stream of the file 21 (step S4). The management data and index data written in the extended attribute and alternative data stream are divided into variable-length blocks and stored in the storage device 25 as a block set 26 and a block set 27. As a result, the block set 26 and the block set 27 include only management data and index data for which deduplication cannot be expected. The user data 22 written in the file 21 is divided into variable-length blocks and stored as a block set 28 in a storage device 25 having a data deduplication mechanism. As a result, the block set 28 includes only user data for which deduplication is desired.

次に、読み取り時の動作について説明する。データ書き込み／読み取りソフトウェア３０は、データ分離／結合装置１０へデータ読み取り要求を発行する（ステップＳ１１）。分離データ読み込み部１４は、ファイル２１の拡張属性及び／又は代替データストリーム２３、ユーザデータ２２の形で格納されている管理データ、インデックスデータ、ユーザデータを、各ブロック集合２６，２７，２８内から取得し（ステップＳ１２）、分離データ関連付け部１３を用いてインデックスデータをもとに、順序良くブロックを結合して、元のデータを再生する（ステップＳ１３）。その後、再生したデータを、データ書き込み／読み取りソフトウェア３０へ返却する。 Next, the operation at the time of reading will be described. The data write / read software 30 issues a data read request to the data separator / combiner 10 (step S11). The separated data reading unit 14 receives management data, index data, and user data stored in the form of extended attributes of the file 21 and / or alternative data stream 23 and user data 22 from the block sets 26, 27, and 28. Acquired (step S12), the separated data association unit 13 is used to combine the blocks in order based on the index data and reproduce the original data (step S13). Thereafter, the reproduced data is returned to the data writing / reading software 30.

以上のように、本発明では、データの分離後に生成される図６に示すようなユーザデータ部分３１、管理データ部分３２、インデックスデータ部分３３を、それぞれ個々のファイルとして記憶装置に書き込むことを行っていない。このため、ファイル再生時であっても、３個のファイルの存在を確認する処理が不要となり、また、いずれかのファイルが消失することによってデータにアクセス不可能となることを防止することができる。つまり、本発明では、重複排除の効率を高めるべく、データを分離するものの、分離後のデータを１つのファイルとして扱うため、これら分離後のデータが常に存在することが保証される。そして、分離後の３つのデータのフラッシュの順序をアトミックに制御するとこで、分離後のデータの矛盾を抑制できる。その結果、高効率な重複排除を実現しつつ、障害からの復旧処理を簡素化でき、ストレージシステムの耐障害性の向上を図ることができる。 As described above, in the present invention, the user data part 31, the management data part 32, and the index data part 33 as shown in FIG. 6 generated after the data separation are written in the storage device as individual files. Not. For this reason, even during file playback, it is not necessary to confirm the existence of three files, and it is possible to prevent data from being inaccessible due to the loss of any one of the files. . In other words, in the present invention, data is separated in order to increase the efficiency of deduplication, but since the separated data is handled as one file, it is guaranteed that these separated data always exist. Then, by controlling the order of flushing the three pieces of separated data atomically, it is possible to suppress inconsistencies in the separated data. As a result, recovery processing from a failure can be simplified while realizing high-efficiency deduplication, and the fault tolerance of the storage system can be improved.

また、一般的に、ストレージ装置には格納可能なファイル数に上限があるが、上述したように、分離後に３つのデータが存在するものの、これらを１つのファイルとして管理するため、格納できるバックアップファイル数の増加を図ることができる。仮に分離後の３つのデータを３つのファイルとして扱う場合と比較すると、その場合よりも３倍のファイル数を扱うことができる。 In general, the storage device has an upper limit on the number of files that can be stored. As described above, although there are three pieces of data after separation, these files are managed as one file, so they can be stored as backup files. The number can be increased. As compared with the case where the three separated data are handled as three files, the number of files can be handled three times as much as that case.

なお、上記では、データ分離／結合装置１０を、データ重複排除機構を搭載したＣＡＳ装置２０とは別の装置とした構成を説明したが、データ分離／結合装置１０を、データ重複排除機構を搭載したＣＡＳ装置２０内に実装した構成としてもよい。 In the above description, the data separation / combination device 10 is described as a device different from the CAS device 20 equipped with the data deduplication mechanism. However, the data separation / combination device 10 is equipped with the data deduplication mechanism. It is good also as a structure mounted in the CAS apparatus 20 which was made.

また、上記では、データ分離／結合装置１０は、バックアップファイルのデータを３つのデータに分離していたが、重複が望めるユーザデータと、重複が望めないデータ（管理情報データ、インデックスデータ）の２つに分離してもよい。この場合、単一の拡張属性、または単一の代替データストリームしか備えていないデータ重複排除機構を搭載したストレージと組み合わせた構成でも実現できる。 In the above description, the data separation / combination apparatus 10 separates the backup file data into three pieces of data. However, user data that can be duplicated and data that cannot be duplicated (management information data and index data) are divided into two. It may be separated into two. In this case, the configuration can be realized in combination with a storage equipped with a single extended attribute or a data deduplication mechanism having only a single alternative data stream.

さらに、上記では、ファイルの拡張属性および代替データストリームが、ユーザデータと同様に重複排除されるストレージ装置と組み合わせた構成であったが、拡張属性や代替データストリームを重複排除しないストレージ装置と組み合わせた構成でも、上述と同等な効果を得ることができる。 Furthermore, in the above description, the extended attribute of the file and the alternative data stream are combined with a storage device that is deduplicated in the same manner as the user data. However, the extended attribute and the alternative data stream are combined with a storage device that does not deduplicate. Even in the configuration, the same effect as described above can be obtained.

＜付記＞
上記実施形態の一部又は全部は、以下の付記のようにも記載されうる。以下、本発明におけるストレージシステム１１０の構成の概略について図１０を参照して説明する。また、本発明における、プログラム、データ格納方法の構成について説明する。但し、本発明は、以下の構成に限定されない。 <Appendix>
Part or all of the above-described embodiment can be described as in the following supplementary notes. Hereinafter, an outline of the configuration of the storage system 110 in the present invention will be described with reference to FIG. The configuration of the program and data storage method in the present invention will be described. However, the present invention is not limited to the following configuration.

（付記１）
書き込み対象となるデータを所定容量に分割して記憶装置１２０に格納すると共に、当該記憶装置に既に記憶されているデータと同一内容の他のデータを前記記憶装置に格納する場合に当該記憶装置に既に記憶されているデータを他のデータとして参照させるデータ書き込み部１１１と、
書き込み対象となるデータを、予め設定された基準に従って分類されるユーザデータ部分と管理データ部分とに分離するデータ分離部１１２と、を備え、
前記データ書き込み部１１１は、前記ユーザデータ部分を格納したファイルを生成すると共に、当該ファイルに格納された前記ユーザデータ部分に関連付けられたメタデータとして前記管理データ部分を前記ファイルに含めて格納し、前記ファイルを書き込み対象となるデータとして前記記憶装置１２０に記憶する、
ストレージシステム１１０。 (Appendix 1)
Data to be written is divided into a predetermined capacity and stored in the storage device 120, and other data having the same content as the data already stored in the storage device is stored in the storage device. A data writing unit 111 for referencing already stored data as other data;
A data separation unit 112 that separates data to be written into a user data part and a management data part that are classified according to a preset criterion;
The data writing unit 111 generates a file storing the user data part, and stores the management data part in the file as metadata associated with the user data part stored in the file, Storing the file in the storage device 120 as data to be written;
Storage system 110.

（付記２）
付記１に記載のストレージシステムであって、
前記データ分離部は、分離された前記ユーザデータ部分と前記管理データ部分との分離前におけるデータ中の位置情報を含むインデックスデータを生成し、
前記データ書き込み部は、前記ファイルに格納された前記ユーザデータ部分に関連付けられた前記メタデータとして前記インデックスデータを前記ファイルに格納し、前記ファイルを書き込み対象となるデータとして記憶装置に記憶する、
ストレージシステム。 (Appendix 2)
The storage system according to attachment 1, wherein
The data separation unit generates index data including position information in the data before separation of the separated user data part and the management data part;
The data writing unit stores the index data in the file as the metadata associated with the user data portion stored in the file, and stores the file in a storage device as data to be written.
Storage system.

（付記３）
付記１又は２に記載のストレージシステムであって、
前記データ書き込み部は、前記ファイルに格納された前記ユーザデータ部分の拡張属性及び／又は代替データストリームを前記メタデータとして用いる、
ストレージシステム。 (Appendix 3)
The storage system according to appendix 1 or 2,
The data writing unit uses an extended attribute and / or alternative data stream of the user data part stored in the file as the metadata.
Storage system.

（付記４）
付記１乃至３のいずれかに記載のストレージシステムであって、
前記データ書き込み部は、前記ファイルを分割することによって形成された前記ユーザデータ部分の分割データを前記記憶装置内の所定領域に格納する、
ストレージシステム。 (Appendix 4)
The storage system according to any one of appendices 1 to 3,
The data writing unit stores the divided data of the user data part formed by dividing the file in a predetermined area in the storage device.
Storage system.

（付記５）
付記２に記載のストレージシステムであって、
前記データ書き込み部は、前記ファイルを分割することによって形成された前記ユーザデータ部分と前記拡張属性部分と前記代替データストリーム部分との分割データを、当該各部分に応じた前記記憶装置内の各所定領域にそれぞれ格納する、
ストレージシステム。 (Appendix 5)
The storage system according to appendix 2,
The data writing unit is configured to divide divided data of the user data part, the extended attribute part, and the alternative data stream part formed by dividing the file into predetermined data in the storage device corresponding to the parts. Store each in the area,
Storage system.

（付記６）
書き込み対象となるデータを所定容量に分割して記憶装置に格納すると共に、当該記憶装置に既に記憶されているデータと同一内容の他のデータを前記記憶装置に格納する場合に当該記憶装置に既に記憶されているデータを他のデータとして参照させるデータ書き込み部を備えた情報処理装置に、
書き込み対象となるデータを、予め設定された基準に従って分類されるユーザデータ部分と管理データ部分とに分離するデータ分離部を実現させると共に、
前記データ書き込み部は、前記ユーザデータ部分を格納したファイルを生成すると共に、当該ファイルに格納された前記ユーザデータ部分に関連付けられたメタデータとして前記管理データ部分を前記ファイルに含めて格納し、前記ファイルを書き込み対象となるデータとして前記記憶装置に記憶する、ことを実現させるためのプログラム。 (Appendix 6)
The data to be written is divided into a predetermined capacity and stored in the storage device, and other data having the same content as the data already stored in the storage device is already stored in the storage device. In an information processing apparatus provided with a data writing unit for referring to stored data as other data,
In addition to realizing a data separation unit that separates data to be written into a user data part and a management data part that are classified according to preset criteria,
The data writing unit generates a file storing the user data part, stores the management data part as metadata associated with the user data part stored in the file, and stores the management data part in the file, A program for realizing that a file is stored in the storage device as data to be written.

（付記７）
付記６に記載のプログラムであって、
前記データ分離部は、分離された前記ユーザデータ部分と前記管理データ部分との分離前におけるデータ中の位置情報を含むインデックスデータを生成し、
前記データ書き込み部は、前記ファイルに格納された前記ユーザデータ部分に関連付けられた前記メタデータとして前記インデックスデータを前記ファイルに格納し、前記ファイルを書き込み対象となるデータとして記憶装置に記憶する、
プログラム。 (Appendix 7)
The program according to appendix 6,
The data separation unit generates index data including position information in the data before separation of the separated user data part and the management data part;
The data writing unit stores the index data in the file as the metadata associated with the user data portion stored in the file, and stores the file in a storage device as data to be written.
program.

（付記８）
書き込み対象となるデータを所定容量に分割して記憶装置に格納すると共に、当該記憶装置に既に記憶されているデータと同一内容の他のデータを前記記憶装置に格納する場合に当該記憶装置に既に記憶されているデータを他のデータとして参照させるデータ書き込み工程を有するデータ格納方法であって、
前記データ書き込み工程の前に、書き込み対象となるデータを、予め設定された基準に従って分類されるユーザデータ部分と管理データ部分とに分離するデータ分離工程を有し、
前記データ書き込み工程は、前記ユーザデータ部分を格納したファイルを生成すると共に、当該ファイルに格納された前記ユーザデータ部分に関連付けられたメタデータとして前記管理データ部分を前記ファイルに含めて格納し、前記ファイルを書き込み対象となるデータとして前記記憶装置に記憶する、
データ格納方法。 (Appendix 8)
The data to be written is divided into a predetermined capacity and stored in the storage device, and other data having the same content as the data already stored in the storage device is already stored in the storage device. A data storage method including a data writing step for referring to stored data as other data,
Before the data writing step, it has a data separation step of separating the data to be written into a user data portion and a management data portion that are classified according to a preset standard,
The data writing step generates a file storing the user data portion, stores the management data portion in the file as metadata associated with the user data portion stored in the file, and stores the file. Storing the file in the storage device as data to be written;
Data storage method.

（付記９）
付記８に記載のデータ格納方法であって、
前記データ分離工程は、分離された前記ユーザデータ部分と前記管理データ部分との分離前におけるデータ中の位置情報を含むインデックスデータを生成し、
前記データ書き込み工程は、前記ファイルに格納された前記ユーザデータ部分に関連付けられた前記メタデータとして前記インデックスデータを前記ファイルに格納し、前記ファイルを書き込み対象となるデータとして記憶装置に記憶する、
データ格納方法。 (Appendix 9)
The data storage method according to appendix 8, wherein
The data separation step generates index data including position information in the data before separation of the separated user data part and the management data part,
The data writing step stores the index data in the file as the metadata associated with the user data portion stored in the file, and stores the file in a storage device as data to be written.
Data storage method.

１ストレージシステム
２アクセラレータノード
３ストレージノード
４バックアップシステム
５バックアップ対象装置
１０データ分離／結合装置
１１分離データ書き込み部
１２データ分離部
１３分離データ関連付け部
１４分離データ読み取り部
２０ＣＡＳ装置
２１ファイル
２２ユーザデータ
２３拡張属性、代替データストリーム
２５記憶装置
３０分離前データ
３１ユーザデータ部分
３２管理データ部分
３３インデックスデータ部分
１１０ストレージシステム
１１１データ書き込み部
１１２データ分離部
１２０記憶装置
DESCRIPTION OF SYMBOLS 1 Storage system 2 Accelerator node 3 Storage node 4 Backup system 5 Backup object apparatus 10 Data separation / combination apparatus 11 Separated data writing part 12 Data separation part 13 Separated data correlation part 14 Separated data reading part 20 CAS apparatus 21 File 22 User data 23 Extended attribute, alternative data stream 25 Storage device 30 Pre-separation data 31 User data portion 32 Management data portion 33 Index data portion 110 Storage system 111 Data writing unit 112 Data separation unit 120 Storage device

Claims

The data to be written is divided into a predetermined capacity and stored in the storage device, and other data having the same content as the data already stored in the storage device is already stored in the storage device. A data writing section for referring to stored data as other data;
The data to be written is separated into a user data part and a management data part that are classified according to a preset standard, and the data in the data before separation of the separated user data part and the management data part A data separator that generates index data including position information ,
Wherein the data writing unit is configured to generate a file containing the user data portion, and the metadata associated with the user data portion stored in the file, the management data portion and the index data in the file Including the user data portion, the management data portion, and the index data in the same file, and storing the file in the storage device as data to be written.
Storage system.

The storage system according to claim 1 ,
The data writing unit uses an extended attribute and / or alternative data stream of the user data part stored in the file as the metadata.
Storage system.

The storage system according to claim 1 or 2 ,
The data writing unit stores the divided data of the user data part formed by dividing the file in a predetermined area in the storage device.
Storage system.

The storage system according to claim 2,
The data writing unit is configured to divide divided data of the user data part, the extended attribute part, and the alternative data stream part formed by dividing the file into predetermined data in the storage device corresponding to the parts. Store each in the area,
Storage system.

The data to be written is divided into a predetermined capacity and stored in the storage device, and other data having the same content as the data already stored in the storage device is already stored in the storage device. In an information processing apparatus provided with a data writing unit for referring to stored data as other data,
The data to be written is separated into a user data part and a management data part classified according to a preset standard , and the position in the data before separation of the separated user data part and the management data part While realizing a data separation unit that generates index data including information ,
Wherein the data writing unit is configured to generate a file containing the user data portion, and the metadata associated with the user data portion stored in the file, the management data portion and the index data in the file By storing them together, the user data part, the management data part, and the index data are stored in the same file, and the file is stored in the storage device as data to be written. Program for.

The data to be written is divided into a predetermined capacity and stored in the storage device, and other data having the same content as the data already stored in the storage device is already stored in the storage device. A data storage method including a data writing step for referring to stored data as other data,
Before the data writing step, the data to be written is separated into a user data part and a management data part that are classified according to a preset standard, and the separated user data part and the management data part A data separation step of generating index data including position information in the data before separation with
Wherein the data writing process, and generates a file containing the user data portion, and the metadata associated with the user data portion stored in the file, the management data portion and the index data in the file Including the user data portion, the management data portion, and the index data in the same file, and storing the file in the storage device as data to be written.
Data storage method.