JP2017123070A

JP2017123070A - Control device, storage device, data management program, and data management method

Info

Publication number: JP2017123070A
Application number: JP2016001967A
Authority: JP
Inventors: 年弘小沢; Toshihiro Ozawa
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2016-01-07
Filing date: 2016-01-07
Publication date: 2017-07-13
Anticipated expiration: 2036-01-07
Also published as: JP6572775B2

Abstract

PROBLEM TO BE SOLVED: To efficiently store information on redundancy removal in a memory.SOLUTION: A control device 1a comprises: a first determination part 13 for determining whether or not first data are redundantly written in both first and second virtual volumes; a second determination part 16 for, when it is determined that the first data are not redundantly written by the first determination part 13, determining whether or not the first data are written in the first or second virtual volume by using individual data information indicating the data written in the first or the second virtual volume; and an individual data registration part 17 for, when it is determined that the first data are not written in any of the first and second virtual volumes by the second determination part 16, registering information on the first data in the individual data information.SELECTED DRAWING: Figure 1

Description

本発明は、制御装置、ストレージ装置、データ管理プログラム及びデータ管理方法に関する。 The present invention relates to a control device, a storage device, a data management program, and a data management method.

ストレージ装置の機能として、同一の内容のデータ（「ブロック」と称されてもよい。）が複数含まれている場合には、複数の同一の内容のデータを１つにまとめて記憶する重複除去技術が知られている。 When a plurality of pieces of data having the same contents (may be referred to as “blocks”) are included as a function of the storage device, a plurality of pieces of data having the same contents are collectively stored as one. Technology is known.

重複除去技術においては、例示的に、どのようなブロックが書き込まれたかを記憶するため、ハッシュ値管理テーブルが使用される。ハッシュ値管理テーブルは、書き込まれたブロックの値から算出されるハッシュ値と、書き込まれたブロックを実際に書き込んだ物理的な場所を示すインデックス（「書き込み物理アドレス」と称されてもよい。）とを含んでよい。 In the deduplication technique, for example, a hash value management table is used to store what block has been written. The hash value management table includes a hash value calculated from the value of the written block and an index indicating a physical location where the written block is actually written (may be referred to as a “write physical address”). And may include.

ハッシュ値管理テーブルは、ブロックの書き込み時に毎回参照される。そこで、ハッシュ値管理テーブルに関する情報は、高速に動作可能なメモリに格納されてよい。 The hash value management table is referenced every time a block is written. Therefore, information regarding the hash value management table may be stored in a memory that can operate at high speed.

しかしながら、新たなブロックが書き込まれるとハッシュ値管理テーブルのエントリ数が増加していくため、メモリの容量が不足するおそれがある。 However, when a new block is written, the number of entries in the hash value management table increases, which may cause a memory capacity shortage.

そこで、ハッシュ値管理テーブルに関する情報の大きさをメモリに格納可能な大きさに抑えつつ、重複しているブロックをできるだけ検出することが考えられる。別言すれば、全てのハッシュ値を記憶するのではなく、例えば、重複しないブロックのハッシュ値と物理アドレスとの関係をメモリに格納しないことが考えられる。 Therefore, it is conceivable to detect as many overlapping blocks as possible while limiting the size of the information related to the hash value management table to a size that can be stored in the memory. In other words, it is possible not to store all hash values, but to store, for example, the relationship between hash values and physical addresses of non-overlapping blocks in the memory.

ハッシュ値管理テーブルの大きさを抑える技術のひとつとして、ハッシュ値管理テーブルをＬＲＵ（Least Recently Used）を用いて管理する技術が知られている。ＬＲＵを用いた技術では、ブロックの書き込み時にハッシュ値管理テーブルを参照した際、ヒットしたエントリの優先度を最高に設定してよい。そして、新たなハッシュ値を有するブロックが書き込まれる際には、優先度が最低のエントリの内容を、新たなハッシュ値の内容に書き換えてよい。 As one technique for reducing the size of the hash value management table, a technique for managing the hash value management table using LRU (Least Recently Used) is known. In the technique using LRU, when the hash value management table is referred to when writing a block, the priority of the entry that has been hit may be set to the highest. Then, when a block having a new hash value is written, the content of the entry with the lowest priority may be rewritten with the content of the new hash value.

特開２０１０−７９３９１号公報JP 2010-79391 A 特開２０１０−１８２３０２号公報JP 2010-182302 A

しかしながら、ＬＲＵを用いてハッシュ値管理テーブルを用いて管理する技術においては、一時的に新たなハッシュ値を有するブロックが大量に書き込まれた場合に、長期間に亘って使用されるブロックのエントリが消去されてしまうおそれがある。また、ハッシュ値管理テーブルのエントリが特定のアプリケーションで特異的に使用されるブロックによって占有されるおそれがある。例えば、動画を編集するアプリケーションでは、編集された動画のブロックが大量に書き込まれることによって、ハッシュ値管理テーブルが動画のブロックで占有されるおそれがある。 However, in the technique of managing using the hash value management table using LRU, when a large number of blocks having new hash values are temporarily written, entries of blocks that are used for a long period of time are stored. There is a risk of being erased. Further, there is a possibility that the entry of the hash value management table is occupied by a block that is specifically used by a specific application. For example, in an application for editing a moving image, there is a possibility that the hash value management table is occupied by a block of the moving image because a large number of edited moving image blocks are written.

１つの側面では、本発明は、重複除去に関する情報をメモリに効率的に格納することを目的とする。 In one aspect, the present invention aims to efficiently store information related to deduplication in a memory.

このため、この制御装置は、第１の仮想ボリュームと第２の仮想ボリュームとを展開するストレージ装置に備えられる制御装置であって、第１のデータが前記第１及び第２の仮想ボリュームに亘って重複して書き込まれているかを判定する第１判定部と、前記第１判定部によって前記第１のデータが重複して書き込まれていないと判定された場合に、前記第１又は第２の仮想ボリュームに書き込まれているデータを示す個別データ情報を用いて、前記第１のデータが前記第１又は第２の仮想ボリュームに書き込まれているかを判定する第２判定部と、前記第２判定部によって前記第１のデータが前記第１及び第２の仮想ボリュームのいずれにも書き込まれていないと判定された場合に、前記個別データ情報に前記第１のデータについての情報を登録する個別データ登録部と、を備える。 Therefore, the control device is a control device provided in a storage device that expands the first virtual volume and the second virtual volume, and the first data spans the first and second virtual volumes. A first determination unit that determines whether the first data has been written in duplicate, and the first determination unit when the first determination unit determines that the first data has not been written in duplicate A second determination unit configured to determine whether the first data is written to the first or second virtual volume using individual data information indicating data written to the virtual volume; and the second determination. Information about the first data in the individual data information when it is determined that the first data has not been written in any of the first and second virtual volumes. Comprising the individual data registration unit for registering, the.

開示の制御装置によれば、重複除去に関する情報をメモリに効率的に格納することができる。 According to the disclosed control device, it is possible to efficiently store information related to deduplication in the memory.

実施形態の一例としてのストレージシステムの構成を模式的に示す図である。1 is a diagram schematically illustrating a configuration of a storage system as an example of an embodiment. 重複除去技術を適用する前のオリジナルファイルを模式的に例示する図である。It is a figure which illustrates typically the original file before applying a duplication removal technique. 重複除去技術を適用した後のファイルを模式的に例示する図である。It is a figure which illustrates typically the file after applying a duplication removal technique. ハッシュ値管理テーブルの第１の例を示す図である。It is a figure which shows the 1st example of a hash value management table. 仮想アドレス空間テーブルの第１の例を示す図である。It is a figure which shows the 1st example of a virtual address space table. 管理テーブルの第１の例を示す図である。It is a figure which shows the 1st example of a management table. ハッシュ値管理テーブルの第２の例を示す図である。It is a figure which shows the 2nd example of a hash value management table. 仮想アドレス空間テーブルの第２の例を示す図である。It is a figure which shows the 2nd example of a virtual address space table. 管理テーブルの第２の例を示す図である。It is a figure which shows the 2nd example of a management table. 実施形態の一例としてのストレージシステムの機能を模式的に示す図である。It is a figure which shows typically the function of the storage system as an example of embodiment. 実施形態の一例としての第１段の重複除去部の機能を模式的に示す図である。It is a figure which shows typically the function of the duplication removal part of the 1st step as an example of embodiment. 実施形態の一例としての第２段の重複除去部の機能を模式的に示す図である。It is a figure which shows typically the function of the duplication removal part of the 2nd step as an example of embodiment. 実施形態の一例としての第１段の重複除去動作を説明するフローチャートである。It is a flowchart explaining the duplication removal operation | movement of the 1st step as an example of embodiment. 実施形態の一例としての第２段の重複除去動作を説明するフローチャートである。It is a flowchart explaining the duplication removal operation | movement of the 2nd step as an example of embodiment. 実施形態の一例としての第１ハッシュ値管理テーブルの作成動作の第１の例を説明するフローチャートである。It is a flowchart explaining the 1st example of the production | generation operation | movement of the 1st hash value management table as an example of embodiment. 実施形態の一例としての第１ハッシュ値管理テーブルの作成動作の第２の例を説明するフローチャートである。It is a flowchart explaining the 2nd example of the production | generation operation | movement of the 1st hash value management table as an example of embodiment. 関連技術としてのハッシュ値管理テーブルの使用例を説明する図である。It is a figure explaining the usage example of the hash value management table as a related technique. 実施形態の一例としてのハッシュ値管理テーブルの使用例を説明する図である。It is a figure explaining the usage example of the hash value management table as an example of embodiment.

以下、図面を参照して一実施の形態を説明する。ただし、以下に示す実施形態はあくまでも例示に過ぎず、実施形態で明示しない種々の変形例や技術の適用を排除する意図はない。すなわち、本実施形態を、その趣旨を逸脱しない範囲で種々変形して実施することができる。 Hereinafter, an embodiment will be described with reference to the drawings. However, the embodiment described below is merely an example, and there is no intention to exclude application of various modifications and techniques not explicitly described in the embodiment. That is, the present embodiment can be implemented with various modifications without departing from the spirit of the present embodiment.

また、各図は、図中に示す構成要素のみを備えるという趣旨ではなく、他の機能等を含むことができる。 Each figure is not intended to include only the components shown in the figure, and may include other functions.

以下、図中において、同一の各符号は同様の部分を示しているので、その説明は省略する。 Hereinafter, in the drawings, the same reference numerals indicate the same parts, and the description thereof is omitted.

〔Ａ〕実施形態の一例
〔Ａ−１〕システム構成
図１は、実施形態の一例としてのストレージシステム１００の構成を模式的に示す図である。 [A] Example of Embodiment [A-1] System Configuration FIG. 1 is a diagram schematically illustrating a configuration of a storage system 100 as an example of an embodiment.

ストレージシステム１００は、図１に例示するように、ストレージ装置１及び複数（図示する例では３つ）のサーバ装置４を備える。 As illustrated in FIG. 1, the storage system 100 includes a storage device 1 and a plurality (three in the illustrated example) of server devices 4.

サーバ装置４は、サーバ機能を備えたコンピュータ（「情報処理装置」と称されてもよい。）である。 The server apparatus 4 is a computer having a server function (may be referred to as “information processing apparatus”).

ストレージ装置１は、後述する記憶装置３０を搭載し、サーバ装置４に対して記憶領域を提供する装置であり、例えばＲＡＩＤ（Redundant Arrays of Inexpensive Disks）を用いて記憶装置３０にデータを分散又は冗長化した状態で保存する。 The storage device 1 is a device that includes a storage device 30 to be described later and provides a storage area to the server device 4. For example, data is distributed or redundantly distributed to the storage device 30 using RAID (Redundant Arrays of Inexpensive Disks). Save it in a state.

ストレージ装置１は、重複除去技術を利用して、ファイルに含まれる同じ内容のデータの重複を除去した状態で、ファイルを記憶装置３０に格納してよい。また、ストレージ装置１は、図１０等を用いて後述するように、複数の仮想ボリューム３０ａを展開して、展開した仮想ボリューム３０ａにファイルを格納してよい。 The storage device 1 may store the file in the storage device 30 in a state in which duplication of data having the same content included in the file is removed by using a deduplication technique. Further, as will be described later with reference to FIG. 10 and the like, the storage apparatus 1 may expand a plurality of virtual volumes 30a and store files in the expanded virtual volumes 30a.

本明細書において、「ファイル」とは、複数の「データ」の集合を指す。また、「データ」は、「ブロック」と称されてもよい。 In this specification, “file” refers to a set of a plurality of “data”. “Data” may also be referred to as “block”.

ストレージ装置１は、図１に例示するように、ＣＰＵ（Central Processing Unit）１０，メモリ２０及び記憶装置３０を備える。 As illustrated in FIG. 1, the storage device 1 includes a CPU (Central Processing Unit) 10, a memory 20, and a storage device 30.

ＣＰＵ１０及びメモリ２０は、例示的に、ＣＭ（Controller Module）１ａに備えられる。「ＣＭ」は、「制御装置」と称されてもよい。 For example, the CPU 10 and the memory 20 are provided in a CM (Controller Module) 1a. “CM” may be referred to as a “control device”.

ＣＭ１ａは、種々の制御を行なう制御装置であり、サーバ装置４からのストレージアクセス要求（「アクセス制御信号」や「ホストＩ／Ｏ」と称されてもよい。）に従って、各種制御を行なう。 The CM 1a is a control device that performs various controls, and performs various controls in accordance with storage access requests from the server device 4 (may also be referred to as “access control signals” or “host I / O”).

なお、図１に示す例においストレージ装置１は１つのＣＭ１ａを備えることとしたが、これに限定されるものではない。ストレージ装置１に備えられるＣＭ１ａの数は、種々変更することができる。 In the example shown in FIG. 1, the storage apparatus 1 includes one CM 1a. However, the present invention is not limited to this. The number of CMs 1a provided in the storage device 1 can be variously changed.

記憶装置３０は、例示的に、データを読み書き可能に記憶する装置であり、例えば、ＨＤＤ（Hard Disk Drive）やＳＳＤ（Solid State Drive）が用いられてよい。記憶装置３０には、図１１及び図１２を用いて後述する第１ハッシュ値管理テーブル，第２ハッシュ値管理テーブル，仮想アドレス空間テーブル及び管理テーブルに関する情報が格納されてよい。 The storage device 30 is, for example, a device that stores data in a readable / writable manner, and for example, an HDD (Hard Disk Drive) or an SSD (Solid State Drive) may be used. The storage device 30 may store information related to a first hash value management table, a second hash value management table, a virtual address space table, and a management table, which will be described later with reference to FIGS. 11 and 12.

なお、図１に示す例においてストレージ装置１には便宜上１つの記憶装置３０を示しているが、これに限定されるものではない。ストレージ装置１に備えられる記憶装置３０の数は、種々変更することができる。 In the example shown in FIG. 1, the storage apparatus 1 shows one storage device 30 for convenience, but the present invention is not limited to this. The number of storage devices 30 provided in the storage device 1 can be variously changed.

実施形態の一例としてのストレージ装置１は、例示的に、大容量の記憶装置３０（「ボリューム」と称されてもよい。）を用途毎等に分割して管理して使用する。ボリュームが分割される単位をＬＵ（Logical Unit）という。 The storage device 1 as an example of the embodiment illustratively uses a large-capacity storage device 30 (may be referred to as a “volume”) by dividing it for each use. The unit into which the volume is divided is called LU (Logical Unit).

メモリ２０は、例示的に、ＲＯＭ（Read Only Memory）及びＲＡＭ（Random Access Memory）を含む記憶装置である。メモリ２０のＲＯＭには、ＢＩＯＳ（Basic Input/Output System）等のプログラムが書き込まれてよい。メモリ２０のソフトウェアプログラムは、ＣＰＵ１０に適宜に読み込まれて実行されてよい。また、メモリ２０のＲＡＭは、一次記録メモリあるいはワーキングメモリとして利用されてよい。 The memory 20 is, for example, a storage device including a ROM (Read Only Memory) and a RAM (Random Access Memory). A program such as BIOS (Basic Input / Output System) may be written in the ROM of the memory 20. The software program in the memory 20 may be appropriately read and executed by the CPU 10. The RAM of the memory 20 may be used as a primary recording memory or a working memory.

ＣＰＵ１０は、例示的に、種々の制御や演算を行なう処理装置であり、メモリ２０のそれぞれに格納されたＯＳ（Operating System）やプログラムを実行することにより、種々の機能を実現する。すなわち、ＣＰＵ１０は、共通データ登録部１１，第１データ入力部１２，第１判定部１３，通信部１４，第２データ入力部１５，第２判定部１６，個別データ登録部１７，書き込み部１８として機能してよい。 The CPU 10 is, for example, a processing device that performs various controls and operations, and implements various functions by executing an OS (Operating System) and a program stored in each of the memories 20. That is, the CPU 10 includes the common data registration unit 11, the first data input unit 12, the first determination unit 13, the communication unit 14, the second data input unit 15, the second determination unit 16, the individual data registration unit 17, and the writing unit 18. May function as.

なお、これらの共通データ登録部１１，第１データ入力部１２，第１判定部１３，通信部１４，第２データ入力部１５，第２判定部１６，個別データ登録部１７，書き込み部１８としての機能を実現するためのプログラムは、例えばフレキシブルディスク、ＣＤ（ＣＤ−ＲＯＭ、ＣＤ−Ｒ、ＣＤ−ＲＷ等）、ＤＶＤ（ＤＶＤ−ＲＯＭ、ＤＶＤ−ＲＡＭ、ＤＶＤ−Ｒ、ＤＶＤ＋Ｒ、ＤＶＤ−ＲＷ、ＤＶＤ＋ＲＷ、ＨＤＤＶＤ等）、ブルーレイディスク、磁気ディスク、光ディスク、光磁気ディスク等の、コンピュータ読取可能な記録媒体に記録された形態で提供されてよい。そして、コンピュータ（本実施形態ではＣＰＵ１０）は上述した記録媒体から図示しない読取装置を介してプログラムを読み取って内部記録装置または外部記録装置に転送し格納して用いてよい。また、プログラムを、例えば磁気ディスク，光ディスク，光磁気ディスク等の記憶装置（記録媒体）に記録しておき、記憶装置から通信経路を介してコンピュータに提供してもよい。 The common data registration unit 11, the first data input unit 12, the first determination unit 13, the communication unit 14, the second data input unit 15, the second determination unit 16, the individual data registration unit 17, and the writing unit 18 are used. Examples of the program for realizing the functions are flexible disk, CD (CD-ROM, CD-R, CD-RW, etc.), DVD (DVD-ROM, DVD-RAM, DVD-R, DVD + R, DVD-RW, DVD + RW, HD DVD, etc.), Blu-ray disc, magnetic disc, optical disc, magneto-optical disc, etc. may be provided in a form recorded on a computer-readable recording medium. The computer (CPU 10 in this embodiment) may read the program from the above-described recording medium via a reading device (not shown), transfer the program to an internal recording device or an external recording device, and use it. Further, the program may be recorded in a storage device (recording medium) such as a magnetic disk, an optical disk, or a magneto-optical disk, and provided to the computer via a communication path from the storage device.

共通データ登録部１１，第１データ入力部１２，第１判定部１３，通信部１４，第２データ入力部１５，第２判定部１６，個別データ登録部１７，書き込み部１８としての機能を実現する際には、内部記憶装置（本実施形態ではメモリ２０）に格納されたプログラムがコンピュータ（本実施形態ではＣＰＵ１０）によって実行されてよい。また、記録媒体に記録されたプログラムをコンピュータが読み取って実行してもよい。 Functions as the common data registration unit 11, the first data input unit 12, the first determination unit 13, the communication unit 14, the second data input unit 15, the second determination unit 16, the individual data registration unit 17, and the writing unit 18 are realized. In doing so, a program stored in the internal storage device (memory 20 in this embodiment) may be executed by a computer (CPU 10 in this embodiment). Further, the computer may read and execute the program recorded on the recording medium.

共通データ登録部１１，第１データ入力部１２，第１判定部１３，通信部１４，第２データ入力部１５，第２判定部１６，個別データ登録部１７，書き込み部１８の機能の詳細について説明する前に、図２〜図９を参照して、重複除去技術について説明する。 Details of functions of the common data registration unit 11, the first data input unit 12, the first determination unit 13, the communication unit 14, the second data input unit 15, the second determination unit 16, the individual data registration unit 17, and the writing unit 18 Prior to the description, the deduplication technique will be described with reference to FIGS.

図２〜図９を参照して説明される重複除去技術は、実施形態の一例としてのストレージシステム１００によって実現されてもよい。 The deduplication technique described with reference to FIGS. 2 to 9 may be realized by the storage system 100 as an example of the embodiment.

図２は重複除去技術を適用する前のオリジナルファイルを模式的に例示する図であり、図３は重複除去技術を適用した後のファイルを模式的に例示する図である。 FIG. 2 is a diagram schematically illustrating an original file before applying the deduplication technique, and FIG. 3 is a diagram schematically illustrating a file after the deduplication technique is applied.

重複除去技術は、複数のファイルにおいて、同一の内容のブロックが複数含まれている場合に、同一の内容のブロックを１つにまとめる技術である。 The deduplication technique is a technique for combining blocks having the same contents into one when a plurality of files having the same contents are included in a plurality of files.

図２に示すように、オリジナルファイルとしてのファイルＡ及びファイルＢが記憶装置に書き込まれる例を考える。ファイルＡには青，赤，紫，黄及び緑の画像データのブロックが含まれ、ファイルＢには緑，茶，赤，紫及び黄の画像データのブロックが含まれる。また、各ブロックのデータサイズは４ＫＢ（キロバイト）であり、ファイルＡ及びファイルＢの合計データサイズは４０ＫＢである。 As shown in FIG. 2, consider an example in which files A and B as original files are written in a storage device. File A contains blue, red, purple, yellow and green image data blocks, and file B contains green, brown, red, purple and yellow image data blocks. The data size of each block is 4 KB (kilobytes), and the total data size of file A and file B is 40 KB.

ファイルＡ及びファイルＢに重複除去を適用する場合には、各ブロックにインデックスが付与される。 When deduplication is applied to file A and file B, an index is assigned to each block.

図３に示す例において、ファイルＡの青，赤，紫，黄及び緑の画像データのブロックには、インデックス０１，０２，０３，０４及び０５がそれぞれ付与される。また、ファイルＢの緑，茶，赤，紫及び黄の画像データのブロックには、インデックス０５，０６，０２，０３及び０４がそれぞれ付与される。別言すれば、ファイルＡ及びファイルＢにおいて、同一の色の画像データのブロックには、同一のインデックスが付与される。 In the example shown in FIG. 3, indexes 01, 02, 03, 04, and 05 are assigned to the blocks of blue, red, purple, yellow, and green image data in file A, respectively. In addition, indexes 05, 06, 02, 03, and 04 are assigned to the green, brown, red, purple, and yellow image data blocks of file B, respectively. In other words, in the file A and the file B, the same index is assigned to the blocks of the same color image data.

ストレージ装置１は、重複除去したファイルを管理するために、図３に示すような管理テーブルに関する情報を記憶する。管理テーブルには、インデックスとポインタとが対応付けられている。 The storage device 1 stores information on a management table as shown in FIG. 3 in order to manage the files from which duplicates have been removed. In the management table, an index and a pointer are associated with each other.

ストレージ装置１は、管理テーブルにおいて、インデックスを参照することにより、ポインタを介してファイルに含まれるブロックを参照することができる。図３に示す例においては、インデックス０１，０２，０３，０４，０５及び０６に対して、青，赤，紫，黄，緑及び茶の各画像データのブロックを示すポインタがそれぞれ対応づけられている。 The storage apparatus 1 can refer to the block included in the file via the pointer by referring to the index in the management table. In the example shown in FIG. 3, pointers indicating blocks of image data of blue, red, purple, yellow, green, and brown are associated with indexes 01, 02, 03, 04, 05, and 06, respectively. Yes.

図４はハッシュ値管理テーブルの第１の例を示す図であり、図５は仮想アドレス空間テーブルの第１の例を示す図であり、図６は管理テーブルの第１の例を示す図である。 4 is a diagram showing a first example of a hash value management table, FIG. 5 is a diagram showing a first example of a virtual address space table, and FIG. 6 is a diagram showing a first example of a management table. is there.

図１に示したファイルＡを記憶装置３０の仮想アドレス０ｘ００００から書き込んだ場合のハッシュ値管理テーブルと仮想アドレス空間テーブルと管理テーブルとは、図４〜図６にそれぞれ示すようになる。 The hash value management table, virtual address space table, and management table when the file A shown in FIG. 1 is written from the virtual address 0x0000 of the storage device 30 are as shown in FIGS.

ハッシュ値管理テーブルには、ハッシュ値と書き込み物理アドレス（「ページ」と称されてもよい。）とが関連付けられている。ハッシュ値は、ブロックの内容に基づいて作成され、ブロックを一意に特定するための情報である。物理アドレスは、ブロックが記憶装置３０において実際に書き込まれている位置を示す。 In the hash value management table, a hash value and a write physical address (may be referred to as “page”) are associated. The hash value is information that is created based on the contents of the block and uniquely identifies the block. The physical address indicates the position where the block is actually written in the storage device 30.

図４に示す例において、ハッシュ値管理テーブルには、ファイルＡの青，赤，紫，黄及び緑の画像データのブロックに対して、ハッシュ値ｂｌｕｅ，ｒｅｄ，ｐｕｒｐｌｅ，ｙｅｌｌｏｗ及びｇｒｅｅｎがそれぞれ関連付けられている。また、図４に示す例において、ハッシュ値管理テーブルには、ファイルＡの青，赤，紫，黄及び緑の画像データのブロックに対して、書き込み物理アドレス０１，０２，０３，０４及び０５がそれぞれ関連付けられている。 In the example shown in FIG. 4, in the hash value management table, hash values blue, red, purple, yellow, and green are associated with blocks of blue, red, purple, yellow, and green image data of file A, respectively. ing. In the example shown in FIG. 4, the hash value management table includes write physical addresses 01, 02, 03, 04, and 05 for blocks of blue, red, purple, yellow, and green image data of file A. Each is associated.

仮想アドレス空間テーブルは、仮想アドレスと物理アドレスとが関連付けられている。仮想アドレスは、ストレージ装置１が展開する仮想ボリューム３０ａ（図１０等を用いて後述）においてブロックが書き込まれる位置を示す。 In the virtual address space table, a virtual address and a physical address are associated with each other. The virtual address indicates a position where a block is written in a virtual volume 30a (described later with reference to FIG. 10) developed by the storage device 1.

図５に示す例において、仮想アドレス０ｘ００００，０ｘ１０００，０ｘ２０００，０ｘ３０００及び０ｘ４０００に対して、書き込み物理アドレス０１，０２，０３，０４及び０５がそれぞれ関連付けられている。 In the example shown in FIG. 5, the write physical addresses 01, 02, 03, 04, and 05 are associated with the virtual addresses 0x0000, 0x1000, 0x2000, 0x3000, and 0x4000, respectively.

図７はハッシュ値管理テーブルの第２の例を示す図であり、図８は仮想アドレス空間テーブルの第２の例を示す図であり、図９は管理テーブルの第２の例を示す図である。 FIG. 7 is a diagram showing a second example of the hash value management table, FIG. 8 is a diagram showing a second example of the virtual address space table, and FIG. 9 is a diagram showing a second example of the management table. is there.

図４〜図６に示した状態から、更に、図２に示したファイルＢを記憶装置３０の仮想アドレス０ｘ６０００から書き込んだ後のハッシュ値管理テーブルと仮想アドレス空間テーブル管理テーブルとは、図７〜図９にそれぞれ示すようになる。 The hash value management table and the virtual address space table management table after the file B shown in FIG. 2 is written from the virtual address 0x6000 of the storage device 30 from the state shown in FIGS. As shown in FIG.

そこで、ハッシュ値管理テーブルに関する情報の大きさをメモリに格納可能な大きさに抑えつつ、重複しているブロックをできるだけ検出することが考えられる。別言すれば、全てのハッシュ値を記憶するのではなく、例えば、重複しないブロックのハッシュ値と書き込み物理アドレスとの関係をメモリに格納しないことが考えられる。 Therefore, it is conceivable to detect as many overlapping blocks as possible while limiting the size of the information related to the hash value management table to a size that can be stored in the memory. In other words, instead of storing all hash values, for example, it is conceivable that the relationship between the hash value of a non-overlapping block and the write physical address is not stored in the memory.

ハッシュ値管理テーブルの大きさを抑える技術のひとつとして、ハッシュ値管理テーブルをＬＲＵを用いて管理する技術が知られている。ＬＲＵを用いた技術では、ブロックの書き込み時にハッシュ値管理テーブルを参照した際、ヒットしたエントリの優先度を最高に設定してよい。そして、新たなハッシュ値を有するブロックが書き込まれる際には、優先度が最低のエントリの内容を、新たなハッシュ値の内容に書き換えてよい。 As one technique for reducing the size of a hash value management table, a technique for managing a hash value management table using an LRU is known. In the technique using LRU, when the hash value management table is referred to when writing a block, the priority of the entry that has been hit may be set to the highest. Then, when a block having a new hash value is written, the content of the entry with the lowest priority may be rewritten with the content of the new hash value.

しかしながら、ＬＲＵを用いてハッシュ値管理テーブルを用いて管理する技術においては、一時的に新たなハッシュ値を有するブロックが大量に書き込まれた場合に、長期間に亘って使用されるブロックのエントリが消去されてしまうおそれがある。また、ハッシュ値管理テーブルのエントリが特定のアプリケーションで特異的に使用されるブロックによって占有されるおそれがある。 However, in the technique of managing using the hash value management table using LRU, when a large number of blocks having new hash values are temporarily written, entries of blocks that are used for a long period of time are stored. There is a risk of being erased. Further, there is a possibility that the entry of the hash value management table is occupied by a block that is specifically used by a specific application.

図１０は、実施形態の一例としてのストレージシステム１００の機能を模式的に示す図である。 FIG. 10 is a diagram schematically illustrating functions of the storage system 100 as an example of the embodiment.

ストレージ装置１は、図１０に例示するように、複数（図示する例ではｎ＋１個）の仮想ボリューム３０ａ（ＬＵＮ０〜ｎ）を展開してよい。なお、「ＬＵＮ」は、Logical Unit Numberの略称である。複数の仮想ボリューム３０ａは、記憶装置３０の記憶領域を使用することによって展開されてよい。 As illustrated in FIG. 10, the storage apparatus 1 may expand a plurality (n + 1 in the illustrated example) of virtual volumes 30a (LUN0 to LUNn). “LUN” is an abbreviation for Logical Unit Number. The plurality of virtual volumes 30 a may be expanded by using the storage area of the storage device 30.

ボリュームに書き込まれるデータは、ＬＵ毎に特徴がある場合がある。例えば、文字ドキュメントを扱うアプリケーションＡ用にＬＵＮ０を割り当て、画像を扱うアプリケーションＢ用にＬＵＮ１を割り当てた場合を想定する。この場合には、ＬＵＮ０には文字ドキュメントデータが多く記録され、ＬＵＮ１には画像データが多く記録されることになり、文字ドキュメントデータと画像データとの間では重複が起きにくい。 Data written to the volume may have a characteristic for each LU. For example, it is assumed that LUN0 is assigned for application A that handles character documents and LUN1 is assigned for application B that handles images. In this case, a large amount of character document data is recorded in LUN0, and a large amount of image data is recorded in LUN1, and duplication is unlikely to occur between character document data and image data.

しかしながら、アプリケーションによらずに記録されるデータも存在する。例えば、ファイルシステムやＯＳが書き込むデータは、複数のアプリケーション間で共通に使用され、各アプリケーションに対応する複数の仮想ボリューム３０ａに同じように格納される場合がある。 However, some data is recorded regardless of the application. For example, data written by a file system or OS may be used in common among a plurality of applications and stored in the same manner in a plurality of virtual volumes 30a corresponding to each application.

つまり、ストレージ装置１が記憶するデータには、ＬＵＮ間に亘って重複するデータと、ＬＵＮに固有のデータとが存在する。実施形態の一例としてのストレージシステム１００は、ＬＵＮ間に亘って重複するデータと、ＬＵＮに固有のデータとの重複管理のための機能を分離することで、ハッシュ値管理テーブルの大きさを抑えつつ、高い重複除去率を達成する。 That is, the data stored in the storage device 1 includes data that overlaps between LUNs and data that is unique to the LUNs. The storage system 100 as an example of the embodiment reduces the size of the hash value management table by separating functions for duplication management between data overlapping between LUNs and data unique to the LUNs. Achieve a high deduplication rate.

そこで、実施形態の一例としてのストレージ装置１は、図１０に例示するように、第１段の重複除去部１０ａ及び複数（図示する例ではｎ＋１個）の第２段の重複除去部１０ｂとして機能する。複数の第２段の重複除去部１０ｂは、複数の仮想ボリューム３０ａのそれぞれに対応して備えられてよい。 Therefore, the storage apparatus 1 as an example of the embodiment functions as a first-stage deduplication unit 10a and a plurality of (n + 1 in the illustrated example) second-stage deduplication unit 10b as illustrated in FIG. To do. A plurality of second-stage deduplication units 10b may be provided corresponding to each of the plurality of virtual volumes 30a.

第１段の重複除去部１０ａは、例示的に、ＬＵＮ間に亘った重複除去を行なう。第１段の重複除去部１０ａは、図１に示した共通データ登録部１１，第１データ入力部１２，第１判定部１３，通信部１４として機能してよい。 The first-stage deduplication unit 10a illustratively performs deduplication across LUNs. The first-stage deduplication unit 10a may function as the common data registration unit 11, the first data input unit 12, the first determination unit 13, and the communication unit 14 illustrated in FIG.

第２段の重複除去部１０ｂは、例示的に、ＬＵＮ毎に重複除去を行なう。第２段の重複除去部１０ｂは、図１に示した第２データ入力部１５，第２判定部１６，個別データ登録部１７及び書き込み部１８として機能してよい。 For example, the second-stage deduplication unit 10b performs deduplication for each LUN. The second-stage duplicate elimination unit 10b may function as the second data input unit 15, the second determination unit 16, the individual data registration unit 17, and the writing unit 18 illustrated in FIG.

図１１は、実施形態の一例としての第１段の重複除去部１０ａの機能を模式的に示す図である。 FIG. 11 is a diagram schematically illustrating the function of the first-stage deduplication unit 10a as an example of the embodiment.

図１１に示す例においては、第１段の重複除去部１０ａが備える機能のうち、一部の機能が示されている。 In the example illustrated in FIG. 11, some of the functions included in the first-stage duplicate removal unit 10 a are illustrated.

共通データ登録部１１（図１１には不図示）は、例示的に、第１ハッシュ値管理テーブルを作成する。共通データ登録部１１は、ストレージ装置１の運用前に、予め第１ハッシュ値管理テーブルを作成してよい。また、共通データ登録部１１は、作成した第１ハッシュ値管理テーブルに関する情報を例えば記憶装置３０の所定の領域に格納してよい。 The common data registration unit 11 (not shown in FIG. 11) illustratively creates a first hash value management table. The common data registration unit 11 may create a first hash value management table in advance before the storage apparatus 1 is operated. Further, the common data registration unit 11 may store information on the created first hash value management table in a predetermined area of the storage device 30, for example.

第１ハッシュ値管理テーブルは、例示的に、複数の仮想ボリューム３０ａに亘って重複して書き込まれるデータに関する情報を示す。別言すれば、第１ハッシュ値管理テーブルは、ストレージ装置１の全体の動作を制御するＯＳによって使用されるデータのハッシュ値を示してよい。更に別言すれば、第１ハッシュ値管理テーブルは、ストレージ装置１によって実行されるアプリケーションによらずに重複して使用されるデータのハッシュ値を示してよい。 The first hash value management table exemplarily shows information related to data that is written redundantly across a plurality of virtual volumes 30a. In other words, the first hash value management table may indicate hash values of data used by the OS that controls the overall operation of the storage apparatus 1. In other words, the first hash value management table may indicate the hash value of data that is used redundantly regardless of the application executed by the storage device 1.

第１ハッシュ値管理テーブルには、ハッシュ値と書き込み物理アドレスとが関連付けられてよい。図１１に示す例において、ハッシュ値ｂｌｕｅ及びｒｅｄには、書き込み物理アドレスｌｕｎ０−０１及びｌｕｎ２−０５が関連付けられている。 A hash value and a write physical address may be associated with the first hash value management table. In the example illustrated in FIG. 11, write physical addresses luna0-01 and luna2-05 are associated with the hash values blue and red.

共通データ登録部１１は、ストレージシステム１００を事前に所定時間動作させて、ＬＵＮ間に亘って書き込まれたデータに関する情報を収集してよい。そして、共通データ登録部１１は、収集した情報に基づいて、書き込み回数が上位のデータに関する情報を所定の個数（例えば、１０万個）登録した第１ハッシュ値管理テーブルを作成してよい。 The common data registration unit 11 may operate the storage system 100 for a predetermined time in advance to collect information about data written between LUNs. Then, based on the collected information, the common data registration unit 11 may create a first hash value management table in which a predetermined number (for example, 100,000) of information related to data with a higher write count is registered.

共通データ登録部１１は、アプリケーションを起動させずにストレージシステム１００を動作させた状態で、重複して使用されたデータを第１ハッシュ値管理テーブルに収集してよい。共通データ登録部１１は、例えば、ＯＳやファイルシステム、ストレージシステム１００の運用環境デフォルトで実行されているプロセスによって書き込まれるデータを、第１ハッシュ値管理テーブルに登録してよい。 The common data registration unit 11 may collect duplicately used data in the first hash value management table in a state where the storage system 100 is operated without starting an application. For example, the common data registration unit 11 may register data written by a process executed as an operating environment default of the OS, file system, and storage system 100 in the first hash value management table.

別言すれば、共通データ登録部１１は、ストレージ装置１によって実行させるアプリケーションが起動していない状態において、重複して書き込まれるデータについての情報を第１ハッシュ値管理テーブルに登録してよい。これにより、共通データ登録部１１は、第１ハッシュ値管理テーブルを生成してよい。 In other words, the common data registration unit 11 may register information on data to be written redundantly in the first hash value management table in a state where an application to be executed by the storage device 1 is not activated. Thereby, the common data registration unit 11 may generate the first hash value management table.

また、共通データ登録部１１は、アプリケーションを起動させてストレージシステム１００を動作させた状態で、重複して使用されたデータを第１ハッシュ値管理テーブルに収集してよい。共通データ登録部１１は、例えば、ＯＳ等によって書き込まれるデータに加えて、複数のアプリケーション間で共通に使用されるデータを、第１ハッシュ値管理テーブルに登録してよい。共通データ登録部１１は、複数の第２段の重複除去部１０ｂに対して各仮想ボリューム３０ａに書き込まれているデータを問い合わせることにより、複数のアプリケーション間で共通に使用されるデータを特定してよい。ここで、「ＯＳ等によって書き込まれるデータ」とは、例えば、ＯＳやファイルシステム、ストレージシステム１００の運用環境デフォルトで実行されているプロセスによって書き込まれるデータである。 Further, the common data registration unit 11 may collect data used redundantly in the first hash value management table in a state where the application is started and the storage system 100 is operated. For example, the common data registration unit 11 may register data used in common among a plurality of applications in the first hash value management table in addition to data written by the OS or the like. The common data registration unit 11 specifies the data used in common among the plurality of applications by inquiring of the data written in each virtual volume 30a to the plurality of second-stage deduplication units 10b. Good. Here, “data written by the OS or the like” is, for example, data written by a process executed by the operating environment default of the OS, file system, or storage system 100.

別言すれば、共通データ登録部１１は、ストレージ装置１によって実行されるアプリケーションが動作している状態において、複数の仮想ボリューム３０ａに亘って重複して書き込まれるデータについての情報を第１ハッシュ値管理テーブルに登録してよい。これにより、共通データ登録部１１は、第１ハッシュ値管理テーブルを生成してよい。 In other words, the common data registration unit 11 uses the first hash value to store information about data that is written redundantly across the plurality of virtual volumes 30a in a state where the application executed by the storage device 1 is operating. It may be registered in the management table. Thereby, the common data registration unit 11 may generate the first hash value management table.

第１データ入力部１２は、例示的に、サーバ装置４から送信された入力データ（別言すれば、「書き込みデータ」）を受信する。別言すれば、第１データ入力部１２は、サーバ装置４から送信された書き込みアドレス（別言すれば、「ＬＵＮ及びアドレス」）とデータとの入力を受け付けてよい。 The first data input unit 12 exemplarily receives input data (in other words, “write data”) transmitted from the server device 4. In other words, the first data input unit 12 may accept input of a write address (in other words, “LUN and address”) and data transmitted from the server device 4.

また、第１データ入力部１２は、入力データを所定サイズのブロック（以下、「入力ブロック」と称する場合がある。）に分割して、それぞれの入力ブロックと入力ブロックの書き込みアドレスとを計算してよい。そして、第１データ入力部１２は、第１判定部１３に対して、入力ブロックと書き込みアドレスとの組を入力してよい。 The first data input unit 12 divides the input data into blocks of a predetermined size (hereinafter sometimes referred to as “input blocks”), and calculates each input block and the write address of the input block. It's okay. Then, the first data input unit 12 may input a set of an input block and a write address to the first determination unit 13.

第１判定部１３は、例示的に、入力ブロックが複数の仮想ボリューム３０ａに亘って重複して書き込まれているかを判定する。別言すれば、第１判定部１３は、入力ブロックが、ＯＳに使用されるブロックであるのか、個別のアプリケーションによって使用されるブロックであるのかを判定してよい。更に別言すれば、第１判定部１３は、入力ブロックのハッシュ値を計算し、第１ハッシュ値管理テーブルを検索して、同じハッシュ値のエントリが登録されているかを調べてよい。 For example, the first determination unit 13 determines whether or not the input block is written over the plurality of virtual volumes 30a. In other words, the first determination unit 13 may determine whether the input block is a block used by the OS or a block used by an individual application. In other words, the first determination unit 13 may calculate a hash value of the input block, search the first hash value management table, and check whether or not an entry with the same hash value is registered.

第１判定部１３は、第１ハッシュ値管理テーブルを用いて、入力ブロックが複数の仮想ボリューム３０ａに亘って重複して書き込まれているかを判定してよい。第１判定部１３は、第１ハッシュ値管理テーブルに登録されているブロックについては、複数の仮想ボリューム３０ａに亘って重複して書き込まれていると判定してよい。また、第１判定部１３は、第１ハッシュ値管理テーブルに登録されていないブロックについては、複数の仮想ボリューム３０ａに亘って重複して書き込まれていないと判定してよい。 The first determination unit 13 may determine whether or not the input block is written over the plurality of virtual volumes 30a using the first hash value management table. The first determination unit 13 may determine that the block registered in the first hash value management table has been written over a plurality of virtual volumes 30a. In addition, the first determination unit 13 may determine that a block that is not registered in the first hash value management table has not been written over a plurality of virtual volumes 30a.

通信部１４は、例示的に、第１判定部１３による判定結果に基づき、第２段の重複除去部１０ｂに対して、入力ブロックに関する情報を入力する。 For example, the communication unit 14 inputs information related to the input block to the second-stage deduplication unit 10b based on the determination result by the first determination unit 13.

具体的には、通信部１４は、入力ブロックに関する情報が第１ハッシュ値管理テーブルに登録されている場合には、入力ブロックについてのエントリの書き込み物理アドレスを読み出してよい。そして、通信部１４は、入力ブロックのヘッダ等の情報から調べた仮想アドレスのＬＵＮに対応づけられた第２段の重複除去部１０ｂに対して、入力ブロックの仮想アドレスと書き込み物理アドレスとの組を通知してよい。 Specifically, the communication unit 14 may read the write physical address of the entry for the input block when information about the input block is registered in the first hash value management table. Then, the communication unit 14 sets the combination of the virtual address of the input block and the write physical address to the second-stage deduplication unit 10b associated with the LUN of the virtual address determined from the information such as the header of the input block. May be notified.

また、通信部１４は、入力ブロックに関する情報が第１ハッシュ値管理テーブルに登録されていない場合には、入力ブロックのヘッダ等の情報から仮想アドレスを調べて、入力ブロックを書き込むべきＬＵＮを特定してよい。そして、通信部１４は、特定したＬＵＮに対応づけられた第２段の重複除去部１０ｂに対して、入力ブロックの仮想アドレスと内容とを通知してよい。 In addition, when the information related to the input block is not registered in the first hash value management table, the communication unit 14 checks the virtual address from the information such as the header of the input block and specifies the LUN in which the input block is to be written. It's okay. And the communication part 14 may notify the virtual address and content of an input block with respect to the duplicate removal part 10b of the 2nd step | paragraph matched with the specified LUN.

図１２は、実施形態の一例としての第２段の重複除去部１０ｂの機能を模式的に示す図である。 FIG. 12 is a diagram schematically illustrating the function of the second-stage deduplication unit 10b as an example of the embodiment.

図１２に示す例においては、第２段の重複除去部１０ｂが備える機能のうち、一部の機能が示されている。 In the example illustrated in FIG. 12, some of the functions included in the second-stage duplicate removal unit 10 b are illustrated.

図１２に示される第２ハッシュ値管理テーブルに関する情報は、例示的に、複数の第２段の重複除去部１０ｂ毎に、例えば記憶装置３０の所定の領域に格納される。第２ハッシュ値管理テーブルは、第２段の重複除去部１０ｂに対応するＬＵＮに閉じてデータに重複を管理するためのテーブルであり、ＬＲＵによって管理される所定数（例えば１０万個）のエントリを含んでよい。 The information related to the second hash value management table shown in FIG. 12 is illustratively stored, for example, in a predetermined area of the storage device 30 for each of the plurality of second-stage duplicate removal units 10b. The second hash value management table is a table for managing duplication of data by closing to the LUN corresponding to the second-stage duplicate removal unit 10b, and a predetermined number (for example, 100,000) entries managed by the LRU. May be included.

第２ハッシュ値管理テーブルには、ハッシュ値と書き込み物理アドレスとが関連付けられてよい。図１２に示す例において、ハッシュ値ｂｒｏｗｎには、書き込み物理アドレス０６が関連付けられている。 A hash value and a write physical address may be associated with the second hash value management table. In the example shown in FIG. 12, a write physical address 06 is associated with the hash value brown.

第２データ入力部１５は、例示的に、第１段の重複除去部１０ａから入力された入力ブロックに関する情報を受け取る。 The second data input unit 15 exemplarily receives information related to the input block input from the first-stage deduplication unit 10a.

具体的には、第２データ入力部１５は、入力ブロックが複数の仮想ボリューム３０ａに亘って重複して書き込まれていない場合には、第１段の重複除去部１０ａから、入力ブロックの仮想アドレスと内容との組を受け取ってよい。また、第２データ入力部１５は、入力ブロックが複数の仮想ボリューム３０ａに亘って重複して書き込まれている場合には、第１段の重複除去部１０ａから、入力ブロックの仮想アドレスと書き込み物理アドレスとの組を受け取ってよい。 Specifically, when the input block is not written redundantly across the plurality of virtual volumes 30a, the second data input unit 15 sends the virtual address of the input block from the first-stage deduplication unit 10a. And a set of contents may be received. Further, when the input block is written redundantly across the plurality of virtual volumes 30a, the second data input unit 15 receives the virtual address and write physical of the input block from the first-stage duplicate removal unit 10a. You may receive a pair with an address.

第２データ入力部１５は、入力ブロックの仮想アドレスと内容との組を受け取った場合には、第２判定部１６に対して、受け取った入力ブロックの仮想アドレスと内容との組を入力してよい。また、第２データ入力部１５は、入力ブロックの仮想アドレスと書き込み物理アドレスとの組を受け取った場合には、書き込み部１８に対して、受け取った入力ブロックの仮想アドレスと書き込み物理アドレスとの組を入力してよい。 When the second data input unit 15 receives the set of the virtual address and the content of the input block, the second data input unit 15 inputs the set of the received virtual address and the content of the input block to the second determination unit 16. Good. In addition, when the second data input unit 15 receives the set of the virtual address and the write physical address of the input block, the second data input unit 15 sends the set of the virtual address and the write physical address of the received input block to the write unit 18. May be entered.

第２判定部１６は、例示的に、第２ハッシュ値管理テーブルを用いて、対応する仮想ボリューム３０ａにおける入力ブロックの重複を判定する。 For example, the second determination unit 16 determines duplication of input blocks in the corresponding virtual volume 30a using the second hash value management table.

具体的には、第２判定部１６は、入力ブロックに関する情報が既に第２ハッシュ値管理テーブルに登録されている場合には、第２ハッシュ値管理テーブルから、書き込み物理アドレスを読み出してよい。そして、第２判定部１６は、書き込み部１８に対して、入力ブロックの仮想アドレスと、読み出した書き込み物理アドレスとの組を入力してよい。 Specifically, the second determination unit 16 may read the write physical address from the second hash value management table when information about the input block is already registered in the second hash value management table. Then, the second determination unit 16 may input a set of the virtual address of the input block and the read write physical address to the writing unit 18.

第２判定部１６は、入力ブロックに関する情報が既に第２ハッシュ値管理テーブルに登録されていない場合には、個別データ登録部１７に対して、入力ブロックの仮想アドレスと内容との組を入力してよい。 When the information regarding the input block is not already registered in the second hash value management table, the second determination unit 16 inputs the set of the virtual address and content of the input block to the individual data registration unit 17. It's okay.

個別データ登録部１７（図１２には不図示）は、例示的に、第２ハッシュ値管理テーブルに入力ブロックについての情報を登録する。 The individual data registration unit 17 (not shown in FIG. 12) exemplarily registers information about the input block in the second hash value management table.

具体的には、個別データ登録部１７は、入力ブロックに関する情報が第２ハッシュ値管理テーブルに登録されていない場合には、第２ハッシュ値管理テーブルの複数のエントリのうち、最もＬＲＵの優先度が低いエントリを選択してよい。個別データ登録部１７は、選択したエントリに入力ブロックのハッシュ値を書き込み（別言すれば、上書きし）、書き込み部１８に対して、入力ブロックの仮想アドレスと内容との組を入力してよい。 Specifically, when the information about the input block is not registered in the second hash value management table, the individual data registration unit 17 has the highest LRU priority among the plurality of entries in the second hash value management table. An entry with a low may be selected. The individual data registration unit 17 may write the hash value of the input block to the selected entry (in other words, overwrite), and input the set of the virtual address and content of the input block to the writing unit 18. .

書き込み部１８は、例示的に、対応する仮想ボリューム３０ａに対して、入力ブロックを書き込む。 For example, the writing unit 18 writes the input block to the corresponding virtual volume 30a.

具体的には、書き込み部１８は、入力ブロックの仮想アドレスと書き込み物理アドレスとの組を受け取った場合には、仮想アドレス空間テーブルの対応する仮想アドレスに対して、入力ブロックの書き込み物理アドレスを関連付けて登録してよい。これにより、入力ブロックの仮想アドレスの値は、書き込み物理アドレスに記録されているブロックの内容となる。 Specifically, when the writing unit 18 receives a set of the virtual address and the write physical address of the input block, the writing unit 18 associates the write physical address of the input block with the corresponding virtual address in the virtual address space table. You can register. Thereby, the value of the virtual address of the input block becomes the content of the block recorded at the write physical address.

書き込み部１８は、入力ブロックの仮想アドレスと内容との組を受け取った場合には、受け取った入力ブロックの内容を記憶装置３０に書き込んでよい。また、書き込み部１８は、管理テーブルに新たな１エントリを追加して、追加したエントリに入力ブロックについての書き込み物理アドレスを登録してよい。更に、書き込み部１８は、仮想アドレス空間テーブルの対応する仮想アドレスに対して、書き込み物理アドレスを関連付けて登録してよい。 When the writing unit 18 receives a set of the virtual address and content of the input block, the writing unit 18 may write the content of the received input block into the storage device 30. The writing unit 18 may add one new entry to the management table and register the write physical address for the input block in the added entry. Further, the writing unit 18 may register the writing physical address in association with the corresponding virtual address in the virtual address space table.

個別データ登録部１７（図１２には不図示）は、書き込み部１８によって管理テーブルに登録された書き込み物理アドレスを、第２ハッシュ値管理テーブルの入力ブロックについてのエントリの書き込み物理アドレスのカラムに登録してよい。 The individual data registration unit 17 (not shown in FIG. 12) registers the write physical address registered in the management table by the write unit 18 in the write physical address column of the entry for the input block of the second hash value management table. You can do it.

入力ブロックの仮想アドレスと内容との組を受け取った場合の、書き込み部１８及び個別データ登録部１７の機能により、入力ブロックの値は、書き込み物理アドレスに記録されているブロックの内容となる。 When the set of the virtual address and content of the input block is received, the value of the input block becomes the content of the block recorded at the write physical address by the functions of the writing unit 18 and the individual data registration unit 17.

〔Ａ−２〕動作
上述の如く構成された実施形態の一例としての第１段の重複除去動作（ステップＳ１〜Ｓ５）を、図１３に示すフローチャートに従って説明する。 [A-2] Operation The first-stage deduplication operation (steps S1 to S5) as an example of the embodiment configured as described above will be described with reference to the flowchart shown in FIG.

第１データ入力部１２は、サーバ装置４から送信された書き込みデータを受け取る（ステップＳ１）。 The first data input unit 12 receives the write data transmitted from the server device 4 (step S1).

第１判定部１３は、書き込みデータのハッシュ値を算出する（ステップＳ２）。 The first determination unit 13 calculates a hash value of the write data (Step S2).

第１判定部１３は、書き込みデータのハッシュ値が第１ハッシュ値管理テーブルに登録されているかを判定する（ステップＳ３）。 The first determination unit 13 determines whether the hash value of the write data is registered in the first hash value management table (step S3).

書き込みデータのハッシュ値が第１ハッシュ値管理テーブルに登録されていない場合には（ステップＳ３のＮｏルート参照）、通信部１４は、書き込みデータの仮想アドレスから書き込むべき仮想ボリューム３０ａを特定する。そして、通信部１４は、特定した仮想ボリューム３０ａに対応する第２段の重複除去部１０ｂに対して、書き込みデータの仮想アドレスと内容とを通知し（ステップＳ４）、処理は終了する。 When the hash value of the write data is not registered in the first hash value management table (see No route in step S3), the communication unit 14 specifies the virtual volume 30a to be written from the virtual address of the write data. Then, the communication unit 14 notifies the second-stage duplicate removal unit 10b corresponding to the identified virtual volume 30a of the virtual address and content of the write data (step S4), and the process ends.

一方、書き込みデータのハッシュ値が第１ハッシュ値管理テーブルに登録されている場合には（ステップＳ３のＹｅｓルート参照）、通信部１４は、対応する第２段の重複除去部１０ｂに対して、書き込みデータの仮想アドレスと書き込み物理アドレスとの組を通知する（ステップＳ５）。そして、処理は終了する。 On the other hand, when the hash value of the write data is registered in the first hash value management table (see the Yes route in step S3), the communication unit 14 performs the corresponding second-stage deduplication unit 10b. A set of the virtual address and write physical address of the write data is notified (step S5). Then, the process ends.

次に、実施形態の一例としての第２段の重複除去動作（ステップＳ１１〜Ｓ１６）を、図１４に示すフローチャートに従って説明する。 Next, the second-stage deduplication operation (steps S11 to S16) as an example of the embodiment will be described with reference to the flowchart shown in FIG.

第２データ入力部１５は、第１段の重複除去部１０ａから入力された書き込みデータを受け取る（ステップＳ１１）。 The second data input unit 15 receives the write data input from the first-stage deduplication unit 10a (step S11).

第２判定部１６は、書き込みデータのハッシュ値を算出する（ステップＳ１２）。 The second determination unit 16 calculates a hash value of the write data (Step S12).

第２判定部１６は、書き込みデータのハッシュ値が第２ハッシュ値管理テーブルに登録されているかを判定する（ステップＳ１３）。 The second determination unit 16 determines whether the hash value of the write data is registered in the second hash value management table (step S13).

書き込みデータのハッシュ値が第２ハッシュ値管理テーブルに登録されている場合には（ステップＳ１３のＹｅｓルート参照）、処理は終了する。 If the hash value of the write data is registered in the second hash value management table (see the Yes route in step S13), the process ends.

一方、書き込みデータのハッシュ値が第２ハッシュ値管理テーブルに登録されていない場合には（ステップＳ１３のＮｏルート参照）、個別データ登録部１７は、第２ハッシュ値管理テーブルを更新する。具体的には、個別データ登録部１７は、第２ハッシュ値管理テーブルにおいて最も使われていないエントリの内容を、書き込みデータのハッシュ値で書き換える（ステップＳ１４）。 On the other hand, when the hash value of the write data is not registered in the second hash value management table (see No route in step S13), the individual data registration unit 17 updates the second hash value management table. Specifically, the individual data registration unit 17 rewrites the content of the entry that is least used in the second hash value management table with the hash value of the write data (step S14).

書き込み部１８は、書き込みデータを記憶装置３０に書き込む。また、書き込み部１８は、書き込みデータの仮想アドレスを管理テーブルに登録し、書き込みデータのインデックスを作成する（ステップＳ１５）。 The writing unit 18 writes write data into the storage device 30. The writing unit 18 registers the virtual address of the write data in the management table, and creates an index of the write data (step S15).

個別データ登録部１７は、第２ハッシュ値管理テーブルの対応するエントリにおける書き込み物理アドレスの内容を、作成されたインデックスに書き換える（ステップＳ１６）。そして、処理は終了する。 The individual data registration unit 17 rewrites the contents of the write physical address in the corresponding entry of the second hash value management table with the created index (step S16). Then, the process ends.

次に、実施形態の一例としての第１ハッシュ値管理テーブルの作成動作（ステップＳ２１〜Ｓ２９）の第１の例を、図１５に示すフローチャートに従って説明する。図１５においては、アプリケーションを起動させずにストレージシステム１００を動作させた状態での第１ハッシュ値管理テーブルの作成動作を説明する。 Next, a first example of the first hash value management table creation operation (steps S21 to S29) as an example of the embodiment will be described with reference to the flowchart shown in FIG. In FIG. 15, the first hash value management table creation operation in a state where the storage system 100 is operated without activating an application will be described.

共通データ登録部１１は、第１ハッシュ値管理テーブルと同じエントリ数を有する作業用ハッシュ値管理テーブルを作成する（ステップＳ２１）。ステップＳ２１の状態において、第１ハッシュ値管理テーブル及び作成された作業用ハッシュ値管理テーブルの各エントリは空である。 The common data registration unit 11 creates a working hash value management table having the same number of entries as the first hash value management table (step S21). In the state of step S21, each entry in the first hash value management table and the created work hash value management table is empty.

共通データ登録部１１は、アプリケーションを起動させずにストレージシステム１００を動作させる（ステップＳ２２）。 The common data registration unit 11 operates the storage system 100 without starting the application (step S22).

第１データ入力部１２は、サーバ装置４から送信された書き込みデータを受け取る（ステップＳ２３）。 The first data input unit 12 receives the write data transmitted from the server device 4 (step S23).

第１判定部１３は、書き込みデータのハッシュ値を算出する（ステップＳ２４）。 The first determination unit 13 calculates a hash value of the write data (step S24).

第１判定部１３は、書き込みデータのハッシュ値が第１ハッシュ値管理テーブルに登録されているかを判定する（ステップＳ２５）。 The first determination unit 13 determines whether the hash value of the write data is registered in the first hash value management table (step S25).

書き込みデータのハッシュ値が第１ハッシュ値管理テーブルに登録されていない場合には（ステップＳ２５のＮｏルート参照）、共通データ登録部１１は、第１ハッシュ値管理テーブルを更新する。具体的には、共通データ登録部１１は、第１ハッシュ値管理テーブルの最も使われていないエントリを、書き込みデータのハッシュ値で書き換える（ステップＳ２６）。そして、処理はステップＳ２３へ戻る。 If the hash value of the write data is not registered in the first hash value management table (see No route in step S25), the common data registration unit 11 updates the first hash value management table. Specifically, the common data registration unit 11 rewrites the least used entry in the first hash value management table with the hash value of the write data (step S26). Then, the process returns to step S23.

一方、書き込みデータのハッシュ値が第１ハッシュ値管理テーブルに登録されている場合には（ステップＳ２５のＹｅｓルート参照）、共通データ登録部１１は、作業用ハッシュ値管理テーブルを参照する。そして、共通データ登録部１１は、書き込みデータのハッシュ値が作業用ハッシュ値管理テーブルに未だ登録されていなければ、書き込みデータのハッシュ値を作業用ハッシュ値管理テーブルに登録する（ステップＳ２７）。 On the other hand, when the hash value of the write data is registered in the first hash value management table (see the Yes route in step S25), the common data registration unit 11 refers to the work hash value management table. If the hash value of the write data is not yet registered in the work hash value management table, the common data registration unit 11 registers the hash value of the write data in the work hash value management table (step S27).

共通データ登録部１１は、作業用ハッシュ値管理テーブルの全てのエントリにハッシュ値が登録されたかを判定する（ステップＳ２８）。 The common data registration unit 11 determines whether or not hash values are registered in all entries of the work hash value management table (step S28).

全てのエントリにハッシュ値が登録されていない場合には（ステップＳ２８のＮｏルート参照）、処理はステップＳ２３へ戻る。 If hash values are not registered in all entries (see No route in step S28), the process returns to step S23.

一方、全てのエントリにハッシュ値が登録されている場合には（ステップＳ２８のＹｅｓルート参照）、共通データ登録部１１は、作業用ハッシュ値管理テーブルの内容を第１ハッシュ値管理テーブルにコピー（別言すれば、上書き）する。また、共通データ登録部１１は、作業用ハッシュ値管理テーブルを破棄する（ステップＳ２９）。そして、処理は終了する。 On the other hand, when hash values are registered in all entries (see the Yes route in step S28), the common data registration unit 11 copies the contents of the working hash value management table to the first hash value management table ( In other words, overwrite). The common data registration unit 11 discards the work hash value management table (step S29). Then, the process ends.

次に、実施形態の一例としての第１ハッシュ値管理テーブルの作成動作（ステップＳ３１〜Ｓ３９）の第２の例を、図１６に示すフローチャートに従って説明する。図１６においては、アプリケーションを起動させてストレージシステム１００を動作させた状態での第１ハッシュ値管理テーブルの作成動作を説明する。 Next, a second example of the first hash value management table creation operation (steps S31 to S39) as an example of the embodiment will be described with reference to the flowchart shown in FIG. In FIG. 16, the creation operation of the first hash value management table in a state where the application is started and the storage system 100 is operated will be described.

共通データ登録部１１は、第１ハッシュ値管理テーブルと同じエントリ数を有する作業用ハッシュ値管理テーブルを作成する（ステップＳ３１）。ステップＳ３１の状態において、第１ハッシュ値管理テーブル及び作成された作業用ハッシュ値管理テーブルの各エントリは空である。 The common data registration unit 11 creates a working hash value management table having the same number of entries as the first hash value management table (step S31). In the state of step S31, each entry in the first hash value management table and the created work hash value management table is empty.

共通データ登録部１１は、アプリケーションを起動させてストレージシステム１００を動作させる（ステップＳ３２）。 The common data registration unit 11 activates the application and operates the storage system 100 (step S32).

第１データ入力部１２は、サーバ装置４から送信された書き込みデータを受け取る（ステップＳ３３）。 The first data input unit 12 receives the write data transmitted from the server device 4 (step S33).

第１判定部１３は、書き込みデータのハッシュ値を算出する（ステップＳ３４）。 The first determination unit 13 calculates a hash value of the write data (step S34).

通信部１４は、書き込みデータの仮想アドレスから、書き込みデータを書き込むべき仮想ボリューム３０ａを特定する。そして、通信部１４は、特定した仮想ボリューム３０ａに対応する第２段の重複除去部１０ｂに対して、書き込みデータの仮想アドレスとブロックの内容とを通知する（ステップＳ３５）。 The communication unit 14 specifies the virtual volume 30a in which the write data is to be written, from the virtual address of the write data. Then, the communication unit 14 notifies the second-stage deduplication unit 10b corresponding to the identified virtual volume 30a of the virtual address of the write data and the contents of the block (step S35).

第１判定部１３は、複数の第２段の重複除去部１０ｂの第２判定部１６による判定結果により、書き込みデータのハッシュ値について、複数の第２ハッシュ値管理テーブルに共通のエントリがあるかを判定する（ステップＳ３６）。 Whether the first determination unit 13 has a common entry in the plurality of second hash value management tables for the hash value of the write data based on the determination result by the second determination unit 16 of the plurality of second-stage deduplication units 10b. Is determined (step S36).

書き込みデータのハッシュ値について複数の第２ハッシュ値管理テーブルに共通のエントリがない場合には（ステップＳ３６のＮｏルート参照）、処理はステップＳ３３へ戻る。 If there is no common entry for the hash value of the write data in the plurality of second hash value management tables (see No route in step S36), the process returns to step S33.

一方、書き込みデータのハッシュ値について、複数の第２ハッシュ値管理テーブルに共通のエントリがある場合には（ステップＳ３６のＹｅｓルート参照）、共通データ登録部１１は、作業用ハッシュ値管理テーブルを参照する。そして、共通データ登録部１１は、書き込みデータのハッシュ値が作業用ハッシュ値管理テーブルに未だ登録されていなければ、書き込みデータのハッシュ値を作業用ハッシュ値管理テーブルに登録する（ステップＳ３７）。 On the other hand, if there is a common entry in the plurality of second hash value management tables for the hash value of the write data (see the Yes route in step S36), the common data registration unit 11 refers to the work hash value management table. To do. If the hash value of the write data is not yet registered in the work hash value management table, the common data registration unit 11 registers the hash value of the write data in the work hash value management table (step S37).

共通データ登録部１１は、作業用ハッシュ値管理テーブルの全てのエントリにハッシュ値が登録されたかを判定する（ステップＳ３８）。 The common data registration unit 11 determines whether or not hash values are registered in all entries of the work hash value management table (step S38).

全てのエントリにハッシュ値が登録されていない場合には（ステップＳ３８のＮｏルート参照）、処理はステップＳ３３へ戻る。 If hash values are not registered in all entries (see No route in step S38), the process returns to step S33.

一方、全てのエントリにハッシュ値が登録されている場合には（ステップＳ３８のＹｅｓルート参照）、共通データ登録部１１は、作業用ハッシュ値管理テーブルの内容を第１ハッシュ値管理テーブルにコピー（別言すれば、上書き）する。また、共通データ登録部１１は、作業用ハッシュ値管理テーブルを破棄する（ステップＳ３９）。そして、処理は終了する。 On the other hand, when hash values are registered in all entries (see the Yes route in step S38), the common data registration unit 11 copies the contents of the work hash value management table to the first hash value management table ( In other words, overwrite). Further, the common data registration unit 11 discards the work hash value management table (step S39). Then, the process ends.

以下、図１７及び図１８を参照しながら、関連技術としてのハッシュ値管理テーブルの使用例と、実施形態の一例としてのハッシュ値管理テーブルの使用例とを比較する。 Hereinafter, a usage example of the hash value management table as the related technique will be compared with a usage example of the hash value management table as an example of the embodiment with reference to FIGS. 17 and 18.

図１７は関連技術としてのハッシュ値管理テーブルの使用例を説明する図であり、図１８は実施形態の一例としてのハッシュ値管理テーブルの使用例を説明する図である。 FIG. 17 is a diagram for explaining a usage example of a hash value management table as a related technique, and FIG. 18 is a diagram for explaining a usage example of a hash value management table as an example of an embodiment.

ストレージ装置１において、２つのアプリケーションＡ及びアプリケーションＢが実行され、アプリケーションＡ及びアプリケーションＢがＬＵＮ０及びＬＵＮ１をそれぞれ使用する場合を考える。 Consider a case in which two applications A and B are executed in the storage apparatus 1 and applications A and B use LUN0 and LUN1, respectively.

アプリケーションＡによって書き込まれるデータが時刻順に、Ｃ−０，Ａ−０，Ａ−１，Ｃ−１，Ａ−０，Ｃ−１，・・・であると仮定する。また、アプリケーションＢによって書き込まれるデータが時刻順に、Ｃ−１，Ｂ−０，Ｂ−１，Ｂ−２，Ｂ−３，Ｃ−０，Ｂ−３，Ｂ−４，Ｃ−１，・・・であると仮定する。なお、Ａで始まるデータはアプリケーションＡでのみ使用されるデータであり、Ｂで始まるデータはアプリケーションＢでのみ使用されるデータであり、Ｃで始まるデータはアプリケーションＡ及びアプリケーションＢの両方で使用されるデータとする。 Assume that data written by the application A is C-0, A-0, A-1, C-1, A-0, C-1,. Further, data written by the application B is in the order of time C-1, B-0, B-1, B-2, B-3, C-0, B-3, B-4, C-1,. Suppose that Note that data starting with A is data used only by application A, data starting with B is data used only by application B, and data starting with C is used by both application A and application B. Data.

そして、アプリケーションＡ及びアプリケーションＢによって書き込まれるデータが混ざって、図１７及び図１８に示すように、以下の順番で記憶装置３０に書き込まれると仮定する。 It is assumed that data written by the application A and the application B are mixed and written to the storage device 30 in the following order as shown in FIGS.

（１）Ｃ−０，（２）Ｃ−１，（３）Ａ−０，（４）Ａ−１，（５）Ｂ−０，（６）Ｂ−１，（７）Ｂ−２，（８）Ｂ−３，（９）Ｃ−１，（１０）Ａ−０，（１１）Ｃ−０，（１２）Ｂ−３，（１３）Ｃ−１，（１４）Ｂ−４，（１５）Ｃ−１，・・・ (1) C-0, (2) C-1, (3) A-0, (4) A-1, (5) B-0, (6) B-1, (7) B-2, ( 8) B-3, (9) C-1, (10) A-0, (11) C-0, (12) B-3, (13) C-1, (14) B-4, (15 ) C-1, ...

図１７に示す例においては、関連技術において使用する１つのハッシュ値管理テーブルが示されている。図１７に示すハッシュ値管理テーブルは、６つのエントリを有しており、初期状態では全てのエントリが空である（符号Ａ１）。 In the example shown in FIG. 17, one hash value management table used in the related art is shown. The hash value management table shown in FIG. 17 has six entries, and all entries are empty in the initial state (reference numeral A1).

なお、実際のストレージ装置１においてはハッシュ値管理テーブルは例えば数十万個以上のエントリを有してよいが、本例においては説明のためにハッシュ値管理テーブルのエントリ数を少なくしている。 In the actual storage apparatus 1, the hash value management table may have hundreds of thousands or more entries, but in this example, the number of entries in the hash value management table is reduced for the sake of explanation.

図１７に示す関連技術におけるハッシュ値管理テーブルは、実施形態の一例としてのストレージ装置１によって使用されてもよい。 The hash value management table in the related technology shown in FIG. 17 may be used by the storage apparatus 1 as an example of the embodiment.

以下、図１７を参照して、上記の（１）〜（１０）のデータが順番に記憶装置３０に書き込まれる際の、ハッシュ値管理テーブルの内容の変化をシミュレートする。記憶装置３０に新たなデータが書き込まれた際に書き込まれた新たなデータがハッシュ値管理テーブルに登録されていない場合には、最も使用されていないエントリの内容が破棄されて、内容が破棄されたエントリに新たなデータが登録されてよい。 Hereinafter, with reference to FIG. 17, a change in the contents of the hash value management table when the data (1) to (10) is sequentially written in the storage device 30 is simulated. If new data written when new data is written to the storage device 30 is not registered in the hash value management table, the content of the least used entry is discarded and the content is discarded. New data may be registered in the new entry.

まず、上記（１）〜（６）のデータが順番に、ハッシュ値管理テーブルの６つのエントリに書き込まれる（符号Ａ２）。別言すれば、（１）Ｃ−０，（２）Ｃ−１，（３）Ａ−０，（４）Ａ−１，（５）Ｂ−０及び（６）Ｂ−１が順番に、ハッシュ値管理テーブルの６つのエントリに書き込まれる。 First, the data of (1) to (6) are written in the six entries of the hash value management table in order (reference A2). In other words, (1) C-0, (2) C-1, (3) A-0, (4) A-1, (5) B-0 and (6) B-1 are in order, It is written in six entries of the hash value management table.

（７）及び（８）のデータの書き込みがされると、ハッシュ値管理テーブルにおいて最も使われていないデータＣ−０及びＣ−１は、データＢ−２及びＢ−３でそれぞれ上書きされる（符号Ａ３）。 When the data (7) and (8) are written, the data C-0 and C-1 that are least used in the hash value management table are overwritten with the data B-2 and B-3, respectively ( Reference A3).

（９）のデータの書き込みがされると、ハッシュ値管理テーブルにおいて最も使われていないデータＡ−０が破棄され、データＣ−１が登録される（符号Ａ４）。 When the data (9) is written, the data A-0 that is least used in the hash value management table is discarded, and the data C-1 is registered (reference A4).

（１０）のデータの書き込みがされると、ハッシュ値管理テーブルにおいて最も使われていないデータＡ−１が破棄され、データＡ−０が登録される（符号Ａ５）。 When the data of (10) is written, the data A-1 that is least used in the hash value management table is discarded and the data A-0 is registered (reference A5).

このように、関連技術におけるハッシュ値管理テーブルの使用例においては、上記（１）〜（１０）のデータの書き込みで、データＣ−１及びＡ−１がそれぞれ２回書き込まれたにもかかわらず、データの重複関係が見つけられていない。 As described above, in the usage example of the hash value management table in the related technology, the data C-1 and A-1 are written twice in the data writing of the above (1) to (10). , Data duplication is not found.

図１８に示す例においては、実施形態の一例において使用する１つの第１ハッシュ値管理テーブル及び２つの第２ハッシュ値管理テーブルが示されている。また、２つの第２ハッシュ値管理テーブルのうち、一方はＬＵＮ０に対応し、他方はＬＵＮ１に対応する。以下、ＬＵＮ０に対応する第２ハッシュ値管理テーブルを「ＬＵＮ０用第２ハッシュ値管理テーブル」といい、ＬＵＮ１に対応する第２ハッシュ値管理テーブルを「ＬＵＮ１用第２ハッシュ値管理テーブル」という場合がある。図１８に示す第１ハッシュ値管理テーブル及び第２ハッシュ値管理テーブルは、いずれも２つのエントリを有している。 In the example illustrated in FIG. 18, one first hash value management table and two second hash value management tables used in the example of the embodiment are illustrated. Of the two second hash value management tables, one corresponds to LUN0 and the other corresponds to LUN1. Hereinafter, the second hash value management table corresponding to LUN0 may be referred to as “LUN0 second hash value management table”, and the second hash value management table corresponding to LUN1 may be referred to as “LUN1 second hash value management table”. is there. Each of the first hash value management table and the second hash value management table shown in FIG. 18 has two entries.

なお、実際のストレージ装置１においては第１ハッシュ値管理テーブル及び第２ハッシュ値管理テーブルは例えば数十万個以上のエントリを有してよいが、本例においては説明のためにハッシュ値管理テーブルのエントリ数を少なくしている。 In the actual storage device 1, the first hash value management table and the second hash value management table may have, for example, hundreds of thousands or more entries, but in this example, the hash value management table is used for explanation. The number of entries is reduced.

以下、図１８を参照して、上記の（１）〜（１０）のデータが順番に記憶装置３０に書き込まれる際の、ハッシュ値管理テーブルの内容の変化をシミュレートする。記憶装置３０に新たなデータが書き込まれた際に書き込まれた新たなデータが第１又は第２ハッシュ値管理テーブルに登録されていない場合には、最も使用されていないエントリの内容が破棄されて、内容が破棄されたエントリに新たなデータが登録されてよい。 Hereinafter, with reference to FIG. 18, a change in the contents of the hash value management table when the data (1) to (10) is sequentially written in the storage device 30 is simulated. If new data written when new data is written to the storage device 30 is not registered in the first or second hash value management table, the content of the least used entry is discarded. New data may be registered in the entry whose contents are discarded.

まず、初期状態において、第１ハッシュ値管理テーブルにはデータＣ−０及びＣ−１が登録されており、２つの第２ハッシュ値管理テーブルは空である（符号Ｂ１）。 First, in the initial state, data C-0 and C-1 are registered in the first hash value management table, and the two second hash value management tables are empty (reference numeral B1).

（１）及び（２）のデータが書き込まれても、データＣ−０及びＣ−１は第１ハッシュ値管理テーブルに登録されているため、いずれの第２ハッシュ値管理テーブルの内容も符号Ｂ１の状態から変化しない。 Even if the data of (1) and (2) are written, since the data C-0 and C-1 are registered in the first hash value management table, the contents of any of the second hash value management tables are represented by the code B1. No change from the state.

（３）及び（４）のデータが書き込まれると、第１ハッシュ値管理テーブルが参照されるが、データＡ−０及びＡ−１は登録されていない。そこで、ＬＵＮ０用第２ハッシュ値管理テーブルが参照されるが、データＡ−０及びＡ−１は登録されていないため、ＬＵＮ０用第２ハッシュ値管理テーブルにデータＡ−０及びＡ−１が登録される（符号Ｂ２）。 When the data of (3) and (4) is written, the first hash value management table is referred to, but the data A-0 and A-1 are not registered. Therefore, the second hash value management table for LUN0 is referred to, but since data A-0 and A-1 are not registered, data A-0 and A-1 are registered in the second hash value management table for LUN0. (Reference B2).

（５）及び（６）のデータが書き込まれると、第１ハッシュ値管理テーブルが参照されるが、データＢ−０及びＢ−１は登録されていない。そこで、ＬＵＮ１用第２ハッシュ値管理テーブルが参照されるが、データＢ−０及びＢ−１は登録されていないため、ＬＵＮ１用第２ハッシュ値管理テーブルにデータＢ−０及びＢ−１が登録される（符号Ｂ３）。 When the data of (5) and (6) are written, the first hash value management table is referred to, but the data B-0 and B-1 are not registered. Therefore, the second hash value management table for LUN1 is referred to, but since data B-0 and B-1 are not registered, data B-0 and B-1 are registered in the second hash value management table for LUN1. (Reference B3).

（７）及び（８）のデータが書き込まれると、第１ハッシュ値管理テーブルが参照されるが、データＢ−２及びＢ−３は登録されていない。そこで、ＬＵＮ１用第２ハッシュ値管理テーブルが参照されるが、データＢ−２及びＢ−３は登録されていないため、データＢ−０及びＢ−１が破棄されて、ＬＵＮ１用第２ハッシュ値管理テーブルにデータＢ−２及びＢ−３が登録される（符号Ｂ４）。 When the data of (7) and (8) is written, the first hash value management table is referenced, but the data B-2 and B-3 are not registered. Therefore, the LUN1 second hash value management table is referred to, but since the data B-2 and B-3 are not registered, the data B-0 and B-1 are discarded, and the LUN1 second hash value is stored. Data B-2 and B-3 are registered in the management table (reference B4).

（９）のデータが書き込まれても、データＣ−１は第１ハッシュ値管理テーブルに登録されている（別言すれば、第１ハッシュ値管理テーブルで重複を発見できた）ため、いずれの第２ハッシュ値管理テーブルの内容も符号Ｂ４の状態から変化しない。 Even if the data of (9) is written, the data C-1 is registered in the first hash value management table (in other words, duplication has been found in the first hash value management table). The content of the second hash value management table does not change from the state of the code B4.

（１０）のデータが書き込まれると、第１ハッシュ値管理テーブルが参照されるが、データＡ−０は登録されていない。そこで、ＬＵＮ０用第２ハッシュ値管理テーブルが参照され、データＡ−０が登録されている（別言すれば、第２ハッシュ値管理テーブルの重複が発見できた）ため、いずれの第２ハッシュ値管理テーブルの内容も符号Ｂ４の状態から変化しない。 When the data (10) is written, the first hash value management table is referred to, but the data A-0 is not registered. Therefore, since the second hash value management table for LUN0 is referred to and data A-0 is registered (in other words, duplication of the second hash value management table can be found), any second hash value is stored. The contents of the management table also do not change from the state of B4.

このように、実施形態の一例におけるハッシュ値管理テーブルの使用例においては、上記（１）〜（１０）のデータの書き込みで、Ｃ−１及びＡ−１がそれぞれ２回書き込まれて、重複関係を発見できた。 As described above, in the usage example of the hash value management table in the example of the embodiment, the data of (1) to (10) is written, and C-1 and A-1 are written twice, respectively. I was able to discover.

例えば、動画や画像を書き込むアプリケーションでは、上記のアプリケーションＢのように重複しない大量のデータを書き込むため、図１８の符号Ｂ１〜Ｂ４に示した状況が頻繁に発生する可能性がある。 For example, in an application for writing a moving image or an image, a large amount of non-overlapping data is written as in the above application B, and therefore the situation shown by reference numerals B1 to B4 in FIG. 18 may frequently occur.

図１７に示した関連技術としてのハッシュ値管理テーブルの使用例と、図１８に示した実施形態の一例としてのハッシュ値管理テーブルの使用例との間で、上述のような重複データの発見に相違が発生した原因の一例としては、次のようなことが考えられる。 For the discovery of duplicate data as described above between the usage example of the hash value management table as the related technique shown in FIG. 17 and the usage example of the hash value management table as an example of the embodiment shown in FIG. As an example of the cause of the difference, the following may be considered.

第一に、アプリケーションＢが重複しないデータを連続して大量に書き込んだため、関連技術としてのハッシュ値管理テーブルでは、それまでにハッシュ値管理テーブルに登録されていたエントリが書き換えられてしまったということである。 First, because application B has written a large amount of data that does not overlap, the hash value management table as a related technology has rewritten the entries registered in the hash value management table so far. That is.

第二に、関連技術としてのハッシュ値管理テーブルでは、全てのアプリケーションの書き込みを１つのハッシュ値管理テーブルで管理する。そのため、アプリケーションＢの書き込みによって、アプリケーションＡの書き込みにより登録されたエントリが影響を受けて消去されてしまったということである。 Second, in a hash value management table as a related technique, writing of all applications is managed by one hash value management table. Therefore, the entry registered by the writing of the application A is erased due to the writing of the application B.

例えば、第一の原因は関連技術としてのハッシュ値管理テーブルでデータＣ−１の重複が発見できなかった原因であり、第二の原因は関連技術としてのハッシュ値管理テーブルでデータＡ−１の重複が発見できなかった原因である。 For example, the first cause is that the duplication of the data C-1 cannot be found in the hash value management table as the related technology, and the second cause is the data of the data A-1 in the hash value management table as the related technology. This is the reason why duplicates could not be found.

実施形態の一例としての第１及び第２ハッシュ値管理テーブルを使用することにより、ＬＵＮ間に亘って使われるデータのエントリがハッシュ値管理テーブルから消去されることを防止できる。また、或るＬＵＮで使われるデータのハッシュ値管理テーブルにおけるエントリが、他のＬＵＮで使われるデータの影響で消去されることを防止できる。 By using the first and second hash value management tables as an example of the embodiment, it is possible to prevent data entries used between LUNs from being deleted from the hash value management table. Further, it is possible to prevent an entry in the hash value management table of data used in a certain LUN from being erased due to the influence of data used in another LUN.

〔Ａ−３〕まとめ
上述した実施形態の一例としてのストレージ装置１によれば、例えば、以下の効果を奏することができる。 [A-3] Summary According to the storage device 1 as an example of the embodiment described above, for example, the following effects can be obtained.

第１判定部１３は、入力データが複数の仮想ボリューム３０ａに亘って重複して書き込まれているかを判定する。また、第２判定部１６は、第１判定部１３によって入力データが重複して書き込まれていないと判定された場合に、第２ハッシュ値管理テーブルを用いて、入力データが対応する仮想ボリューム３０ａに書き込まれているかを判定する。そして、個別データ登録部１７は、第２判定部１６によって入力データが対応する仮想ボリューム３０ａに書き込まれていないと判定された場合に、第２ハッシュ値管理テーブルに入力データについての情報を登録する。 The first determination unit 13 determines whether input data is written redundantly over a plurality of virtual volumes 30a. In addition, when the first determination unit 13 determines that the input data has not been written in duplicate, the second determination unit 16 uses the second hash value management table to correspond to the virtual volume 30a to which the input data corresponds. Is written in The individual data registration unit 17 registers information about the input data in the second hash value management table when the second determination unit 16 determines that the input data has not been written in the corresponding virtual volume 30a. .

これにより、重複除去に関する情報をメモリに効率的に格納することができる。 As a result, information related to deduplication can be efficiently stored in the memory.

書き込み部１８は、個別データ登録部１７によって登録された情報に係る入力データを、対応する仮想ボリューム３０ａに書き込む。 The writing unit 18 writes the input data related to the information registered by the individual data registration unit 17 to the corresponding virtual volume 30a.

書き込み部１８は、入力データが複数の仮想ボリューム３０ａに亘って重複して書き込まれている場合には、第２判定部１６による判定と個別データ登録部１７による登録とを行なわずに、入力データを、対応する仮想ボリューム３０ａに書き込む。 The writing unit 18 does not perform the determination by the second determination unit 16 and the registration by the individual data registration unit 17 when the input data is written redundantly across the plurality of virtual volumes 30a. Are written into the corresponding virtual volume 30a.

これらにより、重複除去を行ないつつ、必要なデータを記憶させることができるため、ストレージ装置１の効率を向上させることができる。 As a result, necessary data can be stored while deduplication is being performed, so that the efficiency of the storage apparatus 1 can be improved.

第１判定部１３は、第１ハッシュ値管理テーブルを用いて判定を行なう。 The first determination unit 13 performs determination using the first hash value management table.

これにより、第１ハッシュ値管理テーブルと第２ハッシュ値管理テーブルとの２つのハッシュ値管理テーブルを使用することで、重複するブロックの情報をブロックの特性に合わせて効率的に管理することができる。そして、所定の領域サイズにおいて高い重複除去率を達成することができる。 Thereby, by using the two hash value management tables of the first hash value management table and the second hash value management table, it is possible to efficiently manage the information of overlapping blocks according to the characteristics of the blocks. . A high deduplication rate can be achieved in a predetermined area size.

複数の仮想ボリューム３０ａに亘って重複して書き込まれるデータは、ＯＳによって使用されるデータである。また、複数の仮想ボリューム３０ａのいずれかに書き込まれるデータは、個別のアプリケーションによって使用されるデータである。 Data that is written redundantly across a plurality of virtual volumes 30a is data used by the OS. The data written to any of the plurality of virtual volumes 30a is data used by individual applications.

これにより、データの使用目的に合わせた、効率的な重複除去を行なうことができる。 Thereby, efficient deduplication can be performed according to the purpose of use of data.

共通データ登録部１１は、アプリケーションが起動していない状態において、重複して書き込まれるデータについての情報を第１ハッシュ値管理テーブルに登録することによって、第１ハッシュ値管理テーブルを生成する。 The common data registration unit 11 generates a first hash value management table by registering information about data to be written redundantly in the first hash value management table when the application is not activated.

共通データ登録部１１は、アプリケーションが起動している状態において、複数の仮想ボリューム３０ａに亘って重複して書き込まれているデータについての情報を第１ハッシュ値管理テーブルに登録することによって、第１ハッシュ値管理テーブルを生成する。 The common data registration unit 11 registers the information about the data that is written over the plurality of virtual volumes 30a in the first hash value management table in a state where the application is activated, thereby Generate a hash value management table.

これらにより、ストレージシステム１００で共通に使用されるデータについての情報を、適切に第１ハッシュ値管理テーブルに登録することができる。 As a result, information about data commonly used in the storage system 100 can be appropriately registered in the first hash value management table.

〔Ｂ〕その他
開示の技術は上述した実施形態に限定されるものではなく、本実施形態の趣旨を逸脱しない範囲で種々変形して実施することができる。本実施形態の各構成及び各処理は、必要に応じて取捨選択することができ、あるいは適宜組み合わせてもよい。 [B] Others The disclosed technology is not limited to the above-described embodiment, and various modifications can be made without departing from the spirit of the present embodiment. Each structure and each process of this embodiment can be selected as needed, or may be combined suitably.

〔Ｃ〕付記
（付記１）
第１の仮想ボリュームと第２の仮想ボリュームとを展開するストレージ装置に備えられる制御装置であって、
第１のデータが前記第１及び第２の仮想ボリュームに亘って重複して書き込まれているかを判定する第１判定部と、
前記第１判定部によって前記第１のデータが重複して書き込まれていないと判定された場合に、前記第１又は第２の仮想ボリュームに書き込まれているデータを示す個別データ情報を用いて、前記第１のデータが前記第１又は第２の仮想ボリュームに書き込まれているかを判定する第２判定部と、
前記第２判定部によって前記第１のデータが前記第１及び第２の仮想ボリュームのいずれにも書き込まれていないと判定された場合に、前記個別データ情報に前記第１のデータについての情報を登録する個別データ登録部と、
を備える、制御装置。 [C] Appendix (Appendix 1)
A control device provided in a storage device for expanding a first virtual volume and a second virtual volume,
A first determination unit that determines whether or not first data is written redundantly across the first and second virtual volumes;
When it is determined by the first determination unit that the first data has not been written in duplicate, using the individual data information indicating the data written in the first or second virtual volume, A second determination unit that determines whether the first data is written to the first or second virtual volume;
When the second determination unit determines that the first data has not been written in any of the first and second virtual volumes, information about the first data is added to the individual data information. An individual data registration unit to be registered;
A control device comprising:

（付記２）
前記個別データ登録部によって登録された前記情報に係る前記第１のデータを、前記第１又は第２仮想ボリュームに書き込む書き込み部を更に備える、
付記１に記載の制御装置。 (Appendix 2)
A writing unit for writing the first data related to the information registered by the individual data registration unit to the first or second virtual volume;
The control device according to attachment 1.

（付記３）
前記ストレージ装置は、前記第１及び第２の仮想ボリュームとは異なる第３の仮想ボリュームを更に展開し、
前記第１判定部によって前記第１のデータが前記第１及び第２の仮想ボリュームに亘って重複して書き込まれていると判定された場合には、前記第２判定部による前記判定と前記個別データ登録部による前記登録とを行なわずに、前記第１のデータを、前記第１のデータの格納先である前記第３の仮想ボリュームに書き込む書き込み部を更に備える、
付記１に記載の制御装置。 (Appendix 3)
The storage apparatus further expands a third virtual volume different from the first and second virtual volumes,
When it is determined by the first determination unit that the first data has been written over the first and second virtual volumes, the determination by the second determination unit and the individual A writing unit that writes the first data to the third virtual volume that is the storage destination of the first data without performing the registration by the data registration unit;
The control device according to attachment 1.

（付記４）
前記第１判定部は、前記第１及び第２の仮想ボリュームに亘って重複して書き込まれているデータを示す共通データ情報を用いて、前記判定を行なう、
付記１〜３のいずれか１項に記載の制御装置。 (Appendix 4)
The first determination unit performs the determination using common data information indicating data that is written redundantly across the first and second virtual volumes.
The control device according to any one of appendices 1 to 3.

（付記５）
前記第１及び第２の仮想ボリュームに亘って重複して書き込まれるデータは、前記ストレージ装置の全体の動作を制御するオペレーティングシステムによって使用されるデータであり、
前記第１及び第２の仮想ボリュームのいずれか一方に書き込まれるデータは、前記ストレージ装置において動作する個別のアプリケーションによって使用されるデータである、
付記４に記載の制御装置。 (Appendix 5)
The data written redundantly across the first and second virtual volumes is data used by an operating system that controls the overall operation of the storage device,
The data written to one of the first and second virtual volumes is data used by an individual application operating in the storage device.
The control device according to appendix 4.

（付記６）
前記アプリケーションが起動していない状態において、重複して書き込まれるデータについての情報を前記共通データ情報に登録することによって、前記共通データ情報を生成する共通データ登録部を更に備える、
付記５に記載の制御装置。 (Appendix 6)
A common data registration unit that generates the common data information by registering, in the common data information, information about data that is written in duplicate in a state where the application is not activated;
The control device according to appendix 5.

（付記７）
前記アプリケーションが起動している状態において、前記第１及び第２の仮想ボリュームに亘って重複して書き込まれているデータについての情報を前記共通データ情報に登録することによって、前記共通データ情報を生成する共通データ登録部を更に備える、
付記５に記載の制御装置。 (Appendix 7)
The common data information is generated by registering, in the common data information, information about data that has been written over the first and second virtual volumes while the application is running. A common data registration unit
The control device according to appendix 5.

（付記８）
第１の仮想ボリュームと第２の仮想ボリュームとを展開するストレージ装置であって、
第１のデータが前記第１及び第２の仮想ボリュームに亘って重複して書き込まれているかを判定する第１判定部と、
前記第１判定部によって前記第１のデータが重複して書き込まれていないと判定された場合に、前記第１又は第２の仮想ボリュームに書き込まれているデータを示す個別データ情報を用いて、前記第１のデータが前記第１又は第２の仮想ボリュームに書き込まれているかを判定する第２判定部と、
前記第２判定部によって前記第１のデータが前記第１及び第２の仮想ボリュームのいずれにも書き込まれていないと判定された場合に、前記個別データ情報に前記第１のデータについての情報を登録する個別データ登録部と、
を備える、ストレージ装置。 (Appendix 8)
A storage device that expands a first virtual volume and a second virtual volume,
A first determination unit that determines whether or not first data is written redundantly across the first and second virtual volumes;
When it is determined by the first determination unit that the first data has not been written in duplicate, using the individual data information indicating the data written in the first or second virtual volume, A second determination unit that determines whether the first data is written to the first or second virtual volume;
When the second determination unit determines that the first data has not been written in any of the first and second virtual volumes, information about the first data is added to the individual data information. An individual data registration unit to be registered;
A storage apparatus.

（付記９）
前記個別データ登録部によって登録された前記情報に係る前記第１のデータを、前記第１又は第２仮想ボリュームに書き込む書き込み部を更に備える、
付記８に記載のストレージ装置。 (Appendix 9)
A writing unit for writing the first data related to the information registered by the individual data registration unit to the first or second virtual volume;
The storage apparatus according to appendix 8.

（付記１０）
当該ストレージ装置は、前記第１及び第２の仮想ボリュームとは異なる第３の仮想ボリュームを更に展開し、
前記第１判定部によって前記第１のデータが前記第１及び第２の仮想ボリュームに亘って重複して書き込まれていると判定された場合には、前記第２判定部による前記判定と前記個別データ登録部による前記登録とを行なわずに、前記第１のデータを、前記第１のデータの格納先である前記第３の仮想ボリュームに書き込む書き込み部を更に備える、
付記８に記載のストレージ装置。 (Appendix 10)
The storage apparatus further expands a third virtual volume different from the first and second virtual volumes,
When it is determined by the first determination unit that the first data has been written over the first and second virtual volumes, the determination by the second determination unit and the individual A writing unit that writes the first data to the third virtual volume that is the storage destination of the first data without performing the registration by the data registration unit;
The storage apparatus according to appendix 8.

（付記１１）
前記第１判定部は、前記第１及び第２の仮想ボリュームに亘って重複して書き込まれているデータを示す共通データ情報を用いて、前記判定を行なう、
付記８〜１０のいずれか１項に記載のストレージ装置。 (Appendix 11)
The first determination unit performs the determination using common data information indicating data that is written redundantly across the first and second virtual volumes.
The storage device according to any one of appendices 8 to 10.

（付記１２）
前記第１及び第２の仮想ボリュームに亘って重複して書き込まれるデータは、当該ストレージ装置の全体の動作を制御するオペレーティングシステムによって使用されるデータであり、
前記第１及び第２の仮想ボリュームのいずれか一方に書き込まれるデータは、当該ストレージ装置において動作する個別のアプリケーションによって使用されるデータである、
付記１１に記載のストレージ装置。 (Appendix 12)
The data that is written over the first and second virtual volumes is data used by an operating system that controls the overall operation of the storage device,
The data written to one of the first and second virtual volumes is data used by an individual application operating in the storage device.
The storage device according to attachment 11.

（付記１３）
前記アプリケーションが起動していない状態において、重複して書き込まれるデータについての情報を前記共通データ情報に登録することによって、前記共通データ情報を生成する共通データ登録部を更に備える、
付記１２に記載のストレージ装置。 (Appendix 13)
A common data registration unit that generates the common data information by registering, in the common data information, information about data that is written in duplicate in a state where the application is not activated;
The storage device according to attachment 12.

（付記１４）
前記アプリケーションが起動している状態において、前記第１及び第２の仮想ボリュームに亘って重複して書き込まれているデータについての情報を前記共通データ情報に登録することによって、前記共通データ情報を生成する共通データ登録部を更に備える、
付記１２に記載のストレージ装置。 (Appendix 14)
The common data information is generated by registering, in the common data information, information about data that has been written over the first and second virtual volumes while the application is running. A common data registration unit
The storage device according to attachment 12.

（付記１５）
第１の仮想ボリュームと第２の仮想ボリュームとを展開するストレージ装置に備えられるコンピュータに、
第１のデータが前記第１及び第２の仮想ボリュームに亘って重複して書き込まれているかを判定し、
前記第１のデータが重複して書き込まれていないと判定された場合に、前記第１又は第２の仮想ボリュームに書き込まれているデータを示す個別データ情報を用いて、前記第１のデータが前記第１又は第２の仮想ボリュームに書き込まれているかを判定し、
前記第１のデータが前記第１及び第２の仮想ボリュームのいずれにも書き込まれていないと判定された場合に、前記個別データ情報に前記第１のデータについての情報を登録する、
処理を実行させる、データ管理プログラム。 (Appendix 15)
In a computer provided in a storage device that expands the first virtual volume and the second virtual volume,
Determining whether first data is written redundantly across the first and second virtual volumes;
When it is determined that the first data is not written redundantly, the first data is stored using the individual data information indicating the data written to the first or second virtual volume. Determine whether it is written to the first or second virtual volume;
Registering information about the first data in the individual data information when it is determined that the first data has not been written to any of the first and second virtual volumes;
Data management program that executes processing.

（付記１６）
前記登録された前記情報に係る前記第１のデータを、前記第１又は第２仮想ボリュームに書き込む、
処理を前記コンピュータに実行させる、付記１５に記載のデータ管理プログラム。 (Appendix 16)
Writing the first data relating to the registered information to the first or second virtual volume;
The data management program according to appendix 15, which causes the computer to execute processing.

（付記１７）
前記ストレージ装置は、前記第１及び第２の仮想ボリュームとは異なる第３の仮想ボリュームを更に展開し、
前記第１のデータが前記第１及び第２の仮想ボリュームに亘って重複して書き込まれている場合には、前記個別データ情報を用いた前記判定と前記個別データ情報への前記登録とを行なわずに、前記第１のデータを、前記第１のデータの格納先である前記第３の仮想ボリュームに書き込む、
処理を前記コンピュータに実行させる、付記１５に記載のデータ管理プログラム。 (Appendix 17)
The storage apparatus further expands a third virtual volume different from the first and second virtual volumes,
If the first data has been written over the first and second virtual volumes, the determination using the individual data information and the registration to the individual data information are performed. Without writing the first data to the third virtual volume that is the storage destination of the first data,
The data management program according to appendix 15, which causes the computer to execute processing.

（付記１８）
前記第１及び第２の仮想ボリュームに亘って重複して書き込まれているデータを示す共通データ情報を用いて、前記判定を行なう、
処理を前記コンピュータに実行させる、付記１５〜１７のいずれか１項に記載のデータ管理プログラム。 (Appendix 18)
The determination is performed using common data information indicating data that has been written over the first and second virtual volumes.
The data management program according to any one of appendices 15 to 17, which causes the computer to execute processing.

（付記１９）
前記第１及び第２の仮想ボリュームに亘って重複して書き込まれるデータは、前記ストレージ装置の全体の動作を制御するオペレーティングシステムによって使用されるデータであり、
前記第１及び第２の仮想ボリュームのいずれか一方に書き込まれるデータは、前記ストレージ装置において動作する個別のアプリケーションによって使用されるデータである、
付記１８に記載のデータ管理プログラム。 (Appendix 19)
The data written redundantly across the first and second virtual volumes is data used by an operating system that controls the overall operation of the storage device,
The data written to one of the first and second virtual volumes is data used by an individual application operating in the storage device.
The data management program according to appendix 18.

（付記２０）
前記アプリケーションが起動していない状態において、重複して書き込まれるデータについての情報を前記共通データ情報に登録することによって、前記共通データ情報を生成する、
処理を前記コンピュータに実行させる、付記１９に記載のデータ管理プログラム。 (Appendix 20)
In the state where the application is not activated, the common data information is generated by registering in the common data information information about data to be written redundantly.
The data management program according to appendix 19, which causes the computer to execute processing.

（付記２１）
前記アプリケーションが起動している状態において、前記第１及び第２の仮想ボリュームに亘って重複して書き込まれているデータについての情報を前記共通データ情報に登録することによって、前記共通データ情報を生成する、
処理を前記コンピュータに実行させる、付記１９に記載のデータ管理プログラム。 (Appendix 21)
The common data information is generated by registering, in the common data information, information about data that has been written over the first and second virtual volumes while the application is running. To
The data management program according to appendix 19, which causes the computer to execute processing.

（付記２２）
第１の仮想ボリュームと第２の仮想ボリュームとを展開するストレージ装置において、
第１のデータが前記第１及び第２の仮想ボリュームに亘って重複して書き込まれているかを判定し、
前記第１のデータが重複して書き込まれていないと判定された場合に、前記第１又は第２の仮想ボリュームに書き込まれているデータを示す個別データ情報を用いて、前記第１のデータが前記第１又は第２の仮想ボリュームに書き込まれているかを判定し、
前記第１のデータが前記第１及び第２の仮想ボリュームのいずれにも書き込まれていないと判定された場合に、前記個別データ情報に前記第１のデータについての情報を登録する、
データ管理方法。 (Appendix 22)
In the storage device that expands the first virtual volume and the second virtual volume,
Determining whether first data is written redundantly across the first and second virtual volumes;
When it is determined that the first data is not written redundantly, the first data is stored using the individual data information indicating the data written to the first or second virtual volume. Determine whether it is written to the first or second virtual volume;
Registering information about the first data in the individual data information when it is determined that the first data has not been written to any of the first and second virtual volumes;
Data management method.

（付記２３）
前記登録された前記情報に係る前記第１のデータを、前記第１又は第２仮想ボリュームに書き込む、
付記２２に記載のデータ管理方法。 (Appendix 23)
Writing the first data relating to the registered information to the first or second virtual volume;
The data management method according to attachment 22.

（付記２４）
前記ストレージ装置は、前記第１及び第２の仮想ボリュームとは異なる第３の仮想ボリュームを更に展開し、
前記第１のデータが前記第１及び第２の仮想ボリュームに亘って重複して書き込まれている場合には、前記個別データ情報を用いた前記判定と前記個別データ情報への前記登録とを行なわずに、前記第１のデータを、前記第１のデータの格納先である前記第３の仮想ボリュームに書き込む、
付記２２に記載のデータ管理方法。 (Appendix 24)
The storage apparatus further expands a third virtual volume different from the first and second virtual volumes,
If the first data has been written over the first and second virtual volumes, the determination using the individual data information and the registration to the individual data information are performed. Without writing the first data to the third virtual volume that is the storage destination of the first data,
The data management method according to attachment 22.

（付記２５）
前記第１及び第２の仮想ボリュームに亘って重複して書き込まれているデータを示す共通データ情報を用いて、前記判定を行なう、
付記２２〜２４のいずれか１項に記載のデータ管理方法。 (Appendix 25)
The determination is performed using common data information indicating data that has been written over the first and second virtual volumes.
The data management method according to any one of appendices 22 to 24.

（付記２６）
前記第１及び第２の仮想ボリュームに亘って重複して書き込まれるデータは、前記ストレージ装置の全体の動作を制御するオペレーティングシステムによって使用されるデータであり、
前記第１及び第２の仮想ボリュームのいずれか一方に書き込まれるデータは、前記ストレージ装置において動作する個別のアプリケーションによって使用されるデータである、
付記２５に記載のデータ管理方法。 (Appendix 26)
The data written redundantly across the first and second virtual volumes is data used by an operating system that controls the overall operation of the storage device,
The data written to one of the first and second virtual volumes is data used by an individual application operating in the storage device.
The data management method according to attachment 25.

（付記２７）
前記アプリケーションが起動していない状態において、重複して書き込まれるデータについての情報を前記共通データ情報に登録することによって、前記共通データ情報を生成する、
付記２６に記載のデータ管理方法。 (Appendix 27)
In the state where the application is not activated, the common data information is generated by registering in the common data information information about data to be written redundantly.
The data management method according to attachment 26.

（付記２８）
前記アプリケーションが起動している状態において、前記第１及び第２の仮想ボリュームに亘って重複して書き込まれているデータについての情報を前記共通データ情報に登録することによって、前記共通データ情報を生成する、
付記２６に記載のデータ管理方法。 (Appendix 28)
The common data information is generated by registering, in the common data information, information about data that has been written over the first and second virtual volumes while the application is running. To
The data management method according to attachment 26.

１００ストレージシステム
１ストレージ装置
１ａ制御装置
１０ＣＰＵ
１０ａ第１段の重複除去部
１０ｂ第２段の重複除去部
１１共通データ登録部
１２第１データ入力部
１３第１判定部
１４通信部
１５第２データ入力部
１６第２判定部
１７個別データ登録部
１８書き込み部
２０メモリ
３０記憶装置
３０ａ仮想ボリューム
４サーバ装置 DESCRIPTION OF SYMBOLS 100 Storage system 1 Storage apparatus 1a Control apparatus 10 CPU
DESCRIPTION OF SYMBOLS 10a 1st stage duplication removal part 10b 2nd stage duplication removal part 11 Common data registration part 12 1st data input part 13 1st determination part 14 Communication part 15 2nd data input part 16 2nd determination part 17 Individual data registration Unit 18 writing unit 20 memory 30 storage device 30a virtual volume 4 server device

Claims

A control device provided in a storage device for expanding a first virtual volume and a second virtual volume,
A first determination unit that determines whether or not first data is written redundantly across the first and second virtual volumes;
When it is determined by the first determination unit that the first data has not been written in duplicate, using the individual data information indicating the data written in the first or second virtual volume, A second determination unit that determines whether the first data is written to the first or second virtual volume;
When the second determination unit determines that the first data has not been written in any of the first and second virtual volumes, information about the first data is added to the individual data information. An individual data registration unit to be registered;
A control device comprising:

A writing unit for writing the first data related to the information registered by the individual data registration unit to the first or second virtual volume;
The control device according to claim 1.

The storage apparatus further expands a third virtual volume different from the first and second virtual volumes,
When it is determined by the first determination unit that the first data has been written over the first and second virtual volumes, the determination by the second determination unit and the individual A writing unit that writes the first data to the third virtual volume that is the storage destination of the first data without performing the registration by the data registration unit;
The control device according to claim 1.

The first determination unit performs the determination using common data information indicating data that is written redundantly across the first and second virtual volumes.
The control device according to claim 1.

The data written redundantly across the first and second virtual volumes is data used by an operating system that controls the overall operation of the storage device,
The data written to one of the first and second virtual volumes is data used by an individual application operating in the storage device.
The control device according to claim 4.

A common data registration unit that generates the common data information by registering, in the common data information, information about data that is written in duplicate in a state where the application is not activated;
The control device according to claim 5.

The common data information is generated by registering, in the common data information, information about data that has been written over the first and second virtual volumes while the application is running. A common data registration unit
The control device according to claim 5.

A storage device that expands a first virtual volume and a second virtual volume,
A first determination unit that determines whether or not first data is written redundantly across the first and second virtual volumes;
When it is determined by the first determination unit that the first data has not been written in duplicate, using the individual data information indicating the data written in the first or second virtual volume, A second determination unit that determines whether the first data is written to the first or second virtual volume;
When the second determination unit determines that the first data has not been written in any of the first and second virtual volumes, information about the first data is added to the individual data information. An individual data registration unit to be registered;
A storage apparatus.

In a computer provided in a storage device that expands the first virtual volume and the second virtual volume,
Determining whether first data is written redundantly across the first and second virtual volumes;
When it is determined that the first data is not written redundantly, the first data is stored using the individual data information indicating the data written to the first or second virtual volume. Determine whether it is written to the first or second virtual volume;
Registering information about the first data in the individual data information when it is determined that the first data has not been written to any of the first and second virtual volumes;
Data management program that executes processing.

In the storage device that expands the first virtual volume and the second virtual volume,
Determining whether first data is written redundantly across the first and second virtual volumes;
When it is determined that the first data is not written redundantly, the first data is stored using the individual data information indicating the data written to the first or second virtual volume. Determine whether it is written to the first or second virtual volume;
Registering information about the first data in the individual data information when it is determined that the first data has not been written to any of the first and second virtual volumes;
Data management method.