JP5832592B1

JP5832592B1 - Data management device

Info

Publication number: JP5832592B1
Application number: JP2014114060A
Authority: JP
Inventors: 翔一朗櫻井
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2014-06-02
Filing date: 2014-06-02
Publication date: 2015-12-16
Anticipated expiration: 2034-06-02
Also published as: JP2015228165A

Abstract

【課題】キーバリューストア方式のデータ領域を用いて、ロールバック処理や排他制御を実現することを目的とする。【解決手段】インメモリデータベース１２０は、キーに対応する間接レコードを記憶する間接レコード記憶部１２２と、間接レコードに対応するマスタレコードを記憶するマスタレコード記憶部１２３と、マスタレコードに対応するリーフレコードを記憶するリーフレコード記憶部１２４と、リーフレコードに対応する実データを記憶する実データ記憶部１２５とを備える。間接レコードは、対応するマスタレコードが記憶された位置と、関連する他の間接レコードが記憶された位置とを示すポインタとを含み、マスタレコードは、対応するリーフレコードが記憶された位置を示すポインタを含み、リーフレコードは、対応する実データが記憶された位置と、次の改訂版の実データに対応するリーフレコードが記憶された位置とを示すポインタを含む。【選択図】図７An object of the present invention is to realize rollback processing and exclusive control using a key-value store data area. An in-memory database 120 includes an indirect record storage unit 122 that stores an indirect record corresponding to a key, a master record storage unit 123 that stores a master record corresponding to the indirect record, and a leaf record corresponding to the master record. Are stored, and a real data storage unit 125 stores actual data corresponding to the leaf records. The indirect record includes a pointer indicating a position where the corresponding master record is stored and a position where another related indirect record is stored, and the master record is a pointer indicating a position where the corresponding leaf record is stored. The leaf record includes a pointer indicating the position where the corresponding actual data is stored and the position where the leaf record corresponding to the next revised version of the actual data is stored. [Selection] Figure 7

Description

この発明は、キーに対応させてデータを記憶するキーバリューストア方式（ＫＶＳ方式）によるデータ管理技術に関する。 The present invention relates to a data management technique based on a key-value store system (KVS system) that stores data corresponding to a key.

識別情報であるキーと、データ（バリュー）とをペアにして記憶するＫＶＳ方式というデータ管理方式がある。ＫＶＳ方式では、キーを指定することにより、対応するデータを取得することができる。 There is a data management method called a KVS method in which a key that is identification information and data (value) are stored in pairs. In the KVS system, corresponding data can be acquired by designating a key.

分けることのできない一連の処理（以下、サービス）をトランザクションという。
トランザクションの途中のサービスでエラーが発生した場合、それまでに実行されたサービスで更新されたデータを元の状態に戻すロールバック処理が実行される。また、あるトランザクションで更新されたデータが、そのトランザクションの処理中に他のトランザクションによって更新されないように、排他制御が行われる。 A series of processes (hereinafter referred to as services) that cannot be divided is called a transaction.
When an error occurs in a service in the middle of a transaction, a rollback process is performed to return data updated by the service executed so far to the original state. Further, exclusive control is performed so that data updated in a certain transaction is not updated by another transaction during the processing of the transaction.

インメモリデータベースという、データをコンピュータのメインメモリに展開し動作するデータベースがある。インメモリデータベースを用いると、データをハードディスク上に展開するディスク型データベースと比較して、高速に処理ができる。 There is an in-memory database that operates by expanding data in the main memory of a computer. If an in-memory database is used, data can be processed at a higher speed than a disk-type database in which data is developed on a hard disk.

特開２０１３−３３３４５号公報JP 2013-33345 A

ＫＶＳ方式のデータ領域を用いてトランザクションを処理する場合、ＫＶＳ方式のデータ領域以外に、キュー構造を有するメモリ領域を利用しなければ、ロールバック処理や排他制御を実現することができなかった。
この発明は、ＫＶＳ方式のデータ管理を行う場合に、キュー構造を有するメモリ領域を利用することなく、ロールバック処理や排他制御を実現可能とすることを目的とする。 When a transaction is processed using a KVS data area, rollback processing and exclusive control cannot be realized unless a memory area having a queue structure is used in addition to the KVS data area.
An object of the present invention is to enable rollback processing and exclusive control without using a memory area having a queue structure when performing KVS data management.

この発明に係るデータ管理装置は、
キーに対応させて実データを記憶するキーバリューストア方式によるデータ管理を行うデータ管理装置であり、
キーに対応する間接レコードを記憶する間接レコード記憶部と、
間接レコードに対応するリーフレコードを記憶するリーフレコード記憶部とを備え、
前記間接レコードは、対応するリーフレコードが記憶された位置を示すマスタポインタと、関連する他の間接レコードが記憶された位置を示す間接ポインタとを含み、
前記リーフレコードは、対応する実データの情報と、次の改訂版の実データに対応するリーフレコードが記憶された位置を示すリーフポインタとを含む
ことを特徴とする。 The data management device according to the present invention is:
A data management device that performs data management by a key-value store method that stores actual data corresponding to a key,
An indirect record storage unit for storing an indirect record corresponding to the key;
A leaf record storage unit for storing leaf records corresponding to indirect records,
The indirect record includes a master pointer indicating a position where a corresponding leaf record is stored, and an indirect pointer indicating a position where another related indirect record is stored;
The leaf record includes information of corresponding actual data and a leaf pointer indicating a position where a leaf record corresponding to the next revised version of actual data is stored.

この発明に係るデータ管理装置は、データ構造を工夫することにより、キュー構造を有するメモリ領域を利用することなく、ロールバック処理や排他制御を実現可能である。 The data management apparatus according to the present invention can realize rollback processing and exclusive control without using a memory area having a queue structure by devising a data structure.

実施の形態１に係る分散システム１０の構成図。1 is a configuration diagram of a distributed system 10 according to Embodiment 1. FIG. 実施の形態１に係るデータ管理装置１００の構成図。1 is a configuration diagram of a data management apparatus 100 according to Embodiment 1. FIG. 分散システム１０によるトランザクションの処理の説明図。FIG. 3 is an explanatory diagram of transaction processing by the distributed system 10. 図３に示すトランザクションの処理におけるデータ状態推移を示す図。The figure which shows the data state transition in the process of the transaction shown in FIG. ロールバック処理におけるデータ状態推移を示す図。The figure which shows the data state transition in a rollback process. インメモリデータベース１２０の基本構造図。The basic structure figure of the in-memory database 120. FIG. インメモリデータベース１２０の構成図。The block diagram of the in-memory database 120. FIG. コミット処理及びロールバック処理におけるリーフレコードの状態推移を示す図。The figure which shows the state transition of the leaf record in a commit process and a rollback process. チャンクポインタ及びデータコピーの説明図。Explanatory drawing of a chunk pointer and data copy. 実施の形態１に示したデータ管理装置１００のハードウェア構成の例を示す図。FIG. 3 is a diagram illustrating an example of a hardware configuration of the data management apparatus 100 illustrated in the first embodiment.

実施の形態１．
最初に本実施の形態において用いる用語について、説明する。
スラブとは、インメモリデータベース内のデータの固まりを意味し、チャンクとは、スラブ内のデータの固まりを意味する。チャンクポインタとは、インメモリデータベースにおいて予め定められた基準アドレスを基準とした相対的な位置を示す論理アドレス情報である。 Embodiment 1 FIG.
First, terms used in the present embodiment will be described.
A slab means a chunk of data in the in-memory database, and a chunk means a chunk of data in the slab. The chunk pointer is logical address information indicating a relative position with reference to a predetermined reference address in the in-memory database.

図１は、実施の形態１に係る分散システム１０の構成図である。
分散システム１０は、複数のデータ管理装置１００を備える。各データ管理装置１００は、サービスバス２０によって接続されている。
分散システム１０は、外部からトランザクションの実行命令を受信する。トランザクションには、実行順序の定められた複数のサービス（アプリケーション）が含まれている。各サービスは、どのデータ管理装置１００によって実行されるかと、入出力のデータ構造とが定められている。分散システム１０は、実行命令によって指定されたトランザクションの各サービスを、定められた実行順序に従い、定められたデータ管理装置１００により実行する。 FIG. 1 is a configuration diagram of a distributed system 10 according to the first embodiment.
The distributed system 10 includes a plurality of data management devices 100. Each data management apparatus 100 is connected by a service bus 20.
The distributed system 10 receives a transaction execution command from the outside. A transaction includes a plurality of services (applications) whose execution order is determined. Each service is determined by which data management device 100 and an input / output data structure. The distributed system 10 executes each service of the transaction designated by the execution instruction by the prescribed data management apparatus 100 according to the prescribed execution order.

図２は、実施の形態１に係るデータ管理装置１００の構成図である。
データ管理装置１００は、ＴＰＭ（ＴｒａｎｓａｃｔｉｏｎＰｒｏｃｅｓｓＭｏｎｉｔｏｒ）１１０、インメモリデータベース１２０を備える。
ＴＰＭ１１０は、インメモリデータベース１２０に記憶されたデータを用いて、トランザクションの処理を行う。ＴＰＭ１１０は、実行部１１１、データ管理部１１２、データ送信部１１３を備える。
インメモリデータベース１２０は、ＫＶＳ方式によりデータを管理する。インメモリデータベース１２０は、ハッシュテーブル１２１、間接レコード記憶部１２２、マスタレコード記憶部１２３、リーフレコード記憶部１２４、実データ記憶部１２５を備える。 FIG. 2 is a configuration diagram of the data management apparatus 100 according to the first embodiment.
The data management apparatus 100 includes a TPM (Transaction Process Monitor) 110 and an in-memory database 120.
The TPM 110 performs transaction processing using the data stored in the in-memory database 120. The TPM 110 includes an execution unit 111, a data management unit 112, and a data transmission unit 113.
The in-memory database 120 manages data by the KVS method. The in-memory database 120 includes a hash table 121, an indirect record storage unit 122, a master record storage unit 123, a leaf record storage unit 124, and an actual data storage unit 125.

図３は、分散システム１０によるトランザクションの処理の説明図である。
図３では、分散システム１０は、データ管理装置１００Ａ、データ管理装置１００Ｂ、データ管理装置１００Ｃを備える。データ管理装置１００Ａとデータ管理装置１００Ｂとが稼働系であり、データ管理装置１００Ｃがデータ管理装置１００Ｂのバックアップとなる待機系である。
図３では、分散システム１０は、サービスＡ、サービスＢ、サービスＣという実行順序が定められたトランザクションの実行命令を受信する。サービスＡは、データ管理装置１００Ａで実行され、サービスＢ，Ｃはデータ管理装置１００Ｂで実行されることがトランザクションに定められている。 FIG. 3 is an explanatory diagram of transaction processing by the distributed system 10.
In FIG. 3, the distributed system 10 includes a data management device 100A, a data management device 100B, and a data management device 100C. The data management device 100A and the data management device 100B are active systems, and the data management device 100C is a standby system that serves as a backup for the data management device 100B.
In FIG. 3, the distributed system 10 receives a transaction execution instruction in which the execution order of service A, service B, and service C is determined. It is defined in the transaction that the service A is executed by the data management apparatus 100A and the services B and C are executed by the data management apparatus 100B.

図４は、図３に示すトランザクションの処理におけるデータ状態推移を示す図である。
図４では、各データ管理装置１００のインメモリデータベース１２０に記憶されたデータと、トランザクション処理中に他のデータ管理装置１００へのデータ渡し用に一時記憶されるトランザクションデータとを示す。トランザクションデータは、後述する主記憶装置９０３に記憶される。
図４では、コミット前の更新状態のデータを破線で示し、コミット後の確定状態のデータを実線で示す。 FIG. 4 is a diagram showing data state transition in the transaction processing shown in FIG.
FIG. 4 shows data stored in the in-memory database 120 of each data management device 100 and transaction data temporarily stored for data transfer to another data management device 100 during transaction processing. Transaction data is stored in a main storage device 903 to be described later.
In FIG. 4, the update state data before committing is indicated by a broken line, and the confirmed state data after committing is indicated by a solid line.

（１）分散システム１０がトランザクションの実行命令を受信する。正確には、分散システム１０の各データ管理装置１００がサービスバス２０を介して実行命令を受信する。すると、各データ管理装置１００は、実行命令を参照して、実行順序が１番のサービスＡを実行することになっているか否かを確認する。そして、実行順序が１番のサービスＡを実行することになっているデータ管理装置１００ＡのＴＰＭ１１０が動作を開始し、他のデータ管理装置１００Ｂ及びデータ管理装置１００ＣのＴＰＭ１１０は待機する。 (1) The distributed system 10 receives a transaction execution instruction. Precisely, each data management device 100 of the distributed system 10 receives an execution command via the service bus 20. Then, each data management apparatus 100 refers to the execution instruction and confirms whether or not the service A having the first execution order is to be executed. Then, the TPM 110 of the data management apparatus 100A that is to execute the service A having the first execution order starts operation, and the other data management apparatuses 100B and the TPM 110 of the data management apparatus 100C stand by.

（２）データ管理装置１００Ａの実行部１１１は、トランザクションデータを生成する。そして、実行部１１１は、サービスＡを実行し、データＸを生成する。
（３）データ管理装置１００Ａのデータ管理部１１２は、データＸを更新状態としてデータ管理装置１００Ａのインメモリデータベース１２０に格納する。また、データ管理部１１２は、データＸと、データＸが記憶されたインメモリデータベース１２０における位置を示すチャンクポインタと、データＸがサービスＡによって生成されたことを示す生成元情報とを、トランザクションデータに格納する。
（４）データ管理装置１００Ａのデータ送信部１１３は、トランザクションデータを、実行順序が２番のサービスＢを実行することになっているデータ管理装置１００Ｂへ送信する。 (2) The execution unit 111 of the data management apparatus 100A generates transaction data. Then, the execution unit 111 executes the service A and generates data X.
(3) The data management unit 112 of the data management apparatus 100A stores the data X as an updated state in the in-memory database 120 of the data management apparatus 100A. In addition, the data management unit 112 converts the data X, the chunk pointer indicating the position in the in-memory database 120 in which the data X is stored, and the generation source information indicating that the data X is generated by the service A into transaction data. To store.
(4) The data transmission unit 113 of the data management apparatus 100A transmits the transaction data to the data management apparatus 100B that is to execute the service B having the second execution order.

（５）トランザクションデータを受信すると、データ管理装置１００ＢのＴＰＭ１１０が動作を開始する。データ管理装置１００Ｂの実行部１１１は、サービスＢを実行し、データＹを生成する。
（６）データ管理装置１００Ｂのデータ管理部１１２は、トランザクションデータに含まれるデータＸを更新状態として、チャンクポインタに従いデータ管理装置１００Ｂのインメモリデータベース１２０に格納する。また、データ管理部１１２は、データＹを更新状態としてデータ管理装置１００Ｂのインメモリデータベース１２０に格納する。また、データ管理部１１２は、データＹと、データＹが記憶されたインメモリデータベース１２０における位置を示すチャンクポインタと、データＹがサービスＢによって生成されたことを示す生成元情報とを、トランザクションデータに格納する。
（７）続いて、データ管理装置１００Ｂの実行部１１１は、サービスＣを実行し、データＺを生成する。そして、データ管理装置１００Ｂのデータ管理部１１２は、データＺを更新状態としてデータ管理装置１００Ｂのインメモリデータベース１２０に格納する。また、データ管理部１１２は、データＺと、データＺが記憶されたインメモリデータベース１２０における位置を示すチャンクポインタと、データＺがサービスＣによって生成されたことを示す生成元情報とを、トランザクションデータに格納する。 (5) When transaction data is received, the TPM 110 of the data management apparatus 100B starts operating. The execution unit 111 of the data management apparatus 100B executes the service B and generates data Y.
(6) The data management unit 112 of the data management device 100B sets the data X included in the transaction data as an update state and stores it in the in-memory database 120 of the data management device 100B according to the chunk pointer. Further, the data management unit 112 stores the data Y in the in-memory database 120 of the data management apparatus 100B as an updated state. In addition, the data management unit 112 converts the data Y, the chunk pointer indicating the position in the in-memory database 120 in which the data Y is stored, and the generation source information indicating that the data Y is generated by the service B into transaction data. To store.
(7) Subsequently, the execution unit 111 of the data management apparatus 100B executes the service C and generates data Z. Then, the data management unit 112 of the data management device 100B stores the data Z as an updated state in the in-memory database 120 of the data management device 100B. In addition, the data management unit 112 displays the data Z, the chunk pointer indicating the position in the in-memory database 120 in which the data Z is stored, and the generation source information indicating that the data Z is generated by the service C as transaction data. To store.

（８）データ管理装置１００Ｂの実行部１１１は、サービスＣが完了すると、その旨を実行順序が１番のサービスＡを実行したデータ管理装置１００Ａに通知する。すると、データ管理装置１００Ａのデータ管理部１１２は、コミット処理を行い、インメモリデータベース１２０に記憶されたデータＸを更新状態から確定状態に変更する。
（９）データ管理装置１００Ａのデータ管理部１１２は、コミット処理が完了すると、その旨を実行順序が２番のサービスＢを実行したデータ管理装置１００Ｂに通知する。すると、データ管理装置１００Ｂのデータ管理部１１２は、コミット処理を行い、インメモリデータベース１２０に記憶されたデータＸ，Ｙ，Ｚを更新状態から確定状態に変更する。 (8) When the service C is completed, the execution unit 111 of the data management apparatus 100B notifies the data management apparatus 100A that executed the service A having the first execution order. Then, the data management unit 112 of the data management apparatus 100A performs a commit process, and changes the data X stored in the in-memory database 120 from the updated state to the confirmed state.
(9) When the commit process is completed, the data management unit 112 of the data management apparatus 100A notifies the data management apparatus 100B that has executed the service B having the second execution order. Then, the data management unit 112 of the data management device 100B performs a commit process, and changes the data X, Y, and Z stored in the in-memory database 120 from the updated state to the confirmed state.

（１０）データ管理装置１００Ｃのデータ管理部１１２は、データ管理装置１００Ｂのインメモリデータベース１２０に確定状態のデータＸ，Ｙ，Ｚが新たに記憶されたため、同期処理を行い、データ管理装置１００Ｃのインメモリデータベース１２０に確定状態としてデータＸ，Ｙ，Ｚを格納する。 (10) The data management unit 112 of the data management device 100C performs synchronization processing since the finalized data X, Y, and Z are newly stored in the in-memory database 120 of the data management device 100B. Data X, Y, and Z are stored in the in-memory database 120 as a confirmed state.

以上のように、トランザクションで規定された順にサービスが実行され、全てのサービスが完了した後に、トランザクションで規定された順にコミット処理が実行される。 As described above, services are executed in the order specified in the transaction, and after all services are completed, commit processing is executed in the order specified in the transaction.

図５は、ロールバック処理におけるデータ状態推移を示す図である。
図５では、図４の（７）に示すように、データ管理装置１００Ａのインメモリデータベース１２０にデータＸが格納され、データ管理装置１００Ｂのインメモリデータベース１２０にデータＸ，Ｙ，Ｚが格納された後に、サービスＣでエラーが発生した場合を想定している。そこで、図５では、図４の（７）と、図４の（７）の後の（８’）〜（９’）を示す。 FIG. 5 is a diagram showing data state transition in the rollback processing.
In FIG. 5, as shown in (7) of FIG. 4, data X is stored in the in-memory database 120 of the data management apparatus 100A, and data X, Y, Z is stored in the in-memory database 120 of the data management apparatus 100B. After that, it is assumed that an error occurs in service C. FIG. 5 shows (7) in FIG. 4 and (8 ′) to (9 ′) after (7) in FIG.

（８’）サービスＣでエラーが発生すると、データ管理装置１００Ｂの実行部１１１は、その旨を実行順序が１番のサービスＡを実行したデータ管理装置１００Ａに通知する。すると、データ管理装置１００Ａのデータ管理部１１２は、ロールバック処理を行い、インメモリデータベース１２０に記憶されたデータＸを削除する。
（９’）データ管理装置１００Ａのデータ管理部１１２は、ロールバック処理が完了すると、その旨を実行順序が２番のサービスＢを実行したデータ管理装置１００Ｂに通知する。すると、データ管理装置１００Ｂのデータ管理部１１２は、ロールバック処理を行い、インメモリデータベース１２０に記憶されたデータＸ，Ｙ，Ｚを削除する。その後、データ管理装置１００Ｂのデータ管理部１１２は、トランザクションデータを削除する。 (8 ′) When an error occurs in the service C, the execution unit 111 of the data management apparatus 100B notifies the data management apparatus 100A that executed the service A having the first execution order. Then, the data management unit 112 of the data management apparatus 100A performs rollback processing and deletes the data X stored in the in-memory database 120.
(9 ′) When the rollback process is completed, the data management unit 112 of the data management apparatus 100A notifies the data management apparatus 100B that has executed the service B having the second execution order. Then, the data management unit 112 of the data management device 100B performs rollback processing and deletes the data X, Y, and Z stored in the in-memory database 120. Thereafter, the data management unit 112 of the data management apparatus 100B deletes the transaction data.

以上のように、エラーが発生すると、トランザクションで規定された順にロールバック処理が実行され、インメモリデータベース１２０に格納されたデータが削除される。 As described above, when an error occurs, rollback processing is executed in the order specified by the transaction, and the data stored in the in-memory database 120 is deleted.

図６は、インメモリデータベース１２０の基本構造図である。
インメモリデータベース１２０は、複数のスラブ１３０から構成される。
各スラブ１３０は、複数のチャンク１３１から構成される。 FIG. 6 is a basic structure diagram of the in-memory database 120.
The in-memory database 120 is composed of a plurality of slabs 130.
Each slab 130 is composed of a plurality of chunks 131.

図７は、インメモリデータベース１２０の構成図である。
上述した通り、インメモリデータベース１２０は、ハッシュテーブル１２１、間接レコード記憶部１２２、マスタレコード記憶部１２３、リーフレコード記憶部１２４、実データ記憶部１２５を備える。
ハッシュテーブル１２１、間接レコード記憶部１２２、マスタレコード記憶部１２３、リーフレコード記憶部１２４、実データ記憶部１２５は、それぞれ、インメモリデータベース１２０を構成する一部のチャンク１３１によって構成される。例えば、ハッシュテーブル１２１、間接レコード記憶部１２２、マスタレコード記憶部１２３、リーフレコード記憶部１２４、実データ記憶部１２５は、それぞれ異なるスラブ１３０によって構成される。 FIG. 7 is a configuration diagram of the in-memory database 120.
As described above, the in-memory database 120 includes the hash table 121, the indirect record storage unit 122, the master record storage unit 123, the leaf record storage unit 124, and the actual data storage unit 125.
The hash table 121, the indirect record storage unit 122, the master record storage unit 123, the leaf record storage unit 124, and the actual data storage unit 125 are each configured by a part of the chunk 131 that configures the in-memory database 120. For example, the hash table 121, the indirect record storage unit 122, the master record storage unit 123, the leaf record storage unit 124, and the actual data storage unit 125 are configured by different slabs 130, respectively.

ハッシュテーブル１２１は、一般的なハッシュテーブルであり、キーに対応する実データを高速に検索するための仕組みである。ハッシュテーブル１２１は、ＫＶＳ方式におけるキーに対応して、間接レコード記憶部１２２が記憶された位置を示す間接ポインタ１３２を記憶する。 The hash table 121 is a general hash table and is a mechanism for searching for real data corresponding to a key at high speed. The hash table 121 stores an indirect pointer 132 indicating the position where the indirect record storage unit 122 is stored, corresponding to the key in the KVS system.

間接レコード記憶部１２２は、対応するマスタレコードが記憶された位置を示すマスタポインタ１３３と、関連する他の間接レコードが記憶された位置を示す間接ポインタ１３２とを含む間接レコードを記憶する。
つまり、間接レコードは、イテレータ構造（チャンクポインタが示す実データを順番にアクセスできる構造）になっており、複数の間接レコードが間接ポインタ１３２によって関連付けられている。そのため、ＲＤＢ（ＲｅｌａｔｉｏｎａｌＤａｔａＢａｓｅ）におけるレコードのように、複数のデータ項目を１つのキーに対して対応付けることが可能となる。つまり、ＫＶＳ方式により、ＲＤＢと同様のデータ管理を実現することが可能となる。 The indirect record storage unit 122 stores an indirect record including a master pointer 133 indicating a position where a corresponding master record is stored and an indirect pointer 132 indicating a position where another related indirect record is stored.
That is, the indirect record has an iterator structure (a structure in which the actual data indicated by the chunk pointer can be accessed in order), and a plurality of indirect records are associated by the indirect pointer 132. Therefore, a plurality of data items can be associated with one key like a record in RDB (Relational DataBase). That is, data management similar to RDB can be realized by the KVS method.

マスタレコード記憶部１２３は、対応するリーフレコードが記憶された位置を示すリーフポインタ１３４と、最新の実データの改訂版を示すバージョン情報とを含むマスタレコードを記憶する。 The master record storage unit 123 stores a master record including a leaf pointer 134 indicating the position where the corresponding leaf record is stored and version information indicating a revised version of the latest actual data.

リーフレコード記憶部１２４は、対応する実データが記憶された位置を示すデータポインタ１３５（実データの情報の一例）と、次の改訂版の実データに対応するリーフレコードが記憶された位置を示すリーフポインタ１３４とを含むリーフレコードを記憶する。
つまり、リーフレコードは、イテレータ構造になっており、複数のバージョンの実データに対応するリーフレコードがリーフポインタ１３４によって関連付けられている。そのため、新しいバージョンの実データが格納された場合に、古いバージョンの実データを削除することなく、キーに対応付けたままにすることが可能となる。 The leaf record storage unit 124 indicates a data pointer 135 (an example of actual data information) indicating the position where the corresponding actual data is stored, and a position where the leaf record corresponding to the next revised version of the actual data is stored. A leaf record including the leaf pointer 134 is stored.
That is, the leaf record has an iterator structure, and leaf records corresponding to a plurality of versions of actual data are associated by the leaf pointer 134. Therefore, when a new version of actual data is stored, the old version of actual data can be kept associated with the key without being deleted.

実データ記憶部１２５は、実データ１３６を記憶する。 The actual data storage unit 125 stores actual data 136.

間接レコード記憶部１２２に記憶された間接レコードは、アプリケーションからアクセスされる項目名を有する。つまりアプリケーション側から参照されるのは、間接レコードのみで、間接レコードから関連付けられるその他の記憶部の情報はアプリケーション側からは参照されない。間接レコードを経由しないでアプリケーションがマスタレコードへアクセスした場合、アプリケーションのアクセスにより、マスタレコード、リーフレコード、実データが破壊される可能性がある。間接レコード記憶部１２２がマスタレコード記憶部１２３と同じ構成を有し２重構成にしているのは、そのためであり、万が一、間接レコード記憶部１２２や間接レコードが破壊された場合は、マスタレコード記憶部１２３やマスタレコードから復元可能となる。
マスタレコード記憶部１２３に記憶されたマスタレコードは、実データに対して１つ存在するデータであり、バージョンに関わらない管理単位である。
リーフレコード記憶部１２４に記憶されたリーフレコードは、マスタレコードに対して複数存在し、バージョン毎に管理される。 The indirect record stored in the indirect record storage unit 122 has an item name accessed from the application. That is, only the indirect record is referred to from the application side, and the information of other storage units associated with the indirect record is not referred to from the application side. When the application accesses the master record without going through the indirect record, there is a possibility that the master record, the leaf record, and the actual data are destroyed due to the access of the application. This is why the indirect record storage unit 122 has the same configuration as the master record storage unit 123 and has a double configuration. If the indirect record storage unit 122 or the indirect record is destroyed, the master record storage is performed. It is possible to restore from the section 123 and the master record.
The master record stored in the master record storage unit 123 is one piece of data for the actual data, and is a management unit that is not related to the version.
A plurality of leaf records stored in the leaf record storage unit 124 exist for the master record, and are managed for each version.

間接ポインタ１３２、マスタポインタ１３３、リーフポインタ１３４、データポインタ１３５を総称してチャンクポインタと呼ぶ。
アプリケーション側から参照される間接レコードの項目名は、これらチャンクポインタによって実データ記憶部１２５に記憶された実データと関連づけられる。なおマスタレコード記憶部１２３の構成を省略し、間接レコード記憶部１２２のマスタポインタ１３３がリーフポインタ１３４と同等の動作を行うことで、間接レコードからリーフレコードが関連づけられるように構成することができる。この場合、間接レコードはマスタレコードの代わりにマスタ管理機能を有することになる。
また実データ記憶部１２５および実データポインタ１３５の構成を省略し、リーフレコード記憶部１２４内に実データ１３６（実データの情報の一例）を保持することにより、図７に示す構成による処理と同様の処理を行うことができる。 The indirect pointer 132, the master pointer 133, the leaf pointer 134, and the data pointer 135 are collectively referred to as a chunk pointer.
The item name of the indirect record referred from the application side is associated with the actual data stored in the actual data storage unit 125 by these chunk pointers. The configuration of the master record storage unit 123 is omitted, and the master pointer 133 of the indirect record storage unit 122 performs the same operation as the leaf pointer 134, so that the leaf record can be associated with the indirect record. In this case, the indirect record has a master management function instead of the master record.
Further, the configuration of the actual data storage unit 125 and the actual data pointer 135 is omitted, and the actual data 136 (an example of information on the actual data) is held in the leaf record storage unit 124, thereby performing the same processing as the configuration shown in FIG. Can be processed.

図８は、コミット処理及びロールバック処理におけるリーフレコードの状態推移を示す図である。
上述した通り、トランザクションによって生成されたデータは更新状態として一旦格納され、トランザクションの完了後にコミット処理が実行され確定状態に変更される。一方、コミット処理ではなく、ロールバック処理が実行された場合、更新状態のデータが削除される。
データが更新状態又は確定状態として管理されると説明したが、具体的には、リーフレコードが更新状態又は確定状態として管理される。 FIG. 8 is a diagram showing the state transition of leaf records in the commit process and rollback process.
As described above, data generated by a transaction is temporarily stored as an updated state, and after the completion of the transaction, commit processing is executed and changed to a confirmed state. On the other hand, when the rollback process is executed instead of the commit process, the updated data is deleted.
Although it has been described that data is managed as an updated state or a confirmed state, specifically, leaf records are managed as an updated state or a confirmed state.

図８では、あるキーに対応して、バージョン１とバージョン２との２つのリーフレコードが確定状態になっているときに、実データが更新された場合を想定している。
実データが更新されると、データ管理部１１２は、既に記憶されている実データに上書きすることなく、更新された実データを新たに実データ記憶部１２５に格納する。また、データ管理部１１２は、格納した実データが記憶された位置を示すデータポインタ１３５を含むリーフレコードを更新状態としてリーフレコード記憶部１２４に格納する。さらに、データ管理部１１２は、格納したリーフレコードが記憶された位置を示すリーフポインタ１３４を、確定状態のリーフレコードのうち最新バージョンのリーフレコード（ここでは、バージョン２のリーフレコード）に含める。 In FIG. 8, it is assumed that actual data is updated when two leaf records of version 1 and version 2 are in a definite state corresponding to a certain key.
When the actual data is updated, the data management unit 112 newly stores the updated actual data in the actual data storage unit 125 without overwriting the already stored actual data. Further, the data management unit 112 stores the leaf record including the data pointer 135 indicating the position where the stored actual data is stored in the leaf record storage unit 124 as an updated state. Further, the data management unit 112 includes the leaf pointer 134 indicating the position where the stored leaf record is stored in the latest version of the confirmed leaf records (here, the version 2 leaf record).

その後、コミット処理が実行されると、データ管理部１１２は、更新状態のリーフレコードを確定状態に変更する。また、データ管理部１１２は、マスタレコードに含まれるバージョン情報を最新の版に（ここでは、バージョン２からバージョン３に）更新する。 Thereafter, when the commit process is executed, the data management unit 112 changes the updated leaf record to the confirmed state. Further, the data management unit 112 updates the version information included in the master record to the latest version (here, from version 2 to version 3).

一方、コミット処理ではなく、ロールバック処理が実行されると、データ管理部１１２は、更新状態のリーフレコードと、そのリーフレコードに対応する実データとを削除する。これにより、インメモリデータベース１２０が、トランザクションの実行前の状態に戻る。 On the other hand, when the rollback process is executed instead of the commit process, the data management unit 112 deletes the updated leaf record and the actual data corresponding to the leaf record. As a result, the in-memory database 120 returns to the state before the execution of the transaction.

なお、データ管理部１１２は、更新状態のリーフレコードに対しては、更新元のトランザクション以外からのアクセスを拒絶する。一方、データ管理部１１２は、確定状態のリーフレコードに対しては、どのトランザクションからのアクセスも受理する。但し、データ管理部１１２は、あるキーに対応する更新状態のリーフレコードが存在する場合には、他のトランザクションからそのキーに対応する実データへの参照のみ許可し、更新は拒絶する。これにより、排他制御が実現される。 Note that the data management unit 112 rejects access from other than the update source transaction to the updated leaf record. On the other hand, the data management unit 112 accepts access from any transaction for a leaf record in a confirmed state. However, when there is an updated leaf record corresponding to a certain key, the data management unit 112 permits only reference to actual data corresponding to the key from another transaction, and rejects the update. Thereby, exclusive control is realized.

また、ここでは、データが更新される場合について説明した。しかし、データが新規に格納される場合についても、同様にロールバック処理や排他制御を実現することができる。
但し、データが新規に格納される場合には、実データが実データ記憶部１２５に格納される際、リーフレコード記憶部１２４だけでなく、ハッシュテーブル１２１と間接レコード記憶部１２２とマスタレコード記憶部１２３とにも対応するレコードが格納される。そして、ロールバック処理では、更新状態のリーフレコードと、そのリーフレコードに対応する実データとともに、ハッシュテーブル１２１と間接レコード記憶部１２２とマスタレコード記憶部１２３とに格納された、対応するレコードも削除される。 Here, the case where data is updated has been described. However, even when data is newly stored, rollback processing and exclusive control can be similarly realized.
However, when data is newly stored, when the actual data is stored in the actual data storage unit 125, not only the leaf record storage unit 124 but also the hash table 121, the indirect record storage unit 122, and the master record storage unit 123 also stores a corresponding record. In the rollback processing, the corresponding record stored in the hash table 121, the indirect record storage unit 122, and the master record storage unit 123 is deleted together with the updated leaf record and the actual data corresponding to the leaf record. Is done.

図９は、チャンクポインタ及びデータコピーの説明図である。
チャンクポインタは、上述したとおり、インメモリデータベース１２０において予め定められた基準アドレスを基準とした相対的な位置を示す論理アドレス情報である。
図９では、データ管理装置１００Ａにおいては、インメモリデータベース１２０の“０ｘ００１０”が基準アドレスとされている。ここでは、スラブ１３０とチャンク１３１との番号は、左上から右下へ１から順に数えられるとする。
データＸについてのチャンクポインタは、基準アドレスを基準として、６番目のスラブ１３０における、６番目のチャンク１３１を示している。また、データＹについてのチャンクポインタは、基準アドレスを基準として、１１番目のスラブ１３０における、５番目のチャンク１３１を示している。 FIG. 9 is an explanatory diagram of chunk pointers and data copying.
As described above, the chunk pointer is logical address information indicating a relative position based on a reference address that is predetermined in the in-memory database 120.
In FIG. 9, in the data management apparatus 100A, “0x0010” of the in-memory database 120 is set as the reference address. Here, it is assumed that the numbers of the slab 130 and the chunk 131 are counted sequentially from 1 from the upper left to the lower right.
The chunk pointer for the data X indicates the sixth chunk 131 in the sixth slab 130 with reference to the reference address. The chunk pointer for data Y indicates the fifth chunk 131 in the eleventh slab 130 with reference to the reference address.

図９では、データ管理装置１００Ｂにおいては、インメモリデータベース１２０の“０ｘ０１００”が基準アドレスとされている。この場合に、例えば、データＸ，Ｙがデータ管理装置１００Ａからデータ管理装置１００Ｂにコピーされたとする。すると、データ管理部１１２は、データ管理装置１００Ｂのインメモリデータベース１２０の基準アドレスを基準とする、６番目のスラブ１３０における、６番目のチャンク１３１に、データＸを格納する。同様に、データ管理部１１２は、データ管理装置１００Ｂのインメモリデータベース１２０の基準アドレスを基準とする、１１番目のスラブ１３０における、５番目のチャンク１３１に、データＹを格納する。
つまり、データのコピーが行われる場合、コピー元のデータのチャンクポインタが示す位置にデータが格納される。 In FIG. 9, in the data management apparatus 100B, “0x0100” of the in-memory database 120 is set as the reference address. In this case, for example, it is assumed that the data X and Y are copied from the data management apparatus 100A to the data management apparatus 100B. Then, the data management unit 112 stores the data X in the sixth chunk 131 in the sixth slab 130 based on the reference address of the in-memory database 120 of the data management apparatus 100B. Similarly, the data management unit 112 stores the data Y in the fifth chunk 131 in the eleventh slab 130 based on the reference address of the in-memory database 120 of the data management device 100B.
That is, when data is copied, the data is stored at the position indicated by the chunk pointer of the copy source data.

これにより、データ管理装置１００間のデータコピーの際に、チャンクポインタの再計算等が不要であり、データコピーの処理を高速に行うことが可能である。また、サービスがデータを参照する際も、チャンクポインタを介してデータを参照すれば、動作するデータ管理装置１００を意識せずデータ参照を行うことが可能である。 As a result, when copying data between the data management apparatuses 100, it is not necessary to recalculate the chunk pointer, and the data copy process can be performed at high speed. In addition, when the service refers to data, it is possible to refer to the data without being aware of the operating data management apparatus 100 by referring to the data via the chunk pointer.

以上のように、実施の形態１に係る分散システム１０では、インメモリデータベース１２０のデータ構造を工夫した。これにより、キュー構造を有するメモリ領域を利用することなく、ロールバック処理及び排他制御を実現することが可能である。また、ＲＤＢのように、１つのキーに対して複数のデータ項目を管理することが可能である。 As described above, in the distributed system 10 according to Embodiment 1, the data structure of the in-memory database 120 is devised. As a result, rollback processing and exclusive control can be realized without using a memory area having a queue structure. In addition, like RDB, it is possible to manage a plurality of data items for one key.

また、実施の形態１に係る分散システム１０では、データ管理装置１００間でトランザクションデータを受け渡して、トランザクションを処理し、コミット処理及びロールバック処理を行う。これにより、複数のデータ管理装置１００でトランザクションを処理する場合に、コミット処理及びロールバック処理を実現することが可能である。 In the distributed system 10 according to the first embodiment, the transaction data is transferred between the data management apparatuses 100, the transaction is processed, and the commit process and the rollback process are performed. Thereby, when a transaction is processed by a plurality of data management apparatuses 100, it is possible to realize a commit process and a rollback process.

また、実施の形態１に係る分散システム１０では、サービスの実行順序や、サービスを実行するデータ管理装置１００が実行命令において規定されている。そのため、サービスの実行順序や実行するデータ管理装置１００を容易に変更可能である。
なお、サービスの実行順序や、サービスを実行するデータ管理装置１００は、実行命令内に規定せず、外部の記憶装置等に記憶しておき、実行部１１１に参照させるようにしてもよい。 In the distributed system 10 according to the first embodiment, the execution order of services and the data management apparatus 100 that executes the services are defined in the execution instructions. Therefore, the execution order of services and the data management apparatus 100 to be executed can be easily changed.
It should be noted that the service execution order and the data management apparatus 100 that executes the service may be stored in an external storage device or the like without being defined in the execution command and referred to by the execution unit 111.

また、実施の形態１に係る分散システム１０では、データ管理装置１００間のデータコピーの際に、コピー元のデータのチャンクポインタが示す位置にデータを格納する。これにより、データコピーの際にチャンクポインタの再計算が不要であり、データコピーを高速に行うことが可能である。 In the distributed system 10 according to the first embodiment, when data is copied between the data management apparatuses 100, the data is stored at the position indicated by the chunk pointer of the copy source data. As a result, it is not necessary to recalculate the chunk pointer during data copying, and data copying can be performed at high speed.

図１０は、実施の形態１に示したデータ管理装置１００のハードウェア構成の例を示す図である。
データ管理装置１００は、コンピュータである。データ管理装置１００の各要素をプログラムで実現することができる。
データ管理装置１００のハードウェア構成としては、バスに、演算装置９０１、外部記憶装置９０２、主記憶装置９０３、通信装置９０４、入出力装置９０５が接続されている。 FIG. 10 is a diagram illustrating an example of a hardware configuration of the data management apparatus 100 illustrated in the first embodiment.
The data management device 100 is a computer. Each element of the data management apparatus 100 can be realized by a program.
As a hardware configuration of the data management apparatus 100, an arithmetic device 901, an external storage device 902, a main storage device 903, a communication device 904, and an input / output device 905 are connected to the bus.

演算装置９０１は、プログラムを実行するＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）等である。外部記憶装置９０２は、例えばＲＯＭ（ＲｅａｄＯｎｌｙ
Ｍｅｍｏｒｙ）やフラッシュメモリ、ハードディスク装置等である。主記憶装置９０３は、例えばＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）等である。通信装置９０４は、例えば通信ボード等である。入出力装置９０５は、例えばマウス、キーボード、ディスプレイ装置等である。
インメモリデータベース１２０は、外部記憶装置９０２や主記憶装置９０３によって構成される。 The arithmetic device 901 is a CPU (Central Processing Unit) that executes a program. For example, the external storage device 902 is a ROM (Read Only).
Memory), flash memory, hard disk device, and the like. The main storage device 903 is, for example, a RAM (Random Access Memory). The communication device 904 is, for example, a communication board. The input / output device 905 is, for example, a mouse, a keyboard, a display device, or the like.
The in-memory database 120 includes an external storage device 902 and a main storage device 903.

プログラムは、通常は外部記憶装置９０２に記憶されており、主記憶装置９０３にロードされた状態で、順次演算装置９０１に読み込まれ、実行される。
プログラムは、実行部１１１、データ管理部１１２、データ送信部１１３として説明している機能を実現するプログラムである。
更に、外部記憶装置９０２にはオペレーティングシステム（ＯＳ）も記憶されており、ＯＳの少なくとも一部が主記憶装置９０３にロードされ、演算装置９０１はＯＳを実行しながら、上記プログラムを実行する。
また、実施の形態１の説明において、インメモリデータベース１２０が記憶するデータ等が主記憶装置９０３にファイルとして記憶されている。 The program is normally stored in the external storage device 902, and is loaded into the main storage device 903 and sequentially read into the arithmetic device 901 and executed.
The program is a program that realizes the functions described as the execution unit 111, the data management unit 112, and the data transmission unit 113.
Furthermore, an operating system (OS) is also stored in the external storage device 902. At least a part of the OS is loaded into the main storage device 903, and the arithmetic device 901 executes the above program while executing the OS.
In the description of the first embodiment, data stored in the in-memory database 120 is stored in the main storage device 903 as a file.

なお、図１０の構成は、あくまでもデータ管理装置１００のハードウェア構成の一例を示すものであり、データ管理装置１００のハードウェア構成は図１０に記載の構成に限らず、他の構成であってもよい。 10 is merely an example of the hardware configuration of the data management apparatus 100, and the hardware configuration of the data management apparatus 100 is not limited to the configuration illustrated in FIG. Also good.

１０分散システム、１００データ管理装置、１１０ＴＰＭ、１１１実行部、１１２データ管理部、１１３データ送信部、１２０インメモリデータベース、１２１ハッシュテーブル、１２２間接レコード記憶部、１２３マスタレコード記憶部、１２４リーフレコード記憶部、１２５実データ記憶部、１３０スラブ、１３１チャンク、１３２間接ポインタ、１３３マスタポインタ、１３４リーフポインタ、１３５データポインタ、１３６実データ。 10 distributed system, 100 data management device, 110 TPM, 111 execution unit, 112 data management unit, 113 data transmission unit, 120 in-memory database, 121 hash table, 122 indirect record storage unit, 123 master record storage unit, 124 leaf record Storage unit, 125 actual data storage unit, 130 slab, 131 chunk, 132 indirect pointer, 133 master pointer, 134 leaf pointer, 135 data pointer, 136 actual data.

Claims

A data management device that performs data management by a key-value store method that stores actual data corresponding to a key,
An indirect record storage unit for storing an indirect record corresponding to the key;
A leaf record storage unit for storing leaf records corresponding to indirect records,
The indirect record includes a master pointer indicating a position where a corresponding leaf record is stored, and an indirect pointer indicating a position where another related indirect record is stored;
The said leaf record contains the information of the corresponding real data, and the leaf pointer which shows the position where the leaf record corresponding to the next revision actual data was memorize | stored.

A data management device that performs data management by a key-value store method that stores actual data corresponding to a key,
An indirect record storage unit for storing an indirect record corresponding to the key;
A master record storage unit for storing a master record corresponding to the indirect record;
A leaf record storage unit for storing a leaf record corresponding to the master record,
The indirect record includes a master pointer indicating a position where a corresponding master record is stored, and an indirect pointer indicating a position where another related indirect record is stored;
The master record includes a leaf pointer indicating a position where a corresponding leaf record is stored;
The said leaf record contains the information of the corresponding real data, and the leaf pointer which shows the position where the leaf record corresponding to the next revision actual data was memorize | stored.

The data management apparatus according to claim 2, wherein the master record further includes version information indicating a revised version of the latest actual data.

The data management device further includes:
A real data storage unit for storing real data corresponding to leaf records is provided.
The data management apparatus according to any one of claims 1 to 3, wherein the leaf record includes a data pointer indicating a position where the corresponding actual data is stored as information of the actual data.

The data management device further includes:
When the actual data is updated, the updated actual data is stored in the actual data storage unit without overwriting the actual data stored in the actual data storage unit, and a leaf record corresponding to the stored actual data is stored. 5. The data management unit according to claim 4, further comprising: a data management unit that stores the leaf pointer in the leaf record corresponding to the actual data before the update, which is stored in the leaf record storage unit and indicates a position where the stored leaf record is stored. Data management device.

When the transaction is executed and the actual data is updated, the data management unit stores the updated actual data in the actual data storage unit, and stores the leaf record corresponding to the updated actual data in the other data An update state that does not allow reference from a transaction is stored in the leaf record storage unit, and when the transaction ends normally, the leaf record is changed to a confirmed state that allows reference from another transaction. The data management device according to claim 5.

The data management unit deletes the updated actual data from the actual data storage unit when the transaction ends abnormally, and deletes the leaf record corresponding to the updated actual data from the leaf record storage unit. The data management device according to claim 6.

8. The data management apparatus according to claim 6, wherein the data management unit rejects updating of actual data corresponding to the key when there is an updated leaf record corresponding to the key.

The data management device further includes:
An execution unit that executes the processing included in the transaction;
When actual data is updated by the process executed by the execution unit, the updated actual data, a data pointer indicating a position where the updated actual data is stored, and a leaf record corresponding to the updated actual data are A data transmission unit that transmits transaction data including a leaf pointer indicating a stored position to another data management device;
When the data management unit receives transaction data from another data management device, the data management unit stores the actual data included in the transaction data at a position indicated by the data pointer included in the transaction data, and includes the data pointer included in the transaction data. 9. The leaf record is stored at a position indicated by a leaf pointer included in the transaction data, and the leaf pointer included in the transaction data is included in the leaf record corresponding to the actual data before the update. The data management apparatus in any one of.

10. The data management apparatus according to claim 1, wherein the master pointer, the indirect pointer, and the leaf pointer indicate relative positions based on a memory reference address.