JP2017027349A

JP2017027349A - Replication program

Info

Publication number: JP2017027349A
Application number: JP2015144903A
Authority: JP
Inventors: 克己松浦; Katsumi Matsuura
Original assignee: Exa Corp
Current assignee: Exa Corp
Priority date: 2015-07-22
Filing date: 2015-07-22
Publication date: 2017-02-02

Abstract

PROBLEM TO BE SOLVED: To suppress data mismatch due to difference of handling of a specific character before and after replicating of a database.SOLUTION: A replication program of the invention is configured to convert zero byte length characters string and a null value on a replication source database, according to handling of the character strings on the replication source database.SELECTED DRAWING: Figure 1

Description

本発明は、データベースのレプリケーションに関する。 The present invention relates to database replication.

データベースのレプリケーションは、あるデータベースと全く同じ内容の複製データベースを作成する処理であり、いわゆるバックアップとは区別される。レプリケーションを利用することにより、例えばマスタデータベースの内容をスレーブデータベースに反映することができる。この手法はデータベースを冗長構成にするとき用いられる。 Database replication is a process of creating a duplicate database having exactly the same contents as a certain database, and is distinguished from so-called backup. By using replication, for example, the contents of the master database can be reflected in the slave database. This technique is used when making a database redundant.

レプリケーションを実施する毎にデータベースの全内容を複製すると、処理負荷や通信負荷の観点から多大なリソースが必要となる。そこで現実的な手法として、マスタデータベースに対する更新部分のみを抽出してこれをスレーブデータベースに反映することが考えられる。 If the entire contents of the database are replicated every time replication is performed, a large amount of resources are required from the viewpoint of processing load and communication load. Therefore, as a practical method, it is conceivable to extract only the updated portion of the master database and reflect this in the slave database.

下記特許文献１は、データベースシステムのレプリケーションに関する技術を記載している。同文献においては、トランザクションレプリケーションのためのデータ変更を追跡するアルゴリズムとして、ログベースのものとトリガベースのものが例示されている（同文献の０００７〜０００８参照）。 The following Patent Document 1 describes a technique related to database system replication. In this document, a log-based algorithm and a trigger-based algorithm are exemplified as an algorithm for tracking data changes for transaction replication (see 0007 to 0008 in the same document).

特開２００５−３０１３２９号公報JP 2005-301329 A

データベースレプリケーションは、異なる機種のホストコンピュータ間で実施される場合がある。ホストコンピュータの機種によっては、データベース内で使用する特定文字の取り扱い方が異なる場合がある。さらには同機種のホストコンピュータであっても、データベース管理システム（ＤＢＭＳ）によって特定文字の取り扱い方が異なる場合もあり得る。レプリケート先データベースにおいて特定文字が正しく取り扱われなかった場合、レコードの内容が正しく認識されず、したがってレプリケート前後に係る整合性がとれなくなる可能性がある。 Database replication may be performed between different types of host computers. Depending on the model of the host computer, the handling of specific characters used in the database may differ. Furthermore, even a host computer of the same model may have different handling of specific characters depending on the database management system (DBMS). If the specific character is not handled correctly in the replication destination database, the contents of the record are not correctly recognized, and therefore there is a possibility that consistency before and after the replication cannot be achieved.

上記特許文献１記載の技術においては、トランザクションの整合性をレプリケート後においても維持することに着目しており、異機種間（または異なるＤＢＭＳ間）でデータベースをレプリケートすることに起因する問題については考慮されていない。しかし実際の運用場面においては、例えばオープンプラットフォーム（Ｗｉｎｄｏｗｓ（登録商標）など）上のＤＢＭＳから汎用機上のＤＢＭＳへレプリケーションを実施する場合があり、したがってレプリケート前後それぞれのプラットフォームにおける特定文字の取り扱いが異なる可能性がある。かかる場合においてもデータベースの整合性を維持することのできる技術が求められる。 The technique described in Patent Document 1 focuses on maintaining transaction consistency even after replication, and considers problems caused by replicating databases between different models (or between different DBMSs). It has not been. However, in actual operation situations, for example, replication may be performed from a DBMS on an open platform (Windows (registered trademark), etc.) to a DBMS on a general-purpose machine, and therefore the handling of specific characters on each platform before and after replication is different there is a possibility. Even in such a case, a technique capable of maintaining the consistency of the database is required.

本発明は、上記のような課題に鑑みてなされたものであり、データベースのレプリケート前後に係る特定文字の取り扱いが異なることに起因するデータ不整合を抑制することを目的とする。 The present invention has been made in view of the above-described problems, and an object of the present invention is to suppress data mismatch caused by different handling of specific characters before and after database replication.

本発明に係るレプリケーションプログラムは、レプリケート元データベースにおけるゼロバイト長文字列およびｎｕｌｌ値を、レプリケート先データベースにおけるこれら文字列の取り扱いに応じて変換する。 The replication program according to the present invention converts a zero-byte length character string and a null value in the replication source database according to the handling of these character strings in the replication destination database.

本発明に係るレプリケーションプログラムによれば、データベース管理システム毎のゼロバイト長文字列およびｎｕｌｌ値の取り扱いの違いを吸収してレプリケーションを正しく実行することができる。 According to the replication program of the present invention, it is possible to correctly perform replication by absorbing differences in handling of a zero-byte length character string and a null value for each database management system.

実施形態１に係るレプリケーションプログラム１２０を実行するレプリケーションコンピュータ１００およびその周辺構成を示す図である。1 is a diagram showing a replication computer 100 that executes a replication program 120 according to Embodiment 1 and its peripheral configuration. FIG. 変換テーブル１４０の構成とデータ例を示す図である。It is a figure which shows the structure and data example of a conversion table. 履歴テーブル２３０の構成とデータ例を示す図である。It is a figure which shows the structure and data example of the log | history table 230. FIG. レプリケーションプログラム１２０の動作を説明する図である。FIG. 6 is a diagram for explaining the operation of a replication program 120. 実施形態２における変換テーブル１４０の構成とデータ例を示す図である。It is a figure which shows the structure and example of a data of the conversion table 140 in Embodiment 2. FIG. 実施形態３における変換テーブル１４０の構成とデータ例を示す図である。It is a figure which shows the structure and data example of the conversion table 140 in Embodiment 3. FIG.

＜実施の形態１＞
図１は、本発明の実施形態１に係るレプリケーションプログラム１２０を実行するレプリケーションコンピュータ１００およびその周辺構成を示す図である。レプリケーションコンピュータ１００は、第１ＤＢ（ＤａｔａＢａｓｅ）２２０を第２ＤＢ３２０へレプリケートするコンピュータであり、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）１１０、レプリケーションプログラム１２０、記憶装置１３０を備える。 <Embodiment 1>
FIG. 1 is a diagram showing a replication computer 100 that executes a replication program 120 according to the first embodiment of the present invention and its peripheral configuration. The replication computer 100 is a computer that replicates a first DB (DataBase) 220 to a second DB 320, and includes a CPU (Central Processing Unit) 110, a replication program 120, and a storage device 130.

ＣＰＵ１１０は、レプリケーションプログラム１２０を実行する。レプリケーションプログラム１２０は、データベースをレプリケートする処理を実装したプログラムである。以下では記載の便宜上、レプリケーションプログラム１２０を動作主体として説明する場合があるが、実際にこれを実行するのはＣＰＵ１１０であることを付言しておく。 The CPU 110 executes the replication program 120. The replication program 120 is a program that implements a process of replicating a database. In the following, for convenience of description, the replication program 120 may be described as an operation subject. However, it is added that the CPU 110 actually executes this.

記憶装置１３０は、変換テーブル１４０を格納している。変換テーブル１４０は、レプリケーションプログラム１２０がレプリケーションを実施する際に、レプリケート元データベース上における特定文字をレプリケート先データベース上における書式に変換するルールを定義している。変換テーブル１４０の具体例については後述する。 The storage device 130 stores a conversion table 140. The conversion table 140 defines a rule for converting a specific character on the replication source database into a format on the replication destination database when the replication program 120 performs replication. A specific example of the conversion table 140 will be described later.

第１ＤＢコンピュータ２１０は、第１ＤＢ２２０を実装した第１データベース管理システム（図示せず）を実行するコンピュータである。第１ＤＢコンピュータ２１０は、第１ＤＢ２２０に対する更新操作を履歴テーブル２３０に格納する。履歴テーブル２３０の具体例については後述する。第１ＤＢコンピュータ２１０はさらに、更新操作を履歴テーブル２３０へ格納する際に、採番データ２４０を用いる。採番データ２４０については後述する。 The first DB computer 210 is a computer that executes a first database management system (not shown) that implements the first DB 220. The first DB computer 210 stores an update operation for the first DB 220 in the history table 230. A specific example of the history table 230 will be described later. The first DB computer 210 further uses the numbering data 240 when storing the update operation in the history table 230. The numbering data 240 will be described later.

第２ＤＢコンピュータ３１０は、第１データベース管理システムとは異なる第２データベース管理システムを実行するコンピュータである。第２データベース管理システムは、第２ＤＢ３２０を実装している。 The second DB computer 310 is a computer that executes a second database management system different from the first database management system. The second database management system has a second DB 320 mounted.

＜実施の形態１：データベース管理システムについて＞
データベース管理システムについて以下に補足する。データベースは単なるデータファイルの集合体ではなく、例えば（ａ）トランザクションを受け付けて実行する、（ｂ）トランザクションのログを記録する、（ｃ）所定のイベントが発生するとあらかじめ定めておいた処理を実行する（トリガ機能）、などの機能を有する。これら機能を実装したプログラムのことを一般にデータベース管理システムと呼ぶ。これら機能をどのように実装するかは個々のデータベース管理システムに依拠する。したがってデータベース管理システム毎に機能仕様が異なる場合がある。 <Embodiment 1: About database management system>
The database management system is supplemented below. The database is not simply a collection of data files. For example, (a) a transaction is accepted and executed, (b) a transaction log is recorded, and (c) a predetermined process is executed when a predetermined event occurs. (Trigger function). A program that implements these functions is generally called a database management system. How these functions are implemented depends on the individual database management system. Therefore, the function specifications may be different for each database management system.

データベースが保持するレコードは、それぞれデータ型を有している。例えば文字列レコードを保持するレコードまたはカラムは固定長文字列型／可変長文字列型などの文字列型を有し、数値レコードを保持するレコードまたはカラムは整数型／浮動小数点型などの数値型を有する。 Each record held in the database has a data type. For example, a record or column that holds a character string record has a character string type such as a fixed-length character string type / variable-length character string type, and a record or column that holds a numeric record is a numeric type such as an integer type or a floating-point type Have

データベース管理システムによっては、データ型の取り扱い方が異なる場合がある。例えば可変長文字列型のカラムにゼロバイト長文字列“”を格納した場合、あるデータベース管理システムにおいてはこれをその定義通りゼロバイト長文字列“”として格納するのに対し、別のデータベース管理システムにおいてはこれをｎｕｌｌ値（データなしを示す特殊なビット値）として格納することが、実際に発生している。 Some database management systems may handle data types differently. For example, when a zero-byte length character string “” is stored in a variable-length character string type column, it is stored as a zero-byte length character string “” as defined in one database management system, whereas another database management. In the system, storing this as a null value (a special bit value indicating no data) has actually occurred.

一般にゼロバイト長文字列とｎｕｌｌ値は、アプリケーション上における取り扱いが全く異なる。例えばゼロバイト長文字列は文字列として取り扱われるのに対し、ｎｕｌｌ値は無効なデータ値として取り扱われる場合がある。そうすると、例えば第１ＤＢ２２０が格納しているレコードおよびそのレコードを読み書きするアプリケーションを第２ＤＢコンピュータ３１０へ移行（第１ＤＢ２２０については第２ＤＢ３２０へレプリケート）した場合、アプリケーションが正しく動作しない可能性がある。例えば当該アプリケーションが第１ＤＢ２２０上のゼロバイト長文字列を文字列変数として処理していた場合、当該アプリケーションは第２ＤＢ３２０上の同じレコードを文字列として処理することができない可能性がある。具体的には、ゼロバイト長文字列に対して文字列を追加する処理が、無効なデータ値に対する処理として正常に動作しない可能性がある。 In general, a zero-byte length character string and a null value are completely different on the application. For example, a zero-byte character string may be handled as a character string, while a null value may be handled as an invalid data value. Then, for example, when the record stored in the first DB 220 and the application that reads and writes the record are migrated to the second DB computer 310 (the first DB 220 is replicated to the second DB 320), the application may not operate correctly. For example, when the application is processing a zero byte length character string on the first DB 220 as a character string variable, the application may not be able to process the same record on the second DB 320 as a character string. Specifically, the process of adding a character string to a zero-byte length character string may not operate normally as a process for an invalid data value.

そこで本発明においては、異なるデータベース管理システム間のレプリケーションにともなう上記のような不具合を解消するため、取り扱い方がデータベース管理システム間で異なるデータについてかかる不具合が生じないような処理を施すこととした。 Therefore, in the present invention, in order to eliminate the above-mentioned problems associated with replication between different database management systems, processing is performed so that such problems do not occur for data that is handled differently between database management systems.

＜実施の形態１：テーブル構成＞
図２は、変換テーブル１４０の構成とデータ例を示す図である。変換テーブル１４０はレプリケート前後におけるデータ変換ルールを定義するデータテーブルである。変換テーブル１４０は、テーブル名フィールド１４１、カラム名フィールド１４２、ゼロバイト長文字列変換ルールフィールド１４３、ｎｕｌｌ値変換ルールフィールド１４４を有する。 <Embodiment 1: Table configuration>
FIG. 2 is a diagram illustrating the configuration of the conversion table 140 and data examples. The conversion table 140 is a data table that defines data conversion rules before and after replication. The conversion table 140 includes a table name field 141, a column name field 142, a zero-byte length character string conversion rule field 143, and a null value conversion rule field 144.

テーブル名フィールド１４１とカラム名フィールド１４２は、レプリケート元データベースのテーブル名とカラム名を一意に識別する文字列である。レプリケート元データベースが複数のデータベースを構築している場合は、さらにデータベース名を指定することもできる。ゼロバイト長文字列変換ルールフィールド１４３は、レプリケート元データベース上においてゼロバイト長文字列のカラム値が格納されている場合、その値をレプリケート先データベースにおいてどのような値として格納するかを指定する書式を示す。ｎｕｌｌ値変換ルールフィールド１４４は、レプリケート元データベース上においてｎｕｌｌ値のカラム値が格納されている場合、その値をレプリケート先データベースにおいてどのような値として格納するかを指定する書式を示す。 The table name field 141 and the column name field 142 are character strings that uniquely identify the table name and column name of the replication source database. If the source database has multiple databases, you can also specify the database name. The zero byte length character string conversion rule field 143 is a format for specifying what value is stored in the replication destination database when the column value of the zero byte length character string is stored in the replication source database. Indicates. The null value conversion rule field 144 indicates a format for designating what value is stored in the replicate destination database when the null value column value is stored in the replicate source database.

図２に示す１行目のデータ例においては、レプリケート元データベースの“ｔａｂｌｅ１”テーブルの“ｃｏｌｕｍｎ１”カラムがゼロバイト長文字列を格納している場合、レプリケート先においてはこれをスペース文字に置き換えるべき旨を定義している。同様に同カラムがｎｕｌｌ値を格納している場合、レプリケート先においてもこれをｎｕｌｌ値として格納すべき旨を定義している。変換ルールが定義されていないカラムについては格別変換を実施せずそのままレプリケートすればよい。 In the data example of the first row shown in FIG. 2, when the “column1” column of the “table1” table of the replication source database stores a zero-byte length character string, it should be replaced with a space character at the replication destination. Is defined. Similarly, when a null value is stored in the same column, it is defined that this should be stored as a null value at the replicate destination. Columns for which no conversion rule is defined may be replicated as they are without performing special conversion.

図３は、履歴テーブル２３０の構成とデータ例を示す図である。履歴テーブル２３０は、第１ＤＢ２２０に対する更新操作を記録するデータテーブルである。履歴テーブル２３０は、順序フィールド２３１、状態フィールド２３２、命令文フィールド２３３を有する。 FIG. 3 is a diagram illustrating a configuration of the history table 230 and an example of data. The history table 230 is a data table that records update operations for the first DB 220. The history table 230 has an order field 231, a status field 232, and a command statement field 233.

順序フィールド２３１は、第１ＤＢ２２０に対して更新処理が実施された通番を保持する。状態フィールド２３２は、当該レコードを第２ＤＢ３２０に対して反映済みか否かを示す。命令文フィールド２３３は、レプリケート元データベースに対して実施された更新操作の命令文を示す。ここでいう更新操作は、例えばＳＱＬにおけるＵＰＤＡＴＥ命令に限られず、ＩＮＳＥＲＴ命令やＤＥＬＥＴＥ命令を含む、データベースに対する全ての書き込み操作を含むものである。 The order field 231 holds a serial number for which update processing has been performed on the first DB 220. The status field 232 indicates whether or not the record has been reflected on the second DB 320. The command statement field 233 indicates the command statement of the update operation performed on the replication source database. The update operation here is not limited to the UPDATE instruction in SQL, for example, but includes all write operations to the database including the INSERT instruction and the DELETE instruction.

履歴テーブル２３０はさらに、記録した更新操作における各カラムに対する指定値を保持するためのフィールドを有する。例えばＩＮＳＥＲＴ命令であれば各カラムに対して格納した値を保持するためのフィールドを有し、ＤＥＬＥＴＥ命令であれば削除するレコードを特定するための条件文におけるカラム値を保持するためのフィールドを有する。各テーブルが有するフィールドはテーブル毎に異なるので、レプリケート元データベースが有するテーブル毎に履歴テーブル２３０を設ける。例えば第１ＤＢ２２０のあるテーブルが２つのカラムを有する場合、図３に示すようにその２つのカラムに対する操作を記録するフィールドを設けることができる。 The history table 230 further has a field for holding a specified value for each column in the recorded update operation. For example, an INSERT instruction has a field for holding a value stored for each column, and a DELETE instruction has a field for holding a column value in a conditional statement for specifying a record to be deleted. . Since the fields of each table are different for each table, a history table 230 is provided for each table of the replication source database. For example, when a table in the first DB 220 has two columns, a field for recording operations for the two columns can be provided as shown in FIG.

データベースに対する更新操作は、複数のテーブルにまたがって実施される場合がしばしばある。またあるテーブルに対する更新操作は別のテーブルのレコードを参照しながら実施される場合がある。テーブル間の整合性を維持しつつこれら複数のテーブルにまたがって更新操作を実施するためには、更新操作を実施する順序が重要になる。そこで本実施形態１においては、レプリケート元データベース（ここでは第１ＤＢ２２０）に対して更新操作を実施した順序を、順序フィールド２３１に記録することとした。 Update operations on the database are often performed across multiple tables. An update operation for a certain table may be performed with reference to a record in another table. In order to perform the update operation across the plurality of tables while maintaining the consistency between the tables, the order in which the update operations are performed becomes important. Therefore, in the first embodiment, the order in which the update operation is performed on the replication source database (here, the first DB 220) is recorded in the order field 231.

レプリケート元データベースに対する更新操作が発生したことを検出する手段としては例えば更新操作を契機として起動されるトリガをレプリケート元データベースにおいて実装しておけばよい。多くのデータベース管理システムはトリガ機構を備えているので、これを実装するのは容易である。 As a means for detecting the occurrence of an update operation for the replication source database, for example, a trigger that is activated when the update operation is performed may be implemented in the replication source database. Many database management systems are equipped with a trigger mechanism, which is easy to implement.

採番データ２４０は、順序フィールド２３１の現在値を保持する。各トリガの開始部分において採番データ２４０を参照して現在値を取得し、更新操作を履歴テーブル２３０に対して記録した後に採番データ２４０の現在値を１つインクリメントすることにより、各更新操作の実施順序を記録することができる。また複数のテーブルにまたがった更新操作であっても、各更新操作を記録する際に参照する現在値を採番データ２４０に集約することにより、テーブルをまたがる更新処理を実施すべき順序を保存してレプリケート後においても整合性を維持することができる。 The numbering data 240 holds the current value of the order field 231. Each update operation is obtained by referring to the numbering data 240 at the start of each trigger, acquiring the current value, recording the update operation in the history table 230, and then incrementing the current value of the numbering data 240 by one. Can be recorded. In addition, even in the case of an update operation extending over a plurality of tables, the current value to be referred to when recording each update operation is aggregated into the numbering data 240, so that the order in which the update process across the tables should be performed is saved. Thus, consistency can be maintained even after replication.

図４は、レプリケーションプログラム１２０の動作を説明する図である。以下図４の各ステップについて説明する。 FIG. 4 is a diagram for explaining the operation of the replication program 120. Hereinafter, each step of FIG. 4 will be described.

（図４：ステップ（１）：未転送の履歴レコードを抽出）
レプリケーションプログラム１２０は、履歴テーブル２３０が格納しているレコードのうち、状態フィールド２３２が“未転送”であるものを抽出する。複数の履歴テーブル２３０が存在する場合（すなわちレプリケート元テーブルが複数存在する場合）は、全ての履歴テーブル２３０からレコードを抽出する。抽出したレコードを順序フィールド２３１にしたがって本ステップ内でソートしてもよいし、後のステップにおいてソートしてもよい。ここでは本ステップにおいてソートするものとする。 (Figure 4: Step (1): Extract untransferred history records)
The replication program 120 extracts records whose status field 232 is “untransferred” from the records stored in the history table 230. When there are a plurality of history tables 230 (that is, when there are a plurality of replication source tables), records are extracted from all the history tables 230. The extracted records may be sorted in this step according to the order field 231 or may be sorted in a later step. Here, it is assumed that sorting is performed in this step.

（図４：ステップ（２）：変換ルールを適用）
レプリケーションプログラム１２０は、ステップ（１）において抽出したレコードに対して変換テーブル１４０が定義している変換ルールを適用することにより、ゼロバイト長文字列とｎｕｌｌ値をそれぞれレプリケート先データベース（ここでは第２ＤＢ３２０）上における書式に変換する。例えば図３の３行目のレコードはカラム１をゼロバイト長文字列“”に更新する旨を指定しているので、これを第２ＤＢ３２０へ反映する際には、図２の１行目の変換ルールにしたがって、当該レコードの“”を“ ”（スペース文字）に変換する。 (Figure 4: Step (2): Applying conversion rules)
The replication program 120 applies the conversion rule defined by the conversion table 140 to the record extracted in step (1), thereby each of the zero byte length character string and the null value is replicated to the destination database (here, the second DB 320). ) Convert to the above format. For example, since the record in the third line in FIG. 3 specifies that column 1 is updated to the zero-byte character string “”, when this is reflected in the second DB 320, the conversion in the first line in FIG. According to the rules, “” of the record is converted to “” (space character).

（図４：ステップ（３）：第２ＤＢを更新：その１）
レプリケーションプログラム１２０は、ステップ（２）においてゼロバイト長文字列とｎｕｌｌ値を変換した後の各レコードを用いて、第２ＤＢ３２０を更新する命令を生成する。第２ＤＢ３２０に対して実施すべき更新命令の種別は、命令文フィールド２３３に記載されている。当該更新命令において指定すべき各カラム値は、各履歴テーブル２３０それぞれのカラム値部分（変換を実施した場合はその変換後の値）に記載されている。したがってレプリケーションプログラム１２０は、これらフィールドを参照することにより、第２ＤＢ３２０に対して実施すべき更新命令を生成することができる。 (FIG. 4: Step (3): Update the second DB: Part 1)
The replication program 120 generates an instruction to update the second DB 320 using each record after converting the zero-byte length character string and the null value in step (2). The type of update command to be executed for the second DB 320 is described in the command statement field 233. Each column value to be specified in the update command is described in a column value portion of each history table 230 (or a value after conversion when conversion is performed). Therefore, the replication program 120 can generate an update command to be executed on the second DB 320 by referring to these fields.

（図４：ステップ（３）：第２ＤＢを更新：その２）
レプリケーションプログラム１２０は、生成した更新命令を順序フィールド２３１が指定する順番で実行することにより、第２ＤＢ３２０に対して履歴テーブル２３０の内容を反映する。レプリケーションプログラム１２０は、反映したレコードについては、状態フィールド２３２を“転送済”に変更する。 (FIG. 4: Step (3): Update the second DB: Part 2)
The replication program 120 reflects the contents of the history table 230 on the second DB 320 by executing the generated update instructions in the order specified by the order field 231. The replication program 120 changes the status field 232 to “transferred” for the reflected record.

（図４：ステップ（２）〜（３）：補足）
これらステップを実施する順序は入れ替えてもよい。すなわち、第２ＤＢ３２０に対する更新命令を生成した後に変換ルールを適用してもよい。 (FIG. 4: Steps (2) to (3): Supplement)
The order in which these steps are performed may be switched. That is, the conversion rule may be applied after generating an update command for the second DB 320.

＜実施の形態１：まとめ＞
本実施形態１に係るレプリケーションプログラム１２０は、第１ＤＢ２２０におけるゼロバイト長文字列とｎｕｌｌ値を第２ＤＢ３２０における書式にそれぞれ変換した上で、第１ＤＢ２２０のレコードを第２ＤＢ３２０へレプリケートする。これにより、レプリケート後においてもデータ不整合を生じることなくアプリケーションを稼働させることができる。 <Embodiment 1: Summary>
The replication program 120 according to the first embodiment converts the zero-byte length character string and the null value in the first DB 220 into the format in the second DB 320, and then replicates the record in the first DB 220 to the second DB 320. As a result, the application can be operated without data inconsistency even after replication.

本実施形態１に係るレプリケーションプログラム１２０は、第２ＤＢ３２０におけるゼロバイト長文字列の書式とｎｕｌｌ値の書式を、カラム毎に変換する。これにより、変換を実施すべきカラムについてのみ変換を実施し、レプリケーションの処理効率を高めることができる。またテーブルのカラム構造に依拠することなく上記変換を実施することができる。 The replication program 120 according to the first embodiment converts the format of the zero byte length character string and the format of the null value in the second DB 320 for each column. As a result, the conversion can be performed only for the column to be converted, and the replication processing efficiency can be improved. Further, the above conversion can be performed without depending on the column structure of the table.

＜実施の形態２＞
図５は、本発明の実施形態２における変換テーブル１４０の構成とデータ例を示す図である。本実施形態２において、変換テーブル１４０は実施形態１で説明した構成に加えて新たに抽出フォーマットフィールド１４５を有する。その他構成は実施形態１と同様である。 <Embodiment 2>
FIG. 5 is a diagram illustrating a configuration and data example of the conversion table 140 according to the second embodiment of the present invention. In the second embodiment, the conversion table 140 has a new extraction format field 145 in addition to the configuration described in the first embodiment. Other configurations are the same as those of the first embodiment.

抽出フォーマットフィールド１４５は、レプリケート元データベースのレコードを抽出する際に当該レコードをどのようなデータ型として取り扱うかを指定する。例えば“ＴＥＸＴ”は元レコードを文字列型として取り扱うことを示し、“ＦＬＯＡＴ３．２”は整数部３桁かつ小数部２ケタの浮動小数点型として取り扱うことを示す。 The extraction format field 145 specifies what data type the record is handled when extracting the record of the replication source database. For example, “TEXT” indicates that the original record is handled as a character string type, and “FLOAT3.2” indicates that it is handled as a floating-point type having a 3-digit integer part and 2-digit decimal part.

レプリケーションプログラム１２０は、レプリケート元レコードを抽出する際に、抽出フォーマットフィールド１４５が指定するデータ型に合致しない部分については適宜切り捨てなどの措置を施す。例えば“ＦＬＯＡＴ３．２”が指定されている抽出元カラムの小数部が３桁以上ある場合、３桁目以降は切り捨てまたは四捨五入する。あるいは“ＦＬＯＡＴ３．２”が指定されている抽出元カラムが数値でない場合、抽出後のカラム値は例えば“０．０”とする。 When the replication program 120 extracts the replication source record, the replication program 120 takes measures such as truncating the portion that does not match the data type specified by the extraction format field 145 as appropriate. For example, if the extraction source column for which “FLOAT3.2” is specified has three or more decimal parts, the third and subsequent digits are rounded down or rounded off. Alternatively, when the extraction source column for which “FLOAT3.2” is designated is not a numerical value, the column value after extraction is set to “0.0”, for example.

＜実施の形態３＞
実施形態１においては、第１ＤＢ２２０から第２ＤＢ３２０へレプリケートを実施することを前提としたが、第２ＤＢ３２０から第１ＤＢ２２０へレプリケートを実施する際にも本発明を用いることができる。本発明の実施形態３ではその１例について説明する。 <Embodiment 3>
In the first embodiment, it is assumed that replication is performed from the first DB 220 to the second DB 320. However, the present invention can also be used when performing replication from the second DB 320 to the first DB 220. In the third embodiment of the present invention, an example will be described.

第２ＤＢ３２０から第１ＤＢ２２０へレプリケートを実施する際にも、処理対象とするデータベースが入れ替わったのみであるから、実施形態１と同様の処理手順を用いることができる。ただし第２ＤＢ３２０がゼロバイト長文字列をｎｕｌｌ値として取り扱うことに鑑み、レプリケーションプログラム１２０またはマニュアル作業によってゼロバイト長文字列の代わりにスペースを代入する措置があらかじめ取られ、レプリケート元たる第２ＤＢ３２０がかかるレコードを格納している場合がある。そこで本実施形態３においては、レプリケート元レコードがスペース文字列であった場合にこれをレプリケート先においてどのように取り扱うかをあらかじめ定義することとした。 Even when the replication is performed from the second DB 320 to the first DB 220, the processing procedure similar to that of the first embodiment can be used because only the database to be processed is replaced. However, in view of the fact that the second DB 320 handles the zero byte length character string as a null value, a measure for substituting a space instead of the zero byte length character string is taken in advance by the replication program 120 or manual operation, and the second DB 320 that is the replication source is applied. May contain records. Therefore, in the third embodiment, when the replication source record is a space character string, how to handle it at the replication destination is defined in advance.

図６は、本実施形態３における変換テーブル１４０の構成とデータ例を示す図である。本実施形態３において、変換テーブル１４０は実施形態１で説明した構成に加えて新たにスペース変換ルールフィールド１４６を有する。その他構成は実施形態１と同様である。 FIG. 6 is a diagram illustrating a configuration and data example of the conversion table 140 according to the third embodiment. In the third embodiment, the conversion table 140 has a space conversion rule field 146 in addition to the configuration described in the first embodiment. Other configurations are the same as those of the first embodiment.

スペース変換ルールフィールド１４６は、レプリケート元データベース上においてスペース文字列のカラム値が格納されている場合、その値をレプリケート先データベースにおいてどのような値として格納するかを指定する書式を示す。上記想定によればスペース文字列はレプリケート先データベースにおいてゼロバイト長文字列として格納すべきであるから、図６の１行目のレコードはその旨を指定している。ただしカラムの性質によってはスペースを格納することが初めから意図されている場合もあるので、そのようなカラムについては本フィールドを指定せずレプリケート元レコードをそのままレプリケートするようにしてもよい。図６の２行目のレコードはその旨を示す。 The space conversion rule field 146 indicates a format for designating what value is stored in the replication destination database when the column value of the space character string is stored in the replication source database. According to the above assumption, the space character string should be stored as a zero-byte length character string in the replication destination database, so the record on the first line in FIG. 6 specifies that fact. However, there is a case where it is originally intended to store a space depending on the property of the column. For such a column, this field may not be specified and the replication source record may be replicated as it is. The record on the second line in FIG. 6 indicates that.

＜本発明の変形例について＞
本発明は上記した実施例に限定されるものではなく、様々な変形例が含まれる。例えば、上記した実施例は本発明を分かりやすく説明するために詳細に説明したものであり、必ずしも説明した全ての構成を備えるものに限定されるものではない。また、ある実施例の構成の一部を他の実施例の構成に置き換える事が可能であり、また、ある実施例の構成に他の実施例の構成を加えることも可能である。また、各実施例の構成の一部について他の構成の追加・削除・置換をすることができる。 <Modification of the present invention>
The present invention is not limited to the above-described embodiments, and includes various modifications. For example, the above-described embodiments have been described in detail for easy understanding of the present invention, and are not necessarily limited to those having all the configurations described. Further, a part of the configuration of a certain embodiment can be replaced with the configuration of another embodiment, and the configuration of another embodiment can be added to the configuration of a certain embodiment. In addition, it is possible to add, delete, and replace other configurations for a part of the configuration of each embodiment.

以上の実施形態において、レプリケーションプログラム１２０は、レプリケート前後の各ＤＢがそれぞれ異なる文字コードを用いる場合は、レプリケート元レコードをレプリケート先データベースが使用する文字コードへ変換してもよい。 In the above embodiment, the replication program 120 may convert the replication source record into the character code used by the replication destination database when the DBs before and after the replication use different character codes.

以上の実施形態において、ゼロバイト長文字列をスペース文字列に変換する例を示したが、ゼロバイト長文字列がｎｕｌｌ値として取り扱われる不具合を回避するためには何らかの文字列が格納されていればよいので、スペース以外の適当な文字列を用いてもよい。ただし処理の便宜に鑑みて、何らかの１バイト長以上の文字または文字列であることが望ましい。 In the above embodiment, an example in which a zero-byte length character string is converted to a space character string has been shown. However, in order to avoid a problem that a zero-byte length character string is handled as a null value, any character string may be stored. Any suitable character string other than a space may be used. However, in view of the convenience of processing, it is desirable that the character or character string has some length of 1 byte or more.

以上の実施形態においては、説明の容易のためＲＤＢＭＳ（リレーショナルデータベース管理システム）におけるＳＱＬ命令文を例示したが、実施すべき命令はデータベース管理システムの仕様に応じたものを適宜用いればよく、必ずしもＳＱＬ命令文に限られるものではない。 In the above embodiment, the SQL statement in the RDBMS (Relational Database Management System) is illustrated for ease of explanation, but the command to be implemented may be appropriately used according to the specification of the database management system, and is not necessarily SQL. It is not limited to imperative sentences.

以上の実施形態においては、履歴テーブル２３０にレコードを格納する手段としてデータベーストリガを用いているが、その他適当な手段によりレプリケート元データベースに対する更新操作を記録してもよい。 In the above embodiment, the database trigger is used as means for storing records in the history table 230, but the update operation for the replication source database may be recorded by other appropriate means.

以上の実施形態においては、第１ＤＢコンピュータ２１０上のデータベースを第２ＤＢコンピュータ３１０上のデータベースへレプリケートしているが、同一コンピュータ上の異なるデータベース管理システム間でレプリケーションを実施する際に本発明を用いることもできる。 In the above embodiment, the database on the first DB computer 210 is replicated to the database on the second DB computer 310, but the present invention is used when performing replication between different database management systems on the same computer. You can also.

１００：レプリケーションコンピュータ、１１０：ＣＰＵ、１２０：レプリケーションプログラム、１３０：記憶装置、１４０：変換テーブル、２１０：第１ＤＢコンピュータ、２２０：第１ＤＢ、２３０：履歴テーブル、２４０：採番データ、３１０：第２ＤＢコンピュータ、３２０：第２ＤＢ。 100: Replication computer, 110: CPU, 120: Replication program, 130: Storage device, 140: Conversion table, 210: First DB computer, 220: First DB, 230: History table, 240: Numbering data, 310: Second DB Computer, 320: second DB.

Claims

A replication program that causes a computer to execute a process of replicating a record held by a first database as a record held by a second database implemented using a database management system different from the first database, In the computer,
Obtaining history data describing an execution history of a first instruction for updating the first database;
Reading a conversion table defining a format of a zero byte length string on the second database and a format of null values on the second database;
Generating a second instruction for executing a process for performing the same record operation as the first instruction on the second database;
The part using the zero byte length character string of the second instruction is converted into the format on the second database according to the definition of the conversion table, and the part using the null value of the second instruction is converted into the conversion table. Converting to a format on the second database according to the definition;
Updating the second database by executing the second instruction converted by the converting step;
A replication program characterized by executing

The first database and the second database are relational databases,
The conversion table defines the format of the zero byte length character string and the format of the null value for each column of the table of the first database,
In the conversion step, the computer
2. The replication program according to claim 1, wherein the conversion is performed for each portion corresponding to a column of a table included in the first database in the second instruction according to the definition of the conversion table. 3.

The replication program further on the computer,
The replication program according to claim 1, wherein the step of converting the character code of the second instruction from the character code used by the first database to the character code used by the second database is executed.

The conversion table defines a conversion rule that specifies conversion of a zero byte length character string on the first database into an alternative character string of 1 byte length or more on the second database;
The replication program causes the computer to convert a portion using a zero-byte length character string in the second instruction to the substitute character string in the conversion step according to the definition of the conversion rule. 1. The replication program according to 1.

The conversion table defines a conversion rule that specifies conversion of the substitute character string on the first database into a zero-byte length character string on the second database;
The replication program, in the conversion step, causes the computer to convert a portion of the second instruction that uses the substitute character string into a zero-byte length character string according to the definition of the conversion rule. 4. The replication program according to 4.