JP2018136775A

JP2018136775A - Server system and replication method

Info

Publication number: JP2018136775A
Application number: JP2017031187A
Authority: JP
Inventors: 駿杉本; Shun Sugimoto; 史郎青島; Shiro Aoshima
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2017-02-22
Filing date: 2017-02-22
Publication date: 2018-08-30

Abstract

PROBLEM TO BE SOLVED: To quickly ensure the redundancy of important data while reducing the impact on service provision when collective replication is performed.SOLUTION: ACT-based DB server 10A of DB server system 100 performs control in such a manner that data having been set as data of high importance, among data stored in DB 15A of the ACT-based DB server 10A, is preferentially processed in replication from ACT-based DB server 10A to SBY-based DB server 10B compared to data having been set as data of low importance. Then, when the processing load of the ACT-based DB server 10A exceeds a threshold 2 during the replication for the data of low importance, the ACT-based DB server 10A suspends the replication.SELECTED DRAWING: Figure 1

Description

本発明は、サーバシステムおよびレプリケーション方法に関する。 The present invention relates to a server system and a replication method.

従来、複数のデータベース（ＤＢ）を用いてＤＢを冗長化したＤＢサーバシステムが知られている（特許文献１参照）。そして、このようなＤＢサーバシステムでは、待機系ＤＢサーバ（以下、ＳＢＹ系ＤＢサーバと記載）が組み込まれた場合には、現用系ＤＢサーバ（以下、ＡＣＴ系ＤＢサーバと記載）のＤＢに記憶されたデータの複製（レプリカ）を別ＳＢＹ系ＤＢサーバのＤＢ上に作成し、ＡＣＴ系ＤＢサーバのデータとＳＢＹ系ＤＢサーバとを同期させる一括レプリケーションを行う。 Conventionally, a DB server system in which a DB is made redundant by using a plurality of databases (DB) is known (see Patent Document 1). In such a DB server system, when a standby DB server (hereinafter referred to as an SBY DB server) is incorporated, it is stored in the DB of an active DB server (hereinafter referred to as an ACT DB server). A replica (replica) of the data is created on the DB of another SBY DB server, and batch replication is performed to synchronize the data of the ACT DB server and the SBY DB server.

特開２００７−２８６９５２号公報JP 2007-286952 A

しかしながら、従来の技術では、一括レプリケーションを行う場合に、該一括レプリケーションにかかる処理負荷が大きいため、サービス提供への影響が大きくなってしまうという課題があった。つまり、例えば、一括レプリケーション中にサービスノードからの要求が集中した場合にはサーバの処理が高負荷となり、サービスが輻輳規制される可能性がある。 However, the conventional technology has a problem in that when batch replication is performed, the processing load for the batch replication is large, so that the influence on service provision increases. In other words, for example, when requests from service nodes are concentrated during batch replication, the server processing becomes heavy and there is a possibility that the service will be restricted.

また、輻輳規制を回避するためにサービス提供装置からの要求を優先に処理することも考えられるが、この場合には、重要度の高いデータのレプリケーションについても後回しとなってしまい、重要なデータの冗長性を迅速に確保できず、障害発生時の影響が大きくなるという課題があった。 In addition, in order to avoid congestion regulation, it may be possible to preferentially process requests from service providing devices, but in this case, replication of highly important data will be postponed, and important data will be postponed. There was a problem that redundancy could not be secured quickly, and the impact at the time of failure occurred was increased.

上述した課題を解決し、目的を達成するために、本発明のサーバシステムは、現用系サーバと待機系サーバとを有するサーバシステムであって、前記現用系サーバに記憶されたデータのうち、重要度が高いデータとして設定されたデータを重要度が低いデータとして設定されたデータよりも優先的に前記現用系サーバから前記待機系サーバへレプリケーションするように制御するレプリケーション制御部と、前記レプリケーション制御部によって重要度が低いデータのレプリケーションが行われている際に、前記現用系サーバの処理負荷が第一の閾値を超えた場合には、レプリケーションを中断する中断部と、を備えることを特徴とする。 In order to solve the above-described problems and achieve the object, the server system of the present invention is a server system having an active server and a standby server, and among important data stored in the active server A replication control unit for controlling data set as high-priority data to be replicated from the active server to the standby server preferentially over data set as low-priority data; and the replication control unit When replication of data with low importance is performed by the above, when the processing load of the active server exceeds a first threshold, an interruption unit that interrupts replication is provided. .

また、本発明のレプリケーション方法は、現用系サーバと待機系サーバとを有するサーバシステムによって実行されるレプリケーション方法であって、前記現用系サーバに記憶されたデータのうち、重要度が高いデータとして設定されたデータを重要度が低いデータとして設定されたデータよりも優先的に前記現用系サーバから前記待機系サーバへレプリケーションするように制御するレプリケーション制御工程と、前記制御工程によって重要度が低いデータのレプリケーションが行われている際に、前記現用系サーバの処理負荷が第一の閾値を超えた場合には、レプリケーションを中断する中断工程と、を含んだことを特徴とする。 The replication method of the present invention is a replication method executed by a server system having an active server and a standby server, and is set as highly important data among data stored in the active server Replication control process for controlling the active data to be replicated from the active server to the standby server preferentially over the data set as low importance data, and for the data with low importance by the control process When replication is being performed, if the processing load of the active server exceeds a first threshold value, an interruption step of interrupting replication is included.

本発明によれば、一括レプリケーションを行う際のサービス提供への影響を低減しつつ、重要なデータの冗長性を迅速に確保することができるという効果を奏する。 According to the present invention, it is possible to quickly ensure the redundancy of important data while reducing the influence on service provision when performing batch replication.

図１は、第一の実施の形態に係るＤＢサーバシステムの構成の一例を示す図である。FIG. 1 is a diagram illustrating an example of a configuration of a DB server system according to the first embodiment. 図２は、第一の実施の形態に係るＡＣＴ系ＤＢサーバの構成を示すブロック図である。FIG. 2 is a block diagram showing the configuration of the ACT DB server according to the first embodiment. 図３は、ＡＣＴ系ＤＢサーバが有するＤＢのデータ構成を例示する図である。FIG. 3 is a diagram illustrating a data configuration of a DB included in the ACT DB server. 図４は、重要度の高いグループからレプリケーションを行う処理について説明する図である。FIG. 4 is a diagram for explaining processing for performing replication from a group with high importance. 図５は、通常時の一括レプリケーションの動作を説明する図である。FIG. 5 is a diagram for explaining the normal batch replication operation. 図６は、高負荷時の一括レプリケーションの動作を説明する図である。FIG. 6 is a diagram for explaining the operation of batch replication at high load. 図７は、第一の実施の形態に係るＤＢサーバシステムのレプリケーション処理の流れを示すフローチャートである。FIG. 7 is a flowchart showing a flow of replication processing of the DB server system according to the first embodiment. 図８は、従来のＤＢサーバシステムにおける課題を説明する図である。FIG. 8 is a diagram for explaining a problem in the conventional DB server system. 図９は、レプリケーションプログラムを実行するコンピュータを示す図である。FIG. 9 is a diagram illustrating a computer that executes a replication program.

以下に、本願に係るサーバシステムおよびレプリケーション方法の実施の形態を図面に基づいて詳細に説明する。なお、この実施の形態により本願に係るサーバシステムおよびレプリケーション方法が限定されるものではない。 Hereinafter, embodiments of a server system and a replication method according to the present application will be described in detail with reference to the drawings. The server system and the replication method according to the present application are not limited by this embodiment.

［第一の実施の形態］
以下の実施の形態では、第一の実施の形態に係るＤＢサーバシステムの構成、ＤＢサーバの構成、ＤＢサーバの処理の流れを順に説明し、最後に第一の実施の形態による効果を説明する。 [First embodiment]
In the following embodiments, the configuration of the DB server system according to the first embodiment, the configuration of the DB server, and the flow of processing of the DB server will be described in order, and finally the effects of the first embodiment will be described. .

［ＤＢサーバシステムの構成］
図１は、第一の実施の形態に係るＤＢサーバシステムの構成の一例を示す図である。第一の実施の形態に係るＤＢサーバシステム１００は、ＡＣＴ系ＤＢサーバ１０ＡとＳＢＹ系ＤＢサーバ１０Ｂとサービス提供装置２０とを有する。なお、図１に示す各装置の数は、あくまで一例であり、これに限られるものではない。また、ＡＣＴ系ＤＢサーバ１０ＡとＳＢＹ系ＤＢサーバ１０Ｂについて、両者が有する機能構成は同じであるものとする。 [Configuration of DB server system]
FIG. 1 is a diagram illustrating an example of a configuration of a DB server system according to the first embodiment. The DB server system 100 according to the first embodiment includes an ACT DB server 10A, an SBY DB server 10B, and a service providing device 20. In addition, the number of each apparatus shown in FIG. 1 is an example to the last, and is not restricted to this. Further, it is assumed that the functional configurations of both the ACT DB server 10A and the SBY DB server 10B are the same.

ＡＣＴ系ＤＢサーバ１０Ａは、現用系ＤＢサーバであって、サービス提供に必要なデータを記憶するＤＢ１５Ａを有する。ＳＢＹ系ＤＢサーバ１０Ｂは、待機系ＤＢサーバであって、ＡＣＴ系ＤＢサーバ１０ＡのＤＢ１５Ａの複製データを記憶するＤＢ１５Ｂを有する。サービス提供装置２０は、ユーザにサービスを提供する。 The ACT DB server 10A is an active DB server, and includes a DB 15A that stores data necessary for service provision. The SBY DB server 10B is a standby DB server and includes a DB 15B that stores duplicate data of the DB 15A of the ACT DB server 10A. The service providing device 20 provides a service to the user.

このような構成のＤＢサーバシステム１００において、サービス提供装置２０は、平常時に、ユーザの要求に応じて、あるいは予め、ＡＣＴ系ＤＢサーバ１０ＡのＤＢ１５Ａにアクセスしてデータを参照し、参照したデータをキャッシュメモリに記憶するデータキャッシュを行う。そして、サービス提供装置２０は、このデータを用いてユーザにサービスを提供する。また、サービス提供装置２０は、ネットワーク障害等によりＡＣＴ系ＤＢサーバ１０Ａと通信できない場合に、ＳＢＹ系ＤＢサーバ１０ＢのＤＢ１５Ｂにアクセスしてデータを参照し、データキャッシュを行って、このデータを用いてユーザにサービスを提供する。 In the DB server system 100 having such a configuration, the service providing device 20 refers to the data by accessing the DB 15A of the ACT DB server 10A in advance in response to the user's request or in advance. The data cache stored in the cache memory is performed. And the service provision apparatus 20 provides a service to a user using this data. Further, when the service providing apparatus 20 cannot communicate with the ACT DB server 10A due to a network failure or the like, the service providing apparatus 20 accesses the DB 15B of the SBY DB server 10B, refers to the data, performs data cache, and uses this data. Provide services to users.

ＡＣＴ系ＤＢサーバ１０Ａは、ＳＢＹ系ＤＢサーバ１０ＢがＤＢサーバシステム１００に組み込まれた場合に、ＤＢ１５Ａに記憶されたデータをＳＢＹ系ＤＢサーバ１０ＢのＤＢ１５Ｂに通知して、ＡＣＴ系ＤＢサーバ１０ＡのＤＢ１５ＡとＳＢＹ系ＤＢサーバ１０ＢのＤＢ１５Ｂとを同期させる一括レプリケーションを行う。 When the SBY DB server 10B is incorporated in the DB server system 100, the ACT DB server 10A notifies the data stored in the DB 15A to the DB 15B of the SBY DB server 10B, and the DB 15A of the ACT DB server 10A. And collective replication to synchronize the DB 15B of the SBY DB server 10B.

第一の実施の形態に係るＤＢサーバシステム１００では、ＡＣＴ系ＤＢサーバ１０Ａが、ＤＢ１５Ａに記憶されたデータのうち、重要度が高いデータとして設定されたデータを重要度が低いデータとして設定されたデータよりも優先的にＡＣＴ系ＤＢサーバ１０Ａから、ＳＢＹ系ＤＢサーバ１０Ｂへレプリケーションするように制御する。 In the DB server system 100 according to the first embodiment, the ACT DB server 10A sets data set as data with high importance among data stored in the DB 15A as data with low importance. Control is performed so that replication is performed from the ACT DB server 10A to the SBY DB server 10B in preference to the data.

そして、ＡＣＴ系ＤＢサーバ１０Ａが、重要度が低いデータのレプリケーションが行われている際に、ＡＣＴ系ＤＢサーバ１０Ａの処理負荷が後述する「閾値２」を超えた場合には、レプリケーションを中断する。 When the ACT DB server 10A is replicating data with low importance, if the processing load of the ACT DB server 10A exceeds a “threshold value 2” described later, the replication is interrupted. .

このように、第一の実施の形態に係るＤＢサーバシステム１００では、重要度の高いグループからレプリケーションを行い、重要度が低いデータのレプリケーションが行われている際に、ＡＣＴ系ＤＢサーバ１０Ａの処理負荷が高くなると、レプリケーションを中断する。このため、一括レプリケーションを行う際のサービス提供への影響を低減しつつ、重要なデータの冗長性を迅速に確保することができる。 As described above, in the DB server system 100 according to the first embodiment, replication is performed from a group with high importance, and processing of the ACT DB server 10A is performed when replication of data with low importance is performed. When the load becomes high, the replication is interrupted. Therefore, it is possible to quickly ensure the redundancy of important data while reducing the influence on service provision when performing batch replication.

［ＡＣＴ系ＤＢサーバの構成］
次に、図２を用いて、図１に示したＡＣＴ系ＤＢサーバ１０Ａの構成を説明する。図２は、第一の実施の形態に係るＡＣＴ系ＤＢサーバ１０Ａの構成を示すブロック図である。図２に示すように、このＡＣＴ系ＤＢサーバ１０Ａは、通信処理部１１、入力部１２、出力部１３、制御部１４およびＤＢ１５Ａを有する。以下にこれらの各部の処理を説明する。 [Configuration of ACT DB server]
Next, the configuration of the ACT DB server 10A shown in FIG. 1 will be described using FIG. FIG. 2 is a block diagram showing a configuration of the ACT DB server 10A according to the first embodiment. As shown in FIG. 2, the ACT DB server 10A includes a communication processing unit 11, an input unit 12, an output unit 13, a control unit 14, and a DB 15A. The processing of each of these units will be described below.

通信処理部１１は、ＮＩＣ等で実現され、ＬＡＮやインターネットなどの電気通信回線を介したＳＢＹ系ＤＢサーバ１０Ｂやサービス提供装置２０等の外部の装置との通信を制御する。 The communication processing unit 11 is realized by a NIC or the like, and controls communication with an external device such as the SBY DB server 10B or the service providing device 20 via an electric communication line such as a LAN or the Internet.

入力部１２は、キーボードやマウスなどの入力デバイスであり、ユーザから各種情報の入力操作を受付ける。例えば、入力部１２は、ＤＢ１５Ａに記憶されているデータの重要度の設定指示を受け付けたり、後述する閾値１、閾値２および閾値３の設定指示を受け付けたりする。出力部１３は、ディスプレイなどの出力デバイスであり、各種情報を出力する。 The input unit 12 is an input device such as a keyboard or a mouse, and receives input operations of various information from the user. For example, the input unit 12 accepts an instruction to set the importance of data stored in the DB 15A, or accepts instructions to set thresholds 1, 2, and 3 described later. The output unit 13 is an output device such as a display, and outputs various information.

ＤＢ１５Ａは、ＲＡＭ（Random Access Memory）、フラッシュメモリ（Flash Memory）等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。 The DB 15A is realized by a semiconductor memory device such as a RAM (Random Access Memory) or a flash memory, or a storage device such as a hard disk or an optical disk.

ＤＢ１５Ａは、サービス提供装置２０が参照してサービス提供に用いるサービス情報を記憶する。ここで、図３の例を用いて、ＡＣＴ系ＤＢサーバ１０Ａが有するＤＢ１５Ａのデータ構成を説明する。図３は、ＡＣＴ系ＤＢサーバが有するＤＢのデータ構成を例示する図である。サービス情報は、図３に例示するように、各レコードを識別するレコードＩＤと、情報内容を示すデータと、設定されたデータの重要度とが対応付けて記憶されている。図３には、例えば、レコードＩＤが１であるレコードのデータがＡＡＡであり、重要度が「高」であることが例示されている。 The DB 15A stores service information used by the service providing apparatus 20 to provide services. Here, the data configuration of the DB 15A of the ACT DB server 10A will be described using the example of FIG. FIG. 3 is a diagram illustrating a data configuration of a DB included in the ACT DB server. As illustrated in FIG. 3, the service information stores a record ID for identifying each record, data indicating information contents, and importance of set data in association with each other. FIG. 3 illustrates, for example, that the data of the record whose record ID is 1 is AAA and the importance is “high”.

制御部１４は、各種の処理手順などを規定したプログラムおよび所要データを格納するための内部メモリを有し、これらによって種々の処理を実行するが、特に本発明に密接に関連するものとしては、レプリケーション制御部１４ａ、計測部１４ｂ、中断部１４ｃおよび再開部１４ｄを有する。 The control unit 14 has an internal memory for storing a program that defines various processing procedures and the necessary data, and performs various processes by using these programs. Particularly, as closely related to the present invention, It has a replication control unit 14a, a measurement unit 14b, an interruption unit 14c, and a resumption unit 14d.

レプリケーション制御部１４ａは、ＡＣＴ系ＤＢサーバ１０Ａに記憶されたデータのうち、重要度が高いデータとして設定されたデータを重要度が低いデータとして設定されたデータよりも優先的にＡＣＴ系ＤＢサーバ１０ＡからＳＢＹ系ＤＢサーバ１０Ｂへレプリケーションするように制御する。 The replication control unit 14a preferentially sets the data set as data with high importance among the data stored in the ACT DB server 10A over data set as data with low importance. To replicate to the SBY DB server 10B.

具体的には、レプリケーション制御部１４ａは、ＳＢＹ系ＤＢサーバ１０ＢがＤＢサーバシステム１００に組み込まれた場合に、ＤＢ１５Ａに記憶されたデータのうち重要度の高いデータから順にＳＢＹ系ＤＢサーバ１０ＢのＤＢ１５Ｂに通知して、ＡＣＴ系ＤＢサーバ１０ＡのＤＢ１５ＡとＳＢＹ系ＤＢサーバ１０ＢのＤＢ１５Ｂとを同期させる一括レプリケーションを行う。 Specifically, when the SBY DB server 10B is incorporated in the DB server system 100, the replication control unit 14a performs the DB 15B of the SBY DB server 10B in descending order of importance from the data stored in the DB 15A. And performing batch replication to synchronize the DB 15A of the ACT DB server 10A and the DB 15B of the SBY DB server 10B.

例えば、図４に例示するように、レプリケーション制御部１４ａは、ＡＣＴ系ＤＢサーバ１０ＡのＤＢ１５Ａに記憶されたデータを重要度ごとにグルーピングし、一括レプリケーション中は、重要度の高いデータから順にＳＢＹ系ＤＢサーバ１０ＢのＤＢ１５Ｂにレプリケーションを実施する。図４は、重要度の高いグループからレプリケーションを行う処理について説明する図である。 For example, as illustrated in FIG. 4, the replication control unit 14 a groups the data stored in the DB 15 </ b> A of the ACT DB server 10 </ b> A according to importance, and during batch replication, the SBY system sequentially from the data with the highest importance. Replication is performed on the DB 15B of the DB server 10B. FIG. 4 is a diagram for explaining processing for performing replication from a group with high importance.

計測部１４ｂは、ＡＣＴ系ＤＢサーバ１０Ａの処理負荷として、ＡＣＴ系ＤＢサーバ１０ＡのＣＰＵ使用率を計測する。具体的には、計測部１４ｂは、ＡＣＴ系ＤＢサーバ１０ＡのＣＰＵ使用率を定期的に計測し、中断部１４ｃおよび再開部１４ｄにＣＰＵ使用率を通知する。 The measurement unit 14b measures the CPU usage rate of the ACT DB server 10A as the processing load of the ACT DB server 10A. Specifically, the measurement unit 14b periodically measures the CPU usage rate of the ACT DB server 10A and notifies the interrupting unit 14c and the restarting unit 14d of the CPU usage rate.

なお、計測部１４ｂは、ログ情報の取得タイミングとして、所定の時間間隔でログ情報を取得してもよいし、所定のイベント等をトリガとして取得してもよいし、ユーザの指示に従って取得するようにしてもよい。また、計測部１４ｂは、ＡＣＴ系ＤＢサーバ１０Ａの処理負荷として、ＣＰＵ使用率に代えて、メモリの使用量や通信量等を計測するようにしてもよい。 The measurement unit 14b may acquire log information at predetermined time intervals as log information acquisition timing, may acquire a predetermined event or the like as a trigger, or may acquire according to a user instruction. It may be. Further, the measurement unit 14b may measure a memory usage amount, a communication amount, or the like instead of the CPU usage rate as the processing load of the ACT DB server 10A.

中断部１４ｃは、レプリケーション制御部１４ａによって重要度が低いデータのレプリケーションが行われている際に、ＡＣＴ系ＤＢサーバ１０Ａの処理負荷が閾値２を超えた場合には、レプリケーションを中断する。 The interrupting unit 14c interrupts replication when the processing load of the ACT DB server 10A exceeds the threshold 2 when the replication control unit 14a is replicating data with low importance.

また、中断部１４ｃは、ＡＣＴ系ＤＢサーバ１０Ａによって重要度が高いデータのレプリケーションが行われている際に、ＡＣＴ系ＤＢサーバ１０Ａの処理負荷が閾値２よりも高い値である閾値３を超えた場合には、レプリケーションを中断する。 Further, the interrupting unit 14c has exceeded the threshold 3 that the processing load of the ACT DB server 10A is higher than the threshold 2 when the ACT DB server 10A is replicating highly important data. If so, suspend replication.

再開部１４ｄは、中断部１４ｃによってレプリケーションが中断された後、ＡＣＴ系ＤＢサーバ１０Ａの処理負荷が閾値２および閾値３よりも低い値である閾値１未満となった場合には、レプリケーションを再開する。 The resuming unit 14d resumes the replication when the processing load of the ACT DB server 10A becomes less than the threshold 1 which is lower than the threshold 2 and the threshold 3 after the interruption is interrupted by the interruption unit 14c. .

上述したように、ＣＰＵ使用率に対する閾値として、閾値１〜閾値３が設定されている。例えば、閾値１として「４５％」、閾値２として「６０％」、閾値３として「７５％」が設定される。これらの閾値は、適宜変更可能なパラメータであるが、概ね、優先度として、重要度の高いデータのレプリケーション＞サービス要求に対する処理＞重要度の低いデータのレプリケーションとなるように、閾値が設定されるものとする。 As described above, thresholds 1 to 3 are set as thresholds for the CPU usage rate. For example, “45%” is set as the threshold 1, “60%” is set as the threshold 2, and “75%” is set as the threshold 3. These thresholds are parameters that can be changed as appropriate. In general, the thresholds are set so that the priority is replication of highly important data> service request processing> replication of less important data. Shall.

ここで、図５および図６の例を用いて、一括レプリケーション中の動作について説明する。図５は、通常時の一括レプリケーションの動作を説明する図である。図６は、高負荷時の一括レプリケーションの動作を説明する図である。 Here, the operation during batch replication will be described with reference to the examples of FIGS. FIG. 5 is a diagram for explaining the normal batch replication operation. FIG. 6 is a diagram for explaining the operation of batch replication at high load.

まず、図５を用いて、通常時の一括レプリケーションの動作を説明する。図５に示すように、ＡＣＴ系ＤＢサーバ１０ＡのＣＰＵ使用率が閾値２を超えないような場合、すなわち、処理負荷が高くない通常時の場合には、一括レプリケーションの処理が継続して行われる。この場合には、ＡＣＴ系ＤＢサーバ１０Ａは、重要度の高いデータから先にレプリケーションを実施し、重要度の高いデータのレプリケーションが終了した後に、重要度の低いデータのレプリケーションを実施する。 First, the normal batch replication operation will be described with reference to FIG. As shown in FIG. 5, when the CPU usage rate of the ACT DB server 10A does not exceed the threshold 2, that is, in the normal time when the processing load is not high, the batch replication processing is continuously performed. . In this case, the ACT DB server 10A performs replication first from data with high importance, and performs replication of data with low importance after replication of data with high importance is completed.

続いて、図６を用いて、高負荷時の一括レプリケーションの動作を説明する。図６に示すように、ＡＣＴ系ＤＢサーバ１０Ａは、重要度の高いデータから先にレプリケーションを実施している間に、ＡＣＴ系ＤＢサーバ１０ＡのＣＰＵ使用率が閾値２を超えた場合には、レプリケーションを継続する。そして、ＡＣＴ系ＤＢサーバ１０Ａは、ＣＰＵ使用率が閾値３を超えた場合には、重要度の高いデータのレプリケーションを中断する。なお、ＡＣＴ系ＤＢサーバ１０Ａは、ＣＰＵ使用率が閾値３を超えた場合には、サービス提供装置２０からの要求に対して輻輳規制をかけてもよい。 Next, the batch replication operation at a high load will be described with reference to FIG. As shown in FIG. 6, when the ACT DB server 10A exceeds the threshold 2 when the CPU usage rate of the ACT DB server 10A exceeds the threshold value 2 while performing replication from the highly important data first, Continue replication. Then, when the CPU usage rate exceeds the threshold 3, the ACT DB server 10A interrupts replication of highly important data. Note that, when the CPU usage rate exceeds the threshold 3, the ACT DB server 10A may apply congestion restriction to the request from the service providing apparatus 20.

その後、ＡＣＴ系ＤＢサーバ１０Ａは、ＣＰＵ使用率が閾値１未満となった場合に、重要度の高いデータのレプリケーションを再開する。そして、ＡＣＴ系ＤＢサーバ１０Ａは、重要度の高いデータのレプリケーションが終了した後に、重要度の低いデータのレプリケーションを実施する。 Thereafter, when the CPU usage rate becomes less than the threshold value 1, the ACT DB server 10A resumes replication of highly important data. Then, the ACT DB server 10A performs replication of data with low importance after replication of data with high importance is completed.

その後、ＡＣＴ系ＤＢサーバ１０Ａは、重要度の低いデータのレプリケーションを実施している間に、ＣＰＵ使用率が閾値２を超えた場合には、レプリケーションを中断する。その後、ＡＣＴ系ＤＢサーバ１０Ａは、ＣＰＵ使用率が閾値１未満となった場合に、重要度の低いデータのレプリケーションを再開する。 Thereafter, the ACT DB server 10A interrupts the replication if the CPU usage rate exceeds the threshold value 2 while performing the replication of the data with low importance. Thereafter, when the CPU usage rate becomes less than the threshold value 1, the ACT DB server 10A resumes replication of data with low importance.

このように、ＡＣＴ系ＤＢサーバ１０Ａは、データの重要度順にレプリケーションを行い、処理負荷とレプリケーション中のデータの重要度に応じて、レプリケーションを中断する。このため、重要度の高いデータの冗長性を確保しつつ、サービスへの影響を最低限に抑えることが可能である。 As described above, the ACT DB server 10A performs replication in the order of importance of data, and interrupts replication according to the processing load and the importance of data being replicated. For this reason, it is possible to minimize the influence on the service while ensuring redundancy of highly important data.

［ＤＢサーバシステムの処理の流れ］
次に、図７を用いて、第一の実施の形態に係るＤＢサーバシステム１００の処理の流れを説明する。図７は、第一の実施の形態に係るＤＢサーバシステムのレプリケーション処理の流れを示すフローチャートである。 [Processing flow of DB server system]
Next, a processing flow of the DB server system 100 according to the first embodiment will be described with reference to FIG. FIG. 7 is a flowchart showing a flow of replication processing of the DB server system according to the first embodiment.

図７に示すように、ＤＢサーバシステム１００のＡＣＴ系ＤＢサーバ１０Ａは、一括レプリケーション中である場合には（ステップＳ１０１肯定）、ＣＰＵ使用率を計測する（ステップＳ１０２）。そして、ＡＣＴ系ＤＢサーバ１０Ａは、重要度の高いデータをレプリケーションしているか否かを判定する（ステップＳ１０３）。 As illustrated in FIG. 7, the ACT DB server 10A of the DB server system 100 measures the CPU usage rate when the batch replication is being performed (Yes in Step S101) (Step S102). Then, the ACT DB server 10A determines whether or not highly important data is being replicated (step S103).

この結果、ＡＣＴ系ＤＢサーバ１０Ａは、重要度の高いデータをレプリケーションしている場合には（ステップＳ１０３肯定）、ＣＰＵ使用率が閾値３を超えたか否かを判定する（ステップＳ１０４）。この結果、ＡＣＴ系ＤＢサーバ１０Ａは、ＣＰＵ使用率が閾値３を超えている場合には（ステップＳ１０４肯定）、ステップＳ１０６の処理に進む。また、ＡＣＴ系ＤＢサーバ１０Ａは、ＣＰＵ使用率が閾値３を超えていない場合には（ステップＳ１０４否定）、この処理フローを終了する。 As a result, when the highly important data is replicated (Yes at Step S103), the ACT DB server 10A determines whether or not the CPU usage rate exceeds the threshold 3 (Step S104). As a result, if the CPU usage rate exceeds the threshold 3 (Yes at Step S104), the ACT DB server 10A proceeds to the process at Step S106. In addition, when the CPU usage rate does not exceed the threshold 3 (No at Step S104), the ACT DB server 10A ends this processing flow.

また、ＡＣＴ系ＤＢサーバ１０Ａは、重要度の高いデータをレプリケーションしていない場合（ステップＳ１０３否定）、すなわち、重要度の低いデータをレプリケーションしている場合には、ＣＰＵ使用率が閾値２を超えたか否かを判定する（ステップＳ１０５）。この結果、ＡＣＴ系ＤＢサーバ１０Ａは、ＣＰＵ使用率が閾値２を超えている場合には（ステップＳ１０５肯定）、ステップＳ１０６の処理に進む。また、ＡＣＴ系ＤＢサーバ１０Ａは、ＣＰＵ使用率が閾値２を超えていない場合には（ステップＳ１０５否定）、この処理フローを終了する。 In addition, when the ACT DB server 10A does not replicate the data with high importance (No at Step S103), that is, when the data with low importance is replicated, the CPU usage rate exceeds the threshold value 2. It is determined whether or not (step S105). As a result, when the CPU usage rate exceeds the threshold 2 (Yes at Step S105), the ACT DB server 10A proceeds to the process at Step S106. In addition, when the CPU usage rate does not exceed the threshold 2 (No at Step S105), the ACT DB server 10A ends this processing flow.

ステップＳ１０６では、ＡＣＴ系ＤＢサーバ１０Ａは、レプリケーションを中断する。そして、ＡＣＴ系ＤＢサーバ１０Ａは、ＣＰＵ使用率を計測し（ステップＳ１０７）、ＣＰＵ使用率が閾値１未満であるか判定する（ステップＳ１０８）。この結果、ＡＣＴ系ＤＢサーバ１０Ａは、ＣＰＵ使用率が閾値１未満でない場合には（ステップＳ１０８否定）、ステップＳ１０７およびステップＳ１０８の処理を繰り返し行う。また、ＡＣＴ系ＤＢサーバ１０Ａは、ＣＰＵ使用率が閾値１未満である場合には（ステップＳ１０８肯定）、レプリケーションを再開し（ステップＳ１０９）、この処理フローを終了する。 In step S106, the ACT DB server 10A interrupts replication. Then, the ACT DB server 10A measures the CPU usage rate (step S107) and determines whether the CPU usage rate is less than the threshold value 1 (step S108). As a result, if the CPU usage rate is not less than the threshold value 1 (No at Step S108), the ACT DB server 10A repeats the processes at Step S107 and Step S108. If the CPU usage rate is less than the threshold 1 (Yes at Step S108), the ACT DB server 10A resumes replication (Step S109) and ends this processing flow.

［第一の実施の形態の効果］
このように、第一の実施の形態に係るＤＢサーバシステム１００のＡＣＴ系ＤＢサーバ１０Ａは、ＡＣＴ系ＤＢサーバ１０ＡのＤＢ１５Ａに記憶されたデータのうち、重要度が高いデータとして設定されたデータを重要度が低いデータとして設定されたデータよりも優先的にＡＣＴ系ＤＢサーバ１０ＡからＳＢＹ系ＤＢサーバ１０Ｂへレプリケーションするように制御する。そして、ＡＣＴ系ＤＢサーバ１０Ａは、重要度が低いデータのレプリケーションが行われている際に、ＡＣＴ系ＤＢサーバの処理負荷が閾値２を超えた場合には、レプリケーションを中断する。このため、一括レプリケーションを行う際のサービス提供への影響を低減しつつ、重要なデータの冗長性を迅速に確保することが可能である。 [Effect of the first embodiment]
As described above, the ACT DB server 10A of the DB server system 100 according to the first embodiment uses the data set as data having high importance among the data stored in the DB 15A of the ACT DB server 10A. Control is performed so that replication is performed from the ACT DB server 10A to the SBY DB server 10B preferentially over data set as data of low importance. Then, the ACT DB server 10A interrupts the replication when the processing load of the ACT DB server exceeds the threshold 2 while the replication of the data with low importance is performed. For this reason, it is possible to quickly ensure the redundancy of important data while reducing the influence on service provision when performing batch replication.

つまり、従来の技術では、図８に例示するように、一括レプリケーションとサービス要求処理の集中による高負荷状態となった場合には、サービス要求に対する輻輳規制がかかるため、サービスに影響が発生する。このため、第一の実施の形態に係るＤＢサーバシステム１００のＡＣＴ系ＤＢサーバ１０Ａは、データの重要度順にレプリケーションを行い、処理負荷とレプリケーション中のデータの重要度に応じて、レプリケーションを中断することで、重要度の高いデータの冗長性を確保しつつ、サービスへの影響を最低限に抑えることが可能である。 That is, in the conventional technology, as illustrated in FIG. 8, in a high load state due to concentration of collective replication and service request processing, congestion is restricted for the service request, which affects the service. For this reason, the ACT DB server 10A of the DB server system 100 according to the first embodiment performs replication in the order of the importance of the data, and interrupts the replication according to the processing load and the importance of the data being replicated. As a result, it is possible to minimize the influence on the service while ensuring redundancy of highly important data.

また、第一の実施の形態に係るＤＢサーバシステム１００のＡＣＴ系ＤＢサーバ１０Ａは、重要度が高いデータのレプリケーションが行われている際に、ＡＣＴ系ＤＢサーバ１０Ａの処理負荷が閾値２よりも高い値である閾値３を超えた場合には、レプリケーションを中断する。これにより、ＤＢサーバシステム１００は、例えば、サービス要求に対する輻輳規制がかかるほどにＡＣＴ系ＤＢサーバ１０Ａが高負荷状態となった場合には、重要度の低いデータだけでなく、重要度が高いデータについてもレプリケーションを中断するので、ＡＣＴ系ＤＢサーバ１０Ａの処理負荷を一旦抑えることが可能である。 Further, the ACT DB server 10A of the DB server system 100 according to the first embodiment is configured such that the processing load of the ACT DB server 10A is higher than the threshold 2 when replication of highly important data is performed. When the threshold value 3, which is a high value, is exceeded, replication is interrupted. Thereby, for example, when the ACT DB server 10A is in a high load state to the extent that the congestion restriction on the service request is applied, the DB server system 100 is not only data with low importance but also data with high importance. Since the replication is interrupted as well, the processing load on the ACT DB server 10A can be temporarily suppressed.

また、第一の実施の形態に係るＤＢサーバシステム１００のＡＣＴ系ＤＢサーバ１０Ａは、レプリケーションが中断された後、ＡＣＴ系ＤＢサーバ１０Ａの処理負荷が閾値２および閾値３よりも低い値である閾値１未満となった場合には、レプリケーションを再開する。このため、ＤＢサーバシステム１００は、レプリケーションが中断した後であっても、ＡＣＴ系ＤＢサーバ１０Ａの処理負荷が軽減された場合には、レプリケーションを自動的に再開することが可能である。 Further, the ACT DB server 10A of the DB server system 100 according to the first embodiment has a threshold that the processing load of the ACT DB server 10A is lower than the threshold 2 and the threshold 3 after the replication is interrupted. When it becomes less than 1, replication is resumed. For this reason, even after the replication is interrupted, the DB server system 100 can automatically resume the replication if the processing load on the ACT DB server 10A is reduced.

また、第一の実施の形態に係るＤＢサーバシステム１００のＡＣＴ系ＤＢサーバ１０Ａは、ＡＣＴ系ＤＢサーバ１０Ａの処理負荷として、ＡＣＴ系ＤＢサーバ１０ＡのＣＰＵ使用率を計測する。このように、ＡＣＴ系ＤＢサーバ１０Ａは、ＡＣＴ系ＤＢサーバ１０Ａの処理負荷として、ＣＰＵ使用率を計測するので、レプリケーションの処理とサービス提供装置２０からの要求に対する処理とにより、ＡＣＴ系ＤＢサーバ１０Ａにどの程度の負荷が生じているかを適切に把握することが可能である。 The ACT DB server 10A of the DB server system 100 according to the first embodiment measures the CPU usage rate of the ACT DB server 10A as the processing load of the ACT DB server 10A. Thus, since the ACT DB server 10A measures the CPU usage rate as the processing load of the ACT DB server 10A, the ACT DB server 10A is processed by the replication process and the process for the request from the service providing apparatus 20. Thus, it is possible to appropriately grasp how much load is generated.

［システム構成等］
また、図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。さらに、各装置にて行なわれる各処理機能は、その全部または任意の一部が、ＣＰＵおよび当該ＣＰＵにて解析実行されるプログラムにて実現され、あるいは、ワイヤードロジックによるハードウェアとして実現され得る。 [System configuration, etc.]
Further, each component of each illustrated apparatus is functionally conceptual, and does not necessarily need to be physically configured as illustrated. In other words, the specific form of distribution / integration of each device is not limited to that shown in the figure, and all or a part thereof may be functionally or physically distributed or arbitrarily distributed in arbitrary units according to various loads or usage conditions. Can be integrated and configured. Further, all or any part of each processing function performed in each device may be realized by a CPU and a program analyzed and executed by the CPU, or may be realized as hardware by wired logic.

また、本実施の形態において説明した各処理のうち、自動的におこなわれるものとして説明した処理の全部または一部を手動的におこなうこともでき、あるいは、手動的におこなわれるものとして説明した処理の全部または一部を公知の方法で自動的におこなうこともできる。この他、上記文書中や図面中で示した処理手順、制御手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。 In addition, among the processes described in this embodiment, all or part of the processes described as being automatically performed can be manually performed, or the processes described as being manually performed All or a part of the above can be automatically performed by a known method. In addition, the processing procedure, control procedure, specific name, and information including various data and parameters shown in the above-described document and drawings can be arbitrarily changed unless otherwise specified.

［プログラム］
図９は、レプリケーションプログラムを実行するコンピュータを示す図である。コンピュータ１０００は、例えば、メモリ１０１０、ＣＰＵ（Central Processing Unit）１０２０を有する。また、コンピュータ１０００は、ハードディスクドライブインタフェース１０３０、ディスクドライブインタフェース１０４０、シリアルポートインタフェース１０５０、ビデオアダプタ１０６０、ネットワークインタフェース１０７０を有する。これらの各部は、バス１０８０によって接続される。 [program]
FIG. 9 is a diagram illustrating a computer that executes a replication program. The computer 1000 includes, for example, a memory 1010 and a CPU (Central Processing Unit) 1020. The computer 1000 also includes a hard disk drive interface 1030, a disk drive interface 1040, a serial port interface 1050, a video adapter 1060, and a network interface 1070. These units are connected by a bus 1080.

メモリ１０１０は、ＲＯＭ（Read Only Memory）１０１１及びＲＡＭ（Random Access Memory）１０１２を含む。ＲＯＭ１０１１は、例えば、ＢＩＯＳ（Basic Input Output System）等のブートプログラムを記憶する。ハードディスクドライブインタフェース１０３０は、ハードディスクドライブ１０９０に接続される。ディスクドライブインタフェース１０４０は、ディスクドライブ１１００に接続される。例えば磁気ディスクや光ディスク等の着脱可能な記憶媒体が、ディスクドライブ１１００に挿入される。シリアルポートインタフェース１０５０は、例えばマウス１０５１、キーボード１０５２に接続される。ビデオアダプタ１０６０は、例えばディスプレイ１０６１に接続される。 The memory 1010 includes a ROM (Read Only Memory) 1011 and a RAM (Random Access Memory) 1012. The ROM 1011 stores a boot program such as BIOS (Basic Input Output System). The hard disk drive interface 1030 is connected to the hard disk drive 1090. The disk drive interface 1040 is connected to the disk drive 1100. For example, a removable storage medium such as a magnetic disk or an optical disk is inserted into the disk drive 1100. The serial port interface 1050 is connected to a mouse 1051 and a keyboard 1052, for example. The video adapter 1060 is connected to the display 1061, for example.

ハードディスクドライブ１０９０は、例えば、ＯＳ１０９１、アプリケーションプログラム１０９２、プログラムモジュール１０９３、プログラムデータ１０９４を記憶する。すなわち、ＡＣＴ系ＤＢサーバ１０Ａの各処理を規定するプログラムは、コンピュータにより実行可能なコードが記述されたプログラムモジュール１０９３として実装される。プログラムモジュール１０９３は、例えばハードディスクドライブ１０９０に記憶される。例えば、装置における機能構成と同様の処理を実行するためのプログラムモジュール１０９３が、ハードディスクドライブ１０９０に記憶される。なお、ハードディスクドライブ１０９０は、ＳＳＤ（Solid State Drive）により代替されてもよい。 The hard disk drive 1090 stores, for example, an OS 1091, an application program 1092, a program module 1093, and program data 1094. That is, a program that defines each process of the ACT DB server 10A is implemented as a program module 1093 in which a code executable by a computer is described. The program module 1093 is stored in the hard disk drive 1090, for example. For example, a program module 1093 for executing processing similar to the functional configuration in the apparatus is stored in the hard disk drive 1090. The hard disk drive 1090 may be replaced by an SSD (Solid State Drive).

また、上述した実施の形態の処理で用いられる設定データは、プログラムデータ１０９４として、例えばメモリ１０１０やハードディスクドライブ１０９０に記憶される。そして、ＣＰＵ１０２０が、メモリ１０１０やハードディスクドライブ１０９０に記憶されたプログラムモジュール１０９３やプログラムデータ１０９４を必要に応じてＲＡＭ１０１２に読み出して実行する。 The setting data used in the processing of the above-described embodiment is stored as program data 1094 in, for example, the memory 1010 or the hard disk drive 1090. Then, the CPU 1020 reads the program module 1093 and the program data 1094 stored in the memory 1010 and the hard disk drive 1090 to the RAM 1012 and executes them as necessary.

なお、プログラムモジュール１０９３やプログラムデータ１０９４は、ハードディスクドライブ１０９０に記憶される場合に限らず、例えば着脱可能な記憶媒体に記憶され、ディスクドライブ１１００等を介してＣＰＵ１０２０によって読み出されてもよい。あるいは、プログラムモジュール１０９３及びプログラムデータ１０９４は、ネットワーク（ＬＡＮ（Local Area Network）、ＷＡＮ（Wide Area Network）等）を介して接続された他のコンピュータに記憶されてもよい。そして、プログラムモジュール１０９３及びプログラムデータ１０９４は、他のコンピュータから、ネットワークインタフェース１０７０を介してＣＰＵ１０２０によって読み出されてもよい。 The program module 1093 and the program data 1094 are not limited to being stored in the hard disk drive 1090, but may be stored in, for example, a removable storage medium and read out by the CPU 1020 via the disk drive 1100 or the like. Alternatively, the program module 1093 and the program data 1094 may be stored in another computer connected via a network (LAN (Local Area Network), WAN (Wide Area Network), etc.). Then, the program module 1093 and the program data 1094 may be read by the CPU 1020 from another computer via the network interface 1070.

１０ＡＡＣＴ系ＤＢサーバ
１０ＢＳＢＹ系ＤＢサーバ
１１通信処理部
１２入力部
１３出力部
１４制御部
１４ａレプリケーション制御部
１４ｂ計測部
１４ｃ中断部
１４ｄ再開部
１５Ａ、１５ＢＤＢ
２０サービス提供装置
１００ＤＢサーバシステム 10A ACT DB server 10B SBY DB server 11 Communication processing unit 12 Input unit 13 Output unit 14 Control unit 14a Replication control unit 14b Measurement unit 14c Interruption unit 14d Resume unit 15A, 15B DB
20 Service providing apparatus 100 DB server system

Claims

A server system having an active server and a standby server,
Of the data stored in the active server, data set as data with high importance is replicated from the active server to the standby server preferentially over data set as data with low importance A replication control unit to control
When replication of data with low importance is performed by the replication control unit, if the processing load of the active server exceeds a first threshold, an interruption unit that interrupts replication,
A server system comprising:

The interruption unit exceeds a second threshold value when the processing load of the active server is higher than the first threshold value when replication of highly important data is performed by the replication control unit. The server system according to claim 1, wherein replication is interrupted in the event of a failure.

After replication is interrupted by the interrupting unit, if the processing load of the active server becomes less than a third threshold that is lower than the first threshold and the second threshold, replication is performed. The server system according to claim 2, further comprising a resuming unit for resuming.

The server system according to claim 1, further comprising a measuring unit that measures a CPU usage rate of the active server as a processing load of the active server.

A replication method executed by a server system having an active server and a standby server,
Of the data stored in the active server, data set as data with high importance is replicated from the active server to the standby server preferentially over data set as data with low importance A replication control process to control,
When replication of data with low importance is performed by the replication control step, if the processing load of the active server exceeds a first threshold, an interruption step of interrupting replication,
The replication method characterized by including.