JP6200387B2

JP6200387B2 - Redundant system and replication method

Info

Publication number: JP6200387B2
Application number: JP2014164431A
Authority: JP
Inventors: 信一河村; 秋山　悟; 悟秋山
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2014-08-12
Filing date: 2014-08-12
Publication date: 2017-09-20
Anticipated expiration: 2034-08-12
Also published as: JP2016040675A

Description

本発明は、マスタ‐スレーブの冗長化システム、およびレプリケーション方法に関する。 The present invention relates to a master-slave redundancy system and a replication method.

マスタ/スレーブの２重化の冗長構成のＤＢ（データベース）システムにおいて、２重故障対策を実施する場合、３重化（マスタ/スレーブ/スレーブ）の冗長構成とする場合と、Ｎ＋１（予備機）構成とすることが考えられる。 In a DB / database system with a redundant configuration of master / slave, when double countermeasures are implemented, a redundant configuration of triple (master / slave / slave) and N + 1 (spare unit) It is conceivable to have a configuration.

また、特許文献１には、マスタ/スレーブの２重化の冗長構成において、ＤＢの差分情報を記憶するために必要な記憶領域の肥大化を防ぐ技術が開示されている。 Patent Document 1 discloses a technique for preventing an enlargement of a storage area necessary for storing DB difference information in a master / slave redundant configuration.

特開2013-171483号公報JP 2013-171483 A

３重化（マスタ/スレーブ/スレーブ）の冗長構成とする場合、図８に示すように、ファミリー毎に分散されたＤＢシステムにおいては、ファミリー毎にスレーブが必要となるため、スレーブの構築費用が増大するという問題がある。 In the case of a triple (master / slave / slave) redundant configuration, as shown in FIG. 8, in the DB system distributed for each family, a slave is required for each family. There is a problem of increasing.

また、Ｎ＋１構成とする場合、図９に示すように、ファミリー＃１のスレーブが故障して、故障発生後に予備機をスレーブとしてファミリー＃１に組み込むための作業が発生し、時間と操作負荷を要するという問題がある。なお、予備機をファミリー＃１に組み込むため作業としては、マスタのデータのレプリケーション作業、ＩＰアドレスなどの所データの設定作業などがある。 Further, in the case of the N + 1 configuration, as shown in FIG. 9, the slave of the family # 1 fails, and after the failure occurs, work for incorporating the spare machine into the family # 1 as a slave occurs. There is a problem that it takes. The work for incorporating the spare machine into the family # 1 includes a master data replication work, a place data setting work such as an IP address, and the like.

本発明は、上記の事情に鑑みなされたものであって、本発明の目的は、冗長化システムの構築費用を削減するとともに、２重故障時でも処理を止めることなく運用することにある。 The present invention has been made in view of the above circumstances, and an object of the present invention is to reduce the construction cost of a redundant system and to operate without stopping processing even in the case of a double failure.

上記の課題を解決するために、本発明は、冗長化システムであって、複数の冗長化サブシステムと、共用ＤＢサーバとを備え、各冗長化サブシステムは、マスタＤＢサーバと、第１スレーブＤＢサーバとを備え、前記共用ＤＢサーバは、前記複数の冗長化サブシステムで共用する第２スレーブＤＢサーバであり、各冗長化サブシステムのマスタＤＢサーバは、当該マスタＤＢサーバが備えるマスタＤＢへの更新が発生した場合、第１スレーブＤＢサーバおよび前記共用ＤＢサーバへのレプリケーションを行い、前記冗長化サブシステムのマスタＤＢサーバおよび第１スレーブＤＢサーバが故障した場合、前記共用ＤＢサーバが当該冗長化サブシステムのマスタＤＢサーバとして動作するとともに、他の冗長化サブシステムからの当該共用ＤＢサーバへのレプリケーションを停止する。 In order to solve the above problems, the present invention is a redundant system comprising a plurality of redundant subsystems and a shared DB server, wherein each redundant subsystem includes a master DB server and a first slave. The shared DB server is a second slave DB server shared by the plurality of redundant subsystems, and the master DB server of each redundant subsystem is transferred to the master DB included in the master DB server. If the update has occurred, have rows replication to the first slave DB server and the shared DB server, if the master DB server and the first slave DB server of the redundant sub-system fails, the shared DB server the Operates as a master DB server of a redundant subsystem and the shared DB from other redundant subsystems To stop the replication of the over server.

本発明は、複数の冗長化サブシステムと、共用ＤＢサーバとを備える冗長化システムが行うレプリケーション方法であって、各冗長化サブシステムは、マスタＤＢサーバと、第１スレーブＤＢサーバとを備え、前記共用ＤＢサーバは、各冗長化サブシステムで共用する第２スレーブＤＢサーバであり、各冗長化サブシステムのマスタＤＢサーバは、前記マスタＤＢサーバへの更新を行う更新ステップと、第１スレーブＤＢサーバおよび前記共用ＤＢサーバへのレプリケーションを行うレプリケーションステップと、を行い、前記冗長化サブシステムのマスタＤＢサーバおよび第１スレーブＤＢサーバが故障した場合、前記共用ＤＢサーバは、当該冗長化サブシステムをマスタＤＢサーバとして切り替える切替ステップと、他の冗長化サブシステムからの当該共用ＤＢサーバへのレプリケーションを停止する停止ステップと、を行う。 The present invention is a replication method performed by a redundant system comprising a plurality of redundant subsystems and a shared DB server, each redundant subsystem comprising a master DB server and a first slave DB server, The shared DB server is a second slave DB server shared by each redundant subsystem, and the master DB server of each redundant subsystem includes an update step for updating to the master DB server , and a first slave DB There lines and replication step, the performing replication to the server and the shared DB server, if the master DB server and the first slave DB server of the redundant sub-system fails, the shared DB server, the redundant subsystem Switching step to switch as a master DB server and other redundant subsystems A stop step of stopping the replication to the shared DB server from beam, is carried out.

本発明によれば、冗長化システムの構築費用を削減することができるとともに、２重故障時でも処理を止めることなく運用することができる。 According to the present invention, it is possible to reduce the construction cost of a redundant system and to operate without stopping processing even in the case of a double failure.

本発明の実施形態の冗長化システムの全体構成図である。1 is an overall configuration diagram of a redundant system according to an embodiment of the present invention. マスタＤＢサーバ構成を示すブロック図である。It is a block diagram which shows a master DB server structure. 第１スレーブＤＢサーバの構成を示すブロック図である。It is a block diagram which shows the structure of a 1st slave DB server. 共用ＤＢサーバの構成を示すブロック図である。It is a block diagram which shows the structure of a shared DB server. 通常時の冗長化システムの動作を説明するための図である。It is a figure for demonstrating operation | movement of the redundancy system at the time of normal. 片系故障時の冗長化システムの動作を説明するための図である。It is a figure for demonstrating operation | movement of the redundancy system at the time of a one-system failure. ２重故障時の冗長化システムの動作を説明するための図である。It is a figure for demonstrating operation | movement of the redundancy system at the time of a double failure. ３重化（マスタ/スレーブ/スレーブ）の冗長構成を示す図である。It is a figure which shows the redundant structure of triple (master / slave / slave). Ｎ＋１の冗長構成を示す図である。It is a figure which shows the redundant structure of N + 1.

以下、本発明の実施の形態を、図面を参照して説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

図１は、本実施形態のＤＢサーバシステム（冗長化システム）の全体構成図である。 FIG. 1 is an overall configuration diagram of a DB server system (redundant system) of this embodiment.

図示するＤＢサーバシステムは、複数のファミリー＃１、＃２、＃３（複数の冗長化サブシステム）１、２、３と、共用ＤＢサーバ４とを備え、ファミリー毎にＤＢが分散された分散ＤＢサーバシステムである。 The illustrated DB server system includes a plurality of families # 1, # 2, and # 3 (a plurality of redundant subsystems) 1, 2, and 3, and a shared DB server 4, and the DB is distributed for each family. This is a DB server system.

各ファミリー１、２、３は、それぞれ対応するＳＩＰサーバ５、６、７（アプリケーションサーバ）の冗長構成のＤＢとして動作し、マスタＤＢサーバと、第１スレーブサーバとを備える。図示する例では、ファミリー＃１＿１は、ＳＩＰサーバ５のＤＢサブシステムであって、マスタＤＢサーバ１１と第１スレーブＤＢサーバ１２とを備える。ファミリー＃２＿２は、ＳＩＰサーバ６のＤＢサブシステムであって、マスタＤＢサーバ２１と第１スレーブＤＢサーバ２２とを備える。ファミリー＃３＿３は、ＳＩＰサーバ７のＤＢサブシステムであって、マスタＤＢサーバ３１と第１スレーブＤＢサーバ３２とを備える。 Each family 1, 2, and 3 operates as a redundant DB of the corresponding SIP servers 5, 6, and 7 (application servers), and includes a master DB server and a first slave server. In the illustrated example, the family # 1_1 is a DB subsystem of the SIP server 5 and includes a master DB server 11 and a first slave DB server 12. Family # 2_2 is a DB subsystem of the SIP server 6 and includes a master DB server 21 and a first slave DB server 22. Family # 3_3 is a DB subsystem of the SIP server 7 and includes a master DB server 31 and a first slave DB server 32.

なお、本実施形態では、アプリケーションサーバとして、呼処理を行うＳＩＰ（Session Initiation Protocol）サーバ５、６、７を用いることとするが、ＳＩＰサーバ５、６、７に限定されるものではない。図示する例では、電話番号に応じてＳＩＰサーバが振り分けられ、各ファミリー１、２、３には対応する電話番号の電話番号情報（加入者情報など）が記憶されているものとする。 In the present embodiment, SIP (Session Initiation Protocol) servers 5, 6, and 7 that perform call processing are used as application servers. However, the application server is not limited to the SIP servers 5, 6, and 7. In the illustrated example, it is assumed that SIP servers are allocated according to telephone numbers, and telephone numbers information (subscriber information, etc.) of corresponding telephone numbers are stored in the families 1, 2, and 3.

共用ＤＢサーバ４は、複数のファミリー１、２、３の第２スレーブＤＢサーバとして、ネットワーク上に構築される。これにより、各ファミリー１、２、３は、マスタＤＢサーバと、第１スレーブＤＢサーバと、第２スレーブＤＢサーバ（共用ＤＢサーバ４）の３重化の冗長構成となり、２重故障対策を実現することができる。 The shared DB server 4 is constructed on the network as a second slave DB server of a plurality of families 1, 2, and 3. As a result, each family 1, 2, 3 has a triple redundant configuration of the master DB server, the first slave DB server, and the second slave DB server (shared DB server 4), and implements countermeasures against double failures. can do.

図２は、ファミリー＃１＿１のマスタＤＢサーバ１１の構成を示すブロック図である。図示するマスタＤＢサーバ１１は、宛先解決部１１０と、更新部１１１と、レプリケーション部１１２と、マスタＤＢ１１３とを備える。 FIG. 2 is a block diagram illustrating a configuration of the master DB server 11 of the family # 1_1. The illustrated master DB server 11 includes a destination resolution unit 110, an update unit 111, a replication unit 112, and a master DB 113.

宛先解決部１１０は、ＳＩＰサーバ５から、電話番号を含む宛先解決要求（データ参照要求）を受信すると、要求された電話番号の電話番号情報をマスタＤＢ１１３から読み出し、ＳＩＰサーバ５に送信する。更新部１１１は、ＳＩＰサーバ５から、データ更新要求を受信すると、マスタＤＢ１１３の要求されたデータを更新する。 Upon receiving a destination resolution request (data reference request) including a telephone number from the SIP server 5, the destination resolution unit 110 reads the telephone number information of the requested telephone number from the master DB 113 and transmits it to the SIP server 5. When receiving the data update request from the SIP server 5, the update unit 111 updates the requested data in the master DB 113.

レプリケーション部１１２は、更新部１１１によるマスタＤＢ１１３のデータ更新処理が発生した場合、当該更新内容を第１スレーブＤＢサーバ１２および共用ＤＢサーバ４に送信し、マスタＤＢ１１３への更新内容を第１および第２スレーブＤＢに反映させるレプリケーション（同期処理）を行う。これにより、マスタＤＢ１１３と、第１および第２スレーブＤＢとが常に同期した状態になる。マスタＤＢ１１３は、元となるオリジナルデータが格納されるＤＢである。 When the data update process of the master DB 113 by the update unit 111 occurs, the replication unit 112 transmits the update content to the first slave DB server 12 and the shared DB server 4, and updates the update content to the master DB 113 to the first and first 2. Perform replication (synchronization processing) to be reflected in the slave DB. Thereby, the master DB 113 and the first and second slave DBs are always synchronized. The master DB 113 is a DB that stores original original data.

なお、ファミリー＃２＿２およびファミリー＃３のマスタＤＢサーバ２１、３１は、図２に示すマスタＤＢサーバ１１と同様であるため、ここでは説明を省略する。 The master DB servers 21 and 31 of the family # 2_2 and the family # 3 are the same as the master DB server 11 shown in FIG.

図３は、ファミリー＃１＿１の第１スレーブＤＢサーバ１２の構成を示すブロック図である。図示する第１スレーブＤＢサーバ１２は、宛先解決部１２０と、更新部１２１と、レプリケーション部１２２と、第１スレーブＤＢ１２３とを備える。 FIG. 3 is a block diagram illustrating a configuration of the first slave DB server 12 of the family # 1_1. The illustrated first slave DB server 12 includes a destination resolution unit 120, an update unit 121, a replication unit 122, and a first slave DB 123.

宛先解決部１２０および更新部１２１は、マスタＤＢサーバ１１の宛先解決部１１０および更新部１１１と同様である。なお、宛先解決部１２０および更新部１２１は、通常時においては動作しない。第１スレーブＤＢサーバ１２は、マスタＤＢサーバ１１が故障し、自身がマスタＤＢサーバとして動作する場合に、宛先解決部１２０および更新部１２１を起動する。 The destination resolution unit 120 and the update unit 121 are the same as the destination resolution unit 110 and the update unit 111 of the master DB server 11. Note that the destination resolution unit 120 and the update unit 121 do not operate during normal times. The first slave DB server 12 activates the destination resolution unit 120 and the update unit 121 when the master DB server 11 fails and operates as a master DB server.

レプリケーション部１２２は、マスタＤＢサーバ１１のレプリケーション部１１２から送信される更新データを受信し、第１スレーブＤＢ１２３をマスタＤＢ１１３と同期させるように、第１スレーブＤＢ１２３を更新する。第１スレーブＤＢ１２３は、マスタＤＢ１１３のレプリカ（複製データ）が格納されるＤＢである。 The replication unit 122 receives the update data transmitted from the replication unit 112 of the master DB server 11 and updates the first slave DB 123 so that the first slave DB 123 is synchronized with the master DB 113. The first slave DB 123 is a DB in which a replica (replicated data) of the master DB 113 is stored.

なお、ファミリー＃２＿２およびファミリー＃３＿３の第１スレーブＤＢサーバ２２、３２は、図３に示す第１スレーブＤＢサーバ１２と同様であるため、ここでは説明を省略する。 Note that the first slave DB servers 22 and 32 of the family # 2_2 and family # 3_3 are the same as the first slave DB server 12 shown in FIG.

図４は、共用ＤＢサーバ４の構成を示すブロック図である。図示する共用ＤＢサーバ４は、宛先解決部４０と、更新部４１と、レプリケーション部４２と、レプリケーション要求部４３と、マスタ切替部４４と、共用ＤＢ４５とを備える。 FIG. 4 is a block diagram showing the configuration of the shared DB server 4. The shared DB server 4 shown includes a destination resolution unit 40, an update unit 41, a replication unit 42, a replication request unit 43, a master switching unit 44, and a shared DB 45.

宛先解決部４０および更新部４１は、マスタＤＢサーバ１１の宛先解決部１１０と、更新部１１１と同様である。なお、１台のサーバで複数ファミリーの共用第２スレーブとして使用するため、リソースの枯渇を回避する必要があることから、宛先解決部４０および更新部４１は、通常時においては動作しない。共用ＤＢサーバ４は、マスタＤＢサーバ１１および第１スレーブＤＢサーバ１２が故障し、自身がマスタＤＢサーバとして動作する場合に、宛先解決部４０および更新部４１を起動する。 The destination resolution unit 40 and the update unit 41 are the same as the destination resolution unit 110 and the update unit 111 of the master DB server 11. Since one server is used as a shared second slave of a plurality of families, it is necessary to avoid resource depletion. Therefore, the destination resolving unit 40 and the updating unit 41 do not operate during normal times. The shared DB server 4 activates the destination resolution unit 40 and the update unit 41 when the master DB server 11 and the first slave DB server 12 fail and operate as a master DB server.

レプリケーション部１２２は、マスタＤＢサーバ１１のレプリケーション部１１２から送信される更新データを受信し、共用ＤＢ４５のファミリー＃１用データをマスタＤＢ１１３と同期させるように、共用ＤＢ４５のファミリー＃１用データを更新する。 The replication unit 122 receives the update data transmitted from the replication unit 112 of the master DB server 11 and updates the family # 1 data of the shared DB 45 so as to synchronize the family # 1 data of the shared DB 45 with the master DB 113. To do.

レプリケーション要求部４３は、所定の時間、マスタＤＢサーバからのレプリケーションがない場合に、当該マスタＤＢサーバにレプリケーション要求を送信する。マスタ切替部４４は、あるファミリーにおいて、マスタＤＢサーバおよび第１スレーブＤＢサーバの２重故障が発生した場合、自身が当該ファミリーのマスタＤＢサーバとして動作するよう切り替える。 The replication request unit 43 transmits a replication request to the master DB server when there is no replication from the master DB server for a predetermined time. When a double failure occurs in the master DB server and the first slave DB server in a certain family, the master switching unit 44 switches itself so that it operates as a master DB server of the family.

共用ＤＢ４５は、全てのファミリーが共用する第２スレーブＤＢであって、図示する共用ＤＢ４５には、ファミリー＃１のマスタＤＢ１１３のレプリカ（複製データ）であるファミリー＃１用データと、ファミリー＃２のマスタＤＢのレプリカであるファミリー＃２用データと、ファミリー＃３のマスタＤＢのレプリカであるファミリー＃３用データとが格納される。 The shared DB 45 is a second slave DB shared by all families. The shared DB 45 shown in the figure includes data for the family # 1, which is a replica (replicated data) of the master DB 113 of the family # 1, and the family # 2. Data for family # 2, which is a replica of the master DB, and data for family # 3, which is a replica of the master DB of family # 3, are stored.

上記説明した、マスタＤＢサーバ１１、２１、３１、第１スレーブＤＢサーバ１２、２２、３２および共用ＤＢサーバ４は、例えば、ＣＰＵと、メモリと、ハードディスク等の外部記憶装置と、入力装置と、出力装置とを備えた汎用的なコンピュータシステムを用いることができる。このコンピュータシステムにおいて、ＣＰＵがメモリ上にロードされた所定のプログラムを実行することにより、各部の各機能が実現される。例えば、マスタＤＢサーバ、第１スレーブＤＢサーバおよび共用ＤＢサーバの各機能は、マスタＤＢサーバ用のプログラムの場合はマスタＤＢサーバのＣＰＵが、第１スレーブＤＢサーバ用のプログラムの場合は第１スレーブＤＢサーバのＣＰＵが、そして、共用ＤＢサーバ用のプログラムの場合は共用ＤＢサーバのＣＰＵがそれぞれ実行することにより実現される。 The master DB servers 11, 21, 31, the first slave DB servers 12, 22, 32 and the shared DB server 4 described above include, for example, a CPU, a memory, an external storage device such as a hard disk, an input device, A general-purpose computer system including an output device can be used. In this computer system, each function of each unit is realized by the CPU executing a predetermined program loaded on the memory. For example, the functions of the master DB server, the first slave DB server, and the shared DB server are as follows: the master DB server CPU in the case of the master DB server program, and the first slave in the case of the first slave DB server program. This is realized by executing the CPU of the DB server and, in the case of a program for the shared DB server, the CPU of the shared DB server.

また、マスタＤＢサーバ用のプログラム、第１スレーブＤＢサーバ用のプログラムおよび共用ＤＢサーバ用のプログラムは、ハードディスク、フレキシブルディスク、ＣＤ−ＲＯＭ、ＭＯ、ＤＶＤ−ＲＯＭなどのコンピュータ読取り可能な記録媒体に記憶することも、ネットワークを介して配信することもできる。 The master DB server program, the first slave DB server program, and the shared DB server program are stored in a computer-readable recording medium such as a hard disk, a flexible disk, a CD-ROM, an MO, and a DVD-ROM. Or distributed over a network.

次に、本実施形態の動作について説明する。 Next, the operation of this embodiment will be described.

図５は、本実施形態の通常時の動作を説明するための図である。図示する例では、ファミリー＃１において、データ更新処理が発生した場合を例に説明する。 FIG. 5 is a diagram for explaining the normal operation of the present embodiment. In the illustrated example, a case where data update processing occurs in the family # 1 will be described as an example.

マスタＤＢサーバ１１の更新部１１１は、ＳＩＰサーバ５からのデータ更新要求を受信すると、マスタＤＢ１１３を更新する（Ｓ１１）。そして、レプリケーション部１１２は、第１スレーブＤＢサーバ１２へのレプリケーションを行うとともに（Ｓ１２）、共用ＤＢサーバ４へのレプリケーションを行う（Ｓ１３）。すなわち、マスタＤＢ１１３への更新内容を第１スレーブＤＢ１２３および共用ＤＢ４５（ファミリー＃１用の第２スレーブＤＢ）に反映させるために、更新内容を含むレプリケーション指示を送信する。 When receiving the data update request from the SIP server 5, the update unit 111 of the master DB server 11 updates the master DB 113 (S11). The replication unit 112 performs replication to the first slave DB server 12 (S12) and replication to the shared DB server 4 (S13). That is, in order to reflect the update contents to the master DB 113 in the first slave DB 123 and the shared DB 45 (second slave DB for family # 1), a replication instruction including the update contents is transmitted.

これにより、第１スレーブＤＢサーバ１２のレプリケーション部１２２は、マスタＤＢサーバ１１から送信された更新内容に基づいて、第１スレーブＤＢ１２３を更新する。また、共用ＤＢサーバ４のレプリケーション部４２は、マスタＤＢサーバ１１から送信された更新内容に基づいて、共用ＤＢ４５を更新する。このような処理により、マスタＤＢ１１３と、第１スレーブＤＢ１２３と、第２スレーブＤＢである共用ＤＢ４５とが同期される。 Thereby, the replication unit 122 of the first slave DB server 12 updates the first slave DB 123 based on the update content transmitted from the master DB server 11. Further, the replication unit 42 of the shared DB server 4 updates the shared DB 45 based on the update content transmitted from the master DB server 11. By such processing, the master DB 113, the first slave DB 123, and the shared DB 45 that is the second slave DB are synchronized.

また、共用ＤＢサーバ４のレプリケーション要求部４３は、所定の時間、マスタＤＢサーバ１１からのレプリケーション指示を受信しない場合、自律的にマスタＤＢサーバ１１にレプリケーション要求を送信する（Ｓ１４）。 Further, when the replication request unit 43 of the shared DB server 4 does not receive a replication instruction from the master DB server 11 for a predetermined time, it autonomously transmits a replication request to the master DB server 11 (S14).

これにより、マスタＤＢサーバ１１のレプリケーション部１１２は、未送信のレプリケーションがある場合は、レプリケーション指示を共用ＤＢサーバ４に送信する。一方、マスタＤＢサーバ１１からの応答がない場合、共用ＤＢサーバ４のレプリケーション要求部４３は、マスタＤＢサーバ１１が故障したこと（障害発生）を検知する。故障検知後の処理については、後述する。 Thereby, the replication unit 112 of the master DB server 11 transmits a replication instruction to the shared DB server 4 when there is unsent replication. On the other hand, when there is no response from the master DB server 11, the replication request unit 43 of the shared DB server 4 detects that the master DB server 11 has failed (failure occurs). Processing after failure detection will be described later.

図６は、本実施形態の片系故障時（第１スレーブＤＢサーバ故障時）の動作を説明するための図である。図示する例では、ファミリー＃１の第１スレーブＤＢサーバ１２が故障した場合を例に説明する。 FIG. 6 is a diagram for explaining the operation when a single system failure occurs (when the first slave DB server fails) according to this embodiment. In the illustrated example, a case where the first slave DB server 12 of the family # 1 fails will be described as an example.

第１スレーブＤＢサーバ１２が故障すると（Ｓ２１）、マスタＤＢサーバ１１は、当該故障を検知し、その後は、マスタＤＢ１１３および共用ＤＢ４５（第２スレーブＤＢ）の２重化の冗長構成として動作する。 When the first slave DB server 12 fails (S21), the master DB server 11 detects the failure, and thereafter operates as a redundant redundant configuration of the master DB 113 and the shared DB 45 (second slave DB).

すなわち、その後、マスタＤＢサーバ１１の更新部１１１は、ＳＩＰサーバ５からのデータ更新要求を受信すると、マスタＤＢ１１３を更新する（Ｓ２２）。そして、レプリケーション部１１２は、共用ＤＢサーバ４へのレプリケーションを行う（Ｓ２３）。すなわち、マスタＤＢ１１３への更新内容を共用ＤＢ４５（ファミリー＃１用の第２スレーブＤＢ）に反映させるために、更新内容を含むレプリケーション指示を送信する。 That is, after that, when receiving the data update request from the SIP server 5, the update unit 111 of the master DB server 11 updates the master DB 113 (S22). Then, the replication unit 112 performs replication to the shared DB server 4 (S23). That is, in order to reflect the update contents to the master DB 113 in the shared DB 45 (second slave DB for family # 1), a replication instruction including the update contents is transmitted.

これにより、共用ＤＢサーバ４のレプリケーション部４２は、マスタＤＢサーバ１１から送信された更新内容に基づいて、共用ＤＢ４５を更新する。このような処理により、マスタＤＢ１１３と、第２スレーブＤＢである共用ＤＢ４５とが同期される。 Thereby, the replication unit 42 of the shared DB server 4 updates the shared DB 45 based on the update content transmitted from the master DB server 11. By such processing, the master DB 113 and the shared DB 45 that is the second slave DB are synchronized.

図７は、本実施形態の２重故障時（両系故障時）の動作を説明するための図である。図示する例では、ファミリー＃１のマスタＤＢサーバ１１および第１スレーブＤＢサーバ１２が故障した場合を例に説明する。 FIG. 7 is a diagram for explaining the operation at the time of double failure (during both system failures) according to the present embodiment. In the illustrated example, a case where the master DB server 11 and the first slave DB server 12 of the family # 1 have failed will be described as an example.

マスタＤＢサーバ１１および第１スレーブＤＢサーバ１２がともに故障する（Ｓ３１、Ｓ３２）。これにより、共用ＤＢサーバ４には、所定の時間、マスタＤＢサーバ１１からのレプリケーション指示が送信されない状態となるため、共用ＤＢサーバ４のレプリケーション要求部４３は、マスタＤＢサーバ１１にレプリケーション要求を送信する（Ｓ３３）。 Both the master DB server 11 and the first slave DB server 12 fail (S31, S32). As a result, since the replication instruction from the master DB server 11 is not transmitted to the shared DB server 4 for a predetermined time, the replication request unit 43 of the shared DB server 4 transmits a replication request to the master DB server 11. (S33).

この場合、マスタＤＢサーバ１１および第１スレーブＤＢサーバ１２が故障しているため、Ｓ３３のレプリケーション要求に対する応答が無いため、レプリケーション要求部４３は、マスタＤＢサーバ１１および第１スレーブＤＢサーバ１２の両系が故障中であることを検知する。 In this case, since the master DB server 11 and the first slave DB server 12 are out of order and there is no response to the replication request in S33, the replication request unit 43 is connected to both the master DB server 11 and the first slave DB server 12. Detect that the system is in failure.

故障を検知した場合、マスタ切替部４４は、当該共用ＤＢサーバ４をファミリー＃１のマスタＤＢサーバ１１として動作するように切り替える（Ｓ３４）。すなわち、マスタＤＢサーバへ格上げするために、宛先解決部４０および更新部４１を起動する。 When a failure is detected, the master switching unit 44 switches the shared DB server 4 to operate as the master DB server 11 of the family # 1 (S34). That is, in order to upgrade to the master DB server, the destination resolution unit 40 and the update unit 41 are activated.

また、マスタ切替部４４は、他のファミリー＃２、＃３のマスタＤＢサーバ２１、３１に、当該共用ＤＢサーバ４がファミリー＃１のマスタＤＢサーバとして動作することを通知し、他のファミリー＃２、＃３から共用ＤＢサーバ４に対するアクセスを禁止する（Ｓ３５）。他のファミリー＃２、＃３のマスタＤＢサーバ２１、３１は、共用ＤＢサーバ４からの当該通知を受信すると、共用ＤＢサーバ４（第２スレーブＤＢ）へのレプリケーションを停止する。これにより、共用ＤＢサーバ４が、マスタＤＢサーバとして動作することによるリソースの枯渇を回避することができる。 In addition, the master switching unit 44 notifies the master DB servers 21 and 31 of the other families # 2 and # 3 that the shared DB server 4 operates as the master DB server of the family # 1. 2, access to the shared DB server 4 from # 3 is prohibited (S35). When the master DB servers 21 and 31 of the other families # 2 and # 3 receive the notification from the shared DB server 4, the replication to the shared DB server 4 (second slave DB) is stopped. Thereby, it is possible to avoid resource depletion due to the shared DB server 4 operating as a master DB server.

また、マスタ切替部４４は、対応するＳＩＰサーバ５に当該共用ＤＢサーバ４がファミリー＃１のマスタＤＢサーバとして動作することを通知する。これにより、ＳＩＰサーバ５は、宛先解決要求およびデータ更新要求を共用ＤＢサーバ４に送信する。宛先解決部４０は、ＳＩＰサーバ５からの宛先解決要求を受信し、共用ＤＢ４５に格納されたファミリー＃１用のデータを参照し、当該宛先解決要求で指定された電話番号に関する情報を読み出し、ＳＩＰサーバ５に送信する。また、更新部４１は、ＳＩＰサーバ５からのデータ更新要求を受信すると、共用ＤＢ４５のファミリー＃１用のデータを更新する（Ｓ３６）。 In addition, the master switching unit 44 notifies the corresponding SIP server 5 that the shared DB server 4 operates as a master DB server of family # 1. As a result, the SIP server 5 transmits a destination resolution request and a data update request to the shared DB server 4. The destination resolution unit 40 receives the destination resolution request from the SIP server 5, refers to the data for family # 1 stored in the shared DB 45, reads out information related to the telephone number specified in the destination resolution request, Send to server 5. Further, when receiving the data update request from the SIP server 5, the updating unit 41 updates the data for family # 1 in the shared DB 45 (S36).

なお、２重故障の復旧後は、共用ＤＢサーバ４のマスタ切替部４４は、自身をマスタＤＢサーバから共用ＤＢサーバに切り替え、宛先解決部４０および更新部４１を停止する。また、マスタ切替部４４は、アクセスを禁止していた他のファミリー＃２、＃３に、自身が共用ＤＢサーバ４としての動作を再開したこと（アクセスの再開）を通知する。これにより、他のファミリー＃２、＃３のマスタＤＢサーバ２１、３１は、アクセスが禁止されていた期間のレプリケーションを共用ＤＢサーバ４に対して行い、マスタＤＢと共用ＤＢ４５（第２スレーブＤＢ）とを同期させる。 After the double failure is recovered, the master switching unit 44 of the shared DB server 4 switches itself from the master DB server to the shared DB server, and stops the destination resolution unit 40 and the update unit 41. Further, the master switching unit 44 notifies the other families # 2 and # 3 that have been prohibited from accessing that the operation as the shared DB server 4 has been resumed (resumption of access). As a result, the master DB servers 21 and 31 of the other families # 2 and # 3 perform replication during the period during which access is prohibited to the shared DB server 4, and the master DB and the shared DB 45 (second slave DB). Synchronize with.

以上説明した本実施形態では、１台のＤＢサーバ４を複数ファミリーの共用第２スレーブサーバとして利用させることにより、２重化（マスタ／スレーブ）の冗長構成をとっているＤＢサーバシステムにおいて、冗長構成の効率化を実現することができる。すなわち、冗長化システムの構築費用を削減することができるとともに、マスタ/スレーブの２重故障が発生した場合であっても、処理を停止させることなく運用することができる。 In the present embodiment described above, redundancy is achieved in a DB server system having a redundant configuration (master / slave) by using one DB server 4 as a shared second slave server of a plurality of families. The efficiency of the configuration can be realized. That is, the construction cost of the redundant system can be reduced, and even if a master / slave double failure occurs, it can be operated without stopping the processing.

また、本実施形態では、常時、各ファミリーのマスタＤＢサーバは、共用ＤＢサーバ４へのレプリケーションを行い、マスタＤＢと共用ＤＢとを同期させている。このため、本実施形態では、故障発生後に、マスタＤＢのデータを共用ＤＢにレプリケーションする処理が不要であり、迅速に、共用ＤＢサーバ４をマスタＤＢサーバに切り替えることができる。すなわち、常時行われているマスタＤＢサーバからのレプリケーションによりバックアップを取得しているため、故障時の組み込み設定が不要である。 Moreover, in this embodiment, the master DB server of each family always performs replication to the shared DB server 4 to synchronize the master DB and the shared DB. For this reason, in this embodiment, the process of replicating the data in the master DB to the shared DB is not required after a failure occurs, and the shared DB server 4 can be quickly switched to the master DB server. In other words, since the backup is acquired by the replication from the master DB server that is always performed, there is no need for the built-in setting at the time of failure.

また、本実施形態では、マスタ／スレーブの２重故障発生時においても、共用ＤＢサーバ４をマスタＤＢサーバに格上げすることにより、アプリケーション処理を中断させることなく安定した運用を継続することができる。 Further, in the present embodiment, even when a master / slave double failure occurs, stable operation can be continued without interrupting application processing by upgrading the shared DB server 4 to a master DB server.

なお、本発明は上記実施形態に限定されるものではなく、特許請求の範囲内において、種々変更・応用が可能である。 In addition, this invention is not limited to the said embodiment, A various change and application are possible within a claim.

１１、２１、３１：マスタＤＢサーバ
１２、２２、３２：第１スレーブＤＢサーバ
１１０、１２０：宛先解決部
１１１、１２１：更新部
１１２、１２２：レプリケーション部
１１３：マスタＤＢ
１２３：第１スレーブＤＢ
４：共用ＤＢサーバ
４０：宛先解決部
４１：更新部
４２：レプリケーション部
４３：レプリケーション要求部
４４：マスタ切替部
４５：共用ＤＢ 11, 21, 31: Master DB server 12, 22, 32: First slave DB server 110, 120: Destination resolution unit 111, 121: Update unit 112, 122: Replication unit 113: Master DB
123: First slave DB
4: Shared DB server 40: Destination resolution unit 41: Update unit 42: Replication unit 43: Replication request unit 44: Master switching unit 45: Shared DB

Claims

A redundant system,
A plurality of redundant subsystems and a shared DB server;
Each redundant subsystem includes a master DB server and a first slave DB server,
The shared DB server is a second slave DB server shared by the plurality of redundant subsystems;
Master DB server of each redundant subsystem, if the update to the master DB of the master DB server comprises occurs, have rows replication to the first slave DB server and the shared DB server,
When the master DB server and the first slave DB server of the redundant subsystem fail, the shared DB server operates as the master DB server of the redundant subsystem and the shared DB from other redundant subsystems. A redundant system characterized by stopping replication to the server .

The redundant system according to claim 1 , wherein
When the first slave DB server of the redundant subsystem fails, the master DB server of the redundant subsystem replicates only to the shared DB server.

The redundancy system according to claim 1 or 2 ,
The redundant system, wherein the shared DB server transmits a replication request to the master DB server when there is no replication from the master DB server for a predetermined time.

A replication method performed by a redundant system including a plurality of redundant subsystems and a shared DB server,
Each redundant subsystem includes a master DB server and a first slave DB server,
The shared DB server is a second slave DB server shared by each redundant subsystem,
The master DB server of each redundant subsystem is
An update step for updating the master DB server ;
And replication step of replicating the first slave DB server and the shared DB server, gastric row,
When the master DB server and the first slave DB server of the redundant subsystem fail, the shared DB server
A switching step of switching the redundant subsystem as a master DB server;
And a stopping step of stopping replication from the other redundant subsystem to the shared DB server .

The replication method according to claim 4 , wherein
In the replication method, when the first slave DB server fails, replication is performed only on the shared DB server.

The replication method according to claim 4 or 5 , wherein
The shared DB server further includes a request step of transmitting a replication request to the master DB server when there is no replication from the master DB server for a predetermined time.