JP2020194217A

JP2020194217A - Redundant system and data synchronization method

Info

Publication number: JP2020194217A
Application number: JP2019097811A
Authority: JP
Inventors: 庸平新海; Yohei Shinkai
Original assignee: Azbil Corp
Current assignee: Azbil Corp
Priority date: 2019-05-24
Filing date: 2019-05-24
Publication date: 2020-12-03
Anticipated expiration: 2039-05-24
Also published as: JP7422492B2

Abstract

To maintain data integrity while avoiding a problem caused by synchronization of abnormal data among a plurality of controllers when recovering from split brain.SOLUTION: A redundant system 1 is composed of two controllers 10A and 10B. The controller 10A or 10B includes: a determination unit 31A or 31B for determining whether abnormality has occurred in a database 25A or 25B when its own device is an active system; a detection unit 32A or 32B for detecting whether two or more controllers are active systems when its own device is an active system; a starting unit 33A or 33B for rebooting the own device and making it a standby system when the detection unit detects that two or more controllers are active systems and the determination unit determines that abnormality has occurred in the database; and a synchronization unit 35A or 35B that synchronizes data of the own device with a standby controller when the detection unit detects that two or more controllers are not active systems and the determination unit determines that abnormality has not occurred in the database.SELECTED DRAWING: Figure 3

Description

この発明は、冗長システムを構成する複数のコントローラ間でデータを同期させる技術に関する。 The present invention relates to a technique for synchronizing data between a plurality of controllers constituting a redundant system.

従来、プラントなどのプロセス制御に用いるプロセス制御システムにおいて、高信頼性を要求される場合には、複数のコントローラを設ける冗長構造をもつ構成が用いられている。このように、コントローラを多重化してシステムを冗長させることで、システムはより安定した運用を行える。このような冗長構造をもつ従来のプラント制御システムは、例えば、現用系として稼働する第１のコントローラと、待機系として稼働する第２のコントローラを備え、第１のコントローラに異常が発生すると、待機系だった第２のコントローラを現用系に切り替える。 Conventionally, in a process control system used for process control of a plant or the like, when high reliability is required, a configuration having a redundant structure in which a plurality of controllers are provided has been used. In this way, by multiplexing the controllers and making the system redundant, the system can operate more stably. A conventional plant control system having such a redundant structure includes, for example, a first controller that operates as an active system and a second controller that operates as a standby system, and when an abnormality occurs in the first controller, it stands by. Switch the second controller, which was a system, to the current system.

ところで、このような冗長システムにおいて、現用系のノードが複数存在する状態を「スプリットブレイン」という。一般的に、冗長システムでは、現用系のノードと待機系のノードとを繋ぐネットワークが障害等により遮断され、現用系のノードと待機系のノードとが独立して動作する場合にスプリットブレインとなる。 By the way, in such a redundant system, a state in which a plurality of active nodes exist is called "split brain". Generally, in a redundant system, a split brain occurs when the network connecting the active node and the standby node is interrupted due to a failure or the like, and the active node and the standby node operate independently. ..

スプリットブレインでは、現用系のノードと待機系のノードとが独立して動作するため、それぞれのノードで異なるデータを保持することとなる。したがって、この状態でシステムがスプリットブレインから復旧すると、現用系のノードと待機系のノードとの間でデータの整合性が取れない状態となる。そこで、現用系のノードと待機系のノードとの間でデータの差異が生じないように、システムをスプリットブレインから復旧させる技術が提案されている。 In split brain, the active node and the standby node operate independently, so that each node holds different data. Therefore, if the system recovers from split brain in this state, the data will not be consistent between the active node and the standby node. Therefore, a technique for recovering the system from split brain has been proposed so that there is no data difference between the active node and the standby node.

例えば、特許文献１に記載された遠隔クラスタシステムでは、スプリットブレインから復旧する際、現用系のノード（プライマリサーバ）が、通信遮断中に自サーバで更新された差分データを待機系のノード（セカンダリサーバ）に送信する。セカンダリサーバは、プライマリサーバから受信した差分データと、通信遮断中に自サーバで更新された差分データとをマージして、プライマリサーバに送信する送信データと、自サーバに反映させる反映データとを作成する。そして、セカンダリサーバは、反映データを自サーバに反映させるとともに、送信データをプライマリサーバに送信する。プライマリサーバは、セカンダリサーバから送られた送信データを自サーバに反映させる。この遠隔クラスタシステムは、このようにして、双方のサーバで保持するデータに差異が生じないようにシステムをスプリットブレインから復旧させる。 For example, in the remote cluster system described in Patent Document 1, when recovering from split brain, the active node (primary server) receives the difference data updated by the local server during communication interruption to the standby node (secondary). Send to server). The secondary server merges the difference data received from the primary server with the difference data updated on the local server during communication interruption, and creates the transmission data to be sent to the primary server and the reflection data to be reflected on the local server. To do. Then, the secondary server reflects the reflected data on the own server and transmits the transmitted data to the primary server. The primary server reflects the transmission data sent from the secondary server on its own server. This remote cluster system thus recovers the system from split brain so that there is no difference in the data held by both servers.

特開２００６−１４６２９９号公報Japanese Unexamined Patent Publication No. 2006-146299

ところで、スプリットブレインは、ビル等の施設内に設置されている複数の監視対象機器を監視する施設監視システムにおいても発生することがある。例えば、施設監視システムは、一般的に現用系と待機系の２台のコントローラによる二重化構成とされるが、双方のコントローラを繋ぐネットワークが障害等により遮断するとスプリットブレインとなる。この場合、現用系のコントローラと待機系のコントローラとは、独立して動作するため、それぞれのコントローラで異なるデータを保持することとなる。ところが、従来の施設監視システムでは、スプリットブレインから復旧するとき、両コントローラの間でデータの整合性を取るための処理は行われておらず、改善が求められていた。 By the way, split brain may also occur in a facility monitoring system that monitors a plurality of monitored devices installed in a facility such as a building. For example, a facility monitoring system generally has a dual configuration consisting of two controllers, an active system and a standby system, but if the network connecting both controllers is interrupted due to a failure or the like, a split brain occurs. In this case, since the active controller and the standby controller operate independently, each controller holds different data. However, in the conventional facility monitoring system, when recovering from split brain, processing for ensuring data consistency between both controllers is not performed, and improvement is required.

また、特許文献１に記載の遠隔クラスタシステムでは、プライマリサーバとセカンダリサーバのそれぞれが保持するデータに異常があるかどうかの確認は行っていない。したがって、仮にプライマリサーバの保持するデータに何らかの異常があった場合、当該異常のあるデータがセカンダリサーバに同期されて、システムがダウンするなどの不具合が発生するおそれがあった。 Further, in the remote cluster system described in Patent Document 1, it is not confirmed whether or not there is an abnormality in the data held by each of the primary server and the secondary server. Therefore, if there is any abnormality in the data held by the primary server, the abnormal data may be synchronized with the secondary server, causing a problem such as the system going down.

この発明は、上記のような課題を解決するためになされたもので、スプリットブレインからの復旧時に、複数のコントローラ間で異常なデータが同期されることによる不具合の発生を回避しつつデータの整合性を保つことを目的としている。 The present invention has been made to solve the above-mentioned problems, and data matching while avoiding the occurrence of a problem due to abnormal data synchronization between a plurality of controllers when recovering from split brain. The purpose is to maintain sex.

この発明に係る冗長システムは、複数のコントローラを含んで構成され、各コントローラは、自機が現用系の場合に、データベースに異常が発生しているか否かを判断する判断部と、自機が現用系の場合に、２台以上のコントローラが現用系となったか否かを検知する検知部と、検知部が、２台以上のコントローラが現用系となったことを検知し、かつ、判断部が、データベースに異常が発生していると判断した場合に、自機を再起動して待機系とする起動部と、検知部が、２台以上のコントローラが現用系となっていないことを検知し、かつ、判断部が、データベースに異常が発生していないと判断した場合に、自機のデータを待機系のコントローラに同期させる同期部と、を備えたことを特徴とする。 The redundant system according to the present invention is configured to include a plurality of controllers, and each controller has a determination unit for determining whether or not an abnormality has occurred in the database when the own machine is an active system, and the own machine. In the case of an active system, a detection unit that detects whether or not two or more controllers have become active systems, and a detection unit that detects that two or more controllers have become active systems and a judgment unit. However, when it is determined that an error has occurred in the database, the startup unit that restarts the machine and makes it a standby system and the detection unit detect that two or more controllers are not in the active system. In addition, the determination unit is provided with a synchronization unit that synchronizes the data of the own machine with the controller of the standby system when it is determined that no abnormality has occurred in the database.

この発明によれば、上記のように構成したので、スプリットブレインからの復旧時に、複数のコントローラ間で異常なデータが同期されることによる不具合の発生を回避しつつデータの整合性を保つことができる。 According to the present invention, since it is configured as described above, it is possible to maintain data consistency while avoiding the occurrence of a problem due to abnormal data synchronization between a plurality of controllers when recovering from split brain. it can.

実施の形態１に係る冗長システム（二重化システム）の構成例を示す図である。It is a figure which shows the configuration example of the redundant system (redundant system) which concerns on Embodiment 1. FIG. 実施の形態１におけるコントローラのＳＳＤに記憶されるデータのイメージを示す図である。It is a figure which shows the image of the data stored in the SSD of the controller in Embodiment 1. FIG. 図３Ａは、実施の形態１における現用系のコントローラの構成例を示す図であり、図３Ｂは、実施の形態１における待機系のコントローラの構成例を示す図である。FIG. 3A is a diagram showing a configuration example of the active system controller according to the first embodiment, and FIG. 3B is a diagram showing a configuration example of the standby system controller according to the first embodiment. 実施の形態１に係る二重化システムにおいて、スプリットブレインが発生した場合の動作例を示すフローチャートである。It is a flowchart which shows the operation example when the split brain occurs in the duplex system which concerns on Embodiment 1. 実施の形態１に係る二重化システムにおいて、スプリットブレインが発生した場合の動作例を示すタイミングチャートである。It is a timing chart which shows the operation example when the split brain occurs in the duplex system which concerns on Embodiment 1. FIG. 実施の形態１におけるコントローラが自機のデータベースに異常が発生しているか否かを判断する処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the process which the controller in Embodiment 1 determines whether or not an abnormality has occurred in the database of own machine.

以下、この発明の実施の形態について図面を参照しながら詳細に説明する。
実施の形態１．
図１は、この発明の実施の形態１に係る冗長システムの構成例を示している。冗長システム１は、複数（ここでは２台）のコントローラ１０（１０Ａ、１０Ｂ）と、複数の監視ポイント２００と、不図示の監視装置とを含んで構成されている。コントローラ１０Ａ、１０Ｂと、各監視ポイント２００と、監視装置とは、システムバス１５０を介して相互に通信可能に接続されている。ここでは、コントローラが２台で構成されているため、冗長システム１を二重化システム１として説明する。しかしながら、コントローラの台数は必ずしも２台でなくともよく、いずれか１台のコントローラが現用系（アクティブ）として稼働できる構成であれば、３台以上であってもよい。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.
Embodiment 1.
FIG. 1 shows a configuration example of a redundant system according to the first embodiment of the present invention. The redundant system 1 is configured to include a plurality of (here, two) controllers 10 (10A, 10B), a plurality of monitoring points 200, and a monitoring device (not shown). The controllers 10A and 10B, each monitoring point 200, and the monitoring device are connected to each other via the system bus 150 so as to be able to communicate with each other. Here, since the controller is composed of two controllers, the redundant system 1 will be described as the redundant system 1. However, the number of controllers does not necessarily have to be two, and may be three or more as long as any one of the controllers can operate as an active system (active).

二重化システム１は、二重化機能を備えている。二重化機能は、２台のコントローラ１０Ａ、１０Ｂで監視ポイント２００の監視、制御を行う機能である。すなわち、二重化システム１は、通常時はコントローラ１０Ａを現用系、コントローラ１０Ｂを待機系（スタンバイ）として稼働させ、現用系のコントローラ１０Ａが監視ポイント２００の監視、制御、及びデータの収集等の処理を行う。コントローラ１０Ａは、データベース２５Ａが構築されたＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）２０Ａを備えており、コントローラ１０Ａは、監視ポイント２００から収集したデータを、ＳＳＤ２０Ａ内のデータベース２５Ａに格納する。 The duplication system 1 has a duplication function. The duplication function is a function of monitoring and controlling the monitoring point 200 with two controllers 10A and 10B. That is, in the duplex system 1, the controller 10A is normally operated as an active system and the controller 10B is operated as a standby system (standby), and the active controller 10A performs processing such as monitoring, control, and data collection of the monitoring point 200. Do. The controller 10A includes an SSD (Solid State Drive) 20A in which the database 25A is constructed, and the controller 10A stores the data collected from the monitoring point 200 in the database 25A in the SSD 20A.

また、二重化機能は、データベースの同期も管理する。例えば、二重化システム１では、現用系のコントローラ１０Ａが起動した状態で、待機系のコントローラ１０Ｂが起動すると、待機系のコントローラ１０Ｂは、現用系のコントローラ１０Ａに対し、データベース２５Ａのコピーを要求する。この要求を受け付けたコントローラ１０Ａは、コントローラ１０Ｂに対し、データベース２５Ａのコピーを送信する。コントローラ１０Ｂは、コントローラ１０Ａからデータベース２５Ａのコピーを受信すると、当該コピーを用いて、自機が備えるＳＳＤ２０Ｂ内にデータベース２５Ｂを構築する。 The duplication feature also manages database synchronization. For example, in the duplex system 1, when the standby system controller 10B is activated while the active system controller 10A is activated, the standby system controller 10B requests the active system controller 10A to copy the database 25A. The controller 10A that has received this request transmits a copy of the database 25A to the controller 10B. When the controller 10B receives a copy of the database 25A from the controller 10A, the controller 10B constructs the database 25B in the SSD 20B provided in the own machine by using the copy.

その後、コントローラ１０Ａは、データベース２５Ａの更新を完了する度に、当該データベース２５Ａを更新したデータのコピーをコントローラ１０Ｂに送信する。コントローラ１０Ｂは、コントローラ１０Ａから受信した更新データのコピーをデータベース２５Ｂに書き込み、データベース２５Ｂの更新を完了する。このようにして、二重化システム１は、データベース２５Ａ、２５Ｂの同期を行う。 After that, each time the controller 10A completes the update of the database 25A, the controller 10A transmits a copy of the updated data of the database 25A to the controller 10B. The controller 10B writes a copy of the update data received from the controller 10A to the database 25B, and completes the update of the database 25B. In this way, the duplication system 1 synchronizes the databases 25A and 25B.

また、二重化システム１は、現用系のコントローラ１０Ａに障害などの問題が発生した場合、待機系のコントローラ１０Ｂを現用系に切り替える。現用系に切り替わったコントローラ１０Ｂは、監視ポイント２００の監視、制御、及びデータの収集等の処理を継続し、監視ポイント２００から収集したデータを、ＳＳＤ２０Ｂ内のデータベース２５Ｂに格納する。 Further, the duplex system 1 switches the standby system controller 10B to the active system when a problem such as a failure occurs in the active system controller 10A. The controller 10B switched to the active system continues processing such as monitoring, control, and data collection of the monitoring point 200, and stores the data collected from the monitoring point 200 in the database 25B in the SSD 20B.

なお、実施の形態１では、コントローラ１０Ａ、１０Ｂは、図２に示すように、データベース２５Ａ、２５Ｂに格納されるデータのほかに、データベース２５Ａ、２５Ｂには格納されないが他方のコントローラとの間で同期するデータ２６Ａ、２６Ｂを、ＳＳＤ２０Ａ、ＳＳＤ２０Ｂに記憶している。 In the first embodiment, as shown in FIG. 2, the controllers 10A and 10B are not stored in the databases 25A and 25B but are connected to the other controller in addition to the data stored in the databases 25A and 25B. The data 26A and 26B to be synchronized are stored in the SSD20A and SSD20B.

＜コントローラ１０Ａ、１０Ｂの構成例＞
次に、コントローラ１０Ａ、１０Ｂの構成例について、図３を参照して説明する。図３Ａは、コントローラ１０Ａの構成例を示し、図３Ｂは、コントローラ１０Ｂの構成例を示している。 <Configuration example of controllers 10A and 10B>
Next, a configuration example of the controllers 10A and 10B will be described with reference to FIG. FIG. 3A shows a configuration example of the controller 10A, and FIG. 3B shows a configuration example of the controller 10B.

図３Ａに示すように、コントローラ１０Ａは、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）２０Ａ、及びＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）３０Ａを備えている。ＳＳＤ２０Ａには、前述したデータベース２５Ａが構築されている。 As shown in FIG. 3A, the controller 10A includes an SSD (Solid State Drive) 20A and a CPU (Central Processing Unit) 30A. The database 25A described above is constructed in the SSD 20A.

ＣＰＵ３０Ａは、予めＳＳＤ２０Ａに記憶されている所定のプログラムを実行することにより、判断部３１Ａ、検知部３２Ａ、起動部３３Ａ、通知部３４Ａ、及び同期部３５Ａとして機能する。 The CPU 30A functions as a determination unit 31A, a detection unit 32A, an activation unit 33A, a notification unit 34A, and a synchronization unit 35A by executing a predetermined program stored in the SSD 20A in advance.

判断部３１Ａは、自機（コントローラ１０Ａ）が現用系の場合に、自機のデータベース２５Ａに異常が発生しているか否かを判断する。例えば、判断部３１Ａは、自機のデータベース２５Ａに対して所定の信号（例えばＰｉｎｇ）を送信し、所定時間（例えば１分）以内に応答があればデータベース２５Ａに接続できた（正常）と判断する。一方、判断部３１Ａは、所定時間以内に応答がなければ、データベース２５Ａに異常が発生したと判断する。判断部３１Ａは、少なくともコントローラ１０Ａが現用系として起動するときに、データベース２５ＡにＰｉｎｇを送信してデータベース２５Ａの異常判断を実行する。 The determination unit 31A determines whether or not an abnormality has occurred in the database 25A of the own machine when the own machine (controller 10A) is an active system. For example, the determination unit 31A transmits a predetermined signal (for example, Ping) to the database 25A of its own machine, and if there is a response within a predetermined time (for example, 1 minute), it is determined that the database 25A can be connected (normal). To do. On the other hand, if there is no response within the predetermined time, the determination unit 31A determines that an abnormality has occurred in the database 25A. The determination unit 31A transmits a ping to the database 25A and executes an abnormality determination of the database 25A at least when the controller 10A is started as an active system.

検知部３２Ａは、自機（コントローラ１０Ａ）が現用系の場合に、二重化システム１を構成する２台のコントローラの双方が現用系となったか否か、すなわちスプリットブレインとなったか否かを検知する。例えば、検知部３２Ａは、コントローラ１０Ｂと信号を送受信し合うことで互いの稼働状態を随時監視し、自機（コントローラ１０Ａ）が現用系として稼働している際に、コントローラ１０Ｂが待機系から現用系に切り替わった場合に、スプリットブレインを検知する。検知部３２Ａがスプリットブレインを検知する具体例については後述する。 When the own machine (controller 10A) is a working system, the detection unit 32A detects whether or not both of the two controllers constituting the duplex system 1 are working systems, that is, whether or not they are split brains. .. For example, the detection unit 32A monitors each other's operating status at any time by transmitting and receiving signals to and from the controller 10B, and when the own machine (controller 10A) is operating as the active system, the controller 10B is in use from the standby system. Detects split brain when switching to the system. A specific example in which the detection unit 32A detects split brain will be described later.

起動部３３Ａは、前述の検知部３２Ａが、２台のコントローラの双方が現用系となったこと（スプリットブレイン）を検知し、かつ、前述の判断部３１Ａが自機（コントローラ１０Ａ）のデータベース２５Ａに異常が発生していると判断した場合に、自機を再起動して現用系から待機系に切り替える。 In the activation unit 33A, the above-mentioned detection unit 32A detects that both of the two controllers have become active systems (split brain), and the above-mentioned determination unit 31A is the database 25A of the own machine (controller 10A). If it is determined that an abnormality has occurred in the system, restart the own machine and switch from the active system to the standby system.

また、起動部３３Ａは、検知部３２Ａにより、２台のコントローラの双方が現用系となったことを検知し、かつ、判断部３１Ａにより、データベース２５Ａに異常が発生していないと判断した場合であっても、コントローラ１０Ｂの後述する通知部３４Ｂによる通知を受信し、当該通知元のコントローラ１０Ｂとの間で自機の優先度が低い場合には、自機を再起動して待機系とする。 Further, the activation unit 33A detects that both of the two controllers have become active systems by the detection unit 32A, and determines that no abnormality has occurred in the database 25A by the determination unit 31A. Even if there is, if the notification by the notification unit 34B described later of the controller 10B is received and the priority of the own machine is low with the controller 10B of the notification source, the own machine is restarted to be a standby system. ..

なお、優先度とは、どのコントローラを現用系として優先的に稼働させるかを示す度合いを示すものであり、優先度が他のコントローラより低い場合、そのコントローラは現用系から待機系に切り替わる。優先度は、ユーザ（管理者）が各コントローラに予め設定しておけばよい。 The priority indicates the degree of which controller is preferentially operated as the active system, and when the priority is lower than that of other controllers, the controller is switched from the active system to the standby system. The priority may be set in advance by the user (administrator) in each controller.

通知部３４Ａは、判断部３１Ａによりデータベース２５Ａに異常が発生していないと判断した場合に、異常が発生していない旨を他のコントローラ（ここではコントローラ１０Ｂ）に通知する。 When the determination unit 31A determines that an abnormality has not occurred in the database 25A, the notification unit 34A notifies another controller (here, the controller 10B) that no abnormality has occurred.

同期部３５Ａは、判断部３１Ａにより、自機のデータベース２５Ａに異常が発生していないと判断し、かつ、検知部３２Ａにより、２台以上のコントローラが現用系となっていないことを検知した場合に、自機が保持するデータを待機系のコントローラに同期させる。 When the synchronization unit 35A determines that an abnormality has not occurred in the database 25A of its own machine by the determination unit 31A, and detects that two or more controllers are not in the active system by the detection unit 32A. In addition, the data held by the own machine is synchronized with the standby controller.

次に、コントローラ１０Ｂの構成例について、図３Ｂを参照して説明する。図３Ｂに示すように、コントローラ１０Ｂは、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）２０Ｂ、及びＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）３０Ｂを備えている。ＳＳＤ２０Ｂには、前述したデータベース２５Ｂが構築されている。 Next, a configuration example of the controller 10B will be described with reference to FIG. 3B. As shown in FIG. 3B, the controller 10B includes an SSD (Solid State Drive) 20B and a CPU (Central Processing Unit) 30B. The database 25B described above is constructed in the SSD 20B.

ＣＰＵ３０Ｂは、予めＳＳＤ２０Ｂに記憶されている所定のプログラムを実行することにより、判断部３１Ｂ、検知部３２Ｂ、起動部３３Ｂ、通知部３４Ｂ、及び同期部３５Ｂとして機能する。なお、判断部３１Ｂ、検知部３２Ｂ、起動部３３Ｂ、通知部３４Ｂ、及び同期部３５Ｂの各部の機能は、前述した判断部３１Ａ、検知部３２Ａ、起動部３３Ａ、通知部３４Ａ、及び同期部３５Ａと基本的に同じであるため、ここではその説明を省略する。 The CPU 30B functions as a determination unit 31B, a detection unit 32B, an activation unit 33B, a notification unit 34B, and a synchronization unit 35B by executing a predetermined program stored in the SSD 20B in advance. The functions of the determination unit 31B, the detection unit 32B, the activation unit 33B, the notification unit 34B, and the synchronization unit 35B are the above-mentioned determination unit 31A, detection unit 32A, activation unit 33A, notification unit 34A, and synchronization unit 35A. Since it is basically the same as, the description thereof is omitted here.

＜二重化システム１の動作例＞
次に、実施の形態１に係る二重化システム１において、スプリットブレインが発生した場合の動作例について、図４のフローチャート、図５のタイミングチャートに基づき説明する。ここでは、具体例として、コントローラ１０Ａの電源がＯＦＦであり、コントローラ１０Ｂが現用系として稼働している状態からの動作例を説明する。また、ここでは、現用系として稼働しているコントローラ１０Ｂのデータベース２５Ｂは、判断部３１Ｂにより既に正常であると判断されているものとする。 <Operation example of duplex system 1>
Next, in the duplex system 1 according to the first embodiment, an operation example when a split brain occurs will be described with reference to the flowchart of FIG. 4 and the timing chart of FIG. Here, as a specific example, an operation example will be described from a state in which the power supply of the controller 10A is OFF and the controller 10B is operating as an active system. Further, here, it is assumed that the database 25B of the controller 10B operating as the active system has already been determined to be normal by the determination unit 31B.

また、ここでは、図４において、コントローラ１０Ａを単に「Ａ系」と記載し、コントローラ１０Ｂを単に「Ｂ系」と記載する。また、図５において、かっこ付きの数字は図４のフローチャートにおけるステップ番号に対応し、「Ａ」はコントローラが現用系であることを示し、「Ｓ」はコントローラが待機系であることを示す。 Further, here, in FIG. 4, the controller 10A is simply referred to as “A system”, and the controller 10B is simply referred to as “B system”. Further, in FIG. 5, the numbers in parentheses correspond to the step numbers in the flowchart of FIG. 4, “A” indicates that the controller is the active system, and “S” indicates that the controller is the standby system.

まず、二重化システム１は、コントローラ１０Ａを待機系として起動する（ステップＳＴ１１）。 First, the duplex system 1 starts the controller 10A as a standby system (step ST11).

次に、コントローラ１０Ａは、現用系として稼働しているコントローラ１０Ｂに対し、コントローラ１０Ｂが保持しているデータ（データベース２５Ｂ、及び同期するデータ２６Ｂ（不図示））のコピーを要求する。この要求を受け付けたコントローラ１０Ｂは、自機の保持するデータのコピーをコントローラ１０Ａに送信し、コントローラ１０Ｂの保持するデータをコントローラ１０Ａのデータに同期する（ステップＳＴ１２）。 Next, the controller 10A requests the controller 10B operating as an active system to copy the data (database 25B and the synchronized data 26B (not shown)) held by the controller 10B. Upon receiving this request, the controller 10B transmits a copy of the data held by the own machine to the controller 10A, and synchronizes the data held by the controller 10B with the data held by the controller 10A (step ST12).

ここで、この同期中に、コントローラ１０Ａとコントローラ１０Ｂとを接続するネットワークが障害等により切断したとする。すると、二重化システム１は、同期処理が中断され、コントローラ１０Ａのデータベース２５Ａには、同期途中の不完全なデータが残される（ステップＳＴ１３）。なお、図５において、データベース２５Ａの表示を一部破線としているのは、データベース２５Ａに不完全なデータが残されていることを示している。 Here, it is assumed that the network connecting the controller 10A and the controller 10B is disconnected due to a failure or the like during this synchronization. Then, the synchronization process is interrupted in the duplex system 1, and incomplete data in the middle of synchronization is left in the database 25A of the controller 10A (step ST13). In addition, in FIG. 5, the fact that the display of the database 25A is partially broken lines indicates that incomplete data is left in the database 25A.

次に、二重化システム１は、ネットワークが切断したことに伴い、コントローラ１０Ａを独立して動作させるべく、コントローラ１０Ａの電源を一旦ＯＦＦにし、コントローラ１０Ａを現用系として起動する（ステップＳＴ１４）。このとき、コントローラ１０Ａは、自機のデータベース２５Ａに異常が発生しているか否かを判断する。この場合の処理について、図６のフローチャートを参照して説明する。 Next, the duplex system 1 temporarily turns off the power of the controller 10A and starts the controller 10A as an active system in order to operate the controller 10A independently when the network is disconnected (step ST14). At this time, the controller 10A determines whether or not an abnormality has occurred in the database 25A of the own machine. The processing in this case will be described with reference to the flowchart of FIG.

図６は、コントローラ１０Ａが自機のデータベース２５Ａに異常が発生しているか否かを判断する処理の流れを示すフローチャートである。まず、コントローラ１０Ａは、起動を開始すると（ステップＳＴ４１）、データベース２５Ａの立ち上げを開始する（ステップＳＴ４２）。そして、コントローラ１０Ａは、判断部３１Ａにより、データベース２５Ａに対してＰｉｎｇを送信する（ステップＳＴ４３）。 FIG. 6 is a flowchart showing a flow of processing in which the controller 10A determines whether or not an abnormality has occurred in the database 25A of the own machine. First, when the controller 10A starts starting (step ST41), the controller 10A starts starting the database 25A (step ST42). Then, the controller 10A transmits a ping to the database 25A by the determination unit 31A (step ST43).

次に、判断部３１Ａは、データベース２５Ａから所定時間（例えば１分）以内に応答があるか否かを確認する（ステップＳＴ４４）。その結果、データベース２５Ａから所定時間以内に応答があれば（ステップＳＴ４４；ＹＥＳ）、判断部３１Ａは、データベース２５Ａが正常に立ち上がったと判断する（ステップＳＴ４５）。そして、コントローラ１０Ａは起動を正常に完了する（ステップＳＴ４６）。 Next, the determination unit 31A confirms whether or not there is a response from the database 25A within a predetermined time (for example, 1 minute) (step ST44). As a result, if there is a response from the database 25A within a predetermined time (step ST44; YES), the determination unit 31A determines that the database 25A has started up normally (step ST45). Then, the controller 10A normally completes the startup (step ST46).

一方、データベース２５Ａから所定時間以内に応答がなければ（ステップＳＴ４４；ＮＯ）、判断部３１Ａは、データベース２５Ａに異常が発生していると判断する（ステップＳＴ４７）。この場合、コントローラ１０Ａはエラー状態となって起動を停止する（ステップＳＴ４８）。 On the other hand, if there is no response from the database 25A within a predetermined time (step ST44; NO), the determination unit 31A determines that an abnormality has occurred in the database 25A (step ST47). In this case, the controller 10A becomes an error state and stops starting (step ST48).

図４、図５の例では、二重化システム１は、同期処理の途中でネットワークが切断したため、コントローラ１０Ａのデータベース２５Ａには、同期途中の不完全なデータが残されている。したがって、判断部３１Ａは、送信したＰｉｎｇに対するデータベース２５Ａからの応答を得ることができず、データベース２５Ａに異常が発生していると判断する。その結果、コントローラ１０Ａは、エラー状態となって起動を停止する（ステップＳＴ１５）。 In the examples of FIGS. 4 and 5, since the network of the duplex system 1 was disconnected during the synchronization process, incomplete data during the synchronization is left in the database 25A of the controller 10A. Therefore, the determination unit 31A cannot obtain a response from the database 25A to the transmitted Ping, and determines that an abnormality has occurred in the database 25A. As a result, the controller 10A becomes an error state and stops starting (step ST15).

その後、二重化システム１では、切断していたネットワークが復旧する（ステップＳＴ１６）。これにより、二重化システム１は、コントローラ１０Ａとコントローラ１０Ｂとの間の通信が再開する。 After that, in the duplex system 1, the disconnected network is restored (step ST16). As a result, the duplex system 1 resumes communication between the controller 10A and the controller 10B.

その後、二重化システム１では、コントローラ１０Ａの検知部３２Ａがスプリットブレインを検知する。同様に、コントローラ１０Ｂの検知部３２Ｂもスプリットブレインを検知する（ステップＳＴ１７）。 After that, in the duplex system 1, the detection unit 32A of the controller 10A detects the split brain. Similarly, the detection unit 32B of the controller 10B also detects split brain (step ST17).

次に、コントローラ１０Ａは、コントローラ１０Ｂに対し、通信により、自機のデータベース２５Ａに異常が発生していることを通知する。これにより、コントローラ１０Ｂは、コントローラ１０Ａのデータベース２５Ａに異常が発生していることを検知する（ステップＳＴ１８）。なお、このステップＳＴ１８の処理は、必ずしも実行されなくともよいが、コントローラ１０Ｂが確実にデータベース２５Ａの異常を検知するために実行されるのが望ましい。 Next, the controller 10A notifies the controller 10B that an abnormality has occurred in the database 25A of its own machine by communication. As a result, the controller 10B detects that an abnormality has occurred in the database 25A of the controller 10A (step ST18). The process of step ST18 does not necessarily have to be executed, but it is desirable that the controller 10B be executed in order to reliably detect an abnormality in the database 25A.

その後、コントローラ１０Ａは、起動部３３Ａにより、自機を待機系として再起動する（ステップＳＴ１９）。 After that, the controller 10A is restarted by the activation unit 33A with its own machine as a standby system (step ST19).

次いで、コントローラ１０Ｂは、コントローラ１０Ａが待機系となったため、検知部３２Ｂにより、２台のコントローラが現用系となっていないこと、すなわちスプリットブレインではないことを検知する（ステップＳＴ２０）。そして、コントローラ１０Ｂは、同期部３５Ｂにより、コントローラ１０Ｂの保持するデータ（データベース２５Ｂ、及び同期するデータ２６Ｂ）をコントローラ１０Ａの保持するデータ（データベース２５Ａ、及び同期するデータ２６Ａ）に上書きすることで、コントローラ１０Ｂの保持するデータを
コントローラ１０Ａに同期させる（ステップＳＴ２１）。これにより、二重化システム１は、コントローラ１０Ａとコントローラ１０Ｂとの間でデータの整合性が保たれ、正常に二重化された状態に復旧する。 Next, the controller 10B detects that the two controllers are not in the active system, that is, it is not a split brain, because the controller 10A has become a standby system (step ST20). Then, the controller 10B overwrites the data held by the controller 10B (database 25B and synchronized data 26B) with the data held by the controller 10A (database 25A and synchronized data 26A) by the synchronization unit 35B. The data held by the controller 10B is synchronized with the controller 10A (step ST21). As a result, the duplication system 1 maintains data integrity between the controller 10A and the controller 10B, and restores the duplication system to a normal duplication state.

なお、上記の例では、二重化システム１は、ステップＳＴ１４において、判断部３１Ａがデータベース２５Ａに異常が発生していると判断した後に、ステップＳＴ１７において、検知部３２Ａ、３２Ｂがスプリットブレインを検知した。しかしながら、二重化システム１は、検知部３２Ａ、３２Ｂによるスプリットブレインの検知を実行した後に、判断部３１Ａがデータベース２５Ａに異常が発生していると判断してもよく、あるいはこれらの処理を並行して実行してもよい。 In the above example, in step ST14, after the determination unit 31A determines that an abnormality has occurred in the database 25A in step ST14, the detection units 32A and 32B detect split brain in step ST17. However, in the duplex system 1, after the detection units 32A and 32B detect the split brain, the determination unit 31A may determine that an abnormality has occurred in the database 25A, or perform these processes in parallel. You may do it.

また、上記の例では、コントローラ１０Ａの電源がＯＦＦとなっており、コントローラ１０Ｂが現用系として稼働している状態からの動作例を説明した。しかしながら、これとは逆に、コントローラ１０Ａが現用系として稼働し、コントローラ１０Ｂの電源がＯＦＦである場合でも、各コントローラは上記と同様の流れでスプリットブレインからの復旧及びデータの同期を行えばよい。 Further, in the above example, an operation example from a state in which the power supply of the controller 10A is turned off and the controller 10B is operating as an active system has been described. However, on the contrary, even when the controller 10A operates as an active system and the power supply of the controller 10B is OFF, each controller may recover from the split brain and synchronize data in the same flow as described above. ..

このように、二重化システム１では、スプリットブレインからシステムを復旧する際、コントローラ１０Ａ、１０Ｂのデータベース２５Ａ、２５Ｂに異常が発生しているか否かの判断を行い、データベースに異常が発生していると判断された場合、そのコントローラ（ここではコントローラ１０Ａ）を再起動して現用系から待機系に切り替える。そして、二重化システム１は、待機系に切り替わったコントローラ１０Ａの保持しているデータを破棄し、現用系として稼働を継続しているコントローラ１０Ｂのデータを、待機系に切り替わったコントローラ１０Ａに同期するように構成した。 In this way, in the duplex system 1, when recovering the system from the split brain, it is determined whether or not an abnormality has occurred in the databases 25A and 25B of the controllers 10A and 10B, and it is determined that an abnormality has occurred in the database. If it is determined, the controller (controller 10A in this case) is restarted to switch from the active system to the standby system. Then, the duplex system 1 discards the data held by the controller 10A that has been switched to the standby system, and synchronizes the data of the controller 10B that is continuing to operate as the active system with the controller 10A that has been switched to the standby system. It was configured in.

特に、従来は、スプリットブレインからの復旧時に、現用系のコントローラと待機系のコントローラのそれぞれが保持するデータに異常があるかどうかの確認は行っていなかった。そのため、仮に同期するデータの提供元となるコントローラのデータに何らかの異常があった場合、当該異常のあるデータが同期されてシステムがダウンするなどの不具合が発生するおそれがあった。 In particular, conventionally, when recovering from split brain, it has not been confirmed whether or not there is an abnormality in the data held by each of the active controller and the standby controller. Therefore, if there is any abnormality in the data of the controller that is the source of the data to be synchronized, there is a possibility that the abnormal data may be synchronized and the system may go down.

これに対し、二重化システム１は、スプリットブレインからの復旧に際し、まず現用系のコントローラと待機系のコントローラのそれぞれのデータベースに異常が発生しているか否かを判断し、データベースに異常が発生していると判断されたコントローラを現用系から待機系に切り替える。そして、待機系に切り替わったコントローラの保持しているデータを破棄するとともに、現用系として稼働を継続するコントローラの保持するデータを、待機系に切り替わったコントローラに同期させる。これにより、二重化システム１は、スプリットブレインからの復旧時に、コントローラ１０Ａとコントローラ１０Ｂとの間で異常なデータが同期されることによりシステムに不具合が発生することを回避しつつ、データの整合性を保つことができる。 On the other hand, when recovering from split brain, the duplex system 1 first determines whether or not an error has occurred in each of the databases of the active controller and the standby controller, and the database has an error. Switch the controller determined to be present from the active system to the standby system. Then, the data held by the controller switched to the standby system is discarded, and the data held by the controller that continues to operate as the active system is synchronized with the controller switched to the standby system. As a result, the duplex system 1 ensures data integrity while avoiding system malfunction due to abnormal data synchronization between the controllers 10A and the controller 10B when recovering from split brain. Can be kept.

なお、図４、図５では、コントローラ１０Ａのデータベース２５Ａに異常が発生したと判断された例を説明したが、例えばコントローラ１０Ａのデータベース２５Ａに異常がなく、コントローラ１０Ｂのデータベース２５Ｂにも異常がなかった場合も想定される。その場合、両コントローラ１０Ａ、１０Ｂは、通知部３４Ａ、３４Ｂにより、自機のデータベースに異常がないと判断された旨を相互に通知し合う。 In addition, in FIGS. 4 and 5, an example in which it was determined that an abnormality occurred in the database 25A of the controller 10A was described. For example, there is no abnormality in the database 25A of the controller 10A, and there is no abnormality in the database 25B of the controller 10B. It is also assumed that this is the case. In that case, the two controllers 10A and 10B mutually notify each other by the notification units 34A and 34B that it is determined that there is no abnormality in the database of their own machine.

そして、コントローラ１０Ａ、１０Ｂは、検知部３２Ａ、３２Ｂが、２台以上のコントローラが現用系となったことを検知し、かつ、判断部３１Ａ、３１Ｂが、データベース２５Ａ、２５Ｂに異常が発生していないと判断した場合であっても、通知部３４Ａ、３４Ｂによる通知を受信し、当該通知元のコントローラとの間で自機の優先度が低い場合には、起動部３３Ａ、３３Ｂにより自機を再起動して待機系とする。 Then, the controllers 10A and 10B detect that the detection units 32A and 32B have become the active system of two or more controllers, and the determination units 31A and 31B have an abnormality in the databases 25A and 25B. Even if it is determined that there is no notification, if the notification by the notification units 34A and 34B is received and the priority of the own unit is low with the controller of the notification source, the activation unit 33A and 33B will use the activation unit 33A and 33B to control the own unit. Reboot and make it a standby system.

例えば、図４のステップＳＴ１５において、コントローラ１０Ａは判断部３１Ａにより、データベース２５Ａに異常がないと判断したとする。この場合、通知部３４Ａは、データベース２５Ａに異常がないと判断した旨をコントローラ１０Ｂに通知する。一方、コントローラ１０Ｂでも、判断部３１Ｂにより、データベース２５Ｂに異常がないと判断されているので、通知部３４Ｂは、データベース２５Ｂに異常がないと判断された旨をコントローラ１０Ａに通知する。 For example, in step ST15 of FIG. 4, it is assumed that the controller 10A determines by the determination unit 31A that there is no abnormality in the database 25A. In this case, the notification unit 34A notifies the controller 10B that there is no abnormality in the database 25A. On the other hand, also in the controller 10B, since the determination unit 31B determines that there is no abnormality in the database 25B, the notification unit 34B notifies the controller 10A that the database 25B is determined to have no abnormality.

このとき、コントローラ１０Ａは、自機のデータベース２５Ａに異常が発生していないと判断した場合であっても、自機の優先度がコントローラ１０Ｂの優先度よりも低ければ、起動部３３Ａにより自機を再起動して待機系とする。そして、コントローラ１０Ｂは、判断部３１Ｂにより、データベース２５Ｂに異常が発生していないと判断し、かつ、検知部３２Ｂにより、２台のコントローラが現用系となっていないことを検知すると、同期部３５Ｂにより、自機のデータを待機系のコントローラ１０Ａに同期させる。 At this time, even if the controller 10A determines that no abnormality has occurred in the database 25A of the own machine, if the priority of the own machine is lower than the priority of the controller 10B, the activation unit 33A causes the own machine. Is restarted to make it a standby system. Then, when the controller 10B determines that no abnormality has occurred in the database 25B by the determination unit 31B and the detection unit 32B detects that the two controllers are not in the active system, the synchronization unit 35B This synchronizes the data of the own machine with the standby controller 10A.

なお、これとは逆に、例えばコントローラ１０Ｂの優先度がコントローラ１０Ａの優先度よりも低い場合も、上記と同様の流れで、コントローラ１０Ｂが待機系に切り替わり、コントローラ１０Ａのデータをコントローラ１０Ｂに同期させればよい。 On the contrary, for example, when the priority of the controller 10B is lower than the priority of the controller 10A, the controller 10B is switched to the standby system and the data of the controller 10A is synchronized with the controller 10B in the same flow as described above. Just let me do it.

また、冗長システムが例えば３台以上のコントローラで構成され、データベースに異常がないと判断されたコントローラが２台以上あった場合も、上記と同様の流れで待機系に切り替わるコントローラを決定すればよい。例えば、冗長システムが３台のコントローラで構成され、３台ともデータベースに異常がないと判断された場合、各コントローラは、通知部により、自機のデータベースに異常がないと判断された旨を相互に通知し合う。この場合、各コントローラは、自機と当該通知元のコントローラとの間で優先度を比較し、自機の優先度が低い場合には、起動部により自機を再起動して待機系とする。そして、最も優先度の高いコントローラ（すなわち現用系を維持するコントローラ）が、同期部により自機のデータを待機系のコントローラに同期させればよい。 Further, even if the redundant system is composed of, for example, three or more controllers and there are two or more controllers that are judged to have no abnormality in the database, the controller that switches to the standby system may be determined in the same flow as above. .. For example, if the redundant system consists of three controllers and it is determined that there is no abnormality in the database of all three controllers, each controller mutually indicates that the database of its own machine is determined to be normal by the notification unit. Notify each other. In this case, each controller compares the priority between the own machine and the controller of the notification source, and if the priority of the own machine is low, the starter restarts the own machine to make it a standby system. .. Then, the controller having the highest priority (that is, the controller that maintains the active system) may synchronize the data of its own machine with the controller of the standby system by the synchronization unit.

なお、待機系に切り替わるコントローラを優先度に基づいて決定する方法はあくまで一例であり、その他の方法で待機系に切り替わるコントローラを決定してもよい。例えば、各コントローラのデータベースに格納されているデータ数を比較し、データ数が最も多いコントローラ以外のコントローラを待機系に切り替えるようにしてもよい。 The method of determining the controller to switch to the standby system based on the priority is only an example, and the controller to switch to the standby system may be determined by another method. For example, the number of data stored in the database of each controller may be compared, and the controller other than the controller having the largest number of data may be switched to the standby system.

以上のように、実施の形態１によれば、冗長システム１は、複数のコントローラ１０Ａ、１０Ｂを含んで構成され、コントローラ１０Ａ、１０Ｂは、自機が現用系の場合に、データベース２５Ａ、２５Ｂに異常が発生しているか否かを判断する判断部３１Ａ、３１Ｂと、自機が現用系の場合に、２台以上のコントローラが現用系となったか否かを検知する検知部３２Ａ、３２Ｂと、判断部３１Ａ、３１Ｂが、データベース２５Ａ、２５Ｂに異常が発生していると判断し、かつ、検知部３２Ａ、３２Ｂが、２台以上のコントローラが現用系となったことを検知した場合に、自機を再起動して待機系とする起動部３３Ａ、３３Ｂと、検知部３２Ａ、３２Ｂが、２台以上のコントローラが現用系となっていないことを検知し、かつ、判断部３１Ａ、３１Ｂが、データベース２５Ａ、２５Ｂに異常が発生していないと判断した場合に、自機のデータを待機系のコントローラに同期させる同期部３５Ａ、３５Ｂと、を備える。これにより、冗長システム１は、スプリットブレインからの復旧時に、複数のコントローラ間で異常なデータが同期されることによりシステムに不具合が発生することを回避しつつ、データの整合性を保つことができる。 As described above, according to the first embodiment, the redundant system 1 is configured to include a plurality of controllers 10A and 10B, and the controllers 10A and 10B are stored in the databases 25A and 25B when the own machine is an active system. Judgment units 31A and 31B for determining whether or not an abnormality has occurred, and detection units 32A and 32B for detecting whether or not two or more controllers have become active systems when the own machine is an active system. When the determination units 31A and 31B determine that an abnormality has occurred in the databases 25A and 25B, and the detection units 32A and 32B detect that two or more controllers have become active systems, they themselves. The activation units 33A and 33B and the detection units 32A and 32B, which restart the machine and use it as a standby system, detect that two or more controllers are not in the active system, and the judgment units 31A and 31B When it is determined that no abnormality has occurred in the databases 25A and 25B, the synchronization units 35A and 35B for synchronizing the data of the own machine with the controller of the standby system are provided. As a result, the redundant system 1 can maintain data integrity while avoiding a system malfunction due to abnormal data synchronization between a plurality of controllers when recovering from split brain. ..

また、実施の形態１によれば、冗長システム１は、判断部３１Ａ、３１Ｂがデータベース２５Ａ、２５Ｂに異常が発生していないと判断した場合に、異常が発生していない旨を他のコントローラに通知する通知部３４Ａ、３４Ｂを備え、起動部３３Ａ、３３Ｂは、検知部３２Ａ、３２Ｂが、２台以上のコントローラが現用系となったことを検知し、かつ、判断部３１Ａ、３１Ｂが、データベース２５Ａ、２５Ｂに異常が発生していないと判断した場合であっても、通知部３４Ａ、３４Ｂによる通知を受信し、当該通知元のコントローラとの間で自機の優先度が低い場合には、自機を再起動して待機系とする。これにより、冗長システム１は、同期するデータの提供元となり得るコントローラが複数あった場合でも、優先度が他より低いコントローラが待機系に切り替わるため、コントローラ間でデータの整合性を速やかに保つことができる。 Further, according to the first embodiment, when the determination units 31A and 31B determine that no abnormality has occurred in the databases 25A and 25B, the redundant system 1 informs another controller that no abnormality has occurred. Notifying units 34A and 34B are provided, the activation units 33A and 33B detect that two or more controllers have become active systems, and the determination units 31A and 31B have databases. Even if it is determined that no abnormality has occurred in 25A or 25B, if the notification by the notification units 34A and 34B is received and the priority of the own machine is low with the controller of the notification source, Restart your own machine to make it a standby system. As a result, in the redundant system 1, even if there are a plurality of controllers that can provide data to be synchronized, the controller having a lower priority than the others is switched to the standby system, so that the data consistency between the controllers can be maintained promptly. Can be done.

また、実施の形態１によれば、判断部は、データベースに対して所定の信号を送信し、当該信号に対して所定時間以内に応答がない場合に、当該データベースに異常が発生していると判断する。これにより、冗長システム１は、データベースに異常が発生しているか否かを容易に判断することができる。 Further, according to the first embodiment, the determination unit transmits a predetermined signal to the database, and when there is no response to the signal within the predetermined time, the database is said to have an abnormality. to decide. As a result, the redundant system 1 can easily determine whether or not an abnormality has occurred in the database.

なお、本願発明はその発明の範囲内において、実施の形態の任意の構成要素の変形、もしくは実施の形態において任意の構成要素の省略が可能である。 In the present invention, within the scope of the invention, it is possible to modify any component of the embodiment or omit any component in the embodiment.

１二重化システム（冗長システム）
１０コントローラ
１０Ａ現用系のコントローラ
１０Ｂ待機系のコントローラ
２０Ａ、２０ＢＳＳＤ
２５Ａ、２５Ｂデータベース
２６Ａ、２６Ｂ同期するデータ
３０Ａ、３０ＢＣＰＵ
３１Ａ、３１Ｂ判断部
３２Ａ、３２Ｂ検知部
３３Ａ、３３Ｂ起動部
３４Ａ、３４Ｂ通知部
３５Ａ、３５Ｂ同期部
１５０システムバス
２００監視ポイント 1 Redundant system (redundant system)
10 Controller 10A Active controller 10B Standby controller 20A, 20B SSD
25A, 25B Database 26A, 26B Synchronized data 30A, 30B CPU
31A, 31B Judgment unit 32A, 32B Detection unit 33A, 33B Activation unit 34A, 34B Notification unit 35A, 35B Synchronization unit 150 System bus 200 Monitoring point

Claims

A redundant system that includes multiple controllers
Each of the above controllers
A judgment unit that determines whether or not an error has occurred in the database when the own machine is a working system,
When the own machine is a working system, a detector that detects whether or not two or more controllers have become a working system, and
When the detection unit detects that two or more controllers have become active systems and the judgment unit determines that an abnormality has occurred in the database, the own machine is restarted. The startup part that is the standby system and
When the detection unit detects that two or more controllers are not in the active system and the judgment unit determines that no abnormality has occurred in the database, it waits for the data of its own machine. A synchronization unit that synchronizes with the system controller,
A redundant system characterized by being equipped with.

When the determination unit determines that an abnormality has not occurred in the database, it is provided with a notification unit for notifying another controller that no abnormality has occurred.
The starter
Even if the detection unit detects that two or more controllers have become active systems and the determination unit determines that no abnormality has occurred in the database, the notification unit will be used. The redundant system according to claim 1, wherein when a notification is received and the priority of the own machine is low with the controller of the notification source, the own machine is restarted to be a standby system.

A claim characterized in that the determination unit transmits a predetermined signal to the database, and determines that an abnormality has occurred in the database when there is no response to the signal within a predetermined time. The redundant system according to claim 1 or 2.

A data synchronization method using multiple controllers that make up a redundant system.
Each of the above controllers
The step that the judgment unit determines whether or not an error has occurred in the database when the own machine is a working system,
The step that the detection unit detects that two or more controllers have become the active system when the own machine is the active system,
When the detection unit detects that two or more controllers have become active systems and the judgment unit determines that an abnormality has occurred in the database, the activation unit activates the own machine. The step of restarting and making it a standby system,
When the detection unit detects that two or more controllers are not in the active system and the judgment unit determines that an abnormality has not occurred in the database, the synchronization unit causes the own machine. Steps to synchronize the data of the standby system with the controller of the standby system,
A data synchronization method characterized by being equipped with.