JP2018018328A

JP2018018328A - Cluster system and server control program

Info

Publication number: JP2018018328A
Application number: JP2016148525A
Authority: JP
Inventors: 厚大堀; Atsushi Ohori
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2016-07-28
Filing date: 2016-07-28
Publication date: 2018-02-01
Anticipated expiration: 2036-07-28
Also published as: JP6702060B2

Abstract

PROBLEM TO BE SOLVED: To provide a cluster system which performs file update of an OS while suppressing influences upon services to a minimum and can be easily recovered when any problem occurs.SOLUTION: A cluster system is characterized in that, when performing file update of an OS of each server, a first server or a second server of a standby system at present uses an OS switching control part to switch a first OS or a second OS that is an OS of a stop system in the previous generation, to an active system, uses a file update control part to perform the file update of the first OS or the second OS that is switched to the active system by a previously acquired update file and, after the file update of the first OS or the second OS, uses a server switching control part to perform control for switching an operation system and the standby system of the first server and the second server. The other first server or the other second server switched to the standby system performs the file update of the OS in accordance with similar procedures.SELECTED DRAWING: Figure 1

Description

本発明は、クラスタシステム及びサーバ制御プログラムに関し、例えば、例えば、２重化冗長構成（運用系（ＡＣＴ系）／待機系（ＳＢＹ系））のサーバを有するクラスタシステムに適用し得るものである。 The present invention relates to a cluster system and a server control program, and can be applied to, for example, a cluster system having a server with a dual redundant configuration (active system (ACT system) / standby system (SBY system)).

近年、継続的にサービスを提供するためにクラスタシステムが広く普及している。クラスタシステムの方式としては種々の方式があるが、複数のサーバを使用して冗長化し、システムの停止時間を最小限に抑え、業務の可用性を向上させる方式としてＨＡ（ＨｉｇｈＡｖａｉｌａｂｉｌｉｔｙ）クラスタシステムが存在する。 In recent years, cluster systems have become widespread in order to provide services continuously. There are various cluster system methods, but HA (High Availability) cluster system exists as a method to improve redundancy by using multiple servers to minimize system downtime and improve business availability. To do.

このＨＡクラスタシステムは、サービス提供中（以下、「ＡＣＴ」系と呼ぶ）サーバで障害を検知すると、サービス未提供サーバ（以下、「ＳＢＹ」系と呼ぶ）に切り替え、業務のダウンタイムを短くすることでサーバの信頼性を向上させるシステムである。 When this HA cluster system detects a failure in a service-providing server (hereinafter referred to as an “ACT” system), it switches to a service non-providing server (hereinafter referred to as an “SBY” system) and shortens the downtime of work. This is a system that improves the reliability of the server.

ところで、ＨＡクラスタシステムのサーバは、サービスの提供中に、サーバ上のプログラムに問題（不具合等）が発生した場合は、サーバ上にインストールされているソフトウェアの更新（以下、「ファイル更新」と呼ぶ）が必要となる（例えば、特許文献１参照）。 By the way, the server of the HA cluster system updates the software installed on the server (hereinafter referred to as “file update”) when a problem (problem or the like) occurs in the program on the server while providing the service. ) Is required (see, for example, Patent Document 1).

また、通信機器等のサーバは、提供中のサービスを停止することなく、ファイル更新を行う必要がある。通常、ＨＡクラスタシステムでファイル更新を行う場合は、まずＳＢＹ系のサーバをファイル更新した後に、ＡＣＴ系のサーバとＳＢＹ系のサーバの切り替えを行い、切り替えた後にＳＢＹ系（元はＡＣＴ系）のサーバのファイル更新を再度行う。このような手順を取ることで、サービスの停止時間を最小限に抑えることが可能となる。 Further, a server such as a communication device needs to update a file without stopping a service being provided. Normally, when updating a file in an HA cluster system, first update the file of the SBY server, then switch between the ACT server and the SBY server, and then switch to the SBY system (originally the ACT system). Update the server file again. By taking such a procedure, the service stop time can be minimized.

また、ファイル更新を行うサーバは、更新前に更新対象のファイルをバックアップしておくことで、もしファイル更新後に、更新したファイルのどこかで不具合が発生してしまい、ファイル更新前の状態に戻したい場合に、このバックアップしたファイルを使用して、ファイル更新前の状態に戻すことが可能である。 In addition, the server that updates the file backs up the file to be updated before the update, so if a problem occurs somewhere in the updated file after the file update, the server returns to the state before the file update. If you want, you can use this backed up file to return to the state before the file was updated.

特開２００８−２４２６７９号公報JP 2008-242679 A

ところで、サーバ上のＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）やカーネル関連のファイルについては、通常はＲＰＭ等のファイル形式でパッケージ管理されていることが多い。 By the way, the OS (Operating System) and kernel related files on the server are usually package-managed in a file format such as RPM.

しかしながら、ＲＰＭパッケージは、パッケージ単位で他のパッケージとの依存関係を持っているため、インストールは容易に行えても、アンインストールするときに、対象のパッケージが他のパッケージと依存関係にある場合には、アンインストール出来ない制限がある。この場合、サーバは、他の依存関係にあるパッケージを全てアンインストールした後に、対象のパッケージをアンインストールする必要が生じる。 However, RPM packages have dependencies with other packages on a package-by-package basis, so if the target package has dependencies with other packages when uninstalling, it can be installed easily. There is a restriction that cannot be uninstalled. In this case, the server needs to uninstall the target package after uninstalling all the other packages having the dependency relationship.

そのため、サーバは、ＲＰＭパッケージをファイル更新した後に、ファイル更新前の状態に戻す場合には、ＲＰＭパッケージの依存関係のチェックが必要となる。よって、パッケージの管理が必要となり、更新するＲＰＭパッケージ量が膨大となった場合には、管理が非常に煩雑となってしまう懸念がある。また、サーバは、誤ったＲＰＭファイルをインストールしてしまった場合など、場合によっては対象装置のＯＳの再インストールが必要になってしまう可能性もあり、容易に元の状態に戻すことが難しいという課題がある。 Therefore, when the server updates the RPM package and returns it to the state before the file update, it is necessary to check the dependency relationship of the RPM package. Therefore, when package management is required and the amount of RPM packages to be updated becomes enormous, there is a concern that management becomes very complicated. In addition, the server may need to reinstall the OS of the target device in some cases, such as when an incorrect RPM file is installed, and it is difficult to easily return to the original state. There are challenges.

そのため、サービスへの影響を最小限に抑えてＯＳのファイル更新を行い、更新したファイルに問題が生じた場合には容易に復旧作業を行うことができるクラスタシステム及びサーバ制御プログラムが望まれている。 Therefore, there is a demand for a cluster system and a server control program that can update the OS file while minimizing the impact on the service, and can easily perform recovery work when a problem occurs in the updated file. .

第１の本発明は、第１のサーバ及び第２のサーバを有し、前記第１のサーバ及び第２のサーバの内、いずれか一方のサーバが運用系として動作し、他方のサーバが待機系として待機するクラスタシステムにおいて、前記各サーバは、（１）運用系と待機系を切り替える制御を行うサーバ切り替え制御部と、（２）第１のＯＳ及び第２のＯＳを別々の記憶領域に保持する記憶部と、（３）前記第１のＯＳ及び第２のＯＳの内、いずれか一方のＯＳをサーバとしての基本的な機能を発揮するための起動系のＯＳとして起動し、もう一方のＯＳを停止系のＯＳとして停止させる制御を行うＯＳ切り替え制御部と、（４）前記第１のＯＳ及び第２のＯＳの内、いずれか一方のＯＳのみを最新の状態にファイル更新し、他方のＯＳを一世代前の状態のまま保持するファイル更新制御部とを備え、（５）前記各サーバのＯＳのファイル更新を行う場合に、現在待機系の第１のサーバ又は第２のサーバは、前記ＯＳ切り替え制御部を用いて、一世代前の停止系のＯＳである前記第１のＯＳ又は第２のＯＳを、起動系に切り替え、前記ファイル更新制御部を用いて、予め取得した更新ファイルにより、起動系に切り替えた前記第１のＯＳ又は第２のＯＳのファイル更新を行い、前記第１のＯＳ又は第２のＯＳのファイル更新後、前記サーバ切り替え制御部を用いて、前記第１のサーバ及び第２のサーバの運用系と待機系を切り替える制御を行い、待機系に切り替わったもう一方の第１のサーバ又は第２のサーバは、切り替え前に待機系であった第１のサーバ又は第２のサーバと同様の手順により、ＯＳのファイル更新を行うことを特徴とする。 The first aspect of the present invention includes a first server and a second server, and one of the first server and the second server operates as an active system, and the other server waits. In the cluster system that stands by as a system, each of the servers includes (1) a server switching control unit that controls switching between the active system and the standby system, and (2) the first OS and the second OS in separate storage areas. (3) one of the first OS and the second OS is booted as a booting OS for performing basic functions as a server, and the other An OS switching control unit that performs control to stop the OS of the OS as a stop OS, and (4) update only one of the first OS and the second OS to the latest state, Leave the other OS the previous generation And (5) when updating the OS file of each server, the first server or the second server currently in the standby system uses the OS switching control unit, The first OS or the second OS, which is the OS of the stop system one generation before, is switched to the boot system, and the file update control unit is used to switch to the boot system using the update file acquired in advance. Update the file of the first OS or the second OS, update the file of the first OS or the second OS, and then operate the first server and the second server using the server switching control unit The other first server or second server that has switched to the standby system is controlled in the same manner as the first server or the second server that was the standby system before switching. O And performing a file update.

第２の本発明のサーバ制御プログラムは、第１のＯＳ及び第２のＯＳを別々の記憶領域に保持する記憶部を備える第１のサーバ及び第２のサーバを有し、前記第１のサーバ及び第２のサーバの内、いずれか一方のサーバが運用系として動作し、他方のサーバが待機系として待機するクラスタシステムを構成する各サーバに搭載されるコンピュータを、（１）運用系と待機系を切り替える制御を行うサーバ切り替え制御部と、（２）前記第１のＯＳ及び第２のＯＳの内、いずれか一方のＯＳをサーバとしての基本的な機能を発揮するための起動系のＯＳとして起動し、もう一方のＯＳを停止系のＯＳとして停止させる制御を行うＯＳ切り替え制御部と、（３）前記第１のＯＳ及び第２のＯＳの内、いずれか一方のＯＳのみを最新の状態にファイル更新し、他方のＯＳを一世代前の状態のまま保持するファイル更新制御部として機能させ、（４）前記各サーバのＯＳのファイル更新を行う場合に、現在待機系の第１のサーバ又は第２のサーバは、前記ＯＳ切り替え制御部を用いて、一世代前の停止系のＯＳである前記第１のＯＳ又は第２のＯＳを、起動系に切り替え、前記ファイル更新制御部を用いて、予め取得した更新ファイルにより、起動系に切り替えた前記第１のＯＳ又は第２のＯＳのファイル更新を行い、前記第１のＯＳ又は第２のＯＳのファイル更新後、前記サーバ切り替え制御部を用いて、前記第１のサーバ及び第２のサーバの運用系と待機系を切り替える制御を行い、待機系に切り替わったもう一方の第１のサーバ又は第２のサーバは、切り替え前に待機系であった第１のサーバ又は第２のサーバと同様の手順により、ＯＳのファイル更新を行うことを特徴とする。 A server control program according to a second aspect of the present invention includes a first server and a second server each having a storage unit that holds the first OS and the second OS in separate storage areas, and the first server And the second server, one of the servers operates as an active system, and the other server stands by as a standby system. A server switching control unit that performs switching control of the system; and (2) a booting-system OS for demonstrating the basic function of either one of the first OS and the second OS as a server. And an OS switching control unit that performs control to stop the other OS as a stop OS, and (3) only one of the first OS and the second OS is the latest Phi to state Update and function as a file update control unit that keeps the other OS in the state of the previous generation, and (4) when updating the file of the OS of each server, The server of 2 uses the OS switching control unit to switch the first OS or the second OS, which is the stop system OS one generation before, to the startup system, and uses the file update control unit. Update the file of the first OS or the second OS switched to the active system using the update file acquired in advance, and use the server switching control unit after updating the file of the first OS or the second OS. Thus, control is performed to switch the active system and the standby system of the first server and the second server, and the other first server or second server that has switched to the standby system is a standby system before switching. First By a procedure similar to the server or the second server, and performs the file update of the OS.

本発明によれば、サービスへの影響を最小限に抑えてＯＳのファイル更新を行い、更新したファイルに問題が生じた場合には容易に復旧作業を行うことができる。 According to the present invention, it is possible to update an OS file while minimizing the influence on the service, and to easily perform a recovery operation when a problem occurs in the updated file.

実施形態の記憶部の詳細構成を示すブロック図である。It is a block diagram which shows the detailed structure of the memory | storage part of embodiment. 実施形態のクラスタシステムの構成及びサーバの内部構成を示すブロック図である。It is a block diagram which shows the structure of the cluster system of embodiment, and the internal structure of a server. 実施形態のサーバの制御系の機能的構成を示すブロック図である。It is a block diagram which shows the functional structure of the control system of the server of embodiment. 実施形態のクラスタシステムを構成する各サーバのＯＳ関連ファイルの更新動作を示す説明図である。It is explanatory drawing which shows the update operation | movement of the OS related file of each server which comprises the cluster system of embodiment. 実施形態のクラスタシステムを構成する各サーバのＯＳ関連ファイルの戻し動作を示す説明図であるIt is explanatory drawing which shows return operation | movement of the OS related file of each server which comprises the cluster system of embodiment. 実施形態のクラスタシステムの各サーバのＯＳの版（バージョン）が、ファイル更新（アップデート）、及びファイル戻し（ダウングレード）を実行する毎に変化する様子をイメージ化した説明図である。It is explanatory drawing which imaged a mode that the version (version) of each server of the cluster system of embodiment changed whenever file update (update) and file return (downgrade) were performed.

（Ａ）主たる実施形態
以下、本発明によるクラスタシステム及びサーバ制御プログラムの実施形態を、図面を参照しながら詳述する。 (A) Main Embodiments Hereinafter, embodiments of a cluster system and a server control program according to the present invention will be described in detail with reference to the drawings.

（Ａ−１）実施形態の構成
（Ａ−１−１）全体構成
図２は、この実施形態のクラスタシステムの構成及びサーバの内部構成を示すブロック図である。 (A-1) Configuration of Embodiment (A-1-1) Overall Configuration FIG. 2 is a block diagram showing the configuration of the cluster system and the internal configuration of the server according to this embodiment.

図２において、クラスタシステム１は、２台のサーバ１０（サーバ１０−１及びサーバ１０−２）を有して構成されるものである。以下、サーバ１０−１を、単に「第１サーバ」と呼ぶときもある。同様に、サーバ１０−２を、「第２サーバ」と呼ぶときもある。 In FIG. 2, the cluster system 1 includes two servers 10 (server 10-1 and server 10-2). Hereinafter, the server 10-1 may be simply referred to as a “first server”. Similarly, the server 10-2 is sometimes referred to as a “second server”.

第１サーバ及び第２サーバは、様々なサービスを提供するものであり、例えばＩＰ電話サービスの管理、制御を行うＳＩＰサーバ等の種々の業務サービスを提供するサーバが該当する。また、第１サーバ及び第２サーバのハードウェア構成は、一般的な情報処理装置と同じ構成を有しており、ソフトウェア構成としては、例えば、Ｌｉｎｕｘ（登録商標）、ＵＮＩＸ（登録商標）、Ｗｉｎｄｏｗｓ（登録商標）等をＯＳとするものが該当する。 The first server and the second server provide various services. For example, a server that provides various business services such as an SIP server that manages and controls an IP telephone service is applicable. The hardware configuration of the first server and the second server has the same configuration as that of a general information processing apparatus. Examples of software configurations include Linux (registered trademark), UNIX (registered trademark), and Windows. Applicable to those using (registered trademark) as an OS.

さらに、第１サーバ及び第２サーバは、ＨＡクラスタシステムの構成サーバであるから、双方とも同じ機能を備えるものであり、相互に死活監視を行っている。運用系（ＡＣＴ系）サーバに障害が生じた場合には、待機系（ＳＢＹ系）サーバへの切り替え処理が行われる。 Furthermore, since the first server and the second server are constituent servers of the HA cluster system, both have the same function, and perform alive monitoring with each other. When a failure occurs in the active system (ACT system) server, a switching process to the standby system (SBY system) server is performed.

（Ａ−１−２）サーバの詳細な構成
図２に示すように、本実施形態に係るサーバ１０（１０−１、１０−２）は、ＣＰＵ１１（１１−１、１１−２）、メモリ１２（１２−１、１２−２）、記憶部１３（１３−１、１３−２）、及び通信部１４（１４−１、１４−２）を少なくとも備える。なお、以下では、２台のサーバ１０（１０−１、１０−２）の共通する処理を説明する場合においては、符号の枝番を省略して説明する。 (A-1-2) Detailed Configuration of Server As shown in FIG. 2, the server 10 (10-1, 10-2) according to the present embodiment includes a CPU 11 (11-1, 11-2), a memory 12 (12-1, 12-2), storage unit 13 (13-1, 13-2), and communication unit 14 (14-1, 14-2). In the following description, in the case of explaining a process common to the two servers 10 (10-1, 10-2), the branch number of the code is omitted.

ＣＰＵ１１は、サーバ１０の演算処理装置である。 The CPU 11 is an arithmetic processing device of the server 10.

メモリ１２は、サーバ１０のメインメモリ（揮発性メモリ）である。メモリ１２は、記憶部１３から読み込んだプログラムが展開され、ＣＰＵ１１により実行される領域である。 The memory 12 is a main memory (volatile memory) of the server 10. The memory 12 is an area where the program read from the storage unit 13 is expanded and executed by the CPU 11.

通信部１４は、例えば、図示しないＩＰネットワークに接続するインタフェースであり、パケットの送受信をして通信する部分である。 The communication unit 14 is, for example, an interface connected to an IP network (not shown), and is a part that communicates by transmitting and receiving packets.

記憶部１３は、ＣＰＵ１１により実行される処理プログラムや、情報処理に必要な各種情報等を記憶するものである。記憶部１３は、例えば、ＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）、及びフラッシュメモリ等の不揮発性の記憶装置を適用することができる。 The storage unit 13 stores a processing program executed by the CPU 11, various information necessary for information processing, and the like. As the storage unit 13, for example, a nonvolatile storage device such as an HDD (Hard Disk Drive), an SSD (Solid State Drive), or a flash memory can be applied.

図１は、記憶部の詳細構成を示すブロック図である。図１において、記憶部１３は、５つのパーティションに分割されており、第１のパーティションから順に、ブートローダ１３１、第１のサーバＯＳ１３２、第２のサーバＯＳ１３３、アプリケーション１３４、及びその他１３５が格納されている。なお、図１の構成例は、一例であって、記憶部１３内のパーティション分割方法、及び各パーティションに格納されるプログラムについてはこれに限定されるものではない。 FIG. 1 is a block diagram illustrating a detailed configuration of the storage unit. In FIG. 1, the storage unit 13 is divided into five partitions, and a boot loader 131, a first server OS 132, a second server OS 133, an application 134, and others 135 are stored in order from the first partition. Yes. The configuration example of FIG. 1 is an example, and the partition division method in the storage unit 13 and the program stored in each partition are not limited thereto.

ブートローダ１３１は、サーバ１０の電源投入時（再起動時を含む）に最初に起動されるものであり、第１のサーバＯＳ１３２、又は第２のサーバＯＳ１３３のいずれかを選択してＯＳを起動させる。例えば、ブートローダ１３１は、ｇｒｕｂ等に相当するプログラムである。 The boot loader 131 is activated first when the server 10 is powered on (including when it is restarted), and selects either the first server OS 132 or the second server OS 133 to activate the OS. . For example, the boot loader 131 is a program corresponding to grub or the like.

第１のサーバＯＳ１３２及び第２のサーバＯＳ１３３は、サーバ本来としての基本的な機能を発揮させるためのプログラムである。第１のサーバＯＳ１３２及び第２のサーバＯＳ１３３は、Ｌｉｎｕｘ、ＵＮＩＸ、及びＷｉｎｄｏｗｓ等が該当する。なお、この実施形態では、ブートローダ１３１により選択されて起動しているサーバ１０のＯＳ（第１のサーバＯＳ１３２、又は第２のサーバＯＳ１３３のいずれか）を「起動系」と呼び、もう一方の起動していないＯＳを「停止系」と呼ぶものとする。また、ＯＳ関連のファイルは、ＲＰＭ等のパッケージ形式で管理されていても良いし、個別に管理されていても良い（この実施形態では、ＯＳ関連のファイルは、パッケージ形式で管理されているものとする）。 The first server OS 132 and the second server OS 133 are programs for demonstrating the basic functions inherent to the server. The first server OS 132 and the second server OS 133 correspond to Linux, UNIX, Windows, and the like. In this embodiment, the OS (either the first server OS 132 or the second server OS 133) of the server 10 that is selected and started by the boot loader 131 is referred to as a “boot system”, and the other boot An OS that has not been executed is referred to as a “stop system”. Further, the OS-related file may be managed in a package format such as RPM, or may be managed individually (in this embodiment, the OS-related file is managed in the package format). And).

アプリケーション１３４は、第１のサーバＯＳ１３２、又は第２のサーバＯＳ１３３が共通に使用するアプリケーションプログラムである。例えば、アプリケーション１３４は、サーバ１０が、ＳＩＰサーバとして機能する場合には、ＳＩＰサーバとして機能させるプログラムが該当する。また、サーバ１０が、データベースサーバとして機能する場合には、データベースサーバとして機能させるプログラムが該当する。 The application 134 is an application program that is commonly used by the first server OS 132 or the second server OS 133. For example, the application 134 corresponds to a program that causes the server 10 to function as a SIP server when the server 10 functions as a SIP server. Further, when the server 10 functions as a database server, a program for causing the server 10 to function as a database server is applicable.

その他１３５は、第１のサーバＯＳ１３２、又は第２のサーバＯＳ１３３が共通に使用する領域であって、例えば、第１のサーバＯＳ１３２、及び第２のサーバＯＳ１３３のログを記憶させる領域である。例えば、第１のサーバＯＳ１３２、及び第２のサーバＯＳ１３３が、Ｌｉｎｕｘであれば、ｔｍｐディレクトリが該当する。 The other 135 is an area used in common by the first server OS 132 or the second server OS 133, and is an area for storing, for example, logs of the first server OS 132 and the second server OS 133. For example, if the first server OS 132 and the second server OS 133 are Linux, the tmp directory is applicable.

図３は、サーバ１０の制御系の機能的構成を示すブロック図である。 FIG. 3 is a block diagram illustrating a functional configuration of the control system of the server 10.

図３において、サーバ１０は、制御部１００を有する。この実施形態では、制御部１００は、記憶部１３に記憶されたＯＳを含む種々のプログラムをメモリ１２に展開し、ＣＰＵ１１により各プログラムを実行することにより実現されるが、実現方法はこれに限らず、サーバ１０のソフトウェア構成部の一部をハードウェアで実現しても良い（又はその逆でも良い）。いずれの手法で実現したとしても、サーバ１０の制御系の構成は、図３で示すものである。 In FIG. 3, the server 10 includes a control unit 100. In this embodiment, the control unit 100 is realized by developing various programs including the OS stored in the storage unit 13 in the memory 12 and executing each program by the CPU 11, but the implementation method is not limited thereto. Instead, a part of the software configuration unit of the server 10 may be realized by hardware (or vice versa). Regardless of which method is used, the configuration of the control system of the server 10 is as shown in FIG.

制御部１００は、サーバ１０の機能を制御するものであり、本実施形態の特徴部分として、サーバ切り替え制御部１０１、ＯＳ切り替え制御部１０２、及びファイル更新制御部１０３を備える。 The control unit 100 controls the function of the server 10, and includes a server switching control unit 101, an OS switching control unit 102, and a file update control unit 103 as characteristic parts of the present embodiment.

サーバ切り替え制御部１０１は、サーバ１０（１０−１、１０−２）をＳＢＹ系からＡＣＴ系（又はその逆）に切り替える制御を行う。例えば、サーバ１０−２（ＳＢＹ系）のサーバ切り替え制御部１０１は、サーバ１０−１（ＡＣＴ系）の障害を検知した場合には、サーバ１０−２をＳＢＹ系からＡＣＴ系に切り替える制御を行う。また、サーバ切り替え制御部１０１は、第１のサーバＯＳ１３２、又は第２のサーバＯＳ１３３のファイル更新を行う場合にも実行される（詳しい動作は、後述する）。 The server switching control unit 101 performs control to switch the server 10 (10-1, 10-2) from the SBY system to the ACT system (or vice versa). For example, when the server switching control unit 101 of the server 10-2 (SBY system) detects a failure of the server 10-1 (ACT system), the server 10-2 performs control to switch the server 10-2 from the SBY system to the ACT system. . The server switching control unit 101 is also executed when updating the file of the first server OS 132 or the second server OS 133 (detailed operation will be described later).

ＯＳ切り替え制御部１０２は、サーバ１０のＯＳを、第１のサーバＯＳ１３２、又は第２のサーバＯＳ１３３に切り替える制御を行うもの（先述のブートローダ１３１に相当するもの）である。この実施形態では、ＯＳ切り替え制御部１０２は、第１のサーバＯＳ１３２及び第２のサーバＯＳ１３３のファイル更新（アップデート）に伴い実行される（詳しい動作は、後述する）。 The OS switching control unit 102 performs control to switch the OS of the server 10 to the first server OS 132 or the second server OS 133 (corresponding to the boot loader 131 described above). In this embodiment, the OS switching control unit 102 is executed along with file update (update) of the first server OS 132 and the second server OS 133 (detailed operations will be described later).

ファイル更新制御部１０３は、ＯＳ（第１のサーバＯＳ１３２及び第２のサーバＯＳ１３３）及びアプリケーションのファイル更新（アップデート）の制御を行うものである。例えば、ファイル更新制御部１０３は、通信部１４を介して、アプリケーション１３４のファイル更新の要求を外部から受信した場合には、図示しないファイル配信サーバから更新ファイルをダウンロードし、アプリケーション１３４のアップデートを行う。なお、更新ファイルは、例えば、ＵＳＢを介してサーバ１０へ提供されても良い。 The file update control unit 103 controls file update (update) of the OS (first server OS 132 and second server OS 133) and applications. For example, when a file update request for the application 134 is received from the outside via the communication unit 14, the file update control unit 103 downloads an update file from a file distribution server (not shown) and updates the application 134. . The update file may be provided to the server 10 via USB, for example.

次に、本実施形態の特徴部分であるＯＳのファイル更新手順の説明を行う。 Next, the OS file update procedure, which is a characteristic part of the present embodiment, will be described.

ＯＳのファイル更新は、クラスタシステム１の２台のサーバ１０（第１サーバ、第２サーバ）の内、ＳＢＹ系のサーバ１０から実行される。例えば、第１サーバがＡＣＴ系で、第２サーバがＳＢＹ系の場合には、第２サーバからＯＳのファイル更新が実行される。 The OS file update is executed from the SBY server 10 out of the two servers 10 (first server and second server) of the cluster system 1. For example, when the first server is an ACT system and the second server is an SBY system, OS file update is executed from the second server.

また、この実施形態では、第１サーバ及び第２サーバは、各々、二つのＯＳ（第１のサーバＯＳ１３２及び第２のサーバＯＳ１３３）を備える構成であり、いずれかのＯＳが起動系として動作し、もう一方のＯＳが停止系の状態となっている。ＳＢＹ系の第２サーバは、停止系のＯＳ（停止系のＯＳを起動系に切り替えて）からファイル更新を行う。例えば、第１のサーバＯＳ１３２が起動系で、第２のサーバＯＳ１３３が停止系であるものとする。この場合、ＳＢＹ系の第２サーバのＯＳ切り替え制御部１０２は、第２のサーバＯＳ１３３を起動系（第１のサーバＯＳ１３２を停止系）に切り替える。切り替え後、ファイル更新制御部１０３は、通信部を介して、図示しない配信サーバから予め取得したＯＳ関連のファイルを用いて、第２のサーバＯＳ１３３（起動系）のアップデートを行う。 In this embodiment, each of the first server and the second server is configured to include two OSs (a first server OS 132 and a second server OS 133), and one of the OSs operates as a startup system. The other OS is in a stopped state. The second SBY server updates the file from the stop OS (switching the stop OS to the start OS). For example, it is assumed that the first server OS 132 is a start system and the second server OS 133 is a stop system. In this case, the OS switching control unit 102 of the SBY second server switches the second server OS 133 to the start system (the first server OS 132 is the stop system). After the switching, the file update control unit 103 updates the second server OS 133 (boot system) using an OS-related file acquired in advance from a distribution server (not shown) via the communication unit.

ここまでの処理により、ＳＢＹ系の第２サーバの第２のサーバＯＳ１３３（起動系）は、最新のバージョンとなる。一方、第１のサーバＯＳ１３２は、最新から一世代前のバージョンのままである。このように、サーバ１０は、ＯＳのファイル更新が実行されるごとに、第１のサーバＯＳ１３２又は第２のサーバＯＳ１３３のいずれかを更新することになるために、言い換えれば、異なるＯＳのバージョンを常に保持することになる。一般的に、最新のバージョンは、不具合を伴う可能性も高い。よって、これ以前に起動系として動作していた１世代前のＯＳを保持（バックアップ）することは、不具合を治癒する手段として有効である。この実施形態では、最新のバージョンのＯＳに不具合が見つかった場合には、１世代前のＯＳである停止系のＯＳ（第１のサーバＯＳ１３２又は第２のサーバＯＳ１３３のいずれか）を、ＯＳ切り替え制御部１０２の切り替え制御により起動系に切り替えるだけで、不具合の存在しないＯＳ（１世代前のＯＳ）に戻すことが可能である。 By the processing so far, the second server OS 133 (boot system) of the SBY second server becomes the latest version. On the other hand, the first server OS 132 remains the version one generation before the latest. As described above, the server 10 updates either the first server OS 132 or the second server OS 133 every time the OS file update is executed. Will always hold. In general, the latest version is likely to be defective. Therefore, maintaining (backup) the previous generation OS that had been operating as the boot system before this is effective as a means for healing the problem. In this embodiment, when a problem is found in the latest version of the OS, the OS of the stopped system (either the first server OS 132 or the second server OS 133) that is the previous generation OS is switched to OS. It is possible to return to an OS that does not have a problem (an OS before one generation) by simply switching to the startup system by switching control of the control unit 102.

ＳＢＹ系の第２サーバのＯＳファイル更新が終了した後、サーバ切り替え制御部１０１の制御により、第２サーバをＡＣＴ系として、第１サーバをＳＢＹ系とする。 After the OS file update of the second SBY server is completed, the second server is set as the ACT system and the first server is set as the SBY system under the control of the server switching control unit 101.

そして、ＡＣＴ系からＳＢＹ系となった第１サーバについても、上記第２サーバで示した同様の手順により、第１のサーバＯＳ１３２又は第２のサーバＯＳ１３３のいずれかのファイル更新を行う。 Then, for the first server that is changed from the ACT system to the SBY system, the file update of either the first server OS 132 or the second server OS 133 is performed according to the same procedure shown for the second server.

（Ａ−２）実施形態の動作
次に、以上のような構成を有する実施形態のクラスタシステム１の動作を説明する。 (A-2) Operation | movement of embodiment Next, operation | movement of the cluster system 1 of embodiment which has the above structures is demonstrated.

（Ａ−２−１）ＯＳファイルの更新動作（アップデート）
図４は、クラスタシステムを構成する各サーバのＯＳ関連ファイルの更新動作を示す説明図である。図４において、左半分のステップＳ１、Ｓ４、Ｓ６、Ｓ７は、第１サーバ（サーバ１０−１）の状態を示している。また、図４において、右半分のステップＳ２、Ｓ３、Ｓ５、Ｓ８は、第２サーバ（サーバ１０−２）の状態を示している。なお、ステップＳ１及びＳ２は、第１サーバ及び第２サーバの初期状態を示しており、実際のファイル更新の動作はステップＳ３から開始される。なお、この実施形態では、ＯＳをアップデートする更新ファイルは、予め配信サーバからダウンロードし、記憶部１３内（例えば、その他１３５−１、１３５−２）に保持されているものとする。 (A-2-1) OS file update operation (update)
FIG. 4 is an explanatory diagram showing the update operation of the OS related file of each server constituting the cluster system. In FIG. 4, steps S1, S4, S6, and S7 on the left half indicate the state of the first server (server 10-1). In FIG. 4, steps S2, S3, S5, and S8 in the right half indicate the state of the second server (server 10-2). Steps S1 and S2 indicate the initial states of the first server and the second server, and the actual file update operation starts from step S3. In this embodiment, it is assumed that an update file for updating the OS is downloaded in advance from the distribution server and held in the storage unit 13 (for example, the other 135-1, 135-2).

クラスタシステム１において、第１サーバがＡＣＴ系であって、第２サーバがＳＢＹ系である（Ｓ１、Ｓ２）。また、第１サーバは、第１のサーバＯＳ１３２−１が起動系として起動している。同様に、第２サーバは、第１のサーバＯＳ１３２−２が起動系として起動している。 In the cluster system 1, the first server is an ACT system and the second server is an SBY system (S1, S2). In addition, the first server is activated by the first server OS 132-1 as the activation system. Similarly, in the second server, the first server OS 132-2 is activated as the activation system.

第２サーバ（ＳＢＹ系）のＯＳ切り替え制御部１０２は、ＯＳを第１のサーバＯＳ１３２−２から第２のサーバＯＳ１３３−２へ切り替える制御（リブート）を行う（Ｓ３）。切り替え後、第２サーバ（ＳＢＹ系）のファイル更新制御部１０３は、予め所得した更新ファイルを用いて、ＯＳのアップデートを行う。 The OS switching control unit 102 of the second server (SBY system) performs control (reboot) to switch the OS from the first server OS 132-2 to the second server OS 133-2 (S3). After the switching, the file update control unit 103 of the second server (SBY system) updates the OS using the update file that has been obtained in advance.

一方、第２サーバ（ＳＢＹ系）がＯＳをアップデータしている間、第１サーバ（ＡＣＴ系）は、運用サーバとして稼働している（Ｓ４）。 On the other hand, while the second server (SBY system) updates the OS, the first server (ACT system) operates as an operation server (S4).

先述のステップＳ３の処理の後（第２のサーバＯＳ１３３−２のファイル更新後）、第２サーバ（ＳＢＹ系）のサーバ切り替え制御部１０１は、第２サーバをＡＣＴ系として、第１サーバをＳＢＹ系とするサーバの切り替え制御を行う（Ｓ５、Ｓ６）。 After the processing in step S3 described above (after updating the file of the second server OS 133-2), the server switching control unit 101 of the second server (SBY system) sets the second server as the ACT system and the first server as the SBY. Switching control of servers to be used is performed (S5, S6).

ＡＣＴ系からＳＢＹ系に切り替わった第１サーバは、先述のステップＳ３と同様の手順によりファイル更新を行う（Ｓ７）。具体的には、第１サーバ（ＳＢＹ系）のＯＳ切り替え制御部１０２は、ＯＳを第１のサーバＯＳ１３２−１から第２のサーバＯＳ１３３−１へ切り替える制御（リブート）を行う。切り替え後、第１サーバ（ＳＢＹ系）のファイル更新制御部１０３は、予め所得した更新ファイルを用いて、ＯＳのアップデートを行う。 The first server switched from the ACT system to the SBY system updates the file by the same procedure as in step S3 described above (S7). Specifically, the OS switching control unit 102 of the first server (SBY system) performs control (reboot) to switch the OS from the first server OS 132-1 to the second server OS 133-1. After the switching, the file update control unit 103 of the first server (SBY system) updates the OS using the update file that has been obtained in advance.

一方、第１サーバ（ＳＢＹ系）がＯＳをアップデータしている間、第２サーバ（ＡＣＴ系）は、運用サーバとして稼働している（Ｓ８）。 On the other hand, while the first server (SBY system) updates the OS, the second server (ACT system) operates as an operation server (S8).

（Ａ−２−２）ＯＳファイルの戻し動作（ダウングレード）
次に、クラスタシステム１の各サーバ（第１サーバ、第２サーバ）ＯＳ更新後、更新したファイルの不具合（バグ）により、各サーバのＯＳを更新前の状態に復旧させる処理について説明する。 (A-2-2) OS file return operation (downgrade)
Next, a process for restoring the OS of each server to the pre-update state due to a defect (bug) of the updated file after updating each server (first server, second server) OS of the cluster system 1 will be described.

図５は、クラスタシステムを構成する各サーバのＯＳ関連ファイルの戻し動作を示す説明図である。図５において、左半分のステップＳ９、Ｓ１１、Ｓ１４は、第１サーバ（サーバ１０−１）の状態を示している。また、図５において、右半分のステップＳ１０、Ｓ１２、Ｓ１３は、第２サーバ（サーバ１０−２）の状態を示している。なお、図５のステップＳ９からの処理は、先述のステップＳ８から続く処理である。すなわち、Ｓ９の直前のクラスタシステムは、第１サーバがＳＢＹ系であって、第２サーバがＡＣＴ系である。また、第１サーバは、ファイル更新済み（最新版）の第２のサーバＯＳ１３３−１が、起動系として動作している。同様に、第２サーバは、ファイル更新済み（最新版）の第２のサーバＯＳ１３３−２が起動系として動作している。なお、第１サーバの停止系の第１のサーバＯＳ１３２−１は、不具合の最新版から１世代前のＯＳ（言い換えれば、不具合の無い安定した版）である。第２サーバの停止系の第１のサーバＯＳ１３２−２も同様である。 FIG. 5 is an explanatory diagram showing the return operation of the OS-related file of each server constituting the cluster system. In FIG. 5, steps S9, S11, and S14 on the left half indicate the state of the first server (server 10-1). In FIG. 5, steps S10, S12, and S13 in the right half indicate the state of the second server (server 10-2). The process from step S9 in FIG. 5 is a process that continues from step S8 described above. That is, in the cluster system immediately before S9, the first server is the SBY system and the second server is the ACT system. In the first server, the file updated (latest version) second server OS 133-1 is operating as a startup system. Similarly, in the second server, the file updated (latest version) second server OS 133-2 operates as a startup system. The first server OS 132-1 of the stop system of the first server is an OS one generation before the latest version of the problem (in other words, a stable version without a problem). The same applies to the first server OS 132-2 in the stop system of the second server.

第１サーバ（ＳＢＹ系）のＯＳ切り替え制御部１０２は、ＯＳを第２のサーバＯＳ１３３−１から、第１のサーバＯＳ１３２−１へ切り替える制御（リブート）を行う（Ｓ９）。このＯＳ切り替え処理により、第２サーバ（ＳＢＹ系）のＯＳは、安定的な一世代前のＯＳ（ダウングレードしたＯＳ）となる。 The OS switching control unit 102 of the first server (SBY system) performs control (reboot) to switch the OS from the second server OS 133-1 to the first server OS 132-1 (S9). By this OS switching process, the OS of the second server (SBY system) becomes a stable previous generation OS (downgraded OS).

一方、第１サーバ（ＳＢＹ系）がＯＳをダウングレードしている間、第２サーバ（ＡＣＴ系）は、運用サーバとして稼働している（Ｓ１０）。 On the other hand, while the first server (SBY system) is downgrading the OS, the second server (ACT system) is operating as an operation server (S10).

先述のステップＳ９の処理の後、第１サーバ（ＳＢＹ系）のサーバ切り替え制御部１０１は、第１サーバをＡＣＴ系として、第２サーバをＳＢＹ系とするサーバの切り替え制御を行う（Ｓ１１、Ｓ１２）。 After the processing in step S9 described above, the server switching control unit 101 of the first server (SBY system) performs switching control of the server in which the first server is the ACT system and the second server is the SBY system (S11, S12). ).

ＡＣＴ系からＳＢＹ系に切り替わった第２サーバは、先述のステップＳ９と同様の手順によりダウングレードを行う（Ｓ１３）。具体的には、第２サーバ（ＳＢＹ系）のＯＳ切り替え制御部１０２は、ＯＳを第２のサーバＯＳ１３３−２から、第１のサーバＯＳ１３２−２へ切り替える制御（リブート）を行う。 The second server that has been switched from the ACT system to the SBY system performs the downgrade by the same procedure as in step S9 described above (S13). Specifically, the OS switching control unit 102 of the second server (SBY system) performs control (reboot) to switch the OS from the second server OS 133-2 to the first server OS 132-2.

一方、第２サーバ（ＳＢＹ系）がＯＳをダウングレードしている間、第１サーバ（ＡＣＴ系）は、運用サーバとして稼働している（Ｓ１４）。 On the other hand, while the second server (SBY system) downgrades the OS, the first server (ACT system) operates as an operation server (S14).

以上により、クラスタシステム１の第１サーバ及び第２サーバは、１世代前の安定的なＯＳで起動している状態となる。 As described above, the first server and the second server of the cluster system 1 are in a state of being started up by a stable OS one generation before.

なお、復旧完了後、バグの存在する第２のサーバＯＳ１３３（ＯＳ１３３−１、ＯＳ１３３−２）について、第１サーバ及び第２サーバは、第１のサーバＯＳ１３２（ＯＳ１３２−１、ＯＳ１３２−２）内容に書き換える処理（例えば、ディスクコピー）を行っても良い。 In addition, about the 2nd server OS133 (OS133-1, OS133-2) where a bug exists after the restoration is completed, the first server and the second server are the contents of the first server OS132 (OS132-1, OS132-2). Rewriting (for example, disk copy) may be performed.

（Ａ−３）実施形態の効果
この実施形態によれば、以下のような効果を奏することができる。 (A-3) Effects of Embodiment According to this embodiment, the following effects can be achieved.

クラスタシステム１の各サーバ１０は、記憶部１３内の各パーティションに２個のＯＳ（第１のサーバＯＳ１３２、第２のサーバＯＳ１３３）を設けることにより、一方のＯＳをファイル更新により最新版とし、他方のＯＳを１世代前の状態で保持することが可能となった。 Each server 10 of the cluster system 1 is provided with two OSs (first server OS 132 and second server OS 133) in each partition in the storage unit 13, so that one OS is updated to the latest version by file update, It became possible to hold the other OS in a state one generation before.

これにより、例えば、最新版としたＯＳのファイルに不具合が生じた場合には、各サーバ１０は、ＯＳ切り替え制御部１０２により、１世代前のＯＳに切り替える（リブート）ことにより、容易に復旧できる。つまり、サーバ１０は、ＯＳのファイルをパッケージ単位で管理していた場合においても、不具合のあるパッケージ（ファイル）と他のパッケージとの依存関係を考慮する必要無く（また、再インストールする必要も無く）容易に復旧することができる。以下、図６を挙げて説明する。 Thereby, for example, when a problem occurs in the latest OS file, each server 10 can be easily recovered by switching (rebooting) the OS to the previous generation by the OS switching control unit 102. . That is, even when the OS 10 manages OS files in units of packages, the server 10 does not need to consider the dependency between a defective package (file) and another package (and does not need to be reinstalled). ) Can be recovered easily. Hereinafter, a description will be given with reference to FIG.

図６は、クラスタシステムの各サーバのＯＳの版（バージョン）が、ファイル更新（アップデート）、及びファイル戻し（ダウングレード）を実行する毎に変化する様子をイメージ化した説明図である。なお、図６では、サーバ１０の記憶部１３内の第１のサーバＯＳ１３２及び第２のサーバＯＳ１３３のみ図示している。 FIG. 6 is an explanatory diagram showing an image of how the OS version of each server in the cluster system changes each time file update (update) and file return (downgrade) are executed. In FIG. 6, only the first server OS 132 and the second server OS 133 in the storage unit 13 of the server 10 are illustrated.

図６（Ａ）では、初期状態であるため、各サーバ１０の第１のサーバＯＳ１３２及び第２のサーバＯＳ１３３のＯＳのバージョンは、「Ｖ１」で同一である。 In FIG. 6A, since it is an initial state, the OS versions of the first server OS 132 and the second server OS 133 of each server 10 are the same as “V1”.

図６（Ｂ）では、第２のサーバＯＳ１３３を更新したために、起動している第２のサーバＯＳ１３３のＯＳのバージョンは、「Ｖ２」となる。一方、停止した第１のサーバＯＳ１３２のＯＳのバージョンは、「Ｖ１」のままである。 In FIG. 6B, since the second server OS 133 is updated, the OS version of the activated second server OS 133 is “V2”. On the other hand, the OS version of the stopped first server OS 132 remains “V1”.

図６（Ｃ）では、第１のサーバＯＳ１３２を更新したために、起動している第１のサーバＯＳ１３２のＯＳのバージョンは、「Ｖ３」となる。一方、停止した第２のサーバＯＳ１３３のＯＳのバージョンは、「Ｖ２」のままである。 In FIG. 6C, since the first server OS 132 is updated, the OS version of the activated first server OS 132 is “V3”. On the other hand, the OS version of the stopped second server OS 133 remains “V2”.

ここで、第１のサーバＯＳ１３２のＯＳ（Ｖ３）に不具合の混入が認められると、起動しているＯＳを切り替えることになる。そうすると、図６（Ｄ）では、起動している第２のサーバＯＳ１３３のＯＳのバージョンは、「Ｖ２」である。一方、停止した第１のサーバＯＳ１３２のＯＳのバージョンは、不具合のある「Ｖ３」のままである。 Here, when a trouble is recognized in the OS (V3) of the first server OS 132, the operating OS is switched. Then, in FIG. 6D, the OS version of the activated second server OS 133 is “V2”. On the other hand, the OS version of the stopped first server OS 132 remains “V3” having a problem.

図６（Ｅ）では、不具合のあるＶ３を除去するために、停止している第１のサーバＯＳ１３２のＯＳのバージョンを、「Ｖ３→Ｖ２」とする（Ｖ２の第２のサーバＯＳ１３３をコピーする）。これにより、サーバ１０は、通常通りアップデータすることが可能となる。 In FIG. 6E, in order to remove the defective V3, the OS version of the stopped first server OS 132 is changed to “V3 → V2” (the second server OS 133 of V2 is copied). ). As a result, the server 10 can update data as usual.

図６（Ｆ）では、第１のサーバＯＳ１３２を更新したために、起動している第１のサーバＯＳ１３２のＯＳのバージョンは、「Ｖ４」となる。一方、停止した第２のサーバＯＳ１３３のＯＳのバージョンは、「Ｖ２」のままである。 In FIG. 6F, since the first server OS 132 is updated, the OS version of the activated first server OS 132 is “V4”. On the other hand, the OS version of the stopped second server OS 133 remains “V2”.

以上、本実施形態のクラスタシステム１は、更新したＯＳに不具合が生じたとしても、容易に不具合の存在しない状態に短時間に復旧することができ、サービスに影響を与えること無くシステムを運用することができる。 As described above, the cluster system 1 according to the present embodiment can easily recover to a state in which a defect does not exist even if a problem occurs in the updated OS, and operates the system without affecting the service. be able to.

（Ｂ）他の実施形態
本発明は、上記実施形態に限定されるものではなく、以下に例示するような変形実施形態も挙げることができる。 (B) Other Embodiments The present invention is not limited to the above-described embodiments, and may include modified embodiments as exemplified below.

（Ｂ−１）上記実施形態では、ファイル更新制御部１０３によりＯＳのファイル更新を行う際、各サーバ１０は、ＯＳ切り替え制御部１０２を用いて、現在停止系のＯＳ（第１のサーバＯＳ１３２又は第２のサーバＯＳ１３３）を起動系に切り替えた後、ファイル更新する処理を行っていた（図４のステップＳ３、Ｓ７）。変形例として、ファイル更新制御部１０３が、現在起動しているＯＳだけでは無く、現在停止しているＯＳのファイル更新も実行できる場合には、各サーバ１０は、停止系のＯＳ（第１のサーバＯＳ１３２又は第２のサーバＯＳ１３３）のファイル更新を実行した後、ＯＳ切り替え制御部１０２を用いて、現在停止系のＯＳ（第１のサーバＯＳ１３２又は第２のサーバＯＳ１３３）を起動系に切り替ても良い。 (B-1) In the above embodiment, when OS file update is performed by the file update control unit 103, each server 10 uses the OS switching control unit 102 to use the currently stopped OS (first server OS 132 or After switching the second server OS 133) to the active system, a file update process was performed (steps S3 and S7 in FIG. 4). As a modified example, when the file update control unit 103 can execute not only the currently activated OS but also the file update of the currently stopped OS, each server 10 is configured as a stopped OS (first After executing the file update of the server OS 132 or the second server OS 133), the OS switching control unit 102 is used to switch the currently stopped OS (the first server OS 132 or the second server OS 133) to the active system. Also good.

（Ｂ−２）上記実施形態では、各サーバ１０は、２個のＯＳ（第１のサーバＯＳ１３２及び第２のサーバＯＳ１３３）を用いてファイル更新を行う例を示した。変形例として、各サーバ１０は、３個以上のＯＳを用いてファイル更新を行っても良い。例えば、３個のＯＳを所持している場合には、ファイル更新を行うと、起動系のＯＳと、第１の停止系ＯＳ、及び第２の停止系ＯＳについて、それぞれ異なるバージョンのＯＳを所持することになる。つまり、起動系のＯＳに不具合が生じた場合には、最新の不具合の存在するバージョンから１世代前だけでなく、２世代前のＯＳに切り替えることができる。 (B-2) In the above-described embodiment, each server 10 performs the file update using two OSs (the first server OS 132 and the second server OS 133). As a modification, each server 10 may perform file update using three or more OSs. For example, if you have three OSs, when you update a file, you have different versions of the operating system, the first operating system, and the second operating system. Will do. That is, when a problem occurs in the booting OS, it is possible to switch from the version in which the latest defect exists to the OS two generations before as well as one generation before.

１…クラスタシステム、１０（１０−１、１０−２）…サーバ、１１（１１−１、１１−２）…ＣＰＵ、１２（１２−１、１２−２）…メモリ、１３（１３−１、１３−２）…記憶部、１４（１４−１、１４−２）…通信部、１００…制御部、１０１…サーバ切り替え制御部、１０２…ＯＳ切り替え制御部、１０３…ファイル更新制御部、１３１（１３１−１、１３１−２）…ブートローダ、１３２（１３２−１、１３２−２）…第１のサーバＯＳ、１３３（１３３−１、１３３−２）…第２のサーバＯＳ、１３４（１３４−１、１３４−２）…アプリケーション、１３５（１３５−１、１３５−２）…その他。
DESCRIPTION OF SYMBOLS 1 ... Cluster system, 10 (10-1, 10-2) ... Server, 11 (11-1, 11-2) ... CPU, 12 (12-1, 12-2) ... Memory, 13 (13-1, 13-2) ... storage unit, 14 (14-1, 14-2) ... communication unit, 100 ... control unit, 101 ... server switching control unit, 102 ... OS switching control unit, 103 ... file update control unit, 131 ( 131-1, 131-2) ... boot loader, 132 (132-1, 132-2) ... first server OS, 133 (133-1, 133-2) ... second server OS, 134 (134-1) , 134-2)... Application, 135 (135-1, 135-2).

Claims

In a cluster system having a first server and a second server, one of the first server and the second server operating as an active system and the other server standing by as a standby system ,
Each of the servers
A server switching control unit that performs control to switch between the active system and the standby system;
A storage unit for holding the first OS and the second OS in separate storage areas;
Either one of the first OS and the second OS is started as a booting OS for performing a basic function as a server, and the other OS is stopped as a stopping OS. An OS switching control unit for performing control,
A file update control unit that updates the file of only one of the first OS and the second OS to the latest state and holds the other OS in the state of the previous generation;
When updating the OS file of each server,
The first server or the second server of the current standby system
Using the OS switching control unit, the first OS or the second OS, which is a stop system OS one generation before, is switched to a boot system,
The file update control unit is used to update the file of the first OS or the second OS that has been switched to the boot system using an update file acquired in advance.
After updating the file of the first OS or the second OS, the server switching control unit is used to perform control to switch between the active system and the standby system of the first server and the second server,
The other first server or second server that has switched to the standby system updates the OS file in the same procedure as the first server or second server that was the standby system before switching. Feature cluster system.

When updating the OS file of each server,
The first server or the second server of the current standby system
Using the file update control unit, update the file of the first OS or the second OS, which is the OS of the previous stop system, with the update file acquired in advance,
2. The cluster system according to claim 1, wherein the OS switching control unit is used to perform control for switching the first OS or the second OS that has been updated to a boot system. 3.

The cluster system according to claim 1, wherein each of the servers includes a communication unit that acquires an update file of the first OS or the second OS via a network.

If a problem is found in the boot OS of each server,
The first server or the second server of the current standby system
Using the OS switching control unit, the first OS or the second OS, which is a stop system OS one generation before, is switched to a boot system,
Using the server switching control unit, control to switch between the active system and the standby system of the first server and the second server,
The other first server or the second server switched to the standby system,
The first OS or the second OS, which is a stop system OS one generation before, is switched to a boot system using the OS switching control unit. Cluster system.

1st server and 2nd server provided with the memory | storage part which hold | maintains 1st OS and 2nd OS in a separate storage area, and either one of said 1st server and 2nd server The computer installed in each server that constitutes the cluster system in which the server of the server operates as the active system and the other server stands by as the standby system,
A server switching control unit that performs control to switch between the active system and the standby system;
Either one of the first OS and the second OS is started as a booting OS for performing a basic function as a server, and the other OS is stopped as a stopping OS. An OS switching control unit for performing control,
Updating one of the first OS and the second OS to the latest state, and causing the other OS to function as a file update control unit that retains the previous generation state;
When updating the OS file of each server,
The first server or the second server of the current standby system
Using the OS switching control unit, the first OS or the second OS, which is a stop system OS one generation before, is switched to a boot system,
The file update control unit is used to update the file of the first OS or the second OS that has been switched to the boot system using an update file acquired in advance.
After updating the file of the first OS or the second OS, the server switching control unit is used to perform control to switch between the active system and the standby system of the first server and the second server,
The other first server or second server that has switched to the standby system updates the OS file in the same procedure as the first server or second server that was the standby system before switching. A server control program characterized.