JP2008276320A

JP2008276320A - Virtual system control method and computer system

Info

Publication number: JP2008276320A
Application number: JP2007116029A
Authority: JP
Inventors: Tadashige Kadoi; 忠茂門井
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2007-04-25
Filing date: 2007-04-25
Publication date: 2008-11-13

Abstract

<P>PROBLEM TO BE SOLVED: To determine an optimum destination in consideration of blade features and redundancy of power supply and fan when migrating a virtual system between a plurality of CPU blades. <P>SOLUTION: A computer system provided with a plurality of CPU blades and configured to migrate a virtual system running on one CPU blade to another CPU blade includes a storage space for storing a management table registering information about each CPU blade including information about a corresponding power supply and/or cooling fan, and a control means for, when a virtual system on any CPU blade must be migrated to another CPU blade, selecting a CPU blade satisfying predetermined conditions in comparison with the source CPU blade and setting it as the destination of the virtual system by reference to the management table. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は仮想システム管理方法およびシステムに関し、特に、ブレードシステムのように複数のコンピューティングノードが同一筐体に混在するシステムにおいて、複数のＣＰＵブレード間で仮想システムのマイグレーションを管理制御する方法およびシステムに関する。 The present invention relates to a virtual system management method and system, and more particularly to a method and system for managing and controlling migration of a virtual system between a plurality of CPU blades in a system in which a plurality of computing nodes are mixed in the same chassis, such as a blade system. About.

近年、同一筐体に複数のＣＰＵブレードが収納されたブレードシステムなどのコンピュータシステムにおいて、システム障害などによりあるＣＰＵブレードから別のＣＰＵブレードへ仮想システムをマイグレーションする技術がある（例えば、特許文献１）。このマイグレーションは一般に、ＶＭ（仮想マシン）による仮想コンピュータシステムにおいて、予め指定したサーバを予備として待機させておき、あるサーバにて過負荷となったり軽障害が頻発した場合に、仮想化ソフトにより仮想システムのライブマイグレーション機能を用いて予備サーバへ仮想システムをマイグレーションさせる。これにより、負荷の平準化や障害によるシステムダウンを回避し、スムーズなシステム運用が実現されるようにしている。
特開２００６−２４４４８１号公報 In recent years, in a computer system such as a blade system in which a plurality of CPU blades are housed in the same casing, there is a technique for migrating a virtual system from one CPU blade to another due to a system failure or the like (for example, Patent Document 1). . In general, in a virtual computer system based on a VM (virtual machine), this migration is performed by using a virtualization software that allows a designated server to stand by as a backup, and if a certain server is overloaded or a light failure occurs frequently. The virtual system is migrated to the spare server using the system live migration function. As a result, load leveling and system failure due to failure are avoided, and smooth system operation is realized.
JP 2006-244481 A

しかしながら、従来のシステムでは、仮想システムの移動先が予め一意に定められているため、予備サーバを用いた負荷分散／障害回避の場合に限らず、運用中のサーバ間で仮想システムの移動を行う際、その時々で適切なサーバを移動先に選択することができない不都合がある。特に、ＣＰＵブレードは用途や実施環境に応じて、性能だけでなく、高可用性を訴求するもの、高信頼性を訴求するもの、大容量メモリ搭載を訴求するものなど、システムの付加価値を提供するバリエーションが増えているが、予め用意された予備サーバが必ずしも移動元のＣＰＵブレードと同等のスペックとは限らないため、仮想システムを移動させた後に性能や安全性が低下してしまう可能性があった。このように、従来のシステムはより安全且つ堅牢なサーバをダイナミックに選択できず、各ブレードの特徴を活かしたシステムリソースの有効活用ができるものではなかった。 However, in the conventional system, since the movement destination of the virtual system is uniquely determined in advance, not only in the case of load distribution / failure avoidance using a spare server, the virtual system is moved between servers in operation. However, there is an inconvenience that an appropriate server cannot be selected as a destination at that time. In particular, CPU blades provide not only performance, but also high-availability, high-reliability, high-capacity memory, and other system-added values depending on applications and implementation environments. Although the number of variations has increased, the spare server prepared in advance does not necessarily have the same specifications as the source CPU blade, so there is a possibility that performance and safety will be reduced after moving the virtual system. It was. As described above, the conventional system cannot dynamically select a safer and more robust server, and cannot effectively utilize system resources utilizing the features of each blade.

本発明は上記のような問題に鑑み、複数のＣＰＵブレードを有するコンピュータシステムで、あるブレード上の仮想システムを別のブレードにマイグレーションするにあたり、ブレードの特徴やファンや電源などの動作環境、障害発生状況なども考慮して最適のブレードを選択するようにして、高可用且つダイナミックなリソース配分を実現する方法およびシステムを提供することを目的とする。 In view of the above problems, the present invention is a computer system having a plurality of CPU blades. When a virtual system on one blade is migrated to another blade, the characteristics of the blade, the operating environment such as the fan and power supply, and the occurrence of a failure It is an object of the present invention to provide a method and system for realizing highly available and dynamic resource allocation by selecting an optimum blade in consideration of the situation and the like.

上記目的を達成するために、本発明は、複数のＣＰＵブレードを具えて構成され１のＣＰＵブレード上で動作する仮想システムを別のＣＰＵブレードにマイグレーションするコンピュータシステムにおいて、
少なくとも各ＣＰＵブレードに関する情報が登録される管理テーブルを格納する記憶領域と、いずれかのＣＰＵブレード上の仮想システムを別のＣＰＵブレードにマイグレーションする必要が生じた場合に、前記管理テーブルを参照して移動元のＣＰＵブレードとの比較において所定の条件を満たす別のＣＰＵブレードを選択して前記仮想システムの移動先に設定する制御手段とを具えることを最も主要な特徴とする。 To achieve the above object, the present invention provides a computer system configured to migrate a virtual system configured with a plurality of CPU blades and operating on one CPU blade to another CPU blade.
Refer to the management table when there is a need to migrate a storage area that stores at least a management table in which information related to each CPU blade is registered, and a virtual system on any CPU blade to another CPU blade. The main feature is that it comprises control means for selecting another CPU blade that satisfies a predetermined condition in comparison with the migration source CPU blade and setting it as the migration destination of the virtual system.

このシステムにおいて、前記管理テーブルはさらに、各ＣＰＵブレードについて、対応する電源および/または冷却ファンの情報が登録されており、この情報が前記制御手段による設定時に考慮されることを特徴とする。 In this system, the management table further stores information on the corresponding power supply and / or cooling fan for each CPU blade, and this information is taken into account when setting by the control means.

また、前記制御手段は、予め所定の条件により、あるいは定期的に前記管理テーブルを参照し、各ＣＰＵブレードについて各登録項目に所定の重み付け演算を行って得たブレード毎のシステム運用環境指標を管理しており、移動元のＣＰＵブレードと同等以上の指標を有する別のＣＰＵブレードを移動先として選択することを特徴とする。 The control means manages the system operating environment index for each blade obtained by performing a predetermined weighting operation on each registered item for each CPU blade by referring to the management table in advance or periodically according to a predetermined condition. And another CPU blade having an index equal to or higher than that of the source CPU blade is selected as the destination.

また、前記管理テーブルがさらに、オペレータの入力により各ＣＰＵブレードをグループ分けして管理する欄を有するとともに、前記制御手段は、前記移動元のＣＰＵブレードと同じグループから移動先のＣＰＵブレードを選択することが好ましい。 In addition, the management table further includes a column for managing the CPU blades by grouping them according to an operator input, and the control unit selects a destination CPU blade from the same group as the source CPU blade. It is preferable.

本発明のコンピュータシステムによれば、利用可能なＣＰＵブレードに関する情報を管理テーブルで管理しており、いずれかのＣＰＵブレードで動作する仮想システムの移動が必要となった場合に、各ＣＰＵブレードの情報を比較してその時点で最も適切なＣＰＵブレードを移動先としてダイナミックに選定するため、予備のブレードを用意するコストや手間が省けるとともに、移動先ブレードの性能や信頼性が異なることによる不具合を防止することができる。 According to the computer system of the present invention, information on available CPU blades is managed by the management table, and information on each CPU blade is required when a virtual system that operates on any of the CPU blades needs to be moved. By comparing these, the most appropriate CPU blade at that time is dynamically selected as the destination, so the cost and labor of preparing a spare blade can be saved, and problems due to different performance and reliability of the destination blade can be prevented. can do.

また、ＣＰＵブレードの性能のみならず、電源やファンの冗長性、省電力性まで移行判定条件に組み入れているため、電源やファンを要素とする信頼性を維持した状態で好適に仮想システムを移動させることができる。 In addition, not only the performance of the CPU blade, but also redundancy and power saving of the power supply and fan are incorporated in the migration judgment conditions, so the virtual system can be moved appropriately while maintaining reliability with the power supply and fan as elements. Can be made.

また、予め各ＣＰＵブレードについてのシステム運用環境指標を算出しておき、障害発生時には同等以上の指標を有するＣＰＵブレードを移動先として決定する構成としたため、障害が発生してから各ＣＰＵブレードの数値を取得して移動先を選択するような構成に比して、障害発生から移行完了までに要する時間が飛躍的に短くなる。 In addition, since the system operating environment index for each CPU blade is calculated in advance and a CPU blade having an index equal to or higher than that is determined as a destination when a failure occurs, the numerical value of each CPU blade after the failure has occurred. The time required from the occurrence of a failure to the completion of the transition is drastically shortened as compared with the configuration in which the destination is selected by acquiring the destination.

さらに、オペレータの設定により各ＣＰＵブレードをグループ管理し、仮想システムの移動をグループ内で制限する構成により、よりユーザの意向に沿った仮想システム制御が可能となり、ユーザの意図する運用目的に応じた適材適所のリソース活用が可能となる。 In addition, each CPU blade is managed in groups according to operator settings, and the virtual system movement is restricted within the group, enabling virtual system control in line with the user's intentions, according to the user's intended operational purpose. It is possible to utilize resources in the right place for the right person

本発明を実施するための最良の実施形態について、図面を参照しながら以下に詳細に説明する。 The best mode for carrying out the present invention will be described in detail below with reference to the drawings.

図１は、本発明の一実施例を示す概略図である。本図に示すように、本発明の実施例は複数（２つ）のブレードシステム１００、２００にそれぞれ複数のＣＰＵブレード、筐体マネジメントプロセッサ、複数の電源ユニット（ＰＳＵ）および冷却ファンが搭載されている。各ＣＰＵブレードには仮想化ソフト（ＶＭ）が展開され、複数のシステムがＶＭ上で動作する構成をとっている。ここで、例えば冷却ファンはブレード毎に設けられている必要はなく、１つのファンで複数のブレードをカバーする構成であってもよい。 FIG. 1 is a schematic view showing an embodiment of the present invention. As shown in the figure, in the embodiment of the present invention, a plurality (two) of blade systems 100, 200 are each mounted with a plurality of CPU blades, a chassis management processor, a plurality of power supply units (PSU), and a cooling fan. Yes. Each CPU blade is deployed with virtualization software (VM), and a plurality of systems operate on the VM. Here, for example, the cooling fan does not need to be provided for each blade, and may be configured to cover a plurality of blades with one fan.

各ＣＰＵブレードはＢＭＣ（ＢａｓｅｂｏａｒｄＭａｎａｇｅｍｅｎｔＣｏｎｔｒｏｌｌｅｒ）とＦＲＵ情報（ＦｉｅｌｄＲｅｐｌａｃｅｍｅｎｔＵｎｉｔＩｎｆｏｒｍａｔｉｏｎ）を格納する不揮発性メモリを有し（図示せず）、このメモリには予め製造工程において当該ＣＰＵブレードの種類、性能（例えば、ＣＰＵのコア数、周波数、キャッシュサイズなど）、ベンダーにより設定されたハードウェア（ＨＷ）耐障害性が登録されている。ここでＨＷ耐障害性は、ＨＷの信頼性予測・実績やサポートされている高可用性機能のレベルなどに応じてブレード間で比較表現できる指標であり、ブレードの出荷時に予めベンダーにより設定されるものとする。メモリの登録内容は、各ブレードの電源オン時に読み出されＢＭＣから筐体マネジメントプロセッサ１０に伝送される。 Each CPU blade has a non-volatile memory (not shown) for storing BMC (Baseboard Management Controller) and FRU information (Field Replacement Unit Information). In this memory, the CPU blade type and performance ( For example, the number of CPU cores, frequency, cache size, etc.) and hardware (HW) fault tolerance set by the vendor are registered. Here, HW fault tolerance is an index that can be compared and expressed between blades according to HW reliability prediction / results and the level of supported high availability functions, etc., and is preset by the vendor at the time of blade shipment And The registered contents of the memory are read when each blade is powered on and transmitted from the BMC to the chassis management processor 10.

筐体マネジメントプロセッサ１０は、ＢＭＣからＣＰＵブレードの相対的に比較可能な上記の情報を受け取る。また、各ＣＰＵブレードの管理下のＰＳＵおよび/または冷却ファン（ＦＡＮ）のＦＲＵ情報、各ＰＳＵ／ＦＡＮの冗長性や省電力機能の有無、ＶＭの種類の情報などを取得して、これらの情報を記憶領域（図示せず）の管理テーブルに登録する。この管理テーブルの例を図２に示す。 The case management processor 10 receives the above-described relatively comparable information of the CPU blade from the BMC. Also, FRU information of PSU and / or cooling fan (FAN) under management of each CPU blade, redundancy of each PSU / FAN, presence / absence of power saving function, VM type information, etc. are acquired, and these information are acquired. Are registered in the management table of the storage area (not shown). An example of this management table is shown in FIG.

図２に示すように、管理テーブルには各筐体のＣＰＵブレードごとに、可用性情報として、ベンダーが設定したＨＷ耐障害性、ＶＭの種類、冷却ファンの冗長性、電源の冗長性の指標値が登録される。また性能情報として、ＣＰＵ性能、メモリ容量、ＩＯ、ＶＭ種類の指標値が登録される。さらに、ＣＰＵ・ファン・電源の省電力性の指標値、グループ番号、最後にグループ内システム運用指標値が登録される。 As shown in FIG. 2, in the management table, for each CPU blade of each chassis, as the availability information, HW fault tolerance, VM type, cooling fan redundancy, power supply redundancy index values set by the vendor Is registered. Also, as performance information, CPU performance, memory capacity, IO, and VM type index values are registered. Further, an index value for power saving of the CPU, fan, and power source, a group number, and finally an in-group system operation index value are registered.

本実施例において、ＨＷ耐障害性の指標は「１」または「２」といった値をとり、図４のテーブルでは、例えば筐体１００ではＣＰＵブレード１２，１ｎがＦＴブレードのような可用性の高い高信頼ブレードであることを示すレベル「２」の指標が登録されている。ＨＷ障害履歴は、例えば初期値にレベル５が設定され、所定期間で所定回数の障害が発生した場合に１レベル下がるよう設定される。この場合に障害発生の事象および回数の情報は各ブレードで管理され、上記のような条件を満たす場合にレベル変更通知がマネジメントプロセッサ１０に通知されるようにすることができる。また、電源／ファンの冗長性としては、ＰＳＵ／ＦＡＮのＦＲＵ情報を読み出し、例えばＮ＋１冗長である場合に「１」、２Ｎの冗長性を有する場合に「２」、ＵＰＳ（無停電電源装置）を具える場合に「３」の指標を登録する。同様に性能情報は、例えばＣＰＵのコア数、周波数、キャッシュサイズなどの数値範囲の組合せを性能指標に対応させた変換テーブルを別途有し（図示せず）、実際の数値をこのテーブルにマッピングして性能指標を導出する構成とすることができる。また、ＣＰＵ・ファン・電源等が省電力電源またはファンであるか否かをＦＲＵ情報から判定し、例えば省電力でない場合に「０」が登録されるようにする。 In this embodiment, the HW fault tolerance index takes a value such as “1” or “2”. In the table of FIG. 4, for example, in the case 100, the CPU blades 12 and 1n have high availability such as FT blades. An index of level “2” indicating that the blade is a trusted blade is registered. In the HW failure history, for example, level 5 is set as an initial value, and is set to be lowered by 1 level when a predetermined number of failures occur in a predetermined period. In this case, information on the event of occurrence of failure and information on the number of times can be managed by each blade, and a level change notification can be notified to the management processor 10 when the above conditions are satisfied. Also, as power source / fan redundancy, PRU / FAN FRU information is read, for example, “1” when N + 1 redundancy is present, “2” when redundancy is 2N, UPS (uninterruptible power supply) The index of “3” is registered. Similarly, the performance information has a separate conversion table (not shown) in which a combination of numerical ranges such as the number of CPU cores, frequency, and cache size is associated with the performance index (not shown), and actual values are mapped to this table. Thus, the performance index can be derived. Further, it is determined from the FRU information whether the CPU, fan, power source, etc. are power saving power sources or fans. For example, when power saving is not performed, “0” is registered.

筐体マネジメントプロセッサ１０はまた、図４に示す管理テーブルに各ＣＰＵブレードのグループ番号を登録することにより、これらのＣＰＵブレードをグループ管理している。このグループ番号は、筐体マネジメントプロセッサ１０のコンソール（図示せず）を介してオペレータが直接設定できるように構成することができる。このグループ分けは、仮想システムをマイグレーションする際の移動先を特定のグループに制限するのに利用される。 The chassis management processor 10 also manages these CPU blades in groups by registering the group numbers of the CPU blades in the management table shown in FIG. This group number can be configured so that an operator can directly set it via a console (not shown) of the housing management processor 10. This grouping is used to limit the migration destination when migrating a virtual system to a specific group.

筐体マネジメントプロセッサ１０はさらに、電源オン時に上記の指標データが揃ったら、あるいは定期的に、各ＣＰＵブレードの各指標データからシステム運用環境指標を算出し管理テーブルに登録するようにしてもよい。このシステム運用環境指標は、上述した管理テーブルの各項目を必要に応じてさらに個別に重み付け演算して導出することができる。本実施例の場合、例えば得られるシステム運用環境指標がレベル１−５となるよう各項目の重み付け値（係数）が予め定められており、これに従って重み付け演算を行っている。各項目の係数は実施環境や各種条件を勘案して、当業者であれば容易に定めることができる。 The case management processor 10 may further calculate a system operating environment index from each index data of each CPU blade and register it in the management table when the above index data is prepared when the power is turned on or periodically. The system operating environment index can be derived by further individually calculating each item of the management table as necessary. In the case of the present embodiment, for example, the weighting values (coefficients) of the respective items are determined in advance so that the obtained system operation environment index becomes level 1-5, and the weighting calculation is performed according to this. The coefficient of each item can be easily determined by those skilled in the art in consideration of the implementation environment and various conditions.

筐体マネジメントプロセッサ１０はさらに、異なるブレード筐体２００の筐体マネジメントプロセッサ２０と情報交換し、利用可能なＣＰＵブレードの管理テーブルを完成し、管理する。また、システム運用中に新規なＣＰＵブレードが追加された場合にも、その挿入を筐体マネジメントプロセッサ１０が検出し、ＣＰＵブレードの情報をそのＢＭＣから即座に入手して管理テーブルに登録するとともに、システム運用環境指標を再算出して管理する。 The case management processor 10 further exchanges information with the case management processor 20 of a different blade case 200 to complete and manage a management table of available CPU blades. In addition, when a new CPU blade is added during system operation, the chassis management processor 10 detects the insertion of the CPU blade, immediately obtains the CPU blade information from the BMC, and registers it in the management table. Recalculate and manage system operating environment indicators.

そして、筐体マネジメントプロセッサ１０は、例えば軽障害が頻発した場合など所定の条件によりいずれかのＣＰＵブレードの仮想システムを別のブレードにマイグレーションする必要が生じた場合は、管理テーブルのシステム運用環境指標を参照し、同一グループ内で移動元のＣＰＵブレードと同等またはそれ以上のシステム運用環境指標を有するブレードから起動先を選択し、当該ＣＰＵブレードのＢＭＣに通知する。この場合、筐体マネジメントプロセッサ１０は継続的に各ＣＰＵブレードの負荷を監視しており、さらに負荷の少ないＣＰＵブレードを選択するようにしてもよい。また、システム運用環境指標が移動元と同等以上のレベルの別のＣＰＵブレードが存在しない場合、他のＣＰＵブレードのうち指標が高いものから選択するように構成してもよく、また指標が所定値以上異なる場合はマイグレーションを実行せずに継続運用するように構成してもよい。 The case management processor 10 determines the system operation environment index of the management table when it is necessary to migrate the virtual system of any CPU blade to another blade under a predetermined condition such as when frequent frequent failures occur. , An activation destination is selected from blades having a system operating environment index equal to or higher than that of the movement source CPU blade in the same group, and notified to the BMC of the CPU blade. In this case, the chassis management processor 10 continuously monitors the load of each CPU blade, and may select a CPU blade having a smaller load. In addition, when there is no other CPU blade whose system operating environment index is equal to or higher than that of the movement source, it may be configured to select another CPU blade having a higher index, and the index is a predetermined value. If different from the above, the system may be configured to continue operation without performing migration.

各ＣＰＵブレードのＶＭは、筐体マネジメントプロセッサ１０から移動先情報を入手すると、ＶＭ間制御による仮想システムの移動を実行する。これにより、例えばシステム運用指標が同等以上のＣＰＵブレードに移動した場合は、仮想システムが動作するシステム環境レベルを落とさずに移動前と同等もしくは改善する移動が可能となり、ユーザの運用目的に応じた最適なシステムリソース配分と堅牢なシステム運用が実現する。 When the VM of each CPU blade obtains the movement destination information from the chassis management processor 10, it executes the movement of the virtual system by inter-VM control. As a result, for example, if the system operation index moves to a CPU blade with the same or higher, it is possible to move to the same or improved level before the movement without lowering the system environment level in which the virtual system operates. Realizes optimal system resource allocation and robust system operation.

このシステムのマイグレーション動作例について以下に説明する。筐体マネジメントプロセッサ１０は、ＡＣオン時に筐体内のＣＰＵブレード１１−１ｎのＢＭＣより、各ＣＰＵブレードのＣＰＵ／メモリの詳細情報、ＨＷ障害履歴情報、ＦＲＵ内のＨＷ耐障害性指標情報を採取し、図２の管理テーブルに各個別指標に反映する。また、直接配下のＰＳＵ１１−１３、ＦＡＮ１１−１３のＦＲＵ情報を読み出し、電源／ファンの冗長性や省電力指標を管理テーブルに登録する。さらに、ＣＰＵブレード１１−１ｎのそれぞれがＤＣオンされたときに、ＶＭの種類を示す情報をＢＭＣが入手しておき、筐体マネジメントプロセッサ１０がその情報をＢＭＣから入手し管理テーブルに反映する。そして、これらの情報からシステム運用環境指標を算出し、管理テーブルに登録する。また、システムのオペレータは必要に応じて各ＣＰＵブレードのグループ分けをコンソール（図示せず）から入力し、これが管理テーブルに登録されている。 An example of the migration operation of this system will be described below. The chassis management processor 10 collects detailed CPU / memory information, HW failure history information, and HW fault tolerance index information in the FRU from the BMC of the CPU blade 11-1n in the chassis when AC is on. This is reflected in each individual index in the management table of FIG. Further, the FRU information of the directly subordinate PSUs 11-13 and FAN11-13 is read, and the power supply / fan redundancy and the power saving index are registered in the management table. Further, when each of the CPU blades 11-1n is DC-on, the BMC obtains information indicating the type of the VM, and the housing management processor 10 obtains the information from the BMC and reflects it in the management table. Then, a system operation environment index is calculated from these pieces of information and registered in the management table. Further, the system operator inputs the grouping of each CPU blade from a console (not shown) as necessary, and this is registered in the management table.

なお、ここでＶＭの種類とは、例えばある時点での複数のＶＭの測定結果やベンダにおける位置づけなどの情報に基づいて任意に定められる格付け情報であり、例えば「高−中−低」や「１〜５」といった指数で表すことができる。すなわち、ＶＭは数社から多種の製品がリリースされており、各製品はそれぞれ可用性、拡張性、性能等に差があり、また同製品でもエディションやヴァージョン間でこれらの指標に違いがある。これらの違いを考慮してベンダ側がＶＭの種類を任意に決定し、製造時にこれを登録するものとする。 Here, the type of VM is rating information arbitrarily determined based on information such as measurement results of a plurality of VMs at a certain point in time and positioning in a vendor, for example, “high-medium-low”, “ It can be expressed by an index such as “1-5”. That is, various products of VM are released from several companies, and each product has a difference in availability, scalability, performance, etc., and even in the same product, there are differences in these indexes between editions and versions. In consideration of these differences, the vendor side arbitrarily determines the type of VM and registers it at the time of manufacture.

一方、筐体マネジメントプロセッサ２０においても、ＡＣオン時に筐体内のＣＰＵブレード２１−２ｍのＢＭＣから同種の情報を入手し、同様の管理テーブルを作成してシステム運用環境指数を算出する。さらに、筐体間で通信を行い、これらの管理テーブルの情報を交換して、図２に示すような総ての情報を管理する。 On the other hand, the case management processor 20 also obtains the same type of information from the BMC of the CPU blade 21-2m in the case when AC is on, creates a similar management table, and calculates the system operating environment index. Further, communication is performed between the cases, information in these management tables is exchanged, and all information as shown in FIG. 2 is managed.

このような状況において、図１に示すＣＰＵブレード１２にて例えば１ビットエラーなどの訂正可能エラーのようなＨＷ軽障害が頻発したり、ブレードが過負荷状態になったことが検出された場合、ＣＰＵブレード１２はＢＭＣに障害履歴情報を登録する。筐体マネジメントプロセッサ１０はＢＭＣからの報告に基づき、所定のアルゴリズムに従って管理テーブルにおけるＣＰＵブレード１２の障害履歴指数を変更する。これは、例えば所定期間内に所定回数の軽障害が報告された場合や、所定期間以上過負荷状態が継続した場合に指数を１ランク下げるなど設定することができる。 In such a situation, when it is detected that a HW minor failure such as a correctable error such as a 1-bit error frequently occurs or the blade is overloaded in the CPU blade 12 shown in FIG. The CPU blade 12 registers failure history information in the BMC. The case management processor 10 changes the failure history index of the CPU blade 12 in the management table according to a predetermined algorithm based on the report from the BMC. This can be set, for example, by lowering the index by one rank when a predetermined number of minor faults are reported within a predetermined period or when an overload state continues for a predetermined period or longer.

このようにして、例えばＣＰＵブレード１２の障害履歴がレベル５からレベル３まで下がるなど、所定の条件が整った場合に、筐体マネジメントプロセッサ１０はこのＣＰＵブレード１２の仮想システムをマイグレーションさせる必要ありと判定する。本実施例ではこの条件を例えば障害履歴指数が２ランク下がった場合としているが、これは本例に限らず他の基準を適用してもよい。 In this way, when a predetermined condition is met, for example, when the failure history of the CPU blade 12 decreases from level 5 to level 3, the case management processor 10 needs to migrate the virtual system of the CPU blade 12. judge. In this embodiment, this condition is set, for example, when the failure history index is lowered by two ranks, but this is not limited to this example, and other criteria may be applied.

このとき筐体マネジメントプロセッサ１０は、図２の管理テーブルを参照し、ＣＰＵブレード１２と同一グループ＃１に設定されているＣＰＵブレード（筐体１００のブレード１ｎ、筐体２００のブレード２２、２ｍ）の中から、システム運用環境指標が同じ高レベル５で、且つ負荷の少ないＣＰＵブレードを移動先ＣＰＵブレードとして選択する。この際に、筐体マネジメントプロセッサ１０は筐体２００の筐体マネジメントプロセッサ２０と通信を行い、各ブレードの負荷状況などの情報を取得して選定する。例えば図３には、移動元のＣＰＵブレード１２に仮想システムが２つある場合に、システム運用環境指標が同じレベル５でグループ＃１に属するＣＰＵブレードのうち、負荷の少ない筐体２００のＣＰＵブレード２２、２ｍを移動先として選択する実施例が示されている。この情報が筐体マネジメントプロセッサ１０、２０間で通信され、ＣＰＵブレード１２のＢＭＣにその移動先の上方が通知される。これを受けたＣＰＵブレード１２のＶＭは、測材に当該ＢＭＣから移動先情報を入手し、図３に示すようなＶＭ−ＶＭ間制御により２つの仮想システムの移動を実行する。 At this time, the chassis management processor 10 refers to the management table of FIG. 2 and the CPU blades are set to the same group # 1 as the CPU blade 12 (blade 1n of the chassis 100, blades 22 and 2m of the chassis 200). Among them, a CPU blade having the same system operating environment index as high level 5 and having a low load is selected as a destination CPU blade. At this time, the chassis management processor 10 communicates with the chassis management processor 20 of the chassis 200 to acquire and select information such as the load status of each blade. For example, in FIG. 3, when there are two virtual systems in the migration source CPU blade 12, among the CPU blades belonging to group # 1 with the same system operating environment index of level 5, the CPU blade of the chassis 200 with a low load An embodiment is shown in which 22, 2m is selected as the movement destination. This information is communicated between the case management processors 10 and 20, and the BMC of the CPU blade 12 is notified of the upper side of the movement destination. Receiving this, the VM of the CPU blade 12 obtains the movement destination information from the BMC for the material measurement, and executes the movement of the two virtual systems by the VM-VM control as shown in FIG.

以上のステップにより、本発明は、各ＣＰＵブレードの性能や電源／ファンの特徴までをも考慮して、その時点で最適な仮想システムの移動先ブレードをダイナミックに選択できるようにしたため、専用のハードウェアを追加することなく、従来のシステム構成にて最適なシステムリソース配分と堅牢なシステム運用が可能となる。また、仮想システムが動作するシステム環境レベルを落とさずに移動先をダイナミックに定義可能としており、障害時や過負荷時に備えて予備の待機サーバを用意する必要がなくなるため、利用者のＴＣＯ（ＴｏｔａｌＣｏｓｔｏｆＯｗｎｅｒｓｈｉｐ）を削減することができる。また、高性能、高可用性、高信頼性などＨＷの特徴をふまえた条件で仮想システムの展開を可能としており、さらにユーザが定義できるグループ内での移動に制限できるため、ユーザの意図する運用目的に応じた適材適所のリソース活用が可能となる。 Through the above steps, the present invention can dynamically select the optimal destination blade of the virtual system at that point in consideration of the performance of each CPU blade and the characteristics of the power supply / fan. It is possible to allocate the optimal system resources and operate the system robustly with the conventional system configuration without adding hardware. In addition, the destination can be dynamically defined without lowering the system environment level in which the virtual system operates, and there is no need to prepare a spare standby server in the event of a failure or overload, so the user's TCO (Total Cost of Ownership) can be reduced. In addition, the virtual system can be deployed on the basis of HW characteristics such as high performance, high availability, and high reliability, and can be restricted to movement within a group that can be defined by the user. It is possible to utilize resources in the right place for the right material.

次に、本発明の他の実施例について説明する。上記実施例では、各ＣＰＵブレードをグループ分けして、仮想システムの移動先を同一グループ内に制限するようにしたが、このグループ分けは必須の構成ではなく、例えば図１または図４に示すように、特にグループを考慮することなく、システム運用環境指標が同等以上であるＣＰＵブレードから負荷が少ないものを選択するようにしてもよい。また、上記実施例では筐体マネジメントプロセッサが移動先を選定しているが、例えば各ＣＰＵブレードのシステム運用環境指標がＶＭの立ち上げ時や稼働中にＢＭＣからＶＭに通知され、各ＣＰＵブレード上のＶＭがこれらの情報から仮想システム移動時の移動先選定を行うよう構成されていてもよい。 Next, another embodiment of the present invention will be described. In the above embodiment, the CPU blades are grouped to limit the migration destination of the virtual system within the same group. However, this grouping is not an essential configuration, for example, as shown in FIG. 1 or FIG. In addition, a CPU blade with a low system load may be selected from CPU blades having a system operation environment index equal to or higher than that without considering a group. In the above embodiment, the chassis management processor selects the destination. For example, the system operating environment index of each CPU blade is notified from the BMC to the VM when the VM is started up or in operation, and the CPU blade The VM may be configured to select a destination when moving the virtual system from these pieces of information.

さらに別の実施例を、図５を参照しながら説明する。図５の実施例では、筐体１００、２００を接続するネットワーク上に管理サーバ３００が設けられており、この管理サーバ３００が図２に示す管理テーブルを管理し、仮想システムの移動先の制御を実行する。本実施例では、各筐体マネジメントプロセッサは、管理サーバ３００で動作する管理ソフトの指示に従い、ＣＰＵブレードに関する情報を各ＢＭＣに取り合わせて入手し、電源／ファンに関する情報は直接入手し、管理サーバ３００に通知する。 Yet another embodiment will be described with reference to FIG. In the embodiment of FIG. 5, a management server 300 is provided on the network connecting the casings 100 and 200. The management server 300 manages the management table shown in FIG. 2 and controls the migration destination of the virtual system. Execute. In this embodiment, each chassis management processor obtains information on CPU blades in combination with each BMC in accordance with instructions of management software running on the management server 300, and directly obtains information on power supplies / fans. Notify

各筐体の筐体マネジメントプロセッサ１０、２０は、ＡＣオン時に各筐体内のＣＰＵブレード１１−１ｎ、２１−２ｍのＢＭＣより、各ＣＰＵブレードのＣＰＵ／メモリの詳細情報、ＨＷ障害履歴情報、ＦＲＵ内のＨＷ耐障害性指標情報を採取し、管理サーバ３００に転送する。また、電源／ファンのＦＲＵ情報から冗長性や省電力性の情報を取得して管理サーバ３００に転送する。管理サーバ３００で動作する管理ソフト（図示せず）は、図２に示す管理テーブルにこれらの情報を登録し、上記説明と同様にして氏ｓつえむうんよう環境指標を算出する。いずれかの筐体で新規にＣＰＵブレードが追加された場合にも、筐体マネジメントプロセッサがＣＰＵブレード挿入を検出し、管理サーバ３００へ情報を送信する。管理ソフトは情報を受けると即座にシステム運用環境指標の再算出を行いその情報を管理する。 The chassis management processors 10 and 20 of each chassis receive detailed information on CPU / memory of each CPU blade, HW failure history information, FRU from the BMC of the CPU blades 11-1n and 21-2m in each chassis when AC is on. The HW fault tolerance index information is collected and transferred to the management server 300. Also, redundancy and power saving information is acquired from the power supply / fan FRU information and transferred to the management server 300. Management software (not shown) running on the management server 300 registers these pieces of information in the management table shown in FIG. 2, and calculates the environmental index in the same manner as described above. Even when a CPU blade is newly added in any case, the case management processor detects the insertion of the CPU blade and transmits information to the management server 300. When the management software receives the information, it immediately recalculates the system operating environment index and manages the information.

例えばＣＰＵブレード１２に軽障害が頻発したり過負荷となった場合には、これがＣＰＵブレード１２のＢＭＣから筐体マネジメントプロセッサ１０を介して管理サーバ３００に通知され、管理テーブルが更新される。管理ソフトは、同一グループ＃１に背手値されているＣＰＵブレードのうち、システム運用環境指標が同じレベル５で、且つ負荷の少ないＣＰＵブレード２２、２ｍを仮想システムの移動先として選択肢、ＣＰＵブレード１２のＶＭに対し、２つの仮想システムの移動を筐体マネジメントプロセッサ１０を介さずに直接指示する。このようにして、複数の筐体にまたがる仮想システムの移動を円滑に行うことができる。 For example, when a minor failure frequently occurs or an overload occurs in the CPU blade 12, this is notified from the BMC of the CPU blade 12 to the management server 300 via the chassis management processor 10, and the management table is updated. The management software selects the CPU blades 22 and 2m having the same system operating environment index level 5 and a low load among the CPU blades valued in the same group # 1 as the migration destination of the virtual system. The 12 VMs are directly instructed to move the two virtual systems without going through the chassis management processor 10. In this way, the virtual system can be moved smoothly across a plurality of housings.

本発明に係る仮想システム制御方法およびコンピュータシステムは、ネットワーク産業を含む情報処理産業で好適に利用することができる。 The virtual system control method and computer system according to the present invention can be suitably used in the information processing industry including the network industry.

本発明にかかるコンピュータシステムの実施例の構成を示す図である。It is a figure which shows the structure of the Example of the computer system concerning this invention. 管理テーブルの一例を示す図である。It is a figure which shows an example of a management table. グループ化により移動先を限定した仮想システムの移動例を示す図である。It is a figure which shows the example of a movement of the virtual system which limited the movement destination by grouping. システム運用環境指標に基づいた仮想システムの移動例を示す図である。It is a figure which shows the example of a movement of the virtual system based on a system operation environment parameter | index. 本発明にかかるコンピュータシステムの別の実施例の構成を示す図である。It is a figure which shows the structure of another Example of the computer system concerning this invention.

Explanation of symbols

１００、２００ブレード筐体
１０、２０筐体マネジメントプロセッサ
１１−１ｎ、２１−２ｍＣＰＵブレード
ＰＳＵ電源ユニット
ＦＡＮ冷却ファン
３００管理サーバ
100, 200 Blade enclosure 10, 20 Enclosure management processor 11-1n, 21-2m CPU blade PSU Power supply unit FAN Cooling fan 300 Management server

Claims

In a computer system configured to migrate a virtual system configured with a plurality of CPU blades and operating on one CPU blade to another CPU blade,
Refer to the management table when there is a need to migrate a storage area that stores at least a management table in which information related to each CPU blade is registered, and a virtual system on any CPU blade to another CPU blade. A computer system comprising: a control unit that selects another CPU blade that satisfies a predetermined condition in comparison with a migration source CPU blade and sets it as a migration destination of the virtual system.

2. The system according to claim 1, wherein the management table further stores information on a corresponding power supply and / or cooling fan for each CPU blade, and this information is taken into account when setting by the control means. A featured computer system.

3. The system according to claim 1, wherein the control unit is obtained by performing a predetermined weighting operation on each registered item for each CPU blade by referring to the management table in advance according to a predetermined condition or periodically. A computer system which manages a system operation environment index for each blade, and selects another CPU blade having an index equivalent to or higher than that of a source CPU blade as a destination.

4. The system according to claim 1, wherein the management table further includes a column for managing the CPU blades by grouping according to an operator input, and the control unit includes the movement source. A computer system, wherein a destination CPU blade is selected from the same group as the CPU blade.

In a virtual system control method for migrating a virtual system operating on one CPU blade to another CPU blade in a computer system comprising a plurality of CPU blades, at least CPU and memory information and fault history for each CPU blade When the information, the managed power source and / or the cooling fan information are registered in the management table, and the virtual system on any CPU blade needs to be migrated to another CPU blade, the control means can control the management table. And selecting another CPU blade that satisfies a predetermined condition related to the source CPU blade and setting it as the destination of the virtual system.

6. The method according to claim 5, wherein the management table further stores information on a corresponding power supply and / or cooling fan for each CPU blade, and this information is stored in the CPU blade of the destination by the control means. A virtual system control method which is considered at the time of selection.

7. The method according to claim 5, wherein the control means obtains a predetermined weighting operation for each registered item for each CPU blade by referring to the management table in advance or under a predetermined condition in advance. A virtual system control method that manages a system operating environment index for each blade, and selects another CPU blade having an index equal to or higher than that of a source CPU blade as a destination.

The method according to any one of claims 5 to 7, wherein the management table further includes a column for grouping and managing the CPU blades according to an input from an operator, and the control means includes the source of the movement. A virtual system control method comprising selecting a destination CPU blade from the same group as a CPU blade.