JP2006252336A

JP2006252336A - Inter-device data transfer apparatus, inter-device data transfer method and program

Info

Publication number: JP2006252336A
Application number: JP2005069941A
Authority: JP
Inventors: Hidehiro Shimizu; 英弘清水; Mitsunori Kori; 光則郡
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2005-03-11
Filing date: 2005-03-11
Publication date: 2006-09-21

Abstract

<P>PROBLEM TO BE SOLVED: To perform a degenerate operation without stopping a system even when there is a fault in a storage device or a data processing device. <P>SOLUTION: An inter-device data transfer apparatus is provided with a table storing data transfer propriety between a plurality of magnetic disk devices and a plurality of I/O processors and fault flags indicating the presence of the fault of the respective devices and a bus connecting device. In the fault of the I/O processor, the magnetic disk device or the bus connecting device, by determining the ON/OFF of the fault flags for the respective devices and determining the transfer possibility of the magnetic disk device and the I/O processor constituting a logical volume, access to a fault device is limited without updating the entire management table. Thus, the system can be shifted to the degenerate operation without stopping the system. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

この発明は、バスに接続された複数のデバイス間のデータ転送におけるボリューム管理モジュールに関するものである。 The present invention relates to a volume management module in data transfer between a plurality of devices connected to a bus.

図１５はデバイス間データ転送装置のシステム構成の一例である。 FIG. 15 shows an example of the system configuration of the inter-device data transfer apparatus.

１０は中央処理装置（ＣＰＵ）でシステム全体の各装置の制御を担当している。９は主記憶装置であり、この主記憶装置９に格納されているオペレーティングシステム９０によりシステム全体のソフトウェアが制御されている。１は中央処理装置１０と主記憶装置９と各デバイスとを接続しているバス（プライマリバス）であり、複数のバスを接続するためのバス結合装置３ａ、３ｂによってバス（セカンダリバス）２ａ、２ｂに接続されている。バス結合装置３ａ、３ｂは、例えば、ブリッジである。磁気ディスク６０ａ、６１ａは、ディスクコントローラ（アダプタ）４ａによってバス２ａに接続され、１つの記憶デバイス７ａを構成している。同様に、磁気ディスク６０ｂ、６１ｂは、ディスクコントローラ４ｂによってバス２ｂに接続され、１つの記憶デバイス７ｂを構成している。磁気ディスク６０ａ、６１ａ、６０ｂ、６１ｂは、以下、物理ディスクともいう。デバイス５０ａはバス２ａに接続され、デバイス５０ｂは２ｂに接続されており、例えば内部に記憶装置を含むＩ／Ｏプロセッサである。 A central processing unit (CPU) 10 is in charge of controlling each device of the entire system. Reference numeral 9 denotes a main storage device, and software of the entire system is controlled by an operating system 90 stored in the main storage device 9. Reference numeral 1 denotes a bus (primary bus) that connects the central processing unit 10, the main storage device 9, and each device, and a bus (secondary bus) 2a by a bus coupling device 3a, 3b for connecting a plurality of buses. 2b. The bus coupling devices 3a and 3b are, for example, bridges. The magnetic disks 60a and 61a are connected to the bus 2a by a disk controller (adapter) 4a to constitute one storage device 7a. Similarly, the magnetic disks 60b and 61b are connected to the bus 2b by the disk controller 4b and constitute one storage device 7b. The magnetic disks 60a, 61a, 60b, 61b are hereinafter also referred to as physical disks. The device 50a is connected to the bus 2a, and the device 50b is connected to 2b. For example, the device 50a is an I / O processor including a storage device therein.

通常デバイス７ａ、７ｂおよびデバイス５０ａ、５０ｂは１システムの中に複数接続されており、デバイス７ａ、７ｂとデバイス５０ａ、５０ｂは相互に直接データ転送を行うものである。デバイス７ａ、７ｂに蓄えられてあるデータは、通常は主記憶装置９に転送され中央処理装置１０で処理されるが、中央処理装置の負荷やバスのデータ転送の負荷を軽減するために、主記憶装置を介さないデバイス間直接データ転送（ｐｅｅｒ−ｔｏ−ｐｅｅｒデータ転送）が行われることがある。つまり、デバイス７ａ、７ｂに蓄えられてあるデータは主記憶装置を経由すること無くデバイス５０ａ、５０ｂの内部にある記憶装置に転送され、５０ａ、５０ｂ内部にあるＩ／Ｏプロセッサで処理されることがある。 Usually, a plurality of devices 7a and 7b and devices 50a and 50b are connected in one system, and the devices 7a and 7b and the devices 50a and 50b directly transfer data to each other. The data stored in the devices 7a and 7b is normally transferred to the main storage device 9 and processed by the central processing unit 10, but in order to reduce the load on the central processing unit and the data transfer on the bus, In some cases, direct data transfer (peer-to-peer data transfer) between devices without using a storage device is performed. That is, the data stored in the devices 7a and 7b is transferred to the storage device inside the devices 50a and 50b without going through the main storage device and processed by the I / O processor inside the devices 50a and 50b. There is.

図１６は、特許文献１で開示されているシステム構成情報を保持するためのボリューム管理テーブルの内部構成図である。図１７は特許文献２で開示されている論理ボリュームの冗長構成情報を保持するための構成情報管理テーブルの内部構成図である。 FIG. 16 is an internal configuration diagram of a volume management table for holding system configuration information disclosed in Patent Document 1. FIG. 17 is an internal configuration diagram of a configuration information management table for holding redundant configuration information of logical volumes disclosed in Patent Document 2.

従来のデバイス間データ転送方法は、システムの構成情報を静的なテーブルに保持し、接続されるバスの階層をたどりバスの転送可能性を参照することによって、デバイス間の転送可能性の判断を行っていた（特許文献１）。また、デバイス間データ転送装置において、論理ボリューム単位で同一のデータを保持することで冗長構成とし、論理ボリュームの故障時に代替する方法も提案されている（特許文献２）。
特許第２９７８８８２号「デバイス間データ転送装置及びデバイス間データ転送方法」特開２００３−３０８１７４号公報「デバイス間データ転送装置、デバイス間データ転送方法及びプログラム」 In the conventional inter-device data transfer method, the system configuration information is held in a static table, and the transfer possibility between devices is determined by referring to the bus transfer possibility by following the hierarchy of the connected bus. (Patent Document 1). Also, a method has been proposed in which an inter-device data transfer apparatus has a redundant configuration by holding the same data in units of logical volumes and is substituted when a logical volume fails (Patent Document 2).
Japanese Patent No. 2978882 “Inter-device data transfer apparatus and inter-device data transfer method” JP 2003-308174 A "Inter-device data transfer apparatus, inter-device data transfer method and program"

従来のデバイス間データ転送方法は、システム構成情報を静的な状態をボリューム管理テーブルや冗長性管理テーブルとして保持するので、Ｉ／Ｏプロセッサや磁気ディスク装置等の各デバイスやバス結合装置が故障した場合、故障した装置を使用しないで縮退運転を可能とするためには、ボリューム管理テーブルや冗長性管理テーブルを再生成する必要がある。 In the conventional inter-device data transfer method, the system configuration information is held in a static state as a volume management table or a redundancy management table, so that each device such as an I / O processor or a magnetic disk device or a bus coupling device has failed. In this case, it is necessary to regenerate the volume management table and the redundancy management table in order to enable the degenerate operation without using the failed device.

そのために、オペレーティングシステムが装置の故障を認識したのちに、故障装置がシステムから切り離された状態でシステム情報を再度採取するか、もしくはボリューム管理テーブルや冗長性管理テーブルから故障装置の情報や論理ボリュームの情報を削除することにより故障装置へのアクセスを制限する必要があった。その際、オペレーティングシステムの再起動やボリューム管理テーブルおよび冗長性管理テーブルの生成の間、しばらくの間システムを停止しなければならないと言う問題があった。 Therefore, after the operating system recognizes the failure of the device, collect the system information again with the failed device disconnected from the system, or from the volume management table or redundancy management table, the failed device information or logical volume It was necessary to restrict access to the failed device by deleting the information. At that time, there is a problem that the system must be stopped for a while during the restart of the operating system and the generation of the volume management table and the redundancy management table.

また、故障装置がシステムから切り離されていない状態でオペレーティングシステムに対してシステム情報を採取しようとすると、エラーもしくはタイムアウトまで待ちが発生することがあるという問題があった。 Further, there is a problem in that waiting for an error or timeout may occur when attempting to collect system information from the operating system in a state where the failed device is not disconnected from the system.

また、故障時には完全にボリューム管理テーブルや冗長性管理テーブルから情報を削除してしまうため、故障した装置が復旧した場合に、再度ボリューム管理テーブルや冗長性管理テーブルを生成するしばらくの間、システムを停止しなければならないと言う問題があった。 In addition, since the information is completely deleted from the volume management table and redundancy management table in the event of a failure, when the failed device is restored, the system is There was a problem that I had to stop.

従来のデバイス間データ転送方法では、ボリューム管理テーブルのバス情報の転送可能性や冗長性管理テーブルの論理ボリューム情報の優先度によって転送可能性のレベルを判断していたため、Ｉ／Ｏプロセッサや磁気ディスク装置等のデバイスやバス結合装置が、必ずしも完全に使用不可能な状態とはならずに間欠的に故障の状態となるような場合に、できるだけエラーとならないような転送可能性の判断をすることができないという問題があった。 In the conventional device-to-device data transfer method, the transferability level is determined based on the transferability of the bus information in the volume management table and the priority of the logical volume information in the redundancy management table. Therefore, the I / O processor or magnetic disk When a device such as a device or a bus coupling device does not necessarily become completely unusable but intermittently becomes a failure state, the transfer possibility is determined so as not to cause an error as much as possible. There was a problem that could not.

また、論理ボリューム故障時の転送可能性の判断は、同一のデータを保持し冗長構成となっている論理ボリューム間でのみ優先度を判断していたため、論理ボリュームを構成する磁気ディスク装置の故障に応じた転送可能性を判断することができないと言う問題があった。 In addition, the determination of transferability when a logical volume fails is because the priority is determined only between logical volumes that have the same data and have a redundant configuration. There was a problem that it was not possible to judge the transfer possibility.

また、バスについてのみ転送可能性の情報を保持していたため、同じバスに接続されている個々のＩ／Ｏプロセッサや論理ボリュームを構成する磁気ディスク装置の故障状態に応じて、各装置の使用時の転送可能性を判断することができないと言う問題があった。 In addition, since transferability information was held only for the bus, when each device was used depending on the failure status of the individual I / O processors and logical disks that make up the logical volume connected to the same bus There was a problem that it was not possible to judge the transfer possibility.

また、従来のデバイス間データ転送方法は、バス同士を繋ぐそれぞれのバス結合装置の階層の数のみ転送可能性の判断に使用していたため、バス結合装置個別の故障状態やバスの階層を重ねることによる転送可能性の変化を識別した転送可能性の判断ができないと言う問題があった。 In addition, since the conventional inter-device data transfer method uses only the number of levels of each bus coupling device that connects the buses to determine the transfer possibility, the failure status of each bus coupling device and the bus hierarchy are overlapped. There was a problem that the transferability could not be determined by identifying the change in transferability due to.

また、従来のデバイス間データ転送方法は、Ｉ／Ｏプロセッサや論理ボリュームを構成する磁気ディスク装置からたどる最上位のバスの転送可能性か、もしくはＩ／Ｏプロセッサが直接接続しているバスの転送可能性についてのみ判断していたため、それ以外で接続関係にあるバスでの転送可能性を判断することができないと言う問題があった。 In addition, the conventional inter-device data transfer method is based on the possibility of transfer of the highest level bus traced from the magnetic disk device constituting the I / O processor or logical volume, or transfer of the bus directly connected to the I / O processor. Since only the possibility was determined, there was a problem that it was not possible to determine the transfer possibility on a bus having a connection relationship other than that.

従来のデバイス間データ転送方法では、故障によるエラーを未然に防ぐことは不可能であった。 In the conventional inter-device data transfer method, it is impossible to prevent an error due to a failure.

この発明は上記のような問題点を解決するためになされたもので、Ｉ／Ｏプロセッサや磁気ディスク装置やバス結合装置が故障した場合、システムの停止期間を最小限とすることを目的とする。また、Ｉ／Ｏプロセッサや磁気ディスク装置やバス結合装置が故障した場合、各装置の故障の状態に応じてデバイス間データ転送の転送可能性を判断することを目的とする。また、故障の予想を立てることでエラーを未然に防ぐことを目的とする。 The present invention has been made to solve the above-described problems, and it is an object of the present invention to minimize a system stop period when an I / O processor, a magnetic disk device, or a bus coupling device fails. . It is another object of the present invention to determine transferability of inter-device data transfer according to the failure state of each device when an I / O processor, magnetic disk device, or bus coupling device fails. Another object of the present invention is to prevent errors in advance by making a failure prediction.

本発明に係るデバイス間データ転送装置は、
複数の記憶デバイスと複数のデータ処理デバイスとが複数の転送経路と少なくとも一つの転送経路結合装置を介して接合され、記憶デバイスとデータ処理デバイスとの間で転送経路と転送経路結合装置を介してデータを転送するデバイス間データ転送装置において、
記憶デバイスの故障に関する情報と、データ処理デバイスの故障に関する情報と、転送経路結合装置の故障に関する情報を示すテーブルを記憶するテーブル記憶部と、
テーブルに示されている記憶デバイスの故障に関する情報と、データ処理デバイスの故障に関する情報と、転送経路結合装置の故障に関する情報に基づいて、記憶デバイスとデータ処理デバイスとの間でのデータ転送の可否を識別するデータ転送制御部とを備えたことを特徴とする。 An inter-device data transfer apparatus according to the present invention is as follows.
A plurality of storage devices and a plurality of data processing devices are joined via a plurality of transfer paths and at least one transfer path coupling device, and between the storage device and the data processing device via a transfer path and a transfer path coupling device. In a device-to-device data transfer device that transfers data,
A table storage unit that stores information about a failure of the storage device, information about a failure of the data processing device, and a table showing information about the failure of the transfer path coupling device;
Whether data transfer between the storage device and the data processing device is possible based on the storage device failure information, the data processing device failure information, and the transfer path coupling device failure information shown in the table. And a data transfer control unit for identifying the device.

本発明によれば、テーブルに記憶されている記憶デバイスの故障に関する情報と、データ処理デバイスの故障に関する情報と、転送経路結合装置の故障に関する情報に基づいて、記憶デバイスとデータ処理デバイスとの間でのデータ転送の可否を識別するため、いずれかの装置において故障があっても、テーブルを再生成する必要がなく、システムを停止させる必要がない。 According to the present invention, between the storage device and the data processing device based on the information on the failure of the storage device stored in the table, the information on the failure of the data processing device, and the information on the failure of the transfer path coupling device. Therefore, even if there is a failure in any of the devices, the table need not be regenerated and the system need not be stopped.

実施の形態１．
図１は、本実施の形態のデバイス間データ転送方法を実現するためのデバイス間データ転送装置のシステム構成の一例である。 Embodiment 1 FIG.
FIG. 1 is an example of a system configuration of an inter-device data transfer apparatus for realizing the inter-device data transfer method of the present embodiment.

１０は中央処理装置（ＣＰＵ）でシステム全体の各装置の制御を担当している。９は主記憶装置であり、この主記憶装置９に格納されているオペレーティングシステム９０によりシステム全体のソフトウェアが制御されている。１は中央処理装置１０と主記憶装置９と各デバイスとを接続しているバスであり、複数のバスを接続するためのバス結合装置３ａ、３ｂによってバス２ａ、２ｂに接続されている。バス結合装置３ａ、３ｂは例えばブリッジである。また、バス結合装置３ａ、３ｂは、転送経路結合装置の例である。磁気ディスク装置６０ａ、６１ａはディスクコントローラ４ａによってバス２ａに接続され一つの記憶デバイス７ａを構成している。同様に、磁気ディスク装置６０ｂ、６１ｂはディスクコントローラ４ｂによってバス２ｂに接続され一つの記憶デバイス７ｂを構成している。同様に、磁気ディスク装置６０ｃ、６１ｃはディスクコントローラ４ｃによってバス２ｂに接続され一つの記憶デバイス７ｃを構成している。デバイス５０ａ、５１ｂはバス２ａに接続され、デバイス５０ｂは２ｂに接続されており、例えば内部に記憶装置を含むＩ／Ｏプロセッサである。 A central processing unit (CPU) 10 is in charge of controlling each device of the entire system. Reference numeral 9 denotes a main storage device, and software of the entire system is controlled by an operating system 90 stored in the main storage device 9. Reference numeral 1 denotes a bus that connects the central processing unit 10, the main storage device 9, and each device, and is connected to the buses 2a and 2b by bus coupling devices 3a and 3b for connecting a plurality of buses. The bus coupling devices 3a and 3b are bridges, for example. The bus coupling devices 3a and 3b are examples of transfer path coupling devices. The magnetic disk devices 60a and 61a are connected to the bus 2a by the disk controller 4a to constitute one storage device 7a. Similarly, the magnetic disk devices 60b and 61b are connected to the bus 2b by the disk controller 4b to constitute one storage device 7b. Similarly, the magnetic disk devices 60c and 61c are connected to the bus 2b by the disk controller 4c to constitute one storage device 7c. The devices 50a and 51b are connected to the bus 2a, and the device 50b is connected to 2b. For example, the devices 50a and 51b are I / O processors including a storage device inside.

通常デバイス７ａ、７ｂ、７ｃおよびデバイス５０ａ、５１ａ、５０ｂは１システムの中に複数接続されており、本実施の形態に係るデバイス間データ転送装置はデバイス７ａ、７ｂ．７ｃとデバイス５０ａ、５１ａ、５０ｂの間で相互に直接データ転送を行うものである。 Usually, a plurality of devices 7a, 7b, 7c and devices 50a, 51a, 50b are connected in one system, and the inter-device data transfer apparatus according to the present embodiment includes devices 7a, 7b. 7c and devices 50a, 51a, 50b directly transfer data to each other.

デバイス７ａ、７ｂ、７ｃに蓄えられてあるデータは、通常は主記憶装置９に転送され中央処理装置１０で処理されるが、中央処理装置の負荷やバスのデータ転送の負荷を軽減するために、主記憶装置を介さないデバイス間直接データ転送（ｐｅｅｒ−ｔｏ−ｐｅｅｒデータ転送）が行われることがある。つまり、本実施の形態に係るデバイス間データ転送装置では、デバイス７ａ、７ｂ、７ｃに蓄えられてあるデータは主記憶装置を経由すること無くデバイス５０ａ、５１ａ、５０ｂの内部にある記憶装置に転送され、５０ａ、５１ａ、５０ｂ内部にあるＩ／Ｏプロセッサで処理する。 The data stored in the devices 7a, 7b, and 7c is normally transferred to the main memory 9 and processed by the central processing unit 10, but in order to reduce the load on the central processing unit and the data transfer on the bus In some cases, direct data transfer between devices (peer-to-peer data transfer) is performed without using the main storage device. That is, in the inter-device data transfer apparatus according to the present embodiment, data stored in the devices 7a, 7b, and 7c is transferred to the storage devices inside the devices 50a, 51a, and 50b without passing through the main storage device. It is processed by an I / O processor inside 50a, 51a, 50b.

本実施の形態では、記憶デバイス７ａ、７ｂ、７ｃは、それぞれ論理ボリュームとして設定されるものとする。 In the present embodiment, it is assumed that the storage devices 7a, 7b, and 7c are set as logical volumes.

ここで、論理ボリュームとは、複数のパーティションを組み合わせて論理的な１つの記憶装置として扱えるようにしたものである。例えば、いわゆるソフトウェアＲＡＩＤ（ＲｅｄｕｎｄａｎｔＡｒｒａｙｓｏｆＩｎｅｘｐｅｎｓｉｖｅＤｉｓｋ）の全てのレベル（０〜５）及び「ｓｐａｎｎｅｄｖｏｌｕｍｅ」も含んでいる。そして、パーティションは、物理ディスク全体であっても、その一部であってもよい。 Here, the logical volume is a combination of a plurality of partitions so that it can be handled as one logical storage device. For example, all levels (0 to 5) and “spanned volume” of so-called software RAID (Redundant Arrays of Inexpensive Disk) are also included. The partition may be the entire physical disk or a part thereof.

本実施の形態では、Ｉ／Ｏプロセッサ５０ａやバス結合装置３ｂや磁気ディスク装置６１ｂが故障した場合に、デバイス間データ転送の転送可能性を判断する手順を示す。 In the present embodiment, a procedure for determining transferability of inter-device data transfer when the I / O processor 50a, the bus coupling device 3b, or the magnetic disk device 61b has failed will be described.

本実施の形態では、この発明によるボリューム管理法およびデータ転送制御法を主記憶装置９に格納されるソフトウェアとしてボリューム管理モジュール９２（以下、ボリューム管理ともいう）およびデータ転送制御モジュール９３（以下、データ転送制御ともいう）で実現している。このボリューム管理９２およびデータ転送制御９３はオペレーティングシステム９０とは分離されたソフトウェアとして実装され、オペレーティングシステム９０に対してシステムの情報を問い合わせることができる。 In the present embodiment, the volume management method and the data transfer control method according to the present invention are software stored in the main storage device 9 as software stored in a volume management module 92 (hereinafter also referred to as volume management) and a data transfer control module 93 (hereinafter referred to as data). (Also called transfer control). The volume management 92 and the data transfer control 93 are implemented as software separated from the operating system 90, and can query the operating system 90 for system information.

なお、ボリューム管理モジュール９２及びデータ転送制御モジュール９３は、ファームウェアとして実現されてもよいし、ハードウェアとして実現されてもよいし、ソフトウェアとファームウェアとハードウェアの組み合わせて実現されてもよい。 Note that the volume management module 92 and the data transfer control module 93 may be realized as firmware, hardware, or a combination of software, firmware, and hardware.

ボリューム管理モジュール９２及びデータ転送制御モジュール９３は、その機能がプログラムで実現される場合には、プログラムをＣＰＵにより実行する実行回路のことである。つまり、ボリューム管理モジュール９２は、ボリューム管理プログラムをＣＰＵに実行させるボリューム管理部の例である。又、データ転送制御モジュール９３は、データ転送制御プログラムをＣＰＵに実行させるデータ転送制御部の例である。 The volume management module 92 and the data transfer control module 93 are execution circuits that execute a program by the CPU when the function is realized by the program. That is, the volume management module 92 is an example of a volume management unit that causes a CPU to execute a volume management program. The data transfer control module 93 is an example of a data transfer control unit that causes a CPU to execute a data transfer control program.

又、オペレーティングシステムはオペレーティングシステムプログラムをＣＰＵにより実行する実行回路のことであり、オペレーティングシステムとは、オペレーティングシステムプログラムをＣＰＵにより実行するオペレーティングシステム実行部のことである。 The operating system is an execution circuit that executes an operating system program by the CPU, and the operating system is an operating system execution unit that executes the operating system program by the CPU.

図２は本実施の形態におけるオペレーティングシステムとボリューム管理およびデータ転送制御の関係を示した構成図である。 FIG. 2 is a configuration diagram showing the relationship between the operating system, volume management, and data transfer control in this embodiment.

図２における矢印は、問い合わせ、参照、処理要求、生成を行う側から受ける側を指している。 The arrow in FIG. 2 indicates the side that receives from the side that performs the inquiry, reference, processing request, and generation.

オペレーティングシステム９０は初期時もしくはユーザによるツールの使用時に、デバイスドライバ９６に情報採取の要求を出し、ボリューム構成テーブル９１およびバス情報９５を生成する。ボリューム構成テーブル９１にはオペレーティングシステム９０が管理するボリュームの構成情報が格納されている。 The operating system 90 issues a request for collecting information to the device driver 96 at the initial time or when the user uses the tool, and generates the volume configuration table 91 and the bus information 95. The volume configuration table 91 stores volume configuration information managed by the operating system 90.

ボリューム管理９２はオペレーティングシステム初期化後に、ボリューム構成テーブル９１を直接参照するかもしくはオペレーティングシステム９０に対してボリューム構成情報やバス情報や物理ディスク・パーティション情報やバス接続情報やＩ／Ｏプロセッサ情報等の情報採取の要求を出し、ボリューム管理テーブル９４を生成する。ボリューム管理テーブル９４は、例えば、主記憶装置９において記憶される。この場合、主記憶装置９はテーブル記憶部の例となる。 The volume management 92 directly refers to the volume configuration table 91 after initialization of the operating system, or sends volume configuration information, bus information, physical disk / partition information, bus connection information, I / O processor information, etc. to the operating system 90. An information collection request is issued, and a volume management table 94 is generated. For example, the volume management table 94 is stored in the main storage device 9. In this case, the main storage device 9 is an example of a table storage unit.

データ転送制御９３はデータ転送前にボリューム管理９２に情報採取の要求を出し転送経路の転送可能性もしくは転送効率を取得する。この時ボリューム管理９２はボリューム管理テーブル９４を参照する。 The data transfer control 93 issues a request for collecting information to the volume management 92 before data transfer to acquire transferability or transfer efficiency of the transfer path. At this time, the volume management 92 refers to the volume management table 94.

データ転送制御９３は上記転送経路の情報をもとにオペレーティングシステム９０に対しデータ転送の要求を出す。オペレーティングシステム９０はデータ転送制御の要求に従い、デバイスドライバ９６にデータ転送の要求を渡す。 The data transfer control 93 issues a data transfer request to the operating system 90 based on the transfer path information. The operating system 90 passes the data transfer request to the device driver 96 in accordance with the data transfer control request.

特に、従来技術である特許第２９７８８８２号「デバイス間データ転送装置及びデバイス間データ転送方法」に比べ、本実施の形態ではデータ転送制御９３のデータ転送方法とボリューム管理テーブル９４が異なる。 In particular, the data transfer method of the data transfer control 93 and the volume management table 94 are different in this embodiment compared to Japanese Patent No. 2978882 “Inter-device data transfer apparatus and inter-device data transfer method”.

図３は図２におけるボリューム管理テーブル９４の内部構成を示したものである。ボリューム管理テーブルは主に、バス転送情報１００、バス結合装置情報１１０、物理ディスク情報１２０、論理ボリューム情報１３０、Ｉ／Ｏプロセッサ情報１４０から成る。 FIG. 3 shows the internal structure of the volume management table 94 in FIG. The volume management table mainly includes bus transfer information 100, bus coupling device information 110, physical disk information 120, logical volume information 130, and I / O processor information 140.

バス転送情報１００には主に、バス番号１０１とバスの転送可能性１０２を格納する。転送可能性とは、ｐｅｅｒ−ｔｏ−ｐｅｅｒデータ転送の不可能を含む転送効率である。これらの組がバスの数だけ格納される。 The bus transfer information 100 mainly stores a bus number 101 and a bus transfer possibility 102. The transfer possibility is transfer efficiency including the impossible of peer-to-peer data transfer. These sets are stored for the number of buses.

バス結合装置情報１１０には主に、バス結合装置の装置番号１１１と接続しているバスの上位バス番号１１２と下位バス番号１１３とバス結合装置の故障を示す故障フラグ１１４を格納する。これらの番号の組がバス結合装置の数だけ格納される。 The bus coupling device information 110 mainly stores an upper bus number 112 and a lower bus number 113 of a bus connected to the device number 111 of the bus coupling device, and a failure flag 114 indicating a failure of the bus coupling device. These number sets are stored as many as the number of bus coupling devices.

物理ディスク情報１２０には主に、物理ディスク番号１２１と接続されているバス番号１２２と物理ディスク内のパーティション情報１２３と物理ディスクの故障を示す故障フラグ１２４を格納する。これらの番号および情報の組が物理ディスクの数だけ格納される。 The physical disk information 120 mainly stores a bus number 122 connected to the physical disk number 121, partition information 123 in the physical disk, and a failure flag 124 indicating a physical disk failure. A set of these numbers and information is stored as many as the number of physical disks.

パーティション情報にはオペレーティングシステムが識別するパーティション番号やパーティション・オフセットなどが格納される。 The partition information stores a partition number or partition offset identified by the operating system.

論理ボリューム情報１３０には主に、論理ボリューム名１３１と論理ボリュームを構成する物理ディスクのパーティション情報１３２を格納する。これらの番号および情報の組が論理ボリュームの数だけ格納される。 The logical volume information 130 mainly stores the logical volume name 131 and the partition information 132 of the physical disk constituting the logical volume. A set of these numbers and information is stored for the number of logical volumes.

Ｉ／Ｏプロセッサ情報１４０には主に、Ｉ／Ｏプロセッサ番号１４１と接続バス番号１４２とＩ／Ｏプロセッサの故障を示す故障フラグ１４３を格納する。これら番号の組がＩ／Ｏプロセッサの数だけ格納される。 The I / O processor information 140 mainly stores an I / O processor number 141, a connection bus number 142, and a failure flag 143 indicating an I / O processor failure. A set of these numbers is stored as many as the number of I / O processors.

各装置の故障の状態を示す故障フラグ１１４、１２４、１４３をボリューム管理テーブルに有することが本実施の形態の特徴の一つとなる。故障フラグは、ユーザや外部プログラムにより値を変更することができる。 One of the features of the present embodiment is that the volume management table has failure flags 114, 124, and 143 indicating the failure state of each device. The value of the failure flag can be changed by a user or an external program.

図４と図５は、本実施の形態に係るデータ転送制御がＩ／Ｏプロセッサやバス結合装置や磁気ディスク装置が故障した場合に、データ転送時に転送経路の転送可能性を識別する手順を示したフローチャートである。 4 and 5 show a procedure for identifying transferability of a transfer path at the time of data transfer when the data transfer control according to the present embodiment fails in an I / O processor, a bus coupling device, or a magnetic disk device. It is a flowchart.

まず図４において転送制御９３がボリューム管理９２を通じてＩ／Ｏプロセッサ情報１４０を参照し、転送先のＩ／Ｏプロセッサ番号に対応する接続バス番号１４２を得る（ステップＳ１１）。 First, in FIG. 4, the transfer control 93 refers to the I / O processor information 140 through the volume management 92, and obtains a connection bus number 142 corresponding to the transfer destination I / O processor number (step S11).

バス転送情報１００の転送可能性１０２を参照し対象のバスが転送可能か否かを調べる（ステップＳ１２）。 It is checked whether or not the target bus can be transferred with reference to the transfer possibility 102 of the bus transfer information 100 (step S12).

ｐｅｅｒ−ｔｏ−ｐｅｅｒ転送不可能なバスである場合は、ステップＳ１５に進む。バス結合装置情報１１０の下位バス番号１１３から上位バス番号１１２を得る（ステップＳ１３）。 If the bus is not capable of peer-to-peer transfer, the process proceeds to step S15. The upper bus number 112 is obtained from the lower bus number 113 of the bus coupler information 110 (step S13).

上位バス番号が存在する場合は、上位バス番号を対象のバス番号としてステップＳ１２に戻る（ステップＳ１３）。 If the upper bus number exists, the process returns to step S12 with the upper bus number as the target bus number (step S13).

ステップＳ１２からステップＳ１４までのループが終了後、最後のバス番号を対象Ｉ／Ｏプロセッサの最上位番号として保存する（ステップＳ１５）。 After the loop from step S12 to step S14 is completed, the last bus number is stored as the highest number of the target I / O processor (step S15).

次に図５において、ボリューム管理テーブル９４のＩ／Ｏプロセッサ情報を参照し、故障フラグがＯＦＦでない場合、対象の論理ボリュームとＩ／Ｏプロセッサは転送不可能である（ステップＳ２１）。 Next, referring to the I / O processor information in the volume management table 94 in FIG. 5, if the failure flag is not OFF, the target logical volume and the I / O processor cannot be transferred (step S21).

故障フラグがＯＦＦである場合、論理ボリューム情報１３０を参照して、論理ボリュームを構成するパーティション情報１３２を列挙する（ステップＳ２２）。 If the failure flag is OFF, the partition information 132 constituting the logical volume is listed with reference to the logical volume information 130 (step S22).

列挙されたパーティション情報に対応する物理ディスク情報１２０の故障フラグを参照し、ＯＦＦでない場合は、対象の論理ボリュームとＩ／Ｏプロセッサは転送不可能である（ステップＳ２３）。 If the failure flag of the physical disk information 120 corresponding to the listed partition information is referred to and the flag is not OFF, the target logical volume and the I / O processor cannot be transferred (step S23).

故障フラグがＯＦＦの場合は、列挙されたパーティション情報に対応する物理ディスク情報１２０のパーティション情報１２３に接続される接続バス番号１２２を得る（ステップＳ２４）。 If the failure flag is OFF, the connection bus number 122 connected to the partition information 123 of the physical disk information 120 corresponding to the listed partition information is obtained (step S24).

バス転送情報１００の転送可能性１０２を参照し対象のバスが転送可能か否かを調べる（ステップＳ２５）。 It is checked whether or not the target bus can be transferred with reference to the transfer possibility 102 of the bus transfer information 100 (step S25).

ｐｅｅｒ−ｔｏ−ｐｅｅｒ転送不可能なバスである場合は、ステップＳ２６に進む。バス結合装置情報１１０の下位バス番号がバス番号に対応するバス結合装置情報の故障フラグを参照し、ＯＦＦでない場合は、バス階層をたどるループを抜けてＳ２９に進む。故障フラグがＯＦＦの場合は、バス結合装置情報１１０の下位バス番号１１３から上位バス番号１１２を得る（ステップＳ２７）。 If the bus is not capable of peer-to-peer transfer, the process proceeds to step S26. If the lower bus number of the bus coupling device information 110 refers to the failure flag of the bus coupling device information corresponding to the bus number and is not OFF, the process goes through a loop following the bus hierarchy and proceeds to S29. If the failure flag is OFF, the upper bus number 112 is obtained from the lower bus number 113 of the bus coupler information 110 (step S27).

上位バス番号が存在する場合は、上位バス番号を対象のバス番号としてステップＳ２５に戻る。ステップＳ２５からステップＳ２９までのループが終了後、最後のバス番号を対象パーティションの最上位バス番号とし、図４におけるステップＳ１５で保存したＩ／Ｏプロセッサの最上位バス番号と比較する（ステップＳ２９）。 If the upper bus number exists, the process returns to step S25 with the upper bus number as the target bus number. After the loop from step S25 to step S29 is completed, the last bus number is set as the highest bus number of the target partition and compared with the highest bus number of the I / O processor saved in step S15 in FIG. 4 (step S29). .

最上位バス番号が一致しなければ対象の論理ボリュームとＩ／Ｏプロセッサは転送不可能である。最上位バス番号が一致した場合、次のパーティションが存在するならばステップＳ２３に戻る（ステップＳ３０）。 If the highest bus numbers do not match, the target logical volume and the I / O processor cannot be transferred. If the highest bus numbers match, the process returns to step S23 if the next partition exists (step S30).

すべてのパーティション情報について最上位バス番号がＩ／Ｏプロセッサの最上位バス番号と一致した場合、対象の論理ボリュームとＩ／Ｏプロセッサはｐｅｅｒ−ｔｏ−ｐｅｅｒ転送可能である。 When the highest bus number for all partition information matches the highest bus number of the I / O processor, the target logical volume and the I / O processor can perform peer-to-peer transfer.

データ転送制御は上記それぞれの手順（図４、図５）に従い、Ｉ／Ｏプロセッサやバス結合装置や磁気ディスク装置が故障しているシステムにおいて、データが格納されている論理ボリュームからｐｅｅｒ−ｔｏ−ｐｅｅｒ転送可能なＩ／Ｏプロセッサを探し出すことができる。 Data transfer control is performed in accordance with each of the above procedures (FIGS. 4 and 5) in a system in which an I / O processor, bus coupling device, or magnetic disk device has failed, from a logical volume storing data to peer-to- An I / O processor capable of peer transfer can be found.

データ転送可能なＩ／Ｏプロセッサが存在しない場合、ユーザに対して警告を発生しても良いし、主記憶装置に転送して中央処理装置（ＣＰＵ）で処理させても良い。 If there is no I / O processor that can transfer data, a warning may be issued to the user, or it may be transferred to the main memory and processed by a central processing unit (CPU).

本実施の形態では、論理ボリュームを構成する記憶デバイスとして「磁気ディスク」を例としたが、磁気テープや半導体ディスクなどいかなる記憶媒体であっても良い。また、ネットワークを介したリモートドライブであっても良い。 In this embodiment, a “magnetic disk” is used as an example of a storage device constituting a logical volume, but any storage medium such as a magnetic tape or a semiconductor disk may be used. Further, it may be a remote drive via a network.

このように、本実施の形態に係るデバイス間データ転送装置及びデバイス間データ転送方法は、複数の記憶デバイスと複数のデータ処理デバイス間のデータ転送可否を記憶するテーブルを有し、Ｉ／Ｏプロセッサ、磁気ディスク装置およびバス結合装置が故障の際に、それぞれの装置について故障フラグのＯＮ／ＯＦＦを設定し、論理ボリュームを構成する磁気ディスク装置とＩ／Ｏプロセッサとの転送可能性の判断を行うことで、管理テーブル全体を更新すること無しに、故障デバイスへのアクセスを制限することを特徴とする。 As described above, the inter-device data transfer apparatus and the inter-device data transfer method according to the present embodiment have a table for storing data transfer availability between a plurality of storage devices and a plurality of data processing devices, and an I / O processor. When a failure occurs in the magnetic disk device and the bus coupling device, the failure flag is set ON / OFF for each device, and the transfer possibility between the magnetic disk device and the I / O processor constituting the logical volume is determined. Thus, access to the failed device is limited without updating the entire management table.

以上のように、Ｉ／Ｏデバイス、磁気ディスク装置、バス結合装置それぞれの装置の故障の有無を判断するようにしているので、各装置の故障状態を識別し縮退運転を判断することができる。 As described above, since the presence / absence of a failure in each of the I / O device, the magnetic disk device, and the bus coupling device is determined, it is possible to identify the failure state of each device and determine the degenerate operation.

つまり、従来技術のようにボリューム管理テーブルに故障フラグが存在しない場合は、それぞれの装置についての転送可能性を確認してから故障の有無が判明するので、故障のある装置を切り離すためにボリューム管理テーブルを新たに作り直さなければならなかったが、本実施の形態では、故障フラグによりＩ／Ｏデバイス、磁気ディスク装置、バス結合装置のそれぞれの装置の故障の有無を判断し、故障であると判断した装置はｐｅｅｒ−ｔｏ−ｐｅｅｒ転送不可と判断し、転送対象とみなさないため、故障している装置を切り離した状態と同様の状態になり、ボリューム管理テーブルを作り直す必要がない。このため、システムを停止させることなく、縮退運転を開始することができる。 In other words, if there is no failure flag in the volume management table as in the prior art, the presence or absence of a failure can be determined after confirming the transferability of each device. Although the table had to be newly recreated, in this embodiment, the presence / absence of a failure in each of the I / O device, magnetic disk device, and bus coupling device is determined based on the failure flag, and the failure is determined. The determined device determines that peer-to-peer transfer is not possible and does not consider it as a transfer target. Therefore, the device is in the same state as when the failed device is disconnected, and there is no need to recreate the volume management table. For this reason, the degenerate operation can be started without stopping the system.

また、転送可能性判断のバス階層探索のループにおいて、故障装置がある場合に中断するようにしているため、全ての接続状態の装置をチェックしなくても、転送不可能を判断することができる。また、システム構成を保持するテーブルの一部の値のみ変更するようにしているので、システムの状態の変化に応じて速やかに構成情報を変更することでき、システムの停止期間を最小限にすることができる。 In addition, in the bus hierarchy search loop for determining transferability, if there is a faulty device, it is interrupted, so it is possible to determine whether transfer is impossible without checking all connected devices. . In addition, since only some values in the table that holds the system configuration are changed, the configuration information can be changed promptly according to changes in the system status, and the system downtime can be minimized. Can do.

また、故障中の装置の接続関係等の情報を残したまま使用不可能である判断を行うため、故障装置が復旧した際に、速やかにシステム構成を変更でき、システムの停止期間を最小限にすることができる。 In addition, since it is determined that the device cannot be used while retaining information such as the connection relationship of the failed device, the system configuration can be changed quickly when the failed device is restored, minimizing the system outage period. can do.

これにより、故障前の状態に応じた復旧の重要性／必要性をユーザや他のプログラムに知らせることができる。 As a result, it is possible to notify the user and other programs of the importance / necessity of recovery according to the state before the failure.

また、磁気ディスク装置上のデータを故障していないＩ／Ｏプロセッサにデータを流すことができる。また、すべての論理ボリュームが、少なくとも１つ以上のＩ／Ｏプロセッサで転送可能である場合、システムを縮退運転が可能であると判断することができる。 Further, the data on the magnetic disk device can be sent to an I / O processor that has not failed. Further, if all logical volumes can be transferred by at least one or more I / O processors, it can be determined that the system can perform degenerate operation.

さらに、論理ボリュームのデータを処理するＩ／Ｏプロセッサについて、論理ボリュームあたりの転送可能なＩ／Ｏプロセッサ台数を求めて、特定の台数を満たさない場合に警告を発生することで、Ｉ／Ｏプロセッサ故障においてデータの処理性能を一定以上に保つようユーザや他のプログラムに知らせることが可能となる。 Further, with respect to the I / O processor that processes the data of the logical volume, the number of I / O processors that can be transferred per logical volume is obtained, and a warning is issued when the specific number is not satisfied. It is possible to notify the user and other programs to keep the data processing performance above a certain level in the event of a failure.

また、論理ボリュームのデータを転送可能なＩ／Ｏプロセッサの数が特定の台数を満たさない場合に警告を発生することで、Ｉ／Ｏプロセッサの故障において一定の信頼性を確保するようユーザや他のプログラムに知らせることが可能となる。 In addition, by issuing a warning when the number of I / O processors that can transfer logical volume data does not satisfy a specific number, users and others can ensure a certain level of reliability in the event of an I / O processor failure. It is possible to inform the program.

実施の形態２．
以上の実施の形態１では、Ｉ／Ｏプロセッサやバス結合装置や磁気ディスク装置の故障を識別するようにしたものであるが、次にＩ／Ｏプロセッサやバス結合装置や磁気ディスク装置が必ずしも完全に使用不可能ではないが、ある確率でエラーとなるような間欠的に故障の状態となる場合を識別する実施の形態を示す。 Embodiment 2. FIG.
In the first embodiment described above, the failure of the I / O processor, the bus coupling device, and the magnetic disk device is identified. Next, the I / O processor, the bus coupling device, and the magnetic disk device are not necessarily complete. An embodiment for identifying a case in which a failure occurs intermittently such that an error occurs with a certain probability, although it is not usable.

図６はこのような場合の動作を説明するためのシステム構成の一例である。 FIG. 6 is an example of a system configuration for explaining the operation in such a case.

１０は中央処理装置（ＣＰＵ）でシステム全体の各装置の制御を担当している。９は主記憶装置であり、この主記憶装置９に格納されているオペレーティングシステム９０によりシステム全体のソフトウェアが制御されている。１は中央処理装置１０と主記憶装置９と各デバイスとを接続しているバスであり、複数のバスを接続するためのバス結合装置３ｄによってバス２ｄに接続されている。さらに、バス２ｄはバス結合装置３ｅによってバス２ｅに接続される。バス結合装置３ｄ、３ｅは例えばブリッジである。磁気ディスク装置６０ｅ、６１ｅはディスクコントローラ４０ｅによってバス２ｅに接続され一つの記憶デバイスであり論理ボリューム８０ｅを構成している。同様に、磁気ディスク装置６２ｅ、６３ｅはディスクコントローラ４１ｅによってバス２ｅに接続され一つの記憶デバイスであり論理ボリューム８１ｅを構成している。デバイス５ｆはバス１に接続され、デバイス５ｄはバス２ｄに接続され、デバイス５ｅは２ｅに接続されており、例えば内部に記憶装置を含むＩ／Ｏプロセッサである。 A central processing unit (CPU) 10 is in charge of controlling each device of the entire system. Reference numeral 9 denotes a main storage device, and software of the entire system is controlled by an operating system 90 stored in the main storage device 9. Reference numeral 1 denotes a bus connecting the central processing unit 10, the main storage device 9, and each device, and is connected to the bus 2d by a bus coupling device 3d for connecting a plurality of buses. Further, the bus 2d is connected to the bus 2e by a bus coupling device 3e. The bus coupling devices 3d and 3e are, for example, bridges. The magnetic disk devices 60e and 61e are connected to the bus 2e by the disk controller 40e and are one storage device and constitute a logical volume 80e. Similarly, the magnetic disk devices 62e and 63e are connected to the bus 2e by the disk controller 41e and are one storage device and constitute a logical volume 81e. The device 5f is connected to the bus 1, the device 5d is connected to the bus 2d, and the device 5e is connected to 2e. For example, the device 5f is an I / O processor including a storage device.

通常、論理ボリューム８０ｅ、８１ｅおよびデバイス５ｄ、５ｅ、５ｆは１システムの中に複数接続されており、本実施の形態に係るデバイス間データ転送装置は論理ボリューム８０ｅ、８１ｅとデバイス５ｄ、５ｅ、５ｆの間で相互に直接データ転送を行うものである。 Normally, a plurality of logical volumes 80e and 81e and devices 5d, 5e, and 5f are connected in one system, and the inter-device data transfer apparatus according to the present embodiment is logical volumes 80e and 81e and devices 5d, 5e, and 5f. Data is directly transferred between each other.

論理ボリューム８０ｅ、８１ｅに蓄えられてあるデータは、通常は主記憶装置９に転送され中央処理装置１０で処理されるが、本実施の形態係るデバイス間データ転送装置では、論理ボリューム８０ｅ、８１ｅに蓄えられてあるデータは主記憶装置を経由すること無くデバイス５ｄ、５ｅ、５ｆの内部にある記憶装置に転送され、５ｄ、５ｅ、５ｆ内部にあるＩ／Ｏプロセッサで処理するものである。 Data stored in the logical volumes 80e and 81e is normally transferred to the main storage device 9 and processed by the central processing unit 10, but in the inter-device data transfer apparatus according to the present embodiment, the data is stored in the logical volumes 80e and 81e. The stored data is transferred to the storage device inside the devices 5d, 5e, 5f without going through the main storage device, and is processed by the I / O processor inside the 5d, 5e, 5f.

本実施の形態では、物理デバイスの単位で論理ボリュームが設定されている例を示しているが、複数の記憶デバイスにまたがる論理ボリュームを構成することも可能である。 In the present embodiment, an example is shown in which logical volumes are set in units of physical devices, but it is also possible to configure logical volumes that span a plurality of storage devices.

本実施の形態では、Ｉ／Ｏプロセッサ５ｄ、５ｅやバス結合装置３ｄや磁気ディスク装置６３ｅが完全には使用不可能ではないがある確率でエラーとなるような間欠的に故障の状態となる場合に、デバイス間データ転送の転送可能性を識別する手順を示す。 In the present embodiment, when the I / O processors 5d and 5e, the bus coupling device 3d, and the magnetic disk device 63e are intermittently in a state of failure such that an error occurs with a probability that they are not completely unusable. Shows a procedure for identifying the transferability of data transfer between devices.

例えば、段階的な故障の状態である故障レベルを不具合の発生する発生率とした場合、Ｉ／Ｏプロセッサ５ｄの故障レベルを２０％、Ｉ／Ｏプロセッサ５ｅの故障レベルを５０％、バス結合装置３ｄの故障レベルを１０％、磁気ディスク装置６３ｅの故障レベルを３０％のように表すことができる。なお、この故障レベルは、故障発生頻度値の例である。 For example, when the failure level, which is a staged failure state, is the occurrence rate of failure, the failure level of the I / O processor 5d is 20%, the failure level of the I / O processor 5e is 50%, and the bus coupling device The failure level of 3d can be expressed as 10%, and the failure level of the magnetic disk device 63e can be expressed as 30%. This failure level is an example of a failure occurrence frequency value.

本実施の形態におけるオペレーティングシステムとボリューム管理およびデータ転送制御の関係は実施の形態１と同様図２で示される。 The relationship between the operating system, volume management, and data transfer control in this embodiment is shown in FIG. 2 as in the first embodiment.

本実施の形態では、データ転送制御９３のデータ転送方法とボリューム管理テーブル９４が実施の形態１と異なる。 In the present embodiment, the data transfer method of the data transfer control 93 and the volume management table 94 are different from those in the first embodiment.

図７はボリューム管理テーブル９４の内部構成を示したものである。 FIG. 7 shows the internal structure of the volume management table 94.

ボリューム管理テーブルは主に、バス転送情報１００、バス結合装置情報１１０、物理ディスク情報１２０、論理ボリューム情報１３０、Ｉ／Ｏプロセッサ情報１４０から成る。 The volume management table mainly includes bus transfer information 100, bus coupling device information 110, physical disk information 120, logical volume information 130, and I / O processor information 140.

バス結合装置情報１１０には主に、バス結合装置の装置番号１１１と接続しているバスの上位バス番号１１２と下位バス番号１１３とバス結合装置の故障の程度を示す故障レベル１１５を格納する。これらの番号の組がバス結合装置の数だけ格納される。 The bus coupling device information 110 mainly stores an upper bus number 112 and a lower bus number 113 of a bus connected to the device number 111 of the bus coupling device, and a failure level 115 indicating the degree of failure of the bus coupling device. These number sets are stored as many as the number of bus coupling devices.

物理ディスク情報１２０には主に、物理ディスク番号１２１と接続されているバス番号１２２と物理ディスク内のパーティション情報１２３と物理ディスクの故障の程度を示す故障レベル１２５を格納する。これらの番号および情報の組が物理ディスクの数だけ格納される。 The physical disk information 120 mainly stores a bus number 122 connected to the physical disk number 121, partition information 123 in the physical disk, and a failure level 125 indicating the degree of failure of the physical disk. A set of these numbers and information is stored as many as the number of physical disks.

Ｉ／Ｏプロセッサ情報１４０には主に、Ｉ／Ｏプロセッサ番号１４１と接続バス番号１４２とＩ／Ｏプロセッサの故障の程度を示す故障レベル１４４を格納する。これら番号の組がＩ／Ｏプロセッサの数だけ格納される。 The I / O processor information 140 mainly stores an I / O processor number 141, a connection bus number 142, and a failure level 144 indicating the degree of failure of the I / O processor. A set of these numbers is stored as many as the number of I / O processors.

各装置の故障の程度を示す故障レベル１１５、１２５、１４４をボリューム管理テーブルに有することが特徴となる。故障レベルは、ユーザや外部プログラムにより値を変更することができる。 The volume management table has failure levels 115, 125, and 144 indicating the degree of failure of each device. The failure level can be changed by a user or an external program.

図８と図１０は、本実施の形態に係るデータ転送制御がＩ／Ｏプロセッサやバス結合装置や磁気ディスク装置が間欠的に故障の状態にある場合に、データ転送時に転送経路の転送可能性を判断する手順を示したフローチャートである。 FIGS. 8 and 10 show the transfer possibility of the transfer path during data transfer when the data transfer control according to this embodiment is intermittently in the state of failure of the I / O processor, the bus coupling device, or the magnetic disk device. It is the flowchart which showed the procedure which judges.

まず図８において転送制御９３がボリューム管理９２を通じてＩ／Ｏプロセッサ情報１４０を参照し、故障レベル１４４の値を記憶する（ステップＳ３１）。 First, in FIG. 8, the transfer control 93 refers to the I / O processor information 140 through the volume management 92 and stores the value of the failure level 144 (step S31).

続いて、転送先のＩ／Ｏプロセッサ番号に対応する接続バス番号１４２を得る（ステップＳ３２）。 Subsequently, the connection bus number 142 corresponding to the transfer destination I / O processor number is obtained (step S32).

バス転送情報１００の転送可能性１０２を参照し対象のバスが転送可能か否かを調べる（ステップＳ３３）。 It is checked whether or not the target bus can be transferred with reference to the transfer possibility 102 of the bus transfer information 100 (step S33).

ｐｅｅｒ−ｔｏ−ｐｅｅｒ転送不可能なバスである場合は、ステップＳ３７に進む。Ｓ３３でＹＥＳの場合は、バス結合装置情報１１０の下位バス番号１１３が等しいものに関する故障レベル１１５を、記憶している故障レベルに累積計算してＩ／Ｏプロセッサの接続バスリストに加える。（ステップＳ３４）
図９はＩ／Ｏプロセッサの接続バスリストの内容を示した図である。 If the bus is not capable of peer-to-peer transfer, the process proceeds to step S37. In the case of YES in S33, the failure level 115 relating to the bus bus device information 110 having the same lower bus number 113 is cumulatively calculated in the stored failure level and added to the connection bus list of the I / O processor. (Step S34)
FIG. 9 is a diagram showing the contents of the connection bus list of the I / O processor.

バス番号１５１ごとに故障レベルの累計値１５２を記憶する。累積計算する方法は、例えば既に記憶している故障レベルの値と対象としている故障レベルの内、値の大きいほうを記憶に残すＭＡＸ演算でも良いし、単純に足し算を行うのでも良いし、故障レベルが不具合発生率とした場合は（１−（１−記録故障レベル／１００）＊（１−対象故障レベル／１００））＊１００としても良い。 The cumulative value 152 of the failure level is stored for each bus number 151. The cumulative calculation method may be, for example, a MAX operation in which the larger one of the already stored failure level value and the target failure level is stored in memory, simple addition may be performed, When the level is a failure occurrence rate, it may be (1- (1-recording failure level / 100) * (1-target failure level / 100)) * 100.

続いて、バス結合装置情報１１０の上位バス番号１１２を得る（ステップＳ３５）。 Subsequently, the upper bus number 112 of the bus coupler information 110 is obtained (step S35).

上位バス番号が存在する場合は、上位バス番号を対象のバス番号としてステップＳ３３に戻る（ステップＳ３６）。 If the upper bus number exists, the process returns to step S33 with the upper bus number as the target bus number (step S36).

ステップＳ３３からステップＳ３６までのループが終了後、対象Ｉ／Ｏプロセッサの接続バスリストを保存する（ステップＳ３７）。 After the loop from step S33 to step S36 is completed, the connection bus list of the target I / O processor is saved (step S37).

図１０はＩ／Ｏプロセッサと論理ボリュームとの間でデバイス間データ転送の転送可能性を求める手順を示したフローチャートである。 FIG. 10 is a flowchart showing a procedure for obtaining transferability of data transfer between devices between the I / O processor and the logical volume.

ボリューム管理テーブル９４の論理ボリューム情報１３０を参照して、論理ボリューム構成するパーティション情報１３２を列挙する（ステップＳ４１）。 With reference to the logical volume information 130 of the volume management table 94, the partition information 132 constituting the logical volume is listed (step S41).

物理ディスク情報１２０のパーティション情報１２３に対応する接続バス番号１２２を得る（ステップＳ４２）。 A connection bus number 122 corresponding to the partition information 123 of the physical disk information 120 is obtained (step S42).

物理ディスク情報１２０のパーティション情報１２３に対応する故障レベルを記憶する（ステップＳ４３）。 The failure level corresponding to the partition information 123 of the physical disk information 120 is stored (step S43).

バス転送情報１００の転送可能性１０２を参照し対象のバスが転送可能か否かを調べる（ステップＳ４４）。 It is checked whether or not the target bus can be transferred with reference to the transfer possibility 102 of the bus transfer information 100 (step S44).

ｐｅｅｒ−ｔｏ−ｐｅｅｒ転送不可能なバスである場合は、対象の論理ボリュームとＩ／Ｏプロセッサは転送不可能である。バス結合装置情報１１０の下位バス番号１１３に対応する故障レベル１１５を取得し、記憶している故障レベルに累積計算する（ステップＳ４５）。 If the bus is not capable of peer-to-peer transfer, the target logical volume and the I / O processor cannot be transferred. The failure level 115 corresponding to the lower bus number 113 of the bus coupling device information 110 is acquired and cumulatively calculated to the stored failure level (step S45).

累積計算する方法は、例えば既に記憶している故障レベルの値と対象としている故障レベルの内、値の大きいほうを記憶に残すＭＡＸ演算でも良いし、単純に足し算を行うのでも良いし、故障レベルが不具合発生率とした場合は（１−（１−記録故障レベル／１００）＊（１−対象故障レベル／１００））＊１００としても良い。バス番号をＩ／Ｏプロセッサの接続バスリスト１５０上のバス番号１５１と比較し、存在しなければＳ４９に進む（ステップＳ４６）。 The cumulative calculation method may be, for example, a MAX operation in which the larger one of the already stored failure level value and the target failure level is stored in memory, simple addition may be performed, When the level is a failure occurrence rate, it may be (1- (1-recording failure level / 100) * (1-target failure level / 100)) * 100. The bus number is compared with the bus number 151 on the connection bus list 150 of the I / O processor, and if it does not exist, the process proceeds to S49 (step S46).

存在しない場合、バス結合装置情報１１０の下位バス番号１１３から上位バス番号１１２を得る（ステップＳ４７）。上位バス番号が存在する場合は、上位バス番号を対象のバス番号としてステップＳ４４に戻る（ステップＳ４８）。 If not, the higher bus number 112 is obtained from the lower bus number 113 of the bus coupler information 110 (step S47). If the upper bus number exists, the process returns to step S44 with the upper bus number as the target bus number (step S48).

上位バス番号が存在しない場合は、記憶している故障レベル対して、Ｉ／Ｏプロセッサの接続バスリスト１５０上の対応するバス番号１５１の故障レベル１５２を累積計算する（ステップＳ４９）。 If there is no higher bus number, the failure level 152 of the corresponding bus number 151 on the connection bus list 150 of the I / O processor is cumulatively calculated for the stored failure level (step S49).

累積計算する方法はＳ４５と同様とする。記憶している故障レベルが既定値に超えている場合、転送不可能である（ステップＳ５０）。次のパーティションが存在するならばステップＳ４４に戻る（ステップＳ５１）。 The cumulative calculation method is the same as in S45. If the stored failure level exceeds the predetermined value, transfer is impossible (step S50). If the next partition exists, the process returns to step S44 (step S51).

すべてのパーティション情報について、Ｉ／Ｏプロセッサの接続バス番号と一致するバス番号がバス階層の一部に存在し、かつ累計した故障レベルが規定の故障レベル以内である場合、対象の論理ボリュームとＩ／Ｏプロセッサはｐｅｅｒ−ｔｏ−ｐｅｅｒ転送可能である。 For all partition information, if the bus number that matches the I / O processor connection bus number exists in a part of the bus hierarchy and the accumulated failure level is within the prescribed failure level, the target logical volume and I The / O processor can perform peer-to-peer transfer.

図１１は、図６の構成図において図８と図１０の手順に従って転送可能性を求めた例である。
磁気ディスク装置６０ｅとＩ／Ｏプロセッサ５ｄの間は、既定の故障レベル４０未満であるためｐｅｅｒ−ｔｏ−ｐｅｅｒ転送可能である。同様に、磁気ディスク装置６３ｅとＩ／Ｏプロセッサ５ｆの間は、既定の故障レベル４０未満（ＭＡＸ演算の場合）であるためｐｅｅｒ−ｔｏ−ｐｅｅｒ転送可能である。 FIG. 11 is an example in which transferability is obtained according to the procedure of FIGS. 8 and 10 in the configuration diagram of FIG.
Peer-to-peer transfer is possible between the magnetic disk device 60e and the I / O processor 5d because the failure level is less than the predetermined failure level 40. Similarly, peer-to-peer transfer is possible between the magnetic disk device 63e and the I / O processor 5f because it is less than the predetermined failure level 40 (in the case of MAX operation).

データ転送制御は上記それぞれの手順（図８、図１０）に従い、Ｉ／Ｏプロセッサやバス結合装置や磁気ディスク装置の故障の程度を設定したシステムにおいて、データが格納されている論理ボリュームからｐｅｅｒ−ｔｏ−ｐｅｅｒ転送可能なＩ／Ｏプロセッサを探し出すことができる。データ転送可能なＩ／Ｏプロセッサが存在しない場合、ユーザに対して警告を発生しても良いし、主記憶装置に転送して中央処理装置（ＣＰＵ）で処理させても良い。 Data transfer control is performed in accordance with each of the above procedures (FIGS. 8 and 10) in a system in which the degree of failure of the I / O processor, bus coupling device, and magnetic disk device is set, from the logical volume storing the data to the peer- An I / O processor capable of to-peer transfer can be found. If there is no I / O processor that can transfer data, a warning may be issued to the user, or it may be transferred to the main memory and processed by a central processing unit (CPU).

このように、本実施の形態に係るデバイス間データ転送装置及びデバイス間データ転送方法は、複数の記憶デバイスと複数のデータ処理デバイス間のデータ転送可否を記憶するテーブルを有し、Ｉ／Ｏプロセッサ、磁気ディスク装置およびバス結合装置が故障の際に、それぞれの装置の故障の重大度に応じて故障レベル（エラー発生頻度）を設定し、Ｉ／Ｏプロセッサ、磁気ディスク装置およびバス結合装置のエラー発生頻度を累積して、論理ボリュームを構成する磁気ディスク装置とＩ／Ｏプロセッサとの転送可能性の判断を行うことで、管理テーブル全体を更新すること無しに、エラー発生頻度の高いＩ／Ｏプロセッサ、磁気ディスク装置やバス結合装置へのアクセスを制限することを特徴とする。 As described above, the inter-device data transfer apparatus and the inter-device data transfer method according to the present embodiment have a table for storing data transfer availability between a plurality of storage devices and a plurality of data processing devices, and an I / O processor. When a failure occurs in the magnetic disk device and the bus coupling device, a failure level (error occurrence frequency) is set according to the severity of the failure of each device, and an error occurs in the I / O processor, magnetic disk device, and bus coupling device. By accumulating the occurrence frequency and determining the transferability between the magnetic disk device constituting the logical volume and the I / O processor, the I / O having a high error occurrence frequency is performed without updating the entire management table. Access to a processor, a magnetic disk device, and a bus coupling device is restricted.

以上のように、Ｉ／Ｏデバイス、磁気ディスク装置、バス結合装置それぞれの装置の故障レベルを判定するようにしているので、各装置の故障の程度を識別し縮退運転を判断することができる。 As described above, the failure level of each device of the I / O device, the magnetic disk device, and the bus coupling device is determined, so that the degree of failure of each device can be identified and the degenerate operation can be determined.

また、システム構成を保持するテーブルの一部の値のみ変更するようにしているので、システムの状態の変化に応じて速やかに構成情報を変更することでき、システムの停止期間を最小限にすることができる。 In addition, since only some values in the table that holds the system configuration are changed, the configuration information can be changed promptly according to changes in the system status, and the system downtime can be minimized. Can do.

また、故障中の装置の接続関係等の情報を残したまま転送可能の程度を判断しているため、故障の程度（不具合の発生頻度）が変更された際に、速やかにシステム構成を変更でき、システムの停止期間を最小限にすることができる。 In addition, because the degree of transfer is determined while leaving information such as the connection relationship of the device under failure, the system configuration can be changed quickly when the degree of failure (fault occurrence frequency) is changed. The system downtime can be minimized.

さらに、バス結合装置の故障レベルを累積計算するので、バスの階層を重ねることで故障装置が複数のデータ転送に関与する場合も、故障のレベルに応じた縮退を判断することができる。 Further, since the failure level of the bus coupling device is cumulatively calculated, degeneration corresponding to the failure level can be determined even when the failure device is involved in a plurality of data transfers by overlapping the bus hierarchy.

また、各装置の不具合発生頻度を元にして故障レベルを設定することにより、頻繁にエラーが発生する装置から順に使用不可能とすることができる。 In addition, by setting the failure level based on the frequency of occurrence of failures in each device, it is possible to disable the devices in order from the devices that frequently generate errors.

また、磁気ディスク装置上のデータを故障レベルの低いＩ／Ｏプロセッサにデータを流すことができる。また、すべての論理ボリュームが、少なくとも１つ以上のＩ／Ｏプロセッサで転送可能である場合、システムを縮退運転が可能であると判断することができる。 Further, data on the magnetic disk device can be sent to an I / O processor with a low failure level. Further, if all logical volumes can be transferred by at least one or more I / O processors, it can be determined that the system can perform degenerate operation.

また、論理ボリュームのデータを転送可能なＩ／Ｏプロセッサの数が特定の台数を満たさない場合に警告を発生することで、Ｉ／Ｏプロセッサ故障において一定の信頼性を確保するようユーザや他のプログラムに知らせることが可能となる。 In addition, by issuing a warning when the number of I / O processors that can transfer logical volume data does not satisfy a specific number, it is possible to ensure a certain level of reliability in the event of an I / O processor failure. It is possible to inform the program.

実施の形態３．
以上の実施の形態１および実施の形態２では、Ｉ／Ｏプロセッサやバス結合装置や磁気ディスク装置の故障や故障の程度を識別して転送可能性を判定するようにしたものであるが、故障そのものを検出して、設定すべき故障のレベルの値を決めるのは、ユーザもしくは外部プログラムやオペレーティングシステムが行うものである。 Embodiment 3 FIG.
In the first and second embodiments described above, the failure and the degree of failure of the I / O processor, the bus coupling device, and the magnetic disk device are identified to determine the transfer possibility. It is a user, an external program, or an operating system that detects this and determines the value of the failure level to be set.

本実施の形態では、既に設定されている故障レベルの値によって、バス結合装置の故障レベルを変更することによって、故障の可能性を予見して転送可能性の判断に使用するものである。 In the present embodiment, the failure level of the bus coupling device is changed according to the failure level value that has already been set, so that the possibility of failure can be predicted and used for determining transferability.

図１２はこのような場合の動作を説明するためのシステム構成の一例である。 FIG. 12 shows an example of a system configuration for explaining the operation in such a case.

１０は中央処理装置（ＣＰＵ）でシステム全体の各装置の制御を担当している。９は主記憶装置であり、この主記憶装置９に格納されているオペレーティングシステム９０によりシステム全体のソフトウェアが制御されている。１は中央処理装置１０と主記憶装置９と各デバイスとを接続しているバスであり、複数のバスを接続するためのバス結合装置３０ｇ、３１ｇによってそれぞれバス２０ｇ、２１ｇに接続されている。さらに、バス２０ｇ、２１ｇはバス結合装置３０ｈ、３１ｈによってバス２ｈに接続される。磁気ディスク装置６０ｈ、６１ｈはディスクコントローラ４ｈによってバス２ｈに接続され一つの記憶デバイスであり論理ボリューム８ｈを構成している。同様に、磁気ディスク装置６０ｉ、６１ｉはディスクコントローラ４ｉによってバス２１ｇに接続され一つの記憶デバイスであり論理ボリューム８ｉを構成している。デバイス５ｊはバス１に接続され、デバイス５０ｇはバス２０ｇに接続され、デバイス５１ｇは２１ｇに接続されている。 A central processing unit (CPU) 10 is in charge of controlling each device of the entire system. Reference numeral 9 denotes a main storage device, and software of the entire system is controlled by an operating system 90 stored in the main storage device 9. Reference numeral 1 denotes a bus that connects the central processing unit 10, the main storage device 9, and each device, and is connected to the buses 20g and 21g by bus coupling devices 30g and 31g for connecting a plurality of buses, respectively. Furthermore, the buses 20g and 21g are connected to the bus 2h by bus coupling devices 30h and 31h. The magnetic disk devices 60h and 61h are connected to the bus 2h by the disk controller 4h and are one storage device and constitute a logical volume 8h. Similarly, the magnetic disk devices 60i and 61i are connected to the bus 21g by the disk controller 4i and are one storage device and constitute a logical volume 8i. The device 5j is connected to the bus 1, the device 50g is connected to the bus 20g, and the device 51g is connected to 21g.

通常、論理ボリューム８ｈ、８ｉおよびデバイス５０ｇ、５１ｇ、５ｊは１システムの中に複数接続されており、本実施の形態に係るデバイス間データ転送装置は論理ボリューム８ｈ、８ｉとデバイス５０ｇ、５１ｇ、５ｊの間で相互に直接データ転送を行うものである。 Normally, a plurality of logical volumes 8h, 8i and devices 50g, 51g, 5j are connected in one system, and the inter-device data transfer apparatus according to the present embodiment is logical volumes 8h, 8i and devices 50g, 51g, 5j. Data is directly transferred between each other.

論理ボリューム８ｈ、８ｉに蓄えられてあるデータは、通常は主記憶装置９に転送され中央処理装置１０で処理されるが、本実施の形態に係るデバイス間データ転送装置では、論理ボリューム８ｈ、８ｉに蓄えられてあるデータは主記憶装置を経由すること無くデバイス５０ｇ、５１ｇ、５ｊの内部にある記憶装置に転送され、５０ｇ、５１ｇ、５ｊ内部にあるＩ／Ｏプロセッサで処理するものである。 The data stored in the logical volumes 8h, 8i is normally transferred to the main storage device 9 and processed by the central processing unit 10, but in the inter-device data transfer apparatus according to the present embodiment, the logical volumes 8h, 8i The data stored in is transferred to a storage device inside the devices 50g, 51g, 5j without going through the main storage device, and is processed by an I / O processor inside the 50g, 51g, 5j.

本実施の形態では、物理記憶デバイスの単位で論理ボリュームが設定されている例を示しているが、複数の物理記憶デバイスにまたがる論理ボリュームを構成することも可能である。 In this embodiment, an example in which a logical volume is set in units of physical storage devices is shown, but it is also possible to configure a logical volume that spans a plurality of physical storage devices.

本実施の形態では、Ｉ／Ｏプロセッサ５１ｇやバス結合装置３０ｇや磁気ディスク装置６１ｈ、６１ｉが完全には使用不可能ではないがある確率でエラーとなるような間欠的に故障の状態にある。例えば、間欠的に故障の状態を示す故障レベルを不具合の発生する発生率とした場合、Ｉ／Ｏプロセッサ５１ｇの故障レベルを３０％、バス結合装置３０ｇの故障レベルを３０％、磁気ディスク装置６１ｈの故障レベルを２０％、磁気ディスク装置６１ｉの故障レベルを３０％であるものとする。なお、この故障レベルは、故障発生頻度値の例である。 In the present embodiment, the I / O processor 51g, the bus coupling device 30g, and the magnetic disk devices 61h and 61i are in an intermittent failure state that causes an error with a probability that they are not completely unusable. For example, when the failure level indicating the state of failure intermittently is a failure occurrence rate, the failure level of the I / O processor 51g is 30%, the failure level of the bus coupling device 30g is 30%, and the magnetic disk device 61h. The failure level is 20%, and the failure level of the magnetic disk device 61i is 30%. This failure level is an example of a failure occurrence frequency value.

図１３、図１４は、本実施の形態に関わる故障予見の方法を示すフローチャートである。 13 and 14 are flowcharts showing a failure prediction method according to this embodiment.

図１３は実施の形態２における図８とほぼ同じである。図１４は実施の形態２にける図１０とほぼ同じである。ただし、図１３においては、図８のＳ３４の累積計算の方法に最大値演算ＭＡＸを使用することとし、さらに演算結果を接続バスリストと参照したバス結合装置情報１１０の故障レベル１１５それぞれに記憶する（Ｓ３８）。つまり、Ｉ／Ｏプロセッサ接続バスリスト１５０の故障レベル累積１５２の値とバス結合装置情報１１０の故障レベル１１５の値とをＭＡＸ演算して、大きい値を新たな故障レベルの値とし、この新たな故障レベルの値をＩ／Ｏプロセッサ接続バスリスト１５０の故障レベル累積１５２とバス結合装置情報１１０の故障レベル１１５の双方に記憶する。このため、Ｉ／Ｏプロセッサ接続バスリスト１５０の故障レベル累積１５２の値が、バス結合装置情報１１０の故障レベル１１５の値よりも大きかった場合は、バス結合装置情報１１０の故障レベル１１５の値がＩ／Ｏプロセッサ接続バスリスト１５０の故障レベル累積１５２の値に書き換わり、この結果、バス結合装置情報１１０の故障レベル１１５の値が増加することになる。 FIG. 13 is substantially the same as FIG. 8 in the second embodiment. FIG. 14 is almost the same as FIG. 10 in the second embodiment. However, in FIG. 13, the maximum value operation MAX is used for the cumulative calculation method in S34 of FIG. 8, and the operation result is stored in each failure level 115 of the bus coupler information 110 referred to as the connected bus list. (S38). That is, the value of the failure level accumulation 152 of the I / O processor connection bus list 150 and the value of the failure level 115 of the bus coupler information 110 are subjected to MAX operation, and a larger value is set as a new failure level value. The value of the failure level is stored in both the failure level accumulation 152 of the I / O processor connection bus list 150 and the failure level 115 of the bus coupler information 110. Therefore, if the value of the failure level accumulation 152 in the I / O processor connection bus list 150 is larger than the value of the failure level 115 in the bus coupler information 110, the value of the fault level 115 in the bus coupler information 110 is The value of the failure level accumulation 152 in the I / O processor connection bus list 150 is rewritten, and as a result, the value of the failure level 115 in the bus coupler information 110 is increased.

同様に、図１４においては、図１０のＳ３５の累積計算の方法に最大値演算ＭＡＸを使用することとし、さらに演算結果を参照したバス結合装置情報１１０の故障レベル１１５に記憶する（Ｓ５２）。 Similarly, in FIG. 14, the maximum value operation MAX is used in the cumulative calculation method of S35 of FIG. 10, and the operation result is stored in the failure level 115 of the bus coupler information 110 referring to the operation result (S52).

図１２の構成および故障レベルの値の場合、実施の形態２の図８、図１０の方法では、論理ボリューム８ｈとＩ／Ｏプロセッサ５ｊの間は、規定の故障レベル未満であるため転送可能と判断されるが、本実施の形態の図１３、図１４のように既に設定されている故障レベルの値を元に、新たに故障レベルを設定することによって、バス結合装置３０ｈ、３１ｈに関わるバス結合装置情報１１０の故障レベル１１５に（図１４のＳ５２により）値２０が設定され、バス結合装置３１ｇに関わるバス結合装置情報１１０の故障レベル１１５に値３０が設定される。これにより論理ボリューム８ｈとＩ／Ｏプロセッサ５ｊの間は、規定の故障レベルを超えるため転送不可能と判断される。 In the case of the configuration of FIG. 12 and the value of the failure level, in the method of FIGS. 8 and 10 of the second embodiment, transfer is possible between the logical volume 8h and the I / O processor 5j because it is less than the prescribed failure level. Although determined, the buses related to the bus coupling devices 30h and 31h are set by newly setting the failure level based on the value of the failure level already set as shown in FIGS. 13 and 14 of the present embodiment. The value 20 is set to the failure level 115 of the coupling device information 110 (by S52 in FIG. 14), and the value 30 is set to the failure level 115 of the bus coupling device information 110 related to the bus coupling device 31g. As a result, it is determined that transfer between the logical volume 8h and the I / O processor 5j is impossible because the specified failure level is exceeded.

このように、本実施の形態に係るデバイス間データ転送装置及びデバイス間データ転送方法は、故障レベルを有するボリューム管理テーブルにおいて、下位のバスのデバイス及びバス結合装置の故障レベルの値により、上位バス結合装置の故障レベルの値を予め設定することにより、故障の度合いが高いブロック全体にアクセスすることを制限し、故障時のエラーを未然に回避することを特徴とする。 As described above, the inter-device data transfer apparatus and the inter-device data transfer method according to the present embodiment, in the volume management table having a failure level, use the upper bus according to the failure level value of the lower bus device and the bus coupling device. By setting the value of the failure level of the coupling device in advance, access to the entire block having a high degree of failure is restricted, and an error at the time of failure is avoided in advance.

以上のように、Ｉ／Ｏデバイス、磁気ディスク装置、バス結合装置それぞれの装置の故障レベルを元に、新たにバス結合装置の故障レベルを設定するようにしているので、バス結合装置を経由する装置全体に関して故障のレベルを予め設定しエラーを未然に回避することができる。 As described above, the failure level of the bus coupling device is newly set based on the failure level of each of the I / O device, the magnetic disk device, and the bus coupling device. An error level can be avoided in advance by setting a failure level for the entire apparatus.

実施の形態１に係るシステム構成例を示す図。FIG. 3 is a diagram illustrating an example of a system configuration according to the first embodiment. 実施の形態１に係るオペレーティングシステムとボリューム管理およびデータ転送制御の関係を示した構成図。FIG. 3 is a configuration diagram showing the relationship between the operating system, volume management, and data transfer control according to the first embodiment. 実施の形態１に係るボリューム管理テーブルの例を示す図。FIG. 4 is a diagram showing an example of a volume management table according to the first embodiment. 実施の形態１に係る転送経路の転送可能性を識別する手順を示したフローチャート図。FIG. 3 is a flowchart showing a procedure for identifying transferability of a transfer path according to the first embodiment. 実施の形態１に係る転送経路の転送可能性を識別する手順を示したフローチャート図。FIG. 3 is a flowchart showing a procedure for identifying transferability of a transfer path according to the first embodiment. 実施の形態２に係るシステム構成例を示す図。FIG. 4 is a diagram illustrating an example of a system configuration according to a second embodiment. 実施の形態２に係るボリューム管理テーブルの例を示す図。FIG. 10 is a diagram showing an example of a volume management table according to the second embodiment. 実施の形態２に係る転送経路の転送可能性を識別する手順を示したフローチャート図。The flowchart figure which showed the procedure which identifies the transfer possibility of the transfer path | route which concerns on Embodiment 2. FIG. 実施の形態２に係るＩ／Ｏプロセッサ接続バスリストの例を示す図。FIG. 10 is a diagram showing an example of an I / O processor connection bus list according to the second embodiment. 実施の形態２に係る転送経路の転送可能性を識別する手順を示したフローチャート図。The flowchart figure which showed the procedure which identifies the transfer possibility of the transfer path | route which concerns on Embodiment 2. FIG. 実施の形態２に係るシステム構成例における転送可能性を示す図。The figure which shows the transfer possibility in the system configuration example concerning Embodiment 2. FIG. 実施の形態３に係るシステム構成例を示す図。FIG. 10 is a diagram showing a system configuration example according to the third embodiment. 実施の形態３に係る転送経路の転送可能性を識別する手順を示したフローチャート図。FIG. 10 is a flowchart showing a procedure for identifying transferability of a transfer path according to the third embodiment. 実施の形態３に係る転送経路の転送可能性を識別する手順を示したフローチャート図。FIG. 10 is a flowchart showing a procedure for identifying transferability of a transfer path according to the third embodiment. 従来技術に係るシステム構成例を示す図。The figure which shows the system structural example which concerns on a prior art. 従来技術に係るボリューム管理テーブルを示す図。The figure which shows the volume management table which concerns on a prior art. 従来技術に係る冗長性管理テーブルを示す図。The figure which shows the redundancy management table which concerns on a prior art.

Explanation of symbols

１バス、２バス、３バス結合装置、４ディスクコントローラ、５Ｉ／Ｏプロセッサ、６物理ディスク、７物理記憶デバイス、９主記憶装置、１０中央処理装置、５０Ｉ／Ｏプロセッサ、５１Ｉ／Ｏプロセッサ、９０オペレーティングシステム、９１ボリューム構成テーブル、９２ボリューム管理、９３データ転送制御、９４ボリューム管理テーブル、９５バス情報、９６デバイスドライバ。 1 bus, 2 bus, 3 bus coupling device, 4 disk controller, 5 I / O processor, 6 physical disk, 7 physical storage device, 9 main storage device, 10 central processing unit, 50 I / O processor, 51 I / O Processor, 90 operating system, 91 volume configuration table, 92 volume management, 93 data transfer control, 94 volume management table, 95 bus information, 96 device driver.

Claims

A plurality of storage devices and a plurality of data processing devices are joined via a plurality of transfer paths and at least one transfer path coupling device, and between the storage device and the data processing device via a transfer path and a transfer path coupling device. In a device-to-device data transfer device that transfers data,
A table storage unit that stores information about a failure of the storage device, information about a failure of the data processing device, and a table showing information about the failure of the transfer path coupling device;
Whether data transfer between the storage device and the data processing device is possible based on the storage device failure information, the data processing device failure information, and the transfer path coupling device failure information shown in the table. An inter-device data transfer apparatus comprising: a data transfer control unit for identifying the device.

The table further includes
The physical location of the storage device, the physical location of the data processing device and the data transfer efficiency of the transfer path;
The data transfer control unit
Information on storage device failure, information on data processing device failure, information on transfer path coupling device failure, physical location of storage device and data processing device stored in table 2. The inter-device data transfer apparatus according to claim 1, wherein whether or not data transfer is possible between the storage device and the data processing device is identified based on the position and the data transfer efficiency of the transfer path.

The table further includes
The physical location of the storage device, the physical location of the data processing device and the physical location of the transfer path combiner;
The data transfer control unit
Based on the physical location of the storage device, the physical location of the data processing device, and the physical location of the transfer path coupling device shown in the table, the storage device and the data processing device to be identified for data transfer availability Extracting a transfer path coupling device used for data transfer between, information relating to the failure of the storage device that is subject to identification of whether data transfer is possible, information relating to the failure of the data processing device that is subject to identification of whether data transfer is possible, The inter-device data transfer apparatus according to claim 1, wherein whether or not data transfer is possible between the storage device and the data processing device is identified based on the extracted information relating to the failure of the transfer path coupling apparatus.

The table above
Information relating to the failure of the storage device indicates a failure flag indicating the presence or absence of a failure of the storage device, information relating to the failure of the data processing device indicates a failure flag indicating the presence or absence of a failure of the data processing device, Indicates a failure flag indicating whether or not there is a failure in the transfer path coupling device,
The data transfer control unit
At least one of the failure flag of the storage device subject to identification of data transfer availability, the failure flag of the data processing device subject to identification of data transfer availability, and the failure flag of the extracted transfer path coupling device The inter-device data transfer apparatus according to claim 3, wherein it is determined that data transfer between the storage device and the data processing device is impossible.

The table above
A failure occurrence frequency value indicating the occurrence frequency of the failure of the storage device is indicated as information relating to the failure of the storage device, a failure occurrence frequency value indicating the occurrence frequency of the failure of the data processing device is indicated as information relating to the failure of the data processing device, and the transfer path As the information about the failure of the coupling device, a failure frequency value indicating the frequency of occurrence of the failure of the transfer path coupling device is indicated,
The data transfer control unit
Failure occurrence frequency value of the storage device subject to identification of data transfer availability, failure occurrence frequency value of the data processing device subject to identification of data transfer availability, failure occurrence frequency value of the extracted transfer path coupling device 4. The inter-device data transfer apparatus according to claim 3, wherein when the accumulated calculated value of the data exceeds a predetermined threshold, it is determined that data transfer between the storage device and the data processing device is impossible. .

The table above
A failure occurrence frequency value indicating the occurrence frequency of the failure of the storage device is indicated as information relating to the failure of the storage device, a failure occurrence frequency value indicating the occurrence frequency of the failure of the data processing device is indicated as information relating to the failure of the data processing device, and the transfer path As the information about the failure of the coupling device, a failure frequency value indicating the frequency of occurrence of the failure of the transfer path coupling device is indicated,
The data transfer control unit
Based on the physical location of the storage device, the physical location of the data processing device, and the physical location of the transfer path coupling device shown in the table, the storage device and the data processing device to be identified for data transfer availability A plurality of transfer path coupling devices used for data transfer between them are sequentially extracted, and for each of the extracted transfer path coupling devices, a failure frequency value of a storage device that is a target of data transfer availability identification and data transfer availability identification A new failure occurrence frequency value is set using at least one of the failure occurrence frequency value of the data processing device subject to the transfer and the failure occurrence frequency value of the transfer path coupling device previously extracted, and the object of data transfer availability identification The failure frequency value of the storage device to be extracted, the failure frequency value of the data processing device that is the object of identification of data transfer availability, and the extracted If the cumulative calculated value of each of the plurality of transfer path coupling devices and the new failure frequency value exceeds a predetermined threshold value, it is determined that data transfer between the storage device and the data processing device is impossible. The inter-device data transfer apparatus according to claim 3.

About a plurality of storage devices and a plurality of data processing devices joined via a plurality of transfer paths and at least one transfer path coupling device, between the storage devices and the data processing devices via the transfer path and the transfer path coupling device In a device-to-device data transfer method for transferring data,
Information relating to storage device failure, information relating to data processing device failure, information relating to failure of transfer path coupling device, information relating to storage device failure, information relating to failure of data processing device, and transfer A failure information acquisition step for acquiring information on a failure of the path coupling device;
Data transfer between the storage device and the data processing device based on the information on the failure of the storage device acquired by the failure information acquisition step, the information on the failure of the data processing device, and the information on the failure of the transfer path coupling device A data transfer method comprising: a data transfer control step for identifying whether or not data transfer is possible.

A plurality of storage devices and a plurality of data processing devices joined via a plurality of transfer paths and at least one transfer path coupling device, between the storage device and the data processing device via the transfer path and the transfer path coupling device In a program that causes a computer to execute data transfer,
Information relating to storage device failure, information relating to data processing device failure, information relating to failure of transfer path coupling device, information relating to storage device failure, information relating to failure of data processing device, and transfer Failure information acquisition processing for acquiring information related to the failure of the path coupling device;
Data transfer between the storage device and the data processing device based on the information on the failure of the storage device acquired by the failure information acquisition processing, the information on the failure of the data processing device, and the information on the failure of the transfer path coupling device A program for causing a computer to execute a data transfer control process for identifying whether or not the data can be transferred.