JP5733515B2

JP5733515B2 - Embedded equipment with RAS function

Info

Publication number: JP5733515B2
Application number: JP2011098181A
Authority: JP
Inventors: 正太植原; 大野　毅; 毅大野
Original assignee: Yokogawa Electric Corp
Current assignee: Yokogawa Electric Corp
Priority date: 2011-04-26
Filing date: 2011-04-26
Publication date: 2015-06-10
Anticipated expiration: 2031-04-26
Also published as: JP2012230533A

Description

本発明は、複数のアプリケーションが、システムソフトウェアのデバイスドライバを介してハードウェアを構成する複数の物理デバイスを利用すると共に、前記アプリケーションおよびハードウェアに対するＲＡＳ機能を実装する、ＲＡＳ機能を備える組み込み機器に関するものである。 The present invention relates to an embedded device having a RAS function, in which a plurality of applications use a plurality of physical devices constituting hardware via a device driver of system software and implement a RAS function for the application and hardware. Is.

ＲＡＳ（ Reliability Availability Serviceability）は、システムの信頼性、可用性、および保守性、の独立した意味を持つ３つの機能を合わせた概念であり、組み込み機器への実装は周知の技術である。特許文献１には、コンピュータシステムにバス接続されたＲＡＳ機能付きインターフェースによって、コンピュータに接続されるデバイスの異常を監視する技術が開示されている。 RAS (Reliability Availability Serviceability) is a concept that combines three functions having independent meanings of system reliability, availability, and maintainability, and is implemented in an embedded device. Patent Document 1 discloses a technique for monitoring an abnormality of a device connected to a computer using an interface with a RAS function connected to the computer system by a bus.

ＲＡＳ機能は、コンピュータシステムが障害で停止することを極力防止することを目的に、システムの監視、異常の早期発見、故障状態の判断、および短時間での修理回復を提供する機能を指す。ＲＡＳ機能は、以下のような機能を提供する。 The RAS function refers to a function that provides system monitoring, early detection of an abnormality, determination of a failure state, and repair and recovery in a short time in order to prevent the computer system from stopping due to a failure as much as possible. The RAS function provides the following functions.

（１）システムの異常監視
（ａ）アプリケーションを動作させるために必要なハードウェアの動作監視。
（ｉ）アプリケーションが直接使用するハードウェアの監視。例えば、ＣＰＵ、ＳＤＲＡＭ、ＣＦカード、シリアル通信の動作監視など。
（ｉｉ）ハードウェアの動作に悪影響を及ぼす環境の監視。例えば、電源バッテリの電圧、電流、周囲温度、塵埃、腐食度の監視など。
（ｂ）ウォッチドッグタイマ(Watch Dog Timer、以下ＷＤＴ)による、ソフトウェアの動作監視。
（２）異常時の、システムの動作状態の記録。
（３）システム異常時の対応処理。例えば、Ｉ／Ｏデバイスの再起動、システムの停止など。 (1) System abnormality monitoring (a) Hardware operation monitoring necessary for operating an application.
(I) Monitor hardware used directly by applications. For example, operation monitoring of CPU, SDRAM, CF card, serial communication, etc.
(Ii) Monitoring the environment that adversely affects the operation of the hardware. For example, monitoring the voltage, current, ambient temperature, dust, and corrosion level of the power battery.
(B) Software operation monitoring by a watch dog timer (hereinafter referred to as WDT).
(2) Recording of the operating state of the system at the time of abnormality.
(3) Response processing when the system is abnormal. For example, I / O device restart, system shutdown, etc.

図１０は、従来のＲＡＳ機能を備える組み込み機器の構成例を示す機能ブロック図であり、ハードウェア１００、システムソフトウェア２００、アプリケーションソフトウェア３００を基本構成要素として備える。 FIG. 10 is a functional block diagram showing a configuration example of an embedded device having a conventional RAS function, and includes hardware 100, system software 200, and application software 300 as basic components.

ハードウェア１００は、ＣＰＵを含む複数の物理デバイス群１０１を含む。システムソフトウェア２００は、ブートローダ２１０とＯＳ２２０を含む。アプリケーションソフトウェア３００は、物理デバイスを使用するアプリケーション３０１〜３０Ｎを含む。 The hardware 100 includes a plurality of physical device groups 101 including a CPU. The system software 200 includes a boot loader 210 and an OS 220. The application software 300 includes applications 301 to 30N that use physical devices.

初期化時診断プログラム２１３は、ブートローダ２１０が持つデバイスドライバＡ２１１に実装され、デバイス初期化手段２１２で起動されるハードウェアの初期化を行う際に物理デバイスの診断処理を実行する。 The initialization diagnostic program 213 is installed in the device driver A 211 of the boot loader 210 and executes a physical device diagnostic process when the hardware activated by the device initialization unit 212 is initialized.

初期化時診断プログラム２１３は、初期化時診断が必要な物理デバイスごとに固有の診断処理を行う。異常検出時は、異常対応処理プログラムＡ２１４により、初期化時診断に対応する異常対応処理を実行する。 The initialization diagnosis program 213 performs a unique diagnosis process for each physical device that requires initialization diagnosis. When an abnormality is detected, an abnormality handling process corresponding to the initialization diagnosis is executed by the abnormality handling program A214.

ＯＳ２２０が持つデバイスドライバＢ２２１に実装された使用時異常検出プログラム２２３は、物理デバイスを使用するアプリケーション３０１〜３０Ｎからの要求を受ける物理デバイス使用時にアクセス手段２２２により起動され、アプリケーションが物理デバイスを使用するタイミングでの異常検出処理を実行する。異常検出時は、異常対応処理プログラムＢ２２４により、デバイス使用時診断に対応する異常対応処理を実行する。 The in-use abnormality detection program 223 installed in the device driver B 221 of the OS 220 is activated by the access unit 222 when using a physical device that receives a request from the applications 301 to 30N that use the physical device, and the application uses the physical device. Anomaly detection processing at timing is executed. When an abnormality is detected, an abnormality handling process corresponding to the device use diagnosis is executed by the abnormality handling process program B224.

ＯＳ２２０が持つデバイスドライバＢ２２１に実装される定周期診断プログラム２３１は、ＯＳ２２０に実装されているタイマ２３０により定周期で起動され、アプリケーションが直接使用しない、もしくは使用頻度の低い物理デバイスに対して定周期診断が必要な物理デバイスごとに固有の診断処理を実行する。異常検出時は、異常対応処理プログラムＣ２３２により、異常対応処理を実行する。 The fixed-cycle diagnostic program 231 installed in the device driver B 221 of the OS 220 is started at a fixed cycle by the timer 230 mounted on the OS 220, and the fixed-cycle is used for physical devices that are not used directly by the application or are used less frequently. Execute unique diagnostic processing for each physical device that needs to be diagnosed. When an abnormality is detected, an abnormality handling process is executed by the abnormality handling program C232.

デバイスドライバＢ２２１は、ハードウェア１００に実装されているＷＤＴ１０２にアクセスするためのＷＤＴドライバ２４０を備えている。 The device driver B 221 includes a WDT driver 240 for accessing the WDT 102 mounted on the hardware 100.

ハードウェア１００に実装されている物理ＷＤＴ１０２は、ＷＤＴドライバ２４０を介して監視対象アプリケーション３１０の監視機能を実現するために実装されている。物理ＷＤＴ１０２は、監視対象となるアプリケーション３１０自身に実装されるＷＤＴリセット手段３１０Ａにより、ＷＤＴドライバ２４０を介して定周期でリセットされる。 The physical WDT 102 mounted on the hardware 100 is mounted to realize the monitoring function of the monitoring target application 310 via the WDT driver 240. The physical WDT 102 is reset at regular intervals via the WDT driver 240 by the WDT reset unit 310A installed in the application 310 itself to be monitored.

次に、動作の概要を説明する。ＲＡＳ機能は、組み込み機器を構成するハードウェア、およびアプリケーションについて動作状態を診断、監視することで、システムに対して信頼性、確実性を向上させる機能を提供する。 Next, an outline of the operation will be described. The RAS function provides a function for improving the reliability and certainty of the system by diagnosing and monitoring the operating state of hardware and applications constituting the embedded device.

診断、監視により異常が検出された場合、異常が発生した物理デバイス、アプリケーションとその異常の程度に応じて、個別に実装された対応処理を行う。ＲＡＳ機能は、以下に示す４つのタイミング（１）〜（４）で動作する。 When an abnormality is detected by diagnosis and monitoring, individually implemented response processing is performed according to the physical device and application in which the abnormality has occurred and the degree of the abnormality. The RAS function operates at the following four timings (1) to (4).

（１）システム立ち上げ時の物理デバイス初期化時におけるハードウェアの診断は、時間のかかる診断や、システムがオンライン状態であるときに行うとシステムに悪影響を及ぼす診断(例えば、メモリの全領域ゼロクリアなど)を実行する。 (1) Hardware diagnosis when initializing a physical device at the time of system start-up is a time-consuming diagnosis or a diagnosis that adversely affects the system when the system is online (for example, clearing all memory areas to zero Etc.).

初期化時診断処理は、ブートローダ２１０のデバイスドライバに実装される初期化診断プログラム２１３で実行される。ブートローダ２１０は、立ち上げ時に診断が必要なデバイスに対して、各物理デバイス個別に実装された初期化時診断処理を行う。異常が検出された際は、検出された異常の箇所と程度に応じて、異常処理対応プログラムＡ２１４で個別に異常対応処理を行う。 The initialization diagnosis process is executed by the initialization diagnosis program 213 installed in the device driver of the boot loader 210. The boot loader 210 performs a diagnosis process at initialization mounted on each physical device for a device that needs diagnosis at the time of startup. When an abnormality is detected, abnormality handling processing is individually performed by the abnormality processing handling program A214 in accordance with the location and degree of the detected abnormality.

（２）アプリケーションが物理デバイスを使用する時に実行されるハードウェアの診断は、システムがオンライン状態であるため、システムに悪影響を及ぼさない診断に限られる。 (2) Hardware diagnosis executed when an application uses a physical device is limited to diagnosis that does not adversely affect the system because the system is online.

使用時異常検出処理は、ＯＳ２２０に組み込まれるデバイスドライバＢ２２１に実装される使用時異常検出プログラム２２３により実行される。アプリケーションから物理デバイスへのアクセスがあると、デバイスドライバＡ２２１はアプリケーションがアクセスする各物理デバイスに対して個別に使用時異常検出処理を行う。異常が検出された際は、検出された異常の箇所と程度に応じて、異常処理対応プログラムＢ２２４で個別に異常対応処理を行う。 The in-use abnormality detection process is executed by the in-use abnormality detection program 223 installed in the device driver B 221 incorporated in the OS 220. When an application accesses a physical device, the device driver A 221 performs an in-use abnormality detection process for each physical device accessed by the application. When an abnormality is detected, abnormality handling processing is individually performed by the abnormality processing handling program B224 in accordance with the location and degree of the detected abnormality.

（３）定周期によるハードウェアの動作環境の診断は、アプリケーションからの使用頻度が低いデバイスや、システムに対して悪影響を及ぼす周囲環境について一定周期で診断を行う。 (3) The hardware operating environment is diagnosed at regular intervals for devices that are not frequently used by applications and the surrounding environment that adversely affects the system.

定周期診断処理は、ＯＳ２２０が持つタイマ２３０により、一定周期でデバイスドライバ２２１に実装された定周期診断処理プログラム２３１を呼び出し、各物理デバイスや環境センサ(温度センサなど)を診断する。異常が検出された際は、検出された異常の箇所と程度に応じて、異常処理対応プログラムＣ２３２で個別に異常対応処理を行う。 In the fixed period diagnosis process, the timer 230 of the OS 220 calls the fixed period diagnosis process program 231 installed in the device driver 221 at a fixed period to diagnose each physical device and environmental sensor (temperature sensor or the like). When an abnormality is detected, the abnormality handling processing is individually performed by the abnormality processing handling program C232 in accordance with the location and degree of the detected abnormality.

（４）ソフトウェアの動作監視は、アプリケーションが正常に動作しているかどうかを、ハードウェア１００に実装された物理ＷＤＴ１０２を用いて監視処理を行う。ＯＳは、ＷＤＴドライバ２４０より提供される。ＷＤＴを用いた監視処理の実現手段となるＷＤＴリセット手段３１０Ａは、監視対象となるアプリケーション３１０自身に実装される。 (4) In the operation monitoring of software, whether or not the application is operating normally is monitored using the physical WDT 102 mounted on the hardware 100. The OS is provided from the WDT driver 240. The WDT resetting means 310A, which is a means for realizing monitoring processing using WDT, is implemented in the application 310 itself to be monitored.

アプリケーション３１０は、ＷＤＴドライバ２４０を介して、一定周期で物理ＷＤＴ１０２のカウンタをリセットする。アプリケーション３１０に異常が発生し物理ＷＤＴ１０２のリセットが行われなくなると、物理ＷＤＴ１０２はタイムアップし、ハードウェアの強制的リセットなどハードウェアによる対応処理が行われる。 The application 310 resets the counter of the physical WDT 102 at regular intervals via the WDT driver 240. When an abnormality occurs in the application 310 and the physical WDT 102 is no longer reset, the physical WDT 102 is timed up and a hardware response process such as a forced hardware reset is performed.

特開２００９−０１５４７２号公報JP 2009-015472 A

従来構成のＲＡＳ機能を備える組み込み機器では、次のような問題がある。
（１）ＲＡＳ機能は、システムに散在して実装される。ＲＡＳ機能は、同じ物理デバイスに対する診断であっても、必要とされる診断のタイミングや診断の内容によって処理が異なる。 The embedded device having the RAS function of the conventional configuration has the following problems.
(1) The RAS function is distributed and implemented in the system. The RAS function is processed differently depending on the required diagnosis timing and diagnosis contents, even if the diagnosis is for the same physical device.

そのため、ＲＡＳ機能を呼び出すブートローダやＯＳなどのシステムソフトウェアの各種モジュールは、各々固有のＲＡＳ機能を実装する必要がある。これにより、ＲＡＳ機能はモジュール間で一部機能が重複する場合も個別に開発され、各々のモジュールに散在して実装される。 For this reason, each of various modules of system software such as a boot loader and an OS that calls the RAS function needs to have a unique RAS function. As a result, the RAS function is individually developed even when a part of the functions overlaps between modules, and is scattered and implemented in each module.

（２）ＲＡＳ機能は、搭載される組み込み機器専用の機能として、機器ごとに実装される。
ＲＡＳ機能に対する要求は、各々の機器の使用目的や運用方針によって異なる。具体的には、機器に搭載される物理デバイスが異なれば、監視対象とする物理デバイスや異常検出および対応の処理が異なる。 (2) The RAS function is implemented for each device as a function dedicated to the embedded device to be mounted.
The request for the RAS function varies depending on the purpose of use and the operation policy of each device. Specifically, if the physical device mounted on the device is different, the physical device to be monitored and abnormality detection and corresponding processing are different.

また、異常検出された場合の対応処理は、異常部分を分離してシステムの動作を実行する(稼働率を重視する運用)、小さな異常でも検出されれば即時停止する(安全性を重視する運用)、など運用方針によっても異なる。 In addition, when an abnormality is detected, the response processing is performed by separating the abnormal part and executing the system operation (operation that emphasizes the operating rate), and immediately stops if a small abnormality is detected (operation that emphasizes safety) ), Etc.

ＲＡＳ機能の実装は、機種(物理デバイスのハードウェア仕様)に依存する。異常検出や異常対応処理の処理手順が同じであっても、物理デバイス毎にアクセス手続きが異なるため、実装される処理は異なるものになる。 The implementation of the RAS function depends on the model (physical device hardware specifications). Even if the processing procedure of abnormality detection and abnormality handling processing is the same, the access procedure is different for each physical device, so that the implemented processing is different.

（３）ＷＤＴを用いたアプリケーションの監視では、監視対象、および対応処理が制限される。ハードウェアが提供する１つのＷＤＴを用いたアプリケーション監視の対象は、１つのアプリケーション処理に限られる。ＷＤＴにより監視対象のアプリケーションの異常が検出された場合、対応処理はＣＰＵのリセットなど、ハードウェアによる対応処理に限られる。 (3) In the application monitoring using WDT, the monitoring target and the corresponding processing are limited. The object of application monitoring using one WDT provided by hardware is limited to one application process. When an abnormality of an application to be monitored is detected by WDT, the response process is limited to a hardware response process such as a CPU reset.

（４）異常が検出された際の対応処理が、その検出箇所だけに限られたものとなる。異常検出処理は、各物理デバイスで個別に実装されるため、複合的に発生した異常を検知することができない。 (4) Corresponding processing when an abnormality is detected is limited to the detected location. Since the abnormality detection process is individually implemented in each physical device, it is not possible to detect a complex abnormality.

異常対応処理は各物理デバイスの異常検出処理に対応し、ハードウェアの仕様に依存して個別に実装されるため、複数のハードウェアおよびソフトウェアに対する統合的な異常対応処理を行うことができない。 Since the abnormality handling process corresponds to the abnormality detection process of each physical device and is individually implemented depending on the hardware specifications, it is not possible to perform an integrated abnormality handling process for a plurality of hardware and software.

本発明の目的の第１は、課題（１）の問題を解決し、ブートローダ、ＯＳ、およびアプリケーションがＲＡＳ機能にできる限り依存しない構成を実現する。このとき、ＲＡＳ機能を一箇所にまとめて実装することにより、ＲＡＳ機能の信頼性、確実性の向上を図ることにある。 The first object of the present invention is to solve the problem (1) and realize a configuration in which the boot loader, the OS, and the application do not depend on the RAS function as much as possible. At this time, the reliability and certainty of the RAS function are improved by mounting the RAS function in one place.

本発明の目的の第２は、課題（２）の問題を解決し、ＲＡＳ機能を搭載する組み込み機器の機種に依存しない共通の枠組みを用いてＲＡＳ機能に対する要求を満たすことにある。 A second object of the present invention is to solve the problem (2) and satisfy the requirements for the RAS function by using a common framework that does not depend on the type of embedded device having the RAS function.

本発明の目的の第３は、課題（３）の問題を解決し、複数のアプリケーションについての監視を可能にすると共に、各々のアプリケーションについて異常検出時の対応処理を個別に規定することを可能にすることにある。 The third object of the present invention is to solve the problem (3), to enable monitoring of a plurality of applications, and to individually define the response processing at the time of abnormality detection for each application. There is to do.

本発明の目的の第４は、課題（４）の問題を解決し、複合的に発生した異常への対応処理を行う、統合的な異常対応処理機能を実現することにある。 A fourth object of the present invention is to solve the problem (4) and to realize an integrated abnormality response processing function for performing a response process for complexly generated abnormalities.

このような課題を達成するために、本発明は次の通りの構成になっている。
複数のアプリケーションが、システムソフトウェアのデバイスドライバを介してハードウェアを構成する複数の物理デバイスを利用すると共に、前記アプリケーションおよびハードウェアに対するＲＡＳ機能を実装する、ＲＡＳ機能を備える組み込み機器において、
前記システムソフトウェアと前記ハードウェア間に介在させたハイパーバイザ層内に、前記複数のアプリケーションおよびハードウェアのＲＡＳ機能を統合して一元管理する、論理ＲＡＳ機能部を実装し、
前記論理ＲＡＳ機能部は、前記複数の物理デバイスを所定個数に分類した論理デバイスに対する論理デバイス異常検出ロジックを備え、
前記論理デバイス異常検出ロジックは、
前記システムソフトウェアのブートローダからの前記物理デバイスの初期化時および前記システムソフトウェアのＯＳのデバイスドライバからのアクセス時に起動され、前記論理デバイスに対する異常検出診断を実行することを特徴とするＲＡＳ機能を備える組み込み機器。
In order to achieve such a subject, the present invention has the following configuration.
In an embedded device having a RAS function, in which a plurality of applications use a plurality of physical devices constituting hardware via a device driver of system software, and implement a RAS function for the application and hardware.
In the hypervisor layer interposed between the system software and the hardware, a logical RAS function unit that integrates and centrally manages the RAS functions of the plurality of applications and hardware is implemented .
The logical RAS function unit includes a logical device abnormality detection logic for a logical device obtained by classifying the plurality of physical devices into a predetermined number,
The logic device abnormality detection logic is
Embedded with RAS function, which is activated when the physical device is initialized from the system software boot loader and when the system software is accessed from the OS device driver, and performs abnormality detection diagnosis for the logical device machine.

（２）前記論理デバイス異常検出ロジックからの異常検出情報を取得し、前記ハードウェアの物理デバイスの異常を検出する異常検出処理手段と、
前記論理デバイス異常検出ロジックからの異常検出情報と、前記アプリケーションの異常検出情報とを統合化して異常処理を実行する、統合化異常対応処理ロジックと、
前記統合化異常対応処理ロジックからの異常対応情報に基づいて前記ハードウェアの物理デバイスの異常対応処理を実行する異常対応処理手段と、
を備えることを特徴とする（１）に記載のＲＡＳ機能を備える組み込み機器。
( 2) Anomaly detection processing means for obtaining anomaly detection information from the logic device anomaly detection logic and detecting an anomaly of the hardware physical device;
Integrated abnormality response processing logic that integrates abnormality detection information from the logic device abnormality detection logic and abnormality detection information of the application to execute abnormality processing;
An abnormality handling processing means for executing an abnormality handling process of the hardware physical device based on anomaly handling information from the integrated malfunction handling processing logic;
An embedded device having a RAS function as described in ( 1) .

（３）前記論理ＲＡＳ機能部は、前記複数のアプリケーションに対応した論理ウォッチドッグタイマ群およびソフトウェア処理異常検出ロジックを備え、
前記論理ウォッチドッグタイマ群は、前記複数のアプリケーションに実装された定周期リセット要求手段からのリセット要求によりリセットされることにより前記複数のアプリケーションを監視し、
タイムアップした論理ウォッチドッグタイマ情報を前記ソフトウェア処理異常検出ロジックに出力し、
前記ソフトウェア処理異常検出ロジックは、前記統合化異常対応処理ロジックに対して異常対応処理を依頼することを特徴とする（２）に記載のＲＡＳ機能を備える組み込み機器。
( 3) The logical RAS function unit includes a logical watchdog timer group and software processing abnormality detection logic corresponding to the plurality of applications,
The logical watchdog timer group monitors the plurality of applications by being reset by a reset request from a fixed period reset request means implemented in the plurality of applications.
Output time-up logic watchdog timer information to the software processing abnormality detection logic,
The software processing abnormality detection logic requests an abnormality handling process from the integrated abnormality handling processing logic, and the embedded device having the RAS function according to ( 2) .

（４）前記統合化異常対応処理ロジックは、前記ソフトウェア処理異常検出ロジックからの異常情報を取得したときに、前記システムソフトウェアのアプリケーション管理手段に通知して対応処理を依頼することを特徴とする（３）に記載のＲＡＳ機能を備える組み込み機器。
( 4) The integrated abnormality response processing logic, when acquiring abnormality information from the software processing abnormality detection logic, notifies the application management means of the system software and requests response processing ( An embedded device having the RAS function described in 3) .

（５）前記統合化異常対応処理ロジックは、夫々複数種類が定義された異常個所に対する異常レベルと異常対応処理に基づき、異常発生箇所に対し異常レベルと異常対応処理の対応付けを設定すると共に、デバイス・ソフトウェアの依存関係および複合的な異常対応処理を設定することを特徴とする（２）乃至（４）のいずれかに記載のＲＡＳ機能を備える組み込み機器。
( 5) The integrated abnormality handling processing logic sets the correspondence between the abnormality level and the abnormality handling processing for the abnormality occurrence location based on the abnormality level and the abnormality handling processing for each of the abnormal locations for which a plurality of types are defined, An embedded device having the RAS function according to any one of ( 2) to ( 4), wherein device software dependency and complex abnormality handling processing are set.

（６）
前記論理ＲＡＳ機能部は、前記論理デバイス異常検出ロジックに対するデバイス定周期診断用タイマおよび前記ハードウェアに設けたウォッチドッグタイマに対する物理ウォッチドッグタイマリセット手段を備えることを特徴とする（１）乃至（５）のいずれかに記載のＲＡＳ機能を備える組み込み機器。
( 6)
It said logic RAS function unit is characterized in that it comprises a physical watchdog timer reset means watchdog timer provided in the device periodic diagnosis timer and the hardware against the logic device abnormality detection logic (1) to ( 5) An embedded device having the RAS function described in any one of 5) .

（７）前記論理デバイス異常検出ロジックおよび前記論理ウォッチドッグ用タイマに対して論理ＲＡＳ機能設定情報を与える第１データベースと、
前記論理デバイス異常検出ロジックおよび前記ソフトウェア処理異常検出ロジックに対して過去の異常履歴情報を与える第２データベースと、
を備える（３）または（４）に記載のＲＡＳ機能を備える組み込み機器。
( 7) a first database that provides logical RAS function setting information to the logical device abnormality detection logic and the logical watchdog timer;
A second database that provides past abnormality history information to the logic device abnormality detection logic and the software processing abnormality detection logic;
( 3) or an embedded device having the RAS function according to (4) .

（８）前記統合化異常対応処理ロジックの異常処理内容は、前記第２データベースの異常履歴に記録されることを特徴とする（７）に記載のＲＡＳ機能を備える組み込み機器。
(8) The embedded device having the RAS function according to (7) , wherein the abnormality processing content of the integrated abnormality handling processing logic is recorded in an abnormality history of the second database.

本発明によれば、次のような効果を期待することができる。
（１）ＲＡＳ機能を論理ＲＡＳ機能部で統合的に提供することにより、ＲＡＳ機能の一元管理を実現できる。従来、個々のアプリケーションに散在していた処理を、ハイパーバイザ層の論理ＲＡＳ機能部へ一元化することで、物理デバイスを使用する上位層のアプリケーションに対する修正を減らし、ＲＡＳ機能の実装を容易にできる環境を実現することができる。 According to the present invention, the following effects can be expected.
(1) By providing the RAS function in an integrated manner using the logical RAS function unit, unified management of the RAS function can be realized. An environment that can easily implement the RAS function by reducing the modification to the upper layer application that uses the physical device by centralizing the processing that has been scattered in the individual applications into the logical RAS function unit of the hypervisor layer. Can be realized.

（２）物理デバイスを機能的に分類された論理デバイスに抽象化することで、機種ごとの差分を吸収できる。ＲＡＳ機能の実装が必要な物理デバイスを、図２のテーブル１に示す５種類の論理デバイスに抽象化し、物理デバイスの差に依存しない論理デバイスの論理機能に対するＲＡＳ機能を提供することで、ＲＡＳ機能を個々の機種専用に実装する必要がなくなる。 (2) By abstracting physical devices into functionally categorized logical devices, differences for each model can be absorbed. By abstracting the physical device that needs to implement the RAS function into the five types of logical devices shown in Table 1 of FIG. 2 and providing the RAS function for the logical function of the logical device that does not depend on the difference between the physical devices, the RAS function is provided. Need not be implemented for individual models.

各物理デバイスに対するＲＡＳ機能を、論理デバイスの論理ロジックと物理デバイス固有の処理に切り分けることで、診断ロジックの共通化と物理デバイス固有処理の一元化を図ることができる。 By dividing the RAS function for each physical device into the logical logic of the logical device and the processing specific to the physical device, the diagnosis logic can be shared and the physical device specific processing can be unified.

抽象化した５種類の論理デバイスに対するＲＡＳ機能へ単純化させたことにより、ＯＳやＲＯＭモニタなどの仕組みを必要とすることなく、ＲＡＳ機能をハイパーバイザ層で実現することが可能となる。 By simplifying the RAS function for five types of abstracted logical devices, the RAS function can be realized in the hypervisor layer without requiring a mechanism such as an OS or a ROM monitor.

（３）論理ＷＤＴ群により、複数のアプリケーション処理の監視を実現することができる。論理ＲＡＳ機能部により複数の論理ＷＤＴ群を提供することにより、複数のアプリケーション処理について監視を行うことができる。 (3) A plurality of application processes can be monitored by the logical WDT group. By providing a plurality of logical WDT groups by the logical RAS function unit, it is possible to monitor a plurality of application processes.

統合化異常対応処理ロジックとの組み合わせにより、ハードウェアのリセット以外にもきめ細かな異常対応処理を行うことができる。また、論理ＷＤＴ群をハイパーバイザ層で実現することにより、ソフトウェアタイマの信頼性に影響するソフトウェアを僅かな部分に限定することができる。 By combining with the integrated abnormality response processing logic, it is possible to perform detailed abnormality response processing in addition to hardware reset. Further, by realizing the logical WDT group in the hypervisor layer, software that affects the reliability of the software timer can be limited to a small portion.

（４）統合化異常対応処理ロジックにより、複合的な異常対応処理を実現することができる。異常発生箇所に対する異常のレベルとして、図２のテーブル３に示す３種類の異常レベルを定義し、これに対応付ける異常対応処理項目を図２のテーブル４に示す４種類に統一することで、異常対応処理ロジックの共通化と物理デバイス固有処理の一元化を行うことができる。 (4) Complex abnormality handling processing can be realized by the integrated abnormality handling processing logic. By defining the three types of abnormality levels shown in Table 3 in FIG. 2 as the abnormality level for the location where an abnormality has occurred, and by unifying the abnormality handling processing items associated with them into the four types shown in Table 4 in FIG. Processing logic can be shared and physical device specific processing can be unified.

異常個所に対する異常レベルと異常対応処理項目の組み合わせにより、きめ細かな異常対応処理が行える。また、異常レベルと異常対応処理の種類を限定して処理を単純化することにより、ＲＡＳ機能をハイパーバイザ層で実現することが可能となる。更に、異常対応処理を統合化することにより、依存関係をもつ複数のデバイスおよびアプリケーションに対して複合的な異常対応処理を行うことができる。 Detailed abnormality response processing can be performed by combining the abnormality level and abnormality response processing items for the abnormal part. Further, by simplifying the processing by limiting the abnormality level and the type of abnormality handling processing, the RAS function can be realized in the hypervisor layer. Further, by integrating the abnormality handling processing, it is possible to perform complex abnormality handling processing for a plurality of devices and applications having dependency relationships.

本発明を適用したＲＡＳ機能を備えた組み込み機器の一実施例を示す機能ブロック図である。It is a functional block diagram which shows one Example of the embedded apparatus provided with the RAS function to which this invention is applied. 論理ＲＡＳ設定情報を示すテーブル１乃至テーブル４である。3 is Table 1 to Table 4 showing logical RAS setting information. 論理ＲＡＳ設定情報を示すテーブル５乃至テーブル８および過去の異常履歴例を示すテーブル９、異常時のシステム状態例を示すテーブル１０である。They are Table 5 to Table 8 showing logical RAS setting information, a table 9 showing an example of a past abnormality history, and a table 10 showing an example of a system state at the time of abnormality. 物理デバイス初期化処理における異常検出ロジックへの遷移を示すフローチャートである。It is a flowchart which shows the transition to the abnormality detection logic in a physical device initialization process. 物理デバイスへのアクセスから異常検出ロジックへの遷移を示すフローチャートである。It is a flowchart which shows the transition from the access to a physical device to an abnormality detection logic. 論理デバイスに対する異常検出ロジックの動作を示すフローチャートである。It is a flowchart which shows operation | movement of the abnormality detection logic with respect to a logic device. ソフトウェア処理監視と異常検出ロジックの動作を示すフローチャートである。It is a flowchart which shows operation | movement of a software process monitoring and abnormality detection logic. 統合された異常対応ロジックの動作を示すフローチャートである。It is a flowchart which shows operation | movement of the integrated abnormality response logic. 異常対応処理項目検索の動作を示すフローチャートである。It is a flowchart which shows the operation | movement of an abnormality handling process item search. 従来のＲＡＳ機能を備えた組み込み機器の構成例を示す機能ブロック図である。It is a functional block diagram which shows the structural example of the embedded apparatus provided with the conventional RAS function.

以下本発明を、図面を用いて詳細に説明する。図１は、本発明を適用したＲＡＳ機能を備えた組み込み機器の一実施例を示す機能ブロック図である。図１０で説明した従来構成と同一要素には同一符号を付して説明を省略する。 Hereinafter, the present invention will be described in detail with reference to the drawings. FIG. 1 is a functional block diagram showing an embodiment of an embedded device having a RAS function to which the present invention is applied. The same elements as those of the conventional configuration described with reference to FIG.

図１において、アプリケーションソフトウェア４００のアプリケーション４０１、４０２、４０３、４０４は、システムソフトウェア５００のデバイスドライバを介してハードウェア１００の物理デバイス群１０１を利用する。 In FIG. 1, applications 401, 402, 403, and 404 of the application software 400 use the physical device group 101 of the hardware 100 via a device driver of the system software 500.

本発明の構成上の特徴は、システムソフトウェア５００のブートローダ５１０及びＯＳ５２０とハードウェア１００との間にハイパーバイザ層６００を介在させ、このハイパーバイザ層に実装した論理ＲＡＳ機能部６２０により、アプリケーションソフトウェアおよびハードウェアのＲＡＳ機能を一元管理する仕組みを提供した点にある。 The constitutional feature of the present invention is that the hyperloader layer 600 is interposed between the boot loader 510 and OS 520 of the system software 500 and the hardware 100, and the application software and the logical RAS function unit 620 mounted on the hypervisor layer are provided. The point is that a mechanism for centrally managing the RAS function of the hardware is provided.

以下、本発明の論理ＲＡＳ機能に関連する要素の構成と動作を説明する。ハイパーバイザ層６００内の物理デバイス中継インタフェース６１０は、システムソフトウェア５００を経由した物理デバイス群１０１へのアクセスを監視し、必要なタイミングに応じて論理ＲＡＳ機能を実行させるための機能ブロックであり、以下に述べる２種類のインタフェース（１）及び（２）を、物理デバイスを使用する上位層のソフトウェアへ提供する。 The configuration and operation of elements related to the logical RAS function of the present invention will be described below. The physical device relay interface 610 in the hypervisor layer 600 is a functional block for monitoring access to the physical device group 101 via the system software 500 and executing a logical RAS function according to necessary timing. The two types of interfaces (1) and (2) described in (1) are provided to the upper layer software using the physical device.

（１）物理デバイス初期化インタフェース６１１：
物理デバイスを初期化する際に使用するインタフェースであり、物理デバイスの初期化において異常が発生した場合、論理ＲＡＳ機能の実行へ遷移する。図４は、物理デバイス初期化処理における異常検出ロジックへの遷移を示すフローチャートである。 (1) Physical device initialization interface 611:
This interface is used when initializing a physical device. When an abnormality occurs during initialization of a physical device, the logical RAS function is executed. FIG. 4 is a flowchart showing a transition to the abnormality detection logic in the physical device initialization process.

ステップＳ１の処理開始で、システムソフトウェア５００のブートローダ５１０における各種デバイスドライバ５１１内のデバイス初期化ドライバ５１２からの信号ｃを物理デバイス初期化インタフェース６１１が取得して、ステップＳ２でハードウェア１００の物理デバイス群１０１に初期化信号ｋを出力する。 At the start of the processing in step S1, the physical device initialization interface 611 acquires the signal c from the device initialization driver 512 in the various device drivers 511 in the boot loader 510 of the system software 500, and in step S2, the physical device of the hardware 100 An initialization signal k is output to the group 101.

ステップＳ３のチェックで初期化時に異常が発生した場合は、信号ｊを後述する論理デバイス異常検出ロジック６２１に送りステップＳ４で異常検出ロジックを実行させ、ステップＳ５で物理デバイス初期化処理を終了する。ステップＳ３のチェックで初期化時に異常が発生ない場合は、ステップＳ５にスキップして物理デバイス初期化処理を終了する。 If an abnormality occurs during initialization in the check in step S3, the signal j is sent to a logic device abnormality detection logic 621 described later, the abnormality detection logic is executed in step S4, and the physical device initialization process is terminated in step S5. If there is no abnormality during initialization in the check in step S3, the process skips to step S5 and ends the physical device initialization process.

（２）物理デバイスアクセスインタフェース６１２：
アプリケーションが物理デバイスへアクセスする際に使用するインタフェースである。アプリケーション４０１、４０２、４０３からの物理デバイス使用要求ａが、ソフトウェア５００内のＯＳ５２０が備える各種デバイスドライバ５１１で受け付けられると、デバイスアクセス手段５２１より使用要求信号ｅが物理デバイスアクセスインタフェース６１２に送信され、デバイスごとに決められたタイミングで論理ＲＡＳ機能部６２０の実行へ遷移する。 (2) Physical device access interface 612:
An interface used when an application accesses a physical device. When the physical device use request a from the applications 401, 402, and 403 is received by the various device drivers 511 provided in the OS 520 in the software 500, a use request signal e is transmitted from the device access unit 521 to the physical device access interface 612. Transition is made to execution of the logical RAS function unit 620 at a timing determined for each device.

図５は、物理デバイスへのアクセスから異常検出ロジックへの遷移を示すフローチャートである。ステップＳ１で要求信号ｅが物理デバイスアクセスインタフェース６１２に送信され、ステップＳ２で物理デバイスへのアクセスが開始されるが、事前のチェックステップＳ３でアクセスを行う前に異常検出が必要な場合には、信号ｈを論理デバイス異常検出ロジック６２１に出力して異常検出を実行させた後、信号ｌを物理デバイス群１０１に出力し、ステップＳ５で物理デバイスに対して要求された処理を実行する。 FIG. 5 is a flowchart showing a transition from access to a physical device to abnormality detection logic. In step S1, a request signal e is transmitted to the physical device access interface 612, and access to the physical device is started in step S2. However, if it is necessary to detect an abnormality before accessing in the prior check step S3, After the signal h is output to the logical device abnormality detection logic 621 and abnormality detection is executed, the signal l is output to the physical device group 101, and processing requested for the physical device is executed in step S5.

チェックステップＳ３で、異常検出が必要ない場合には、ステップＳ４をスキップして信号ｌを物理デバイス群１０１に出力し、ステップＳ５で物理デバイスに対して要求された処理を実行する。 If it is not necessary to detect abnormality in the check step S3, step S4 is skipped and the signal l is output to the physical device group 101, and the processing requested for the physical device is executed in step S5.

更に、チェックステップＳ６でアクセスを行った後に異常検出が必要な場合には、信号ｈを論理デバイス異常検出ロジック６２１に出力して異常検出を実行させた後、信号ｌを物理デバイス群１０１に出力し、ステップＳ７で物理デバイスに対して要求された処理を実行し、ステップＳ８で処理を終了する。チェックステップＳ６で、異常検出が必要ない場合には、ステップＳ７をスキップしてステップＳ８で処理を終了する。 Further, if abnormality detection is necessary after the access is made in the check step S6, the signal h is output to the logical device abnormality detection logic 621 to execute abnormality detection, and then the signal l is output to the physical device group 101. In step S7, the requested process is executed for the physical device, and the process ends in step S8. If it is not necessary to detect abnormality in check step S6, step S7 is skipped and the process ends in step S8.

次に、本発明の主要部を成す論理ＲＡＳ機能部６２０につき説明する。論理ＲＡＳ機能部６２０は、ハイパーバイザ層６００において、論理的なＲＡＳ機能を提供する機能ブロックである。 Next, the logical RAS function unit 620 constituting the main part of the present invention will be described. The logical RAS function unit 620 is a functional block that provides a logical RAS function in the hypervisor layer 600.

論理ＲＡＳ機能部６２０は、物理デバイス群１０１を、論理的な複数種類のデバイスに分類した論理デバイスに対する異常検出ロジック６２１、タイマ管理手段６２２、ソフトウェア処理異常検出ロジック６２６、および統合化異常対応処理ロジック６２７を備えている。 The logical RAS function unit 620 includes an abnormality detection logic 621, a timer management unit 622, a software processing abnormality detection logic 626, and an integrated abnormality response processing logic for logical devices that classify the physical device group 101 into a plurality of types of logical devices. 627.

論理デバイス異常検出ロジック６２１は、論理ＲＡＳ機能の実行を要求する物理デバイスを、図２のテーブル１に示すように、論理ＣＰＵ，論理メモリ、論理ファイル、論理Ｉ／Ｏ、論理センサの５種類に分類し、各論理デバイスに対して定義された診断や検査を実行する。 The logical device abnormality detection logic 621 includes five types of physical devices that request execution of the logical RAS function: a logical CPU, a logical memory, a logical file, a logical I / O, and a logical sensor as shown in Table 1 of FIG. Classify and execute diagnostics and tests defined for each logical device.

論理デバイスは、図２のテーブル２に示す論理機能を有するデバイスとして抽象化されたデバイスであり、それぞれの論理機能に対し、正常性を検証するための診断・検査項目が定義される。異常が検出された際は、異常の状態と過去の異常履歴から図２のテーブル３に示す３種類の異常レベル、レベル１、レベル２、レベル３を判定し、統合化異常対応処理ロジック６２７へ信号ｐ１で異常検出情報を伝達する。 The logical device is a device abstracted as a device having a logical function shown in Table 2 of FIG. 2, and diagnostic / inspection items for verifying normality are defined for each logical function. When an abnormality is detected, three types of abnormality levels, Level 1, Level 2, and Level 3 shown in Table 3 of FIG. 2 are determined from the abnormality state and past abnormality history, and the integrated abnormality response processing logic 627 is performed. The abnormality detection information is transmitted by the signal p1.

タイマ管理手段６２２は、論理ＷＤＴ群６２３、定周期診断を行うデバイス定周期診断タイマ６２４、および物理ＷＤＴリセットタイマ６２５を実装する。論理ＷＤＴ群６２３は、ハイパーバイザ層６００で論理的なＷＤＴ機能を複数のソフトウェアタイマで実現し、ソフトウェア処理の監視機能として提供する。異常が検出された際は、ソフトウェア処理異常検出ロジック６２６へ、ソフトウェアの異常情報信号ｎで伝達する。 The timer management unit 622 includes a logical WDT group 623, a device fixed period diagnosis timer 624 that performs fixed period diagnosis, and a physical WDT reset timer 625. The logical WDT group 623 implements a logical WDT function in the hypervisor layer 600 with a plurality of software timers, and provides it as a monitoring function for software processing. When an abnormality is detected, a software abnormality information signal n is transmitted to the software processing abnormality detection logic 626.

デバイス定周期診断タイマ６２４は、ハードウェアの定周期診断を行うトリガとなるタイマであり、論理デバイス異常検出ロジック６２１へ一定周期で物理デバイスの診断を要求する信号ｍを送信する。 The device fixed period diagnosis timer 624 is a timer that serves as a trigger for performing hardware fixed period diagnosis, and transmits a signal m requesting physical device diagnosis to the logical device abnormality detection logic 621 at a constant period.

タイマ管理手段６２２は、ソフトウェア監視の中核を担う機能ブロックであるため、タイマ管理手段自身で物理ＷＤＴ１０２を用いた異常の監視を行う。これは、物理ＷＤＴリセットタイマ６２５により、一定周期で物理ＷＤＴ１０２のリセットを要求する信号ｑを出力することで実現する。信号ｑは、後述する物理デバイス固有処理手段６３０内の物理ＷＤＴリセット手段６３３に渡され、この物理ＷＤＴリセット処理手段６３３からの信号ｕにより、ハードウェア１００の物理ＷＤＴ１０２をリセットする。 Since the timer management unit 622 is a functional block that plays a central role in software monitoring, the timer management unit itself monitors an abnormality using the physical WDT 102. This is realized by outputting a signal q requesting resetting of the physical WDT 102 at a constant cycle by the physical WDT reset timer 625. The signal q is transferred to a physical WDT reset unit 633 in a physical device specific processing unit 630 described later, and the physical WDT 102 of the hardware 100 is reset by a signal u from the physical WDT reset processing unit 633.

ソフトウェア処理異常検出ロジック６２６は、論理ＷＤＴ群６２３により異常検出されたソフトウェアの異常情報信号ｎと、第２データベース６２９を検索した過去の異常履歴データｄ５から、図２のテーブル３に示すレベル１、レベル２、レベル３の３種類の異常レベルを判定し、その異常検出情報を信号ｐ２で統合化異常対応処理ロジック６２７へ伝達する。 The software processing abnormality detection logic 626 uses the level 1 shown in the table 3 of FIG. 2 from the abnormality information signal n of software detected abnormally by the logical WDT group 623 and the past abnormality history data d5 retrieved from the second database 629. Three types of abnormality levels of level 2 and level 3 are determined, and the abnormality detection information is transmitted to the integrated abnormality response processing logic 627 by a signal p2.

統合化異常対応処理ロジック６２７は、論理デバイス異常検出ロジック６２１およびソフトウェア処理異常検出ロジック６２６から、異常検出箇所と異常レベルの情報を信号ｐ１およびｐ２で受信し、異常対応処理を実行する機能を備える。 The integrated abnormality handling processing logic 627 has a function of receiving abnormality detection location and abnormality level information from signals p1 and p2 from the logic device abnormality detection logic 621 and the software processing abnormality detection logic 626, and executing abnormality handling processing. .

統合化異常対応処理ロジック６２７は、受信した複数の異常検出情報ｐ１，ｐ２と第１データベース６２８にアクセスして得られる論理ＲＡＳ機能設定データｄ３から、異常発生箇所、異常発生箇所と依存関係にあるデバイス、ソフトウェア、および異常の同時性を判断し、複合的に異常対応処理ロジックを実行する。異常対応処理は、図２のテーブル４に示す、４種類の異常対応処理１乃至４を提供する。 The integrated abnormality handling processing logic 627 is dependent on the abnormality occurrence location and the abnormality occurrence location from the received plurality of abnormality detection information p1, p2 and the logical RAS function setting data d3 obtained by accessing the first database 628. Judgment of device, software, and abnormality simultaneity and complex execution of abnormality handling logic. The abnormality handling process provides four types of abnormality handling processes 1 to 4 shown in the table 4 of FIG.

第１データベース６２８は、論理ＲＡＳ機能を実現するための設定情報が格納された記憶領域である。図２のテーブル１に示す物理デバイスと論理デバイスの対応付け、図２のテーブル２に示す各論理デバイスと物理デバイス固有処理の対応付け、図３のテーブル５に示すデバイス・ソフトウェアの依存関係、図３のテーブル６に示す異常発生箇所の異常レベルと異常対応処理項目の対応付け、図３のテーブル７に示す複合的な異常対応処理項目の対応付け、および図３のテーブル８に示す論理ＷＤＴ用タイマ６２３のタイムアップ時間についての設定情報を保持する。 The first database 628 is a storage area in which setting information for realizing the logical RAS function is stored. Correspondence between physical devices and logical devices shown in Table 1 in FIG. 2, correspondence between each logical device and physical device specific processing shown in Table 2 in FIG. 2, device / software dependency shown in Table 5 in FIG. 3 is associated with the abnormality level and abnormality handling processing item of the abnormality occurrence location shown in the table 6, the association of the complex abnormality handling processing item shown in the table 7 of FIG. 3, and the logical WDT shown in the table 8 of FIG. 3. The setting information about the time-up time of the timer 623 is held.

第２データベース６２９は、図３のテーブル９に示す過去に異常が検出された時の異常検出履歴、および、図３のーブル１０に示す異常時のシステムの状態を記録する記憶領域である。異常検出履歴は異常検出時に異常レベルを判定するための情報として使用される。異常時のシステムの状態は、メンテナンス作業者などが異常の発生するシステムの状態を参照するために使用される。 The second database 629 is a storage area for recording an abnormality detection history when an abnormality is detected in the past shown in the table 9 of FIG. 3, and a system state at the time of abnormality shown in the table 10 of FIG. The abnormality detection history is used as information for determining an abnormality level when an abnormality is detected. The system state at the time of abnormality is used for a maintenance worker or the like to refer to the state of the system where the abnormality occurs.

物理デバイス固有処理部６３０は、各物理デバイスに対する異常検出処理手段６３１、および異常対応処理手段６３２を実装する機能ブロックである。異常検出処理手段６３１は、論理デバイス異常検出ロジック６２１からの信号ｏに対応した物理デバイス固有処理を実行する。異常対応処理手段６３２は、統合化異常対応処理ロジック６２７からの信号ｒにより伝達された異常対応情報に対応した物理デバイス固有処理を実行する。 The physical device specific processing unit 630 is a functional block that implements an abnormality detection processing unit 631 and an abnormality handling processing unit 632 for each physical device. The abnormality detection processing unit 631 executes physical device specific processing corresponding to the signal o from the logical device abnormality detection logic 621. The abnormality handling processing unit 632 executes physical device specific processing corresponding to the abnormality handling information transmitted by the signal r from the integrated abnormality handling processing logic 627.

次に本発明組み込み機器の動作を、以下の（１）項乃至（４）項で説明する。
（１）ソフトウェアに対する論理ＲＡＳ機能実行のインタフェース：
論理ＲＡＳ機能の実行は、アプリケーションソフトウェア４００の処理からハイパーバイザ層６００の物理デバイス中継インタフェース６１０へアクセスすることによって行われる。 Next, the operation of the embedded device of the present invention will be described in the following items (1) to (4).
(1) Logical RAS function execution interface to software:
The execution of the logical RAS function is performed by accessing the physical device relay interface 610 of the hypervisor layer 600 from the processing of the application software 400.

物理デバイス中継インタフェース６１０は、物理デバイス初期化インタフェース６１１および物理デバイスアクセスインタフェース６１２の２種類のインタフェースを持ち、夫々、既に説明した図４、図５に示す動作フローにより論理ＲＡＳ機能が実行される。 The physical device relay interface 610 has two types of interfaces, a physical device initialization interface 611 and a physical device access interface 612, and the logical RAS function is executed according to the operation flows shown in FIGS. 4 and 5, respectively.

（２）論理デバイスに対する異常検出ロジック：
図６は、論理デバイスに対する異常検出ロジックの動作を示すフローチャートである。ステップＳ１で処理が開始されると、ステップＳ２で信号ｈ，ｊ，ｍを論理デバイス異常検出ロジック６２１が受け取る。 (2) Anomaly detection logic for logic devices:
FIG. 6 is a flowchart showing the operation of the abnormality detection logic for the logical device. When the process is started in step S1, the logic device abnormality detection logic 621 receives the signals h, j, and m in step S2.

論理デバイス異常検出ロジック６２１は、ステップＳ３で、第１データベース６２８にアクセスして取得したデータｄ１を参照し、論理ＲＡＳ機能の実行を要求する物理デバイスについて、論理ＲＡＳ機能設定情報に記録された対応付けにより、５種類の論理デバイスに分類する。 In step S3, the logical device abnormality detection logic 621 refers to the data d1 acquired by accessing the first database 628, and the correspondence recorded in the logical RAS function setting information for the physical device that requests execution of the logical RAS function. This is classified into 5 types of logical devices.

次に、ステップＳ４でデータｄ１を参照し、各論理デバイスの論理機能に対応した診断、検査項目を参照し、実行する必要のある診断、検査項目を選出する。これは、論理ＲＡＳ機能実行の要求元、すなわち物理デバイス中継インタフェース６１０で提供される２種類のインタフェースによって、実行する異常検出ロジックが異なるためである。 Next, in step S4, the data d1 is referred to, the diagnosis and inspection items corresponding to the logical function of each logical device are referred to, and the diagnosis and inspection items that need to be executed are selected. This is because the abnormality detection logic to be executed differs depending on the request source of the logical RAS function execution, that is, the two types of interfaces provided by the physical device relay interface 610.

この対応付けは、図２のテーブル２の例に示すような情報として論理ＲＡＳ機能設定情報に記録されている。実行する必要のある診断、検査項目については、信号ｏを出力し、ステップＳ５で夫々対応する物理デバイス固有の異常検出処理手段６３１を呼び出し、信号ｓにより物理デバイス群１０１の異常検出処理を実行する。 This association is recorded in the logical RAS function setting information as information as shown in the example of the table 2 in FIG. For diagnosis and inspection items that need to be executed, a signal o is output, the corresponding physical device specific abnormality detection processing means 631 is called in step S5, and the abnormality detection processing of the physical device group 101 is executed by the signal s. .

ステップＳ６のチェックで異常が検出された場合は、ステップＳ７で異常の程度と第２データベース６２９にアクセスして取得したデータｄ４を参照し、過去の異常検出履歴から異常レベルを判定し、ステップＳ８でデータｄ４で異常履歴を更新したうえで、ステップＳ９で異常検出箇所と異常レベルの情報を、信号ｐ１で統合化異常対応処理ロジック６２７に伝達し、ステップＳ１０で論理デバイス異常検出ロジックの処理を終了する。ステップＳ６のチェックで異常が検出されない場合は、ステップＳ１０にスキップして処理を終了する。 If an abnormality is detected in the check of step S6, the abnormality level and the data d4 acquired by accessing the second database 629 are referred to in step S7, the abnormality level is determined from the past abnormality detection history, and step S8. After updating the abnormality history with the data d4 in step S9, the abnormality detection location and abnormality level information is transmitted to the integrated abnormality response processing logic 627 in step S9, and the logic device abnormality detection logic is processed in step S10. finish. If no abnormality is detected in the check in step S6, the process skips to step S10 and ends.

具体例として、デバイス・ソフトウェアＡＩＯ（Analog Input Output）アクセス時の異常検出ロジックの動作を記述する。ＡＩＯの物理デバイスアクセスインタフェース６１２にアクセスがあると、論理ＲＡＳ機能部６２０に実行の要求が伝達される。論理デバイス異常検出ロジック６２１は、図２のテーブル１からＡＩＯを論理Ｉ／Ｏであると分類し、図２のテーブル２に従ってＡＩＯの診断、検査項目を選定し、ＡＩＯ固有処理を呼び出して診断を行う。 As a specific example, the operation of the abnormality detection logic when the device software AIO (Analog Input Output) is accessed will be described. When access is made to the AIO physical device access interface 612, an execution request is transmitted to the logical RAS function unit 620. The logical device abnormality detection logic 621 classifies the AIO as a logical I / O from the table 1 in FIG. 2, selects the AIO diagnosis and inspection items according to the table 2 in FIG. 2, calls the AIO specific process, and performs the diagnosis. Do.

ここで致命的な異常が検出された場合、図３のテーブル９にアクセスして記録された過去の異常検出履歴を参照し、過去に異常検出がないことから、異常レベルをレベル２(致命的、再現性無し)と判定して、異常検査履歴を更新のうえ信号ｐ１を統合化異常対応処理ロジック６２７へ伝達する。 If a fatal abnormality is detected here, the previous abnormality detection history recorded by accessing the table 9 in FIG. 3 is referred to, and since no abnormality has been detected in the past, the abnormality level is set to level 2 (fatal , No reproducibility), the abnormality inspection history is updated, and the signal p1 is transmitted to the integrated abnormality response processing logic 627.

（３）ソフトウェア処理監視と異常検出ロジック：
図７は、ソフトウェア処理監視と異常検出ロジックの動作を示すフローチャートである。論理ＲＡＳ機能部６２０のタイマ管理手段６２２は、複数のソフトウェアタイマで構成された論理ＷＤＴ群６２３を実装して上位のソフトウェア処理に提供し、これにより複数のソフトウェア処理の監視を実現する。 (3) Software processing monitoring and abnormality detection logic:
FIG. 7 is a flowchart showing the operation of the software process monitoring and abnormality detection logic. The timer management unit 622 of the logical RAS function unit 620 implements a logical WDT group 623 composed of a plurality of software timers and provides it to the upper software processing, thereby realizing monitoring of the plurality of software processing.

論理ＷＤＴ群６２３のスタートとリセット機能は、監視対象となるアプリケーション４０２、４０３、４０４自身が定周期ＷＤＴリセット要求手段４０２Ａ、４０３Ａ、４０４Ａを実装し、タイマ管理手段６２２はこれら定周期ＷＤＴリセット要求手段を呼び出す。 The start and reset functions of the logical WDT group 623 are such that the applications 402, 403, and 404 themselves to be monitored implement the fixed-cycle WDT reset request means 402A, 403A, and 404A. Call.

ステップＳ１でソフトウェア処理の監視が開始されると、ＯＳ５２０内のＷＤＴリセット手段５２２からの信号ｆ１、またはブートローダ５１０内のＷＤＴリセット手段５１３からの信号ｆ２により、ステップＳ２で論理ＷＤＴ群６２３がスタートする。 When monitoring of software processing is started in step S1, the logical WDT group 623 is started in step S2 by the signal f1 from the WDT reset unit 522 in the OS 520 or the signal f2 from the WDT reset unit 513 in the boot loader 510. .

論理ＷＤＴ群６２３は、ステップＳ３で第１データベース６２８へアクセスしたデータｄ２から、設定されたタイムアップ時間を論理ＲＡＳ機能設定情報より取得し、ステップＳ４で論理ＷＤＴ群のタイマのカウントを開始する。 The logical WDT group 623 obtains the set time-up time from the logical RAS function setting information from the data d2 accessed to the first database 628 in step S3, and starts counting the timer of the logical WDT group in step S4.

アプリケーション４０２、４０３、４０４は、正常に動作している場合には、定周期ＷＤＴリセット要求手段４０２Ａ、４０３Ａ、４０４Ａにより、カウンタがタイムアップしない周期でカウンタをクリアする。 When the applications 402, 403, and 404 are operating normally, the counters are cleared at a cycle in which the counters do not time up by the fixed cycle WDT reset requesting means 402A, 403A, and 404A.

ステップＳ５のチェックでソフトウェアの異常により論理ＷＤＴ群６２３のいずれかがタイムアップした場合は、ステップＳ６でソフトウェアの異常情報が信号ｎでソフトウェア処理異常検出ロジック６２６に伝達される。ステップＳ１からＳ６までの処理が論理ＷＤＴ群６２３の処理範囲（Ａ）である。 If any of the logical WDT groups 623 is timed up due to a software abnormality in the check in step S5, the software abnormality information is transmitted to the software processing abnormality detection logic 626 as a signal n in step S6. The processing from step S1 to S6 is the processing range (A) of the logical WDT group 623.

ソフトウェア処理異常検出ロジック６２６は、ステップＳ７で第２データベース６２９にアクセスしたデータｄ５により、過去の異常検出履歴を参照することにより異常レベルを判定する。 The software processing abnormality detection logic 626 determines the abnormality level by referring to the past abnormality detection history based on the data d5 accessed to the second database 629 in step S7.

更に、ステップＳ８で、データｄ５で第２データベース６２９の異常検出履歴を更新したうえで、ステップＳ９で異常検出箇所と異常レベルの情報を、信号ｐ２で統合化異常対応処理ロジック６２７に伝達し、ステップＳ１０でソフトウェア処理の監視を終了する。ステップＳ７からＳ９までの処理がソフトウェア処理異常検出ロジック６２６の処理範囲（Ｂ）である。 Further, in step S8, the abnormality detection history of the second database 629 is updated with the data d5, and in step S9, the information of the abnormality detection location and the abnormality level is transmitted to the integrated abnormality handling processing logic 627 by the signal p2. In step S10, the monitoring of the software process ends. The processing from step S7 to S9 is the processing range (B) of the software processing abnormality detection logic 626.

具体例として、アプリケーションとしてインストールされている(図示されていない)デバイス・ソフトウェである、図３のテーブル５に定義されているＨＴＴＰサーバの監視処理について記述する。 As a specific example, a monitoring process of an HTTP server defined in the table 5 of FIG. 3 which is device software (not shown) installed as an application will be described.

ＨＴＴＰサーバ自身には、論理ＷＤＴ群６２３のスタートとリセット処理が実装される。ＨＴＴＰサーバが論理ＷＤＴ群６２３をスタートさせると、論理ＷＤＴ群６２３は、図３テーブル８の例のようなタイムアップ時間の設定からタイムアップ時間６０秒を取得し、カウントを開始する。 In the HTTP server itself, the start and reset processing of the logical WDT group 623 is implemented. When the HTTP server starts the logical WDT group 623, the logical WDT group 623 acquires a time-up time of 60 seconds from the setting of the time-up time as in the example of FIG. 3 table 8, and starts counting.

ＨＴＴＰサーバは、正常時には６０秒を超えない周期で論理ＷＤＴ群６２３をリセットする。ＨＴＴＰサーバに異常があった場合、論理ＷＤＴ群６２３はタイムアップし、ソフトウェアの異常情報を信号ｎでソフトウェア処理異常検出ロジック６２６に伝達する。 The HTTP server resets the logical WDT group 623 at a period not exceeding 60 seconds when normal. When there is an abnormality in the HTTP server, the logical WDT group 623 is timed up and transmits software abnormality information to the software processing abnormality detection logic 626 by a signal n.

ソフトウェア処理異常検出ロジック６２６では、データｄ５により、第２データベース６２９へのアクセスでテーブル９の異常検出履歴を参照し、過去にも異常が発生していることから、異常レベルをレベル３(致命的、再現性あり)と判定し、異常検出履歴を更新のうえ情報を信号ｐ２により統合化異常対応処理ロジック６２７へ伝達する。 The software processing abnormality detection logic 626 refers to the abnormality detection history of the table 9 by accessing the second database 629 based on the data d5, and an abnormality has occurred in the past. Therefore, the abnormality level is set to level 3 (Fatal). And the abnormality detection history is updated, and the information is transmitted to the integrated abnormality response processing logic 627 by the signal p2.

（４）統合化異常対応処理ロジック６２７：
図８は、統合された異常対応ロジックの動作を示すフローチャートである。ステップＳ１で異常対応処理ロジックが開始されると、ステップＳ２で統合化異常対応ロジック６２７は、論理デバイス異常検出ロジック６２１からの異常情報信号ｐ１またはソフトウェア異常検出ロジック６２６からの異常情報信号ｐ２により、異常検出箇所と異常レベルの情報を受信する。 (4) Integrated abnormality response processing logic 627:
FIG. 8 is a flowchart showing the operation of the integrated abnormality handling logic. When the abnormality handling processing logic is started in step S1, the integrated abnormality handling logic 627 in step S2 receives the abnormality information signal p1 from the logic device abnormality detection logic 621 or the abnormality information signal p2 from the software abnormality detection logic 626. Receives information on anomaly detection locations and anomaly levels.

ステップＳ３で異常検出箇所と異常レベルに対応した異常対応処理を検索する。まず、検索では、異常発生箇所に対し異常レベルに従った対応処理を、データｄ３で第１データベース２６８にアクセスし、論理ＲＡＳ機能設定情報から検索（異常対応処理項目検索の動作フローは、図９で説明する）し、項目を選出する。 In step S3, an abnormality handling process corresponding to the abnormality detection location and the abnormality level is searched. First, in the search, the correspondence processing according to the abnormality level for the location where the abnormality has occurred is accessed from the first database 268 with the data d3 and searched from the logical RAS function setting information (the operation flow of the abnormality handling processing item search is shown in FIG. And select an item.

ステップＳ４のチェックで物理デバイスに対する異常処理がある場合には、信号ｒを出力して、ステップＳ５で物理デバイス固有処理部６３０内の異常対応処理手段６３２を起動し、信号ｔで物理デバイス群１０１により該当する物理デバイス固有の異常対応処理を実行する。 If there is an abnormality process for the physical device in the check in step S4, the signal r is output, the abnormality response processing means 632 in the physical device specific processing unit 630 is activated in step S5, and the physical device group 101 is received by the signal t. To execute the error handling process specific to the corresponding physical device.

次に、ステップＳ６のチェックでソフトウェア処理に異常がある場合には、ＯＳ５２０内のアプリケーション管理手段５２３に信号ｇを通知し、ステップＳ７でソフトウェの管理機能へ異常対応処理を要求する。 Next, if there is an abnormality in the software processing in the check in step S6, the signal g is notified to the application management means 523 in the OS 520, and an abnormality handling process is requested to the software management function in step S7.

ステップＳ８でデータｄ６を第２データベース６２９に送信し、テーブル９およびテーブル１０に過去の異常履歴に異常時のシステムの状態を記録し、ステップＳ９で異常対応処理ロジックを終了する。 In step S8, the data d6 is transmitted to the second database 629, the system state at the time of abnormality is recorded in the past abnormality history in the tables 9 and 10, and the abnormality handling processing logic is terminated in step S9.

図９は、統合化異常対応処理ロジック６２７による異常対応処理項目検索の動作を示すフローチャートである。ステップＳ１で異常対応処理検索が開始されると、ステップＳ２では統合化異常対応処理ロジック６２７は、データｄ３で第１データベース６２８にアクセスし、テーブル６を参照して異常個所の異常レベルに従った異常対応処理項目を検索する。 FIG. 9 is a flowchart showing the operation of abnormality handling process item search by the integrated abnormality handling processing logic 627. When the abnormality response processing search is started in step S1, the integrated abnormality response processing logic 627 accesses the first database 628 with the data d3 and follows the abnormality level of the abnormal part with reference to the table 6 in step S2. Search for error handling items.

ステップＳ３では、第１データベース６２８にアクセスしたデータｄ３で、テーブル５を参照し、異常発生箇所と依存関係にあるデバイス・ソフトウェアを検索する。ステップＳ４のチェックで依存するデバイス・ソフトウェアがある場合には、ステップＳ５のチェックに進み、依存する部分で異常が検出されていなければ、ステップＳ６でデータｄ３により第１データベース６２８にアクセスし、依存する部分への統合的な異常対応処理を検索する。 In step S3, the data d3 that has accessed the first database 628 is referred to the table 5 to search for device software that is dependent on the location where the abnormality occurred. If there is device software that depends on the check in step S4, the process proceeds to the check in step S5. If no abnormality is detected in the dependent part, the first database 628 is accessed by the data d3 in step S6. Search the integrated error handling process to the part to be.

次に、ステップＳ７のチェックで統合的な異常処理の必要性がある場合には、ステップＳ８で依存する部分の異常対応を処理項目に追加し、ステップＳ９で異常対応処理検索を
終了する。 Next, if there is a need for integrated abnormality processing in the check in step S7, the abnormality handling of the dependent part is added to the processing item in step S8, and the abnormality handling processing search is terminated in step S9.

ステップＳ５のチェックで、依存する部分で異常検出されている場合には、ステップＳ８にスキップして、依存する部分の異常対応を処理項目に追加する。ステップＳ４のチェックで、依存するデバイス・ソフトウェアがない場合には、ステップＳ９にスキップして異常対応処理検索を終了する。 If an abnormality is detected in the dependent part in the check in step S5, the process skips to step S8 and adds the abnormality corresponding to the dependent part to the processing item. If there is no dependent device software in the check in step S4, the process skips to step S9 and ends the abnormality handling process search.

図８、図９の処理フローで明らかなように、統合化異常対応処理ロジック６２７は、異常発生箇所と依存関係にあるデバイス・ソフトウェアを論理ＲＡＳ機能設定情報から検索する。依存する部分については、その部分において異常検出されている場合、もしくは複合的な異常対応処理が論理ＲＡＳ機能設定情報に定義されている場合に、対応処理を項目として追加する。 As is apparent from the processing flows of FIGS. 8 and 9, the integrated abnormality handling processing logic 627 searches the logical RAS function setting information for device software that is dependent on the abnormality occurrence location. For the dependent part, if an abnormality is detected in that part, or if a complex abnormality handling process is defined in the logical RAS function setting information, the handling process is added as an item.

検索された異常対応処理ロジックの処理項目のうち、物理デバイスに対する処理は、対応する固有の異常対応処理手段６３２を呼び出して処理を実行する。ソフトウェアに対する処理は、ＯＳが持つソフトウェアの管理機能であるアプリケーション管理手段５２３に対して異常対応処理の要求を行う。そして最後に、異常が発生した時のシステムの状態を過去の異常履歴に記録する。 Of the processing items of the detected abnormality handling processing logic, the processing for the physical device calls the corresponding unique abnormality handling processing means 632 and executes the processing. In the process for software, an abnormality handling process is requested to the application management unit 523 which is a software management function of the OS. Finally, the state of the system when an abnormality occurs is recorded in the past abnormality history.

具体例として、ＡＩＯの異常(異常レベル２)が単独で検出された際の異常対応処理を記述する。統合化異常対応処理ロジック６２７は、異常情報信号ｐ１を受信すると、テーブル６の例に示す異常対応処理の対応付けの情報を参照し、ＡＩＯに対する異常レベル２の異常対応処理１(ＡＩＯの再起動)を処理項目として選出する。 As a specific example, an abnormality handling process when an AIO abnormality (abnormal level 2) is detected alone will be described. When the integrated abnormality response processing logic 627 receives the abnormality information signal p1, the integrated abnormality response processing logic 627 refers to the information on the association of the abnormality response processing shown in the example of Table 6, and the abnormality response processing 1 (AIO restart of the AIO) is performed. ) Is selected as a processing item.

次に、テーブル５の例に示す依存関係の情報より、制御アプリケーションが依存関係にあることを把握する。ここで、制御アプリケーションでも異常が検出されていれば、その異常対応処理を処理項目に加える。 Next, it is grasped from the dependency relationship information shown in the example of Table 5 that the control application has a dependency relationship. Here, if an abnormality is detected in the control application, the abnormality handling process is added to the process item.

この例では、ＡＩＯ単独の異常としているため、テーブル７の例に示す複合的な異常対応処理の検索に移る。ここでは、依存する制御アプリケーションの異常対応処理、および条件が設定されているため、異常対応処理１(制御アプリケーションの再起動)を処理項目に加える。 In this example, since the abnormality is AIO alone, the process proceeds to a search for a complex abnormality handling process shown in the example of Table 7. Here, since the abnormality handling process and conditions of the dependent control application are set, the abnormality handling process 1 (reactivation of the control application) is added to the process item.

以上により異常対応処理の処理項目が列挙されるため、処理項目に従ってＡＩＯ固有の異常対応処理、および制御アプリケーション再起動の要求を行う。そして最後に、テーブル１０に示す例のように、異常が発生した時のシステムの状態を記録する。 Since the processing items of the abnormality handling process are enumerated as described above, an abnormality handling process unique to AIO and a request for restarting the control application are performed according to the processing item. Finally, as in the example shown in Table 10, the system state when an abnormality occurs is recorded.

以上説明した本発明のＲＡＳ機能は、次のような拡張応用が可能である。
（１）組み込み機器に着脱可能なフォールトトレラント機能装置への応用：
信頼性を向上させることを目的として、論理ＲＡＳ機能に対して、ＲＡＳ機能に加えて論理的なフォールトトレラント機能を実装する。 The RAS function of the present invention described above can be extended as follows.
(1) Application to fault-tolerant functional devices that can be attached to and detached from embedded devices:
For the purpose of improving reliability, a logical fault tolerant function is implemented in addition to the RAS function for the logical RAS function.

ブートローダやＯＳなどの上位のソフトウェアは、ハードウェアの冗長化構成やシングル構成を意識する必要がない。従って、ハイパーバイザ層で実現する冗長化制御機能は、論理化された多重化ＣＰＵボードの制御を行い、物理ＣＰＵへアクセスする際にハードウェアに依存する機能へマッピングする。 Higher-level software such as a boot loader and OS does not need to be aware of a redundant configuration of hardware or a single configuration. Therefore, the redundancy control function realized in the hypervisor layer controls the logically multiplexed CPU board and maps it to a function depending on hardware when accessing the physical CPU.

（２）抽象化、単純化を行った論理ＲＡＳ機能をＦＰＧＡへ実装することが可能である。プログラマブルなＩＣにＲＡＳ機能を実装することにより、より信頼性の高いＲＡＳ機能を着脱可能な形態で組み込み機器に提供することができる。 (2) An abstracted and simplified logical RAS function can be implemented in an FPGA. By mounting the RAS function on a programmable IC, a more reliable RAS function can be provided to an embedded device in a detachable form.

（３）ＣＰＵとしてマルチコアＣＰＵを採用する場合には、ハイパーバイザー層６００に実装された論理ＲＡＳ機能部６２０、物理デバイス中継インターフェース６１０、物理デバイス固有処理部６３０を、専用のコアＣＰＵに割り当てることによりＲＡＳ機能に係る信号処理の高速化を実現することができる。 (3) When a multi-core CPU is adopted as a CPU, the logical RAS function unit 620, the physical device relay interface 610, and the physical device specific processing unit 630 implemented in the hypervisor layer 600 are allocated to a dedicated core CPU. Speeding up of signal processing related to the RAS function can be realized.

１００ハードウェア
１０１物理デバイス群
１０２物理ウォッチドッグタイマ（ＷＤＴ）
４００アプリケーションソフトウェア
４０１〜４０４アプリケーション
４０２Ａ〜４０４Ａ定周期ＷＤＴリセット要求手段
５００システムソフトウェア
５１０ブートローダ
５１１デバイスドライバ
５１２デバイス初期化ドライバ
５１３ＷＤＴリセット手段
５２０ＯＳ
５２１デバイスアクセス手段
５２２ＷＤＴリセット手段
５２３アプリケーション管理手段
６００ハイパーバイザ層
６１０物理デバイス中継インタフェース
６１１物理デバイス初期化インタフェース
６１２物理デバイスアクセスインタフェース
６２０論理ＲＡＳ機能部
６２１論理デバイス異常検出ロジック
６２２タイマ管理手段
６２３論理ＷＤＴ群
６２４デバイス定周期診断タイマ
６２５物理ＷＤＴリセットタイマ
６２６ソフトウェア処理異常検出ロジック
６２７統合化異常対応処理ロジック
６２８第１データベース
６２９第２データベース
６３０物理デバイス固有処理部
６３１異常検出処理手段
６３２異常対応処理手段
６３３物理ＷＤＴリセット手段 100 Hardware 101 Physical Device Group 102 Physical Watchdog Timer (WDT)
400 Application software 401-404 Application 402A-404A Fixed period WDT reset request means 500 System software 510 Boot loader 511 Device driver 512 Device initialization driver 513 WDT reset means 520 OS
521 Device access unit 522 WDT reset unit 523 Application management unit 600 Hypervisor layer 610 Physical device relay interface 611 Physical device initialization interface 612 Physical device access interface 620 Logical RAS function unit 621 Logical device abnormality detection logic 622 Timer management unit 623 Logical WDT Group 624 Device fixed period diagnosis timer 625 Physical WDT reset timer 626 Software processing abnormality detection logic 627 Integrated abnormality response processing logic 628 First database 629 Second database 630 Physical device specific processing unit 631 Anomaly detection processing means 632 Anomaly response processing means 633 Physical WDT reset means

Claims

In an embedded device having a RAS function, in which a plurality of applications use a plurality of physical devices constituting hardware via a device driver of system software, and implement a RAS function for the application and hardware.
In the hypervisor layer interposed between the system software and the hardware, a logical RAS function unit that integrates and centrally manages the RAS functions of the plurality of applications and hardware is implemented.
The logical RAS function unit includes a logical device abnormality detection logic for a logical device obtained by classifying the plurality of physical devices into a predetermined number,
The logic device abnormality detection logic is
Embedded with RAS function, which is activated when the physical device is initialized from the system software boot loader and when the system software is accessed from the OS device driver, and performs abnormality detection diagnosis for the logical device machine.

Anomaly detection processing means for obtaining anomaly detection information from the logic device anomaly detection logic and detecting an anomaly of the physical device of the hardware;
Integrated abnormality response processing logic that integrates abnormality detection information from the logic device abnormality detection logic and abnormality detection information of the application to execute abnormality processing;
An abnormality handling processing means for executing an abnormality handling process of the hardware physical device based on anomaly handling information from the integrated malfunction handling processing logic;
The embedded device having the RAS function according to claim 1.

The logical RAS function unit includes a logical watchdog timer group and software processing abnormality detection logic corresponding to the plurality of applications,
The logical watchdog timer group monitors the plurality of applications by being reset by a reset request from a fixed period reset request means implemented in the plurality of applications.
Output time-up logic watchdog timer information to the software processing abnormality detection logic,
The embedded device having a RAS function according to claim 2, wherein the software processing abnormality detection logic requests the integrated abnormality response processing logic to perform an abnormality response process.

4. The integrated abnormality response processing logic, when acquiring abnormality information from the software processing abnormality detection logic, notifies an application management means of the system software and requests response processing. Embedded device having the described RAS function.

The integrated abnormality handling processing logic sets the correspondence between the abnormality level and the abnormality handling process for the abnormality occurrence location based on the abnormality level and the abnormality handling processing for each of the plurality of types of abnormality locations, and device software The embedded device having the RAS function according to any one of claims 2 to 4, wherein dependency relations and complex abnormality handling processes are set.

6. The logical RAS function unit includes a device fixed period diagnosis timer for the logical device abnormality detection logic and a physical watchdog timer reset unit for a watchdog timer provided in the hardware. An embedded device having the RAS function described in any one of the above.

A first database for providing logical RAS function setting information to the logical device abnormality detection logic and the logical watchdog timer;
A second database that provides past abnormality history information to the logic device abnormality detection logic and the software processing abnormality detection logic;
An embedded device having a RAS function according to claim 3 or 4.

The embedded device having the RAS function according to claim 7 , wherein the abnormality processing content of the integrated abnormality handling processing logic is recorded in an abnormality history of the second database.