JPH10143387A - 障害診断機能付きのコンピュータ・システム - Google Patents

障害診断機能付きのコンピュータ・システム

Info

Publication number
JPH10143387A
JPH10143387A JP9297446A JP29744697A JPH10143387A JP H10143387 A JPH10143387 A JP H10143387A JP 9297446 A JP9297446 A JP 9297446A JP 29744697 A JP29744697 A JP 29744697A JP H10143387 A JPH10143387 A JP H10143387A
Authority
JP
Japan
Prior art keywords
fault
computer system
bus
signal
circuit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP9297446A
Other languages
English (en)
Japanese (ja)
Other versions
JPH10143387A5 (enExample
Inventor
Paul R Culley
ポール・アール・カリー
Joseph P Miller
ジョセフ・ピー・ミラー
Daniel S Hull
ダニエル・エス・ハル
Siamak Tavallaei
シアマック・タヴァラエイ
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Compaq Computer Corp
Original Assignee
Compaq Computer Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Compaq Computer Corp filed Critical Compaq Computer Corp
Publication of JPH10143387A publication Critical patent/JPH10143387A/ja
Publication of JPH10143387A5 publication Critical patent/JPH10143387A5/ja
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • G06F11/0745Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in an input/output transactions management context
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • G06F11/0748Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in a remote unit communicating with a single-box computer node experiencing an error/fault
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/079Root cause analysis, i.e. error or fault diagnosis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Test And Diagnosis Of Digital Computers (AREA)
  • Bus Control (AREA)
JP9297446A 1996-10-29 1997-10-29 障害診断機能付きのコンピュータ・システム Pending JPH10143387A (ja)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US739687 1996-10-29
US08/739,687 US6000040A (en) 1996-10-29 1996-10-29 Method and apparatus for diagnosing fault states in a computer system

Publications (2)

Publication Number Publication Date
JPH10143387A true JPH10143387A (ja) 1998-05-29
JPH10143387A5 JPH10143387A5 (enExample) 2005-04-07

Family

ID=24973376

Family Applications (1)

Application Number Title Priority Date Filing Date
JP9297446A Pending JPH10143387A (ja) 1996-10-29 1997-10-29 障害診断機能付きのコンピュータ・システム

Country Status (4)

Country Link
US (1) US6000040A (enExample)
EP (1) EP0840226B1 (enExample)
JP (1) JPH10143387A (enExample)
DE (1) DE69726693T2 (enExample)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002526860A (ja) * 1998-10-01 2002-08-20 フィーニックス テクノロジーズ リミテッド マルチプロセッサ環境において正しいプロセッサのための入出力命令をエミュレートし、ソフトウェアsmiをサービスするための装置及び方法
EP1703401A2 (en) 2005-03-17 2006-09-20 Fujitsu Limited Information processing apparatus and control method therefor
JP2013041438A (ja) * 2011-08-17 2013-02-28 Nec Fielding Ltd ハードウェア障害被疑特定装置、ハードウェア障害被疑特定方法、及びプログラム

Families Citing this family (58)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2213966C (en) * 1995-12-27 2004-10-26 Koken Co., Ltd. Monitoring control apparatus
DE19723079C1 (de) * 1997-06-02 1998-11-19 Bosch Gmbh Robert Fehlerdiagnosevorrichtung und -verfahren
US6496869B1 (en) * 1998-03-26 2002-12-17 National Semiconductor Corporation Receiving data on a networked computer in a reduced power state
US6463550B1 (en) * 1998-06-04 2002-10-08 Compaq Information Technologies Group, L.P. Computer system implementing fault detection and isolation using unique identification codes stored in non-volatile memory
US6438711B2 (en) * 1998-07-15 2002-08-20 Intel Corporation Method and apparatus for performing field diagnostics on a computer system
US6219626B1 (en) * 1998-09-08 2001-04-17 Lockheed Corp Automated diagnostic system
US6175927B1 (en) * 1998-10-06 2001-01-16 International Business Machine Corporation Alert mechanism for service interruption from power loss
US6449729B1 (en) * 1999-02-12 2002-09-10 Compaq Information Technologies Group, L.P. Computer system for dynamically scaling busses during operation
US6359565B1 (en) * 1999-06-03 2002-03-19 Fujitsu Network Communications, Inc. Method and system for monitoring the thermal status of a card shelf
US6898654B1 (en) 1999-07-29 2005-05-24 Microsoft Corporation Method and system for managing bandwidth on a master-slave bus
JP3715475B2 (ja) * 1999-09-13 2005-11-09 富士通株式会社 電子機器用温度制御回路および電子機器の温度制御方法
US6550019B1 (en) * 1999-11-04 2003-04-15 International Business Machines Corporation Method and apparatus for problem identification during initial program load in a multiprocessor system
US6543002B1 (en) * 1999-11-04 2003-04-01 International Business Machines Corporation Recovery from hang condition in a microprocessor
US6543003B1 (en) * 1999-11-08 2003-04-01 International Business Machines Corporation Method and apparatus for multi-stage hang recovery in an out-of-order microprocessor
US6643802B1 (en) * 2000-04-27 2003-11-04 Ncr Corporation Coordinated multinode dump collection in response to a fault
US6735720B1 (en) * 2000-05-31 2004-05-11 Microsoft Corporation Method and system for recovering a failed device on a master-slave bus
US6658510B1 (en) * 2000-10-18 2003-12-02 International Business Machines Corporation Software method to retry access to peripherals that can cause bus timeouts during momentary busy periods
GB2373607B (en) * 2001-03-23 2003-02-12 Sun Microsystems Inc A computer system
GB2373606B (en) * 2001-03-23 2003-06-04 Sun Microsystems Inc A computer system
US6845469B2 (en) * 2001-03-29 2005-01-18 International Business Machines Corporation Method for managing an uncorrectable, unrecoverable data error (UE) as the UE passes through a plurality of devices in a central electronics complex
US6829729B2 (en) * 2001-03-29 2004-12-07 International Business Machines Corporation Method and system for fault isolation methodology for I/O unrecoverable, uncorrectable error
US6766401B2 (en) 2001-04-27 2004-07-20 International Business Machines Corporation Increasing control information from a single general purpose input/output (GPIO) mechanism
US20040225783A1 (en) * 2001-07-30 2004-11-11 Erickson Michael John Bus to multiple jtag bus bridge
US7000141B1 (en) * 2001-11-14 2006-02-14 Hewlett-Packard Development Company, L.P. Data placement for fault tolerance
US7047462B2 (en) * 2002-01-04 2006-05-16 Hewlett-Packard Development Company, Lp. Method and apparatus for providing JTAG functionality in a remote server management controller
US7093168B2 (en) * 2002-01-22 2006-08-15 Honeywell International, Inc. Signal validation and arbitration system and method
US7447975B2 (en) * 2002-09-12 2008-11-04 Hewlett-Packard Development Company, L.P. Supporting cyclic redundancy checking for PCI-X
US20040166905A1 (en) * 2003-02-07 2004-08-26 Hewlett-Packard Development Company, L.P. Radio frequency linked computer architecture
US7318171B2 (en) * 2003-03-12 2008-01-08 Intel Corporation Policy-based response to system errors occurring during OS runtime
US20040249773A1 (en) * 2003-06-03 2004-12-09 Ge Medical Systems Global Technology Company, Llc Diagnostic multilevel polymorphic state machine technical field
US20040267483A1 (en) * 2003-06-26 2004-12-30 Percer Benjamin Thomas Methods and systems for masking faults in a margin testing environment
US7400996B2 (en) * 2003-06-26 2008-07-15 Benjamin Thomas Percer Use of I2C-based potentiometers to enable voltage rail variation under BMC control
US7437258B2 (en) * 2003-06-26 2008-10-14 Hewlett-Packard Development Company, L.P. Use of I2C programmable clock generator to enable frequency variation under BMC control
US7493226B2 (en) * 2003-06-26 2009-02-17 Hewlett-Packard Development Company, L.P. Method and construct for enabling programmable, integrated system margin testing
US7673177B2 (en) * 2003-07-01 2010-03-02 Samsung Electronics Co., Ltd. Circuit and method for providing PCB power-on self test capability for peripheral devices
DE10361364B4 (de) * 2003-12-29 2010-07-01 Advanced Micro Devices, Inc., Sunnyvale Vorrichtung zum Behandeln von Interruptereignissen, mit der pegel-sensitive bzw. level-sensitive Interruptanforderungen in flankengetriggerten Interruptnachrichten umgesetzt werden
US20050193246A1 (en) * 2004-02-19 2005-09-01 Marconi Communications, Inc. Method, apparatus and software for preventing switch failures in the presence of faults
US7228457B2 (en) * 2004-03-16 2007-06-05 Arm Limited Performing diagnostic operations upon a data processing apparatus with power down support
US7089341B2 (en) * 2004-03-31 2006-08-08 International Business Machines Corporation Method and apparatus for supporting interrupt devices configured for a particular architecture on a different platform
US7337368B2 (en) * 2004-06-07 2008-02-26 Dell Products L.P. System and method for shutdown memory testing
US7451064B2 (en) * 2004-10-06 2008-11-11 Hewlett-Packard Development Company, L.P. System and method for logging hardware usage data, and uses for such logged hardware usage data
US20060184770A1 (en) * 2005-02-12 2006-08-17 International Business Machines Corporation Method of implementing precise, localized hardware-error workarounds under centralized control
US7689748B2 (en) * 2006-05-05 2010-03-30 Ati Technologies, Inc. Event handler for context-switchable and non-context-switchable processing tasks
JP2008226083A (ja) * 2007-03-15 2008-09-25 Nec Electronics Corp オンチップ・デバッグ・エミュレータおよびデバッグ方法並びにマイクロコンピュータ
US8667336B2 (en) * 2007-06-14 2014-03-04 Intel Corporation Flash memory-hosted local and remote out-of-service platform manageability
TWI411920B (zh) * 2007-09-29 2013-10-11 Tpk Touch Solutions Inc The interrupt sequence of the interrupt request signal
EP2042998B1 (en) * 2007-09-29 2016-05-18 TPK Touch Solutions Inc. Logic gateway circuit for bus that supports multiple interrupt request signals
US7743193B2 (en) 2007-10-31 2010-06-22 Tpk Touch Solutions Inc. Logic gateway circuit for bus that supports multiple interrupt request signals
CN102891762B (zh) * 2011-07-20 2016-05-04 赛恩倍吉科技顾问(深圳)有限公司 连续处理网络数据的系统及方法
CN102955718B (zh) * 2011-08-17 2016-02-24 赛恩倍吉科技顾问(深圳)有限公司 服务器保护系统
CN103135518B (zh) * 2011-12-02 2019-11-12 费希尔控制国际公司 程序流控制监控例程、与之相关的方法以及系统
KR20140113175A (ko) * 2013-03-15 2014-09-24 삼성전자주식회사 버스 프로토콜 검사기, 이를 포함하는 시스템 온 칩 및 버스 프로토콜 검사 방법
US8943373B1 (en) 2013-09-25 2015-01-27 Lenovo Enterprise Solutions (Singapore) Pte. Ltd. Keyboard, video and mouse switch identifying and displaying nodes experiencing a problem
US20190179721A1 (en) * 2016-01-26 2019-06-13 Hewlett Packard Enterprise Development Lp Utilizing non-volatile phase change memory in offline status and error debugging methodologies
US9940235B2 (en) 2016-06-29 2018-04-10 Oracle International Corporation Method and system for valid memory module configuration and verification
US10379927B2 (en) * 2016-11-01 2019-08-13 Xilinx, Inc. Programmable clock monitor
US11126492B1 (en) * 2019-11-05 2021-09-21 Express Scripts Stategic Development, Inc. Systems and methods for anomaly analysis and outage avoidance in enterprise computing systems
EP4069456A4 (en) * 2021-01-27 2024-01-03 Apex Brands, Inc. SPINDLE AND SPINDLE SYSTEM WITH LOGIC SUPPLY BUS FAULT DIAGNOSIS

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ATE25779T1 (de) * 1981-10-01 1987-03-15 Stratus Computer Inc Digitale datenverarbeitungsanlage mit zuverlaessigkeits-bus-protokoll.
ATE46403T1 (de) * 1984-12-19 1989-09-15 Siemens Ag Dezentrales ueberwachungssystem der lueftung in einer datenverarbeitungsanlage.
US5267246A (en) * 1988-06-30 1993-11-30 International Business Machines Corporation Apparatus and method for simultaneously presenting error interrupt and error data to a support processor
US4965717A (en) * 1988-12-09 1990-10-23 Tandem Computers Incorporated Multiple processor system having shared memory with private-write capability
JP2804125B2 (ja) * 1989-11-08 1998-09-24 株式会社日立製作所 情報処理システムの障害監視装置と制御方法
US5216672A (en) * 1992-04-24 1993-06-01 Digital Equipment Corporation Parallel diagnostic mode for testing computer memory
EP0636976B1 (en) * 1993-07-28 1998-12-30 Koninklijke Philips Electronics N.V. Microcontroller provided with hardware for supporting debugging as based on boundary scan standard-type extensions
SE502852C2 (sv) * 1994-04-08 1996-01-29 Ellemtel Utvecklings Ab Sätt och system för distribuerad övervakning av hårdvara
JP2886093B2 (ja) * 1994-07-28 1999-04-26 株式会社日立製作所 障害処理方法および情報処理システム
US5701409A (en) * 1995-02-22 1997-12-23 Adaptec, Inc. Error generation circuit for testing a digital bus
US5570375A (en) * 1995-05-10 1996-10-29 National Science Council Of R.O.C. IEEE Std. 1149.1 boundary scan circuit capable of built-in self-testing
US5708773A (en) * 1995-07-20 1998-01-13 Unisys Corporation JTAG interface system for communicating with compliant and non-compliant JTAG devices
KR0171385B1 (ko) * 1995-08-05 1999-03-30 양승택 전자식 교환기의 장애 진단 방법
US5706297A (en) * 1995-08-24 1998-01-06 Unisys Corporation System for adapting maintenance operations to JTAG and non-JTAG modules
US5742753A (en) * 1996-06-06 1998-04-21 The Boeing Company Mesh interconnected array in a fault-tolerant computer system
US5640404A (en) * 1996-08-05 1997-06-17 Vlsi Technology, Inc. Limited probes device testing for high pin count digital devices

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002526860A (ja) * 1998-10-01 2002-08-20 フィーニックス テクノロジーズ リミテッド マルチプロセッサ環境において正しいプロセッサのための入出力命令をエミュレートし、ソフトウェアsmiをサービスするための装置及び方法
EP1703401A2 (en) 2005-03-17 2006-09-20 Fujitsu Limited Information processing apparatus and control method therefor
US7802138B2 (en) 2005-03-17 2010-09-21 Fujitsu Limited Control method for information processing apparatus, information processing apparatus, control program for information processing system and redundant comprisal control apparatus
JP2013041438A (ja) * 2011-08-17 2013-02-28 Nec Fielding Ltd ハードウェア障害被疑特定装置、ハードウェア障害被疑特定方法、及びプログラム

Also Published As

Publication number Publication date
US6000040A (en) 1999-12-07
EP0840226B1 (en) 2003-12-10
DE69726693T2 (de) 2004-10-07
EP0840226A1 (en) 1998-05-06
DE69726693D1 (de) 2004-01-22

Similar Documents

Publication Publication Date Title
JPH10143387A (ja) 障害診断機能付きのコンピュータ・システム
US5864653A (en) PCI hot spare capability for failed components
US6081865A (en) Isolation of PCI and EISA masters by masking control and interrupt lines
US6070253A (en) Computer diagnostic board that provides system monitoring and permits remote terminal access
US5907689A (en) Master-target based arbitration priority
US6311296B1 (en) Bus management card for use in a system for bus monitoring
US11609874B2 (en) System-on-chips and methods of controlling reset of system-on-chips
US6742139B1 (en) Service processor reset/reload
US7594144B2 (en) Handling fatal computer hardware errors
EP0817055B1 (en) Computer system host switching
US7669084B2 (en) Method for self-diagnosing remote I/O enclosures with enhanced FRU callouts
EP3349118B1 (en) Bus hang detection and find out
JP3943998B2 (ja) ロジカル・パーティショニングの実施をテストする方法、その方法をコンピュータに実行させるためのプログラムを記録したコンピューター可読記録媒体及びロジカル・パーティショニング・テスト・システム
US8700835B2 (en) Computer system and abnormality detection circuit
CA2160500C (en) Pci/isa bridge having an arrangement for responding to pci bridge address parity errors for internal pci slaves in the pci/isa bridge
US6035355A (en) PCI system and adapter requirements following reset
EP0979451A1 (en) Digital data processing methods and apparatus for fault isolation
WO1997046941A9 (en) Digital data processing methods and apparatus for fault isolation
JPH11161625A (ja) コンピュータ・システム
US7877643B2 (en) Method, system, and product for providing extended error handling capability in host bridges
US6985980B1 (en) Diagnostic scheme for programmable logic in a system on a chip
US5951661A (en) Bus protocol violation monitor systems and methods
US6732298B1 (en) Nonmaskable interrupt workaround for a single exception interrupt handler processor
US7290180B2 (en) Method to use an alternate I/O debug path
JP3838992B2 (ja) 障害検出方法及び情報処理システム

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20040430

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20040430

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20060413

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20060418

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20060915