JP4717079B2 - コンピュータ・システムにおける障害の診断および保守のための方法およびシステム(疑わしいコンポーネントの履歴ベースの優先順位付け) - Google Patents

コンピュータ・システムにおける障害の診断および保守のための方法およびシステム(疑わしいコンポーネントの履歴ベースの優先順位付け) Download PDF

Info

Publication number
JP4717079B2
JP4717079B2 JP2007550793A JP2007550793A JP4717079B2 JP 4717079 B2 JP4717079 B2 JP 4717079B2 JP 2007550793 A JP2007550793 A JP 2007550793A JP 2007550793 A JP2007550793 A JP 2007550793A JP 4717079 B2 JP4717079 B2 JP 4717079B2
Authority
JP
Japan
Prior art keywords
list
previous
computerized system
corrective action
priority
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2007550793A
Other languages
English (en)
Japanese (ja)
Other versions
JP2008527554A (ja
JP2008527554A5 (enExample
Inventor
ニッサン−メッシング、オリット
ズロトニック、アヴィアド
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of JP2008527554A publication Critical patent/JP2008527554A/ja
Publication of JP2008527554A5 publication Critical patent/JP2008527554A5/ja
Application granted granted Critical
Publication of JP4717079B2 publication Critical patent/JP4717079B2/ja
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2294Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing by remote test
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0793Remedial or corrective actions
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/079Root cause analysis, i.e. error or fault diagnosis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Debugging And Monitoring (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Test And Diagnosis Of Digital Computers (AREA)
JP2007550793A 2005-01-18 2006-01-12 コンピュータ・システムにおける障害の診断および保守のための方法およびシステム(疑わしいコンポーネントの履歴ベースの優先順位付け) Expired - Fee Related JP4717079B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US11/037,513 2005-01-18
US11/037,513 US7409595B2 (en) 2005-01-18 2005-01-18 History-based prioritizing of suspected components
PCT/EP2006/050178 WO2006077193A1 (en) 2005-01-18 2006-01-12 History-based prioritizing of suspected components

Publications (3)

Publication Number Publication Date
JP2008527554A JP2008527554A (ja) 2008-07-24
JP2008527554A5 JP2008527554A5 (enExample) 2008-12-04
JP4717079B2 true JP4717079B2 (ja) 2011-07-06

Family

ID=35914581

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2007550793A Expired - Fee Related JP4717079B2 (ja) 2005-01-18 2006-01-12 コンピュータ・システムにおける障害の診断および保守のための方法およびシステム(疑わしいコンポーネントの履歴ベースの優先順位付け)

Country Status (5)

Country Link
US (1) US7409595B2 (enExample)
EP (1) EP1851634A1 (enExample)
JP (1) JP4717079B2 (enExample)
CN (1) CN100559350C (enExample)
WO (1) WO2006077193A1 (enExample)

Families Citing this family (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DK1688842T3 (da) * 2005-01-26 2008-06-16 Oce Tech Bv Automatiseret ydelsesanalyse og fejludbedring
US7689873B1 (en) * 2005-09-19 2010-03-30 Google Inc. Systems and methods for prioritizing error notification
US7613949B1 (en) * 2006-06-30 2009-11-03 Boone Lewis A Fault isolation system and method
EP1993014B1 (de) * 2007-05-16 2011-06-29 Siemens Aktiengesellschaft Verfahren zum Lokalisieren von defekten Hardwarekomponenten und/oder Systemfehlern innerhalb einer Produktionsanlage
US8010325B2 (en) * 2008-04-25 2011-08-30 Microsoft Corporation Failure simulation and availability report on same
US20090292956A1 (en) * 2008-05-23 2009-11-26 Microsoft Corporation Trend based test failure prioritization
US8756576B2 (en) * 2008-08-20 2014-06-17 International Business Machines Corporation Ranking peer submitted code snippets using execution feedback
US8266594B2 (en) 2008-08-20 2012-09-11 International Business Machines Corporation System, method and program product for correcting semantic errors in code using peer submitted code snippets
US8713534B2 (en) 2008-08-20 2014-04-29 International Business Machines Corporation System, method and program product for guiding correction of semantic errors in code using collaboration records
JP5439775B2 (ja) * 2008-09-17 2014-03-12 富士通株式会社 障害対応プログラム、障害対応装置、及び障害対応システム
US7949900B2 (en) * 2008-09-19 2011-05-24 International Business Machines Corporation Autonomously configuring information systems to support mission objectives
US8185781B2 (en) * 2009-04-09 2012-05-22 Nec Laboratories America, Inc. Invariants-based learning method and system for failure diagnosis in large scale computing systems
US8024609B2 (en) * 2009-06-03 2011-09-20 International Business Machines Corporation Failure analysis based on time-varying failure rates
US8600556B2 (en) 2009-06-22 2013-12-03 Johnson Controls Technology Company Smart building manager
US11269303B2 (en) 2009-06-22 2022-03-08 Johnson Controls Technology Company Systems and methods for detecting changes in energy usage in a building
US20110020122A1 (en) * 2009-07-24 2011-01-27 Honeywell International Inc. Integrated condition based maintenance system for wind turbines
US20110314331A1 (en) * 2009-10-29 2011-12-22 Cybernet Systems Corporation Automated test and repair method and apparatus applicable to complex, distributed systems
US8156377B2 (en) * 2010-07-02 2012-04-10 Oracle International Corporation Method and apparatus for determining ranked causal paths for faults in a complex multi-host system with probabilistic inference in a time series
US8069370B1 (en) 2010-07-02 2011-11-29 Oracle International Corporation Fault identification of multi-host complex systems with timesliding window analysis in a time series
US8230262B2 (en) 2010-07-02 2012-07-24 Oracle International Corporation Method and apparatus for dealing with accumulative behavior of some system observations in a time series for Bayesian inference with a static Bayesian network model
US8291263B2 (en) 2010-07-02 2012-10-16 Oracle International Corporation Methods and apparatus for cross-host diagnosis of complex multi-host systems in a time series with probabilistic inference
US8234523B2 (en) * 2010-07-28 2012-07-31 Honeywell International Inc. Automatic determination of success of using a computerized decision support system
FR2989499B1 (fr) 2012-04-12 2014-05-16 Airbus Operations Sas Procede, dispositifs et programme d'ordinateur d'aide au diagnostic preventif d'un systeme d'un aeronef, utilisant des graphes d'evenements redoutes
CN104956373A (zh) 2012-12-04 2015-09-30 惠普发展公司,有限责任合伙企业 确定异常网络行为的可疑根本原因
US20140259167A1 (en) * 2013-03-11 2014-09-11 Samsung Electronics Co. Ltd. Behavior based application blacklisting
US10628246B1 (en) * 2013-05-20 2020-04-21 The Boeing Company Methods and systems for prioritizing corrective actions in a troubleshooting chart
EP3126980A4 (en) * 2014-04-02 2017-10-11 Sikorsky Aircraft Corporation System and method for improved health management and maintenance decision support
US9424063B2 (en) * 2014-04-29 2016-08-23 Vmware, Inc. Method and system for generating remediation options within a cluster of host computers that run virtual machines
US9389900B2 (en) 2014-04-29 2016-07-12 Vmware, Inc. Method and system for supporting a change in state within a cluster of host computers that run virtual machines
US9747152B2 (en) * 2015-04-27 2017-08-29 Splunk Inc. Tracking incomplete transactions in correlation with application errors
US10474519B2 (en) * 2015-09-17 2019-11-12 Netapp, Inc. Server fault analysis system using event logs
US10180869B2 (en) 2016-02-16 2019-01-15 Microsoft Technology Licensing, Llc Automated ordering of computer system repair
CN106469098A (zh) * 2016-09-19 2017-03-01 广州日滨科技发展有限公司 一种设备的故障处理方法和装置
US10379929B2 (en) * 2016-12-19 2019-08-13 Microsoft Technology Licensing, Llc Enhanced diagnostic and remediation system
US11188416B2 (en) * 2018-07-12 2021-11-30 Micron Technology, Inc. Enhanced block management for a memory sub-system
BR112021002673A2 (pt) * 2018-08-20 2021-05-11 Skf Ai, Ltd. fornecimento de recomendações de solução corretiva para uma falha de máquina industrial
US10936246B2 (en) 2018-10-10 2021-03-02 Micron Technology, Inc. Dynamic background scan optimization in a memory sub-system
US11144038B2 (en) 2019-09-27 2021-10-12 Rockwell Automation Technologies, Inc. System and method for industrial automation troubleshooting
US20230032571A1 (en) * 2019-12-23 2023-02-02 Embraer S.A. Systems and methods for an agnostic system functional status determination and automatic management of failures

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4633467A (en) * 1984-07-26 1986-12-30 At&T Bell Laboratories Computer system fault recovery based on historical analysis
US5214653A (en) * 1990-10-22 1993-05-25 Harris Corporation Fault finder expert system
US5253184A (en) * 1991-06-19 1993-10-12 Storage Technology Corporation Failure and performance tracking system
US5293556A (en) * 1991-07-29 1994-03-08 Storage Technology Corporation Knowledge based field replaceable unit management
JP3675851B2 (ja) * 1994-03-15 2005-07-27 富士通株式会社 計算機監視方式
US5561760A (en) * 1994-09-22 1996-10-01 International Business Machines Corporation System for localizing field replaceable unit failures employing automated isolation procedures and weighted fault probability encoding
US6633782B1 (en) * 1999-02-22 2003-10-14 Fisher-Rosemount Systems, Inc. Diagnostic expert in a process control system
US6625745B1 (en) 1999-03-17 2003-09-23 Hewlett-Packard Development Co.Lp Network component failure identification with minimal testing
US6622264B1 (en) * 1999-10-28 2003-09-16 General Electric Company Process and system for analyzing fault log data from a machine so as to identify faults predictive of machine failures
US6415395B1 (en) * 1999-04-02 2002-07-02 General Electric Company Method and system for processing repair data and fault log data to facilitate diagnostics
US6487677B1 (en) * 1999-09-30 2002-11-26 Lsi Logic Corporation Methods and systems for dynamic selection of error recovery procedures in a managed device
US7113988B2 (en) * 2000-06-29 2006-09-26 International Business Machines Corporation Proactive on-line diagnostics in a manageable network
US6574537B2 (en) * 2001-02-05 2003-06-03 The Boeing Company Diagnostic system and method
JP2003091314A (ja) * 2001-09-17 2003-03-28 Toshiba Corp 監視制御システム
JP4796251B2 (ja) * 2001-09-21 2011-10-19 株式会社日立製作所 ネットワークストレージシステム及びその制御方法
US6895533B2 (en) * 2002-03-21 2005-05-17 Hewlett-Packard Development Company, L.P. Method and system for assessing availability of complex electronic systems, including computer systems
US6892159B2 (en) 2002-05-17 2005-05-10 Sun Microsystems, Inc. Method and system for storing field replaceable unit operational history information
US7050941B2 (en) * 2002-06-07 2006-05-23 Arkray, Inc. Trouble countermeasure support system and terminal device connected to the same
GB2391132B (en) * 2002-07-19 2005-09-21 Hewlett Packard Co Fault diagnosis in a network
US7194445B2 (en) * 2002-09-20 2007-03-20 Lenovo (Singapore) Pte. Ltd. Adaptive problem determination and recovery in a computer system
JP2004355424A (ja) * 2003-05-30 2004-12-16 Hitachi Ltd 情報処理装置の障害管理方式
US20050091356A1 (en) * 2003-10-24 2005-04-28 Matthew Izzo Method and machine-readable medium for using matrices to automatically analyze network events and objects
US7206771B2 (en) * 2003-11-11 2007-04-17 International Business Machines Corporation Automated knowledge system for equipment repair based on component failure history

Also Published As

Publication number Publication date
WO2006077193A1 (en) 2006-07-27
EP1851634A1 (en) 2007-11-07
JP2008527554A (ja) 2008-07-24
CN101107594A (zh) 2008-01-16
US20060161819A1 (en) 2006-07-20
CN100559350C (zh) 2009-11-11
US7409595B2 (en) 2008-08-05

Similar Documents

Publication Publication Date Title
JP4717079B2 (ja) コンピュータ・システムにおける障害の診断および保守のための方法およびシステム(疑わしいコンポーネントの履歴ベースの優先順位付け)
US20090300430A1 (en) History-based prioritizing of suspected components
EP0333620B1 (en) On-line problem management for data-processing systems
US4922491A (en) Input/output device service alert function
US5293556A (en) Knowledge based field replaceable unit management
EP0570505B1 (en) Knowledge based machine initiated maintenance system and method
US5253184A (en) Failure and performance tracking system
EP0570513B1 (en) Maintenance apparatus and method initiated by a hierarchical distributed knowledge based machine
EP0471635B1 (en) Automated enrolment of a computer system into a service network of computer systems
US8108724B2 (en) Field replaceable unit failure determination
US20160378583A1 (en) Management computer and method for evaluating performance threshold value
US9081656B2 (en) Methods and systems for predicting a fault
WO1992020026A1 (en) Knowledge based resource management
US8032789B2 (en) Apparatus maintenance system and method
JP5696492B2 (ja) 故障検出装置、故障検出方法、及び、故障検出プログラム
EP0471636B1 (en) Flexible service network for computer systems
EP0471638B1 (en) Problem prevention on a computer system in a service network of computer systems
EP0471637B1 (en) Tracking the resolution of a problem on a computer system in a service network of computer systems
WO2023047806A1 (ja) 情報処理装置、および自動分析システム
WO2018168606A1 (ja) 情報処理装置、情報処理方法およびプログラム記録媒体
JPS61208547A (ja) 故障診断支援装置

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20081014

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20081014

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20100813

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20100817

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20101108

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20110322

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20110329

R150 Certificate of patent or registration of utility model

Free format text: JAPANESE INTERMEDIATE CODE: R150

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20140408

Year of fee payment: 3

LAPS Cancellation because of no payment of annual fees