CN100559350C - 基于历史对可疑组件排优先级 - Google Patents

基于历史对可疑组件排优先级 Download PDF

Info

Publication number
CN100559350C
CN100559350C CNB2006800024654A CN200680002465A CN100559350C CN 100559350 C CN100559350 C CN 100559350C CN B2006800024654 A CNB2006800024654 A CN B2006800024654A CN 200680002465 A CN200680002465 A CN 200680002465A CN 100559350 C CN100559350 C CN 100559350C
Authority
CN
China
Prior art keywords
list
previous
computerized system
time
response
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB2006800024654A
Other languages
English (en)
Chinese (zh)
Other versions
CN101107594A (zh
Inventor
O·尼桑-梅辛
A·兹洛特尼克
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of CN101107594A publication Critical patent/CN101107594A/zh
Application granted granted Critical
Publication of CN100559350C publication Critical patent/CN100559350C/zh
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2294Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing by remote test
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0793Remedial or corrective actions
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/079Root cause analysis, i.e. error or fault diagnosis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Debugging And Monitoring (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Test And Diagnosis Of Digital Computers (AREA)
CNB2006800024654A 2005-01-18 2006-01-12 基于历史对可疑组件排优先级 Expired - Fee Related CN100559350C (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/037,513 2005-01-18
US11/037,513 US7409595B2 (en) 2005-01-18 2005-01-18 History-based prioritizing of suspected components

Publications (2)

Publication Number Publication Date
CN101107594A CN101107594A (zh) 2008-01-16
CN100559350C true CN100559350C (zh) 2009-11-11

Family

ID=35914581

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2006800024654A Expired - Fee Related CN100559350C (zh) 2005-01-18 2006-01-12 基于历史对可疑组件排优先级

Country Status (5)

Country Link
US (1) US7409595B2 (enExample)
EP (1) EP1851634A1 (enExample)
JP (1) JP4717079B2 (enExample)
CN (1) CN100559350C (enExample)
WO (1) WO2006077193A1 (enExample)

Families Citing this family (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ES2302133T3 (es) * 2005-01-26 2008-07-01 Oce-Technologies B.V. Analisis automatico de comportamiento y solucion de fallos.
US7689873B1 (en) * 2005-09-19 2010-03-30 Google Inc. Systems and methods for prioritizing error notification
US7613949B1 (en) * 2006-06-30 2009-11-03 Boone Lewis A Fault isolation system and method
EP1993014B1 (de) * 2007-05-16 2011-06-29 Siemens Aktiengesellschaft Verfahren zum Lokalisieren von defekten Hardwarekomponenten und/oder Systemfehlern innerhalb einer Produktionsanlage
US8010325B2 (en) * 2008-04-25 2011-08-30 Microsoft Corporation Failure simulation and availability report on same
US20090292956A1 (en) * 2008-05-23 2009-11-26 Microsoft Corporation Trend based test failure prioritization
US8266594B2 (en) 2008-08-20 2012-09-11 International Business Machines Corporation System, method and program product for correcting semantic errors in code using peer submitted code snippets
US8756576B2 (en) * 2008-08-20 2014-06-17 International Business Machines Corporation Ranking peer submitted code snippets using execution feedback
US8713534B2 (en) * 2008-08-20 2014-04-29 International Business Machines Corporation System, method and program product for guiding correction of semantic errors in code using collaboration records
JP5439775B2 (ja) * 2008-09-17 2014-03-12 富士通株式会社 障害対応プログラム、障害対応装置、及び障害対応システム
US7949900B2 (en) * 2008-09-19 2011-05-24 International Business Machines Corporation Autonomously configuring information systems to support mission objectives
US8185781B2 (en) * 2009-04-09 2012-05-22 Nec Laboratories America, Inc. Invariants-based learning method and system for failure diagnosis in large scale computing systems
US8024609B2 (en) * 2009-06-03 2011-09-20 International Business Machines Corporation Failure analysis based on time-varying failure rates
US11269303B2 (en) 2009-06-22 2022-03-08 Johnson Controls Technology Company Systems and methods for detecting changes in energy usage in a building
US8600556B2 (en) 2009-06-22 2013-12-03 Johnson Controls Technology Company Smart building manager
US20110020122A1 (en) * 2009-07-24 2011-01-27 Honeywell International Inc. Integrated condition based maintenance system for wind turbines
US20110314331A1 (en) * 2009-10-29 2011-12-22 Cybernet Systems Corporation Automated test and repair method and apparatus applicable to complex, distributed systems
US8156377B2 (en) * 2010-07-02 2012-04-10 Oracle International Corporation Method and apparatus for determining ranked causal paths for faults in a complex multi-host system with probabilistic inference in a time series
US8069370B1 (en) 2010-07-02 2011-11-29 Oracle International Corporation Fault identification of multi-host complex systems with timesliding window analysis in a time series
US8291263B2 (en) 2010-07-02 2012-10-16 Oracle International Corporation Methods and apparatus for cross-host diagnosis of complex multi-host systems in a time series with probabilistic inference
US8230262B2 (en) 2010-07-02 2012-07-24 Oracle International Corporation Method and apparatus for dealing with accumulative behavior of some system observations in a time series for Bayesian inference with a static Bayesian network model
US8234523B2 (en) * 2010-07-28 2012-07-31 Honeywell International Inc. Automatic determination of success of using a computerized decision support system
FR2989499B1 (fr) 2012-04-12 2014-05-16 Airbus Operations Sas Procede, dispositifs et programme d'ordinateur d'aide au diagnostic preventif d'un systeme d'un aeronef, utilisant des graphes d'evenements redoutes
CN104956373A (zh) 2012-12-04 2015-09-30 惠普发展公司,有限责任合伙企业 确定异常网络行为的可疑根本原因
US20140259167A1 (en) * 2013-03-11 2014-09-11 Samsung Electronics Co. Ltd. Behavior based application blacklisting
US10628246B1 (en) * 2013-05-20 2020-04-21 The Boeing Company Methods and systems for prioritizing corrective actions in a troubleshooting chart
US10388087B2 (en) * 2014-04-02 2019-08-20 Sikorsky Aircraft Corporation System and method for improved health management and maintenance decision support
US9424063B2 (en) * 2014-04-29 2016-08-23 Vmware, Inc. Method and system for generating remediation options within a cluster of host computers that run virtual machines
US9389900B2 (en) 2014-04-29 2016-07-12 Vmware, Inc. Method and system for supporting a change in state within a cluster of host computers that run virtual machines
US9747152B2 (en) * 2015-04-27 2017-08-29 Splunk Inc. Tracking incomplete transactions in correlation with application errors
US10474519B2 (en) * 2015-09-17 2019-11-12 Netapp, Inc. Server fault analysis system using event logs
US10180869B2 (en) 2016-02-16 2019-01-15 Microsoft Technology Licensing, Llc Automated ordering of computer system repair
CN106469098A (zh) * 2016-09-19 2017-03-01 广州日滨科技发展有限公司 一种设备的故障处理方法和装置
US10379929B2 (en) * 2016-12-19 2019-08-13 Microsoft Technology Licensing, Llc Enhanced diagnostic and remediation system
US11188416B2 (en) * 2018-07-12 2021-11-30 Micron Technology, Inc. Enhanced block management for a memory sub-system
WO2020041020A1 (en) * 2018-08-20 2020-02-27 Presenso, Ltd. Providing corrective solution recommendations for an industrial machine failure
US10936246B2 (en) 2018-10-10 2021-03-02 Micron Technology, Inc. Dynamic background scan optimization in a memory sub-system
US11144038B2 (en) 2019-09-27 2021-10-12 Rockwell Automation Technologies, Inc. System and method for industrial automation troubleshooting
EP4081872A4 (en) * 2019-12-23 2023-12-27 Embraer S.A. SYSTEMS AND METHODS FOR DETERMINING THE FUNCTIONAL STATE OF AN AGNOSTIC SYSTEM AND AUTOMATIC MANAGEMENT OF FAILURES

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6487677B1 (en) * 1999-09-30 2002-11-26 Lsi Logic Corporation Methods and systems for dynamic selection of error recovery procedures in a managed device
US6625745B1 (en) * 1999-03-17 2003-09-23 Hewlett-Packard Development Co.Lp Network component failure identification with minimal testing
US20030216881A1 (en) * 2002-05-17 2003-11-20 Sun Microsystems, Inc. Method and system for storing field replaceable unit operational history information
WO2003105039A1 (ja) * 2002-06-07 2003-12-18 アークレイ株式会社 トラブル対処支援システムおよびこれに接続される端末装置

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4633467A (en) * 1984-07-26 1986-12-30 At&T Bell Laboratories Computer system fault recovery based on historical analysis
US5214653A (en) * 1990-10-22 1993-05-25 Harris Corporation Fault finder expert system
US5253184A (en) * 1991-06-19 1993-10-12 Storage Technology Corporation Failure and performance tracking system
US5293556A (en) * 1991-07-29 1994-03-08 Storage Technology Corporation Knowledge based field replaceable unit management
JP3675851B2 (ja) * 1994-03-15 2005-07-27 富士通株式会社 計算機監視方式
US5561760A (en) * 1994-09-22 1996-10-01 International Business Machines Corporation System for localizing field replaceable unit failures employing automated isolation procedures and weighted fault probability encoding
US6633782B1 (en) * 1999-02-22 2003-10-14 Fisher-Rosemount Systems, Inc. Diagnostic expert in a process control system
US6622264B1 (en) * 1999-10-28 2003-09-16 General Electric Company Process and system for analyzing fault log data from a machine so as to identify faults predictive of machine failures
US6415395B1 (en) * 1999-04-02 2002-07-02 General Electric Company Method and system for processing repair data and fault log data to facilitate diagnostics
US7113988B2 (en) * 2000-06-29 2006-09-26 International Business Machines Corporation Proactive on-line diagnostics in a manageable network
US6574537B2 (en) * 2001-02-05 2003-06-03 The Boeing Company Diagnostic system and method
JP2003091314A (ja) * 2001-09-17 2003-03-28 Toshiba Corp 監視制御システム
JP4796251B2 (ja) * 2001-09-21 2011-10-19 株式会社日立製作所 ネットワークストレージシステム及びその制御方法
US6895533B2 (en) * 2002-03-21 2005-05-17 Hewlett-Packard Development Company, L.P. Method and system for assessing availability of complex electronic systems, including computer systems
GB2391132B (en) * 2002-07-19 2005-09-21 Hewlett Packard Co Fault diagnosis in a network
US7194445B2 (en) * 2002-09-20 2007-03-20 Lenovo (Singapore) Pte. Ltd. Adaptive problem determination and recovery in a computer system
JP2004355424A (ja) * 2003-05-30 2004-12-16 Hitachi Ltd 情報処理装置の障害管理方式
US20050091356A1 (en) * 2003-10-24 2005-04-28 Matthew Izzo Method and machine-readable medium for using matrices to automatically analyze network events and objects
US7206771B2 (en) * 2003-11-11 2007-04-17 International Business Machines Corporation Automated knowledge system for equipment repair based on component failure history

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6625745B1 (en) * 1999-03-17 2003-09-23 Hewlett-Packard Development Co.Lp Network component failure identification with minimal testing
US6487677B1 (en) * 1999-09-30 2002-11-26 Lsi Logic Corporation Methods and systems for dynamic selection of error recovery procedures in a managed device
US20030216881A1 (en) * 2002-05-17 2003-11-20 Sun Microsystems, Inc. Method and system for storing field replaceable unit operational history information
WO2003105039A1 (ja) * 2002-06-07 2003-12-18 アークレイ株式会社 トラブル対処支援システムおよびこれに接続される端末装置

Also Published As

Publication number Publication date
EP1851634A1 (en) 2007-11-07
JP2008527554A (ja) 2008-07-24
WO2006077193A1 (en) 2006-07-27
US20060161819A1 (en) 2006-07-20
CN101107594A (zh) 2008-01-16
US7409595B2 (en) 2008-08-05
JP4717079B2 (ja) 2011-07-06

Similar Documents

Publication Publication Date Title
CN100559350C (zh) 基于历史对可疑组件排优先级
US20090300430A1 (en) History-based prioritizing of suspected components
US5253184A (en) Failure and performance tracking system
US7328376B2 (en) Error reporting to diagnostic engines based on their diagnostic capabilities
US5469463A (en) Expert system for identifying likely failure points in a digital data processing system
US4922491A (en) Input/output device service alert function
EP0570505B1 (en) Knowledge based machine initiated maintenance system and method
Zheng et al. Co-analysis of RAS log and job log on Blue Gene/P
CN112732477B (zh) 一种带外自检故障隔离的方法
US20160378583A1 (en) Management computer and method for evaluating performance threshold value
CN107870832B (zh) 基于多维度健康诊断方法的多路径存储设备
US20160019131A1 (en) Methods and Arrangements to Collect Data
JPH01243135A (ja) コンピュータ・システムにおける問題解決方法
US8347142B2 (en) Non-disruptive I/O adapter diagnostic testing
WO2023071039A1 (zh) 一种故障诊断方法、装置、设备及可读存储介质
CN111835566A (zh) 一种系统故障管理方法、装置及系统
JP7595823B1 (ja) 情報処理装置、情報処理方法及び情報処理プログラム
CN111190781A (zh) 服务器系统的测试自检方法
EP0471636B1 (en) Flexible service network for computer systems
CN111427328A (zh) 一种降低家居系统故障的方法
JP7590300B2 (ja) 情報処理装置、および自動分析システム
EP0471638B1 (en) Problem prevention on a computer system in a service network of computer systems
EP0471637A2 (en) Tracking the resolution of a problem on a computer system in a service network of computer systems
CN115913895A (zh) 一种服务器故障诊断告警的方法、装置、设备及介质
CN118519805A (zh) 一种故障线索检测方法、装置、设备及可读存储介质

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20091111

Termination date: 20190112