CN106844078A - A kind for the treatment of method and apparatus of PCIE failures - Google Patents

A kind for the treatment of method and apparatus of PCIE failures Download PDF

Info

Publication number
CN106844078A
CN106844078A CN201611230230.2A CN201611230230A CN106844078A CN 106844078 A CN106844078 A CN 106844078A CN 201611230230 A CN201611230230 A CN 201611230230A CN 106844078 A CN106844078 A CN 106844078A
Authority
CN
China
Prior art keywords
pcie
kernel
fault messages
failures
user space
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201611230230.2A
Other languages
Chinese (zh)
Inventor
常现超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou Yunhai Information Technology Co Ltd
Original Assignee
Zhengzhou Yunhai Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou Yunhai Information Technology Co Ltd filed Critical Zhengzhou Yunhai Information Technology Co Ltd
Priority to CN201611230230.2A priority Critical patent/CN106844078A/en
Publication of CN106844078A publication Critical patent/CN106844078A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0766Error or fault reporting or storing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/079Root cause analysis, i.e. error or fault diagnosis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0793Remedial or corrective actions

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Debugging And Monitoring (AREA)

Abstract

This application discloses a kind for the treatment of method and apparatus of PCIE failures, the method gathers PCIE fault messages in being included in kernel;The PCIE fault messages are transferred to User space from kernel;The PCIE fault messages collected are analyzed in User space;According to the result of analysis, the PCIE failures are repaired or isolated.The device includes collecting unit, for gathering PCIE fault messages in kernel;Transmission unit, for the PCIE fault messages to be transferred into User space from kernel;Analytic unit, for being analyzed to the PCIE fault messages collected in User space;Repair and isolated location, for the result according to analysis, the PCIE failures are repaired or isolated.The above method and device bothersome laborious go to repair failure without artificial, it is possible to increase the efficiency and quality of fault restoration.

Description

A kind for the treatment of method and apparatus of PCIE failures
Technical field
The invention belongs to Computer Applied Technology field, more particularly to a kind for the treatment of method and apparatus of PCIE failures.
Background technology
With developing rapidly for computer technology and integrated circuit technique, no matter being obtained for from software or hardware winged Speed lifting.Because many peripheral hardwares of computer are all that PCIE (Peripheral Component Interface Express) sets Standby, with being continuously increased for number of devices, the probability that PCIE device breaks down is also increasing, brings very big to keeper Challenge, this is accomplished by the health status that keeper often pays close attention to PCIE device, nonetheless, it is also difficult to find failure in time. , it is necessary to keeper checks substantial amounts of system journal and analyzes during PCIE generation failures, take a long time reparation and break down Equipment, and data volumes of some services are huge, and the cluster of server is also big, and maintenance gets up to waste time and energy, and may be tight The service impacting quality of weight.
The content of the invention
To solve the above problems, the invention provides a kind for the treatment of method and apparatus of PCIE failures, without artificial bothersome Laborious goes reparation failure, it is possible to increase the efficiency and quality of fault restoration.
A kind of processing method of PCIE failures that the present invention is provided, including:
PCIE fault messages are gathered in kernel;
The PCIE fault messages are transferred to User space from kernel;
The PCIE fault messages collected are analyzed in User space;
According to the result of analysis, the PCIE failures are repaired or isolated.
Preferably, in the processing method of above-mentioned PCIE failures, the PCIE failures are repaired or is isolated described Afterwards, also include:
The PCIE fault messages are notified into keeper.
Preferably, in the processing method of above-mentioned PCIE failures, the PCIE fault messages are notified into keeper described Afterwards, also include:
Alarmed for the PCIE fault messages.
Preferably, in the processing method of above-mentioned PCIE failures, the PCIE fault messages that gathered in kernel are:
To kernel patch is squeezed into system, kernel code is changed, PCIE fault messages are gathered in kernel.
Preferably, it is described that the PCIE fault messages are transferred to from kernel in the processing method of above-mentioned PCIE failures User space is:
The PCIE fault messages are transferred to by User space from kernel with the communication mode of netlink.
A kind of processing unit of PCIE failures that the present invention is provided, including:
Collecting unit, for gathering PCIE fault messages in kernel;
Transmission unit, for the PCIE fault messages to be transferred into User space from kernel;
Analytic unit, for being analyzed to the PCIE fault messages collected in User space;
Repair and isolated location, for the result according to analysis, the PCIE failures are repaired or isolated.
Preferably, in the processing unit of above-mentioned PCIE failures,
Also include:
Notification unit, for the PCIE fault messages to be notified into keeper.
Preferably, in the processing unit of above-mentioned PCIE failures, also include:
Alarm unit, for being alarmed for the PCIE fault messages.
Preferably, in the processing unit of above-mentioned PCIE failures, the collecting unit in system specifically in squeezing into Core patch, changes kernel code, and PCIE fault messages are gathered in kernel.
Preferably, in the processing unit of above-mentioned PCIE failures, the transmission unit is specifically for the communication of netlink The PCIE fault messages are transferred to User space by mode from kernel.
The treating method and apparatus of the above-mentioned PCIE failures provided by foregoing description, the present invention, due to the method It is included in collection PCIE fault messages in kernel;The PCIE fault messages are transferred to User space from kernel;In User space pair The PCIE fault messages collected are analyzed;According to the result of analysis, the PCIE failures are repaired or isolated, because This bothersome laborious goes to repair failure without artificial, it is possible to increase the efficiency and quality of fault restoration.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing The accompanying drawing to be used needed for having technology description is briefly described, it should be apparent that, drawings in the following description are only this Inventive embodiment, for those of ordinary skill in the art, on the premise of not paying creative work, can also basis The accompanying drawing of offer obtains other accompanying drawings.
The schematic diagram of the processing method of the first PCIE failure that Fig. 1 is provided for the embodiment of the present application;
The schematic diagram of the processing unit of the first PCIE failure that Fig. 2 is provided for the embodiment of the present application;
The 4th kind of schematic diagram of the processing unit of PCIE failures that Fig. 3 is provided for the embodiment of the present application.
Specific embodiment
Core concept of the invention is to provide a kind for the treatment of method and apparatus of PCIE failures, without artificial bothersome laborious Go repair failure, it is possible to increase the efficiency and quality of fault restoration.
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation is described, it is clear that described embodiment is only a part of embodiment of the invention, rather than whole embodiments.It is based on Embodiment in the present invention, it is every other that those of ordinary skill in the art are obtained under the premise of creative work is not made Embodiment, belongs to the scope of protection of the invention.
The processing method of the first PCIE failure that the embodiment of the present application is provided is as shown in figure 1, Fig. 1 is the embodiment of the present application The schematic diagram of the processing method of the first the PCIE failure for providing, the method comprises the following steps:
S1:PCIE fault messages are gathered in kernel;
It should be noted that failure benefit can be squeezed into the operating system nucleus of computer, using KPatch instruments Fourth, for collecting fault message, wherein fault message can include but is not limited to position and the failure cause of failure generation, and It is packaged and is transmitted.Furthermore it is possible to squeeze into patch module during operating system, without going to compile again in Core, and usually said patch, are got to patch inside kernel source code when kernel is compiled, and are then compiled, specifically, Can pass through/proc files, code is directly changed inside kernel, the collection of fault message can also be so realized, herein It is not intended to limit concrete implementation mode.
S2:The PCIE fault messages are transferred to User space from kernel;
It should be noted that due to collecting the position of fault message in kernel, and follow-up processing procedure occurs in user State, it is therefore desirable to which PCIE fault messages are transferred to User space from kernel, and specific transmission means includes but is not limited to utilize Netlink passages.
S3:The PCIE fault messages collected are analyzed in User space;
Specifically, statistic of classification can be carried out to the PCIE fault messages, the result analyzed.
S4:According to the result of analysis, the PCIE failures are repaired or isolated.
It should be noted that in this step, after the completion of analysis, it is possible to attempt it is automatic repair failure, if can not repair Work(, such as EMS memory error, it is possible to the internal memory of failure is done and is isolated, it is to avoid failure memory again by using causing system unstable, Avoid the failure that serious influence is caused to system or key service, produce serious consequence, this mode to make up people It is monitoring PCIE device health status, the inefficiency of manual administration failure and analysis Trouble cause and can not be timely and effective The deficiency for processing and causing machine to be unable to stable operation.
By foregoing description, the processing method of above-mentioned the first PCIE failure that the embodiment of the present application is provided is included in PCIE fault messages are gathered in kernel;The PCIE fault messages are transferred to User space from kernel;In User space to collection The PCIE fault messages are analyzed;According to the result of analysis, the PCIE failures are repaired or isolated, therefore need not It is artificial bothersome laborious to go to repair failure, it is possible to increase the efficiency and quality of fault restoration.
Second processing method of PCIE failures that the embodiment of the present application is provided, is at the place of above-mentioned the first PCIE failure On the basis of reason method, also including following technical characteristic:
It is described the PCIE failures are repaired or isolated after, also include:
The PCIE fault messages are notified into keeper.
Specifically, the result and detailed information of failure are sent to keeper, can be with short message or the side of mail Formula is notified, to ensure troubleshooting rationally, specific form includes but is not limited to make chart or curve, with Added Management Member more intuitively observes failure.
The processing method of the third PCIE failure that the embodiment of the present application is provided, is at the place of above-mentioned second PCIE failures On the basis of reason method, also including following technical characteristic:
It is described the PCIE fault messages are notified into keeper after, also include:
Alarmed for the PCIE fault messages.
It should be noted that some fault messages are more serious, therefore information is allowed to allow keeper to understand simultaneously with prestissimo Treatment is very important, such as when certain hardware damage cannot be repaired, in order to not influence the normal of system to use, must just enter Row isolation, by taking CPU as an example, CPU has 24 cores on a machine, if one of core has been damaged and cannot repaired, must just use up Fast isolation, it is impossible to reuse, other 23 can also use, but performance has just declined, must now notify keeper and When more exchange device, the mode of this alarm can show that state of affairs urgency level so that the problem of keeper's priority treatment equipment.
The 4th kind of processing method of PCIE failures that the embodiment of the present application is provided, is at the place of above-mentioned the third PCIE failure On the basis of reason method, also including following technical characteristic:
It is described in kernel gather PCIE fault messages be:
To kernel patch is squeezed into system, kernel code is changed, PCIE fault messages are gathered in kernel.
It should be noted that by the way of kernel patch is squeezed into, can be loaded directly into the case where kernel is not compiled Patch module, obtains failure, in hgher efficiency.
The 5th kind of processing method of PCIE failures that the embodiment of the present application is provided, be it is above-mentioned the first to the 4th kind of PCIE In the processing method of failure on the basis of any one, also including following technical characteristic:
It is described the PCIE fault messages are transferred to User space from kernel to be:
The PCIE fault messages are transferred to by User space from kernel with the communication mode of netlink.
It should be noted that Netlink is kernel state and the mode of User space communication in linux system, when PCIE is produced Patch module will be collected into dependent failure information after failure, then place this information in the passage of netlink, be sent to User space.
The processing unit of the first PCIE failure that the embodiment of the present application is provided is as shown in Fig. 2 Fig. 2 is the embodiment of the present application The schematic diagram of the processing unit of the first the PCIE failure for providing, the device includes:
Collecting unit 201, for gathering PCIE fault messages in kernel, it is necessary to illustrate, can be in computer In operating system nucleus, using KPatch instruments, failure patch is squeezed into, for collecting fault message, wherein fault message can be with Position and failure cause that including but not limited to failure occurs, and be packaged and transmitted.Furthermore it is possible to be in operation Patch module is squeezed into during system operation, without going to compile kernel again, and usually said patch, it is when kernel is compiled Wait and patch is got into kernel source code the inside, then compile, specifically, can pass through/proc files, directly repaiied inside kernel Change code, can also so realize the collection of fault message, concrete implementation mode is not intended to limit herein;
Transmission unit 202, for the PCIE fault messages to be transferred into User space from kernel, it is necessary to illustrate, by In the position of collection fault message in kernel, and follow-up processing procedure occurs in User space, it is therefore desirable to by PCIE failures letter Breath is transferred to User space from kernel, and specific transmission means is included but is not limited to using netlink passages;
Analytic unit 203, for being analyzed to the PCIE fault messages collected in User space, specifically, can be with Statistic of classification is carried out to the PCIE fault messages, the result analyzed;
Repair and isolated location 204, for the result according to analysis, the PCIE failures are repaired or isolated, need It is noted that after the completion of analysis, it is possible to attempt automatic reparation failure, if reparation is unsuccessful, such as EMS memory error, it is possible to will The internal memory of failure does isolates, it is to avoid failure memory is again by using causing system unstable, it is to avoid the failure is to system or pass Key service causes serious influence, produces serious consequence, this mode can make up artificial monitoring PCIE device health status, Manual administration failure and analysis Trouble cause inefficiency and can not it is timely and effective treatment and cause machine can not stablize The deficiency of operation.
Second processing unit of PCIE failures that the embodiment of the present application is provided, is at the place of above-mentioned the first PCIE failure On the basis of reason device, also including following technical characteristic:
Notification unit, for the PCIE fault messages to be notified into keeper.
Specifically, the result and detailed information of failure are sent to keeper, and to ensure reasonable handling failure, tool The form of body is included but is not limited to make chart or curve, and failure is more intuitively observed with Added Management person, and with short message or postal The mode of part is notified.
The processing unit of the third PCIE failure that the embodiment of the present application is provided, is at the place of above-mentioned second PCIE failures On the basis of reason device, also including following technical characteristic:
Alarm unit, for being alarmed for the PCIE fault messages.
It should be noted that some fault messages are more serious, therefore information is allowed to allow keeper to understand simultaneously with prestissimo Treatment is extremely important, such as when certain hardware damage cannot be repaired, in order to not influence the normal of system to use, just must go to every From, by taking CPU as an example, CPU has 24 cores on a machine, if one of core has been damaged and cannot repaired, just must as early as possible every From that can not reuse, other 23 also can be to use, but performance has just declined, and now must send out warning notice management Member's more exchange device in time, the mode of this alarm can show that state of affairs urgency level so that keeper's priority treatment equipment Problem.
The 4th kind of processing unit of PCIE failures that the embodiment of the present application is provided, is at the place of above-mentioned the third PCIE failure On the basis of reason device, also including following technical characteristic:
The collecting unit is gathered specifically for kernel patch is squeezed into system, changing kernel code in kernel PCIE fault messages.
Specifically, the 4th kind of signal of the processing unit of PCIE failures provided for the embodiment of the present application with reference to Fig. 3, Fig. 3 Figure, the device includes the kernel 402 being connected with PCIE device 401, kernel patch 403 is squeezed into kernel 402, the kernel patch 403 are transferred to analytic unit 405 after collection PCIE fault messages in kernel 402 using transmission unit 404, further according to analysis As a result, using repairing and isolated location 406 is repaired or isolated, it is necessary to illustrate, using the side for squeezing into kernel patch Formula, can be loaded directly into patch module in the case where kernel is not compiled, and obtain failure, and treatment effeciency is higher.
The 5th kind of processing unit of PCIE failures that the embodiment of the present application is provided, be it is above-mentioned the first to the 4th kind of PCIE In the processing unit of failure on the basis of any one, also including following technical characteristic:
Specifically for from kernel be transferred to the PCIE fault messages with the communication mode of netlink by the transmission unit User space.
It should be noted that netlink is kernel state and the mode of User space communication in linux system, when PCIE is produced Patch module will be collected into dependent failure information after failure, then place this information in the passage of netlink, be sent to User space.
In sum, the embodiment of the present application is provided the above method and device, can reduce the work of fault management, realize The automation of fault management, can timely and effectively find and solve failure, it is ensured that the safe and reliable fortune of system and key service OK.
The foregoing description of the disclosed embodiments, enables professional and technical personnel in the field to realize or uses the present invention. Various modifications to these embodiments will be apparent for those skilled in the art, as defined herein General Principle can be realized in other embodiments without departing from the spirit or scope of the present invention.Therefore, the present invention The embodiments shown herein is not intended to be limited to, and is to fit to and principles disclosed herein and features of novelty phase one The scope most wide for causing.

Claims (10)

1. a kind of processing method of PCIE failures, it is characterised in that including:
PCIE fault messages are gathered in kernel;
The PCIE fault messages are transferred to User space from kernel;
The PCIE fault messages collected are analyzed in User space;
According to the result of analysis, the PCIE failures are repaired or isolated.
2. the processing method of PCIE failures according to claim 1, it is characterised in that
It is described the PCIE failures are repaired or isolated after, also include:
The PCIE fault messages are notified into keeper.
3. the processing method of PCIE failures according to claim 2, it is characterised in that
It is described the PCIE fault messages are notified into keeper after, also include:
Alarmed for the PCIE fault messages.
4. the processing method of PCIE failures according to claim 3, it is characterised in that
It is described in kernel gather PCIE fault messages be:
To kernel patch is squeezed into system, kernel code is changed, PCIE fault messages are gathered in kernel.
5. the processing method of the PCIE failures according to claim any one of 1-4, it is characterised in that
It is described the PCIE fault messages are transferred to User space from kernel to be:
The PCIE fault messages are transferred to by User space from kernel with the communication mode of netlink.
6. a kind of processing unit of PCIE failures, it is characterised in that including:
Collecting unit, for gathering PCIE fault messages in kernel;
Transmission unit, for the PCIE fault messages to be transferred into User space from kernel;
Analytic unit, for being analyzed to the PCIE fault messages collected in User space;
Repair and isolated location, for the result according to analysis, the PCIE failures are repaired or isolated.
7. the processing unit of PCIE failures according to claim 6, it is characterised in that
Also include:
Notification unit, for the PCIE fault messages to be notified into keeper.
8. the processing unit of PCIE failures according to claim 7, it is characterised in that
Also include:
Alarm unit, for being alarmed for the PCIE fault messages.
9. the processing unit of PCIE failures according to claim 8, it is characterised in that
The collecting unit gathers PCIE events specifically for kernel patch is squeezed into system, changing kernel code in kernel Barrier information.
10. the processing unit of the PCIE failures according to claim any one of 6-9, it is characterised in that
The transmission unit from kernel by the PCIE fault messages with the communication mode of netlink specifically for being transferred to user State.
CN201611230230.2A 2016-12-27 2016-12-27 A kind for the treatment of method and apparatus of PCIE failures Pending CN106844078A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611230230.2A CN106844078A (en) 2016-12-27 2016-12-27 A kind for the treatment of method and apparatus of PCIE failures

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611230230.2A CN106844078A (en) 2016-12-27 2016-12-27 A kind for the treatment of method and apparatus of PCIE failures

Publications (1)

Publication Number Publication Date
CN106844078A true CN106844078A (en) 2017-06-13

Family

ID=59113310

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611230230.2A Pending CN106844078A (en) 2016-12-27 2016-12-27 A kind for the treatment of method and apparatus of PCIE failures

Country Status (1)

Country Link
CN (1) CN106844078A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108228374A (en) * 2017-12-28 2018-06-29 华为技术有限公司 A kind of fault handling method of equipment, apparatus and system
CN109815043A (en) * 2019-01-25 2019-05-28 华为技术有限公司 Fault handling method, relevant device and computer storage medium
CN112732477A (en) * 2021-04-01 2021-04-30 四川华鲲振宇智能科技有限责任公司 Method for fault isolation by out-of-band self-checking
US11994940B2 (en) 2019-01-25 2024-05-28 Huawei Cloud Computing Technologies Co., Ltd. Fault processing method, related device, and computer storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101296135A (en) * 2008-06-27 2008-10-29 中兴通讯股份有限公司 Fault information processing method and device
CN102063344A (en) * 2009-11-18 2011-05-18 中兴通讯股份有限公司 Method and system for system fault information dump
CN102222194A (en) * 2011-07-14 2011-10-19 哈尔滨工业大学 Module and method for LINUX host computing environment safety protection
CN103593189A (en) * 2013-11-14 2014-02-19 昆明理工大学 Method for implementing user mode drive program in embedded Linux
CN105354103A (en) * 2014-12-19 2016-02-24 汉柏科技有限公司 Method for managing watchdog in user mode
CN105630620A (en) * 2015-12-23 2016-06-01 浪潮集团有限公司 Machine fault automated processing method
CN106254139A (en) * 2016-08-30 2016-12-21 四川长虹网络科技有限责任公司 A kind of fault collection processes exchange method

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101296135A (en) * 2008-06-27 2008-10-29 中兴通讯股份有限公司 Fault information processing method and device
CN102063344A (en) * 2009-11-18 2011-05-18 中兴通讯股份有限公司 Method and system for system fault information dump
CN102222194A (en) * 2011-07-14 2011-10-19 哈尔滨工业大学 Module and method for LINUX host computing environment safety protection
CN103593189A (en) * 2013-11-14 2014-02-19 昆明理工大学 Method for implementing user mode drive program in embedded Linux
CN105354103A (en) * 2014-12-19 2016-02-24 汉柏科技有限公司 Method for managing watchdog in user mode
CN105630620A (en) * 2015-12-23 2016-06-01 浪潮集团有限公司 Machine fault automated processing method
CN106254139A (en) * 2016-08-30 2016-12-21 四川长虹网络科技有限责任公司 A kind of fault collection processes exchange method

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108228374A (en) * 2017-12-28 2018-06-29 华为技术有限公司 A kind of fault handling method of equipment, apparatus and system
WO2019129022A1 (en) * 2017-12-28 2019-07-04 华为技术有限公司 Error processing method, apparatus and system for device
US11144416B2 (en) 2017-12-28 2021-10-12 Huawei Technologies Co., Ltd. Device fault processing method, apparatus, and system
CN109815043A (en) * 2019-01-25 2019-05-28 华为技术有限公司 Fault handling method, relevant device and computer storage medium
CN109815043B (en) * 2019-01-25 2022-04-05 华为云计算技术有限公司 Fault processing method, related equipment and computer storage medium
US11994940B2 (en) 2019-01-25 2024-05-28 Huawei Cloud Computing Technologies Co., Ltd. Fault processing method, related device, and computer storage medium
CN112732477A (en) * 2021-04-01 2021-04-30 四川华鲲振宇智能科技有限责任公司 Method for fault isolation by out-of-band self-checking

Similar Documents

Publication Publication Date Title
CN107171293B (en) The system and method for relay protection O&M information multidimensional publication is realized in smart grid
CN101227329B (en) System, apparatus and method for managing network device
CN104796273B (en) A kind of method and apparatus of network fault root diagnosis
CN105529831B (en) A kind of secondary equipment of intelligent converting station failure Computer Aided Analysis System
CN103688489A (en) Method for strategy processing and network equipment
CN102195813A (en) Method and device for intelligently creating operation and maintenance worksheet
CN106844078A (en) A kind for the treatment of method and apparatus of PCIE failures
CN103138988B (en) Positioning treatment method and positioning treatment device of network faults
CN107332722A (en) The method for removing and system of a kind of fault message
CN106972626A (en) The running status inspection method of power equipment, apparatus and system
CN101296135A (en) Fault information processing method and device
WO2011106971A1 (en) Method and system for diagnosing network management system faults
CN104574219A (en) System and method for monitoring and early warning of operation conditions of power grid service information system
CN105630620A (en) Machine fault automated processing method
CN112596975A (en) Method, system, equipment and storage medium for monitoring network equipment
CN107943670A (en) A kind of ups power equipment monitoring system
JP2013130901A (en) Monitoring server and network device recovery system using the same
KR100846835B1 (en) Method and apparatus for Security Event Correlation Analysis based on Context Language
CN101841838B (en) Method and device for processing logical link alarm
CN103995759B (en) High-availability computer system failure handling method and device based on core internal-external synergy
CN109818808A (en) Method for diagnosing faults, device and electronic equipment
CN107528705A (en) Fault handling method and device
CN105045100A (en) Intelligent operation monitoring platform for management by use of mass data
CN116126772A (en) UART serial port management system and method applied to ARM server
CN111681006B (en) Payment security guarantee method of hospital informatization system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170613