CN104539449A - Handling method and related device for fault information - Google Patents

Handling method and related device for fault information Download PDF

Info

Publication number
CN104539449A
CN104539449A CN201410784311.1A CN201410784311A CN104539449A CN 104539449 A CN104539449 A CN 104539449A CN 201410784311 A CN201410784311 A CN 201410784311A CN 104539449 A CN104539449 A CN 104539449A
Authority
CN
China
Prior art keywords
information
managed object
data center
moment point
state information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410784311.1A
Other languages
Chinese (zh)
Other versions
CN104539449B (en
Inventor
和江涛
王波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN201910059252.4A priority Critical patent/CN109921920A/en
Priority to CN201410784311.1A priority patent/CN104539449B/en
Publication of CN104539449A publication Critical patent/CN104539449A/en
Priority to PCT/CN2015/096567 priority patent/WO2016095716A1/en
Application granted granted Critical
Publication of CN104539449B publication Critical patent/CN104539449B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Debugging And Monitoring (AREA)
  • Computer And Data Communications (AREA)

Abstract

The embodiment of the invention discloses a handling method for fault information. The handling method for the fault information is used for optimizing the fault location of data center. The method comprises the following steps that the safety management information of the data center is obtained at multiple time points; according to state management information, the state information of N managed objects of the data center is defined, and the state information is used for representing the safety states of the managed objects; the multiple time points and the state information, corresponding to each time point, of the N managed objects are recorded. The embodiment of the invention further provides a related handling device for the fault information.

Description

A kind of failure information processing method and relevant apparatus
Technical field
The present invention relates to message area, particularly relate to a kind of failure information processing method and relevant apparatus.
Background technology
Data center is the facility of a whole set of complexity, not only comprises department of computer science and to unify other equipment supporting with it, also comprises data communication and connects, environmental control equipment, watch-dog and various safety device.Along with the maturation of data center's correlation technique, increasing enterprise start to build oneself data center and by business migration on data center's platform.
Actual data center has complicated IT system environment, when data center breaks down, need manually to carry out fault location according to the condition managing information of data center's magnanimity, these condition managing information, for representing the running status of data center, comprise the system configuration information of data center and/or warning information and/or performance monitoring information and/or log information and/or complain guarantee information and/or configuration change information and/or work order information.
When but data center breaks down, primary task is recovery business, and after business recovery, the condition managing information of data center there occurs change compared with fault moment, needing manually devotes considerable time searches historic state management information, then the occurrence positions of analysis of failure.Nonetheless, a lot of condition managing information of fault moment also can not be inquired about, and cause realizing fault location accurately.Therefore, the failure information processing method length consuming time of prior art, complicated operation, and reliability is not high.
Summary of the invention
Embodiments provide a kind of failure information processing method, for optimizing fault location.
The first aspect of the embodiment of the present invention provides a kind of failure information processing method, is applicable to data center, and described data center comprises managed object, and described method comprises:
In multiple moment point, obtain the condition managing information of described data center, described condition managing information is for describing the running status of described data center;
According to described condition managing information, determine the state information of N number of managed object of described data center, described state information is for representing the operating state of described managed object;
Record the state information of described multiple moment point and N number of managed object corresponding to each described moment point.
In conjunction with the first aspect of the embodiment of the present invention, in the first implementation of the first aspect of the embodiment of the present invention, described record the state information of described multiple moment point and N number of managed object corresponding to each described moment point before also comprise:
Determine the incidence relation between described N number of managed object;
The described state information recording described multiple moment point and N number of managed object corresponding to each described moment point comprises:
Record the incidence relation between the state information of N number of managed object corresponding to described multiple moment point, each described moment point and N number of managed object corresponding to each described moment point.
In conjunction with the first implementation of the first aspect of the embodiment of the present invention, in the second implementation of the first aspect of the embodiment of the present invention, the condition managing information of described data center comprises:
System configuration information and/or warning information and/or performance monitoring information and/or log information and/or complain guarantee information and/or configuration change information and/or work order information.
In conjunction with the first or the second implementation of the first aspect of the embodiment of the present invention, in the third implementation of the first aspect of the embodiment of the present invention, described according to described condition managing information, determine that the state information of N number of managed object of described data center comprises:
According to the attribute of N number of managed object of described data center, described condition managing information is divided into the state information of described N number of managed object, the attribute of described managed object comprises: the device name of managed object and/or the IP address of managed object and/or the device coding of managed object and/or the user name of managed object.
In conjunction with the first or the second implementation of the first aspect of the embodiment of the present invention, in the 4th kind of implementation of the first aspect of the embodiment of the present invention, described method also comprises:
Receive the trouble shoot instruction that client sends, described trouble shoot instruction comprises fault and the moment occurs;
From the incidence relation between the state information of N number of managed object corresponding to described multiple moment point, each described moment point of record and N number of managed object corresponding to each described moment point, search the incidence relation that described fault occurs between the state information of N number of managed object corresponding to moment and N number of managed object;
Incidence relation between the state information of N number of managed object corresponding described fault generation moment and N number of managed object is fed back to described client.
The second aspect of the embodiment of the present invention provides a kind of faulted-phase judgment device, and be applicable to data center, described data center comprises managed object, and described device comprises:
Data obtaining module, in multiple moment point, obtain the condition managing information of described data center, described condition managing information is for describing the running status of described data center;
Safe determination module, for according to described condition managing information, determine the state information of N number of managed object of described data center, described state information is for representing the operating state of described managed object;
Information logging modle, for recording the state information of described multiple moment point and N number of managed object corresponding to each described moment point.
In conjunction with the second aspect of the embodiment of the present invention, the first implementation of the second aspect of the embodiment of the present invention also comprises:
Association determination module, for before the state information of multiple moment point described in described information logging modle record and N number of managed object corresponding to each described moment point, determines the incidence relation between described N number of managed object;
Described information logging modle specifically for:
Record the incidence relation between the state information of N number of managed object corresponding to described multiple moment point, each described moment point and N number of managed object corresponding to each described moment point;
In conjunction with the first implementation of the second aspect of the embodiment of the present invention, in the second implementation of the second aspect of the embodiment of the present invention, the condition managing information of described data center comprises:
System configuration information and/or warning information and/or performance monitoring information and/or log information and/or complain guarantee information and/or configuration change information and/or work order information.
In conjunction with the first or the second implementation of the second aspect of the embodiment of the present invention, in the third implementation of the second aspect of the embodiment of the present invention, described safe determination module specifically for:
According to the attribute of N number of managed object of described data center, described condition managing information is divided into the state information of described N number of managed object, the attribute of described managed object comprises: the device name of managed object and/or the IP address of managed object and/or the device coding of managed object and/or the user name of managed object.
In conjunction with the first or the second implementation of the second aspect of the embodiment of the present invention, the 4th kind of implementation of the second aspect of the embodiment of the present invention also comprises:
Command reception module, for receiving the trouble shoot instruction that client sends, described trouble shoot instruction comprises fault and the moment occurs;
Trouble shoot module, for from described multiple moment point of record and the state information of N number of managed object corresponding to each described moment point, searches the state information that N number of managed object corresponding to moment occurs described fault;
Fault feedback module, for giving described client by the status information feedback of N number of managed object corresponding described fault generation moment.
In the method that the embodiment of the present invention provides, in multiple moment point, obtain the condition managing information of data center; According to condition managing information, the state information of N number of managed object of determining data center, this state information is for representing the safe condition of managed object; Record the state information of multiple moment point and N number of managed object corresponding to each moment point.The condition managing information of data center is carried out classification according to managed object and is preserved by the method that the embodiment of the present invention provides, like this when carrying out fault location, user can according to the information of preserving, be directly targeted to fault and the moment occurs, safe condition according to this moment each managed object carries out fault location accurately, without the need to manually searching the condition managing information of magnanimity, also without the need to manually analyzing condition managing information.Therefore, the method that the embodiment of the present invention provides can reduce the duration of fault location, simplifies the operation of fault location, improves the reliability of fault location.
Accompanying drawing explanation
Fig. 1 is failure information processing method embodiment flow chart in the embodiment of the present invention;
Fig. 2 is another embodiment flow chart of failure information processing method in the embodiment of the present invention;
Fig. 3 is faulted-phase judgment device embodiment flow chart in the embodiment of the present invention;
Fig. 4 is another embodiment flow chart of faulted-phase judgment device in the embodiment of the present invention;
Fig. 5 is another embodiment flow chart of faulted-phase judgment device in the embodiment of the present invention;
Fig. 6 is another embodiment flow chart of faulted-phase judgment device in the embodiment of the present invention.
Embodiment
Embodiments provide a kind of failure information processing method, for reducing the duration of fault location, simplifying the operation of fault location, improving the reliability of fault location.The embodiment of the present invention additionally provides relevant faulted-phase judgment device, will be described respectively below.
The basic procedure of the failure information processing method that the embodiment of the present invention provides refers to Fig. 1, mainly comprises:
101, in multiple moment point, the condition managing information of data center is obtained;
Faulted-phase judgment device, in multiple moment point, obtains the condition managing information of data center, and this condition managing information is used for the running status at data of description center.
Wherein, multiple moment point can be artificial setting, also can be faulted-phase judgment device default setting, as fault information processor acquiescence arranges a moment point every 15min.The plurality of moment point also can be determined by other means, does not limit herein.
The method that faulted-phase judgment device obtains the condition managing information of data center has a lot, describes in detail, do not limit in the embodiment below herein.
102, according to condition managing information, the state information of N number of managed object of determining data center;
Data center comprises the managed object being no less than, and data center manages these managed objects.Wherein, managed object can be the entity objects such as physical equipment, can be also the software objects such as operating system, database, middleware, not limit in the present embodiment.
Faulted-phase judgment device according to condition managing information, the state information of N number of managed object of determining data center.Wherein, state information is for representing the operating state of managed object.
103, the state information of the plurality of moment point and N number of managed object corresponding to each moment point is recorded.
The state information of the plurality of moment point of faulted-phase judgment device record and N number of managed object corresponding to each moment point, make user when carrying out fault location, can from multiple moment point of preserving with corresponding state information, there is the safe condition of moment each managed object in looking up the fault, and then location is which managed unit breaks down accurately.
Present embodiments provide a kind of failure information processing method, wherein faulted-phase judgment device is in multiple moment point, obtains the condition managing information of data center; According to condition managing information, the state information of N number of managed object of determining data center, this state information is for representing the safe condition of managed object; Record the state information of multiple moment point and N number of managed object corresponding to each moment point.The condition managing information of data center is carried out classification according to managed object and is preserved by the method that the present embodiment provides, like this when carrying out fault location, user can directly looking up the fault occur before and after the moment information of preserving, before and after occurring according to fault, the safe condition of moment each managed object carries out fault location accurately, without the need to manually searching the condition managing information of magnanimity, also without the need to manually analyzing condition managing information.Therefore, the method that the present embodiment provides can reduce the duration of fault location, simplifies the operation of fault location, improves the reliability of fault location.
Embodiment shown in Fig. 1 gives the basic procedure of the failure information processing method that the embodiment of the present invention provides, and below by providing a kind of embodiment of more refinement, for providing fault location more accurately, refer to Fig. 2, its basic procedure comprises:
201, in multiple moment point, the condition managing information of data center is obtained;
Faulted-phase judgment device, in multiple moment point, obtains the condition managing information of data center, and this condition managing information is used for the running status at data of description center.
Wherein, the plurality of moment point can be artificial setting, also can be faulted-phase judgment device default setting, as fault information processor acquiescence arranges a moment point every 15min.The plurality of moment point also can be determined by other means, does not limit herein.
The method that faulted-phase judgment device obtains the condition managing information of data center has a lot, such as, data center can comprise repository (CMDB, Configuration Management Database), one or several system in network management system, log system, complaint safeguards system, configuration change system, WorkForm System, faulted-phase judgment device can from these systems the condition managing information of active obtaining data center, or the condition managing information of data center that these systems of passive reception send.Faulted-phase judgment device also can obtain the condition managing information of data center by other means, does not limit herein.
Optionally, corresponding with the system of data center, the condition managing information of data center can comprise system configuration information and/or warning information and/or performance monitoring information and/or log information and/or complain guarantee information and/or configuration change information and/or work order information, also can comprise other information, not limit herein.
202, according to condition managing information, the state information of N number of managed object of determining data center;
Data center comprises the managed object being no less than, and data center manages these managed objects.Wherein, managed object can be the entity objects such as physical equipment, can be also the software objects such as operating system, not limit in the present embodiment.
Faulted-phase judgment device according to condition managing information, the state information of N number of managed object of determining data center.Wherein, state information is for representing the operating state of managed object.
Optionally, the condition managing information obtained in step 201 according to the attribute of N number of managed object of data center, can be divided into the state information of N number of managed object by faulted-phase judgment device.Wherein, the attribute of managed object can comprise in the device name of managed object, IP address, device coding, user name one or several, also can be other attribute.Such as faulted-phase judgment device can according to the IP address of managed object, the warning information of data center and/or performance monitoring information and/or log information is divided into the warning information of each managed object and/or performance monitoring information and/or log information; Or the assets according to managed object are encoded, the configuration change information of data center and/or work order information are divided into configuration change information and/or the work order information of each managed object; Or, according to the device name of managed object, the system configuration information of data center and/or complaint guarantee information are divided into the configuration information of each managed object and/or complain guarantee information.The state information condition managing information obtained in step 201 being divided into N number of managed object can be also other method, does not limit herein.
Optionally, faulted-phase judgment device is according to the attribute of N number of managed object of data center, after the condition managing information obtained in step 201 is divided into the state information of N number of managed object, in order to reduce data to be recorded, further process can also be done, as deleted invalid data or repeating data (as info information in daily record) etc. to state information.Do not limit herein.
203, the incidence relation between N number of managed object is determined;
Multiple moment point that faulted-phase judgment device is described in step 201, determine the incidence relation between N number of managed object.This incidence relation, for associating in each managed object of this N, has the managed object of information interaction.
204, the incidence relation between the state information of N number of managed object corresponding to the plurality of moment point, each moment point and N number of managed object corresponding to each moment point is recorded.
The state information of N number of managed object that the plurality of moment point of faulted-phase judgment device record, each moment point are corresponding, and the incidence relation between N number of managed object corresponding to each moment point, make user when carrying out fault location, can from multiple moment point of preserving with corresponding state information, there is the safe condition of moment each managed object in looking up the fault, and then location is which managed object breaks down accurately.Especially, due to sometimes, the fault of data center is not that managed object itself breaks down, but the passage of the information interaction of two or more managed object there occurs fault.Therefore, user, when carrying out fault location, can also analyze in conjunction with the incidence relation between N number of managed object corresponding to fault moment, actually judge the managed object itself broken down, or the passage of the information interaction between managed object.
Present embodiments provide a kind of failure information processing method, wherein faulted-phase judgment device is in multiple moment point, obtains the condition managing information of data center; According to condition managing information, the state information of N number of managed object of determining data center, this state information is for representing the safe condition of managed object; Determine the incidence relation between N number of managed object; Record the incidence relation between the state information of N number of managed object corresponding to multiple moment point, each moment point and N number of managed object.The condition managing information of data center is carried out classification according to managed object and is preserved by the method that the present embodiment provides, like this when carrying out fault location, user can directly looking up the fault occur before and after the moment information of preserving, before and after occurring according to fault, the safe condition of moment each managed object carries out fault location accurately, without the need to manually searching the condition managing information of magnanimity, also without the need to manually analyzing condition managing information.Therefore, the method that the present embodiment provides can reduce the duration of fault location, simplifies the operation of fault location, improves the reliability of fault location.And incidence relation when also have recorded multiple moment point in the present embodiment between N number of managed object, provide further reference for user carries out fault location, make user to carry out fault location more accurately.
User is when carrying out fault location, client can be used to carry out the information that the looking up the fault moment is corresponding from faulted-phase judgment device, therefore optional, as another embodiment of the present invention, after step 204, faulted-phase judgment device can also receive the trouble shoot instruction that client sends, and this trouble shoot instruction comprises fault and the moment occurs; Faulted-phase judgment device is from the incidence relation between the state information of N number of managed object corresponding to multiple moment point, each moment point of record and N number of managed object corresponding to each moment point, incidence relation between the state information of N number of managed object that the looking up the fault generation moment is corresponding and N number of managed object, and incidence relation fault occurred between the state information of N number of managed object corresponding to moment and N number of managed object feeds back to client, user is made to obtain the lookup result of faulted-phase judgment device by client.Wherein, the state information of N number of managed object that the fault generation moment is corresponding, can occur before and after the moment in preset time period (as the time period that first 30 minutes of moment to fault occurs after the moment 20 minutes occurs fault) for fault, the state information of N number of managed object that faulted-phase judgment device is preserved.
For the ease of understanding above-described embodiment, be described for above-described embodiment embody rule scene below.
Faulted-phase judgment device, every 15min, obtains the work order information of the warning information of data center, the log information obtaining data center from the log system of data center, the configuration change information from the configuration change system acquisition data center of data center, the WorkForm System acquisition data center from data center from the network management system of data center.
Data center comprises three managed objects, is respectively network equipment A, memory device B and computing equipment C.Faulted-phase judgment device is by the warning information of data center that gets and log information, according to device A, B, the IP address of C divides, be divided into warning information and the log information of device A, the warning information of the warning information of equipment B and log information and equipment C and log information, by the configuration change information of data center that gets and work order information, according to device A, B, the assets coding of C divides, be divided into configuration change information and the work order information of device A, the configuration change information of the configuration change information of equipment B and work order information and equipment C and work order information.
The incidence relation of faulted-phase judgment device determination device A, B, C, wherein, has information interaction between device A and equipment, has information interaction between equipment B and equipment C.
These moment point of faulted-phase judgment device, and device A corresponding to these moment point, the warning information of B, C, log information, configuration change information, work order information and device A, B, C incidence relation record.
User uses client to carry out the information that the looking up the fault moment is corresponding from faulted-phase judgment device, and faulted-phase judgment device receives the trouble shoot instruction that subscription client sends, and it is 10:22am that this trouble shoot instruction comprises the fault generation moment; When faulted-phase judgment device finds 10:00am, 10:15am and 10:30am from the information of record, the incidence relation of the warning information of device A, B, C, log information, configuration change information, work order information and device A, B, C, lookup result is fed back to client by faulted-phase judgment device, during this lookup result display 10:15am, the warning information display device A power down of device A.User, according to this warning information, orientates the managed object broken down as device A.
Embodiment above provides a kind of failure information processing method, and the following examples will provide a kind of faulted-phase judgment device, and for realizing said method, its basic structure refers to Fig. 3, comprising:
Data obtaining module 301, in multiple moment point, obtain the condition managing information of data center, this condition managing information is used for the running status at data of description center;
Safe determination module 302, for according to condition managing information, the state information of N number of managed object of determining data center, this state information is for representing the operating state of managed object;
Information logging modle 303, for recording the state information of the plurality of moment point and N number of managed object corresponding to each moment point.
Present embodiments provide a kind of faulted-phase judgment device, wherein data obtaining module 301 is in multiple moment point, obtains the condition managing information of data center; Safe determination module 302 according to condition managing information, the state information of N number of managed object of determining data center, this state information is for representing the safe condition of managed object; Information logging modle 303 records the state information of multiple moment point and N number of managed object corresponding to each moment point.The condition managing information of data center is carried out classification according to managed object and is preserved by the device that the present embodiment provides, like this when carrying out fault location, user can directly looking up the fault occur before and after the moment information of preserving, before and after occurring according to fault, the safe condition of moment each managed object carries out fault location accurately, without the need to manually searching the condition managing information of magnanimity, also without the need to manually analyzing condition managing information.Therefore, the device that the present embodiment provides can reduce the duration of fault location, simplifies the operation of fault location, improves the reliability of fault location.
Embodiment shown in Fig. 3 gives the basic structure of the faulted-phase judgment device that the embodiment of the present invention provides, and below by providing a kind of embodiment of more refinement, for providing fault location more accurately, refer to Fig. 4, its basic structure comprises:
Data obtaining module 401, in multiple moment point, obtain the condition managing information of data center, this condition managing information is used for the running status at data of description center;
Safe determination module 402, for according to condition managing information, the state information of N number of managed object of determining data center, this state information is for representing the operating state of managed object;
Association determination module 403, for before the state information of the plurality of moment point of information logging modle record and N number of managed object corresponding to each moment point, determines the incidence relation between this N number of managed object;
Information logging modle 404, the incidence relation between N number of managed object that state information and each moment point for recording N number of managed object corresponding to the plurality of moment point, each moment point are corresponding.
Optionally, the condition managing information of data center can comprise: system configuration information and/or warning information and/or performance monitoring information and/or log information and/or complain guarantee information and/or configuration change information and/or work order information.
Optionally, safe determination module specifically may be used for: according to the attribute of N number of managed object of described data center, described condition managing information is divided into the state information of described N number of managed object, the attribute of described managed object comprises: the device name of managed object and/or the IP address of managed object and/or the device coding of managed object and/or the user name of managed object.
Present embodiments provide a kind of faulted-phase judgment device, wherein data obtaining module 401 is in multiple moment point, obtains the condition managing information of data center; Safe determination module 402 according to condition managing information, the state information of N number of managed object of determining data center, this state information is for representing the safe condition of managed object; Association determination module 403 determines the incidence relation between N number of managed object; Information logging modle 404 records the incidence relation between the state information of N number of managed object corresponding to multiple moment point, each moment point and N number of managed object.The condition managing information of data center is carried out classification according to managed object and is preserved by the device that the present embodiment provides, like this when carrying out fault location, user can directly looking up the fault occur before and after the moment information of preserving, before and after occurring according to fault, the safe condition of moment each managed object carries out fault location accurately, without the need to manually searching the condition managing information of magnanimity, also without the need to manually analyzing condition managing information.Therefore, the device that the present embodiment provides can reduce the duration of fault location, simplifies the operation of fault location, improves the reliability of fault location.And incidence relation when information logging modle 404 also have recorded multiple moment point in the present embodiment between N number of managed object, provide further reference for user carries out fault location, make user to carry out fault location more accurately.
Embodiment shown in Fig. 4 gives the basic structure of the faulted-phase judgment device of a kind of comparatively refinement that the embodiment of the present invention provides, a kind of faulted-phase judgment device of more refinement will be provided below, this device can carry out information interaction with client, refers to Fig. 5, and its basic structure comprises:
Data obtaining module 501, in multiple moment point, obtain the condition managing information of data center, this condition managing information is used for the running status at data of description center;
Safe determination module 502, for according to condition managing information, the state information of N number of managed object of determining data center, this state information is for representing the operating state of managed object;
Association determination module 503, for before the state information of the plurality of moment point of information logging modle record and N number of managed object corresponding to each moment point, determines the incidence relation between this N number of managed object;
Information logging modle 504, the incidence relation between N number of managed object that state information and each moment point for recording N number of managed object corresponding to the plurality of moment point, each moment point are corresponding.
Command reception module 505, for receiving the trouble shoot instruction that client sends, this trouble shoot instruction comprises fault and the moment occurs;
Trouble shoot module 506, for from multiple moment point of record and the state information of N number of managed object corresponding to each moment point, there is the state information of N number of managed object corresponding to moment in looking up the fault;
, for there is the status information feedback of N number of managed object corresponding to moment by fault to client in fault feedback module 507.
Present embodiments provide a kind of faulted-phase judgment device, wherein data obtaining module 501 is in multiple moment point, obtains the condition managing information of data center; Safe determination module 502 according to condition managing information, the state information of N number of managed object of determining data center, this state information is for representing the safe condition of managed object; Association determination module 503 determines the incidence relation between N number of managed object; Information logging modle 504 records the incidence relation between the state information of N number of managed object corresponding to multiple moment point, each moment point and N number of managed object.The condition managing information of data center is carried out classification according to managed object and is preserved by the device that the present embodiment provides, like this when carrying out fault location, user can according to the information of preserving, the information that direct Search and Orientation was preserved to the fault generation front and back moment, before and after occurring according to fault, the safe condition of this moment in moment each managed object carries out fault location accurately, without the need to manually searching the condition managing information of magnanimity, also without the need to manually analyzing condition managing information.Therefore, the device that the present embodiment provides can reduce the duration of fault location, simplifies the operation of fault location, improves the reliability of fault location.And incidence relation when information logging modle 504 also have recorded multiple moment point in the present embodiment between N number of managed object, provide further reference for user carries out fault location, make user to carry out fault location more accurately.Meanwhile, command reception module 505 can receive the trouble shoot instruction that client sends; There is the state information of N number of managed object corresponding to moment in trouble shoot module 506 looking up the fault from multiple moment point of record and the state information of N number of managed object corresponding to each moment point; Being there is the status information feedback of N number of managed object corresponding to moment to client by fault feedback module 507 in fault, so just makes user obtain the lookup result of faulted-phase judgment device by client.
For the ease of understanding above-described embodiment, be described for above-described embodiment embody rule scene below.
Data obtaining module 501, every 15min, obtains the work order information of the warning information of data center, the log information obtaining data center from the log system of data center, the configuration change information from the configuration change system acquisition data center of data center, the WorkForm System acquisition data center from data center from the network management system of data center.
Data center comprises three managed objects, is respectively network equipment A, memory device B and computing equipment C.Safe determination module 502 is by the warning information of data center that gets and log information, divide according to the IP address of device A, B, C, be divided into the warning information of device A and log information, the warning information of equipment B and the warning information of log information and equipment C and log information, by the configuration change information of data center that gets and work order information, divide according to the assets coding of device A, B, C, be divided into the configuration change information of device A and work order information, the configuration change information of equipment B and the configuration change information of work order information and equipment C and work order information.
Association determination module 503 determines the incidence relation of device A, B, C, wherein, has information interaction between device A and equipment, has information interaction between equipment B and equipment C.
Information logging modle 504 by these moment point, and device A corresponding to these moment point, the warning information of B, C, log information, configuration change information, work order information and device A, B, C incidence relation record.
User uses client to carry out the information that the looking up the fault moment is corresponding from faulted-phase judgment device, and command reception module 505 receives the trouble shoot instruction that subscription client sends, and it is 10:22am that this trouble shoot instruction comprises the fault generation moment; When trouble shoot module 506 finds 10:00am, 10:15am and 10:30am from the information of record, the incidence relation of the warning information of device A, B, C, log information, configuration change information, work order information and device A, B, C, lookup result is fed back to client by fault feedback module 507, during this lookup result display 10:15am, the warning information display device A power down of device A.User, according to this warning information, orientates the managed object broken down as device A.
From the angle of blocking functional entity, the faulted-phase judgment device the embodiment of the present invention is described above, from the angle of hardware handles, the faulted-phase judgment device the embodiment of the present invention is described below, refer to Fig. 6, another embodiment of faulted-phase judgment device 600 in the embodiment of the present invention comprises:
Input unit 601, output device 602, processor 603 and memory 604 (quantity of the processor 603 wherein in faulted-phase judgment device 600 can be one or more, for a processor 603 in Fig. 6).In some embodiments of the invention, input unit 601, output device 602, processor 603 are connected by bus or alternate manner with memory 604, wherein, to be connected by bus in Fig. 6.
Wherein, by calling the operational order that memory 604 stores, processor 603 is for performing following steps:
In multiple moment point, obtain the condition managing information of data center, this condition managing information is used for the running status at data of description center;
According to condition managing information, the state information of N number of managed object of determining data center, this state information is for representing the operating state of managed object;
Record the state information of the plurality of moment point and N number of managed object corresponding to each moment point.
In some embodiments of the present invention, processor 603 also performs following steps:
Before the state information recording the plurality of moment point and N number of managed object corresponding to each moment point, determine the incidence relation between N number of managed object;
Record the incidence relation between the state information of N number of managed object corresponding to the plurality of moment point, each moment point and N number of managed object corresponding to each moment point.
In some embodiments of the present invention, the condition managing information of data center comprises:
System configuration information and/or warning information and/or performance monitoring information and/or log information and/or complain guarantee information and/or configuration change information and/or work order information.
In some embodiments of the present invention, processor 603 also performs following steps:
According to the attribute of N number of managed object of data center, condition managing information is divided into the state information of N number of managed object, the attribute of this managed object comprises: the device name of managed object and/or the IP address of managed object and/or the device coding of managed object and/or the user name of managed object.
In some embodiments of the present invention, processor 603 also performs following steps:
Receive the trouble shoot instruction that client sends, this trouble shoot instruction comprises fault and the moment occurs;
From the incidence relation between the state information of N number of managed object corresponding to multiple moment point, each moment point of record and N number of managed object corresponding to each moment point, there is the incidence relation between the state information of N number of managed object corresponding to moment and N number of managed object in looking up the fault;
Incidence relation fault occurred between the state information of N number of managed object corresponding to moment and N number of managed object feeds back to client.
Those skilled in the art can be well understood to, and for convenience and simplicity of description, the system of foregoing description, the specific works process of device and unit, with reference to the corresponding process in preceding method embodiment, can not repeat them here.
In several embodiments that the application provides, should be understood that, disclosed system, apparatus and method, can realize by another way.Such as, device embodiment described above is only schematic, such as, the division of described unit, be only a kind of logic function to divide, actual can have other dividing mode when realizing, such as multiple unit or assembly can in conjunction with or another system can be integrated into, or some features can be ignored, or do not perform.Another point, shown or discussed coupling each other or direct-coupling or communication connection can be by some interfaces, and the indirect coupling of device or unit or communication connection can be electrical, machinery or other form.
The described unit illustrated as separating component or can may not be and physically separates, and the parts as unit display can be or may not be physical location, namely can be positioned at a place, or also can be distributed in multiple network element.Some or all of unit wherein can be selected according to the actual needs to realize the object of the present embodiment scheme.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, also can be that the independent physics of unit exists, also can two or more unit in a unit integrated.Above-mentioned integrated unit both can adopt the form of hardware to realize, and the form of SFU software functional unit also can be adopted to realize.
If described integrated unit using the form of SFU software functional unit realize and as independently production marketing or use time, can be stored in a computer read/write memory medium.Based on such understanding, the part that technical scheme of the present invention contributes to prior art in essence in other words or all or part of of this technical scheme can embody with the form of software product, this computer software product is stored in a storage medium, comprising some instructions in order to make a computer equipment (can be personal computer, server, or the network equipment etc.) perform all or part of step of method described in each embodiment of the present invention.And aforesaid storage medium comprises: USB flash disk, portable hard drive, read-only memory (ROM, Read-OnlyMemory), random access memory (RAM, Random Access Memory), magnetic disc or CD etc. various can be program code stored medium.
The above, above embodiment only in order to technical scheme of the present invention to be described, is not intended to limit; Although with reference to previous embodiment to invention has been detailed description, those of ordinary skill in the art is to be understood that: it still can be modified to the technical scheme described in foregoing embodiments, or carries out equivalent replacement to wherein portion of techniques feature; And these amendments or replacement, do not make the essence of appropriate technical solution depart from the spirit and scope of various embodiments of the present invention technical scheme.

Claims (10)

1. a failure information processing method, is applicable to data center, it is characterized in that, described data center comprises managed object, and described method comprises:
In multiple moment point, obtain the condition managing information of described data center, described condition managing information is for describing the running status of described data center;
According to described condition managing information, determine the state information of N number of managed object of described data center, described state information is for representing the operating state of described managed object;
Record the state information of described multiple moment point and N number of managed object corresponding to each described moment point.
2. failure information processing method according to claim 1, is characterized in that, described record the state information of described multiple moment point and N number of managed object corresponding to each described moment point before also comprise:
Determine the incidence relation between described N number of managed object;
The described state information recording described multiple moment point and N number of managed object corresponding to each described moment point comprises:
Record the incidence relation between the state information of N number of managed object corresponding to described multiple moment point, each described moment point and N number of managed object corresponding to each described moment point.
3. failure information processing method according to claim 2, is characterized in that, the condition managing information of described data center comprises:
System configuration information and/or warning information and/or performance monitoring information and/or log information and/or complain guarantee information and/or configuration change information and/or work order information.
4. the failure information processing method according to Claims 2 or 3, is characterized in that, described according to described condition managing information, determines that the state information of N number of managed object of described data center comprises:
According to the attribute of N number of managed object of described data center, described condition managing information is divided into the state information of described N number of managed object, the attribute of described managed object comprises: the device name of managed object and/or the IP address of managed object and/or the device coding of managed object and/or the user name of managed object.
5. the failure information processing method according to Claims 2 or 3, is characterized in that, described method also comprises:
Receive the trouble shoot instruction that client sends, described trouble shoot instruction comprises fault and the moment occurs;
From the incidence relation between the state information of N number of managed object corresponding to described multiple moment point, each described moment point of record and N number of managed object corresponding to each described moment point, search the incidence relation that described fault occurs between the state information of N number of managed object corresponding to moment and N number of managed object;
Incidence relation between the state information of N number of managed object corresponding described fault generation moment and N number of managed object is fed back to described client.
6. a faulted-phase judgment device, is applicable to data center, it is characterized in that, described data center comprises managed object, and described device comprises:
Data obtaining module, in multiple moment point, obtain the condition managing information of described data center, described condition managing information is for describing the running status of described data center;
Safe determination module, for according to described condition managing information, determine the state information of N number of managed object of described data center, described state information is for representing the operating state of described managed object;
Information logging modle, for recording the state information of described multiple moment point and N number of managed object corresponding to each described moment point.
7. faulted-phase judgment device according to claim 6, is characterized in that, described device also comprises:
Association determination module, for before the state information of multiple moment point described in described information logging modle record and N number of managed object corresponding to each described moment point, determines the incidence relation between described N number of managed object;
Described information logging modle specifically for:
Record the incidence relation between the state information of N number of managed object corresponding to described multiple moment point, each described moment point and N number of managed object corresponding to each described moment point.
8. faulted-phase judgment device according to claim 7, is characterized in that, the condition managing information of described data center comprises:
System configuration information and/or warning information and/or performance monitoring information and/or log information and/or complain guarantee information and/or configuration change information and/or work order information.
9. the faulted-phase judgment device according to claim 7 or 8, is characterized in that, described safe determination module specifically for:
According to the attribute of N number of managed object of described data center, described condition managing information is divided into the state information of described N number of managed object, the attribute of described managed object comprises: the device name of managed object and/or the IP address of managed object and/or the device coding of managed object and/or the user name of managed object.
10. the faulted-phase judgment device according to claim 7 or 8, is characterized in that, described device also comprises:
Command reception module, for receiving the trouble shoot instruction that client sends, described trouble shoot instruction comprises fault and the moment occurs;
Trouble shoot module, for from described multiple moment point of record and the state information of N number of managed object corresponding to each described moment point, searches the state information that N number of managed object corresponding to moment occurs described fault;
Fault feedback module, for giving described client by the status information feedback of N number of managed object corresponding described fault generation moment.
CN201410784311.1A 2014-12-16 2014-12-16 A kind of failure information processing method and relevant apparatus Active CN104539449B (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CN201910059252.4A CN109921920A (en) 2014-12-16 2014-12-16 A kind of failure information processing method and relevant apparatus
CN201410784311.1A CN104539449B (en) 2014-12-16 2014-12-16 A kind of failure information processing method and relevant apparatus
PCT/CN2015/096567 WO2016095716A1 (en) 2014-12-16 2015-12-07 Fault information processing method and related device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410784311.1A CN104539449B (en) 2014-12-16 2014-12-16 A kind of failure information processing method and relevant apparatus

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CN201910059252.4A Division CN109921920A (en) 2014-12-16 2014-12-16 A kind of failure information processing method and relevant apparatus

Publications (2)

Publication Number Publication Date
CN104539449A true CN104539449A (en) 2015-04-22
CN104539449B CN104539449B (en) 2019-02-19

Family

ID=52854918

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201910059252.4A Pending CN109921920A (en) 2014-12-16 2014-12-16 A kind of failure information processing method and relevant apparatus
CN201410784311.1A Active CN104539449B (en) 2014-12-16 2014-12-16 A kind of failure information processing method and relevant apparatus

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CN201910059252.4A Pending CN109921920A (en) 2014-12-16 2014-12-16 A kind of failure information processing method and relevant apparatus

Country Status (2)

Country Link
CN (2) CN109921920A (en)
WO (1) WO2016095716A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016095716A1 (en) * 2014-12-16 2016-06-23 华为技术有限公司 Fault information processing method and related device
CN106909550A (en) * 2015-12-22 2017-06-30 中国移动通信集团吉林有限公司 A kind of data handling system and method

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111401577A (en) * 2020-02-14 2020-07-10 上海电气分布式能源科技有限公司 Device management method, device and storage medium
CN111782437B (en) * 2020-07-10 2023-08-11 中国工商银行股份有限公司 Fault positioning method, device, computing equipment and medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090150718A1 (en) * 2007-12-11 2009-06-11 Choon-Seo Park Large-scale cluster monitoring system, and method of automatically building/restoring the same
CN102546274A (en) * 2010-12-20 2012-07-04 中国移动通信集团广西有限公司 Alarm monitoring method and alarm monitoring equipment in communication service

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0272742A (en) * 1988-09-07 1990-03-13 Nec Corp Data error location detecting system
CN101304340B (en) * 2007-05-09 2011-09-21 华为技术有限公司 Method and apparatus for monitoring resource condition as well as communication network
CN102739415A (en) * 2011-03-31 2012-10-17 华为技术有限公司 Method and device for determining network failure data and recording network instantaneous state data
US9071535B2 (en) * 2013-01-03 2015-06-30 Microsoft Technology Licensing, Llc Comparing node states to detect anomalies
CN104184826A (en) * 2014-09-05 2014-12-03 浪潮(北京)电子信息产业有限公司 Multi-data-center storage environment managing method and system
CN109921920A (en) * 2014-12-16 2019-06-21 华为技术有限公司 A kind of failure information processing method and relevant apparatus

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090150718A1 (en) * 2007-12-11 2009-06-11 Choon-Seo Park Large-scale cluster monitoring system, and method of automatically building/restoring the same
CN102546274A (en) * 2010-12-20 2012-07-04 中国移动通信集团广西有限公司 Alarm monitoring method and alarm monitoring equipment in communication service

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
宛蕊华等: "充分利用N2000网管,做好电信网络综合监控", 《西安邮电学院学报》 *
张艳玲等: "数字微波监控系统中故障管理模块的设计", 《微计算机信息》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016095716A1 (en) * 2014-12-16 2016-06-23 华为技术有限公司 Fault information processing method and related device
CN106909550A (en) * 2015-12-22 2017-06-30 中国移动通信集团吉林有限公司 A kind of data handling system and method

Also Published As

Publication number Publication date
CN109921920A (en) 2019-06-21
CN104539449B (en) 2019-02-19
WO2016095716A1 (en) 2016-06-23

Similar Documents

Publication Publication Date Title
US8661291B2 (en) Diagnosing a fault incident in a data center
CN110245034B (en) Service metric analysis based on structured log patterns of usage data
US9612892B2 (en) Creating a correlation rule defining a relationship between event types
WO2018120721A1 (en) Method and system for testing user interface, electronic device, and computer readable storage medium
KR20190075972A (en) Systems and methods for identifying process flows from log files and for visualizing flows
CN105447046A (en) Distributed system data consistency processing method, device and system
KR102301946B1 (en) Visual tools for failure analysis in distributed systems
EP3178004B1 (en) Recovering usability of cloud based service from system failure
CN109039787B (en) Log processing method and device and big data cluster
US11119843B2 (en) Verifying application behavior based on distributed tracing
CN109325010B (en) Log checking method, device, computer equipment and storage medium
CN105095059A (en) Method and device for automated testing
CN110941554B (en) Method and device for reproducing faults
CN104539449A (en) Handling method and related device for fault information
US10855750B2 (en) Centralized management of webservice resources in an enterprise
US20220036154A1 (en) Unsupervised multi-dimensional computer-generated log data anomaly detection
US10089167B2 (en) Log file reduction according to problem-space network topology
CN104794013A (en) Method and device for positioning system operation state and method and device for building system operation state model
CN113836237A (en) Method and device for auditing data operation of database
CN105022663A (en) Power system monitoring and control system
JP6594977B2 (en) Method, system, computer program, and computer-readable storage medium for monitoring requests for code sets
CN110928885B (en) Method and device for updating data of Mysql database to Es database
CN113918204A (en) Metadata script management method and device, electronic equipment and storage medium
US9692665B2 (en) Failure analysis in cloud based service using synthetic measurements
CN110515803B (en) Processing method and device for log message and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant