CN106330501A - Fault correlation method and device - Google Patents

Fault correlation method and device Download PDF

Info

Publication number
CN106330501A
CN106330501A CN201510364079.0A CN201510364079A CN106330501A CN 106330501 A CN106330501 A CN 106330501A CN 201510364079 A CN201510364079 A CN 201510364079A CN 106330501 A CN106330501 A CN 106330501A
Authority
CN
China
Prior art keywords
fault
resource
source
trouble
time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510364079.0A
Other languages
Chinese (zh)
Inventor
施政法
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CN201510364079.0A priority Critical patent/CN106330501A/en
Priority to PCT/CN2016/073759 priority patent/WO2016206386A1/en
Publication of CN106330501A publication Critical patent/CN106330501A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/40Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks using virtualisation of network functions or resources, e.g. SDN or NFV entities

Abstract

The invention provides a fault correlation method and device, and belongs to the field of communication. The method comprises the following steps: receiving source fault information reported by a fault source, wherein the source fault information comprises position information of the fault source; obtaining a generation time of the fault source; finding a corresponding resource topology tree according to the generation time; and obtaining a correlated fault of the fault source according to the position information and the resource topology tree. Compared with the prior art, the corresponding topology tree is found according to the generation time of the fault, and then a correct topological relation is located. Therefore, correct correlation is carried out fault management, the analysis errors caused by resource topology changes is well solved, the fault correlation efficiency of users in a virtual environment is greatly improved, and meanwhile fault correlation method and device provided by the invention have no dependence on specific products or interfaces of the virtual layers and have very good universality, thereby being very suitable for application in the existing stage of virtual environment and providing the core competitiveness of products.

Description

A kind of fault correlation method and apparatus
Technical field
The present invention relates to the communications field, particularly to a kind of fault correlation method and apparatus.
Background technology
Along with the development of the technology such as IT virtualization, cloud computing, and Internet service business is to telecom operators Impact, proposes the virtualized requirement of network function in field of telecommunications, target is by Intel Virtualization Technology, adopts Substitute, with cheap common hardware, the specialized hardware that conventional telecommunications field uses, reduce investment and operation cost, More flexibly and easily service arrangement ability is provided simultaneously.
According to ETSI (European Telecommunications Standards Institute, Europe Telecommunication standardization association) network function virtualization specification in framework divide, whole telecommunications virtualization system divides For following three layers: virtual application layer, virtual resource layer and physical resource layer.
When virtual application is broken down, cause the reason of its fault, except virtual application itself, also may be used Can be that the fault of virtual resource layer and physical resource layer causes, such as, the thing on a physical server Reason port is unavailable, and the virtual port on this physical port can be caused unavailable, thus this is virtual to affect use The network link of the virtual application of port is obstructed.Therefore a virtual Trouble cause is quickly positioned, The fault of its lower floor (virtual resource layer, physical resource layer) must be associated with.
Three-tier system under telecommunications virtualization system, is all the product using different manufacturers under major part scene Product, due to relevant specification immature, in the fault message of each manufacturer, the most only can pay close attention to this layer Information, without carrying the information of other layers, therefore, it is impossible to directly come according to the primary fault information of each layer Incidence relation between the fault of location, it is desirable to provide a kind of effective fault correlation location mechanism.Conventional failure closes Connection technology, be according to malfunctioning node between topological relation phenotypic analysis, traditional corresponding technology only can safeguard work as Front topological relation i.e. topological tree, this is because for non-virtualized environment, the change of topological tree is steady Calmly controlled.And for virtualized environment, according to different deployment strategys, different hardware states, Relation between each layer resource is dynamically change, and this change is general is sightless to upper layer application.When After fault occurs, when reporting analysis, resource topology relation there occurs change, then analysis result will produce Error.
Summary of the invention
The main technical problem to be solved in the present invention is to provide a kind of fault correlation method and apparatus, solves existing In having when utilizing topological tree to carry out fault correlation the problem of analysis result mistake.
For solving the problems referred to above, the present invention provides a kind of fault correlation method, including:
Receiving the source fault message that the source of trouble reports, described source fault message includes the position letter of the described source of trouble Breath;
Obtain the generation time of the described source of trouble;
Corresponding resource topology tree is searched according to the described generation time;
The relevant fault of the described source of trouble is obtained according to described positional information and described resource topology tree.
In an embodiment of the present invention, each resource node in described resource topology tree includes preset time period The effective time corresponding with each resource node.
In an embodiment of the present invention, according to the described generation time search corresponding resource topology tree it Before, also include: detect the change of each resource node, described resource topology tree is updated.
In an embodiment of the present invention, the change of described each resource node includes: newly-increased resource node and/or Delete resource node;Described it is updated including to described resource topology tree:
When being detected as newly-increased resource node, described resource topology tree increases this resource node, and will Report the time started as this resource node effective time time of this resource node newly-increased;
When being detected as deleting resource node, the time deleting this resource node is reported to have as this resource node The end time of effect time.
In an embodiment of the present invention, the generation time of the described source of trouble of described acquisition includes: obtains and connects The reception time receiving described fault message and the time delay reported, according to the described reception time with described prolong Time obtains the generation time of the described source of trouble late.
In an embodiment of the present invention, described obtain according to described positional information and described resource topology tree The relevant fault of the described source of trouble includes: find the described source of trouble to open up in described resource according to described positional information Flutter the resource node in tree, save from described resource according to the fault type of default correlation rule and the described source of trouble The fault message of some upstream identifies the relevant fault of the described source of trouble.
In an embodiment of the present invention, finding the described source of trouble in described money according to described positional information After resource node in the topological tree of source, also include: be associated with described resource node according to described positional information Corresponding resource object on.
In an embodiment of the present invention, described basis presets correlation rule and the failure classes of the described source of trouble Type identifies the relevant fault of the described source of trouble from the fault message of described resource node upstream and includes:
When the described source of trouble occurs at physical resource layer, do not carry out fault correlation identification;
When the described source of trouble occurs at virtual resource layer, according to the resource node of ownership, search and described money The resource node of the physical resource layer that source node connects, obtains the fault message on these nodes, according to described Fault type and described default correlation rule filter out the relevant fault of physical resource layer;
When the described source of trouble occurs at virtual application layer: according to the virtual network function unit of ownership, search The empty machine being connected with described virtual network function unit, obtains the money of the virtual resource layer being connected with described empty machine Source node, obtains the fault message on these nodes, according to described fault type and described default correlation rule Filter out the relevant fault of virtual resource layer.
In an embodiment of the present invention, according to presetting correlation rule and the fault type of the described source of trouble Before identifying the relevant fault of the described source of trouble from the fault message of described resource node upstream, also include: Dynamically update preset rules.
For solving the problems referred to above, the present invention also provides for a kind of fault correlation device, including receiver module, acquisition Module, topography module and relating module:
Described receiver module is for receiving the source fault message that the source of trouble reports, and described source fault message includes institute State the positional information of the source of trouble;
Described acquisition module is for obtaining the generation time of the described source of trouble;
Described topography module is for searching corresponding resource topology tree according to the described generation time;
Described relating module is for obtaining the described source of trouble according to described positional information and described resource topology tree Relevant fault.
In an embodiment of the present invention, also including more new module, described more new module is for according to institute Before stating the resource topology tree that the generation time searches correspondence, detect the change of each resource node, described resource is opened up Flutter tree to be updated.
In an embodiment of the present invention, the change of described each resource node includes: newly-increased resource node and/or Delete resource node;Described more new module is additionally operable to:
When being detected as newly-increased resource node, described resource topology tree increases this resource node, and will Report the time started as this resource node effective time time of this resource node newly-increased;
When being detected as deleting resource node, the time deleting this resource node is reported to have as this resource node The end time of effect time.
In an embodiment of the present invention, described acquisition module is additionally operable to acquisition and receives described fault message The reception time and time delay of reporting, according to described reception time and described time delay obtain described therefore The generation time in barrier source.
In an embodiment of the present invention, described relating module is additionally operable to: find according to described positional information Described source of trouble resource node in described resource topology tree, according to default correlation rule and the described source of trouble Fault type from the fault message of described resource node upstream, identify the relevant fault of the described source of trouble.
In an embodiment of the present invention, described relating module is additionally operable to: according to preset correlation rule and The fault type of the described source of trouble identifies the described source of trouble from the fault message of described resource node upstream Before relevant fault, dynamically update preset rules.
The invention has the beneficial effects as follows:
The fault correlation method and apparatus that the present invention provides, receives the source fault message that the source of trouble reports, source event Barrier information includes the positional information of the source of trouble;Obtain the generation time of the source of trouble;It is right to search according to the time of generation The resource topology tree answered;The relevant fault of the source of trouble is obtained according to positional information and resource topology tree.With existing Technology ratio, finds the topological tree of correspondence, i.e. navigates to correct topological relation according to the generation time of fault. Thus the management to fault carries out correct association, it is possible to well solve resource topology and change the analysis brought Error, is greatly improved user's efficiency of fault correlation under virtualized environment, and the present invention is for virtualization simultaneously The specific product of each layer system and interface realize not relying on, and possess good versatility, are highly suitable for existing The virtualized environment in stage is applied, it is provided that the core competitiveness of product.
Accompanying drawing explanation
The fault correlation method flow schematic diagram that Fig. 1-1 provides for the embodiment of the present invention one;
Resource topology tree schematic diagram one in the fault correlation method that Fig. 1-2 provides for the embodiment of the present invention one;
Resource topology tree schematic diagram two in the fault correlation method that Fig. 1-3 provides for the embodiment of the present invention one;
Resource node modeling schematic diagram in the fault correlation method that Fig. 1-4 provides for the embodiment of the present invention one;
The fault correlation method flow schematic diagram that Fig. 2 provides for the embodiment of the present invention two;
The fault correlation method flow schematic diagram that Fig. 3 provides for the embodiment of the present invention three;
The fault correlation method flow schematic diagram that Fig. 4 provides for the embodiment of the present invention four;
The fault correlation method flow schematic diagram that Fig. 5 provides for the embodiment of the present invention five;
The fault correlation method flow schematic diagram that Fig. 6 provides for the embodiment of the present invention six;
The fault correlation apparatus structure schematic diagram one that Fig. 7 provides for the embodiment of the present invention six;
The fault correlation apparatus structure schematic diagram two that Fig. 8 provides for the embodiment of the present invention six.
Detailed description of the invention
For making the purpose of the embodiment of the present invention, technical scheme and advantage clearer, below in conjunction with the present invention Accompanying drawing in embodiment, is clearly and completely described the technical scheme in the embodiment of the present invention, it is clear that Described embodiment is a part of embodiment of the present invention rather than whole embodiments.Based in the present invention Embodiment, those of ordinary skill in the art obtained under not making creative work premise all its His embodiment, broadly falls into the scope of protection of the invention.
Embodiment one
The fault correlation method of the present embodiment, as Figure 1-1, comprises the following steps:
Step S101: receiving the source fault message that the source of trouble reports, source fault message includes the position of the source of trouble Information;
In this step, the source of trouble reports the relevant information of the fault of self, i.e. reports source fault message.Pipe Reason end receives this source fault message.The positional information of the source of trouble here refers to where break down, specifically At which layer, which resource node etc..
Step S102: obtain the generation time of the source of trouble;
In this step, the generation time here refers to the time that fault produces.
Step S103: search corresponding resource topology tree according to the time of generation;
In this step, for virtualized environment, according to different deployment strategys, different hardware shapes State, the relation between each layer resource is dynamically change, i.e. resource topology tree is continually changing, for standard The true fault finding fault correlation, then resource topology tree accurately will be found.Here corresponding resource Topological tree refers to that this source of trouble produces the resource topology tree corresponding to time of fault.
Step S104: obtain the relevant fault of the source of trouble according to positional information and resource topology tree.
In this step, after determining resource topology tree, it is possible to the position letter occurred according to this source of trouble Breath finds its relevant fault.Citing illustrates, as shown in Figure 1-2, and virtual network function management VNFM Resource topology tree graph when i.e. application layer management node receives the source fault message that the source of trouble reports, at this figure In, virtual application 1 creates fault B owing to postponing at 00:01, just fault B is reported at 00:04 VNFM, when 00:02, the empty machine 1 of this virtual application 1 correspondence moves to main frame 2 from main frame 1, and And the empty machine at now this application place, move to main frame 2 from main frame 1, i.e. at this point for empty machine 1 Saying, the resource node of its upstream physical resource layer is main frame 2;Fault C is created, at main frame 1 at main frame 2 Create fault A.As Figure 1-3, for producing resource topology tree graph during fault, now, virtual application The time main frame 1 that the empty machine 1 of 1 correspondence connects, i.e. at this point for empty machine 1 from the point of view of, its upstream physical resource layer Resource node is main frame 1, creates fault A at main frame 1.Fault B so virtual application 1 produced Corresponding relevant fault should be association analysis to fault A on main frame 1 rather than fault C on main frame 2.
Concrete, it is contemplated that the problem that resource node change and fault report delay, it is simple to accurately process association Fault, the resource topology tree effective time that in including preset time period, each resource node is corresponding with each resource node, I.e. resource topology tree is one and can store one period of cycle, interior whole topology informations, and records each resource node Effective time range, when fault reports analysis, the resource of the association in time produced according to fault to correspondence Node is analyzed.Meriting attention and be, the cycle can the most specifically be arranged, preferably here Be set in one day.
Concrete, before searching corresponding resource topology tree according to the time of generation, also include: detect each money Source node changes, and is updated resource topology tree.I.e. Real-time Collection resource node change, real-time update provides Topological tree flutters in source, and preserves history resource topology tree, it is simple to later stage time-based resource topology tree is looked into Ask.Resource node, after changing, can keep the resource topology tree in a recent period of time, and record phase Close the effective time scope of resource node, when fault is given the correct time owing to network delay or other reasons supplement, energy The enough generation time according to fault, navigate to correct resource topology tree, it is simple to the later stage carries out fault accurately Association.
Concrete, newly-increased resource node and/or deletion resource node;It is updated including to resource topology tree: When being detected as newly-increased resource node, resource topology tree increases this resource node, and will report newly-increased The time of this resource node is as the time started of this resource node effective time;When being detected as deleting resource joint During point, report the time deleting this resource node as the end time of this resource node effective time.
In order to learn the generation time of the source of trouble, can obtain and receive the reception time of fault message and report Time delay, obtain the generation time of the source of trouble according to reception time and time delay.It is construed as it He is appreciated that the time of source of trouble generation fault is included.
Concrete, in above-mentioned steps S103, obtain the association event of the source of trouble according to positional information and resource topology tree Barrier can find source of trouble resource node in resource topology tree according to positional information, according to default association rule Then the fault type with the source of trouble identifies the association event of the source of trouble from the fault message of resource node upstream Barrier.Fault type according to default correlation rule and the source of trouble identifies from the fault message of resource node upstream The relevant fault in source of being out of order includes: when the source of trouble occurs at physical resource layer, do not carry out fault correlation knowledge Not;When the source of trouble occurs at virtual resource layer, according to the resource node of ownership, search with resource node even The resource node of the physical resource layer connect, obtains the fault message on these nodes, according to fault type with pre- If correlation rule filters out the relevant fault of physical resource layer;When the source of trouble occurs at virtual application layer: root According to the virtual network function unit of ownership, search the empty machine being connected with virtual network function unit, obtain with empty The resource node of the virtual resource layer that machine connects, obtains the fault message on these nodes, according to fault type With the relevant fault that default correlation rule filters out virtual resource layer.
In order to preferably be associated fault analyzing, finding the source of trouble at resource topology according to positional information After resource node in tree, also include: be associated with the corresponding resource pair of resource node according to positional information As upper.
Due to each layer product of virtualized environment, it is all to support across manufacturer, may constantly introduce new manufacturer Product, as accessed different virtual platforms, the fault message of these different vendors is all different, therefore The incidence relation of fault needs dynamically to update.Concrete, according to presetting correlation rule and the fault of the source of trouble Before type identifies the relevant fault of the source of trouble from the fault message of resource node upstream, also include: dynamic State updates preset rules.Concrete renewal, can be that virtualized each layer product updates, as increased void Planization is applied, when increasing new virtual platform, by loading new correlation rule table manually or automatically.
Further, in order to preferably each resource node be processed, it is simple to be rapidly performed by fault correlation, Resource node in resource topology tree can be modeled, as Figure 1-4, wherein to fault according to money Source Type, is divided into calculating, storage, network three class, and the fault message of each layer is carried out by fault collection module Adaptation, extracts type information, fault will be divided into calculating type fault, storage class fault and network class Type fault.Concrete:
Physical resource layer:
Network: router, switch, physical port
Calculate: main frame
Storage: main frame, storage
Virtual resource layer:
Network: virtual port mouth
Calculate: empty machine
Storage: cloud disk
Virtual application layer:
The fault of virtual application layer divides different type according to alarm code: network, calculates, store
The fault of virtual application layer is mounted to the virtual network function unit of its correspondence Under (Virtualised Network Function is called for short VNF) VNF, during subsequent association, according to position Information, sets up incidence relation with empty machine, virtual port, cloud disk etc..
Merit attention and be, by setting up corresponding model, can show based on the fault relationship of virtualization layering, Can be according to virtual application layer, virtual resource layer, these 3 layers of physical resource layer shows the fault under virtualization Related information, as Figure 1-4.When the multiple possible upstream failure of fault correlation, according to weight Height determine the sequencing shown, help user to determine the priority of location.It should be noted that Here specific weight can the most specifically be arranged.
Embodiment two
The fault correlation method of the present embodiment, as in figure 2 it is shown, comprise the following steps:
Step S201: source of trouble reporting fault information i.e. reports source fault message;
Step S202: application layer management node VNFM receives fault message, extracts position, class by adaptation Type information;
Step S203: application layer management node is associated with on the resource object of correspondence according to positional information;
Step S204: application layer management node obtains association according to resource topology relation (i.e. resource topology tree) Fault;
Step S205: relevant fault is inserted in fault message by application layer management node, and stores warehouse-in.
Embodiment three
The fault correlation method of the present embodiment, mainly to receive the source fault letter that the source of trouble reports in this example Breath, illustrates as a example by carrying out fault correlation, as it is shown on figure 3, comprise the following steps:
Step S301: the source of trouble detects alarm, by Simple Network Management Protocol (Simple Network Management Protocol, is called for short SNMP) SNMP or other mode reporting faults;
Step S302: application layer management node receives fault, by adaptation processing, extracts fault type And positional information;
Step S303: the time that application layer management node produces according to fault location information and fault, by fault It is associated with on the resource object of correspondence;
In this step, concrete, physical resource layer fault: according to positional information, route can be associated with In device, switch, main frame, physical port, magnetic battle array;Virtual resource layer fault: according to positional information, can To be associated with on empty machine, cloud disk, virtual port;Virtual application layer fault: according to positional information, be associated with On corresponding VNF.
Step S304: application layer management node failure acquisition module notice fault correlation module, requirement analysis closes Connection fault;
In this step, the logic that relevant fault is analyzed is as follows: the time produced according to fault, retrieval time In the range of resource topology tree information physical resource layer fault: do not analyze;Virtual resource layer fault: according to returning The resource node belonged to, searches connected physical resource layer resource node, obtains the fault on these nodes Information, screens according to fault type and fault correlation rule (predefined), if having multiple, and root Select according to weight.Virtual application layer fault: according to the VNF of ownership, first look for coupled empty machine, as Really fault is network class fault, continues the empty machine port being connected according to this void machine of IP address search, if fault For disk sort fault, then search the cloud disk being connected with this void machine.For fault present in above-mentioned node, root According to type and correlation rule, filter out the virtual resource layer fault of association.
Step S305: application layer management node, according to the relevant fault got, updates the root fault of fault Information.
Embodiment four
The fault correlation method of the present embodiment, main with the place after fault clearance after relevant fault in this example Reason process, as shown in Figure 4, comprises the following steps:
Step S401: source of trouble reporting fault removes event;
Step S402: application layer management node receives fault clearance event, from the information preserved before, Get this location of fault information;
Step S403: this fault is deleted from the resource node being associated by application layer management node;
Step S404: application layer management node completes follow-up fault clearance flow process.
Embodiment five
The fault correlation method of the present embodiment, mainly presetting correlation rule process with dynamic renewal in this example is Example, as it is shown in figure 5, comprise the following steps:
Step S501: door sends the requests to application layer management node;
Step S502: application layer management node has been responsible for the importing of fault correlation rule list, returns response letter Breath;
Step S503: door display imports result;
Step S504: under manufacturer supports the scene of automatic notification interface, vendor product accesses backward application layer Management node VNFM sends notice;
Step S505: application layer management node receives notice and completes the importing of fault correlation rule, and returns response.
It should be noted that and introduce new vendor product in virtualized environment, manufacturer provides the fail close of product Connection rule list, describes the incidence relation between this product and upstream product fault, when manufacturer does not support the most logical Know under the scene of interface, initiate manual importing by VNFM door.
Embodiment six
The fault correlation method of the present embodiment, mainly to detect the change of each resource node in this example, to resource As a example by topological tree is updated process, as shown in Figure 6, comprise the following steps:
Step S601: detect that resource node changes, sends Notification of Changes to application layer management node;
Step S602: according to change type, updates failed resource topological tree.
In this step, concrete more new regulation is as follows: if newly-increased resource node, then in fault topology Increase new resource node, arrange effective time started on call time.If revising resource node and relating to Change to topology location, then (will report set of time is end to revise effective time on the node of old position Time), and increase new node in a new location, and effectively the time started be set on call time.If deleted Except resource node, then effective end time of this resource node is set on call time.
Embodiment seven
The present embodiment provides a kind of fault correlation device 700, as it is shown in fig. 7, comprises receiver module 701, obtain Delivery block 702, topography module 703 and relating module 704:
Receiver module 701 is for receiving the source fault message that the source of trouble reports, and source fault message includes the source of trouble Positional information;
Acquisition module 702 is for obtaining the generation time of the source of trouble;
Topography module 703 is for searching corresponding resource topology tree according to the time of generation;
Relating module 704 for obtaining the relevant fault of the source of trouble according to positional information and resource topology tree.
The present embodiment also provides for a kind of fault correlation device 700, as shown in Figure 8, also includes more new module 705, More new module 705, for before searching corresponding resource topology tree according to the time of generation, detects each resource joint Point change, is updated resource topology tree.
Further, the change of each resource node includes: newly-increased resource node and/or deletion resource node;Update mould Block is additionally operable to: when being detected as newly-increased resource node, increase this resource node in resource topology tree, and Using time of reporting this resource node newly-increased as time started of this resource node effective time;When being detected as When deleting resource node, report the time deleting this resource node as the end of this resource node effective time Time.
Further, acquisition module 702 is additionally operable to obtain and receives the reception time of fault message and prolonging of reporting Late the time, obtain the generation time of the source of trouble according to reception time and time delay.
Further, relating module 704 is additionally operable to: find the source of trouble in resource topology tree according to positional information Resource node, according to the fault type of default correlation rule and the source of trouble fault letter from resource node upstream Breath identifies the relevant fault of the source of trouble.
Further, relating module 704 is additionally operable to: according to presetting correlation rule and the fault type of the source of trouble Before identifying the relevant fault of the source of trouble from the fault message of resource node upstream, dynamically update and preset rule Then.
Meriting attention and be, in this example, fault correlation device can be at virtual management systematic difference layer-management Node (NFVO and VNFM) is upper to be realized, it is recommended that realize on VNFM.First VNFM can be simultaneously connected with thing Reason resource layer, virtual resource layer and virtual application layer system, possess the possibility of association.Secondly VNFM is according to ETSI The definition of NFV and the current generally location of industry, be positioned at the downstream of NFVO, realize fail close at VNFM network element After connection, NFVO can direct this achievement of multiplexing.
One of ordinary skill in the art will appreciate that all or part of step in said method can be come by program Instruction related hardware completes, and said procedure can be stored in computer-readable recording medium, such as read-only storage Device, disk or CD etc..Alternatively, all or part of step of above-described embodiment can also use one or Multiple integrated circuits realize.Correspondingly, each module/unit in above-described embodiment can use the shape of hardware Formula realizes, it would however also be possible to employ the form of software function module realizes.The present invention is not restricted to any particular form The combination of hardware and software.
Above example is only in order to illustrate technical scheme and unrestricted, reference only to preferred embodiment The present invention has been described in detail.It will be understood by those within the art that, can be to the present invention's Technical scheme is modified or equivalent, without deviating from the spirit and scope of technical solution of the present invention, all Should contain in the middle of scope of the presently claimed invention.

Claims (15)

1. a fault correlation method, it is characterised in that including:
Receiving the source fault message that the source of trouble reports, described source fault message includes the position letter of the described source of trouble Breath;
Obtain the generation time of the described source of trouble;
Corresponding resource topology tree is searched according to the described generation time;
The relevant fault of the described source of trouble is obtained according to described positional information and described resource topology tree.
2. fault correlation method as claimed in claim 1, it is characterised in that described resource topology tree Including the effective time that resource node each in preset time period is corresponding with each resource node.
3. fault correlation method as claimed in claim 2, it is characterised in that according to described generation Before time searches corresponding resource topology tree, also include: detect the change of each resource node, to described resource Topological tree is updated.
4. fault correlation method as claimed in claim 3, it is characterised in that described each resource node Change includes: newly-increased resource node and/or deletion resource node;Described described resource topology tree is updated Including:
When being detected as newly-increased resource node, described resource topology tree increases this resource node, and will Report the time started as this resource node effective time time of this resource node newly-increased;
When being detected as deleting resource node, the time deleting this resource node is reported to have as this resource node The end time of effect time.
5. the fault correlation method as described in any one of claim 1-4, it is characterised in that described in obtain The generation time taking the described source of trouble includes: obtains and to receive reception time of described fault message and to report Time delay, according to described reception time and generation time of obtaining the described source of trouble described time delay.
6. the fault correlation method as described in any one of claim 1-4, it is characterised in that described The relevant fault obtaining the described source of trouble according to described positional information and described resource topology tree includes: according to described Positional information finds described source of trouble resource node in described resource topology tree, according to default correlation rule From the fault message of described resource node upstream, the described source of trouble is identified with the fault type of the described source of trouble Relevant fault.
7. fault correlation method as claimed in claim 6, it is characterised in that according to described position After information finds described source of trouble resource node in described resource topology tree, also include: according to described Positional information is associated with on the corresponding resource object of described resource node.
8. fault correlation method as claimed in claim 6, it is characterised in that described basis is preset and closed The fault type joining the regular and described source of trouble identifies described from the fault message of described resource node upstream The relevant fault of the source of trouble includes:
When the described source of trouble occurs at physical resource layer, do not carry out fault correlation identification;
When the described source of trouble occurs at virtual resource layer, according to the resource node of ownership, search and described money The resource node of the physical resource layer that source node connects, obtains the fault message on these nodes, according to described Fault type and described default correlation rule filter out the relevant fault of physical resource layer;
When the described source of trouble occurs at virtual application layer: according to the virtual network function unit of ownership, search The empty machine being connected with described virtual network function unit, obtains the money of the virtual resource layer being connected with described empty machine Source node, obtains the fault message on these nodes, according to described fault type and described default correlation rule Filter out the relevant fault of virtual resource layer.
9. fault correlation method as claimed in claim 6, it is characterised in that according to presetting association The fault type of the regular and described source of trouble identifies described event from the fault message of described resource node upstream Before the relevant fault in barrier source, also include: dynamically update preset rules.
10. a fault correlation device, it is characterised in that include receiver module, acquisition module, topology Module and relating module:
Described receiver module is for receiving the source fault message that the source of trouble reports, and described source fault message includes institute State the positional information of the source of trouble;
Described acquisition module is for obtaining the generation time of the described source of trouble;
Described topography module is for searching corresponding resource topology tree according to the described generation time;
Described relating module is for obtaining the described source of trouble according to described positional information and described resource topology tree Relevant fault.
11. fault correlation devices as claimed in claim 10, it is characterised in that also include more new module, Described more new module is for, before searching corresponding resource topology tree according to the described generation time, detecting each money Source node changes, and is updated described resource topology tree.
12. fault correlation devices as claimed in claim 11, it is characterised in that described each resource node Change includes: newly-increased resource node and/or deletion resource node;Described more new module is additionally operable to:
When being detected as newly-increased resource node, described resource topology tree increases this resource node, and will Report the time started as this resource node effective time time of this resource node newly-increased;
When being detected as deleting resource node, the time deleting this resource node is reported to have as this resource node The end time of effect time.
The 13. fault correlation devices as described in any one of claim 10-12, it is characterised in that described Acquisition module is additionally operable to obtain the reception time receiving described fault message and the time delay reported, according to Described reception time and obtain generation time of the described source of trouble described time delay.
The 14. fault correlation devices as described in any one of claim 10-12, it is characterised in that described Relating module is additionally operable to: find described source of trouble money in described resource topology tree according to described positional information Source node, according to the fault type of default correlation rule and the described source of trouble from described resource node upstream therefore Barrier information identifies the relevant fault of the described source of trouble.
15. fault correlation devices as claimed in claim 14, it is characterised in that described relating module is also For: according to the fault type event from described resource node upstream presetting correlation rule and the described source of trouble Before barrier information identifies the relevant fault of the described source of trouble, dynamically update preset rules.
CN201510364079.0A 2015-06-26 2015-06-26 Fault correlation method and device Pending CN106330501A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201510364079.0A CN106330501A (en) 2015-06-26 2015-06-26 Fault correlation method and device
PCT/CN2016/073759 WO2016206386A1 (en) 2015-06-26 2016-02-14 Fault correlation method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510364079.0A CN106330501A (en) 2015-06-26 2015-06-26 Fault correlation method and device

Publications (1)

Publication Number Publication Date
CN106330501A true CN106330501A (en) 2017-01-11

Family

ID=57584611

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510364079.0A Pending CN106330501A (en) 2015-06-26 2015-06-26 Fault correlation method and device

Country Status (2)

Country Link
CN (1) CN106330501A (en)
WO (1) WO2016206386A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107623596A (en) * 2017-09-15 2018-01-23 郑州云海信息技术有限公司 Start the method for testing network element positioning investigation failure in a kind of NFV platforms
CN108156037A (en) * 2017-12-29 2018-06-12 中国移动通信集团江苏有限公司 Alarm correlation analysis method, device, equipment and medium
CN110428060A (en) * 2019-06-12 2019-11-08 南京博泰测控技术有限公司 A kind of fault information managing method, device and system
CN110493025A (en) * 2018-05-15 2019-11-22 中国移动通信集团浙江有限公司 It is a kind of based on the failure root of multilayer digraph because of the method and device of diagnosis
CN110868355A (en) * 2019-11-19 2020-03-06 广州丰石科技有限公司 NFV network-based topology automatic discovery and fault delimitation method
CN111026098A (en) * 2019-12-30 2020-04-17 臻驱科技(上海)有限公司 Fault diagnosis method and device for vehicle motor controller and electronic equipment
CN111193605A (en) * 2019-08-28 2020-05-22 腾讯科技(深圳)有限公司 Fault positioning method and device and storage medium
CN114448774A (en) * 2021-12-16 2022-05-06 武汉光迅科技股份有限公司 Alarm processing method, device and storage medium

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108768949B (en) * 2018-04-28 2020-04-14 广东电网有限责任公司 Random geometric data anomaly positioning method based on Markov random field theory
CN111600746B (en) * 2020-04-15 2022-12-09 新浪网技术(中国)有限公司 Network fault positioning method, device and equipment
US20230239206A1 (en) * 2022-01-24 2023-07-27 Rakuten Mobile, Inc. Topology Alarm Correlation
CN114780283B (en) * 2022-06-20 2022-11-01 新华三信息技术有限公司 Fault processing method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101431448A (en) * 2008-10-22 2009-05-13 华为技术有限公司 Method, equipment and system for positioning fault of IP bearing network
CN102045192A (en) * 2009-10-20 2011-05-04 株式会社日立制作所 Apparatus and system for estimating network configuration
CN103001811A (en) * 2012-12-31 2013-03-27 北京启明星辰信息技术股份有限公司 Method and device for fault locating
WO2015042937A1 (en) * 2013-09-30 2015-04-02 华为技术有限公司 Fault management method, entity and system

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102082690B (en) * 2011-01-10 2013-04-03 北京邮电大学 Passive finding equipment and method of network topology

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101431448A (en) * 2008-10-22 2009-05-13 华为技术有限公司 Method, equipment and system for positioning fault of IP bearing network
CN102045192A (en) * 2009-10-20 2011-05-04 株式会社日立制作所 Apparatus and system for estimating network configuration
CN103001811A (en) * 2012-12-31 2013-03-27 北京启明星辰信息技术股份有限公司 Method and device for fault locating
WO2015042937A1 (en) * 2013-09-30 2015-04-02 华为技术有限公司 Fault management method, entity and system

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107623596A (en) * 2017-09-15 2018-01-23 郑州云海信息技术有限公司 Start the method for testing network element positioning investigation failure in a kind of NFV platforms
CN108156037A (en) * 2017-12-29 2018-06-12 中国移动通信集团江苏有限公司 Alarm correlation analysis method, device, equipment and medium
CN108156037B (en) * 2017-12-29 2020-12-15 中国移动通信集团江苏有限公司 Alarm correlation analysis method, device, equipment and medium
CN110493025A (en) * 2018-05-15 2019-11-22 中国移动通信集团浙江有限公司 It is a kind of based on the failure root of multilayer digraph because of the method and device of diagnosis
CN110493025B (en) * 2018-05-15 2022-06-14 中国移动通信集团浙江有限公司 Fault root cause diagnosis method and device based on multilayer digraphs
CN110428060A (en) * 2019-06-12 2019-11-08 南京博泰测控技术有限公司 A kind of fault information managing method, device and system
CN111193605A (en) * 2019-08-28 2020-05-22 腾讯科技(深圳)有限公司 Fault positioning method and device and storage medium
CN110868355A (en) * 2019-11-19 2020-03-06 广州丰石科技有限公司 NFV network-based topology automatic discovery and fault delimitation method
CN110868355B (en) * 2019-11-19 2022-05-13 广州丰石科技有限公司 Topology automatic discovery and fault delimitation method based on NFV network
CN111026098A (en) * 2019-12-30 2020-04-17 臻驱科技(上海)有限公司 Fault diagnosis method and device for vehicle motor controller and electronic equipment
CN114448774A (en) * 2021-12-16 2022-05-06 武汉光迅科技股份有限公司 Alarm processing method, device and storage medium
CN114448774B (en) * 2021-12-16 2023-12-05 武汉光迅科技股份有限公司 Alarm processing method, device and storage medium

Also Published As

Publication number Publication date
WO2016206386A1 (en) 2016-12-29

Similar Documents

Publication Publication Date Title
CN106330501A (en) Fault correlation method and device
US10797970B2 (en) Interactive hierarchical network chord diagram for application dependency mapping
CN102316001B (en) Virtual network connection configuration realizing method and network equipment
CN106126652B (en) Mishap Database switching method and system for distributed experiment & measurement system
CN105099789B (en) A kind of network element updating method and apparatus
CN106130761B (en) The recognition methods of the failed network device of data center and device
US20170346676A1 (en) Alarm correlation in network function virtualization environment
CN107222339B (en) Graph database-based fault analysis method and device for power information communication system
US8347143B2 (en) Facilitating event management and analysis within a communications environment
AU2019201687B2 (en) Network device vulnerability prediction
US20130124712A1 (en) Elastic cloud networking
CN107612787A (en) A kind of cloud hostdown detection method for cloud platform of being increased income based on Openstack
TWI677217B (en) Method and device for implementing packet mirroring of dynamic traffic in a cloud network environment
CN102427445B (en) Safe auditing method of IT simulation infrastructure offline compliance
CN101651561B (en) Network topology analytical method and system based on rule engine
CN112583648B (en) Intelligent service fault processing method based on DNS
CN109639488B (en) Multi-extranet shunt acceleration method and system
CN105119736A (en) Data check method and device in network function virtualization architecture
CN107634863A (en) Distributed monitoring device and method for domain name mapping disaster tolerance service
US9674040B2 (en) Network topology discovery and obsolescence reporting
CN109104335A (en) A kind of industrial control equipment network attack test method and system
US20120078565A1 (en) Methods, Systems, and Products for Reflective Maintenance
CN111371570B (en) Fault detection method and device for NFV network
CN103684871A (en) Method for monitoring operation and maintenance asset state, method and system for updating operation and maintenance configuration information
CN104539462B (en) It is a kind of to switch to method and device of the calamity for application example

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20170111