WO2006021702A1 - Systeme de gestion de reseau de communication pour reparation automatique de pannes - Google Patents

Systeme de gestion de reseau de communication pour reparation automatique de pannes Download PDF

Info

Publication number
WO2006021702A1
WO2006021702A1 PCT/FR2005/050530 FR2005050530W WO2006021702A1 WO 2006021702 A1 WO2006021702 A1 WO 2006021702A1 FR 2005050530 W FR2005050530 W FR 2005050530W WO 2006021702 A1 WO2006021702 A1 WO 2006021702A1
Authority
WO
WIPO (PCT)
Prior art keywords
diagnosis
module
communication network
management system
failure
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/FR2005/050530
Other languages
English (en)
French (fr)
Inventor
Emmanuel Marilly
Olivier Martinot
Mohamed Adel Saidi
Sylvain Squedin
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alcatel Lucent SAS
Nokia Inc
Original Assignee
Alcatel SA
Alcatel Lucent SAS
Nokia Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alcatel SA, Alcatel Lucent SAS, Nokia Inc filed Critical Alcatel SA
Priority to US11/572,904 priority Critical patent/US8149718B2/en
Priority to JP2007523126A priority patent/JP2008508760A/ja
Priority to EP05787400A priority patent/EP1774705A1/fr
Publication of WO2006021702A1 publication Critical patent/WO2006021702A1/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/16Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks using machine learning or artificial intelligence
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0663Performing the actions predefined by failover planning, e.g. switching to standby network elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements

Definitions

  • the present invention relates to communication networks and applies in particular but not exclusively to packet-oriented communication networks. It relates more particularly to the management of communication networks. It is known to associate with communication networks, network management systems (or NMSs for "Network Management System”) whose purpose is to collect alarms from the network equipment, to make a synthesis, including the means of correlation methods, and display this summary or its alarms to an operator so that it can implement a corrective action in case of failure of one or more of these equipment.
  • the notion of "failure” must be understood in a very general sense meaning any type of malfunction, whether hardware or software. Thus, a bad configuration of a network element can be likened to a failure (software), as well as an error in a routing table of an IP router or a port inadvertently closed.
  • Network management systems can also configure network equipment.
  • the operator can enter new parameters through a human-machine interface and the network management system passes these new parameters on the equipment. In this way, the operator can correct a network failure in response to the display of alarms.
  • Such centralized analysis relies on the collection of a large number of information and alarm data from the many elements of the communication network.
  • These elements can be devices such as routers within the framework of a network of Internet Protocol (IP) communication, but they can also be Voice, Optical, Private Branch Exchange (PABX), Media Gateway (Media Gateway) switches in an NGN (Next Generation Network) network, and so on.
  • a network element can also be a part of such equipment, such as a map for example.
  • a failure on a card of an equipment can generate an alarm from all the cards of other equipment, connected to one of the ports of this card, as well as from the cards of the equipment in which the breakdown occurred.
  • diagnostic systems which, based on this large number of alarms, determine the most likely cause (s). These diagnostic systems are based for example on sets of rules, such as those based on the product llogRules TM from the company Ilog. They can also rely on neural networks, Bayesian networks, expert systems or other techniques, including artificial intelligence.
  • the object of the invention is to automate the correction of failures of the managed communication network, while ensuring that the problems detected are corrected or, at least, that the general situation of the telecommunication network is improved.
  • the invention firstly relates to a management system of a communication network comprising a diagnostic module capable of determining a diagnosis from information provided by elements of the communication network.
  • This diagnosis identifies a failure within the communication network.
  • This management system is characterized in that it further comprises a repair module arranged to determine one or more corrective actions from at least the diagnostic received from the diagnostic module and to transmit one or more commands corresponding to the action or actions corrective, to one or more elements of the communication network, to correct the fault, to transmit a diagnostic request to the diagnostic module to trigger the determination of a new diagnosis by the diagnostic module.
  • the management system can comprise the following characteristics, individually or in combination: an iteration counter is incremented each time a new diagnosis is triggered, and the management system stops the modules of diagnosis and repair when the iteration counter has reached a predetermined threshold.
  • the command or commands are transmitted via a mediation module capable of transforming the command or commands formulated in a generic language, into specific commands conforming to control languages specific to the different equipment of the communication network.
  • the repair module is arranged to communicate with a traffic optimization module if corrective actions can not be satisfactorily determined to correct the fault, in order to eliminate or limit the influence of this fault on the behavior of the network of communication.
  • the diagnostic module implements a Bayesian network to determine the diagnosis.
  • the diagnosis contains at least the designation of the fault, the location of the fault, and possibly an estimate of the probability associated with the diagnosis.
  • the diagnostic module communicates the diagnostics to the repair module using an SNMP compliant message.
  • the repair module has a repair rule base.
  • the rules may depend on the iteration counter.
  • the repair module has a history of the diagnostics and corrective actions determined at each iteration and is able to return to the state of the communication network for a given iteration, if when the threshold is reached, the diagnosis is worse than what he was for this given iteration.
  • the invention also has as a second object a method for managing a communication network comprising a first step of determining a diagnosis based on information provided by elements of the communication network.
  • This method is characterized in that it further comprises a second determination step, if the diagnosis identifies a failure, one or more corrective actions from at least the diagnosis and transmission of one or more commands ( C) corresponding to this or these corrective actions, to one or more elements of the communication network, in order to correct the failure; the first step being automatically triggered at the end of the second step, to form a treatment cycle
  • the management method may comprise the following characteristics, individually or in combination: an iteration counter is incremented each time a new diagnosis is triggered, the treatment cycle being interrupted when this iteration counter has reached a predetermined threshold.
  • the command or commands are transmitted via a mediation module capable of transforming the command or commands formulated in a generic language, into specific commands conforming to control languages specific to the different equipment of the communication network.
  • a command can be transmitted to a traffic optimization module if corrective actions can not be satisfactorily determined to correct the fault, in order to eliminate or limit the influence of this fault on the behavior of the communication network.
  • the first step implements a Bayesian network to determine the diagnosis.
  • the diagnosis contains at least the designation of the fault, the location of the fault, and possibly an estimate of the probability associated with the diagnosis.
  • a history of the diagnoses and corrective actions determined at each iteration is constituted and if when the threshold is reached the diagnosis is less good than it was for a previous iteration, it returns to the state of the communication network for this iteration earlier.
  • the repair module makes it possible to automatically repair (or semi-automatically, after validation of the operator) the failures of the communication network and / or to undertake actions corrective measures limiting the consequences of failures, and in particular their impact on the Service Level Agreements (SLAs) deployed on the communication network.
  • SLAs Service Level Agreements
  • a new diagnosis is requested from the diagnostic module to ensure that this repair has not caused a new failure, and if so, to correct them in turn.
  • a treatment cycle can thus be implemented which must converge towards the repair of the diagnosed breakdown.
  • a threshold can be set for the number of iterations of the loop.
  • FIG 1 shows schematically the management system according to the invention.
  • FIG. 2 represents an internal architecture of the repair module according to one implementation of the invention.
  • FIG. 3 illustrates the sequence of steps of the method according to the invention.
  • the network management system comprises two main modules which are on the one hand a diagnostic module MD and on the other hand a repair module MR.
  • the diagnostics module receives inbound complaints from the CC clients and network A information. Customer complaints may be caused by a decrease in the performance of the network or the services implemented on the network. Upon receipt of such a complaint, the MD diagnostic system may trigger a diagnostic search to determine the technical cause that led to the complaint. It can then take advantage of information from network elements. A diagnostic search can also be triggered by this information from the elements of the communication network themselves (that is, independently of the presence of a customer complaint). The information A resulting from the elements of the network can be mainly of two natures:
  • They can be alarms, ie asynchronous notifications transmitted by the elements of the network to the network management system. These alarms are transmitted when an anomaly is detected by the element, when a measured value reaches a predetermined threshold or, more generally, during any type of event that has been planned, by configuration. , that the appearance should trigger an alarm. These alarms can therefore indicate a failure, whether it is software or hardware.
  • MIB Management Informat ion
  • the diagnostic module MD can determine that these represent a probable failure or failure. As a result of such a determination, it triggers the determination of the diagnosis, that is to say, the search for the failure. As mentioned earlier, when a failure occurs, it can cause alarms not only from the failed network element, but also from "neighboring" network elements. The determination of the diagnosis is precisely to identify the failed network element (it is the location of the failure), as well as the type of failure (the title of the failure), from this set of alarms from a multitude of network elements. Generally, the diagnosis can not be certain, and the diagnosis module MD will then provide the diagnosis D most probable by associating a probability value. It can also provide a list of several diagnostics, each associated with a probability value, and each identifying a failure within the communication network N.
  • the diagnostic module can be a known module of the state of the art. As previously described, the technologies used can be diverse: neural networks, expert systems, rule-based systems, and so on. These may be the aforementioned commercial products. However, the diagnostic module MD must preferably respect two criteria:
  • the diagnostic search can be initiated other than by receiving alarms A from the communications network.
  • this search must be triggered by an external event that can be either a customer complaint CC or a diagnostic request (CD), transmitted by the repair module MR and will be explained later.
  • an external event can be either a customer complaint CC or a diagnostic request (CD), transmitted by the repair module MR and will be explained later.
  • this diagnostic module implements Bayesian networks and is similar to the "5530 - Network Analyzer” product from Alcatel.
  • the operation of an implementation of this module is therefore described in the technical document of the product "5530".
  • the Bayesian network technology and its application to the diagnosis determination are for example described in F. Jensen's "An introduction to Bayesian Networks", LJCL Press, 1996 or in the article “IP VPN Network Diagnosis: Technologies and Perspectives "by incident Délègue, Stéphane Betgé-Brezetz, Emmanuel Marilly and Olivier Martinot, 3rd International Conference on Networking, March 2004.
  • Bayesian networks uses probability theory which defines a rule to refine a hypothesis by taking into account additional evidence and background information and thus lead to a number representing the degree of probability that the hypothesis be true.
  • a Bayesian diagram therefore defines all the test actions to be undertaken, associated with a probability and a cost, the cost serving for example to define which tests require more time than the others.
  • Figure 3 illustrates an extremely simplified example of a Bayesian diagram. It shows that the "LossPacket" entity depends on several other entities, "HighCPUUtilization”, “Interface IN Status”, “InterfaceOutStatus” (Status of the outgoing interface).
  • This model therefore allows the system to represent the causal chains and thus to indicate the tests to be performed, and to calculate a probability value for each possible diagnosis and to determine a most probable diagnosis.
  • a diagnosis D determined it is transmitted to the repair module MR, in the form of one or more messages.
  • These messages are, for example, compliant with the Simple Network Management Protocol (SNMP) defined by the Internet Engineering Task Force (IETF) in RFCs 1 157 and 1592 for versions 1 and 2 respectively. It can be a single SNMP message (or "trap") or multiple messages. In the latter case, the messages must be clearly identified so that the repair module MR can easily determine that they are related to the same diagnosis D.
  • the first message contains the general information on the diagnosis, while the following message (s) contain details information.
  • the first message may contain the diagnostic information relating to the LSP itself, while the subsequent messages contain information relating to the equipment (or routers) traversed by the LSP.
  • This determination can be made using the ticket which will be explained below: all the SNMP messages carrying the same diagnosis D will have the same ticket.
  • the two modules can communicate via a software bus CORBA type (Common Object Reqüest Broker Architecture) as defined by the OMG (Open Management Group) or COM / DCOM company Microsoft.
  • CORBA type Common Object Reqüest Broker Architecture
  • OMG Open Management Group
  • COM / DCOM company Microsoft COM / DCOM company Microsoft
  • a ticket which is a number that identifies a session for finding the correct diagnosis for a given problem. As said before, if the same diagnosis is carried by several messages, in particular because the volume of information is too important, each of these messages will have the same ticket.
  • OIDs SNMP Object IDentifiers
  • the left part (1 .3.6.1.4.1.637.) Corresponds to the Alcatel identifier, the next part (4.) corresponds to the application (ie the module of MD diagnosis) and the last part corresponds to the attributes specific to the SNMP trap fields.
  • the repair module D Upon receipt of a message containing a diagnosis D, the repair module D will seek to determine one or more corrective actions.
  • Figure 2 shows a possible functional architecture of the MR repair module. This architecture is functional, it can give rise to different achievements.
  • the repair module MR comprises a main module MP, an HMI human-machine interface, a database B and a configuration module MC.
  • the D diagnostics are received by the main module MP, whose purpose is to determine the corrective actions and to transmit the commands corresponding to these corrective actions either to a mediation module MED, or directly to the elements of the network, not represented on the Fig.
  • This determination is therefore made from the diagnosis D and the information it contains and that we have seen previously. However, it may also depend on other such information - an internal logic, ie a formalization in a particular language, of the policy that makes it possible to determine a corrective action from at least one diagnosis.
  • This internal logic is stored in a database B.
  • parameters determined by the user in particular by means of the interface HMI and the configuration module MC and stored in the database B (or possibly in another database separate data).
  • Information transmitted by the user via the HMI interface during the development of the diagnosis in particular in response to a proposal made by the main module MP.
  • the internal logic is formalized in the form of a set of rules. These rules make it possible to associate the diagnostics C that can be received from the diagnostic module MD with corrective actions. They can be entered in base B via the MC configuration module and the HMI interface. These rules can typically be expressed as:
  • the HMI interface can display them on the screen and allow the user to choose the one that seems most appropriate.
  • the repair module MR will then transmit to the elements of the communication network, the commands C corresponding to the corrective actions chosen by the operator.
  • other implementations are also possible.
  • the advantage of using neural networks is their ability to learn and operate in the uncertain. For example, they may be able to apply the most likely corrective action to the most likely outage, even if a certain amount of information is missing.
  • the problem is the difficulty of apprehension of such a technology by the operators of the communication networks.
  • the commands C can be transmitted to the elements of the communication network N, via a mediation module MED.
  • the aim of this mediation module is to transform commands C formulated in a generic language by the repair module MR into specific commands C 1 , C 2 conforming to control languages specific to the different equipment of the communication network.
  • the equipment items N 1 and N 2 may be of a different type, for example an IP router and a media gateway. to connect this communication network N to another network, not shown. It may also be equipment of similar types but manufactured by different suppliers. In these two situations, the commands to be used to configure them must be adapted to the command language understood by the equipment.
  • the same generic command C (for example, "open an interface”) will therefore give rise to different specific commands for the two devices N 1 and N 2 , each of these specific commands being in accordance with the equipment's own language, and in particular with their CLI interface (Commancf An Interface). Different products and technologies can be used to realize this MED mediation module.
  • a policy-based management architecture can be used.
  • the repair module MR makes it possible to automate the repair of the communication network or to semi-automate it in the case where an intervention of the operator is requested.
  • the complexity of communication networks is such that it is almost impossible to fully control the impact of corrective actions.
  • the repair can be ineffective to solve the problem (for example, because the diagnosis was not good, but also because the corrective actions were not fully adapted), and secondly , it is also possible that the actual repair of the fault can cause a new failure. For example, by repairing a first failure, the transmission of traffic to network elements that until then were isolated, and in doing so, we can highlight a breakdown among these network elements.
  • the Applicant therefore considered the additional technical problem of minimizing the insufficient or unwanted impacts on the communication network.
  • a diagnostic request CD is transmitted to the diagnostic module MD, so that it determines a new diagnosis.
  • This diagnostic request CD can contain the value of the ticket, so that this new diagnosis (and any subsequent) is linked to the first diagnosis within a session determined by the same ticket. Thanks to this re-looping to the diagnostic module MD, it can check the correction of the fault and detect the appearance of a new failure. It can then trigger a new repair by the repair module MR, and so on.
  • This preferred embodiment of the invention therefore makes it possible to carry out a control loop of the actions performed on the managed communication network.
  • this embodiment may pose the additional problem of causing infinite loops, that is to say that each repair causes a new failure cyclically.
  • an iteration counter can be implemented. For example, this iteration counter is incremented each time a new diagnosis is triggered. When this iteration counter reaches a certain predefined threshold, the loop is stopped and the system can indicate to the operator through the HMI interface that no solution has been found.
  • the counter can be initialized to the value of this threshold and decremented each time the new diagnosis is triggered. The stop criterion is then the transition to 0 of this iteration counter.
  • this iteration counter can influence the internal logic of the repair module. For example, in the situation where this internal logic is implemented as a set of rules, these rules may depend on the value of the iteration counter.
  • the operator can configure these dependencies according to the constraints he sets. For example, on an unimportant LSP, the operator can associate a high iteration threshold (notably because the resulting cost is less than the triggering of the traffic optimization module, which we will see later). For a very important LSP with an SLA with strong penalties, the iteration threshold can be very low and the operator wishes above all to maintain the service even if it is necessary to trigger the traffic optimization module. Eventually, if in the end, the diagnostics are more serious problems on the communication network than they were at a given moment, a back tracking mechanism can be implemented to return to this configuration of the network. . To do this, the repair module contains a history of corrective actions and / or commands issued and diagnostics received.
  • the repair module can use this history to return the network in the state corresponds to this iteration. It is enough then to transmit the inverse commands to restore this previous situation. Failing to truly resolve the failure, this implementation provides a compromise by improving the state of the network.
  • a message is sent to the operator via the human-machine interface HMI to indicate that the problem could not be solved and the situation of the network.
  • a traffic optimization module TE (or Traffic Engineering, in English) can be used. The purpose of this traffic optimization module is to redirect traffic so that it avoids the problem, rather than truly solving the problem.
  • the TE module can modify the routing tables of other routers, the LSPs passing through this router, and so on. so that no more traffic is directed to this router.
  • the equipment in question remains inoperative, the behavior of the network can become fully operational.
  • a diagnosis identifying a breakdown can lead to four conclusions: either it has repair or there is no repair and the operator is warned, or there is no repair but triggering of the tool.
  • traffic optimization the MR repair module places the network in the "minus" bad state by using the rollback function and warns the operator.
  • FIG. 3 schematizes the progress of the steps of the method according to the invention.
  • a first step “Diag” is to trigger the determination of a diagnosis. This step is triggered, typically, as a result of receiving a communication network alarm or a customer complaint. Then, the system determines whether the diagnosis identifies a fault (box "P"). If there is no fault, the process stops (box “Fl") If there is a fault, the process consists in determining certain criteria of stops ("Stop?"). It is a matter of determining whether the iteration number has reached the predefined threshold, or whether the diagnosis corresponds to a diagnosis already established during an earlier iteration. If these stop criteria are met, the system stops (box "F2"). If not, the method implements a "Rep” step of determining corrective actions and deploying commands associated with these corrective actions to the communication network.
  • a new diagnostic step "Diag” is triggered and the process then goes through the various steps and tests described above.
  • the diagnostic module is put into operation by receiving alarms from the communication network or by a complaint from a client.
  • a complaint from a customer CC or alarms A received from the communication network have highlighted a problem within this network.
  • a first diagnosis reveals that the interface X of a router Y is out of service ("down" in English).
  • the diagnostic module MD transmits this diagnosis to the repair module, with a ticket value and an iteration value of 1.
  • the repair module MR determines the corrective action associated with this diagnosis.
  • the corrective action is, in this example, to put the interface X in service (" ⁇ p" in English).
  • a generic command associated with this corrective action is transmitted to the mediation module MED, which determines the appropriate command in the language specific to the router Y.
  • the repair module MR transmits a diagnostic request CD to the diagnostic module MD, specifying the same ticket value.
  • the diagnostic module MD redoes a diagnosis of the communication network and determines that the interface X of the router Y is indeed operational (" ⁇ p") and that the network is operating normally. He can then transmit a diagnostic D to the repair module to signal that the network is functioning normally. After which, the problem being solved, the repair module MR does not trigger any action and the method of the invention ends.
  • the diagnostic module also determines that the interface X of the router Y is down ("down").
  • the diagnosis D is transmitted to the repair module with a ticket, and the repair module MR determines a corrective action.
  • the generic command associated with this corrective action is transmitted to the MED mediation module which converts it into a command conforming to the specific language of the router Y.
  • the MR repair module transmits a CD diagnostic request to the diagnostic module MD, with the same ticket.
  • the diagnostic module MD triggers a new diagnostic determination.
  • the new diagnostic D notes that the X interface of the Y router is still out of service ("cfown"). This diagnosis D is transmitted to the repair module MR with the same ticket, and an iteration value equal to 2.
  • the MR repair module can see that although the iteration is 2, the problem remains the same. He can then decide to transmit a command to the traffic optimization module TE (Traffic Engineering).
  • TE Traffic Engineering
  • This TE traffic optimization module can then modify the paths in place in the communication network so that they avoid the interface X of the router Y. In this way, although the interface in question is out of order, the communication network can behave normally.
  • the diagnostic module MD also determines that the interface X of the router Y is out of order ("cfown").
  • the diagnosis D is transmitted to the repair module with a ticket number and an iteration value equal to 1.
  • the repair module determines the generic command to be transmitted to the mediation module MED which converts it into a command according to the language specific to the router Y in order to put the interface X in service (" ⁇ p").
  • the MR repair module transmits a CD diagnostic request to the diagnostic module MD, with the same ticket number.
  • the diagnostic module then triggers a new diagnostic determination and a new problem emerges, for example a misconfiguration of the Border Gateway Protocol (BGP).
  • BGP Border Gateway Protocol
  • This diagnosis D is transmitted to the repair module with an iteration number equal to 2 and the same ticket value.
  • the repair module MR determines the appropriate corrective actions, which may be here to reconfigure the BGP protocol. Generic commands are then passed to the mediation module MED which converts them into specific commands.
  • the repair module MR transmits a diagnostic request CD to the diagnostic module MD, specifying the same ticket number. It appears from the new diagnosis D that the same interface X of the router Y is out of service ("down").
  • This diagnosis D is transmitted to the repair module MR with an iteration value equal to 3 and always the same ticket value.
  • the repair module can see that the iteration value is equal to 3 and that the diagnosis is the same as that of iteration 1. He can then determine that he is unable to solve the problem and transmit to the human-machine interface HMI a command causing the display of a message expressing this disability as well as information relating to the diagnosis.
  • the repair module MR of the invention can adopt different behaviors, such as to warn the operator , trigger a traffic optimization module, trigger repair commands, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Databases & Information Systems (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Telephonic Communication Services (AREA)
PCT/FR2005/050530 2004-07-30 2005-07-01 Systeme de gestion de reseau de communication pour reparation automatique de pannes Ceased WO2006021702A1 (fr)

Priority Applications (3)

Application Number Priority Date Filing Date Title
US11/572,904 US8149718B2 (en) 2004-07-30 2005-07-01 Communication network management system for automatic repair of failures
JP2007523126A JP2008508760A (ja) 2004-07-30 2005-07-01 自動障害修復のための通信ネットワーク管理システム
EP05787400A EP1774705A1 (fr) 2004-07-30 2005-07-01 Systeme de gestion de reseau de communication pour reparation automatique de pannes

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR0408505A FR2873879B1 (fr) 2004-07-30 2004-07-30 Systeme de gestion de reseau de communication pour reparation automatique de pannes
FR0408505 2004-07-30

Publications (1)

Publication Number Publication Date
WO2006021702A1 true WO2006021702A1 (fr) 2006-03-02

Family

ID=34947039

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/FR2005/050530 Ceased WO2006021702A1 (fr) 2004-07-30 2005-07-01 Systeme de gestion de reseau de communication pour reparation automatique de pannes

Country Status (6)

Country Link
US (1) US8149718B2 (enExample)
EP (1) EP1774705A1 (enExample)
JP (1) JP2008508760A (enExample)
CN (1) CN101027872A (enExample)
FR (1) FR2873879B1 (enExample)
WO (1) WO2006021702A1 (enExample)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080192119A1 (en) * 2007-02-14 2008-08-14 At&T Knowledge Ventures, Lp System and method of managing video content quality
US8213321B2 (en) 2007-02-01 2012-07-03 Deere & Company Controller area network condition monitoring and bus health on in-vehicle communications networks

Families Citing this family (45)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7536595B1 (en) * 2005-10-19 2009-05-19 At&T Intellectual Property, Ii, L.P. Systems, devices, and methods for initiating recovery
JP2007128437A (ja) * 2005-11-07 2007-05-24 Hitachi Ltd ディスクアレイ装置及びその経路障害検出方法
US20070106784A1 (en) * 2005-11-08 2007-05-10 Dickman David T Systems, methods and apparatus to identify network maintenance zones
JP4701148B2 (ja) * 2006-03-02 2011-06-15 アラクサラネットワークス株式会社 障害回復システム及びサーバ
US7788544B2 (en) * 2006-05-03 2010-08-31 Computer Associates Think, Inc. Autonomous system state tolerance adjustment for autonomous management systems
CN101197621B (zh) * 2007-12-07 2012-03-07 中兴通讯股份有限公司 一种对网管系统故障进行远程诊断定位的方法及其系统
CN101459925B (zh) * 2007-12-11 2010-10-20 中国移动通信集团公司 电信网络投诉管理系统及方法
WO2009082836A1 (en) * 2007-12-26 2009-07-09 Zte Corporation Fault handling method and device
CN101217696B (zh) * 2007-12-28 2012-09-05 中国移动通信集团浙江有限公司 一种移动用户投诉综合处理系统及其方法
US7995575B2 (en) * 2008-01-02 2011-08-09 Cisco Technology, Inc. Packet error handling
US8588225B1 (en) * 2008-07-07 2013-11-19 Cisco Technology, Inc. Physical resource to virtual service network mapping in a template based end-to-end service provisioning
DE102009033215A1 (de) * 2008-07-15 2010-01-21 Airbus Operations Gmbh Netzwerkverwaltungs-System für ein Flugzeug
US7954010B2 (en) * 2008-12-12 2011-05-31 At&T Intellectual Property I, L.P. Methods and apparatus to detect an error condition in a communication network
CN101801015B (zh) * 2009-02-06 2014-03-12 中兴通讯股份有限公司 一种小区退服故障的处理方法及装置
CN101924661B (zh) * 2009-06-17 2015-07-22 中兴通讯股份有限公司 告警的处理方法及装置
WO2011014169A1 (en) 2009-07-30 2011-02-03 Hewlett-Packard Development Company, L.P. Constructing a bayesian network based on received events associated with network entities
US8825845B1 (en) * 2010-11-10 2014-09-02 Open Invention Network, Llc Managing a network element operating on a network
CN102325036B (zh) * 2011-05-17 2016-09-14 南京中兴软件有限责任公司 一种网络系统的故障诊断方法、系统及装置
US9160620B2 (en) * 2011-11-30 2015-10-13 GM Global Technology Operations LLC Integrated fault diagnosis and prognosis for in-vehicle communications
US9021310B1 (en) * 2012-02-14 2015-04-28 Amazon Technologies, Inc. Policy-driven automatic network fault remediation
CN103684828B (zh) * 2012-09-18 2018-08-03 长春亿阳计算机开发有限公司 一种电信设备故障的处理方法和装置
CN103001801B (zh) * 2012-11-30 2016-12-21 北京奇虎科技有限公司 网络修复方法和装置
CN103023699B (zh) * 2012-11-30 2016-12-21 北京奇虎科技有限公司 一种网络修复方法和系统
US9071956B2 (en) * 2012-12-03 2015-06-30 Qualcomm Incorporated Systems and methods for dynamic enablement of wireless communication device functionalities
US20140351414A1 (en) * 2013-05-24 2014-11-27 Alcatel Lucent Systems And Methods For Providing Prediction-Based Dynamic Monitoring
US9578047B2 (en) * 2015-01-13 2017-02-21 GM Global Technology Operations LLC Method and system for reflectometry based communication network monitoring, intrusion detection, and message authentication
CN105262622A (zh) * 2015-10-27 2016-01-20 北京极科极客科技有限公司 一种路由器的优化和诊断的方法及系统
US9973441B2 (en) * 2015-11-07 2018-05-15 King Abdulaziz City Of Science And Technology (Kacst) Method and system for establishing routes in wireless ad-hoc networks utilizing Bayesian approach
US10355918B2 (en) * 2016-01-20 2019-07-16 Level 3 Communications, Llc System and method for automatically repairing a network element
US10162698B2 (en) * 2016-03-25 2018-12-25 Dropbox, Inc. System and method for automated issue remediation for information technology infrastructure
US10142185B2 (en) 2016-06-08 2018-11-27 At&T Intellectual Property I, L.P. Content quality assessment and prediction via flows
JP6718398B2 (ja) * 2017-02-15 2020-07-08 日本電信電話株式会社 サービス復旧装置およびサービス復旧方法
CN108540298B (zh) * 2017-03-01 2022-06-17 中兴通讯股份有限公司 一种自动处理垃圾业务的方法及装置
JP2018170618A (ja) * 2017-03-29 2018-11-01 Kddi株式会社 障害自動復旧システム、制御装置、手順作成装置およびプログラム
DE102018122445A1 (de) * 2017-09-13 2019-03-14 Fisher-Rosemount Systems, Inc. Assistent anwendung für ein modulares steuerungssystem
US10771331B2 (en) 2018-11-07 2020-09-08 Cisco Technology, Inc. Closed loop control for fixing network configuration issues to aid in device classification
CN111200827B (zh) * 2018-11-19 2023-03-21 华硕电脑股份有限公司 网络系统、无线网络延伸器以及网络供应端
CN109560970A (zh) * 2018-12-24 2019-04-02 大唐软件技术股份有限公司 一种网络故障愈合方法、装置以及系统
US10985969B2 (en) * 2019-02-19 2021-04-20 Juniper Networks, Inc. Systems and methods for a virtual network assistant
WO2020235926A1 (en) * 2019-05-20 2020-11-26 Samsung Electronics Co., Ltd. Methods and systems for recovery of network elements in a communication network
US11570038B2 (en) 2020-03-31 2023-01-31 Juniper Networks, Inc. Network system fault resolution via a machine learning model
US11314583B2 (en) 2020-08-18 2022-04-26 Micron Technology, Inc. Memory data correction using multiple error control operations
US11743151B2 (en) 2021-04-20 2023-08-29 Juniper Networks, Inc. Virtual network assistant having proactive analytics and correlation engine using unsupervised ML model
US11770290B2 (en) 2021-08-13 2023-09-26 Juniper Networks, Inc. Network management actions based on access point classification
US12450400B2 (en) * 2023-10-31 2025-10-21 Dell Products L.P. Out of band component validation

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998024222A2 (en) * 1996-11-27 1998-06-04 Telefonaktiebolaget Lm Ericsson (Publ) Software fault management system
US6294991B1 (en) * 1998-09-08 2001-09-25 Mci Communications Corporation Method and system therefor for ensuring a true activation of distributed restoration in a telecommunications network

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6654914B1 (en) * 1999-05-28 2003-11-25 Teradyne, Inc. Network fault isolation
JP2000358029A (ja) * 1999-06-15 2000-12-26 Nec Corp 自動障害診断ネットワークシステム及びネットワークの自動障害診断方法
US6718377B1 (en) * 1999-08-13 2004-04-06 Lucent Technologies Inc. Telecommunications network management system interface
CA2411821A1 (en) * 2000-06-16 2001-12-20 Verisae, Inc. Enterprise asset management system and method
US20020124070A1 (en) * 2001-03-02 2002-09-05 Pulsipher Eric A. System for providing related information of a network error event in a hand-held device
JP4149178B2 (ja) * 2001-03-09 2008-09-10 松下電器産業株式会社 リモートメンテナンスシステム
FR2835674B1 (fr) 2002-02-07 2006-02-24 Cit Alcatel Deploiement des regles par un dispositif de gestion de services, en fonction d'informations sur les equipements du reseau
JP2003244150A (ja) * 2002-02-15 2003-08-29 Toshiba Corp ネットワーク復旧システム
US7091846B2 (en) * 2002-03-18 2006-08-15 Siemens Communications, Inc. Methods and apparatus for handling information regarding an alarm for a communication network
FR2843260B1 (fr) 2002-07-31 2005-04-08 Cit Alcatel Systeme de gestion de reseau par regles comportant un moteur d'inference
JP2004164615A (ja) * 2002-10-11 2004-06-10 Seiko Epson Corp 作業担当者支援方法及び作業担当者支援プログラム
US7301909B2 (en) * 2002-12-20 2007-11-27 Compucom Systems, Inc. Trouble-ticket generation in network management environment
US7620848B1 (en) * 2003-11-25 2009-11-17 Cisco Technology, Inc. Method of diagnosing and repairing network devices based on scenarios
US7434099B2 (en) * 2004-06-21 2008-10-07 Spirent Communications Of Rockville, Inc. System and method for integrating multiple data sources into service-centric computer networking services diagnostic conclusions
US20060221913A1 (en) * 2005-03-31 2006-10-05 Adc Telecommunications, Inc. Integrated network management of a software defined radio system
US8068425B2 (en) * 2008-04-09 2011-11-29 Embarq Holdings Company, Llc System and method for using network performance information to determine improved measures of path states

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998024222A2 (en) * 1996-11-27 1998-06-04 Telefonaktiebolaget Lm Ericsson (Publ) Software fault management system
US6294991B1 (en) * 1998-09-08 2001-09-25 Mci Communications Corporation Method and system therefor for ensuring a true activation of distributed restoration in a telecommunications network

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
GARIJO M ET AL: "A MULTIAGENT SYSTEM FOR COOPERATIVE NETWORK-FAULT MANAGEMENT", PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON THE PRACTICAL APPLICATION OF INTELLIGENT AGENTS AND MULTI-AGENT TECHNOLOGY, XX, XX, 22 April 1996 (1996-04-22), pages 279 - 294, XP002068055 *
JIANN-LIANG CHEN ET AL: "A fuzzy expert system for network fault management", SYSTEMS, MAN AND CYBERNETICS, 1996., IEEE INTERNATIONAL CONFERENCE ON BEIJING, CHINA 14-17 OCT. 1996, NEW YORK, NY, USA,IEEE, US, vol. 1, 14 October 1996 (1996-10-14), pages 328 - 331, XP010206647, ISBN: 0-7803-3280-6 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8213321B2 (en) 2007-02-01 2012-07-03 Deere & Company Controller area network condition monitoring and bus health on in-vehicle communications networks
US20080192119A1 (en) * 2007-02-14 2008-08-14 At&T Knowledge Ventures, Lp System and method of managing video content quality
US8839325B2 (en) * 2007-02-14 2014-09-16 At&T Intellectual Property I, L.P. System and method of managing video content quality

Also Published As

Publication number Publication date
FR2873879A1 (fr) 2006-02-03
JP2008508760A (ja) 2008-03-21
US8149718B2 (en) 2012-04-03
US20070260911A1 (en) 2007-11-08
CN101027872A (zh) 2007-08-29
FR2873879B1 (fr) 2006-10-27
EP1774705A1 (fr) 2007-04-18

Similar Documents

Publication Publication Date Title
EP1774705A1 (fr) Systeme de gestion de reseau de communication pour reparation automatique de pannes
EP1463238B1 (fr) Dispositif de gestion locale de procédés d'assurance pour un équipement de réseau de communications
EP0951155B1 (fr) Procédé et système d'administration de réseaux et de systèmes
US7434099B2 (en) System and method for integrating multiple data sources into service-centric computer networking services diagnostic conclusions
EP3087701B1 (fr) Procede de diagnostic de fonctions service dans un reseau ip
US11533215B2 (en) Programmable diagnosis model for correlation of network events
JP4509093B2 (ja) エンド・ツー・エンドのテスト及び診断マネジャ
US20220179726A1 (en) Failure impact analysis of network events
FR2809513A1 (fr) Controle de qualite de service, notamment de telecommunication
Harrington Guidelines for Considering Operations and Management of New Protocols and Protocol Extensions
FR2860369A1 (fr) Localisation de points d'entree de flux dans un reseau de communications
US11956116B2 (en) Programmable diagnosis model for correlation of network events
Bahnasse et al. Smart hybrid SDN approach for MPLS VPN management on digital environment
EP1401146A1 (fr) Dispositif et procédé de planification de configuration d'un réseau de communications par prévision d'évolution
Varga et al. Integration of service-level monitoring with fault management for end-to-end multi-provider ethernet services
EP1633081A1 (fr) DIispositif de diagnostic modulaire à base de connaissances évolutive, pour un réseau de communications
EP1635507B1 (fr) Dispositif de diagnostic à modèles de diagnostic adaptatifs, pour un réseau de communications
US11088928B2 (en) Service aware conditional path monitoring
Yu et al. A practical scheme for MPLS fault monitoring and alarm correlation in backbone networks
EP2472783B1 (fr) Procédé de selection de noeuds de bordure inter-domaines
Al-Fuqaha et al. Intelligent service monitoring and support
Feng et al. Detecting Service Disruptions in Large BGP/MPLS VPN Networks
Acharya et al. Presence based network topology tracing system for VoIP networks
Willm Administration de réseaux informatiques: protocole SNMP
Festor et al. Operations & Management: 12th International Workshop on Distributed Systems, DSOM 2001: proceedings

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2005787400

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2007523126

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 11572904

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 200580032326.1

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 2005787400

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 11572904

Country of ref document: US