CN112887135A - Method and device for multi-point fault recovery based on state PCE - Google Patents

Method and device for multi-point fault recovery based on state PCE Download PDF

Info

Publication number
CN112887135A
CN112887135A CN202110086658.9A CN202110086658A CN112887135A CN 112887135 A CN112887135 A CN 112887135A CN 202110086658 A CN202110086658 A CN 202110086658A CN 112887135 A CN112887135 A CN 112887135A
Authority
CN
China
Prior art keywords
recovery
fault
topology
route
link
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110086658.9A
Other languages
Chinese (zh)
Other versions
CN112887135B (en
Inventor
胡续坤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fiberhome Telecommunication Technologies Co Ltd
Original Assignee
Fiberhome Telecommunication Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fiberhome Telecommunication Technologies Co Ltd filed Critical Fiberhome Telecommunication Technologies Co Ltd
Priority to CN202110086658.9A priority Critical patent/CN112887135B/en
Publication of CN112887135A publication Critical patent/CN112887135A/en
Application granted granted Critical
Publication of CN112887135B publication Critical patent/CN112887135B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention discloses a method and a device for multi-point fault recovery based on a state PCE (path computation element), wherein the state PCE simulates various different fault topologies for current service according to TE (traffic engineering) link topology and service information of the current service to perform simulation computation so as to obtain recovery routes corresponding to the different fault topologies; the state PCE respectively abstracts each TE link, each fault topology and each recovery route calculated by simulation, and saves the simulation result of the recovery route corresponding to each fault topology in an abstract form; when a TE link fault occurs in the original route of the current service, the state PCE updates the fault topology in real time and abstracts the fault topology, then searches a corresponding recovery route from the simulation result based on the abstracted real-time fault topology, and switches the current service to the recovery route. The scheme effectively saves the rerouting recovery time, improves the rerouting recovery success rate, and can reduce the processing pressure of the PCE server in real-time routing calculation.

Description

Method and device for multi-point fault recovery based on state PCE
Technical Field
The invention belongs to the technical field of control planes, and particularly relates to a method and a device for multi-point fault recovery based on a state PCE.
Background
With the rapid development of a ROADM (Reconfigurable Optical Add-Drop Multiplexer) all-Optical network, the network scale, nodes and traffic volume increase rapidly, and the efficiency and success rate of real-time calculation based on damage of a control plane become the bottleneck of stable operation of the existing network. At present, an operator requires 100% recovery for a single point fault of an engineering network, switching time requires S-level and is as short as possible, and requirements for fast multipoint faults and high recovery rate have been provided. In a TE (Traffic Engineering) link topology, a single point failure means that only one TE link fails in the entire topology, and a multipoint failure means that at least two TE links fail in the entire topology, and the failures occur in a sequence. Currently, the methods for performing TE link failure recovery mainly include the following two methods:
1) distributed control plane: the distributed control plane mechanism determines that the service switching control logic can only be at the first node, when a certain TE link fails, the failed node transmits the failure information to each service first node, and the services of different first nodes calculate the route at the same time, so that resource conflict cannot be avoided, resource preemption is easy to occur, and the service recovery rate is low. That is to say, the distributed control level lacks a global view and cannot sense real-time topology changes, so that the distributed control level can only perform fault simulation recovery for single-point faults of the whole network, and resource conflicts cannot be effectively avoided when multi-point faults occur.
Taking fig. 1 as an example, the full topology has 10 TE links, and assuming that three ordered faults occur (i.e. three-point fault, specifically, fiber breakage occurs in order in 3 out of 10 TE links), there are totally all three ordered faults (i.e. three-point faults, specifically, fiber breakage occurs in order in 10 TE links)
Figure BDA0002910980980000011
The fiber breaking combination mode is characterized in that each fiber breaking combination corresponds to one fault topology. The two same fiber breaking points have different failure topologies and different corresponding recovery routes due to the difference of fiber breaking sequences, and each fiber breaking combination corresponds to a corresponding recovery route of the service. For example, for the original traffic route A-B-E, if the link B-E fails, the corresponding restoration route is A-C-D-E. When only the link B-E of the whole network fails, the failure topology is shown in figure 1, the node A is used as a service first node and can refer to a route A-C-D-E to recover the service; when the link B-E fails first and the link C-D fails later, the route A-C-D-E can be referred to recover the service; however, when link C-D fails first and link B-E fails later, the failure topology is as shown in FIG. 2, sinceThe first node a of the distributed leveling service cannot sense the failure of the link C-D, and at this time, the a-C-D-E is still used as a recovery route, so that the recovery failure occurs. Therefore, the distributed control level can only process single-point faults of the whole network and cannot deal with multi-point faults.
2) Centralized PCE (Path Computation Element) computes routes in real time: the method can simulate and recover multi-point faults, but each time a fault occurs, the recovery route needs to be calculated in real time according to fault topology, the PCE server has high real-time route calculation pressure, and when a large-scale network is responded, the calculation efficiency of the recovery route is very low due to factors such as more services, complex topology, damage and the like, so that the service recovery time is influenced, and the rapid recovery of the fault cannot be realized.
In summary, the distributed control plane has a bottleneck that only a single point fault in the whole network can be processed, and cannot cope with a multi-point fault, and the centralized PCE real-time computation method has the problems of high computation pressure of the PCE server, low computation efficiency, and long service fault recovery time, so that a scheme for rapidly simulating and recovering the multi-point fault is urgently needed to be designed.
Disclosure of Invention
Aiming at the defects or improvement requirements in the prior art, the invention provides a scheme for multi-point fault recovery based on a state PCE (path computation element), aiming at realizing the rapid recovery of the multi-point fault through multi-point fault simulation, abstraction and pre-calculation, thereby solving the technical problems that the traditional scheme cannot cope with the multi-point fault and has large real-time calculation pressure, low efficiency and long service fault recovery time.
To achieve the above object, according to an aspect of the present invention, there is provided a method for multi-point failure recovery based on a state PCE, including:
the state PCE simulates various different fault topologies aiming at the current service according to the TE link topology and the service information of the current service to carry out simulation calculation so as to obtain recovery routes corresponding to the different fault topologies;
the state PCE respectively abstracts each TE link, each fault topology and each recovery route calculated by simulation, and saves the simulation result of the recovery route corresponding to each fault topology in an abstract form;
when a TE link fault occurs in the original route of the current service, the state PCE updates the fault topology in real time and abstracts the fault topology, then searches a corresponding recovery route from the simulation result based on the abstracted real-time fault topology, and switches the current service to the recovery route.
Preferably, after the switching the current service to the restoration route, the method further includes:
and performing single-wave detection on the current service based on the searched recovery route, if the single-wave detection is abnormal, restarting route calculation by the state PCE, and switching the current service to the recalculated recovery route.
Preferably, after the switching the current service to the recalculated restoration route, the method further includes:
and performing single-wave detection on the current service based on the recalculated recovery route, and if the single-wave detection is normal, updating the simulation result based on the recalculated recovery route.
Preferably, the state PCE abstracts each TE link, each fault topology, and each recovery route calculated by simulation, specifically:
performing unique digital coding on each TE link in the TE link topology to finish the abstraction of each TE link;
abstracting each fault topology by using a TE link abstraction result according to the fault times and the fault occurrence sequence contained in the fault topology;
and abstracting each recovery route calculated by simulation by taking the corresponding fault topology abstract result as a key word according to the number of TE links, the connection sequence of the TE links and the head and tail nodes of each TE link contained in the recovery route.
Preferably, for any fault topology, the corresponding fault topology abstraction result is (X1, X2.., Xn), which characterizes that n times of faults occur in the TE link topology, and the TE links corresponding to codes X1, X2.., Xn fail in sequence.
Preferably, when a TE link failure occurs in the original route of the current service, the PCC reports the TE link failure to the state PCE, so that the state PCE updates the failure topology in real time and abstracts the failure topology.
Preferably, when the corresponding restoration route is searched from the simulation result based on the abstracted real-time fault topology, if the search is successful, the current service is switched to the searched restoration route; and if the search fails, the state PCE initiates routing calculation, records the calculated recovery route, and updates the simulation result when waiting for simulation calculation.
Preferably, when the failure in the original route of the current service is recovered, the state PCE updates the failure topology in real time and clears the failure record, and simultaneously, the recovered route of the current service route is returned to the original route.
Preferably, when the topology of the TE link changes, the state PCE performs simulation computation again for various different fault topologies simulated by the current service to obtain recovery routes corresponding to the different fault topologies, and further updates the simulation result.
According to another aspect of the present invention, an apparatus for performing multi-point failure recovery based on a state PCE is provided, including the state PCE, where the state PCE includes at least one processor and a memory, where the at least one processor and the memory are connected through a data bus, and the memory stores an instruction executable by the at least one processor, and the instruction, after being executed by the processor, is configured to complete the method for performing multi-point failure recovery based on the state PCE according to the first aspect.
Generally, compared with the prior art, the technical scheme of the invention has the following beneficial effects: in the fault recovery method provided by the invention, before the fault occurs, the idle time of a CPU of a state PCE is fully utilized to simulate various different multi-point fault combinations for simulation pre-calculation, and a simulation result is abstracted, simplified and then stored; therefore, when a fault occurs, the simulation result stored based on pre-calculation can be quickly matched, the recovery route corresponding to the current fault is found, and then the service switching is completed. Therefore, the bottleneck of single-point failure can be broken through, and the failure recovery under the condition of multi-point failure is effectively solved; compared with real-time calculation, the fault simulation pre-calculation mode effectively saves the rerouting recovery time, improves the rerouting recovery success rate and can reduce the processing pressure of the PCE server in real-time routing calculation.
Drawings
Fig. 1 is a schematic diagram of a fault topology when a single point of fault occurs in a TE link topology according to an embodiment of the present invention;
fig. 2 is a fault topology diagram when two-point faults occur in a TE link topology according to an embodiment of the present invention;
fig. 3 is a flowchart of a method for performing multi-point failure recovery based on a state PCE according to an embodiment of the present invention;
fig. 4 is a TE link topology diagram provided by an embodiment of the present invention;
fig. 5 is an abstract process code diagram of TE links, fault topology, and restoration routes provided by an embodiment of the present invention;
fig. 6 is a topology diagram for performing two-point failure recovery based on a state PCE according to an embodiment of the present invention;
fig. 7 is a schematic topology diagram when performing a recovery route switching after a single-wave detection anomaly is provided in an embodiment of the present invention;
fig. 8 is a complete flowchart of multi-point failure recovery based on a state PCE according to an embodiment of the present invention;
FIG. 9 is a logic diagram of a conventional scheme for single point of failure recovery based on a distributed control plane;
FIG. 10 is a logical diagram of a state PCE based multi-point failure recovery provided by an embodiment of the present invention;
fig. 11 is a block diagram of an apparatus architecture for performing multi-point failure recovery based on a state PCE according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention. In addition, the technical features involved in the embodiments of the present invention described below may be combined with each other as long as they do not conflict with each other.
Example 1
In order to solve the technical problems that the conventional scheme cannot cope with multi-point failures and has high real-time computation pressure, low efficiency and long service failure recovery time, an embodiment of the present invention provides a method for performing multi-point failure recovery based on a state PCE, as shown in fig. 3, the method mainly includes the following steps (the complete process may refer to fig. 8):
and step 10, simulating and calculating by the state PCE aiming at various different fault topologies simulated by the current service according to the TE link topology and the service information of the current service to obtain recovery routes corresponding to the different fault topologies.
Firstly, PCC (Path Computation Clients) reports service information of TE link topology and current service to a state PCE, where the service information mainly refers to an original route of the service and represents a service trend; taking the TE link topology shown in fig. 4 as an example, the service information of the service a-E mainly refers to the original route of the service going to the service a-B-E. Then, having the complete TE link topology and traffic information, next, the fault simulation and emulation calculations can be performed for the current traffic. Because the simulation of multi-point faults is carried out, the number of faults (namely the number of fault links) and the fault occurrence sequence in the whole TE link topology are all factors to be considered, so that fault simulation of various fiber breaking combinations (corresponding to various different fault topologies) is required, the pre-calculation result is also mass data, but because the simulation calculation is carried out before the faults rather than the real-time calculation when the faults occur, the CPU idle calculation force of the state PCE can be fully utilized, and the processing pressure of the real-time calculation of the PCE is reduced.
Specifically, if the whole TE link topology has M total TE links, it is necessary to simulate the occurrence of 1 time, 2 times, and 3 times in the whole TE link topology respectivelyFault topology at M faults; when the number of failures is M (M is 1,2, 3.., M), that is, when M-point failures are simulated, the failure occurrence sequence is different, and the total number of failures is M
Figure BDA0002910980980000061
Different fault topologies are contemplated. If each service is subjected to fault simulation, for the current service, when m faults occur in the simulation, at least one fault is required to occur on the original route of the current service, that is, at least one TE link is required to fail on the original route of the current service, and only then the current service has a service switching requirement. Still taking the TE link topology shown in fig. 4 as an example, the original route of the current traffic a-E is a-B-E, i.e., the original route includes link a-B and link B-E. For the service A-E, if single-point fault simulation is carried out, only one link in the whole topology is in fault, and the fault link is a link A-B or a link B-E; if two-point fault simulation is carried out, only two links in the whole topology have faults, and at least one fault link is a link A-B or a link B-E; if three-point fault simulation is carried out, only three links in the whole topology have faults, and at least one fault link is a link A-B or a link B-E; and so on. For each fault topology, obtaining a corresponding recovery route through simulation calculation; in a specific embodiment, when a link B-E only fails in the whole network, a corresponding recovery route obtained under the failure topology is a-C-D-E, as shown in fig. 1; when the link C-D fails first and the link B-E fails later in the whole network, the corresponding restoration route obtained under the failure topology is a-B-D-E, as shown in fig. 2.
And step 20, the state PCE abstracts each TE link, each fault topology and each recovery route calculated by simulation respectively, and saves the simulation result of the recovery route corresponding to each fault topology in an abstract form.
Through abstraction processing, dynamic fault topology can be materialized, simulation results are further simplified and stored, and maintenance and management are facilitated. The whole abstraction process can refer to the code implementation shown in fig. 5, and the concrete method is as follows:
1) and performing unique digital coding on each TE link in the TE link topology to finish the abstraction of each TE link. Taking the TE link topology shown in fig. 4 as an example, the full topology has 10 TE links, so 10 TE links can be encoded by numbers 1-10 respectively; of course, random number encoding may also be employed. For example, in one particular embodiment, link A-C is encoded as 1, link A-B is encoded as 2, link B-D is encoded as 3, link D-G is randomly encoded as 12, and link F-G is randomly encoded as 17, as shown in FIG. 4; the codes of the other links are not described in detail.
2) And abstracting each fault topology by using the TE link abstraction result according to the fault times and the fault occurrence sequence contained in the fault topology. Specifically, for any fault topology, the corresponding abstract result is (X1, X2,. and Xn), which characterizes that n times of faults occur in the TE link topology, and the TE links corresponding to the codes X1, X2,. and Xn sequentially fail in sequence. Taking the TE link topology shown in fig. 4 as an example, (3,1,12) indicates that three fiber-broken faults occur in the current topology, that is, three links have faults, and the three links are, in order, link B-D, link a-C, and link D-G according to the sequence of the faults.
3) And abstracting each recovery route calculated by simulation by taking the corresponding fault topology abstract result as a key word and ignoring contents such as a protocol according to the number of TE links, the connection sequence of the TE links and the head-tail nodes of each TE link, wherein the TE links, the connection sequence of the TE links and the head-tail nodes of each TE link are contained in the recovery route, and the abstract result contains relay information of the recovery route. Taking the TE link topology shown in fig. 4 as an example, (17,2,1) represents that fiber-breaking faults occur in the links F-G, A-B and a-C in the current topology in turn, assuming that the current service is service F-B, and the recovery route obtained under the (17,2,1) fault topology is service F-C-D-B, the corresponding recovery route abstraction result can refer to fig. 5, and the head-tail node information of each link in the recovery route F-C-D-B is recorded in turn, so that the recovery route can be directly restored based on the abstraction result.
After abstraction is completed according to the method, the mapping relation between each fault topology abstract result and the corresponding recovery route abstract result can be directly stored in a key value pair mode, namely, the recovery route simulation results corresponding to various different fault topologies are stored, and therefore a rich sample base is provided for subsequent real-time fault matching.
And step 30, when a TE link fault occurs in the original route of the current service, the state PCE updates and abstracts the fault topology in real time, further searches a corresponding recovery route from the simulation result based on the abstracted real-time fault topology, and switches the current service to the recovery route.
When a failure topology is changed, two situations of failure occurrence and failure recovery generally exist, and the state PCE may perform corresponding operations according to the failure occurrence or recovery situation, specifically as follows:
1) when a TE link fault occurs in the original route of the current service, the PCC can report the fault to the state PCE in real time, so that the state PCE can update the fault topology in real time strictly according to the fault occurrence sequence and abstract the current fault topology; and then searching a recovery route abstract result corresponding to the current fault topology abstract result from the simulation result stored in the pre-calculation mode. If the search is successful, the corresponding recovery route is obtained, and the current service can be directly switched to the searched recovery route from the original route to finish the service transmission; if the search fails, it is proved that the simulation result does not have a record corresponding to the current fault topology abstract result, and a fault is likely to occur under the condition that the simulation is not completed, the state PCE initiates routing computation first, records the computed recovery route, and updates the simulation result when waiting for the simulation computation. When the subsequent simulation calculation is performed to the multipoint fault corresponding to the current fault topology, the corresponding key value pair is directly stored in the simulation result without repeating the simulation calculation for the multipoint fault, that is, the simulation result is updated as mentioned above.
2) When the failure in the original route of the current service is recovered, the PCC can report the failure recovery message to the state PCE in real time, so that the state PCE updates the failure topology in real time and clears the failure record, and meanwhile, the recovery route of the current service route is returned to the original route again in consideration of the fact that the original route has better performance compared with the recovery route.
Taking fig. 6 as an example, assuming that an original service route is a-B-E, if the PCC reports "link C-D fails first and link B-E fails later" to the state PCE at present, the state PCE may find a corresponding restoration route as a-B-D-E from a simulation result, and thus switch the service a-E from the original route a-B-E to the restoration route a-B-D-E, as shown in fig. 6; and if the subsequent PCC reports the message of 'failure recovery' to the state PCE, the state PCE returns the service A-E to the original route A-B-E again. If the current PCC reports 'failure after link B-E' to the state PCE, and the state PCE can search and find a corresponding recovery route as A-C-D-E from a simulation result, the service A-E is switched from the original route A-B-E to the recovery route A-C-D-E; similarly, if the subsequent PCC reports a failure recovery message to the state PCE, the state PCE returns the service a-E to the original route a-B-E again. Because the pre-calculation result is mass data, a rich sample library is provided, and the simulation result is stored in an abstract form for maintenance and management; therefore, when a fault occurs, the rapid matching of massive recovery routes can be realized according to the current fault topology abstract result, and the service issuing is completed, so that the service is rapidly and accurately recovered.
Further, to ensure the stability after the service is recovered, after the current service is switched to the recovery route, that is, after the step 30, referring to fig. 3, the method further includes:
step 40, based on the searched recovery route, performing single wave detection on the current service, if the single wave detection is abnormal, the state PCE reinitiates the route calculation, and switches the current service to the recalculated recovery route.
As shown in fig. 7, the original service route is a-B-E, and when the link C-D fails first and the link B-E fails later, the service a-E is switched to the restoration route a-B-D-E according to the simulation result. At this time, single wave detection is carried out based on the recovery route A-B-D-E, single wave abnormality exists on the link D-E, and the fact that the situation that the service is unstable occurs if the service is switched to the A-B-D-E is proved, therefore, the state PCE initiates routing calculation again, new recovery route A-B-D-G-E is obtained through calculation, and the service is further switched to the route A-B-D-G-E from the route A-B-D-E.
And step 50, performing single-wave detection on the current service based on the recalculated recovery route, and if the single-wave detection is normal, updating the simulation result based on the recalculated recovery route.
Continuing to take fig. 7 as an example, at this time, the service a-E has been switched to the recalculated restoration route a-B-D-G-E, and then the single-wave detection is performed based on the restoration route a-B-D-G-E; assuming that the single-wave detection is normal, the service is proved to be stable on the new recovery route A-B-D-G-E, at the moment, the recovery route can be fed back to the state PCE, the state PCE learns and records the new recovery route A-B-D-G-E, the original stored recovery route A-B-D-E in the simulation result is replaced, the mapping relation between the fault topology of 'the first fault of the link C-D and the later fault of the link B-E' and the recovery route A-B-D-G-E is formed, and the incremental updating of the recovery route is realized.
Further, when the TE link topology changes, for example, a new TE link is added or a certain TE link is deleted, the state PCE performs simulation calculation again for various different fault topologies simulated by the current service to obtain recovery routes corresponding to different fault topologies, and further updates the simulation result, and the specific simulation process is not repeated.
The overall scheme idea of performing fault recovery by using a distributed control plane is shown in fig. 9, and only a single-point fault can be dealt with, the single-point fault is reported to a service head node when the single-point fault occurs, fault recovery is performed according to a single-point simulation result, and a path is automatically calculated when the fault recovery fails. The TE link, the fault topology and the recovery route abstraction are introduced into the fault recovery method provided by the invention, the overall scheme idea is shown in FIG. 10, and a mode of fault simulation precomputation and real-time calculation is adopted, so that compared with a distributed mode which can only support single-node fault simulation recovery, the fault recovery method can abstract according to the real-time change condition of the fault topology, further accurately and quickly match out the corresponding recovery route, and realize quick service recovery during multipoint fault; and meanwhile, a single-wave detection function and a feedback mechanism are provided, the state PCE actively learns a single-wave optimization result, updates a recovery route corresponding to the fault, updates a fault simulation result according to the optimized result increment, fully utilizes PCE idle calculation force, optimizes the switching success rate and switching time of the multi-point fault, and improves the operation stability of the project.
In summary, the fault recovery method provided by the embodiment of the invention has the following beneficial effects:
before a fault occurs, simulating various different multi-point fault combinations to perform simulation pre-calculation, abstracting a TE link, a fault topology and a recovery route, effectively and accurately describing the recovery route corresponding to the fault topology, solving the bottleneck that distributed control can only process single-point faults of the whole network, effectively dealing with the multi-point faults, improving the success rate of rerouting recovery and reducing the processing pressure of real-time routing calculation of a state PCE;
after receiving the fault report, the state PCE can abstract according to the real-time fault topology and search the simulation calculation result to realize rapid route recovery, thereby effectively reducing the service recovery time and ensuring that the rerouting recovery time is possible in S level;
before a fault occurs, the CPU idle computing power of the state PCE is fully utilized to carry out simulation computation, the requirement on the performance of a master control panel of a large-scale network is lowered, and the equipment cost can be remarkably lowered;
the fault simulation result can be provided for operation and maintenance personnel in advance, and the fixed recovery path is more convenient for engineering operation and maintenance management compared with a random calculation result.
Example 2
On the basis of the method for performing multi-point failure recovery based on a state PCE provided in embodiment 1, the present invention further provides an apparatus for performing multi-point failure recovery based on a state PCE, which is capable of implementing the above method, as shown in fig. 11, which is an architecture diagram of the apparatus in the embodiment of the present invention. The apparatus for multi-point failure recovery based on a state PCE of the present embodiment includes one or more processors 21 and a memory 22. In fig. 11, one processor 21 is taken as an example.
The processor 21 and the memory 22 may be connected by a bus or other means, and fig. 11 illustrates the connection by a bus as an example.
The memory 22, which is a non-volatile computer-readable storage medium for a method for multi-point failure recovery based on a state PCE, may be used to store non-volatile software programs, non-volatile computer-executable programs, and modules, such as the method for multi-point failure recovery based on a state PCE in embodiment 1. The processor 21 executes various functional applications and data processing of the apparatus for multi-point failure recovery based on a state PCE by running a nonvolatile software program, instructions and modules stored in the memory 22, that is, implements the method for multi-point failure recovery based on a state PCE of embodiment 1.
The memory 22 may include high speed random access memory and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some embodiments, the memory 22 may optionally include memory located remotely from the processor 21, and these remote memories may be connected to the processor 21 via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The program instructions/modules are stored in the memory 22 and, when executed by the one or more processors 21, perform the method for multi-point failure recovery based on a state PCE in embodiment 1 above, for example, perform the various steps shown in fig. 1 and 8 described above.
Those of ordinary skill in the art will appreciate that all or part of the steps of the various methods of the embodiments may be implemented by associated hardware as instructed by a program, which may be stored on a computer-readable storage medium, which may include: read Only Memory (ROM), Random Access Memory (RAM), magnetic or optical disks, and the like.
It will be understood by those skilled in the art that the foregoing is only a preferred embodiment of the present invention, and is not intended to limit the invention, and that any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims (10)

1. A method for multi-point fault recovery based on a state PCE is characterized by comprising the following steps:
the state PCE simulates various different fault topologies aiming at the current service according to the TE link topology and the service information of the current service to carry out simulation calculation so as to obtain recovery routes corresponding to the different fault topologies;
the state PCE respectively abstracts each TE link, each fault topology and each recovery route calculated by simulation, and saves the simulation result of the recovery route corresponding to each fault topology in an abstract form;
when a TE link fault occurs in the original route of the current service, the state PCE updates the fault topology in real time and abstracts the fault topology, then searches a corresponding recovery route from the simulation result based on the abstracted real-time fault topology, and switches the current service to the recovery route.
2. The method for multi-point failure recovery based on state PCE of claim 1, wherein after the switching of the current traffic onto the recovery route, the method further comprises:
and performing single-wave detection on the current service based on the searched recovery route, if the single-wave detection is abnormal, restarting route calculation by the state PCE, and switching the current service to the recalculated recovery route.
3. The method for multi-point failure recovery based on a state PCE of claim 2, wherein after the switching of the current traffic onto the recalculated recovery route, the method further comprises:
and performing single-wave detection on the current service based on the recalculated recovery route, and if the single-wave detection is normal, updating the simulation result based on the recalculated recovery route.
4. The method for multi-point failure recovery based on a state PCE of claim 1, wherein the state PCE abstracts each TE link, each failure topology, and each recovery route calculated by simulation, specifically:
performing unique digital coding on each TE link in the TE link topology to finish the abstraction of each TE link;
abstracting each fault topology by using a TE link abstraction result according to the fault times and the fault occurrence sequence contained in the fault topology;
and abstracting each recovery route calculated by simulation by taking the corresponding fault topology abstract result as a key word according to the number of TE links, the connection sequence of the TE links and the head and tail nodes of each TE link contained in the recovery route.
5. The method of claim 4, wherein for any failure topology, the corresponding abstract result of the failure topology is (X1, X2.., Xn), which characterizes n failures in the TE link topology, and the TE links corresponding to the codes X1, X2.., Xn are failed in sequence.
6. The method for multi-point failure recovery based on a state PCE of claim 1, wherein when a TE link failure occurs in an original route of the current service, the PCC reports the TE link failure to the state PCE, so that the state PCE updates a failure topology in real time and abstracts the failure topology.
7. The method for multi-point failure recovery based on state PCE of claim 1, wherein when the real-time failure topology based on abstraction searches for a corresponding recovery route from the simulation result, if the search is successful, the current service is switched to the searched recovery route; and if the search fails, the state PCE initiates routing calculation, records the calculated recovery route, and updates the simulation result when waiting for simulation calculation.
8. The method for multi-point failure recovery based on a state PCE of any of claims 1-7, wherein when a failure in the original route of the current traffic is recovered, the state PCE updates the failure topology and clears the failure record in real time, while returning the recovered route of the current traffic route back to the original route.
9. The method for multi-point failure recovery based on a state PCE of any of claims 1-7, wherein when the topology change occurs in the TE link topology, the state PCE performs simulation computation again for various different failure topologies simulated by the current service to obtain recovery routes corresponding to the different failure topologies, and further updates the simulation result.
10. An apparatus for performing multi-point failure recovery based on a state PCE, comprising the state PCE, wherein the state PCE comprises at least one processor and a memory, the at least one processor and the memory are connected by a data bus, and the memory stores an instruction executable by the at least one processor, and the instruction is used for completing the multi-point failure recovery based on the state PCE according to any one of claims 1 to 9 after being executed by the processor.
CN202110086658.9A 2021-01-22 2021-01-22 Method and device for multi-point fault recovery based on state PCE Active CN112887135B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110086658.9A CN112887135B (en) 2021-01-22 2021-01-22 Method and device for multi-point fault recovery based on state PCE

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110086658.9A CN112887135B (en) 2021-01-22 2021-01-22 Method and device for multi-point fault recovery based on state PCE

Publications (2)

Publication Number Publication Date
CN112887135A true CN112887135A (en) 2021-06-01
CN112887135B CN112887135B (en) 2022-11-08

Family

ID=76050116

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110086658.9A Active CN112887135B (en) 2021-01-22 2021-01-22 Method and device for multi-point fault recovery based on state PCE

Country Status (1)

Country Link
CN (1) CN112887135B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113286208A (en) * 2021-06-15 2021-08-20 烽火通信科技股份有限公司 Passive fault recovery method and system in resource shortage
WO2023015897A1 (en) * 2021-08-13 2023-02-16 烽火通信科技股份有限公司 Intelligent control method, apparatus and system for optical network
CN116016202A (en) * 2022-08-05 2023-04-25 烽火通信科技股份有限公司 Network fault simulation method and system based on twin PCE
CN116155701A (en) * 2023-02-11 2023-05-23 北京天弛网络有限公司 Method, device, equipment and readable storage medium for calculating service optimal path

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101645847A (en) * 2009-08-25 2010-02-10 中兴通讯股份有限公司 Method and system for improving service resumption speed in transmission network
CN105933174A (en) * 2016-07-12 2016-09-07 重庆邮电大学 Pre-calculating recovery method based on distance self-adaptive route and spectrum distribution
US20170331706A1 (en) * 2014-12-05 2017-11-16 Zte Corporation Method and Device for Implementing Capacity Planning
CN110391977A (en) * 2018-04-18 2019-10-29 中兴通讯股份有限公司 A kind of method, system and the storage medium of network failure protection
CN111147287A (en) * 2019-12-10 2020-05-12 网络通信与安全紫金山实验室 Network simulation method and system in SDN scene

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101645847A (en) * 2009-08-25 2010-02-10 中兴通讯股份有限公司 Method and system for improving service resumption speed in transmission network
US20170331706A1 (en) * 2014-12-05 2017-11-16 Zte Corporation Method and Device for Implementing Capacity Planning
CN105933174A (en) * 2016-07-12 2016-09-07 重庆邮电大学 Pre-calculating recovery method based on distance self-adaptive route and spectrum distribution
CN110391977A (en) * 2018-04-18 2019-10-29 中兴通讯股份有限公司 A kind of method, system and the storage medium of network failure protection
CN111147287A (en) * 2019-12-10 2020-05-12 网络通信与安全紫金山实验室 Network simulation method and system in SDN scene

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113286208A (en) * 2021-06-15 2021-08-20 烽火通信科技股份有限公司 Passive fault recovery method and system in resource shortage
CN113286208B (en) * 2021-06-15 2022-04-26 烽火通信科技股份有限公司 Passive fault recovery method and system in resource shortage
WO2023015897A1 (en) * 2021-08-13 2023-02-16 烽火通信科技股份有限公司 Intelligent control method, apparatus and system for optical network
CN116016202A (en) * 2022-08-05 2023-04-25 烽火通信科技股份有限公司 Network fault simulation method and system based on twin PCE
CN116016202B (en) * 2022-08-05 2024-04-26 烽火通信科技股份有限公司 Network fault simulation method and system based on twin PCE
CN116155701A (en) * 2023-02-11 2023-05-23 北京天弛网络有限公司 Method, device, equipment and readable storage medium for calculating service optimal path

Also Published As

Publication number Publication date
CN112887135B (en) 2022-11-08

Similar Documents

Publication Publication Date Title
CN112887135B (en) Method and device for multi-point fault recovery based on state PCE
US7518986B1 (en) Push-based hierarchical state propagation within a multi-chassis network device
US6757242B1 (en) System and multi-thread method to manage a fault tolerant computer switching cluster using a spanning tree
US8018844B2 (en) Reliable message transfer over an unreliable network
WO2014210501A1 (en) Asynchronous message passing for large graph clustering
CN111314802B (en) Optical fiber cutting method and device, SDN controller, system and storage medium
CN112887148B (en) Method and device for simulating and predicting network flow
Sutra et al. Fast genuine generalized consensus
CN105072194A (en) Structure and method for recovering stored data in distributed file system
CN113505021B (en) Fault tolerance method and system based on multi-master-node master-slave distributed architecture
CN114826466A (en) Selection and switching method of redundant master clock
CN101026562A (en) Method and device for selecting reserve LPS
Zhang et al. Adaptive failure prediction using long short-term memory in optical network
Stidsen et al. Optimal routing with failure‐independent path protection
JP2012015837A (en) Path calculation device, data transfer device, path calculation method, data transfer method and program
CN113268382B (en) Method and device for switching fragment nodes in block chain system
CN101459551B (en) Method, apparatus for path recovery and path computing unit
CN108667508B (en) Shared link risk group generation method and device
CN105571604A (en) Co-evolved route optimizing method in dynamic road network environment
CN112866052A (en) Network link state detection method, and test matrix training method and device
Stidsen et al. Complete rerouting protection
CN116016202B (en) Network fault simulation method and system based on twin PCE
CN112054921B (en) Network verification method and device, computer storage medium and electronic equipment
CN116909759B (en) Information processing method, device and equipment, medium and controller cluster
CN116319626A (en) Distributed controller data synchronization method based on shortest path tree

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant