WO2008119294A1 - Procédé et matériel de restauration du commerce en réseau - Google Patents

Procédé et matériel de restauration du commerce en réseau Download PDF

Info

Publication number
WO2008119294A1
WO2008119294A1 PCT/CN2008/070589 CN2008070589W WO2008119294A1 WO 2008119294 A1 WO2008119294 A1 WO 2008119294A1 CN 2008070589 W CN2008070589 W CN 2008070589W WO 2008119294 A1 WO2008119294 A1 WO 2008119294A1
Authority
WO
WIPO (PCT)
Prior art keywords
service layer
node
tunnel
end node
determining
Prior art date
Application number
PCT/CN2008/070589
Other languages
English (en)
French (fr)
Inventor
Peng Zheng
Original Assignee
Huawei Technologies Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co., Ltd. filed Critical Huawei Technologies Co., Ltd.
Publication of WO2008119294A1 publication Critical patent/WO2008119294A1/zh

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/40Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass for recovering from a failure of a protocol instance or entity, e.g. service redundancy protocols, protocol state redundancy or protocol service redirection

Definitions

  • the present invention relates to the field of network communication technologies, and in particular, to a method and apparatus for recovering network services. Background technique
  • TDM+WDM and TDM different particles such as VC12 and VC4 of Synchronous Digital Hierarchy.
  • the network called the bearer location is the service layer
  • the upper layer network relative to the service layer is called the client layer.
  • the services carried in these networks have great similarities in configuration and dynamic recovery.
  • the service layer generally has larger particles, and the response to network failures is relatively direct and fast.
  • the recovery efficiency of faults is high, but the end-to-end flexibility is poor.
  • the client layer generally has smaller particles and a larger number. When the fault occurs, the impact surface is wider, the fault recovery efficiency is lower, but the end-to-end flexibility is higher.
  • FIG. 5 it is a schematic diagram of a customer-area label switching path (LSP) of a small-grained service layer tunnel, which is an effective way to improve resource utilization and overall network recovery efficiency.
  • LSP customer-area label switching path
  • Resilience but also to ensure that the network is less turbulent in the event of a failure.
  • the above objectives are contradictory in many cases. Only through the coordination and compromise of different layers of tunnel scheduling can the final satisfactory effect be achieved.
  • the bottom-up recovery method is used. After determining the fault information in the fault notification, the service layer tunnel is first restored.
  • the client layer is directly restored. Therefore, quickly determining whether the service layer tunnel can be successfully restored is an important issue related to service recovery performance. For example, if the VC12 LSP is used to carry the VC12 LSP, the VC12 LSP needs to be successfully restored to improve the recovery performance of the VC12 LSP. The recovery performance plays an important role.
  • the client layer signaling timeout wait method is used to determine whether the service layer tunnel can be successfully recovered, thereby determining whether the client layer performs recovery. If the service layer tunnel can be restored normally, the client layer service is restored after the service layer tunnel is restored. Otherwise, the client layer waits until the recovery timer expires, and initiates end-to-end recovery of the client layer service. At this time, the client layer may Re-select other service layer tunnels as your own business bearer layer.
  • the embodiment of the invention provides a method and a device for restoring network services, which can shorten the overall recovery time of the customer service and meet the requirements of the network service for recovery performance.
  • a method for restoring a network service which is provided by the embodiment of the present invention, includes:
  • Receiving a failure notification determining whether the tunnel end node of the service layer where the fault is located is invalid; if the tunnel end node of the service layer fails, performing recovery of the client layer tunnel; if the tunnel end node of the service layer does not fail, performing service Recovery of layer tunnels.
  • An embodiment of the present invention further provides an apparatus for recovering network services, including a determining module and a fault recovery module;
  • the determining module is configured to receive a fault notification, determine whether the tunnel end node of the service layer where the fault is located is invalid, and send the determination result to the fault recovery module;
  • the fault recovery module is configured to receive the judgment result; if the judgment result is that the tunnel end node of the service layer is invalid, perform recovery of the client layer tunnel; if the judgment result is a tunnel of the service layer If the last node does not fail, the service layer tunnel is restored.
  • the bottom-up tunnel recovery strategy is combined with whether the last node is invalid, to determine whether the service layer tunnel is restored or the client service is directly restored, thereby ensuring the overall recovery efficiency of the network and ensuring The superior recovery performance of a single service in various situations.
  • FIG. 1 is a flowchart of a method for restoring a network service according to Embodiment 1 of the present invention
  • FIG. 2 is a schematic diagram of a VC-4 tunnel carrying a VC-12 LSP according to Embodiment 1 of the present invention
  • a structural diagram of a device for recovering network services
  • FIG. 4 is a structural diagram of an apparatus for restoring a network service in Embodiment 3 of the present invention.
  • FIG. 5 is a schematic diagram of a prior art segmentation service layer tunnel carrying small granular client layer LSPs. detailed description
  • the recovery timer is started, and the timer timeout period is generally set to the second level. If the service layer tunnel can be restored normally, the client layer service is restored after the service layer tunnel is restored. Otherwise, the client layer waits until the recovery timer expires, and initiates end-to-end recovery of the client layer service. At this time, the client layer may Re-select other service layer tunnels as your own business bearer layer. However, the recovery result of the service layer is determined by the method of timeout waiting. In the case where the service node tunnel end node fails, the first and last nodes are the source and sink of the local layer tunnel signaling, and the signaling occurs when the source and sink fail.
  • the embodiments of the present invention provide a method and device for restoring network services, which can shorten the overall recovery time of the customer service and meet the requirements of the network service for recovery performance.
  • the client layer is carried on the segmentation tunnel.
  • the service layer tunnel head node receives the failure notification, if the terminal node of the service layer is determined to be invalid, the first node of the service layer tunnel notifies the client service. The first node, to make it self-recovery; if it is determined that the end node of the service layer has not failed, the client layer service will not perform any action, waiting for the service layer to recover.
  • an embodiment of the present invention discloses a method for restoring a network service, which specifically includes the following steps:
  • Step 101 When any fault occurs in the optical fiber, the corresponding service layer tunnel head node and the client layer LSP head node receive the fault notification, and the fault notification includes the node identifier and the port identifier where the alarm occurs. At this point, the client layer LSP does not perform any action, waiting for the first node of the service layer tunnel to first locate the fault location.
  • Step 102 The service layer tunnel first node that receives the fault notification traverses the path resource information of all service layer tunnels associated with the node and port where the alarm occurs.
  • Step 103 Determine, according to the path resource information traversed by the first node of the service layer tunnel, whether the port on which the alarm is generated is an egress port of the upstream node of the end node of the service layer tunnel. If yes, go to step 104; otherwise, go to step 105.
  • Step 104 The port where the alarm occurs is the egress port of the upstream node of the service node tunnel end node, so it is determined that the end node of the service layer tunnel may be invalid, and further determination is needed whether the end node is reachable.
  • the route query timer Since it is necessary to wait for the fault state route to be spread before determining whether the last node is reachable, the route query timer is started, and the timing is 50 milliseconds. The reason why the timer is used to wait for the fault state to complete the flooding is to confirm that the node is invalid when all the ports of the node have an alarm. The route query timer may not be started. As long as there is a port alarm, the step of determining whether the last node is reachable is directly performed.
  • step 105 is performed. If the last node is unreachable, step 106 is performed.
  • the last node is reachable, you can use the first and last nodes of each tunnel as the source and sink, and perform route calculation in the traffic engineering (TE, Traffic Engineering) database. If the end node is reachable, if the route cannot be calculated, the calculation fails, indicating that the last node is unreachable, and it can be determined that the last node has failed.
  • TE Traffic Engineering
  • the BFD (Bidirecation Forwarding Detection) protocol can be used to determine whether the end node is reachable.
  • a Hello (handshake) message is sent between the first and last nodes (for example, 10 milliseconds) to determine the working state of the interface between the two points. If the first node does not receive the handshake message of the last node for a long time, it indicates the end. If the node is not working properly, it can be determined that the last node is unreachable and has failed.
  • This method is more common in a pure IP network. In a network that can report port faults in time, the state of the port can be directly determined. Therefore, the BFD protocol timing status query protocol is not used to detect the fault, but the BFD is used.
  • the protocol method can also achieve the same effect of judging the failure of the end node.
  • Step 105 The tunnel first node initiates recovery of the service layer tunnel.
  • Step 106 If the last node is unreachable, the last node is invalid, and the service layer tunnel cannot be recovered. At this time, the first node of all client services carried by the tunnel is searched, and an announcement of the underlying recovery failure is sent to the first nodes. The first node performs recovery of the client layer tunnel separately.
  • the following is an example of the present invention by taking as an example a plurality of VC-12 LSPs are carried on a plurality of segmented VC-4 tunnels.
  • the client layer is two VC-12 LSPs: VC-12 LSP A and VC-12 LSP B, and the service layer is three segmented VC-4 tunnels: FA1, FA2, and FA3.
  • the above two VC-12 LSPs are carried on three VC-4 tunnels.
  • 1, 2, 3, 4, and 5 are nodes.
  • the embodiments of the present invention are specifically described by taking faults at three points A, B, and C as an example.
  • node 2 When a fiber breakage fault occurs at node A, node 2 receives the fault notification.
  • the fault notification includes: The alarm node is node 2, and the port is A.
  • the node 2 searches for the HOP information of the VC-4 tunnel with the node as the first node, and finds that the port A is not the egress of the upstream node 3 of the end node 4 of the tunnel, and immediately initiates the rerouting recovery of the tunnel FA2.
  • the 50 millisecond timer is started first. After the timer expires, the TE database is searched for nodes 2 and 4. The route calculation is performed for the source and sink nodes. If the route is calculated, the node 4 is reachable, not the failed node, and the re-route recovery of the tunnel FA2 is immediately initiated.
  • node 2 searches for the first node of each upper layer VC-12 LSP passing through the node, in this example, corresponding to VC-12 LSP A and VC-12 LSP B.
  • the first node is node 2 and node 1, respectively, and then informs node 1 and node 2 to initiate LSP A and LSP B rerouting recovery through the remote notification mechanism.
  • node 2 receives the fault notification.
  • the fault notification includes: The alarm node is node 4 and the port is C.
  • Node 2 searches for the HOP information of the VC-4 tunnel with the node as the head node, and finds that port C is the ingress port of the last node 4 of the tunnel. If the end node is not invalid, the tunnel FA2 re-route recovery is initiated immediately.
  • an embodiment of the present invention further provides an apparatus for recovering network services, including a determining module 201 and a fault recovery module 202.
  • the judging module 201 is configured to determine, after receiving the fault notification, whether the service layer tunnel end node related to the fault is invalid, and send the judgment result to the fault recovery module 202;
  • the fault recovery module 202 is configured to receive a judgment result sent by the determining module, if the service layer tunnel end node fails, perform a client layer tunnel recovery; if the service layer tunnel end node does not fail, perform the good Recovery of the service layer tunnel.
  • the determining module 201 may specifically include an upstream node determining unit 201a, a waiting unit 201c, and a last node reachability determining unit 201b;
  • the upstream node determining unit 201a is configured to determine whether the port where the alarm is generated is an egress port of the upstream node of the tunnel end node, and if yes, send the determination result to the waiting unit 201c, if not, the service layer tunnel The last node has not expired;
  • the waiting unit 201c is configured to wait for the fault state route to be diffused, and send the signal that has been waited for to the last node reachability determining unit 201b;
  • the terminal node reachability determining unit 201b is configured to: after receiving the signal that the waiting unit 201c sends the waiting signal, determine whether the last node of the service layer tunnel is reachable, if the end node of the service layer tunnel is unreachable, The end node of the service layer tunnel is invalid. If the end node of the service layer tunnel is reachable, otherwise the end node of the tunnel of the service layer is not invalid.
  • the last node reachability determining unit 201b includes a route calculation determining unit, configured to perform routing calculation in the traffic engineering TE database by using the first and last nodes of the service layer tunnel as a source and a sink, and if the route can be calculated, The end node of the service layer tunnel is reachable. If the route cannot be calculated, the end node of the tunnel of the service layer is unreachable.
  • the terminal node reachability determining unit 201b may also include a timing query unit, configured to send a Hello message to the service layer tunnel end node periodically by running the BFD protocol, if the service layer tunnel head node is set. If the handshake message of the last node of the service layer tunnel is not received within the time, it is determined that the end node of the service layer tunnel is unreachable.
  • a timing query unit configured to send a Hello message to the service layer tunnel end node periodically by running the BFD protocol, if the service layer tunnel head node is set. If the handshake message of the last node of the service layer tunnel is not received within the time, it is determined that the end node of the service layer tunnel is unreachable.
  • the reason for waiting for the unit to wait for the fault state route to be spread is that it is necessary to confirm that the node is invalid when all the ports of the node have an alarm. It is also possible to use the waiting unit without any use. As long as there is a port alarm, the step of determining whether the last node is reachable is directly performed.
  • the determining module 201 may specifically include an upstream node determining unit 201a and a last node reachability determining unit 201b.
  • the upstream node determining unit 201a is configured to determine whether the port where the alarm is generated is an egress port of the upstream node of the tunnel end node, and if yes, send the determination result to the end node reachability determining unit 202b, if not, The service layer tunnel end node has not failed.
  • the last node reachability determining unit 201b is configured to receive the determination result sent by the upstream node determining unit 201a, and if the port where the alarm occurs is an egress port of the upstream node of the service layer tunnel end node, determine the service. Whether the end node of the layer tunnel is reachable. If the end node of the service layer tunnel is unreachable, the end node of the service layer tunnel is invalid. If the end node of the service layer tunnel is reachable, otherwise the end node of the tunnel of the service layer is not invalid. .

Description

一种恢复网络业务的方法及装置 本申请要求于 2007 年 03 月 29 日提交中国专利局、 申请号为 200710064922.9、 发明名称为 "一种提高多层网络业务恢复性能的方法及装 置"的中国专利申请的优先权, 其全部内容通过引用结合在本申请中。 技术领域
本发明涉及网络通信技术领域, 特别涉及一种恢复网络业务的方法及 装置。 背景技术
随着网络大融合和网络扁平化的发展, 原有不同类型网络之间的界限 逐渐模糊, 在统一控制平面下的广义的多层网络成为追求的目标, 最终实 现不同层网络之间灵活高效的混合调度, 以优化网络业务的管理和提高网 络的生存性。 在多层网络的混合调度中, 釆用分层的隧道技术能够有效地 提高业务调度的效率, 屏蔽不同层业务之间的影响, 使得在业务调度中只 需要关注不同层隧道之间的接口以及策略的协调。 可能的多层网络混合调 度的情况有: 因特网协议 IP ( Internet Protocol ) +时分复用 TDM ( Time Division Multiplexing ) 、 IP+波分复用 WDM ( Wavelength Division
Multiplexing ) 、 TDM+WDM以及 TDM不同颗粒, 如同步数字体系 SDH ( Synchronous Digital Hierarchy )的 VC12和 VC4。 在这些多层网络中称处 于承载位置的网络为服务层, 相对于服务层的上层网络被称为客户层, 这 些网络中承载的业务在配置和动态恢复方面存在着很大的相似性。 服务层 一般对应的颗粒较大, 对网络故障的响应较为直接和快速, 故障的恢复效 率较高, 但端到端的灵活性较差; 而客户层一般对应颗粒较小, 数量较多, 在网络故障发生的时候影响面较广, 故障恢复效率较低, 但端到端灵活性 较高。
参见图 5 ,为釆用分段的服务层隧道来承载小颗粒的客户层标签交换路 径 LSP (Label Switch Path)的示意图, 这种承载方式是提高资源利用率和网 络整体恢复效率的有效途径, 但需要解决新建和故障恢复等情况下分段隧 道和上层 LSP的合理调度问题, 既要保证客户层业务具有较高性能的保护 恢复能力, 又要保证网络在发生故障的时候动荡较小。 以上的目标在很多 情况下是存在矛盾的, 只有通过不同层隧道调度的协调和折中, 才能达到 最终满意的效果。 在目前的多层隧道的恢复策略中釆用自底向上的恢复方 式, 在判定故障通告中的故障信息之后首先恢复服务层隧道, 如果服务层 隧道恢复失败则直接恢复客户层。 因而快速地判定服务层隧道是否能成功 恢复则是关系到业务恢复性能的重要问题。例如,用 SDH的分段 VC4隧道 来承载 VC12LSP的情况下, 为了提高 VC12LSP的恢复性能, 同时又考虑 整体恢复效率的因素, 故障发生时需要快速判定 VC4隧道能否成功恢复, 此功能对提高 VC12LSP的恢复性能起重要的作用。
现有技术中, 釆用客户层信令超时等待的方法判定服务层隧道是否能 够成功恢复, 从而决定客户层是否进行恢复。 如果服务层隧道可以进行正 常的恢复, 则服务层隧道恢复后客户层业务就恢复, 否则, 客户层进行等 待直到恢复定时器超时, 发起客户层业务的端到端恢复, 此时客户层可能 会重新选择其他服务层隧道作为自己的业务承载层。
但是, 在出现服务层隧道末节点失效的情况下, 由于首末节点作为本 层隧道信令的源和宿出现故障时信令会导致服务层无法及时恢复, 因而客 户业务的整体恢复性能受到很大影响, 很难满足网络业务对恢复性能的要 求。 发明内容
本发明实施例提供了一种恢复网络业务的方法及装置, 可缩短客户业 务的整体恢复时间, 满足网络业务对恢复性能的要求。 本发明实施例提供 的一种恢复网络业务的方法, 包括:
接收故障通告, 判断故障所在服务层的隧道末节点是否失效; 如果所述服务层的隧道末节点失效, 则进行客户层隧道的恢复; 如果所述服务层的隧道末节点没有失效, 则进行服务层隧道的恢复。
本发明实施例还提供一种恢复网络业务的装置, 包括判断模块和故障 恢复模块;
所述判断模块, 用于接收故障通告, 判断故障所在的服务层的隧道末 节点是否失效, 并将判断结果发送给所述故障恢复模块; 所述故障恢复模块, 用于接收所述判断结果; 如果所述判断结果是所 述服务层的隧道末节点失效, 则进行客户层隧道的恢复; 如果所述判断结 果是所述服务层的隧道末节点没有失效, 则进行服务层隧道的恢复。
本发明实施例中, 通过自底向上的隧道恢复策略并结合末节点是否失 效, 以判断是进行服务层隧道的恢复还是直接进行客户业务的恢复, 既保 证了网络整体恢复效率的提升, 又保证了单条业务在各种情况下较优的恢 复性能。 附图说明
图 1是本发明实施例 1中的一种恢复网络业务的方法的流程图; 图 2是本发明实施例 1中的 VC-4隧道承载 VC-12LSP的示意图; 图 3是本发明实施例 2中恢复网络业务的装置的结构图;
图 4是本发明实施例 3中恢复网络业务的装置的结构图;
图 5是现有技术中釆用分段服务层隧道承载小颗粒客户层 LSP的示意 图。 具体实施方式
现有的业务恢复的技术方案中, 当客户业务首节点收到故障通告时启 动恢复定时器, 定时器超时时长一般设为秒级。 如果服务层隧道可以进行 正常的恢复, 则服务层隧道恢复后客户层业务就恢复, 否则, 客户层进行 等待直到恢复定时器超时, 发起客户层业务的端到端恢复, 此时客户层可 能会重新选择其他服务层隧道作为自己的业务承载层。 但是, 通过超时等 待的方法判定服务层的恢复结果, 在出现服务层隧道末节点失效的情况下, 由于首末节点是本层隧道信令的源和宿, 在源和宿出现故障时信令是无法 绕开的, 因而服务层是无法恢复的, 客户端需要等到定时器超时才能进行 恢复。 由于服务层隧道正常恢复一般是几十毫秒, 而恢复定时器超时时长 为秒级, 因而客户业务的整体恢复性能受到很大影响, 很难满足网络业务 对恢复性能的要求。 鉴于此, 本发明实施例提供了一种恢复网络业务的方 法及装置, 可缩短客户业务的整体恢复时间, 满足网络业务对恢复性能的 要求。
下面结合附图和具体实施例对本发明作进一步说明, 但本发明不局限 于以下实施例。
本发明实施例提供的技术方案中, 客户层承载在分段隧道上, 当服务 层隧道首节点接收到故障通告, 如果判定服务层的末节点失效, 则由服务 层隧道的首节点通知客户业务的首节点, 使其自行恢复; 如果判定服务层 末节点没有失效, 客户层业务不会执行任何动作, 等待服务层进行恢复。
实施例 1
参见图 1 , 本发明实施例公开了一种恢复网络业务的方法, 具体包括以 下步骤:
步骤 101 : 当光纤任意一处发生了故障, 相应的服务层隧道首节点和 客户层 LSP首节点就会接收到故障通告,故障通告中包含有发生告警的节点 标识和端口标识。 此时客户层 LSP并不作任何动作, 等待服务层隧道首节点 首先定位故障的位置。
步骤 102: 收到故障通告的服务层隧道首节点遍历与发生告警的节点和 端口相关的所有服务层隧道的路径资源信息。
步骤 103: 根据服务层隧道首节点遍历的路径资源信息判断发生告警的 端口是否为该服务层隧道末节点的上游节点的出端口, 如果是, 执行步骤 104, 否则执行步骤 105。
步骤 104: 发生告警的端口是服务层隧道末节点的上游节点的出端口, 因而判定可能是该服务层隧道的末节点失效, 需要进一步判定末节点是否 可达。
由于在判定末节点是否可达之前需要等待故障状态路由扩散完毕, 此 时启动路由查询定时器, 定时时长为 50毫秒。 釆用定时器等待故障状态路 由扩散完毕的原因是需要确认在该节点的所有端口都发生告警的情况下, 该节点才算失效。 也可以不启动该路由查询定时器, 只要有一个端口告警, 就直接进行判断末节点是否可达的步骤。
路由查询定时器超时后, 判断末节点是否可达, 如果末节点可达, 执 行步骤 105 , 如果末节点不可达, 执行步骤 106。
判断末节点是否可达可以以各隧道的首末节点作为源和宿, 在流量工 程(TE, Traffic Engineering )数据库中进行路由计算, 如果可以计算出路 由, 说明末节点可达, 如果无法计算出路由, 即计算失败, 说明末节点不 可达, 就能判定末节点已经失效。
判断末节点是否可达也可以釆用双向转发检测 BFD ( Bidirecation Forwarding Detection )协议来实现。通过运行 BFD协议在首末节点间定时(如 10毫秒)发送 Hello (握手)报文, 以判定两点间接口的工作状态, 如果首 节点长时间收不到末节点的握手消息, 则说明末节点已经无法正常工作, 就能判定末节点不可达, 已经失效。 这种做法在纯 IP网络中较为常见, 在可 以及时上报端口故障的网络中可以直接判定端口的状态, 所以并不依赖于 此种通过 BFD协议定时状态查询的协议来检测故障,但釆用 BFD协议的方法 也是可以达到同样的判断末节点失效的效果的。
步骤 105: 隧道首节点发起服务层隧道的恢复。
步骤 106: 末节点不可达, 则末节点失效, 此服务层隧道无法恢复, 这 时查找此隧道承载的所有客户业务的首节点, 并向这些首节点发送底层恢 复失败的通告, 由这些客户层首节点分别进行客户层隧道的恢复。
下面以多条 VC-12 LSP承载在多条分段的 VC-4隧道上为例,具体说明本 发明实施例。
参见图 2, 客户层为两条 VC-12 LSP: VC-12 LSP A和 VC-12 LSP B, 服 务层为 3条分段的 VC-4隧道: FA1、 FA2和 FA3。 上述两条 VC-12LSP承载在 3条 VC-4隧道上。 其中 1、 2、 3、 4、 5分别为节点。 分别以 A、 B、 C三处发 生故障为例, 具体说明本发明实施例。
1 ) 当 A处发生断纤故障的时候, 节点 2接收到故障通告, 该故障通告中 包含: 告警节点为节点 2, 端口为 A。 节点 2查找以本节点为首节点的 VC-4 隧道的 HOP信息, 发现端口 A不是本隧道末节点 4的上游节点 3的出口, 则立 即发起隧道 FA2的重路由恢复。
2 ) 当 B处发生断纤故障的时候, 节点 2接收到故障通告, 该故障通告中 包含: 告警节点为节点 3 , 端口为 B。 节点 2查找以本节点为首节点的 VC-4 隧道的 HOP信息, 发现端口 B是本隧道末节点 4的上游节点 3的出端口, 说明 节点 4有可能失效。
这时先启动 50毫秒定时器, 定时器超时后, 查找 TE数据库以节点 2、 4 为源宿节点进行路由计算, 如果计算出了路由, 说明节点 4可达, 不是失效 节点, 则立即发起隧道 FA2的重路由恢复。
如果无法计算出路由, 说明节点 4不可达, 是失效节点, 则节点 2查找 经过本节点的各上层 VC-12LSP的首节点,本例中对应 VC-12 LSP A和 VC-12 LSP B , 其首节点分别为 2号节点和 1号节点, 然后通过远端通知机制通知节 点 1和节点 2发起 LSP A和 LSP B重路由恢复。
3 ) 当 C处发生断纤故障的时候, 节点 2接收到故障通告, 该故障通告中 包含: 告警节点为节点 4 , 端口为 C。 节点 2查找以本节点为首节点的 VC-4 隧道的 HOP信息, 发现端口 C是本隧道末节点 4的入端口, 认为末节点没有 失效, 则立即发起隧道 FA2的重路由恢复。
实施例 2
参见图 3 ,本发明实施例还提供了一种恢复网络业务的装置, 包括判断 模块 201和故障恢复模块 202。
判断模块 201 ,用于在收到故障通告后判断与故障相关的服务层隧道末 节点是否失效, 并将判断结果发送给故障恢复模块 202;
故障恢复模块 202, 用于接收所述判断模块发送的判断结果, 如果所述 服务层隧道末节点失效, 则进行客户层隧道的恢复; 如果所述服务层隧道 末节点没有失效, 则进行 Λ良务层隧道的恢复。
其中判断模块 201可以具体包括上游节点判断单元 201 a、等待单元 201 c 和末节点可达性判断单元 201b;
上游节点判断单元 201a用于判断所述发生告警的端口是否为所述隧道 末节点的上游节点的出端口, 如果是, 则将判断结果发送给等待单元 201c, 如果不是, 则所述服务层隧道末节点没有失效;
等待单元 201c, 用于等待故障状态路由扩散完毕, 并将已等待完毕的 信号发送给所述末节点可达性判断单元 201b;
末节点可达性判断单元 201b,用于接收所述等待单元 201c发送的已等 待完毕的信号后, 判断所述服务层隧道末节点是否可达, 如果所述服务层 隧道末节点不可达, 则服务层隧道末节点失效, 如果所述服务层隧道末节 点可达, 否则所述 ^良务层隧道末节点没有失效。 末节点可达性判断单元 201b包括路由计算判断单元, 用于以所述服务 层隧道的首末节点作为源和宿, 在流量工程 TE数据库中进行路由计算, 如 果可以计算出路由, 则所述服务层隧道末节点可达, 如果无法计算出路由, 则所述 Λ良务层隧道末节点不可达。
末节点可达性判断单元 201b也可以包括一个定时查询单元, 用于服 务层隧道首节点通过运行 BFD协议向服务层隧道末节点定时发送 Hello报 文, 如果所述服务层隧道首节点在设定时间内收不到服务层隧道末节点的 握手消息, 则判断服务层隧道末节点不可达。
实施例 3
在故障判断模块 201中等待单元等待故障状态路由扩散完毕的原因是 需要确认在该节点的所有端口都发生告警的情况下, 该节点才算失效。 也 可以不釆用该等待单元, 只要有一个端口告警, 就直接进行判断末节点是 否可达的步骤。
参见图 4, 判断模块 201可以具体包括上游节点判断单元 201a和末节 点可达性判断单元 201b。
上游节点判断单元 201a用于判断所述发生告警的端口是否为所述隧道 末节点的上游节点的出端口, 如果是, 则将判断结果发送给末节点可达性 判断单元 202b, 如果不是, 则所述服务层隧道末节点没有失效。
末节点可达性判断单元 201b用于接收所述上游节点判断单元 201a发 送的判断结果, 如果所述发生告警的端口是所述服务层隧道末节点的上游 节点的出端口, 则判断所述服务层隧道末节点是否可达, 如果所述服务层 隧道末节点不可达, 则服务层隧道末节点失效, 如果所述服务层隧道末节 点可达, 否则所述 ^良务层隧道末节点没有失效。
其它模块的内容和连接关系与实施例 2 中所述的装置相同, 这里不再 赘述。
上述实施例中, 服务层隧道首节点通过故障定位和路由检测的手段, 通过判断与故障相关的末节点是否失效, 可以在很短时间内判定出服务层 隧道是否可以恢复成功, 这样可缩短客户业务的整体恢复时间, 满足网络 业务对恢复性能的要求。 既保证了网络整体恢复效率的提升, 又保证了单 条业务在各种情况下最优的恢复性能。
以上所述的实施例, 只是用于说明本发明原理和特性的示范性的优选 的具体实施例, 本领域的技术人员在不脱离本发明的实质和原理的前提下 进行的通常变化和等同替换都应包含在本发明的保护范围内。

Claims

权 利 要 求
1. 一种恢复网络业务的方法, 其特征在于, 包括:
接收故障通告, 判断故障所在服务层的隧道末节点是否失效; 如果所述服务层的隧道末节点失效, 则进行客户层隧道的恢复; 如果所述服务层的隧道末节点没有失效, 则进行服务层隧道的恢复。
2. 如权利要求 1所述的方法, 其特征在于, 所述判断故障所在服务层 的隧道末节点是否失效的步骤, 包括:
遍历告警节点和告警端口的隧道的路径资源信息, 确定所述告警端口 是所述服务层的隧道末节点的上游节点的出端口, 确定所述服务层的隧道 末节点不可达, 确定所述 Λ良务层的隧道末节点失效。
3. 如权利要求 1所述的方法, 其特征在于, 所述判断故障所在服务层 的隧道末节点是否失效的步骤, 包括:
遍历告警节点和告警端口的隧道的路径资源信息, 确定所述告警端口 不是所述服务层的隧道末节点的上游节点的出端口, 确定所述服务层的隧 道末节点未失效。
4. 如权利要求 1所述的方法, 其特征在于, 所述判断故障所在服务层 的隧道末节点是否失效的步骤, 包括:
遍历告警节点和告警端口的隧道的路径资源信息, 确定所述告警端口 是所述服务层的隧道末节点的上游节点的出端口, 确定所述服务层的隧道 末节点可达, 确定所述服务层的隧道末节点未失效。
5. 如权利要求 2 或 4所述的方法, 其特征在于, 所述判断故障所在服 务层的隧道末节点是否失效的步骤, 包括: 在判断所述服务层的隧道末节 点是否可达的步骤之前, 等待故障状态路由扩散完毕。
6. 如权利要求 5所述的方法, 其特征在于, 所述判断所述服务层的隧 道末节点是否可达的步骤具体包括:
以所述服务层的隧道的首节点和末节点作为源和宿, 在流量工程数据 库中进行路由计算, 如果计算出路由, 则确定所述服务层的隧道末节点可 达, 如果无法计算出路由, 则确定所述服务层的隧道末节点不可达。
7. 如权利要求 5所述的方法, 其特征在于, 所述判断所述服务层的隧 道末节点是否可达的步骤具体包括:
所述服务层的隧道首节点通过运行双向转发检测协议向所述服务层的 隧道末节点定时发送握手消息, 如果所述服务层的隧道首节点在设定时间 内收到所述服务层的隧道末节点反馈的握手响应消息, 则所述服务层的隧 道末节点可达; 如果所述服务层的隧道首节点在设定时间内未收到所述服 务层的隧道末节点反馈的握手响应消息, 则所述服务层的隧道末节点不可 达。
8. 一种恢复网络业务的装置, 其特征在于, 包括判断模块和故障恢 复模块;
所述判断模块, 用于接收故障通告, 判断故障所在的服务层的隧道末 节点是否失效, 并将判断结果发送给所述故障恢复模块;
所述故障恢复模块, 用于接收所述判断结果; 如果所述判断结果是所 述服务层的隧道末节点失效, 则进行客户层隧道的恢复; 如果所述判断结 果是所述服务层的隧道末节点没有失效, 则进行服务层隧道的恢复。
9. 如权利要求 8所述的装置, 其特征在于, 所述判断模块具体包括: 上游节点判断单元和末节点可达性判断单元;
所述上游节点判断单元, 用于确定告警端口是所述服务层的隧道末节 点的上游节点的出端口, 将确定结果发送给所述末节点可达性判断单元; 确定所述告警端口不是所述服务层的隧道末节点的上游节点的出端口, 确 定所述 ^良务层的隧道末节点未失效;
所述末节点可达性判断单元, 用于接收所述确定结果, 触发判断所述 服务层的隧道末节点是否可达; 若所述服务层的隧道末节点不可达, 则确 定所述服务层的隧道末节点失效; 若所述服务层的隧道末节点可达, 则确 定所述 ^良务层的隧道末节点未失效。
10. 如权利要求 8所述的装置,其特征在于,所述判断模块具体包括: 上游节点判断单元、 等待单元和末节点可达性判断单元;
所述上游节点判断单元, 用于确定告警端口是所述服务层的隧道末节 点的上游节点的出端口, 将确定结果发送给所述末节点可达性判断单元; 确定所述告警端口不是所述服务层的隧道末节点的上游节点的出端口, 确 定所述 ^良务层的隧道末节点未失效;
所述等待单元, 用于等待故障状态路由扩散完毕, 并将已等待完毕的 信号发送给所述末节点可达性判断单元;
所述末节点可达性判断单元, 用于接收所述已等待完毕的信号, 触发 判断所述服务层的隧道末节点是否可达; 若所述服务层的隧道末节点不可 达, 则确定所述服务层的隧道末节点失效; 若所述服务层的隧道末节点可 达, 则确定所述服务层的隧道末节点未失效。
11. 如权利要求 9或 10所述的装置, 其特征在于, 所述末节点可达性 判断单元包括:
路由计算判断单元, 用于以所述服务层的隧道首节点和末节点作为源 和宿, 在流量工程数据库中进行路由计算, 如果计算出路由, 则确定所述 服务层的隧道末节点可达, 如果无法计算出路由, 则确定所述服务层的隧 道末节点不可达。
12. 如权利要求 9或 10所述的装置, 其特征在于, 所述末节点可达 性判断单元包括:
定时查询单元, 用于所述服务层的隧道首节点通过运行双向转发检测 协议向所述服务层的隧道末节点定时发送握手消息, 如果所述服务层的隧 道首节点在设定时间内收不到所述服务层的隧道末节点反馈的握手响应消 息, 则确定所述服务层的隧道末节点不可达。
PCT/CN2008/070589 2007-03-29 2008-03-26 Procédé et matériel de restauration du commerce en réseau WO2008119294A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN 200710064922 CN100512154C (zh) 2007-03-29 2007-03-29 一种提高多层网络业务恢复性能的方法及装置
CN200710064922.9 2007-03-29

Publications (1)

Publication Number Publication Date
WO2008119294A1 true WO2008119294A1 (fr) 2008-10-09

Family

ID=38795829

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2008/070589 WO2008119294A1 (fr) 2007-03-29 2008-03-26 Procédé et matériel de restauration du commerce en réseau

Country Status (2)

Country Link
CN (1) CN100512154C (zh)
WO (1) WO2008119294A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114567542A (zh) * 2022-02-16 2022-05-31 烽火通信科技股份有限公司 硬管道专线逐跳业务检测方法、装置、设备及存储介质

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100512154C (zh) * 2007-03-29 2009-07-08 华为技术有限公司 一种提高多层网络业务恢复性能的方法及装置
CN101621497B (zh) * 2008-06-30 2012-11-21 华为技术有限公司 一种多层网络中的业务保护的方法和系统
CN101505246B (zh) * 2009-03-02 2011-08-24 中兴通讯股份有限公司 网络隧道分段保护方式的故障检测方法和修复方法
CN102136965B (zh) 2010-12-24 2013-12-04 华为技术有限公司 一种隧道故障检测方法和流量工程节点
WO2013091173A1 (zh) * 2011-12-20 2013-06-27 中兴通讯股份有限公司 一种自动解除动态隧道与服务层告警关联的方法和装置
CN103580904A (zh) * 2012-08-08 2014-02-12 中兴通讯股份有限公司 域间故障信息的发送方法及装置
CN102833108B (zh) * 2012-08-30 2016-03-30 华为技术有限公司 故障点位置信息处理方法及设备
CN102833109B (zh) * 2012-08-30 2015-01-07 华为技术有限公司 故障点位置信息处理方法及设备
CN104283711B (zh) * 2014-09-29 2018-01-26 中国联合网络通信集团有限公司 基于双向转发检测bfd的故障检测方法、节点及系统
CN114125911B (zh) * 2021-10-21 2023-09-29 中盈优创资讯科技有限公司 一种5g专线业务中断故障定位方法及装置

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5657320A (en) * 1995-06-06 1997-08-12 Mci Corporation Method and system for resolving contention of spare capacity circuits of a telecommunications network
CN1512716A (zh) * 2002-12-30 2004-07-14 北京邮电大学 一种多层网络故障恢复方法
EP1610496A1 (en) * 2004-06-18 2005-12-28 Avaya Technology Corp. Rapid fault detection and recovery for internet protocol telephony
CN1885789A (zh) * 2005-06-23 2006-12-27 华为技术有限公司 自动交换光网络中通道故障的处理方法
CN101056203A (zh) * 2007-03-29 2007-10-17 华为技术有限公司 一种提高多层网络业务恢复性能的方法及装置

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5657320A (en) * 1995-06-06 1997-08-12 Mci Corporation Method and system for resolving contention of spare capacity circuits of a telecommunications network
CN1512716A (zh) * 2002-12-30 2004-07-14 北京邮电大学 一种多层网络故障恢复方法
EP1610496A1 (en) * 2004-06-18 2005-12-28 Avaya Technology Corp. Rapid fault detection and recovery for internet protocol telephony
CN1885789A (zh) * 2005-06-23 2006-12-27 华为技术有限公司 自动交换光网络中通道故障的处理方法
CN101056203A (zh) * 2007-03-29 2007-10-17 华为技术有限公司 一种提高多层网络业务恢复性能的方法及装置

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114567542A (zh) * 2022-02-16 2022-05-31 烽火通信科技股份有限公司 硬管道专线逐跳业务检测方法、装置、设备及存储介质
CN114567542B (zh) * 2022-02-16 2023-09-15 烽火通信科技股份有限公司 硬管道专线逐跳业务检测方法、装置、设备及存储介质

Also Published As

Publication number Publication date
CN101056203A (zh) 2007-10-17
CN100512154C (zh) 2009-07-08

Similar Documents

Publication Publication Date Title
WO2008119294A1 (fr) Procédé et matériel de restauration du commerce en réseau
EP1898584B1 (en) A method and device for recovering the share mesh network
EP1845656B1 (en) A method for implementing master and backup transmission path
EP1973267B1 (en) A service failure restoration method and system
EP1986371B1 (en) A failure processing method and a system and a device thereof
US20020171886A1 (en) Automatic control plane recovery for agile optical networks
US8335154B2 (en) Method and system for providing fault detection and notification for composite transport groups
WO2006026930A1 (fr) Procede de restauration d'un service de maillage optique
WO2007009360A1 (fr) Procede et appareil pour retablir une liaison defaillante
WO2008025285A1 (fr) Procédé et dispositif pour obtenir un service de multidiffusion avec des types multiples de protection et de récupération
WO2006025296A1 (ja) 障害回復方法およびネットワーク装置ならびにプログラム
CN101176303A (zh) 一种业务倒换的方法和网络节点
WO2008046358A1 (fr) Procédé et dispositif destinés à réaliser une pénétration d'un statut de liaison de réseau point à multipoint
WO2010124557A1 (zh) 环网保护方法、网络节点及环网络
WO2007109937A1 (en) Resilient optical burst ring failure protecting method, apparatus and failure processing method
WO2006069549A1 (fr) Procede de protection de service de donnees dans un reseau d'emission metropolitain
CN101577719A (zh) 一种双机热备方法、装置和系统
WO2011157130A2 (zh) 路径建立方法和装置
WO2011076024A1 (zh) 传送多协议标签交换网络系统和链路保护方法
WO2013178098A1 (zh) 域间故障信息的发送方法及装置
WO2010028560A1 (zh) 在mesh网络中实现永久环网保护的方法
EP2302836B1 (en) Method, system and device for recovering service
WO2010121459A1 (zh) 一种自动交换光网络中实现保护与恢复的方法及系统
WO2012106976A1 (zh) 一种故障保护方法及设备
WO2007036101A1 (fr) Systeme et procede de protection de voie de service multidiffusion

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08715325

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 08715325

Country of ref document: EP

Kind code of ref document: A1