WO2010083764A1 - 一种协调自动保护倒换操作与恢复操作的装置及方法 - Google Patents
一种协调自动保护倒换操作与恢复操作的装置及方法 Download PDFInfo
- Publication number
- WO2010083764A1 WO2010083764A1 PCT/CN2010/070303 CN2010070303W WO2010083764A1 WO 2010083764 A1 WO2010083764 A1 WO 2010083764A1 CN 2010070303 W CN2010070303 W CN 2010070303W WO 2010083764 A1 WO2010083764 A1 WO 2010083764A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- protection
- alarm
- recovery
- protocol unit
- channel
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0654—Management of faults, events, alarms or notifications using network fault recovery
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L12/00—Data switching networks
- H04L12/28—Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
- H04L12/42—Loop networks
- H04L12/437—Ring fault isolation or reconfiguration
Definitions
- the present invention relates to an optical network, and in particular, to an apparatus and method for coordinating automatic protection switching (APS) operation and recovery operation.
- APS automatic protection switching
- BACKGROUND With the increasing scale of networks and the need for high-quality services, optical networks have increasingly higher requirements for network survivability.
- Network survivability means that the network can maintain an acceptable level of quality of service in the event of a failure.
- APS operation and recovery operations are the main means to improve optical network survivability and support QoS (Quality of Service) requirements.
- QoS Quality of Service
- the basic idea of the APS operation is to pre-configure the protection channel for the service. When the working channel fails, the working channel detection unit of the corresponding node detects the alarm and reports it to the protection protocol unit of the node.
- the protection protocol unit receives the working channel alarm. After the APS operation is started, the protection protocol unit runs the set protection protocol algorithm, and performs signaling interaction with the protection protocol units of other nodes in the protection channel. Then, the protection protocol units of each node in the protection channel respectively go to the node. The execution unit sends a switching instruction. Finally, the execution unit of each node in the protection channel performs an APS operation to switch the service to the protection channel to ensure normal operation of the service.
- the APS operation has the advantages of fast switching time and short service interruption time, but the network resource utilization rate is relatively low.
- the recovery operation does not pre-configure the protection channel for the service.
- the working channel detection unit of the corresponding node detects the alarm and sends it to the recovery protocol unit of the node.
- the recovery protocol unit receives the working channel.
- the recovery operation is started, that is, the recovery protocol unit recalculates a new channel for the service from the idle resources of the current network, and performs signaling interaction with the recovery protocol unit of other nodes in the new channel.
- each node in the new channel The recovery protocol unit sends a switching instruction to the execution unit of the node respectively.
- the execution unit of each node in the new channel performs a recovery operation to switch the service to the new channel to ensure normal operation of the service.
- the device includes a working channel detecting unit, a protection protocol unit, and a recovery protocol unit;
- the working channel detecting unit detects the working channel alarm, it is added to the protection protocol unit and the recovery protocol unit; the protection protocol unit starts the APS operation immediately after receiving the working channel alarm, and the recovery protocol unit receives the working channel alarm. After waiting for the preset duration holdoff, check whether the working channel detection unit still detects the alarm. If yes, start the recovery operation. Otherwise, the service is running normally, and no recovery operation is required.
- the technical problem to be solved by the present invention is to provide an apparatus and method for coordinating APS operation and recovery operations, which reduces the time of service loss in the case of APS function failure.
- a device for coordinating APS operation and recovery operation comprising a working channel detecting unit, a protection protocol unit and a recovery protocol unit; and the working channel detecting unit is used for the service
- the working channel performs fault monitoring, and when the working channel fails, the alarm is sent to the protection protocol unit and the recovery protocol unit;
- the protection protocol unit is configured to receive the working channel alarm and determine whether an immediate start is needed.
- the recovery protocol unit is configured to receive the work channel alarm and the immediate start recovery operation notification, when receiving the When the recovery operation notification is started immediately, or when the received working channel alarm timeout still exists, the recovery operation is started.
- the device further includes a protection channel detecting unit, configured to perform fault monitoring on the protection channel of the service, and alert the protection protocol unit when the protection channel fails; the protection protocol unit is further configured to receive the Protect channel alarms.
- the protection protocol unit is further configured to determine the type of the received alarm, and record the alarm information when the protection channel alarm is received and mark the status of the protection channel as having an alarm.
- a method for coordinating the APS operation and the recovery operation includes the following steps: Step a: When a working channel of the current service generates a fault, the working channel detecting unit sends an alarm to the protection protocol unit of the node and the recovery protocol unit; After the receiving the alarm of the working channel, the recovery protocol unit starts a timer, and the protection protocol unit determines whether the recovery operation needs to be started immediately after receiving the alarm of the working channel, and if yes, proceeds to step c; step c: the protection The protocol unit notifies the recovery protocol unit to immediately start the recovery operation, and the recovery protocol unit starts the recovery operation immediately after receiving the notification, and the alarm processing is completed.
- the protection protocol unit starts an APS operation; the recovery protocol unit waits for the timer to expire before checking whether the working channel detection unit still has an alarm. If yes, the recovery operation is started, and the alarm processing is completed. Otherwise, the alarm processing is completed.
- the protection channel detection unit sends a protection channel alarm to the protection protocol unit of the local node. After receiving the alarm, the protection protocol unit determines the type of the alarm, and if it is a working channel alarm, proceeds to step b; if it is a protection channel alarm, records the alarm information and marks the protection channel status In order to have an alarm.
- the protection protocol unit uses the following method to determine whether the recovery operation needs to be started immediately: Step A: Check whether the status of the protection channel of the current service is an alarm, and if so, the recovery operation needs to be started immediately; otherwise, step B is performed; Step B: Check whether the protection group of the service is in the non-enabled state. If yes, you need to start the recovery operation immediately. Otherwise, you do not need to start the recovery operation immediately.
- the protection protocol unit is internal but not limited to a Data Communication Network (DCN), a High Level Data Link Control (HDLC) protocol bus, and a Central Processing Unit (CPU).
- DCN Data Communication Network
- HDLC High Level Data Link Control
- CPU Central Processing Unit
- the protection channel detecting unit sends an alarm to the protection protocol unit by means of a DCN or HDLC protocol bus.
- the beneficial effects of the present invention are mainly manifested in:
- the method for coordinating APS operation and recovery operation according to the present invention can be implemented by the device for coordinating APS operation and recovery operation according to the present invention; a protection channel detecting unit is added to the device for detecting The protection channel of the service realizes the monitoring of the protection channel failure. At the same time, the communication mechanism between the protection protocol unit and the recovery protocol unit is added, and the information interaction between the two is realized.
- the protection protocol unit receives the service. After the working channel is alarmed, it is first determined whether the recovery operation needs to be started immediately according to the status of the protection channel.
- FIG. 1 is a schematic structural view of a device for coordinating APS and service recovery operations
- FIG. 2 is a schematic structural view of the device according to the present invention
- FIG. 3 is a flow chart of the method of the present invention
- FIG. 5 is a schematic diagram of a network topology structure according to Embodiment 1 of the present invention
- FIG. 7 is a schematic diagram of a network topology structure according to Embodiment 2 of the present invention.
- a device for coordinating APS operation and recovery operation is used for each node in an optical network, including a working channel detecting unit, a protection channel detecting unit, a protection protocol unit, and a recovery protocol unit; and the working channel detecting unit is configured to The working channel of the service performs fault monitoring, and when the working channel fails, the alarm is sent to the protection protocol unit and the recovery protocol unit;
- the protection channel detection unit is configured to perform fault monitoring on the protection channel of the service, and the alarm is sent to the protection protocol unit when the protection channel is faulty;
- the protection channel detection unit reports the alarm to the protection protocol unit through the DCN and the HDLC protocol bus;
- the protection protocol unit is configured to receive the reported alarm, and is used to determine the type of the received alarm, record the alarm information when the protection channel alarm is received,
- the recovery protocol unit is configured to receive the working channel alarm and start the timer, and start the recovery operation when the notification of the immediate activation of the protection protocol unit is received, or when the timer expires but the working channel detection unit still has an alarm. That is to say, the recovery protocol unit is configured to receive the working channel alarm and immediately initiate the recovery operation notification, and start the recovery operation when receiving the immediate start recovery operation notification, or when the received working channel alarm timeout still exists.
- a protection channel detecting unit is added to the foregoing device, which is used for detecting a protection channel of the service, and realizes monitoring of the protection channel failure, and at the same time, increases a communication mechanism between the protection protocol unit and the recovery protocol unit. The interaction of information between the two is realized. Referring to FIG.
- the method for coordinating the APS operation and the recovery operation specifically includes the following steps: Step 301: The working channel detecting unit and the protection channel detecting unit respectively monitor the status of the working channel and the protection channel of the current service, if the current service If the working channel fails, the working channel detecting unit is alerted to the protection protocol unit and the recovery protocol unit; if the protection channel of the current service fails, the protection channel detecting unit sends an alarm to the protection channel.
- Protecting the protocol unit Step 302: The protection protocol unit and the recovery protocol unit simultaneously process the received alarm; and start the corresponding operation according to the processing result.
- the protection protocol unit after receiving the service channel alarm of the service, the protection protocol unit first determines whether the recovery operation needs to be started immediately according to the status of the protection channel, and the APS function fails. Then, the recovery protocol unit in the notification timer timeout waiting process immediately starts the recovery operation, thereby reducing the damage time of the service.
- the specific steps of the protection protocol unit for processing the alarm are as shown in FIG.
- Step 401 The protection protocol unit determines the type of the received alarm, and if it is a protection channel alarm, step 4 is performed; if it is a working channel alarm, Step 404: Step 402: Record the alarm information and mark the status of the protection channel as having an alarm, and the alarm processing ends; Step 403: Determine whether the recovery operation needs to be started immediately, and if yes, execute step 404; otherwise, execute the step 4 405; This step checks the status of the current service protection channel. If it is available, you do not need to start the recovery operation immediately. If it is not available, you need to start the recovery operation immediately. The protection channel is unavailable.
- Step 4 404: Notify the recovery protocol unit to start the recovery operation; Step 405: Start the APS operation.
- step 4 is performed; step 503: determining whether the timer expires, if yes, executing step 504; otherwise, performing step 502; step 504; checking whether the working channel detecting unit has an alarm, and if yes, performing the step 505; Otherwise, the alarm processing ends; Step 505: The recovery operation is started, and the alarm processing ends.
- a specific processing procedure for the recovery protocol unit to process the alarm is implemented. As shown in FIG.
- FIG. 6 it is a schematic diagram of a network topology structure of an embodiment in which the service has 1+1 protection and recovery attributes; in the figure, a pair of services 1 exists between node A and node C, and the working path is Node A, Node B, and Node C, the service has a 1+1 protection attribute, and the protection path is Node A, Node I, and Node C. Meanwhile, the service also has a recovery attribute, and both the working path and the protection path fail. When the business needs to be restored.
- Step 6a The protection protocol unit of the node A and the node C receives the alarm;
- Step 6b The type of the protection protocol unit to the alarm resource The judgment is made to determine the protection channel alarm;
- Step 6c The protection protocol unit records the alarm information, and marks the status of the protection channel of the service 1 as "there is an alarm, and the interval 1 of the working path of the service 1 is also set.
- Step 6a The protection protocol unit determines the type of the alarm resource, and determines the working channel alarm
- Step 6b The protection protocol unit checks the status of the protection channel of the service 1, In order to have an alarm, it is necessary to immediately start the recovery operation
- Step 6c The protection protocol unit issues a notification to the recovery protocol unit to "start the recovery operation immediately.”
- the process of recovering the protocol unit after the failure of the segment 1 is as follows: Step 6A: The recovery protocol unit receives the working channel alarm of service 1; Step 6B: Start the timer with the holdoff time being holdoff, where the value of holdoff is set to
- Step 6C During the timeout period of the waiting timer, the notification is received from the protection protocol unit.
- Step 6D The recovery operation is started immediately, that is, the service recovery mechanism of the ASON is started, so that the service 1 is rerouted as soon as possible, and the service loss time is reduced.
- step 6D through the service recovery mechanism of the ASON, the recovery path finally obtained by rerouting the service 1 is node A, node G, node F, node E, node D to node C, that is, the thin dotted line in FIG. path.
- FIG. 7 is a schematic diagram of a network topology structure of an embodiment of the present invention in the case where a service has a multiplex section sharing protection and recovery attribute; wherein, there is a pair of services between node E and node G (recorded as service 2)
- the working path is node E, node F and node G; service 2 has multiplex section shared protection attribute, and when the span 5 fails, the protection path of service 2 is node E, node D, node, node K, node J, Node I, Node G, Node F, and then to Node G.
- Step 7a The protection protocol unit of the node I and the node J receives the alarm;
- 7b The protection protocol unit determines the type of the alarm, and determines that it is a protection channel alarm.
- Step 7c The protection protocol unit records the alarm information, and marks the status of the protection channel of the service 2 as "with alarm”.
- Step 7a The protection protocol unit determines the alarm type and determines the working channel alarm
- Step 7b The protection protocol unit checks the service 2 The status of the protection channel is alarmed. Since the working channel and the protection channel of service 2 have fault alarms, the recovery operation needs to be started immediately.
- Step 7c The protection protocol unit issues an immediate recovery operation to the recovery protocol unit. Pass Know.
- Step 7A The recovery protocol unit receives the alarm of the working channel resource of the service 2;
- Step 7B Start the timer whose hold time is holdoff, where the value of the holdoff is set to 50ms (because the switching time of the multiplex section shared protection is generally within 50ms);
- Step 7C During the waiting timer timeout period, the recovery protocol unit receives the notification of immediate startup recovery from the protection protocol unit;
- Step 7D Start recovery immediately Operation, that is, start the ASON service recovery mechanism to reroute the service 2 as soon as possible to reduce the time of business damage.
- step 7D through the service recovery mechanism of the ASON, the recovery path finally obtained by rerouting the service 2 is the node E, the node H, and the node G, as shown by the thin dotted line in FIG. 7 .
- the service 2 is restored by using the ASON without waiting for the holdoff time, thereby reducing the interruption time of the service 2.
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Maintenance And Management Of Digital Transmission (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
本发明涉及一种协调自动保护倒换操作与恢复操作的装置及方法,所述装置包括工作通道检测单元、保护通道检测单元、保护协议单元以及恢复协议单元;所述方法为:当前业务的工作通道产生故障时,工作通道检测单元上报工作通道告警给本节点的保护协议单元以及恢复协议单元; 恢复协议单元接收所述工作通道告警后启动定时器, 所述保护协议单元接收所述工作通道告警后判断是否需要立即启动恢复操作,若是,则保护协议单元通知恢复协议单元立即启动恢复操作,恢复协议单元收到通知后立即启动恢复操作。本发明在自动保护倒换功能失效的情况下减少了业务的受损时间。
Description
一种协调自动保护倒换操作与恢复操作的装置及方法 技术领域 本发明涉及光网络, 尤其涉及一种协调自动保护倒换 ( Automatic Protection Switching , APS ) 操作与恢复操作的装置及方法。 背景技术 随着网络规模的日益扩大与高质量业务的需要, 目前, 光网络对网络生 存性的要求越来越高。 网络生存性是指网络在失效情况下仍可维持可接受的 业务质量等级。 APS 操作与恢复操作是提高光网络生存性、 支持业务传输 QoS ( Quality of Service, 月艮务质量) 需求的主要手段。 APS 操作的基本思想是为业务预先配置保护通道, 工作通道出现故障 时, 相应节点的工作通道检测单元检测到告警, 并将其上报给本节点的保护 协议单元; 该保护协议单元接收工作通道告警后启动 APS操作, 即该保护协 议单元运行设定的保护协议算法, 并与保护通道中其它节点的保护协议单元 进行信令交互; 然后, 保护通道中各个节点的保护协议单元分别向本节点的 执行单元下发切换指令; 最后, 保护通道中各个节点的执行单元执行 APS操 作, 将业务切换到保护通道, 保证业务的正常运行。 APS操作具有倒换时间 快、 业务中断时间短的优点, 但是网络资源利用率相对较低。 恢复操作并不为业务预先配置保护通道, 工作通道出现故障时, 相应节 点的工作通道检测单元检测到告警, 并将其上 ·ί艮给本节点的恢复协议单元; 该恢复协议单元接收工作通道告警后启动恢复操作, 即该恢复协议单元从当 前网络的空闲资源中为业务重新计算出一条新通道, 并与新通道中其它节点 的恢复协议单元进行信令交互; 然后, 新通道中各个节点的恢复协议单元分 别向本节点的执行单元下发切换指令; 最后, 新通道中各个节点的执行单元 执行恢复操作, 将业务切换到新通道, 保证业务的正常运行。 恢复操作具有 较高的网络资源利用率, 但是需要对业务通道进行实时计算, 业务受损时间 相对较长。 对于较高 QoS要求的业务, 往往同时为该业务配备 APS功能与恢复功 能。 参照图 1 , 装置包括工作通道检测单元、 保护协议单元与恢复协议单元;
工作通道检测单元检测到工作通道告警时, 将其上 ·ί艮给保护协议单元与恢复 协议单元; 保护协议单元接收到工作通道告警后立即启动 APS操作, 而恢复 协议单元接收到工作通道告警后等待预设时长 holdoff之后,查看工作通道检 测单元是否仍检测到告警, 若是, 启动恢复操作, 否则, 业务已正常运行, 无需启动恢复操作。 由于保护协议单元与恢复协议单元之间缺乏信令交互, 在 APS功能失效的情况下, 如业务的工作通道与保护通道均存在故障, 恢复 操作应立即启动时, 恢复协议单元必须等待 holdoff之后才启动恢复操作,使 得业务的受损时间增加了 holdoff, 拉长了业务中断时间。 发明内容 本发明要解决的技术问题是提供一种协调 APS操作与恢复操作的装置 及方法, 减少了 APS功能失效的情况下业务的受损时间。 本发明解决其技术问题所釆用的技术方案是: 一种协调 APS操作与恢复操作的装置, 包括工作通道检测单元、 保护 协议单元以及恢复协议单元; 所述工作通道检测单元用于对业务的工作通道进行故障监视,并在所述 工作通道出现故障时上 4艮告警给所述保护协议单元和恢复协议单元; 所述保护协议单元用于接收所述工作通道告警,并判断是否需要立即启 动恢复操作, 且需要时, 通知所述恢复协议单元立即启动恢复操作, 否则, 启动 APS操作; 所述恢复协议单元用于接收所述工作通道告警和所述立即启动恢复操 作通知, 当接收到所述立即启动恢复操作通知时, 或者当接收到的所述工作 通道告警超时依然存在时, 启动恢复操作。 所述装置还包括保护通道检测单元,用于对业务的保护通道进行故障监 视, 并在所述保护通道出现故障时上 告警给所述保护协议单元; 所述保护 协议单元还用于接收所述保护通道告警。 所述保护协议单元还用于判断接收到的告警的类型,并在接收到保护通 道告警时记录所述告警信息并将所述保护通道的状态标记为有告警。
一种协调 APS操作与恢复操作的方法, 包括以下步骤: 步骤 a: 当前业务的工作通道产生故障时, 工作通道检测单元上 4艮工作 通道告警给本节点的保护协议单元以及恢复协议单元; 步骤 b: 所述恢复协议单元接收所述工作通道告警后启动定时器, 所述 保护协议单元接收所述工作通道告警后判断是否需要立即启动恢复操作, 若 是, 进入步骤 c; 步骤 c: 所述保护协议单元通知所述恢复协议单元立即启动恢复操作, 所述恢复协议单元收到通知后立即启动恢复操作, 本次告警处理完毕。 所述步骤 b中, 若判断的结果为无需立即启动恢复操作, 则所述保护协 议单元启动 APS操作;所述恢复协议单元等待所述定时器超时后查看所述工 作通道检测单元是否依然存在告警, 若是, 则启动恢复操作, 之后本次告警 处理完毕, 否则, 本次告警处理完毕。 上述方法中, 若当前业务的保护通道产生故障, 则保护通道检测单元上 艮保护通道告警给所述本节点的保护协议单元。 所述保护协议单元接收到告警后, 判断所述告警的类型, 若为工作通道 告警, 则转入步骤 b执行; 若为保护通道告警, 则记录所述告警信息并将所 述保护通道状态标记为有告警。 所述步骤 b中,保护协议单元釆用如下方法判断是否需要立即启动恢复 操作: 步骤 A: 查看当前业务的保护通道状态是否为有告警, 若是, 需要立即 启动恢复操作, 否则, 执行步骤 B; 步骤 B: 查看所述业务的保护组是否处于非使能状态, 若是, 需要立即 启动恢复操作, 否则, 无需立即启动恢复操作。 所述保护协议单元通过但不限于数据通信网络 ( Data Communication Network, DCN ), 高级数据链路控制( High Level Data Link Control, HDLC ) 协议总线、 中央处理器 ( Central Processing Unit , 简称为 CPU ) 内部进程间 通讯方式通知所述恢复协议单元立即启动恢复操作。
所述保护通道检测单元通过但不限于 DCN、 HDLC协议总线方式将告 警上 ~¾给所述保护协议单元。 本发明的有益效果主要表现在: 本发明所述协调 APS操作与恢复操作 的方法可通过本发明所述协调 APS操作与恢复操作的装置实现;该装置中增 加了保护通道检测单元, 用于检测业务的保护通道, 实现了对保护通道故障 的监视, 同时, 增加了保护协议单元与恢复协议单元之间的通讯机制, 实现 了两者间信息的交互; 该方法中, 保护协议单元接收到业务的工作通道告警 后, 首先根据保护通道的状况确定是否需要立即启动恢复操作, 在 APS功能 失效的情况下, 通知定时器超时等待过程中的恢复协议单元立即启动恢复操 作, 从而减少了业务的受损时间。 附图说明 图 1为目前协调 APS与业务恢复操作的装置结构示意图; 图 2为本发明所述装置的结构示意图; 图 3是本发明所述方法的流程图; 图 4是保护协议单元处理告警的流程图; 图 5是恢复协议单元处理告警的流程图; 图 6是本发明实施例一的网络拓朴结构示意图; 图 7是本发明实施例二的网络拓朴结构示意图。 具体实施方式 图 1 已在背景技术中加以描述, 此处不再赞述, 下面结合其它附图对本 发明作进一步的描述。 参照图 2, —种协调 APS操作与恢复操作的装置, 用于光网络中每个节 点, 包括工作通道检测单元、 保护通道检测单元、 保护协议单元以及恢复协 议单元; 工作通道检测单元用于对业务的工作通道进行故障监视,并在工作通道 出现故障时上 4艮告警给保护协议单元和恢复协议单元;
保护通道检测单元用于对业务的保护通道进行故障监视,并在保护通道 出现故障时上 4艮告警给保护协议单元; 保护通道检测单元通过 DCN、 HDLC 协议总线方式将告警上报给保护协议单元; 保护协议单元用于接收上报的告警, 并用于判断接收到的告警的类型, 在接收到保护通道告警时记录告警信息并将保护通道的状态标记为有告警; 以及在接收到工作通道告警时判断是否需要立即启动恢复操作, 需要时, 通 知恢复协议单元立即启动恢复操作, 不需要时, 启动 APS操作; 保护协议单 元通过 DCN、 HDLC协议总线、 CPU 内部进程间通讯方式通知恢复协议单 元启动恢复操作。 恢复协议单元用于接收工作通道告警并启动定时器,以及在接收到保护 协议单元的立即启动恢复操作通知时, 或者在定时器超时但工作通道检测单 元依然存在告警时, 启动恢复操作。 也就是说,恢复协议单元用于接收工作通道告警和立即启动恢复操作通 知, 当接收到立即启动恢复操作通知时, 或者当接收到的工作通道告警超时 依然存在时, 启动恢复操作。 通过该实施例, 在上述装置中增加了保护通道检测单元, 用于检测业务 的保护通道, 实现了对保护通道故障的监视, 同时, 增加了保护协议单元与 恢复协议单元之间的通讯机制, 实现了两者间信息的交互。 参照图 3 , 本发明所述协调 APS操作与恢复操作的方法, 具体包括以下 步骤: 步骤 301: 工作通道检测单元和保护通道检测单元分别监测当前业务的 工作通道和保护通道的状态, 若当前业务的工作通道发生故障, 则工作通道 检测单元上 ·ί艮工作通道告警给保护协议单元和恢复协议单元; 若当前业务的 保护通道发生故障, 则保护通道检测单元上 ~¾保护通道告警给所述保护协议 单元; 步骤 302: 保护协议单元和恢复协议单元同时处理接收到的告警; 并根 据处理结果启动相应操作。 通过该实施例, 保护协议单元接收到业务的工作通道告警后, 首先根据 保护通道的状况确定是否需要立即启动恢复操作, 在 APS 功能失效的情况
下, 通知定时器超时等待过程中的恢复协议单元立即启动恢复操作, 从而减 少了业务的受损时间。 保护协议单元处理告警的具体步骤如图 4所示, 包括: 步骤 401 :保护协议单元判断接收到的告警的类型,若为保护通道告警, 则执行步 4聚 402; 若为工作通道告警, 则执行步 4聚 403; 步骤 402: 记录告警信息并将保护通道的状态标记为有告警, 本次告警 处理结束; 步骤 403 : 判断是否需要立即启动恢复操作, 若是, 执行步骤 404; 否 则, 执行步 4聚 405; 本步骤通过查看当前业务保护通道的状态, 若可用, 则不需要立即启动 恢复操作; 若不可用, 则需要立即启动恢复操作, 保护通道不可用情况包括:
( 1 ) 存在故障告警时; ( 2 ) 保护通道处于 "非使能"状态时; ( 3 ) 保护通道已经不能保证业务的正常运行。 步 4聚 404: 通知恢复协议单元启动恢复操作; 步骤 405 : 启动 APS操作。 通过该实施例, 实现了保护协议单元处理告警的具体处理过程。 恢复协议单元处理告警的具体步骤如图 5所示, 包括: 步骤 501 : 恢复协议单元启动定时时时长为 holdoff的定时器; 步骤 502: 判断是否收到来自保护协议单元的通知, 若是, 则执行步骤
505; 否则, 执行步 4聚 503; 步骤 503 : 判断定时器是否到时, 若是, 则执行步骤 504; 否则, 执行 步骤 502; 步骤 504;查看工作通道检测单元是否存在告警,若是,则执行步骤 505; 否则, 本次告警处理结束;
步骤 505 : 启动恢复操作, 本次告警处理结束。 通过该实施例, 实现了恢复协议单元处理告警的具体处理过程。 如图 6所示,是本发明在业务具有 1+1保护与恢复属性情况下的实施例 网络拓朴结构示意图; 图中, 节点 A与节点 C之间存在一对业务 1 , 其工作 路径为节点 A、 节点 B与节点 C, 该业务具有 1+1保护属性, 其保护路径为 节点 A、 节点 I与节点 C; 同时, 该业务还具有恢复属性, 当其工作路径与 保护路径都出现故障时, 需要对业务进行恢复操作。 假设业务 1的保护路径中的跨段 9出现故障,则保护协议单元的处理流 程为: 步骤 6a: 节点 A与节点 C的保护协议单元接收到告警; 步骤 6b: 保护协议单元对告警资源的类型进行判断, 确定为保护通道 告警; 步骤 6c: 保护协议单元记录该告警信息, 并将业务 1 的保护通道的状 态标记为"有告警,,。 设此时业务 1的工作路径的跨段 1也出现了故障,则保护协议单元的 处理流程为: 步骤 6a,: 保护协议单元对告警资源的类型进行判断, 确定为工作通道 告警; 步骤 6b,: 保护协议单元查看业务 1的保护通道的状态, 为有告警, 因 此需要立即启动恢复操作; 步骤 6c,: 保护协议单元向恢复协议单元发出"立即启动恢复操作,,的通 知。 跨段 1 出现故障后恢复协议单元的处理流程为: 步骤 6A: 恢复协议单元接收到业务 1的工作通道告警; 步骤 6B: 启动定时时间为 holdoff的定时器, 其中 holdoff的值设置为
50ms (因为 1+1保护的倒换时间一般都在 50ms以内);
步骤 6C: 等待定时器超时期间, 接收到了来自于保护协议单元的通知; 步骤 6D: 立即启动恢复操作, 即启动 ASON的业务恢复机制, 以尽快 为业务 1进行重路由, 减少业务受损时间。 在步骤 6D中, 通过 ASON的业务恢复机制, 最终给业务 1重路由得到 的恢复路径为节点 A、 节点 G、 节点 F、 节点 E、 节点 D到节点 C, 即图 6 中的细虚线所示路径。 在本实施例中, 当业务 1的工作通道与保护通道都出 现故障告警时, 因为没有等待 holdoff时间就利用 ASON对业务 1进行了恢 复操作, 因此减少了业务 1的中断时间。 如图 7所示,是本发明在业务具有复用段共享保护与恢复属性情况下的 实施例网络拓朴结构示意图; 其中, 节点 E与节点 G之间存在一对业务(记 为业务 2 ), 其工作路径为节点 E、 节点 F与节点 G; 业务 2具有复用段共享 保护属性, 当跨段 5 出现故障后, 业务 2的保护路径为节点 E、 节点 D、 节 点 、 节点 K、 节点 J、 节点 I、 节点 G、 节点 F、 然后再到节点 G; 同时, 业务 2还具有恢复属性, 当其工作路径与保护路径都出现故障时, 需要对业 务进行恢复操作。 具体地, 本实施例涉及以下步骤: 当业务 2的保护路径中的跨段 10出现故障后, 保护协议单元的处理流 程为: 步骤 7a: 节点 I与节点 J的保护协议单元接收到告警; 步骤 7b: 保护协议单元判断告警的类型, 确定为保护通道告警; 步骤 7c: 保护协议单元记录该告警信息, 将业务 2的保护通道的状态 标记为 "有告警"。 当业务 2的工作路径的跨段 5出现故障后,保护协议单元的处理流程为: 步骤 7a,: 保护协议单元判断告警类型, 确定为工作通道告警; 步骤 7b,: 保护协议单元查看业务 2的保护通道的状态, 为有告警, 由 于业务 2的工作通道与保护通道都存在故障告警, 所以需要立即启动恢复操 作; 步骤 7c,: 保护协议单元向恢复协议单元发出"立即启动恢复操作,,的通
知。 当跨段 5出现故障后恢复协议单元的处理流程为: 步骤 7A: 恢复协议单元接收到业务 2的工作通道资源的告警; 步骤 7B: 启动定时时间为 holdoff的定时器, 其中 holdoff的值设置为 50ms (因为复用段共享保护的倒换时间一般都在 50ms以内); 步骤 7C: 等待定时器超时期间, 恢复协议单元接收到了来自于保护协 议单元的立即启动恢复的通知; 步骤 7D: 立即启动恢复操作, 即启动 ASON的业务恢复机制, 以尽快 为业务 2进行重路由, 减少业务受损时间。 在步骤 7D中, 通过 ASON的业务恢复机制, 最终给业务 2重路由得到 的恢复路径为节点 E、 节点 H与节点 G, 如图 7中的细虚线所示路径。 在本 实施例中, 当业务 2的工作通道与保护通道都出现故障告警时, 因为没有等 待 holdoff时间就利用 ASON对业务 2进行了恢复操作, 因此减少了业务 2 的中断时间。 以上所述仅为本发明的较佳实施例而已,并非用于限制本发明的保护范 围。 应当理解的是, 对本发明技术所在领域的普通技术人员来说, 可以根据 本发明的技术方案及其构思进行相应的等同改变或替换, 而所有这些改变或 替换, 都应属于本发明所附权利要求的保护范围。
Claims
权 利 要 求 书 一种协调自动保护倒换操作与恢复操作的装置, 包括工作通道检测单 元、 保护十办议单元以及恢复十办议单元; 所述工作通道检测单元用于对 业务的工作通道进行故障监视, 并在所述工作通道出现故障时上 ^艮工 作通道告警给所述保护协议单元和所述恢复协议单元; 其特征在于: 所述保护协议单元用于接收所述工作通道告警, 并对所述工作通 道告警判断是否需要立即启动恢复操作, 且需要时, 通知所述恢复协 议单元立即启动恢复操作, 否则, 启动自动保护倒换操作;
所述恢复协议单元用于接收所述工作通道告警和所述立即启动 恢复操作通知, 当接收到所述立即启动恢复操作通知时, 或者当接收 到的所述工作通道告警超时依然存在时, 启动恢复操作。 如权利要求 1 所述的协调自动保护倒换操作与恢复操作的装置, 其特 征在于: 所述装置还包括保护通道检测单元, 用于对业务的保护通道 进行故障监视, 并在所述保护通道出现故障时上报保护通道告警给所 述保护协议单元; 所述保护协议单元还用于接收所述保护通道告警。 如权利要求 2所述的协调自动保护倒换操作与恢复操作的装置, 其特 征在于: 所述保护协议单元还用于判断接收到的告警的类型, 并在接 收到所述保护通道告警时记录所述保护通道告警的信息并将所述保护 通道的状态标记为有告警。 一种协调自动保护倒换操作与恢复操作的方法, 其特征在于, 包括以 下步骤:
当前业务的工作通道产生故障时, 工作通道检测单元上 4艮工作通 道告警给本节点的保护协议单元以及恢复协议单元;
所述恢复协议单元接收所述工作通道告警后启动定时器, 所述保 护协议单元接收所述工作通道告警后判断是否需要立即启动恢复操 作, 若是, 所述保护协议单元通知所述恢复协议单元立即启动恢复操 作, 所述恢复协议单元收到通知后立即启动恢复操作, 本次告警处理 完毕。
5. 如权利要求 4所述的协调自动保护倒换操作与恢复操作的方法, 其特 征在于: 在所述保护协议单元接收所述工作通道告警后判断是否需要 立即启动恢复操作之后, 若判断的结果为无需立即启动恢复操作, 则 所述保护协议单元启动自动保护倒换操作; 所述恢复协议单元等待所 述定时器超时后查看所述工作通道检测单元是否依然存在告警,若是, 则启动恢复操作, 之后本次告警处理完毕, 否则, 本次告警处理完毕。
6. 如权利要求 4所述的协调自动保护倒换操作与恢复操作的方法, 其特 征在于: 若当前业务的保护通道产生故障, 则保护通道检测单元上 4艮 保护通道告警给所述本节点的保护协议单元。
7. 如权利要求 4或 6所述的协调自动保护倒换操作与恢复操作的方法, 其特征在于: 所述保护协议单元接收到告警后, 判断所述告警的类型, 若为所述工作通道告警, 则所述恢复协议单元接收所述工作通道告警 后启动定时器, 所述保护协议单元接收所述工作通道告警后判断是否 需要立即启动恢复操作; 若为所述保护通道告警, 则记录所述保护通 道告警的信息并将所述保护通道的状态标记为有告警。
8. 如权利要求 7所述的协调自动保护倒换操作与恢复操作的方法, 其特 征在于, 所述保护协议单元釆用如下方法判断是否需要立即启动恢复 操作:
查看当前业务的所述保护通道的状态是否为有告警, 若是, 需要 立即启动恢复操作, 否则, 进一步查看所述业务的保护组是否处于非 使能状态, 若是, 需要立即启动恢复操作, 否则, 无需立即启动恢复 操作。
9. 如权利要求 4所述的协调自动保护倒换操作与恢复操作的方法, 其特 征在于, 所述保护协议单元通过但不限于数据通信网络、 高级数据链 路控制协议总线、 CPU内部进程间通讯方式通知所述恢复协议单元立 即启动恢复操作。
10. 如权利要求 6所述的协调自动保护倒换操作与恢复操作的方法, 其特 征在于, 所述保护通道检测单元通过但不限于数据通信网络、 高级数 据链路控制协议总线方式将告警上报给所述保护协议单元。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP10733237.1A EP2391060B1 (en) | 2009-01-22 | 2010-01-21 | Device and method for coordinating automatic protection switching operation and recovery operation |
US13/144,475 US8775869B2 (en) | 2009-01-22 | 2010-01-21 | Device and method for coordinating automatic protection switching operation and recovery operation |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200910105244.5A CN101790110B (zh) | 2009-01-22 | 2009-01-22 | 一种协调自动保护倒换操作与恢复操作的装置及方法 |
CN200910105244.5 | 2009-01-22 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2010083764A1 true WO2010083764A1 (zh) | 2010-07-29 |
Family
ID=42355552
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2010/070303 WO2010083764A1 (zh) | 2009-01-22 | 2010-01-21 | 一种协调自动保护倒换操作与恢复操作的装置及方法 |
Country Status (4)
Country | Link |
---|---|
US (1) | US8775869B2 (zh) |
EP (1) | EP2391060B1 (zh) |
CN (1) | CN101790110B (zh) |
WO (1) | WO2010083764A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012131695A1 (en) * | 2011-03-31 | 2012-10-04 | Tejas Networks Limited | A method and system of protection switching in a network element |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102882590A (zh) * | 2011-07-15 | 2013-01-16 | 中兴通讯股份有限公司 | 一种双向工作路径故障消失后的处理方法及装置、系统 |
CN103001810B (zh) * | 2012-12-26 | 2015-12-23 | 盛科网络(苏州)有限公司 | 网络业务通道保护切换方法及系统 |
CN105515971B (zh) * | 2014-10-14 | 2019-11-05 | 中兴通讯股份有限公司 | 一种资源复用的分层Qos调度的实现方法及装置 |
JP2017038303A (ja) * | 2015-08-12 | 2017-02-16 | 富士通株式会社 | 受信装置及び警報情報の転送方法 |
CN107171820B (zh) * | 2016-03-08 | 2019-12-31 | 北京京东尚科信息技术有限公司 | 信息传输、发送、获取方法和装置 |
CN113518271B (zh) | 2020-04-10 | 2024-02-13 | 上海诺基亚贝尔股份有限公司 | 无源光网络中用于信道管理的方法、装置和系统 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1710869A (zh) * | 2005-07-14 | 2005-12-21 | 广东省电信有限公司研究院 | 自动交换光网络中连接的增强型主备保护的实现方法 |
CN1764132A (zh) * | 2004-10-22 | 2006-04-26 | 华为技术有限公司 | 一种备用通道好坏检测的方法 |
CN1815994A (zh) * | 2005-02-02 | 2006-08-09 | 华为技术有限公司 | 智能光网络双向复用段环网络保护倒换失败的检测方法 |
CN1874201A (zh) * | 2006-06-20 | 2006-12-06 | 中兴通讯股份有限公司 | 在接收设备共享配置下的光网络保护触发方法及装置 |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2081040A1 (en) * | 1993-04-13 | 1994-10-14 | Nizar Ladha | Alarm panel with cellular telephone backup |
JP3450611B2 (ja) * | 1996-09-18 | 2003-09-29 | 富士通株式会社 | 障害情報管理装置 |
JP3581765B2 (ja) * | 1996-09-20 | 2004-10-27 | 株式会社日立コミュニケーションテクノロジー | 複合リング形ネットワークシステムにおけるパス切替方法及び装置 |
IT1308858B1 (it) * | 1999-10-29 | 2002-01-11 | Cit Alcatel | Metodo di segnalazione di guasti in reti di telecomunicazioni. |
US20030030862A1 (en) * | 2001-06-01 | 2003-02-13 | Joseph Trier | Device and method for monitoring signal characteristics of optical signals in an optical communications network |
US20040132409A1 (en) * | 2002-08-28 | 2004-07-08 | Siemens Aktiengesellschaft | Test method for message paths in communications networks and redundant network arrangements |
CN100395994C (zh) * | 2005-06-23 | 2008-06-18 | 华为技术有限公司 | 自动交换光网络中通道故障的处理方法 |
US20070046261A1 (en) * | 2005-08-17 | 2007-03-01 | Wojciech Porebski | Method and apparatus for temperature, conductance and/or impedance testing in remote application of battery monitoring systems |
US7647533B2 (en) * | 2006-04-25 | 2010-01-12 | Alcatel Lucent | Automatic protection switching and error signal processing coordination apparatus and methods |
CN101145951B (zh) * | 2007-06-21 | 2010-06-16 | 中兴通讯股份有限公司 | 通信网络1+1保护级联组网的实现方法 |
US7830784B2 (en) * | 2007-06-29 | 2010-11-09 | Verizon Patent And Licensing Inc. | Intelligent network restoration |
CN101150462A (zh) * | 2007-10-18 | 2008-03-26 | 中兴通讯股份有限公司 | 保护倒换方法 |
-
2009
- 2009-01-22 CN CN200910105244.5A patent/CN101790110B/zh active Active
-
2010
- 2010-01-21 WO PCT/CN2010/070303 patent/WO2010083764A1/zh active Application Filing
- 2010-01-21 US US13/144,475 patent/US8775869B2/en active Active
- 2010-01-21 EP EP10733237.1A patent/EP2391060B1/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1764132A (zh) * | 2004-10-22 | 2006-04-26 | 华为技术有限公司 | 一种备用通道好坏检测的方法 |
CN1815994A (zh) * | 2005-02-02 | 2006-08-09 | 华为技术有限公司 | 智能光网络双向复用段环网络保护倒换失败的检测方法 |
CN1710869A (zh) * | 2005-07-14 | 2005-12-21 | 广东省电信有限公司研究院 | 自动交换光网络中连接的增强型主备保护的实现方法 |
CN1874201A (zh) * | 2006-06-20 | 2006-12-06 | 中兴通讯股份有限公司 | 在接收设备共享配置下的光网络保护触发方法及装置 |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012131695A1 (en) * | 2011-03-31 | 2012-10-04 | Tejas Networks Limited | A method and system of protection switching in a network element |
Also Published As
Publication number | Publication date |
---|---|
CN101790110B (zh) | 2012-12-19 |
EP2391060B1 (en) | 2016-08-31 |
CN101790110A (zh) | 2010-07-28 |
US20110276825A1 (en) | 2011-11-10 |
US8775869B2 (en) | 2014-07-08 |
EP2391060A4 (en) | 2015-01-21 |
EP2391060A1 (en) | 2011-11-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2010083764A1 (zh) | 一种协调自动保护倒换操作与恢复操作的装置及方法 | |
CN100459601C (zh) | 网络中主备网关设备的实现方法 | |
WO2007056929A1 (fr) | Procede et appareil pour mettre en oeuvre la protection de groupes au sein d'un reseau mpls | |
JP5513342B2 (ja) | パケット中継装置 | |
WO2009082923A1 (fr) | Procédé de traitement de défaut de liaison et dispositif de transfert de données | |
WO2008055436A1 (fr) | Procédé de commande d'état de redémarrage progressif et de routeur | |
WO2008046358A1 (fr) | Procédé et dispositif destinés à réaliser une pénétration d'un statut de liaison de réseau point à multipoint | |
WO2006136072A1 (fr) | Procédé pour traiter une panne de canal dans un réseau optique automatiquement commuté | |
WO2017124791A1 (zh) | 链路检测方法及装置 | |
WO2011143876A1 (zh) | 业务节点主备切换方法和装置 | |
WO2011000240A1 (zh) | 呼叫中心坐席故障处理方法及系统 | |
WO2008119294A1 (fr) | Procédé et matériel de restauration du commerce en réseau | |
WO2013185567A1 (zh) | 一种分组传送网络保护倒换装置和方法 | |
US20190245735A1 (en) | Server apparatus, cluster system, cluster control method and program | |
WO2008043285A1 (fr) | Procédé et dispositif de protection pour réseau mixte | |
WO2007118395A1 (fr) | Procédé de prise en charge de service fondé sur la tolérance aux sinistres d'un dispositif, appareil de commutation de service et machine de réserve | |
WO2015196676A1 (zh) | 组网保护方法、装置及组网中的汇聚主用网元 | |
JP2009303092A (ja) | ネットワーク装置および回線切替方法 | |
WO2009082894A1 (fr) | Procédé, système et équipement pour la mise en œuvre d'une commutation automatique de protection entre cartes principales et de réserve | |
WO2009055995A1 (en) | Maintaining method for automatic switched optical network system when operation engenders alarm | |
WO2010003323A1 (zh) | 一种链路故障恢复的方法、系统和装置 | |
WO2010121459A1 (zh) | 一种自动交换光网络中实现保护与恢复的方法及系统 | |
WO2006089490A1 (fr) | Méthode d’implémentation de fec bfd | |
WO2011017900A1 (zh) | 一种以太网隧道分段保护方法及系统 | |
CN111865637B (zh) | 一种故障恢复方法及系统 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 10733237 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 13144475 Country of ref document: US |
|
REEP | Request for entry into the european phase |
Ref document number: 2010733237 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2010733237 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |