WO2006005251A1 - Method and system for carrying out the function of switching in a communication system - Google Patents

Method and system for carrying out the function of switching in a communication system Download PDF

Info

Publication number
WO2006005251A1
WO2006005251A1 PCT/CN2005/000995 CN2005000995W WO2006005251A1 WO 2006005251 A1 WO2006005251 A1 WO 2006005251A1 CN 2005000995 W CN2005000995 W CN 2005000995W WO 2006005251 A1 WO2006005251 A1 WO 2006005251A1
Authority
WO
WIPO (PCT)
Prior art keywords
switching
communication system
standby
primary
data
Prior art date
Application number
PCT/CN2005/000995
Other languages
French (fr)
Chinese (zh)
Inventor
Zhaoxia Lv
Wei Tan
Original Assignee
Huawei Technologies Co. Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co. Ltd. filed Critical Huawei Technologies Co. Ltd.
Publication of WO2006005251A1 publication Critical patent/WO2006005251A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • H04L12/40Bus networks
    • H04L12/40169Flexible bus arrangements
    • H04L12/40176Flexible bus arrangements involving redundancy
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/55Prevention, detection or correction of errors
    • H04L49/552Prevention, detection or correction of errors by ensuring the integrity of packets received through redundant connections
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • H04L12/40Bus networks
    • H04L12/403Bus networks with centralised control, e.g. polling

Definitions

  • the present invention relates to communication technologies, and more particularly to switching techniques in communication devices.
  • Telecommunications networks require reliable, uninterrupted service for their subscribers, especially in applications such as e-money, order processing, customer service, inventory management, e-mail and Internet access. Sex becomes more important than ever, and its usability requirements can reach 99.999% or even higher. Therefore, network survivability has become an important factor affecting network design and construction, and the equipment used in telecommunications networks also needs high reliability.
  • Redundant backup refers to the replacement of multiple devices with the same function. Especially when the main device fails or requires maintenance, the backup device takes over the main device to continue communication, ensuring uninterrupted operation of the communication system.
  • Load sharing uses the same principle, using multiple devices to work alternately or collaboratively, increasing system reliability.
  • Redundant backup and active/standby switching technologies have multiple backup modes, such as 1+1, 1: 1, 1: N, and so on.
  • 1 + 1 backup is a simple protection method.
  • the primary and backup units in the system form a logical business function unit, that is, the protected equipment unit has a standby unit.
  • the main module is responsible for real-time processing of services, and the standby service is maintained.
  • the data of the active unit is the same.
  • the switchover is initiated to take over the service on the original active unit.
  • the spare unit in the 1 : 1 backup starts working only after a failure.
  • the 1 : N backup means that multiple device units share a spare unit. When any one of the primary units fails, the standby unit takes over.
  • a complex telecommunication device contains a plurality of processing units that work together, and communication needs to be performed between the units; and in the entire communication network, communication between different node devices is also required.
  • the switchover In order to ensure the self-healing capability of the device under network fault conditions, ensure data integrity and maintain service quality, the switchover must be completed within a specified time limit to restore the service as soon as possible, so as to minimize the impact on the communication system.
  • components that are mutually standby are logically the same functional entity, normal work.
  • the standby unit does not provide services externally, but needs to be consistent with the data of the primary unit as much as possible to ensure that when the primary unit fails, the standby unit is upgraded to the primary unit, and the service loss is minimal.
  • the manual switching is performed by the operator to send a switching command through the maintenance station.
  • the automatic fault switching refers to the failure detection of the main board by the system fault detection module.
  • the active/standby switchover is triggered.
  • the switchover of the board is used as an example.
  • the automatic manual switchover is caused by the abnormality of the peripheral modules involved in the active board or because the main board is to be pulled out or because of some
  • the active and standby boards also need to be switched back to the original active/standby relationship. In this case, the system automatically triggers. Active/standby switchover.
  • the identity of the primary and backup units is determined by the underlying hardware.
  • the standby unit immediately goes up to the primary use, and then notifies the upper application module, and further informs the relevant other orders.
  • the board has been switched between the active and standby boards.
  • FIG. 1 shows the structure of a switching system employed in the prior art communication system.
  • the active and standby units are connected to other devices of the communication system through the circuit switching module.
  • the circuit switching module is implemented through a switched network.
  • the switching network is switched by the physical address exchange of the primary and backup units.
  • Each business processing unit maintains a physical connection with both the primary and backup units.
  • the underlying hardware performs circuit switching, switches the standby unit to the working plane, separates the active unit, and then notifies all upper application modules and service modules to communicate with the new active unit.
  • the switching technology used in the communication system is directly carried out by the underlying hardware, and the upper application module and the service module are in a passive state, which is not conducive to high-level equipment maintenance; and since the switching is direct It is implemented by circuit switching, so it has no transparency to the upper layer application, and the switching has a great influence on the system; again, the switching process is mechanically completed by the circuit, which is not flexible and cannot support automatic manual switching; in general, device switching It also requires data transfer, reconfiguration of spare devices, etc., which requires a long delay. As a result, the demand for real-time services and high service quality is far from being met in the telecommunication network.
  • the above solutions have the following problems: Since the device switching is initiated and executed by the underlying hardware, the upper-layer application is in the passive processing state and is not supported by the high-level device maintenance; due to the data transfer between the primary and backup devices and the standby device. Reconfiguration takes more time, so the switching operation has a large delay and cannot meet real-time requirements. Since the switching is realized by physical address conversion or circuit switching, it has no transparency and flexibility, so it has a greater impact on the entire communication system. .
  • the present invention provides a method for switching devices in a communication system, including:
  • step A The process of the reverse preprocessing described in step A includes:
  • step B Determine whether the system resources meet the switching conditions. If yes, go to step B. Otherwise, terminate the switching process.
  • the process of the reverse preprocessing described in step A may further include:
  • step A2. Determine whether the service resource meets the switching condition. If yes, go to step B. Otherwise, terminate the switching process.
  • the method further comprises the steps of:
  • the service data of the active device is backed up to the standby device to perform smooth connection operation of the service data.
  • step C After the step C, a step is further included,
  • the switching of the underlying communication channel is implemented by a mapping mechanism of a logical address and a physical address.
  • the data synchronization is implemented by using a warm backup between the active device and the standby device.
  • the method further includes: determining whether a switching abnormality occurs, and if so, performing a switching recovery process.
  • the switching recovery process includes:
  • the switching recovery process may also include:
  • the configuration of the primary device is restored, and the primary device is notified to operate as a primary device.
  • the switching recovery process may further include:
  • the present invention also provides a switching system for a device in a communication system, comprising: a control module, configured to control execution of a switching process;
  • a main execution module configured to perform, according to an instruction of the control module, a switching process of an underlying communication channel and configuration data related to the active device;
  • an alternate execution module configured to perform, according to an instruction of the control module, a switching process of an underlying communication channel and configuration data related to the standby device.
  • the primary execution module and the standby execution module are further configured to back up service data of the primary device to the standby device and smooth processing of service data according to an instruction of the control module.
  • the technical solution of the present invention is different from the prior art in that the division control module and the main standby execution module separate the switching control function and the execution function to facilitate the switching control; the switching process is divided into multiple execution steps and states.
  • the switchover process control and state transition are implemented.
  • the logical address and physical address mapping mechanism are adopted to implement transparent and fast switching of the underlying communication channel.
  • the data backup and service smoothing mechanism before and after switching, and the warm backup mode of the active and standby devices are used to improve the switching. Speed and reliability; for the abnormal situation in the switching process, switching recovery is achieved.
  • the difference in this technical solution brings about a more obvious beneficial effect, namely the control function and
  • the separation of the execution functions improves the efficiency and flexibility of the switching;
  • the implementation of the switching process state migration improves the flexibility of the switching and supports the switching recovery;
  • the address mapping mechanism implements the transparency of the switching and reduces the switching to the system.
  • the impact of the switching speed is improved; data backup and service smoothing and warm backup mode improve the switching speed and reliability; and the switching recovery mechanism enhances the robustness of the switching.
  • FIG. 1 is a schematic structural view of a switching system used in a conventional communication system
  • FIG. 2 is a schematic structural view of a switching system according to an embodiment of the present invention
  • FIG. 3 is a schematic diagram of an address mapping mechanism in a switching system according to an embodiment of the present invention.
  • FIG. 4 is a flowchart of a switching method according to an embodiment of the present invention.
  • the invention adopts a step-by-step process under control to complete the switching process, and logically divides the entire switching operation into a plurality of sub-steps, thereby improving the flexibility of the switching operation.
  • the execution function and the control function are separated, and are respectively implemented by the execution module and the control module.
  • the transparency of the switching operation is achieved by mapping the logical address and the physical address, and the impact of the switching on the entire communication system is minimized.
  • the temperature backup mechanism of the data is used to maintain synchronization between the primary and backup units, thereby improving the speed of performing the active/standby switchover.
  • the service smoothing method is adopted to ensure a smooth transition to the service during the switching.
  • the primary backup device involved is a single board device.
  • the control module 201 is configured to control the execution of the switching step, and is connected to the active execution module 202 and the standby execution module 203.
  • the primary execution module 202 is configured to perform a switching step on the active board under the control of the control module 201.
  • the standby execution module 203 is configured to perform a switching step on the standby board under the controllability of the control module.
  • the active execution module 202 is distributed on the active board, and the standby execution module 203 is distributed on the standby board.
  • the control module 201 can be distributed on the main standby board or separately distributed on another board. Separating the control function and the execution function into separate modules, so that the switching operation can be implemented step by step, which enhances the reliability and flexibility of the switching, so that the switching can be step by step. Reply back.
  • Figure 3 illustrates an address mapping mechanism in a switching system in accordance with one embodiment of the present invention.
  • First define logical addresses and physical addresses for each board.
  • the application layer modules use logical addresses to communicate with each other.
  • An address mapping relationship exists between the logical address and the physical address.
  • the address mapping relationship is maintained by the underlying module.
  • the underlying module switches the mapping relationship between the logical address and the physical address, that is, transparent switching is implemented.
  • the communication system includes the device board A, the device board B, and its standby board B.
  • the board B is the main board
  • the board B' is the standby board.
  • the physical address of the board B is 301
  • the physical address of the board B' is 302.
  • the physical address 301 of the board B corresponds to the logical address 303
  • the physical address 302 of the board ⁇ corresponds to the logical address 304.
  • the other boards in the communication system such as the board, access the active board through logical address 303 to communicate with the active board.
  • the address mapping mechanism switches the address mapping relationship, mapping the physical address 301 of the active board to the logical address 304, and mapping the physical address 302 of the standby board B' to the logical address 303.
  • the board corresponding to the physical address 302 is accessed through the mapping relationship, so that the board actually communicates with the board B'.
  • the board B' becomes the active board, and the board B becomes the standby board.
  • the upper application module, or other boards, are not affected before and after the entire switching operation.
  • the above address mapping mechanism is adopted to implement fast switching of the underlying communication channel, which realizes fast processing of switching and reduces service interruption time. Since the operation of the address mapping relationship does not involve hardware switching, the reliability of the switching operation is improved. At the same time, for other service modules, the impact of switching is shielded and transparent.
  • FIG. 4 illustrates a flow of a switching method in accordance with one embodiment of the present invention. According to the foregoing, the switching process is divided into multiple steps, and the control module completes the control function, and the execution module completes the specific operation.
  • the trigger message is first transmitted to the control module, and the control module initiates the switching operation.
  • the control module performs a pre-processing of the switching to determine whether the system resource satisfies the switching condition. Because the switching operation requires certain system resource conditions to complete, before the switching is performed, in order to ensure the reliability of the switching, the control module first performs system resource judgment. For example, if the configuration of the standby board is complete, if the standby board is not configured, the active board cannot be used for communication. The switching operation cannot be performed. The control module will terminate the switching process. In the case of manual switching, the control module also determines whether the system resources are sufficient. For example, when the standby board runs abnormally, a warning signal is issued to notify the operator that the switching cannot be performed.
  • step 402 the control module sends a command to the active execution module and the standby execution module, and the primary execution module and the standby execution module perform reverse switching preprocessing to determine whether the service resource satisfies the switching condition. Because the switching operation has a certain impact on the service, the execution module must judge the currently running service before performing the switching. For example, when the service is busy or the real-time requirement is high, it is judged that the switching condition is not satisfied. , terminate the switch or postpone the switch, or issue a warning that the switch is unsafe.
  • the service execution backup process is performed by the active execution module and the standby execution module.
  • the data between the standby board and the active board cannot be synchronized. Even if the data is used in the warm backup mode, the data cannot be completely synchronized. Therefore, you need to back up the data of the active board to the standby board before the switchover is performed. Make the primary and backup data completely consistent, and ensure that the business communication can be smoothly connected after the switching occurs.
  • step 404 under the control of the control module, the switching process of the underlying communication channel is completed by the primary execution module and the standby execution module, that is, the address mapping relationship between the logical address and the physical address is modified. Map the logical address of the original active board to the physical address of the original standby board. This step completes the transparent switchover of the active and standby boards, and blocks the impact on the high-level application modules and other unrelated boards, and can be completed in a faster time.
  • step 405 under the control of the control module, the system-related configuration data switching process is completed by the main execution module and the standby execution module.
  • a device board needs to be configured to work properly. Therefore, after the switchover is completed, system-related configuration data processing is required. For example, you need to configure the status of the active and standby boards on the bottom layer so that the active board does not send data.
  • the standby board communicates with other boards in the system.
  • step 406 the main processing module and the standby execution module complete the smooth processing of the service data.
  • the execution module monitors the communication service of the primary and backup boards, and adopts corresponding policies in special cases to ensure that services are not as good as possible. Affected by the switching operation. For example, Check the communication data of the board. If the communication is abnormal, send a warning signal. If the active and standby boards are in the same state, the standby board is used. The data communicates and causes the active board to enter the standby state.
  • step 402 is omitted in the flow of the above-described switching method, if it is not necessary.
  • the active board is faulty, and the standby unit must take over the communication.
  • the impact on the service is unavoidable.
  • the data is designed to be a warm backup method, and the actual impact on the business will be small.
  • step 403 is omitted.
  • the main board is faulty due to a hardware failure, it is impossible to perform the recovery operation. Therefore, it is not necessary to back up the configuration data.
  • a board that serves as an active/standby redundant backup adopts a warm backup mode to ensure data synchronization and service smooth switching.
  • the service data on the main board changes, the service data is backed up to the standby board.
  • the running data on the active and standby boards is always consistent.
  • the manual switch is used to back up the service data to the standby board.
  • the warm backup is performed. Therefore, when the switchover occurs, the backup is required.
  • the amount of data is very small, and the time taken for data backup is relatively small, thereby increasing the speed of switching and reducing the impact on system and service communication.
  • the method of performing the switching of the switching process is also used to implement the switching recovery.
  • the switchover recovery means that the system can perform the switchover recovery according to the situation during the process of the switchover or the switchover.
  • the master/slave board is restored to the state before the switchover. This provides support for automatic manual switching in certain special situations. For example, the board switching process initiated by the operation and maintenance personnel requires the offline operation of the main board for a certain period of time.
  • the switching process is first divided into a plurality of stages, and the state control method is used to control the switching.
  • the switching recovery processing can be performed, and the state before the switching is restored as much as possible. Details below In the switching step, when an abnormal situation occurs, the switching recovery process is as follows:
  • step 401 or step 402 in the pre-processing stage of the switching, since the substantial switching operation has not been performed, if the switching fails, for example, the switching condition determination is not satisfied, the switching process is directly ended, and no switching is required. restore.
  • step 403 in the data backup phase, the data backup does not affect the normal operation of the original active board, but the data is backed up on the standby board. Therefore, when the switchover fails, the switchover recovery operation only needs to restore the standby board. For example, simply reset and restart the standby board.
  • step 404 if an abnormality occurs in the underlying communication channel switching phase, the switchover recovery operation is performed, and the underlying communication channel is switched to the state before the switchover, and the configuration of the original active board is restored, and the device is notified to operate as the active board.
  • step 405 in the system configuration data switching phase, if an abnormality occurs, the switching recovery operation is performed, and the related configuration data is restored to the original active/standby correspondence relationship, and then the underlying communication channel is switched to the state before the switching, and finally the original primary recovery is resumed.
  • the service module of the board is notified that it is running as the active board.
  • the system smooth upgrade and hardware maintenance of the uninterrupted service are implemented by the above-mentioned switching recovery mechanism. For example, if you use the switchover recovery, you can remove the active board from the system for a period of time, perform operation or maintenance, and then resume the operation of the active board by the switchover recovery mechanism. In this process, the standby board is used. The board replaces the main board, so the service is not interrupted.

Abstract

The present invention discloses a method and system for carrying out the function of switching in a communication system. The invention separates a switch-controlling function and a performing function by dividing a controlling module and a master-backup performing module. The switch process comprises several steps and phases to achieve process controlling and phase shifting. The mapping between a logic address and a physical address is used to achieve a rapid switching for a communication channel in a lower layer. The data backup and smoothing process and the hot backup are used to improve the speed and the reliability of the switching. Theswitching recovery is achieved when a failure is occurred.

Description

通信系统中设备的倒换方法及系统  Method and system for switching equipment in communication system
技术领域 本发明涉及通信技术, 特别涉及通信设备中的倒换技术。 TECHNICAL FIELD The present invention relates to communication technologies, and more particularly to switching techniques in communication devices.
背景技术 Background technique
电信网络要求能为其用户提供可靠的不间断的服务, 尤其是在一些重 要业务的应用中, 如电子货币、 订单处理、 客户服务、 库存管理、' 电子邮 件和国际互联网接入等, 业务生存性变得比以往更加重要, 其可用性要求 能达到 99. 999%甚至更高。 因此, 网络生存能力成为影响网络设计与构建 的重要因素, 而在电信网络中使用的设备也相应的需要有很高的可靠性。  Telecommunications networks require reliable, uninterrupted service for their subscribers, especially in applications such as e-money, order processing, customer service, inventory management, e-mail and Internet access. Sex becomes more important than ever, and its usability requirements can reach 99.999% or even higher. Therefore, network survivability has become an important factor affecting network design and construction, and the equipment used in telecommunications networks also needs high reliability.
在通信系统中, 冗余备份和负荷分担是常用的增强设备可靠性的方法。 冗余备份是指采用多个相同功能的设备进行替换工作, 特别是在当主用设 备出现故障或需要维护的情况下, 备用设备接替主用设备继续进行通信, 保证通信系统的不中断运行。 负荷分担采用相同的原理, 利用多个设备的 交替工作或协同工作, 增加系统的可靠性。  In communication systems, redundant backup and load sharing are common methods of enhancing device reliability. Redundant backup refers to the replacement of multiple devices with the same function. Especially when the main device fails or requires maintenance, the backup device takes over the main device to continue communication, ensuring uninterrupted operation of the communication system. Load sharing uses the same principle, using multiple devices to work alternately or collaboratively, increasing system reliability.
对于采用冗余备份的组件, 必须提供主备倒换的机制以实现可靠性和 可维护性的需求。 冗余备份和主备倒换技术有多种备份模式, 比如 1+1, 1: 1, 1: N等。 1 + 1备份是一种简单的保护方式, 系统中的主备用单元组成 一个逻辑上的业务功能单元, 即被保护设备单元拥有一个备用单元, 主用 模块负责业务的实时处理, 备用业务保持与主用单元的数据一致, 在检测 到原主用单元故障时发起倒换, 接管原主用单元上的业务。 与 1 + 1备份不 同的 1 : 1备份中备用单元只在出现故障后开始工作。 而 1 : N备份就是多 个设备单元共享一个备用单元, 当任意一个主用单元出现故障时, 备用单 元即接替其工作。  For components with redundant backup, an active/standby switchover mechanism must be provided for reliability and maintainability. Redundant backup and active/standby switching technologies have multiple backup modes, such as 1+1, 1: 1, 1: N, and so on. 1 + 1 backup is a simple protection method. The primary and backup units in the system form a logical business function unit, that is, the protected equipment unit has a standby unit. The main module is responsible for real-time processing of services, and the standby service is maintained. The data of the active unit is the same. When the fault of the original primary unit is detected, the switchover is initiated to take over the service on the original active unit. Unlike the 1 + 1 backup, the spare unit in the 1 : 1 backup starts working only after a failure. The 1 : N backup means that multiple device units share a spare unit. When any one of the primary units fails, the standby unit takes over.
通常, 一个复杂的电信设备包含多个协同工作的处理单元, 各单元之 间需要进行通信; 而在整个通信网络中, 不同节点设备之间也需要通信。 为了保证在网络故障条件下设备的自愈能力, 确保数据的完整性和维持服 务质量, 倒换必须在指定的时限内完成, 尽快恢复业务, 使对通信系统影 响最小。 一般的说, 互为主备用的组件逻辑上是同一个功能实体, 正常工 作时只有主用单元对外提供服务, 备用单元对外不提供服务、 但需要尽量 与主用单元的数据保持一致, 以保证主用单元故障时, 备用单元升为主用 单元后, 业务损失最少。 而这些功能都与所采用的倒换技术的性能密切相 关。 因此, 保护倒换机制的性能对于系统的可靠性乃至整个网络的生存能 力都有着重大的影响。 Generally, a complex telecommunication device contains a plurality of processing units that work together, and communication needs to be performed between the units; and in the entire communication network, communication between different node devices is also required. In order to ensure the self-healing capability of the device under network fault conditions, ensure data integrity and maintain service quality, the switchover must be completed within a specified time limit to restore the service as soon as possible, so as to minimize the impact on the communication system. Generally speaking, components that are mutually standby are logically the same functional entity, normal work. At the time of operation, only the primary unit provides external services, and the standby unit does not provide services externally, but needs to be consistent with the data of the primary unit as much as possible to ensure that when the primary unit fails, the standby unit is upgraded to the primary unit, and the service loss is minimal. These features are closely related to the performance of the switching technology used. Therefore, the performance of the protection switching mechanism has a significant impact on the reliability of the system and the viability of the entire network.
根据倒换触发的原因不同, 可以分为人工倒换、 自动故障倒换和自动 人工倒换等。 其中, 人工倒换, 是指通过维护台由操作人员发送倒换命令 等方式人为的触发主备倒换; 自动故障倒换, 是指由系统故障检测模块, 检测到主用单板发生故障, 并由系统自动触发的主备倒换; 以单板设备的 倒换为例: 自动人工倒换, 是指由于主用单板涉及到的周边模块发生异常、 或者由于主用单板要被拔出、 或者由于某些单板的特殊要求, 而需要将主 用单板停止使用一段时间进行维护, 并且在发生过倒换后的主备板还需要 倒换回原始的主备关系, 在这种情况下发生的系统自动触发的主备倒换。  According to the reasons of the triggering, it can be divided into manual switching, automatic fault switching, and automatic manual switching. The manual switching is performed by the operator to send a switching command through the maintenance station. The automatic fault switching refers to the failure detection of the main board by the system fault detection module. The active/standby switchover is triggered. The switchover of the board is used as an example. The automatic manual switchover is caused by the abnormality of the peripheral modules involved in the active board or because the main board is to be pulled out or because of some For the special requirements of the board, you need to stop the main board for a period of time for maintenance. After the switchover occurs, the active and standby boards also need to be switched back to the original active/standby relationship. In this case, the system automatically triggers. Active/standby switchover.
目前的通信系统中, 主、 备用单元的身份是由底层硬件确定的, 当硬 件发现主用单元异常时, 备用单元立即升为主用, 然后再通知上层应用模 块, 并进一步通知到相关其它单板发生了主备板切换。  In the current communication system, the identity of the primary and backup units is determined by the underlying hardware. When the hardware finds that the primary unit is abnormal, the standby unit immediately goes up to the primary use, and then notifies the upper application module, and further informs the relevant other orders. The board has been switched between the active and standby boards.
图 1 示出了现有通信系统中所采用的一种倒换系统的结构。 主用、 备 用单元通过电路切换模块与通信系统其他设备连接。 电路交换模块是通过 交换网络实现的, 在一般情况下, 交换网络是通过主备用单元的物理地址 的交换完成切换的。 各业务处理单元与主、 备用单元都保持物理连接。 当 倒换条件触发设备单元发生倒换时, 底层硬件进行电路切换, 将备用单元 切换到工作平面上, 主用单元分离, 然后通知所有上层应用模块、 业务模 块与新的主用单元发生通信。  Figure 1 shows the structure of a switching system employed in the prior art communication system. The active and standby units are connected to other devices of the communication system through the circuit switching module. The circuit switching module is implemented through a switched network. In general, the switching network is switched by the physical address exchange of the primary and backup units. Each business processing unit maintains a physical connection with both the primary and backup units. When the switching condition triggers the switching of the equipment unit, the underlying hardware performs circuit switching, switches the standby unit to the working plane, separates the active unit, and then notifies all upper application modules and service modules to communicate with the new active unit.
可见, 目前通信系统中所采用的倒换技术, 由于主、 备仲裁、 倒换通 知等都直接由底层硬件进行, 上层应用模块和业务模块处于被动状态, 不 利于高层的设备维护; 而且由于倒换是直接由电路交换实现, 因此对上层 应用不具有透明性, 倒换对系统造成影响较大; 再次, 倒换过程由电路机 械地完成, 不具有灵活性, 不能支持自动人工倒换; 在一般情况下, 设备 倒换时还需要进行数据转移、 备用设备的重新配置等, 需要较长的时延, 导致在电信网络中远远不能满足实时业务、 高服务质量的需求。 It can be seen that the switching technology used in the communication system is directly carried out by the underlying hardware, and the upper application module and the service module are in a passive state, which is not conducive to high-level equipment maintenance; and since the switching is direct It is implemented by circuit switching, so it has no transparency to the upper layer application, and the switching has a great influence on the system; again, the switching process is mechanically completed by the circuit, which is not flexible and cannot support automatic manual switching; in general, device switching It also requires data transfer, reconfiguration of spare devices, etc., which requires a long delay. As a result, the demand for real-time services and high service quality is far from being met in the telecommunication network.
在实际应用中, 上述方案存在以下问题: 由于设备倒换是由底层硬件 发起和执行的, 上层应用处于被动处理状态, 对于高层设备维护不支持; 由于主、 备用设备之间数据转移和备用设备的重新配置耗用较多时间, 因 此倒换操作延时大, 不能满足实时需求; 由于倒换是通过物理地址变换或 电路切换实现的, 不具有透明性和灵活性, 因此对于整个通信系统的影响 较大。  In practical applications, the above solutions have the following problems: Since the device switching is initiated and executed by the underlying hardware, the upper-layer application is in the passive processing state and is not supported by the high-level device maintenance; due to the data transfer between the primary and backup devices and the standby device. Reconfiguration takes more time, so the switching operation has a large delay and cannot meet real-time requirements. Since the switching is realized by physical address conversion or circuit switching, it has no transparency and flexibility, so it has a greater impact on the entire communication system. .
发明内容 Summary of the invention
有鉴于此, 本发明的主要目的在于提供一种通信系统中设备的倒换方 法及系统, 使得倒换的速度更快、 对正在运行的业务影响更小并对其他单 板具有透明性。  In view of this, it is a primary object of the present invention to provide a method and system for switching devices in a communication system that enables faster switching, less impact on running services, and transparency to other boards.
. 为实现上述目的, 本发明提供了一种通信系统中设备的倒换方法, 包 括:  To achieve the above object, the present invention provides a method for switching devices in a communication system, including:
A、 进行倒换预处理;  A. Perform reverse switching preprocessing;
B、 完成对主用设备和备用设备之间的底层通信通道的切换处理; B. Completing the switching process of the underlying communication channel between the primary device and the standby device;
C、 完成主用设备和备用设备配置数据的切换处理。 C. Complete the switching process of the configuration data of the primary device and the standby device.
步骤 A所述倒换预处理的过程包括:  The process of the reverse preprocessing described in step A includes:
Al、 判断系统资源是否满足倒换条件, 如果满足, 进入步骤 B, 否则, 终止倒换流程。  Al. Determine whether the system resources meet the switching conditions. If yes, go to step B. Otherwise, terminate the switching process.
步骤 A所述倒换预处理的过程还可以包括:  The process of the reverse preprocessing described in step A may further include:
A2、 判断业务资源是否满足倒换条件, 如果满足, 进入步骤 B, 否则 终止倒换流程。  A2. Determine whether the service resource meets the switching condition. If yes, go to step B. Otherwise, terminate the switching process.
在所述步骤 A和所述步骤 B之间, 还包含步骤:  Between the step A and the step B, the method further comprises the steps of:
将所述主用设备的业务数据备份到所述备用设备, 进行业务数据的平 滑接续操作。  The service data of the active device is backed up to the standby device to perform smooth connection operation of the service data.
在所述步骤 C之后还包含步骤,  After the step C, a step is further included,
进行业务数据的平滑处理操作, 保证业务的平滑接续。  Smooth processing of business data to ensure smooth connection of services.
在本发明中, 通过逻辑地址和物理地址的映射机制实现所述底层通信 通道的切换。 所述主用设备和所述备用设备之间采用温备份的方式实现数据同步。 上述方法还包括, 判断是否出现倒换异常, 如果出现, 执行倒换恢复 流程。 In the present invention, the switching of the underlying communication channel is implemented by a mapping mechanism of a logical address and a physical address. The data synchronization is implemented by using a warm backup between the active device and the standby device. The method further includes: determining whether a switching abnormality occurs, and if so, performing a switching recovery process.
所述倒换恢复流程包括:  The switching recovery process includes:
对所述备用设备进行数据恢复。  Data recovery is performed on the standby device.
所述倒换恢复流程也可以包括:  The switching recovery process may also include:
将所述底层通信通道切换恢复到倒换前的状态;  Reverting the underlying communication channel to a state before the switching;
恢复所述主用设备的配置, 并通知所述主用设备作为主用运行。 所述倒换恢复流程还可以包括:  The configuration of the primary device is restored, and the primary device is notified to operate as a primary device. The switching recovery process may further include:
恢复所述配置数据为原主、 备设备的对应关系;  Restoring the configuration data as the correspondence between the original primary and backup devices;
切换恢复所述底层通信通道到倒换前的状态;  Switching to restore the state of the underlying communication channel to before the switching;
恢复原所述主用设备的业务模块, 并通知其作为主用运行。  Restore the service module of the original primary device and notify it to operate as the primary.
本发明同时提供了一种通信系统中设备的倒换系统, 包括: 控制模块, 用于控制倒换流程的执行;  The present invention also provides a switching system for a device in a communication system, comprising: a control module, configured to control execution of a switching process;
主用执行模块, 用于根据所述控制模块的指令, 执行与主用设备相关 的底层通信通道和配置数据的切换处理;  a main execution module, configured to perform, according to an instruction of the control module, a switching process of an underlying communication channel and configuration data related to the active device;
备用执行模块, 用于根据所述控制模块的指令, 执行与备用设备相关 的底层通信通道和配置数据的切换处理。  And an alternate execution module, configured to perform, according to an instruction of the control module, a switching process of an underlying communication channel and configuration data related to the standby device.
其中, 所述主用执行模块和备用执行模块还用于根据所述控制模块的 指令, 将所述主用设备的业务数据备份到所述备用设备, 以及业务数据的 平滑处理。  The primary execution module and the standby execution module are further configured to back up service data of the primary device to the standby device and smooth processing of service data according to an instruction of the control module.
通过比较可以发现, 本发明的技术方案与现有技术的区别在于, 划分 控制模块和主备用执行模块, 将倒换控制功能和执行功能分离, 方便倒换 控制; 倒换流程划分为多个执行步骤和状态, 实现倒换流程控制和状态迁 移; 采用逻辑地址和物理地址映射机制, 实现底层通信通道的透明快速切 换; 采用倒换前后的数据备份和业务平滑处理机制, 以及主备设备的温备 份方式, 提高倒换的速度和可靠性; 针对倒换流程中的异常情况, 实现了 倒换恢复。  By comparison, it can be found that the technical solution of the present invention is different from the prior art in that the division control module and the main standby execution module separate the switching control function and the execution function to facilitate the switching control; the switching process is divided into multiple execution steps and states. The switchover process control and state transition are implemented. The logical address and physical address mapping mechanism are adopted to implement transparent and fast switching of the underlying communication channel. The data backup and service smoothing mechanism before and after switching, and the warm backup mode of the active and standby devices are used to improve the switching. Speed and reliability; for the abnormal situation in the switching process, switching recovery is achieved.
这种技术方案上的区别, 带来了较为明显的有益效果, 即控制功能和 执行功能的分离提高了倒换的效率和灵活性; 倒换流程状态迁移实现方式 提高了倒换的灵活性和实现了对倒换恢复的支持; 地址映射机制实现了倒 换的透明性, 并降低了倒换对系统的影响, 提高了倒换速度; 数据备份和 业务平滑以及温备份方式, 提高了倒换速度和可靠性; 倒换恢复机制增强 了倒换的鲁棒性。 The difference in this technical solution brings about a more obvious beneficial effect, namely the control function and The separation of the execution functions improves the efficiency and flexibility of the switching; the implementation of the switching process state migration improves the flexibility of the switching and supports the switching recovery; the address mapping mechanism implements the transparency of the switching and reduces the switching to the system. The impact of the switching speed is improved; data backup and service smoothing and warm backup mode improve the switching speed and reliability; and the switching recovery mechanism enhances the robustness of the switching.
附图说明 图 1是现有通信系统中所采用的一种倒换系统的结构示意图; 图 2是才艮据本发明的一个实施例的倒换系统的结构示意图; BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a schematic structural view of a switching system used in a conventional communication system; FIG. 2 is a schematic structural view of a switching system according to an embodiment of the present invention;
图 3是根据本发明的一个实施例的倒换系统中地址映射机制示意图; 图 4是才艮据本发明的一个实施例的倒换方法流程图。  3 is a schematic diagram of an address mapping mechanism in a switching system according to an embodiment of the present invention; and FIG. 4 is a flowchart of a switching method according to an embodiment of the present invention.
具体实施方式 为使本发明的目的、 技术方案和优点更加清楚, 下面将结合附图对本 发明作进一步地详细描述。 DETAILED DESCRIPTION OF THE INVENTION In order to make the objects, technical solutions and advantages of the present invention more comprehensible, the present invention will be further described in detail with reference to the accompanying drawings.
本发明采用控制下的分步流程完成倒换过程, 将整个倒换操作在逻辑 上分为多个子步骤, 提高倒换操作的灵活性。 将执行功能和控制功能分离, 分别由执行模块和控制模块实现。 利用逻辑地址和物理地址的映射实现倒 换操作的透明性, 将倒换对整个通信系统的影响降至最小。 采用数据的温 备份机制, 保持主备用单元之间的同步, 提高执行主备倒换的速度。 采用 业务平滑处理方法, 保证倒换时对业务平滑过渡。  The invention adopts a step-by-step process under control to complete the switching process, and logically divides the entire switching operation into a plurality of sub-steps, thereby improving the flexibility of the switching operation. The execution function and the control function are separated, and are respectively implemented by the execution module and the control module. The transparency of the switching operation is achieved by mapping the logical address and the physical address, and the impact of the switching on the entire communication system is minimized. The temperature backup mechanism of the data is used to maintain synchronization between the primary and backup units, thereby improving the speed of performing the active/standby switchover. The service smoothing method is adopted to ensure a smooth transition to the service during the switching.
在本发明的各个实施例中 , 所涉及的主备用设备为单板设备。  In various embodiments of the invention, the primary backup device involved is a single board device.
图 2示出了根据本发明的一个实施例的倒换系统的结构, 包含控制模 块 201、 主用执行模块 202和备用执行模块 203。 其中, 控制模块 201用于 控制倒换步骤的执行, 与主用执行模块 202和备用执行模块 203都相连; 主用执行模块 202用于在控制模块 201的控制下在主用单板上执行倒换步 骤; 备用执行模块 203用于在控制模块的控制性下在备用单板上执行倒换 步骤。 其中主用执行模块 202分布在主用单板上, 备用执行模块 203分布 在备用单板上。 而控制模块 201可以分布在主备用单板上, 也可以单独分 布在另外的单板上。 将控制功能和执行功能分离为独立的模块, 使得倒换 操作能够按步骤实现, 增强了倒换的可靠性和灵活性, 使得倒换能够按步 骤进行回复。 2 shows the structure of a switching system including a control module 201, a main execution module 202, and an alternate execution module 203, in accordance with an embodiment of the present invention. The control module 201 is configured to control the execution of the switching step, and is connected to the active execution module 202 and the standby execution module 203. The primary execution module 202 is configured to perform a switching step on the active board under the control of the control module 201. The standby execution module 203 is configured to perform a switching step on the standby board under the controllability of the control module. The active execution module 202 is distributed on the active board, and the standby execution module 203 is distributed on the standby board. The control module 201 can be distributed on the main standby board or separately distributed on another board. Separating the control function and the execution function into separate modules, so that the switching operation can be implemented step by step, which enhances the reliability and flexibility of the switching, so that the switching can be step by step. Reply back.
图 3示出了根据本发明的一个实施例的倒换系统中的地址映射机制。 首先对每个单板定义逻辑地址和物理地址。 应用层模块之间使用逻辑地址 进行通信, 逻辑地址与物理地址之间存在地址映射关系, 该地址映射关系 由底层模块维护。 倒换时, 底层模块切换逻辑地址与物理地址之间的映射 关系, 即实现了透明倒换。  Figure 3 illustrates an address mapping mechanism in a switching system in accordance with one embodiment of the present invention. First define logical addresses and physical addresses for each board. The application layer modules use logical addresses to communicate with each other. An address mapping relationship exists between the logical address and the physical address. The address mapping relationship is maintained by the underlying module. When switching, the underlying module switches the mapping relationship between the logical address and the physical address, that is, transparent switching is implemented.
比如, 通信系统中包含设备单板 A、 设备单板 B及其备用单板 B 。 当 正常工作时, 单板 B为主用单板, 单板 B'为备用单板, 单板 B的物理地址 为 301, 单板 B'的物理地址为 302。 通过地址映射, 单板 B的物理地址 301 与逻辑地址 303对应, 单板 Β'的物理地址 302与逻辑地址 304对应。 通信 系统中的其他单板, 比如单板 Α, 通过逻辑地址 303访问主用单板, 与主 用单板 Β进行通信。 当倒换发生时, 地址映射机制切换地址映射关系, 将 主用单板 Β的物理地址 301映射到逻辑地址 304 , 将备用单板 B'的物理地 址 302映射到逻辑地址 303。 此后其他单板 Α访问逻辑地址 303时, 通过 映射关系即访问物理地址 302所对应的单板, 因此实际上是与单板 B'进行 通信。 这时单板 B'成为主用单板, 单板 B成为备用单板。 而在整个倒换操 作发生前后, 上层应用模块, 或其他单板都没有受到影响。  For example, the communication system includes the device board A, the device board B, and its standby board B. When the board is working normally, the board B is the main board, and the board B' is the standby board. The physical address of the board B is 301, and the physical address of the board B' is 302. The physical address 301 of the board B corresponds to the logical address 303, and the physical address 302 of the board 与 corresponds to the logical address 304. The other boards in the communication system, such as the board, access the active board through logical address 303 to communicate with the active board. When the switchover occurs, the address mapping mechanism switches the address mapping relationship, mapping the physical address 301 of the active board to the logical address 304, and mapping the physical address 302 of the standby board B' to the logical address 303. After the other board accesses the logical address 303, the board corresponding to the physical address 302 is accessed through the mapping relationship, so that the board actually communicates with the board B'. At this time, the board B' becomes the active board, and the board B becomes the standby board. The upper application module, or other boards, are not affected before and after the entire switching operation.
采用上述地址映射机制, 实现快速的底层通信通道的切换, 实现了倒 换的快速处理, 并减少业务中断时间。 由于地址映射关系的操作不涉及硬 件切换, 因此提高了倒换操作的可靠性。 同时, 对于其它业务模块来说, 屏蔽了倒换带来的影响, 具有透明性。  The above address mapping mechanism is adopted to implement fast switching of the underlying communication channel, which realizes fast processing of switching and reduces service interruption time. Since the operation of the address mapping relationship does not involve hardware switching, the reliability of the switching operation is improved. At the same time, for other service modules, the impact of switching is shielded and transparent.
图 4示出了根据本发明的一个实施例的倒换方法流程。 根据前述, 将 倒换流程分为多个步骤执行, 并由控制模块完成控制功能, 执行模块完成 具体操作。  Figure 4 illustrates a flow of a switching method in accordance with one embodiment of the present invention. According to the foregoing, the switching process is divided into multiple steps, and the control module completes the control function, and the execution module completes the specific operation.
在各种触发方式下, 当倒换触发后, 触发消息首先传到控制模块, 由 控制模块启动倒换操作。  In various trigger modes, after the trigger is triggered, the trigger message is first transmitted to the control module, and the control module initiates the switching operation.
首先进入步骤 401, 由控制模块进行倒换预处理, 判断系统资源是否 满足倒换条件。 由于倒换操作需要一定的系统资源条件才能完成, 因此在 倒换进行之前, 为了保证倒换的可靠性, 由控制模块先进行系统资源判断。 比如, 判断对备用单板的配置是否已经完成, 若备用单板还没有配置, 则 无法接替主用单板进行通信, 倒换操作不能进行, 此时控制模块将终止倒 换流程。 对于人工倒换的情况, 控制模块还将判断系统资源是否充足, 比 如当备用单板运行异常时, 发出警告信号, 通知操作员不能进行倒换。 First, proceeding to step 401, the control module performs a pre-processing of the switching to determine whether the system resource satisfies the switching condition. Because the switching operation requires certain system resource conditions to complete, before the switching is performed, in order to ensure the reliability of the switching, the control module first performs system resource judgment. For example, if the configuration of the standby board is complete, if the standby board is not configured, the active board cannot be used for communication. The switching operation cannot be performed. The control module will terminate the switching process. In the case of manual switching, the control module also determines whether the system resources are sufficient. For example, when the standby board runs abnormally, a warning signal is issued to notify the operator that the switching cannot be performed.
接着进入步骤 402, 控制模块发送命令给主用执行模块和备用执行模 块, 由主用执行模块和备用执行模块进行倒换预处理, 判断业务资源是否 满足倒换条件。 由于倒换操作对业务有一定的影响, 因此在进行倒换之前, 必须由执行模块对当前正在运行的业务进行判断, 比如, 当业务较繁忙或 对实时性的要求较高时, 判断倒换条件不满足, 终止倒换或推迟倒换, 或 者发出倒换不安全的警告。  Then, in step 402, the control module sends a command to the active execution module and the standby execution module, and the primary execution module and the standby execution module perform reverse switching preprocessing to determine whether the service resource satisfies the switching condition. Because the switching operation has a certain impact on the service, the execution module must judge the currently running service before performing the switching. For example, when the service is busy or the real-time requirement is high, it is judged that the switching condition is not satisfied. , terminate the switch or postpone the switch, or issue a warning that the switch is unsafe.
接着进入步骤 403, 当倒换条件满足之后, 由主用执行模块和备用执 行模块进行业务数据备份处理。 由于备用单板和主用单板之间数据不能同 步, 即使采用温备份的方式也不能保证数据的完全同步, 因此在倒换真正 执行之前, 需要先将主用单板的数据备份到备用单板, 使得主备用数据完 全一致, 保证在倒换发生后, 业务通信能平滑接续。  Then, proceeding to step 403, after the switching condition is satisfied, the service execution backup process is performed by the active execution module and the standby execution module. The data between the standby board and the active board cannot be synchronized. Even if the data is used in the warm backup mode, the data cannot be completely synchronized. Therefore, you need to back up the data of the active board to the standby board before the switchover is performed. Make the primary and backup data completely consistent, and ensure that the business communication can be smoothly connected after the switching occurs.
接着进入步骤 404, 在控制模块控制下, 由主用执行模块和备用执行 模块完成对底层通信通道的切换处理, 即逻辑地址和物理地址的地址映射 关系修改。 将原主用单板的逻辑地址映射到原备用单板的物理地址上。 该 步骤完成了主备用单板的透明切换, 屏蔽了对高层应用模块和其他不相关 单板的影响, 而且可以在较快的时间内完成。  Then, proceeding to step 404, under the control of the control module, the switching process of the underlying communication channel is completed by the primary execution module and the standby execution module, that is, the address mapping relationship between the logical address and the physical address is modified. Map the logical address of the original active board to the physical address of the original standby board. This step completes the transparent switchover of the active and standby boards, and blocks the impact on the high-level application modules and other unrelated boards, and can be completed in a faster time.
接着进入步骤 405 , 在控制模块控制下, 由主用执行模块和备用执行 模块完成与系统相关的配置数据切换处理。 在通信系统中, 设备单板往往 需要进行配置才能正常工作, 因此在倒换完成之后, 还需要进行系统相关 的配置数据处理。 比如, 需要在底层配置主备用单板的状态, 使得主用单 板不再发送数据, 而备用单板与系统其他单板进行通信。  Next, proceeding to step 405, under the control of the control module, the system-related configuration data switching process is completed by the main execution module and the standby execution module. In a communication system, a device board needs to be configured to work properly. Therefore, after the switchover is completed, system-related configuration data processing is required. For example, you need to configure the status of the active and standby boards on the bottom layer so that the active board does not send data. The standby board communicates with other boards in the system.
最后进入步骤 406, 由主用执行模块和备用执行模块完成业务数据的 平滑处理。 在主备用单板完成倒换之后, 需要保证通信业务的平滑接续, 因此在该步骤, 执行模块对主备用单板的通信业务进行监测, 并在特殊情 况下采用相应的策略, 以保证业务尽量不受倒换操作的影响。 比如, 对备 用单板的通信数据进行核查, 如果通信出现异常, 则发送警告信号, 或者 在业务平滑期主用单板和备用单板同时工作, 当主备用单板数据完全同步 的时候, 才使用备用单板的数据进行通信, 并使得主用单板进入备用状态。 Finally, in step 406, the main processing module and the standby execution module complete the smooth processing of the service data. After the main and standby boards complete the switchover, you need to ensure the smooth connection of the communication service. Therefore, in this step, the execution module monitors the communication service of the primary and backup boards, and adopts corresponding policies in special cases to ensure that services are not as good as possible. Affected by the switching operation. For example, Check the communication data of the board. If the communication is abnormal, send a warning signal. If the active and standby boards are in the same state, the standby board is used. The data communicates and causes the active board to enter the standby state.
在本发明的一个实施例中, 在不必要的情况下, 上述倒换方法的流程 中, 将步骤 402省略。 比如, 在硬件故障引起的自动倒换下, 此时主用单 板出现故障, 必须由备用单元接替其进行通信, 此时对业务造成的影响是 不可避免的, 但由于主备用单板运行的业务数据设计为温备份方式, 实际 对业务的影响也会 ί艮小。  In one embodiment of the present invention, step 402 is omitted in the flow of the above-described switching method, if it is not necessary. For example, in the automatic switchover caused by a hardware fault, the active board is faulty, and the standby unit must take over the communication. The impact on the service is unavoidable. The data is designed to be a warm backup method, and the actual impact on the business will be small.
在本发明的另一个实施例中, 在不必要的情况下, 如硬件故障引起的 自动倒换, 上述倒换方法的流程中, 将步骤 403省略。 由于硬件故障时, 主用单板发生故障, 不可能进行倒换恢复操作, 因此配置数据没有必要进 行备份。  In another embodiment of the present invention, in an unnecessary case, such as automatic switching caused by a hardware failure, in the flow of the above switching method, step 403 is omitted. When the main board is faulty due to a hardware failure, it is impossible to perform the recovery operation. Therefore, it is not necessary to back up the configuration data.
在本发明的一个实施例中, 为了实现平滑倒换设计, 作为主备冗余备 份的单板, 采用温备份方式, 保证数据同步和业务平滑倒换。 温备份是指, 在正常运行过程中, 主板上的业务数据发生改变时, 立即将业务数据备份 到备板上, 使主备单板上的运行数据始终保持一致。 而在倒换时, 特别是 在人工倒换或自动人工倒换下, 允许主用单板将业务数据备份到备用单板 上, 由于正常工作时已经实施了温备份, 因此在倒换发生时, 所需要备份 的数据量是非常小的, 数据备份所耗时间也是比较少的, 由此提高倒换的 速度, 降低对系统和业务通信造成的影响。  In an embodiment of the present invention, in order to implement a smooth switching design, a board that serves as an active/standby redundant backup adopts a warm backup mode to ensure data synchronization and service smooth switching. When the service data on the main board changes, the service data is backed up to the standby board. The running data on the active and standby boards is always consistent. In the case of a switchover, the manual switch is used to back up the service data to the standby board. In the normal operation, the warm backup is performed. Therefore, when the switchover occurs, the backup is required. The amount of data is very small, and the time taken for data backup is relatively small, thereby increasing the speed of switching and reducing the impact on system and service communication.
在本发明的一个实施例中, 还利用倒换流程的步骤执行方法实现倒换 恢复。 倒换恢复是指在倒换发送过程中或者倒换发生后的一定时间之内, 系统都可以根据情况进行倒换恢复, 尽量使得主备用单板恢复到倒换之前 的状态。 这实现了在某些特殊需要情况下对于自动人工倒换的支持, 比如 由操作维护人员发起的单板倒换过程, 需要对主用单板进行一定时间的离 线操作。  In an embodiment of the present invention, the method of performing the switching of the switching process is also used to implement the switching recovery. The switchover recovery means that the system can perform the switchover recovery according to the situation during the process of the switchover or the switchover. The master/slave board is restored to the state before the switchover. This provides support for automatic manual switching in certain special situations. For example, the board switching process initiated by the operation and maintenance personnel requires the offline operation of the main board for a certain period of time.
根据本发明的一个实施例的倒换方法, 首先将倒换流程划分为多个阶 段, 采用状态控制的方法控制倒换的进行。 当任意一个阶段出现处理异常 时, 都可以进行倒换恢复处理, 尽可能恢复到倒换前的状态。 下面详细给 出了在倒换步骤中, 当出现异常情况时, 倒换恢复流程如下: According to the switching method of one embodiment of the present invention, the switching process is first divided into a plurality of stages, and the state control method is used to control the switching. When a processing exception occurs at any one stage, the switching recovery processing can be performed, and the state before the switching is restored as much as possible. Details below In the switching step, when an abnormal situation occurs, the switching recovery process is as follows:
步骤 401或步骤 402中, 在倒换的预处理阶段, 由于还没有进行实质 的倒换操作, 因此在此阶段, 如果倒换失败, 比如倒换条件判断不满足时, 将直接结束倒换流程, 不需要进行倒换恢复。  In step 401 or step 402, in the pre-processing stage of the switching, since the substantial switching operation has not been performed, if the switching fails, for example, the switching condition determination is not satisfied, the switching process is directly ended, and no switching is required. restore.
步骤 403 中, 在数据备份阶段, 由于数据备份不影响原主用单板的正 常运行, 只是将数据备份在备用单板, 因此倒换失败时, 进行的倒换恢复 操作只需对备用单板进行恢复, 比如简单的将备用单板复位重启。  In step 403, in the data backup phase, the data backup does not affect the normal operation of the original active board, but the data is backed up on the standby board. Therefore, when the switchover fails, the switchover recovery operation only needs to restore the standby board. For example, simply reset and restart the standby board.
步骤 404 中, 在底层通信通道切换阶段, 如果出现异常, 则进行倒换 恢复操作, 将底层通信通道切换恢复到倒换前的状态, 并恢复原主用单板 的配置, 通知其作为主用单板运行。  In step 404, if an abnormality occurs in the underlying communication channel switching phase, the switchover recovery operation is performed, and the underlying communication channel is switched to the state before the switchover, and the configuration of the original active board is restored, and the device is notified to operate as the active board. .
步骤 405 中, 在系统相关配置数据切换阶段, 如果出现异常, 则进行 倒换恢复操作, 先恢复相关配置数据为原主备对应关系, 然后将底层通信 通道切换恢复到倒换前的状态, 最后恢复原主用单板的业务模块等, 通知 其作为主用单板运行。  In step 405, in the system configuration data switching phase, if an abnormality occurs, the switching recovery operation is performed, and the related configuration data is restored to the original active/standby correspondence relationship, and then the underlying communication channel is switched to the state before the switching, and finally the original primary recovery is resumed. The service module of the board is notified that it is running as the active board.
在本发明的一个实施例中, 通过上述倒换恢复机制实现不中断业务的 系统平滑升级和硬件维护。 比如, 釆用倒换恢复, 可以将主用单板从系统 中卸下一段时间, 进行操作或维护, 然后由倒换恢复机制, 重新恢复主用 单板的运行, 而在这一过程中由备用单板替代主用单板运行, 因此业务没 有发生中断。  In an embodiment of the present invention, the system smooth upgrade and hardware maintenance of the uninterrupted service are implemented by the above-mentioned switching recovery mechanism. For example, if you use the switchover recovery, you can remove the active board from the system for a period of time, perform operation or maintenance, and then resume the operation of the active board by the switchover recovery mechanism. In this process, the standby board is used. The board replaces the main board, so the service is not interrupted.
虽然通过参照本发明的某些优选实施例, 已经对本发明进行了图示和 描述, 但本领域的普通技术人员应该明白, 可以在形式上和细节上对其作 各种各样的改变, 而不偏离所附权利要求书所限定的本发明的精神和范围。  Although the present invention has been illustrated and described with reference to the preferred embodiments of the present invention, those skilled in the art The spirit and scope of the invention is defined by the appended claims.

Claims

权 利 要 求 Rights request
1. 一种通信系统中设备的倒换方法, 其特征在于, 包括:  A method for switching a device in a communication system, comprising:
A、 进行倒换预处理;  A. Perform reverse switching preprocessing;
B、 完成对主用设备和备用设备之间的底层通信通道的切换处理; B. Completing the switching process of the underlying communication channel between the primary device and the standby device;
C、 完成主用设备和备用设备配置数据的切换处理。 C. Complete the switching process of the configuration data of the primary device and the standby device.
2. 根据权利要求 1所述的通信系统中设备的倒换方法, 其特征在于, 步骤 A所述倒换预处理的过程包括:  2. The method for switching a device in a communication system according to claim 1, wherein the process of switching the pre-processing in step A comprises:
Al、 判断系统资源是否满足倒换条件, 如果满足, 进入步骤 B, 否贝' J , 终止倒换流程。  Al. Determine whether the system resources meet the switching conditions. If yes, go to step B, and no, stop the switching process.
3. 根据权利要求 1或 1所述的通信系统中设备的倒换方法, 其特征 在于, 步骤 A所述倒换预处理的过程包括:  The method for switching the device in the communication system according to claim 1 or 1, wherein the process of switching the preprocessing in step A comprises:
A2、 判断业务资源是否满足倒换条件, 如果满足, 进入步骤 B, 否则 终止倒换流程。  A2. Determine whether the service resource meets the switching condition. If yes, go to step B. Otherwise, terminate the switching process.
4. 根据权利要求 1所述的通信系统中设备的倒换方法, 其特征在于, 在所述步骤 A和所述步骤 B之间 , 还包含步骤:  4. The method for switching devices in a communication system according to claim 1, wherein between the step A and the step B, the method further comprises the steps of:
将所述主用设备的业务数据备份到所述备用设备, 进行业务数据的平 滑接续操作。  The service data of the active device is backed up to the standby device to perform smooth connection operation of the service data.
5. 根据权利要求 1所述的通信系统中设备的倒换方法, 其特征在于, 在所述步骤 C之后还包含步骤,  5. The method for switching devices in a communication system according to claim 1, further comprising a step after said step C,
进行业务数据的平滑处理操作, 保证业务的平滑接续。  Smooth processing of business data to ensure smooth connection of services.
6. 根据权利要求 1所述的通信系统中设备的倒换方法, 其特征在于,  6. The method for switching devices in a communication system according to claim 1, wherein:
7. 根据权利要求 1 所述的通信系统中设备的倒换方法, 其特征在于 还包括, 所述主用设备和所述备用设备之间采用温备份的方式实现数据同 步。 The method for switching a device in a communication system according to claim 1, further comprising: performing data synchronization between the active device and the standby device by using a warm backup.
8. 根据权利要求 1 - 7中任意一条权利要求所述的通信系统中设备的 倒换方法, 其特征在于还包括, 判断是否出现倒换异常, 如果出现, 执行 倒换恢复流程。  The method for switching a device in a communication system according to any one of claims 1-7, further comprising: determining whether a switching abnormality occurs, and if so, performing a switching recovery process.
9. 根据权利要求 8所述的通信系统中设备的倒换方法, 其特征在于, 所述倒换恢复流程包括: 9. The method of switching devices in a communication system according to claim 8, wherein: The switching recovery process includes:
对所述备用设备进行数据恢复。  Data recovery is performed on the standby device.
10.根据权利要求 9所述的通信系统中设备的倒换方法,其特征在于, 所述倒换恢复流程包括:  The method for switching a device in a communication system according to claim 9, wherein the switching recovery process comprises:
将所述底层通信通道切换恢复到倒换前的状态;  Reverting the underlying communication channel to a state before the switching;
恢复所述主用设备的配置, 并通知所述主用设备作为主用运行。 The configuration of the primary device is restored, and the primary device is notified to operate as a primary device.
11.根据权利要求 8所述的通信系统中设备的倒换方法,其特征在于, 所述倒换恢复流程包括: The method for switching a device in a communication system according to claim 8, wherein the switching recovery process comprises:
恢复所述配置数据为原主、 备设备的对应关系;  Restoring the configuration data as the correspondence between the original primary and backup devices;
切换恢复所述底层通信通道到倒换前的状态;  Switching to restore the state of the underlying communication channel to before the switching;
恢复原所述主用设备的业务模块, 并通知其作为主用运行。  Restore the service module of the original primary device and notify it to operate as the primary.
12. 一种通信系统中设备的倒换系统, 其特征在于包括:  12. A switching system for a device in a communication system, comprising:
控制模块, 用于控制倒换流程的执行;  a control module, configured to control execution of the switching process;
主用执行模块, 用于根据所述控制模块的指令, 执行与主用设备相关 的底层通信通道和配置数据的切换处理;  a main execution module, configured to perform, according to an instruction of the control module, a switching process of an underlying communication channel and configuration data related to the active device;
备用执行模块, 用于根据所述控制模块的指令, 执行与备用设备相关 的底层通信通道和配置数据的切换处理。  And an alternate execution module, configured to perform, according to an instruction of the control module, a switching process of an underlying communication channel and configuration data related to the standby device.
13. 根据权利要求 12 所述的通信系统中设备的倒换系统, 其特征在 于, 所述主用执行模块和备用执行模块还用于根据所述控制模块的指令, 将所述主用设备的业务数据备份到所述备用设备, 以及业务数据的平滑处  The switching system of the device in the communication system according to claim 12, wherein the primary execution module and the standby execution module are further configured to: service the primary device according to an instruction of the control module Data backup to the backup device, and smoothing of business data
PCT/CN2005/000995 2004-07-13 2005-07-07 Method and system for carrying out the function of switching in a communication system WO2006005251A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN200410071746.8 2004-07-13
CNB2004100717468A CN100395962C (en) 2004-07-13 2004-07-13 Method and system for equipment switching in communication system

Publications (1)

Publication Number Publication Date
WO2006005251A1 true WO2006005251A1 (en) 2006-01-19

Family

ID=35783513

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2005/000995 WO2006005251A1 (en) 2004-07-13 2005-07-07 Method and system for carrying out the function of switching in a communication system

Country Status (2)

Country Link
CN (1) CN100395962C (en)
WO (1) WO2006005251A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111478947A (en) * 2020-03-20 2020-07-31 珠海高凌信息科技股份有限公司 Real-time synchronization method and system for main control board and standby control board

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101247213A (en) * 2007-02-16 2008-08-20 华为技术有限公司 Method and system for master/standby rearrangement
CN101222358B (en) * 2008-01-22 2010-06-09 华为技术有限公司 Configuration emigration method and device
CN101267345B (en) * 2008-03-10 2010-12-08 中兴通讯股份有限公司 Service node backup method and distributed system
CN101599852B (en) * 2008-06-06 2011-08-24 中兴通讯股份有限公司 Communication method of a plurality of modules among single boards, single board with a plurality of software modules and communication device
WO2011157149A2 (en) * 2011-05-31 2011-12-22 华为技术有限公司 Method, communication device and system, and service request device for main/standby switch between communication devices
CN102265555B (en) * 2011-05-31 2015-04-29 华为技术有限公司 Conversion method between an main communicational equipment and a spare communicational equipment, the communicational equipments and a service request equipment
CN103068034B (en) * 2013-01-29 2016-05-11 大唐移动通信设备有限公司 A kind of method that data are synchronous and device
CN112714461B (en) * 2021-01-29 2022-05-31 四川安迪科技实业有限公司 DAMA satellite network central station protection switching method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1321004A (en) * 2000-04-25 2001-11-07 华为技术有限公司 Method and equipment for swapping active with standby switches
US20030033268A1 (en) * 2001-08-09 2003-02-13 Srikanth Muthya Data access component providing transparent access to a plurality of medical imaging databases

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1321004A (en) * 2000-04-25 2001-11-07 华为技术有限公司 Method and equipment for swapping active with standby switches
US20030033268A1 (en) * 2001-08-09 2003-02-13 Srikanth Muthya Data access component providing transparent access to a plurality of medical imaging databases

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111478947A (en) * 2020-03-20 2020-07-31 珠海高凌信息科技股份有限公司 Real-time synchronization method and system for main control board and standby control board

Also Published As

Publication number Publication date
CN100395962C (en) 2008-06-18
CN1722628A (en) 2006-01-18

Similar Documents

Publication Publication Date Title
WO2006005251A1 (en) Method and system for carrying out the function of switching in a communication system
US8015431B2 (en) Cluster system and failover method for cluster system
EP2281240B1 (en) Maintaining data integrity in data servers across data centers
US8032786B2 (en) Information-processing equipment and system therefor with switching control for switchover operation
WO2016058307A1 (en) Fault handling method and apparatus for resource
CN101160794B (en) Disaster recovery system and method of service controlling device in intelligent network
CN105159798A (en) Dual-machine hot-standby method for virtual machines, dual-machine hot-standby management server and system
CN110912991A (en) Super-fusion-based high-availability implementation method for double nodes
CN105302661A (en) System and method for implementing virtualization management platform high availability
CN103346903A (en) Dual-machine backup method and device
KR20070026327A (en) Redundant routing capabilities for a network node cluster
CN101237413B (en) Method for realizing high-availability of control part under forward and control separated network architecture
CN103532753A (en) Double-computer hot standby method based on memory page replacement synchronization
CN101594383A (en) A kind of service of double controller storage system and controller state method for supervising
WO2005125228A1 (en) Upgradable communication system and upgrading method of communication system
KR20030054165A (en) Processor board dualizing apparatus and method for mobile communication system
CN100496030C (en) Integrated access media gateway device based on the high-available system
CN112511326A (en) Switching method, device, equipment and storage medium
CN102487332B (en) Fault processing method, apparatus thereof and system thereof
CN103297279A (en) Switching method of main and backup single disks of software control in multi-software process system
EP2538612A1 (en) Method and device for managing service
WO2021238579A1 (en) Method for managing sata hard disk by means of storage system, and storage system
JP2010231257A (en) High availability system and method for handling failure of high availability system
JP6856574B2 (en) Service continuation system and service continuation method
JP5353378B2 (en) HA cluster system and clustering method thereof

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

WWW Wipo information: withdrawn in national office

Country of ref document: DE

122 Ep: pct application non-entry in european phase