WO2023273945A1 - 协议高可用保护系统及保护方法 - Google Patents

协议高可用保护系统及保护方法 Download PDF

Info

Publication number
WO2023273945A1
WO2023273945A1 PCT/CN2022/099981 CN2022099981W WO2023273945A1 WO 2023273945 A1 WO2023273945 A1 WO 2023273945A1 CN 2022099981 W CN2022099981 W CN 2022099981W WO 2023273945 A1 WO2023273945 A1 WO 2023273945A1
Authority
WO
WIPO (PCT)
Prior art keywords
main control
control unit
protocol
database
message
Prior art date
Application number
PCT/CN2022/099981
Other languages
English (en)
French (fr)
Inventor
吴道揆
陈卫立
王万里
曹义
孙兴强
吕朋伟
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2023273945A1 publication Critical patent/WO2023273945A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/40Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass for recovering from a failure of a protocol instance or entity, e.g. service redundancy protocols, protocol state redundancy or protocol service redirection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
    • G06F16/273Asynchronous replication or reconciliation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/54Interprogram communication
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/54Interprogram communication
    • G06F9/544Buffers; Shared memory; Pipes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1095Replication or mirroring of data, e.g. scheduling or transport for data synchronisation between network nodes

Definitions

  • the present application relates to the technical field of network equipment, in particular to a protocol high-availability protection system and protection method.
  • the high-availability protection of the control plane refers to performing active-standby switchover on the main control unit of the device, keeping the protocol uninterrupted, avoiding the shock and re-convergence of the control plane from affecting the traffic forwarding on the forwarding plane, and improving the network service quality.
  • the weak consistency scheme of the master and backup master control units of the database can guarantee the final consistency between the master and backup, but this consistency lacks strict guarantees.
  • the main control unit modifies the data, before the data is synchronized to the standby main control unit, the data between the main and standby main control units is inconsistent. Initiating an active-standby switchover within this period will result in the failure of the protocol function, which will cause oscillation and re-convergence of the control plane. All such schemes based on weak consistency have certain limitations in their scope of application. It can only support high availability based on soft state protocols.
  • the strong consistency scheme of the master and backup master control units of the database which can ensure the strict consistency of the master, backup master, and backup master control units.
  • the method to achieve this strong consistency is to use blocking synchronous replication. That is to say, after the content of the database is modified on the main control unit, the content must be synchronized to one or more protection nodes, and the modification action will be returned successfully only after the response is confirmed.
  • This method ensures strong consistency between the master and backup master control units, but each modification involves data synchronization and waiting across nodes, which will lead to serious performance degradation of the database.
  • the embodiment of the present application provides a protocol high availability protection system, including: a first main control unit and a second main control unit connected to the first main control unit; the first main control unit is used to receive other The system sends a message signal to the first main control unit, processes the message signal to obtain processed data, and stores the processed data to the first main control unit and the second main control unit unit; the first main control unit is used to send a message reception confirmation signal to the other system after the second main control unit obtains the processed data; it also includes a connection with the first main control module A line card unit, the line card unit is connected to the second main control module, and the line card unit is used to receive the processing data and perform traffic forwarding according to the processing data.
  • the embodiment of the present application also provides a protocol high availability protection method, which is applied to a protocol high availability protection system including a first main control unit and a second main control unit, including: the first main control unit receives messages sent by other systems message signal, the first main control unit processes the message signal to obtain processed data, and the first main control unit stores the processed data in the first main control unit and the second main control unit Two main control units: the first main control unit sends a message reception confirmation signal to the other systems after the second main control unit acquires the processed data.
  • Fig. 1 is a schematic structural diagram of the protocol high availability protection system provided by the first embodiment of the present application
  • Fig. 2 is a flow chart of the protocol high availability protection method provided by the second embodiment of the present application.
  • Fig. 3 is a flowchart one of the sending process of the TCP protocol high availability protection method
  • Fig. 4 is the flowchart two of the receiving process of TCP protocol high availability protection method
  • Fig. 5 is a flow chart of the high availability protection method of the OSPF protocol.
  • the purpose of the embodiments of the present application is to provide a protocol high availability protection system and protection method, so as to realize the high availability of the protocol.
  • the first main control unit after the first main control unit receives message signals sent by other systems and processes the message signals, it stores the processing data obtained by processing the message signals in the first main control unit and synchronously stores them in the The second main control unit sends a message reception confirmation signal to other systems after the second main control unit obtains the processing data, thereby realizing the synchronization of the processing data of the message signals of the first main control unit and the second main control unit,
  • the second main control unit has not yet obtained the processing data of the message signal, if the switching of the main control unit occurs, and the first main control unit is converted to the second main control unit, since the message receiving confirmation signal has not been sent, this When other systems will resend the message signal, the second main control unit can also re-receive the message signal and process it without re-convergence; after the second main control unit obtains the processing data of the message signal, if The switchover of the main control unit occurs, and the first main control unit is converted into the second main control unit.
  • the second main control unit Since the second main control unit has synchronously stored the processing data of the message signal, the second main control unit can The processing data of the signal continues to process the message signal, and there will be no re-convergence of the main control unit, so as to realize the high availability of the protocol.
  • the first embodiment of the present application relates to a protocol high-availability protection system, the specific structure is shown in Figure 1, including: a first main control unit 10 and a second main control unit 20 connected to the first main control unit 10; wherein , the first main control unit 10 is used to receive message signals sent by other systems. After receiving the message signals, the first main control unit 10 processes the message signals and obtains processed data, and then converts the obtained processed data Stored in the first main control unit 10 and the second main control unit 20; when the second main control unit 20 obtains the processed data of the message signal, the first main control unit 10 sends a message reception confirmation signal to other systems.
  • it also includes a line card unit 30 connected to the first main control module 10, the line card unit 30 is connected to the second main control module 20, and the line card unit 30 is used for receiving processing data and forwarding traffic according to the processing data.
  • the first main control unit 10 After the first main control unit 10 receives the message signal sent by other systems and processes the message signal, it stores the processed data obtained by processing the message signal into the first main control unit 10 and simultaneously stores it in the second main control unit 10.
  • the main control unit 20 sends a message reception confirmation signal to other systems after the second main control unit 20 obtains the processed data, thereby realizing the processing data of the message signals of the first main control unit 10 and the second main control unit 20 Synchronization, when the second main control unit 20 has not yet obtained the processing data of the message signal, if the switching of the main control unit occurs, the first main control unit 10 is converted to the second main control unit 20, since the message has not yet been sent Receiving the confirmation signal, at this time, other systems will resend the message signal, and the second main control unit 20 can also receive the message signal again and process it without re-convergence; After the processing data of the signal, if the switching of the main control unit occurs, the first main control unit 10 is converted to the second main control unit 20.
  • the second main control unit 20 Since the second main control unit 20 has synchronously stored the processing data of the message signal, the second The main control unit 20 can continue to process the message signal according to the synchronously stored message signal processing data, and there will be no re-convergence of the main control unit, thereby realizing high availability of the protocol.
  • the first main control unit 10 includes a first protocol component 11 and a first database 12 communicatively connected with the first protocol component 11, and the second main control unit 20 includes a second protocol component 21 and a communication connection with the first protocol component 11.
  • the second database 22 to which the second protocol component 21 is communicatively connected.
  • the first protocol component 11 is used to receive the message signal, and process the message signal to obtain processing data, then store the processing data into the first database 12, and store it into the second database 22 synchronously; when the first When switching between the main control unit 10 and the second main control unit 20 , the second protocol component 21 can obtain the stored processing data from the second database 22 .
  • the first database 12 includes a first sending module 13, and the first sending module 13 is used for storing and sending a message reception confirmation signal to other systems. After the first protocol component 11 receives foreign language signals from other systems, it will generate a message reception confirmation signal and store the message reception confirmation signal in the first sending module 13.
  • the first sending module 13 sends the message reception confirmation signal stored in it to other systems, and other systems will no longer send the sent message signal after receiving the message reception confirmation signal, if the second main control unit 20 The processing data has not been obtained yet and the switching of the first main control unit 10 and the second main control unit 20 takes place, then the message reception confirmation signal is stored in the first sending module 13 and has not yet been sent, and other systems have not received it after a period of time. After receiving the message reception confirmation signal, the message signal will be resent.
  • the message signal can be received by the second protocol component 21 in the second main control unit 20 and re-processed, thereby avoiding the loss of the processing of the message signal, and also No reconvergence of the master unit takes place, resulting in high availability of the protocol.
  • the first database 12 is communicatively connected to the second database 22 , and the first database 12 is used to receive processing data from the first protocol component 11 and transmit the processing data to the second database 22 .
  • the processed data is transmitted to the second database 22 through the first database 12.
  • the first database 12 can directly transmit the processed data to the second database 22.
  • the processed data can be directly stored in the second database 22 without processing the processed data, thereby effectively improving the transmission and storage efficiency of the processed data.
  • the aforementioned transmission of processed data from the first database 12 to the second database 22 is only an illustration of a specific example in this embodiment, and does not constitute a limitation. In other embodiments of the present application, Other methods such as simultaneously transmitting the processed data to the first database 12 and the second database 22 through the first protocol component 11 may also be used, and the details may be flexibly set according to actual needs.
  • the second database 22 includes a second sending module 23, and the second sending module 23 is used to send a processing data reception confirmation signal to the first database 12 after the second database 22 receives the processing data; After processing the data reception confirmation signal, the first sending module 13 sends the stored message reception confirmation signal to other systems.
  • the second sending module 23 is set to send the processing data receiving confirmation signal to the first database 12, thereby ensuring that the second database 22 has received the message signal when the first sending module 13 sends the stored message receiving confirmation signal to other systems processing data, so as to ensure the data synchronization of the first database 12 and the second database 22.
  • the network protocol run by the first protocol component 11 is the same as the network protocol run by the second protocol component 21, for example, the first protocol component 11 and the second protocol component 21 can run TCP (Transmission Control Protocol, Transmission Control Protocol) or OSPF (Open Shortest Path First, Open Shortest Path First) and other network communication protocols.
  • TCP Transmission Control Protocol
  • OSPF Open Shortest Path First, Open Shortest Path First
  • both the first database and the second database are SDDM (Shared-Memory Distributed Data Manager, Shared-Memory Distributed Data Manager).
  • SDDM is a distributed data storage system based on shared memory. The system can provide extremely high read and write performance through shared memory, and at the same time ensure data synchronization, timing linkage and timing collaboration between the first database and the second database, ensuring that the second Strict consistency of information between a database and a second database.
  • the second embodiment of the present application provides a protocol high availability protection method, which is applied to a protocol high availability protection system including a first main control unit and a second main control unit, as shown in FIG. 2 , including the following steps:
  • Step S101 the first main control unit receives message signals sent by other systems.
  • other systems may be other systems running on a terminal different from the protocol high-availability protection system, or other systems running on the same terminal as the protocol high-availability protection system.
  • Flexible settings can be made according to actual needs.
  • the network protocol running in the first main control unit is the TCP protocol
  • other systems can be clients of the current terminal
  • the network protocol running in the first main control unit is the OSPF protocol
  • other systems can be running on LSAs on other endpoints advertise the network topology.
  • Step S102 the first main control unit processes the message signal to obtain processed data.
  • the first main control unit processes the received message signal through its running network protocol.
  • corresponding processing data will be generated, such as protocol state change data, etc., which Part of the protocol state change data includes the state data required by the protocol itself, and also includes the forwarding information items calculated according to the protocol message and used to guide the work of the forwarding plane.
  • Step S103 the first main control unit stores the processing data to the first main control unit and the second main control unit.
  • Step S104 After the second main control unit acquires the processed data, the first main control unit sends a message reception confirmation signal to other systems.
  • the second main control unit after the second main control unit acquires the processed data, the second main control unit will generate and send a processed data reception confirmation signal to the first main control unit, and the first main control unit receives the second
  • the step of sending a message receipt confirmation signal to other systems will be executed only after the processing data sent by the main control unit receives the confirmation signal, so as to ensure the synchronization of the processing data between the first main control unit and the second main control unit.
  • Step S105 the first main control unit or the second main control unit sends the processing data to the line card unit.
  • the first main control unit sends the processing data to the line card unit after sending a packet reception confirmation signal to other systems, and the line card unit performs traffic forwarding according to the processing data.
  • the first main control unit completes the processing of a message signal and obtains all the processed data of the message signal, no matter whether the processed data is synchronized with the second main control unit or not storage, can continue to receive another message signal for processing, so as to ensure that there is no fault in the process of message signal processing, and improve the efficiency of message processing.
  • this embodiment is an embodiment of the protocol high availability protection method corresponding to the first embodiment, and this embodiment can be implemented in cooperation with the first embodiment.
  • the relevant technical details and technical effects mentioned in the first embodiment are still valid in this embodiment, and will not be repeated here to reduce repetition.
  • the relevant technical details and technical effects mentioned in this embodiment can also be applied in the first embodiment.
  • TCP is a network transport layer protocol, which realizes reliable transmission of information on the network through a send-response-sliding window mechanism.
  • the reliable transmission of TCP is divided into two directions: sending and receiving, which are described as follows:
  • the sending process of TCP is shown in Figure 3, including:
  • Step S201 the first protocol component receives a packet signal sent by the client.
  • Step S202 the first protocol component processes the message signal to obtain the content of the sending window, and stores the content of the sending window in the first database.
  • Step S203 the first protocol component generates a packet reception confirmation signal and stores it in the first sending module.
  • Step S204 the first database synchronizes the sending window content to the second database.
  • Step S205 After receiving the content of the sending window, the second database stores the content of the sending window, and sends a processing data reception confirmation signal to the first database through the second sending module.
  • Step S206 After the first database receives the processing data reception confirmation signal, the first sending module sends the stored message reception confirmation signal to the client.
  • Step S207 The first protocol component sends the content of the sending window stored in the first database to the line card unit.
  • Step S208 After receiving the content of the sending window, the line card unit sends a response message to the first protocol component.
  • Step S209 The first protocol component updates the sending window content stored in the first database according to the response message.
  • Step S210 the first database synchronizes the updated sending window content to the second database.
  • switching the main control unit at any time can ensure that the TCP connection is not interrupted and information is not lost. If the main control unit is converted before step S206, the first sending module does not send the stored message reception confirmation signal to the client, and the client does not receive the message reception confirmation signal for a certain period of time and will send the message to the converted main control module again. Send a message signal; if the main control unit is switched after step S206, the sliding window information has been synchronized to the second database, and the second protocol component in the second main control module can obtain the content of the sending window from the second database after switching and continue processing to ensure the reliability of TCP transmission.
  • the receiving process of TCP is shown in Figure 4, including:
  • Step S301 the first protocol component receives a packet signal sent by an external terminal.
  • Step S302 the first protocol component processes the message signal to obtain the content of the sending window, and stores the content of the sending window in the first database.
  • Step S303 the first protocol component generates a packet reception confirmation signal and stores it in the first sending module.
  • Step S304 the first database synchronizes the sending window content to the second database.
  • Step S305 After receiving the content of the sending window, the second database stores the content of the sending window, and sends a processing data reception confirmation signal to the first database through the second sending module.
  • Step S306 After the first database receives the processing data reception confirmation signal, the first sending module sends the stored message reception confirmation signal to the client.
  • Step S307 the first protocol component sends the content of the sending window stored in the first database to the line card unit.
  • step S307 the first sending module does not send the stored message reception confirmation signal to the external terminal, and the external terminal does not receive the message reception confirmation signal for a certain period of time and will send the message to the converted main control module again. Send message signal. Therefore, the reliability of TCP transmission during the conversion process of the main control module is guaranteed.
  • the OSPF protocol is an IP network routing protocol, which collects and notifies the network topology through the LSA, and calculates the route according to the network topology, using It is used to guide the forwarding engine to send traffic.
  • IP network routing protocol which collects and notifies the network topology through the LSA, and calculates the route according to the network topology, using It is used to guide the forwarding engine to send traffic.
  • Step S401 the external terminal sends a message signal including the network topology to the first protocol component through the LSA, and the first protocol component accepts the LSA.
  • Step S402 the first protocol component stores the LSA information into the first database, and at the same time starts the shortest path first (SPF) route calculation, and stores the calculated route information into the first database.
  • SPF shortest path first
  • Step S403 The first protocol component generates a packet reception confirmation signal and stores it in the first sending module.
  • Step S404 the first database synchronizes the LSA information and routing information to the second database.
  • Step S405 After receiving the LSA information and the routing information, the second database stores the LSA information and the routing information, and sends a processed data reception confirmation signal to the first database through the second sending module.
  • Step S406 After the first database receives the processing data reception confirmation signal, the first sending module sends the stored message reception confirmation signal to the client.
  • Step S407 the first protocol component sends the content of the sending window stored in the first database to the line card unit.
  • the main control unit switching the main control unit at any time can ensure that the routing information in OSPF is not lost. If the main control unit is converted before step S407, the first sending module does not send the stored message reception confirmation signal to the external terminal, and the external terminal does not receive the message reception confirmation signal for a certain period of time and will send the message to the converted main control module again. Send message signal. In this way, the reliability of OSPF transmission during the conversion process of the main control module is ensured.
  • modules involved in this embodiment are logical modules.
  • a logical unit can be a physical unit, or a part of a physical unit, or multiple physical units. Combination of units.
  • units that are not closely related to solving the technical problems proposed in the present application are not introduced in this embodiment, but this does not mean that there are no other units in this embodiment.
  • the program is stored in a storage medium and includes several instructions to make a device (which can be a single-chip , chip, etc.) or a processor (processor) executes all or part of the steps of the methods described in the various embodiments of the present application.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only database (ROM, Read-Only Memory), random access database (RAM, Random Access Memory), magnetic disk or optical disc, etc., which can store program codes. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Databases & Information Systems (AREA)
  • Computer Security & Cryptography (AREA)
  • Computing Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

本申请涉及网络设备技术领域,公开了一种协议高可用保护系统及保护方法。其中,协议高可用保护系统,包括:第一主控单元和与第一主控单元连接的第二主控单元;第一主控单元用于接收其它系统发送的报文信号、对报文信号进行处理以获取处理数据、并将处理数据存储至第一主控单元和第二主控单元;第一主控单元用于在第二主控单元获取处理数据后、向其它系统发送报文接收确认信号;还包括与第一主控模块连接的线卡单元,线卡单元与第二主控模块连接,线卡单元用于接收处理数据,并根据处理数据进行流量转发。

Description

协议高可用保护系统及保护方法
交叉引用
本申请基于申请号为“202110729464.6”、申请日为2021年6月29日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此以引入方式并入本申请。
技术领域
本申请涉及网络设备技术领域,特别涉及一种协议高可用保护系统及保护方法。
背景技术
随着网络带宽的日益提升,客户对网络设备,特别是网络设备中的控制面高可用性要求越来越高。控制面的每一次震荡和重新收敛,在完成收敛之前,将会导致网络断流和环路。控制面的高可用性保护,是指在设备的主控单元进行主备倒换,保持协议不中断,避免控制面的震荡和重新收敛影响转发面的流量转发,提升网络服务质量。
实现协议高可用的有几种方式:
1、控制面的各协议自身实现主备状态的同步和倒换保护,这种方法实现复杂度高,缺乏方案的统一性和简洁性。
2、借助高可用数据库来实现控制面的高可用保护。而数据库的实现又可以进一步细分为两种方式:
(1)数据库的主、备主控单元弱一致性方案,该方案能保证主备之间的最终一致性,但这种一致性是缺乏严格保障的。主主控单元修改数据后,该数据同步到备主控单元之前,主、备主控单元之间数据是不一致的。在这段时间内发起主备倒换,将会导致协议的功能失效,从而引发控制面的震荡和重新收敛。所有这种基于弱一致性的方案应用范围有一定的局限性。只能支撑基于软状态 协议的高可用性。
(2)数据库的主、备主控单元强一致性方案,该方案能保证主、备主、备主控单元的严格一致性,通常实现该强一致性的方法是采用阻塞式同步复制方式,也就是说,在主主控单元上修改数据库内容后,该内容必须同步到一个或多个保护节点,并通过应答确认后,该修改动作才成功返回。这种方式保证了主、备主控单元之间的强一致性,但每一次修改涉及到跨节点的数据同步和等待,该方式会导致数据库的性能下降严重。
因此,如何保证主、备主控单元切换时网络协议的高可用性成为一个亟待解决的问题。
发明内容
本申请的实施方式提供了一种协议高可用保护系统,包括:第一主控单元和与所述第一主控单元连接的第二主控单元;所述第一主控单元用于接收其它系统向所述第一主控单元发送的报文信号、对所述报文信号进行处理以获取处理数据、并将所述处理数据存储至所述第一主控单元和所述第二主控单元;所述第一主控单元用于在所述第二主控单元获取所述处理数据后、向所述其它系统发送报文接收确认信号;还包括与所述第一主控模块连接的线卡单元,所述线卡单元与所述第二主控模块连接,所述线卡单元用于接收所述处理数据,并根据所述处理数据进行流量转发。
本申请的实施方式还提供了一种协议高可用保护方法,应用于包括第一主控单元和第二主控单元的协议高可用保护系统,包括:所述第一主控单元接收其它系统发送的报文信号,所述第一主控单元对所述报文信号进行处理以获取处理数据,所述第一主控单元将所述处理数据存储至所述第一主控单元和所述第二主控单元;所述第一主控单元在所述第二主控单元获取所述处理数据后,向所述其它系统发送报文接收确认信号。
附图说明
图1是本申请第一实施方式所提供的协议高可用保护系统的结构示意图;
图2是本申请第二实施方式所提供的协议高可用保护方法的流程图;
图3是TCP协议高可用保护方法的发送过程的流程图一;
图4是TCP协议高可用保护方法的接收过程的流程图二;
图5是OSPF协议高可用保护方法的流程图。
具体实施方式
为使本申请的目的、技术方案和优点更加清楚,下面将结合附图对本申请的各实施方式进行详细的阐述。然而,本领域的普通技术人员可以理解,在本申请各实施方式中,为了使读者更好地理解本申请而提出了许多技术细节。但是,即使没有这些技术细节和基于以下各实施方式的种种变化和修改,也可以实现本申请所要求保护的技术方案。
本申请实施方式的目的在于提供一种协议高可用保护系统及保护方法,实现协议的高可用性。
本申请实施方式第一主控单元接收到其它系统发送的报文信号、并对报文信号进行处理后,将处理报文信号获取的处理数据存储至第一主控单元的同时,同步存储至第二主控单元,在第二主控单元获取到处理数据后向其它系统发送报文接收确认信号,从而实现第一主控单元和第二主控单元的报文信号的处理数据的同步,在第二主控单元尚未获取到报文信号的处理数据时,如果发生主控单元的倒换,由第一主控单元转换为第二主控单元,则由于尚未发送报文接收确认信号,此时其它系统会重新发送报文信号,第二主控单元也可以重新接收报文信号并进行处理,而不会发生重新收敛;在第二主控单元获取到报文信号的处理数据后,如果发生主控单元的倒换,由第一主控单元转换为第二主控单元,由于第二主控单元已经同步存储有报文信号的处理数据,第二主控单元可以根据同步存储的报文信号的处理数据继续对报文信号进行处理,也不会发生主控单元的重新收敛,从而实现协议的高可用性。
本申请的第一实施方式涉及一种协议高可用保护系统,具体结构如图1所示,包括:第一主控单元10和与第一主控单元10连接的第二主控单元20;其中,第一主控单元10用于接收其它系统发送的报文信号,在接收到报文信号后,第一主控单元10对报文信号进行处理并获得处理数据,然后将获取到的处理数据存储至第一主控单元10和第二主控单元20;当第二主控单元20获取到报文 信号的处理数据后,第一主控单元10向其它系统发送报文接收确认信号。此外,还包括与第一主控模块10连接的线卡单元30,线卡单元30与第二主控模块20连接,线卡单元30用于接收处理数据,并根据处理数据进行流量转发。
第一主控单元10接收到其它系统发送的报文信号、并对报文信号进行处理后,将处理报文信号获取的处理数据存储至第一主控单元10的同时,同步存储至第二主控单元20,在第二主控单元20获取到处理数据后向其它系统发送报文接收确认信号,从而实现第一主控单元10和第二主控单元20的报文信号的处理数据的同步,在第二主控单元20尚未获取到报文信号的处理数据时,如果发生主控单元的倒换,由第一主控单元10转换为第二主控单元20,则由于尚未发送报文接收确认信号,此时其它系统会重新发送报文信号,第二主控单元20也可以重新接收报文信号并进行处理,而不会发生重新收敛;在第二主控单元20获取到报文信号的处理数据后,如果发生主控单元的倒换,由第一主控单元10转换为第二主控单元20,由于第二主控单元20已经同步存储有报文信号的处理数据,第二主控单元20可以根据同步存储的报文信号的处理数据继续对报文信号进行处理,也不会发生主控单元的重新收敛,从而实现协议的高可用性。
具体的,如图2所示,第一主控单元10包括第一协议组件11和与第一协议组件11通信连接的第一数据库12,第二主控单元20包括第二协议组件21和与第二协议组件21通信连接的第二数据库22。其中,第一协议组件11用于接收报文信号,并对报文信号进行处理以获取处理数据,然后将处理数据存储至第一数据库12、并同步存储到第二数据库22;当发生第一主控单元10和第二主控单元20的切换时,第二协议组件21可以从第二数据库22中获取存储的处理数据。
此外,在本申请中,第一数据库12包括第一发报模块13,第一发报模块13用于存储并向其它系统发送报文接收确认信号。在第一协议组件11接收到其它系统的外文信号后,会产生报文接收确认信号并将报文接收确认信号存储在第一发报模块13中,当第二主控单元20获取到处理数据后,第一发报模块13将其存储的报文接收确认信号发送至其它系统,其它系统接收到报文接收确认信号后即不再发送已发送的报文信号,若第二主控单元20尚未获取到处理数 据而发生第一主控单元10和第二主控单元20的切换,则报文接收确认信号存储在第一发报模块13中而尚未发送,其它系统在一段时间间隔后未接收到报文接收确认信号则会重发报文信号,此时报文信号可以被第二主控单元20中的第二协议组件21接收并重新进行处理,从而避免报文信号的处理丢失,也不会发生主控单元的重新收敛,从而实现协议的高可用性。
具体的,在本实施方式中,第一数据库12与第二数据库22通信连接,第一数据库12用于从第一协议组件11接收处理数据并将处理数据传输至第二数据库22。通过第一数据库12向第二数据库22传输处理数据,当存储的过程中需要对处理数据进行压缩或者是其它方式的处理后存储时,第一数据库12可以直接向第二数据库22传输处理后的处理数据,第二数据库22无需对处理数据进行处理而可以直接存储,从而有效的提升处理数据的传输和存储效率。可以理解的是,前述通过第一数据库12向第二数据库22传输处理数据仅为本实施方式中的一种具体的实施例的举例说明,并不构成限定,在本申请的其它实施方式中,也可以是通过第一协议组件11同时向第一数据库12和第二数据库22传输处理数据等其它方式,具体可以根据实际需要进行灵活的设置。
此外,第二数据库22包括第二发报模块23,第二发报模块23用于在第二数据库22接收到处理数据后向第一数据库12发送处理数据接收确认信号;第一数据库12接收到处理数据接收确认信号后,第一发报模块13将存储的报文接收确认信号发送至其它系统。设置第二发报模块23向第一数据库12发送处理数据接收确认信号,从而保证第一发报模块13将存储的报文接收确认信号发送至其它系统时第二数据库22已经接收到报文信号的处理数据,从而保证第一数据库12和第二数据库22的数据同步。
优选的,在本实施方式中,第一协议组件11所运行的网络协议和第二协议组件21所运行的网络协议相同,例如,第一协议组件11和第二协议组件21可以运行TCP(Transmission Control Protocol,传输控制协议)或OSPF(Open Shortest Path First,开放式最短路径优先)等网络通信协议。当发生第一主控单元10和第二主控单元20的转换时,第一协议组件11所运行的网络协议和第二协议组件21所运行的网络协议相同可以保证报文信号的继续处理,从而进一步的实现协议的高可用性。
优选的,在本实施方式中,第一数据库和第二数据库均为SDDM(Shared-Memory Distributed Data Manager,共享内存分布式数据管理器)。SDDM为一种基于共享内存的分布式数据存储系统,该系统通过共享内存可以提供极高的读写性能,同时保证第一数据库和第二数据库之间数据同步、时序联动和时序协作,保证第一数据库和第二数据库之间信息的严格一致。
本申请第二实施方式提供了一种协议高可用保护方法,应用于包括第一主控单元和第二主控单元的协议高可用保护系统,如图2所示,包括以下步骤:
步骤S101:第一主控单元接收其它系统发送的报文信号。
具体的,在本实施方式中,其它系统既可以是运行在与协议高可用保护系统不同的终端上的其它系统,也可以是与协议高可用保护系统运行在同一终端上的其他系统,具体可以根据实际需要进行灵活的设置。例如,当第一主控单元中运行的网络协议为TCP协议时,其他系统可以是当前终端的客户端;当第一主控单元中运行的网络协议为OSPF协议时,其它系统可以是运行在其它终端上的LSA通告网络拓扑。
步骤S102:第一主控单元对报文信号进行处理以获取处理数据。
具体的,在本实施方式中,第一主控单元通过其运行的网络协议对接收的报文信号进行处理,协议处理过程中,会产生相应的处理数据,例如协议状态的变化数据等,这部分协议状态的变化数据包括协议自身处理需要的状态数据,同时也包括根据协议报文计算出来,用于指导转发面工作的转发信息条目。
步骤S103:第一主控单元将处理数据存储至第一主控单元和第二主控单元。
步骤S104:第一主控单元在第二主控单元获取处理数据后,向其它系统发送报文接收确认信号。
具体的,在本实施方式中,第二主控单元在获取处理数据后,第二主控单元会产生并向第一主控单元发送处理数据接收确认信号,第一主控单元接收到第二主控单元发送的处理数据接收确认信号后,才会执行向其它系统发送报文接收确认信号步骤,保证第一主控单元和第二主控单元之间处理数据的同步。
步骤S105:第一主控单元或者第二主控单元向线卡单元发送处理数据。
具体的,在本实施方式中,第一主控单元在向其它系统发送报文接收确认信号后,将处理数据发送至线卡单元,线卡单元根据处理数据进行流量转发。
需要说明的是,在本实施方式中,只要第一主控单元完成了对一条报文信号的处理,获得了该报文信号的全部处理数据,无论处理数据是否与第二主控单元完成同步存储,都可以继续接受另一条报文信号进行处理,从而保证报文信号的处理过程不出现断层,提升报文的处理效率。
不难发现,本实施方式为与第一实施方式相对应的协议高可用保护方法的实施例,本实施方式可与第一实施方式互相配合实施。第一实施方式中提到的相关技术细节和技术效果在本实施方式中依然有效,为了减少重复,这里不再赘述。相应地,本实施方式中提到的相关技术细节和技术效果也可应用在第一实施方式中。
上面各种方法的步骤划分,只是为了描述清楚,实现时可以合并为一个步骤或者对某些步骤进行拆分,分解为多个步骤,只要包含相同的逻辑关系,都在本专利的保护范围内;对算法中或者流程中添加无关紧要的修改或者引入无关紧要的设计,但不改变其算法和流程的核心设计都在该专利的保护范围内。
下面,将对第一协议组件和第二协议组件中所运行的网络协议进行举例说明,具体如下。
当第一协议组件和第二协议组件中所运行的网络协议为TCP时,TCP是一种网络传输层协议,该协议通过发送-应答-滑动窗口机制,实现了信息在网络上的可靠传输。TCP的可靠传输分为发送和接受两个方向,分别描述如下:
TCP的发送过程如图3所示,包括:
步骤S201:第一协议组件接收客户端发送的报文信号。
步骤S202:第一协议组件对报文信号进行处理得到发送窗口内容,并将发送窗口内容存储到第一数据库中。
步骤S203:第一协议组件产生报文接收确认信号并存储在第一发报模块中。
步骤S204:第一数据库将发送窗口内容同步到第二数据库。
步骤S205:第二数据库接收到发送窗口内容后,对发送窗口内容进行存储,并通过第二发报模块向第一数据库发送处理数据接收确认信号。
步骤S206:第一数据库在接收到处理数据接收确认信号后,第一发报模块将存储的报文接收确认信号发送至客户端。
步骤S207:第一协议组件将第一数据库中存储的发送窗口内容发送至线卡 单元。
步骤S208:线卡单元接收到发送窗口内容后,向第一协议组件发送应答报文。
步骤S209:第一协议组件根据应答报文更新第一数据库中存储的发送窗口内容。
步骤S210:第一数据库将更新后的发送窗口内容同步至第二数据库。
由此可见,在此过程中,任意时刻转换主控单元都能保证TCP链接不中断,信息不丢失。如果步骤S206之前转换主控单元,第一发报模块未将存储的报文接收确认信号发送至客户端,客户端在一定时长未接收到报文接收确认信号会重新向转换后的主控模块发送报文信号;如果步骤S206之后转换主控单元,该滑动窗口信息已经同步到第二数据库,倒换后第二主控模块中的第二协议组件可以从第二数据库获取到发送窗口内容并继续处理,保证TCP传输的可靠性。
TCP的接收过程如图4所示,包括:
步骤S301:第一协议组件接收外部终端发送的报文信号。
步骤S302:第一协议组件对报文信号进行处理得到发送窗口内容,并将发送窗口内容存储到第一数据库中。
步骤S303:第一协议组件产生报文接收确认信号并存储在第一发报模块中。
步骤S304:第一数据库将发送窗口内容同步到第二数据库。
步骤S305:第二数据库接收到发送窗口内容后,对发送窗口内容进行存储,并通过第二发报模块向第一数据库发送处理数据接收确认信号。
步骤S306:第一数据库在接收到处理数据接收确认信号后,第一发报模块将存储的报文接收确认信号发送至客户端。
步骤S307:第一协议组件将第一数据库中存储的发送窗口内容发送至线卡单元。
由此可见,在此过程中,任意时刻转换主控单元都能保证TCP链接不中断,信息不丢失。如果步骤S307之前转换主控单元,第一发报模块未将存储的报文接收确认信号发送至外部终端,外部终端在一定时长未接收到报文接收确认信号会重新向转换后的主控模块发送报文信号。从而保证主控模块转换过程中TCP传输的可靠性。
当第一协议组件和第二协议组件中所运行的网络协议为OSPF协议时,OSPF协议是一种IP网络路由协议,该协议通过LSA收集和通告网络拓扑,并根据该网络拓扑计算路由,用于指导转发引擎进行流量发送。具体步骤如图5所示,包括:
步骤S401:外部终端通过LSA向第一协议组件发送包括网络拓扑的报文信号,第一协议组件接受该LSA。
步骤S402:第一协议组件将LSA信息存储进第一数据库,同时启动最短路径优先(SPF)路由计算,将计算得到的路由信息存储进第一数据库。
步骤S403:第一协议组件产生报文接收确认信号并存储在第一发报模块中。
步骤S404:第一数据库将LSA信息和路由信息同步到第二数据库。
步骤S405:第二数据库接收到LSA信息和路由信息后,对LSA信息和路由信息进行存储,并通过第二发报模块向第一数据库发送处理数据接收确认信号。
步骤S406:第一数据库在接收到处理数据接收确认信号后,第一发报模块将存储的报文接收确认信号发送至客户端。
步骤S407:第一协议组件将第一数据库中存储的发送窗口内容发送至线卡单元。
由此可见,在此过程中,任意时刻转换主控单元都能保证OSPF中的路由信息不丢失。如果步骤S407之前转换主控单元,第一发报模块未将存储的报文接收确认信号发送至外部终端,外部终端在一定时长未接收到报文接收确认信号会重新向转换后的主控模块发送报文信号。从而保证主控模块转换过程中OSPF传输的可靠性。
值得一提的是,本实施方式中所涉及到的各模块均为逻辑模块,在实际应用中,一个逻辑单元可以是一个物理单元,也可以是一个物理单元的一部分,还可以以多个物理单元的组合实现。此外,为了突出本申请的创新部分,本实施方式中并没有将与解决本申请所提出的技术问题关系不太密切的单元引入,但这并不表明本实施方式中不存在其它的单元。
本领域技术人员可以理解实现上述实施例方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序存储在一个存储介质中,包括若干 指令用以使得一个设备(可以是单片机,芯片等)或处理器(processor)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读数据库(ROM,Read-Only Memory)、随机存取数据库(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。
本领域的普通技术人员可以理解,上述各实施方式是实现本申请的具体实施例,而在实际应用中,可以在形式上和细节上对其作各种改变,而不偏离本申请的精神和范围。

Claims (11)

  1. 一种协议高可用保护系统,包括:
    第一主控单元和与所述第一主控单元连接的第二主控单元;
    所述第一主控单元用于接收其它系统发送的报文信号、对所述报文信号进行处理以获取处理数据、并将所述处理数据存储至所述第一主控单元和所述第二主控单元;
    所述第一主控单元用于在所述第二主控单元获取所述处理数据后、向所述其它系统发送报文接收确认信号;
    还包括与所述第一主控模块连接的线卡单元,所述线卡单元与所述第二主控模块连接,所述线卡单元用于接收所述处理数据,并根据所述处理数据进行流量转发。
  2. 根据权利要求1所述的协议高可用保护系统,其中,所述第一主控单元包括第一协议组件和与所述第一协议组件通信连接的第一数据库,所述第二主控单元包括第二协议组件和与所述第二协议组件通信连接的第二数据库;
    所述第一协议组件用于接收所述报文信号、对所述报文信号进行处理以获取处理数据、并将所述处理数据存储至所述第一数据库和所述第二数据库,所述第二协议组件用于获取所述第二数据库中存储的所述处理数据。
  3. 根据权利要求2所述的协议高可用保护系统,其中,所述第一数据库包括第一发报模块,所述第一发报模块用于存储并向所述其它系统发送所述报文接收确认信号。
  4. 根据权利要求2或3所述的协议高可用保护系统,其中,所述第一数据库与所述第二数据库通信连接,所述第一数据库用于从所述第一协议组件接收所述处理数据并将所述处理数据传输至所述第二数据库。
  5. 根据权利要求4所述的协议高可用保护系统,其中,所述第二数据库包括第二发报模块,所述第二发报模块用于在所述第二数据库接收到所述处理数据后向所述第一数据库发送处理数据接收确认信号;
    所述第一发报模块用于在接收到所述处理数据接收确认信号后、向所述其它系统发送所述报文接收确认信号。
  6. 根据权利要求2至5中任意一项所述的协议高可用保护系统,其中,所 述第一协议组件所运行的网络协议和所述第二协议组件所运行的网络协议相同。
  7. 根据权利要求6所述的协议高可用保护系统,其中,所述通信协议为TCP或OSPF。
  8. 根据权利要求2至7中任意一项所述的协议高可用保护系统,其中,所述第一数据库和所述第二数据库均为SDDM。
  9. 一种协议高可用保护方法,应用于包括第一主控单元和第二主控单元的协议高可用保护系统,包括:
    所述第一主控单元接收其它系统发送的报文信号,所述第一主控单元对所述报文信号进行处理以获取处理数据,所述第一主控单元将所述处理数据存储至所述第一主控单元和所述第二主控单元;
    所述第一主控单元在所述第二主控单元获取所述处理数据后,向所述其它系统发送报文接收确认信号;
    所述第一主控单元或者所述第二主控单元向线卡单元发送所述处理数据。
  10. 根据权利要求9所述的协议高可用保护方法,其中,所述第二主控单元获取所述处理数据后,还包括:
    所述第二主控单元产生并向所述第一主控单元发送处理数据接收确认信号;
    所述第一主控单元接收到所述处理数据接收确认信号后、向所述其它系统发送报文接收确认信号。
  11. 根据权利要求9或10所述的协议高可用保护方法,其中,所述第一主控单元对所述报文信号进行处理以获取处理数据后,还包括:
    当所述第一主控单元处理完成所述报文信号后,接收新的报文信号。
PCT/CN2022/099981 2021-06-29 2022-06-20 协议高可用保护系统及保护方法 WO2023273945A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110729464.6A CN115550474A (zh) 2021-06-29 2021-06-29 协议高可用保护系统及保护方法
CN202110729464.6 2021-06-29

Publications (1)

Publication Number Publication Date
WO2023273945A1 true WO2023273945A1 (zh) 2023-01-05

Family

ID=84691215

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/099981 WO2023273945A1 (zh) 2021-06-29 2022-06-20 协议高可用保护系统及保护方法

Country Status (2)

Country Link
CN (1) CN115550474A (zh)
WO (1) WO2023273945A1 (zh)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1889464A (zh) * 2006-07-26 2007-01-03 华为技术有限公司 一种实现通信接管的方法及装置
CN101764738A (zh) * 2008-12-25 2010-06-30 华为技术有限公司 一种支持tcp协议连接的备份方法及其设备
CN103944974A (zh) * 2014-04-02 2014-07-23 华为技术有限公司 一种协议报文处理方法、控制器故障处理方法及相关设备
US20150142755A1 (en) * 2012-08-24 2015-05-21 Hitachi, Ltd. Storage apparatus and data management method
CN106533771A (zh) * 2016-11-24 2017-03-22 杭州华三通信技术有限公司 一种网络设备以及控制信息传输方法

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1889464A (zh) * 2006-07-26 2007-01-03 华为技术有限公司 一种实现通信接管的方法及装置
CN101764738A (zh) * 2008-12-25 2010-06-30 华为技术有限公司 一种支持tcp协议连接的备份方法及其设备
US20150142755A1 (en) * 2012-08-24 2015-05-21 Hitachi, Ltd. Storage apparatus and data management method
CN103944974A (zh) * 2014-04-02 2014-07-23 华为技术有限公司 一种协议报文处理方法、控制器故障处理方法及相关设备
CN106533771A (zh) * 2016-11-24 2017-03-22 杭州华三通信技术有限公司 一种网络设备以及控制信息传输方法

Also Published As

Publication number Publication date
CN115550474A (zh) 2022-12-30

Similar Documents

Publication Publication Date Title
US9705735B2 (en) System and method using RSVP hello suppression for graceful restart capable neighbors
EP3373547B1 (en) Method for realizing disaster tolerance backup
US7974315B2 (en) Method for implementing synchronization of link state database, router, line card and master board
US9659075B2 (en) Providing high availability in an active/active appliance cluster
US9049230B2 (en) Backup method and backup device for TCP connection
EP1261186A1 (en) System and method for TCP connection protection switching
CN108337161B (zh) 一种mlag接口故障三层数据流量平滑切换的方法
US20070064698A1 (en) Method and apparatus for transferring BGP state information during asynchronous startup
CN103747091A (zh) 一种嵌入式设备的分布式数据同步系统及方法
CN112787960B (zh) 一种堆叠分裂的处理方法、装置、设备及存储介质
US8483049B2 (en) System and method for communications system routing component level high availability
KR20150002474A (ko) 통신 네트워크에서 장애 복구 방법
US20090154341A1 (en) Method And Apparatus For Providing Network Redundancy
US11985027B2 (en) Systems and methods for seamless failover in branch deployments by superimposing clustering solution on VRRP
WO2023273945A1 (zh) 协议高可用保护系统及保护方法
CN106533771B (zh) 一种网络设备以及控制信息传输方法
CN116915353A (zh) 作为精确时间协议边界时钟的堆叠网络设备
WO2019134572A1 (zh) 基于sdn的光传送网保护恢复方法、设备及存储介质
CN114553761B (zh) 一种异常处理方法、装置、网络设备及存储介质
JP5011728B2 (ja) ホットスタンバイ方式を採用したシステム及びそれに用いる同期方法
JP2013179430A (ja) 装置間同期および切替方法、並びに装置間切替システム
CN117555688A (zh) 基于双活中心的数据处理方法、系统、设备及存储介质
CN117354250A (zh) 一种报文处理方法及装置
JP2013012827A (ja) 通信システムおよび通信装置
CN106210432A (zh) 软件传真接收系统及方法

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22831763

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 22831763

Country of ref document: EP

Kind code of ref document: A1