CN109981353B - Method and system for protecting adjacent station redundancy in frame type network communication equipment - Google Patents

Method and system for protecting adjacent station redundancy in frame type network communication equipment Download PDF

Info

Publication number
CN109981353B
CN109981353B CN201910168712.7A CN201910168712A CN109981353B CN 109981353 B CN109981353 B CN 109981353B CN 201910168712 A CN201910168712 A CN 201910168712A CN 109981353 B CN109981353 B CN 109981353B
Authority
CN
China
Prior art keywords
station equipment
control unit
main control
equipment
adjacent station
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910168712.7A
Other languages
Chinese (zh)
Other versions
CN109981353A (en
Inventor
韦文
师进
马兰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
CRSC Research and Design Institute Group Co Ltd
Original Assignee
CRSC Research and Design Institute Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by CRSC Research and Design Institute Group Co Ltd filed Critical CRSC Research and Design Institute Group Co Ltd
Priority to CN201910168712.7A priority Critical patent/CN109981353B/en
Publication of CN109981353A publication Critical patent/CN109981353A/en
Application granted granted Critical
Publication of CN109981353B publication Critical patent/CN109981353B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/22Arrangements for detecting or preventing errors in the information received using redundant apparatus to increase reliability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0663Performing the actions predefined by failover planning, e.g. switching to standby network elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0805Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
    • H04L43/0817Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking functioning
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/10Active monitoring, e.g. heartbeat, ping or trace-route

Abstract

The invention relates to a method and a system for protecting adjacent station redundancy in frame type network communication equipment, which comprises the following steps of monitoring the running states of a plurality of paths of main control boards of local station equipment and adjacent station equipment through a back-off control unit; the back-cut control unit of the local station equipment and the back-cut control unit of the adjacent station equipment synchronize the running state monitoring information of the multi-channel main control boards of the local station equipment and the adjacent station equipment. According to the running states of the multi-path main control boards of the local station equipment and the adjacent station equipment, the back cut control unit of the local station equipment selects one path of the multi-path main control board of the local station equipment or the adjacent station equipment to communicate with the exchange board of the local station equipment to complete the back cut action.

Description

Method and system for protecting adjacent station redundancy in frame type network communication equipment
Technical Field
The invention belongs to the technical field of network communication, and particularly relates to a method for protecting adjacent station redundancy in frame type network communication equipment.
Background
The existing network technology improves the stability of the network in various ways, and is mainly embodied as protection of different parts in the network. The protection method is that the corresponding parts are divided into a main part and a standby part, when the main part fails, the standby part can be switched to automatically, so that the original functions of the network are not affected by the failure.
The network protection technology includes protection of devices, protection of communication links, protection of sub-networks and the like, and the adopted technical methods are different. The equipment is protected mainly by configuring a plurality of sets of equipment redundancy in a network and configuring a plurality of board card redundancy protection in the equipment.
At present, high-end network communication equipment mainly adopts a frame structure, and can realize equipment protection by carrying out redundant design on a plurality of board cards in the equipment, wherein a switch board and a main control board are core board cards of the equipment, the switch board provides the network communication performance of the whole equipment, and the main control board is responsible for providing drive and management control for the switch board. In order to improve the reliability and availability of the equipment, a plurality of redundant core board cards are usually configured to mutually play a role in redundancy protection, a single core board card fails to cause equipment failure, and the equipment can still normally operate. At present, the redundant protection of the board card can be realized in several ways: (1)1+1 or 1:1 protection, namely the number of the main board cards is the same as that of the standby board cards, so as to form one-to-one protection; (2) and (4) N + M protection, namely the number of the main board cards is different from that of the standby board cards (more main board cards are generally available), so that flexible protection is formed.
Generally, the process involved in the process of switching back the failure of the active board card and the standby board card in a single device includes: (1) the method comprises the steps that when the equipment normally operates, state synchronization is kept between a main board card and a standby board card in the equipment, and the operation state is monitored in real time; (2) when a fault occurs, the fault is timely discovered by monitoring and each board card in the equipment is informed; (3) each board card in the equipment performs fault reverse switching based on a protection strategy, the software/hardware function realized on the original main board card is reversely switched to the standby board card to be realized, and the inter-board-card communication between the original other board cards and the main board card is reversely switched to the inter-board-card communication between the other board cards and the standby board card.
However, in the existing equipment protection technical solution, only redundancy protection between boards is considered in the equipment, and no board redundancy protection over equipment cooperation is adopted, so that the existing frame-type network communication equipment (such as a high-end switch, a router, a transmission device, and the like) cannot cope with the situation that a plurality of core boards fail simultaneously, and when no core board is available on a single device, the whole equipment is unavailable.
Therefore, a network communication device capable of establishing adjacent station device redundancy protection is needed, and on the basis of keeping the function of board redundancy protection in the device, a board redundancy protection mode of cross-device cooperation is added, so that higher device reliability and availability are obtained from the network or system level.
Disclosure of Invention
Aiming at the problems, the invention provides a method for protecting the redundancy of adjacent stations in frame type network communication equipment, which comprises the following steps:
the local station equipment and the adjacent station equipment are connected to one path of the multi-path main control board through the back-cut control unit to communicate with the exchange board;
the method comprises the steps that a back cut control unit of local station equipment monitors the running state of a multi-channel main control board of the local station equipment, and a back cut control unit of adjacent station equipment monitors the running state of the multi-channel main control board of the adjacent station equipment;
the back cut control unit of the local station equipment and the back cut control unit of the adjacent station equipment synchronize the running state monitoring information of the multi-path main control boards of the local station equipment and the adjacent station equipment;
according to the running states of the multi-channel main control boards of the local station equipment and the adjacent station equipment, the back cut control unit of the local station equipment selects one channel of the multi-channel main control boards of the local station equipment or the adjacent station equipment to communicate with the exchange board of the local station equipment, and the back cut action is completed.
Preferably, the reverse switching control unit of the local station device and the reverse switching control unit of the adjacent station device execute a handshake interaction protocol to synchronize the running state monitoring information of the multiple main control boards of the local station device and the adjacent station device.
Preferably, the handshake interaction protocol is executed based on an ethernet UDP message communication manner.
Preferably, the reverse switching control unit of the local station device and the reverse switching control unit of the adjacent station device execute a handshake interaction protocol, including remote discovery and real-time information feedback monitoring;
the remote end finds that: the reverse switching control unit of the local station equipment monitors the message sent by the adjacent station equipment, verifies the sent message, replies the reverse switching control unit of the adjacent station equipment after the verification is passed, verifies the reply by the reverse switching control unit of the adjacent station equipment, and completes remote discovery after the verification is passed;
real-time monitoring information returning: after the remote end discovery is completed, the reverse switching control unit of the adjacent station equipment keeps periodically sending heartbeats to the reverse switching control unit of the local station, the reverse switching control unit of the local station equipment replies and confirms each heartbeat message, carries the corresponding message serial number of the heartbeat message, and maintains the connection state of the reverse switching control unit of the local station equipment and the reverse switching control unit of the adjacent station equipment.
Preferably, the heartbeat message carries a message sequence number and monitoring information of the running state of the multi-path main control board of the adjacent station equipment.
Preferably, the handshake interaction protocol executed by the reverse switching control unit of the local station device and the reverse switching control unit of the adjacent station device further includes fault information notification and a request for taking over protection;
fault information informs and requests to take over protection: when the reverse switching control unit of the local station equipment monitors that all the multiple main control boards of the local station equipment have faults, if the adjacent station equipment still has a main control board which normally works, the reverse switching control unit of the local station equipment sends a burst message carrying fault information of the local station equipment to the remote reverse switching control unit and requests the adjacent station equipment to carry out take-over protection, and the reverse switching control unit of the adjacent station equipment replies and confirms the burst message and executes actions.
Preferably, the burst message is an aperiodic message, and carries the fault information of the multi-path main control board of the local station device.
A kind of adjacent station redundant protection system in the frame type network communication equipment, including at least a series of adjacent station apparatuses and at least a series of this station apparatus, this station apparatus and adjacent station apparatus include exchange board, multipath master control board and connect to exchange board and multipath master control board between the back cut control unit;
the local station equipment and the adjacent station equipment are connected to one path of the multi-path main control board through the back-cut control unit to communicate with the exchange board;
the method comprises the steps that a reverse switching control unit monitors the running state of a multi-channel main control board of local station equipment in the local station equipment, and the reverse switching control unit monitors the running state of the multi-channel main control board of adjacent station equipment in the adjacent station equipment;
the back cut control unit synchronizes the running states of the multi-path main control boards of the local station equipment and the adjacent station equipment in the local station equipment and the adjacent station equipment;
and the back cut control unit selects one of the multiple main control boards of the local station equipment or the adjacent station equipment from the local station equipment to communicate with the exchange board of the local station equipment according to the running states of the multiple main control boards of the local station equipment and the adjacent station equipment.
Preferably, the back-cut control unit comprises a selection module, a bridging module and a remote interface;
a selection module: selecting one of the multiple main control boards from the local station equipment and the adjacent station equipment to carry out single-path output;
a bridge module: converting the single-path output into multi-path output, driving the exchange board, and establishing the driving and management of the main control board on the exchange board;
a remote interface: and a driving passage is established at a long distance through a network cable or an optical fiber, so that the equipment of the local station is connected with the multi-path output of the equipment of the adjacent station.
Preferably, in the local station device, the multiple main control boards are connected to an input end of the selection module, an output end of the selection module is connected to an input end of the bridge module, an output end of the bridge module is connected to the switch board, the remote interface is connected to an output end of the bridge module, and the remote interface of the adjacent station device is connected to an input end of the local station device selection module.
Preferably, the reverse switching control unit includes at least one remote interface, which ensures that a group of local station devices is connected to at least one group of neighboring station devices, and a group of neighboring station devices is connected to at least one group of local station devices.
Preferably, the local station device and the adjacent station device further include a backplane, and the exchange board, the primary main control board, and the standby main control board are all disposed on the backplane.
Preferably, the back-off control unit monitors hardware in-place signals or software in-place signals of the main control board through the back board.
Preferably, the backplane is provided with an in-place signal transmission line for receiving a hardware in-place signal of the main control board.
Preferably, an ethernet channel is arranged on the back board and is used for receiving the software heartbeat signal of the main control board.
Preferably, the switch board and the multi-path main control board support hot plug functions.
According to the technical scheme, a board card protection means of crossing equipment is added on the board card protection means in the original equipment, which is equivalent to more standby main control boards when the main control boards are in failure, so that the cooperation capability among crossing equipment is fully exerted, the protection mode of the whole network is richer, and the reliability, the availability and the stability of the network of the equipment are improved.
The technical scheme not only keeps the board card redundancy protection function of the local station equipment and the adjacent station equipment, but also increases the board card redundancy protection mode of cross-equipment cooperation, and obtains higher equipment reliability and availability from the network or system level.
The back-cut control unit designed by the technical scheme expands the driving channel between the main control board and the exchange board from the channel in the equipment to the driving channel in the equipment and across the equipment, and can support the adjacent station to take over through the channel, namely the adjacent station redundancy protection.
In the technical scheme, the handshake interaction protocol between the switching control units of the cross-span equipment can be realized as follows: discovering a remote back-off control unit and establishing communication connection; transmitting real-time monitoring information of the adjacent station equipment master control board by the adjacent station equipment back-cut control unit to the local equipment so as to inform the local equipment whether the master control board of the adjacent station equipment can be used for redundancy protection; and informing the adjacent station equipment reverse switching control unit of the relevant fault information of the main control board of the local station equipment.
The back-off control unit of the technical scheme can monitor hardware in-place signals or software heartbeat signals of the main control board or the standby main control board of the equipment in real time so as to find faults.
Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
FIG. 1 illustrates a flow chart of a method of neighbor redundancy protection in a subrack network communication device of the present invention;
FIG. 2 is a schematic diagram illustrating the paths of a neighboring station redundancy protection system in a subrack network communication device in the normal operation of the neighboring station redundancy protection system in the local station device;
FIG. 3 is a schematic diagram illustrating the path of the fail-over protection within the subrack network communication device of the adjacent site redundancy protection system of the present invention;
FIG. 4 is a schematic diagram illustrating the path of the fail-over protection between adjacent site devices for the adjacent site redundancy protection system in a subrack network communication device of the present invention;
FIG. 5 is a path diagram illustrating a one-to-many protection scenario for fail-over protection between adjacent site devices by an adjacent site redundancy protection system in a subrack network communication device of the present invention;
FIG. 6 is a flow chart of the adjacent station redundancy protection system in the subrack network communication equipment for fault switching protection between adjacent station equipments;
fig. 7 is a schematic diagram illustrating a backplane path for monitoring the primary/standby main control boards in the adjacent station redundancy protection system in the subrack network communication device according to the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
It will also be understood that when an element is referred to as being "between," "connected to" or "coupled to" another element, it can be directly between, connected or coupled to the other element or intervening elements may be present. In contrast, when an element is referred to as being "directly between," directly connected to, "or" coupled to "another element, there are no intervening elements present.
Fig. 1 shows a flowchart of a method for protecting redundancy of neighboring stations in a frame-type network communication device, where both the local station device and the neighboring station device access one of multiple main control boards through a reverse switching control unit to communicate with an exchange board;
the method comprises the steps that a back cut control unit of local station equipment monitors the running state of a multi-channel main control board of the local station equipment, and a back cut control unit of adjacent station equipment monitors the running state of the multi-channel main control board of the adjacent station equipment;
the back cut control unit of the local station equipment and the back cut control unit of the adjacent station equipment synchronize the running state monitoring information of the multi-path main control boards of the local station equipment and the adjacent station equipment;
according to the running states of the multi-channel main control boards of the local station equipment and the adjacent station equipment, the back cut control unit of the local station equipment selects one channel of the multi-channel main control boards of the local station equipment or the adjacent station equipment to communicate with the exchange board of the local station equipment, and the back cut action is completed.
And the back-cut control unit of the local station equipment and the back-cut control unit of the adjacent station equipment execute a handshake interaction protocol to synchronize the running state monitoring information of the multiple main control boards of the local station equipment and the adjacent station equipment.
The handshake interaction protocol is executed based on the UDP message communication mode of the Ethernet.
The method for executing handshake interaction protocol between the reverse switching control unit of the station equipment and the reverse switching control unit of the adjacent station equipment comprises the following steps:
the remote end finds that: the reverse switching control unit of the local station equipment monitors the message sent by the adjacent station equipment, verifies the sent message, replies the reverse switching control unit of the adjacent station equipment after the verification is passed, verifies the reply by the reverse switching control unit of the adjacent station equipment, and completes remote discovery after the verification is passed;
real-time monitoring information returning: after the remote end discovery is completed, the reverse switching control unit of the adjacent station equipment keeps periodically sending heartbeats to the reverse switching control unit of the local station, the reverse switching control unit of the local station equipment replies and confirms each heartbeat message, carries the corresponding message serial number of the heartbeat message, and maintains the connection state of the reverse switching control unit of the local station equipment and the reverse switching control unit of the adjacent station equipment.
The heartbeat message carries the message serial number and the monitoring information of the running state of the multi-path main control board of the adjacent station equipment.
The handshake interactive protocol executed by the handshake control unit of the station equipment and the handshake control unit of the adjacent station equipment further comprises:
fault information informs and requests to take over protection: when the reverse switching control unit of the local station equipment monitors that all the multiple main control boards of the local station equipment have faults, if the adjacent station equipment still has a main control board which normally works, the reverse switching control unit of the local station equipment sends a burst message carrying fault information of the local station equipment to the remote reverse switching control unit and requests the adjacent station equipment to take over protection, and the reverse switching control unit of the adjacent station equipment replies and confirms the burst message and executes actions.
The burst message is an aperiodic message and carries the fault information of the multi-path main control board of the local station equipment.
The adjacent station equipment redundancy protection technology of the invention realizes the main control board redundancy protection in the network communication equipment and the main control board redundancy protection of cross-equipment cooperation, and the specific protection is as follows:
1) when the main control board of the local station equipment fails, the standby main control board on the local station equipment takes over the functions of the main control board;
2) when the main control board and the standby main control board of the local station equipment both fail, the current main control board on the adjacent station equipment takes over the functions of the main control board of the local station equipment.
Illustratively, as shown in fig. 2, in terms of hardware design, the back cut control unit includes the following modules:
a bridge module: the module has the function that one input end can drive a plurality of output ends simultaneously;
a selection module: the module has the function of selecting one path from a plurality of paths of input ends to ensure that the input end is in butt joint with the output end;
a remote interface: the interface provides the function of remotely building a drive channel through a network cable or an optical fiber.
Fig. 6 shows a flow chart of the protection of the reverse switching of the failure between adjacent station devices in the present invention, a method for protecting the redundancy of adjacent stations in a frame type network communication device, comprising:
step 1: when the local station equipment and the adjacent station equipment normally operate, the main master control board of the local station equipment synchronizes the state of the local station equipment to the main master control board of the adjacent station equipment, and the operation state is monitored in real time;
in order to realize the monitoring of the running state of the main control board of the station equipment, each switching control unit of the station equipment monitors the hardware in-place signal or the software heartbeat signal of the main control board of the station equipment in real time. Fig. 7 shows a schematic diagram of a backplane path for monitoring the active/standby main control boards in the device, and the specific method is as follows:
an in-place signal transmission line is arranged on the back plate and is connected with the main/standby main control board and the back-off control unit 1/2; when the main/standby main control board is inserted into the backplane, the backplane outputs a hardware in-place signal to the backplane, and illustratively, the backplane output hardware in-place signal is a constant 3.3V voltage and is transmitted to the inverse switching control unit 1/2 through an in-place signal transmission line; when the active/standby main control board fails, the in-place signal will fail, and for example, when the active/standby main control board fails, the voltage returns to zero, and can be detected by the back-off control unit 1/2;
or an ethernet channel is arranged on the device backplane and connected with the main/standby main control board and the back-off control unit 1/2; software of the master control board/standby control board periodically sends out a UDP message indicating a Heartbeat signal, and illustratively, the software of the master control board/standby control board sends out a UDP message carrying a "Heartbeat" message every 200ms, and the UDP message is transmitted to the back-off control unit 1/2 through an ethernet channel; when the primary/standby main control board fails, the sending of the heartbeat message is stopped, which can be detected by the back-off control unit 1/2.
Correspondingly, each back-off control unit of the adjacent station equipment monitors the hardware in-place signal or the software heartbeat signal of the main control board or the standby main control board of the adjacent station equipment in real time.
In order to detect a potentially available main control board on the adjacent station device, the reverse switching control unit of the local station device needs to execute a certain handshake interaction protocol with the reverse switching control unit of the adjacent station device, and the interaction needs to be performed through connection on a remote interface. One of the functions of the handshake interaction protocol is to discover a remote back-off control unit and establish communication connection; and the second function is to transmit the real-time monitoring information of the adjacent station equipment reverse switching control unit to the adjacent station equipment main control board back to the local equipment so as to inform the local equipment whether the main control board of the adjacent station equipment can be used for redundancy protection.
Step 2: when the main control board/standby main control board in the local station equipment is in fault, the fault is timely discovered by monitoring, and the main control boards of the local station equipment and the adjacent station equipment are informed that the main control board and the adjacent station equipment take over protection;
when the local station equipment back-off control unit monitors that the main control board or the standby control board in the local station equipment has faults, the adjacent station equipment back-off control unit is informed of relevant fault information by means of the handshake interaction protocol in the step 1. The third function of the handshake interaction protocol is to inform the relevant failure information of the main control board of the local station equipment to the reverse switching control unit of the adjacent station equipment and request the adjacent station equipment to take over protection.
The handshake interaction protocol is implemented by the way that both parties perform UDP message communication based on ethernet over the remote optical fiber, and the specific working process is described as follows:
(1) remote discovery process:
a. the station device reverse switching control unit monitors a designated port (such as 8001);
b. a reverse switching control unit of adjacent station equipment sends a 'Hello' message to a designated port, and carries a handshake interaction protocol version number;
c. after receiving the message, the reverse switching control unit of the station equipment replies a Hello message to the reverse switching control unit of the adjacent station equipment if the protocol version number is verified to be consistent, and the message carries a handshake interaction protocol version number;
after receiving the reply message, the reverse switching control unit of the adjacent station equipment verifies that the protocol version numbers are consistent;
the remote discovery process is completed and the connection state is successfully entered.
(2) The real-time monitoring information returning process comprises the following steps:
a. after the discovery process of the adjacent station equipment is completed, the backward switching control unit of the adjacent station equipment keeps periodically sending heartbeats, namely a Heartbeat message, to the backward switching control unit of the local station equipment;
b. the heartbeat message carries a message serial number and monitoring information, wherein the monitoring information comprises the running state (normal/fault) of the main control board of the adjacent station equipment detected by the reverse switching control unit of the adjacent station equipment in the step 1;
c. the reverse switching control unit of the station equipment replies and confirms each heartbeat message and carries the corresponding message serial number of the heartbeat message so as to maintain the connection state with the reverse switching control unit of the adjacent station equipment.
(3) The fault information informs and requests to take over the protection process:
a. when the main control board or the standby main control board of the local station equipment fails, the condition is detected by the reverse switching control unit of the local station equipment according to the step 1;
b. according to the message content returned by the real-time monitoring information, if the adjacent station equipment still has a main control board or a standby main control board which normally works, the local station equipment back-cut control unit sends a burst message (aperiodic message) to the back-cut control unit of the adjacent station equipment, wherein the message carries the fault information of the local station equipment and requests the adjacent station equipment to take over protection;
c. the reverse switching control unit of the adjacent station equipment replies and confirms the burst message and starts to execute the fault reverse switching action of the step 3;
d. and after receiving the reply confirmation message, the local station equipment switching back control unit starts to execute the fault switching back action of the step 3.
And step 3: each main control board of the station equipment carries out fault switching based on a protection strategy, the software/hardware functions realized on the original main control board/standby main control board are switched to the main control board of the adjacent station equipment to realize the fault switching, and the communication between the main control boards of the original station equipment and the main control boards of the main control board/standby main control board is switched to the communication between the main control boards of the main control board of the station equipment and the main control boards of the adjacent station equipment.
The failure of the equipment of the station is switched to the following state: the selection modules of the backward cutting control unit 1 and the backward cutting control unit 2 of the station equipment select the port 3;
the fault reverse switching of the adjacent station equipment is as follows: the adjacent station equipment back cut control unit 3 and the back cut control unit 4 inform the remote ends of the main control panel of the current adjacent station equipment that the exchange board 1 and the exchange board 2 need to be taken over by the adjacent station equipment respectively; the adjacent station device main master control board is designed with a software function for driving the multiple switching boards simultaneously by means of the bridge module of the back-off control unit, and then the path of fig. 3 can initiate driving connection to the switching board 1 and the switching board 2 of the local station device to successfully take over all the switching boards of the local station device.
It should be noted that the failure switching needs each main control board to support the hot plug function, that is, the new main control board replaces the exchange board on the corresponding channel driven by the original main control board without powering on the device again. The hot plug function ensures that the service interruption time is short in the process of fault protection switching-back, and the requirement of service continuous operation is met.
In the following, a group of main control boards, a group of standby control boards, two switch boards (switch board 1, switch board 2), and two switching control units are configured in the network communication device to exemplify normal operation and fault scenarios of the device and corresponding fault protection switching modes, but the adjacent station redundancy protection method provided by the present invention is also applicable to other network communication devices.
Fig. 2 shows a schematic diagram of a path during normal operation in the device of the present invention, wherein an in-place signal transmission line is arranged on a backplane, so that both the main control board and the standby control board are connected to the back-off control unit 1 and the back-off control unit 2; when the main control board and the standby main control board are inserted into the backplane, constant 3.3V in-place signals of the main control board and the standby main control board are output to the backplane, and are transmitted to the inverse switching control unit 1 and the inverse switching control unit 2 through in-place signal transmission lines, and at the moment, the main control board of the local equipment is judged to be in a normal state.
Meanwhile, the reverse switching control unit 1 of the local station equipment is connected with the remote interface 1 of the reverse switching control unit 3 of the adjacent station equipment through the input port 3 of the selection module, the reverse switching control unit 2 of the local station equipment is connected with the remote interface 1 of the reverse switching control unit 4 of the adjacent station equipment through the input port 3 of the selection module, and a handshake protection protocol between the local station equipment and the adjacent station equipment is realized based on an Ethernet UDP message communication mode.
In an exemplary manner, the first and second electrodes are,
the station equipment backward switching control unit 1 and the backward switching control unit 2 monitor the designated port 8001;
the reverse switching control unit 3 and the reverse switching control unit 4 of the adjacent station device send a Hello message to the designated port 8001, and carry a handshake interaction protocol version number;
after receiving the messages, the reverse switching control unit 1 of the station equipment and the reverse switching control unit 2 of the station equipment verify that the protocol version numbers are consistent, reply a Hello message to the reverse switching control unit 3 and the reverse switching control unit 4 of the adjacent station equipment, and carry a handshake interaction protocol version number;
after receiving the reply message, the reverse switching control unit of the adjacent station equipment verifies that the protocol version numbers are consistent;
the remote discovery process is completed and the connection state is successfully entered.
After the discovery process of the adjacent station equipment is completed, the back cut control unit 3 and the back cut control unit 4 of the adjacent station equipment keep periodically sending Heartbeat, namely a Heartbeat message to the back cut control unit of the local station equipment;
the heartbeat message carries a message serial number and monitoring information, and the monitoring information contains information that the main control board or the standby control board of the adjacent station equipment normally operates, which is detected by the adjacent station equipment reverse switching control unit 3 and the adjacent station equipment control unit 4;
the local station equipment back-cut control unit 1 and the back-cut control unit 2 reply and confirm each heartbeat message and carry the corresponding message serial number thereof so as to maintain the connection state with the adjacent station equipment back-cut control unit 3 and the adjacent station equipment control unit 4.
At this time, the port 1 of the main master control board sequentially drives the switch board 1 through the port 1 of the selection module and the port 2 of the bridge module of the reverse switching control unit 1; the port 2 of the main control board drives the exchange board 2 through the port of the selection module 1 and the port of the bridge module 2 of the reverse switching control unit 2 in sequence; the standby main control board is in a standby state.
FIG. 3 shows a path schematic of fail-over protection within the present station equipment of the present invention; wherein, an in-place signal transmission line is arranged on the back plate, so that the main control board and the standby main control board are both connected with the back cut control unit 1 and the back cut control unit 2; when the main control board and the standby control board are inserted into the backplane, the constant 3.3V in-place signal of the standby control board is output to the backplane, and the main control board cannot input the constant 3.3V in-place signal to the backplane, and the constant 3.3V in-place signal is transmitted to the back-off control unit 1 and the back-off control unit 2 through the in-place signal transmission line, at this time, it is determined that the main control board of the device at the station is in a fault state, and it is determined that the standby control board is in a normal state.
Meanwhile, the reverse switching control unit 1 of the local station equipment is connected with the remote interface 1 of the reverse switching control unit 3 of the adjacent station equipment through the input port 3 of the selection module, the reverse switching control unit 2 of the local station equipment is connected with the remote interface 1 of the reverse switching control unit 4 of the adjacent station equipment through the input port 3 of the selection module, and a handshake protection protocol between the local station equipment and the adjacent station equipment is realized based on an Ethernet UDP message communication mode.
In an exemplary manner, the first and second electrodes are,
the reverse switching control unit 1 and the reverse switching control unit 2 of the local station device monitor the designated port 8001;
the reverse switching control unit 3 and the reverse switching control unit 4 of the adjacent station device send a Hello message to the designated port 8001, and carry a handshake interaction protocol version number;
after receiving the messages, the reverse switching control unit 1 of the station equipment and the reverse switching control unit 2 of the station equipment verify that the protocol version numbers are consistent, reply a Hello message to the reverse switching control unit 3 and the reverse switching control unit 4 of the adjacent station equipment, and carry a handshake interaction protocol version number;
after receiving the reply message, the reverse switching control unit of the adjacent station equipment verifies that the protocol version numbers are consistent;
the remote discovery process is completed and the connection state is successfully entered.
After the discovery process of the adjacent station equipment is completed, the backward switching control unit 3 and the backward switching control unit 4 of the adjacent station equipment keep periodically sending heartbeats, namely a Heartbeat message, to the backward switching control unit of the local station equipment;
the heartbeat message carries a message serial number and monitoring information, and the monitoring information contains information that the main control board or the standby control board of the adjacent station equipment normally operates, which is detected by the adjacent station equipment reverse switching control unit 3 and the adjacent station equipment control unit 4;
the local station equipment back-cut control unit 1 and the back-cut control unit 2 reply and confirm each heartbeat message and carry the corresponding message serial number thereof so as to maintain the connection state with the adjacent station equipment back-cut control unit 3 and the adjacent station equipment control unit 4.
At the moment, the standby main control board is started, and the selection modules of the backward cutting control unit 1 and the backward cutting control unit 2 are all backward cut to the port 2; the port 1 of the standby main control board drives the exchange board 1 through the port 2 of the selection module and the port 2 of the bridge module of the reverse cutting control unit 1 in sequence; the port 2 of the standby main control board drives the switch board 2 through the port 2 of the selection module and the port 2 of the bridge module of the reverse switching control unit 2 in sequence. And the switching board of the local station equipment in the scene is taken over by the standby main control board of the local station equipment.
FIG. 4 shows a path diagram of reverse fault protection between neighboring station devices; wherein, an in-place signal transmission line is arranged on the back plate, so that the main control board and the standby main control board are both connected with the back cut control unit 1 and the back cut control unit 2; when the main control board and the standby control board are inserted into the backplane, the main control board and the standby control board cannot input constant 3.3V in-situ signals to the backplane, and at the moment, it can be judged that the main control board and the standby control board of the local station equipment are both in a fault state. The reverse switching control unit 1 of the local station equipment is connected with the remote interface 1 of the reverse switching control unit 3 of the adjacent station equipment through the input port 3 of the selection module, the reverse switching control unit 2 of the local station equipment is connected with the remote interface 1 of the reverse switching control unit 4 of the adjacent station equipment through the input port 3 of the selection module, and a handshake protection protocol between the local station equipment and the adjacent station equipment is realized based on an Ethernet UDP message communication mode.
Illustratively, the handshake protection protocol between the local station device and the neighboring station device is as follows:
(1) remote discovery process:
a. the station equipment backward switching control unit 1 and the backward switching control unit 2 monitor the designated port 8001;
b. the reverse switching control unit 3 and the reverse switching control unit 4 of the adjacent station device send a Hello message to the designated port 8001, and carry a handshake interaction protocol version number;
c. after receiving the messages, the reverse switching control unit 1 of the station equipment and the reverse switching control unit 2 of the station equipment verify that the protocol version numbers are consistent, reply a Hello message to the reverse switching control unit 3 and the reverse switching control unit 4 of the adjacent station equipment, and carry a handshake interaction protocol version number;
after receiving the reply message, the reverse switching control unit of the adjacent station equipment verifies that the protocol version numbers are consistent;
the remote discovery process is completed and the connection state is successfully entered.
(2) The real-time monitoring information returning process comprises the following steps:
a. after the discovery process of the adjacent station equipment is completed, the back cut control unit 3 and the back cut control unit 4 of the adjacent station equipment keep periodically sending Heartbeat, namely a Heartbeat message to the back cut control unit of the local station equipment;
b. the heartbeat message carries a message serial number and monitoring information, and the monitoring information contains information that the main control board or the standby control board of the adjacent station equipment normally operates, which is detected by the adjacent station equipment reverse switching control unit 3 and the adjacent station equipment control unit 4;
c. the local station equipment back-cut control unit 1 and the back-cut control unit 2 reply and confirm each heartbeat message and carry the corresponding message serial number thereof so as to maintain the connection state with the adjacent station equipment back-cut control unit 3 and the adjacent station equipment control unit 4.
(3) The fault information informs and requests to take over the protection process:
a. when detecting that both the main control board and the standby main control board of the local station equipment are in failure, the condition is detected by the local station equipment switching control unit;
b. according to the message content returned by the real-time monitoring information, the main control board/standby main control board which normally works is found in the adjacent station equipment, and the reverse switching control unit 1 and the reverse switching control unit 2 of the local station equipment send burst messages (aperiodic messages) to the reverse switching control unit 3 and the reverse switching control unit 4 of the adjacent station equipment, wherein the messages carry the fault information of the local station equipment and request the adjacent station equipment to take over protection;
c. after replying and confirming the burst message, the reverse switching control unit 3 and the reverse switching control unit 4 of the adjacent station equipment execute a fault reverse switching action, and the reverse switching control unit 3 and the reverse switching control unit 4 of the adjacent station equipment inform the remote ends of the main control panel of the current adjacent station equipment that the exchange board 1 and the exchange board 2 need to be taken over by the adjacent station equipment respectively; the adjacent station equipment main control board is designed with a software function for driving a plurality of paths of exchange boards simultaneously by means of a bridge module of a back-cut control unit, and then a driving connection can be initiated to the exchange board 1 and the exchange board 2 of the local station equipment by means of the path in FIG. 4;
d. after the local station device reverse switching control unit receives the reply acknowledgement message, the selection modules of the local station device reverse switching control unit 1 and the reverse switching control unit 2 both select the port 3, and the main control board of the adjacent station device successfully takes over all the switch boards of the local station device.
In order to support cross-equipment redundancy protection, a network cable or an optical fiber is needed to connect a port of a selection module of the local station equipment back cut control unit with a remote interface of an adjacent station equipment back cut control unit. In this example, the port of the selection module 3 of the local station device reverse cutting control unit 1 may be connected to the remote interface 1 of the adjacent station device reverse cutting control unit 3, and the port of the selection module 3 of the local station device reverse cutting control unit 2 may be connected to the remote interface 1 of the adjacent station device reverse cutting control unit 4.
When the main control board and the standby main control board of the local station equipment both fail, the selection modules of the reverse switching control unit 1 and the reverse switching control unit 2 of the local station equipment are all reversely switched to 3 ports; the port 1 of the main control board of the adjacent station equipment sequentially drives the exchange board 1 of the local station equipment through the port 1 of the selection module 1, the port 1 of the bridge module and the remote interface 1 of the reverse switching control unit 3, and the port 3 of the selection module 3 and the port 2 of the bridge module of the reverse switching control unit 1 of the local station equipment; the ports 2 of the main control board of the adjacent station device sequentially drive the switch board 2 of the local station device through the port 1 of the selection module 1, the port 1 of the bridge module, and the remote interface 1 of the reverse switching control unit 4, and the port 3 of the reverse switching control unit 2 and the port 2 of the bridge module of the local station device. In the scene, the exchange board of the local station equipment is taken over by the current main control board of the adjacent station equipment.
FIG. 5 is a path diagram illustrating a one-to-many protection scenario for fail-over protection between neighboring station devices; the station 1 device and the station 2 device both have a failure in both the main control board and the standby main control board, and the back-off control unit of the station 1 device and the back-off control unit of the station 2 device are respectively connected to the remote interface 1 and the remote interface 2 of the back-off control unit of the same adjacent station device. At this time, the bridge module of the reverse switching control unit of the adjacent station device performs the function that one input end simultaneously drives three output ends, so that the switching boards of the station 1 device and the station 2 device can be simultaneously taken over by the current main control board of the adjacent station device.
Illustratively, a port of a selection module 3 of a reverse cutting control unit 1 of a station 1 device is connected with a remote interface 1 of a reverse cutting control unit 3 of an adjacent station device, and a port of a selection module 3 of a reverse cutting control unit 2 of the station 1 device is connected with a remote interface 1 of a reverse cutting control unit 4 of the adjacent station device; the port of the selection module 3 of the backward switching control unit 5 of the station 2 equipment is connected with the remote interface 2 of the backward switching control unit 3 of the adjacent station equipment, and the port of the selection module 3 of the backward switching control unit 6 of the station 2 equipment is connected with the remote interface 2 of the backward switching control unit 4 of the adjacent station equipment. Site 1 equipment and site 2 equipment are provided with on-site signal transmission lines on back plates, a main control board and a standby main control board are connected with an inverse switching control unit 1 and an inverse switching control unit 2 in the site 1 equipment, and the main control board and the standby main control board are connected with an inverse switching control unit 5 and an inverse switching control unit 6 in the site 2 equipment; when the main control board and the standby control board of the station 1 device or the station 2 device are inserted into the backplane, the main control board and the standby control board cannot input a constant 3.3V in-place signal to the backplane, and at this time, it can be determined that the main control board and the standby control board of the station are both in a fault state. Illustratively, when both the site 1 device and the site 2 device are in a failure state, the back-off control unit 1 of the site 1 device is connected to the remote interface 1 of the back-off control unit 3 of the adjacent site device through the input port 3 of the selection module, and the back-off control unit 2 of the site 1 device is connected to the remote interface 1 of the back-off control unit 4 of the adjacent site device through the input port 3 of the selection module; the back-cut control unit 5 of the station 2 device is connected with the remote interface 2 of the back-cut control unit 3 of the adjacent station device through the input port 3 of the selection module, and the back-cut control unit 6 of the station 2 device is connected with the remote interface 2 of the back-cut control unit 4 of the adjacent station device through the input port 3 of the selection module. A handshake protection protocol between the site 1 device and the adjacent site device and between the site 2 device and the adjacent site device is realized based on an Ethernet UDP message communication mode.
Illustratively, the handshake protection protocol between the local station device and the neighboring station device is as follows:
(1) remote discovery process:
a. the back-cut control unit 1 and the back-cut control unit 2 of the site 1 device and the back-cut control unit 5 and the back-cut control unit 6 of the site 2 device monitor the designated port 8001;
b. the reverse switching control unit 3 and the reverse switching control unit 4 of the adjacent station device send a Hello message to the designated port 8001, and carry a handshake interaction protocol version number;
c. after receiving the messages, the back-off control unit 1 and the back-off control unit 2 of the site 1 device and the back-off control unit 5 and the back-off control unit 6 of the site 2 device verify that the protocol version numbers are consistent, reply a Hello message to the back-off control unit 3 and the back-off control unit 4 of the adjacent site device, and carry a handshake interaction protocol version number;
after receiving the reply message, the reverse switching control unit of the adjacent station equipment verifies that the protocol version numbers are consistent;
the remote discovery process is completed and the connection state is successfully entered.
(2) The real-time monitoring information returning process comprises the following steps:
a. after the discovery process of the adjacent station equipment is completed, the back cut control unit 3 and the back cut control unit 4 of the adjacent station equipment keep periodically sending heartbeats, namely a 'Heartbeat' message to the back cut control units of the site 1 equipment and the site 2 equipment;
b. the heartbeat message carries a message serial number and monitoring information, and the monitoring information contains information that the main control board or the standby control board of the adjacent station equipment normally operates, which is detected by the adjacent station equipment reverse switching control unit 3 and the adjacent station equipment control unit 4;
c. the back-cut control unit 1 and the back-cut control unit 2 of the site 1 device and the back-cut control unit 5 and the back-cut control unit 6 of the site 2 device reply and confirm each heartbeat message, and carry the corresponding message serial number thereof, so as to maintain the connection state with the back-cut control unit 3 and the adjacent station device control unit 4 of the adjacent station device.
(3) The fault information informs and requests to take over the protection process:
a. when detecting that both the main control board and the standby main control board of the local station equipment are in failure, the condition is detected by the back-off control units of the station 1 equipment and the station 2 equipment;
b. according to the message content returned by the real-time monitoring information, the main control board/standby main control board which normally works is found out to be still available for the adjacent station equipment, the back-cut control unit 1 of the station 1 equipment and the back-cut control unit 5 of the station 2 equipment send burst messages (non-periodic messages) to the back-cut control unit 3 of the adjacent station equipment, and the back-cut control unit 2 of the station 1 equipment and the back-cut control unit 6 of the station 2 equipment send the burst messages (non-periodic messages) to the back-cut control unit 4 of the adjacent station equipment, wherein the messages carry the fault information of the local station equipment and request the adjacent station equipment to take over protection;
c. after replying and confirming the burst message, the reverse switching control unit 3 and the reverse switching control unit 4 of the adjacent station equipment execute a fault reverse switching action, and the reverse switching control unit 3 and the reverse switching control unit 4 of the adjacent station equipment inform that the far end of the main control panel of the current adjacent station equipment is respectively provided with the exchange board 1 and the exchange board 2 of the station 1 equipment and the exchange board 1 and the exchange board 2 of the station 2 to be taken over by the adjacent station equipment; the adjacent station equipment main control board is designed with a software function for driving a plurality of paths of exchange boards simultaneously by means of a bridge module of a back-cut control unit, and then a path in figure 5 can initiate driving connection to the station 1 equipment exchange board 1 and the station 2 exchange board 2;
d. after the reverse switching control units of the site 1 device and the site 2 device receive the reply acknowledgement message, the reverse switching control unit 1 and the reverse switching control unit 2 of the site 1 device and the reverse switching control unit 5 and the reverse switching control unit 6 of the site 2 device all select the port 3, and the main control board of the adjacent site device successfully takes over all the switch boards of the site 1 device and the site 2 device.
In order to support cross-equipment redundancy protection, network cables or optical fibers are needed to connect the ports of the selection modules of the back-cut control units of the site 1 equipment and the site 2 equipment with the remote interfaces of the back-cut control units of the adjacent site equipment.
When the main control board and the standby main control board of the station 1 equipment both fail, the selection modules of the back-off control unit 1 and the back-off control unit 2 of the station 1 equipment are back-off to 3 ports; a port 1 of a main control board of adjacent station equipment sequentially drives an exchange board 1 of the station 1 equipment through a port 1 of a selection module 1, a port 1 of a bridging module and a remote interface 1 of an inverse switching control unit 3, and a port 3 of a selection module 3 and a port 2 of the inverse switching control unit 1 of the station 1 equipment; the port 2 of the main control board of the adjacent station device sequentially drives the switch board 2 of the station 1 device through the port 1 of the selection module, the port 1 of the bridge module and the zoom-out interface 1 of the reverse switching control unit 4, and the port 3 of the reverse switching control unit 2 and the port 2 of the bridge module of the station 1 device. When the main control board and the standby main control board of the site 2 equipment both fail, the selection modules of the back-off control unit 5 and the back-off control unit 6 of the site 2 equipment are back-off to the 3 ports; a port 1 of a main control board of adjacent station equipment sequentially drives an exchange board 1 of station 2 equipment through a port 1 of a selection module, a port 3 of a bridging module and a remote interface 2 of an inverse switching control unit 3, and a port 3 of a selection module and a port 2 of a bridge module 5 of the station 2 equipment; the port 2 of the main control board of the adjacent station device sequentially drives the switch board 2 of the station 2 device through the port 1 of the selection module, the port 3 of the bridge module and the zoom-out interface 2 of the reverse switching control unit 4, and the port 3 of the selection module and the port 2 of the bridge module 6 of the station 2 device. In the scene, the exchange boards of the site 1 device and the site 2 device are taken over by the current main control board of the adjacent site device.
It should be noted that the number of the main control boards, the number of the switch boards, and the number of the module ports of the back-off control unit of the network communication device are all exemplified in the explanation of the principle, and the technical solution is also applicable to the case where the number of the module ports of the main control boards, the switch boards, and the back-off control units of the network communication device is other.
According to fig. 5, the number of output ports of the bridge module and the number of remote interfaces of the reverse switching control unit determine how many devices of the failed stations can be simultaneously protected by the neighboring station device, that is, the "one-to-many" protection capability of the neighboring station device. At this time, the adjacent station device reverse switching control unit needs to maintain the connection of the handshake interaction protocol with the multiple station device reverse switching control units, and respond to the adjacent station takeover protection request of the multiple station device reverse switching control units.
Similarly, if the number of input ports of the local station device selection module is increased, the local station device control unit may connect remote interfaces of multiple neighboring station device control units, and multiple potential neighboring station devices may provide takeover protection when the local station device fails, that is, the neighboring station devices have "many-to-one" protection capability. At this time, the local station equipment switching control unit needs to maintain the connection of the handshake interaction protocol with the multiple adjacent station equipment switching control units, and when the local station fails, one of the normally working adjacent station equipment is selected randomly or in sequence, and an adjacent station takeover protection request is sent to the switching control unit.
The technical scheme includes the design of newly added hardware and software, and comprises a newly added hardware switching control unit, a software handshake interaction protocol, a function of simultaneously driving a multi-path exchange board by means of a bridge module, and the like. The board card protection means of the cross-equipment is added on the board card protection means in the original equipment, which is equivalent to more standby main control boards when the main control boards are in failure, so that the cross-equipment cooperation capability is fully exerted, the protection mode of the whole network is richer, and the equipment reliability, availability and network stability are improved.
The technical scheme not only keeps the board card redundancy protection function of the local station equipment and the adjacent station equipment, but also increases the board card redundancy protection mode of cross-equipment cooperation, and obtains higher equipment reliability and availability from the network or system level.
The back-cut control unit designed by the technical scheme expands the driving channel between the main control board and the exchange board from the channel in the equipment to the driving channel in the equipment and between the cross equipment, and can support the takeover of the adjacent station through the channel, namely the redundancy protection of the adjacent station.
The back-off control unit of the technical scheme can monitor hardware in-place signals or software heartbeat signals of the main control board or the standby main control board of the equipment in real time so as to find faults.
The handshake interaction protocol between the switching control units of the cross-equipment can realize that: discovering a remote back-off control unit and establishing communication connection; transmitting real-time monitoring information of the adjacent station equipment master control board by the adjacent station equipment back-cut control unit to the local equipment so as to inform the local equipment whether the master control board of the adjacent station equipment can be used for redundancy protection; and informing the adjacent station equipment switching control unit of the relevant fault information of the main control board of the local station equipment.
The back-off control unit in the technical scheme may be constructed by a plurality of independent hardware modules, may be implemented by hardware circuit programming using an FPGA (Field Programmable Gate Array), and may be implemented by designing a dedicated ASIC (Application Specific Integrated circuit) chip.
Although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (15)

1. A method for protecting adjacent station redundancy in frame type network communication equipment is characterized by comprising the following steps:
the local station equipment and the adjacent station equipment are connected to one path of the multi-path main control board through the back-cut control unit to communicate with the exchange board;
the method comprises the steps that a back cut control unit of local station equipment monitors the running state of a multi-channel main control board of the local station equipment, and a back cut control unit of adjacent station equipment monitors the running state of the multi-channel main control board of the adjacent station equipment;
the back cut control unit of the local station equipment and the back cut control unit of the adjacent station equipment synchronize the running state monitoring information of the multi-path main control boards of the local station equipment and the adjacent station equipment; the reverse switching control unit of the local station equipment and the reverse switching control unit of the adjacent station equipment execute a handshake interaction protocol to synchronize the running state monitoring information of the multiple main control boards of the local station equipment and the adjacent station equipment; the reverse switching control unit of the local station equipment and the reverse switching control unit of the adjacent station equipment execute a handshake interaction protocol, including remote discovery and real-time monitoring information return;
finishing the backward switching action according to the running states of the multi-path main control boards of the local station equipment and the adjacent station equipment; when the main control board of the local station equipment fails, the standby main control board on the local station equipment takes over the functions of the main control board; and when the main control board and the standby main control board of the local station equipment both fail, taking over the functions of the main control board of the local station equipment by the current main control board on the adjacent station equipment.
2. The method for protecting adjacent station redundancy in a subrack network communication device according to claim 1, wherein the handshake interaction protocol is executed based on an ethernet UDP packet communication manner.
3. The subrack redundancy protection method in a subrack network communication device according to claim 1,
the remote discovery comprises that the reverse switching control unit of the local station equipment monitors the message sent by the adjacent station equipment, verifies the sent message, replies the reverse switching control unit of the adjacent station equipment after the verification is passed, the reverse switching control unit of the adjacent station equipment verifies the reply, and the remote discovery is completed after the verification is passed;
after the real-time monitoring information is returned, the back-off control unit of the adjacent station equipment periodically sends heartbeats to the back-off control unit of the local station equipment after the far-end discovery is completed, the back-off control unit of the local station equipment replies and confirms each heartbeat message, carries the corresponding message serial number of the heartbeat message, and maintains the connection state of the back-off control unit of the local station equipment and the back-off control unit of the adjacent station equipment.
4. The adjacent station redundancy protection method in the subrack network communication device according to claim 3, wherein the heartbeat message carries a message sequence number and monitoring information of the operation state of the multipath main control board of the adjacent station device.
5. The adjacent station redundancy protection method in the subrack network communication device according to claim 4, wherein the handshake interaction protocol executed by the reverse switching control unit of the local station device and the reverse switching control unit of the adjacent station device further includes fault information notification and request to take over protection;
fault information informs and requests to take over protection: when the reverse switching control unit of the local station equipment monitors that all the multiple main control boards of the local station equipment have faults, if the adjacent station equipment still has a main control board which normally works, the reverse switching control unit of the local station equipment sends a burst message carrying fault information of the local station equipment to the remote reverse switching control unit and requests the adjacent station equipment to take over protection, and the reverse switching control unit of the adjacent station equipment replies and confirms the burst message and executes actions.
6. The adjacent station redundancy protection method in the subrack network communication device according to claim 5, wherein the burst message is an aperiodic message carrying fault information of the multi-path main control board of the local station device.
7. A system for protecting the redundancy of adjacent stations in frame-type network communication equipment is characterized by that,
the station equipment and the adjacent station equipment comprise an exchange board, a multi-path main control board and an inverse switching control unit connected between the exchange board and the multi-path main control board;
the local station equipment and the adjacent station equipment are connected to one path of the multi-path main control board through the back-cut control unit to communicate with the exchange board;
the method comprises the steps that a reverse switching control unit monitors the running state of a multi-channel main control board of local station equipment in the local station equipment, and the reverse switching control unit monitors the running state of the multi-channel main control board of adjacent station equipment in the adjacent station equipment;
the back cut control unit synchronizes the running states of the multi-path main control boards of the local station equipment and the adjacent station equipment in the local station equipment and the adjacent station equipment; the reverse switching control unit of the local station equipment and the reverse switching control unit of the adjacent station equipment execute a handshake interaction protocol to synchronize the running state monitoring information of the multiple main control boards of the local station equipment and the adjacent station equipment; the reverse switching control unit of the local station equipment and the reverse switching control unit of the adjacent station equipment execute a handshake interaction protocol which comprises remote discovery and real-time monitoring information return;
the back cut control unit completes back cut action according to the running states of the multi-path main control boards of the local station equipment and the adjacent station equipment; when the main control board of the local station equipment fails, the standby main control board on the local station equipment takes over the functions of the main control board; and when the main control board and the standby main control board of the local station equipment both fail, taking over the functions of the main control board of the local station equipment by the current main control board on the adjacent station equipment.
8. The subrack network communication device of claim 7, wherein,
the reverse cutting control unit comprises a selection module, a bridging module and a remote interface;
a selection module: selecting one of the multiple main control boards from the local station equipment and the adjacent station equipment to carry out single-path output;
a bridge module: converting the single-path output into multi-path output, driving the exchange board, and establishing the driving and management of the main control board on the exchange board;
a remote interface: and a driving passage is established at a long distance through a network cable or an optical fiber, so that the equipment of the local station is connected with the multi-path output of the equipment of the adjacent station.
9. The adjacent station redundancy protection system in the subrack network communication device of claim 8, wherein in the local station device, the multiple main control boards are connected to the input end of the selection module, the output end of the selection module is connected to the input end of the bridge module, the output end of the bridge module is connected to the switch board, the remote interface is connected to the output end of the bridge module, and the remote interface of the adjacent station device is connected to the input end of the local station device selection module.
10. The adjacent station redundancy protection system in subrack network communication equipment of claim 8 or 9, wherein the reverse switching control unit comprises at least one remote interface, ensuring that a group of local station equipment is connected with at least one group of adjacent station equipment, and a group of adjacent station equipment is connected with at least one group of local station equipment.
11. The adjacent station redundancy protection system in the subrack network communication device of claim 7, wherein the local station device and the adjacent station device further comprise a backplane, and the switch board, the primary main control board and the standby main control board are all disposed on the backplane.
12. The adjacent station redundancy protection system in subrack network communication equipment of claim 11, wherein the back-off control unit monitors the hardware-in-place signal or the software-in-place signal of the main control board through the backplane.
13. The subrack redundancy protection system in subrack network communication equipment of claim 12, wherein the backplane has a bit signal transmission line for receiving the hardware bit signal of the main control board.
14. The subrack redundancy protection system in subrack network communication equipment of claim 13, wherein the backplane is configured with an ethernet channel for receiving software heartbeat signals of the main control board.
15. The system of claim 7, wherein the switch board and the multi-way master control board support hot plug function.
CN201910168712.7A 2019-03-06 2019-03-06 Method and system for protecting adjacent station redundancy in frame type network communication equipment Active CN109981353B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910168712.7A CN109981353B (en) 2019-03-06 2019-03-06 Method and system for protecting adjacent station redundancy in frame type network communication equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910168712.7A CN109981353B (en) 2019-03-06 2019-03-06 Method and system for protecting adjacent station redundancy in frame type network communication equipment

Publications (2)

Publication Number Publication Date
CN109981353A CN109981353A (en) 2019-07-05
CN109981353B true CN109981353B (en) 2022-05-24

Family

ID=67078043

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910168712.7A Active CN109981353B (en) 2019-03-06 2019-03-06 Method and system for protecting adjacent station redundancy in frame type network communication equipment

Country Status (1)

Country Link
CN (1) CN109981353B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111917526B (en) * 2020-07-31 2022-12-23 许继集团有限公司 Extensible cross-redundancy communication interface device and method
CN112202323B (en) * 2020-08-25 2021-08-20 中国南方电网有限责任公司超高压输电公司广州局 Redundancy improving method for flexible direct current valve control protection system
CN115150317B (en) * 2022-06-22 2023-09-12 杭州迪普科技股份有限公司 Routing table entry issuing method and device, electronic equipment and computer readable medium

Citations (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20020028288A (en) * 2000-10-09 2002-04-17 박종섭 Device for connecting automatically exchange main board for channel redundancy board in mobile communication bts system
CN1738214A (en) * 2004-08-17 2006-02-22 华为技术有限公司 Inter-board channel protection device and method
CN1937782A (en) * 2006-08-04 2007-03-28 华为数字技术有限公司 Frame device and its message transmitting method
CN1949701A (en) * 2006-09-21 2007-04-18 华为技术有限公司 Method, system and apparatus for service switching
CN101188539A (en) * 2006-11-21 2008-05-28 日本电气株式会社 Resilient packet ring (RPR) network system, RPR node device, and redundancy method for the same
CN101841735A (en) * 2009-03-17 2010-09-22 华为技术有限公司 Frame-type switch, stack system and fault treatment method after stack
CN101958821A (en) * 2010-10-29 2011-01-26 国电南瑞科技股份有限公司 Transregional double ring network topology of electric power dispatch automation system
CN101989927A (en) * 2010-10-20 2011-03-23 华为技术有限公司 Method, device and equipment for processing redundancy inversion
CN102193543A (en) * 2011-03-25 2011-09-21 上海磁浮交通发展有限公司 Control system based on profibus redundant network topological structure and switching method of control system
CN102866690A (en) * 2012-09-18 2013-01-09 南京国电南自美卓控制系统有限公司 Redundancy switching method among redundancy process control stations in distributed control system
CN102991538A (en) * 2012-12-26 2013-03-27 北京科信华技术有限公司 Device and method for automatically switching point-to-point communication
CN203071960U (en) * 2013-01-31 2013-07-17 长城信息产业股份有限公司 Redundant type switch
CN105072021A (en) * 2015-08-11 2015-11-18 国电南瑞科技股份有限公司 Cross-segment message forwarding method for dispatching automation systems
CN205068150U (en) * 2015-10-20 2016-03-02 中车大连电力牵引研发中心有限公司 Redundant control system
CN107255918A (en) * 2017-06-19 2017-10-17 北京航天发射技术研究所 A kind of remote control computer hot backup redundancy control method for automatically switching
CN206759478U (en) * 2017-03-16 2017-12-15 南京讯汇科技发展有限公司 A kind of network switch system
CN108173691A (en) * 2017-12-27 2018-06-15 杭州迪普科技股份有限公司 A kind of method and device of cross-equipment aggregation

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030233473A1 (en) * 2002-05-07 2003-12-18 International Business Machines Corporation Method for configuring logical connections to a router in a data communication system
CN101252426B (en) * 2007-09-11 2011-05-11 北京东土科技股份有限公司 Method for realizing high-reliability distributed redundant looped network
CN102487357A (en) * 2010-12-03 2012-06-06 杭州华三通信技术有限公司 Control method and device for continuous forwarding after main and standby switching
CN102223304B (en) * 2011-06-16 2014-08-27 杭州华三通信技术有限公司 Method and device for continuously forwarding after main/standby switch
US20180006833A1 (en) * 2016-06-29 2018-01-04 Argela Yazilim ve Bilisim Teknolojileri San. ve Tic. A.S. System and method for controller-initiated simultaneous discovery of the control tree and data network topology in a software defined network
US10581688B2 (en) * 2017-06-19 2020-03-03 Quanta Computer Inc. Methods for automatically configuring multiple chassis link aggregation group (MC-LAG)

Patent Citations (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20020028288A (en) * 2000-10-09 2002-04-17 박종섭 Device for connecting automatically exchange main board for channel redundancy board in mobile communication bts system
CN1738214A (en) * 2004-08-17 2006-02-22 华为技术有限公司 Inter-board channel protection device and method
CN1937782A (en) * 2006-08-04 2007-03-28 华为数字技术有限公司 Frame device and its message transmitting method
CN1949701A (en) * 2006-09-21 2007-04-18 华为技术有限公司 Method, system and apparatus for service switching
CN101188539A (en) * 2006-11-21 2008-05-28 日本电气株式会社 Resilient packet ring (RPR) network system, RPR node device, and redundancy method for the same
CN101841735A (en) * 2009-03-17 2010-09-22 华为技术有限公司 Frame-type switch, stack system and fault treatment method after stack
WO2011150829A1 (en) * 2010-10-20 2011-12-08 华为技术有限公司 Method, apparatus and device for processing redundancy switch
CN101989927A (en) * 2010-10-20 2011-03-23 华为技术有限公司 Method, device and equipment for processing redundancy inversion
CN101958821A (en) * 2010-10-29 2011-01-26 国电南瑞科技股份有限公司 Transregional double ring network topology of electric power dispatch automation system
CN102193543A (en) * 2011-03-25 2011-09-21 上海磁浮交通发展有限公司 Control system based on profibus redundant network topological structure and switching method of control system
CN102866690A (en) * 2012-09-18 2013-01-09 南京国电南自美卓控制系统有限公司 Redundancy switching method among redundancy process control stations in distributed control system
CN102991538A (en) * 2012-12-26 2013-03-27 北京科信华技术有限公司 Device and method for automatically switching point-to-point communication
CN203071960U (en) * 2013-01-31 2013-07-17 长城信息产业股份有限公司 Redundant type switch
CN105072021A (en) * 2015-08-11 2015-11-18 国电南瑞科技股份有限公司 Cross-segment message forwarding method for dispatching automation systems
CN205068150U (en) * 2015-10-20 2016-03-02 中车大连电力牵引研发中心有限公司 Redundant control system
CN206759478U (en) * 2017-03-16 2017-12-15 南京讯汇科技发展有限公司 A kind of network switch system
CN107255918A (en) * 2017-06-19 2017-10-17 北京航天发射技术研究所 A kind of remote control computer hot backup redundancy control method for automatically switching
CN108173691A (en) * 2017-12-27 2018-06-15 杭州迪普科技股份有限公司 A kind of method and device of cross-equipment aggregation

Also Published As

Publication number Publication date
CN109981353A (en) 2019-07-05

Similar Documents

Publication Publication Date Title
CN109981353B (en) Method and system for protecting adjacent station redundancy in frame type network communication equipment
US7613108B2 (en) Redundant supervisory control system, and redundancy switching method of the same
CN102014019B (en) Aggregated link switching method and device
JP4527447B2 (en) Network relay device and control method thereof
CN101594383B (en) Method for monitoring service and status of controllers of double-controller storage system
JP2005521334A (en) Network management system
CN102882704B (en) Link protection method in the soft reboot escalation process of a kind of ISSU and equipment
JPH0351142B2 (en)
CN103560955A (en) Method and device for switching between redundancy devices
US7564780B2 (en) Time constrained failure recovery in communication networks
RU2563142C2 (en) Base station system for railway application and method of creating base station network
JP2008544678A (en) Communication network system
KR100794520B1 (en) Securtioy system and method for controlling a traffic using the same
WO2012000338A1 (en) Method and system for achieving main/standby switch for single boards
JP2009206540A (en) Line terminating equipment, redundant communication system, redundant communication method and redundant communication program
CN109120520B (en) Fault processing method and equipment
CN108667640A (en) Communication means and equipment, network access system
JP2011259064A (en) Optical communication network system, slave station communication device and master station communication device
US7210069B2 (en) Failure recovery in a multiprocessor configuration
CN111181766B (en) Redundant FC network system and method for realizing dynamic configuration of switch
RU2667806C2 (en) Pseudo-wire (pw) dual-homing protection switching method and system
JP2006254096A (en) Ip telephone system
CN112698991A (en) Dual-computer hot standby system and method based on 1 to 1 heartbeat
JPH08251216A (en) Data transmitter
CN110838994A (en) Redundant Ethernet link state monitoring system based on broadcast protocol

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant