CN114615148A - Rapid link aggregation failover method - Google Patents

Rapid link aggregation failover method Download PDF

Info

Publication number
CN114615148A
CN114615148A CN202210259048.9A CN202210259048A CN114615148A CN 114615148 A CN114615148 A CN 114615148A CN 202210259048 A CN202210259048 A CN 202210259048A CN 114615148 A CN114615148 A CN 114615148A
Authority
CN
China
Prior art keywords
data
user
intf
abnormal
interface
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210259048.9A
Other languages
Chinese (zh)
Inventor
陈玄
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xinhe Semiconductor Technology Wuxi Co Ltd
Original Assignee
Xinhe Semiconductor Technology Wuxi Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xinhe Semiconductor Technology Wuxi Co Ltd filed Critical Xinhe Semiconductor Technology Wuxi Co Ltd
Priority to CN202210259048.9A priority Critical patent/CN114615148A/en
Publication of CN114615148A publication Critical patent/CN114615148A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements
    • H04L41/0896Bandwidth or capacity management, i.e. automatically increasing or decreasing capacities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/12Arrangements for detecting or preventing errors in the information received by using return channel
    • H04L1/16Arrangements for detecting or preventing errors in the information received by using return channel in which the return channel carries supervisory signals, e.g. repetition request signals
    • H04L1/18Automatic repetition systems, e.g. Van Duuren systems
    • H04L1/1806Go-back-N protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0663Performing the actions predefined by failover planning, e.g. switching to standby network elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/41Flow control; Congestion control by acting on aggregated flows or links

Abstract

The invention provides a rapid link aggregation failover method, which reduces software burden, reduces packet loss rate, accelerates the overall abnormal recovery speed and can better ensure the stability of a network by automatically switching abnormal link data streams and actively retransmitting abnormal interface data.

Description

Rapid link aggregation failover method
Technical Field
The invention relates to the technical field of communication, in particular to a quick link aggregation failover method.
Background
Link Aggregation (LAG), which refers to binding multiple ethernet physical links together to form a logical Link to implement load sharing of the throughput of the ingress/egress traffic at each member port, and the switch determines from which member port the network packet is sent to the switch at the opposite end according to a port load sharing policy configured by a user. Meanwhile, the reliability of the links can be effectively improved through mutual dynamic backup of the bound links.
The main functions of link aggregation are:
1. the bandwidth is increased: combining the capacities of the plurality of links into one logical link;
2. automatic failover/failback: transferring communications from the failed link to an active link in the aggregation;
3. load balancing: both incoming and outgoing communications are assigned according to a load balancing policy (e.g., source and destination MAC or IP addresses) selected by the user;
4. the management is improved: all interfaces are managed as a unit;
5. network address pool consumption is reduced: one IP address may be assigned to the entire aggregation.
The defects and shortcomings of the prior art are as follows:
when the exchanger detects that one member port link is fault, it stops transmitting package on the port, and recalculates the transmitting port of message in the rest links according to the load sharing strategy, and the fault port is recovered and then acts as the receiving/transmitting port again.
At present, the fault tolerance in link aggregation is realized on the basis of software, when a link has a problem, the dynamic configuration needs to be carried out by the software, an abnormal link is removed from an aggregation member group, and the flow is switched to a normal link. The method increases the burden of software, has larger delay and also increases the network packet loss rate.
In order to solve the defects of the prior art, the invention provides a rapid link aggregation failover method, which automatically switches abnormal link data streams after detecting the link abnormality, reduces software burden by actively retransmitting abnormal interface data, reduces packet loss rate, accelerates the overall abnormal data stream recovery speed, and can better guarantee the stability of a network.
Disclosure of Invention
In order to solve the defects in the prior art, the invention provides a rapid link aggregation failure transfer method, which reduces software burden, reduces packet loss rate, accelerates the recovery speed of the whole abnormal data stream and can better guarantee the stability of a network by automatically switching abnormal link data streams and actively retransmitting abnormal interface data after detecting the link abnormality.
In order to achieve the purpose, the invention adopts the following technical scheme.
In the embodiment of the invention, a method for fast failure transfer of link aggregation is provided, which comprises the following steps:
s1, when the hardware detects the link abnormity, it will send the interface information of the abnormal link to the USER _ INTF module; if the USER _ INTF determines that the interface belongs to a certain LAG group, S2 is continued;
s2 USER _ INTF disconnects the data receiving and sending between the abnormal interface USER side and the MAC; a loopback channel in the USER _ INTF is opened, the DATA of the TX _ DATA is directly connected to the RX _ DATA, and a loopback mark is marked on a related message;
s3, when receiving the interface information of the abnormal link fed back by the USER _ INTF, the LAG member selection module shields the interface corresponding to the abnormal link from LAG member; the USER _ INTF puts the message with the loopback mark at the entrance of the chip again and incorporates the data forwarding process;
s4, when the software detects that the LAG group member link is abnormal, the abnormal member deleting work is started, and the abnormal link recovery is waited; and the USER _ INTF detects that the interface corresponding to the abnormal link is deleted from the LAG member group and enters a waiting interface reset state.
Further, the USER _ INTF disconnects the DATA reception RX _ DATA between the MAC and the USER side corresponding to the abnormal interface, and notifies the USER side of error information.
Further, USER _ INTF disconnects the DATA transmission between the MAC and the USER side to which the abnormal interface corresponds, and discards all DATA received before the eop flag of TX _ DATA is set high.
Further, when detecting that eop flag of TX _ DATA corresponding to the abnormal interface is set high by USER _ INTF, the loopback channel in USER _ INTF is opened, the DATA of TX _ DATA is directly connected to RX _ DATA, and loopback flag is marked on the corresponding message.
Further, the LAG member selection module exists in the USER _ INTF module, and is configured to detect an interface abnormality and perform automatic data stream switching.
In an embodiment of the present invention, a computer device is further provided, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and when the processor executes the computer program, the processor implements the foregoing method for fast link aggregation failover.
The invention has the beneficial effects that aiming at the problems in the existing communication system, the invention provides a rapid link aggregation failover method, which reduces the software burden, reduces the packet loss rate, accelerates the overall abnormal recovery speed and can better ensure the stability of the network by automatically switching the abnormal link data stream and actively retransmitting the abnormal interface data.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts. In the drawings:
FIG. 1 is a schematic flow chart of the present invention;
FIG. 2 is a diagram illustrating a data flow of a standard Ethernet interface according to a first embodiment of the present invention;
FIG. 3 is a diagram illustrating an example of an abnormal interface data flow according to a first embodiment of the present invention;
fig. 4 is a diagram illustrating a specific implementation procedure of failover according to a first embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the embodiments of the present invention are further described in detail below with reference to the accompanying drawings. The following embodiments are only used to more clearly illustrate the system method and technical solution of the present invention, and the protection scope of the present application is not limited thereby.
According to the embodiment of the invention, the invention provides a rapid link aggregation failover method, and by detecting the link abnormality, the abnormal link data flow is automatically switched, and the mode of actively retransmitting the abnormal interface data reduces the software burden, reduces the packet loss rate, accelerates the recovery speed of the whole abnormal data flow, and can better ensure the stability of the network.
The principles and spirit of the present invention are explained in detail below with reference to several representative embodiments of the invention.
Fig. 1 is a process flow diagram of a fast link aggregation failover method. As shown in fig. 1, the method comprises the steps of:
s1, when the hardware detects the link abnormity, it will send the interface information of the abnormal link to the USER _ INTF module; if the USER _ INTF determines that the interface belongs to a certain LAG group, S2 is continued;
s2 USER _ INTF disconnects the data receiving and sending between the abnormal interface USER side and the MAC; a loopback channel in the USER _ INTF is opened, the DATA of the TX _ DATA is directly connected to the RX _ DATA, and a loopback mark is marked on a related message;
in specific implementation, the USER _ INTF disconnects the DATA reception RX _ DATA between the MAC and the USER side corresponding to the abnormal interface, and notifies the USER side of error information.
In specific implementation, USER _ INTF disconnects the DATA transmission between the MAC and the USER side corresponding to the abnormal interface, and discards all DATA received before eop of TX _ DATA is set high.
In specific implementation, when USER _ INTF detects that eop flag of TX _ DATA corresponding to an abnormal interface is set high, a loopback channel in USER _ INTF is opened, the TX _ DATA is directly connected to RX _ DATA, and loopback flag is marked on a corresponding message.
S3, when receiving the interface information of the abnormal link fed back by the USER _ INTF, the LAG member selection module shields the interface corresponding to the abnormal link from LAG member; the USER _ INTF puts the message with the loopback mark at the entrance of the chip again and incorporates the data forwarding process;
and the LAG member selection module is arranged in the USER _ INTF module and is used for detecting interface abnormality and automatically switching data streams.
S4, when the software detects that the LAG group member link is abnormal, the abnormal member deleting work is started, and the abnormal link recovery is waited; and the USER _ INTF detects that the interface corresponding to the abnormal link is deleted from the LAG member group and enters a waiting interface reset state.
It should be noted that although the operations of the method of the present invention have been described in the above embodiments and the accompanying drawings in a particular order, this does not require or imply that these operations must be performed in this particular order or that all of the illustrated operations must be performed to achieve desirable results. Additionally or alternatively, certain steps may be omitted, multiple steps combined into one step execution, and/or one step broken down into multiple step executions.
For a more clear explanation of the above-mentioned fast link aggregation failover method, a specific embodiment is described below, but it should be noted that this embodiment is only for better explaining the present invention and should not be construed as an undue limitation to the present invention.
The first embodiment is as follows:
DATA flow of the standard ethernet interface as shown in fig. 2, the user side receives and transmits ethernet messages transmitted by an mac (media Access control) controller through parallel DATA interfaces RX _ DATA and TX _ DATA, where RX _ DATA is used to receive messages and TX _ DATA is used to transmit messages.
When the interface is abnormal, if the TX _ DATA is not switched to a new aggregation group member in time, a large amount of packet loss may be caused.
The invention provides a rapid link aggregation failover method, which adds an LAG member selection module in a USER _ INTF (USER interface) module between an MAC (media access control) and a USER side, and is used for detecting interface abnormality and automatically switching data streams, and the structure of the LAG member selection module is shown in figure 3.
A fast link aggregation failover method, the specific implementation steps are shown in fig. 4, and specifically are as follows:
the method comprises the following steps: powering on the equipment;
step two: each port link is on-line, and the link is opened to enter a working mode;
step one and step two are the normal working state of the link.
Step three: when the hardware detects that the Link is abnormal (Link _ down, Link state down), judging whether the interface belongs to a certain LAG group (marked by LAG _ mask), if so, continuing the step four, otherwise, not operating;
the link exception is wider in representing range, including interface exception, and also belongs to link exception when a scene interface is normal but a network cable connecting the interface is broken, and in these scenes, the related interfaces all belong to a link down state and need to be deleted from the LAG group.
When the Link is abnormal, the interface sends the Link _ down state to the USER _ INTF, and the USER _ INTF stores the state and feeds back the state to the LAG member selection module. And the LAG member selection module deletes the corresponding Link from the LAG group members according to the Link _ down state.
Step four: USER _ INTF disconnects DATA reception between the USER side and the MAC, and if RX _ DATA is receiving DATA at this time, error information is passed to the USER side.
If the RX _ DATA is receiving DATA at this time, marking an error and an end of packet (eop, tail packet) high, and informing the user side that the currently received ethernet message is finished and has an error, and the ethernet message needs to be discarded;
step five: USER _ INTF disconnects DATA transmission between the USER side and the MAC, waits for the end of packet of TX _ DATA to go high, and discards all received DATA until then.
Step six: when detecting that end of packet is set high, USER _ INTF opens a loopback channel (loopback) in USER _ INTF, directly connecting the DATA of TX _ DATA to RX _ DATA, and marking a loopback mark on a message;
the LAG member selection module of the USER _ INTF shields the abnormal link from LAG member members (for example, sets a mask bit) according to the feedback of the USER _ INTF, ensures other data flows of the same LAG group, and does not actively select the current abnormal link;
the message with the loopback mark needs to be re-entered into the data forwarding process at the chip inlet, and a new link is re-selected and sent at the LAG group member selection module.
Step seven: when the software detects that the LAG group member link is abnormal, the abnormal member is deleted, and the recovery of the abnormal link is waited;
it is noted that; step three, the hardware logic detects the abnormal state of the link. And step seven is the status from the CPU software read link. This step is mainly to synchronize the software and hardware, and the CPU can sense the LAG change and the cause of the LAG change.
Step eight: when an LAG member selection module in the USER _ INTF detects that an interface corresponding to the abnormal link is deleted from an LAG member group, entering a waiting interface reset state; and after the interface is recovered, when the software is reset and reset (returning to the step one), clearing the previous abnormal forwarding operation.
The method comprises the following key steps of actively discarding MAC received data and informing a user side of error state to ensure that user logic cannot cause abnormity due to the reception of residual packets; before abnormal LAG group member shielding and member elimination are not implemented, a sending interface is reselected through a loopback channel to reduce packet loss; to implement the operation, the data needs to be actively guided to a loopback channel (the current commercial exchange chip is basically provided with a loopback data path); and actively shielding the LAG group members, and forcibly carrying out rerouting to further reduce the packet loss rate.
The invention has the advantages that aiming at the problems in the existing communication system, the invention provides a rapid link aggregation failover method, which reduces the software burden, reduces the packet loss rate, accelerates the overall abnormal recovery speed and can better ensure the stability of the network by automatically switching the abnormal link data stream and actively retransmitting the abnormal interface data.
The applicant of the present invention has made detailed description and description of the embodiments of the present invention with reference to the drawings, which are included in the specification, the embodiments of the present invention are merely preferred embodiments of the present invention, and the detailed description is only for the purpose of helping the reader to better understand the spirit of the present invention, and not for limiting the scope of the present invention, but rather, any improvement or modification made based on the spirit of the present invention should fall within the scope of the present invention.

Claims (5)

1. A fast link aggregation failover method is characterized in that the method comprises the following steps:
s1, when the hardware detects the link abnormity, it will send the interface information of the abnormal link to the USER _ INTF module; if the USER _ INTF determines that the interface belongs to a certain LAG group, S2 is continued;
s2 USER _ INTF disconnects the data receiving and sending between the abnormal interface USER side and the MAC; a loopback channel in the USER _ INTF is started, the DATA of the TX _ DATA is directly connected to the RX _ DATA, and a loopback mark is marked on a corresponding message;
s3, when receiving the interface information of the abnormal link fed back by the USER _ INTF, the LAG member selection module shields the interface corresponding to the abnormal link from LAG member; the USER _ INTF puts the message with the loopback mark at the entrance of the chip again and incorporates the data forwarding process;
s4, when the software detects that the LAG group member link is abnormal, the abnormal member deleting work is started, and the abnormal link recovery is waited; and the USER _ INTF detects that the interface corresponding to the abnormal link is deleted from the LAG member group and enters a waiting interface reset state.
2. The method of claim 1, wherein the method comprises: and the LAG member selection module is arranged in the USER _ INTF module and is used for detecting interface abnormality and automatically switching data streams.
3. The method according to claim 1, wherein the method comprises: and the USER _ INTF disconnects the DATA reception RX _ DATA between the MAC and the USER side corresponding to the abnormal interface and informs the USER side of error information.
4. The method according to claim 1, wherein the method comprises: USER _ INTF disconnects the DATA transmission TX _ DATA between the MAC and the USER side to which the anomalous interface corresponds and discards all DATA received before the eop flag of TX _ DATA is high.
5. The method according to claim 1, wherein the method comprises: when detecting that eop flag of TX _ DATA corresponding to the abnormal interface is set high, USER _ INTF opens a loopback channel in USER _ INTF, directly connects the DATA of TX _ DATA to RX _ DATA, and marks a loopback flag for the corresponding message.
CN202210259048.9A 2022-03-16 2022-03-16 Rapid link aggregation failover method Pending CN114615148A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210259048.9A CN114615148A (en) 2022-03-16 2022-03-16 Rapid link aggregation failover method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210259048.9A CN114615148A (en) 2022-03-16 2022-03-16 Rapid link aggregation failover method

Publications (1)

Publication Number Publication Date
CN114615148A true CN114615148A (en) 2022-06-10

Family

ID=81862329

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210259048.9A Pending CN114615148A (en) 2022-03-16 2022-03-16 Rapid link aggregation failover method

Country Status (1)

Country Link
CN (1) CN114615148A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101478438A (en) * 2009-01-13 2009-07-08 中兴通讯股份有限公司 Ethernet OAM link discovery method
CN101505270A (en) * 2009-03-24 2009-08-12 神州数码网络(北京)有限公司 Method for aggregated link rapid convergence of distributed switch
CN103795570A (en) * 2014-01-23 2014-05-14 杭州华三通信技术有限公司 Unicast message recovery method and device of stacking switch system with ring topology
CN112165429A (en) * 2020-09-11 2021-01-01 烽火通信科技股份有限公司 Link aggregation convergence method and device for distributed switching equipment

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101478438A (en) * 2009-01-13 2009-07-08 中兴通讯股份有限公司 Ethernet OAM link discovery method
CN101505270A (en) * 2009-03-24 2009-08-12 神州数码网络(北京)有限公司 Method for aggregated link rapid convergence of distributed switch
CN103795570A (en) * 2014-01-23 2014-05-14 杭州华三通信技术有限公司 Unicast message recovery method and device of stacking switch system with ring topology
CN112165429A (en) * 2020-09-11 2021-01-01 烽火通信科技股份有限公司 Link aggregation convergence method and device for distributed switching equipment

Similar Documents

Publication Publication Date Title
US7835265B2 (en) High availability Ethernet backplane architecture
US7639605B2 (en) System and method for detecting and recovering from virtual switch link failures
US20080068985A1 (en) Network redundancy method and middle switch apparatus
US9137099B2 (en) Method, device, and system for link aggregation failure protection
US20090019130A1 (en) Network relay system and control method thereof
US20060209787A1 (en) Load distributing apparatus and load distributing method
EP1261186A1 (en) System and method for TCP connection protection switching
US20020176355A1 (en) Snooping standby router
US7693169B2 (en) Transmission apparatus and frame transmission method
US9628324B2 (en) Openflow switch and failure recovery method in openflow network
US20120236859A1 (en) Method & apparatus for configuring a link aggregation group on a stacked switch
EP2627039B1 (en) Method and device for switching aggregation links
WO2007121633A1 (en) A method for rapid rerouting when multiple ports sharing the transmission links
US8098575B2 (en) Packet relay method and device
CN111800336A (en) Routing transmission implementation method based on multi-channel network link aggregation
US7952994B2 (en) Redundancy method and switch apparatus
CN101895455A (en) Method for fast protecting ring network based on multi-protocol label switching transport profile (MPLS-TP)
CN113645312A (en) Method and device for protecting sub-ring network link based on ERPS protocol
CN114615148A (en) Rapid link aggregation failover method
CN102857423A (en) Service flow transferring method and node in distributed link polymerization system
CN102045259B (en) Packet switching equipment and method for managing customer service
US20080298231A1 (en) Ring node and redundancy method
CN115152192B (en) PCE controlled network reliability
CN114039907A (en) Method for improving link stability based on network card binding
KR101442567B1 (en) Seamless network communication method using frame based routing on the ring topology

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination