WO2022048281A1 - 一种改善链路聚合协议超时的方法和设备 - Google Patents

一种改善链路聚合协议超时的方法和设备 Download PDF

Info

Publication number
WO2022048281A1
WO2022048281A1 PCT/CN2021/103319 CN2021103319W WO2022048281A1 WO 2022048281 A1 WO2022048281 A1 WO 2022048281A1 CN 2021103319 W CN2021103319 W CN 2021103319W WO 2022048281 A1 WO2022048281 A1 WO 2022048281A1
Authority
WO
WIPO (PCT)
Prior art keywords
switch
timeout
response
port
abnormal
Prior art date
Application number
PCT/CN2021/103319
Other languages
English (en)
French (fr)
Inventor
卢彦呈
Original Assignee
苏州浪潮智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 苏州浪潮智能科技有限公司 filed Critical 苏州浪潮智能科技有限公司
Priority to US18/012,928 priority Critical patent/US11683257B1/en
Publication of WO2022048281A1 publication Critical patent/WO2022048281A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/10Active monitoring, e.g. heartbeat, ping or trace-route
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L45/00Routing or path finding of packets in data switching networks
    • H04L45/24Multipath
    • H04L45/245Link aggregation, e.g. trunking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/16Threshold monitoring
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0604Management of faults, events, alarms or notifications using filtering, e.g. reduction of information by using priority, element types, position or time
    • H04L41/0627Management of faults, events, alarms or notifications using filtering, e.g. reduction of information by using priority, element types, position or time by acting on the notification or alarm source
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0659Management of faults, events, alarms or notifications using network fault recovery by isolating or reconfiguring faulty entities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0805Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
    • H04L43/0817Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking functioning
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/50Testing arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L45/00Routing or path finding of packets in data switching networks
    • H04L45/28Routing or path finding of packets in data switching networks using route fault recovery
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L45/00Routing or path finding of packets in data switching networks
    • H04L45/42Centralised routing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L45/00Routing or path finding of packets in data switching networks
    • H04L45/70Routing based on monitoring results
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements
    • H04L41/0803Configuration setting
    • H04L41/0806Configuration setting for initial configuration or provisioning, e.g. plug-and-play
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/34Signalling channels for network management communication
    • H04L41/344Out-of-band transfers
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/50Reducing energy consumption in communication networks in wire-line communication networks, e.g. low power modes or reduced link rate

Definitions

  • the field relates to the computer field, and more particularly to a method and apparatus for improving link aggregation protocol timeouts.
  • Out-of-band management refers to the use of independent management channels for device maintenance. It allows system administrators to remotely monitor and manage servers, routers, network switches and other network devices. In contrast, in-band management refers to the use of conventional data channels to manage devices. The obvious limitation of In-Band Management is that it is susceptible to attack or compromise of the managed device. Out-of-band management addresses this limitation by deploying a management channel that is physically isolated from the data channel.
  • Link aggregation is to aggregate multiple physical ports together to form a logical port to achieve load sharing of outgoing and incoming traffic throughput on each member port.
  • the switch determines network packets according to the port load sharing policy configured by the user. Which member port to send to the peer switch. When the switch detects that the link of one of the member ports is faulty, it stops sending packets on this port, and recalculates the sending port of the packets in the remaining links according to the load sharing policy. After the faulty port recovers, it will serve again. Send and receive port.
  • LACP Link Aggregation Control Protocol
  • the two machines regularly exchange LACPDUs (Link Aggregation Control Protocol Data Unit, Aggregation Link Control Protocol). data unit), the switch can know the aggregation link settings of the peer end and coordinate the appropriate configuration, and can also know whether the peer end is operating normally.
  • LACPDU Link Aggregation Control Protocol Data Unit
  • the switch can know the aggregation link settings of the peer end and coordinate the appropriate configuration, and can also know whether the peer end is operating normally.
  • the establishment of an aggregated link is generally carried out by using the LACP protocol. After the link is established, LACPDU packets are continuously sent on the link. In practical applications, the problem often encountered is that the number of packets that can be sent to the CPU (Central Processing Unit) per second is limited on the port to avoid overloading the CPU, and LACPDU packets need to be sent to the CPU.
  • CPU Central Processing Unit
  • the network status is unstable, such as a broadcast storm or a loop occurs, there will be too many packets on the port that need to be sent to the CPU for processing, such as ARP (Address Resolution Protocol) packets. It is very likely that the LACPDU packets will be squeezed out, causing the port to make a misjudgment and move the port out of the aggregated link, making the port on the aggregated link in an unstable state.
  • ARP Address Resolution Protocol
  • the purpose of the embodiments of the present invention is to propose a method and device for improving the link aggregation protocol timeout.
  • the problem that the stability of in-band management may be affected by in-band data flow can be avoided, and the stability of in-band management can be avoided.
  • the stability of the aggregated link is further improved, and the flexibility of forward compatibility is retained.
  • an aspect of the embodiments of the present invention provides a method for improving the link aggregation protocol timeout, including the following steps:
  • the abnormal port of the second switch is removed from the aggregated link.
  • determining, by the network control center, whether the second switch port is abnormal includes:
  • the first switch In response to the number of timeouts being greater than the threshold number of times, the first switch submits a request for querying the port status of the second switch;
  • the network control center In response to the network control center receiving the request, the network control center obtains the state of the second switch port and determines whether the state of the second switch port is abnormal;
  • the network control center sends the judgment result of whether it is abnormal to the first switch.
  • setting the timeout period and timeout threshold times for the switch to receive LACPDU packets includes:
  • the threshold number of times is three.
  • the method further includes: modifying a field of a reserved block of the LACPDU packet to switch whether the switch aggregated link supports an out-of-band management mode.
  • Another aspect of the embodiments of the present invention also provides a device for improving link aggregation protocol timeout, the device includes:
  • the setting module is configured to set the timeout time and timeout threshold times for the switch to receive the LACPDU packet;
  • a statistics module configured to count the number of timeouts in response to the timeout of the first switch receiving the LACPDU packet from the second switch;
  • the judgment module is configured to judge whether the second switch port is abnormal through the network control center in response to the timeout times being greater than the threshold times and the switch aggregation link being in the out-of-band management mode;
  • the removal module is configured to remove the abnormal port of the second switch from the aggregated link in response to the abnormality of the port of the second switch.
  • the judgment module is further configured to:
  • the first switch In response to the number of timeouts being greater than the threshold number of times, the first switch submits a request for querying the port status of the second switch;
  • the network control center In response to the network control center receiving the request, the network control center obtains the state of the second switch port and determines whether the state of the second switch port is abnormal;
  • the network control center sends the judgment result of whether it is abnormal to the first switch.
  • the setting module is further configured to:
  • the threshold number of times is three.
  • the device further includes a modification module, and the modification module is configured to modify a field of the reserved block of the LACPDU packet to switch whether the switch aggregation link supports the out-of-band management mode.
  • the present invention has the following beneficial technical effects: in the method for improving the link aggregation protocol timeout provided by the embodiment of the present invention, the timeout time and timeout threshold times for receiving LACPDU packets by the switch are set; in response to the first switch receiving the LACPDU from the second switch Packet timeout, count the timeout times; in response to the timeout times greater than the threshold times and the switch aggregation link is in the out-of-band management mode, the network control center determines whether the second switch port is abnormal; in response to the second switch port being abnormal, the second switch is abnormal.
  • the technical solution of removing the aggregated link from the port can avoid the problem that in-band management may be affected by the in-band data flow, which can further improve the stability of the aggregated link and retain the flexibility of forward compatibility.
  • FIG. 1 is a schematic flowchart of a method for improving link aggregation protocol timeout according to an embodiment of the present invention.
  • FIG. 2 is a schematic diagram of a device for improving link aggregation protocol timeout according to an embodiment of the present invention.
  • Figure 1 shows a schematic flow chart of the method.
  • the method may include the following steps:
  • S1 sets the timeout time and timeout threshold times for the switch to receive LACPDU packets.
  • users can set long intervals (30 seconds) or short intervals (1 second) to send LACPDU packets;
  • S3 responds that the number of timeouts is greater than the threshold and the aggregated link of the switch is in out-of-band management mode, and judges whether the second switch port is abnormal through the network control center.
  • an aggregated link status monitoring and control module can be added to the control center, so that the switch can query the aggregated link status of the peer switch through the control center, so that the aggregated link status is not affected by in-band data streams. influence;
  • S4 removes the abnormal port of the second switch from the aggregated link in response to the abnormality of the port of the second switch. After judging that the port of the second switch is abnormal, the network control center may monitor the abnormal port regularly. Add the port to the aggregated link.
  • the technical solution of the present invention can avoid the problem that the stability of in-band management may be affected by the in-band data flow, further improve the stability of the aggregated link, and retain the flexibility of forward compatibility.
  • determining whether the second switch port is abnormal by the network control center includes:
  • the first switch In response to the number of timeouts being greater than the threshold number of times, the first switch submits a request for querying the port status of the second switch;
  • the network control center In response to the network control center receiving the request, the network control center obtains the state of the second switch port and determines whether the state of the second switch port is abnormal;
  • the network control center sends the judgment result of whether it is abnormal to the first switch.
  • the timeout count of the first switch receiving LACPDU packets reaches three times, it submits the aggregated link query request of the second switch to the aggregated link status monitoring and control module of the control center.
  • the aggregated link status monitoring and control module confirms that the If the information in the table expires, let the aggregated link status monitoring and control module send a status request to the second switch, and the second switch returns the link status and then sends the query result back to the first switch.
  • the first switch decides whether to close the port according to the link status of the opposite end.
  • setting the timeout period and timeout threshold times for the switch to receive LACPDU packets includes:
  • the threshold number of times is three.
  • the method further includes: modifying the field of the reserved block of the LACPDU packet to switch whether the switch aggregation link supports the out-of-band management mode.
  • the location where the LACP field is modified is the reserved section. In the standard, this section reserves a total of 50 bytes for extended functions. Therefore, modifying this field will not affect the existing functions.
  • the first byte of the first byte is used. The ones digit is used to judge whether the out-of-band management mode is supported, 1 is for out-of-band management, and 0 is for in-band management. The advantage of this is that it can be forward compatible with old devices that do not support this technology. If the reserved block in the received LACPDU packet is not modified, the in-band management method will continue to be used to maintain the aggregated link state.
  • the technical solution of the present invention can avoid the problem that the stability of in-band management may be affected by the in-band data flow, further improve the stability of the aggregated link, and retain the flexibility of forward compatibility.
  • the above-mentioned programs can be stored in a computer-readable storage medium.
  • the storage medium may be a magnetic disk, an optical disk, a read-only memory (Read-Only Memory, ROM), or a random access memory (Random Access Memory, RAM), and the like.
  • the above computer program embodiments can achieve the same or similar effects as any of the foregoing method embodiments corresponding thereto.
  • the methods disclosed according to the embodiments of the present invention may also be implemented as a computer program executed by the CPU, and the computer program may be stored in a computer-readable storage medium.
  • the computer program is executed by the CPU, the above-mentioned functions defined in the methods disclosed in the embodiments of the present invention are executed.
  • the device 200 includes:
  • Setting module 201 the setting module is configured to set the timeout time and timeout threshold times for the switch to receive the LACPDU packet;
  • the statistics module is configured to count the number of timeouts in response to the timeout of the first switch receiving the LACPDU packet from the second switch;
  • Judging module 203 the judging module is configured to judge whether the second switch port is abnormal through the network control center in response to the timeout times being greater than the threshold times and the switch aggregation link being in the out-of-band management mode;
  • the removal module 204 is configured to, in response to the abnormality of the second switch port, remove the abnormal port of the second switch from the aggregated link.
  • the judgment module is further configured to:
  • the first switch In response to the number of timeouts being greater than the threshold number of times, the first switch submits a request for querying the port status of the second switch;
  • the network control center In response to the network control center receiving the request, the network control center obtains the state of the second switch port and determines whether the state of the second switch port is abnormal;
  • the network control center sends the judgment result of whether it is abnormal to the first switch.
  • the setting module is further configured to:
  • the threshold number of times is three.
  • the device further includes a modification module, and the modification module is configured to modify the field of the reserved block of the LACPDU packet to switch whether the switch aggregation link supports the out-of-band management mode.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Environmental & Geological Engineering (AREA)
  • Health & Medical Sciences (AREA)
  • Cardiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

本发明提供了一种改善链路聚合协议超时的方法和设备,该方法包括以下步骤:设定交换机接收LACPDU封包的超时时间和超时阈值次数;响应于第一交换机接收来自第二交换机的LACPDU封包超时,统计超时次数;响应于超时次数大于阈值次数并且交换机聚合链路处于带外管理模式,通过网络控制中心判断第二交换机端口是否异常;响应于第二交换机端口异常,将第二交换机异常端口移除聚合链路。通过使用本发明的方案,能够避免带内管理可能被带内数据流影响稳定性的问题,能够使聚合链路的稳定性能获得更进一步的提升,并且保留了向前兼容的弹性。

Description

一种改善链路聚合协议超时的方法和设备
本申请要求于2020年9月4日提交中国国家知识产权局,申请号为202010921798.9,发明名称为“一种改善链路聚合协议超时的方法和设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本领域涉及计算机领域,并且更具体地涉及一种改善链路聚合协议超时的方法和设备。
背景技术
带外管理(Out-of-band management)是指使用独立管理信道进行装置维护。它允许系统管理员远程监控和管理服务器、路由器、网络交换机和其他网络装置。相对的,带内管理是指使用常规数据信道来管理装置。带内管理(In-Band Management)的明显限制是这种管理容易受到被管理装置受攻击或损害的影响。带外管理通过部署与数据信道物理隔离的管理信道来解决这个限制。
聚合链路(Link aggregation)是将多个物理埠汇聚在一起,形成一个逻辑端口,以实现出和入流量吞吐量在各成员端口的负荷分担,交换机根据用户配置的端口负荷分担策略决定网络封包从哪个成员埠发送到对端的交换机。当交换机检测到其中一个成员埠的链路发生故障时,就停止在此埠上发送封包,并根据负荷分担策略在剩下的链路中重新计算报文的发送埠,故障埠恢复后再次担任收发埠。
聚合链路控制协议(Link Aggregation Control Protocol,LACP)是一种让交换机之间可以动态形成聚合链路的协议,透过两台机器定时交换LACPDU(Link Aggregation Control Protocol Data Unit,聚合链路控制协议数据单元),交换机可以知道对端的聚合链路设定并且协调出适合的配置,也能知道对端是否正常运作中。目前聚合链路的建立一般是使用LACP协议来进行,链路建成后会持续在链路上发送LACPDU的封包。在实际应用的场合,经常会碰到的问题是,在端口上会限制每秒能够送往CPU(Central Processing Unit,中央处理器)的封包数以避免CPU超载,而LACPDU封包是需要送交CPU做处理的,如果网络状态不稳定,比如发生广播风暴,或是有回路产生,这时端口上会有太多需要送往CPU处理的封包,比如ARP(Address Resolution Protocol,地址解析协议)封包,那就很可能会把LACPDU封包挤掉,使得端口产生误判将端口移出聚合链路,让聚合链路上的端口呈现不稳定的状态。
发明内容
有鉴于此,本发明实施例的目的在于提出一种改善链路聚合协议超时的方法和设备,通过使用本发明的方法,能够避免带内管理可能被带内数据流影响稳定性的问题,能够使聚合链路的稳定性能获得更进一步的提升,并且保留了向前兼容的弹性。
基于上述目的,本发明的实施例的一个方面提供了一种改善链路聚合协议超时的方法,包括以下步骤:
设定交换机接收LACPDU封包的超时时间和超时阈值次数;
响应于第一交换机接收来自第二交换机的LACPDU封包超时,统计超时次数;
响应于超时次数大于阈值次数并且交换机聚合链路处于带外管理模式, 通过网络控制中心判断第二交换机端口是否异常;
响应于第二交换机端口异常,将第二交换机异常端口移除聚合链路。
根据本发明的一个实施例,响应于超时次数大于阈值次数并且交换机聚合链路处于带外管理模式,通过网络控制中心判断第二交换机端口是否异常包括:
响应于超时次数大于阈值次数,第一交换机提交查询第二交换机端口状态的请求;
响应于网络控制中心接收到请求,网络控制中心获取第二交换机端口状态并判断第二交换机端口的状态是否异常;
网络控制中心将是否异常的判断结果发送到第一交换机。
根据本发明的一个实施例,设定交换机接收LACPDU封包的超时时间和超时阈值次数包括:
设定接收LACPDU封包的阈值时间;
响应于接收LACPDU封包的时间大于阈值时间判断为超时一次并重新开始计时。
根据本发明的一个实施例,阈值次数为三次。
根据本发明的一个实施例,方法还包括:修改LACPDU封包的reserved(保留)区块的字段以切换交换机聚合链路是否支持带外管理模式。
本发明的实施例的另一个方面,还提供了一种改善链路聚合协议超时的设备,设备包括:
设定模块,设定模块配置为设定交换机接收LACPDU封包的超时时间和超时阈值次数;
统计模块,统计模块配置为响应于第一交换机接收来自第二交换机的 LACPDU封包超时,统计超时次数;
判断模块,判断模块配置为响应于超时次数大于阈值次数并且交换机聚合链路处于带外管理模式,通过网络控制中心判断第二交换机端口是否异常;
移除模块,移除模块配置为响应于第二交换机端口异常,将第二交换机异常端口移除聚合链路。
根据本发明的一个实施例,判断模块还配置为:
响应于超时次数大于阈值次数,第一交换机提交查询第二交换机端口状态的请求;
响应于网络控制中心接收到请求,网络控制中心获取第二交换机端口状态并判断第二交换机端口的状态是否异常;
网络控制中心将是否异常的判断结果发送到第一交换机。
根据本发明的一个实施例,设定模块还配置为:
设定接收LACPDU封包的阈值时间;
响应于接收LACPDU封包的时间大于阈值时间判断为超时一次并重新开始计时。
根据本发明的一个实施例,阈值次数为三次。
根据本发明的一个实施例,设备还包括修改模块,修改模块配置为修改LACPDU封包的reserved区块的字段以切换交换机聚合链路是否支持带外管理模式。
本发明具有以下有益技术效果:本发明实施例提供的改善链路聚合协议超时的方法,通过设定交换机接收LACPDU封包的超时时间和超时阈值次数;响应于第一交换机接收来自第二交换机的LACPDU封包超时,统计 超时次数;响应于超时次数大于阈值次数并且交换机聚合链路处于带外管理模式,通过网络控制中心判断第二交换机端口是否异常;响应于第二交换机端口异常,将第二交换机异常端口移除聚合链路的技术方案,能够避免带内管理可能被带内数据流影响稳定性的问题,能够使聚合链路的稳定性能获得更进一步的提升,并且保留了向前兼容的弹性。
附图说明
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的实施例。
图1为根据本发明一个实施例的改善链路聚合协议超时的方法的示意性流程图;以及
图2为根据本发明一个实施例的改善链路聚合协议超时的设备的示意图。
具体实施方式
为使本发明的目的、技术方案和优点更加清楚明白,以下结合具体实施例,并参照附图,对本发明实施例进一步详细说明。
基于上述目的,本发明的实施例的第一个方面,提出了一种改善链路聚合协议超时的方法的一个实施例。图1示出的是该方法的示意性流程图。
如图1中所示,该方法可以包括以下步骤:
S1设定交换机接收LACPDU封包的超时时间和超时阈值次数,根据IEEE802.3ad的标准,用户可以设定使用长间隔(30秒)或短间隔(1秒) 来发送LACPDU封包;
S2响应于第一交换机接收来自第二交换机的LACPDU封包超时,统计超时次数;
S3响应于超时次数大于阈值次数并且交换机聚合链路处于带外管理模式,通过网络控制中心判断第二交换机端口是否异常,由于一般数据中心都会有一个网络控制中心(Controller)用来监控数据中心的所有网络设备,可以在控制中心里追加聚合链路状态监控和控制模块,让交换机得以透过控制中心来查询对端交换机的聚合链路状态,使聚合链路的状态不受带内数据流的影响;
S4响应于第二交换机端口异常,将第二交换机异常端口移除聚合链路,网络控制中心在判断第二交换机端口异常后,可以定时监控该异常端口,当该异常端口的状态正常时,重新将该端口加入到聚合链路中。
通过本发明的技术方案,能够避免带内管理可能被带内数据流影响稳定性的问题,能够使聚合链路的稳定性能获得更进一步的提升,并且保留了向前兼容的弹性。
在本发明的一个优选实施例中,响应于超时次数大于阈值次数并且交换机聚合链路处于带外管理模式,通过网络控制中心判断第二交换机端口是否异常包括:
响应于超时次数大于阈值次数,第一交换机提交查询第二交换机端口状态的请求;
响应于网络控制中心接收到请求,网络控制中心获取第二交换机端口状态并判断第二交换机端口的状态是否异常;
网络控制中心将是否异常的判断结果发送到第一交换机。当第一交换机接收LACPDU封包超时计数达到三次时,提交第二交换机的聚合链路查 询的请求到控制中心的聚合链路状态监控和控制模块,此时聚合链路状态监控和控制模块确认数据库中的链路状态表,如果表中信息过期则让聚合链路状态监控和控制模块对第二交换机发送状态请求,第二交换机回传链路状态后再把查询结果回传到第一交换机,此时第一交换机再根据对端链路状态决定是否需要关闭端口。
在本发明的一个优选实施例中,设定交换机接收LACPDU封包的超时时间和超时阈值次数包括:
设定接收LACPDU封包的阈值时间;
响应于接收LACPDU封包的时间大于阈值时间判断为超时一次并重新开始计时。
在本发明的一个优选实施例中,阈值次数为三次。
在本发明的一个优选实施例中,方法还包括:修改LACPDU封包的reserved区块的字段以切换交换机聚合链路是否支持带外管理模式。修改LACP字段的位置是reserved区段,在标准中该区段共计保留五十个字节作为扩展功能之用,故修改该字段不会影响既有功能,使用其中第一个字节的第一个位作为是否支持带外管理模式的判断,1为带外管理,0为带内管理。这样的好处是可以向前兼容没有支持本技术的旧设备,如果收到的LACPDU封包里reserved区块没有修改,那就继续使用带内管理的方式维护聚合链路状态。
通过本发明的技术方案,能够避免带内管理可能被带内数据流影响稳定性的问题,能够使聚合链路的稳定性能获得更进一步的提升,并且保留了向前兼容的弹性。
需要说明的是,本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,可以通过计算机程序来指令相关硬件来完成,上述的 程序可存储于计算机可读取存储介质中,该程序在执行时,可包括如上述各方法的实施例的流程。其中存储介质可为磁碟、光盘、只读存储器(Read-Only Memory,ROM)或随机存取存储器(Random Access Memory,RAM)等。上述计算机程序的实施例,可以达到与之对应的前述任意方法实施例相同或者相类似的效果。
此外,根据本发明实施例公开的方法还可以被实现为由CPU执行的计算机程序,该计算机程序可以存储在计算机可读存储介质中。在该计算机程序被CPU执行时,执行本发明实施例公开的方法中限定的上述功能。
基于上述目的,本发明的实施例的第二个方面,提出了一种改善链路聚合协议超时的设备,如图2所示,设备200包括:
设定模块201,设定模块配置为设定交换机接收LACPDU封包的超时时间和超时阈值次数;
统计模块202,统计模块配置为响应于第一交换机接收来自第二交换机的LACPDU封包超时,统计超时次数;
判断模块203,判断模块配置为响应于超时次数大于阈值次数并且交换机聚合链路处于带外管理模式,通过网络控制中心判断第二交换机端口是否异常;
移除模块204,移除模块配置为响应于第二交换机端口异常,将第二交换机异常端口移除聚合链路。
在本发明的一个优选实施例中,判断模块还配置为:
响应于超时次数大于阈值次数,第一交换机提交查询第二交换机端口状态的请求;
响应于网络控制中心接收到请求,网络控制中心获取第二交换机端口状态并判断第二交换机端口的状态是否异常;
网络控制中心将是否异常的判断结果发送到第一交换机。
在本发明的一个优选实施例中,设定模块还配置为:
设定接收LACPDU封包的阈值时间;
响应于接收LACPDU封包的时间大于阈值时间判断为超时一次并重新开始计时。
在本发明的一个优选实施例中,阈值次数为三次。
在本发明的一个优选实施例中,设备还包括修改模块,修改模块配置为修改LACPDU封包的reserved区块的字段以切换交换机聚合链路是否支持带外管理模式。
上述实施例,特别是任何“优选”实施例是实现的可能示例,并且仅为了清楚地理解本发明的原理而提出。可以在不脱离本文所描述的技术的精神和原理的情况下对上述实施例进行许多变化和修改。所有修改旨在被包括在本公开的范围内并且由所附权利要求保护。

Claims (10)

  1. 一种改善链路聚合协议超时的方法,其特征在于,包括以下步骤:
    设定交换机接收LACPDU封包的超时时间和超时阈值次数;
    响应于第一交换机接收来自第二交换机的LACPDU封包超时,统计所述超时次数;
    响应于所述超时次数大于阈值次数并且交换机聚合链路处于带外管理模式,通过网络控制中心判断所述第二交换机端口是否异常;
    响应于所述第二交换机端口异常,将所述第二交换机异常端口移除所述聚合链路。
  2. 根据权利要求1所述的方法,其特征在于,响应于所述超时次数大于阈值次数并且交换机聚合链路处于带外管理模式,通过网络控制中心判断所述第二交换机端口是否异常包括:
    响应于所述超时次数大于所述阈值次数,所述第一交换机提交查询所述第二交换机端口状态的请求;
    响应于所述网络控制中心接收到所述请求,所述网络控制中心获取所述第二交换机端口状态并判断所述第二交换机端口的状态是否异常;
    所述网络控制中心将是否异常的判断结果发送到所述第一交换机。
  3. 根据权利要求1所述的方法,其特征在于,设定交换机接收LACPDU封包的超时时间和超时阈值次数包括:
    设定接收LACPDU封包的阈值时间;
    响应于接收LACPDU封包的时间大于所述阈值时间判断为超时一次并重新开始计时。
  4. 根据权利要求1所述的方法,其特征在于,所述阈值次数为三次。
  5. 根据权利要求1所述的方法,其特征在于,还包括:修改LACPDU封包的reserved区块的字段以切换所述交换机聚合链路是否支持带外管理模式。
  6. 一种改善链路聚合协议超时的设备,其特征在于,所述设备包括:
    设定模块,所述设定模块配置为设定交换机接收LACPDU封包的超时时间和超时阈值次数;
    统计模块,所述统计模块配置为响应于第一交换机接收来自第二交换机的LACPDU封包超时,统计所述超时次数;
    判断模块,所述判断模块配置为响应于所述超时次数大于阈值次数并且交换机聚合链路处于带外管理模式,通过网络控制中心判断所述第二交换机端口是否异常;
    移除模块,所述移除模块配置为响应于所述第二交换机端口异常,将所述第二交换机异常端口移除所述聚合链路。
  7. 根据权利要求6所述的设备,其特征在于,所述判断模块还配置为:
    响应于所述超时次数大于所述阈值次数,所述第一交换机提交查询所述第二交换机端口状态的请求;
    响应于所述网络控制中心接收到所述请求,所述网络控制中心获取所述第二交换机端口状态并判断所述第二交换机端口的状态是否异常;
    所述网络控制中心将是否异常的判断结果发送到所述第一交换机。
  8. 根据权利要求6所述的设备,其特征在于,所述设定模块还配置为:
    设定接收LACPDU封包的阈值时间;
    响应于接收LACPDU封包的时间大于所述阈值时间判断为超时一次并重新开始计时。
  9. 根据权利要求6所述的设备,其特征在于,所述阈值次数为三次。
  10. 根据权利要求6所述的设备,其特征在于,还包括修改模块,所述修改模块配置为修改LACPDU封包的reserved区块的字段以切换所述交换机聚合链路是否支持带外管理模式。
PCT/CN2021/103319 2020-09-04 2021-06-29 一种改善链路聚合协议超时的方法和设备 WO2022048281A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US18/012,928 US11683257B1 (en) 2020-09-04 2021-06-29 Method and device for improving link aggregation protocol timeout

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010921798.9 2020-09-04
CN202010921798.9A CN112134797B (zh) 2020-09-04 2020-09-04 一种改善链路聚合协议超时的方法和设备

Publications (1)

Publication Number Publication Date
WO2022048281A1 true WO2022048281A1 (zh) 2022-03-10

Family

ID=73847981

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/103319 WO2022048281A1 (zh) 2020-09-04 2021-06-29 一种改善链路聚合协议超时的方法和设备

Country Status (3)

Country Link
US (1) US11683257B1 (zh)
CN (1) CN112134797B (zh)
WO (1) WO2022048281A1 (zh)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112134797B (zh) * 2020-09-04 2022-05-13 苏州浪潮智能科技有限公司 一种改善链路聚合协议超时的方法和设备
CN115150453B (zh) * 2022-06-30 2023-08-11 苏州浪潮智能科技有限公司 一种避免lacp协议超时的方法、系统、设备和存储介质

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104917624A (zh) * 2014-03-10 2015-09-16 华耀(中国)科技有限公司 一种链路聚合路径的健康检查系统及方法
US20180176036A1 (en) * 2016-12-20 2018-06-21 Dell Products, Lp Switch Dependent Teaming for High Availability of Virtual Network Functions
CN112134797A (zh) * 2020-09-04 2020-12-25 苏州浪潮智能科技有限公司 一种改善链路聚合协议超时的方法和设备

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101094157B (zh) * 2007-08-20 2011-09-21 中兴通讯股份有限公司 利用链路聚合实现网络互连的方法
JP5513342B2 (ja) * 2010-02-26 2014-06-04 アラクサラネットワークス株式会社 パケット中継装置
CN101945042B (zh) * 2010-09-03 2014-01-01 中兴通讯股份有限公司 一种链路聚合的方法及设备
CN103188172B (zh) * 2011-12-28 2018-03-27 中兴通讯股份有限公司 链路聚合的异常恢复方法和交换设备
CN104219075B (zh) * 2013-05-31 2017-10-24 新华三技术有限公司 一种基于开放流协议的lacp环路检测方法和装置
CN104954271B (zh) * 2014-03-26 2018-11-30 国际商业机器公司 Sdn网络中的数据包处理方法和装置
US9866469B2 (en) * 2015-05-18 2018-01-09 Dell Products L.P. Shutdown response system
US10110469B2 (en) * 2016-07-21 2018-10-23 Cisco Technology, Inc. Detecting and preventing network loops

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104917624A (zh) * 2014-03-10 2015-09-16 华耀(中国)科技有限公司 一种链路聚合路径的健康检查系统及方法
US20180176036A1 (en) * 2016-12-20 2018-06-21 Dell Products, Lp Switch Dependent Teaming for High Availability of Virtual Network Functions
CN112134797A (zh) * 2020-09-04 2020-12-25 苏州浪潮智能科技有限公司 一种改善链路聚合协议超时的方法和设备

Also Published As

Publication number Publication date
CN112134797A (zh) 2020-12-25
CN112134797B (zh) 2022-05-13
US11683257B1 (en) 2023-06-20
US20230198881A1 (en) 2023-06-22

Similar Documents

Publication Publication Date Title
US10257265B2 (en) Redundancy network protocol system
US9270524B2 (en) Method and device for LACP link switching and data transmission
US8451713B2 (en) Special marker message for link aggregation marker protocol
WO2022048281A1 (zh) 一种改善链路聚合协议超时的方法和设备
US8306039B2 (en) Methods and systems for automatic transport path selection for multi-homed entities in stream control transmission protocol
US7903548B2 (en) BFD rate-limiting and automatic session activation
US8027246B2 (en) Network system and node apparatus
US9565113B2 (en) Adaptive link aggregation and virtual link aggregation
US10097465B2 (en) Data transfer between endpoints using a multipath connection
US20130235722A1 (en) Method and system for power control based on data flow awareness in a packet network switch
US6538990B1 (en) Method and system for congestion flow control in a high speed network
CN106712899B (zh) 一种端口速率调整方法及设备
US10200277B2 (en) Influencing path selection during a multipath connection
WO2009009977A1 (fr) Procédé de commutation maître/secours pour un dispositif de routage et système de secours pour un dispositif de routage
WO2009023996A1 (fr) Procédé de mise en œuvre d'une interconnexion de réseau par l'intermédiaire d'une agrégation de liaisons
WO2008080314A1 (fr) Procédé, moteur de retransmission et dispositif de communication pour la commande d'accès aux messages
US20120063304A1 (en) Manageability Tools for Lossless Networks
US7561587B2 (en) Method and system for providing layer-4 switching technologies
WO2015070383A1 (zh) 一种链路聚合的方法、装置和系统
WO2014040466A1 (zh) 控制ap的方法和装置
WO2021093797A1 (zh) 一种信息上报方法和信息处理方法及设备
EP3376717B1 (en) Link discovery method and apparatus
WO2015192306A1 (zh) 软件定义网络中的控制设备、系统和传输通道控制方法
US20080095148A1 (en) Mechanism for automatic global network configuration and switch parameter setting using radius/AAA
WO2017000861A1 (zh) 交换机虚拟局域网中mac地址的学习方法及装置

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21863349

Country of ref document: EP

Kind code of ref document: A1