CN108768757B - Fault processing method and device and distributed network equipment - Google Patents

Fault processing method and device and distributed network equipment Download PDF

Info

Publication number
CN108768757B
CN108768757B CN201810838987.2A CN201810838987A CN108768757B CN 108768757 B CN108768757 B CN 108768757B CN 201810838987 A CN201810838987 A CN 201810838987A CN 108768757 B CN108768757 B CN 108768757B
Authority
CN
China
Prior art keywords
card
switch
isolation
control board
service board
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810838987.2A
Other languages
Chinese (zh)
Other versions
CN108768757A (en
Inventor
胡滕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Maipu Communication Technology Co Ltd
Original Assignee
Maipu Communication Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Maipu Communication Technology Co Ltd filed Critical Maipu Communication Technology Co Ltd
Priority to CN201810838987.2A priority Critical patent/CN108768757B/en
Publication of CN108768757A publication Critical patent/CN108768757A/en
Application granted granted Critical
Publication of CN108768757B publication Critical patent/CN108768757B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0663Performing the actions predefined by failover planning, e.g. switching to standby network elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/069Management of faults, events, alarms or notifications using logs of notifications; Post-processing of notifications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/55Prevention, detection or correction of errors
    • H04L49/557Error correction, e.g. fault recovery or fault tolerance

Abstract

The invention provides a fault processing method, a fault processing device and distributed network equipment, wherein when a master control board card in the distributed network equipment obtains an isolation application initiated by a switch card, if the switch card is determined to meet a preset isolation condition, all service board cards are informed to stop forwarding cross-card data traffic to the switch card, the isolation of the switch card is completed, so that the switch card cannot participate in cross-card data traffic forwarding between the service board cards, and the cross-card service traffic forwarding between the service board cards cannot pass through the switch card, so that the service traffic of the cross-service board cards even packet loss and even flow break caused by complete fault of a link aggregation group between the switch card and the service board cards are avoided, and the influence on the cross-service board card forwarding traffic is reduced.

Description

Fault processing method and device and distributed network equipment
Technical Field
The present invention relates to the field of data communication, and in particular, to a method and an apparatus for processing a fault, and a distributed network device.
Background
Currently, a mainstream distributed rack network switching device includes a main control board (MPU), a plurality of service boards (LPUs) for performing data stream interaction, and one or more switch cards (SFUs) connected to the service boards. The service flow among the service board cards is realized by forwarding of the switch card, the main control board card is used for controlling and managing the distributed network equipment, the switch card and the service board cards are connected through an internal link aggregation group, and each link aggregation group comprises a plurality of high-speed channels.
In a distributed network device, once multiple high-speed channels between an LPU and an SFU completely fail, the failure may cause a packet loss or even a flow interruption of traffic across LPU boards. Therefore, the distributed network device needs to process the link failure, and reduces the influence on the normal service flow.
Disclosure of Invention
In view of the above, embodiments of the present invention provide a method, an apparatus, and a distributed network device for fault handling, so as to solve the above problem.
In a first aspect, an embodiment of the present invention provides a fault handling method, where the method is applied to a master control board of a distributed network device, and includes: the master control board card acquires an isolation application initiated by a switch card, wherein the isolation application is initiated when the switch card detects that all connecting channels between any service board card and the switch card are abnormal; and when the master control board card determines that the switch card meets the preset isolation condition, informing all the service board cards of stopping forwarding the cross-card data traffic to the switch card.
In a second aspect, an embodiment of the present invention provides a fault handling method, which is applied to a switch card of a distributed network device, and the method includes: when detecting that all connecting channels between the switching card and any service board card are abnormal, the switching card initiates an isolation application to a main control board card; and the switch card acquires an isolation determination command fed back by the main control board card based on a preset isolation condition and stops participating in receiving and sending of cross-card data traffic.
In a third aspect, an embodiment of the present invention provides a fault handling method, which is applied to a service board of a distributed network device, where the method includes: the service board card obtains a notice from the main control board card for stopping forwarding the cross-card data traffic to the switch card meeting the isolation condition; and the service board card stops forwarding the cross-card data traffic to the switch card.
In a fourth aspect, an embodiment of the present invention provides a fault handling method, which is applied to a distributed network device, and the method includes: when detecting that all connection channels between the switching card and any service board card are abnormal, the switching card initiates an isolation application to the main control board card; the master control board card acquires an isolation application sent by the switch card, and notifies all service board cards of stopping forwarding the cross-card data traffic to the switch card when the switch card is determined to meet a preset isolation condition; and the service board card acquires a notice of stopping forwarding the cross-card data traffic to the switch card meeting the isolation condition from the main control board card, and stops forwarding the cross-card data traffic to the switch card.
In a fifth aspect, an embodiment of the present invention provides a fault handling apparatus, which is applied to a main control board of a distributed network device, where the apparatus includes: the system comprises an acquisition module, a processing module and a processing module, wherein the acquisition module is used for acquiring an isolation application initiated by a switch card, and the isolation application is initiated by the switch card when the switch card detects that all connecting channels between any service card and the switch card are abnormal; and the judgment execution module is used for informing all the service board cards of stopping forwarding the cross-card data traffic to the switch card when the switch card is determined to meet the preset isolation condition.
In a sixth aspect, an embodiment of the present invention provides a fault handling apparatus, which is applied to a switch card of a distributed network device, and the apparatus includes: the system comprises an initiating module, a main control board card and a service board card, wherein the initiating module is used for initiating an isolation application to the main control board card when detecting that all connecting channels between the main control board card and any service board card are abnormal; and the execution module is used for acquiring an isolation determination command fed back by the main control board card based on a preset isolation condition and stopping participating in the transceiving of the cross-card data traffic.
In a seventh aspect, an embodiment of the present invention provides a fault handling apparatus, which is applied to a service board of a distributed network device, where the apparatus includes: the acquisition module is used for acquiring a notice from the main control board card for stopping forwarding the cross-card data traffic to the switch card meeting the isolation condition; and the forwarding stopping module is used for stopping forwarding the cross-card data traffic to the switch card.
In an eighth aspect, an embodiment of the present invention provides a distributed network device, including a master control board, a service board connected to the master control board, and a switch card, where the service board is connected to at least one switch card, and the switch card is configured to initiate an isolation application to the master control board when all connection channels with any service board are detected to be abnormal; the master control board card is used for acquiring the isolation application and informing all the service board cards of stopping forwarding the cross-card data traffic to the switch card when the switch card is determined to meet the preset isolation condition; the service board card is used for stopping forwarding the cross-card data traffic to the switch card after acquiring the notification of stopping forwarding the cross-card data traffic to the switch card from the main control board card.
Compared with the prior art, according to the fault processing method, the fault processing device and the distributed network equipment provided by the embodiments of the invention, when the master control board card in the distributed network equipment obtains the isolation application initiated by the switch card, if the switch card is determined to meet the preset isolation condition, all the service board cards are notified to stop forwarding the cross-card data traffic to the switch card, and the isolation of the switch card is completed, so that the switch card cannot participate in the cross-card data traffic forwarding between the service board cards, and then the service board cards cannot pass through the switch card when the cross-board service traffic forwarding is performed between the service board cards, thereby avoiding the service traffic packet loss and even the flow interruption of the cross-service board cards when the link aggregation group between the switch card and the service board cards completely fails, and reducing the influence on the cross-service board card forwarding traffic.
In order to make the aforementioned and other objects, features and advantages of the present invention comprehensible, preferred embodiments accompanied with figures are described in detail below.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the embodiments will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present invention and therefore should not be considered as limiting the scope, and for those skilled in the art, other related drawings can be obtained according to the drawings without inventive efforts.
FIG. 1a is a schematic diagram of a distributed network device according to an embodiment of the present invention;
fig. 1b is a schematic diagram illustrating a failure of a high-speed channel inside a distributed network device according to an embodiment of the present invention;
fig. 2 is a flowchart of a fault handling method according to an embodiment of the present invention;
fig. 3 is a second flowchart of a fault handling method according to an embodiment of the present invention;
fig. 4 is a third flowchart of a fault handling method according to an embodiment of the present invention;
FIG. 5 is a fourth flowchart of a fault handling method according to an embodiment of the present invention;
FIG. 6 is a fifth flowchart of a fault handling method according to an embodiment of the present invention;
fig. 7 is a block diagram of a fault handling apparatus according to an embodiment of the present invention;
fig. 8 is a second block diagram of a fault handling apparatus according to the embodiment of the present invention;
fig. 9 is a third block diagram of a fault handling apparatus according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations. Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without making any creative effort, shall fall within the protection scope of the present invention.
First, the terms involved in the embodiments of the present invention will be briefly described:
MPU: a master control board card of network equipment with a distributed rack structure.
SFU: and (4) exchanging cards.
An LPU: and (4) a service board card.
The protection protocol is as follows: a protocol for maintaining backplane high speed channel link status.
SFU isolation: the SFU is no longer involved in forwarding across the service boards.
SFU isolation recovery: and enabling the isolated SFU to participate in forwarding across the service board card again.
Aiming at the current mainstream distributed rack network switching equipment, the service flow among all the service board cards is realized by forwarding of the switch card, the main control board card is used for controlling and managing the distributed network equipment, the switch card and the service board cards are connected through an internal link aggregation group, and each link aggregation group comprises a plurality of high-speed channels.
However, in a distributed network device formed by LPUs and SFUs, once multiple high-speed channels between the LPUs and the SFUs are completely blocked, the fault may cause packet loss or even flow interruption of traffic flow across LPU boards. Therefore, the distributed network device needs to process the link failure, and reduces the influence on the normal service flow.
In order to solve the above problem, embodiments of the present invention provide a fault processing method, an apparatus, and a distributed network device.
Fig. 1a is a schematic diagram of a distributed network device 100 according to an embodiment of the present invention. The distributed network device 100 may include a main control board card, a service board card connected to the main control board card, and a switch card, where the service board card is connected to at least one switch card through an internal link aggregation group, and each link aggregation group includes multiple high-speed channels.
It should be noted that the fault handling method and apparatus provided in the embodiment of the present invention are implemented based on the distributed network device 100.
Referring to fig. 2, fig. 2 is a flowchart of a fault handling method according to an embodiment of the present invention, where the method is described from the perspective of a master board of a distributed network device. The flow shown in fig. 2 will be described in detail below, and the method includes:
step S110: the master control board card obtains an isolation application initiated by the switch card, and the isolation application is initiated when the switch card detects that any service board card and all connecting channels between the switch cards are abnormal.
When the switch card SFU in the distributed network device sends a data packet to the service board LPU through the keep-alive protocol, the LPU can feed back the data packet to the SFU after receiving the data packet. When the SFU receives the feedback data packet, the characterization internal high-speed channel connection is normal, and when the feedback data packet cannot be received, the characterization internal high-speed channel connection is completely abnormal.
When the SFU detects that all internal high-speed channels connected with a certain service board LPU and the SFU are abnormal, because the keep-alive protocol is a bidirectional protocol, the LPU side also detects that all internal high-speed channels connected with the SFU are also abnormal.
As shown in FIG. 1b, the internal high speed path between SFU0 and LPU0 is completely failed. In this case, the SFU0 may initiate an isolation application to the master board MPU. After acquiring the isolation application, the MPU may determine whether the SFU0 initiating the isolation application needs to be isolated based on preset isolation conditions.
Step S120: and when the master control board card determines that the switch card meets the preset isolation condition, informing all the service board cards of stopping forwarding the cross-card data traffic to the switch card.
As mentioned in the foregoing, after acquiring the isolation application, the MPU may determine whether the SFU initiating the isolation application needs to be isolated based on a preset isolation condition.
The preset isolation condition may be:
besides the SFU0 initiating the isolation application, the distributed network device further includes at least one other switch card, for example, SFU1, SFU2 … …, and the like, and a connection channel in a normal connection state exists between the service board LPU0 having a complete failure of the internal high-speed channel between the service board LPU0 and the SFU0 and at least one other switch card, for example, a connection channel in a normal connection state exists between the LPU0 and the SFU 1. It should be noted that the normal connection channel herein does not necessarily mean that the connection states of all connection channels in the internal link aggregation group are normal, as long as there is at least one normal connection channel.
When the MPU determines that the distributed network device where the SFU0 and the SFU0 initiating the isolation application are located meets the preset isolation condition, that is, it is determined that the SFU0 meets the preset isolation condition, and then may notify all LPUs included in the distributed network device to stop forwarding the cross-card data traffic to the SFU 0.
When all the LPUs stop forwarding the cross-card data traffic to the SFU0, isolation of the SFU0 is completed, so that the SFU0 cannot participate in cross-card data traffic forwarding between the LPUs, and when cross-board service traffic forwarding is performed between the LPUs, the cross-board service traffic will not pass through the SFU0, thereby avoiding packet loss and even flow interruption of the service traffic across the LPUs when a link aggregation group between the SFU0 and the LPUs completely fails, and reducing influence on the cross-LPU forwarding traffic.
As an optional implementation, please refer to fig. 3, where the main control board MPU notifies all the service boards LPUs to stop forwarding the cross-card data traffic to the switch card SFU0, and after completing isolation of the SFU0, in order to recover the traffic forwarding across the service boards as soon as possible and minimize loss due to internal high-speed channel failure, the method may further include:
step S130: and when the master control board card determines that the switch card meets the isolation removing condition, informing all the service board cards to resume forwarding the cross-card data traffic to the switch card.
As an optional implementation manner, the isolation release condition may be:
and the connection channels between the isolated switch card SFU and all the LPUs are recovered to be normal.
In this embodiment, when the MPU determines that the connection channels between the previously isolated SFU0 and the LPU0 return to normal, and the connection channels with all LPUs are in a normal state, it may be determined that the SFU0 satisfies the isolation release condition.
As another optional implementation, the isolation release condition may be:
the isolated switch card SFU is pulled out or the service board card LPU at the opposite end of the isolated switch card SFU is pulled out.
When the SFU or the LPU is unplugged, the MPU may determine that the corresponding SFU is unplugged or determine that the corresponding LPU is unplugged, when the MPU may acquire a card unplugging interruption event generated by the corresponding SFU or the corresponding LPU.
In such an embodiment, when the MPU determines that the previously isolated SFU0 was unplugged, or when the MPU determines that the service card LPU0 on the opposite side of the previously isolated SFU0, it may be determined that the switch card SFU0 satisfies the isolation release condition.
As another alternative, the isolation release condition may be:
and the connection channels between the service board LPU at the opposite end of the previously isolated SFU and the rest switch cards in the distributed network equipment comprising the SFU are all disconnected.
In this embodiment, when the MPU determines that all connection channels between the LPU0 of the opposite end of the previously isolated SFU0 and the remaining switch cards in the distributed network device including SFU0 are disconnected, it may be determined that SFU0 satisfies the isolation release condition.
When the MPU determines that the isolated SFU0 satisfies the isolation release condition, the MPU may notify all LPUs to forward the cross-card data traffic to the SFU0, so that the SFU0 participates in the cross-card data traffic transmission and reception, and the loss caused by the internal high-speed channel failure is minimized.
In the fault handling method provided by the embodiment of the present invention, when a master control board in a distributed network device obtains an isolation application initiated by a switch card, if it is determined that the switch card satisfies a preset isolation condition, all service boards are notified to stop forwarding cross-card data traffic to the switch card, and isolation of the switch card is completed, so that the switch card cannot participate in cross-card data traffic forwarding between the service boards, and thus when inter-board service traffic forwarding is performed between the service boards, the switch card cannot be passed through, thereby avoiding packet loss and even flow break of service traffic of the cross-service boards when a link aggregation group between the switch card and the service boards completely fails, and reducing an influence on the flow forwarding of the cross-service boards.
Referring to fig. 4, an embodiment of the present invention further provides a fault handling method, which is described from the perspective of a switch card of a distributed network device. The flow shown in fig. 4 will be explained, and the method includes:
step S210: and when detecting that all the connecting channels between the switching card and any service board card are abnormal, the switching card initiates an isolation application to the main control board card.
Step S220: and the switch card acquires an isolation determination command fed back by the main control board card based on a preset isolation condition and stops participating in receiving and sending of cross-card data traffic.
In addition, referring to fig. 5, an embodiment of the present invention further provides a fault handling method, which is described from the perspective of a service board of a distributed network device. The flow shown in fig. 5 will be explained, and the method includes:
step S310: and the service board card acquires a notice from the main control board card for stopping forwarding the cross-card data traffic to the switch card meeting the isolation condition.
Step S320: and the service board card stops forwarding the cross-card data traffic to the switch card.
In addition, referring to fig. 6, an embodiment of the present invention further provides a fault handling method, which is implemented by distributed network devices. The method comprises the following steps:
step S410: and when detecting that all the connecting channels between the switching card and any service board card are abnormal, the switching card initiates an isolation application to the main control board card.
Step S420: the master control board card obtains the isolation application sent by the switch card, and when the switch card is determined to meet the preset isolation condition, all the service board cards are informed to stop forwarding the cross-card data traffic to the switch card.
Step S430: and the service board card acquires a notice of stopping forwarding the cross-card data traffic to the switch card meeting the isolation condition from the main control board card, and stops forwarding the cross-card data traffic to the switch card.
In addition, referring to fig. 7, which corresponds to the fault processing method shown in fig. 2, an embodiment of the present invention further provides a fault processing apparatus 400, which is applied to a master control board of a distributed network device, where the apparatus includes:
an obtaining module 410, configured to obtain an isolation application initiated by a switch card, where the isolation application is initiated when the switch card detects that all connection channels between any service board card and the switch card are abnormal;
a judgment execution module 420, configured to notify all the service board cards to stop forwarding the cross-card data traffic to the switch card when it is determined that the switch card meets the preset isolation condition.
In this embodiment, please refer to the contents described in the embodiments shown in fig. 1 to fig. 3 for the process of implementing each function of each functional module of the fault handling apparatus 400, which is not described herein again.
In addition, referring to fig. 8, corresponding to the fault handling method shown in fig. 4, an embodiment of the present invention further provides a fault handling apparatus 500, which is applied to a switch card of a distributed network device, where the apparatus includes:
an initiating module 510, configured to initiate an isolation application to a master control board when all connection channels with any service board are detected to be abnormal;
and the execution module 520 is configured to acquire an isolation determination command fed back by the main control board card based on a preset isolation condition, and stop participating in the transceiving of the cross-card data traffic.
In this embodiment, please refer to the contents described in the embodiments shown in fig. 1 to fig. 4 for the process of implementing each function of each functional module of the fault handling apparatus 500, which is not described herein again.
In addition, referring to fig. 9, which corresponds to the fault processing method shown in fig. 5, an embodiment of the present invention further provides a fault processing apparatus 600, which is applied to a service board of a distributed network device, where the apparatus includes:
an obtaining module 610, configured to obtain, from the master control board, a notification to stop forwarding the cross-card data traffic to the switch card that meets the isolation condition;
a stop forwarding module 620, configured to stop forwarding the cross-card data traffic to the switch card.
In this embodiment, please refer to the contents described in the embodiments shown in fig. 1 to fig. 5 for the process of implementing each function of each functional module of the fault handling apparatus 500, which is not described herein again.
In addition, referring to fig. 1, an embodiment of the present invention further provides a distributed network device 100, where the distributed network device 100 may include a master board, a service board connected to the master board, and a switch card. The service board card and at least one switch card are connected through an internal link aggregation group, and each link aggregation group comprises a plurality of high-speed channels.
The switch card is used for initiating an isolation application to the main control board card when detecting that all connecting channels between the switch card and any service board card are abnormal; the master control board card is used for acquiring the isolation application and informing all the service board cards of stopping forwarding the cross-card data traffic to the switch card when the switch card is determined to meet the preset isolation condition; the service board card is used for stopping forwarding the cross-card data traffic to the switch card after acquiring the notification of stopping forwarding the cross-card data traffic to the switch card from the main control board card.
In summary, the fault processing method, the fault processing apparatus, and the distributed network device provided in the embodiments of the present invention are a fault processing method, an apparatus, and a distributed network device, where when a main control board in a distributed network device obtains an isolation application initiated by a switch card, if it is determined that the switch card meets a preset isolation condition, informing all the service board cards to stop forwarding the cross-card data traffic to the switch card, completing the isolation of the switch card, so that the switch card can not participate in the inter-board service traffic forwarding between the service board cards, and the inter-board service traffic forwarding between the service board cards can not pass through the switch card, therefore, the phenomenon that when the link aggregation group between the switch card and the service board card completely breaks down, the service flow of the cross-service board card loses packets and even breaks off is avoided, and the influence on the flow forwarded by the cross-service board card is reduced.
In the embodiments provided in the present application, it should be understood that the disclosed apparatus and method can be implemented in other ways. The apparatus embodiments described above are merely illustrative, and for example, the flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of apparatus, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
In addition, the functional modules in the embodiments of the present invention may be integrated together to form an independent part, or each module may exist separately, or two or more modules may be integrated to form an independent part.
The functions, if implemented in the form of software functional modules and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes. It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention. It should be noted that: like reference numbers and letters refer to like items in the following figures, and thus, once an item is defined in one figure, it need not be further defined and explained in subsequent figures.
The above description is only for the specific embodiments of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present invention, and all the changes or substitutions should be covered within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (10)

1. A fault processing method is characterized in that the method is applied to a main control board card of distributed network equipment, and the method comprises the following steps:
the master control board card acquires an isolation application initiated by a switch card, wherein the isolation application is initiated when the switch card detects that all connection channels between any service board card and the switch card are abnormal;
when the main control board card determines that the switch card meets the preset isolation condition, all the service board cards are informed to stop forwarding the cross-card data traffic to the switch card,
the preset isolation conditions are as follows: the distributed network equipment also comprises other switch cards except the switch card, and a connection channel with a normal connection state exists between any service board card and at least one other switch card in the other switch cards.
2. The method of claim 1, wherein after the notifying all traffic boards to stop forwarding cross-card data traffic to the switch card, the method further comprises:
and when the master control board card determines that the switch card meets the isolation removing condition, informing all the service board cards to resume forwarding the cross-card data traffic to the switch card.
3. The method of claim 2, wherein the determining, by the master board card, that the switch card satisfies the isolation release condition comprises:
the master control board card judges whether the connection channels between the switch card and any service board card are normal or not; if so, the master control board card determines that the switch card meets the isolation removing condition; or
The master control board card judges whether any service board card is pulled out or whether the switch card is pulled out; if so, the main control board card determines that the switch card meets the isolation removing condition; or
The master control board card judges whether connection channels between any service board card and the rest switch cards in the distributed network equipment including the switch cards are all disconnected; if so, the main control board card determines that the switch card meets the isolation removing condition.
4. A method for processing a failure, wherein the method is applied to a switch card of a distributed network device, and the method comprises:
when detecting that all connecting channels between the switching card and any service board card are abnormal, the switching card initiates an isolation application to a main control board card;
the switch card acquires an isolation determination command fed back by the main control board card based on a preset isolation condition, stops participating in the transceiving of the cross-card data traffic,
the preset isolation conditions are as follows: the distributed network equipment also comprises other switch cards except the switch card, and a connection channel with a normal connection state exists between any service board card and at least one other switch card in the other switch cards.
5. A fault processing method is characterized in that the method is applied to a service board card of distributed network equipment, and comprises the following steps:
the service board card acquires a notice of stopping forwarding the cross-card data traffic to the switch card meeting the isolation condition from the main control board card, wherein the switch card initiates an isolation application to the main control board when detecting that all connecting channels between any service board card and the switch card are abnormal, and the main control board initiates a notice of stopping forwarding the cross-card data traffic to the switch card to all the service board cards when determining that the switch card meets the preset isolation condition; the preset isolation conditions are as follows: the distributed network equipment also comprises other switch cards except the switch card, and a connection channel with a normal connection state exists between any service board card and at least one other switch card in the other switch cards;
and the service board card stops forwarding the cross-card data traffic to the switch card.
6. A fault handling method applied to a distributed network device, the method comprising:
when detecting that all connecting channels between the switch card and any service board card are abnormal, the switch card initiates an isolation application to a main control board card;
the master control board card acquires an isolation application sent by the switch card, and notifies all service board cards of stopping forwarding the cross-card data traffic to the switch card when the switch card is determined to meet a preset isolation condition;
the service board card obtains a notice of stopping forwarding the cross-card data traffic to the switch card meeting the isolation condition from the main control board card, stops forwarding the cross-card data traffic to the switch card,
the preset isolation conditions are as follows: the distributed network equipment also comprises other switch cards except the switch card, and a connection channel with a normal connection state exists between any service board card and at least one other switch card in the other switch cards.
7. The utility model provides a fault handling device which characterized in that, is applied to the master control integrated circuit board of distributed network equipment, the device includes:
the system comprises an acquisition module, a processing module and a processing module, wherein the acquisition module is used for acquiring an isolation application initiated by a switch card, and the isolation application is initiated by the switch card when the switch card detects that all connecting channels between any service card and the switch card are abnormal;
a judgment execution module for notifying all the service board cards to stop forwarding the cross-card data traffic to the switch card when the switch card is determined to meet the preset isolation condition,
the preset isolation conditions are as follows: the distributed network equipment also comprises other switch cards except the switch card, and a connection channel with a normal connection state exists between any service board card and at least one other switch card in the other switch cards.
8. A fault handling apparatus, applied to a switch card of a distributed network device, the apparatus comprising:
the initiating module is used for initiating an isolation application to the main control board card when detecting that all connecting channels between the switch card and any service board card are abnormal;
the execution module is configured to acquire an isolation determination command fed back by the main control board card based on a preset isolation condition, and stop participating in the transceiving of the cross-card data traffic, where the preset isolation condition is: the distributed network equipment also comprises other switch cards except the switch card, and a connection channel with a normal connection state exists between any service board card and at least one other switch card in the other switch cards.
9. A fault handling device is applied to a service board card of a distributed network device, and the device comprises:
the system comprises an acquisition module, a main control board and a switching card, wherein the acquisition module is used for acquiring a notice of stopping forwarding the cross-card data traffic to the switching card meeting an isolation condition from the main control board, the switching card initiates an isolation application to the main control board when detecting that all connecting channels between any service board card and the switching card are abnormal, and the main control board initiates a notice of stopping forwarding the cross-card data traffic to the switching card to all service board cards when determining that the switching card meets the preset isolation condition; the preset isolation conditions are as follows: the distributed network equipment also comprises other switch cards except the switch card, and a connection channel with a normal connection state exists between any service board card and at least one other switch card in the other switch cards;
and the forwarding stopping module is used for stopping forwarding the cross-card data traffic to the switch card.
10. A distributed network device is characterized by comprising a main control board card, a service board card connected with the main control board card and a switch card, wherein the service board card is connected with at least one switch card,
the switch card is used for initiating an isolation application to the main control board card when detecting that all connecting channels between the switch card and any service board card are abnormal;
the master control board card is used for acquiring the isolation application and informing all the service board cards of stopping forwarding the cross-card data traffic to the switch card when the switch card is determined to meet the preset isolation condition;
the service board card is used for stopping forwarding the cross-card data traffic to the switch card after acquiring the notice of stopping forwarding the cross-card data traffic to the switch card from the main control board card,
the preset isolation conditions are as follows: the distributed network equipment also comprises other switch cards except the switch card, and a connection channel with a normal connection state exists between any service board card and at least one other switch card in the other switch cards.
CN201810838987.2A 2018-07-26 2018-07-26 Fault processing method and device and distributed network equipment Active CN108768757B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810838987.2A CN108768757B (en) 2018-07-26 2018-07-26 Fault processing method and device and distributed network equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810838987.2A CN108768757B (en) 2018-07-26 2018-07-26 Fault processing method and device and distributed network equipment

Publications (2)

Publication Number Publication Date
CN108768757A CN108768757A (en) 2018-11-06
CN108768757B true CN108768757B (en) 2021-10-08

Family

ID=63971527

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810838987.2A Active CN108768757B (en) 2018-07-26 2018-07-26 Fault processing method and device and distributed network equipment

Country Status (1)

Country Link
CN (1) CN108768757B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109347685B (en) * 2018-11-21 2021-01-26 新华三技术有限公司 Network equipment, logic device and notification information transmission method
CN109302359A (en) * 2018-11-27 2019-02-01 迈普通信技术股份有限公司 A kind of network equipment, link converting method and device
CN110098962B (en) * 2019-04-25 2022-06-07 新华三技术有限公司 Traffic transmission method and device, electronic equipment and readable storage medium
CN112003782B (en) * 2020-09-02 2022-05-24 新华三信息安全技术有限公司 Fault processing method, device, network equipment and machine readable storage medium
CN112187679B (en) * 2020-09-22 2022-11-11 锐捷网络股份有限公司 Message processing method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101047538A (en) * 2006-03-31 2007-10-03 上海贝尔阿尔卡特股份有限公司 Seamless switch-over system of data link based on Ethernet exchange and its method
CN101895423A (en) * 2010-07-15 2010-11-24 华为技术有限公司 Data transmission method and system in Ethernet
CN104168134A (en) * 2014-07-21 2014-11-26 上海寰创通信科技股份有限公司 Distributed wireless hotspot control system with uninterrupted data service after single-board abnormity
CN105357114A (en) * 2014-08-21 2016-02-24 杭州迪普科技有限公司 Distributed network equipment
US9594716B1 (en) * 2013-07-01 2017-03-14 Juniper Networks, Inc. Pluggable module for signal relay between communication cards

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101047538A (en) * 2006-03-31 2007-10-03 上海贝尔阿尔卡特股份有限公司 Seamless switch-over system of data link based on Ethernet exchange and its method
CN101895423A (en) * 2010-07-15 2010-11-24 华为技术有限公司 Data transmission method and system in Ethernet
US9594716B1 (en) * 2013-07-01 2017-03-14 Juniper Networks, Inc. Pluggable module for signal relay between communication cards
CN104168134A (en) * 2014-07-21 2014-11-26 上海寰创通信科技股份有限公司 Distributed wireless hotspot control system with uninterrupted data service after single-board abnormity
CN105357114A (en) * 2014-08-21 2016-02-24 杭州迪普科技有限公司 Distributed network equipment

Also Published As

Publication number Publication date
CN108768757A (en) 2018-11-06

Similar Documents

Publication Publication Date Title
CN108768757B (en) Fault processing method and device and distributed network equipment
EP2352253A1 (en) Method and apparatus for protecting link aggregation group of ethernet ring
CN106533736B (en) Network equipment restarting method and device
JP6724998B2 (en) Server device, cluster system, cluster control method and program
CN101729426B (en) Method and system for quickly switching between master device and standby device of virtual router redundancy protocol (VRRP)
CN105790825B (en) Method and device for hot backup of controller in distributed protection
WO2016086582A1 (en) Signal detection method and device
CN108804260B (en) SRIO system switching method and device
JPWO2008105031A1 (en) Cluster system and node switching method
CN110912815B (en) Network element ring formation preventing method, device, equipment and readable storage medium
CN105790965A (en) Method and device for link switching and line card
WO2016065751A1 (en) Method for recovering link communication, service line card and system
CN111698157A (en) Link management method, board card and switch
CN109120558B (en) Method and system for automatically eliminating single board port fault
CN110661599A (en) HA implementation method, device and storage medium between main node and standby node
JP5050014B2 (en) Network monitoring system
CN112422428B (en) Link state acquisition method and device, electronic equipment and readable storage medium
CN109347679A (en) Dual master control plate hot standby switch method, apparatus, access gateway and safety network system
KR100832543B1 (en) High availability cluster system having hierarchical multiple backup structure and method performing high availability using the same
CN114095462A (en) Fault tolerance method and system for SRIO communication system of radar processor
CN106789139B (en) Multipoint fault processing method and device
JP6762032B2 (en) Power receiving device and control method
CN114301763A (en) Distributed cluster fault processing method and system, electronic device and storage medium
CN109634787B (en) Distributed file system monitor switching method, device, equipment and storage medium
CN110321261B (en) Monitoring system and monitoring method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant