CN109861745B - Fault processing method and cluster router - Google Patents

Fault processing method and cluster router Download PDF

Info

Publication number
CN109861745B
CN109861745B CN201910002876.2A CN201910002876A CN109861745B CN 109861745 B CN109861745 B CN 109861745B CN 201910002876 A CN201910002876 A CN 201910002876A CN 109861745 B CN109861745 B CN 109861745B
Authority
CN
China
Prior art keywords
cascade port
port
cluster router
cascade
cable
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910002876.2A
Other languages
Chinese (zh)
Other versions
CN109861745A (en
Inventor
谢龙
李军军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou H3C Technologies Co Ltd
Original Assignee
Hangzhou H3C Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou H3C Technologies Co Ltd filed Critical Hangzhou H3C Technologies Co Ltd
Priority to CN201910002876.2A priority Critical patent/CN109861745B/en
Publication of CN109861745A publication Critical patent/CN109861745A/en
Application granted granted Critical
Publication of CN109861745B publication Critical patent/CN109861745B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention provides a fault processing method and a cluster router, which relate to the technical field of communication and comprise the following steps: receiving a port event of a cascading port in the cluster router; if the port event is an UP event, clearing the aging accumulation times of the cascade port and the error rates of a plurality of cables contained in the optical fiber connected with the cascade port; detecting the error rate of the cable corresponding to the cascade port under the condition that a preset detection condition is reached; if the detection result shows that the error rate of the cable is higher than the preset value, the cable is set to be in a DOWN state, and the invention actively checks the failed cable and carries out accurate action processing, thereby ensuring the stability and reliability of the data of the cluster router and further solving the technical problems of poor reliability and unstable data transmission of the existing cluster router.

Description

Fault processing method and cluster router
Technical Field
The present invention relates to the field of communications technologies, and in particular, to a fault handling method and a cluster router.
Background
With the increasing popularity and development of the internet, the router device as the core of the network is facing new challenges, and the traditional router can no longer meet the capacity requirement of data exchange. The cluster router is a system formed by mutually and directly connecting a plurality of single routers.
Currently, a cluster router is mainly divided into a back-to-back cluster router and a Multi-frame cluster router, where frames are connected through a cascade port, where the cascade port refers to an optical fiber connector (Multi-fiber Push On) and includes a plurality of optical fibers therein. The cascade port converts the optical signal into an electrical signal and transmits the electrical signal to a switching chip fe (fabric element) of the single board. If optical fiber pollution occurs or optical fiber is loose, an optical fiber fault may be caused, and a packet loss phenomenon may occur continuously at a cascade port in a working state, which may seriously affect the performance of the cluster router, and may even cause a problem that the cascade port cannot enter the working state. No effective solution has been proposed to the above problems.
Disclosure of Invention
In view of this, the present invention provides a method and an apparatus for processing a failure of an interface in a cluster router, and a cluster router.
In a first aspect, an embodiment of the present invention provides a method for processing a failure of an interface in a cluster router, where the method is applied to a switch chip FE in the cluster router, and the method includes: receiving a port event of a cascading port in the cluster router; if the port event is an UP event, clearing the aging accumulated times of the cascade port and the error rates of a plurality of cables contained in the optical fiber connected with the cascade port; detecting the error rate of the internal cable corresponding to the cascade port under the condition of reaching a preset detection condition; and if the detection result is that the error rate of the cable is higher than a preset value, setting the cable to be in a DOWN state.
Further, the method further comprises: if the preset number of cables in the plurality of cables corresponding to the cascade port are set to be in the DOWN state, setting the cascade port to be in the DOWN state, and performing isolation processing on the cascade port.
Further, the isolating the cascade port includes: and after the cascade port is set to be in the DOWN state, deleting a routing table entry corresponding to the cascade port in a routing table of the cluster router.
Further, after isolating the cascading ports, the method further comprises: and if the cascade port is detected to be changed from the UP state to the DOWN state, setting the aging accumulated times of the cascade port as a first identifier, and executing isolation recovery operation on the internal cable, wherein the first identifier indicates that the detection on the cable of the optical fiber connected with the cascade port is finished.
Further, judging whether the cascade port meets a preset detection condition by the following method specifically includes: if the aging accumulated times of the cascade port is larger than or equal to a preset threshold value, determining that the preset detection condition is met; and/or determining that the preset detection condition is met if a preset time elapses from a time point when the aging accumulation frequency of the cascade port is cleared, the state of the cascade port continues to be an UP state, and the aging accumulation frequency of the cascade port continues to be a second identifier, where the second identifier indicates that detection of an internal cable of an optical fiber connected to the cascade port is started.
In a second aspect, the present invention discloses a cluster router, including: at least two subracks and an optical fiber connected between the subracks, the optical fiber comprising a plurality of cables;
the machine frames comprise SFUs, each SFU is provided with an exchange chip and a cascade port connected with the exchange chip, and the optical fibers are respectively inserted into the cascade ports of the two machine frames;
the switching chip is used for executing the method of any one of the above items.
Further, the cluster router is a back-to-back cluster router, and the back-to-back cluster router includes two wire clamp frames.
Further, the cluster router is a multi-frame cluster router, and the subrack includes: at least two wire clamp frames and a switching frame.
According to the method, the cable with the fault can be found in time, the cable is set to be in the DOWN state, the cable with the fault is actively checked, and accurate action processing is performed, so that stability and reliability of data transmission of the cluster router are guaranteed, and the technical problems that the existing cluster router is poor in reliability and unstable in data transmission are solved.
Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
In order to make the aforementioned and other objects, features and advantages of the present invention comprehensible, preferred embodiments accompanied with figures are described in detail below.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.
FIG. 1 is a schematic diagram of a multi-box cluster router, according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of a back-to-back cluster router according to an embodiment of the present invention;
fig. 3 is a flow chart of a fault handling method according to an embodiment of the invention.
Detailed Description
To make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is apparent that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
According to an embodiment of the present invention, an embodiment of a cluster router is provided.
Fig. 1 and fig. 2 are schematic diagrams of cluster routers according to an embodiment of the present invention. In the present application, the cluster router may be a back-to-back cluster router (as shown in fig. 2) or a multi-frame cluster router (as shown in fig. 1). When the cluster router is a back-to-back cluster router, the cluster router comprises two line Card frames (LCCs) (line Card sessions), and the two LCCs are directly connected through a cascade port; when the cluster router is a multi-frame cluster router, the cluster router comprises LCCs and switching frames (Fabric Central sessions), wherein FCC (fluid center) is arranged between the LCCs, and the LCCs are connected with the FCC through cascade ports. An exchange board SFU (switch Fabric Unit) of the LCC and an SFU of the FCC are provided with exchange chips FE (Fabric element), the FE is connected through a cascade port, and an optical fiber connected with the cascade port comprises a plurality of cables; the FE is used for the fault handling method provided by the application.
FIG. 1 is a schematic diagram of a multi-box cluster router, according to an embodiment of the present invention; fig. 2 is a schematic diagram of a back-to-back cluster router according to an embodiment of the present invention. The structure of the cluster router will be described in detail with reference to fig. 1 and 2.
As shown in fig. 1, a multi-box (one FCC + two LCCs in fig. 1) cluster router is shown, and as shown in fig. 2, a back-to-back cluster router is shown. As shown in fig. 1, the multi-frame cluster router includes one FCC and two LCCs, and the FCC is disposed between the two LCCs. As shown in fig. 2, a back-to-back cluster router includes two LCCs, and the two LCCs are connected.
In the back-to-back cluster router and the multi-frame cluster router, the connection between each Line Card (Line Card) and the FE13 in the data input direction forms the first-level connection of the switching plane, and the connection between each LC and the FE13 in the data output direction forms the third-level connection of the switching plane. As shown in fig. 1, in a multi-frame cluster router, the second stage of the switching plane is implemented in the FCC, FE2 in fig. 1. As shown in fig. 2, if the LCC2 in fig. 2 is a line card box in the data-in direction, the second stage of the switch plane is implemented in the LCC2, i.e., FE2 in fig. 2.
The interaction of flow data in the cluster router data channel mainly passes through the optic fibre between the frame, and each cascade port passes through the fiber connection, contained many cables in the optic fibre, the optic fibre is through one-to-one butt joint to the FE of photoelectric conversion module. When a user performs plugging operation on the cascade port or when a fault such as physical collision occurs on the cascade interface, uncertain factors such as pollution of the cascade port or loosening of cables may occur, and at this time, signals of part of cables on one cascade port are weakened. Signal degradation of a portion of the cable will cause the cascade port connected to it on FE to fail to go UP, or the cable will generate a Cyclic Redundancy Check (CRC). Such a failed data transmission link not only causes the cluster router to perform wrong traffic statistics, but also causes the cluster router to continuously lose packets and become unreliable.
To address this concern, the present invention provides a fault handling method, it being noted that the steps illustrated in the flowchart of the figure may be performed in a computer system such as a set of computer executable instructions, and that while a logical order is illustrated in the flowchart, in some cases the steps illustrated or described may be performed in an order different than presented herein.
Fig. 3 is a flowchart of a failure handling method according to an embodiment of the present invention, as shown in fig. 3, applied to an FE of a cluster router, where the method includes the following steps:
step S302, receiving a port event of a cascade port in the cluster router;
step S304, if the port event is an UP event, clearing the aging accumulation times of the cascade port and the error rate of a plurality of cables contained in the optical fiber connected with the cascade port;
and step S306, detecting the error rate of the cable connected with the cascade port under the condition that a preset detection condition is reached.
In the cluster router shown in fig. 1 and 2, the SFU may be provided with a photoelectric conversion unit (not shown in the figure), and the electrical signal output by the FE is converted into an optical signal by the photoelectric conversion unit and output to the cascade port. And then, transmitting the optical signal to a machine frame at the opposite end through the optical fiber plugged in the cascade port, thereby completing the forwarding of the cross-machine frame in the cluster router. And a plurality of pins of the FE are led out and then connected to a connector and are plugged into the photoelectric conversion unit through the connector, and then the photoelectric conversion unit is connected to the cascade port, wherein one pin corresponds to one cable on one optical fiber, and data transmitted and received on the pin is transmitted through the corresponding cable on the optical fiber.
A plurality of first registers may be integrated inside the FE, and each first register corresponds to one pin and is used to record an error rate when data is received at the pin. In addition, a plurality of second registers may be further integrated inside the FE, and each second register may correspond to one pin and be used to control the enabling of data input and output of the pin. For example, when the value in the second register is "0", the input and output of data on the corresponding pin are disabled, that is, the cable is set to the DOWN state; when the value in the second register is "1", the input/output of the data on the corresponding pin is enabled, that is, the cable is set to the UP state.
Port events of a tandem port include UP events and DOWN events. If the received port event is determined to be an UP event, the aging accumulated times of the cascade port can be eliminated; the error rate of the plurality of cables included in the optical fiber connected to the cascade port, that is, the number of CRCs generated by the plurality of cables of the cascade port, may also be cleared.
After receiving the UP event of the cascade port, the FE records the state of the cascade port, that is, records the state as the UP state. At this time, the aging identification information is marked as the first identification, i.e., the aging timing for the tandem port is started. The FE may record the aging accumulation times of each cascade port, and count the aging accumulation times once after the aging timer reaches a preset time. For example, when the value of the aging flag information is set to 1, it indicates that aging timing is started for the cascade port and fault detection is started for the cable of the optical fiber connected to the cascade port, and when the value of the aging flag information is set to 0, it indicates that aging timing for the cascade port is stopped and detection for the cable of the optical fiber connected to the cascade port is ended.
When an UP event of a cascade port is received, it indicates that the cascade port has connected an optical fiber, and the cascade port can form a cluster with a machine frame at the opposite end to start row data transmission. Then, the FE needs to set the value recorded in each second register to "1" to enable, allow the pin to perform data input and output, clear the error rate already recorded in each second register, and restart recording. During data transmission, virtual connection or disconnection may occur in a portion of the cable on the optical fiber. Due to the occurrence of such a situation, data packet loss may be caused, resulting in accumulation of the error rate.
In this embodiment, the preset detection condition is determined according to the number of aging accumulation times of the cascade port, the state (UP state or DOWN state) of the cascade port, and the aging identification information of the cascade port, which will be described in the following embodiments.
In this embodiment, when the cascade port reaches the preset detection condition, the error rate (i.e., the number of cyclic redundancy check codes CRC) of the cascade port cable may be detected and recorded in the corresponding first register.
Optionally, in this embodiment, whether the cascade port meets the preset detection condition may be determined in the following three ways.
The first method is as follows:
and if the aging accumulated times of the cascade port is greater than or equal to a preset threshold value, determining that the preset detection condition is met.
Specifically, in this embodiment, if the port event received in step S302 is an UP event, the timer is started after the accumulated aging number of the cascade port and the error rates of the plurality of cables included in the optical fiber connected to the cascade port are cleared. And if the timing time is up, acquiring the aging accumulated times of the cascade port, and judging whether the aging accumulated times of the cascade port is greater than or equal to a preset threshold value. If yes, determining that the preset detection condition is met, and detecting the error rate of the cable. And if the detection result is that the error rate of the cable is higher than a preset value, setting the cable to be in a DOWN state.
The second method comprises the following steps:
and if a preset time elapses from the moment of clearing the aging accumulation times of the cascade port, the state of the cascade port continues to be an UP state, and the aging identification information of the cascade port continues to be a second identification, determining that the preset detection condition is met, wherein the second identification indicates that the detection of the cable of the optical fiber connected to the cascade port is started.
Specifically, in this embodiment, if the port event received in step S302 is an UP event, the timer is started after the accumulated aging number of the cascade port and the error rates of the plurality of cables of the optical fiber connected to the cascade port are cleared. If the timing time is UP, the preset time is elapsed from the moment of clearing the aging accumulated times of the cascade port, whether the state of the cascade port is in an UP state continuously or not is judged, and the aging identification information of the cascade port is in a second identification continuously. And if so, determining that the preset detection condition is met, wherein the second identifier indicates that the detection of the cable of the optical fiber connected with the cascade port is started.
The third method comprises the following steps:
and if the aging accumulated times of the cascade port is greater than or equal to a preset threshold value, determining that the preset detection condition is met.
And if a preset time elapses from the moment of clearing the aging accumulation times of the cascade port, the state of the cascade port continues to be an UP state, and the aging identification information of the cascade port continues to be a second identification, determining that the preset detection condition is met, wherein the second identification indicates that the detection of the cable of the optical fiber connected to the cascade port is started. And if the detection result is that the error rate of the cable is higher than a preset value, setting the cable to be in a DOWN state.
Based on the three ways, the aging accumulation times, the timing time of the timer and the preset value of the error rate can be set according to empirical values, and the set range is not limited.
And step S308, if the detection result is that the error rate of the cable is higher than a preset value, setting the cable to be in a DOWN state.
Specifically, in this embodiment, if the port event received in step S302 is an UP event, the timer is started after the accumulated aging number of the cascade port and the error rates of the plurality of cables included in the optical fiber connected to the cascade port are cleared. If the timing time is up, acquiring the aging accumulated times of the cascade port, and judging whether the aging accumulated times of the cascade port is greater than or equal to a preset threshold, where the preset threshold may be configured according to actual requirements, for example, 3 times. And judging whether the state of the cascade port is continuously in an UP state or not and judging whether the aging identification information of the cascade port is continuously a second identification after the preset time elapses from the moment of clearing the aging accumulation times of the cascade port. And if so, determining that the preset detection condition is met, wherein the second identifier indicates that the detection of the cable of the optical fiber connected with the cascade port is started. And if the detection result is that the error rate of the cable is higher than a preset value, setting the cable to be in a DOWN state.
In an alternative embodiment, the method further comprises the steps of: if a preset number of cables in a plurality of cables of the optical fiber connected with the cascade port are set to be in a DOWN state, setting the cascade port to be in the DOWN state, and performing isolation processing on the cascade port.
A certain routing table entry is stored in the FE to instruct which port the received data will be forwarded through, and when a packet is received, if the output interface of the matched routing table entry is a cascade port, cross-chassis forwarding needs to be performed through the cascade port. When a plurality of cables in one optical fiber are already in a DOWN state in which data input and output cannot be performed, a situation of data blocking may occur when the cables are actually matched to the routing table entry for forwarding, and a transmission requirement cannot be met, the routing table entry related to the cascade port corresponding to the optical fiber may be deleted, so that the problem of data blocking caused by the fact that the received message is continuously matched to the routing table entry is avoided.
Because the optical fibers connected to the cascade ports include a plurality of cables and are connected between the two machine frames through the plurality of optical fibers, that is, redundancy is configured during connection to ensure that when a certain optical fiber is disconnected and the like, the cluster router is not seriously affected, but if the cascade ports cannot sense the cable fault, data can be continuously transmitted on the cable, and the loss of the data can exist all the time. At this time, on one optical fiber, if the number of cable faults contained therein increases, the loss of data continues to increase. Therefore, when the FE can determine that the number of faulty cables in the optical fiber connected to the cascade port of the FE reaches a preset number, the FE directly turns off the optical fiber, which can avoid such continuous data loss, thereby avoiding expansion of the influence.
The preset number may be set according to actual needs, for example, a fixed numerical value may be preset as the preset number according to the total number of the optical fibers connected to the cascade port; a proportional value, for example, 0.6 or 0.7, may also be preset, and if it is determined that the number of cables set to the DOWN state exceeds the total number of 0.6 cables (or the total number of 0.7 internal cables), the cascade port is isolated. In this embodiment, the determination process of the preset number is not specifically limited, and is used for setting according to actual needs.
It should be noted that, in this embodiment, after the isolation processing is performed on the cascade port, the timer is restarted, the timer waits for the timing time of the next period to arrive, and the processes described in the above step S302 to step S308 are repeated, which is not described again here.
It should be noted that, in this embodiment, if the received port event is an UP event, the timer may be started after a time delay, and after the timer is started, the above steps S306 and S308 are executed.
After delaying for a period of time, the timer is restarted, so that the accuracy of the action processing of the cascade port can be ensured when the cascade port is changed from the DOWN state to the UP state and the state of the cascade port is kept in a stable state, and meanwhile, the fault of the processing equipment in the operation process can be periodically monitored, and the reliability of the cluster equipment is ensured.
In summary, in this embodiment, after the cascade port of the cluster router changes from the DOWN state to the UP state and is in the stable UP state, whether a cable of the cascade port has a fault is periodically checked, and if it is found that a pin of an FE corresponding to the cable is in an enable and DOWN state, or a CRC (error rate of the cable) of data received on a certain pin is greater than a preset value, the cable may be set to the DOWN state.
By the processing mode, the problem of packet loss of the cluster router caused by continuous failure of cables corresponding to the cascade ports in the cluster router is effectively solved, and the method can ensure the stability and reliability of the cluster router.
In this embodiment, after performing the isolation processing on the cascade port, the method further includes:
if the cascade port is detected to be changed from the UP state to the DOWN state, the aging identification information of the cascade port is set to be a first identification, for example, the value of the aging identification information is set to be 1, and isolation recovery operation is performed on the cable. Therefore, when the user checks the optical fiber connection condition and restores the connection, the corresponding routing table entry can be generated again according to the connection with the machine frame of the opposite end, so as to avoid the problem that the user cannot perform data transmission with the machine frame of the opposite end according to the FE arranged in the SFU on the machine frame after repairing the optical fiber.
According to the method, the cable with the fault can be found in time, the cable is set to be in the DOWN state, the cable with the fault is actively checked, and accurate action processing is performed, so that stability and reliability of cluster router data are guaranteed, and the technical problems that an existing cluster router is poor in reliability and unstable in data transmission are solved.
In addition, in the description of the embodiments of the present invention, unless otherwise explicitly specified or limited, the terms "mounted," "connected," and "connected" are to be construed broadly, e.g., as meaning either a fixed connection, a removable connection, or an integral connection; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood in specific cases to those skilled in the art.
In the description of the present invention, it should be noted that the terms "center", "upper", "lower", "left", "right", "vertical", "horizontal", "inner", "outer", etc., indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings, and are only for convenience of description and simplicity of description, but do not indicate or imply that the device or element being referred to must have a particular orientation, be constructed and operated in a particular orientation, and thus, should not be construed as limiting the present invention. Furthermore, the terms "first," "second," and "third" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the several embodiments provided in the present application, it should be understood that the disclosed system, apparatus and method may be implemented in other ways. The above-described embodiments of the apparatus are merely illustrative, and for example, the division of the units is only one logical division, and there may be other divisions when actually implemented, and for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection of devices or units through some communication interfaces, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit.
The functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a non-volatile computer-readable storage medium executable by a processor. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
Finally, it should be noted that: the above-mentioned embodiments are only specific embodiments of the present invention, which are used for illustrating the technical solutions of the present invention and not for limiting the same, and the protection scope of the present invention is not limited thereto, although the present invention is described in detail with reference to the foregoing embodiments, those skilled in the art should understand that: any person skilled in the art can modify or easily conceive the technical solutions described in the foregoing embodiments or equivalent substitutes for some technical features within the technical scope of the present disclosure; such modifications, changes or substitutions do not depart from the spirit and scope of the embodiments of the present invention, and they should be construed as being included therein. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (6)

1. A fault handling method is characterized in that the fault handling method is applied to a switching chip FE in a cluster router, the cluster router comprises at least two machine frames and optical fibers connected between the machine frames, the optical fibers comprise a plurality of cables, the machine frames comprise switching boards, the switching boards are provided with switching chips and cascade ports connected with the switching chips, and the optical fibers are respectively inserted into the cascade ports of the two machine frames; the method comprises the following steps:
receiving a port event of a cascading port in the cluster router;
if the port event is an UP event, clearing the aging accumulated times of the cascade port and the error rates of a plurality of cables contained in the optical fiber connected with the cascade port;
detecting the error rate of the cable corresponding to the cascade port under the condition that a preset detection condition is reached;
if the detection result is that the error rate of the cable is higher than a preset value, setting the cable to be in a DOWN state;
if a preset number of cables in the plurality of cables corresponding to the cascade port are set to be in a DOWN state, setting the cascade port to be in the DOWN state, and isolating the cascade port;
and if the cascade port is detected to be changed from the UP state to the DOWN state, setting the aging identification information of the cascade port as a first identification, and executing isolation recovery operation on the cable, wherein the first identification indicates that the detection on the cable of the optical fiber connected with the cascade port is finished.
2. The method of claim 1, wherein isolating the cascaded port comprises:
and after the cascade port is set to be in the DOWN state, deleting a routing table entry corresponding to the cascade port in a routing table of the cluster router.
3. The method according to claim 1, wherein determining whether the cascade port satisfies a preset detection condition includes:
if the aging accumulated times of the cascade port is larger than or equal to a preset threshold value, determining that the preset detection condition is met; and/or
And if a preset time elapses from the moment of clearing the aging accumulation times of the cascade port, the state of the cascade port continues to be an UP state, and the aging identification information of the cascade port continues to be a second identification, determining that the preset detection condition is met, wherein the second identification indicates that the detection of the cable of the optical fiber connected to the cascade port is started.
4. A cluster router, comprising: at least two subracks and an optical fiber connected between the subracks, the optical fiber comprising a plurality of cables;
the machine frames comprise exchange plates, exchange chips and cascade ports connected with the exchange chips are arranged on the exchange plates, and the optical fibers are respectively inserted into the cascade ports of the two machine frames;
the switch chip is used for executing the method of any one of the preceding claims 1 to 3.
5. The cluster router of claim 4, wherein the cluster router is a back-to-back cluster router, the back-to-back cluster router comprising two wire-clamping frames.
6. The cluster router of claim 4, wherein the cluster router is a multi-chassis cluster router, the chassis comprising: at least two wire clamp frames and a switching frame.
CN201910002876.2A 2019-01-02 2019-01-02 Fault processing method and cluster router Active CN109861745B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910002876.2A CN109861745B (en) 2019-01-02 2019-01-02 Fault processing method and cluster router

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910002876.2A CN109861745B (en) 2019-01-02 2019-01-02 Fault processing method and cluster router

Publications (2)

Publication Number Publication Date
CN109861745A CN109861745A (en) 2019-06-07
CN109861745B true CN109861745B (en) 2021-05-07

Family

ID=66893809

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910002876.2A Active CN109861745B (en) 2019-01-02 2019-01-02 Fault processing method and cluster router

Country Status (1)

Country Link
CN (1) CN109861745B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113037364B (en) * 2019-12-25 2022-06-24 华为技术有限公司 Method and equipment for service protection switching

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0854599A2 (en) * 1997-01-17 1998-07-22 Scientific-Atlanta, Inc. System and method for detecting, correcting and discarding corrupted data packets in a cable data delivery system
CN103959710A (en) * 2012-11-22 2014-07-30 华为技术有限公司 Method, device and system for backing up device information
CN103997422A (en) * 2014-05-05 2014-08-20 大唐移动通信设备有限公司 Fault processing method and device for IP interface board
CN104539461A (en) * 2015-01-09 2015-04-22 烽火通信科技股份有限公司 Device and method for inhibitory protocol port oscillation reporting
CN109039799A (en) * 2018-06-26 2018-12-18 新华三技术有限公司 Link state detection method, device and the network equipment
CN109039761A (en) * 2018-08-24 2018-12-18 新华三技术有限公司 Faulty link treating method and apparatus in clustered control channel

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1202599A3 (en) * 2000-10-31 2004-03-24 Chiaro Networks Ltd. System and method for routing IP packets using an optical core
CN101707536B (en) * 2009-11-25 2011-09-07 成都市华为赛门铁克科技有限公司 Fault detection method, line card and main control card
JP2013150176A (en) * 2012-01-19 2013-08-01 Hitachi Ltd Transmission system
CN102932062B (en) * 2012-10-24 2016-07-27 沈越 Intelligent optical fiber connects distribution system and adapter port state information identification method
JP6345390B2 (en) * 2013-05-23 2018-06-20 日本電信電話株式会社 Optical switch module and optical switching device
US9496951B2 (en) * 2013-08-08 2016-11-15 Mark E. Boduch Method and apparatus for performing path protection for rate-adaptive optics
CN103888335A (en) * 2013-12-05 2014-06-25 国家电网公司 Transformer station information network interconnection method
CN105681138A (en) * 2014-11-21 2016-06-15 中国移动通信集团广东有限公司 Network link networking method and system
CN106789753B (en) * 2015-11-24 2020-06-26 新华三技术有限公司 Line card frame, multi-frame cluster router and message processing method
CN207037789U (en) * 2017-06-26 2018-02-23 深圳回收宝科技有限公司 The data acquisition device and its reclaimer of portable electric appts
CN207369037U (en) * 2017-11-16 2018-05-15 广东电网有限责任公司韶关供电局 A kind of optical cable multichannel fibre core real-time monitoring device and system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0854599A2 (en) * 1997-01-17 1998-07-22 Scientific-Atlanta, Inc. System and method for detecting, correcting and discarding corrupted data packets in a cable data delivery system
CN103959710A (en) * 2012-11-22 2014-07-30 华为技术有限公司 Method, device and system for backing up device information
CN103997422A (en) * 2014-05-05 2014-08-20 大唐移动通信设备有限公司 Fault processing method and device for IP interface board
CN104539461A (en) * 2015-01-09 2015-04-22 烽火通信科技股份有限公司 Device and method for inhibitory protocol port oscillation reporting
CN109039799A (en) * 2018-06-26 2018-12-18 新华三技术有限公司 Link state detection method, device and the network equipment
CN109039761A (en) * 2018-08-24 2018-12-18 新华三技术有限公司 Faulty link treating method and apparatus in clustered control channel

Also Published As

Publication number Publication date
CN109861745A (en) 2019-06-07

Similar Documents

Publication Publication Date Title
US7599301B2 (en) Communications network tap with heartbeat monitor
US20080117827A1 (en) Method and system for verifying connectivity of logical link
JP2867860B2 (en) Cross connect device
CN103404100B (en) The control method of relay, relay and network system
JPH11507780A (en) Method and system for identifying a fault location in a communication network
CN110677292A (en) Optical interface rate configuration method and device
US5522047A (en) Graceful insertion of a tree into a ring network
US6813240B1 (en) Method of identifying low quality links in a telecommunications network
CN109861745B (en) Fault processing method and cluster router
RU2602977C2 (en) Method for detecting correctness of optic fibre connection based on radio remote unit (rru) looped network and base band unit (bbu)
CN109120338A (en) Network failure locating method, device, equipment and medium
CN111669255A (en) Loopback test method, device, circuit, equipment and medium for communication equipment network port
JP2006285519A (en) Fault diagnostic method for data transfer system, data transfer system, and data storage device
CN103957130B (en) Fault detect and restoration methods and system
CN117527653A (en) Cluster heartbeat management method, system, equipment and medium
CN109039761B (en) Method and device for processing fault link in cluster control channel
CN115549775B (en) Processing method for optical signal transmission abnormality, optical transmission equipment and system
US11962401B2 (en) Reducing connection validation (CV) time in an optical node
CN106301882B (en) Cable with spare signal line for automatic fault recovery
CN114095462A (en) Fault tolerance method and system for SRIO communication system of radar processor
CN104348676A (en) Link detection method and device based on operation administration and maintenance
CN108134689B (en) Method and device for checking faults of cascade interface inside frame type equipment
CN104038365A (en) Diagnostic Port For Inter-switch And Node Link Testing In Electrical, Optical And Remote Loopback Modes
EP2180636A1 (en) Method for signaling a unidirectional failure of a packet-switched link
CN114024836B (en) STN network A ring breakpoint determination method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant