WO2019079961A1 - Method and device for determining shared risk link group - Google Patents

Method and device for determining shared risk link group

Info

Publication number
WO2019079961A1
WO2019079961A1 PCT/CN2017/107448 CN2017107448W WO2019079961A1 WO 2019079961 A1 WO2019079961 A1 WO 2019079961A1 CN 2017107448 W CN2017107448 W CN 2017107448W WO 2019079961 A1 WO2019079961 A1 WO 2019079961A1
Authority
WO
WIPO (PCT)
Prior art keywords
log
interface
identifies
shared risk
determining
Prior art date
Application number
PCT/CN2017/107448
Other languages
French (fr)
Chinese (zh)
Inventor
张耀坤
孙春霞
张大冬
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to PCT/CN2017/107448 priority Critical patent/WO2019079961A1/en
Priority to CN201780009064.XA priority patent/CN109964450B/en
Publication of WO2019079961A1 publication Critical patent/WO2019079961A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/069Management of faults, events, alarms or notifications using logs of notifications; Post-processing of notifications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0604Management of faults, events, alarms or notifications using filtering, e.g. reduction of information by using priority, element types, position or time
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0805Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
    • H04L43/0811Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking connectivity
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0823Errors, e.g. transmission errors
    • H04L43/0829Packet loss
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0852Delays
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0852Delays
    • H04L43/087Jitter

Definitions

  • the present application relates to the field of communications technologies, and in particular, to a method and an apparatus for determining a Shared Risk Link Group (SRLG).
  • SRLG Shared Risk Link Group
  • Some of the independent physical interfaces/links in the router network have the same risk of failure. When one physical interface/link fails, the physical interface/link with the same risk of failure will also be down. For example, some links pass the same optical transmission, and when a transmission failure occurs, these links are simultaneously down. These links can be referred to as shared risk links. If there are shared risk links on the two critical paths in the network (such as the active and standby paths used for link protection), after the same transmission, when the transmission failure occurs, the service protection at the router level will be completely invalid, resulting in comprehensive services. Interrupted.
  • the router industry has the concept and function of sharing a risk link group, and can manually set which links belong to the same SRLG, and have the same fault risk.
  • Traffic mirroring Traffic Engineering, TE selects paths for hot backup, based on the CR-LDP (Consultation-based Routing Label Distribution Protocol (CR-LSP) or Fast Reroute (FRR)
  • CR-LDP Consultation-based Routing Label Distribution Protocol
  • FRR Fast Reroute
  • the shared risk link is not used as the primary and backup paths.
  • which paths in the network belong to the same SRLG need to be recognized by humans; for various objective reasons, it is difficult to find out which links have the same risk regardless of whether the network server or the customer is provided. Therefore, although the SRLG function is currently available, it must be manually identified and manually added. It is very difficult to implement, and it is easy to cause the two links that are mutually backed up to share risks, and the risk of service transmission is high.
  • the technical problem to be solved by the embodiments of the present application is to provide a method and a device for determining a shared risk link group, so as to realize convenient and accurate determination of a shared risk link group and reduce service transmission risk.
  • an embodiment of the present application provides a method for determining a shared risk link group, which may include:
  • first log set of one or more network devices where the first log set includes a log that is not generated by the network device restart or is not caused by an interface board failure of the network device;
  • first log and a second log from the second log set, where the first log is a log generated by the first interface, and the second log is a log generated by the second interface, where When the first log and the second log meet the preset condition, the first link corresponding to the first interface and the second link corresponding to the second interface belong to the same shared risk link group.
  • the first log set By obtaining the first log set of one or more network devices in the network, the first log set can be filtered according to the log filtering rule by using the log analysis method at the network protocol (IP) level to obtain the network device.
  • IP network protocol
  • the log of the corresponding fault of the interface forms a second log set; thereby realizing the automatic identification of the transport layer
  • the information, and then determining the shared risk link group in the second log set according to whether the preset condition is met, can save the customer from identifying a large amount of input of the SRLG, and can avoid the inaccuracy of human identification; and can also obtain the SRLG set according to the obtained Dynamically adjusting the service deployment, especially in the scenario where the shared risk link confirmation mechanism cannot be deployed based on IP lighting, has practical significance, and is a very useful supplement to the current SRLG function, which can significantly improve the stability and security of network service transmission.
  • the log filtering rule includes at least one of the following:
  • the useful information of the transport layer can be obtained, which is beneficial to narrowing the data range of the SRLG set mining and identification, and improving the effectiveness and efficiency of the information analysis.
  • determining that the first log and the second log meet predetermined conditions include:
  • the first log identifies that the status of the first interface changes to up
  • the second log identifies that the status of the second interface changes to up
  • the generation time of the first log and the generation of the second log
  • the interval of time is less than the first threshold
  • the first log identifies that the status of the first interface changes to down
  • the second log identifies that the status of the second interface changes to down
  • the generation time of the first log and the generation of the second log
  • the interval of time is less than the second threshold
  • the first log identifies that the first interface has an error
  • the second log identifies that the second interface has an error
  • the interval between the first log generation time and the second log generation time is less than Third threshold
  • the first log identifies that the first interface is running, managing, and maintaining OAM up
  • the second log identifies that the second interface is OAM up
  • the first log is generated
  • the second log is generated.
  • the interval of generation time is less than the fourth threshold
  • the first log identifies that the first interface is OAM down
  • the second log identifies that the second interface is OAM down
  • the interval between the generation time of the first log and the second log is less than The fifth threshold.
  • the second log set includes a network device identifier, a log generation interface, a log generation time, and a log occurrence event.
  • the determining device can accurately know the location and type of the fault, thereby obtaining an accurate SRLG analysis and recognition result.
  • the method further includes:
  • two methods may be performed according to preset conditions in a manner similar to the first aspect.
  • the sharing risk between the interfaces is determined, so that a collection of SRLGs can be obtained, which is convenient for users and network devices to circumvent.
  • the method further includes:
  • the determined shared risk link group is reported to the user equipment for alarming.
  • the determined shared risk link group is delivered to the network device, or the determined shared risk link group is delivered to the network device according to the indication of the user equipment.
  • an apparatus for determining a shared risk link group which may include:
  • a transceiver unit configured to acquire a first log set of one or more network devices, where the first log set includes a log that is not generated by the network device or is not caused by an interface board failure of the network device;
  • a processing unit configured to filter the logs in the first log set according to the log filtering rule to obtain a second log set, and obtain the first log and the second log from the second log set, where the first log is a log generated by the first interface, where the second log is a log generated by the second interface, and when it is determined that the first log and the second log meet a preset condition, determining that the first interface corresponds to The second link corresponding to the first link and the second interface belong to the same shared risk link group.
  • the log filtering rule includes at least one of the following:
  • the processing unit when it is determined that the first log and the second log meet a preset condition, is specifically configured to:
  • the first log identifies that the status of the first interface changes to up
  • the second log identifies that the status of the second interface changes to up
  • the generation time of the first log and the generation of the second log
  • the interval of time is less than the first threshold
  • the first log identifies that the status of the first interface changes to down
  • the second log identifies that the status of the second interface changes to down
  • the generation time of the first log and the generation of the second log
  • the interval of time is less than the second threshold
  • the first log identifies that the first interface has an error
  • the second log identifies that the second interface has an error
  • the interval between the first log generation time and the second log generation time is less than Third threshold
  • the first log identifies that the first interface is running, managing, and maintaining OAM up
  • the second log identifies that the second interface is OAM up
  • the first log is generated
  • the second log is generated.
  • the interval of generation time is less than the fourth threshold
  • the first log identifies that the first interface is OAM down
  • the second log identifies that the second interface is OAM down
  • the interval between the generation time of the first log and the second log is less than The fifth threshold.
  • the second log set includes a network device identifier, a log generation interface, a log generation time, and a log occurrence event.
  • the processing unit is further configured to:
  • the processing unit is further configured to:
  • the determined shared risk link group is reported to the user equipment for alarming.
  • the determined shared risk link group is delivered to the network device, or the determined shared risk link group is delivered to the network device according to the indication of the user equipment.
  • an apparatus for determining a shared risk link group which may include:
  • processors a processor, a memory, and a bus, wherein the processor and the memory are connected by a bus, wherein the memory is configured to store a set of program codes, the processor is configured to invoke program code stored in the memory, and execute the embodiment of the present application.
  • an embodiment of the present application provides an apparatus for determining a shared risk link group, where the apparatus includes: a main control board and an interface board.
  • the main control board includes: a first processor and a second memory.
  • the interface board includes: a second processor, a second memory, and an interface card.
  • the main control board is coupled to the interface board.
  • the first memory can be used to store program code, and the first processor is configured to call the program code in the first memory to perform the following operations:
  • first log set of one or more network devices where the first log set includes a log that is not generated by the network device restarting or not being caused by an interface board failure of the network device; and the first log is generated according to a log filtering rule
  • the log in the set is filtered to obtain a second log set, and the first log and the second log are obtained from the second log set, where the first log is a log generated by the first interface, and the second log is Determining, by the log generated by the second interface, the first link corresponding to the first interface and the second link corresponding to the second interface, when determining that the first log and the second log meet a preset condition Roads belong to the same shared risk link group.
  • the second memory can be used to store program code
  • the second processor is configured to call the program code in the second memory to perform the following operations:
  • the trigger interface card sends a log acquisition message to one or more network devices to trigger the network device to return a log that is not caused by the network device restart or the interface board failure of the network device.
  • an inter-process communication (IPC) control channel is established between the main control board and the interface board.
  • IPC inter-process communication
  • an embodiment of the present application provides a computer readable storage medium having instructions stored therein that, when run on a computer, implement any of the above first aspect or the first aspect The method in the possible implementation.
  • FIG. 1 is a schematic structural diagram of a system for determining an SRLG in an SDN network according to an embodiment of the present application
  • FIG. 2 is a schematic flowchart of a method for determining an SRLG according to an embodiment of the present disclosure
  • FIG. 3 is a schematic flowchart of another method for determining an SRLG according to an embodiment of the present disclosure
  • FIG. 4 is a schematic flowchart of still another method for determining an SRLG according to an embodiment of the present disclosure
  • FIG. 5 is a schematic structural diagram of an apparatus for determining an SRLG according to an embodiment of the present disclosure
  • FIG. 6 is a schematic structural diagram of another apparatus for determining an SRLG according to an embodiment of the present disclosure.
  • FIG. 7 is a schematic structural diagram of another apparatus for determining an SRLG according to an embodiment of the present application.
  • FIG. 1 is a schematic diagram of a system architecture for determining an SRLG in a Software Defined Network (SDN) according to an embodiment of the present application; the architecture of the SDN network including the SDN may be divided into top-down technologies. Five parts: application layer, northbound interface, control layer, southbound interface, and network device layer.
  • SDN Software Defined Network
  • Application layer including a variety of different services and applications, the application layer through the programming interface provided by the control layer to program the underlying devices, develop a variety of business applications, and achieve a variety of business innovation.
  • Northbound interface connects the interface between the application layer and the control layer, so that various SDN applications can easily call the underlying network resources.
  • Control layer It is the core of the entire SDN network. It centrally manages all the devices in the network and regards the entire network as a unified resource pool. It allocates resources flexibly and dynamically according to different needs of users and the global network topology. For the lower layer, communication with the network device layer is performed through a standard protocol; for the upper layer, the application layer is provided with control capability for network resources through an open interface.
  • Southbound interface Connects the interface between the control layer and the network device to implement control over network device status and data traffic forwarding.
  • Network device layer The hardware device layer, which contains all the physical devices in the network that make up the entire IT infrastructure, such as switches, routers, and so on.
  • the network device layer is mainly responsible for data processing, forwarding, and the like.
  • control layer may include a controller
  • the network device layer may include, but is not limited to, a network device such as a router, and may include, for example, a switch, a gateway device, and the like, in addition to the router shown in FIG.
  • Other network devices such as security devices.
  • Each router can be interconnected to form a router network topology that contains multiple paths.
  • the apparatus for determining the SRLG in the present application may be implemented by an SDN controller (SNC) or other back-end server in the SDN network; and in the ordinary router network, it may be processed by the main control center of each router.
  • SNC SDN controller
  • CPU Central Processing Unit
  • FIG. 2 is a schematic flowchart of a method for determining an SRLG according to an embodiment of the present disclosure
  • log data of a specified network device or all networks in the network in a preset time period may be acquired.
  • the preset time period may be one year or one month, and is not limited in any embodiment of the present application.
  • the log of the network device restart may be excluded, such as the log of the physical interface up and/or down caused by the human operation shutdown; and the interface of the network device may also be excluded.
  • a log such as a physical interface up and/or down caused by a board failure.
  • S203 Obtain a first log and a second log from the second log set, where the first log is a log generated by the first interface, and the second log is a log generated by the second interface, where When it is determined that the first log and the second log meet the preset condition, the first link corresponding to the first interface and the second link corresponding to the second interface belong to the same shared risk link group.
  • the log filtering rule includes at least one of the following:
  • BFD is a two-way forwarding detection mechanism that provides millisecond-level detection and fast detection of links. BFD can communicate with the upper-layer routing protocol to achieve fast convergence and ensure service continuity.
  • Ethernet OAM is a tool for monitoring network faults. It is mainly used to solve the link problems common in the last mile of Ethernet access. Users can monitor the link status between the two devices by enabling Ethernet OAM on two point-to-point devices.
  • Ethernet OAM can effectively improve the management and maintenance capabilities of Ethernet and ensure the stable operation of the network. Its main functions include:
  • Link performance monitoring Monitors various performances of the link, including measurement of packet loss, delay, and jitter, and statistics on various types of traffic.
  • the link is detected by sending a detection packet, and the network administrator is notified when the link fails.
  • Loop test The link fault is detected by the loopback of non-Ethernet OAM packets.
  • the log that excludes the restart of the network device may be excluded during the log filtering in step S202, such as the physical interface up and/or caused by the human operation shutdown.
  • Down log and can also exclude logs such as physical interface up and / or down caused by network board interface failure.
  • log filtering rules may be arbitrarily selected for use, or two or more combinations may be arbitrarily selected or used in combination. After filtering by the log filtering rule, you can obtain logs of related devices that are up and/or down with the physical interface, and then mine from it to get the SRLG.
  • the logs in the second log collection can be sorted by time.
  • the second log set may include, but is not limited to, a network device identifier, a log generation interface, a log generation time, and a log occurrence event.
  • the data in sequence number 1 indicates that the network device, that is, the router R14, is at 2017-06-15 17:51:30, and a CRC fault occurs on one interface GigabitEthernet0/8/7 on the device, and an error occurs. If a similar failure occurs on another interface at the same time, the two interfaces may be at risk.
  • the SRLG can be obtained from the log mining of each interface included in the log collection.
  • determining that the first log and the second log meet the preset conditions include:
  • the first log identifies that the status of the first interface changes to up
  • the second log identifies that the status of the second interface changes to up
  • the generation time of the first log and the generation of the second log
  • the interval of time is less than the first threshold
  • the first log identifies that the status of the first interface changes to down
  • the second log identifies that the status of the second interface changes to down
  • the generation time of the first log and the generation of the second log
  • the interval of time is less than the second threshold
  • the first log identifies that the first interface has an error
  • the second log identifies that the second interface has an error
  • the interval between the first log generation time and the second log generation time is less than Third threshold
  • the first log identifies that the first interface is running, managing, and maintaining OAM up
  • the second log identifies that the second interface is OAM up
  • the first log is generated
  • the second log is generated.
  • the interval of generation time is less than the fourth threshold
  • the first log identifies that the first interface is OAM down
  • the second log identifies that the second interface is OAM down
  • the interval between the generation time of the first log and the second log is less than The fifth threshold.
  • the link corresponding to the two interfaces can be regarded as a possible shared risk link group and recorded in the SRLG set.
  • the “simultaneous” described in the embodiments of the present application is not the same time in the strict sense. Due to factors such as the transmission time of the line, there may be very fine time difference between the two physical interfaces/links. Therefore, in the present application, “simultaneously” includes both the strict same time, and also includes two very close times at a small time difference such as 500 milliseconds or 1 second.
  • the first threshold to the fifth threshold in the embodiment of the present application may be a smaller time difference or infinitely close to 0.
  • the specific values of the five thresholds may be the same or different, and are not limited in any embodiment of the present application.
  • the pseudo code of the preset condition may include the following parts:
  • the preset condition when the above-mentioned preset condition occurs, it returns true, and the first link, that is, the link A and the second link, that is, the link B, are regarded as possible shared risk link groups, and are recorded in the set of SRLG; If the default situation occurs, that is, if the conditions in the above preset conditions are not met, then false is returned, and link A and link B are regarded as non-shared risk link groups, and need not be recorded in the set of SRLG.
  • the service deployment can be dynamically adjusted according to the obtained SRLG set, especially in the point where the shared risk link confirmation mechanism cannot be deployed based on the IP light addition, which has practical significance and is very useful supplement to the current SRLG function. It can significantly improve the stability and security of network service transmission.
  • FIG. 3 is a schematic flowchart of another method for determining an SRLG according to an embodiment of the present disclosure.
  • steps S301-S303 are the same as steps S201-S203, and details are not described herein again.
  • the method further includes:
  • the processing method of the second log set including the logs of the two interfaces is provided.
  • the processing method is similar to the foregoing, and details are not described herein again.
  • the shared risk link groups that meet the preset conditions can be sequentially filtered out to generate a set of SRLGs.
  • a shared risk link group relationship table may be generated according to the comparison with the preset conditions.
  • the shared risk link group relationship table includes identifiers of two links that fail simultaneously and frequency of simultaneous failures.
  • the process of outputting the SRLG according to the second log set may include:
  • Input the second log collection
  • the detection time threshold time1 for example, 500 milliseconds or 1 second, it is considered that the two physical interfaces are up/down at the same time, and the risk may be shared.
  • Step 1 Traverse the second log set and output the SRLG relationship table.
  • Step 2 Traverse the SRLG relational table and output all collections of SRLG
  • SRLG relational table As shown in the following table, a possible form of SRLG relational table is given:
  • FIG. 4 is a schematic flowchart of still another method for determining an SRLG according to an embodiment of the present disclosure.
  • steps S401-S406 are the same as steps S301-S306 in the embodiment shown in FIG. The details are not described herein again.
  • the method further includes:
  • the determined shared risk link group is delivered to the network device, or the determined shared risk link group is delivered to the network device after the user confirms.
  • the physical link of the possible co-risk transmission is determined by the method in the foregoing embodiment, it can be added to the SRLG set, which can be used for the customer to confirm the decision, or automatically send the SRLG set to the network device. Eliminate the process of customer identification.
  • FIG. 5 it is a schematic diagram of a device for determining an SRLG according to an embodiment of the present disclosure
  • the transceiver unit 100 is configured to acquire a first log set of one or more network devices, where the first log set includes a log that is not generated by the network device or is not caused by an interface board failure of the network device;
  • the processing unit 200 is configured to filter the logs in the first log set according to the log filtering rule to obtain a second log set, and obtain the first log and the second log from the second log set, where the first The log is a log generated by the first interface, and the second log is a log generated by the second interface.
  • the log filtering rule includes at least one of the following:
  • the processing unit 200 is specifically configured to:
  • the first log identifies that the status of the first interface changes to up
  • the second log identifies that the status of the second interface changes to up
  • the generation time of the first log and the generation of the second log
  • the interval of time is less than the first threshold
  • the first log identifies that the status of the first interface changes to down
  • the second log identifies that the status of the second interface changes to down
  • the generation time of the first log and the generation of the second log
  • the interval of time is less than the second threshold
  • the first log identifies that the first interface has an error
  • the second log identifies that the second interface has an error
  • the interval between the first log generation time and the second log generation time is less than Third threshold
  • the first log identifies that the first interface is running, managing, and maintaining OAM up
  • the second log identifies that the second interface is OAM up
  • the first log is generated
  • the second log is generated.
  • the interval of generation time is less than the fourth threshold
  • the first log identifies that the first interface is OAM down
  • the second log identifies that the second interface is OAM down
  • the interval between the generation time of the first log and the second log is less than The fifth threshold.
  • the second log set includes a network device identifier, a log generation interface, a log generation time, and a log occurrence event.
  • processing unit 200 is further configured to:
  • processing unit 200 is further configured to:
  • the determined shared risk link group is delivered to the network device, or the determined shared risk link group is delivered to the network device after the user confirms.
  • FIG. 6 is a schematic diagram of another apparatus for determining an SRLG according to an embodiment of the present disclosure.
  • the apparatus may include a processor 110 , a memory 120 , and a bus 130 .
  • the processor 110 and the memory 120 are connected by a bus 130 for storing instructions for executing the instructions stored by the memory 120 to implement the steps in the method corresponding to Figures 2 to 4 above.
  • the device may further include an input port 140 and an output port 150.
  • the processor 110, the memory 120, the input port 140 and the output port 150 can be connected by a bus 130.
  • the processor 110 is configured to execute instructions stored in the memory 120 to control the input port 140 to receive signals, and control the output port 150 to send signals to complete the steps performed by the controller in the above method.
  • the input port 140 and the output port 150 may be the same or different physical entities. When they are the same physical entity, they can be collectively referred to as input and output ports.
  • the memory 120 may be integrated in the processor 110 or may be provided separately from the processor 110.
  • the functions of the input port 140 and the output port 150 can be implemented by a dedicated chip through a transceiver circuit or a transceiver.
  • the processor 110 can be implemented by a dedicated processing chip, a processing circuit, a processor, or a general purpose chip.
  • a device provided by an embodiment of the present application may be implemented by using a general-purpose computer.
  • the program code for the functions of the processor 110, the input port 140 and the output port 150 is stored in a memory, and the general purpose processor implements the functions of the processor 110, the input port 140 and the output port 150 by executing code in the memory.
  • FIG. 6 shows only one memory and processor for ease of illustration. In an actual controller, there may be multiple processors and memories.
  • the memory may also be referred to as a storage medium or a storage device, and the like.
  • an embodiment of the present application provides an apparatus 210 for determining an SRLG.
  • the device 210 includes a main control board 220 and an interface board 230.
  • the main control board 220 includes a processor 221 and a memory 222.
  • the interface board 230 includes a processor 231, a memory 232, and an interface card 233.
  • the main control board 220 and the interface board 230 are coupled.
  • the memory 222 can be used to store the program code of the main control board 210, and the processor 221 is used to call the program code in the memory 222 to perform the following operations:
  • first log set of one or more network devices where the first log set includes a log that is not generated by the network device restarting or not being caused by an interface board failure of the network device; and the first log is generated according to a log filtering rule
  • the log in the set is filtered to obtain a second log set, and the first log and the second log are obtained from the second log set, where the first log is a log generated by the first interface, and the second log is Determining, by the log generated by the second interface, the first link corresponding to the first interface and the second link corresponding to the second interface, when determining that the first log and the second log meet a preset condition Roads belong to the same shared risk link group.
  • the memory 232 can be used to store program code of the interface board 230, and the processor 231 is used to call the program code in the memory 232 to perform the following operations:
  • the trigger interface card 233 sends a log acquisition message to one or more network devices to trigger the network device to return a log that is not caused by the network device restarting or the interface board failure of the network device.
  • an inter-process communication protocol IPC control channel is established between the main control board 220 and the interface board 230.
  • the device 210 can be an SDN controller, a router, or other background server.
  • the device 210 can implement the functions of the device for determining the SRLG in the foregoing method embodiments. For the specific implementation steps, refer to the foregoing method embodiments, and details are not described herein.
  • the processor may be a central processing unit (“CPU"), and the processor may also be other general-purpose processors, digital signal processors (DSPs), and dedicated integration. Circuit (ASIC), off-the-shelf programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware component, etc.
  • the general purpose processor may be a microprocessor or the processor or any conventional processor or the like.
  • the memory can include read only memory and random access memory and provides instructions and data to the processor.
  • a portion of the memory may also include a non-volatile random access memory.
  • the bus may also include a power bus, a control bus, and a status signal bus.
  • the various buses are labeled as buses in the figure.
  • each step of the above method may be completed by an integrated logic circuit of hardware in a processor or an instruction in a form of software.
  • the steps of the method disclosed in the embodiments of the present application may be directly implemented as a hardware processor, or may be performed by a combination of hardware and software modules in the processor.
  • the software module can be located in a conventional storage medium such as random access memory, flash memory, read only memory, programmable read only memory or electrically erasable programmable memory, registers, and the like.
  • the storage medium is located in the memory, and the processor reads the information in the memory and combines the hardware to complete the steps of the above method. To avoid repetition, it will not be described in detail here.
  • the embodiment of the present application further provides a system including the foregoing device and one or more network devices.
  • the disclosed systems, devices, and methods may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
  • the computer program product includes one or more computer instructions.
  • the computer can be a general purpose computer, a special purpose computer, a computer network, or other programmable device.
  • the computer instructions can be stored in a computer readable storage medium or transferred from one computer readable storage medium to another computer readable storage medium, for example, the computer instructions can be from a website site, computer, server or data center Transfer to another website site, computer, server, or data center by wire (eg, coaxial cable, fiber optic, digital subscriber line (DSL), or wireless (eg, infrared, wireless, microwave, etc.).
  • the computer readable storage medium can be any available media that can be accessed by a computer or a data storage device such as a server, data center, or the like that includes one or more available media.
  • the usable medium may be a magnetic medium (eg, a floppy disk, a hard disk, a magnetic tape), an optical medium (eg, a DVD), or a semiconductor medium (such as a solid state disk (SSD)).

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

Disclosed in the embodiments of the present application are a method and device for determining a shared risk link group, the method comprising: acquiring a first log set of one or more network devices, the first log set comprising a log generated due to the restart of the network device(s) or the failure of an interface board of the network device(s); filtering logs in the first log set according to a log filtering rule so as to obtain a second log set; and acquiring from the second log set a first log generated by a first interface and a second log generated by a second interface, and determining that a first link corresponding to the first interface and a second link corresponding to the second interface belong to a same shared risk link group when determining that the first log and the second log meet a preset condition. By employing the embodiments of the present application, a shared risk link group may be conveniently and accurately determined, thereby reducing the risks of service transmission.

Description

一种确定共享风险链路组的方法及装置Method and device for determining shared risk link group 技术领域Technical field
本申请涉及通信技术领域,尤其涉及一种确定共享风险链路组(Shared Risk Link Group,SRLG)的方法及装置。The present application relates to the field of communications technologies, and in particular, to a method and an apparatus for determining a Shared Risk Link Group (SRLG).
背景技术Background technique
在路由器组网中的一些相互独立的物理接口/链路,具备相同的故障风险,在其中一个物理接口/链路发生故障时,具备相同故障风险的物理接口/链路也会同时down。例如,一些链路经过相同的光传输,当发生传输故障时,这些链路同时down。则可以将这些链路称为共享风险链路。如果网络中的两条关键路径(如用于链路保护的主备路径)上存在共享风险链路,经过相同传输,当传输故障发生,将导致路由器层面的业务保护完全失效,进而导致业务全面中断。Some of the independent physical interfaces/links in the router network have the same risk of failure. When one physical interface/link fails, the physical interface/link with the same risk of failure will also be down. For example, some links pass the same optical transmission, and when a transmission failure occurs, these links are simultaneously down. These links can be referred to as shared risk links. If there are shared risk links on the two critical paths in the network (such as the active and standby paths used for link protection), after the same transmission, when the transmission failure occurs, the service protection at the router level will be completely invalid, resulting in comprehensive services. Interrupted.
在现有技术中,路由器业界有共享风险链路组的概念和功能,可手工设置哪些链路属于同一个SRLG,具有相同故障风险。在流量工程(Traffic Engineering,TE)选择路径做热备份,基于路由受限标签分发协议(CR-LDP:Constraint-based Routing Label Distribution Protocol,CR-LSP)或快速重路由(Fast Reroute,FRR)旁路(Bypass)隧道做禁忌计算,不使共享风险链路分别作为主备路径。但是,网络中哪些路径属于同一个SRLG,需要人为确认识别;由于各种客观原因,无论提供网络服务者还是客户,都很难弄清楚哪些链路共传输具有相同风险。因此,尽管当前有SRLG功能,但必须通过人为识别和手工添加,实施起来非常困难,容易导致互为备份的两条链路共享风险,业务传输风险较高。In the prior art, the router industry has the concept and function of sharing a risk link group, and can manually set which links belong to the same SRLG, and have the same fault risk. Traffic mirroring (Traffic Engineering, TE) selects paths for hot backup, based on the CR-LDP (Consultation-based Routing Label Distribution Protocol (CR-LSP) or Fast Reroute (FRR) The Bypass tunnel is used as a tabu calculation. The shared risk link is not used as the primary and backup paths. However, which paths in the network belong to the same SRLG need to be recognized by humans; for various objective reasons, it is difficult to find out which links have the same risk regardless of whether the network server or the customer is provided. Therefore, although the SRLG function is currently available, it must be manually identified and manually added. It is very difficult to implement, and it is easy to cause the two links that are mutually backed up to share risks, and the risk of service transmission is high.
发明内容Summary of the invention
本申请实施例所要解决的技术问题在于,提供一种确定共享风险链路组的方法及装置,以期实现方便准确地确定共享风险链路组,降低业务传输风险。The technical problem to be solved by the embodiments of the present application is to provide a method and a device for determining a shared risk link group, so as to realize convenient and accurate determination of a shared risk link group and reduce service transmission risk.
第一方面,本申请的实施例提供了一种确定共享风险链路组的方法,可包括:In a first aspect, an embodiment of the present application provides a method for determining a shared risk link group, which may include:
获取一个或多个网络设备的第一日志集合,所述第一日志集合包含非所述网络设备重启或非所述网络设备的接口板故障产生的日志;Obtaining a first log set of one or more network devices, where the first log set includes a log that is not generated by the network device restart or is not caused by an interface board failure of the network device;
根据日志过滤规则对所述第一日志集合中的日志进行过滤,得到第二日志集合;Filtering the logs in the first log set according to the log filtering rule to obtain a second log set;
从所述第二日志集合中获取第一日志和第二日志,所述第一日志为所述第一接口产生的日志,所述第二日志为所述第二接口产生的日志,在确定所述第一日志和第二日志满足预设条件时,确定所述第一接口对应的第一链路和所述第二接口对应的第二链路属于同一个共享风险链路组。Obtaining a first log and a second log from the second log set, where the first log is a log generated by the first interface, and the second log is a log generated by the second interface, where When the first log and the second log meet the preset condition, the first link corresponding to the first interface and the second link corresponding to the second interface belong to the same shared risk link group.
通过获取组网中的一个或多个网络设备的第一日志集合,可以在网络协议(Internet Protocol,IP)层面通过日志分析的方式,根据日志过滤规则对第一日志集合进行过滤,得到网络设备接口相应故障的日志,形成第二日志集合;从而实现自动识别传输层的有用 信息,然后根据是否满足预设条件确定第二日志集合中的共享风险链路组,可以省去客户人为识别SRLG的大量投入,并可以避免人为识别的不准确性;同时可以根据得到的SRLG集合动态调整业务部署,尤其在基于IP加光进行共享风险链路确认机制无法部署的场景有现实意义,对当前的SRLG功能是非常有益的补充,可以显著提升网络业务传输的稳定性和安全性。By obtaining the first log set of one or more network devices in the network, the first log set can be filtered according to the log filtering rule by using the log analysis method at the network protocol (IP) level to obtain the network device. The log of the corresponding fault of the interface forms a second log set; thereby realizing the automatic identification of the transport layer The information, and then determining the shared risk link group in the second log set according to whether the preset condition is met, can save the customer from identifying a large amount of input of the SRLG, and can avoid the inaccuracy of human identification; and can also obtain the SRLG set according to the obtained Dynamically adjusting the service deployment, especially in the scenario where the shared risk link confirmation mechanism cannot be deployed based on IP lighting, has practical significance, and is a very useful supplement to the current SRLG function, which can significantly improve the stability and security of network service transmission.
在一种可能的实现方式中,所述日志过滤规则包括以下至少一种:In a possible implementation manner, the log filtering rule includes at least one of the following:
过滤得到物理接口开启up和/或关闭down的日志;Filtering the log that the physical interface is up and/or down.
过滤得到物理接口出现误码的日志;Filtering to get a log of errors in the physical interface;
过滤得到物理接口的双向转发检测up和/或双向转发检测down的日志;和Filtering to obtain a log of the bidirectional forwarding detection up and/or bidirectional forwarding detection down of the physical interface; and
过滤得到物理接口的运行、管理和维护OAM up和/或OAM down的日志。Filtering logs for running, managing, and maintaining OAM up and/or OAM down for physical interfaces.
通过上述多种日志规则,可以得到传输层的有用信息,利于缩小SRLG集合挖掘和识别的数据范围,提高了信息分析的有效性和效率。Through the above various log rules, the useful information of the transport layer can be obtained, which is beneficial to narrowing the data range of the SRLG set mining and identification, and improving the effectiveness and efficiency of the information analysis.
在一种可能的实现方式中,确定所述第一日志和第二日志满足预设条件包括:In a possible implementation manner, determining that the first log and the second log meet predetermined conditions include:
确定所述第一日志和所述第二日志满足以下条件中的至少一项:Determining that the first log and the second log satisfy at least one of the following conditions:
所述第一日志标识所述第一接口的状态变为up,所述第二日志标识所述第二接口的状态变为up,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第一阈值;The first log identifies that the status of the first interface changes to up, the second log identifies that the status of the second interface changes to up, the generation time of the first log, and the generation of the second log The interval of time is less than the first threshold;
所述第一日志标识所述第一接口的状态变为down,所述第二日志标识所述第二接口的状态变为down,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第二阈值;The first log identifies that the status of the first interface changes to down, the second log identifies that the status of the second interface changes to down, the generation time of the first log, and the generation of the second log The interval of time is less than the second threshold;
所述第一日志标识所述第一接口出现误码,所述第二日志标识所述第二接口出现误码,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第三阈值;The first log identifies that the first interface has an error, the second log identifies that the second interface has an error, and the interval between the first log generation time and the second log generation time is less than Third threshold
所述第一日志标识所述第一接口出现运行、管理和维护OAM up,所述第二日志标识所述第二接口出现OAM up,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第四阈值;和,The first log identifies that the first interface is running, managing, and maintaining OAM up, the second log identifies that the second interface is OAM up, the first log is generated, and the second log is generated. The interval of generation time is less than the fourth threshold; and,
所述第一日志标识所述第一接口出现OAM down,所述第二日志标识所述第二接口出现OAM down,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第五阈值。The first log identifies that the first interface is OAM down, the second log identifies that the second interface is OAM down, and the interval between the generation time of the first log and the second log is less than The fifth threshold.
通过上述条件来确定SRLG,可以准确地确认具备共享风险的链路,为客户提供参考或可直接发给网络设备作为路径选择的基准。By determining the SRLG through the above conditions, it is possible to accurately confirm the link with the shared risk, provide a reference for the customer or directly send it to the network device as a reference for path selection.
在一种可能的实现方式中,所述第二日志集合包括网络设备标识、日志发生接口、日志发生时间和日志发生事件。In a possible implementation manner, the second log set includes a network device identifier, a log generation interface, a log generation time, and a log occurrence event.
通过携带网络设备标识和日志发生接口、日志发生时间和日志发生事件,可以令确定装置准确地获知故障发生的位置和类型,从而得到准确的SRLG分析识别结果。By carrying the network device identifier and the log generation interface, the log generation time, and the log occurrence event, the determining device can accurately know the location and type of the fault, thereby obtaining an accurate SRLG analysis and recognition result.
在一种可能的实现方式中,所述方法还包括:In a possible implementation manner, the method further includes:
从所述第二日志集合中获取第三日志,所述第三日志为第三接口产生的日志;Acquiring a third log from the second log set, where the third log is a log generated by the third interface;
在确定所述第一日志和第三日志满足所述预设条件时,确定所述第一接口对应的第一链路和所述第三接口对应的第三链路属于同一个共享风险链路组;When it is determined that the first log and the third log meet the preset condition, determining that the first link corresponding to the first interface and the third link corresponding to the third interface belong to the same shared risk link group;
在确定所述第二日志和第三日志满足所述预设条件时,确定所述第二接口对应的第二链路和所述第三接口对应的第三链路属于同一个共享风险链路组。When it is determined that the second log and the third log meet the preset condition, determining that the second link corresponding to the second interface and the third link corresponding to the third interface belong to the same shared risk link group.
当存在三个及以上的接口时,可以采用与第一方面类似的方式根据预设条件进行两个 接口之间的共享风险确定,从而可以得到一个SRLG的集合,利于用户和网络设备进行规避。When there are three or more interfaces, two methods may be performed according to preset conditions in a manner similar to the first aspect. The sharing risk between the interfaces is determined, so that a collection of SRLGs can be obtained, which is convenient for users and network devices to circumvent.
在一种可能的实现方式中,所述方法还包括:In a possible implementation manner, the method further includes:
将确定的共享风险链路组上报用户设备,进行告警;The determined shared risk link group is reported to the user equipment for alarming.
将确定的共享风险链路组下发至所述网络设备,或者根据所述用户设备的指示将确定的共享风险链路组下发至所述网络设备。The determined shared risk link group is delivered to the network device, or the determined shared risk link group is delivered to the network device according to the indication of the user equipment.
第二方面,本申请的实施例提供了一种确定共享风险链路组的装置,可包括:In a second aspect, an embodiment of the present application provides an apparatus for determining a shared risk link group, which may include:
收发单元,用于获取一个或多个网络设备的第一日志集合,所述第一日志集合包含非所述网络设备重启或非所述网络设备的接口板故障产生的日志;a transceiver unit, configured to acquire a first log set of one or more network devices, where the first log set includes a log that is not generated by the network device or is not caused by an interface board failure of the network device;
处理单元,用于根据日志过滤规则对所述第一日志集合中的日志进行过滤,得到第二日志集合;从所述第二日志集合中获取第一日志和第二日志,所述第一日志为所述第一接口产生的日志,所述第二日志为所述第二接口产生的日志,在确定所述第一日志和第二日志满足预设条件时,确定所述第一接口对应的第一链路和所述第二接口对应的第二链路属于同一个共享风险链路组。a processing unit, configured to filter the logs in the first log set according to the log filtering rule to obtain a second log set, and obtain the first log and the second log from the second log set, where the first log is a log generated by the first interface, where the second log is a log generated by the second interface, and when it is determined that the first log and the second log meet a preset condition, determining that the first interface corresponds to The second link corresponding to the first link and the second interface belong to the same shared risk link group.
在一种可能的实现方式中,所述日志过滤规则包括以下至少一种:In a possible implementation manner, the log filtering rule includes at least one of the following:
过滤得到物理接口开启up和/或关闭down的日志;Filtering the log that the physical interface is up and/or down.
过滤得到物理接口出现误码的日志;Filtering to get a log of errors in the physical interface;
过滤得到物理接口的双向转发检测up和/或双向转发检测down的日志;和Filtering to obtain a log of the bidirectional forwarding detection up and/or bidirectional forwarding detection down of the physical interface; and
过滤得到物理接口的运行、管理和维护OAM up和/或OAM down的日志。Filtering logs for running, managing, and maintaining OAM up and/or OAM down for physical interfaces.
在一种可能的实现方式中,在确定所述第一日志和第二日志满足预设条件时,所述处理单元具体用于:In a possible implementation manner, when it is determined that the first log and the second log meet a preset condition, the processing unit is specifically configured to:
确定所述第一日志和所述第二日志满足以下条件中的至少一项:Determining that the first log and the second log satisfy at least one of the following conditions:
所述第一日志标识所述第一接口的状态变为up,所述第二日志标识所述第二接口的状态变为up,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第一阈值;The first log identifies that the status of the first interface changes to up, the second log identifies that the status of the second interface changes to up, the generation time of the first log, and the generation of the second log The interval of time is less than the first threshold;
所述第一日志标识所述第一接口的状态变为down,所述第二日志标识所述第二接口的状态变为down,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第二阈值;The first log identifies that the status of the first interface changes to down, the second log identifies that the status of the second interface changes to down, the generation time of the first log, and the generation of the second log The interval of time is less than the second threshold;
所述第一日志标识所述第一接口出现误码,所述第二日志标识所述第二接口出现误码,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第三阈值;The first log identifies that the first interface has an error, the second log identifies that the second interface has an error, and the interval between the first log generation time and the second log generation time is less than Third threshold
所述第一日志标识所述第一接口出现运行、管理和维护OAM up,所述第二日志标识所述第二接口出现OAM up,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第四阈值;和,The first log identifies that the first interface is running, managing, and maintaining OAM up, the second log identifies that the second interface is OAM up, the first log is generated, and the second log is generated. The interval of generation time is less than the fourth threshold; and,
所述第一日志标识所述第一接口出现OAM down,所述第二日志标识所述第二接口出现OAM down,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第五阈值。The first log identifies that the first interface is OAM down, the second log identifies that the second interface is OAM down, and the interval between the generation time of the first log and the second log is less than The fifth threshold.
在一种可能的实现方式中,所述第二日志集合包括网络设备标识、日志发生接口、日志发生时间和日志发生事件。In a possible implementation manner, the second log set includes a network device identifier, a log generation interface, a log generation time, and a log occurrence event.
在一种可能的实现方式中,所述处理单元还用于:In a possible implementation manner, the processing unit is further configured to:
从所述第二日志集合中获取第三日志,所述第三日志为第三接口产生的日志;Acquiring a third log from the second log set, where the third log is a log generated by the third interface;
在确定所述第一日志和第三日志满足所述预设条件时,确定所述第一接口对应的第一链路和所述第三接口对应的第三链路属于同一个共享风险链路组; When it is determined that the first log and the third log meet the preset condition, determining that the first link corresponding to the first interface and the third link corresponding to the third interface belong to the same shared risk link Group
在确定所述第二日志和第三日志满足所述预设条件时,确定所述第二接口对应的第二链路和所述第三接口对应的第三链路属于同一个共享风险链路组。When it is determined that the second log and the third log meet the preset condition, determining that the second link corresponding to the second interface and the third link corresponding to the third interface belong to the same shared risk link group.
在一种可能的实现方式中,所述处理单元还用于:In a possible implementation manner, the processing unit is further configured to:
将确定的共享风险链路组上报用户设备,进行告警;The determined shared risk link group is reported to the user equipment for alarming.
将确定的共享风险链路组下发至所述网络设备,或者根据所述用户设备的指示将确定的共享风险链路组下发至所述网络设备。The determined shared risk link group is delivered to the network device, or the determined shared risk link group is delivered to the network device according to the indication of the user equipment.
第三方面,本申请的实施例提供了一种确定共享风险链路组的装置,可包括:In a third aspect, an embodiment of the present application provides an apparatus for determining a shared risk link group, which may include:
处理器、存储器和总线,所述处理器和存储器通过总线连接,其中,所述存储器用于存储一组程序代码,所述处理器用于调用所述存储器中存储的程序代码,执行本申请实施例第一方面或第一方面任一实现方式中的步骤。a processor, a memory, and a bus, wherein the processor and the memory are connected by a bus, wherein the memory is configured to store a set of program codes, the processor is configured to invoke program code stored in the memory, and execute the embodiment of the present application The steps of any of the implementations of the first aspect or the first aspect.
第四方面,本申请的实施例提供了一种确定共享风险链路组的装置,该装置包括:主控板和接口板。主控板包括:第一处理器和第二存储器。接口板包括:第二处理器、第二存储器和接口卡。主控板和接口板耦合。第一存储器可以用于存储程序代码,第一处理器用于调用第一存储器中的程序代码执行如下操作:In a fourth aspect, an embodiment of the present application provides an apparatus for determining a shared risk link group, where the apparatus includes: a main control board and an interface board. The main control board includes: a first processor and a second memory. The interface board includes: a second processor, a second memory, and an interface card. The main control board is coupled to the interface board. The first memory can be used to store program code, and the first processor is configured to call the program code in the first memory to perform the following operations:
获取一个或多个网络设备的第一日志集合,所述第一日志集合包含非所述网络设备重启或非所述网络设备的接口板故障产生的日志;根据日志过滤规则对所述第一日志集合中的日志进行过滤,得到第二日志集合;从所述第二日志集合中获取第一日志和第二日志,所述第一日志为所述第一接口产生的日志,所述第二日志为所述第二接口产生的日志,在确定所述第一日志和第二日志满足预设条件时,确定所述第一接口对应的第一链路和所述第二接口对应的第二链路属于同一个共享风险链路组。Obtaining a first log set of one or more network devices, where the first log set includes a log that is not generated by the network device restarting or not being caused by an interface board failure of the network device; and the first log is generated according to a log filtering rule The log in the set is filtered to obtain a second log set, and the first log and the second log are obtained from the second log set, where the first log is a log generated by the first interface, and the second log is Determining, by the log generated by the second interface, the first link corresponding to the first interface and the second link corresponding to the second interface, when determining that the first log and the second log meet a preset condition Roads belong to the same shared risk link group.
第二存储器可以用于存储程序代码,第二处理器用于调用第二存储器中的程序代码执行如下操作:The second memory can be used to store program code, and the second processor is configured to call the program code in the second memory to perform the following operations:
触发接口卡向一个或多个网络设备发送日志获取消息,以触发所述网络设备返回非所述网络设备重启或非所述网络设备的接口板故障产生的日志。The trigger interface card sends a log acquisition message to one or more network devices to trigger the network device to return a log that is not caused by the network device restart or the interface board failure of the network device.
在一种可能的实现方式中,主控板和接口板之间建立进程间通信协议(inter-process communication,IPC)控制通道。In a possible implementation manner, an inter-process communication (IPC) control channel is established between the main control board and the interface board.
第五方面,本申请的实施例提供了一种计算机可读存储介质,所述计算机可读存储介质中存储有指令,当其在计算机上运行时,实现上述第一方面或第一方面的任意可能的实现方式中的方法。In a fifth aspect, an embodiment of the present application provides a computer readable storage medium having instructions stored therein that, when run on a computer, implement any of the above first aspect or the first aspect The method in the possible implementation.
附图说明DRAWINGS
为了更清楚地说明本申请实施例或背景技术中的技术方案,下面将对本申请实施例或背景技术中所需要使用的附图进行说明。In order to more clearly illustrate the technical solutions in the embodiments of the present application or the background art, the drawings to be used in the embodiments of the present application or the background art will be described below.
图1为本申请实施例提供的一种在SDN网络中确定SRLG的系统架构示意图;FIG. 1 is a schematic structural diagram of a system for determining an SRLG in an SDN network according to an embodiment of the present application;
图2为本申请实施例提供的一种确定SRLG的方法的流程示意图;2 is a schematic flowchart of a method for determining an SRLG according to an embodiment of the present disclosure;
图3为本申请实施例提供的另一种确定SRLG的方法的流程示意图;FIG. 3 is a schematic flowchart of another method for determining an SRLG according to an embodiment of the present disclosure;
图4为本申请实施例提供的又一种确定SRLG的方法的流程示意图;FIG. 4 is a schematic flowchart of still another method for determining an SRLG according to an embodiment of the present disclosure;
图5为本申请实施例提供的一种确定SRLG的装置的组成示意图; FIG. 5 is a schematic structural diagram of an apparatus for determining an SRLG according to an embodiment of the present disclosure;
图6为本申请实施例提供的另一种确定SRLG的装置的组成示意图;FIG. 6 is a schematic structural diagram of another apparatus for determining an SRLG according to an embodiment of the present disclosure;
图7为本申请实施例提供的又一种确定SRLG的装置的组成示意图。FIG. 7 is a schematic structural diagram of another apparatus for determining an SRLG according to an embodiment of the present application.
具体实施方式Detailed ways
下面结合本申请实施例中的附图对本申请的实施例进行描述。The embodiments of the present application are described below in conjunction with the accompanying drawings in the embodiments of the present application.
本申请的说明书和权利要求书及上述附图中的术语“包括”和“具有”以及它们任何变形,意图在于覆盖不排他的包含。例如包含了一系列步骤或单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元,而是可选地还包括没有列出的步骤或单元,或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤或单元。The terms "comprising" and "having", and any variations thereof, are intended to cover a non-exclusive inclusion in the specification and claims of the application. For example, a process, method, system, product, or device that comprises a series of steps or units is not limited to the listed steps or units, but optionally also includes steps or units not listed, or alternatively Other steps or units inherent to these processes, methods, products or equipment.
请参见图1,图1为本申请实施例提供的一种在软件定义网络(Software Defined Network,SDN)中确定SRLG的系统架构示意图;该SDN网络中包括SDN的架构自上而下可以分为五个部分:应用层、北向接口、控制层、南向接口、网络设备层。Referring to FIG. 1 , FIG. 1 is a schematic diagram of a system architecture for determining an SRLG in a Software Defined Network (SDN) according to an embodiment of the present application; the architecture of the SDN network including the SDN may be divided into top-down technologies. Five parts: application layer, northbound interface, control layer, southbound interface, and network device layer.
应用层:包括各种不同的业务和应用,应用层通过控制层提供的编程接口对底层设备进行编程,开发各种业务应用,实现丰富多彩的业务创新。Application layer: including a variety of different services and applications, the application layer through the programming interface provided by the control layer to program the underlying devices, develop a variety of business applications, and achieve a variety of business innovation.
北向接口:连接应用层与控制层的接口,使得各项SDN应用能够便捷的调用底层的网络资源。Northbound interface: connects the interface between the application layer and the control layer, so that various SDN applications can easily call the underlying network resources.
控制层:是整个SDN网络的核心,集中管理网络中的所有设备,把整个网络看做统一的资源池,根据用户不同的需求以及全局网络拓扑,灵活动态的分配资源。对下层,通过标准的协议与网络设备层进行通信;对上层,通过开放接口向应用层提供对网络资源的控制能力。Control layer: It is the core of the entire SDN network. It centrally manages all the devices in the network and regards the entire network as a unified resource pool. It allocates resources flexibly and dynamically according to different needs of users and the global network topology. For the lower layer, communication with the network device layer is performed through a standard protocol; for the upper layer, the application layer is provided with control capability for network resources through an open interface.
南向接口:连接控制层与网络设备的接口,实现对网络设备状态、数据流量转发的管控。Southbound interface: Connects the interface between the control layer and the network device to implement control over network device status and data traffic forwarding.
网络设备层:即硬件设备层,包含组成整个IT基础架构的网络中的所有物理设备,比方说交换机、路由器等等。网络设备层主要负责数据处理、转发等工作。Network device layer: The hardware device layer, which contains all the physical devices in the network that make up the entire IT infrastructure, such as switches, routers, and so on. The network device layer is mainly responsible for data processing, forwarding, and the like.
在本申请实施例中,控制层可包括控制器(Controller),网络设备层可以包括但不限于路由器等网络设备,如除了图1所示的路由器之外,还可以包括诸如交换机、网关设备、安全设备等其它网络设备。各个路由器可相互连接形成路由器网络拓扑,其中包含多条路径。In the embodiment of the present application, the control layer may include a controller, and the network device layer may include, but is not limited to, a network device such as a router, and may include, for example, a switch, a gateway device, and the like, in addition to the router shown in FIG. Other network devices such as security devices. Each router can be interconnected to form a router network topology that contains multiple paths.
需要说明的是,执行本申请确定SRLG的装置在SDN组网中可以由SDN控制器(SDN Controller,SNC)或其他后台服务器实现;而在普通路由器组网中,可由各个路由器的主控中央处理器(Central Processing Unit,CPU)进行处理,由不同路由器之间进行协作完成。It should be noted that the apparatus for determining the SRLG in the present application may be implemented by an SDN controller (SNC) or other back-end server in the SDN network; and in the ordinary router network, it may be processed by the main control center of each router. The Central Processing Unit (CPU) processes and is coordinated by different routers.
下面结合图2-图3对本申请确定SRLG的方法进行详细描述。The method for determining the SRLG in the present application will be described in detail below with reference to FIGS. 2 to 3.
请参见图2,图2为本申请实施例提供的一种确定SRLG的方法的流程示意图;具体包括如下步骤:Referring to FIG. 2, FIG. 2 is a schematic flowchart of a method for determining an SRLG according to an embodiment of the present disclosure;
S201、获取一个或多个网络设备的第一日志集合,所述第一日志集合包含非所述网络设备重启或非所述网络设备的接口板故障产生的日志;S201. Acquire a first log set of one or more network devices, where the first log set includes a log that is not generated by the network device restart or is not caused by an interface board fault of the network device;
可选地,可以获取预设时间段内网络中指定网络设备或所有网络的日志数据。例如, 预设时间段可以是一年或一个月,本申请实施例不作任何限定。Optionally, log data of a specified network device or all networks in the network in a preset time period may be acquired. E.g, The preset time period may be one year or one month, and is not limited in any embodiment of the present application.
可选地,在获取第一日志集合时,为了减少分析的数据量,可以排除网络设备重启的日志,如人为操作关机引起的物理接口up和/或down的日志;还可以排除网络设备的接口板故障引起的诸如物理接口up和/或down的日志。Optionally, when the first log set is obtained, in order to reduce the amount of data analyzed, the log of the network device restart may be excluded, such as the log of the physical interface up and/or down caused by the human operation shutdown; and the interface of the network device may also be excluded. A log such as a physical interface up and/or down caused by a board failure.
S202、根据日志过滤规则对所述第一日志集合中的日志进行过滤,得到第二日志集合;S202. Filter the logs in the first log set according to the log filtering rule to obtain a second log set.
S203、从所述第二日志集合中获取第一日志和第二日志,所述第一日志为所述第一接口产生的日志,所述第二日志为所述第二接口产生的日志,在确定所述第一日志和第二日志满足预设条件时,确定所述第一接口对应的第一链路和所述第二接口对应的第二链路属于同一个共享风险链路组。S203: Obtain a first log and a second log from the second log set, where the first log is a log generated by the first interface, and the second log is a log generated by the second interface, where When it is determined that the first log and the second log meet the preset condition, the first link corresponding to the first interface and the second link corresponding to the second interface belong to the same shared risk link group.
可选地,所述日志过滤规则包括以下至少一种:Optionally, the log filtering rule includes at least one of the following:
过滤得到物理接口开启up和/或关闭down的日志;Filtering the log that the physical interface is up and/or down.
过滤得到物理接口出现误码的日志;Filtering to get a log of errors in the physical interface;
过滤得到物理接口的双向转发(Bidirectional Forwarding Detection,BFD)检测up和/或双向转发检测down的日志;和Filtering to obtain a log of the bidirectional forwarding (BFD) detection of the physical interface and/or the bidirectional forwarding detection down; and
过滤得到物理接口的运行、管理和维护(Operation Administration and Maintenance,OAM)up和/或OAM down的日志。Filters the logs of the Operation, Administration, and Maintenance (OAM) up and/or OAM down of the physical interface.
其中,BFD是一种双向转发检测机制,可以提供毫秒级的检测,可以实现链路的快速检测,BFD通过与上层路由协议联动,可以实现路由的快速收敛,确保业务的永续性。BFD is a two-way forwarding detection mechanism that provides millisecond-level detection and fast detection of links. BFD can communicate with the upper-layer routing protocol to achieve fast convergence and ensure service continuity.
以太网OAM是一种监控网络故障的工具,目前主要用于解决以太网接入“最后一公里”中常见的链路问题。用户通过在两个点到点连接的设备上启用以太网OAM功能,可以监控这两台设备之间的链路状态。Ethernet OAM is a tool for monitoring network faults. It is mainly used to solve the link problems common in the last mile of Ethernet access. Users can monitor the link status between the two devices by enabling Ethernet OAM on two point-to-point devices.
以太网OAM能够有效提高以太网的管理和维护能力,保障网络的稳定运行,其主要功能包括:Ethernet OAM can effectively improve the management and maintenance capabilities of Ethernet and ensure the stable operation of the network. Its main functions include:
1.链路性能监测:对链路的各种性能进行监测,包括对丢包、时延和抖动等的衡量,以及对各类流量的统计。1. Link performance monitoring: Monitors various performances of the link, including measurement of packet loss, delay, and jitter, and statistics on various types of traffic.
2.故障侦测和告警:通过发送检测报文来探测链路的连通性,当链路出现故障时及时通知网络管理员。2. Fault detection and alarm: The link is detected by sending a detection packet, and the network administrator is notified when the link fails.
3.环路测试:通过非以太网OAM协议报文的环回来检测链路故障。3. Loop test: The link fault is detected by the loopback of non-Ethernet OAM packets.
可选地,如果在步骤S201中获取到了网络设备相关的各种日志,也可以在步骤S202中进行日志过滤时排除掉排除网络设备重启的日志,如人为操作关机引起的物理接口up和/或down的日志;以及还可以排除掉网络设备的接口板故障引起的诸如物理接口up和/或down的日志。Optionally, if the various logs related to the network device are obtained in step S201, the log that excludes the restart of the network device may be excluded during the log filtering in step S202, such as the physical interface up and/or caused by the human operation shutdown. Down log; and can also exclude logs such as physical interface up and / or down caused by network board interface failure.
需要说明的是,以上的日志过滤规则可以任意选择一个使用,也可以任意选择两个或以上进行组合使用,或者还可以全部采用。通过日志过滤规则进行过滤之后,可以得到与物理接口up和/或down的相关设备的日志,然后从中进行挖掘得到SRLG。It should be noted that the above log filtering rules may be arbitrarily selected for use, or two or more combinations may be arbitrarily selected or used in combination. After filtering by the log filtering rule, you can obtain logs of related devices that are up and/or down with the physical interface, and then mine from it to get the SRLG.
可选地,为了便于查看和挖掘,第二日志集合中的日志可以按照时间排序。其中,所述第二日志集合可以包括但不限于网络设备标识、日志发生接口、日志发生时间和日志发生事件等。 Optionally, for ease of viewing and mining, the logs in the second log collection can be sorted by time. The second log set may include, but is not limited to, a network device identifier, a log generation interface, a log generation time, and a log occurrence event.
如下表所示,给出了一种可能的第二日志集合的形式:As shown in the following table, a possible form of the second log collection is given:
Figure PCTCN2017107448-appb-000001
Figure PCTCN2017107448-appb-000001
例如,序号1中的数据可表示网络设备即路由器R14在2017-06-15 17:51:30,该设备上的一个接口GigabitEthernet0/8/7发生CRC故障,出现误码。如果在同一时间另一接口也发生类似故障,则这两个接口可能共风险。For example, the data in sequence number 1 indicates that the network device, that is, the router R14, is at 2017-06-15 17:51:30, and a CRC fault occurs on one interface GigabitEthernet0/8/7 on the device, and an error occurs. If a similar failure occurs on another interface at the same time, the two interfaces may be at risk.
在得到第二日志集合之后,便可以从中包含的各个接口的日志挖掘得到SRLG。After the second log set is obtained, the SRLG can be obtained from the log mining of each interface included in the log collection.
可选地,以第二日志集合中包含的第一日志和第二日志为例,确定所述第一日志和第二日志满足预设条件包括:Optionally, taking the first log and the second log included in the second log set as an example, determining that the first log and the second log meet the preset conditions include:
确定所述第一日志和所述第二日志满足以下条件中的至少一项:Determining that the first log and the second log satisfy at least one of the following conditions:
所述第一日志标识所述第一接口的状态变为up,所述第二日志标识所述第二接口的状态变为up,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第一阈值;The first log identifies that the status of the first interface changes to up, the second log identifies that the status of the second interface changes to up, the generation time of the first log, and the generation of the second log The interval of time is less than the first threshold;
所述第一日志标识所述第一接口的状态变为down,所述第二日志标识所述第二接口的状态变为down,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第二阈值;The first log identifies that the status of the first interface changes to down, the second log identifies that the status of the second interface changes to down, the generation time of the first log, and the generation of the second log The interval of time is less than the second threshold;
所述第一日志标识所述第一接口出现误码,所述第二日志标识所述第二接口出现误码,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第三阈值;The first log identifies that the first interface has an error, the second log identifies that the second interface has an error, and the interval between the first log generation time and the second log generation time is less than Third threshold
所述第一日志标识所述第一接口出现运行、管理和维护OAM up,所述第二日志标识所述第二接口出现OAM up,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第四阈值;和,The first log identifies that the first interface is running, managing, and maintaining OAM up, the second log identifies that the second interface is OAM up, the first log is generated, and the second log is generated. The interval of generation time is less than the fourth threshold; and,
所述第一日志标识所述第一接口出现OAM down,所述第二日志标识所述第二接口出现OAM down,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第五阈值。The first log identifies that the first interface is OAM down, the second log identifies that the second interface is OAM down, and the interval between the generation time of the first log and the second log is less than The fifth threshold.
以上多种预设条件可以选择一种使用,也可以一同使用。The above various preset conditions can be selected for one use or used together.
需要说明的是,除了两个接口同时出现相同的故障情况,此时,可以将这两个接口对应的链路视为可能的共享风险链路组,记录到SRLG的集合中,当两个接口同时出现不同的故障情况时,也可以考虑将这两个接口对应的链路视为可能的共享风险链路组,记录到SRLG 的集合中,尤其在同时出现不同故障的次数较多时,可以将将这两个接口对应的链路视为可能的共享风险链路组。It should be noted that, except that the two interfaces have the same fault condition at the same time, the link corresponding to the two interfaces can be regarded as a possible shared risk link group and recorded in the SRLG set. When different fault conditions occur at the same time, you can also consider the links corresponding to the two interfaces as possible shared risk link groups and record them to the SRLG. In the collection, especially when there are many times when different faults occur at the same time, the links corresponding to the two interfaces can be regarded as possible shared risk link groups.
另一个需要说明的是,本申请实施例中所述的“同时”并非严格意义上的同一时刻,由于线路传输时间等因素的影响,两个物理接口/链路之间可能存在非常细微的时差,因此本申请中“同时”既包含严格的同一时刻,也包含在较小时差如500毫秒或1秒等的两个相距非常近的时刻。如本申请实施例中的第一阈值至第五阈值,可以是一个较小的时差或者无限接近于0,五个阈值的具体数值可以相同也可以不同,本申请实施例不作任何限定。It should be noted that the “simultaneous” described in the embodiments of the present application is not the same time in the strict sense. Due to factors such as the transmission time of the line, there may be very fine time difference between the two physical interfaces/links. Therefore, in the present application, "simultaneously" includes both the strict same time, and also includes two very close times at a small time difference such as 500 milliseconds or 1 second. The first threshold to the fifth threshold in the embodiment of the present application may be a smaller time difference or infinitely close to 0. The specific values of the five thresholds may be the same or different, and are not limited in any embodiment of the present application.
可选地,预设条件的伪码可以包含如下部分:Optionally, the pseudo code of the preset condition may include the following parts:
Figure PCTCN2017107448-appb-000002
Figure PCTCN2017107448-appb-000002
即出现上述预设条件中的情况时,返回true,将第一链路即链路A和第二链路即链路B视为可能的共享风险链路组,记录到SRLG的集合中;当出现默认的其他情况即不满足上述预设条件中的情况时,则返回false,将链路A和链路B视为非共享风险链路组,无需记录到SRLG的集合中。That is, when the above-mentioned preset condition occurs, it returns true, and the first link, that is, the link A and the second link, that is, the link B, are regarded as possible shared risk link groups, and are recorded in the set of SRLG; If the default situation occurs, that is, if the conditions in the above preset conditions are not met, then false is returned, and link A and link B are regarded as non-shared risk link groups, and need not be recorded in the set of SRLG.
通过获取组网中的一个或多个网络设备的第一日志集合,可以在IP层面通过日志分析的方式,根据日志过滤规则对第一日志集合进行过滤,得到网络设备接口相应故障的日志,形成第二日志集合;从而实现自动识别传输层的有用信息,然后根据是否满足预设条件确定第二日志集合中的共享风险链路组,可以省去客户人为识别SRLG的大量投入,并可以避免人为识别的不准确性;同时可以根据得到的SRLG集合动态调整业务部署,尤其在基于IP加光进行共享风险链路确认机制无法部署的局点有现实意义,对当前的SRLG功能是非常有益的补充,可以显著提升网络业务传输的稳定性和安全性。By obtaining the first log collection of one or more network devices in the network, you can filter the first log collection based on the log filtering rules by using the log analysis method at the IP level to obtain the log of the corresponding fault of the network device interface. The second log collection; thereby realizing the automatic identification of the useful information of the transport layer, and then determining the shared risk link group in the second log set according to whether the preset condition is met, thereby eliminating the large amount of investment of the customer to identify the SRLG, and avoiding artificial The inaccuracy of the identification; at the same time, the service deployment can be dynamically adjusted according to the obtained SRLG set, especially in the point where the shared risk link confirmation mechanism cannot be deployed based on the IP light addition, which has practical significance and is very useful supplement to the current SRLG function. It can significantly improve the stability and security of network service transmission.
参见图3,图3为本申请实施例提供的另一种确定SRLG的方法的流程示意图;在本实施例中,步骤S301-S303与步骤S201-S203相同,此处不再赘述。在步骤S303之后,还包括:Referring to FIG. 3, FIG. 3 is a schematic flowchart of another method for determining an SRLG according to an embodiment of the present disclosure. In this embodiment, steps S301-S303 are the same as steps S201-S203, and details are not described herein again. After step S303, the method further includes:
S304、从所述第二日志集合中获取第三日志,所述第三日志为第三接口产生的日志。S304. Acquire a third log from the second log set, where the third log is a log generated by the third interface.
S305、在确定所述第一日志和第三日志满足所述预设条件时,确定所述第一接口对应的第一链路和所述第三接口对应的第三链路属于同一个共享风险链路组。S305. When it is determined that the first log and the third log meet the preset condition, determine that the first link corresponding to the first interface and the third link corresponding to the third interface belong to the same sharing risk. Link group.
S306、在确定所述第二日志和第三日志满足所述预设条件时,确定所述第二接口对应 的第二链路和所述第三接口对应的第三链路属于同一个共享风险链路组。S306. When it is determined that the second log and the third log meet the preset condition, determine that the second interface corresponds to The second link and the third link corresponding to the third interface belong to the same shared risk link group.
以上,给出了第二日志集合中包括大于两个接口的日志时的处理方法,当还存在第四日志时,处理方法与上述类似,此处不再赘述。通过两两接口的日志与预设条件的比对,可以将满足预设条件的共享风险链路组依次筛选出来,生成一个SRLG的集合。The processing method of the second log set including the logs of the two interfaces is provided. When the fourth log is still present, the processing method is similar to the foregoing, and details are not described herein again. Through the comparison of the logs of the two interfaces and the preset conditions, the shared risk link groups that meet the preset conditions can be sequentially filtered out to generate a set of SRLGs.
可选地,在筛选的过程中,可以根据与预设条件的比对情况生成一个共享风险链路组关系表。Optionally, during the screening process, a shared risk link group relationship table may be generated according to the comparison with the preset conditions.
然后根据共享风险链路组关系表中的信息,输出一个共享风险链路组的集合。Then, according to the information in the shared risk link group relationship table, a set of shared risk link groups is output.
其中,所述共享风险链路组关系表包括同时发生故障的两条链路的标识以及同时发生故障的频次。The shared risk link group relationship table includes identifiers of two links that fail simultaneously and frequency of simultaneous failures.
具体地,根据第二日志集合输出SRLG的集合过程中,可以包括:Specifically, the process of outputting the SRLG according to the second log set may include:
输入:第二日志集合;Input: the second log collection;
检测时间阈值time1:如500毫秒或1秒等,认为此时间内两个物理接口同时up/down等,可能共传输风险;The detection time threshold time1: for example, 500 milliseconds or 1 second, it is considered that the two physical interfaces are up/down at the same time, and the risk may be shared.
输出:共享风险链路SRLG集合Output: Shared Risk Link SRLG Collection
步骤1:遍历第二日志集合,输出SRLG关系表Step 1: Traverse the second log set and output the SRLG relationship table.
Figure PCTCN2017107448-appb-000003
Figure PCTCN2017107448-appb-000003
步骤2:遍历SRLG关系表,输出SRLG所有集合Step 2: Traverse the SRLG relational table and output all collections of SRLG
如下表所示,给出了一种可能的SRLG关系表的形式:As shown in the following table, a possible form of SRLG relational table is given:
Figure PCTCN2017107448-appb-000004
Figure PCTCN2017107448-appb-000004
输出的SRLG的集合中的共享风险链路组示例如下:An example of a shared risk link group in the set of output SRLGs is as follows:
{R16.GigabitEthernet0/8/1,R11.GigabitEthernet1/1/0,R1.GigabitEthernet12/0/1};表示上述3个接口对应的链路共风险。 {R16.GigabitEthernet0/8/1, R11.GigabitEthernet1/1/0, R1.GigabitEthernet12/0/1}; indicates that the links corresponding to the above three interfaces are at risk.
或者还可以表示如下:{R3.GigabitEthernet0/3/0,R9.GigabitEthernet9/0/11};表示上述2个接口对应的链路共风险。Or it can be expressed as follows: {R3.GigabitEthernet0/3/0, R9.GigabitEthernet9/0/11}; indicates that the links corresponding to the above two interfaces have a common risk.
请参见图4,图4为本申请实施例提供的又一种确定SRLG的方法的流程示意图;在本实施例中,步骤S401-S406与图3所示实施例中的步骤S301-S306相同,此处不再赘述,在步骤S406之后,还包括:Referring to FIG. 4, FIG. 4 is a schematic flowchart of still another method for determining an SRLG according to an embodiment of the present disclosure. In this embodiment, steps S401-S406 are the same as steps S301-S306 in the embodiment shown in FIG. The details are not described herein again. After step S406, the method further includes:
S407、将确定的共享风险链路组上报用户,进行告警。S407. Report the shared risk link group to the user and perform an alarm.
S408、将确定的共享风险链路组下发至所述网络设备,或者在用户确认后将确定的共享风险链路组下发至所述网络设备。S408. The determined shared risk link group is delivered to the network device, or the determined shared risk link group is delivered to the network device after the user confirms.
即通过上述实施例中的方法挖掘确定得到可能共风险传输的物理链路之后,可以将其补充到SRLG集合中,既可以供客户决策确认,也可以将SRLG集合自动下发到网络设备上。省去客户人为识别的过程。That is, after the physical link of the possible co-risk transmission is determined by the method in the foregoing embodiment, it can be added to the SRLG set, which can be used for the customer to confirm the decision, or automatically send the SRLG set to the network device. Eliminate the process of customer identification.
请参照图5,为本申请实施例提供的一种确定SRLG的装置的组成示意图;可包括:Referring to FIG. 5, it is a schematic diagram of a device for determining an SRLG according to an embodiment of the present disclosure;
收发单元100,用于获取一个或多个网络设备的第一日志集合,所述第一日志集合包含非所述网络设备重启或非所述网络设备的接口板故障产生的日志;The transceiver unit 100 is configured to acquire a first log set of one or more network devices, where the first log set includes a log that is not generated by the network device or is not caused by an interface board failure of the network device;
处理单元200,用于根据日志过滤规则对所述第一日志集合中的日志进行过滤,得到第二日志集合;从所述第二日志集合中获取第一日志和第二日志,所述第一日志为所述第一接口产生的日志,所述第二日志为所述第二接口产生的日志,在确定所述第一日志和第二日志满足预设条件时,确定所述第一接口对应的第一链路和所述第二接口对应的第二链路属于同一个共享风险链路组。The processing unit 200 is configured to filter the logs in the first log set according to the log filtering rule to obtain a second log set, and obtain the first log and the second log from the second log set, where the first The log is a log generated by the first interface, and the second log is a log generated by the second interface. When it is determined that the first log and the second log meet a preset condition, determining that the first interface corresponds to The first link and the second link corresponding to the second interface belong to the same shared risk link group.
可选地,所述日志过滤规则包括以下至少一种:Optionally, the log filtering rule includes at least one of the following:
过滤得到物理接口开启up和/或关闭down的日志;Filtering the log that the physical interface is up and/or down.
过滤得到物理接口出现误码的日志;Filtering to get a log of errors in the physical interface;
过滤得到物理接口的双向转发检测up和/或双向转发检测down的日志;和Filtering to obtain a log of the bidirectional forwarding detection up and/or bidirectional forwarding detection down of the physical interface; and
过滤得到物理接口的运行、管理和维护OAM up和/或OAM down的日志。Filtering logs for running, managing, and maintaining OAM up and/or OAM down for physical interfaces.
可选地,在确定所述第一日志和第二日志满足预设条件时,所述处理单元200具体用于:Optionally, when it is determined that the first log and the second log meet the preset condition, the processing unit 200 is specifically configured to:
确定所述第一日志和所述第二日志满足以下条件中的至少一项:Determining that the first log and the second log satisfy at least one of the following conditions:
所述第一日志标识所述第一接口的状态变为up,所述第二日志标识所述第二接口的状态变为up,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第一阈值;The first log identifies that the status of the first interface changes to up, the second log identifies that the status of the second interface changes to up, the generation time of the first log, and the generation of the second log The interval of time is less than the first threshold;
所述第一日志标识所述第一接口的状态变为down,所述第二日志标识所述第二接口的状态变为down,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第二阈值;The first log identifies that the status of the first interface changes to down, the second log identifies that the status of the second interface changes to down, the generation time of the first log, and the generation of the second log The interval of time is less than the second threshold;
所述第一日志标识所述第一接口出现误码,所述第二日志标识所述第二接口出现误码,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第三阈值;The first log identifies that the first interface has an error, the second log identifies that the second interface has an error, and the interval between the first log generation time and the second log generation time is less than Third threshold
所述第一日志标识所述第一接口出现运行、管理和维护OAM up,所述第二日志标识所述第二接口出现OAM up,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第四阈值;和,The first log identifies that the first interface is running, managing, and maintaining OAM up, the second log identifies that the second interface is OAM up, the first log is generated, and the second log is generated. The interval of generation time is less than the fourth threshold; and,
所述第一日志标识所述第一接口出现OAM down,所述第二日志标识所述第二接口出现OAM down,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第五阈值。 The first log identifies that the first interface is OAM down, the second log identifies that the second interface is OAM down, and the interval between the generation time of the first log and the second log is less than The fifth threshold.
可选地,所述第二日志集合包括网络设备标识、日志发生接口、日志发生时间和日志发生事件。Optionally, the second log set includes a network device identifier, a log generation interface, a log generation time, and a log occurrence event.
可选地,所述处理单元200还用于:Optionally, the processing unit 200 is further configured to:
从所述第二日志集合中获取第三日志,所述第三日志为第三接口产生的日志;Acquiring a third log from the second log set, where the third log is a log generated by the third interface;
在确定所述第一日志和第三日志满足所述预设条件时,确定所述第一接口对应的第一链路和所述第三接口对应的第三链路属于同一个共享风险链路组;When it is determined that the first log and the third log meet the preset condition, determining that the first link corresponding to the first interface and the third link corresponding to the third interface belong to the same shared risk link group;
在确定所述第二日志和第三日志满足所述预设条件时,确定所述第二接口对应的第二链路和所述第三接口对应的第三链路属于同一个共享风险链路组。When it is determined that the second log and the third log meet the preset condition, determining that the second link corresponding to the second interface and the third link corresponding to the third interface belong to the same shared risk link group.
可选地,所述处理单元200还用于:Optionally, the processing unit 200 is further configured to:
将确定的共享风险链路组上报用户,进行告警;Reporting the identified shared risk link group to the user for alarms;
将确定的共享风险链路组下发至所述网络设备,或者在用户确认后将确定的共享风险链路组下发至所述网络设备。The determined shared risk link group is delivered to the network device, or the determined shared risk link group is delivered to the network device after the user confirms.
请参照图6,为本申请实施例提供的另一种确定SRLG的装置的组成示意图;如图6所示,该装置可以包括处理器110、存储器120和总线130。处理器110和存储器120通过总线130连接,该存储器120用于存储指令,该处理器110用于执行该存储器120存储的指令,以实现如上图2-图4对应的方法中的步骤。Please refer to FIG. 6 , which is a schematic diagram of another apparatus for determining an SRLG according to an embodiment of the present disclosure. As shown in FIG. 6 , the apparatus may include a processor 110 , a memory 120 , and a bus 130 . The processor 110 and the memory 120 are connected by a bus 130 for storing instructions for executing the instructions stored by the memory 120 to implement the steps in the method corresponding to Figures 2 to 4 above.
进一步的,该装置还可以包括、输入口140和输出口150。其中,处理器110、存储器120、输入口140和输出口150可以通过总线130相连。Further, the device may further include an input port 140 and an output port 150. The processor 110, the memory 120, the input port 140 and the output port 150 can be connected by a bus 130.
处理器110用于执行该存储器120存储的指令,以控制输入口140接收信号,并控制输出口150发送信号,完成上述方法中控制器执行的步骤。其中,输入口140和输出口150可以为相同或者不同的物理实体。为相同的物理实体时,可以统称为输入输出口。所述存储器120可以集成在所述处理器110中,也可以与所述处理器110分开设置。The processor 110 is configured to execute instructions stored in the memory 120 to control the input port 140 to receive signals, and control the output port 150 to send signals to complete the steps performed by the controller in the above method. The input port 140 and the output port 150 may be the same or different physical entities. When they are the same physical entity, they can be collectively referred to as input and output ports. The memory 120 may be integrated in the processor 110 or may be provided separately from the processor 110.
作为一种实现方式,输入口140和输出口150的功能可以考虑通过收发电路或者收发的专用芯片实现。处理器110可以考虑通过专用处理芯片、处理电路、处理器或者通用芯片实现。As an implementation manner, the functions of the input port 140 and the output port 150 can be implemented by a dedicated chip through a transceiver circuit or a transceiver. The processor 110 can be implemented by a dedicated processing chip, a processing circuit, a processor, or a general purpose chip.
作为另一种实现方式,可以考虑使用通用计算机的方式来实现本申请实施例提供的装置。即将实现处理器110,输入口140和输出口150功能的程序代码存储在存储器中,通用处理器通过执行存储器中的代码来实现处理器110,输入口140和输出口150的功能。As another implementation manner, a device provided by an embodiment of the present application may be implemented by using a general-purpose computer. The program code for the functions of the processor 110, the input port 140 and the output port 150 is stored in a memory, and the general purpose processor implements the functions of the processor 110, the input port 140 and the output port 150 by executing code in the memory.
该装置所涉及的与本申请实施例提供的技术方案相关的概念,解释和详细说明及其他步骤请参见前述方法或其他实施例中关于这些内容的描述,此处不做赘述。For the concepts, explanations, detailed descriptions and other steps related to the technical solutions provided by the embodiments of the present application, refer to the descriptions of the foregoing methods or other embodiments, and no further details are provided herein.
本领域技术人员可以理解,为了便于说明,图6仅示出了一个存储器和处理器。在实际的控制器中,可以存在多个处理器和存储器。存储器也可以称为存储介质或者存储设备等,本申请实施例对此不做限制。Those skilled in the art will appreciate that FIG. 6 shows only one memory and processor for ease of illustration. In an actual controller, there may be multiple processors and memories. The memory may also be referred to as a storage medium or a storage device, and the like.
请参见图7,本申请实施例提供了一种确定SRLG的装置210。所述装置210包括:主控板220和接口板230。主控板220包括:处理器221和存储器222。接口板230包括:处理器231、存储器232和接口卡233。主控板220和接口板230耦合。Referring to FIG. 7, an embodiment of the present application provides an apparatus 210 for determining an SRLG. The device 210 includes a main control board 220 and an interface board 230. The main control board 220 includes a processor 221 and a memory 222. The interface board 230 includes a processor 231, a memory 232, and an interface card 233. The main control board 220 and the interface board 230 are coupled.
存储器222可以用于存储主控板210的程序代码,处理器221用于调用存储器222中的程序代码执行如下操作: The memory 222 can be used to store the program code of the main control board 210, and the processor 221 is used to call the program code in the memory 222 to perform the following operations:
获取一个或多个网络设备的第一日志集合,所述第一日志集合包含非所述网络设备重启或非所述网络设备的接口板故障产生的日志;根据日志过滤规则对所述第一日志集合中的日志进行过滤,得到第二日志集合;从所述第二日志集合中获取第一日志和第二日志,所述第一日志为所述第一接口产生的日志,所述第二日志为所述第二接口产生的日志,在确定所述第一日志和第二日志满足预设条件时,确定所述第一接口对应的第一链路和所述第二接口对应的第二链路属于同一个共享风险链路组。Obtaining a first log set of one or more network devices, where the first log set includes a log that is not generated by the network device restarting or not being caused by an interface board failure of the network device; and the first log is generated according to a log filtering rule The log in the set is filtered to obtain a second log set, and the first log and the second log are obtained from the second log set, where the first log is a log generated by the first interface, and the second log is Determining, by the log generated by the second interface, the first link corresponding to the first interface and the second link corresponding to the second interface, when determining that the first log and the second log meet a preset condition Roads belong to the same shared risk link group.
存储器232可以用于存储接口板230的程序代码,处理器231用于调用存储器232中的程序代码执行如下操作:The memory 232 can be used to store program code of the interface board 230, and the processor 231 is used to call the program code in the memory 232 to perform the following operations:
触发接口卡233向一个或多个网络设备发送日志获取消息,以触发所述网络设备返回非所述网络设备重启或非所述网络设备的接口板故障产生的日志。The trigger interface card 233 sends a log acquisition message to one or more network devices to trigger the network device to return a log that is not caused by the network device restarting or the interface board failure of the network device.
在一种可能的实现方式中,主控板220和接口板230之间建立进程间通信协议IPC控制通道。In a possible implementation manner, an inter-process communication protocol IPC control channel is established between the main control board 220 and the interface board 230.
所述装置210可以为SDN控制器、路由器或其他后台服务器,所述装置210能够实现前述方法实施例中的确定SRLG的装置的功能,具体执行步骤可以参见前述方法实施例,此处不在赘述。The device 210 can be an SDN controller, a router, or other background server. The device 210 can implement the functions of the device for determining the SRLG in the foregoing method embodiments. For the specific implementation steps, refer to the foregoing method embodiments, and details are not described herein.
应理解,在本申请实施例中,处理器可以是中央处理单元(Central Processing Unit,简称为“CPU”),该处理器还可以是其他通用处理器、数字信号处理器(DSP)、专用集成电路(ASIC)、现成可编程门阵列(FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that, in the embodiment of the present application, the processor may be a central processing unit ("CPU"), and the processor may also be other general-purpose processors, digital signal processors (DSPs), and dedicated integration. Circuit (ASIC), off-the-shelf programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware component, etc. The general purpose processor may be a microprocessor or the processor or any conventional processor or the like.
该存储器可以包括只读存储器和随机存取存储器,并向处理器提供指令和数据。存储器的一部分还可以包括非易失性随机存取存储器。The memory can include read only memory and random access memory and provides instructions and data to the processor. A portion of the memory may also include a non-volatile random access memory.
该总线除包括数据总线之外,还可以包括电源总线、控制总线和状态信号总线等。但是为了清楚说明起见,在图中将各种总线都标为总线。In addition to the data bus, the bus may also include a power bus, a control bus, and a status signal bus. However, for the sake of clarity, the various buses are labeled as buses in the figure.
在实现过程中,上述方法的各步骤可以通过处理器中的硬件的集成逻辑电路或者软件形式的指令完成。结合本申请实施例所公开的方法的步骤可以直接体现为硬件处理器执行完成,或者用处理器中的硬件及软件模块组合执行完成。软件模块可以位于随机存储器,闪存、只读存储器,可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质中。该存储介质位于存储器,处理器读取存储器中的信息,结合其硬件完成上述方法的步骤。为避免重复,这里不再详细描述。In the implementation process, each step of the above method may be completed by an integrated logic circuit of hardware in a processor or an instruction in a form of software. The steps of the method disclosed in the embodiments of the present application may be directly implemented as a hardware processor, or may be performed by a combination of hardware and software modules in the processor. The software module can be located in a conventional storage medium such as random access memory, flash memory, read only memory, programmable read only memory or electrically erasable programmable memory, registers, and the like. The storage medium is located in the memory, and the processor reads the information in the memory and combines the hardware to complete the steps of the above method. To avoid repetition, it will not be described in detail here.
根据本申请实施例提供的方法,本申请实施例还提供一种系统,其包括前述的装置和一个或多于一个的网络设备。According to the method provided by the embodiment of the present application, the embodiment of the present application further provides a system including the foregoing device and one or more network devices.
还应理解,本文中涉及的第一、第二、第三、第四以及各种数字编号仅为描述方便进行的区分,并不用来限制本申请实施例的范围。It is also to be understood that the first, second, third, fourth, and various reference numerals are in the
应理解,本文中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。It should be understood that the term "and/or" herein is merely an association relationship describing an associated object, indicating that there may be three relationships, for example, A and/or B, which may indicate that A exists separately, and A and B exist simultaneously. There are three cases of B alone. In addition, the character "/" in this article generally indicates that the contextual object is an "or" relationship.
应理解,在本申请的各种实施例中,上述各过程的序号的大小并不意味着执行顺序的 先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。It should be understood that in various embodiments of the present application, the size of the sequence numbers of the above processes does not imply an order of execution. The order of execution of each process should be determined by its function and internal logic, and should not be construed as limiting the implementation process of the embodiments of the present application.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各种说明性逻辑块(illustrative logical block)和步骤(step),能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。Those of ordinary skill in the art will appreciate that the various illustrative logical blocks and steps described in connection with the embodiments disclosed herein can be implemented in electronic hardware, or a combination of computer software and electronic hardware. achieve. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the solution. A person skilled in the art can use different methods to implement the described functions for each particular application, but such implementation should not be considered to be beyond the scope of the present application.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided by the present application, it should be understood that the disclosed systems, devices, and methods may be implemented in other manners. For example, the device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division. In actual implementation, there may be another division manner, for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored or not executed. In addition, the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本申请实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如固态硬盘Solid State Disk(SSD))等。In the above embodiments, it may be implemented in whole or in part by software, hardware, firmware, or any combination thereof. When implemented in software, it may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, the processes or functions described in accordance with embodiments of the present application are generated in whole or in part. The computer can be a general purpose computer, a special purpose computer, a computer network, or other programmable device. The computer instructions can be stored in a computer readable storage medium or transferred from one computer readable storage medium to another computer readable storage medium, for example, the computer instructions can be from a website site, computer, server or data center Transfer to another website site, computer, server, or data center by wire (eg, coaxial cable, fiber optic, digital subscriber line (DSL), or wireless (eg, infrared, wireless, microwave, etc.). The computer readable storage medium can be any available media that can be accessed by a computer or a data storage device such as a server, data center, or the like that includes one or more available media. The usable medium may be a magnetic medium (eg, a floppy disk, a hard disk, a magnetic tape), an optical medium (eg, a DVD), or a semiconductor medium (such as a solid state disk (SSD)).
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。 The foregoing is only a specific embodiment of the present application, but the scope of protection of the present application is not limited thereto, and any person skilled in the art can easily think of changes or substitutions within the technical scope disclosed in the present application. It should be covered by the scope of protection of this application. Therefore, the scope of protection of the present application should be determined by the scope of the claims.

Claims (18)

  1. 一种确定共享风险链路组的方法,其特征在于,包括:A method for determining a shared risk link group, comprising:
    获取一个或多个网络设备的第一日志集合,所述第一日志集合包含非所述网络设备重启或非所述网络设备的接口板故障产生的日志;Obtaining a first log set of one or more network devices, where the first log set includes a log that is not generated by the network device restart or is not caused by an interface board failure of the network device;
    根据日志过滤规则对所述第一日志集合中的日志进行过滤,得到第二日志集合;Filtering the logs in the first log set according to the log filtering rule to obtain a second log set;
    从所述第二日志集合中获取第一日志和第二日志,所述第一日志为所述第一接口产生的日志,所述第二日志为所述第二接口产生的日志,在确定所述第一日志和第二日志满足预设条件时,确定所述第一接口对应的第一链路和所述第二接口对应的第二链路属于同一个共享风险链路组。Obtaining a first log and a second log from the second log set, where the first log is a log generated by the first interface, and the second log is a log generated by the second interface, where When the first log and the second log meet the preset condition, the first link corresponding to the first interface and the second link corresponding to the second interface belong to the same shared risk link group.
  2. 根据权利要求1所述的方法,其特征在于,所述日志过滤规则包括以下至少一种:The method according to claim 1, wherein the log filtering rule comprises at least one of the following:
    过滤得到物理接口开启up和/或关闭down的日志;Filtering the log that the physical interface is up and/or down.
    过滤得到物理接口出现误码的日志;Filtering to get a log of errors in the physical interface;
    过滤得到物理接口的双向转发检测up和/或双向转发检测down的日志;和Filtering to obtain a log of the bidirectional forwarding detection up and/or bidirectional forwarding detection down of the physical interface; and
    过滤得到物理接口的运行、管理和维护OAM up和/或OAM down的日志。Filtering logs for running, managing, and maintaining OAM up and/or OAM down for physical interfaces.
  3. 根据权利要求1或2所述的方法,其特征在于,确定所述第一日志和第二日志满足预设条件包括:The method according to claim 1 or 2, wherein determining that the first log and the second log satisfy a preset condition comprises:
    确定所述第一日志和所述第二日志满足以下条件中的至少一项:Determining that the first log and the second log satisfy at least one of the following conditions:
    所述第一日志标识所述第一接口的状态变为up,所述第二日志标识所述第二接口的状态变为up,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第一阈值;The first log identifies that the status of the first interface changes to up, the second log identifies that the status of the second interface changes to up, the generation time of the first log, and the generation of the second log The interval of time is less than the first threshold;
    所述第一日志标识所述第一接口的状态变为down,所述第二日志标识所述第二接口的状态变为down,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第二阈值;The first log identifies that the status of the first interface changes to down, the second log identifies that the status of the second interface changes to down, the generation time of the first log, and the generation of the second log The interval of time is less than the second threshold;
    所述第一日志标识所述第一接口出现误码,所述第二日志标识所述第二接口出现误码,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第三阈值;The first log identifies that the first interface has an error, the second log identifies that the second interface has an error, and the interval between the first log generation time and the second log generation time is less than Third threshold
    所述第一日志标识所述第一接口出现运行、管理和维护OAM up,所述第二日志标识所述第二接口出现OAM up,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第四阈值;和,The first log identifies that the first interface is running, managing, and maintaining OAM up, the second log identifies that the second interface is OAM up, the first log is generated, and the second log is generated. The interval of generation time is less than the fourth threshold; and,
    所述第一日志标识所述第一接口出现OAM down,所述第二日志标识所述第二接口出现OAM down,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第五阈值。The first log identifies that the first interface is OAM down, the second log identifies that the second interface is OAM down, and the interval between the generation time of the first log and the second log is less than The fifth threshold.
  4. 根据权利要求1至3任一项所述的方法,其特征在于,所述第二日志集合包括网络设备标识、日志发生接口、日志发生时间和日志发生事件。The method according to any one of claims 1 to 3, wherein the second log set comprises a network device identifier, a log generation interface, a log generation time, and a log occurrence event.
  5. 根据权利要求1至4任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1 to 4, further comprising:
    从所述第二日志集合中获取第三日志,所述第三日志为第三接口产生的日志;Acquiring a third log from the second log set, where the third log is a log generated by the third interface;
    在确定所述第一日志和第三日志满足所述预设条件时,确定所述第一接口对应的第一 链路和所述第三接口对应的第三链路属于同一个共享风险链路组;Determining the first corresponding to the first interface when determining that the first log and the third log meet the preset condition The third link corresponding to the link and the third interface belongs to the same shared risk link group;
    在确定所述第二日志和第三日志满足所述预设条件时,确定所述第二接口对应的第二链路和所述第三接口对应的第三链路属于同一个共享风险链路组。When it is determined that the second log and the third log meet the preset condition, determining that the second link corresponding to the second interface and the third link corresponding to the third interface belong to the same shared risk link group.
  6. 根据权利要求1至5任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1 to 5, further comprising:
    将确定的共享风险链路组上报用户设备,进行告警;The determined shared risk link group is reported to the user equipment for alarming.
    将确定的共享风险链路组下发至所述网络设备,或者根据所述用户设备的指示将确定的共享风险链路组下发至所述网络设备。The determined shared risk link group is delivered to the network device, or the determined shared risk link group is delivered to the network device according to the indication of the user equipment.
  7. 一种确定共享风险链路组的装置,其特征在于,包括:An apparatus for determining a shared risk link group, comprising:
    收发单元,用于获取一个或多个网络设备的第一日志集合,所述第一日志集合包含非所述网络设备重启或非所述网络设备的接口板故障产生的日志;a transceiver unit, configured to acquire a first log set of one or more network devices, where the first log set includes a log that is not generated by the network device or is not caused by an interface board failure of the network device;
    处理单元,用于根据日志过滤规则对所述第一日志集合中的日志进行过滤,得到第二日志集合;从所述第二日志集合中获取第一日志和第二日志,所述第一日志为所述第一接口产生的日志,所述第二日志为所述第二接口产生的日志,在确定所述第一日志和第二日志满足预设条件时,确定所述第一接口对应的第一链路和所述第二接口对应的第二链路属于同一个共享风险链路组。a processing unit, configured to filter the logs in the first log set according to the log filtering rule to obtain a second log set, and obtain the first log and the second log from the second log set, where the first log is a log generated by the first interface, where the second log is a log generated by the second interface, and when it is determined that the first log and the second log meet a preset condition, determining that the first interface corresponds to The second link corresponding to the first link and the second interface belong to the same shared risk link group.
  8. 根据权利要求7所述的装置,其特征在于,所述日志过滤规则包括以下至少一种:The apparatus according to claim 7, wherein the log filtering rule comprises at least one of the following:
    过滤得到物理接口开启up和/或关闭down的日志;Filtering the log that the physical interface is up and/or down.
    过滤得到物理接口出现误码的日志;Filtering to get a log of errors in the physical interface;
    过滤得到物理接口的双向转发检测up和/或双向转发检测down的日志;和Filtering to obtain a log of the bidirectional forwarding detection up and/or bidirectional forwarding detection down of the physical interface; and
    过滤得到物理接口的运行、管理和维护OAM up和/或OAM down的日志。Filtering logs for running, managing, and maintaining OAM up and/or OAM down for physical interfaces.
  9. 根据权利要求7或8所述的装置,其特征在于,在确定所述第一日志和第二日志满足预设条件时,所述处理单元具体用于:The apparatus according to claim 7 or 8, wherein when the first log and the second log are determined to meet a preset condition, the processing unit is specifically configured to:
    确定所述第一日志和所述第二日志满足以下条件中的至少一项:Determining that the first log and the second log satisfy at least one of the following conditions:
    所述第一日志标识所述第一接口的状态变为up,所述第二日志标识所述第二接口的状态变为up,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第一阈值;The first log identifies that the status of the first interface changes to up, the second log identifies that the status of the second interface changes to up, the generation time of the first log, and the generation of the second log The interval of time is less than the first threshold;
    所述第一日志标识所述第一接口的状态变为down,所述第二日志标识所述第二接口的状态变为down,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第二阈值;The first log identifies that the status of the first interface changes to down, the second log identifies that the status of the second interface changes to down, the generation time of the first log, and the generation of the second log The interval of time is less than the second threshold;
    所述第一日志标识所述第一接口出现误码,所述第二日志标识所述第二接口出现误码,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第三阈值;The first log identifies that the first interface has an error, the second log identifies that the second interface has an error, and the interval between the first log generation time and the second log generation time is less than Third threshold
    所述第一日志标识所述第一接口出现运行、管理和维护OAM up,所述第二日志标识所述第二接口出现OAM up,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第四阈值;和,The first log identifies that the first interface is running, managing, and maintaining OAM up, the second log identifies that the second interface is OAM up, the first log is generated, and the second log is generated. The interval of generation time is less than the fourth threshold; and,
    所述第一日志标识所述第一接口出现OAM down,所述第二日志标识所述第二接口出现OAM down,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第五阈值。 The first log identifies that the first interface is OAM down, the second log identifies that the second interface is OAM down, and the interval between the generation time of the first log and the second log is less than The fifth threshold.
  10. 根据权利要求7至9任一项所述的装置,其特征在于,所述第二日志集合包括网络设备标识、日志发生接口、日志发生时间和日志发生事件。The apparatus according to any one of claims 7 to 9, wherein the second log set comprises a network device identifier, a log generation interface, a log generation time, and a log occurrence event.
  11. 根据权利要求7至10任一项所述的装置,其特征在于,所述处理单元还用于:The device according to any one of claims 7 to 10, wherein the processing unit is further configured to:
    从所述第二日志集合中获取第三日志,所述第三日志为第三接口产生的日志;Acquiring a third log from the second log set, where the third log is a log generated by the third interface;
    在确定所述第一日志和第三日志满足所述预设条件时,确定所述第一接口对应的第一链路和所述第三接口对应的第三链路属于同一个共享风险链路组;When it is determined that the first log and the third log meet the preset condition, determining that the first link corresponding to the first interface and the third link corresponding to the third interface belong to the same shared risk link group;
    在确定所述第二日志和第三日志满足所述预设条件时,确定所述第二接口对应的第二链路和所述第三接口对应的第三链路属于同一个共享风险链路组。When it is determined that the second log and the third log meet the preset condition, determining that the second link corresponding to the second interface and the third link corresponding to the third interface belong to the same shared risk link group.
  12. 根据权利要求7至11任一项所述的装置,其特征在于,所述处理单元还用于:The device according to any one of claims 7 to 11, wherein the processing unit is further configured to:
    将确定的共享风险链路组上报用户设备,进行告警;The determined shared risk link group is reported to the user equipment for alarming.
    将确定的共享风险链路组下发至所述网络设备,或者根据用户指示将确定的共享风险链路组下发至所述网络设备。The determined shared risk link group is delivered to the network device, or the determined shared risk link group is delivered to the network device according to the user indication.
  13. 一种确定共享风险链路组的装置,其特征在于,包括:An apparatus for determining a shared risk link group, comprising:
    处理器、存储器和总线,所述处理器和存储器通过总线连接,其中,所述存储器用于存储一组程序代码,所述处理器用于调用所述存储器中存储的程序代码,执行以下操作:a processor, a memory and a bus, the processor and the memory being connected by a bus, wherein the memory is for storing a set of program code, the processor is for calling the program code stored in the memory, and performing the following operations:
    获取一个或多个网络设备的第一日志集合,所述第一日志集合包含非所述网络设备重启或非所述网络设备的接口板故障产生的日志;Obtaining a first log set of one or more network devices, where the first log set includes a log that is not generated by the network device restart or is not caused by an interface board failure of the network device;
    根据日志过滤规则对所述第一日志集合中的日志进行过滤,得到第二日志集合;Filtering the logs in the first log set according to the log filtering rule to obtain a second log set;
    从所述第二日志集合中获取第一日志和第二日志,所述第一日志为所述第一接口产生的日志,所述第二日志为所述第二接口产生的日志,在确定所述第一日志和第二日志满足预设条件时,确定所述第一接口对应的第一链路和所述第二接口对应的第二链路属于同一个共享风险链路组。Obtaining a first log and a second log from the second log set, where the first log is a log generated by the first interface, and the second log is a log generated by the second interface, where When the first log and the second log meet the preset condition, the first link corresponding to the first interface and the second link corresponding to the second interface belong to the same shared risk link group.
  14. 根据权利要求13所述的装置,其特征在于,所述日志过滤规则包括以下至少一种:The apparatus according to claim 13, wherein the log filtering rule comprises at least one of the following:
    过滤得到物理接口开启up和/或关闭down的日志;Filtering the log that the physical interface is up and/or down.
    过滤得到物理接口出现误码的日志;Filtering to get a log of errors in the physical interface;
    过滤得到物理接口的双向转发检测up和/或双向转发检测down的日志;和Filtering to obtain a log of the bidirectional forwarding detection up and/or bidirectional forwarding detection down of the physical interface; and
    过滤得到物理接口的运行、管理和维护OAM up和/或OAM down的日志。Filtering logs for running, managing, and maintaining OAM up and/or OAM down for physical interfaces.
  15. 根据权利要求13或14所述的装置,其特征在于,在确定所述第一日志和第二日志满足预设条件时,所述处理器具体用于:The device according to claim 13 or 14, wherein when the first log and the second log are determined to meet a preset condition, the processor is specifically configured to:
    确定所述第一日志和所述第二日志满足以下条件中的至少一项:Determining that the first log and the second log satisfy at least one of the following conditions:
    所述第一日志标识所述第一接口的状态变为up,所述第二日志标识所述第二接口的状态变为up,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第一阈值; The first log identifies that the status of the first interface changes to up, the second log identifies that the status of the second interface changes to up, the generation time of the first log, and the generation of the second log The interval of time is less than the first threshold;
    所述第一日志标识所述第一接口的状态变为down,所述第二日志标识所述第二接口的状态变为down,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第二阈值;The first log identifies that the status of the first interface changes to down, the second log identifies that the status of the second interface changes to down, the generation time of the first log, and the generation of the second log The interval of time is less than the second threshold;
    所述第一日志标识所述第一接口出现误码,所述第二日志标识所述第二接口出现误码,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第三阈值;The first log identifies that the first interface has an error, the second log identifies that the second interface has an error, and the interval between the first log generation time and the second log generation time is less than Third threshold
    所述第一日志标识所述第一接口出现运行、管理和维护OAM up,所述第二日志标识所述第二接口出现OAM up,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第四阈值;和,The first log identifies that the first interface is running, managing, and maintaining OAM up, the second log identifies that the second interface is OAM up, the first log is generated, and the second log is generated. The interval of generation time is less than the fourth threshold; and,
    所述第一日志标识所述第一接口出现OAM down,所述第二日志标识所述第二接口出现OAM down,所述第一日志的产生时间和所述第二日志的产生时间的间隔小于第五阈值。The first log identifies that the first interface is OAM down, the second log identifies that the second interface is OAM down, and the interval between the generation time of the first log and the second log is less than The fifth threshold.
  16. 根据权利要求13至15任一项所述的装置,其特征在于,所述第二日志集合包括网络设备标识、日志发生接口、日志发生时间和日志发生事件。The apparatus according to any one of claims 13 to 15, wherein the second log set comprises a network device identifier, a log generation interface, a log generation time, and a log occurrence event.
  17. 根据权利要求13至16任一项所述的装置,其特征在于,所述处理器还用于:The device according to any one of claims 13 to 16, wherein the processor is further configured to:
    从所述第二日志集合中获取第三日志,所述第三日志为第三接口产生的日志;Acquiring a third log from the second log set, where the third log is a log generated by the third interface;
    在确定所述第一日志和第三日志满足所述预设条件时,确定所述第一接口对应的第一链路和所述第三接口对应的第三链路属于同一个共享风险链路组;When it is determined that the first log and the third log meet the preset condition, determining that the first link corresponding to the first interface and the third link corresponding to the third interface belong to the same shared risk link group;
    在确定所述第二日志和第三日志满足所述预设条件时,确定所述第二接口对应的第二链路和所述第三接口对应的第三链路属于同一个共享风险链路组。When it is determined that the second log and the third log meet the preset condition, determining that the second link corresponding to the second interface and the third link corresponding to the third interface belong to the same shared risk link group.
  18. 根据权利要求13至17任一项所述的装置,其特征在于,所述处理单元还用于:The device according to any one of claims 13 to 17, wherein the processing unit is further configured to:
    将确定的共享风险链路组上报用户设备,进行告警;The determined shared risk link group is reported to the user equipment for alarming.
    将确定的共享风险链路组下发至所述网络设备,或者根据用户设备指示将确定的共享风险链路组下发至所述网络设备。 The determined shared risk link group is delivered to the network device, or the determined shared risk link group is delivered to the network device according to the user equipment indication.
PCT/CN2017/107448 2017-10-24 2017-10-24 Method and device for determining shared risk link group WO2019079961A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2017/107448 WO2019079961A1 (en) 2017-10-24 2017-10-24 Method and device for determining shared risk link group
CN201780009064.XA CN109964450B (en) 2017-10-24 2017-10-24 Method and device for determining shared risk link group

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/107448 WO2019079961A1 (en) 2017-10-24 2017-10-24 Method and device for determining shared risk link group

Publications (1)

Publication Number Publication Date
WO2019079961A1 true WO2019079961A1 (en) 2019-05-02

Family

ID=66246174

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/107448 WO2019079961A1 (en) 2017-10-24 2017-10-24 Method and device for determining shared risk link group

Country Status (2)

Country Link
CN (1) CN109964450B (en)
WO (1) WO2019079961A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021143524A1 (en) * 2020-01-15 2021-07-22 华为技术有限公司 Fault detection method, and apparatus
CN113691395A (en) * 2021-08-04 2021-11-23 曙光信息产业(北京)有限公司 Network operation and maintenance method and device, computer equipment and storage medium
CN117074915A (en) * 2023-08-23 2023-11-17 扬州万方科技股份有限公司 Board card soft and hard restarting test system and method based on characteristic keyword monitoring technology

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113821491A (en) * 2021-02-22 2021-12-21 京东科技控股股份有限公司 Method, apparatus, server and medium for generating network device log template

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1879356A (en) * 2003-11-21 2006-12-13 思科技术公司 Method and apparatus for determining network routing information based on shared risk link group information
WO2008058574A1 (en) * 2006-11-16 2008-05-22 Telefonaktiebolaget Lm Ericsson (Publ) Provision of alternative communications paths in a communications network
CN102868563A (en) * 2012-09-30 2013-01-09 华信邮电咨询设计研究院有限公司 Cross layer mapping management method of share risk link group
WO2016095710A1 (en) * 2014-12-15 2016-06-23 中兴通讯股份有限公司 Method and device for adjusting srlg

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7701848B2 (en) * 2003-07-11 2010-04-20 Chunming Qiao Efficient trap avoidance and shared protection method in survivable networks with shared risk link groups and a survivable network
CN105991188B (en) * 2015-02-16 2019-09-10 阿里巴巴集团控股有限公司 A kind of method and device detecting sharing memory

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1879356A (en) * 2003-11-21 2006-12-13 思科技术公司 Method and apparatus for determining network routing information based on shared risk link group information
WO2008058574A1 (en) * 2006-11-16 2008-05-22 Telefonaktiebolaget Lm Ericsson (Publ) Provision of alternative communications paths in a communications network
CN102868563A (en) * 2012-09-30 2013-01-09 华信邮电咨询设计研究院有限公司 Cross layer mapping management method of share risk link group
WO2016095710A1 (en) * 2014-12-15 2016-06-23 中兴通讯股份有限公司 Method and device for adjusting srlg

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021143524A1 (en) * 2020-01-15 2021-07-22 华为技术有限公司 Fault detection method, and apparatus
CN113691395A (en) * 2021-08-04 2021-11-23 曙光信息产业(北京)有限公司 Network operation and maintenance method and device, computer equipment and storage medium
CN117074915A (en) * 2023-08-23 2023-11-17 扬州万方科技股份有限公司 Board card soft and hard restarting test system and method based on characteristic keyword monitoring technology
CN117074915B (en) * 2023-08-23 2024-05-17 扬州万方科技股份有限公司 Board card soft and hard restarting test method based on characteristic keyword monitoring technology

Also Published As

Publication number Publication date
CN109964450B (en) 2020-06-26
CN109964450A (en) 2019-07-02

Similar Documents

Publication Publication Date Title
US10917322B2 (en) Network traffic tracking using encapsulation protocol
US10103851B2 (en) Network link monitoring and testing
CN104798341B (en) Service level is characterized on electric network
WO2019079961A1 (en) Method and device for determining shared risk link group
WO2015090098A1 (en) Method and apparatus for realizing fault location
CN108632099B (en) Fault detection method and device for link aggregation
CN112737800B (en) Service node fault positioning method, call chain generating method and server
CN113542008A (en) Network OAM method and device
CN109960634A (en) A kind of method for monitoring application program, apparatus and system
WO2021018309A1 (en) Method, device and system for determination of message transmission path, and computer storage medium
CN108449228A (en) Message processing method and the network equipment
EP3232620B1 (en) Data center based fault analysis method and device
CN103023815B (en) Aggregation link load sharing method and device
US9280426B2 (en) System and method for server redundancy
CN108599978B (en) Cloud monitoring method and device
WO2012106914A1 (en) Dynamic tunnel fault diagnosis method, device and system
CN107005440B (en) method, device and system for positioning link fault
Evang et al. Crosslayer network outage classification using machine learning
CN116248479A (en) Network path detection method, device, equipment and storage medium
CN101431435A (en) Connection-oriented service configuration and management method
Gao et al. Xshot: Light-weight link failure localization using crossed probing cycles in SDN
CN113766363A (en) Fault monitoring method and device and computing equipment
WO2024041213A1 (en) Communication method, apparatus and system, and storage medium
US20240171498A1 (en) Detecting in-transit inband telemetry packet drops
CN104796276B (en) A kind of link switch-over method and system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17929610

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17929610

Country of ref document: EP

Kind code of ref document: A1