CN112770197A - Method, device, equipment and storage medium for determining OTN equipment fault reason - Google Patents

Method, device, equipment and storage medium for determining OTN equipment fault reason Download PDF

Info

Publication number
CN112770197A
CN112770197A CN202011631074.7A CN202011631074A CN112770197A CN 112770197 A CN112770197 A CN 112770197A CN 202011631074 A CN202011631074 A CN 202011631074A CN 112770197 A CN112770197 A CN 112770197A
Authority
CN
China
Prior art keywords
information
otn
equipment
determining
layer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011631074.7A
Other languages
Chinese (zh)
Inventor
杨肖潇
杨俊杰
卢道和
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
WeBank Co Ltd
Original Assignee
WeBank Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by WeBank Co Ltd filed Critical WeBank Co Ltd
Priority to CN202011631074.7A priority Critical patent/CN112770197A/en
Publication of CN112770197A publication Critical patent/CN112770197A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q11/00Selecting arrangements for multiplex systems
    • H04Q11/0001Selecting arrangements for multiplex systems using optical switching
    • H04Q11/0062Network aspects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q11/00Selecting arrangements for multiplex systems
    • H04Q11/0001Selecting arrangements for multiplex systems using optical switching
    • H04Q11/0062Network aspects
    • H04Q2011/0079Operation or maintenance aspects
    • H04Q2011/0081Fault tolerance; Redundancy; Recovery; Reconfigurability

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The embodiment of the application discloses a method, a device, equipment and a storage medium for determining the reason of OTN equipment failure, wherein the method comprises the following steps: acquiring alarm information to be processed; the alarm information to be processed is alarm information of network equipment connected with the OTN equipment; extracting key equipment parameters from the alarm information to be processed, wherein the key equipment parameters are used for indicating the identification of the network equipment and the identification of a corresponding physical port; determining a matching result between the key equipment parameter and the level information of the OTN equipment in a mapping table; and determining the fault reason of the OTN equipment based on the matching result.

Description

Method, device, equipment and storage medium for determining OTN equipment fault reason
Technical Field
The present application relates to computer technology, and relates to, but is not limited to, a method, an apparatus, a device, and a storage medium for determining a cause of a failure of an OTN device.
Background
In the related art, the alarm information monitoring platform generally collects the alarm information of the OTN device and the alarm information of the network device in full, and respectively converges the alarm information of the OTN device and the alarm information of the network device, thereby completing the whole alarm convergence process.
The problems in the related art are: the alarm of the OTN device is connected to the total alarm system, which requires a certain development cost. Since the OTN device is relatively closed, the interface for reporting data by the network manager is usually privatized and customized. The access to the main network management system has high corresponding development cost and long development period. And after the proprietary protocol is developed and the later version of the network management is updated, new rules may need to be adapted again. In conclusion, the software development cost is high, and the period is long.
Disclosure of Invention
In view of this, embodiments of the present application provide a method, an apparatus, a device, and a storage medium for determining a cause of a fault of an OTN device, so as to solve the problems in the related art, such as a high software development cost and a long period, that are used for converging an alarm of the OTN device.
In a first aspect, an embodiment of the present application provides a method for determining a cause of a failure of an OTN device, where the method includes: acquiring alarm information to be processed; the alarm information to be processed is alarm information of network equipment connected with the OTN equipment; extracting key equipment parameters from the alarm information to be processed, wherein the key equipment parameters are used for indicating the identification of the network equipment and the identification of a corresponding physical port; determining a matching result between the key equipment parameter and the level information of the OTN equipment in a mapping table; and determining the fault reason of the OTN equipment based on the matching result.
In a second aspect, an embodiment of the present application provides an apparatus for determining a cause of a failure of an OTN device, including: the acquisition module is used for acquiring alarm information to be processed; the alarm information to be processed is alarm information of network equipment connected with the OTN equipment; an extraction module, configured to extract a key device parameter from the alarm information to be processed, where the key device parameter is used to indicate an identifier of the network device and an identifier of a corresponding physical port; the matching module is used for determining a matching result with the level information of the OTN equipment in the mapping table based on the key equipment parameters; and the determining module is used for determining the fault reason of the OTN equipment based on the matching result.
In a third aspect, an embodiment of the present application provides a computer device, including a memory and a processor, where the memory stores a computer program that is executable on the processor, and the processor executes the computer program to implement the steps in any of the methods for determining a cause of a failure of an OTN device according to the embodiments of the present application.
In a fourth aspect, the present application provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps in the method for determining a cause of a failure of an OTN device according to any one of the embodiments of the present application.
In the embodiment of the application, key equipment parameters are extracted from the acquired alarm information of the network equipment to be processed, and the fault reason of the OTN equipment is determined according to the matching result of the key equipment parameters and the level information of the OTN equipment in the mapping table. Therefore, according to the technical scheme provided by the embodiment of the application, the alarm information of the OTN equipment does not need to be acquired, then the key equipment parameters of the network equipment are extracted from the alarm information of the network equipment, and the key equipment parameters are matched with the hierarchy information of the OTN equipment, so that the fault reason of the OTN equipment can be deduced according to the alarm information of the network equipment. Because the alarm information of the OTN equipment is not collected, and software for processing the alarm information of the OTN equipment is not needed, the problems of high software development cost and long development period are solved from the source.
Drawings
Fig. 1is a schematic flowchart of a method for determining a cause of a failure of an OTN device according to an embodiment of the present application;
fig. 2is a schematic diagram of a logarithmic function relationship between an initial value of an abnormal probability of an object in an OPU layer and a first ratio according to an embodiment of the present application;
fig. 3is a schematic diagram of a method for determining a cause of a fault of an OTN device in the related art;
fig. 4 is a schematic diagram of a method for determining a cause of a fault of an OTN device in the related art;
fig. 5 is a schematic diagram of a method for determining a cause of a failure of an OTN device in an embodiment of the present application;
FIG. 6 is a schematic diagram illustrating a process of alarm formatting according to an embodiment of the present application;
FIG. 7is a schematic flow chart illustrating a method for calculating a probability convergence warning according to an embodiment of the present application;
fig. 8is a schematic structural diagram of a device for determining a cause of a fault of an OTN device according to an embodiment of the present application;
fig. 9 is a hardware entity diagram of a computer device according to an embodiment of the present application.
Detailed Description
The technical solution of the present application is further elaborated below with reference to the drawings and the embodiments.
Fig. 1is a schematic flow chart of an implementation process of a method for determining a cause of a fault of an OTN device provided in an embodiment of the present application, and is applied to a monitoring platform, as shown in fig. 1, the method includes:
step 102: acquiring alarm information to be processed; the alarm information to be processed is alarm information of network equipment connected with the OTN equipment;
the OTN (optical transport network) refers to a transport network system that implements transport, multiplexing, routing, and monitoring of service signals in an optical domain, and ensures performance indexes and survivability thereof. The OTN device may be a device on an OTN system, and the OTN device may be an optical terminal device, an optical line device, an optical relay device, an optical add/drop multiplexer, and the like; the OTN system comprises OTN equipment and a network management server, wherein the OTN equipment is connected with the network equipment so as to realize the transmission of signals between different network equipment through the OTN equipment; the network devices may be computers, gateways, routers, bridges, and the like.
When the Network equipment is in fault, the monitoring unit of the Network equipment can give out an alarm signal according to the fault condition and send out alarm information to the monitoring platform through Simple Network Management Protocol (SNMP); the alarm information to be processed may be original alarm information of the network device, or alarm information obtained by screening the original alarm information.
The monitoring platform is also called a master alarm system, is a monitoring platform which is set up for monitoring the abnormality of all equipment in operation in a production network, can collect logs of all abnormal alarm related equipment, matches keywords to generate alarm information, and has an interface capability to receive the alarm information provided by other systems.
In the implementation process, the alarm information to be processed may be at least one piece of original alarm information sent by the network device and received by the monitoring platform, or may be the alarm information obtained by formatting the original alarm information. For example, at least one piece of original alarm information sent by the network device and received by the monitoring platform within a preset time period may be used as the alarm information to be processed; for another example, at least one piece of original alarm information in a preset time period may be formatted according to a timestamp generated by the original alarm information, and the formatted original alarm information may be used as the alarm information to be processed.
Step 104: extracting key equipment parameters from the alarm information to be processed, wherein the key equipment parameters are used for indicating the identification of the network equipment and the identification of a corresponding physical port;
the identifier of the network device may be an IP (Internet Protocol) address of the network device, a network device name of the network device, and the like, and the identifier of the physical port may be a physical port name of the physical port.
Step 106: determining a matching result between the key equipment parameter and the level information of the OTN equipment in a mapping table;
the mapping table may be understood as a resource table between the network device and the OTN device, and may pre-store the identifier of the network device, the identifier of the physical port, and the hierarchy information of the OTN device in advance during implementation, where the matching result includes matching and mismatching of the key device parameter and the hierarchy information of the OTN device. The hierarchy information of the OTN device represents the relationship among the layers of the OTN device, see the relationship among the layers in the OTS layer, the OCH layer and the OPU layer in the OTN device in table 1.
Assuming that the identifier of the network device extracted from the alarm information to be processed is a target network device name and the identifier of the extracted physical port is a target physical port name, the target network device name and the target physical port name may be considered to be matched with the hierarchical information of the OTN device under the condition that the network device name and the physical port name which are the same as the target network device name and the target physical port name are found from the mapping table.
And under the condition that the network equipment name and the physical port name which are the same as the target network equipment name and the target physical port name are not found from the mapping table, the target network equipment name and the target physical port name are considered to be not matched with the hierarchy information of the OTN equipment.
Step 108: and determining the fault reason of the OTN equipment based on the matching result.
And determining the OTN hierarchy which is possibly failed according to the matching result, and determining the failure reason of the OTN equipment according to the hierarchy information of the OTN hierarchy.
In some embodiments, the level information in the mapping table is the abnormal probability of the possible object, and the level information matched with the mapping table can be obtained from the matching result, referring to the embodiment of table 3, the number of the received alarm information is 8, and the number of the received alarm information matched with the mapping table is 6. And taking the target object with the maximum abnormal probability or the target object with the abnormal probability meeting the preset condition as the fault reason of the OTN equipment from the matched 6 pieces of layer-level information. Referring again to the embodiment of table 3, the probability of the second alarm message and the eighth alarm message is the largest, and the output may be: the 'maximum Fiber-AA-BB-01 anomaly probability of the OTS layer' and 'the second maximum 192.4THz anomaly probability of the OCH layer' are adopted, so that the fault reasons can be the Fiber-AA-BB-01 anomaly of the OTS layer and the 192.4THz anomaly of the OCH layer.
In other embodiments, the level information in the mapping table may have no abnormal probability of the object, and the level information matched with the mapping table may still be obtained from the matching result, and the matched target object may be obtained based on the matched level information, for example, the target object in the OTS information, the target object in the OCH information, and the target object in the OPU information; then, judging whether two target objects with physical connection relation are in a target level record table, for example, firstly, obtaining a target level record table between an OTS layer and an OCH layer according to the target objects in the OTS information; here, the target level record table records an object set of the OCH layer associated with a target object in the OTS information; in the target level record table, it is determined whether the target object in the OCH information is in the target level record table, i.e. whether the target object is in the object set of the OCH layer. If so, the target object in the OTS information is taken as the failure reason. If not, continuously acquiring a target level record table between the OCH layer and the OPU layer according to the target object in the OCH information; and in the target level record table, determining whether the target object in the OPU information is in the target level record table, if so, taking the target object in the OCH information as a fault reason, otherwise, taking the target object in the OPU information as the fault reason.
In the embodiment of the application, key equipment parameters are extracted from the acquired alarm information of the network equipment to be processed, and the fault reason of the OTN equipment is determined according to the matching result of the key equipment parameters and the level information of the OTN equipment in the mapping table. Therefore, according to the technical scheme provided by the embodiment of the application, the alarm information of the OTN equipment does not need to be acquired, then the key equipment parameters of the network equipment are extracted from the alarm information of the network equipment, and the key equipment parameters are matched with the hierarchy information of the OTN equipment, so that the fault reason of the OTN equipment can be deduced according to the alarm information of the network equipment. Because the alarm information of the OTN equipment is not collected, and software for processing the alarm information of the OTN equipment is not needed, the problems of high software development cost and long development period are solved from the source.
The embodiment of the present application further provides a method for determining a cause of a fault of an OTN device, which is applied to a monitoring platform, and the method includes:
step S202: acquiring at least one piece of original alarm information from at least one piece of network equipment;
the original alarm information may be the alarm information which is sent to the monitoring platform by the monitoring unit of the network device through the SNMP protocol without preprocessing.
In one embodiment, obtaining at least one original alarm message from at least one network device comprises: the method comprises the steps that a monitoring platform receives at least one piece of original alarm information sent by at least one piece of network equipment in a preset time period; in another embodiment, obtaining at least one original alarm message from at least one network device comprises: at least one piece of original alarm information in a preset time period can be acquired according to the timestamp generated by the original alarm information.
Step S204: preprocessing formatting each piece of the at least one piece of original alarm information to obtain at least one piece of alarm information; the at least one piece of alarm information is alarm information of network equipment connected with the OTN equipment;
here, since at least one piece of original alarm information may be from a plurality of network devices, and formats of alarm SNMP logs of vendors to which the respective network devices belong are different, at least one piece of original alarm information may be preprocessed in a formatted manner.
Step S206: extracting key equipment parameters from the at least one piece of alarm information, wherein the key equipment parameters are used for indicating the identification of the network equipment and the identification of the corresponding physical port;
the key device parameters may be extracted from each of the alarm messages, and the matching result corresponding to the hierarchy information of the OTN device in the mapping table may be determined based on each of the key parameters.
Step S208: determining a matching result between the key equipment parameter and the level information of the OTN equipment in a mapping table;
step S210: and determining the fault reason of the OTN equipment based on the matching result of the at least one piece of alarm information.
The fault reason of the OTN equipment can be determined by comprehensively analyzing the matching results of the plurality of alarm messages.
In the embodiment of the application, the at least one piece of original alarm information is obtained, and the at least one piece of original alarm information is subjected to formatting pretreatment to obtain the alarm information to be processed, so that the standardability of the alarm information can be improved.
The embodiment of the present application further provides a method for determining a cause of a fault of an OTN device, which is applied to a monitoring platform, and the method includes:
step S302: acquiring at least one piece of original alarm information from at least one piece of network equipment;
step S304: preprocessing formatting each piece of the at least one piece of original alarm information to obtain at least one piece of alarm information; the at least one piece of alarm information is alarm information of network equipment connected with the OTN equipment;
step S306: extracting key equipment parameters from the at least one piece of alarm information, wherein the key equipment parameters are used for indicating the identification of the network equipment and the identification of the corresponding physical port;
step S308: determining a matching result between the key equipment parameter and the level information of the OTN equipment in a mapping table;
the matching result comprises matching and mismatching between the key equipment parameter and the hierarchy information of the OTN equipment, and abnormal probability of abnormality of an object in the hierarchy information of the OTN equipment under the matching condition;
step S310: determining the weight of each level of information in the OTN equipment;
wherein the OTN device may include an OPU (Optical Payload Unit) layer, an OCH (Optical Channel) layer, and an OTS (Optical Transmission Section) layer;
table 1is a mapping table provided in this embodiment, where the mapping table includes the network device name and the physical port name, and hierarchical information of an OPU layer, an OCH layer, and an OTS layer of the OTN device:
TABLE 1
Figure BDA0002880073830000081
The hierarchy information of the OPU layer is OPU information, and the OPU information comprises name information and port information of optical transmission equipment, the port rate of the OPU layer and the abnormal probability of an object in the OPU layer; the abnormal probability of the object in the OPU layer can be preset; the name information of the optical transmission device may be the client-side device in table 1, and may be "SZ-DW-AA", for example; the port information may be a client-side port in Table 1, and may be, for example, "C-1/3"; the port rate of the OPU layer may be the port rate in table 1, and may be, for example, "10 Gb/s"; the probability of abnormality of an object in the OPU layer may be the first probability in table 1, and may be "30%" for example.
The OCH information is information of the OCH layer related to the OPU layer, and comprises channel frequency of a channel, an initial station and a termination station, channel speed and abnormal probability of an object in the OCH layer; the abnormal probability of the object in the OCH layer can be preset; the channel frequency may be "192.1 THz"; the origination site may be "AA"; the termination site may be "BB"; the channel rate may be "10 Gb/s"; the anomaly probability of the object in the OCH layer may be the second probability in table 1, and may be "40%" for example.
The hierarchy information of the OTS layer is OTS information, and the OTS information comprises operator optical fiber information, total bandwidth information of the service currently borne by the OTS layer and the abnormal probability of an object in the OTS layer; the abnormal probability of the object in the OTS layer can be preset; the operator Fiber information may be "Fiber-AA-BB-01", the total bandwidth information of the traffic now carried by the OTS layer may be the carried rate in table 1, and the anomaly probability of the object in the OTS layer may be a third probability in table 1, for example, "30%".
The OPU layer is an information structure for adapting the transmission of customer information over an optical channel; the optical path payload unit is used to combine the client information, overhead needed to adapt the client signal rate and OPUK payload rate, and other OPUK overhead to support client signaling. These overheads are specific adaptation overheads, the capacity of the OPUK is divided by k, k being 0, 1, 2e, 3, 4.
The OCH layer is the core of the whole OTN and is the main functional carrier of the OTN; the OCH layer is mainly responsible for routing and wavelength allocation of customer signals of various formats and provides networking functions for end-to-end optical channels for transparent transmission of subscriber signals of various formats.
The OTS layer is responsible for handling optical transmission segment overhead to ensure integrity of optical transmission segment adaptation information while enabling detection and control of optical amplifiers or repeaters.
The weight can be preset or obtained by calculation; each OTN device includes hierarchical information of three layers, i.e., OPU, OCH, and OTS, and weights of the hierarchical information of different OTN devices in the same OTN hierarchy may be the same or different.
Step S312: and determining the fault reason of the OTN equipment based on the weight of each level of information in the OTN equipment and the abnormal probability of the object in each level of information in the OTN equipment.
Each layer level information of the OTN device may include at least one object, and each object corresponds to an abnormal probability that an abnormality may occur; an initial value of an abnormal probability of an object in the hierarchy information of each OTN device, that is, an initial abnormal probability, may be determined according to a first ratio between a channel rate of an OCH channel in the OTN device and an OPU port rate, and when the first ratio is equal to 1, the initial abnormal probability of the OCH layer may be required to be the maximum, for example, may be 40%; the initial anomaly probability of the OPU layer is small, for example, may be 30%; the initial anomaly probability of the OTS layer is minimal, and may be 30%, for example; when the first ratio is greater than 1, it may be required that the initial anomaly probability of the OPU layer is the largest, for example, may be 80% or 60%, the initial anomaly probability of the OCH layer is smaller, for example, may be 15% or 30%, and the initial anomaly probability of the OTS layer is the smallest, for example, may be 5% or 10%. The size relation among the initial abnormal probabilities of the OTS layer, the OCH layer and the OPU layer is determined by judging whether the first ratio is larger than 1, so that the determination of the initial abnormal probabilities of the OTN layers is more flexible and accurate.
Further, there may be a logarithmic function relationship between an initial value y of the anomaly probability of the object in the OPU layer and a first ratio x between a channel rate of the OCH in the OTN device and a port rate of the OPU, where y is a dependent variable and x is an independent variable.
In some embodiments, the logarithmic function may be a natural logarithmic function. For example, for a certain network, the expression of y and x can be expressed by formula (1):
y=0.20ln(x)+0.31 (1);
the above 0.20 and 0.31 are preferred values obtained by multiple tests in this embodiment, and in other embodiments, other values may be set according to actual needs, and are not limited specifically. Referring to fig. 2, in the case that the first ratio is 10, the initial abnormal probability of the OPU layer is about 77%, the initial abnormal probability of the OCH layer may be 20%, and the initial abnormal probability of the OTS layer may be 7%; in a case that the first ratio is 1, an initial abnormal probability of the OPU layer is about 31%, the initial abnormal probability of the OCH layer may be 40%, and the initial abnormal probability of the OTS layer may be 30%. The initial value of the abnormal probability of the object in the OPU layer is determined according to the logarithmic function relationship, and then the initial values of the abnormal probabilities of the OCH layer and the OTS layer are determined according to the initial value of the abnormal probability of the OPU layer, so that the accuracy of determining the abnormal probability can be further improved.
In the embodiment of the application, the fault reason of the OTN equipment is determined according to the weight of each level of information in the OTN equipment and the abnormal probability of the object in each level of information in the OTN equipment, so that the fault reason can be determined more accurately; in addition, the initial value of the abnormal probability of the object abnormal in each layer level information of the OTN equipment is determined according to the first ratio between the OCH channel rate and the OPU port rate, so that the determination of the initial value of the abnormal probability can be more accurate.
The embodiment of the present application further provides a method for determining a cause of a fault of an OTN device, which is applied to a monitoring platform, and the method includes:
step S402: acquiring at least one piece of original alarm information from at least one piece of network equipment;
step S404: preprocessing formatting each piece of the at least one piece of original alarm information to obtain at least one piece of alarm information; the at least one piece of alarm information is alarm information of network equipment connected with the OTN equipment;
step S406: extracting key equipment parameters from the at least one piece of alarm information, wherein the key equipment parameters are used for indicating the identification of the network equipment and the identification of the corresponding physical port;
step S408: determining a matching result between the key equipment parameter and the level information of the OTN equipment in a mapping table;
the matching result comprises matching and mismatching between the key equipment parameter and the hierarchy information of the OTN equipment, and abnormal probability of abnormality of an object in the hierarchy information of the OTN equipment under the matching condition;
step S410: determining the matching quantity between the key equipment parameter of each alarm information and the level information of the OTN equipment in the mapping table based on the matching result of each alarm information;
assuming that the number of the acquired alarm information to be processed is 8, and in the 8 pieces of alarm information, the matching between the key device parameters of 6 pieces of alarm information and the hierarchy information of the OTN device in the mapping table is performed, it may be determined that the number of the matching between the key device parameters of the alarm information and the hierarchy information of the OTN device in the mapping table is 6.
Step S412: determining a first ratio between the OCH rate and the OPU rate in the OTN device;
wherein the OCH rate may be a channel rate of the OCH layer, and the OPU rate may be a port rate of the OPU layer; referring to table 1, the first ratios corresponding to different physical port names are different, and under the condition that the physical port name is "Eth 1/1", the ratio of the OCH rate to the OPU rate is 10Gb/s to 10Gb/s, which is 1; under the condition that the name of the physical port is T1/3, the ratio of the OCH rate to the OPU rate is 100Gb/s to 10Gb/s, namely 10; under the condition that the name of the physical port is T2/3, the ratio of the OCH rate to the OPU rate is 100Gb/s to 100Gb/s, namely 1; under the condition that the name of the physical port is F2/4, the ratio of the OCH rate to the OPU rate is 40Gb/s to 10Gb/s, namely 4; in one embodiment, the maximum ratio 10 of the OCH rate to the OPU rate may be selected as the first ratio; in another embodiment, any ratio of the OCH rate to the OPU rate may be selected as the first ratio.
Step S414: determining a second ratio between the number of matches and the first ratio;
step S416: and determining the weight of each layer level information in the OTN equipment based on the second ratio.
Table 2 shows a method for determining the weight of each level of information in the OTN device:
TABLE 2
Figure BDA0002880073830000121
Referring to table 2, in the case that the second ratio is less than or equal to 1, that is, in the case that the number of matches is less than the first ratio, it may be determined that the weights of the OPU layer, the OCH layer, and the OTS layer of the OTN device are 1.2, 0.8, and 0.4, respectively; when the second ratio is greater than 1 and less than or equal to 2, that is, the number of matches is greater than the first ratio and is less than twice the first ratio, it may be determined that the weights of the OPU layer, the OCH layer, and the OTS layer of the OTN device are 0.4, 1.2, and 0.8, respectively; in case that the second ratio is greater than 2, that is, the number of matches is greater than twice the first ratio, it may be determined that the weights of the OPU layer, the OCH layer, and the OTS layer of the OTN device are 0.4, 0.8, and 1.2, respectively.
Step S418: and determining the fault reason of the OTN equipment based on the weight of each level of information in the OTN equipment and the abnormal probability of the object in each level of information in the OTN equipment.
In the embodiment of the application, the weight of each layer level information in the OTN equipment is determined according to the matching number and the first ratio between the OCH rate and the OPU rate, so that the determination of the weight is more accurate.
The embodiment of the present application further provides a method for determining a cause of a fault of an OTN device, which is applied to a monitoring platform, and the method includes:
step S502: acquiring at least one piece of original alarm information from at least one piece of network equipment;
step S504: preprocessing formatting each piece of the at least one piece of original alarm information to obtain at least one piece of alarm information; the at least one piece of alarm information is alarm information of network equipment connected with the OTN equipment;
step S506: extracting key equipment parameters from the at least one piece of alarm information, wherein the key equipment parameters are used for indicating the identification of the network equipment and the identification of the corresponding physical port;
step S508: determining a matching result between the key equipment parameter and the level information of the OTN equipment in a mapping table;
the matching result comprises matching and mismatching between the key equipment parameter and the hierarchy information of the OTN equipment, and abnormal probability of abnormality of an object in the hierarchy information of the OTN equipment under the matching condition;
step S510: determining the matching quantity between the key equipment parameter of each alarm information and the level information of the OTN equipment in the mapping table based on the matching result of each alarm information;
step S512: determining a first ratio between the OCH rate and the OPU rate in the OTN device;
step S514: determining a second ratio between the number of matches and the first ratio;
wherein, assuming that the number of matches is 6 and the first ratio is 4, the second ratio is 1.5.
Step S516: and determining the weight of each layer level information in the OTN equipment based on the second ratio.
Since 1< the second ratio ≦ 2, it can be determined that the weights of the OPU layer, the OCH layer, and the OTS layer are 0.4, 1.2, and 0.8 in this order.
Step S518: for an object in each level information, determining the final probability of the object by accumulation based on the number of times of matching the object, the weight of the level information where the object is located and the abnormal probability of the object;
table 3is another mapping table provided in this embodiment, where the mapping table includes the network device name and the physical port name, and level information of the OPU layer, the OCH layer, and the OTS layer of the OTN device:
TABLE 3
Figure BDA0002880073830000141
Figure BDA0002880073830000151
Referring to Table 3, assuming that the weight of the hierarchy information of the OTS layer is 0.8, the final probability of the object "Fiber-AA-BB-01" of the OTS layer may be: 30% by 0.8+ 10% by 0.8+ 30% by 0.8-1.12.
Assuming that the weight of the level information of the OCH layer is 1.2, the final probabilities of the objects "192.1 THz", "192.2 THz", "192.3 THz", "192.5 THz" of the OCH layer can be: 40%. 1.2 ═ 0.48; the final probability of the object "192.4 THz" of the OCH layer may be: 30%. 1.2+ 30%. 1.2 ═ 0.72.
Assuming that the weight of the hierarchy information of the OPU layer is 0.4, since the objects of the OPU layer only appear once, and there is no accumulation, the maximum value of the final probability of the objects of the OPU layer may be 60% by 0.4 — 0.24.
Step S520: sequencing the final probabilities of all the objects to obtain a sequencing result;
the final probability of the object is ranked, and a ranking result can be obtained: 1.12>0.72>0.48> 0.24.
Step S522: and determining the fault reason of the OTN equipment based on the sequencing result.
Wherein, because the final probability of the object 'Fiber-AA-BB-01' of the OTS layer is the largest, it can be concluded that the failure cause of the OTN equipment may be related to the optical Fiber with the operator optical Fiber number 'Fiber-AA-BB-01'.
In the embodiment of the application, the final probability of the object is determined in an accumulation mode based on the number of times of matching the object, the weight of the level information where the object is located and the abnormal probability of the object, and the fault reason of the OTN equipment is determined according to the sequencing result of the final probability, so that the final probability can be determined more accurately.
The embodiment of the present application further provides a method for determining a cause of a fault of an OTN device, which is applied to a monitoring platform, and the method includes:
step S602: acquiring at least one piece of original alarm information from at least one piece of network equipment;
step S604: preprocessing formatting each piece of the at least one piece of original alarm information to obtain at least one piece of alarm information; the at least one piece of alarm information is alarm information of network equipment connected with the OTN equipment;
step S606: extracting key equipment parameters from the at least one piece of alarm information, wherein the key equipment parameters are used for indicating the identification of the network equipment and the identification of the corresponding physical port;
step S608: determining a matching result between the key equipment parameter and the level information of the OTN equipment in a mapping table;
the matching result comprises matching and mismatching between the key equipment parameter and the hierarchy information of the OTN equipment, and abnormal probability of abnormality of an object in the hierarchy information of the OTN equipment under the matching condition;
step S610: determining the matching quantity between the key equipment parameter of each alarm information and the level information of the OTN equipment in the mapping table based on the matching result of each alarm information;
step S612: determining a first ratio between the OCH rate and the OPU rate in the OTN device;
step S614: determining a second ratio between the number of matches and the first ratio;
step S616: and determining the weight of each layer level information in the OTN equipment based on the second ratio.
Step S618: for an object in each level information, determining the final probability of the object in an accumulation mode based on the number of times of matching the object, the weight of the level information where the object is located and the abnormal probability of the object;
step S620: sequencing the final probabilities of all the objects to obtain a sequencing result;
step S622: screening out target objects meeting preset conditions based on the sorting result;
step S624: and determining the fault reason of the OTN equipment based on the target object and the hierarchy information of the layer where the target object is located.
Wherein the preset conditions may be that the final probability in the ranking results is the largest and the second largest; then "Fiber-AA-BB-01" of the OTS layer with a final probability of 1.12 and "192.4 THz" of the OCH layer with a final probability of 0.72 can be determined as target objects; it can be determined that the cause of the failure of the OTN device can be an operator major optical interruption, i.e. an optical interruption with an operator Fiber number "Fiber-AA-BB-01".
In the embodiment of the application, the target object is screened out according to the final probability sorting result of the object, and the fault reason of the OTN equipment is determined according to the target object and the hierarchy information of the layer where the target object is located, so that the fault reason of the OTN equipment can be determined more accurately.
The embodiment of the present application further provides a method for determining a cause of a fault of an OTN device, which is applied to a monitoring platform, and the method includes:
step S702: acquiring at least one piece of original alarm information from at least one piece of network equipment;
step S704: preprocessing formatting each piece of the at least one piece of original alarm information to obtain at least one piece of alarm information; the at least one piece of alarm information is alarm information of network equipment connected with the OTN equipment;
step S706: extracting key equipment parameters from the at least one piece of alarm information, wherein the key equipment parameters are used for indicating the identification of the network equipment and the identification of the corresponding physical port;
step S708: determining a matching result between the key equipment parameter and the level information of the OTN equipment in a mapping table;
the matching result comprises matching and mismatching between the key equipment parameter and the hierarchy information of the OTN equipment, and abnormal probability of abnormality of an object in the hierarchy information of the OTN equipment under the matching condition;
step S710: determining the matching quantity between the key equipment parameter of each alarm information and the level information of the OTN equipment in the mapping table based on the matching result of each alarm information;
step S712: determining a first ratio between the OCH rate and the OPU rate in the OTN device;
step S714: determining a second ratio between the number of matches and the first ratio;
step S716: and determining the weight of each layer level information in the OTN equipment based on the second ratio.
Step S718: for an object in each level information, determining the final probability of the object by accumulation based on the number of times of matching the object, the weight of the level information where the object is located and the abnormal probability of the object;
step S720: sequencing the final probabilities of all the objects to obtain a sequencing result;
step S722: screening out target objects meeting preset conditions based on the sorting result;
step S724: determining a fault reason of the OTN equipment based on the target object and the hierarchy information of the layer where the target object is located;
step S726: outputting the fault reason to prompt a network manager of the OTN equipment;
step S728: obtaining a feedback result of the network management of the OTN equipment to the fault reason;
step S730: and maintaining or updating the abnormal probability of the object in each level of information in the mapping table based on the feedback result.
The master alarm system may output the failure cause to prompt a network manager of the OTN device, and the network manager may check an actual failure scene to obtain a feedback result of the failure cause, that is, may obtain an actual failure, and if the OTN level where the actual failure is located matches the OTN level where the object having the maximum value of the final probability is located, maintain the abnormal probability of the object in the information of each level in the mapping table; if the OTN level where the actual fault is located is matched with the OTN level where the object with the second largest value of the final probability is located, the probability of the OTN layer where the object with the second largest value is located may be increased by 10%, and the probabilities of the other OTN layers may be decreased by 10%; if the OTN level where the actual failure is located is not matched with the OTN level where the object with the maximum value of the final probability is located or the OTN level where the object with the second maximum value is located, the probability of the actual correct OTN layer may be increased by 20%, and the probabilities of the other OTN layers may be decreased by 20%.
In the embodiment of the application, the abnormal probability of the object in each level of information in the mapping table is dynamically maintained or updated according to the actual fault, so that the fault reason of the OTN equipment can be more accurately determined.
In the related art, there are two main methods related to an alarm convergence method of an optical transport network OTN device (which may also be referred to as a method for determining a failure cause of the OTN device):
a first alarm convergence method is shown in fig. 3, in which a total alarm system collects alarm information of an OTN device and alarm information of a network device in an OTN system, and performs uniform convergence on the collected alarm information of the OTN device and the collected alarm information of the network device; when the OTN equipment fails, pushing the alarm information of the OTN equipment to an OTN network management system of a manufacturer to which the OTN equipment belongs through a private protocol of the manufacturer to which the OTN equipment belongs; then, the OTN network management systems of each manufacturer push the alarm information of the corresponding OTN device to the master alarm system through a Telemetry (telemeasurement) technology or a Common Object Request Broker Architecture (CORBA); meanwhile, the alarm information of the Network device is pushed to the total alarm system through a Simple Network Management Protocol (SNMP).
The total alarm system converges the alarm information of the network equipment according to the connection relation between the network equipment and the OTN equipment by comparing the alarm information of the OTN equipment and the alarm information of the network equipment which occur in a specific time period; and finally, converging the alarm information of the OTN equipment according to the hierarchical relation of the OPU, the OCH and the OTS. And finally, converging to obtain the root fault reason of the OTN equipment, and finishing the whole alarm convergence process.
Telemetry is a remote data acquisition technology for monitoring the performance and faults of equipment, and the Telemetry technology adopts a push mode to acquire abundant monitoring data in time, so that the network fault can be quickly positioned, and the problem of network operation and maintenance is solved. Push mode is a technique based on a client/server mechanism where the server actively sends information to the client.
CORBA is a standard Object-oriented application program system specification (OMG) made by the Object Management Group (Object Management Group), which is a Distributed processing Environment (Distributed Computing Environment) solution for Object Management organizationDCE), a solution proposed by the interconnection of hardware and software systems; the role of the OMG organization is to provide a common framework for application development, to develop industrial guidelines and pairsThe development of object technology is accelerated according to management specifications.
A second alarm convergence method is shown in fig. 4, and first converges the alarm information of the OTN device through an OTN network management system of a vendor to which the OTN device belongs, and then pushes the converged alarm information of the OTN device to a master alarm system; when the OTN equipment fails, specific alarm information of the OTN equipment is pushed to an OTN network management system of the affiliated manufacturer through a private protocol of the affiliated manufacturer, and the OTN network management system converges according to the hierarchical relationship of OPU, OCH and OTS through a preset alarm convergence rule; and pushing the alarm information of the converged OTN equipment to a master alarm system through telemeasurement or CORBA corresponding to OTN network management systems of different manufacturers, and simultaneously pushing the alarm information of the network equipment to the master alarm system through SNMP.
The total alarm system converges the alarm information of the network equipment by comparing the alarm information of the converged OTN equipment and the alarm information of the network equipment in a specific time period according to the pre-entered corresponding relationship between the network equipment and the OTN equipment, and finally completes the whole alarm convergence process.
However, in the two alarm convergence methods, the alarm information on the OTN system is accessed to the total alarm system, and a certain development cost is required. Because the OTN system is relatively closed, the interfaces for reporting data by the OTN network management system of the OTN system are usually privatized and customized. The access of the master alarm system has high corresponding development cost and long development period, and the proprietary protocol may need to adapt to new rules after the development is completed and the later version of the network management is updated.
In addition, when the OTN system fails, a large amount of alarm information of the OTN device and alarm information of the network device may appear at the same time. The alarm formats of all OTN manufacturers are inconsistent with the alarm formats of network equipment manufacturers, and the methods for alarm convergence in the OTN equipment scene are also inconsistent, so that if the methods are directly used for analysis, the complexity of calculation is relatively high, and further, the time for analyzing and processing the fault is relatively long.
Moreover, if a large number of alarms of the network device and alarms of the OTN device are simultaneously sent in a short time, the total alarm system may occasionally lose alarms, and once the alarm information of the OTN system or the alarm information of the network device is lost, the calculation convergence may be affected, and even according to the previously established algorithm, the convergence failure may occur.
According to the embodiment of the application, the complexity of alarm convergence under the situation related to optical transmission can be reduced by analyzing the characteristics of the optical transport network and using a probability iterative learning mode, and the time of alarm convergence is shortened.
The method and the device for processing the alarm information of the network equipment can preprocess the alarm information of the network equipment by using a formatting method, and match the abnormal probability of the object in the hierarchical information of the OTN equipment corresponding to the pre-recorded network equipment based on the corresponding relation between the network equipment and the OTN equipment pre-recorded in the mapping table.
According to the embodiment of the application, the alarm convergence result can be calculated by using a probability iteration and weight assignment method, the convergence result can be confirmed again, and the corresponding probability is trained to increase the accuracy of subsequent convergence calculation.
Referring to fig. 5, in a scenario where an OTN system is abnormal and affects a network device, the method may select only the alarm information of the network device for formatting preprocessing in response to a situation that different OTN device manufacturers and network device manufacturers have many types of alarm information, a large number of alarm information, and possibly alarm loss, and converge the alarm information caused by the abnormality of the OTN system by using a specific probability method; after alarm convergence is completed, iterative optimization is performed according to the actual fault reason, and the probability value under the scene corresponding to the next fault is updated. With the method, as the number of times of convergence increases, the corresponding convergence judgment can be trained more accurately.
According to the alarm convergence method provided by the embodiment of the application, the alarm information on the OTN system is not acquired or directly shielded, and the alarm information of the OTN system in the whole amount is not directly calculated at the beginning of the process; the method comprises the steps of carrying out formatting pretreatment on network equipment alarms collected by a main alarm system, and then calculating the probability to quickly converge the alarms of the network equipment and give a convergence conclusion; after convergence is completed, the difference between the actual fault and the probability convergence result can be compared, and the corresponding probability condition is updated to be used as a new basis for next convergence.
Referring to fig. 6 and 7, an embodiment of the present application may provide an alarm convergence method:
step 602: acquiring a network equipment alarm;
the main alarm system acquires alarm information of the network equipment through SNMP;
wherein, the alarm information of the down type of the physical port can be regarded as the alarm information which is valuable for deducing the fault reason of the OTN equipment; in the embodiment of the application, the alarm information of the OTN system is not involved, and only the alarm information of the network device is analyzed, so that the alarm information of all the OTN systems can be converged first (that is, the alarm information of all the OTN systems does not participate in the subsequent calculation), and only the alarm information of the network device is left.
Step 604: screening alarm physical down types;
the master alarm system screens alarm information of a physical port down type;
the total alarm system can perform preliminary screening on the alarm information of the network equipment, and can perform screening through keywords such as DOWN, DOWN and the like in a network alarm log, and only leave the alarm information of the type of the DOWN physical port in the alarm information of the network equipment; if the type of network equipment alarm is not satisfied, the next processing is not carried out.
Step 606: formatting an alarm extraction device and a port;
the master alarm system carries out formatting treatment on the alarm information of the down type of the physical port so as to extract the name of the network equipment and the name of the physical port;
wherein, the network device name and the physical port name can be extracted from the alarm information of the physical port down type; because the formats of the alarm SNMP logs of various manufacturers are different, but the alarm information of the down type of the physical port comprises the name of the network equipment to be alarmed and the specific physical port information of the down type, the name of the network equipment and the name of the physical port can be extracted by combining a regular expression and a keyword, and the name of the network equipment can be customized by a user, so the user can extract the name of the network equipment through the regular expression according to the rule; although names of different manufacturers are different, the physical ports in the network alarm information may be matched by requiring that keywords such as "Ethernet", "Gigabit Ethernet", "Fast Ethernet", and the like are included, and keywords such as "port-channel", "vlan (virtual local area network)" are not included.
In the embodiment of the present application, the following 3 specific pieces of alarm information of the network device may be taken as an example for description:
the first piece of alarm information may be: oct 2920: 12: 5010.126.231.782020 SZ-NET-B%% 10IFNET/5/LINK _ UPDOWN: Line protocol state on the interface force-gigabit Ethernet2/0/1changed to down.
The second warning message may be: SZ-NET-D:2020Oct 2912: 17:53.90 CST% ETHPORT-5-IF _ DOWN _ LINK _ FAILURE: Interface Ethernet1/17is DOWN (Link FAILURE) Timestamp: 2020-10-2912: 17:53.
The third piece of alarm information may be: SZ-NET-D:2020Oct 2912: 17:59.902 CST:% ETH _ PORT _ CHANNEL-5-PORT _ UP: PORT-CHANNEL1301: Ethernet1/18is UP.
Because the network equipment name is self-defined by a user, the network equipment name can be obtained through a specific regular expression as 'SZ-NET-B', and a corresponding physical port can be matched as 'Forty-Gigabit Ethernet 2/0/1' according to a keyword, so that the first piece of alarm information meets the requirement. Similarly, the name of the network device "SZ-NET-D" and the name of the physical port "Ethernet 1/17" can be extracted from the second alarm message; the third warning message contains the "Ethernet", but also contains the "port-channel", and does not contain the keywords such as "DOWN", etc., which does not meet the requirement, and the third warning message can be discarded.
Step 608: matching the pre-input information to obtain the latest probability;
step 610: whether probability information can be matched; if yes, go to step 612; if not, go to step 614;
step 612: placing the alarm into a cache pool;
step 614: discard the alert;
the master alarm system matches the network equipment name and the physical port name with the pre-entered hierarchy information of the OTN equipment, and obtains the latest probability that the object of the hierarchy information of the OTN equipment corresponding to the network equipment name and the physical port name is abnormal;
if the network device name and the physical port name can be matched with the hierarchy information of a specific OTN device, acquiring the OTN device information of three layers of OPU, OCH and OTS of the hierarchy information and probability information corresponding to the hierarchy, and putting the alarm information corresponding to the network device name and the physical port name into a cache pool; and if the network equipment name and the physical port name are not matched with the hierarchy information of any OTN equipment, discarding the alarm information corresponding to the network equipment name and the physical port name, and carrying a keyword of 'not relating to an OTN system' in the alarm information.
Step 616: whether a new increase exists within 15 seconds; if yes, go to step 608; if not, go to step 618;
step 618: counting the number of alarms;
the total alarm system judges whether new alarm information of the network equipment which can be matched with the hierarchy information of the OTN equipment is generated within a specific time length;
if no new alarm information of the network device capable of matching the hierarchy information of the OTN device is generated within the time length, counting the number of the alarm information of all the matched network devices acquired within the time length, and ending the alarm formatting process.
The format of the hierarchy information of the pre-entered OTN device is shown in table 1.
Referring to table 1, each network device name and physical port name may form a key index of hierarchy information of the pre-entered OTN device, and according to the key index, an OPU layer of the OTN carrying transmission of the network device may be associated, where the OPU layer includes optical transmission device name information and specific port information, a port rate of the OPU layer, and a probability of possible abnormality, where the optical transmission device name information may be the client-side device in table 1, and the port information may be the client-side port in table 1; then, associating information of the OCH layer corresponding to the OPU, including the channel frequency of the channel, the starting station and the ending station, the channel speed corresponding to the channel, and the probability of possible abnormality; and finally, the operator optical fiber information associated to the OTS layer corresponding to the OCH includes the optical fiber number of the operator, the total bandwidth information of the service currently carried by the OTS, namely, the carried rate, and the probability of corresponding possible abnormality.
In an embodiment, the total alarm system may determine an initial abnormal probability of the hierarchy information of each OTN device according to a ratio of an OCH channel rate to an OPU port rate in the hierarchy information of each OTN device;
the hierarchy information of the OTN equipment is pre-recorded in the master alarm system according to the actually used OTN system of the network equipment, and is used for inquiring the hierarchy information of the corresponding OTN equipment according to the name of the network equipment and the name of the physical port. The abnormal probability corresponding to the hierarchy of each OTN device has an initialized value, namely the initial abnormal probability of the hierarchy information of the OTN device; this value needs to be determined with reference to the ratio of the OCH channel rate and the OPU port rate, where the ratio is 1: 1, the initial abnormal probability of the OCH layer may be required to be the maximum, for example, may be 40%; the initial anomaly probability of the OPU layer is small, for example, may be 30%; the initial anomaly probability of the OTS layer is minimal, and may be 30%, for example; when the ratio is greater than 1: 1, the initial abnormal probability of the OPU layer may be required to be the largest, for example, may be 80% or 60%, the initial abnormal probability of the OCH layer is smaller, for example, may be 15% or 30%, and the initial abnormal probability of the OTS layer is the smallest, for example, may be 5% or 10%, in which case, the initial abnormal probability of the OPU layer and the ratio of the "channel rate of OCH" to the "port rate of OPU" are changed in a "logarithmic curve"; assuming that the ratio is x and the initial anomaly probability of the OPU layer is y, the initial anomaly probability of the OPU layer may be determined according to formula (1):
y=0.20ln(x)+0.31 (1);
referring to fig. 2, in the case that the ratio is 10, the initial anomaly probability of the OPU layer may be 80%; in the case where the ratio is 20, the initial anomaly probability of the OPU layer may be 90%.
Through the alarm formatting from the above step 602 to step 618, the specific alarm information of the network device carried by the OTN system may be obtained, and the level information and the number of the OTN devices on the OTN system associated with the alarm information and the probability that the level information of the OTN device corresponding to the alarm information of each network device is abnormal are determined.
After determining the initial abnormal probability of the hierarchy information of each OTN device, referring to fig. 7, the alarm convergence method further includes calculating a probability convergence alarm in the following steps 620 to 632:
step 620: counting the number of alarms;
step 622: matching alarm coefficients of all layers;
the total alarm system obtains the number, and the total alarm system can determine the weight of each layer level information in each OTN device according to the number and the maximum value of the ratio;
the number may also be understood as the number of alarm information of the network device that matches the hierarchical information of the OTN device, and the total alarm system may correspondingly search the weight of each layer of each corresponding OTN device in the number according to the number of alarm information that meets the requirement and has an abnormal probability counted in a specific time.
In one embodiment, the weights of different rules may be selected according to the maximum value of the ratio of the OCH channel rate to the OPU port rate in the OTN system; in the same OTN system, if there are different ratios, such as 4: 1 and 10: 1, then the maximum 10: 1is "the ratio of the channel rate of OCH to the port rate of OPU", that is, 10 is used as the reference for judging the weight.
It should be understood that OPU, OCH, and OTS are three hierarchical relationships, wherein the OPU layer is at the bottom layer of the three hierarchical levels, and the number of corresponding hierarchical information is the largest; the OCH layer is arranged in the middle layer of the three levels, and the number of corresponding level information is in the middle; the OTS layer is located at an upper layer of the three levels, and the corresponding level information is fewer in number. Therefore, when the weight is judged, the weight can be confirmed by comparing the total alarm quantity of the network equipment which is matched with the level information of the OTN equipment according to the requirement with the ratio of the OCH channel rate and the OPU port rate, so that the judgment accuracy is improved. Specifically, referring to table 2, in the case that the number is equal to or less than the ratio, the weights of the OPU layer, the OCH layer, and the OTS layer may be determined to be 1.2, 0.8, and 0.4, respectively; in the case that the number is greater than the ratio and is less than or equal to twice the ratio, the weights of the OPU layer, the OCH layer, and the OTS layer may be determined to be 0.4, 1.2, and 0.8, respectively; in the case where the number is more than twice the ratio, the weights of the OPU layer, the OCH layer, and the OTS layer may be determined to be 0.4, 0.8, and 1.2, respectively.
Step 624: calculating probability and alarm convergence;
step 626: outputting a fault reason;
the total alarm system calculates the probability of the OTN system and determines the fault reason of the OTN equipment according to the probability; the master alarm system can output the fault reason to prompt the network management of the OTN equipment to remove the core and check the actual fault.
Weighting and calculating the collected alarm information of all the network devices by using the abnormal probability of the object in the level information of each OTN device and the weights of the OPU layer, the OCH layer and the OTS layer where the object is located; the final probability of the object is determined by accumulation. The final probabilities of the objects can be sequenced to obtain a sequencing result, specific object information in the hierarchy information of the target layer and the target layer of the OTN equipment corresponding to the two maximum final probabilities can be output, and the fault reason of the OTN equipment is determined according to the hierarchy and the object information; and finally, deleting the alarm information of the network equipment to finish the whole alarm convergence.
Step 628: checking whether the result meets the actual condition; if yes, go to step 632; if not, go to step 630.
Wherein the checking result can be considered as an actual fault.
Step 630: updating the abnormal probability of each matched layer;
step 632: the flow ends.
The main alarm system compares the actual fault with the predicted fault reason; and adjusting the abnormal probability of the object in the level information of each OTN device according to the comparison result.
In one embodiment, after the whole alarm convergence is completed, the actual fault scene can be checked by means of an OTN network management system of the OTN system; if the OTN hierarchy where the actual fault is located is matched with the hierarchy of the OTN equipment corresponding to the object with the maximum final probability in the corresponding hierarchy information, the abnormal probability of the object with the abnormality in the hierarchy information of each OTN equipment is kept unchanged, and the process is ended; if the OTN hierarchy where the actual fault is located matches the hierarchy of the OTN device corresponding to the object having the second highest final probability, the probability of the object having the abnormality in the hierarchy information of the target layer of the OTN device corresponding to the second highest final probability may be increased by 10%, and the probability of the object having the abnormality in the hierarchy information of other layers may be decreased by 10% in total, for example, the probability of the object having the abnormality in the hierarchy information of each layer in other layers may be decreased by 5% respectively. If the OTN hierarchy where the actual fault is located and the hierarchies of the OTN devices corresponding to the objects in the corresponding hierarchy information and the objects with the first and second highest final probabilities are not matched, the abnormal probability of the object occurring abnormality in the hierarchy information of the layer which should be correct actually may be increased by 20%, and the probabilities of the other layers may be decreased by 20%, for example, the abnormal probability of the object occurring abnormality in the hierarchy information of each layer in the other hierarchies may be decreased by 10%, respectively, the actual correct hierarchy is the hierarchy of the OTN device corresponding to the object with the smallest final probability, and the process ends.
The method using probability iteration in the embodiment of the application is mainly used for solving the problem that when a large amount of alarm information of network equipment in a network environment and alarm information of OTN equipment are concurrent, part of alarm information is lost, so that the method is more suitable for the network alarm environment in the field through training by using the probability iteration method.
The embodiment of the present application further provides a method for determining a cause of a fault of an OTN device, where the method includes:
step S802: under the condition of a certain OTN system failure, the total alarm system receives the pushing of the alarm information of the network equipment;
wherein, the alarm information of the network device may be the following 8 pieces of alarm information:
the first piece of alarm information: SZ-NET-A:2020Oct 2920: 12:48 CST:% ETHPORT-5-IF _ DOWN _ INITIALIZING: Interface Ethernet1/1is DOWN (Initializing).
And the second alarm message: SZ-NET-A:2020Oct 2920: 12:49 CST:% ETHPORT-5-IF _ DOWN _ INITIALIZING: Interface Ethernet1/2is DOWN (Link failure).
The third piece of alarm information: SZ-NET-A:2020Oct 2920: 12:49 CST:% ETHPORT-5-IF _ DOWN _ INITIALIZING: Interface Port-channel 1is DOWN (Link failure).
The fourth warning message: SZ-NET-A:2020Oct 2920: 12:49 CST:% ETHPORT-5-IF _ DOWN _ INITIALIZING: Interface Ethernet1/3is DOWN (Link failure).
And fifth alarm information: oct 2920: 12: 5010.126.231.782020 SZ-NET-B%% 10IFNET/5/LINK _ UPDOWN: Line protocol state on the interface force-Gigabit Ethernet2/0/1changed to down.
The sixth piece of alarm information: oct 2920: 12: 5010.126.231.782020 SZ-NET-B%% 10IFNET/5/LINK _ UPDOWN: Line protocol state on the interface force-Gigabit Ethernet2/0/2changed to down.
The seventh piece of alarm information: SZ-NET-C:2020Oct 2920: 12:51 CST% ETHPORT-5-IF _ DOWN _ INITIALIZING: Interface Ethernet3/1is DOWN (Initializing).
The eighth alarm message: SZ-NET-C:2020Oct 2920: 12:59 CST:% ETHPORT-5-IF _ DOWN _ INITIALIZING: Interface Ethernet6/2is DOWN (Initializing).
Step S804: performing formatting pretreatment on the alarm information of the network equipment to obtain a network equipment name and a physical port name corresponding to the alarm information;
the alarm information of the network equipment can be screened, and the third alarm information is not a specific physical port and is screened out; the eighth alarm information is screened because the corresponding probability cannot be found in the total alarm system; the combination of the network device name and the keyword of the physical port name in a single piece of alarm information can be extracted through a regular expression and the keyword, and the data formatted in the following table 4, that is, the network device name and the physical port name corresponding to the alarm information, is obtained:
TABLE 4
Figure BDA0002880073830000281
Step S806: in the total alarm system, determining the hierarchy information of the OTN equipment matched with the network equipment name and the physical port name, and determining the abnormal probability of the object in the hierarchy information of the OTN equipment;
the table 3is a mapping table, where the mapping table includes the network device name, the physical port name, and the hierarchy information of the OTN device; for convenience of presentation, table 3 hides part of information in the mapping table, and highlights the abnormal probability of the object in the hierarchical information.
TABLE 3
Figure BDA0002880073830000291
Figure BDA0002880073830000301
Step S808: the total alarm system determines the weight of each layer level information in each OTN device according to the number of the layer level information in the OTN device matched with the network device name and the physical port name and the ratio of the OCH channel rate to the OPU port rate;
calculating the number of hierarchical information in the OTN equipment matched with the network equipment name and the physical port name to be 6, wherein in the OTN system with the same OTS, the maximum ratio of the OCH channel rate to the OPU port rate is 4; if the number of alarm messages of the network device meeting the requirement is 6, then since 6 is between one-time ratio 4 and two-time ratio 8, the weight combination selected according to the requirement may be that the weight of the OPU layer is 0.4, the weight of the OCH layer is 1.2, and the weight of the OTS layer is 0.8.
Step S810, the total alarm system calculates the probability of the OTN system;
wherein, the abnormal probability of the abnormality of the 'Fiber-AA-BB-01' of the OTS layer can be as follows: 30% by 0.8+ 10% by 0.8+ 30% by 0.8-1.12.
The abnormal probability of "192.1 THz", "192.2 THz", "192.3 THz", "192.5 THz" of the OCH layer being abnormal may be: 40%. 1.2 ═ 0.48; the anomaly probability of "192.4 THz" being anomalous may be: 30%. 1.2+ 30%. 1.2 ═ 0.72
Since the objects in the hierarchy information of the OPU layer only appear once, there is no accumulation, and thus the maximum value may be: 60%. 0.4 ═ 0.24.
Step S812: the total alarm system determines the fault reason of the OTN equipment according to the probability;
wherein, because the probability of the second alarm information and the eighth alarm information is the maximum, the first, the third, the fourth, the fifth and the seventh alarm information can be deleted, and the output 'the Fiber-AA-BB-01 abnormal probability of the OTS layer is the maximum', and the output 'the 192.4THz abnormal probability of the OCH layer is the second maximum'.
Step S814: the main alarm system compares the actual fault with the determined fault reason of the OTN equipment, and adjusts the abnormal probability of the object in the level information of each OTN equipment according to the comparison result;
the total alarm system can check the actual fault condition of the OTN system by means of the OTN network management system of the OTN system, and confirms that the main light interruption of an operator is confirmed, namely the optical Fiber interruption with the operator optical Fiber number of 'Fiber-AA-BB-01', so that the alarm convergence is accurate and consistent with the actual condition, and the probability of correspondingly updating an object in the level information of the OTN equipment in a mapping table of the network equipment is not needed; and if the actual fault is inconsistent with the fault reason, correspondingly updating the probability of the object in the hierarchy information of the OTN equipment in the mapping table of the network equipment.
The embodiment of the present application further provides a method for determining a cause of a failure of an OTN device, where the method includes:
step S902: acquiring at least one piece of original alarm information from at least one piece of network equipment;
step S904: preprocessing formatting each piece of the at least one piece of original alarm information to obtain at least one piece of alarm information; the at least one piece of alarm information is alarm information of network equipment connected with the OTN equipment;
step S906: extracting key equipment parameters from the at least one piece of alarm information, wherein the key equipment parameters are used for indicating the identification of the network equipment and the identification of the corresponding physical port;
step S908: determining a matching result between the key equipment parameter and the level information of the OTN equipment in a mapping table;
wherein the matching result comprises matching and mismatching between the key device parameter and the hierarchy information of the OTN device; the OTN equipment comprises an OPU layer, an OCH layer and an OTS layer; each layer level information in the OTN device may include: the OPU information comprises name information of the optical transmission equipment, port information and port rate of an OPU layer; OCH information including channel frequency, channel rate, start site and stop site of a channel; and the OTS information comprises the optical fiber information of the operator and the total bandwidth information of the service carried by the OTS layer.
Step S910: acquiring the hierarchy information of the matched OTN equipment under the condition that the matching result shows that the key equipment parameters are matched with the hierarchy information of the OTN equipment;
table 5 is a part of the mapping table, and table 5 includes the level information of the OTN device matching the key device parameter.
TABLE 5
Figure BDA0002880073830000321
Figure BDA0002880073830000331
Step S912: determining an adaptation result between the level information of the matched OTN equipment and a preset record table of the OTN equipment, wherein the record table is used for recording the connection relationship between objects in each level in the OTN equipment;
the record table may include a first hierarchical record table (a hierarchical record table of the OTS layer and the OCH layer), and a second hierarchical record table (a hierarchical record table of the OCH layer and the OPU layer), where the first hierarchical record table may record a connection relationship between an object in the OTS layer and an object in the OCH layer, and the second hierarchical record table may record a connection relationship between an object in the OCH layer and an object in the OPU layer.
Table 6 is a hierarchical recording table of the OTS layer and the OCH layer provided in this embodiment, and table 7is a hierarchical recording table of the OCH layer and the OPU layer provided in this embodiment; it should be noted that only part of the information in the hierarchical record table is listed in table 6 and table 7.
TABLE 6
Channel name AA-BB-192.1 XX-YY-196.0 CC-DD-192.2
Bandwidth (Gb/s) 10 10 100
Endpoint A-device SZ-DWDW-AA SZ-DWDW-XX SZ-DWDW-CC
Endpoint A-device port L-1/1 L-1/2 L-2/1
Endpoint Z-device SZ-DWDW-BB SZ-DWDW-YY SZ-DWDW-DD
Endpoint Z-device port L-1/1 L-1/2 L-2/1
Channel frequency 192.1 196.0 192.2
Operator fiber numbering Fiber-AA-BB-01 Fiber-XX-YY-01 Fiber-CC-DD-01
TABLE 7
Figure BDA0002880073830000332
Figure BDA0002880073830000341
Step S914: and determining the fault reason of the OTN equipment based on the adaptation result.
In the embodiment of the application, the fault reason of the OTN equipment is determined according to the adaptation result between the level information of the OTN equipment matched with the key equipment parameters in the mapping table and the preset record table of the OTN equipment, so that the fault reason of the OTN equipment can be determined more conveniently.
The embodiment of the present application further provides a method for determining a cause of a failure of an OTN device, where the method includes:
step S1002: acquiring at least one piece of original alarm information from at least one piece of network equipment;
step S1004: preprocessing formatting each piece of the at least one piece of original alarm information to obtain at least one piece of alarm information; the at least one piece of alarm information is alarm information of network equipment connected with the OTN equipment;
step S1006: extracting key equipment parameters from the at least one piece of alarm information, wherein the key equipment parameters are used for indicating the identification of the network equipment and the identification of the corresponding physical port;
step S1008: determining a matching result between the key equipment parameter and the level information of the OTN equipment in a mapping table;
the matching result comprises matching and mismatching between the key equipment parameter and the hierarchy information of the OTN equipment;
the OTN equipment comprises an OPU layer, an OCH layer and an OTS layer; each layer level information in the OTN equipment comprises: the OPU information comprises name information of the optical transmission equipment, port information and port rate of an OPU layer; OCH information including channel frequency, channel rate, start site and stop site of a channel; and the OTS information comprises the optical fiber information of the operator and the total bandwidth information of the service carried by the OTS layer.
Step S1010: acquiring the hierarchy information of the matched OTN equipment under the condition that the matching result shows that the key equipment parameters are matched with the hierarchy information of the OTN equipment;
step S1012: acquiring a target object in the OTS information from the hierarchy information of the matched OTN equipment;
referring to Table 5, the target object in the OTS information may be the operator Fiber number "Fiber-AA-BB-01".
Step S1014: determining a first adaptation result between the OTS information and an object of an OCH layer in a record table of the OTN equipment based on the target object of the OTS information;
the first adaptation result includes adaptation and non-adaptation between the target object of the OTS information and the object of the OCH layer in a first level record table (the OTS layer and the level record table of the OCH layer), and a first number of pieces of level information of the OCH layer corresponding to the target object of the OTS information and contents in each piece of level information in the mapping table may be determined first, and then a second number of pieces of level information of the OCH layer corresponding to the target object of the OTS information and contents in each piece of level information in the first level record table may be determined.
If the contents in the first number and each piece of hierarchical information completely conform to the contents in the second number and each piece of hierarchical information, the target object of the OTS information is considered to be matched with the object of the OCH layer in the first hierarchical record table; if the first number is not equal to the second number, or although the first number is equal to the second number, the content in each piece of hierarchical information is not completely consistent, the target object of the OTS information is not adapted to the object of the OCH layer in the first hierarchical record table; it should be noted that, because the content in table 5 is a subset of the content in table 6, the same level information of the OCH layer can be always found in table 6 for any level information of the OCH layer in table 5, and further, it can be determined whether the first number and the second number are equal to determine that the target object of the OTS information is adapted to the object of the OCH layer in the first level record table.
Referring to table 5, there are 6 total hierarchy information of the OCH layers corresponding to the operator optical Fiber number "Fiber-AA-BB-01", where two are completely the same, that is, the channel frequencies in the hierarchy information of the two OCH layers are both "192.4 THz", the starting site is both "AA", the terminating site is both "BB", and the channel rates are both "40 Gb/s", so that it can be determined that there are 5 total different hierarchy information of the OCH layers corresponding to the object operator optical Fiber number "Fiber-AA-BB-01" in the hierarchy information of the OTS layer in table 5.
Referring to table 6, the objects of OCH in the first hierarchical record table (the hierarchical record table of the OTS layer and the OCH layer) may include "channel name", "bandwidth", "end a-device port", "end Z-device port", and "channel frequency", then the objects of the OCH layer corresponding to the operator Fiber number "Fiber-AA-BB-01" can be determined as the channel name "AA-BB-192.1", the bandwidth "10 Gb/s", the end point A-device "SZ-DWDWDWDW-AA", the end point A-device port "L-1/1", the end point Z-device "SZ-DWDWDWDWDWDWDW-BB", the end point Z-device port "L-1/1" and the channel frequency "192.1".
In table 6, if the hierarchy level difference information of the OCH layer corresponding to the operator Fiber number "Fiber-AA-BB-01" is equal to 5, it may be considered that the first adaptation result is that the target object of the OTS layer is adapted to the object of the OCH layer; if the level different level information of the OCH layer corresponding to the operator optical Fiber number 'Fiber-AA-BB-01' is more than 5, the first adaptation result can be regarded as that the target object of the OTS layer is not adapted to the object of the OCH layer; it should be noted that, because the content in table 5 is a subset of the content in table 6, the number of different hierarchy levels of the OCH layer corresponding to the operator Fiber number "Fiber-AA-BB-01" in table 6 is at least 5, and the information in the hierarchy record table of the OTS layer and the OCH layer is not fully listed in table 6.
Step S1016: determining a fault reason of the OTN equipment based on the target object of the OTS information under the condition that the first adaptation result shows that the target object of the OTS layer is adapted to the object of the OCH layer;
wherein, when the target object of the OTS layer is adapted to the object of the OCH layer, it is indicated that it is likely that the OTS layer of the OTN device has a fault, and a fault cause of the OTN device may be determined according to the target object of the OTS information.
Step S1018: under the condition that the first adaptation result shows that the target object of the OTS layer is not adapted to the object of the OCH layer, acquiring the target object in the OCH information from the level information of the matched OTN equipment;
in this case, when the target object of the OTS layer is not adapted to the object of the OCH layer, it is indicated that it is very likely that the OCH layer or the OPU layer of the OTN device fails instead of the OTS layer of the OTN device, and therefore, the target object in the OCH information may be further obtained from the hierarchy information of the matched OTN device.
Step S1020: determining a second adaptation result between the object of the OPU layer in the record table of the OTN equipment and the object of the OPU layer based on the target object in the OCH information;
the second adaptation result includes adaptation and non-adaptation between the target object of the OCH information and an object of an OPU layer in a second hierarchical recording table, and the third number of pieces of hierarchical information of the OPU layer corresponding to the target object of the OCH information and the content in each piece of hierarchical information in the mapping table may be determined first, and then the fourth number of pieces of hierarchical information of the OPU layer corresponding to the target object of the OCH information and the content in each piece of hierarchical information in the second hierarchical recording table may be determined.
If the contents in the third number and each piece of hierarchical information completely conform to the contents in the fourth number and each piece of hierarchical information, considering that the target object of the OCH information is matched with the object of the OPU layer in the second hierarchical record table; if the third number is not equal to the fourth number, the target object of the OCH information is not adapted to the object of the OPU layer in the second-level recording table; it should be noted that, because the contents in table 5 are a subset of the contents in table 7, the same hierarchy information of the OPU layer can always be found in table 7 for the hierarchy information of any OPU layer in table 5, and further, only whether the third number and the fourth number are equal can be determined to determine that the target object of the OCH information is adapted to the object of the OPU layer in the second hierarchical recording table.
Step S1022: determining a fault reason of the OTN equipment based on the target object in the OCH information under the condition that the second adaptation result shows that the target object in the OCH information is adapted to the object of the OPU layer;
when the target object of the OCH layer is adapted to the object of the OPU layer, it is indicated that a fault is likely to occur in the OCH layer of the OTN device, and a fault cause of the OTN device may be determined according to the target object of the OCH information.
Step S1024: under the condition that the second adaptation result shows that the target object in the OCH information is not adapted to the object of the OPU layer, acquiring the target object in the OPU information from the level information of the matched OTN equipment; and determining the fault reason of the OTN equipment based on the target object in the OPU information.
When the target object of the OCH layer is not adapted to the object of the OPU layer, it is indicated that it is highly likely that the OPU layer of the OTN device has a fault, and the fault reason of the OTN device can be determined according to the target object of the OPU information.
In the embodiment of the application, the OTN hierarchy which is possibly caused by the fault reason of the OTN equipment is determined according to the adaptation result between the hierarchy information of the OTN equipment matched with the key equipment parameters in the mapping table and the preset record table of the OTN equipment, and the fault reason of the OTN equipment is determined according to the target object of the OTN layer, so that the fault reason of the OTN equipment can be determined more conveniently and accurately.
The embodiment of the present application further provides a method for determining a cause of a failure of an OTN device, where the method includes:
step S1102: the main alarm system acquires alarm information of the network equipment;
step S1104: the master alarm system screens alarm information of a physical port down type;
step S1106: the master alarm system carries out formatting treatment on the alarm information of the down type of the physical port so as to extract the name of the network equipment and the name of the physical port;
step S1108: the master alarm system matches the network equipment name and the physical port name with the hierarchy information of the OTN equipment which is pre-input, and obtains an object of the hierarchy information of the OTN equipment corresponding to the network equipment name and the physical port name;
when the performance of the master alarm system is strong enough, and the communication path between the network device and the master alarm system is stable enough, the probability options corresponding to the network device name and the physical port name in the hierarchical information of the OTN device can be deleted without losing the alarm information of all the network devices.
Step S1110: acquiring a hierarchical recording table of the OTS layer and the OCH layer;
step S1112: acquiring a hierarchical recording table of the OCH layer and the OPU layer;
in the alarm convergence process, the process of formatting the alarm information is unchanged, the alarm convergence process is that the alarm information is matched with the level information of the OTN equipment, then the level information of the OCH layer is traversed according to a target object in the OTS information of the OTS layer, and if the alarm information of all the network equipment is completely matched with the 'level record table of the OTS layer and the OCH layer', the information of the object in the level information of the OTS layer can be output to converge all the alarm information.
And if the OTS layer does not completely contain the hierarchy information of all OCH layers in the hierarchy record table of the OTS layer and the OCH layers, the object corresponding to the OTS layer is considered to have no abnormality. Traversing the object in the hierarchy information of each OCH layer according to a hierarchy record table of the OCH layers and the OPU layer until all the network equipment alarm information is contained; and outputting the hierarchical information and the object of the abnormal OTN equipment, and converging the alarm information of all the network equipment from the master alarm system.
Based on the foregoing embodiments, the present application provides an apparatus for determining a cause of a failure of an OTN device, where the apparatus includes each included unit and each module included in each unit, and may be implemented by a processor in a computer device; of course, the implementation can also be realized through a specific logic circuit; in implementation, the processor may be a Central Processing Unit (CPU), a Microprocessor (MPU), a Digital Signal Processor (DSP), a Field Programmable Gate Array (FPGA), or the like.
Fig. 8is a schematic structural diagram of a device for determining a cause of an OTN device failure according to an embodiment of the present application, and as shown in fig. 8, the device 800 includes an obtaining module 801, an extracting module 802, a matching module 803, and a determining module 804, where:
an obtaining module 801, configured to obtain alarm information to be processed; the alarm information to be processed is alarm information of network equipment connected with the OTN equipment; an extracting module 802, configured to extract a key device parameter from the alarm information to be processed, where the key device parameter is used to indicate an identifier of the network device and an identifier of a corresponding physical port; a matching module 803, configured to determine, based on the key device parameter, a matching result with the hierarchy information of the OTN device in the mapping table; a determining module 804, configured to determine a failure cause of the OTN device based on the matching result.
In one embodiment, the obtaining module 801 includes: the acquisition submodule is used for acquiring at least one piece of original alarm information from at least one piece of network equipment; the preprocessing submodule is used for carrying out formatting preprocessing on each piece of the at least one piece of original alarm information to obtain the at least one piece of alarm information;
correspondingly, the determining module 804 is configured to determine that the failure cause of the OTN device is determined based on the matching result of the at least one piece of alarm information.
In one embodiment, the matching result includes a match and a mismatch between the key device parameter and the hierarchy information of the OTN device; the OTN equipment comprises an OPU layer, an OCH layer and an OTS layer; each layer level information in the OTN equipment comprises: the OPU information comprises name information of the optical transmission equipment, port information and port rate of an OPU layer; OCH information including channel frequency, channel rate, start site and stop site of a channel; and the OTS information comprises the optical fiber information of the operator and the total bandwidth information of the service carried by the OTS layer.
In one embodiment, the matching result further includes an abnormal probability that an object is abnormal in information of each hierarchy of the OTN device in case of matching;
the determining module includes: a first determining unit, configured to determine weights of information of each layer in the OTN device; and the second determining unit is used for determining the fault reason of the OTN equipment based on the weight of each level of information in the OTN equipment and the abnormal probability of the object in each level of information in the OTN equipment.
In one embodiment, the first determining unit includes: the first determining subunit is configured to determine, based on a matching result of each piece of alarm information, a number of matches between a key device parameter of each piece of alarm information and level information of the OTN device in the mapping table; a second determining subunit, configured to determine a first ratio between a rate of OCHs in the OTN device and an OPU rate; a third determining subunit, configured to determine a second ratio between the number of matches and the first ratio; and a fourth determining subunit, configured to determine, based on the second ratio, a weight of each level of information in the OTN device.
In one embodiment, the second determination unit includes: a fourth determining subunit, configured to determine, for an object in each level of information, a final probability of the object by accumulation based on the number of times the object is matched, a weight of the level of information where the object is located, and an abnormal probability of the object being abnormal; the sequencing subunit is used for sequencing the final probabilities of all the objects to obtain a sequencing result; and the fifth determining subunit is configured to determine, based on the sorting result, a cause of the failure of the OTN device.
In an embodiment, the fifth determining subunit is configured to, based on the sorting result, screen out a target object that meets a preset condition; and determining the fault reason of the OTN equipment based on the target object and the hierarchy information of the layer where the target object is located.
In one embodiment, the OTN device comprises an OPU layer, an OCH layer and an OTS layer; each layer level information in the OTN equipment comprises: the OPU information comprises name information of the optical transmission equipment, port information, port rate of an OPU layer and abnormal probability of an object in the OPU layer; OCH information including channel frequency of a channel, an abnormal probability of an object in an originating site and a terminating site and an OPU layer; the OTS information includes the operator optical fiber information, the total bandwidth information of the service currently carried by the OTS layer, and the abnormal probability of the object in the OTS layer.
In one embodiment, the OPU information further includes an abnormal probability of an object in the OPU layer; the OCH information also comprises the abnormal probability of an object in the OPU layer; the OTS information also comprises the abnormal probability of the object in the OTS layer; the initial value y of the abnormal probability of the object in the OPU layer has a logarithmic function relationship with the first ratio x between the OCH rate and the OPU rate in the OTN equipment, wherein the y is a dependent variable, and the x is an independent variable.
In one embodiment, the apparatus further comprises: the output module is used for outputting the fault reason so as to prompt the network manager of the OTN equipment; a re-obtaining module, configured to obtain a result of the network manager of the OTN device feeding back the failure cause; and the updating module is used for maintaining or updating the abnormal probability of the object in each level of information in the mapping table based on the feedback result.
In some embodiments, the determining module 804 includes:
an obtaining unit, configured to obtain hierarchy information of the matched OTN device when the matching result indicates that the key device parameter is matched with the hierarchy information of the OTN device;
a third determining unit, configured to determine an adaptation result between the level information of the matched OTN device and a preset record table of the OTN device, where the record table is used to record a connection relationship between layers in the OTN device; and determining the fault reason of the OTN equipment based on the adaptation result.
In some embodiments, the third determining unit is configured to obtain a target object in the OTS information from the hierarchy information of the matched OTN device; determining a first adaptation result between the OTS information and an object of an OCH layer in a record table of the OTN equipment based on the target object of the OTS information; and determining a fault reason of the OTN equipment based on the target object of the OTS information under the condition that the first adaptation result shows that the target object of the OTS layer is adapted to the object of the OCH layer.
In some embodiments, the third determining unit is configured to, in a case that the first adaptation result indicates that the target object of the OTS layer is not adapted to the object of the OCH layer, obtain the target object in the OCH information from the hierarchy information of the matched OTN device; determining a second adaptation result between the object of the OPU layer in the record table of the OTN equipment and the object of the OPU layer based on the target object in the OCH information; determining a fault reason of the OTN equipment based on the target object in the OCH information under the condition that the second adaptation result shows that the target object in the OCH information is adapted to the object of the OPU layer; under the condition that the second adaptation result shows that the target object in the OCH information is not adapted to the object of the OPU layer, acquiring the target object in the OPU information from the level information of the matched OTN equipment; and determining the fault reason of the OTN equipment based on the target object in the OPU information.
The above description of the apparatus embodiments, similar to the above description of the method embodiments, has similar beneficial effects as the method embodiments. For technical details not disclosed in the embodiments of the apparatus of the present application, reference is made to the description of the embodiments of the method of the present application for understanding.
It should be noted that, in the embodiment of the present application, if the method for determining the cause of the OTN device failure is implemented in the form of a software functional module, and is sold or used as an independent product, the method may also be stored in a computer-readable storage medium. Based on such understanding, the technical solutions of the embodiments of the present application may be essentially implemented or a part contributing to the related art may be embodied in the form of a software product stored in a storage medium, and including a plurality of instructions for enabling a computer device (which may be a mobile phone, a tablet computer, a desktop computer, a personal digital assistant, a navigator, a digital phone, a video phone, a television, a sensing device, etc.) to execute all or part of the methods described in the embodiments of the present application. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read Only Memory (ROM), a magnetic disk, or an optical disk. Thus, embodiments of the present application are not limited to any specific combination of hardware and software.
Correspondingly, an embodiment of the present application provides a computer device, fig. 9 is a schematic diagram of a hardware entity of the computer device in the embodiment of the present application, and as shown in fig. 9, the hardware entity of the computer device 900 includes: the OTN equipment failure cause determination method comprises a memory 901 and a processor 902, wherein the memory 901 stores a computer program which can be run on the processor 902, and the processor 902 executes the program to implement the steps of the method for determining the OTN equipment failure cause according to the above embodiments.
The Memory 901 is configured to store instructions and applications executable by the processor 902, and may also buffer data (e.g., image data, audio data, voice communication data, and video communication data) to be processed or already processed by the processor 902 and modules in the computer apparatus 900, and may be implemented by a FLASH Memory (FLASH) or a Random Access Memory (RAM).
Correspondingly, the present application provides a computer readable storage medium, on which a computer program is stored, which when executed by a processor implements the steps in the method provided in the above embodiments.
Here, it should be noted that: the above description of the storage medium and device embodiments is similar to the description of the method embodiments above, with similar advantageous effects as the method embodiments. For technical details not disclosed in the embodiments of the storage medium and apparatus of the present application, reference is made to the description of the embodiments of the method of the present application for understanding.
It should be appreciated that reference throughout this specification to "one embodiment" or "an embodiment" means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present application. Thus, the appearances of the phrases "in one embodiment" or "in an embodiment" in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. It should be understood that, in the various embodiments of the present application, the sequence numbers of the above-mentioned processes do not mean the execution sequence, and the execution sequence of each process should be determined by its function and inherent logic, and should not constitute any limitation to the implementation process of the embodiments of the present application. The above-mentioned serial numbers of the embodiments of the present application are merely for description and do not represent the merits of the embodiments.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
In the several embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other ways. The above-described device embodiments are merely illustrative, for example, the division of the unit is only a logical functional division, and there may be other division ways in actual implementation, such as: multiple units or components may be combined, or may be integrated into another system, or some features may be omitted, or not implemented. In addition, the coupling, direct coupling or communication connection between the components shown or discussed may be through some interfaces, and the indirect coupling or communication connection between the devices or units may be electrical, mechanical or other forms.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units; can be located in one place or distributed on a plurality of network units; some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment. In addition, all functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may be separately regarded as one unit, or two or more units may be integrated into one unit; the integrated unit can be realized in a form of hardware, or in a form of hardware plus a software functional unit.
Those of ordinary skill in the art will understand that: all or part of the steps for realizing the method embodiments can be completed by hardware related to program instructions, the program can be stored in a computer readable storage medium, and the program executes the steps comprising the method embodiments when executed; and the aforementioned storage medium includes: various media that can store program codes, such as a removable Memory device, a Read Only Memory (ROM), a magnetic disk, or an optical disk. Alternatively, the integrated units described above in the present application may be stored in a computer-readable storage medium if they are implemented in the form of software functional modules and sold or used as independent products. Based on such understanding, the technical solutions of the embodiments of the present application may be essentially implemented or a part contributing to the related art may be embodied in the form of a software product stored in a storage medium, and including a plurality of instructions for enabling a computer device (which may be a mobile phone, a tablet computer, a desktop computer, a personal digital assistant, a navigator, a digital phone, a video phone, a television, a sensing device, etc.) to execute all or part of the methods described in the embodiments of the present application. And the aforementioned storage medium includes: a removable storage device, a ROM, a magnetic or optical disk, or other various media that can store program code.
The methods disclosed in the several method embodiments provided in the present application may be combined arbitrarily without conflict to obtain new method embodiments. Features disclosed in several of the product embodiments provided in the present application may be combined in any combination to yield new product embodiments without conflict. The features disclosed in the several method or apparatus embodiments provided in the present application may be combined arbitrarily, without conflict, to arrive at new method embodiments or apparatus embodiments.
The above description is only for the embodiments of the present application, but the scope of the present application is not limited thereto, and any person skilled in the art can easily conceive of changes or substitutions within the technical scope of the present application, and shall be covered by the scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims (14)

1. A method for determining the cause of OTN equipment failure is applied to a monitoring platform, and comprises the following steps:
acquiring alarm information to be processed; the alarm information to be processed is alarm information of network equipment connected with the OTN equipment;
extracting key equipment parameters from the alarm information to be processed, wherein the key equipment parameters are used for indicating the identification of the network equipment and the identification of a corresponding physical port;
determining a matching result between the key equipment parameter and the level information of the OTN equipment in a mapping table;
and determining the fault reason of the OTN equipment based on the matching result.
2. The method of claim 1, wherein the obtaining the alarm information to be processed comprises:
acquiring at least one piece of original alarm information from at least one piece of network equipment;
preprocessing formatting each piece of the at least one piece of original alarm information to obtain at least one piece of alarm information;
correspondingly, the determining the failure cause of the OTN device based on the matching result includes: and determining the fault reason of the OTN equipment based on the matching result of the at least one piece of alarm information.
3. The method according to claim 1, wherein the matching result comprises a match and a mismatch between the key device parameter and the hierarchy information of the OTN device;
the matching result also comprises the abnormal probability of the object in the information of each layer level of the OTN equipment being abnormal under the matching condition;
the determining the fault reason of the OTN device based on the matching result of each alarm information includes:
determining the weight of each level of information in the OTN equipment;
and determining the fault reason of the OTN equipment based on the weight of each level of information in the OTN equipment and the abnormal probability of the object in each level of information in the OTN equipment.
4. The method of claim 3, wherein the determining the weight of each level of information in the OTN device comprises:
determining the matching quantity between the key equipment parameter of each alarm information and the level information of the OTN equipment in the mapping table based on the matching result of each alarm information;
determining a first ratio between the OCH rate and the OPU rate in the OTN device;
determining a second ratio between the number of matches and the first ratio;
and determining the weight of each layer level information in the OTN equipment based on the second ratio.
5. The method according to claim 3, wherein the determining the failure cause of the OTN device based on the weight of each level information in the OTN device and the abnormal probability of the object in each level information of the OTN device appearing abnormally comprises:
for an object in each level information, determining the final probability of the object in an accumulation mode based on the number of times of matching the object, the weight of the level information where the object is located and the abnormal probability of the object;
sequencing the final probabilities of all the objects to obtain a sequencing result;
and determining the fault reason of the OTN equipment based on the sequencing result.
6. The method according to claim 5, wherein the determining the cause of the failure of the OTN device based on the sorting result comprises:
screening out target objects meeting preset conditions based on the sorting result;
and determining the fault reason of the OTN equipment based on the target object and the hierarchy information of the layer where the target object is located.
7. The method according to any one of claims 1 to 6,
the OTN equipment comprises an OPU layer, an OCH layer and an OTS layer;
each layer level information in the OTN equipment comprises:
the OPU information comprises name information of the optical transmission equipment, port information and port rate of an OPU layer;
OCH information including channel frequency, channel rate, start site and stop site of a channel;
and the OTS information comprises the optical fiber information of the operator and the total bandwidth information of the service carried by the OTS layer.
8. The method of claim 7, wherein the OPU information further comprises an anomaly probability of an object in an OPU layer;
the OCH information also comprises the abnormal probability of an object in the OPU layer;
the OTS information also comprises the abnormal probability of the object in the OTS layer;
the initial value y of the abnormal probability of the object in the OPU layer has a logarithmic function relationship with the first ratio x between the OCH rate and the OPU rate in the OTN equipment, wherein the y is a dependent variable, and the x is an independent variable.
9. The method of claim 7, further comprising:
outputting the fault reason to prompt a network manager of the OTN equipment;
obtaining a feedback result of the network management of the OTN equipment to the fault reason;
and maintaining or updating the abnormal probability of the object in each level of information in the mapping table based on the feedback result.
10. The method according to claim 7, wherein the determining the fault cause of the OTN device based on the matching result of each alarm message comprises:
acquiring the hierarchy information of the matched OTN equipment under the condition that the matching result shows that the key equipment parameters are matched with the hierarchy information of the OTN equipment;
determining an adaptation result between the level information of the matched OTN equipment and a preset record table of the OTN equipment, wherein the record table is used for recording the connection relationship between objects in each level in the OTN equipment;
and determining the fault reason of the OTN equipment based on the adaptation result.
11. The method according to claim 10, wherein the determining an adaptation result between the hierarchy information of the matched OTN device and a record table of a preset OTN device, and determining a failure cause of the OTN device based on the adaptation result comprises:
acquiring a target object in the OTS information from the hierarchy information of the matched OTN equipment;
determining a first adaptation result between the OTS information and an object of an OCH layer in a record table of the OTN equipment based on the target object of the OTS information;
determining a fault reason of the OTN equipment based on the target object of the OTS information under the condition that the first adaptation result shows that the target object of the OTS layer is adapted to the object of the OCH layer;
under the condition that the first adaptation result shows that the target object of the OTS layer is not adapted to the object of the OCH layer, acquiring the target object in the OCH information from the level information of the matched OTN equipment;
determining a second adaptation result between the object of the OPU layer in the record table of the OTN equipment and the object of the OPU layer based on the target object in the OCH information;
determining a fault reason of the OTN equipment based on the target object in the OCH information under the condition that the second adaptation result shows that the target object in the OCH information is adapted to the object of the OPU layer;
under the condition that the second adaptation result shows that the target object in the OCH information is not adapted to the object of the OPU layer, acquiring the target object in the OPU information from the level information of the matched OTN equipment; and determining the fault reason of the OTN equipment based on the target object in the OPU information.
12. An apparatus for determining a cause of a failure of an OTN device, the apparatus comprising:
the acquisition module is used for acquiring alarm information to be processed; the alarm information to be processed is alarm information of network equipment connected with the OTN equipment;
an extraction module, configured to extract a key device parameter from the alarm information to be processed, where the key device parameter is used to indicate an identifier of the network device and an identifier of a corresponding physical port;
the matching module is used for determining a matching result with the level information of the OTN equipment in the mapping table based on the key equipment parameters;
and the determining module is used for determining the fault reason of the OTN equipment based on the matching result.
13. A computer device comprising a memory and a processor, said memory storing a computer program operable on the processor, wherein the processor implements the steps of the method of determining a cause of a failure of an OTN device of any of claims 1 to 11 when executing said program.
14. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method for determining a cause of a failure of an OTN device according to any one of claims 1 to 11.
CN202011631074.7A 2020-12-31 2020-12-31 Method, device, equipment and storage medium for determining OTN equipment fault reason Pending CN112770197A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011631074.7A CN112770197A (en) 2020-12-31 2020-12-31 Method, device, equipment and storage medium for determining OTN equipment fault reason

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011631074.7A CN112770197A (en) 2020-12-31 2020-12-31 Method, device, equipment and storage medium for determining OTN equipment fault reason

Publications (1)

Publication Number Publication Date
CN112770197A true CN112770197A (en) 2021-05-07

Family

ID=75699573

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011631074.7A Pending CN112770197A (en) 2020-12-31 2020-12-31 Method, device, equipment and storage medium for determining OTN equipment fault reason

Country Status (1)

Country Link
CN (1) CN112770197A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113281616A (en) * 2021-05-28 2021-08-20 国网冀北电力有限公司承德供电公司 Power transmission line fault diagnosis method, system and framework based on expert system
CN113837672A (en) * 2021-11-26 2021-12-24 深圳普菲特信息科技股份有限公司 Equipment maintenance management method based on industrial Internet
CN114374899A (en) * 2021-12-30 2022-04-19 北京格林威尔科技发展有限公司 Method and device for positioning service fault of optical transport network
CN116488724A (en) * 2023-06-25 2023-07-25 成都实时技术股份有限公司 Optical fiber communication test method, medium and system using same

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011063834A1 (en) * 2009-11-25 2011-06-03 Telefonaktiebolaget Lm Ericsson (Publ) Optical transport network alarms
WO2015051638A1 (en) * 2013-10-08 2015-04-16 华为技术有限公司 Fault location method and device
US20160056914A1 (en) * 2014-08-25 2016-02-25 Ciena Corporation Otn adaptation for support of subrate granularity and flexibility and for distribution across multiple modem engines
CN106685682A (en) * 2015-11-10 2017-05-17 中国移动通信集团公司 Fault processing method, network device and system
CN107196804A (en) * 2017-06-01 2017-09-22 国网山东省电力公司信息通信公司 Power system terminal communication access network Centralized Alarm Monitoring system and method
WO2018028573A1 (en) * 2016-08-12 2018-02-15 中兴通讯股份有限公司 Method and device for fault handling, and controller
CN110149227A (en) * 2019-05-16 2019-08-20 平安科技(深圳)有限公司 The method and device of network alarm
CN110247725A (en) * 2019-06-24 2019-09-17 国网河北省电力有限公司经济技术研究院 The line fault investigation method, apparatus and terminal device of OTN network
CN111030857A (en) * 2019-12-06 2020-04-17 深圳前海微众银行股份有限公司 Network alarm method, device, system and computer readable storage medium
CN111176879A (en) * 2019-12-31 2020-05-19 中国建设银行股份有限公司 Fault repairing method and device for equipment
CN111585819A (en) * 2020-05-12 2020-08-25 广东电网有限责任公司电力调度控制中心 Distribution network communication equipment fault analysis method and system
CN112087334A (en) * 2020-09-09 2020-12-15 中移(杭州)信息技术有限公司 Alarm root cause analysis method, electronic device and storage medium

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011063834A1 (en) * 2009-11-25 2011-06-03 Telefonaktiebolaget Lm Ericsson (Publ) Optical transport network alarms
WO2015051638A1 (en) * 2013-10-08 2015-04-16 华为技术有限公司 Fault location method and device
US20160056914A1 (en) * 2014-08-25 2016-02-25 Ciena Corporation Otn adaptation for support of subrate granularity and flexibility and for distribution across multiple modem engines
CN106685682A (en) * 2015-11-10 2017-05-17 中国移动通信集团公司 Fault processing method, network device and system
WO2018028573A1 (en) * 2016-08-12 2018-02-15 中兴通讯股份有限公司 Method and device for fault handling, and controller
CN107196804A (en) * 2017-06-01 2017-09-22 国网山东省电力公司信息通信公司 Power system terminal communication access network Centralized Alarm Monitoring system and method
CN110149227A (en) * 2019-05-16 2019-08-20 平安科技(深圳)有限公司 The method and device of network alarm
CN110247725A (en) * 2019-06-24 2019-09-17 国网河北省电力有限公司经济技术研究院 The line fault investigation method, apparatus and terminal device of OTN network
CN111030857A (en) * 2019-12-06 2020-04-17 深圳前海微众银行股份有限公司 Network alarm method, device, system and computer readable storage medium
CN111176879A (en) * 2019-12-31 2020-05-19 中国建设银行股份有限公司 Fault repairing method and device for equipment
CN111585819A (en) * 2020-05-12 2020-08-25 广东电网有限责任公司电力调度控制中心 Distribution network communication equipment fault analysis method and system
CN112087334A (en) * 2020-09-09 2020-12-15 中移(杭州)信息技术有限公司 Alarm root cause analysis method, electronic device and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
林绵锋, 刘雪原, 方来付, 顾畹仪: "光传送网的故障管理", 光通信研究, no. 06, 30 December 2000 (2000-12-30) *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113281616A (en) * 2021-05-28 2021-08-20 国网冀北电力有限公司承德供电公司 Power transmission line fault diagnosis method, system and framework based on expert system
CN113837672A (en) * 2021-11-26 2021-12-24 深圳普菲特信息科技股份有限公司 Equipment maintenance management method based on industrial Internet
CN113837672B (en) * 2021-11-26 2022-03-15 深圳普菲特信息科技股份有限公司 Equipment maintenance management method based on industrial Internet
CN114374899A (en) * 2021-12-30 2022-04-19 北京格林威尔科技发展有限公司 Method and device for positioning service fault of optical transport network
CN114374899B (en) * 2021-12-30 2023-12-15 北京格林威尔科技发展有限公司 Method and device for positioning service faults of optical transmission network
CN116488724A (en) * 2023-06-25 2023-07-25 成都实时技术股份有限公司 Optical fiber communication test method, medium and system using same
CN116488724B (en) * 2023-06-25 2023-09-15 成都实时技术股份有限公司 Optical fiber communication test method, medium and system using same

Similar Documents

Publication Publication Date Title
CN112770197A (en) Method, device, equipment and storage medium for determining OTN equipment fault reason
CN100479385C (en) Integral maintaining method and system for multi-equipment
WO2020034900A1 (en) Method and apparatus for obtaining logical topology information of odn, device, and storage medium
US11316607B2 (en) Machine learning techniques for selecting paths in multi-vendor reconfigurable optical add/drop multiplexer networks
KR102418969B1 (en) System and method for predicting communication apparatuses failure based on deep learning
US7889989B2 (en) Method for implementing tandem concatenation monitoring automatically and apparatus thereof
WO2019184640A1 (en) Indicator determination method and related device thereto
CN103199919A (en) Multi-parameter-sensed high-accuracy network fault screening and positioning system and method
CN113326946A (en) Method, device and storage medium for updating application recognition model
US20170149646A1 (en) Path computation in multi-layer networks
CN114266294A (en) Training method of classification model, and fault analysis method and device of target link
CN110830539A (en) Network storage system and method
US20220200870A1 (en) Inter-terminal connection state prediction method and apparatus and analysis device
CN114186750A (en) Routing planning method and device for hierarchical requirements of power business of smart power grid
CN114866431A (en) Method and device for predicting SFC network fault based on INT and processor
JP6950576B2 (en) Network systems, topology management methods, and programs
CN115827363A (en) Resource alarm analysis method and device, electronic equipment and storage medium
CN114142923A (en) Optical cable fault positioning method, device, equipment and readable medium
US8059548B1 (en) Automatic displaying of alarms in a communications network
WO2023041992A1 (en) Systems and methods for performing root cause analysis
US7907549B2 (en) Modeling a ring network using a ring object
US20090162075A1 (en) Transmission system and transmission method
Wubete et al. Importance of optical metrics for IGP configuration change prediction
WO2021249629A1 (en) Device and method for monitoring communication networks
CN117176839B (en) Remote measurement message transmission method, device, communication equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination