CN112910669A - Fault intelligent processing method, device and system - Google Patents

Fault intelligent processing method, device and system Download PDF

Info

Publication number
CN112910669A
CN112910669A CN201911218121.2A CN201911218121A CN112910669A CN 112910669 A CN112910669 A CN 112910669A CN 201911218121 A CN201911218121 A CN 201911218121A CN 112910669 A CN112910669 A CN 112910669A
Authority
CN
China
Prior art keywords
fault
alarm
information
work order
fault alarm
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201911218121.2A
Other languages
Chinese (zh)
Other versions
CN112910669B (en
Inventor
何庆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhongying Youchuang Information Technology Co Ltd
Original Assignee
Zhongying Youchuang Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhongying Youchuang Information Technology Co Ltd filed Critical Zhongying Youchuang Information Technology Co Ltd
Priority to CN201911218121.2A priority Critical patent/CN112910669B/en
Publication of CN112910669A publication Critical patent/CN112910669A/en
Application granted granted Critical
Publication of CN112910669B publication Critical patent/CN112910669B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery

Abstract

The invention provides a fault intelligent processing method, a device and a system, wherein the method comprises the following steps: a professional network management system receives fault alarm information, performs alarm preprocessing on the fault alarm information, performs corresponding configuration issuing if the fault alarm information indicates a fault which can be recovered through configuration modification, generates a work order according to the fault alarm information if the fault alarm information indicates a fault which can not be recovered through configuration modification, and sends the work order to a service guarantee system; and the service guarantee system distributes the work orders to corresponding operation and maintenance personnel. According to the scheme, for the fault which can be recovered through configuration modification, the corresponding configuration issuing can be carried out through the professional network management system, the fault which can not be recovered through configuration modification can be processed without manually arriving at a site, a work order is generated for the fault which can not be recovered through configuration modification, the work order is sent to the service guarantee system to be distributed, the fault is processed according to the condition, and the operation and maintenance efficiency can be improved.

Description

Fault intelligent processing method, device and system
Technical Field
The invention relates to the technical field of fault processing, in particular to a fault intelligent processing method, device and system.
Background
The conventional fault handling method is as follows:
when equipment fails, the professional network manager generates an alarm, the alarm is forwarded to the comprehensive alarm system and the service guard, and the service guard sends a list to maintenance personnel; maintenance personnel can only arrive the on-site processing, and when a fault is processed, information such as a light path, a route and the like needs to be checked, and a plurality of systems need to be logged in during the process, so that the operation and maintenance efficiency is very complicated, and is low.
Disclosure of Invention
The embodiment of the invention provides a fault intelligent processing method, device and system, and solves the technical problems that in the prior art, maintenance personnel only can arrive at the site for processing, and the operation and maintenance efficiency is low.
The embodiment of the invention provides an intelligent fault processing method, which comprises the following steps:
a professional network management system receives fault alarm information, performs alarm preprocessing on the fault alarm information, performs corresponding configuration issuing if the fault alarm information indicates a fault which can be recovered through configuration modification, generates a work order according to the fault alarm information if the fault alarm information indicates a fault which can not be recovered through configuration modification, and sends the work order to a service guarantee system;
and the service guarantee system distributes the work orders to corresponding operation and maintenance personnel.
The embodiment of the invention also provides an intelligent fault processing method, which comprises the following steps:
receiving fault alarm information, performing alarm preprocessing on the fault alarm information, if the fault alarm information indicates a fault which can be recovered through configuration modification, issuing corresponding configuration, and if the fault alarm information indicates a fault which can not be recovered through configuration modification, generating a work order according to the fault alarm information, and sending the work order to a service guarantee system.
The embodiment of the invention also provides a fault intelligent processing system, which comprises: professional network management system and service guarantee system, wherein, professional network management system and the service guarantee system are used for: the intelligent fault processing method is realized.
The embodiment of the invention also provides a fault intelligent processing device, which comprises:
the fault warning information receiving module is used for receiving fault warning information;
the warning preprocessing module is used for carrying out warning preprocessing on the fault warning information;
the configuration issuing module is used for issuing corresponding configuration if the fault warning information indicates that the fault which can be recovered through configuration modification is detected;
and the work order generation issuing module is used for generating a work order according to the fault alarm information and sending the work order to a service guarantee system if the fault alarm information indicates that the fault which can not be recovered through configuration modification is present.
The embodiment of the present invention further provides a computer device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and the processor implements the method when executing the computer program.
The embodiment of the invention also provides a computer readable storage medium, and the computer readable storage medium stores a computer program for executing the method.
In the embodiment of the invention, the fault which can be recovered by configuration modification can be processed by corresponding configuration issuing through the professional network management system without manually reaching the site for processing, and the fault which can not be recovered by configuration modification generates the work order and sends the work order to the service guarantee system for dispatching, so that the fault is processed according to the condition, and the operation and maintenance efficiency can be improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a flow chart (system angle) of a fault intelligent processing method provided by an embodiment of the present invention;
fig. 2 is a flow chart (system angle) of a fault intelligent processing method provided by the embodiment of the present invention (two);
fig. 3 is a flow chart (system angle) of a fault intelligent processing method provided by an embodiment of the present invention (three);
fig. 4 is a flowchart (system angle) of a fault intelligent processing method according to an embodiment of the present invention (four);
fig. 5 is a flowchart (system angle) of a fault intelligent processing method provided in an embodiment of the present invention (five);
fig. 6 is a flowchart (system angle) of a fault intelligent processing method according to an embodiment of the present invention (six);
fig. 7 is a schematic diagram of an alarm receipt according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In an embodiment of the present invention, an intelligent fault handling method is provided, as shown in fig. 1, the method includes:
step 101: a professional network management system receives fault alarm information, performs alarm preprocessing on the fault alarm information, performs corresponding configuration issuing if the fault alarm information indicates a fault which can be recovered through configuration modification, generates a work order according to the fault alarm information if the fault alarm information indicates a fault which can not be recovered through configuration modification, and sends the work order to a service guarantee system;
step 102: and the service guarantee system distributes the work orders to corresponding operation and maintenance personnel.
In the embodiment of the present invention, as shown in fig. 2, in step 101, the performing, by the private network management system, alarm preprocessing on the fault alarm information includes:
the professional network management system determines the fault alarm type, the fault root alarm position, the fault problem phenomenon description and the remote and field processing elements according to the fault alarm information, and determines whether the fault can be recovered through configuration modification according to the fault alarm type; and generating a work order according to the fault alarm type, the fault root alarm position, the fault problem phenomenon description and the remote and field processing elements.
In the embodiment of the invention, a plurality of faults are possibly triggered by the same reason, so that maintenance personnel can not clearly distinguish the main faults from the auxiliary faults and can only check the faults one by one, and the repeated workload is large during processing.
Based on this, as shown in fig. 3, the receiving, by the private network management system, the failure warning information in step 101, and performing warning preprocessing on the failure warning information, further includes:
the professional network management system receives the plurality of fault alarm information, determines a fault alarm type corresponding to each fault alarm information in the plurality of fault alarm information, determines a master-slave relationship of the plurality of fault alarm types for the plurality of fault alarm types caused by the same reason, and generates a work order according to the fault alarm information corresponding to the master fault alarm type based on the master-slave relationship.
The master-slave analysis of alarm types is shown in table 1:
TABLE 1
Figure BDA0002300055890000041
The corresponding master-slave relationship needs to be embodied on a professional network management system, such as adding a field or presenting slave alarm information of which alarm the alarm is on the original field, and the like without forwarding.
Based on this, it is necessary to modify the alarm forwarding interface of the professional network management system, modify the order triggering interface of the professional network management system and the service support system, and all require to add four field information of fault alarm type, fault root alarm position, fault problem phenomenon description, remote and field processing element.
And combing alarm preprocessing positioning logic by taking the alarm type as a unit. After the fault of the related alarm type occurs, the professional network management system firstly carries out preprocessing positioning operation according to rules, and the values of four fields of the fault alarm type, the fault root alarm position, the fault problem phenomenon description and the remote and field processing elements are determined. The professional network management system adds the four field content information when forwarding to the service guarantee system. When the service guarantee system dispatches the work order, the preprocessing information is required to be automatically inserted into the work order, so that operation and maintenance personnel can conveniently perform processing analysis.
Therefore, when a fault occurs, the method can process the fault through the alarm preprocessing and the work order self-healing processing function; when the alarm can not be processed, master-slave alarms can be judged through alarm compression, positioning is carried out rapidly, only master alarm lists are arranged, and the list dispatching amount is reduced, so that the workload of operation and maintenance personnel is reduced.
The following examples illustrate the failure and the manner of handling.
(1) OLT equipment off-line
(11) Checking the state of an uplink device port of the OLT, if the uplink port DOWN indicates that the OLT device DOWN or a transmission link is broken or an optical module is abnormal, and distributing a branch company for processing;
the remote and field processing elements are described as: "the uplink port DOWN, OLT device DOWN or transmission link failure or optical module is abnormal, please distribute branch company for processing".
The field is displayed on the alarm board of the professional network management system and is forwarded to a comprehensive alarm module (which is a module existing in the prior art mainly for carrying out an alarm function) of the professional network management system.
(12) If the uplink port UP and the BAS are directly hung, whether the BAS static route is configured or not and whether the VLAN is transparently transmitted or not are checked, if the BAS static route is not configured, the static route configuration and the relevant VLAN is added for transparent transmission;
after adding the static routing configuration and the relevant VLAN for transparent transmission, performing the operation of the OLT ping, and if the ping is not needed to be forwarded to the comprehensive notice;
the remote and field processing elements are described as: "static routing, VLAN configuration is lacked, supplemented, and the ping result is 0% pkg loss after supplementation".
The alarm panel presents the process information to be forwarded and added at the same time.
(13) And if the uplink port is UP, if the switch is connected, checking whether the OLT manages the VLAN or not under the interface of the switch in a transparent transmission mode, if the VLAN is not managed in a transparent transmission mode, managing the VLAN under the interface in a transparent transmission mode, and ping to test the state of the OLT after configuration. If the ping is not required to be forwarded to the comprehensive notice, and if the fault is not recovered after the ping detection. The comprehensive alarm module needs to be forwarded, and the alarm board presents and simultaneously forwards and adds the processing information.
The remote and field processing elements are described as: the OLT manages VLAN configuration missing, is supplemented, and has a ping result of 0% pkg loss after the supplementation.
(14) And the configuration is not abnormal, the state of the OLT needs to be checked on site, and the order is dispatched to the branch company.
The remote and field processing elements are described as: "the UP port UP, configuration is not abnormal, and the OLT status needs to be checked on site, please dispatch the branch to process".
The field is displayed on a professional network management system alarm board and is forwarded to a comprehensive alarm module.
(2) Switch equipment off-line
(21) If the BAS port DOWN is connected to the switch, the fact that power failure or optical fiber breakage is caused is indicated, and the order needs to be dispatched to a branch company for processing;
the remote and field processing elements are described as: "the up connection port DOWN, caused by power failure or optical fiber breakage, please distribute the branch for processing".
The field is displayed on a professional network management system alarm board and is forwarded to a comprehensive alarm module.
(22) If the switch is connected with the BAS port UP, checking the transparent transmission condition of the switch management vlan and the static route adding condition on the BAS, if configuration is lost, adding the PING switch;
if the ping check is not successful, the processing information is forwarded to the comprehensive alarm module, and the alarm board displays the processing information and simultaneously forwards the processing information;
the remote and field processing elements are described as: the switch manages vlan transparent transmission abnormity, BAS static routing abnormity is supplemented, and the supplemented ping result is 0% pkg loss.
(23) If the switch is connected with the BAS, whether the vlan is transmitted through the connected switch or not and whether a static route is added to the BAS or not are respectively checked, if configuration is lost, the PING switch is added, and the list is automatically returned after the PING switch is normal.
If the ping check is not successful, the information needs to be forwarded to the comprehensive alarm module, and the alarm board presents the information and forwards and adds the processing information
The remote and field processing elements are described as: 'vlan transparent transmission abnormality of an uplink exchanger, BAS static routing abnormality, supplemented and supplemented ping result of 0% pkg loss'.
(3) OLT port receiving no light
(31) Checking the optical power transmitted by the PON port, if the transmitted optical power is abnormal, the optical module is abnormal, and the optical module needs to be processed on site to directly send a bill;
the remote and field processing elements are described as: "optical module is abnormal, optical module needs to be processed on site, please be distributed to branch company for processing".
The fault alarm type field takes value of optical module abnormity, and the fault root alarm position takes value: the name of the A-end equipment comprises equipment IP, port information and fault problem phenomenon description values: fault time, optical module fault, device name, device IP, frame/slot/subslot/port;
the fields are displayed on a professional network management system alarm board and are forwarded to a comprehensive alarm module.
(32) Sending light power normally, checking the user state at the PON port, generating LOS alarm and optical fiber fault if all the users are off-line, and dispatching the order division company.
The remote and on-site processing elements are as follows: the user is off-line, an LOS alarm is generated, the optical fiber fault is generated, and a distribution branch company is required to process the optical fiber fault.
The fault alarm type field takes a value of an optical fiber fault, and the fault root alarm position takes a value: the name of the A-end equipment comprises equipment IP, port information and fault problem phenomenon description values: time of failure, failure of optical fibers, equipment name, equipment IP, frame/slot/subslot/port.
The fields are displayed on a professional network management system alarm board and are forwarded to a comprehensive alarm module.
(33) And transmitting the optical power normally, checking the user state under the PON port, and automatically intercepting the list by the professional network management system due to power failure of the optical modem of the user if the users are all off-line.
The remote and field processing elements are described as: the user is off-line, and the professional network management system automatically intercepts the order due to power failure of the optical modem of the user.
The fault alarm type field value user is powered down, and the fault root alarm position value is as follows: the name of the A-end equipment comprises equipment IP, port information and fault problem phenomenon description values: failure time, user power down, device name, device IP, frame/slot/sub-slot/port.
The fields are displayed on a professional network management system alarm board without forwarding the comprehensive alarm module.
(34) And checking the optical power transmitted by the PON port, wherein if the transmitted optical power is abnormal, the optical module is abnormal, and the optical module needs to be processed on site to directly dispatch the order.
The remote and field processing elements are described as: "optical module is abnormal, optical module needs to be processed on site, please be distributed to branch company for processing".
The fault alarm type field takes value of optical module abnormity, and the fault root alarm position takes value: the name of the A-end equipment comprises equipment IP, port information and fault problem phenomenon description values: fault time, optical module fault, device name, device IP, frame/slot/subslot/port;
the fields are displayed on a professional network management system alarm board and are forwarded to a comprehensive alarm module.
(35) The transmitted light power is normal, the light receiving condition of a PON port is checked, the light receiving is abnormal, the light attenuation is too large, the light brazing rod connection condition needs to be checked on site, and a dispatching division company is used;
the remote and field processing elements are described as: "receive light unusual, light decay is too big, need the on-the-spot inspection light borer connection condition, please distribute the branch company to handle".
The fault alarm type field value receives light abnormity, and the fault root alarm position value: the name of the A-end equipment comprises equipment IP, port information and fault problem phenomenon description values: fault time, optical module fault, device name, device IP, frame/slot/subslot/port;
the fields are displayed on a professional network management system alarm board and are forwarded to a comprehensive alarm module.
(4) ONU single board state anomaly
Logging in ONU equipment to check the state of the single board, remotely resetting the single board at first, and directly returning the single board if the state of the single board is normal after resetting; if the device is not recovered after reset, remotely restarting the device again, observing the state of the single board after restarting, and automatically returning the order by a normal professional network management system; if the single board is not recovered, the single board may fail, and the dispatching division company checks the single board on site.
Fault alarm type field value: and (4) hardware board cards.
The remote and field process element descriptions are defined as follows: after the single board is reset remotely, the single board state is normal, and after the equipment is restarted, the single board state is normal;
in the above two types, the professional network management system does not forward the alarm, but needs to present the alarm in the professional network management system. If the board card is reset and the equipment is restarted, the remote and field processing element description is defined as follows according to the situation:
after reset and restart, the single board is still abnormal, the single board fails, and the single board is requested to be sent to a branch company for on-site inspection.
The fields are displayed on a professional network management system alarm board and are forwarded to a comprehensive alarm module.
In the embodiment of the present invention, as shown in fig. 4, the method further includes:
step 1011: after the corresponding configuration is issued, the professional network management system performs fault recovery verification to determine whether the fault is recovered, if so, the recovery information and the processing information are added to the corresponding work order, and the corresponding work order is sent to the service guarantee system;
step 1012: and the service guarantee system records the corresponding work order and does not dispatch the work order.
The following examples are given.
(1) OLT equipment off-line
(11) If the uplink port UP and the BAS are directly hung, whether the BAS static route is configured or not and whether the VLAN is transparently transmitted or not are checked, if the BAS static route is not configured, the static route configuration and the relevant VLAN is added for transparent transmission;
after adding the static route configuration and the relevant VLAN, performing the operation of the OLT ping, and if the ping is not successful, forwarding the ping to the comprehensive alarm module.
The remote and field processing elements are described as: static routing, VLAN configuration loss, supplementation, 0% pkg loss of ping result after supplementation, and alarm recovery.
The alarm panel presents the process information to be forwarded and added at the same time.
(12) And if the uplink port is UP, if the switch is connected, checking whether the OLT manages the VLAN or not under the interface of the switch in a transparent transmission mode, if the VLAN is not managed in a transparent transmission mode, managing the VLAN under the interface in a transparent transmission mode, and ping to test the state of the OLT after configuration.
If the ping is not communicated, the ping needs to be forwarded to the comprehensive alarm module, if the fault is not recovered after the ping detection, the comprehensive alarm module needs to be forwarded, and the alarm board displays that the processing information is forwarded and added at the same time.
The remote and field processing elements are described as: the OLT manages VLAN configuration loss, is supplemented, and after the supplementation, the ping result is 0% pkg loss and the alarm is recovered.
(2) Switch equipment off-line
(21) If the switch is connected with the BAS port UP, checking the transparent transmission condition of the switch management vlan and the static route adding condition on the BAS, if configuration is lost, adding the PING switch;
if the ping check is not successful, the processing information is forwarded to the comprehensive alarm module, and the alarm board displays the processing information and simultaneously forwards the processing information;
the remote and field processing elements are described as: the exception of vlan transparent transmission is managed by the switch, the exception of BAS static routing is supplemented, the ping result is 0% pkg loss after the supplementation, and the alarm is recovered.
(22) If the switch is connected with the BAS, whether the vlan is transmitted through the connected switch or not and whether a static route is added to the BAS or not are respectively checked, if configuration is lost, the PING switch is added, and the list is automatically returned after the PING switch is normal.
If the ping check is not successful, the processing information is forwarded to the comprehensive alarm module, and the alarm board displays the processing information and simultaneously forwards the processing information;
the remote and field processing elements are described as: 'the vlan transparent transmission of the upper connection exchanger is abnormal, the BAS static routing is abnormal, the upper connection exchanger is supplemented, the ping result is 0% pkg loss after the supplement, and the alarm is recovered'.
(3) ONU single board state anomaly
Logging in ONU equipment to check the state of the single board, remotely resetting the single board at first, and directly returning the single board if the state of the single board is normal after resetting; if the device is not recovered after reset, remotely restarting the device again, observing the state of the single board after restarting, and automatically returning the order by the normal network manager; if the single board is not recovered, the single board may fail, and the dispatching division company checks the single board on site.
Fault alarm type field value: and (4) hardware board cards.
The remote and field process element descriptions are defined as follows: after the single board is reset remotely, the state of the single board is normal and the alarm is recovered, and after the equipment is restarted, the state of the single board is normal and the alarm is recovered "
The fields are displayed on the network management alarm board and are forwarded to the comprehensive alarm module.
In the embodiment of the present invention, as shown in fig. 5, the method further includes:
step 103: the professional network management system sends fault alarm clearing information to the service guarantee system;
step 104: and the service support system receives fault alarm clearing information, stops the corresponding work order distribution based on the fault alarm clearing information, receives the fault reason of manual feedback, adds the fault reason into the corresponding work order, and stores the corresponding work order.
For the automatic receipt flow, when the service guarantee system receives the alarm clearing information sent by the professional network management system, the automatic receipt and the automatic confirmation are carried out, the field of 'manual feedback obstacle reason' is added in the service guarantee system, and the work order can be filed only after the manual feedback obstacle reason is filled.
In the embodiment of the present invention, as shown in fig. 6, the method further includes:
step 105: the service guarantee system sends an alarm receipt confirmation instruction to the professional network management system;
step 106: the professional network management system collects and processes the alarm information according to the alarm receipt confirmation instruction and returns the collected and processed result to the service guarantee system;
step 107: and the service support system adds the collected processing result to a corresponding work order and stores the corresponding work order.
For the manual 'application receipt' process, an 'alarm receipt confirmation' button is newly added from the service guarantee system through the service guarantee system and the professional network management system, the process of calling the test script packaged by the professional network management system is triggered, and the automatic script information result is inserted into the feedback information of the service guarantee system. For the obstacle of manual order settlement, a service guarantee system and a professional network management system are required to have the function of reverse obstacle cancellation. And after the service guarantee system manually checks the order, the clearing information is returned to the professional network management system.
For example.
The alarm receipt confirmation processing logic of the service guarantee system comprises the following steps:
(1) and (4) business rules:
clicking the 'alarm receipt confirmation' button can directly collect and process the receipt alarm information of the selected alarm.
At present, professional alarm receipt confirmation of PON and IPRAN is temporarily realized.
(2) Page logic
And selecting an alarm, and clicking an alarm receipt confirmation button:
when the "professional" of the alarm is "PON", the alarm receipt fault processing of the PON integrated network management system (i.e. a specific professional network management system) is directly triggered, and a receipt alarm information processing interface provided by the PON integrated network management system is called in real time. After the PON integrated network management system receives the preprocessing instruction pushed by the service provisioning system, the PON integrated network management system needs to collect and process the information of the alarm, and returns the processing result to the service provisioning system within a specified time, and the predetermined time is temporarily set to 60S. After receiving the processing information returned by the PON integrated network management system, the service support system is to be displayed on the pop-up frame when the "alarm receipt acknowledgement" button is clicked.
When the 'professional' of the alarm is 'IPRAN', the alarm receipt fault processing of the IPRAN network management system (namely a specific professional network management system) is directly triggered, and a receipt alarm information processing interface provided by the IPRAN network management system is called in real time. After receiving the preprocessing instruction pushed by the service guarantee system, the IPRAN network management system needs to collect and process the alarm information and return the processing result to the service guarantee system within a specified time, and the preset time is temporarily set to be 60S. After receiving the preprocessing information returned by the IPRAN network management system, the service guarantee system is displayed on a bullet frame when a 'warning receipt confirmation' button is clicked.
When the alarm is given "professional" not "PON, IPRAN" (professional automatic identification according to alarm work order), directly popping up a prompt box: the profession has no alarm receipt fault processing capability for the moment.
Entering the alarm receipt fault processing interface:
specialization: and selecting the 'specialty' corresponding to the selected alarm work order.
Fault single number: and taking the 'fault list number' of the fault list corresponding to the selected alarm.
Professional network management warning serial number: and (4) taking the 'alarm serial number' of the professional network management alarm corresponding to the selected alarm.
And the alarm receipt confirmation processing interface returns the parameters: can be set according to actual conditions.
And selecting an alarm, clicking an alarm receipt confirmation button, directly popping up an interface, and presenting receipt confirmation processing result information. If return in 60S: the information is presented on the bullet box.
The function button "insert feedback" appears on the bullet box. Clicking the button can update the piece of alarm receipt confirmation processing information into a feedback list of the fault receipt.
And (3) feedback content: "alarm serial number": XXX; \ n "pretreatment results": XXX; \ n "preprocessing information": XXX.
The feedback personnel: the current operator is defaulted.
And a feedback department: and defaulting the department to which the current operator belongs.
Feedback time: the current time is defaulted.
The feedback source is as follows: and (5) manual feedback.
If over 60S has not returned: the pre-processing is interrupted and the information is presented on the bullet box: and the interface of the professional network management system returns overtime. And closing the preprocessing result information presentation page after clicking the 'determination'.
If the time exceeds 60S, the method returns to the following steps: the service provisioning system discards directly.
The specific alarm receipt is shown in fig. 7.
The access network alarms and returns the order to confirm the logic:
(1) OLT equipment offline alarm
Logging in an OLT uplink device to ping, testing and checking the OLT connectivity, wherein the normal connectivity indicates that an OLT management link is normal; checking whether the state of an uplink port of the OLT is UP; checking the state flow state of an uplink port of the OLT; the above information indicates that the service is normal.
(2) Switch equipment off-line
And ping the uplink equipment of the switch to test whether the switch is normal or not and check whether the corresponding port is UP or not.
(3) PON port receiving no light alarm
Whether an online user exists in the optical modem under the PON port or not is checked, the user is online, the light receiving condition of the corresponding user OLT side, display on optical-info 00, and the light receiving value range is from-29 to-7.
(4) OLT PON received optical power abnormity warning
And checking whether the optical power transmitted by the OLT PON port is normal or not, wherein the value range of the optical power is 2db-7 db.
And checking whether the optical power of the PON port on the optical network terminal receiving ONT side is normal.
(5) ONU single-board state abnormity alarm confirmation
Logging in the ONU equipment, checking the single board state, and if the single board state is normal, indicating that the single board state is recovered.
Based on the same inventive concept, the embodiment of the present invention further provides a fault intelligent processing system, including: professional network management system and service guarantee system, wherein, professional network management system and the service guarantee system are used for: the intelligent fault processing method is realized.
The embodiment of the invention also provides a fault intelligent processing method (in the aspect of a professional network management system), which comprises the following steps:
receiving fault alarm information, performing alarm preprocessing on the fault alarm information, if the fault alarm information indicates a fault which can be recovered through configuration modification, issuing corresponding configuration, and if the fault alarm information indicates a fault which can not be recovered through configuration modification, generating a work order according to the fault alarm information, and sending the work order to a service guarantee system.
In the embodiment of the present invention, the method further includes: and after the corresponding configuration is issued, fault recovery verification is carried out, whether the fault is recovered or not is determined, and if the fault is recovered, the recovery information and the processing information are added into the corresponding work order.
In the embodiment of the present invention, the performing of the alarm preprocessing on the fault alarm information includes:
determining fault alarm type, fault root alarm position, fault problem phenomenon description, remote and field processing elements according to the fault alarm information, and determining whether the fault can be recovered through configuration modification according to the fault alarm type; and generating a work order according to the fault alarm type, the fault root alarm position, the fault problem phenomenon description and the remote and field processing elements.
In the embodiment of the present invention, the performing alarm preprocessing on the fault alarm information further includes:
determining a fault alarm type corresponding to each fault alarm information in the received multiple fault alarm information, determining a master-slave relationship of the multiple fault alarm types for the multiple fault alarm types caused by the same reason, and determining to generate a work order according to the fault alarm information corresponding to the master fault alarm type based on the master-slave relationship.
In the embodiment of the present invention, the method further includes:
and receiving an alarm receipt confirmation instruction sent by the service support system, collecting and processing alarm information according to the alarm receipt confirmation instruction, and returning a collection processing result to the service support system.
Based on the same inventive concept, the embodiment of the present invention further provides an intelligent fault handling apparatus, as described in the following embodiments. Because the principle of the fault intelligent processing device for solving the problems is similar to that of the fault intelligent processing method, the implementation of the fault intelligent processing device can refer to the implementation of the fault intelligent processing method, and repeated parts are not described again. As used hereinafter, the term "unit" or "module" may be a combination of software and/or hardware that implements a predetermined function. Although the means described in the embodiments below are preferably implemented in software, an implementation in hardware, or a combination of software and hardware is also possible and contemplated.
This intelligent processing apparatus of trouble includes:
the fault warning information receiving module is used for receiving fault warning information;
the warning preprocessing module is used for carrying out warning preprocessing on the fault warning information;
the configuration issuing module is used for issuing corresponding configuration if the fault warning information indicates that the fault which can be recovered through configuration modification is detected;
and the work order generation issuing module is used for generating a work order according to the fault alarm information and sending the work order to a service guarantee system if the fault alarm information indicates that the fault which can not be recovered through configuration modification is present.
In the embodiment of the present invention, the method further includes:
and the fault recovery checking module is used for carrying out fault recovery checking after the corresponding configuration is issued, determining whether the fault is recovered, and if the fault is recovered, adding the recovery information and the processing information into the corresponding work order.
In the embodiment of the present invention, the alarm preprocessing module is specifically configured to:
determining fault alarm type, fault root alarm position, fault problem phenomenon description, remote and field processing elements according to the fault alarm information, and determining whether the fault can be recovered through configuration modification according to the fault alarm type;
the work order generation issuing module is specifically used for: and generating a work order according to the fault alarm type, the fault root alarm position, the fault problem phenomenon description and the remote and field processing elements.
In the embodiment of the present invention, the alarm preprocessing module is specifically configured to:
determining a fault alarm type corresponding to each fault alarm information in the received multiple fault alarm information, and determining the master-slave relation of the multiple fault alarm types for the multiple fault alarm types caused by the same reason;
the work order generation issuing module is specifically used for: and determining to generate a work order according to fault alarm information corresponding to the main fault alarm type based on the master-slave relationship.
In the embodiment of the present invention, the method further includes:
and the alarm information collection processing module is used for receiving an alarm receipt confirmation instruction sent by the service support system, collecting and processing alarm information according to the alarm receipt confirmation instruction, and returning a collection processing result to the service support system.
The embodiment of the present invention further provides a computer device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and the processor implements the method when executing the computer program.
The embodiment of the invention also provides a computer readable storage medium, and the computer readable storage medium stores a computer program for executing the method.
In summary, the fault intelligent processing method, device and system provided by the invention have the following advantages:
1. reduce unnecessary dispatch alarm tickets: by optimizing the alarm distribution rule of the professional network management system, the correlation merging between alarms is realized, and the alarm unit quantity which does not need to be processed is reduced.
2. The alarm processing is automatically completed through a professional network management system: for part of the OLT equipment to be offline and the switch equipment to be offline, it is desirable to automatically implement alarm recovery by a fault intelligent preprocessing function under the condition of definite lack of configuration or configuration error, that is, after professional direct processing is completed, a fault list is not distributed. For the alarms which are successfully preprocessed (self-healing is realized), the orders are required to be sent out, and the alarm occurrence traces are reserved for checking and recording, so that the fault preprocessing index is improved.
3. The display of the preprocessing information on the alarm list is realized: the professional network management system automatically logs in the equipment to carry out related preprocessing instructions, defines fault alarm information, and forwards the preprocessing information result and the alarm to the service guarantee system together, and the related preprocessing instruction information can be presented in the work order content at the same time.
4. Alarm related information can be automatically collected in fault processing: in the fault processing, alarm information collection can be initiated, such as collection of port optical power, error code information, flow information and the like, and manual login of equipment operation is not needed.
5. And (3) fault recovery confirmation before returning the fault work order: and a link of manually feeding back the reason of the fault is realized, and a link of manually confirming whether the fault alarm is accurate is added. And the automatic processing function of partial faults is realized. And the recovery confirmation and the business confirmation of the fault during manual receipt are realized.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The above description is only a preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes may be made to the embodiment of the present invention by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (19)

1. An intelligent fault handling method is characterized by comprising the following steps:
a professional network management system receives fault alarm information, performs alarm preprocessing on the fault alarm information, performs corresponding configuration issuing if the fault alarm information indicates a fault which can be recovered through configuration modification, generates a work order according to the fault alarm information if the fault alarm information indicates a fault which can not be recovered through configuration modification, and sends the work order to a service guarantee system;
and the service guarantee system distributes the work orders to corresponding operation and maintenance personnel.
2. The intelligent fault handling method of claim 1, further comprising: after the corresponding configuration is issued, the professional network management system performs fault recovery verification to determine whether the fault is recovered, if so, the recovery information and the processing information are added to the corresponding work order, and the corresponding work order is sent to the service guarantee system;
and the service guarantee system records the corresponding work order and does not dispatch the work order.
3. The intelligent fault processing method according to claim 2, wherein the performing of the alarm preprocessing on the fault alarm information by the professional network management system comprises:
the professional network management system determines the fault alarm type, the fault root alarm position, the fault problem phenomenon description and the remote and field processing elements according to the fault alarm information, and determines whether the fault can be recovered through configuration modification according to the fault alarm type; and generating a work order according to the fault alarm type, the fault root alarm position, the fault problem phenomenon description and the remote and field processing elements.
4. The intelligent fault processing method according to claim 1, wherein a professional network management system receives fault alarm information and performs alarm preprocessing on the fault alarm information, further comprising:
the professional network management system receives the plurality of fault alarm information, determines a fault alarm type corresponding to each fault alarm information in the plurality of fault alarm information, determines a master-slave relationship of the plurality of fault alarm types for the plurality of fault alarm types caused by the same reason, and generates a work order according to the fault alarm information corresponding to the master fault alarm type based on the master-slave relationship.
5. The intelligent fault handling method of claim 1, further comprising:
the professional network management system sends fault alarm clearing information to the service guarantee system;
and the service support system receives fault alarm clearing information, stops the corresponding work order distribution based on the fault alarm clearing information, receives the fault reason of manual feedback, adds the fault reason into the corresponding work order, and stores the corresponding work order.
6. The intelligent fault handling method of claim 1, further comprising:
the service guarantee system sends an alarm receipt confirmation instruction to the professional network management system;
the professional network management system collects and processes the alarm information according to the alarm receipt confirmation instruction and returns the collected and processed result to the service guarantee system;
and the service support system adds the collected processing result to a corresponding work order and stores the corresponding work order.
7. An intelligent fault handling method is characterized by comprising the following steps:
receiving fault alarm information, performing alarm preprocessing on the fault alarm information, if the fault alarm information indicates a fault which can be recovered through configuration modification, issuing corresponding configuration, and if the fault alarm information indicates a fault which can not be recovered through configuration modification, generating a work order according to the fault alarm information, and sending the work order to a service guarantee system.
8. The intelligent fault handling method of claim 7, further comprising: and after the corresponding configuration is issued, fault recovery verification is carried out, whether the fault is recovered or not is determined, and if the fault is recovered, the recovery information and the processing information are added into the corresponding work order.
9. The intelligent fault handling method according to claim 8, wherein performing alarm preprocessing on the fault alarm information includes:
determining fault alarm type, fault root alarm position, fault problem phenomenon description, remote and field processing elements according to the fault alarm information, and determining whether the fault can be recovered through configuration modification according to the fault alarm type; and generating a work order according to the fault alarm type, the fault root alarm position, the fault problem phenomenon description and the remote and field processing elements.
10. The intelligent fault handling method according to claim 7, wherein the alarm preprocessing is performed on the fault alarm information, and further comprising:
determining a fault alarm type corresponding to each fault alarm information in the received multiple fault alarm information, determining a master-slave relationship of the multiple fault alarm types for the multiple fault alarm types caused by the same reason, and determining to generate a work order according to the fault alarm information corresponding to the master fault alarm type based on the master-slave relationship.
11. The intelligent fault handling method of claim 7, further comprising:
and receiving an alarm receipt confirmation instruction sent by the service support system, collecting and processing alarm information according to the alarm receipt confirmation instruction, and returning a collection processing result to the service support system.
12. An intelligent fault handling system, comprising: professional network management system and service guarantee system, wherein, professional network management system and the service guarantee system are used for: the intelligent fault handling method according to any one of claims 1 to 6 is implemented.
13. An intelligent fault handling device, comprising:
the fault warning information receiving module is used for receiving fault warning information;
the warning preprocessing module is used for carrying out warning preprocessing on the fault warning information;
the configuration issuing module is used for issuing corresponding configuration if the fault warning information indicates that the fault which can be recovered through configuration modification is detected;
and the work order generation issuing module is used for generating a work order according to the fault alarm information and sending the work order to a service guarantee system if the fault alarm information indicates that the fault which can not be recovered through configuration modification is present.
14. The fault intelligent processing device according to claim 13, further comprising:
and the fault recovery checking module is used for carrying out fault recovery checking after the corresponding configuration is issued, determining whether the fault is recovered, and if the fault is recovered, adding the recovery information and the processing information into the corresponding work order.
15. The intelligent fault handling device of claim 14, wherein the alarm preprocessing module is specifically configured to:
determining fault alarm type, fault root alarm position, fault problem phenomenon description, remote and field processing elements according to the fault alarm information, and determining whether the fault can be recovered through configuration modification according to the fault alarm type;
the work order generation issuing module is specifically used for: and generating a work order according to the fault alarm type, the fault root alarm position, the fault problem phenomenon description and the remote and field processing elements.
16. The intelligent fault handling device of claim 13, wherein the alarm preprocessing module is specifically configured to:
determining a fault alarm type corresponding to each fault alarm information in the received multiple fault alarm information, and determining the master-slave relation of the multiple fault alarm types for the multiple fault alarm types caused by the same reason;
the work order generation issuing module is specifically used for: and determining to generate a work order according to fault alarm information corresponding to the main fault alarm type based on the master-slave relationship.
17. The fault intelligent processing device according to claim 13, further comprising:
and the alarm information collection processing module is used for receiving an alarm receipt confirmation instruction sent by the service support system, collecting and processing alarm information according to the alarm receipt confirmation instruction, and returning a collection processing result to the service support system.
18. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the method of any of claims 1 to 11 when executing the computer program.
19. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program for executing the method of any one of claims 1 to 11.
CN201911218121.2A 2019-12-03 2019-12-03 Intelligent fault processing method, device and system Active CN112910669B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911218121.2A CN112910669B (en) 2019-12-03 2019-12-03 Intelligent fault processing method, device and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911218121.2A CN112910669B (en) 2019-12-03 2019-12-03 Intelligent fault processing method, device and system

Publications (2)

Publication Number Publication Date
CN112910669A true CN112910669A (en) 2021-06-04
CN112910669B CN112910669B (en) 2023-08-08

Family

ID=76103993

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911218121.2A Active CN112910669B (en) 2019-12-03 2019-12-03 Intelligent fault processing method, device and system

Country Status (1)

Country Link
CN (1) CN112910669B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113724437A (en) * 2021-08-30 2021-11-30 四川虹美智能科技有限公司 Unattended alarm method and system for unattended selling cabinet
CN113726555A (en) * 2021-08-02 2021-11-30 华迪计算机集团有限公司 System and method suitable for auxiliary analysis alarm of data communication network
CN115001946A (en) * 2022-06-01 2022-09-02 中国建设银行股份有限公司 Wrong packet flash failure processing method and system, electronic equipment and storage medium
CN115242603A (en) * 2022-07-20 2022-10-25 北京天一恩华科技股份有限公司 Fault state management method and device, electronic equipment and storage medium
CN117610667A (en) * 2024-01-17 2024-02-27 湖南傲思软件股份有限公司 Fault handling expert system, method and computer equipment based on open source large model

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101222742A (en) * 2007-11-22 2008-07-16 中国移动通信集团山东有限公司 Alarm self-positioning and self-processing method and system for mobile communication network guard system
CN101257694A (en) * 2007-11-22 2008-09-03 中国移动通信集团山东有限公司 Mobile communication network managing intelligent sending method based on information support system
WO2016062109A1 (en) * 2014-10-22 2016-04-28 中兴通讯股份有限公司 Fault category determination method and apparatus in passive optical network (pon)
WO2018137520A1 (en) * 2017-01-24 2018-08-02 华为技术有限公司 Service recovery method and apparatus
CN108401164A (en) * 2018-02-12 2018-08-14 中国联合网络通信集团有限公司 Fault handling method and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101222742A (en) * 2007-11-22 2008-07-16 中国移动通信集团山东有限公司 Alarm self-positioning and self-processing method and system for mobile communication network guard system
CN101257694A (en) * 2007-11-22 2008-09-03 中国移动通信集团山东有限公司 Mobile communication network managing intelligent sending method based on information support system
WO2016062109A1 (en) * 2014-10-22 2016-04-28 中兴通讯股份有限公司 Fault category determination method and apparatus in passive optical network (pon)
WO2018137520A1 (en) * 2017-01-24 2018-08-02 华为技术有限公司 Service recovery method and apparatus
CN108401164A (en) * 2018-02-12 2018-08-14 中国联合网络通信集团有限公司 Fault handling method and device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
尤龙: "基于告警分析的故障智能预处理研究", 《通信与信息技术》 *
鲍伟华等: "移动核心网设备故障智能预处理系统研究", 《移动通信》 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113726555A (en) * 2021-08-02 2021-11-30 华迪计算机集团有限公司 System and method suitable for auxiliary analysis alarm of data communication network
CN113724437A (en) * 2021-08-30 2021-11-30 四川虹美智能科技有限公司 Unattended alarm method and system for unattended selling cabinet
CN115001946A (en) * 2022-06-01 2022-09-02 中国建设银行股份有限公司 Wrong packet flash failure processing method and system, electronic equipment and storage medium
CN115242603A (en) * 2022-07-20 2022-10-25 北京天一恩华科技股份有限公司 Fault state management method and device, electronic equipment and storage medium
CN115242603B (en) * 2022-07-20 2024-01-02 北京天一恩华科技股份有限公司 Fault state management method and device, electronic equipment and storage medium
CN117610667A (en) * 2024-01-17 2024-02-27 湖南傲思软件股份有限公司 Fault handling expert system, method and computer equipment based on open source large model
CN117610667B (en) * 2024-01-17 2024-04-26 湖南傲思软件股份有限公司 Fault handling expert system, method and computer equipment based on open source large model

Also Published As

Publication number Publication date
CN112910669B (en) 2023-08-08

Similar Documents

Publication Publication Date Title
CN112910669A (en) Fault intelligent processing method, device and system
CN107508722B (en) Service monitoring method and device
CN102355368B (en) Fault processing method of network equipment and system
CN105049223B (en) A kind of power telecom network defect troubleshooting decision assistant analysis method
CN113179182B (en) Network supervision method, device, equipment and storage medium
CN103873283B (en) Ethernet passive optical network alert processing method and system
US11392873B2 (en) Systems and methods for simulating orders and workflows in an order entry and management system to test order scenarios
KR101171274B1 (en) System and method with erp system to manage planet risk during outage
CN109450527A (en) Failure determination method, device, computer equipment and storage medium
CN111010298B (en) PON (passive optical network) network fault monitoring method and device
CN108989128B (en) Fault positioning method and device based on networking structure
CN110519122A (en) A kind of network quality automatic monitoring device and method based on Mtr
JP2013130901A (en) Monitoring server and network device recovery system using the same
CN114025260A (en) Optical access network fault positioning method, device, equipment and medium
CN110609761B (en) Method and device for determining fault source, storage medium and electronic equipment
CN116506340A (en) Flow link testing method and device, electronic equipment and storage medium
CN114422386B (en) Monitoring method and device for micro-service gateway
CN112134764B (en) Method and device for determining connectivity of test environment
CN113179180A (en) Basalt client disaster fault repairing method, basalt client disaster fault repairing device and basalt client disaster storage medium
CN112995071A (en) Problem chip positioning method
CN114338688B (en) Data management method and device
CN107784731B (en) System for calculating time for uniformly replenishing cash of ATM (automatic teller machine) machine
TWI544756B (en) Group circuit obstacle detection system and its method
CN109510727A (en) A kind of method and system being automatically positioned ONU failure
CN113438117B (en) Network element work order processing method and device, storage medium and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant