CN112910669B - Intelligent fault processing method, device and system - Google Patents

Intelligent fault processing method, device and system Download PDF

Info

Publication number
CN112910669B
CN112910669B CN201911218121.2A CN201911218121A CN112910669B CN 112910669 B CN112910669 B CN 112910669B CN 201911218121 A CN201911218121 A CN 201911218121A CN 112910669 B CN112910669 B CN 112910669B
Authority
CN
China
Prior art keywords
fault
alarm
port
site
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201911218121.2A
Other languages
Chinese (zh)
Other versions
CN112910669A (en
Inventor
何庆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Unihub China Information Technology Co Ltd
Original Assignee
Unihub China Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Unihub China Information Technology Co Ltd filed Critical Unihub China Information Technology Co Ltd
Priority to CN201911218121.2A priority Critical patent/CN112910669B/en
Publication of CN112910669A publication Critical patent/CN112910669A/en
Application granted granted Critical
Publication of CN112910669B publication Critical patent/CN112910669B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention provides a fault intelligent processing method, device and system, wherein the method comprises the following steps: the professional network management system receives fault alarm information, carries out alarm pretreatment on the fault alarm information, carries out corresponding configuration issuing if the fault alarm information indicates a fault which can be recovered through configuration modification, and generates a work order according to the fault alarm information if the fault alarm information indicates a fault which cannot be recovered through configuration modification, and sends the work order to a service guarantee system; the service guarantee system distributes the work order to corresponding operation and maintenance personnel. The scheme can process the fault which can be recovered through configuration modification by carrying out corresponding configuration issuing through the professional network management system without manually arriving at the site for processing, and can generate a work order for the fault which cannot be recovered through configuration modification and send the work order to the service support system for dispatch, so that the fault is processed according to the situation, and the operation and maintenance efficiency can be improved.

Description

Intelligent fault processing method, device and system
Technical Field
The present invention relates to the field of fault processing technologies, and in particular, to a fault intelligent processing method, device, and system.
Background
The traditional fault handling method is as follows:
when the equipment fails, the professional network manager can generate an alarm, and the alarm is transferred to the comprehensive report system and the service insurance, and the service insurance meeting sends a bill to maintenance personnel; maintenance personnel can only reach on-site processing, and when processing faults, information such as light paths, routes and the like needs to be checked, and a plurality of systems need to be logged in during the period, so that the operation and maintenance are very complicated and the operation and maintenance efficiency is low.
Disclosure of Invention
The embodiment of the invention provides a fault intelligent processing method, device and system, which solve the technical problems that maintenance personnel can only reach the site to process fussy and the operation and maintenance efficiency is low in the prior art.
The embodiment of the invention provides a fault intelligent processing method, which comprises the following steps:
the professional network management system receives fault alarm information, carries out alarm pretreatment on the fault alarm information, carries out corresponding configuration issuing if the fault alarm information indicates a fault which can be recovered through configuration modification, and generates a work order according to the fault alarm information if the fault alarm information indicates a fault which cannot be recovered through configuration modification, and sends the work order to a service guarantee system;
the service guarantee system distributes the work order to corresponding operation and maintenance personnel.
The embodiment of the invention also provides a fault intelligent processing method, which comprises the following steps:
and receiving fault alarm information, carrying out alarm preprocessing on the fault alarm information, if the fault alarm information indicates a fault which can be recovered through configuration modification, carrying out corresponding configuration issuing, and if the fault alarm information indicates a fault which cannot be recovered through configuration modification, generating a work order according to the fault alarm information, and sending the work order to a service guarantee system.
The embodiment of the invention also provides a fault intelligent processing system, which comprises: the system comprises a professional network management system and a service guarantee system, wherein the professional network management system and the service guarantee system are used for: the fault intelligent processing method is realized.
The embodiment of the invention also provides a fault intelligent processing device, which comprises:
the fault alarm information receiving module is used for receiving the fault alarm information;
the alarm preprocessing module is used for carrying out alarm preprocessing on the fault alarm information;
the configuration issuing module is used for issuing corresponding configuration if the fault alarm information indicates a fault which can be recovered through configuration modification;
And the work order generation and issuing module is used for generating a work order according to the fault alarm information and sending the work order to a service guarantee system if the fault alarm information indicates a fault which cannot be recovered through configuration modification.
The embodiment of the invention also provides computer equipment, which comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the processor realizes the method when executing the computer program.
The embodiment of the invention also provides a computer readable storage medium, which stores a computer program for executing the method.
In the embodiment of the invention, the fault which can be recovered through configuration modification can be processed through corresponding configuration issuing of the professional network management system without manually arriving at the site for processing, and the fault which cannot be recovered through configuration modification is generated, and the work order is sent to the service guarantee system for dispatch, so that the fault is processed according to the situation, and the operation and maintenance efficiency can be improved.
Drawings
In order to more clearly illustrate the embodiments of the invention or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flow chart (system angle) of a fault intelligent processing method (I) provided by an embodiment of the invention;
FIG. 2 is a flow chart (system angle) (II) of a fault intelligent processing method according to an embodiment of the present invention;
FIG. 3 is a flow chart (system angle) (III) of a fault intelligent processing method provided by an embodiment of the invention;
fig. 4 is a flowchart (system angle) of a fault intelligent processing method according to an embodiment of the present invention;
FIG. 5 is a flowchart (system angle) of a fault intelligent processing method according to an embodiment of the present invention;
FIG. 6 is a flowchart (system angle) (six) of a fault intelligent processing method according to an embodiment of the present invention;
FIG. 7 is a schematic diagram of an alarm receipt according to an embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
In an embodiment of the present invention, a fault intelligent processing method is provided, as shown in fig. 1, where the method includes:
step 101: the professional network management system receives fault alarm information, carries out alarm pretreatment on the fault alarm information, carries out corresponding configuration issuing if the fault alarm information indicates a fault which can be recovered through configuration modification, and generates a work order according to the fault alarm information if the fault alarm information indicates a fault which cannot be recovered through configuration modification, and sends the work order to a service guarantee system;
step 102: the service guarantee system distributes the work order to corresponding operation and maintenance personnel.
In the embodiment of the present invention, as shown in fig. 2, in step 101, the proprietary network management system performs an alarm preprocessing on the fault alarm information, including:
the professional network management system determines a fault alarm type, a fault root alarm position, a fault problem phenomenon description and a remote and on-site processing element according to the fault alarm information, and determines whether the fault is recoverable through configuration modification according to the fault alarm type; generating a work order according to the fault alarm type, the fault root alarm position, the fault problem phenomenon description and the remote and on-site processing elements.
In the embodiment of the invention, a plurality of faults are possibly triggered by the same reason, so that maintenance personnel can not know the main faults and the auxiliary faults and can only check one by one, and the repeated workload is high during processing.
Based on this, as shown in fig. 3, in step 101, the proprietary network management system receives the fault alarm information, performs alarm preprocessing on the fault alarm information, and further includes:
the professional network management system receives a plurality of fault alarm information, determines a fault alarm type corresponding to each fault alarm information in the plurality of fault alarm information, determines a master-slave relationship of the plurality of fault alarm types for the plurality of fault alarm types caused by the same reason, and determines to generate a work order according to the fault alarm information corresponding to the master fault alarm type based on the master-slave relationship.
Master-slave analysis of alarm types is shown in Table 1:
TABLE 1
The corresponding master-slave relationship needs to be embodied on a professional network management system, such as adding a field or presenting the slave alarm information of which alarm is on the original field, not forwarding, and the like.
Based on the above, the alarm forwarding interface of the professional network management system needs to be modified, and the single trigger interface of the professional network management system and the service guarantee system needs to be modified, so that four field information of fault alarm types, fault root alarm positions, fault problem phenomenon descriptions and remote and field processing elements are required to be added.
And combing alarm preprocessing positioning logic by taking the alarm type as a unit. After the fault of the related alarm type occurs, the professional network management system carries out preprocessing and positioning operation according to rules to clearly determine the values of four fields of the fault alarm type, the fault root alarm position, the fault problem phenomenon description and the remote and field processing elements. The professional network management system adds the content information of the four fields when forwarding to the service guarantee system. When the service guarantee system distributes the work order, the preprocessing information is required to be automatically inserted into the work order, so that operation and maintenance personnel can process and analyze the work order.
According to the method, when faults are generated, the fault can be processed through the alarm preprocessing and the work order self-healing processing function; when the processing is not finished, the master-slave alarms can be judged through the alarm compression, the master-slave alarms are rapidly positioned, only the master alarm list is arranged, and the list dispatching quantity is reduced, so that the workload of operation and maintenance personnel is reduced.
The failure and handling mode are illustrated below.
(1) OLT equipment off-line
(11) Checking the port state of the uplink equipment of the OLT, and if the uplink port is DOWN, indicating that the OLT equipment is DOWN or a transmission link is broken or an optical module is abnormal, and dispatching the processing of the distribution company;
the remote and on-site processing elements are described as: "uplink port DOWN, OLT device DOWN or transmission link DOWN or optical module DOWN, please dispatch the branch office to handle.
This field is presented in the professional network management system alarm board and forwarded to the comprehensive alarm module of the professional network management system (which is mainly used for performing the alarm function, and existing modules).
(12) If the port UP is connected UP, if the BAS is hung directly, checking whether the static route of the BAS is configured or not and whether the VLAN is transmitted transparently or not, if not, adding the static route configuration and the VLAN related to the transmission transparently;
after the static route configuration and the transparent transmission related VLAN are added, the OLT ping operation is carried out, if the ping is not always needed to be forwarded to comprehensive notice;
the remote and on-site processing elements are described as: "static routing, VLAN configuration missing, supplemented, ping result 0% pkg loss after supplementation".
The alert board presents the simultaneous forwarding of the added processing information.
(13) If the uplink port UP is connected, if the uplink switch is connected, checking whether the OLT management VLAN, such as the non-transparent management VLAN, is transmitted under the switch interface, and after configuration, the OLT state is tested by ping. If the ping is not always needed to be forwarded to the comprehensive notice, if the failure is not recovered after the ping is detected. And a comprehensive alarm module is required to be forwarded, and the alarm board presents the processing information which is forwarded and added simultaneously.
The remote and on-site processing elements are described as: "OLT manages VLAN configuration loss, supplemented, ping result 0% pkg loss after supplementation.
(14) The configuration is not abnormal, the OLT state is required to be checked on site, and the order is dispatched to the branch company.
The remote and on-site processing elements are described as: the uplink port UP is configured without abnormality, and the OLT state needs to be checked on site to ask the distribution company to process.
The field is presented on a professional network management system alarm board and forwarded to the comprehensive alarm module.
(2) Switch equipment off-line
(21) If the switch is connected with the BAS port DOWN in an uplink manner, indicating that power failure or optical fiber breakage is caused, and dispatching to branch companies is needed;
the remote and on-site processing elements are described as: "uplink port DOWN, power failure or optical fiber break, please dispatch company to handle it".
The field is presented on a professional network management system alarm board and forwarded to the comprehensive alarm module.
(22) If the switch is connected with the BAS port UP in an uplink mode, checking the vlan transparent transmission condition managed by the switch and the static route addition condition on the BAS, if configuration is lost, the PING switch is added;
if the ping verification is not always required to be forwarded to the comprehensive alarm module, the alarm board presents the processing information which is forwarded and added at the same time;
the remote and on-site processing elements are described as: "switch management vlan transparent transmission exception, BAS static routing exception, supplemented with ping result 0% pkg loss".
(23) If the switch is connected to the BAS in an uplink mode, whether the uplink switch transmits the vlan or not and whether static routes are added to the BAS or not are checked respectively, if the static routes are configured to be missing, the PING switch is added, and automatic receipt is performed after the static routes are normal.
If the ping check is not always needed to be forwarded to the comprehensive alarm module, the alarm board presents the simultaneous forwarding and increasing the processing information
The remote and on-site processing elements are described as: "upstream switch vlan transparent transmission abnormality, BAS static route abnormality, supplementary, ping result 0% pkg loss.
(3) OLT (optical line terminal) port receiving no light
(31) Checking the sending optical power of the PON port, if the sending optical power is abnormal, the optical module is abnormal, and the optical module needs to be processed on site to directly send a bill;
the remote and on-site processing elements are described as: "optical module is abnormal, the optical module needs to be processed on site, please be processed by a distribution company".
The fault alarm type field value optical module is abnormal, and the fault root alarm position value: the equipment name of the A end is equipment IP, port information and fault problem description value: the failure time, the optical module failure, the equipment name, the equipment IP, the frame/slot/sub-slot/port;
the fields are presented on the professional network management system alarm board and forwarded to the comprehensive alarm module.
(32) And the sending optical power is normal, the user state under the PON port is checked, and if all the users are offline, LOS alarm, optical fiber fault and order division are generated.
The remote and on-site processing elements are as follows: "user offline, generate LOS alert, optical drill failure, please dispatch branch office to process.
The fault alarm type field takes the value of the optical drill fault and the fault root alarm position takes the value of the fault root alarm position: the equipment name of the A end is equipment IP, port information and fault problem description value: failure time, optical drill failure, device name: device IP: box/slot/sub-slot/port.
The fields are presented on the professional network management system alarm board and forwarded to the comprehensive alarm module.
(33) And if the user is offline, the user light cat is powered down, and the professional network management system automatically cuts a bill.
The remote and on-site processing elements are described as: the user is offline, and the professional network management system automatically cuts a bill caused by power failure of the user light cat.
The user with the value of the fault alarm type field is powered down, and the value of the fault root alarm position is: the equipment name of the A end is equipment IP, port information and fault problem description value: the failure time, the user power down, the device name, device IP, box/slot/sub-slot/port.
The above fields present no forwarding comprehensive alarm module in the professional network management system alarm board.
(34) And (3) checking the sending optical power of the PON port, wherein if the sending optical power is abnormal, the optical module is abnormal, and the optical module needs to be processed on site to directly send a bill.
The remote and on-site processing elements are described as: "optical module is abnormal, the optical module needs to be processed on site, please be processed by a distribution company".
The fault alarm type field value optical module is abnormal, and the fault root alarm position value: the equipment name of the A end is equipment IP, port information and fault problem description value: the failure time, the optical module failure, the equipment name, the equipment IP, the frame/slot/sub-slot/port;
the fields are presented on the professional network management system alarm board and forwarded to the comprehensive alarm module.
(35) The transmitting optical power is normal, the light receiving condition of a PON port is checked, the light receiving is abnormal, the light attenuation is too large, the condition of optical fiber connection is required to be checked on site, and a bill division company is assigned;
the remote and on-site processing elements are described as: "abnormal light reception, excessive light attenuation, on-site inspection of the optical fiber connection condition, please send the branch company to deal with.
The value of the fault alarm type field is abnormal in light receiving, and the value of the fault root alarm position is: the equipment name of the A end is equipment IP, port information and fault problem description value: the failure time, the optical module failure, the equipment name, the equipment IP, the frame/slot/sub-slot/port;
the fields are presented on the professional network management system alarm board and forwarded to the comprehensive alarm module.
(4) ONU single board state abnormality
Logging in ONU equipment to check the state of the single board, firstly resetting the single board remotely, and if the state of the single board is normal after resetting, returning a bill directly; if the device is not recovered after reset, remotely restarting the device again, observing the state of the single board after restarting, and normally and automatically returning a receipt of a professional network management system; if not, the single board may be failed, and the dispatching company checks the single board on site.
The fault alert type field takes the value: a hardware board.
The remote and on-site process element descriptions are defined as follows: after the single board is reset remotely, the single board is in a normal state, and after the equipment is restarted, the single board is in a normal state;
the two types of the above systems are not forwarded by the professional network management system, but are needed to be presented in the professional network management system. If the board card is reset and the equipment is restarted, the remote and on-site processing element description is defined as follows:
the single board is still abnormal after reset and restart, and the single board fails, please send the single board division company to check the single board on site.
The fields are presented on the professional network management system alarm board and forwarded to the comprehensive alarm module.
In an embodiment of the present invention, as shown in fig. 4, the method further includes:
step 1011: after the corresponding configuration is issued, the professional network management system performs fault recovery verification to determine whether the fault is recovered, if so, the recovery information and the processing information are added into the corresponding work order, and the corresponding work order is sent to the service guarantee system;
Step 1012: the service guarantee system records the corresponding work order and does not dispatch the work order.
The following is an example.
(1) OLT equipment off-line
(11) If the port UP is connected UP, if the BAS is hung directly, checking whether the static route of the BAS is configured or not and whether the VLAN is transmitted transparently or not, if not, adding the static route configuration and the VLAN related to the transmission transparently;
and after the static routing configuration and the transparent transmission related VLAN are added, the OLT ping operation is performed, and if the ping is not always required to be forwarded to the comprehensive alarm module.
The remote and on-site processing elements are described as: "static route, VLAN configuration is missing, supplemented, ping result 0% pkg loss after supplementation, alarm is recovered".
The alert board presents the simultaneous forwarding of the added processing information.
(12) If the uplink port UP is connected, if the uplink switch is connected, checking whether the OLT management VLAN, such as the non-transparent management VLAN, is transmitted under the switch interface, and after configuration, the OLT state is tested by ping.
If the ping is not always required to be forwarded to the comprehensive alarm module, if the failure is not recovered after the ping is detected, the comprehensive alarm module is required to be forwarded, and the alarm board displays the processing information which is forwarded and added.
The remote and on-site processing elements are described as: "OLT manages VLAN configuration missing, supplemented, ping result 0% pkg loss after supplementation, alarm restored".
(2) Switch equipment off-line
(21) If the switch is connected with the BAS port UP in an uplink mode, checking the vlan transparent transmission condition managed by the switch and the static route addition condition on the BAS, if configuration is lost, the PING switch is added;
if the ping verification is not always required to be forwarded to the comprehensive alarm module, the alarm board presents the processing information which is forwarded and added at the same time;
the remote and on-site processing elements are described as: "the switch manages vlan transparent transmission abnormality, BAS static route abnormality, it has been supplemented, after supplementing ping result 0% pkg loss, the alarm has been recovered".
(22) If the switch is connected to the BAS in an uplink mode, whether the uplink switch transmits the vlan or not and whether static routes are added to the BAS or not are checked respectively, if the static routes are configured to be missing, the PING switch is added, and automatic receipt is performed after the static routes are normal.
If the ping verification is not always required to be forwarded to the comprehensive alarm module, the alarm board presents the processing information which is forwarded and added at the same time;
the remote and on-site processing elements are described as: ' the vlan transparent transmission of the uplink switch is abnormal, the BAS static route is abnormal, the supplementary ping result is 0% pkg loss after the supplementary ping, and the alarm is recovered.
(3) ONU single board state abnormality
Logging in ONU equipment to check the state of the single board, firstly resetting the single board remotely, and if the state of the single board is normal after resetting, returning a bill directly; if the device is not recovered after reset, remotely restarting the device again, observing the state of the single board after restarting, and automatically returning a bill by a normal network manager; if not, the single board may be failed, and the dispatching company checks the single board on site.
The fault alert type field takes the value: a hardware board.
The remote and on-site process element descriptions are defined as follows: after the single board is reset remotely, the single board is in normal state, the alarm is restored, and after the equipment is restarted, the single board is in normal state, and the alarm is restored "
The fields are presented on the network management alarm board and forwarded to the comprehensive alarm module.
In an embodiment of the present invention, as shown in fig. 5, the method further includes:
step 103: the professional network management system sends fault alarm clearing information to the service guarantee system;
step 104: the service guarantee system receives fault alarm clearing information, stops corresponding work order dispatch based on the fault alarm clearing information, receives fault reasons of manual feedback, adds the fault reasons to the corresponding work orders, and stores the corresponding work orders.
The method is characterized in that for the automatic receipt flow, when the service support system receives alarm clearing information sent by the professional network management system, automatic receipt is carried out, automatic confirmation is carried out, a field of 'manual feedback obstacle cause' is added in the service support system, and the work order can be archived only after the manual feedback obstacle cause is filled.
In an embodiment of the present invention, as shown in fig. 6, the method further includes:
Step 105: the service guarantee system sends an alarm receipt confirmation instruction to the professional network management system;
step 106: the professional network management system collects alarm information according to the alarm receipt confirmation instruction and returns the collection processing result to the service guarantee system;
step 107: and the service guarantee system adds the collection processing result to the corresponding work order and stores the corresponding work order.
The method is characterized in that for the manual 'application receipt' flow, an alarm receipt confirmation button is newly added from the service guarantee system through the service guarantee system and the professional network management system, and the encapsulated test script flow of the professional network management system is triggered and invoked, so that an automatic script information result is inserted into feedback information of the service guarantee system. For the obstacle of manual statement, the service guarantee system and the professional network management system are required to have the function of reverse obstacle blocking. And after the service guarantee system manually forms a bill, the clearing information is returned to the professional network management system.
For example.
Alarm receipt confirmation processing logic of the service guarantee system:
(1) Business rules:
and clicking the alarm receipt confirmation button can directly collect and process receipt alarm information for the selected alarm.
At present, the alarm receipt confirmation of the PON and IPRAN professions is temporarily realized.
(2) Page logic
When the alarm is selected and the alarm receipt confirmation button is clicked:
when the 'professional' of the alarm is 'PON', the alarm receipt fault processing of the PON comprehensive network management system (namely a specific professional network management system) is directly triggered, and a receipt alarm information processing interface provided by the PON comprehensive network management system is called in real time. After the PON comprehensive network management system receives the preprocessing instruction sent by the service guarantee system, information collection processing is carried out on the alarm, the processing result is returned to the service guarantee system within a set time, and the preset time is tentative for 60S. After receiving the processing information returned by the PON comprehensive network management system, the service guarantee system is presented on a bullet frame when clicking an alarm receipt confirmation button.
When the professional of the alarm is IPRAN, the alarm receipt fault processing of the IPRAN network management system (namely a specific professional network management system) is directly triggered, and a receipt alarm information processing interface provided by the IPRAN network management system is called in real time. After the IPRAN network management system receives the preprocessing instruction sent by the service guarantee system, information collection processing is carried out on the alarm, the processing result is returned to the service guarantee system within a set time, and the preset time is tentative for 60S. After receiving the preprocessing information returned by the IPRAN network management system, the service guarantee system is presented on a bullet frame when clicking an alarm receipt confirmation button.
When the alarm is 'professional' not 'PON and IPRAN' (professional is automatically identified according to an alarm work order), a prompt box is directly popped up: the profession has no alarm receipt fault processing capability.
Alarm receipt fault handling interface parameter:
and (3) profession: and taking the 'specialty' corresponding to the selected alarming worksheet.
Failure list number: and taking the 'failure list number' of the failure list corresponding to the selected alarm.
Professional network management alarm serial number: and taking the alarm serial number of the professional network management alarm corresponding to the selected alarm.
The alarm receipt confirmation processing interface returns the parameters: can be set according to actual conditions.
After the alarm is selected and the alarm receipt confirmation button is clicked, the interface is directly popped up and the receipt confirmation processing result information is presented. If return within 60S: information is presented on the bullet box.
The function button "insert feedback" appears on the bullet frame. Clicking the button can update the alarm receipt confirmation processing information to the feedback list of the trouble ticket.
The feedback content is as follows: "alarm Water number": XXX; n "pretreatment result": XXX; n "preprocessing information": XXX.
Feedback personnel: default to the current operator.
Feedback department: default to the department to which the current operator belongs.
Feedback time: default to the current time.
Feedback source: and (5) manual feedback.
If the above 60S is not returned: the preprocessing is interrupted and information is presented on the bullet box: the interface of the professional network management system returns a timeout. And closing the preprocessing result information presentation page after clicking the 'determining'.
If the time exceeds 60S, returning: the service assurance system drops directly.
A specific alarm receipt is shown in fig. 7.
Access network alarm receipt confirmation logic:
(1) Off-line alarm for OLT equipment
Logging in an OLT (optical line terminal) uplink device to perform ping, testing and checking the connectivity of the OLT, wherein the connectivity is normal to indicate that an OLT management link is normal; checking whether the uplink port state of the OLT is UP; checking the state flow state of the uplink port of the OLT; the above information normally indicates that the service is normal.
(2) Switch equipment off-line
The device ping tests whether the switch is normal or not on the switch uplink and checks whether the corresponding port is UP or not.
(3) PON port receiving no light alarm
Checking whether an on-line user exists in a cat below the PON port, checking the light receiving condition of the corresponding user OLT side, display ont optical-info 0 0, and receiving light in a value range of-29 to-7.
(4) OLT PON received optical power abnormality warning
And checking whether the optical power transmitted by the PON port of the OLT is normal or not, wherein the value range of the optical power is 2db-7db.
And checking whether the optical power of the PON port receiving ONT is normal or not.
(5) ONU single board state abnormity alarm confirmation
Logging in ONU equipment, checking the state of the single board, and if the single board state is normal, indicating that the recovery is carried out.
Based on the same inventive concept, the embodiment of the invention also provides a fault intelligent processing system, which comprises: the system comprises a professional network management system and a service guarantee system, wherein the professional network management system and the service guarantee system are used for: the fault intelligent processing method is realized.
The embodiment of the invention also provides a fault intelligent processing method (the angle of a professional network management system), which comprises the following steps:
and receiving fault alarm information, carrying out alarm preprocessing on the fault alarm information, if the fault alarm information indicates a fault which can be recovered through configuration modification, carrying out corresponding configuration issuing, and if the fault alarm information indicates a fault which cannot be recovered through configuration modification, generating a work order according to the fault alarm information, and sending the work order to a service guarantee system.
In an embodiment of the present invention, the method further includes: after the corresponding configuration is issued, fault recovery verification is carried out, whether the fault is recovered or not is determined, and if the fault is recovered, the recovery information and the processing information are added into the corresponding work order.
In the embodiment of the invention, the alarm preprocessing of the fault alarm information comprises the following steps:
determining a fault alarm type, a fault root alarm position, a fault problem phenomenon description and remote and field processing elements according to the fault alarm information, and determining whether the fault is recoverable through configuration modification according to the fault alarm type; generating a work order according to the fault alarm type, the fault root alarm position, the fault problem phenomenon description and the remote and on-site processing elements.
In the embodiment of the invention, the alarm preprocessing is performed on the fault alarm information, and the method further comprises the following steps:
determining a fault alarm type corresponding to each fault alarm message in the received multiple fault alarm messages, determining a master-slave relation of the multiple fault alarm types for the multiple fault alarm types caused by the same reason, and determining to generate a work order according to the fault alarm message corresponding to the master fault alarm type based on the master-slave relation.
In an embodiment of the present invention, the method further includes:
and receiving an alarm receipt confirmation instruction sent by the service guarantee system, collecting alarm information according to the alarm receipt confirmation instruction, and returning a collection processing result to the service guarantee system.
Based on the same inventive concept, the embodiment of the invention also provides a fault intelligent processing device, as described in the following embodiment. Because the principle of the fault intelligent processing device for solving the problem is similar to that of the fault intelligent processing method, the implementation of the fault intelligent processing device can be referred to the implementation of the fault intelligent processing method, and the repetition is not repeated. As used below, the term "unit" or "module" may be a combination of software and/or hardware that implements the intended function. While the means described in the following embodiments are preferably implemented in software, implementation in hardware, or a combination of software and hardware, is also possible and contemplated.
The fault intelligent processing device comprises:
the fault alarm information receiving module is used for receiving the fault alarm information;
the alarm preprocessing module is used for carrying out alarm preprocessing on the fault alarm information;
the configuration issuing module is used for issuing corresponding configuration if the fault alarm information indicates a fault which can be recovered through configuration modification;
and the work order generation and issuing module is used for generating a work order according to the fault alarm information and sending the work order to a service guarantee system if the fault alarm information indicates a fault which cannot be recovered through configuration modification.
In an embodiment of the present invention, the method further includes:
and the fault recovery verification module is used for carrying out fault recovery verification after carrying out corresponding configuration issuing, determining whether the fault is recovered, and adding the recovery information and the processing information into the corresponding work order if the fault is recovered.
In the embodiment of the invention, the alarm preprocessing module is specifically used for:
determining a fault alarm type, a fault root alarm position, a fault problem phenomenon description and remote and field processing elements according to the fault alarm information, and determining whether the fault is recoverable through configuration modification according to the fault alarm type;
the work order generation and issuing module is specifically used for: generating a work order according to the fault alarm type, the fault root alarm position, the fault problem phenomenon description and the remote and on-site processing elements.
In the embodiment of the invention, the alarm preprocessing module is specifically used for:
determining a fault alarm type corresponding to each fault alarm message in the received plurality of fault alarm messages, and determining a master-slave relationship of the plurality of fault alarm types for the plurality of fault alarm types caused by the same reason;
the work order generation and issuing module is specifically used for: and determining to generate a work order according to the fault alarm information corresponding to the main fault alarm type based on the master-slave relation.
In an embodiment of the present invention, the method further includes:
and the alarm information collection processing module is used for receiving an alarm receipt confirmation instruction sent by the service guarantee system, carrying out alarm information collection processing according to the alarm receipt confirmation instruction, and returning a collection processing result to the service guarantee system.
The embodiment of the invention also provides computer equipment, which comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the processor realizes the method when executing the computer program.
The embodiment of the invention also provides a computer readable storage medium, which stores a computer program for executing the method.
In summary, the fault intelligent processing method, device and system provided by the invention have the following advantages:
1. reducing the alarm list which is not necessarily distributed: by optimizing alarm dispatch rules of the professional network management system, the association merging among alarms is realized, and the alarm single quantity which does not need to be processed is reduced.
2. And automatically completing alarm processing through a professional network management system: aiming at the offline of partial OLT equipment, the offline of the switch equipment is hoped to automatically realize alarm recovery through a fault intelligent preprocessing function under the condition of definite lack of configuration or configuration error, namely, after professional direct processing is finished, a fault list is not dispatched. For the alarms which are successfully preprocessed (realizing self-healing), the alarms need to be sent out, and the occurrence trace of the alarms is reserved for checking records, so that the fault preprocessing index is improved.
3. The preprocessing information is displayed on an alarm list: the professional network management system automatically logs in the equipment to carry out related preprocessing instruction, and definitely sends out fault alarm information, and forwards the result of the preprocessing information and the alarm to the service guarantee system together, wherein the related preprocessing instruction information can be simultaneously presented in the work order content.
4. Alarm related information can be automatically collected in fault processing: in fault processing, alarm information collection, such as collecting port optical power, error code information, flow information and the like, can be initiated without manually logging in equipment operation.
5. And (3) confirming fault recovery before a fault work order receipt: the manual feedback of the obstacle cause link is realized, and a manual confirmation link of whether the fault alarm is accurate or not is added. The automatic processing function of partial faults is realized. And the recovery confirmation and service confirmation links of faults in the manual receipt process are realized.
It will be appreciated by those skilled in the art that embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The above description is only of the preferred embodiments of the present invention and is not intended to limit the present invention, and various modifications and variations can be made to the embodiments of the present invention by those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (13)

1. The intelligent fault processing method is characterized by comprising the following steps of:
the professional network management system receives the fault alarm information and carries out alarm preprocessing on the fault alarm information; wherein, include:
the professional network management system receives a plurality of fault alarm information, determines a fault alarm type, a fault root alarm position, a fault problem phenomenon description and a remote and on-site processing element according to the fault alarm information, and determines whether the fault is recoverable through configuration modification according to the fault alarm type; determining master-slave relation of a plurality of fault alarm types for a plurality of fault alarm types caused by the same reason according to the fault alarm type corresponding to each fault alarm message in the plurality of fault alarm messages, determining fault alarm information corresponding to the master fault alarm type based on the master-slave relation, and generating a work order according to the fault alarm type, the fault root alarm position, the fault problem phenomenon description and the remote and field processing elements;
If the fault alarm information indicates a fault which can be recovered through configuration modification, corresponding configuration issuing is carried out, and if the fault alarm information indicates a fault which cannot be recovered through configuration modification, a work order is generated according to the fault alarm information, and the work order is sent to a service guarantee system;
wherein, the fault alarm type at least comprises: the OLT equipment is off-line, the exchanger equipment is off-line, the OLT port receives no light and the ONU single board has abnormal state;
q1, when the OLT equipment is off-line,
q11, checking the port state of the uplink equipment of the OLT, and if the uplink port is DOWN, indicating that the OLT equipment is DOWN or a transmission link is broken or an optical module is abnormal, and dispatching the distribution company to process;
the remote and on-site processing elements are described as: uplink port DOWN, OLT equipment DOWN or transmission link break or optical module abnormality, please dispatch branch company to process;
the described fields are presented in the warning board of the professional network management system and forwarded to the comprehensive warning module of the professional network management system;
q12, if the uplink port UP is connected, if the BAS is directly hung, checking whether the static route of the BAS is configured or not and whether the VLAN is transmitted through or not, if the static route is not configured, adding the static route configuration and the VLAN related to the transmission through;
after adding static route configuration and transparent transmission related VLAN, carrying out OLT ping operation, if ping is not always needed to be forwarded to the comprehensive alarm module;
The remote and on-site processing elements are described as: static routing, VLAN configuration is deleted, supplementary and ping result after supplementary is 0% pkg loss;
the described fields forward the added processing information while the alarm board is presented;
q13, if the uplink port UP, if the uplink switch is used, checking whether the OLT management VLAN is transmitted through under the switch interface, if not, transmitting the management VLAN under the interface, and after configuration, ping to test the OLT state; if the ping is not always required to be forwarded to the comprehensive alarm module, if the failure is not recovered after the ping is detected, forwarding the failure to the comprehensive alarm module, and forwarding the processing information at the same time when the failure is presented on an alarm board;
the remote and on-site processing elements are described as: the OLT manages VLAN configuration deletion, and the ping result after supplementation is 0% pkg loss;
q14, configuring no abnormality, checking the OLT state on site, and dispatching a bill to a branch company;
the remote and on-site processing elements are described as: the uplink port UP is configured without abnormality, and the OLT state is required to be checked on site to ask the distribution company to process; the described fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q2, when the switch equipment is off-line,
q21, if the switch upper BAS port DOWN shows that power failure or optical fiber breakage is caused, the switch needs to dispatch a bill to a branch company for processing;
The remote and on-site processing elements are described as: the uplink port DOWN is caused by power failure or optical fiber breakage, and the uplink port DOWN is required to be processed by a distribution company;
the described fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q22, if the switch is connected with the BAS port UP in an uplink mode, checking the vlan transparent transmission condition managed by the switch and the static route addition condition on the BAS, if configuration is lost, adding the PING switch;
if the ping check is not successful, forwarding to the comprehensive alarm module, and forwarding the added processing information while the alarm board is presented;
the remote and on-site processing elements are described as: the switch manages vlan transparent transmission abnormality, BAS static route abnormality, supplementary, ping result 0% pkg loss after supplementary;
q23, if the switch is connected to the BAS by the switch, respectively checking whether the connected switch is connected with the vlan or not, and if static route is added to the BAS, if the configuration is missing, the PING switch is added, and if the configuration is normal, the automatic receipt is returned;
if the ping verification is not always required to be forwarded to the comprehensive alarm module, the alarm board presents the remote and on-site processing elements for simultaneously forwarding the added processing information as follows: the vlan transparent transmission of the uplink switch is abnormal, the BAS static route is abnormal, and the supplementary ping result is 0% pkg loss;
Q3, when the OLT port receives no light,
q31, checking the sending optical power of the PON port, if the sending optical power is abnormal, the optical module is abnormal, and the optical module needs to be processed on site to directly send a bill;
the remote and on-site processing elements are described as: the optical module is abnormal, the optical module needs to be processed on site, and a distribution company is required to process the optical module;
the fault alarm type field value optical module is abnormal, and the fault root alarm position value: name of a terminal A device: device IP: port information, fault problem phenomenon description value: failure time, optical module failure, device name: device IP: a box/slot/sub-slot/port;
the fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q32, the transmitted optical power is normal, the user state under the PON port is checked, if the users are all offline, LOS alarm is generated, the optical fiber is in fault, and the order division is dispatched;
the remote and on-site processing elements are as follows: the user goes offline, generates LOS alarm and optical fiber fault, and asks the distribution company to process;
the fault alarm type field takes the value of the optical drill fault and the fault root alarm position takes the value of the fault root alarm position: name of a terminal A device: device IP: port information, fault problem phenomenon description value: fault time, light drill fault, device name: device IP: a box/slot/sub-slot/port;
The fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q33, the transmitted light power is normal, the user state under the PON port is checked, if the users are all offline, the professional network management system automatically cuts a bill due to the power failure of the user light cat;
the remote and on-site processing elements are described as: the user is offline, and the professional network management system automatically intercepts a bill due to the power failure of the user light cat;
the user with the value of the fault alarm type field is powered down, and the value of the fault root alarm position is: name of a terminal A device: device IP: port information, fault problem phenomenon description value: fault time, user power down, device name: device IP: a box/slot/sub-slot/port;
the fields show that the comprehensive alarm module is not forwarded on the alarm board;
q34, checking the sending optical power of the PON port, if the sending optical power is abnormal, the optical module is abnormal, and the optical module needs to be processed on site to directly send a bill;
the remote and on-site processing elements are described as: the optical module is abnormal, the optical module needs to be processed on site, and a distribution company is required to process the optical module;
the fault alarm type field value optical module is abnormal, and the fault root alarm position value: name of a terminal A device: device IP: port information, fault problem phenomenon description value: failure time, optical module failure, device name: device IP: a box/slot/sub-slot/port;
The fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q35, the transmitted light power is normal, the light receiving condition of the PON port is checked, the light receiving is abnormal, the light attenuation is overlarge, the light drill connection condition is required to be checked on site, and the order division company is assigned;
the remote and on-site processing elements are described as: abnormal light receiving and overlarge light attenuation, and the condition of optical drill connection needs to be checked on site to be processed by a distribution company;
the value of the fault alarm type field is abnormal in light receiving, and the value of the fault root alarm position is: name of a terminal A device: device IP: port information, fault problem phenomenon description value: failure time, optical module failure, device name: device IP: a box/slot/sub-slot/port;
the fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q4, when the ONU single board state is abnormal,
logging in ONU equipment to check the state of the single board, firstly resetting the single board remotely, and if the state of the single board is normal after resetting, returning a bill directly; if the device is not recovered after reset, remotely restarting the device again, observing the state of the single board after restarting, and normally and automatically returning a receipt of a professional network management system; if not, the single board is possibly broken down, and the dispatching company checks the single board on site;
the fault alert type field takes the value: a hardware board card;
The remote and on-site process element descriptions are defined as follows: after the single board is reset remotely, the single board is in a normal state, and after the equipment is restarted, the single board is in a normal state;
the two types of the above systems are not forwarded by the professional network management system, but are needed to be presented in the professional network management system; if the board card is reset and the equipment is restarted, the remote and on-site processing element description is defined as follows:
after reset and restart, the single board is still abnormal, the single board fails, and the dispatching branch company is required to check the single board on site;
the fields are presented on the alarm board and forwarded to the comprehensive alarm module;
the service guarantee system distributes the work order to corresponding operation and maintenance personnel.
2. The fault-intellectualized processing method of claim 1, further comprising: after the corresponding configuration is issued, the professional network management system performs fault recovery verification to determine whether the fault is recovered, if so, the recovery information and the processing information are added into the corresponding work order, and the corresponding work order is sent to the service guarantee system;
the service guarantee system records the corresponding work order and does not dispatch the work order.
3. The fault-intellectualized processing method of claim 1, further comprising:
The professional network management system sends fault alarm clearing information to the service guarantee system;
the service guarantee system receives fault alarm clearing information, stops corresponding work order dispatch based on the fault alarm clearing information, receives fault reasons of manual feedback, adds the fault reasons to the corresponding work orders, and stores the corresponding work orders.
4. The fault-intellectualized processing method of claim 1, further comprising:
the service guarantee system sends an alarm receipt confirmation instruction to the professional network management system;
the professional network management system collects alarm information according to the alarm receipt confirmation instruction and returns the collection processing result to the service guarantee system;
and the service guarantee system adds the collection processing result to the corresponding work order and stores the corresponding work order.
5. The intelligent fault processing method is characterized by comprising the following steps of:
receiving fault alarm information, and carrying out alarm preprocessing on the fault alarm information; wherein, include:
the professional network management system receives a plurality of fault alarm information, determines a fault alarm type, a fault root alarm position, a fault problem phenomenon description and a remote and on-site processing element according to the fault alarm information, and determines whether the fault is recoverable through configuration modification according to the fault alarm type; determining master-slave relation of a plurality of fault alarm types for a plurality of fault alarm types caused by the same reason according to the fault alarm type corresponding to each fault alarm message in the plurality of fault alarm messages, determining fault alarm information corresponding to the master fault alarm type based on the master-slave relation, and generating a work order according to the fault alarm type, the fault root alarm position, the fault problem phenomenon description and the remote and field processing elements;
If the fault alarm information indicates a fault which can be recovered through configuration modification, corresponding configuration issuing is carried out, and if the fault alarm information indicates a fault which cannot be recovered through configuration modification, a work order is generated according to the fault alarm information, and the work order is sent to a service guarantee system;
wherein, the fault alarm type at least comprises: the OLT equipment is off-line, the exchanger equipment is off-line, the OLT port receives no light and the ONU single board has abnormal state;
q1, when the OLT equipment is off-line,
q11, checking the port state of the uplink equipment of the OLT, and if the uplink port is DOWN, indicating that the OLT equipment is DOWN or a transmission link is broken or an optical module is abnormal, and dispatching the distribution company to process;
the remote and on-site processing elements are described as: uplink port DOWN, OLT equipment DOWN or transmission link break or optical module abnormality, please dispatch branch company to process;
the described fields are presented in the warning board of the professional network management system and forwarded to the comprehensive warning module of the professional network management system;
q12, if the uplink port UP is connected, if the BAS is directly hung, checking whether the static route of the BAS is configured or not and whether the VLAN is transmitted through or not, if the static route is not configured, adding the static route configuration and the VLAN related to the transmission through;
after adding static route configuration and transparent transmission related VLAN, carrying out OLT ping operation, if ping is not always needed to be forwarded to the comprehensive alarm module;
The remote and on-site processing elements are described as: static routing, VLAN configuration is deleted, supplementary and ping result after supplementary is 0% pkg loss;
the described fields forward the added processing information while the alarm board is presented;
q13, if the uplink port UP, if the uplink switch is used, checking whether the OLT management VLAN is transmitted through under the switch interface, if not, transmitting the management VLAN under the interface, and after configuration, ping to test the OLT state; if the ping is not always required to be forwarded to the comprehensive alarm module, if the failure is not recovered after the ping is detected, forwarding the failure to the comprehensive alarm module, and forwarding the processing information at the same time when the failure is presented on an alarm board;
the remote and on-site processing elements are described as: the OLT manages VLAN configuration deletion, and the ping result after supplementation is 0% pkg loss;
q14, configuring no abnormality, checking the OLT state on site, and dispatching a bill to a branch company;
the remote and on-site processing elements are described as: the uplink port UP is configured without abnormality, and the OLT state is required to be checked on site to ask the distribution company to process; the described fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q2, when the switch equipment is off-line,
q21, if the switch upper BAS port DOWN shows that power failure or optical fiber breakage is caused, the switch needs to dispatch a bill to a branch company for processing;
The remote and on-site processing elements are described as: the uplink port DOWN is caused by power failure or optical fiber breakage, and the uplink port DOWN is required to be processed by a distribution company;
the described fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q22, if the switch is connected with the BAS port UP in an uplink mode, checking the vlan transparent transmission condition managed by the switch and the static route addition condition on the BAS, if configuration is lost, adding the PING switch;
if the ping check is not successful, forwarding to the comprehensive alarm module, and forwarding the added processing information while the alarm board is presented;
the remote and on-site processing elements are described as: the switch manages vlan transparent transmission abnormality, BAS static route abnormality, supplementary, ping result 0% pkg loss after supplementary;
q23, if the switch is connected to the BAS by the switch, respectively checking whether the connected switch is connected with the vlan or not, and if static route is added to the BAS, if the configuration is missing, the PING switch is added, and if the configuration is normal, the automatic receipt is returned;
if the ping verification is not always required to be forwarded to the comprehensive alarm module, the alarm board presents the remote and on-site processing elements for simultaneously forwarding the added processing information as follows: the vlan transparent transmission of the uplink switch is abnormal, the BAS static route is abnormal, and the supplementary ping result is 0% pkgloss;
Q3, when the OLT port receives no light,
q31, checking the sending optical power of the PON port, if the sending optical power is abnormal, the optical module is abnormal, and the optical module needs to be processed on site to directly send a bill;
the remote and on-site processing elements are described as: the optical module is abnormal, the optical module needs to be processed on site, and a distribution company is required to process the optical module;
the fault alarm type field value optical module is abnormal, and the fault root alarm position value: name of a terminal A device: device IP: port information, fault problem phenomenon description value: failure time, optical module failure, device name: device IP: a box/slot/sub-slot/port;
the fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q32, the transmitted optical power is normal, the user state under the PON port is checked, if the users are all offline, LOS alarm is generated, the optical fiber is in fault, and the order division is dispatched;
the remote and on-site processing elements are as follows: the user goes offline, generates LOS alarm and optical fiber fault, and asks the distribution company to process;
the fault alarm type field takes the value of the optical drill fault and the fault root alarm position takes the value of the fault root alarm position: name of a terminal A device: device IP: port information, fault problem phenomenon description value: fault time, light drill fault, device name: device IP: a box/slot/sub-slot/port;
The fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q33, the transmitted light power is normal, the user state under the PON port is checked, if the users are all offline, the professional network management system automatically cuts a bill due to the power failure of the user light cat;
the remote and on-site processing elements are described as: the user is offline, and the professional network management system automatically intercepts a bill due to the power failure of the user light cat;
the user with the value of the fault alarm type field is powered down, and the value of the fault root alarm position is: name of a terminal A device: device IP: port information, fault problem phenomenon description value: fault time, user power down, device name: device IP: a box/slot/sub-slot/port;
the fields show that the comprehensive alarm module is not forwarded on the alarm board;
q34, checking the sending optical power of the PON port, if the sending optical power is abnormal, the optical module is abnormal, and the optical module needs to be processed on site to directly send a bill;
the remote and on-site processing elements are described as: the optical module is abnormal, the optical module needs to be processed on site, and a distribution company is required to process the optical module;
the fault alarm type field value optical module is abnormal, and the fault root alarm position value: name of a terminal A device: device IP: port information, fault problem phenomenon description value: failure time, optical module failure, device name: device IP: a box/slot/sub-slot/port;
The fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q35, the transmitted light power is normal, the light receiving condition of the PON port is checked, the light receiving is abnormal, the light attenuation is overlarge, the light drill connection condition is required to be checked on site, and the order division company is assigned;
the remote and on-site processing elements are described as: abnormal light receiving and overlarge light attenuation, and the condition of optical drill connection needs to be checked on site to be processed by a distribution company;
the value of the fault alarm type field is abnormal in light receiving, and the value of the fault root alarm position is: name of a terminal A device: device IP: port information, fault problem phenomenon description value: failure time, optical module failure, device name: device IP: a box/slot/sub-slot/port;
the fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q4, when the ONU single board state is abnormal,
logging in ONU equipment to check the state of the single board, firstly resetting the single board remotely, and if the state of the single board is normal after resetting, returning a bill directly; if the device is not recovered after reset, remotely restarting the device again, observing the state of the single board after restarting, and normally and automatically returning a receipt of a professional network management system; if not, the single board is possibly broken down, and the dispatching company checks the single board on site;
the fault alert type field takes the value: a hardware board card;
The remote and on-site process element descriptions are defined as follows: after the single board is reset remotely, the single board is in a normal state, and after the equipment is restarted, the single board is in a normal state;
the two types of the above systems are not forwarded by the professional network management system, but are needed to be presented in the professional network management system; if the board card is reset and the equipment is restarted, the remote and on-site processing element description is defined as follows:
after reset and restart, the single board is still abnormal, the single board fails, and the dispatching branch company is required to check the single board on site;
the fields are presented on the alarm board and forwarded to the comprehensive alarm module.
6. The fault-intellectualized processing method of claim 5, further comprising: after the corresponding configuration is issued, fault recovery verification is carried out, whether the fault is recovered or not is determined, and if the fault is recovered, recovery information and processing information are added into the corresponding work order.
7. The fault-intellectualized processing method of claim 5, further comprising:
and receiving an alarm receipt confirmation instruction sent by the service guarantee system, collecting alarm information according to the alarm receipt confirmation instruction, and returning a collection processing result to the service guarantee system.
8. A fault-intellectualized processing system, comprising: the system comprises a professional network management system and a service guarantee system, wherein the professional network management system and the service guarantee system are used for: a fault intellectualization processing method according to any of claims 1 to 4 is implemented.
9. An intelligent fault handling device, comprising:
the fault alarm information receiving module is used for receiving the fault alarm information;
the alarm preprocessing module is used for carrying out alarm preprocessing on the fault alarm information;
the configuration issuing module is used for issuing corresponding configuration if the fault alarm information indicates a fault which can be recovered through configuration modification;
the work order generation and issuing module is used for generating a work order according to the fault alarm information and sending the work order to a service guarantee system if the fault alarm information indicates a fault which cannot be recovered through configuration modification;
wherein, the fault alarm type at least comprises: the OLT equipment is off-line, the exchanger equipment is off-line, the OLT port receives no light and the ONU single board has abnormal state;
q1, when the OLT equipment is off-line,
q11, checking the port state of the uplink equipment of the OLT, and if the uplink port is DOWN, indicating that the OLT equipment is DOWN or a transmission link is broken or an optical module is abnormal, and dispatching the distribution company to process;
The remote and on-site processing elements are described as: uplink port DOWN, OLT equipment DOWN or transmission link break or optical module abnormality, please dispatch branch company to process;
the described fields are presented in the warning board of the professional network management system and forwarded to the comprehensive warning module of the professional network management system;
q12, if the uplink port UP is connected, if the BAS is directly hung, checking whether the static route of the BAS is configured or not and whether the VLAN is transmitted through or not, if the static route is not configured, adding the static route configuration and the VLAN related to the transmission through;
after adding static route configuration and transparent transmission related VLAN, carrying out OLT ping operation, if ping is not always needed to be forwarded to the comprehensive alarm module;
the remote and on-site processing elements are described as: static routing, VLAN configuration is deleted, supplementary and ping result after supplementary is 0% pkg loss;
the described fields forward the added processing information while the alarm board is presented;
q13, if the uplink port UP, if the uplink switch is used, checking whether the OLT management VLAN is transmitted through under the switch interface, if not, transmitting the management VLAN under the interface, and after configuration, ping to test the OLT state; if the ping is not always required to be forwarded to the comprehensive alarm module, if the failure is not recovered after the ping is detected, forwarding the failure to the comprehensive alarm module, and forwarding the processing information at the same time when the failure is presented on an alarm board;
The remote and on-site processing elements are described as: the OLT manages VLAN configuration deletion, and the ping result after supplementation is 0% pkg loss;
q14, configuring no abnormality, checking the OLT state on site, and dispatching a bill to a branch company;
the remote and on-site processing elements are described as: the uplink port UP is configured without abnormality, and the OLT state is required to be checked on site to ask the distribution company to process; the described fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q2, when the switch equipment is off-line,
q21, if the switch upper BAS port DOWN shows that power failure or optical fiber breakage is caused, the switch needs to dispatch a bill to a branch company for processing;
the remote and on-site processing elements are described as: the uplink port DOWN is caused by power failure or optical fiber breakage, and the uplink port DOWN is required to be processed by a distribution company;
the described fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q22, if the switch is connected with the BAS port UP in an uplink mode, checking the vlan transparent transmission condition managed by the switch and the static route addition condition on the BAS, if configuration is lost, adding the PING switch;
if the ping check is not successful, forwarding to the comprehensive alarm module, and forwarding the added processing information while the alarm board is presented;
the remote and on-site processing elements are described as: the switch manages vlan transparent transmission abnormality, BAS static route abnormality, supplementary, ping result 0% pkg loss after supplementary;
Q23, if the switch is connected to the BAS by the switch, respectively checking whether the connected switch is connected with the vlan or not, and if static route is added to the BAS, if the configuration is missing, the PING switch is added, and if the configuration is normal, the automatic receipt is returned;
if the ping verification is not always required to be forwarded to the comprehensive alarm module, the alarm board presents the remote and on-site processing elements for simultaneously forwarding the added processing information as follows: the vlan transparent transmission of the uplink switch is abnormal, the BAS static route is abnormal, and the supplementary ping result is 0% pkg loss;
q3, when the OLT port receives no light,
q31, checking the sending optical power of the PON port, if the sending optical power is abnormal, the optical module is abnormal, and the optical module needs to be processed on site to directly send a bill;
the remote and on-site processing elements are described as: the optical module is abnormal, the optical module needs to be processed on site, and a distribution company is required to process the optical module;
the fault alarm type field value optical module is abnormal, and the fault root alarm position value: name of a terminal A device: device IP: port information, fault problem phenomenon description value: failure time, optical module failure, device name: device IP: a box/slot/sub-slot/port;
the fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q32, the transmitted optical power is normal, the user state under the PON port is checked, if the users are all offline, LOS alarm is generated, the optical fiber is in fault, and the order division is dispatched;
The remote and on-site processing elements are as follows: the user goes offline, generates LOS alarm and optical fiber fault, and asks the distribution company to process;
the fault alarm type field takes the value of the optical drill fault and the fault root alarm position takes the value of the fault root alarm position: name of a terminal A device: device IP: port information, fault problem phenomenon description value: fault time, light drill fault, device name: device IP: a box/slot/sub-slot/port;
the fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q33, the transmitted light power is normal, the user state under the PON port is checked, if the users are all offline, the professional network management system automatically cuts a bill due to the power failure of the user light cat;
the remote and on-site processing elements are described as: the user is offline, and the professional network management system automatically intercepts a bill due to the power failure of the user light cat;
the user with the value of the fault alarm type field is powered down, and the value of the fault root alarm position is: name of a terminal A device: device IP: port information, fault problem phenomenon description value: fault time, user power down, device name: device IP: a box/slot/sub-slot/port;
the fields show that the comprehensive alarm module is not forwarded on the alarm board;
q34, checking the sending optical power of the PON port, if the sending optical power is abnormal, the optical module is abnormal, and the optical module needs to be processed on site to directly send a bill;
The remote and on-site processing elements are described as: the optical module is abnormal, the optical module needs to be processed on site, and a distribution company is required to process the optical module;
the fault alarm type field value optical module is abnormal, and the fault root alarm position value: name of a terminal A device: device IP: port information, fault problem phenomenon description value: failure time, optical module failure, device name: device IP: a box/slot/sub-slot/port;
the fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q35, the transmitted light power is normal, the light receiving condition of the PON port is checked, the light receiving is abnormal, the light attenuation is overlarge, the light drill connection condition is required to be checked on site, and the order division company is assigned;
the remote and on-site processing elements are described as: abnormal light receiving and overlarge light attenuation, and the condition of optical drill connection needs to be checked on site to be processed by a distribution company;
the value of the fault alarm type field is abnormal in light receiving, and the value of the fault root alarm position is: name of a terminal A device: device IP: port information, fault problem phenomenon description value: failure time, optical module failure, device name: device IP: a box/slot/sub-slot/port;
the fields are presented on the alarm board and forwarded to the comprehensive alarm module;
q4, when the ONU single board state is abnormal,
Logging in ONU equipment to check the state of the single board, firstly resetting the single board remotely, and if the state of the single board is normal after resetting, returning a bill directly; if the device is not recovered after reset, remotely restarting the device again, observing the state of the single board after restarting, and normally and automatically returning a receipt of a professional network management system; if not, the single board is possibly broken down, and the dispatching company checks the single board on site;
the fault alert type field takes the value: a hardware board card;
the remote and on-site process element descriptions are defined as follows: after the single board is reset remotely, the single board is in a normal state, and after the equipment is restarted, the single board is in a normal state;
the two types of the above systems are not forwarded by the professional network management system, but are needed to be presented in the professional network management system; if the board card is reset and the equipment is restarted, the remote and on-site processing element description is defined as follows:
after reset and restart, the single board is still abnormal, the single board fails, and the dispatching branch company is required to check the single board on site;
the fields are presented on the alarm board and forwarded to the comprehensive alarm module; the alarm preprocessing module is specifically used for:
determining a fault alarm type, a fault root alarm position, a fault problem phenomenon description and remote and field processing elements according to the fault alarm information, and determining whether the fault is recoverable through configuration modification according to the fault alarm type; determining a master-slave relationship of a plurality of fault alarm types for a plurality of fault alarm types caused by the same reason according to the fault alarm type corresponding to each of the received plurality of fault alarm information;
The work order generation and issuing module is specifically used for:
and determining fault alarm information corresponding to the main fault alarm type based on the master-slave relationship, and generating a work order according to the fault alarm type, the fault root alarm position, the fault problem phenomenon description and the remote and on-site processing elements.
10. The fault-intellectualized processing device of claim 9, further comprising:
and the fault recovery verification module is used for carrying out fault recovery verification after carrying out corresponding configuration issuing, determining whether the fault is recovered, and adding recovery information and processing information into a corresponding work order if the fault is recovered.
11. The fault-intellectualized processing device of claim 9, further comprising:
and the alarm information collection processing module is used for receiving an alarm receipt confirmation instruction sent by the service guarantee system, carrying out alarm information collection processing according to the alarm receipt confirmation instruction, and returning a collection processing result to the service guarantee system.
12. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the method of any of claims 1 to 7 when executing the computer program.
13. A computer readable storage medium, characterized in that the computer readable storage medium stores a computer program for executing the method of any one of claims 1 to 7.
CN201911218121.2A 2019-12-03 2019-12-03 Intelligent fault processing method, device and system Active CN112910669B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911218121.2A CN112910669B (en) 2019-12-03 2019-12-03 Intelligent fault processing method, device and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911218121.2A CN112910669B (en) 2019-12-03 2019-12-03 Intelligent fault processing method, device and system

Publications (2)

Publication Number Publication Date
CN112910669A CN112910669A (en) 2021-06-04
CN112910669B true CN112910669B (en) 2023-08-08

Family

ID=76103993

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911218121.2A Active CN112910669B (en) 2019-12-03 2019-12-03 Intelligent fault processing method, device and system

Country Status (1)

Country Link
CN (1) CN112910669B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113726555A (en) * 2021-08-02 2021-11-30 华迪计算机集团有限公司 System and method suitable for auxiliary analysis alarm of data communication network
CN113724437B (en) * 2021-08-30 2022-08-19 四川虹美智能科技有限公司 Unattended alarm method and system for unattended selling cabinet
CN115001946A (en) * 2022-06-01 2022-09-02 中国建设银行股份有限公司 Wrong packet flash failure processing method and system, electronic equipment and storage medium
CN115242603B (en) * 2022-07-20 2024-01-02 北京天一恩华科技股份有限公司 Fault state management method and device, electronic equipment and storage medium
CN115766386A (en) * 2022-11-15 2023-03-07 中盈优创资讯科技有限公司 Method, device and equipment for real-time positioning and inquiring current alarm state
CN115801536A (en) * 2022-12-26 2023-03-14 中国联合网络通信集团有限公司 Alarm information processing method, device, server and storage medium
CN117610667B (en) * 2024-01-17 2024-04-26 湖南傲思软件股份有限公司 Fault handling expert system, method and computer equipment based on open source large model

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101222742A (en) * 2007-11-22 2008-07-16 中国移动通信集团山东有限公司 Alarm self-positioning and self-processing method and system for mobile communication network guard system
CN101257694A (en) * 2007-11-22 2008-09-03 中国移动通信集团山东有限公司 Mobile communication network managing intelligent sending method based on information support system
WO2016062109A1 (en) * 2014-10-22 2016-04-28 中兴通讯股份有限公司 Fault category determination method and apparatus in passive optical network (pon)
WO2018137520A1 (en) * 2017-01-24 2018-08-02 华为技术有限公司 Service recovery method and apparatus
CN108401164A (en) * 2018-02-12 2018-08-14 中国联合网络通信集团有限公司 Fault handling method and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101222742A (en) * 2007-11-22 2008-07-16 中国移动通信集团山东有限公司 Alarm self-positioning and self-processing method and system for mobile communication network guard system
CN101257694A (en) * 2007-11-22 2008-09-03 中国移动通信集团山东有限公司 Mobile communication network managing intelligent sending method based on information support system
WO2016062109A1 (en) * 2014-10-22 2016-04-28 中兴通讯股份有限公司 Fault category determination method and apparatus in passive optical network (pon)
WO2018137520A1 (en) * 2017-01-24 2018-08-02 华为技术有限公司 Service recovery method and apparatus
CN108401164A (en) * 2018-02-12 2018-08-14 中国联合网络通信集团有限公司 Fault handling method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于告警分析的故障智能预处理研究;尤龙;《通信与信息技术》;20160725(第04期);第81-83页 *

Also Published As

Publication number Publication date
CN112910669A (en) 2021-06-04

Similar Documents

Publication Publication Date Title
CN112910669B (en) Intelligent fault processing method, device and system
CN107995049B (en) Cross-region synchronous fault monitoring method, device and system for power safety region
CN1992636B (en) System and method for processing warning information
CN106789323A (en) A kind of communication network management method and its device
CN105529831B (en) A kind of secondary equipment of intelligent converting station failure Computer Aided Analysis System
CN103295155B (en) Security core service system method for supervising
CN105049223B (en) A kind of power telecom network defect troubleshooting decision assistant analysis method
CN107370537B (en) A kind of detection method of Fibre-optical Pilot Protection
CN103973359A (en) OTN protection switching automation testing system, method and device
CN109450527A (en) Failure determination method, device, computer equipment and storage medium
CN101090296A (en) Diagnostic method and device for breakout of cable
CN102355368A (en) Fault processing method of network equipment and system
CN111010298B (en) PON (passive optical network) network fault monitoring method and device
CN113453260B (en) Method for realizing random selection and guarantee of 5G transmission sub-slices based on dynamic scheduling algorithm
CN108989128B (en) Fault positioning method and device based on networking structure
CN110768845A (en) Intelligent substation process level virtual connection fault positioning system
CN110752959A (en) Intelligent substation process layer physical link fault positioning system
CN107356284A (en) A kind of detection method, apparatus and system
CN112701783B (en) Distribution network operation and maintenance monitoring equipment and monitoring method
CN112134764B (en) Method and device for determining connectivity of test environment
CN113946465A (en) Big data fault processing method and related equipment thereof
CN114143160A (en) Cloud platform automation operation and maintenance system
CN107070694A (en) A kind of method in distributed communication network medium-long range batch debugging route-exchanging device
CN113179180A (en) Basalt client disaster fault repairing method, basalt client disaster fault repairing device and basalt client disaster storage medium
CN111030037B (en) Optical fiber differential protection fault analysis system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant