CN110224874B - Equipment fault processing method and device - Google Patents

Equipment fault processing method and device Download PDF

Info

Publication number
CN110224874B
CN110224874B CN201910565044.1A CN201910565044A CN110224874B CN 110224874 B CN110224874 B CN 110224874B CN 201910565044 A CN201910565044 A CN 201910565044A CN 110224874 B CN110224874 B CN 110224874B
Authority
CN
China
Prior art keywords
fault
target
equipment
parameter
classification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910565044.1A
Other languages
Chinese (zh)
Other versions
CN110224874A (en
Inventor
赵春伟
李涛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou Apas Technology Co ltd
Original Assignee
Zhengzhou Apas Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou Apas Technology Co ltd filed Critical Zhengzhou Apas Technology Co ltd
Priority to CN201910565044.1A priority Critical patent/CN110224874B/en
Publication of CN110224874A publication Critical patent/CN110224874A/en
Application granted granted Critical
Publication of CN110224874B publication Critical patent/CN110224874B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/069Management of faults, events, alarms or notifications using logs of notifications; Post-processing of notifications
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The embodiment of the application provides a method and a device for processing equipment faults, and belongs to the field of network control. The method comprises the following steps: extracting corresponding configuration files of each device, and analyzing the configuration files respectively to obtain parameter attributes corresponding to the devices respectively; aggregating the parameter attributes according to fault types respectively to form fault classification; the fault classification and the parameter attribute are stored in an associated mode and are used for uniformly dividing the faults of the equipment; monitoring each device in real time, and acquiring corresponding target parameters according to fault classification of a target fault once the target fault is monitored; and generating a fault report according to the target parameters and sending the fault report. The method and the device can analyze the log information corresponding to the fault in time, and improve the processing efficiency of the equipment fault in the network.

Description

Equipment fault processing method and device
Technical Field
The present application relates to the field of computer technologies, and in particular, to a method and an apparatus for processing an equipment fault.
Background
When a network fails frequently, if the failed network device or link is not located and processed in time, the failure may spread to the whole network, i.e. network oscillation occurs. Network shock can cause total network disruption and service interruption. For example, if a clock failure occurs in a certain routing device, the system time of the routing device is hundreds of times faster than that of other routing devices in the network, which may cause other routing devices in the entire network to repeatedly delete and generate the entry corresponding to the routing device, thereby severely consuming resources of other routing devices, and when the resources of other routing devices are exhausted, causing a total network outage and a total network service interruption.
In the past, equipment often needs to be manually inspected or manually operated to find faults existing in an equipment system, but the fault finding process is often not timely and accurate enough, and after the faults are found, a large number of logs are manually extracted from the system to be analyzed, so that the faults of the hardware can be located, and the equipment maintenance efficiency is reduced. In addition, with the increasing number of devices in the network and the number of corresponding manufacturers of the devices, the commands and indexes provided by each manufacturer are different.
Disclosure of Invention
The embodiment of the application aims to provide a method and a device for processing equipment faults so as to realize fault classification and monitoring of various types of equipment in a network and improve the fault processing efficiency.
In order to solve the above technical problem, the embodiments of the present application are implemented as follows:
according to a first aspect of an embodiment of the present application, a method for processing an equipment fault is provided, where the method includes:
extracting corresponding configuration files of each device, and analyzing the configuration files respectively to obtain parameter attributes corresponding to the devices respectively;
aggregating the parameter attributes according to fault types respectively to form fault classification; the fault classification and the parameter attribute are stored in an associated mode and are used for uniformly dividing the faults of the equipment;
monitoring each device in real time, and acquiring corresponding target parameters according to fault classification of a target fault once the target fault is monitored;
and generating a fault report according to the target parameters and sending the fault report.
In one embodiment of the present application, when the configuration files are extracted according to the manufacturers corresponding to the devices, respectively, and the configuration files corresponding to the devices are extracted and analyzed,
segmenting the text content in the configuration file according to the corresponding types of the manufacturer and the equipment respectively;
integrating corresponding positions and paragraph marks of the feature words obtained by word segmentation in the configuration file respectively to obtain target formats corresponding to the configuration file respectively;
and analyzing the configuration file according to the target format to obtain the corresponding relation between the parameter attribute and the fault keyword.
In an embodiment of the present application, when the parameter attributes are aggregated according to the failure types,
extracting the fault keywords to match with a set fault type to obtain the mapping relation;
and respectively extracting the mapping relation and the corresponding relation between the parameter attribute and the fault keyword, and aggregating according to the fault type to obtain the respective corresponding parameter attribute of the fault classification.
In an embodiment of the present application, when the extracted fault keyword is matched with a predetermined fault type,
calling a given near-meaning word library, and extracting the near-meaning words corresponding to the fault keywords;
and matching the similar meaning words with the corresponding names of the fault types so as to improve the matching rate of the fault keywords.
In an embodiment of the present application, when the corresponding target parameter is obtained according to the fault classification to which the target fault belongs,
positioning corresponding target equipment according to the target fault;
calling corresponding target parameters in the target equipment according to the fault classification;
and triggering corresponding operation instructions according to the target parameters respectively, and acquiring respective corresponding parameter values of the target parameters respectively.
In an embodiment of the present application, when generating a fault report according to the target parameter,
correlating the target parameters with the corresponding parameter values to form corresponding log information, thereby generating a fault report corresponding to the target fault;
and sending the fault report to a manufacturer of the target equipment.
According to a second aspect of the embodiments of the present application, there is provided an apparatus for processing a device failure, the apparatus including:
the analysis module is used for extracting the corresponding configuration files of the equipment and respectively analyzing the configuration files to obtain the parameter attributes respectively corresponding to the equipment;
the clustering module is used for aggregating the parameter attributes according to fault types respectively to form fault classifications; the fault classification and the parameter attribute are stored in an associated mode and used for uniformly dividing faults of the equipment;
the monitoring module is used for monitoring each device in real time, and acquiring corresponding target parameters according to the fault classification to which the target fault belongs once the target fault is monitored;
and the sending module is used for generating a fault report according to the target parameters and sending the fault report.
In an embodiment of the application, the parsing module specifically includes:
the word segmentation unit is used for segmenting the text content in the configuration file according to the corresponding types of the manufacturer and the equipment;
the integration unit is used for integrating the corresponding positions and paragraph marks of the feature words obtained by word segmentation in the configuration file respectively to obtain the target formats corresponding to the configuration files respectively;
and the analysis unit is used for respectively analyzing the configuration files according to the target formats to obtain the corresponding relation between the parameter attributes and the fault keywords.
In an embodiment of the present application, the clustering module specifically includes,
the matching unit is used for extracting the fault keywords to match with a set fault type to obtain the mapping relation;
and the aggregation unit is used for respectively extracting the mapping relation and the corresponding relation between the parameter attribute and the fault keyword, and aggregating according to the fault type to obtain the respective corresponding parameter attribute of the fault classification.
In an embodiment of the present application, the matching unit specifically includes:
the extraction submodule is used for calling a set synonym library and extracting the synonym corresponding to the fault keyword;
and the matching sub-module is used for matching the similar meaning words with the corresponding names of the fault types so as to improve the matching rate of the fault keywords.
In an embodiment of the present application, the monitoring module specifically includes:
the positioning unit is used for positioning corresponding target equipment according to the target fault;
the calling unit is used for calling corresponding target parameters in the target equipment according to the fault classification;
and the triggering unit is used for respectively triggering corresponding operation instructions according to the target parameters and respectively acquiring the parameter values corresponding to the target parameters.
In an embodiment of the present application, the sending module specifically includes:
the correlation unit is used for correlating the target parameters with the corresponding parameter values to form corresponding log information so as to generate a fault report corresponding to the target fault;
and the output unit is used for sending the fault report to a manufacturer of the target equipment.
According to the technical scheme provided by the embodiment of the application, the embodiment of the application extracts the corresponding configuration files of the equipment and analyzes the configuration files respectively to obtain the parameter attributes corresponding to the equipment respectively; aggregating the parameter attributes according to fault types respectively to form fault classification; monitoring each device in real time, and acquiring corresponding target parameters according to fault classification of a target fault once the target fault is monitored; and generating a fault report according to the target parameters and sending the fault report. According to the method and the device, after the configuration files of the devices are analyzed, integrated and matched, the parameter attributes are cached according to the fault types, the corresponding parameter acquisition instruction is triggered after the target fault is monitored to be matched, the corresponding parameter attributes are acquired to generate the fault report, and the fault report is sent to the corresponding device manufacturer or operation and maintenance personnel, so that the log information corresponding to the fault is analyzed in time, and the processing efficiency of the device fault in the network is improved.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly introduced below, it is obvious that the drawings in the following description are only some embodiments described in the present specification, and for those skilled in the art, other drawings can be obtained according to the drawings without any creative effort.
FIG. 1 is a flow diagram of a method of handling a device failure according to one embodiment of the present application;
FIG. 2 is a schematic structural diagram of an electronic device corresponding to device failure handling in a network according to an embodiment of the present application;
fig. 3 is a schematic structural diagram of a device for processing an equipment failure according to an embodiment of the present application.
Detailed Description
In order to make those skilled in the art better understand the technical solutions in the present specification, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present specification, and not all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments in the present specification without any creative effort shall fall within the protection scope of the present specification.
The embodiment of the application provides a method and a device for processing equipment faults.
First, a method for processing an equipment fault provided in an embodiment of the present application is described below.
At present, equipment faults in a network are often inspected in a manual mode, operation logs corresponding to the equipment in the network are periodically analyzed, working states corresponding to the equipment are judged, once the network faults are found, the equipment needs to be inspected one by one, target equipment with the faults is determined, and efficiency is very low. Some log analysis tools also collect logs corresponding to devices of various manufacturers, perform alarm after filtering parameter attributes, and perform analysis by operation and maintenance personnel, because devices in a network are from different manufacturers, the operation and maintenance personnel need to be familiar with the devices of different manufacturers, and the operation and maintenance personnel perform log analysis and then process the logs, so that the efficiency is low.
In this embodiment, first, configuration files corresponding to devices of manufacturers are collected, and the configuration files of the manufacturers are analyzed, that is, the configuration files corresponding to the devices are analyzed, parameter attributes obtained by analysis are aggregated uniformly according to the fault types of the devices, and the fault types corresponding to the devices are integrated in a semantic analysis manner; once the target fault is monitored, matching of the fault type is carried out, the target fault corresponding to the abnormal parameter attribute is determined, the fault type corresponding to the target fault is determined in a data analysis mode, the target parameter is called according to the matched fault type, a corresponding report is generated and sent to a manufacturer or operation and maintenance personnel, the operation and maintenance personnel or the manufacturer can rapidly process the fault, the operation and maintenance personnel do not need to be familiar with equipment parameters of all manufacturers, the information processing efficiency is improved, and the equipment maintenance work in a network is simplified.
Fig. 1 is a flowchart of a method for processing a device failure according to an embodiment of the present application, and as shown in fig. 1, the method may include the following steps:
in step 101, the corresponding configuration files of the devices are extracted and analyzed, so as to obtain the parameter attributes corresponding to the devices.
In this embodiment, the devices of each manufacturer are centrally managed in an information extraction manner, the configuration files corresponding to the devices are respectively extracted through the specific interfaces, the formats of the configuration files are read, and then the configuration files are analyzed, so as to obtain the parameter attributes obtained after analysis.
When equipment of a new manufacturer is accessed into the network, the corresponding configuration file is read according to the set data interface for analysis, and the parameter attribute corresponding to the new equipment is obtained.
In this embodiment, when the configuration file is a fault manual corresponding to the device, the configuration file may be analyzed to obtain corresponding parameter attributes, which is detailed as follows:
step a: segmenting the text content in the configuration file according to the corresponding types of the manufacturers and the equipment respectively;
analyzing a text structure corresponding to the configuration file, segmenting the text content in the configuration file, namely segmenting the configuration file by taking a segment, a line and a separator as a unit respectively to obtain a corresponding segmentation structure, removing corresponding virtual words such as a, the and the like, screening by combining word frequency, and removing common words to obtain corresponding characteristic words.
Step b: integrating corresponding positions and paragraph marks of the feature words obtained by word segmentation in the configuration file respectively to obtain target formats corresponding to the configuration file respectively;
extracting keywords corresponding to the fault and keywords corresponding to the parameter attribute from the feature words, and respectively counting the positions of the keywords in the configuration file and the paragraphs where the keywords are located, so as to fit the relative positions of the recorded fault keywords and the parameter attribute in the configuration file, and obtain the target formats corresponding to the configuration files respectively.
Step c: and analyzing the configuration file according to the target format to obtain the corresponding relation between the parameter attribute and the fault keyword.
And analyzing the configuration file according to the target format to obtain the corresponding relation between the fault keyword and the parameter attribute, wherein the corresponding parameter attribute is obtained for judgment once a certain fault occurs in the equipment, so that the fault parameter is quickly called.
Step 102: aggregating the parameter attributes according to fault types respectively to form fault classification; the fault classification and the parameter attribute are stored in an associated mode and are used for uniformly dividing the faults of the equipment;
specifically, the parameter attributes are aggregated according to the fault types, and the method comprises the following steps:
step A: extracting the fault keywords to match with a set fault type to obtain the mapping relation;
because the document descriptions of the manufacturers are different and the description modes of the same fault may be different, the fault keyword in step 101 needs to be matched with a predetermined fault type to maintain the consistency of the fault type and obtain the mapping relationship between the fault type and the fault keyword. And calling a word library of the similar meaning words, extracting the similar meaning words corresponding to the fault keywords, and matching the similar meaning words with the fault type name together with the fault keywords to improve the matching rate of the fault keywords, so that the part describing the fault in the configuration document is recalled to the maximum extent.
And B, step B: and respectively extracting the mapping relation and the corresponding relation between the parameter attribute and the fault keyword, and aggregating according to the fault type to obtain the respective corresponding parameter attribute of the fault classification.
And performing association matching according to the mapping relation between the fault type and the fault keyword and the corresponding relation between the parameter attribute and the fault keyword, and aggregating by taking the fault type as a unit to obtain the parameter attribute corresponding to the fault classification.
In this embodiment, the fault classification is used to uniformly divide the equipment faults and store the equipment faults in association with the parameter attributes subjected to the integration processing. And caching the parameter attributes corresponding to the fault classification to the control end, and once new equipment is accessed to the network, analyzing the configuration file of the new equipment again, classifying and dividing according to the mapping relation between the obtained parameter attributes and the cached < parameter attributes, fault classification >, and thus integrating the newly accessed equipment into the fault processing system of the embodiment.
Step 103: monitoring each device in real time, and acquiring corresponding target parameters according to fault classification of a target fault once the target fault is monitored;
in this embodiment, when the corresponding target parameter is obtained according to the fault classification to which the target fault belongs,
step a): positioning corresponding target equipment according to the target fault;
monitoring the working state corresponding to each device in the network in real time, monitoring a target fault once monitoring that the output of a certain index corresponding to a certain node in the network is abnormal and is larger or smaller than a normal working threshold, taking the node (a server, a gateway, a router and the like) corresponding to the target fault as the corresponding target device, and outputting a fault name corresponding to the target device for analysis by operation and maintenance personnel.
In this embodiment, the number of the target devices is one or more, but when there are a plurality of target devices, the failure of one device affects the operating state of the entire link, so when there are a plurality of target devices, analysis is performed according to the data flow direction, and the device located upstream in the link is taken as the target device.
In other embodiments, if the target device is not configured with the function of outputting the fault name, analyzing according to the log corresponding to the target device, acquiring the parameter attribute corresponding to the target device, matching according to the abnormal parameter attribute and the incidence relation between the fault classification cached by the control end in step 102 and the parameter attribute, and determining the fault classification to which the target device belongs. And if the matching result has a plurality of fault classifications, sorting according to the matching rate, and selecting the fault classification with the highest matching rate as the target equipment.
Step b): calling corresponding target parameters in the target equipment according to the fault classification;
after the target equipment is determined, determining the fault classification corresponding to the target fault according to the target fault output by the target equipment, and determining the target parameter corresponding to the fault classification according to the incidence relation between the fault classification cached by the control end and the parameter attribute.
In the real-time monitoring process, abnormal parameter attributes are monitored to cause target faults, so the target parameters include other to-be-monitored parameter attributes besides the monitored abnormal parameter attributes, and the to-be-monitored parameter attributes need to be acquired according to a trigger command so as to analyze the working state of the target equipment.
Step c): and triggering corresponding operation instructions according to the target parameters respectively, and acquiring respective corresponding parameter values of the target parameters respectively.
In this embodiment, the target device with the fault is extracted, the operation instructions corresponding to the target parameters are respectively triggered in the target devices, the outputs corresponding to the operation instructions are extracted, and then the parameter values corresponding to the corresponding target parameters are obtained, so as to obtain the parameter states corresponding to the target devices with the fault, so that the target parameters and the corresponding parameter values are respectively written into the fault report in the following process.
Step 104: and generating a fault report according to the target parameters and sending the fault report.
In this embodiment, the target parameter and the parameter value corresponding to the target parameter are extracted, the target parameter and the parameter value are associated and then de-duplicated, and after the repeated target parameter and the corresponding parameter are proposed, the repeated target parameter and the corresponding parameter are written into the document template in the form of a predetermined table, respectively, to form a fault report, so that the log information of the target device in the state of the target fault is accurately reflected.
After the fault report is generated, the fault report is sent to a mailbox of a manufacturer or an administrator corresponding to the target equipment, the target equipment is subjected to fault alarm, and meanwhile, the target parameters corresponding to the target fault are sent to the manufacturer or the administrator corresponding to the target equipment, so that the manufacturer or the administrator corresponding to the target equipment does not need to run related operating instructions one by one, look up the related parameters, directly and quickly analyze all the output target parameters of the target fault, further repair the target fault, improve the equipment maintenance efficiency and reduce the equipment maintenance complexity.
Extracting corresponding configuration files of all equipment and analyzing the configuration files respectively to obtain parameter attributes corresponding to the equipment respectively; aggregating the parameter attributes according to fault types respectively to form fault classification; monitoring each device in real time, and acquiring corresponding target parameters according to fault classification of a target fault once the target fault is monitored; and generating a fault report according to the target parameters and sending the fault report. According to the method and the device, after the configuration files of the devices are analyzed, integrated and matched, the parameter attributes are cached according to the fault types, the corresponding parameter acquisition instruction is triggered after the target fault is monitored to be matched, the corresponding parameter attributes are acquired to generate the fault report, and the fault report is sent to the corresponding device manufacturer or operation and maintenance personnel, so that the log information corresponding to the fault is analyzed in time, and the processing efficiency of the device fault in the network is improved.
In another embodiment, when a device failure in a network is handled,
step 201: extracting corresponding configuration files of each device, and analyzing the configuration files respectively to obtain parameter attributes corresponding to the devices respectively;
in the embodiment, firstly, the commands or interfaces provided by each manufacturer need to be packaged, and the packaging mode needs to have stronger universality, so that the problem that when a new manufacturer server is added, the new manufacturer server can be packaged similarly, and the equipment is compatible with the existing equipment in the network is solved.
In this embodiment, if the command provided by the Dell vendor to view the states of the Raid card and the hard disk is MegaCli, the monitoring index is taken out through the command, the implementation manner is that this is used as a configuration item and is analyzed by a program, and the configuration file is exemplified as follows:
Figure BDA0002109326900000081
/>
Figure BDA0002109326900000091
in this embodiment, the configuration file represents the parameter attribute of the device through param, health _ check, expect, and the like, and may be obtained through parsing.
Step 202: aggregating the parameter attributes according to fault types respectively to form fault classification; the fault classification and the parameter attribute are stored in an associated mode and are used for uniformly dividing the faults of the equipment;
and extracting param, health _ check, expect and related synonyms of corresponding parameter values of block, count and the like for analysis to form fault classification representing damage of the magnetic disk so as to be compatible with corresponding equipment of more manufacturers and further perform unified processing on the faults of the equipment.
Step 203: monitoring each device in real time, and acquiring corresponding target parameters according to fault classification of a target fault once the target fault is monitored;
in this embodiment, if it is monitored that a read-write Error occurs in a disk corresponding to a certain device, a corresponding Error type is obtained, that is, a "-PDList-aALL-NoLog | grep-i Error" command is called to obtain a corresponding target parameter 'Media Error Count:'.
Step 204: and generating a fault report according to the target parameters and sending the fault report.
And generating a fault report by the acquired 'Media Error Count', the corresponding parameter value and the threshold value '40' and sending the fault report to operation and maintenance personnel, so that the operation and maintenance personnel can judge conveniently, if the parameter value is larger than 40, namely the bad track of the disk, the read-write Error of the disk is determined, otherwise, other errors are determined, and other instructions are required to be operated in the target equipment for troubleshooting.
In another alternative embodiment, when handling a device failure:
fig. 2 is a schematic structural diagram of an electronic device according to an embodiment of the present application. On the hardware level, the electronic device comprises a processor and optionally an internal bus, a network interface and a memory. The Memory may include a Memory, such as a Random-Access Memory (RAM), and may further include a non-volatile Memory, such as at least 1 disk Memory. Of course, the electronic device may also include hardware required for other services.
The processor, the network interface, and the memory may be connected to each other by an internal bus, which may be an ISA (Industry Standard Architecture) bus, a PCI (Peripheral Component Interconnect) bus, an EISA (Extended Industry Standard Architecture) bus, or the like. The bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one double-headed arrow is shown in FIG. 2, but this does not indicate only one bus or one type of bus.
And the memory is used for storing programs. In particular, the program may include program code comprising computer operating instructions. The memory may include both memory and non-volatile storage and provides instructions and data to the processor.
The processor reads the corresponding computer program from the nonvolatile memory into the memory and then runs the computer program, and the processing device of the equipment failure is formed on the logic level. The processor is used for executing the program stored in the memory and is specifically used for executing the following operations:
extracting corresponding configuration files of each device, and analyzing the configuration files respectively to obtain parameter attributes corresponding to the devices respectively;
aggregating the parameter attributes according to fault types respectively to form fault classification; the fault classification and the parameter attribute are stored in an associated mode and are used for uniformly dividing the faults of the equipment;
monitoring each device in real time, and acquiring corresponding target parameters according to fault classification of a target fault once the target fault is monitored;
and generating a fault report according to the target parameters and sending the fault report.
The method for processing the device failure according to the embodiment shown in fig. 2 of the present application may be applied to a processor, or may be implemented by a processor. The processor may be an integrated circuit chip having signal processing capabilities. In implementation, the steps of the above method may be performed by integrated logic circuits of hardware in a processor or instructions in the form of software. The Processor may be a general-purpose Processor, including a Central Processing Unit (CPU), a Network Processor (NP), and the like; but also Digital Signal Processors (DSPs), application Specific Integrated Circuits (ASICs), field Programmable Gate Arrays (FPGAs) or other Programmable logic devices, discrete Gate or transistor logic devices, discrete hardware components. The various methods, steps, and logic blocks disclosed in the embodiments of the present application may be implemented or performed. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. The steps of the method disclosed in connection with the embodiments of the present application may be directly implemented by a hardware decoding processor, or implemented by a combination of hardware and software modules in the decoding processor. The software module may be located in ram, flash memory, rom, prom, or eprom, registers, etc. storage media as is well known in the art. The storage medium is located in a memory, and a processor reads information in the memory and combines hardware thereof to complete the steps of the method.
The electronic device may also execute the method in fig. 1, and implement the function of the device failure processing apparatus in the embodiment shown in fig. 1, which is not described herein again in this embodiment of the present application.
Of course, besides the software implementation, the electronic device in this specification does not exclude other implementations, such as logic devices or a combination of software and hardware, and the like, that is, the execution subject of the following processing flow is not limited to each logic unit, and may also be hardware or logic devices.
Fig. 3 is a schematic structural diagram of a device for processing a device failure according to an embodiment of the present application. Referring to fig. 3, in a software implementation, the apparatus 300 for processing a device failure in a picture may include: a parsing module 301, a clustering module 302, a monitoring module 303, and a sending module 304, wherein,
the analysis module 301 is configured to extract corresponding configuration files of the devices and analyze the configuration files respectively to obtain parameter attributes corresponding to the devices;
a clustering module 302, configured to aggregate the parameter attributes according to fault types, respectively, to form fault classifications; the fault classification and the parameter attribute are stored in an associated mode and are used for uniformly dividing the faults of the equipment; (ii) a
The monitoring module 303 is configured to monitor each device in real time, and once a target fault is monitored, obtain a corresponding target parameter according to a fault classification to which the target fault belongs;
and a sending module 304, configured to generate a fault report according to the target parameter and send the fault report.
The parsing module 301 specifically includes:
the word segmentation unit is used for segmenting the text content in the configuration file according to the corresponding types of the manufacturer and the equipment;
the integration unit is used for integrating corresponding positions and paragraph marks of all feature words obtained by word segmentation in the configuration file respectively to obtain target formats corresponding to the configuration file respectively;
and the analysis unit is used for respectively analyzing the configuration files according to the target formats to obtain the corresponding relation between the parameter attributes and the fault keywords.
The clustering module 302, in particular comprising,
the matching unit is used for extracting the fault keywords to match with a set fault type to obtain the mapping relation;
and the aggregation unit is used for respectively extracting the mapping relation and the corresponding relation between the parameter attribute and the fault keyword, and aggregating according to the fault type to obtain the parameter attribute corresponding to each fault classification.
The matching unit specifically includes:
the extraction submodule is used for calling a set near-meaning word bank and extracting the near-meaning words corresponding to the fault keywords;
and the matching sub-module is used for matching the similar meaning words with the corresponding names of the fault types so as to improve the matching rate of the fault keywords.
The monitoring module 303 specifically includes:
the positioning unit is used for positioning corresponding target equipment according to the target fault;
the calling unit is used for calling corresponding target parameters in the target equipment according to the fault classification;
and the triggering unit is used for respectively triggering corresponding operation instructions according to the target parameters and respectively acquiring the parameter values corresponding to the target parameters.
The sending module 304 specifically includes:
the correlation unit is used for correlating the target parameters with the corresponding parameter values to form corresponding log information so as to generate a fault report corresponding to the target fault;
and the output unit is used for sending the fault report to a manufacturer of the target equipment.
In short, the above description is only a preferred embodiment of the present disclosure, and is not intended to limit the scope of the present disclosure. Any modification, equivalent replacement, improvement and the like made within the spirit and principle of the present specification shall be included in the protection scope of the present specification.
The systems, devices, modules or units illustrated in the above embodiments may be implemented by a computer chip or an entity, or by a product with certain functions. One typical implementation device is a computer. In particular, the computer may be, for example, a personal computer, a laptop computer, a cellular telephone, a camera phone, a smartphone, a personal digital assistant, a media player, a navigation device, an email device, a game console, a tablet computer, a wearable device, or a combination of any of these devices.
Computer-readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static Random Access Memory (SRAM), dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), read Only Memory (ROM), electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, a computer readable medium does not include a transitory computer readable medium such as a modulated data signal and a carrier wave.
It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrases "comprising a," "8230," "8230," or "comprising" does not exclude the presence of other like elements in a process, method, article, or apparatus comprising the element.
All the embodiments in the present specification are described in a progressive manner, and the same and similar parts among the embodiments are referred to each other, and each embodiment focuses on the differences from other embodiments. In particular, for the system embodiment, since it is substantially similar to the method embodiment, the description is simple, and for the relevant points, reference may be made to the partial description of the method embodiment.

Claims (8)

1. A method for handling a device failure, the method comprising:
extracting corresponding configuration files of each device and analyzing the configuration files respectively to obtain the corresponding relation between the parameter attribute of each device and the fault keyword;
extracting the fault keywords to match with a set fault type to obtain a mapping relation; extracting the mapping relation and the corresponding relation between the parameter attribute and the fault keyword respectively, and aggregating according to the fault type to obtain the respective corresponding parameter attribute of the fault classification; the fault classification and the parameter attribute are stored in an associated mode and are used for uniformly dividing the faults of the equipment;
monitoring each device in real time, and acquiring corresponding target parameters according to the fault classification to which the target fault belongs according to the incidence relation between the fault classification and the parameter attribute once the target fault is monitored;
and generating a fault report according to the target parameters and sending the fault report.
2. The method of claim 1, wherein the configuration files are extracted according to respective manufacturers of the devices, and when the configuration files corresponding to the devices are extracted and analyzed respectively,
segmenting the text content in the configuration file according to the corresponding types of the manufacturer and the equipment respectively;
integrating corresponding positions and paragraph marks of the feature words obtained by word segmentation in the configuration file respectively to obtain target formats corresponding to the configuration file respectively;
and analyzing the configuration file respectively according to the target format to obtain the corresponding relation between the parameter attribute and the fault keyword.
3. The method of claim 1, wherein extracting the fault key matches a predetermined fault type,
calling a given near meaning word library, and extracting the near meaning words corresponding to the fault keywords;
and matching the similar meaning words with the corresponding names of the fault types so as to improve the matching rate of the fault keywords.
4. The method of claim 1, comprising: when the corresponding target parameters are obtained according to the fault classification to which the target fault belongs,
positioning corresponding target equipment according to the target fault;
calling corresponding target parameters in the target equipment according to the fault classification;
and triggering corresponding operation instructions according to the target parameters respectively, and acquiring respective corresponding parameter values of the target parameters respectively.
5. The method of claim 4, wherein when generating a fault report based on the target parameter,
correlating the target parameters with corresponding parameter values to form corresponding log information, thereby generating a fault report corresponding to the target fault;
and sending the fault report to a manufacturer of the target equipment.
6. An apparatus for handling a device failure, the apparatus comprising:
the analysis module is used for extracting corresponding configuration files of all the equipment and respectively analyzing the configuration files to obtain the corresponding relation between the parameter attribute of each equipment and the fault keyword;
the clustering module comprises a matching unit and an aggregation unit, wherein the matching unit is used for extracting the fault keywords to match with the set fault types to obtain a mapping relation; the aggregation unit is used for respectively extracting the mapping relation and the corresponding relation between the parameter attribute and the fault keyword, and aggregating according to the fault type to obtain the respective corresponding parameter attribute of the fault classification; the fault classification and the parameter attribute are stored in an associated mode and are used for uniformly dividing the faults of the equipment;
the monitoring module is used for monitoring each device in real time, and acquiring corresponding target parameters according to the fault classification to which the target fault belongs according to the incidence relation between the fault classification and the parameter attribute once the target fault is monitored;
and the sending module is used for generating a fault report according to the target parameters and sending the fault report.
7. The apparatus according to claim 6, wherein the parsing module specifically includes:
the word segmentation unit is used for segmenting the text content in the configuration file according to the corresponding types of the manufacturer and the equipment;
the integration unit is used for integrating corresponding positions and paragraph marks of all feature words obtained by word segmentation in the configuration file respectively to obtain target formats corresponding to the configuration file respectively;
and the analysis unit is used for respectively analyzing the configuration files according to the target formats to obtain the corresponding relation between the parameter attributes and the fault keywords.
8. The apparatus according to claim 6, wherein the matching unit specifically includes:
the extraction submodule is used for calling a set near-meaning word bank and extracting the near-meaning words corresponding to the fault keywords;
and the matching sub-module is used for matching the synonym with the corresponding name of the fault type so as to improve the matching rate of the fault keyword.
CN201910565044.1A 2019-06-27 2019-06-27 Equipment fault processing method and device Active CN110224874B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910565044.1A CN110224874B (en) 2019-06-27 2019-06-27 Equipment fault processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910565044.1A CN110224874B (en) 2019-06-27 2019-06-27 Equipment fault processing method and device

Publications (2)

Publication Number Publication Date
CN110224874A CN110224874A (en) 2019-09-10
CN110224874B true CN110224874B (en) 2023-04-07

Family

ID=67814962

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910565044.1A Active CN110224874B (en) 2019-06-27 2019-06-27 Equipment fault processing method and device

Country Status (1)

Country Link
CN (1) CN110224874B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110855503A (en) * 2019-11-22 2020-02-28 叶晓斌 Fault cause determining method and system based on network protocol hierarchy dependency relationship
CN112134356A (en) * 2020-02-11 2020-12-25 吴龙圣 Smart power grid fault monitoring method and system based on big data analysis
CN111694686B (en) * 2020-06-03 2023-08-04 北京百度网讯科技有限公司 Processing method and device for abnormal service, electronic equipment and storage medium
CN112148733A (en) * 2020-09-15 2020-12-29 珠海格力电器股份有限公司 Method, device, electronic device and computer readable medium for determining fault type
CN112241163A (en) * 2020-10-28 2021-01-19 广州极飞科技有限公司 Unmanned aerial vehicle fault processing method, device, equipment and storage medium
CN114630352B (en) * 2020-12-11 2023-08-15 中国移动通信集团湖南有限公司 Fault monitoring method and device for access equipment
CN112818008A (en) * 2021-01-21 2021-05-18 中广核工程有限公司 Intelligent diagnosis method, system, medium and electronic equipment for nuclear power debugging faults

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109376873A (en) * 2018-09-11 2019-02-22 平安科技(深圳)有限公司 O&M method, apparatus, electronic equipment and computer readable storage medium

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6424562B2 (en) * 2014-10-23 2018-11-21 富士ゼロックス株式会社 Failure prediction device, failure prediction system, and program

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109376873A (en) * 2018-09-11 2019-02-22 平安科技(深圳)有限公司 O&M method, apparatus, electronic equipment and computer readable storage medium

Also Published As

Publication number Publication date
CN110224874A (en) 2019-09-10

Similar Documents

Publication Publication Date Title
CN110224874B (en) Equipment fault processing method and device
US9832280B2 (en) User profile configuring method and device
CN106656536B (en) Method and equipment for processing service calling information
CN113254255B (en) Cloud platform log analysis method, system, device and medium
CN110377569B (en) Log monitoring method, device, computer equipment and storage medium
CN113342564B (en) Log auditing method and device, electronic equipment and medium
EP3425524A1 (en) Cloud platform-based client application data calculation method and device
CN111881011A (en) Log management method, platform, server and storage medium
US10628250B2 (en) Search for information related to an incident
CN105630682A (en) System and method for automatically collecting and analyzing collapse of mobile terminal
CN109495291B (en) Calling abnormity positioning method and device and server
CN110647472A (en) Breakdown information statistical method and device, computer equipment and storage medium
CN111400361A (en) Data real-time storage method and device, computer equipment and storage medium
CN111581057B (en) General log analysis method, terminal device and storage medium
CN111177193A (en) Flink-based log streaming processing method and system
CN112306820A (en) Log operation and maintenance root cause analysis method and device, electronic equipment and storage medium
CN106789158A (en) Damage identification method and system are insured in a kind of cloud service
CN116841779A (en) Abnormality log detection method, abnormality log detection device, electronic device and readable storage medium
CN109818808B (en) Fault diagnosis method and device and electronic equipment
US11675647B2 (en) Determining root-cause of failures based on machine-generated textual data
CN111966339B (en) Buried point parameter input method and device, computer equipment and storage medium
CN113392000A (en) Test case execution result analysis method, device, equipment and storage medium
CN113326064A (en) Method for dividing business logic module, electronic equipment and storage medium
CN112241355A (en) Link tracking method, system, computer readable storage medium and electronic device
CN115186001A (en) Patch processing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant