CN110704224A - Online fault processing method and system - Google Patents

Online fault processing method and system Download PDF

Info

Publication number
CN110704224A
CN110704224A CN201910879471.7A CN201910879471A CN110704224A CN 110704224 A CN110704224 A CN 110704224A CN 201910879471 A CN201910879471 A CN 201910879471A CN 110704224 A CN110704224 A CN 110704224A
Authority
CN
China
Prior art keywords
fault
online
data
reason
type
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910879471.7A
Other languages
Chinese (zh)
Inventor
赵晓璐
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Microphone Culture Media Co Ltd
Original Assignee
Shanghai Microphone Culture Media Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Microphone Culture Media Co Ltd filed Critical Shanghai Microphone Culture Media Co Ltd
Priority to CN201910879471.7A priority Critical patent/CN110704224A/en
Publication of CN110704224A publication Critical patent/CN110704224A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/079Root cause analysis, i.e. error or fault diagnosis

Abstract

The invention discloses an online fault processing method and system, wherein the method comprises the steps of collecting operation data information of a system fault moment when an online fault occurs; determining the fault type of the online fault based on the operation data information at the fault moment; searching fault reasons and solutions matched with the fault types in a database, wherein each fault reason corresponds to at least one solution, and each fault reason is preset with a corresponding probability value; recommending an optimal fault reason and a corresponding solution according to the probability value of the fault reason; generating a fault report based on the optimal fault reason and the corresponding solution; and pushing the fault report to the corresponding operation and maintenance terminal. The invention can automatically acquire the fault information of the online fault of the system and analyze and process the fault information, thereby automatically pushing a corresponding solution to operation and maintenance personnel, optimizing the processing flow of the online fault, improving the processing efficiency of the online fault and reducing the working intensity of the operation and maintenance personnel.

Description

Online fault processing method and system
Technical Field
The invention relates to the technical field of software development and maintenance, in particular to an online fault processing method and system.
Background
In the process of continuous operation of an IT on-line system, some fault problems affecting the stability of the system usually occur, and in order to ensure the continuous operation of the system, the corresponding fault problems must be accurately positioned and quickly solved at the first time. In the prior art, fault problems are generally detected and repaired by system operation maintainers and system developers with abundant working experience, and if the quantity or quality of the system operation maintainers and the system developers cannot meet the requirements, the fault problems cannot be timely found and solved, so that the fault solving period is long and the normal operation of the system is seriously influenced. Therefore, the detection and the processing of the online system fault completely depend on manual work, and the problems of high labor cost and low fault processing efficiency exist.
Disclosure of Invention
The invention aims to at least solve one of the technical problems in the prior art, and particularly provides an online fault processing method and system which can automatically acquire and analyze the online fault information of the system, so that a corresponding solution is automatically pushed to operation and maintenance personnel, the online fault processing flow is effectively optimized, the online fault processing efficiency is improved, the working intensity of the operation and maintenance personnel is reduced, and the requirements on the number and quality of the operation and maintenance personnel in the operation and maintenance process of the system are reduced.
In order to achieve the above object of the present invention, according to a first aspect of the present invention, there is provided an online fault handling method including the steps of:
when an online fault occurs, acquiring operation data information of a system at the moment of the fault;
determining the fault type of the online fault based on the operation data information at the fault moment;
searching fault reasons and solutions matched with the fault types in a database, wherein each fault reason corresponds to at least one solution, and each fault reason is preset with a corresponding probability value;
recommending an optimal fault reason and a corresponding solution according to the probability value of the fault reason;
generating a fault report based on the optimal fault reason and the corresponding solution;
and pushing the fault report to the corresponding operation and maintenance terminal.
Preferably, the method further comprises:
and monitoring the running state of the system in real time.
Preferably, the operation state of the real-time monitoring system includes:
acquiring running state data of the system according to preset interval time;
comparing the running state data with preset state data of normal running of the system;
and judging whether the system has an online fault according to the comparison result.
Preferably, the determining the fault type of the online fault based on the operation data information at the fault time includes:
extracting fault data from the operation data information at the fault moment, wherein the fault data is data generated due to the online fault of the system;
and matching the corresponding fault type in a fault type table preset in the database according to the type of the fault data.
Preferably, the method further comprises:
acquiring fault frequency information of the fault type within preset time, and pushing a system maintenance suggestion to a corresponding operation and maintenance terminal based on the fault frequency information.
According to a second aspect of the present invention, there is provided an online fault handling system, the system comprising:
the operation data acquisition module is used for acquiring operation data information of a system at a fault moment when an online fault occurs;
the fault type judging module is used for determining the fault type of the online fault based on the operation data information at the fault moment;
the fault cause searching module is used for searching a fault cause and a solution which are matched with the fault type in a database, wherein each fault cause corresponds to at least one solution, and each fault cause is preset with a corresponding probability value;
the solution recommending module recommends an optimal fault reason and a corresponding solution according to the probability value of the fault reason;
the fault report generation module is used for generating a fault report based on the optimal fault reason and the corresponding solution;
and the fault report pushing module is used for pushing the fault report to the corresponding operation and maintenance terminal.
Preferably, the system further comprises:
and the running state monitoring module is used for monitoring the running state of the system in real time.
Preferably, the operation state monitoring module includes:
the state data acquisition unit is used for acquiring the running state data of the system according to preset interval time;
the state data comparison unit is used for comparing the running state data with the preset state data of normal running of the system;
and the online fault judging unit is used for judging whether the system has online faults according to the comparison result.
Preferably, the fault type determining module includes:
the fault data extraction unit is used for extracting fault data from the operation data information at the fault moment, wherein the fault data is data generated by on-line fault of a system;
and the fault type matching unit is used for matching the corresponding fault type in a fault type table preset in the database according to the type of the fault data.
Preferably, the system further comprises:
and the maintenance suggestion pushing module is used for acquiring the fault frequency information of the fault type within the preset time and pushing the system maintenance suggestions to the corresponding operation and maintenance terminals based on the fault frequency information.
According to the scheme, the invention provides an online fault processing method and system, wherein the method comprises the steps of collecting operation data information of a system fault moment when an online fault occurs; determining the fault type of the online fault based on the operation data information at the fault moment; searching fault reasons and solutions matched with the fault types in a database, wherein each fault reason corresponds to at least one solution, and each fault reason is preset with a corresponding probability value; recommending an optimal fault reason and a corresponding solution according to the probability value of the fault reason; generating a fault report based on the optimal fault reason and the corresponding solution; and pushing the fault report to the corresponding operation and maintenance terminal. The invention can automatically acquire the fault information of the online fault of the system and analyze and process the fault information, thereby automatically pushing a corresponding solution to operation and maintenance personnel, effectively optimizing the processing flow of the online fault, improving the processing efficiency of the online fault, reducing the working strength of the operation and maintenance personnel, and simultaneously reducing the requirements on the number and quality of the operation and maintenance personnel in the operation and maintenance process of the system.
Additional aspects and advantages of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.
Drawings
The above and/or additional aspects and advantages of the present invention will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:
FIG. 1 is a schematic flow chart of a method for handling faults on a line according to a preferred embodiment of the present invention;
fig. 2 is a schematic structural diagram of an on-line fault handling system according to a preferred embodiment of the present invention.
Detailed Description
Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.
It will be understood by those skilled in the art that, unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the prior art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
In the description of the present invention, it is to be understood that the terms "longitudinal", "lateral", "upper", "lower", "front", "rear", "left", "right", "vertical", "horizontal", "top", "bottom", "inner", "outer", and the like, indicate orientations or positional relationships based on those shown in the drawings, and are used merely for convenience of description and for simplicity of description, and do not indicate or imply that the referenced devices or elements must have a particular orientation, be constructed in a particular orientation, and be operated, and thus, are not to be construed as limiting the present invention.
In the description of the present invention, unless otherwise specified and limited, it is to be noted that the terms "mounted," "connected," and "connected" are to be interpreted broadly, and may be, for example, a mechanical connection or an electrical connection, a communication between two elements, a direct connection, or an indirect connection via an intermediate medium, and specific meanings of the terms may be understood by those skilled in the art according to specific situations.
According to a first aspect of the present invention, the present invention provides an online fault handling method, as shown in fig. 1, which may include the steps of:
s101, when an online fault occurs, acquiring operation data information of a system at the moment of the fault;
in the process of online system operation, the stability of the system is the life line of an enterprise, and the fault problem needs to be accurately positioned and quickly solved at the first time. The online system is generally in a continuous operation state, so that some faults inevitably occur, and operation and maintenance personnel are required to timely process the faults. In order to quickly repair a fault, in the running process of a system, when an online fault occurs, firstly, running data information of the system at the moment of the fault is required to be collected, and the fault reason is searched from the running data information of the system at the moment of the fault, so that the fault is quickly repaired in a targeted manner.
S102, determining the fault type of the online fault based on the running data information at the fault moment;
after the operation data information at the fault moment is acquired, the fault self-diagnosis system determines the fault type of the fault on the line according to the operation data information at the fault moment, so that the corresponding fault reason and the matching solution can be conveniently searched for subsequently according to the fault type. Determining the fault type of the online fault based on the operation data information at the fault time specifically can be performed according to the following steps: firstly, extracting fault data from operation data information at the fault moment, wherein the fault data is data generated due to the online fault of a system; and then matching corresponding fault types in a fault type table preset in a database according to the types of the fault data.
It should be noted that when an online fault occurs in the system, some abnormal data inevitably appears in the operation data at the time of the fault, and the abnormal data is often directly related to the cause of the fault, so that the abnormal data in the operation data information at the time of the fault is extracted as the fault data, and then the fault data is correspondingly analyzed to quickly find out the cause of the fault. Different types of the fault data are corresponding to different fault types, therefore, the correlation between the corresponding fault data and the fault types is found out by carrying out statistical analysis on the online faults within a certain period of time in the early stage, a fault type table corresponding to the fault data and the fault types is formed and stored in a database, and the corresponding fault types can be matched by analyzing the fault data when fault self-diagnosis is carried out.
S103, searching fault reasons and solutions matched with the fault types in a database, wherein each fault reason corresponds to at least one solution, and each fault reason is preset with a corresponding probability value;
after the fault type is determined, the fault reason and the solution which are matched with the fault type are continuously searched in the database, the corresponding relation between the fault type and the fault reason and the solution is also obtained by analyzing and processing the online fault in a certain period of time at the early stage, and is pre-stored in a database, each fault type may correspond to one or more fault causes, setting corresponding probability values for a plurality of fault reasons corresponding to the same medium fault type according to the analysis of the previous fault data, wherein the higher the probability value is, the higher the probability value represents that the fault type is more likely to be caused by the fault reason, at the same time, at least one corresponding solution is preset for each of the different fault causes leading to a certain fault type, in this way, after the fault reason is determined, the corresponding fault reason and the corresponding solution can be automatically matched in the database.
S104, recommending an optimal fault reason and a corresponding solution according to the probability value of the fault reason;
after the fault reason and the solution matched with the fault type are found, according to the probability values of a plurality of fault reasons causing the fault type, the system automatically recommends the optimal fault reason and the corresponding solution, wherein the optimal fault reason is the optimal fault reason which possibly causes the fault type, namely the fault reason with the highest probability value. Through the steps, the recommendation of the optimal fault reason is realized, the guidance direction and the solution are provided for the operation and maintenance personnel to repair the fault, the time for the operation and maintenance personnel to detect and process the fault is conveniently shortened, and the fault processing efficiency is improved.
S105, generating a fault report based on the optimal fault reason and the corresponding solution;
the system generates a corresponding fault report according to the recommended optimal fault reason and the corresponding solution, and the operation and maintenance personnel provide technical guidance and data support for solving the online fault in the form of the fault report.
And S106, pushing the fault report to the corresponding operation and maintenance terminal.
And finally, pushing the generated fault report to the operation and maintenance terminal so that the operation and maintenance personnel holding the operation and maintenance terminal can remotely and quickly know the fault information and corresponding processing measures of the online fault, and guiding the operation and maintenance personnel to quickly solve the online fault through the fault report. After the fault report is pushed to the operation and maintenance terminal, the operation and maintenance personnel can be prompted to respond quickly through an alarm prompt tone.
In this embodiment, the method may further include:
and monitoring the running state of the system in real time.
By monitoring the running state of the system in real time, when the system has an online fault, the data acquisition mechanism is triggered to acquire the running data information of the system at the moment of the fault at the first time. Specifically, the operation state of the real-time monitoring system may be performed according to the following steps:
acquiring running state data of the system according to preset interval time;
comparing the running state data with preset state data of normal running of the system;
and judging whether the system has an online fault according to the comparison result.
In this embodiment, the method may further include:
acquiring fault frequency information of a fault type within preset time, and pushing a system maintenance suggestion to a corresponding operation and maintenance terminal based on the fault frequency information.
In order to perform targeted advanced maintenance on the system before the on-line fault of the fault type occurs, the fault frequency information of the fault type in the preset time can be obtained, then the corresponding fault frequency information is subjected to statistical analysis, if the frequency of the fault type is greater than a preset frequency threshold value in the preset time, a corresponding system maintenance suggestion is formed and is pushed to an operation and maintenance terminal, and the operation and maintenance personnel are reminded to perform corresponding maintenance on the system in time, so that the occurrence of the high frequency of the on-line fault of the same fault type is avoided in advance, and the stable operation of the system is further ensured.
According to the scheme, the invention provides an online fault processing method, and when an online fault occurs, operation data information of a system at the moment of the fault is acquired; then determining the fault type of the online fault based on the operation data information at the fault moment; then, searching fault reasons and solutions matched with the fault types in a database, wherein each fault reason corresponds to at least one solution, and each fault reason is preset with a corresponding probability value; then recommending an optimal fault reason and a corresponding solution according to the probability value of the fault reason; then generating a fault report based on the optimal fault reason and the corresponding solution; and finally, pushing the fault report to the corresponding operation and maintenance terminal. The invention can automatically acquire the fault information of the online fault of the system and analyze and process the fault information, thereby automatically pushing a corresponding solution to operation and maintenance personnel, effectively optimizing the processing flow of the online fault, improving the processing efficiency of the online fault, reducing the working strength of the operation and maintenance personnel, and simultaneously reducing the requirements on the number and quality of the operation and maintenance personnel in the operation and maintenance process of the system.
According to a second aspect of the present invention, there is provided an online fault handling system, as shown in fig. 2, which may include:
the operation data acquisition module 201 is used for acquiring operation data information of a system fault moment when an online fault occurs;
in the process of online system operation, the stability of the system is the life line of an enterprise, and the fault problem needs to be accurately positioned and quickly solved at the first time. The online system is generally in a continuous operation state, so that some faults inevitably occur, and operation and maintenance personnel are required to timely process the faults. In order to quickly repair a fault, in the running process of a system, when an online fault occurs, firstly, running data information of the system at the moment of the fault is required to be collected, and the fault reason is searched from the running data information of the system at the moment of the fault, so that the fault is quickly repaired in a targeted manner.
The fault type judging module 202 is configured to determine a fault type of an online fault based on operation data information at a fault time;
after the operation data information at the fault moment is acquired, the fault self-diagnosis system determines the fault type of the fault on the line according to the operation data information at the fault moment, so that the corresponding fault reason and the matching solution can be conveniently searched for subsequently according to the fault type. Specifically, in this embodiment, the fault type determination module includes: the fault data extraction unit is used for extracting fault data from the operation data information at the fault moment, wherein the fault data is data generated by on-line fault of a system; and the fault type matching unit is used for matching the corresponding fault type in a fault type table preset in the database according to the type of the fault data.
It should be noted that when an online fault occurs in the system, some abnormal data inevitably appears in the operation data at the time of the fault, and the abnormal data is often directly related to the cause of the fault, so that the abnormal data in the operation data information at the time of the fault is extracted as the fault data, and then the fault data is correspondingly analyzed to quickly find out the cause of the fault. Different types of the fault data are corresponding to different fault types, therefore, the correlation between the corresponding fault data and the fault types is found out by carrying out statistical analysis on the online faults within a certain period of time in the early stage, a fault type table corresponding to the fault data and the fault types is formed and stored in a database, and the corresponding fault types can be matched by analyzing the fault data when fault self-diagnosis is carried out.
The fault cause searching module 203 is configured to search a database for fault causes and solutions matched with the fault types, where each fault cause corresponds to at least one solution, and each fault cause is preset with a corresponding probability value;
after the fault type is determined, the fault reason and the solution which are matched with the fault type are continuously searched in the database, the corresponding relation between the fault type and the fault reason and the solution is also obtained by analyzing and processing the online fault in a certain period of time at the early stage, and is pre-stored in a database, each fault type may correspond to one or more fault causes, setting corresponding probability values for a plurality of fault reasons corresponding to the same medium fault type according to the analysis of the previous fault data, wherein the higher the probability value is, the higher the probability value represents that the fault type is more likely to be caused by the fault reason, at the same time, at least one corresponding solution is preset for each of the different fault causes leading to a certain fault type, in this way, after the fault reason is determined, the corresponding fault reason and the corresponding solution can be automatically matched in the database.
The solution recommending module 204 recommends an optimal fault reason and a corresponding solution according to the probability value of the fault reason;
after the fault reason and the solution matched with the fault type are found, according to the probability values of a plurality of fault reasons causing the fault type, the system automatically recommends the optimal fault reason and the corresponding solution, wherein the optimal fault reason is the optimal fault reason which possibly causes the fault type, namely the fault reason with the highest probability value. Through the steps, the recommendation of the optimal fault reason is realized, the guidance direction and the solution are provided for the operation and maintenance personnel to repair the fault, the time for the operation and maintenance personnel to detect and process the fault is conveniently shortened, and the fault processing efficiency is improved.
A fault report generation module 205, configured to generate a fault report based on the optimal fault cause and the corresponding solution;
the system generates a corresponding fault report according to the recommended optimal fault reason and the corresponding solution, and the operation and maintenance personnel provide technical guidance and data support for solving the online fault in the form of the fault report.
And the fault report pushing module 206 is configured to push the fault report to the corresponding operation and maintenance terminal.
And finally, pushing the generated fault report to the operation and maintenance terminal so that the operation and maintenance personnel holding the operation and maintenance terminal can remotely and quickly know the fault information and corresponding processing measures of the online fault, and guiding the operation and maintenance personnel to quickly solve the online fault through the fault report. After the fault report is pushed to the operation and maintenance terminal, the operation and maintenance personnel can be prompted to respond quickly through an alarm prompt tone.
In this embodiment, the system may further include:
and the running state monitoring module is used for monitoring the running state of the system in real time.
By monitoring the running state of the system in real time, when the system has an online fault, the data acquisition mechanism is triggered to acquire the running data information of the system at the moment of the fault at the first time.
Specifically, in this embodiment, the operation state monitoring module includes:
the state data acquisition unit is used for acquiring the running state data of the system according to preset interval time;
the state data comparison unit is used for comparing the running state data with the preset state data of normal running of the system;
and the online fault judging unit is used for judging whether the system has online faults according to the comparison result.
In this embodiment, the system may further include:
and the maintenance suggestion pushing module is used for acquiring the fault frequency information of the fault type within the preset time and pushing the system maintenance suggestions to the corresponding operation and maintenance terminal based on the fault frequency information.
In order to perform targeted advanced maintenance on the system before the on-line fault of the fault type occurs, the fault frequency information of the fault type in the preset time can be obtained, then the corresponding fault frequency information is subjected to statistical analysis, if the frequency of the fault type is greater than a preset frequency threshold value in the preset time, a corresponding system maintenance suggestion is formed and is pushed to an operation and maintenance terminal, and the operation and maintenance personnel are reminded to perform corresponding maintenance on the system in time, so that the occurrence of the high frequency of the on-line fault of the same fault type is avoided in advance, and the stable operation of the system is further ensured.
According to the scheme, the invention provides the online fault processing system, and when the online fault occurs, the operation data information of the system at the fault moment is acquired; then determining the fault type of the online fault based on the operation data information at the fault moment; then, searching fault reasons and solutions matched with the fault types in a database, wherein each fault reason corresponds to at least one solution, and each fault reason is preset with a corresponding probability value; then recommending an optimal fault reason and a corresponding solution according to the probability value of the fault reason; then generating a fault report based on the optimal fault reason and the corresponding solution; and finally, pushing the fault report to the corresponding operation and maintenance terminal. The invention can automatically acquire the fault information of the online fault of the system and analyze and process the fault information, thereby automatically pushing a corresponding solution to operation and maintenance personnel, effectively optimizing the processing flow of the online fault, improving the processing efficiency of the online fault, reducing the working strength of the operation and maintenance personnel, and simultaneously reducing the requirements on the number and quality of the operation and maintenance personnel in the operation and maintenance process of the system.
In this embodiment, the modules/units integrated with the online fault handling system may be stored in a computer readable storage medium if they are implemented in the form of software functional units and sold or used as independent products. Based on such understanding, all or part of the flow of the method according to the embodiments of the present invention may also be implemented by a computer program, which may be stored in a computer-readable storage medium, and when the computer program is executed by a processor, the steps of the method embodiments may be implemented. Wherein the computer program comprises computer program code, which may be in the form of source code, object code, an executable file or some intermediate form, etc. The computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, usb disk, removable hard disk, magnetic disk, optical disk, computer Memory, Read-only Memory (ROM), Random Access Memory (RAM), electrical carrier wave signals, telecommunications signals, software distribution medium, etc. It should be noted that the computer readable medium may contain content that is subject to appropriate increase or decrease as required by legislation and patent practice in jurisdictions, for example, in some jurisdictions, computer readable media does not include electrical carrier signals and telecommunications signals as is required by legislation and patent practice.
Those skilled in the art will appreciate that while some embodiments herein include some features included in other embodiments, rather than others, combinations of features of different embodiments are meant to be within the scope of the invention and form different embodiments. For example, in the following claims, any of the claimed embodiments may be used in any combination.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (10)

1. An online fault handling method, the method comprising:
when an online fault occurs, acquiring operation data information of a system at the moment of the fault;
determining the fault type of the online fault based on the operation data information at the fault moment;
searching fault reasons and solutions matched with the fault types in a database, wherein each fault reason corresponds to at least one solution, and each fault reason is preset with a corresponding probability value;
recommending an optimal fault reason and a corresponding solution according to the probability value of the fault reason;
generating a fault report based on the optimal fault reason and the corresponding solution;
and pushing the fault report to the corresponding operation and maintenance terminal.
2. The on-line fault handling method of claim 1, further comprising:
and monitoring the running state of the system in real time.
3. The on-line fault handling method according to claim 2, wherein the monitoring the operation state of the system in real time comprises:
acquiring running state data of the system according to preset interval time;
comparing the running state data with preset state data of normal running of the system;
and judging whether the system has an online fault according to the comparison result.
4. The on-line fault handling method according to claim 1, wherein the determining the fault type of the on-line fault based on the operation data information at the fault time comprises:
extracting fault data from the operation data information at the fault moment, wherein the fault data is data generated due to the online fault of the system;
and matching the corresponding fault type in a fault type table preset in the database according to the type of the fault data.
5. The method of on-line fault handling according to any of claims 1-4, wherein the method further comprises:
acquiring fault frequency information of the fault type within preset time, and pushing a system maintenance suggestion to a corresponding operation and maintenance terminal based on the fault frequency information.
6. An online fault handling system, the system comprising:
the operation data acquisition module is used for acquiring operation data information of a system at a fault moment when an online fault occurs;
the fault type judging module is used for determining the fault type of the online fault based on the operation data information at the fault moment;
the fault cause searching module is used for searching a fault cause and a solution which are matched with the fault type in a database, wherein each fault cause corresponds to at least one solution, and each fault cause is preset with a corresponding probability value;
the solution recommending module recommends an optimal fault reason and a corresponding solution according to the probability value of the fault reason;
the fault report generation module is used for generating a fault report based on the optimal fault reason and the corresponding solution;
and the fault report pushing module is used for pushing the fault report to the corresponding operation and maintenance terminal.
7. The online fault handling system of claim 6, further comprising:
and the running state monitoring module is used for monitoring the running state of the system in real time.
8. The on-line fault handling system of claim 7, wherein the operational status monitoring module comprises:
the state data acquisition unit is used for acquiring the running state data of the system according to preset interval time;
the state data comparison unit is used for comparing the running state data with the preset state data of normal running of the system;
and the online fault judging unit is used for judging whether the system has online faults according to the comparison result.
9. The on-line fault handling system of claim 6, wherein the fault type determination module comprises:
the fault data extraction unit is used for extracting fault data from the operation data information at the fault moment, wherein the fault data is data generated by on-line fault of a system;
and the fault type matching unit is used for matching the corresponding fault type in a fault type table preset in the database according to the type of the fault data.
10. An online fault handling system according to any of claims 6-9, characterized in that the system further comprises:
and the maintenance suggestion pushing module is used for acquiring the fault frequency information of the fault type within the preset time and pushing the system maintenance suggestions to the corresponding operation and maintenance terminals based on the fault frequency information.
CN201910879471.7A 2019-09-18 2019-09-18 Online fault processing method and system Pending CN110704224A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910879471.7A CN110704224A (en) 2019-09-18 2019-09-18 Online fault processing method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910879471.7A CN110704224A (en) 2019-09-18 2019-09-18 Online fault processing method and system

Publications (1)

Publication Number Publication Date
CN110704224A true CN110704224A (en) 2020-01-17

Family

ID=69194568

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910879471.7A Pending CN110704224A (en) 2019-09-18 2019-09-18 Online fault processing method and system

Country Status (1)

Country Link
CN (1) CN110704224A (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111459772A (en) * 2020-04-16 2020-07-28 江西科技学院 Computer fault alarm system
CN111913825A (en) * 2020-07-31 2020-11-10 赵鑫飚 Big data based solution recommendation system and method
CN112183773A (en) * 2020-08-25 2021-01-05 珠海格力电器股份有限公司 Fault data processing method, device, equipment and storage medium
CN112467876A (en) * 2020-11-07 2021-03-09 国网山东省电力公司东阿县供电公司 Power failure analysis and positioning management system and method
CN113157524A (en) * 2021-04-22 2021-07-23 深圳壹账通创配科技有限公司 Big data based exception problem solving method, system, equipment and storage medium
CN113239073A (en) * 2021-04-28 2021-08-10 北京云旗通科技有限公司 Maintenance system and method based on precise air conditioner
CN113688156A (en) * 2021-08-27 2021-11-23 深圳市玄羽科技有限公司 Mechanical fault detection system and method based on big data
CN113886130A (en) * 2021-10-21 2022-01-04 深信服科技股份有限公司 Method, device and medium for processing database fault
CN114548326A (en) * 2022-04-27 2022-05-27 深圳丰尚智慧农牧科技有限公司 Fault processing method and device for feed production equipment and computer equipment
CN115840915A (en) * 2022-11-22 2023-03-24 广州城轨科技有限公司 Automatic identification method, system, terminal and storage medium for electromechanical equipment fault
CN116257039A (en) * 2023-05-16 2023-06-13 江西五十铃汽车有限公司 Vehicle fault diagnosis hierarchical processing method and program product based on big data
CN116581871A (en) * 2023-04-03 2023-08-11 山东华科信息技术有限公司 Fault removal method, device, system and equipment for power distribution equipment
CN117193252A (en) * 2023-09-28 2023-12-08 广东百德朗科技有限公司 Intelligent building remote operation and maintenance method and device based on data platform and electronic equipment
CN117807155A (en) * 2024-03-01 2024-04-02 深圳润世华软件和信息技术服务有限公司 Method, equipment and storage medium for generating multi-dimensional early warning prompt information

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107562034A (en) * 2017-07-14 2018-01-09 宝沃汽车(中国)有限公司 Fault handling method and processing system on line
CN108501980A (en) * 2018-03-23 2018-09-07 固安信通信号技术股份有限公司 The monitoring method and terminal device of track circuit equipment
CN110198526A (en) * 2019-05-30 2019-09-03 北京市众诚恒祥能源投资管理有限公司 A kind of combustor fault diagnosis system based on Internet of Things

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107562034A (en) * 2017-07-14 2018-01-09 宝沃汽车(中国)有限公司 Fault handling method and processing system on line
CN108501980A (en) * 2018-03-23 2018-09-07 固安信通信号技术股份有限公司 The monitoring method and terminal device of track circuit equipment
CN110198526A (en) * 2019-05-30 2019-09-03 北京市众诚恒祥能源投资管理有限公司 A kind of combustor fault diagnosis system based on Internet of Things

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111459772A (en) * 2020-04-16 2020-07-28 江西科技学院 Computer fault alarm system
CN111913825A (en) * 2020-07-31 2020-11-10 赵鑫飚 Big data based solution recommendation system and method
CN111913825B (en) * 2020-07-31 2021-04-27 山西泰森科技股份有限公司 Big data based solution recommendation system and method
CN112183773A (en) * 2020-08-25 2021-01-05 珠海格力电器股份有限公司 Fault data processing method, device, equipment and storage medium
CN112467876B (en) * 2020-11-07 2023-07-21 国网山东省电力公司东阿县供电公司 Power failure analysis and positioning management system and method
CN112467876A (en) * 2020-11-07 2021-03-09 国网山东省电力公司东阿县供电公司 Power failure analysis and positioning management system and method
CN113157524A (en) * 2021-04-22 2021-07-23 深圳壹账通创配科技有限公司 Big data based exception problem solving method, system, equipment and storage medium
CN113239073A (en) * 2021-04-28 2021-08-10 北京云旗通科技有限公司 Maintenance system and method based on precise air conditioner
CN113688156A (en) * 2021-08-27 2021-11-23 深圳市玄羽科技有限公司 Mechanical fault detection system and method based on big data
CN113886130A (en) * 2021-10-21 2022-01-04 深信服科技股份有限公司 Method, device and medium for processing database fault
CN114548326B (en) * 2022-04-27 2022-09-09 深圳丰尚智慧农牧科技有限公司 Fault processing method and device for feed production equipment and computer equipment
CN114548326A (en) * 2022-04-27 2022-05-27 深圳丰尚智慧农牧科技有限公司 Fault processing method and device for feed production equipment and computer equipment
CN115840915A (en) * 2022-11-22 2023-03-24 广州城轨科技有限公司 Automatic identification method, system, terminal and storage medium for electromechanical equipment fault
CN116581871A (en) * 2023-04-03 2023-08-11 山东华科信息技术有限公司 Fault removal method, device, system and equipment for power distribution equipment
CN116581871B (en) * 2023-04-03 2024-02-06 山东华科信息技术有限公司 Fault removal method, device, system and equipment for power distribution equipment
CN116257039A (en) * 2023-05-16 2023-06-13 江西五十铃汽车有限公司 Vehicle fault diagnosis hierarchical processing method and program product based on big data
CN117193252A (en) * 2023-09-28 2023-12-08 广东百德朗科技有限公司 Intelligent building remote operation and maintenance method and device based on data platform and electronic equipment
CN117807155A (en) * 2024-03-01 2024-04-02 深圳润世华软件和信息技术服务有限公司 Method, equipment and storage medium for generating multi-dimensional early warning prompt information

Similar Documents

Publication Publication Date Title
CN110704224A (en) Online fault processing method and system
CN110727531A (en) Fault prediction and processing method and system for online system
CN107145445B (en) Automatic analysis method and system for error log of software automation test
CN111177655B (en) Data processing method and device and electronic equipment
CN113084388B (en) Welding quality detection method, system, device and storage medium
CN113030666A (en) Large-scale transformer discharge fault diagnosis method and device
CN115718450A (en) Equipment wire-stopping monitoring method and device, electronic equipment and system
CN111371647A (en) Data center monitoring data preprocessing method and device
CN117150418B (en) Transformer operation detection period formulation method and system based on state characteristic fault tree
CN110687851A (en) Terminal operation monitoring system and method
CN110716736A (en) Method and system for processing online fault of customer service feedback
CN116739829B (en) Big data-based power data analysis method, system and medium
CN116308295A (en) Industrial production data management method and system
CN110796282A (en) Industry fluctuation period monitoring method based on correlation coefficient
CN114386745A (en) PMS power transformation equipment data checking and identifying method and system
CN114155914A (en) Detection and correction system based on metagenome splicing error
CN106789311B (en) Automatic shielding method and system for engineering alarm
CN110633311B (en) Data processing method, device and storage medium
CN112114242B (en) On-line monitoring and analyzing method for automatic IC test
CN117520999A (en) Intelligent operation and maintenance method and system for edge data center equipment
CN116453437B (en) Display screen module testing method, device, equipment and storage medium
CN113900865B (en) Intelligent power grid equipment automatic test method, system and readable storage medium
CN116387652B (en) Online maintenance system and method for formation/capacity-division power supply equipment
CN116304763B (en) Power data pre-analysis method, system, equipment and medium
CN110704093A (en) Method and system for processing operation feedback online fault

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination