CN117648206A - System abnormal information monitoring and analyzing method, device, equipment and storage medium - Google Patents

System abnormal information monitoring and analyzing method, device, equipment and storage medium Download PDF

Info

Publication number
CN117648206A
CN117648206A CN202311537203.XA CN202311537203A CN117648206A CN 117648206 A CN117648206 A CN 117648206A CN 202311537203 A CN202311537203 A CN 202311537203A CN 117648206 A CN117648206 A CN 117648206A
Authority
CN
China
Prior art keywords
information
abnormal
acquiring
anomaly
message
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311537203.XA
Other languages
Chinese (zh)
Inventor
苏祺云
胡凌壮
袁骞
李显
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Kaidisi Intelligent Technology Co ltd
Original Assignee
Shenzhen Kaidisi Intelligent Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Kaidisi Intelligent Technology Co ltd filed Critical Shenzhen Kaidisi Intelligent Technology Co ltd
Priority to CN202311537203.XA priority Critical patent/CN117648206A/en
Publication of CN117648206A publication Critical patent/CN117648206A/en
Pending legal-status Critical Current

Links

Abstract

The invention discloses a system anomaly information monitoring and analyzing method, a system and equipment, wherein the method comprises the following steps: acquiring abnormal information of a plurality of service systems received by a message middleware server in real time; acquiring a predefined configuration file, acquiring message information of abnormal information according to the configuration file, and storing the abnormal information in a database in a preset mode; identifying a target service system generating abnormal information according to the message information, acquiring cache information of the target service system, and calculating and storing the current abnormal times of the target service system according to the historical abnormal times in the cache information of the target service system; acquiring an abnormal threshold according to the configuration file, and notifying a user according to a preset notification mode when the current abnormal times exceed the abnormal threshold. The method solves the problems that abnormal information in each system is scattered and is difficult to check and easy to miss, and thus improves the capability of daily checking and analyzing the system abnormality.

Description

System abnormal information monitoring and analyzing method, device, equipment and storage medium
Technical Field
The present invention relates to the field of server monitoring, and in particular, to a method, a system, an apparatus, and a storage medium for monitoring and analyzing system anomaly information.
Background
Currently, most systems deploy a load balancing service for the stability and throughput capacity of the service, and when the same set of system is deployed on a plurality of servers, the load balancing service can distribute traffic to different servers, so that a single point of failure in the system can be eliminated.
However, when the same system is deployed on a plurality of servers, because the abnormal logs are distributed on different servers, after the system has problems, the system on which server appears cannot be monitored quickly, related operation and maintenance personnel or development personnel need to log on all servers on which the system is deployed one by one to search the corresponding abnormal logs, so that the difficulty of positioning the abnormal logs is increased, the abnormal omission is easily caused, and the time for positioning the abnormal logs is prolonged.
Accordingly, there is a need for improvement and development in the art.
Disclosure of Invention
The invention mainly aims to provide a system abnormal information monitoring and analyzing method, a system, equipment and a storage medium, and aims to solve the problems that in the prior art, when the same system is deployed on a plurality of servers, abnormal logs are distributed on different servers, so that after the system has problems, the difficulty of positioning the abnormal logs is increased, abnormal omission is easily caused, and the time for positioning the abnormal logs is prolonged.
In order to achieve the object, a first aspect of the present invention provides a system anomaly information monitoring and analyzing method, wherein the system anomaly information monitoring and analyzing method includes:
acquiring abnormal information of a plurality of service systems received by a message middleware server in real time;
acquiring a predefined configuration file, acquiring message information of the abnormal information according to the configuration file, and storing the abnormal information into a database in a preset mode;
identifying a target service system generating the abnormal information according to the message information, acquiring target service system cache information, and calculating and storing the current abnormal times of the target service system according to the historical abnormal times in the target service system cache information;
and acquiring an abnormal threshold according to the configuration file, and notifying a user according to a preset notification mode when the current abnormal times exceed the abnormal threshold.
Optionally, the step of monitoring the abnormal information of the plurality of service systems in real time includes:
deploying message middleware service ends, and respectively deploying message middleware client ends in a plurality of service systems;
defining the configuration file, and storing the configuration file in a system where the message middleware server is located and a system where the message middleware client is located, wherein the configuration file comprises a service system identifier, an anomaly type code and the anomaly threshold value.
Optionally, the step of obtaining, in real time, the abnormal information of the plurality of service systems received by the message middleware server includes:
monitoring the abnormal information of a plurality of service systems in real time according to the message middleware server;
when abnormal information occurs in the service system, pushing the abnormal information to the message middleware server according to the message middleware client;
and acquiring the abnormal information of the plurality of service systems received by the message middleware server according to the abnormal information pushed to the message middleware server.
Optionally, the anomaly information includes anomaly content information, a current service system identifier and a current anomaly type code;
the step of obtaining the predefined configuration file, obtaining the message information of the abnormal information according to the configuration file, and storing the abnormal information in a database in a preset manner comprises the following steps:
acquiring the configuration file, and acquiring message information in the abnormal information according to the configuration file, wherein the message information comprises abnormal source information, abnormal type information, abnormal information and abnormal time;
and constructing a message object according to the abnormality source information, the abnormality type information, the abnormality information and the abnormality time, and storing the message object in a preset format into the database.
Optionally, the step of identifying the target service system generating the anomaly information according to the message information, obtaining the cache information of the target service system, and calculating and storing the current anomaly times of the target service system according to the historical anomaly times in the cache information of the target service system includes:
identifying a target service system generating the abnormal information according to the abnormal source information in the message information;
acquiring target service system cache information from the cache, and acquiring abnormal times according to the target service system cache information;
and adding one to the historical abnormal times to obtain the current abnormal times, and storing the current abnormal times.
Optionally, the step of obtaining the abnormality threshold according to the configuration file, and notifying the user according to a preset notification mode when the current abnormality number exceeds the abnormality threshold includes:
acquiring the abnormal threshold according to the configuration file, and judging whether the current abnormal times exceed the abnormal threshold according to the abnormal threshold;
and when the current abnormal times exceeds the abnormal threshold, notifying a user according to a preset notification mode, and updating the abnormal times in the cache information of the target service system to 0.
Optionally, the step of obtaining the abnormality threshold according to the configuration file, when the current abnormality number exceeds the abnormality threshold, notifying the user according to a preset notification mode further includes:
acquiring the abnormal information according to the database;
and displaying the abnormal information on a display interface.
The second aspect of the present invention provides a system anomaly information monitoring and analyzing system, wherein the system anomaly information monitoring and analyzing system includes:
the abnormal information receiving module is used for acquiring abnormal information of a plurality of service systems received by the message middleware server in real time;
the abnormal information storage module is used for acquiring a predefined configuration file, acquiring message information of the abnormal information according to the configuration file, and storing the abnormal information into a database in a preset mode;
the abnormal times acquisition module is used for identifying a target service system generating the abnormal information according to the message information, acquiring cache information of the target service system, and calculating and storing the current abnormal times of the target service system according to the historical abnormal times in the cache information of the target service system;
and the notification module is used for acquiring an abnormal threshold according to the configuration file, and notifying a user according to a preset notification mode when the current abnormal times exceed the abnormal threshold.
A third aspect of the present invention provides an apparatus including a memory, a processor, and a system abnormality information monitoring analysis program stored on the memory and operable on the processor, the system abnormality information monitoring analysis program implementing the steps of any one of the system abnormality information monitoring analysis methods when executed by the processor.
A fourth aspect of the present invention provides a computer-readable storage medium having stored thereon a system abnormality information monitoring analysis program which, when executed by a processor, implements the steps of any one of the system abnormality information monitoring analysis methods.
From the above, in the scheme of the invention, the abnormal information of a plurality of service systems received by the message middleware server is obtained in real time; acquiring a predefined configuration file, acquiring message information of the abnormal information according to the configuration file, and storing the abnormal information into a database in a preset mode; identifying a target service system generating the abnormal information according to the message information, acquiring target service system cache information, and calculating and storing the current abnormal times of the target service system according to the historical abnormal times in the target service system cache information; and acquiring an abnormal threshold according to the configuration file, and notifying a user according to a preset notification mode when the current abnormal times exceed the abnormal threshold.
Compared with the prior art, the method has the advantages that the problem that when the same system is deployed on a plurality of servers at present, the difficulty of positioning the abnormal logs is increased after the system is in a problem, the abnormal logs are easy to miss, and the time for positioning the abnormal logs is prolonged is solved; after the corresponding abnormal information is received, the abnormal information is classified, sorted and stored in a database, so that a user can conveniently check the abnormal conditions of all systems regularly; in addition, for the multiple abnormal information of the same system, the number of times of judging the generation of the abnormal information can be correspondingly increased, when the number of times reaches the abnormal threshold value, the abnormal condition can be correspondingly notified to the user, so that the user is timely notified of the emergency abnormal condition, the user can know the emergency abnormal condition in time, and the safety of the system is ensured.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are needed in the embodiments or the description of the prior art will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a schematic flow chart of a system anomaly information monitoring and analyzing method according to an embodiment of the present invention;
FIG. 2 is a schematic flow chart of an embodiment of a system anomaly information monitoring and analyzing method according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of a system anomaly information monitoring and analyzing system according to an embodiment of the present invention;
fig. 4 is a schematic block diagram of an internal structure of an apparatus according to an embodiment of the present invention.
Detailed Description
In the following description, for purposes of explanation and not limitation, specific details are set forth such as the particular system architecture, techniques, etc., in order to provide a thorough understanding of the embodiments of the present invention. It will be apparent, however, to one skilled in the art that the present invention may be practiced in other embodiments that depart from these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present invention with unnecessary detail.
It should be understood that the terms "comprises" and/or "comprising," when used in this specification and the appended claims, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
It is also to be understood that the terminology used in the description of the invention is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used in this specification and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise.
It should be further understood that the term "and/or" as used in the present specification and the appended claims refers to any and all possible combinations of one or more of the associated listed items, and includes such combinations.
As used in this specification and the appended claims, the term "if" may be interpreted in the context of "when …" or "once" or "in response to a determination" or "in response to a classification. Similarly, the phrase "if determined" or "if classified to [ described condition or event ]" may be interpreted in the context of meaning "upon determination" or "in response to determination" or "upon classification to [ described condition or event ]" or "in response to classification to [ described condition or event ]".
The following description of the embodiments of the present invention will be made more fully hereinafter with reference to the accompanying drawings, in which embodiments of the invention are shown, it being evident that the embodiments described are only some, but not all embodiments of the invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, but the present invention may be practiced in other ways other than those described herein, and persons skilled in the art will readily appreciate that the present invention is not limited to the specific embodiments disclosed below.
Exemplary method
As shown in fig. 1, an embodiment of the present invention provides a system anomaly information monitoring and analyzing method, specifically, the system anomaly information monitoring and analyzing method includes the following steps:
step S100, obtaining abnormal information of a plurality of service systems received by the message middleware server in real time.
It should be noted that the message middleware is a cross-process, asynchronous communication mechanism for transmitting messages upstream and downstream, and the message system ensures reliable transmission of the messages, which can be used for collecting logs and abnormal information. In the embodiment of the application, the message middleware may be RabbitMQ (Rabbit Messa ge Queue) message middleware. The RabbitMQ server is written in the Erlang language, while clustering and failover are built on an open telecommunications platform framework, all major programming languages having a client library that communicates with the proxy interface.
The message middleware server side deployed through the service system can receive the abnormal information generated by the service system in real time through the message middleware client side deployed in the service system. Therefore, when abnormal information appears in a plurality of service systems, the information middleware client side sends the abnormal information to the information middleware server side, so that the abnormal logs from each system can be checked in a concentrated mode without opening the service systems to check whether the abnormal state exists one by one, and further, staff can conveniently check the abnormal state of the system.
Further, the step of monitoring the abnormal information of the plurality of service systems in real time includes:
deploying message middleware service ends, and respectively deploying message middleware client ends in a plurality of service systems;
defining the configuration file, and storing the configuration file in a system where the message middleware server is located and a system where the message middleware client is located, wherein the configuration file comprises a service system identifier, an anomaly type code and the anomaly threshold value.
Specifically, the application is provided for the problem that when the system is deployed on a plurality of servers in the same set of system, the difficulty of positioning the abnormal logs is increased after the system is in a problem due to the fact that the abnormal logs are distributed on different servers, so that the problem of abnormal log positioning is easy to cause abnormal omission and the time of abnormal log positioning is prolonged, and therefore, a message middleware server is required to be deployed in a local system applying the system abnormal information monitoring and analyzing method, and a message middleware client is required to be deployed in a plurality of business systems. In the embodiment of the application, the message middleware client and the message middleware server are both RabbitMQ message middleware. The message middleware client subscribes to RabbitMQ messages, namely abnormal information, of the message middleware servers of the multiple service systems. It should be noted that, in this application, the plurality of service systems are service systems that are selected by the user in advance to obtain the anomaly information thereof.
The configuration file comprises three fields of an anomaly threshold value, an anomaly type code and a system identifier, and the configuration file is stored in a database in a structured data mode. And after obtaining the abnormal information of the plurality of service systems received by the message middleware server, extracting the configuration file from the database, and processing the abnormal information. The abnormal threshold is a threshold preset by a user, and when the same service system receives multiple times of abnormal information, the user is correspondingly notified when the judgment times exceed the abnormal threshold. The anomaly type codes are classified into: error, warn, info, wherein Error = Error; warn = alert; info = general information; the three types of anomaly type codes are log levels corresponding to corresponding system logs, and after the anomaly information of a plurality of service systems received by the message middleware server is obtained, the anomaly type codes of the current anomaly information are obtained by analyzing the content in the anomaly information to match. The system identifiers are user-defined identifiers, each identifier corresponds to a unique service system, and according to the system identifiers, it can be determined which service system the abnormal information is generated by.
And the configuration file is correspondingly stored in the message middleware service list, and when the service system generates an abnormal message, the abnormal information transmitted through the message middleware service end comprises the generated abnormal message, the service system identifier and the abnormal type code.
Further, the step of obtaining, in real time, the abnormal information of the plurality of service systems received by the message middleware server includes:
monitoring the abnormal information of a plurality of service systems in real time according to the message middleware server;
when abnormal information occurs in the service system, pushing the abnormal information to the message middleware server according to the message middleware client;
and acquiring the abnormal information of the plurality of service systems received by the message middleware server according to the abnormal information pushed to the message middleware server.
Abnormal information of a plurality of service systems can be obtained at any time through a message middleware server deployed locally and a message middleware client deployed on the service system, namely, a monitoring effect is realized; when abnormal information appears in the service system, the message middleware server subscribes the Rabbit MQ message of the message middleware client, so that the message middleware client can push the corresponding abnormal information to the message middleware server, and the message middleware server can correspondingly receive the abnormal information of a plurality of service systems.
Step 200, a predefined configuration file is obtained, message information of the abnormal information is obtained according to the configuration file, and the abnormal information is stored in a database in a preset mode.
Specifically, since the configuration file is defined in advance, and the exception message includes the service system identifier and the exception type code, that is, the message information, the exception information can be stored correspondingly according to the message information. After the corresponding abnormal information is received, the abnormal information is classified, sorted and stored in the database, so that a user can conveniently check the abnormal conditions of all systems regularly.
Further, the anomaly information comprises anomaly content information, a current service system identifier and a current anomaly type code;
the step of obtaining the predefined configuration file, obtaining the message information of the abnormal information according to the configuration file, and storing the abnormal information in a database in a preset manner comprises the following steps:
acquiring the configuration file, and acquiring message information in the abnormal information according to the configuration file, wherein the message information comprises abnormal source information, abnormal type information, abnormal content information and abnormal time;
And constructing a message object according to the abnormality source information, the abnormality type information, the abnormality content information and the abnormality time, and storing the message object in a preset format into the database.
The abnormal content information is the abnormality generated by the service system, the abnormal content information comprises abnormal content information, a service system identifier and an abnormal type code generated by the configuration file, wherein the service system identifier of the abnormal information corresponds to the service system generating the current abnormal content information and is recorded as the current service system identifier; the current anomaly type code is the anomaly type code corresponding to the transmitted anomaly information.
When the message middleware server receives the abnormal information, the corresponding configuration file of the system where the message middleware server is located is obtained, and the message information in the abnormal information, namely the abnormal source information, the abnormal type information, the abnormal information and the abnormal time, can be identified according to the configuration file. The method comprises the steps that the information of an abnormal source is obtained according to a current service system identifier in an abnormal message, the information of an abnormal type is obtained according to a current abnormal type code in the abnormal message, abnormal content information is information contained in the abnormal information, abnormal time is abnormal generation time in the abnormal information, and the abnormal generation time is added when a message middleware client sends the abnormal information, so that a user can know when the abnormality is generated.
And constructing a message object by using the abnormality source information, the abnormality type information, the abnormality content information and the abnormality time, namely storing the received message into a database according to the abnormality source, the abnormality type, the abnormality content information and the abnormality time as one object, wherein the object is set as the message object in the embodiment of the application, and before storing the message object into the database, storing the message object into a Monnodb database in json.
Step S300, identifying the target service system generating the abnormal information according to the message information, acquiring the cache information of the target service system, and calculating and storing the current abnormal times of the target service system according to the historical abnormal times in the cache information of the target service system.
It should be noted that the target service system cache information is cache information of one service system, where the same number of anomalies in the target service system is cached. Because the message information contains the abnormality source information, the target service system generating the abnormality information can be identified through the message information, and the target service system cache information stored in the target service system cache of the target service system is correspondingly acquired, so that the current abnormality number after the current abnormality number is added to the historical abnormality number is judged.
The step of identifying the target service system generating the abnormal information according to the message information, obtaining the cache information of the target service system, and calculating and storing the current abnormal times of the target service system according to the historical abnormal times in the cache information of the target service system comprises the following steps:
identifying a target service system generating the abnormal information according to the abnormal source information in the message information;
acquiring target service system cache information from the cache, and acquiring abnormal times according to the target service system cache information;
and adding one to the historical abnormal times to obtain the current abnormal times, and storing the current abnormal times.
Further describing, when the message middleware server receives the primary abnormal information, it is regarded as a count, and identifies the target service system generating the abnormal information, and then stores the number of times in the cache, where in this embodiment of the present application, the cache may be a redis cache. And each time the current abnormal times are received, the times of the corresponding target service system in the redis cache are taken out and added by one to obtain the corresponding current abnormal times, and the current abnormal times are stored in the redis cache to be used as the latest historical abnormal times.
Step S400, obtaining an abnormal threshold according to the configuration file, and notifying a user according to a preset notification mode when the current abnormal times exceed the abnormal threshold.
Specifically, for multiple abnormal information of the same system, the number of times of judging the same abnormal information generation can be correspondingly judged, when the number of times reaches the abnormal threshold value, the abnormal condition can be correspondingly notified to the user, so that the user is timely notified of the emergency abnormal condition, the user can know the emergency abnormal condition in time, and the safety of the system is ensured.
Further, the step of obtaining the abnormality threshold according to the configuration file, and notifying the user according to a preset notification mode when the current abnormality number exceeds the abnormality threshold includes:
acquiring the abnormal threshold according to the configuration file, and judging whether the current abnormal times exceed the abnormal threshold according to the abnormal threshold;
and when the current abnormal times exceeds the abnormal threshold, notifying a user according to a preset notification mode, and updating the abnormal times in the cache information of the target service system to 0.
Specifically, in one embodiment of the present application, the anomaly threshold may be set to 20 times, that is, when the current anomaly number exceeds the anomaly threshold, the user is notified correspondingly, and the anomaly number in the cache information of the target service system is updated to 0. In one embodiment of the present application, the user may be notified by sending a mail or a short message.
The step of obtaining the abnormal threshold according to the configuration file, and notifying the user according to a preset notification mode when the current abnormal times exceed the abnormal threshold, further comprises:
acquiring the abnormal information according to the database;
and displaying the abnormal information on a display interface.
In the embodiment of the application, after the abnormal information is received, the abnormal information can be extracted from the database and displayed on the display interface at the front end through a visualization method, so that the abnormal information can be provided for operation and maintenance personnel to perform daily centralized analysis and monitoring.
The flow of the system abnormal information monitoring and analyzing method is further described through fig. 2, specifically, a plurality of application servers push message queues, namely abnormal messages, in real time, the message middleware server receives the messages and stores the messages correspondingly and performs visual processing, meanwhile, whether the number of times of the abnormal messages reaches a threshold value is judged, and when the number of times of the abnormal messages reaches the threshold value, the corresponding alarm is given and a user is notified.
Compared with the prior art, the method has the advantages that the message middleware client side is arranged in the service system, and the message middleware server side is locally applied to the service system, so that when abnormal information appears in a plurality of service systems, the deployed message middleware client side sends the abnormal information to the message middleware server side, so that the abnormal logs from each system can be intensively checked, whether the abnormal logs exist in each service system or not is checked one by one without opening the service system, and the system abnormality is checked by staff conveniently; after the corresponding abnormal information is received, the abnormal information is classified, sorted and stored in a database, so that a user can conveniently check the abnormal conditions of all systems regularly; in addition, for the multiple abnormal information of the same system, the number of times of judging the generation of the abnormal information can be correspondingly increased, when the number of times reaches the abnormal threshold value, the abnormal condition can be correspondingly notified to the user, so that the user is timely notified of the emergency abnormal condition, the user can know the emergency abnormal condition in time, and the safety of the system is ensured.
Exemplary apparatus
As shown in fig. 3, corresponding to the system anomaly information monitoring and analyzing method, the embodiment of the present invention further provides a system anomaly information monitoring and analyzing system, where the system anomaly information monitoring and analyzing system includes:
the abnormal information receiving module 31 is configured to obtain abnormal information of a plurality of service systems received by the message middleware server in real time;
the abnormal information storage module 32 is configured to obtain a predefined configuration file, obtain message information of the abnormal information according to the configuration file, and store the abnormal information in a database in a preset manner;
an abnormal times acquisition module 33, configured to identify a target service system that generates the abnormal information according to the message information, acquire target service system cache information, and calculate and store a current abnormal times of the target service system according to a historical abnormal times in the target service system cache information;
and the notification module 34 is configured to obtain an anomaly threshold according to the configuration file, and notify a user according to a preset notification mode when the current anomaly number exceeds the anomaly threshold.
It should be noted that, the specific structure and implementation manner of the system anomaly information monitoring and analyzing system and each module or unit thereof may refer to corresponding descriptions in the method embodiments, and are not repeated herein.
The division manner of each module of the system anomaly information monitoring and analyzing system is not limited, and is not particularly limited.
Based on the embodiment, the invention also provides equipment, and the functional block diagram of the equipment can be shown in fig. 4. The device comprises a processor 10, a memory 20, a network interface and a display screen 30 connected by a system bus. In one embodiment, when the processor 10 executes the system exception information monitoring and analyzing program 40 in the memory 20, the following steps are implemented:
acquiring abnormal information of a plurality of service systems received by a message middleware server in real time;
acquiring a predefined configuration file, acquiring message information of the abnormal information according to the configuration file, and storing the abnormal information into a database in a preset mode;
identifying a target service system generating the abnormal information according to the message information, acquiring target service system cache information, and calculating and storing the current abnormal times of the target service system according to the historical abnormal times in the target service system cache information;
and acquiring an abnormal threshold according to the configuration file, and notifying a user according to a preset notification mode when the current abnormal times exceed the abnormal threshold.
Optionally, the step of monitoring the abnormal information of the plurality of service systems in real time includes:
deploying message middleware service ends, and respectively deploying message middleware client ends in a plurality of service systems;
defining the configuration file, and storing the configuration file in a system where the message middleware server is located and a system where the message middleware client is located, wherein the configuration file comprises a service system identifier, an anomaly type code and the anomaly threshold value.
Optionally, the step of obtaining, in real time, the abnormal information of the plurality of service systems received by the message middleware server includes:
monitoring the abnormal information of a plurality of service systems in real time according to the message middleware server;
when abnormal information occurs in the service system, pushing the abnormal information to the message middleware server according to the message middleware client;
and acquiring the abnormal information of the plurality of service systems received by the message middleware server according to the abnormal information pushed to the message middleware server.
Optionally, the anomaly information includes anomaly content information, a current service system identifier and a current anomaly type code;
The step of obtaining the predefined configuration file, obtaining the message information of the abnormal information according to the configuration file, and storing the abnormal information in a database in a preset manner comprises the following steps:
acquiring the configuration file, and acquiring message information in the abnormal information according to the configuration file, wherein the message information comprises abnormal source information, abnormal type information, abnormal information and abnormal time;
and constructing a message object according to the abnormality source information, the abnormality type information, the abnormality information and the abnormality time, and storing the message object in a preset format into the database.
Optionally, the step of identifying the target service system generating the anomaly information according to the message information, obtaining the cache information of the target service system, and calculating and storing the current anomaly times of the target service system according to the historical anomaly times in the cache information of the target service system includes:
identifying a target service system generating the abnormal information according to the abnormal source information in the message information;
acquiring target service system cache information from the cache, and acquiring abnormal times according to the target service system cache information;
And adding one to the historical abnormal times to obtain the current abnormal times, and storing the current abnormal times.
Optionally, the step of obtaining the abnormality threshold according to the configuration file, and notifying the user according to a preset notification mode when the current abnormality number exceeds the abnormality threshold includes:
acquiring the abnormal threshold according to the configuration file, and judging whether the current abnormal times exceed the abnormal threshold according to the abnormal threshold;
and when the current abnormal times exceeds the abnormal threshold, notifying a user according to a preset notification mode, and updating the abnormal times in the cache information of the target service system to 0.
Optionally, the step of obtaining the abnormality threshold according to the configuration file, when the current abnormality number exceeds the abnormality threshold, notifying the user according to a preset notification mode further includes:
acquiring the abnormal information according to the database;
and displaying the abnormal information on a display interface.
Wherein the processor of the device is configured to provide computing and control capabilities. The memory of the device includes a non-volatile storage medium, an internal memory. The nonvolatile storage medium stores an operating system and a system abnormality information monitoring analysis program. The internal memory provides an environment for the operation of the operating system and the system anomaly information monitoring and analyzing program in the nonvolatile storage medium. The network interface of the device is used to communicate with an external device via a network connection. The system abnormal information monitoring and analyzing program is executed by the processor to realize the steps of the arbitrary system abnormal information monitoring and analyzing method. The display of the device may be a liquid crystal display or an electronic ink display.
It will be appreciated by persons skilled in the art that the functional block diagram shown in FIG. 4 is merely a block diagram of some of the structures associated with the present inventive arrangements and is not limiting of the apparatus to which the present inventive arrangements may be applied, as a specific apparatus may include more or fewer components than shown, or may combine some of the components, or have a different arrangement of components.
In one embodiment, an apparatus is provided that includes a memory, a processor, and a system exception information monitoring and analysis program stored on the memory and executable on the processor, the system exception information monitoring and analysis program implementing the steps of any of the system exception information monitoring and analysis methods provided by embodiments of the present invention when executed by the processor.
The embodiment of the invention also provides a computer readable storage medium, wherein the computer readable storage medium is stored with a system abnormal information monitoring and analyzing program, and the steps of any system abnormal information monitoring and analyzing method provided by the embodiment of the invention are realized when the system abnormal information monitoring and analyzing program is executed by a processor.
It should be understood that the sequence number of each step in the embodiment does not mean that the execution sequence of each process should be determined by the function and the internal logic, and should not limit the implementation process of the embodiment of the present invention.
It will be apparent to those skilled in the art that, for convenience and brevity of description, only the division of the functional units and modules is illustrated, and in practical application, the functional distribution may be performed by different functional units and modules, that is, the internal structure of the apparatus is divided into different functional units or modules, so as to perform all or part of the functions described above. The functional units and modules in the embodiment may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit, where the integrated units may be implemented in a form of hardware or a form of a software functional unit. In addition, the specific names of the functional units and modules are only for distinguishing from each other, and are not used for limiting the protection scope of the present invention. The specific working process of the units and modules in the device may refer to the corresponding process in the foregoing method embodiment, which is not described herein again.
In the embodiments, the descriptions of the embodiments are focused on, and the parts of a certain embodiment that are not described or depicted in detail can be referred to for related descriptions of other embodiments.
Those of ordinary skill in the art will appreciate that the elements and algorithm steps of the examples described in connection with the embodiments disclosed herein may be implemented as electronic hardware, or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
In the embodiments provided herein, it should be understood that the disclosed system/apparatus and method may be implemented in other ways. For example, the system/device embodiments described above are merely illustrative, e.g., the division of the modules or units is merely a logical functional division, and may be implemented in other ways, e.g., multiple units or components may be combined or integrated into another system, or some features may be omitted, or not performed.
The integrated modules/units, if implemented in the form of software functional units and sold or used as stand-alone products, may be stored in a computer readable storage medium. Based on such understanding, the present invention may also be implemented by implementing all or part of the flow of the method of the embodiment, or by instructing the relevant hardware by a computer program, where the computer program may be stored on a computer readable storage medium, and where the computer program, when executed by a processor, may implement the steps of the respective method embodiments. Wherein the computer program comprises computer program code which may be in source code form, object code form, executable file or some intermediate form etc. The computer readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a U disk, a removable hard disk, a magnetic disk, an optical disk, a computer Memory, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), an electrical carrier signal, a telecommunications signal, a software distribution medium, and so forth. It should be noted that the content of the computer readable storage medium can be appropriately increased or decreased according to the requirements of the legislation and the patent practice in the jurisdiction.
The above embodiments are only for illustrating the technical solution of the present invention, and not for limiting the same; although the invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art will understand that; the technical scheme described in the foregoing embodiments can be modified or some technical features thereof can be replaced by equivalents; such modifications and substitutions are not intended to depart from the spirit and scope of the various embodiments of the invention, which are also within the spirit and scope of the invention.

Claims (10)

1. The system abnormal information monitoring and analyzing method is characterized by comprising the following steps of:
acquiring abnormal information of a plurality of service systems received by a message middleware server in real time;
acquiring a predefined configuration file, acquiring message information of the abnormal information according to the configuration file, and storing the abnormal information into a database in a preset mode;
identifying a target service system generating the abnormal information according to the message information, acquiring target service system cache information, and calculating and storing the current abnormal times of the target service system according to the historical abnormal times in the target service system cache information;
And acquiring an abnormal threshold according to the configuration file, and notifying a user according to a preset notification mode when the current abnormal times exceed the abnormal threshold.
2. The system anomaly information monitoring and analyzing method according to claim 1, wherein the step of monitoring anomaly information of a plurality of service systems in real time, previously comprises:
deploying message middleware service ends, and respectively deploying message middleware client ends in a plurality of service systems;
defining the configuration file, and storing the configuration file in a system where the message middleware server is located and a system where the message middleware client is located, wherein the configuration file comprises a service system identifier, an anomaly type code and the anomaly threshold value.
3. The system anomaly information monitoring and analyzing method according to claim 2, wherein the step of acquiring anomaly information of the plurality of service systems received by the message middleware server in real time includes:
monitoring the abnormal information of a plurality of service systems in real time according to the message middleware server;
when abnormal information occurs in the service system, pushing the abnormal information to the message middleware server according to the message middleware client;
And acquiring the abnormal information of the plurality of service systems received by the message middleware server according to the abnormal information pushed to the message middleware server.
4. The system anomaly information monitoring and analyzing method according to claim 2, wherein the anomaly information includes anomaly content information, current service system identification, and current anomaly type code;
the step of obtaining the predefined configuration file, obtaining the message information of the abnormal information according to the configuration file, and storing the abnormal information in a database in a preset manner comprises the following steps:
acquiring the configuration file, and acquiring message information in the abnormal information according to the configuration file, wherein the message information comprises abnormal source information, abnormal type information, abnormal content information and abnormal time;
and constructing a message object according to the abnormality source information, the abnormality type information, the abnormality content information and the abnormality time, and storing the message object in a preset format into the database.
5. The system anomaly information monitoring and analyzing method according to claim 4, wherein the step of identifying the target service system generating the anomaly information based on the message information, obtaining target service system cache information, and calculating and storing the current anomaly times of the target service system based on the historical anomaly times in the target service system cache information comprises:
Identifying a target service system generating the abnormal information according to the abnormal source information in the message information;
acquiring target service system cache information from the cache, and acquiring abnormal times according to the target service system cache information;
and adding one to the abnormal times to obtain the current abnormal times, and storing the current abnormal times.
6. The system anomaly information monitoring and analyzing method according to claim 1, wherein the step of acquiring an anomaly threshold value according to the configuration file, and notifying the user according to a preset notification mode when the current anomaly number exceeds the anomaly threshold value comprises:
acquiring the abnormal threshold according to the configuration file, and judging whether the current abnormal times exceed the abnormal threshold according to the abnormal threshold;
and when the current abnormal times exceeds the abnormal threshold, notifying a user according to a preset notification mode, and updating the abnormal times in the cache information of the target service system to 0.
7. The system anomaly information monitoring and analyzing method according to claim 1, wherein the step of acquiring the anomaly threshold according to the configuration file, when the current anomaly number exceeds the anomaly threshold, notifying the user according to a preset notification mode further comprises:
Acquiring the abnormal information according to the database;
and displaying the abnormal information on a display interface.
8. The system anomaly information monitoring and analyzing system is characterized by comprising:
the abnormal information receiving module is used for acquiring abnormal information of a plurality of service systems received by the message middleware server in real time;
the abnormal information storage module is used for acquiring a predefined configuration file, acquiring message information of the abnormal information according to the configuration file, and storing the abnormal information into a database in a preset mode;
the abnormal times acquisition module is used for identifying a target service system generating the abnormal information according to the message information, acquiring cache information of the target service system, and calculating and storing the current abnormal times of the target service system according to the historical abnormal times in the cache information of the target service system;
and the notification module is used for acquiring an abnormal threshold according to the configuration file, and notifying a user according to a preset notification mode when the current abnormal times exceed the abnormal threshold.
9. An apparatus comprising a memory, a processor, and a system exception information monitoring and analysis program stored on the memory and executable on the processor, the system exception information monitoring and analysis program when executed by the processor implementing the steps of the system exception information monitoring and analysis method of any one of claims 1-7.
10. A computer-readable storage medium, wherein a system abnormality information monitoring analysis program is stored on the computer-readable storage medium, and the system abnormality information monitoring analysis program, when executed by a processor, implements the steps of the system abnormality information monitoring analysis method according to any one of claims 1 to 7.
CN202311537203.XA 2023-11-16 2023-11-16 System abnormal information monitoring and analyzing method, device, equipment and storage medium Pending CN117648206A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311537203.XA CN117648206A (en) 2023-11-16 2023-11-16 System abnormal information monitoring and analyzing method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311537203.XA CN117648206A (en) 2023-11-16 2023-11-16 System abnormal information monitoring and analyzing method, device, equipment and storage medium

Publications (1)

Publication Number Publication Date
CN117648206A true CN117648206A (en) 2024-03-05

Family

ID=90047014

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311537203.XA Pending CN117648206A (en) 2023-11-16 2023-11-16 System abnormal information monitoring and analyzing method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN117648206A (en)

Similar Documents

Publication Publication Date Title
EP3121725A1 (en) Information processing device and monitoring method
CN100549975C (en) Computer maintenance support system and analysis server
CN112965874B (en) Configurable monitoring alarm method and system
KR102068622B1 (en) Failure prediction system for heterogeneous network security system
CN110740061B (en) Fault early warning method and device and computer storage medium
US20070174732A1 (en) Monitoring system and method
CN109947616A (en) A kind of automatically-monitored operational system of the cloud operating system based on OpenStack technology
EP2608061A1 (en) Method and system for update message tracking and checking
US20030093516A1 (en) Enterprise management event message format
CN110035117A (en) One kind is based on configurable monitoring script monitoring system and monitoring method
CN114095522A (en) Vehicle monitoring method, service system, management terminal, vehicle and storage medium
CN114356499A (en) Kubernetes cluster alarm root cause analysis method and device
CN114024834A (en) Fault positioning method and device, electronic equipment and readable storage medium
CN109240863A (en) A kind of cpu fault localization method, device, equipment and storage medium
CN115658420A (en) Database monitoring method and system
EP2899918A1 (en) Method, apparatus and system for detecting network element load imbalance
CN112910733A (en) Full link monitoring system and method based on big data
CN117648206A (en) System abnormal information monitoring and analyzing method, device, equipment and storage medium
CN113760634A (en) Data processing method and device
CN117220917A (en) Network real-time monitoring method based on cloud computing
CN115981950A (en) Monitoring alarm method, device, equipment and computer readable storage medium
CN116055303A (en) Link monitoring processing method and device, electronic equipment and storage medium
JP4081258B2 (en) Management server system
CN114640567A (en) Apache log analysis method and device
CN113835961A (en) Alarm information monitoring method, device, server and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination