CN110389875B - Method, apparatus and storage medium for monitoring the operating state of a computer system - Google Patents

Method, apparatus and storage medium for monitoring the operating state of a computer system Download PDF

Info

Publication number
CN110389875B
CN110389875B CN201910249543.XA CN201910249543A CN110389875B CN 110389875 B CN110389875 B CN 110389875B CN 201910249543 A CN201910249543 A CN 201910249543A CN 110389875 B CN110389875 B CN 110389875B
Authority
CN
China
Prior art keywords
abnormal event
computer system
type
dictionary tree
event type
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910249543.XA
Other languages
Chinese (zh)
Other versions
CN110389875A (en
Inventor
马聪
郭一方
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Unionpay Co Ltd
Original Assignee
China Unionpay Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Unionpay Co Ltd filed Critical China Unionpay Co Ltd
Priority to CN201910249543.XA priority Critical patent/CN110389875B/en
Publication of CN110389875A publication Critical patent/CN110389875A/en
Application granted granted Critical
Publication of CN110389875B publication Critical patent/CN110389875B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3055Monitoring arrangements for monitoring the status of the computing system or of the computing system component, e.g. monitoring if the computing system is on, off, available, not available
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3065Monitoring arrangements determined by the means or processing involved in reporting the monitored data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/324Display of status information
    • G06F11/327Alarm or error message display

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Debugging And Monitoring (AREA)
  • Computer And Data Communications (AREA)

Abstract

The present invention relates to a method, apparatus and storage medium for monitoring the operating state of a computer system. A method for monitoring the operational status of a computer system according to one aspect of the present invention comprises the steps of: for a process, obtaining one or more abnormal event types which may occur in the computer system during execution, wherein each abnormal event type has corresponding codes; constructing a dictionary tree associated with the processing procedure, wherein each abnormal event type corresponds to one branch of the dictionary tree, characters contained in the codes of the abnormal event type correspond to nodes of the branch one by one, and the appearance sequence of the characters is consistent with the node connection sequence; and utilizing the dictionary tree to record each abnormal event type to monitor the running state of the computer system.

Description

Method, apparatus and storage medium for monitoring the operating state of a computer system
Technical Field
The present invention relates to computer technology, and more particularly, to a method for monitoring the operating state of a computer system, an apparatus for implementing the method, and a computer readable storage medium containing a computer program for implementing the method.
Background
With the expansion of online transaction traffic, the data throughput of computer systems is increasing and the processing logic is becoming complex. To ensure stable operation of the service, monitoring of the operation state of the computer system is becoming more important.
The current monitoring mainly adopts the following two modes: monitoring based on system resources and monitoring based on system business processes. The former takes the service condition of server resources as a main monitoring target, has the advantages of strong real-time performance and high sensitivity, but cannot realize the real-time monitoring of a specific business processing process; the latter monitors the log through statistical analysis, so that the monitoring of the process level can be realized, but due to the complexity of the log acquisition and analysis process, the monitoring has a certain time delay, and the requirement of a low fault tolerance system (such as a transaction system) cannot be met.
There is therefore a need for a method and apparatus for monitoring the operational state of a computer system that enables real-time and highly sensitive monitoring of the state of the system at the process level.
Disclosure of Invention
It is an object of the present invention to provide a method and apparatus for monitoring the operational state of a computer system that enables real-time and highly sensitive monitoring of the state of the system at the process level.
A method for monitoring the operational status of a computer system according to one aspect of the present invention comprises the steps of:
for a process, obtaining one or more abnormal event types which may occur in the computer system during execution, wherein each abnormal event type has corresponding codes;
constructing a dictionary tree associated with the processing procedure, wherein each abnormal event type corresponds to one branch of the dictionary tree, characters contained in the codes of the abnormal event type correspond to nodes of the branch one by one, and the appearance sequence of the characters is consistent with the node connection sequence; and
and recording each abnormal event type by utilizing the dictionary tree to realize monitoring of the running state of the computer system.
Optionally, in the above method, the types of encoding include a complete encoding type matching a specific abnormal event type and a prefix encoding type matching a plurality of abnormal event types containing the same prefix character.
Optionally, in the above method, each abnormal event type is recorded in the following manner: and allocating a counter and a time sequence queue for each branch end node, wherein the counter records the occurrence times of corresponding abnormal event types, and the time sequence queue records the occurrence times of corresponding abnormal event types.
Optionally, in the above method, further comprising:
and if the number of times of occurrence of the corresponding type of abnormal event in the sliding time window is determined to be greater than a set threshold value based on the counter and the time sequence queue, generating an alarm message.
Optionally, in the above method, further comprising:
statistics of the operating state of the computer system are generated based on the counter and a record of the time series queue.
Optionally, in the above method, the abnormal event type includes one or more of the following: memory overflow, memory leakage, communication timeout, communication interruption, and data table operation error.
An apparatus for monitoring the operating state of a computer system according to another aspect of the present invention comprises:
a first module configured to obtain, for a process, one or more exception event types that may occur during execution of the computer system, wherein each of the exception event types has a corresponding code;
a second module configured to construct a dictionary tree associated with the process, wherein each of the abnormal event types corresponds to a branch of the dictionary tree, the encoding of the abnormal event type includes characters that are in one-to-one correspondence with nodes of the branch and an appearance order of the characters is consistent with a node connection order; and
and a third module configured to record each abnormal event type by using the dictionary tree to monitor the running state of the computer system.
Optionally, the apparatus is integrated within the computer system.
In an apparatus for monitoring the operational status of a computer system according to another aspect of the present invention, comprising:
a memory;
a processor; and
a computer program stored on the memory and executable on the processor to perform the steps of:
for a process, obtaining one or more abnormal event types which may occur in the computer system during execution, wherein each abnormal event type has corresponding codes;
constructing a dictionary tree associated with the processing procedure, wherein each abnormal event type corresponds to one branch of the dictionary tree, characters contained in the codes of the abnormal event type correspond to nodes of the branch one by one, and the appearance sequence of the characters is consistent with the node connection sequence; and
and recording each abnormal event type by utilizing the dictionary tree to realize monitoring of the running state of the computer system.
It is a further object of the invention to provide a computer readable storage medium having stored thereon a computer program which when executed by a processor implements a method as described above.
According to one or more embodiments of the present invention, the exception event type is represented in a code of limited field length, and is recorded using a dictionary tree, with a statistical query complexity of O (field length), with far better efficiency than index-based relational database query statistics; in addition, real-time fuzzy matching query statistics can be realized by utilizing the prefix coding type, the time complexity is O (the field length of prefix coding), and the efficiency is better than that of an elastic search fuzzy query. In addition, by configuring a time-series queue for each branch's end node to record an abnormal event, a real-time monitoring alarm mechanism for system status within a configurable time window can be realized, and has extremely high sensitivity.
Drawings
The foregoing and/or other aspects and advantages of the present invention will become more apparent and more readily appreciated from the following description of the various aspects taken in conjunction with the accompanying drawings in which like or similar elements are designated with the same reference numerals. The drawings include:
FIG. 1 is a flow chart of a method for monitoring the operational status of a computer system in accordance with one embodiment of the present invention.
FIG. 2 is a flow chart of a dictionary tree construction and initialization process in accordance with another embodiment of the present invention.
FIG. 3 is a schematic diagram of a dictionary tree generated in accordance with the process shown in FIG. 2.
FIG. 4 is a flow chart of an operational status anomaly monitoring process in accordance with another embodiment of the present invention.
FIG. 5 is a flow chart of a process for generating an operational status statistical query result according to another embodiment of the present invention.
FIG. 6 is a block diagram of an apparatus for monitoring the operational status of a computer system in accordance with another embodiment of the present invention.
FIG. 7 is a block diagram of an apparatus for monitoring the operational status of a computer system in accordance with another embodiment of the present invention.
Detailed Description
The present invention will now be described more fully hereinafter with reference to the accompanying drawings, in which illustrative embodiments of the invention are shown. This invention may, however, be embodied in different forms and should not be construed as limited to the embodiments set forth herein. The above-described embodiments are provided to fully convey the disclosure herein and to more fully convey the scope of the invention to those skilled in the art.
In this specification, terms such as "comprising" and "including" mean that there are other elements and steps not directly or explicitly recited in the description and claims, nor does the inventive solution exclude the presence of other elements or steps.
In a computer system, when an error or exception occurs in a particular process being performed, a code (also known as an error code) is typically returned that indicates the type of exception event or type of error. According to one aspect of the invention, the above-described encoding is recorded using a dictionary tree, whereby real-time monitoring of the operating state can be achieved at the process level. It is to be understood that monitoring herein includes, but is not limited to, condition monitoring, fault alerting, and operation statistical analysis, among others. In addition, an exception event should be construed broadly as a variety of events occurring during execution of a process other than normal or occurring in error, including, for example, but not limited to, memory overflow, memory leak, communication timeout, communication interruption, and data table operation error.
In one or more embodiments of the invention, an associated dictionary tree is built for each process, the dictionary tree having the following structure: the dictionary tree has one or more branches, each exception type corresponding to one of the branches of the dictionary tree; all branches originate from a root node and comprise a plurality of nodes, the number of nodes is consistent with the number of characters contained in the codes of the abnormal event type, wherein the characters contained in the codes of the abnormal event type are in one-to-one correspondence with the nodes of the branch, and the appearance sequence of the characters is consistent with the node connection sequence (namely the sequence in which the nodes of each branch are connected in turn).
The last node of each branch is the end node, and optionally, the monitoring attribute is assigned to each branch by assigning a counter to the end node, which can record the number of occurrences of the corresponding type of abnormal event, and a time-series queue, which can record the time of occurrence of the corresponding type of abnormal event.
In one or more embodiments of the invention, the types of encoding previously described include a complete encoding type that matches a particular exception event type and a prefix encoding type that matches a plurality of exception event types that contain the same prefix character. Accordingly, the end node of the branch corresponding to the prefix coding type may be an intermediate node of the branch corresponding to the complete coding type.
The running state of the computer system can be monitored by using the counter and the time sequence queue. For example, the number of occurrences of the corresponding type of abnormal event within the sliding time window may be determined based on the counter and the time series queue, and then compared with a set threshold value, and if greater than the set threshold value, an operation of generating an alarm message is triggered. For another example, statistics of the operating state of the computer system may be generated using the number of times the abnormal event occurs recorded by the counter and the time the abnormal event occurs recorded by the time series queue.
FIG. 1 is a flow chart of a method for monitoring the operational status of a computer system in accordance with one embodiment of the present invention.
As shown in FIG. 1, at step 110, the means for monitoring the operational status of the computer system obtains the type { E of an exception event that may occur during execution of a process P i Where i= … … n }. Optionally, each exception event type has a corresponding encoded STR i The code contains one or more characters. Illustratively, the range of values of the character(character set) contains the numbers {0, … …,9}, the cases { a, …, Z, a, …, Z } and the special characters { _+=% }.
Proceeding to step 120, the means for monitoring the operational status of the computer system builds a dictionary tree T associated with process P based on knowledge of the type of abnormal event of process P, which may have a structure as described above.
Proceeding to step 130, the means for monitoring the operational status of the computer system records the respective exception event type E using the constructed dictionary tree T i To enable monitoring of the operating state of the computer system.
It is noted that the means for monitoring the operating state of the computer system may be integrated within the monitored computer system or may exist as a component independent of the monitored computer system.
FIG. 2 is a flow chart of a dictionary tree construction and initialization process in accordance with another embodiment of the present invention.
As shown in FIG. 2, at step 210, a set { E } of exception event types that may occur during execution of a process P is obtained i Where i= … … n }. In the present embodiment, each exception event type E i Is assigned an error code or code, denoted STR i . The code may be a string of a limited number of characters m, e.g. denoted STR i (s 1 ,s 2 ,…,s m )。
As previously described, the types of encoding include a full encoding type and a prefix encoding type. An example of a complete coding type is for example "ABC3001", and an example of a prefix coding type is for example "ABC 4", wherein the character "x" is a wildcard. The code may be stored in a database to be invoked when the dictionary tree construction process is performed.
Step 220 is then entered, where the root node root of the dictionary tree is initialized as the starting node.
Step 230 is then entered to determine whether the set of exception event types { E } has been traversed i Where i= … … n }, i.e. whether an attempt has been made to encode E for each i Construction of the correspondingBranching. If so, the build and initialization process ends, otherwise step 240 is entered.
At step 240, a determination is made as to whether code E corresponding to the currently constructed branch has been traversed in sequence i Of (c), i.e. code E i Whether or not each character of (a) has been mapped in turn to a corresponding node in the dictionary tree, wherein the nodes encode E in accordance with the character i Sequentially connected to form a sequence corresponding to code E i Is a branch of (2). If so, return to step 230, otherwise proceed to step 250. To code E i For "ABC3001" for example, as shown in fig. 3, the characters A, B, C, 3, 0, 1 will be mapped sequentially to corresponding nodes in the lexicon tree, which are connected sequentially to get corresponding branches. Assuming that the mapping of the last character 1 has been completed, the process shown in fig. 2 will return to step 230, otherwise step 250 is entered.
At step 250, slave code E i Extracting the character to be mapped currently. Step 260 is then entered to determine if the character currently to be mapped already has a corresponding node or matching node in the dictionary tree, if so, step 240 is returned, otherwise step 270 is entered.
In step 270, a corresponding node is constructed for the character currently to be mapped.
The mapping process is further described below with the aid of the dictionary tree T shown in fig. 3. Assume that a branch encoding "ABC3001" is constructed after the construction of the branch encoding "ABC 4" is completed. In order, character A in code "ABC3001" is first mapped to a node at level 1 of the lexicon tree. Since the code "ABC4 x" has the same starting character "a" as "ABC3001" and the corresponding node has been generated for the starting character a in the dictionary tree T when constructing the branch of the code "ABC4 x", the character a in "ABC3001" is judged to have a matching node and this node can be shared with the code "ABC4 x". Likewise, for characters B and C in "ABC3001", it already has matching nodes at layers 2 and 3 in the dictionary tree T, and thus the corresponding nodes can be shared with the code "ABC4 x". But for other characters (3, 0, 1) in "ABC3001" there are no matching nodes, so a new node needs to be built in step 270.
After step 270 is performed, the process shown in fig. 2 proceeds to step 280, where it is determined whether the node to which the character currently being processed corresponds or matches is a final node (circle shown in phantom in fig. 3), and if so, step 290 is entered, otherwise, step 240 is returned.
In step 290, the node corresponding to the character to be processed is initialized with the final node attribute. In the present embodiment, the initialization of the attribute includes, but is not limited to, setting of a terminal node type identifier (a complete encoding type matching a specific abnormal event type and prefix encoding types matching a plurality of pieces of prefix encoding types containing the same prefix character), setting of an initial value of a terminal node counter, and initialization setting of a time-series queue of the terminal node (for example, setting the length of the initial time-series queue to 0), and the like.
After step 290 is performed, the process shown in FIG. 2 returns to step 230.
It should be noted that the dictionary tree construction and initialization process shown in fig. 2 may be performed in the apparatus for monitoring the operation state of the computer system, or may be performed in other apparatuses.
FIG. 4 is a flow chart of an operational status anomaly monitoring process in accordance with another embodiment of the present invention. The monitoring process may be implemented by a device for monitoring the operating state of a computer.
As shown in FIG. 4, in step 411, a code or error code E of an exception event is received i . Step 412 is then entered into the root node root of the dictionary tree.
Step 413 is then entered to determine whether the currently received code E has been traversed in sequence i Is a character of the character set. If so, the monitoring process is ended, otherwise step 414 is entered.
At step 414, slave code E i Extracting the character to be processed currently. Step 415 is then entered to determine if the currently processed character already has a corresponding node or matching node in the dictionary tree, if so, step 416 is entered, otherwise step 417 is entered.
In step 416, it is determined whether the node corresponding to the currently processed character is a terminal node, if so, step 418 is entered, otherwise step 413 is returned.
At step 418, the counter value is incremented. After step 418, step 419 is entered where the current sliding time window is calculated. Next, at step 420, queue elements in the time series queue that fall outside the current sliding time window are removed and new queue elements are added at the tail of the time series queue to record the time when the current abnormal event occurred, thereby generating an updated time series queue.
After step 420, step 421 is entered, it is determined whether the length of the updated time-series queue is greater than a preset threshold, if so, step 422 is entered, an alarm message indicating that an abnormality occurs in the running state of the computer is generated, and step 411 is returned, otherwise, the process of fig. 4 is ended.
Go back to another branch 417 of step 415. In step 417, a matching node is constructed for the currently processed character. After step 417, step 423 is entered to determine if the node to which the currently processed character corresponds or matches is a terminating node, if so, step 424 is entered, otherwise step 413 is returned.
In step 424, the node corresponding to the character currently to be processed is initialized with the final node attribute. In the present embodiment, the initialization of the attribute includes, but is not limited to, setting of a terminal node type identifier (a complete encoding type matching a specific abnormal event type and prefix encoding types matching a plurality of the same prefix characters), setting of an initial value of a terminal node counter (e.g., set to 1), and initialization setting of a time-series queue of the terminal node (e.g., set the length of the initial time-series queue to 1 and add a new queue element to record the time when the abnormal event occurs), and the like.
After step 424 is performed, the process shown in FIG. 4 returns to step 413.
FIG. 5 is a flow chart of a process for generating an operational status statistical query result according to another embodiment of the present invention. The process may be implemented by a device for monitoring the operating state of a computer.
As shown in FIG. 5, at step 511, a code or error code E containing an exception event is received from the computer system i Is provided. Step 512 is then entered into the root node root of the dictionary tree.
Step 513 is then entered to determine if the currently received code E has been traversed in sequence i Is a character of the character set. If so, step 514 is entered, otherwise step 515 is entered.
In step 514, it is determined whether the node corresponding to the currently processed character is a terminal node, if so, step 516 is entered, otherwise step 517 is entered.
At step 516, the count value of the counter associated with the end node and the time series queue are output. In step 517, a message is output indicating that the current statistical query request is invalid.
Returning to another branch 515 of step 513. At step 515, slave code E i Extracting the character currently processed. Next, step 518 is entered to determine whether the currently processed character already has a corresponding node or matching node in the dictionary tree, and if so, step 513 is returned, otherwise step 517 is entered.
FIG. 6 is a block diagram of an apparatus for monitoring the operational status of a computer system in accordance with another embodiment of the present invention.
As shown in fig. 6, the apparatus 60 for monitoring an operation state of a computer system according to the present embodiment includes a first module 610, a second module 620, and a third module 630. The first module 610 is configured to obtain, for a process, one or more exception event types that may occur during execution of the computer system, wherein each of the exception event types has a corresponding code. The second module 620 is configured to construct a dictionary tree associated with the process, wherein each of the exception event types corresponds to a branch of the dictionary tree, the encoding of the exception event type includes characters that correspond one-to-one with nodes of the branch and the order of occurrence of the characters is consistent with the order of node connection. The third module 630 is configured to record each exception type using the dictionary tree to enable monitoring of the operational state of the computer system.
FIG. 7 is a block diagram of an apparatus for monitoring the operational status of a computer system in accordance with another embodiment of the present invention.
The apparatus 70 shown in fig. 7 comprises a memory 710, a processor 720 and a computer program 730 stored on the memory 710 and executable on the processor 720, wherein the computer program 730 is executable by running on the processor 720 to perform the method as described above with the aid of the embodiments of fig. 1-5.
According to one aspect of the invention, a computer-readable storage medium is provided, on which a computer program is stored which, when being executed by a processor, carries out the method of the embodiments described with reference to fig. 1-5.
The embodiments and examples set forth herein are presented to best explain the embodiments in accordance with the present technology and its particular application and to thereby enable those skilled in the art to make and use the invention. However, those skilled in the art will recognize that the foregoing description and examples have been presented for the purpose of illustration and example only. The description as set forth is not intended to cover various aspects of the invention or to limit the invention to the precise form disclosed.
In view of the foregoing, the scope of the present disclosure is determined by the following claims.

Claims (13)

1. A method for monitoring the operational status of a computer system, the method comprising the steps of:
for a process, obtaining one or more abnormal event types which may occur in the computer system during execution, wherein each abnormal event type has corresponding codes;
constructing a dictionary tree associated with the processing procedure, wherein each abnormal event type corresponds to one branch of the dictionary tree, characters contained in the codes of the abnormal event type correspond to nodes of the branch one by one, and the appearance sequence of the characters is consistent with the node connection sequence; and
recording each exception event type using the dictionary tree to enable monitoring of the operational state of the computer system,
wherein each anomaly event type is recorded in the following manner: and allocating a counter and a time sequence queue for each branch end node, wherein the counter records the occurrence times of corresponding abnormal event types, and the time sequence queue records the occurrence times of corresponding abnormal event types.
2. The method of claim 1, wherein the types of encoding include a complete encoding type that matches a particular exception type and a prefix encoding type that matches a plurality of identical prefix characters.
3. The method of claim 2, further comprising:
and if the number of times of occurrence of the corresponding type of abnormal event in the sliding time window is determined to be greater than a set threshold value based on the counter and the time sequence queue, generating an alarm message.
4. The method of claim 2, further comprising:
statistics of the operating state of the computer system are generated based on the counter and a record of the time series queue.
5. The method of claim 1, wherein the exception event type comprises one or more of: memory overflow, memory leakage, communication timeout, communication interruption, and data table operation error.
6. An apparatus for monitoring the operational status of a computer system, comprising:
a first module configured to obtain, for a process, one or more exception event types that may occur during execution of the computer system, wherein each of the exception event types has a corresponding code;
a second module configured to construct a dictionary tree associated with the process, wherein each of the abnormal event types corresponds to a branch of the dictionary tree, the encoding of the abnormal event type includes characters that are in one-to-one correspondence with nodes of the branch and an appearance order of the characters is consistent with a node connection order; and
a third module configured to record each abnormal event type using the dictionary tree to enable monitoring of an operational state of the computer system,
wherein each anomaly event type is recorded in the following manner: and allocating a counter and a time sequence queue for each branch end node, wherein the counter records the occurrence times of corresponding abnormal event types, and the time sequence queue records the occurrence times of corresponding abnormal event types.
7. The apparatus of claim 6, wherein the apparatus is integrated within the computer system.
8. An apparatus for monitoring an operational state of a computer system, the apparatus comprising:
a memory;
a processor; and
a computer program stored on the memory and executable on the processor to perform the steps of:
for a process, obtaining one or more abnormal event types which may occur in the computer system during execution, wherein each abnormal event type has corresponding codes;
constructing a dictionary tree associated with the processing procedure, wherein each abnormal event type corresponds to one branch of the dictionary tree, characters contained in the codes of the abnormal event type correspond to nodes of the branch one by one, and the appearance sequence of the characters is consistent with the node connection sequence; and
recording each exception event type using the dictionary tree to enable monitoring of the operational state of the computer system,
wherein each anomaly event type is recorded in the following manner: and allocating a counter and a time sequence queue for each branch end node, wherein the counter records the occurrence times of corresponding abnormal event types, and the time sequence queue records the occurrence times of corresponding abnormal event types.
9. The apparatus of claim 8, wherein the types of encoding include a complete encoding type that matches a particular exception type and a prefix encoding type that matches a plurality of identical prefix characters.
10. The apparatus of claim 8, wherein the following steps are further performed:
and if the number of times of occurrence of the corresponding type of abnormal event in the sliding time window is determined to be greater than a set threshold value based on the counter and the time sequence queue, generating an alarm message.
11. The apparatus of claim 8, wherein the following steps are further performed:
and generating a statistical result of the running state of the computer system by using the counter and the record of the time sequence queue.
12. The apparatus of claim 8, wherein the exception event type comprises one or more of: memory overflow, memory leakage, communication timeout, communication interruption, and data table operation error.
13. A computer readable storage medium, on which a computer program is stored, characterized in that the program, when being executed by a processor, implements the method according to any of claims 1-5.
CN201910249543.XA 2019-03-29 2019-03-29 Method, apparatus and storage medium for monitoring the operating state of a computer system Active CN110389875B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910249543.XA CN110389875B (en) 2019-03-29 2019-03-29 Method, apparatus and storage medium for monitoring the operating state of a computer system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910249543.XA CN110389875B (en) 2019-03-29 2019-03-29 Method, apparatus and storage medium for monitoring the operating state of a computer system

Publications (2)

Publication Number Publication Date
CN110389875A CN110389875A (en) 2019-10-29
CN110389875B true CN110389875B (en) 2023-06-06

Family

ID=68284886

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910249543.XA Active CN110389875B (en) 2019-03-29 2019-03-29 Method, apparatus and storage medium for monitoring the operating state of a computer system

Country Status (1)

Country Link
CN (1) CN110389875B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112434770B (en) * 2020-09-10 2024-04-12 张家港市爱上旅途网络科技有限公司 Two-dimensional code display method and system based on dot matrix screen
CN116501531B (en) * 2023-06-19 2023-09-08 成都移信通科技有限公司 Software plug-in configuration method and system for monitoring software operation data security

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9037922B1 (en) * 2012-05-01 2015-05-19 Amazon Technololgies, Inc. Monitoring and analysis of operating states in a computing environment
CN105183788A (en) * 2015-08-20 2015-12-23 及时标讯网络信息技术(北京)有限公司 Operation method for Chinese AC automatic machine based on retrieval of keyword dictionary tree
CN107679036A (en) * 2017-10-12 2018-02-09 南京网数信息科技有限公司 A kind of wrong word monitoring method and system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9037922B1 (en) * 2012-05-01 2015-05-19 Amazon Technololgies, Inc. Monitoring and analysis of operating states in a computing environment
CN105183788A (en) * 2015-08-20 2015-12-23 及时标讯网络信息技术(北京)有限公司 Operation method for Chinese AC automatic machine based on retrieval of keyword dictionary tree
CN107679036A (en) * 2017-10-12 2018-02-09 南京网数信息科技有限公司 A kind of wrong word monitoring method and system

Also Published As

Publication number Publication date
CN110389875A (en) 2019-10-29

Similar Documents

Publication Publication Date Title
US9189319B2 (en) Management system for outputting information denoting recovery method corresponding to root cause of failure
US11614990B2 (en) Automatic correlation of dynamic system events within computing devices
CN110661659B (en) Alarm method, device and system and electronic equipment
CN110995482B (en) Alarm analysis method and device, computer equipment and computer readable storage medium
US8392385B2 (en) Flexible event data content management for relevant event and alert analysis within a distributed processing system
US10929217B2 (en) Multi-variant anomaly detection from application telemetry
US8730816B2 (en) Dynamic administration of event pools for relevant event and alert analysis during event storms
CN107864063B (en) Abnormity monitoring method and device and electronic equipment
US9178936B2 (en) Selected alert delivery in a distributed processing system
CN110389875B (en) Method, apparatus and storage medium for monitoring the operating state of a computer system
CN111160021A (en) Log template extraction method and device
JP2007174235A (en) Attribute information collection device, attribute information collection method and attribute information collection program
CN110795264A (en) Monitoring management method and system and intelligent management terminal
US8136012B2 (en) Method and system for updating topology changes of a computer network
CN113626238B (en) ctdb service health state monitoring method, system, device and storage medium
CN107330031B (en) Data storage method and device and electronic equipment
CN117376092A (en) Fault root cause positioning method, device, equipment and storage medium
CN111124542A (en) Configuration information management system
WO2021249629A1 (en) Device and method for monitoring communication networks
CN106777981B (en) Behavior data verification method and device
CN117061332B (en) Fault diagnosis method and system based on probability directed graph deep learning
CN111506446B (en) Interface fault detection method and server
CN114422324B (en) Alarm information processing method and device, electronic equipment and storage medium
US20230185650A1 (en) Method for managing a plurality of events
CN111221818A (en) Alarm information analysis method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant