CN109597702A - Root cause analysis method, apparatus, equipment and the storage medium of messaging bus exception - Google Patents

Root cause analysis method, apparatus, equipment and the storage medium of messaging bus exception Download PDF

Info

Publication number
CN109597702A
CN109597702A CN201811472556.5A CN201811472556A CN109597702A CN 109597702 A CN109597702 A CN 109597702A CN 201811472556 A CN201811472556 A CN 201811472556A CN 109597702 A CN109597702 A CN 109597702A
Authority
CN
China
Prior art keywords
messaging bus
time series
series data
exception
tps
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811472556.5A
Other languages
Chinese (zh)
Other versions
CN109597702B (en
Inventor
叶金瓒
王雅琪
张娟
尤嘉
朱楚廷
覃文丰
李泽宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
WeBank Co Ltd
Original Assignee
WeBank Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by WeBank Co Ltd filed Critical WeBank Co Ltd
Priority to CN201811472556.5A priority Critical patent/CN109597702B/en
Publication of CN109597702A publication Critical patent/CN109597702A/en
Application granted granted Critical
Publication of CN109597702B publication Critical patent/CN109597702B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/079Root cause analysis, i.e. error or fault diagnosis

Abstract

The invention discloses a kind of root cause analysis methods of messaging bus exception, this method comprises: the TPS time series data and the preset TPS time series data for calling relevant operation system to the messaging bus of acquisition messaging bus;Judge the TPS time series data of the messaging bus with the presence or absence of abnormal;If the TPS time series data of the messaging bus exist it is abnormal, by preset clustering algorithm, determine respectively the messaging bus TPS time series data and the preset outlier called to the messaging bus in the TPS time series data of relevant operation system;Judge whether there is the operation system for occurring outlier simultaneously with the messaging bus, and if it exists, then determine it is described with outlier occurs simultaneously in messaging bus operation system is messaging bus exception root because.The invention also discloses root cause analysis device, equipment and a kind of storage mediums of a kind of messaging bus exception.The present invention realizes low cost, efficient messaging bus exception root cause analysis.

Description

Root cause analysis method, apparatus, equipment and the storage medium of messaging bus exception
Technical field
The present invention relates to the root cause analysis method, apparatus of field of communication technology more particularly to messaging bus exception, equipment and Storage medium.
Background technique
Messaging bus (Message Queue, abbreviation MQ) is a kind of communication mechanism of striding course, between upstream and downstream Message is transmitted, messaging service can be provided between application program, is provided safeguard for interconnecting for each application program.
The solution of traditional messaging bus exception basic reason analysis mainly includes following two:
The first, the calling data of all application programs relevant to messaging bus is all recorded, specifically, just It is that the calling data of all application programs relevant to messaging bus will be recorded, is called including entry call and outlet Data, to realize accurate root cause analysis.The defect of this mode is: 1) cost is too high.Related to messaging bus tune System is required to be transformed, and the record for calling data is supported after transformation, to support root cause analysis with data, due to practical enterprise There are substantial amounts, the features of call relation complexity for the system of industry, thus a large amount of system is transformed will lead to it is excessively high Cost;2) transformation is difficult.It there are part system is outsourcing in enterprise, these systems may be bulk/volume buying and can not provide two Secondary transformation.
Second, artificial micro-judgment, specifically, be exactly when the size of message of messaging bus occurs abnormal, looked for through The personnel tested come check application program TPS (Transaction Per Second, i.e., transaction that master slave system per second is capable of handling or The quantity of affairs) time series data, the reason for the size of message exception for leading to messaging bus is judged with this.This mode cost of labor compared with Height, and can not be met the requirements in the timeliness of root cause analysis.
Summary of the invention
It is a primary object of the present invention to propose a kind of root cause analysis method, apparatus of messaging bus exception, equipment and deposit Storage media, it is intended to realize low cost, efficient messaging bus exception root cause analysis.
To achieve the above object, the present invention provides a kind of root cause analysis method of messaging bus exception, the method includes Following steps:
It acquires the TPS time series data of messaging bus and preset calls relevant operation system to the messaging bus TPS time series data;
Judge the TPS time series data of the messaging bus with the presence or absence of abnormal;
If the TPS time series data of the messaging bus exist it is abnormal, by preset clustering algorithm, determine respectively described in It the TPS time series data of messaging bus and preset is called in the TPS time series data of relevant operation system to the messaging bus Outlier;
Judge whether there is the operation system for occurring outlier simultaneously with the messaging bus, and if it exists, described in then determining With the messaging bus occur simultaneously outlier operation system be the messaging bus exception root because.
Preferably, the TPS time series data of the judgement messaging bus includes: with the presence or absence of abnormal step
Judge whether the TPS time series data of the messaging bus is greater than or equal to preset threshold;
If so, it is abnormal to determine that the TPS time series data of the messaging bus exists.
Preferably, the preset clustering algorithm is DBSCAN clustering algorithm, described by preset clustering algorithm, respectively When determining the TPS time series data and the preset TPS for calling relevant operation system to the messaging bus of the messaging bus Ordinal number according in outlier the step of include:
TPS time series data and preset business system relevant to messaging bus calling for the messaging bus The TPS time series data of system searches for cluster by checking every in data set Eps neighborhood;
The quantity for the point for including in the Eps neighborhood of certain point if it exists be greater than or equal to preset quantity, then create one with The point is the cluster of kernel object;
For each cluster of creation, assemble the object reachable from the direct density of its kernel object;
When not new point is added to any cluster, the point of any cluster will be not belonging in the data set as outlier.
Preferably, the acquisition TPS time series data of messaging bus and preset relevant to messaging bus calling After the step of TPS time series data of operation system, further includes:
By the TPS time series data of the collected messaging bus and preset relevant to messaging bus calling After the TPS time series data of operation system is converted to preset standard data format, store into presetting database;
It is described by preset clustering algorithm, determine respectively the messaging bus TPS time series data and preset and institute Before stating the step of messaging bus calls the outlier in the TPS time series data of relevant operation system, further includes:
Read the TPS time series data of the messaging bus stored in the database and preset total with the message Line calls the TPS time series data of relevant operation system.
Preferably, the root cause analysis method of the messaging bus exception further include:
The root cause analysis result of the messaging bus is fed back into WEB front-end.
In addition, to achieve the above object, the present invention also provides a kind of root cause analysis device of messaging bus exception, the dresses It sets and includes:
Acquisition module, for acquiring the TPS time series data of messaging bus and preset related to messaging bus calling Operation system TPS time series data;
Abnormal judgment module, for judging the TPS time series data of the messaging bus with the presence or absence of abnormal;
Outlier determining module, if the TPS time series data for the messaging bus has exception, by preset poly- Class algorithm, determine respectively the messaging bus TPS time series data and preset business relevant to messaging bus calling Outlier in the TPS time series data of system;
Root is because of judgment module, for judging whether there is the operation system for occurring outlier simultaneously with the messaging bus, If it exists, then the operation system for determining that outlier occurs simultaneously in the described and messaging bus is the root of the messaging bus exception Cause.
Preferably, the abnormal judgment module, be also used to judge the TPS time series data of the messaging bus whether be greater than or Equal to preset threshold;If so, it is abnormal to determine that the TPS time series data of the messaging bus exists.
Preferably, the preset clustering algorithm is DBSCAN clustering algorithm;
The outlier determining module, if the TPS time series data for being also used to the messaging bus has exception, for institute Ordinal number when stating the TPS time series data and the preset TPS for calling relevant operation system to the messaging bus of messaging bus According to by checking that every in data set Eps neighborhood searches for cluster;The number for the point for including in the Eps neighborhood of certain point if it exists Amount is greater than or equal to preset quantity, then creating one with this is the cluster of kernel object;For each cluster of creation, assemble from it The reachable object of the direct density of kernel object;When not new point is added to any cluster, will be not belonging to appoint in the data set The point of what cluster is as outlier.
Preferably, the root cause analysis device of the messaging bus exception further includes format converting module;
The format converting module, for by the TPS time series data of the collected messaging bus and preset and institute It states after messaging bus calls the TPS time series data of relevant operation system to be converted to preset standard data format, stores to pre- If in database;
The outlier determining module is also used to read the TPS timing of the messaging bus stored in the database Data and the preset TPS time series data that relevant operation system is called to the messaging bus.
In addition, to achieve the above object, it is described to disappear the present invention also provides a kind of root cause analysis equipment of messaging bus exception The root cause analysis equipment of breath bus exception includes: memory, processor and is stored on the memory and can be in the processing The root cause analysis program of the messaging bus exception run on device, the root cause analysis program of the messaging bus exception is by the processing The step of device realizes the root cause analysis method of messaging bus exception as described above when executing.
In addition, to achieve the above object, the present invention also provides a kind of storage medium, being stored with message on the storage medium The root cause analysis program of the root cause analysis program of bus exception, the messaging bus exception realizes institute as above when being executed by processor The step of root cause analysis method for the messaging bus exception stated.
The root cause analysis method of messaging bus exception proposed by the present invention acquires the TPS time series data of messaging bus first, And the preset TPS time series data that relevant operation system is called to the messaging bus;Then, judge the messaging bus TPS time series data is with the presence or absence of abnormal;If the TPS time series data of the messaging bus has exception, pass through preset cluster Algorithm determine messaging bus and it is preset call to the messaging bus in the TPS time series data of relevant operation system from Group's point then can be determined that the system is to cause message total if there is occurring the operation system of outlier simultaneously with messaging bus The basic reason of line exception.This mode compared with the prior art, without by all application programs relevant to messaging bus It calls data all to record, participates in analysis without artificial, to realize low cost, efficient messaging bus exception root Because of analysis.
Detailed description of the invention
Fig. 1 is the device structure schematic diagram for the hardware running environment that the embodiment of the present invention is related to;
Fig. 2 is the flow diagram of the root cause analysis method first embodiment of messaging bus exception of the present invention;
Fig. 3 is that outlier is shown in the TPS time series data of the collected messaging bus of the embodiment of the present invention and operation system It is intended to.
The embodiments will be further described with reference to the accompanying drawings for the realization, the function and the advantages of the object of the present invention.
Specific embodiment
It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not intended to limit the present invention.
As shown in Figure 1, Fig. 1 is the device structure schematic diagram for the hardware running environment that the embodiment of the present invention is related to.
The root cause analysis equipment of messaging bus exception of the embodiment of the present invention can be PC machine or background server.
As shown in Figure 1, the terminal may include: processor 1001, such as CPU, network interface 1004, user interface 1003, memory 1005, communication bus 1002.Wherein, communication bus 1002 is for realizing the connection communication between these components. User interface 1003 may include display screen (Display), input unit such as keyboard (Keyboard), optional user interface 1003 can also include standard wireline interface and wireless interface.Network interface 1004 optionally may include that the wired of standard connects Mouth, wireless interface (such as WI-FI interface).Memory 1005 can be high speed RAM memory, be also possible to stable memory (non-volatile memory), such as magnetic disk storage.Memory 1005 optionally can also be independently of aforementioned processor 1001 storage device.
It will be understood by those skilled in the art that device structure shown in Fig. 1 does not constitute the restriction to equipment, can wrap It includes than illustrating more or fewer components, perhaps combines certain components or different component layouts.
As shown in Figure 1, as may include that operating system, network are logical in a kind of memory 1005 of computer storage medium Believe the root cause analysis program of module, Subscriber Interface Module SIM and messaging bus exception.
In terminal shown in Fig. 1, network interface 1004 is mainly used for connecting background server, carries out with background server Data communication;User interface 1003 is mainly used for connecting client (user terminal), carries out data communication with client;And processor 1001 can be used for calling the root cause analysis program of the messaging bus exception stored in memory 1005, and it is total to execute following message Operation in the root cause analysis embodiment of the method for line exception.
Based on above-mentioned hardware configuration, the root cause analysis embodiment of the method for messaging bus exception of the present invention is proposed.
It is the flow diagram of the root cause analysis method first embodiment of messaging bus exception of the present invention referring to Fig. 2, Fig. 2, The described method includes:
Step S10, acquire messaging bus TPS time series data and preset industry relevant to messaging bus calling The TPS time series data of business system;
The root cause analysis method of the present embodiment messaging bus exception is realized that user can take on the backstage by background server The root cause analysis instruction for the corresponding WEB front-end triggering messaging bus exception of device of being engaged in, so that background server is carried out according to the instruction The root cause analysis of messaging bus exception.
In the present embodiment, when carrying out the root cause analysis of messaging bus exception, first from data source, i.e., each operation system is adopted Ordinal number when collecting the TPS time series data and the preset TPS for calling relevant operation system to the messaging bus of messaging bus According to forming corresponding data set.Wherein, TPS time series data be used for characterize system TPS (Transaction Per Second, The quantity of transaction or affairs that master slave system i.e. per second is capable of handling) this device parameter changes with time situation comprising be It unites in the TPS value of several different time points;Operation system relevant to messaging bus calling include institute it is in need directly or Connect the operation system for calling messaging bus.
Specifically, it when acquiring TPS time series data, can be acquired according to preset frequency acquisition, such as in 1 hour It is interior, the TPS value an of messaging bus was acquired every 1 minute and the TPS value of relevant operation system is called to messaging bus, thus The TPS time series data collection of messaging bus is obtained, and calls the TPS time series data of relevant each operation system to messaging bus Collection.Certainly, the frequency acquisition of time series data and acquisition time section can flexible setting according to actual needs, the present embodiment to this not It limits.
Step S20 judges the TPS time series data of the messaging bus with the presence or absence of abnormal;
In the step, judge the TPS time series data of above-mentioned collected messaging bus with the presence or absence of abnormal.
In a judgment mode, step S20 be may further include: judge the TPS time series data of the messaging bus Whether preset threshold is greater than or equal to;If so, it is abnormal to determine that the TPS time series data of the messaging bus exists.For example, in advance The TPS threshold value of messaging bus is set as 100, then when the TPS value of collected messaging bus is greater than or equal to 100, that is, is determined The TPS value is abnormal.
In another judgment mode, the TPS value of collected messaging bus and the ratio of preset threshold can also be calculated, if Ratio falls in pre-set interval, then determines that the TPS value is abnormal.
If the TPS time series data of the messaging bus exists abnormal, S30 is thened follow the steps, by preset clustering algorithm, The TPS time series data of the messaging bus is determined respectively and preset calls relevant operation system to the messaging bus Outlier in TPS time series data;
In the step, if the TPS time series data of messaging bus has exception, message is determined by preset clustering algorithm Bus and the preset outlier called to the messaging bus in the TPS time series data of relevant operation system.It is so-called to peel off Point refers in a time series, the extreme large and extreme small of the mean level far from sequence.
In the present embodiment, when using DBSCAN (Density-Based Spatial Clustering of Applications with Noise has noisy density clustering method) clustering algorithm when, step S30 can wrap It includes: TPS time series data for the messaging bus and preset calling relevant operation system to the messaging bus TPS time series data searches for cluster by checking every in data set Eps neighborhood;Include in the Eps neighborhood of certain point if it exists The quantity of point be greater than or equal to preset quantity, then creating one with this is the cluster of kernel object;For each cluster of creation, Assemble the object reachable from the direct density of its kernel object;It, will be in the data set when not new point is added to any cluster The point of any cluster is not belonging to as outlier.
DBSCAN is a more representational density-based algorithms in machine learning algorithm.With division and layer Secondary clustering method is different, and cluster is defined as the maximum set of the connected point of density by it, can be with region highdensity enough It is divided into cluster, and can find the cluster of arbitrary shape in the spatial database of noise.The basic principle of DBSCAN algorithm are as follows:
1) cluster is searched for by checking every in data set Eps neighborhood, the point as the Eps neighborhood of fruit dot p includes is more than MinPts, then one is created using p as the cluster of kernel object;
2) then, DBSCAN iteratively assembles the object reachable from the direct density of these kernel objects, this process may It is related to some density up to the merging of cluster;
3) when not new point is added to any cluster, which terminates, and the point for being not belonging to any cluster is then known as peeling off Point.
Wherein, Eps neighborhood: the neighborhood in given object radius Eps is known as the Eps neighborhood of the object.MinPts: it describes The distance of a certain sample is the threshold value of number of samples in the neighborhood of ∈.Kernel object: if the sample in given object Ε neighborhood Points are more than or equal to MinPts, then the object is referred to as kernel object.Direct density is reachable: for sample set D, if sample point Q is in the Ε neighborhood of p, and p is kernel object, then object q is reachable from the direct density of object p.
The specific implementation flow of DBSCAN algorithm is as follows:
Input: sample set D=(x1, x2 ..., xm), Neighbourhood parameter (∈, MinPts), sample distance metric mode.
Output: cluster divides C.
1) kernel object set is initializedInitialization cluster number of clusters k=0, initializes non-access-sample setCluster divides
2) for j=1,2 ... m is found out all kernel objects by following step:
A) by distance metric mode, ∈-neighborhood subsample collection N ∈ (xj) of sample xj is found.
If b) collection number of samples in subsample meets | N ∈ (xj) | kernel object sample is added in sample xj by >=MinPts Set: Ω=Ω ∪ { xj }.
If 3) kernel object setThen algorithm terminates, and is otherwise transferred to step 4.
4) in kernel object set omega, a kernel object o is randomly choosed, initializes current cluster kernel object queue Ω Cur={ o } initializes classification sequence number k=k+1, initializes current cluster sample set Ck={ o }, updates non-access-sample set
If 5) current cluster kernel object queueThen current clustering cluster Ck generation finishes, and updates cluster and divides C ={ C1, C2 ..., Ck } updates kernel object set omega=Ω-Ck, is transferred to step 3.
6) a kernel object o ' is taken out in current cluster kernel object queue Ω cur, is looked for by neighborhood distance threshold ∈ All ∈-neighborhood subsample collection N ∈ (o ') out is enabledUpdate current cluster sample set Ck=Ck ∪ Δ updates non-access-sample setΩ cur=Ω cur ∪ (Δ ∩ Ω)-o ' is updated, step 5 is transferred to.
Export result are as follows: cluster division C=C1, C2 ..., Ck }.
By DBSCAN clustering algorithm, the efficient analysis of outlier is realized, is the root of subsequent message bus exception because dividing Analysis provides premise.
It should be noted that other than DBSCAN clustering algorithm, can also be determined respectively using other clustering algorithms described in It the TPS time series data of messaging bus and preset is called in the TPS time series data of relevant operation system to the messaging bus Outlier, when specific implementation, can flexible setting.
Step S40 judges whether there is the operation system for occurring outlier simultaneously with the messaging bus, and if it exists, then Determine it is described with outlier occurs simultaneously in messaging bus operation system is messaging bus exception root because.
In the step, according to the outlier of above-mentioned determination, judges whether there is and occur outlier simultaneously with messaging bus Operation system, referring to Fig. 3, Fig. 3 be in the collected messaging bus of the embodiment of the present invention and the TPS time series data of operation system from The schematic diagram of group's point, by taking the TPS time series data for acquiring messaging bus, A system and B system as an example, wherein A system and B system are The calling related system of messaging bus.Fig. 3 shows the line chart that data of acquisition generate per minute, amounts to hour Data volume.
By DBSCAN algorithm, the outlier for analyzing the TPS time series data of messaging bus is as shown in the table:
Serial number TPS value Whether outlier
44 90 It is
45 122 It is
46 145 It is
The outlier for analyzing the TPS time series data of A system is as shown in the table:
Serial number TPS value Whether outlier
44 201 It is
45 277 It is
46 323 It is
It is not difficult to find that messaging bus and the TPS time series data of A system exist simultaneously on 44,45,46 equi-time points in figure Outlier, and B system does not have outlier illustrates to be particularly likely that messaging bus caused by the size of message of A system is uprushed is different at this time Often, therefore it can be determined that A system is that messaging bus is caused abnormal basic reason occur.
In the present embodiment, the TPS time series data and the preset and messaging bus for acquiring messaging bus first call The TPS time series data of relevant operation system;Then, judge the TPS time series data of the messaging bus with the presence or absence of abnormal;If The TPS time series data of the messaging bus exist it is abnormal, then by preset clustering algorithm determine messaging bus and it is preset and The messaging bus calls the outlier in the TPS time series data of relevant operation system, if there is with messaging bus simultaneously There is the operation system of outlier, then can be determined that the system is to lead to the basic reason of messaging bus exception.This mode phase Than in the prior art, without all recording the calling data of all application programs relevant to messaging bus, without people Work participates in analysis, to realize low cost, efficient messaging bus exception root cause analysis.
Further, it is based on above-mentioned first embodiment, proposes the root cause analysis method second of messaging bus exception of the present invention Embodiment.
In the present embodiment, after step S10, ordinal number when can also include: the TPS by the collected messaging bus According to and preset call the TPS time series data of relevant operation system to be converted to preset normal data to the messaging bus After format, store into presetting database;
It can also include: when reading the TPS of the messaging bus stored in the database at this time before step S30 Ordinal number evidence and the preset TPS time series data that relevant operation system is called to the messaging bus.
In the present embodiment, since the format of the TPS time series data got from different business systems may be different, for just In subsequent analysis, when the TPS time series data and preset business relevant to messaging bus calling for collecting messaging bus When the TPS time series data of system, collected data can be uniformly converted into preset standard data format, then store to In preset database;When the TPS time series data for judging messaging bus is deposited when abnormal, stored from reading database at this time The TPS time series data of messaging bus and the preset TPS time series data that relevant operation system is called to the messaging bus, And abnormal root cause analysis is carried out according to the data read.In this way, storage and the root of TPS time series data are realized Calling when because of analysis.
Further, the root cause analysis method of the messaging bus exception can be comprising steps of by the messaging bus Root cause analysis result feed back to WEB front-end.
The root cause analysis result of messaging bus is fed back into WEB front-end, so that WEB front-end shows root cause analysis result In respective page, so it is convenient for the reason of operation maintenance personnel timely learning messaging bus exception.
The present invention also provides a kind of root cause analysis devices of messaging bus exception.The root of messaging bus exception of the present invention is because dividing Analysis apparatus includes:
Acquisition module, for acquiring the TPS time series data of messaging bus and preset related to messaging bus calling Operation system TPS time series data;
Abnormal judgment module, for judging the TPS time series data of the messaging bus with the presence or absence of abnormal;
Outlier determining module, if the TPS time series data for the messaging bus has exception, by preset poly- Class algorithm, determine respectively the messaging bus TPS time series data and preset business relevant to messaging bus calling Outlier in the TPS time series data of system;
Root is because of judgment module, for judging whether there is the operation system for occurring outlier simultaneously with the messaging bus, If it exists, then the operation system for determining that outlier occurs simultaneously in the described and messaging bus is the root of the messaging bus exception Cause.
Further, the abnormal judgment module, is also used to judge whether the TPS time series data of the messaging bus is greater than Or it is equal to preset threshold;If so, it is abnormal to determine that the TPS time series data of the messaging bus exists.
Further, the preset clustering algorithm is DBSCAN clustering algorithm;
The outlier determining module, if the TPS time series data for being also used to the messaging bus has exception, for institute Ordinal number when stating the TPS time series data and the preset TPS for calling relevant operation system to the messaging bus of messaging bus According to by checking that every in data set Eps neighborhood searches for cluster;The number for the point for including in the Eps neighborhood of certain point if it exists Amount is greater than or equal to preset quantity, then creating one with this is the cluster of kernel object;For each cluster of creation, assemble from it The reachable object of the direct density of kernel object;When not new point is added to any cluster, will be not belonging to appoint in the data set The point of what cluster is as outlier.
Further, the root cause analysis device of the messaging bus exception further includes format converting module;
The format converting module, for by the TPS time series data of the collected messaging bus and preset and institute It states after messaging bus calls the TPS time series data of relevant operation system to be converted to preset standard data format, stores to pre- If in database;
The outlier determining module is also used to read the TPS timing of the messaging bus stored in the database Data and the preset TPS time series data that relevant operation system is called to the messaging bus.
The root cause analysis method that the method that above-mentioned each program module is realized can refer to messaging bus exception of the present invention is implemented Example, details are not described herein again.
The present invention also provides a kind of root cause analysis equipment of messaging bus exception.
The root cause analysis equipment of messaging bus exception of the present invention includes: memory, processor and is stored in the memory The root cause analysis program of messaging bus exception that is upper and can running on the processor, the root of the messaging bus exception is because dividing The step of analysis program realizes the root cause analysis method of messaging bus exception as described above when being executed by the processor.
Wherein, the root cause analysis program of the messaging bus exception run on the processor is performed realized side Method can refer to each embodiment of root cause analysis method of messaging bus exception of the present invention, and details are not described herein again.
The present invention also provides a kind of storage mediums.
The root cause analysis program of messaging bus exception, the root of the messaging bus exception are stored on storage medium of the present invention Because of the step of realizing the root cause analysis method of messaging bus exception as described above when analysis program is executed by processor.
Wherein, the root cause analysis program of the messaging bus exception run on the processor is performed realized side Method can refer to each embodiment of root cause analysis method of messaging bus exception of the present invention, and details are not described herein again.
It should be noted that, in this document, the terms "include", "comprise" or its any other variant are intended to non-row His property includes, so that the process, method, article or the system that include a series of elements not only include those elements, and And further include other elements that are not explicitly listed, or further include for this process, method, article or system institute it is intrinsic Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including being somebody's turn to do There is also other identical elements in the process, method of element, article or system.
The serial number of the above embodiments of the invention is only for description, does not represent the advantages or disadvantages of the embodiments.
Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side Method can be realized by means of software and necessary general hardware platform, naturally it is also possible to by hardware, but in many cases The former is more preferably embodiment.Based on this understanding, technical solution of the present invention substantially in other words does the prior art The part contributed out can be embodied in the form of software products, which is stored in one as described above In storage medium (such as ROM/RAM, magnetic disk, CD), including some instructions are used so that terminal device (it can be mobile phone, Computer, server, air conditioner or network equipment etc.) execute method described in each embodiment of the present invention.
The above is only a preferred embodiment of the present invention, is not intended to limit the scope of the invention, all to utilize this hair Equivalent structure or equivalent flow shift made by bright specification and accompanying drawing content is applied directly or indirectly in other relevant skills Art field, is included within the scope of the present invention.

Claims (11)

1. a kind of root cause analysis method of messaging bus exception, which is characterized in that the root cause analysis side of the messaging bus exception Method includes the following steps:
When acquiring the TPS time series data and the preset TPS for calling relevant operation system to the messaging bus of messaging bus Ordinal number evidence;
Judge the TPS time series data of the messaging bus with the presence or absence of abnormal;
If the TPS time series data of the messaging bus has exception, by preset clustering algorithm, the message is determined respectively The TPS time series data of bus and it is preset call to the messaging bus in the TPS time series data of relevant operation system from Group's point;
Judge whether there is the operation system for occurring outlier simultaneously with the messaging bus, and if it exists, then determine described and institute State messaging bus and meanwhile occur outlier operation system be the messaging bus exception root because.
2. the root cause analysis method of messaging bus exception as described in claim 1, which is characterized in that the judgement message The TPS time series data of bus whether there is abnormal step
Judge whether the TPS time series data of the messaging bus is greater than or equal to preset threshold;
If so, it is abnormal to determine that the TPS time series data of the messaging bus exists.
3. the root cause analysis method of messaging bus exception as claimed in claim 2, which is characterized in that the preset cluster is calculated Method is DBSCAN clustering algorithm, described by preset clustering algorithm, determines the TPS time series data of the messaging bus respectively, And the step of preset outlier called in the TPS time series data of relevant operation system to the messaging bus, includes:
TPS time series data for the messaging bus and preset relevant operation system is called to the messaging bus TPS time series data searches for cluster by checking every in data set Eps neighborhood;
The quantity for the point for including in the Eps neighborhood of certain point if it exists is greater than or equal to preset quantity, then creates one with the point For the cluster of kernel object;
For each cluster of creation, assemble the object reachable from the direct density of its kernel object;
When not new point is added to any cluster, the point of any cluster will be not belonging in the data set as outlier.
4. the root cause analysis method of messaging bus exception as claimed any one in claims 1 to 3, which is characterized in that described Ordinal number when acquiring the TPS time series data and the preset TPS for calling relevant operation system to the messaging bus of messaging bus According to the step of after, further includes:
By the TPS time series data of the collected messaging bus and preset business relevant to messaging bus calling After the TPS time series data of system is converted to preset standard data format, store into presetting database;
It is described by preset clustering algorithm, determine the TPS time series data of the messaging bus respectively and preset disappear with described Before the step of breath bus calls the outlier in the TPS time series data of relevant operation system, further includes:
Read the TPS time series data and the preset and messaging bus tune of the messaging bus stored in the database With the TPS time series data of relevant operation system.
5. the root cause analysis method of messaging bus exception as claimed any one in claims 1 to 3, which is characterized in that described The root cause analysis method of messaging bus exception further include:
The root cause analysis result of the messaging bus is fed back into WEB front-end.
6. a kind of root cause analysis device of messaging bus exception, which is characterized in that the root cause analysis of the messaging bus exception fills It sets and includes:
Acquisition module, for acquire messaging bus TPS time series data and preset industry relevant to messaging bus calling The TPS time series data of business system;
Abnormal judgment module, for judging the TPS time series data of the messaging bus with the presence or absence of abnormal;
Outlier determining module is calculated if the TPS time series data for the messaging bus has exception by preset cluster Method, determine respectively the messaging bus TPS time series data and preset operation system relevant to messaging bus calling TPS time series data in outlier;
Root is because of judgment module, for judging whether there is the operation system for occurring outlier simultaneously with the messaging bus, if depositing , then determine it is described with outlier occurs simultaneously in messaging bus operation system is messaging bus exception root because.
7. the root cause analysis device of messaging bus exception as claimed in claim 6, which is characterized in that
The exception judgment module, is also used to judge whether the TPS time series data of the messaging bus is greater than or equal to default threshold Value;If so, it is abnormal to determine that the TPS time series data of the messaging bus exists.
8. the root cause analysis device of messaging bus exception as claimed in claim 7, which is characterized in that the preset cluster is calculated Method is DBSCAN clustering algorithm;
The outlier determining module disappears if the TPS time series data for being also used to the messaging bus has exception for described The TPS time series data and the preset TPS time series data for calling relevant operation system to the messaging bus of bus are ceased, is led to Every in inspection data set Eps neighborhood is crossed to search for cluster;The quantity for the point for including in the Eps neighborhood of certain point if it exists is greater than Or be equal to preset quantity, then creating one with this is the cluster of kernel object;For each cluster of creation, assemble from its core pair As the reachable object of direct density;When not new point is added to any cluster, any cluster will be not belonging in the data set Point is used as outlier.
9. the root cause analysis device of the messaging bus exception as described in any one of claim 6 to 8, which is characterized in that described The root cause analysis device of messaging bus exception further includes format converting module;
The format converting module, for preset disappearing the TPS time series data of the collected messaging bus and with described After breath bus calls the TPS time series data of relevant operation system to be converted to preset standard data format, store to present count According in library;
The outlier determining module is also used to read the TPS time series data of the messaging bus stored in the database, And the preset TPS time series data that relevant operation system is called to the messaging bus.
10. a kind of root cause analysis equipment of messaging bus exception, which is characterized in that the root cause analysis of the messaging bus exception is set Standby includes: memory, processor and to be stored in the messaging bus that can be run on the memory and on the processor abnormal Root cause analysis program, realize when the root cause analysis program of the messaging bus exception is executed by the processor such as claim The step of root cause analysis method of the exception of messaging bus described in any one of 1 to 5.
11. a kind of storage medium, which is characterized in that be stored with the root cause analysis journey of messaging bus exception on the storage medium It is realized as described in any one of claims 1 to 5 when the root cause analysis program of sequence, the messaging bus exception is executed by processor Messaging bus exception root cause analysis method the step of.
CN201811472556.5A 2018-12-03 2018-12-03 Root cause analysis method, device, equipment and storage medium for message bus abnormity Active CN109597702B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811472556.5A CN109597702B (en) 2018-12-03 2018-12-03 Root cause analysis method, device, equipment and storage medium for message bus abnormity

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811472556.5A CN109597702B (en) 2018-12-03 2018-12-03 Root cause analysis method, device, equipment and storage medium for message bus abnormity

Publications (2)

Publication Number Publication Date
CN109597702A true CN109597702A (en) 2019-04-09
CN109597702B CN109597702B (en) 2022-04-26

Family

ID=65960982

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811472556.5A Active CN109597702B (en) 2018-12-03 2018-12-03 Root cause analysis method, device, equipment and storage medium for message bus abnormity

Country Status (1)

Country Link
CN (1) CN109597702B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112035320A (en) * 2020-08-31 2020-12-04 维沃移动通信有限公司 Service monitoring method and device, electronic equipment and readable storage medium
CN113515102A (en) * 2020-04-10 2021-10-19 北京京东乾石科技有限公司 Exception attribution method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102289585A (en) * 2011-08-15 2011-12-21 重庆大学 Real-time monitoring method for energy consumption of public building based on data mining
CN102360378A (en) * 2011-10-10 2012-02-22 南京大学 Outlier detection method for time-series data
CN104462802A (en) * 2014-11-26 2015-03-25 浪潮电子信息产业股份有限公司 Method for analyzing outlier data in large-scale data
US20170019308A1 (en) * 2015-07-14 2017-01-19 Netflix, Inc. Server outlier detection

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102289585A (en) * 2011-08-15 2011-12-21 重庆大学 Real-time monitoring method for energy consumption of public building based on data mining
CN102360378A (en) * 2011-10-10 2012-02-22 南京大学 Outlier detection method for time-series data
CN104462802A (en) * 2014-11-26 2015-03-25 浪潮电子信息产业股份有限公司 Method for analyzing outlier data in large-scale data
US20170019308A1 (en) * 2015-07-14 2017-01-19 Netflix, Inc. Server outlier detection

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113515102A (en) * 2020-04-10 2021-10-19 北京京东乾石科技有限公司 Exception attribution method and device
CN112035320A (en) * 2020-08-31 2020-12-04 维沃移动通信有限公司 Service monitoring method and device, electronic equipment and readable storage medium

Also Published As

Publication number Publication date
CN109597702B (en) 2022-04-26

Similar Documents

Publication Publication Date Title
US9590880B2 (en) Dynamic collection analysis and reporting of telemetry data
US11451622B1 (en) Multi-tier resource and load orchestration
CN111339071B (en) Method and device for processing multi-source heterogeneous data
CN113377850A (en) Big data technology platform of cognitive Internet of things
US10721201B2 (en) Systems and methods for generating a message topic training dataset from user interactions in message clients
US20170109668A1 (en) Model for Linking Between Nonconsecutively Performed Steps in a Business Process
US10803133B2 (en) System for decomposing events from managed infrastructures that includes a reference tool signalizer
CN113632099A (en) Distributed product defect analysis system, method and computer readable storage medium
US11042525B2 (en) Extracting and labeling custom information from log messages
US20120239596A1 (en) Classification of stream-based data using machine learning
CN109656999A (en) Method of data synchronization, equipment, storage medium and the device of big data quantity
EP3693898A1 (en) Method and system for detecting and preventing an imminent failure in a target system
US20200175337A1 (en) Artificial intelligence system for inspecting image reliability
CN113282611B (en) Method, device, computer equipment and storage medium for synchronizing stream data
US11329860B2 (en) System for decomposing events that includes user interface
CN114297935A (en) Airport terminal building departure optimization operation simulation system and method based on digital twin
CN109597702A (en) Root cause analysis method, apparatus, equipment and the storage medium of messaging bus exception
US11115338B2 (en) Intelligent conversion of internet domain names to vector embeddings
US8700756B2 (en) Systems, methods and devices for extracting and visualizing user-centric communities from emails
US20170109640A1 (en) Generation of Candidate Sequences Using Crowd-Based Seeds of Commonly-Performed Steps of a Business Process
US11663109B1 (en) Automated seasonal frequency identification
CN104933077A (en) Rule-based multi-file information analysis method
CN114757157B (en) Method, apparatus, device and medium for generating an aircraft kit
US20190036760A1 (en) System for decomposing events from managed infrastructures with semantic clustering
CN114330720A (en) Knowledge graph construction method and device for cloud computing and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant