CN114371980A - Call link performance monitoring method and device, computer equipment and storage medium - Google Patents

Call link performance monitoring method and device, computer equipment and storage medium Download PDF

Info

Publication number
CN114371980A
CN114371980A CN202210037019.8A CN202210037019A CN114371980A CN 114371980 A CN114371980 A CN 114371980A CN 202210037019 A CN202210037019 A CN 202210037019A CN 114371980 A CN114371980 A CN 114371980A
Authority
CN
China
Prior art keywords
application
information
performance index
monitoring
call
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210037019.8A
Other languages
Chinese (zh)
Inventor
屈清涛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Pingan Payment Technology Service Co Ltd
Original Assignee
Pingan Payment Technology Service Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Pingan Payment Technology Service Co Ltd filed Critical Pingan Payment Technology Service Co Ltd
Priority to CN202210037019.8A priority Critical patent/CN114371980A/en
Publication of CN114371980A publication Critical patent/CN114371980A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3051Monitoring arrangements for monitoring the configuration of the computing system or of the computing system component, e.g. monitoring the presence of processing resources, peripherals, I/O links, software programs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3089Monitoring arrangements determined by the means or processing involved in sensing the monitored data, e.g. interfaces, connectors, sensors, probes, agents
    • G06F11/3093Configuration details thereof, e.g. installation, enabling, spatial arrangement of the probes

Abstract

The embodiment of the invention relates to the technical field of data analysis, and provides a method for monitoring call link performance, which comprises the following steps: acquiring call log information between a plurality of application systems acquired in a point burying mode from a preset monitoring platform; acquiring device information corresponding to a plurality of application systems from a preset device management system; acquiring performance index information of a plurality of application systems from a preset application performance index monitoring system; establishing an incidence relation between the performance index information of each application system and the corresponding equipment information; and generating a visualized call link performance monitoring graph according to the call log information among the plurality of application systems and the incidence relation. The embodiment of the invention can quickly position the problematic platform.

Description

Call link performance monitoring method and device, computer equipment and storage medium
Technical Field
The embodiment of the invention relates to the technical field of data processing, in particular to a method and a device for monitoring call link performance, computer equipment and a storage medium.
Background
With the diversified development of enterprise services, the service platform of an enterprise is gradually converted towards a medium-large service system, and the medium-large service platform is relatively comprehensive in supported services and relatively complex in technology, and part of services can also relate to the support of a third party, so that the performance tracking of the conventional service platform has some problems, such as: when performance problems arise, it is difficult to locate quickly.
Disclosure of Invention
In view of this, embodiments of the present invention provide a method, an apparatus, a computer device, and a computer-readable storage medium for monitoring call link performance, which are used to solve the problem that it is difficult to quickly locate a call link when a problem occurs in an existing service platform.
In order to achieve the above object, an embodiment of the present invention provides a method for monitoring call link performance, which is applied to a service platform including a plurality of application systems, and includes:
acquiring call log information between a plurality of application systems acquired in a point burying mode from a preset monitoring platform;
acquiring device information corresponding to a plurality of application systems from a preset device management system;
acquiring performance index information of a plurality of application systems from a preset application performance index monitoring system;
establishing an incidence relation between the performance index information of each application system and the corresponding equipment information;
and generating a visualized call link performance monitoring graph according to the call log information among the plurality of application systems and the incidence relation.
Optionally, the obtaining performance index information of a plurality of application systems from a preset application performance index monitoring system includes:
and acquiring performance index information of a plurality of application systems from the application performance index monitoring system through an application program interface provided by a preset data visualization tool.
Optionally, the method further comprises:
configuring interface access quantity of a preset interface contained in a preset application system, wherein the preset application system is at least one of a plurality of application systems;
detecting whether the access quantity of the preset interface is greater than or equal to the configured interface access quantity;
and if the access quantity of the preset interface is detected to be larger than or equal to the configured interface access quantity, displaying a calling link performance monitoring graph containing the preset interface.
Optionally, the method further comprises:
configuring alarm thresholds of performance index data of a plurality of the application systems;
detecting whether the performance index data of the plurality of application systems exceeds an alarm threshold corresponding to each application system;
and if the performance index data of the application system exceeds the alarm threshold corresponding to the application system, outputting alarm information.
Optionally, the configuring the alarm threshold of the performance indicator data of the plurality of application systems includes:
acquiring historical performance index data of a plurality of application systems in a historical time period;
and configuring an alarm threshold corresponding to the performance index data of each application system according to the historical performance index data.
Optionally, if the performance index data of the application system exceeds the alarm threshold corresponding to the application system, after the step of outputting the alarm information, the method further includes:
determining an application system corresponding to the alarm information;
and displaying a calling link performance monitoring graph containing the application system corresponding to the alarm information.
Optionally, the generating a visualized call link performance monitoring graph according to the call log information between the plurality of application systems and the association relationship includes:
generating a calling link sequence diagram according to calling log information among the application systems, wherein the calling link comprises calling relations of at least two application systems;
and performing association display on the performance index information and the equipment information and the application system contained in the calling link sequence diagram according to the association relation to obtain a visual calling link performance monitoring diagram.
In order to achieve the above object, an embodiment of the present invention further provides a call link performance monitoring apparatus, where the call link performance monitoring apparatus includes:
the system comprises a first acquisition module, a second acquisition module and a third acquisition module, wherein the first acquisition module is used for acquiring call log information between a plurality of application systems acquired in a point burying mode from a preset monitoring platform;
the second acquisition module is used for acquiring the device information corresponding to the plurality of application systems from a preset device management system;
the third acquisition module is used for acquiring performance index information of a plurality of application systems from a preset application performance index monitoring system;
the establishing module is used for establishing the incidence relation between the performance index information of each application system and the corresponding equipment information;
and the generating module is used for generating a visualized call link performance monitoring graph according to the call log information among the application systems and the incidence relation.
To achieve the above object, an embodiment of the present invention further provides a computer device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor implements the steps of invoking the link performance monitoring method as described above when executing the computer program.
To achieve the above object, an embodiment of the present invention further provides a computer-readable storage medium, in which a computer program is stored, where the computer program is executable by at least one processor, so as to cause the at least one processor to execute the steps of invoking the link performance monitoring method as described above.
According to the call link performance monitoring method, the call link performance monitoring device, the computer equipment and the computer readable storage medium, call log information among a plurality of application systems collected in a point burying mode is obtained from a preset monitoring platform; acquiring device information corresponding to a plurality of application systems from a preset device management system; acquiring performance index information of a plurality of application systems from a preset application performance index monitoring system; establishing an incidence relation between the performance index information of each application system and the corresponding equipment information; and generating a visualized call link performance monitoring graph according to the call log information among the plurality of application systems and the incidence relation. In the embodiment, the visual call link performance monitoring graph is generated through the call log information among the application systems, the performance index information of the application systems and the corresponding equipment information, so that the call details among the application systems can be visually displayed through the call link performance monitoring graph, and the problems existing in the call process can be conveniently found.
Drawings
Fig. 1 is a flowchart illustrating steps of an embodiment of a method for invoking link performance monitoring according to the present invention.
Fig. 2 is a flowchart illustrating a detailed procedure of generating a visualized call link performance monitoring graph according to the call log information and the association relationship between the plurality of application systems according to an embodiment of the present invention.
Fig. 3 is a flowchart illustrating steps of another embodiment of a method for invoking link performance monitoring according to the present invention.
Fig. 4 is a flowchart illustrating steps of another embodiment of a method for invoking link performance monitoring according to the present invention.
Fig. 5 is a flowchart illustrating a detailed process of configuring alarm thresholds of performance indicator data of a plurality of application systems according to an embodiment of the present invention.
Fig. 6 is a flowchart illustrating steps of another embodiment of a method for invoking link performance monitoring according to the present invention.
Fig. 7 is a schematic diagram of program modules of a call link performance monitoring apparatus according to an embodiment of the present invention.
Fig. 8 is a schematic diagram of a hardware structure of a computer device according to an embodiment of the present invention.
The implementation, functional features and advantages of the objects of the present invention will be further explained with reference to the accompanying drawings.
Detailed Description
The advantages of the invention are further illustrated in the following description of specific embodiments in conjunction with the accompanying drawings.
Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The implementations described in the exemplary embodiments below are not intended to represent all implementations consistent with the present disclosure. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the present disclosure, as detailed in the appended claims.
The terminology used in the present disclosure is for the purpose of describing particular embodiments only and is not intended to be limiting of the disclosure. As used in this disclosure and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It should also be understood that the term "and/or" as used herein refers to and encompasses any and all possible combinations of one or more of the associated listed items.
It is to be understood that although the terms first, second, third, etc. may be used herein to describe various information, such information should not be limited to these terms. These terms are only used to distinguish one type of information from another. For example, first information may also be referred to as second information, and similarly, second information may also be referred to as first information, without departing from the scope of the present disclosure. The word "if" as used herein may be interpreted as "at … …" or "when … …" or "in response to a determination", depending on the context.
In the description of the present invention, it should be understood that the numerical references before the steps do not identify the order of performing the steps, but merely serve to facilitate the description of the present invention and to distinguish each step, and thus should not be construed as limiting the present invention.
Referring to fig. 1, a flowchart of a method for invoking link performance monitoring according to a first embodiment of the present invention is shown. It is to be understood that the flow charts in the embodiments of the present method do not limit the order in which the steps are performed. The following description will exemplarily refer to a call link performance monitoring apparatus (hereinafter, referred to as "training apparatus") as an execution subject, where the training apparatus may be applied to a computer device, and the computer device may be a mobile phone, a tablet personal computer (tablet personal computer), a laptop computer (laptop computer), a server, or other devices having a data transmission function. The method comprises the following specific steps:
and step S10, obtaining call log information between a plurality of application systems collected in a buried point mode from a preset monitoring platform.
In particular, the monitoring platform may be a CAT monitoring platform. The CAT monitoring platform is essentially a real-time monitoring system which is mainly embodied in monitoring reports Transaction, event, promble, heartbeat and the like, and the customized monitoring model and the customized real-time analysis report of the CAT system are also the core advantages of the CAT system. The CAT monitoring platform is prone to index and link event monitoring, and is not suitable for scenes with a large number of service logs, because searching and analyzing cannot be performed, the CAT only sees the latest sample data and data with problems. The ELK should be used for a scenario with a large number of service logs.
The service platform in this embodiment needs to access the CAT monitoring platform in advance, and perform point burying in the service system, so that the CAT monitoring platform can collect call log information between application systems in the service system in a point burying manner.
The call log information may include traceid, name of application system of the requesting party, name of application system of the called party, call interface, call time, interface time consumption, service ip, call port, and other information.
Traceid is a serial number used to uniquely identify different invocation requests, that is, different invocation requests between application systems have different traceids. Each invocation request triggered by the application system has a corresponding Traceid.
It should be noted that the application system in this embodiment refers to a system that implements a specific function in the service platform. For example, if the service platform is a shopping platform, each application system in the shopping platform may be an order system, a user system, a settlement system, a commodity system, and the like.
Step S11, obtaining device information corresponding to the plurality of application systems from a preset device management system.
Specifically, the device management system is a system for managing devices (servers) that support respective application systems to implement business functions. In this embodiment, the service system needs to access the device management system in advance, so that the device management system can subsequently acquire device information corresponding to each application system.
The device information includes machine model, machine number, CPU utilization, memory utilization, and the like.
By way of example, the device supporting the application system a to implement the service function includes 30 servers, and the CPU usage rate and the memory usage rate of all the servers are 30% and 20%. The device supporting the application system B to realize the service function comprises 20 servers, wherein the CPU utilization rate of all the servers is 40%, and the memory utilization rate is 30%.
Step S12, obtaining performance index information of a plurality of application systems from a preset application performance index monitoring system.
Specifically, the application performance index monitoring system may be Prometheus (Prometheus) monitoring system, which is a set of open source monitoring & alarm & time series database combinations developed by sourdcloud corporation. The performance index information of each application system can be monitored by the monitoring system.
In this embodiment, the service system needs to access the application performance index monitoring system in advance, so that the performance index information of each application system can be subsequently acquired by the application performance index monitoring system.
The performance index information may include a dubbo thread number, an interface consumption time, an interface call QPS (Query Per Second)/TPS (Transactions Per Second, number of transmissions Per Second), a database CPU, an active session number act, a maximum connection number max conn, a used connection number use conn, a current QPS/TPS, an interface performance, an interface influence table, an interface parameter, a parameter output, a current limit index, a degradation index, and the like.
In an exemplary embodiment, the obtaining performance index information of a plurality of application systems from a preset application performance index monitoring system includes: and acquiring performance index information of a plurality of application systems from the application performance index monitoring system through an application program interface provided by a preset data visualization tool.
In particular, the data visualization tool may be grafana. Wherein, grafana is an open source data visualization tool developed by Go language, can be used for data monitoring and data statistics, and has an alarm function.
Step S13, an association between the performance index information of each application system and the corresponding device information is established.
Specifically, after obtaining the performance index information of each application system and the device information of each application system, the two may be associated and bound, so that the device information may be displayed simultaneously when the performance index information of each application system is displayed.
Step S14, generating a visualized call link performance monitoring graph according to the call log information among the application systems and the incidence relation.
Specifically, the call log information includes call relationships between the respective application systems. For example, the call log information is used for calling the application system B for the application system a, and the application system B is used for calling the application system C, so that the call links of each piece of application information can be obtained by calling the log information, and the association relationship includes the performance index information and the device information of each application system, so that the visual call link performance monitoring graph can be obtained by displaying the association relationship and the call links in a combined manner. Therefore, details in each application system can be seen through the call link performance monitoring graph, and when a certain application system fails, the application system with a problem can be quickly positioned according to the call link performance monitoring graph.
In an exemplary embodiment, referring to fig. 2, the generating a visualized call link performance monitoring graph according to the call log information and the association relationship between a plurality of the application systems includes:
step S20, generating a call link sequence diagram according to the call log information between the plurality of application systems, where the call link includes the call relationship between at least two application systems.
Specifically, each piece of call log information includes a call relationship between application systems, and the call relationship between the application systems included in the call log information is displayed in time sequence, so that a call link timing chart can be obtained. The display according to the time sequence means that the display is performed according to a calling party and a called party between the application systems, namely the calling party is displayed in front and the called party is displayed behind.
And step S21, performing association display on the performance index information, the equipment information and the application system contained in the calling link sequence diagram according to the association relation to obtain a visual calling link performance monitoring diagram.
Specifically, after the call link timing chart is generated, the application system in the call link timing chart may be displayed in association with the performance index information and the device information, so that a call link performance monitoring chart may be obtained, and details in the application system may be displayed at the same time through the call link performance monitoring. Therefore, when a certain application system has a fault, the application system with the problem can be quickly positioned according to the calling link performance monitoring graph.
According to the call link performance monitoring method, the call link performance monitoring device, the computer equipment and the computer readable storage medium, call log information among a plurality of application systems collected in a point burying mode is obtained from a preset monitoring platform; acquiring device information corresponding to a plurality of application systems from a preset device management system; acquiring performance index information of a plurality of application systems from a preset application performance index monitoring system; establishing an incidence relation between the performance index information of each application system and the corresponding equipment information; and generating a visualized call link performance monitoring graph according to the call log information among the plurality of application systems and the incidence relation. In the embodiment, the visual call link performance monitoring graph is generated through the call log information among the application systems, the performance index information of the application systems and the corresponding equipment information, so that the call details among the application systems can be visually displayed through the call link performance monitoring graph, and the problems existing in the call process can be conveniently found.
In an exemplary embodiment, referring to fig. 3, the method further comprises:
step S30, configuring an interface access amount of a preset interface included in a preset application system, where the preset application system is at least one of the plurality of application systems.
Specifically, the interface access amount refers to the number of times of calling the preset interface within a preset time period. The interface access amount can be flexibly configured and modified according to actual conditions. The preset time period can also be flexibly set and modified according to actual conditions, for example, the preset time period is 1 minute, and the interface access amount is 1000 times.
Step S31, detecting whether the access amount of the preset interface is greater than or equal to the configured interface access amount.
Step S32, if it is detected that the access amount of the preset interface is greater than or equal to the configured interface access amount, displaying a call link performance monitoring graph including the preset interface.
Specifically, it may be detected in real time or at regular time whether the access amount of the preset interface is greater than or equal to a pre-configured interface access amount, and when it is detected that the access amount of the preset interface is greater than or equal to the configured interface access amount, which indicates that a hot event occurs currently, a call link performance monitoring graph including the preset interface may be immediately displayed, so that a user may view specific details through the currently displayed call link performance monitoring graph.
In an embodiment, when it is detected that the access amount of the preset interface is smaller than the pre-configured interface access amount, processing may be performed according to a normal flow.
As an example, the preset interface is an order interface, and when it is detected that the number of times of calling the order interface in a short time exceeds the access amount of the interface, the preset interface automatically transfers to a calling link performance monitoring graph including the order interface, so that a user can query the reason for a large number of order situations in a short time through the calling link performance monitoring graph.
In an exemplary embodiment, referring to fig. 4, the method further comprises:
step S40, configuring alarm thresholds of performance index data of a plurality of the application systems.
Specifically, the alarm threshold may be flexibly set and modified according to actual conditions. The alarm thresholds configured by different application systems may be the same or different. The performance index data is preferably the number of dubbo threads of each application system, the interface time consumption, the interface call QPS/TPS, the active session number act, the maximum connection number max conn and the used connection number use conn.
Step S41, detecting whether the performance index data of the plurality of application systems exceeds the alarm threshold corresponding to each application system.
Specifically, whether the performance index data of the multiple application systems exceeds the alarm threshold corresponding to each application system may be detected in real time through multiple threads, that is, each thread detects the performance index data of one application system, so as to determine whether the detected performance index data exceeds the alarm threshold corresponding to the application system.
In an embodiment, one thread may also be used to perform time-sharing detection on the performance index data of a plurality of application systems, that is, the thread detects the performance index data of a first application system at a first time point to determine whether the detected performance index data exceeds an alarm threshold corresponding to the application system; the performance index data of the second application system may be detected at the second point in time by the thread to determine whether the detected performance index data exceeds an alarm threshold corresponding to the application system. Similarly, the performance index data of the nth application system may be detected at the nth time point through the thread to determine whether the detected performance index data exceeds an alarm threshold corresponding to the application system. Wherein, the first time point, the second time point, …, and the nth time point refer to different time periods. The first application system, the second application system, …, and the nth application system refer to one of the plurality of application systems.
And step S42, if the performance index data of the application system exceeds the alarm threshold corresponding to the application system, outputting alarm information.
Specifically, in the detection process, if it is detected that the performance index data of the application system exceeds the alarm threshold corresponding to the application system, an alarm may be given to the application system. In a specific scenario, the alarm operation may be implemented by outputting alarm information. In this embodiment, the alarm information may include a system name of the application system, specifically which performance index data of the application system exceeds the alarm threshold, a specific value of the performance index data of the alarm, and the like.
In one embodiment, the alarm information may also include a reason for the possible occurrence of the alarm information, a treatment recommendation, and the like.
In this embodiment, the performance index data of the application system is preset with the alarm threshold, so that when the performance index data is detected to exceed the alarm threshold, an alarm can be given in time, so that a user can process alarm information in time.
In an embodiment, when the alarm information is output, in order to enable the administrator user to process the alarm information in time, the alarm information may be sent to the administrator user by a short message, a mail, or the like.
In an exemplary embodiment, referring to FIG. 5, configuring alarm thresholds for performance indicator data for a plurality of the application systems comprises: step S50, obtaining historical performance index data of a plurality of the application systems in a historical time period.
Specifically, the historical time period may be flexibly set and modified according to actual conditions, for example, the historical time period is a past week time. The historical performance index data is performance index data of each application system acquired in a historical time period.
And step S51, configuring alarm threshold values corresponding to the performance index data of each application system according to the historical performance index data.
Specifically, after the historical performance index data of each application system is acquired, the acquired historical performance index data may be averaged, and the obtained average value may be used as the alarm threshold, or a median value, a maximum value, and the like in the historical performance index data may be used as the alarm threshold.
In an exemplary embodiment, referring to fig. 6, after the step of outputting the warning information if the performance index data of the application system exceeds the warning threshold corresponding to the application system, the method further includes: step S60, determining an application system corresponding to the alarm information; and step S61, displaying a call link performance monitoring chart of the application system corresponding to the alarm information.
Specifically, when an alarm condition of one application system is detected, a corresponding call link performance monitoring graph is displayed, so that each application system included in the call link performance monitoring graph can be analyzed in time, and possible problems can be found out.
Referring to fig. 7, a schematic diagram of program modules of a call link performance monitoring apparatus 700 (hereinafter, referred to as "monitoring apparatus 700") according to an embodiment of the invention is shown. The monitoring apparatus 700 may be applied to a computer device, which may be a mobile phone, a tablet personal computer (tablet personal computer), a laptop computer (laptop computer), a server, or other devices having a data transmission function. In this embodiment, the monitoring apparatus 700 may include or be divided into one or more program modules, and the one or more program modules are stored in a storage medium and executed by one or more processors to implement the present invention and implement the above-described call link performance monitoring method. The program module referred to in the embodiments of the present invention refers to a series of computer program instruction segments capable of performing specific functions, and is more suitable for describing the execution process of the call link performance monitoring method in the storage medium than the program itself. In this embodiment, the call link performance monitoring apparatus 700 includes a first obtaining module 701, a second obtaining module 702, a third obtaining module 703, an establishing module 704, and a generating module 705. The following description will specifically describe the functions of the program modules of the present embodiment:
the first obtaining module 701 is configured to obtain, from a preset monitoring platform, call log information between multiple application systems, which is collected in a point burying manner.
In particular, the monitoring platform may be a CAT monitoring platform. The CAT monitoring platform is essentially a real-time monitoring system which is mainly embodied in monitoring reports Transaction, event, promble, heartbeat and the like, and the customized monitoring model and the customized real-time analysis report of the CAT system are also the core advantages of the CAT system. The CAT monitoring platform is prone to index and link event monitoring, and is not suitable for scenes with a large number of service logs, because searching and analyzing cannot be performed, the CAT only sees the latest sample data and data with problems. The ELK should be used for a scenario with a large number of service logs.
The service platform in this embodiment needs to access the CAT monitoring platform in advance, and perform point burying in the service system, so that the CAT monitoring platform can collect call log information between application systems in the service system in a point burying manner.
The call log information may include traceid, name of application system of the requesting party, name of application system of the called party, call interface, call time, interface time consumption, service ip, call port, and other information.
Traceid is a serial number used to uniquely identify different invocation requests, that is, different invocation requests between application systems have different traceids. Each invocation request triggered by the application system has a corresponding Traceid.
It should be noted that the application system in this embodiment refers to a system that implements a specific function in the service platform. For example, if the service platform is a shopping platform, each application system in the shopping platform may be an order system, a user system, a settlement system, a commodity system, and the like.
A second obtaining module 702, configured to obtain device information corresponding to multiple application systems from a preset device management system.
Specifically, the device management system is a system for managing devices (servers) that support respective application systems to implement business functions. In this embodiment, the service system needs to access the device management system in advance, so that the device management system can subsequently acquire device information corresponding to each application system.
The device information includes machine model, machine number, CPU utilization, memory utilization, and the like.
By way of example, the device supporting the application system a to implement the service function includes 30 servers, and the CPU usage rate and the memory usage rate of all the servers are 30% and 20%. The device supporting the application system B to realize the service function comprises 20 servers, wherein the CPU utilization rate of all the servers is 40%, and the memory utilization rate is 30%.
A third obtaining module 703 is configured to obtain performance index information of a plurality of application systems from a preset application performance index monitoring system.
Specifically, the application performance index monitoring system may be Prometheus (Prometheus) monitoring system, which is a set of open source monitoring & alarm & time series database combinations developed by sourdcloud corporation. The performance index information of each application system can be monitored by the monitoring system.
In this embodiment, the service system needs to access the application performance index monitoring system in advance, so that the performance index information of each application system can be subsequently acquired by the application performance index monitoring system.
The performance index information may include a dubbo thread number, an interface consumption time, an interface call QPS (Query Per Second)/TPS (Transactions Per Second, number of transmissions Per Second), a database CPU, an active session number act, a maximum connection number max conn, a used connection number use conn, a current QPS/TPS, an interface performance, an interface influence table, an interface parameter, a parameter output, a current limit index, a degradation index, and the like.
In an exemplary embodiment, the third obtaining module 703 is further configured to obtain performance index information of a plurality of application systems from the application performance index monitoring system through an application program interface provided by a preset data visualization tool.
In particular, the data visualization tool may be grafana. Wherein, grafana is an open source data visualization tool developed by Go language, can be used for data monitoring and data statistics, and has an alarm function.
The establishing module 704 is configured to establish an association relationship between the performance index information of each application system and the corresponding device information.
Specifically, after obtaining the performance index information of each application system and the device information of each application system, the performance index information of each application system and the device information of each application system may be associated and bound, so that the device information may be displayed simultaneously when the performance index information of each application system is displayed.
A generating module 705, configured to generate a visualized call link performance monitoring graph according to the call log information between multiple application systems and the association relationship.
Specifically, the call log information includes call relationships between the respective application systems. For example, the call log information is used for calling the application system B for the application system a, and the application system B is used for calling the application system C, so that the call links of each piece of application information can be obtained by calling the log information, and the association relationship includes the performance index information and the device information of each application system, so that the visual call link performance monitoring graph can be obtained by displaying the association relationship and the call links in a combined manner. Therefore, details in each application system can be seen through the call link performance monitoring graph, and when a certain application system fails, the application system with a problem can be quickly positioned according to the call link performance monitoring graph.
In an exemplary embodiment, the generating module 705 is further configured to generate a call link timing chart according to call log information between a plurality of the application systems, where the call link includes a call relationship between at least two application systems.
Specifically, each piece of call log information includes a call relationship between application systems, and the call relationship between the application systems included in the call log information is displayed in time sequence, so that a call link timing chart can be obtained. The display according to the time sequence means that the display is performed according to a calling party and a called party between the application systems, namely the calling party is displayed in front and the called party is displayed behind.
The generating module 705 is further configured to perform association display on the performance index information, the device information, and the application system included in the call link timing diagram according to the association relationship, so as to obtain a visual call link performance monitoring diagram.
Specifically, after the call link timing chart is generated, the application system in the call link timing chart may be displayed in association with the performance index information and the device information, so that a call link performance monitoring chart may be obtained, and details in the application system may be displayed at the same time through the call link performance monitoring. Therefore, when a certain application system has a fault, the application system with the problem can be quickly positioned according to the calling link performance monitoring graph.
According to the call link performance monitoring method, the call link performance monitoring device, the computer equipment and the computer readable storage medium, call log information among a plurality of application systems collected in a point burying mode is obtained from a preset monitoring platform; acquiring device information corresponding to a plurality of application systems from a preset device management system; acquiring performance index information of a plurality of application systems from a preset application performance index monitoring system; establishing an incidence relation between the performance index information of each application system and the corresponding equipment information; and generating a visualized call link performance monitoring graph according to the call log information among the plurality of application systems and the incidence relation. In the embodiment, the visual call link performance monitoring graph is generated through the call log information among the application systems, the performance index information of the application systems and the corresponding equipment information, so that the call details among the application systems can be visually displayed through the call link performance monitoring graph, and the problems existing in the call process can be conveniently found.
In an exemplary embodiment, the monitoring apparatus 700 further includes a configuration module, a detection module, and a display module.
The configuration module is configured to configure an interface access amount of a preset interface included in a preset application system, where the preset application system is at least one of the plurality of application systems.
Specifically, the interface access amount refers to the number of times of calling the preset interface within a preset time period. The interface access amount can be flexibly configured and modified according to actual conditions. The preset time period can also be flexibly set and modified according to actual conditions, for example, the preset time period is 1 minute, and the interface access amount is 1000 times.
The detection module is used for detecting whether the access quantity of the preset interface is larger than or equal to the configured interface access quantity.
And the display module is used for displaying a calling link performance monitoring chart containing the preset interface if the fact that the access quantity of the preset interface is greater than or equal to the configured interface access quantity is detected.
Specifically, it may be detected in real time or at regular time whether the access amount of the preset interface is greater than or equal to a pre-configured interface access amount, and when it is detected that the access amount of the preset interface is greater than or equal to the configured interface access amount, which indicates that a hot event occurs currently, a call link performance monitoring graph including the preset interface may be immediately displayed, so that a user may view specific details through the currently displayed call link performance monitoring graph.
In an embodiment, when it is detected that the access amount of the preset interface is smaller than the pre-configured interface access amount, processing may be performed according to a normal flow.
As an example, the preset interface is an order interface, and when it is detected that the number of times of calling the order interface in a short time exceeds the access amount of the interface, the preset interface automatically transfers to a calling link performance monitoring graph including the order interface, so that a user can query the reason for a large number of order situations in a short time through the calling link performance monitoring graph.
In an exemplary embodiment, the monitoring apparatus 700 further includes an output module.
The configuration module is further configured to configure alarm thresholds of the performance index data of the plurality of application systems.
The detection module is further configured to detect whether performance index data of the plurality of application systems exceeds an alarm threshold corresponding to each application system.
Specifically, whether the performance index data of the multiple application systems exceeds the alarm threshold corresponding to each application system may be detected in real time through multiple threads, that is, each thread detects the performance index data of one application system, so as to determine whether the detected performance index data exceeds the alarm threshold corresponding to the application system.
In an embodiment, one thread may also be used to perform time-sharing detection on the performance index data of a plurality of application systems, that is, the thread detects the performance index data of a first application system at a first time point to determine whether the detected performance index data exceeds an alarm threshold corresponding to the application system; the performance index data of the second application system may be detected at the second point in time by the thread to determine whether the detected performance index data exceeds an alarm threshold corresponding to the application system. Similarly, the performance index data of the nth application system may be detected at the nth time point through the thread to determine whether the detected performance index data exceeds an alarm threshold corresponding to the application system. Wherein, the first time point, the second time point, …, and the nth time point refer to different time periods. The first application system, the second application system, …, and the nth application system refer to one of the plurality of application systems.
The output module is further configured to output alarm information if the performance index data of the application system exceeds an alarm threshold corresponding to the application system.
Specifically, in the detection process, if it is detected that the performance index data of the application system exceeds the alarm threshold corresponding to the application system, an alarm may be given to the application system. In a specific scenario, the alarm operation may be implemented by outputting alarm information. In this embodiment, the alarm information may include a system name of the application system, specifically which performance index data of the application system exceeds the alarm threshold, a specific value of the performance index data of the alarm, and the like.
In one embodiment, the alarm information may also include a reason for the possible occurrence of the alarm information, a treatment recommendation, and the like.
In an exemplary embodiment, the configuration module is further configured to obtain historical performance indicator data of a plurality of the application systems in a historical time period.
Specifically, the historical time period may be flexibly set and modified according to actual conditions, for example, the historical time period is a past week time. The historical performance index data is performance index data of each application system acquired in a historical time period.
The configuration module is further configured to configure an alarm threshold corresponding to the performance index data of each application system according to the historical performance index data.
Specifically, after the historical performance index data of each application system is acquired, the acquired historical performance index data may be averaged, and the obtained average value may be used as the alarm threshold, or a median value, a maximum value, and the like in the historical performance index data may be used as the alarm threshold.
In an exemplary embodiment, the monitoring apparatus 700 further includes a determination module.
And the determining module is used for determining the application system corresponding to the alarm information.
The display module is further configured to display a call link performance monitoring graph including the application system corresponding to the alarm information.
Specifically, when an alarm condition of one application system is detected, a corresponding call link performance monitoring graph is displayed, so that each application system included in the call link performance monitoring graph can be analyzed in time, and possible problems can be found out.
Fig. 8 is a schematic diagram of a hardware architecture of a computer apparatus 800 according to an embodiment of the present invention. In this embodiment, the computer device 800 is a device capable of automatically performing numerical calculation and/or information processing according to an instruction set or stored in advance. As shown, the computer apparatus 800 includes, but is not limited to, at least a memory 801, a processor 802, and a network interface 803 communicatively connected to each other via a device bus. Wherein:
in this embodiment, the memory 801 includes at least one type of computer-readable storage medium including a flash memory, a hard disk, a multimedia card, a card-type memory (e.g., SD or DX memory, etc.), a Random Access Memory (RAM), a Static Random Access Memory (SRAM), a Read Only Memory (ROM), an Electrically Erasable Programmable Read Only Memory (EEPROM), a Programmable Read Only Memory (PROM), a magnetic memory, a magnetic disk, an optical disk, and the like. In some embodiments, the storage 801 may be an internal storage unit of the computer device 800, such as a hard disk or a memory of the computer device 800. In other embodiments, the memory 801 may also be an external storage device of the computer device 800, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), or the like, provided on the computer device 800. Of course, the memory 801 may also include both internal and external memory units to the computer device 800. In this embodiment, the memory 801 is generally used for storing an operating device and various application software installed in the computer apparatus 800, such as a program code for calling the link performance monitoring apparatus 700. In addition, the memory 801 can also be used to temporarily store various types of data that have been output or are to be output.
Processor 802 may be a Central Processing Unit (CPU), controller, microcontroller, microprocessor, or other data Processing chip in some embodiments. The processor 802 generally operates to control the overall operation of the computer device 800. In this embodiment, the processor 802 is configured to run program codes stored in the memory 801 or process data, for example, run the call link performance monitoring apparatus 700, so as to implement the call link performance monitoring method in the above embodiments.
The network interface 803 may include a wireless network interface or a wired network interface, and the network interface 803 is generally used for establishing a communication link between the computer apparatus 800 and other electronic devices. For example, the network interface 803 is used to connect the computer apparatus 800 to an external terminal via a network, establish a data transmission channel and a communication connection between the computer apparatus 800 and the external terminal, and the like. The network may be a wireless or wired network such as an Intranet (Intranet), the Internet (Internet), a Global System of Mobile communication (GSM), Wideband Code Division Multiple Access (WCDMA), 4G network, 5G network, Bluetooth (Bluetooth), Wi-Fi, and the like.
It is noted that fig. 8 only shows computer device 800 with components 801 and 803, but it is understood that not all of the shown components are required and that more or less components may be implemented instead.
In this embodiment, the call link performance monitoring apparatus 700 stored in the memory 801 may be further divided into one or more program modules, and the one or more program modules are stored in the memory 801 and executed by one or more processors (in this embodiment, the processor 802) to implement the call link performance monitoring method of the present invention.
The present embodiment also provides a computer-readable storage medium, such as a flash memory, a hard disk, a multimedia card, a card-type memory (e.g., SD or DX memory, etc.), a Random Access Memory (RAM), a Static Random Access Memory (SRAM), a read-only memory (ROM), an electrically erasable programmable read-only memory (EEPROM), a programmable read-only memory (PROM), a magnetic memory, a magnetic disk, an optical disk, a server, an App application mall, etc., on which a computer program is stored, which when executed by a processor implements corresponding functions. The computer readable storage medium of this embodiment is used for storing the call link performance monitoring apparatus 700 to implement the call link performance monitoring method of the present invention when executed by a processor.
The above-mentioned serial numbers of the embodiments of the present invention are merely for description and do not represent the merits of the embodiments.
Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner.
The above description is only a preferred embodiment of the present invention, and not intended to limit the scope of the present invention, and all modifications of equivalent structures and equivalent processes, which are made by using the contents of the present specification and the accompanying drawings, or directly or indirectly applied to other related technical fields, are included in the scope of the present invention.

Claims (10)

1. A method for monitoring call link performance is applied to a service platform comprising a plurality of application systems, and is characterized by comprising the following steps:
acquiring call log information between a plurality of application systems acquired in a point burying mode from a preset monitoring platform;
acquiring device information corresponding to a plurality of application systems from a preset device management system;
acquiring performance index information of a plurality of application systems from a preset application performance index monitoring system;
establishing an incidence relation between the performance index information of each application system and the corresponding equipment information;
and generating a visualized call link performance monitoring graph according to the call log information among the plurality of application systems and the incidence relation.
2. The call link performance monitoring method according to claim 1, wherein the obtaining performance index information of a plurality of application systems from a preset application performance index monitoring system comprises:
and acquiring performance index information of a plurality of application systems from the application performance index monitoring system through an application program interface provided by a preset data visualization tool.
3. The call link performance monitoring method of claim 1, wherein the method further comprises:
configuring interface access quantity of a preset interface contained in a preset application system, wherein the preset application system is at least one of a plurality of application systems;
detecting whether the access quantity of the preset interface is greater than or equal to the configured interface access quantity;
and if the access quantity of the preset interface is detected to be larger than or equal to the configured interface access quantity, displaying a calling link performance monitoring graph containing the preset interface.
4. The call link performance monitoring method of claim 1, wherein the method further comprises:
configuring alarm thresholds of performance index data of a plurality of the application systems;
detecting whether the performance index data of the plurality of application systems exceeds an alarm threshold corresponding to each application system;
and if the performance index data of the application system exceeds the alarm threshold corresponding to the application system, outputting alarm information.
5. The call link performance monitoring method of claim 4, wherein said configuring an alarm threshold for performance metric data for a plurality of said application systems comprises:
acquiring historical performance index data of a plurality of application systems in a historical time period;
and configuring an alarm threshold corresponding to the performance index data of each application system according to the historical performance index data.
6. The call link performance monitoring method according to claim 4, wherein after the step of outputting the alarm information if the performance index data of the application system exceeds the alarm threshold corresponding to the application system, the method further comprises:
determining an application system corresponding to the alarm information;
and displaying a calling link performance monitoring graph containing the application system corresponding to the alarm information.
7. The call link performance monitoring method according to any one of claims 1 to 6, wherein the generating a visualized call link performance monitoring graph according to the call log information and the association relationship between the plurality of application systems includes:
generating a calling link sequence diagram according to calling log information among the application systems, wherein the calling link comprises calling relations of at least two application systems;
and performing association display on the performance index information and the equipment information and the application system contained in the calling link sequence diagram according to the association relation to obtain a visual calling link performance monitoring diagram.
8. A call link performance monitoring apparatus, comprising:
the system comprises a first acquisition module, a second acquisition module and a third acquisition module, wherein the first acquisition module is used for acquiring call log information between a plurality of application systems acquired in a point burying mode from a preset monitoring platform;
the second acquisition module is used for acquiring the device information corresponding to the plurality of application systems from a preset device management system;
the third acquisition module is used for acquiring performance index information of a plurality of application systems from a preset application performance index monitoring system;
the establishing module is used for establishing the incidence relation between the performance index information of each application system and the corresponding equipment information;
and the generating module is used for generating a visualized call link performance monitoring graph according to the call log information among the application systems and the incidence relation.
9. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the steps of invoking the link performance monitoring method according to any one of claims 1 to 7 when executing the computer program.
10. A computer-readable storage medium, having stored therein a computer program executable by at least one processor to cause the at least one processor to perform the steps of invoking the link performance monitoring method according to any one of claims 1-7.
CN202210037019.8A 2022-01-13 2022-01-13 Call link performance monitoring method and device, computer equipment and storage medium Pending CN114371980A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210037019.8A CN114371980A (en) 2022-01-13 2022-01-13 Call link performance monitoring method and device, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210037019.8A CN114371980A (en) 2022-01-13 2022-01-13 Call link performance monitoring method and device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
CN114371980A true CN114371980A (en) 2022-04-19

Family

ID=81144717

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210037019.8A Pending CN114371980A (en) 2022-01-13 2022-01-13 Call link performance monitoring method and device, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN114371980A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115904872A (en) * 2022-12-01 2023-04-04 深圳市雁联计算系统有限公司 Transaction link monitoring method, system and storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115904872A (en) * 2022-12-01 2023-04-04 深圳市雁联计算系统有限公司 Transaction link monitoring method, system and storage medium
CN115904872B (en) * 2022-12-01 2024-03-12 深圳市雁联计算系统有限公司 Transaction link monitoring method, system and storage medium

Similar Documents

Publication Publication Date Title
US10956424B2 (en) Application recommending method and system, and server
US9369521B2 (en) Naming of distributed business transactions
CN113176978B (en) Monitoring method, system, equipment and readable storage medium based on log file
CN112631913B (en) Method, device, equipment and storage medium for monitoring operation faults of application program
CN110232010A (en) A kind of alarm method, alarm server and monitoring server
US20080244616A1 (en) System and method for message lifetime management
CN107566172B (en) Active management method and system based on storage system
CN109409948B (en) Transaction abnormity detection method, device, equipment and computer readable storage medium
CN114371980A (en) Call link performance monitoring method and device, computer equipment and storage medium
CN114553682A (en) Real-time alarm method, system, computer equipment and storage medium
CN110727563A (en) Cloud service alarm method and device for preset customer
CN110674149A (en) Service data processing method and device, computer equipment and storage medium
CN111130944B (en) System monitoring method and system
CN111710403A (en) Medical equipment supervision method, equipment and readable storage medium
CN108234658B (en) Method and device for sensing health condition of server cluster and server
CN110113208A (en) Alarm information processing method, device, equipment and computer readable storage medium
CN115499302A (en) Monitoring method and device of business system, readable storage medium and electronic equipment
CN109766238B (en) Session number-based operation and maintenance platform performance monitoring method and device and related equipment
US10296967B1 (en) System, method, and computer program for aggregating fallouts in an ordering system
CN113608982A (en) Function execution performance monitoring method and device, computer equipment and storage medium
CN112596974A (en) Full link monitoring method, device, equipment and storage medium
CN114024867B (en) Network anomaly detection method and device
CN112653567B (en) Monitoring method, monitoring device, computer equipment and storage medium
CN113920698B (en) Early warning method, device, equipment and medium for interface abnormal call
CN116484373B (en) Abnormal process checking and killing method, system, device, computer equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination