WO2024060776A1 - Service health status display method and apparatus, and device and storage medium - Google Patents

Service health status display method and apparatus, and device and storage medium Download PDF

Info

Publication number
WO2024060776A1
WO2024060776A1 PCT/CN2023/104819 CN2023104819W WO2024060776A1 WO 2024060776 A1 WO2024060776 A1 WO 2024060776A1 CN 2023104819 W CN2023104819 W CN 2023104819W WO 2024060776 A1 WO2024060776 A1 WO 2024060776A1
Authority
WO
WIPO (PCT)
Prior art keywords
service
query
status
health
time range
Prior art date
Application number
PCT/CN2023/104819
Other languages
French (fr)
Chinese (zh)
Inventor
郭良
侯瑞军
Original Assignee
华为云计算技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为云计算技术有限公司 filed Critical 华为云计算技术有限公司
Publication of WO2024060776A1 publication Critical patent/WO2024060776A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine

Definitions

  • the present application relates to the field of computer technology, and in particular to a method, device, equipment and storage medium for displaying service health status.
  • the display method of the health status of services in related technologies cannot accurately represent the health status of services, causing operation and maintenance personnel to be unable to accurately locate service faults.
  • Embodiments of the present application provide a method, device, equipment and storage medium for displaying service health status, which can display the status identifier of the service based on the impact of the frequency of the original health status appearing within the query time range on the final health status, thereby enabling Improves the accuracy of representing the health status of a service.
  • embodiments of the present application provide a method for displaying service health status, including:
  • the indicator data of the service is used to reflect the original health status of the service
  • the status identifier of the service is displayed for users to locate faults based on the status identifier of the service.
  • the status identifier It is used to indicate the final health state of the service, and the weight is used to indicate the degree of influence of the frequency of the original health state on the final health state.
  • the embodiment of this application displays the status identifier of the service by combining the weight of the original health state and the frequency of the original health state within the query time, so that the status identifier of the service includes the impact of the frequency of the original health state on the final health state, improving Indicates the accuracy of the health status of the service, thereby improving the accuracy of service fault location.
  • the method for displaying the service health status further includes:
  • the corresponding weights of multiple original health states are determined
  • the weight determination model is used to indicate the correspondence between the frequency of multiple original health states of the service and the final health state of the service.
  • the weight is determined through the weight determination model, so that the weight of the original health state includes the degree of influence of the frequency of the original health state on the final health state.
  • the display method of service health status also includes:
  • the status identifier of the service is displayed, including:
  • the status identifier of the service is displayed based on the status score corresponding to the service and the corresponding score ranges of multiple final health states.
  • the display method of service health status also includes:
  • Determine the metric data served within the query time range including:
  • the indicator data of the service is collected from all indicator data of the service.
  • the data amount of the service indicator data is not greater than the target amount
  • the target number is determined based on the preconfigured collection step, which is used to indicate the length of the time interval between the indicator data of the service.
  • determining the query step size of the data query includes:
  • the query step is determined based on the collection step, the preset time length threshold and the query time range.
  • the query step size is determined adaptively through the collection step size, thereby avoiding missed reports of service failures, thus balancing computing resources and the accuracy of health monitoring.
  • the status identification includes a color identification or a shape identification.
  • the display method of service health status also includes:
  • the raw health status of the target service at multiple moments in the query time range is displayed.
  • embodiments of the present application provide a display device for serving health status, including:
  • the first determination module is used to determine the query time range of data query
  • the second determination module is used to determine the indicator data of the service within the query time range.
  • the indicator data of the service is used to reflect the original health status of the service;
  • the third determination module is used to determine the frequency of each of the multiple original health states of the service within the query time range based on the indicator data of the service within the query time range;
  • the display module is used to display the status identifier of the service based on the corresponding weights of the multiple original health states of the service and the frequency of occurrence of the service in the multiple original health states within the query time, so that the user can perform operations based on the status identifier of the service.
  • the status identifier is used to indicate the final health status of the service
  • the weight is used to indicate the impact of the frequency of the original health status on the final health status.
  • the embodiment of this application displays the status identifier of the service by combining the weight of the original health state and the frequency of the original health state within the query time, so that the status identifier of the service includes the impact of the frequency of the original health state on the final health state, improving Indicates the accuracy of the health status of the service, thereby improving the accuracy of service fault location.
  • the service health status display device also includes:
  • a fourth determination module is used to determine the weights corresponding to the various original health states according to a pre-configured weight determination model and a preset time length threshold;
  • the weight determination model is used to indicate the correspondence between the frequency of multiple original health states of the service and the final health state of the service.
  • the weight is determined through the weight determination model, so that the weight of the original health state includes the degree of influence of the frequency of the original health state on the final health state.
  • the fourth determination module is also used to determine the state score ranges corresponding to the multiple final health states based on the respective weights and preset time length thresholds of the multiple original health states;
  • the calculation module is used to calculate the status score corresponding to the service based on the respective weights of the multiple original health states of the service and the respective frequencies of the multiple original health states within the query time;
  • the display module is used to display the status identifier of the service according to the status score corresponding to the service and the score ranges corresponding to the multiple final health states.
  • the first determination module is also used to determine the query step size of the data query
  • the second determination module is used to collect the indicator data of the service from all the indicator data of the service according to the query step size and the query time range.
  • the data amount of the service indicator data is not greater than the target amount
  • the target number is determined based on the preconfigured collection step, which is used to indicate the length of the time interval between the indicator data of the service.
  • the first determination module is used to determine the collection step, and the collection step is used to indicate the length of the time interval between all indicator data of the service; according to the collection step, the preset time length threshold and Query the time range and determine the query step size.
  • the query step size is determined adaptively through the collection step size, thereby avoiding missed reports of service failures, thus balancing computing resources and the accuracy of health monitoring.
  • the status identification includes a color identification or a shape identification.
  • the device further includes:
  • the receiving module is used to receive the user's operation on the status identification of the target service
  • the display module is also used to display the original health status of the target service at multiple moments within the query time range based on operations.
  • embodiments of the present application provide a display device for serving health status, including: at least one memory for storing a program; at least one processor for executing the program stored in the memory. When the program stored in the memory is executed When, the processor is used to execute the method provided in the first aspect.
  • embodiments of the present application provide a display device for serving health status, characterized in that the device runs computer program instructions to execute the method provided in the second aspect.
  • the device may be a chip or a processor.
  • the apparatus may include a processor, which may be coupled to a memory, read instructions in the memory and execute the method provided in the first aspect according to the instructions.
  • the memory may be integrated into the chip or processor, or may be independent of the chip or processor.
  • embodiments of the present application provide a computer storage medium. Instructions are stored in the computer storage medium. When the instructions are run on a computer, they cause the computer to execute the method provided in the first aspect.
  • embodiments of the present application provide a computer program product containing instructions. When the instructions are run on a computer, they cause the computer to execute the method provided in the first aspect.
  • Figure 1 is a system architecture diagram of a business system provided by an embodiment of the present application.
  • Figure 2 is a schematic flow chart of a service indicator data collection method provided by an embodiment of the present application.
  • Figure 3 is a schematic flowchart of a method for displaying service health status provided by an embodiment of the present application
  • Figure 4 is a schematic diagram of a human-computer interaction interface provided by an embodiment of the present application.
  • FIG. 5 is a schematic diagram of another human-computer interaction interface provided by an embodiment of the present application.
  • Figure 6 is a schematic diagram of yet another human-computer interaction interface provided by an embodiment of the present application.
  • Figure 7 is a schematic diagram of a service health status display interface provided by an embodiment of the present application.
  • Figure 8 is a schematic diagram of another service health status display interface provided by an embodiment of the present application.
  • Figure 9 is a schematic structural diagram of a service health status display device provided by an embodiment of the present application.
  • Figure 10 is a schematic structural diagram of a computing device provided by an embodiment of the present application.
  • the term "and/or" is only an association relationship describing associated objects, indicating that there can be three relationships.
  • a and/or B can mean: A alone exists, and A alone exists. There is B, and there are three situations A and B at the same time.
  • the term "plurality" means two or more.
  • multiple systems refer to two or more systems
  • multiple terminals refer to two or more terminals.
  • first and second are only used for descriptive purposes and cannot be understood as indicating or implying relative importance or implicitly indicating the indicated technical features. Therefore, features defined as “first” and “second” may explicitly or implicitly include one or more of these features.
  • the terms “including,” “includes,” “having,” and variations thereof all mean “including but not limited to,” unless otherwise specifically emphasized.
  • FIG 1 is an architectural diagram of a business system involved in the embodiment of this application.
  • the business system includes the following network devices: a management platform 110 and several host devices 120.
  • the management platform 110 may be a single computing device, or it may be a service cluster composed of multiple computing devices, or it may be a cloud computing center, or it may be a hyper terminal.
  • the computing device involved in this solution can be used to provide cloud services, and it can establish communication connections with several host devices to provide computing functions and/or storage functions for the host devices.
  • the management platform involved in the embodiments of this application may be a hardware device, or may be embedded in a virtualized environment.
  • the management platform involved in this solution may be a virtual machine executed on a hardware device including one or more other virtual machines.
  • the host device 120 may be a physical host or a virtual host.
  • the management platform 110 and several host devices 120 form a business system.
  • the business system provides several different microservices to the outside world, and each microservice is run by one or more host devices 120 .
  • the management device 110 can collect indicator data of microservices run by the host device 120 .
  • the management platform 110 monitors the health of the microservices based on the collected indicator data of the microservices of various businesses, so that the operation and maintenance staff of the microservices (for convenience of description, called users) determine the microservices through the management platform.
  • the health of the microservice, and based on the health of the microservice locate the time when the microservice fails.
  • the display method of service health status in related technologies only represents the health status of the service at the time of query, and the health status of the service at each time point in a certain period of time in history has an impact on the health status of the service at the time of query.
  • only displays the health status of the service at the time of query without considering the impact of the health status of the service at each time point in the historical time period, and cannot reflect the changes in the health status of the service during the entire time period queried by the user, which does not satisfy Users’ needs for problem overview and further fault location. Therefore, the display method of the health status of the service in the related technology cannot accurately represent the health of the service.
  • the priorities of different health states are determined according to the impact of the health state of the service on the user, and the priorities of the health states are sorted, with the health states with higher priorities being used as representatives, and the identifiers are displayed according to the corresponding color identifiers of the health states.
  • the priority order of the health states is: abnormal state > lossy state > normal state.
  • the identification color corresponding to the abnormal state is red
  • the identification color corresponding to the lossy state is yellow
  • the identification color corresponding to the normal state is green.
  • the abnormal state Since the priority of the abnormal state is higher than the normal priority, the abnormal state is used as the representative state within the query time, and the final identification color is red.
  • the display method of the health status of the above-mentioned services did not consider the frequency of occurrence of the health state in the past 24 hours but only considered the priority. In fact, the health status of other time points was omitted, and essentially a large amount of original information was lost. The health status of the omitted time points did not contribute to the representation of the final health state. From a practical point of view, When the user checks the topology, he finds that a certain service is marked in red. However, when locating the abnormal state, he finds that the abnormal state only occurred for a moment and then quickly returned to normal. It can be seen that it is obviously unreasonable to display the service mark in red.
  • the related technology counts the frequency of occurrence of different health states of the service within the query time range, and uses the health state with the highest frequency as the representative state, and displays the color identifier corresponding to the health state with the highest frequency.
  • the identifier color corresponding to the abnormal state is red
  • the identifier color corresponding to the lossy state is yellow
  • the identifier color corresponding to the normal state is green.
  • the normal state will be used as a 24-hour state representative, and the identifier of the service will be displayed as a green identifier.
  • the above method causes abnormal states that do not have a frequency advantage to be directly ignored, and there is a risk of underreporting abnormal states.
  • embodiments of the present application provide a method, device, equipment and storage medium for displaying the service health status, which can display the status identification of the service based on the impact of the frequency of the original health status appearing within the query time range on the final health status, thereby It can improve the accuracy of service health performance.
  • FIG 2 is a schematic flowchart of a service indicator data collection method provided by an embodiment of the present application.
  • the service indicator data collection method provided by the embodiment of this application is applied to the management system shown in Figure 1.
  • the service indicator data collection method provided by the embodiment of the present application includes S201-S203.
  • S201 The host device collects all indicator data of services run by the host device.
  • the host device When the host device is running the service, it can generate a running log of the service.
  • the running log records all the indicator data of the host device when running the service.
  • Hosting devices can extract service metric data from operational logs.
  • the host device can receive an indicator collection instruction sent by the management platform, and the host device responds to the indicator collection instruction and extracts the indicator data of the service from the operation log.
  • all indicator data of the service can be collected through k8s indicators.
  • the indicator data of the service carries a timestamp, and the timestamp is used to indicate the generation time of the indicator data.
  • S202 The host device sends all indicator data of the service to the management platform.
  • the host device communicates with the management platform to enable data transmission.
  • the host device has multiple ways of sending service indicator data to the management platform.
  • the management device can use Prometheus' indicator acquisition method to collect service indicator data.
  • a client can be installed on the host device and linked with the server in the management platform to collect service indicator data.
  • the host device can actively send all indicator data of the service to the management platform.
  • the host device can respond to the indicator collection instruction, collect all indicator data of the service, and send all indicator data of the service to the management platform.
  • the host device can passively send all indicator data of the service to the management platform. In order to save the storage resources of the host device, the host device can actively send all the indicator data of the service to the management platform after extracting all the indicator data of the service.
  • the indicator data of the service can be collected according to a certain collection step length.
  • the collection step length is configured by the user through the management platform, and the management platform can send the collection step length to the host device, so that the host device collects all indicator data of the service according to the collection step length.
  • S203 Management platform storage service indicator data.
  • the management platform can store the indicator data in time sequence and store the indicator data as time series data to facilitate subsequent retrieval in time order.
  • Figure 3 is a schematic flowchart of a method for displaying service health status provided by an embodiment of the present application.
  • the method for displaying service health status provided by the embodiment of the present application is applied to the management system shown in Figure 1 .
  • the method for displaying service health status provided by the embodiment of the present application includes S301-S304.
  • S301 The management platform determines a query time range for data query.
  • the query time range refers to the time range of the data that the user needs to query, including the start time and deadline of the data query. For example, if the query time range is the past 24 hours and the current time is 6 pm on June 15, 2022, then the query time range is from 6 pm on June 14, 2022 to 6 pm on June 15, 2022.
  • the management device includes a display, and the display is used to display a human-computer interaction interface for data query.
  • the user can input the query time range of the data query through the human-computer interaction interface of the data query.
  • the management device can determine the query time range of the data query after receiving the data query instruction.
  • the user can input data query instructions through a human-computer interaction interface for data query.
  • the human-computer interaction interface 40 of the display displays a data query window 41
  • the data query window 41 displays “data query”, and displays a “confirm” logo 42 and a “cancel” logo 43 .
  • the management platform can generate a data query instruction.
  • the query time range may be determined according to the query duration pre-configured by the management platform. For example, the management platform defaults to the query duration of the past 24 hours. After receiving the data query instruction, the management platform can calculate the query time range according to the current time and the query duration.
  • the management platform can obtain the query time range through a human-computer interaction interface for data query. Users can enter the query time range through the human-computer interaction interface.
  • the management platform can configure multiple query durations. For example, as shown in Figure 5, when the user clicks the "Confirm" logo 52, the human-computer interaction interface 50 displays the query duration window 51.
  • the query duration Window 51 displays multiple query durations, the past 24 hours, the past 72 hours, and the past week. And the query duration confirmation mark 53 is displayed, and the user clicks on it.
  • the management platform can display various query durations through the human-computer interaction interface of the data query table, and users can select the query duration through the human-computer interaction interface.
  • the management platform determines the query time range of data query based on the query duration selected by the user through the human-computer interaction interface and the current time.
  • S302 The management platform determines the indicator data of the service within the query time range.
  • Services can be services in a variety of business scenarios, such as services in mobile phone payment business scenarios, services in book query scenarios, services in purchase of items scenarios, etc.
  • a service can be a business service or multiple microservices that make up a business service.
  • the indicator data of the service can reflect the health status of the service, so that the management platform can monitor the health of the service based on the indicator data of the service.
  • the management platform stores all indicator data of the service, and the management platform stores all indicator data of the service in chronological order.
  • the management platform can retrieve the indicator data within the query time range, that is, the management platform samples all indicator data of the service to determine the indicator data of the service within the query time range.
  • the metric data includes the service's raw health status and timestamp. The management platform can obtain the indicator data served within the query time range based on the timestamp in the indicator data.
  • the management platform samples the indicator data of services within the query time range and performs health monitoring.
  • all service indicator data can be sampled according to the query step size to obtain the service indicator data within the query time range.
  • the service indicator data refers to the data sampled from all service indicator data.
  • the query step size may be input by the user through a human-computer interaction interface for data query.
  • the human-computer interaction interface 60 of the data query can display the query step input box 61 and the query step unit 62 (such as minutes, seconds, hours).
  • the user can confirm the data query through the "confirm" mark 63.
  • the management platform generates data query instructions based on the query step size input by the user. In other words, the data query instructions include the query step size.
  • the collection step size is equal to the query step size
  • all indicator data served within the query time range are accurately presented with no false negatives.
  • the query time range is large, the amount of data will be very large, causing the interface to slow down.
  • the collection step size is smaller than the query step size, the advantage is that the amount of data can be controlled.
  • the disadvantage is that the captured original sampling points may be ignored, resulting in missing data.
  • the queried sampling points are abnormal and happen to be ignored, the exceptions cannot be accurately reported, resulting in poor user experience.
  • the management platform can adaptively determine the query step size.
  • the management platform determines the query step length according to the collection step length, a preset time length threshold and a query time range.
  • the time period that users can query is not infinite. Therefore, you can limit the upper limit of query points by configuring the time length threshold.
  • the upper limit of the number of query points determines the resolution. For example, if the user wants to report an abnormality of one point in red, if the upper limit of the number of query points is 600, then 1/600 will be used as the minimum resolution when calculating the weight (specific details See subsequent instructions).
  • the query step size is equal to the collection step size.
  • the query step satisfies the following formula (1):
  • querytime represents the query time range
  • T represents the preset time length threshold
  • scrape_interval represents the collection step. It should be noted that the units of all parameters in the formula are the same. For example, all parameters in the formula are in minutes. The specific parameters can be set as needed and are not limited here.
  • S303 Determine, based on the indicator data of the service within the query time range, the frequency at which each of the multiple original health states of the service occurs within the query time range.
  • the raw health status of a service is used to indicate the status of the service while it is running.
  • Users can customize the division and number of original health states according to the monitored objects and monitoring needs.
  • the original health state of a service can be divided into lossy state, abnormal state and normal state.
  • the lossy state means that there is an unhealthy state during the running of the service, but the service can run normally and the user cannot detect it;
  • the abnormal state means that there is an unhealthy state during the running of the service and the service cannot be provided normally, which has a great impact on the user.
  • the original health status of a service may be different at different moments within the query time range. Statistically analyze the indicator data of the service at different times within the query time range to determine the original health status of the service at different times. Then, the statistical service counts the frequency of occurrence of various original health states within the query time range.
  • S304 Display the status identifier of the service according to the corresponding weights of the multiple original health states of the service and the frequency of occurrence of the multiple original health states within the query time, so that the user can use the service according to the
  • the status identifier of the service is used to locate the fault.
  • the status identifier is used to indicate the final health state of the service.
  • the weight is used to represent the degree of impact of the frequency of the original health state on the final health state.
  • the final health status can be divided by users according to their own needs. Since the frequency of occurrence of different original health states has different effects on different users, the user sets the final health state according to the frequency of occurrence of different original health states.
  • the final health status that users care about is the following four: the original health status of the service is all normal within the query time range (for convenience of description, it is expressed as Healthy' below), the original health status of the service within the query time range A lossy state has occurred (for convenience of description, it is represented by Degraded' below), the original health status of the service has an abnormal state within the query time range (for convenience of description, it is represented by Failure' below), and the frequency of occurrence of the abnormal state No more than 10%, the original health status of the service has an abnormal state within the query time range (for convenience of description, it is expressed as "Failure” below) and the frequency of abnormal state is higher than 10%.
  • the weight of the original health state is used to represent the degree of influence of the frequency of the original health state within the query time range on the final health state.
  • the management platform calculates the status score of the service within the query time based on the weight of the original health status and the frequency of the original health status within the query time range. Displays the status ID corresponding to the status score of the service.
  • tube represents the original health state of the service
  • F(s i ) represents the frequency of the i-th original health state within the query time range
  • W(s i ) represents the weight of the i-th original health state.
  • i is a positive integer.
  • the management platform stores the correspondence between the status identifier and the status score range.
  • status identifier 1 is used to indicate that the original health status of the service is all normal (Healthy) within the query time range
  • Status ID 2 is used to indicate that the original health status of the service has become damaged (Degraded) within the query time range.
  • the status score range corresponding to Status ID 2 is 1 ⁇ score ⁇ 10.
  • Status ID 3 is used to indicate that an abnormal state (Failure) occurred in the original health state of the service within the query time range and the frequency of the abnormal state is not higher than 10%
  • the status score range corresponding to status identifier 3 is 10 ⁇ score ⁇ 1440.
  • Status ID 4 is used to indicate that the original health status of the service has an abnormal state (Failure) within the query time range and the frequency of the abnormal status is higher than 10%.
  • the score range corresponding to Status ID 4 is 1440 ⁇ score.
  • the user selects a query time range of 24 hours, that is, 1440 minutes: one minute has an abnormal state (Failure), and the rest are normal states (Healthy).
  • the user selects a query time range of 1 hour, that is, 60 minutes.
  • the status identifier can be a color identifier or a shape identifier. No limitation is made here.
  • the weight of the original health state and the state score range can be configured by the user as needed.
  • the management platform can calculate the corresponding weights of multiple original health states through a weight determination model. Specifically, according to the preconfigured weight determination model and the preset time length threshold, the respective corresponding weights and state score ranges of the multiple original health states are determined.
  • the weight determination model is used to indicate the correspondence between the frequency of the original health state of the service and the final health state of the service.
  • the weight determination model can be viewed as a multi-variable set of inequalities. Users can configure the corresponding relationship between the status score range and the frequency range of the original health status according to their own needs.
  • the state score range has 4 segments, namely (0, S1], (S1, S2], (S2, S3], (S3, ⁇ ). Among them, the larger the score, the more dangerous the state.
  • the time length threshold is 24 hours Determine the minimum resolution, that is, the minimum resolution is 1/1440.
  • the correspondence between the frequency of the original health state of the service and the final health state of the service is shown in Table 1:
  • W 1 is the weight corresponding to the normal state (Healthy)
  • W 2 is the weight corresponding to the lossy state (Degraded)
  • W 3 is the weight corresponding to the abnormal state (Failure).
  • the state score score is strictly greater than 10.
  • the state score score is greater than 1.
  • a business may include multiple services.
  • the book purchasing business includes services: book title search service 11, book display service 12, book evaluation service 13, and order payment service 14.
  • the correlation between the book title search service 11, the book display service 12, the book evaluation service 13, and the order payment service 14 is shown in Figure 7.
  • book name search service 11, book display service 12 and book evaluation service 13 are all in normal status within 24 hours
  • the status identifiers of book name search service 11, book display service 12 and book evaluation service 13 are all triangular identifiers.
  • the order payment service 14 is in an abnormal state within 24 hours
  • the status mark of the order payment service 14 is a square mark.
  • the management platform can display the final health status of multiple services under the same business through a human-computer interaction interface. Furthermore, the management platform can display the final health status of respective services under multiple businesses.
  • users can view the original health status of the service at various moments through the status identification. Specifically, a user's operation on the status identification of the target service is received; according to the operation, the original health status of the target service at multiple moments within the query time range is displayed.
  • the status identifiers of the book name search service 11, the book display service 12 and the book purchase service 13 are all triangle identifiers, while the status identifier of the order payment service 14 is a square identifier.
  • the user can By operating the status identifier of the order payment service 14 through the human-computer interaction interface, the management platform displays the original health status of the order payment service 14 at each sampling moment in the past 24 hours.
  • embodiments of the present application provide a display device for service health status.
  • FIG. 9 is a schematic structural diagram of a service health status display device provided by an embodiment of the present application.
  • the service health status display device provided by the embodiment of the present application can be applied to the management platform as shown in Figure 1.
  • the display device for service health status provided by the embodiment of the present application includes a first determination module 901, a second determination module 902, a third determination module 903 and a display module 904.
  • the first determination module 901 is used to determine the query time range of data query
  • the second determination module 902 is used to determine the indicator data of the service within the query time range.
  • the indicator data of the service is used to reflect the original health status of the service;
  • the third determination module 903 is configured to determine the frequency of each of the multiple original health states of the service within the query time range based on the indicator data of the service within the query time range;
  • the display module 904 is configured to display the status identifier of the service according to the corresponding weights of the multiple original health states of the service and the frequency of occurrence of the service in the multiple original health states within the query time, so as to It is used for users to locate faults based on the status identifier of the service.
  • the status identifier is used to indicate the final health status of the service.
  • the weight is used to represent the degree of impact of the frequency of the original health status on the final health status.
  • the embodiment of this application displays the status identifier of the service by combining the weight of the original health state and the frequency of the original health state within the query time, so that the status identifier of the service includes the impact of the frequency of the original health state on the final health state, improving Indicates the accuracy of the health status of the service, thereby improving the accuracy of service fault location.
  • the service health status display device also includes:
  • the fourth determination module is used to determine the corresponding weights of the multiple original health states according to the preconfigured weight determination model and the preset time length threshold;
  • the weight determination model is used to indicate the corresponding relationship between the frequency of multiple original health states of the service and the final health state of the service.
  • the weight is determined by the weight determination model, so that the weight of the original health state includes the influence of the frequency of occurrence of the original health state on the final health state.
  • the fourth determination module is also used to determine the state score ranges corresponding to the multiple final health states according to the respective weights and the preset time length threshold of the multiple original health states. ;
  • a calculation module configured to calculate the status score corresponding to the service based on the corresponding weights of the multiple original health states of the service and the frequency of each of the multiple original health states appearing within the query time;
  • the display module is configured to display the status identification of the service according to the status score corresponding to the service and the score ranges corresponding to the multiple final health states.
  • the first determination module is also used to determine the query step size of the data query
  • the second determination module is configured to collect indicator data of the service from all indicator data of the service according to the query step size and the query time range.
  • the data amount of the service indicator data is not greater than the target amount
  • the target number is determined according to a preconfigured collection step, and the collection step is used to indicate the length of the time interval between the indicator data of the service.
  • the first determination module is used to determine the collection step, and the collection step is used to indicate the length of the time interval between all indicator data of the service; according to the collection step, the preset Set the time length threshold and the query time range to determine the query step size.
  • the query step size is determined adaptively through the collection step size, thereby avoiding missed reports of service failures, thus balancing computing resources and the accuracy of health monitoring.
  • the status identification includes a color identification or a shape identification.
  • the device further includes:
  • the receiving module is used to receive the user's operation on the status identification of the target service
  • the display module is also configured to display the original health status of the target service at multiple moments within the query time range according to the operation.
  • the device embodiment described in Figure 9 is only illustrative.
  • the division of modules is only a logical function division. In actual implementation, there may be other division methods.
  • multiple modules or components may be combined or can be integrated into another system, or some features can be ignored, or not implemented.
  • Each functional module in each embodiment of the present application can be integrated into one processing module, or each module can exist physically alone, or two or more modules can be integrated into one module.
  • the above-mentioned modules in Figure 9 can be implemented in the form of hardware, software functional units, or a combination of software and hardware.
  • Figure 10 shows a schematic structural diagram of a computing device provided by an embodiment of the present application.
  • the computing device may be a server or the like.
  • the management platform in Figure 1 includes at least one computing device.
  • the computing device includes: a processor 1001, a memory 1002, and a communication interface 1003.
  • the processor 1001, the memory 1002 and the communication interface 1003 are connected through a bus 1004.
  • Memory 1002 includes operating system and program code modules.
  • Memory 1002 may include bulk storage for data or instructions.
  • the memory 1002 may include a hard disk drive (HDD), a floppy disk drive, flash memory, an optical disk, a magneto-optical disk, a magnetic tape, or a Universal Serial Bus (USB) drive or two or more A combination of many of the above.
  • Memory 1002 may include removable or non-removable (or fixed) media, where appropriate.
  • the memory 1002 may be internal or external to the integrated gateway disaster recovery device.
  • memory 1002 is non-volatile solid-state memory.
  • the memory may include read-only memory (ROM), random access memory (RAM), magnetic disk storage media devices, optical storage media devices, flash memory devices, electrical, optical or other physical/tangible memory storage devices.
  • ROM read-only memory
  • RAM random access memory
  • magnetic disk storage media devices magnetic disk storage media devices
  • optical storage media devices flash memory devices
  • electrical, optical or other physical/tangible memory storage devices typically, the memory includes one or more tangible (non-transitory) computer-readable storage media (e.g., memory devices) encoded with software including computer-executable instructions, and when the software is executed (e.g., by one or more processors), it is operable to perform the operations described with reference to the methods in the present application.
  • the processor 1001 reads and executes the computer program instructions stored in the memory 1002 to implement any of the service health status display methods in the above embodiments.
  • the electronic device may also include a communication interface 1003 and a bus 1010. Among them, as shown in Figure 10, the processor 1001, the memory 1002, and the communication interface 1003 are connected through the bus 1010 and complete communication with each other.
  • the communication interface 1003 is mainly used to implement communication between modules, devices, units and/or equipment in the embodiments of this application.
  • Bus 1010 includes hardware, software, or both, coupling components of an electronic device to one another.
  • the bus may include Accelerated Graphics Port (AGP) or other graphics bus, Enhanced Industry Standard Architecture (EISA) bus, Front Side Bus (FSB), HyperTransport (HT) interconnect, Industry Standard Architecture (ISA) Bus, Infinite Bandwidth Interconnect, Low Pin Count (LPC) Bus, Memory Bus, Micro Channel Architecture (MCA) Bus, Peripheral Component Interconnect (PCI) Bus, PCI-Express (PCI-X) Bus, Serial Advanced Technology Attachment (SATA) bus, Video Electronics Standards Association Local (VLB) bus or other suitable bus or a combination of two or more of these.
  • bus 1010 may include one or more buses.
  • processors in the embodiments of the present application can be a central processing unit (CPU), or other general-purpose processor, digital signal processor (DSP), or application-specific integrated circuit (application specific integrated circuit, ASIC), field programmable gate array (field programmable gate array, FPGA) or other programmable logic devices, transistor logic devices, hardware components or any combination thereof.
  • a general-purpose processor can be a microprocessor or any conventional processor.
  • the method steps in the embodiments of the present application can be implemented by hardware or by a processor executing software instructions.
  • Software instructions can be composed of corresponding software modules, and software modules can be stored in random access memory (random access memory, RAM), flash memory, read-only memory (read-only memory, ROM), programmable read-only memory (programmable rom) , PROM), erasable programmable read-only memory (erasable PROM, EPROM), electrically erasable programmable read-only memory (electrically EPROM, EEPROM), register, hard disk, mobile hard disk, CD-ROM or other well-known in the art any other form of storage media.
  • An exemplary storage medium is coupled to the processor such that the processor can read information from the storage medium and write information to the storage medium.
  • the storage medium can also be an integral part of the processor.
  • the processor and storage media may be located in an ASIC.
  • the computer program product includes one or more computer instructions.
  • the computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable device.
  • the computer instructions may be stored in or transmitted over a computer-readable storage medium.
  • the computer instructions may be transmitted from one website, computer, server or data center to another website through wired (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.) means. , computer, server or data center for transmission.
  • the computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that contains one or more available media integrated.
  • the available media may be magnetic media (eg, floppy disk, hard disk, magnetic tape), optical media (eg, DVD), or semiconductor media (eg, solid state disk (SSD)), etc.

Abstract

The present application provides a service health status display method and apparatus, and a device and a storage medium. In embodiments, the method comprises: determining a query time range of a data query; determining indicator data of a service within the query time range, the indicator data of the service being used for reflecting original health statuses of the service; determining, according to the indicator data of the service within the query time range, the frequency of occurrence of each of multiple original health statuses of the service within the query time range; and displaying a status identifier of the service according to the weight corresponding to each of the multiple original health statuses of the service and the frequency of occurrence of the service in the multiple original health statuses within the query time, the status identifier being used for indicating a final health status of the service, and the weight being used for representing the degree of influence of the frequency of occurrence of the original health status on the final health status. In this way, by means of the technical solution provided by the embodiments of the present application, the accuracy of representing the health status of a service can be improved, thereby improving the accuracy of service fault location.

Description

服务健康状态的显示方法、装置、设备及存储介质Service health status display method, device, equipment and storage medium
本申请要求在2022年9月19日提交中国国家知识产权局、申请号为202211138376.X,发明名称为“服务健康状态的显示方法、装置、设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application requests the priority of the Chinese patent application submitted to the State Intellectual Property Office of China on September 19, 2022, with the application number 202211138376. The entire contents of which are incorporated herein by reference.
技术领域Technical field
本申请涉及计算机技术领域,尤其涉及一种服务健康状态的显示方法、装置、设备及存储介质。The present application relates to the field of computer technology, and in particular to a method, device, equipment and storage medium for displaying service health status.
背景技术Background technique
计算机技术的飞速发展带来更强的处理能力、更多的存储空间以及更快的网络环境。这使得应用可以为更多的人群服务,面临更大的负载压力。随着应用内部逻辑复杂性的不断提升,其开发和运维面临着越来越多的挑战。一个业务通常由多个互相通信的服务完成,因此,运维人员需要对每个服务的健康状态进行监控。The rapid development of computer technology has brought stronger processing power, more storage space and faster network environment. This allows the application to serve more people and face greater load pressure. As the complexity of the internal logic of applications continues to increase, its development and operation and maintenance are facing more and more challenges. A business is usually completed by multiple services that communicate with each other. Therefore, operation and maintenance personnel need to monitor the health status of each service.
目前,相关技术中的服务健康状态的展示方式仅表示查询时刻的健康状态,使得运维工作人员无法进行问题总览和进一步故障定位的需求。Currently, the display method of service health status in related technologies only represents the health status at the time of query, making it impossible for operation and maintenance staff to provide an overview of problems and further fault location requirements.
因此,相关技术中服务的健康状态的展示方式无法精准地表示出服务的健康状态,导致运维人员无法精准地定位服务故障。Therefore, the display method of the health status of services in related technologies cannot accurately represent the health status of services, causing operation and maintenance personnel to be unable to accurately locate service faults.
发明内容Contents of the invention
本申请实施例提供了一种服务健康状态的显示方法、装置、设备及存储介质,能够基于原始健康状态在查询时间范围内出现的频率对最终健康状态的影响,显示服务的状态标识,从而能够提高表示服务的健康状态的精准度。Embodiments of the present application provide a method, device, equipment and storage medium for displaying service health status, which can display the status identifier of the service based on the impact of the frequency of the original health status appearing within the query time range on the final health status, thereby enabling Improves the accuracy of representing the health status of a service.
第一方面,本申请实施例提供了一种服务健康状态的显示方法,包括:In the first aspect, embodiments of the present application provide a method for displaying service health status, including:
确定数据查询的查询时间范围;Determine the query time range of data query;
确定在查询时间范围内服务的指标数据,服务的指标数据用于反应服务的原始健康状态;Determine the indicator data of the service within the query time range. The indicator data of the service is used to reflect the original health status of the service;
根据查询时间范围内服务的指标数据,确定服务的多种原始健康状态各自在查询时间范围内出现的频率;Based on the indicator data of the service within the query time range, determine the frequency of each of the multiple original health states of the service within the query time range;
根据服务的多种原始健康状态各自对应的权重和服务在多种原始健康状态下在查询时间内出现的频率,显示服务的状态标识,以用于用户根据服务的状态标识进行故障定位,状态标识用于指示服务的最终健康状态,权重用于表示原始健康状态出现的频率对最终健康状态的影响程度。According to the corresponding weights of the multiple original health states of the service and the frequency of occurrence of the service in the multiple original health states within the query time, the status identifier of the service is displayed for users to locate faults based on the status identifier of the service. The status identifier It is used to indicate the final health state of the service, and the weight is used to indicate the degree of influence of the frequency of the original health state on the final health state.
本申请实施例通过结合原始健康状态的权重和原始健康状态在查询时间内出现的频率,显示服务的状态标识,使得服务的状态标识包括原始健康状态出现的频率对最终健康状态的影响程度,提高表示服务的健康状态的精准度,从而提高服务故障定位的精准度。The embodiment of this application displays the status identifier of the service by combining the weight of the original health state and the frequency of the original health state within the query time, so that the status identifier of the service includes the impact of the frequency of the original health state on the final health state, improving Indicates the accuracy of the health status of the service, thereby improving the accuracy of service fault location.
在一种可能的实现方式中,服务健康状态的显示方法还包括:In a possible implementation, the method for displaying the service health status further includes:
根据预先配置的权重确定模型和预设的时间长度阈值,确定多种原始健康状态各自对应的权重;According to the preconfigured weight determination model and the preset time length threshold, the corresponding weights of multiple original health states are determined;
其中,权重确定模型用于指示服务的多种原始健康状态的频率与服务的最终健康状态的对应关系。Among them, the weight determination model is used to indicate the correspondence between the frequency of multiple original health states of the service and the final health state of the service.
如此,通过权重确定模型确定权重,使得原始健康状态的权重包含了原始健康状态出现的频率对最终健康状态的影响程度。In this way, the weight is determined through the weight determination model, so that the weight of the original health state includes the degree of influence of the frequency of the original health state on the final health state.
在一种可能的实现方式中,服务健康状态的显示方法还包括: In a possible implementation, the display method of service health status also includes:
根据多种原始健康状态各自对应的权重和预设的时间长度阈值,确定多种最终健康状态各自对应的状态分数范围;According to the corresponding weights and preset time length thresholds of multiple original health states, determine the corresponding state score ranges of multiple final health states;
根据服务的多种原始健康状态各自对应的权重和服务在多种原始健康状态下在查询时间内出现的频率,显示服务的状态标识,包括:Based on the corresponding weights of the multiple original health states of the service and the frequency of occurrence of the service in the multiple original health states within the query time, the status identifier of the service is displayed, including:
根据服务的多种原始健康状态各自对应的权重和多种原始健康状态各自在查询时间内出现的频率,计算服务对应的状态分数;Calculate the status score corresponding to the service based on the corresponding weights of the multiple original health states of the service and the frequency of occurrence of the multiple original health states within the query time;
根据服务对应的状态分数和多种最终健康状态各自对应的分数范围,显示服务的状态标识。The status identifier of the service is displayed based on the status score corresponding to the service and the corresponding score ranges of multiple final health states.
在一种可能的实现方式中,服务健康状态的显示方法还包括:In a possible implementation, the display method of service health status also includes:
确定数据查询的查询步长;Determine the query step size of data query;
确定在查询时间范围内服务的指标数据,包括:Determine the metric data served within the query time range, including:
按照查询步长和查询时间范围,从服务的所有指标数据中采集服务的指标数据。According to the query step size and query time range, the indicator data of the service is collected from all indicator data of the service.
如此,由于服务的指标数据的数据量较大,可以根据查询步长对服务的所有指标数据采样,从而平衡计算资源和健康监测的精准度。In this way, due to the large amount of indicator data of the service, all indicator data of the service can be sampled according to the query step, thereby balancing computing resources and the accuracy of health monitoring.
在一种可能的实现方式中,服务的指标数据的数据量不大于目标数量;In a possible implementation, the data amount of the service indicator data is not greater than the target amount;
其中,目标数量根据预先配置的采集步长确定,采集步长用于指示服务的指标数据之间的时间间隔长度。Among them, the target number is determined based on the preconfigured collection step, which is used to indicate the length of the time interval between the indicator data of the service.
在一种可能的实现方式中,确定数据查询的查询步长,包括:In a possible implementation, determining the query step size of the data query includes:
确定采集步长,采集步长用于指示服务的所有指标数据之间的时间间隔长度;Determine the collection step size, which is used to indicate the length of the time interval between all indicator data of the service;
根据采集步长、预设的时间长度阈值和查询时间范围,确定查询步长。The query step is determined based on the collection step, the preset time length threshold and the query time range.
如此,通过采集步长自适应确定查询步长,避免了服务故障漏报,从而平衡计算资源和健康监测的精准度。In this way, the query step size is determined adaptively through the collection step size, thereby avoiding missed reports of service failures, thus balancing computing resources and the accuracy of health monitoring.
在一种可能的实现方式中,状态标识包括颜色标识或形状标识。In a possible implementation, the status identification includes a color identification or a shape identification.
在一种可能的实现方式中,服务有多个。In a possible implementation, there are multiple services.
在一种可能的实现方式中,服务健康状态的显示方法还包括:In a possible implementation, the display method of service health status also includes:
接收用户对目标服务的状态标识的操作;Receive the user's operation on the status identification of the target service;
根据操作,显示目标服务在查询时间范围内多个时刻的原始健康状态。Based on the operation, the raw health status of the target service at multiple moments in the query time range is displayed.
如此,能够方便用户查看服务在多个时刻的原始健康状态。This allows users to easily view the original health status of the service at multiple times.
第二方面,本申请实施例提供了一种服务健康状态的显示装置,包括:In the second aspect, embodiments of the present application provide a display device for serving health status, including:
第一确定模块,用于确定数据查询的查询时间范围;The first determination module is used to determine the query time range of data query;
第二确定模块,用于确定在查询时间范围内服务的指标数据,服务的指标数据用于反应服务的原始健康状态;The second determination module is used to determine the indicator data of the service within the query time range. The indicator data of the service is used to reflect the original health status of the service;
第三确定模块,用于根据查询时间范围内服务的指标数据,确定服务的多种原始健康状态各自在查询时间范围内出现的频率;The third determination module is used to determine the frequency of each of the multiple original health states of the service within the query time range based on the indicator data of the service within the query time range;
显示模块,用于根据服务的多种原始健康状态各自对应的权重和服务在多种原始健康状态下在查询时间内出现的频率,显示服务的状态标识,以用于用户根据服务的状态标识进行故障定位,状态标识用于指示服务的最终健康状态,权重用于表示原始健康状态出现的频率对最终健康状态的影响程度。The display module is used to display the status identifier of the service based on the corresponding weights of the multiple original health states of the service and the frequency of occurrence of the service in the multiple original health states within the query time, so that the user can perform operations based on the status identifier of the service. For fault location, the status identifier is used to indicate the final health status of the service, and the weight is used to indicate the impact of the frequency of the original health status on the final health status.
本申请实施例通过结合原始健康状态的权重和原始健康状态在查询时间内出现的频率,显示服务的状态标识,使得服务的状态标识包括原始健康状态出现的频率对最终健康状态的影响程度,提高表示服务的健康状态的精准度,从而提高服务故障定位的精准度。The embodiment of this application displays the status identifier of the service by combining the weight of the original health state and the frequency of the original health state within the query time, so that the status identifier of the service includes the impact of the frequency of the original health state on the final health state, improving Indicates the accuracy of the health status of the service, thereby improving the accuracy of service fault location.
在一种可能的实现方式中,服务健康状态显示装置还包括:In a possible implementation, the service health status display device also includes:
第四确定模块,用于根据预先配置的权重确定模型和预设的时间长度阈值,确定多种原始健康状态各自对应的权重;A fourth determination module is used to determine the weights corresponding to the various original health states according to a pre-configured weight determination model and a preset time length threshold;
其中,权重确定模型用于指示服务的多种原始健康状态的频率与服务的最终健康状态的对应关系。Among them, the weight determination model is used to indicate the correspondence between the frequency of multiple original health states of the service and the final health state of the service.
如此,通过权重确定模型确定权重,使得原始健康状态的权重包含了原始健康状态出现的频率对最终健康状态的影响程度。 In this way, the weight is determined through the weight determination model, so that the weight of the original health state includes the degree of influence of the frequency of the original health state on the final health state.
在一种可能的实现方式中,第四确定模块还用于根据多种原始健康状态各自对应的权重和预设的时间长度阈值,确定多种最终健康状态各自对应的状态分数范围;In a possible implementation, the fourth determination module is also used to determine the state score ranges corresponding to the multiple final health states based on the respective weights and preset time length thresholds of the multiple original health states;
计算模块,用于根据服务的多种原始健康状态各自对应的权重和多种原始健康状态各自在查询时间内出现的频率,计算服务对应的状态分数;The calculation module is used to calculate the status score corresponding to the service based on the respective weights of the multiple original health states of the service and the respective frequencies of the multiple original health states within the query time;
显示模块,用于根据服务对应的状态分数和多种最终健康状态各自对应的分数范围,显示服务的状态标识。The display module is used to display the status identifier of the service according to the status score corresponding to the service and the score ranges corresponding to the multiple final health states.
在一种可能的实现方式中,第一确定模块还用于确定数据查询的查询步长;In a possible implementation, the first determination module is also used to determine the query step size of the data query;
第二确定模块用于按照查询步长和查询时间范围,从服务的所有指标数据中采集服务的指标数据。The second determination module is used to collect the indicator data of the service from all the indicator data of the service according to the query step size and the query time range.
如此,由于服务的指标数据的数据量较大,可以根据查询步长对服务的所有指标数据采样,从而平衡计算资源和健康监测的精准度。In this way, due to the large amount of indicator data of the service, all indicator data of the service can be sampled according to the query step, thereby balancing computing resources and the accuracy of health monitoring.
在一种可能的实现方式中,服务的指标数据的数据量不大于目标数量;In a possible implementation, the data amount of the service indicator data is not greater than the target amount;
其中,目标数量根据预先配置的采集步长确定,采集步长用于指示服务的指标数据之间的时间间隔长度。Among them, the target number is determined based on the preconfigured collection step, which is used to indicate the length of the time interval between the indicator data of the service.
在一种可能的实现方式中,第一确定模块用于确定采集步长,采集步长用于指示服务的所有指标数据之间的时间间隔长度;根据采集步长、预设的时间长度阈值和查询时间范围,确定查询步长。In a possible implementation, the first determination module is used to determine the collection step, and the collection step is used to indicate the length of the time interval between all indicator data of the service; according to the collection step, the preset time length threshold and Query the time range and determine the query step size.
如此,通过采集步长自适应确定查询步长,避免了服务故障漏报,从而平衡计算资源和健康监测的精准度。In this way, the query step size is determined adaptively through the collection step size, thereby avoiding missed reports of service failures, thus balancing computing resources and the accuracy of health monitoring.
在一种可能的实现方式中,状态标识包括颜色标识或形状标识。In a possible implementation, the status identification includes a color identification or a shape identification.
在一种可能的实现方式中,服务有多个。In a possible implementation, there are multiple services.
在一种可能的实现方式中,装置还包括:In a possible implementation, the device further includes:
接收模块,用于接收用户对目标服务的状态标识的操作;The receiving module is used to receive the user's operation on the status identification of the target service;
显示模块还用于根据操作,显示目标服务在查询时间范围内多个时刻的原始健康状态。The display module is also used to display the original health status of the target service at multiple moments within the query time range based on operations.
如此,能够方便用户查看服务在多个时刻的原始健康状态。In this way, users can conveniently view the original health status of the service at multiple moments.
第三方面,本申请实施例提供了一种服务健康状态的显示装置,包括:至少一个存储器,用于存储程序;至少一个处理器,用于执行存储器存储的程序,当存储器存储的程序被执行时,处理器用于执行第一方面中所提供的方法。In a third aspect, embodiments of the present application provide a display device for serving health status, including: at least one memory for storing a program; at least one processor for executing the program stored in the memory. When the program stored in the memory is executed When, the processor is used to execute the method provided in the first aspect.
第四方面,本申请实施例提供了一种服务健康状态的显示装置,其特征在于,装置运行计算机程序指令,以执行第二方面中所提供的方法。示例性的,该装置可以为芯片,或处理器。In a fourth aspect, embodiments of the present application provide a display device for serving health status, characterized in that the device runs computer program instructions to execute the method provided in the second aspect. For example, the device may be a chip or a processor.
在一个例子中,该装置可以包括处理器,该处理器可以与存储器耦合,读取存储器中的指令并根据该指令执行第一方面中所提供的方法。其中,该存储器可以集成在芯片或处理器中,也可以独立于芯片或处理器之外。In one example, the apparatus may include a processor, which may be coupled to a memory, read instructions in the memory and execute the method provided in the first aspect according to the instructions. The memory may be integrated into the chip or processor, or may be independent of the chip or processor.
第五方面,本申请实施例提供了一种计算机存储介质,计算机存储介质中存储有指令,当指令在计算机上运行时,使得计算机执行第一方面中所提供的方法。In a fifth aspect, embodiments of the present application provide a computer storage medium. Instructions are stored in the computer storage medium. When the instructions are run on a computer, they cause the computer to execute the method provided in the first aspect.
第六方面,本申请实施例提供了一种包含指令的计算机程序产品,当指令在计算机上运行时,使得计算机执行第一方面中所提供的方法。In a sixth aspect, embodiments of the present application provide a computer program product containing instructions. When the instructions are run on a computer, they cause the computer to execute the method provided in the first aspect.
附图说明Description of drawings
图1是本申请实施例提供的一种业务系统的系统架构图;Figure 1 is a system architecture diagram of a business system provided by an embodiment of the present application;
图2是本申请实施例提供的一种服务的指标数据采集方法的流程示意图;Figure 2 is a schematic flow chart of a service indicator data collection method provided by an embodiment of the present application;
图3是本申请实施例提供的一种服务健康状态的显示方法的流程示意图;Figure 3 is a schematic flowchart of a method for displaying service health status provided by an embodiment of the present application;
图4是本申请实施例提供的一种人机交互界面的示意图;Figure 4 is a schematic diagram of a human-computer interaction interface provided by an embodiment of the present application;
图5是本申请实施例提供的另一种人机交互界面的示意图;Figure 5 is a schematic diagram of another human-computer interaction interface provided by an embodiment of the present application;
图6是本申请实施例提供的又一种人机交互界面的示意图;Figure 6 is a schematic diagram of yet another human-computer interaction interface provided by an embodiment of the present application;
图7是本申请实施例提供的一种服务健康状态显示界面的示意图;Figure 7 is a schematic diagram of a service health status display interface provided by an embodiment of the present application;
图8是本申请实施例提供的另一种服务健康状态显示界面的示意图; Figure 8 is a schematic diagram of another service health status display interface provided by an embodiment of the present application;
图9是本申请实施例提供的一种服务健康状态的显示装置的结构示意图;Figure 9 is a schematic structural diagram of a service health status display device provided by an embodiment of the present application;
图10是本申请实施例提供的一种计算设备的结构示意图。Figure 10 is a schematic structural diagram of a computing device provided by an embodiment of the present application.
具体实施方式Detailed ways
为了使本申请实施例的目的、技术方案和优点更加清楚,下面将结合附图,对本申请实施例中的技术方案进行描述。In order to make the purpose, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be described below with reference to the accompanying drawings.
在本申请实施例的描述中,“示例性的”、“例如”或者“举例来说”等词用于表示作例子、例证或说明。本申请实施例中被描述为“示例性的”、“例如”或者“举例来说”的任何实施例或设计方案不应被解释为比其它实施例或设计方案更优选或更具优势。确切而言,使用“示例性的”、“例如”或者“举例来说”等词旨在以具体方式呈现相关概念。In the description of the embodiments of this application, words such as "exemplary", "for example" or "for example" are used to represent examples, illustrations or explanations. Any embodiment or design described as "exemplary," "such as," or "for example" in the embodiments of the present application is not to be construed as preferred or advantageous over other embodiments or designs. Rather, use of the words "exemplary," "such as," or "for example" is intended to present the concepts in a concrete manner.
在本申请实施例的描述中,术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,单独存在B,同时存在A和B这三种情况。另外,除非另有说明,术语“多个”的含义是指两个或两个以上。例如,多个系统是指两个或两个以上的系统,多个终端是指两个或两个以上的终端。In the description of the embodiments of this application, the term "and/or" is only an association relationship describing associated objects, indicating that there can be three relationships. For example, A and/or B can mean: A alone exists, and A alone exists. There is B, and there are three situations A and B at the same time. In addition, unless otherwise stated, the term "plurality" means two or more. For example, multiple systems refer to two or more systems, and multiple terminals refer to two or more terminals.
此外,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括一个或者更多个该特征。术语“包括”、“包含”、“具有”及它们的变形都意味着“包括但不限于”,除非是以其他方式另外特别强调。In addition, the terms "first" and "second" are only used for descriptive purposes and cannot be understood as indicating or implying relative importance or implicitly indicating the indicated technical features. Therefore, features defined as "first" and "second" may explicitly or implicitly include one or more of these features. The terms “including,” “includes,” “having,” and variations thereof all mean “including but not limited to,” unless otherwise specifically emphasized.
图1是本申请实施例所涉及的一种业务系统的架构图。该业务系统包括以下网络设备:管理平台110以及若干个主机设备120。Figure 1 is an architectural diagram of a business system involved in the embodiment of this application. The business system includes the following network devices: a management platform 110 and several host devices 120.
其中,管理平台110可以是单台计算设备,或者,也可以是由多台计算设备组成的服务集群,或者,也可以是一个云计算中心,或者还可以是超级终端。The management platform 110 may be a single computing device, or it may be a service cluster composed of multiple computing devices, or it may be a cloud computing center, or it may be a hyper terminal.
在一个例子中,本方案中涉及的计算设备可以用于提供云服务,其可以与若干个主机设备建立通信连接,从而为主机设备提供运算功能和/或存储功能。In one example, the computing device involved in this solution can be used to provide cloud services, and it can establish communication connections with several host devices to provide computing functions and/or storage functions for the host devices.
本申请实施例中涉及的管理平台可以是硬件设备,也可以植入虚拟化环境中。例如,本方案中涉及的管理平台可以是在包括一个或多个其他虚拟机的硬件设备上执行的虚拟机。The management platform involved in the embodiments of this application may be a hardware device, or may be embedded in a virtualized environment. For example, the management platform involved in this solution may be a virtual machine executed on a hardware device including one or more other virtual machines.
主机设备120可以是实体主机,或者,也可以是虚拟主机。The host device 120 may be a physical host or a virtual host.
在本申请实施例中,管理平台110和若干个主机设备120组成业务系统,该业务系统对外提供若干种不同业务的微服务,每种微服务由一个或多个主机设备120运行。管理设备110能够采集主机设备120运行的微服务的指标数据。管理平台110根据汇集来的各种业务的微服务的指标数据对微服务的健康度进行监控,从而使得微服务的运维工作人员(为了方便描述,称之为用户)通过管理平台确定微服务的健康度,并根据微服务的健康度,定位微服务出现故障的时间。In this embodiment of the present application, the management platform 110 and several host devices 120 form a business system. The business system provides several different microservices to the outside world, and each microservice is run by one or more host devices 120 . The management device 110 can collect indicator data of microservices run by the host device 120 . The management platform 110 monitors the health of the microservices based on the collected indicator data of the microservices of various businesses, so that the operation and maintenance staff of the microservices (for convenience of description, called users) determine the microservices through the management platform. The health of the microservice, and based on the health of the microservice, locate the time when the microservice fails.
目前,相关技术中的服务健康状态的展示方式仅表示在查询时刻时服务的健康状态,而在历史某个时间段内服务在各个时间点的健康状态对服务在查询时刻的健康状态是有影响的,仅展示服务在查询时刻的健康状态,而不考虑服务在历史时间段内各个时间点的健康状态的影响,也无法体现出用户查询的整个时间段内服务的健康状态的变化,不满足用户进行问题总览和进一步故障定位的需求。因此,相关技术中服务的健康状态的展示方式无法精准地表现出服务的健康度。Currently, the display method of service health status in related technologies only represents the health status of the service at the time of query, and the health status of the service at each time point in a certain period of time in history has an impact on the health status of the service at the time of query. , only displays the health status of the service at the time of query, without considering the impact of the health status of the service at each time point in the historical time period, and cannot reflect the changes in the health status of the service during the entire time period queried by the user, which does not satisfy Users’ needs for problem overview and further fault location. Therefore, the display method of the health status of the service in the related technology cannot accurately represent the health of the service.
本申请发明人在技术研究过程中发现,相关技术中按照服务的健康状态对用户的影响度确定不同的健康状态的优先级,并对健康状态的优先级进行排序,用优先级较高的健康状态作为代表,根据健康状态对应颜色标识显示标识。如,健康状态的优先级顺序为:异常状态>有损状态>正常状态。异常状态对应的标识颜色为红色,有损状态对应的标识颜色为黄色,正常状态对应的标识颜色为绿色。当查询过去24小时的服务拓扑,23小时59秒内服务都是正常转态,只是其中出现了1秒钟的异常状态。由于异常状态优先级高于正常优先级,采用异常状态作为查询时间内的代表状态,最终标识的颜色为红色。当时,上述服务的健康状态的显示方式未考虑健康状态在过去24小时内的出现频率而只考虑优先级,其实是对其他时间点的健康状态的遗漏,本质上是丢失了大量的原始信息。遗漏的时间点的健康状态都没参与贡献到最终健康状态的表征上。从实际出发, 用户查看拓扑时发现某个服务是红色标识,然而定位异常状态发现只有一个瞬间出现异常状态并迅速恢复了正常状态,可见该显示该服务的标识为红色标识显然是不合理的。The inventors of this application discovered during the technical research process that in the related technology, the priorities of different health states are determined according to the impact of the health state of the service on the user, and the priorities of the health states are sorted, with the health states with higher priorities being used as representatives, and the identifiers are displayed according to the corresponding color identifiers of the health states. For example, the priority order of the health states is: abnormal state > lossy state > normal state. The identification color corresponding to the abnormal state is red, the identification color corresponding to the lossy state is yellow, and the identification color corresponding to the normal state is green. When querying the service topology of the past 24 hours, the services were all in normal state within 23 hours and 59 seconds, except for an abnormal state that occurred for 1 second. Since the priority of the abnormal state is higher than the normal priority, the abnormal state is used as the representative state within the query time, and the final identification color is red. At that time, the display method of the health status of the above-mentioned services did not consider the frequency of occurrence of the health state in the past 24 hours but only considered the priority. In fact, the health status of other time points was omitted, and essentially a large amount of original information was lost. The health status of the omitted time points did not contribute to the representation of the final health state. From a practical point of view, When the user checks the topology, he finds that a certain service is marked in red. However, when locating the abnormal state, he finds that the abnormal state only occurred for a moment and then quickly returned to normal. It can be seen that it is obviously unreasonable to display the service mark in red.
或者,相关技术中统计服务在查询时间范围内不同的健康状态出现的频率,并将出现频率最高的健康状态作为代表状态,显示该频率出现最高的健康状态对应的颜色标识。其中,异常状态对应的标识颜色为红色,有损状态对应的标识颜色为黄色,正常状态对应的标识颜色为绿色。如,查询24小时的服务拓扑,某个服务在24小时内有10小时出现了异常状态,剩余14小时为正常状态。按照不同的健康状态出现频率,正常状态将作为24小时的状态代表,显示该服务的标识为绿色标识。但是,上述方式导致不占频率优势的异常状态会被直接忽略掉,有异常状态漏报风险。Alternatively, the related technology counts the frequency of occurrence of different health states of the service within the query time range, and uses the health state with the highest frequency as the representative state, and displays the color identifier corresponding to the health state with the highest frequency. Among them, the identifier color corresponding to the abnormal state is red, the identifier color corresponding to the lossy state is yellow, and the identifier color corresponding to the normal state is green. For example, when querying the 24-hour service topology, a service has an abnormal state for 10 hours within 24 hours, and the remaining 14 hours are in a normal state. According to the frequency of occurrence of different health states, the normal state will be used as a 24-hour state representative, and the identifier of the service will be displayed as a green identifier. However, the above method causes abnormal states that do not have a frequency advantage to be directly ignored, and there is a risk of underreporting abnormal states.
综上可见,相关技术中无法精准地表现出服务的健康度。In summary, it can be seen that the health of services cannot be accurately expressed in related technologies.
基于此,本申请实施例提供了服务健康状态的显示方法、装置、设备及存储介质,能够基于原始健康状态在查询时间范围内出现的频率对最终健康状态的影响,显示服务的状态标识,从而能够提高服务的健康度表现的精准度。Based on this, embodiments of the present application provide a method, device, equipment and storage medium for displaying the service health status, which can display the status identification of the service based on the impact of the frequency of the original health status appearing within the query time range on the final health status, thereby It can improve the accuracy of service health performance.
接下来,基于图1对应的实施例中的业务系统,对本申请实施例提供的服务的指标数据采集方法进行说明。Next, based on the business system in the embodiment corresponding to FIG. 1 , the indicator data collection method for the service provided in the embodiment of the present application is described.
图2是本申请实施例提供的一种服务的指标数据采集方法的流程示意图。本申请实施例提供的服务的指标数据采集方法应用于图1所示的管理系统。如图2所示,本申请实施例提供的服务的指标数据采集方法包括S201-S203。Figure 2 is a schematic flowchart of a service indicator data collection method provided by an embodiment of the present application. The service indicator data collection method provided by the embodiment of this application is applied to the management system shown in Figure 1. As shown in Figure 2, the service indicator data collection method provided by the embodiment of the present application includes S201-S203.
S201:主机设备采集主机设备运行的服务的所有指标数据。S201: The host device collects all indicator data of services run by the host device.
主机设备运行服务过程中能够生成服务的运行日志,运行日志记载了主机设备在运行服务中服务的所有指标数据。When the host device is running the service, it can generate a running log of the service. The running log records all the indicator data of the host device when running the service.
主机设备能够从运行日志中提取服务的指标数据。在一种可能的实现方式中,主机设备可以接收管理平台发送的指标采集指令,主机设备响应于指标采集指令,从运行日志中提取服务的指标数据。Hosting devices can extract service metric data from operational logs. In a possible implementation, the host device can receive an indicator collection instruction sent by the management platform, and the host device responds to the indicator collection instruction and extracts the indicator data of the service from the operation log.
在一种可能的实现方式中,可以通过k8s指标采集服务的所有指标数据。In a possible implementation, all indicator data of the service can be collected through k8s indicators.
在这里,服务的指标数据中携带有时间戳,时间戳用于指示指标数据的生成时间。Here, the indicator data of the service carries a timestamp, and the timestamp is used to indicate the generation time of the indicator data.
S202:主机设备向管理平台发送服务的所有指标数据。S202: The host device sends all indicator data of the service to the management platform.
主机设备与管理平台通信连接,从而能够进行数据传输。主机设备具有多种向管理平台发送服务的指标数据的方式。The host device communicates with the management platform to enable data transmission. The host device has multiple ways of sending service indicator data to the management platform.
管理设备可以采用普罗米修斯的指标获取方式采集服务的指标数据。例如,主机设备中可以安装客户端,并与管理平台中的服务端联动,从而采集服务的指标数据。The management device can use Prometheus' indicator acquisition method to collect service indicator data. For example, a client can be installed on the host device and linked with the server in the management platform to collect service indicator data.
在一种可能的实现方式中,主机设备能够主动向管理平台发送服务的所有指标数据。主机设备能够响应于指标采集指令,采集服务的所有指标数据,并向管理平台发送服务的所有指标数据。In a possible implementation, the host device can actively send all indicator data of the service to the management platform. The host device can respond to the indicator collection instruction, collect all indicator data of the service, and send all indicator data of the service to the management platform.
在另一种可能的实现方式中,主机设备能够被动向管理平台发送服务的所有指标数据。为了节省主机设备的存储资源,主机设备能够在提取到服务的所有指标数据后主动向管理平台发送服务的所有指标数据。In another possible implementation, the host device can passively send all indicator data of the service to the management platform. In order to save the storage resources of the host device, the host device can actively send all the indicator data of the service to the management platform after extracting all the indicator data of the service.
在本实施例中,由于服务的指标数据的数据量大,采集服务的指标数据可以按照一定的采集步长来采集。在这里,采集步长是用户通过管理平台配置的,管理平台能够将采集步长发送给主机设备,从而主机设备根据采集步长采集服务的所有指标数据。In this embodiment, since the amount of service indicator data is large, the indicator data of the service can be collected according to a certain collection step length. Here, the collection step length is configured by the user through the management platform, and the management platform can send the collection step length to the host device, so that the host device collects all indicator data of the service according to the collection step length.
S203:管理平台存储服务的指标数据。S203: Management platform storage service indicator data.
管理平台可以将指标数据按照时间的顺序,将指标数据存储为时间序列形式的数据,以便于后续按照时间顺序进行检索。The management platform can store the indicator data in time sequence and store the indicator data as time series data to facilitate subsequent retrieval in time order.
接着,基于图1对应的实施例中的业务系统,对本申请实施例提供的服务健康状态的显示方法进行说明。Next, based on the business system in the embodiment corresponding to Figure 1, the method for displaying the service health status provided by the embodiment of the present application will be described.
图3是本申请实施例提供的一种服务健康状态的显示方法的流程示意图。本申请实施例提供的服务健康状态的显示方法应用于图1所示的管理系统。如图3所示,本申请实施例提供的服务健康状态的显示方法包括S301-S304。 Figure 3 is a schematic flowchart of a method for displaying service health status provided by an embodiment of the present application. The method for displaying service health status provided by the embodiment of the present application is applied to the management system shown in Figure 1 . As shown in Figure 3, the method for displaying service health status provided by the embodiment of the present application includes S301-S304.
S301:管理平台确定数据查询的查询时间范围。S301: The management platform determines a query time range for data query.
查询时间范围是指用户需要查询的数据的时间范围,包括数据查询的开始时间和截止时间。例如,查询时间范围为过去24小时,当前时刻为2022年6月15日下午6点,那么查询时间范围则为2022年6月14日下午6点止2022年6月15日下午6点。The query time range refers to the time range of the data that the user needs to query, including the start time and deadline of the data query. For example, if the query time range is the past 24 hours and the current time is 6 pm on June 15, 2022, then the query time range is from 6 pm on June 14, 2022 to 6 pm on June 15, 2022.
在一种可能的实现方式中,管理设备包括显示器,显示器用于显示数据查询的人机交互界面。例如,用户能够通过数据查询的人机交互界面输入数据查询的查询时间范围。In a possible implementation, the management device includes a display, and the display is used to display a human-computer interaction interface for data query. For example, the user can input the query time range of the data query through the human-computer interaction interface of the data query.
管理设备能够在接收到数据查询指令后,确定数据查询的查询时间范围。示例性地,用户能够通过数据查询的人机交互界面输入数据查询的指令。例如,如图4所示,显示器的人机交互界面40显示数据查询窗口41,数据查询窗口41显示“数据查询”,并显示“确认”标识42和“取消”标识43。当用户点击“确认”标识42时,管理平台可生成数据查询的指令。The management device can determine the query time range of the data query after receiving the data query instruction. For example, the user can input data query instructions through a human-computer interaction interface for data query. For example, as shown in FIG. 4 , the human-computer interaction interface 40 of the display displays a data query window 41 , the data query window 41 displays “data query”, and displays a “confirm” logo 42 and a “cancel” logo 43 . When the user clicks the "confirm" logo 42, the management platform can generate a data query instruction.
确定查询时间范围的方式可以有多种,具体如下。There are many ways to determine the query time range, as detailed below.
在一种可能的实现方式中,查询时间范围可以是根据管理平台预先配置的查询时长确定的。例如,管理平台默认的查询时长为过去24小时,管理平台接收到数据查询指令后,能够根据当前的时间和查询时长计算得到查询时间范围。In a possible implementation, the query time range may be determined according to the query duration pre-configured by the management platform. For example, the management platform defaults to the query duration of the past 24 hours. After receiving the data query instruction, the management platform can calculate the query time range according to the current time and the query duration.
在另一种可能的实现方式中,管理平台能够通过数据查询的人机交互界面获取查询时间范围。用户能够通过人机交互界面输入查询时间范围。In another possible implementation, the management platform can obtain the query time range through a human-computer interaction interface for data query. Users can enter the query time range through the human-computer interaction interface.
在又一种可能的实现方式中,管理平台可以配置多种查询时长,例如,如图5所示,当用户点击“确认”标识52后,人机交互界面50显示查询时长窗口51,查询时长窗口51显示多个查询时长,过去24小时,过去72小时,过去一个星期。并显示查询时长确认标识53,用户通过点击。管理平台能够通过数据查询表的人机交互界面显示多种查询时长,用户可以通过人机交互界面选择查询时长。管理平台根据用户通过人机交互界面选择的查询时长和当前的时间,确定数据查询的查询时间范围。In another possible implementation, the management platform can configure multiple query durations. For example, as shown in Figure 5, when the user clicks the "Confirm" logo 52, the human-computer interaction interface 50 displays the query duration window 51. The query duration Window 51 displays multiple query durations, the past 24 hours, the past 72 hours, and the past week. And the query duration confirmation mark 53 is displayed, and the user clicks on it. The management platform can display various query durations through the human-computer interaction interface of the data query table, and users can select the query duration through the human-computer interaction interface. The management platform determines the query time range of data query based on the query duration selected by the user through the human-computer interaction interface and the current time.
S302:管理平台确定在所述查询时间范围内服务的指标数据。S302: The management platform determines the indicator data of the service within the query time range.
服务可以是多种业务场景下的服务,例如,手机缴费业务场景中的服务,图书查询场景下的服务,购买物品场景下的服务等。服务可以是业务服务,也可以是组成业务服务的多个微服务。Services can be services in a variety of business scenarios, such as services in mobile phone payment business scenarios, services in book query scenarios, services in purchase of items scenarios, etc. A service can be a business service or multiple microservices that make up a business service.
服务的指标数据能够反应服务的健康状态,从而使得管理平台能够基于服务的指标数据对服务进行健康监测。The indicator data of the service can reflect the health status of the service, so that the management platform can monitor the health of the service based on the indicator data of the service.
管理平台存储有服务的所有指标数据,且管理平台是按照时间顺序存储服务的所有指标数据。可选地,管理平台能够检索到查询时间范围内的指标数据,即管理平台从服务的所有指标数据中进行采样,从而确定在查询时间范围内服务的指标数据。可选地,指标数据包括服务的原始健康状态和时间戳。管理平台能够根据指标数据中的时间戳,获取在查询时间范围内服务的指标数据。The management platform stores all indicator data of the service, and the management platform stores all indicator data of the service in chronological order. Optionally, the management platform can retrieve the indicator data within the query time range, that is, the management platform samples all indicator data of the service to determine the indicator data of the service within the query time range. Optionally, the metric data includes the service's raw health status and timestamp. The management platform can obtain the indicator data served within the query time range based on the timestamp in the indicator data.
在一种可能的实现方式中,为了保证健康监测的精准度,管理平台采样在查询时间范围内服务的指标数据,进行健康监测。In one possible implementation, in order to ensure the accuracy of health monitoring, the management platform samples the indicator data of services within the query time range and performs health monitoring.
在另一种可能的实现方式中,由于服务的指标数据的数据量较大,为了平衡计算资源和健康监测的精准度,可以根据查询步长对服务的所有指标数据采样,得到查询时间范围内服务的指标数据。需要说明的是,服务的指标数据是指从服务的所有指标数据中采样得到的数据。In another possible implementation, since the amount of service indicator data is large, in order to balance the computing resources and the accuracy of health monitoring, all service indicator data can be sampled according to the query step size to obtain the service indicator data within the query time range. It should be noted that the service indicator data refers to the data sampled from all service indicator data.
作为一个示例,查询步长可以是用户通过数据查询的人机交互界面输入的。例如,如图6所示,当用户确认进行数据查询时,数据查询的人机交互界面60可以显示查询步长输入框61和查询步长的单位62(如分钟,秒,小时)。用户可以通过“确认”标识63确认进行数据查询个。管理平台根据用户输入的查询步长生成数据查询的指令,换言之,数据查询的指令中包括查询步长。As an example, the query step size may be input by the user through a human-computer interaction interface for data query. For example, as shown in FIG. 6 , when the user confirms the data query, the human-computer interaction interface 60 of the data query can display the query step input box 61 and the query step unit 62 (such as minutes, seconds, hours). The user can confirm the data query through the "confirm" mark 63. The management platform generates data query instructions based on the query step size input by the user. In other words, the data query instructions include the query step size.
作为另一个示例,当采集步长等于查询步长时,在查询时间范围内服务的所有指标数据准确呈现,无漏报。但是,查询时间范围较大时,数据量会非常大,导致接口变慢。当采集步长小于查询步长时,优点是数据量可以控制。缺点是抓取的原始采样点可能被忽略,导致数据漏报,当被查询的采样点如果有异常且刚好被忽略的时候,就无法准确上报异常,用户体验差。为了平衡计算资源和健康监测的精准度,管理平台能够自适应确定查询步长。As another example, when the collection step size is equal to the query step size, all indicator data served within the query time range are accurately presented with no false negatives. However, when the query time range is large, the amount of data will be very large, causing the interface to slow down. When the collection step size is smaller than the query step size, the advantage is that the amount of data can be controlled. The disadvantage is that the captured original sampling points may be ignored, resulting in missing data. When the queried sampling points are abnormal and happen to be ignored, the exceptions cannot be accurately reported, resulting in poor user experience. In order to balance computing resources and health monitoring accuracy, the management platform can adaptively determine the query step size.
具体地,管理平台根据采集步长、预设的时间长度阈值和查询时间范围,确定查询步长。 Specifically, the management platform determines the query step length according to the collection step length, a preset time length threshold and a query time range.
用户所能查询的时间段不是无限长的。因此可以通过配置时间长度阈值从而限制查询点数上限。查询点数上限决定了分辨率,比如,用户希望有一个点的异常就要上报红色,如果查询点数上界为600个,那么后续计算权重时就需要用1/600来作为最小分辨率(具体详见后续说明)。The time period that users can query is not infinite. Therefore, you can limit the upper limit of query points by configuring the time length threshold. The upper limit of the number of query points determines the resolution. For example, if the user wants to report an abnormality of one point in red, if the upper limit of the number of query points is 600, then 1/600 will be used as the minimum resolution when calculating the weight (specific details See subsequent instructions).
当查询时间范围小于或等于时间长度阈值时,查询步长等于采集步长。当查询时间范围大于时间长度阈值时,查询步长querystep满足下述公式(1):
When the query time range is less than or equal to the time length threshold, the query step size is equal to the collection step size. When the query time range is greater than the time length threshold, the query step satisfies the following formula (1):
其中,querytime表示查询时间范围,T表示预设的时间长度阈值,scrape_interval表示采集步长。需要说明的是,该公式中的所有参数的单位相同,例如,公式中的所有参数均以分钟为单位,具体可根据需要设定,在此不作限定。Among them, querytime represents the query time range, T represents the preset time length threshold, and scrape_interval represents the collection step. It should be noted that the units of all parameters in the formula are the same. For example, all parameters in the formula are in minutes. The specific parameters can be set as needed and are not limited here.
S303:根据所述查询时间范围内服务的指标数据,确定所述服务的多种原始健康状态各自在所述查询时间范围内出现的频率。S303: Determine, based on the indicator data of the service within the query time range, the frequency at which each of the multiple original health states of the service occurs within the query time range.
服务的原始健康状态用于指示服务在运行过程中的状态。用户可以根据被监控对象和监控需求自定义原始健康状态的划分和个数。例如服务的原始健康状态可以分为有损状态、异常状态和正常状态。其中,有损状态表示服务运行过程中存在非健康状态,但服务能正常运行,用户无法感知到;异常状态表示服务运行过程中存在非健康状态,已无法正常提供服务,对用户影响极大。The raw health status of a service is used to indicate the status of the service while it is running. Users can customize the division and number of original health states according to the monitored objects and monitoring needs. For example, the original health state of a service can be divided into lossy state, abnormal state and normal state. Among them, the lossy state means that there is an unhealthy state during the running of the service, but the service can run normally and the user cannot detect it; the abnormal state means that there is an unhealthy state during the running of the service and the service cannot be provided normally, which has a great impact on the user.
在查询时间范围内,服务在不同时刻的原始健康状态可能不同。统计分析在查询时间范围内服务在不同时刻的指标数据,从而确定服务在不同时刻的原始健康状态。接着,统计服务在查询时间范围内多种原始健康状态出现的频率。The original health status of a service may be different at different moments within the query time range. Statistically analyze the indicator data of the service at different times within the query time range to determine the original health status of the service at different times. Then, the statistical service counts the frequency of occurrence of various original health states within the query time range.
S304:根据所述服务的多种原始健康状态各自对应的权重和所述多种原始健康状态各自在所述查询时间内出现的频率,显示所述服务的状态标识,以用于用户根据所述服务的状态标识进行故障定位,所述状态标识用于指示所述服务的最终健康状态,所述权重用于表示原始健康状态出现的频率对最终健康状态的影响程度。S304: Display the status identifier of the service according to the corresponding weights of the multiple original health states of the service and the frequency of occurrence of the multiple original health states within the query time, so that the user can use the service according to the The status identifier of the service is used to locate the fault. The status identifier is used to indicate the final health state of the service. The weight is used to represent the degree of impact of the frequency of the original health state on the final health state.
最终健康状态可以是用户根据自身需求划分的。由于不同的原始健康状态出现的频率对不同的用户的影响是不同的,因此,用户根据不同的原始健康状态出现的频率设定最终健康状态。例如,用户关心的最终健康状态为以下四种:服务的原始健康状态在查询时间范围内全部为正常状态(为方便描述,下述用Healthy’表示)、在查询时间范围内服务的原始健康状态出现了有损状态(为方便描述,下述用Degraded’表示)、在查询时间范围内服务的原始健康状态出现了异常状态(为方便描述,下述用Failure’表示)且异常状态出现的频率不高于10%、在查询时间范围内服务的原始健康状态出现了异常状态(为方便描述,下述用Failure”表示)且异常状态出现的频率高于10%。The final health status can be divided by users according to their own needs. Since the frequency of occurrence of different original health states has different effects on different users, the user sets the final health state according to the frequency of occurrence of different original health states. For example, the final health status that users care about is the following four: the original health status of the service is all normal within the query time range (for convenience of description, it is expressed as Healthy' below), the original health status of the service within the query time range A lossy state has occurred (for convenience of description, it is represented by Degraded' below), the original health status of the service has an abnormal state within the query time range (for convenience of description, it is represented by Failure' below), and the frequency of occurrence of the abnormal state No more than 10%, the original health status of the service has an abnormal state within the query time range (for convenience of description, it is expressed as "Failure" below) and the frequency of abnormal state is higher than 10%.
原始健康状态的权重用于表示原始健康状态在查询时间范围内出现的频率对最终健康状态的影响程度。The weight of the original health state is used to represent the degree of influence of the frequency of the original health state within the query time range on the final health state.
其中,管理平台根据原始健康状态的权重和该原始健康状态在查询时间范围内出现的频率,计算服务在查询时间内的状态分数。显示服务的状态分数对应的状态标识。Among them, the management platform calculates the status score of the service within the query time based on the weight of the original health status and the frequency of the original health status within the query time range. Displays the status ID corresponding to the status score of the service.
服务在查询时间内的状态分数score满足下述公式(2):
The status score of the service within the query time satisfies the following formula (2):
其中,si实现方式中,管表示服务的原始健康状态,F(si)表示第i种原始健康状态在查询时间范围内出现的频率,W(si)表示第i种原始健康状态权重,i为正整数。Among them, in the implementation of s i , tube represents the original health state of the service, F(s i ) represents the frequency of the i-th original health state within the query time range, and W(s i ) represents the weight of the i-th original health state. , i is a positive integer.
在本实施例中,管理平台存储有状态标识和状态分数范围之间的对应关系。例如,状态标识1用于表示服务的原始健康状态在查询时间范围内全部为正常状态(Healthy),状态标识1对应的状态分数范围为score=1。状态标识2用于表示在查询时间范围内服务的原始健康状态出现了有损状态(Degraded),状态标识2对应的状态分数范围为1<score≤10。状态标识3用于表示在查询时间范围内服务的原始健康状态出现了异常状态(Failure)且异常状态出现的频率不高于 10%,状态标识3对应的状态分数范围为10<score≤1440。状态标识4用于表示在查询时间范围内服务的原始健康状态出现了异常状态(Failure)且异常状态出现的频率高于10%,状态标识4对应的分数范围为1440<score。In this embodiment, the management platform stores the correspondence between the status identifier and the status score range. For example, status identifier 1 is used to indicate that the original health status of the service is all normal (Healthy) within the query time range, and the status score range corresponding to status identifier 1 is score=1. Status ID 2 is used to indicate that the original health status of the service has become damaged (Degraded) within the query time range. The status score range corresponding to Status ID 2 is 1<score≤10. Status ID 3 is used to indicate that an abnormal state (Failure) occurred in the original health state of the service within the query time range and the frequency of the abnormal state is not higher than 10%, the status score range corresponding to status identifier 3 is 10<score≤1440. Status ID 4 is used to indicate that the original health status of the service has an abnormal state (Failure) within the query time range and the frequency of the abnormal status is higher than 10%. The score range corresponding to Status ID 4 is 1440<score.
例如,score=1,显示绿色状态标识:查询时间范围内内状态均为正常状态(Healthy)。1<score<10,显示黄色状态标识:查询时间范围内内出现了有损状态(Degraded)。10<score<1440,显示橙色状态标识:查询时间范围内内出现了异常状态(Failure)且频率低于10%。score>1440显示红色状态标识:查询时间范围内内出现failure且频率高于10%。For example, score=1, the green status indicator is displayed: the status is normal (Healthy) within the query time range. 1<score<10, the yellow status indicator is displayed: the degraded status (Degraded) occurs within the query time range. 10<score<1440, the orange status indicator is displayed: the abnormal status (Failure) occurs within the query time range and the frequency is less than 10%. Score>1440, the red status indicator is displayed: the failure occurs within the query time range and the frequency is higher than 10%.
用户看到不同的状态标识,就能准确知道这四种最终健康状态代表的含义。例如,用户选择查询时间范围为24小时,即1440分钟:其中有一分钟出现了异常状态(Failure),剩下都是正常状态(Healthy),状态分数score=1439/1440+0+14400*1/1440=10.9993。其中,10<score<=1440,显示橙色标识。又例如,用户,用户选择查询时间范围为1小时,即60分钟,其中,有十分钟服务出现了异常状态(Failure),剩下均为正常状态(Healthy),则状态分数score=50/60+0+14400*10/60=2400.8333,score>1440显示红色标识。When users see different status indicators, they can accurately know what these four final health states represent. For example, the user selects a query time range of 24 hours, that is, 1440 minutes: one minute has an abnormal state (Failure), and the rest are normal states (Healthy). The state score score=1439/1440+0+14400*1/ 1440=10.9993. Among them, 10<score<=1440, displays the orange mark. For another example, the user selects a query time range of 1 hour, that is, 60 minutes. Among them, the service has an abnormal state (Failure) for ten minutes, and the rest are in a normal state (Healthy), then the state score score=50/60 +0+14400*10/60=2400.8333, score>1440 displays red mark.
在这里,状态标识可以是颜色标识,还可以是形状标识。在此不作限定。Here, the status identifier can be a color identifier or a shape identifier. No limitation is made here.
在一种可能的实现方式中,原始健康状态的权重和状态分数范围可以是用户根据需要配置的。In a possible implementation, the weight of the original health state and the state score range can be configured by the user as needed.
在另一种可能的实现方式中,管理平台能够通过权重确定模型计算多种原始健康状态各自对应的权重。具体地,根据预先配置的权重确定模型和预设的时间长度阈值,确定所述多种原始健康状态各自对应的权重和状态分数范围。In another possible implementation, the management platform can calculate the corresponding weights of multiple original health states through a weight determination model. Specifically, according to the preconfigured weight determination model and the preset time length threshold, the respective corresponding weights and state score ranges of the multiple original health states are determined.
其中,权重确定模型用于指示服务的原始健康状态的频率与服务的最终健康状态的对应关系。在这里,权重确定模型可以看作一个多变量的不等式组。用户可以根据自身需求配置状态分数范围与原始健康状态的频率范围的对应关系。Among them, the weight determination model is used to indicate the correspondence between the frequency of the original health state of the service and the final health state of the service. Here, the weight determination model can be viewed as a multi-variable set of inequalities. Users can configure the corresponding relationship between the status score range and the frequency range of the original health status according to their own needs.
例如,状态分数范围有4段,分别为(0,S1]、(S1,S2]、(S2,S3]、(S3,∞)。其中,分数越大状态越危险。以时间长度阈值24小时确定最小分辨率,即最小分辨率为1/1440。服务的原始健康状态的频率与服务的最终健康状态的对应关系如表一所示:For example, the state score range has 4 segments, namely (0, S1], (S1, S2], (S2, S3], (S3, ∞). Among them, the larger the score, the more dangerous the state. The time length threshold is 24 hours Determine the minimum resolution, that is, the minimum resolution is 1/1440. The correspondence between the frequency of the original health state of the service and the final health state of the service is shown in Table 1:
表一
Table I
根据状态分数的计算公式(2),可以得到如下不等式组(3),即权重确定模型。
According to the calculation formula (2) of the state score, the following inequality group (3) can be obtained, that is, the weight determination model.
其中:W1为正常状态(Healthy)对应权重,W2为有损状态(Degraded)对应权重,W3为异常状态(Failure)对应权重。Among them: W 1 is the weight corresponding to the normal state (Healthy), W 2 is the weight corresponding to the lossy state (Degraded), and W 3 is the weight corresponding to the abnormal state (Failure).
取一组符合约束的可行解:W1=1,W2=10,W3=14400。Take a set of feasible solutions that meet the constraints: W 1 =1, W 2 =10, W 3 =14400.
可以严格证明上述可行解是符合要求的,由可行解中各个权重的取值可得,S1=1,S2=10, S3=1440,则分数分段(即状态分数范围)分别为(0,1]、(1,10]、(10,1440]、(1440,∞)。可见,在查询时间范围内:It can be strictly proved that the above feasible solution meets the requirements. From the values of each weight in the feasible solution, S1=1, S2=10, S3=1440, then the score segments (ie, the state score range) are (0, 1], (1, 10], (10, 1440], (1440, ∞). It can be seen that within the query time range:
1)只要有异常状态,状态分数score严格大于10。1) As long as there is an abnormal state, the state score score is strictly greater than 10.
2)异常状态出现频率低于10%,状态分数score一定小于1440。2) If the frequency of abnormal status is less than 10%, the status score must be less than 1440.
3)有且仅有正常状态时,状态分数score=1。3) When there is and is only a normal state, the state score score=1.
4)只要有有损状态,状态分数score大于1。4) As long as there is a lossy state, the state score score is greater than 1.
因此,最终健康状态有四种,不仅包含了原始健康状态的权重,还包含了原始健康状态在查询时间范围内出现的频率。Therefore, there are four final health states, which not only include the weight of the original health state, but also include the frequency of the original health state within the query time range.
在一些实施例中,业务可以包括多个服务。例如,如图7所示,购买书籍业务包括服务:书籍名称搜索服务11,书籍展示服务12,书籍评价服务13,订单付款服务14。其中,书籍名称搜索服务11,书籍展示服务12,书籍评价服务13,订单付款服务14的关联关系如图7所示。其中,书籍名称搜索服务11,书籍展示服务12和书籍评价服务13在24小时内均为正常状态,书籍名称搜索服务11,书籍展示服务12和书籍评价服务13的状态标识均为三角形标识。订单付款服务14在24小时内为出现异常状态,订单付款服务14的状态标识为正方形标识。管理平台能够通过人机交互界面显示同一业务下的多个服务的最终健康状态。进一步地,管理平台能够显示多个业务下各自的服务的最终健康状态。In some embodiments, a business may include multiple services. For example, as shown in Figure 7, the book purchasing business includes services: book title search service 11, book display service 12, book evaluation service 13, and order payment service 14. Among them, the correlation between the book title search service 11, the book display service 12, the book evaluation service 13, and the order payment service 14 is shown in Figure 7. Among them, book name search service 11, book display service 12 and book evaluation service 13 are all in normal status within 24 hours, and the status identifiers of book name search service 11, book display service 12 and book evaluation service 13 are all triangular identifiers. The order payment service 14 is in an abnormal state within 24 hours, and the status mark of the order payment service 14 is a square mark. The management platform can display the final health status of multiple services under the same business through a human-computer interaction interface. Furthermore, the management platform can display the final health status of respective services under multiple businesses.
在一些实施例中,用户能够通过状态标识查看服务的在各个时刻的原始健康状态。具体地,接收用户对目标服务的状态标识的操作;根据所述操作,显示所述目标服务在所述查询时间范围内多个时刻的原始健康状态。In some embodiments, users can view the original health status of the service at various moments through the status identification. Specifically, a user's operation on the status identification of the target service is received; according to the operation, the original health status of the target service at multiple moments within the query time range is displayed.
例如,如图8所示,购买书籍业务中,书籍名称搜索服务11,书籍展示服务12和书籍购买服务13的状态标识均为三角形标识,而订单付款服务14的状态标识为正方形标识,用户可以通过人机交互界面对订单付款服务14的状态标识进行操作,管理平台显示订单付款服务14在过去24小时内各个采样时刻的原始健康状态。For example, as shown in Figure 8, in the book purchase business, the status identifiers of the book name search service 11, the book display service 12 and the book purchase service 13 are all triangle identifiers, while the status identifier of the order payment service 14 is a square identifier. The user can By operating the status identifier of the order payment service 14 through the human-computer interaction interface, the management platform displays the original health status of the order payment service 14 at each sampling moment in the past 24 hours.
基于图3中服务健康状态的显示方法,本申请实施例提供了一种服务健康状态的显示装置。Based on the display method of service health status in Figure 3, embodiments of the present application provide a display device for service health status.
图9是本申请实施例提供了一种服务健康状态的显示装置的结构示意图。本申请实施例提供的服务健康状态的显示装置可以应用于如图1所示的管理平台。如图9所示,本申请实施例提供的服务健康状态的显示装置包括第一确定模块901,第二确定模块902,第三确定模块903和显示模块904。FIG. 9 is a schematic structural diagram of a service health status display device provided by an embodiment of the present application. The service health status display device provided by the embodiment of the present application can be applied to the management platform as shown in Figure 1. As shown in Figure 9, the display device for service health status provided by the embodiment of the present application includes a first determination module 901, a second determination module 902, a third determination module 903 and a display module 904.
第一确定模块901,用于确定数据查询的查询时间范围;The first determination module 901 is used to determine the query time range of data query;
第二确定模块902,用于确定在所述查询时间范围内服务的指标数据,服务的指标数据用于反应所述服务的原始健康状态;The second determination module 902 is used to determine the indicator data of the service within the query time range. The indicator data of the service is used to reflect the original health status of the service;
第三确定模块903,用于根据所述查询时间范围内服务的指标数据,确定所述服务的多种原始健康状态各自在所述查询时间范围内出现的频率;The third determination module 903 is configured to determine the frequency of each of the multiple original health states of the service within the query time range based on the indicator data of the service within the query time range;
显示模块904,用于根据所述服务的多种原始健康状态各自对应的权重和所述服务在多种原始健康状态下在所述查询时间内出现的频率,显示所述服务的状态标识,以用于用户根据所述服务的状态标识进行故障定位,所述状态标识用于指示所述服务的最终健康状态,所述权重用于表示原始健康状态出现的频率对最终健康状态的影响程度。The display module 904 is configured to display the status identifier of the service according to the corresponding weights of the multiple original health states of the service and the frequency of occurrence of the service in the multiple original health states within the query time, so as to It is used for users to locate faults based on the status identifier of the service. The status identifier is used to indicate the final health status of the service. The weight is used to represent the degree of impact of the frequency of the original health status on the final health status.
本申请实施例通过结合原始健康状态的权重和原始健康状态在查询时间内出现的频率,显示服务的状态标识,使得服务的状态标识包括原始健康状态出现的频率对最终健康状态的影响程度,提高表示服务的健康状态的精准度,从而提高服务故障定位的精准度。The embodiment of this application displays the status identifier of the service by combining the weight of the original health state and the frequency of the original health state within the query time, so that the status identifier of the service includes the impact of the frequency of the original health state on the final health state, improving Indicates the accuracy of the health status of the service, thereby improving the accuracy of service fault location.
在一种可能的实现方式中,服务健康状态显示装置还包括:In a possible implementation, the service health status display device also includes:
第四确定模块,用于根据预先配置的权重确定模型和预设的时间长度阈值,确定所述多种原始健康状态各自对应的权重;The fourth determination module is used to determine the corresponding weights of the multiple original health states according to the preconfigured weight determination model and the preset time length threshold;
其中,所述权重确定模型用于指示服务的多种原始健康状态的频率与服务的最终健康状态的对应关系。Wherein, the weight determination model is used to indicate the corresponding relationship between the frequency of multiple original health states of the service and the final health state of the service.
如此,通过权重确定模型确定权重,使得原始健康状态的权重包含了原始健康状态出现的频率对最终健康状态的影响程度。 In this way, the weight is determined by the weight determination model, so that the weight of the original health state includes the influence of the frequency of occurrence of the original health state on the final health state.
在一种可能的实现方式中,第四确定模块还用于根据所述多种原始健康状态各自对应的权重和预设的时间长度阈值,确定多种所述最终健康状态各自对应的状态分数范围;In a possible implementation, the fourth determination module is also used to determine the state score ranges corresponding to the multiple final health states according to the respective weights and the preset time length threshold of the multiple original health states. ;
计算模块,用于根据所述服务的多种原始健康状态各自对应的权重和所述多种原始健康状态各自在所述查询时间内出现的频率,计算所述服务对应的状态分数;A calculation module configured to calculate the status score corresponding to the service based on the corresponding weights of the multiple original health states of the service and the frequency of each of the multiple original health states appearing within the query time;
所述显示模块,用于根据所述服务对应的状态分数和多种所述最终健康状态各自对应的分数范围,显示所述服务的状态标识。The display module is configured to display the status identification of the service according to the status score corresponding to the service and the score ranges corresponding to the multiple final health states.
在一种可能的实现方式中,第一确定模块还用于确定数据查询的查询步长;In a possible implementation, the first determination module is also used to determine the query step size of the data query;
所述第二确定模块用于按照所述查询步长和所述查询时间范围,从所述服务的所有指标数据中采集所述服务的指标数据。The second determination module is configured to collect indicator data of the service from all indicator data of the service according to the query step size and the query time range.
如此,由于服务的指标数据的数据量较大,可以根据查询步长对服务的所有指标数据采样,从而平衡计算资源和健康监测的精准度。In this way, due to the large amount of indicator data of the service, all indicator data of the service can be sampled according to the query step, thereby balancing computing resources and the accuracy of health monitoring.
在一种可能的实现方式中,服务的指标数据的数据量不大于目标数量;In a possible implementation, the data amount of the service indicator data is not greater than the target amount;
其中,所述目标数量根据预先配置的采集步长确定,所述采集步长用于指示所述服务的指标数据之间的时间间隔长度。Wherein, the target number is determined according to a preconfigured collection step, and the collection step is used to indicate the length of the time interval between the indicator data of the service.
在一种可能的实现方式中,第一确定模块用于确定采集步长,所述采集步长用于指示所述服务的所有指标数据之间的时间间隔长度;根据所述采集步长、预设的时间长度阈值和所述查询时间范围,确定所述查询步长。In a possible implementation, the first determination module is used to determine the collection step, and the collection step is used to indicate the length of the time interval between all indicator data of the service; according to the collection step, the preset Set the time length threshold and the query time range to determine the query step size.
如此,通过采集步长自适应确定查询步长,避免了服务故障漏报,从而平衡计算资源和健康监测的精准度。In this way, the query step size is determined adaptively through the collection step size, thereby avoiding missed reports of service failures, thus balancing computing resources and the accuracy of health monitoring.
在一种可能的实现方式中,状态标识包括颜色标识或形状标识。In a possible implementation, the status identification includes a color identification or a shape identification.
在一种可能的实现方式中,服务有多个。In a possible implementation, there are multiple services.
在一种可能的实现方式中,装置还包括:In a possible implementation, the device further includes:
接收模块,用于接收用户对目标服务的状态标识的操作;The receiving module is used to receive the user's operation on the status identification of the target service;
所述显示模块还用于根据所述操作,显示所述目标服务在所述查询时间范围内多个时刻的原始健康状态。The display module is also configured to display the original health status of the target service at multiple moments within the query time range according to the operation.
如此,能够方便用户查看服务在多个时刻的原始健康状态。In this way, users can conveniently view the original health status of the service at multiple moments.
附图9所描述的装置实施例仅仅是示意性的,例如,所述模块的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个模块或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。在本申请各个实施例中的各功能模块可以集成在一个处理模块中,也可以是各个模块单独物理存在,也可以两个或两个以上模块集成在一个模块中。附图9中上述各个模块既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现,也可以采用软件硬件相结合的方式来实现。The device embodiment described in Figure 9 is only illustrative. For example, the division of modules is only a logical function division. In actual implementation, there may be other division methods. For example, multiple modules or components may be combined or can be integrated into another system, or some features can be ignored, or not implemented. Each functional module in each embodiment of the present application can be integrated into one processing module, or each module can exist physically alone, or two or more modules can be integrated into one module. The above-mentioned modules in Figure 9 can be implemented in the form of hardware, software functional units, or a combination of software and hardware.
参考图10,图10示出了本申请实施例提供的计算设备的结构示意图。计算设备可以是服务器等。其中,图1中的管理平台包括至少一个计算设备。如图10所示,计算设备包括:处理器1001、存储器1002和通信接口1003。处理器1001、存储器1002和通信接口1003之间通过总线1004连接。存储器1002中包括操作系统和程序代码模块。Referring to Figure 10, Figure 10 shows a schematic structural diagram of a computing device provided by an embodiment of the present application. The computing device may be a server or the like. Wherein, the management platform in Figure 1 includes at least one computing device. As shown in Figure 10, the computing device includes: a processor 1001, a memory 1002, and a communication interface 1003. The processor 1001, the memory 1002 and the communication interface 1003 are connected through a bus 1004. Memory 1002 includes operating system and program code modules.
存储器1002可以包括用于数据或指令的大容量存储器。举例来说而非限制,存储器1002可包括硬盘驱动器(Hard Disk Drive,HDD)、软盘驱动器、闪存、光盘、磁光盘、磁带或通用串行总线(Universal Serial Bus,USB)驱动器或者两个或更多个以上这些的组合。在合适的情况下,存储器1002可包括可移除或不可移除(或固定)的介质。在合适的情况下,存储器1002可在综合网关容灾设备的内部或外部。在特定实施例中,存储器1002是非易失性固态存储器。Memory 1002 may include bulk storage for data or instructions. By way of example, and not limitation, the memory 1002 may include a hard disk drive (HDD), a floppy disk drive, flash memory, an optical disk, a magneto-optical disk, a magnetic tape, or a Universal Serial Bus (USB) drive or two or more A combination of many of the above. Memory 1002 may include removable or non-removable (or fixed) media, where appropriate. Where appropriate, the memory 1002 may be internal or external to the integrated gateway disaster recovery device. In certain embodiments, memory 1002 is non-volatile solid-state memory.
存储器可包括只读存储器(ROM),随机存取存储器(RAM),磁盘存储介质设备,光存储介质设备,闪存设备,电气、光学或其他物理/有形的存储器存储设备。因此,通常,存储器包括一个或多个编码有包括计算机可执行指令的软件的有形(非暂态)计算机可读存储介质(例如,存储器设备),并且当该软件被执行(例如,由一个或多个处理器)时,其可操作来执行参考根据本申请中的方法所描述的操作。 The memory may include read-only memory (ROM), random access memory (RAM), magnetic disk storage media devices, optical storage media devices, flash memory devices, electrical, optical or other physical/tangible memory storage devices. Thus, typically, the memory includes one or more tangible (non-transitory) computer-readable storage media (e.g., memory devices) encoded with software including computer-executable instructions, and when the software is executed (e.g., by one or more processors), it is operable to perform the operations described with reference to the methods in the present application.
处理器1001通过读取并执行存储器1002中存储的计算机程序指令,以实现上述实施例中的任意一种服务健康状态的显示方法。The processor 1001 reads and executes the computer program instructions stored in the memory 1002 to implement any of the service health status display methods in the above embodiments.
在一个示例中,电子设备还可包括通信接口1003和总线1010。其中,如图10所示,处理器1001、存储器1002、通信接口1003通过总线1010连接并完成相互间的通信。In one example, the electronic device may also include a communication interface 1003 and a bus 1010. Among them, as shown in Figure 10, the processor 1001, the memory 1002, and the communication interface 1003 are connected through the bus 1010 and complete communication with each other.
通信接口1003,主要用于实现本申请实施例中各模块、装置、单元和/或设备之间的通信。The communication interface 1003 is mainly used to implement communication between modules, devices, units and/or equipment in the embodiments of this application.
总线1010包括硬件、软件或两者,将电子设备的部件彼此耦接在一起。举例来说而非限制,总线可包括加速图形端口(AGP)或其他图形总线、增强工业标准架构(EISA)总线、前端总线(FSB)、超传输(HT)互连、工业标准架构(ISA)总线、无限带宽互连、低引脚数(LPC)总线、存储器总线、微信道架构(MCA)总线、外围组件互连(PCI)总线、PCI-Express(PCI-X)总线、串行高级技术附件(SATA)总线、视频电子标准协会局部(VLB)总线或其他合适的总线或者两个或更多个以上这些的组合。在合适的情况下,总线1010可包括一个或多个总线。尽管本申请实施例描述和示出了特定的总线,但本申请考虑任何合适的总线或互连。Bus 1010 includes hardware, software, or both, coupling components of an electronic device to one another. By way of example, and not limitation, the bus may include Accelerated Graphics Port (AGP) or other graphics bus, Enhanced Industry Standard Architecture (EISA) bus, Front Side Bus (FSB), HyperTransport (HT) interconnect, Industry Standard Architecture (ISA) Bus, Infinite Bandwidth Interconnect, Low Pin Count (LPC) Bus, Memory Bus, Micro Channel Architecture (MCA) Bus, Peripheral Component Interconnect (PCI) Bus, PCI-Express (PCI-X) Bus, Serial Advanced Technology Attachment (SATA) bus, Video Electronics Standards Association Local (VLB) bus or other suitable bus or a combination of two or more of these. Where appropriate, bus 1010 may include one or more buses. Although the embodiments of this application describe and illustrate a specific bus, this application contemplates any suitable bus or interconnection.
可以理解的是,本申请的实施例中的处理器可以是中央处理单元(central processing unit,CPU),还可以是其他通用处理器、数字信号处理器(digital signal processor,DSP)、专用集成电路(application specific integrated circuit,ASIC)、现场可编程门阵列(field programmable gate array,FPGA)或者其他可编程逻辑器件、晶体管逻辑器件,硬件部件或者其任意组合。通用处理器可以是微处理器,也可以是任何常规的处理器。It can be understood that the processor in the embodiments of the present application can be a central processing unit (CPU), or other general-purpose processor, digital signal processor (DSP), or application-specific integrated circuit (application specific integrated circuit, ASIC), field programmable gate array (field programmable gate array, FPGA) or other programmable logic devices, transistor logic devices, hardware components or any combination thereof. A general-purpose processor can be a microprocessor or any conventional processor.
本申请的实施例中的方法步骤可以通过硬件的方式来实现,也可以由处理器执行软件指令的方式来实现。软件指令可以由相应的软件模块组成,软件模块可以被存放于随机存取存储器(random access memory,RAM)、闪存、只读存储器(read-only memory,ROM)、可编程只读存储器(programmable rom,PROM)、可擦除可编程只读存储器(erasable PROM,EPROM)、电可擦除可编程只读存储器(electrically EPROM,EEPROM)、寄存器、硬盘、移动硬盘、CD-ROM或者本领域熟知的任何其它形式的存储介质中。一种示例性的存储介质耦合至处理器,从而使处理器能够从该存储介质读取信息,且可向该存储介质写入信息。当然,存储介质也可以是处理器的组成部分。处理器和存储介质可以位于ASIC中。The method steps in the embodiments of the present application can be implemented by hardware or by a processor executing software instructions. Software instructions can be composed of corresponding software modules, and software modules can be stored in random access memory (random access memory, RAM), flash memory, read-only memory (read-only memory, ROM), programmable read-only memory (programmable rom) , PROM), erasable programmable read-only memory (erasable PROM, EPROM), electrically erasable programmable read-only memory (electrically EPROM, EEPROM), register, hard disk, mobile hard disk, CD-ROM or other well-known in the art any other form of storage media. An exemplary storage medium is coupled to the processor such that the processor can read information from the storage medium and write information to the storage medium. Of course, the storage medium can also be an integral part of the processor. The processor and storage media may be located in an ASIC.
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本申请实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中,或者通过所述计算机可读存储介质进行传输。所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如固态硬盘(solid state disk,SSD))等。In the above embodiments, it may be implemented in whole or in part by software, hardware, firmware, or any combination thereof. When implemented using software, it may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, the processes or functions described in the embodiments of the present application are generated in whole or in part. The computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable device. The computer instructions may be stored in or transmitted over a computer-readable storage medium. The computer instructions may be transmitted from one website, computer, server or data center to another website through wired (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.) means. , computer, server or data center for transmission. The computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that contains one or more available media integrated. The available media may be magnetic media (eg, floppy disk, hard disk, magnetic tape), optical media (eg, DVD), or semiconductor media (eg, solid state disk (SSD)), etc.
可以理解的是,在本申请的实施例中涉及的各种数字编号仅为描述方便进行的区分,并不用来限制本申请的实施例的范围。 It can be understood that the various numerical numbers involved in the embodiments of the present application are only for convenience of description and are not used to limit the scope of the embodiments of the present application.

Claims (22)

  1. 一种服务健康状态的显示方法,其特征在于,包括:A method for displaying service health status, which is characterized by including:
    确定数据查询的查询时间范围;Determine the query time range of data query;
    确定在所述查询时间范围内服务的指标数据,服务的指标数据用于反应所述服务的原始健康状态;Determine the indicator data of the service within the query time range. The indicator data of the service is used to reflect the original health status of the service;
    根据所述查询时间范围内服务的指标数据,确定所述服务的多种原始健康状态各自在所述查询时间范围内出现的频率;According to the indicator data of the service within the query time range, determine the frequency of each of the multiple original health states of the service within the query time range;
    根据所述服务的多种原始健康状态各自对应的权重和所述服务在多种原始健康状态下在所述查询时间内出现的频率,显示所述服务的状态标识,以用于用户根据所述服务的状态标识进行故障定位,所述状态标识用于指示所述服务的最终健康状态,所述权重用于表示原始健康状态在查询时间范围内出现的频率对最终健康状态的影响程度。According to the corresponding weights of the multiple original health states of the service and the frequency of occurrence of the service in the multiple original health states within the query time, the status identifier of the service is displayed for the user to use the service according to the The status identifier of the service is used to locate the fault. The status identifier is used to indicate the final health state of the service. The weight is used to represent the degree of influence of the frequency of the original health state within the query time range on the final health state.
  2. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1, further comprising:
    根据预先配置的权重确定模型和预设的时间长度阈值,确定所述多种原始健康状态各自对应的权重;Determine the respective weights corresponding to the multiple original health states according to the preconfigured weight determination model and the preset time length threshold;
    其中,所述权重确定模型用于指示服务的多种原始健康状态的频率与服务的最终健康状态的对应关系。Wherein, the weight determination model is used to indicate the corresponding relationship between the frequency of multiple original health states of the service and the final health state of the service.
  3. 根据权利要求2所述的方法,其特征在于,所述方法还包括:The method of claim 2, further comprising:
    根据所述多种原始健康状态各自对应的权重和预设的时间长度阈值,确定多种所述最终健康状态各自对应的状态分数范围;Determining state score ranges corresponding to the multiple final health states according to the weights corresponding to the multiple original health states and a preset time length threshold;
    所述根据所述服务的多种原始健康状态各自对应的权重和所述服务在多种原始健康状态下在所述查询时间内出现的频率,显示所述服务的状态标识,包括:Displaying the status identifier of the service based on the corresponding weights of the multiple original health states of the service and the frequency of occurrence of the service in the multiple original health states within the query time includes:
    根据所述服务的多种原始健康状态各自对应的权重和所述多种原始健康状态各自在所述查询时间内出现的频率,计算所述服务对应的状态分数;Calculate the status score corresponding to the service according to the corresponding weights of the multiple original health states of the service and the frequency of occurrence of the multiple original health states within the query time;
    根据所述服务对应的状态分数和多种所述最终健康状态各自对应的分数范围,显示所述服务的状态标识。The status identifier of the service is displayed according to the status score corresponding to the service and the score ranges corresponding to the multiple final health statuses.
  4. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1, further comprising:
    确定数据查询的查询步长;Determine the query step size of data query;
    所述确定在所述查询时间范围内服务的指标数据,包括:The determining of the indicator data served within the query time range includes:
    按照所述查询步长和所述查询时间范围,从所述服务的所有指标数据中采集所述服务的指标数据。According to the query step size and the query time range, the indicator data of the service is collected from all the indicator data of the service.
  5. 根据权利要求4所述的方法,其特征在于,所述服务的指标数据的数据量不大于目标数量;The method according to claim 4, characterized in that the data amount of the indicator data of the service is not greater than the target amount;
    其中,所述目标数量根据预先配置的采集步长确定,所述采集步长用于指示所述服务的指标数据之间的时间间隔长度。Wherein, the target number is determined according to a preconfigured collection step, and the collection step is used to indicate the length of the time interval between the indicator data of the service.
  6. 根据权利要求4所述的方法,其特征在于,所述确定数据查询的查询步长,包括:The method according to claim 4, characterized in that determining the query step size of data query includes:
    确定采集步长,所述采集步长用于指示所述服务的所有指标数据之间的时间间隔长度;Determine the collection step size, which is used to indicate the length of the time interval between all indicator data of the service;
    根据所述采集步长、预设的时间长度阈值和所述查询时间范围,确定所述查询步长。The query step length is determined according to the collection step length, a preset time length threshold and the query time range.
  7. 根据权利要求1-6任一项所述的方法,其特征在于,所述状态标识包括颜色标识或形状标识。The method according to any one of claims 1 to 6, characterized in that the status identification includes a color identification or a shape identification.
  8. 根据权利要求1-6任一项所述的方法,其特征在于,所述服务有多个。The method according to any one of claims 1-6, characterized in that there are multiple services.
  9. 根据权利要求1-6任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1-6, characterized in that the method further includes:
    接收用户对目标服务的状态标识的操作;Receive the user's operation on the status identification of the target service;
    根据所述操作,显示所述目标服务在所述查询时间范围内多个时刻的原始健康状态。According to the operation, the original health status of the target service at multiple moments within the query time range is displayed.
  10. 一种服务健康状态的显示装置,其特征在于,包括:A device for displaying a service health status, comprising:
    第一确定模块,用于确定数据查询的查询时间范围;The first determination module is used to determine the query time range of data query;
    第二确定模块,用于确定在所述查询时间范围内服务的指标数据,服务的指标数据用于反应所述服务的原始健康状态;The second determination module is used to determine the indicator data of the service within the query time range, and the indicator data of the service is used to reflect the original health status of the service;
    第三确定模块,用于根据所述查询时间范围内服务的指标数据,确定所述服务的多种原始健 康状态各自在所述查询时间范围内出现的频率;The third determination module is used to determine multiple original keys of the service based on the indicator data of the service within the query time range. The frequency of each health status occurring within the query time range;
    显示模块,用于根据所述服务的多种原始健康状态各自对应的权重和所述服务在多种原始健康状态下在所述查询时间内出现的频率,显示所述服务的状态标识,以用于用户根据所述服务的状态标识进行故障定位,所述状态标识用于指示所述服务的最终健康状态,所述权重用于表示原始健康状态出现的频率对最终健康状态的影响程度。The display module is used to display the status identifier of the service according to the weights corresponding to the various original health states of the service and the frequency of occurrence of the service in the various original health states within the query time, so that the user can locate the fault according to the status identifier of the service. The status identifier is used to indicate the final health state of the service, and the weight is used to indicate the degree of influence of the frequency of occurrence of the original health state on the final health state.
  11. 根据权利要求10所述的装置,其特征在于,所述装置还包括:The device according to claim 10, characterized in that the device further includes:
    第四确定模块,用于根据预先配置的权重确定模型和预设的时间长度阈值,确定所述多种原始健康状态各自对应的权重;The fourth determination module is used to determine the corresponding weights of the multiple original health states according to the preconfigured weight determination model and the preset time length threshold;
    其中,所述权重确定模型用于指示服务的多种原始健康状态的频率与服务的最终健康状态的对应关系。Wherein, the weight determination model is used to indicate the corresponding relationship between the frequency of multiple original health states of the service and the final health state of the service.
  12. 根据权利要求11所述的装置,其特征在于,所述第四确定模块还用于根据所述多种原始健康状态各自对应的权重和预设的时间长度阈值,确定多种所述最终健康状态各自对应的状态分数范围;The device according to claim 11, characterized in that the fourth determination module is further configured to determine multiple final health states based on respective weights corresponding to the multiple original health states and a preset time length threshold. The respective corresponding status score ranges;
    计算模块,用于根据所述服务的多种原始健康状态各自对应的权重和所述多种原始健康状态各自在所述查询时间内出现的频率,计算所述服务对应的状态分数;A calculation module configured to calculate the status score corresponding to the service based on the corresponding weights of the multiple original health states of the service and the frequency of each of the multiple original health states appearing within the query time;
    所述显示模块,用于根据所述服务对应的状态分数和多种所述最终健康状态各自对应的分数范围,显示所述服务的状态标识。The display module is used to display the status identifier of the service according to the status score corresponding to the service and the score ranges corresponding to the multiple final health states.
  13. 根据权利要求10所述的装置,其特征在于,所述第一确定模块还用于确定数据查询的查询步长;The device according to claim 10, characterized in that the first determination module is also used to determine the query step size of the data query;
    所述第二确定模块用于按照所述查询步长和所述查询时间范围,从所述服务的所有指标数据中采集所述服务的指标数据。The second determination module is configured to collect indicator data of the service from all indicator data of the service according to the query step size and the query time range.
  14. 根据权利要求13所述的装置,其特征在于,所述服务的指标数据的数据量不大于目标数量;The device according to claim 13, characterized in that the data amount of the indicator data of the service is not greater than the target amount;
    其中,所述目标数量根据预先配置的采集步长确定,所述采集步长用于指示所述服务的指标数据之间的时间间隔长度。Wherein, the target number is determined according to a preconfigured collection step, and the collection step is used to indicate the length of the time interval between the indicator data of the service.
  15. 根据权利要求13所述的装置,其特征在于,所述第一确定模块用于确定采集步长,所述采集步长用于指示所述服务的所有指标数据之间的时间间隔长度;根据所述采集步长、预设的时间长度阈值和所述查询时间范围,确定所述查询步长。The device according to claim 13, characterized in that the first determination module is used to determine the collection step size, and the collection step size is used to indicate the length of the time interval between all indicator data of the service; according to the The query step is determined based on the collection step, the preset time length threshold and the query time range.
  16. 根据权利要求10-15任一项所述的装置,其特征在于,所述状态标识包括颜色标识或形状标识。The device according to any one of claims 10 to 15, characterized in that the status identification includes a color identification or a shape identification.
  17. 根据权利要求10-15任一项所述的装置,其特征在于,所述服务有多个。The device according to any one of claims 10 to 15, characterized in that there are multiple services.
  18. 根据权利要求10-15任一项所述的装置,其特征在于,所述装置还包括:The device according to any one of claims 10 to 15, characterized in that the device further comprises:
    接收模块,用于接收用户对目标服务的状态标识的操作;The receiving module is used to receive the user's operation on the status identification of the target service;
    所述显示模块还用于根据所述操作,显示所述目标服务在所述查询时间范围内多个时刻的原始健康状态。The display module is also configured to display the original health status of the target service at multiple moments within the query time range according to the operation.
  19. 一种服务健康状态的显示装置,其特征在于,包括:A device for displaying a service health status, comprising:
    至少一个存储器,用于存储程序;At least one memory for storing programs;
    至少一个处理器,用于执行所述存储器存储的程序,当所述存储器存储的程序被执行时,所述处理器用于执行如权利要求1-9任一所述的方法。At least one processor, configured to execute the program stored in the memory, and when the program stored in the memory is executed, the processor is configured to execute the method according to any one of claims 1-9.
  20. 一种服务健康状态的显示装置,其特征在于,所述装置运行计算机程序指令,以执行如权利要求1-9任一所述的方法。A display device for serving health status, characterized in that the device runs computer program instructions to execute the method according to any one of claims 1-9.
  21. 一种计算机存储介质,所述计算机存储介质中存储有指令,当所述指令在计算机上运行时,使得计算机执行如权利要求1-9任一所述的方法。A computer storage medium. Instructions are stored in the computer storage medium. When the instructions are run on a computer, they cause the computer to execute the method according to any one of claims 1 to 9.
  22. 一种包含指令的计算机程序产品,当所述指令在计算机上运行时,使得所述计算机执行如权利要求1-9任一所述的方法。 A computer program product containing instructions that, when run on a computer, cause the computer to perform the method according to any one of claims 1-9.
PCT/CN2023/104819 2022-09-19 2023-06-30 Service health status display method and apparatus, and device and storage medium WO2024060776A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202211138376.X 2022-09-19
CN202211138376.XA CN117762747A (en) 2022-09-19 2022-09-19 Method, device, equipment and storage medium for displaying service health status

Publications (1)

Publication Number Publication Date
WO2024060776A1 true WO2024060776A1 (en) 2024-03-28

Family

ID=90318575

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/104819 WO2024060776A1 (en) 2022-09-19 2023-06-30 Service health status display method and apparatus, and device and storage medium

Country Status (2)

Country Link
CN (1) CN117762747A (en)
WO (1) WO2024060776A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6959265B1 (en) * 2003-10-07 2005-10-25 Serden Technologies, Inc. User-centric measurement of quality of service in a computer network
CN102982231A (en) * 2012-10-30 2013-03-20 北京大学 Quantitative calculation method for software confidence level
US8819704B1 (en) * 2011-08-05 2014-08-26 Google Inc. Personalized availability characterization of online application services
CN113407597A (en) * 2021-06-28 2021-09-17 阿特拉斯·科普柯(无锡)压缩机有限公司 Abnormity early warning method and device, storage medium and computer equipment

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6959265B1 (en) * 2003-10-07 2005-10-25 Serden Technologies, Inc. User-centric measurement of quality of service in a computer network
US8819704B1 (en) * 2011-08-05 2014-08-26 Google Inc. Personalized availability characterization of online application services
CN102982231A (en) * 2012-10-30 2013-03-20 北京大学 Quantitative calculation method for software confidence level
CN113407597A (en) * 2021-06-28 2021-09-17 阿特拉斯·科普柯(无锡)压缩机有限公司 Abnormity early warning method and device, storage medium and computer equipment

Also Published As

Publication number Publication date
CN117762747A (en) 2024-03-26

Similar Documents

Publication Publication Date Title
WO2021147481A1 (en) Monitoring method and apparatus, and electronic device
CN111459782B (en) Method and device for monitoring service system, cloud platform system and server
CN110674009B (en) Application server performance monitoring method and device, storage medium and electronic equipment
CN107704387B (en) Method, device, electronic equipment and computer readable medium for system early warning
CN111147306B (en) Fault analysis method and device of Internet of things equipment and Internet of things platform
CN115396289A (en) Fault alarm determination method and device, electronic equipment and storage medium
CN113485862B (en) Method and device for managing service faults, electronic equipment and storage medium
CN113656252B (en) Fault positioning method, device, electronic equipment and storage medium
WO2022088803A1 (en) System information analysis method and apparatus based on cloud environment, electronic device, and medium
WO2024060776A1 (en) Service health status display method and apparatus, and device and storage medium
CN111654405B (en) Method, device, equipment and storage medium for fault node of communication link
WO2023134285A1 (en) Risk management method and risk management apparatus
CA2793952C (en) Extracting data related to clinical diagnostic instruments
CN114595765A (en) Data processing method and device, electronic equipment and storage medium
WO2021184588A1 (en) Cluster optimization method and device, server, and medium
CN114697247A (en) Fault detection method, device, equipment and storage medium of streaming media system
TWI822474B (en) Mobile network management system and method for private network
CN110347549A (en) A kind of computer fault alarm system and method, information data processing terminal
US11941284B2 (en) Management system, QoS violation detection method, and QoS violation detection program
CN117056110B (en) System fault investigation method and device, electronic equipment and storage medium
JP7245211B2 (en) Anomaly detection device, anomaly detection method, and program
CN114697319B (en) Tenant service management method and device for public cloud
WO2023173766A1 (en) Port traffic acquisition method and apparatus, storage medium, and electronic device
CN116340108A (en) Buried point data detection method and device, electronic equipment and readable storage medium
CN113942548A (en) Method and device for realizing standardized maintenance terminal of temporary speed limiting server