WO2024060776A1 - Procédé et appareil d'affichage d'état de santé de service, et dispositif et support de stockage - Google Patents

Procédé et appareil d'affichage d'état de santé de service, et dispositif et support de stockage Download PDF

Info

Publication number
WO2024060776A1
WO2024060776A1 PCT/CN2023/104819 CN2023104819W WO2024060776A1 WO 2024060776 A1 WO2024060776 A1 WO 2024060776A1 CN 2023104819 W CN2023104819 W CN 2023104819W WO 2024060776 A1 WO2024060776 A1 WO 2024060776A1
Authority
WO
WIPO (PCT)
Prior art keywords
service
query
status
health
time range
Prior art date
Application number
PCT/CN2023/104819
Other languages
English (en)
Chinese (zh)
Inventor
郭良
侯瑞军
Original Assignee
华为云计算技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为云计算技术有限公司 filed Critical 华为云计算技术有限公司
Publication of WO2024060776A1 publication Critical patent/WO2024060776A1/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine

Definitions

  • the present application relates to the field of computer technology, and in particular to a method, device, equipment and storage medium for displaying service health status.
  • the display method of the health status of services in related technologies cannot accurately represent the health status of services, causing operation and maintenance personnel to be unable to accurately locate service faults.
  • Embodiments of the present application provide a method, device, equipment and storage medium for displaying service health status, which can display the status identifier of the service based on the impact of the frequency of the original health status appearing within the query time range on the final health status, thereby enabling Improves the accuracy of representing the health status of a service.
  • embodiments of the present application provide a method for displaying service health status, including:
  • the indicator data of the service is used to reflect the original health status of the service
  • the status identifier of the service is displayed for users to locate faults based on the status identifier of the service.
  • the status identifier It is used to indicate the final health state of the service, and the weight is used to indicate the degree of influence of the frequency of the original health state on the final health state.
  • the embodiment of this application displays the status identifier of the service by combining the weight of the original health state and the frequency of the original health state within the query time, so that the status identifier of the service includes the impact of the frequency of the original health state on the final health state, improving Indicates the accuracy of the health status of the service, thereby improving the accuracy of service fault location.
  • the method for displaying the service health status further includes:
  • the corresponding weights of multiple original health states are determined
  • the weight determination model is used to indicate the correspondence between the frequency of multiple original health states of the service and the final health state of the service.
  • the weight is determined through the weight determination model, so that the weight of the original health state includes the degree of influence of the frequency of the original health state on the final health state.
  • the display method of service health status also includes:
  • the status identifier of the service is displayed, including:
  • the status identifier of the service is displayed based on the status score corresponding to the service and the corresponding score ranges of multiple final health states.
  • the display method of service health status also includes:
  • Determine the metric data served within the query time range including:
  • the indicator data of the service is collected from all indicator data of the service.
  • the data amount of the service indicator data is not greater than the target amount
  • the target number is determined based on the preconfigured collection step, which is used to indicate the length of the time interval between the indicator data of the service.
  • determining the query step size of the data query includes:
  • the query step is determined based on the collection step, the preset time length threshold and the query time range.
  • the query step size is determined adaptively through the collection step size, thereby avoiding missed reports of service failures, thus balancing computing resources and the accuracy of health monitoring.
  • the status identification includes a color identification or a shape identification.
  • the display method of service health status also includes:
  • the raw health status of the target service at multiple moments in the query time range is displayed.
  • embodiments of the present application provide a display device for serving health status, including:
  • the first determination module is used to determine the query time range of data query
  • the second determination module is used to determine the indicator data of the service within the query time range.
  • the indicator data of the service is used to reflect the original health status of the service;
  • the third determination module is used to determine the frequency of each of the multiple original health states of the service within the query time range based on the indicator data of the service within the query time range;
  • the display module is used to display the status identifier of the service based on the corresponding weights of the multiple original health states of the service and the frequency of occurrence of the service in the multiple original health states within the query time, so that the user can perform operations based on the status identifier of the service.
  • the status identifier is used to indicate the final health status of the service
  • the weight is used to indicate the impact of the frequency of the original health status on the final health status.
  • the embodiment of this application displays the status identifier of the service by combining the weight of the original health state and the frequency of the original health state within the query time, so that the status identifier of the service includes the impact of the frequency of the original health state on the final health state, improving Indicates the accuracy of the health status of the service, thereby improving the accuracy of service fault location.
  • the service health status display device also includes:
  • a fourth determination module is used to determine the weights corresponding to the various original health states according to a pre-configured weight determination model and a preset time length threshold;
  • the weight determination model is used to indicate the correspondence between the frequency of multiple original health states of the service and the final health state of the service.
  • the weight is determined through the weight determination model, so that the weight of the original health state includes the degree of influence of the frequency of the original health state on the final health state.
  • the fourth determination module is also used to determine the state score ranges corresponding to the multiple final health states based on the respective weights and preset time length thresholds of the multiple original health states;
  • the calculation module is used to calculate the status score corresponding to the service based on the respective weights of the multiple original health states of the service and the respective frequencies of the multiple original health states within the query time;
  • the display module is used to display the status identifier of the service according to the status score corresponding to the service and the score ranges corresponding to the multiple final health states.
  • the first determination module is also used to determine the query step size of the data query
  • the second determination module is used to collect the indicator data of the service from all the indicator data of the service according to the query step size and the query time range.
  • the data amount of the service indicator data is not greater than the target amount
  • the target number is determined based on the preconfigured collection step, which is used to indicate the length of the time interval between the indicator data of the service.
  • the first determination module is used to determine the collection step, and the collection step is used to indicate the length of the time interval between all indicator data of the service; according to the collection step, the preset time length threshold and Query the time range and determine the query step size.
  • the query step size is determined adaptively through the collection step size, thereby avoiding missed reports of service failures, thus balancing computing resources and the accuracy of health monitoring.
  • the status identification includes a color identification or a shape identification.
  • the device further includes:
  • the receiving module is used to receive the user's operation on the status identification of the target service
  • the display module is also used to display the original health status of the target service at multiple moments within the query time range based on operations.
  • embodiments of the present application provide a display device for serving health status, including: at least one memory for storing a program; at least one processor for executing the program stored in the memory. When the program stored in the memory is executed When, the processor is used to execute the method provided in the first aspect.
  • embodiments of the present application provide a display device for serving health status, characterized in that the device runs computer program instructions to execute the method provided in the second aspect.
  • the device may be a chip or a processor.
  • the apparatus may include a processor, which may be coupled to a memory, read instructions in the memory and execute the method provided in the first aspect according to the instructions.
  • the memory may be integrated into the chip or processor, or may be independent of the chip or processor.
  • embodiments of the present application provide a computer storage medium. Instructions are stored in the computer storage medium. When the instructions are run on a computer, they cause the computer to execute the method provided in the first aspect.
  • embodiments of the present application provide a computer program product containing instructions. When the instructions are run on a computer, they cause the computer to execute the method provided in the first aspect.
  • Figure 1 is a system architecture diagram of a business system provided by an embodiment of the present application.
  • Figure 2 is a schematic flow chart of a service indicator data collection method provided by an embodiment of the present application.
  • Figure 3 is a schematic flowchart of a method for displaying service health status provided by an embodiment of the present application
  • Figure 4 is a schematic diagram of a human-computer interaction interface provided by an embodiment of the present application.
  • FIG. 5 is a schematic diagram of another human-computer interaction interface provided by an embodiment of the present application.
  • Figure 6 is a schematic diagram of yet another human-computer interaction interface provided by an embodiment of the present application.
  • Figure 7 is a schematic diagram of a service health status display interface provided by an embodiment of the present application.
  • Figure 8 is a schematic diagram of another service health status display interface provided by an embodiment of the present application.
  • Figure 9 is a schematic structural diagram of a service health status display device provided by an embodiment of the present application.
  • Figure 10 is a schematic structural diagram of a computing device provided by an embodiment of the present application.
  • the term "and/or" is only an association relationship describing associated objects, indicating that there can be three relationships.
  • a and/or B can mean: A alone exists, and A alone exists. There is B, and there are three situations A and B at the same time.
  • the term "plurality" means two or more.
  • multiple systems refer to two or more systems
  • multiple terminals refer to two or more terminals.
  • first and second are only used for descriptive purposes and cannot be understood as indicating or implying relative importance or implicitly indicating the indicated technical features. Therefore, features defined as “first” and “second” may explicitly or implicitly include one or more of these features.
  • the terms “including,” “includes,” “having,” and variations thereof all mean “including but not limited to,” unless otherwise specifically emphasized.
  • FIG 1 is an architectural diagram of a business system involved in the embodiment of this application.
  • the business system includes the following network devices: a management platform 110 and several host devices 120.
  • the management platform 110 may be a single computing device, or it may be a service cluster composed of multiple computing devices, or it may be a cloud computing center, or it may be a hyper terminal.
  • the computing device involved in this solution can be used to provide cloud services, and it can establish communication connections with several host devices to provide computing functions and/or storage functions for the host devices.
  • the management platform involved in the embodiments of this application may be a hardware device, or may be embedded in a virtualized environment.
  • the management platform involved in this solution may be a virtual machine executed on a hardware device including one or more other virtual machines.
  • the host device 120 may be a physical host or a virtual host.
  • the management platform 110 and several host devices 120 form a business system.
  • the business system provides several different microservices to the outside world, and each microservice is run by one or more host devices 120 .
  • the management device 110 can collect indicator data of microservices run by the host device 120 .
  • the management platform 110 monitors the health of the microservices based on the collected indicator data of the microservices of various businesses, so that the operation and maintenance staff of the microservices (for convenience of description, called users) determine the microservices through the management platform.
  • the health of the microservice, and based on the health of the microservice locate the time when the microservice fails.
  • the display method of service health status in related technologies only represents the health status of the service at the time of query, and the health status of the service at each time point in a certain period of time in history has an impact on the health status of the service at the time of query.
  • only displays the health status of the service at the time of query without considering the impact of the health status of the service at each time point in the historical time period, and cannot reflect the changes in the health status of the service during the entire time period queried by the user, which does not satisfy Users’ needs for problem overview and further fault location. Therefore, the display method of the health status of the service in the related technology cannot accurately represent the health of the service.
  • the priorities of different health states are determined according to the impact of the health state of the service on the user, and the priorities of the health states are sorted, with the health states with higher priorities being used as representatives, and the identifiers are displayed according to the corresponding color identifiers of the health states.
  • the priority order of the health states is: abnormal state > lossy state > normal state.
  • the identification color corresponding to the abnormal state is red
  • the identification color corresponding to the lossy state is yellow
  • the identification color corresponding to the normal state is green.
  • the abnormal state Since the priority of the abnormal state is higher than the normal priority, the abnormal state is used as the representative state within the query time, and the final identification color is red.
  • the display method of the health status of the above-mentioned services did not consider the frequency of occurrence of the health state in the past 24 hours but only considered the priority. In fact, the health status of other time points was omitted, and essentially a large amount of original information was lost. The health status of the omitted time points did not contribute to the representation of the final health state. From a practical point of view, When the user checks the topology, he finds that a certain service is marked in red. However, when locating the abnormal state, he finds that the abnormal state only occurred for a moment and then quickly returned to normal. It can be seen that it is obviously unreasonable to display the service mark in red.
  • the related technology counts the frequency of occurrence of different health states of the service within the query time range, and uses the health state with the highest frequency as the representative state, and displays the color identifier corresponding to the health state with the highest frequency.
  • the identifier color corresponding to the abnormal state is red
  • the identifier color corresponding to the lossy state is yellow
  • the identifier color corresponding to the normal state is green.
  • the normal state will be used as a 24-hour state representative, and the identifier of the service will be displayed as a green identifier.
  • the above method causes abnormal states that do not have a frequency advantage to be directly ignored, and there is a risk of underreporting abnormal states.
  • embodiments of the present application provide a method, device, equipment and storage medium for displaying the service health status, which can display the status identification of the service based on the impact of the frequency of the original health status appearing within the query time range on the final health status, thereby It can improve the accuracy of service health performance.
  • FIG 2 is a schematic flowchart of a service indicator data collection method provided by an embodiment of the present application.
  • the service indicator data collection method provided by the embodiment of this application is applied to the management system shown in Figure 1.
  • the service indicator data collection method provided by the embodiment of the present application includes S201-S203.
  • S201 The host device collects all indicator data of services run by the host device.
  • the host device When the host device is running the service, it can generate a running log of the service.
  • the running log records all the indicator data of the host device when running the service.
  • Hosting devices can extract service metric data from operational logs.
  • the host device can receive an indicator collection instruction sent by the management platform, and the host device responds to the indicator collection instruction and extracts the indicator data of the service from the operation log.
  • all indicator data of the service can be collected through k8s indicators.
  • the indicator data of the service carries a timestamp, and the timestamp is used to indicate the generation time of the indicator data.
  • S202 The host device sends all indicator data of the service to the management platform.
  • the host device communicates with the management platform to enable data transmission.
  • the host device has multiple ways of sending service indicator data to the management platform.
  • the management device can use Prometheus' indicator acquisition method to collect service indicator data.
  • a client can be installed on the host device and linked with the server in the management platform to collect service indicator data.
  • the host device can actively send all indicator data of the service to the management platform.
  • the host device can respond to the indicator collection instruction, collect all indicator data of the service, and send all indicator data of the service to the management platform.
  • the host device can passively send all indicator data of the service to the management platform. In order to save the storage resources of the host device, the host device can actively send all the indicator data of the service to the management platform after extracting all the indicator data of the service.
  • the indicator data of the service can be collected according to a certain collection step length.
  • the collection step length is configured by the user through the management platform, and the management platform can send the collection step length to the host device, so that the host device collects all indicator data of the service according to the collection step length.
  • S203 Management platform storage service indicator data.
  • the management platform can store the indicator data in time sequence and store the indicator data as time series data to facilitate subsequent retrieval in time order.
  • Figure 3 is a schematic flowchart of a method for displaying service health status provided by an embodiment of the present application.
  • the method for displaying service health status provided by the embodiment of the present application is applied to the management system shown in Figure 1 .
  • the method for displaying service health status provided by the embodiment of the present application includes S301-S304.
  • S301 The management platform determines a query time range for data query.
  • the query time range refers to the time range of the data that the user needs to query, including the start time and deadline of the data query. For example, if the query time range is the past 24 hours and the current time is 6 pm on June 15, 2022, then the query time range is from 6 pm on June 14, 2022 to 6 pm on June 15, 2022.
  • the management device includes a display, and the display is used to display a human-computer interaction interface for data query.
  • the user can input the query time range of the data query through the human-computer interaction interface of the data query.
  • the management device can determine the query time range of the data query after receiving the data query instruction.
  • the user can input data query instructions through a human-computer interaction interface for data query.
  • the human-computer interaction interface 40 of the display displays a data query window 41
  • the data query window 41 displays “data query”, and displays a “confirm” logo 42 and a “cancel” logo 43 .
  • the management platform can generate a data query instruction.
  • the query time range may be determined according to the query duration pre-configured by the management platform. For example, the management platform defaults to the query duration of the past 24 hours. After receiving the data query instruction, the management platform can calculate the query time range according to the current time and the query duration.
  • the management platform can obtain the query time range through a human-computer interaction interface for data query. Users can enter the query time range through the human-computer interaction interface.
  • the management platform can configure multiple query durations. For example, as shown in Figure 5, when the user clicks the "Confirm" logo 52, the human-computer interaction interface 50 displays the query duration window 51.
  • the query duration Window 51 displays multiple query durations, the past 24 hours, the past 72 hours, and the past week. And the query duration confirmation mark 53 is displayed, and the user clicks on it.
  • the management platform can display various query durations through the human-computer interaction interface of the data query table, and users can select the query duration through the human-computer interaction interface.
  • the management platform determines the query time range of data query based on the query duration selected by the user through the human-computer interaction interface and the current time.
  • S302 The management platform determines the indicator data of the service within the query time range.
  • Services can be services in a variety of business scenarios, such as services in mobile phone payment business scenarios, services in book query scenarios, services in purchase of items scenarios, etc.
  • a service can be a business service or multiple microservices that make up a business service.
  • the indicator data of the service can reflect the health status of the service, so that the management platform can monitor the health of the service based on the indicator data of the service.
  • the management platform stores all indicator data of the service, and the management platform stores all indicator data of the service in chronological order.
  • the management platform can retrieve the indicator data within the query time range, that is, the management platform samples all indicator data of the service to determine the indicator data of the service within the query time range.
  • the metric data includes the service's raw health status and timestamp. The management platform can obtain the indicator data served within the query time range based on the timestamp in the indicator data.
  • the management platform samples the indicator data of services within the query time range and performs health monitoring.
  • all service indicator data can be sampled according to the query step size to obtain the service indicator data within the query time range.
  • the service indicator data refers to the data sampled from all service indicator data.
  • the query step size may be input by the user through a human-computer interaction interface for data query.
  • the human-computer interaction interface 60 of the data query can display the query step input box 61 and the query step unit 62 (such as minutes, seconds, hours).
  • the user can confirm the data query through the "confirm" mark 63.
  • the management platform generates data query instructions based on the query step size input by the user. In other words, the data query instructions include the query step size.
  • the collection step size is equal to the query step size
  • all indicator data served within the query time range are accurately presented with no false negatives.
  • the query time range is large, the amount of data will be very large, causing the interface to slow down.
  • the collection step size is smaller than the query step size, the advantage is that the amount of data can be controlled.
  • the disadvantage is that the captured original sampling points may be ignored, resulting in missing data.
  • the queried sampling points are abnormal and happen to be ignored, the exceptions cannot be accurately reported, resulting in poor user experience.
  • the management platform can adaptively determine the query step size.
  • the management platform determines the query step length according to the collection step length, a preset time length threshold and a query time range.
  • the time period that users can query is not infinite. Therefore, you can limit the upper limit of query points by configuring the time length threshold.
  • the upper limit of the number of query points determines the resolution. For example, if the user wants to report an abnormality of one point in red, if the upper limit of the number of query points is 600, then 1/600 will be used as the minimum resolution when calculating the weight (specific details See subsequent instructions).
  • the query step size is equal to the collection step size.
  • the query step satisfies the following formula (1):
  • querytime represents the query time range
  • T represents the preset time length threshold
  • scrape_interval represents the collection step. It should be noted that the units of all parameters in the formula are the same. For example, all parameters in the formula are in minutes. The specific parameters can be set as needed and are not limited here.
  • S303 Determine, based on the indicator data of the service within the query time range, the frequency at which each of the multiple original health states of the service occurs within the query time range.
  • the raw health status of a service is used to indicate the status of the service while it is running.
  • Users can customize the division and number of original health states according to the monitored objects and monitoring needs.
  • the original health state of a service can be divided into lossy state, abnormal state and normal state.
  • the lossy state means that there is an unhealthy state during the running of the service, but the service can run normally and the user cannot detect it;
  • the abnormal state means that there is an unhealthy state during the running of the service and the service cannot be provided normally, which has a great impact on the user.
  • the original health status of a service may be different at different moments within the query time range. Statistically analyze the indicator data of the service at different times within the query time range to determine the original health status of the service at different times. Then, the statistical service counts the frequency of occurrence of various original health states within the query time range.
  • S304 Display the status identifier of the service according to the corresponding weights of the multiple original health states of the service and the frequency of occurrence of the multiple original health states within the query time, so that the user can use the service according to the
  • the status identifier of the service is used to locate the fault.
  • the status identifier is used to indicate the final health state of the service.
  • the weight is used to represent the degree of impact of the frequency of the original health state on the final health state.
  • the final health status can be divided by users according to their own needs. Since the frequency of occurrence of different original health states has different effects on different users, the user sets the final health state according to the frequency of occurrence of different original health states.
  • the final health status that users care about is the following four: the original health status of the service is all normal within the query time range (for convenience of description, it is expressed as Healthy' below), the original health status of the service within the query time range A lossy state has occurred (for convenience of description, it is represented by Degraded' below), the original health status of the service has an abnormal state within the query time range (for convenience of description, it is represented by Failure' below), and the frequency of occurrence of the abnormal state No more than 10%, the original health status of the service has an abnormal state within the query time range (for convenience of description, it is expressed as "Failure” below) and the frequency of abnormal state is higher than 10%.
  • the weight of the original health state is used to represent the degree of influence of the frequency of the original health state within the query time range on the final health state.
  • the management platform calculates the status score of the service within the query time based on the weight of the original health status and the frequency of the original health status within the query time range. Displays the status ID corresponding to the status score of the service.
  • tube represents the original health state of the service
  • F(s i ) represents the frequency of the i-th original health state within the query time range
  • W(s i ) represents the weight of the i-th original health state.
  • i is a positive integer.
  • the management platform stores the correspondence between the status identifier and the status score range.
  • status identifier 1 is used to indicate that the original health status of the service is all normal (Healthy) within the query time range
  • Status ID 2 is used to indicate that the original health status of the service has become damaged (Degraded) within the query time range.
  • the status score range corresponding to Status ID 2 is 1 ⁇ score ⁇ 10.
  • Status ID 3 is used to indicate that an abnormal state (Failure) occurred in the original health state of the service within the query time range and the frequency of the abnormal state is not higher than 10%
  • the status score range corresponding to status identifier 3 is 10 ⁇ score ⁇ 1440.
  • Status ID 4 is used to indicate that the original health status of the service has an abnormal state (Failure) within the query time range and the frequency of the abnormal status is higher than 10%.
  • the score range corresponding to Status ID 4 is 1440 ⁇ score.
  • the user selects a query time range of 24 hours, that is, 1440 minutes: one minute has an abnormal state (Failure), and the rest are normal states (Healthy).
  • the user selects a query time range of 1 hour, that is, 60 minutes.
  • the status identifier can be a color identifier or a shape identifier. No limitation is made here.
  • the weight of the original health state and the state score range can be configured by the user as needed.
  • the management platform can calculate the corresponding weights of multiple original health states through a weight determination model. Specifically, according to the preconfigured weight determination model and the preset time length threshold, the respective corresponding weights and state score ranges of the multiple original health states are determined.
  • the weight determination model is used to indicate the correspondence between the frequency of the original health state of the service and the final health state of the service.
  • the weight determination model can be viewed as a multi-variable set of inequalities. Users can configure the corresponding relationship between the status score range and the frequency range of the original health status according to their own needs.
  • the state score range has 4 segments, namely (0, S1], (S1, S2], (S2, S3], (S3, ⁇ ). Among them, the larger the score, the more dangerous the state.
  • the time length threshold is 24 hours Determine the minimum resolution, that is, the minimum resolution is 1/1440.
  • the correspondence between the frequency of the original health state of the service and the final health state of the service is shown in Table 1:
  • W 1 is the weight corresponding to the normal state (Healthy)
  • W 2 is the weight corresponding to the lossy state (Degraded)
  • W 3 is the weight corresponding to the abnormal state (Failure).
  • the state score score is strictly greater than 10.
  • the state score score is greater than 1.
  • a business may include multiple services.
  • the book purchasing business includes services: book title search service 11, book display service 12, book evaluation service 13, and order payment service 14.
  • the correlation between the book title search service 11, the book display service 12, the book evaluation service 13, and the order payment service 14 is shown in Figure 7.
  • book name search service 11, book display service 12 and book evaluation service 13 are all in normal status within 24 hours
  • the status identifiers of book name search service 11, book display service 12 and book evaluation service 13 are all triangular identifiers.
  • the order payment service 14 is in an abnormal state within 24 hours
  • the status mark of the order payment service 14 is a square mark.
  • the management platform can display the final health status of multiple services under the same business through a human-computer interaction interface. Furthermore, the management platform can display the final health status of respective services under multiple businesses.
  • users can view the original health status of the service at various moments through the status identification. Specifically, a user's operation on the status identification of the target service is received; according to the operation, the original health status of the target service at multiple moments within the query time range is displayed.
  • the status identifiers of the book name search service 11, the book display service 12 and the book purchase service 13 are all triangle identifiers, while the status identifier of the order payment service 14 is a square identifier.
  • the user can By operating the status identifier of the order payment service 14 through the human-computer interaction interface, the management platform displays the original health status of the order payment service 14 at each sampling moment in the past 24 hours.
  • embodiments of the present application provide a display device for service health status.
  • FIG. 9 is a schematic structural diagram of a service health status display device provided by an embodiment of the present application.
  • the service health status display device provided by the embodiment of the present application can be applied to the management platform as shown in Figure 1.
  • the display device for service health status provided by the embodiment of the present application includes a first determination module 901, a second determination module 902, a third determination module 903 and a display module 904.
  • the first determination module 901 is used to determine the query time range of data query
  • the second determination module 902 is used to determine the indicator data of the service within the query time range.
  • the indicator data of the service is used to reflect the original health status of the service;
  • the third determination module 903 is configured to determine the frequency of each of the multiple original health states of the service within the query time range based on the indicator data of the service within the query time range;
  • the display module 904 is configured to display the status identifier of the service according to the corresponding weights of the multiple original health states of the service and the frequency of occurrence of the service in the multiple original health states within the query time, so as to It is used for users to locate faults based on the status identifier of the service.
  • the status identifier is used to indicate the final health status of the service.
  • the weight is used to represent the degree of impact of the frequency of the original health status on the final health status.
  • the embodiment of this application displays the status identifier of the service by combining the weight of the original health state and the frequency of the original health state within the query time, so that the status identifier of the service includes the impact of the frequency of the original health state on the final health state, improving Indicates the accuracy of the health status of the service, thereby improving the accuracy of service fault location.
  • the service health status display device also includes:
  • the fourth determination module is used to determine the corresponding weights of the multiple original health states according to the preconfigured weight determination model and the preset time length threshold;
  • the weight determination model is used to indicate the corresponding relationship between the frequency of multiple original health states of the service and the final health state of the service.
  • the weight is determined by the weight determination model, so that the weight of the original health state includes the influence of the frequency of occurrence of the original health state on the final health state.
  • the fourth determination module is also used to determine the state score ranges corresponding to the multiple final health states according to the respective weights and the preset time length threshold of the multiple original health states. ;
  • a calculation module configured to calculate the status score corresponding to the service based on the corresponding weights of the multiple original health states of the service and the frequency of each of the multiple original health states appearing within the query time;
  • the display module is configured to display the status identification of the service according to the status score corresponding to the service and the score ranges corresponding to the multiple final health states.
  • the first determination module is also used to determine the query step size of the data query
  • the second determination module is configured to collect indicator data of the service from all indicator data of the service according to the query step size and the query time range.
  • the data amount of the service indicator data is not greater than the target amount
  • the target number is determined according to a preconfigured collection step, and the collection step is used to indicate the length of the time interval between the indicator data of the service.
  • the first determination module is used to determine the collection step, and the collection step is used to indicate the length of the time interval between all indicator data of the service; according to the collection step, the preset Set the time length threshold and the query time range to determine the query step size.
  • the query step size is determined adaptively through the collection step size, thereby avoiding missed reports of service failures, thus balancing computing resources and the accuracy of health monitoring.
  • the status identification includes a color identification or a shape identification.
  • the device further includes:
  • the receiving module is used to receive the user's operation on the status identification of the target service
  • the display module is also configured to display the original health status of the target service at multiple moments within the query time range according to the operation.
  • the device embodiment described in Figure 9 is only illustrative.
  • the division of modules is only a logical function division. In actual implementation, there may be other division methods.
  • multiple modules or components may be combined or can be integrated into another system, or some features can be ignored, or not implemented.
  • Each functional module in each embodiment of the present application can be integrated into one processing module, or each module can exist physically alone, or two or more modules can be integrated into one module.
  • the above-mentioned modules in Figure 9 can be implemented in the form of hardware, software functional units, or a combination of software and hardware.
  • Figure 10 shows a schematic structural diagram of a computing device provided by an embodiment of the present application.
  • the computing device may be a server or the like.
  • the management platform in Figure 1 includes at least one computing device.
  • the computing device includes: a processor 1001, a memory 1002, and a communication interface 1003.
  • the processor 1001, the memory 1002 and the communication interface 1003 are connected through a bus 1004.
  • Memory 1002 includes operating system and program code modules.
  • Memory 1002 may include bulk storage for data or instructions.
  • the memory 1002 may include a hard disk drive (HDD), a floppy disk drive, flash memory, an optical disk, a magneto-optical disk, a magnetic tape, or a Universal Serial Bus (USB) drive or two or more A combination of many of the above.
  • Memory 1002 may include removable or non-removable (or fixed) media, where appropriate.
  • the memory 1002 may be internal or external to the integrated gateway disaster recovery device.
  • memory 1002 is non-volatile solid-state memory.
  • the memory may include read-only memory (ROM), random access memory (RAM), magnetic disk storage media devices, optical storage media devices, flash memory devices, electrical, optical or other physical/tangible memory storage devices.
  • ROM read-only memory
  • RAM random access memory
  • magnetic disk storage media devices magnetic disk storage media devices
  • optical storage media devices flash memory devices
  • electrical, optical or other physical/tangible memory storage devices typically, the memory includes one or more tangible (non-transitory) computer-readable storage media (e.g., memory devices) encoded with software including computer-executable instructions, and when the software is executed (e.g., by one or more processors), it is operable to perform the operations described with reference to the methods in the present application.
  • the processor 1001 reads and executes the computer program instructions stored in the memory 1002 to implement any of the service health status display methods in the above embodiments.
  • the electronic device may also include a communication interface 1003 and a bus 1010. Among them, as shown in Figure 10, the processor 1001, the memory 1002, and the communication interface 1003 are connected through the bus 1010 and complete communication with each other.
  • the communication interface 1003 is mainly used to implement communication between modules, devices, units and/or equipment in the embodiments of this application.
  • Bus 1010 includes hardware, software, or both, coupling components of an electronic device to one another.
  • the bus may include Accelerated Graphics Port (AGP) or other graphics bus, Enhanced Industry Standard Architecture (EISA) bus, Front Side Bus (FSB), HyperTransport (HT) interconnect, Industry Standard Architecture (ISA) Bus, Infinite Bandwidth Interconnect, Low Pin Count (LPC) Bus, Memory Bus, Micro Channel Architecture (MCA) Bus, Peripheral Component Interconnect (PCI) Bus, PCI-Express (PCI-X) Bus, Serial Advanced Technology Attachment (SATA) bus, Video Electronics Standards Association Local (VLB) bus or other suitable bus or a combination of two or more of these.
  • bus 1010 may include one or more buses.
  • processors in the embodiments of the present application can be a central processing unit (CPU), or other general-purpose processor, digital signal processor (DSP), or application-specific integrated circuit (application specific integrated circuit, ASIC), field programmable gate array (field programmable gate array, FPGA) or other programmable logic devices, transistor logic devices, hardware components or any combination thereof.
  • a general-purpose processor can be a microprocessor or any conventional processor.
  • the method steps in the embodiments of the present application can be implemented by hardware or by a processor executing software instructions.
  • Software instructions can be composed of corresponding software modules, and software modules can be stored in random access memory (random access memory, RAM), flash memory, read-only memory (read-only memory, ROM), programmable read-only memory (programmable rom) , PROM), erasable programmable read-only memory (erasable PROM, EPROM), electrically erasable programmable read-only memory (electrically EPROM, EEPROM), register, hard disk, mobile hard disk, CD-ROM or other well-known in the art any other form of storage media.
  • An exemplary storage medium is coupled to the processor such that the processor can read information from the storage medium and write information to the storage medium.
  • the storage medium can also be an integral part of the processor.
  • the processor and storage media may be located in an ASIC.
  • the computer program product includes one or more computer instructions.
  • the computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable device.
  • the computer instructions may be stored in or transmitted over a computer-readable storage medium.
  • the computer instructions may be transmitted from one website, computer, server or data center to another website through wired (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.) means. , computer, server or data center for transmission.
  • the computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that contains one or more available media integrated.
  • the available media may be magnetic media (eg, floppy disk, hard disk, magnetic tape), optical media (eg, DVD), or semiconductor media (eg, solid state disk (SSD)), etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Medical Treatment And Welfare Office Work (AREA)
  • Debugging And Monitoring (AREA)

Abstract

La présente invention concerne un procédé et un appareil d'affichage d'état de santé de service, ainsi qu'un dispositif et un support de stockage. Selon des modes de réalisation, le procédé consiste à : déterminer un intervalle de temps d'interrogation d'une interrogation de données ; déterminer des données d'indicateur d'un service dans l'intervalle de temps d'interrogation, les données d'indicateur du service étant utilisées pour refléter des états de santé d'origine du service ; déterminer, en fonction des données d'indicateur du service dans l'intervalle de temps d'interrogation, la fréquence d'occurrence de chaque état parmi de multiples états de santé d'origine du service dans l'intervalle de temps d'interrogation ; et afficher un identifiant d'état du service en fonction d'un poids correspondant à chacun des multiples états de santé d'origine du service et des fréquences d'occurrence des multiples états de santé d'origine du service dans le temps d'interrogation, l'identifiant d'état étant utilisé pour indiquer un état de santé final du service, et le poids étant utilisé pour représenter le degré d'influence de la fréquence d'occurrence de l'état de santé d'origine sur l'état de santé final. De cette manière, au moyen de la solution technique décrite par les modes de réalisation de la présente invention, la précision de représentation de l'état de santé d'un service peut être améliorée, ce qui permet d'améliorer la précision de localisation de défauts de service.
PCT/CN2023/104819 2022-09-19 2023-06-30 Procédé et appareil d'affichage d'état de santé de service, et dispositif et support de stockage WO2024060776A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202211138376.X 2022-09-19
CN202211138376.XA CN117762747A (zh) 2022-09-19 2022-09-19 服务健康状态的显示方法、装置、设备及存储介质

Publications (1)

Publication Number Publication Date
WO2024060776A1 true WO2024060776A1 (fr) 2024-03-28

Family

ID=90318575

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/104819 WO2024060776A1 (fr) 2022-09-19 2023-06-30 Procédé et appareil d'affichage d'état de santé de service, et dispositif et support de stockage

Country Status (2)

Country Link
CN (1) CN117762747A (fr)
WO (1) WO2024060776A1 (fr)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6959265B1 (en) * 2003-10-07 2005-10-25 Serden Technologies, Inc. User-centric measurement of quality of service in a computer network
CN102982231A (zh) * 2012-10-30 2013-03-20 北京大学 软件可信度的定量计算方法
US8819704B1 (en) * 2011-08-05 2014-08-26 Google Inc. Personalized availability characterization of online application services
CN113407597A (zh) * 2021-06-28 2021-09-17 阿特拉斯·科普柯(无锡)压缩机有限公司 异常预警方法与装置、存储介质、计算机设备

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6959265B1 (en) * 2003-10-07 2005-10-25 Serden Technologies, Inc. User-centric measurement of quality of service in a computer network
US8819704B1 (en) * 2011-08-05 2014-08-26 Google Inc. Personalized availability characterization of online application services
CN102982231A (zh) * 2012-10-30 2013-03-20 北京大学 软件可信度的定量计算方法
CN113407597A (zh) * 2021-06-28 2021-09-17 阿特拉斯·科普柯(无锡)压缩机有限公司 异常预警方法与装置、存储介质、计算机设备

Also Published As

Publication number Publication date
CN117762747A (zh) 2024-03-26

Similar Documents

Publication Publication Date Title
WO2021147481A1 (fr) Procédé et appareil de surveillance, et dispositif électronique
CN111459782B (zh) 监控业务系统的方法、装置、云平台系统和服务器
CN110674009B (zh) 应用服务器性能监测方法、装置、存储介质及电子设备
CN107704387B (zh) 用于系统预警的方法、装置、电子设备及计算机可读介质
JP2018129023A (ja) インダストリアル・インターネットオペレーションシステムに基づく安全性の検査方法と装置
CN111147306B (zh) 一种物联网设备的故障分析方法、装置以及物联网平台
CN115396289A (zh) 一种故障告警确定方法、装置、电子设备及存储介质
CN113485862B (zh) 业务故障的管理方法、装置、电子设备及存储介质
WO2022088803A1 (fr) Procédé et appareil d'analyse d'informations système basés sur un environnement cloud, dispositif électronique et support
WO2024060776A1 (fr) Procédé et appareil d'affichage d'état de santé de service, et dispositif et support de stockage
CN111654405B (zh) 通信链路的故障节点方法、装置、设备及存储介质
CN114513334B (zh) 风险管理方法和风险管理装置
US20180123866A1 (en) Method and apparatus for determining event level of monitoring result
CA2793952C (fr) Extraction de donnees concernant des instruments de diagnostic clinique
CN114595765A (zh) 数据处理方法、装置、电子设备及存储介质
WO2021073413A1 (fr) Procédé et appareil d'envoi de paramètres de performance de système, dispositif de gestion et support d'informations
WO2021184588A1 (fr) Procédé et dispositif d'optimisation de grappes, serveur et support
TWI822474B (zh) 用於企業專網之行動網路管理系統及方法
CN116723111B (zh) 业务请求的处理方法、系统及电子设备
US11941284B2 (en) Management system, QoS violation detection method, and QoS violation detection program
CN113225228B (zh) 数据处理方法及装置
CN117056110B (zh) 一种系统故障排查方法、装置、电子设备及存储介质
JP7245211B2 (ja) 異常検知装置、異常検知方法、およびプログラム
CN114697319B (zh) 一种公有云的租户业务管理方法及装置
WO2023173766A1 (fr) Procédé et appareil d'acquisition de trafic de port, support de stockage, et dispositif électronique

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23867071

Country of ref document: EP

Kind code of ref document: A1