WO2022021420A1 - Root cause localization method and apparatus, and electronic device - Google Patents

Root cause localization method and apparatus, and electronic device Download PDF

Info

Publication number
WO2022021420A1
WO2022021420A1 PCT/CN2020/106404 CN2020106404W WO2022021420A1 WO 2022021420 A1 WO2022021420 A1 WO 2022021420A1 CN 2020106404 W CN2020106404 W CN 2020106404W WO 2022021420 A1 WO2022021420 A1 WO 2022021420A1
Authority
WO
WIPO (PCT)
Prior art keywords
node
child
parent node
performance index
parent
Prior art date
Application number
PCT/CN2020/106404
Other languages
French (fr)
Chinese (zh)
Inventor
黄晓光
张伟
岳晓贫
兰宇
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to PCT/CN2020/106404 priority Critical patent/WO2022021420A1/en
Publication of WO2022021420A1 publication Critical patent/WO2022021420A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance

Definitions

  • the present application relates to the technical field of fault location, and in particular, to a root cause location method and device, and electronic equipment.
  • monitoring the performance of the wireless communication system is an important part, which is mainly used for the location of performance faults and the root cause of performance faults in the wireless communication system.
  • the operation and maintenance personnel have set many indicators to evaluate the performance of the wireless communication system, and to locate the performance fault and the root cause of the performance fault of the wireless communication system.
  • these indicators may include key performance indicators (Key Performance Indicator, KPI) in terms of access, retention, and mobility, as well as service experience indicators such as rate and delay.
  • KPI Key Performance Indicator
  • the performance fault location is to locate the abnormal value of the above indicators.
  • Root cause location of performance faults refers to analyzing the causes of abnormal values of the above indicators. After locating the performance fault and the root cause of the performance fault, the operation and maintenance personnel can eliminate the performance fault to ensure the user's wireless communication experience.
  • the fault tree and the specific values of the above indicators can be used to locate performance faults, and the indicators with abnormal values (abbreviated as abnormal indicators) can be determined; Whether the specific value of the index is greater than the threshold value corresponding to the index, if the specific value of the index corresponding to a child node is greater than the threshold value corresponding to the index, it can be determined that the above-mentioned performance failure root cause is that the index corresponding to the child node is abnormal.
  • the performance fault root cause location is performed by comparing the threshold value, which may affect the accuracy of the performance fault root cause location due to the inaccurate setting of the threshold value.
  • the indicators of each sub-node may have different degrees of influence on the abnormal indicators; it is relatively arbitrary to determine whether the indicators of each sub-node have an impact on the abnormal indicators by simply using the threshold judgment method, which will affect the accuracy of performance fault root cause location.
  • the present application provides a root cause locating method, device, and electronic device, which can improve the accuracy of root cause locating.
  • the present application provides a root cause locating method
  • the root cause locating method includes: acquiring a tree-shaped network of a first performance index; The sampling value of the second performance index; according to the sampling value of the first performance index and the sampling values of a plurality of second performance indicators, calculate the influence factor of each child node in the tree network on the parent node; according to each child node in the tree network For the influence factor of the parent node, the second performance index in the tree network that causes the abnormality of the first performance index is determined.
  • the tree-shaped network includes a first performance indicator and a plurality of second performance indicators that affect the first performance indicator, the first performance indicator corresponds to the root node of the tree-shaped network, and the plurality of second performance indicators corresponds to one of the root nodes in the tree-shaped network. other nodes.
  • the influence factor of each child node on the parent node is used to characterize the degree of influence of the performance index of each child node on the performance index of the parent node.
  • the influence factor of each child node in the tree network on the parent node can be calculated according to the sampled value of the first performance index and the sampled value of a plurality of second performance indicators.
  • the influence factor of a child node on the parent node is used to represent the influence degree of the performance index of the child node on the performance index of the parent node.
  • the above-mentioned multiple second performance indicators correspond to other nodes in the tree-shaped network except the root node; therefore, it can be concluded that the node corresponding to the above-mentioned second performance index can be is: a child node of the root node corresponding to the first performance index or a child node of the child node of the root node corresponding to the first performance index, and the like.
  • the influence factor of each child node in the tree network on the parent node can reflect the degree of influence of the above-mentioned multiple second performance indicators on the first performance indicator. Therefore, according to the influence factor of each child node in the above-mentioned tree network on the parent node, the second performance index in the tree network that causes the abnormality of the first performance index can be determined.
  • the method provided by the present application does not require manual participation, and can determine, through calculation, the sampling value of the first performance index obtained by periodic sampling and the sampling values of a plurality of second performance indicators, which causes the first performance index in the tree network. Abnormal secondary performance indicator. In this way, compared with manual participation in root cause localization, the efficiency and accuracy of root cause localization can be improved, and the cost can also be reduced.
  • the preset positioning period includes Q sampling moments, where Q ⁇ 2, and Q is an integer.
  • the above-mentioned acquisition of the sampling value of the first performance index and the sampling values of a plurality of second performance indicators obtained by sampling in the preset positioning period includes: at the qth sampling time of the preset positioning period, sampling to obtain the first performance index of the first performance index.
  • the execution frequency of performance fault localization can be controlled by presetting the size of the localization period, and the number of first performance indicators and the number of multiple second performance indicators used for performance fault localization can be controlled.
  • calculating the influence factor of each child node on the parent node in the tree network according to the sampled value of the first performance index and the sampled value of a plurality of second performance indicators includes: first, for the tree Each parent node in the shape network executes: the qth sampled value of the performance index of all child nodes of the first parent node is used as the input of the impact factor estimation model of the first parent node, and the impact of the first parent node is run.
  • Factor estimation model output the qth influence factor of each child node of the first parent node to the first parent node; then, for each child node of the first parent node: calculate the effect of the first child node on the first parent node.
  • the average value of the Q influence factors is used to obtain the influence factor of the first child node on the first parent node; wherein, the first child node is any child node of the first parent node.
  • the first parent node is any parent node in the tree network; the impact factor estimation models of different parent nodes are different; the impact factor estimation model of the first parent node has the performance of all child nodes from the first parent node From the sampling value of the index, the ability to extract the influence factor of each child node of the first parent node on the first parent node.
  • q takes values in ⁇ 1, . value and the qth sampling value of each of the multiple second performance indicators, calculate the qth impact factor of each child node of each parent node on the parent node, that is, use the sampling value of the performance indicator collected at any sampling moment, Calculate the influence factor of the child node on the parent node at this adoption moment. Since the values of the performance indicators (including the first performance indicator and a plurality of second performance indicators) collected at different sampling times are different, the degree of influence between the performance indicators determined by the performance indicators with different values may also be different.
  • the influence factor of each sampling time is calculated by using the sampling values collected at each adopting time, and then the influence factors of all sampling time are averaged to determine the unique influence factor of each child node on the parent node.
  • the method of calculating the influence factors of each sampling time separately and then averaging, compared with using the sampling values collected at all sampling times to calculate the unique influence factor of each child node on the parent node the accuracy of the obtained influence factor is higher. .
  • the qth impact factor of all child nodes of each parent node on the parent node can be obtained at one time, thereby improving the efficiency of root cause location.
  • calculating the influence factor of each child node on the parent node in the tree network according to the sampled value of the first performance index and the sampled value of a plurality of second performance indicators includes: first, for the tree Each parent node in the shape network executes: the qth sampled value of the performance index of all child nodes of the first parent node is used as the input of the impact factor estimation model of the first parent node, and the impact of the first parent node is run.
  • the factor estimation model outputs the qth impact factor of each child node of the first parent node on the first parent node; then, if the pth sampling time in the Q sampling moments is the mutation moment of the performance index, the first Each child node of the parent node executes: calculate the average value of the first p-1 impact factors of the first child node to the first parent node, and obtain the impact factor of the first child node to the first parent node before mutation; calculate the first child node The average value of the p-th -Q-th impact factor of the node on the first parent node, and the impact factor of the first child node on the first parent node after the mutation is obtained; finally, the first child node before the mutation is calculated.
  • the difference or ratio of the influence factor of , and the influence factor of the first child node to the first parent node after mutation, and the difference or ratio is used as the influence factor of the first child node to the first parent node.
  • the first parent node is any parent node in the tree network; the impact factor estimation models of different parent nodes are different.
  • the impact factor estimation model of the first parent node has the ability to extract the impact factor of each child node of the first parent node on the first parent node from the sampled values of performance indicators of all child nodes of the first parent node.
  • p is a positive integer less than or equal to Q; the first child node is any child node of the first parent node.
  • the abnormality of the first performance index may be a sudden result.
  • the sudden abnormality of the first performance index there is a large mutation in the sampling value of the first performance index and the sampling values of multiple second performance indicators, then the multiple second performance indicators before and after the mutation have a significant impact on the first performance
  • the degree of influence of indicators may vary greatly. That is to say, there is a big difference between the influence factor of the first child node before the mutation on the first parent node and the influence factor of the first child node after the mutation on the first parent node. Therefore, in this design method, all the influencing factors of the first child node before and after mutation to the first parent node are averaged respectively.
  • the difference or ratio of the influence factor of the first child node on the first parent node before and after the mutation is calculated, and the difference or the ratio is used as the influence factor of the first child node on the first parent node. Since the difference or the ratio represents the change of the influence factor of the first child node on the first parent node before and after the mutation, and the greater the change of the influence factor of any child node on the parent node before and after the mutation, the The child node has a greater impact on causing the abnormality of the first performance index; therefore, the difference or the ratio is used for locating the root cause of the abnormality of the first performance index, which can improve the accuracy of locating the root cause of the abnormality of the first performance index.
  • the method before calculating the influence factor of each child node in the tree network on the parent node according to the sampled value of the first performance index and the sampled value of the plurality of second performance indicators, the method further includes: : Use the h-th sample value of the performance index of the first parent node and the h-th sample value of the performance index of all child nodes of the first parent node as training samples, train the preset attention model, and obtain the Impact factor prediction model.
  • h takes values in sequence in ⁇ 1,...,H ⁇ , H ⁇ 100, and H is an integer.
  • the above design method provides an implementation method for obtaining the influence factor prediction model of the first parent node.
  • calculating the influence factor of each child node in the tree network on the parent node according to the sampled value of the first performance index and the sampled values of a plurality of second performance indicators including: for the tree network Perform the following steps for each child node in to obtain the influence factor of each child node on the parent node: according to the Q sampling values of the performance index of the second child node, calculate the first residual of the performance index of the second child node; For the Q sampled values of the performance index of the parent node of the second child node, calculate the second residual of the performance index of the parent node of the second child node; calculate the correlation between the first residual and the second residual, and According to the correlation between the first residual and the second residual, the influence factor of the second child node on the parent node is obtained.
  • the second child node is any child node of the tree network; the first residual is used to represent the fluctuation of the Q sample values of the performance index of the second child node.
  • the second residual is used to represent the fluctuation of the Q sampled values of the performance index of the parent node of the second child node.
  • the first residual of the performance index of the second child node and the second residual of the performance index of the parent node of the second child node are calculated first;
  • the correlation between the first residual and the second residual is determined, and the influence factor of the second child node on the parent node is determined according to the correlation between the first residual and the second residual. Since the residual represents the fluctuation of the Q sampling values of the performance index of any node, if the residual of the performance index of any node has a large correlation with the residual of the performance index of the parent node of any node, then The fluctuation of the Q sampling values representing the performance index of any node has a greater impact on the fluctuation of the Q sampling values of the performance index of the parent node of any node.
  • the fluctuation of the Q sampling values of the performance index of any node has a greater impact on the fluctuation of the Q sampling values of the performance index of the parent node of any node, it can be determined that the performance index of any node affects the parent node. The greater the impact of the node's sex index.
  • the influence factor of the second child node on the parent node is determined according to the correlation between the first residual and the second residual, and then the influence factor is used to determine the second performance that causes the abnormality of the first performance index
  • the indicator refers to the degree of influence on the fluctuation of the Q sampling values of the performance index of the second child node on the fluctuation of the Q sampling values of the performance index of the second child node to determine the abnormality of the first performance index.
  • the second performance index can improve the accuracy of locating the root cause of the abnormality of the first performance index.
  • calculating the first residual of the performance index of the second child node according to the Q sampled values of the performance index of the second child node includes: using a time series decomposition algorithm to calculate the first residual from the second child node From the Q sampling values of the performance index of the node, extract the trend component of the performance index of the second child node and the periodic component of the performance index of the second child node; subtract the Q sampling values of the performance index of the second child node from the The trend component and the period component are used to obtain the first residual of the performance index of the second child node.
  • the trend component is used to represent the change trend of the Q sampled values of the performance index of the second child node
  • the periodic component is used to represent the periodic change of the Q sampled values of the performance index of the second child node.
  • the above design method provides an implementation manner of obtaining the first residual of the performance index of the second child node, which can also be used to obtain the second residual of the performance index of the parent node of the second child node.
  • determining the second performance index in the tree network that causes the first performance index to be abnormal according to the influence factors of each child node in the tree network on the parent node, including: The influence factor of each child node on the parent node is calculated, and the influence factor of the leaf node in the tree network to the root node is calculated; the performance indicators of N leaf nodes in the tree network are determined as the second performance that causes the abnormality of the first performance indicator index.
  • the influence factor of N leaf nodes on the root node is greater than the influence factor of other leaf nodes in the tree network on the root node, N ⁇ 1, and N is an integer.
  • the leaf nodes are terminal nodes in the tree network, the root cause of the abnormality of the first performance index exists in all the leaf nodes in the tree network. Therefore, in the above design method, the influence factor of all leaf nodes in the tree network on the root node is directly calculated; and then the performance index of the leaf node with the larger influence factor on the root node is determined as the second factor that causes the abnormality of the first performance index. Performance.
  • the tree network includes a first preset node, the first preset node is not a leaf node, and the influence factor of the child nodes of the first preset node on the first preset node is less than or equal to the preset node. Set the threshold.
  • the influence factor of M nodes on the root node is greater than the influence factor of other nodes in the tree network on the root node, M ⁇ 1, and M is an integer.
  • the influence factor of the child nodes of the first preset node on the first preset node does not exceed the preset threshold, that is, the first preset node.
  • the performance index of the child node of the first preset node has little influence on the performance index of the first preset node, it can be considered that the first preset node is similar to the leaf node, and the root cause of the abnormal first performance index may also exist in the tree network. in the first preset node in .
  • the influence factor of all leaf nodes in the tree network on the root node and the influence factor of the first preset node in the tree network on the root node are calculated; Among the nodes and the first preset node, a node with a larger influence factor on the root node is determined, and its performance index is the second performance index that causes the first performance index to be abnormal. In this way, both the leaf node that may include the root cause that causes the abnormality of the first performance index and the first preset node that may include the root cause that causes the abnormality of the first performance index are judged, and the first performance index that causes the abnormality can be determined more accurately. Abnormal secondary performance indicator.
  • the present application provides a root cause locating device, the device comprising: an acquisition module configured to acquire a tree-shaped network of a first performance index, and obtain a sampling value of the first performance index obtained by sampling within a preset positioning period and sampling values of a plurality of second performance indicators; a calculation module for calculating the influence factor of each child node in the tree network on the parent node according to the sampling values of the first performance index and the sampling values of a plurality of second performance indicators; positioning The module is configured to determine the second performance index in the tree network that causes the abnormality of the first performance index according to the influence factors of each child node in the tree network on the parent node.
  • the tree-shaped network includes a first performance indicator and a plurality of second performance indicators that affect the first performance indicator, the first performance indicator corresponds to the root node of the tree-shaped network, and the plurality of second performance indicators corresponds to one of the root nodes in the tree-shaped network. other nodes.
  • the influence factor of each child node on the parent node is used to characterize the degree of influence of the performance index of each child node on the performance index of the parent node.
  • the preset positioning period includes Q sampling moments, where Q ⁇ 2, and Q is an integer.
  • the acquisition module is specifically used to obtain the qth sampling value of the first performance index and the qth sampling value of the plurality of second performance indicators by sampling at the qth sampling time of the preset positioning cycle; wherein, q is in ⁇ 1 , ..., Q ⁇ and take values in turn.
  • the calculation module is specifically used to: perform for each parent node in the tree network: take the qth sampled value of the performance index of all child nodes of the first parent node as the first The input of the influence factor estimation model of the parent node, run the influence factor estimation model of the first parent node, and output the qth influence factor of each child node of the first parent node on the first parent node; Execute for each child node: calculate the average value of Q influence factors of the first child node to the first parent node, and obtain the influence factor of the first child node to the first parent node.
  • the first parent node is any parent node in the tree network; the impact factor estimation models of different parent nodes are different.
  • the impact factor estimation model of the first parent node has the ability to extract the impact factor of each child node of the first parent node on the first parent node from the sampled values of performance indicators of all child nodes of the first parent node.
  • the first child node is any child node of the first parent node.
  • the calculation module is specifically used for: first, for each parent node in the tree network, execute: take the qth sampled value of the performance index of all the child nodes of the first parent node as Input the influence factor estimation model of the first parent node, run the influence factor estimation model of the first parent node, and output the qth influence factor of each child node of the first parent node on the first parent node; then, if Q The p-th sampling time in the sampling time is the mutation time of the performance index, which is executed for each child node of the first parent node: calculate the average value of the first p-1 impact factors of the first child node to the first parent node, Obtain the influence factor of the first child node on the first parent node before the mutation; The influence factor of the parent node; finally, calculate the difference or ratio of the influence factor of the first child node to the first parent node before the mutation and the influence factor of the first child node to the first parent node after the mutation, and use the difference or ratio as The
  • the first parent node is any parent node in the tree network; the impact factor estimation models of different parent nodes are different.
  • the impact factor estimation model of the first parent node has the ability to extract the impact factor of each child node of the first parent node on the first parent node from the sampled values of performance indicators of all child nodes of the first parent node.
  • p is a positive integer less than or equal to Q; the first child node is any child node of the first parent node.
  • the device further includes: a model training module, configured to calculate, according to the sampled value of the first performance index and the sampled value of a plurality of second performance indicators, the relationship between each child node in the tree network to the parent Before the impact factor of the node, the h-th sample value of the performance index of the first parent node and the h-th sample value of the performance index of all child nodes of the first parent node are used as training samples, and the preset attention model is trained to obtain The impact factor prediction model of the first parent node.
  • h takes values in sequence in ⁇ 1,...,H ⁇ , H ⁇ 100, and H is an integer.
  • the calculation module is specifically configured to perform the following steps for each child node in the tree network, so as to obtain the influence factor of each child node on the parent node: Q according to the performance index of the second child node calculate the first residual of the performance index of the second child node; according to the Q sampling values of the performance index of the parent node of the second child node, calculate the second residual of the performance index of the parent node of the second child node difference; calculate the correlation between the first residual and the second residual, and obtain the influence factor of the second child node on the parent node according to the correlation between the first residual and the second residual.
  • the second child node is any child node of the tree network.
  • the first residual is used to represent the fluctuation of the Q sampled values of the performance index of the second child node.
  • the second residual is used to represent the fluctuation of the Q sampled values of the performance index of the parent node of the second child node.
  • the calculation module is specifically used for: using a time series decomposition algorithm to extract the trend component of the performance index of the second child node and the first sub-node from the Q sampling values of the performance index of the second child node.
  • the periodic component of the performance index of the second child node; the Q sampled values of the performance index of the second child node are subtracted from the trend component and the periodic component to obtain the first residual of the performance index of the second child node.
  • the trend component is used to represent the change trend of the Q sampled values of the performance index of the second child node
  • the periodic component is used to represent the periodic change of the Q sampled values of the performance index of the second child node.
  • the positioning module is specifically used to: calculate the influence factor of the leaf node on the root node in the tree network according to the influence factor of each child node in the tree network on the parent node;
  • the performance indicators of the N leaf nodes in the tree network are determined as the second performance indicators that cause the first performance indicator to be abnormal.
  • the influence factor of N leaf nodes on the root node is greater than the influence factor of other leaf nodes in the tree network on the root node, N ⁇ 1, and N is an integer.
  • the tree network includes a first preset node, the first preset node is not a leaf node, and the influence factor of the child nodes of the first preset node on the first preset node is less than or equal to the preset node.
  • the positioning module is specifically used for: calculating the influence factor of the leaf node on the root node in the tree network according to the influence factor of each child node in the tree network on the parent node, and calculating the influence factor of the first preset node on the root node;
  • the performance indicators of M nodes in the leaf node and the first preset node are determined as the second performance indicators that cause the first performance indicator to be abnormal.
  • the influence factor of M nodes on the root node is greater than the influence factor of other nodes in the tree network on the root node, M ⁇ 1, and M is an integer.
  • the present application provides an electronic device, the electronic device comprising: a processor and a memory for storing instructions executable by the processor; wherein the processor is configured to execute the instructions, such that the The electronic device executes the root cause locating method described in the first aspect and any possible design manners thereof.
  • the present application provides a computer-readable storage medium, where computer instructions are stored on the computer-readable storage medium, and when the computer instructions are executed on an electronic device, the electronic device is made to perform the first aspect and any of the methods described above.
  • a possible design approach described in the root cause localization method is made to perform the first aspect and any of the methods described above.
  • the present application provides a computer program product that, when the computer program product is run on an electronic device, causes the electronic device to execute the root cause described in the first aspect and any possible design manner thereof positioning method.
  • FIG. 1 is a flowchart 1 of a root cause locating method provided by an embodiment of the present application
  • FIG. 2 is a schematic structural diagram of a tree network of a wireless communication system according to an embodiment of the present application
  • FIG. 3 is a flow chart 2 of a root cause locating method provided by an embodiment of the present application.
  • FIG. 4 is a flowchart 3 of a root cause locating method provided by an embodiment of the present application.
  • FIG. 5 is a schematic structural diagram of an impact factor prediction model of a parent node provided by an embodiment of the present application.
  • FIG. 6 is a flowchart four of a root cause locating method provided by an embodiment of the present application.
  • FIG. 7 is a flowchart 5 of a root cause locating method provided by an embodiment of the present application.
  • FIG. 8 is a flowchart of a method for calculating the influence factor of a child node on a parent node provided by an embodiment of the present application
  • FIG. 9 is a flowchart 6 of a root cause locating method provided by an embodiment of the present application.
  • FIG. 10 is a schematic structural diagram of a tree network of a first performance index provided by an embodiment of the present application.
  • FIG. 11 is a flowchart seven of a root cause locating method provided by an embodiment of the present application.
  • FIG. 12 is a flowchart of a method for determining a second performance indicator that causes an abnormality of the first performance indicator according to an embodiment of the present application
  • FIG. 13 is a schematic structural diagram of a root cause locating device according to an embodiment of the present application.
  • FIG. 14 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
  • first and second performance indicators are different performance indicators.
  • the operation and maintenance of the device or the system mainly includes fault location and root cause location of the device or system.
  • fault location refers to determining the fault of the device or system
  • fault root cause location refers to determining the cause of the failure of the device or system.
  • the failures of the equipment or the system are mainly divided into two types, namely performance failures and hardware failures.
  • the performance failure refers to the abnormal value of some performance indicators of the device or system; the performance indicators may include the above KPI indicators and the above service experience indicators.
  • Hardware failure refers to an abnormality in the hardware of the device or system; hardware failure may be caused by hardware damage or incorrect operation.
  • the root cause of the fault is mainly located for the performance fault of the device or the system, that is, the cause of the abnormal value of the performance index is determined.
  • the performance index of the numerical anomaly is referred to as the abnormal performance index for short.
  • a hardware failure can also lead to a performance failure; therefore, the cause of the abnormal value of the abnormal performance indicator may be other performance indicators other than the abnormal performance indicator of the device or the system, or a hardware failure.
  • the performance fault location is to specify a performance index (ie, an abnormal performance index) with an abnormal value among all the performance indexes of the device or system.
  • Root cause location of performance faults refers to analyzing the causes of numerical anomalies of abnormal performance indicators.
  • the number of abnormal performance indicators is also large.
  • the number of abnormal performance indicators per day accounts for about 3% of the total number of network elements in the wireless communication system. The number of performance metrics analyzed is greater.
  • the related scheme uses the fault tree of abnormal performance indicators to locate the root cause of abnormal performance indicators.
  • the fault tree is a directed acyclic graph (DAG), which consists of nodes representing performance indicators and directed edges connecting these nodes, and the directed edges between nodes represent the mutual relationship between nodes ( from the parent node to its child node).
  • DAG directed acyclic graph
  • the root node of the fault tree corresponds to the abnormal performance index
  • other nodes of the fault tree except the root node correspond to other performance indicators that have an impact on the abnormal performance index.
  • the performance index of each child node in the fault tree is greater than the threshold value corresponding to the performance index. If the value of the performance index corresponding to a child node is greater than the threshold value corresponding to the performance index, then It can be determined that the performance index of the child node causes the numerical value of the abnormal performance index to be abnormal.
  • the performance indicators of each sub-node may have different degrees of influence on the abnormal performance indicators, and it is rather arbitrary to determine whether the performance indicators of each sub-node have an impact on the abnormal performance indicators by simply using the threshold value judgment method, which may Some performance indicators whose impact on abnormal performance indicators is less than the corresponding threshold value are filtered out, and some performance indicators that have no effect on abnormal performance indicators may also be determined as the root cause of abnormal performance indicators. In this way, the accuracy of root cause location will be affected.
  • Embodiments of the present application provide a root cause locating method, device, and electronic device, and the root cause locating method can improve the accuracy of root cause locating.
  • the root cause locating method can be applied to any scenario that requires root cause locating, for example, a wireless communication system, or other scenarios where the influence relationship between performance indicators can be determined.
  • a root cause location method provided by this embodiment of the present application may include S101-S104.
  • the electronic device obtains a tree-shaped network of a first performance index; wherein, the tree-shaped network includes a first performance index and a plurality of second performance indicators that affect the first performance index, and the first performance index corresponds to a root node of the tree-shaped network, The plurality of second performance indicators correspond to other nodes in the tree network except the root node.
  • the first performance index is a performance index that is abnormal and needs root cause localization.
  • the tree network of the first performance indicator is a directed acyclic graph DAG, which consists of nodes representing performance indicators and directed edges connecting these nodes.
  • the directed edges between nodes represent the influence relationship between nodes (directed by child nodes). its parent node).
  • Any node other than the root node in the tree network corresponds to at least one second performance index, and the performance indexes corresponding to different nodes other than the root node in the tree network are different.
  • a tree network can also be called a fault tree.
  • the first performance indicator may be any one of all performance indicators of the electronic device, any one of all performance indicators of a wireless communication network, or any one of all performance indicators of other electronic devices, and so on.
  • the plurality of second performance indicators may include a plurality of performance indicators that may affect the first performance indicator. The plurality of second performance indicators are different from the first performance indicators.
  • the first performance indicator is the user rate
  • the multiple performance indicators that affect the first performance indicator may include: physical resource block (Physical Resource Block, PRB) utilization that directly affects the first performance indicator, PRB utilization Distribution ratio, and control channel element (control channel element, CCE) utilization rate, etc.; the path loss that indirectly affects the first performance index, etc.
  • PRB Physical Resource Block
  • CCE control channel element
  • the electronic device may receive a tree network corresponding to the first performance index input by the operation and maintenance personnel.
  • the electronic device may receive a tree-shaped network corresponding to the first performance index from other electronic devices.
  • the electronic device may receive an abnormal performance index that needs root cause localization, and use the performance index as the first performance index; The index constructs a tree network of the first performance index.
  • the tree-shaped network may be constructed according to artificial experience, or the tree-shaped network of the obtained first performance index may be constructed according to the relationship between all performance indicators of the electronic device.
  • a tree-shaped network of a wireless communication system includes a root node and a plurality of child nodes.
  • the first performance indicator corresponding to the root node is the user rate.
  • the plurality of child nodes include a child node representing resource capacity, a child node representing air interface quality, a child node representing some other performance indicators, a child node representing weak coverage, and a child node representing signal interference situation.
  • the performance indicators corresponding to child nodes representing resource capacity include physical resource block (Physical Resource Block, PRB) utilization, distribution ratio of PRB utilization, and control channel element (control channel element, CCE) utilization.
  • PRB Physical Resource Block
  • CCE control channel element
  • the performance indicators corresponding to the sub-nodes representing the air interface quality include block error rate (block error rate, BLER), average scheme level of modulation and coding strategy (Modulation and Coding Scheme, MCS), and channel quality indicator (Channel Quality Indicator, CQI) .
  • the performance indicators corresponding to the child nodes representing some other performance indicators include uplink interference power and handover success rate.
  • the performance index corresponding to the sub-node representing the weak coverage includes the path loss and the average value of the distance between all the user equipments and the base station in the wireless communication system.
  • the performance index corresponding to the child node representing the signal interference situation includes the uplink interference power.
  • the electronic device acquires the sampled value of the first performance index and the sampled values of a plurality of second performance indicators that are sampled within a preset positioning period.
  • the electronic device may acquire the sampled value of the first performance index and the sampled values of a plurality of second performance indicators collected in a preset positioning period before the current time and adjacent to the current time.
  • the preset positioning period may be one day or seven days, and so on.
  • the sampled value of the first performance indicator may include one or more sampled values; and the sampled value of each second performance indicator in the plurality of second performance indicators may include one or more sampled values.
  • the electronic device may also collect sampling values of all performance indicators (including the first performance index and a plurality of second performance indicators) of the electronic device in real time according to a preset sampling period, and combine the collected sampling values and The corresponding sampling time is saved.
  • the specific process of performing S102 by the electronic device may include: acquiring all sampled values of the first performance index and all sampled values of a plurality of second performance indicators at the sampling time within the above-mentioned preset positioning period.
  • the preset sampling period may be one hour or one day, and so on.
  • the preset positioning period may include one or more preset sampling periods, and each preset sampling period corresponds to a sampling moment.
  • the electronic device calculates the influence factor of each child node on the parent node in the tree network according to the sampled value of the first performance index and the sampled value of a plurality of second performance indexes; wherein, the influence factor of each child node on the parent node is calculated as It is used to characterize the degree of influence of the performance index of each child node on the performance index of the parent node.
  • the electronic device may perform for each child node in the tree network: using the sampled value of the performance index of the child node and the sampled value of the performance index of the parent node of the child node, calculate the influence factor of the child node on its parent node.
  • the child node may have one or more parent nodes, and the child node may also have one or more influence factors on its parent node.
  • the influence factor of the child node to its parent node is in a one-to-one correspondence with the parent node of the child node.
  • the electronic device may also perform for each parent node in the tree network: using the sampled value of the performance index of the parent node and the sampled value of the performance index of the child nodes of the parent node, calculate the pair of child nodes of the parent node.
  • the influence factor of this parent node may have one or more child nodes, and the parent node's child nodes may also have one or more influence factors on the parent node.
  • the influence factors of the child nodes of the parent node to the parent node are in a one-to-one correspondence with the child nodes of the parent node.
  • the relationship strength of the influence relationship between the nodes, or the influence degree of the performance index between the nodes can be represented by the influence factor.
  • the electronic device determines, according to the influence factors of each child node in the tree network on the parent node, the second performance index in the tree network that causes the first performance index to be abnormal.
  • the electronic device can select a node with a greater impact on the root node; the performance index of the node with a greater impact on the root node is the first performance index that causes the abnormality of the first performance index.
  • the electronic device may be repaired for the second performance index causing the abnormality of the first performance index, so as to solve the problem of the abnormality of the first performance index by adjusting the second performance index causing the abnormality of the first performance index.
  • the influence factor of each child node in the tree network on the parent node can be calculated according to the sampled value of the first performance index and the sampled value of a plurality of second performance indicators.
  • the influence factor of a child node on the parent node is used to represent the influence degree of the performance index of the child node on the performance index of the parent node.
  • the above-mentioned multiple second performance indicators correspond to other nodes in the tree-shaped network except the root node; therefore, it can be concluded that the node corresponding to the above-mentioned second performance index can be is: a child node of the root node corresponding to the first performance index or a child node of the child node of the root node corresponding to the first performance index, and the like.
  • the influence factor of each child node in the tree network on the parent node can reflect the degree of influence of the above-mentioned multiple second performance indicators on the first performance indicator. Therefore, according to the influence factor of each child node in the above-mentioned tree network on the parent node, the second performance index in the tree network that causes the abnormality of the first performance index can be determined.
  • the method provided by the present application does not require manual participation, and can determine, through calculation, the sampling value of the first performance index obtained by periodic sampling and the sampling values of a plurality of second performance indicators, which causes the first performance index in the tree network. Abnormal secondary performance indicator. In this way, compared with manual participation in root cause localization, the efficiency and accuracy of root cause localization can be improved, and the cost can also be reduced.
  • the electronic device can automatically calculate the influence factor of each child node in the tree network on the parent node, expanding the tree network of the first performance index will not increase the difficulty of root cause localization.
  • the preset positioning period includes Q sampling moments, where Q ⁇ 2, and Q is an integer.
  • the electronic device obtains the qth sampling value of the first performance index and the qth sampling value of the plurality of second performance indicators by sampling at the qth sampling time of the preset positioning period; Q ⁇ takes values in turn.
  • the qth sampled value of the plurality of second performance indicators includes the qth sampled value of each second performance indicator.
  • the sampled values of the first performance indicator sampled in the preset positioning period include Q sampled values of the first performance indicator; the sampled values of multiple second performance indicators sampled in the preset positioning period include the sampling values of each second performance indicator. Q sample values.
  • the electronic device may first calculate the number of samples in the tree network at each sampling moment according to the Q sampled values of the first performance indicator and the Q sampled values of each of the second performance indicators in the plurality of first performance indicators.
  • the influence factor of each child node on the parent node is calculated, and then the influence factor of each child node on the parent node in the tree network within the preset positioning period (ie the influence factor of each child node on the parent node in the tree network) is calculated.
  • the electronic device for each parent node in the tree network, the electronic device first calculates the influence factor of each child node on the parent node at each sampling moment, and then calculates the impact factor of each child node on the parent node within the preset positioning period
  • the impact factor (that is, the impact factor of each child node on the parent node).
  • S103 may include S301-S302. To illustrate the process of calculating the influence factor of each parent node's child nodes on the parent node more clearly, each parent node is referred to as the first parent node.
  • the electronic device performs for each parent node in the tree network: taking the qth sampled value of the performance index of all child nodes of the first parent node as the input of the impact factor estimation model of the first parent node, and running The influence factor estimation model of the first parent node outputs the qth influence factor of each child node of the first parent node on the first parent node; wherein, q takes values in sequence in ⁇ 1,...,Q ⁇ .
  • the first parent node is any parent node in the tree network; the impact factor estimation models of different parent nodes are different.
  • the impact factor prediction model of the first parent node has the ability to extract the impact factor of each child node of the first parent node on the first parent node from the sampling values of the performance indicators of all child nodes of the first parent node.
  • the electronic device pre-stores the influence factor prediction model of each parent node in the tree network.
  • the impact factor estimation model of the first parent node has the function of extracting the impact factor of each child node of the first parent node on the first parent node and the first parent node from the sampling values of the performance indicators of all child nodes of the first parent node. The ability to estimate the performance indicators of the parent node.
  • the electronic device performs for each child node of the first parent node: calculate the average value of Q influence factors of the first child node on the first parent node, and obtain the influence factor of the first child node on the first parent node; wherein, The first child node is any child node of the first parent node.
  • each parent node when calculating the influence factor on the first parent node is referred to as the first parent node.
  • the electronic device divides the sum of the Q influence factors of the first child node to the first parent node by Q to obtain the average value of the Q influence factors of the first child node to the first parent node, and uses the average value as the first The influence factor of the child node on the first parent node. Furthermore, the electronic device can obtain the influence factors of each child node of the first parent node on the first parent node.
  • the electronic device calculates the influence factor of each sampling time by using the sampling values collected at each adoption time, and then averages the influence factors of all the sampling time to determine the unique influence factor of each child node on the parent node. In this way, the method of calculating the influence factors of each sampling time separately and then averaging, compared with using the sampling values collected at all sampling times to calculate the unique influence factor of each child node on the parent node, the accuracy of the obtained influence factor is higher. .
  • the abnormality of the first performance index may be a result of slow deterioration of the first performance index, or a sudden result.
  • the changes of the sampling values of the first performance indicator and the plurality of second performance indicators are both small, and the plurality of second performance indicators at different sampling times have little effect on the first performance indicator.
  • the degree of influence of the performance index does not change much; therefore, the method of averaging the Q influence factors of the first child node on the first parent node in S302 can be used to calculate the influence of the first child node on the first parent node factor.
  • the plurality of second performance indicators before and after the mutation have a significant effect on the first performance index.
  • the change of the influence degree of that is to say, the influence factor of the first child node before the mutation on the first parent node and the influence factor of the first child node after the mutation on the first parent node are quite different.
  • the abnormality of the first performance index is sudden, the greater the change of the influence factor of any child node on the parent node before and after the mutation, the greater the influence of any child node on the abnormality of the first performance index. .
  • the electronic device calculates the specific process of the influence factor of the child node on the parent node for each parent node in the tree network, as shown in FIG. 4 , S103 can be Including S401-S402. To illustrate the process of calculating the influence factor of each parent node's child nodes on the parent node more clearly, each parent node is referred to as the first parent node.
  • the electronic device performs for each parent node in the tree network: taking the qth sampling value of the performance index of all the child nodes of the first parent node as the input of the impact factor estimation model of the first parent node, and running The influence factor estimation model of the first parent node outputs the qth influence factor of each child node of the first parent node on the first parent node; wherein, q takes values in sequence in ⁇ 1,...,Q ⁇ .
  • the electronic device performs for each child node of the first parent node: calculate the first p-1 times of the first child node to the first parent node Calculate the average value of the impact factors to obtain the impact factor of the first child node on the first parent node before the mutation; The influence factor of a child node on the first parent node; calculate the difference or ratio between the influence factor of the first child node on the first parent node before the mutation and the influence factor of the first child node on the first parent node after the mutation. The value or ratio is used as the influence factor of the first child node to the first parent node.
  • p is a positive integer less than or equal to Q; the first child node is any child node of the first parent node.
  • the difference value may be obtained by subtracting the influence factor of the first child node on the first parent node after the mutation from the influence factor of the first child node on the first parent node before the mutation, and the ratio may be the first child node after the mutation. It is obtained by dividing the impact factor on the first parent node by the impact factor on the first parent node of the first child node before mutation.
  • the electronic device determines that the abnormality of the first performance index is sudden when receiving the sudden change of the performance index as the pth sampling time.
  • the electronic device divides the sum of the first p-1 influence factors of the first child node to the first parent node by the first difference, and the first difference is equal to p-1, to obtain the first child node to the first parent node before mutation Then divide the sum of the p-th - Q-th influencing factors of the first child node to the first parent node by the second difference, and the second difference is equal to Qp, to obtain the first child node after mutation to the first The influence factor of a parent node.
  • the electronic device subtracts the difference between the influence factor of the first child node on the first parent node after the mutation and the influence factor of the first child node on the first parent node before the mutation, as the difference between the first child node and the first parent node.
  • Influence factor or, divide the influence factor of the first child node to the first parent node after the mutation by the ratio of the influence factor of the first child node to the first parent node before the mutation, as the ratio of the first child node to the first parent node Impact factor.
  • the electronic device can obtain the influence factors of each child node of the first parent node on the first parent node.
  • the abnormality of the first performance index may be a sudden result.
  • the sudden abnormality of the first performance index there is a large mutation in the sampling value of the first performance index and the sampling values of multiple second performance indicators, then the multiple second performance indicators before and after the mutation have a significant impact on the first performance
  • the degree of influence of indicators may vary greatly. That is to say, there is a big difference between the influence factor of the first child node before the mutation on the first parent node and the influence factor of the first child node after the mutation on the first parent node. Therefore, the electronic device averages all the influence factors of the first child node on the first parent node before and after the mutation, respectively.
  • the electronic device calculates the difference or ratio of the influence factor of the first child node to the first parent node before and after the mutation, and uses the difference or the ratio as the influence factor of the first child node to the first parent node. Since the difference or the ratio represents the change of the influence factor of the first child node on the first parent node before and after the mutation, and the greater the change of the influence factor of any child node on the parent node before and after the mutation, the The child node has a greater impact on causing the abnormality of the first performance index; therefore, the difference or the ratio is used for locating the root cause of the abnormality of the first performance index, which can improve the accuracy of locating the root cause of the abnormality of the first performance index.
  • the electronic device adopts the h-th sample value of the performance index of the first parent node in the above tree network and the performance index of all child nodes of the first parent node.
  • the h-th sampling value of is used as a training sample to train the preset attention model (Attention Model) to obtain the influence factor prediction model of the first parent node; where h takes values in sequence in ⁇ 1,...,H ⁇ , H ⁇ 100, H is an integer.
  • the electronic device can collect sample values of the first performance index and multiple sample values of the second performance index that the electronic device is applied to in various usage scenarios, so as to obtain the first performance index H sample values of , and H sample values of each second performance indicator in the plurality of second performance indicators.
  • the H sample values of the first performance index and the H sample values of each second performance index are in one-to-one correspondence.
  • the electronic device may arrange the H sample values of the first performance index and the H sample values of each second performance index into the first sample value, the second sample, ... , and the H th sample, the H sample values of the first performance index and the H sample values of each second performance index may be respectively arranged according to other arrangement rules.
  • h takes values in sequence in ⁇ 1,...,H ⁇ , and performs the process of training the preset attention model.
  • the preset attention model used for training when h takes a value in ⁇ 1, . It is assumed that the attention model, for example, the preset attention model used for training when the value of h is 2 is the preset attention model obtained by training when the value of h is 1.
  • the preset attention model obtained by training is the influence factor prediction model of the first parent node.
  • the electronic device uses the h-th sampled value of the performance indicators of all child nodes of the first parent node as the input of the preset attention model, runs the preset attention module, and outputs the performance of the first parent node.
  • the h-th estimated value of the index then calculate the absolute value of the difference between the h-th estimated value of the performance index of the first parent node and the h-th sample value of the performance index of the first parent node; then,
  • the parameters of the preset attention model are adjusted according to the absolute value of the difference, so as to reduce the absolute value of the difference.
  • the structure of the influence factor estimation model of the first parent node obtained by training is shown in Figure 5, assuming that all child nodes of the first parent node Y include child nodes.
  • Child node 2 and child node 3. Take the h-th sample value x1 of the performance index of child node 1, the h-th sample value x2 of the performance index of child node 2, and the h-th sample value x3 of the performance index of child node 3 as the input of the preset attention model .
  • the h-th sample value of the performance index of each child node is input to the corresponding Multilayer Perceptron (MLP); then the output of the MLP is normalized by the corresponding normalization function (for example, the softmax function).
  • MLP Multilayer Perceptron
  • the influence factor f1 of each child node on the first parent node Y is obtained. It includes: the influence factor f1(x1) of the child node x1 on the first parent node Y, the influence factor f1(x2) of the child node x2 on the first parent node Y, the influence factor f1 of the child node x3 on the first parent node Y (x3).
  • the h-th sample value of the performance index of each child node is multiplied by the influence factor of the corresponding child node on the first parent node, and all the multiplied values obtained include x1*f1(x1), x2* f1(x2), x3*f1(x3); the multiplied values of all child nodes are input to an MLP, and the MLP outputs the h-th estimated value f2(X) of the performance index of the first parent node.
  • FIG. 5 is only a schematic diagram of the influence factor estimation model of a parent node, and the embodiment of the present application does not limit the influence factor estimation model of any parent node.
  • the structure of the influence factor prediction model of different parent nodes is also different. Since the performance indicators of different parent nodes are different, and the performance indicators of the child nodes of different parent nodes are different, the parameters of the influence factor prediction models of different parent nodes are also different.
  • the influence factor estimation model of the first parent node has the function of extracting the effect of each child node of the first parent node on the first parent node from the sampling values of the performance indicators of all child nodes of the first parent node. Influence factor, and the ability to estimate the performance index of the first parent node.
  • the estimated value using the performance index of the first parent node is extracted from the sampling values of the performance index of all child nodes of the first parent node by using the impact factor prediction model of the first parent node, if the first parent node If there is a big difference between the estimated value of the performance index of the first parent node and the sample value of the performance index of the first parent node, it can be considered that there are factors other than the performance index of all the child nodes of the first parent node.
  • the performance indicators of all child nodes of the parent node have a greater impact on the sample values of the performance indicators of the first parent node. Other factors may refer to hardware failures or performance metrics not included in the tree network.
  • the electronic device may also determine other factors that cause the first performance index to be abnormal. Specifically, referring to FIG. 6 , the electronic device may execute S601-S604 without executing S103-S104 after S102. In order to illustrate the process of calculating the influence factor of each parent node's child nodes or other factors on the parent node more clearly, each parent node is referred to as the first parent node.
  • the electronic device performs for each parent node in the tree network: taking the qth sampled value of the performance index of all the child nodes of the first parent node as the input of the influence factor estimation model of the first parent node, and running
  • the impact factor estimation model of the first parent node outputs the qth impact factor of each child node of the first parent node on the first parent node and the qth estimated value of the performance index of the first parent node;
  • q takes values in sequence in ⁇ 1,...,Q ⁇ ; other factors refer to factors other than the performance indicators of all child nodes of the first parent node.
  • the electronic device performs for each child node of the first parent node: calculating the average value of Q influence factors of the first child node on the first parent node to obtain the influence factor of the first child node on the first parent node.
  • the first child node is any child node of the first parent node.
  • the electronic device calculates the average value of Q influence factors of other factors of the first parent node on the first parent node, and obtains the influence factors of other factors of the first parent node on the first parent node.
  • the electronic device determines, according to the influence factors of each child node in the tree network on the parent node and the influence factors of other factors of each parent node on the corresponding parent node, the first performance index in the tree network that causes the first performance index to be abnormal.
  • the second performance index or other factors that cause the first performance index to be abnormal are included in the first performance index in the tree network that causes the first performance index to be abnormal.
  • each parent node may be equal to the performance index of a new child node of the parent node.
  • the specific process of S604 please refer to the detailed description of S104, which is not repeated in this embodiment of the present application. .
  • the electronic device in addition to calculating the influence factor of each child node in the tree network on the parent node, the electronic device also calculates the influence factor of other factors of each parent node on the corresponding parent node.
  • the problem of inaccuracy of the second performance index determined from the plurality of second performance indexes and causing the abnormality of the first performance index can be avoided because the obtained multiple second performance indexes are not comprehensive enough. It is also possible to avoid the problem of inaccuracy of the second performance index determined from a plurality of second performance indicators that causes the first performance index to be abnormal when the first performance index is abnormal due to a hardware failure. Therefore, determining the root cause of the abnormality of the first performance index from the plurality of second performance indicators and other factors is more accurate than determining the root cause of the abnormality of the first performance index from the plurality of second performance indicators.
  • the electronic device may determine other than the second performance indicator in the tree network that causes the first performance indicator to be abnormal, and may also determine other performance indicators that cause the first performance indicator to be abnormal. factor. Specifically, referring to FIG. 7 , the electronic device may execute S701-S704 without executing S103-S104 after S102. In order to more clearly illustrate the process of calculating the influence factor of each parent node's child nodes or other factors on the parent node, each parent node is referred to as the first parent node.
  • the electronic device performs for each parent node in the tree network: taking the qth sampled value of the performance index of all the child nodes of the first parent node as the input of the influence factor estimation model of the first parent node, and running
  • the impact factor estimation model of the first parent node outputs the qth impact factor of each child node of the first parent node on the first parent node and the qth estimated value of the performance index of the first parent node;
  • q takes values in sequence in ⁇ 1,...,Q ⁇ .
  • the electronic device performs for each child node of the first parent node: calculate the first p-1 times of the first child node to the first parent node Calculate the average value of the impact factors to obtain the impact factor of the first child node on the first parent node before the mutation; The influence factor of a child node on the first parent node; calculate the difference or ratio between the influence factor of the first child node on the first parent node before the mutation and the influence factor of the first child node on the first parent node after the mutation. The value or ratio is used as the influence factor of the first child node to the first parent node.
  • the electronic device calculates the average value of the first p-1 influence factors of the other factors of the first parent node on the first parent node, and obtains the influence factors of other factors of the first parent node before the mutation on the first parent node; calculate the first parent node.
  • the difference or ratio between the influence factors of other factors of the node on the first parent node and the influence factors of other factors of the first parent node on the first parent node after mutation, and the difference or ratio is taken as the pair of other factors of the first parent node.
  • the electronic device determines, according to the influence factor of each child node in the tree network on the parent node, and the influence factor of other factors of each parent node on the corresponding parent node, the first performance index in the tree network that causes the abnormality of the first performance index.
  • the second performance index or other factors that cause the first performance index to be abnormal are abnormal.
  • each parent node may be equivalent to the performance index of a new child node of the parent node.
  • the electronic device can also determine the root cause of the abnormality of the first performance index from a plurality of second performance indicators and other factors.
  • the problem of inaccuracy of the second performance index determined from the plurality of second performance indexes and causing the abnormality of the first performance index can be avoided because the obtained multiple second performance indexes are not comprehensive enough. It is also possible to avoid the problem of inaccuracy of the second performance index determined from a plurality of second performance indicators that causes the first performance index to be abnormal when the first performance index is abnormal due to a hardware failure. Therefore, determining the root cause of the abnormality of the first performance index from the plurality of second performance indicators and other factors is more accurate than determining the root cause of the abnormality of the first performance index from the plurality of second performance indicators.
  • the electronic device first calculates the influence factor of each child node on the parent node at each sampling time, and then calculates the influence factor of each child node on the parent node within the preset positioning period (that is, the influence factor of each child node on the parent node).
  • the influence factor of the node or for each child node in the tree network, the influence factor of the child node on the parent node can be directly calculated.
  • the Q sampled values of the performance indicators of each node in the tree network are all arrays arranged in the order of the sampling moments, it can be known that the Q sampled values can be represented by the The trend component of the change trend, the periodic component that occurs periodically in the Q sampled values, and the residual characterizing the fluctuation of the Q sampled values are composed. Since the residual represents the fluctuation of the Q sampling values of the performance index of any node, if the residual of the performance index of any node has a large correlation with the residual of the performance index of the parent node of any node, then The fluctuation of the Q sampling values representing the performance index of any node has a greater impact on the fluctuation of the Q sampling values of the performance index of the parent node of any node. Therefore, the influence factor of any node on the parent node can be determined by using the correlation between the residual of the performance index of any node and the residual of the performance index of the parent node of any node.
  • the electronic device may determine the influence factor of any node on the parent node according to the correlation of the residuals. Specifically, as shown in FIG. 8 , the electronic device may execute S801-S803 for each child node in the tree network to calculate the influence factor of each child node on the parent node. In order to illustrate the process of calculating the influence factor of each child node on the parent node more clearly, each child node is referred to as a second child node.
  • the electronic device calculates the first residual of the performance index of the second child node according to the Q sampled values of the performance index of the second child node; wherein the second child node is any child node of the tree network; the first The residual is used to characterize the fluctuation of the Q sampled values of the performance index of the second child node.
  • the residual of the performance index of the second child node is referred to as the first residual.
  • the Q sampled values of the performance indicator of the second child node may include Q sampled values of one performance indicator, or Q sampled values of each performance indicator.
  • the first residual of the performance index of the second child node may include the first residual of one performance index, or the first residual of each performance index.
  • the electronic device calculates the performance according to the Q sampled values of each performance indicator of the second subnode The first residual of the indicator.
  • the electronic device may use a time series decomposition algorithm to extract the trend component of the performance index of the second child node and the trend component of the performance index of the second child node from the Q sampled values of the performance index of the second child node.
  • Periodic component then subtract the trend component and the periodic component from the Q sampled values of the performance index of the second child node to obtain the first residual of the performance index of the second child node.
  • the trend component is used to represent the change trend of the Q sampled values of the performance index of the second child node
  • the periodic component is used to represent the periodic change of the Q sampled values of the performance index of the second child node.
  • the time series decomposition algorithm includes the seasonal trend decomposition method based on local weighted regression (Seasonal-Trend decomposition procedure based on Loess, STL decomposition method).
  • the electronic device performs the following steps for each performance indicator of the second child node to obtain the first residual of the performance indicator: from the Q sampled values of the performance indicator, extracting the trend component of the performance indicator and the Periodic component; then subtract the trend component and the periodic component from the Q sampled values of the performance index to obtain the first residual of the performance index.
  • the electronic device may use H sample values of the performance index of the second sub-node in advance to train a differential integrated moving average autoregressive model (Autoregressive Integrated Moving Average model, ARIMA model) based on the time series decomposition algorithm, and obtain The components of the second child node fit the model.
  • the component fitting model has the capability of extracting a trend component of the performance index of the second child node and a periodic component of the performance index of the second child node from a plurality of sampled values of the performance index of the second child node.
  • the electronic device can also train other models based on the time series decomposition algorithm to obtain the component fitting model of the second child node, for example, an autoregressive model (Autoregressive model, AR model), this
  • an autoregressive model Autoregressive model, AR model
  • the electronic device may input the Q sampled values of the performance indicators of the second child node into the component fitting model of the second child node, run the component fitting model of the second child node, and output the component fitting model of the second child node.
  • the trend component of the performance indicator and the periodic component of the performance indicator of the second child node may input the Q sampled values of the performance indicators of the second child node into the component fitting model of the second child node, run the component fitting model of the second child node, and output the component fitting model of the second child node.
  • the electronic device performs the following steps for each performance index of the second child node to obtain the trend component and periodic component of the performance index: inputting the Q sampled values of the performance index into the component fitting model of the second child node, Run the component fitting model of the second child node, and output the trend component of the performance index and the periodic component of the performance index.
  • the electronic device calculates a second residual of the performance index of the parent node of the second child node according to the Q sampled values of the performance index of the parent node of the second child node, where the second residual is used to characterize the performance index of the second child node.
  • the fluctuation of the Q sampled values of the parent node's performance index is used to characterize the performance index of the second child node.
  • the residual of the performance index of the parent node of the second child node is referred to as the second residual.
  • the parent node of the second child node may include one parent node or multiple parent nodes.
  • the second residual of the performance index of the parent node of the second child node may include the second residual of the performance index of one parent node of the second child node, or the second residual of the performance index of each parent node of the second child node Difference.
  • the Q sampled values of the performance indicator of any parent node of the second child node may include Q sampled values of one performance indicator, or Q sampled values of each performance indicator.
  • the second residual of the performance index of any parent node of the second child node may include the second residual of one performance index, or the second residual of each performance index.
  • the electronic device will perform an analysis on each parent node of the second child node.
  • the Q sampled values of the performance indicator are used to calculate the second residual of the performance indicator.
  • the electronic device calculates the influence factor of the second child node on the parent node according to the preset correlation coefficient algorithm, the first residual and the second residual.
  • the parent node of the second child node includes one parent node or multiple parent nodes, and the influence factor of the second child node on the parent node includes the influence factor of the second child node on a parent node or the second child node on each parent node impact factor.
  • the electronic device performs the calculation according to the preset correlation coefficient algorithm, the first residual of the performance index of the second child node, and the performance of each parent node of the second child node.
  • the second residual of the indicator calculates the influence factor of the second child node on each parent node.
  • the electronic device calculates the correlation between the first residual of the one performance index and the second residual of the one performance index, and uses the correlation as the influence factor of the second child node on the parent node.
  • the electronic device calculates the correlation between the first residual of each performance index of the second child node and the second residual of this one performance index, and obtains multiple correlations one-to-one corresponding to the multiple performance indexes of the second child node ; Then average the multiple correlations to obtain the influence factor of the second child node on this parent node.
  • the electronic device calculates the correlation between the first residual of one performance index of the second child node and the second residual of each performance index, and obtains multiple correlations one-to-one corresponding to the multiple performance indexes of the parent node; The multiple correlations are averaged to obtain the influence factor of the second child node on this one parent node.
  • the electronic device calculates the correlation between the first residual of each performance index of the second child node and the second residual of each performance index, and obtains multiple correlations corresponding to each performance index of the second child node; Average multiple correlations corresponding to each performance index of the second child node to obtain an average value corresponding to each performance index of the second child node; then, average the average values corresponding to all performance indexes of the second child node , to get the influence factor of each performance index of the second child node on this parent node.
  • the calculation formula of the preset correlation system may be as shown in formula (1):
  • U 1 is the first residual
  • U 2 is the second residual
  • r(U 1 , U 2 ) is the correlation between the first residual and the second residual (or called correlation coefficient)
  • Cov(U 1 , U 2 ) is the covariance of U 1 and U 2
  • Var[U 1 ] is the variance of U 1
  • Var[U 2 ] is the variance of U 2 .
  • the electronic device determines the influence factor of the second child node on the parent node according to the correlation between the first residual and the second residual, and then uses the influence factor to determine the second performance that causes the first performance index to be abnormal.
  • the indicator refers to the degree of influence on the fluctuation of the Q sampling values of the performance index of the second child node on the fluctuation of the Q sampling values of the performance index of the second child node to determine the abnormality of the first performance index. the second performance indicator.
  • the fluctuation of the Q sampling values of the performance index of any node has a greater impact on the fluctuation of the Q sampling values of the performance index of the parent node of any node, it can be determined that the performance index of any node affects the parent node. Therefore, determining the influence factor of the second child node on the parent node according to the correlation between the first residual and the second residual can improve the location of the root cause of the abnormal first performance index. accuracy.
  • the breadth-first algorithm (also called breadth-first algorithm) can be used to first calculate the influence factors of each child node in the tree network For the influence factor of the parent node, calculate the influence factor of each leaf node in the tree network on the root node; then determine the node with the largest influence factor on the root node from all the leaf nodes in the tree network, and determine the influence on the root node.
  • the performance indicator of the node with the largest factor is the second performance indicator that causes the first performance indicator to be abnormal.
  • S104 may include S901-S902.
  • the electronic device calculates the influence factor of the leaf node on the root node in the tree network according to the influence factor of each child node in the tree network on the parent node.
  • the electronic device calculates the influence factor of each leaf node on the root node for each leaf node in the tree network.
  • the influence factor of each leaf node on the root node may include one influence factor or multiple influence factors.
  • the total number of influence factors of each leaf node to the root node is equal to the total number of paths from each leaf node to the root node.
  • the electronic device multiplies all the influence factors on each path from the leaf node to the root node to obtain an influence factor of the leaf node on the root node.
  • the tree network includes a root node A and multiple child nodes, and the multiple child nodes include node B, node C, node D, node E, and node F. , Node G, and Node H.
  • the root node A is the parent node of node B, node C, and node D.
  • Node B is the parent of Node E and Node F.
  • Node C is the parent of node F.
  • Node D is the parent of Node G and Node H.
  • the influence factor of node B on root node A is 0.6, the influence factor of node C on root node A is 0.3, the influence factor of node D on node A is 0.5, the influence factor of node E on node B is 0.6, and the influence factor of node F on node A is 0.6.
  • the influence factor of B is 0.7, the influence factor of node F on node C is 0.1, the influence factor of node G on node D is 0.3, and the influence factor of node H on node D is 0.2.
  • the influence factor of node E on node B and the influence factor of node B on root node A can be multiplied, and the influence factor of node E on root node A can be obtained as 0.36.
  • Multiply the impact factor of node F on node B and the impact factor of node B on root node A and get an impact factor of node F on root node A as 0.42;
  • the influence factor of node A is multiplied, and another influence factor of node F to root node A is obtained as 0.03.
  • Multiplying the influence factor of node G on node D and the influence factor of node D on root node A another influence factor of node G on root node A is obtained as 0.15.
  • Multiplying the influence factor of node H on node D and the influence factor of node D on root node A the other influence factor of node H on root node A is 0.10.
  • the electronic device determines the performance indexes of the N leaf nodes in the tree network as the second performance indexes that cause the first performance index to be abnormal; wherein, the influence factor of the N leaf nodes on the root node is greater than that of other nodes in the tree network
  • the influence factor of the leaf node on the root node, N ⁇ 1, N is an integer.
  • the electronic device may select N leaf nodes from all the leaf nodes in the tree network, and determine the performance indicators of the N leaf nodes as the second performance indicators that cause the first performance indicator to be abnormal.
  • the electronic device may sort all leaf nodes in descending order of shadow factors according to the influence factors of all leaf nodes on the parent node, and then select the top N leaf nodes from the sorted leaf nodes. Wherein, if the influence factor of a leaf node to the root node includes multiple influence factors, the largest influence factor of the leaf node to the root node can be used for sorting.
  • the electronic device may further acquire path information from each of the N leaf nodes to the root node.
  • the path information may include connection relationships of all nodes on the path from a leaf node to the root node, and performance indicators of all nodes on the path from a leaf node to the root node.
  • the electronic device or the user can adjust the second performance index that causes the first performance index to be abnormal according to the path information and the second performance index that causes the first performance index to be abnormal, so as to solve the problem of the first performance index being abnormal.
  • the path information from the leaf node to the root node may include the path information corresponding to the maximum influence factor of the leaf node on the root node, or may include the leaf node to the root node. multiple path information.
  • the second performance indicator determined by the electronic device that causes the first performance indicator to be abnormal is the performance indicator of node F.
  • the path information from node F to root node A may include: information indicating that node F points to node B and node B points to node A, the performance index of node F, the performance index of node B and the performance index of node A.
  • the electronic device directly calculates the influence factors of all leaf nodes in the tree network on the root node; and then determines that the performance index of the leaf node with a larger influence factor on the root node is the second performance index that causes the abnormality of the first performance index.
  • the tree network includes a first preset node, the first preset node is not a leaf node, and the influence factor of the child nodes of the first preset node on the first preset node is less than or equal to a preset threshold. Since the influence factor of the child nodes of the first preset node on the first preset node is less than or equal to the preset threshold, the first preset node can be considered as an indecomposable node. Then, the influence factor of each leaf node in the tree network on the root node is calculated, and the influence factor of the first preset node on the root node is also calculated.
  • S104 may include S1101-S1102.
  • the electronic device calculates the influence factor of the leaf node on the root node in the tree network according to the influence factor of each child node on the parent node in the tree network, and calculates the influence factor of the first preset node on the root node.
  • the tree network may include one or more first preset nodes.
  • the electronic device calculates the influence factor of each first preset node in the tree network on the root node.
  • the electronic device may, for other nodes in the tree network except the leaf node and the root node, determine whether the influence factor of each child node of each other node on the parent node is less than or equal to the predetermined factor. Set the threshold. If the influence factor of all child nodes of each other node on the parent node is less than or equal to the preset threshold, then each other node is determined to be a first preset node. If the influence factor of at least one child node of each other node on the parent node is greater than the preset threshold, it is determined that each other node is not the first preset node.
  • the tree network includes two first preset nodes, namely node C and node D.
  • the electronic device determines the performance indicators of the M nodes in the leaf node and the first preset node as the second performance indicator that causes the first performance indicator to be abnormal; wherein, the influence factor of the M nodes on the root node is greater than the tree shape
  • the influence factor of other nodes in the network on the root node, M ⁇ 1, M is an integer.
  • the electronic device may select M nodes from all leaf nodes and all first preset nodes in the tree network; and then determine the performance indicators of the M nodes as the second performance indicators that cause the first performance indicator to be abnormal. Wherein, if the influence factor of a leaf node or a first preset node on the root node includes multiple influence factors, the maximum influence factor of the leaf node or the first preset node on the root node can be used to select M node.
  • the electronic device may further acquire path information from each of the M nodes to the root node.
  • the path information may include connection relationships of all nodes on the path from a node to the root node, and performance indicators of all nodes on the path from a node to the root node.
  • the electronic device or the user can adjust the second performance index that causes the first performance index to be abnormal according to the path information and the second performance index that causes the first performance index to be abnormal, so as to solve the problem of the first performance index being abnormal.
  • the path information from the node to the root node may include the path information corresponding to the maximum influence factor of the node on the root node, or may include multiple paths from the node to the root node information.
  • the influence factor of the child nodes of the first preset node on the first preset node does not exceed the preset threshold, that is, the first preset node.
  • the performance index of the child node of the first preset node has little influence on the performance index of the first preset node, it can be considered that the first preset node is similar to the leaf node, and the root cause of the abnormal first performance index may also exist in the tree network. in the first preset node in .
  • the electronic device calculates the influence factor of all leaf nodes in the tree network on the root node, and the influence factor of the first preset node in the tree network on the root node; In a preset node, a node with a larger influence factor on the root node is determined, and its performance index is the second performance index that causes the abnormality of the first performance index. In this way, both the leaf node that may include the root cause that causes the abnormality of the first performance index and the first preset node that may include the root cause that causes the abnormality of the first performance index are judged, and the first performance index that causes the abnormality can be determined more accurately. Abnormal secondary performance indicator.
  • the depth-first algorithm can be used to search the tree network from the root node to the parent node level by level.
  • the node with the largest influence factor is determined until the found node is a leaf node or the first preset node, and the performance index of the found node is determined to be the second performance index that causes the first performance index to be abnormal.
  • the specific process of S104 is described. As shown in FIG. 12, S104 may include S1201-S1208.
  • the electronic device takes the root node as the second parent node, and determines the child node with the largest impact factor on the second parent node according to the influence factors of each child node in the tree network on the parent node.
  • the electronic device determines whether the child node with the largest influence factor on the second parent node is a leaf node.
  • the electronic device takes the child node with the largest influence factor on the second parent node as the second parent node, and determines the child node with the largest influence factor on the second parent node according to the influence factors of each child node in the tree network on the parent node. child node.
  • the electronic device takes the child node with the largest influence factor of the second parent node as the child node to be selected.
  • the electronic device determines whether the total number of child nodes to be selected is less than N.
  • N is an integer.
  • the electronic device deletes the child node with the largest influence factor on the second parent node from the tree network.
  • the electronic device also deletes all parent nodes without child nodes from the tree network caused by deleting the child node with the largest influence factor on the second parent node.
  • N is equal to 2
  • S1201-S1207 are executed, the node E is deleted from the tree-shaped network.
  • the node E is deleted from the tree-shaped network.
  • the tree-shaped network after deleting node E continue to execute S1201-S1207, and delete node F from the tree-shaped network. Since node B has no child nodes after node F is deleted, node B is also deleted from the tree-shaped network.
  • the electronic device uses the obtained performance index of the child node to be selected as the second performance index that causes the first performance index to be abnormal.
  • the electronic device can search the tree network from the root node to the node with the largest influence factor on the parent node, until the found node is a leaf node, and determine that the performance index of the found node is:
  • the second performance index that causes the first performance index to be abnormal can also determine the path information from the found node to the root node.
  • the above-mentioned electronic device or root cause locating apparatus includes corresponding hardware structures and/or software modules for executing each function.
  • the embodiments of the present application can be implemented in hardware or a combination of hardware and computer software. Whether a function is performed by hardware or computer software driving hardware depends on the specific application and design constraints of the technical solution. Skilled persons may use different methods to implement the described functions for each specific application, but such implementations should not be considered beyond the scope of the embodiments of the present application.
  • the electronic device or the root cause locating device may be divided into functional modules according to the foregoing method examples.
  • each functional module may be divided corresponding to each function, or two or more functions may be integrated into one processing unit. in the module.
  • the above-mentioned integrated modules can be implemented in the form of hardware, and can also be implemented in the form of software function modules. It should be noted that, the division of modules in the embodiments of the present application is schematic, and is only a logical function division, and there may be other division manners in actual implementation.
  • an embodiment of the present application provides a root cause locating device 1300 .
  • the obtaining module 1301 is used to support the root cause locating device 1300 to perform S101-S102 in the above method embodiments, and/or other processes for the techniques described herein.
  • the computing module 1302 is used to support the root cause locating device 1300 to perform S103, S301-S302, S401-S402, S601-S603, S701-S703, S801-S803 in the above method embodiments, and/or for the techniques described herein other processes.
  • the location module 1303 is used to support the root cause location device 1300 to perform S104, S604, S704, S901-S902, S1101-S1102, S1201-S1208, and/or other processes for the techniques described herein in the above method embodiments.
  • the above root cause locating apparatus 1300 may further include: a model training module.
  • the model training module is used to support the root cause locating apparatus 1300 to perform the process of training a preset attention model in the above method embodiments, and/or other processes for the techniques described herein.
  • the above root cause locating device 1300 includes but is not limited to the unit modules listed above.
  • the root cause locating apparatus 1300 may further include a storage unit for storing sampled values of performance indicators.
  • the specific functions that can be realized by the above functional units also include but are not limited to the functions corresponding to the method steps described in the above examples.
  • the detailed description of other units of the root cause locating device 1300 please refer to the detailed description of the corresponding method steps. This embodiment of the present application will not be repeated here.
  • the above-mentioned acquisition module 1301 , calculation module 1302 , and location module 1303 can be integrated into one processing module, and the above-mentioned storage unit can be a storage module of the root cause location device 1300 .
  • FIG. 13 shows a possible schematic structural diagram of the electronic device involved in the above embodiment.
  • the electronic electronic device 1400 includes: a processing module 1401 , a storage module 1402 and a communication module 1403 .
  • the processing module 1401 is used to control and manage the electronic device 1400 .
  • the storage module 1402 is used to store program codes and data of the electronic device 1400 .
  • the communication module 1403 is used to communicate with other devices. For example, the communication module is used to receive or send data to other devices.
  • the processing module 1401 may be a processor or a controller, for example, a CPU, a general-purpose processor, a digital signal processor (Digital Signal Processor, DSP), an application-specific integrated circuit (Application-Specific Integrated Circuit, ASIC), a field programmable Gate Array (Field Programmable Gate Array, FPGA) or other programmable logic devices, transistor logic devices, hardware components or any combination thereof. It may implement or execute the various exemplary logical blocks, modules and circuits described in connection with this disclosure.
  • the processor may also be a combination that implements computing functions, such as a combination of one or more microprocessors, a combination of a DSP and a microprocessor, and the like.
  • the communication module 1403 may be a transceiver, a transceiver circuit, a communication interface, or the like.
  • the storage module 1402 may be a memory.
  • Embodiments of the present application also provide a computer-readable storage medium, where computer program codes are stored in the computer-readable storage medium.
  • the electronic device executes the computer program code, the electronic device realizes the execution of FIG. 1 , FIG. 3 , and FIG. 4.
  • the relevant method steps in any one of FIG. 6 , FIG. 7 , FIG. 8 , FIG. 9 , FIG. 11 and FIG. 12 implement the method in the above embodiment.
  • Embodiments of the present application also provide a computer program product, which, when the computer program product runs on an electronic device, enables the electronic device to execute FIG. 1 , FIG. 3 , FIG. 4 , FIG. 6 , FIG. 7 , FIG. 8 , FIG. 9 ,
  • the relevant method steps in any of Figures 11 and 12 implement the methods in the above-described embodiments.
  • the root cause locating device 1300, the electronic device 1400, the computer-readable storage medium or the computer program product provided in this application are all used to execute the corresponding method provided above, therefore, the beneficial effects that can be achieved can refer to the above The beneficial effects in the corresponding method provided in this article will not be repeated here.
  • the disclosed system, apparatus and method may be implemented in other manners.
  • the device embodiments described above are only illustrative.
  • the division of the modules or units is only a logical function division. In actual implementation, there may be other division methods.
  • multiple units or components may be Incorporation may either be integrated into another system, or some features may be omitted, or not implemented.
  • the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit.
  • the above-mentioned integrated units may be implemented in the form of hardware, or may be implemented in the form of software functional units.
  • the integrated unit if implemented in the form of a software functional unit and sold or used as an independent product, may be stored in a computer-readable storage medium.
  • the technical solutions of the present application can be embodied in the form of software products in essence, or the parts that contribute to the prior art, or all or part of the technical solutions, and the computer software products are stored in a storage medium , which includes several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor to execute all or part of the steps of the methods described in the various embodiments of the present application.
  • the aforementioned storage medium includes: flash memory, removable hard disk, read-only memory, random access memory, magnetic disk or optical disk and other media that can store program codes.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

Embodiments of the present application provide a root cause localization method and a device, relating to the technical field of fault localization, and capable of improving the accuracy of root cause localization. The specific solution comprises: acquiring a tree network for a first performance indicator; acquiring a sampling value of the first performance indicator and sampling values of multiple second performance indicators obtained by means of sampling in a preset localization cycle; calculating, according to the sampling value of the first performance indicator and the sampling values of the multiple second performance indicators, influence factors of respective child nodes in relation to a parent node in the tree network; and determining, according to the influence factors of the respective child nodes in relation to the parent node in the tree network, a second performance indicator that has caused an abnormality in the first performance indicator in the tree network. The tree network comprises the first performance indicator and the multiple second performance indicators that influence the first performance indicator. The first performance indicator corresponds to a root node of the tree network, and the multiple second performance indicators correspond to nodes other than the root node in the tree network.

Description

一种根因定位方法及装置、电子设备A root cause locating method and device, and electronic equipment 技术领域technical field
本申请涉及故障定位技术领域,尤其涉及一种根因定位方法及装置、电子设备。The present application relates to the technical field of fault location, and in particular, to a root cause location method and device, and electronic equipment.
背景技术Background technique
在无线通信系统的运维中,监控无线通信系统的性能作为很重要的一个环节,其主要用于无线通信系统中的性能故障定位和性能故障根因定位。对此,运维人员设置了较多的指标用于评价无线通信系统的性能,对无线通信系统进行性能故障定位和性能故障根因定位。In the operation and maintenance of the wireless communication system, monitoring the performance of the wireless communication system is an important part, which is mainly used for the location of performance faults and the root cause of performance faults in the wireless communication system. In this regard, the operation and maintenance personnel have set many indicators to evaluate the performance of the wireless communication system, and to locate the performance fault and the root cause of the performance fault of the wireless communication system.
其中,这些指标可以包括接入类、保持性和移动性等方面的关键绩效指标(Key Performance Indicator,KPI),还包括速率和时延等业务体验指标。性能故障定位是指定位出上述各个指标中数值异常的指标。性能故障根因定位是指分析导致上述指标的数值异常的原因。定位出性能故障和性能故障根因后,运维人员可以排除该性能故障,以保证用户的无线通信使用体验。Among them, these indicators may include key performance indicators (Key Performance Indicator, KPI) in terms of access, retention, and mobility, as well as service experience indicators such as rate and delay. The performance fault location is to locate the abnormal value of the above indicators. Root cause location of performance faults refers to analyzing the causes of abnormal values of the above indicators. After locating the performance fault and the root cause of the performance fault, the operation and maintenance personnel can eliminate the performance fault to ensure the user's wireless communication experience.
目前,一些方案中,可以利用故障树和上述各个指标的具体数值进行性能故障定位,确定出数值异常的指标(简称为异常指标);然后,分别判断该异常指标在故障树中的各个子节点的指标的具体数值是否大于该指标对应的门限值,如果一个子节点对应的指标的具体数值大于该指标对应的门限值,则可以确定上述性能故障根因为该子节点对应的指标异常。At present, in some solutions, the fault tree and the specific values of the above indicators can be used to locate performance faults, and the indicators with abnormal values (abbreviated as abnormal indicators) can be determined; Whether the specific value of the index is greater than the threshold value corresponding to the index, if the specific value of the index corresponding to a child node is greater than the threshold value corresponding to the index, it can be determined that the above-mentioned performance failure root cause is that the index corresponding to the child node is abnormal.
但是,采用对比门限值的方式进行性能故障根因定位,可能会因为门限值设置不准确而影响性能故障根因定位的准确性。并且,各个子节点的指标对异常指标可能都有不同程度的影响;单纯采用门限判断的方式确定各个子节点的指标是否对异常指标产生影响比较武断,会影响性能故障根因定位的准确性。However, the performance fault root cause location is performed by comparing the threshold value, which may affect the accuracy of the performance fault root cause location due to the inaccurate setting of the threshold value. In addition, the indicators of each sub-node may have different degrees of influence on the abnormal indicators; it is relatively arbitrary to determine whether the indicators of each sub-node have an impact on the abnormal indicators by simply using the threshold judgment method, which will affect the accuracy of performance fault root cause location.
发明内容SUMMARY OF THE INVENTION
本申请提供一种根因定位方法及装置、电子设备,可以提高根因定位的准确度。The present application provides a root cause locating method, device, and electronic device, which can improve the accuracy of root cause locating.
第一方面,本申请提供一种根因定位方法,该根因定位方法包括:获取第一性能指标的树形网络;获取预设定位周期内采样得到的第一性能指标的采样值和多个第二性能指标的采样值;根据第一性能指标的采样值和多个第二性能指标的采样值,计算树形网络中各个子节点对父节点的影响因子;根据树形网络中各个子节点对父节点的影响因子,确定出树形网络中导致第一性能指标异常的第二性能指标。In a first aspect, the present application provides a root cause locating method, the root cause locating method includes: acquiring a tree-shaped network of a first performance index; The sampling value of the second performance index; according to the sampling value of the first performance index and the sampling values of a plurality of second performance indicators, calculate the influence factor of each child node in the tree network on the parent node; according to each child node in the tree network For the influence factor of the parent node, the second performance index in the tree network that causes the abnormality of the first performance index is determined.
其中,树形网络包括第一性能指标和影响第一性能指标的多个第二性能指标,第一性能指标对应树形网络的根节点,多个第二性能指标对应树形网络中除根节点之外的其他节点。每个子节点对父节点的影响因子用于表征每个子节点的性能指标对父节点的性能指标的影响程度。The tree-shaped network includes a first performance indicator and a plurality of second performance indicators that affect the first performance indicator, the first performance indicator corresponds to the root node of the tree-shaped network, and the plurality of second performance indicators corresponds to one of the root nodes in the tree-shaped network. other nodes. The influence factor of each child node on the parent node is used to characterize the degree of influence of the performance index of each child node on the performance index of the parent node.
上述方法中,可以根据第一性能指标的采样值和多个第二性能指标的采样值,计算树形网络中各个子节点对父节点的影响因子。一个子节点对父节点的影响因子用于表征该子节点的性能指标对父节点的性能指标的影响程度。其中,由于上述第一性能指标对应树形网络的根节点,上述多个第二性能指标对应树形网络中除根节点之外的其他节点;因此可以得出:上述第二性能指标对应的节点可以为:第一性能指标对应的根节点的子节点或者该第一性能指标对应的根节点的子节点的子节点等。由此可见,该树形网络中各个子节点对父节点的影响因子,可以反映出上述多 个第二性能指标对第一性能指标的影响程度。因此,根据上述树形网络中各个子节点对父节点的影响因子,可以确定出树形网络中导致第一性能指标异常的第二性能指标。In the above method, the influence factor of each child node in the tree network on the parent node can be calculated according to the sampled value of the first performance index and the sampled value of a plurality of second performance indicators. The influence factor of a child node on the parent node is used to represent the influence degree of the performance index of the child node on the performance index of the parent node. Wherein, since the above-mentioned first performance index corresponds to the root node of the tree-shaped network, the above-mentioned multiple second performance indicators correspond to other nodes in the tree-shaped network except the root node; therefore, it can be concluded that the node corresponding to the above-mentioned second performance index can be is: a child node of the root node corresponding to the first performance index or a child node of the child node of the root node corresponding to the first performance index, and the like. It can be seen that the influence factor of each child node in the tree network on the parent node can reflect the degree of influence of the above-mentioned multiple second performance indicators on the first performance indicator. Therefore, according to the influence factor of each child node in the above-mentioned tree network on the parent node, the second performance index in the tree network that causes the abnormality of the first performance index can be determined.
本申请提供的方法,不需要人工参与,便可以根据周期性采样得到的第一性能指标的采样值和多个第二性能指标的采样值,通过计算确定出树形网络中导致第一性能指标异常的第二性能指标。这样,相比于人工参与进行根因定位,可以提升根因定位的效率和准确度,还可以降低成本。The method provided by the present application does not require manual participation, and can determine, through calculation, the sampling value of the first performance index obtained by periodic sampling and the sampling values of a plurality of second performance indicators, which causes the first performance index in the tree network. Abnormal secondary performance indicator. In this way, compared with manual participation in root cause localization, the efficiency and accuracy of root cause localization can be improved, and the cost can also be reduced.
在一种可能的设计方式中,预设定位周期包括Q个采样时刻,Q≥2,Q为整数。上述获取预设定位周期内采样得到的第一性能指标的采样值和多个第二性能指标的采样值,包括:在预设定位周期的第q个采样时刻,采样得到第一性能指标的第q个采样值和多个第二性能指标的第q个采样值。其中,q在{1,……,Q}中依次取值。In a possible design manner, the preset positioning period includes Q sampling moments, where Q≥2, and Q is an integer. The above-mentioned acquisition of the sampling value of the first performance index and the sampling values of a plurality of second performance indicators obtained by sampling in the preset positioning period includes: at the qth sampling time of the preset positioning period, sampling to obtain the first performance index of the first performance index. The q sampled values and the qth sampled value of the plurality of second performance indicators. Among them, q takes values in sequence in {1,...,Q}.
在该设计方式中,可以通过预设定位周期的大小,控制性能故障定位的执行频率,以及控制用于性能故障定位的第一性能指标的个数和多个第二性能指标的个数。In this design mode, the execution frequency of performance fault localization can be controlled by presetting the size of the localization period, and the number of first performance indicators and the number of multiple second performance indicators used for performance fault localization can be controlled.
另一种可能的设计方式中,上述根据第一性能指标的采样值和多个第二性能指标的采样值,计算树形网络中各个子节点对父节点的影响因子,包括:首先,针对树形网络中的每个父节点执行:将第一父节点的所有子节点的性能指标的第q个采样值,作为第一父节点的影响因子预估模型的输入,运行第一父节点的影响因子预估模型,输出第一父节点的每个子节点对第一父节点的第q个影响因子;然后,针对第一父节点的每个子节点执行:计算第一子节点对第一父节点的Q个影响因子的平均值,得到第一子节点对第一父节点的影响因子;其中,第一子节点是第一父节点的任一个子节点。其中,第一父节点为树形网络中的任一个父节点;不同父节点的影响因子预估模型不同;第一父节点的影响因子预估模型具有从第一父节点的所有子节点的性能指标的采样值中,提取出第一父节点的每个子节点对第一父节点的影响因子的能力。In another possible design method, calculating the influence factor of each child node on the parent node in the tree network according to the sampled value of the first performance index and the sampled value of a plurality of second performance indicators includes: first, for the tree Each parent node in the shape network executes: the qth sampled value of the performance index of all child nodes of the first parent node is used as the input of the impact factor estimation model of the first parent node, and the impact of the first parent node is run. Factor estimation model, output the qth influence factor of each child node of the first parent node to the first parent node; then, for each child node of the first parent node: calculate the effect of the first child node on the first parent node. The average value of the Q influence factors is used to obtain the influence factor of the first child node on the first parent node; wherein, the first child node is any child node of the first parent node. Among them, the first parent node is any parent node in the tree network; the impact factor estimation models of different parent nodes are different; the impact factor estimation model of the first parent node has the performance of all child nodes from the first parent node From the sampling value of the index, the ability to extract the influence factor of each child node of the first parent node on the first parent node.
在该设计方式中,针对树形网络中的每个父节点,q依次在{1,……,Q}中取值,利用第q个采样时刻采集到的第一性能指标的第q个采样值和多个第二性能指标各自的第q个采样值,计算每个父节点的各个子节点对父节点的第q个影响因子,即利用任一个采样时刻采集到的性能指标的采样值,计算这一采用时刻下的子节点对父节点的影响因子。由于不同采样时刻下采集到的性能指标(包括第一性能指标和多个第二性能指标)的数值不同,而数值不同的性能指标所确定出的性能指标之间的影响程度也可能不同。因此,分别利用各个采用时刻采集到的的采样值计算各个采样时刻的影响因子,再对所有采样时刻的影响因子求平均,确定出每个子节点对父节点的唯一的影响因子。这样分别计算各个采样时刻的影响因子再求平均的方法,相较于用所有采样时刻下采集到的采样值计算每个子节点对父节点的唯一的影响因子,得到的影响因子的准确度更高。In this design method, for each parent node in the tree network, q takes values in {1, . value and the qth sampling value of each of the multiple second performance indicators, calculate the qth impact factor of each child node of each parent node on the parent node, that is, use the sampling value of the performance indicator collected at any sampling moment, Calculate the influence factor of the child node on the parent node at this adoption moment. Since the values of the performance indicators (including the first performance indicator and a plurality of second performance indicators) collected at different sampling times are different, the degree of influence between the performance indicators determined by the performance indicators with different values may also be different. Therefore, the influence factor of each sampling time is calculated by using the sampling values collected at each adopting time, and then the influence factors of all sampling time are averaged to determine the unique influence factor of each child node on the parent node. In this way, the method of calculating the influence factors of each sampling time separately and then averaging, compared with using the sampling values collected at all sampling times to calculate the unique influence factor of each child node on the parent node, the accuracy of the obtained influence factor is higher. .
其次,利用影响因子预估模型,可以一次性得到每个父节点的所有子节点对父节点的第q个影响因子,进而提高了根因定位的效率。Secondly, using the impact factor prediction model, the qth impact factor of all child nodes of each parent node on the parent node can be obtained at one time, thereby improving the efficiency of root cause location.
另一种可能的设计方式中,上述根据第一性能指标的采样值和多个第二性能指标的采样值,计算树形网络中各个子节点对父节点的影响因子,包括:首先,针对树形网络中的每个父节点执行:将第一父节点的所有子节点的性能指标的第q个采样值,作为第一父节点的影响因子预估模型的输入,运行第一父节点的影响因子预估模型,输出第一父节点的每个子节点对第一父节点的第q个影响因子;然后,若Q个采样时刻中的第p个采样时刻为性能指标的突变时刻,针对第一父节点的每个子节点执行:计算第一子节点对第一父节点的前p-1个影响因子的平均值,得到突变前第一子节点对第一父节点的影响因子;计算第一子节点对第一父节点的第p个-第Q个影响因子的平均值,得到突变后第一子节点对第一父节点的影响因子;最后,计算突变前第一子节点对 第一父节点的影响因子与突变后第一子节点对第一父节点的影响因子的差值或比值,将差值或比值作为第一子节点对第一父节点的影响因子。In another possible design method, calculating the influence factor of each child node on the parent node in the tree network according to the sampled value of the first performance index and the sampled value of a plurality of second performance indicators includes: first, for the tree Each parent node in the shape network executes: the qth sampled value of the performance index of all child nodes of the first parent node is used as the input of the impact factor estimation model of the first parent node, and the impact of the first parent node is run. The factor estimation model outputs the qth impact factor of each child node of the first parent node on the first parent node; then, if the pth sampling time in the Q sampling moments is the mutation moment of the performance index, the first Each child node of the parent node executes: calculate the average value of the first p-1 impact factors of the first child node to the first parent node, and obtain the impact factor of the first child node to the first parent node before mutation; calculate the first child node The average value of the p-th -Q-th impact factor of the node on the first parent node, and the impact factor of the first child node on the first parent node after the mutation is obtained; finally, the first child node before the mutation is calculated. The difference or ratio of the influence factor of , and the influence factor of the first child node to the first parent node after mutation, and the difference or ratio is used as the influence factor of the first child node to the first parent node.
其中,第一父节点为树形网络中的任一个父节点;不同父节点的影响因子预估模型不同。第一父节点的影响因子预估模型具有从第一父节点的所有子节点的性能指标的采样值中,提取出第一父节点的每个子节点对第一父节点的影响因子的能力。p为小于或等于Q的正整数;第一子节点是第一父节点的任一个子节点。Wherein, the first parent node is any parent node in the tree network; the impact factor estimation models of different parent nodes are different. The impact factor estimation model of the first parent node has the ability to extract the impact factor of each child node of the first parent node on the first parent node from the sampled values of performance indicators of all child nodes of the first parent node. p is a positive integer less than or equal to Q; the first child node is any child node of the first parent node.
可以理解的是,由于第一性能指标异常可能是突发性的结果。对于突发性的第一性能指标异常,第一性能指标的采样值和多个第二性能指标的采样值都存在一个较大的突变,则突变前后的多个第二性能指标对第一性能指标的影响程度的变化可能较大。也就是说,突变前的第一子节点对第一父节点的影响因子和突变后的第一子节点对第一父节点的影响因子存在较大的差异。因此,在该设计方式中分别对突变前后的第一子节点对第一父节点的所有影响因子求平均。It can be understood that the abnormality of the first performance index may be a sudden result. For the sudden abnormality of the first performance index, there is a large mutation in the sampling value of the first performance index and the sampling values of multiple second performance indicators, then the multiple second performance indicators before and after the mutation have a significant impact on the first performance The degree of influence of indicators may vary greatly. That is to say, there is a big difference between the influence factor of the first child node before the mutation on the first parent node and the influence factor of the first child node after the mutation on the first parent node. Therefore, in this design method, all the influencing factors of the first child node before and after mutation to the first parent node are averaged respectively.
进一步地,该设计方式中计算突变前后第一子节点对第一父节点的影响因子的差值或比值,将该差值或该比值作为第一子节点对第一父节点的影响因子。由于该差值或该比值表示了第一子节点对第一父节点的影响因子在突变前后的变化,并且,任一个子节点对父节点的影响因子在突变前后的变化越大,该任一个子节点对导致第一性能指标异常的影响就越大;因此,该差值或该比值用于第一性能指标异常的根因定位,可以提高第一性能指标异常的根因定位的准确性。Further, in this design method, the difference or ratio of the influence factor of the first child node on the first parent node before and after the mutation is calculated, and the difference or the ratio is used as the influence factor of the first child node on the first parent node. Since the difference or the ratio represents the change of the influence factor of the first child node on the first parent node before and after the mutation, and the greater the change of the influence factor of any child node on the parent node before and after the mutation, the The child node has a greater impact on causing the abnormality of the first performance index; therefore, the difference or the ratio is used for locating the root cause of the abnormality of the first performance index, which can improve the accuracy of locating the root cause of the abnormality of the first performance index.
另一种可能的设计方式中,在上述根据第一性能指标的采样值和多个第二性能指标的采样值,计算树形网络中各个子节点对父节点的影响因子之前,该方法还包括:采用第一父节点的性能指标的第h个样本值和第一父节点的所有子节点的性能指标的第h个采样值作为训练样本,训练预设注意力模型,得到第一父节点的影响因子预估模型。其中,h在{1,……,H}中依次取值,H≥100,H为整数。In another possible design manner, before calculating the influence factor of each child node in the tree network on the parent node according to the sampled value of the first performance index and the sampled value of the plurality of second performance indicators, the method further includes: : Use the h-th sample value of the performance index of the first parent node and the h-th sample value of the performance index of all child nodes of the first parent node as training samples, train the preset attention model, and obtain the Impact factor prediction model. Among them, h takes values in sequence in {1,...,H}, H≥100, and H is an integer.
上述设计方式,给出了一种获取第一父节点的影响因子预估模型的一种实现方式。The above design method provides an implementation method for obtaining the influence factor prediction model of the first parent node.
另一种可能的设计方式中,上述根据第一性能指标的采样值和多个第二性能指标的采样值,计算树形网络中各个子节点对父节点的影响因子,包括:针对树形网络中的每个子节点执行以下步骤,以获取每个子节点对父节点的影响因子:根据第二子节点的性能指标的Q个采样值,计算第二子节点的性能指标的第一残差;根据第二子节点的父节点的性能指标的Q个采样值,计算第二子节点的父节点的性能指标的第二残差;计算第一残差和第二残差之间的相关性,并根据第一残差和第二残差之间的相关性,获取第二子节点对父节点的影响因子。In another possible design method, calculating the influence factor of each child node in the tree network on the parent node according to the sampled value of the first performance index and the sampled values of a plurality of second performance indicators, including: for the tree network Perform the following steps for each child node in to obtain the influence factor of each child node on the parent node: according to the Q sampling values of the performance index of the second child node, calculate the first residual of the performance index of the second child node; For the Q sampled values of the performance index of the parent node of the second child node, calculate the second residual of the performance index of the parent node of the second child node; calculate the correlation between the first residual and the second residual, and According to the correlation between the first residual and the second residual, the influence factor of the second child node on the parent node is obtained.
其中,第二子节点是树形网络的任一个子节点;第一残差用于表征第二子节点的性能指标的Q个采样值的波动情况。第二残差用于表征第二子节点的父节点的性能指标的Q个采样值的波动情况。Wherein, the second child node is any child node of the tree network; the first residual is used to represent the fluctuation of the Q sample values of the performance index of the second child node. The second residual is used to represent the fluctuation of the Q sampled values of the performance index of the parent node of the second child node.
在上述设计方式中,先计算第二子节点的性能指标的第一残差和第二子节点的父节点的性能指标的第二残差;然后,利用预设相关系统的计算公式,可以获取第一残差和第二残差之间的相关性,并根据第一残差和第二残差之间的相关性确定第二子节点对父节点的影响因子。由于残差表示了任一个节点的性能指标的Q个采样值的波动情况,如果任一个节点的性能指标的残差与任一个节点的父节点的性能指标的残差的相关性较大,则表示任一个节点的性能指标的Q个采样值的波动情况,对任一个节点的父节点的性能指标的Q个采样值的波动情况影响较大。而如果任一个节点的性能指标的Q个采样值的波动情况,对任一个节点的父节点的性能指标的Q个采样值的波动情况影响越大,则可以确定任一个节点的性能指标对父节点的性指标的影响越大。因此,上 述设计方式中,根据第一残差和第二残差之间的相关性确定第二子节点对父节点的影响因子,再用该影响因子确定导致第一性能指标异常的第二性能指标,就是根据第二子节点的性能指标的Q个采样值的波动情况,对第二子节点的父节点的性能指标的Q个采样值的波动情况的影响程度,确定导致第一性能指标异常的第二性能指标,可以提高第一性能指标异常的根因定位的准确性。In the above design method, the first residual of the performance index of the second child node and the second residual of the performance index of the parent node of the second child node are calculated first; The correlation between the first residual and the second residual is determined, and the influence factor of the second child node on the parent node is determined according to the correlation between the first residual and the second residual. Since the residual represents the fluctuation of the Q sampling values of the performance index of any node, if the residual of the performance index of any node has a large correlation with the residual of the performance index of the parent node of any node, then The fluctuation of the Q sampling values representing the performance index of any node has a greater impact on the fluctuation of the Q sampling values of the performance index of the parent node of any node. However, if the fluctuation of the Q sampling values of the performance index of any node has a greater impact on the fluctuation of the Q sampling values of the performance index of the parent node of any node, it can be determined that the performance index of any node affects the parent node. The greater the impact of the node's sex index. Therefore, in the above design method, the influence factor of the second child node on the parent node is determined according to the correlation between the first residual and the second residual, and then the influence factor is used to determine the second performance that causes the abnormality of the first performance index The indicator refers to the degree of influence on the fluctuation of the Q sampling values of the performance index of the second child node on the fluctuation of the Q sampling values of the performance index of the second child node to determine the abnormality of the first performance index. The second performance index can improve the accuracy of locating the root cause of the abnormality of the first performance index.
另一种可能的设计方式中,上述根据第二子节点的性能指标的Q个采样值,计算第二子节点的性能指标的第一残差,包括:利用时间序列分解算法,从第二子节点的性能指标的Q个采样值中,提取第二子节点的性能指标的趋势分量和第二子节点的性能指标的周期分量;将第二子节点的性能指标的Q个采样值,减去趋势分量和周期分量,得到第二子节点的性能指标的第一残差。其中,趋势分量用于表示第二子节点的性能指标的Q个采样值的变化趋势,周期分量用于表示第二子节点的性能指标的Q个采样值的周期性变动。In another possible design manner, calculating the first residual of the performance index of the second child node according to the Q sampled values of the performance index of the second child node includes: using a time series decomposition algorithm to calculate the first residual from the second child node From the Q sampling values of the performance index of the node, extract the trend component of the performance index of the second child node and the periodic component of the performance index of the second child node; subtract the Q sampling values of the performance index of the second child node from the The trend component and the period component are used to obtain the first residual of the performance index of the second child node. The trend component is used to represent the change trend of the Q sampled values of the performance index of the second child node, and the periodic component is used to represent the periodic change of the Q sampled values of the performance index of the second child node.
上述设计方式,给出了一种获取第二子节点的性能指标的第一残差的一种实现方式,其也可以用于获取第二子节点的父节点的性能指标的第二残差。The above design method provides an implementation manner of obtaining the first residual of the performance index of the second child node, which can also be used to obtain the second residual of the performance index of the parent node of the second child node.
另一种可能的设计方式中,上述根据树形网络中各个子节点对父节点的影响因子,确定出树形网络中导致第一性能指标异常的第二性能指标,包括:根据树形网络中各个子节点对父节点的影响因子,计算树形网络中叶子节点对根节点的影响因子;将树形网络中的N个叶子节点的性能指标,确定为导致第一性能指标异常的第二性能指标。其中,N个叶子节点对根节点的影响因子大于树形网络中其他叶子节点对根节点的影响因子,N≥1,N为整数。In another possible design method, determining the second performance index in the tree network that causes the first performance index to be abnormal according to the influence factors of each child node in the tree network on the parent node, including: The influence factor of each child node on the parent node is calculated, and the influence factor of the leaf node in the tree network to the root node is calculated; the performance indicators of N leaf nodes in the tree network are determined as the second performance that causes the abnormality of the first performance indicator index. Among them, the influence factor of N leaf nodes on the root node is greater than the influence factor of other leaf nodes in the tree network on the root node, N≥1, and N is an integer.
可以理解的是,由于叶子节点为树形网络中的末端节点,导致第一性能指标异常的根因就存在与树形网络中的所有叶子节点中。因此,上述设计方式中,直接计算树形网络中的所有叶子节点对根节点的影响因子;再确定对根节点的影响因子较大的叶子节点的性能指标就是导致第一性能指标异常的第二性能指标。It can be understood that since the leaf nodes are terminal nodes in the tree network, the root cause of the abnormality of the first performance index exists in all the leaf nodes in the tree network. Therefore, in the above design method, the influence factor of all leaf nodes in the tree network on the root node is directly calculated; and then the performance index of the leaf node with the larger influence factor on the root node is determined as the second factor that causes the abnormality of the first performance index. Performance.
另一种可能的设计方式中,树形网络中包括第一预设节点,第一预设节点不是叶子节点,第一预设节点的子节点对第一预设节点的影响因子小于或等于预设阈值。上述根据树形网络中各个子节点对父节点的影响因子,确定出树形网络中导致第一性能指标异常的第二性能指标,包括:根据树形网络中各个子节点对父节点的影响因子,计算树形网络中叶子节点对根节点的影响因子,并计算第一预设节点对根节点的影响因子;将叶子节点和第一预设节点中的M个节点的性能指标,确定为导致第一性能指标异常的第二性能指标。其中,M个节点对根节点的影响因子大于树形网络中其他节点对根节点的影响因子,M≥1,M为整数。In another possible design method, the tree network includes a first preset node, the first preset node is not a leaf node, and the influence factor of the child nodes of the first preset node on the first preset node is less than or equal to the preset node. Set the threshold. The above-mentioned determining, according to the influence factor of each child node in the tree network on the parent node, determines the second performance index in the tree network that causes the first performance index to be abnormal, including: according to the influence factor of each child node in the tree network on the parent node , calculate the influence factor of the leaf node on the root node in the tree network, and calculate the influence factor of the first preset node on the root node; determine the performance indicators of the leaf node and the M nodes in the first preset node as causing The second performance index in which the first performance index is abnormal. Among them, the influence factor of M nodes on the root node is greater than the influence factor of other nodes in the tree network on the root node, M≥1, and M is an integer.
可以理解的是,虽然树形网络中的第一预设节点不是叶子节点,但是第一预设节点的子节点对第一预设节点的影响因子不超过预设阈值,即第一预设节点的子节点的性能指标对第一预设节点的性能指标的影响程度较小,则可以认为第一预设节点和叶子节点类似,导致第一性能指标异常的根因也可能存在于树形网络中的第一预设节点中。因此,上述设计方式中,计算树形网络中的所有叶子节点对根节点的影响因子、以及树形网络中的第一预设节点对根节点的影响因子;再从树形网络中的所有叶子节点和第一预设节点中确定对根节点的影响因子较大的节点,其性能指标就是导致第一性能指标异常的第二性能指标。如此,对可能包括导致第一性能指标异常的根因的叶子节点和可能包括导致第一性能指标异常的根因的第一预设节点都进行判断,可以更准确地确定出导致第一性能指标异常的第二性能指标。It can be understood that although the first preset node in the tree network is not a leaf node, the influence factor of the child nodes of the first preset node on the first preset node does not exceed the preset threshold, that is, the first preset node. The performance index of the child node of the first preset node has little influence on the performance index of the first preset node, it can be considered that the first preset node is similar to the leaf node, and the root cause of the abnormal first performance index may also exist in the tree network. in the first preset node in . Therefore, in the above design method, the influence factor of all leaf nodes in the tree network on the root node and the influence factor of the first preset node in the tree network on the root node are calculated; Among the nodes and the first preset node, a node with a larger influence factor on the root node is determined, and its performance index is the second performance index that causes the first performance index to be abnormal. In this way, both the leaf node that may include the root cause that causes the abnormality of the first performance index and the first preset node that may include the root cause that causes the abnormality of the first performance index are judged, and the first performance index that causes the abnormality can be determined more accurately. Abnormal secondary performance indicator.
第二方面,本申请提供一种根因定位装置,该装置包括:获取模块,用于获取第一性能指标的树形网络,获取预设定位周期内采样得到的第一性能指标的采样值和多个第二性能指标的采样 值;计算模块,用于根据第一性能指标的采样值和多个第二性能指标的采样值,计算树形网络中各个子节点对父节点的影响因子;定位模块,用于根据树形网络中各个子节点对父节点的影响因子,确定出树形网络中导致第一性能指标异常的第二性能指标。In a second aspect, the present application provides a root cause locating device, the device comprising: an acquisition module configured to acquire a tree-shaped network of a first performance index, and obtain a sampling value of the first performance index obtained by sampling within a preset positioning period and sampling values of a plurality of second performance indicators; a calculation module for calculating the influence factor of each child node in the tree network on the parent node according to the sampling values of the first performance index and the sampling values of a plurality of second performance indicators; positioning The module is configured to determine the second performance index in the tree network that causes the abnormality of the first performance index according to the influence factors of each child node in the tree network on the parent node.
其中,树形网络包括第一性能指标和影响第一性能指标的多个第二性能指标,第一性能指标对应树形网络的根节点,多个第二性能指标对应树形网络中除根节点之外的其他节点。每个子节点对父节点的影响因子用于表征每个子节点的性能指标对父节点的性能指标的影响程度。The tree-shaped network includes a first performance indicator and a plurality of second performance indicators that affect the first performance indicator, the first performance indicator corresponds to the root node of the tree-shaped network, and the plurality of second performance indicators corresponds to one of the root nodes in the tree-shaped network. other nodes. The influence factor of each child node on the parent node is used to characterize the degree of influence of the performance index of each child node on the performance index of the parent node.
在一种可能的设计方式中,预设定位周期包括Q个采样时刻,Q≥2,Q为整数。获取模块,具体用于在预设定位周期的第q个采样时刻,采样得到第一性能指标的第q个采样值和多个第二性能指标的第q个采样值;其中,q在{1,……,Q}中依次取值。In a possible design manner, the preset positioning period includes Q sampling moments, where Q≥2, and Q is an integer. The acquisition module is specifically used to obtain the qth sampling value of the first performance index and the qth sampling value of the plurality of second performance indicators by sampling at the qth sampling time of the preset positioning cycle; wherein, q is in {1 , ..., Q} and take values in turn.
另一种可能的设计方式中,计算模块,具体用于:针对树形网络中的每个父节点执行:将第一父节点的所有子节点的性能指标的第q个采样值,作为第一父节点的影响因子预估模型的输入,运行第一父节点的影响因子预估模型,输出第一父节点的每个子节点对第一父节点的第q个影响因子;针对第一父节点的每个子节点执行:计算第一子节点对第一父节点的Q个影响因子的平均值,得到第一子节点对第一父节点的影响因子。其中,第一父节点为树形网络中的任一个父节点;不同父节点的影响因子预估模型不同。第一父节点的影响因子预估模型具有从第一父节点的所有子节点的性能指标的采样值中,提取出第一父节点的每个子节点对第一父节点的影响因子的能力。第一子节点是第一父节点的任一个子节点。In another possible design manner, the calculation module is specifically used to: perform for each parent node in the tree network: take the qth sampled value of the performance index of all child nodes of the first parent node as the first The input of the influence factor estimation model of the parent node, run the influence factor estimation model of the first parent node, and output the qth influence factor of each child node of the first parent node on the first parent node; Execute for each child node: calculate the average value of Q influence factors of the first child node to the first parent node, and obtain the influence factor of the first child node to the first parent node. Wherein, the first parent node is any parent node in the tree network; the impact factor estimation models of different parent nodes are different. The impact factor estimation model of the first parent node has the ability to extract the impact factor of each child node of the first parent node on the first parent node from the sampled values of performance indicators of all child nodes of the first parent node. The first child node is any child node of the first parent node.
另一种可能的设计方式中,计算模块,具体用于:首先,针对树形网络中的每个父节点执行:将第一父节点的所有子节点的性能指标的第q个采样值,作为第一父节点的影响因子预估模型的输入,运行第一父节点的影响因子预估模型,输出第一父节点的每个子节点对第一父节点的第q个影响因子;然后,若Q个采样时刻中的第p个采样时刻为性能指标的突变时刻,针对第一父节点的每个子节点执行:计算第一子节点对第一父节点的前p-1个影响因子的平均值,得到突变前第一子节点对第一父节点的影响因子;计算第一子节点对第一父节点的第p个-第Q个影响因子的平均值,得到突变后第一子节点对第一父节点的影响因子;最后,计算突变前第一子节点对第一父节点的影响因子与突变后第一子节点对第一父节点的影响因子的差值或比值,将差值或比值作为第一子节点对第一父节点的影响因子。In another possible design method, the calculation module is specifically used for: first, for each parent node in the tree network, execute: take the qth sampled value of the performance index of all the child nodes of the first parent node as Input the influence factor estimation model of the first parent node, run the influence factor estimation model of the first parent node, and output the qth influence factor of each child node of the first parent node on the first parent node; then, if Q The p-th sampling time in the sampling time is the mutation time of the performance index, which is executed for each child node of the first parent node: calculate the average value of the first p-1 impact factors of the first child node to the first parent node, Obtain the influence factor of the first child node on the first parent node before the mutation; The influence factor of the parent node; finally, calculate the difference or ratio of the influence factor of the first child node to the first parent node before the mutation and the influence factor of the first child node to the first parent node after the mutation, and use the difference or ratio as The influence factor of the first child node on the first parent node.
其中,第一父节点为树形网络中的任一个父节点;不同父节点的影响因子预估模型不同。第一父节点的影响因子预估模型具有从第一父节点的所有子节点的性能指标的采样值中,提取出第一父节点的每个子节点对第一父节点的影响因子的能力。p为小于或等于Q的正整数;第一子节点是第一父节点的任一个子节点。Wherein, the first parent node is any parent node in the tree network; the impact factor estimation models of different parent nodes are different. The impact factor estimation model of the first parent node has the ability to extract the impact factor of each child node of the first parent node on the first parent node from the sampled values of performance indicators of all child nodes of the first parent node. p is a positive integer less than or equal to Q; the first child node is any child node of the first parent node.
另一种可能的设计方式中,该装置还包括:模型训练模块,用于在根据第一性能指标的采样值和多个第二性能指标的采样值,计算树形网络中各个子节点对父节点的影响因子之前,采用第一父节点的性能指标的第h个样本值和第一父节点的所有子节点的性能指标的第h个采样值作为训练样本,训练预设注意力模型,得到第一父节点的影响因子预估模型。其中,h在{1,……,H}中依次取值,H≥100,H为整数。In another possible design manner, the device further includes: a model training module, configured to calculate, according to the sampled value of the first performance index and the sampled value of a plurality of second performance indicators, the relationship between each child node in the tree network to the parent Before the impact factor of the node, the h-th sample value of the performance index of the first parent node and the h-th sample value of the performance index of all child nodes of the first parent node are used as training samples, and the preset attention model is trained to obtain The impact factor prediction model of the first parent node. Among them, h takes values in sequence in {1,...,H}, H≥100, and H is an integer.
另一种可能的设计方式中,计算模块,具体用于针对树形网络中的每个子节点执行以下步骤,以获取每个子节点对父节点的影响因子:根据第二子节点的性能指标的Q个采样值,计算第二子节点的性能指标的第一残差;根据第二子节点的父节点的性能指标的Q个采样值,计算第二子节点的父节点的性能指标的第二残差;计算第一残差和第二残差之间的相关性,并根据第一残差和 第二残差之间的相关性,获取第二子节点对父节点的影响因子。其中,第二子节点是树形网络的任一个子节点。第一残差用于表征第二子节点的性能指标的Q个采样值的波动情况。第二残差用于表征第二子节点的父节点的性能指标的Q个采样值的波动情况。In another possible design manner, the calculation module is specifically configured to perform the following steps for each child node in the tree network, so as to obtain the influence factor of each child node on the parent node: Q according to the performance index of the second child node calculate the first residual of the performance index of the second child node; according to the Q sampling values of the performance index of the parent node of the second child node, calculate the second residual of the performance index of the parent node of the second child node difference; calculate the correlation between the first residual and the second residual, and obtain the influence factor of the second child node on the parent node according to the correlation between the first residual and the second residual. Wherein, the second child node is any child node of the tree network. The first residual is used to represent the fluctuation of the Q sampled values of the performance index of the second child node. The second residual is used to represent the fluctuation of the Q sampled values of the performance index of the parent node of the second child node.
另一种可能的设计方式中,计算模块,具体用于:利用时间序列分解算法,从第二子节点的性能指标的Q个采样值中,提取第二子节点的性能指标的趋势分量和第二子节点的性能指标的周期分量;将第二子节点的性能指标的Q个采样值,减去趋势分量和周期分量,得到第二子节点的性能指标的第一残差。其中,趋势分量用于表示第二子节点的性能指标的Q个采样值的变化趋势,周期分量用于表示第二子节点的性能指标的Q个采样值的周期性变动。In another possible design method, the calculation module is specifically used for: using a time series decomposition algorithm to extract the trend component of the performance index of the second child node and the first sub-node from the Q sampling values of the performance index of the second child node. The periodic component of the performance index of the second child node; the Q sampled values of the performance index of the second child node are subtracted from the trend component and the periodic component to obtain the first residual of the performance index of the second child node. The trend component is used to represent the change trend of the Q sampled values of the performance index of the second child node, and the periodic component is used to represent the periodic change of the Q sampled values of the performance index of the second child node.
另一种可能的设计方式中,定位模块,具体用于:根据树形网络中各个子节点对父节点的影响因子,计算树形网络中叶子节点对根节点的影响因子;In another possible design method, the positioning module is specifically used to: calculate the influence factor of the leaf node on the root node in the tree network according to the influence factor of each child node in the tree network on the parent node;
将树形网络中的N个叶子节点的性能指标,确定为导致第一性能指标异常的第二性能指标。其中,N个叶子节点对根节点的影响因子大于树形网络中其他叶子节点对根节点的影响因子,N≥1,N为整数。The performance indicators of the N leaf nodes in the tree network are determined as the second performance indicators that cause the first performance indicator to be abnormal. Among them, the influence factor of N leaf nodes on the root node is greater than the influence factor of other leaf nodes in the tree network on the root node, N≥1, and N is an integer.
另一种可能的设计方式中,树形网络中包括第一预设节点,第一预设节点不是叶子节点,第一预设节点的子节点对第一预设节点的影响因子小于或等于预设阈值。定位模块,具体用于:根据树形网络中各个子节点对父节点的影响因子,计算树形网络中叶子节点对根节点的影响因子,并计算第一预设节点对根节点的影响因子;将叶子节点和第一预设节点中的M个节点的性能指标,确定为导致第一性能指标异常的第二性能指标。其中,M个节点对根节点的影响因子大于树形网络中其他节点对根节点的影响因子,M≥1,M为整数。In another possible design method, the tree network includes a first preset node, the first preset node is not a leaf node, and the influence factor of the child nodes of the first preset node on the first preset node is less than or equal to the preset node. Set the threshold. The positioning module is specifically used for: calculating the influence factor of the leaf node on the root node in the tree network according to the influence factor of each child node in the tree network on the parent node, and calculating the influence factor of the first preset node on the root node; The performance indicators of M nodes in the leaf node and the first preset node are determined as the second performance indicators that cause the first performance indicator to be abnormal. Among them, the influence factor of M nodes on the root node is greater than the influence factor of other nodes in the tree network on the root node, M≥1, and M is an integer.
第三方面,本申请提供一种电子设备,所述电子设备包括:处理器和用于存储所述处理器可执行指令的存储器;其中,所述处理器被配置为执行所述指令,使得所述电子设备执行如第一方面及其任一种可能的设计方式所述的根因定位方法。In a third aspect, the present application provides an electronic device, the electronic device comprising: a processor and a memory for storing instructions executable by the processor; wherein the processor is configured to execute the instructions, such that the The electronic device executes the root cause locating method described in the first aspect and any possible design manners thereof.
第四方面,本申请提供一种计算机可读存储介质,该计算机可读存储介质上存储有计算机指令,当所述计算机指令在电子设备上运行时,使得电子设备执行如第一方面及其任一种可能的设计方式所述的根因定位方法。In a fourth aspect, the present application provides a computer-readable storage medium, where computer instructions are stored on the computer-readable storage medium, and when the computer instructions are executed on an electronic device, the electronic device is made to perform the first aspect and any of the methods described above. A possible design approach described in the root cause localization method.
第五方面,本申请提供一种计算机程序产品,当所述计算机程序产品在电子设备上运行时,使得所述电子设备执行如第一方面及其任一种可能的设计方式所述的根因定位方法。In a fifth aspect, the present application provides a computer program product that, when the computer program product is run on an electronic device, causes the electronic device to execute the root cause described in the first aspect and any possible design manner thereof positioning method.
本申请第二方面及其任一种可能的设计方式,以及第三方面、第四方面、和第五方面的所带来的技术效果可参见上述第一方面中不同设计方式所带来的技术效果,此处不再赘述。For the technical effects brought by the second aspect of the present application and any possible design methods thereof, as well as the third aspect, the fourth aspect, and the fifth aspect, reference may be made to the technologies brought about by different design methods in the above-mentioned first aspect The effect will not be repeated here.
附图说明Description of drawings
图1为本申请实施例提供的一种根因定位方法的流程图一;FIG. 1 is a flowchart 1 of a root cause locating method provided by an embodiment of the present application;
图2为本申请实施例提供的一种无线通信系统的树形网络的结构示意图;FIG. 2 is a schematic structural diagram of a tree network of a wireless communication system according to an embodiment of the present application;
图3为本申请实施例提供的一种根因定位方法的流程图二;FIG. 3 is a flow chart 2 of a root cause locating method provided by an embodiment of the present application;
图4为本申请实施例提供的一种根因定位方法的流程图三;FIG. 4 is a flowchart 3 of a root cause locating method provided by an embodiment of the present application;
图5为本申请实施例提供的一种父节点的影响因子预估模型的结构示意图;5 is a schematic structural diagram of an impact factor prediction model of a parent node provided by an embodiment of the present application;
图6为本申请实施例提供的一种根因定位方法的流程图四;6 is a flowchart four of a root cause locating method provided by an embodiment of the present application;
图7为本申请实施例提供的一种根因定位方法的流程图五;7 is a flowchart 5 of a root cause locating method provided by an embodiment of the present application;
图8为本申请实施例提供的一种计算子节点对父节点的影响因子的方法流程图;8 is a flowchart of a method for calculating the influence factor of a child node on a parent node provided by an embodiment of the present application;
图9为本申请实施例提供的一种根因定位方法的流程图六;9 is a flowchart 6 of a root cause locating method provided by an embodiment of the present application;
图10为本申请实施例提供的一种第一性能指标的树形网络的结构示意图;FIG. 10 is a schematic structural diagram of a tree network of a first performance index provided by an embodiment of the present application;
图11为本申请实施例提供的一种根因定位方法的流程图七;FIG. 11 is a flowchart seven of a root cause locating method provided by an embodiment of the present application;
图12为本申请实施例提供的一种确定导致第一性能指标异常的第二性能指标的方法流程图;12 is a flowchart of a method for determining a second performance indicator that causes an abnormality of the first performance indicator according to an embodiment of the present application;
图13为本申请实施例提供的一种根因定位装置的结构示意图;13 is a schematic structural diagram of a root cause locating device according to an embodiment of the present application;
图14为本申请实施例提供的一种电子设备的结构示意图。FIG. 14 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
具体实施方式detailed description
本申请实施例中所述的“第一”和“第二”等是用于区别不同的对象,或者用于区别对同一对象的不同处理,而不是用于描述对象的特定顺序。例如,第一性能指标和第二性能指标是不同的性能指标。The "first" and "second" etc. described in the embodiments of the present application are used to distinguish different objects, or used to distinguish different processing of the same object, rather than being used to describe a specific order of the objects. For example, the first performance indicator and the second performance indicator are different performance indicators.
为了保证任一种设备或系统的正常运行,对该设备或该系统的运维(即运行和维护)是十分重要的。对该设备或该系统的运维主要包括对该设备或系统进行故障定位和故障根因定位。其中,故障定位是指确定该设备或系统的故障;故障根因定位是指确定导致该设备或系统的故障的原因。In order to ensure the normal operation of any kind of equipment or system, the operation and maintenance (ie, operation and maintenance) of the equipment or the system is very important. The operation and maintenance of the device or the system mainly includes fault location and root cause location of the device or system. Among them, fault location refers to determining the fault of the device or system; fault root cause location refers to determining the cause of the failure of the device or system.
目前,该设备或该系统的故障主要分为两种,即性能故障和硬件故障。性能故障是指该设备或系统的一些性能指标的数值异常;性能指标可以包括上述KPI指标和上述业务体验指标。硬件故障是指该设备或系统的硬件出现异常;硬件故障可能是由于硬件损坏或错误操作等导致的。At present, the failures of the equipment or the system are mainly divided into two types, namely performance failures and hardware failures. The performance failure refers to the abnormal value of some performance indicators of the device or system; the performance indicators may include the above KPI indicators and the above service experience indicators. Hardware failure refers to an abnormality in the hardware of the device or system; hardware failure may be caused by hardware damage or incorrect operation.
本申请实施例中主要针对该设备或该系统的性能故障进行故障根因定位,即确定导致性能指标的数值异常的原因。该数值异常的性能指标简称为异常性能指标。由于硬件故障也会导致性能故障的发生;因此,导致异常性能指标的数值异常的原因可以是该设备或该系统的除异常性能指标之外的其他性能指标,也可能是硬件故障。In the embodiment of the present application, the root cause of the fault is mainly located for the performance fault of the device or the system, that is, the cause of the abnormal value of the performance index is determined. The performance index of the numerical anomaly is referred to as the abnormal performance index for short. A hardware failure can also lead to a performance failure; therefore, the cause of the abnormal value of the abnormal performance indicator may be other performance indicators other than the abnormal performance indicator of the device or the system, or a hardware failure.
具体地,针对该设备或该系统的性能故障,可以监控该设备或该系统的一些性能指标,根据这些性能指标的数值,对该设备或该系统进行性能故障定位和性能故障根因定位。定位出性能故障和性能故障根因后,运维人员可以排除该性能故障,以保证用户对该设备或该系统的使用体验。其中,性能故障定位是指定位出该设备或系统的所有性能指标中数值异常的性能指标(即异常性能指标)。性能故障根因定位是指分析导致异常性能指标的数值异常的原因。Specifically, for the performance fault of the device or the system, some performance indicators of the device or the system can be monitored, and according to the values of the performance indicators, the performance fault location and the root cause of the performance fault can be located for the device or the system. After locating the performance fault and the root cause of the performance fault, the operation and maintenance personnel can eliminate the performance fault to ensure the user experience of the device or the system. Wherein, the performance fault location is to specify a performance index (ie, an abnormal performance index) with an abnormal value among all the performance indexes of the device or system. Root cause location of performance faults refers to analyzing the causes of numerical anomalies of abnormal performance indicators.
相关方案由人工进行性能故障定位和性能故障根因定位,确定出异常性能指标、以及导致异常性能指标的数值异常的性能指标。由于人工速度有限,则存在根因定位的效率较低的问题。并且,一些设备或系统的性能指标的数量庞大,人工进行根因定位时容易出错,降低了根因定位的准确度。In the related scheme, performance fault location and root cause location of performance faults are performed manually, and abnormal performance indicators and performance indicators that cause numerical abnormality of abnormal performance indicators are determined. Due to the limited manual speed, there is a problem that the efficiency of root cause localization is low. In addition, some devices or systems have a huge number of performance indicators, and it is easy to make mistakes when manually locating the root cause, which reduces the accuracy of root cause locating.
例如,在无线通信系统的运维中,通过无线通信系统有大量的性能指标,则异常性能指标的数量也很大。以某个运营商的无线通信系统为例,其每天的异常性能指标的数量占无线通信系统的总网元个数的3%左右,那么,对每天的异常性能指标进行性能根因定位时需要分析的性能指标的数量更多。For example, in the operation and maintenance of a wireless communication system, there are a large number of performance indicators through the wireless communication system, and the number of abnormal performance indicators is also large. Taking the wireless communication system of a certain operator as an example, the number of abnormal performance indicators per day accounts for about 3% of the total number of network elements in the wireless communication system. The number of performance metrics analyzed is greater.
进一步,相关方案利用异常性能指标的故障树,对异常性能指标进行根因定位。其中,该故障树是一个有向无环图(directed acyclic graph,DAG),由代表性能指标的节点和连接这些节点的有向边构成,节点间的有向边代表了节点间的互相关系(由父节点指向其子节点)。该故障树的根节点对应异常性能指标,该故障树除根节点之外的其他节点对应对异常性能指标有影响的其他性能指标。Further, the related scheme uses the fault tree of abnormal performance indicators to locate the root cause of abnormal performance indicators. Among them, the fault tree is a directed acyclic graph (DAG), which consists of nodes representing performance indicators and directed edges connecting these nodes, and the directed edges between nodes represent the mutual relationship between nodes ( from the parent node to its child node). The root node of the fault tree corresponds to the abnormal performance index, and other nodes of the fault tree except the root node correspond to other performance indicators that have an impact on the abnormal performance index.
具体的,分别判断该故障树中的各个子节点的性能指标的数值是否大于该性能指标对应的门限值,如果一个子节点对应的性能指标的数值大于该性能指标对应的门限值,则可以确定该子节 点的性能指标导致异常性能指标的数值异常。Specifically, it is respectively judged whether the value of the performance index of each child node in the fault tree is greater than the threshold value corresponding to the performance index. If the value of the performance index corresponding to a child node is greater than the threshold value corresponding to the performance index, then It can be determined that the performance index of the child node causes the numerical value of the abnormal performance index to be abnormal.
但是,采用对比门限值的方式进行根因定位,可能会因为门限值设置不准确而影响根因定位的准确性。However, using the method of comparing the threshold value for root cause localization may affect the accuracy of root cause localization due to the inaccurate setting of the threshold value.
一方面,各个子节点的性能指标对异常性能指标可能都有不同程度的影响,而单纯采用门限值判断的方式确定各个子节点的性能指标是否对异常性能指标产生影响比较武断,可能会将一些对异常性能指标的影响小于对应门限值的性能指标过滤掉,还可能将一些对异常性能指标没有影响的性能指标确定为导致异常性能指标的根因。这样,会影响根因定位的准确性。On the one hand, the performance indicators of each sub-node may have different degrees of influence on the abnormal performance indicators, and it is rather arbitrary to determine whether the performance indicators of each sub-node have an impact on the abnormal performance indicators by simply using the threshold value judgment method, which may Some performance indicators whose impact on abnormal performance indicators is less than the corresponding threshold value are filtered out, and some performance indicators that have no effect on abnormal performance indicators may also be determined as the root cause of abnormal performance indicators. In this way, the accuracy of root cause location will be affected.
另一方面,由于各个子节点与根节点之间的距离不同;因此,针对不同的子节点需要设置不同的门限值用于判断子节点的性能指标是否对异常性能指标产生影响。子节点距离根节点越远,该子节点与根节点的影响关系越不明显,则该子节点对应的门限值越难以确定。并且,如果子节点对应的门限值设置不准确,则会影响根因定位的准确性。On the other hand, since the distance between each child node and the root node is different; therefore, different thresholds need to be set for different child nodes to judge whether the performance index of the child node has an impact on the abnormal performance index. The farther the child node is from the root node, the less obvious the influence relationship between the child node and the root node is, and the more difficult it is to determine the threshold value corresponding to the child node. Moreover, if the threshold value corresponding to the child node is not set accurately, the accuracy of root cause location will be affected.
进一步的,故障树上各个节点的性能指标的门限值难以确定,设置故障树中所有节点的性能指标的门限值也需要耗费较大的人力和成本。Further, it is difficult to determine the threshold value of the performance index of each node on the fault tree, and setting the threshold value of the performance index of all the nodes in the fault tree also requires a lot of manpower and cost.
本申请实施例提供一种根因定位方法及装置、电子设备,通过该根因定位方法可以提高根因定位的准确度。该根因定位方法可以应用于任何需要根因定位的场景,例如,无线通信系统,或其他的能够确定性能指标之间的影响关系的场景。Embodiments of the present application provide a root cause locating method, device, and electronic device, and the root cause locating method can improve the accuracy of root cause locating. The root cause locating method can be applied to any scenario that requires root cause locating, for example, a wireless communication system, or other scenarios where the influence relationship between performance indicators can be determined.
下面将结合附图对本申请实施例的实施方式进行详细描述。The implementation of the embodiments of the present application will be described in detail below with reference to the accompanying drawings.
参见图1,本申请实施例提供的一种根因定位方法可以包括S101-S104。Referring to FIG. 1 , a root cause location method provided by this embodiment of the present application may include S101-S104.
S101、电子设备获取第一性能指标的树形网络;其中,树形网络包括第一性能指标和影响第一性能指标的多个第二性能指标,第一性能指标对应树形网络的根节点,多个第二性能指标对应树形网络中除根节点之外的其他节点。S101. The electronic device obtains a tree-shaped network of a first performance index; wherein, the tree-shaped network includes a first performance index and a plurality of second performance indicators that affect the first performance index, and the first performance index corresponds to a root node of the tree-shaped network, The plurality of second performance indicators correspond to other nodes in the tree network except the root node.
其中,第一性能指标为存在异常的、需要进行根因定位的一个性能指标。第一性能指标的树形网络是一个有向无环图DAG,由代表性能指标的节点和连接这些节点的有向边构成,节点间的有向边代表了节点间的影响关系(由子节点指向其父节点)。该树形网络中除根节点之外的任意一个节点对应至少一个第二性能指标,并且,该树形网络中除根节点之外的不同节点所对应的性能指标不同。树形网络也可以称为故障树。Wherein, the first performance index is a performance index that is abnormal and needs root cause localization. The tree network of the first performance indicator is a directed acyclic graph DAG, which consists of nodes representing performance indicators and directed edges connecting these nodes. The directed edges between nodes represent the influence relationship between nodes (directed by child nodes). its parent node). Any node other than the root node in the tree network corresponds to at least one second performance index, and the performance indexes corresponding to different nodes other than the root node in the tree network are different. A tree network can also be called a fault tree.
其中,第一性能指标可以为电子设备的所有性能指标中的任意一个、无线通信网络的所有性能指标中的任意一个或者其他电子设备的所有性能指标中的任意一个等等。多个第二性能指标可以包括可能会对第一性能指标产生影响的多个性能指标。该多个第二性能指标和第一性能指标不同。The first performance indicator may be any one of all performance indicators of the electronic device, any one of all performance indicators of a wireless communication network, or any one of all performance indicators of other electronic devices, and so on. The plurality of second performance indicators may include a plurality of performance indicators that may affect the first performance indicator. The plurality of second performance indicators are different from the first performance indicators.
例如,第一性能指标为用户速率,对该第一性能指标产生影响的多个性能指标可以包括:直接影响第一性能指标的物理资源块(Physical Resource Block,PRB)利用率、PRB利用率的分布比例、以及控制信道单元(control channel element,CCE)利用率等等;间接影响第一性能指标的路径损耗等等。For example, the first performance indicator is the user rate, and the multiple performance indicators that affect the first performance indicator may include: physical resource block (Physical Resource Block, PRB) utilization that directly affects the first performance indicator, PRB utilization Distribution ratio, and control channel element (control channel element, CCE) utilization rate, etc.; the path loss that indirectly affects the first performance index, etc.
本申请实施例中,电子设备可以接收运维人员输入的第一性能指标对应树形网络。或者,电子设备可以接收来自其他电子设备的第一性能指标对应的树形网络。或者,电子设备可以接收存在异常的、需要进行根因定位的一个性能指标,将该性能指标作为第一性能指标;再根据电子设备的所有性能指标之间的联系,用电子设备的多个性能指标构建该第一性能指标的树形网络。In this embodiment of the present application, the electronic device may receive a tree network corresponding to the first performance index input by the operation and maintenance personnel. Alternatively, the electronic device may receive a tree-shaped network corresponding to the first performance index from other electronic devices. Alternatively, the electronic device may receive an abnormal performance index that needs root cause localization, and use the performance index as the first performance index; The index constructs a tree network of the first performance index.
其中,该树形网络可以是根据人工经验构建的,或者,根据电子设备的所有性能指标之间的 联系,构建得到的该第一性能指标的树形网络。Wherein, the tree-shaped network may be constructed according to artificial experience, or the tree-shaped network of the obtained first performance index may be constructed according to the relationship between all performance indicators of the electronic device.
示例性地,如图2所示的一种无线通信系统的树形网络,该树形网络包括根节点和多个子节点。其中,根节点对应的第一性能指标为用户速率。多个子节点包括表示资源容量的子节点、表示空口质量的子节点、表示某些其他性能指标的子节点、表示弱覆盖的子节点和表示信号干扰情况的子节点。表示资源容量的子节点对应的性能指标包括物理资源块(Physical Resource Block,PRB)利用率、PRB利用率的分布比例、以及控制信道单元(control channel element,CCE)利用率。表示空口质量的子节点对应的性能指标包括误块率(block error rate,BLER)、调制与编码策略(Modulation and Coding Scheme,MCS)的平均方案等级、以及信道质量指示(Channel Quality Indicator,CQI)。表示某些其他性能指标的子节点对应的性能指标包括上行干扰功率和切换成功率。表示弱覆盖的子节点对应的性能指标包括路径损耗、以及无线通信系统中的所有用户设备和基站之间的距离的平均值。表示信号干扰情况的子节点对应的性能指标包括上行干扰功率。Exemplarily, as shown in FIG. 2, a tree-shaped network of a wireless communication system includes a root node and a plurality of child nodes. The first performance indicator corresponding to the root node is the user rate. The plurality of child nodes include a child node representing resource capacity, a child node representing air interface quality, a child node representing some other performance indicators, a child node representing weak coverage, and a child node representing signal interference situation. The performance indicators corresponding to child nodes representing resource capacity include physical resource block (Physical Resource Block, PRB) utilization, distribution ratio of PRB utilization, and control channel element (control channel element, CCE) utilization. The performance indicators corresponding to the sub-nodes representing the air interface quality include block error rate (block error rate, BLER), average scheme level of modulation and coding strategy (Modulation and Coding Scheme, MCS), and channel quality indicator (Channel Quality Indicator, CQI) . The performance indicators corresponding to the child nodes representing some other performance indicators include uplink interference power and handover success rate. The performance index corresponding to the sub-node representing the weak coverage includes the path loss and the average value of the distance between all the user equipments and the base station in the wireless communication system. The performance index corresponding to the child node representing the signal interference situation includes the uplink interference power.
S102、电子设备获取预设定位周期内采样得到的第一性能指标的采样值和多个第二性能指标的采样值。S102. The electronic device acquires the sampled value of the first performance index and the sampled values of a plurality of second performance indicators that are sampled within a preset positioning period.
电子设备可以获取当前时刻之前的、且与当前时刻相邻的一个预设定位周期内采集到的第一性能指标的采样值和多个第二性能指标的采样值。The electronic device may acquire the sampled value of the first performance index and the sampled values of a plurality of second performance indicators collected in a preset positioning period before the current time and adjacent to the current time.
其中,预设定位周期可以为一天或七天等等。该第一性能指标的采样值可以包括一个或多个采样值;该多个第二性能指标中每个第二性能指标的采样值可以包括一个或多个采样值。The preset positioning period may be one day or seven days, and so on. The sampled value of the first performance indicator may include one or more sampled values; and the sampled value of each second performance indicator in the plurality of second performance indicators may include one or more sampled values.
在一些实施例中,电子设备还可以按照预设采样周期,实时采集电子设备的所有性能指标(包括第一性能指标和多个第二性能指标)的采样值,并将采集到的采样值和对应的采样时刻进行保存。电子设备执行S102的具体过程可以包括:获取采样时刻在上述预设定位周期内的第一性能指标的所有采样值和多个第二性能指标的所有采样值。In some embodiments, the electronic device may also collect sampling values of all performance indicators (including the first performance index and a plurality of second performance indicators) of the electronic device in real time according to a preset sampling period, and combine the collected sampling values and The corresponding sampling time is saved. The specific process of performing S102 by the electronic device may include: acquiring all sampled values of the first performance index and all sampled values of a plurality of second performance indicators at the sampling time within the above-mentioned preset positioning period.
其中,预设采样周期可以为一个小时或一天等等。预设定位周期可以包括一个或多个预设采样周期,每个预设采样周期对应一个采样时刻。The preset sampling period may be one hour or one day, and so on. The preset positioning period may include one or more preset sampling periods, and each preset sampling period corresponds to a sampling moment.
S103、电子设备根据第一性能指标的采样值和多个第二性能指标的采样值,计算树形网络中各个子节点对父节点的影响因子;其中,每个子节点对父节点的影响因子用于表征每个子节点的性能指标对父节点的性能指标的影响程度。S103, the electronic device calculates the influence factor of each child node on the parent node in the tree network according to the sampled value of the first performance index and the sampled value of a plurality of second performance indexes; wherein, the influence factor of each child node on the parent node is calculated as It is used to characterize the degree of influence of the performance index of each child node on the performance index of the parent node.
电子设备可以针对树形网络中的每个子节点执行:利用该子节点的性能指标的采样值和该子节点的父节点的性能指标的采样值,计算该子节点对其父节点的影响因子。其中,该子节点的父节点可以有一个或多个,该子节点对其父节点的影响因子也可以有一个或多个。该子节点对其父节点的影响因子和该子节点的父节点为一一对应。The electronic device may perform for each child node in the tree network: using the sampled value of the performance index of the child node and the sampled value of the performance index of the parent node of the child node, calculate the influence factor of the child node on its parent node. The child node may have one or more parent nodes, and the child node may also have one or more influence factors on its parent node. The influence factor of the child node to its parent node is in a one-to-one correspondence with the parent node of the child node.
或者,电子设备还可以针对树形网络中的每个父节点执行:利用该父节点的性能指标的采样值和该父节点的子节点的性能指标的采样值,计算该父节点的子节点对该父节点的影响因子。其中,该父节点的子节点可以有一个或多个,该父节点的子节点对该父节点的影响因子也可以有一个或多个。该父节点的子节点对该父节点的影响因子和该父节点的子节点为一一对应。Alternatively, the electronic device may also perform for each parent node in the tree network: using the sampled value of the performance index of the parent node and the sampled value of the performance index of the child nodes of the parent node, calculate the pair of child nodes of the parent node. The influence factor of this parent node. The parent node may have one or more child nodes, and the parent node's child nodes may also have one or more influence factors on the parent node. The influence factors of the child nodes of the parent node to the parent node are in a one-to-one correspondence with the child nodes of the parent node.
其中,节点间的影响关系的关系强度,或者说节点间的性能指标的影响程度可以通过影响因子来表示。Among them, the relationship strength of the influence relationship between the nodes, or the influence degree of the performance index between the nodes can be represented by the influence factor.
S104、电子设备根据树形网络中各个子节点对父节点的影响因子,确定出树形网络中导致第一性能指标异常的第二性能指标。S104. The electronic device determines, according to the influence factors of each child node in the tree network on the parent node, the second performance index in the tree network that causes the first performance index to be abnormal.
电子设备根据各个子节点对父节点的影响因子,可以选择对根节点的影响程度较大的节点; 该对根节点的影响程度较大的节点的性能指标,就是导致第一性能指标异常的第二性能指标。进而,可以针对该导致第一性能指标异常的第二性能指标,对电子设备进行维修,以通过调整该导致第一性能指标异常的第二性能指标,实现解决第一性能指标异常的问题。According to the influence factor of each child node on the parent node, the electronic device can select a node with a greater impact on the root node; the performance index of the node with a greater impact on the root node is the first performance index that causes the abnormality of the first performance index. Two performance indicators. Furthermore, the electronic device may be repaired for the second performance index causing the abnormality of the first performance index, so as to solve the problem of the abnormality of the first performance index by adjusting the second performance index causing the abnormality of the first performance index.
可以理解的是,上述方法中,可以根据第一性能指标的采样值和多个第二性能指标的采样值,计算树形网络中各个子节点对父节点的影响因子。一个子节点对父节点的影响因子用于表征该子节点的性能指标对父节点的性能指标的影响程度。其中,由于上述第一性能指标对应树形网络的根节点,上述多个第二性能指标对应树形网络中除根节点之外的其他节点;因此可以得出:上述第二性能指标对应的节点可以为:第一性能指标对应的根节点的子节点或者该第一性能指标对应的根节点的子节点的子节点等。由此可见,该树形网络中各个子节点对父节点的影响因子,可以反映出上述多个第二性能指标对第一性能指标的影响程度。因此,根据上述树形网络中各个子节点对父节点的影响因子,可以确定出树形网络中导致第一性能指标异常的第二性能指标。It can be understood that, in the above method, the influence factor of each child node in the tree network on the parent node can be calculated according to the sampled value of the first performance index and the sampled value of a plurality of second performance indicators. The influence factor of a child node on the parent node is used to represent the influence degree of the performance index of the child node on the performance index of the parent node. Wherein, since the above-mentioned first performance index corresponds to the root node of the tree-shaped network, the above-mentioned multiple second performance indicators correspond to other nodes in the tree-shaped network except the root node; therefore, it can be concluded that the node corresponding to the above-mentioned second performance index can be is: a child node of the root node corresponding to the first performance index or a child node of the child node of the root node corresponding to the first performance index, and the like. It can be seen that the influence factor of each child node in the tree network on the parent node can reflect the degree of influence of the above-mentioned multiple second performance indicators on the first performance indicator. Therefore, according to the influence factor of each child node in the above-mentioned tree network on the parent node, the second performance index in the tree network that causes the abnormality of the first performance index can be determined.
本申请提供的方法,不需要人工参与,便可以根据周期性采样得到的第一性能指标的采样值和多个第二性能指标的采样值,通过计算确定出树形网络中导致第一性能指标异常的第二性能指标。这样,相比于人工参与进行根因定位,可以提升根因定位的效率和准确度,还可以降低成本。The method provided by the present application does not require manual participation, and can determine, through calculation, the sampling value of the first performance index obtained by periodic sampling and the sampling values of a plurality of second performance indicators, which causes the first performance index in the tree network. Abnormal secondary performance indicator. In this way, compared with manual participation in root cause localization, the efficiency and accuracy of root cause localization can be improved, and the cost can also be reduced.
进一步地,如果获取到更多的影响第一性能指标的第二性能指标,只要已知更多的影响第一性能指标的第二性能指标对第一性能指标的影响路径,就可以对第一性能指标的树形网络进行扩展,使得第一性能指标的树形网络更精细,进而可以确定出更精准的导致第一性能指标异常的第二性能指标。由于电子设备可以自动计算出树形网络中各个子节点对父节点的影响因子,因此,扩展第一性能指标的树形网络也基本不会增加根因定位的实现难度。Further, if more second performance indicators that affect the first performance indicator are obtained, as long as the impact paths of more second performance indicators that affect the first performance The tree-shaped network of performance indicators is expanded, so that the tree-shaped network of the first performance indicator is more refined, and then a more accurate second performance indicator that causes the abnormality of the first performance indicator can be determined. Since the electronic device can automatically calculate the influence factor of each child node in the tree network on the parent node, expanding the tree network of the first performance index will not increase the difficulty of root cause localization.
本申请实施例中,预设定位周期包括Q个采样时刻,Q≥2,Q为整数。电子设备在预设定位周期的第q个采样时刻,采样得到第一性能指标的第q个采样值和多个第二性能指标的第q个采样值;其中,q在{1,……,Q}中依次取值。In this embodiment of the present application, the preset positioning period includes Q sampling moments, where Q≥2, and Q is an integer. The electronic device obtains the qth sampling value of the first performance index and the qth sampling value of the plurality of second performance indicators by sampling at the qth sampling time of the preset positioning period; Q} takes values in turn.
其中,多个第二性能指标的第q个采样值包括每个第二性能指标的第q个采样值。预设定位周期内采样得到的第一性能指标的采样值包括第一性能指标的Q个采样值;预设定位周期内采样得到的多个第二性能指标的采样值包括各个第二性能指标的Q个采样值。Wherein, the qth sampled value of the plurality of second performance indicators includes the qth sampled value of each second performance indicator. The sampled values of the first performance indicator sampled in the preset positioning period include Q sampled values of the first performance indicator; the sampled values of multiple second performance indicators sampled in the preset positioning period include the sampling values of each second performance indicator. Q sample values.
本申请实施例中,电子设备根据第一性能指标的Q个采样值和多个第一性能指标中每个第二性能指标的Q个采样值,可以先计算每个采样时刻的树形网络中各个子节点对父节点的影响因子,再计算预设定位周期内的树形网络中各个子节点对父节点的影响因子(即树形网络中各个子节点对父节点的影响因子)。In this embodiment of the present application, the electronic device may first calculate the number of samples in the tree network at each sampling moment according to the Q sampled values of the first performance indicator and the Q sampled values of each of the second performance indicators in the plurality of first performance indicators. The influence factor of each child node on the parent node is calculated, and then the influence factor of each child node on the parent node in the tree network within the preset positioning period (ie the influence factor of each child node on the parent node in the tree network) is calculated.
本申请实施例中,电子设备针对树形网络中的每个父节点,先计算每个采样时刻的各个子节点对父节点的影响因子,再计算预设定位周期内的各个子节点对父节点的影响因子(即各个子节点对父节点的影响因子)。具体地,如图3所示,S103可以包括S301-S302。为了更清楚说明计算每个父节点的子节点对父节点的影响因子的过程,将每个父节点称为第一父节点。In the embodiment of the present application, for each parent node in the tree network, the electronic device first calculates the influence factor of each child node on the parent node at each sampling moment, and then calculates the impact factor of each child node on the parent node within the preset positioning period The impact factor (that is, the impact factor of each child node on the parent node). Specifically, as shown in FIG. 3 , S103 may include S301-S302. To illustrate the process of calculating the influence factor of each parent node's child nodes on the parent node more clearly, each parent node is referred to as the first parent node.
S301、电子设备针对树形网络中的每个父节点执行:将第一父节点的所有子节点的性能指标的第q个采样值,作为第一父节点的影响因子预估模型的输入,运行第一父节点的影响因子预估模型,输出第一父节点的每个子节点对第一父节点的第q个影响因子;其中,q在{1,……,Q}中依次取值。S301, the electronic device performs for each parent node in the tree network: taking the qth sampled value of the performance index of all child nodes of the first parent node as the input of the impact factor estimation model of the first parent node, and running The influence factor estimation model of the first parent node outputs the qth influence factor of each child node of the first parent node on the first parent node; wherein, q takes values in sequence in {1,...,Q}.
其中,第一父节点为树形网络中的任一个父节点;不同父节点的影响因子预估模型不同。第一父节点的影响因子预估模型具有从第一父节点的所有子节点的性能指标的采样值中,提取出第 一父节点的每个子节点对第一父节点的影响因子的能力。Wherein, the first parent node is any parent node in the tree network; the impact factor estimation models of different parent nodes are different. The impact factor prediction model of the first parent node has the ability to extract the impact factor of each child node of the first parent node on the first parent node from the sampling values of the performance indicators of all child nodes of the first parent node.
本申请实施例中,电子设备预先保存有树形网络中每个父节点的影响因子预估模型。第一父节点的影响因子预估模型具有从第一父节点的所有子节点的性能指标的采样值中,提取出第一父节点的每个子节点对第一父节点的影响因子、以及第一父节点的性能指标的预估值的能力。In the embodiment of the present application, the electronic device pre-stores the influence factor prediction model of each parent node in the tree network. The impact factor estimation model of the first parent node has the function of extracting the impact factor of each child node of the first parent node on the first parent node and the first parent node from the sampling values of the performance indicators of all child nodes of the first parent node. The ability to estimate the performance indicators of the parent node.
本申请实施例中,电子设备从{1,……,Q}中依次取值后,可以得到第一父节点的每个子节点对第一父节点的Q个影响因子。In the embodiment of the present application, after the electronic device sequentially takes values from {1, .
S302、电子设备针对第一父节点的每个子节点执行:计算第一子节点对第一父节点的Q个影响因子的平均值,得到第一子节点对第一父节点的影响因子;其中,第一子节点是第一父节点的任一个子节点。S302, the electronic device performs for each child node of the first parent node: calculate the average value of Q influence factors of the first child node on the first parent node, and obtain the influence factor of the first child node on the first parent node; wherein, The first child node is any child node of the first parent node.
需要说明的是,为了更清楚说明计算每个子节点对第一父节点的影响因子的过程,将计算对第一父节点的影响因子时的每个父节点称为第一父节点。It should be noted that, in order to explain the process of calculating the influence factor of each child node on the first parent node more clearly, each parent node when calculating the influence factor on the first parent node is referred to as the first parent node.
电子设备将第一子节点对第一父节点的Q个影响因子的和除以Q,得到第一子节点对第一父节点的Q个影响因子的平均值,并将该平均值作为第一子节点对第一父节点的影响因子。进而,电子设备可以得到第一父节点的各个子节点对第一父节点的影响因子。The electronic device divides the sum of the Q influence factors of the first child node to the first parent node by Q to obtain the average value of the Q influence factors of the first child node to the first parent node, and uses the average value as the first The influence factor of the child node on the first parent node. Furthermore, the electronic device can obtain the influence factors of each child node of the first parent node on the first parent node.
可以理解的是,由于不同采样时刻下采集到的性能指标(包括第一性能指标和多个第二性能指标)的数值不同,而数值不同的性能指标所确定出的性能指标之间的影响程度也可能不同。因此,电子设备分别利用各个采用时刻采集到的的采样值计算各个采样时刻的影响因子,再对所有采样时刻的影响因子求平均,确定出每个子节点对父节点的唯一的影响因子。这样分别计算各个采样时刻的影响因子再求平均的方法,相较于用所有采样时刻下采集到的采样值计算每个子节点对父节点的唯一的影响因子,得到的影响因子的准确度更高。It can be understood that, due to the different values of the performance indicators (including the first performance indicator and multiple second performance indicators) collected at different sampling times, the degree of influence between the performance indicators determined by the performance indicators with different values may also be different. Therefore, the electronic device calculates the influence factor of each sampling time by using the sampling values collected at each adoption time, and then averages the influence factors of all the sampling time to determine the unique influence factor of each child node on the parent node. In this way, the method of calculating the influence factors of each sampling time separately and then averaging, compared with using the sampling values collected at all sampling times to calculate the unique influence factor of each child node on the parent node, the accuracy of the obtained influence factor is higher. .
本申请实施例中,第一性能指标异常可能是第一性能指标缓慢恶化的结果,或者是突发性的结果。In this embodiment of the present application, the abnormality of the first performance index may be a result of slow deterioration of the first performance index, or a sudden result.
其中,对于第一性能指标缓慢恶化导致的第一性能指标异常,第一性能指标和多个第二性能指标的采样值的变化都较小,不同采样时刻下多个第二性能指标对第一性能指标的影响程度的变化也不大;因此,可以采用S302中的对第一子节点对第一父节点的Q个影响因子求平均的方式,计算第一子节点对第一父节点的影响因子。Wherein, for the abnormality of the first performance indicator caused by the slow deterioration of the first performance indicator, the changes of the sampling values of the first performance indicator and the plurality of second performance indicators are both small, and the plurality of second performance indicators at different sampling times have little effect on the first performance indicator. The degree of influence of the performance index does not change much; therefore, the method of averaging the Q influence factors of the first child node on the first parent node in S302 can be used to calculate the influence of the first child node on the first parent node factor.
其中,对于突发性的第一性能指标异常,第一性能指标和多个第二性能指标的采样值都存在一个较大的突变,则突变前后的多个第二性能指标对第一性能指标的影响程度的变化可能较大,也就是说,突变前的第一子节点对第一父节点的影响因子和突变后的第一子节点对第一父节点的影响因子存在较大的差异。而且,由于第一性能指标异常是突发性的,则任一个子节点对父节点的影响因子在突变前后的变化越大,该任一个子节点对导致第一性能指标异常的影响就越大。因此,对于突发性的第一性能指标异常,得到每个采样时刻的树形网络中各个子节点对父节点的影响因子之后,获取突变前后的各个子节点对父节点的影响因子的变化情况,根据变化情况计算树形网络中各个子节点对父节点的影响因子。Wherein, for the sudden abnormality of the first performance index, there is a large mutation in the sampling values of the first performance index and the plurality of second performance indicators, then the plurality of second performance indicators before and after the mutation have a significant effect on the first performance index. The change of the influence degree of , that is to say, the influence factor of the first child node before the mutation on the first parent node and the influence factor of the first child node after the mutation on the first parent node are quite different. Moreover, since the abnormality of the first performance index is sudden, the greater the change of the influence factor of any child node on the parent node before and after the mutation, the greater the influence of any child node on the abnormality of the first performance index. . Therefore, for the sudden abnormality of the first performance index, after obtaining the influence factor of each child node on the parent node in the tree network at each sampling time, obtain the change of the influence factor of each child node on the parent node before and after the mutation , and calculate the influence factor of each child node on the parent node in the tree network according to the change.
本申请实施例中,对于突发性的第一性能指标异常,电子设备针对树形网络中的每个父节点计算子节点对父节点的影响因子的具体过程,参见图4所示,S103可以包括S401-S402。为了更清楚说明计算每个父节点的子节点对父节点的影响因子的过程,将每个父节点称为第一父节点。In the embodiment of the present application, for the sudden first performance index abnormality, the electronic device calculates the specific process of the influence factor of the child node on the parent node for each parent node in the tree network, as shown in FIG. 4 , S103 can be Including S401-S402. To illustrate the process of calculating the influence factor of each parent node's child nodes on the parent node more clearly, each parent node is referred to as the first parent node.
S401、电子设备针对树形网络中的每个父节点执行:将第一父节点的所有子节点的性能指标的第q个采样值,作为第一父节点的影响因子预估模型的输入,运行第一父节点的影响因子预估 模型,输出第一父节点的每个子节点对第一父节点的第q个影响因子;其中,q在{1,……,Q}中依次取值。S401, the electronic device performs for each parent node in the tree network: taking the qth sampling value of the performance index of all the child nodes of the first parent node as the input of the impact factor estimation model of the first parent node, and running The influence factor estimation model of the first parent node outputs the qth influence factor of each child node of the first parent node on the first parent node; wherein, q takes values in sequence in {1,...,Q}.
需要说明的是,S401的具体过程可以参见上述S301的详细描述,本申请实施例这里不予赘述。It should be noted that, for the specific process of S401, reference may be made to the detailed description of the foregoing S301, which is not repeated in this embodiment of the present application.
S402、若Q个采样时刻中的第p个采样时刻为性能指标的突变时刻,电子设备针对第一父节点的每个子节点执行:计算第一子节点对第一父节点的前p-1个影响因子的平均值,得到突变前第一子节点对第一父节点的影响因子;计算第一子节点对第一父节点的第p个-第Q个影响因子的平均值,得到突变后第一子节点对第一父节点的影响因子;计算突变前第一子节点对第一父节点的影响因子与突变后第一子节点对第一父节点的影响因子的差值或比值,将差值或比值作为第一子节点对第一父节点的影响因子。S402. If the p-th sampling time in the Q sampling times is the sudden change of the performance index, the electronic device performs for each child node of the first parent node: calculate the first p-1 times of the first child node to the first parent node Calculate the average value of the impact factors to obtain the impact factor of the first child node on the first parent node before the mutation; The influence factor of a child node on the first parent node; calculate the difference or ratio between the influence factor of the first child node on the first parent node before the mutation and the influence factor of the first child node on the first parent node after the mutation. The value or ratio is used as the influence factor of the first child node to the first parent node.
其中,p为小于或等于Q的正整数;第一子节点是第一父节点的任一个子节点。Wherein, p is a positive integer less than or equal to Q; the first child node is any child node of the first parent node.
其中,上述差值可以是突变后第一子节点对第一父节点的影响因子减去突变前第一子节点对第一父节点的影响因子得到的,上述比值可以是突变后第一子节点对第一父节点的影响因子除以突变前第一子节点对第一父节点的影响因子得到的。The difference value may be obtained by subtracting the influence factor of the first child node on the first parent node after the mutation from the influence factor of the first child node on the first parent node before the mutation, and the ratio may be the first child node after the mutation. It is obtained by dividing the impact factor on the first parent node by the impact factor on the first parent node of the first child node before mutation.
电子设备在接受到性能指标的突变时刻为第p个采样时刻的情况下,确定第一性能指标异常为突发性的。电子设备将第一子节点对第一父节点的前p-1个影响因子的和除以第一差值,第一差值等于p-1,得到突变前第一子节点对第一父节点的影响因子;再将第一子节点对第一父节点的第p个-第Q个影响因子的和除以第二差值,第二差值等于Q-p,得到突变后第一子节点对第一父节点的影响因子。最后,电子设备将突变后第一子节点对第一父节点的影响因子减去突变前第一子节点对第一父节点的影响因子的差值,作为第一子节点对第一父节点的影响因子;或者,将突变后第一子节点对第一父节点的影响因子除以突变前第一子节点对第一父节点的影响因子的比值,作为第一子节点对第一父节点的影响因子。进而,电子设备可以得到第一父节点的各个子节点对第一父节点的影响因子。The electronic device determines that the abnormality of the first performance index is sudden when receiving the sudden change of the performance index as the pth sampling time. The electronic device divides the sum of the first p-1 influence factors of the first child node to the first parent node by the first difference, and the first difference is equal to p-1, to obtain the first child node to the first parent node before mutation Then divide the sum of the p-th - Q-th influencing factors of the first child node to the first parent node by the second difference, and the second difference is equal to Qp, to obtain the first child node after mutation to the first The influence factor of a parent node. Finally, the electronic device subtracts the difference between the influence factor of the first child node on the first parent node after the mutation and the influence factor of the first child node on the first parent node before the mutation, as the difference between the first child node and the first parent node. Influence factor; or, divide the influence factor of the first child node to the first parent node after the mutation by the ratio of the influence factor of the first child node to the first parent node before the mutation, as the ratio of the first child node to the first parent node Impact factor. Furthermore, the electronic device can obtain the influence factors of each child node of the first parent node on the first parent node.
可以理解的是,由于第一性能指标异常可能是突发性的结果。对于突发性的第一性能指标异常,第一性能指标的采样值和多个第二性能指标的采样值都存在一个较大的突变,则突变前后的多个第二性能指标对第一性能指标的影响程度的变化可能较大。也就是说,突变前的第一子节点对第一父节点的影响因子和突变后的第一子节点对第一父节点的影响因子存在较大的差异。因此,电子设备分别对突变前后的第一子节点对第一父节点的所有影响因子求平均。It can be understood that the abnormality of the first performance index may be a sudden result. For the sudden abnormality of the first performance index, there is a large mutation in the sampling value of the first performance index and the sampling values of multiple second performance indicators, then the multiple second performance indicators before and after the mutation have a significant impact on the first performance The degree of influence of indicators may vary greatly. That is to say, there is a big difference between the influence factor of the first child node before the mutation on the first parent node and the influence factor of the first child node after the mutation on the first parent node. Therefore, the electronic device averages all the influence factors of the first child node on the first parent node before and after the mutation, respectively.
进一步地,电子设备计算突变前后第一子节点对第一父节点的影响因子的差值或比值,将该差值或该比值作为第一子节点对第一父节点的影响因子。由于该差值或该比值表示了第一子节点对第一父节点的影响因子在突变前后的变化,并且,任一个子节点对父节点的影响因子在突变前后的变化越大,该任一个子节点对导致第一性能指标异常的影响就越大;因此,该差值或该比值用于第一性能指标异常的根因定位,可以提高第一性能指标异常的根因定位的准确性。Further, the electronic device calculates the difference or ratio of the influence factor of the first child node to the first parent node before and after the mutation, and uses the difference or the ratio as the influence factor of the first child node to the first parent node. Since the difference or the ratio represents the change of the influence factor of the first child node on the first parent node before and after the mutation, and the greater the change of the influence factor of any child node on the parent node before and after the mutation, the The child node has a greater impact on causing the abnormality of the first performance index; therefore, the difference or the ratio is used for locating the root cause of the abnormality of the first performance index, which can improve the accuracy of locating the root cause of the abnormality of the first performance index.
本申请实施例中,电子设备在S301-S302和S401-S402之前,采用上述树形网络中的第一父节点的性能指标的第h个样本值和第一父节点的所有子节点的性能指标的第h个采样值作为训练样本,训练预设注意力模型(Attention Model),得到第一父节点的影响因子预估模型;其中,h在{1,……,H}中依次取值,H≥100,H为整数。In the embodiment of the present application, before S301-S302 and S401-S402, the electronic device adopts the h-th sample value of the performance index of the first parent node in the above tree network and the performance index of all child nodes of the first parent node. The h-th sampling value of is used as a training sample to train the preset attention model (Attention Model) to obtain the influence factor prediction model of the first parent node; where h takes values in sequence in {1,...,H}, H≥100, H is an integer.
其中,电子设备针对第一性能指标的树形网络,可以采集电子设备应用于多种使用场景下的第一性能指标的样本值和多个第二性能指标的样本值,以获取第一性能指标的H个样本值和多个 第二性能指标中每个第二性能指标的H个样本值。第一性能指标的H个样本值和每个第二性能指标的H个样本值为一一对应的。Wherein, for the tree network of the first performance index, the electronic device can collect sample values of the first performance index and multiple sample values of the second performance index that the electronic device is applied to in various usage scenarios, so as to obtain the first performance index H sample values of , and H sample values of each second performance indicator in the plurality of second performance indicators. The H sample values of the first performance index and the H sample values of each second performance index are in one-to-one correspondence.
其中,电子设备可以按照对应的采样时刻的先后,分别将第一性能指标的H个样本值和每个第二性能指标的H个样本值排列成第1个样本值、第2个样本、…、第H个样本,也可以按照其他的排列规则分别对第一性能指标的H个样本值和每个第二性能指标的H个样本值进行排列。The electronic device may arrange the H sample values of the first performance index and the H sample values of each second performance index into the first sample value, the second sample, ... , and the H th sample, the H sample values of the first performance index and the H sample values of each second performance index may be respectively arranged according to other arrangement rules.
需要说明的是,h在{1,……,H}中依次取值,并执行训练预设注意力模型的过程。h在{1,……,H}中取一个值的情况下用于训练的预设注意力模型,是h在{1,……,H}中取前一个值的情况下训练得到的预设注意力模型,例如,h取值为2的情况下用于训练的预设注意力模型是h取值为1的情况下训练得到的预设注意力模型。h取值为H的情况下训练得到的预设注意力模型,就是第一父节点的影响因子预估模型。It should be noted that h takes values in sequence in {1,...,H}, and performs the process of training the preset attention model. The preset attention model used for training when h takes a value in {1, . It is assumed that the attention model, for example, the preset attention model used for training when the value of h is 2 is the preset attention model obtained by training when the value of h is 1. When the value of h is H, the preset attention model obtained by training is the influence factor prediction model of the first parent node.
本申请实施例中,电子设备将第一父节点的所有子节点的性能指标的第h个采样值,作为预设注意力模型的输入,运行预设注意力模块,输出第一父节点的性能指标的第h个预估值;再计算第一父节点的性能指标的第h个预估值和第一父节点的性能指标的第h个样本值之间的差值的绝对值;然后,根据该差值的绝对值调整预设注意力模型的参数,以减小该差值的绝对值。In the embodiment of the present application, the electronic device uses the h-th sampled value of the performance indicators of all child nodes of the first parent node as the input of the preset attention model, runs the preset attention module, and outputs the performance of the first parent node. The h-th estimated value of the index; then calculate the absolute value of the difference between the h-th estimated value of the performance index of the first parent node and the h-th sample value of the performance index of the first parent node; then, The parameters of the preset attention model are adjusted according to the absolute value of the difference, so as to reduce the absolute value of the difference.
示例性地,利用第一父节点的预设注意力模型,训练得到的第一父节点的影响因子预估模型的结构如图5所示,假设第一父节点Y的所有子节点包括子节点1、子节点2和子节点3。将子节点1的性能指标的第h个样本值x1、子节点2的性能指标的第h个样本值x2和子节点3的性能指标的第h个样本值x3,作为预设注意力模型的输入。一方面,每个子节点的性能指标的第h个样本值输入到对应的多层感知机(Multilayer Perceptron,MLP);再通过对应的归一化函数(例如,softmax函数)对MLP的输出进行归一化,得到每个子节点对第一父节点Y的影响因子f1。具有包括:子节点x1对第一父节点Y的影响因子f1(x1)、子节点x2对第一父节点Y的影响因子f1(x2)、子节点x3对第一父节点Y的影响因子f1(x3)。另一方面,每个子节点的性能指标的第h个样本值和对应的子节点对第一父节点的影响因子相乘,得到的所有相乘后的数值包括x1*f1(x1)、x2*f1(x2)、x3*f1(x3);所有子节点的相乘后的数值输入到一个MLP,MLP输出第一父节点的性能指标的第h个预估值f2(X)。Exemplarily, using the preset attention model of the first parent node, the structure of the influence factor estimation model of the first parent node obtained by training is shown in Figure 5, assuming that all child nodes of the first parent node Y include child nodes. 1. Child node 2 and child node 3. Take the h-th sample value x1 of the performance index of child node 1, the h-th sample value x2 of the performance index of child node 2, and the h-th sample value x3 of the performance index of child node 3 as the input of the preset attention model . On the one hand, the h-th sample value of the performance index of each child node is input to the corresponding Multilayer Perceptron (MLP); then the output of the MLP is normalized by the corresponding normalization function (for example, the softmax function). Unification, the influence factor f1 of each child node on the first parent node Y is obtained. It includes: the influence factor f1(x1) of the child node x1 on the first parent node Y, the influence factor f1(x2) of the child node x2 on the first parent node Y, the influence factor f1 of the child node x3 on the first parent node Y (x3). On the other hand, the h-th sample value of the performance index of each child node is multiplied by the influence factor of the corresponding child node on the first parent node, and all the multiplied values obtained include x1*f1(x1), x2* f1(x2), x3*f1(x3); the multiplied values of all child nodes are input to an MLP, and the MLP outputs the h-th estimated value f2(X) of the performance index of the first parent node.
需要说明的是,图5仅仅是一个父节点的影响因子预估模型的示意图,本申请实施例不对任一个父节点的影响因子预估模型进行限制。另外,由于不同父节点的子节点的个数不同,则不同父节点的影响因子预估模型的结构也不同。由于不同父节点的性能指标不同,以及不同父节点的子节点的性能指标不同,则不同父节点的影响因子预估模型的参数也不同。It should be noted that FIG. 5 is only a schematic diagram of the influence factor estimation model of a parent node, and the embodiment of the present application does not limit the influence factor estimation model of any parent node. In addition, since the number of child nodes of different parent nodes is different, the structure of the influence factor prediction model of different parent nodes is also different. Since the performance indicators of different parent nodes are different, and the performance indicators of the child nodes of different parent nodes are different, the parameters of the influence factor prediction models of different parent nodes are also different.
本申请实施例中,第一父节点的影响因子预估模型具有从第一父节点的所有子节点的性能指标的采样值中,提取出第一父节点的每个子节点对第一父节点的影响因子、以及第一父节点的性能指标的预估值的能力。由于利用第一父节点的性能指标的预估值是利用第一父节点的影响因子预估模型从第一父节点的所有子节点的性能指标的采样值中提取出的,如果第一父节点的性能指标的预估值和第一父节点的性能指标的样本值存在较大差异,则可以认为存在除第一父节点的所有子节点的性能指标之外的其他因素,相较于第一父节点的所有子节点的性能指标,对第一父节点的性能指标的样本值的影响更大。其他因素可以是指硬件故障或未被包含在树形网络中的性能指标。因此,电子设备除了确定出树形网络中导致第一性能指标异常的第二性能指标,还可能确定出导致第一性能指标异常的其他因素。具体地,参见图6,电子设备可以在S102之后,不执行S103-S104,执行S601-S604。为了更清楚说明计算每个父节点的子节点或其他因素对父节点的影 响因子的过程,将每个父节点称为第一父节点。In the embodiment of the present application, the influence factor estimation model of the first parent node has the function of extracting the effect of each child node of the first parent node on the first parent node from the sampling values of the performance indicators of all child nodes of the first parent node. Influence factor, and the ability to estimate the performance index of the first parent node. Since the estimated value using the performance index of the first parent node is extracted from the sampling values of the performance index of all child nodes of the first parent node by using the impact factor prediction model of the first parent node, if the first parent node If there is a big difference between the estimated value of the performance index of the first parent node and the sample value of the performance index of the first parent node, it can be considered that there are factors other than the performance index of all the child nodes of the first parent node. The performance indicators of all child nodes of the parent node have a greater impact on the sample values of the performance indicators of the first parent node. Other factors may refer to hardware failures or performance metrics not included in the tree network. Therefore, in addition to determining the second performance index in the tree network that causes the first performance index to be abnormal, the electronic device may also determine other factors that cause the first performance index to be abnormal. Specifically, referring to FIG. 6 , the electronic device may execute S601-S604 without executing S103-S104 after S102. In order to illustrate the process of calculating the influence factor of each parent node's child nodes or other factors on the parent node more clearly, each parent node is referred to as the first parent node.
S601、电子设备针对树形网络中的每个父节点执行:将第一父节点的所有子节点的性能指标的第q个采样值,作为第一父节点的影响因子预估模型的输入,运行第一父节点的影响因子预估模型,输出第一父节点的每个子节点对第一父节点的第q个影响因子、第一父节点的性能指标的第q个预估值;再将第一父节点的性能指标的第q个预估值和第一父节点的性能指标的第q个采样值的差值的绝对值,除以第一父节点的性能指标的第q个采样值,得到第一父节点的其他因素对第一父节点的第q个影响因子。S601, the electronic device performs for each parent node in the tree network: taking the qth sampled value of the performance index of all the child nodes of the first parent node as the input of the influence factor estimation model of the first parent node, and running The impact factor estimation model of the first parent node outputs the qth impact factor of each child node of the first parent node on the first parent node and the qth estimated value of the performance index of the first parent node; The absolute value of the difference between the q-th estimated value of the performance indicator of a parent node and the q-th sampled value of the performance indicator of the first parent node, divided by the q-th sampled value of the performance indicator of the first parent node, Get the qth influencing factor of other factors of the first parent node on the first parent node.
其中,q在{1,……,Q}中依次取值;其他因素是指除第一父节点的所有子节点的性能指标之外的因素。Among them, q takes values in sequence in {1,...,Q}; other factors refer to factors other than the performance indicators of all child nodes of the first parent node.
需要说明的是,S601中与S301的相同部分的具体过程,可以参见S301中的该相同部分的详细描述,本申请实施例这里不予赘述。It should be noted that, for the specific process of the same part in S601 as that of S301, reference may be made to the detailed description of the same part in S301, which is not repeated in this embodiment of the present application.
S602、电子设备针对第一父节点的每个子节点执行:计算第一子节点对第一父节点的Q个影响因子的平均值,得到第一子节点对第一父节点的影响因子。S602. The electronic device performs for each child node of the first parent node: calculating the average value of Q influence factors of the first child node on the first parent node to obtain the influence factor of the first child node on the first parent node.
其中,第一子节点是第一父节点的任一个子节点。The first child node is any child node of the first parent node.
需要说明的是,S602的具体过程,可以参见S301的详细描述,本申请实施例这里不予赘述。It should be noted that, for the specific process of S602, reference may be made to the detailed description of S301, which is not repeated in this embodiment of the present application.
S603、电子设备计算第一父节点的其他因素对第一父节点的Q个影响因子的平均值,得到第一父节点的其他因素对第一父节点的影响因子。S603. The electronic device calculates the average value of Q influence factors of other factors of the first parent node on the first parent node, and obtains the influence factors of other factors of the first parent node on the first parent node.
S604、电子设备根据树形网络中的各个子节点对父节点的影响因子,以及各个父节点的其他因素对对应的父节点的影响因子,确定出树形网络中导致第一性能指标异常的第二性能指标或导致第一性能指标异常的其他因素。S604. The electronic device determines, according to the influence factors of each child node in the tree network on the parent node and the influence factors of other factors of each parent node on the corresponding parent node, the first performance index in the tree network that causes the first performance index to be abnormal. The second performance index or other factors that cause the first performance index to be abnormal.
需要说明的是,可以将各个父节点的其他因素等同于父节点的一个新的子节点的性能指标,此时,S604的具体过程,可以参见S104的详细描述,本申请实施例这里不予赘述。It should be noted that other factors of each parent node may be equal to the performance index of a new child node of the parent node. In this case, for the specific process of S604, please refer to the detailed description of S104, which is not repeated in this embodiment of the present application. .
可以理解的是,电子设备除了计算树形网络中的各个子节点对父节点的影响因子,还计算各个父节点的其他因素对对应的父节点的影响因子。可以避免由于获取的多个第二性能指标不够全面,导致从多个第二性能指标中确定出的导致第一性能指标异常的第二性能指标不准确的问题。还可以避免硬件故障导致第一性能指标异常的情况下,从多个第二性能指标中确定出的导致第一性能指标异常的第二性能指标不准确的问题。因此,从多个第二性能指标和其他因素中确定导致第一性能指标异常的根因,相较于从多个第二性能指标中确定导致第一性能指标异常的根因,更准确。It can be understood that, in addition to calculating the influence factor of each child node in the tree network on the parent node, the electronic device also calculates the influence factor of other factors of each parent node on the corresponding parent node. The problem of inaccuracy of the second performance index determined from the plurality of second performance indexes and causing the abnormality of the first performance index can be avoided because the obtained multiple second performance indexes are not comprehensive enough. It is also possible to avoid the problem of inaccuracy of the second performance index determined from a plurality of second performance indicators that causes the first performance index to be abnormal when the first performance index is abnormal due to a hardware failure. Therefore, determining the root cause of the abnormality of the first performance index from the plurality of second performance indicators and other factors is more accurate than determining the root cause of the abnormality of the first performance index from the plurality of second performance indicators.
本申请实施例中,对于突发性的第一性能指标异常,电子设备除了确定出树形网络中导致第一性能指标异常的第二性能指标,还可能确定出导致第一性能指标异常的其他因素。具体地,参见图7,电子设备可以在S102之后,不执行S103-S104,执行S701-S704。为了更清楚说明计算每个父节点的子节点或其他因素对父节点的影响因子的过程,将每个父节点称为第一父节点。In this embodiment of the present application, for a sudden first performance indicator abnormality, the electronic device may determine other than the second performance indicator in the tree network that causes the first performance indicator to be abnormal, and may also determine other performance indicators that cause the first performance indicator to be abnormal. factor. Specifically, referring to FIG. 7 , the electronic device may execute S701-S704 without executing S103-S104 after S102. In order to more clearly illustrate the process of calculating the influence factor of each parent node's child nodes or other factors on the parent node, each parent node is referred to as the first parent node.
S701、电子设备针对树形网络中的每个父节点执行:将第一父节点的所有子节点的性能指标的第q个采样值,作为第一父节点的影响因子预估模型的输入,运行第一父节点的影响因子预估模型,输出第一父节点的每个子节点对第一父节点的第q个影响因子、第一父节点的性能指标的第q个预估值;再将第一父节点的性能指标的第q个预估值和第一父节点的性能指标的第q个采样值的差值的绝对值,除以第一父节点的性能指标的第q个采样值,得到第一父节点的其他因素对第一父节点的第q个影响因子。S701, the electronic device performs for each parent node in the tree network: taking the qth sampled value of the performance index of all the child nodes of the first parent node as the input of the influence factor estimation model of the first parent node, and running The impact factor estimation model of the first parent node outputs the qth impact factor of each child node of the first parent node on the first parent node and the qth estimated value of the performance index of the first parent node; The absolute value of the difference between the q-th estimated value of the performance indicator of a parent node and the q-th sampled value of the performance indicator of the first parent node, divided by the q-th sampled value of the performance indicator of the first parent node, Get the qth influencing factor of other factors of the first parent node on the first parent node.
其中,q在{1,……,Q}中依次取值。Among them, q takes values in sequence in {1,...,Q}.
需要说明的是,S701的具体过程可以参见上述S601的详细描述,本申请实施例这里不予赘述。It should be noted that, for the specific process of S701, reference may be made to the detailed description of the foregoing S601, which is not repeated in this embodiment of the present application.
S702、若Q个采样时刻中的第p个采样时刻为性能指标的突变时刻,电子设备针对第一父节点的每个子节点执行:计算第一子节点对第一父节点的前p-1个影响因子的平均值,得到突变前第一子节点对第一父节点的影响因子;计算第一子节点对第一父节点的第p个-第Q个影响因子的平均值,得到突变后第一子节点对第一父节点的影响因子;计算突变前第一子节点对第一父节点的影响因子与突变后第一子节点对第一父节点的影响因子的差值或比值,将差值或比值作为第一子节点对第一父节点的影响因子。S702. If the p-th sampling time in the Q sampling times is the sudden change of the performance index, the electronic device performs for each child node of the first parent node: calculate the first p-1 times of the first child node to the first parent node Calculate the average value of the impact factors to obtain the impact factor of the first child node on the first parent node before the mutation; The influence factor of a child node on the first parent node; calculate the difference or ratio between the influence factor of the first child node on the first parent node before the mutation and the influence factor of the first child node on the first parent node after the mutation. The value or ratio is used as the influence factor of the first child node to the first parent node.
需要说明的是,S702的具体过程,可以参见S402的详细描述,本申请实施例这里不予赘述。It should be noted that, for the specific process of S702, reference may be made to the detailed description of S402, which is not repeated in this embodiment of the present application.
S703、电子设备计算第一父节点的其他因素对第一父节点的前p-1个影响因子的平均值,得到突变前第一父节点的其他因素对第一父节点的影响因子;计算第一父节点的其他因素对第一父节点的第p个-第Q个影响因子的平均值,得到突变后第一父节点的其他因素对第一父节点的影响因子;计算突变前第一父节点的其他因素对第一父节点的影响因子与突变后第一父节点的其他因素对第一父节点的影响因子的差值或比值,将差值或比值作为第一父节点的其他因素对第一父节点的影响因子。S703. The electronic device calculates the average value of the first p-1 influence factors of the other factors of the first parent node on the first parent node, and obtains the influence factors of other factors of the first parent node before the mutation on the first parent node; calculate the first parent node. The average value of the p-th to Q-th influencing factors of the other factors of a parent node on the first parent node, to obtain the influence factors of other factors of the first parent node after the mutation on the first parent node; calculate the first parent node before the mutation The difference or ratio between the influence factors of other factors of the node on the first parent node and the influence factors of other factors of the first parent node on the first parent node after mutation, and the difference or ratio is taken as the pair of other factors of the first parent node. The influence factor of the first parent node.
需要说明的是,S703的具体过程,可以参见S402的详细描述,本申请实施例这里不予赘述。It should be noted that, for the specific process of S703, reference may be made to the detailed description of S402, which is not repeated in this embodiment of the present application.
S704、电子设备根据树形网络中的各个子节点对父节点的影响因子,以及各个父节点的其他因素对对应的父节点的影响因子,确定出树形网络中导致第一性能指标异常的第二性能指标或导致第一性能指标异常的其他因素。S704. The electronic device determines, according to the influence factor of each child node in the tree network on the parent node, and the influence factor of other factors of each parent node on the corresponding parent node, the first performance index in the tree network that causes the abnormality of the first performance index. The second performance index or other factors that cause the first performance index to be abnormal.
需要说明的是,可以将各个父节点的其他因素等同于父节点的一个新的子节点的性能指标,此时,S704的具体过程,可以参见S104的详细描述,本申请实施例这里不予赘述。It should be noted that other factors of each parent node may be equivalent to the performance index of a new child node of the parent node. In this case, for the specific process of S704, reference may be made to the detailed description of S104, which is not repeated in this embodiment of the present application. .
可以理解的是,电子设备对于突发性的第一性能指标异常,也可以从多个第二性能指标和其他因素中,确定导致第一性能指标异常的根因。可以避免由于获取的多个第二性能指标不够全面,导致从多个第二性能指标中确定出的导致第一性能指标异常的第二性能指标不准确的问题。还可以避免硬件故障导致第一性能指标异常的情况下,从多个第二性能指标中确定出的导致第一性能指标异常的第二性能指标不准确的问题。因此,从多个第二性能指标和其他因素中确定导致第一性能指标异常的根因,相较于从多个第二性能指标中确定导致第一性能指标异常的根因,更准确。It can be understood that, for the sudden abnormality of the first performance index, the electronic device can also determine the root cause of the abnormality of the first performance index from a plurality of second performance indicators and other factors. The problem of inaccuracy of the second performance index determined from the plurality of second performance indexes and causing the abnormality of the first performance index can be avoided because the obtained multiple second performance indexes are not comprehensive enough. It is also possible to avoid the problem of inaccuracy of the second performance index determined from a plurality of second performance indicators that causes the first performance index to be abnormal when the first performance index is abnormal due to a hardware failure. Therefore, determining the root cause of the abnormality of the first performance index from the plurality of second performance indicators and other factors is more accurate than determining the root cause of the abnormality of the first performance index from the plurality of second performance indicators.
本申请实施例中,电子设备除了先计算每个采样时刻的各个子节点对父节点的影响因子,再计算预设定位周期内的各个子节点对父节点的影响因子(即各个子节点对父节点的影响因子),也可以针对树形网络中的每个子节点,直接计算子节点对父节点的影响因子。In this embodiment of the present application, the electronic device first calculates the influence factor of each child node on the parent node at each sampling time, and then calculates the influence factor of each child node on the parent node within the preset positioning period (that is, the influence factor of each child node on the parent node). The influence factor of the node), or for each child node in the tree network, the influence factor of the child node on the parent node can be directly calculated.
具体地,由于树形网络中每个节点的性能指标的Q个采样值都是按照采样时刻的先后顺序排列而成的数列,可以知道,该Q个采样值可以由表示该Q个采样值的变化趋势的趋势分量、该Q个采样值中的周期性出现的周期分量、以及表征该Q个采样值的波动情况的残差组成。由于残差表示了任一个节点的性能指标的Q个采样值的波动情况,如果任一个节点的性能指标的残差与任一个节点的父节点的性能指标的残差的相关性较大,则表示任一个节点的性能指标的Q个采样值的波动情况,对任一个节点的父节点的性能指标的Q个采样值的波动情况影响较大。因此,可以利用任一个节点的性能指标的残差和任一个节点的父节点的性能指标的残差之间的相关性,确定任一个节点对父节点的影响因子。Specifically, since the Q sampled values of the performance indicators of each node in the tree network are all arrays arranged in the order of the sampling moments, it can be known that the Q sampled values can be represented by the The trend component of the change trend, the periodic component that occurs periodically in the Q sampled values, and the residual characterizing the fluctuation of the Q sampled values are composed. Since the residual represents the fluctuation of the Q sampling values of the performance index of any node, if the residual of the performance index of any node has a large correlation with the residual of the performance index of the parent node of any node, then The fluctuation of the Q sampling values representing the performance index of any node has a greater impact on the fluctuation of the Q sampling values of the performance index of the parent node of any node. Therefore, the influence factor of any node on the parent node can be determined by using the correlation between the residual of the performance index of any node and the residual of the performance index of the parent node of any node.
本申请实施例中,电子设备可根据残差的相关性,确定任一个节点对父节点的影响因子。具体地,如图8所示,电子设备可以针对树形网络中的每个子节点执行S801-S803,以计算每个子节点对父节点的影响因子。为了更清楚说明计算每个子节点对父节点的影响因子的过程,将每个子节点称为第二子节点。In this embodiment of the present application, the electronic device may determine the influence factor of any node on the parent node according to the correlation of the residuals. Specifically, as shown in FIG. 8 , the electronic device may execute S801-S803 for each child node in the tree network to calculate the influence factor of each child node on the parent node. In order to illustrate the process of calculating the influence factor of each child node on the parent node more clearly, each child node is referred to as a second child node.
S801、电子设备根据第二子节点的性能指标的Q个采样值,计算第二子节点的性能指标的第一残差;其中,第二子节点是树形网络的任一个子节点;第一残差用于表征第二子节点的性能指标的Q个采样值的波动情况。S801. The electronic device calculates the first residual of the performance index of the second child node according to the Q sampled values of the performance index of the second child node; wherein the second child node is any child node of the tree network; the first The residual is used to characterize the fluctuation of the Q sampled values of the performance index of the second child node.
其中,将第二子节点的性能指标的残差称为第一残差。第二子节点的性能指标的Q个采样值可以包括一个性能指标的Q个采样值,或者各个性能指标的Q个采样值。第二子节点的性能指标的第一残差可以包括一个性能指标的第一残差,或者各个性能指标的第一残差。Wherein, the residual of the performance index of the second child node is referred to as the first residual. The Q sampled values of the performance indicator of the second child node may include Q sampled values of one performance indicator, or Q sampled values of each performance indicator. The first residual of the performance index of the second child node may include the first residual of one performance index, or the first residual of each performance index.
本申请实施例中,若第二子节点的性能指标的Q个采样值包括各个性能指标的Q个采样值,电子设备根据第二子节点的每个性能指标的Q个采样值,计算该性能指标的第一残差。In this embodiment of the present application, if the Q sampled values of the performance indicators of the second subnode include Q sampled values of each performance indicator, the electronic device calculates the performance according to the Q sampled values of each performance indicator of the second subnode The first residual of the indicator.
本申请实施例中,电子设备可以利用时间序列分解算法,从第二子节点的性能指标的Q个采样值中,提取第二子节点的性能指标的趋势分量和第二子节点的性能指标的周期分量;再将第二子节点的性能指标的Q个采样值,减去趋势分量和周期分量,得到第二子节点的性能指标的第一残差。其中,趋势分量用于表示第二子节点的性能指标的Q个采样值的变化趋势,周期分量用于表示第二子节点的性能指标的Q个采样值的周期性变动。In this embodiment of the present application, the electronic device may use a time series decomposition algorithm to extract the trend component of the performance index of the second child node and the trend component of the performance index of the second child node from the Q sampled values of the performance index of the second child node. Periodic component; then subtract the trend component and the periodic component from the Q sampled values of the performance index of the second child node to obtain the first residual of the performance index of the second child node. The trend component is used to represent the change trend of the Q sampled values of the performance index of the second child node, and the periodic component is used to represent the periodic change of the Q sampled values of the performance index of the second child node.
其中,时间序列分解算法包括基于局部加权回归的季节趋势分解方法(Seasonal-Trend decomposition procedure based on Loess,STL分解法)。Among them, the time series decomposition algorithm includes the seasonal trend decomposition method based on local weighted regression (Seasonal-Trend decomposition procedure based on Loess, STL decomposition method).
具体地,电子设备针对第二子节点的每个性能指标执行以下步骤,以获取该性能指标的第一残差:从性能指标的Q个采样值中,提取性能指标的趋势分量和性能指标的周期分量;再将性能指标的Q个采样值,减去趋势分量和周期分量,得到性能指标的第一残差。Specifically, the electronic device performs the following steps for each performance indicator of the second child node to obtain the first residual of the performance indicator: from the Q sampled values of the performance indicator, extracting the trend component of the performance indicator and the Periodic component; then subtract the trend component and the periodic component from the Q sampled values of the performance index to obtain the first residual of the performance index.
本申请实施例中,电子设备可以预先利用第二子节点的性能指标的H个样本值,训练基于时间序列分解算法的差分整合移动平均自回归模型(Autoregressive Integrated Moving Average model,ARIMA模型),得到第二子节点的分量拟合模型。该分量拟合模型具有从第二子节点的性能指标的多个采样值中提取出第二子节点的性能指标的趋势分量和第二子节点的性能指标的周期分量的能力。In the embodiment of the present application, the electronic device may use H sample values of the performance index of the second sub-node in advance to train a differential integrated moving average autoregressive model (Autoregressive Integrated Moving Average model, ARIMA model) based on the time series decomposition algorithm, and obtain The components of the second child node fit the model. The component fitting model has the capability of extracting a trend component of the performance index of the second child node and a periodic component of the performance index of the second child node from a plurality of sampled values of the performance index of the second child node.
需要说明的是,除了上述ARIMA模型,电子设备还可以训练基于时间序列分解算法的其他模型,以得到第二子节点的分量拟合模型,例如,自回归模型(Autoregressive model,AR模型),本申请实施例不作限制。It should be noted that, in addition to the above ARIMA model, the electronic device can also train other models based on the time series decomposition algorithm to obtain the component fitting model of the second child node, for example, an autoregressive model (Autoregressive model, AR model), this The application examples are not limited.
本申请实施例中,电子设备可以将第二子节点的性能指标的Q个采样值输入第二子节点的分量拟合模型,运行第二子节点的分量拟合模型,输出第二子节点的性能指标的趋势分量和第二子节点的性能指标的周期分量。In this embodiment of the present application, the electronic device may input the Q sampled values of the performance indicators of the second child node into the component fitting model of the second child node, run the component fitting model of the second child node, and output the component fitting model of the second child node. The trend component of the performance indicator and the periodic component of the performance indicator of the second child node.
具体地,电子设备针对第二子节点的每个性能指标执行以下步骤,以获取该性能指标的趋势分量和周期分量:将性能指标的Q个采样值输入第二子节点的分量拟合模型,运行第二子节点的分量拟合模型,输出性能指标的趋势分量和性能指标的周期分量。Specifically, the electronic device performs the following steps for each performance index of the second child node to obtain the trend component and periodic component of the performance index: inputting the Q sampled values of the performance index into the component fitting model of the second child node, Run the component fitting model of the second child node, and output the trend component of the performance index and the periodic component of the performance index.
S802、电子设备根据第二子节点的父节点的性能指标的Q个采样值,计算第二子节点的父节点的性能指标的第二残差,第二残差用于表征第二子节点的父节点的性能指标的Q个采样值的波动情况。S802. The electronic device calculates a second residual of the performance index of the parent node of the second child node according to the Q sampled values of the performance index of the parent node of the second child node, where the second residual is used to characterize the performance index of the second child node. The fluctuation of the Q sampled values of the parent node's performance index.
其中,将第二子节点的父节点的性能指标的残差称为第二残差。第二子节点的父节点可以包括一个父节点或多个父节点。第二子节点的父节点的性能指标的第二残差可以包括第二子节点的一个父节点的性能指标的第二残差,或第二子节点的各个父节点的性能指标的第二残差。The residual of the performance index of the parent node of the second child node is referred to as the second residual. The parent node of the second child node may include one parent node or multiple parent nodes. The second residual of the performance index of the parent node of the second child node may include the second residual of the performance index of one parent node of the second child node, or the second residual of the performance index of each parent node of the second child node Difference.
其中,第二子节点的任一个父节点的性能指标的Q个采样值可以包括一个性能指标的Q个采样值,或者各个性能指标的Q个采样值。第二子节点的任一个父节点的性能指标的第二残差可以包括一个性能指标的第二残差,或者各个性能指标的第二残差。The Q sampled values of the performance indicator of any parent node of the second child node may include Q sampled values of one performance indicator, or Q sampled values of each performance indicator. The second residual of the performance index of any parent node of the second child node may include the second residual of one performance index, or the second residual of each performance index.
本申请实施例中,若第二子节点的任一个父节点的性能指标的Q个采样值包括多个性能指标的Q个采样值,电子设备对第二子节点的任一个父节点的每个性能指标的Q个采样值,计算该性能指标的第二残差。In this embodiment of the present application, if the Q sampled values of the performance indicators of any parent node of the second child node include Q sampled values of multiple performance indicators, the electronic device will perform an analysis on each parent node of the second child node. The Q sampled values of the performance indicator are used to calculate the second residual of the performance indicator.
需要说明的是,电子设备计算第二子节点的父节点的性能指标的第二残差的过程,与计算第二子节点的性能指标的第一残差的过程同理,本申请实施例这里不予赘述。It should be noted that the process of calculating the second residual of the performance index of the parent node of the second child node by the electronic device is the same as the process of calculating the first residual of the performance index of the second child node. I won't go into details.
S803、电子设备根据预设相关系数算法、第一残差和第二残差,计算第二子节点对父节点的影响因子。S803. The electronic device calculates the influence factor of the second child node on the parent node according to the preset correlation coefficient algorithm, the first residual and the second residual.
其中,第二子节点的父节点包括一个父节点或多个父节点,第二子节点对父节点的影响因子包括第二子节点对一个父节点的影响因子或第二子节点对各个父节点的影响因子。The parent node of the second child node includes one parent node or multiple parent nodes, and the influence factor of the second child node on the parent node includes the influence factor of the second child node on a parent node or the second child node on each parent node impact factor.
本申请实施例中,电子设备针对第二子节点的每个父节点,根据预设相关系数算法、第二子节点的性能指标的第一残差和第二子节点的每个父节点的性能指标的第二残差,计算第二子节点对每个父节点的影响因子。In the embodiment of the present application, for each parent node of the second child node, the electronic device performs the calculation according to the preset correlation coefficient algorithm, the first residual of the performance index of the second child node, and the performance of each parent node of the second child node. The second residual of the indicator calculates the influence factor of the second child node on each parent node.
具体地,若第二子节点的性能指标的第一残差包括一个性能指标的第一残差,第二子节点的一个父节点的性能指标的第二残差包括一个性能指标的第二残差,电子设备计算这一个性能指标的第一残差和这一个性能指标的第二残差的相关性,将相关性作为第二子节点对这一个父节点的影响因子。Specifically, if the first residual of the performance index of the second child node includes the first residual of one performance index, the second residual of the performance index of a parent node of the second child node includes the second residual of one performance index difference, the electronic device calculates the correlation between the first residual of the one performance index and the second residual of the one performance index, and uses the correlation as the influence factor of the second child node on the parent node.
若第二子节点的性能指标的第一残差包括多个性能指标的第一残差,第二子节点的一个父节点的性能指标的第二残差包括一个性能指标的第二残差,电子设备计算第二子节点的每个性能指标的第一残差和这一个性能指标的第二残差的相关性,得到与第二子节点的多个性能指标一一对应的多个相关性;再对该多个相关性求平均,得到第二子节点对这一个父节点的影响因子。If the first residual of the performance index of the second child node includes the first residual of multiple performance indexes, and the second residual of the performance index of a parent node of the second child node includes the second residual of one performance index, The electronic device calculates the correlation between the first residual of each performance index of the second child node and the second residual of this one performance index, and obtains multiple correlations one-to-one corresponding to the multiple performance indexes of the second child node ; Then average the multiple correlations to obtain the influence factor of the second child node on this parent node.
若第二子节点的性能指标的第一残差包括一个性能指标的第一残差,第二子节点的一个父节点的性能指标的第二残差包括多个性能指标的第二残差,电子设备计算第二子节点的一个性能指标的第一残差和各个性能指标的第二残差的相关性,得到与这一个父节点的多个性能指标一一对应的多个相关性;再对该多个相关性求平均,得到第二子节点对这一个父节点的影响因子。If the first residual of the performance index of the second child node includes the first residual of one performance index, and the second residual of the performance index of a parent node of the second child node includes the second residual of multiple performance indexes, The electronic device calculates the correlation between the first residual of one performance index of the second child node and the second residual of each performance index, and obtains multiple correlations one-to-one corresponding to the multiple performance indexes of the parent node; The multiple correlations are averaged to obtain the influence factor of the second child node on this one parent node.
若第二子节点的性能指标的第一残差包括多个性能指标的第一残差,第二子节点的一个父节点的性能指标的第二残差包括多个性能指标的第二残差,电子设备计算第二子节点的每个性能指标的第一残差和各个性能指标的第二残差的相关性,得到第二子节点的每个性能指标对应的多个相关性;再对第二子节点的每个性能指标对应的多个相关性求平均,得到第二子节点的每个性能指标对应的平均值;然后,对第二子节点的所有性能指标对应的平均值求平均,得到第二子节点的每个性能指标对这一个父节点的影响因子。If the first residual of the performance index of the second child node includes the first residual of multiple performance indexes, the second residual of the performance index of a parent node of the second child node includes the second residual of multiple performance indexes , the electronic device calculates the correlation between the first residual of each performance index of the second child node and the second residual of each performance index, and obtains multiple correlations corresponding to each performance index of the second child node; Average multiple correlations corresponding to each performance index of the second child node to obtain an average value corresponding to each performance index of the second child node; then, average the average values corresponding to all performance indexes of the second child node , to get the influence factor of each performance index of the second child node on this parent node.
本申请实施例中,预设相关系统的计算公式可以如公式(1)所示:In the embodiment of the present application, the calculation formula of the preset correlation system may be as shown in formula (1):
Figure PCTCN2020106404-appb-000001
Figure PCTCN2020106404-appb-000001
其中,U 1为第一残差,U 2为第二残差;r(U 1,U 2)为第一残差与第二残差的相关性(或者称 为相关系数);Cov(U 1,U 2)为U 1与U 2的协方差;Var[U 1]为U 1的方差;Var[U 2]为U 2的方差。 Among them, U 1 is the first residual, U 2 is the second residual; r(U 1 , U 2 ) is the correlation between the first residual and the second residual (or called correlation coefficient); Cov(U 1 , U 2 ) is the covariance of U 1 and U 2 ; Var[U 1 ] is the variance of U 1 ; Var[U 2 ] is the variance of U 2 .
可以理解的是,电子设备根据第一残差和第二残差之间的相关性确定第二子节点对父节点的影响因子,再用该影响因子确定导致第一性能指标异常的第二性能指标,就是根据第二子节点的性能指标的Q个采样值的波动情况,对第二子节点的父节点的性能指标的Q个采样值的波动情况的影响程度,确定导致第一性能指标异常的第二性能指标。而如果任一个节点的性能指标的Q个采样值的波动情况,对任一个节点的父节点的性能指标的Q个采样值的波动情况影响越大,则可以确定任一个节点的性能指标对父节点的性指标的影响越大;因此,根据第一残差和第二残差之间的相关性确定第二子节点对父节点的影响因子,能够提高第一性能指标异常的根因定位的准确性。It can be understood that the electronic device determines the influence factor of the second child node on the parent node according to the correlation between the first residual and the second residual, and then uses the influence factor to determine the second performance that causes the first performance index to be abnormal. The indicator refers to the degree of influence on the fluctuation of the Q sampling values of the performance index of the second child node on the fluctuation of the Q sampling values of the performance index of the second child node to determine the abnormality of the first performance index. the second performance indicator. However, if the fluctuation of the Q sampling values of the performance index of any node has a greater impact on the fluctuation of the Q sampling values of the performance index of the parent node of any node, it can be determined that the performance index of any node affects the parent node. Therefore, determining the influence factor of the second child node on the parent node according to the correlation between the first residual and the second residual can improve the location of the root cause of the abnormal first performance index. accuracy.
本申请实施例中,电子设备计算出树形网络中的所有子节点对父节点的影响因子之后,可以采用广度优先算法(又称为宽度优先算法),先根据树形网络中的各个子节点对父节点的影响因子,计算树形网络中的各个叶子节点对根节点的影响因子;再从树形网络的所有叶子节点中确定对根节点的影响因子最大的节点,确定对根节点的影响因子最大的节点的性能指标为导致第一性能指标异常的第二性能指标。具体地,如图9所示,S104可以包括S901-S902。In the embodiment of the present application, after the electronic device calculates the influence factors of all the child nodes in the tree network on the parent node, the breadth-first algorithm (also called breadth-first algorithm) can be used to first calculate the influence factors of each child node in the tree network For the influence factor of the parent node, calculate the influence factor of each leaf node in the tree network on the root node; then determine the node with the largest influence factor on the root node from all the leaf nodes in the tree network, and determine the influence on the root node. The performance indicator of the node with the largest factor is the second performance indicator that causes the first performance indicator to be abnormal. Specifically, as shown in FIG. 9 , S104 may include S901-S902.
S901、电子设备根据树形网络中各个子节点对父节点的影响因子,计算树形网络中叶子节点对根节点的影响因子。S901. The electronic device calculates the influence factor of the leaf node on the root node in the tree network according to the influence factor of each child node in the tree network on the parent node.
电子设备对树形网络中的每个叶子节点,计算每个叶子节点对根节点的影响因子。其中,每个叶子结点对根节点的影响因子可以包括一个影响因子,或者多个影响因子。每个叶子节点对根节点的影响因子的总个数,等于每个叶子节点到根节点的路径的总个数。The electronic device calculates the influence factor of each leaf node on the root node for each leaf node in the tree network. The influence factor of each leaf node on the root node may include one influence factor or multiple influence factors. The total number of influence factors of each leaf node to the root node is equal to the total number of paths from each leaf node to the root node.
本申请实施例中,电子设备针对每个叶子节点,将叶子节点到根节点的每一条路径上的所有影响因子进行相乘,得到叶子节点对根节点的一个影响因子。In the embodiment of the present application, for each leaf node, the electronic device multiplies all the influence factors on each path from the leaf node to the root node to obtain an influence factor of the leaf node on the root node.
示例性地,如图10所示的第一性能指标的树形网络,该树形网络包括根节点A和多个子节点,多个子节点包括节点B、节点C、节点D、节点E、节点F、节点G和节点H。其中,根节点A是节点B、节点C和节点D的父节点。节点B是节点E和节点F的父节点。节点C是节点F的父节点。节点D是节点G和节点H的父节点。节点B对根节点A的影响因子为0.6,节点C对根节点A的影响因子为0.3,节点D对节点A的影响因子为0.5,节点E对节点B的影响因子为0.6,节点F对节点B的影响因子为0.7,节点F对节点C的影响因子为0.1,节点G对节点D的影响因子为0.3,节点H对节点D的影响因子为0.2。Exemplarily, as shown in the tree network of the first performance index as shown in FIG. 10 , the tree network includes a root node A and multiple child nodes, and the multiple child nodes include node B, node C, node D, node E, and node F. , Node G, and Node H. The root node A is the parent node of node B, node C, and node D. Node B is the parent of Node E and Node F. Node C is the parent of node F. Node D is the parent of Node G and Node H. The influence factor of node B on root node A is 0.6, the influence factor of node C on root node A is 0.3, the influence factor of node D on node A is 0.5, the influence factor of node E on node B is 0.6, and the influence factor of node F on node A is 0.6. The influence factor of B is 0.7, the influence factor of node F on node C is 0.1, the influence factor of node G on node D is 0.3, and the influence factor of node H on node D is 0.2.
进而,可以将节点E对节点B的影响因子和节点B对根节点A的影响因子相乘,得到节点E对根节点A的影响因子为0.36。将节点F对节点B的影响因子和节点B对根节点A的影响因子相乘,得到节点F对根节点A的一个影响因子为0.42;将节点F对节点C的影响因子和节点C对根节点A的影响因子相乘,得到节点F对根节点A的另一个影响因子为0.03。将节点G对节点D的影响因子和节点D对根节点A的影响因子相乘,得到节点G对根节点A的另一个影响因子为0.15。将节点H对节点D的影响因子和节点D对根节点A的影响因子相乘,得到节点H对根节点A的另一个影响因子为0.10。Furthermore, the influence factor of node E on node B and the influence factor of node B on root node A can be multiplied, and the influence factor of node E on root node A can be obtained as 0.36. Multiply the impact factor of node F on node B and the impact factor of node B on root node A, and get an impact factor of node F on root node A as 0.42; The influence factor of node A is multiplied, and another influence factor of node F to root node A is obtained as 0.03. Multiplying the influence factor of node G on node D and the influence factor of node D on root node A, another influence factor of node G on root node A is obtained as 0.15. Multiplying the influence factor of node H on node D and the influence factor of node D on root node A, the other influence factor of node H on root node A is 0.10.
S902、电子设备将树形网络中的N个叶子节点的性能指标,确定为导致第一性能指标异常的第二性能指标;其中,N个叶子节点对根节点的影响因子大于树形网络中其他叶子节点对根节点的影响因子,N≥1,N为整数。S902. The electronic device determines the performance indexes of the N leaf nodes in the tree network as the second performance indexes that cause the first performance index to be abnormal; wherein, the influence factor of the N leaf nodes on the root node is greater than that of other nodes in the tree network The influence factor of the leaf node on the root node, N≥1, N is an integer.
电子设备可以从树形网络中的所有叶子节点中选出N个叶子节点,并将N个叶子节点的性能 指标确定为导致第一性能指标异常的第二性能指标。The electronic device may select N leaf nodes from all the leaf nodes in the tree network, and determine the performance indicators of the N leaf nodes as the second performance indicators that cause the first performance indicator to be abnormal.
在一些实施例中,电子设备可以按照所有叶子节点对父节点的影响因子,对所有叶子节点进行影子因子从高到低的排序,再从排序后的叶子节点中选出前N个叶子节点。其中,若一个叶子节点对根节点的影响因子包括多个影响因子,可以将该叶子节点对根节点的最大的影响因子用于排序。In some embodiments, the electronic device may sort all leaf nodes in descending order of shadow factors according to the influence factors of all leaf nodes on the parent node, and then select the top N leaf nodes from the sorted leaf nodes. Wherein, if the influence factor of a leaf node to the root node includes multiple influence factors, the largest influence factor of the leaf node to the root node can be used for sorting.
本申请实施例中,电子设备在S902之后,还可以获取N个叶子节点各自到根节点的路径信息。该路径信息可以包括一个叶子节点到根节点的路径上的所有节点的连接关系,以及一个叶子节点到根节点的路径上的所有节点的性能指标。进而,电子设备或用户可以根据该路径信息和导致第一性能指标异常的第二性能指标,调整该导致第一性能指标异常的第二性能指标,实现解决第一性能指标异常的问题。In this embodiment of the present application, after S902, the electronic device may further acquire path information from each of the N leaf nodes to the root node. The path information may include connection relationships of all nodes on the path from a leaf node to the root node, and performance indicators of all nodes on the path from a leaf node to the root node. Further, the electronic device or the user can adjust the second performance index that causes the first performance index to be abnormal according to the path information and the second performance index that causes the first performance index to be abnormal, so as to solve the problem of the first performance index being abnormal.
其中,若一个叶子节点到根节点的路径为多个,该叶子节点到根节点的路径信息可以包括该叶子节点对根节点的最大影响因子对应的路径信息,也可以包括该叶子节点对根节点的多个路径信息。Wherein, if there are multiple paths from a leaf node to the root node, the path information from the leaf node to the root node may include the path information corresponding to the maximum influence factor of the leaf node on the root node, or may include the leaf node to the root node. multiple path information.
示例性地,对于图10所示的树形网络,N等于1时,电子设备确定出的导致第一性能指标异常的第二性能指标为节点F的性能指标。节点F到根节点A的路径信息可以包括:表示节点F指向节点B和节点B指向节点A的信息,节点F的性能指标,节点B的性能指标和节点A的性能指标。Exemplarily, for the tree network shown in FIG. 10 , when N is equal to 1, the second performance indicator determined by the electronic device that causes the first performance indicator to be abnormal is the performance indicator of node F. The path information from node F to root node A may include: information indicating that node F points to node B and node B points to node A, the performance index of node F, the performance index of node B and the performance index of node A.
可以理解的是,由于叶子节点为树形网络中的末端节点,导致第一性能指标异常的根因就存在与树形网络中的所有叶子节点中。因此,电子设备直接计算树形网络中的所有叶子节点对根节点的影响因子;再确定对根节点的影响因子较大的叶子节点的性能指标就是导致第一性能指标异常的第二性能指标。It can be understood that since the leaf nodes are terminal nodes in the tree network, the root cause of the abnormality of the first performance index exists in all the leaf nodes in the tree network. Therefore, the electronic device directly calculates the influence factors of all leaf nodes in the tree network on the root node; and then determines that the performance index of the leaf node with a larger influence factor on the root node is the second performance index that causes the abnormality of the first performance index.
本申请实施例中,树形网络包括第一预设节点,第一预设节点不是叶子节点,第一预设节点的子节点对第一预设节点的影响因子小于或等于预设阈值。由于第一预设节点的子节点对第一预设节点的影响因子小于或等于预设阈值,可以认为第一预设节点为不可分解的节点。则计算树形网络中的各个叶子节点对根节点的影响因子,还计算第一预设节点对根节点的影响因子。再从树形网络的所有叶子节点和第一预设节点中确定对根节点的影响因子最大的节点,确定对根节点的影响因子最大的节点的性能指标为导致第一性能指标异常的第二性能指标。具体地,如图11所示,S104可以包括S1101-S1102。In the embodiment of the present application, the tree network includes a first preset node, the first preset node is not a leaf node, and the influence factor of the child nodes of the first preset node on the first preset node is less than or equal to a preset threshold. Since the influence factor of the child nodes of the first preset node on the first preset node is less than or equal to the preset threshold, the first preset node can be considered as an indecomposable node. Then, the influence factor of each leaf node in the tree network on the root node is calculated, and the influence factor of the first preset node on the root node is also calculated. Then determine the node with the largest influence factor on the root node from all the leaf nodes and the first preset node of the tree network, and determine that the performance index of the node with the largest influence factor on the root node is the second one that causes the first performance index to be abnormal. Performance. Specifically, as shown in FIG. 11 , S104 may include S1101-S1102.
S1101、电子设备根据树形网络中各个子节点对父节点的影响因子,计算树形网络中叶子节点对根节点的影响因子,并计算第一预设节点对根节点的影响因子。S1101. The electronic device calculates the influence factor of the leaf node on the root node in the tree network according to the influence factor of each child node on the parent node in the tree network, and calculates the influence factor of the first preset node on the root node.
其中,树形网络可以包括一个或多个第一预设节点。电子设备计算树形网络中的每个第一预设节点对根节点的影响因子。Wherein, the tree network may include one or more first preset nodes. The electronic device calculates the influence factor of each first preset node in the tree network on the root node.
需要说明的是,电子设备计算第一预设节点对根节点的影响因子,与计算叶子节点对根节点的影响因子的过程同理,则S1101的具体过程,可以参加上述S901的详细描述,本申请实施例这里不予赘述。It should be noted that the process of calculating the influence factor of the first preset node on the root node by the electronic device is the same as the process of calculating the influence factor of the leaf node on the root node. For the specific process of S1101, you can refer to the detailed description of the above-mentioned S901. The application examples are not repeated here.
本申请实施例中,电子设备在S1101之前,可以对树形网络中除叶子节点和根节点之外的其他节点,判断每个其他节点的各个子节点对父节点的影响因子是否小于或等于预设阈值。若每个其他节点的所有子节点对父节点的影响因子小于或等于预设阈值,则确定每个其他节点为一个第一预设节点。若每个其他节点的至少一个子节点对父节点的影响因子大于预设阈值,则确定每个 其他节点不是第一预设节点。In this embodiment of the present application, before S1101, the electronic device may, for other nodes in the tree network except the leaf node and the root node, determine whether the influence factor of each child node of each other node on the parent node is less than or equal to the predetermined factor. Set the threshold. If the influence factor of all child nodes of each other node on the parent node is less than or equal to the preset threshold, then each other node is determined to be a first preset node. If the influence factor of at least one child node of each other node on the parent node is greater than the preset threshold, it is determined that each other node is not the first preset node.
示例性地,假设预设阈值为0.3,对于图10所示的树形网络,由于节点C的子节点F对节点C的影响因子为0.1,小于0.3;还有,节点D的子节点G对节点D的影响因子为0.3,并且节点D的子节点H对节点D的影响因子为0.2,小于0.3。因此,该树形网络包括两个第一预设节点,即节点C和节点D。Exemplarily, assuming that the preset threshold is 0.3, for the tree network shown in FIG. 10 , since the influence factor of the child node F of node C on node C is 0.1, which is less than 0.3; The influence factor of node D is 0.3, and the influence factor of node D's child node H on node D is 0.2, which is less than 0.3. Therefore, the tree network includes two first preset nodes, namely node C and node D.
S1102、电子设备将叶子节点和第一预设节点中的M个节点的性能指标,确定为导致第一性能指标异常的第二性能指标;其中,M个节点对根节点的影响因子大于树形网络中其他节点对根节点的影响因子,M≥1,M为整数。S1102. The electronic device determines the performance indicators of the M nodes in the leaf node and the first preset node as the second performance indicator that causes the first performance indicator to be abnormal; wherein, the influence factor of the M nodes on the root node is greater than the tree shape The influence factor of other nodes in the network on the root node, M≥1, M is an integer.
电子设备可以从树形网络中的所有叶子节点和所有第一预设节点中,选出M个节点;再将M个节点的性能指标,确定为导致第一性能指标异常的第二性能指标。其中,若一个叶子节点或一个第一预设节点对根节点的影响因子包括多个影响因子,可以将该叶子节点或该第一预设节点对根节点的最大的影响因子,用于选择M个节点。The electronic device may select M nodes from all leaf nodes and all first preset nodes in the tree network; and then determine the performance indicators of the M nodes as the second performance indicators that cause the first performance indicator to be abnormal. Wherein, if the influence factor of a leaf node or a first preset node on the root node includes multiple influence factors, the maximum influence factor of the leaf node or the first preset node on the root node can be used to select M node.
需要说明的是电子设备从所有叶子节点和所有第一预设节点中选出M个节点的具体过程,可以参加上述从树形网络中的所有叶子节点中选出N个叶子节点的详细描述,本申请实施例这里不予赘述。It should be noted that the specific process in which the electronic device selects M nodes from all leaf nodes and all first preset nodes can participate in the above detailed description of selecting N leaf nodes from all leaf nodes in the tree network. This embodiment of the present application will not be repeated here.
本申请实施例中,电子设备在S1102之后,还可以获取M个节点各自到根节点的路径信息。该路径信息可以包括一个节点到根节点的路径上的所有节点的连接关系,以及一个节点到根节点的路径上的所有节点的性能指标。进而,电子设备或用户可以根据该路径信息和导致第一性能指标异常的第二性能指标,调整该导致第一性能指标异常的第二性能指标,实现解决第一性能指标异常的问题。In this embodiment of the present application, after S1102, the electronic device may further acquire path information from each of the M nodes to the root node. The path information may include connection relationships of all nodes on the path from a node to the root node, and performance indicators of all nodes on the path from a node to the root node. Further, the electronic device or the user can adjust the second performance index that causes the first performance index to be abnormal according to the path information and the second performance index that causes the first performance index to be abnormal, so as to solve the problem of the first performance index being abnormal.
其中,若一个节点到根节点的路径为多个,该节点到根节点的路径信息可以包括该节点对根节点的最大影响因子对应的路径信息,也可以包括该节点对根节点的多个路径信息。Wherein, if there are multiple paths from a node to the root node, the path information from the node to the root node may include the path information corresponding to the maximum influence factor of the node on the root node, or may include multiple paths from the node to the root node information.
可以理解的是,虽然树形网络中的第一预设节点不是叶子节点,但是第一预设节点的子节点对第一预设节点的影响因子不超过预设阈值,即第一预设节点的子节点的性能指标对第一预设节点的性能指标的影响程度较小,则可以认为第一预设节点和叶子节点类似,导致第一性能指标异常的根因也可能存在于树形网络中的第一预设节点中。因此,电子设备计算树形网络中的所有叶子节点对根节点的影响因子、以及树形网络中的第一预设节点对根节点的影响因子;再从树形网络中的所有叶子节点和第一预设节点中确定对根节点的影响因子较大的节点,其性能指标就是导致第一性能指标异常的第二性能指标。如此,对可能包括导致第一性能指标异常的根因的叶子节点和可能包括导致第一性能指标异常的根因的第一预设节点都进行判断,可以更准确地确定出导致第一性能指标异常的第二性能指标。It can be understood that although the first preset node in the tree network is not a leaf node, the influence factor of the child nodes of the first preset node on the first preset node does not exceed the preset threshold, that is, the first preset node. The performance index of the child node of the first preset node has little influence on the performance index of the first preset node, it can be considered that the first preset node is similar to the leaf node, and the root cause of the abnormal first performance index may also exist in the tree network. in the first preset node in . Therefore, the electronic device calculates the influence factor of all leaf nodes in the tree network on the root node, and the influence factor of the first preset node in the tree network on the root node; In a preset node, a node with a larger influence factor on the root node is determined, and its performance index is the second performance index that causes the abnormality of the first performance index. In this way, both the leaf node that may include the root cause that causes the abnormality of the first performance index and the first preset node that may include the root cause that causes the abnormality of the first performance index are judged, and the first performance index that causes the abnormality can be determined more accurately. Abnormal secondary performance indicator.
本申请实施例中,电子设备计算出树形网络中的所有子节点对父节点的影响因子之后,可以采用深度优先算法,对树形网络从根节点开始一级一级地查找对父节点的影响因子最大的节点,直至查找到的节点为叶子节点或第一预设节点,确定查找到的节点的性能指标为导致第一性能指标异常的第二性能指标。具体地,以查找到的节点为叶子节点,则确定查找到的节点的性能指标为导致第一性能指标异常的第二性能指标为例,说明S104的具体过程。如图12所示,S104可以包括S1201-S1208。In the embodiment of the present application, after the electronic device calculates the influence factors of all the child nodes in the tree network on the parent node, the depth-first algorithm can be used to search the tree network from the root node to the parent node level by level. The node with the largest influence factor is determined until the found node is a leaf node or the first preset node, and the performance index of the found node is determined to be the second performance index that causes the first performance index to be abnormal. Specifically, taking the found node as a leaf node, and determining that the performance index of the found node is the second performance index causing the abnormality of the first performance index as an example, the specific process of S104 is described. As shown in FIG. 12, S104 may include S1201-S1208.
S1201、电子设备将根节点作为第二父节点,并根据树形网络中各个子节点对父节点的影响因子,确定对第二父节点的影响因子最大的子节点。S1201. The electronic device takes the root node as the second parent node, and determines the child node with the largest impact factor on the second parent node according to the influence factors of each child node in the tree network on the parent node.
S1202、电子设备判断对第二父节点的影响因子最大的子节点是不是叶子节点。S1202. The electronic device determines whether the child node with the largest influence factor on the second parent node is a leaf node.
电子设备若确定对第二父节点的影响因子最大的子节点不是叶子节点,执行S1203;若确定对第二父节点的影响因子最大的子节点是叶子节点,执行S1204。If the electronic device determines that the child node with the greatest influence factor on the second parent node is not a leaf node, execute S1203; if it is determined that the child node with the greatest influence factor on the second parent node is a leaf node, execute S1204.
S1203、电子设备将对第二父节点的影响因子最大的子节点作为第二父节点,并根据树形网络中各个子节点对父节点的影响因子,确定对第二父节点的影响因子最大的子节点。S1203. The electronic device takes the child node with the largest influence factor on the second parent node as the second parent node, and determines the child node with the largest influence factor on the second parent node according to the influence factors of each child node in the tree network on the parent node. child node.
电子设备在S1203之后,继续执行S1202。After S1203, the electronic device continues to execute S1202.
S1204、电子设备将对第二父节点的影响因子最大的子节点作为待选子节点。S1204, the electronic device takes the child node with the largest influence factor of the second parent node as the child node to be selected.
S1205、电子设备统计得到的待选子节点的总个数。S1205. The total number of child nodes to be selected obtained by the electronic device statistics.
S1206、电子设备判断待选子节点的总个数是否小于N。S1206, the electronic device determines whether the total number of child nodes to be selected is less than N.
其中,N≥1,N为整数。Among them, N≥1, N is an integer.
电子设备若确定对待选子节点的总个数小于N,执行S1207;若确定对待选子节点的总个数大于或等于N,执行S1208。If the electronic device determines that the total number of child nodes to be selected is less than N, execute S1207; if it is determined that the total number of child nodes to be selected is greater than or equal to N, execute S1208.
S1207、电子设备从树形网络中删除对第二父节点的影响因子最大的子节点。S1207, the electronic device deletes the child node with the largest influence factor on the second parent node from the tree network.
其中,电子设备还将删除对第二父节点的影响因子最大的子节点导致的、没有子节点的所有父节点从树形网络中删除。The electronic device also deletes all parent nodes without child nodes from the tree network caused by deleting the child node with the largest influence factor on the second parent node.
示例性地,假设N等于2,对于图10所示的树形网络,若执行S1201-S1207,从树形网络中删除了节点E。针对删除节点E后的树形网络,继续执行S1201-S1207,从树形网络中删除了节点F,由于删除节点F后导致节点B没有了子节点,则还从树形网络中删除节点B。Exemplarily, assuming that N is equal to 2, for the tree-shaped network shown in FIG. 10, if S1201-S1207 are executed, the node E is deleted from the tree-shaped network. For the tree-shaped network after deleting node E, continue to execute S1201-S1207, and delete node F from the tree-shaped network. Since node B has no child nodes after node F is deleted, node B is also deleted from the tree-shaped network.
电子设备在S1207之后,继续执行S1201。After S1207, the electronic device continues to execute S1201.
S1208、电子设备将得到的待选子节点的性能指标,作为导致第一性能指标异常的第二性能指标。S1208. The electronic device uses the obtained performance index of the child node to be selected as the second performance index that causes the first performance index to be abnormal.
可以理解的是,电子设备可以对树形网络从根节点开始一级一级地查找对父节点的影响因子最大的节点,直至查找到的节点为叶子节点,确定查找到的节点的性能指标为导致第一性能指标异常的第二性能指标,同时还能确定查找到的节点到根节点的路径信息。It can be understood that the electronic device can search the tree network from the root node to the node with the largest influence factor on the parent node, until the found node is a leaf node, and determine that the performance index of the found node is: The second performance index that causes the first performance index to be abnormal can also determine the path information from the found node to the root node.
可以理解的是,上述电子设备或根因定位装置为了实现上述功能,其包含了执行各个功能相应的硬件结构和/或软件模块。本领域技术人员应该很容易意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,本申请实施例能够以硬件或硬件和计算机软件的结合形式来实现。某个功能究竟以硬件还是计算机软件驱动硬件的方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请实施例的范围。It can be understood that, in order to realize the above-mentioned functions, the above-mentioned electronic device or root cause locating apparatus includes corresponding hardware structures and/or software modules for executing each function. Those skilled in the art should easily realize that, in conjunction with the units and algorithm steps of the examples described in the embodiments disclosed herein, the embodiments of the present application can be implemented in hardware or a combination of hardware and computer software. Whether a function is performed by hardware or computer software driving hardware depends on the specific application and design constraints of the technical solution. Skilled persons may use different methods to implement the described functions for each specific application, but such implementations should not be considered beyond the scope of the embodiments of the present application.
本申请实施例可以根据上述方法示例对上述电子设备或根因定位装置进行功能模块的划分,例如,可以对应各个功能划分各个功能模块,也可以将两个或两个以上的功能集成在一个处理模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。需要说明的是,本申请实施例中对模块的划分是示意性的,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。In this embodiment of the present application, the electronic device or the root cause locating device may be divided into functional modules according to the foregoing method examples. For example, each functional module may be divided corresponding to each function, or two or more functions may be integrated into one processing unit. in the module. The above-mentioned integrated modules can be implemented in the form of hardware, and can also be implemented in the form of software function modules. It should be noted that, the division of modules in the embodiments of the present application is schematic, and is only a logical function division, and there may be other division manners in actual implementation.
在采用对应各个功能划分各个功能模块的情况下,如图13所示,本申请实施例提供根因定位装置1300,根因定位装置1300包括:获取模块1301、计算模块1302和定位模块1303。In the case where each functional module is divided according to each function, as shown in FIG. 13 , an embodiment of the present application provides a root cause locating device 1300 .
其中,获取模块1301用于支持根因定位装置1300执行上述方法实施例中的S101-S102,和/或用于本文所描述的技术的其它过程。计算模块1302用于支持根因定位装置1300执行上述方法 实施例中的S103,S301-S302,S401-S402,S601-S603,S701-S703,S801-S803,和/或用于本文所描述的技术的其它过程。定位模块1303用于支持根因定位装置1300执行上述方法实施例中的S104,S604,S704,S901-S902,S1101-S1102,S1201-S1208,和/或用于本文所描述的技术的其它过程。Wherein, the obtaining module 1301 is used to support the root cause locating device 1300 to perform S101-S102 in the above method embodiments, and/or other processes for the techniques described herein. The computing module 1302 is used to support the root cause locating device 1300 to perform S103, S301-S302, S401-S402, S601-S603, S701-S703, S801-S803 in the above method embodiments, and/or for the techniques described herein other processes. The location module 1303 is used to support the root cause location device 1300 to perform S104, S604, S704, S901-S902, S1101-S1102, S1201-S1208, and/or other processes for the techniques described herein in the above method embodiments.
进一步的,上述根因定位装置1300还可以包括:模型训练模块。该模型训练模块用于支持根因定位装置1300执行上述方法实施例中的训练预设注意力模型的过程,和/或用于本文所描述的技术的其它过程。Further, the above root cause locating apparatus 1300 may further include: a model training module. The model training module is used to support the root cause locating apparatus 1300 to perform the process of training a preset attention model in the above method embodiments, and/or other processes for the techniques described herein.
当然,上述根因定位装置1300包括但不限于上述所列举的单元模块。例如,该根因定位装置1300还可以包括用于保存性能指标的采样值的存储单元。并且,上述功能单元的具体所能够实现的功能也包括但不限于上述实例所述的方法步骤对应的功能,根因定位装置1300的其他单元的详细描述可以参考其所对应方法步骤的详细描述,本申请实施例这里不再赘述。Of course, the above root cause locating device 1300 includes but is not limited to the unit modules listed above. For example, the root cause locating apparatus 1300 may further include a storage unit for storing sampled values of performance indicators. In addition, the specific functions that can be realized by the above functional units also include but are not limited to the functions corresponding to the method steps described in the above examples. For the detailed description of other units of the root cause locating device 1300, please refer to the detailed description of the corresponding method steps. This embodiment of the present application will not be repeated here.
在采用集成单元的情况下,上述获取模块1301、计算模块1302和定位模块1303等可以集成在一个处理模块中实现,上述存储单元可以是根因定位装置1300的存储模块。In the case of using an integrated unit, the above-mentioned acquisition module 1301 , calculation module 1302 , and location module 1303 can be integrated into one processing module, and the above-mentioned storage unit can be a storage module of the root cause location device 1300 .
图13示出了上述实施例中所涉及的电子设备的一种可能的结构示意图。该电子电子设备1400包括:处理模块1401、存储模块1402和通信模块1403。FIG. 13 shows a possible schematic structural diagram of the electronic device involved in the above embodiment. The electronic electronic device 1400 includes: a processing module 1401 , a storage module 1402 and a communication module 1403 .
该处理模块1401用于对电子设备1400进行控制管理。该存储模块1402,用于保存电子设备1400的程序代码和数据。通信模块1403用于与其他设备通信。如通信模块用于接收或者向其他设备发送的数据。The processing module 1401 is used to control and manage the electronic device 1400 . The storage module 1402 is used to store program codes and data of the electronic device 1400 . The communication module 1403 is used to communicate with other devices. For example, the communication module is used to receive or send data to other devices.
其中,处理模块1401可以是处理器或控制器,例如可以是CPU,通用处理器,数字信号处理器(Digital Signal Processor,DSP),专用集成电路(Application-Specific Integrated Circuit,ASIC),现场可编程门阵列(Field Programmable Gate Array,FPGA)或者其他可编程逻辑器件、晶体管逻辑器件、硬件部件或者其任意组合。其可以实现或执行结合本申请公开内容所描述的各种示例性的逻辑方框,模块和电路。所述处理器也可以是实现计算功能的组合,例如包含一个或多个微处理器组合,DSP和微处理器的组合等等。通信模块1403可以是收发器、收发电路或通信接口等。存储模块1402可以是存储器。The processing module 1401 may be a processor or a controller, for example, a CPU, a general-purpose processor, a digital signal processor (Digital Signal Processor, DSP), an application-specific integrated circuit (Application-Specific Integrated Circuit, ASIC), a field programmable Gate Array (Field Programmable Gate Array, FPGA) or other programmable logic devices, transistor logic devices, hardware components or any combination thereof. It may implement or execute the various exemplary logical blocks, modules and circuits described in connection with this disclosure. The processor may also be a combination that implements computing functions, such as a combination of one or more microprocessors, a combination of a DSP and a microprocessor, and the like. The communication module 1403 may be a transceiver, a transceiver circuit, a communication interface, or the like. The storage module 1402 may be a memory.
本申请实施例还提供一种计算机可读存储介质,该计算机可读存储介质中存储有计算机程序代码,当上述电子设备执行该计算机程序代码时,实现该电子设备执行图1、图3、图4、图6、图7、图8、图9、图11和图12中任一附图中的相关方法步骤实现上述实施例中的方法。Embodiments of the present application also provide a computer-readable storage medium, where computer program codes are stored in the computer-readable storage medium. When the above-mentioned electronic device executes the computer program code, the electronic device realizes the execution of FIG. 1 , FIG. 3 , and FIG. 4. The relevant method steps in any one of FIG. 6 , FIG. 7 , FIG. 8 , FIG. 9 , FIG. 11 and FIG. 12 implement the method in the above embodiment.
本申请实施例还提供了一种计算机程序产品,当该计算机程序产品在电子设备上运行时,使得电子设备执行图1、图3、图4、图6、图7、图8、图9、图11和图12中任一附图中的相关方法步骤实现上述实施例中的方法。Embodiments of the present application also provide a computer program product, which, when the computer program product runs on an electronic device, enables the electronic device to execute FIG. 1 , FIG. 3 , FIG. 4 , FIG. 6 , FIG. 7 , FIG. 8 , FIG. 9 , The relevant method steps in any of Figures 11 and 12 implement the methods in the above-described embodiments.
其中,本申请提供的根因定位装置1300、电子设备1400、计算机可读存储介质或者计算机程序产品均用于执行上文所提供的对应的方法,因此,其所能达到的有益效果可参考上文所提供的对应的方法中的有益效果,此处不再赘述。Wherein, the root cause locating device 1300, the electronic device 1400, the computer-readable storage medium or the computer program product provided in this application are all used to execute the corresponding method provided above, therefore, the beneficial effects that can be achieved can refer to the above The beneficial effects in the corresponding method provided in this article will not be repeated here.
通过以上的实施方式的描述,所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将装置的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。上述描述的系统,装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。From the description of the above embodiments, those skilled in the art can clearly understand that for the convenience and brevity of the description, only the division of the above functional modules is used as an example for illustration. In practical applications, the above functions can be allocated as required. It is completed by different functional modules, that is, the internal structure of the device is divided into different functional modules, so as to complete all or part of the functions described above. For the specific working process of the system, apparatus and unit described above, reference may be made to the corresponding process in the foregoing method embodiments, and details are not described herein again.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed system, apparatus and method may be implemented in other manners. For example, the device embodiments described above are only illustrative. For example, the division of the modules or units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be Incorporation may either be integrated into another system, or some features may be omitted, or not implemented. On the other hand, the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit. The above-mentioned integrated units may be implemented in the form of hardware, or may be implemented in the form of software functional units.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)或处理器执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:快闪存储器、移动硬盘、只读存储器、随机存取存储器、磁碟或者光盘等各种可以存储程序代码的介质。The integrated unit, if implemented in the form of a software functional unit and sold or used as an independent product, may be stored in a computer-readable storage medium. Based on this understanding, the technical solutions of the present application can be embodied in the form of software products in essence, or the parts that contribute to the prior art, or all or part of the technical solutions, and the computer software products are stored in a storage medium , which includes several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor to execute all or part of the steps of the methods described in the various embodiments of the present application. The aforementioned storage medium includes: flash memory, removable hard disk, read-only memory, random access memory, magnetic disk or optical disk and other media that can store program codes.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。The above are only specific embodiments of the present application, but the protection scope of the present application is not limited to this. should be covered within the scope of protection of this application. Therefore, the protection scope of the present application should be subject to the protection scope of the claims.

Claims (20)

  1. 一种根因定位方法,其特征在于,所述方法包括:A root cause locating method, characterized in that the method comprises:
    获取第一性能指标的树形网络;其中,所述树形网络包括所述第一性能指标和影响所述第一性能指标的多个第二性能指标,所述第一性能指标对应所述树形网络的根节点,所述多个第二性能指标对应所述树形网络中除所述根节点之外的其他节点;Obtaining a tree-shaped network of first performance indicators; wherein, the tree-shaped network includes the first performance indicators and a plurality of second performance indicators that affect the first performance indicators, and the first performance indicators correspond to the tree The root node of the tree-shaped network, the plurality of second performance indicators correspond to other nodes in the tree-shaped network except the root node;
    获取预设定位周期内采样得到的所述第一性能指标的采样值和所述多个第二性能指标的采样值;acquiring the sampled value of the first performance index and the sampled values of the plurality of second performance indicators obtained by sampling within a preset positioning period;
    根据所述第一性能指标的采样值和所述多个第二性能指标的采样值,计算所述树形网络中各个子节点对父节点的影响因子;其中,每个子节点对父节点的影响因子用于表征所述每个子节点的性能指标对父节点的性能指标的影响程度;According to the sampled value of the first performance index and the sampled value of the plurality of second performance indicators, the influence factor of each child node in the tree network on the parent node is calculated; wherein, the influence of each child node on the parent node The factor is used to characterize the degree of influence of the performance index of each child node on the performance index of the parent node;
    根据所述树形网络中各个子节点对父节点的影响因子,确定出所述树形网络中导致所述第一性能指标异常的第二性能指标。According to the influence factor of each child node in the tree-shaped network on the parent node, the second performance index in the tree-shaped network that causes the abnormality of the first performance index is determined.
  2. 根据权利要求1所述的方法,其特征在于,所述预设定位周期包括Q个采样时刻,Q≥2,Q为整数;The method according to claim 1, wherein the preset positioning period includes Q sampling moments, where Q≥2, and Q is an integer;
    所述获取预设定位周期内采样得到的所述第一性能指标的采样值和所述多个第二性能指标的采样值,包括:The acquiring the sampled value of the first performance indicator and the sampled values of the plurality of second performance indicators sampled within the preset positioning period includes:
    在所述预设定位周期的第q个采样时刻,采样得到所述第一性能指标的第q个采样值和所述多个第二性能指标的第q个采样值;其中,q在{1,……,Q}中依次取值。At the qth sampling time of the preset positioning period, the qth sampling value of the first performance index and the qth sampling value of the plurality of second performance indicators are obtained by sampling; wherein, q is in {1 , ..., Q} and take values in turn.
  3. 根据权利要求2所述的方法,其特征在于,所述根据所述第一性能指标的采样值和所述多个第二性能指标的采样值,计算所述树形网络中各个子节点对父节点的影响因子,包括:The method according to claim 2, wherein, according to the sampled value of the first performance index and the sampled value of the plurality of second performance indicators, calculating the relative value of each child node in the tree network to the parent The impact factor of the node, including:
    针对所述树形网络中的每个父节点执行:将第一父节点的所有子节点的性能指标的第q个采样值,作为所述第一父节点的影响因子预估模型的输入,运行所述第一父节点的影响因子预估模型,输出所述第一父节点的每个子节点对所述第一父节点的第q个影响因子;其中,所述第一父节点为所述树形网络中的任一个父节点;不同父节点的影响因子预估模型不同;所述第一父节点的影响因子预估模型具有从所述第一父节点的所有子节点的性能指标的采样值中,提取出所述第一父节点的每个子节点对第一父节点的影响因子的能力;Execute for each parent node in the tree network: take the qth sampled value of the performance index of all child nodes of the first parent node as the input of the impact factor estimation model of the first parent node, run The impact factor estimation model of the first parent node, outputting the qth impact factor of each child node of the first parent node on the first parent node; wherein, the first parent node is the tree Any parent node in the shape network; the impact factor estimation models of different parent nodes are different; the impact factor estimation model of the first parent node has sampling values from the performance indicators of all child nodes of the first parent node , extracting the ability of each child node of the first parent node to influence factors of the first parent node;
    针对所述第一父节点的每个子节点执行:计算第一子节点对所述第一父节点的Q个影响因子的平均值,得到所述第一子节点对所述第一父节点的影响因子;其中,所述第一子节点是所述第一父节点的任一个子节点。Perform for each child node of the first parent node: calculate the average value of Q influence factors of the first child node on the first parent node, and obtain the impact of the first child node on the first parent node factor; wherein, the first child node is any child node of the first parent node.
  4. 根据权利要求2所述的方法,其特征在于,所述根据所述第一性能指标的采样值和所述多个第二性能指标的采样值,计算所述树形网络中各个子节点对父节点的影响因子,包括:The method according to claim 2, wherein, according to the sampled value of the first performance index and the sampled value of the plurality of second performance indicators, calculating the relative value of each child node in the tree network to the parent The impact factor of the node, including:
    针对所述树形网络中的每个父节点执行:将第一父节点的所有子节点的性能指标的第q个采样值,作为所述第一父节点的影响因子预估模型的输入,运行所述第一父节点的影响因子预估模型,输出所述第一父节点的每个子节点对所述第一父节点的第q个影响因子;其中,所述第一父节点为所述树形网络中的任一个父节点;不同父节点的影响因子预估模型不同;所述第一父节点的影响因子预估模型具有从所述第一父节点的所有子节点的性能指标的采样值中,提取出所述第一父节点的每个子节点对第一父节点的影响因子的能力;Execute for each parent node in the tree network: take the qth sampled value of the performance index of all child nodes of the first parent node as the input of the impact factor estimation model of the first parent node, run The impact factor estimation model of the first parent node, outputting the qth impact factor of each child node of the first parent node on the first parent node; wherein, the first parent node is the tree Any parent node in the shape network; the impact factor estimation models of different parent nodes are different; the impact factor estimation model of the first parent node has sampling values from the performance indicators of all child nodes of the first parent node , extracting the ability of each child node of the first parent node to influence factors of the first parent node;
    若所述Q个采样时刻中的第p个采样时刻为性能指标的突变时刻,针对所述第一父节点的每个子节点执行:计算第一子节点对所述第一父节点的前p-1个影响因子的平均值,得到突变前所 述第一子节点对所述第一父节点的影响因子;计算第一子节点对所述第一父节点的第p个-第Q个影响因子的平均值,得到突变后所述第一子节点对所述第一父节点的影响因子;计算突变前所述第一子节点对所述第一父节点的影响因子与突变后所述第一子节点对所述第一父节点的影响因子的差值或比值,将所述差值或比值作为所述第一子节点对所述第一父节点的影响因子;If the p-th sampling time in the Q sampling times is the sudden change of the performance index, perform for each child node of the first parent node: calculate the first p- of the first child node to the first parent node. The average value of 1 impact factor, to obtain the impact factor of the first child node on the first parent node before mutation; calculate the p-th -Qth impact factor of the first child node on the first parent node to obtain the influence factor of the first child node on the first parent node after mutation; calculate the influence factor of the first child node on the first parent node before mutation and the The difference or ratio of the influence factor of the child node to the first parent node, and the difference or ratio is used as the influence factor of the first child node to the first parent node;
    其中,p为小于或等于Q的正整数;所述第一子节点是所述第一父节点的任一个子节点。Wherein, p is a positive integer less than or equal to Q; the first child node is any child node of the first parent node.
  5. 根据权利要求3或4所述的方法,其特征在于,在所述根据所述第一性能指标的采样值和所述多个第二性能指标的采样值,计算所述树形网络中各个子节点对父节点的影响因子之前,所述方法还包括:The method according to claim 3 or 4, characterized in that, in the calculation according to the sampled value of the first performance index and the sampled value of the plurality of second performance indicators, each child in the tree network is calculated Before the influence factor of the node on the parent node, the method further includes:
    采用所述第一父节点的性能指标的第h个样本值和所述第一父节点的所有子节点的性能指标的第h个采样值作为训练样本,训练预设注意力模型,得到所述第一父节点的影响因子预估模型;The h-th sample value of the performance index of the first parent node and the h-th sample value of the performance index of all child nodes of the first parent node are used as training samples to train the preset attention model to obtain the The impact factor prediction model of the first parent node;
    其中,h在{1,……,H}中依次取值,H≥100,H为整数。Among them, h takes values in sequence in {1,...,H}, H≥100, and H is an integer.
  6. 根据权利要求2所述的方法,其特征在于,所述根据所述第一性能指标的采样值和所述多个第二性能指标的采样值,计算所述树形网络中各个子节点对父节点的影响因子,包括:The method according to claim 2, wherein, according to the sampled value of the first performance index and the sampled value of the plurality of second performance indicators, calculating the relative value of each child node in the tree network to the parent The impact factor of the node, including:
    针对所述树形网络中的每个子节点执行以下步骤,以获取所述每个子节点对父节点的影响因子:Perform the following steps for each child node in the tree network to obtain the influence factor of each child node on the parent node:
    根据第二子节点的性能指标的Q个采样值,计算所述第二子节点的性能指标的第一残差;其中,所述第二子节点是所述树形网络的任一个子节点;所述第一残差用于表征所述第二子节点的性能指标的Q个采样值的波动情况;Calculate the first residual of the performance index of the second child node according to the Q sampled values of the performance index of the second child node; wherein the second child node is any child node of the tree network; The first residual is used to represent the fluctuation of the Q sampling values of the performance index of the second child node;
    根据所述第二子节点的父节点的性能指标的Q个采样值,计算所述第二子节点的父节点的性能指标的第二残差,所述第二残差用于表征所述第二子节点的父节点的性能指标的Q个采样值的波动情况;Calculate a second residual of the performance index of the parent node of the second child node according to the Q sampled values of the performance index of the parent node of the second child node, where the second residual is used to characterize the first The fluctuation of the Q sampling values of the performance index of the parent node of the two child nodes;
    计算所述第一残差和所述第二残差之间的相关性,并根据所述第一残差和所述第二残差之间的相关性,获取所述第二子节点对父节点的影响因子。Calculate the correlation between the first residual and the second residual, and according to the correlation between the first residual and the second residual, obtain the relationship between the second child node and the parent The influence factor of the node.
  7. 根据权利要求6所述的方法,其特征在于,所述根据第二子节点的性能指标的Q个采样值,计算所述第二子节点的性能指标的第一残差,包括:The method according to claim 6, wherein the calculating the first residual of the performance index of the second child node according to the Q sampling values of the performance index of the second child node comprises:
    利用时间序列分解算法,从所述第二子节点的性能指标的Q个采样值中,提取所述第二子节点的性能指标的趋势分量和所述第二子节点的性能指标的周期分量;其中,所述趋势分量用于表示所述第二子节点的性能指标的Q个采样值的变化趋势,所述周期分量用于表示所述第二子节点的性能指标的Q个采样值的周期性变动;Using a time series decomposition algorithm, extract the trend component of the performance index of the second child node and the periodic component of the performance index of the second child node from the Q sampled values of the performance index of the second child node; Wherein, the trend component is used to represent the change trend of the Q sampled values of the performance index of the second child node, and the period component is used to represent the period of the Q sampled values of the performance index of the second child node sexual change;
    将所述第二子节点的性能指标的Q个采样值,减去所述趋势分量和所述周期分量,得到所述第二子节点的性能指标的第一残差。The trend component and the period component are subtracted from the Q sampled values of the performance index of the second child node to obtain a first residual of the performance index of the second child node.
  8. 根据权利要求1-7中任一项所述的方法,其特征在于,所述根据所述树形网络中各个子节点对父节点的影响因子,确定出所述树形网络中导致所述第一性能指标异常的第二性能指标,包括:The method according to any one of claims 1-7, characterized in that, according to the influence factor of each child node in the tree network on the parent node, determining that the tree network causes the first A second performance index with abnormal performance index, including:
    根据所述树形网络中各个子节点对父节点的影响因子,计算所述树形网络中叶子节点对所述根节点的影响因子;According to the influence factor of each child node in the tree network on the parent node, calculate the influence factor of the leaf node in the tree network on the root node;
    将所述树形网络中的N个叶子节点的性能指标,确定为导致所述第一性能指标异常的第二性能指标;其中,所述N个叶子节点对所述根节点的影响因子大于所述树形网络中其他叶子节点对所述根节点的影响因子,N≥1,N为整数。Determine the performance indicators of the N leaf nodes in the tree-shaped network as the second performance indicators that cause the first performance indicator to be abnormal; wherein, the impact factor of the N leaf nodes on the root node is greater than all Influence factor of other leaf nodes in the tree network on the root node, N≥1, N is an integer.
  9. 根据权利要求1-7中任一项所述的方法,其特征在于,所述树形网络中包括第一预设节点,所述第一预设节点不是叶子节点,所述第一预设节点的子节点对所述第一预设节点的影响因子小于或等于预设阈值;The method according to any one of claims 1-7, wherein the tree network includes a first preset node, the first preset node is not a leaf node, and the first preset node The influence factor of the child node on the first preset node is less than or equal to the preset threshold;
    其中,所述根据所述树形网络中各个子节点对父节点的影响因子,确定出所述树形网络中导致所述第一性能指标异常的第二性能指标,包括:Wherein, according to the influence factor of each child node in the tree network on the parent node, determining the second performance index in the tree network that causes the first performance index to be abnormal, including:
    根据所述树形网络中各个子节点对父节点的影响因子,计算所述树形网络中叶子节点对所述根节点的影响因子,并计算所述第一预设节点对所述根节点的影响因子;According to the influence factor of each child node in the tree network on the parent node, calculate the influence factor of the leaf node in the tree network on the root node, and calculate the influence factor of the first preset node on the root node. Impact factor;
    将所述叶子节点和所述第一预设节点中的M个节点的性能指标,确定为导致所述第一性能指标异常的第二性能指标;其中,所述M个节点对所述根节点的影响因子大于所述树形网络中其他节点对所述根节点的影响因子,M≥1,M为整数。Determining the performance indicators of M nodes in the leaf node and the first preset node as the second performance indicator that causes the first performance indicator to be abnormal; wherein the M nodes are responsible for the root node The influence factor of is greater than the influence factor of other nodes in the tree network on the root node, M≥1, and M is an integer.
  10. 一种根因定位装置,其特征在于,所述装置包括:A root cause locating device, characterized in that the device comprises:
    获取模块,用于获取第一性能指标的树形网络,获取预设定位周期内采样得到的所述第一性能指标的采样值和所述多个第二性能指标的采样值;其中,所述树形网络包括所述第一性能指标和影响所述第一性能指标的多个第二性能指标,所述第一性能指标对应所述树形网络的根节点,所述多个第二性能指标对应所述树形网络中除所述根节点之外的其他节点;an obtaining module, configured to obtain a tree-shaped network of the first performance index, and obtain the sampling value of the first performance index and the sampling value of the plurality of second performance indexes obtained by sampling within a preset positioning period; wherein, the The tree network includes the first performance indicator and a plurality of second performance indicators that affect the first performance indicator, the first performance indicator corresponds to the root node of the tree network, and the plurality of second performance indicators corresponding to other nodes in the tree network except the root node;
    计算模块,用于根据所述第一性能指标的采样值和所述多个第二性能指标的采样值,计算所述树形网络中各个子节点对父节点的影响因子;其中,每个子节点对父节点的影响因子用于表征所述每个子节点的性能指标对父节点的性能指标的影响程度;a calculation module, configured to calculate the influence factor of each child node in the tree network on the parent node according to the sampling value of the first performance index and the sampling value of the plurality of second performance indices; wherein, each child node The influence factor on the parent node is used to characterize the degree of influence of the performance index of each child node on the performance index of the parent node;
    定位模块,用于根据所述树形网络中各个子节点对父节点的影响因子,确定出所述树形网络中导致所述第一性能指标异常的第二性能指标。The positioning module is configured to determine the second performance index in the tree network that causes the abnormality of the first performance index according to the influence factor of each child node in the tree network on the parent node.
  11. 根据权利要求10所述的装置,其特征在于,所述预设定位周期包括Q个采样时刻,Q≥2,Q为整数;The device according to claim 10, wherein the preset positioning period includes Q sampling moments, where Q≥2, and Q is an integer;
    所述获取模块,具体用于在所述预设定位周期的第q个采样时刻,采样得到所述第一性能指标的第q个采样值和所述多个第二性能指标的第q个采样值;其中,q在{1,……,Q}中依次取值。The acquisition module is specifically configured to obtain the qth sampling value of the first performance index and the qth sampling value of the plurality of second performance indicators by sampling at the qth sampling time of the preset positioning period value; among them, q takes values in sequence in {1,...,Q}.
  12. 根据权利要求11所述的装置,其特征在于,The apparatus of claim 11, wherein:
    所述计算模块,具体用于:The computing module is specifically used for:
    针对所述树形网络中的每个父节点执行:将第一父节点的所有子节点的性能指标的第q个采样值,作为所述第一父节点的影响因子预估模型的输入,运行所述第一父节点的影响因子预估模型,输出所述第一父节点的每个子节点对所述第一父节点的第q个影响因子;其中,所述第一父节点为所述树形网络中的任一个父节点;不同父节点的影响因子预估模型不同;所述第一父节点的影响因子预估模型具有从所述第一父节点的所有子节点的性能指标的采样值中,提取出所述第一父节点的每个子节点对第一父节点的影响因子的能力;Execute for each parent node in the tree network: take the qth sampled value of the performance index of all child nodes of the first parent node as the input of the impact factor estimation model of the first parent node, run The impact factor estimation model of the first parent node, outputting the qth impact factor of each child node of the first parent node on the first parent node; wherein, the first parent node is the tree Any parent node in the shape network; the impact factor estimation models of different parent nodes are different; the impact factor estimation model of the first parent node has sampling values from the performance indicators of all child nodes of the first parent node , extracting the ability of each child node of the first parent node to influence factors of the first parent node;
    针对所述第一父节点的每个子节点执行:计算第一子节点对所述第一父节点的Q个影响因子的平均值,得到所述第一子节点对所述第一父节点的影响因子;其中,所述第一子节点是所述第一父节点的任一个子节点。Perform for each child node of the first parent node: calculate the average value of Q influence factors of the first child node on the first parent node, and obtain the impact of the first child node on the first parent node factor; wherein, the first child node is any child node of the first parent node.
  13. 根据权利要求11所述的装置,其特征在于,The apparatus of claim 11, wherein:
    所述计算模块,具体用于:The computing module is specifically used for:
    针对所述树形网络中的每个父节点执行:将第一父节点的所有子节点的性能指标的第q个采 样值,作为所述第一父节点的影响因子预估模型的输入,运行所述第一父节点的影响因子预估模型,输出所述第一父节点的每个子节点对所述第一父节点的第q个影响因子;其中,所述第一父节点为所述树形网络中的任一个父节点;不同父节点的影响因子预估模型不同;所述第一父节点的影响因子预估模型具有从所述第一父节点的所有子节点的性能指标的采样值中,提取出所述第一父节点的每个子节点对第一父节点的影响因子的能力;Execute for each parent node in the tree network: take the qth sampled value of the performance index of all child nodes of the first parent node as the input of the impact factor estimation model of the first parent node, run The impact factor estimation model of the first parent node, outputting the qth impact factor of each child node of the first parent node on the first parent node; wherein, the first parent node is the tree Any parent node in the shape network; the impact factor estimation models of different parent nodes are different; the impact factor estimation model of the first parent node has sampling values from the performance indicators of all child nodes of the first parent node , extracting the ability of each child node of the first parent node to influence factors of the first parent node;
    若所述Q个采样时刻中的第p个采样时刻为性能指标的突变时刻,针对所述第一父节点的每个子节点执行:计算第一子节点对所述第一父节点的前p-1个影响因子的平均值,得到突变前所述第一子节点对所述第一父节点的影响因子;计算第一子节点对所述第一父节点的第p个-第Q个影响因子的平均值,得到突变后所述第一子节点对所述第一父节点的影响因子;计算突变前所述第一子节点对所述第一父节点的影响因子与突变后所述第一子节点对所述第一父节点的影响因子的差值或比值,将所述差值或比值作为所述第一子节点对所述第一父节点的影响因子;If the p-th sampling time in the Q sampling times is the sudden change of the performance index, perform for each child node of the first parent node: calculate the first p- of the first child node to the first parent node. The average value of 1 impact factor, to obtain the impact factor of the first child node on the first parent node before mutation; calculate the p-th -Qth impact factor of the first child node on the first parent node to obtain the influence factor of the first child node on the first parent node after mutation; calculate the influence factor of the first child node on the first parent node before mutation and the The difference or ratio of the influence factor of the child node to the first parent node, and the difference or ratio is used as the influence factor of the first child node to the first parent node;
    其中,p为小于或等于Q的正整数;所述第一子节点是所述第一父节点的任一个子节点。Wherein, p is a positive integer less than or equal to Q; the first child node is any child node of the first parent node.
  14. 根据权利要求12或13所述的装置,其特征在于,所述装置还包括:The device according to claim 12 or 13, wherein the device further comprises:
    模型训练模块,用于在所述根据所述第一性能指标的采样值和所述多个第二性能指标的采样值,计算所述树形网络中各个子节点对父节点的影响因子之前,采用所述第一父节点的性能指标的第h个样本值和所述第一父节点的所有子节点的性能指标的第h个采样值作为训练样本,训练预设注意力模型,得到所述第一父节点的影响因子预估模型;A model training module, configured to calculate, according to the sampling value of the first performance index and the sampling value of the plurality of second performance indices, before calculating the influence factor of each child node in the tree network on the parent node, The h-th sample value of the performance index of the first parent node and the h-th sample value of the performance index of all child nodes of the first parent node are used as training samples to train the preset attention model to obtain the The impact factor prediction model of the first parent node;
    其中,h在{1,……,H}中依次取值,H≥100,H为整数。Among them, h takes values in sequence in {1,...,H}, H≥100, and H is an integer.
  15. 根据权利要求11所述的装置,其特征在于,The apparatus of claim 11, wherein:
    所述计算模块,具体用于针对所述树形网络中的每个子节点执行以下步骤,以获取所述每个子节点对父节点的影响因子:The computing module is specifically configured to perform the following steps for each child node in the tree network to obtain the influence factor of each child node on the parent node:
    根据第二子节点的性能指标的Q个采样值,计算所述第二子节点的性能指标的第一残差;其中,所述第二子节点是所述树形网络的任一个子节点;所述第一残差用于表征所述第二子节点的性能指标的Q个采样值的波动情况;Calculate the first residual of the performance index of the second child node according to the Q sampled values of the performance index of the second child node; wherein the second child node is any child node of the tree network; The first residual is used to represent the fluctuation of the Q sampling values of the performance index of the second child node;
    根据所述第二子节点的父节点的性能指标的Q个采样值,计算所述第二子节点的父节点的性能指标的第二残差,所述第二残差用于表征所述第二子节点的父节点的性能指标的Q个采样值的波动情况;Calculate a second residual of the performance index of the parent node of the second child node according to the Q sampled values of the performance index of the parent node of the second child node, where the second residual is used to characterize the first The fluctuation of the Q sampling values of the performance index of the parent node of the two child nodes;
    计算所述第一残差和所述第二残差之间的相关性,并根据所述第一残差和所述第二残差之间的相关性,获取所述第二子节点对父节点的影响因子。Calculate the correlation between the first residual and the second residual, and according to the correlation between the first residual and the second residual, obtain the relationship between the second child node and the parent The influence factor of the node.
  16. 根据权利要求15所述的装置,其特征在于,The apparatus of claim 15, wherein:
    所述计算模块,具体用于:The computing module is specifically used for:
    利用时间序列分解算法,从所述第二子节点的性能指标的Q个采样值中,提取所述第二子节点的性能指标的趋势分量和所述第二子节点的性能指标的周期分量;其中,所述趋势分量用于表示所述第二子节点的性能指标的Q个采样值的变化趋势,所述周期分量用于表示所述第二子节点的性能指标的Q个采样值的周期性变动;Using a time series decomposition algorithm, extract the trend component of the performance index of the second child node and the periodic component of the performance index of the second child node from the Q sampled values of the performance index of the second child node; Wherein, the trend component is used to represent the change trend of the Q sampled values of the performance index of the second child node, and the period component is used to represent the period of the Q sampled values of the performance index of the second child node sexual change;
    将所述第二子节点的性能指标的Q个采样值,减去所述趋势分量和所述周期分量,得到所述第二子节点的性能指标的第一残差。The trend component and the period component are subtracted from the Q sampled values of the performance index of the second child node to obtain a first residual of the performance index of the second child node.
  17. 根据权利要求10-16中任一项所述的装置,其特征在于,The device according to any one of claims 10-16, characterized in that,
    所述定位模块,具体用于:The positioning module is specifically used for:
    根据所述树形网络中各个子节点对父节点的影响因子,计算所述树形网络中叶子节点对所述根节点的影响因子;According to the influence factor of each child node in the tree network on the parent node, calculate the influence factor of the leaf node in the tree network on the root node;
    将所述树形网络中的N个叶子节点的性能指标,确定为导致所述第一性能指标异常的第二性能指标;其中,所述N个叶子节点对所述根节点的影响因子大于所述树形网络中其他叶子节点对所述根节点的影响因子,N≥1,N为整数。Determine the performance indicators of the N leaf nodes in the tree-shaped network as the second performance indicators that cause the first performance indicator to be abnormal; wherein, the impact factor of the N leaf nodes on the root node is greater than all Influence factor of other leaf nodes in the tree network on the root node, N≥1, N is an integer.
  18. 根据权利要求10-16中任一项所述的装置,其特征在于,所述树形网络中包括第一预设节点,所述第一预设节点不是叶子节点,所述第一预设节点的子节点对所述第一预设节点的影响因子小于或等于预设阈值;The device according to any one of claims 10-16, wherein the tree network includes a first preset node, the first preset node is not a leaf node, and the first preset node The influence factor of the child node on the first preset node is less than or equal to the preset threshold;
    其中,所述定位模块,具体用于:Wherein, the positioning module is specifically used for:
    根据所述树形网络中各个子节点对父节点的影响因子,计算所述树形网络中叶子节点对所述根节点的影响因子,并计算所述第一预设节点对所述根节点的影响因子;According to the influence factor of each child node in the tree network on the parent node, calculate the influence factor of the leaf node in the tree network on the root node, and calculate the influence factor of the first preset node on the root node. Impact factor;
    将所述叶子节点和所述第一预设节点中的M个节点的性能指标,确定为导致所述第一性能指标异常的第二性能指标;其中,所述M个节点对所述根节点的影响因子大于所述树形网络中其他节点对所述根节点的影响因子,M≥1,M为整数。Determining the performance indicators of M nodes in the leaf node and the first preset node as the second performance indicator that causes the first performance indicator to be abnormal; wherein the M nodes are responsible for the root node The influence factor of is greater than the influence factor of other nodes in the tree network on the root node, M≥1, and M is an integer.
  19. 一种电子设备,其特征在于,所述电子设备包括:处理器和用于存储所述处理器可执行指令的存储器;An electronic device, characterized in that the electronic device comprises: a processor and a memory for storing executable instructions of the processor;
    其中,所述处理器被配置为执行所述指令,使得所述电子设备执行如权利要求1-9中任一项所述的根因定位方法。Wherein, the processor is configured to execute the instructions, so that the electronic device executes the root cause locating method according to any one of claims 1-9.
  20. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质上存储有计算机指令,当所述计算机指令在电子设备上运行时,使得所述电子设备执行如权利要求1-9中任一项所述的根因定位方法。A computer-readable storage medium, characterized in that the computer-readable storage medium has computer instructions stored thereon, and when the computer instructions are executed on an electronic device, the electronic device is made to perform the execution as in claims 1-9. The root cause localization method of any one.
PCT/CN2020/106404 2020-07-31 2020-07-31 Root cause localization method and apparatus, and electronic device WO2022021420A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2020/106404 WO2022021420A1 (en) 2020-07-31 2020-07-31 Root cause localization method and apparatus, and electronic device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2020/106404 WO2022021420A1 (en) 2020-07-31 2020-07-31 Root cause localization method and apparatus, and electronic device

Publications (1)

Publication Number Publication Date
WO2022021420A1 true WO2022021420A1 (en) 2022-02-03

Family

ID=80036975

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/106404 WO2022021420A1 (en) 2020-07-31 2020-07-31 Root cause localization method and apparatus, and electronic device

Country Status (1)

Country Link
WO (1) WO2022021420A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116050713A (en) * 2023-03-13 2023-05-02 深圳迅策科技有限公司 Index calculation method, apparatus, electronic device and computer program product
CN116128571A (en) * 2023-04-12 2023-05-16 花瓣云科技有限公司 Advertisement exposure analysis method and related device
CN117708764B (en) * 2024-02-06 2024-05-03 青岛天高智慧科技有限公司 Intelligent analysis method for student consumption data based on campus card

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102946649A (en) * 2012-10-30 2013-02-27 浙江理工大学 Wireless sensor network topology construction method based on spanning tree
US20140254433A1 (en) * 2011-08-05 2014-09-11 Nec Corporation Sensor network system, sensor network control method, sensor node, sensor node control method, and sensor node control program
CN105634787A (en) * 2014-11-26 2016-06-01 华为技术有限公司 Evaluation method, prediction method and device and system for network key indicator
CN110851694A (en) * 2018-07-26 2020-02-28 王飞 Personalized recommendation system based on user memory network and tree structure depth model
CN110888755A (en) * 2019-11-15 2020-03-17 亚信科技(中国)有限公司 Method and device for searching abnormal root node of micro-service system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140254433A1 (en) * 2011-08-05 2014-09-11 Nec Corporation Sensor network system, sensor network control method, sensor node, sensor node control method, and sensor node control program
CN102946649A (en) * 2012-10-30 2013-02-27 浙江理工大学 Wireless sensor network topology construction method based on spanning tree
CN105634787A (en) * 2014-11-26 2016-06-01 华为技术有限公司 Evaluation method, prediction method and device and system for network key indicator
CN110851694A (en) * 2018-07-26 2020-02-28 王飞 Personalized recommendation system based on user memory network and tree structure depth model
CN110888755A (en) * 2019-11-15 2020-03-17 亚信科技(中国)有限公司 Method and device for searching abnormal root node of micro-service system

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116050713A (en) * 2023-03-13 2023-05-02 深圳迅策科技有限公司 Index calculation method, apparatus, electronic device and computer program product
CN116128571A (en) * 2023-04-12 2023-05-16 花瓣云科技有限公司 Advertisement exposure analysis method and related device
CN117708764B (en) * 2024-02-06 2024-05-03 青岛天高智慧科技有限公司 Intelligent analysis method for student consumption data based on campus card

Similar Documents

Publication Publication Date Title
CN108520357B (en) Method and device for judging line loss abnormality reason and server
US10361923B2 (en) Method and device for discovering network topology
CN106548357B (en) Client satisfaction evaluation method and system
WO2017028632A1 (en) Method of predicting distribution network operation reliability
CN111538951B (en) Abnormality positioning method and device
CN109803295B (en) Method and device for evaluating communication cell rectification priority
WO2022021420A1 (en) Root cause localization method and apparatus, and electronic device
US9740816B2 (en) Method and apparatus for network traffic simulation
CN111984544B (en) Device performance test method and device, electronic device and storage medium
CN106803799B (en) Performance test method and device
WO2018184304A1 (en) Method and device for detecting health state of network element
WO2016188498A1 (en) Wireless network throughput evaluating method and device
CN109242250A (en) A kind of user's behavior confidence level detection method based on Based on Entropy method and cloud model
CN112054943A (en) Traffic prediction method for mobile network base station
Wisse et al. Relieving the elicitation burden of Bayesian belief networks.
CN111935741A (en) Method, device and system for detecting poor quality cell of communication network
Murudkar et al. QoE-driven anomaly detection in self-organizing mobile networks using machine learning
CN109657122B (en) Academic team important member identification method based on academic big data
CN113867966A (en) Cloud resource scheduling method in hybrid cloud mode
CN102957572A (en) Method and device for evaluating performance of system
Ghanem et al. Centrality metrics in dynamic networks: a comparison study
CN108459991A (en) A method of obtaining equipment dependability data
US20170317890A1 (en) Inferring Component Parameters for Components in a Network
CN116523244A (en) Testing manpower risk early warning method based on outsourcing resources
CN115866235A (en) Video quality evaluation method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20947760

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20947760

Country of ref document: EP

Kind code of ref document: A1