WO2022028120A1 - Indicator detection model acquisition method and apparatus, fault locating method and apparatus, and device and storage medium - Google Patents

Indicator detection model acquisition method and apparatus, fault locating method and apparatus, and device and storage medium Download PDF

Info

Publication number
WO2022028120A1
WO2022028120A1 PCT/CN2021/100381 CN2021100381W WO2022028120A1 WO 2022028120 A1 WO2022028120 A1 WO 2022028120A1 CN 2021100381 W CN2021100381 W CN 2021100381W WO 2022028120 A1 WO2022028120 A1 WO 2022028120A1
Authority
WO
WIPO (PCT)
Prior art keywords
leaf node
key kpi
mobile communication
communication network
indicators
Prior art date
Application number
PCT/CN2021/100381
Other languages
French (fr)
Chinese (zh)
Inventor
冯媛
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2022028120A1 publication Critical patent/WO2022028120A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W24/00Supervisory, monitoring or testing arrangements
    • H04W24/06Testing, supervising or monitoring using simulated traffic
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • G06Q10/06393Score-carding, benchmarking or key performance indicator [KPI] analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0677Localisation of faults

Definitions

  • Embodiments of the present application relate to the field of failure analysis of semiconductor devices, and in particular, to a method, apparatus, device, and storage medium for acquiring an index detection model and locating faults.
  • AIOPS intelligent operation and maintenance
  • An embodiment of the present application provides a method for obtaining an index detection model for a mobile communication network, including: obtaining key KPI indexes of each leaf node in a mobile communication network according to a preset key KPI index system, where the key KPI index system includes: At least two types of key KPI indicators; after aggregating the key KPI indicators of each leaf node by category, extract the characteristic parameters of each type of key KPI indicators aggregated by each leaf node; The characteristic parameters of the model are used as the input of the preset model for model training, and the index detection model corresponding to each type of key KPI index is obtained.
  • the embodiment of the present application also provides a fault location method for a mobile communication network, including: using the index detection model obtained by the above method for obtaining an index detection model for a mobile communication network, to each leaf node in the mobile communication network Perform index detection; for the target leaf node whose detection result is that the key KPI index is abnormal, it is determined whether the target leaf node is a fault critical node based on the historical status of the target leaf node.
  • the embodiment of the present application also provides a device for obtaining an index detection model for a mobile communication network, including: an obtaining module, configured to obtain the key KPI index of each leaf node in the mobile communication network according to a preset key KPI index system,
  • the key KPI indicator system includes at least two types of key KPI indicators;
  • the training module is used to aggregate the key KPI indicators of each leaf node by category, and then extract the characteristic parameters of each type of key KPI indicators aggregated by each leaf node, And the feature parameters of the same type of key KPI indicators of each leaf node are used as the input of the preset model for model training, and the indicator detection model corresponding to each type of key KPI indicators is obtained.
  • the embodiment of the present application further provides a fault location device for a mobile communication network, including: an abnormality detection module, configured to use the index detection model obtained by the above-mentioned method for obtaining an index detection model for a mobile communication network to Each leaf node in the network performs index detection; the fault location module is used to determine whether the target leaf node is a fault critical node based on the historical status of the target leaf node for which the detection result is an abnormal key KPI index.
  • Embodiments of the present application also provide an electronic device, including a processor, a memory, and a communication bus connecting the processor and the memory; the memory stores a first computer program, and the first computer program can be executed by the processor to realize the above-mentioned functions and/or, the memory stores a second computer program, and the second computer program can be executed by the processor to implement the steps of the above method for locating faults in the mobile communication network.
  • Embodiments of the present application further provide a computer-readable storage medium, where the computer-readable storage medium stores a first computer program, and the first computer program can be executed by a processor to implement the above index detection model for a mobile communication network and/or, the computer-readable storage medium stores a second computer program, and the second computer program can be executed by the processor to implement the steps of the above method for locating faults in a mobile communication network.
  • FIG. 1 is a schematic flowchart of a method for obtaining an index detection model for a mobile communication network provided by Embodiment 1 of the present application;
  • FIG. 2 is a schematic diagram of a key KPI indicator system provided in Embodiment 1 of the present application.
  • FIG. 3 is a schematic flowchart of a method for locating a fault in a mobile communication network according to Embodiment 1 of the present application;
  • FIG. 4 is a schematic diagram of a process of locating a root cause node of a fault according to Embodiment 1 of the present application;
  • FIG. 5 is a schematic structural diagram of an apparatus for obtaining an index detection model for a mobile communication network according to Embodiment 2 of the present application;
  • FIG. 6 is a schematic structural diagram of a fault locating apparatus for a mobile communication network according to Embodiment 2 of the present application.
  • FIG. 7 is a schematic diagram of an index detection model acquisition and fault location process provided in Embodiment 2 of the present application.
  • FIG. 8 is a schematic diagram of probe deployment according to Embodiment 2 of the present application.
  • FIG. 9 is a schematic diagram of topology networking according to Embodiment 2 of the present application.
  • FIG. 10-1 is a schematic diagram of a characteristic parameter matrix of key KPI indicators of a cell according to Embodiment 2 of the present application;
  • 10-2 is a schematic diagram of a characteristic parameter matrix of key KPI indicators of a network element according to Embodiment 2 of the present application;
  • FIG. 11 is a schematic diagram of a link provided in Embodiment 2 of the present application.
  • FIG. 12 is a schematic structural diagram of an electronic device according to Embodiment 3 of the present application.
  • the embodiments of the present application mainly provide a method, device, equipment, and storage medium for obtaining an index detection model and locating faults, which solve the problem that in the related art, the accuracy of fault locating in mobile communication networks is poor, resulting in low fault repair efficiency and high cost, which in turn affects the network. performance issue.
  • the method, device, device, and storage medium for acquiring an indicator detection model and locating faults collect key KPI indicators and extract characteristic parameters for each leaf node in a mobile communication network through a preset key KPI indicator system. , and the extracted feature parameters are used as the input of the preset model to train the model to obtain the index detection model, and then the index detection model can be used to detect abnormal indexes of each leaf node in the mobile communication network, and can be combined with the leaves with abnormal indexes.
  • the historical status of the node can accurately determine whether the leaf node is a key fault node.
  • the threshold-based indicator detection and the fault location based on the experience of the operation and maintenance personnel in the related technology, it no longer relies on the threshold threshold to divide the abnormality and the experience of the operation and maintenance personnel. , which can effectively avoid false alarms and missed alarms, accurately detect leaf nodes with abnormal indicators, and determine whether the node is a key fault node based on the historical status of the leaf node, which greatly reduces the alarm range and improves the accuracy of fault location. This can improve the efficiency of subsequent fault repair, reduce repair costs, and improve network performance.
  • the KPI threshold is used in related technologies to determine the abnormality of the indicators, which leads to misjudgment of the indicators falling within the gray interval of the threshold edge, which leads to a large number of false alarms and false alarms, and it is difficult to truly reflect the network quality; and a large number of alarm information It will interfere with the operation and maintenance personnel to quickly and accurately determine the fault and analyze the cause, resulting in low fault repair efficiency and high cost, which will affect the network performance.
  • This embodiment first provides a method for obtaining an index detection model that can be used in a mobile communication network.
  • the index detection model obtained by this method can accurately detect whether the key KPI indicators of each leaf node in the mobile communication network are abnormal, and no longer rely on the threshold threshold to divide the abnormality and the experience of operation and maintenance personnel, and can effectively Avoid false alarms and missed alarms.
  • the method for obtaining an index detection model for a mobile communication network may include, but is not limited to:
  • S101 Acquire key KPI indicators of each leaf node in the mobile communication network according to a preset key KPI indicator system, where the key KPI indicator system includes at least two types of key KPI indicators.
  • the key KPI indicator types included in the key KPI indicator system in this embodiment may be flexibly set according to specific application scenarios and requirements.
  • VOLTE Vehicle over Long-Term Evolution, long-term evolution voice bearer
  • the key KPI indicator system is shown in Figure 2, which can include but not limited to at least two of the following types of key KPI indicators.
  • Flexible selection, and other types of key KPI indicators can also be expanded or replaced according to needs:
  • eSRVCC Enhanced Single Radio Voice Call Continuity, enhanced single wireless voice call continuity
  • the key KPI indicators in the same time period of each leaf node may be collected.
  • key KPI indicators of each leaf node in different time periods may also be collected.
  • the types of the key KPI indicators collected for each leaf node may be the same.
  • the collection may be performed periodically, for example, the collection period (that is, the collection granularity) may be set in hours, days, or months. ; Of course, optionally, in some application examples, periodic collection may not be used for collection.
  • registration signaling, call signaling, call drop signaling, eSRVCC handover signaling, voice quality signaling, etc. may be collected at corresponding locations in the mobile communication network, but not limited to.
  • the key KPI indicators of each leaf node are obtained, and the collected signaling may include but not limited to time information, user information, behavior information, location information, and key KPI indicators.
  • the above various key KPI indicators of the leaf node are aggregated to obtain a set of various KPI indicators above each leaf node.
  • the feature parameters extracted in this embodiment may include, but are not limited to, at least two of the following, which may be flexibly selected according to requirements, and may also be flexibly expanded or replaced according to requirements:
  • the mean value of key KPI indicators The mean value of key KPI indicators, the standard deviation of key KPI indicators, the minimum value of key KPI indicators, the maximum value of key KPI indicators, the quarter point of key KPI indicators, the median value of key KPI indicators, the The third-quarter point, the standard deviation mean of key KPI indicators, the variance mean of key KPI indicators, the Chebyshev statistical characteristics of key KPI indicators, the total variation of key KPI indicators, and the coefficient of variation of key KPI indicators.
  • S103 Use the characteristic parameters of the same type of key KPI indicators of each leaf node as the input of the preset model to perform model training, and obtain an indicator detection model corresponding to each type of key KPI indicators.
  • the characteristic parameters of each of the same type of key KPI indicators for the 80 leaf nodes are used as pre-
  • the input of the model is set for model training, so as to obtain the index detection model of the nine types of key KPI indicators in the above example.
  • this indicator detection model it is possible to detect whether the key KPI indicators of the corresponding type on the leaf nodes are abnormal.
  • the preset model in this embodiment may be flexibly selected according to specific application scenarios.
  • the preset model may be, but not limited to, a random forest model.
  • the characteristic parameters of each type of key KPI indicators after the aggregation of each leaf node are extracted include:
  • the size of the window and the second time window can be flexibly set according to specific application requirements.
  • the feature parameters of the same type of key KPI indicators of each leaf node are used as the input of the preset model for model training, including:
  • the feature parameters of the same type of key KPI indicators of each leaf node are converted into time series as the input of the random forest model, and the random forest model is trained to obtain the corresponding indicator detection model.
  • This embodiment also provides a fault location method for a mobile communication network, as shown in FIG. 3 , which includes but is not limited to:
  • the index detection of each leaf node in the mobile communication network may include but is not limited to:
  • the time series sequence of each leaf node is predicted through the index detection model, and when the difference between the predicted value and the actual value (that is, the actual value of the time series sequence of the leaf node) is greater than the set threshold, it is determined that the target leaf node corresponds to the index detection
  • the key KPI indicators of the model are abnormal.
  • the leaf node when indicators are detected for each leaf node, it is possible to detect whether the above-mentioned various key KPI indicators of the leaf node are normal, so as to determine whether various key KPI indicators are normal, and when there is a certain type of KPI indicators When abnormal, the leaf node can be considered as the target leaf node.
  • S302 For the target leaf node whose detection result is that the key KPI indicator is abnormal, determine whether the target leaf node is a fault critical node in combination with the historical state of the target leaf node. For example, this step may include, but is not limited to:
  • the historical state of the leaf node includes the historical state of the key KPI indicators determined to be abnormal in the above S301; and the retroactive historical time value can be flexibly set according to specific application scenarios, for example, it can be set to the previous several hours, several days Or a month or a few months and so on.
  • the fluctuation contribution of each target leaf node determined as the critical node of the failure can also be calculated, and according to the calculated result, the contribution of each target leaf node determined as the critical node of the failure
  • the root cause node of the fault is determined in the target leaf node, thereby further improving the accuracy of fault location.
  • the fluctuation contribution degree is a value that reflects the degree of influence on other leaf nodes connected to the target leaf node when the target leaf node fails.
  • the location process of the fault root cause node in the above example is shown in FIG. 4, including but not limited to:
  • S401 Acquire the fluctuation contribution degree of each target leaf node determined as the fault critical node.
  • S402 Determine the target leaf node with the largest fluctuation contribution as the fault root cause node.
  • the index detection model may further include but not limited to:
  • the topology structure of the mobile communication network is self-learned, so as to obtain the topology structure of the mobile communication network, and each network entity in the obtained topology structure is used as a leaf node .
  • the method for acquiring the indicator detection model and the method for locating faults adopts the learning method based on the historical characteristics (ie, characteristic parameters) of multi-dimensional KPI indicators.
  • the root cause node of the fault can be determined by calculating the maximum fluctuation contribution, which solves the problem of the ripple effect of the index fluctuation, thereby filtering out the error caused by the occasional index fluctuation. alarm, greatly reducing the scope of the alarm. In this way, the most comprehensive fault root cause node can be realized with the combination of as few dimensions and their attribute values as possible.
  • Embodiment 2 is a diagrammatic representation of Embodiment 1:
  • This embodiment also provides a device for obtaining an index detection model for a mobile communication network, as shown in FIG. 5 , which includes but is not limited to:
  • the acquisition module 501 is configured to acquire the key KPI indicators of each leaf node in the mobile communication network according to the preset key KPI indicator system, the key KPI indicator system includes at least two types of key KPI indicators, and the specific acquisition process is shown in the above embodiment. , and will not be repeated here.
  • the training module 502 is used to aggregate the obtained key KPI indicators of each leaf node by category, extract the characteristic parameters of each type of key KPI indicators after the aggregation of each leaf node, and separate the same type of key KPI indicators of each leaf node respectively.
  • the characteristic parameters of KPI indicators are used as the input of the preset model for model training, and the indicator detection model corresponding to each type of key KPI indicators is obtained.
  • the specific training process is shown in the above-mentioned embodiment, and details are not described herein again.
  • the functions of the acquisition module 501 and the training module 502 in this embodiment may be implemented by, but not limited to, a processor.
  • This embodiment also provides a fault location device for a mobile communication network, as shown in FIG. 6 , which includes but is not limited to:
  • the anomaly detection module 601 is configured to perform index detection on each leaf node in the mobile communication network by using the index detection model obtained by the above-described index detection model acquisition method for the mobile communication network.
  • the specific detection process is shown in the above examples, and details are not repeated here.
  • the fault location module 602 is used to determine whether the target leaf node is a fault key node for the target leaf node for which the detection result is that the key KPI index is abnormal, combined with the historical state of the target leaf node; the specific determination process is shown in the above embodiment. It is not repeated here.
  • the fault locating apparatus further includes a topology networking self-learning module 603, configured to perform self-learning on the topology structure of the mobile communication network according to the acquired user plane signaling and control plane signaling, thereby The topology structure of the mobile communication network is acquired, and each network entity in the acquired topology structure is used as a leaf node. In this way, when there is a networking change in the network, the latest networking structure can be obtained in real time without relying on a third-party data interface.
  • an application scenario of a mobile communication network based on a VoLTE network is used as an example for description.
  • the process of acquiring an indicator detection model for a mobile communication network and locating a fault is shown in Figure 7, including but not limited to network indicator collection S701, raw data cleaning S702, multi-dimensional indicator aggregation S703, and key KPI indicators Feature parameter extraction S704, model training S705, index anomaly identification S706, and fault root cause location S707 to complete the process of anomaly identification and root cause location.
  • Hard probes can be deployed on the Mw port between the I/M-CSCF and SBC/P-CSCF to collect registration signaling, call signaling, and dropped calls.
  • Class signaling the Rx port deployed on the P/SBC and PCRF collects call drop signaling;
  • the Sv port deployed between the MME and eMSC network elements collects eSRVCC handover signaling; the s1-u deployed between the cell and the SGW network element
  • the port collects voice quality signaling.
  • the collected signaling includes time information, user information, behavior information, location information, and key KPI indicators.
  • the key KPI system in this application example is shown in Figure 2.
  • An example of raw data cleaning may include: cleaning, stripping, and normalizing valid information in each collected signaling information, and standardizing and normalizing the data.
  • the on-site mobile communication network topology structure is self-learned.
  • An example of self-learning The schematic diagram of the topology networking is shown in Figure 9, and the cell Ci and each network element on each core network side in the figure are used as leaf nodes.
  • the multi-dimensional index aggregation is to aggregate the indicators of the cell dimension and each network element dimension for the cleaned data, and collect statistics at a set time granularity (for example, hours).
  • the feature parameter extraction of key KPI indicators includes the mean value of key KPI indicators, the standard deviation of key KPI indicators, the minimum value of key KPI indicators, the maximum value of key KPI indicators, and a quarter of key KPI indicators Quantile, median value of key KPI indicators, quartiles of key KPI indicators, standard deviation mean of key KPI indicators, mean variance of key KPI indicators, Chebyshev statistical characteristics of key KPI indicators, key KPIs The total variation of indicators, the coefficient of variation of key KPI indicators, and the characteristic parameters are converted into matrices, as shown in Figure 10-1 and Figure 10-2.
  • the first column of the matrix shown in Figure 10-1 is each cell as a leaf node, that is, the characteristic parameters of a certain type of key KPI indicators of the network entity on the wireless network side.
  • kpi01t1 to ci1 corresponding to the first row kpi01tn corresponds to the mean value of key KPI indicators, the standard deviation of key KPI indicators, the minimum value of key KPI indicators, the maximum value of key KPI indicators, and the quarter quantile of key KPI indicators of a certain type of key KPI indicators of ci1 point, the median value of key KPI indicators, the third-quarter point of key KPI indicators, the standard deviation mean of key KPI indicators, the variance mean of key KPI indicators, the Chebyshev statistical characteristics of key KPI indicators, the average value of key KPI indicators Total variation, coefficient of variation of key KPI indicators.
  • the first column of the matrix shown in Figure 10-2 is each network element serving as a leaf node, that is, a network entity on the core network side.
  • the model training in this application example includes converting the feature parameters shown in Figure 10-1 and Figure 10-2 into time series, which are used as the input of the random forest model for model training to obtain an indicator detection model.
  • the obtained indicator detection model can be used to predict the key KPI time series of each leaf node.
  • the indicator is determined to be an abnormal indicator. Determine the corresponding leaf node as the target leaf node. It is assumed here that the key KPI indicators of 30 cells are abnormal, and the key KPI indicators of 7 network elements are abnormal.
  • the fault root cause location in this application example may include: acquiring the historical status information of the target leaf node (that is, the above-mentioned 30 cells and 7 network elements), if the historical status is normal, the target leaf node is determined to be a key fault node; If the historical state is abnormal, the target leaf node is determined not to be the critical node of the failure. Suppose that after the above determination, it is found that 10 cells among the above 30 cells are fault critical nodes, and 3 network elements among the above 7 network elements are fault critical nodes.
  • FIG. 11 A schematic diagram of a link structure is shown in Figure 11.
  • the uplink packet loss rate indicator wave of the MME numbered 14XX87 is abnormal, and its fluctuation contribution is 0.86 (for example, if the number of other leaf nodes connected to the MME is 100, the abnormal packet loss rate indicator of the MME will affect the 86, the dynamic contribution of the MME can be determined to be 0.86; of course, other calculation methods can also be used, as long as the dynamic contribution defined in this embodiment can be accurately obtained), which is the maximum fluctuation contribution in the group , so it is determined that the root is the uplink packet loss rate indicator of the MME numbered 14XX87.
  • This embodiment also provides an electronic device, as shown in FIG. 12, including a processor 1201, a memory 1202, and a communication bus 1203 connecting the processor 1201 and the memory 1202;
  • the memory 1202 stores a first computer program, and the first computer program can be executed by the processor 1201 to implement the steps of the method for obtaining an index detection model for a mobile communication network as shown in the above embodiments;
  • the memory 1202 stores a second computer program, and the second computer program can be executed by the processor 1201 to implement the steps of the mobile communication network fault location method shown in the above embodiments.
  • This embodiment also provides a computer-readable storage medium, where the computer-readable storage medium stores a first computer program, and the first computer program can be executed by a processor, so as to implement the mobile communication as shown in the above embodiments
  • the computer-readable storage medium stores a first computer program, and the first computer program can be executed by the processor to implement the steps in the mobile communication network fault location method shown in the above embodiments.
  • the computer-readable storage medium in this embodiment includes volatile or non-volatile implemented in any method or technology for storage of information, such as computer-readable instructions, data structures, computer program modules or other data , removable or non-removable media.
  • Computer-readable storage media include but are not limited to RAM (Random Access Memory, random access memory), ROM (Read-Only Memory, read-only memory), EEPROM (Electrically Erasable Programmable read only memory, electrically erasable programmable read only memory) ), flash memory or other memory technology, CD-ROM (Compact Disc Read-Only Memory), digital versatile disk (DVD) or other optical disk storage, magnetic cartridges, magnetic tape, magnetic disk storage or other magnetic storage devices, Or any other medium that can be used to store the desired information and that can be accessed by a computer.
  • RAM Random Access Memory
  • ROM Read-Only Memory
  • EEPROM Electrically Erasable Programmable read only memory
  • flash memory or other memory technology
  • CD-ROM Compact Disc Read-Only Memory
  • This embodiment also provides a first computer program (or referred to as first computer software), the first computer program can be distributed on a computer-readable medium and executed by a computable device, so as to realize the above-mentioned and in some cases, at least one of the steps shown or described may be performed in an order different from that described in the above embodiments.
  • first computer program or referred to as first computer software
  • This embodiment also provides a second computer program (or called second computer software), the second computer program can be distributed on a computer-readable medium and executed by a computer-readable device, so as to realize the above-mentioned mobile communication and, in some cases, at least one of the steps shown or described may be performed in an order different from that described in the above embodiments.
  • second computer software or called second computer software
  • This embodiment also provides a computer program product, including a computer-readable device, on which any of the computer programs shown above are stored.
  • the computer-readable device may include the computer-readable storage medium as described above.
  • the functional modules/units in the system, and the device can be implemented as software (which can be implemented by computer program codes executable by a computing device). ), firmware, hardware, and their appropriate combination.
  • the division between functional modules/units mentioned in the above description does not necessarily correspond to the division of physical components; for example, one physical component may have multiple functions, or one function or step may be composed of several physical components Components execute cooperatively.
  • Some or all physical components may be implemented as software executed by a processor, such as a central processing unit, digital signal processor or microprocessor, or as hardware, or as an integrated circuit, such as an application specific integrated circuit .
  • communication media typically embodies computer readable instructions, data structures, computer program modules, or other data in a modulated data signal such as a carrier wave or other transport mechanism, and can include any information delivery, as is well known to those of ordinary skill in the art medium. Therefore, the present application is not limited to any particular combination of hardware and software.

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • Game Theory and Decision Science (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

Provided are an indicator detection model acquisition method and apparatus, a fault locating method and apparatus, and a device and a storage medium. The indicator detection model acquisition method in the present application comprises: according to a preset KPI system, acquiring KPIs of leaf nodes in a mobile communication network, wherein the KPI system comprises at least two types of KPIs; after the KPIs of the leaf nodes are respectively aggregated by type, extracting a feature parameter of each type of KPI after the aggregation of the leaf nodes; and respectively taking the feature parameter of the same type of KPI of the leaf nodes as an input of a preset model to perform model training, so as to obtain an indicator detection model corresponding to each type of KPI.

Description

指标检测模型获取及故障定位方法、装置、设备及存储介质Index detection model acquisition and fault location method, device, equipment and storage medium
交叉引用cross reference
本申请基于申请号为“202010783870.6”、申请日为2020年08月06日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此以引入方式并入本申请。This application is based on the Chinese patent application with the application number "202010783870.6" and the application date is August 6, 2020, and claims the priority of the Chinese patent application. The entire content of the Chinese patent application is hereby incorporated by reference. Apply.
技术领域technical field
本申请实施例涉及半导体器件失效分析领域,尤其涉及一种指标检测模型获取及故障定位方法、装置、设备及存储介质。Embodiments of the present application relate to the field of failure analysis of semiconductor devices, and in particular, to a method, apparatus, device, and storage medium for acquiring an index detection model and locating faults.
背景技术Background technique
在移动运营商IT运维领域,多维度多指标的故障根因节点定位一直是智能化运维(AIOPS)的核心问题。当某个网元节点关键性能指标出现异常时,运维人员需要快速、准确的定位到引发故障的原因,从而开展修复止损工作。但由于移动组网的复杂性和指标的多样性,导致故障根因节点的排查困难、耗时时间长,运维人员希望系统给出的故障根因节点结果尽可能的简洁。即以最少的维度和属性值覆盖故障原因,以便运维人员能够快速核实并修复。In the field of IT operation and maintenance of mobile operators, multi-dimensional and multi-index fault root cause node location has always been the core issue of intelligent operation and maintenance (AIOPS). When the key performance indicators of a certain network element node are abnormal, the operation and maintenance personnel need to quickly and accurately locate the cause of the failure, so as to carry out repair and stop loss work. However, due to the complexity of mobile networking and the diversity of indicators, it is difficult and time-consuming to troubleshoot root cause nodes. The operation and maintenance personnel hope that the results of root cause nodes given by the system are as concise as possible. That is, the cause of the fault is covered with the least dimension and attribute values, so that the operation and maintenance personnel can quickly verify and fix it.
目前移动运营商已开始对网络中的关键KPI性能指标进行监测,并使用KPI阈值对其进行指标异常进行判定。这种方法的会导致落在阈值边缘灰色区间内的指标误判,进而导致大量的虚警和误警,很难真实反映网络质量;而大量的告警信息会干扰运维人员快速、准确的判定故障及分析原因,造成故障修复效率低、成本高,进而影响网络性能。At present, mobile operators have begun to monitor key KPI performance indicators in the network, and use KPI thresholds to determine whether the indicators are abnormal. This method will lead to misjudgment of indicators that fall within the gray interval of the threshold edge, which will lead to a large number of false alarms and false alarms, and it is difficult to truly reflect the network quality; and a large number of alarm information will interfere with the rapid and accurate judgment of operation and maintenance personnel. Failure and analysis of the cause result in low efficiency and high cost of fault repair, which in turn affects network performance.
发明内容SUMMARY OF THE INVENTION
本申请实施例提供了一种用于移动通信网络的指标检测模型获取方法,包括:根据预设的关键KPI指标体系,获取移动通信网络中各叶子节点的关键KPI 指标,关键KPI指标体系中包括至少两类关键KPI指标;将各叶子节点的关键KPI指标分别按类进行聚合后,提取各叶子节点聚合后的每一类关键KPI指标的特征参数;分别将各叶子节点的同一类关键KPI指标的特征参数作为预设模型的输入进行模型训练,得到每一类关键KPI指标对应的指标检测模型。An embodiment of the present application provides a method for obtaining an index detection model for a mobile communication network, including: obtaining key KPI indexes of each leaf node in a mobile communication network according to a preset key KPI index system, where the key KPI index system includes: At least two types of key KPI indicators; after aggregating the key KPI indicators of each leaf node by category, extract the characteristic parameters of each type of key KPI indicators aggregated by each leaf node; The characteristic parameters of the model are used as the input of the preset model for model training, and the index detection model corresponding to each type of key KPI index is obtained.
本申请实施例还提供了一种移动通信网络的故障定位方法,包括:采用通过如上的用于移动通信网络的指标检测模型获取方法所获取的指标检测模型,对移动通信网络中的各叶子节点进行指标检测;对于检测结果为关键KPI指标存在异常的目标叶子节点,结合该目标叶子节点的历史状态,确定该目标叶子节点是否为故障关键节点。The embodiment of the present application also provides a fault location method for a mobile communication network, including: using the index detection model obtained by the above method for obtaining an index detection model for a mobile communication network, to each leaf node in the mobile communication network Perform index detection; for the target leaf node whose detection result is that the key KPI index is abnormal, it is determined whether the target leaf node is a fault critical node based on the historical status of the target leaf node.
本申请实施例还提供了一种用于移动通信网络的指标检测模型获取装置,包括:获取模块,用于根据预设的关键KPI指标体系,获取移动通信网络中各叶子节点的关键KPI指标,关键KPI指标体系中包括至少两类关键KPI指标;训练模块,用于将各叶子节点的关键KPI指标分别按类进行聚合后,提取各叶子节点聚合后的每一类关键KPI指标的特征参数,并分别将各叶子节点的同一类关键KPI指标的特征参数作为预设模型的输入进行模型训练,得到每一类关键KPI指标对应的指标检测模型。The embodiment of the present application also provides a device for obtaining an index detection model for a mobile communication network, including: an obtaining module, configured to obtain the key KPI index of each leaf node in the mobile communication network according to a preset key KPI index system, The key KPI indicator system includes at least two types of key KPI indicators; the training module is used to aggregate the key KPI indicators of each leaf node by category, and then extract the characteristic parameters of each type of key KPI indicators aggregated by each leaf node, And the feature parameters of the same type of key KPI indicators of each leaf node are used as the input of the preset model for model training, and the indicator detection model corresponding to each type of key KPI indicators is obtained.
本申请实施例还提供了一种移动通信网络的故障定位装置,包括:异常检测模块,用于采用通过如上的用于移动通信网络的指标检测模型获取方法所获取的指标检测模型,对移动通信网络中的各叶子节点进行指标检测;故障定位模块,用于对于检测结果为关键KPI指标存在异常的目标叶子节点,结合该目标叶子节点的历史状态,确定该目标叶子节点是否故障关键节点。The embodiment of the present application further provides a fault location device for a mobile communication network, including: an abnormality detection module, configured to use the index detection model obtained by the above-mentioned method for obtaining an index detection model for a mobile communication network to Each leaf node in the network performs index detection; the fault location module is used to determine whether the target leaf node is a fault critical node based on the historical status of the target leaf node for which the detection result is an abnormal key KPI index.
本申请实施例还提供了一种电子设备,包括处理器、存储器和连接处理器和存储器的通信总线;存储器存储有第一计算机程序,第一计算机程序可被处理器执行,以实现如上的用于移动通信网络的指标检测模型获取方法的步骤;和/或,存储器存储有第二计算机程序,第二计算机程序可被处理器执行,以实现如上的移动通信网络的故障定位方法的步骤。Embodiments of the present application also provide an electronic device, including a processor, a memory, and a communication bus connecting the processor and the memory; the memory stores a first computer program, and the first computer program can be executed by the processor to realize the above-mentioned functions and/or, the memory stores a second computer program, and the second computer program can be executed by the processor to implement the steps of the above method for locating faults in the mobile communication network.
本申请实施例还提供了一种计算机可读存储介质,计算机可读存储介质存储有第一计算机程序,第一计算机程序可被处理器执行,以实现如上的用于移动通信网络的指标检测模型获取方法的步骤;和/或,计算机可读存储介质存储 有第二计算机程序,第二计算机程序可被处理器执行,以实现如上的移动通信网络的故障定位方法的步骤。Embodiments of the present application further provide a computer-readable storage medium, where the computer-readable storage medium stores a first computer program, and the first computer program can be executed by a processor to implement the above index detection model for a mobile communication network and/or, the computer-readable storage medium stores a second computer program, and the second computer program can be executed by the processor to implement the steps of the above method for locating faults in a mobile communication network.
本申请其他特征和相应的有益效果在说明书的后面部分进行阐述说明,且应当理解,至少部分有益效果从本申请说明书中的记载变的显而易见。Other features and corresponding beneficial effects of the present application are described in later parts of the specification, and it should be understood that at least some of the beneficial effects will become apparent from the description in the specification of the present application.
附图说明Description of drawings
图1为本申请实施例一提供的用于移动通信网络的指标检测模型获取方法流程示意图;1 is a schematic flowchart of a method for obtaining an index detection model for a mobile communication network provided by Embodiment 1 of the present application;
图2为本申请实施例一提供的关键KPI指标体系示意图;2 is a schematic diagram of a key KPI indicator system provided in Embodiment 1 of the present application;
图3为本申请实施例一提供的移动通信网络的故障定位方法流程示意图;3 is a schematic flowchart of a method for locating a fault in a mobile communication network according to Embodiment 1 of the present application;
图4为本申请实施例一提供的故障根因节点的定位过程示意图;4 is a schematic diagram of a process of locating a root cause node of a fault according to Embodiment 1 of the present application;
图5为本申请实施例二提供的用于移动通信网络的指标检测模型获取装置结构示意图;5 is a schematic structural diagram of an apparatus for obtaining an index detection model for a mobile communication network according to Embodiment 2 of the present application;
图6为本申请实施例二提供的移动通信网络的故障定位装置结构示意图;FIG. 6 is a schematic structural diagram of a fault locating apparatus for a mobile communication network according to Embodiment 2 of the present application;
图7为本申请实施例二提供的指标检测模型获取以及故障定位过程示意图;FIG. 7 is a schematic diagram of an index detection model acquisition and fault location process provided in Embodiment 2 of the present application;
图8为本申请实施例二提供的探针部署示意图;FIG. 8 is a schematic diagram of probe deployment according to Embodiment 2 of the present application;
图9为本申请实施例二提供的拓扑组网示意图;FIG. 9 is a schematic diagram of topology networking according to Embodiment 2 of the present application;
图10-1为本申请实施例二提供的小区的关键KPI指标特征参数矩阵示意图;FIG. 10-1 is a schematic diagram of a characteristic parameter matrix of key KPI indicators of a cell according to Embodiment 2 of the present application;
图10-2为本申请实施例二提供的网元的关键KPI指标特征参数矩阵示意图;10-2 is a schematic diagram of a characteristic parameter matrix of key KPI indicators of a network element according to Embodiment 2 of the present application;
图11为本申请实施例二提供的链路示意图;FIG. 11 is a schematic diagram of a link provided in Embodiment 2 of the present application;
图12为本申请实施例三提供的电子设备结构示意图。FIG. 12 is a schematic structural diagram of an electronic device according to Embodiment 3 of the present application.
具体实施方式detailed description
为了使本申请的目的、技术方案及优点更加清楚明白,下面通过具体实施方式结合附图对本申请实施例作进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。In order to make the purpose, technical solutions and advantages of the present application more clearly understood, the embodiments of the present application will be further described in detail below through specific implementation manners in conjunction with the accompanying drawings. It should be understood that the specific embodiments described herein are only used to explain the present application, but not to limit the present application.
本申请实施例主要提供的一种指标检测模型获取及故障定位方法、装置、设备及存储介质,解决相关技术中,移动通信网络故障定位准确性差,导致故障修复效率低、成本高,进而影响网络性能的问题。The embodiments of the present application mainly provide a method, device, equipment, and storage medium for obtaining an index detection model and locating faults, which solve the problem that in the related art, the accuracy of fault locating in mobile communication networks is poor, resulting in low fault repair efficiency and high cost, which in turn affects the network. performance issue.
本申请实施例提供的指标检测模型获取及故障定位方法、装置、设备及存储介质,通过预设的关键KPI指标体系对移动通信网络中的各叶子节点进行关键KPI指标的采集以及特征参数的提取,并将提取到的特征参数作为预设模型的输入进行模型训练得到指标检测模型,进而可通过指标检测模型对移动通信网络中的各叶子节点进行指标异常的检测,并可结合指标异常的叶子节点的历史状态准确的确定出该叶子节点是否为故障关键节点,相对于相关技术中基于阈值的指标检测和依赖运维人员经验的故障原因定位,不再依赖门限阈值划分异常和运维人员经验,可以有效的避免虚警和漏警,能够准确检测出指标异常的叶子节点,并可结合该叶子节点的历史状态判定该节点是否为故障关键节点,大幅缩减了告警范围,提升了故障定位精准度,进而可提升后续故障修复效率、降低修复成本,并提升网络性能。The method, device, device, and storage medium for acquiring an indicator detection model and locating faults provided by the embodiments of the present application collect key KPI indicators and extract characteristic parameters for each leaf node in a mobile communication network through a preset key KPI indicator system. , and the extracted feature parameters are used as the input of the preset model to train the model to obtain the index detection model, and then the index detection model can be used to detect abnormal indexes of each leaf node in the mobile communication network, and can be combined with the leaves with abnormal indexes. The historical status of the node can accurately determine whether the leaf node is a key fault node. Compared with the threshold-based indicator detection and the fault location based on the experience of the operation and maintenance personnel in the related technology, it no longer relies on the threshold threshold to divide the abnormality and the experience of the operation and maintenance personnel. , which can effectively avoid false alarms and missed alarms, accurately detect leaf nodes with abnormal indicators, and determine whether the node is a key fault node based on the historical status of the leaf node, which greatly reduces the alarm range and improves the accuracy of fault location. This can improve the efficiency of subsequent fault repair, reduce repair costs, and improve network performance.
实施例一:Example 1:
下面结合附图和实施实例,对本申请的具体实施方式作进一步详细描述。The specific embodiments of the present application will be described in further detail below with reference to the accompanying drawings and implementation examples.
针对相关技术中使用KPI阈值对其进行指标异常进行判定,导致落在阈值边缘灰色区间内的指标误判,进而导致大量的虚警和误警,很难真实反映网络质量;而大量的告警信息会干扰运维人员快速、准确的判定故障及分析原因,造成故障修复效率低、成本高,进而影响网络性能的问题,本实施例首先提供了一种可用于移动通信网络的指标检测模型获取方法,通过该方法获取的指标检测模型可以对移动通信网络中各叶子节点的关键KPI指标是否出现异常进行准确的检测,不再依赖于不再依赖门限阈值划分异常和运维人员经验,可以有效的避免虚警和漏警。The KPI threshold is used in related technologies to determine the abnormality of the indicators, which leads to misjudgment of the indicators falling within the gray interval of the threshold edge, which leads to a large number of false alarms and false alarms, and it is difficult to truly reflect the network quality; and a large number of alarm information It will interfere with the operation and maintenance personnel to quickly and accurately determine the fault and analyze the cause, resulting in low fault repair efficiency and high cost, which will affect the network performance. This embodiment first provides a method for obtaining an index detection model that can be used in a mobile communication network. , the index detection model obtained by this method can accurately detect whether the key KPI indicators of each leaf node in the mobile communication network are abnormal, and no longer rely on the threshold threshold to divide the abnormality and the experience of operation and maintenance personnel, and can effectively Avoid false alarms and missed alarms.
参见图1所示,本实施例提供的用于移动通信网络的指标检测模型获取方法,可包括但不限于:Referring to FIG. 1 , the method for obtaining an index detection model for a mobile communication network provided by this embodiment may include, but is not limited to:
S101:根据预设的关键KPI指标体系,获取移动通信网络中各叶子节点的关键KPI指标,关键KPI指标体系中包括至少两类关键KPI指标。S101: Acquire key KPI indicators of each leaf node in the mobile communication network according to a preset key KPI indicator system, where the key KPI indicator system includes at least two types of key KPI indicators.
应当理解的是,本实施例中的关键KPI指标体系中所包括的关键KPI指标类型,可以根据具体应用场景和需求灵活设置。例如,对于VOLTE(Voice over Long-Term Evolution,长期演进语音承载)网络,关键KPI指标体系参见图2所示,可以包括但不限于以下几类关键KPI指标中的至少两种,具体可根据需 求灵活选用,也可根据需求扩展或替换其他类型的关键KPI指标:It should be understood that the key KPI indicator types included in the key KPI indicator system in this embodiment may be flexibly set according to specific application scenarios and requirements. For example, for a VOLTE (Voice over Long-Term Evolution, long-term evolution voice bearer) network, the key KPI indicator system is shown in Figure 2, which can include but not limited to at least two of the following types of key KPI indicators. Flexible selection, and other types of key KPI indicators can also be expanded or replaced according to needs:
初始注册成功率;Initial registration success rate;
重注册成功率;Re-registration success rate;
VoLTE网络接通率;VoLTE network connection rate;
V2V呼叫建立时长;V2V call establishment duration;
VoLET掉话率;VoLET call drop rate;
eSRVCC(Enhanced Single Radio Voice Call Continuity,增强的单一无线语音呼叫连续性)切换成功率;eSRVCC (Enhanced Single Radio Voice Call Continuity, enhanced single wireless voice call continuity) handover success rate;
eSRVCC切换平均时延;eSRVCC switching average delay;
VoLET MOS3.0占比;The proportion of VoLET MOS3.0;
上行丢包率。Upstream packet loss rate.
应当理解的是,本实施例中,对移动通信网络中的各叶子节点进行关键KPI指标的采集时,可以采集各叶子节点相同时间段内的关键KPI指标。可选地,在一些应用示例中,也可采集各叶子节点不同时间段内的关键KPI指标。It should be understood that, in this embodiment, when the key KPI indicators are collected for each leaf node in the mobile communication network, the key KPI indicators in the same time period of each leaf node may be collected. Optionally, in some application examples, key KPI indicators of each leaf node in different time periods may also be collected.
应当理解的是,本实施例中,对移动通信网络中的各叶子节点进行关键KPI指标的采集时,对各叶子节点采集的关键KPI指标的类型可以相同。可选地,在一些应用示例中,也可存在对部分叶子节点所采集的关键KPI指标的类型,与其他叶子节点所采集的关键KPI指标类型不同;具体可根据需求灵活设定。It should be understood that, in this embodiment, when the key KPI indicators are collected for each leaf node in the mobile communication network, the types of the key KPI indicators collected for each leaf node may be the same. Optionally, in some application examples, there may also be types of key KPI indicators collected for some leaf nodes, which are different from the types of key KPI indicators collected by other leaf nodes; the specific types can be flexibly set according to requirements.
应当理解的是,本实施例中,对移动通信网络中的各叶子节点进行关键KPI指标的采集时,可以周期性的采集,例如可以小时、天或者月设定采集周期(也即采集粒度);当然,可选地,在一些应用示例中,可也不采用周期性的采集方式进行采集。It should be understood that, in this embodiment, when the key KPI indicators are collected for each leaf node in the mobile communication network, the collection may be performed periodically, for example, the collection period (that is, the collection granularity) may be set in hours, days, or months. ; Of course, optionally, in some application examples, periodic collection may not be used for collection.
可选地,在一些实施例中,可以通过但不限于在移动通信网络中的相应位置采集注册信令、呼叫信令、掉话类信令、eSRVCC切换信令、语音质量类信令等从而获取到各叶子节点的关键KPI指标,采集的信令中可包括但不限于时间信息、用户信息、行为信息、位置信息,以及关键KPI指标。Optionally, in some embodiments, registration signaling, call signaling, call drop signaling, eSRVCC handover signaling, voice quality signaling, etc. may be collected at corresponding locations in the mobile communication network, but not limited to. The key KPI indicators of each leaf node are obtained, and the collected signaling may include but not limited to time information, user information, behavior information, location information, and key KPI indicators.
S102:将各叶子节点的关键KPI指标分别按类进行聚合后,提取各叶子节点聚合后的每一类关键KPI指标的特征参数。S102: After aggregating the key KPI indicators of each leaf node by category, extract characteristic parameters of each category of key KPI indicators aggregated by each leaf node.
例如,对于某一叶子节点,采集了其初始注册成功率、重注册成功率、VoLTE 网络接通率、V2V呼叫建立时长、VoLET掉话率、eSRVCC切换成功率、eSRVCC切换平均时延、VoLET MOS3.0占比、上行丢包率。则将该叶子节点的以上各类关键KPI指标进行聚合,得到各叶子节点以上各类KPI指标的集合。For example, for a leaf node, the initial registration success rate, re-registration success rate, VoLTE network connection rate, V2V call establishment duration, VoLET call drop rate, eSRVCC handover success rate, eSRVCC handover average delay, VoLET MOS3 .0 ratio, uplink packet loss rate. Then, the above various key KPI indicators of the leaf node are aggregated to obtain a set of various KPI indicators above each leaf node.
然后针对该叶子节点,提取其各类关键KPI指标的特征参数。本实施例中所提取的特征参数可包括但不限于以下中的至少两种,具体可根据需求灵活选用,也可根据需求灵活扩展或替换:Then, for the leaf node, the characteristic parameters of various key KPI indicators are extracted. The feature parameters extracted in this embodiment may include, but are not limited to, at least two of the following, which may be flexibly selected according to requirements, and may also be flexibly expanded or replaced according to requirements:
关键KPI指标的均值、关键KPI指标的标准差、关键KPI指标的最小值、关键KPI指标的最大值、关键KPI指标的四分之一分位点、关键KPI指标的中值、关键KPI指标的四分之三分位点、关键KPI指标的标准差均值、关键KPI指标的方差均值、关键KPI指标的切比雪夫统计特征、关键KPI指标的总变差、关键KPI指标的变异系数。The mean value of key KPI indicators, the standard deviation of key KPI indicators, the minimum value of key KPI indicators, the maximum value of key KPI indicators, the quarter point of key KPI indicators, the median value of key KPI indicators, the The third-quarter point, the standard deviation mean of key KPI indicators, the variance mean of key KPI indicators, the Chebyshev statistical characteristics of key KPI indicators, the total variation of key KPI indicators, and the coefficient of variation of key KPI indicators.
S103:分别将各叶子节点的同一类关键KPI指标的特征参数作为预设模型的输入进行模型训练,得到每一类关键KPI指标对应的指标检测模型。S103: Use the characteristic parameters of the same type of key KPI indicators of each leaf node as the input of the preset model to perform model training, and obtain an indicator detection model corresponding to each type of key KPI indicators.
例如,假设当前移动网络中存在80个叶子节点,针对每个叶子节点采集了上述示例的九类关键KPI指标,则针对该80个叶子节点的每一同一类关键KPI指标的特征参数分别作为预设模型的输入进行模型训练,从而得到以上示例的九类关键KPI指标的指标检测模型。通过该指标检测模型即可对叶子节点上对应类型的关键KPI指标是否异常进行检测。For example, assuming that there are 80 leaf nodes in the current mobile network, and the nine types of key KPI indicators in the above example are collected for each leaf node, the characteristic parameters of each of the same type of key KPI indicators for the 80 leaf nodes are used as pre- The input of the model is set for model training, so as to obtain the index detection model of the nine types of key KPI indicators in the above example. Through this indicator detection model, it is possible to detect whether the key KPI indicators of the corresponding type on the leaf nodes are abnormal.
应当理解的是,本实施例中的预设模型可以根据具体应用场景灵活选用,例如一种示例中,该预设模型可以为但不限于随机森林模型。在本示例中,提取各叶子节点聚合后的每一类关键KPI指标的特征参数包括:It should be understood that the preset model in this embodiment may be flexibly selected according to specific application scenarios. For example, in an example, the preset model may be, but not limited to, a random forest model. In this example, the characteristic parameters of each type of key KPI indicators after the aggregation of each leaf node are extracted include:
针对每一叶子节点,提取聚合后的每一类关键KPI指标落入第一时间窗内的关键KPI指标的特征参数,和落入第二时间窗内的关键KPI指标的特征参数,其中第二时间窗大于第一时间窗,例如第一时间窗可以为7天,第二时间窗可以为30天,或者第一时间窗为4小时,第二时间窗为40小时等,也即第一时间窗和第二时间窗的大小可以根据具体应用需求灵活设定。For each leaf node, extract the characteristic parameters of the aggregated key KPI indicators of each type of key KPI indicators that fall within the first time window, and the characteristic parameters of the key KPI indicators that fall within the second time window, where the second The time window is greater than the first time window, for example, the first time window can be 7 days, the second time window can be 30 days, or the first time window is 4 hours, the second time window is 40 hours, etc., that is, the first time window The size of the window and the second time window can be flexibly set according to specific application requirements.
在本示例中,分别将各叶子节点的同一类关键KPI指标的特征参数作为预设模型的输入进行模型训练包括:In this example, the feature parameters of the same type of key KPI indicators of each leaf node are used as the input of the preset model for model training, including:
分别将各叶子节点的同一类关键KPI指标的特征参数转换为时序序列作为 随机森林模型的输入,对随机森林模型进行训练,得到对应的指标检测模型。The feature parameters of the same type of key KPI indicators of each leaf node are converted into time series as the input of the random forest model, and the random forest model is trained to obtain the corresponding indicator detection model.
本实施例还提供了一种移动通信网络的故障定位方法,请参见图3所示,其包括但不限于:This embodiment also provides a fault location method for a mobile communication network, as shown in FIG. 3 , which includes but is not limited to:
S301:采用通过如上所示的用于移动通信网络的指标检测模型获取方法所获取的指标检测模型,对移动通信网络中的各叶子节点进行指标检测。S301: Using the index detection model obtained by the above-described method for obtaining an index detection model for a mobile communication network, perform index detection on each leaf node in the mobile communication network.
其中,通过指标检测模型,对移动通信网络中的各叶子节点进行指标检测可包括但不限于:Wherein, through the index detection model, the index detection of each leaf node in the mobile communication network may include but is not limited to:
通过指标检测模型对每一叶子节点的时序序列进行预测,在预测值与实际值(也即叶子节点的时序序列的实际值)之差大于设定阈值时,确定该目标叶子节点对应该指标检测模型的关键KPI指标异常。The time series sequence of each leaf node is predicted through the index detection model, and when the difference between the predicted value and the actual value (that is, the actual value of the time series sequence of the leaf node) is greater than the set threshold, it is determined that the target leaf node corresponds to the index detection The key KPI indicators of the model are abnormal.
在本实施例中,对各叶子节点进行指标检测时,可以对叶子节点的上述各类关键KPI指标是否正常进行检测,以确定其各类关键KPI指标是否都正常,当存在某一类KPI指标异常时,则可认为该叶子节点为目标叶子节点。In this embodiment, when indicators are detected for each leaf node, it is possible to detect whether the above-mentioned various key KPI indicators of the leaf node are normal, so as to determine whether various key KPI indicators are normal, and when there is a certain type of KPI indicators When abnormal, the leaf node can be considered as the target leaf node.
S302:对于检测结果为关键KPI指标存在异常的目标叶子节点,结合该目标叶子节点的历史状态,确定该目标叶子节点是否为故障关键节点。例如,该步骤可包括但不限于:S302: For the target leaf node whose detection result is that the key KPI indicator is abnormal, determine whether the target leaf node is a fault critical node in combination with the historical state of the target leaf node. For example, this step may include, but is not limited to:
在目标叶子节点的历史状态为关键KPI指标正常时,确定该目标叶子节点不是故障关键节点;When the historical status of the target leaf node is that the key KPI indicators are normal, it is determined that the target leaf node is not a fault critical node;
在目标叶子节点的历史状态为关键KPI指标异常时,确定该目标叶子节点是故障关键节点;When the historical state of the target leaf node is abnormal for the key KPI indicators, it is determined that the target leaf node is the key fault node;
本实施例中叶子节点的历史状态包括上述S301中确定为异常的关键KPI指标的历史状态;且追溯的历史时间值可以根据具体应用场景灵活设定,例如可以设置为前几个小时、几天或一个月或几个月等。In this embodiment, the historical state of the leaf node includes the historical state of the key KPI indicators determined to be abnormal in the above S301; and the retroactive historical time value can be flexibly set according to specific application scenarios, for example, it can be set to the previous several hours, several days Or a month or a few months and so on.
在本实施例中,为了避免复杂组网异常波动的涟漪效应,还可计算确定为故障关键节点的各目标叶子节点的波动贡献度,并会根据计算得到的结果从确定为故障关键节点的各目标叶子节点中确定出故障根因节点,从而进一步提升故障定位的准确率。本实施例中波动贡献度为体现目标叶子节点故障时,对与其连接的其他叶子节点的影响程度的值。In this embodiment, in order to avoid the ripple effect of abnormal fluctuations in the complex networking, the fluctuation contribution of each target leaf node determined as the critical node of the failure can also be calculated, and according to the calculated result, the contribution of each target leaf node determined as the critical node of the failure The root cause node of the fault is determined in the target leaf node, thereby further improving the accuracy of fault location. In this embodiment, the fluctuation contribution degree is a value that reflects the degree of influence on other leaf nodes connected to the target leaf node when the target leaf node fails.
当S302中确定为故障关键节点的目标叶子节点有多个时,上述示例的故障根因节点的定位过程请参见图4所示,包括但不限于:When there are multiple target leaf nodes determined to be the critical fault nodes in S302, the location process of the fault root cause node in the above example is shown in FIG. 4, including but not limited to:
S401:获取确定为故障关键节点的各目标叶子节点的波动贡献度。S401: Acquire the fluctuation contribution degree of each target leaf node determined as the fault critical node.
S402:确定波动贡献度最大的目标叶子节点为故障根因节点。S402: Determine the target leaf node with the largest fluctuation contribution as the fault root cause node.
可选地,在本实施例对一些示例中,通过指标检测模型,对移动通信网络中的各叶子节点进行指标检测之前,还可包括但不限于:Optionally, in some examples in this embodiment, before the index detection is performed on each leaf node in the mobile communication network by using the index detection model, the index detection model may further include but not limited to:
根据获取到的用户面信令和控制面信令,对移动通信网络的拓扑结构进行自学习,从而获取到移动通信网络的拓扑结构,并将获取到的拓扑结构中的各网络实体作为叶子节点。According to the obtained user plane signaling and control plane signaling, the topology structure of the mobile communication network is self-learned, so as to obtain the topology structure of the mobile communication network, and each network entity in the obtained topology structure is used as a leaf node .
本实施例提供的指标检测模型获取方法和故障定位方法,相对于传统基于阈值的指标检测和依赖运维人员经验的故障原因定位,采取了基于多维度KPI指标历史特征(即特征参数)学习方式,对关键KPI指标进行识别;并可结合现场的移动通信网络的拓扑结构进行故障根因节点分析;在异常识别过程中,将叶子节点的关键KPI指标与自身的历史数据做比较,不依赖门限阈值划分异常,可以有效的避免虚警和漏警,能够检测出明显的关键KPI指标异常以及有劣化趋势的异常;在故障根因节点定位部分,结合KPI异常检测的结果及该节点的历史状态,判定该节点是否为故障关键节点,并可结合组网的拓扑结构,通过计算最大波动贡献度确定故障根因节点,解决了指标波动的涟漪效应问题,从而过滤掉偶发性指标波动引起的误告警,大幅缩减了告警范围。从而实现以尽可能少的维度及其属性值的组合表现最为全面的故障根因节点。Compared with the traditional threshold-based indicator detection and fault cause location relying on the experience of operation and maintenance personnel, the method for acquiring the indicator detection model and the method for locating faults provided in this embodiment adopts the learning method based on the historical characteristics (ie, characteristic parameters) of multi-dimensional KPI indicators. , identify key KPI indicators; and can analyze the root cause node of the fault in combination with the topology of the mobile communication network on site; in the process of abnormal identification, compare the key KPI indicators of the leaf node with its own historical data without relying on the threshold The threshold is divided into anomalies, which can effectively avoid false alarms and missed alarms, and can detect obvious anomalies of key KPI indicators and anomalies with deterioration trends; in the part of fault root cause node location, combined with the results of KPI anomaly detection and the historical status of the node , to determine whether the node is the key node of the fault, and combined with the topology of the network, the root cause node of the fault can be determined by calculating the maximum fluctuation contribution, which solves the problem of the ripple effect of the index fluctuation, thereby filtering out the error caused by the occasional index fluctuation. alarm, greatly reducing the scope of the alarm. In this way, the most comprehensive fault root cause node can be realized with the combination of as few dimensions and their attribute values as possible.
实施例二:Embodiment 2:
本实施例还提供了一种用于移动通信网络的指标检测模型获取装置,参见图5所示,其包括但不限于:This embodiment also provides a device for obtaining an index detection model for a mobile communication network, as shown in FIG. 5 , which includes but is not limited to:
获取模块501,用于根据预设的关键KPI指标体系,获取移动通信网络中各叶子节点的关键KPI指标,关键KPI指标体系中包括至少两类关键KPI指标,具体获取过程参见上述实施例所示,在此不再赘述。The acquisition module 501 is configured to acquire the key KPI indicators of each leaf node in the mobile communication network according to the preset key KPI indicator system, the key KPI indicator system includes at least two types of key KPI indicators, and the specific acquisition process is shown in the above embodiment. , and will not be repeated here.
训练模块502,用于将获取的各叶子节点的关键KPI指标分别按类进行聚合后,提取各叶子节点聚合后的每一类关键KPI指标的特征参数,并分别将各 叶子节点的同一类关键KPI指标的特征参数作为预设模型的输入进行模型训练,得到每一类关键KPI指标对应的指标检测模型。具体的训练过程参见上述实施例所示,在此不再赘述。The training module 502 is used to aggregate the obtained key KPI indicators of each leaf node by category, extract the characteristic parameters of each type of key KPI indicators after the aggregation of each leaf node, and separate the same type of key KPI indicators of each leaf node respectively. The characteristic parameters of KPI indicators are used as the input of the preset model for model training, and the indicator detection model corresponding to each type of key KPI indicators is obtained. The specific training process is shown in the above-mentioned embodiment, and details are not described herein again.
本实施例中获取模块501和训练模块502的功能可通过但不限于处理器实现。The functions of the acquisition module 501 and the training module 502 in this embodiment may be implemented by, but not limited to, a processor.
本实施例还提供了一种移动通信网络的故障定位装置,参见图6所示,其包括但不限于:This embodiment also provides a fault location device for a mobile communication network, as shown in FIG. 6 , which includes but is not limited to:
异常检测模块601,用于采用通过如上所示的用于移动通信网络的指标检测模型获取方法所获取的指标检测模型,对移动通信网络中的各叶子节点进行指标检测。具体检测过程参见上述各示例所示,在此不再赘述。The anomaly detection module 601 is configured to perform index detection on each leaf node in the mobile communication network by using the index detection model obtained by the above-described index detection model acquisition method for the mobile communication network. The specific detection process is shown in the above examples, and details are not repeated here.
故障定位模块602,用于对于检测结果为关键KPI指标存在异常的目标叶子节点,结合该目标叶子节点的历史状态,确定该目标叶子节点是否故障关键节点;具体确定过程参见上述实施例所示,在此不再赘述。The fault location module 602 is used to determine whether the target leaf node is a fault key node for the target leaf node for which the detection result is that the key KPI index is abnormal, combined with the historical state of the target leaf node; the specific determination process is shown in the above embodiment. It is not repeated here.
在本实施例的一些示例中,故障定位装置还包括拓扑组网自学习模块603,用于根据获取到的用户面信令和控制面信令,对移动通信网络的拓扑结构进行自学习,从而获取到移动通信网络的拓扑结构,并将获取到的拓扑结构中的各网络实体作为叶子节点。这样当网络中存在组网变化时,不依赖第三方数据接口,即可实时获取到最新的组网结构。In some examples of this embodiment, the fault locating apparatus further includes a topology networking self-learning module 603, configured to perform self-learning on the topology structure of the mobile communication network according to the acquired user plane signaling and control plane signaling, thereby The topology structure of the mobile communication network is acquired, and each network entity in the acquired topology structure is used as a leaf node. In this way, when there is a networking change in the network, the latest networking structure can be obtained in real time without relying on a third-party data interface.
为了便于理解,本实施例下面以一种应用示例做进一步说明。For ease of understanding, this embodiment is further described below with an application example.
在本应用示例中,以基于VoLTE网络的移动通信网络这一应用场景为示例进行说明。本应用示例中的用于移动通信网络的指标检测模型获取以及故障定位过程包括请参见图7所示,包括但不限于网络指标采集S701、原始数据清洗S702、多维度指标聚集S703、关键KPI指标的特征参数提取S704、模型训练S705、指标异常识别S706、故障根因定位S707进而完成异常识别,根因定位的过程。In this application example, an application scenario of a mobile communication network based on a VoLTE network is used as an example for description. In this application example, the process of acquiring an indicator detection model for a mobile communication network and locating a fault is shown in Figure 7, including but not limited to network indicator collection S701, raw data cleaning S702, multi-dimensional indicator aggregation S703, and key KPI indicators Feature parameter extraction S704, model training S705, index anomaly identification S706, and fault root cause location S707 to complete the process of anomaly identification and root cause location.
其中,一种示例的网络指标采集方式参见图8所示,可以通过硬探针部署在I/M-CSCF和SBC/P-CSCF间的Mw口,采集注册信令、呼叫信令、掉话类信令;部署在P/SBC和PCRF的Rx口采集掉话信令;部署在MME和eMSC网元间的Sv口,采集eSRVCC切换信令;部署在小区和SGW网元间的s1-u 口采集语音质量类信令。采集的信令包含时间信息、用户信息、行为信息、位置信息,以及关键KPI指标,本应用示例中的关键KPI体系参见图2所示。An example of the network indicator collection method is shown in Figure 8. Hard probes can be deployed on the Mw port between the I/M-CSCF and SBC/P-CSCF to collect registration signaling, call signaling, and dropped calls. Class signaling; the Rx port deployed on the P/SBC and PCRF collects call drop signaling; the Sv port deployed between the MME and eMSC network elements collects eSRVCC handover signaling; the s1-u deployed between the cell and the SGW network element The port collects voice quality signaling. The collected signaling includes time information, user information, behavior information, location information, and key KPI indicators. The key KPI system in this application example is shown in Figure 2.
一种示例的原始数据清洗可包括:原始数据清洗对采集的各信令信息中的有效信息进行清洗、剥离、归整,将数据标准化、规范化。同时还需要结合VoLTE业务逻辑,对现网的拓扑结构进行实时清洗,例如根据获取到的用户面指令和控制面指令对现场的移动通信网路拓扑结构进行自学习,一种自学习到的示例的拓扑组网示意图如图9所示,图中的小区Ci以及各核心网侧的各网元则作为叶子节点。An example of raw data cleaning may include: cleaning, stripping, and normalizing valid information in each collected signaling information, and standardizing and normalizing the data. At the same time, it is also necessary to combine the VoLTE business logic to clean the topology structure of the existing network in real time. For example, according to the obtained user plane instructions and control plane instructions, the on-site mobile communication network topology structure is self-learned. An example of self-learning The schematic diagram of the topology networking is shown in Figure 9, and the cell Ci and each network element on each core network side in the figure are used as leaf nodes.
多维度指标聚集的则是对清洗后的数据,分别进行小区维度及各网元维度的指标聚集,做以设定的时间粒度(例如小时)聚集统计。The multi-dimensional index aggregation is to aggregate the indicators of the cell dimension and each network element dimension for the cleaned data, and collect statistics at a set time granularity (for example, hours).
本应用示例中,对于关键KPI指标的特征参数提取,包括关键KPI指标的均值、关键KPI指标的标准差、关键KPI指标的最小值、关键KPI指标的最大值、关键KPI指标的四分之一分位点、关键KPI指标的中值、关键KPI指标的四分之三分位点、关键KPI指标的标准差均值、关键KPI指标的方差均值、关键KPI指标的切比雪夫统计特征、关键KPI指标的总变差、关键KPI指标的变异系数,并将特征参数转换为矩阵,如图10-1和图10-2所示。图10-1所示的矩阵的第一列为作为叶子节点的各小区,也即为无线网络侧的网络实体的某一类关键KPI指标的特征参数,对于第一行的ci1对应的kpi01t1至kpi01tn则分别对应ci1的某一类关键KPI指标的关键KPI指标的均值、关键KPI指标的标准差、关键KPI指标的最小值、关键KPI指标的最大值、关键KPI指标的四分之一分位点、关键KPI指标的中值、关键KPI指标的四分之三分位点、关键KPI指标的标准差均值、关键KPI指标的方差均值、关键KPI指标的切比雪夫统计特征、关键KPI指标的总变差、关键KPI指标的变异系数。图10-2所示的矩阵的第一列为作为叶子节点的各网元,即核心网侧的网络实体。In this application example, the feature parameter extraction of key KPI indicators includes the mean value of key KPI indicators, the standard deviation of key KPI indicators, the minimum value of key KPI indicators, the maximum value of key KPI indicators, and a quarter of key KPI indicators Quantile, median value of key KPI indicators, quartiles of key KPI indicators, standard deviation mean of key KPI indicators, mean variance of key KPI indicators, Chebyshev statistical characteristics of key KPI indicators, key KPIs The total variation of indicators, the coefficient of variation of key KPI indicators, and the characteristic parameters are converted into matrices, as shown in Figure 10-1 and Figure 10-2. The first column of the matrix shown in Figure 10-1 is each cell as a leaf node, that is, the characteristic parameters of a certain type of key KPI indicators of the network entity on the wireless network side. For the kpi01t1 to ci1 corresponding to the first row kpi01tn corresponds to the mean value of key KPI indicators, the standard deviation of key KPI indicators, the minimum value of key KPI indicators, the maximum value of key KPI indicators, and the quarter quantile of key KPI indicators of a certain type of key KPI indicators of ci1 point, the median value of key KPI indicators, the third-quarter point of key KPI indicators, the standard deviation mean of key KPI indicators, the variance mean of key KPI indicators, the Chebyshev statistical characteristics of key KPI indicators, the average value of key KPI indicators Total variation, coefficient of variation of key KPI indicators. The first column of the matrix shown in Figure 10-2 is each network element serving as a leaf node, that is, a network entity on the core network side.
本应用示例中的模型训练包括将图10-1和图10-2所示的特征参数转换为时序序列,作为随机森林模型输入进行模型训练,得到指标检测模型。The model training in this application example includes converting the feature parameters shown in Figure 10-1 and Figure 10-2 into time series, which are used as the input of the random forest model for model training to obtain an indicator detection model.
本应用示例中的指标异常识别则可采用得到的指标检测模型对各叶子节点进行关键KPI时序序列进行预测,当预测结果与实际结果之差大于预设阈值时,判定该指标为异常指标,从而确定对应的叶子节点为目标叶子节点。此处假设 有30个小区的关键KPI指标异常,7个网元的关键KPI指标异常。In the indicator anomaly identification in this application example, the obtained indicator detection model can be used to predict the key KPI time series of each leaf node. When the difference between the predicted result and the actual result is greater than the preset threshold, the indicator is determined to be an abnormal indicator. Determine the corresponding leaf node as the target leaf node. It is assumed here that the key KPI indicators of 30 cells are abnormal, and the key KPI indicators of 7 network elements are abnormal.
本应用示例中的故障根因定位可包括:获取目标叶子节点(即上述30个小区和7个网元)的历史状态信息,如历史状态为正常,则该目标叶子节点判定为故障关键节点;若历史状态即为异常,则该目标叶子节点判定为不是故障关键节点。假设经过如上判定后,发现上述30个小区中存在10个小区为故障关键节点,上述7个网元中存在3个网元为故障关键节点。The fault root cause location in this application example may include: acquiring the historical status information of the target leaf node (that is, the above-mentioned 30 cells and 7 network elements), if the historical status is normal, the target leaf node is determined to be a key fault node; If the historical state is abnormal, the target leaf node is determined not to be the critical node of the failure. Suppose that after the above determination, it is found that 10 cells among the above 30 cells are fault critical nodes, and 3 network elements among the above 7 network elements are fault critical nodes.
结合全网拓扑结构,分布计算各个故障关键节点的波动贡献度,一种链路结构示意图如图11所示。编号为14XX87的MME的上行丢包率指标波异常,其波动贡献度为0.86(例如假设该与该MME连接的其他叶子节点的个数为100个,该MME丢包率指标异常会影响到其中的86个,则可确定该MME的动贡献度为0.86;当然也可采用其他的计算方式,只要能准确获取到本实施例中定义的动贡献度即可),为组内最大波动贡献度,因此判定根因为编号14XX87的MME的上行丢包率指标。Combined with the topology structure of the entire network, the fluctuation contribution of each key fault node is distributed and calculated. A schematic diagram of a link structure is shown in Figure 11. The uplink packet loss rate indicator wave of the MME numbered 14XX87 is abnormal, and its fluctuation contribution is 0.86 (for example, if the number of other leaf nodes connected to the MME is 100, the abnormal packet loss rate indicator of the MME will affect the 86, the dynamic contribution of the MME can be determined to be 0.86; of course, other calculation methods can also be used, as long as the dynamic contribution defined in this embodiment can be accurately obtained), which is the maximum fluctuation contribution in the group , so it is determined that the root is the uplink packet loss rate indicator of the MME numbered 14XX87.
实施例三:Embodiment three:
本实施例还提供了一种电子设备,请参见图12所示,包括处理器1201、存储器1202和连接处理器1201和存储器1202的通信总线1203;This embodiment also provides an electronic device, as shown in FIG. 12, including a processor 1201, a memory 1202, and a communication bus 1203 connecting the processor 1201 and the memory 1202;
存储器1202存储有第一计算机程序,第一计算机程序可被处理器1201执行,以实现如上各实施例中所示的用于移动通信网络的指标检测模型获取方法的步骤;The memory 1202 stores a first computer program, and the first computer program can be executed by the processor 1201 to implement the steps of the method for obtaining an index detection model for a mobile communication network as shown in the above embodiments;
和/或,存储器1202存储有第二计算机程序,第二计算机程序可被处理器1201执行,以实现如上各实施例中所示的移动通信网络的故障定位方法的步骤。And/or, the memory 1202 stores a second computer program, and the second computer program can be executed by the processor 1201 to implement the steps of the mobile communication network fault location method shown in the above embodiments.
本实施例还提供了一种计算机可读存储介质,该计算机可读存储介质存储有第一计算机程序,第一计算机程序可被处理器执行,以实现如上各实施例所示的用于移动通信网络的指标检测模型获取方法中的步骤;This embodiment also provides a computer-readable storage medium, where the computer-readable storage medium stores a first computer program, and the first computer program can be executed by a processor, so as to implement the mobile communication as shown in the above embodiments The steps in the method for obtaining the index detection model of the network;
和/或,该计算机可读存储介质存储有第一计算机程序,第一计算机程序可被处理器执行,以实现如上各实施例所示的移动通信网络的故障定位方法中的步骤。And/or, the computer-readable storage medium stores a first computer program, and the first computer program can be executed by the processor to implement the steps in the mobile communication network fault location method shown in the above embodiments.
本实施例中的该计算机可读存储介质包括在用于存储信息(诸如计算机可 读指令、数据结构、计算机程序模块或其他数据)的任何方法或技术中实施的易失性或非易失性、可移除或不可移除的介质。计算机可读存储介质包括但不限于RAM(Random Access Memory,随机存取存储器),ROM(Read-Only Memory,只读存储器),EEPROM(Electrically Erasable Programmable read only memory,带电可擦可编程只读存储器)、闪存或其他存储器技术、CD-ROM(Compact Disc Read-Only Memory,光盘只读存储器),数字多功能盘(DVD)或其他光盘存储、磁盒、磁带、磁盘存储或其他磁存储装置、或者可以用于存储期望的信息并且可以被计算机访问的任何其他的介质。The computer-readable storage medium in this embodiment includes volatile or non-volatile implemented in any method or technology for storage of information, such as computer-readable instructions, data structures, computer program modules or other data , removable or non-removable media. Computer-readable storage media include but are not limited to RAM (Random Access Memory, random access memory), ROM (Read-Only Memory, read-only memory), EEPROM (Electrically Erasable Programmable read only memory, electrically erasable programmable read only memory) ), flash memory or other memory technology, CD-ROM (Compact Disc Read-Only Memory), digital versatile disk (DVD) or other optical disk storage, magnetic cartridges, magnetic tape, magnetic disk storage or other magnetic storage devices, Or any other medium that can be used to store the desired information and that can be accessed by a computer.
本实施例还提供了一种第一计算机程序(或称第一计算机软件),该第一计算机程序可以分布在计算机可读介质上,由可计算装置来执行,以实现如上所述的用于移动通信网络的指标检测模型获取方法;并且在某些情况下,可以采用不同于上述实施例所描述的顺序执行所示出或描述的至少一个步骤。This embodiment also provides a first computer program (or referred to as first computer software), the first computer program can be distributed on a computer-readable medium and executed by a computable device, so as to realize the above-mentioned and in some cases, at least one of the steps shown or described may be performed in an order different from that described in the above embodiments.
本实施例还提供了一种第二计算机程序(或称第二计算机软件),该第二计算机程序可以分布在计算机可读介质上,由可计算装置来执行,以实现如上所述的移动通信网络的故障定位方法;并且在某些情况下,可以采用不同于上述实施例所描述的顺序执行所示出或描述的至少一个步骤。This embodiment also provides a second computer program (or called second computer software), the second computer program can be distributed on a computer-readable medium and executed by a computer-readable device, so as to realize the above-mentioned mobile communication and, in some cases, at least one of the steps shown or described may be performed in an order different from that described in the above embodiments.
本实施例还提供了一种计算机程序产品,包括计算机可读装置,该计算机可读装置上存储有如上所示的任一计算机程序。本实施例中该计算机可读装置可包括如上所示的计算机可读存储介质。This embodiment also provides a computer program product, including a computer-readable device, on which any of the computer programs shown above are stored. In this embodiment, the computer-readable device may include the computer-readable storage medium as described above.
可见,本领域的技术人员应该明白,上文中所公开方法中的全部或某些步骤、系统、装置中的功能模块/单元可以被实施为软件(可以用计算装置可执行的计算机程序代码来实现)、固件、硬件及其适当的组合。在硬件实施方式中,在以上描述中提及的功能模块/单元之间的划分不一定对应于物理组件的划分;例如,一个物理组件可以具有多个功能,或者一个功能或步骤可以由若干物理组件合作执行。某些物理组件或所有物理组件可以被实施为由处理器,如中央处理器、数字信号处理器或微处理器执行的软件,或者被实施为硬件,或者被实施为集成电路,如专用集成电路。It can be seen that those skilled in the art should understand that all or some of the steps in the methods disclosed above, the functional modules/units in the system, and the device can be implemented as software (which can be implemented by computer program codes executable by a computing device). ), firmware, hardware, and their appropriate combination. In a hardware implementation, the division between functional modules/units mentioned in the above description does not necessarily correspond to the division of physical components; for example, one physical component may have multiple functions, or one function or step may be composed of several physical components Components execute cooperatively. Some or all physical components may be implemented as software executed by a processor, such as a central processing unit, digital signal processor or microprocessor, or as hardware, or as an integrated circuit, such as an application specific integrated circuit .
此外,本领域普通技术人员公知的是,通信介质通常包含计算机可读指令、数据结构、计算机程序模块或者诸如载波或其他传输机制之类的调制数据信号 中的其他数据,并且可包括任何信息递送介质。所以,本申请不限制于任何特定的硬件和软件结合。In addition, communication media typically embodies computer readable instructions, data structures, computer program modules, or other data in a modulated data signal such as a carrier wave or other transport mechanism, and can include any information delivery, as is well known to those of ordinary skill in the art medium. Therefore, the present application is not limited to any particular combination of hardware and software.
以上内容是结合具体的实施方式对本申请实施例所作的进一步详细说明,不能认定本申请的具体实施只局限于这些说明。对于本申请所属技术领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干简单推演或替换,都应当视为属于本申请的保护范围。The above content is a further detailed description of the embodiments of the present application in conjunction with specific implementations, and it cannot be considered that the specific implementation of the present application is limited to these descriptions. For those of ordinary skill in the technical field of the present application, without departing from the concept of the present application, some simple deductions or substitutions can be made, which should be regarded as belonging to the protection scope of the present application.

Claims (13)

  1. 一种用于移动通信网络的指标检测模型获取方法,包括:A method for obtaining an index detection model for a mobile communication network, comprising:
    根据预设的关键KPI指标体系,获取所述移动通信网络中各叶子节点的关键KPI指标,所述关键KPI指标体系中包括至少两类关键KPI指标;According to the preset key KPI index system, obtain the key KPI index of each leaf node in the mobile communication network, and the key KPI index system includes at least two types of key KPI indexes;
    将各叶子节点的所述关键KPI指标分别按类进行聚合后,提取各叶子节点聚合后的每一类关键KPI指标的特征参数;After aggregating the key KPI indicators of each leaf node by category, extract the characteristic parameters of each type of key KPI indicators aggregated by each leaf node;
    分别将各叶子节点的同一类关键KPI指标的特征参数作为预设模型的输入进行模型训练,得到每一类关键KPI指标对应的指标检测模型。The characteristic parameters of the same type of key KPI indicators of each leaf node are respectively used as the input of the preset model for model training, and the indicator detection model corresponding to each type of key KPI indicators is obtained.
  2. 如权利要求1所述的用于移动通信网络的指标检测模型获取方法,其中,所述预设模型为随机森林模型,所述提取各叶子节点聚合后的每一类关键KPI指标的特征参数包括:The method for obtaining an index detection model for a mobile communication network according to claim 1, wherein the preset model is a random forest model, and the extracting characteristic parameters of each type of key KPI index aggregated by each leaf node include: :
    针对每一叶子节点,提取聚合后的每一类关键KPI指标落入第一时间窗内的关键KPI指标的特征参数,和落入第二时间窗内的关键KPI指标的特征参数,所述第二时间窗大于所述第一时间窗;For each leaf node, extract the characteristic parameters of the key KPI indicators that fall within the first time window for each type of key KPI indicators after aggregation, and the characteristic parameters of the key KPI indicators that fall within the second time window. The second time window is greater than the first time window;
    所述分别将各叶子节点的同一类关键KPI指标的特征参数作为预设模型的输入进行模型训练包括:The model training using the characteristic parameters of the same type of key KPI indicators of each leaf node as the input of the preset model respectively includes:
    分别将各叶子节点的同一类关键KPI指标的特征参数转换为时序序列作为所述随机森林模型的输入,对所述随机森林模型进行训练,得到指标检测模型。The characteristic parameters of the same type of key KPI indicators of each leaf node are respectively converted into time series as the input of the random forest model, and the random forest model is trained to obtain an indicator detection model.
  3. 如权利要求1或2所述的用于移动通信网络的指标检测模型获取方法,其中,所述特征参数包括以下中的至少两种:The method for obtaining an index detection model for a mobile communication network according to claim 1 or 2, wherein the characteristic parameters include at least two of the following:
    关键KPI指标的均值、关键KPI指标的标准差、关键KPI指标的最小值、关键KPI指标的最大值、关键KPI指标的四分之一分位点、关键KPI指标的中值、关键KPI指标的四分之三分位点、关键KPI指标的标准差均值、关键KPI指标的方差均值、关键KPI指标的切比雪夫统计特征、关键KPI指标的总变差、关键KPI指标的变异系数。The mean value of key KPI indicators, the standard deviation of key KPI indicators, the minimum value of key KPI indicators, the maximum value of key KPI indicators, the quarter point of key KPI indicators, the median value of key KPI indicators, the The third-quarter point, the standard deviation mean of key KPI indicators, the variance mean of key KPI indicators, the Chebyshev statistical characteristics of key KPI indicators, the total variation of key KPI indicators, and the coefficient of variation of key KPI indicators.
  4. 如权利要求1至3中任一项所述的用于移动通信网络的指标检测模型获 取方法,其中,所述关键KPI指标体系包括以下关键KPI指标中的至少两类:The method for obtaining an index detection model for a mobile communication network according to any one of claims 1 to 3, wherein the key KPI index system includes at least two types of the following key KPI indexes:
    初始注册成功率,重注册成功率,长期演进语音承载VoLTE网络接通率,V2V呼叫建立时长,VoLET掉话率,增强的单一无线语音呼叫连续性eSRVCC切换成功率,eSRVCC切换平均时延,VoLET MOS3.0占比,上行丢包率。Initial registration success rate, re-registration success rate, VoLTE network connection rate over long-term evolution voice, V2V call setup duration, VoLET call drop rate, enhanced single wireless voice call continuity eSRVCC handover success rate, eSRVCC handover average delay, VoLET MOS3.0 proportion, uplink packet loss rate.
  5. 一种移动通信网络的故障定位方法,包括:A fault location method for a mobile communication network, comprising:
    采用通过如权利要求1至4中任一项所述的用于移动通信网络的指标检测模型获取方法所获取的指标检测模型,对所述移动通信网络中的各叶子节点进行指标检测;Using the index detection model obtained by the method for obtaining an index detection model for a mobile communication network according to any one of claims 1 to 4, index detection is performed on each leaf node in the mobile communication network;
    对于检测结果为关键KPI指标存在异常的目标叶子节点,结合该目标叶子节点的历史状态,确定该目标叶子节点是否为故障关键节点。For the target leaf node whose detection result is that the key KPI index is abnormal, it is determined whether the target leaf node is a fault critical node based on the historical status of the target leaf node.
  6. 如权利要求5所述的移动通信网络的故障定位方法,其中,包括:所述结合该目标叶子节点的历史状态,确定该目标叶子节点是否为故障关键节点包括:The method for locating faults in a mobile communication network according to claim 5, comprising: determining whether the target leaf node is a fault critical node in combination with the historical state of the target leaf node comprising:
    在所述目标叶子节点的历史状态为关键KPI指标正常时,确定该目标叶子节点不是故障关键节点;When the historical state of the target leaf node is that the key KPI indicator is normal, determine that the target leaf node is not a fault critical node;
    在所述目标叶子节点的历史状态为关键KPI指标异常时,确定该目标叶子节点是故障关键节点。When the historical state of the target leaf node is that the key KPI index is abnormal, it is determined that the target leaf node is a fault critical node.
  7. 如权利要求5或6所述的移动通信网络的故障定位方法,其中,通过所述指标检测模型,对所述移动通信网络中的各叶子节点进行指标检测包括:The method for locating faults in a mobile communication network according to claim 5 or 6, wherein, using the index detection model, performing index detection on each leaf node in the mobile communication network comprises:
    通过所述指标检测模型对每一所述叶子节点的时序序列进行预测,在预测值与实际值之差大于设定阈值时,确定该目标叶子节点对应该指标检测模型的关键KPI指标异常。The time sequence sequence of each leaf node is predicted by the index detection model, and when the difference between the predicted value and the actual value is greater than the set threshold, it is determined that the key KPI index of the target leaf node corresponding to the index detection model is abnormal.
  8. 如权利要求5至7中任一项所述的移动通信网络的故障定位方法,其中,当确定为故障关键节点的目标叶子节点有多个时,所述方法还包括:The method for locating faults in a mobile communication network according to any one of claims 5 to 7, wherein, when there are multiple target leaf nodes determined to be critical fault nodes, the method further comprises:
    获取确定为故障关键节点的各目标叶子节点的波动贡献度,所述波动贡献 度为体现目标叶子节点故障时,对与其连接的其他叶子节点的影响程度的值;Obtain the fluctuation contribution degree of each target leaf node determined as the fault key node, and the fluctuation contribution degree is a value that reflects the degree of influence on other leaf nodes connected to it when the target leaf node fails;
    确定所述波动贡献度最大的目标叶子节点为故障根因节点。It is determined that the target leaf node with the largest fluctuation contribution is the fault root cause node.
  9. 如权利要求5至8中任一项所述的移动通信网络的故障定位方法,其中,通过所述指标检测模型,对所述移动通信网络中的各叶子节点进行指标检测之前,还包括:The method for locating faults in a mobile communication network according to any one of claims 5 to 8, wherein, using the indicator detection model, before performing indicator detection on each leaf node in the mobile communication network, the method further comprises:
    根据获取到的用户面信令和控制面信令,对所述移动通信网络的拓扑结构进行自学习,从而获取到所述移动通信网络的拓扑结构,并将获取到的拓扑结构中的各网络实体作为叶子节点。According to the obtained user plane signaling and control plane signaling, self-learning is performed on the topology of the mobile communication network, so as to obtain the topology of the mobile communication network, and each network in the obtained topology is Entities act as leaf nodes.
  10. 一种用于移动通信网络的指标检测模型获取装置,包括:A device for obtaining an index detection model for a mobile communication network, comprising:
    获取模块,用于根据预设的关键KPI指标体系,获取所述移动通信网络中各叶子节点的关键KPI指标,所述关键KPI指标体系中包括至少两类关键KPI指标;an acquisition module, configured to acquire key KPI indicators of each leaf node in the mobile communication network according to a preset key KPI indicator system, where the key KPI indicator system includes at least two types of key KPI indicators;
    训练模块,用于将各叶子节点的所述关键KPI指标分别按类进行聚合后,提取各叶子节点聚合后的每一类关键KPI指标的特征参数,并分别将各叶子节点的同一类关键KPI指标的特征参数作为预设模型的输入进行模型训练,得到每一类关键KPI指标对应的指标检测模型。The training module is used for aggregating the key KPI indicators of each leaf node by category, extracting the characteristic parameters of each type of key KPI indicators aggregated by each leaf node, and dividing the same type of key KPI indicators of each leaf node. The characteristic parameters of the indicators are used as the input of the preset model for model training, and the indicator detection model corresponding to each type of key KPI indicators is obtained.
  11. 一种移动通信网络的故障定位装置,包括:A fault location device for a mobile communication network, comprising:
    异常检测模块,用于采用通过如权利要求1至4中任一项所述的用于移动通信网络的指标检测模型获取方法所获取的指标检测模型,对所述移动通信网络中的各叶子节点进行指标检测;An anomaly detection module, configured to use the index detection model obtained by the index detection model acquisition method for a mobile communication network according to any one of claims 1 to 4, to detect each leaf node in the mobile communication network perform index detection;
    故障定位模块,用于对于检测结果为关键KPI指标存在异常的目标叶子节点,结合该目标叶子节点的历史状态,确定该目标叶子节点是否故障关键节点。The fault locating module is used to determine whether the target leaf node is a fault critical node in combination with the historical status of the target leaf node for the target leaf node whose detection result is that the key KPI index is abnormal.
  12. 一种电子设备,包括处理器、存储器和连接所述处理器和存储器的通信总线;An electronic device comprising a processor, a memory, and a communication bus connecting the processor and the memory;
    所述存储器存储有第一计算机程序,所述第一计算机程序可被所述处理器 执行,以实现如权利要求1至4中任一项所述的用于移动通信网络的指标检测模型获取方法的步骤;The memory stores a first computer program, and the first computer program can be executed by the processor to implement the method for obtaining an index detection model for a mobile communication network according to any one of claims 1 to 4 A step of;
    和/或,and / or,
    所述存储器存储有第二计算机程序,所述第二计算机程序可被所述处理器执行,以实现如权利要求5至9中任一项所述的移动通信网络的故障定位方法的步骤。The memory stores a second computer program executable by the processor to implement the steps of the method for locating a fault in a mobile communication network according to any one of claims 5 to 9.
  13. 一种计算机可读存储介质,所述计算机可读存储介质存储有第一计算机程序,所述第一计算机程序可被处理器执行,以实现如权利要求1至4中任一项所述的用于移动通信网络的指标检测模型获取方法的步骤;A computer-readable storage medium, the computer-readable storage medium stores a first computer program, the first computer program can be executed by a processor, to realize the use according to any one of claims 1 to 4. The steps of a method for obtaining an index detection model for a mobile communication network;
    和/或,and / or,
    所述计算机可读存储介质存储有第二计算机程序,所述第二计算机程序可被处理器执行,以实现如权利要求5至9中任一项所述的移动通信网络的故障定位方法的步骤。The computer-readable storage medium stores a second computer program, and the second computer program can be executed by the processor to implement the steps of the method for locating a fault in a mobile communication network according to any one of claims 5 to 9 .
PCT/CN2021/100381 2020-08-06 2021-06-16 Indicator detection model acquisition method and apparatus, fault locating method and apparatus, and device and storage medium WO2022028120A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010783870.6 2020-08-06
CN202010783870.6A CN114095965A (en) 2020-08-06 2020-08-06 Index detection model obtaining and fault positioning method, device, equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2022028120A1 true WO2022028120A1 (en) 2022-02-10

Family

ID=80116956

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/100381 WO2022028120A1 (en) 2020-08-06 2021-06-16 Indicator detection model acquisition method and apparatus, fault locating method and apparatus, and device and storage medium

Country Status (2)

Country Link
CN (1) CN114095965A (en)
WO (1) WO2022028120A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115102836A (en) * 2022-07-13 2022-09-23 中国联合网络通信集团有限公司 Network equipment fault analysis method and device and storage medium
CN115277370A (en) * 2022-09-23 2022-11-01 云智慧(北京)科技有限公司 Method, device and equipment for determining root cause node of operation and maintenance system

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115796847B (en) * 2023-02-10 2023-05-09 成都秦川物联网科技股份有限公司 Intelligent gas maintenance personnel management method, internet of things system and medium
CN116996133B (en) * 2023-09-27 2023-12-05 国网江苏省电力有限公司常州供电分公司 Identity authentication and eavesdropping positioning method for power line carrier communication equipment
CN117880055B (en) * 2024-03-12 2024-05-31 灵长智能科技(杭州)有限公司 Network fault diagnosis method, device, equipment and medium based on transmission layer index

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019036129A1 (en) * 2017-08-15 2019-02-21 Cognant Llc Temporal anomaly detection system and method
CN110474786A (en) * 2018-05-10 2019-11-19 上海大唐移动通信设备有限公司 Method and device based on random forest analysis VoLTE network failure reason
CN111126824A (en) * 2019-12-19 2020-05-08 中国移动通信集团江苏有限公司 Multi-index association model training method and multi-index anomaly analysis method
CN111177505A (en) * 2019-12-31 2020-05-19 中国移动通信集团江苏有限公司 Training method, recommendation method and device of index anomaly detection model

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019036129A1 (en) * 2017-08-15 2019-02-21 Cognant Llc Temporal anomaly detection system and method
CN110474786A (en) * 2018-05-10 2019-11-19 上海大唐移动通信设备有限公司 Method and device based on random forest analysis VoLTE network failure reason
CN111126824A (en) * 2019-12-19 2020-05-08 中国移动通信集团江苏有限公司 Multi-index association model training method and multi-index anomaly analysis method
CN111177505A (en) * 2019-12-31 2020-05-19 中国移动通信集团江苏有限公司 Training method, recommendation method and device of index anomaly detection model

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
HUANG, WEI: "Research on AIOps Technology Based on Machine Learning", MASTER THESIS, 1 June 2019 (2019-06-01), China, pages 1 - 91, XP009533868, DOI: 10.26944/d.cnki.gbfju.2019.000297 *
LI DONGSHENG: "Algorithm Research and Implementation on Real Time Anomaly Detection of KPI Data", CHINA MASTER’S THESES FULL-TEXT DATABASE,INFORMATION SCIENCE AND TECHNOLOGY I140-243, 15 February 2020 (2020-02-15), pages 1 - 76, XP055894102 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115102836A (en) * 2022-07-13 2022-09-23 中国联合网络通信集团有限公司 Network equipment fault analysis method and device and storage medium
CN115277370A (en) * 2022-09-23 2022-11-01 云智慧(北京)科技有限公司 Method, device and equipment for determining root cause node of operation and maintenance system
CN115277370B (en) * 2022-09-23 2023-02-03 云智慧(北京)科技有限公司 Method, device and equipment for determining root cause node of operation and maintenance system

Also Published As

Publication number Publication date
CN114095965A (en) 2022-02-25

Similar Documents

Publication Publication Date Title
WO2022028120A1 (en) Indicator detection model acquisition method and apparatus, fault locating method and apparatus, and device and storage medium
WO2022048168A1 (en) Training method and device for failure prediction neural network model
JP6690011B2 (en) System and method for measuring effective customer impact of network problems in real time using streaming analysis
EP3379419B1 (en) Situation analysis
CN111126824B (en) Multi-index correlation model training method and multi-index anomaly analysis method
US20150195154A1 (en) Creating a Knowledge Base for Alarm Management in a Communications Network
EP3843445A1 (en) Abnormality detection method and apparatus, terminal, and storage medium
CN112398680B (en) Fault delimiting method and equipment
US9015312B2 (en) Network management system and method for identifying and accessing quality of service issues within a communications network
US10158522B2 (en) Network monitor and method for event based prediction of radio network outages and their root cause
JP6097889B2 (en) Monitoring system, monitoring device, and inspection device
CN110650060A (en) Processing method, equipment and storage medium for flow alarm
CN111756560A (en) Data processing method, device and storage medium
CN112242971A (en) Flow abnormity detection method, device, network equipment and storage medium
CN114338372B (en) Network information security monitoring method and system
CN115038088B (en) Intelligent network security detection early warning system and method
US20210359899A1 (en) Managing Event Data in a Network
CN113590429A (en) Server fault diagnosis method and device and electronic equipment
CN113037562A (en) Gateway fault assessment method and device and server
WO2022057501A1 (en) Method for identifying abnormal terminal, analysis apparatus and device, and storage medium
US11777786B2 (en) Method, device and computer program product for anomaly detection and root cause analysis
CN112491635A (en) Method, system, implementation equipment and storage medium for link quality detection
CN110990223A (en) Monitoring alarm method and device based on system log
CN113626236B (en) Fault diagnosis method, device, equipment and medium for distributed file system
CN110764975A (en) Early warning method and device for equipment performance and monitoring equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21852375

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM XXXX DATED 23/06/2023)

122 Ep: pct application non-entry in european phase

Ref document number: 21852375

Country of ref document: EP

Kind code of ref document: A1