WO2023045829A1 - Service abnormality prediction method and device, storage medium, and electronic device - Google Patents

Service abnormality prediction method and device, storage medium, and electronic device Download PDF

Info

Publication number
WO2023045829A1
WO2023045829A1 PCT/CN2022/119099 CN2022119099W WO2023045829A1 WO 2023045829 A1 WO2023045829 A1 WO 2023045829A1 CN 2022119099 W CN2022119099 W CN 2022119099W WO 2023045829 A1 WO2023045829 A1 WO 2023045829A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
time
real
historical
dimensional
Prior art date
Application number
PCT/CN2022/119099
Other languages
French (fr)
Chinese (zh)
Inventor
冯媛
杨翌晨
邵敏峰
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2023045829A1 publication Critical patent/WO2023045829A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/243Classification techniques relating to the number of classes
    • G06F18/2431Multiple classes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • G06Q50/40
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring

Definitions

  • Embodiments of the present disclosure relate to the communication field, and in particular, relate to a service anomaly prediction method, device, storage medium, and electronic device.
  • KPI Key Performance Indicators
  • Embodiments of the present disclosure provide a service anomaly prediction method, device, storage medium, and electronic device to at least solve the problem in the related art that operation and maintenance personnel start to diagnose anomalies after anomalies occur, resulting in untimely anomaly processing.
  • a business abnormality prediction method including:
  • the prediction result is input into the target random forest model trained in advance based on the historical business data, and the abnormal detection result of the business within the preset time output by the target random forest model is obtained.
  • the method before performing multi-dimensional indicator aggregation on the real-time business data to obtain multi-dimensional real-time data indicators, the method further includes:
  • the method also includes:
  • the constructed initial neural network model is trained according to the multi-dimensional historical data indicators to obtain the trained target neural network model.
  • the method before performing multi-dimensional indicator aggregation on the historical business data to obtain multi-dimensional historical data indicators, the method further includes:
  • the method also includes:
  • the constructed initial random forest model is trained according to the feature matrix to obtain the trained target random forest model.
  • determining characteristic parameters according to the data aggregation index includes:
  • the obtained parameters are composed into the characteristic parameters.
  • the real-time service data includes at least: time information, behavior information, location information and KPI;
  • the historical service data includes at least: time information, behavior information, location information and KPI.
  • a service anomaly prediction device including:
  • the collection module is configured to collect real-time business data at the current time
  • the first aggregation module is configured to aggregate the real-time business data with multi-dimensional indicators to obtain multi-dimensional real-time data indicators;
  • the input module is configured to input the multi-dimensional real-time data index into the target neural network model trained in advance based on the historical business data, and obtain the output of the target neural network model within the preset time period after the current time. forecast result;
  • the anomaly detection module is configured to input the prediction result into a pre-trained target random forest model based on the historical business data, and obtain the abnormal detection result of the business within the preset time output by the target random forest model.
  • the device also includes:
  • the first stripping module is configured to strip invalid data in the real-time business data to obtain valid real-time business data
  • the first cleaning module is configured to normalize the non-normalized data in the valid real-time business data to obtain cleaned real-time business data.
  • the device also includes:
  • the first extraction module is configured to extract the historical business data
  • the second focusing module is configured to aggregate the historical business data with multi-dimensional indicators to obtain multi-dimensional historical data indicators
  • the first training module is configured to train the constructed initial neural network model according to the multi-dimensional historical data indicators to obtain the trained target neural network model.
  • the device also includes:
  • the second stripping module is configured to strip invalid data in the historical business data to obtain valid historical business data
  • the second cleaning module is configured to normalize the non-normalized data in the valid historical business data to obtain cleaned historical business data.
  • the device also includes:
  • the second extraction module is configured to extract data aggregation indicators with a time window of N and a time window of M from the multi-dimensional historical data indicators respectively, wherein N is not equal to M, and both N and M are integers greater than 1;
  • a determining module configured to determine characteristic parameters according to the data aggregation index
  • a conversion module configured to convert the characteristic parameters into a characteristic matrix
  • the second training module is configured to train the constructed initial random forest model according to the feature matrix to obtain the trained target random forest model.
  • the determination module is further configured to
  • the obtained parameters are composed into the characteristic parameters.
  • the real-time service data includes at least: time information, behavior information, location information and KPI;
  • the historical service data includes at least: time information, behavior information, location information and KPI.
  • a computer-readable storage medium where a computer program is stored in the storage medium, wherein the computer program is set to execute any one of the above method embodiments when running in the steps.
  • an electronic device including a memory and a processor, wherein a computer program is stored in the memory, and the processor is configured to run the computer program to perform any of the above Steps in the method examples.
  • the real-time business data at the current time is collected; the real-time business data is aggregated with multi-dimensional indicators to obtain multi-dimensional real-time data indicators; the multi-dimensional real-time data indicators are input into the pre-trained historical business data
  • the target neural network model the forecast result of the business within the preset time period after the current time output by the target neural network model is obtained; the forecast result is input into the target random forest model trained in advance based on the historical business data
  • obtaining the abnormality detection results of the business within the preset time output by the target random forest model can solve the problem in related technologies that the operation and maintenance personnel only start to diagnose the abnormality after the abnormality occurs, which leads to the untimely processing of the abnormality. Predicting possible abnormalities in the future can enable operation and maintenance personnel to intervene and deal with them in advance, reduce the risk of complaints, and improve user experience.
  • FIG. 1 is a block diagram of a hardware structure of a mobile terminal according to a business anomaly prediction method according to an embodiment of the present disclosure
  • FIG. 2 is a flow chart of a business anomaly prediction method according to an embodiment of the present disclosure
  • FIG. 3 is a flow chart of a business anomaly prediction method according to an optional embodiment of the present disclosure
  • FIG. 4 is a flowchart of a KPI network intelligent early warning according to an embodiment of the present disclosure
  • FIG. 5 is a schematic diagram of a neural network model according to an embodiment of the present disclosure.
  • FIG. 6 is a network diagram of VoLTE probe collection and deployment according to an embodiment of the present disclosure.
  • Fig. 7 is a block diagram of a service anomaly prediction device according to an embodiment of the present disclosure.
  • Fig. 1 is a block diagram of the hardware structure of the mobile terminal of the business anomaly prediction method of the embodiment of the present disclosure.
  • the mobile terminal may include one or more (only shown in Fig. 1 1)
  • Processor 102 may include but not limited to a processing device such as a microprocessor MCU or a programmable logic device FPGA) and a memory 104 for storing data, wherein the above-mentioned mobile terminal may also include a communication function
  • the transmission device 106 and the input and output device 108 may be executed in mobile terminals, computer terminals or similar computing devices.
  • FIG. 1 is only for illustration, and it does not limit the structure of the above mobile terminal.
  • the mobile terminal may also include more or fewer components than those shown in FIG. 1 , or have a different configuration from that shown in FIG. 1 .
  • the memory 104 can be used to store computer programs, for example, software programs and modules of application software, such as the computer program corresponding to the business anomaly prediction method in the embodiment of the present disclosure, and the processor 102 executes the computer program stored in the memory 104 by running the Various functional applications and service chain address pool slicing processing realize the above-mentioned method.
  • the memory 104 may include high-speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory.
  • the memory 104 may further include a memory that is remotely located relative to the processor 102, and these remote memories may be connected to the mobile terminal through a network. Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • the transmission device 106 is used to receive or transmit data via a network.
  • the specific example of the above network may include a wireless network provided by the communication provider of the mobile terminal.
  • the transmission device 106 includes a network interface controller (NIC for short), which can be connected to other network devices through a base station so as to communicate with the Internet.
  • the transmission device 106 may be a radio frequency (Radio Frequency, referred to as RF) module, which is used to communicate with the Internet in a wireless manner.
  • RF Radio Frequency
  • FIG. 2 is a flowchart of a service anomaly prediction method according to an embodiment of the present disclosure. As shown in FIG. 2 , the process includes the following step:
  • Step S202 collecting real-time business data at the current time
  • the real-time service data includes at least: time information, behavior information, location information and KPI.
  • Step S204 aggregating the real-time business data with multi-dimensional indicators to obtain multi-dimensional real-time data indicators
  • Step S206 input the multi-dimensional real-time data index into the target neural network model trained in advance based on historical business data, and obtain the forecast result of the business within the preset time period after the current time output by the target neural network model ;
  • Step S208 inputting the prediction result into the target random forest model trained in advance based on the historical business data, and obtaining the abnormal detection result of the business within the preset time output by the target random forest model.
  • the real-time business data is cleaned to obtain the cleaned real-time business data, and further, the invalid data in the real-time business data is stripped to obtain an effective real-time Business data: standardize the non-standardized data in the effective real-time business data to obtain cleaned real-time business data.
  • Fig. 3 is a flowchart of a business anomaly prediction method according to an optional embodiment of the present disclosure, as shown in Fig. 3 , including:
  • Step S302 extracting historical business data
  • the historical service data at least includes: time information, behavior information, location information and KPI, and specifically, specific data may be collected from different locations.
  • Step S304 aggregating the historical business data with multi-dimensional indicators to obtain multi-dimensional historical data indicators
  • Step S306 Train the constructed initial neural network model according to the multi-dimensional historical data indicators to obtain the trained target neural network model.
  • the historical business data is cleaned to obtain the cleaned historical business data, and further, the invalid data in the historical business data is stripped to obtain valid historical data
  • Business data standardize the non-normalized data in the effective historical business data to obtain cleaned historical business data.
  • This embodiment extracts historical business data, and aggregates hourly and daily granularity indicators for historical business data in each dimension; based on multi-dimensional historical business data, extracts the time series features of historical data after collection; constructs an initial neural network model, and extracts The historical business data after the feature is used to train the initial neural network model; the initial neural network model is often used in natural language translation. After the model is modified, it is used to predict the time series. The actual results show that the accuracy of the prediction is better than Common time series forecasting algorithms.
  • the target neural network model obtained after training can be used to predict the prediction results for a period of time in the future.
  • data aggregation indexes with a time window of N and a time window of M are respectively extracted from the multi-dimensional historical data indexes, wherein, N is not equal to M, and both N and M are integers greater than 1, For example, N is 7 days and M is 60 days.
  • N is 7 days and M is 60 days.
  • the time window is a data aggregation index of 60 days; the characteristic parameters are determined according to the data aggregation index; the characteristic parameters are converted into a characteristic matrix; the initial random forest model constructed is trained according to the characteristic matrix, and the trained results are obtained.
  • the target random forest model is a data aggregation index of 60 days; the characteristic parameters are determined according to the data aggregation index; the characteristic parameters are converted into a characteristic matrix; the initial random forest model constructed is trained according to the characteristic matrix, and the trained results are obtained.
  • the target random forest model is a data aggregation index of 60 days; the characteristic parameters are determined according to the data aggregation index; the characteristic parameters are converted
  • the hourly and daily granularity index aggregation is performed on the historical business data of each dimension; based on the multi-dimensional historical business data, the historical statistical characteristic parameters after the collection are extracted, and the model training is performed on the random forest network; conventional anomaly detection requires A large number of manually labeled samples, based on hundreds of millions of samples per hour, is almost impossible to achieve in the field of operation and maintenance.
  • the embodiment of the present disclosure adopts a semi-automatic labeling method combining statistics and manual labeling to edit samples, which can greatly reduce the cost of manual labeling and improve labeling efficiency.
  • the target random forest model obtained after training is used to detect and classify abnormal indicators (normal indicators, abnormal indicators), and give prediction conclusions about indicators that may appear abnormal.
  • the above-mentioned determination of characteristic parameters according to the data aggregation index may specifically include: respectively obtaining the following parameters from the data aggregation index: mean value, standard deviation, minimum value, maximum value, quarter point, median , three-quarters point, standard deviation mean, variance mean, Chebyshev statistical features, total variation, coefficient of variation; the obtained parameters form the characteristic parameters.
  • this embodiment provides a practical implementation plan for the operation and maintenance thinking of early identification, early diagnosis and early disposal through the evolution of technical capabilities. Based on the multi-dimensional KPI historical feature learning method, the future trend of KPI is predicted; in the process of abnormal identification, this embodiment is compared with the historical data of the dimension-KPI itself, and does not rely on threshold thresholds to divide abnormalities, which can effectively avoid false positives.
  • Alarms and missed alarms can detect obvious KPI abnormalities and abnormalities with a tendency to deteriorate; in the fault early warning interface, through deep integration with operation and maintenance work, according to abnormal scenarios, build golden business indicators-operation and maintenance KPI indicators, multi-dimensional layering Early warning system, so as to filter out false alarms caused by sporadic index fluctuations, greatly reduce the scope of alarms, and improve the optimization efficiency of operation and maintenance colleagues.
  • Fig. 4 is a flowchart of a KPI network intelligent early warning according to an embodiment of the present disclosure, as shown in Fig. 4 , including:
  • the probe After the probe collects the data, it cleans the basic data, and completes the basic index aggregation and abnormal data processing in the cleaning stage.
  • the aggregated data is subjected to feature extraction through feature engineering, which includes statistical features of single-time granularity of global data and time series features of dimensional data (that is, sequence prediction features).
  • the extracted features are used as the input of the AI model, and the time series sequence is input to the prediction model to obtain the prediction result of the specified dimension.
  • the predicted historical data and the predicted results are spliced and then input into the abnormal recognition model to determine the abnormal recognition.
  • the abnormal recognition model is a random forest abnormal recognition model, which can be obtained through historical business data training.
  • the historical business data includes at least: time information, Behavior information, location information, and KPIs. KPIs specifically include statistical tags and manual analysis tags.
  • Fig. 5 is a schematic diagram of a neural network model according to an embodiment of the present disclosure.
  • the model mainly includes three parts: an encoder Encoder, a decoder Decoder and an attention model Attention.
  • the encoding and decoding network uses RNN (Recurrent Neural Network , recurrent neural network) model.
  • the time series KPI index is used as the input of the Encoder model, and the feature is encoded and converted into a feature tensor through the Encoder, which is used as the input of the Decoder.
  • Self Attention different weights are given according to the degree of influence, and the feature matrix and Encoder results are concatenated as the input of the fully connected network, and finally the prediction result is output.
  • Fig. 6 is a network diagram of VoLTE probe collection and deployment according to an embodiment of the present disclosure, as shown in Fig. 6 , including: IMS (IP Multimedia Subsystem IP, multimedia system), EPC (Evolved Packet Core, all-IP grouping core network ), E-UTRAN (Evolved UMTS Terrestrial Radio Access Network, Evolved UMTS Terrestrial Radio Access Network), GERAN (GSM EDGE Radio Access Network GSM/EDGE, wireless communication network), eMSC/GMSC (Evolved Mobile Switching Center/Gateway Mobile Switching Center, Evolved Gateway Mobile Switching Center/Gateway Mobile Switching Center), DRA (Diameter Routing Agent, routing proxy node), DNS/ENUM (Domain Name System/Telephone Number Mapping working group, domain name system/telephone number mapping working group) and Three-in-one HSS (Home Subscriber Server, belonging to the subscriber server), among which, IMS includes: BGCF (Breakout Gateway Control Function,
  • the network elements involved in this embodiment include at least I/S-CSCF, P-CSCF/SBC, SEA-GW, MME, eNB, and eMSC/GMSC.
  • Partners hip Project referred to as 3GPP protocol standard collection interface, that is, the deployment location of the data collection probe.
  • This embodiment is based on VoLTE (Voice over Long Term Evolution, long-term evolution voice bearer) mobile communication network video service anomaly prediction, including network index collection, raw data cleaning, multi-dimensional index aggregation, KPI historical feature extraction, KPI index prediction, index anomaly identification process.
  • VoLTE Voice over Long Term Evolution, long-term evolution voice bearer
  • the hard probe is deployed on the Mw interface between the I/S-CSCF and P-CSCF/SBC to collect signaling such as registration, call, and call drop; it is deployed on the P/SBC and PCRF
  • the Rx port collects call drop signaling; it is deployed on the Sv port between the MME and the eMSC network element to collect ESRVCC (Evolved Single Radio Voice Call Continuity, evolved single radio mode voice call continuity) handover signaling; it is deployed in the cell and signaling gateway (Security Gateway, referred to as SGW) s1-u interface between network elements collects voice and video user plane signaling.
  • ESRVCC Evolved Single Radio Voice Call Continuity, evolved single radio mode voice call continuity
  • Raw data cleaning the collected signaling includes time information, user information, behavior information, location information, and KPI
  • the key KPI model indicators include at least: initial registration success rate, re-registration success rate, VoLTE network connection rate, V2V call establishment Duration, VoLTE call drop rate, ESRVCC handover success rate, ESRVCC average delay, MOS 3.0 (Mean Opinion Score, mean value of subjective evaluation of call quality) ratio, uplink packet loss rate.
  • Raw data cleaning cleans the effective information in the information, including the stripping of invalid data and the normalization of unplanned data.
  • Multi-dimensional index aggregation For the cleaned data, the indicators of the cell dimension and each network element dimension are aggregated separately, and hourly granular aggregation statistics are performed.
  • the aggregated indicator matrix is as follows:
  • KPI historical feature extraction respectively take the dimension of the window as N and the window as M-KPI aggregation index, obtain the mean, standard deviation, minimum value, maximum value, quarter point, median, and third quarter Points, mean standard deviation, mean variance, Chebyshev statistical features, total variation, and coefficient of variation are used as feature parameters, and the data is transformed into a feature matrix with the same structure as the above indicators.
  • index prediction For index prediction, the above index matrix is converted into a time series sequence format, and historical data is used as the input of the seq2seq+attention prediction model for model training. Real-time data is used as prediction data to predict future indicators and obtain prediction results.
  • the feature matrix is used as the input of the random forest model for model training. And use the prediction results to input the random forest model to perform anomaly detection on the predicted time series.
  • Fig. 7 is a block diagram of a business anomaly prediction device according to an embodiment of the present disclosure, as shown in Fig. 7 , including:
  • the collection module 72 is configured to collect real-time business data at the current time
  • the first aggregation module 74 is configured to aggregate the real-time business data with multi-dimensional indicators to obtain multi-dimensional real-time data indicators;
  • the input module 76 is configured to input the multi-dimensional real-time data index into the target neural network model pre-trained based on historical business data, and obtain the business within the preset time period after the current time output by the target neural network model prediction results;
  • the anomaly detection module 78 is configured to input the prediction result into the target random forest model trained in advance based on the historical business data, and obtain the abnormal detection result of the business within the preset time output by the target random forest model.
  • the device also includes:
  • the first stripping module is configured to strip invalid data in the real-time business data to obtain valid real-time business data
  • the first cleaning module is configured to normalize the non-normalized data in the valid real-time business data to obtain cleaned real-time business data.
  • the device also includes:
  • the first extraction module is configured to extract the historical business data
  • the second focusing module is configured to aggregate the historical business data with multi-dimensional indicators to obtain multi-dimensional historical data indicators
  • the first training module is configured to train the constructed initial neural network model according to the multi-dimensional historical data indicators to obtain the trained target neural network model.
  • the device also includes:
  • the second stripping module is configured to strip invalid data in the historical business data to obtain valid historical business data
  • the second cleaning module is configured to normalize the non-normalized data in the valid historical business data to obtain cleaned historical business data.
  • the device also includes:
  • the second extraction module is configured to extract data aggregation indicators with a time window of N and a time window of M from the multi-dimensional historical data indicators respectively, wherein N is not equal to M, and both N and M are integers greater than 1;
  • a determining module configured to determine characteristic parameters according to the data aggregation index
  • a conversion module configured to convert the characteristic parameters into a characteristic matrix
  • the second training module is configured to train the constructed initial random forest model according to the feature matrix to obtain the trained target random forest model.
  • the determination module is further configured to
  • the obtained parameters are composed into the characteristic parameters.
  • the real-time service data includes at least: time information, behavior information, location information and KPI;
  • the historical service data includes at least: time information, behavior information, location information and KPI.
  • Embodiments of the present disclosure also provide a computer-readable storage medium, in which a computer program is stored, wherein the computer program is configured to execute the steps in any one of the above method embodiments when running.
  • the above-mentioned computer-readable storage medium may include but not limited to: U disk, read-only memory (Read-Only Memory, referred to as ROM), random access memory (Random Access Memory, referred to as RAM) , mobile hard disk, magnetic disk or optical disk and other media that can store computer programs.
  • ROM read-only memory
  • RAM random access memory
  • mobile hard disk magnetic disk or optical disk and other media that can store computer programs.
  • Embodiments of the present disclosure also provide an electronic device, including a memory and a processor, where a computer program is stored in the memory, and the processor is configured to run the computer program to execute the steps in any one of the above method embodiments.
  • the electronic device may further include a transmission device and an input and output device, wherein the transmission device is connected to the processor, and the input and output device is connected to the processor.
  • each module or each step of the above-mentioned disclosure can be realized by a general-purpose computing device, and they can be concentrated on a single computing device, or distributed in a network composed of multiple computing devices In fact, they can be implemented in program code executable by a computing device, and thus, they can be stored in a storage device to be executed by a computing device, and in some cases, can be executed in an order different from that shown here. Or described steps, or they are fabricated into individual integrated circuit modules, or multiple modules or steps among them are fabricated into a single integrated circuit module for implementation. As such, the present disclosure is not limited to any specific combination of hardware and software.

Abstract

Embodiments of the present application provide a service abnormality prediction method and device, a storage medium, and an electronic device. The method comprises: acquiring real-time service data at a current time; performing multi-dimensional indicator aggregation on the real-time service data to obtain a multi-dimensional real-time data indicator; inputting the multi-dimensional real-time data indicator into a target neural network model to obtain a prediction result of the service within a preset time period after the current time output by the target neural network model; and inputting the prediction result into a target random forest model to obtain an abnormality detection result of the service within the preset time period output by the target random forest model. The problem in the related art of untimely abnormality processing caused by the operation and maintenance personnel starting abnormality diagnosis only after an abnormality occurs can be solved, and abnormalities that may occur in the network in future can be predicted, so that the operation and maintenance personnel can perform intervention and handling in advance, thereby reducing the risk of complaint.

Description

一种业务异常预测方法、装置、存储介质及电子装置A business abnormality prediction method, device, storage medium and electronic device
相关申请的交叉引用Cross References to Related Applications
本公开基于2021年09月24日提交的发明名称为“一种业务异常预测方法、装置、存储介质及电子装置”的中国专利申请CN202111124591.X,并且要求该专利申请的优先权,通过引用将其所公开的内容全部并入本公开。This disclosure is based on the Chinese patent application CN202111124591.X filed on September 24, 2021 with the title of "a service abnormality prediction method, device, storage medium and electronic device", and claims the priority of this patent application, which is incorporated by reference The contents disclosed therein are fully incorporated into the present disclosure.
技术领域technical field
本公开实施例涉及通信领域,具体而言,涉及一种业务异常预测方法、装置、存储介质及电子装置。Embodiments of the present disclosure relate to the communication field, and in particular, relate to a service anomaly prediction method, device, storage medium, and electronic device.
背景技术Background technique
在移动运营商运维领域,大数据分析系统中存储着成千上万的维度指标数据。网络运维人员通常根据人工经验,对核心指标进行定期巡检,发现指标异常后才能开始接下来的故障处置工作。但由于移动组网的复杂性和指标的多样性,导致故障根因的排查困难、耗时时间长,严重问题会甚至直接导致网络瘫痪,影响用户正常使用。In the field of operation and maintenance of mobile operators, thousands of dimensional index data are stored in the big data analysis system. Network operation and maintenance personnel usually conduct regular inspections on core indicators based on manual experience, and start the next troubleshooting work only after abnormal indicators are found. However, due to the complexity of the mobile network and the diversity of indicators, it is difficult and time-consuming to troubleshoot the root cause of the fault. Serious problems may even directly lead to network paralysis and affect the normal use of users.
目前移动运营商已开始对网络中的关键绩效指标(Key Performance Indicators,简称为KPI)进行监测,并使用阈值对其进行指标异常进行判定。在网络运维提前干预并没有突破性的应用实践。At present, mobile operators have begun to monitor Key Performance Indicators (KPI) in the network, and use thresholds to judge abnormal indicators. There is no breakthrough application practice for early intervention in network operation and maintenance.
针对相关技术中异常发生后运维人员才开始进行异常诊断,导致异常处理不及时的问题,尚未提出解决方案。Aiming at the problem in the related technology that the operation and maintenance personnel start to diagnose the abnormality after the abnormality occurs, which leads to the untimely processing of the abnormality, no solution has been proposed yet.
发明内容Contents of the invention
本公开实施例提供了一种业务异常预测方法、装置、存储介质及电子装置,以至少解决相关技术中异常发生后运维人员才开始进行异常诊断,导致异常处理不及时的问题。Embodiments of the present disclosure provide a service anomaly prediction method, device, storage medium, and electronic device to at least solve the problem in the related art that operation and maintenance personnel start to diagnose anomalies after anomalies occur, resulting in untimely anomaly processing.
根据本公开的一个实施例,提供了一种业务异常预测方法,包括:According to an embodiment of the present disclosure, a business abnormality prediction method is provided, including:
采集当前时间的实时业务数据;Collect real-time business data at the current time;
将所述实时业务数据进行多维度指标聚集,得到多维度实时数据指标;Aggregating the real-time business data with multi-dimensional indicators to obtain multi-dimensional real-time data indicators;
将所述多维度实时数据指标输入到预先基于历史业务数据训练好的目标神经网络模型中,得到所述目标神经网络模型输出的所述当前时间之后预设时间段内业务的预测结果;Inputting the multi-dimensional real-time data index into the target neural network model trained in advance based on historical business data, and obtaining the forecast result of the business within the preset time period after the current time output by the target neural network model;
将所述预测结果输入预先基于所述历史业务数据训练好的目标随机森林模型中,得到所述目标随机森林模型输出的所述预设时间内业务的异常检测结果。The prediction result is input into the target random forest model trained in advance based on the historical business data, and the abnormal detection result of the business within the preset time output by the target random forest model is obtained.
在一示例性实施例中,在将所述实时业务数据进行多维度指标聚集,得到多维度实时数据指标之前,所述方法还包括:In an exemplary embodiment, before performing multi-dimensional indicator aggregation on the real-time business data to obtain multi-dimensional real-time data indicators, the method further includes:
将所述实时业务数据中的无效数据进行剥离,得到有效实时业务数据;Stripping invalid data in the real-time business data to obtain valid real-time business data;
将所述有效实时业务数据中的非规范化数据进行规范化处理,得到清洗后的实时业务数据。Standardize the non-normalized data in the effective real-time business data to obtain cleaned real-time business data.
在一示例性实施例中,所述方法还包括:In an exemplary embodiment, the method also includes:
提取所述历史业务数据;Extracting the historical business data;
将所述历史业务数据进行多维度指标聚集,得到多维度历史数据指标;Aggregating the historical business data with multi-dimensional indicators to obtain multi-dimensional historical data indicators;
根据所述多维度历史数据指标对构建的初始神经网络模型进行训练,得到训练好的所述目标神经网络模型。The constructed initial neural network model is trained according to the multi-dimensional historical data indicators to obtain the trained target neural network model.
在一示例性实施例中,在将所述历史业务数据进行多维度指标聚集,得到多维度历史数据指标之前,所述方法还包括:In an exemplary embodiment, before performing multi-dimensional indicator aggregation on the historical business data to obtain multi-dimensional historical data indicators, the method further includes:
将所述历史业务数据中的无效数据进行剥离,得到有效历史业务数据;Stripping invalid data from the historical business data to obtain valid historical business data;
将所述有效历史业务数据中的非规范化数据进行规范化处理,得到清洗后的历史业务数据。Normalize the non-normalized data in the valid historical business data to obtain cleaned historical business data.
在一示例性实施例中,所述方法还包括:In an exemplary embodiment, the method also includes:
分别从所述多维度历史数据指标中提取时间窗口为N与时间窗口为M的数据聚集指标,其中,N不等于M,N、M均为大于1的整数;Respectively extracting data aggregation indicators with a time window of N and a time window of M from the multi-dimensional historical data indicators, wherein N is not equal to M, and both N and M are integers greater than 1;
根据所述数据聚集指标确定特征参数;determining characteristic parameters according to the data aggregation index;
将所述特征参数转换为特征矩阵;converting the characteristic parameters into a characteristic matrix;
根据所述特征矩阵对构建的初始随机森林模型进行训练,得到训练好的所述目标随机森林模型。The constructed initial random forest model is trained according to the feature matrix to obtain the trained target random forest model.
在一示例性实施例中,根据所述数据聚集指标确定特征参数包括:In an exemplary embodiment, determining characteristic parameters according to the data aggregation index includes:
分别从所述数据聚集指标中获取以下参数:均值、标准差、最小值、最大值、四分之一分位点、中值、四分之三分位点、标准差均值、方差均值、切比雪夫统计特征、总变差、变异系数;The following parameters are respectively obtained from the data aggregation index: mean, standard deviation, minimum value, maximum value, quarter point, median, third quarter point, standard deviation mean, variance mean, cut Bischeff statistical characteristics, total variation, coefficient of variation;
将获取到的所述参数组成所述特征参数。The obtained parameters are composed into the characteristic parameters.
在一示例性实施例中,所述实时业务数据至少包括:时间信息、行为信息、位置信息以及KPI;In an exemplary embodiment, the real-time service data includes at least: time information, behavior information, location information and KPI;
所述历史业务数据至少包括:时间信息、行为信息、位置信息以及KPI。The historical service data includes at least: time information, behavior information, location information and KPI.
根据本公开的另一个实施例,还提供了一种业务异常预测装置,包括:According to another embodiment of the present disclosure, a service anomaly prediction device is also provided, including:
采集模块,设置为采集当前时间的实时业务数据;The collection module is configured to collect real-time business data at the current time;
第一聚集模块,设置为将所述实时业务数据进行多维度指标聚集,得到多维度实时数据指标;The first aggregation module is configured to aggregate the real-time business data with multi-dimensional indicators to obtain multi-dimensional real-time data indicators;
输入模块,设置为将所述多维度实时数据指标输入到预先基于历史业务数据训练好的目标神经网络模型中,得到所述目标神经网络模型输出的所述当前时间之后预设时间段内业务的预测结果;The input module is configured to input the multi-dimensional real-time data index into the target neural network model trained in advance based on the historical business data, and obtain the output of the target neural network model within the preset time period after the current time. forecast result;
异常检测模块,设置为将所述预测结果输入预先基于所述历史业务数据训练好的目标随机森林模型中,得到所述目标随机森林模型输出的所述预设时间内业务的异常检测结果。The anomaly detection module is configured to input the prediction result into a pre-trained target random forest model based on the historical business data, and obtain the abnormal detection result of the business within the preset time output by the target random forest model.
在一示例性实施例中,所述装置还包括:In an exemplary embodiment, the device also includes:
第一剥离模块,设置为将所述实时业务数据中的无效数据进行剥离,得到有效实时业务数据;The first stripping module is configured to strip invalid data in the real-time business data to obtain valid real-time business data;
第一清洗模块,设置为将所述有效实时业务数据中的非规范化数据进行规范化处理,得到清洗后的实时业务数据。The first cleaning module is configured to normalize the non-normalized data in the valid real-time business data to obtain cleaned real-time business data.
在一示例性实施例中,所述装置还包括:In an exemplary embodiment, the device also includes:
第一提取模块,设置为提取所述历史业务数据;The first extraction module is configured to extract the historical business data;
第二聚焦模块,设置为将所述历史业务数据进行多维度指标聚集,得到多维度历史数据指标;The second focusing module is configured to aggregate the historical business data with multi-dimensional indicators to obtain multi-dimensional historical data indicators;
第一训练模块,设置为根据所述多维度历史数据指标对构建的初始神经网络模型进行训练,得到训练好的所述目标神经网络模型。The first training module is configured to train the constructed initial neural network model according to the multi-dimensional historical data indicators to obtain the trained target neural network model.
在一示例性实施例中,所述装置还包括:In an exemplary embodiment, the device also includes:
第二剥离模块,设置为将所述历史业务数据中的无效数据进行剥离,得到有效历史业务数据;The second stripping module is configured to strip invalid data in the historical business data to obtain valid historical business data;
第二清洗模块,设置为将所述有效历史业务数据中的非规范化数据进行规范化处理,得到清洗后的历史业务数据。The second cleaning module is configured to normalize the non-normalized data in the valid historical business data to obtain cleaned historical business data.
在一示例性实施例中,所述装置还包括:In an exemplary embodiment, the device also includes:
第二提取模块,设置为分别从所述多维度历史数据指标中提取时间窗口为N与时间窗口为M的数据聚集指标,其中,N不等于M,N、M均为大于1的整数;The second extraction module is configured to extract data aggregation indicators with a time window of N and a time window of M from the multi-dimensional historical data indicators respectively, wherein N is not equal to M, and both N and M are integers greater than 1;
确定模块,设置为根据所述数据聚集指标确定特征参数;A determining module, configured to determine characteristic parameters according to the data aggregation index;
转换模块,设置为将所述特征参数转换为特征矩阵;A conversion module, configured to convert the characteristic parameters into a characteristic matrix;
第二训练模块,设置为根据所述特征矩阵对构建的初始随机森林模型进行训练,得到训练好的所述目标随机森林模型。The second training module is configured to train the constructed initial random forest model according to the feature matrix to obtain the trained target random forest model.
在一示例性实施例中,所述确定模块,还设置为In an exemplary embodiment, the determination module is further configured to
分别从所述数据聚集指标中获取以下参数:均值、标准差、最小值、最大值、四分之一分位点、中值、四分之三分位点、标准差均值、方差均值、切比雪夫统计特征、总变差、变异系数;The following parameters are respectively obtained from the data aggregation index: mean, standard deviation, minimum value, maximum value, quarter point, median, third quarter point, standard deviation mean, variance mean, cut Bischeff statistical characteristics, total variation, coefficient of variation;
将获取到的所述参数组成所述特征参数。The obtained parameters are composed into the characteristic parameters.
在一示例性实施例中,所述实时业务数据至少包括:时间信息、行为信息、位置信息以及KPI;In an exemplary embodiment, the real-time service data includes at least: time information, behavior information, location information and KPI;
所述历史业务数据至少包括:时间信息、行为信息、位置信息以及KPI。The historical service data includes at least: time information, behavior information, location information and KPI.
根据本公开的又一个实施例,还提供了一种计算机可读的存储介质,所述存储介质中存储有计算机程序,其中,所述计算机程序被设置为运行时执行上述任一项方法实施例中的步骤。According to yet another embodiment of the present disclosure, there is also provided a computer-readable storage medium, where a computer program is stored in the storage medium, wherein the computer program is set to execute any one of the above method embodiments when running in the steps.
根据本公开的又一个实施例,还提供了一种电子装置,包括存储器和处理器,所述存储器中存储有计算机程序,所述处理器被设置为运行所述计算机程序以执行上述任一项方法实施例中的步骤。According to yet another embodiment of the present disclosure, there is also provided an electronic device, including a memory and a processor, wherein a computer program is stored in the memory, and the processor is configured to run the computer program to perform any of the above Steps in the method examples.
本公开实施例,采集当前时间的实时业务数据;将所述实时业务数据进行多维度指标聚集,得到多维度实时数据指标;将所述多维度实时数据指标输入到预先基于历史业务数据训练好的目标神经网络模型中,得到所述目标神经网络模型输出的所述当前时间之后预设时间段内业务的预测结果;将所述预测结果输入预先基于所述历史业务数据训练好的目标随机森林模型中,得到所述目标随机森林模型输出的所述预设时间内业务的异常检测结果,可以解决相关技术中异常发生后运维人员才开始进行异常诊断,导致异常处理不及时的问题,对网络未来可能发生的异常进行预测,可以使运维人员提前干预、提前处置,降低投诉风险,提升用户体验。In the embodiment of the present disclosure, the real-time business data at the current time is collected; the real-time business data is aggregated with multi-dimensional indicators to obtain multi-dimensional real-time data indicators; the multi-dimensional real-time data indicators are input into the pre-trained historical business data In the target neural network model, the forecast result of the business within the preset time period after the current time output by the target neural network model is obtained; the forecast result is input into the target random forest model trained in advance based on the historical business data In this method, obtaining the abnormality detection results of the business within the preset time output by the target random forest model can solve the problem in related technologies that the operation and maintenance personnel only start to diagnose the abnormality after the abnormality occurs, which leads to the untimely processing of the abnormality. Predicting possible abnormalities in the future can enable operation and maintenance personnel to intervene and deal with them in advance, reduce the risk of complaints, and improve user experience.
附图说明Description of drawings
图1是本公开实施例的业务异常预测方法的移动终端的硬件结构框图;FIG. 1 is a block diagram of a hardware structure of a mobile terminal according to a business anomaly prediction method according to an embodiment of the present disclosure;
图2是根据本公开实施例的业务异常预测方法的流程图;FIG. 2 is a flow chart of a business anomaly prediction method according to an embodiment of the present disclosure;
图3是根据本公开可选实施例的业务异常预测方法的流程图;FIG. 3 is a flow chart of a business anomaly prediction method according to an optional embodiment of the present disclosure;
图4是根据本公开实施例的KPI网络智能预警的流程图;FIG. 4 is a flowchart of a KPI network intelligent early warning according to an embodiment of the present disclosure;
图5是根据本公开实施例的神经网络模型的示意图;5 is a schematic diagram of a neural network model according to an embodiment of the present disclosure;
图6是根据本公开实施例的VoLTE探针采集部署的组网图;FIG. 6 is a network diagram of VoLTE probe collection and deployment according to an embodiment of the present disclosure;
图7是根据本公开实施例的业务异常预测装置的框图。Fig. 7 is a block diagram of a service anomaly prediction device according to an embodiment of the present disclosure.
具体实施方式Detailed ways
下文中将参考附图并结合实施例来详细说明本公开的实施例。Hereinafter, embodiments of the present disclosure will be described in detail with reference to the drawings and in combination with the embodiments.
需要说明的是,本公开的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。It should be noted that the terms "first" and "second" in the specification and claims of the present disclosure and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific sequence or sequence.
本公开实施例中所提供的方法实施例可以在移动终端、计算机终端或者类似的运算装置中执行。以运行在移动终端上为例,图1是本公开实施例的业务异常预测方法的移动终端的硬件结构框图,如图1所示,移动终端可以包括一个或多个(图1中仅示出一个)处理器102(处理器102可以包括但不限于微处理器MCU或可编程逻辑器件FPGA等的处理装置)和用于存储数据的存储器104,其中,上述移动终端还可以包括用于通信功能的传输设备106以及输入输出设备108。本领域普通技术人员可以理解,图1所示的结构仅为示意,其并不对上述移动终端的结构造成限定。例如,移动终端还可包括比图1中所示更多或者更少的组件,或者具有与图1所示不同的配置。The method embodiments provided in the embodiments of the present disclosure may be executed in mobile terminals, computer terminals or similar computing devices. Taking running on a mobile terminal as an example, Fig. 1 is a block diagram of the hardware structure of the mobile terminal of the business anomaly prediction method of the embodiment of the present disclosure. As shown in Fig. 1, the mobile terminal may include one or more (only shown in Fig. 1 1) Processor 102 (processor 102 may include but not limited to a processing device such as a microprocessor MCU or a programmable logic device FPGA) and a memory 104 for storing data, wherein the above-mentioned mobile terminal may also include a communication function The transmission device 106 and the input and output device 108. Those skilled in the art can understand that the structure shown in FIG. 1 is only for illustration, and it does not limit the structure of the above mobile terminal. For example, the mobile terminal may also include more or fewer components than those shown in FIG. 1 , or have a different configuration from that shown in FIG. 1 .
存储器104可用于存储计算机程序,例如,应用软件的软件程序以及模块,如本公开实施例中的业务异常预测方法对应的计算机程序,处理器102通过运行存储在存储器104内的计算机程序,从而执行各种功能应用以及业务链地址池切片处理,即实现上述的方法。存储器104可包括高速随机存储器,还可包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,存储器104可进一步包括相对于处理器102远程设置的存储器,这些远程存储器可以通过网络连接至移动终端。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 104 can be used to store computer programs, for example, software programs and modules of application software, such as the computer program corresponding to the business anomaly prediction method in the embodiment of the present disclosure, and the processor 102 executes the computer program stored in the memory 104 by running the Various functional applications and service chain address pool slicing processing realize the above-mentioned method. The memory 104 may include high-speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some examples, the memory 104 may further include a memory that is remotely located relative to the processor 102, and these remote memories may be connected to the mobile terminal through a network. Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
传输装置106用于经由一个网络接收或者发送数据。上述的网络具体实例可包括移动终端的通信供应商提供的无线网络。在一个实例中,传输装置106包括一个网络适配器(Network Interface Controller,简称为NIC),其可通过基站与其他网络设备相连从而可与互联网进行通讯。在一个实例中,传输装置106可以为射频(Radio Frequency,简称为RF)模块,其用于通过无线方式与互联网进行通讯。The transmission device 106 is used to receive or transmit data via a network. The specific example of the above network may include a wireless network provided by the communication provider of the mobile terminal. In one example, the transmission device 106 includes a network interface controller (NIC for short), which can be connected to other network devices through a base station so as to communicate with the Internet. In one example, the transmission device 106 may be a radio frequency (Radio Frequency, referred to as RF) module, which is used to communicate with the Internet in a wireless manner.
在本实施例中提供了一种运行于上述移动终端或网络架构的业务异常预测方法,图2是根据本公开实施例的业务异常预测方法的流程图,如图2所示,该流程包括如下步骤:In this embodiment, a service anomaly prediction method running on the above-mentioned mobile terminal or network architecture is provided. FIG. 2 is a flowchart of a service anomaly prediction method according to an embodiment of the present disclosure. As shown in FIG. 2 , the process includes the following step:
步骤S202,采集当前时间的实时业务数据;Step S202, collecting real-time business data at the current time;
本实施例中,所述实时业务数据至少包括:时间信息、行为信息、位置信息以及KPI。In this embodiment, the real-time service data includes at least: time information, behavior information, location information and KPI.
步骤S204,将所述实时业务数据进行多维度指标聚集,得到多维度实时数据指标;Step S204, aggregating the real-time business data with multi-dimensional indicators to obtain multi-dimensional real-time data indicators;
步骤S206,将所述多维度实时数据指标输入到预先基于历史业务数据训练好的目标神经网络模型中,得到所述目标神经网络模型输出的所述当前时间之后预设时间段内业务的预测结果;Step S206, input the multi-dimensional real-time data index into the target neural network model trained in advance based on historical business data, and obtain the forecast result of the business within the preset time period after the current time output by the target neural network model ;
步骤S208,将所述预测结果输入预先基于所述历史业务数据训练好的目标随机森林模型中,得到所述目标随机森林模型输出的所述预设时间内业务的异常检测结果。Step S208, inputting the prediction result into the target random forest model trained in advance based on the historical business data, and obtaining the abnormal detection result of the business within the preset time output by the target random forest model.
通过上述步骤S202至S208,可以解决相关技术中异常发生后运维人员才开始进行异常诊断,导致异常处理不及时的问题,对网络未来可能发生的异常进行预测,可以使运维人员提前干预、提前处置,降低投诉风险,提升用户体验。Through the above steps S202 to S208, it is possible to solve the problem in the related technology that the operation and maintenance personnel start to diagnose the abnormality after the abnormality occurs, which leads to the untimely processing of the abnormality, and predicting the abnormality that may occur in the network in the future can enable the operation and maintenance personnel to intervene in advance, Deal with it in advance to reduce the risk of complaints and improve user experience.
为了去除无效的实时业务数据,在上述步骤S204之前,将所述实时业务数据进行清洗,得到清洗后的实时业务数据,进一步的,将所述实时业务数据中的无效数据进行剥离,得到有效实时业务数据;将所述有效实时业务数据中的非规范化数据进行规范化处理,得到清洗后的实时业务数据。In order to remove invalid real-time business data, before the above step S204, the real-time business data is cleaned to obtain the cleaned real-time business data, and further, the invalid data in the real-time business data is stripped to obtain an effective real-time Business data: standardize the non-standardized data in the effective real-time business data to obtain cleaned real-time business data.
图3是根据本公开可选实施例的业务异常预测方法的流程图,如图3所示,包括:Fig. 3 is a flowchart of a business anomaly prediction method according to an optional embodiment of the present disclosure, as shown in Fig. 3 , including:
步骤S302,提取历史业务数据;Step S302, extracting historical business data;
本实施例中,所述历史业务数据至少包括:时间信息、行为信息、位置信息以及KPI,具体可以从不同位置采集具体数据。In this embodiment, the historical service data at least includes: time information, behavior information, location information and KPI, and specifically, specific data may be collected from different locations.
步骤S304,将所述历史业务数据进行多维度指标聚集,得到多维度历史数据指标;Step S304, aggregating the historical business data with multi-dimensional indicators to obtain multi-dimensional historical data indicators;
步骤S306,根据所述多维度历史数据指标对构建的初始神经网络模型进行训练,得到训练好的所述目标神经网络模型。Step S306: Train the constructed initial neural network model according to the multi-dimensional historical data indicators to obtain the trained target neural network model.
为了去除无效的历史业务数据,在上述步骤S304之前,将所述历史业务数据进行清洗,得到清洗后的历史业务数据,进一步的,将所述历史业务数据中的无效数据进行剥离,得到有效历史业务数据;将所述有效历史业务数据中的非规范化数据进行规范化处理,得到清洗后的历史业务数据。In order to remove invalid historical business data, before the above step S304, the historical business data is cleaned to obtain the cleaned historical business data, and further, the invalid data in the historical business data is stripped to obtain valid historical data Business data: standardize the non-normalized data in the effective historical business data to obtain cleaned historical business data.
本实施例提取历史业务数据,分别对每个维度的历史业务数据进行小时、天粒度指标聚合;基于多维度历史业务数据,提取集合后的历史数据时序特征;构建初始神经网络模型,并将提取特征后的历史业务数据用于对初始神经网络模型训练;初始神经网络模型常用于自然语言翻译,对该模型进行改造后,用于对时间序列的预测,实际结果表明,预测的准确率优于常见时间序列预测算法。训练后得到的目标神经网络模型可以用于预测未来一段时间的预测结果。This embodiment extracts historical business data, and aggregates hourly and daily granularity indicators for historical business data in each dimension; based on multi-dimensional historical business data, extracts the time series features of historical data after collection; constructs an initial neural network model, and extracts The historical business data after the feature is used to train the initial neural network model; the initial neural network model is often used in natural language translation. After the model is modified, it is used to predict the time series. The actual results show that the accuracy of the prediction is better than Common time series forecasting algorithms. The target neural network model obtained after training can be used to predict the prediction results for a period of time in the future.
在一示例性实施例中,分别从所述多维度历史数据指标中提取时间窗口为N与时间窗口为M的数据聚集指标,其中,N不等于M,N、M均为大于1的整数,例如,N为7天,M为60天,采集历史数据时,则从当前时间开始,从多维度历史数据指标中提取时间窗口为7天的数据聚集指标,以及从多维度历史数据指标中提取时间窗口为60天的数据聚集指标;根据所述数据聚集指标确定特征参数;将所述特征参数转换为特征矩阵;根据所述特征矩阵对构建的初始随机森林模型进行训练,得到训练好的所述目标随机森林模型。In an exemplary embodiment, data aggregation indexes with a time window of N and a time window of M are respectively extracted from the multi-dimensional historical data indexes, wherein, N is not equal to M, and both N and M are integers greater than 1, For example, N is 7 days and M is 60 days. When collecting historical data, start from the current time, extract the data aggregation indicators with a time window of 7 days from the multi-dimensional historical data indicators, and extract from the multi-dimensional historical data indicators The time window is a data aggregation index of 60 days; the characteristic parameters are determined according to the data aggregation index; the characteristic parameters are converted into a characteristic matrix; the initial random forest model constructed is trained according to the characteristic matrix, and the trained results are obtained. The target random forest model.
本实施例分别对每个维度的历史业务数据进行小时、天粒度指标聚合;基于多维度历史业务数据,提取集合后的历史统计特征参数,并对随机森林网络进行模型训练;常规的异常检测需要大量的人工标记样本,基于每小时上亿的大数据量样本标记在运维领域几乎不可能实现。本公开实施例采用统计学与人工标记相结合半自动标签方式,就行样本编辑,可以大 幅降低人工标记成本,提升标记效率。训练后得到的目标随机森林模型用于对进行指标异常检测和分类(正常指标、异常指标),给出可能出现异常的指标预测结论。In this embodiment, the hourly and daily granularity index aggregation is performed on the historical business data of each dimension; based on the multi-dimensional historical business data, the historical statistical characteristic parameters after the collection are extracted, and the model training is performed on the random forest network; conventional anomaly detection requires A large number of manually labeled samples, based on hundreds of millions of samples per hour, is almost impossible to achieve in the field of operation and maintenance. The embodiment of the present disclosure adopts a semi-automatic labeling method combining statistics and manual labeling to edit samples, which can greatly reduce the cost of manual labeling and improve labeling efficiency. The target random forest model obtained after training is used to detect and classify abnormal indicators (normal indicators, abnormal indicators), and give prediction conclusions about indicators that may appear abnormal.
进一步的,上述根据所述数据聚集指标确定特征参数具体可以包括:分别从所述数据聚集指标中获取以下参数:均值、标准差、最小值、最大值、四分之一分位点、中值、四分之三分位点、标准差均值、方差均值、切比雪夫统计特征、总变差、变异系数;将获取到的所述参数组成所述特征参数。Further, the above-mentioned determination of characteristic parameters according to the data aggregation index may specifically include: respectively obtaining the following parameters from the data aggregation index: mean value, standard deviation, minimum value, maximum value, quarter point, median , three-quarters point, standard deviation mean, variance mean, Chebyshev statistical features, total variation, coefficient of variation; the obtained parameters form the characteristic parameters.
本实施例相对于相关技术中等到故障触发后运维人员才开始进行故障诊断的模式,通过技术能力演进,为提前识别、提前诊断、提前处置的运维思想给出切实可落地的执行方案。采取了基于多维度KPI历史特征学习方式,对KPI未来趋势进行预测;在异常识别过程中,本实施例与维度-KPI自身的历史数据做比较,不依赖门限阈值划分异常,可以有效的避免虚警和漏警,能够检测出明显的KPI异常以及有劣化趋势的异常;在故障预警界面,通过与运维工作深度结合,根据异常场景,构建黄金业务指标-运维KPI指标、多维度分层预警体系,从而过滤掉偶发性指标波动引起的误告警,大幅缩减了告警范围,提升运维同事优化效率。Compared with the mode in the related technology that the operation and maintenance personnel do not start fault diagnosis until the fault is triggered, this embodiment provides a practical implementation plan for the operation and maintenance thinking of early identification, early diagnosis and early disposal through the evolution of technical capabilities. Based on the multi-dimensional KPI historical feature learning method, the future trend of KPI is predicted; in the process of abnormal identification, this embodiment is compared with the historical data of the dimension-KPI itself, and does not rely on threshold thresholds to divide abnormalities, which can effectively avoid false positives. Alarms and missed alarms can detect obvious KPI abnormalities and abnormalities with a tendency to deteriorate; in the fault early warning interface, through deep integration with operation and maintenance work, according to abnormal scenarios, build golden business indicators-operation and maintenance KPI indicators, multi-dimensional layering Early warning system, so as to filter out false alarms caused by sporadic index fluctuations, greatly reduce the scope of alarms, and improve the optimization efficiency of operation and maintenance colleagues.
图4是根据本公开实施例的KPI网络智能预警的流程图,如图4所示,包括:Fig. 4 is a flowchart of a KPI network intelligent early warning according to an embodiment of the present disclosure, as shown in Fig. 4 , including:
探针采集数据后,将数据进行基础数据清洗,在清洗阶段完成基础指标聚集以及异常数据处理。After the probe collects the data, it cleans the basic data, and completes the basic index aggregation and abnormal data processing in the cleaning stage.
聚合后的数据通过特征工程进行特征提取,该特征包括全局数据单时间粒度的统计特征以及维度数据的时间序列特征(即时序预测特征)。The aggregated data is subjected to feature extraction through feature engineering, which includes statistical features of single-time granularity of global data and time series features of dimensional data (that is, sequence prediction features).
提取的特征作为AI模型的输入,时序序列输入到预测模型后,得到指定维度的预测结果。预测的历史数据与预测结果拼接后输入异常识别模型,进行异常识别的判定,其中,异常识别模型为随机森林异常识别模型,可以通过历史业务数据训练得到的,历史业务数据至少包括:时间信息、行为信息、位置信息以及KPI,KPI具体包括统计学标签与人工分析标签。The extracted features are used as the input of the AI model, and the time series sequence is input to the prediction model to obtain the prediction result of the specified dimension. The predicted historical data and the predicted results are spliced and then input into the abnormal recognition model to determine the abnormal recognition. Among them, the abnormal recognition model is a random forest abnormal recognition model, which can be obtained through historical business data training. The historical business data includes at least: time information, Behavior information, location information, and KPIs. KPIs specifically include statistical tags and manual analysis tags.
最终得到未来可能存在异常维度异常维度汇总结果。Finally, the summary result of abnormal dimensions that may exist in the future is obtained.
图5是根据本公开实施例的神经网络模型的示意图,如图5所示,该模型主要包括编码器Encoder、解码器Decoder和注意力模型Attention三大部分,编解码网络使用RNN(Recurrent Neural Network,递归神经网络)模型。时序KPI指标作为Encoder模型输入,通过Encoder对特征进行编码转换为特征张量,作为Decoder输入。结合Self Attention,根据影响程度赋予不同权重,特征矩阵和Encoder结果进行拼接作为全连接网络的输入,最终输出预测结果。Fig. 5 is a schematic diagram of a neural network model according to an embodiment of the present disclosure. As shown in Fig. 5, the model mainly includes three parts: an encoder Encoder, a decoder Decoder and an attention model Attention. The encoding and decoding network uses RNN (Recurrent Neural Network , recurrent neural network) model. The time series KPI index is used as the input of the Encoder model, and the feature is encoded and converted into a feature tensor through the Encoder, which is used as the input of the Decoder. Combined with Self Attention, different weights are given according to the degree of influence, and the feature matrix and Encoder results are concatenated as the input of the fully connected network, and finally the prediction result is output.
图6是根据本公开实施例的VoLTE探针采集部署的组网图,如图6所示,包括:IMS(IP Multimedia Subsystem IP,多媒体系统)、EPC(Evolved Packet Core,全IP的分组核心网)、E-UTRAN(Evolved UMTS Terrestrial Radio Access Network,演进的UMTS陆地无线接入网)、GERAN(GSM EDGE Radio Access Network GSM/EDGE,无线通讯网络)、eMSC/GMSC(Evolved Mobile Switching Center/Gateway Mobile Switching Center,演进网关移动交换中心/网关移动交换中心)、DRA(Diameter Routing Agent,路由代理节点)、DNS/ENUM(Domain Name System/Telephone Number Mapping working group,域名系统/电话号码映射工作组)以及三合一HSS(Home Subscriber Server,归属签约用户服务器),其中,IMS包括:BGCF(Breakout Gateway Control Function,出口网关控制功能)、MGCF(Media Gateway Control Function,媒体网关控制功能)、AS(Application Server,应用服务器)、I/S-CSCF (Interrogating/Serving Call Session Control Function,查询/服务CSCF)、IBCF(互联边界控制功能)、P-CSCF/SBC(Proxy Call Session Control Function/Session Border Controller,代理CSCF/会话边界控制器)以及PCRF(Policy and Charging Rules Function,策略与计费规则功能),EPC包括:SEA-GW(Serving Gateway,服务网关)、MME(Mobility Management Entity,移动管理节点),E-UTRAN包括eNB(eNode B,基站)。本实施例涉及的网元至少包括I/S-CSCF、P-CSCF/SBC、SEA-GW、MME、eNB以及eMSC/GMSC,各网元连接线上为第三代合作伙伴计划(The 3rd Genenration Partners hip Project,简称为3GPP)协议标准采集接口,即数据采集探针部署位置。Fig. 6 is a network diagram of VoLTE probe collection and deployment according to an embodiment of the present disclosure, as shown in Fig. 6 , including: IMS (IP Multimedia Subsystem IP, multimedia system), EPC (Evolved Packet Core, all-IP grouping core network ), E-UTRAN (Evolved UMTS Terrestrial Radio Access Network, Evolved UMTS Terrestrial Radio Access Network), GERAN (GSM EDGE Radio Access Network GSM/EDGE, wireless communication network), eMSC/GMSC (Evolved Mobile Switching Center/Gateway Mobile Switching Center, Evolved Gateway Mobile Switching Center/Gateway Mobile Switching Center), DRA (Diameter Routing Agent, routing proxy node), DNS/ENUM (Domain Name System/Telephone Number Mapping working group, domain name system/telephone number mapping working group) and Three-in-one HSS (Home Subscriber Server, belonging to the subscriber server), among which, IMS includes: BGCF (Breakout Gateway Control Function, egress gateway control function), MGCF (Media Gateway Control Function, media gateway control function), AS (Application Server , application server), I/S-CSCF (Interrogating/Serving Call Session Control Function, query/serving CSCF), IBCF (Internet Border Control Function), P-CSCF/SBC (Proxy Call Session Control Function/Session Border Controller, proxy CSCF/Session Border Controller) and PCRF (Policy and Charging Rules Function, policy and charging rule function), EPC includes: SEA-GW (Serving Gateway, service gateway), MME (Mobility Management Entity, mobile management node), E - UTRAN includes eNB (eNode B, base station). The network elements involved in this embodiment include at least I/S-CSCF, P-CSCF/SBC, SEA-GW, MME, eNB, and eMSC/GMSC. Partners hip Project, referred to as 3GPP) protocol standard collection interface, that is, the deployment location of the data collection probe.
本实施例基于VoLTE(Voice over Long Term Evolution,长期演进语音承载)移动通信网络视频业务异常预测,包括网络指标采集、原始数据清洗、多维度指标聚集、KPI历史特征提取、KPI指标预测、指标异常识别过程。This embodiment is based on VoLTE (Voice over Long Term Evolution, long-term evolution voice bearer) mobile communication network video service anomaly prediction, including network index collection, raw data cleaning, multi-dimensional index aggregation, KPI historical feature extraction, KPI index prediction, index anomaly identification process.
网络指标采集,如图6所示,硬探针部署在I/S-CSCF和P-CSCF/SBC间的Mw口,采集注册、呼叫、掉话类信令;部署在P/SBC和PCRF的Rx口采集掉话信令;部署在MME和eMSC网元间的Sv口,采集ESRVCC(Evolved Single Radio Voice Call Continuity,演进单无线模式语音呼叫连续性)切换信令;部署在小区和信令网关(Security Gateway,简称为SGW)网元间的s1-u口采集语音视频用户面信令。Network indicator collection, as shown in Figure 6, the hard probe is deployed on the Mw interface between the I/S-CSCF and P-CSCF/SBC to collect signaling such as registration, call, and call drop; it is deployed on the P/SBC and PCRF The Rx port collects call drop signaling; it is deployed on the Sv port between the MME and the eMSC network element to collect ESRVCC (Evolved Single Radio Voice Call Continuity, evolved single radio mode voice call continuity) handover signaling; it is deployed in the cell and signaling gateway (Security Gateway, referred to as SGW) s1-u interface between network elements collects voice and video user plane signaling.
原始数据清洗,采集的信令包含时间信息、用户信息、行为信息、位置信息,以及KPI,关键KPI模型指标至少包括:初始注册成功率,重注册成功率,VoLTE网络接通率,V2V呼叫建立时长,VoLTE掉话率,ESRVCC切换成功率,ESRVCC切换平均时延,MOS 3.0(Mean Opinion Score,通话质量主观评测均值)占比,上行丢包率。原始数据清洗对信息中的有效信息进行清洗,包括无效数据的剥离、将非规划化数据规范化。Raw data cleaning, the collected signaling includes time information, user information, behavior information, location information, and KPI, the key KPI model indicators include at least: initial registration success rate, re-registration success rate, VoLTE network connection rate, V2V call establishment Duration, VoLTE call drop rate, ESRVCC handover success rate, ESRVCC average delay, MOS 3.0 (Mean Opinion Score, mean value of subjective evaluation of call quality) ratio, uplink packet loss rate. Raw data cleaning cleans the effective information in the information, including the stripping of invalid data and the normalization of unplanned data.
多维度指标聚集,对于清洗后的数据,分别进行小区维度及各网元维度的指标聚集,做小时粒度聚集统计。聚集后的指标矩阵如下:Multi-dimensional index aggregation. For the cleaned data, the indicators of the cell dimension and each network element dimension are aggregated separately, and hourly granular aggregation statistics are performed. The aggregated indicator matrix is as follows:
Figure PCTCN2022119099-appb-000001
Figure PCTCN2022119099-appb-000001
KPI历史特征提取,分别取窗口为N及窗口为M的维度-KPI聚集指标,获取均值、标准差、最小值、最大值、四分之一分位点、中值、四分之三分位点、标准差均值、方差均值、切比雪夫统计特征、总变差、变异系数作为特征参数,并将数据转换为与上述指标结构相同的特征矩阵。KPI historical feature extraction, respectively take the dimension of the window as N and the window as M-KPI aggregation index, obtain the mean, standard deviation, minimum value, maximum value, quarter point, median, and third quarter Points, mean standard deviation, mean variance, Chebyshev statistical features, total variation, and coefficient of variation are used as feature parameters, and the data is transformed into a feature matrix with the same structure as the above indicators.
指标预测,将上述指标矩阵转换为时序序列格式,历史数据作为seq2seq+attention预测模型输入进行模型训练。实时数据作为预测数据,对未来的指标进行预测,得到预测结果。For index prediction, the above index matrix is converted into a time series sequence format, and historical data is used as the input of the seq2seq+attention prediction model for model training. Real-time data is used as prediction data to predict future indicators and obtain prediction results.
指标异常识别,将特征矩阵作为随机森林模型输入进行模型训练。并使用预测结果输入该随机森林模型对预测的时序序列进行异常检测。Indicator anomaly identification, the feature matrix is used as the input of the random forest model for model training. And use the prediction results to input the random forest model to perform anomaly detection on the predicted time series.
根据本公开的另一个实施例,还提供了一种业务异常预测装置,图7是根据本公开实施 例的业务异常预测装置的框图,如图7所示,包括:According to another embodiment of the present disclosure, there is also provided a business anomaly prediction device, and Fig. 7 is a block diagram of a business anomaly prediction device according to an embodiment of the present disclosure, as shown in Fig. 7 , including:
采集模块72,设置为采集当前时间的实时业务数据;The collection module 72 is configured to collect real-time business data at the current time;
第一聚集模块74,设置为将所述实时业务数据进行多维度指标聚集,得到多维度实时数据指标;The first aggregation module 74 is configured to aggregate the real-time business data with multi-dimensional indicators to obtain multi-dimensional real-time data indicators;
输入模块76,设置为将所述多维度实时数据指标输入到预先基于历史业务数据训练好的目标神经网络模型中,得到所述目标神经网络模型输出的所述当前时间之后预设时间段内业务的预测结果;The input module 76 is configured to input the multi-dimensional real-time data index into the target neural network model pre-trained based on historical business data, and obtain the business within the preset time period after the current time output by the target neural network model prediction results;
异常检测模块78,设置为将所述预测结果输入预先基于所述历史业务数据训练好的目标随机森林模型中,得到所述目标随机森林模型输出的所述预设时间内业务的异常检测结果。The anomaly detection module 78 is configured to input the prediction result into the target random forest model trained in advance based on the historical business data, and obtain the abnormal detection result of the business within the preset time output by the target random forest model.
在一示例性实施例中,所述装置还包括:In an exemplary embodiment, the device also includes:
第一剥离模块,设置为将所述实时业务数据中的无效数据进行剥离,得到有效实时业务数据;The first stripping module is configured to strip invalid data in the real-time business data to obtain valid real-time business data;
第一清洗模块,设置为将所述有效实时业务数据中的非规范化数据进行规范化处理,得到清洗后的实时业务数据。The first cleaning module is configured to normalize the non-normalized data in the valid real-time business data to obtain cleaned real-time business data.
在一示例性实施例中,所述装置还包括:In an exemplary embodiment, the device also includes:
第一提取模块,设置为提取所述历史业务数据;The first extraction module is configured to extract the historical business data;
第二聚焦模块,设置为将所述历史业务数据进行多维度指标聚集,得到多维度历史数据指标;The second focusing module is configured to aggregate the historical business data with multi-dimensional indicators to obtain multi-dimensional historical data indicators;
第一训练模块,设置为根据所述多维度历史数据指标对构建的初始神经网络模型进行训练,得到训练好的所述目标神经网络模型。The first training module is configured to train the constructed initial neural network model according to the multi-dimensional historical data indicators to obtain the trained target neural network model.
在一示例性实施例中,所述装置还包括:In an exemplary embodiment, the device also includes:
第二剥离模块,设置为将所述历史业务数据中的无效数据进行剥离,得到有效历史业务数据;The second stripping module is configured to strip invalid data in the historical business data to obtain valid historical business data;
第二清洗模块,设置为将所述有效历史业务数据中的非规范化数据进行规范化处理,得到清洗后的历史业务数据。The second cleaning module is configured to normalize the non-normalized data in the valid historical business data to obtain cleaned historical business data.
在一示例性实施例中,所述装置还包括:In an exemplary embodiment, the device also includes:
第二提取模块,设置为分别从所述多维度历史数据指标中提取时间窗口为N与时间窗口为M的数据聚集指标,其中,N不等于M,N、M均为大于1的整数;The second extraction module is configured to extract data aggregation indicators with a time window of N and a time window of M from the multi-dimensional historical data indicators respectively, wherein N is not equal to M, and both N and M are integers greater than 1;
确定模块,设置为根据所述数据聚集指标确定特征参数;A determining module, configured to determine characteristic parameters according to the data aggregation index;
转换模块,设置为将所述特征参数转换为特征矩阵;A conversion module, configured to convert the characteristic parameters into a characteristic matrix;
第二训练模块,设置为根据所述特征矩阵对构建的初始随机森林模型进行训练,得到训练好的所述目标随机森林模型。The second training module is configured to train the constructed initial random forest model according to the feature matrix to obtain the trained target random forest model.
在一示例性实施例中,所述确定模块,还设置为In an exemplary embodiment, the determination module is further configured to
分别从所述数据聚集指标中获取以下参数:均值、标准差、最小值、最大值、四分之一分位点、中值、四分之三分位点、标准差均值、方差均值、切比雪夫统计特征、总变差、变异系数;The following parameters are respectively obtained from the data aggregation index: mean, standard deviation, minimum value, maximum value, quarter point, median, third quarter point, standard deviation mean, variance mean, cut Bischeff statistical characteristics, total variation, coefficient of variation;
将获取到的所述参数组成所述特征参数。The obtained parameters are composed into the characteristic parameters.
在一示例性实施例中,所述实时业务数据至少包括:时间信息、行为信息、位置信息以及KPI;In an exemplary embodiment, the real-time service data includes at least: time information, behavior information, location information and KPI;
所述历史业务数据至少包括:时间信息、行为信息、位置信息以及KPI。The historical service data includes at least: time information, behavior information, location information and KPI.
本公开的实施例还提供了一种计算机可读存储介质,该计算机可读存储介质中存储有计算机程序,其中,该计算机程序被设置为运行时执行上述任一项方法实施例中的步骤。Embodiments of the present disclosure also provide a computer-readable storage medium, in which a computer program is stored, wherein the computer program is configured to execute the steps in any one of the above method embodiments when running.
在一个示例性实施例中,上述计算机可读存储介质可以包括但不限于:U盘、只读存储器(Read-Only Memory,简称为ROM)、随机存取存储器(Random Access Memory,简称为RAM)、移动硬盘、磁碟或者光盘等各种可以存储计算机程序的介质。In an exemplary embodiment, the above-mentioned computer-readable storage medium may include but not limited to: U disk, read-only memory (Read-Only Memory, referred to as ROM), random access memory (Random Access Memory, referred to as RAM) , mobile hard disk, magnetic disk or optical disk and other media that can store computer programs.
本公开的实施例还提供了一种电子装置,包括存储器和处理器,该存储器中存储有计算机程序,该处理器被设置为运行计算机程序以执行上述任一项方法实施例中的步骤。Embodiments of the present disclosure also provide an electronic device, including a memory and a processor, where a computer program is stored in the memory, and the processor is configured to run the computer program to execute the steps in any one of the above method embodiments.
在一个示例性实施例中,上述电子装置还可以包括传输设备以及输入输出设备,其中,该传输设备和上述处理器连接,该输入输出设备和上述处理器连接。In an exemplary embodiment, the electronic device may further include a transmission device and an input and output device, wherein the transmission device is connected to the processor, and the input and output device is connected to the processor.
本实施例中的具体示例可以参考上述实施例及示例性实施方式中所描述的示例,本实施例在此不再赘述。For specific examples in this embodiment, reference may be made to the examples described in the foregoing embodiments and exemplary implementation manners, and details will not be repeated here in this embodiment.
显然,本领域的技术人员应该明白,上述的本公开的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本公开不限制于任何特定的硬件和软件结合。Obviously, those skilled in the art should understand that each module or each step of the above-mentioned disclosure can be realized by a general-purpose computing device, and they can be concentrated on a single computing device, or distributed in a network composed of multiple computing devices In fact, they can be implemented in program code executable by a computing device, and thus, they can be stored in a storage device to be executed by a computing device, and in some cases, can be executed in an order different from that shown here. Or described steps, or they are fabricated into individual integrated circuit modules, or multiple modules or steps among them are fabricated into a single integrated circuit module for implementation. As such, the present disclosure is not limited to any specific combination of hardware and software.
以上所述仅为本公开的优选实施例而已,并不用于限制本公开,对于本领域的技术人员来说,本公开可以有各种更改和变化。凡在本公开的原则之内,所作的任何修改、等同替换、改进等,均应包含在本公开的保护范围之内。The above descriptions are only preferred embodiments of the present disclosure, and are not intended to limit the present disclosure. For those skilled in the art, the present disclosure may have various modifications and changes. Any modification, equivalent replacement, improvement, etc. made within the principle of the present disclosure shall be included in the protection scope of the present disclosure.

Claims (10)

  1. 一种业务异常预测方法,包括:A business abnormality prediction method, comprising:
    采集当前时间的实时业务数据;Collect real-time business data at the current time;
    将所述实时业务数据进行多维度指标聚集,得到多维度实时数据指标;Aggregating the real-time business data with multi-dimensional indicators to obtain multi-dimensional real-time data indicators;
    将所述多维度实时数据指标输入到预先基于历史业务数据训练好的目标神经网络模型中,得到所述目标神经网络模型输出的所述当前时间之后预设时间段内业务的预测结果;Inputting the multi-dimensional real-time data index into the target neural network model trained in advance based on historical business data, and obtaining the forecast result of the business within the preset time period after the current time output by the target neural network model;
    将所述预测结果输入预先基于所述历史业务数据训练好的目标随机森林模型中,得到所述目标随机森林模型输出的所述预设时间内业务的异常检测结果。The prediction result is input into the target random forest model trained in advance based on the historical business data, and the abnormal detection result of the business within the preset time output by the target random forest model is obtained.
  2. 根据权利要求1所述的方法,其中,在将所述实时业务数据进行多维度指标聚集,得到多维度实时数据指标之前,所述方法还包括:The method according to claim 1, wherein, before aggregating the real-time business data with multi-dimensional indicators to obtain multi-dimensional real-time data indicators, the method further comprises:
    将所述实时业务数据中的无效数据进行剥离,得到有效实时业务数据;Stripping invalid data in the real-time business data to obtain valid real-time business data;
    将所述有效实时业务数据中的非规范化数据进行规范化处理,得到清洗后的实时业务数据。Standardize the non-normalized data in the effective real-time business data to obtain cleaned real-time business data.
  3. 根据权利要求1所述的方法,其中,所述方法还包括:The method according to claim 1, wherein the method further comprises:
    提取所述历史业务数据;Extracting the historical business data;
    将所述历史业务数据进行多维度指标聚集,得到多维度历史数据指标;Aggregating the historical business data with multi-dimensional indicators to obtain multi-dimensional historical data indicators;
    根据所述多维度历史数据指标对构建的初始神经网络模型进行训练,得到训练好的所述目标神经网络模型。The constructed initial neural network model is trained according to the multi-dimensional historical data indicators to obtain the trained target neural network model.
  4. 根据权利要求3所述的方法,其中,在将所述历史业务数据进行多维度指标聚集,得到多维度历史数据指标之前,所述方法还包括:The method according to claim 3, wherein, before performing multi-dimensional indicator aggregation on the historical business data to obtain multi-dimensional historical data indicators, the method further comprises:
    将所述历史业务数据中的无效数据进行剥离,得到有效历史业务数据;Stripping invalid data from the historical business data to obtain valid historical business data;
    将所述有效历史业务数据中的非规范化数据进行规范化处理,得到清洗后的历史业务数据。Normalize the non-normalized data in the valid historical business data to obtain cleaned historical business data.
  5. 根据权利要求3所述的方法,其中,所述方法还包括:The method according to claim 3, wherein the method further comprises:
    分别从所述多维度历史数据指标中提取时间窗口为N与时间窗口为M的数据聚集指标,其中,N不等于M,N、M均为大于1的整数;Respectively extracting data aggregation indicators with a time window of N and a time window of M from the multi-dimensional historical data indicators, wherein N is not equal to M, and both N and M are integers greater than 1;
    根据所述数据聚集指标确定特征参数;determining characteristic parameters according to the data aggregation index;
    将所述特征参数转换为特征矩阵;converting the characteristic parameters into a characteristic matrix;
    根据所述特征矩阵对构建的初始随机森林模型进行训练,得到训练好的所述目标随机森林模型。The constructed initial random forest model is trained according to the feature matrix to obtain the trained target random forest model.
  6. 根据权利要求5所述的方法,其中,根据所述数据聚集指标确定特征参数包括:The method according to claim 5, wherein determining characteristic parameters according to the data aggregation index comprises:
    分别从所述数据聚集指标中获取以下参数:均值、标准差、最小值、最大值、四分之一分位点、中值、四分之三分位点、标准差均值、方差均值、切比雪夫统计特征、总变差、变异系数;The following parameters are respectively obtained from the data aggregation index: mean, standard deviation, minimum value, maximum value, quarter point, median, third quarter point, standard deviation mean, variance mean, cut Bischeff statistical characteristics, total variation, coefficient of variation;
    将获取到的所述参数组成所述特征参数。The obtained parameters are composed into the characteristic parameters.
  7. 根据权利要求1至6中任一项所述的方法,其中,A method according to any one of claims 1 to 6, wherein,
    所述实时业务数据至少包括:时间信息、行为信息、位置信息以及KPI;The real-time service data includes at least: time information, behavior information, location information and KPI;
    所述历史业务数据至少包括:时间信息、行为信息、位置信息以及KPI。The historical service data includes at least: time information, behavior information, location information and KPI.
  8. 一种业务异常预测装置,包括:A business anomaly prediction device, comprising:
    采集模块,设置为采集当前时间的实时业务数据;The collection module is configured to collect real-time business data at the current time;
    第一聚集模块,设置为将所述实时业务数据进行多维度指标聚集,得到多维度实时数据指标;The first aggregation module is configured to aggregate the real-time business data with multi-dimensional indicators to obtain multi-dimensional real-time data indicators;
    输入模块,设置为将所述多维度实时数据指标输入到预先基于历史业务数据训练好的目标神经网络模型中,得到所述目标神经网络模型输出的所述当前时间之后预设时间段内业务的预测结果;The input module is configured to input the multi-dimensional real-time data index into the target neural network model trained in advance based on the historical business data, and obtain the output of the target neural network model within the preset time period after the current time. forecast result;
    异常检测模块,设置为将所述预测结果输入预先基于所述历史业务数据训练好的目标随机森林模型中,得到所述目标随机森林模型输出的所述预设时间内业务的异常检测结果。The anomaly detection module is configured to input the prediction result into a pre-trained target random forest model based on the historical business data, and obtain the abnormal detection result of the business within the preset time output by the target random forest model.
  9. 一种计算机可读的存储介质,所述存储介质中存储有计算机程序,其中,所述计算机程序被设置为运行时执行所述权利要求1至7任一项中所述的方法。A computer-readable storage medium, wherein a computer program is stored in the storage medium, wherein the computer program is configured to perform the method described in any one of claims 1 to 7 when running.
  10. 一种电子装置,包括存储器和处理器,所述存储器中存储有计算机程序,所述处理器被设置为运行所述计算机程序以执行所述权利要求1至7任一项中所述的方法。An electronic device, comprising a memory and a processor, wherein a computer program is stored in the memory, and the processor is configured to run the computer program to perform the method described in any one of claims 1 to 7.
PCT/CN2022/119099 2021-09-24 2022-09-15 Service abnormality prediction method and device, storage medium, and electronic device WO2023045829A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111124591.X 2021-09-24
CN202111124591.XA CN115859188A (en) 2021-09-24 2021-09-24 Service abnormity prediction method and device, storage medium and electronic device

Publications (1)

Publication Number Publication Date
WO2023045829A1 true WO2023045829A1 (en) 2023-03-30

Family

ID=85653212

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/119099 WO2023045829A1 (en) 2021-09-24 2022-09-15 Service abnormality prediction method and device, storage medium, and electronic device

Country Status (2)

Country Link
CN (1) CN115859188A (en)
WO (1) WO2023045829A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116628428A (en) * 2023-07-24 2023-08-22 华能信息技术有限公司 Data processing method and system
CN116701127A (en) * 2023-08-09 2023-09-05 睿至科技集团有限公司 Big data-based application performance monitoring method and platform
CN116827971A (en) * 2023-08-29 2023-09-29 北京国网信通埃森哲信息技术有限公司 Block chain-based carbon emission data storage and transmission method, device and equipment
CN117041121A (en) * 2023-10-07 2023-11-10 成都无声讯通科技有限责任公司 Internet of things anomaly monitoring method and system based on data mining
CN117196322A (en) * 2023-11-06 2023-12-08 深圳市明心数智科技有限公司 Intelligent wind control method, intelligent wind control device, computer equipment and storage medium
CN117389179A (en) * 2023-10-17 2024-01-12 司空定制家居科技有限公司 Remote intelligent centralized control method and system for inspection equipment
CN117389179B (en) * 2023-10-17 2024-05-03 北京思木企业管理咨询中心(有限合伙) Remote intelligent centralized control method and system for inspection equipment

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116381511B (en) * 2023-05-30 2023-08-29 中国华能集团清洁能源技术研究院有限公司 Battery state detection method and device, electronic equipment and storage medium
CN116528274B (en) * 2023-07-05 2023-09-22 腾讯科技(深圳)有限公司 Network quality regulation and control method and related equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111062464A (en) * 2019-10-24 2020-04-24 中国电力科学研究院有限公司 Power communication network reliability prediction and guarantee method and system based on deep learning
CN111914873A (en) * 2020-06-05 2020-11-10 华南理工大学 Two-stage cloud server unsupervised anomaly prediction method
US20200380335A1 (en) * 2019-05-30 2020-12-03 AVAST Software s.r.o. Anomaly detection in business intelligence time series
CN112115024A (en) * 2020-09-03 2020-12-22 上海上讯信息技术股份有限公司 Training method and device for fault prediction neural network model

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200380335A1 (en) * 2019-05-30 2020-12-03 AVAST Software s.r.o. Anomaly detection in business intelligence time series
CN111062464A (en) * 2019-10-24 2020-04-24 中国电力科学研究院有限公司 Power communication network reliability prediction and guarantee method and system based on deep learning
CN111914873A (en) * 2020-06-05 2020-11-10 华南理工大学 Two-stage cloud server unsupervised anomaly prediction method
CN112115024A (en) * 2020-09-03 2020-12-22 上海上讯信息技术股份有限公司 Training method and device for fault prediction neural network model

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116628428A (en) * 2023-07-24 2023-08-22 华能信息技术有限公司 Data processing method and system
CN116628428B (en) * 2023-07-24 2023-10-31 华能信息技术有限公司 Data processing method and system
CN116701127A (en) * 2023-08-09 2023-09-05 睿至科技集团有限公司 Big data-based application performance monitoring method and platform
CN116701127B (en) * 2023-08-09 2023-12-19 睿至科技集团有限公司 Big data-based application performance monitoring method and platform
CN116827971A (en) * 2023-08-29 2023-09-29 北京国网信通埃森哲信息技术有限公司 Block chain-based carbon emission data storage and transmission method, device and equipment
CN116827971B (en) * 2023-08-29 2023-11-24 北京国网信通埃森哲信息技术有限公司 Block chain-based carbon emission data storage and transmission method, device and equipment
CN117041121A (en) * 2023-10-07 2023-11-10 成都无声讯通科技有限责任公司 Internet of things anomaly monitoring method and system based on data mining
CN117041121B (en) * 2023-10-07 2023-12-19 成都无声讯通科技有限责任公司 Internet of Things anomaly monitoring method and system based on data mining
CN117389179A (en) * 2023-10-17 2024-01-12 司空定制家居科技有限公司 Remote intelligent centralized control method and system for inspection equipment
CN117389179B (en) * 2023-10-17 2024-05-03 北京思木企业管理咨询中心(有限合伙) Remote intelligent centralized control method and system for inspection equipment
CN117196322A (en) * 2023-11-06 2023-12-08 深圳市明心数智科技有限公司 Intelligent wind control method, intelligent wind control device, computer equipment and storage medium
CN117196322B (en) * 2023-11-06 2024-03-26 深圳市明心数智科技有限公司 Intelligent wind control method, intelligent wind control device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN115859188A (en) 2023-03-28

Similar Documents

Publication Publication Date Title
WO2023045829A1 (en) Service abnormality prediction method and device, storage medium, and electronic device
JP6690011B2 (en) System and method for measuring effective customer impact of network problems in real time using streaming analysis
EP2832040B1 (en) System and method for root cause analysis of mobile network performance problems
EP3286878B1 (en) Fault diagnosis in networks
WO2016017208A1 (en) Monitoring system, monitoring device, and inspection device
EP2894813A1 (en) Technique for creating a knowledge base for alarm management in a communications network
KR20180120558A (en) System and method for predicting communication apparatuses failure based on deep learning
WO2014019624A1 (en) Manipulation of streams of monitoring data
WO2022028120A1 (en) Indicator detection model acquisition method and apparatus, fault locating method and apparatus, and device and storage medium
CN105306246B (en) A kind of method, apparatus and server of the complaint of automatic-answering back device network class
US11777824B2 (en) Anomaly detection method and apparatus
EP3491784B1 (en) Estimation of losses in a video stream
CN104539556A (en) Method for providing satisfaction service quality for users under heterogeneous network environment
US11777786B2 (en) Method, device and computer program product for anomaly detection and root cause analysis
CN117093461A (en) Method, system, equipment and storage medium for time delay detection and analysis
Nougnanke et al. Low-overhead near-real-time flow statistics collection in SDN
CN111371575B (en) Method and device for delimiting call problem
Theera-Ampornpunt et al. Using big data for more dependability: a cellular network tale
CN116756518A (en) Cloud edge cooperative power defect identification method and related device based on federal learning
Khatib et al. Knowledge acquisition for fault management in LTE networks
US10742485B2 (en) Method for determining a sequence of events, a determination device for determining a sequence of events, and a providing device
CN116016288A (en) Flow monitoring method, device, equipment and storage medium of industrial equipment
WO2018064934A1 (en) Method and device for evaluating video quality
CN114785617A (en) 5G network application layer anomaly detection method and system
Babaria et al. FlowFormers: Transformer-based Models for Real-time Network Flow Classification