WO2021189844A1 - Detection method and apparatus for multivariate kpi time series, and device and storage medium - Google Patents

Detection method and apparatus for multivariate kpi time series, and device and storage medium Download PDF

Info

Publication number
WO2021189844A1
WO2021189844A1 PCT/CN2020/125003 CN2020125003W WO2021189844A1 WO 2021189844 A1 WO2021189844 A1 WO 2021189844A1 CN 2020125003 W CN2020125003 W CN 2020125003W WO 2021189844 A1 WO2021189844 A1 WO 2021189844A1
Authority
WO
WIPO (PCT)
Prior art keywords
preset
value
time series
gaussian distribution
kpi time
Prior art date
Application number
PCT/CN2020/125003
Other languages
French (fr)
Chinese (zh)
Inventor
邓悦
郑立颖
徐亮
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2021189844A1 publication Critical patent/WO2021189844A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2474Sequence data queries, e.g. querying versioned data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2477Temporal data queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/06Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
    • G06N3/063Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means

Definitions

  • This application relates to the field of artificial intelligence technology, and in particular to a detection method, device, computer equipment, and computer-readable storage medium for multiple KPI time series.
  • Industrial equipment such as servers, spacecraft, robotic assistance systems, engines, etc.
  • machines usually generate multiple time series indicators so that the abnormal behavior of each device can be detected in time.
  • Multiple univariate time series from the same device or the same entity form a multivariate KPI time series.
  • the effect of directly using multiple KPI time series to detect entity anomalies at the entity level is better than that of multiple univariate time series.
  • the anomaly detection of multivariate KPI time series is mainly divided into the deterministic model and the random based model.
  • the deterministic model mainly reconstructs the "normal" time series behavior and uses the reconstruction error to perform multi-sensor anomaly detection.
  • the inventor realizes that the deterministic model requires a large number of labeled time series for training; and based on the randomness model, the value range of the historical multivariate KPI time series is obtained, and the current multivariate KPI time series is detected through the value range whether the current multivariate KPI time series is abnormal, and the accuracy of the detection is Lower.
  • This application provides a method for detecting multiple KPI time series.
  • the method for detecting multiple KPI time series includes the following steps:
  • the preset segmentation strategy and the multi-element KPI time series to be trained determine the corresponding multi-segment sub-multiple KPI time series to be trained, wherein the sub-multiple KPI time series to be trained are unlabeled;
  • the preset threshold and the reconstruction probability value of the to-be-detected value it is determined whether the time corresponding to the to-be-detected value is an abnormal point.
  • the present application also provides a detection device for multiple KPI time series, and the detection device for multiple KPI time series includes:
  • the obtaining module is used to obtain the corresponding multi-segment sub-multiple KPI time series to be trained according to the preset segmentation strategy and the multi-element KPI time series to be trained, wherein the sub-multiple KPI time series to be trained are unlabeled;
  • a generating module configured to train a preset model according to the sub-multiple KPI time series to be trained, and generate a corresponding deterministic model
  • the first acquisition module is used to acquire the time series of multiple KPIs to be tested
  • the second obtaining module is configured to obtain the reconstruction probability value of the value to be detected in the multivariate KPI time series to be detected according to the multivariate KPI time series to be detected and the deterministic model;
  • the determining module is configured to determine whether the time corresponding to the value to be detected is an abnormal point according to a preset threshold value and the reconstruction probability value of the value to be detected.
  • the present application also provides a computer device that includes a processor, a memory, and a computer program stored on the memory and executable by the processor, wherein when the computer program is executed by the processor , To achieve the following steps:
  • the preset segmentation strategy and the multi-element KPI time series to be trained determine the corresponding multi-segment sub-multiple KPI time series to be trained, wherein the sub-multiple KPI time series to be trained are unlabeled;
  • the preset threshold and the reconstruction probability value of the to-be-detected value it is determined whether the time corresponding to the to-be-detected value is an abnormal point.
  • the present application also provides a computer-readable storage medium having a computer program stored on the computer-readable storage medium, and when the computer program is executed by a processor, the following steps are implemented:
  • the preset segmentation strategy and the multi-element KPI time series to be trained determine the corresponding multi-segment sub-multiple KPI time series to be trained, wherein the sub-multiple KPI time series to be trained are unlabeled;
  • the preset threshold and the reconstruction probability value of the to-be-detected value it is determined whether the time corresponding to the to-be-detected value is an abnormal point.
  • FIG. 1 is a schematic flowchart of a method for detecting multiple KPI time series according to an embodiment of the application
  • FIG. 2 is a schematic flowchart of sub-steps of the method for detecting multiple KPI time series in FIG. 1;
  • FIG. 3 is a schematic diagram of decoding and encoding of a deterministic model in an embodiment of the application
  • FIG. 4 is a schematic block diagram of a detection device for multiple KPI time series provided by an embodiment of the application
  • FIG. 5 is a schematic block diagram of the structure of a computer device related to an embodiment of the application.
  • the embodiments of the present application provide a detection method, device, computer equipment, and computer-readable storage medium for multiple KPI time series.
  • the detection method of the multiple KPI time series can be applied to a computer device, and the computer device can be an electronic device such as a notebook computer or a desktop computer.
  • FIG. 1 is a schematic flowchart of a method for detecting multiple KPI time series according to an embodiment of the application.
  • the detection method of the multiple KPI time series includes step S101 to step S105.
  • Step S101 Determine the corresponding multi-segment sub-multiple KPI time sequence to be trained according to the preset segmentation strategy and the multi-element KPI time sequence to be trained, wherein the sub-multiple KPI time sequence to be trained is unlabeled.
  • the preset segmentation strategy includes the sliding window information of T+1, where T can be Is 0. For example, when T is 1, the length of the sliding window is 2 moments.
  • the obtained time series of the multivariate KPI to be trained includes 0-24 moments, based on the length of the sliding window of 2 moments, the 0-24 moments of the multivariate KPI time series to be trained are segmented to obtain 23 sub-multivariate KPI time series. . Or, when T is 0, the length of the sliding window is 1 moment.
  • the obtained multivariate KPI time series to be trained includes 0-24 moments, based on the length of the sliding window is 1 time, the 0-24 moments of the multivariate KPI time series to be trained are segmented to obtain 24 sub-multiple KPI time series . Among them, each segment of the multiple KPI time series corresponds to a pair of values at a time.
  • Step S102 Train a preset model according to the sub-multiple KPI time series to be trained, and generate a corresponding deterministic model.
  • the preset model is trained through the sub-multiple KPI time series to generate a corresponding deterministic model, where the sub-multiple KPI time series is a sub-multiple KPI time series without labels. For example, input the sub-multiple KPI time series into a preset model, extract the value corresponding to each time in the sub-multiple KPI time series through the network layer in the preset model, and obtain the vector features between the values corresponding to each time, based on the The vector features between the corresponding values at each time train the weight parameters of the network layer to generate the corresponding deterministic model.
  • step S102 includes: sub-step S1021 to sub-step S1024.
  • Sub-step S1021 input the sub-multiple KPI time sequence to be trained into the preset model, and extract the values in the sub-multiple KPI time sequence to be trained.
  • the numerical value may be the first numerical value at the first time, or the second numerical value at the first time, or the first numerical value at the second time, or the like.
  • sub-step S1022 the numerical value is coded according to a preset coding program, and the first Gaussian distribution and the first auxiliary variable of the numerical value are obtained.
  • the preset model includes an encoding program.
  • the value is encoded by the encoding program to obtain the first Gaussian distribution after encoding of the value.
  • a diagonal Gaussian vector is randomly selected from the first Gaussian distribution, and the diagonal Gaussian vector is taken as The first auxiliary variable for this value.
  • the preset encoding program includes a first preset neural network model and a first fully connected layer, and the value is encoded according to the preset encoding program to obtain the first Gaussian distribution of the value
  • the first auxiliary variable including: obtaining the first hidden variable of the first preset neural network model according to the value and the first preset neural network model; according to the first preset fully connected layer and the first preset neural network model The variable is hidden to obtain the first Gaussian distribution corresponding to the value; the first auxiliary variable of the value is obtained based on the first Gaussian distribution.
  • the preset model includes an encoding program and a decoding program, where the encoding program includes a first preset neural network GPU and a first fully connected layer h 1 ; the decoding program includes a second preset neural network Network GPU and the second fully connected layer h 2 .
  • the encoding program includes a first preset neural network GPU and a first fully connected layer h 1 ; the decoding program includes a second preset neural network Network GPU and the second fully connected layer h 2 .
  • w e , u e , and b e are variable parameters.
  • the parameter matrix of the first neural network or preset model the parameter matrix is distributed with 0 as the center, and the parameters in the parameter matrix are randomly selected as w e , U e , b e initial parameters. It is the update gate in the GRU, which is used to determine how to combine the newly input independent variable with the previous timing information.
  • the reset gate in the GRU used to decide how much previous timing information to keep.
  • the second preset formula In order to be variable parameters, by obtaining the parameter matrix of the first neural network or preset model, the parameter matrix is distributed with 0 as the center, and the parameters in the parameter matrix are randomly selected as The initial parameters.
  • the third preset formula In order to be variable parameters, by obtaining the parameter matrix of the first neural network or preset model, the parameter matrix is distributed with 0 as the center, and the parameters in the parameter matrix are randomly selected as The initial parameters.
  • the second hidden variable before the first fully connected layer is obtained, and the obtained first hidden variable is connected with the second hidden variable to obtain vector information.
  • the vector information is input into the first fully connected layer, the network layer in the first fully connected layer uses the preset ReLU as the activation function, and the mean value and standard deviation are obtained through the following two linear transformations and softplus activation function transformations.
  • the value obtained in the first hidden variables e t and before the first layer a second fully-connected hidden variable z t-1, the first and second hidden variables hidden variables e t z t-1 connected to afford Hidden variable vector (z t-1 , e t ), through the first linear transformation function of the first fully connected layer Get the first average of this value. in, Is a constant.
  • the second linear transformation function through the first fully connected layer Get the first standard deviation of this value. in, Is a constant, ⁇ is a very small constant, which is set in order to prevent the value from overflowing during calculation.
  • the first Gaussian distribution of the value is constructed through the first mean value and the first standard deviation. For example, when getting the first mean And the first standard deviation , Construct the specific Gaussian distribution at the time corresponding to the value
  • a diagonal Gaussian vector is randomly selected from the first Gaussian distribution as the first auxiliary variable of the value. For example, from this particular Gaussian distribution A diagonal Gaussian vector z t is randomly selected, and the diagonal Gaussian vector z t is used as the first auxiliary variable of the value.
  • Sub-step S1023 Decode the first auxiliary variable according to a preset decoding program to obtain a second Gaussian distribution corresponding to the value.
  • the preset model includes a decoding program.
  • the first auxiliary variable of the value is obtained, the first auxiliary variable is decoded by the preset decoding program to obtain the second Gaussian distribution after decoding the first auxiliary variable.
  • the decoding program includes a second preset neural network model and a second preset fully connected layer
  • the first auxiliary variable is decoded according to the preset decoding program to obtain the corresponding value
  • the second Gaussian distribution includes: obtaining the second hidden variable of the second preset neural network model according to the first auxiliary variable and the second preset neural network model; and obtaining the second hidden variable of the second preset neural network model according to the second preset fully connected layer and the The second hidden variable obtains the second Gaussian distribution corresponding to the value.
  • the decoding program includes a second preset neural network GPU and a second fully connected layer h 2.
  • the second preset neural network is obtained
  • the second hidden variable corresponding to the output of the second preset neural network is obtained by inputting the first auxiliary variable and the first hidden variable into the second preset neural network.
  • the first auxiliary variable z t is obtained, the first hidden variable d t-1 generated before the second preset neural network GPU is obtained, and the z t and d t-1 are input to the second preset neural network GPU through
  • the first preset formula in the first preset neural network GPU Among them, w e , u e , and b e are variable parameters.
  • the parameter matrix is distributed with 0 as the center, and the parameters in the parameter matrix are randomly selected. It is the update gate in the GRU, which is used to determine how to combine the newly input independent variable with the previous timing information. It is the reset gate in the GRU, used to decide how much previous timing information to keep.
  • the second preset formula In order to be variable parameters, by obtaining the parameter matrix of the first neural network or preset model, the parameter matrix is distributed with 0 as the center, and the parameters in the parameter matrix are randomly selected as The initial parameters. Obtained by the third preset formula in, In order to be variable parameters, by obtaining the parameter matrix of the first neural network or preset model, the parameter matrix is distributed with 0 as the center, and the parameters in the parameter matrix are randomly selected as The initial parameters.
  • the second hidden variable of the second preset neural network When the second hidden variable of the second preset neural network is obtained, the second hidden variable is input into the second fully connected layer, and the preset ReLU is used as the activation function through the network layer in the second fully connected layer.
  • the following two linear transformations and softplus transformations get the mean and standard deviation. For example, through the first linear transformation function of the first fully connected layer Get the second mean of this value. in, Is a constant.
  • the second linear transformation function through the first fully connected layer Get the second standard deviation of this value. in, Is a constant, ⁇ is a very small constant, which is set in order to prevent the value from overflowing during calculation.
  • the second Gaussian distribution of the value is constructed through the second mean value and the second standard deviation. For example, when getting the second mean And the first standard deviation , Construct the specific Gaussian distribution at the time corresponding to the value Where the specific Gaussian distribution It is the second Gaussian distribution.
  • Sub-step S1024 training the loss function of the preset model according to the first Gaussian distribution and the second Gaussian distribution, and generating a corresponding deterministic model.
  • the loss function of the preset model is trained by obtaining the first Gaussian distribution and the second Gaussian distribution of the value to obtain the corresponding deterministic model. For example, by training the first loss function of the coding program with the obtained first Gaussian distribution, and training the second loss function of the decoding function with the obtained second Gaussian distribution, the corresponding deterministic function is obtained.
  • the training the loss function of the preset model according to the first Gaussian distribution and the second Gaussian distribution to generate a corresponding deterministic model includes: obtaining a cumulative distribution in the second Gaussian distribution Probability; according to the cumulative distribution probability and the first preset loss function, obtain the reconstruction probability of the value; according to the first Gaussian distribution, obtain the regular term of the first Gaussian distribution; according to the regular term and the first Two preset loss functions to obtain a regularization term of the value; training the network parameters of the decoding program and the network parameters of the encoding program based on the reconstruction probability and the regularization term, and generate a corresponding deterministic model.
  • the reconstruction probability is calculated by calculating the original data x t in the reconstruction distribution
  • the cumulative distribution probability in The next step is to calculate the regularization term. Since the auxiliary variable is not directly fitted in the encoding process, but z t is obtained by fitting the mean and standard deviation of the auxiliary variable, and then resampling, the entire reconstruction process is affected by noise. Noise intensity, which is the standard deviation of the fit It is calculated by a neural network, so in order to better reconstruct the original data during the whole training process, the standard deviation will be compressed as close to 0 as possible. Once the standard deviation is close to 0, the randomness of sampling disappears, and only a fixed sample will be obtained, namely In this case, the model is equivalent to an ordinary autoencoder.
  • x t obeys the standard normal distribution, which avoids noise (ie ) Is compressed to zero, so it is guaranteed that the model can generate new samples that are different from the original data.
  • N(0,I) of the multivariate independent normal distribution and the standard normal distribution as this regular term and add it to the loss function.
  • N is the number of z t.
  • the network parameters of the encoder and decoder are continuously trained by maximizing the above loss function until the expected accuracy is achieved.
  • the method further includes: recording the reconstruction probability of each value in the sub-multiple KPI time series to be trained, and sorting the reconstruction probability of each value; Determine the target reconstruction probability corresponding to the preset sequence number in the reconstruction probability after sorting, and use the target reconstruction probability as the preset threshold.
  • Step S103 Obtain a multivariate KPI time series to be detected.
  • the time series of multiple KPIs to be detected is obtained through a preset segmentation strategy. For example, when a new value is detected on the multiple KPI time series, the multiple KPI time series are segmented based on the target window information in the preset segmentation strategy, and the multiple KPI time series to be detected containing the new value are obtained. .
  • Step S104 According to the multivariate KPI time series to be detected and the deterministic model, obtain the reconstruction probability value of the value to be detected in the multivariate KPI time series to be detected
  • the value to be detected in the multivariate KPI time sequence to be detected is obtained, and the value to be detected is a new value.
  • the value to be detected is input into the first preset neural network of the deterministic model to obtain hidden variables output by the first preset neural network.
  • the hidden variable is input into the first preset fully connected layer, and the first mean value and the first standard deviation obtained by the transformation of the first preset fully connected layer are obtained.
  • the first Gaussian distribution of the value to be detected is obtained.
  • the auxiliary variable of the value to be detected is obtained.
  • the hidden variables output by the second preset neural network are obtained.
  • the second mean value and the second standard deviation obtained by the transformation of the second preset fully connected layer are obtained.
  • a second Gaussian distribution map of the value to be detected is constructed to obtain the reconstruction probability of the value to be detected.
  • Step S105 Determine whether the time corresponding to the value to be detected is an abnormal point according to the preset threshold value and the reconstruction probability value of the value to be detected.
  • the reconstruction probability value of the value to be detected is compared with a preset threshold value to determine whether the time of the value to be detected is an abnormal point; if the reconstruction probability value of the value to be detected is less than the preset threshold value, the value to be detected is determined The moment of is an abnormal point.
  • a preset model is trained through an unlabeled multivariate KPI time series to generate a corresponding deterministic model, and the detected multivariate KPI time series is detected through the deterministic model to determine that the multivariate KPI time series is Whether the time corresponding to the value to be detected is an abnormal point.
  • Enhance the robustness of the model in the encoding process, the model based on determinism and the model based on randomness are merged to fully capture the time dependence between auxiliary variables in the latent space, so the input data can be better represented
  • the distribution is more suitable for the abnormal detection of multiple KPI time series data; the modeling function of the recurrent neural network with random variables is expanded, and the GRU model is integrated to fully capture the time dependence of time series data, which improves the accuracy of the model and greatly improves the accuracy of the model.
  • Reduce the complexity of the model save the CPU occupancy rate in the calculation process, and also reduce the required storage space.
  • This application determines the corresponding multi-segment sub-multiple KPI time series to be trained according to the preset segmentation strategy and the multi-element KPI time series to be trained, wherein the sub-multiple KPI time series to be trained is unlabeled; according to the Training preset models for the sub-multiple KPI time series to be trained to generate the corresponding deterministic model; obtain the multi-element KPI time series to be detected; obtain the multi-element KPI time series to be detected and the deterministic model to obtain the The reconstruction probability value of the to-be-detected value in the detected multivariate KPI time series; according to the preset threshold and the reconstruction probability value of the to-be-detected value, it is determined whether the time corresponding to the to-be-detected value is an abnormal point.
  • the KPI time series generates a deterministic model, which enhances the robustness of the model, making the deterministic model more suitable for abnormal detection of multiple KPI time series data, and improves the accuracy of the
  • FIG. 4 is a schematic block diagram of an apparatus for detecting multiple KPI time series according to an embodiment of the present application.
  • the detection device 400 for multiple KPI time series includes: a first determination module 401, a generation module 402, a first acquisition module 403, a second acquisition module 404, and a second determination module 405.
  • the first determination module 401 is configured to obtain the corresponding multi-segment sub-multiple KPI time series to be trained according to the preset segmentation strategy and the multi-element KPI time series to be trained, wherein the sub-multiple KPI time series to be trained is none Label;
  • the generating module 402 is configured to train a preset model according to the sub-multiple KPI time series to be trained, and generate a corresponding deterministic model;
  • the first obtaining module 403 is used to obtain the multivariate KPI time series to be detected
  • the second obtaining module 404 is configured to obtain the reconstruction probability value of the value to be detected in the multivariate KPI time series to be detected according to the multivariate KPI time series to be detected and the deterministic model;
  • the second determining module 405 is configured to determine whether the time corresponding to the value to be detected is an abnormal point according to the preset threshold value and the reconstruction probability value of the value to be detected.
  • the generating module 402 is also specifically used for:
  • the generating module 402 is also specifically used for:
  • the first auxiliary variable of the value is obtained based on the first Gaussian distribution.
  • the generating module 402 is also specifically used for:
  • the generating module 402 is also specifically used for:
  • the detection device of multiple KPI time series is also used for:
  • the second determining module 405 is also used for:
  • the reconstruction probability value of the to-be-detected value is less than the preset threshold, the time when the to-be-detected value is determined to be an abnormal point.
  • the apparatus provided in the foregoing embodiment may be implemented in the form of a computer program, and the computer program may run on the computer device shown in FIG. 5.
  • FIG. 5 is a schematic block diagram of the structure of a computer device according to an embodiment of the application.
  • the computer device may be a terminal.
  • the computer device includes a processor, a memory, and a network interface connected through a system bus, where the memory may be volatile or non-volatile.
  • the non-volatile storage medium can store an operating system and a computer program.
  • the computer program includes program instructions, and when the program instructions are executed, the processor can execute any method for detecting multiple KPI time series.
  • the processor is used to provide computing and control capabilities and support the operation of the entire computer equipment.
  • the internal memory provides an environment for the operation of the computer program in the non-volatile storage medium.
  • the processor can execute any method for detecting multiple KPI time series.
  • the network interface is used for network communication, such as sending assigned tasks.
  • the network interface is used for network communication, such as sending assigned tasks.
  • FIG. 5 is only a block diagram of part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied.
  • the specific computer device may Including more or fewer parts than shown in the figure, or combining some parts, or having a different arrangement of parts.
  • the processor may be a central processing unit (Central Processing Unit, CPU), and the processor may also be other general-purpose processors, digital signal processors (Digital Signal Processors, DSPs), and application specific integrated circuits (Application Specific Integrated Circuits). Circuit, ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, etc.
  • the general-purpose processor may be a microprocessor or the processor may also be any conventional processor.
  • the processor is used to run a computer program stored in a memory to implement the following steps:
  • the preset threshold and the reconstruction probability value of the to-be-detected value it is determined whether the time corresponding to the to-be-detected value is an abnormal point.
  • the processor trains a preset model according to the sub-multiple KPI time series to be trained, and generates a corresponding deterministic model when it is implemented, and is used to implement:
  • the preset coding program of the processor includes a first preset neural network model and a first fully connected layer, and the numerical value is coded according to the preset coding program to obtain the value
  • the first Gaussian distribution and the first auxiliary variable are realized, they are used to realize:
  • the first auxiliary variable of the value is obtained based on the first Gaussian distribution.
  • the decoding program of the processor includes a second preset neural network model and a second preset fully connected layer, and the first auxiliary variable is decoded according to the preset decoding program to obtain the When the second Gaussian distribution corresponding to the above-mentioned value is realized, it is used to realize:
  • the processor trains the loss function of the preset model according to the first Gaussian distribution and the second Gaussian distribution, and generates a corresponding deterministic model for implementation, it is used to realize:
  • the processor when the processor obtains the reconstruction probability of the numerical value and implements it, it is used to implement:
  • the processor determines whether the time when the value to be detected is an abnormal point realization based on a preset threshold and the reconstruction probability value of the value to be detected, the processor is used to implement:
  • the reconstruction probability value of the to-be-detected value is less than the preset threshold, the time when the to-be-detected value is determined to be an abnormal point.
  • the embodiments of the present application also provide a computer-readable storage medium.
  • the computer-readable storage medium may be volatile or non-volatile.
  • a computer program is stored on the computer-readable storage medium.
  • the program includes program instructions, and the method implemented when the program instructions are executed can refer to the various embodiments of the method for detecting multiple KPI time series in this application.
  • the computer-readable storage medium may be the internal storage unit of the computer device described in the foregoing embodiment, for example, the hard disk or memory of the computer device.
  • the computer-readable storage medium may also be an external storage device of the computer device, such as a plug-in hard disk equipped on the computer device, a smart memory card (Smart Media Card, SMC), and a Secure Digital (SD) ) Card, Flash Card, etc.
  • a plug-in hard disk equipped on the computer device such as a smart memory card (Smart Media Card, SMC), and a Secure Digital (SD) ) Card, Flash Card, etc.
  • SD Secure Digital
  • the computer-readable storage medium may mainly include a storage program area and a storage data area, where the storage program area may store an operating system, an application program required by at least one function, etc.; the storage data area may store Data created by the use of nodes, etc.
  • the blockchain referred to in this application is a new application mode of computer technology such as storage of preset models and deterministic models, point-to-point transmission, consensus mechanism, and encryption algorithm.
  • Blockchain essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information for verification. The validity of the information (anti-counterfeiting) and the generation of the next block.
  • the blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Fuzzy Systems (AREA)
  • Probability & Statistics with Applications (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Neurology (AREA)
  • Image Analysis (AREA)

Abstract

A detection method and apparatus for a multivariate KPI time series, and a computer device and a computer-readable storage medium, which relate to the technical field of artificial intelligence. The method comprises: according to a preset segmentation policy and a multivariate KPI time series to be trained, determining a plurality of corresponding segments of sub-multivariate KPI time series to be trained (S101), wherein the sub-multivariate KPI time series to be trained are untagged; training a preset model according to the sub-multivariate KPI time series to be trained, so as to generate a corresponding deterministic model (S102); acquiring a multivariate KPI time series to be tested (S103); according to the multivariate KPI time series to be tested and the deterministic model, acquiring a reconstruction probability value of a numerical value to be tested in the multivariate KPI time series to be tested (S104); and according to a preset threshold and the reconstruction probability value of the numerical value to be tested, determining whether a moment corresponding to the numerical value to be tested is an anomaly point (S105). The deterministic model is generated by means of the untagged multivariate KPI time series, the robustness of the model is enhanced, and the detection accuracy is improved.

Description

多元KPI时间序列的检测方法、装置、设备及存储介质Detection method, device, equipment and storage medium of multiple KPI time series
本申请要求于2020年9月22日提交中国专利局、申请号为CN2020110040443、名称为“多元KPI时间序列的检测方法、装置、设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on September 22, 2020, with the application number CN2020110040443, and the name "Multivariate KPI time series detection methods, devices, equipment and storage media", all of which are approved The reference is incorporated in this application.
技术领域Technical field
本申请涉及人工智能技术领域,尤其涉及一种多元KPI时间序列的检测方法、装置、计算机设备及计算机可读存储介质。This application relates to the field of artificial intelligence technology, and in particular to a detection method, device, computer equipment, and computer-readable storage medium for multiple KPI time series.
背景技术Background technique
工业设备,例如服务器、航天器、机器人辅助系统、引擎等通常会产生多个时间序列指标,以便可以及时检测到每个设备的行为异常。来自同一设备或同一个实体的多个单变量时间序列形成一个多元KPI时间序列。通常,直接使用多元KPI时间序列在实体级别检测实体异常的效果会优于多个单变量时间序列的异常检测效果。Industrial equipment, such as servers, spacecraft, robotic assistance systems, engines, etc., usually generate multiple time series indicators so that the abnormal behavior of each device can be detected in time. Multiple univariate time series from the same device or the same entity form a multivariate KPI time series. Generally, the effect of directly using multiple KPI time series to detect entity anomalies at the entity level is better than that of multiple univariate time series.
多元KPI时间序列的异常检测主要分为基于确定性模型和基于随机性模型,基于确定性模型的主要是重建“正常”时间序列行为,并使用重建误差进行多传感器异常检测。发明人意识到,确定性模型需要大量的带标签时间序列进行训练;而基于随机性模型获取历史多元KPI时间序列的数值范围,通过该数值范围检测当前多元KPI时间序列是否异常,检测的准确率较低。The anomaly detection of multivariate KPI time series is mainly divided into the deterministic model and the random based model. The deterministic model mainly reconstructs the "normal" time series behavior and uses the reconstruction error to perform multi-sensor anomaly detection. The inventor realizes that the deterministic model requires a large number of labeled time series for training; and based on the randomness model, the value range of the historical multivariate KPI time series is obtained, and the current multivariate KPI time series is detected through the value range whether the current multivariate KPI time series is abnormal, and the accuracy of the detection is Lower.
发明内容Summary of the invention
本申请提供一种多元KPI时间序列的检测方法,所述多元KPI时间序列的检测方法包括以下步骤:This application provides a method for detecting multiple KPI time series. The method for detecting multiple KPI time series includes the following steps:
根据预置切分策略和待训练的多元KPI时间序列,确定对应的待训练的多段子多元KPI时间序列,其中,所述待训练的子多元KPI时间序列为无标签;According to the preset segmentation strategy and the multi-element KPI time series to be trained, determine the corresponding multi-segment sub-multiple KPI time series to be trained, wherein the sub-multiple KPI time series to be trained are unlabeled;
根据所述待训练的子多元KPI时间序列训练预置模型,生成对应的确定性模型;Training a preset model according to the sub-multiple KPI time series to be trained to generate a corresponding deterministic model;
获取待检测的多元KPI时间序列;Obtain the time series of multiple KPIs to be tested;
根据所述待检测的多元KPI时间序列和所述确定性模型,获取所述待检测的多元KPI时间序列中待检测数值的重建概率值;Obtaining the reconstruction probability value of the value to be detected in the multivariate KPI time series to be detected according to the multivariate KPI time series to be detected and the deterministic model;
根据预置阈值和所述待检测数值的重建概率值,确定所述待检测数值对应的时刻是否为异常点。According to the preset threshold and the reconstruction probability value of the to-be-detected value, it is determined whether the time corresponding to the to-be-detected value is an abnormal point.
本申请还提供一种多元KPI时间序列的检测装置,所述多元KPI时间序列的检测装置包括:The present application also provides a detection device for multiple KPI time series, and the detection device for multiple KPI time series includes:
得到模块,用于根据预置切分策略和待训练的多元KPI时间序列,得到对应的待训练的多段子多元KPI时间序列,其中,所述待训练的子多元KPI时间序列为无标签;The obtaining module is used to obtain the corresponding multi-segment sub-multiple KPI time series to be trained according to the preset segmentation strategy and the multi-element KPI time series to be trained, wherein the sub-multiple KPI time series to be trained are unlabeled;
生成模块,用于根据所述待训练的子多元KPI时间序列训练预置模型,生成对应的确定性模型;A generating module, configured to train a preset model according to the sub-multiple KPI time series to be trained, and generate a corresponding deterministic model;
第一获取模块,用于获取待检测的多元KPI时间序列;The first acquisition module is used to acquire the time series of multiple KPIs to be tested;
第二获取模块,用于根据所述待检测的多元KPI时间序列和所述确定性模型,获取所述待检测的多元KPI时间序列中待检测数值的重建概率值;The second obtaining module is configured to obtain the reconstruction probability value of the value to be detected in the multivariate KPI time series to be detected according to the multivariate KPI time series to be detected and the deterministic model;
确定模块,用于根据预置阈值和所述待检测数值的重建概率值,确定所述待检测数值对应的时刻是否为异常点。The determining module is configured to determine whether the time corresponding to the value to be detected is an abnormal point according to a preset threshold value and the reconstruction probability value of the value to be detected.
本申请还提供一种计算机设备,所述计算机设备包括处理器、存储器、以及存储在所述存储器上并可被所述处理器执行的计算机程序,其中所述计算机程序被所述处理器执行 时,实现如下步骤:The present application also provides a computer device that includes a processor, a memory, and a computer program stored on the memory and executable by the processor, wherein when the computer program is executed by the processor , To achieve the following steps:
根据预置切分策略和待训练的多元KPI时间序列,确定对应的待训练的多段子多元KPI时间序列,其中,所述待训练的子多元KPI时间序列为无标签;According to the preset segmentation strategy and the multi-element KPI time series to be trained, determine the corresponding multi-segment sub-multiple KPI time series to be trained, wherein the sub-multiple KPI time series to be trained are unlabeled;
根据所述待训练的子多元KPI时间序列训练预置模型,生成对应的确定性模型;Training a preset model according to the sub-multiple KPI time series to be trained to generate a corresponding deterministic model;
获取待检测的多元KPI时间序列;Obtain the time series of multiple KPIs to be tested;
根据所述待检测的多元KPI时间序列和所述确定性模型,获取所述待检测的多元KPI时间序列中待检测数值的重建概率值;Obtaining the reconstruction probability value of the value to be detected in the multivariate KPI time series to be detected according to the multivariate KPI time series to be detected and the deterministic model;
根据预置阈值和所述待检测数值的重建概率值,确定所述待检测数值对应的时刻是否为异常点。According to the preset threshold and the reconstruction probability value of the to-be-detected value, it is determined whether the time corresponding to the to-be-detected value is an abnormal point.
本申请还提供一种计算机可读存储介质,所述计算机可读存储介质上存储有计算机程序,其中所述计算机程序被处理器执行时,实现如下步骤:The present application also provides a computer-readable storage medium having a computer program stored on the computer-readable storage medium, and when the computer program is executed by a processor, the following steps are implemented:
根据预置切分策略和待训练的多元KPI时间序列,确定对应的待训练的多段子多元KPI时间序列,其中,所述待训练的子多元KPI时间序列为无标签;According to the preset segmentation strategy and the multi-element KPI time series to be trained, determine the corresponding multi-segment sub-multiple KPI time series to be trained, wherein the sub-multiple KPI time series to be trained are unlabeled;
根据所述待训练的子多元KPI时间序列训练预置模型,生成对应的确定性模型;Training a preset model according to the sub-multiple KPI time series to be trained to generate a corresponding deterministic model;
获取待检测的多元KPI时间序列;Obtain the time series of multiple KPIs to be tested;
根据所述待检测的多元KPI时间序列和所述确定性模型,获取所述待检测的多元KPI时间序列中待检测数值的重建概率值;Obtaining the reconstruction probability value of the value to be detected in the multivariate KPI time series to be detected according to the multivariate KPI time series to be detected and the deterministic model;
根据预置阈值和所述待检测数值的重建概率值,确定所述待检测数值对应的时刻是否为异常点。According to the preset threshold and the reconstruction probability value of the to-be-detected value, it is determined whether the time corresponding to the to-be-detected value is an abnormal point.
附图说明Description of the drawings
为了更清楚地说明本申请实施例技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solutions of the embodiments of the present application more clearly, the following will briefly introduce the drawings used in the description of the embodiments. Obviously, the drawings in the following description are some embodiments of the present application. Ordinary technicians can obtain other drawings based on these drawings without creative work.
图1为本申请实施例提供的一种多元KPI时间序列的检测方法的流程示意图;FIG. 1 is a schematic flowchart of a method for detecting multiple KPI time series according to an embodiment of the application;
图2为图1中的多元KPI时间序列的检测方法的子步骤流程示意图;FIG. 2 is a schematic flowchart of sub-steps of the method for detecting multiple KPI time series in FIG. 1;
图3为本申请实施例中确定性模型的解码和编码示意图;FIG. 3 is a schematic diagram of decoding and encoding of a deterministic model in an embodiment of the application;
图4为本申请实施例提供的一种多元KPI时间序列的检测装置的示意性框图;FIG. 4 is a schematic block diagram of a detection device for multiple KPI time series provided by an embodiment of the application;
图5为本申请一实施例涉及的计算机设备的结构示意框图。FIG. 5 is a schematic block diagram of the structure of a computer device related to an embodiment of the application.
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional characteristics, and advantages of the purpose of this application will be further described in conjunction with the embodiments and with reference to the accompanying drawings.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be described clearly and completely in conjunction with the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, rather than all of them. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this application.
附图中所示的流程图仅是示例说明,不是必须包括所有的内容和操作/步骤,也不是必须按所描述的顺序执行。例如,有的操作/步骤还可以分解、组合或部分合并,因此实际执行的顺序有可能根据实际情况改变。The flowchart shown in the drawings is only an example, and does not necessarily include all contents and operations/steps, nor does it have to be executed in the described order. For example, some operations/steps can also be decomposed, combined or partially combined, so the actual execution order may be changed according to actual conditions.
本申请实施例提供一种多元KPI时间序列的检测方法、装置、计算机设备及计算机可读存储介质。其中,该多元KPI时间序列的检测方法可应用于计算机设备中,该计算机设备可以是笔记本电脑、台式电脑等电子设备。The embodiments of the present application provide a detection method, device, computer equipment, and computer-readable storage medium for multiple KPI time series. Wherein, the detection method of the multiple KPI time series can be applied to a computer device, and the computer device can be an electronic device such as a notebook computer or a desktop computer.
下面结合附图,对本申请的一些实施方式作详细说明。在不冲突的情况下,下述的实施例及实施例中的特征可以相互组合。Hereinafter, some embodiments of the present application will be described in detail with reference to the accompanying drawings. In the case of no conflict, the following embodiments and features in the embodiments can be combined with each other.
请参照图1,图1为本申请的实施例提供的一种多元KPI时间序列的检测方法的流程 示意图。Please refer to FIG. 1. FIG. 1 is a schematic flowchart of a method for detecting multiple KPI time series according to an embodiment of the application.
如图1所示,该多元KPI时间序列的检测方法包括步骤S101至步骤S105。As shown in FIG. 1, the detection method of the multiple KPI time series includes step S101 to step S105.
步骤S101、根据预置切分策略和待训练的多元KPI时间序列,确定对应的待训练的多段子多元KPI时间序列,其中,所述待训练的子多元KPI时间序列为无标签。Step S101: Determine the corresponding multi-segment sub-multiple KPI time sequence to be trained according to the preset segmentation strategy and the multi-element KPI time sequence to be trained, wherein the sub-multiple KPI time sequence to be trained is unlabeled.
示范性的,获取待训练多元KPI时间序列,通过预置切分策略对获取到的待训练多元KPI时间序列进行数据处理,预置切分策略包括T+1的滑动窗口信息,其中,T可以为0。例如,当T为1时,则滑动窗口的长度为2个时刻。当获取到的待训练多元KPI时间序列包括0-24个时刻,则基于滑动窗口的长度为2个时刻对待训练多元KPI时间序列的0-24个时刻进行切分,得到23段子多元KPI时间序列。或者,当T为0时,则滑动窗口的长度为1个时刻。当获取到的待训练多元KPI时间序列包括0-24个时刻,则基于滑动窗口的长度为1个时刻对待训练多元KPI时间序列的0-24个时刻进行切分,得到24段子多元KPI时间序列。其中,每一段子多元KPI时间序列上时刻对应有对个数值。Exemplarily, obtain the multivariate KPI time series to be trained, and perform data processing on the obtained multivariate KPI time series to be trained through a preset segmentation strategy. The preset segmentation strategy includes the sliding window information of T+1, where T can be Is 0. For example, when T is 1, the length of the sliding window is 2 moments. When the obtained time series of the multivariate KPI to be trained includes 0-24 moments, based on the length of the sliding window of 2 moments, the 0-24 moments of the multivariate KPI time series to be trained are segmented to obtain 23 sub-multivariate KPI time series. . Or, when T is 0, the length of the sliding window is 1 moment. When the obtained multivariate KPI time series to be trained includes 0-24 moments, based on the length of the sliding window is 1 time, the 0-24 moments of the multivariate KPI time series to be trained are segmented to obtain 24 sub-multiple KPI time series . Among them, each segment of the multiple KPI time series corresponds to a pair of values at a time.
步骤S102、根据所述待训练的子多元KPI时间序列训练预置模型,生成对应的确定性模型。Step S102: Train a preset model according to the sub-multiple KPI time series to be trained, and generate a corresponding deterministic model.
示范性的,通过子多元KPI时间序列对预置模型进行训练,生成对应的确定性模型,其中,该子多元KPI时间序列为不带标签的子多元KPI时间序列。例如,将该子多元KPI时间序列输入预置模型,通过预置模型中的网络层提取该子多元KPI时间序列中各个时刻对应的数值,获取各个时刻对应的数值之间的向量特征,基于该各个时刻对应的数值之间的向量特征对该网络层的权重参数进行训练,生成对应的确定性模型。或者,将该子多元KPI时间序列输入预置模型,为每个输入子多元KPI时间序列的数值样本建立特定分布以获取辅助变量,再根据辅助变量重建原始数值样本,通过最大化损失函数以改进预置模型的网络参数,生成对应的确定性模型。Exemplarily, the preset model is trained through the sub-multiple KPI time series to generate a corresponding deterministic model, where the sub-multiple KPI time series is a sub-multiple KPI time series without labels. For example, input the sub-multiple KPI time series into a preset model, extract the value corresponding to each time in the sub-multiple KPI time series through the network layer in the preset model, and obtain the vector features between the values corresponding to each time, based on the The vector features between the corresponding values at each time train the weight parameters of the network layer to generate the corresponding deterministic model. Or, input the submultivariate KPI time series into a preset model, establish a specific distribution for the numerical samples of each input submultivariate KPI time series to obtain auxiliary variables, and then reconstruct the original numerical samples based on the auxiliary variables, and improve by maximizing the loss function Preset the network parameters of the model to generate the corresponding deterministic model.
在一实施例中,具体地,参照图2,步骤S102包括:子步骤S1021至子步骤S1024。In an embodiment, specifically, referring to FIG. 2, step S102 includes: sub-step S1021 to sub-step S1024.
子步骤S1021、将所述待训练的子多元KPI时间序列输入所述预置模型,提取所述待训练的子多元KPI时间序列中的数值。Sub-step S1021, input the sub-multiple KPI time sequence to be trained into the preset model, and extract the values in the sub-multiple KPI time sequence to be trained.
将该子多元KPI时间序列输入预置模型,通过预置模型的输入层提取该子多元KPI时间序列中数值,其中,该子多元KPI时间序列包括多个时刻,且每个时刻对应有多个数值。该数值可以为第一时刻的第一数值也可以为第一时刻的第二数值,或者,为第二时刻的第一数值等。Input the sub-multiple KPI time series into the preset model, and extract the values in the sub-multiple KPI time series through the input layer of the preset model, where the sub-multiple KPI time series includes multiple moments, and each moment corresponds to multiple Numerical value. The numerical value may be the first numerical value at the first time, or the second numerical value at the first time, or the first numerical value at the second time, or the like.
子步骤S1022、根据预置编码程序对所述数值进行编码,获取所述数值的第一高斯分布和第一辅助变量。In sub-step S1022, the numerical value is coded according to a preset coding program, and the first Gaussian distribution and the first auxiliary variable of the numerical value are obtained.
预置模型包括编码程序,通过该编码程序对该数值进行编码,得到该数值编码后的第一高斯分布,从该第一高斯分布中随机抽取一个对角高斯向量,将该对角高斯向量作为该数值的第一辅助变量。The preset model includes an encoding program. The value is encoded by the encoding program to obtain the first Gaussian distribution after encoding of the value. A diagonal Gaussian vector is randomly selected from the first Gaussian distribution, and the diagonal Gaussian vector is taken as The first auxiliary variable for this value.
在一实施例中,所述预置编码程序包括第一预置神经网络模型和第一全连接层,所述根据预置编码程序对所述数值进行编码,获取所述数值的第一高斯分布和第一辅助变量,包括:根据所述数值和第一预置神经网络模型,得到所述第一预置神经网络模型的第一隐藏变量;根据第一预置全连接层和所述第一隐藏变量,得到所述数值对应的第一高斯分布;基于所述第一高斯分布得到所述数值的第一辅助变量。In an embodiment, the preset encoding program includes a first preset neural network model and a first fully connected layer, and the value is encoded according to the preset encoding program to obtain the first Gaussian distribution of the value And the first auxiliary variable, including: obtaining the first hidden variable of the first preset neural network model according to the value and the first preset neural network model; according to the first preset fully connected layer and the first preset neural network model The variable is hidden to obtain the first Gaussian distribution corresponding to the value; the first auxiliary variable of the value is obtained based on the first Gaussian distribution.
示范性的,如图3所示,预置模型包括编码程序和解码程序,其中,该编码程序包括第一预置神经网络GPU和第一全连接层h 1;解码程序包括第二预置神经网络GPU和第二全连接层h 2。在获取到子多元KPI时间序列中的数值时,获取第一预置神经网络GPU之前产生的第一隐藏变量,通过将该数值和第一隐藏变量输入该第一预置神经网络GPU中,得到第一预置神经网络GPU输出的第二隐藏变量。例如,获取到子多元KPI时间序列中的数值x t,获取第一预置神经网络GPU之前产生的第一隐藏变量e t-1,将该x t和e t-1输入第一预置 神经网络GPU,通过该第一预置神经网络GPU中第一预置公式
Figure PCTCN2020125003-appb-000001
其中,w e、u e、b e为变量参数,通过获取第一神经网络或预置模型的参数矩阵,该参数矩阵是以0为中心分布的,随机抽取该参数矩阵中的参数作为w e、u e、b e的初始参数。
Figure PCTCN2020125003-appb-000002
是GRU中的更新门,用于决定如何将新输入的自变量与之前的时序信息结合起来。
Figure PCTCN2020125003-appb-000003
是GRU中的重置门,用于决定保留多少此前的时序信息。通过第二预置公式得到
Figure PCTCN2020125003-appb-000004
其中,
Figure PCTCN2020125003-appb-000005
为为变量参数,通过获取第一神经网络或预置模型的参数矩阵,该参数矩阵是以0为中心分布的,随机抽取该参数矩阵中的参数作为
Figure PCTCN2020125003-appb-000006
的初始参数。通过第三预置公式得到
Figure PCTCN2020125003-appb-000007
其中,
Figure PCTCN2020125003-appb-000008
为为变量参数,通过获取第一神经网络或预置模型的参数矩阵,该参数矩阵是以0为中心分布的,随机抽取该参数矩阵中的参数作为
Figure PCTCN2020125003-appb-000009
的初始参数。
Exemplarily, as shown in FIG. 3, the preset model includes an encoding program and a decoding program, where the encoding program includes a first preset neural network GPU and a first fully connected layer h 1 ; the decoding program includes a second preset neural network Network GPU and the second fully connected layer h 2 . When the value in the sub-multiple KPI time series is obtained, the first hidden variable generated before the first preset neural network GPU is obtained, and the value and the first hidden variable are input into the first preset neural network GPU to obtain The second hidden variable output by the first preset neural network GPU. For example, obtain the value x t in the sub-multiple KPI time series , obtain the first hidden variable e t-1 generated before the GPU of the first preset neural network, and input the x t and e t-1 into the first preset neural network Network GPU, through the first preset formula in the first preset neural network GPU
Figure PCTCN2020125003-appb-000001
Among them, w e , u e , and b e are variable parameters. By obtaining the parameter matrix of the first neural network or preset model, the parameter matrix is distributed with 0 as the center, and the parameters in the parameter matrix are randomly selected as w e , U e , b e initial parameters.
Figure PCTCN2020125003-appb-000002
It is the update gate in the GRU, which is used to determine how to combine the newly input independent variable with the previous timing information.
Figure PCTCN2020125003-appb-000003
It is the reset gate in the GRU, used to decide how much previous timing information to keep. Obtained by the second preset formula
Figure PCTCN2020125003-appb-000004
in,
Figure PCTCN2020125003-appb-000005
In order to be variable parameters, by obtaining the parameter matrix of the first neural network or preset model, the parameter matrix is distributed with 0 as the center, and the parameters in the parameter matrix are randomly selected as
Figure PCTCN2020125003-appb-000006
The initial parameters. Obtained by the third preset formula
Figure PCTCN2020125003-appb-000007
in,
Figure PCTCN2020125003-appb-000008
In order to be variable parameters, by obtaining the parameter matrix of the first neural network or preset model, the parameter matrix is distributed with 0 as the center, and the parameters in the parameter matrix are randomly selected as
Figure PCTCN2020125003-appb-000009
The initial parameters.
在得到该数值的第一隐藏变量时,获取该第一全连接层之前的第二隐藏变量,将得到的第一隐藏变量与的第二隐藏变量连接,得到向量信息。将该向量信息输入第一全连接层,通过该第一全连接层中的网络层以预置ReLU作为激活函数,分别经过下面两个线性变换和softplus激活函数变换得到均值和标准差。例如,在得到该数值的第一隐藏变量e t以及该第一全连接层之前的第二隐藏变量z t-1,将第一隐藏变量e t与第二隐藏变量z t-1连接,得到隐藏变量向量(z t-1,e t),通过第一全连接层的第一线性变换函数
Figure PCTCN2020125003-appb-000010
得到该数值的第一均值。其中,
Figure PCTCN2020125003-appb-000011
为常数。通过第一全连接层的第二线性变换函数
Figure PCTCN2020125003-appb-000012
得到该数值的第一标准差。其中,
Figure PCTCN2020125003-appb-000013
为常数,∈是一个很小的常数,为了防止计算时数值溢出而设置。
When the first hidden variable of the value is obtained, the second hidden variable before the first fully connected layer is obtained, and the obtained first hidden variable is connected with the second hidden variable to obtain vector information. The vector information is input into the first fully connected layer, the network layer in the first fully connected layer uses the preset ReLU as the activation function, and the mean value and standard deviation are obtained through the following two linear transformations and softplus activation function transformations. For example, the value obtained in the first hidden variables e t and before the first layer a second fully-connected hidden variable z t-1, the first and second hidden variables hidden variables e t z t-1 connected to afford Hidden variable vector (z t-1 , e t ), through the first linear transformation function of the first fully connected layer
Figure PCTCN2020125003-appb-000010
Get the first average of this value. in,
Figure PCTCN2020125003-appb-000011
Is a constant. The second linear transformation function through the first fully connected layer
Figure PCTCN2020125003-appb-000012
Get the first standard deviation of this value. in,
Figure PCTCN2020125003-appb-000013
Is a constant, ε is a very small constant, which is set in order to prevent the value from overflowing during calculation.
在得到该数值的第一均值和第一标准差时,通过第一均值和第一标准差构建该数值的第一高斯分布。例如,在获取第一均值
Figure PCTCN2020125003-appb-000014
和第一标准差
Figure PCTCN2020125003-appb-000015
时,构建该数值对应时刻的特定高斯分布
Figure PCTCN2020125003-appb-000016
在构建该数值的第一高斯分布时,随机从该第一高斯分布抽取一个对角高斯向量作为该数值的第一辅助变量。例如,从该特定高斯分布
Figure PCTCN2020125003-appb-000017
随机抽取到一个对角高斯向量z t,将该对角高斯向量z t作为该数值的第一辅助变量。
When the first mean value and the first standard deviation of the value are obtained, the first Gaussian distribution of the value is constructed through the first mean value and the first standard deviation. For example, when getting the first mean
Figure PCTCN2020125003-appb-000014
And the first standard deviation
Figure PCTCN2020125003-appb-000015
, Construct the specific Gaussian distribution at the time corresponding to the value
Figure PCTCN2020125003-appb-000016
When constructing the first Gaussian distribution of the value, a diagonal Gaussian vector is randomly selected from the first Gaussian distribution as the first auxiliary variable of the value. For example, from this particular Gaussian distribution
Figure PCTCN2020125003-appb-000017
A diagonal Gaussian vector z t is randomly selected, and the diagonal Gaussian vector z t is used as the first auxiliary variable of the value.
子步骤S1023、根据预置解码程序对所述第一辅助变量进行解码,得到所述数值对应的第二高斯分布。Sub-step S1023: Decode the first auxiliary variable according to a preset decoding program to obtain a second Gaussian distribution corresponding to the value.
预置模型包括解码程序,在得到该数值的第一辅助变量时,通过预置解码程序对该第一辅助变量进行解码,得到该第一辅助变量解码后第二高斯分布。The preset model includes a decoding program. When the first auxiliary variable of the value is obtained, the first auxiliary variable is decoded by the preset decoding program to obtain the second Gaussian distribution after decoding the first auxiliary variable.
在一实施例中,所述解码程序包括第二预置神经网络模型和第二预置全连接层,所述根据预置解码程序对所述第一辅助变量进行解码,得到所述数值对应的第二高斯分布,包括:根据所述第一辅助变量和第二预置神经网络模型,得到所述第二预置神经网络模型的第二隐藏变量;根据第二预置全连接层和所述第二隐藏变量,得到所述数值对应的第二高斯分布。In an embodiment, the decoding program includes a second preset neural network model and a second preset fully connected layer, and the first auxiliary variable is decoded according to the preset decoding program to obtain the corresponding value The second Gaussian distribution includes: obtaining the second hidden variable of the second preset neural network model according to the first auxiliary variable and the second preset neural network model; and obtaining the second hidden variable of the second preset neural network model according to the second preset fully connected layer and the The second hidden variable obtains the second Gaussian distribution corresponding to the value.
示范性的,如图3所示,该解码程序包括第二预置神经网络GPU和第二全连接层h 2,在获取到该数值的第一辅助变量时,获取该第二预置神经网络GPU之前的第一隐藏变量,通过将该第一辅助变量和第一隐藏变量输入该第二预置神经网络,得到第二预置神经网络输出对应的第二隐藏变量。例如,获取到第一辅助变量z t,获取第二预置神经网络GPU之前产生的第一隐藏变量d t-1,将该z t和d t-1输入第二预置神经网络GPU,通过该第一预置神经网络GPU中第一预置公式
Figure PCTCN2020125003-appb-000018
其中,w e、 u e、b e为为变量参数,通过获取第一神经网络或预置模型的参数矩阵,该参数矩阵是以0为中心分布的,随机抽取该参数矩阵中的参数。
Figure PCTCN2020125003-appb-000019
是GRU中的更新门,用于决定如何将新输入的自变量与之前的时序信息结合起来。
Figure PCTCN2020125003-appb-000020
是GRU中的重置门,用于决定保留多少此前的时序信息。通过第二预置公式得到
Figure PCTCN2020125003-appb-000021
其中,
Figure PCTCN2020125003-appb-000022
为为变量参数,通过获取第一神经网络或预置模型的参数矩阵,该参数矩阵是以0为中心分布的,随机抽取该参数矩阵中的参数作为
Figure PCTCN2020125003-appb-000023
的初始参数。通过第三预置公式得到
Figure PCTCN2020125003-appb-000024
其中,
Figure PCTCN2020125003-appb-000025
为为变量参数,通过获取第一神经网络或预置模型的参数矩阵,该参数矩阵是以0为中心分布的,随机抽取该参数矩阵中的参数作为
Figure PCTCN2020125003-appb-000026
的初始参数。
Exemplarily, as shown in FIG. 3, the decoding program includes a second preset neural network GPU and a second fully connected layer h 2. When the first auxiliary variable of the value is obtained, the second preset neural network is obtained For the first hidden variable before the GPU, the second hidden variable corresponding to the output of the second preset neural network is obtained by inputting the first auxiliary variable and the first hidden variable into the second preset neural network. For example, the first auxiliary variable z t is obtained, the first hidden variable d t-1 generated before the second preset neural network GPU is obtained, and the z t and d t-1 are input to the second preset neural network GPU through The first preset formula in the first preset neural network GPU
Figure PCTCN2020125003-appb-000018
Among them, w e , u e , and b e are variable parameters. By obtaining the parameter matrix of the first neural network or the preset model, the parameter matrix is distributed with 0 as the center, and the parameters in the parameter matrix are randomly selected.
Figure PCTCN2020125003-appb-000019
It is the update gate in the GRU, which is used to determine how to combine the newly input independent variable with the previous timing information.
Figure PCTCN2020125003-appb-000020
It is the reset gate in the GRU, used to decide how much previous timing information to keep. Obtained by the second preset formula
Figure PCTCN2020125003-appb-000021
in,
Figure PCTCN2020125003-appb-000022
In order to be variable parameters, by obtaining the parameter matrix of the first neural network or preset model, the parameter matrix is distributed with 0 as the center, and the parameters in the parameter matrix are randomly selected as
Figure PCTCN2020125003-appb-000023
The initial parameters. Obtained by the third preset formula
Figure PCTCN2020125003-appb-000024
in,
Figure PCTCN2020125003-appb-000025
In order to be variable parameters, by obtaining the parameter matrix of the first neural network or preset model, the parameter matrix is distributed with 0 as the center, and the parameters in the parameter matrix are randomly selected as
Figure PCTCN2020125003-appb-000026
The initial parameters.
在得到该第二预置神经网路的第二隐藏变量时,将该第二隐藏变量输入第二全连接层,通过第二全连接层中的网络层以预置ReLU作为激活函数,分别经过下面两个线性变换和softplus变换得到均值和标准差。例如,通过第一全连接层的第一线性变换函数
Figure PCTCN2020125003-appb-000027
得到该数值的第二均值。其中,
Figure PCTCN2020125003-appb-000028
为常数。通过第一全连接层的第二线性变换函数
Figure PCTCN2020125003-appb-000029
得到该数值的第二标准差。其中,
Figure PCTCN2020125003-appb-000030
为常数,∈是一个很小的常数,为了防止计算时数值溢出而设置。
When the second hidden variable of the second preset neural network is obtained, the second hidden variable is input into the second fully connected layer, and the preset ReLU is used as the activation function through the network layer in the second fully connected layer. The following two linear transformations and softplus transformations get the mean and standard deviation. For example, through the first linear transformation function of the first fully connected layer
Figure PCTCN2020125003-appb-000027
Get the second mean of this value. in,
Figure PCTCN2020125003-appb-000028
Is a constant. The second linear transformation function through the first fully connected layer
Figure PCTCN2020125003-appb-000029
Get the second standard deviation of this value. in,
Figure PCTCN2020125003-appb-000030
Is a constant, ε is a very small constant, which is set in order to prevent the value from overflowing during calculation.
在得到该数值的第二均值和第二标准差时,通过第二均值和第二标准差构建该数值的第二高斯分布。例如,在获取第二均值
Figure PCTCN2020125003-appb-000031
和第一标准差
Figure PCTCN2020125003-appb-000032
时,构建该数值对应时刻的特定高斯分布
Figure PCTCN2020125003-appb-000033
其中特定高斯分布
Figure PCTCN2020125003-appb-000034
为第二高斯分布。
When the second mean value and the second standard deviation of the value are obtained, the second Gaussian distribution of the value is constructed through the second mean value and the second standard deviation. For example, when getting the second mean
Figure PCTCN2020125003-appb-000031
And the first standard deviation
Figure PCTCN2020125003-appb-000032
, Construct the specific Gaussian distribution at the time corresponding to the value
Figure PCTCN2020125003-appb-000033
Where the specific Gaussian distribution
Figure PCTCN2020125003-appb-000034
It is the second Gaussian distribution.
子步骤S1024、根据所述第一高斯分布和第二高斯分布训练所述预置模型的损失函数,生成对应的确定性模型。Sub-step S1024, training the loss function of the preset model according to the first Gaussian distribution and the second Gaussian distribution, and generating a corresponding deterministic model.
通过得到该数值的第一高斯分布和第二高斯分布训练预置模型的损失函数,得到对应的确定性模型。例如,通过得到的第一高斯分布训练编码程序的第一损失函数,通过得到的第二高斯分布训练解码函数的第二损失函数,得到对应的确定性函数。The loss function of the preset model is trained by obtaining the first Gaussian distribution and the second Gaussian distribution of the value to obtain the corresponding deterministic model. For example, by training the first loss function of the coding program with the obtained first Gaussian distribution, and training the second loss function of the decoding function with the obtained second Gaussian distribution, the corresponding deterministic function is obtained.
在一实施例中,所述根据所述第一高斯分布和第二高斯分布训练所述预置模型的损失函数,生成对应的确定性模型,包括:获取所述第二高斯分布中的累积分布概率;根据所述累积分布概率和第一预置损失函数,得到所述数值的重建概率;根据所述第一高斯分布,得到所述第一高斯分布的正则项;根据所述正则项和第二预置损失函数,得到所述数值的正则化项;基于所述重建概率和所述正则化项训练所述解码程序的网络参数和所述编码程序的网络参数,生成对应的确定性模型。In an embodiment, the training the loss function of the preset model according to the first Gaussian distribution and the second Gaussian distribution to generate a corresponding deterministic model includes: obtaining a cumulative distribution in the second Gaussian distribution Probability; according to the cumulative distribution probability and the first preset loss function, obtain the reconstruction probability of the value; according to the first Gaussian distribution, obtain the regular term of the first Gaussian distribution; according to the regular term and the first Two preset loss functions to obtain a regularization term of the value; training the network parameters of the decoding program and the network parameters of the encoding program based on the reconstruction probability and the regularization term, and generate a corresponding deterministic model.
示范性的,重建概率通过计算原始数据x t在重建分布
Figure PCTCN2020125003-appb-000035
中的累积分布概率,即
Figure PCTCN2020125003-appb-000036
接下来是计算正则化项。由于编码过程中不是直接对辅助变量进行拟合,而是通过拟合辅助变量的均值和标准差,进而重采样得到z t,因此整个重构过程受到噪声的影响。噪声强度,也就是拟合的标准差
Figure PCTCN2020125003-appb-000037
是通过神经网络计算得到,所以整个训练过程为了能更好的重建原始数据,会尽量压缩其标准差接近于0。一旦标准差接近0,采样的随机性消失,只会得到一个固定的样本,即
Figure PCTCN2020125003-appb-000038
这种情况下,模型等同于一个普通的自编码器。
Exemplary, the reconstruction probability is calculated by calculating the original data x t in the reconstruction distribution
Figure PCTCN2020125003-appb-000035
The cumulative distribution probability in
Figure PCTCN2020125003-appb-000036
The next step is to calculate the regularization term. Since the auxiliary variable is not directly fitted in the encoding process, but z t is obtained by fitting the mean and standard deviation of the auxiliary variable, and then resampling, the entire reconstruction process is affected by noise. Noise intensity, which is the standard deviation of the fit
Figure PCTCN2020125003-appb-000037
It is calculated by a neural network, so in order to better reconstruct the original data during the whole training process, the standard deviation will be compressed as close to 0 as possible. Once the standard deviation is close to 0, the randomness of sampling disappears, and only a fixed sample will be obtained, namely
Figure PCTCN2020125003-appb-000038
In this case, the model is equivalent to an ordinary autoencoder.
例如,z t|x t服从标准的正态分布,就避免了噪声(即
Figure PCTCN2020125003-appb-000039
)被压缩为零的问题,因此保证模型可以生成与原始数据不同的新样本。
For example, z t |x t obeys the standard normal distribution, which avoids noise (ie
Figure PCTCN2020125003-appb-000039
) Is compressed to zero, so it is guaranteed that the model can generate new samples that are different from the original data.
因此,我们以多元独立正态分布与标准正态分布的KL散度KL[N(u,σ 2)]||N(0,I)作为 这个正则项加入损失函数中。
Figure PCTCN2020125003-appb-000040
其中,N是z t的个数。
Therefore, we use the KL divergence KL[N(u,σ 2 )]||N(0,I) of the multivariate independent normal distribution and the standard normal distribution as this regular term and add it to the loss function.
Figure PCTCN2020125003-appb-000040
Among them, N is the number of z t.
将上述两个损失函数进行合并,得到总损失函数:
Figure PCTCN2020125003-appb-000041
通过最大化上述损失函数来不断训练编码器及解码器的网络参数,直至达到预期精度。
Combine the above two loss functions to get the total loss function:
Figure PCTCN2020125003-appb-000041
The network parameters of the encoder and decoder are continuously trained by maximizing the above loss function until the expected accuracy is achieved.
在一实施例中,所述得到所述数值的重建概率之后,还包括:记录所述待训练的子多元KPI时间序列中各个数值的重建概率,并将各个所述数值的重建概率进行排序;确定排序后所述重建概率中预置序号对应的目标重建概率,并将所述目标重建概率作为预置阈值。In an embodiment, after obtaining the reconstruction probability of the value, the method further includes: recording the reconstruction probability of each value in the sub-multiple KPI time series to be trained, and sorting the reconstruction probability of each value; Determine the target reconstruction probability corresponding to the preset sequence number in the reconstruction probability after sorting, and use the target reconstruction probability as the preset threshold.
示范性的,记录预置模型训练中最后一次计算L recon的数据,我们得到了每个原始数据对应的重建概率,记作S i=log[p(x i|z i)],其中,i=1至N,合并为一个包含N个元素的集合S={S 1,S 2...S N}。重建概率S i越高意味着输入x i服从时间序列的正常模式,因此可以高度可信地对其进行重构。概率越小,说明能重构原始数据的可能性越小,因此异常的可能性就越大。由于异常数值占比很小,根据实际经验,我们选取所有S i数据的下5%分位数作为阈值。 Exemplarily, recording the data of the last calculation of L recon in the preset model training, we get the reconstruction probability corresponding to each original data, denoted as S i =log[p(x i |z i )], where i =1 to N, combined into a set S={S 1 , S 2 ... S N } containing N elements. Reconstruction means higher probability S i x i input time series subject to the normal mode, it can be reconstructed highly reliable manner. The smaller the probability, the smaller the possibility of reconstructing the original data, and therefore the greater the possibility of abnormality. A very small proportion due to abnormal values, based on practical experience, we selected 5% quantile S i of all data as a threshold.
步骤S103、获取待检测的多元KPI时间序列。Step S103: Obtain a multivariate KPI time series to be detected.
示范性的,通过预置切分策略获取待检测多元KPI时间序列。例如,检测到多元KPI时间序列上出现新的数值时,基于该预置切分策略中的目标窗口信息对该多元KPI时间序列进行切分,获取包含出现新的数值的待检测多元KPI时间序列。Exemplarily, the time series of multiple KPIs to be detected is obtained through a preset segmentation strategy. For example, when a new value is detected on the multiple KPI time series, the multiple KPI time series are segmented based on the target window information in the preset segmentation strategy, and the multiple KPI time series to be detected containing the new value are obtained. .
步骤S104、根据所述待检测的多元KPI时间序列和所述确定性模型,获取所述待检测的多元KPI时间序列中待检测数值的重建概率值Step S104: According to the multivariate KPI time series to be detected and the deterministic model, obtain the reconstruction probability value of the value to be detected in the multivariate KPI time series to be detected
示范性的,将该待检测多元KPI时间序列输入该确定性模型的,获取待待检测多元KPI时间序列中的待检测数值,该待检测数值为出现新的数值。将该待检测数值输入该该确定性模型的第一预置神经网络,得到该第一预置神经网络输出的隐藏变量。将该隐藏变量输入第一预置全连接层,获取第一预置全连接层变换得到的第一均值和第一标准差。根据第一均值和第一标准差,得到该待检测数值的第一高斯分布。基于该第一高斯分布,得到该待检测数值的辅助变量。基于第二预置神经网络和辅助变量,得到该第二预置神经网络输出的隐藏变量。基于该第二预置神经网络输出的隐藏变量和第二预置全连接层,得到第二预置全连接层变换得到的第二均值和第二标准差。基于的第二均值和第二标准差,构建该待检测数值的第二高斯分布图,得到该待检测数值的重建概率。Exemplarily, when the multivariate KPI time sequence to be detected is input into the deterministic model, the value to be detected in the multivariate KPI time sequence to be detected is obtained, and the value to be detected is a new value. The value to be detected is input into the first preset neural network of the deterministic model to obtain hidden variables output by the first preset neural network. The hidden variable is input into the first preset fully connected layer, and the first mean value and the first standard deviation obtained by the transformation of the first preset fully connected layer are obtained. According to the first mean value and the first standard deviation, the first Gaussian distribution of the value to be detected is obtained. Based on the first Gaussian distribution, the auxiliary variable of the value to be detected is obtained. Based on the second preset neural network and auxiliary variables, the hidden variables output by the second preset neural network are obtained. Based on the hidden variables output by the second preset neural network and the second preset fully connected layer, the second mean value and the second standard deviation obtained by the transformation of the second preset fully connected layer are obtained. Based on the second mean value and the second standard deviation, a second Gaussian distribution map of the value to be detected is constructed to obtain the reconstruction probability of the value to be detected.
步骤S105、根据预置阈值和所述待检测数值的重建概率值,确定所述待检测数值对应的时刻是否为异常点。Step S105: Determine whether the time corresponding to the value to be detected is an abnormal point according to the preset threshold value and the reconstruction probability value of the value to be detected.
将该待检测数值的重建概率值与预置阈值进行比对,确定该待检测数值的时刻是否为异常点;若该待检测数值的重建概率值小于该预置阈值,则确定该待检测数值的时刻为异常点。The reconstruction probability value of the value to be detected is compared with a preset threshold value to determine whether the time of the value to be detected is an abnormal point; if the reconstruction probability value of the value to be detected is less than the preset threshold value, the value to be detected is determined The moment of is an abnormal point.
在本申请实施例中,通过无标签的多元KPI时间序列训练预置模型,生成对应的确定性模型,通过该确定性模型对该检测的多元KPI时间序列进行检测,确定该多元KPI时间序列中待检测数值对应的时刻是否为异常点。利用贝叶斯网络的知识根据不同的输入数据建立不同的高斯分布,然后从中抽样获取对应潜变量,再根据潜变量建立特定分布,再次抽样得到重建数据,因此可以生成与原始数据不同的变量,增强了模型的鲁棒性;在编码过程中,融合了基于确定性的模型和基于随机性的模型,充分捕获了潜在空间中辅助变量之间的时间依赖性,因此可以更好地表示输入数据的分布,更适用于多元KPI时序数据的 异常检测;扩展了具有随机变量的递归神经网络的建模功能,融合了GRU模型在充分捕捉时间序列数据的时间依赖性,提升了模型精度的同时大大降低模型复杂度,节约了运算过程中的CPU占用率,也减少了所需的存储空间。In the embodiment of the present application, a preset model is trained through an unlabeled multivariate KPI time series to generate a corresponding deterministic model, and the detected multivariate KPI time series is detected through the deterministic model to determine that the multivariate KPI time series is Whether the time corresponding to the value to be detected is an abnormal point. Use the knowledge of Bayesian network to establish different Gaussian distributions according to different input data, and then sample the corresponding latent variables from it, and then establish a specific distribution based on the latent variables, and then sample the reconstructed data again, so variables different from the original data can be generated. Enhance the robustness of the model; in the encoding process, the model based on determinism and the model based on randomness are merged to fully capture the time dependence between auxiliary variables in the latent space, so the input data can be better represented The distribution is more suitable for the abnormal detection of multiple KPI time series data; the modeling function of the recurrent neural network with random variables is expanded, and the GRU model is integrated to fully capture the time dependence of time series data, which improves the accuracy of the model and greatly improves the accuracy of the model. Reduce the complexity of the model, save the CPU occupancy rate in the calculation process, and also reduce the required storage space.
本申请通过根据预置切分策略和待训练的多元KPI时间序列,确定对应的待训练的多段子多元KPI时间序列,其中,所述待训练的子多元KPI时间序列为无标签;根据所述待训练的子多元KPI时间序列训练预置模型,生成对应的确定性模型;获取待检测的多元KPI时间序列;根据所述待检测的多元KPI时间序列和所述确定性模型,获取所述待检测的多元KPI时间序列中待检测数值的重建概率值;根据预置阈值和所述待检测数值的重建概率值,确定所述待检测数值对应的时刻是否为异常点,实现通过无标签的多元KPI时间序列生成确定性模型,增强了模型的鲁棒性,使得该确定性模型更适用于多元KPI时序数据的异常检测,并提高了检测的准确率。This application determines the corresponding multi-segment sub-multiple KPI time series to be trained according to the preset segmentation strategy and the multi-element KPI time series to be trained, wherein the sub-multiple KPI time series to be trained is unlabeled; according to the Training preset models for the sub-multiple KPI time series to be trained to generate the corresponding deterministic model; obtain the multi-element KPI time series to be detected; obtain the multi-element KPI time series to be detected and the deterministic model to obtain the The reconstruction probability value of the to-be-detected value in the detected multivariate KPI time series; according to the preset threshold and the reconstruction probability value of the to-be-detected value, it is determined whether the time corresponding to the to-be-detected value is an abnormal point. The KPI time series generates a deterministic model, which enhances the robustness of the model, making the deterministic model more suitable for abnormal detection of multiple KPI time series data, and improves the accuracy of the detection.
请参照图4,图4为本申请实施例提供的一种多元KPI时间序列的检测装置的示意性框图。Please refer to FIG. 4. FIG. 4 is a schematic block diagram of an apparatus for detecting multiple KPI time series according to an embodiment of the present application.
如图4所示,该多元KPI时间序列的检测装置400,包括:第一确定模块401、生成模块402、第一获取模块403、第二获取模块404、第二确定模块405。As shown in FIG. 4, the detection device 400 for multiple KPI time series includes: a first determination module 401, a generation module 402, a first acquisition module 403, a second acquisition module 404, and a second determination module 405.
第一确定模块401,用于根据预置切分策略和待训练的多元KPI时间序列,得到对应的待训练的多段子多元KPI时间序列,其中,所述待训练的子多元KPI时间序列为无标签;The first determination module 401 is configured to obtain the corresponding multi-segment sub-multiple KPI time series to be trained according to the preset segmentation strategy and the multi-element KPI time series to be trained, wherein the sub-multiple KPI time series to be trained is none Label;
生成模块402,用于根据所述待训练的子多元KPI时间序列训练预置模型,生成对应的确定性模型;The generating module 402 is configured to train a preset model according to the sub-multiple KPI time series to be trained, and generate a corresponding deterministic model;
第一获取模块403,用于获取待检测的多元KPI时间序列;The first obtaining module 403 is used to obtain the multivariate KPI time series to be detected;
第二获取模块404,用于根据所述待检测的多元KPI时间序列和所述确定性模型,获取所述待检测的多元KPI时间序列中待检测数值的重建概率值;The second obtaining module 404 is configured to obtain the reconstruction probability value of the value to be detected in the multivariate KPI time series to be detected according to the multivariate KPI time series to be detected and the deterministic model;
第二确定模块405,用于根据预置阈值和所述待检测数值的重建概率值,确定所述待检测数值对应的时刻是否为异常点。The second determining module 405 is configured to determine whether the time corresponding to the value to be detected is an abnormal point according to the preset threshold value and the reconstruction probability value of the value to be detected.
其中,生成模块402具体还用于:Among them, the generating module 402 is also specifically used for:
将所述待训练的子多元KPI时间序列输入所述预置模型,提取所述待训练的子多元KPI时间序列中的数值;Input the sub-multiple KPI time series to be trained into the preset model, and extract the values in the sub-multiple KPI time series to be trained;
根据预置编码程序对所述数值进行编码,获取所述数值的第一高斯分布和第一辅助变量;Encode the numerical value according to a preset coding program, and obtain the first Gaussian distribution and the first auxiliary variable of the numerical value;
根据预置解码程序对所述第一辅助变量进行解码,得到所述数值对应的第二高斯分布;Decode the first auxiliary variable according to a preset decoding program to obtain a second Gaussian distribution corresponding to the value;
根据所述第一高斯分布和第二高斯分布训练所述预置模型的损失函数,生成对应的确定性模型。Training the loss function of the preset model according to the first Gaussian distribution and the second Gaussian distribution to generate a corresponding deterministic model.
其中,生成模块402具体还用于:Among them, the generating module 402 is also specifically used for:
根据所述数值和第一预置神经网络模型,得到所述第一预置神经网络模型的第一隐藏变量;Obtaining the first hidden variable of the first preset neural network model according to the value and the first preset neural network model;
根据第一预置全连接层和所述第一隐藏变量,得到所述数值对应的第一高斯分布;Obtaining the first Gaussian distribution corresponding to the value according to the first preset fully connected layer and the first hidden variable;
基于所述第一高斯分布得到所述数值的第一辅助变量。The first auxiliary variable of the value is obtained based on the first Gaussian distribution.
其中,生成模块402具体还用于:Among them, the generating module 402 is also specifically used for:
根据所述第一辅助变量和第二预置神经网络模型,得到所述第二预置神经网络模型的第二隐藏变量;Obtaining the second hidden variable of the second preset neural network model according to the first auxiliary variable and the second preset neural network model;
根据第二预置全连接层和所述第二隐藏变量,得到所述数值对应的第二高斯分布。According to the second preset fully connected layer and the second hidden variable, a second Gaussian distribution corresponding to the value is obtained.
其中,生成模块402具体还用于:Among them, the generating module 402 is also specifically used for:
获取所述第二高斯分布中的累积分布概率;Acquiring the cumulative distribution probability in the second Gaussian distribution;
根据所述累积分布概率和第一预置损失函数,得到所述数值的重建概率;Obtaining the numerical reconstruction probability according to the cumulative distribution probability and the first preset loss function;
根据所述第一高斯分布,得到所述第一高斯分布的正则项;Obtaining a regular term of the first Gaussian distribution according to the first Gaussian distribution;
根据所述正则项和第二预置损失函数,得到所述数值的正则化项;Obtaining a regularization term of the numerical value according to the regularization term and the second preset loss function;
基于所述重建概率和所述正则化项训练所述解码程序的网络参数和所述编码程序的网络参数,生成对应的确定性模型。Training the network parameters of the decoding program and the network parameters of the encoding program based on the reconstruction probability and the regularization term, and generate a corresponding deterministic model.
其中,多元KPI时间序列的检测装置还用于:Among them, the detection device of multiple KPI time series is also used for:
记录所述待训练的子多元KPI时间序列中各个数值的重建概率,并将各个所述数值的重建概率进行排序;Recording the reconstruction probability of each value in the sub-multiple KPI time series to be trained, and sorting the reconstruction probability of each value;
确定排序后所述重建概率中预置序号对应的目标重建概率,并将所述目标重建概率作为预置阈值。Determine the target reconstruction probability corresponding to the preset sequence number in the reconstruction probability after sorting, and use the target reconstruction probability as the preset threshold.
其中,第二确定模块405还用于:Wherein, the second determining module 405 is also used for:
将所述待检测数值的重建概率值与预置阈值进行比对,确定所述待检测数值的时刻是否为异常点;Comparing the reconstruction probability value of the to-be-detected value with a preset threshold to determine whether the moment of the to-be-detected value is an abnormal point;
若所述待检测数值的重建概率值小于所述预置阈值,则确定所述待检测数值的时刻为异常点。If the reconstruction probability value of the to-be-detected value is less than the preset threshold, the time when the to-be-detected value is determined to be an abnormal point.
需要说明的是,所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,上述描述的装置和各模块及单元的具体工作过程,可以参考前述多元KPI时间序列的检测方法实施例中的对应过程,在此不再赘述。It should be noted that those skilled in the art can clearly understand that for the convenience and brevity of the description, the specific working process of the above-described device and each module and unit can be referred to in the foregoing embodiment of the detection method for multiple KPI time series The corresponding process will not be repeated here.
上述实施例提供的装置可以实现为一种计算机程序的形式,该计算机程序可以在如图5所示的计算机设备上运行。The apparatus provided in the foregoing embodiment may be implemented in the form of a computer program, and the computer program may run on the computer device shown in FIG. 5.
请参阅图5,图5为本申请实施例提供的一种计算机设备的结构示意性框图。该计算机设备可以为终端。Please refer to FIG. 5, which is a schematic block diagram of the structure of a computer device according to an embodiment of the application. The computer device may be a terminal.
如图5所示,该计算机设备包括通过系统总线连接的处理器、存储器和网络接口,其中,存储器可以是易失性的,也可以是非易失性的。As shown in FIG. 5, the computer device includes a processor, a memory, and a network interface connected through a system bus, where the memory may be volatile or non-volatile.
非易失性存储介质可存储操作系统和计算机程序。该计算机程序包括程序指令,该程序指令被执行时,可使得处理器执行任意一种多元KPI时间序列的检测方法。The non-volatile storage medium can store an operating system and a computer program. The computer program includes program instructions, and when the program instructions are executed, the processor can execute any method for detecting multiple KPI time series.
处理器用于提供计算和控制能力,支撑整个计算机设备的运行。The processor is used to provide computing and control capabilities and support the operation of the entire computer equipment.
内存储器为非易失性存储介质中的计算机程序的运行提供环境,该计算机程序被处理器执行时,可使得处理器执行任意一种多元KPI时间序列的检测方法。The internal memory provides an environment for the operation of the computer program in the non-volatile storage medium. When the computer program is executed by the processor, the processor can execute any method for detecting multiple KPI time series.
该网络接口用于进行网络通信,如发送分配的任务等。本领域技术人员可以理解,图5中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。The network interface is used for network communication, such as sending assigned tasks. Those skilled in the art can understand that the structure shown in FIG. 5 is only a block diagram of part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied. The specific computer device may Including more or fewer parts than shown in the figure, or combining some parts, or having a different arrangement of parts.
应当理解的是,处理器可以是中央处理单元(Central Processing Unit,CPU),该处理器还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。其中,通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that the processor may be a central processing unit (Central Processing Unit, CPU), and the processor may also be other general-purpose processors, digital signal processors (Digital Signal Processors, DSPs), and application specific integrated circuits (Application Specific Integrated Circuits). Circuit, ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, etc. Among them, the general-purpose processor may be a microprocessor or the processor may also be any conventional processor.
其中,在一个实施例中,所述处理器用于运行存储在存储器中的计算机程序,以实现如下步骤:Wherein, in an embodiment, the processor is used to run a computer program stored in a memory to implement the following steps:
根据预置切分策略和待训练的多元KPI时间序列,确定对应的待训练的多段子多元KPI时间序列,其中,所述待训练的子多元KPI时间序列为无标签;Determine the corresponding multi-segment sub-multiple KPI time series to be trained according to the preset segmentation strategy and the multi-element KPI time series to be trained, wherein the sub-multiple KPI time series to be trained is unlabeled;
根据所述待训练的子多元KPI时间序列训练预置模型,生成对应的确定性模型;Training a preset model according to the sub-multiple KPI time series to be trained to generate a corresponding deterministic model;
获取待检测的多元KPI时间序列;Obtain the time series of multiple KPIs to be tested;
根据所述待检测的多元KPI时间序列和所述确定性模型,获取所述待检测的多元KPI时间序列中待检测数值的重建概率值;Obtaining the reconstruction probability value of the to-be-detected value in the multi-element KPI time series to be detected according to the multi-element KPI time series to be detected and the deterministic model;
根据预置阈值和所述待检测数值的重建概率值,确定所述待检测数值对应的时刻是否为异常点。According to the preset threshold and the reconstruction probability value of the to-be-detected value, it is determined whether the time corresponding to the to-be-detected value is an abnormal point.
在一个实施例中,所述处理器根据所述待训练的子多元KPI时间序列训练预置模型,生成对应的确定性模型实现时,用于实现:In an embodiment, the processor trains a preset model according to the sub-multiple KPI time series to be trained, and generates a corresponding deterministic model when it is implemented, and is used to implement:
将所述待训练的子多元KPI时间序列输入所述预置模型,提取所述待训练的子多元KPI时间序列中的数值;Input the sub-multiple KPI time series to be trained into the preset model, and extract the values in the sub-multiple KPI time series to be trained;
根据预置编码程序对所述数值进行编码,获取所述数值的第一高斯分布和第一辅助变量;Encode the numerical value according to a preset coding program, and obtain the first Gaussian distribution and the first auxiliary variable of the numerical value;
根据预置解码程序对所述第一辅助变量进行解码,得到所述数值对应的第二高斯分布;Decode the first auxiliary variable according to a preset decoding program to obtain a second Gaussian distribution corresponding to the value;
根据所述第一高斯分布和第二高斯分布训练所述预置模型的损失函数,生成对应的确定性模型。Training the loss function of the preset model according to the first Gaussian distribution and the second Gaussian distribution to generate a corresponding deterministic model.
在一个实施例中,所述处理器所述预置编码程序包括第一预置神经网络模型和第一全连接层,所述根据预置编码程序对所述数值进行编码,获取所述数值的第一高斯分布和第一辅助变量实现时,用于实现:In an embodiment, the preset coding program of the processor includes a first preset neural network model and a first fully connected layer, and the numerical value is coded according to the preset coding program to obtain the value When the first Gaussian distribution and the first auxiliary variable are realized, they are used to realize:
根据所述数值和第一预置神经网络模型,得到所述第一预置神经网络模型的第一隐藏变量;Obtaining the first hidden variable of the first preset neural network model according to the value and the first preset neural network model;
根据第一预置全连接层和所述第一隐藏变量,得到所述数值对应的第一高斯分布;Obtaining the first Gaussian distribution corresponding to the value according to the first preset fully connected layer and the first hidden variable;
基于所述第一高斯分布得到所述数值的第一辅助变量。The first auxiliary variable of the value is obtained based on the first Gaussian distribution.
在一个实施例中,所述处理器所述解码程序包括第二预置神经网络模型和第二预置全连接层,所述根据预置解码程序对所述第一辅助变量进行解码,得到所述数值对应的第二高斯分布实现时,用于实现:In an embodiment, the decoding program of the processor includes a second preset neural network model and a second preset fully connected layer, and the first auxiliary variable is decoded according to the preset decoding program to obtain the When the second Gaussian distribution corresponding to the above-mentioned value is realized, it is used to realize:
根据所述第一辅助变量和第二预置神经网络模型,得到所述第二预置神经网络模型的第二隐藏变量;Obtaining the second hidden variable of the second preset neural network model according to the first auxiliary variable and the second preset neural network model;
根据第二预置全连接层和所述第二隐藏变量,得到所述数值对应的第二高斯分布。According to the second preset fully connected layer and the second hidden variable, a second Gaussian distribution corresponding to the value is obtained.
在一个实施例中,所述处理器所述根据所述第一高斯分布和第二高斯分布训练所述预置模型的损失函数,生成对应的确定性模型实现时,用于实现:In an embodiment, when the processor trains the loss function of the preset model according to the first Gaussian distribution and the second Gaussian distribution, and generates a corresponding deterministic model for implementation, it is used to realize:
获取所述第二高斯分布中的累积分布概率;Acquiring the cumulative distribution probability in the second Gaussian distribution;
根据所述累积分布概率和第一预置损失函数,得到所述数值的重建概率;Obtaining the numerical reconstruction probability according to the cumulative distribution probability and the first preset loss function;
根据所述第一高斯分布,得到所述第一高斯分布的正则项;Obtaining a regular term of the first Gaussian distribution according to the first Gaussian distribution;
根据所述正则项和第二预置损失函数,得到所述数值的正则化项;Obtaining a regularization term of the numerical value according to the regularization term and the second preset loss function;
基于所述重建概率和所述正则化项训练所述解码程序的网络参数和所述编码程序的网络参数,生成对应的确定性模型。Training the network parameters of the decoding program and the network parameters of the encoding program based on the reconstruction probability and the regularization term, and generate a corresponding deterministic model.
在一个实施例中,所述处理器所述得到所述数值的重建概率之后实现时,用于实现:In an embodiment, when the processor obtains the reconstruction probability of the numerical value and implements it, it is used to implement:
记录所述待训练的子多元KPI时间序列中各个数值的重建概率,并将各个所述数值的重建概率进行排序;Recording the reconstruction probability of each value in the sub-multiple KPI time series to be trained, and sorting the reconstruction probability of each value;
确定排序后所述重建概率中预置序号对应的目标重建概率,并将所述目标重建概率作为预置阈值。Determine the target reconstruction probability corresponding to the preset sequence number in the reconstruction probability after sorting, and use the target reconstruction probability as the preset threshold.
在一个实施例中,所述处理器所述根据预置阈值和所述待检测数值的重建概率值,确定所述待检测数值的时刻是否为异常点实现时,用于实现:In an embodiment, when the processor determines whether the time when the value to be detected is an abnormal point realization based on a preset threshold and the reconstruction probability value of the value to be detected, the processor is used to implement:
将所述待检测数值的重建概率值与预置阈值进行比对,确定所述待检测数值的时刻是否为异常点;Comparing the reconstruction probability value of the to-be-detected value with a preset threshold to determine whether the moment of the to-be-detected value is an abnormal point;
若所述待检测数值的重建概率值小于所述预置阈值,则确定所述待检测数值的时刻为 异常点。If the reconstruction probability value of the to-be-detected value is less than the preset threshold, the time when the to-be-detected value is determined to be an abnormal point.
本申请实施例还提供一种计算机可读存储介质,计算机可读存储介质可以是易失性的,也可以是非易失性的,所述计算机可读存储介质上存储有计算机程序,所述计算机程序中包括程序指令,所述程序指令被执行时所实现的方法可参照本申请多元KPI时间序列的检测方法的各个实施例。The embodiments of the present application also provide a computer-readable storage medium. The computer-readable storage medium may be volatile or non-volatile. A computer program is stored on the computer-readable storage medium. The program includes program instructions, and the method implemented when the program instructions are executed can refer to the various embodiments of the method for detecting multiple KPI time series in this application.
其中,所述计算机可读存储介质可以是前述实施例所述的计算机设备的内部存储单元,例如所述计算机设备的硬盘或内存。所述计算机可读存储介质也可以是所述计算机设备的外部存储设备,例如所述计算机设备上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。The computer-readable storage medium may be the internal storage unit of the computer device described in the foregoing embodiment, for example, the hard disk or memory of the computer device. The computer-readable storage medium may also be an external storage device of the computer device, such as a plug-in hard disk equipped on the computer device, a smart memory card (Smart Media Card, SMC), and a Secure Digital (SD) ) Card, Flash Card, etc.
进一步地,所述计算机可读存储介质可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序等;存储数据区可存储根据区块链节点的使用所创建的数据等。Further, the computer-readable storage medium may mainly include a storage program area and a storage data area, where the storage program area may store an operating system, an application program required by at least one function, etc.; the storage data area may store Data created by the use of nodes, etc.
本申请所指区块链是预置模型和确定性模型的存储、点对点传输、共识机制、加密算法等计算机技术的新型应用模式。区块链(Blockchain),本质上是一个去中心化的数据库,是一串使用密码学方法相关联产生的数据块,每一个数据块中包含了一批次网络交易的信息,用于验证其信息的有效性(防伪)和生成下一个区块。区块链可以包括区块链底层平台、平台产品服务层以及应用服务层等。The blockchain referred to in this application is a new application mode of computer technology such as storage of preset models and deterministic models, point-to-point transmission, consensus mechanism, and encryption algorithm. Blockchain, essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information for verification. The validity of the information (anti-counterfeiting) and the generation of the next block. The blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者系统不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者系统所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者系统中还存在另外的相同要素。It should be noted that in this article, the terms "include", "include" or any other variants thereof are intended to cover non-exclusive inclusion, so that a process, method, article or system including a series of elements not only includes those elements, It also includes other elements that are not explicitly listed, or elements inherent to the process, method, article, or system. If there are no more restrictions, the element defined by the sentence "including a..." does not exclude the existence of other identical elements in the process, method, article, or system that includes the element.
上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。The serial numbers of the foregoing embodiments of the present application are for description only, and do not represent the superiority or inferiority of the embodiments. The above are only specific implementations of this application, but the protection scope of this application is not limited to this. Anyone familiar with the technical field can easily think of various equivalents within the technical scope disclosed in this application. Modifications or replacements, these modifications or replacements shall be covered within the protection scope of this application. Therefore, the protection scope of this application shall be subject to the protection scope of the claims.

Claims (20)

  1. 一种多元KPI时间序列的检测方法,其中,包括:A detection method for multiple KPI time series, including:
    根据预置切分策略和待训练的多元KPI时间序列,确定对应的待训练的多段子多元KPI时间序列,其中,所述待训练的子多元KPI时间序列为无标签;According to the preset segmentation strategy and the multi-element KPI time series to be trained, determine the corresponding multi-segment sub-multiple KPI time series to be trained, wherein the sub-multiple KPI time series to be trained are unlabeled;
    根据所述待训练的子多元KPI时间序列训练预置模型,生成对应的确定性模型;Training a preset model according to the sub-multiple KPI time series to be trained to generate a corresponding deterministic model;
    获取待检测的多元KPI时间序列;Obtain the time series of multiple KPIs to be tested;
    根据所述待检测的多元KPI时间序列和所述确定性模型,获取所述待检测的多元KPI时间序列中待检测数值的重建概率值;Obtaining the reconstruction probability value of the to-be-detected value in the multi-element KPI time series to be detected according to the multi-element KPI time series to be detected and the deterministic model;
    根据预置阈值和所述待检测数值的重建概率值,确定所述待检测数值对应的时刻是否为异常点。According to the preset threshold and the reconstruction probability value of the to-be-detected value, it is determined whether the time corresponding to the to-be-detected value is an abnormal point.
  2. 如权利要求1所述的多元KPI时间序列的检测方法,其中,所述根据所述待训练的子多元KPI时间序列训练预置模型,生成对应的确定性模型,包括:5. The method for detecting multiple KPI time series according to claim 1, wherein the training a preset model according to the sub-multiple KPI time series to be trained to generate a corresponding deterministic model comprises:
    将所述待训练的子多元KPI时间序列输入所述预置模型,提取所述待训练的子多元KPI时间序列中的数值;Input the sub-multiple KPI time series to be trained into the preset model, and extract the values in the sub-multiple KPI time series to be trained;
    根据预置编码程序对所述数值进行编码,获取所述数值的第一高斯分布和第一辅助变量;Encode the numerical value according to a preset coding program, and obtain the first Gaussian distribution and the first auxiliary variable of the numerical value;
    根据预置解码程序对所述第一辅助变量进行解码,得到所述数值对应的第二高斯分布;Decode the first auxiliary variable according to a preset decoding program to obtain a second Gaussian distribution corresponding to the value;
    根据所述第一高斯分布和第二高斯分布训练所述预置模型的损失函数,生成对应的确定性模型。Training the loss function of the preset model according to the first Gaussian distribution and the second Gaussian distribution to generate a corresponding deterministic model.
  3. 如权利要求2所述的多元KPI时间序列的检测方法,其中,所述预置编码程序包括第一预置神经网络模型和第一全连接层,所述根据预置编码程序对所述数值进行编码,获取所述数值的第一高斯分布和第一辅助变量,包括:The method for detecting multiple KPI time series according to claim 2, wherein the preset coding program includes a first preset neural network model and a first fully connected layer, and the numerical value is performed according to the preset coding program. Encoding to obtain the first Gaussian distribution and the first auxiliary variable of the value includes:
    根据所述数值和第一预置神经网络模型,得到所述第一预置神经网络模型的第一隐藏变量;Obtaining the first hidden variable of the first preset neural network model according to the value and the first preset neural network model;
    根据第一预置全连接层和所述第一隐藏变量,得到所述数值对应的第一高斯分布;Obtaining the first Gaussian distribution corresponding to the value according to the first preset fully connected layer and the first hidden variable;
    基于所述第一高斯分布得到所述数值的第一辅助变量。The first auxiliary variable of the value is obtained based on the first Gaussian distribution.
  4. 如权利要求2所述的多元KPI时间序列的检测方法,其中,所述解码程序包括第二预置神经网络模型和第二预置全连接层,所述根据预置解码程序对所述第一辅助变量进行解码,得到所述数值对应的第二高斯分布,包括:The method for detecting multiple KPI time series according to claim 2, wherein the decoding program includes a second preset neural network model and a second preset fully connected layer, and the first The auxiliary variable is decoded to obtain the second Gaussian distribution corresponding to the value, including:
    根据所述第一辅助变量和第二预置神经网络模型,得到所述第二预置神经网络模型的第二隐藏变量;Obtaining the second hidden variable of the second preset neural network model according to the first auxiliary variable and the second preset neural network model;
    根据第二预置全连接层和所述第二隐藏变量,得到所述数值对应的第二高斯分布。According to the second preset fully connected layer and the second hidden variable, a second Gaussian distribution corresponding to the value is obtained.
  5. 如权利要求2所述的多元KPI时间序列的检测方法,其中,所述根据所述第一高斯分布和第二高斯分布训练所述预置模型的损失函数,生成对应的确定性模型,包括:The method for detecting a multivariate KPI time series according to claim 2, wherein the training the loss function of the preset model according to the first Gaussian distribution and the second Gaussian distribution to generate the corresponding deterministic model comprises:
    获取所述第二高斯分布中的累积分布概率;Acquiring the cumulative distribution probability in the second Gaussian distribution;
    根据所述累积分布概率和第一预置损失函数,得到所述数值的重建概率;Obtaining the numerical reconstruction probability according to the cumulative distribution probability and the first preset loss function;
    根据所述第一高斯分布,得到所述第一高斯分布的正则项;Obtaining a regular term of the first Gaussian distribution according to the first Gaussian distribution;
    根据所述正则项和第二预置损失函数,得到所述数值的正则化项;Obtaining a regularization term of the numerical value according to the regularization term and the second preset loss function;
    基于所述重建概率和所述正则化项训练所述解码程序的网络参数和所述编码程序的网络参数,生成对应的确定性模型。Training the network parameters of the decoding program and the network parameters of the encoding program based on the reconstruction probability and the regularization term, and generate a corresponding deterministic model.
  6. 如权利要求5所述的多元KPI时间序列的检测方法,其中,所述得到所述数值的重建概率之后,还包括:The method for detecting multiple KPI time series according to claim 5, wherein after said obtaining the reconstruction probability of the value, it further comprises:
    记录所述待训练的子多元KPI时间序列中各个数值的重建概率,并将各个所述数值的重建概率进行排序;Recording the reconstruction probability of each value in the sub-multiple KPI time series to be trained, and sorting the reconstruction probability of each value;
    确定排序后所述重建概率中预置序号对应的目标重建概率,并将所述目标重建概率作为预置阈值。Determine the target reconstruction probability corresponding to the preset sequence number in the reconstruction probability after sorting, and use the target reconstruction probability as the preset threshold.
  7. 如权利要求1所述的多元KPI时间序列的检测方法,其中,所述根据预置阈值和所述待检测数值的重建概率值,确定所述待检测数值的时刻是否为异常点,包括:The method for detecting a multivariate KPI time series according to claim 1, wherein the determining whether the time at which the value to be detected is an abnormal point according to a preset threshold and the reconstruction probability value of the value to be detected comprises:
    将所述待检测数值的重建概率值与预置阈值进行比对,确定所述待检测数值的时刻是否为异常点;Comparing the reconstruction probability value of the to-be-detected value with a preset threshold to determine whether the moment of the to-be-detected value is an abnormal point;
    若所述待检测数值的重建概率值小于所述预置阈值,则确定所述待检测数值的时刻为异常点。If the reconstruction probability value of the to-be-detected value is less than the preset threshold, the time when the to-be-detected value is determined to be an abnormal point.
  8. 一种多元KPI时间序列的检测装置,其中,包括:A detection device for multiple KPI time series, which includes:
    第一确定模块,用于根据预置切分策略和待训练的多元KPI时间序列,确定对应的待训练的多段子多元KPI时间序列,其中,所述待训练的子多元KPI时间序列为无标签;The first determination module is used to determine the corresponding multi-segment sub-multiple KPI time series to be trained according to the preset segmentation strategy and the multi-element KPI time series to be trained, wherein the sub-multiple KPI time series to be trained is unlabeled ;
    生成模块,用于根据所述待训练的子多元KPI时间序列训练预置模型,生成对应的确定性模型;A generating module, configured to train a preset model according to the sub-multiple KPI time series to be trained, and generate a corresponding deterministic model;
    第一获取模块,用于获取待检测的多元KPI时间序列;The first acquisition module is used to acquire the time series of multiple KPIs to be tested;
    第二获取模块,用于根据所述待检测的多元KPI时间序列和所述确定性模型,获取所述待检测的多元KPI时间序列中待检测数值的重建概率值;The second obtaining module is configured to obtain the reconstruction probability value of the value to be detected in the multivariate KPI time series to be detected according to the multivariate KPI time series to be detected and the deterministic model;
    第二确定模块,用于根据预置阈值和所述待检测数值的重建概率值,确定所述待检测数值对应的时刻是否为异常点。The second determining module is configured to determine whether the time corresponding to the value to be detected is an abnormal point according to the preset threshold value and the reconstruction probability value of the value to be detected.
  9. 一种计算机设备,其中,所述计算机设备包括处理器、存储器、以及存储在所述存储器上并可被所述处理器执行的计算机程序,其中所述计算机程序被所述处理器执行时,实现如下步骤:A computer device, wherein the computer device includes a processor, a memory, and a computer program stored on the memory and executable by the processor, wherein when the computer program is executed by the processor, the The following steps:
    根据预置切分策略和待训练的多元KPI时间序列,确定对应的待训练的多段子多元KPI时间序列,其中,所述待训练的子多元KPI时间序列为无标签;According to the preset segmentation strategy and the multi-element KPI time series to be trained, determine the corresponding multi-segment sub-multiple KPI time series to be trained, wherein the sub-multiple KPI time series to be trained are unlabeled;
    根据所述待训练的子多元KPI时间序列训练预置模型,生成对应的确定性模型;Training a preset model according to the sub-multiple KPI time series to be trained to generate a corresponding deterministic model;
    获取待检测的多元KPI时间序列;Obtain the time series of multiple KPIs to be tested;
    根据所述待检测的多元KPI时间序列和所述确定性模型,获取所述待检测的多元KPI时间序列中待检测数值的重建概率值;Obtaining the reconstruction probability value of the to-be-detected value in the multi-element KPI time series to be detected according to the multi-element KPI time series to be detected and the deterministic model;
    根据预置阈值和所述待检测数值的重建概率值,确定所述待检测数值对应的时刻是否为异常点。According to the preset threshold and the reconstruction probability value of the to-be-detected value, it is determined whether the time corresponding to the to-be-detected value is an abnormal point.
  10. 如权利要求9所述的计算机设备,其中,所述根据所述待训练的子多元KPI时间序列训练预置模型,生成对应的确定性模型,包括:9. The computer device according to claim 9, wherein the training a preset model according to the sub-multiple KPI time series to be trained to generate a corresponding deterministic model comprises:
    将所述待训练的子多元KPI时间序列输入所述预置模型,提取所述待训练的子多元KPI时间序列中的数值;Input the sub-multiple KPI time series to be trained into the preset model, and extract the values in the sub-multiple KPI time series to be trained;
    根据预置编码程序对所述数值进行编码,获取所述数值的第一高斯分布和第一辅助变量;Encode the numerical value according to a preset coding program, and obtain the first Gaussian distribution and the first auxiliary variable of the numerical value;
    根据预置解码程序对所述第一辅助变量进行解码,得到所述数值对应的第二高斯分布;Decode the first auxiliary variable according to a preset decoding program to obtain a second Gaussian distribution corresponding to the value;
    根据所述第一高斯分布和第二高斯分布训练所述预置模型的损失函数,生成对应的确定性模型。Training the loss function of the preset model according to the first Gaussian distribution and the second Gaussian distribution to generate a corresponding deterministic model.
  11. 如权利要求10所述的计算机设备,其中,所述预置编码程序包括第一预置神经网络模型和第一全连接层,所述根据预置编码程序对所述数值进行编码,获取所述数值的第一高斯分布和第一辅助变量,包括:The computer device according to claim 10, wherein the preset coding program includes a first preset neural network model and a first fully connected layer, and the numerical value is coded according to the preset coding program to obtain the The first Gaussian distribution of the value and the first auxiliary variable include:
    根据所述数值和第一预置神经网络模型,得到所述第一预置神经网络模型的第一隐藏变量;Obtaining the first hidden variable of the first preset neural network model according to the value and the first preset neural network model;
    根据第一预置全连接层和所述第一隐藏变量,得到所述数值对应的第一高斯分布;Obtaining the first Gaussian distribution corresponding to the value according to the first preset fully connected layer and the first hidden variable;
    基于所述第一高斯分布得到所述数值的第一辅助变量。The first auxiliary variable of the value is obtained based on the first Gaussian distribution.
  12. 如权利要求10所述的计算机设备,其中,所述解码程序包括第二预置神经网络模型和第二预置全连接层,所述根据预置解码程序对所述第一辅助变量进行解码,得到所述数值对应的第二高斯分布,包括:10. The computer device of claim 10, wherein the decoding program includes a second preset neural network model and a second preset fully connected layer, and the first auxiliary variable is decoded according to the preset decoding program, Obtaining the second Gaussian distribution corresponding to the value includes:
    根据所述第一辅助变量和第二预置神经网络模型,得到所述第二预置神经网络模型的第二隐藏变量;Obtaining the second hidden variable of the second preset neural network model according to the first auxiliary variable and the second preset neural network model;
    根据第二预置全连接层和所述第二隐藏变量,得到所述数值对应的第二高斯分布。According to the second preset fully connected layer and the second hidden variable, a second Gaussian distribution corresponding to the value is obtained.
  13. 如权利要求10所述的计算机设备,其中,所述根据所述第一高斯分布和第二高斯分布训练所述预置模型的损失函数,生成对应的确定性模型,包括:10. The computer device of claim 10, wherein the training the loss function of the preset model according to the first Gaussian distribution and the second Gaussian distribution to generate the corresponding deterministic model comprises:
    获取所述第二高斯分布中的累积分布概率;Acquiring the cumulative distribution probability in the second Gaussian distribution;
    根据所述累积分布概率和第一预置损失函数,得到所述数值的重建概率;Obtaining the numerical reconstruction probability according to the cumulative distribution probability and the first preset loss function;
    根据所述第一高斯分布,得到所述第一高斯分布的正则项;Obtaining a regular term of the first Gaussian distribution according to the first Gaussian distribution;
    根据所述正则项和第二预置损失函数,得到所述数值的正则化项;Obtaining a regularization term of the numerical value according to the regularization term and the second preset loss function;
    基于所述重建概率和所述正则化项训练所述解码程序的网络参数和所述编码程序的网络参数,生成对应的确定性模型。Training the network parameters of the decoding program and the network parameters of the encoding program based on the reconstruction probability and the regularization term, and generate a corresponding deterministic model.
  14. 如权利要求13所述的计算机设备,其中,所述得到所述数值的重建概率之后,还实现以下步骤:The computer device according to claim 13, wherein after said obtaining the reconstruction probability of the value, the following steps are further implemented:
    记录所述待训练的子多元KPI时间序列中各个数值的重建概率,并将各个所述数值的重建概率进行排序;Recording the reconstruction probability of each value in the sub-multiple KPI time series to be trained, and sorting the reconstruction probability of each value;
    确定排序后所述重建概率中预置序号对应的目标重建概率,并将所述目标重建概率作为预置阈值。Determine the target reconstruction probability corresponding to the preset sequence number in the reconstruction probability after sorting, and use the target reconstruction probability as the preset threshold.
  15. 如权利要求9所述的计算机设备,其中,所述根据预置阈值和所述待检测数值的重建概率值,确定所述待检测数值的时刻是否为异常点,包括:9. The computer device according to claim 9, wherein the determining whether the time at which the value to be detected is an abnormal point according to a preset threshold and the reconstruction probability value of the value to be detected comprises:
    将所述待检测数值的重建概率值与预置阈值进行比对,确定所述待检测数值的时刻是否为异常点;Comparing the reconstruction probability value of the to-be-detected value with a preset threshold to determine whether the moment of the to-be-detected value is an abnormal point;
    若所述待检测数值的重建概率值小于所述预置阈值,则确定所述待检测数值的时刻为异常点。If the reconstruction probability value of the to-be-detected value is less than the preset threshold, the time when the to-be-detected value is determined to be an abnormal point.
  16. 一种计算机可读存储介质,其中,所述计算机可读存储介质上存储有计算机程序,其中所述计算机程序被处理器执行时,实现如下步骤:A computer-readable storage medium, wherein a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the following steps are implemented:
    根据预置切分策略和待训练的多元KPI时间序列,确定对应的待训练的多段子多元KPI时间序列,其中,所述待训练的子多元KPI时间序列为无标签;According to the preset segmentation strategy and the multi-element KPI time series to be trained, determine the corresponding multi-segment sub-multiple KPI time series to be trained, wherein the sub-multiple KPI time series to be trained are unlabeled;
    根据所述待训练的子多元KPI时间序列训练预置模型,生成对应的确定性模型;Training a preset model according to the sub-multiple KPI time series to be trained to generate a corresponding deterministic model;
    获取待检测的多元KPI时间序列;Obtain the time series of multiple KPIs to be tested;
    根据所述待检测的多元KPI时间序列和所述确定性模型,获取所述待检测的多元KPI时间序列中待检测数值的重建概率值;Obtaining the reconstruction probability value of the to-be-detected value in the multi-element KPI time series to be detected according to the multi-element KPI time series to be detected and the deterministic model;
    根据预置阈值和所述待检测数值的重建概率值,确定所述待检测数值对应的时刻是否为异常点。According to the preset threshold and the reconstruction probability value of the to-be-detected value, it is determined whether the time corresponding to the to-be-detected value is an abnormal point.
  17. 如权利要求16所述的计算机可读存储介质,其中,所述根据所述待训练的子多元KPI时间序列训练预置模型,生成对应的确定性模型,包括:15. The computer-readable storage medium of claim 16, wherein the training a preset model according to the sub-multiple KPI time series to be trained to generate a corresponding deterministic model comprises:
    将所述待训练的子多元KPI时间序列输入所述预置模型,提取所述待训练的子多元KPI时间序列中的数值;Input the sub-multiple KPI time series to be trained into the preset model, and extract the values in the sub-multiple KPI time series to be trained;
    根据预置编码程序对所述数值进行编码,获取所述数值的第一高斯分布和第一辅助变量;Encode the numerical value according to a preset coding program, and obtain the first Gaussian distribution and the first auxiliary variable of the numerical value;
    根据预置解码程序对所述第一辅助变量进行解码,得到所述数值对应的第二高斯分布;Decode the first auxiliary variable according to a preset decoding program to obtain a second Gaussian distribution corresponding to the value;
    根据所述第一高斯分布和第二高斯分布训练所述预置模型的损失函数,生成对应的确定性模型。Training the loss function of the preset model according to the first Gaussian distribution and the second Gaussian distribution to generate a corresponding deterministic model.
  18. 如权利要求17所述的计算机可读存储介质,其中,所述预置编码程序包括第一预置神经网络模型和第一全连接层,所述根据预置编码程序对所述数值进行编码,获取所述数值的第一高斯分布和第一辅助变量,包括:17. The computer-readable storage medium of claim 17, wherein the preset coding program comprises a first preset neural network model and a first fully connected layer, and the numerical value is coded according to the preset coding program, Obtaining the first Gaussian distribution and the first auxiliary variable of the value includes:
    根据所述数值和第一预置神经网络模型,得到所述第一预置神经网络模型的第一隐藏变量;Obtaining the first hidden variable of the first preset neural network model according to the value and the first preset neural network model;
    根据第一预置全连接层和所述第一隐藏变量,得到所述数值对应的第一高斯分布;Obtaining the first Gaussian distribution corresponding to the value according to the first preset fully connected layer and the first hidden variable;
    基于所述第一高斯分布得到所述数值的第一辅助变量。The first auxiliary variable of the value is obtained based on the first Gaussian distribution.
  19. 如权利要求17所述的计算机可读存储介质,其中,所述解码程序包括第二预置神经网络模型和第二预置全连接层,所述根据预置解码程序对所述第一辅助变量进行解码,得到所述数值对应的第二高斯分布,包括:The computer-readable storage medium of claim 17, wherein the decoding program includes a second preset neural network model and a second preset fully connected layer, and the first auxiliary variable is processed according to the preset decoding program. Decoding to obtain the second Gaussian distribution corresponding to the value includes:
    根据所述第一辅助变量和第二预置神经网络模型,得到所述第二预置神经网络模型的第二隐藏变量;Obtaining the second hidden variable of the second preset neural network model according to the first auxiliary variable and the second preset neural network model;
    根据第二预置全连接层和所述第二隐藏变量,得到所述数值对应的第二高斯分布。According to the second preset fully connected layer and the second hidden variable, a second Gaussian distribution corresponding to the value is obtained.
  20. 如权利要求17所述的计算机可读存储介质,其中,所述根据所述第一高斯分布和第二高斯分布训练所述预置模型的损失函数,生成对应的确定性模型,包括:17. The computer-readable storage medium of claim 17, wherein the training the loss function of the preset model according to the first Gaussian distribution and the second Gaussian distribution to generate the corresponding deterministic model comprises:
    获取所述第二高斯分布中的累积分布概率;Acquiring the cumulative distribution probability in the second Gaussian distribution;
    根据所述累积分布概率和第一预置损失函数,得到所述数值的重建概率;Obtaining the numerical reconstruction probability according to the cumulative distribution probability and the first preset loss function;
    根据所述第一高斯分布,得到所述第一高斯分布的正则项;Obtaining a regular term of the first Gaussian distribution according to the first Gaussian distribution;
    根据所述正则项和第二预置损失函数,得到所述数值的正则化项;Obtaining a regularization term of the numerical value according to the regularization term and the second preset loss function;
    基于所述重建概率和所述正则化项训练所述解码程序的网络参数和所述编码程序的网络参数,生成对应的确定性模型。Training the network parameters of the decoding program and the network parameters of the encoding program based on the reconstruction probability and the regularization term, and generate a corresponding deterministic model.
PCT/CN2020/125003 2020-09-22 2020-10-30 Detection method and apparatus for multivariate kpi time series, and device and storage medium WO2021189844A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011004044.3 2020-09-22
CN202011004044.3A CN112131272B (en) 2020-09-22 2020-09-22 Method, device, equipment and storage medium for detecting multi-element KPI time sequence

Publications (1)

Publication Number Publication Date
WO2021189844A1 true WO2021189844A1 (en) 2021-09-30

Family

ID=73842418

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/125003 WO2021189844A1 (en) 2020-09-22 2020-10-30 Detection method and apparatus for multivariate kpi time series, and device and storage medium

Country Status (2)

Country Link
CN (1) CN112131272B (en)
WO (1) WO2021189844A1 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113963085A (en) * 2021-12-23 2022-01-21 树根互联股份有限公司 State characterization method and device of industrial system and electronic equipment
CN114065608A (en) * 2021-10-21 2022-02-18 深圳市卓立智能制造有限公司 Method and system for stably controlling output power of reciprocating electromagnetic pump and electronic equipment
CN115062686A (en) * 2022-05-07 2022-09-16 北京交通大学 Multi-KPI (Key performance indicator) time sequence abnormity detection method and system based on multi-angle features
CN116304540A (en) * 2023-05-23 2023-06-23 吉林大学 Ocean observation data processing method and device
CN117288441A (en) * 2023-10-16 2023-12-26 中国气象科学研究院 Langley calibration auxiliary method and system
CN117609881A (en) * 2023-11-29 2024-02-27 阿童木(广州)智能科技有限公司 Metal overlap detection method and system based on artificial intelligence

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112686330B (en) * 2021-01-07 2024-05-31 中国农业银行股份有限公司 KPI abnormal data detection method and device, storage medium and electronic equipment
CN113114529B (en) * 2021-03-25 2022-05-24 清华大学 KPI (Key Performance indicator) anomaly detection method and device based on condition variation automatic encoder and computer storage medium
CN113761021A (en) * 2021-08-17 2021-12-07 杭州涂鸦信息技术有限公司 Time sequence index data precision reduction processing method and device and computer equipment
CN115906144B (en) * 2021-08-26 2024-04-19 抖音视界有限公司 Data processing method, data processing device, electronic apparatus, and readable storage medium
CN114239952B (en) * 2021-12-14 2024-02-13 建信金融科技有限责任公司 Abnormality detection method, device, equipment and medium for key performance indexes
CN114494242A (en) * 2022-02-21 2022-05-13 平安科技(深圳)有限公司 Time series data detection method, device, equipment and computer storage medium
CN114881110B (en) * 2022-04-02 2023-07-21 西安交通大学 Real-time detection method for total pressure change mode in cabin of on-orbit spacecraft

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109492193A (en) * 2018-12-28 2019-03-19 同济大学 Abnormal network data based on depth machine learning model generate and prediction technique
CN109543943A (en) * 2018-10-17 2019-03-29 国网辽宁省电力有限公司电力科学研究院 A kind of electricity price inspection execution method based on big data deep learning
CN111277603A (en) * 2020-02-03 2020-06-12 杭州迪普科技股份有限公司 Unsupervised anomaly detection system and method
WO2020147480A1 (en) * 2019-01-14 2020-07-23 阿里巴巴集团控股有限公司 Stream processing-based monitoring index abnormality detection method, device and equipment

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4381361B2 (en) * 2005-08-31 2009-12-09 株式会社東芝 Time-series data analysis device, time-series data analysis method, and time-series data analysis program
US9075713B2 (en) * 2012-05-24 2015-07-07 Mitsubishi Electric Research Laboratories, Inc. Method for detecting anomalies in multivariate time series data
CN109101986A (en) * 2018-06-07 2018-12-28 国网山东省电力公司青岛供电公司 Power transmission and transformation equipment state method for detecting abnormality and system based on stack noise reduction self-encoding encoder
US20200097810A1 (en) * 2018-09-25 2020-03-26 Oracle International Corporation Automated window based feature generation for time-series forecasting and anomaly detection
CN110071913B (en) * 2019-03-26 2020-10-02 同济大学 Unsupervised learning-based time series anomaly detection method
CN109978379B (en) * 2019-03-28 2021-08-24 北京百度网讯科技有限公司 Time series data abnormity detection method and device, computer equipment and storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109543943A (en) * 2018-10-17 2019-03-29 国网辽宁省电力有限公司电力科学研究院 A kind of electricity price inspection execution method based on big data deep learning
CN109492193A (en) * 2018-12-28 2019-03-19 同济大学 Abnormal network data based on depth machine learning model generate and prediction technique
WO2020147480A1 (en) * 2019-01-14 2020-07-23 阿里巴巴集团控股有限公司 Stream processing-based monitoring index abnormality detection method, device and equipment
CN111277603A (en) * 2020-02-03 2020-06-12 杭州迪普科技股份有限公司 Unsupervised anomaly detection system and method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
CHEN JUNSHENG, LI JIAN, CHEN WEIGEN, SUN PENG: "A Method for Detecting Anomaly Conditions of Wind Turbines Using Stacked Denoising Autoencoders with Sliding Window and Multiple Noise Ratios", TRANSACTIONS OF CHINA ELECTROTECHNICAL SOCIETY, vol. 35, no. 2, 31 January 2020 (2020-01-31), pages 346 - 358, XP055852391, ISSN: 1000-6753, DOI: 10.19595/j.cnki.1000-6753.tces.181577 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114065608A (en) * 2021-10-21 2022-02-18 深圳市卓立智能制造有限公司 Method and system for stably controlling output power of reciprocating electromagnetic pump and electronic equipment
CN114065608B (en) * 2021-10-21 2023-09-22 深圳市卓立智能制造有限公司 Output power stable control method and system for reciprocating electromagnetic pump and electronic equipment
CN113963085A (en) * 2021-12-23 2022-01-21 树根互联股份有限公司 State characterization method and device of industrial system and electronic equipment
CN113963085B (en) * 2021-12-23 2022-03-29 树根互联股份有限公司 State characterization method and device of industrial system and electronic equipment
CN115062686A (en) * 2022-05-07 2022-09-16 北京交通大学 Multi-KPI (Key performance indicator) time sequence abnormity detection method and system based on multi-angle features
CN116304540A (en) * 2023-05-23 2023-06-23 吉林大学 Ocean observation data processing method and device
CN117288441A (en) * 2023-10-16 2023-12-26 中国气象科学研究院 Langley calibration auxiliary method and system
CN117609881A (en) * 2023-11-29 2024-02-27 阿童木(广州)智能科技有限公司 Metal overlap detection method and system based on artificial intelligence
CN117609881B (en) * 2023-11-29 2024-04-30 阿童木(广州)智能科技有限公司 Metal overlap detection method and system based on artificial intelligence

Also Published As

Publication number Publication date
CN112131272A (en) 2020-12-25
CN112131272B (en) 2023-11-10

Similar Documents

Publication Publication Date Title
WO2021189844A1 (en) Detection method and apparatus for multivariate kpi time series, and device and storage medium
CN111709491B (en) Anomaly detection method, device, equipment and storage medium based on self-encoder
CN113011593B (en) Method and system for eliminating quantum measurement noise, electronic device and medium
CN111880998B (en) Service system anomaly detection method and device, computer equipment and storage medium
US20170177993A1 (en) Adaptive neural network management system
CN111914559B (en) Text attribute extraction method and device based on probabilistic graphical model and computer equipment
AU2019201789B2 (en) Failed and censored instances based remaining useful life (rul) estimation of entities
US20200311557A1 (en) Data-driven deep learning model generalization analysis and improvement
US20200410285A1 (en) Anomaly Augmented Generative Adversarial Network
WO2022227198A1 (en) Method and device for determining drug regimen of patient
CN111160485A (en) Regression training-based abnormal behavior detection method and device and electronic equipment
US20180365594A1 (en) Systems and methods for generative learning
CN112529767B (en) Image data processing method, device, computer equipment and storage medium
CN112131274B (en) Method, device, equipment and readable storage medium for detecting abnormal points of time sequence
CN110929733A (en) Denoising method and device, computer equipment, storage medium and model training method
CN112651467A (en) Training method and system and prediction method and system of convolutional neural network
CN114022480B (en) Medical image key point detection method and device based on statistics and shape topological graph
CN113255747B (en) Quantum multichannel convolutional neural classification method, system, terminal and storage medium
CN116910592B (en) Log detection method and device, electronic equipment and storage medium
US11727109B2 (en) Identifying adversarial attacks with advanced subset scanning
CN115329971B (en) Method and device for eliminating amplitude damping noise, electronic equipment and medium
CN116108369A (en) Fault detection method, device, equipment and medium for intelligent micro-grid protection
CN114186583B (en) Method and system for recovering abnormal signal of corrosion detection of tank wall of oil storage tank
Gao et al. Nonparametric modeling and break point detection for time series signal of counts
CN114548367B (en) Reconstruction method and device of multimodal data based on countermeasure network

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20927071

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20927071

Country of ref document: EP

Kind code of ref document: A1