CN106815639A

CN106815639A - The abnormal point detecting method and device of flow data

Info

Publication number: CN106815639A
Application number: CN201611229506.5A
Authority: CN
Inventors: 陈龙
Original assignee: Neusoft Corp
Current assignee: Neusoft Corp
Priority date: 2016-12-27
Filing date: 2016-12-27
Publication date: 2017-06-09

Abstract

The present invention relates to the abnormal point detecting method and device of a kind of flow data, it is used to solve the relatively low technical problem of the degree of accuracy of prior art stream data outlier detection.The method includes：According to the data that every class sensor is detected in the historical juncture, and the actual measurement abnormal point numerical of each historical juncture trains short-term memory recurrent neural network LSTM models long, LSTM models have been trained in formation, wherein, the data for being detected in the historical juncture per sensor described in class are used as an input vector for training the LSTM models；The data that sensor described in every class was detected at current time obtain the abnormal point numerical testing result for having trained LSTM models to export as the input for having trained LSTM models.

Description

The abnormal point detecting method and device of flow data

Technical field

The present invention relates to data processing field, in particular it relates to the abnormal point detecting method and device of a kind of flow data.

Background technology

Flow data is one group of order, a large amount of, data sequence that rapidly, continuously reaches, and generally, data flow can be regarded It is a dynamic data set increased without limitation with time duration.For every field, for example, Internet of Things, aviation boat My god, the meteorological field such as observing and controlling and financial service, the sensor that flow data can be understood as various multiclass detects at each moment The data for arriving.

Stream data carries out outlier detection, is conducive to predicting the following failure that may occur, and diagnosed.At present, Stream data carries out the method that the research of abnormality detection is based primarily upon statistics and machine learning.

Wherein, the distribution situation that Statistics-Based Method passes through analysis statisticaling data, finds and does not meet the data distribution Abnormity point.This method is it should be understood that data distribution situation, while the abnormity point for detecting may be examined by different distributed models Measure, that is, the mechanism for detecting these abnormity points may not be unique.Method based on machine learning mainly regards outlier detection as Cluster or classification problem, solve the abnormity point ambiguity problem produced based on statistical method.But prior art is generally using shallow Layer machine learning model is carried out, and flow data only is directly used in into model learning, does not consider the inner link between data characteristics, abnormal The degree of accuracy of point detection is relatively low.

The content of the invention

The main object of the present invention is to provide the abnormal point detecting method and device of a kind of flow data, is used to solve existing skill The relatively low technical problem of the degree of accuracy of art stream data outlier detection.

To achieve these goals, first aspect present invention provides a kind of abnormal point detecting method of flow data, the stream Data include the data detected in the historical juncture per class sensor, and in the actual measurement abnormal point numerical of each historical juncture According to methods described includes：

The data that the sensor according to every class is detected in the historical juncture, and the actual measurement abnormal point numerical instruction LSTM models have been trained in white silk short-term memory recurrent neural network LSTM models long, formation, wherein, sensor is described described in per class The data that historical juncture detects are used as an input vector for training the LSTM models；

The data that sensor described in every class was detected at current time are obtained as the input for having trained LSTM models To the abnormal point numerical testing result for having trained LSTM models to export.

Alternatively, the data that basis sensor described in per class is detected in the historical juncture, and the actual measurement Abnormal point numerical trains short-term memory recurrent neural network LSTM models long, and LSTM models have been trained in formation, including：

Learn often sensor described in class using the flow data sliding window of the LSTM models to be detected in the historical juncture The data for arriving, obtain the destination probability value P (y=1 | h) of normal point, wherein, the value of normal point label y is 1；

Label and the destination probability value P (y=1 | h) according to normal point, calculate and obtain cross entropy, wherein, the friendship Pitch loss function of the entropy as model, the similitude for weighing probable value and label；

The parameter of the LSTM models is adjusted according to the loss function.

Alternatively, the flow data sliding window study using LSTM models sensor described in per class is gone through described The data that the history moment detects, obtain the destination probability value P (y=1 | h) of normal point, including：

Learn each class sensor respectively using the flow data sliding window of the LSTM models to be examined in the historical juncture The feature of the data for measuring, obtains hiding vector representation；

The logistic regression layer treatment that the hiding vector representation is passed through into the LSTM models, obtains the destination probability value P (y=1 | h).

Learn each class sensor respectively using the flow data sliding window of the LSTM models to be examined in the historical juncture The feature of the data for measuring, obtains first and hides vector representation；

Logistic regression layer treatment of the vector representation by the LSTM models is hidden by described first, described first is obtained general Rate value P (y=1 | h₁)；

Detected in the historical juncture using the flow data sliding window study all the sensors of the LSTM models The feature of data, obtains second and hides vector representation；

Logistic regression layer treatment of the vector representation by the LSTM models is hidden by described second, the second probable value is obtained P (y=1 | h₂)；

According to the first probable value P (y=1 | h₁) and the second probable value P (y=1 | h₂) calculated by equation below Obtain the destination probability value P (y=1 | h)：

P (y=1 | h)=sigmoid (α P (y=1 | h₁)+β P (y=1 | h₂))；

Wherein, α is more than 0, and alpha+beta=1 more than 0, β, and sigmoid is activation primitive.

Alternatively, the label and the destination probability value P (y=1 | h) according to normal point, calculates and obtains cross entropy, Including：

The label of normal point and the cross entropy L (θ) of the destination probability value P (y=1 | h) are calculated by equation below：

L (θ)=- (ylogP (y=1 | h)+(1-y) log (1-P (y=1 | h))).

Alternatively, the parameter that the LSTM models are adjusted according to the loss function, including：

Model parameter is adjusted by stochastic gradient descent algorithm according to the loss function L (θ).

Second aspect present invention provides a kind of detection means, including：

Training module, for the data detected in the historical juncture according to every class sensor, and during each history The actual measurement abnormal point numerical at quarter trains short-term memory recurrent neural network LSTM models long, and LSTM models have been trained in formation, wherein, The data detected in the historical juncture per sensor described in class are used as an input vector for training the LSTM models；

Detection module, for the data that sensor described in every class is detected at current time to have been trained into LSTM as described The input of model, obtains the abnormal point numerical testing result for having trained LSTM models to export.

Alternatively, the training module includes：

Data characteristics study module, for being sensed described in the every class of flow data sliding window study using the LSTM models The data that device is detected in the historical juncture, obtain the destination probability value P (y=1 | h) of normal point, wherein, normal point label y Value be 1；

Computing module, for the label according to normal point and the destination probability value P (y=1 | h), calculating is intersected Entropy, wherein, the cross entropy as model loss function, the similitude for weighing probable value and label；

Parameter adjustment module, the parameter for adjusting the LSTM models according to the loss function.

Alternatively, the data characteristics study module specifically for：

P (y=1 | h)=sigmoid (α P (y=1 | h₁)+β P (y=1 | h₂))；

Alternatively, the computing module is used for：

L (θ)=- (ylogP (y=1 | h)+(1-y) log (1-P (y=1 | h))).

Alternatively, the parameter adjustment module specifically for：

By above-mentioned technical proposal, the data that every class sensor was detected in the historical juncture are used as training LSTM models Input vector, based on LSTM models characteristic in itself, its relation between considering different input vectors in training, so as to When the data detected using various kinds of sensors train LSTM models, it is ensured that LSTM models can learn various kinds of sensors high it Between contact.Inner link between not considering data characteristics when the abnormity point of flow data is detected compared to existing technology, the present invention Abnormity point can be more accurately detected using the LSTM models trained.

Other features and advantages of the present invention will be described in detail in subsequent specific embodiment part.

Brief description of the drawings

Accompanying drawing is, for providing a further understanding of the present invention, and to constitute the part of specification, with following tool Body implementation method is used to explain the present invention together, but is not construed as limiting the invention.In the accompanying drawings：

Fig. 1 is the structural representation of the computing unit of LSTM models；

Fig. 2 is a kind of schematic flow sheet of the abnormal point detecting method of flow data provided in an embodiment of the present invention；

Fig. 3 is the schematic diagram that a kind of flow data provided in an embodiment of the present invention slides study window learning data feature；

Fig. 4 is a kind of schematic diagram of the training process of LSTM models provided in an embodiment of the present invention；

Fig. 5 is the schematic diagram of the training process of another LSTM models provided in an embodiment of the present invention；

Fig. 6 A are a kind of structural representations of detection means provided in an embodiment of the present invention；

Fig. 6 B are the structural representations of another detection means provided in an embodiment of the present invention.

Specific embodiment

Specific embodiment of the invention is described in detail below in conjunction with accompanying drawing.It should be appreciated that this place is retouched The specific embodiment stated is merely to illustrate and explain the present invention, and is not intended to limit the invention.

In order that those skilled in the art are easier to understand technical scheme provided in an embodiment of the present invention, first below to relating to And to correlation technique simply introduced.

LSTM (Long Short-term Memory Recurrent Neural Network, recurrent neural net in short-term long Network) model is a kind of time recurrent neural network.Forget a mechanism f by introducing, solve traditional recurrent neural network (Recurrent Neural Network, RNN) produce propagation information lose and gradient diffusing phenomenon, more suitable for treatment and Critical event very long is spaced and postponed in predicted time sequence.

As shown in figure 1, computing unit in LSTM models is as shown in figure 1, as seen from the figure, the calculating of each computing unit Journey is shown below：

i_t=σ (W⁽ⁱ⁾x_t+U⁽ⁱ⁾h_t-1+b⁽ⁱ⁾)

f_t=σ (W^(f)x_t+U^(f)h_t-1+b^(f))

c_t=f_t⊙c_t-1+i_t⊙tanh(W^(c)x_t+U^(c)h_t-1+b^(c))

o_t=σ (W^(o)x_t+U^(o)h_t-1+b^(o))

h_t=o_t⊙tanh(c_t)

Usually, the abnormity point that flow data is produced is most relevant with its historical data.By analysis of history data, contribute to Judgement is predicted the abnormal conditions of data.Above formula illustrates any sort sensor and hides vector representation (h in t_t) calculating Method.First, x_tIt is the vector representation of such sensor of t, its component is that each sensor is adopted in t such sensor The numerical value of collection, h_t-1It is the hiding vector representation at t-1 moment, W⁽ⁱ⁾, U⁽ⁱ⁾And b⁽ⁱ⁾Respectively calculate input gate vector representation (i_t) parameter.Based on sigmoid activation primitives, acquisition can be calculated only comprising 0 and 1 vector representation i_t.Similarly, W^(f), U^(f)And b^(f)Respectively calculate and forget a vector representation (f_t) parameter.Then, based on i_tAnd f_tWeighted sum, it is automatic to obtain stream Data are in t-1 moment important information, while filtering out useless information, calculate the memory elements vector representation for obtaining t (c_t).Equally, based on sigmoid activation primitives, out gate vector o of the flow data in t can be obtained_t.Finally, o is calculated_t With memory elements c_tBetween dot product, obtain flow data t hiding vector representation h_t, for calculate the t+1 moment hide to Amount is represented.

Input vector x_tBy after LSTM, calculating and obtaining corresponding hiding vector representation h_t, going through before containing t History flow data change information.Based on h_tIt is trained or predicts, compared to uses x merely_tFor, when being more conducive to capture each Sensor information is carved, the performance of outlier detection model is improved.

Based on LSTM models, the embodiment of the present invention provides a kind of abnormal point detecting method of flow data, wherein, the fluxion According to the data detected in the historical juncture including every class sensor, and in the actual measurement abnormal point numerical of each historical juncture According to.As shown in Fig. 2 the method includes：

The data that S201, the sensor according to per class are detected in the historical juncture, and the actual measurement abnormity point Data train short-term memory recurrent neural network LSTM models long, and LSTM models have been trained in formation, wherein, sensor described in per class The data detected in the historical juncture are used as an input vector for training the LSTM models.

Specifically, the data that same sensor is detected in multiple historical junctures are a data sequences, by the data sequence Used as input vector, each data in data sequence are each element in input vector.

S202, the data that sensor described in every class is detected at current time the defeated of LSTM models is trained into as described Enter, obtain the abnormal point numerical testing result for having trained LSTM models to export.

With reference to the above-mentioned introduction to LSTM models, the number that the embodiment of the present invention detects every class sensor in the historical juncture According to the input vector as training LSTM models, in the case of with n class sensors, it is believed that be LSTM model runnings N chronomere, in each chronomere, the input vector of LSTM models is that a class sensor is detected in the historical juncture Data.Based on LSTM models characteristic in itself, its relation between considering different input vectors in training, so as to use During the data training LSTM models that various kinds of sensors is detected, it is ensured that LSTM models can learn between various kinds of sensors high Contact.Inner link between not considering data characteristics when the abnormity point of flow data is detected compared to existing technology, the present invention is implemented Example can more accurately detect abnormity point using the LSTM models trained.That is, the embodiment of the present invention will be to fluxion According to outlier detection regard the classification problem based on time series as, it is contemplated that the relation between various kinds of sensors, improve different The degree of accuracy of often point detection.

In order that those skilled in the art more understand technical scheme provided in an embodiment of the present invention, below above-mentioned steps enter Row is described in detail.

Specifically, above-mentioned steps S201 can include：Using the flow data sliding window study of the LSTM models per class The data that the sensor is detected in the historical juncture, obtain the destination probability value P (y=1 | h) of normal point, wherein, just Often the value of point label y is 1；Label and the destination probability value P (y=1 | h) according to normal point, calculate and obtain cross entropy, its In, the cross entropy as model loss function, the similitude for weighing probable value and label；According to the loss function Adjust the parameter of the LSTM models.

What deserves to be explained is, the size that flow data slides study window illustrates each step-length for learning, and flow data is slided The size of study window specifically can dynamically be adjusted according to input vector, it is also possible to be preset as fixed value.

Fig. 4 is the schematic diagram that flow data slides study window learning data feature.Wherein, each circle of the Tu Zhong the superiors Circle represents an element of input vector, namely the data that sensor is detected in a certain historical juncture.Flow data slides study The size of window is 4, i.e., learn 4 data characteristicses of element every time.K1, k2 and k3 represent each layer neural network parameter, specifically Computing unit shown in reference picture 1, each layer of neural network parameter includes the ginseng being related in computing unit as shown in Figure 1 Number.By after each layer of neural network parameter treatment, vector representation, h as shown in Figure 4 being hidden accordingly_t-2、h_t-1With h_t.As shown in Figure 1, the hiding vector representation of last layer will hide vector representation as next layer.

So, after obtaining hiding vector representation for each input vector successively learning data feature, by hiding vector table Show and processed by logistic regression layer, that is, obtain the destination probability value P (y=1 | h) of normal point.That is, normal point label y It is 1 to be worth, and the value of abnormity point label y is 0.Destination probability value P (y=1 | h) refers to the probability that detected data belong to normal point Value, so that, detected data belong to the probable value as 1-P (y=1 | h) of abnormity point.In addition, loss function is used to weigh general The similitude of rate value and label, if similar sexual satisfaction predetermined threshold value, can obtain described having trained LSTM moulds with deconditioning Type；Alternatively, when frequency of training reaches threshold value, it is also possible to deconditioning, obtain described having trained LSTM models.

In a kind of possible implementation of the embodiment of the present invention, the flow data using the LSTM models is slided The data that window study sensor described in per class is detected in the historical juncture, obtain the destination probability value P (y=1 of normal point | h), including：Learn each class sensor respectively using the flow data sliding window of the LSTM models to be examined in the historical juncture The feature of the data for measuring, obtains hiding vector representation；The hiding vector representation is returned by the logic of the LSTM models Return layer to process, obtain the destination probability value P (y=1 | h).

Illustratively, 3 class sensors are in each historical juncture t_n-wTo t_nData are detected respectively, wherein, the classification of sensor The data type that can be detected according to sensor is classified, such as above-mentioned 3 class sensor can be respectively temperature classes sensing Device, humidity class sensor, pressure class sensor.Also, the quantity of each class sensor can be multiple.With t_nMoment illustrates, t_n The flow data at momentWherein, { x_t1,x_t2It is temperature classes sensor in t_nDetection The data for arriving, { x_h1It is humidity class sensor in t_nThe data that moment detects, { x_p1,x_p2,x_p3It is pressure class sensor in t_n The data that moment detects.So, the embodiment of the present invention is in this kind of possible implementation, by each class sensor in t_n-w To t_nThe data that moment detects are used as the input vector for training LSTM models.

Specifically, as shown in figure 4, using LSTM_tStudy temperature classes sensor is in t_n-wTo t_nThe data sequence that moment detects The data characteristics of row, using LSTM_hStudy temperature classes sensor is in t_n-wTo t_nThe data characteristics of the data sequence that the moment detects, Using LSTM_pStudy temperature classes sensor is in t_n-wTo t_nThe data characteristics of the data sequence that the moment detects, in by multiple After the study of interbed, obtain hide vector representation h respectively_t, h_h, h_p.Further, by h_t, h_h, h_pAs the defeated of logistic regression layer Incoming vector h_s, i.e. h_s=[h_t, h_h, h_p], by after the activation primitive treatment of logistic regression layer, obtaining destination probability value P (y=1 |h)。

Illustratively, it is assumed that h_s=[x₁,x₂,......x_n], it is necessary to learn to y=a₁x₁+a₂x₂+ ...+b, i.e. y= w_l.h_s+ b, then can do such as down conversion：Y=log (p/ (1-p)) is made, wherein p is the numerical value between [0,1], it is believed that be certain The possibility that part thing occurs, doing further conversion for y=log (p/ (1-p)) can obtain：

The above-mentioned functional form for being Logic Regression Models, also referred to as sigmoid functions.By above formula understand p [0,1] it Between it is continuous in whole real number field curve and can lead.When two classification problems is solved, a threshold value can be set, when p is more than When threshold value, it is divided into classification A, otherwise, is divided into classification B.Then in embodiments of the present invention, classify for normal point and abnormity point Problem, the data of detection are for the destination probability P (y=1 | h) of normal point：

The data of detection are for the destination probability P (y=0 | h) of abnormity point：

Further, cross entropy Ls of the destination probability P (y=1 | h) and label y between can be obtained by below equation (θ)：

L (θ)=- y log (P (y=1 | h))+(1-y) log (1-P (y=1 | h))；

What deserves to be explained is, cross entropy is smaller to represent that destination probability is more similar to label, such that it is able to the cross entropy is made It is loss function, each layer parameter to LSTM models is adjusted.And after parameter adjustment is carried out, the mistake shown in reference picture 4 Journey, continues based on 3 class sensors in t_n-wLSTM models are trained to the data that the tn moment detects, until what is trained Similitude between destination probability and label reaches predetermined threshold value, or, until frequency of training reaches threshold value, deconditioning is obtained To having trained LSTM models.

In the alternatively possible implementation of the embodiment of the present invention, the flow data using the LSTM models is slided The data that dynamic window study sensor described in per class is detected in the historical juncture, obtain the destination probability value P (y of normal point =1 | h), including：Learn each class sensor respectively in the history using the flow data sliding window of the LSTM models The feature of the data that quarter detects, obtains first and hides vector representation；Vector representation is hidden by described first pass through the LSTM Model logistic regression layer treatment, obtain the first probable value P (y=1 | h₁)；Slided using the flow data of the LSTM models The feature of the data that dynamic window study all the sensors are detected in the historical juncture, obtains second and hides vector representation；Will Described second hides vector representation by the layer treatment of the logistic regression of the LSTM models, obtain the second probable value P (y=1 | h₂)；According to the first probable value P (y=1 | h₁) and the second probable value P (y=1 | h₂) be calculated by equation below The destination probability value P (y=1 | h)：

P (y=1 | h)=sigmoid (α P (y=1 | h₁)+β P (y=1 | h₂))

Example based on Fig. 4 is illustrated.As shown in figure 5, with reference to the above-mentioned process described to Fig. 4, by h_s=[h_t, h_h, h_p] by logistic regression layer activation primitive treatment obtain the first probable value P (y=1 | h₁) after, by LSTM_aLearn all biographies Sensor is in t_n-wThe feature of the data detected to the tn moment, obtains hiding vector representation h_a, by h_aBy swashing for logistic regression layer Function treatment living obtain the second probable value P (y=1 | h₂).The destination probability value P (y=1 | h) for finally giving is the first probable value P (y=1 | h₁) and the second probable value P (y=1 | h₂) weighted sum, wherein, weights α and β can set in advance according to the actual requirements It is fixed.

After being calculated destination probability value P (y=1 | h) by this kind of possible implementation, it is also possible to calculate intersection Entropy L (θ)：L (θ)=- (ylogP (y=1 | h)+(1-y) log (1-P (y=1 | h))), and according to L (θ) to the ginseng of LSTM models Number is adjusted.

In above-mentioned possible implementation, LSTM models training when, the data conduct that each class sensor is detected The input vector of LSTM models is trained, so as to take into account the contact between adjacent two classes sensor, while in order to capture difference Contact between class individual sensor, the data that the sensor of all categories is detected are directly inputted as another input vector LSTM models, contribute to LSTM model learnings to the deeper contact between different classes of sensor, further improve inspection Survey the degree of accuracy.

Further, the embodiment of the present invention can also pass through SGD (Stochastic according to the cross entropy L (θ) Gradient Descent, stochastic gradient descent) algorithm is adjusted to model parameter.

Specifically, parameter adjustment is as shown by the following formula：

θ={ W_s ⁽ⁱ⁾,W_s ^(f),W_s ^(c),W_s ^(o),U_s ⁽ⁱ⁾,U_s ^(f),U_s ^(c),U_s ^(o),

b_s ⁽ⁱ⁾,b_s ^(f),b_s ^(c)b_s ^(o),w_l,b⁽ⁱ⁾, s=t, h, p ..., a }

Wherein, l is learning rate, and θ represents the parameter of current each layer of LSTM models,The parameter after adjustment is represented,Represent Cross entropy L (θ) is to θ derivations.Reference picture 1, the parameter of each computing unit includes W⁽ⁱ⁾, U⁽ⁱ⁾And b⁽ⁱ⁾, that is, calculate input gate to Measure the parameter for representing, and W^(f), U^(f)And b^(f), that is, calculate the parameter for forgetting a vector representation, and W^(c), U^(c)And b^(c), that is, calculate the parameter of memory elements vector representation, and W^(o), U^(o)And b^(o), that is, the parameter of out gate vector representation is calculated, In addition, the relevant parameter that computing is used also including dot product etc..In corresponding to Fig. 4 and Fig. 5, the every LSTM shown in figure_tRepresent Each layer, the computing unit shown in each layer of structure reference picture 1.Being calculated by above-mentioned formula for every layer of each parameter can In the hope of the parameter after adjustment.Wherein, LSTM_h、LSTM_pAnd LSTM_aSimilarly, so, trained again according to the parameter after adjustment LSTM models, terminate, the LSTM moulds trained until the satisfactory numerical value of cross entropy or frequency of training reach threshold value Type.

What deserves to be explained is, can make each layer parameter of LSTM models towards greatest gradient side using stochastic gradient descent algorithm To convergence, so as to ensure that the cross entropy of each training more comes small so that the cross entropy after training each time becomes closer to meet It is required that numerical value.

The embodiment of the present invention also provides a kind of detection means 60, a kind of fluxion for implementing the offer of above method embodiment According to abnormal point detecting method, as shown in Figure 6A, the detection means 60 includes：

Training module 601, for according to the data detected in the historical juncture per class sensor, and each history The actual measurement abnormal point numerical at moment trains short-term memory recurrent neural network LSTM models long, and LSTM models have been trained in formation, its In, the data detected in the historical juncture per sensor described in class as train one of the LSTM models be input into Amount；

Detection module 602, for the data that sensor described in every class is detected at current time to have been trained as described The input of LSTM models, obtains the abnormal point numerical testing result for having trained LSTM models to export.

Specifically, as shown in Figure 6B, the training module 601 includes：

Data characteristics study module 6011, for described in the every class of flow data sliding window study using the LSTM models The data that sensor is detected in the historical juncture, obtain the destination probability value P (y=1 | h) of normal point, wherein, normal point The value of label y is 1；

Computing module 6012, for the label according to normal point and the destination probability value P (y=1 | h), calculating is handed over Fork entropy, wherein, the cross entropy as model loss function, the similitude for weighing probable value and label；

Parameter adjustment module 6013, the parameter for adjusting the LSTM models according to the loss function.

Alternatively, the data characteristics study module 6011 specifically for：

P (y=1 | h)=sigmoid (α P (y=1 | h₁)+β P (y=1 | h₂))

Alternatively, the computing module 6012 is used for：

L (θ)=- (ylogP (y=1 | h)+(1-y) log (1-P (y=1 | h))).

Alternatively, the parameter adjustment module 6013 specifically for：

Those skilled in the art can be understood that, for convenience and simplicity of description, only with above-mentioned each function mould The division of block is carried out for example, in practical application, as needed can distribute by different functional modules above-mentioned functions Complete, will the internal structure of device be divided into different functional modules, to complete all or part of function described above. The specific work process of foregoing description functional module, may be referred to the corresponding process in preceding method embodiment, no longer go to live in the household of one's in-laws on getting married herein State.

Using above-mentioned detection device, the data that the detection means detects every class sensor in the historical juncture are used as training The input vector of LSTM models, based on LSTM models characteristic in itself, between it considers different input vectors in training Relation, so as to when the data detected using various kinds of sensors train LSTM models, it is ensured that LSTM models can learn height Contact between various kinds of sensors.Compared to existing technology detect flow data abnormity point when do not consider in data characteristics Contact, the present invention can more accurately detect abnormity point using the LSTM models trained.

In embodiment provided herein, it should be understood that disclosed apparatus and method, can be by other Mode is realized.For example, during each functional module in each embodiment of the invention can be integrated in a processing unit, it is also possible to It is that unit is individually physically present.Above-mentioned integrated unit can both be realized in the form of hardware, it would however also be possible to employ hardware Plus the form of SFU software functional unit is realized.

The above-mentioned integrated unit realized in the form of SFU software functional unit, can store and be deposited in an embodied on computer readable In storage media.Above-mentioned SFU software functional unit storage is in a storage medium, including some instructions are used to so that a computer Equipment (can be personal computer, server, or network equipment etc.) performs the portion of each embodiment methods described of the invention Step by step.And foregoing storage medium includes：(Random Access Memory, arbitrary access is deposited for USB flash disk, mobile hard disk, RAM Reservoir), magnetic disc or CD etc. are various can be with the medium of data storage.

The above, specific embodiment only of the invention, but protection scope of the present invention is not limited thereto, and it is any Those familiar with the art the invention discloses technical scope in, the change or replacement that can be readily occurred in, all should It is included within the scope of the present invention.Therefore, protection scope of the present invention should be defined by scope of the claims.

Claims

1. a kind of abnormal point detecting method of flow data, it is characterised in that the flow data is included per class sensor in history The data that quarter detects, and in the actual measurement abnormal point numerical of each historical juncture, methods described includes：

The data that the sensor according to every class is detected in the historical juncture, and the actual measurement abnormal point numerical training length LSTM models have been trained in short-term memory recurrent neural network LSTM models, formation, wherein, sensor is in the history described in per class The data that moment detects are used as an input vector for training the LSTM models；

The data that sensor described in every class was detected at current time obtain institute as the input for having trained LSTM models State the abnormal point numerical testing result for having trained LSTM models to export.

2. method according to claim 1, it is characterised in that basis sensor described in per class is in the historical juncture The data for detecting, and the actual measurement abnormal point numerical trains short-term memory recurrent neural network LSTM models long, is formed Training LSTM models, including：

Detected in the historical juncture per sensor described in class using the flow data sliding window study of the LSTM models Data, obtain the destination probability value P (y=1 | h) of normal point, wherein, the value of normal point label y is 1；

Label and the destination probability value P (y=1 | h) according to normal point, calculate and obtain cross entropy, wherein, the cross entropy As the loss function of model, the similitude for weighing probable value and label；

The parameter of the LSTM models is adjusted according to the loss function.

3. method according to claim 2, it is characterised in that the flow data sliding window using the LSTM models The data that study is detected per sensor described in class in the historical juncture, obtain the destination probability value P (y=1 | h) of normal point, Including：

Learn each class sensor respectively using the flow data sliding window of the LSTM models to be detected in the historical juncture Data feature, obtain hide vector representation；

The logistic regression layer treatment that the hiding vector representation is passed through into the LSTM models, obtains the destination probability value P (y =1 | h).

4. method according to claim 2, it is characterised in that the flow data sliding window using the LSTM models The data that study is detected per sensor described in class in the historical juncture, obtain the destination probability value P (y=1 | h) of normal point, Including：

Learn each class sensor respectively using the flow data sliding window of the LSTM models to be detected in the historical juncture Data feature, obtain first hide vector representation；

Logistic regression layer treatment of the vector representation by the LSTM models is hidden by described first, first probable value is obtained P (y=1 | h₁)；

The data detected in the historical juncture using the flow data sliding window study all the sensors of the LSTM models Feature, obtain second hide vector representation；

Logistic regression layer treatment of the vector representation by the LSTM models is hidden by described second, the second probable value P (y are obtained =1 | h₂)；

According to the first probable value P (y=1 | h₁) and the second probable value P (y=1 | h₂) be calculated by equation below The destination probability value P (y=1 | h)：

P (y=1 | h)=sigmoid (α P (y=1 | h₁)+β P (y=1 | h₂))；

5. the method according to any one of claim 2 to 4, it is characterised in that the label according to normal point and described Destination probability value P (y=1 | h), calculate and obtain cross entropy, including：

L (θ)=- (ylogP (y=1 | h)+(1-y) log (1-P (y=1 | h))).

6. method according to claim 5, it is characterised in that described that the LSTM models are adjusted according to the loss function Parameter, including：

7. a kind of detection means, it is characterised in that including：

Training module, for according to the data detected in the historical juncture per class sensor, and each historical juncture Actual measurement abnormal point numerical trains short-term memory recurrent neural network LSTM models long, and LSTM models have been trained in formation, wherein, per class The data that the sensor was detected in the historical juncture are used as an input vector for training the LSTM models；

Detection module, for the data that sensor described in every class is detected at current time to have been trained into LSTM models as described Input, obtain the abnormal point numerical testing result for having trained LSTM models to export.

8. device according to claim 7, it is characterised in that the training module includes：

Data characteristics study module, exists for sensor described in the every class of flow data sliding window study using the LSTM models The data that the historical juncture detects, obtain the destination probability value P (y=1 | h) of normal point, wherein, the value of normal point label y It is 1；

Computing module, for the label according to normal point and the destination probability value P (y=1 | h), calculates and obtains cross entropy, its In, the cross entropy as model loss function, the similitude for weighing probable value and label；

9. device according to claim 8, it is characterised in that the data characteristics study module specifically for：

10. device according to claim 8, it is characterised in that the data characteristics study module specifically for：

P (y=1 | h)=sigmoid (α P (y=1 | h₁)+β P (y=1 | h₂))；