CN110751169A

CN110751169A - Time sequence classification method based on relation change among multivariate variables

Info

Publication number: CN110751169A
Application number: CN201910833290.0A
Authority: CN
Inventors: 蔡瑞初; 陈嘉伟; 温雯; 郝志峰; 陈炳丰; 李梓健
Original assignee: Guangdong University of Technology
Current assignee: Guangdong University of Technology
Priority date: 2019-09-04
Filing date: 2019-09-04
Publication date: 2020-02-04
Anticipated expiration: 2039-09-04
Also published as: CN110751169B

Abstract

The invention provides a time sequence classification method based on relation change among multivariate variables, which comprises the following steps: acquiring sample data from the observation data set, calculating a partial correlation coefficient between every two variables of the sample data, and constructing a partial correlation coefficient matrix; coding the partial correlation coefficient matrix through a convolutional neural network to obtain a corresponding characteristic diagram; respectively stretching each feature map into feature vectors to be circularly input into the long and short memory neural network, thereby obtaining a hidden state for capturing variable inter-relation change modes; and inputting the hidden state into a label classifier, outputting a corresponding sample class, and finishing the classification of the time sequence. The time sequence classification method based on the relation change among the multivariate variables fully considers the relation among different variables in time sequence data, performs classification based on the relation mode of the variables, fully expresses the change mode of the relation among the different variables in the time sequence data, and has better robustness on the input noise value and high classification precision.

Description

Time sequence classification method based on relation change among multivariate variables

Technical Field

The invention relates to the technical field of data mining, in particular to a time sequence classification method based on relation change among multivariate variables.

Background

Applications of time series data in the fields of industrial systems, information systems, medical health, financial markets, etc. are becoming more and more common today. Therefore, the task of time-series classification has become an important and valuable research topic, such as abnormality detection and the like. Conventional similarity-based time sequence classification methods such as K-nearest neighbor (KNN) and Dynamic Time Warping (DTW). However, this type of method is sensitive only to the values of the variables and does not take into account the relationships between the different variables.

Another type of method that is currently popular is to perform a series of feature transformations on time series data, so as to mine patterns therein for classification, such as multi-layer perceptrons (MLPs), long short term memory neural networks (LSTM), Convolutional Neural Networks (CNN), and so on. Although this kind of method implicitly captures the relationship between different variables on the feature space, it is difficult to characterize the change pattern of the relationship between the variables. In the time series classification problem, a certain type of variation mode of the relation between variables often represents a classification category. For example, in an information system, it is common that "CPU temperature" is increased by an increase in "CPU usage" of a certain server, and "fan speed" is increased by an increase in "CPU temperature", so that "CPU temperature" is maintained relatively stable when "CPU usage" continues to increase. It can be seen that the relationship between "CPU usage" and "CPU temperature" changes from being dependent to being independent during this time. However, when a fan of a server fails, then both "CPU temperature" and "fan speed" may be irrelevant, and an increase in "CPU usage" causes the "CPU temperature" to continue to rise, even causing the server to be down. Therefore, the relationship between "CPU usage" and "CPU temperature" has not been independent for this period of time.

The relationship between the variables in the two categories is different, but the current method cannot express and classify the variation well.

Disclosure of Invention

The invention provides a time sequence classification method based on the relation change among multiple variables, aiming at overcoming the technical defects that the existing time sequence data classification method cannot effectively express the change mode of the relation among the variables and is human.

In order to solve the technical problems, the technical scheme of the invention is as follows:

a method for time-series classification based on relation change among multivariate variables comprises the following steps:

s1: acquiring an observation data set with a label;

s2: acquiring sample data from the observation data set, calculating a partial correlation coefficient between every two variables of the sample data, and constructing a partial correlation coefficient matrix to obtain a partial correlation coefficient matrix at each moment;

s3: the partial correlation coefficient matrix at each moment is used as an input convolutional neural network CNN, and the convolutional neural network encodes the partial correlation coefficient matrix to obtain a corresponding characteristic diagram;

s4: respectively stretching each feature map into feature vectors to be circularly input into a long and short memory neural network (LSTM), thereby obtaining a hidden state for capturing a variable mode among variable relations;

s5: and inputting the hidden state into a label classifier, outputting a corresponding sample class, and finishing the classification of the time sequence.

Wherein, the step S1 specifically includes:

sampling at fixed time by using a data acquisition device of an industrial system or an information system; different index values are obtained at each sampling moment, the system state corresponding to the moment is represented by a label variable, and an observation data set can be acquired after the system runs for a period of time, wherein:

characterization of the observation dataset as X ═ X₁,x₂,Λ,x_m]Wherein m is the number of samples; set at time tSample data x_t∈RⁿI.e. containing n variables, and each sample data corresponds to a tag variable y_tWherein y is_t∈R。

Wherein, the step S2 specifically includes:

s21: obtaining sample data X with time length of w from observation data set_t＝[x_t-w+1,x_t-w+2,Λ,x_t]Wherein X is_tIs a time slice in X and is used for calculating partial correlation coefficient matrix P_t∈R^n×nAs a relation matrix between variables at time t;

s22: the time sequences of two variables i and j in the period of time are respectively set asThen one coefficient in the partial correlation coefficient matrix

The calculation method is as follows:

wherein,

is a covariance matrix sigma_tOf the inverse matrix of (d), and the covariance matrix sigma_tElement (1) ofThe calculation method is as follows:

wherein,

andrespectively, the mean of the two variables over the time period.

S23: obtaining a partial correlation coefficient matrix P at each moment according to the calculation mode of the step S22_tAnd is used for representing the relation between different variables at each moment.

Wherein, the step S3 specifically includes: a partial correlation coefficient matrix P with a time length of l_t-l+1,P_t-l+2,ΛP_tInputting the convolution neural network, and coding the partial correlation coefficient matrix by the convolution neural network to obtain corresponding l characteristic graphs and corresponding labels y at each moment_t。

Wherein, in the step S4, the hidden state h_tFor capturing the l variable inter-relationship variation patterns.

In step S5, the label classifier adopts a full connection layer, and outputs the obtained sample class

Wherein the method further comprises step S6: and (4) repeatedly performing the steps S3-S5 by using a gradient descent method by using the cross entropy of the output sample class as a loss function so as to improve the classification precision.

Compared with the prior art, the technical scheme of the invention has the beneficial effects that:

the time sequence classification method based on the relation change among the multivariate variables fully considers the relation among different variables in time sequence data and classifies based on the relation mode of the variables, improves the defects of the existing method, fully expresses the change mode of different variable relations in the time sequence data, has better robustness on the input noise value and high classification precision, and can be applied to the time sequence classification problem in the fields of industrial systems, information systems, medical health, financial markets and the like.

Drawings

FIG. 1 is a schematic flow diagram of the process of the present invention.

Detailed Description

The drawings are for illustrative purposes only and are not to be construed as limiting the patent;

for the purpose of better illustrating the embodiments, certain features of the drawings may be omitted, enlarged or reduced, and do not represent the size of an actual product;

it will be understood by those skilled in the art that certain well-known structures in the drawings and descriptions thereof may be omitted.

The technical solution of the present invention is further described below with reference to the accompanying drawings and examples.

Example 1

As shown in fig. 1, a method for time-series classification based on relationship change between multivariate variables comprises the following steps:

s1: acquiring an observation data set with a label;

Wherein, the step S1 specifically includes:

characterization of the observation dataset as X ═ X₁,x₂,Λ,x_m]Wherein m is the number of samples; let the sample at time tData x_t∈RⁿI.e. containing n variables, and each sample data corresponds to a tag variable y_tWherein y is_t∈R。

More specifically, the step S2 specifically includes:

s22: the time sequences of two variables i and j in the period of time are respectively set as

Then one coefficient in the partial correlation coefficient matrix

The calculation method is as follows:

wherein,

is a covariance matrix sigma_tOf the inverse matrix of (d), and the covariance matrix sigma_tElement (1) of

The calculation method is as follows:

wherein,

andrespectively, the mean of the two variables over the time period.

More specifically, the step S3 specifically includes: a partial correlation coefficient matrix P with a time length of l_t-l+1,P_t-l+2,ΛP_tInputting the convolution neural network, and coding the partial correlation coefficient matrix by the convolution neural network to obtain corresponding l characteristic graphs and corresponding labels y at each moment_t。

More specifically, in the step S4, the hidden state h_tFor capturing the l variable inter-relationship variation patterns.

More specifically, in step S5, the label classifier adopts a full connection layer, and outputs the obtained sample class

More specifically, the method further includes step S6: and (4) repeatedly performing the steps S3-S5 by using a gradient descent method by using the cross entropy of the output sample class as a loss function so as to improve the classification precision.

In the specific implementation process, the time sequence classification method based on the relation change among the multivariate variables fully considers the relation among different variables in time sequence data, and simultaneously classifies based on the relation mode of the variables, so that the method overcomes the defects of the existing method, fully expresses the change mode of different variable relations in the time sequence data, has better robustness on the input noise value, has high classification precision, and can be applied to the time sequence classification problem in the fields of industrial systems, information systems, medical health, financial markets and the like.

It should be understood that the above-described embodiments of the present invention are merely examples for clearly illustrating the present invention, and are not intended to limit the embodiments of the present invention. Other variations and modifications will be apparent to persons skilled in the art in light of the above description. And are neither required nor exhaustive of all embodiments. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present invention should be included in the protection scope of the claims of the present invention.

Claims

1. A method for time-series classification based on relation change among multivariate variables is characterized by comprising the following steps:

s1: acquiring an observation data set with a label;

s3: the partial correlation coefficient matrix at each moment is used as an input convolutional neural network, and the convolutional neural network encodes the partial correlation coefficient matrix to obtain a corresponding characteristic diagram;

s4: respectively stretching each feature map into feature vectors to be circularly input into the long and short memory neural network, thereby obtaining a hidden state for capturing variable inter-relation change modes;

2. The method according to claim 1, wherein the step S1 is specifically performed by:

characterization of the observation dataset as X ═ X₁,x₂,Λ,x_m]Wherein m is the number of samples; let sample data x at time t_t∈RⁿI.e. containing n variables, with one for each sample dataIndividual tag variable y_tWherein y is_t∈R。

3. The method according to claim 2, wherein the step S2 is specifically performed by:

Then one coefficient in the partial correlation coefficient matrix

The calculation method is as follows:

wherein,

The calculation method is as follows:

wherein,

andrespectively, the mean of the two variables over the time period.

4. The method according to claim 3, wherein the step S3 is specifically performed by: a partial correlation coefficient matrix P with a time length of l_t-l+1,P_t-l+2,ΛP_tInputting the convolution neural network, and coding the partial correlation coefficient matrix by the convolution neural network to obtain corresponding l characteristic graphs and corresponding labels y at each moment_t。

5. The method according to claim 4, wherein in step S4, the hidden state h is hidden_tFor capturing the l variable inter-relationship variation patterns.

6. The method according to claim 5, wherein in step S5, the label classifier employs a full connection layer, and outputs the obtained sample class

7. The method for time-series classification based on the relation change between multivariate variables according to any one of claims 1-6, further comprising the step of S6: and (4) repeatedly performing the steps S3-S5 by using a gradient descent method by using the cross entropy of the output sample class as a loss function so as to improve the classification precision.