CN110008253B

CN110008253B - Industrial data association rule mining and abnormal working condition prediction method

Info

Publication number: CN110008253B
Application number: CN201910244856.6A
Authority: CN
Inventors: 徐正国; 王豆; 陈积明; 程鹏; 孙优贤
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 2019-03-28
Filing date: 2019-03-28
Publication date: 2021-02-23
Anticipated expiration: 2039-03-28
Also published as: CN110008253A

Abstract

The invention discloses an industrial data association rule mining and abnormal working condition prediction method which can be applied to fault prediction and health management of an industrial process. The invention introduces the association rule mining into the industrial equipment fault prediction, and finds the association between the operation parameters through the association rule mining algorithm. According to the characteristics of industrial data, starting from the variation trend of the operation parameters of the equipment, generating a transaction set by taking the variation trend of the operation parameters as the most important index, mining association rules between the parameters on the basis of the transaction set, and introducing the mining result of the association rules into the prediction of the abnormal working condition of the industrial equipment to obtain a more accurate prediction result. The method has great application value for fault prediction and health management in engineering.

Description

Industrial data association rule mining and abnormal working condition prediction method

Technical Field

The invention belongs to the technical field of reliability maintenance engineering, and relates to an industrial data association rule mining and abnormal working condition prediction method based on a two-stage frequent item set generation strategy.

Background

With the continuous emergence of complex systems and the increasing demand of real-time monitoring of industrial processes, modern industrial equipment is often equipped with a plurality of sensors to monitor the operation state of the industrial equipment in the operation process. Meanwhile, multiple fault modes may occur in the operation process of the equipment, a certain fault may correspond to a plurality of symptoms, and under the condition, the single sensor information cannot completely reflect the operation state of the equipment, so that fault prediction based on multi-sensor information is generated at the right moment. The failure prediction based on multi-sensor information aims to analyze the operation state of the equipment using comprehensive sensor information, thereby making more reliable equipment diagnosis and prediction. With the continuous development of sensing technology, the use of multiple sensors for condition monitoring, fault diagnosis and prediction of equipment has become a trend.

In the field of fault prediction, the work of combining association rule mining and fault prediction is still fresh at present. In fact, for time series data, equipment failure or failure is often represented by parameters or relevant features extracted from the parameters, and the prediction is often carried out on the variation trend of the parameters or the relevant features. And the association rule among the parameters is mined, so that more complete parameters, namely equipment running state information can be obtained, and a certain basis is provided for subsequent prediction.

Disclosure of Invention

Aiming at the current situation of the prior art, the invention aims to solve the problem that the association rule of sensor data is rarely considered in the existing data-driven prediction technology, provides an equipment abnormal working condition prediction method based on the operation parameter association rule, and constructs a more applicable wavelet neural network to perform abnormal working condition prediction (fault prediction).

The concept of the present invention will now be explained as follows:

the invention uses the association rule to depict the association of the operation parameters of the industrial process, and researches the abnormal working condition prediction problem mined based on the association rule of the time sequence data. In order to mine association rules on a sequence level for time series data, the invention provides a time series data association rule mining algorithm comprising a two-stage frequent item set generation process. In the first stage, extracting the change trend information of the time sequence as a basic mode for mining association rules, and finding a frequent item set of time sequence change forms; in the second stage, on the basis of the frequent item set of the time sequence variation form, the frequent item set of which the sequence is a basic mode is found, and association rule mining is carried out on every two sequences. And then, carrying out abnormal working condition prediction by using the system variables related to the association rule obtained by mining, and introducing the association rule into a wavelet neural network to improve the prediction accuracy. The method provided by the invention takes the operation parameter association rule into consideration, and can obtain a more accurate fault prediction result.

According to the invention concept, the invention provides an industrial data association rule mining and predicting method based on a two-stage frequent item set generation strategy, which comprises the following specific steps:

step 1: performing piecewise linearization representation and symbolization on time series data, and constructing a discrete data set suitable for association rule mining;

step 2: generating a frequent item set of the data set by adopting a two-stage frequent item set mining algorithm;

and step 3: generating association rules according to the frequent item sets, and extracting the association rules meeting the minimum support degree and the minimum confidence degree threshold;

and 4, step 4: and introducing the association rule mining result into a wavelet neural network and predicting the abnormal working condition of the industrial equipment.

Based on the above scheme, the following implementation manner can be specifically adopted for each step:

preferably, the step 1 comprises the following substeps:

step 1.1: the measuring time sequence of the sensor is as follows

N is the number of sensors and k is the time sequence length; the starting point of the initial fitting is

Initial fitting endpoint of

The fitting starting point is recorded as

Fitted endpoint of

Fitting error threshold value is omega_E；

Step 1.2: for each

The piecewise fitting was performed as follows:

1.2.1 initializing a segmentation point count value of 1;

1.2.2 in turn for each starting point of the fit

Performing step 1) -step 4):

1) firstly, calculating end as start + h;

2) for data

Fitting by using a least square method, and calculating a fitting error ERR;

3) if the fitting error ERR is not more than the fitting error threshold value omega_EIf h is h +1, skipping to step 1) again;

4) if the fitting error ERR is larger than the fitting error threshold value omega_EObtaining

Line segment fitting sequence of

Recording the segmentation point when the start is equal to start + h

Resetting h to 2, count to count + 1;

1.2.3 circularly executing the step 1.2.2 until the end is larger than k, and obtaining a fitted linear time sequence

And segmentation point

Composed sequence of segmentation points Pⁱ；

Step 1.3: time series after fitting any sensor

Is marked as Y_k＝{y₁,y₂,…,y_kAnd extracting trend and numerical value information of each fitting line segment, and representing one fitting line segment s in the following triple mode_i：

Wherein k is_iWhich represents the slope of the line segment,

represents the span of the line segment on the time axis, r_iData { y } representing the growth rate of the line segment data corresponding to the line segment_j,y_j+1,…,y_j+h}，

j is the starting point of the line segment;

for the line segmented time sequence Y_kAll the line segments in the sequence are subjected to triple representation to obtain a triple sequence S_n＝{s₁,s₂,…,s_nIn which n represents the time series X_kThe number of segments after segmentation;

step 1.4: clustering line segment sequences in the triple sequence and symbolizing the line segments, which are used for representing different change forms of equipment or systems, and describing the line segments s by adopting Euclidean distance_iAnd s_jDegree of similarity d_ij：

Wherein d is_ijRepresenting a line segment s_iAnd s_jSimilarity of (d)_ijThe smaller the size, the more similar the change form of the two line segments, ω_kAnd ω_rIs a weight;

then according to the similarity index d_ijUsing a K-means clustering algorithm to pair S_nClustering is carried out, and a phase is distributed to the same line segmentThe same symbol is used for representing the change mode of the operation parameter to obtain a symbolized sequence F_n＝{f₁,f₂,…,f_n}，f₁,f₂,…,f_nRespectively representing symbols to which the 1 st, 2 … th, n line segments are assigned;

step 1.5: measuring time sequence for every two sensors

And

merging its segment point sequence PⁱAnd P^jIs denoted by P^ij，n_ij-1 is PⁱAnd P^jThe number of the combined segmentation points; and symbolizing the sequence according to the combined segmentation point pair

And

performing segmentation reconstruction to obtain reconstructed symbolic sequence

And

preferably, the step 2 comprises the following substeps:

step 2.1: for measuring time series

And

respectively corresponding operating parameters VⁱAnd V^jThe symbolized data of the measurement sequence obtained from step 1 is

And

from which a transaction set is formed, i.e. each transaction is recorded as

And

the line segment type symbols included in (1) are respectively marked as

And

recording the minimum support threshold of the two stages as min₁And minisup₂；

Step 2.2: calculating the support degree of each item through a single scanning data set to obtain a frequent 1-item set, and performing the following processes of 2.2.1-2.2.3:

2.2.1: let σ (-) be the support count of an item or set of items, initially 0; is provided with

Is denoted by the class symbol t_kT represents a or b;

2.2.2: for each transaction

Calculating σ (t)_k)＝σ(t_k)+1；

2.2.3: for each t_kIf, if

Not less than the minimum support degree threshold value minsup₁Then, consider t_kFor frequent 1-item sets, reserve t_kAnd recording corresponding support degree counts; if it is not

Less than the minimum support threshold value minsup₁Then, consider t_kNot a frequent 1-item set;

step 2.3: using the frequent 1-item set t obtained in step 2.2_kForming a 2-item set and calculating the support degree of the 2-item set to find the frequent 2-item set according to the following processes:

2.3.1: note a_pAnd b_qRespectively, the symbols from the original line segment class after step 2.2

And

the item retained in (1);

2.3.2 for each { a_p,b_qExecuting the following steps:

1) for each one exists in

Of (1) { a_p,b_q}, calculate σ ({ a)_p,b_q})＝σ({a_p,b_q})+1

2) If it is not

Not less than min₁Then consider { a_p,B_qKeep { a } for the frequent 2-item set_p,b_qAnd recording corresponding support degree counts;

step 2.4: using the frequent 2-item set { a) obtained in step 2.3_p,b_qCalculating the support degree of every two operation parameters in the whole data set, and obtaining a frequent item set of a parameter level, and performing the following steps: for every two operating parameters VⁱAnd V^jSet of formed items { Vⁱ,V^j}, calculate σ ({ V)ⁱ,V^j})＝sum(σ({a_p,b_q}) if

Not less than the minimum support degree threshold value minsup₂Then { V } is retainedⁱ,V^jRecord the corresponding support degree and calculate sigma (V)ⁱ)＝sum(σ(a_p))；σ(V^j)＝sum(σ(b_q))。

Preferably, the step 3 comprises the following substeps:

step 3.1: for each set { V satisfying the threshold of the support degree obtained in step 2ⁱ,V^jResults in the following association rules: v^j→VⁱAnd Vⁱ→V^jRecording the minimum confidence threshold value as minconf;

step 3.2: calculating a confidence threshold value according to each generated association rule group, wherein the process of extracting the association rules is as follows: for each association rule Vⁱ→V^jCalculating

If conf (V)ⁱ→V^j) If the minimum confidence coefficient threshold is not less than minconf, the association rule V is reservedⁱ→V^jAnd records the corresponding support and confidence omegaⁱ。

Preferably, the step 4 comprises the following substeps:

step 4.1: for any set of association parameters extracted from the association rule, it is marked as { V¹,V²,…,V^uWhere u denotes the number of associated parameters, V^uFor each association rule V, the rule's consequent, i.e. the target parameterⁱ→V^u1,2, … u-1, each with a confidence level, which is denoted as ωⁱ(ii) a For the target parameter V^uPredicting abnormal working conditions by using a wavelet neural network;

step 4.2: constructing a training sample: the preset prediction step length is recorded to be l, and a group of association parameters extracted by association rule mining are set to be V¹,V²,…,V^uThe complete training data set formed by them is recorded as

Construct the following matrix I_trainFor the training input of the neural network:

wherein, I_trainEach column in the training output O is a training input sample_trainComprises the following steps:

step 4.3: training the wavelet neural network by using the constructed training sample: input parameter is VⁱI is 1,2, … u-1, and the output parameter is V^uWherein at network initialization, the confidence ω derived from the association rule is usedⁱSetting an initial weight value between a network input layer and a hidden layer, wherein i is 1,2, … u-1;

step 4.4: and (3) new data prediction: recording a preset abnormal working condition occurrence threshold value as omega_pFor newly acquired sensor measurement data, the model trained in the step 4.3 is used for carrying out prediction in the step l, and if the obtained target parameter predicted value exceeds the set threshold value omega relative to the initial normal drift amount_pAnd judging that the abnormal working condition occurs.

Preferably, before the device fails, the model is reconstructed and trained after a predetermined number of measurement data are updated with the data update, so as to obtain a more accurate prediction result.

The industrial data association rule mining and predicting method based on the two-stage frequent item set generation strategy can be used for a complex industrial system measured by a sensor. By mining the association rule of the operation parameters of the industrial equipment, the corresponding parameter association is obtained, and the parameter association is introduced into wavelet neural network prediction, so that a more accurate prediction effect can be obtained. The method provides firm support for subsequent equipment maintenance planning, is beneficial to equipment maintenance management with strict reliability requirements, and has wide prospects in the aspect of practical engineering application.

Drawings

FIG. 1 shows the predicted result of variable 7 of IDV (13) in the example and the comparison with the actual value;

FIG. 2 shows the predicted result of the variable 11 of IDV (13) in the example and the comparison with the actual value;

FIG. 3 shows the predicted error rate of IDV (13) variable 7 in the example;

FIG. 4 shows the predicted error rate of the IDV (13) variable 11 in the example.

Detailed Description

The embodiments of the present invention will now be further described with reference to the accompanying drawings.

The following example illustrates the specific operational steps and the effectiveness of the verification method in terms of Tennessee-Iseman (TE) process simulation data.

The data set was sampled at 3 minute intervals and recorded the variable measurements taken by each sensor at that sampling interval. Under each operating condition (normal operating state and fault operating state under 21 preset faults), the measurement data of the simulation process will generate two types of data sets, namely a training set and a test set. The acquisition process of the training set is measured values of all 52 variables obtained under the condition that the simulation process runs for 25 hours, wherein, except the training set acquired under the normal running state, the acquisition of the other 21 training set data introduces faults after the simulation process runs for 1 hour, and only the measured data of the following 24 hours are recorded. That is, the training set in the normal operation state has 500 observation samples, and the training sets collected in the remaining 21 fault states are all 480 observation samples. In addition, for 22 test sets, the data is all the variable measurement values collected after the simulation process runs for 48 hours, that is, each test set contains 960 sample data. It should be noted that in the simulation of 21 process faults, the corresponding fault was introduced after the simulation was run for 8 hours. Therefore, for the test set in 21 fault operation states, the first 160 observation samples are normal data, and the last 800 observation samples are fault data. In the TE process simulation model, only IDV (13) is a slowly varying fault, so in this example we use the relevant data of IDV (13) to perform experiments. The specific process of the industrial data association rule mining and abnormal working condition prediction method is as follows:

step 1: and (3) performing piecewise linearization representation and symbolization on the time series data, and constructing a discrete data set suitable for association rule mining. The method specifically comprises the following substeps:

step 1.1: the measuring time sequence of the sensor is as follows

Initial fitting endpoint of

The fitting starting point is recorded as

Fitted endpoint of

Fitting error threshold value is omega_E. It should be noted that in the present invention, i and j are numbers indicating sensors as superscripts and are numbers indicating only ordinal numbers as subscripts, regardless of the sensor numbers.

Step 1.2: for each

The piecewise fitting was performed as follows:

1.2.1 initializing a segmentation point count value of 1;

1.2.2 in turn for each starting point of the fit

Performing step 1) -step 4):

1) firstly, calculating end as start + h;

2) for data

Fitting by using a least square method, and calculating a fitting error ERR;

Line segment fitting sequence of

Recording the segmentation point when the start is equal to start + h

Resetting h to 2, count to count + 1;

1.2.3 circularly executing 1.2.2 till end that end is larger than k, and obtaining a line-segment time sequence after least square fitting

And segmentation point

Composed sequence of segmentation points Pⁱ；

Step 1.3: time series after fitting any sensor

Is marked as Y_k＝{y₁,y₂,…,y_kWith a plurality of line segments fitted by the least squares method described above. Extracting trend and numerical information of each fitting line segment, and representing one fitting line segment s in the following triple mode_i：

Wherein k is_iWhich represents the slope of the line segment,

j is the starting point of the line segment;

step 1.4: clustering line segment sequences in the triple sequence and symbolizing the line segments to represent different change forms of equipment or a system, thereby preparing for subsequent association rule mining. Describing line segment s by Euclidean distance_iAnd s_jDegree of similarity d_ij：

then according to the similarity index d_ijUsing a K-means clustering algorithm to pair S_nClustering is carried out, and the same symbol is distributed to the same line segment to represent the change mode of the operation parameter, so as to obtain a symbolized sequence F_n＝{f₁,f₂,…,f_n}，f₁,f₂,…,f_nRespectively representing symbols to which the 1 st, 2 … th, n line segments are assigned;

step 1.5: measuring time sequence for every two sensors

And

merging its segment point sequence PⁱAnd P^jIs denoted by P^ij，n_ij-1 is PⁱAnd P^jThe number of the combined segmentation points; and respectively symbolize the sequences according to the combined segmentation points

And

And

step 2: and generating a frequent item set of the data set by adopting a two-stage frequent item set mining algorithm. The method specifically comprises the following substeps:

step 2.1: for measuring time series

And

And

from which a transaction set is formed, i.e. each transaction logIs composed of

And

the line segment type symbols included in (1) are respectively marked as

And

recording the minimum support threshold of the two stages as min₁And minisup₂. In this example, the minimum support threshold is set as: minsup₁＝0.2， minsup₂＝0.2。

Is denoted by the class symbol t_kT represents a or b;

2.2.2: for each transaction

Calculating σ (t)_l)＝σ(t_l)+1；

2.2.3: for each t_lIf, if

And

the item retained in (1);

2.3.2 for each { a_p,b_qExecuting the following steps:

1) for each one exists in

Of (1) { a_p,b_q}, calculate σ ({ a)_p,b_q})＝σ({a_p,b_q})+1

2) If it is not

And step 3: and generating association rules according to the frequent item set, and extracting the association rules meeting the minimum support degree and the minimum confidence degree threshold value. The method specifically comprises the following substeps:

step 3.1: for each set { V satisfying the threshold of the support degree obtained in step 2ⁱ,V^jResults in the following association rules: v^j→VⁱAnd Vⁱ→V^jRecording the minimum confidence threshold value as minconf; in this example, the minimum confidence threshold is set as: minconf ═ 0.7;

In this step, association rules satisfying the threshold condition are generated, and a part of association parameters and confidence values thereof are extracted as shown in table 1. As can be seen from the results of table 1, this example will perform the prediction operation using variable 7 and variable 11 as target parameters.

And 4, step 4: and introducing the association rule mining result into a wavelet neural network and predicting the abnormal working condition of the industrial equipment. The method specifically comprises the following substeps:

step 4.1: for any set of association parameters extracted from the association rule, it is marked as { V¹,V²,…,V^uWhere u denotes the number of associated parameters, V^uFor each association rule V, the rule's consequent, i.e. the target parameterⁱ→V^u1,2, … u-1, all haveOne confidence, let it be ωⁱ(ii) a For the target parameter V^uPredicting abnormal working conditions by using a wavelet neural network;

step 4.2: constructing a training sample: let the preset prediction step be l, which in this example is set to 10. The set of association parameters extracted by association rule mining is { V }¹,V²,…,V^uThe complete training data set formed by them is recorded as

in particular, the training set herein not only uses fault data of the IDV (13) related variables, but also uses data of the related variables under normal operating conditions.

Step 4.3: training the wavelet neural network by using the constructed training sample: input parameter is VⁱI is 1,2, … u-1, and the output parameter is V^uWherein at network initialization, the confidence ω derived from the association rule is usedⁱAnd i is 1,2, … u-1, setting initial weight between the network input layer and the hidden layer. In this example, for variable 7, the input layer is 4 nodes and the hidden layer is 8 nodes; for the variable 11, the input layer is 3 nodes, the hidden layer is 6 nodes, the output layers of the two variables are 1 node, the adopted wavelet basis functions are all Morlet mother wavelet basis functions, and the related confidence values in the table 1 are used as the initialization weights of the input layer and the hidden layer of the neural network;

step 4.4: and (3) new data prediction: recording a preset abnormal working condition occurrence threshold value as omega_pFor newly acquired sensor measurement data, the model trained in the step 4.3 is used for carrying out prediction in the step l, and if the obtained target parameter predicted value exceeds the set threshold value omega relative to the initial normal drift amount_pAnd judging that the abnormal working condition occurs. Before the device does not fail, with the updating of the data, every updating a predetermined number N^lAfter the measurement data is obtained, the model is reconstructed and trained to obtain more accurate prediction results, wherein N is^lDepending on the sensor sampling frequency and actual industrial field requirements. This example uses the first 300 data of the test set (total 960 sample points) to verify the prediction effect and updates the neural network with every 10 data. The threshold at which abnormal conditions (failures) occur (i.e. a parameter deviating from its normal value by a certain percentage) is set to ω_p＝0.015。

Table 1 association rules

Rule antecedents	Rule clause	Confidence level
			Variable 13	Variable 7	0.7527
Variable 16	Variable 7	0.7446
			Variable 36	Variable 7	0.7017
Variable 35	Variable 11	0.7513
			Variable 36	Variable 11	0.7390

TABLE 2 Total prediction error Rate

	Introducing association rules	Without introducing association rules
			Variable 7	1.0482	1.8548
Variable 11	0.8536	1.2135

Fig. 1 and fig. 2 show the prediction results of the variable 7 and the variable 13, and in order to verify the advantages of introducing the association rule, the prediction results are compared with the neural network prediction results under the condition of not introducing the association rule. In fig. 1 and 2, a vertical solid line indicates actual abnormal condition occurrence time under the condition of setting our threshold, and a vertical dotted line indicate predicted values of the abnormal condition occurrence time on the premise of introducing and not introducing the association rule, respectively. As can be seen from fig. 1 and fig. 2, the prediction result obtained by the method of the present invention can better approach the true value, and especially in the prediction of the first half test data, a good prediction result is obtained, because the first half is the operation data in the normal state, the training set is relatively complete and the value is relatively concentrated. In the prediction of the failure time, the method provided by the invention also obtains a better prediction result, in fig. 1, the predicted value lags behind the real value by 8 sampling points, and in fig. 2, the predicted value lags behind the real value by 5 sampling points. Compared with the prediction result without the introduction of the association rule, the method provided by the invention obviously obtains a more accurate prediction result. The error rate prediction calculation results for the variables 7 and 11 are shown in fig. 3 and 4. Also, to further quantify the results, the overall prediction error rate was calculated as shown in Table 2. From the point of view of the overall prediction error, the introduction of the association rule significantly reduces the prediction error of the neural network, which is also well reflected in the data presented in table 2.

Claims

1. A method for mining industrial data association rules and predicting abnormal working conditions is characterized by comprising the following specific steps:

and 4, step 4: introducing the association rule mining result into a wavelet neural network and predicting the abnormal working condition of the industrial equipment;

the step 1 comprises the following substeps:

step 1.1: the measuring time sequence of the sensor is as follows

Initial fitting endpoint of

h is 2; the fitting starting point is recorded as

Fitted endpoint of

Fitting error threshold value is omega_E；

Step 1.2: for each

The piecewise fitting was performed as follows:

1.2.1 initializing a segmentation point count value of 1;

1.2.2 in turn for each starting point of the fit

Performing step 1) -step 4):

1) firstly, calculating end as start + h;

2) for data

Fitting by using a least square method, and calculating a fitting error ERR;

Line segment fitting sequence of

Recording the segmentation point when the start is equal to start + h

Resetting h to 2, count to count + 1;

And segmentation point

Composed sequence of segmentation points Pⁱ；

Step 1.3: time series after fitting any sensor

Wherein k is_iWhich represents the slope of the line segment,

j is the starting point of the line segment;

step 1.5: measuring time sequence for every two sensors

And

And

And

2. the method for mining industrial data association rules and predicting abnormal conditions as claimed in claim 1, wherein the step 2 comprises the following sub-steps:

step 2.1: for measuring time series

And

respectively corresponding operating parameters VⁱAnd V^jThe symbolized sequence of the measurement time sequence obtained from step 1 is

And

from which a transaction set is formed, i.e. each transaction is recorded as

And

the line segment type symbols included in (1) are respectively marked as

And

Is denoted by the class symbol t_kT represents a or b;

2.2.2: for each transaction

Calculating σ (t)_k)＝σ(t_k)+1；

2.2.3: for each t_kIf, if

And

the item retained in (1);

2.3.2 for each { a_p,b_qExecuting the following steps:

1) for each one exists in

Of (1) { a_p,b_q}, calculate σ ({ a)_p,b_q})＝σ({a_p,b_q})+1

2) If it is not

3. The method for mining industrial data association rules and predicting abnormal conditions as claimed in claim 2, wherein the step 3 comprises the following sub-steps:

If conf (V)ⁱ→V^j) If the minimum confidence coefficient threshold value minconf is not less than the minimum confidence coefficient threshold value minconf, the association rule V is reservedⁱ→V^jAnd records the corresponding support and confidence omegaⁱ。

4. The method for mining industrial data association rules and predicting abnormal operating conditions as claimed in claim 3, wherein the step 4 comprises the following sub-steps:

Construct the following matrix I_trainIs a neural netTraining input of the collaterals:

5. The method as claimed in claim 1, wherein before the equipment fails, the model is reconstructed and trained after a predetermined number of measurement data are updated with the update of the data, so as to obtain a more accurate prediction result.