CN110298385B - exergy information and incremental SVDD (singular value decomposition) based online early fault detection method - Google Patents
exergy information and incremental SVDD (singular value decomposition) based online early fault detection method Download PDFInfo
- Publication number
- CN110298385B CN110298385B CN201910486042.3A CN201910486042A CN110298385B CN 110298385 B CN110298385 B CN 110298385B CN 201910486042 A CN201910486042 A CN 201910486042A CN 110298385 B CN110298385 B CN 110298385B
- Authority
- CN
- China
- Prior art keywords
- sample
- svdd
- incremental
- model
- support vector
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/19—Recognition using electronic means
- G06V30/192—Recognition using electronic means using simultaneous comparisons or correlations of the image signals with a plurality of references
- G06V30/194—References adjustable by an adaptive method, e.g. learning
Abstract
The invention provides a method based onAn online early failure detection method of information and increment SVDD. Based onAn online early fault detection method for describing SVDD by information and incremental support vector data comprises the steps of firstly standardizing process measurement data under normal working conditions to obtain a process training sample set and corresponding process training sample setsAn information value; calculating each measured variable andscreening out characteristic variables related to energy efficiency according to the accumulated mutual information contribution rate by using mutual information values between effects; establishing an SVDD initial model, and respectively carrying out self-adaptive updating of mutual information values and updating of an incremental SVDD model on newly added samples in the process to obtain an incremental SVDD state model under a normal working condition; and obtaining models in different states according to the measurement data in different states, and using the models for online early fault detection. The invention combines the advantages of an energy-based method and a data-based driving method, not only can solve the problem of time-varying characteristics of system parameters in the process, but also can effectively extract energy characteristic variables in the process, improve the speed and the precision of early fault detection and effectively realize the on-line detection of early faults.
Description
Technical Field
Background
The SVDD is a single-class classification algorithm based on a support domain, has intuitive data description and good popularization performance, can solve the problem that fault samples are difficult to obtain in the actual process, and is widely applied to the fields of fault diagnosis and the like. However, SVDD needs to solve the quadratic programming problem, the training complexity is high, and the training complexity is exponential to the number of samples. In the SVDD fault detection research, in order to reduce the SVDD computation complexity and improve the effectiveness of sample data, a feature sample is extracted by performing feature dimension reduction by using a statistical method, and then a SVDD classification model is established by using the feature sample to realize fault detection.As a concept combining the first and second laws of thermodynamics, it can be used to better understand the process, quantify the direction of low efficiency and differentiate the quality of energy, it is a common concept in many fields such as physics, chemistry, mechanical mechanics, etc., and it adoptsThe concept of (2) can also reduce data dimensionality. Based onThe information method can effectively reduce the modeling workload, increase the calculation efficiency, and maintain the similarity to a great extent while reducing the dimension of the model.Is effective in chemical process systemThe concrete expression of the information and the evaluation index of the overall energy quality of the process are based onThe information extraction method is to extract the information in the processThe effect is extracted, thereby achieving the effect of reducing the dimension.
However, in the actual industrial production process, the system process parameters change along with the advance of time, the fault characteristics change along with the change of the system process parameters, the fault detection model needs to train a new sample, at this time, the SVDD has to abandon the original trained model, new data needs to be retrained together with historical data, the algorithm complexity is known, and the calculation complexity of the algorithm increases exponentially along with the continuous update of the new data sample. Once a large amount of data samples are trained online, a large amount of computation time and storage space are wasted in the training process of the SVDD, so that the algorithm cannot meet the requirement of real-time updating of the system. Compared with the traditional batch SVDD, the incremental learning technology can inherit the learned knowledge and update the model according to the new data sample on the basis of the original model, so that the model knowledge has accumulativeness, and the time-varying problem of the actual process can be solved. Meanwhile, the historical energy characteristic variables cannot be applied to new process conditions and new data, and the energy characteristic variables need to be extracted from the sets of the historical data and the new data again. Typically, the most recent measured variable data highlights the characteristics of the current process system, while older historical data reflects system characteristics that are more distant from the most recent system state. Therefore, in order to reflect the current characteristics of the system in time, the influence of the historical data is reduced by adding weights to the historical data, and a forgetting factor can be added. In addition, the forgetting factor needs to be reasonably adjusted according to the change condition of the current system parameters. If the on-line extraction can be effectively carried out from the sample setInformation, the fault detection algorithm is used after the energy characteristic sample set is constructed, and the early fault detection efficiency can be obviously improved.
Disclosure of Invention
Aiming at the problem of time-varying characteristics of system parameters in the actual process and the problem that the screened energy characteristics are difficult to reflect the dynamic characteristics of the process, the invention respectively adopts an incremental learning algorithm and an energy characteristic extraction method based on a variable forgetting factor, and the method can adaptively extract the faults in the processAnd meanwhile, an incremental detection model which is continuously updated can be established, so that an early fault detection method is provided.
Based onThe online early fault detection method of the information and increment SVDD comprises the following steps:
(1) collecting process measurement variables andthe effect sample is standardized to obtainAnd Y0(ii) a The method comprises the following steps of performing data preprocessing on a sample, wherein the preprocessing is mainly performed by adopting data standardization, and the specific implementation steps are as follows:
for training sample data x1,x2,…xNSample xiThe calculation formula of the normalization process is as follows:
wherein x isi、Respectively representing the ith original off-line sample and the normalized sample, wherein theta is the arithmetic mean of all sample data, and sigma is the variance of all samples. Through standardization processing, detection errors caused by variable measuring ranges can be eliminated;
(2) in the calculation step 1And Y0With different measured variables from each otherMutual information value of effectsScreening and selecting according to the principle of accumulated mutual information contribution rateEfficient most relevant set of energy feature sample set X0The detailed steps for establishing the energy characteristic sample are as follows:
in order to reduce the influence of the sample size on mutual information estimation, a K-nearest neighbor method is adopted to calculate a mutual information value between two variables:
wherein N is the number of total samples, k is the number of neighbors, sxAnd syAre respectively represented asAnd Y0The number of nearest neighbor samples in the subspace, phi (·) is a digamma function. Meanwhile, in order to ensure that the mutual information can effectively extract the energy information, the cumulative mutual information contribution degree of the selected g measurement variables can be as follows:
CPMI indicates that the first g features are includedEffect information accounting systemThe ratio of the effective information is generally used to determine the number of features to be screened. Since the mutual information of each feature is greater than 0, the CPMI monotonically increases within the value range of the number of the screened features. In order to achieve relatively good feature dimension reduction effect, CPMI is required to be larger than a specified control limit;
(3) energy feature sample set X obtained in step 20And constructing an initial SVDD model gamma0The detailed steps for establishing the model are as follows:
given a training set x1,x2,…,xN]Where N is the number of samples. a and R represent the center and radius of the hypersphere, respectively. According to the principle of minimizing the structural risk, the hypersphere radius minimization problem can be described as the following quadratic programming problem with inequality constraint, and simultaneously, a relaxation variable xi is introducediAnd a penalty factor C:
for the quadratic programming problem with the constraint condition, a lagrangian multiplier can be introduced, the quadratic programming problem is often easier to solve after being converted into a dual form, and a kernel function can be introduced to project an original space to a high-dimensional space so as to solve the nonlinear problem:
solving by a quadratic programming Optimization (SMO) algorithm to obtain an optimal solution alphaiThereby obtaining an initial SVDD modeType gamma0The model parameters of (1);
(4) introducing incremental samplesAnd Y1Then, self-adaptively updating the mutual information value corresponding to the incremental sample according to the energy feature extraction rule based on the variable forgetting factorScreening out an energy feature set of the incremental sample according to a feature dimension reduction principleThe self-adaptive energy feature extraction method comprises the following steps:
after mutual information values corresponding to the newly added data samples are added with a forgetting factor rho, the newly added measurement samples at the t +1 th timeAnd correspondingEffect data sample Yt+1The mutual information value between the two is as follows:
where rho epsilon (0, 1)],Representing the column vector formed by the characteristics of the jth measured variable in the t +1 th new sample, m refers to the number of the measured variables,H(Yt+1)、respectively representing the information entropy of the variable and the joint entropy value between the two.
The variable forgetting factor method can adaptively adjust the size of the current forgetting factor according to the parameter change of the time-varying system, and the variable forgetting factor ρ of the mutual information value of the newly added sample at the (k + 1) th time is as follows:
in the formula CPMIt+1The cumulative mutual information contribution rate g of the characteristic sample corresponding to the t +1 th newly added datat+1The value is selected according to the cumulative mutual information contribution rate of the current sample, CPMIt+1The value is typically greater than 0.85. Finally, the CPMI from the current sample sett+1Screening out a new energy feature set
(5) According to the initial SVDD model gamma of step 30And step 4) energy characteristic sampleUpdating to obtain an incremental SVDD model gamma1The specific steps of incremental updating are as follows:
1) statistics and statistical limits;
given sample set X ═ X1,x2,…,xN]The dual form of the SVDD hypersphere radius optimization problem is as follows:
whereinDelta is the optimal compensation factor and the kernel function can be recorded as Kij=K(xi,xj) (ii) a From the KKT condition, W is the optimum solution for alphaiThe first derivative of δ needs to satisfy the following condition:
wherein Ω (x)i) Is a sample discriminant function of SVDD.
As can be seen from equation (10), the KKT condition of SVDD generally divides the training sample set into three categories: an intra-sphere non-support vector R, a standard support vector S, and a boundary support vector E. For new sample x just addeduThe model parameters need to be changed to make the extended sample set reach the KKT condition again, so as to obtain a new optimal solution, where the model coefficients change as:
wherein alpha isuIs the new sample x just addeduCorresponding Lagrange multiplier, Δ αuFor the variance of Lagrange multiplier of new sample, Delta alphajAnd adding the Lagrange multiplier variable quantity of the original sample set for the new sample. The standard support vector model coefficient variance can be written as:
let T equal to K-1For samples in the E, R set, its κi≡ 0, known from formula (13):
In the formula (I), the compound is shown in the specification,is a model boundary sensitivity factor, for samples within the S set, whichWhen the compound is brought into the formula (12), the following components are included:
2) sample attribute migration;
the sample attribute migration is based on the delta sample model coefficient variation Δ αuAnd adjusting the change of the original sample model coefficient to enable the extended sample set to satisfy the KKT condition again so as to tend to a new equilibrium state. In the process of migrating the attributes of the incremental samples, the attributes all meet delta alphau>0, six different sample attribute migration scenarios are introduced below:
①κsattribute value set corresponding to standard support vector setThe s-th number in (1) can be calculated by the formula (14)sIf it satisfies the upper limit Δ αs≤C-αsThen Δ αuAnd kappasSame number, xsChange from standard support vector to boundary support vector:
wherein λ is a migration factor.
If delta alphasAt a lower limit Δ αs≥-αsThen, delta αuAnd kappasOpposite sign, xsSupported by a standardVector becomes the intra-sphere non-support vector:
③ for the non-support vector x in the ballrThen there is dr>0,The r-th element as the non-SVM sensitization factor set in the sphere can be calculated according to the formula (16) to obtain delta dr. If Δ dr<0, intra-sphere non-support vector xrWould become the standard support vector:
support vector x for boundarieseThen there is dr<0, the e-th element of the set of non-SVM sensitivity factors within the sphere can be represented asCan be calculated to obtainIf Δ de>0, boundary support vector xeWould become the standard support vector:
commonly set increment sample xuWhen d is zerouIf x is greater than or equal to 0, the sample x is considereduAnd the model sample coefficients are not required to be updated if the target class samples are within the sphere. When d isu<0, incremental sample xuThen it becomes the standard support vector:
alpha in the incremental SVDD model training processuThe upper bound value is a penalty factor C when alpha isuWhen C is less than or equal to C, increment sample alphauWill become a boundary support vector with delta coefficient variation of:
λma=C-αu (22)
in incremental SVDD, let Δ αmax=min{λsp,λsm,λrs,λes,λu,λmaAs incremental samples alphauThe coefficient variation of the original model sample obtained by the equation (14) is delta alphai;
3) Updating a T matrix;
the T matrix model in equation (13) also needs to be updated in the incremental iteration process, and in order to reduce the complexity of calculating the inverse matrix, the following update strategy is adopted. For an arbitrary sample xl∈R∪E∪{xuBecome the standard support vector with T +1 updates of the process Tt+1Comprises the following steps:
when the standard supports vector xlWhen leaving set S, Tt+1Comprises the following steps:
(6) continuously introducing an incremental sample, and dynamically updating an incremental SVDD model gamma t times according to the steps 4) and 5)t. And for different fault state samples, calculating the incremental SVDD model gamma of different fault statesth;
(7) To online test sample setData standardization and energy feature extraction are carried out, and h fault state models corresponding to the data standardization and the energy feature extraction are calculatedRelative distance gamma of the moldh(xhi) Based on the principle of minimum relative distance, the fault state category corresponding to the sample is detected and identified, and the detailed steps are as follows:
the relative distance-based discrimination method can strengthen the discrimination rule and improve the fault state detection rate of the algorithm by dividing by the radius of the hypersphere in each fault state model. The relative distance can not only detect the fault, but also judge the severity of the fault. Using relative distance as criterion:
compared with the prior art, the invention has the beneficial effects that:
the invention combines the advantages of an energy-based method and a data-based driving method, not only can solve the problem of time-varying characteristics of system parameters in the process, but also can effectively extract energy characteristic variables in the process, improve the speed and the precision of early fault detection and effectively realize the on-line detection of early faults.
Drawings
FIG. 1 is a flow chart of the method of the present invention;
FIG. 2 is a schematic diagram of a commercial propylene rectification column;
FIG. 3 is a graph comparing incremental and batch model training times;
FIGS. 4 a-4 c are online early failure detection results of a prior SVDD method, wherein FIG. 4a is the sample's performance on a normal state model, FIG. 4b is the sample's performance on a moderate degradation state model, and FIG. 4c is the sample's performance on a severe degradation state model;
FIGS. 5 a-5 c are online early failure detection results of the incremental SVDD method, wherein FIG. 5a is the sample's performance on the normal state model, FIG. 5b is the sample's performance on the moderate degradation state model, and FIG. 5c is the sample's performance on the severe degradation state model;
fig. 6a to 6c are online early failure detection results of the method of the present invention, wherein fig. 6a is the representation of a sample on a normal state model, fig. 6b is the representation of a sample on a moderate degradation state model, and fig. 6c is the representation of a sample on a severe degradation state model.
Detailed Description
The invention will now be described in detail with reference to the plant examples and the accompanying drawings, in order to provide a thorough understanding of the effectiveness of the process and method of the invention as applied and practiced by way of example.
Based onThe information and increment SVDD online early fault detection method has the following specific implementation mode:
the industrial propylene rectifying tower is used as a research object, the feeding of the tower mainly comprises propane and propylene, the relative volatility of the propane and the propylene is close to 1, and in order to achieve a better separation effect, the two towers are connected in series for operation. The flow of a commercial propylene rectification column is shown in FIG. 2. The No. 1 propylene rectifying tower is equivalent to a stripping section in the rectifying process, the tower is provided with 77 layers of tower plates, and the tower bottom discharge is used as circulating propane under the condition of flow control. And pumping tower kettle materials of the No. 2 propylene rectifying tower into the top of the No. 1 propylene rectifying tower through a reflux pump to serve as the top reflux of the No. 1 propylene rectifying tower. The propylene product is passed through an overhead cooler and then to a propylene product guard bed to remove some of the oxide impurities. The industrial rectification case is based on ASPEN simulation of a propylene rectification tower of a certain plant, and the propylene rectification process is simulated by using Aspen Tech 8.4 software. In order to simulate the scaling fault of the heat exchanger in the rectification process, a double-material-flow heat exchanger module with detailed calculation is adopted in the process, the total heat transfer coefficient is calculated according to the membrane coefficient and the pipe wall resistance calculated by the geometric structure, and the condenser scaling factor is selected to generate step change to simulate the early fault of the rectification process. If the fouling factor of the condenser is increased, the heat transfer efficiency is reduced, the heat flux of the heat exchanger is reduced, and the process is effectiveIs wasted and the waste is generated,the effect will be reduced。The effect is taken as an index for measuring the effective energy utilization efficiency of the process, and the early failure condition of the rectification process can be obviously reflected, so that the step change of the fouling factor of the condenser is adopted to indicate whether the early failure occurs or not. During the simulation, the fouling factor parameter of the condenser of column 2 was varied from 0 to 8.6X 10-3(sqm K)/Watt to simulate early failure during rectification.
(1) Collecting process measurement variables andthe effect sample is standardized to obtainAnd Y0(ii) a The method comprises the following steps of performing data preprocessing on a sample, wherein the preprocessing is mainly performed by adopting data standardization, and the specific implementation steps are as follows:
for training sample data x1,x2,…xNSample xiThe calculation formula of the normalization process is as follows:
wherein x isi、Respectively representing the ith original off-line sample and the normalized sample, wherein theta is the arithmetic mean of all sample data, and sigma is the variance of all samples. Through standardization processing, detection errors caused by variable measuring ranges can be eliminated;
(2) in the calculation step 1And Y0With different measured variables from each otherMutual information value of effectsScreening and selecting according to the principle of accumulated mutual information contribution rateEfficient most relevant set of energy feature sample set X0The detailed steps for establishing the energy characteristic sample are as follows:
in order to reduce the influence of the sample size on mutual information estimation, a K-nearest neighbor method is adopted to calculate a mutual information value between two variables:
wherein N is the number of total samples, k is the number of neighbors, sxAnd syAre respectively represented asAnd Y0The number of nearest neighbor samples in the subspace, phi (·) is a digamma function. Meanwhile, in order to ensure that the mutual information can effectively extract the energy information, the cumulative mutual information contribution degree of the selected g measurement variables can be as follows:
CPMI indicates that the first g features are includedEffect information accounting systemThe ratio of the effective information is generally used to determine the number of features to be screened. Since the mutual information of each feature is greater than 0, the CPMI monotonically increases within the value range of the number of the screened features. To achieveThe relatively good characteristic dimension reduction effect needs to ensure that the CPMI is larger than the specified control limit, and the control limit is usually 85%;
(3) energy feature sample set X obtained in step 20And constructing an initial SVDD model gamma0The detailed steps for establishing the model are as follows:
given a training set x1,x2,…,xN]Where N is the number of samples. a and R represent the center and radius of the hypersphere, respectively. According to the principle of minimizing the structural risk, the hypersphere radius minimization problem can be described as the following quadratic programming problem with inequality constraint, and simultaneously, a relaxation variable xi is introducediAnd a penalty factor C:
for the quadratic programming problem with the constraint condition, a lagrangian multiplier can be introduced, the quadratic programming problem is often easier to solve after being converted into a dual form, and a kernel function can be introduced to project an original space to a high-dimensional space so as to solve the nonlinear problem:
solving by a quadratic programming Optimization (SMO) algorithm to obtain an optimal solution alphaiThereby obtaining an initial SVDD model gamma0The model parameters of (1);
(4) introducing incremental samplesAnd Y1Then, self-adaptively updating the mutual information value corresponding to the incremental sample according to the energy feature extraction rule based on the variable forgetting factorScreening out energy characteristics of incremental samples according to a characteristic dimension reduction principleCollectionThe self-adaptive energy feature extraction method comprises the following steps:
after mutual information values corresponding to the newly added data samples are added with a forgetting factor rho, the newly added measurement samples at the t +1 th timeAnd correspondingEffect data sample Yt+1The mutual information value between the two is as follows:
where rho epsilon (0, 1)],Representing the column vector formed by the characteristics of the jth measured variable in the t +1 th new sample, m refers to the number of the measured variables,H(Yt+1)、respectively representing the information entropy of the variable and the joint entropy value between the two.
The variable forgetting factor method can adaptively adjust the size of the current forgetting factor according to the parameter change of the time-varying system, and the variable forgetting factor ρ of the mutual information value of the newly added sample at the (k + 1) th time is as follows:
in the formula CPMIt+1The cumulative mutual information contribution rate g of the characteristic sample corresponding to the t +1 th newly added datat+1The value is selected according to the cumulative mutual information contribution rate of the current sample, CPMIt+1The value is typically greater than 0.85. Finally, the CPMI from the current sample sett+1Screening out a new energy feature set
(5) According to the initial SVDD model gamma of step 30And step 4) energy characteristic sampleUpdating to obtain an incremental SVDD model gamma1The specific steps of incremental updating are as follows:
1) statistics and statistical limits;
given sample set X ═ X1,x2,…,xN]The dual form of the SVDD hypersphere radius optimization problem is as follows:
whereinDelta is the optimal compensation factor and the kernel function can be recorded as Kij=K(xi,xj) (ii) a From the KKT condition, W is the optimum solution for alphaiThe first derivative of δ needs to satisfy the following condition:
wherein Ω (x)i) Is a sample discriminant function of SVDD.
As can be seen from equation (10), the KKT condition of SVDD generally divides the training sample set into three categories: an intra-sphere non-support vector R, a standard support vector S, and a boundary support vector E. For new sample x just addeduThe model parameters need to be changed to make the extended sample set reach the KKT condition again, so as to obtain a new optimal solution, where the model coefficients change as:
wherein alpha isuIs the new sample x just addeduCorresponding Lagrange multiplier, Δ αuFor the variance of Lagrange multiplier of new sample, Delta alphajAnd adding the Lagrange multiplier variable quantity of the original sample set for the new sample. The standard support vector model coefficient variance can be written as:
let T equal to K-1For samples in the E, R set, its κi≡ 0, known from formula (13):
In the formula (I), the compound is shown in the specification,is a model boundary sensitivity factor, for samples within the S set, whichWhen the compound is brought into the formula (12), the following components are included:
2) sample attribute migration;
the sample attribute migration is based on the delta sample model coefficient variation Δ αuAnd adjusting the change of the original sample model coefficient to enable the extended sample set to satisfy the KKT condition again so as to tend to a new equilibrium state. In the process of migrating the attributes of the incremental samples, the attributes all meet delta alphau>0, six different sample attribute migration scenarios are introduced below:
①κsattribute value set corresponding to standard support vector setThe s-th number in (1) can be calculated by the formula (14)sIf it satisfies the upper limit Δ αs≤C-αsThen Δ αuAnd kappasSame number, xsChange from standard support vector to boundary support vector:
wherein λ is a migration factor.
If delta alphasAt a lower limit Δ αs≥-αsThen, delta αuAnd kappasOpposite sign, xsFrom the standard support vector to the intra-sphere non-support vector:
③ for the non-support vector x in the ballrThen there is dr>0,The r-th element as the set of non-SVM sensitivity factors within the sphere, according to the formula(16) Can calculate to obtain delta dr. If Δ dr<0, intra-sphere non-support vector xrWould become the standard support vector:
support vector x for boundarieseThen there is dr<0, the e-th element of the set of non-SVM sensitivity factors within the sphere can be represented asCan be calculated to obtainIf Δ de>0, boundary support vector xeWould become the standard support vector:
commonly set increment sample xuWhen d is zerouIf x is greater than or equal to 0, the sample x is considereduAnd the model sample coefficients are not required to be updated if the target class samples are within the sphere. When d isu<0, incremental sample xuThen it becomes the standard support vector:
alpha in the incremental SVDD model training processuThe upper bound value is a penalty factor C when alpha isuWhen C is less than or equal to C, increment sample alphauWill become a boundary support vector with delta coefficient variation of:
λma=C-αu (22)
in incremental SVDD, let Δ αmax=min{λsp,λsm,λrs,λes,λu,λmaAsIncremental sample alphauThe coefficient variation of the original model sample obtained by the equation (14) is delta alphai;
3) Updating a T matrix;
the T matrix model in equation (13) also needs to be updated in the incremental iteration process, and in order to reduce the complexity of calculating the inverse matrix, the following update strategy is adopted. For an arbitrary sample xl∈R∪E∪{xuBecome the standard support vector with T +1 updates of the process Tt+1Comprises the following steps:
when the standard supports vector xlWhen leaving set S, Tt+1Comprises the following steps:
(6) continuously introducing an incremental sample, and dynamically updating an incremental SVDD model gamma t times according to the steps 4) and 5)t. And for different fault state samples, calculating the incremental SVDD model gamma of different fault statesth;
(7) To online test sample setCarrying out data standardization and energy feature extraction, and calculating the relative distance gamma of the h fault state modelsh(xhi) Based on the principle of minimum relative distance, the fault state category corresponding to the sample is detected and identified, and the detailed steps are as follows:
the relative distance-based discrimination method can strengthen the discrimination rule and improve the fault state detection rate of the algorithm by dividing by the radius of the hypersphere in each fault state model. The relative distance can not only detect the fault, but also judge the severity of the fault. Using relative distance as criterion:
the most essential difference between the incremental SVDD and the SVDD is structural inheritance, so the training time of the incremental SVDD is shorter than that of the traditional SVDD, as shown in fig. 3. According to the results, as the number of samples is continuously increased, the difference between the training time of the SVDD model and the incremental SVDD is larger and larger. The reason is that the traditional SVDD algorithm has to put the last training sample and the newly added sample together for retraining to solve the secondary optimization problem, so that the historical samples are repeatedly trained, and meanwhile, the samples which are just added are not reasonably screened; different from the batch-type SVDD, the inheritability of the structure brings the advantage that the incremental SVDD can update the model by using the last training result, and the incremental SVDD selectively performs learning training on the incremental samples, so that the training time can be obviously reduced.
The method of the invention is compared with the SVDD method and the incremental SVDD online early detection rate, and the detection results are shown in Table 1. Comparing the detection results of the SVDD method and the incremental SVDD method, the difference between the incremental algorithm and the non-incremental algorithm in the fault detection rate is not obvious. The incremental SVDD has smaller hypersphere space and more training data dimension and sample number than the non-incremental SVDD, thereby showing that the incremental SVDD can selectively reserve the samples which are most likely to be the support vectors, and shortening the training time without reducing the fault detection rate. That is to say, the incremental algorithm does not sacrifice the classification capability to improve the training speed of the model, which indicates that the incremental algorithm is more suitable for the problem of system dynamic change. The results in table 1 and fig. 4, 5, 6 also show that the algorithm failure detection rate without energy feature extraction is lower than the algorithm failure detection rate with energy feature extraction, demonstrating that the algorithm based on energy feature extraction has a lower failure detection rateThe feature dimension reduction technology for information extraction can effectively eliminate irrelevant noise information and extract fault change features, and improves the classification performance of the model, so that the actual production process is guided to a certain extent.
TABLE 1 detection rates of different methods in early failure states
The embodiments described in this specification are merely illustrative of implementations of the inventive concept and the scope of the present invention should not be considered limited to the specific forms set forth in the embodiments but rather by the equivalents thereof as may occur to those skilled in the art upon consideration of the present inventive concept.
Claims (1)
1. Based onThe online early fault detection method of the information and increment SVDD comprises the following steps:
1) collecting process measurement variables andthe effect sample is standardized to obtainAnd Y0(ii) a The method comprises the following steps of preprocessing a sample by data standardization, wherein the preprocessing comprises the following specific implementation steps:
for training sample data x1,x2,…xNSample xiThe calculation formula of the normalization process is as follows:
wherein x isi、Respectively represent the ith original separationLine samples and normalized samples, theta is the arithmetic mean of all sample data, and sigma is the variance of all samples;
2) in the calculation step 1)And Y0With different measured variables from each otherMutual information value of effectsScreening and selecting according to the principle of accumulated mutual information contribution rateEfficient most relevant set of energy feature sample set X0The detailed steps for establishing the energy characteristic sample are as follows:
in order to reduce the influence of the sample size on mutual information estimation, a K-nearest neighbor method is adopted to calculate a mutual information value between two variables:
wherein N is the number of total samples, k is the number of neighbors, sxAnd syAre respectively represented asAnd Y0The number of nearest neighbor samples in the subspace, phi (·) is a digamma function; meanwhile, in order to ensure that the mutual information can effectively extract the energy information, the cumulative mutual information contribution degree of the selected g measurement variables can be as follows:
CPMI represents pre-stageg features comprisingEffect information accounting systemThe proportion of the effective information is used for determining the number of the screened features; because the mutual information of each feature is greater than 0, the CPMI is monotonically increased within the value range of the screened feature number; in order to achieve relatively good feature dimension reduction effect, CPMI is required to be larger than a specified control limit;
3) the energy characteristic sample set X obtained in the step 2)0And constructing an initial SVDD model gamma0The detailed steps for establishing the model are as follows:
given a training set x1,x2,…,xN]Wherein N is the number of samples; a and R represent the center and radius of the hypersphere, respectively. According to the principle of minimizing the structural risk, the hypersphere radius minimization problem can be described as the following quadratic programming problem with inequality constraint, and simultaneously, a relaxation variable xi is introducediAnd a penalty factor C:
for the quadratic programming problem with the constraint condition, a lagrangian multiplier can be introduced, the quadratic programming problem is often easier to solve after being converted into a dual form, and a kernel function can be introduced to project an original space to a high-dimensional space so as to solve the nonlinear problem:
obtaining an optimal solution alpha through solving a quadratic programming optimization algorithm SMO algorithmiThereby obtaining an initial SVDD model gamma0The model parameters of (1);
4) introducing incremental samplesAnd Y1Then, self-adaptively updating the mutual information value corresponding to the incremental sample according to the energy feature extraction rule based on the variable forgetting factorScreening out an energy feature set of the incremental sample according to a feature dimension reduction principleThe self-adaptive energy feature extraction method comprises the following steps:
after mutual information values corresponding to the newly added data samples are added with a forgetting factor rho, the newly added measurement samples at the t +1 th timeAnd correspondingEffect data sample Yt+1The mutual information value between the two is as follows:
where rho epsilon (0, 1)],Representing the column vector formed by the characteristics of the jth measured variable in the t +1 th new sample, m refers to the number of the measured variables,H(Yt+1)、individual watchThe information entropy of the argument and the joint entropy value between the two;
the variable forgetting factor method can adaptively adjust the size of the current forgetting factor according to the parameter change of the time-varying system, and the variable forgetting factor ρ of the mutual information value of the newly added sample at the (k + 1) th time is as follows:
in the formula CPMIt+1The cumulative mutual information contribution rate g of the characteristic sample corresponding to the t +1 th newly added datat+1The value is selected according to the principle of the accumulated mutual information contribution rate of the current sample; finally, the CPMI from the current sample sett+1Screening out a new energy feature set
5) The initial SVDD model gamma according to the step 3)0And step 4) energy characteristic sampleUpdating to obtain an incremental SVDD model gamma1The specific steps of incremental updating are as follows:
5-1) statistics and statistical limits;
given sample set X ═ X1,x2,…,xN]The dual form of the SVDD hypersphere radius optimization problem is as follows:
whereinDelta is the optimal compensation factor and the kernel function can be recorded as Kij=K(xi,xj) (ii) a From the KKT condition, W is the optimum solution for alphaiThe first derivative of δ needs to satisfy the following condition:
wherein Ω (x)i) A sample discrimination function for SVDD;
as can be seen from equation (10), the KKT condition of SVDD generally divides the training sample set into three categories: an intra-sphere non-support vector R, a standard support vector S and a boundary support vector E; for new sample x just addeduThe model parameters need to be changed to make the extended sample set reach the KKT condition again, so as to obtain a new optimal solution, where the model coefficients change as:
wherein alpha isuIs the new sample x just addeduCorresponding Lagrange multiplier, Δ αuFor the variance of Lagrange multiplier of new sample, Delta alphajThe Lagrange multiplier variation of the original sample set after the new sample is added; the standard support vector model coefficient variance can be written as:
let T equal to K-1For samples in the E, R set, its κi≡ 0, known from formula (13):
In the formula (I), the compound is shown in the specification,is a model boundary sensitivity factor, for samples within the S set, whichWhen it is carried into formula (12):
5-2) sample attribute migration;
the sample attribute migration is based on the delta sample model coefficient variation Δ αuAdjusting the change of the original sample model coefficient to enable the extended sample set to satisfy the KKT condition again so as to tend to a new equilibrium state; in the process of migrating the attributes of the incremental samples, the attributes all meet delta alphau>0, the sample attribute migration cases are divided into the following six types:
①κsattribute value set corresponding to standard support vector setThe s-th number in (1) can be calculated by the formula (14)sIf it satisfies the upper limit Δ αs≤C-αsThen Δ αuAnd kappasSame number, xsChange from standard support vector to boundary support vector:
wherein λ is a migration factor;
if delta alphasAt a lower limit Δ αs≥-αsThen, delta αuAnd kappasOpposite sign, xsFrom the standard support vector to the intra-sphere non-support vector:
③ for the non-support vector x in the ballrThen there is dr>0,The r-th element as the non-SVM sensitization factor set in the sphere can be calculated according to the formula (16) to obtain delta dr(ii) a If Δ dr<0, intra-sphere non-support vector xrWould become the standard support vector:
support vector x for boundarieseThen there is dr<0, the e-th element of the set of non-SVM sensitivity factors within the sphere can be represented asCan be calculated to obtainIf Δ de>0, boundary support vector xeWould become the standard support vector:
commonly set increment sample xuWhen d is zerouIs more than or equal to 0Then the sample x is considereduIf the target class sample is the intra-sphere target class sample, the model sample coefficient does not need to be updated; when d isu<0, incremental sample xuThen it becomes the standard support vector:
alpha in the incremental SVDD model training processuThe upper bound value is a penalty factor C when alpha isuWhen C is less than or equal to C, increment sample alphauWill become a boundary support vector with delta coefficient variation of:
λma=C-αu (22)
in incremental SVDD, let Δ αmax=min{λsp,λsm,λrs,λes,λu,λmaAs incremental samples alphauThe coefficient variation of the original model sample obtained by the equation (14) is delta alphai;
5-3) updating the T matrix;
the T matrix model in the formula (13) also needs to be updated in the incremental iteration process, and in order to reduce the complexity of calculating an inverse matrix, the following updating strategy is adopted; for an arbitrary sample xl∈R∪E∪{xuBecome the standard support vector with T +1 updates of the process Tt+1Comprises the following steps:
when the standard supports vector xlWhen leaving set S, Tt+1Comprises the following steps:
6) continuously introducing incremental samples, and dynamically updating t times of incremental SVDD model gamma according to step 4) and step 5)t(ii) a And forCalculating the incremental SVDD model gamma of different fault states by using the same fault state sampleth;
7) To online test sample setCarrying out data standardization and energy feature extraction, and calculating the relative distance gamma of the h fault state modelsh(xhi) Based on the principle of minimum relative distance, the fault state category corresponding to the sample is detected and identified, and the detailed steps are as follows:
the discrimination method based on the relative distance can strengthen the judgment rule and improve the fault state detection rate of the algorithm by dividing the radius of the hypersphere in each fault state model; the relative distance can not only detect the fault, but also judge the severity of the fault; using relative distance as criterion:
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910486042.3A CN110298385B (en) | 2019-06-05 | 2019-06-05 | exergy information and incremental SVDD (singular value decomposition) based online early fault detection method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910486042.3A CN110298385B (en) | 2019-06-05 | 2019-06-05 | exergy information and incremental SVDD (singular value decomposition) based online early fault detection method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110298385A CN110298385A (en) | 2019-10-01 |
CN110298385B true CN110298385B (en) | 2021-07-27 |
Family
ID=68027725
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910486042.3A Active CN110298385B (en) | 2019-06-05 | 2019-06-05 | exergy information and incremental SVDD (singular value decomposition) based online early fault detection method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110298385B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112184037B (en) * | 2020-09-30 | 2022-11-11 | 华中科技大学 | Multi-modal process fault detection method based on weighted SVDD |
CN113237920B (en) * | 2021-05-17 | 2022-04-22 | 西南交通大学 | Method for detecting fault heat source of valve-side sleeve of extra-high voltage converter transformer |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105868783A (en) * | 2016-03-31 | 2016-08-17 | 华东理工大学 | Reduction type support vector data description method based on information entropy |
CN106124988A (en) * | 2016-06-28 | 2016-11-16 | 江苏科技大学 | A kind of motor multi-state fault detection method based on RBF, multilamellar FDA and SVDD |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20190042977A1 (en) * | 2017-08-07 | 2019-02-07 | Sas Institute Inc. | Bandwidth selection in support vector data description for outlier identification |
US20190095400A1 (en) * | 2017-09-28 | 2019-03-28 | Sas Institute Inc. | Analytic system to incrementally update a support vector data description for outlier identification |
-
2019
- 2019-06-05 CN CN201910486042.3A patent/CN110298385B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105868783A (en) * | 2016-03-31 | 2016-08-17 | 华东理工大学 | Reduction type support vector data description method based on information entropy |
CN106124988A (en) * | 2016-06-28 | 2016-11-16 | 江苏科技大学 | A kind of motor multi-state fault detection method based on RBF, multilamellar FDA and SVDD |
Non-Patent Citations (1)
Title |
---|
Improved data-based fault detection strategy and application to distillation columns;MADAKYARU M 等;《Process Safety and Environmental Protection》;20171231;第22-34页 * |
Also Published As
Publication number | Publication date |
---|---|
CN110298385A (en) | 2019-10-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Jeon et al. | Prediction of ship fuel consumption by using an artificial neural network | |
CN113256066B (en) | PCA-XGboost-IRF-based job shop real-time scheduling method | |
Singh et al. | A comprehensive review: Fault detection, diagnostics, prognostics, and fault modeling in HVAC systems | |
Wang et al. | Data-driven mode identification and unsupervised fault detection for nonlinear multimode processes | |
Deng et al. | Modified kernel principal component analysis using double-weighted local outlier factor and its application to nonlinear process monitoring | |
US11860608B2 (en) | Industrial equipment operation, maintenance and optimization method and system based on complex network model | |
Li et al. | Review on fault detection and diagnosis feature engineering in building heating, ventilation, air conditioning and refrigeration systems | |
Song et al. | Multisubspace elastic network for multimode quality-related process monitoring | |
Ma et al. | A novel hierarchical detection and isolation framework for quality-related multiple faults in large-scale processes | |
CN110298385B (en) | exergy information and incremental SVDD (singular value decomposition) based online early fault detection method | |
CN110598929B (en) | Wind power nonparametric probability interval ultrashort term prediction method | |
He et al. | Dynamic mutual information similarity based transient process identification and fault detection | |
Simmini et al. | A self-tuning KPCA-based approach to fault detection in chiller systems | |
Yao et al. | An enhanced dynamic artificial immune system based on simulated vaccine for early fault diagnosis with limited data | |
Zhang et al. | Experimental study on performance assessments of HVAC cross-domain fault diagnosis methods oriented to incomplete data problems | |
Bouhouche et al. | Combined use of principal component analysis and self organisation map for condition monitoring in pickling process | |
Tian et al. | An unsupervised data mining-based framework for evaluation and optimization of operation strategy of HVAC system | |
Liang et al. | The impact of improved PCA method based on anomaly detection on chiller sensor fault detection | |
Wei et al. | Virtual sensor development for multioutput nonlinear processes based on bilinear neighborhood preserving regression model with localized construction | |
CN115618506A (en) | Method for predicting power of single-shaft combined cycle gas turbine | |
Chen et al. | An enhanced DPCA fault diagnosis method based on hierarchical cluster analysis | |
Gao et al. | Compound fault diagnosis for cooling dehumidifier based on RBF neural network improved by kernel principle component analysis and adaptive genetic algorithm | |
Sallehuddin et al. | Forecasting small data set using hybrid cooperative feature selection | |
CN111914471A (en) | Rectification tower fault detection method based on rapid nuclear independent component analysis | |
Wang et al. | A reduced nonstationary discrete convolution kernel for multimode process monitoring |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |