CN113741364A - Multi-mode chemical process fault detection method based on improved t-SNE - Google Patents

Multi-mode chemical process fault detection method based on improved t-SNE Download PDF

Info

Publication number
CN113741364A
CN113741364A CN202110988594.1A CN202110988594A CN113741364A CN 113741364 A CN113741364 A CN 113741364A CN 202110988594 A CN202110988594 A CN 202110988594A CN 113741364 A CN113741364 A CN 113741364A
Authority
CN
China
Prior art keywords
dimensional
data
lof
space
low
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110988594.1A
Other languages
Chinese (zh)
Inventor
顾昊昱
张成功
钱平
王丽
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Institute of Technology
Original Assignee
Shanghai Institute of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Institute of Technology filed Critical Shanghai Institute of Technology
Priority to CN202110988594.1A priority Critical patent/CN113741364A/en
Publication of CN113741364A publication Critical patent/CN113741364A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/418Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS], computer integrated manufacturing [CIM]
    • G05B19/41875Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS], computer integrated manufacturing [CIM] characterised by quality surveillance of production
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/30Nc systems
    • G05B2219/31From computer integrated manufacturing till monitoring
    • G05B2219/31337Failure information database
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Manufacturing & Machinery (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Automation & Control Theory (AREA)
  • Complex Calculations (AREA)

Abstract

The invention relates to a multi-mode chemical process fault detection method based on improved t-SNE, which comprises the following steps: step S1: collecting multi-modal chemical process raw data XoAnd carrying out standardization processing to obtain high-dimensional data X; step S2: computing mahalanobis distance D between high-dimensional data pointsM(ii) a Step S3: carrying out feature extraction on the high-dimensional data X by adopting an improved t-distribution random neighbor embedding method t-SNE to obtain a low-dimensional matrix Y; step S4: obtaining a mapping matrix A from a high-dimensional space to a low-dimensional space; step S5: obtaining a residual error space E of the training data; step S6: solving a feature space and a residual space of the online data; step S7: constructing LOF statistics by using a local outlier factor LOF algorithm; step S8: calculating LOF statistics and corresponding control limits of training data; step S9: and calculating LOF statistic of the online data, and performing real-time fault detection. Compared with the prior art, the method can meet the requirement of multi-mode process monitoring and has high accuracy.

Description

Multi-mode chemical process fault detection method based on improved t-SNE
Technical Field
The invention relates to the field of chemical production process monitoring, in particular to a multi-mode chemical process fault detection method based on improved t-SNE.
Background
Under the support of technologies such as a distributed control system technology, an internet of things and big data, the chemical production process tends to be automated and intelligentized more and more. The process monitoring is a key link for ensuring the production safety and high efficiency, and the fault detection is the most basic and important link in the process monitoring. The process monitoring technology based on data driving, which is developed on the basis of the Multivariate Statistical Process Monitoring (MSPM) theory, can mine effective information from massive process data to realize modeling of the chemical process, thereby achieving the purpose of monitoring.
In the chemical production process, the balance point changes during production due to seasonal changes, demand fluctuations and other factors, so that the multi-modal characteristic of the production process is formed. The traditional MSPM method is generally only suitable for realizing process monitoring in a single-mode production process, and if the production modes are switched in multi-mode operation and production, the traditional method cannot realize effective modeling and monitoring. The model integration method is an idea of multi-modal process fault detection, and comprises the specific steps of segmenting each mode of a multi-modal process, and establishing a corresponding model for each mode on the basis to realize detection. The method adopts a mode of hard partitioning the multi-mode process, and the fault detection effect has strict requirements on the accurate partitioning of the modes. The other method is a global modeling method which adopts a softening separation method and does not need to accurately divide each mode of the multi-mode process.
the t-SNE algorithm is a manifold learning algorithm and converts the conventional thought of high-dimensional and low-dimensional similarity of Euclidean distance measures into the similarity of high-dimensional and low-dimensional data reflected by conditional probability. When the model established by the t-SNE is used for feature extraction, the structural characteristics of low-dimensional data and high-dimensional data can be guaranteed. However, when the classical t-SNE algorithm measures the distance between data points, the Euclidean distance is calculated and then converted into the probability, and the method is only suitable for modeling of the monomodal chemical process. After modeling is completed, corresponding statistic is constructed to realize fault detection, and traditional statistic T2SPE (solid phase extraction) needs to meet the requirement that data obey normal distribution, but the chemical process data after dimensionality reduction usually has the characteristic of non-Gaussian property, and T is used2And SPE statistics may affect control limitsAnd the final fault detection effect.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provide a multi-modal chemical process fault detection method based on improved t-SNE.
The purpose of the invention can be realized by the following technical scheme:
a multi-modal chemical process fault detection method based on improved t-SNE comprises the following steps:
step S1: collecting multi-modal chemical process raw data XoAnd carrying out standardization processing to obtain high-dimensional data X;
step S2: computing mahalanobis distance D between high-dimensional data pointsM
Step S3: carrying out feature extraction on the high-dimensional data X by adopting an improved t-distribution random neighbor embedding method t-SNE to obtain a low-dimensional matrix Y;
step S4: obtaining a mapping matrix A from a high-dimensional space to a low-dimensional space;
step S5: obtaining a residual error space E of the training data;
step S6: solving a feature space and a residual space of the online data;
step S7: constructing LOF statistics by using a local outlier factor LOF algorithm;
step S8: calculating LOF statistics and corresponding control limits of training data;
step S9: and calculating LOF statistic of the online data, and performing real-time fault detection.
Preferably, the multi-modal chemical process raw data in step S1 is data generated when the chemical process normally runs in a plurality of different modalities, wherein the number of sampling points of each modality is the same; the multi-mode chemical process raw data is Xo=[x1,x2,...,xN]∈RD×NWhere D represents the number of process variables and N represents the number of sample points.
Preferably, the step S2 is specifically: x is the number ofi,xjAre two sample points in the high dimensional dataset X,xi,xjmahalanobis distance D in high dimensional spaceM(xi,xj) The calculation method is as follows:
Figure BDA0003231697030000021
where μ is the sample mean of the data set X, Σ-1As a covariance matrix of data set X, sigma-1The specific form is as follows:
Figure BDA0003231697030000022
wherein c isijIs a sample point xi,xjCovariance of cijThe expression is as follows:
cij=Cov(xi,xj)=E[xi-E(xi)][xj-E(xj)],i,j=1,2,...,D (3)
wherein E (x)i) And E (x)j) Respectively represent samples xiAnd xjThe mathematical expectation of (2).
Preferably, the step S3 is configured to obtain the low-dimensional feature matrix Y, and specifically includes the following steps:
step S301: calculating conditional probabilities between high-dimensional spatial data points;
recording high-dimensional space sample point xjIs xiHas a conditional probability of pj|iExpressed as:
Figure BDA0003231697030000031
wherein σiIs represented by XiIs the variance of the Gaussian distribution of the center point, k denotes the number of neighbors, xkDenotes xiAny one of the neighbors of;
step S302: calculating conditional probabilities between the low-dimensional spatial data points;
assuming final acquisitionThe low-dimensional feature matrix is Y ═ Y1,y2,...,yN]∈Rd×ND represents the finally obtained matrix with the low-dimensional characteristic of d dimension, and the high-dimensional data sample point xiAnd xjThe mapped point in the low-dimensional space is yiAnd yjWill be given by yiThe variance of the Gaussian distribution as the center point is set to
Figure BDA0003231697030000032
Accordingly, the sample point y in the low-dimensional space can be obtainediIs yjThe conditional probability of (2) is as follows:
Figure BDA0003231697030000033
wherein y iskDenotes yiAny one of the neighbors of;
step S303: similarity between high and low dimensional data distributions using KL divergence measures;
the similarity cost function C constructed by KL divergence is as follows:
Figure BDA0003231697030000034
wherein P isiRepresenting a high-dimensional spatial sample xiSet of conditional probabilities forming a neighbour relation with all other samples, QiRepresenting low dimensional spatial samples yiSet x of conditional probabilities forming a neighbor relation with all other samplesj,pj|iDenotes xiIs xjThe conditional probability of (a);
step S304: minimizing a cost function;
minimizing a cost function C, C to y, by means of gradient descentiAnd (3) solving the gradient, wherein the gradient formula is as follows:
Figure BDA0003231697030000035
step S305: simplifying a gradient formula;
using joint probability distribution instead of conditional probability, so that for arbitrary i and j, there is qji=qij,pji=pij(ii) a The target is achieved by a symmetrical method as follows:
Figure BDA0003231697030000041
the transformation cost function C is:
Figure BDA0003231697030000042
the simplified gradient formula is:
Figure BDA0003231697030000043
step S306: redefining the joint distribution probability of the low-dimensional data according to the t distribution of the students;
in order to solve the crowding problem of low-dimensional data during classification, student t distribution is used for replacing Gaussian distribution, and t distribution with degree of freedom of 1 is used for redefining qijThe concrete form is as follows:
Figure BDA0003231697030000044
the gradient formula is converted to the following expression:
Figure BDA0003231697030000045
step S307: introducing a large momentum term in gradient updating to avoid partial optimization in the gradient descending process, wherein the introduced large momentum term is as follows:
Figure BDA0003231697030000046
wherein
Figure BDA0003231697030000047
Represents the solution of the t-th iteration, α (t) represents the momentum of the t-th iteration, and η represents the learning efficiency.
Preferably, σ in the step S301iThe value is selected in relation to the Perplexity; the Perplexity is:
Figure BDA0003231697030000048
wherein H (p)i) Is piShannon entropy of (c):
Figure BDA0003231697030000049
the Perplexity value is set to 5 to 50.
Preferably, the step S5 is specifically: given that the feature space of the training data is Y, the mapping matrix from the high-dimensional space X to the low-dimensional space Y is a, and accordingly, the mapping matrix from the low-dimensional space Y to the high-dimensional space is denoted as B:
B=AT(AAT)-1 (16)
finding a high dimensional space
Figure BDA00032316970300000410
Figure BDA00032316970300000411
The residual space E of the training data is thus calculated:
Figure BDA00032316970300000412
preferably, the step S6 is specifically: obtaining on-line data xunNormalized to obtain xonObtaining the online high-dimensional data x by mapping the matrix AonLow-dimensional projection y ofon
yon=Axon (19)
Residual space e of online dataonNamely:
Figure BDA0003231697030000051
wherein
Figure BDA0003231697030000052
Representing a low-dimensional online data space yonProjection in the high dimension.
Preferably, the step S7 includes the steps of:
step S701: searching a local neighborhood, and calculating the distance of the farthest neighbor;
feature sample set for training data
Figure BDA0003231697030000053
Sample point y in (1)iLooking for yiK neighbors of (a) form a neighborhood
Figure BDA0003231697030000054
yiThe distance from its nearest neighbor is denoted k distance (y)i);
Step S702: obtaining a local reachable distance;
sample yiAnd its neighboring neighbor
Figure BDA0003231697030000055
Constructed to reach a distance of
Figure BDA0003231697030000056
The specific definition is as follows:
Figure BDA0003231697030000057
wherein the content of the first and second substances,
Figure BDA0003231697030000058
is a sample point yiAnd its neighbors
Figure BDA0003231697030000059
The Euclidean distance of;
step S703: calculating local reachable density;
sample point yiIs defined as lrd (y)i) The expression is as follows:
Figure BDA00032316970300000510
step S704: calculating a local outlier factor;
sample point yiThe local outlier factor of (a) is LOF (y)i) The form is as follows:
Figure BDA00032316970300000511
preferably, the step S8 is specifically: through the LOF statistic construction method in the step S7, local outlier factors of each sample point in the training data feature space are obtained, and a statistic matrix LOF is obtainedY=[LOF(y1),LOF(y2),…,LOF(yN)]Obtaining the control Limit LOF Using the Nuclear Density methodY_Limit(ii) a Similarly, a statistic matrix LOF of a training data residual space E is obtainedE=[LOF(e1),LOF(e2),…,LOF(eN)]And corresponding control limit LOFE_Limit
Preferably, the step S9 is specifically: residual spatial sample point e for online dataonFinding E in the training data residual space EonNeighborhood of (2)
Figure BDA00032316970300000512
Finally find eonLOF statistic LOF (e)on);
And fault detection is realized through the feature space and the residual error space statistics, and when the online data statistics are all larger than the control limit, the fault can be judged to occur.
Compared with the prior art, the invention has the following advantages:
1) when the Euclidean distance is used for calculating the distance, the problems of inconsistency of different dimensionalities of data and correlation between variables are often ignored, the Mahalanobis distance is an improvement on the Euclidean distance, the influence of the scale is eliminated through a covariance matrix on the basis of a total sample, and the interference of the correlation between the variables is eliminated; in the multi-modal chemical process, scale differences exist among the modes, so that the idea of calculating Euclidean distance in the traditional t-SNE algorithm is changed, the distance calculation mode is replaced by calculating the Mahalanobis distance, and the modeling of the multi-modal chemical process is not influenced by dimensions;
2) according to the invention, the fault detection of the multi-mode chemical process is realized by establishing the global model by improving the t-distribution random neighbor embedding t-SNE algorithm, and compared with a multi-model integration method, the global model does not need to meet the requirement of accurately dividing each mode;
3) the LOF statistic adopted by the invention can adapt to non-Gaussian distribution data, and the robustness is stronger, so that more accurate control limit and statistic are obtained by calculation, and finally, the fault detection with higher accuracy is realized.
Drawings
FIG. 1 is a flow chart of a multi-modal chemical process fault detection method based on improved t-SNE.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, shall fall within the scope of protection of the present invention.
As shown in FIG. 1, the invention provides a multi-modal chemical process fault detection method based on improved t-SNE. Firstly, collecting data of multiple modal chemical processes during normal operation, and carrying out standardized preprocessing to obtain training data; secondly, calculating the Mahalanobis distance of the training data, and realizing multi-mode chemical process modeling by improving a t-SNE algorithm; and finally, introducing local outlier factor LOF algorithm to construct statistics, and realizing real-time fault detection in the multi-modal chemical process.
The embodiment is realized by the following technical scheme, which specifically comprises the following steps:
step S1: collecting multi-modal chemical process data;
collecting data generated during normal operation of chemical process under multiple different modes, wherein the number of sampling points of each mode is the same, and forming high-dimensional original data Xo=[x1,x2,...,xN]∈RD×NWhere D represents the number of process variables and N represents the number of sample points.
Step S2: carrying out standardization processing on original high-dimensional data;
raw data XoAnd obtaining high-dimensional data X after standardization processing.
Step S3: calculating the mahalanobis distance between high-dimensional data points;
xi,xjis two sample points, X, in a high dimensional dataset Xi,xjMahalanobis distance D in high dimensional spaceM(xi,xj) The calculation method is as follows:
Figure BDA0003231697030000071
where μ is the sample mean of the data set X, Σ-1As a covariance matrix of data set X, sigma-1The specific form is as follows:
Figure BDA0003231697030000072
wherein c isijIs a sample point xi,xjCovariance of cijThe expression is as follows:
cij=Cov(xi,xj)=E[xi-E(xi)][xj-E(xj)],i,j=1,2,...,D (3)
step S4: extracting the characteristics of the high-dimensional data by using an improved t-distribution random neighbor embedding method;
calculating conditional probability among high-dimensional space data points;
high dimensional spatial sample point xjIs xiIs denoted as pj|i
Figure BDA0003231697030000073
Wherein σiIs represented by XiIs the variance of the Gaussian distribution of the center point, k denotes the number of neighbors, xkDenotes xiAny one of the neighbors.
σiThe values are selected in relation to a Perplexity, which is defined by the following equation:
Figure BDA0003231697030000074
wherein H (p)i) Is piShannon entropy of (c):
Figure BDA0003231697030000075
the value of the degree of confusion is generally set between 5 and 50, and σ is found by a binary searchiThe value of (c).
Calculating conditional probability among the low-dimensional space data points;
the finally obtained low-dimensional feature matrix is assumed to be Y ═ Y1,y2,...,yN]∈Rd×ND represents the finally obtained matrix with the low-dimensional characteristic of d dimension, and the high-dimensional data sample point xiAnd xjThe mapped point in the low-dimensional space is yiAnd yjWill be given by yiThe variance of the Gaussian distribution as the center point is set to
Figure BDA0003231697030000081
Accordingly, the sample point y in the low-dimensional space can be obtainediIs yjThe conditional probability of (2) is as follows:
Figure BDA0003231697030000082
wherein y iskDenotes yiAny one of the neighbors.
Measuring the similarity between two data distributions by using KL Divergence (Kullback-Leibler Divergence);
the similarity cost function C constructed by KL divergence is as follows:
Figure BDA0003231697030000083
fourthly, minimizing a cost function;
minimizing a cost function C, the cost function C to y, by means of gradient descentiAnd (3) solving the gradient, wherein the gradient formula is as follows:
Figure BDA0003231697030000084
simplifying gradient formula;
using joint probability distribution instead of conditional probability, so that for arbitrary i and j, there is qji=qij,pji=pij. The target is achieved by a symmetrical method as follows:
Figure BDA0003231697030000085
the cost function C is converted into the following expression:
Figure BDA0003231697030000086
the gradient formula can be simplified as:
Figure BDA0003231697030000087
redefining the joint distribution probability of the low-dimensional data according to the student t distribution;
in order to solve the crowding problem of low-dimensional data in classification, a student t distribution is used for replacing a Gaussian distribution. Redefining q using t-distribution with degree of freedom of 1ijThe concrete form is as follows:
Figure BDA0003231697030000088
the gradient formula is converted to the following expression:
Figure BDA0003231697030000091
seventhly, in order to avoid falling into local optimum in the gradient descending process, a large momentum term is added in the gradient updating process;
Figure BDA0003231697030000092
wherein
Figure BDA0003231697030000093
Represents the solution of the t-th iteration, α (t) represents the momentum of the t-th iteration, and η represents the learning efficiency. Obtaining low dimension by the above stepsThe feature matrix Y.
Step S5: obtaining a mapping matrix;
in order to realize real-time fault detection and reduce the calculation time loss of processing new data, a mapping matrix is required to obtain a low-dimensional feature space of online high-dimensional data, so a mapping matrix A for projecting the high-dimensional space to the low-dimensional space is required to be obtained. Moore-Penrose generalized inverse matrix X from X+A can be obtained:
A=YX+ (16)
Figure BDA0003231697030000094
step S6: obtaining a residual space of the training data;
given that the feature space of the training data is Y, the mapping matrix from the high-dimensional space X to the low-dimensional space Y is a, and accordingly, the mapping matrix from the low-dimensional space Y to the high-dimensional space is denoted as B:
B=AT(AAT)-1 (18)
finding a high dimensional space
Figure BDA0003231697030000095
Figure BDA0003231697030000096
The residual space E of the training data is thus calculated:
Figure BDA0003231697030000097
step S7: solving a feature space and a residual space of the online data;
obtaining on-line data xunNormalized to obtain xonObtaining the online high-dimensional data x by mapping the matrix AonLow-dimensional projection y ofon
yon=Axon (21)
Residual space e of online dataonNamely:
Figure BDA0003231697030000098
step S8: constructing LOF statistics by using a local outlier factor algorithm;
searching a local neighborhood, and calculating the distance of the farthest neighbor;
feature sample set for training data
Figure BDA0003231697030000099
Sample point y in (1)iIn other words, find yiK neighbors of (a) form a neighborhood
Figure BDA00032316970300000910
yiThe distance from its nearest neighbor is denoted k distance (y)i) (ii) a Secondly, obtaining a local reachable distance;
sample yiAnd its neighboring neighbor
Figure BDA0003231697030000101
Constructed to reach a distance of
Figure BDA0003231697030000102
The specific definition is as follows:
Figure BDA0003231697030000103
wherein the content of the first and second substances,
Figure BDA0003231697030000104
is a sample point yiAnd its neighbors
Figure BDA0003231697030000105
The euclidean distance of (c).
Thirdly, calculating local reachable density;
sample point yiIs defined as lrd (y)i) The expression is as follows:
Figure BDA0003231697030000106
fourthly, calculating local outlier factors;
sample point yiThe local outlier factor of (a) is LOF (y)i) The form is as follows:
Figure BDA0003231697030000107
step S8: calculating LOF statistics and corresponding control limits of training data;
by the method for constructing the statistics, the local outlier factor of each sample point in the training data feature space is obtained, and a statistics matrix LOF is obtainedY=[LOF(y1),LOF(y2),…,LOF(yN)]Obtaining the control Limit LOF Using the Nuclear Density methodY_Limit. The signal coefficient is 0.99, and alpha is 0.01. Through the formula (26), the LOF is derivedY_Limit
Figure BDA0003231697030000108
Similarly, a statistic matrix LOF of a training data residual space E is obtainedE=[LOF(e1),LOF(e2),…,LOF(eN)]And corresponding control limit LOFE_Limit。LOFE_LimitThe derivation is as follows:
Figure BDA0003231697030000109
step 9, calculating LOF statistic of online data to realize real-time fault detection;
sample points y for online data feature spaceonSearching in training data feature space YFinding and yonThe K sample points with the closest distance form yonNeighborhood of (2)
Figure BDA00032316970300001010
Finally, find yonLOF statistic LOF (y)on) The expression is as follows:
Figure BDA00032316970300001011
similarly, for the residual spatial sample point e of the online dataonFinding E in the training data residual space EonNeighborhood of (2)
Figure BDA00032316970300001012
Finally find eonLOF statistic LOF (e)on) The expression is as follows:
Figure BDA00032316970300001013
and fault detection is realized through the feature space and the residual error space statistics, and when the online data statistics are all larger than the control limit, the fault can be judged to occur. The determination conditions were as follows:
Figure BDA0003231697030000111
due to the change of production requirements in the chemical production process, a plurality of operation modes generally exist, and the traditional method is usually only suitable for the modeling of the chemical process with a single mode. Therefore, the invention provides an improved algorithm on the basis of the t-SNE algorithm, and the Mahalanobis distance is adopted to calculate the distance between data points so as to match the modeling of the multi-modal process. On the basis, the statistic is established through a local outlier factor LOF algorithm, so that the detection result is more robust.
While the invention has been described with reference to specific embodiments, the invention is not limited thereto, and various equivalent modifications and substitutions can be easily made by those skilled in the art within the technical scope of the invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (10)

1. A multi-modal chemical process fault detection method based on improved t-SNE is characterized by comprising the following steps:
step S1: collecting multi-modal chemical process raw data XoAnd carrying out standardization processing to obtain high-dimensional data X;
step S2: computing mahalanobis distance D between high-dimensional data pointsM
Step S3: carrying out feature extraction on the high-dimensional data X by adopting an improved t-distribution random neighbor embedding method t-SNE to obtain a low-dimensional matrix Y;
step S4: obtaining a mapping matrix A from a high-dimensional space to a low-dimensional space;
step S5: obtaining a residual error space E of the training data;
step S6: solving a feature space and a residual space of the online data;
step S7: constructing LOF statistics by using a local outlier factor LOF algorithm;
step S8: calculating LOF statistics and corresponding control limits of training data;
step S9: and calculating LOF statistic of the online data, and performing real-time fault detection.
2. The method according to claim 1, wherein the multi-modal chemical process raw data in the step S1 is data generated when the chemical process normally operates in a plurality of different modes, and the number of sampling points in each mode is the same; the multi-mode chemical process raw data is Xo=[x1,x2,...,xN]∈RD×NWhere D represents the number of variables of the process and N represents the number of sampling pointsAnd (4) the number.
3. The method for multi-modal chemical process fault detection based on improved t-SNE as claimed in claim 1, wherein the step S2 is specifically as follows: x is the number ofi,xjIs two sample points, X, in a high dimensional dataset Xi,xjMahalanobis distance D in high dimensional spaceM(xi,xj) The calculation method is as follows:
Figure FDA0003231697020000011
where μ is the sample mean of the data set X, Σ-1As a covariance matrix of data set X, sigma-1The specific form is as follows:
Figure FDA0003231697020000012
wherein c isijIs a sample point xi,xjCovariance of cijThe expression is as follows:
cij=Cov(xi,xj)=E[xi-E(xi)][xj-E(xj)],i,j=1,2,...,D (3)
wherein E (x)i) And E (x)j) Respectively represent samples xiAnd xjThe mathematical expectation of (2).
4. The multi-modal chemical process fault detection method based on improved t-SNE as claimed in claim 1, wherein the step S3 is used for obtaining a low-dimensional feature matrix Y, and specifically comprises the following steps:
step S301: calculating conditional probabilities between high-dimensional spatial data points;
recording high-dimensional space sample point xjIs xiHas a conditional probability of pj|iExpressed as:
Figure FDA0003231697020000021
wherein σiIs represented by XiIs the variance of the Gaussian distribution of the center point, k denotes the number of neighbors, xkDenotes xiAny one of the neighbors of;
step S302: calculating conditional probabilities between the low-dimensional spatial data points;
the finally obtained low-dimensional feature matrix is assumed to be Y ═ Y1,y2,...,yN]∈Rd×ND represents the finally obtained matrix with the low-dimensional characteristic of d dimension, and the high-dimensional data sample point xiAnd xjThe mapped point in the low-dimensional space is yiAnd yjWill be given by yiThe variance of the Gaussian distribution as the center point is set to
Figure FDA0003231697020000025
Accordingly, the sample point y in the low-dimensional space can be obtainediIs yjThe conditional probability of (2) is as follows:
Figure FDA0003231697020000022
wherein y iskDenotes yiAny one of the neighbors of;
step S303: similarity between high and low dimensional data distributions using KL divergence measures;
the similarity cost function C constructed by KL divergence is as follows:
Figure FDA0003231697020000023
wherein P isiRepresenting a high-dimensional spatial sample xiSet of conditional probabilities forming a neighbour relation with all other samples, QiRepresenting low dimensional spatial samples yiForm a close proximity with all other samplesSet of conditional probability constituents of relationships xj,pj|iDenotes xiIs xjThe conditional probability of (a);
step S304: minimizing a cost function;
minimizing a cost function C, C to y, by means of gradient descentiAnd (3) solving the gradient, wherein the gradient formula is as follows:
Figure FDA0003231697020000024
step S305: simplifying a gradient formula;
using joint probability distribution instead of conditional probability, so that for arbitrary i and j, there is qji=qij,pji=pij(ii) a The target is achieved by a symmetrical method as follows:
Figure FDA0003231697020000031
the transformation cost function C is:
Figure FDA0003231697020000032
the simplified gradient formula is:
Figure FDA0003231697020000033
step S306: redefining the joint distribution probability of the low-dimensional data according to the t distribution of the students;
in order to solve the crowding problem of low-dimensional data during classification, student t distribution is used for replacing Gaussian distribution, and t distribution with degree of freedom of 1 is used for redefining qijThe concrete form is as follows:
Figure FDA0003231697020000034
the gradient formula is converted to the following expression:
Figure FDA0003231697020000035
step S307: introducing a large momentum term in gradient updating to avoid partial optimization in the gradient descending process, wherein the introduced large momentum term is as follows:
Figure FDA0003231697020000036
wherein
Figure FDA0003231697020000037
Represents the solution of the t-th iteration, α (t) represents the momentum of the t-th iteration, and η represents the learning efficiency.
5. The method for multi-modal chemical process fault detection based on improved t-SNE as claimed in claim 4, wherein σ in the step S301iThe value is selected in relation to the Perplexity; the Perplexity is:
Figure FDA0003231697020000038
wherein H (p)i) Is piShannon entropy of (c):
Figure FDA0003231697020000039
the Perplexity value is set to 5 to 50.
6. The method for multi-modal chemical process fault detection based on improved t-SNE as claimed in claim 1, wherein the step S5 is specifically as follows: given that the feature space of the training data is Y, the mapping matrix from the high-dimensional space X to the low-dimensional space Y is a, and accordingly, the mapping matrix from the low-dimensional space Y to the high-dimensional space is denoted as B:
B=AT(AAT)-1 (16)
finding a high dimensional space
Figure FDA00032316970200000310
Figure FDA00032316970200000311
The residual space E of the training data is thus calculated:
Figure FDA0003231697020000041
7. the method for multi-modal chemical process fault detection based on improved t-SNE as claimed in claim 1, wherein the step S6 is specifically as follows: obtaining on-line data xunNormalized to obtain xonObtaining the online high-dimensional data x by mapping the matrix AonLow-dimensional projection y ofon
yon=Axon (19)
Residual space e of online dataonNamely:
Figure FDA0003231697020000042
wherein
Figure FDA0003231697020000043
Representing a low-dimensional online data space yonProjection in the high dimension.
8. The method for multi-modal chemical process fault detection based on improved t-SNE as claimed in claim 1, wherein the step S7 comprises the following steps:
step S701: searching a local neighborhood, and calculating the distance of the farthest neighbor;
feature sample set for training data
Figure FDA0003231697020000044
Sample point y in (1)iLooking for yiK neighbors of (a) form a neighborhood
Figure FDA0003231697020000045
yiThe distance from its nearest neighbor is denoted k distance (y)i);
Step S702: obtaining a local reachable distance;
sample yiAnd its neighboring neighbor
Figure FDA0003231697020000046
Constructed to reach a distance of
Figure FDA0003231697020000047
The specific definition is as follows:
Figure FDA0003231697020000048
wherein the content of the first and second substances,
Figure FDA0003231697020000049
is a sample point yiAnd its neighbors
Figure FDA00032316970200000410
The Euclidean distance of;
step S703: calculating local reachable density;
sample point yiLocal achievable density ofIs defined as lrd (y)i) The expression is as follows:
Figure FDA00032316970200000411
step S704: calculating a local outlier factor;
sample point yiThe local outlier factor of (a) is LOF (y)i) The form is as follows:
Figure FDA00032316970200000412
9. the method for multi-modal chemical process fault detection based on improved t-SNE as claimed in claim 1, wherein the step S8 is specifically as follows: through the LOF statistic construction method in the step S7, local outlier factors of each sample point in the training data feature space are obtained, and a statistic matrix LOF is obtainedY=[LOF(y1),LOF(y2),…,LOF(yN)]Obtaining the control Limit LOF Using the Nuclear Density methodY_Limit(ii) a Similarly, a statistic matrix LOF of a training data residual space E is obtainedE=[LOF(e1),LOF(e2),…,LOF(eN)]And corresponding control limit LOFE_Limit
10. The method for multi-modal chemical process fault detection based on improved t-SNE as claimed in claim 1, wherein the step S9 is specifically as follows: residual spatial sample point e for online dataonFinding E in the training data residual space EonNeighborhood of (2)
Figure FDA0003231697020000051
Finally find eonLOF statistic LOF (e)on);
And fault detection is realized through the feature space and the residual error space statistics, and when the online data statistics are all larger than the control limit, the fault can be judged to occur.
CN202110988594.1A 2021-08-26 2021-08-26 Multi-mode chemical process fault detection method based on improved t-SNE Pending CN113741364A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110988594.1A CN113741364A (en) 2021-08-26 2021-08-26 Multi-mode chemical process fault detection method based on improved t-SNE

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110988594.1A CN113741364A (en) 2021-08-26 2021-08-26 Multi-mode chemical process fault detection method based on improved t-SNE

Publications (1)

Publication Number Publication Date
CN113741364A true CN113741364A (en) 2021-12-03

Family

ID=78733127

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110988594.1A Pending CN113741364A (en) 2021-08-26 2021-08-26 Multi-mode chemical process fault detection method based on improved t-SNE

Country Status (1)

Country Link
CN (1) CN113741364A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114528921A (en) * 2022-01-20 2022-05-24 江苏大学 Transformer fault diagnosis method based on LOF algorithm and hybrid sampling

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105511445A (en) * 2015-12-01 2016-04-20 沈阳化工大学 Multi-modal process fault detection method based on local neighbor standardization matrix
CN111625576A (en) * 2020-05-15 2020-09-04 西北工业大学 Score clustering analysis method based on t-SNE
CN111949012A (en) * 2020-09-09 2020-11-17 兰州理工大学 Intermittent process fault detection method based on double-weight multi-neighborhood preserving embedding algorithm
CN112697268A (en) * 2020-12-04 2021-04-23 佳木斯电机股份有限公司 Motor anomaly detection integrated algorithm based on T-SNE

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105511445A (en) * 2015-12-01 2016-04-20 沈阳化工大学 Multi-modal process fault detection method based on local neighbor standardization matrix
CN111625576A (en) * 2020-05-15 2020-09-04 西北工业大学 Score clustering analysis method based on t-SNE
CN111949012A (en) * 2020-09-09 2020-11-17 兰州理工大学 Intermittent process fault detection method based on double-weight multi-neighborhood preserving embedding algorithm
CN112697268A (en) * 2020-12-04 2021-04-23 佳木斯电机股份有限公司 Motor anomaly detection integrated algorithm based on T-SNE

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
DECHENG LIU ET AL.: "Fault Detection Based on Modified t-SNE", 2019CAA SYMPOSIUM ON FAULT DETECTION,SUPERVISION AND SAFETY FOR TECHNICAL PROCESSES(SAFEPROCESS), pages 269 - 273 *
马贺贺;胡益;侍洪波;: "基于马氏距离局部离群因子方法的复杂化工过程故障检测", 化工学报, no. 05, pages 1674 - 1682 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114528921A (en) * 2022-01-20 2022-05-24 江苏大学 Transformer fault diagnosis method based on LOF algorithm and hybrid sampling

Similar Documents

Publication Publication Date Title
CN110070141B (en) Network intrusion detection method
CN110532542B (en) Invoice false invoice identification method and system based on positive case and unmarked learning
US20220076150A1 (en) Method, apparatus and system for estimating causality among observed variables
CN104462196B (en) Multiple features combining Hash information search method
CN109858015B (en) Semantic similarity calculation method and device based on CTW (computational cost) and KM (K-value) algorithm
CN111461157B (en) Self-learning-based cross-modal Hash retrieval method
CN108733976B (en) Key protein identification method based on fusion biology and topological characteristics
CN114021799A (en) Day-ahead wind power prediction method and system for wind power plant
CN111612041A (en) Abnormal user identification method and device, storage medium and electronic equipment
CN110489424B (en) Tabular information extraction method and device, storage medium and electronic equipment
Zhou et al. Constructing ECOC based on confusion matrix for multiclass learning problems.
CN111597943B (en) Table structure identification method based on graph neural network
CN110674865A (en) Rule learning classifier integration method oriented to software defect class distribution unbalance
CN108561127B (en) Stratum pressure prediction method based on random simulation
CN110489423B (en) Information extraction method and device, storage medium and electronic equipment
Jiang et al. DeepDom: Predicting protein domain boundary from sequence alone using stacked bidirectional LSTM
CN114781688A (en) Method, device, equipment and storage medium for identifying abnormal data of business expansion project
Srimani et al. Adaptive data mining approach for PCB defect detection and classification
CN113741364A (en) Multi-mode chemical process fault detection method based on improved t-SNE
Celik et al. Biological cartography: Building and benchmarking representations of life
Shah et al. Analysis of different clustering algorithms for accurate knowledge extraction from popular datasets
CN114463587A (en) Abnormal data detection method, device, equipment and storage medium
CN112182337B (en) Method for identifying similar news from massive short news and related equipment
US11048730B2 (en) Data clustering apparatus and method based on range query using CF tree
Haraksim et al. Validation of likelihood ratio methods for forensic evidence evaluation handling multimodal score distributions

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination