CN110232062B

CN110232062B - KPLS (kernel principal component plus minor component plus) and FCM (fiber channel model) -based sewage treatment process monitoring method

Info

Publication number: CN110232062B
Application number: CN201910572930.7A
Authority: CN
Inventors: 周平; 张瑞垚; 王宏
Original assignee: Northeastern University China
Current assignee: Northeastern University China
Priority date: 2019-06-28
Filing date: 2019-06-28
Publication date: 2021-04-02
Anticipated expiration: 2039-06-28
Also published as: CN110232062A

Abstract

The invention relates to the technical field of sewage treatment quality monitoring, and provides a KPLS and FCM based sewage treatment process monitoring method. The method comprises the steps of firstly, collecting data samples of sewage treatment processes under normal working conditions and abnormal working conditions, respectively using data of sewage treatment operation variables and data of effluent quality variables as input and output data matrixes, and standardizing the two matrixes; then constructing a KPLS model, mapping an input sample to a high-dimensional characteristic space, introducing a Gaussian kernel function to obtain a Gram matrix K, and solving a score matrix; then calculating density values of input sample points, calculating a constructor and drawing constructor images to determine the clustering number; and finally, clustering the scoring matrix based on an FCM algorithm to obtain a membership matrix, and monitoring abnormal working conditions in the sewage treatment process according to the membership matrix. The invention can reduce the dimension of high-dimensional data, process nonlinear data, accurately and conveniently determine the clustering number and improve the timeliness and accuracy of monitoring.

Description

KPLS (kernel principal component plus minor component plus) and FCM (fiber channel model) -based sewage treatment process monitoring method

Technical Field

The invention relates to the technical field of sewage treatment quality monitoring, in particular to a KPLS and FCM based sewage treatment process monitoring method.

Background

With the acceleration of urbanization and industrialization in China, the demand of the society on fresh water resources is increasing day by day, and the construction of urban domestic sewage treatment facilities needs to be accelerated to improve the urban domestic sewage treatment capacity. The active sludge process is the main method for treating urban sewage at present. The activated sludge sewage purification mainly comprises 3 processes of initial adsorption, microorganism metabolism, flocculation formation and sedimentation, and the essence is that biodegradable organic matters in the sewage are adsorbed, decomposed and oxidized by utilizing the microorganism group in the activated sludge through a series of biochemical reactions, so that the biodegradable organic matters are separated from the sewage, and the aim of purifying the sewage is fulfilled.

At present, biochemical oxygen demand ([ BOD ]), chemical oxygen demand ([ COD ]), suspended matter ([ SS ]), ammonia nitrogen ([ NH ]), and total phosphorus ([ TP ]) are generally adopted as sewage discharge indexes. In the sewage treatment process, parameters such as water inlet flow, water inlet components, pollutant concentration, weather change and the like are passively accepted, and the life activities of microorganisms are influenced by various factors such as dissolved oxygen concentration, microorganism population, the pH value of sewage and the like, so that the long-term stable operation of the urban sewage treatment plant is very difficult to maintain. The failure of the sewage treatment plant easily causes the quality of the effluent not to reach the standard, increases the operation cost and causes environmental pollution. Therefore, if the abnormal working condition of the sewage treatment process cannot be detected in time, the correct judgment cannot be made and no powerful measures are taken in time for adjustment and correction, so that the irreversible loss of the sewage treatment process can be caused. Therefore, an operator can accurately judge the abnormal working condition by detecting the sewage treatment process, and timely and accurately take measures, so that the safety, stability and smooth operation of sewage treatment are ensured, and the quality of effluent is especially important.

The existing sewage treatment process monitoring method adopts a data mining method in recent years, and the main reason is that a large amount of data exists and can be widely used, and the data needs to be converted into useful information and knowledge urgently. Since sewage treatment process data has no classification identification and the occurrence of sewage treatment failures is not correlated much with time, it is not suitable to mine using classification or sequence pattern mining. The cluster analysis in the data mining technology is an unsupervised classification technology and can be well used for analyzing data with less prior knowledge, so that the cluster analysis technology is widely applied to sewage process monitoring.

The fuzzy c-means clustering (FCM) algorithm is one of the classical clustering algorithms. FCM gives the uncertainty degree of the sample to the category, and establishes the uncertainty description of the sample to the category, which is more consistent with the description of the objective world. However, the data of the sewage treatment process has high dimensionality and nonlinearity, and the traditional FCM algorithm cannot process the high dimensionality and nonlinearity data, so that the difficulty of process monitoring is increased, the reliability of fault detection is reduced, the effluent quality of sewage is greatly influenced, and certain economic loss and even accidents are caused. Meanwhile, the clustering number of the FCM algorithm needs to be preset manually, and the method has great limitation in practical application.

Disclosure of Invention

Aiming at the problems in the prior art, the invention provides the KPLS and FCM-based sewage treatment process monitoring method, which can reduce the dimension of high-dimensional data, process nonlinear data, accurately and conveniently determine the clustering number and improve the timeliness and the accuracy of sewage treatment process monitoring.

The technical scheme of the invention is as follows:

a KPLS and FCM based sewage treatment process monitoring method is characterized by comprising the following steps:

step 1: respectively collecting data samples of a normal working condition and a sewage treatment process containing an abnormal working condition, wherein the data samples of the sewage treatment process comprise m₁Operation variable m of sewage treatment₂Individual effluent quality variables; adding the sewage treatment process data sample under the normal working condition before the sewage treatment process data sample under the abnormal working condition from the time angle to form a mixed data sample set; collecting m mixed data samples₁Taking the data of the running variable of the sewage treatment as an input data matrix X, and concentrating the mixed data sample into m₂Taking the data of the effluent quality variable as an output data matrix Y;

step 2: preprocessing an input data matrix X and an output data matrix Y; the preprocessing comprises the steps of calculating the mean value and the standard deviation of each variable in an input data matrix X and an output data matrix Y, and normalizing the input data matrix X and the output data matrix Y into data with zero mean value and unit standard deviation;

and step 3: constructing a KPLS model for monitoring the sewage treatment process, and mapping an input sample X in an input data matrix X to a high-dimensional feature space F: x → phi (X) belongs to F, a Gaussian kernel function is introduced to obtain a Gram matrix K of the input data matrix X, and the Gram matrix K is subjected to centralization processing;

and 4, step 4: determining the number of pivot elements by adopting a cross verification method, and solving a score matrix T;

and 5: computing input samples X in an input data matrix X_iPoint density value D of_iCalculating a constructor S (j), drawing an image of the constructor S (j), and constructing the image according to the constructor S (j)Determining the clustering number c according to the slope number of the image;

step 6: and c is used as the clustering number, the scoring matrix T is clustered based on an FCM algorithm to obtain a membership matrix U, and abnormal working condition monitoring is carried out on the sewage treatment process according to the membership matrix U: and if the membership degree of the sample to the clustering center of the normal working condition sample at a certain moment is less than mu, the sewage treatment process generates an abnormality at the sample.

The sewage treatment process adopts an activated sludge method, raw sewage enters a biochemical tank part after primary treatment, after biological denitrification, one part of the raw sewage enters a secondary sedimentation tank for sedimentation after denitrification again through internal circulation reflux; the biochemical pool part comprises biochemical pool l is belonged to {1,2,3,4,5}, wherein the biochemical pool l₁Belongs to {1,2} as an anoxic zone mainly completing the denitrification reaction process, a biochemical pool l₂The epsilon {3,4,5} is an aerobic zone which mainly completes the nitration reaction process; in the step 1, m is₁The operation variables of the sewage treatment comprise inflow, inflow ammonia nitrogen, the biomass of active heterotrophic bacteria in a biochemical pool I belonging to {1,2,3,4,5}, the biomass of easily biodegradable organic matters in the biochemical pool I belonging to {1,2,3,4,5}, and the operation variables of the sewage treatment comprise inflow, inflow ammonia nitrogen, inflow, outflow and outflow of water, outflow of water₁Amount of nitrol in epsilon {1,2}, biochemical pool l₂Activity autotrophic bacteria biomass in epsilon {3,4,5}, biochemical pool l₂The ammonia nitrogen content in the epsilon {3,4,5 }; m is₂The quality variables of the effluent comprise biochemical oxygen demand, chemical oxygen demand, suspended matters and ammonia nitrogen amount of the effluent.

The step 3 comprises the following steps:

step 3.1: the KPLS model for monitoring the sewage treatment process is constructed as

Φ＝TP₁'+Φ_r

Y＝TQ'+Y_r

Step 3.2: mapping input samples X in an input data matrix X to a high-dimensional feature space F: x → phi (X) belongs to F, a Gaussian kernel function is introduced to obtain a Gram matrix K of the input data matrix X, the Gram matrix K is subjected to centralization processing, and a KPLS model is converted into

K＝TP₂'+E

Y＝TQ'+Y_r

Wherein, the element of the ith row and the jth column of the Gram matrix K is K_ij＝k(x_i,x_j)＝<Φ(x_i),Φ(x_j)>，x_i、x_jRespectively, the ith input sample X in the input data matrix X_iJ-th input sample x_j，k(x_i,x_j) Is a Gaussian kernel function, i, j belongs to {1, 2., n }, and n is the number of samples in the input data matrix X; t is high dimensional data phi ═ phi (x)_i) I ∈ {1, 2., n } } score matrix, T ═ T [₁,...,t_A]A is the number of pivot elements, P₁＝[p₁₁,...,p_1A]、P₂＝[p₂₁,...,p_2A]、Q＝[q₁,...,q_A]Respectively a matrix phi, a Gram matrix K, a load matrix of an output data matrix Y, phi_r、E、Y_rRespectively are the modeling residual errors of the matrix phi, the Gram matrix K and the output data matrix Y.

In the step 4, determining the number of the principal elements A by adopting a cross verification method, and solving a scoring matrix T, wherein the method comprises the following steps:

step 4.1: let u be any column of the output data matrix Y;

step 4.2: calculating a score vector: t is Ku;

step 4.3: normalizing the score vector t: l t | → 1;

step 4.4: and (3) performing regression on each column in the output data matrix Y on the score vector t: q ═ Y't;

step 4.5: calculating a new score for the output data matrix Y: u is Yq;

step 4.6: and (3) normalizing the u vector: | | u | → 1;

step 4.7: judging whether u converges: if yes, jumping to step 4.8; if not, jumping to step 4.2;

step 4.8: updating the matrix: repeating the steps 4.2 to 4.7 until a score vector is calculated, wherein K is (I-tt ') K (I-tt '), and Y is Y-tq ', and calculating the next score vector until a score vectors are extracted; wherein I is an identity matrix.

In the step 3, the Gram matrix after the centralization processing

Wherein E is_nIs an n × n identity matrix, 1_nIs n-dimensional all 1-column vector, 1'_nIs 1_nThe transposed matrix of (2).

The step 5 comprises the following steps:

step 5.1: computing input samples X in an input data matrix X_iPoint density value D of_iIs composed of

Wherein the content of the first and second substances,

r_dis the effective radius of the neighborhood density,

step 5.2: calculating the constructor S (j) as

Step 5.3: and drawing the image of the constructor S (j), and taking the slope number of the image of the constructor S (j) as the cluster number c.

The step 6 comprises the following steps:

step 6.1: clustering the score matrix T based on the FCM algorithm by taking c as the clustering number to construct an FCM target function

Wherein the content of the first and second substances,

is the ith row vector of the scoring matrix T,

is m₁Input sample x of dimension_iCorresponding reduced A-dimensional new sample, u_ijIs a sample

For the jth clustering center v_jThe degree of membership of (a) is,

membership matrix U ═ U_ij)_n×cThe cluster center matrix V ═ V (V)_j)_c×A；m∈[1,+∞]Is a fuzzy index;

is a sample

With the jth cluster center v_jThe Euclidean distance between; c clustering centers obtained by clustering the score matrix T comprise clustering centers of normal working condition samples and clustering centers of c-1 abnormal working condition samples;

step 6.2: solving a membership matrix U:

step 6.2.1: initializing FCM algorithm parameters: setting a fuzzy index m, setting an algorithm termination limit epsilon and a maximum iteration count, setting an initialization iteration count k to be 1, and randomly initializing a membership matrix U^(k)＝(u_ij ^(k))_n×cRandomly initializing a cluster center matrix V^(k)＝(v_j ^(k))_c×A；

Step 6.2.2: v is to be_j ^(k)Substitution formula

Calculating membership degree matrix U of k +1 iteration^(k+1)＝(u_ij ^(k+1))_n×c；

Step 6.2.3: will u_ij ^(k+1)Substitution formula

Calculating the clustering center matrix of the (k + 1) th iteration as V^(k+1)＝(v_j ^(k+1))_c×A；

Step 6.2.4: if | | | U^(k+1)-U^(k)If | is less than epsilon or the iteration times k is more than count, stopping iteration to obtain a final membership matrix U, and entering step 6.3; otherwise, making k equal to k +1, and returning to the step 6.2.2;

step 6.3: monitoring abnormal working conditions in the sewage treatment process according to the membership matrix U: if the ith sample

And if the membership degree of the clustering center of the normal working condition sample is less than mu, the sewage treatment process is abnormal at the ith sample.

The invention has the beneficial effects that:

(1) the KPLS algorithm and the FCM algorithm are combined, the KPLS model and the FCM model are constructed to describe the normal production process, prior knowledge of abnormal working conditions in the sewage treatment process is not needed, and only normal working condition data are used as marking data. Firstly, based on a data driving method, a Gaussian kernel function is adopted, standardized process variables are projected to a high-dimensional feature space, a KPLS model for monitoring the sewage treatment process is established in the high-dimensional feature space, after the number of principal elements is determined by a cross verification method, dimension reduction is carried out on high-dimensional input data, a score matrix T is obtained and used as input data of clustering analysis in an FCM algorithm, the purpose of dimension reduction is achieved, and meanwhile the limitation that the FCM cannot process nonlinear data is solved.

(2) The invention calculates the constructor based on the density function and solves the clustering number according to the constructor, thereby accurately and conveniently determining the clustering number and solving the limitation problem that the clustering number of the FCM algorithm needs to be preset manually.

(3) The method and the device cluster the scoring matrix T based on the FCM algorithm to obtain the membership matrix U, monitor abnormal working conditions in the sewage treatment process according to the membership matrix U, monitor the occurrence time of the abnormal working conditions through sample membership, and simultaneously identify the number of the abnormal working conditions, have high monitoring timeliness and accuracy, are convenient for operators to monitor the sewage treatment process, accurately judge the fluctuation of the effluent quality of sewage treatment, and timely take measures to treat and correct, so that the stable, efficient and safe operation of a sewage plant is ensured, and the effluent quality is ensured.

Drawings

FIG. 1 is a flow chart of a KPLS and FCM based sewage treatment process monitoring method of the present invention;

FIG. 2 is a schematic diagram of a constructor in accordance with an embodiment of the invention;

FIG. 3 is a schematic diagram illustrating the membership of a monitoring sample to a clustering center of a normal condition sample according to an embodiment of the present invention;

FIG. 4 is a schematic diagram illustrating the membership of a monitoring sample to a cluster center of an abnormal condition sample according to an embodiment of the present invention;

fig. 5 is a schematic diagram of the clustering effect of the score matrix T according to the embodiment of the present invention.

Detailed Description

The invention will be further described with reference to the accompanying drawings and specific embodiments.

Fig. 1 shows a flow chart of the KPLS and FCM-based sewage treatment process monitoring method according to the present invention. The KPLS and FCM-based sewage treatment process monitoring method is characterized by comprising the following steps of:

step 1: respectively collecting data samples of a normal working condition and a sewage treatment process containing an abnormal working condition, wherein the data samples of the sewage treatment process comprise m₁Operation variable m of sewage treatment₂Individual effluent quality variables; adding the sewage treatment process data sample under the normal working condition before the sewage treatment process data sample under the abnormal working condition from the time angle to form a mixed data sample set; collecting m mixed data samples₁Taking the data of the running variable of the sewage treatment as an input data matrix X, and mixing the data samplesCentralizing m₂The data of the water quality variable is used as an output data matrix Y.

In this embodiment, the sewage treatment process employs an activated sludge process. The activated sludge process flow is generally divided into primary treatment, secondary treatment and tertiary treatment according to the treatment degree. The raw sewage is treated in the first stage and then enters the biochemical tank for biological denitrification, one part of the raw sewage is subjected to denitrification again through internal circulation reflux, and the other part of the raw sewage enters the secondary sedimentation tank for sedimentation. The biochemical tank is the most important place for completing biochemical reaction process and purifying sewage. The biochemical pool part comprises biochemical pool l is belonged to {1,2,3,4,5}, wherein the biochemical pool l₁Belongs to {1,2} as an anoxic zone mainly completing the denitrification reaction process, a biochemical pool l₂The epsilon {3,4,5} is an aerobic zone which mainly completes the nitration reaction process; in the step 1, m is₁The operation variables of the sewage treatment comprise inflow, inflow ammonia nitrogen, the biomass of active heterotrophic bacteria in a biochemical pool I belonging to {1,2,3,4,5}, the biomass of easily biodegradable organic matters in the biochemical pool I belonging to {1,2,3,4,5}, and the operation variables of the sewage treatment comprise inflow, inflow ammonia nitrogen, inflow, outflow and outflow of water, outflow of water₁Amount of nitrol in epsilon {1,2}, biochemical pool l₂Activity autotrophic bacteria biomass in epsilon {3,4,5}, biochemical pool l₂The ammonia nitrogen content in the epsilon {3,4,5 }; m is₂The quality variables of the effluent comprise biochemical oxygen demand, chemical oxygen demand, suspended matters and ammonia nitrogen amount of the effluent.

The abnormal conditions are sludge bulking, foaming, scumming, toxic shock, stormy weather, etc., as is well known to those skilled in the art. In this embodiment, 200 sewage treatment process data samples under normal conditions and 800 sewage treatment process data samples under an abnormal condition including rainstorm weather are collected to form a mixed data sample set including 1000 samples. Collecting m mixed data samples₁Taking the data of 20 sewage treatment operation variables as an input data matrix X epsilon R^1000×20Collecting m mixed data samples₂Taking the data of 4 water outlet quality variables as an output data matrix Y epsilon R^1000×4。

Step 2: preprocessing an input data matrix X and an output data matrix Y; the preprocessing comprises the steps of calculating the mean value and the standard deviation of each variable in the input data matrix X and the output data matrix Y, and normalizing the input data matrix X and the output data matrix Y into data with zero mean value and unit standard deviation.

And step 3: constructing a KPLS model for monitoring the sewage treatment process, and mapping an input sample X in an input data matrix X to a high-dimensional feature space F: x → phi (X) is in the middle of F, a Gaussian kernel function is introduced to obtain a Gram matrix K of the input data matrix X, and the Gram matrix K is subjected to centralization processing.

The step 3 comprises the following steps:

Φ＝TP₁'+Φ_r

Y＝TQ'+Y_r

K＝TP₂'+E

Y＝TQ'+Y_r

In the step 3, the Gram matrix after the centralization processing

A KPLS model is constructed by adopting a nonlinear least square iterative algorithm, and KPLS is kernel projection to relative structure. In this embodiment, the Gaussian kernel function is

Wherein, c₁Is a Gaussian kernel width parameter, c₁Is taken from 5m₁Empirical determination, i.e. determination of c₁＝5*m₁＝100。

step 4.1: let u be any column of the output data matrix Y;

step 4.2: calculating a score vector: t is Ku;

step 4.3: normalizing the score vector t: l t | → 1;

step 4.5: calculating a new score for the output data matrix Y: u is Yq;

step 4.6: and (3) normalizing the u vector: | | u | → 1;

In this embodiment, the number a of principal elements is determined to be 3 by using a cross-validation method.

And 5: computing input samples X in an input data matrix X_iPoint density value D of_iCalculating a structural function S (j), drawing an image of the structural function S (j), and determining a cluster number c according to the slope number of the image of the structural function S (j);

the step 5 comprises the following steps:

Wherein the content of the first and second substances,

r_dis the effective radius of the neighborhood density,

step 5.2: calculating the constructor S (j) as

As shown in fig. 2, the slope of the constructor s (j) reflects the point density value of the sample data, which has the practical meaning that the slopes of the constructor s (j) at the homogeneous data are the same. As can be seen from fig. 2, the image has distinct transitions around 200 and 700, respectively, whereby the image can be roughly divided into two parts, i.e., (0,500) U (700,1000) and (500,700). In the 1000 test data sets, the first 200 test data sets are normal working condition data, and the last 800 test data sets are data including abnormal working conditions. From the analysis, it can be judged that the mixed data sample set is divided into two categories: class1 class1 is a normal condition sample class and class2 class2 is an abnormal condition sample class, so that the cluster number c is determined to be 2.

The step 6 comprises the following steps:

Wherein the content of the first and second substances,

is the ith row vector of the scoring matrix T,

For the jth clustering center v_jThe degree of membership of (a) is,

is a sample

step 6.2: solving a membership matrix U:

Step 6.2.2: v is to be_j ^(k)Substitution formula

Step 6.2.3: will u_ij ^(k+1)Substitution formula

Wherein the fuzzy index m influences the fuzzy degree of the membership degree matrix. In this embodiment, the effect of the algorithm can be optimized by setting the fuzzy index m to 2.4.

In this example, the sample was monitored

For the clustering center v1 of normal condition sample and the clustering center v of abnormal condition sample₂Degree of membership u of_i1、u_i2As shown in fig. 3 and 4, respectively, the clustering effect of the score matrix T is shown in fig. 5. Set μ to 0.5. As can be seen from FIGS. 3 and 4, at the 500 th and 700 th samples, the samples

And the membership degree of the clustering center of the normal working condition samples is less than 0.5, so that the abnormality of the sewage treatment process at 700 th samples is judged. Therefore, the monitoring method can timely monitor the occurrence of abnormal working conditions in the sewage treatment process.

It is to be understood that the above-described embodiments are only a few embodiments of the present invention, and not all embodiments. The above examples are only for explaining the present invention and do not constitute a limitation to the scope of protection of the present invention. All other embodiments, which can be derived by those skilled in the art from the above-described embodiments without any creative effort, namely all modifications, equivalents, improvements and the like made within the spirit and principle of the present application, fall within the protection scope of the present invention claimed.

Claims

1. A KPLS and FCM based sewage treatment process monitoring method is characterized by comprising the following steps:

step 1: respectively collecting data samples of a normal working condition and a sewage treatment process containing an abnormal working condition, wherein the data samples of the sewage treatment process comprise m₁Operation variable m of sewage treatment₂Individual effluent quality variables; adding the sewage treatment process data sample under the normal working condition before the sewage treatment process data sample under the abnormal working condition from the time angle to form a mixed data sample set; will be provided withMixed data sample set m₁Taking the data of the running variable of the sewage treatment as an input data matrix X, and concentrating the mixed data sample into m₂Taking the data of the effluent quality variable as an output data matrix Y;

the step 5 comprises the following steps:

Wherein the content of the first and second substances,

r_dis the effective radius of the neighborhood density,

step 5.2: calculating the constructor S (j) as

Step 5.3: drawing an image of a constructor S (j), and taking the slope number of the image of the constructor S (j) as a cluster number c;

2. The KPLS and FCM based sewage treatment process monitoring method of claim 1, wherein the sewage treatment process is activated sludge process, the raw sewage is treated in the first stage, and then enters into the biochemical tank, after biological denitrification, one part of the raw sewage enters into the secondary sedimentation tank to be precipitated again through internal circulation reflux; the biochemical pool part comprises biochemical pool l is belonged to {1,2,3,4,5}, wherein the biochemical pool l₁Belongs to {1,2} as an anoxic zone mainly completing the denitrification reaction process, a biochemical pool l₂The epsilon {3,4,5} is an aerobic zone which mainly completes the nitration reaction process; in the step 1, m is₁The operation variables of the sewage treatment comprise inflow, inflow ammonia nitrogen, the biomass of active heterotrophic bacteria in a biochemical pool I belonging to {1,2,3,4,5}, the biomass of easily biodegradable organic matters in the biochemical pool I belonging to {1,2,3,4,5}, and the operation variables of the sewage treatment comprise inflow, inflow ammonia nitrogen, inflow, outflow and outflow of water, outflow of water₁Amount of nitrol in epsilon {1,2}, biochemical pool l₂Activity autotrophic bacteria biomass in epsilon {3,4,5}, biochemical pool l₂The ammonia nitrogen content in the epsilon {3,4,5 }; m is₂The quality variables of the effluent comprise biochemical oxygen demand, chemical oxygen demand, suspended matters and ammonia nitrogen amount of the effluent.

3. A KPLS and FCM based sewage treatment process monitoring method according to claim 1 or 2, wherein said step 3 comprises the steps of:

Φ＝TP₁'+Φ_r

Y＝TQ'+Y_r

K＝TP₂'+E

Y＝TQ'+Y_r

Wherein, the element of the ith row and the jth column of the Gram matrix K is K_ij＝k(x_i,x_j)＝<Φ(x_i),Φ(x_j)>，x_i、x_jRespectively, the ith input sample X in the input data matrix X_iJ-th input sample x_j，k(x_i,x_j) Is a Gaussian kernel function, i, j belongs to {1, 2., n }, and n is the number of samples in the input data matrix X; t is high dimensional data phi ═ phi (x)_i) I ∈ {1,2, …, n } } score matrix, T ═ T [ -T }₁,…,t_A]A is the number of pivot elements, P₁＝[p₁₁,…,p_1A]、P₂＝[p₂₁,…,p_2A]、Q＝[q₁,…,q_A]Respectively a matrix phi, a Gram matrix K, a load matrix of an output data matrix Y, phi_r、E、Y_rRespectively are the modeling residual errors of the matrix phi, the Gram matrix K and the output data matrix Y.

4. The KPLS and FCM based sewage treatment process monitoring method of claim 3, wherein in step 4, cross validation method is used to determine principal component number A and solve scoring matrix T, comprising the following steps:

step 4.1: let u be any column of the output data matrix Y;

step 4.2: calculating a score vector: t is Ku;

step 4.3: normalizing the score vector t: l t | → 1;

step 4.5: calculating a new score for the output data matrix Y: u is Yq;

step 4.6: and (3) normalizing the u vector: | | u | → 1;

5. A KPLS and FCM based sewage treatment process monitoring method according to claim 3, wherein in step 3, the processed Gram matrix is centralized