CN109582003A - Based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis - Google Patents
Based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis Download PDFInfo
- Publication number
- CN109582003A CN109582003A CN201811462243.1A CN201811462243A CN109582003A CN 109582003 A CN109582003 A CN 109582003A CN 201811462243 A CN201811462243 A CN 201811462243A CN 109582003 A CN109582003 A CN 109582003A
- Authority
- CN
- China
- Prior art keywords
- cluster
- point
- sample
- matrix
- discriminant analysis
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B23/00—Testing or monitoring of control systems or parts thereof
- G05B23/02—Electric testing or monitoring
- G05B23/0205—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults
- G05B23/0259—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults characterized by the response to fault detection
- G05B23/0262—Confirmation of fault detection, e.g. extra checks to confirm that a failure has indeed occurred
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01M—TESTING STATIC OR DYNAMIC BALANCE OF MACHINES OR STRUCTURES; TESTING OF STRUCTURES OR APPARATUS, NOT OTHERWISE PROVIDED FOR
- G01M13/00—Testing of machine parts
- G01M13/04—Bearings
- G01M13/045—Acoustic or vibration analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2411—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/20—Pc systems
- G05B2219/24—Pc safety
- G05B2219/24065—Real time diagnostics
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Bioinformatics & Computational Biology (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Automation & Control Theory (AREA)
- Acoustics & Sound (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Testing Of Devices, Machine Parts, Or Other Structures Thereof (AREA)
Abstract
Based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis, it is characterised in that: method includes the following steps: (1) collects the segmented rear composing training sample of vibration signal under bearing difference work shape;(2) feature extraction is carried out to the training sample that (1) obtains;(3) feature normalization of (2) is handled;(4) cluster labels collection is acquired using density peaks cluster to all characteristic sets of (3);(5) divergence and the interior divergence regularization term of cluster between using the cluster pseudo label of (4) to construct Local Clustering, and combined with divergence in the class scatter and class for having exemplar in FDA, determine final projection vector;(6) projection vector of (5) is utilized to seek projection set of the label characteristics collection in dimension reduction space;(7) the projection set training extreme learning machine of (6) is utilized;(8) vibration signal of collection is input in model after (2), (3) and (5) processing and determines operating condition.The present patent application is applied to the fault identification problem of bearing apparatus.
Description
Technical field:
The present application relates to a kind of bearing apparatus fault diagnosis fields, more particularly to one kind to be based on pseudo label semi-supervised kernel
Local Fei Sheer discriminant analysis bearing failure diagnosis.
Background technique:
In industrial circle, in order to increase the reliability of equipment performance, reduction is caused under yield due to mechanical disorder
The monitoring of the probability of drop, state of runtime machine is increasingly valued by people.Rotating machinery is in industrial department using the most
Extensive one kind mechanical equipment, many machinery such as steam turbine, compressor, blower and milling train belong to this kind.However, its core
Component bearing usually influences its normal work due to various various forms of failures, can even cause sometimes due to certain failure serious
Disastrous accident, and result in significant economic losses, therefore the research for carrying out fault diagnosis has highly important reality
Meaning.
Mechanical fault diagnosis is exactly to utilize signal processing and analyzing technology to the signal containing fault message measured,
It finds out characteristic parameter related with failure and is differentiated using real-time technique state of these characteristic parameters to equipment.Here it relates to
And to two aspect the problem of, first is that utilize signal processing technology carry out feature extraction;Second is that being carried out using mode identification technology
Fault diagnosis.In terms of signal characteristic abstraction, be broadly divided into: the mean value of the temporal signatures of signal such as signal, mean-square value, peak value,
Kurtosis and flexure etc.;Frequency domain character of signal such as energy spectrum, AR power spectrum etc.;And the time-frequency characteristics of signal such as wavelet analysis,
Hilbert transformation and Short Time Fourier Transform etc..In order to fully characterize different classes of failure and then improve discrimination, just
A variety of different characteristics are needed to be merged, this causes computation complexity to improve same but also the dimension of feature vector greatly increases
When also extend time of fault diagnosis.Therefore how to be able to achieve reasonable Data Dimensionality Reduction is just particularly important.Principal component point
The classic algorithm of (Principal Components Analysis, PCA) as Data Dimensionality Reduction is analysed, because feature can be effectively removed
Between linearly related keep the main information of primitive character simultaneously and be widely used in fault diagnosis field.Locality preserving projections
(Locality Preserving Projections, LPP) is the linear approximation of nonlinear method LaplacianEigenmap,
As a kind of new subspace analysis method, initial data non-linearity manifold office is difficult to keep because can solve principal component analytical method
The problem of portion's structure and be used widely.However, PCA and LPP belong to unsupervised dimension-reduction algorithm, in dimensionality reduction learning process
It fails to using known classification information to make the feature after dimensionality reduction be unfavorable for the differentiation between classification.Fisher discriminant analysis is made
There is supervision dimension reduction method for one, because by maximization class scatter and divergence in class can be minimized using existing classification information
Method optimizing reduced order subspace so that the feature after dimensionality reduction is conducive to the differentiation between classification and then is widely used in various classification
Field.Although the feature after FDA algorithm dimensionality reduction is conducive to improve the classification performance of algorithm, there is supervision dimensionality reduction since it belongs to
Method, therefore need a large amount of label informations that could obtain preferable Generalization Capability in advance.However in practical application, especially event
Hinder diagnostic field, being limited to obtain by various conditions largely has the sample of label very difficult, therefore usually occurs only a small amount of
The situation of a large amount of unlabeled exemplars residues with the presence of exemplar.Enough there are exemplar, FDA and its improvement due to lacking
Algorithm usually will appear over-fitting and then lead to Generalization Capability degradation.Therefore, how to utilize these largely without mark
This guidance of signed-off sample has supervision dimensionality reduction study to become the emphasis that scholars pay close attention to.In consideration of it, in order to using largely without label sample
This raising algorithm differentiates that performance, the present invention carry out clustering to sample using the clustering algorithm based on density peaks first and obtain
Then pseudo label keeps unlabeled exemplars by divergence in increase standardization item to the class of part FDA algorithm and class scatter
Cluster structural integrity, finally by with maintain exemplar class scatter maximize and class in divergence minimum part
FDA algorithm objective function solves best projection vector together.Through proposed by the present invention based on pseudo label semi-supervised kernel part
Coefficient vector after Fisher discriminant analysis method dimensionality reduction has better separating capacity and then is conducive to sentencing for subsequent classifier
Not, so that performance of fault diagnosis is greatly improved.
In terms of mode identification method, neural network and algorithm of support vector machine (SVM) are because of its good non-linear differentiation
Ability has been widely applied to fault diagnosis field.But the above method need training parameter it is more, cause the time longer and
It is easily ensnared into locally optimal solution.Extreme learning machine (extreme learning machine) ELM as it is a kind of it is easy to use,
Effective single hidden layer feedforward neural network learning algorithm, do not need the input weight for adjusting network during the execution of the algorithm and
The biasing of hidden member, and unique optimal solution can be generated, therefore have the advantages that pace of learning is fast and Generalization Capability is good, it is very suitable
Together in the very high fault diagnosis field of the classification problem under big data era, especially requirement of real-time.For this purpose, the present invention plans
It is combined based on the Fisher discriminant analysis of pseudo label semi-supervised kernel part and extreme learning machine to realize the fast of bearing apparatus failure
Speed diagnosis can realize Data Dimensionality Reduction from signal processing angle, when improving diagnosis while keeping different classes of separating capacity
Effect;Slave pattern identifies angle again can realize the quick diagnosis of fault category by extreme learning machine, reduce runing time.
Summary of the invention:
1, it is based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis, it is characterised in that: this method
The following steps are included:
(1) it is collected using the vibration acceleration sensor being mounted on the spring bearing upper end casing of induction conductivity output shaft
Vibration signal of the bearing under different work shapes, if there are four types of work shapes altogether: normal condition, inner ring malfunction, outer ring failure shape
State and rolling element malfunction, when being respectively then that 1024 segment processings obtain entire to the signal progress length under various operating conditions
Domain signal set Sm∈R1024×m, wherein m indicates the number of entire time-domain signal training sample set, Sn∈R1024×nTo there is label
Time-domain signal sample set, wherein n indicates the number n < < m of label time-domain signal training sample set;
(2) feature extraction is carried out to the training samples information that step (1) obtains, obtains to shake under the various operating conditions of effecting reaction
Dynamic signal characteristic set, if sharing d feature, then entire training sample characteristic set X* m∈Rd×m;There is exemplar feature set
Close X* n∈Rd×n;
(3) the feature training sample set obtained to step (2) is standardized, and makes the numerical value of each characteristic index
Range determines that in mean value be 0, in the standardized normal distribution section that variance is 1;Entire training sample characteristic set after then normalizing
Xm∈Rd×m;There is exemplar characteristic set X after normalizationn∈Rd×n;
(4) the entire training sample characteristic set X after the normalization obtained to step (3)mIt is clustered and is calculated using density peaks
Method acquires cluster labels setAnd whether be boundary point identification sets platform
(5) the cluster labels set obtained using step (4)With identification sets platformBetween construction Local Clustering
Divergence SulbWith divergence S in Local ClusteringulwRegularization term, and have exemplar X in the Fisher discriminant analysis of partnInstitute is right
The local class scatter S answeredlbDivergence S in drawn game categorylwIt optimizes together, determines final projection vector Tss-KLFDA∈
Rm×r, wherein dimensionality reduction dimension is r < < d;
(6) projection vector T obtained in step (5) is utilizedss-KLFDASolve XmProjection vector in r dimension reduced order subspace
Set Zm∈Rr×mAnd XnProjection vector set Zn∈Rr×n;
(7) projection vector set Z after the dimensionality reduction acquired in step (6) is utilizedn∈Rr×nTraining extreme learning machine model Melm;
(8) it is collected by the vibration acceleration sensor being mounted on the spring bearing upper end casing of induction conductivity output shaft
The vibration signal of the bearing is 1024 segment processings according to the method for step (2) calculating feature vector through length
X is obtained after method normalization by step (3)new∈Rd×1, the projection vector T that is obtained using step (5)ss-KLFDASolve Xnew?
R ties up the projection vector set Z in reduced order subspacenew∈Rr×1It is then input to trained model MelmThe middle current bearing of determination
Final working condition.
2, according to claim 1 to be examined based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing fault
It is disconnected, which is characterized in that be to extract vibration signal time-domain snapshots 12 statistics to method used by the feature extraction in step (2)
Feature, including average value, root mean square, variance, standard deviation, rectified mean value, peak-to-peak value, kurtosis value, peak factor, wave
The shape factor, the kurtosis factor, the pulse factor, the nargin factor carry out 5 layers of small wavelength-division to vibration signal time-domain snapshots using DB4 small echo
The energy spectrum and energy spectrum entropy totally 12 dimension fault signatures of 5 details coefficients and 1 approximation component are solved and extracted, and vibration is believed
Number time-domain snapshots carry out 5 floor empirical mode decomposition and extract the energy spectrum and energy spectrum entropy of 5 Intrinsic mode functions and 1 remainder
Totally 12 dimension fault signature, amounts to d=32 dimensional feature after combination.
3, according to claim 1 to be examined based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing fault
It is disconnected, which is characterized in that method used by being standardized in step (3) to each index value is z-score method, tool
Steps are as follows for body: settingTo any indexIt is standardized place
Method used by managing is as follows:
μ is recorded simultaneouslyi, σi, the standardization of i=1 ..., d in case of new samples.
4, according to claim 1 to be examined based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing fault
It is disconnected, which is characterized in that acquisition cluster labels Ji Tai in step (3)And whether be boundary point identification sets platformUsed method is utilized based on density peaks clustering algorithm to all sample set XmClustering is carried out,
Specific step is as follows: data-oriented collection Xm={ x1, x2..., xi..., xm}∈Rd×m, wherein xi∈RdRepresent i-th of sample to
Amount, for each sample point xiIts local density values ρ is quantitatively calculated firstiWith Distance Density higher sample point away from
From δi, embody are as follows:
dijFor xiAnd xjEuclidean distance, dcFor distance is truncated, it is arranged so that the average distance number of each data point is
The 2% of data point sum, further setsIt indicatesThe lower sequence of descending arrangement, i.e. satisfaction:Then
The local density values ρ of all sample points is obtainediWith the distance δ of the higher sample point of Distance DensityiAfterwards, with part
Density piFor horizontal axis, distance δiDraw X-Y scheme for the longitudinal axis, be called decision diagram, choose those with higher local density and
The point of relatively high distance is as cluster centre, it is determined that after cluster centre, remaining each point is attributed to apart from it most
Close density is higher than cluster belonging to its point, if XmInclude ncA cluster,For the corresponding data point of each cluster centre
Number, i.e. mjA sample is the cluster centre of j-th of cluster,For the cluster labels set of all data points, i.e.,
ciIndicate data set XmIn i-th of data point xiBelong to ciA cluster, its initialization definitions are as follows:
dcPoint set, then the highest point of density in its borderline region is found for each cluster, and with the close of the point
Degree is used as valve primaryChang screens the noise spot of the cluster, i.e., only retains the point that density in cluster is greater than or equal to the threshold values, enableNormal point and boundary point identification are represented, if hi=1 is expressed as boundary point, otherwise hi=0 is expressed as normal point,
4, according to claim 1 to be examined based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing fault
It is disconnected, which is characterized in that T in step (5)ss-KLFDACalculation method is as described below:
Divergence S between construction Local ClusteringulbS is expressed as with matrix formulb=XmLulbXm T, wherein Lulb=Dulb-Wulb∈
Rm×m, Dulb∈Rm×mIt is a diagonal matrix, its i-th of diagonal entry isLikewise, office
Portion clusters interior divergence SulwS can also be expressed as with matrix formulw=XmLulwXm T, wherein Lulw=Dulw-Wulw∈Rm×m, Dulw
∈Rm×mIt is a diagonal matrix, its i-th of diagonal entry is
Here Wulb, WulwIt is the matrix of m × m, and
WhereinRepresentative belongs to cluster ciSample size,σi=| | xi-xi (k)|
|, xi (k)It is xiKth=7 neighbours, | | | | indicate Euclidean distance;
Construct local class scatter matrix SlbScatter Matrix S in drawn game categorylw, SlbIt can also be expressed as with matrix form
Slb=XnLlbXn T.Wherein, Llb=Dlb-Wlb∈Rn×n, Dlb∈Rn×nIt is a diagonal matrix, its i-th of diagonal entry isLikewise, SlwS can also be expressed as with matrix formlw=XnLlwXn T.Wherein, Llw=Dlw-Wlw∈
Rn×n, Dlw∈Rn×nIt is a diagonal matrix, its i-th of diagonal entry isHere Wlb, WlwIt is n
The matrix of × n, and
It indicates in class yiThere is the quantity of exemplar in ∈ { 1,2 ..., c }, c classification number is 4 here,
By LlbAnd LlwIt is extended to m m matrix by zero-padding, is embodied as follows:
Thenβ=0.5,
Construction feature equation KLsslbKa=λ KLsslwKa, above formula can regard generalized eigenvalue λ as1≥λ2>=..., >=λm
Generalized eigenvector corresponding with itsGeneralized-grads Theory.K represents nuclear matrix, wherein Kij=κ
(xi, xj), κ (xi, xj) it is gaussian kernel function: κ (xi, xj)=exp (- | | xi-xj||2/σ2), σ represents core width, here σ=
0.5;Therefore, final projection vector Tss-KLFDA∈Rm×rIt is expressed asφ(Xm) represent XmIn nuclear space
Projection vector.
5, according to claim 1 to be examined based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing fault
It is disconnected, which is characterized in that the method for solving of projection vector of the step (6) in r dimension reduced order subspace is as follows: for new samples x,
The feature representation of reduced order subspace is as follows:
X → z=TT ss-KLFDAφ (x)=(a1, a2..., ar)Tφ(Xm)Tφ(x)
=(a1, a2..., ar)TK (:, x)
Enable Ta ss-KLFDA=(a1, a2..., ar), then above formula can be further expressed asHere K (:, x)=[κ (x1, x), κ (x2, x) ..., κ (xm, x)]T。
6, according to claim 1 to be examined based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing fault
It is disconnected, which is characterized in that M in step (7)elmModel is trained by following method: hidden layer number L is determined first, output layer
Number is classification number c, is set as 4, as 4 kinds of operating conditions, random initializtion input weight and bias matrix here, is set as defeated
Enter weight PInput∈RL×r, hidden neuron biasing Binput∈RL×1, extend BinputFor B ∈ RL×(n), calculate hidden layer output matrix H
∈RL×(n):
Construct data category matrix of consequence T ∈ R(n)×c, tij=1, work as xijWhen ∈ j class, other are -1, acquire output weight
Matrix β ∈ RL×c, β=H+T, H+For the Moore-Penrose generalized inverse matrix of H, extreme learning machine mould after training is finally obtained
Type: Melm={ Pinput, Binput, β };
7, according to claim 1 to be examined based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing fault
It is disconnected, which is characterized in that the feature vector, X of new samples in step (8)newDetermination using the feature extraction side in step (2)
Method, that standardization processing method utilizes is the μ of step (3) storagei, σi, i=1,2 ... d carry out z-score standardization, utilize step
Suddenly the T that (5) obtainss-KLFDACalculate XnewProjection vector Z in r dimension reduced order subspacenew∈Rr×1, the determination of end-state by
MelmModel is determining,
Ttest=Htest T×β
Take max (Htest) corresponding to subscript be current bearing equipment working condition output.
Beneficial effects of the present invention:
1. Method for Bearing Fault Diagnosis of the invention, feature extracting method is using can be conducive to improve classification performance
Have a supervision dimension reduction method, while making full use of unlabeled exemplars information guiding to have label dimensionality reduction using semi-supervised learning mechanism
It practises, the dimensionality reduction feature made has more distinction, and can effectively avoid FDA and its innovatory algorithm because receiving exemplar quantity
Limit the over-fitting occurred.
2. Method for Bearing Fault Diagnosis of the invention carries out cluster point to sample using the clustering algorithm based on density peaks
Analysis obtains pseudo label, and compared with K-means algorithm and spectral clustering, which does not need specified cluster number in advance and initial
Cluster centre and the cluster that can recognize that various shape and size, therefore the pseudo label being very suitable under cluster number unknown situation
Solve problems.In addition, the extreme learning machine that the present invention is also exceedingly fast using training speed is greatly improved as the method for pattern-recognition
The timeliness of fault diagnosis.Therefore the present invention combines two kinds of algorithms, can improve diagnosis efficiency from signal processing angle, from
Pattern-recognition angle can be reduced runing time again.
3. Method for Bearing Fault Diagnosis of the invention, by increasing divergence and class scatter in regularization term to class simultaneously
Mode keeps unlabeled exemplars to cluster structural integrity, only considers that the global and local space structure of holding is consistent with PCA and LPP
Property it is different, this method can greatly enhance separating capacity between the class of dimensionality reduction feature and have good robustness.
Detailed description of the invention:
Attached drawing 1 is that cluster labels determine schematic diagram in the embodiment of the present invention 2.
Attached drawing 2 is that boundary point label determines schematic diagram in the embodiment of the present invention 2.
Attached drawing 3 is rotating machinery simulation test experiment platform structure figure in the embodiment of the present invention 5.
Attached drawing 4 is the vibration signal time-domain snapshots figure in the embodiment of the present invention 5 under each operating condition.
Attached drawing 5 is the preceding bidimensional characteristic profile in the embodiment of the present invention 5 after PCA projects dimensionality reduction.
Attached drawing 6 is the preceding bidimensional characteristic profile in the embodiment of the present invention 5 after LPP projects dimensionality reduction.
Attached drawing 7 is the preceding bidimensional characteristic profile in the embodiment of the present invention 5 after FDA projects dimensionality reduction.
Attached drawing 8 be in the embodiment of the present invention 5 after the Fisher discriminant analysis dimensionality reduction of pseudo label semi-supervised kernel part before two
Dimensional feature distribution map.
Attached drawing 9 is PCA, LPP, FDA and inventive algorithm classification performance comparison diagram in the embodiment of the present invention 5.
Attached drawing 10 is temporal signatures algorithms of different classification performance comparison diagram in the embodiment of the present invention 6.
Attached drawing 11 is wavelet field feature algorithms of different classification performance comparison diagram in the embodiment of the present invention 6.
Attached drawing 12 is EMD characteristics of decomposition algorithms of different classification performance comparison diagram in the embodiment of the present invention 6.
Attached drawing 13 is temporal signatures and the combination algorithms of different classification performance comparison of small echo characteristic of field in the embodiment of the present invention 6
Figure.
Attached drawing 14 is temporal signatures and EMD characteristic of field combination algorithms of different classification performance comparison diagram in the embodiment of the present invention 6.
Attached drawing 15 is wavelet field feature and the combination algorithms of different classification performance comparison of EMD characteristic of field in the embodiment of the present invention 6
Figure.
Attached drawing 16 is the performance comparison figure of algorithms of different under different dimensionality reduction dimension variations in the embodiment of the present invention 6.
Attached drawing 17 is the nicety of grading comparison in the embodiment of the present invention 6 after different dimension reduction methods and different classifications algorithm combination
Figure.
Specific embodiment:
Embodiment 1:
Based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis, the local Fisher differentiates
Parser comprises the following specific steps that:
Enable xi∈RdI-th of sample vector is represented, corresponding class label is yi∈ { 1,2 ..., c }, c are classification number.It enables
Xn={ x1, x2..., xi..., xn}∈Rd×nRepresentative has exemplar data matrix, Xm={ x1, x2..., xi..., xm}∈Rd×m
Population sample data matrix is represented, wherein m indicates the quantity of training sample, and n is the quantity for having exemplar, m > n.Enable Xm=
{Xn, Xu, XuIt is the set of unmarked sample.Assuming that zi∈Rr(1≤r≤d) is by matrix T ∈ Rd×rConvert obtained low-dimensional
The projective representation of subspace: zi=TTxi。
Local Fisher Discrimination Analysis Algorithm (LFDA) can be stated with following optimization problem:
Here, Slb, Slw∈Rd×dRespectively indicate Scatter Matrix in local class scatter matrix drawn game category, definition difference
Are as follows:
Here Wlb, WlwIt is the matrix of n × n, and
It indicates in class yiThere are the quantity of exemplar, A in ∈ { 1,2 ..., c }ijIt is heuristic based on local scaleization
xiAnd xjBetween similarity measurement, AijIt is defined as
Parameter σiIndicate xiLocalization scale parameter, be defined as σi=| | xi-xi (k)||
Wherein, xi (k)It is xiKth neighbour, be usually arranged as 7, | | | | indicate Euclidean distance.
Above-mentioned optimization problem can be solved with following generalized eigenvalue problem:
Assuming that final generalized eigenvalue is ordered as λ by sequence of successively decreasing1≥λ2…≥λd
Wherein λi, i=1,2 ..., d are corresponding generalized eigenvectorsGeneralized eigenvalue, most
Eventually, TLFDAIt can indicate are as follows:
The matrix of LFDA indicates
In order to facilitate our algorithm of description, we furthermore present the expression matrix form of LFDA.SlbIt can use down
The pairs of form in face is expressed:
Equally, SlbS can also be expressed as with matrix formlb=XnLlbXn T
Wherein, Llb=Dlb-Wlb∈Rn×n, Dlb∈Rn×nIt is a diagonal matrix, its i-th of diagonal entry is
Likewise, SlwS can also be expressed as with matrix formlw=XnLlwXn T
Wherein, Llw=Dlw-Wlw∈Rn×n, Dlw∈Rn×nIt is a diagonal matrix, its i-th of diagonal entry is
Therefore, the objective function of LFDA can be further expressed as with matrix form
Embodiment 2:
Based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis, the pseudo label generation is made
Density peaks clustering algorithm comprises the following specific steps that:
Data-oriented collection Xm={ x1, x2 ..., xi..., xm}∈Rd×m, wherein xi∈RdI-th of sample vector is represented, for
Each sample point xiIts local density values ρ is quantitatively calculated firstiWith the distance δ of the higher sample point of Distance Densityi, it
Be defined as follows:
Here parameter dcIt need to be specified in advance for truncation distance, dijRepresent xiAnd xjEuclidean distance.
Further setIt indicatesThe lower sequence of descending arrangement, i.e. satisfaction:
Obviously, from the equations above it is not difficult to find that be locally or globally for maximum sample point for density value, they
δiIt can be than the δ of other sample pointsjIt is worth much bigger.Therefore, those δiThe very big sample point of value is probably cluster centre.
The local density values ρ of all sample points is obtainediWith the distance δ of the higher sample point of Distance DensityiAfterwards, with part
Density piFor horizontal axis, distance δiX-Y scheme is drawn for the longitudinal axis, is called decision diagram.Choose those with higher local density and
The point of relatively high distance is as cluster centre.After cluster centre has been determined, remaining each point is attributed to apart from it most
Close density is higher than cluster belonging to its point.If XmInclude ncA cluster,For the corresponding data point of each cluster centre
Number, i.e. mjData point is the cluster centre of j-th of cluster.For the cluster labels set of all data points, i.e.,
ciIndicate data set XmIn i-th of data point xiBelong to ciA cluster.Its initialization definitions are as follows:
For data set XmIn all local densities compare xthiIn the big data point of a data point with xiNearest number
Strong point number, is defined as follows:
ForSample point, cluster labels is defined as:
The determining strategy of cluster labels for ease of description, provides cluster labels schematic diagram here.(serial number as shown in Figure 1
For qi, arranged according to density size descending), it is assumed that sample point 1 and sample point 2 are determining cluster centre, respectively represent cluster 1
With cluster 2.The cluster labels of sample point 3 should be clustered according to belonging to the point for being higher than it away from nearest density and are consistent.No
Hardly possible discovery is exactly to put 1, therefore 3 cluster labels of sample point are exactly 1 apart from the point that nearest density is higher than it with sample point 3.Together
Reason, the cluster labels of sample point 4 should be consistent with the point 3 that is higher than it away from nearest density, as cluster 1.And so on, sample
The cluster labels of this point 5 are 2, and the cluster labels of sample point 6 are also 2.
After the cluster labels of all sample points determine, for erased noise point, algorithm is each cluster definition first
One borderline region is assigned to the cluster but is less than d at a distance from the point in other clusterscPoint set.It then is every
A cluster finds the highest point of density in its borderline region, and the noise of the cluster is screened using the density of the point as threshold values
Point only retains the point that density in cluster is greater than or equal to the threshold values.It enablesRepresent cluster core and cluster
Halo mark, the former corresponds to normal point, and the latter corresponds to boundary point, if hi=1 is expressed as boundary point, otherwise hi=0 indicates to be positive
Chang Dian.It enablesFor sample xiAffiliated cluster ciCorresponding density threshold, then
For ease of description, schematic diagram is equally provided here.As illustrated in fig. 2, it is assumed that belong to cluster 1 in sample point 7 with
The cluster of point 5 of other clusters 2 is less than dc, then determine that the sample point 8 of cluster 1 is noise spot as threshold value using the density for putting 7.
It should be noted that parameter d in algorithmcInfluence of the determination to cluster result it is very big, if dcIt is excessive to be easy to cause
The local density values of sample point are all approximately equal to be divided into same cluster, generate and owe cluster phenomenon.If dcIt is too small, often
It is a to cluster the sample point meeting for including seldom, it is more likely that the case where same cluster is divided into several parts occur, generated
Cluster phenomenon.Herein according to experience, d is setcTo make the average distance number of each data point be the 1%- of data point sum
2%.
Embodiment 3:
Based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis, the pseudo label semi-supervised kernel
Local Fisher discriminant analysis Method for Bearing Fault Diagnosis comprises the following specific steps that:
Using based on density peaks clustering algorithm to all sample set XmClustering is carried out, the cluster of sample point is obtained
Tally set platformAnd whether be boundary point logo collectionWhat needs to be explained here is that the number n of clusterc
Do not need identical as classification number, this can better adapt to multi-modal data distribution situation.According to above- mentioned information construction office
The poly- class scatter S in portionulbWith divergence S in Local ClusteringulwRegularization term embodies as follows:
Here Wulb, WulwIt is the matrix of m × m, and
It indicates in cluster ci∈ { 1,2 ..., ncIn sample quantity.
First by LlbAnd LlwIt is extended to m m matrix by zero-padding, is embodied as follows:
Furthermore by SulbAlso S is expressed as with matrix formulb=XmLulbXm T
Wherein, Lulb=Dulb-Wulb∈Rm×n, Dulb∈Rm×mIt is a diagonal matrix, its i-th of diagonal entry is
Likewise, SulwS can also be expressed as with matrix formulw=XmLulwXm T
Wherein, Lulw=Dulw-Wulw∈Rm×m, Dulw∈Rm×mIt is a diagonal matrix, its i-th of diagonal entry is
Then
Therefore, the corresponding generalized eigenvalue problem of semi-supervised part Fisher Discrimination Analysis Algorithm can further indicate that as
Under:Introduce the further construction feature equation K of nuclear theoryLsslbKa=λ
KLsslwKa, above formula can regard generalized eigenvalue λ as1≥λ2>=..., >=λmGeneralized eigenvector corresponding with itsGeneralized-grads Theory.K represents nuclear matrix, wherein Kij=κ (xi, xj), κ (xi, xj) it is Gaussian kernel letter
Number: κ (xi, xj)=exp (- | | xi-xj||2/σ2), σ represents core width, here σ=0.5;Therefore, final projection vector
Tss-KLFDA∈Rm×rIt is expressed asφ
(Xm) represent XmIn the projection vector of nuclear space.
For new samples x, the feature representation of reduced order subspace is as follows:
X → z=TT ss-KLFDAφ (x)=(a1, a2..., ar)Tφ(Xm)Tφ(x)
=(a1, a2..., ar)TK (:, x)
Enable Ta ss-KLFDA=(a1, a2..., ar), then above formula can be further expressed as
Here K (:, x)=[κ (x1, x), κ (x2, x) ..., κ (xm, x)]T。
Embodiment 4:
Based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis, the extreme learning machine includes
Following specific steps:
For a neural networks with single hidden layer, it is assumed that there are n to have label training sample, wherein (zj, tj), zj=[zj1,
zj2... zjr]T∈Rr, tj=[tj1, tj2... tjc]T∈Rc.According to yi∈ { 1,2 ..., c } settingOther are 0.This
Invention output layer number c=4 has the output of some output layer of the neural networks with single hidden layer of L hidden node can for one
To indicate are as follows:
Wherein, g (x) is activation primitive, PInput i=[Pi1, PI, 2..., PI, r] it is input weight, βicFor i-th of hidden layer list
The output weight of corresponding c-th of the output unit of member, biIt is the biasing of i-th of Hidden unit.PInput i·zjIndicate PInput iAnd zjIt is interior
Product.The target of neural networks with single hidden layer study is the error minimum so that output, can be expressed as
There is βi, PInput iAnd bi, so that:
H β=T can be expressed as with matrix.Wherein, H is the output of hidden node, and β is output weight, and T is desired output.
In order to training neural networks with single hidden layer, it is intended that obtainWithSo that
Wherein, i=1 ..., L, this is equivalent to minimize loss function:
Traditional algorithm based on gradient descent method can be used to solve the above problem, but the study based on gradient is calculated
Method needs adjust all parameters during iteration, and the training time is longer.And in ELM algorithm, once input weight PInput i
With the biasing b of hidden layeriIt is determined at random, the output matrix H of hidden layer is just now uniquely determined.Training neural networks with single hidden layer can turn
It turns to and solves a linear system H β=T.And exporting weight beta can be determined:Wherein, H+It is matrix H
Moore-Penrose generalized inverse.And the provable solution acquiredNorm be the smallest and unique.
Embodiment 5:
In order to verify the diagnosis performance based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis, this
Text has carried out following experiments.Experimental data derives from rotating machinery simulation test experiment platform, and structure is as shown in Figure 3.Wherein
Chief component are as follows: driving motor, transmission gear, transmission bearing, fictitious load and piezoelectric acceleration vibrating sensor and
Acquisition terminal etc..Test middle (center) bearing model N205EM (outer diameter 52mm, internal diameter 25mm, rolling element diameter 7.5mm, number 12
It is a).Bearing revolving speed is 1450r/min, sample frequency 12kHZ.Experiment simulates four kinds of operating statuses of rolling bearing: 1 is normal
State;2 inner ring failures;3 outer ring failures;4 rolling element failures.The sample point number of each vibration signal segment is L=1024, often
Vibration signal segment under a operating condition is as shown in Figure 4.Experimental situation: Windows7 operating system, at CPU:Intel i7,3.4G
Manage device, simulation software Matlab2010b.In addition, 1000 normal samples, inner ring fault sample, outer ring are respectively adopted herein
Fault sample and rolling element fault sample are for statistical analysis, and DB4 small echo is used to carry out Decomposition order to vibration signal segment as 5
Wavelet transformation and extract 5 details coefficients and 1 approximation component energy spectrum and energy spectrum entropy[33]Total 6 × 2=12 dimension event
Hinder feature and 5 layers of empirical mode decomposition[34]And extract the energy spectrum and energy spectrum entropy of 5 Intrinsic mode functions and 1 remainder
Fault signature amounts to 6 × 2=12 dimensional feature, and 32 dimension fault signatures are amounted to after Fusion Features.
In order to verify the dimensionality reduction performance of semi-supervised kernel part Fisher Discrimination Analysis Algorithm, normal sample is taken in experiment, it is interior
It encloses fault sample, outer ring fault sample and each 50 samples of rolling element fault sample and constitutes overall data set progress dimensionality reduction,
In each classification have exemplar number be 20, non-exemplar number be 30.And it is unsupervised with PCA and LPP two
Dimension-reduction algorithm and FDA dimension-reduction algorithm are compared.In view of the dimension of traditional FDA dimension reduction space is limited to classification number, experiment
Dimensionality reduction dimension is set r=3 by middle unification.Two before after the projection vector dimensionality reduction that all samples obtain after various algorithm optimizations
Dimensional feature is respectively displayed in Fig. 5,6,7 and Fig. 8.The drop that semi-supervised kernel part Fisher distinguished number proposed by the present invention obtains
Dimension data has apparent distinction, and not only same class has exemplar to flock together, but also different classes of no label sample
This also achieves and efficiently separates.
Pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing is based in order to quantitatively more proposed by the present invention
The performance of fault diagnosis, we take normal sample, and inner ring fault sample, outer ring fault sample and rolling element fault sample are each
1000 samples constitute overall data set and are tested.500 sample groups wherein chosen in each classification sample set are combined into
Training sample set, remaining 500 sample groups are combined into test sample set.By this paper algorithm with other 7 dimension-reduction algorithms into
Row comparative analysis, parameter setting method are same as above.Compare for convenience, in experiment choose nearest neighbor classifier as base classifier simultaneously
Performance Evaluating Indexes are used as using correct classification rate (Correct classification rate CCR), are in experimentation
Elimination Random Effect, we randomly choose 200 as having exemplar, remaining 300 conduct to the sample of each classification
Unlabeled exemplars, count their average correct classification rate for each algorithm independent operating 30 times, and experimental result is as shown in Figure 9.From
The experimental result of Fig. 9, which can be seen that algorithm proposed by the present invention, can fully consider that the Local Clustering structure between unlabeled exemplars is believed
Breath, and LFDA algorithm dimensionality reduction is instructed by two standardization items of divergence in poly- class scatter and cluster, so that the feature after dimensionality reduction
Differentiation performance is stronger, is more advantageous to the classifier classification in later period, therefore obtained classification performance is optimal.
Embodiment 6:
It is proposed by the present invention based on pseudo label semi-supervised kernel part Fei Sheer under different characteristic combined situation in order to compare
12 dimensions statistics temporal signatures, 12 dimension Wavelet Energy Spectrums are respectively adopted in experiment for the classification performance of discriminant analysis bearing failure diagnosis
With Energy-Entropy feature, the totally 6 groups of progress events of 12 dimension empirical mode decomposition energy spectrums and Energy-Entropy feature and their combination of two
Hinder diagnostic test.As above-mentioned experiment, normal sample, inner ring fault sample, outer ring fault sample and rolling element failure are chosen
Each 500 samples composing training data acquisition system of sample, remaining 500 samples composition test sample collection are closed.It is same to use recently
Adjacent classifier is correct classification rate (CCR) as base classifier, evaluation index, and other parameter settings are same as above.Dimensionality reduction dimension is r=
3, to eliminate Random Effect, each algorithm independent operating 30 times, experiment, which randomly selects 200 training samples and is used as, every time label
Sample, remaining to be used as unlabeled exemplars, final statistical result is as shown in fig. 10-15.From experimental result it can be seen that this hair
Bright proposition based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis under different characteristic combined situation
Classification performance is superior to other dimension-reduction algorithms, this also indicates that the dimensionality reduction feature of algorithm proposed by the present invention can not only be farthest
It maintains and distinguishes information between the class of exemplar, while it is consistent also to have taken into account the Local Clustering structure of unlabeled exemplars to each other
Property, so that the coefficient vector after projection all has good separating capacity under different characteristic combination.
Pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis is based in order to more proposed by the present invention
Nicety of grading comparative situation under different dimensionality reduction dimensions is based on difference herein by changing from dimensionality reduction dimension r comes from 2 to 11
The nicety of grading of the bearing fault recognition methods of dimension-reduction algorithm, wherein data set still chooses 1000 normal samples, inner ring failure
Sample, outer ring fault sample and rolling element fault sample, wherein 500 sample composing training data acquisition systems in each classification, remain
Remaining 500 samples composition test sample collection is closed, and feature is the feature combination in three domains.This experiment is equally classified using arest neighbors
Device is correct classification rate (CCR) as base classifier, evaluation index, and other parameter settings are same as above.Due to FDA algorithm dimensionality reduction dimension
Classification number need to be less than, therefore this experiment only has chosen PCA, LPP, SKMFA, KSFDA algorithm and this paper algorithm carries out performance pair
Than analysis.To eliminate Random Effect, each algorithm independent operating 30 times, experiment randomly selects 200 training sample conducts every time
There is exemplar, it is remaining to be used as unlabeled exemplars, take average classification accuracy rate as evaluation performance indicator, other parameter settings
Ibid, experimental result is as shown in figure 16.The experimental results showed that proposed by the present invention be based on pseudo label semi-supervised kernel part Fei Sheer
Classification performance of the discriminant analysis bearing failure diagnosis under different dimensionality reduction dimensions is all substantially better than the axis based on other dimension-reduction algorithms
Hold the classification performance of method for diagnosing faults.The experimental result again demonstrates the spy after dimension-reduction algorithm dimensionality reduction proposed by the present invention
Sign has good separating capacity.
Finally, in order to verify the bearing fault identity after combining herein based on SS-KLFDA algorithm with different classifications device
Can, we use Various Classifiers on Regional algorithm and compare experiment, including support vector machines (one-all), SVM
(one-one), RBF nerve net (RBFNN), multi-layer perception (MLP) nerve net (MLP), extreme learning machine ELM.SVM algorithm parameter is adopted
With Gaussian kernel, penalty factor and core width are through 5 cross validations using trellis search method from C={ 2-2, 2-1, 1,21, 22, 23,
26, 28, 210And σ={ 0.1,0.5,0.7,1,1.2,1.5,2,2.5,3 } determine, the RBF nuclear parameter of RBF algorithm from σ=
{ 0.1,0.5,0.7,1,1.2,1.5,2,2.5,3 } it is determined using 5 cross-validation methods, the Hidden unit of RBF, MLP and ELM
Number is 30, and dimensionality reduction dimension is r=3, and other parameter settings are same as above, each algorithm independent operating 30 times and to calculate classification accuracy rate flat
Mean value, experimental result are as shown in figure 17.It can be found that algorithm of the invention is calculated with various classifiers by the experimental result of the figure
Performance of fault diagnosis after method combination is superior to other dimension-reduction algorithms, which further demonstrates that through proposed by the present invention
SS-KFDA algorithm can effectively utilize unlabeled exemplars and keep the supervision algorithm study of Local Clustering Structural Guidelines to make to obtain
Separating capacity between dimensionality reduction feature and class with higher substantially increases the diagnostic accuracy of the classifier of same a combination thereof.
Claims (8)
1. being based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis, it is characterised in that: this method includes
Following steps:
(1) axis is collected using the vibration acceleration sensor being mounted on the spring bearing upper end casing of induction conductivity output shaft
Hold the vibration signal under different work shapes, if altogether there are four types of work shape: normal condition, inner ring malfunction, outer ring malfunction and
Rolling element malfunction, then carrying out length to the signal under various operating conditions respectively is that 1024 segment processings obtain entire time domain letter
Number set Sm∈R1024×m, wherein m indicates the number of entire time-domain signal training sample set, Sn∈R1024×nTo there is label time domain
Sample of signal set, wherein n indicates the number n < < m of label time-domain signal training sample set;
(2) feature extraction is carried out to the training samples information that step (1) obtains, obtains that letter can be vibrated under the various operating conditions of effecting reaction
Number characteristic set, if sharing d feature, then entire training sample characteristic set X* m∈Rd×m;There is exemplar characteristic set X* n
∈Rd×n;
(3) the feature training sample set obtained to step (2) is standardized, and makes the numberical range of each characteristic index
Determine that in mean value be 0, in the standardized normal distribution section that variance is 1;Entire training sample characteristic set X after then normalizingm∈Rd ×m;There is exemplar characteristic set X after normalizationn∈Rd×n;
(4) the entire training sample characteristic set X after the normalization obtained to step (3)mIt is acquired using density peaks clustering algorithm
Cluster labels setAnd whether be boundary point logo collection
(5) the cluster labels set obtained using step (4)And logo collectionDivergence between construction Local Clustering
SulbWith divergence S in Local ClusteringulwRegularization term, and have exemplar X in the Fisher discriminant analysis of partnCorresponding
Local class scatter SlbDivergence S in drawn game categorylwIt optimizes together, determines final projection vector Tss-KLFDA∈Rm×r,
Wherein dimensionality reduction dimension is r < < d;
(6) projection vector T obtained in step (5) is utilizedss-KLFDASolve XmProjection vector set in r dimension reduced order subspace
Zm∈Rr×mAnd XnProjection vector set Zn∈Rr×n;
(7) projection vector set Z after the dimensionality reduction acquired in step (6) is utilizedn∈Rr×nTraining extreme learning machine model Melm;
(8) axis is collected by the vibration acceleration sensor being mounted on the spring bearing upper end casing of induction conductivity output shaft
The vibration signal held is 1024 segment processings according to the method for step (2) calculating feature vector through length
X is obtained after method normalization by step (3)new∈Rd×1, the projection vector T that is obtained using step (5)ss-KLFDASolve Xnew?
R ties up the projection vector set Z in reduced order subspacenew∈Rr×1It is then input to trained model MelmThe middle current bearing of determination
Final working condition.
2. according to claim 1 be based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis,
It is characterized in that, is to extract 12 statistical natures of vibration signal time-domain snapshots to method used by the feature extraction in step (2),
Including average value, root mean square, variance, standard deviation, rectified mean value, peak-to-peak value, kurtosis value, peak factor, waveform because
Son, the kurtosis factor, the pulse factor, the nargin factor carry out 5 layers of wavelet decomposition simultaneously to vibration signal time-domain snapshots using DB4 small echo
The energy spectrum and energy spectrum entropy totally 12 dimension fault signature of 5 details coefficients and 1 approximation component are extracted, and when to vibration signal
Domain segment carries out 5 layers of empirical mode decomposition and extracts the energy spectrum and energy spectrum entropy totally 12 of 5 Intrinsic mode functions and 1 remainder
Fault signature is tieed up, d=32 dimensional feature is amounted to after combination.
3. according to claim 1 be based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis,
It is characterized in that, method used by being standardized in step (3) to each index value is z-score method, specific to walk
It is rapid as follows: to setTo any indexIt is standardized institute
The method of use is as follows:
μ is recorded simultaneouslyi, σi, the standardization of i=1 ..., d in case of new samples.
4. according to claim 1 be based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis,
It is characterized in that, to acquisition cluster labels set in step (3)And whether be boundary point logo collectionInstitute
The method of use is utilized based on density peaks clustering algorithm to all sample set XmClustering is carried out, specific steps are such as
Under: data-oriented collection Xm={ x1, x2..., xi..., xm}∈Rd×m, wherein xi∈RdI-th of sample vector is represented, for each
A sample point xiIts local density values ρ is quantitatively calculated firstiWith the distance δ of the higher sample point of Distance Densityi, specific table
It reaches are as follows:
dijFor xiAnd xjEuclidean distance, dcFor distance is truncated, it is arranged so that the average distance number of each data point is data
The 2% of point sum, further setsIt indicatesThe lower sequence of descending arrangement, i.e. satisfaction:Then
The local density values ρ of all sample points is obtainediWith the distance δ of the higher sample point of Distance DensityiAfterwards, with local density
ρiFor horizontal axis, distance δiX-Y scheme is drawn for the longitudinal axis, is called decision diagram, chooses those with higher local density and opposite
Compared with rise from point as cluster centre, it is determined that after cluster centre, remaining each point is attributed to away from nearest
Density is higher than cluster belonging to its point, if XmInclude ncA cluster,For the volume of the corresponding data point of each cluster centre
Number, i.e. mjA sample is the cluster centre of j-th of cluster,For the cluster labels set of all data points, i.e. ciTable
Show data set XmIn i-th of data point xiBelong to ciA cluster, its initialization definitions are as follows:It enablesFor data set XmIn all local densities compare xthiNumber
In the big data point in strong point with xiNearest data point number, is specifically defined are as follows:
ForSample point, cluster labels is defined as:When the cluster labels of all sample points determine it
Afterwards, for erased noise point, algorithm is that each cluster defines a borderline region first, that is, is assigned to the cluster but gathers with other
The distance of point in class is less than dcPoint set, then for it is each cluster find the highest point of density in its borderline region, and
Using the density of the point as threshold valuesThe noise spot of the cluster is screened, i.e., only retains density in cluster and is greater than or equal to the valve
The point of value enablesNormal point and boundary point identification are represented, if hi=1 is expressed as boundary point, otherwise hi=0 indicates to be positive
Chang Dian,
5. according to claim 1 be based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis,
It is characterized in that, T in step (5)ss-KLFDACalculation method is as described below:
Divergence S between construction Local ClusteringulbS is expressed as with matrix formulb=XmLulbXm T, wherein Lulb=Dulb-Wulb∈Rm×m,
Dulb∈Rm×mIt is a diagonal matrix, its i-th of diagonal entry isLikewise, part is poly-
Divergence S in classulwS can also be expressed as with matrix formulw=XmLulwXm T, wherein Lulw=Dulw-Wulw∈Rm×m, Dulw∈Rm ×mIt is a diagonal matrix, its i-th of diagonal entry is
Here Wulb, WulwIt is the matrix of m × m, and
WhereinRepresentative belongs to cluster ciSample size,σi=| | xi-xi (k)| |, xi (k)
It is xiKth=7 neighbours, | | | | indicate Euclidean distance;
Construct local class scatter matrix SlbScatter Matrix S in drawn game categorylw, SlbS can also be expressed as with matrix formlb=
XnLlbXn T.Wherein, Llb=Dlb-Wlb∈Rn×n, Dlb∈Rn×nIt is a diagonal matrix, its i-th of diagonal entry isLikewise, SlwS can also be expressed as with matrix formlw=XnLlwXn T.Wherein, Llw=Dlw-Wlw∈
Rn×n, Dlw∈Rn×nIt is a diagonal matrix, its i-th of diagonal entry isHere Wlb, WlwIt is
The matrix of n × n, and
nyiIt indicates in class yiThere is the quantity of exemplar in ∈ { 1,2 ..., c }, c classification number is 4 here,
By LlbAnd LlwIt is extended to m m matrix by zero-padding, is embodied as follows:
Thenβ=0.5,
Construction feature equation KLsslbKa=λ KLsslwKa, above formula can regard generalized eigenvalue λ as1≥λ2>=..., >=λmWith its correspondence
Generalized eigenvectorGeneralized-grads Theory.K represents nuclear matrix, wherein Kij=κ (xi, xj), κ (xi,
xj) it is gaussian kernel function: κ (xi, xj)=exp (- | | xi-xj||2/σ2), σ represents core width, here σ=0.5;Therefore, final throwing
Shadow vector Tss-KLFDA∈Rm×rIt is expressed as
φ(Xm) represent XmIn the projection vector of nuclear space.
6. according to claim 1 be based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis,
It is characterized in that, the method for solving of projection vector of the step (6) in r dimension reduced order subspace is as follows: for new samples x, dimensionality reduction
The feature representation of subspace is as follows:
X → z=TT ss-KLFDAφ (x)=(a1, a2..., ar)Tφ(Xm)Tφ(x)
=(a1, a2..., ar)TK (:, x)
Enable Ta ss-KLFDA=(a1, a2..., ar), then above formula can further be expressed as x → z=TaT ss-KLFDAK (:, x), here K
(:, x)=[κ (x1, x), κ (x2, x) ..., κ (xm, x)]T。
7. according to claim 1 be based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis,
It is characterized in that, M in step (7)elmModel is trained by following method: determining hidden layer number L, the number of output layer first
As classification number c is set as 4, as 4 kinds of operating conditions, random initializtion input weight and bias matrix here, is set as input power
Weight PInput∈RL×r, hidden neuron biasing Binput∈RL×1, extend BinputFor B ∈ RL×(n), calculate hidden layer output matrix H ∈ RL ×(n):
Construct data category matrix of consequence T ∈ R(n)×c, tij=1 works as xijWhen ∈ j class, other are -1, acquire output weight matrix β
∈RL×c, β=H+T, H+For the Moore-Penrose generalized inverse matrix of H, extreme learning machine model after training: M is finally obtainedelm
={ PInput, Binput, β }.
8. according to claim 1 be based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis,
It is characterized in that, the feature vector, X of new samples in step (8)newDetermination using the feature extracting method in step (2), mark
That standardization processing method utilizes is the μ of step (3) storagei, σi, i=1,2 ... d carry out z-score standardization, utilize step (5)
Obtained Tss-KLFDACalculate XnewProjection vector Z in r dimension reduced order subspacenew∈Rr×1, the determination of end-state is by MelmMould
Type is determining,
Ttest=Htest T×β
Take max (Htest) corresponding to subscript be current bearing equipment working condition output.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811462243.1A CN109582003B (en) | 2018-12-03 | 2018-12-03 | Bearing fault diagnosis method based on pseudo label semi-supervised kernel local Fisher discriminant analysis |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811462243.1A CN109582003B (en) | 2018-12-03 | 2018-12-03 | Bearing fault diagnosis method based on pseudo label semi-supervised kernel local Fisher discriminant analysis |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109582003A true CN109582003A (en) | 2019-04-05 |
CN109582003B CN109582003B (en) | 2021-04-20 |
Family
ID=65926477
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811462243.1A Expired - Fee Related CN109582003B (en) | 2018-12-03 | 2018-12-03 | Bearing fault diagnosis method based on pseudo label semi-supervised kernel local Fisher discriminant analysis |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109582003B (en) |
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110146120A (en) * | 2019-05-19 | 2019-08-20 | 浙江大学 | A kind of sensor fault diagnosis method and system |
CN110243603A (en) * | 2019-05-30 | 2019-09-17 | 沈阳化工大学 | Based on Welch conversion-radial direction base nerve net Fault Diagnosis of Roller Bearings |
CN110378917A (en) * | 2019-07-16 | 2019-10-25 | 青岛达芬奇科技有限公司 | Tooth dividing method based on peak value cluster |
CN110503025A (en) * | 2019-08-19 | 2019-11-26 | 重庆大学 | A kind of analog circuit Incipient Fault Diagnosis method based on semi-supervised coorinated training |
CN110610484A (en) * | 2019-08-21 | 2019-12-24 | 西安理工大学 | Printing dot quality detection method based on rotary projection transformation |
CN110672324A (en) * | 2019-09-02 | 2020-01-10 | 佛山科学技术学院 | Bearing fault diagnosis method and device based on supervised LLE algorithm |
CN110751121A (en) * | 2019-10-28 | 2020-02-04 | 北京理工大学 | Unsupervised radar signal sorting method based on clustering and SOFM |
CN110823574A (en) * | 2019-09-30 | 2020-02-21 | 安徽富煌科技股份有限公司 | Fault diagnosis method based on semi-supervised learning deep countermeasure network |
CN110991566A (en) * | 2019-12-26 | 2020-04-10 | 东北石油大学 | Method and device for diagnosing fault of wind driven generator in information fusion mode |
CN111061257A (en) * | 2019-12-30 | 2020-04-24 | 杭州电子科技大学 | Industrial process monitoring method based on dynamic global LPP |
CN111523376A (en) * | 2020-03-09 | 2020-08-11 | 浙江工业大学 | Self-interference micro-ring resonant cavity sensing classification identification method based on unsupervised learning |
CN111541505A (en) * | 2020-04-03 | 2020-08-14 | 武汉大学 | Time domain channel prediction method and system for OFDM wireless communication system |
CN111811818A (en) * | 2020-06-02 | 2020-10-23 | 桂林电子科技大学 | Rolling bearing fault diagnosis method based on AP clustering algorithm of specified clustering number |
CN112257862A (en) * | 2020-09-30 | 2021-01-22 | 重庆大学 | Semi-supervised identification method based on relational network marker sample expansion |
CN112459971A (en) * | 2020-11-30 | 2021-03-09 | 吉电(滁州)章广风力发电有限公司 | Abnormal vibration detection method for tower of wind generating set |
CN112541516A (en) * | 2019-12-17 | 2021-03-23 | 招商局重庆交通科研设计院有限公司 | Energy spectrum-based bridge static monitoring data principal component clustering method |
CN112925292A (en) * | 2021-01-24 | 2021-06-08 | 国网辽宁省电力有限公司电力科学研究院 | Generator set process monitoring and fault diagnosis method based on layered partitioning |
CN113657556A (en) * | 2021-09-23 | 2021-11-16 | 华北电力大学 | Gas turbine inlet guide vane system fault diagnosis method based on multivariate statistical analysis |
CN113741394A (en) * | 2021-09-06 | 2021-12-03 | 河海大学 | Industrial equipment fault diagnosis system based on semi-supervised incremental learning |
CN114648077A (en) * | 2022-05-18 | 2022-06-21 | 合肥高斯智能科技有限公司 | Method and device for multi-point industrial data defect detection |
CN114817856A (en) * | 2022-04-15 | 2022-07-29 | 重庆科技学院 | Beam-pumping unit fault diagnosis method based on structure information holding domain adaptive network |
CN115358353A (en) * | 2022-10-20 | 2022-11-18 | 众芯汉创(北京)科技有限公司 | Multi-source fusion transformer fault diagnosis method |
CN115374855A (en) * | 2022-08-23 | 2022-11-22 | 北京交通大学 | Method for grading railway emergency based on clustering |
CN116680550A (en) * | 2023-05-23 | 2023-09-01 | 南京航空航天大学 | Rolling bearing fault diagnosis method based on active learning under sample imbalance |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101667164B1 (en) * | 2015-10-02 | 2016-10-17 | 경북대학교 산학협력단 | Rotor speed-based bearing fault diagnosis method |
CN106843195A (en) * | 2017-01-25 | 2017-06-13 | 浙江大学 | Based on the Fault Classification that the integrated semi-supervised Fei Sheer of self adaptation differentiates |
CN107103125A (en) * | 2017-04-01 | 2017-08-29 | 宁波大学 | A kind of method for diagnosing faults based on two classification Fisher discriminant analyses |
CN107341504A (en) * | 2017-06-07 | 2017-11-10 | 同济大学 | A kind of Trouble Diagnostic Method of Machinery Equipment based on the popular study of time series data |
-
2018
- 2018-12-03 CN CN201811462243.1A patent/CN109582003B/en not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101667164B1 (en) * | 2015-10-02 | 2016-10-17 | 경북대학교 산학협력단 | Rotor speed-based bearing fault diagnosis method |
CN106843195A (en) * | 2017-01-25 | 2017-06-13 | 浙江大学 | Based on the Fault Classification that the integrated semi-supervised Fei Sheer of self adaptation differentiates |
CN107103125A (en) * | 2017-04-01 | 2017-08-29 | 宁波大学 | A kind of method for diagnosing faults based on two classification Fisher discriminant analyses |
CN107341504A (en) * | 2017-06-07 | 2017-11-10 | 同济大学 | A kind of Trouble Diagnostic Method of Machinery Equipment based on the popular study of time series data |
Non-Patent Citations (1)
Title |
---|
梁亮: "基于费舍尔判别分析法的故障诊断", 《计算机工程与设计》 * |
Cited By (40)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110146120B (en) * | 2019-05-19 | 2021-08-03 | 浙江大学 | Sensor fault diagnosis method and system |
CN110146120A (en) * | 2019-05-19 | 2019-08-20 | 浙江大学 | A kind of sensor fault diagnosis method and system |
CN110243603A (en) * | 2019-05-30 | 2019-09-17 | 沈阳化工大学 | Based on Welch conversion-radial direction base nerve net Fault Diagnosis of Roller Bearings |
CN110378917B (en) * | 2019-07-16 | 2023-04-07 | 青岛达芬奇科技有限公司 | Tooth segmentation method based on peak value clustering |
CN110378917A (en) * | 2019-07-16 | 2019-10-25 | 青岛达芬奇科技有限公司 | Tooth dividing method based on peak value cluster |
CN110503025A (en) * | 2019-08-19 | 2019-11-26 | 重庆大学 | A kind of analog circuit Incipient Fault Diagnosis method based on semi-supervised coorinated training |
CN110503025B (en) * | 2019-08-19 | 2023-04-18 | 重庆大学 | Analog circuit early fault diagnosis method based on semi-supervised cooperative training |
CN110610484A (en) * | 2019-08-21 | 2019-12-24 | 西安理工大学 | Printing dot quality detection method based on rotary projection transformation |
CN110672324A (en) * | 2019-09-02 | 2020-01-10 | 佛山科学技术学院 | Bearing fault diagnosis method and device based on supervised LLE algorithm |
CN110672324B (en) * | 2019-09-02 | 2021-03-26 | 佛山科学技术学院 | Bearing fault diagnosis method and device based on supervised LLE algorithm |
CN110823574A (en) * | 2019-09-30 | 2020-02-21 | 安徽富煌科技股份有限公司 | Fault diagnosis method based on semi-supervised learning deep countermeasure network |
CN110751121A (en) * | 2019-10-28 | 2020-02-04 | 北京理工大学 | Unsupervised radar signal sorting method based on clustering and SOFM |
CN112541516A (en) * | 2019-12-17 | 2021-03-23 | 招商局重庆交通科研设计院有限公司 | Energy spectrum-based bridge static monitoring data principal component clustering method |
CN112541516B (en) * | 2019-12-17 | 2022-08-16 | 重庆物康科技有限公司 | Energy spectrum-based bridge static monitoring data principal component clustering method |
CN110991566A (en) * | 2019-12-26 | 2020-04-10 | 东北石油大学 | Method and device for diagnosing fault of wind driven generator in information fusion mode |
CN110991566B (en) * | 2019-12-26 | 2022-09-27 | 东北石油大学 | Method and device for diagnosing fault of wind driven generator in information fusion mode |
CN111061257A (en) * | 2019-12-30 | 2020-04-24 | 杭州电子科技大学 | Industrial process monitoring method based on dynamic global LPP |
CN111061257B (en) * | 2019-12-30 | 2021-02-19 | 杭州电子科技大学 | Industrial process monitoring method based on dynamic global LPP |
CN111523376A (en) * | 2020-03-09 | 2020-08-11 | 浙江工业大学 | Self-interference micro-ring resonant cavity sensing classification identification method based on unsupervised learning |
CN111523376B (en) * | 2020-03-09 | 2023-07-28 | 浙江工业大学 | Self-interference micro-ring resonant cavity sensing classification identification method based on unsupervised learning |
CN111541505B (en) * | 2020-04-03 | 2021-04-27 | 武汉大学 | Time domain channel prediction method and system for OFDM wireless communication system |
CN111541505A (en) * | 2020-04-03 | 2020-08-14 | 武汉大学 | Time domain channel prediction method and system for OFDM wireless communication system |
CN111811818A (en) * | 2020-06-02 | 2020-10-23 | 桂林电子科技大学 | Rolling bearing fault diagnosis method based on AP clustering algorithm of specified clustering number |
CN111811818B (en) * | 2020-06-02 | 2022-02-01 | 桂林电子科技大学 | Rolling bearing fault diagnosis method based on AP clustering algorithm of specified clustering number |
CN112257862A (en) * | 2020-09-30 | 2021-01-22 | 重庆大学 | Semi-supervised identification method based on relational network marker sample expansion |
CN112257862B (en) * | 2020-09-30 | 2023-12-05 | 重庆大学 | Semi-supervised identification method based on relation network marking sample expansion |
CN112459971A (en) * | 2020-11-30 | 2021-03-09 | 吉电(滁州)章广风力发电有限公司 | Abnormal vibration detection method for tower of wind generating set |
CN112925292B (en) * | 2021-01-24 | 2024-05-14 | 国网辽宁省电力有限公司电力科学研究院 | Generator set process monitoring and fault diagnosis method based on layered and segmented |
CN112925292A (en) * | 2021-01-24 | 2021-06-08 | 国网辽宁省电力有限公司电力科学研究院 | Generator set process monitoring and fault diagnosis method based on layered partitioning |
CN113741394B (en) * | 2021-09-06 | 2023-08-15 | 河海大学 | Industrial equipment fault diagnosis system based on semi-supervised incremental learning |
CN113741394A (en) * | 2021-09-06 | 2021-12-03 | 河海大学 | Industrial equipment fault diagnosis system based on semi-supervised incremental learning |
CN113657556B (en) * | 2021-09-23 | 2023-12-26 | 华北电力大学 | Gas turbine inlet guide vane system fault diagnosis method based on multivariate statistical analysis |
CN113657556A (en) * | 2021-09-23 | 2021-11-16 | 华北电力大学 | Gas turbine inlet guide vane system fault diagnosis method based on multivariate statistical analysis |
CN114817856A (en) * | 2022-04-15 | 2022-07-29 | 重庆科技学院 | Beam-pumping unit fault diagnosis method based on structure information holding domain adaptive network |
CN114817856B (en) * | 2022-04-15 | 2024-05-28 | 重庆科技学院 | Beam-pumping unit fault diagnosis method based on structural information retention domain adaptation network |
CN114648077A (en) * | 2022-05-18 | 2022-06-21 | 合肥高斯智能科技有限公司 | Method and device for multi-point industrial data defect detection |
CN115374855A (en) * | 2022-08-23 | 2022-11-22 | 北京交通大学 | Method for grading railway emergency based on clustering |
CN115374855B (en) * | 2022-08-23 | 2024-05-14 | 北京交通大学 | Clustering-based method for grading railway emergency |
CN115358353A (en) * | 2022-10-20 | 2022-11-18 | 众芯汉创(北京)科技有限公司 | Multi-source fusion transformer fault diagnosis method |
CN116680550A (en) * | 2023-05-23 | 2023-09-01 | 南京航空航天大学 | Rolling bearing fault diagnosis method based on active learning under sample imbalance |
Also Published As
Publication number | Publication date |
---|---|
CN109582003B (en) | 2021-04-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109582003A (en) | Based on pseudo label semi-supervised kernel part Fei Sheer discriminant analysis bearing failure diagnosis | |
CN110132598B (en) | Fault noise diagnosis algorithm for rolling bearing of rotating equipment | |
Huang et al. | Rolling bearing fault diagnosis and performance degradation assessment under variable operation conditions based on nuisance attribute projection | |
CN106980822B (en) | A kind of rotary machinery fault diagnosis method based on selective ensemble study | |
CN112036301B (en) | Driving motor fault diagnosis model construction method based on intra-class feature transfer learning and multi-source information fusion | |
CN108709745B (en) | Rapid bearing fault identification method based on enhanced LPP algorithm and extreme learning machine | |
Zhong et al. | Unsupervised learning for expert-based software quality estimation. | |
CN108073158A (en) | Based on PCA and KNN density algorithm Wind turbines Method for Bearing Fault Diagnosis | |
Ranaee et al. | Application of the PSO–SVM model for recognition of control chart patterns | |
Wu et al. | A transformer-based approach for novel fault detection and fault classification/diagnosis in manufacturing: A rotary system application | |
Tao et al. | Bearing defect diagnosis based on semi-supervised kernel Local Fisher Discriminant Analysis using pseudo labels | |
Bai et al. | Research on feature selection for rotating machinery based on Supervision Kernel Entropy Component Analysis with Whale Optimization Algorithm | |
CN106202952A (en) | A kind of Parkinson disease diagnostic method based on machine learning | |
CN113255848A (en) | Water turbine cavitation sound signal identification method based on big data learning | |
Su et al. | A novel hybrid method based on KELM with SAPSO for fault diagnosis of rolling bearing under variable operating conditions | |
CN108647707B (en) | Probabilistic neural network creation method, failure diagnosis method and apparatus, and storage medium | |
CN110285976A (en) | Multi-dimensional time sequence information based on DBN drives Fault Diagnosis of Aeroengines method | |
Purohit et al. | Deep autoencoding GMM-based unsupervised anomaly detection in acoustic signals and its hyper-parameter optimization | |
CN110502989A (en) | A kind of small sample EO-1 hyperion face identification method and system | |
Chen et al. | Multiscale shared learning for fault diagnosis of rotating machinery in transportation infrastructures | |
CN105241665A (en) | Rolling bearing fault diagnosis method based on IRBFNN-AdaBoost classifier | |
CN102324007A (en) | Method for detecting abnormality based on data mining | |
Li et al. | Support cluster machine | |
CN111461565A (en) | Power supply side power generation performance evaluation method under power regulation | |
Ramakrishnan et al. | Epileptic eeg signal classification using multi-class convolutional neural network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20210420 Termination date: 20211203 |
|
CF01 | Termination of patent right due to non-payment of annual fee |