CN108052974B - Fault diagnosis method, system, equipment and storage medium - Google Patents

Fault diagnosis method, system, equipment and storage medium Download PDF

Info

Publication number
CN108052974B
CN108052974B CN201711320019.4A CN201711320019A CN108052974B CN 108052974 B CN108052974 B CN 108052974B CN 201711320019 A CN201711320019 A CN 201711320019A CN 108052974 B CN108052974 B CN 108052974B
Authority
CN
China
Prior art keywords
feature data
fault
distance
data
feature
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201711320019.4A
Other languages
Chinese (zh)
Other versions
CN108052974A (en
Inventor
张莉
薛杨涛
王邦军
凌兴宏
张召
李凡长
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou University
Original Assignee
Suzhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou University filed Critical Suzhou University
Priority to CN201711320019.4A priority Critical patent/CN108052974B/en
Publication of CN108052974A publication Critical patent/CN108052974A/en
Application granted granted Critical
Publication of CN108052974B publication Critical patent/CN108052974B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/211Selection of the most significant subset of features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation

Abstract

The application discloses a fault diagnosis method, a system, equipment and a storage medium, wherein the method comprises the following steps: respectively acquiring feature data sets corresponding to the initial feature set when the target equipment runs normally and faults to obtain training data comprising the normal feature data sets and the fault feature data sets; respectively calculating corresponding KL (karhunen-Loeve) distances between each feature data in the normal feature data set and corresponding feature data in the fault feature data set to obtain a KL distance set; performing cross validation on the training data by adopting a support vector machine classification; determining features related to fault operation from the initial feature set according to the verification result and the size of the KL distance in the KL distance set to obtain an optimal feature set; and when the data to be diagnosed of the target equipment is acquired, determining the characteristic data corresponding to the optimal characteristic set from the data to be diagnosed, and then carrying out corresponding fault diagnosis on the target equipment by using the characteristic data. The method and the device effectively improve the accuracy of the follow-up fault diagnosis result.

Description

Fault diagnosis method, system, equipment and storage medium
Technical Field
The present invention relates to the field of fault diagnosis technologies, and in particular, to a fault diagnosis method, system, device, and storage medium.
Background
At present, when people judge whether a piece of equipment breaks down, the mode of manual judgment is usually used for determining, so that on one hand, due to the influence of artificial subjective factors, the reliability of a fault diagnosis result is unstable, on the other hand, a large amount of labor cost needs to be consumed, and the diagnosis efficiency is low. In order to reduce labor cost and improve diagnosis efficiency, people gradually start to adopt a support vector machine to perform fault diagnosis on equipment, but the accuracy of a diagnosis result needs to be further improved.
In summary, how to improve the accuracy of the fault diagnosis result is a problem to be solved urgently at present.
Disclosure of Invention
In view of this, the present invention provides a fault diagnosis method, system, device and storage medium, which can improve the accuracy of the fault diagnosis result. The specific scheme is as follows:
in a first aspect, the present invention discloses a fault diagnosis method, including:
respectively acquiring feature data sets corresponding to the initial feature set when the target equipment runs normally and faults to obtain training data comprising the normal feature data sets and the fault feature data sets; wherein the initial set of features comprises a plurality of features;
respectively calculating corresponding KL (karhunen-Loeve) distances between each feature data in the normal feature data set and corresponding feature data in the fault feature data set to obtain a KL distance set;
performing cross validation on the training data by adopting a support vector machine to obtain a validation result;
determining features related to fault operation from the initial feature set according to the verification result and the size of the KL distance in the KL distance set to obtain an optimal feature set;
and when the data to be diagnosed of the target equipment is acquired, determining the characteristic data corresponding to the optimal characteristic set from the data to be diagnosed, and then performing corresponding fault diagnosis on the target equipment by using the characteristic data.
Optionally, the step of respectively calculating a KL distance corresponding to each feature data in the normal feature data set and a corresponding feature data in the fault feature data set includes:
respectively determining Gaussian distribution corresponding to each feature data in the normal feature data set and the fault feature data set;
and respectively calculating KL (KL) distances between the Gaussian distribution corresponding to each feature data in the normal feature data set and the Gaussian distribution corresponding to the corresponding feature data in the fault feature data set.
Optionally, the step of respectively determining a gaussian distribution corresponding to each feature data in the normal feature data set and the fault feature data set includes:
respectively carrying out standardization processing on each feature data in the normal feature data set and the fault feature data set, and then respectively determining the Gaussian distribution corresponding to each standardized feature data in the normal feature data set and the fault feature data set.
Optionally, the step of performing cross validation on the training data by using support vector machine classification includes:
and performing ten-fold cross validation on the training data by adopting support vector machine classification.
Optionally, the step of determining, according to the verification result and the size of the KL distance in the KL distance set, the feature related to the fault operation from the initial feature set to obtain an optimal feature set includes:
intensively screening KL distances meeting preset conditions from the KL distances to obtain a target KL distance;
screening the characteristics corresponding to the target KL distance from the initial characteristic set to obtain target characteristics;
and determining the characteristics related to the fault operation from the target characteristics according to the verification result to obtain an optimal characteristic set.
Optionally, the step of screening KL distances satisfying a preset condition from the KL distance set includes:
sorting the KL distance sets in a descending order to obtain sorted distance sets;
and screening out KL distances in a preset number from the sorted distance set.
Optionally, the step of screening KL distances satisfying a preset condition from the KL distance set includes:
and screening out the KL distance with the KL distance larger than a preset threshold value from the KL distance set.
In a second aspect, the present invention discloses a fault diagnosis system, comprising:
the characteristic data acquisition module is used for respectively acquiring characteristic data sets corresponding to the initial characteristic set when the target equipment runs in a normal mode and a fault mode to obtain training data comprising the normal characteristic data sets and the fault characteristic data sets; wherein the initial set of features comprises a plurality of features;
a KL distance calculating module, configured to calculate a corresponding KL distance between each feature data in the normal feature data set and a corresponding feature data in the fault feature data set, respectively, to obtain a KL distance set;
the cross validation module is used for performing cross validation on the training data by adopting a support vector machine to obtain a validation result;
the characteristic determining module is used for determining characteristics related to fault operation from the initial characteristic set according to the verification result and the size of the KL distance in the KL distance set to obtain an optimal characteristic set;
and the fault diagnosis module is used for determining the characteristic data corresponding to the optimal characteristic set from the data to be diagnosed when the data to be diagnosed of the target equipment is obtained, and then performing corresponding fault diagnosis on the target equipment by using the characteristic data.
In a third aspect, the invention discloses a fault diagnosis device, comprising a processor and a memory; wherein the processor implements the fault diagnosis method disclosed above when executing the fault diagnosis program stored in the memory.
In a fourth aspect, the present invention discloses a computer-readable storage medium for storing a fault diagnosis program; wherein the fault diagnosis program implements the fault diagnosis method disclosed above when executed by a processor.
It can be seen that, after the training data comprising a normal feature data set and a fault feature data set is obtained, the corresponding KL distance between each feature data in the normal feature data set and the corresponding feature data in the fault feature data set is calculated, cross validation is performed on the training data by using a support vector machine in a classified manner, then the features related to fault operation are determined from the initial feature set according to the validation result and the calculated KL distance, after the data to be diagnosed is obtained, the feature data corresponding to the features related to fault operation can be determined from the data to be diagnosed, and as the feature data can reflect whether the equipment is in a fault operation state, the equipment can be subsequently diagnosed by using the feature data. Therefore, before fault diagnosis is carried out on the equipment, the characteristics related to fault operation are determined based on the KL distance, and the characteristics can reflect the fault operation characteristics, so that the accuracy of subsequent fault diagnosis results is effectively improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the provided drawings without creative efforts.
FIG. 1 is a flow chart of a fault diagnosis method disclosed in the embodiments of the present invention;
FIG. 2a is a graph illustrating the results of monitoring "fault 21" using an SVM;
FIG. 2b is a diagram illustrating the monitoring result of "failure 21" when using KL-FS-SVM;
fig. 3 is a schematic structural diagram of a fault diagnosis system according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The embodiment of the invention discloses a fault diagnosis method, which is shown in figure 1 and comprises the following steps:
step S11: respectively acquiring feature data sets corresponding to the initial feature set when the target equipment runs normally and faults to obtain training data comprising the normal feature data sets and the fault feature data sets; wherein the initial set of features comprises a plurality of features.
It should be noted that, in this embodiment, when the feature data set corresponding to the initial feature set of the target device during fault operation is obtained, the feature data set corresponding to the initial feature set of the target device during operation under a type of fault may be obtained, so that a corresponding fault feature data set is finally obtained; of course, the embodiment may also acquire the feature data sets corresponding to the initial feature set when the target device operates under K types of faults, so that corresponding K types of fault feature data sets are finally obtained. Wherein the value of K is an integer greater than 1.
Step S12: and respectively calculating corresponding KL (Kullback-Leibler divergence) distances between each feature data in the normal feature data set and the corresponding feature data in the fault feature data set to obtain a KL distance set.
In this embodiment, the step of calculating the KL distance corresponding to each feature data in the normal feature data set and the corresponding feature data in the fault feature data set respectively may specifically include:
respectively determining Gaussian distribution corresponding to each feature data in the normal feature data set and the fault feature data set; and respectively calculating KL (KL) distances between the Gaussian distribution corresponding to each feature data in the normal feature data set and the Gaussian distribution corresponding to the corresponding feature data in the fault feature data set.
Considering that different dimensions and orders of magnitude may exist between different initially acquired feature data, for this reason, in this embodiment, when determining the gaussian distribution corresponding to the feature data, normalization processing needs to be performed on the feature data first. That is, the step of determining the gaussian distribution corresponding to each feature data in the normal feature data set and the fault feature data set respectively may specifically include:
respectively carrying out standardization processing on each feature data in the normal feature data set and the fault feature data set, and then respectively determining the Gaussian distribution corresponding to each standardized feature data in the normal feature data set and the fault feature data set.
It is understood that, for each type of fault, the present embodiment will perform the processing procedures disclosed in the above step S12 and the following steps S13, S14 and S15 for the corresponding fault feature data set.
Step S13: and performing cross validation on the training data by adopting a support vector machine to obtain a validation result.
The step of performing cross validation on the training data by using support vector machine classification may specifically include: and performing ten-fold cross validation on the training data by adopting support vector machine classification.
Step S14: and determining the characteristics related to the fault operation from the initial characteristic set according to the verification result and the size of the KL distance in the KL distance set to obtain an optimal characteristic set.
In this embodiment, the step of determining, according to the verification result and the size of the KL distance in the KL distance set, the feature related to the faulty operation from the initial feature set to obtain the optimal feature set may specifically include:
intensively screening KL distances meeting preset conditions from the KL distances to obtain a target KL distance; screening the characteristics corresponding to the target KL distance from the initial characteristic set to obtain target characteristics; and determining the characteristics related to the fault operation from the target characteristics according to the verification result to obtain an optimal characteristic set.
In a specific embodiment, the step of screening KL distances satisfying a preset condition from the KL distance set may specifically include:
sorting the KL distance sets in a descending order to obtain sorted distance sets; and screening out KL distances in a preset number from the sorted distance set.
It is understood that the preset number can be specifically set according to actual needs, and is not limited herein.
In another specific embodiment, the step of screening KL distances satisfying the preset condition from the KL distance set may specifically include:
and screening out the KL distance with the KL distance larger than a preset threshold value from the KL distance set.
It is understood that the preset threshold may be specifically set according to actual needs, and is not limited herein.
Step S15: and when the data to be diagnosed of the target equipment is acquired, determining the characteristic data corresponding to the optimal characteristic set from the data to be diagnosed, and then performing corresponding fault diagnosis on the target equipment by using the characteristic data.
In this embodiment, after the feature data corresponding to the optimal feature set is determined from the data to be diagnosed, corresponding fault diagnosis may be performed on the target device by using the feature data and combining a Support Vector Machine (SVM).
It can be seen that, in the embodiments of the present invention, after the training data including the normal feature data set and the fault feature data set is acquired, the KL distance corresponding to each feature data in the normal feature data set and the corresponding feature data in the fault feature data set is calculated, and cross validation is performed on the training data by using a support vector machine, and then the features related to fault operation are determined from the initial feature set according to the validation result and the calculated KL distance. Therefore, before fault diagnosis is carried out on the equipment, the method and the device for fault diagnosis determine the characteristics related to fault operation based on the KL distance, and the characteristics can reflect the characteristics of fault operation, so that the accuracy of subsequent fault diagnosis results is effectively improved.
Based on the foregoing embodiments, the embodiments of the present invention perform corresponding tests on a tennessman Process (TEP) data set. The TEP data set comprises a feature data set corresponding to normal operation and a feature data set corresponding to 21 different faults. For each fault, the training set has 480 fault feature data, the test set comprises 960 observation data, each observation data comprises 52 variables, the data of the test set starts with normal data, the 161 th sampling fails, all data are sampled every 3 minutes, and all data are generated by the TEP simulation software. In this embodiment, 500 training data in the normal feature data set and 480 training data of one fault are taken as input of the training set, and fault detection is performed on the test set of each fault. The specific implementation steps are as follows:
1) for collected normal characteristic data set in industrial process
Figure BDA0001504576560000071
And fault signature data set
Figure BDA0001504576560000072
N0And NkRespectively, the number of samples in the normal feature data set and the kth fault feature data set, m is the number of features, N is the number of fault categories, where N is1=500,NkWhen the value is 480, m is 52, n is 21, and then the characteristic data are respectively processed by standardization preprocessing. The normalized formula is:
Figure BDA0001504576560000073
wherein the content of the first and second substances,
Figure BDA0001504576560000074
is the mean value of the jth characteristic data of the normal characteristic data set,
Figure BDA0001504576560000075
is the standard deviation of the jth characteristic data of the normal characteristic data set.
2) Assuming that each feature data obeys different Gaussian distribution, respectively corresponding feature data of different classes of data
Figure BDA0001504576560000076
And estimating the generalized Gaussian distribution, wherein the distribution of each characteristic data can be approximated by a Gaussian density function:
Figure BDA0001504576560000077
wherein the content of the first and second substances,
Figure BDA0001504576560000078
different parameters
Figure BDA0001504576560000079
And
Figure BDA00015045765600000710
representing different gaussian density functions, i.e. the gaussian distributions are also different. Therefore, the invention provides that the Gaussian distribution of the jth characteristic data of the kth class is composed of parameters
Figure BDA00015045765600000711
And
Figure BDA00015045765600000712
it is decided that,
Figure BDA00015045765600000713
and
Figure BDA00015045765600000714
it is obtained by maximum likelihood estimation of the gaussian density function. The maximum likelihood estimation method calculates as follows:
Figure BDA00015045765600000715
Figure BDA00015045765600000716
Figure BDA00015045765600000717
where Ψ (z) ═ Φ' (z)/Φ (z), is first determined by the above formula
Figure BDA0001504576560000081
And
Figure BDA0001504576560000082
first, estimate initialization by using a moment method
Figure BDA0001504576560000083
Iteration is carried out by using Newton method
Figure BDA0001504576560000084
To obtain
Figure BDA0001504576560000085
f (β) is calculated as follows:
Figure BDA0001504576560000086
to obtain
Figure BDA0001504576560000087
After that, the air conditioner is started to work,
Figure BDA0001504576560000088
is calculated as follows:
Figure BDA0001504576560000089
to obtain
Figure BDA00015045765600000810
And
Figure BDA00015045765600000811
the gaussian distribution of the jth characteristic data of the kth class can be obtained.
3) The KL divergence measures the distance of two distributions, P and Q, and the formula is calculated:
Figure BDA00015045765600000812
therefore, substituting the Gaussian density function of the characteristic j into the KL distance to obtain the KL distance between the corresponding characteristic data of the characteristic j under the normal and kth fault conditions respectively:
Figure BDA00015045765600000813
4) for the same fault type k, KL distances corresponding to different characteristics j are expressed as
Figure BDA00015045765600000814
To pair
Figure BDA00015045765600000815
And sorting from large to small, and marking the feature set obtained by corresponding sorting as R.
5) And performing ten-fold cross validation on the training data by adopting the support vector machine classification, and taking the feature subset F with the best classification effect from the R, namely the optimal feature set.
After the optimal feature set is obtained, the present embodiment can be examined based on the test set. Specifically, the method comprises the following steps:
1) collecting test data of an industrial process in real time
Figure BDA00015045765600000816
(m is the number of features), here, 960 test samples, 52 feature m, and the test data is normalized, the corresponding normalization formula is:
Figure BDA00015045765600000817
2) and selecting the characteristics of the test data according to the obtained optimal characteristic set F to form input data, classifying by using a support vector machine, outputting a result, and judging whether the test sample has a fault or not and whether the test sample belongs to the fault.
According to the feature selection method based on the KL distance, provided by the embodiment of the invention, the normal feature data set and the fault feature data set of the TEP are used as training data, and the fault test data set of the TEP is tested. Experiments show that the combination of the feature selection method based on the KL distance and the support vector machine (namely KL-FS-SVM) provided by the embodiment of the invention can improve the fault diagnosis result of the traditional Support Vector Machine (SVM), and meanwhile, for complex process data, the feature selection method based on the KL distance is far superior to the traditional feature selection methods (such as Fscore and Relief). As shown in table one, the fault diagnosis results of different faults are found in this embodiment that the detection rates of the Fscore-SVM and the Relief-SVM in the "fault 20" are not as high as that of the SVM, which indicates that the features causing the fault are more and less different, and the detection rate obtained by selecting the first 47 features by the proposed feature selection method based on the KL distance is slightly higher than that of the SVM without feature selection, which indicates the effectiveness of the feature selection method proposed in the embodiment of the present invention. And in the KL-FS-SVM fault diagnosis model, the detection rates of the fault 11 and the fault 21 are greatly improved.
Watch 1
Type of failure SVM Fscore‐SVM Relief‐SVM KL‐FS‐SVM
Fault
11 83.50% 76.88% 84.75% 87.88%
Fault 20 80.63% 79.00% 78.88% 80.75%
Fault 21 12.88% 13.38% 42.13% 100%
Watch two
Type of failure Fscore Relief KL‐FS
Fault
11 43 15 2
Fault 20 27 29 47
Fault 21 33 35 1
In addition to the fault detection rate, referring to table two, comparing the feature numbers of the optimal feature set obtained by the Fscore, Relief and KL-FS feature selection methods, the number of the selected features of KL-FS is the smallest, and the diagnostic result of the SVM is improved to a great extent, especially the diagnostic performance of the "fault 21" is the most obvious, as shown in fig. 2a and fig. 2 b. Fig. 2a is a schematic diagram of a monitoring result of "fault 21" when the SVM is used, and fig. 2b is a schematic diagram of a monitoring result of "fault 21" when the KL-FS-SVM is used.
Correspondingly, the embodiment of the present invention further discloses a fault diagnosis system, as shown in fig. 3, the system includes:
the characteristic data acquisition module 11 is configured to acquire characteristic data sets corresponding to the initial characteristic set of the target device during normal operation and fault operation, respectively, to obtain training data including a normal characteristic data set and a fault characteristic data set; wherein the initial set of features comprises a plurality of features;
a KL distance calculating module 12, configured to calculate a corresponding KL distance between each feature data in the normal feature data set and a corresponding feature data in the fault feature data set, respectively, to obtain a KL distance set;
a cross validation module 13, configured to perform cross validation on the training data by using a support vector machine to obtain a validation result;
the characteristic determining module 14 is configured to determine characteristics related to fault operation from the initial characteristic set according to the verification result and the size of the KL distance in the KL distance set, so as to obtain an optimal characteristic set;
and the fault diagnosis module 15 is configured to determine, when data to be diagnosed of the target device is obtained, feature data corresponding to the optimal feature set from the data to be diagnosed, and then perform corresponding fault diagnosis on the target device by using the feature data.
For more specific working processes of the modules, reference may be made to corresponding contents disclosed in the foregoing embodiments, and details are not repeated here.
Furthermore, the invention also discloses a fault diagnosis device, which comprises a processor and a memory; wherein the processor implements the fault diagnosis method disclosed above when executing the fault diagnosis program stored in the memory. For more specific steps of the method, reference may be made to corresponding contents disclosed in the foregoing embodiments, and details are not repeated here.
Further, the present invention also discloses a computer readable storage medium for storing a fault diagnosis program; wherein the fault diagnosis program implements the fault diagnosis method disclosed above when executed by a processor. For more specific steps of the method, reference may be made to corresponding contents disclosed in the foregoing embodiments, and details are not repeated here.
The embodiments are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same or similar parts among the embodiments are referred to each other. The device disclosed by the embodiment corresponds to the method disclosed by the embodiment, so that the description is simple, and the relevant points can be referred to the method part for description.
Those of skill would further appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both, and that the various illustrative components and steps have been described above generally in terms of their functionality in order to clearly illustrate this interchangeability of hardware and software. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
The steps of a method or algorithm described in connection with the embodiments disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may reside in Random Access Memory (RAM), memory, Read Only Memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art.
Finally, it should also be noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
The fault diagnosis method, system, device and storage medium provided by the present invention are described in detail above, and the principle and the implementation of the present invention are explained in this document by applying specific examples, and the description of the above examples is only used to help understanding the method and the core idea of the present invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims (8)

1. A fault diagnosis method, comprising:
respectively acquiring feature data sets corresponding to the initial feature set when the target equipment runs normally and faults to obtain training data comprising the normal feature data sets and the fault feature data sets; wherein the initial set of features comprises a plurality of features;
respectively calculating corresponding KL (karhunen-Loeve) distances between each feature data in the normal feature data set and corresponding feature data in the fault feature data set to obtain a KL distance set;
performing cross validation on the training data by adopting a support vector machine to obtain a validation result;
determining features related to fault operation from the initial feature set according to the verification result and the size of the KL distance in the KL distance set to obtain an optimal feature set;
when the data to be diagnosed of the target equipment is obtained, determining feature data corresponding to the optimal feature set from the data to be diagnosed, and then performing corresponding fault diagnosis on the target equipment by using the feature data;
wherein the step of calculating the KL distance between each feature data in the normal feature data set and the corresponding feature data in the fault feature data set respectively includes:
respectively determining Gaussian distribution corresponding to each feature data in the normal feature data set and the fault feature data set;
respectively calculating KL (KL) distances between Gaussian distributions corresponding to each feature data in the normal feature data set and Gaussian distributions corresponding to corresponding feature data in the fault feature data set;
the step of determining the features related to the fault operation from the initial feature set according to the verification result and the size of the KL distance in the KL distance set to obtain an optimal feature set includes:
intensively screening KL distances meeting preset conditions from the KL distances to obtain a target KL distance;
screening the characteristics corresponding to the target KL distance from the initial characteristic set to obtain target characteristics;
and determining the characteristics related to the fault operation from the target characteristics according to the verification result to obtain an optimal characteristic set.
2. The method according to claim 1, wherein the step of determining the gaussian distribution corresponding to each of the normal characteristic data set and the fault characteristic data set respectively comprises:
respectively carrying out standardization processing on each feature data in the normal feature data set and the fault feature data set, and then respectively determining Gaussian distribution corresponding to each standardized feature data in the normal feature data set and the fault feature data set.
3. The method of fault diagnosis according to claim 1, wherein said step of cross-validating on said training data using support vector machine classification comprises:
and performing ten-fold cross validation on the training data by adopting support vector machine classification.
4. The fault diagnosis method according to claim 1, wherein the step of screening KL distances satisfying a preset condition from the KL distance set includes:
sorting the KL distance sets in a descending order to obtain sorted distance sets;
and screening out KL distances in a preset number from the sorted distance set.
5. The fault diagnosis method according to claim 1, wherein the step of screening KL distances satisfying a preset condition from the KL distance set includes:
and screening out the KL distance with the KL distance larger than a preset threshold value from the KL distance set.
6. A fault diagnosis system, comprising:
the characteristic data acquisition module is used for respectively acquiring characteristic data sets corresponding to the initial characteristic set when the target equipment runs in a normal mode and a fault mode to obtain training data comprising the normal characteristic data sets and the fault characteristic data sets; wherein the initial set of features comprises a plurality of features;
a KL distance calculating module, configured to calculate a corresponding KL distance between each feature data in the normal feature data set and a corresponding feature data in the fault feature data set, respectively, to obtain a KL distance set;
the cross validation module is used for performing cross validation on the training data by adopting a support vector machine to obtain a validation result;
the characteristic determining module is used for determining characteristics related to fault operation from the initial characteristic set according to the verification result and the size of the KL distance in the KL distance set to obtain an optimal characteristic set;
the fault diagnosis module is used for determining feature data corresponding to the optimal feature set from the data to be diagnosed when the data to be diagnosed of the target equipment is obtained, and then performing corresponding fault diagnosis on the target equipment by using the feature data;
the KL distance calculation module is further configured to determine a Gaussian distribution corresponding to each feature data in the normal feature data set and the fault feature data set respectively; respectively calculating KL (KL) distances between Gaussian distributions corresponding to each feature data in the normal feature data set and Gaussian distributions corresponding to corresponding feature data in the fault feature data set;
the characteristic determination module is further used for screening KL distances meeting preset conditions from the KL distance set to obtain a target KL distance; screening the characteristics corresponding to the target KL distance from the initial characteristic set to obtain target characteristics; and determining the characteristics related to the fault operation from the target characteristics according to the verification result to obtain an optimal characteristic set.
7. A fault diagnosis apparatus comprising a processor and a memory; wherein the processor implements the failure diagnosis method according to any one of claims 1 to 5 when executing the failure diagnosis program stored in the memory.
8. A computer-readable storage medium for storing a fault diagnosis program; wherein the fault diagnosing program implements the fault diagnosing method according to any one of claims 1 to 5 when executed by a processor.
CN201711320019.4A 2017-12-12 2017-12-12 Fault diagnosis method, system, equipment and storage medium Active CN108052974B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711320019.4A CN108052974B (en) 2017-12-12 2017-12-12 Fault diagnosis method, system, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711320019.4A CN108052974B (en) 2017-12-12 2017-12-12 Fault diagnosis method, system, equipment and storage medium

Publications (2)

Publication Number Publication Date
CN108052974A CN108052974A (en) 2018-05-18
CN108052974B true CN108052974B (en) 2022-05-17

Family

ID=62124331

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711320019.4A Active CN108052974B (en) 2017-12-12 2017-12-12 Fault diagnosis method, system, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN108052974B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109902739A (en) * 2019-02-27 2019-06-18 厦门理工学院 A kind of mechanical equipment fault degree recognition methods, terminal device and storage medium
CN109976316B (en) * 2019-04-25 2021-06-18 山东科技大学 Fault-related variable selection method
CN110212519B (en) * 2019-05-23 2023-04-07 山西金晖隆开关股份有限公司 Data processing method for medium and low voltage distribution network
CN111078456B (en) * 2019-12-26 2023-05-16 新奥数能科技有限公司 Device fault diagnosis method, device, computer readable storage medium and electronic device
CN111275135B (en) * 2020-02-27 2023-09-12 苏州大学 Fault diagnosis method, device, equipment and medium
CN111707355A (en) * 2020-06-19 2020-09-25 浙江讯飞智能科技有限公司 Equipment operation state detection method, device, equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080203601A1 (en) * 2007-02-28 2008-08-28 Airbus Espana, S.L. Jig and method of manufacturing aircraft frames in a composite material
CN105389326A (en) * 2015-09-16 2016-03-09 中国科学院计算技术研究所 Image annotation method based on weak matching probability canonical correlation model
CN107067023A (en) * 2017-01-10 2017-08-18 苏州大学 A kind of method for diagnosing faults and device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080203601A1 (en) * 2007-02-28 2008-08-28 Airbus Espana, S.L. Jig and method of manufacturing aircraft frames in a composite material
CN105389326A (en) * 2015-09-16 2016-03-09 中国科学院计算技术研究所 Image annotation method based on weak matching probability canonical correlation model
CN107067023A (en) * 2017-01-10 2017-08-18 苏州大学 A kind of method for diagnosing faults and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于概率分布测度的高炉故障检测研究;蔡晋辉等;《仪器仪表学报》;20140630;第1363-1369页 *

Also Published As

Publication number Publication date
CN108052974A (en) 2018-05-18

Similar Documents

Publication Publication Date Title
CN108052974B (en) Fault diagnosis method, system, equipment and storage medium
CN105512799B (en) Power system transient stability evaluation method based on mass online historical data
EP3105644B1 (en) Method of identifying anomalies
EP2342603B1 (en) Method and apparatus for creating state estimation models in machine condition monitoring
CN111177655B (en) Data processing method and device and electronic equipment
CN115688018B (en) Method for monitoring state and diagnosing faults of bearing under multiple working conditions
CN104216350A (en) System and method for analyzing sensed data
CN108921424B (en) Power data anomaly detection method, device, equipment and readable storage medium
EP2431877A2 (en) System and method for modeling conditional dependence for anomaly detection in machine condition monitoring
CN110717472B (en) Fault diagnosis method and system based on improved wavelet threshold denoising
CN107220500A (en) Performance degradation experiment Bayesian reliability appraisal procedure based on inverse Gaussian process
CN110221590B (en) Industrial process multi-fault diagnosis method based on discriminant analysis
CN116627116B (en) Process industry fault positioning method and system and electronic equipment
CN112034139A (en) Method and device for judging rock burst tendency grade and electronic equipment
CN111880957A (en) Program error positioning method based on random forest model
CN106682516A (en) Detection method, detection device and server of application programs
CN113361729B (en) Risk assessment method, device, terminal and storage medium based on maintenance plan
CN107798149B (en) Aircraft maintainability assessment method
US11132790B2 (en) Wafer map identification method and computer-readable recording medium
US20210089886A1 (en) Method for processing data based on neural networks trained by different methods and device applying method
CN113239075A (en) Construction data self-checking method and system
CN111881502A (en) Bridge state discrimination method based on fuzzy clustering analysis
CN112541017A (en) Industrial production process state monitoring method, device, equipment and storage medium
CN112860870A (en) Noise data identification method and equipment
CN111985826A (en) Visual quality grading method and system for multi-index industrial product

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant