CN105631596B - Equipment fault diagnosis method based on multi-dimensional piecewise fitting - Google Patents

Equipment fault diagnosis method based on multi-dimensional piecewise fitting Download PDF

Info

Publication number
CN105631596B
CN105631596B CN201511004841.0A CN201511004841A CN105631596B CN 105631596 B CN105631596 B CN 105631596B CN 201511004841 A CN201511004841 A CN 201511004841A CN 105631596 B CN105631596 B CN 105631596B
Authority
CN
China
Prior art keywords
fault
sample
data
time
abnormal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201511004841.0A
Other languages
Chinese (zh)
Other versions
CN105631596A (en
Inventor
丁书耕
张建辉
张华伟
安佰京
赵俊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Luruan Digital Technology Co Ltd
Original Assignee
Shandong Luneng Software Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Luneng Software Technology Co Ltd filed Critical Shandong Luneng Software Technology Co Ltd
Priority to CN201511004841.0A priority Critical patent/CN105631596B/en
Publication of CN105631596A publication Critical patent/CN105631596A/en
Application granted granted Critical
Publication of CN105631596B publication Critical patent/CN105631596B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0631Resource planning, allocation, distributing or scheduling for enterprises or organisations
    • G06Q10/06313Resource planning in a project environment
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/80Management or planning
    • Y02P90/82Energy audits or management systems therefor

Landscapes

  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Engineering & Computer Science (AREA)
  • Economics (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Development Economics (AREA)
  • Quality & Reliability (AREA)
  • Biodiversity & Conservation Biology (AREA)
  • Game Theory and Decision Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Educational Administration (AREA)
  • Tourism & Hospitality (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Testing And Monitoring For Control Systems (AREA)
  • Testing Of Devices, Machine Parts, Or Other Structures Thereof (AREA)

Abstract

A device fault diagnosis method based on multi-dimensional segmentation fitting sequentially comprises a fault diagnosis training step and a fault diagnosis operation step, the essence of data morphological characteristics is set, a multi-dimensional segmentation fitting algorithm and an optimized dynamic time warping algorithm are combined to realize the functions of mode expression and distance threshold extraction of modeling based on device fault data, and the problem that the similarity degree between fault data is difficult to efficiently and accurately depict in the current fault diagnosis technology is solved by the functions of device data fault type identification and reason diagnosis through the mode matching of the extracted morphological characteristics of found device abnormal data.

Description

Equipment fault diagnosis method based on multi-dimensional piecewise fitting
Technical Field
The invention relates to the technical field of fault diagnosis, in particular to an equipment fault diagnosis method based on multi-dimensional piecewise fitting.
Background
In recent years, with the continuous innovation and development of scientific technology and the rapid development of social industrialization, more and more industrial industrialization systems are loaded in large-scale industrial production occasions, and huge irreplaceable productivity is created in the application environment. Maintenance work for these devices is therefore particularly important and laborious. However, many implicit influencing factors may cause the system equipment to malfunction or even fail, so that countries and enterprises pay more and more attention to the status detection and fault diagnosis of the important system equipment.
Generally, these devices are increasingly sophisticated and coupled by interaction between different parts. The method of disassembling and disassembling the equipment is time-consuming and labor-consuming, the diagnosis effect is often not ideal, and the equipment performance is easy to be unstable. The method for reading the state of the important part of the equipment in real time by installing the monitoring sensor device to observe the running trend of the relevant parameters of the equipment and contrast the index threshold value is simple, intuitive and strong in reliability, and is widely popularized and used in various production occasions such as power plants, automobiles, satellites and the like at present. The efficiency of this approach is still limited by the ability and effort of plant maintenance personnel to work with complex unknown conditions that are prominent in the plant and are less experienced.
The above method is difficult to form a standard fault diagnosis system due to the dilemma of empirical knowledge expression, so a fault diagnosis method based on mathematical mining technology is gradually introduced into the fault diagnosis work of these large and complex devices. The mathematics mining technology integrates subject knowledge of modern cybernetics, computer science, artificial intelligence, signal processing, mode recognition, statistical mathematics and the like, and excavates information which is hidden in data and is beneficial to fault condition analysis by researching historical data and current real-time data of analysis equipment. Expert systems, association rules, neuron networks, Bayesian networks and the like are mathematical mining methods which are widely applied, and a set of research systems which are divided into fault analysis, fault modeling, fault detection, fault inference, fault decision and the like are gradually formed by the methods.
In a paper "vehicle fault diagnosis expert system with fusion example and rule inference" (journal of mechanical engineering, 2002, 7 th stage: 91-95), a brand-new mixed inference method with fusion examples and rules is provided in the paper, and a whole set of expert system for solving the problem that different diagnosis units are difficult to communicate information is established. It can be seen that the expert system can solve the fault diagnosis problem in the related field with the capability similar to the human expert level in the specific field, and the main characteristic is to form the own specific rule to analyze and solve the problem by relying on the existing human expert experience knowledge. The method is a characteristic of an expert system, but is limited by the lack of experience of human professional knowledge in unknown fields, the expert system often has a series of problems of low self-learning ability, low diagnosis success rate, difficulty in obtaining system knowledge and the like, and the problems still need to be further deeply researched and optimized by the method of the expert system.
In a paper "grid fault diagnosis based on association rule data mining technology" (protection and control of power system, 2009, 9 th period: 8-14), an article applies an association rule algorithm to grid fault diagnosis. And corresponding decision attributes and condition attributes are designated according to the fault characteristics, the establishment of an original decision table is completed, and meanwhile, an association rule algorithm is applied to carry out frequent item set mining and strong rule screening on decision table data, so that reasoning and diagnosis on fault information of various complex conditions are finally realized. However, the technology has some problems, for example, the common problems of the potential implicit failure mode such as poor discrimination, poor large-scale data processing capability, low efficiency of storing and updating the association rule, and the like, are still solved by the following related work.
In a paper "fault diagnosis technology research based on a BP neural network" (computer and modernization, 2009, volume 7), the article selects a general BP neural network to be applied to a fault diagnosis scene, and successfully applies the neural network to fault diagnosis according to the device level distribution characteristics under the condition of reasonably constructing a BP neural network model, so that the diagnosis efficiency and reliability are improved. The neural network has the greatest characteristic that the neural network can infinitely and nonlinearly approximate to an original data model under the condition of large sample quantity, but the inherent characteristics of overfitting, numerical randomness, unstable training and the like limit the application range of the neural network in the field of fault diagnosis, and other optimization algorithms need to be matched for diagnosis application.
In a thesis of Bayesian network fault diagnosis method for multi-feature information fusion (China mechanical engineering, 2010, 8 th: 940-945), an article takes a pump vibration signal as a research object, information fusion is carried out on multiple fault features such as a frequency domain and a time domain extracted by the signal by using a Bayesian parameter estimation method, then a complete fault classifier is established by constructing a Bayesian network, and fault types are identified by calculating a maximum posterior probability estimation value. The Bayesian network fault diagnosis method needs a priori statistical knowledge of a large amount of sample data, and meanwhile, the Bayesian network directed acyclic expression mode has a risk of error accumulation, and the method needs to be paid attention to the Bayesian network fault diagnosis method.
In view of the problems and risks of the common fault diagnosis technologies, the invention aims at fully mining the potential visual expression form of the data sample and measuring the difference effectiveness of the fault mode, and tries to better solve the problems of low efficiency and low recognition accuracy of fault diagnosis by establishing a new mode matching technology, thereby being beneficial to the continuous maintenance of the good state of the equipment.
Disclosure of Invention
The invention aims to overcome the defects of the prior art, provides an equipment fault diagnosis method based on multi-dimensional piecewise fitting, and solves the problem that the similarity degree between fault data is difficult to describe efficiently and accurately in the current fault diagnosis technology.
The invention provides an equipment fault diagnosis method based on multi-dimensional piecewise fitting, which sequentially comprises the following steps:
step 1: the fault diagnosis training step specifically comprises the following steps:
(1.1) acquiring fault sample information from a database;
(1.2) sequentially carrying out piecewise linear fitting on each fault sample;
(1.3) extracting the characteristics of each section of data of the fault sample to obtain a characteristic matrix of the fault sample;
(1.4) carrying out fault characteristic conversion, eliminating characteristic dimension and obtaining a characteristic matrix of a fault sample with the dimension eliminated;
(1.5) calculating a mode distance threshold;
(1.6) associating the fault feature matrix and the pattern distance threshold value with each other, and storing to generate a fault pattern knowledge base;
step 2: the fault diagnosis operation steps specifically comprise:
(2.1) acquiring abnormal sample information from a real-time database;
(2.2) performing piecewise linear fitting on the current abnormal sample;
(2.3) carrying out abnormal feature extraction and transformation on the data subjected to segmentation fitting;
(2.4) sequentially calculating the pattern distance of the sample features in the fault pattern knowledge base by using the abnormal feature information;
(2.5) converting the mode distance into a mode similarity using a falling ridge pacing method;
and (2.6) outputting a final fault diagnosis result.
Further, the step (1.1) comprises the following specific steps: selecting a piece of research equipment with the number P of fault types being more than or equal to 2 and the occurrence frequency T of each fault being more than or equal to 2 to meet the requirements, selecting a sufficient number of observation points N, wherein N is more than or equal to 10, carrying out fault record searching on historical running state data of the equipment for a long enough time, picking out fault related measuring point information, starting and stopping time of a fault process and useful information of fault maintenance measure records from fault records by utilizing a set screening rule, and reading fault sample data from a power plant real-time database PI according to the useful information, wherein:
the fault sample data with n measuring points and m time points, all the measuring point data at the moment j are regarded as a n-dimensional column vector and are expressed as follows:
u(tj)=[uj1,uj2,uj3,...,ujn]
the sample data file is stored in a matrix form of m × n, and the specific form is as follows:
Figure BDA0000893564990000031
the row represents m fault time, the column represents n equipment observation points, the m and n values of the row and the column between each fault sample are different, each fault sample is endowed with a fault type identifier ID, and the fault type identifier ID determination method is that if all the samples contain X types of faults, the numerical range of the fault type identifier ID is as follows: 1-X.
Further, the step (1.2) comprises the following specific steps:
(1.2.1) mean filtering operation: filtering and eliminating noise pollution doped in sample data;
(1.2.2) carrying out fault sample segmentation initialization, namely carrying out segmentation initialization on the fault sample after filtering processing;
(1.2.3) combining every two initialized data segments to calculate a fitting error;
and (1.2.4) determining a segmentation cut point of the fault sample, and performing adaptive state segmentation on the fault sample.
Further, the step (1.3) comprises the following specific steps:
dividing the fault sample into
Figure BDA0000893564990000033
The x divided data segments from f1 *The data segment starts to carry out feature extraction, and the specific operations are as follows:
due to f1 *The data segment is
Figure BDA0000893564990000032
In the form of matrix, the row is time point number and is listed as measuring point number;
and (3) extracting features according to the dimension (column) of the measuring points, wherein the features comprise: slope k, duration l, mean m and variance v;
wherein,
Figure BDA0000893564990000041
the vector is linearly fitted according to the principle of least square method, and the fitting result is a linear equation p (x) a0i+a1ix, so slope characteristic k ═ a1iWherein a is0i、a1iIs a fitting constant;
duration characteristic l is
Figure BDA0000893564990000042
Vector length h, i.e., l ═ h;
mean characteristic m is
Figure BDA0000893564990000043
Average of all values of the vector, i.e.
Figure BDA0000893564990000044
The variance characteristic v is
Figure BDA0000893564990000045
Sum of fluctuation amplitudes of all values of the vector from the mean, i.e.
Figure BDA0000893564990000046
N is the total number of vectors;
segmenting data segments
Figure BDA0000893564990000047
Is converted into a feature matrix
Figure BDA0000893564990000048
The rest of
Figure BDA00008935649900000411
Extracting characteristics according to the operation method, converting the original time domain data matrix into a mathematical characteristic matrix, and finally dividing the data segment set
Figure BDA00008935649900000412
Conversion to:
Figure BDA0000893564990000049
further, the step (1.4) comprises the following specific steps: converting the slope k into an inclination angle alpha, converting the duration l into a time proportion, carrying out normalization processing on the mean value characteristic m, and carrying out characteristic matrix treatment on the fault sample
Figure BDA00008935649900000410
Converted into elimination dimensions
Figure BDA0000893564990000051
Further, the step (2.1) comprises the following specific steps:
according to the equipment state early warning system of the power plant, discovering that certain unknown abnormal state occurs in the equipment, and carrying out the following related operations:
(2.1.1) determining settings from the early warning systemStandby alarm generation time t1And a cut-off time t2
(2.1.2) determining from the early warning system the relevant observation point wp ═ x for the device alarm1,x2,...,xn'];
(2.1.3) according to the generation time t1And a cut-off time t2And a database sampling frequency fs, and obtaining the time point number m ═ fs × (t)2-t1) The number of device measurement points n equals length ([ x ])1,x2,...,xn']) Where length () is a calculated length function);
(2.1.4) acquiring abnormal sample data, namely the sample data with n 'of measuring points and m of time points, wherein all the measuring point data at the moment j are regarded as a column vector with n' dimension and are expressed as:
v(tj)=[vj1,vj2,vj3,...,vjn']
storing the sample data file in a matrix form of m multiplied by n', wherein the specific form is as follows:
Figure BDA0000893564990000052
where the rows represent m failure times and the columns represent n' device observation points.
Further, the step (2.2) comprises the following specific steps:
(2.2.1) mean filtering operation: filtering and eliminating noise pollution of each measuring point doped in abnormal sample data according to a mean filtering principle;
(2.2.2) carrying out segmentation initialization on the abnormal sample after filtering treatment;
(2.2.3) combining every two initialized data segments to calculate a fitting error;
and (2.2.4) determining an abnormal sample segmentation cut point based on a recursive combination mode.
Further, the step (2.5) comprises the following specific steps:
transforming pattern distances into pattern similarities according to the following formula based on the principle of the falling-ridge stepwise methodDegree of rotation
Figure BDA0000893564990000053
Figure BDA0000893564990000061
Wherein,
Figure BDA0000893564990000062
represents the pattern distance calculated by pattern recognition of the abnormal sample and the jth fault sample under the fault type i, and ThiRepresenting a mode distance threshold under fault type i;
according to the dimension of the fault type, each distance vector is divided into
Figure BDA0000893564990000063
Converted into similarity vectors
Figure BDA0000893564990000064
Finally, a similarity set of the abnormal sample FT and all samples of the fault knowledge base is obtained, and the similarity set is as follows:
Figure BDA0000893564990000065
further, the step (2.6) comprises the following specific steps:
and outputting a fault diagnosis result according to the similarity set result rho Sets and an agreed rule, wherein the rule is satisfied:
rule 1: if a plurality of fault samples with the similarity exceeding 90% with the abnormal samples exist, outputting the serial numbers of all the samples, simultaneously outputting the fault type to which the fault sample belongs, the confidence coefficient of the fault type to which the fault sample belongs, and the best matching position of the sample with the maximum similarity in the fault development stage, and if the confidence coefficient exceeds 50%, outputting suggested maintenance measures;
rule 2: if no fault sample with the similarity exceeding 90% with the abnormal sample exists and the maximum similarity value is greater than or equal to 60%, outputting the serial number of the maximum similarity sample, simultaneously outputting the fault type to which the fault sample belongs, the confidence coefficient of the fault type to which the fault sample belongs and the best matching position of the maximum similarity sample at the fault development stage, and if the confidence coefficient exceeds 50%, outputting the recommended maintenance measures;
rule 3: if no fault sample with the similarity exceeding 90% with the abnormal sample exists and the maximum similarity value is less than 60%, outputting the abnormal sample as an unusual working condition or an unknown fault type, and continuing to pay attention to the development of the abnormality.
The equipment fault diagnosis method of the invention can realize that:
1. the invention is a brand-new fault diagnosis technology independent of faults. The present invention starts from analyzing the mathematical morphological characteristics of historical fault sample data, and judges the similarity degree and the most possible development trend of the existing fault in time for the current abnormal sample.
2. The invention can efficiently and accurately carry out fault diagnosis work, realizes the data information compression target and eliminates the influence of redundant data noise by converting the sample data from the time domain space to the mathematical morphology characteristic space, thereby greatly improving the fault diagnosis efficiency.
3. In order to ensure the effectiveness of pattern expression, the invention establishes a multidimensional segmentation fitting method based on the data development state. In consideration of the characteristic that the coupling among all variables in the sample data is unclear, the optimal segmentation threshold of the current sample data is mined by adopting a self-adaptive algorithm, and the segmentation positions of all the variables are consistent, so that the development state of the segmented data in each stage is more hierarchical.
4. The invention has low requirement on sample data for fault diagnosis and does not need to carry out complex data preprocessing work. The prior related technology generally cannot realize fault diagnosis of sample data with inconsistent sample time and different variable numbers in a knowledge base, and the method can solve the problem by methods of variable trimming, dynamic time warping and the like.
5. The operation speed of the fault analysis and fault identification part is within the second level, so that not only can the fault diagnosis of a sample of a complete fault mode be realized, but also the fault location of a part of fault modes can be realized. Therefore, the invention can be completely transplanted to an equipment on-line state detection system for application.
Drawings
FIG. 1 is a comprehensive flow chart of the training phase and the operation phase of the fault diagnosis model
FIG. 2 dynamic time warping distance effect plot
FIG. 3 is a trend graph of key measuring points of three faults of a front pump of a steam pump
FIG. 4 is a diagram illustrating the effect of online fault diagnosis of abnormal samples
Detailed Description
Reference will now be made in detail to the embodiments of the present invention, the following examples of which are intended to be illustrative only and are not to be construed as limiting the scope of the invention.
Most of the prior related technologies are that each variable dimension of fault data is separately cut apart for calculation, but the invention comprehensively considers the problems of related information between different variables of sample data, realizes a segmentation fitting technology based on bottom-up, extracts various overall morphological characteristics of the fault data and simultaneously considers the numerical characteristics of the fault data. The method can realize effective compression of large-scale data, reduce the fault diagnosis time and fundamentally improve the fault diagnosis precision, so the method has the capability of quick fault diagnosis. The invention carries out the calculation of the characteristic difference of the fault samples by a dynamic time warping method, and can solve the problem that the similarity of the fault samples with different lengths is difficult to measure. The fault diagnosis method of the invention does not take the mode distance as the final measurement standard, but compares the mode distance with the similarity percentage, unifies the measurement standard of fault identification, and further improves the fault diagnosis precision again.
The invention provides an equipment fault diagnosis method based on multi-dimensional segment fitting, which is a brand-new fault diagnosis technical method for setting out the essence of data morphological features. The method mainly comprises two processes of establishing a model and operating the model.
Fig. 1 (left) is a flow chart of modeling of the present invention, and the whole modeling process mainly includes the following steps:
step 1: and acquiring fault sample information from a system database.
The obtained fault sample information comes from a device state detection system of a power plant, and the rough operation process is as follows: firstly, selecting a piece of research-available equipment with the number P (P > -2) of fault types and the occurrence frequency T (T > -2) of each fault meeting the requirement, selecting an observation point N (N > -10) with the number of feet, and searching the fault record of the historical operation state data of the equipment for a long enough time; then, useful information such as fault related measuring point information, starting and stopping time of a fault process, fault maintenance measure records and the like are picked out from the fault records by using a set screening rule; and finally, reading fault sample data from the power plant real-time database PI according to the information.
For example, for a fault sample data with n measurement points and m time points, all measurement point data at time j can be regarded as an n-dimensional column vector, which is expressed as:
u(tj)=[uj1,uj2,uj3,...,ujn]
the sample data file is stored in a matrix form of m × n, and the specific form is as follows:
Figure BDA0000893564990000081
according to the method, the storage form of all fault samples is as follows: the dimension of row represents m failure times, the dimension of column represents n device observation points, and the two values of the dimension (m, n) of the row and the column between each failure sample may not be the same. While each fault sample is given its fault type identification ID. The method for determining the fault type identifier ID comprises the following steps: if all samples contain X faults, the numerical range of the fault type identifier ID is as follows: 1 to X.
Step 2: sequentially carrying out multi-dimensional piecewise linear fitting on each fault sample
The step is a key technical part of the fault diagnosis invention, and is mainly used for carrying out self-adaptive state segmentation according to the overall development trend of a fault sample and carrying out data preprocessing preparation for subsequent fault feature extraction.
Step 2.1 mean filtering operation:
in order to effectively segment fault sample data, noise pollution doped in the sample data is filtered and eliminated. To quickly and efficiently implement the filtering of noisy data, a typical linear filtering algorithm is chosen: and (5) average filtering.
According to the principle of mean filtering, the specific filtering operation process is as follows: firstly, extracting the first observation point column vector data u of the fault sample data F-1The form is as follows:
Figure BDA0000893564990000091
will column vector u-1Filter template size viewed as a sequence of numbers
Figure BDA0000893564990000092
(round is a rounding function) if L is an even number, then 1 more is needed. After the size of the filtering template is determined, the substantive filtering operation is started:
1. the center position of the filtering template is placed in u11Where the sum of all data within the template is divided by L, the value is u11The output value of the filtering operation of (2) is:
Figure BDA0000893564990000093
2. the center position of the filtering template is moved forward continuously, and filtering output is carried out according to the same operation until all data are completely filtered, for example, the filtering operation is carried out at u1iThe output of the position, mean filtering is:
Figure BDA0000893564990000094
3. finally obtained column vector data u-1The filtered data of (2) is also a column of vectors, and the specific form is as follows:
Figure BDA0000893564990000095
all measuring points of fault sample data are sequentially subjected to filtering operation according to the 3 steps, so that the fault sample data F is converted into F*In the form of:
Figure BDA0000893564990000101
step 2.2 Fault sample segmentation initialization
Carrying out segmentation initialization on the filtered fault sample F, dividing each 2 pieces of data into 1 data segment, and dividing the fault sample into 1 data segment because the length of the fault sample is m
Figure BDA0000893564990000102
The most detailed data segment; if m is odd, the last segment is composed of 3 pieces of data, and the final segment is divided into
Figure BDA0000893564990000103
The most detailed data segment. The initialization effect of the segmentation is as follows:
Figure BDA0000893564990000104
wherein,
Figure BDA0000893564990000105
step 2.3 initialization data segment pairwise combination calculation fitting error
The main purpose of this step is to generate the above step after initialization segmentation
Figure BDA0000893564990000106
(or
Figure BDA0000893564990000107
) And combining each data segment with the data segment adjacent to the right of the data segment, and simultaneously calculating the generated fitting error.
The detailed merge operation is as follows: such as merge fi *And
Figure BDA0000893564990000108
comprises the following steps:
Figure BDA0000893564990000109
in the form of a matrix, e.g. 4 xn, with columns being the number of test points. Data for each column
Figure BDA00008935649900001010
And (5) fitting a straight line by a least square method. The least square method comprises the following specific steps:
the fitting objective was to fit the data
Figure BDA0000893564990000111
Using a first order polynomial p (x) ═ a0+a1x is expressed such that the cumulative sum of fitting errors I is (a)0+a1x-yi)2And minimum.
This is apparently a solution of I ═ I (a)0,a1) And (4) quadratic programming of extreme values. According to the Lagrange principle to obtain
Figure BDA0000893564990000112
Namely:
Figure BDA0000893564990000113
wherein m is 4n is 1
The above formula is a0,a1The linear equation system is expressed by matrix as
Figure BDA0000893564990000114
The above formula becomes a normal equation system, and the coefficient matrix is a symmetric positive definite matrix, so that a unique solution exists. Solving for a from the above formulak(k is 0,1), thus obtaining a polynomial:
p(x)=a0+a1x
the mean error due to the fit is called the fit error, which is recorded as:
Figure BDA0000893564990000115
after all measuring points are fitted, the fitting errors generated are summed
Figure BDA0000893564990000116
(N is the number of stations).
When all are
Figure BDA0000893564990000117
(or
Figure BDA0000893564990000118
) After the data segments are combined pairwise, three results are generated, namely, a data segment set is divided
Figure BDA0000893564990000119
Second, fitting error vector
Figure BDA00008935649900001110
Third, the sample integral segmentation position vector
Figure BDA00008935649900001111
Step 2.4 determining the segmentation cut points of the fault samples
The main purpose of the step is to realize the determination of the fault sample segmentation position in a recursive combination mode.
1. According to the invention, a numerical value MAX _ ERROR is set according to experience as an index for judging the fitting ERROR. If the error vector is fitted
Figure BDA0000893564990000121
If all values in the set are greater than the maximum fitting ERROR threshold MAX _ ERROR, then operation 1 is performed: confirmation
Figure BDA0000893564990000122
Is the final set of sample segmentation locations; otherwise, operation 2 is performed: combining fitting error vectors
Figure BDA0000893564990000123
The middle minimum value position J and the segmentation data segment of the right adjacent position J +1 thereof, and updating the segmentation data segment set F*Fitting error vector
Figure BDA0000893564990000124
And sample integral segmentation position vector
Figure BDA0000893564990000125
The specific updating operation is as follows: f*The length is reduced by 1, the front J-1 is kept unchanged, the value after J is shifted forward by 1 bit, and F is located at the position of JJ'=[FJ;FJ+1](ii) a According to updated F*Determining a new segmentation position vector
Figure BDA0000893564990000126
Figure BDA0000893564990000127
The length is reduced by 1, the front J-1 is kept unchanged, the value behind J is shifted forward by 1 bit, and the position of J is
Figure BDA0000893564990000128
merge is a fitting error function generated and designed according to a least square method;
2. judging the magnitude of the updated fitting ERROR vector value and the maximum fitting ERROR threshold MAX _ ERROR again, if all the fitting ERROR vector values are larger than the threshold MAX _ ERROR, executing operation 1, and otherwise, skipping out of the cycle operation and stopping;
3. circularly combining the segmentation data segments of the minimum fitting error position and the right adjacent position thereof until the final fitting error vector
Figure BDA0000893564990000129
All above the MAX ERROR value, the recursive merge loop stops.
After the above steps, after all the segmented data segments are completed by means of recursive combination, three results are generated, one is a set of segmented data segments
Figure BDA00008935649900001210
Second, fitting error vector
Figure BDA00008935649900001211
Third, the sample integral segmentation position vector
Figure BDA00008935649900001212
(where x is the number of final segmentations of the sample data). Therefore, the target of adaptively segmenting the state of the fault sample is realized, and preparation is made for next fault feature extraction.
And step 3: performing feature extraction on the fault sample segmentation data segment;
the method realizes the extraction of mathematical characteristics such as slope, time proportion, mean value, variance and the like for each segmented data segment of the current fault sample.
For example, the failure sample F*Is divided into
Figure BDA00008935649900001213
The x divided data segments. From f1 *The data segment begins feature extraction.
The specific operation of feature extraction is as follows:
due to f1 *The data segment is
Figure BDA0000893564990000131
In matrix form, the row is the number of time points and the column is the number of test points.
Feature extraction is carried out according to the dimension (column) of the measuring points, and the main features are as follows: slope k, duration l, mean m, variance v.
To be provided with
Figure BDA0000893564990000132
For example, how to perform the relevant feature extraction work is described:
the vector is linearly fitted according to the principle of least square method, and the fitting result is a linear equation p (x) a0i+a1ix. So that the slope characteristic k is a1i
Duration characteristic l is
Figure BDA0000893564990000133
Vector length, i.e., l ═ h;
mean characteristic m is
Figure BDA0000893564990000134
Average of all values of the vector, i.e.
Figure BDA0000893564990000135
The variance characteristic v is
Figure BDA0000893564990000136
Sum of fluctuation amplitudes of all values of the vector from the mean, i.e.
Figure BDA0000893564990000137
Thus dividing the data segment
Figure BDA0000893564990000138
Is converted into a feature matrix
Figure BDA0000893564990000139
The rest of
Figure BDA00008935649900001310
And (4) extracting the characteristics according to the operation method, and converting the original time domain data matrix into a mathematical characteristic matrix.
Final segmentation of data segment sets
Figure BDA00008935649900001311
Conversion to the form:
Figure BDA00008935649900001312
and 4, step 4: carrying out fault characteristic conversion and eliminating characteristic dimension;
1 slope k translates to inclination angle α: since k has a value range of [ - ∞, + ∞ [ - ]]In order to prevent two similar fault samples from generating an excessive difference of slope values due to the existence of fitting errors and finally causing misjudgment, the slope values are converted into angle values through an arc tangent function so as to eliminate the characteristic dimension. The slope characteristic conversion formula is as follows: alpha is alphaij=arctan(kij) Where i ∈ [1, x ]];j∈[1,n;]
Duration l translates into time ratio p: under the condition that the lengths of the fault samples are different, the segmentation duration cannot represent the length of a real and effective segmentation stage, and misjudgment is easy to generate, so that the segmentation duration is divided by the whole length of the original fault sample to eliminate the dimension of the duration characteristics. The time length characteristic conversion formula is as follows:
Figure BDA0000893564990000141
where i ∈ [1, x ]];
3 mean feature m normalization: under the condition that the segmentation numbers of the fault samples are different, the average value cannot represent the real and effective numerical characteristics of the segmentation stage, and misjudgment is easy to generate, so the total numerical quantity of the original fault samples should be divided to eliminate the dimension of the average value characteristics. The time length characteristic conversion formula is as follows:
Figure BDA0000893564990000142
where i ∈ [1, x ]];j∈[1,n];
Feature matrix of final fault samples
Figure BDA0000893564990000143
Converted into elimination dimensions
Figure BDA0000893564990000144
Such a matrix form;
and 5: calculating a mode distance threshold;
the purpose of this step is to find the mode distance between every two of all the samples obtained in step 1 and calculate the distance threshold for each fault category.
Through the four steps, all fault sample data { F1,F2,...,FsAll the data are converted into a feature matrix form set { fm } similar to the formula (1)1,fm2,...,fms}. And then calculating the mode distance of two different samples in the same type of fault samples by using a Dynamic Time Warping (DTW) method, and determining the mode distance threshold of each type of fault according to the calculation result.
The specific operation of calculating the mode distance by the Dynamic Time Warping (DTW) method is described below:
1 there are two sample feature matrices I under the assumption of failure type IAAnd IBThe specific form is as follows:
Figure BDA0000893564990000151
Figure BDA0000893564990000152
wherein, ai(i=1,2,...,x),bj(j ═ 1, 2.. times, x') are two sample feature matrices I, respectivelyAAnd IBThe row vector of (a) represents the mathematical characterization of each segmented data segment of the respective fault sample in all the measurement point dimensions.
2 two samples S under the same fault typeAAnd SBCharacteristic matrix I ofA、IBAccording to the dynamic time warping principle, and the optimal path is determined by combining the mode distance I with the mode distance I shown in the figure 2AAnd IBThe row vectors are copied and stretched to meet the condition that the length of the characteristic matrix columns of the two samples are aligned equally IAAnd IBThe mode distance calculation result of (2) is minimum. SAAnd SBThe formula for calculating the mode distance is as follows:
Figure BDA0000893564990000153
wherein, T [ i: - ] represents a multivariate sequence formed by the ith row vector to the last row vector of the characteristic matrix T;
MDbase(ai,bj) The Euclidean distance from the segmented data segment is represented, and the specific calculation method is as follows:
Figure BDA0000893564990000154
wherein eud (X, Y) represents the Euclidean distance between two vectors X, Y, i.e. the Euclidean distance
Figure BDA0000893564990000155
The four variables of lambda, rho and omega represent the weight of four characteristics of an inclination angle alpha, a time proportion p, a mean value proportion T and a variance v in the calculation mode distanceThe value is obtained. The sum of the four variables of lambda, rho and omega is 1, and all the variables are set to be 0.25 in the invention
According to the DTW calculation method described above, two samples S can be obtainedAAnd SBHas a mode distance of
Figure BDA0000893564990000156
If there are d fault samples in the fault type I, the d fault samples are obtained according to the permutation and combination mode and the calculation mode
Figure BDA0000893564990000161
In total
Figure BDA0000893564990000162
Individual mode distance results. And a pattern distance threshold Th of fault type I1Should be a vector
Figure BDA0000893564990000163
Maximum value of
Figure BDA0000893564990000164
According to the above method, the mode distance threshold of all fault types can be calculated, and a mode distance threshold vector is formed:
Thsets=[Th1,Th2,...,Thx]
step 6: integrating useful information to generate a failure mode knowledge base;
the useful information in the fault knowledge base should include the following parts:
1 feature matrix set FM ═ { FM ] extracted from all fault samples1,fm2,...,fms} (s is the number of failed samples);
2 measuring point name set PT ═ { PT associated with each fault type1,pt2,...,ptx} (x is the number of categories of the type of failure);
3 pattern distance threshold vector Thsets corresponding to each fault type [ Th ]1,Th2,...,Thx](x is thereforeNumber of categories of barrier types).
In order to correlate the two pieces of information (fault signature matrix and pattern distance threshold) with each other, the invention stores the fault information in a fault knowledge base in the following manner:
Figure BDA0000893564990000165
fig. 1 (right) is a flow chart of the model operation of the present invention, and the whole operation process mainly includes the following 6 steps:
step 1: acquiring abnormal sample information from a real-time database;
according to the condition early warning system of the power plant, the condition of the equipment is found to be in an unknown abnormal condition, and the following relevant operations are carried out: 1 determining the alarm generation time t of the equipment from the early warning system1And a cut-off time t2
2, determining a relevant observation point wp ═ x of the equipment alarm from the early warning system1,x2,...,xn'];
According to the generation time t1And a cut-off time t2And database sampling frequency fs, the number of available time points m ═ fs × (t)2-t1) The number of device measurement points n equals length ([ x ])1,x2,...,xn']) (length calculation length function).
Thus, the obtained abnormal sample data is sample data with n 'measuring points and m time points, and all the measuring point data at the moment j can be regarded as a column vector with n' dimension, which is expressed as:
v(tj)=[vj1,vj2,vj3,...,vjn']
the sample data file is stored in a matrix form of m × n', and the specific form is as follows:
Figure BDA0000893564990000171
according to the method, the obtained abnormal sample storage form is as follows: the row dimension represents m failure times and the column dimension represents n' device observation points.
Step 2: performing piecewise linear fitting on the current abnormal sample;
the method mainly carries out adaptive state segmentation according to the overall development trend of the abnormal sample, and carries out data preprocessing preparation for subsequent fault feature extraction. The method is operated according to piecewise linear fitting of a model training stage:
2.1 mean value Filter operation
Setting an abnormal sample
Figure BDA0000893564990000172
The length of the sample is 0.1 times as the length of the filtering template, and the noise pollution doped in each measuring point in the abnormal sample is filtered and eliminated according to the mean filtering principle. The specific operation is seen in the model training phase step 2.1.
All the measuring points of the abnormal sample data finish filtering operation according to corresponding steps in sequence, so that the abnormal sample data
Figure BDA0000893564990000173
Is converted into
Figure BDA0000893564990000174
The form is as follows:
Figure BDA0000893564990000175
2.2 Exception sample segmentation initialization
For filtered abnormal samples
Figure BDA0000893564990000176
Segment initialization is carried out, every 2 pieces of data are divided into 1 data segment, and the abnormal sample data is divided into 1 data segment due to the fact that the length of the abnormal sample data is m
Figure BDA0000893564990000177
The most detailed data segment; if m is odd, thenThe latter section consists of 3 pieces of data, and is finally divided into
Figure BDA0000893564990000178
The most detailed data segment. The initialization effect of the segmentation is as follows:
Figure BDA0000893564990000181
wherein,
Figure BDA0000893564990000182
2.3 computing fitting error by combining two initialized data segments
Generated after initializing the step 2.2 abnormal sample segment
Figure BDA0000893564990000183
(or
Figure BDA0000893564990000184
) And each data segment and the right adjacent data segment are fitted according to the least square principle to realize combination, and the fitting error sum generated by all the measuring points is calculated.
When all are
Figure BDA0000893564990000185
(or
Figure BDA0000893564990000186
) After the data segments are combined pairwise, three results are generated, 1, the data segment set is divided
Figure BDA0000893564990000187
2 fitting error vector
Figure BDA0000893564990000188
3-sample global segmentation position vector
Figure BDA0000893564990000189
2.4 recursive merging to determine outlier sample segmentation cut points
This step specifically operates the reference model construction phase, divided above
Figure BDA00008935649900001810
Continuously merging two adjacent data segments generating the minimum fitting error by the recursive merging mode of each data segment to generate 3 updated results, wherein 1 is a set of divided data segments
Figure BDA00008935649900001811
Second, fitting error vector
Figure BDA00008935649900001812
Third, the sample integral segmentation position vector
Figure BDA00008935649900001813
(where x is the number of final segmentations of the sample data). The goal of adaptive state segmentation of the anomaly samples is achieved,
Figure BDA00008935649900001814
and extracting the operation object for the next step of abnormal features.
And step 3: carrying out abnormal feature extraction and transformation on the data subjected to segmentation fitting;
the method realizes extraction of 4 mathematical characteristics of slope, time span, mean value and variance of each data segment after the current abnormal sample is segmented.
For example, abnormal samples
Figure BDA00008935649900001815
Is divided into
Figure BDA00008935649900001816
The x divided data segments. The specific operation of feature extraction may refer to step 3 of the model construction phase.
Each of the divided data segments
Figure BDA0000893564990000191
Extracting characteristics such as corresponding slope k, time span l, mean value m, variance v and the like according to the dimension of the measuring point, wherein the result after extraction is as follows:
Figure BDA0000893564990000192
when all the segmentation data segments of the abnormal sample are subjected to feature extraction according to the method, the segmentation data segment vector with the length of x
Figure BDA0000893564990000193
Can be converted into the following characteristic matrix form:
Figure DEST_PATH_GDA0000937340420000194
in order to prevent the abnormal samples and similar fault samples from causing final misjudgment when calculating the pattern distance due to the following reasons:
1, the existence of fitting errors causes overlarge difference of slope values, and the solution is to convert the slope value k into an angle value alpha through an arc tangent function so as to eliminate the characteristic dimension;
2, under the condition that the lengths of the fault samples are different, the segmentation time length l cannot represent the proportion of a real and effective segmentation stage, misjudgment is easy to generate, and the segmentation time length l is divided by the whole length m of the original fault sample to eliminate the dimension of time length characteristics;
3 under the condition that the number of the divided fault samples is different, the mean value cannot represent the real and effective numerical characteristics of the divided stages, and misjudgment is easy to occur, so the total numerical quantity of the original fault samples should be divided to eliminate the dimension of the mean value characteristics.
The specific operation method for eliminating the dimension can be seen in the characteristic transformation formula of step 4 of the model construction stage. Thus transformed, the features are transformed into the following dimensionless form:
Figure BDA0000893564990000195
abnormal sample
Figure BDA0000893564990000201
Conversion into the above feature matrix
Figure BDA0000893564990000202
Then, the step of calculating the distance of the sample mode of the fault knowledge base can be carried out.
And 4, step 4: sequentially calculating pattern distances for sample features in a fault pattern knowledge base by using the abnormal feature information;
firstly, extracting the stored information faultMessageSets in the fault knowledge base, wherein the specific mode is as follows:
Figure BDA0000893564990000203
then, a feature matrix FT and a measuring point information vector wp of the abnormal sample and first-type fault information V1={fm11,fm12,...,fm1,c1,pt1,Th1And fourthly, performing fault identification work. The specific operation is as follows:
a comparison measuring point information vector wp and a fault type fault measuring point information vector pt1The repetition rate f is calculated by the following formula:
Figure BDA0000893564990000204
sequentially carrying out pattern distance numerical dist on the feature matrix FT of the two abnormal samples and each feature matrix under the fault type 1 according to the following method1iAnd (3) calculating:
the abnormal sample may be a complete fault sample of a certain type or a partial fault sample in a development process of a certain type. For accurately calculating dist1iNumerical value, this step applies the telescopic windowThe sliding search technology is used for accurately positioning the fault part, so that the mode recognition function of the abnormal sample is realized, and the development stage information of the abnormal sample is captured.
2.1 first, all possible values of the shrink window WT are determined. The following contents are set in the invention:
1, all fault samples in a fault knowledge base are samples of a complete process;
2, the expansion ratio of the expansion window is xi-20%;
the value range of WT may be defined as round (length (ft)) 80% to round (length (ft)) 120%, and WT ═ n is set1,n2,...,nx];
Wherein round is a rounding function; length is a function of calculating the number of sample segments, i.e., the number of rows of the feature matrix.
2.2 then from the telescopic window n1Initially, at each sample feature matrix fm for fault type 11iAnd the mode distance is calculated by sliding.
If fm1iIf the length of the window is m, the sliding frequency of the telescopic window is m, and the range of pattern recognition on the fault sample is as follows:
[1,n1]~[m-n1+1,m]。
abnormal samples from [1, n ] of faulty samples1]The position starts to calculate the mode distance, each sliding step L is set to 1, and FT and fm are calculated1i[1:n1]The mode distance formula of (1):
dist11=κ*MDDTW(FT,fm1i[1:n1]
wherein, if the repetition rate η is equal to 0, then κ is 2; if the repetition rate eta is not equal to 0
Figure BDA0000893564990000211
MDDTW(FT,fm1i[1,n1]) To find FT and fm1i[1,n1]DTW distance function of (d).
The rest is [2, n ]2]~[m-n1+1,m]The ranges of (A) are all calculated according to the above 2.1-2.2 steps, and finallyObtaining a mode distance vector
Figure BDA0000893564990000216
At window size n1In the case of (2), the local best mode distance is dist11=min(distVector1) The local best matching position is:
L11=[nmin-n1+1,nmin]。
2.3 last other expansion windows x-1 value cases n2~nxCalculating the local optimal mode distance dist under each window value1iAnd local best matching position L1i=[nx-ni+1,nx]。
2.4 after all window values are matched, the following two important results are generated:
1 local mode distance vector
Figure BDA0000893564990000217
2 local optimum position vector
Figure BDA0000893564990000218
Selecting mode distance minimum from local mode distance vector
Figure BDA0000893564990000212
And corresponding best matching position
Figure BDA0000893564990000213
Feature matrices FT and fm as abnormal samples and failure samples1iThe pattern recognition end result of (1).
After the three abnormal sample feature matrixes FT are matched with all feature matrixes of other fault samples one by one, the pattern distance vector is generated
Figure BDA0000893564990000214
And a position vector
Figure BDA0000893564990000215
Remaining x-1 type faults ViSolving the pattern distance vector according to the mode and the abnormal samples FT and wp, and finally obtaining a pattern distance set
Figure BDA0000893564990000221
And a set of position vectors
Figure BDA0000893564990000222
And 5: converting the mode distance into mode similarity by using a falling-ridge distribution method;
the pattern distance obtained in the above step is the distance obtained by applying the DTW algorithm, which is essentially the euclidean distance, and the euclidean distance is related to the time length of the samples, and the greater the time length is, the euclidean distance is correspondingly increased, so that the similarity between the two samples cannot be visually judged, and therefore, the final fault diagnosis result can be obtained only after the pattern distance obtained in the above step is converted into the pattern similarity by adopting the ridge-descending distribution method in the present step.
The mode distance set composition component obtained in step 4 is abc mode distance
Figure BDA0000893564990000223
Pattern distance is converted to similarity according to the principle of the falling-ridge stepwise method according to the following formula
Figure BDA0000893564990000224
Figure BDA0000893564990000225
Wherein,
Figure BDA0000893564990000226
represents the pattern distance calculated by pattern recognition of the abnormal sample and the jth fault sample under the fault type i, and ThiRepresentsThe mode distance threshold for fault type i.
Thus, each distance vector is calculated according to the dimension of the fault type
Figure BDA0000893564990000227
Converted into similarity vectors
Figure BDA0000893564990000228
Finally, the similarity set of the obtained abnormal sample FT and all samples of the fault knowledge base is as follows:
Figure BDA0000893564990000229
step 6: and outputting a final fault diagnosis result.
According to the similarity set result rho Sets obtained in the step 5, the most effective and reliable fault diagnosis result is output according to the agreed rule:
rule 1: if a plurality of fault samples with the similarity exceeding 90% with the abnormal sample exist, outputting the serial numbers of all the samples, simultaneously outputting the type vector of the fault sample, the confidence level of the fault type, the best matching position Location of the sample with the maximum similarity in the fault development stage, and outputting the recommended maintenance action if the confidence level exceeds 50%;
rule 2: if no fault sample with the similarity exceeding 90% with the abnormal sample exists and the maximum similarity value is greater than or equal to 60%, outputting the serial number of the maximum similarity sample, and simultaneously outputting the type of the fault to which the fault sample belongs, the confidence level of the fault type to which the fault sample belongs, and the best matching position Location of the maximum similarity sample in the fault development stage; if the confidence level exceeds 50%, recommending action of the adopted maintenance measure;
rule 3: if no fault sample with the similarity exceeding 90% with the abnormal sample exists and the maximum similarity value is less than 60%, outputting the abnormal sample as an unusual working condition or an unknown fault type, and recommending to continuously pay attention to the development of the abnormality;
wherein the typeVector value satisfies the above rule
Figure BDA0000893564990000231
The value of i in (1); the confidence level value of a certain fault type is equal to the number of the fault types meeting the rule divided by the total number of the fault samples of the fault type; the maintenance measure repairAction can be obtained by inquiring the operation rule of the equipment according to the fault type.
The steam pump preposition pump of a #2 unit of a certain thermal power plant in the north is taken as a state monitoring object, the steam pump preposition pump is an important component of a steam feed water pump, the function is important, and the performance safety of boiler equipment is maintained. The equipment belongs to exposed facilities, has more observation parts and is easy to cause various faults, and the characteristic is suitable for the equipment fault diagnosis method designed by the invention. The implementation process of the present invention is further illustrated by the detailed description of the embodiment.
The implementation steps of the embodiment of the invention for the fault diagnosis of the pre-pump equipment of the steam pump of a certain power plant are as follows:
fault diagnosis knowledge base construction process of steam pump pre-pump equipment
Step 1: acquiring failure sample information of a front pump of the steam pump from a power plant PI database;
selecting 21 key parameters related to the operation of a front pump of the air pump, wherein the key parameters comprise actual power (MW), radial bearing temperature (DEG C) of a motor transmission end, thrust bearing temperature (DEG C), motor current (A) and the like, and therefore each piece of observation data of the equipment is 21-dimensional row vector:
u(tj)=[uj1,uj2,uj3,...,uj21]
searching fault records of the equipment from the 10 months in 2013 to the 10 months in 2015, determining that the fault type P is greater than or equal to 2 and the fault occurrence frequency T is greater than or equal to 2 according to a screening rule, and extracting the following information from the fault records of the equipment according to the screening rule: in the period, 3 faults (radial bush temperature sudden rise of a motor transmission end, thrust bush temperature sudden drop and motor current sudden change) occur totally, the number of the faults is 86 (radial bush temperature sudden rise of the motor transmission end is 25, thrust bush temperature sudden drop is 28 and motor current sudden change is 33), fault related measuring points are different (the radial bush temperature sudden rise of the motor transmission end is from measuring point 6 to measuring point 14, the thrust bush temperature sudden drop is from measuring point 15 to measuring point 17, the motor current sudden change is from measuring point 1 to measuring point 6 and from measuring point 18 to measuring point 21), and the fault duration time range is from 2 hours to 14 hours. (ii) a And finally, reading all fault sample data from the power plant real-time database PI according to the information.
The finally obtained fault samples comprise three parts of fault type number information (the radial tile temperature sudden rise number of a motor transmission end is 1, the thrust tile temperature sudden fall number is 2, and the motor current sudden change number is 3), fault sample data and fault type related measuring point information, wherein the sequence of each type of fault samples is sequenced according to the sequence of fault occurrence time:
Figure BDA0000893564990000241
wherein,iFjthe specific form of (1) is a matrix form, the row m represents the sample length, and the column represents the measuring point. As follows:
Figure BDA0000893564990000242
step 2: performing piecewise linear fitting on each Fault sample in Fault Sets;
from the sample1F1Starting until the sample3F33And each fault sample adopts a piecewise linear fitting technology, so that the self-adaptive state segmentation is realized according to the overall development trend of the fault sample, and data preprocessing preparation is carried out for subsequent fault feature extraction.
First, a mean filtering operation is performed. Sequentially extractingiFjPer station column vector u-iDoping according to the operation steps of the mean filtering principle in the specific embodimentFiltering and eliminating noise pollution in sample data to obtain filtered dataiFj *
After the filtering operation, the maximum fitting ERROR threshold MAX _ ERROR is empirically set to 0.2. The bottom-up state partitioning is implemented in a recursive merge manner. The following three results were obtained: 1 partitioning a set of data segments
Figure BDA0000893564990000243
2 fitting error vector
Figure BDA0000893564990000244
3-sample global segmentation position vector
Figure BDA0000893564990000245
Finally, each sample of the steam pump is arranged in front of the pumpiFjAll are processed into
Figure BDA0000893564990000246
And step 3: extracting the characteristics of each section of data of each segmented fault sample;
because the sample data time length is in the hour level, in order to facilitate the rapid and effective operation of the fault diagnosis, the mathematical characteristics such as the slope k, the time length l, the mean value m, the variance v and the like are extracted from each section of data of the fault sample to carry out data compression and the transformation of the feature space.
After feature extractioniFjEach piece of data fi *All converted into four-dimensional elements (k, l, m, v). The specific feature extraction formula can be seen in step 3 of the modeling stage of the specific embodiment. Thus, each fault sample realizes the process of converting the sample from the time domain space to the feature space:
Figure BDA0000893564990000251
and 4, step 4: carrying out fault characteristic conversion and eliminating characteristic dimension;
in order to prevent misjudgment of different fault samples due to dimension reasons when the mode distance is calculated, three characteristics of the slope, the duration and the mean value need to be converted into a slope angle, a time proportion and a mean value proportion. The specific operation method for eliminating the dimension can be seen in a characteristic transformation formula in the step 4 of the modeling stage. Through transformation, the characteristics are transformed into the following dimensionless forms:
Figure BDA0000893564990000252
so far, the conversion from the time domain space to the feature space is realized by all samples, and the conversion of the Fault sample set Fault Sets into the Fault sample feature set is as follows:
Figure BDA0000893564990000253
and 5: calculating a mode distance threshold value under each fault type;
through the 4 steps, all fault sample data are converted into a characteristic matrix form set form. In the step, a Dynamic Time Warping (DTW) method is used for calculating the mode distance of two different samples in the same type of fault samples, and the mode distance threshold of each type of fault is determined according to the calculation result.
For example, there are 25 sample feature matrices in the type 1 fault in the fault sample, and every two feature matrices calculate the DTW distance to obtain a vector DtwVector1=[Md1,Md2,...,Md25]. Pattern distance threshold Th of fault type 11=min(DtwVector1). The pattern distance threshold Th for fault types 2, 3 is obtained in the same way2、Th3
Step 6: generating a steam pump gas quality pump fault mode knowledge base;
summarizing the information obtained in the steps, the steam pump pre-pump fault knowledge base should include the following 3 parts of information:
1. and extracting a feature matrix set FaultFeatureSets from all fault samples.
2. Relevant measuring point name set PT ═ { PT) of three types of faults1,pt2,pt3};
3. Pattern distance threshold values Thsets ═ Th corresponding to three types of faults1,Th2,...,Thx]。
And integrating the above 3 parts of information to generate a complete steam pump preposed pump fault knowledge base in the following form:
Figure BDA0000893564990000261
fault on-line diagnosis process of steam pump pre-pump equipment
The plant-level monitoring system of the power plant finds that unknown abnormality occurs at a certain measuring point of the steam pump pre-pump equipment when 03 is 12 months and 15 days in 2014. In order to better help the professional personnel to carry out inspection and maintenance, the power plant personnel retrieve abnormal sample data from the real-time database according to the starting and ending time of the known abnormality and the related alarm measuring point information pt
Figure BDA0000893564990000262
And the fault diagnosis method of the invention is used for carrying out advanced prejudgment.
Abnormal sample data
Figure BDA0000893564990000263
After data processing operations such as piecewise linear fitting, abnormal feature extraction and feature dimension elimination, calculating to obtain abnormal sample data
Figure BDA0000893564990000264
Corresponding feature matrix
Figure BDA0000893564990000265
And (4) matrix. The above operations can be referred to as corresponding steps of the model operation phase of the embodiment.
Abnormal sample information
Figure BDA0000893564990000266
And pt is used for matching the sample information in the fault mode knowledge base one by one, and converting the mode distance into corresponding fault similarity.
The abnormal samples are matched with all fault samples in the knowledge base, and the DTW distance is calculated by using a sliding telescopic time window method to serve as the final mode distance. The set of available distances is in the form:
Figure BDA0000893564990000267
and (4) mapping the distances between the [0,1] numerical values by operating a falling-ridge distribution method, and converting the mode distances into similarity according to a unified standard. Embodiments A falling ridge distribution method of the model run phase is used herein. The available similarity sets are in the form:
Figure BDA0000893564990000271
outputting a judgment result of fault diagnosis according to an agreed fault diagnosis rule: the abnormity is pre-determined to be the radial tile temperature sudden-rise fault of the motor transmission end, and the determination reason is as follows: 15 fault samples with similarity of more than 90% with the abnormal sample in the front-mounted pump of the steam pump are (
Figure BDA0000893564990000272
Etc.), the confidence coefficient of the diagnosis fault type 1 is 60% and exceeds the confidence coefficient threshold value, and the fault sample with the maximum similarity is that the number of the segmentation segments is 7
Figure BDA0000893564990000273
The optimal matching positions are 1-4 segments, and the actual fault diagnosis effect diagram is shown in figure 4. The integral trends of the two samples are matched and consistent from the beginning to the middle position, so that the fault development stage is determined as the middle stage of fault development, and the query operation regulation suggests to take maintenance measures as a method for forcibly cooling the transmission end of the cold water impact motor
The fact shows that the diagnosis result after the professional staff arrive in time is really the radial tile temperature sudden-rise fault of the motor transmission end, and the operation of the front-mounted pump equipment of the steam pump is recovered to be normal after the front-mounted pump of the steam pump is forced to be cooled to a reasonable temperature range by adopting a cold water cooling method.
Although exemplary embodiments of the present invention have been described for illustrative purposes, those skilled in the art will appreciate that various modifications, additions, substitutions and the like can be made in form and detail without departing from the scope and spirit of the invention as disclosed in the accompanying claims, all of which are intended to fall within the scope of the claims, and that various steps in the various sections and methods of the claimed product can be combined together in any combination. Therefore, the description of the embodiments disclosed in the present invention is not intended to limit the scope of the present invention, but to describe the present invention. Accordingly, the scope of the present invention is not limited by the above embodiments, but is defined by the claims or their equivalents.

Claims (1)

1. A device fault diagnosis method based on multi-dimensional piecewise fitting is characterized by sequentially comprising the following steps: step 1: the fault diagnosis training step specifically comprises the following steps:
(1.1) acquiring fault sample information from a database;
(1.2) sequentially carrying out piecewise linear fitting on each fault sample;
(1.3) extracting the characteristics of each section of data of the fault sample to obtain a characteristic matrix of the fault sample;
(1.4) carrying out fault characteristic conversion, eliminating characteristic dimension and obtaining a characteristic matrix of a fault sample with the dimension eliminated;
(1.5) calculating a mode distance threshold;
(1.6) associating the fault feature matrix and the pattern distance threshold value with each other, and storing to generate a fault pattern knowledge base;
step 2: the fault diagnosis operation steps specifically comprise:
(2.1) acquiring abnormal sample information from a real-time database;
(2.2) performing piecewise linear fitting on the current abnormal sample;
(2.3) carrying out abnormal feature extraction and transformation on the data subjected to segmentation fitting;
(2.4) sequentially calculating the pattern distance of the sample features in the fault pattern knowledge base by using the abnormal feature information;
(2.5) converting the mode distance into a mode similarity using a falling ridge pacing method;
(2.6) outputting a final fault diagnosis result;
wherein, the step (1.1) comprises the following steps: selecting a piece of research equipment with the number P of fault types being more than or equal to 2 and the occurrence frequency T of each fault being more than or equal to 2 to meet the requirements, selecting the number of observation points as N, wherein N is more than or equal to 10, carrying out fault record searching on historical running state data of the equipment for a certain time, picking out fault related measurement point information, starting and stopping time of a fault process and useful information recorded by fault maintenance measures from fault records by utilizing a set screening rule, and reading fault sample data from a power plant real-time database PI according to the useful information, wherein:
the fault sample data with n measuring points and m time points, all the measuring point data at the moment j are regarded as a n-dimensional column vector and are expressed as follows:
u(tj)=[uj1,uj2,uj3,...,ujn]
the sample data file is stored in a matrix form of m × n, and the specific form is as follows:
Figure FDA0002582527450000021
the row represents m fault time, the column represents n equipment observation points, the m and n values of the row and the column between each fault sample are different, each fault sample is endowed with a fault type identifier ID, and the fault type identifier ID determination method is that if all the samples contain X types of faults, the numerical range of the fault type identifier ID is as follows: 1-X;
wherein, the step (1.2) comprises the following steps:
(1.2.1) mean filtering operation: filtering and eliminating noise pollution doped in sample data;
(1.2.2) carrying out fault sample segmentation initialization, namely carrying out segmentation initialization on the fault sample after filtering processing;
(1.2.3) combining every two initialized data segments to calculate a fitting error;
(1.2.4) determining a segmentation cutting point of a fault sample, and performing adaptive state segmentation on the fault sample;
wherein, the step (1.3) comprises the following steps:
dividing the fault sample into
Figure FDA0002582527450000022
The x divided data segments from f1 *The data segment starts to carry out feature extraction, and the specific operations are as follows:
due to f1 *The data segment is
Figure FDA0002582527450000023
In the form of matrix, the row is time point number and is listed as measuring point number;
and (3) extracting features according to the dimensions of the measuring points, wherein the features comprise: slope k, duration l, mean m and variance v;
wherein,
Figure FDA0002582527450000024
the vector is linearly fitted according to the principle of least square method, and the fitting result is a linear equation p (x) a0i+a1ix, so slope characteristic k ═ a1iWherein a is0i、a1iIs a fitting constant;
duration characteristic l is
Figure FDA0002582527450000025
Vector length h, i.e., l ═ h;
mean characteristic m is
Figure FDA0002582527450000031
Average of all values of the vector, i.e.
Figure FDA0002582527450000032
The variance characteristic v is
Figure FDA0002582527450000033
Sum of fluctuation amplitudes of all values of the vector from the mean, i.e.
Figure FDA0002582527450000034
N is the total number of vectors;
segmenting data segments
Figure FDA0002582527450000035
Is converted into a feature matrix
Figure FDA0002582527450000036
The rest of
Figure FDA0002582527450000037
Extracting characteristics according to the operation method, converting the original time domain data matrix into a mathematical characteristic matrix, and finally dividing the data segment set
Figure FDA0002582527450000038
Conversion to:
Figure FDA0002582527450000039
wherein, the step (2.1) comprises the following steps:
according to the equipment state early warning system of the power plant, discovering that certain unknown abnormal state occurs in the equipment, and carrying out the following related operations:
(2.1.1) determining the device alarm generation time t from the early warning system1And a cut-off time t2
(21.2) determining from the early warning system the relevant observation point wp ═ x for the device alarm1,x2,...,xn'];
(2.1.3) according to the generation time t1And a cut-off time t2And a database sampling frequency fs, and obtaining the time point number m ═ fs × (t)2-t1) The number of device measurement points n equals length ([ x ])1,x2,...,xn']) Where length () is the calculated length function;
(2.1.4) acquiring abnormal sample data, namely the sample data with n 'of measuring points and m of time points, wherein all the measuring point data at the moment j are regarded as a column vector with n' dimension and are expressed as:
v(tj)=[vj1,vj2,vj3,...,vjn']
storing the sample data file in a matrix form of m multiplied by n', wherein the specific form is as follows:
Figure FDA0002582527450000041
wherein the rows represent m failure times and the columns represent n' device observation points;
wherein, the step (2.2) comprises the following specific steps:
(2.2.1) mean filtering operation: filtering and eliminating noise pollution of each measuring point doped in abnormal sample data according to a mean filtering principle;
(2.2.2) carrying out segmentation initialization on the abnormal sample after filtering treatment;
(2.2.3) combining every two initialized data segments to calculate a fitting error;
and (2.2.4) determining an abnormal sample segmentation cut point based on a recursive combination mode.
CN201511004841.0A 2015-12-29 2015-12-29 Equipment fault diagnosis method based on multi-dimensional piecewise fitting Active CN105631596B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201511004841.0A CN105631596B (en) 2015-12-29 2015-12-29 Equipment fault diagnosis method based on multi-dimensional piecewise fitting

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201511004841.0A CN105631596B (en) 2015-12-29 2015-12-29 Equipment fault diagnosis method based on multi-dimensional piecewise fitting

Publications (2)

Publication Number Publication Date
CN105631596A CN105631596A (en) 2016-06-01
CN105631596B true CN105631596B (en) 2020-12-29

Family

ID=56046501

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201511004841.0A Active CN105631596B (en) 2015-12-29 2015-12-29 Equipment fault diagnosis method based on multi-dimensional piecewise fitting

Country Status (1)

Country Link
CN (1) CN105631596B (en)

Families Citing this family (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106096634B (en) * 2016-06-06 2018-04-06 北京理工大学 Fault detection method based on Adaptive windowing mental arithmetic method with interval halving algorithm
CN106096214B (en) * 2016-07-22 2019-08-27 杨志强 A kind of switchgear Fault Fuzzy Diagnostic method based on indirect thermometric mode
CN106789243B (en) * 2016-12-22 2019-06-14 烟台东方纵横科技股份有限公司 A kind of IT operational system with intelligent trouble analytic function
CN106697291A (en) * 2016-12-22 2017-05-24 深圳万智联合科技有限公司 Intelligent unmanned plane pesticide spraying device
CN110226140B (en) * 2017-01-25 2023-04-11 Ntn株式会社 State monitoring method and state monitoring device
JP6772963B2 (en) * 2017-06-05 2020-10-21 トヨタ自動車株式会社 Abnormality diagnosis device and abnormality diagnosis method
CN107748892B (en) * 2017-09-25 2020-11-17 西安理工大学 Human behavior data segmentation method based on Mahalanobis distance
CN108398656B (en) * 2018-02-27 2021-01-05 安徽建筑大学 High voltage light emitting diode street lamp, fault diagnosis method and readable storage medium
CN108414949B (en) * 2018-02-27 2021-06-01 安徽建筑大学 High-voltage light-emitting diode lamp fault diagnosis method and medium based on illumination characteristics
CN108829755B (en) * 2018-05-25 2021-04-13 国网山东省电力公司青岛供电公司 Transformer fault case matching search method based on iteration closest point
CN110646202B (en) * 2018-06-08 2021-09-03 西门子股份公司 Method and device for detecting wear of a bearing
CN109242835A (en) * 2018-08-24 2019-01-18 深圳市兆泰云智能科技有限公司 Vehicle bottom defect inspection method, device, equipment and system based on artificial intelligence
CN109239301B (en) * 2018-09-12 2021-06-01 江苏科技大学 Anchor chain flash welding quality online evaluation method
CN109507990B (en) * 2018-12-25 2021-06-15 中南大学 Fault tracing method and system
CN110187294B (en) * 2019-04-17 2024-08-02 安徽建筑大学 Fault diagnosis method and device for piecewise linear constant current drive LED light source
CN110298767B (en) * 2019-05-29 2020-05-12 山东科技大学 Thermal power plant time sequence variable abnormity monitoring method and system
CN112152830B (en) * 2019-06-28 2023-08-04 中国电力科学研究院有限公司 Intelligent fault root cause analysis method and system
CN111222774B (en) * 2019-12-30 2020-08-18 广州博士信息技术研究院有限公司 Enterprise data analysis method and device and server
CN112257754B (en) * 2020-09-24 2023-07-28 北京航天测控技术有限公司 Method and device for analyzing running state of spacecraft
CN112492567B (en) * 2020-11-19 2022-03-18 江苏恒宝智能系统技术有限公司 Fault analysis and solution method and device in emergency command communication
CN112487712B (en) * 2020-11-25 2024-08-23 中国科学院力学研究所 Intelligent identification method for buried suspended pipeline working condition parameters based on genetic algorithm and multidimensional dynamic time regularity
CN112849648B (en) * 2020-12-31 2023-04-28 重庆国际复合材料股份有限公司 Intelligent tray identification system and method
CN114973746B (en) * 2021-02-23 2023-09-26 腾讯科技(深圳)有限公司 Parking lot determining method, equipment and computer readable storage medium
CN113392874B (en) * 2021-05-13 2022-12-13 中车唐山机车车辆有限公司 Abnormal state diagnosis method and device for rail vehicle and terminal equipment
CN113496089B (en) * 2021-09-07 2021-11-16 江苏宝诺铸造有限公司 Internet of things equipment abnormal state online monitoring method and system based on artificial intelligence
CN114626666B (en) * 2021-12-11 2023-05-05 国网湖北省电力有限公司经济技术研究院 Engineering site progress identification system based on full-time space monitoring
CN114486259A (en) * 2022-01-05 2022-05-13 电子科技大学 Signal processing method of distributed optical fiber acoustic sensing system for optimizing variational modal decomposition
CN114693110B (en) * 2022-03-29 2023-07-04 杭州安脉盛智能技术有限公司 Abnormality monitoring method, abnormality monitoring system and storage medium of energy storage system
CN115956543A (en) * 2022-07-15 2023-04-14 生态环境部南京环境科学研究所 Fishing device and fishing method for wild giant salamander resource investigation
CN116104773B (en) * 2023-02-21 2024-04-12 重庆成峰水务工程有限责任公司 Comprehensive detection equipment and detection method for water pump faults

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103267652A (en) * 2013-05-24 2013-08-28 北京工业大学 Intelligent online diagnosis method for early failures of equipment
CN104462842A (en) * 2014-12-22 2015-03-25 厦门大学 Excavating diagnosis method of failure data based on bayesian network
CN104462846A (en) * 2014-12-22 2015-03-25 山东鲁能软件技术有限公司 Intelligent device failure diagnosis method based on support vector machine
CN105093040A (en) * 2015-09-09 2015-11-25 成都比善科技开发有限公司 Transformer station on-line monitoring system with high safety
CN105158598A (en) * 2015-08-15 2015-12-16 国家电网公司 Fault prediction method suitable for power equipment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103267652A (en) * 2013-05-24 2013-08-28 北京工业大学 Intelligent online diagnosis method for early failures of equipment
CN104462842A (en) * 2014-12-22 2015-03-25 厦门大学 Excavating diagnosis method of failure data based on bayesian network
CN104462846A (en) * 2014-12-22 2015-03-25 山东鲁能软件技术有限公司 Intelligent device failure diagnosis method based on support vector machine
CN105158598A (en) * 2015-08-15 2015-12-16 国家电网公司 Fault prediction method suitable for power equipment
CN105093040A (en) * 2015-09-09 2015-11-25 成都比善科技开发有限公司 Transformer station on-line monitoring system with high safety

Also Published As

Publication number Publication date
CN105631596A (en) 2016-06-01

Similar Documents

Publication Publication Date Title
CN105631596B (en) Equipment fault diagnosis method based on multi-dimensional piecewise fitting
CN105467975B (en) A kind of equipment fault diagnosis method
CN110322048B (en) Fault early warning method for production logistics conveying equipment
CN110969556B (en) Machine learning multidimensional multi-model fusion river channel water quality anomaly detection method and device
CN109146246B (en) Fault detection method based on automatic encoder and Bayesian network
CN111539553B (en) Wind turbine generator fault early warning method based on SVR algorithm and off-peak degree
CN110895526A (en) Method for correcting data abnormity in atmosphere monitoring system
CN111401573B (en) Working condition state modeling and model correcting method
CN106779200A (en) Based on the Wind turbines trend prediction method for carrying out similarity in the historical data
CN111275288A (en) XGboost-based multi-dimensional data anomaly detection method and device
CN109086793A (en) A kind of abnormality recognition method of wind-driven generator
Calvo-Bascones et al. A collaborative network of digital twins for anomaly detection applications of complex systems. Snitch Digital Twin concept
Liu et al. A two-stage deep autoencoder-based missing data imputation method for wind farm SCADA data
CN112032003B (en) Method for monitoring operation performance of large wind turbine generator
CN110717610A (en) Wind power prediction method based on data mining
CN115484102A (en) Industrial control system-oriented anomaly detection system and method
CN111080074A (en) System service security situation element obtaining method based on network multi-feature association
Vishnu et al. Recurrent neural networks for online remaining useful life estimation in ion mill etching system
CN115828466A (en) Fan main shaft component fault prediction method based on wide kernel convolution
Shi et al. Study of wind turbine fault diagnosis and early warning based on SCADA data
CN117763305A (en) Power time sequence data anomaly detection and repair method considering behavior commonality
Wendelberger et al. Monitoring Deforestation Using Multivariate Bayesian Online Changepoint Detection with Outliers
Han et al. Self-supervised deep clustering method for detecting abnormal data of wastewater treatment process
Lithoxoidou et al. Malfunction diagnosis in industrial process systems using data mining for knowledge discovery
Yang et al. Anomaly detection of dust removal system through gradient boosting decision tree algorithm

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP01 Change in the name or title of a patent holder

Address after: 250101 5th floor, block B, Yinhe building, 2008 Xinluo street, high tech Zone, Jinan City, Shandong Province

Patentee after: Shandong luruan Digital Technology Co.,Ltd.

Address before: 250101 5th floor, block B, Yinhe building, 2008 Xinluo street, high tech Zone, Jinan City, Shandong Province

Patentee before: SHANDONG LUNENG SOFTWARE TECHNOLOGY Co.,Ltd.

CP01 Change in the name or title of a patent holder