CN105631596B - Equipment fault diagnosis method based on multi-dimensional piecewise fitting - Google Patents
Equipment fault diagnosis method based on multi-dimensional piecewise fitting Download PDFInfo
- Publication number
- CN105631596B CN105631596B CN201511004841.0A CN201511004841A CN105631596B CN 105631596 B CN105631596 B CN 105631596B CN 201511004841 A CN201511004841 A CN 201511004841A CN 105631596 B CN105631596 B CN 105631596B
- Authority
- CN
- China
- Prior art keywords
- fault
- sample
- data
- time
- abnormal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 84
- 238000003745 diagnosis Methods 0.000 title claims abstract description 74
- 230000002159 abnormal effect Effects 0.000 claims abstract description 71
- 230000011218 segmentation Effects 0.000 claims abstract description 60
- 238000000605 extraction Methods 0.000 claims abstract description 23
- 238000012549 training Methods 0.000 claims abstract description 7
- 239000013598 vector Substances 0.000 claims description 70
- 239000011159 matrix material Substances 0.000 claims description 49
- 238000001914 filtration Methods 0.000 claims description 35
- 230000008569 process Effects 0.000 claims description 14
- 238000006243 chemical reaction Methods 0.000 claims description 13
- 238000012423 maintenance Methods 0.000 claims description 13
- 238000005259 measurement Methods 0.000 claims description 9
- 230000009466 transformation Effects 0.000 claims description 7
- 238000012545 processing Methods 0.000 claims description 6
- 238000012216 screening Methods 0.000 claims description 6
- 238000011160 research Methods 0.000 claims description 5
- 230000003044 adaptive effect Effects 0.000 claims description 4
- 238000005070 sampling Methods 0.000 claims description 3
- 238000005516 engineering process Methods 0.000 abstract description 16
- 238000004422 calculation algorithm Methods 0.000 abstract description 8
- 230000000877 morphologic effect Effects 0.000 abstract description 5
- 238000011161 development Methods 0.000 description 18
- 230000018109 developmental process Effects 0.000 description 18
- 238000004364 calculation method Methods 0.000 description 11
- 230000005540 biological transmission Effects 0.000 description 8
- 238000013528 artificial neural network Methods 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 238000009826 distribution Methods 0.000 description 5
- 238000005065 mining Methods 0.000 description 5
- 230000005856 abnormality Effects 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 230000004927 fusion Effects 0.000 description 4
- 238000003909 pattern recognition Methods 0.000 description 4
- 238000007781 pre-processing Methods 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 238000010276 construction Methods 0.000 description 3
- 230000008030 elimination Effects 0.000 description 3
- 238000003379 elimination reaction Methods 0.000 description 3
- 238000012544 monitoring process Methods 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- 230000009471 action Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 238000001816 cooling Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000011089 mechanical engineering Methods 0.000 description 2
- 238000010606 normalization Methods 0.000 description 2
- 238000000638 solvent extraction Methods 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 1
- 125000002015 acyclic group Chemical group 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000004888 barrier function Effects 0.000 description 1
- 238000009411 base construction Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 238000013144 data compression Methods 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 238000013499 data model Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000009776 industrial production Methods 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000007257 malfunction Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000009966 trimming Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/063—Operations research, analysis or management
- G06Q10/0631—Resource planning, allocation, distributing or scheduling for enterprises or organisations
- G06Q10/06313—Resource planning in a project environment
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P90/00—Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
- Y02P90/80—Management or planning
- Y02P90/82—Energy audits or management systems therefor
Landscapes
- Business, Economics & Management (AREA)
- Human Resources & Organizations (AREA)
- Engineering & Computer Science (AREA)
- Economics (AREA)
- Strategic Management (AREA)
- Entrepreneurship & Innovation (AREA)
- Development Economics (AREA)
- Quality & Reliability (AREA)
- Biodiversity & Conservation Biology (AREA)
- Game Theory and Decision Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Marketing (AREA)
- Operations Research (AREA)
- Educational Administration (AREA)
- Tourism & Hospitality (AREA)
- Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Testing And Monitoring For Control Systems (AREA)
- Testing Of Devices, Machine Parts, Or Other Structures Thereof (AREA)
Abstract
A device fault diagnosis method based on multi-dimensional segmentation fitting sequentially comprises a fault diagnosis training step and a fault diagnosis operation step, the essence of data morphological characteristics is set, a multi-dimensional segmentation fitting algorithm and an optimized dynamic time warping algorithm are combined to realize the functions of mode expression and distance threshold extraction of modeling based on device fault data, and the problem that the similarity degree between fault data is difficult to efficiently and accurately depict in the current fault diagnosis technology is solved by the functions of device data fault type identification and reason diagnosis through the mode matching of the extracted morphological characteristics of found device abnormal data.
Description
Technical Field
The invention relates to the technical field of fault diagnosis, in particular to an equipment fault diagnosis method based on multi-dimensional piecewise fitting.
Background
In recent years, with the continuous innovation and development of scientific technology and the rapid development of social industrialization, more and more industrial industrialization systems are loaded in large-scale industrial production occasions, and huge irreplaceable productivity is created in the application environment. Maintenance work for these devices is therefore particularly important and laborious. However, many implicit influencing factors may cause the system equipment to malfunction or even fail, so that countries and enterprises pay more and more attention to the status detection and fault diagnosis of the important system equipment.
Generally, these devices are increasingly sophisticated and coupled by interaction between different parts. The method of disassembling and disassembling the equipment is time-consuming and labor-consuming, the diagnosis effect is often not ideal, and the equipment performance is easy to be unstable. The method for reading the state of the important part of the equipment in real time by installing the monitoring sensor device to observe the running trend of the relevant parameters of the equipment and contrast the index threshold value is simple, intuitive and strong in reliability, and is widely popularized and used in various production occasions such as power plants, automobiles, satellites and the like at present. The efficiency of this approach is still limited by the ability and effort of plant maintenance personnel to work with complex unknown conditions that are prominent in the plant and are less experienced.
The above method is difficult to form a standard fault diagnosis system due to the dilemma of empirical knowledge expression, so a fault diagnosis method based on mathematical mining technology is gradually introduced into the fault diagnosis work of these large and complex devices. The mathematics mining technology integrates subject knowledge of modern cybernetics, computer science, artificial intelligence, signal processing, mode recognition, statistical mathematics and the like, and excavates information which is hidden in data and is beneficial to fault condition analysis by researching historical data and current real-time data of analysis equipment. Expert systems, association rules, neuron networks, Bayesian networks and the like are mathematical mining methods which are widely applied, and a set of research systems which are divided into fault analysis, fault modeling, fault detection, fault inference, fault decision and the like are gradually formed by the methods.
In a paper "vehicle fault diagnosis expert system with fusion example and rule inference" (journal of mechanical engineering, 2002, 7 th stage: 91-95), a brand-new mixed inference method with fusion examples and rules is provided in the paper, and a whole set of expert system for solving the problem that different diagnosis units are difficult to communicate information is established. It can be seen that the expert system can solve the fault diagnosis problem in the related field with the capability similar to the human expert level in the specific field, and the main characteristic is to form the own specific rule to analyze and solve the problem by relying on the existing human expert experience knowledge. The method is a characteristic of an expert system, but is limited by the lack of experience of human professional knowledge in unknown fields, the expert system often has a series of problems of low self-learning ability, low diagnosis success rate, difficulty in obtaining system knowledge and the like, and the problems still need to be further deeply researched and optimized by the method of the expert system.
In a paper "grid fault diagnosis based on association rule data mining technology" (protection and control of power system, 2009, 9 th period: 8-14), an article applies an association rule algorithm to grid fault diagnosis. And corresponding decision attributes and condition attributes are designated according to the fault characteristics, the establishment of an original decision table is completed, and meanwhile, an association rule algorithm is applied to carry out frequent item set mining and strong rule screening on decision table data, so that reasoning and diagnosis on fault information of various complex conditions are finally realized. However, the technology has some problems, for example, the common problems of the potential implicit failure mode such as poor discrimination, poor large-scale data processing capability, low efficiency of storing and updating the association rule, and the like, are still solved by the following related work.
In a paper "fault diagnosis technology research based on a BP neural network" (computer and modernization, 2009, volume 7), the article selects a general BP neural network to be applied to a fault diagnosis scene, and successfully applies the neural network to fault diagnosis according to the device level distribution characteristics under the condition of reasonably constructing a BP neural network model, so that the diagnosis efficiency and reliability are improved. The neural network has the greatest characteristic that the neural network can infinitely and nonlinearly approximate to an original data model under the condition of large sample quantity, but the inherent characteristics of overfitting, numerical randomness, unstable training and the like limit the application range of the neural network in the field of fault diagnosis, and other optimization algorithms need to be matched for diagnosis application.
In a thesis of Bayesian network fault diagnosis method for multi-feature information fusion (China mechanical engineering, 2010, 8 th: 940-945), an article takes a pump vibration signal as a research object, information fusion is carried out on multiple fault features such as a frequency domain and a time domain extracted by the signal by using a Bayesian parameter estimation method, then a complete fault classifier is established by constructing a Bayesian network, and fault types are identified by calculating a maximum posterior probability estimation value. The Bayesian network fault diagnosis method needs a priori statistical knowledge of a large amount of sample data, and meanwhile, the Bayesian network directed acyclic expression mode has a risk of error accumulation, and the method needs to be paid attention to the Bayesian network fault diagnosis method.
In view of the problems and risks of the common fault diagnosis technologies, the invention aims at fully mining the potential visual expression form of the data sample and measuring the difference effectiveness of the fault mode, and tries to better solve the problems of low efficiency and low recognition accuracy of fault diagnosis by establishing a new mode matching technology, thereby being beneficial to the continuous maintenance of the good state of the equipment.
Disclosure of Invention
The invention aims to overcome the defects of the prior art, provides an equipment fault diagnosis method based on multi-dimensional piecewise fitting, and solves the problem that the similarity degree between fault data is difficult to describe efficiently and accurately in the current fault diagnosis technology.
The invention provides an equipment fault diagnosis method based on multi-dimensional piecewise fitting, which sequentially comprises the following steps:
step 1: the fault diagnosis training step specifically comprises the following steps:
(1.1) acquiring fault sample information from a database;
(1.2) sequentially carrying out piecewise linear fitting on each fault sample;
(1.3) extracting the characteristics of each section of data of the fault sample to obtain a characteristic matrix of the fault sample;
(1.4) carrying out fault characteristic conversion, eliminating characteristic dimension and obtaining a characteristic matrix of a fault sample with the dimension eliminated;
(1.5) calculating a mode distance threshold;
(1.6) associating the fault feature matrix and the pattern distance threshold value with each other, and storing to generate a fault pattern knowledge base;
step 2: the fault diagnosis operation steps specifically comprise:
(2.1) acquiring abnormal sample information from a real-time database;
(2.2) performing piecewise linear fitting on the current abnormal sample;
(2.3) carrying out abnormal feature extraction and transformation on the data subjected to segmentation fitting;
(2.4) sequentially calculating the pattern distance of the sample features in the fault pattern knowledge base by using the abnormal feature information;
(2.5) converting the mode distance into a mode similarity using a falling ridge pacing method;
and (2.6) outputting a final fault diagnosis result.
Further, the step (1.1) comprises the following specific steps: selecting a piece of research equipment with the number P of fault types being more than or equal to 2 and the occurrence frequency T of each fault being more than or equal to 2 to meet the requirements, selecting a sufficient number of observation points N, wherein N is more than or equal to 10, carrying out fault record searching on historical running state data of the equipment for a long enough time, picking out fault related measuring point information, starting and stopping time of a fault process and useful information of fault maintenance measure records from fault records by utilizing a set screening rule, and reading fault sample data from a power plant real-time database PI according to the useful information, wherein:
the fault sample data with n measuring points and m time points, all the measuring point data at the moment j are regarded as a n-dimensional column vector and are expressed as follows:
u(tj)=[uj1,uj2,uj3,...,ujn]
the sample data file is stored in a matrix form of m × n, and the specific form is as follows:
the row represents m fault time, the column represents n equipment observation points, the m and n values of the row and the column between each fault sample are different, each fault sample is endowed with a fault type identifier ID, and the fault type identifier ID determination method is that if all the samples contain X types of faults, the numerical range of the fault type identifier ID is as follows: 1-X.
Further, the step (1.2) comprises the following specific steps:
(1.2.1) mean filtering operation: filtering and eliminating noise pollution doped in sample data;
(1.2.2) carrying out fault sample segmentation initialization, namely carrying out segmentation initialization on the fault sample after filtering processing;
(1.2.3) combining every two initialized data segments to calculate a fitting error;
and (1.2.4) determining a segmentation cut point of the fault sample, and performing adaptive state segmentation on the fault sample.
Further, the step (1.3) comprises the following specific steps:
dividing the fault sample intoThe x divided data segments from f1 *The data segment starts to carry out feature extraction, and the specific operations are as follows:
due to f1 *The data segment isIn the form of matrix, the row is time point number and is listed as measuring point number;
and (3) extracting features according to the dimension (column) of the measuring points, wherein the features comprise: slope k, duration l, mean m and variance v;
the vector is linearly fitted according to the principle of least square method, and the fitting result is a linear equation p (x) a0i+a1ix, so slope characteristic k ═ a1iWherein a is0i、a1iIs a fitting constant;
The variance characteristic v isSum of fluctuation amplitudes of all values of the vector from the mean, i.e.N is the total number of vectors;
The rest ofExtracting characteristics according to the operation method, converting the original time domain data matrix into a mathematical characteristic matrix, and finally dividing the data segment setConversion to:
further, the step (1.4) comprises the following specific steps: converting the slope k into an inclination angle alpha, converting the duration l into a time proportion, carrying out normalization processing on the mean value characteristic m, and carrying out characteristic matrix treatment on the fault sample
Further, the step (2.1) comprises the following specific steps:
according to the equipment state early warning system of the power plant, discovering that certain unknown abnormal state occurs in the equipment, and carrying out the following related operations:
(2.1.1) determining settings from the early warning systemStandby alarm generation time t1And a cut-off time t2;
(2.1.2) determining from the early warning system the relevant observation point wp ═ x for the device alarm1,x2,...,xn'];
(2.1.3) according to the generation time t1And a cut-off time t2And a database sampling frequency fs, and obtaining the time point number m ═ fs × (t)2-t1) The number of device measurement points n equals length ([ x ])1,x2,...,xn']) Where length () is a calculated length function);
(2.1.4) acquiring abnormal sample data, namely the sample data with n 'of measuring points and m of time points, wherein all the measuring point data at the moment j are regarded as a column vector with n' dimension and are expressed as:
v(tj)=[vj1,vj2,vj3,...,vjn']
storing the sample data file in a matrix form of m multiplied by n', wherein the specific form is as follows:
where the rows represent m failure times and the columns represent n' device observation points.
Further, the step (2.2) comprises the following specific steps:
(2.2.1) mean filtering operation: filtering and eliminating noise pollution of each measuring point doped in abnormal sample data according to a mean filtering principle;
(2.2.2) carrying out segmentation initialization on the abnormal sample after filtering treatment;
(2.2.3) combining every two initialized data segments to calculate a fitting error;
and (2.2.4) determining an abnormal sample segmentation cut point based on a recursive combination mode.
Further, the step (2.5) comprises the following specific steps:
transforming pattern distances into pattern similarities according to the following formula based on the principle of the falling-ridge stepwise methodDegree of rotation
Wherein,represents the pattern distance calculated by pattern recognition of the abnormal sample and the jth fault sample under the fault type i, and ThiRepresenting a mode distance threshold under fault type i;
according to the dimension of the fault type, each distance vector is divided intoConverted into similarity vectors
Finally, a similarity set of the abnormal sample FT and all samples of the fault knowledge base is obtained, and the similarity set is as follows:
further, the step (2.6) comprises the following specific steps:
and outputting a fault diagnosis result according to the similarity set result rho Sets and an agreed rule, wherein the rule is satisfied:
rule 1: if a plurality of fault samples with the similarity exceeding 90% with the abnormal samples exist, outputting the serial numbers of all the samples, simultaneously outputting the fault type to which the fault sample belongs, the confidence coefficient of the fault type to which the fault sample belongs, and the best matching position of the sample with the maximum similarity in the fault development stage, and if the confidence coefficient exceeds 50%, outputting suggested maintenance measures;
rule 2: if no fault sample with the similarity exceeding 90% with the abnormal sample exists and the maximum similarity value is greater than or equal to 60%, outputting the serial number of the maximum similarity sample, simultaneously outputting the fault type to which the fault sample belongs, the confidence coefficient of the fault type to which the fault sample belongs and the best matching position of the maximum similarity sample at the fault development stage, and if the confidence coefficient exceeds 50%, outputting the recommended maintenance measures;
rule 3: if no fault sample with the similarity exceeding 90% with the abnormal sample exists and the maximum similarity value is less than 60%, outputting the abnormal sample as an unusual working condition or an unknown fault type, and continuing to pay attention to the development of the abnormality.
The equipment fault diagnosis method of the invention can realize that:
1. the invention is a brand-new fault diagnosis technology independent of faults. The present invention starts from analyzing the mathematical morphological characteristics of historical fault sample data, and judges the similarity degree and the most possible development trend of the existing fault in time for the current abnormal sample.
2. The invention can efficiently and accurately carry out fault diagnosis work, realizes the data information compression target and eliminates the influence of redundant data noise by converting the sample data from the time domain space to the mathematical morphology characteristic space, thereby greatly improving the fault diagnosis efficiency.
3. In order to ensure the effectiveness of pattern expression, the invention establishes a multidimensional segmentation fitting method based on the data development state. In consideration of the characteristic that the coupling among all variables in the sample data is unclear, the optimal segmentation threshold of the current sample data is mined by adopting a self-adaptive algorithm, and the segmentation positions of all the variables are consistent, so that the development state of the segmented data in each stage is more hierarchical.
4. The invention has low requirement on sample data for fault diagnosis and does not need to carry out complex data preprocessing work. The prior related technology generally cannot realize fault diagnosis of sample data with inconsistent sample time and different variable numbers in a knowledge base, and the method can solve the problem by methods of variable trimming, dynamic time warping and the like.
5. The operation speed of the fault analysis and fault identification part is within the second level, so that not only can the fault diagnosis of a sample of a complete fault mode be realized, but also the fault location of a part of fault modes can be realized. Therefore, the invention can be completely transplanted to an equipment on-line state detection system for application.
Drawings
FIG. 1 is a comprehensive flow chart of the training phase and the operation phase of the fault diagnosis model
FIG. 2 dynamic time warping distance effect plot
FIG. 3 is a trend graph of key measuring points of three faults of a front pump of a steam pump
FIG. 4 is a diagram illustrating the effect of online fault diagnosis of abnormal samples
Detailed Description
Reference will now be made in detail to the embodiments of the present invention, the following examples of which are intended to be illustrative only and are not to be construed as limiting the scope of the invention.
Most of the prior related technologies are that each variable dimension of fault data is separately cut apart for calculation, but the invention comprehensively considers the problems of related information between different variables of sample data, realizes a segmentation fitting technology based on bottom-up, extracts various overall morphological characteristics of the fault data and simultaneously considers the numerical characteristics of the fault data. The method can realize effective compression of large-scale data, reduce the fault diagnosis time and fundamentally improve the fault diagnosis precision, so the method has the capability of quick fault diagnosis. The invention carries out the calculation of the characteristic difference of the fault samples by a dynamic time warping method, and can solve the problem that the similarity of the fault samples with different lengths is difficult to measure. The fault diagnosis method of the invention does not take the mode distance as the final measurement standard, but compares the mode distance with the similarity percentage, unifies the measurement standard of fault identification, and further improves the fault diagnosis precision again.
The invention provides an equipment fault diagnosis method based on multi-dimensional segment fitting, which is a brand-new fault diagnosis technical method for setting out the essence of data morphological features. The method mainly comprises two processes of establishing a model and operating the model.
Fig. 1 (left) is a flow chart of modeling of the present invention, and the whole modeling process mainly includes the following steps:
step 1: and acquiring fault sample information from a system database.
The obtained fault sample information comes from a device state detection system of a power plant, and the rough operation process is as follows: firstly, selecting a piece of research-available equipment with the number P (P > -2) of fault types and the occurrence frequency T (T > -2) of each fault meeting the requirement, selecting an observation point N (N > -10) with the number of feet, and searching the fault record of the historical operation state data of the equipment for a long enough time; then, useful information such as fault related measuring point information, starting and stopping time of a fault process, fault maintenance measure records and the like are picked out from the fault records by using a set screening rule; and finally, reading fault sample data from the power plant real-time database PI according to the information.
For example, for a fault sample data with n measurement points and m time points, all measurement point data at time j can be regarded as an n-dimensional column vector, which is expressed as:
u(tj)=[uj1,uj2,uj3,...,ujn]
the sample data file is stored in a matrix form of m × n, and the specific form is as follows:
according to the method, the storage form of all fault samples is as follows: the dimension of row represents m failure times, the dimension of column represents n device observation points, and the two values of the dimension (m, n) of the row and the column between each failure sample may not be the same. While each fault sample is given its fault type identification ID. The method for determining the fault type identifier ID comprises the following steps: if all samples contain X faults, the numerical range of the fault type identifier ID is as follows: 1 to X.
Step 2: sequentially carrying out multi-dimensional piecewise linear fitting on each fault sample
The step is a key technical part of the fault diagnosis invention, and is mainly used for carrying out self-adaptive state segmentation according to the overall development trend of a fault sample and carrying out data preprocessing preparation for subsequent fault feature extraction.
Step 2.1 mean filtering operation:
in order to effectively segment fault sample data, noise pollution doped in the sample data is filtered and eliminated. To quickly and efficiently implement the filtering of noisy data, a typical linear filtering algorithm is chosen: and (5) average filtering.
According to the principle of mean filtering, the specific filtering operation process is as follows: firstly, extracting the first observation point column vector data u of the fault sample data F-1The form is as follows:
will column vector u-1Filter template size viewed as a sequence of numbers(round is a rounding function) if L is an even number, then 1 more is needed. After the size of the filtering template is determined, the substantive filtering operation is started:
1. the center position of the filtering template is placed in u11Where the sum of all data within the template is divided by L, the value is u11The output value of the filtering operation of (2) is:
2. the center position of the filtering template is moved forward continuously, and filtering output is carried out according to the same operation until all data are completely filtered, for example, the filtering operation is carried out at u1iThe output of the position, mean filtering is:
3. finally obtained column vector data u-1The filtered data of (2) is also a column of vectors, and the specific form is as follows:
all measuring points of fault sample data are sequentially subjected to filtering operation according to the 3 steps, so that the fault sample data F is converted into F*In the form of:
step 2.2 Fault sample segmentation initialization
Carrying out segmentation initialization on the filtered fault sample F, dividing each 2 pieces of data into 1 data segment, and dividing the fault sample into 1 data segment because the length of the fault sample is mThe most detailed data segment; if m is odd, the last segment is composed of 3 pieces of data, and the final segment is divided intoThe most detailed data segment. The initialization effect of the segmentation is as follows:
step 2.3 initialization data segment pairwise combination calculation fitting error
The main purpose of this step is to generate the above step after initialization segmentation(or) And combining each data segment with the data segment adjacent to the right of the data segment, and simultaneously calculating the generated fitting error.
The detailed merge operation is as follows: such as merge fi *Andcomprises the following steps:in the form of a matrix, e.g. 4 xn, with columns being the number of test points. Data for each columnAnd (5) fitting a straight line by a least square method. The least square method comprises the following specific steps:
the fitting objective was to fit the dataUsing a first order polynomial p (x) ═ a0+a1x is expressed such that the cumulative sum of fitting errors I is (a)0+a1x-yi)2And minimum.
This is apparently a solution of I ═ I (a)0,a1) And (4) quadratic programming of extreme values. According to the Lagrange principle to obtain
wherein m is 4n is 1
The above formula is a0,a1The linear equation system is expressed by matrix as
The above formula becomes a normal equation system, and the coefficient matrix is a symmetric positive definite matrix, so that a unique solution exists. Solving for a from the above formulak(k is 0,1), thus obtaining a polynomial:
p(x)=a0+a1x
the mean error due to the fit is called the fit error, which is recorded as:
after all measuring points are fitted, the fitting errors generated are summed(N is the number of stations).
When all are(or) After the data segments are combined pairwise, three results are generated, namely, a data segment set is dividedSecond, fitting error vectorThird, the sample integral segmentation position vector
Step 2.4 determining the segmentation cut points of the fault samples
The main purpose of the step is to realize the determination of the fault sample segmentation position in a recursive combination mode.
1. According to the invention, a numerical value MAX _ ERROR is set according to experience as an index for judging the fitting ERROR. If the error vector is fittedIf all values in the set are greater than the maximum fitting ERROR threshold MAX _ ERROR, then operation 1 is performed: confirmationIs the final set of sample segmentation locations; otherwise, operation 2 is performed: combining fitting error vectorsThe middle minimum value position J and the segmentation data segment of the right adjacent position J +1 thereof, and updating the segmentation data segment set F*Fitting error vectorAnd sample integral segmentation position vector
The specific updating operation is as follows: f*The length is reduced by 1, the front J-1 is kept unchanged, the value after J is shifted forward by 1 bit, and F is located at the position of JJ'=[FJ;FJ+1](ii) a According to updated F*Determining a new segmentation position vector The length is reduced by 1, the front J-1 is kept unchanged, the value behind J is shifted forward by 1 bit, and the position of J ismerge is a fitting error function generated and designed according to a least square method;
2. judging the magnitude of the updated fitting ERROR vector value and the maximum fitting ERROR threshold MAX _ ERROR again, if all the fitting ERROR vector values are larger than the threshold MAX _ ERROR, executing operation 1, and otherwise, skipping out of the cycle operation and stopping;
3. circularly combining the segmentation data segments of the minimum fitting error position and the right adjacent position thereof until the final fitting error vectorAll above the MAX ERROR value, the recursive merge loop stops.
After the above steps, after all the segmented data segments are completed by means of recursive combination, three results are generated, one is a set of segmented data segmentsSecond, fitting error vectorThird, the sample integral segmentation position vector(where x is the number of final segmentations of the sample data). Therefore, the target of adaptively segmenting the state of the fault sample is realized, and preparation is made for next fault feature extraction.
And step 3: performing feature extraction on the fault sample segmentation data segment;
the method realizes the extraction of mathematical characteristics such as slope, time proportion, mean value, variance and the like for each segmented data segment of the current fault sample.
For example, the failure sample F*Is divided intoThe x divided data segments. From f1 *The data segment begins feature extraction.
The specific operation of feature extraction is as follows:
due to f1 *The data segment isIn matrix form, the row is the number of time points and the column is the number of test points.
Feature extraction is carried out according to the dimension (column) of the measuring points, and the main features are as follows: slope k, duration l, mean m, variance v.
the vector is linearly fitted according to the principle of least square method, and the fitting result is a linear equation p (x) a0i+a1ix. So that the slope characteristic k is a1i;
The variance characteristic v isSum of fluctuation amplitudes of all values of the vector from the mean, i.e.
The rest ofAnd (4) extracting the characteristics according to the operation method, and converting the original time domain data matrix into a mathematical characteristic matrix.
and 4, step 4: carrying out fault characteristic conversion and eliminating characteristic dimension;
1 slope k translates to inclination angle α: since k has a value range of [ - ∞, + ∞ [ - ]]In order to prevent two similar fault samples from generating an excessive difference of slope values due to the existence of fitting errors and finally causing misjudgment, the slope values are converted into angle values through an arc tangent function so as to eliminate the characteristic dimension. The slope characteristic conversion formula is as follows: alpha is alphaij=arctan(kij) Where i ∈ [1, x ]];j∈[1,n;]
Duration l translates into time ratio p: under the condition that the lengths of the fault samples are different, the segmentation duration cannot represent the length of a real and effective segmentation stage, and misjudgment is easy to generate, so that the segmentation duration is divided by the whole length of the original fault sample to eliminate the dimension of the duration characteristics. The time length characteristic conversion formula is as follows:where i ∈ [1, x ]];
3 mean feature m normalization: under the condition that the segmentation numbers of the fault samples are different, the average value cannot represent the real and effective numerical characteristics of the segmentation stage, and misjudgment is easy to generate, so the total numerical quantity of the original fault samples should be divided to eliminate the dimension of the average value characteristics. The time length characteristic conversion formula is as follows:where i ∈ [1, x ]];j∈[1,n];
and 5: calculating a mode distance threshold;
the purpose of this step is to find the mode distance between every two of all the samples obtained in step 1 and calculate the distance threshold for each fault category.
Through the four steps, all fault sample data { F1,F2,...,FsAll the data are converted into a feature matrix form set { fm } similar to the formula (1)1,fm2,...,fms}. And then calculating the mode distance of two different samples in the same type of fault samples by using a Dynamic Time Warping (DTW) method, and determining the mode distance threshold of each type of fault according to the calculation result.
The specific operation of calculating the mode distance by the Dynamic Time Warping (DTW) method is described below:
1 there are two sample feature matrices I under the assumption of failure type IAAnd IBThe specific form is as follows:
wherein, ai(i=1,2,...,x),bj(j ═ 1, 2.. times, x') are two sample feature matrices I, respectivelyAAnd IBThe row vector of (a) represents the mathematical characterization of each segmented data segment of the respective fault sample in all the measurement point dimensions.
2 two samples S under the same fault typeAAnd SBCharacteristic matrix I ofA、IBAccording to the dynamic time warping principle, and the optimal path is determined by combining the mode distance I with the mode distance I shown in the figure 2AAnd IBThe row vectors are copied and stretched to meet the condition that the length of the characteristic matrix columns of the two samples are aligned equally IAAnd IBThe mode distance calculation result of (2) is minimum. SAAnd SBThe formula for calculating the mode distance is as follows:
wherein, T [ i: - ] represents a multivariate sequence formed by the ith row vector to the last row vector of the characteristic matrix T;
MDbase(ai,bj) The Euclidean distance from the segmented data segment is represented, and the specific calculation method is as follows:
wherein eud (X, Y) represents the Euclidean distance between two vectors X, Y, i.e. the Euclidean distanceThe four variables of lambda, rho and omega represent the weight of four characteristics of an inclination angle alpha, a time proportion p, a mean value proportion T and a variance v in the calculation mode distanceThe value is obtained. The sum of the four variables of lambda, rho and omega is 1, and all the variables are set to be 0.25 in the invention
According to the DTW calculation method described above, two samples S can be obtainedAAnd SBHas a mode distance of
If there are d fault samples in the fault type I, the d fault samples are obtained according to the permutation and combination mode and the calculation modeIn totalIndividual mode distance results. And a pattern distance threshold Th of fault type I1Should be a vectorMaximum value of
According to the above method, the mode distance threshold of all fault types can be calculated, and a mode distance threshold vector is formed:
Thsets=[Th1,Th2,...,Thx]
step 6: integrating useful information to generate a failure mode knowledge base;
the useful information in the fault knowledge base should include the following parts:
1 feature matrix set FM ═ { FM ] extracted from all fault samples1,fm2,...,fms} (s is the number of failed samples);
2 measuring point name set PT ═ { PT associated with each fault type1,pt2,...,ptx} (x is the number of categories of the type of failure);
3 pattern distance threshold vector Thsets corresponding to each fault type [ Th ]1,Th2,...,Thx](x is thereforeNumber of categories of barrier types).
In order to correlate the two pieces of information (fault signature matrix and pattern distance threshold) with each other, the invention stores the fault information in a fault knowledge base in the following manner:
fig. 1 (right) is a flow chart of the model operation of the present invention, and the whole operation process mainly includes the following 6 steps:
step 1: acquiring abnormal sample information from a real-time database;
according to the condition early warning system of the power plant, the condition of the equipment is found to be in an unknown abnormal condition, and the following relevant operations are carried out: 1 determining the alarm generation time t of the equipment from the early warning system1And a cut-off time t2;
2, determining a relevant observation point wp ═ x of the equipment alarm from the early warning system1,x2,...,xn'];
According to the generation time t1And a cut-off time t2And database sampling frequency fs, the number of available time points m ═ fs × (t)2-t1) The number of device measurement points n equals length ([ x ])1,x2,...,xn']) (length calculation length function).
Thus, the obtained abnormal sample data is sample data with n 'measuring points and m time points, and all the measuring point data at the moment j can be regarded as a column vector with n' dimension, which is expressed as:
v(tj)=[vj1,vj2,vj3,...,vjn']
the sample data file is stored in a matrix form of m × n', and the specific form is as follows:
according to the method, the obtained abnormal sample storage form is as follows: the row dimension represents m failure times and the column dimension represents n' device observation points.
Step 2: performing piecewise linear fitting on the current abnormal sample;
the method mainly carries out adaptive state segmentation according to the overall development trend of the abnormal sample, and carries out data preprocessing preparation for subsequent fault feature extraction. The method is operated according to piecewise linear fitting of a model training stage:
2.1 mean value Filter operation
Setting an abnormal sampleThe length of the sample is 0.1 times as the length of the filtering template, and the noise pollution doped in each measuring point in the abnormal sample is filtered and eliminated according to the mean filtering principle. The specific operation is seen in the model training phase step 2.1.
All the measuring points of the abnormal sample data finish filtering operation according to corresponding steps in sequence, so that the abnormal sample dataIs converted intoThe form is as follows:
2.2 Exception sample segmentation initialization
For filtered abnormal samplesSegment initialization is carried out, every 2 pieces of data are divided into 1 data segment, and the abnormal sample data is divided into 1 data segment due to the fact that the length of the abnormal sample data is mThe most detailed data segment; if m is odd, thenThe latter section consists of 3 pieces of data, and is finally divided intoThe most detailed data segment. The initialization effect of the segmentation is as follows:
2.3 computing fitting error by combining two initialized data segments
Generated after initializing the step 2.2 abnormal sample segment(or) And each data segment and the right adjacent data segment are fitted according to the least square principle to realize combination, and the fitting error sum generated by all the measuring points is calculated.
When all are(or) After the data segments are combined pairwise, three results are generated, 1, the data segment set is divided2 fitting error vector3-sample global segmentation position vector
2.4 recursive merging to determine outlier sample segmentation cut points
This step specifically operates the reference model construction phase, divided aboveContinuously merging two adjacent data segments generating the minimum fitting error by the recursive merging mode of each data segment to generate 3 updated results, wherein 1 is a set of divided data segmentsSecond, fitting error vectorThird, the sample integral segmentation position vector(where x is the number of final segmentations of the sample data). The goal of adaptive state segmentation of the anomaly samples is achieved,and extracting the operation object for the next step of abnormal features.
And step 3: carrying out abnormal feature extraction and transformation on the data subjected to segmentation fitting;
the method realizes extraction of 4 mathematical characteristics of slope, time span, mean value and variance of each data segment after the current abnormal sample is segmented.
For example, abnormal samplesIs divided intoThe x divided data segments. The specific operation of feature extraction may refer to step 3 of the model construction phase.
Each of the divided data segmentsExtracting characteristics such as corresponding slope k, time span l, mean value m, variance v and the like according to the dimension of the measuring point, wherein the result after extraction is as follows:
when all the segmentation data segments of the abnormal sample are subjected to feature extraction according to the method, the segmentation data segment vector with the length of xCan be converted into the following characteristic matrix form:
in order to prevent the abnormal samples and similar fault samples from causing final misjudgment when calculating the pattern distance due to the following reasons:
1, the existence of fitting errors causes overlarge difference of slope values, and the solution is to convert the slope value k into an angle value alpha through an arc tangent function so as to eliminate the characteristic dimension;
2, under the condition that the lengths of the fault samples are different, the segmentation time length l cannot represent the proportion of a real and effective segmentation stage, misjudgment is easy to generate, and the segmentation time length l is divided by the whole length m of the original fault sample to eliminate the dimension of time length characteristics;
3 under the condition that the number of the divided fault samples is different, the mean value cannot represent the real and effective numerical characteristics of the divided stages, and misjudgment is easy to occur, so the total numerical quantity of the original fault samples should be divided to eliminate the dimension of the mean value characteristics.
The specific operation method for eliminating the dimension can be seen in the characteristic transformation formula of step 4 of the model construction stage. Thus transformed, the features are transformed into the following dimensionless form:
abnormal sampleConversion into the above feature matrixThen, the step of calculating the distance of the sample mode of the fault knowledge base can be carried out.
And 4, step 4: sequentially calculating pattern distances for sample features in a fault pattern knowledge base by using the abnormal feature information;
firstly, extracting the stored information faultMessageSets in the fault knowledge base, wherein the specific mode is as follows:
then, a feature matrix FT and a measuring point information vector wp of the abnormal sample and first-type fault information V1={fm11,fm12,...,fm1,c1,pt1,Th1And fourthly, performing fault identification work. The specific operation is as follows:
a comparison measuring point information vector wp and a fault type fault measuring point information vector pt1The repetition rate f is calculated by the following formula:
sequentially carrying out pattern distance numerical dist on the feature matrix FT of the two abnormal samples and each feature matrix under the fault type 1 according to the following method1iAnd (3) calculating:
the abnormal sample may be a complete fault sample of a certain type or a partial fault sample in a development process of a certain type. For accurately calculating dist1iNumerical value, this step applies the telescopic windowThe sliding search technology is used for accurately positioning the fault part, so that the mode recognition function of the abnormal sample is realized, and the development stage information of the abnormal sample is captured.
2.1 first, all possible values of the shrink window WT are determined. The following contents are set in the invention:
1, all fault samples in a fault knowledge base are samples of a complete process;
2, the expansion ratio of the expansion window is xi-20%;
the value range of WT may be defined as round (length (ft)) 80% to round (length (ft)) 120%, and WT ═ n is set1,n2,...,nx];
Wherein round is a rounding function; length is a function of calculating the number of sample segments, i.e., the number of rows of the feature matrix.
2.2 then from the telescopic window n1Initially, at each sample feature matrix fm for fault type 11iAnd the mode distance is calculated by sliding.
If fm1iIf the length of the window is m, the sliding frequency of the telescopic window is m, and the range of pattern recognition on the fault sample is as follows:
[1,n1]~[m-n1+1,m]。
abnormal samples from [1, n ] of faulty samples1]The position starts to calculate the mode distance, each sliding step L is set to 1, and FT and fm are calculated1i[1:n1]The mode distance formula of (1):
dist11=κ*MDDTW(FT,fm1i[1:n1]
wherein, if the repetition rate η is equal to 0, then κ is 2; if the repetition rate eta is not equal to 0MDDTW(FT,fm1i[1,n1]) To find FT and fm1i[1,n1]DTW distance function of (d).
The rest is [2, n ]2]~[m-n1+1,m]The ranges of (A) are all calculated according to the above 2.1-2.2 steps, and finallyObtaining a mode distance vectorAt window size n1In the case of (2), the local best mode distance is dist11=min(distVector1) The local best matching position is:
L11=[nmin-n1+1,nmin]。
2.3 last other expansion windows x-1 value cases n2~nxCalculating the local optimal mode distance dist under each window value1iAnd local best matching position L1i=[nx-ni+1,nx]。
2.4 after all window values are matched, the following two important results are generated:
Selecting mode distance minimum from local mode distance vectorAnd corresponding best matching positionFeature matrices FT and fm as abnormal samples and failure samples1iThe pattern recognition end result of (1).
After the three abnormal sample feature matrixes FT are matched with all feature matrixes of other fault samples one by one, the pattern distance vector is generatedAnd a position vector
Remaining x-1 type faults ViSolving the pattern distance vector according to the mode and the abnormal samples FT and wp, and finally obtaining a pattern distance setAnd a set of position vectors
And 5: converting the mode distance into mode similarity by using a falling-ridge distribution method;
the pattern distance obtained in the above step is the distance obtained by applying the DTW algorithm, which is essentially the euclidean distance, and the euclidean distance is related to the time length of the samples, and the greater the time length is, the euclidean distance is correspondingly increased, so that the similarity between the two samples cannot be visually judged, and therefore, the final fault diagnosis result can be obtained only after the pattern distance obtained in the above step is converted into the pattern similarity by adopting the ridge-descending distribution method in the present step.
The mode distance set composition component obtained in step 4 is abc mode distancePattern distance is converted to similarity according to the principle of the falling-ridge stepwise method according to the following formula
Wherein,represents the pattern distance calculated by pattern recognition of the abnormal sample and the jth fault sample under the fault type i, and ThiRepresentsThe mode distance threshold for fault type i.
Thus, each distance vector is calculated according to the dimension of the fault typeConverted into similarity vectors
Finally, the similarity set of the obtained abnormal sample FT and all samples of the fault knowledge base is as follows:
step 6: and outputting a final fault diagnosis result.
According to the similarity set result rho Sets obtained in the step 5, the most effective and reliable fault diagnosis result is output according to the agreed rule:
rule 1: if a plurality of fault samples with the similarity exceeding 90% with the abnormal sample exist, outputting the serial numbers of all the samples, simultaneously outputting the type vector of the fault sample, the confidence level of the fault type, the best matching position Location of the sample with the maximum similarity in the fault development stage, and outputting the recommended maintenance action if the confidence level exceeds 50%;
rule 2: if no fault sample with the similarity exceeding 90% with the abnormal sample exists and the maximum similarity value is greater than or equal to 60%, outputting the serial number of the maximum similarity sample, and simultaneously outputting the type of the fault to which the fault sample belongs, the confidence level of the fault type to which the fault sample belongs, and the best matching position Location of the maximum similarity sample in the fault development stage; if the confidence level exceeds 50%, recommending action of the adopted maintenance measure;
rule 3: if no fault sample with the similarity exceeding 90% with the abnormal sample exists and the maximum similarity value is less than 60%, outputting the abnormal sample as an unusual working condition or an unknown fault type, and recommending to continuously pay attention to the development of the abnormality;
wherein the typeVector value satisfies the above ruleThe value of i in (1); the confidence level value of a certain fault type is equal to the number of the fault types meeting the rule divided by the total number of the fault samples of the fault type; the maintenance measure repairAction can be obtained by inquiring the operation rule of the equipment according to the fault type.
The steam pump preposition pump of a #2 unit of a certain thermal power plant in the north is taken as a state monitoring object, the steam pump preposition pump is an important component of a steam feed water pump, the function is important, and the performance safety of boiler equipment is maintained. The equipment belongs to exposed facilities, has more observation parts and is easy to cause various faults, and the characteristic is suitable for the equipment fault diagnosis method designed by the invention. The implementation process of the present invention is further illustrated by the detailed description of the embodiment.
The implementation steps of the embodiment of the invention for the fault diagnosis of the pre-pump equipment of the steam pump of a certain power plant are as follows:
fault diagnosis knowledge base construction process of steam pump pre-pump equipment
Step 1: acquiring failure sample information of a front pump of the steam pump from a power plant PI database;
selecting 21 key parameters related to the operation of a front pump of the air pump, wherein the key parameters comprise actual power (MW), radial bearing temperature (DEG C) of a motor transmission end, thrust bearing temperature (DEG C), motor current (A) and the like, and therefore each piece of observation data of the equipment is 21-dimensional row vector:
u(tj)=[uj1,uj2,uj3,...,uj21]
searching fault records of the equipment from the 10 months in 2013 to the 10 months in 2015, determining that the fault type P is greater than or equal to 2 and the fault occurrence frequency T is greater than or equal to 2 according to a screening rule, and extracting the following information from the fault records of the equipment according to the screening rule: in the period, 3 faults (radial bush temperature sudden rise of a motor transmission end, thrust bush temperature sudden drop and motor current sudden change) occur totally, the number of the faults is 86 (radial bush temperature sudden rise of the motor transmission end is 25, thrust bush temperature sudden drop is 28 and motor current sudden change is 33), fault related measuring points are different (the radial bush temperature sudden rise of the motor transmission end is from measuring point 6 to measuring point 14, the thrust bush temperature sudden drop is from measuring point 15 to measuring point 17, the motor current sudden change is from measuring point 1 to measuring point 6 and from measuring point 18 to measuring point 21), and the fault duration time range is from 2 hours to 14 hours. (ii) a And finally, reading all fault sample data from the power plant real-time database PI according to the information.
The finally obtained fault samples comprise three parts of fault type number information (the radial tile temperature sudden rise number of a motor transmission end is 1, the thrust tile temperature sudden fall number is 2, and the motor current sudden change number is 3), fault sample data and fault type related measuring point information, wherein the sequence of each type of fault samples is sequenced according to the sequence of fault occurrence time:
wherein,iFjthe specific form of (1) is a matrix form, the row m represents the sample length, and the column represents the measuring point. As follows:
step 2: performing piecewise linear fitting on each Fault sample in Fault Sets;
from the sample1F1Starting until the sample3F33And each fault sample adopts a piecewise linear fitting technology, so that the self-adaptive state segmentation is realized according to the overall development trend of the fault sample, and data preprocessing preparation is carried out for subsequent fault feature extraction.
First, a mean filtering operation is performed. Sequentially extractingiFjPer station column vector u-iDoping according to the operation steps of the mean filtering principle in the specific embodimentFiltering and eliminating noise pollution in sample data to obtain filtered dataiFj *。
After the filtering operation, the maximum fitting ERROR threshold MAX _ ERROR is empirically set to 0.2. The bottom-up state partitioning is implemented in a recursive merge manner. The following three results were obtained: 1 partitioning a set of data segments2 fitting error vector3-sample global segmentation position vector
And step 3: extracting the characteristics of each section of data of each segmented fault sample;
because the sample data time length is in the hour level, in order to facilitate the rapid and effective operation of the fault diagnosis, the mathematical characteristics such as the slope k, the time length l, the mean value m, the variance v and the like are extracted from each section of data of the fault sample to carry out data compression and the transformation of the feature space.
After feature extractioniFjEach piece of data fi *All converted into four-dimensional elements (k, l, m, v). The specific feature extraction formula can be seen in step 3 of the modeling stage of the specific embodiment. Thus, each fault sample realizes the process of converting the sample from the time domain space to the feature space:
and 4, step 4: carrying out fault characteristic conversion and eliminating characteristic dimension;
in order to prevent misjudgment of different fault samples due to dimension reasons when the mode distance is calculated, three characteristics of the slope, the duration and the mean value need to be converted into a slope angle, a time proportion and a mean value proportion. The specific operation method for eliminating the dimension can be seen in a characteristic transformation formula in the step 4 of the modeling stage. Through transformation, the characteristics are transformed into the following dimensionless forms:
so far, the conversion from the time domain space to the feature space is realized by all samples, and the conversion of the Fault sample set Fault Sets into the Fault sample feature set is as follows:
and 5: calculating a mode distance threshold value under each fault type;
through the 4 steps, all fault sample data are converted into a characteristic matrix form set form. In the step, a Dynamic Time Warping (DTW) method is used for calculating the mode distance of two different samples in the same type of fault samples, and the mode distance threshold of each type of fault is determined according to the calculation result.
For example, there are 25 sample feature matrices in the type 1 fault in the fault sample, and every two feature matrices calculate the DTW distance to obtain a vector DtwVector1=[Md1,Md2,...,Md25]. Pattern distance threshold Th of fault type 11=min(DtwVector1). The pattern distance threshold Th for fault types 2, 3 is obtained in the same way2、Th3。
Step 6: generating a steam pump gas quality pump fault mode knowledge base;
summarizing the information obtained in the steps, the steam pump pre-pump fault knowledge base should include the following 3 parts of information:
1. and extracting a feature matrix set FaultFeatureSets from all fault samples.
2. Relevant measuring point name set PT ═ { PT) of three types of faults1,pt2,pt3};
3. Pattern distance threshold values Thsets ═ Th corresponding to three types of faults1,Th2,...,Thx]。
And integrating the above 3 parts of information to generate a complete steam pump preposed pump fault knowledge base in the following form:
fault on-line diagnosis process of steam pump pre-pump equipment
The plant-level monitoring system of the power plant finds that unknown abnormality occurs at a certain measuring point of the steam pump pre-pump equipment when 03 is 12 months and 15 days in 2014. In order to better help the professional personnel to carry out inspection and maintenance, the power plant personnel retrieve abnormal sample data from the real-time database according to the starting and ending time of the known abnormality and the related alarm measuring point information ptAnd the fault diagnosis method of the invention is used for carrying out advanced prejudgment.
Abnormal sample dataAfter data processing operations such as piecewise linear fitting, abnormal feature extraction and feature dimension elimination, calculating to obtain abnormal sample dataCorresponding feature matrixAnd (4) matrix. The above operations can be referred to as corresponding steps of the model operation phase of the embodiment.
Abnormal sample informationAnd pt is used for matching the sample information in the fault mode knowledge base one by one, and converting the mode distance into corresponding fault similarity.
The abnormal samples are matched with all fault samples in the knowledge base, and the DTW distance is calculated by using a sliding telescopic time window method to serve as the final mode distance. The set of available distances is in the form:
and (4) mapping the distances between the [0,1] numerical values by operating a falling-ridge distribution method, and converting the mode distances into similarity according to a unified standard. Embodiments A falling ridge distribution method of the model run phase is used herein. The available similarity sets are in the form:
outputting a judgment result of fault diagnosis according to an agreed fault diagnosis rule: the abnormity is pre-determined to be the radial tile temperature sudden-rise fault of the motor transmission end, and the determination reason is as follows: 15 fault samples with similarity of more than 90% with the abnormal sample in the front-mounted pump of the steam pump are (Etc.), the confidence coefficient of the diagnosis fault type 1 is 60% and exceeds the confidence coefficient threshold value, and the fault sample with the maximum similarity is that the number of the segmentation segments is 7The optimal matching positions are 1-4 segments, and the actual fault diagnosis effect diagram is shown in figure 4. The integral trends of the two samples are matched and consistent from the beginning to the middle position, so that the fault development stage is determined as the middle stage of fault development, and the query operation regulation suggests to take maintenance measures as a method for forcibly cooling the transmission end of the cold water impact motor
The fact shows that the diagnosis result after the professional staff arrive in time is really the radial tile temperature sudden-rise fault of the motor transmission end, and the operation of the front-mounted pump equipment of the steam pump is recovered to be normal after the front-mounted pump of the steam pump is forced to be cooled to a reasonable temperature range by adopting a cold water cooling method.
Although exemplary embodiments of the present invention have been described for illustrative purposes, those skilled in the art will appreciate that various modifications, additions, substitutions and the like can be made in form and detail without departing from the scope and spirit of the invention as disclosed in the accompanying claims, all of which are intended to fall within the scope of the claims, and that various steps in the various sections and methods of the claimed product can be combined together in any combination. Therefore, the description of the embodiments disclosed in the present invention is not intended to limit the scope of the present invention, but to describe the present invention. Accordingly, the scope of the present invention is not limited by the above embodiments, but is defined by the claims or their equivalents.
Claims (1)
1. A device fault diagnosis method based on multi-dimensional piecewise fitting is characterized by sequentially comprising the following steps: step 1: the fault diagnosis training step specifically comprises the following steps:
(1.1) acquiring fault sample information from a database;
(1.2) sequentially carrying out piecewise linear fitting on each fault sample;
(1.3) extracting the characteristics of each section of data of the fault sample to obtain a characteristic matrix of the fault sample;
(1.4) carrying out fault characteristic conversion, eliminating characteristic dimension and obtaining a characteristic matrix of a fault sample with the dimension eliminated;
(1.5) calculating a mode distance threshold;
(1.6) associating the fault feature matrix and the pattern distance threshold value with each other, and storing to generate a fault pattern knowledge base;
step 2: the fault diagnosis operation steps specifically comprise:
(2.1) acquiring abnormal sample information from a real-time database;
(2.2) performing piecewise linear fitting on the current abnormal sample;
(2.3) carrying out abnormal feature extraction and transformation on the data subjected to segmentation fitting;
(2.4) sequentially calculating the pattern distance of the sample features in the fault pattern knowledge base by using the abnormal feature information;
(2.5) converting the mode distance into a mode similarity using a falling ridge pacing method;
(2.6) outputting a final fault diagnosis result;
wherein, the step (1.1) comprises the following steps: selecting a piece of research equipment with the number P of fault types being more than or equal to 2 and the occurrence frequency T of each fault being more than or equal to 2 to meet the requirements, selecting the number of observation points as N, wherein N is more than or equal to 10, carrying out fault record searching on historical running state data of the equipment for a certain time, picking out fault related measurement point information, starting and stopping time of a fault process and useful information recorded by fault maintenance measures from fault records by utilizing a set screening rule, and reading fault sample data from a power plant real-time database PI according to the useful information, wherein:
the fault sample data with n measuring points and m time points, all the measuring point data at the moment j are regarded as a n-dimensional column vector and are expressed as follows:
u(tj)=[uj1,uj2,uj3,...,ujn]
the sample data file is stored in a matrix form of m × n, and the specific form is as follows:
the row represents m fault time, the column represents n equipment observation points, the m and n values of the row and the column between each fault sample are different, each fault sample is endowed with a fault type identifier ID, and the fault type identifier ID determination method is that if all the samples contain X types of faults, the numerical range of the fault type identifier ID is as follows: 1-X;
wherein, the step (1.2) comprises the following steps:
(1.2.1) mean filtering operation: filtering and eliminating noise pollution doped in sample data;
(1.2.2) carrying out fault sample segmentation initialization, namely carrying out segmentation initialization on the fault sample after filtering processing;
(1.2.3) combining every two initialized data segments to calculate a fitting error;
(1.2.4) determining a segmentation cutting point of a fault sample, and performing adaptive state segmentation on the fault sample;
wherein, the step (1.3) comprises the following steps:
dividing the fault sample intoThe x divided data segments from f1 *The data segment starts to carry out feature extraction, and the specific operations are as follows:
due to f1 *The data segment isIn the form of matrix, the row is time point number and is listed as measuring point number;
and (3) extracting features according to the dimensions of the measuring points, wherein the features comprise: slope k, duration l, mean m and variance v;
the vector is linearly fitted according to the principle of least square method, and the fitting result is a linear equation p (x) a0i+a1ix, so slope characteristic k ═ a1iWherein a is0i、a1iIs a fitting constant;
The variance characteristic v isSum of fluctuation amplitudes of all values of the vector from the mean, i.e.N is the total number of vectors;
The rest ofExtracting characteristics according to the operation method, converting the original time domain data matrix into a mathematical characteristic matrix, and finally dividing the data segment setConversion to:
wherein, the step (2.1) comprises the following steps:
according to the equipment state early warning system of the power plant, discovering that certain unknown abnormal state occurs in the equipment, and carrying out the following related operations:
(2.1.1) determining the device alarm generation time t from the early warning system1And a cut-off time t2;
(21.2) determining from the early warning system the relevant observation point wp ═ x for the device alarm1,x2,...,xn'];
(2.1.3) according to the generation time t1And a cut-off time t2And a database sampling frequency fs, and obtaining the time point number m ═ fs × (t)2-t1) The number of device measurement points n equals length ([ x ])1,x2,...,xn']) Where length () is the calculated length function;
(2.1.4) acquiring abnormal sample data, namely the sample data with n 'of measuring points and m of time points, wherein all the measuring point data at the moment j are regarded as a column vector with n' dimension and are expressed as:
v(tj)=[vj1,vj2,vj3,...,vjn']
storing the sample data file in a matrix form of m multiplied by n', wherein the specific form is as follows:
wherein the rows represent m failure times and the columns represent n' device observation points;
wherein, the step (2.2) comprises the following specific steps:
(2.2.1) mean filtering operation: filtering and eliminating noise pollution of each measuring point doped in abnormal sample data according to a mean filtering principle;
(2.2.2) carrying out segmentation initialization on the abnormal sample after filtering treatment;
(2.2.3) combining every two initialized data segments to calculate a fitting error;
and (2.2.4) determining an abnormal sample segmentation cut point based on a recursive combination mode.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201511004841.0A CN105631596B (en) | 2015-12-29 | 2015-12-29 | Equipment fault diagnosis method based on multi-dimensional piecewise fitting |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201511004841.0A CN105631596B (en) | 2015-12-29 | 2015-12-29 | Equipment fault diagnosis method based on multi-dimensional piecewise fitting |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105631596A CN105631596A (en) | 2016-06-01 |
CN105631596B true CN105631596B (en) | 2020-12-29 |
Family
ID=56046501
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201511004841.0A Active CN105631596B (en) | 2015-12-29 | 2015-12-29 | Equipment fault diagnosis method based on multi-dimensional piecewise fitting |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105631596B (en) |
Families Citing this family (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106096634B (en) * | 2016-06-06 | 2018-04-06 | 北京理工大学 | Fault detection method based on Adaptive windowing mental arithmetic method with interval halving algorithm |
CN106096214B (en) * | 2016-07-22 | 2019-08-27 | 杨志强 | A kind of switchgear Fault Fuzzy Diagnostic method based on indirect thermometric mode |
CN106789243B (en) * | 2016-12-22 | 2019-06-14 | 烟台东方纵横科技股份有限公司 | A kind of IT operational system with intelligent trouble analytic function |
CN106697291A (en) * | 2016-12-22 | 2017-05-24 | 深圳万智联合科技有限公司 | Intelligent unmanned plane pesticide spraying device |
CN110226140B (en) * | 2017-01-25 | 2023-04-11 | Ntn株式会社 | State monitoring method and state monitoring device |
JP6772963B2 (en) * | 2017-06-05 | 2020-10-21 | トヨタ自動車株式会社 | Abnormality diagnosis device and abnormality diagnosis method |
CN107748892B (en) * | 2017-09-25 | 2020-11-17 | 西安理工大学 | Human behavior data segmentation method based on Mahalanobis distance |
CN108398656B (en) * | 2018-02-27 | 2021-01-05 | 安徽建筑大学 | High voltage light emitting diode street lamp, fault diagnosis method and readable storage medium |
CN108414949B (en) * | 2018-02-27 | 2021-06-01 | 安徽建筑大学 | High-voltage light-emitting diode lamp fault diagnosis method and medium based on illumination characteristics |
CN108829755B (en) * | 2018-05-25 | 2021-04-13 | 国网山东省电力公司青岛供电公司 | Transformer fault case matching search method based on iteration closest point |
CN110646202B (en) * | 2018-06-08 | 2021-09-03 | 西门子股份公司 | Method and device for detecting wear of a bearing |
CN109242835A (en) * | 2018-08-24 | 2019-01-18 | 深圳市兆泰云智能科技有限公司 | Vehicle bottom defect inspection method, device, equipment and system based on artificial intelligence |
CN109239301B (en) * | 2018-09-12 | 2021-06-01 | 江苏科技大学 | Anchor chain flash welding quality online evaluation method |
CN109507990B (en) * | 2018-12-25 | 2021-06-15 | 中南大学 | Fault tracing method and system |
CN110187294B (en) * | 2019-04-17 | 2024-08-02 | 安徽建筑大学 | Fault diagnosis method and device for piecewise linear constant current drive LED light source |
CN110298767B (en) * | 2019-05-29 | 2020-05-12 | 山东科技大学 | Thermal power plant time sequence variable abnormity monitoring method and system |
CN112152830B (en) * | 2019-06-28 | 2023-08-04 | 中国电力科学研究院有限公司 | Intelligent fault root cause analysis method and system |
CN111222774B (en) * | 2019-12-30 | 2020-08-18 | 广州博士信息技术研究院有限公司 | Enterprise data analysis method and device and server |
CN112257754B (en) * | 2020-09-24 | 2023-07-28 | 北京航天测控技术有限公司 | Method and device for analyzing running state of spacecraft |
CN112492567B (en) * | 2020-11-19 | 2022-03-18 | 江苏恒宝智能系统技术有限公司 | Fault analysis and solution method and device in emergency command communication |
CN112487712B (en) * | 2020-11-25 | 2024-08-23 | 中国科学院力学研究所 | Intelligent identification method for buried suspended pipeline working condition parameters based on genetic algorithm and multidimensional dynamic time regularity |
CN112849648B (en) * | 2020-12-31 | 2023-04-28 | 重庆国际复合材料股份有限公司 | Intelligent tray identification system and method |
CN114973746B (en) * | 2021-02-23 | 2023-09-26 | 腾讯科技(深圳)有限公司 | Parking lot determining method, equipment and computer readable storage medium |
CN113392874B (en) * | 2021-05-13 | 2022-12-13 | 中车唐山机车车辆有限公司 | Abnormal state diagnosis method and device for rail vehicle and terminal equipment |
CN113496089B (en) * | 2021-09-07 | 2021-11-16 | 江苏宝诺铸造有限公司 | Internet of things equipment abnormal state online monitoring method and system based on artificial intelligence |
CN114626666B (en) * | 2021-12-11 | 2023-05-05 | 国网湖北省电力有限公司经济技术研究院 | Engineering site progress identification system based on full-time space monitoring |
CN114486259A (en) * | 2022-01-05 | 2022-05-13 | 电子科技大学 | Signal processing method of distributed optical fiber acoustic sensing system for optimizing variational modal decomposition |
CN114693110B (en) * | 2022-03-29 | 2023-07-04 | 杭州安脉盛智能技术有限公司 | Abnormality monitoring method, abnormality monitoring system and storage medium of energy storage system |
CN115956543A (en) * | 2022-07-15 | 2023-04-14 | 生态环境部南京环境科学研究所 | Fishing device and fishing method for wild giant salamander resource investigation |
CN116104773B (en) * | 2023-02-21 | 2024-04-12 | 重庆成峰水务工程有限责任公司 | Comprehensive detection equipment and detection method for water pump faults |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103267652A (en) * | 2013-05-24 | 2013-08-28 | 北京工业大学 | Intelligent online diagnosis method for early failures of equipment |
CN104462842A (en) * | 2014-12-22 | 2015-03-25 | 厦门大学 | Excavating diagnosis method of failure data based on bayesian network |
CN104462846A (en) * | 2014-12-22 | 2015-03-25 | 山东鲁能软件技术有限公司 | Intelligent device failure diagnosis method based on support vector machine |
CN105093040A (en) * | 2015-09-09 | 2015-11-25 | 成都比善科技开发有限公司 | Transformer station on-line monitoring system with high safety |
CN105158598A (en) * | 2015-08-15 | 2015-12-16 | 国家电网公司 | Fault prediction method suitable for power equipment |
-
2015
- 2015-12-29 CN CN201511004841.0A patent/CN105631596B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103267652A (en) * | 2013-05-24 | 2013-08-28 | 北京工业大学 | Intelligent online diagnosis method for early failures of equipment |
CN104462842A (en) * | 2014-12-22 | 2015-03-25 | 厦门大学 | Excavating diagnosis method of failure data based on bayesian network |
CN104462846A (en) * | 2014-12-22 | 2015-03-25 | 山东鲁能软件技术有限公司 | Intelligent device failure diagnosis method based on support vector machine |
CN105158598A (en) * | 2015-08-15 | 2015-12-16 | 国家电网公司 | Fault prediction method suitable for power equipment |
CN105093040A (en) * | 2015-09-09 | 2015-11-25 | 成都比善科技开发有限公司 | Transformer station on-line monitoring system with high safety |
Also Published As
Publication number | Publication date |
---|---|
CN105631596A (en) | 2016-06-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105631596B (en) | Equipment fault diagnosis method based on multi-dimensional piecewise fitting | |
CN105467975B (en) | A kind of equipment fault diagnosis method | |
CN110322048B (en) | Fault early warning method for production logistics conveying equipment | |
CN110969556B (en) | Machine learning multidimensional multi-model fusion river channel water quality anomaly detection method and device | |
CN109146246B (en) | Fault detection method based on automatic encoder and Bayesian network | |
CN111539553B (en) | Wind turbine generator fault early warning method based on SVR algorithm and off-peak degree | |
CN110895526A (en) | Method for correcting data abnormity in atmosphere monitoring system | |
CN111401573B (en) | Working condition state modeling and model correcting method | |
CN106779200A (en) | Based on the Wind turbines trend prediction method for carrying out similarity in the historical data | |
CN111275288A (en) | XGboost-based multi-dimensional data anomaly detection method and device | |
CN109086793A (en) | A kind of abnormality recognition method of wind-driven generator | |
Calvo-Bascones et al. | A collaborative network of digital twins for anomaly detection applications of complex systems. Snitch Digital Twin concept | |
Liu et al. | A two-stage deep autoencoder-based missing data imputation method for wind farm SCADA data | |
CN112032003B (en) | Method for monitoring operation performance of large wind turbine generator | |
CN110717610A (en) | Wind power prediction method based on data mining | |
CN115484102A (en) | Industrial control system-oriented anomaly detection system and method | |
CN111080074A (en) | System service security situation element obtaining method based on network multi-feature association | |
Vishnu et al. | Recurrent neural networks for online remaining useful life estimation in ion mill etching system | |
CN115828466A (en) | Fan main shaft component fault prediction method based on wide kernel convolution | |
Shi et al. | Study of wind turbine fault diagnosis and early warning based on SCADA data | |
CN117763305A (en) | Power time sequence data anomaly detection and repair method considering behavior commonality | |
Wendelberger et al. | Monitoring Deforestation Using Multivariate Bayesian Online Changepoint Detection with Outliers | |
Han et al. | Self-supervised deep clustering method for detecting abnormal data of wastewater treatment process | |
Lithoxoidou et al. | Malfunction diagnosis in industrial process systems using data mining for knowledge discovery | |
Yang et al. | Anomaly detection of dust removal system through gradient boosting decision tree algorithm |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP01 | Change in the name or title of a patent holder |
Address after: 250101 5th floor, block B, Yinhe building, 2008 Xinluo street, high tech Zone, Jinan City, Shandong Province Patentee after: Shandong luruan Digital Technology Co.,Ltd. Address before: 250101 5th floor, block B, Yinhe building, 2008 Xinluo street, high tech Zone, Jinan City, Shandong Province Patentee before: SHANDONG LUNENG SOFTWARE TECHNOLOGY Co.,Ltd. |
|
CP01 | Change in the name or title of a patent holder |