CN111401785A - Power system equipment fault early warning method based on fuzzy association rule - Google Patents
Power system equipment fault early warning method based on fuzzy association rule Download PDFInfo
- Publication number
- CN111401785A CN111401785A CN202010274132.9A CN202010274132A CN111401785A CN 111401785 A CN111401785 A CN 111401785A CN 202010274132 A CN202010274132 A CN 202010274132A CN 111401785 A CN111401785 A CN 111401785A
- Authority
- CN
- China
- Prior art keywords
- fuzzy
- data
- algorithm
- clustering
- early warning
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 43
- 238000005192 partition Methods 0.000 claims abstract description 11
- 238000005065 mining Methods 0.000 claims abstract description 10
- 239000011159 matrix material Substances 0.000 claims description 17
- 230000008569 process Effects 0.000 claims description 8
- 230000007704 transition Effects 0.000 claims description 7
- 238000004364 calculation method Methods 0.000 description 6
- 230000008901 benefit Effects 0.000 description 5
- 239000007789 gas Substances 0.000 description 4
- 238000012423 maintenance Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000003745 diagnosis Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000012549 training Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- BJEPYKJPYRNKOW-REOHCLBHSA-N (S)-malic acid Chemical compound OC(=O)[C@@H](O)CC(O)=O BJEPYKJPYRNKOW-REOHCLBHSA-N 0.000 description 1
- 101100460704 Aspergillus sp. (strain MF297-2) notI gene Proteins 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- BJEPYKJPYRNKOW-UHFFFAOYSA-N alpha-hydroxysuccinic acid Natural products OC(=O)C(O)CC(O)=O BJEPYKJPYRNKOW-UHFFFAOYSA-N 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000009191 jumping Effects 0.000 description 1
- 238000003064 k means clustering Methods 0.000 description 1
- 235000011090 malic acid Nutrition 0.000 description 1
- 239000001630 malic acid Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000013138 pruning Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/063—Operations research, analysis or management
- G06Q10/0639—Performance analysis of employees; Performance analysis of enterprise or organisation operations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2321—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
- G06F18/23213—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/20—Administration of product repair or maintenance
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/06—Energy or water supply
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Human Resources & Organizations (AREA)
- Economics (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Strategic Management (AREA)
- Data Mining & Analysis (AREA)
- Entrepreneurship & Innovation (AREA)
- Marketing (AREA)
- Tourism & Hospitality (AREA)
- General Business, Economics & Management (AREA)
- Quality & Reliability (AREA)
- Operations Research (AREA)
- Development Economics (AREA)
- Educational Administration (AREA)
- Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Game Theory and Decision Science (AREA)
- Bioinformatics & Computational Biology (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Biology (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Probability & Statistics with Applications (AREA)
- Evolutionary Computation (AREA)
- Public Health (AREA)
- Water Supply & Treatment (AREA)
- General Health & Medical Sciences (AREA)
- Primary Health Care (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a power system equipment fault early warning method based on fuzzy association rules, which relates to a fault early warning method and comprises the following steps: determining the optimal partition number of the power equipment data through K mean value and information entropy mixed iteration so as to realize dynamic self-adaptive boundary partition; introducing a fuzzy set to divide a softening attribute boundary, and dividing a fuzzy interval by using a fuzzy C mean value; and (3) selecting a group of optimal minimum support degree and trust degree as main parameters of a mining algorithm by using an Apriori algorithm, and mining association rules according to the parameters to construct a rule base so as to analyze and predict the fault state of the power equipment. The method can quantitatively obtain the optimal partition number during attribute discretization, and realize dynamic self-adaptive attribute boundary division; compared with the traditional association rule method, the method can quickly and accurately detect the fault state of the equipment.
Description
Technical Field
The invention relates to a fault early warning method, in particular to a power system equipment fault early warning method based on a fuzzy association rule.
Background
At present, with the continuous development of smart power grids, the scale of the power grids is continuously enlarged, and the requirements on the operation safety of power systems are higher and higher. In order to ensure the safe operation of the power system equipment, reduce the failure burst rate of the power system equipment and reduce the overhaul cost of the equipment, the method is of great importance to the state detection and the safe maintenance of the power system equipment.
In some domestic and foreign researches on the aspect of power system fault early warning, a density-based DBSCAN clustering algorithm is adopted to calculate the relative proximity of sampled data and historical fault data clusters so as to complete the classification of the data; some data samples are processed by three different normalization methods and are used as the input of a fuzzy C mean algorithm, and the fault type of the samples is determined by solving the membership degree; some transformers are clustered on different short circuit turns, axial displacement and radial deformation, and clustering results are used for explaining frequency response analysis to diagnose equipment faults; some methods adopt an exponential form of a membership function of a fuzzy c-means to obtain a judgment index of the distance, and the obtained membership matrix realizes the division of transformer fault data. Most of the existing fault early warning methods for the power equipment are realized through a simple clustering algorithm, the implicit correlation among data cannot be mined and analyzed, the fault trend cannot be detected as soon as possible, and more maintenance time cannot be strived for operation and maintenance personnel, so that serious loss is caused.
If timely and effective early warning is required to be carried out on the fault state of the power system equipment, the implicit association relation among the data must be deeply mined. Because the online monitoring data of the power system shows exponential growth trend every day, and the association rule algorithm has the advantage of being capable of mining out the rule which cannot be intuitively felt from a large amount of data in a centralized manner, and can often give out unexpected rule combinations, the method is widely applied to the fields of power system fault diagnosis, thermal power plant optimization, network safety and the like.
Disclosure of Invention
The invention mainly aims to provide a power system equipment fault early warning method based on a fuzzy association rule.
The technical scheme adopted by the invention is as follows: a power system equipment fault early warning method based on fuzzy association rules comprises the following steps:
determining the optimal partition number of the power equipment data through K mean value and information entropy mixed iteration so as to realize dynamic self-adaptive boundary partition;
introducing a fuzzy set to divide a softening attribute boundary, and dividing a fuzzy interval by using a fuzzy C mean value;
and (3) selecting a group of optimal minimum support degree and trust degree as main parameters of a mining algorithm by using an Apriori algorithm, and mining association rules according to the parameters to construct a rule base so as to analyze and predict the fault state of the power equipment.
Further, the method comprises the following steps:
the number and the center of the initial classes are solved by the improved K mean value and the information entropy mixed iteration, and the final clustering result is solved by FCM clustering;
set the sample set to be clustered asThe method for early warning the equipment fault of the power system based on the fuzzy association rule comprises the following specific steps:
S2: in the process that the number of clusters is gradually increased, each pair corresponds to a cluster number j, the cluster center is calculated by using an improved K-means algorithm, and then the transition difference value of the information entropy is calculated on the basis of calculating the data deviation;
s3: at the determinedIn sequence, obtainedThe number k of clusters when the minimum value is reached and the cluster center at that time;
S4: the optimal clustering number k and the class centerAs an initialization parameter, initializing the FCM algorithm;
S6: if it is notIf so, the clustering algorithm stops and outputs the membership matrixAnd class centerOtherwise, go to S5 to continue iteration;
s8: obtaining the final membership matrixAnd class centerAnd dividing the data in the data set into corresponding classes.
The invention has the advantages that:
(1) the optimal partition number during attribute discretization can be obtained quantitatively, and dynamic self-adaptive attribute boundary division is realized.
(2) The KEFCM algorithm is used, the minimum support degree and the minimum confidence degree are selected, data edge information can be effectively reserved, rules with research values are prevented from being ignored in the process of mining association rules, and the KEFCM algorithm is high in classification accuracy.
(3) Compared with the traditional association rule method, the method can quickly and accurately detect the fault state of the equipment.
In addition to the objects, features and advantages described above, other objects, features and advantages of the present invention are also provided. The present invention will be described in further detail below with reference to the drawings.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this application, illustrate embodiments of the invention and, together with the description, serve to explain the invention and not to limit the invention.
FIG. 1 is a flow chart of a method of an embodiment of the present invention;
FIG. 2 is a graph of information entropy transition difference changes for an embodiment of the present invention;
FIG. 3 is a graph showing comparison results of three clustering algorithms according to the embodiment of the present invention;
FIG. 4 is a flow diagram of an embodiment of the present invention to obtain frequent 1-term streams;
FIG. 5 is a flow chart of obtaining frequent k-terms sets according to an embodiment of the present invention;
FIG. 6 is a graph showing the relationship between the mean matching rate and the confidence level in different support degrees according to the embodiment of the present invention;
FIG. 7 is a diagram of the relationship between the variance of the matching rate and the confidence level under different support degrees according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
The invention provides a power system equipment fault early warning method based on a fuzzy association rule.
Clustering algorithm
And (3) K-means clustering:
the K-means (K-means) is one of the most widely used methods in the clustering analysis, is a simple iterative clustering algorithm and aims to achieve the minimization of the intra-class data distance and the maximization of the inter-class data distance. For a given data setWherein n is the number of data and d is the dimension of data. Each data in x is divided into according to respective attributesA plurality of different classes, each class having a cluster center, a class space for a jth classRepresented by the mean of all data in the class, computingThe formula is as follows:
given a cluster center ofThe Euclidean distance is adopted among the data as a division index, and the calculation formula is as follows:
basic steps of the K-means algorithm:
s1: selecting k data in a data space as an initial clustering center;
s2: for each data in the data set x, according to the Euclidean distance dij between the data and the clustering centers, classifying the data into a class corresponding to the nearest clustering center according to the nearest criterion;
s3: taking the mean value corresponding to all the data in each category as the clustering center of the category, and updating the clustering center;
s4: judging whether the clustering center is changed or not, and if not, outputting a result; otherwise, return to S2 continues.
The K-means algorithm enables the obtained clusters to meet the condition that the similarity of data objects in the same class is high, and the similarity of data objects in different classes is low. However, the K-means algorithm determines an initial partition based on the initial cluster center and then optimizes the initial partition. The selection of the initial clustering center has a great influence on the clustering result, and once the initial value is not well selected, an effective clustering result cannot be obtained, which also becomes a main problem of the K-means algorithm. Therefore, the method for selecting the clustering centers by the K-means algorithm is improved to ensure that the mutual distance between the initial clustering centers is as far as possible. The cluster center initialization algorithm flow is as follows:
s1: randomly selecting a point from an input data point set x as a first clustering center;
s2: for each data in x:
(2) Calculating the probability p (xi) that each data is selected as the next cluster center;
(3) taking a random number r between [0,1], subtracting the probability of the first data by r, and selecting the random number r as the next clustering center if the subtraction result is less than or equal to 0; if the result is greater than 0, continuously subtracting the probability value of the next data, and repeating the process until the subtracted result is less than or equal to 0; selecting the data corresponding to the subtracted probability value as the next clustering center;
s3: s2 is repeated until all k cluster centers have been selected.
Information entropy:
shannon introduced the Entropy (Entropy) concept into the field of informatics in 1948, and used the Entropy to measure the amount of information contained in data. For a data set x divided into k class sets, Pij is used for representing the deviation degree of the ith sample point from the center of the jth class set, and the smaller the value of Pij is, the smaller the probability that the ith sample belongs to the jth class is, and the farther the ith sample is from the jth class is. Wherein, the calculation formula of the deviation Pij is as follows:
the overall information entropy calculation formula of the class is as follows:
the information entropy difference of jumping from the j-1 th state to the j state is called an information entropy jump value, and the calculation formula is as follows:
the difference value between the (j-1) th to (j) th state entropy jump values and the (j + 1) th state entropy jump values is called information entropy jump difference value, and the calculation formula is as follows:
with the increase of the number of clusters, the data amount in each class is reduced, the probability that each data belongs to one class is increased, and the information entropy of the class overall is increased. In the process that the number of classes is increased from small to large, class division is carried out according to the sequence from disorder to order to disorder, the initial disorder is that the clustering is too general and the overall characteristics of the data set cannot be known, and the final disorder is that the clustering is too fine and the overall knowledge of the data set is lacked. Thus, the data set information entropy transition difference value can be usedTo determine the optimal number of clusters, i.e., from (k-1 classes → k classes) to (k classes → k +1 classes) in the number of clustersAt the lowest, this means that there has been no need to increase the dataset from k classes to k +1 classes, when k is the optimal number of clusters.
Fuzzy C-means clustering:
fuzzy C-means algorithm (FCM) is an unsupervised fuzzy clustering algorithm used to classify high-dimensional spatially distributed data into specific classes. The membership degree of each sample point to all class centers is obtained by optimizing an objective function, so that the sample points are determined to achieve the purpose of automatically classifying sample data. The basic idea of the FCM algorithm is to combine a given set of samplesPartitioning into k fuzzy clustersThe given objective function J is minimized. The objective function J is defined as follows:
in the formula:in order to be an index of the blur,and the ith data belongs to the membership matrix of the jth class.
In each iteration, a membership function is used to calculate a membership value and update the cluster centerAnd membership matrixThe basic steps of the algorithm are as follows:
s1: initializing cluster number k, fuzzy weighting index m and iteration termination thresholdNumber of iterationsAnd membership matrix;
S2: calculating a fuzzy clustering center:
S4: if the iteration end condition is satisfiedIf so, the target function reaches the minimum value, the iteration is terminated, and a membership matrix is outputAnd class center, otherwise go to S2 to continue operation until the condition is satisfied.
Implementation of the KEFCM algorithm:
the whole algorithm is divided into two stages, the first stage is used for solving the number and the center of the initial class by the improved K mean value and the information entropy mixed iteration, and the second stage is used for solving the final clustering result by FCM clustering.
Referring to FIG. 1, as shown in FIG. 1, a sample set to be clustered is known asThe overall steps of the KEFCM (Fuzzy c-means algorithm based on K-means and Encopy) algorithm are:
S2: in the process of gradually increasing the number of clusters, each pair corresponds to a cluster number j, the cluster center is solved by utilizing an improved K-means algorithm, and then the jump difference value of the information entropy is solved on the basis of calculating the data deviation;
S3: at the determinedIn sequence, obtainedThe number k of clusters when the minimum value is reached and the cluster center at that time;
S4: the optimal clustering number k and the class centerAs an initialization parameter, initializing the FCM algorithm;
S6: if it is notI.e. membership matrixMembership matrix relative to last timeIs less than the iteration termination threshold, the clustering algorithm stops and outputs the membership matrixAnd class centerOtherwise, go to S5 to continue iteration;
s8: obtaining the final membership matrixAnd class centerAnd dividing the data in the data set into corresponding classes.
The invention can quantitatively obtain the optimal partition number during attribute discretization, and realize dynamic self-adaptive attribute boundary division;
the KEFCM algorithm is used, the minimum support degree and the minimum confidence degree are selected, so that data edge information can be effectively reserved, rules with research values are prevented from being ignored in the process of mining association rules, and the classification accuracy of the KEFCM algorithm is high;
compared with the traditional association rule method, the method can quickly and accurately detect the fault state of the equipment.
Example verification:
the validity of the algorithm provided by the text is verified by adopting a test data set, a UCI Wine data set is selected as test data, the data set comprises 178 samples and 13 characteristics (such as Alcohol, Malic acid and Ash), the data set is divided into 3 types in total, and partial sample data of the data set is shown in Table 1.
TABLE 1Wine partial sample data
The entropy transition difference obtained by combining the information entropy and the K-means iterative computation is shown in fig. 2, and it can be known from the figure that when the transition condition is 2, the transition difference of the system is the minimum, so the optimal classification number of the data set is 3, and the value is consistent with the actual classification number of the test data set.
In fact, the three types of samples of the Wine data set respectively contain 59 samples, 71 samples and 48 samples, and the Wine data set is classified by three algorithms of k-means, FCM and KEFCM respectively, and the result is shown in FIG. 3, and it can be seen that the classification accuracy of the KEFCM algorithm proposed herein is the highest.
Basic theory of Apriori algorithm:
the Apriori algorithm is a boolean-type management rule algorithm for finding a frequent item set, which is calculated using a layer-by-layer iteration method and generates the frequent item set based on a candidate item set, i.e., a (k-1) -item set L k-1 is used to generate a k-item set L k, a frequent 1-item set and a frequent k-item set, as shown in fig. 4 and 5, by scanning a database, counts of each item are accumulated to obtain an item satisfying the minimum support degree, a set of the frequent 1-item set is found and marked as L1, then a set L2 of the frequent 2-item set is found through a set L1 of the frequent 1-item set, and so on until an item set satisfying a condition cannot be obtained, at which time, the obtained item set is called as the maximum frequent item set.
The connecting step is to connect L k-1 with itself to generate a candidate k-item set, and is to mark Ck. the pruning step is to delete any (k-1) item subset of the candidate k-item set if the candidate k-item set does not exist in L k-1.
The validity of the association rule is determined by the support and trust. According to the definition of association rule, for databaseProvided that A and B areA subset transaction of, then,And is andfor the empty collection, thenThe expression (A) and (B) are the front piece and the back piece of the association rule. Support degree is databaseInThe percentage of (c) is shown in formula (12).
In the formula: a is a front piece of the association rule; b is a back piece of the association rule.
Confidence is a databaseWherein represents the probability of B when A appears, as shown in formula (13).
Experimental analysis:
preparing data:
the fault of the transformer is different, and the fault characteristic gas is different. In the existing GB/T7252-2001 'analysis and judgment guide rule for dissolved gas in transformer oil' in China, five attributes of characteristic gases influencing transformer fault generation, namely H2, CH4, C2H2, C2H4 and C2H6, are shown in Table 2, and 1000 groups of gas component historical normal data in 2017 and 600 groups of data before and after a fault recording point occurring in 5 months in 2018 are extracted for analysis.
TABLE 2 Transformer Properties
The 1000 groups of data of five continuous attributes are discretized by adopting a KEFCM clustering algorithm, and the discretization interval after the optimal classification number of each attribute is obtained is shown in table 3 (two effective fractions are reserved).
TABLE 3 discretization interval of Transformer Attribute
In view of the need of data mining, the sections belonging to different attributes need to be distinguished so as not to be repeated, and therefore, the data to be mined need to be numbered. For example, the value of H2 in a set of data is 14.88, i.e., the value falls in the fifth interval of the x0 attribute, so the data is labeled 05, and so on. The resulting form of the database to be mined is shown in table 4.
Table 4 database to be mined
4.2 Association rule base establishment
In order to enable the mined association rules to accurately express the relationship among the attributes of the transformer, the selection of the minimum support degree minSup and the minimum trust degree minConf is also the most critical step. The index of matching rate is used as an index for evaluating the accuracy of the association rule mined under a certain group of minSup and minConf by combining the mean value and the variance of the index, and a group of optimal minSup and minConf is found through a plurality of groups of experiments, and the rules mined under the parameters form an association rule base. The calculation formula of the matching rate is as follows:
in the formula:the number of rules to which the current data conforms,the number of rules for the rule antecedent and rule antecedent is only met,is the degree of matching of the set of data with the rule base.
The larger the presentation rule, the more accurately reflects the intrinsic relationship of the set of data attributes. The average matching degree of the rule base and the training data is represented by the mean matching rate of all the matching data, and the stability degree of the rule base suitable for the training data is represented by the variance of the matching rate.
And establishing rule bases under different minSup and minConf, comparing the change of the mean value and the variance of the matching rate in different rule bases, determining a group of optimal minSup and minConf, and establishing the rule bases by using the optimal minSup and minConf as parameters of a mining algorithm. The results of the experimental analysis are shown in fig. 6 and 7 below.
Generally, the larger the support degree value and the higher the confidence degree, the smaller the mean value and the larger the variance, but the too large support degree and confidence degree can result in the suddenly reduced mean value and the suddenly increased variance. This is because too large support and trust leads to a drastic reduction in the number of rules, which reduces the coverage of the rule base, i.e., a large amount of data cannot find a rule matching therewith. As can be seen from FIGS. 6 and 7, when,The set of minimum support and confidence that is the best for the Apriori algorithm. On the basis, 2781 pieces of frequent attribute sets and 5546 pieces of association rules are mined, and the forms of partial frequent attribute sets and association rules are shown in tables 5 and 6.
TABLE 5 partial frequent Attribute set
TABLE 6 partial association rules
Taking the first association rule (11, 23 → 02) in table 8 as an example, the meaning is:
given a CH4 value in the first interval ([ 11.53, 12.61 ]), and a C2H2 value in the third interval ([ 2.29, 3.43 ]), the probability of the value of H2 falling in the second interval ([ 5.51, 8.35 ]) is 95.14%.
And (3) verifying the early warning effect:
in order to further verify the practicability of the fault early warning method, 600 groups of data before and after a fault recording point occurring in 5 months in 2018 are selected for validity verification (wherein the 300 th group of data is an initial fault point). The early warning results under the method and the early warning results under the conventional association rules are shown in table 7.
TABLE 7 Transformer Fault diagnosis results
The association rule is a representation of the relationship between the attributes of the device in a normal state, and in the early stage of a fault, the existing association relationship between the attributes is gradually broken and continuously worsened. The applicability of the original association rule to the current operating data is gradually reduced, so that an alarm occurs. From table 8, the diagnostic result of the fuzzy association rule detects that there is a failure trend in the 276 th group of data, whereas the conventional association rule method identifies the failure trend in the 293 th group of data, which indicates that the method can diagnose the failure state of the transformer more accurately. In summary, the experimental result verifies the effectiveness and the high efficiency of the fuzzy association rule in the fault early warning process.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.
Claims (2)
1. A power system equipment fault early warning method based on fuzzy association rules is characterized in that,
the method comprises the following steps:
determining the optimal partition number of the power equipment data through K mean value and information entropy mixed iteration so as to realize dynamic self-adaptive boundary partition;
introducing a fuzzy set to divide a softening attribute boundary, and dividing a fuzzy interval by using a fuzzy C mean value;
and (3) selecting a group of optimal minimum support degree and trust degree as main parameters of a mining algorithm by using an Apriori algorithm, and mining association rules according to the parameters to construct a rule base so as to analyze and predict the fault state of the power equipment.
2. The fuzzy association rule based power system equipment fault pre-warning method of claim 1
The method is characterized by comprising the following steps:
the number and the center of the initial classes are solved by the improved K mean value and the information entropy mixed iteration, and the final clustering result is solved by FCM clustering;
set the sample set to be clustered asThe method for early warning the equipment fault of the power system based on the fuzzy association rule comprises the following specific steps:
S2: in the process that the number of clusters is gradually increased, each pair corresponds to a cluster number j, the cluster center is calculated by using an improved K-means algorithm, and then the transition difference value of the information entropy is calculated on the basis of calculating the data deviation;
s3: at the determinedIn sequence, obtainedThe number k of clusters when the minimum value is reached and the cluster center at that time;
S4: the optimal clustering number k and the class centerAs an initialization parameter, initializing the FCM algorithm;
S6: if it is notIf so, the clustering algorithm stops and outputs the membership matrixAnd class centerOtherwise, go to S5 to continue iteration;
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010274132.9A CN111401785A (en) | 2020-04-09 | 2020-04-09 | Power system equipment fault early warning method based on fuzzy association rule |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010274132.9A CN111401785A (en) | 2020-04-09 | 2020-04-09 | Power system equipment fault early warning method based on fuzzy association rule |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111401785A true CN111401785A (en) | 2020-07-10 |
Family
ID=71436892
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010274132.9A Pending CN111401785A (en) | 2020-04-09 | 2020-04-09 | Power system equipment fault early warning method based on fuzzy association rule |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111401785A (en) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112163682A (en) * | 2020-10-19 | 2021-01-01 | 北京邮电大学 | Power dispatching automation system fault tracing method based on information difference graph model |
CN112333147A (en) * | 2020-09-30 | 2021-02-05 | 中国核动力研究设计院 | Nuclear power plant DCS platform network operation situation sensing method and system |
CN112488181A (en) * | 2020-11-26 | 2021-03-12 | 哈尔滨工程大学 | Service fault high-response matching method based on MIDS-Tree |
CN112885462A (en) * | 2021-03-02 | 2021-06-01 | 南京邮电大学 | Intelligent health correlation analysis method oriented to multi-source information fusion |
CN113010597A (en) * | 2021-04-06 | 2021-06-22 | 东北大学 | Parallel association rule mining method for ocean big data |
CN113791924A (en) * | 2021-08-13 | 2021-12-14 | 济南浪潮数据技术有限公司 | GRA-based server fault diagnosis rule screening method |
CN117401578A (en) * | 2023-12-15 | 2024-01-16 | 常州欧普莱机械制造有限公司 | Intelligent management system for lifting weight weighing signals |
CN117572159A (en) * | 2024-01-17 | 2024-02-20 | 成都英华科技有限公司 | Power failure detection method and system based on big data analysis |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107907783A (en) * | 2017-12-19 | 2018-04-13 | 西安交通大学 | Transformer fault integrated diagnostic system and diagnostic method based on fuzzy association rules |
-
2020
- 2020-04-09 CN CN202010274132.9A patent/CN111401785A/en active Pending
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107907783A (en) * | 2017-12-19 | 2018-04-13 | 西安交通大学 | Transformer fault integrated diagnostic system and diagnostic method based on fuzzy association rules |
Non-Patent Citations (3)
Title |
---|
王奔: "关联规则挖掘及其在火电机组运行优化中的应用", 《中国优秀硕士学位论文全文数据库 工程科技Ⅱ辑》 * |
顾慧等: "一种基于EKFCM算法的电站脱硫系统目标工况库的建立方法", 《中国电机工程学报》 * |
高瑜: "基于数据挖掘的火电厂风机故障预警研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112333147A (en) * | 2020-09-30 | 2021-02-05 | 中国核动力研究设计院 | Nuclear power plant DCS platform network operation situation sensing method and system |
CN112163682B (en) * | 2020-10-19 | 2022-05-17 | 北京邮电大学 | Power dispatching automation system fault tracing method based on information difference graph model |
CN112163682A (en) * | 2020-10-19 | 2021-01-01 | 北京邮电大学 | Power dispatching automation system fault tracing method based on information difference graph model |
CN112488181A (en) * | 2020-11-26 | 2021-03-12 | 哈尔滨工程大学 | Service fault high-response matching method based on MIDS-Tree |
CN112488181B (en) * | 2020-11-26 | 2022-10-18 | 哈尔滨工程大学 | Service fault high-response matching method based on MIDS-Tree |
CN112885462A (en) * | 2021-03-02 | 2021-06-01 | 南京邮电大学 | Intelligent health correlation analysis method oriented to multi-source information fusion |
CN113010597A (en) * | 2021-04-06 | 2021-06-22 | 东北大学 | Parallel association rule mining method for ocean big data |
CN113010597B (en) * | 2021-04-06 | 2023-08-01 | 东北大学 | Ocean big data-oriented parallel association rule mining method |
CN113791924A (en) * | 2021-08-13 | 2021-12-14 | 济南浪潮数据技术有限公司 | GRA-based server fault diagnosis rule screening method |
CN117401578A (en) * | 2023-12-15 | 2024-01-16 | 常州欧普莱机械制造有限公司 | Intelligent management system for lifting weight weighing signals |
CN117401578B (en) * | 2023-12-15 | 2024-04-19 | 姜文涛 | Intelligent management system for lifting weight weighing signals |
CN117572159A (en) * | 2024-01-17 | 2024-02-20 | 成都英华科技有限公司 | Power failure detection method and system based on big data analysis |
CN117572159B (en) * | 2024-01-17 | 2024-03-26 | 成都英华科技有限公司 | Power failure detection method and system based on big data analysis |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111401785A (en) | Power system equipment fault early warning method based on fuzzy association rule | |
WO2022110557A1 (en) | Method and device for diagnosing user-transformer relationship anomaly in transformer area | |
CN110213222B (en) | Network intrusion detection method based on machine learning | |
CN107301296B (en) | Data-based qualitative analysis method for circuit breaker fault influence factors | |
Charrad et al. | NbClust: an R package for determining the relevant number of clusters in a data set | |
CN106022477A (en) | Intelligent analysis decision system and method | |
CN106503086A (en) | The detection method of distributed local outlier | |
CN106250442A (en) | The feature selection approach of a kind of network security data and system | |
CN106485089B (en) | The interval parameter acquisition methods of harmonic wave user's typical condition | |
CN113987033B (en) | Main transformer online monitoring data group deviation identification and calibration method | |
CN111046930A (en) | Power supply service satisfaction influence factor identification method based on decision tree algorithm | |
CN111291822B (en) | Equipment running state judging method based on fuzzy clustering optimal k value selection algorithm | |
CN110223193A (en) | The method of discrimination and system of operation of power networks state are used for based on fuzzy clustering and RS-KNN model | |
CN109308411A (en) | The method and system of layered weighting software action defect based on artificial intelligence decision tree | |
CN112905583A (en) | High-dimensional big data outlier detection method | |
CN110503133A (en) | A kind of centrifugal compressor failure prediction method based on deep learning | |
CN110544047A (en) | Bad data identification method | |
CN116796271A (en) | Resident energy abnormality identification method | |
CN117407732A (en) | Unconventional reservoir gas well yield prediction method based on antagonistic neural network | |
CN114611604A (en) | User screening method based on electric drive assembly load characteristic fusion and clustering | |
CN114722947A (en) | Power dispatching monitoring data anomaly detection method based on neighbor search clustering | |
CN114384423A (en) | Fuel cell health state identification method based on typical operation parameters | |
CN111488903A (en) | Decision tree feature selection method based on feature weight | |
CN111507878B (en) | Network crime suspects investigation method and system based on user portrait | |
KR102358357B1 (en) | Estimating apparatus for market size, and control method thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200710 |