CN112881827A

CN112881827A - Oil-immersed transformer fault diagnosis method based on improved grey correlation analysis

Info

Publication number: CN112881827A
Application number: CN202110037452.7A
Authority: CN
Inventors: 吐松江·卡日; 伊力哈木·亚尔买买提; 刘鹏伟; 张宽; 孙国良; 邸强; 张鹏程; 逯浩坦
Original assignee: Xinjiang University
Current assignee: Xinjiang University
Priority date: 2021-01-12
Filing date: 2021-01-12
Publication date: 2021-06-01
Anticipated expiration: 2041-01-12
Also published as: CN112881827B

Abstract

The invention discloses a fault diagnosis method for an oil-immersed transformer, which comprises the following steps: s1, collecting and sorting multi-type fault samples of the oil-immersed transformer, and constructing a feature set FS1 and a fault sample set D1; s2, calculating the feature weight of a feature set FS1 by two feature selection methods, establishing low-dimensional feature subsets FS21 and FS22, and further establishing low-dimensional fault sample sets D21 and D22; s3, solving clustering centers C1 and C2 of fault sample sets D21 and D22 by using a fuzzy clustering algorithm, and taking the obtained clustering centers C1 and C2 as two groups of reference sequences in improved gray correlation analysis; and normalizing the feature weights corresponding to the low-dimensional features FS21 and FS22, calculating corresponding association values based on the Mahalanobis distance and the dynamic identification coefficient, and selecting the fault type corresponding to the maximum association value as a diagnosis result of the sample to be detected after calculating the average value. The oil-immersed transformer fault diagnosis method established by the invention has the advantages of simple structure, high calculation efficiency, good diagnosis performance and the like.

Description

Oil-immersed transformer fault diagnosis method based on improved grey correlation analysis

Technical Field

The invention belongs to the technical field of electrical equipment, and particularly relates to an oil-immersed transformer fault diagnosis method based on improved grey correlation analysis.

Background

The oil-immersed transformer is used as expensive and important component equipment in a high-voltage power system, bears the functions of voltage conversion and electric energy distribution, and plays an especially important role in reliable operation of the power system. Because the operation environment of the oil-immersed transformer is severe, various defects and faults inevitably occur, and potential safety hazards are brought to the reliable operation of a power system; when the oil-immersed transformer suffers from a fault, the power production department faces huge economic loss, so that the fault of the transformer can be found and processed as soon as possible, and the establishment of an efficient and reliable fault diagnosis system has important significance.

At present, among the fault diagnosis methods for oil-immersed transformers, a Dissolved Gas Analysis (DGA) method in oil is one of the simplest and most widely applied methods. However, the DGA-based ratio method has the defects of absolute coding boundary, limited coding types, low diagnosis accuracy, inconsistent diagnosis performance and the like; in recent years, experts and scholars at home and abroad adopt technical theories of machine learning, data mining and the like to establish a plurality of intelligent fault diagnosis models based on dissolved gas in oil and obtain positive effects, but the problems of complex diagnosis models, redundant characteristics, low diagnosis precision, low diagnosis efficiency and the like exist at the same time. The problems of characteristic redundancy, difficulty in determining characteristic weight, incapability of reflecting mutual influence among resolution results and the like exist in the conventional oil-immersed transformer fault diagnosis method based on grey correlation analysis. Therefore, a model for diagnosing potential faults of the oil immersed transformer, which is simple in structure, accurate, efficient and convenient, is established, the working state of the power transformer is mastered in time, and the method has important significance for reducing transformer maintenance, shortening power failure time and improving equipment and power grid reliability.

Disclosure of Invention

The invention aims to provide an oil-immersed transformer fault diagnosis method based on improved grey correlation analysis, and solves the problems of complex diagnosis model, redundant characteristics, low diagnosis precision and low diagnosis efficiency of the existing oil-immersed transformer fault diagnosis method.

In order to solve the technical problems, the invention is realized by the following technical scheme:

an oil-immersed transformer fault diagnosis method based on improved grey correlation analysis comprises the following steps:

s1, collecting the oil-immersed transformer, sorting multiple types of fault samples, and constructing a feature set FS1 according to the component values and the combinations of conventional gases dissolved in oil; a fault sample set D1 is established together based on a gas sample dissolved in oil of the oil-immersed transformer and the characteristic set FS 1;

s2, calculating feature weights of a feature set FS1 by adopting two different feature selection methods, sorting the feature weights according to size values, reserving features with weight values meeting the requirements of preset thresholds, establishing low-dimensional feature subsets FS21 and FS22, and further establishing low-dimensional fault sample sets D21 and D22;

s3, solving clustering centers C1 and C2 of fault sample sets D21 and D22 by using a fuzzy clustering algorithm, and respectively using the obtained clustering centers as reference sequences in improved gray correlation analysis; normalization processing is carried out on the feature weights corresponding to the low-dimensional features FS21 and FS22, and then the correlation coefficient and the correlation degree of the sample to be detected are calculated respectively according to the weight values and the reference sequence; and calculating the average value of the two groups of correlation degrees, and outputting the fault type corresponding to the maximum value of the average correlation degrees as the diagnosis result of the sample to be detected.

Preferably, in the step S1, the feature set FS1 is characterized by 9 fault types of oil chromatogram samples, the oil chromatogram samples are composed of content features of 5 feature gases, and the 5 feature gases are methane (CH4), hydrogen (H2), ethane (C2H6), ethylene (C2H4), and acetylene (C2H 2);

wherein, the content characteristic of each characteristic gas consists of three characteristics of absolute content, relative content and mutual ratio;

among them, 9 kinds of fault types include No Fault (NF), Partial Discharge (PD), low energy discharge (LD), high energy discharge (HD), low temperature overheat (LT), medium temperature overheat (MT), high temperature overheat (HT), low energy discharge and overheat (LTD), and high energy discharge and overheat (HTD);

and extracting the contained characteristic values from the characteristic set FS1 so as to construct a fault sample set D1.

Preferably, the step S2 includes the following steps:

s21, aiming at the established fault sample set D1, 80% of fault samples are used for establishing a fault diagnosis model, and 20% of fault samples are used for testing the diagnosis performance of the model; in order to eliminate dimension inconsistency among different samples, the data of the original fault data set is normalized, and the extreme value normalization formula is as follows:

wherein: x is the number of_ikIs the kth feature data in the ith sample, and x_{ik max}And x_{ik min}Respectively representing the maximum value and the minimum value of the kth characteristic;

s22, calculating a feature weight value in the feature set FS1 by using a feature selection method (the Fisher Score method and the Relief method are respectively used in the invention) based on the feature set FS1 normalized in the step S21, wherein the larger the weight value is, the more important the feature is, otherwise, the feature is insensitive to fault classification, and the feature can be removed.

Wherein, calculating the characteristic weight value w by using a Fisher Scoer method₁(f_i) The calculation method of (2) is as follows:

assume that the dataset contains class c samples, n_kIs the total number of class k samples, f_j,iIs the jth sample value in the ith feature; u. of_iDenotes the ith characteristic_iAverage value of (a), u_ikRepresents the mean value, x, in the k class under the ith feature_i,k_jFor the jth sample in the kth class sample in the ith feature f_iValue of (i) isScore of feature F (F)_i) And weight w₁(f_i) Can be calculated as follows:

wherein, the characteristic weight value w is calculated by using a Relief F method₂(f_i) The calculation method of (2) is as follows:

firstly, setting all feature weights as 0; then, k neighbor sample sets of samples belonging to the same class as the random sample R and k samples not belonging to the same class as R are searched from the data set D, and each characteristic weight w is updated₂(f_i)；

Wherein, diff (f)_i,R_i,H_j) Denotes samples R and H_jIn characteristic f_iDifference of above, H_jIs the jth nearest neighbor sample of the same class as R. C is the total number of classes, class (R), which is different from sample R_i) Represents the class of the sample R, P (C) represents the probability of class C, diff (f)_i,R_i,M_j(C) Are sample R and sample M)_j(C) In characteristic f_iDifference of above, M_j(C) Representing the jth nearest neighbor sample to sample R in class C. Wherein diff (f)_i,R_i,H_j) The calculation can be as follows:

in the above formula, R < f >_i]、H_j[f_i]Respectively represent samples R and H_iIn characteristic f_iSample values of the following. Wherein the score is F (F)_i) The higher or weight ω (f)_i) The larger the size, the stronger the ability of the feature to distinguish different categories; and conversely, the characteristic is insensitive to fault classification and can be eliminated.

S23, step S22The calculated characteristic weight values are sorted according to the weight values, and a characteristic weight threshold value w is preset_ρ(ii) a When feature f_iWeight value of omega₁(f_i)、ω₂(f_i) Greater than a threshold value w_ρIf so, the feature is reserved, otherwise, the feature is removed as an unimportant feature; and respectively establishing low-dimensional feature subsets FS21 and FS2 by using the reserved features, and further establishing corresponding low-dimensional fault sample sets D21 and D22.

Preferably, the feature selection method in step S2 is a Fisher Score feature selection method and a Relief F feature selection method;

the dimensions of the low-dimensional feature subsets FS21, F22 are lower than the feature subset FS 1.

Preferably, the step S3 includes the following steps:

s31, solving the clustering centers of the fault sample sets D21 and D22 by adopting a fuzzy C-means clustering method, and taking the obtained clustering centers as a reference sequence X in improved grey correlation analysis₀₁And X₀₂；

S32, respectively carrying out normalization processing on the feature weight values corresponding to the low-dimensional feature subsets FS21 and FS22 to obtain a weight vector omega₀₁(k) And ω₀₂(k)；

S33, based on the fault sample set D21 obtained in the step S31, the low-dimensional feature subset FS21 and the reference sequence X₀₁Calculating the correlation coefficient of each characteristic point in the sample to be detected and the reference sequence; wherein, for a certain characteristic point k, the correlation coefficient xi_{k-fisher score}The calculation formula of (a) is as follows:

in the formula, x₀(k) Is a reference sequence X₀The value at the kth feature; x is the number of_i(k) The value of the ith sequence to be tested at the kth characteristic is taken as the value of the ith sequence to be tested; the maximum and minimum values of the corresponding characteristic differences of the fault standard sequence x0 and the sequence xi to be detected are represented by delta max and delta min; delta_oi(k) Is the difference between the test sequence and the reference sequence; to be more accurateMore clearly describing the differences between sequences, the present invention uses the Mahalanobis Distance (MD); cov is a covariance matrix among different dimensional variables, rho is a resolution coefficient, and the indirect influence of the attention degree and the characteristics of the delta max on the correlation degree is reflected; in order to improve the diagnosis precision, a dynamic resolution coefficient is adopted, and the calculation method is shown as the following formula:

wherein the content of the first and second substances,

ε(i)＝Δv(i)/Δ_max；

s34, based on the weight vector omega obtained in step S22 and step S33₁(f_i) And correlation coefficient ξ_{k-fisher score}Calculating the correlation degree gamma between the sample to be measured and the reference sequence_{i-fisher scoer}(ii) a Wherein, the correlation calculation formula is as follows:

wherein ξ_kThe correlation coefficient of a certain characteristic point k; w is a₀₁Solving the retained and normalized characteristic weight value based on a Fisher Score method; n is the number of key features; m is the total number of fault types in the reference sequence;

s35, based on the fault sample set D22 obtained in the step S31, the low-dimensional feature subset FS22 and the reference sequence X₀₂Calculating the correlation coefficient of each characteristic point in the sample to be detected and the reference sequence; the calculation steps can be repeated S33 and S34, and then the second group gamma is obtained_i-ReliefF。

S36, based on the magnitude γ of the relevance value obtained in step S34 and step S35 respectively_{i-Fisher Scoer}And gamma_i-ReliefFAnd calculating the average relevance value, wherein the calculation formula is as follows:

according to

Calculating a value, and outputting the fault type corresponding to the maximum correlation value as a diagnosis result of the sample to be detected.

The invention has the following beneficial effects:

by adopting the dynamic identification coefficient, the diagnosis method of the oil-immersed transformer has the advantages of simple structure, high calculation efficiency, no redundancy characteristic, high diagnosis precision, high diagnosis efficiency and the like, so that the transformer fault diagnosis, maintenance and overhaul work can be carried out more effectively and reliably.

Of course, it is not necessary for any product in which the invention is practiced to achieve all of the above-described advantages at the same time.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

S1, collecting and sorting multi-type fault samples of the oil-immersed transformer, and constructing a feature set FS1 according to the conventional gas component values dissolved in oil and the combination of the conventional gas component values; a fault sample set D1 is established together based on a gas sample dissolved in oil of the oil-immersed transformer and the characteristic set FS 1;

In step S1, the feature set FS1 is characterized by 9 fault types of oil chromatogram samples, the oil chromatogram samples are composed of content features of 5 feature gases, and the 5 feature gases are methane (CH4), hydrogen (H2), ethane (C2H6), ethylene (C2H4), and acetylene (C2H 2);

The following table is the feature content of the feature set FS 1:

TABLE 1 DFA-based FS1 feature set

Characteristic name	Feature content	Characteristic name	Feature content
				f₁	H₂％	f₁₃	CH₄/C₂H₂
f₂	CH₄％	f₁₄	CH₄/TH
				f₃	C₂H₆％	f₁₅	C₂H₆/C₂H₄
f₄	C₂H₄％	f₁₆	C₂H₆/C₂H₂
				f₅	C₂H₂％	f₁₇	C₂H₆/TH
f₆	H₂/CH₄	f₁₈	C₂H₄/C₂H₂
				f₇	H₂/C₂H₆	f₁₉	C₂H₄/TH
f₈	H₂/C₂H₄	f₂₀	C₂H₂/TH
				f₉	H₂/C₂H₂	f₂₁	C₂H₂/TD
f₁₀	H₂/TH	f₂₂	C₂H₄/TD
				f₁₁	CH₄/C₂H₆	f₂₃	CH₄/TD
f₁₂	CH₄/C₂H₄	f₂₄	TS

Wherein, H2% is H2/(H2+ CH4+ C2H6+ C2H4+ C2H2), and the rest is similar; TH ═ CH4+ C2H6+ C2H4+ C2H 2; TD ═ C2H4+ C2H2+ CH 4; TS ═ H2+ CH4+ C2H6+ C2H4+ C2H 2; all by mass.

In step S2, the method includes the following steps:

assume that the dataset contains class c samples, n_kIs the total number of class k samples, f_j,iIs the jth sample value in the ith feature; u. of_iDenotes the ith characteristic_iAverage value of (a), u_ikRepresents the mean value, x, in the k class under the ith feature_i,k_jFor the jth sample in the kth class sample in the ith feature f_iThe value of the i-th feature is the score F (F)_i) And weight w₁(f_i) Can be calculated as follows:

S23, based on the characteristic weight value calculated in step S22Sorting according to the weight value, and presetting a characteristic weight threshold value w_ρ(ii) a When feature f_iWeight value of omega₁(f_i)、ω₂(f_i) Greater than a threshold value w_ρIf so, the feature is reserved, otherwise, the feature is removed as an unimportant feature; and establishing low-dimensional feature subsets FS21 and FS2 for the reserved features, and further establishing low-dimensional fault sample sets D21 and D22.

In step S3, the method includes the following steps:

S32, respectively carrying out normalization processing on the feature weight values corresponding to the low-dimensional feature subsets FS21 and FS22 to obtain weight vectors omega 01(k) and omega 02(k), wherein the calculation formula is as follows;

wherein p and q respectively represent the weight preset value omega which is calculated and ordered by utilizing a Fisher Socre method and a Relief F method_ρThe number of features of (2).

in the formula, x₀(k) Is a reference sequence X₀The value at the kth feature; x is the number of_i(k) The value of the ith sequence to be tested at the kth characteristic is taken as the value of the ith sequence to be tested; the maximum and minimum values of the corresponding characteristic differences of the fault standard sequence x0 and the sequence xi to be detected are represented by delta max and delta min; delta_oi(k) Is the difference between the test sequence and the reference sequence; in order to describe the difference between sequences more accurately and obviously, the Mahalanobis Distance (MD) is adopted in the invention; cov is a covariance matrix among different dimensional variables, rho is a resolution coefficient, and the indirect influence of the attention degree and the characteristics of the delta max on the correlation degree is reflected; in order to improve the diagnosis precision, a dynamic resolution coefficient is adopted, and the calculation method is shown as the following formula:

wherein the content of the first and second substances,

(i)＝Δv(i)/Δ_max；

s34, based on the weight vector omega obtained in step S22 and step S33₀₁(f_i) And correlation coefficient ξ_{k-fisher score}Calculating the correlation degree gamma between the sample to be measured and the reference sequence_{i-fisher scoer}(ii) a Wherein, the correlation calculation formula is as follows:

wherein ξ_kThe correlation coefficient of a certain characteristic point k; w is a_kThe characteristic weight value is normalized; n is the number of key features; m is the fault type in the reference sequence;

according to

According to the oil-immersed transformer fault diagnosis method based on the improved grey correlation analysis, provided by the invention, the diagnosed fault characteristic information is more comprehensive, the selected characteristic subset has better identification capability and lower input characteristic dimension, and the problem of difficulty in determining the characteristic weight is solved; and by adopting the dynamic identification coefficient, the whole set of diagnosis method has the advantages of simple structure, high calculation efficiency, good diagnosis performance and the like, so that the transformer fault diagnosis, maintenance and overhaul work can be carried out more effectively and reliably.

In the description herein, references to the description of "one embodiment," "an example," "a specific example" or the like are intended to mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

The preferred embodiments of the invention disclosed above are intended to be illustrative only. The preferred embodiments are not intended to be exhaustive or to limit the invention to the precise embodiments disclosed. Obviously, many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles of the invention and the practical application, to thereby enable others skilled in the art to best utilize the invention. The invention is limited only by the claims and their full scope and equivalents.

Claims

1. An oil-immersed transformer fault diagnosis method based on improved grey correlation analysis is characterized by comprising the following steps:

2. The oil-filled transformer fault diagnosis method based on improved gray correlation analysis according to claim 1, wherein in step S1, the feature set FS1 is characterized by 9 fault types of oil chromatogram samples, the oil chromatogram samples are composed of content features of 5 feature gases, and the 5 feature gases are methane (CH4), hydrogen (H2), ethane (C2H6), ethylene (C2H4), and acetylene (C2H 2);

3. The oil-filled transformer fault diagnosis method based on improved grey correlation analysis according to claim 2, wherein the step S2 includes the following steps:

s22, calculating a feature weight value in the feature set FS1 by using a feature selection method (the Fisher Score method and the Relief F method are respectively used in the invention) based on the feature set FS1 normalized in the step S21, wherein the larger the weight value is, the more important the feature is, otherwise, the feature is insensitive to fault classification;

Wherein, diff (f)_i,R_i,H_j) Denotes samples R and H_jIn characteristic f_iDifference of above, H_jIs the jth nearest neighbor sample of the same class as R. C is the total number of classes, class (R), which is different from sample R_i) Represents the class of the sample R, P (C) represents the probability of class C, diff (fi, R)_i,M_j(C) Are sample R and sample M)_j(C) In characteristic f_iDifference of above, M_j(C) Representing the jth nearest neighbor sample to sample R in class C. Wherein diff (f)_i,R_i,H_j) The calculation can be as follows:

in the above formula, R < f >_i]、H_j[f_i]Respectively represent samples R and H_iIn characteristic f_iSample values of the following. Wherein the score is F (F)_i) The higher or weight ω (f)_i) The larger the size, the stronger the ability of the feature to distinguish different categories; otherwise, the characteristic is not sensitive to fault classification;

s23, sorting according to the weight value based on the characteristic weight value calculated in the step S22, and presetting a characteristic weight threshold w_ρ(ii) a When feature f_iWeight value of omega₁(f_i)、ω₂(f_i) Greater than a threshold value w_ρIf so, the feature is reserved, otherwise, the feature is removed as an unimportant feature; and respectively establishing low-dimensional feature subsets FS21 and FS2 by using the reserved features, and further establishing corresponding low-dimensional fault sample sets D21 and D22.

4. The oil-filled transformer fault diagnosis method based on the improved grey correlation analysis according to claim 3, wherein the feature selection method in the step S2 is a Fisher Score feature selection method and a Relief feature selection method;

5. The oil-filled transformer fault diagnosis method based on improved grey correlation analysis according to claim 4, wherein the step S3 includes the following steps:

S33, based on the fault sample set D21 obtained in the step S31, the low-dimensional feature subset FS21 and the reference sequence X₀₁Calculating the correlation coefficient of each characteristic point in the sample to be detected and the reference sequence; whereinFor a certain feature point k, the correlation coefficient ξ_{k-fisher score}The calculation formula of (a) is as follows:

wherein the content of the first and second substances,

ε(i)＝Δv(i)/Δ_max；

s34, based on the weight vector omega obtained in step S22 and step S33₀₁(f_i) And correlation coefficient ξ_{k-fisher score}Calculating the correlation degree gamma between the sample to be measured and the reference sequence_{i-fisherscoer}(ii) a Wherein, the correlation calculation formula is as follows:

wherein ξ_kThe correlation coefficient of a certain characteristic point k; w is a₀₁Solving and normalizing the processed feature weights based on the Fisher Score methodA weight value; n is the number of key features; m is the total number of fault types in the reference sequence;

s35, based on the fault sample set D22 obtained in the step S31, the low-dimensional feature subset FS22 and the reference sequence X₀₂Calculating the correlation coefficient of each characteristic point in the sample to be detected and the reference sequence; the calculation steps can be repeated S33 and S34, and then the second group gamma is obtained_i-ReliefF；

according to