CN107491792B

CN107491792B - Power grid fault classification method based on feature mapping transfer learning

Info

Publication number: CN107491792B
Application number: CN201710756382.4A
Authority: CN
Inventors: 张化光; 刘鑫蕊; 孙秋野; 于晓婷; 杨珺; 王智良; 赵鑫; 吴泽群
Original assignee: Northeastern University China
Current assignee: Northeastern University China
Priority date: 2017-08-29
Filing date: 2017-08-29
Publication date: 2020-04-07
Anticipated expiration: 2037-08-29
Also published as: CN107491792A

Abstract

The invention discloses a power grid fault classification method based on feature mapping transfer learning, which comprises the following steps: 1. selecting target domain data and auxiliary source domain data; 2. respectively extracting fault features of target field data and auxiliary source field data based on micro-increment wavelet singular entropy, and taking each micro-increment wavelet singular entropy as a fault feature to respectively form a feature vector space corresponding to the target field and a feature vector space corresponding to the auxiliary source field; 3. finding out base vectors corresponding to the axis features, the specific features of the auxiliary source field and the specific features of the target field based on a feature mapping migration learning method; 4. taking the obtained base vector corresponding to the auxiliary source field as a support vector; and simultaneously setting a similarity penalty item and adding a constraint condition of a support vector training set to jointly train a classifier to obtain a corresponding classification result. The method can accurately and quickly find the three groups of base vectors which can reflect the fault types most.

Description

Power grid fault classification method based on feature mapping transfer learning

Technical Field

The invention belongs to the technical field of power transmission and distribution, and particularly relates to a power grid fault classification method based on feature mapping migration learning.

Background

The increasing scale of the power grid and the increasing transmission capacity and voltage level bring huge economic and social benefits, but meanwhile, the fault of the power grid can cause more serious harm to social economy and people's life. The rapid and accurate classification of the grid faults is a precondition for rapidly recovering the power supply of the grid and is an important part of fault analysis, so that the research on the rapid and reliable fault classification method has important significance for guaranteeing the safety and the economy of a power system.

Classification has been widely studied and applied as an important machine learning method; the method mainly comprises the steps of training a classification model according to source field data and then predicting the type of target field data by using the classification model. In order to ensure that the trained classification model has accuracy and high reliability, the traditional classification learning needs to satisfy two basic assumptions: (1) the training sample for learning and the new test sample meet the condition of independent and same distribution; (2) there must be enough training samples available to learn a good classification model. However, in practical applications, we find that these two conditions are often not satisfied.

In order to solve the problems of insufficient data quantity and characteristic difference, most machine learning algorithms adopt the method of re-marking fault samples, but a large amount of experiments and professional knowledge are needed, and the collected marked data and the fault data in the target field cannot be distributed consistently due to the change of factors such as the operation mode of a power grid, load and the like, so that the reliability of a diagnosis result is reduced.

The applicant researches and discovers that migration learning as a cross-field and cross-task learning method attracts more and more attention of scholars in the field of machine learning. Transfer learning is a new machine learning method that solves problems in different but related fields using existing knowledge. The method relaxes two basic assumptions in the traditional machine learning, aims to transfer knowledge learned from the source field to the target field under the condition that the source field data and the target field data have different data distributions, and solves the learning problem that only a small amount of labeled sample data exists in the target field or even the target field does not exist. When a power grid fails, the network topology structure changes, the data distribution changes, and based on a transfer learning method, knowledge of auxiliary data which is different from target data but related to the target data is fully utilized, so that the fault classification performance of a machine learning algorithm on the power grid can be effectively improved.

Therefore, the power grid fault classification based on the transfer learning has certain theoretical basis and practical significance.

Disclosure of Invention

In view of the defects in the prior art, the invention aims to provide a power grid fault classification method based on feature mapping migration learning, which can be used for mapping data of each field from an original high-dimensional feature space to a low-dimensional feature space by analyzing the correlation between the characteristic features of an auxiliary source field and the characteristic features of a target field and axis features in an abstract manner, so that the source field data and the target field data have similar distribution in the low-dimensional space; and solving the maximum value of the relation coefficient by a Lagrange multiplier method, and further finding out three groups of base vectors which can reflect the fault types most.

In order to achieve the purpose, the technical scheme of the invention is as follows:

a power grid fault classification method based on feature mapping migration learning is characterized by comprising the following steps:

step 1, selecting target domain data to be classified and auxiliary source domain data, wherein the target domain data at least comprises: three-phase current data of each fault line at each fault moment; the auxiliary source domain data includes: three-phase current data of each fault line at the previous fault moment corresponding to each fault moment, three-phase current data of each fault line at the previous normal operation moment corresponding to each fault moment and three-phase current data of adjacent lines of the fault line at each fault moment;

step 2, fault feature extraction based on micro-increment wavelet singular entropy is respectively carried out on target field data and auxiliary source field data to extract respective corresponding micro-increment wavelet singular entropy, each micro-increment wavelet singular entropy is taken as a fault feature, and then a feature vector space corresponding to the target field and a feature vector space corresponding to the auxiliary source field are respectively formed;

step 3, based on a feature mapping migration learning method, using the intersection of the auxiliary source field and the target field as an axis feature, and finding out base vectors corresponding to the axis feature, the characteristic feature of the auxiliary source field and the characteristic feature of the target field based on a Lagrange multiplier method to obtain an extreme value;

step 4, in the fault classification process based on the SVM, taking the base vector corresponding to the auxiliary source field obtained in the step 3 as a support vector; meanwhile, a similarity punishment item of a support vector training set corresponding to the auxiliary source field is added into an original target function of the support vector machine SVM, and a constraint condition of the added support vector training set is added into an original target function constraint condition, so that a classifier is trained together to obtain a corresponding classification result.

Further, the step 2 comprises:

step 21, respectively performing m-layer wavelet multi-resolution signal decomposition on the target field data and the auxiliary source field data to obtain a wavelet transformation coefficient matrix corresponding to a wavelet transformation result, and performing singular value decomposition calculation to obtain a singular value feature matrix corresponding to the wavelet transformation coefficient matrix, and marking the singular value feature matrix as Λ ═ diag (λ ═ diag)₁,λ₂,…λ_n)；

Step 22, respectively constructing n-order micro-increment wavelet singular entropies of the target field data and the auxiliary source field data, wherein the corresponding formula is

In the formula, λ_iIs the i-th order non-zero singular eigenvalue, X_iIs λ_iThe ith micro-increment wavelet singular entropy of (a);

step 23, constructing a feature vector X by using the n-order micro-increment wavelet singular entropy element of the auxiliary source field data_s1Is marked as X_s1＝[X₁,X₂…X_n]Simultaneously order

The corresponding normalized wavelet packet feature vector X_s1 ^*Is represented by X_s1 ^*＝[X₁/X,X₂/X,…,X_n/X]And forming a vector space X of the auxiliary source domain data_s ^*＝[X_s1 ^*,X_s2 ^*,…X_sn ^*](ii) a Vector space X which similarly constitutes target domain data_t ^*＝[X_t1 ^*,X_t2 ^*,…X_tn ^*]。

Further, n ═ m in the singular value feature matrix²-1 and such that λ_nAnd the constraint condition is satisfied.

Further, the step 3 comprises:

step 31, defining auxiliary source field X_s ^*The fault identifier of the known fault type is Y, so that a certain fault type identifier Y belongs to Y; auxiliary source field X_s ^*And the target area X_t ^*The intersection of (A) is the corresponding axis feature or is called the domain axis feature X_∩ ^*∈X_s ^*∩X_t ^*Simultaneous calculation of axial features X_∩ ^*And the correlation coefficient between Y, the corresponding calculation formula is as follows:

wherein, I (X)_∩ ^*Y) represents an axial feature X_∩ ^*Correlation coefficient with Y, P (X)_∩ ^*Y) field axis feature X_∩ ^*Joint distribution probability with fault identity y, P (X)_∩ ^*) Representing axial characteristics X_∩ ^*Data X appearing in the auxiliary Source Domain_s ^*P (y) indicates that the fault mark y appears in the target area data X_t ^*And selecting the axis feature with the maximum correlation coefficient value in m-layer wavelet multi-resolution signal decomposition to form an axis feature set, and recording as X_∩＝{X_∩1 ^*,X_∩2 ^*,…,X_∩m ^*}；

And selecting the axis feature with the maximum correlation coefficient value in m-layer wavelet multi-resolution signal decomposition to form an axis feature set, and recording as X_∩＝{X_∩1 ^*,X_∩2 ^*,…,X_∩m ^*}；

Step 32, firstly, based on the union formed by the extracted fault features in the auxiliary source domain data and the target domain data

Three sets of paired sample sets of random variables α, γ, were constructed and labeled

Wherein | X_∩|，

Respectively representing the dimensions of the axis features, the dimensions of the fault features of the auxiliary source domain data, the dimensions of the fault features of the target domain data, and

representing sample points X in auxiliary source domain data_s ^*In the axial feature space X_∩The value of (a) is selected from,

representing auxiliary source domain data sample points X_s ^*In a feature space

The value of (a) is selected from,

representing sample points in target domain data

In a feature space

The value of (a) is selected from,

then linearly combined according to

Find three groups of base vectors by the principle that the correlation coefficient between the three groups reaches the maximum

Namely, based on the following formula

Corresponding constraint condition

Wherein C is_AA＝(A_S∪A_t)(A_S∪A_t)^T

Wherein: w_AIs a set of basis vectors corresponding to the axial features; w_SThe method comprises the following steps of (1) acquiring a base vector set corresponding to the characteristic features of the auxiliary source field; w_TA base vector set corresponding to the characteristic features of the target field; c_ssIs a fault characteristic D in auxiliary source field data_sA covariance matrix of medial axis features; a. the_SIs | X about α_∩|×n_sA matrix of dimensions; a. the_tIs | X about α_∩|×n_tMatrix of dimensions, S is about β

A matrix of dimensions, T being about β

A matrix of dimensions; c_TTRefer to the failure characteristics D in the target domain data_tA covariance matrix of medial axis features; c_AAIs a fault characteristic D in auxiliary source field data_sAnd fault signature D in target domain data_tUnion D of_s∪D_tA covariance matrix of medial axis features;

step 33, finding out the base vectors corresponding to the axis features, the fault features in the auxiliary source field and the fault features in the target field based on the method for solving the extreme value by the Lagrange multiplier method, namely finding out the base vectors corresponding to the axis features, the fault features in the auxiliary source field and the fault features in the target field based on the following formulas:

the eigenvectors corresponding to the first m generalized eigenvalues of the matrix are the base vectors W_A，W_S，W_T。

Further, the step 4 comprises:

step 41, in the fault classification process based on the SVM, firstly, the base vector W corresponding to the auxiliary source field obtained in the step 3 is used_SAs a support vector; meanwhile, adding a similarity penalty term of a support vector training set corresponding to the field of the auxiliary source into the original target function in the support vector machine SVM, and recording the similarity penalty term as

Adding the constraint condition of the added support vector training set into the constraint condition of the original objective function; then the support vector training set V containing the auxiliary source field data in the support vector machine SVM^sThe optimization process of the training sample T is

Constraint conditions

Wherein N is_tIs the number of i, N_s-N_tIs the number of the j's,

k is the number of training sets of target domain data,

is the support vector of the jth auxiliary source field data, D_tIs the training data corresponding to the target domain data,

representing the distance, γ, of the jth support vector from the training data_t、γ_sRegularization coefficients for the target domain data and the auxiliary source domain data respectively,

is the squared term of the error function;

then, a Lagrange multiplier method is used for optimization, namely, in order to achieve the minimum loss function between the predicted value and the real category label, an SVM function estimation expression added with an auxiliary support vector set, namely an improved SVM function estimation expression is as follows:

and step 42, obtaining corresponding classification results by constructing and combining a plurality of two classifiers.

Further, said step 42 includes obtaining a corresponding classification result by using a decision binary tree method.

Compared with the prior art, the invention has the beneficial effects that:

the invention relaxes the conditions that the training and testing data distribution is the same and the target diagnosis data quantity is sufficient for a data source, and increases the auxiliary source field data, so that the auxiliary source field data can effectively help the target field to realize classification by a migration learning method, in particular to the method which is characterized in that a characteristic value diagonal array can simply reflect the time-frequency distribution characteristics of fault signals, a micro-increment wavelet singular entropy can quantitatively distinguish signals with different time-frequency distributions, quantitatively express the characteristics of the data on the distribution trend, and quantitatively reflect the characteristics of system uncertainty, complexity and the like by the statistical analysis of information. By analyzing the correlation between the specific characteristics of the auxiliary source field and the specific characteristics of the target field and the axis characteristics in an abstract way, the data of each field are effectively mapped to a low-dimensional characteristic space from an original high-dimensional characteristic space, the maximum value of a relation coefficient is solved by a Lagrange multiplier method, three groups of base vectors which can reflect fault categories most are found, and finally, the base vectors in the auxiliary source field are used as support vectors, and the support vectors are given certain weight through penalty terms and are trained with a training set of the target field together with a classifier, so that the base vectors with classification discrimination capability can greatly improve the classification precision.

Drawings

FIG. 1 is a flow chart of the steps corresponding to the method of the present invention;

FIG. 2 is a diagram of core steps corresponding to an embodiment of the method of the present invention;

FIG. 3 is a simplified model of a grid line according to the embodiment of the present invention;

FIG. 4 is a structural diagram of a decision binary tree multi-classification according to the embodiment of the present invention;

fig. 5 shows the projection result of the basis vectors based on the transfer learning according to the embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

1-2, the grid fault classification method based on feature mapping migration learning is characterized by comprising the following steps:

step 1, selecting target domain data to be classified and auxiliary source domain data, wherein the target domain data at least comprises: three-phase current data of each fault line at each fault moment, namely the size and direction of the three-phase current; the auxiliary source domain data includes: three-phase current data of each fault line at the previous fault moment corresponding to each fault moment, three-phase current data of each fault line at the normal operation moment corresponding to each fault moment and three-phase current data of each fault line at the fault moment corresponding to the adjacent line corresponding to the fault line; if the calculation is carried out in 24 hours, the current fault moment is the current fault moment when the fault occurs today, the last fault moment is the previous fault moment, if the fault occurs yesterday, the three-phase current data when the fault occurs today is used as the target field data, and the fault data related yesterday is contained in the source field data;

step 2, fault feature extraction based on micro-increment wavelet singular entropy is respectively carried out on target field data and auxiliary source field data to extract respective corresponding micro-increment wavelet singular entropy, each micro-increment wavelet singular entropy is taken as a fault feature, and then a feature vector space corresponding to the target field and a feature vector space corresponding to the auxiliary source field are respectively formed; further, the step 2 comprises:

step 21, performing m-layer wavelet multi-resolution signal decomposition on the target domain data and the auxiliary source domain data respectively to obtain a wavelet transform coefficient matrix corresponding to a wavelet transform result,obtaining a singular value feature matrix corresponding to the wavelet transform coefficient matrix after singular value decomposition calculation (the singular value feature matrix represents the basic modal feature of the wavelet transform coefficient matrix), and marking as lambda ═ diag (lambda ═ diag)₁,λ₂,…λ_n)；

Step 22, organically combining the wavelet transformation, singular value decomposition and information entropy to form a micro-increment wavelet singular entropy, specifically, n-order micro-increment wavelet singular entropy for respectively constructing target field data and auxiliary source field data, and the corresponding formula is

In the formula, X_iFor the non-zero singular value λ of the ith order_iThe micro-increment wavelet singular entropy;

The corresponding normalized wavelet packet feature vector X_s1 ^*Is represented by X_s1 ^*＝[X₁/X,X₂/X,…,X_n/X]And forming a vector space X of the auxiliary source domain data_s ^*＝[X_s1 ^*,X_s2 ^*,…X_sn ^*](ii) a Vector space X which similarly constitutes target domain data_t ^*＝[X_t1 ^*,X_t2 ^*,…X_tn ^*]. Furthermore, m is often selected according to different fault conditions, and n is generally equal to m²-1, so that the number of layers of the wavelet decomposition can be dynamically adjusted according to the complexity of the fault, and λ_nSatisfies the constraint condition lambda_n/λ₁The singular value feature matrix obtained in the way can reflect fault information most simply.

Step 3, based on the feature mapping migration learning method, the method willThe intersection of the auxiliary source field and the target field is used as an axis feature, and a base vector corresponding to the axis feature, the fault feature of the auxiliary source field and the fault feature of the target field is found out based on a Lagrange multiplier method for solving an extreme value; further, based on the idea of feature mapping migration learning, the intersection of the auxiliary source field and the target field is used as an axis feature, and data of each field is mapped from the original high-dimensional feature space to the low-dimensional feature space, in the low-dimensional space, the source field data and the target field data have similar distribution, so that the source field data and the target field data can be abstracted to analyze the correlation between the characteristic features of the auxiliary source field and the characteristic features of the target field and the axis features, and the maximum value of the relation coefficient is solved by using a Lagrange multiplier method, so that three groups of basis vectors which can most reflect the fault category are found, specifically, the step 3 includes: step 31, defining auxiliary source field X_s ^*The fault identifier of the known fault type is Y, so that a certain fault type identifier Y belongs to Y; auxiliary source field X_s ^*And the target area X_t ^*The intersection of (A) is the corresponding axis feature or is called the domain axis feature X_∩ ^*∈X_s ^*∩X_t ^*Simultaneous calculation of axial features X_∩ ^*And the correlation coefficient between Y, the corresponding calculation formula is as follows:

wherein, P (X)_∩ ^*Y) field axis feature X_∩ ^*Joint distribution probability with fault sign y, correlation coefficient I (X)_∩ ^*And y) the axis features with large values have stronger discriminability for fault types, so the axis features with the maximum correlation coefficient values in m-layer wavelet multi-resolution signal decomposition are selected to form an axis feature set which is recorded as X_∩＝{X_∩1 ^*,X_∩2 ^*,…,X_∩m ^*}; step 32, firstly, based on the union formed by the extracted fault features in the auxiliary source domain data and the target domain data

Wherein | X_∩|，

The value of (a) is selected from,

representing sample points in target domain data

In a feature space

And then linearly combining the values

Namely, based on the following formula

The constraint condition is

Wherein C is_AA＝(A_S∪A_t)(A_S∪A_t)^T

Step 33, finding out the basis vectors corresponding to the axis features, the auxiliary source field fault features, and the target field fault features based on the method for solving the extremum by Lagrange multiplier method, that is, finding out the basis vectors corresponding to the axis features, the auxiliary source field unique features, and the target field unique features based on the following formulas, where the source field unique features refer to the parts (axis features) of the source field features excluding the intersection between the source field and the target field, and the remaining features, and the target field unique features refer to the parts (axis features) of the target field features excluding the intersection between the source field and the target field, and the remaining features:

Step 4, in the fault classification process based on the SVM, taking the base vector corresponding to the auxiliary source field obtained in the step 3 as a support vector; meanwhile, a similarity punishment item of a support vector training set corresponding to the auxiliary source field is added into an original target function of the support vector machine SVM, and a constraint condition of the added support vector training set is added into an original target function constraint condition, so that a classifier is trained together to obtain a corresponding classification result. Further, the step 4 comprises: step 41, in the fault classification process based on the SVM, firstly, the base vector W corresponding to the auxiliary source field obtained in the step 3 is used_SAs a support vector; meanwhile, adding a similarity penalty term of a support vector training set corresponding to the field of the auxiliary source into the original target function in the support vector machine SVM, and recording the similarity penalty term as

And in the original object boxAdding the constraint conditions of the added support vector training set into the number constraint conditions; then the support vector training set V containing the auxiliary source field data in the support vector machine SVM^sThe optimization process of the training sample T is

Wherein

k is the number of training sets of target domain data,

represents the distance of the jth support vector from the training data, if the smaller its value, then

The larger the value is, the greater the classification effect of the support vector on the target domain is, and gamma is_t、γ_sRegularization coefficients for the target domain data and the auxiliary source domain data respectively,

the square term of the error function is used for replacing the original relaxation variable, so that the calculation can be simplified;

wherein sgn represents a sign function, and if the number of the corresponding return value is greater than 0, sgn returns 1, if the number is equal to 0, then 0 is returned, and if the number is less than 0, then-1 is returned.

And step 42, multi-classification of the power grid fault can obtain corresponding classification results by constructing and combining a plurality of two classifiers. Further, the step 42 includes obtaining a corresponding classification result by using a decision binary tree method, for example, dividing all categories into two subclasses by using the decision binary tree method, each subclass further being divided into two subclasses, where a fault can be divided into a ground and a non-ground, and the ground is divided into a single-phase ground (a/b/c) and a two-phase ground (ab/ac/bc); and the method is divided into two-phase short circuit (ab/ac/bc), three-phase short circuit (abc) and the like without ground until the final class is divided.

The scheme of the invention is further illustrated below by taking the specific example as an example:

as shown in fig. 3 to 5, the specific steps in the power grid model are as follows:

setting parameters: as shown in fig. 4, the power grid model is a simplified 500kV double-end power supply transmission system with a total length of 200 km; the circuit model adopts a frequency correlation model to ensure that a calculation result obtained in the transient simulation is more accurate, and the model considers that signals with different frequencies have different attenuation degrees in the transmission process; in the case of power frequency, the positive sequence parameter is r₁＝0.035W/km，x₁＝0.424W/km，b₁＝2.726×10^-6S/km; zero sequence parameter is r₀＝0.3W/km，x₀＝1.143W/km，b₀＝1.936×10^-6S/km; meanwhile, A, B, C three-phase current data of 10 faults under the working conditions of different fault positions, different transition resistances and different fault moments are generated on the power grid model1089 groups in total as samples for fault classification, where A_gFault 105 group, B_gFault 145 group, C_gFailed 90 group, AB_gFault 95 group, BC_gFailed 118 group, AC_gFailure 102 group, AB failure 129 group, BC failure 109 group, AC failure 111 group, and ABC failure 85 group.

Step 2: respectively carrying out m-layer wavelet multi-resolution signal decomposition on target field data and auxiliary source field data to obtain a wavelet transformation coefficient matrix corresponding to a wavelet transformation result, and obtaining a singular value characteristic matrix corresponding to the wavelet transformation coefficient matrix after singular value decomposition calculation, wherein the singular value characteristic matrix is marked as lambda ═ diag (lambda ═ diag)₁,λ₂,…λ_n) (ii) a Taking 3-layer wavelet resolution decomposition of C-phase current in the field of auxiliary sources as an example, the singular value feature matrix is lambda ═ diag (lambda)₁,λ₂,…λ₈) The singular characteristic values obtained after SVD conversion of the C-phase current signals under different types of faults are shown in table 1 (the bold data indicates fault data on the C-phase). As can be seen from Table 1, for the C-phase related faults, the 8 singular values are relatively averaged; and the fault data which is not related to the phase C is relatively uneven.

TABLE 1C-phase current singular diagonal matrix singular eigenvalues of each order

Taking A-phase single-phase grounding as an example, calculating the singular entropy of the micro-increment wavelet as

The same can be obtained

By analogy, X can be obtained_s1＝[X₁,X₂,…,X₆,X₇,X₈]＝[2.198,0.341,-0.345,-0.187,-0.108,-0.196,-0.084,-0.056]，

X_s1 ^*＝[X₁/X,X₂/X,…,X₈/X]＝[0.970,0.151,-0.152,-0.083,-0.047,-0.092,-0.003,-0.001]Similarly, X can be obtained when the B-phase single-phase short circuit occurs_s2 ^*The auxiliary source field vector space X can be obtained by the singular eigenvalues of 10 fault types_s ^*＝[X_s1 ^*,X_s2 ^*,…X_s10 ^*]_8×10And target domain vector space X_t ^*＝[X_t1 ^*,X_t2 ^*,…X_t10 ^*]_8×10。

And step 3: based on the feature mapping migration learning method, three groups of basis vectors which can reflect the fault category most are found, and the method mainly comprises the following steps:

(1) obtaining an auxiliary source domain vector space X_s ^*＝[X_s1 ^*,X_s2 ^*,…X_s10 ^*]_8×10And target domain vector space X_t ^*＝[X_t1 ^*,X_t2 ^*,…X_t10 ^*]_8×10；

(2) From X_∩ ^*∈X_s ^*∩X_t ^*Selecting the axis features with the maximum m phase relation values to form an axis feature set, and recording the axis feature set as X_∩＝{X_∩1 ^*,X_∩2 ^*,…,X_∩m ^*}；

(3) Construction of paired sample sets

(4) Then the eigenvector corresponding to the first m generalized eigenvalues of the above formula matrix is selected as the base vector W_A，W_S，W_T(ii) a According to (1) - (4), the axial feature number is taken as 100, the projection vector dimension is 70, and the obtained basis vector projection result is shown in fig. 5:

and finally, adding a support vector training set to obtain a classification result, specifically, if the C phase is taken as a special phase at first, 1 represents that the phase is a fault phase, and 0 represents that the phase is a non-fault phase, table 2 lists part of training samples and coding conditions with the C phase as the special phase, and other conditions are similar.

TABLE 2C phase Fault codes for Special phases

The classification test results of the grid faults after the support vector training set is added are as follows: as can be seen from Table 3, after the support vector training set is added, various faults can be correctly identified, the average accuracy of fault classification can reach more than 99%, and the method is obviously improved compared with the method without the support vector training set.

TABLE 3 Fault Classification test results statistics

Table 4 shows that the fault classification method of the improved SVM after the support vector training set is added is basically not influenced by the fault time, the fault position and the transition resistance, and the misjudgment of the algorithm is possibly caused only when the high-resistance fault occurs at the tail end of the power transmission line through the discovery of the misjudgment samples.

TABLE 4 Fault Classification results under different conditions

In order to verify the adaptability of the fault classification method based on the transfer learning to the parameter change of the power grid line, the trained improved SVM model is used for testing 3 pieces of line fault sample data with different parameters, the line parameters are shown in a table 5, and the test result of each line is shown in a table 6. As can be seen from Table 6: the fault classification accuracy of different power transmission lines can reach more than 98% by using a fault classification method based on transfer learning, which shows that the fault classification method can be well adapted to the change of line parameters; meanwhile, the method can quickly realize the whole process of extracting the characteristics to classify the faults, the time required for classifying and identifying 1 sample data is less than 0.2 s, and the requirement of fault diagnosis on diagnosis time is met.

TABLE 5 parameters of 3 lines in the grid model

TABLE 6 Fault Classification results for different grid lines

The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art should be considered to be within the technical scope of the present invention, and the technical solutions and the inventive concepts thereof according to the present invention should be equivalent or changed within the scope of the present invention.

Claims

1. A power grid fault classification method based on feature mapping transfer learning is characterized by comprising the following steps:

step 1, target field data to be classified and auxiliary source field data are selected, wherein the target field data comprise: three-phase current data of each fault line at each fault moment; the auxiliary source domain data includes: three-phase current data of each fault line at the previous fault moment corresponding to each fault moment, three-phase current data of each fault line at the previous normal operation moment corresponding to each fault moment and three-phase current data of adjacent lines of the fault line at each fault moment;

step 3, based on a feature mapping migration learning method, finding out base vectors corresponding to the axis features, the characteristic features of the auxiliary source field and the characteristic features of the target field by using the intersection of the auxiliary source field and the target field as the axis features and solving an extreme value based on a Lagrange multiplier method;

2. The grid fault classification method according to claim 1, characterized in that:

the step 2 comprises the following steps:

step 21, respectively performing m-layer wavelet multi-resolution signal decomposition on the target field data and the auxiliary source field data to obtain a wavelet transformation coefficient matrix corresponding to a wavelet transformation result, and obtaining a singular value feature matrix corresponding to the wavelet transformation coefficient matrix after singular value decomposition calculation, and recording the singular value feature matrix as Λ ═ diag (λ ═ diag)₁,λ₂,…λ_n)；

The corresponding normalized wavelet packet feature vector X_s1 ^*Is represented by X_s1 ^*＝[X₁/X,X₂/X,…,X_n/X]And forming a feature vector space X of the auxiliary source field data_s ^*＝[X_s1 ^*,X_s2 ^*,…X_sn ^*](ii) a Repeating the above steps to form a feature vector space X of the target field data_t ^*＝[X_t1 ^*,X_t2 ^*,…X_tn ^*]。

3. The grid fault classification method according to claim 2, characterized in that:

n-m in the singular value feature matrix²-1 and such that λ_nAnd the constraint condition is satisfied.

4. The grid fault classification method according to claim 1, characterized in that:

the step 3 comprises the following steps:

step 31, defining auxiliary source field data X_s ^*The fault identifier of the known fault type is Y, so that a certain fault type identifier Y belongs to Y; auxiliary source domain data X_s ^*And target domain data X_t ^*The intersection of (A) is the corresponding axis feature or the domain axis feature, denoted as X_∩ ^*∈X_s ^*∩X_t ^*Simultaneous calculation of axial features X_∩ ^*And the correlation coefficient between Y, the corresponding calculation formula is as follows:

Three groups of random variables α are set, and the number of gamma is n_sThree sets of paired sample sets of random variables α, γ, were constructed and labeled

Wherein | X_∩|，

The value of (a) is selected from,

representing sample points in target domain data

In a feature space

The value of (a) is selected from,

then linearly combining

Namely, based on the following formula

The constraint condition corresponding to the formula is

C_AA＝(A_S∪A_t)(A_S∪A_t)^T

C_ASST＝S^TA_ST^T

A matrix of dimensions, T being about β

step 33, finding out the basis vectors corresponding to the axis features, the characteristic features of the auxiliary source field and the characteristic features of the target field based on the method of solving the extremum by the lagrange multiplier method, that is, finding out the basis vectors corresponding to the axis features, the characteristic features of the auxiliary source field and the characteristic features of the target field based on the following formulas:

5. The grid fault classification method according to claim 1, characterized in that:

the step 4 comprises the following steps:

Wherein N is_tIs the number of i, N_s-N_tIs the number of the j's,

k is the number of training sets of target domain data,

is the squared term of the error function;

then, optimizing by using a Lagrange multiplier method, namely adding an SVM function estimation expression of an auxiliary support vector set when the loss function between the predicted value and the real category label is minimum, wherein the SVM function estimation expression is as follows:

6. The grid fault classification method according to claim 5, characterized in that:

said step 42 comprises obtaining the corresponding classification result using a decision binary tree method.