CN114118251A

CN114118251A - Fault diagnosis and early warning method based on multi-source data fusion and convolution twin neural network

Info

Publication number: CN114118251A
Application number: CN202111390122.2A
Authority: CN
Inventors: 王振浩; 许超; 李小兰; 杨波; 李泽曦; 张琦; 刘东延; 卢毅; 杨旭; 郑舒文; 谭澈; 赵宁; 孙守道; 谢杰; 赵贝加; 张志鹏
Original assignee: Shenyang Aibeike Technology Co ltd; State Grid Corp of China SGCC; Shenyang Power Supply Co of State Grid Liaoning Electric Power Co Ltd
Current assignee: Shenyang Aibeike Technology Co ltd; State Grid Corp of China SGCC; Shenyang Power Supply Co of State Grid Liaoning Electric Power Co Ltd
Priority date: 2021-11-22
Filing date: 2021-11-22
Publication date: 2022-03-01

Abstract

The invention discloses a fault diagnosis and early warning method based on multi-source data fusion and a convolution twin neural network, which is characterized in that collected vibration information of a power transformer, concentration information of dissolved gas in oil, infrared information of the power transformer, voltage information and various types of data in current information are subjected to standardized integration by constructing a data dictionary, data information is extracted through double-space characteristics, fault recognition and classification of the power transformer are realized by utilizing the convolution twin neural network, and the fault diagnosis efficiency can be improved.

Description

Fault diagnosis and early warning method based on multi-source data fusion and convolution twin neural network

Technical Field

The invention relates to the technical field of power transformer fault diagnosis, in particular to a fault diagnosis and early warning method based on multi-source data fusion and a convolution twin neural network.

Background

For a long time, the maintenance strategy of the transformer in China mainly adopts regular maintenance taking time as a standard, has the defects of excessive maintenance and insufficient maintenance, has great blindness and compendity, causes great waste of manpower and material resources, and increases the probability of generating new hidden dangers. In consideration of the big data characteristic of the operation information of the transformer and the diversity and complexity of fault types, a new technological revolution and an industrial revolution require the intelligent and high-end transformation of electric equipment. The artificial intelligence algorithm can establish a simulated nervous system for information processing, and can form a characteristic deep learning sample by using a large amount of transformer fault instance data, so that the method has great advantages in the aspect of nonlinear complex fault identification.

With the increase of the capacity of a power grid and the proposal of a concept of a smart power grid, an intelligent substation is produced, and the continuous development of sensing monitoring equipment and the large-scale deployment and application on the surface and inside of a power transformer greatly improve the requirements on the intelligent level of monitoring, diagnosis and positioning of the power transformer equipment, and the data of the intelligent substation gradually presents typical big data characteristics of multiple sources, isomerism and the like.

In order to improve the safe operation level of a large-scale high-voltage power grid, a new method for multi-source information fusion fault diagnosis and residual life prediction of a power transformer is explored, the defects of 'maintenance surplus' and 'maintenance deficiency' existing at present are overcome, and the requirements of a new technological revolution and industrial revolution of power equipment towards intellectualization and high-end transformation are met. Concepts such as artificial intelligence, deep learning and the like are introduced to inject new vitality into the online fault monitoring, diagnosis and positioning of the power transformer under the big data background, and a brand-new thought and research direction are provided.

Disclosure of Invention

In view of the above, the invention provides a fault diagnosis and early warning method based on multi-source data fusion and convolution twin neural network, so as to realize on-line fault monitoring of a power transformer through artificial intelligence and deep learning.

The invention provides a technical scheme, in particular to a fault diagnosis and early warning method based on multi-source data fusion and a convolution twin neural network, which comprises the following steps:

s1: acquiring vibration information of the power transformer, concentration information of dissolved gas in oil, infrared information of the power transformer, voltage information and current information in real time and acquiring data;

s2: preprocessing the acquired power transformer data, eliminating abnormal and redundant data, and constructing a data dictionary to realize standardized integrated classification of multi-source heterogeneous data;

s3: and after extracting principal component data characteristics of abnormal state data in the multi-source heterogeneous data subjected to standardized integrated classification, inputting the extracted principal component data characteristics into a trained multiple convolution twin network model, preliminarily diagnosing, forming a multi-evidence body, fusing by utilizing an improved DS evidence theory, and finally obtaining a fault type and early warning.

Preferably, in step S2, the constructing a data dictionary realizes standardized integrated classification of multi-source heterogeneous data, and specifically includes:

s201: representing the preprocessed power transformer data by adopting a quintuple (epsilon NAME, pi INFORMATION, omega CONTENT, gamma CONT-MATION, tau IDENTIFY), wherein epsilon NAME is a source data NAME, pi INFORMATION is source data marking INFORMATION, omega CONTENT is source data CONTENT, gamma CONT-MATION is source data CONTENT marking INFORMATION, and tau IDENTIFY is identification data;

s202: extracting identification data information in power transformer data expressed by quintuple, and defining identification base as

If C_r≥C_r,baseIf the data is successfully identified, adding the data into a data type space corresponding to the data dictionary, otherwise, abandoning the data;

wherein, tau_new,kRepresenting newly acquired identification data; tau is_kThe identification data of the data type corresponding to the data dictionary is represented; c_r,baseThe identification reference value is shown.

Further preferably, in step S3, the training process of the multiple convolution twin network model is as follows:

1) acquiring vibration information of the power transformer, concentration information of dissolved gas in oil, infrared information of the power transformer, voltage information and current information and acquiring data;

2) preprocessing the acquired power transformer data, eliminating abnormal and redundant data, and constructing a data dictionary to realize standardized integrated classification of multi-source heterogeneous data;

3) extracting principal component data characteristics of abnormal state data in the multi-source heterogeneous data classified by the standardized integration, and dividing a training set and a test set;

4) inputting training data in a training set into a multiple convolution twin network for model training;

5) inputting the test data in the test set into the multiple convolution twin network diagnostic model trained in the step 4) for preliminary diagnosis, then forming multiple evidence bodies, and finally obtaining the fault type by utilizing improved DS evidence theory fusion.

Further preferably, in step 2), the data dictionary is constructed to realize standardized integrated classification of multi-source heterogeneous data, and specifically:

2.1) representing the preprocessed power transformer data by quintuple (epsilon NAME, pi INFORMATION, omega CONTENT, gamma CONT-MATION and tau IDENTIFY), wherein epsilon NAME is a source data NAME, pi INFORMATION is source data marking INFORMATION, omega CONTENT is source data CONTENT, gamma CONT-MATION is source data CONTENT marking INFORMATION, and tau IDENTIFY is identification data;

2.2) extracting identification data information in the power transformer data expressed by quintuple and defining an identification base

Further preferably, in step 3), extracting principal metadata features of abnormal state data in the multi-source heterogeneous data classified by the standardized integration, and dividing a training set and a test set, specifically:

3.1) transforming the acquired data into a matrix X, wherein the size of the matrix X is n multiplied by m, n is the number of samples, and m is the number of variables;

3.2) inputting the matrix X into a PCA space model, reducing high-dimensional data into low-dimensional data, and reserving main characteristic information of the original data, wherein the PCA space model is as follows:

wherein, t_iAs a principal component vector, p_iIs a load vector;

3.3) dividing the obtained labeled principal component vectors into a training set and a testing set, wherein the training set accounts for 80 percent, and the testing set accounts for 20 percent.

Further preferably, step 4) inputs training data in the training set into the multiple convolution twin network for model training, specifically:

4.1) initializing convolution twin network parameters and structures, converting input data into a 3 x 3 matrix, then inputting convolution layers with the number of convolution kernels of 16 and the size of 2 x 2 into the network, wherein the expression is as follows:

y_i＝f(x_i*ω_i+b_i)

wherein, y_iFor the output of convolutional layer i, f () is the activation function ReLU, x_iFor input data, ω_iAs weights of convolution kernels, b_iIs a bias vector.

4.2) inputting the convolution layer output to a pooling layer with the size of 2 multiplied by 2, adopting a maximum pooling mode, and having the expression:

wherein, y_maxOutputting data for the pooling layer, wherein R is a pooling area;

4.3) inputting output data of the pooling layer into a convolution layer with the size of 2 multiplied by 2 and the number of convolution kernels of 32, inputting the output data of the convolution layer into a maximum pooling layer, and then inputting the output data of the maximum pooling layer into a full-connection layer, wherein the expression is as follows:

y′_i＝g(y′_iω′_i+b′_i)

wherein, y'_iOutputting data for full connectivity layer, ω'_i,b′_iRespectively, the weights and bias vectors of the fully connected layers.

4.4) judging the error between the loss function and the expected target, if the error meets the requirement, finishing the model training, wherein the loss function expression is as follows:

L(W,(Y,X₁,X₂)ⁱ)＝(1-Y)L_G(E_W(X₁,X₂)ⁱ)+T

T＝YL₁(E_W(X₁,X₂)ⁱ)

where P is the total input sample, Y is whether the two input samples are the same fault, L_GFor two samples being loss functions of the same type, L₁The same type of loss function is used for both samples.

Further preferably, the step 4) of inputting the training data in the training set into the multiple convolution twin network for model training further includes the following steps:

4.5) if the fault type of the input sample is difficult to judge, inputting the original input data into a KICA space, mapping the data to a high-dimensional space for principal component calculation, wherein the calculation method is as follows:

where K is the kernel matrix, f_tr() Calculating trace;

4.6) mapping data to a high-dimensional feature space:

wherein the feature matrix v ═ v₁,v₂,...,v_h]，Λ＝diag(λ₁,λ₂,...,λ_h)；

4.7) calculating an independent principal element s according to the following calculation formula:

s＝W·z；

4.8) inputting the characteristics extracted by the KICA space into a convolution twin network with the same structure and parameters for model training.

Further preferably, step 5) inputs the test data in the test set into a multiple convolution twin network diagnosis model for preliminary diagnosis, then forms multiple evidence bodies, and finally obtains the fault type by utilizing improved DS evidence theory fusion, specifically:

5.1) inputting test data in the test set into the multiple convolution twin network diagnostic model trained in the step 4), and constructing a BPA function according to the obtained classification result, wherein the expression is as follows:

wherein m is_i(E_M) Is E_MBPA, E_MAs a fault classType A_iFor fault diagnosis accuracy of the ith convolution twin neural network, P_iMThe classification result belongs to E when the ith convolution twin neural network is used for carrying out diagnosis classification on the sample_MThe probability of (d);

5.2) carrying out evidence correction by using a Langmuir distance function, wherein the correction expression is as follows:

wherein d is_ij(L) is the Langmuir distance of the two evidences, D_iAs evidence body m_iTotal distance from other evidential bodies, R_rel(m_i) Evaluating the reliability of the evidence;

5.3) performing evidence fusion according to a DS synthesis rule to finally obtain the fault type of the power transformer, wherein the synthesis rule is as follows:

wherein K is a conflict factor of the conflict degree of the two evidences.

According to the fault diagnosis and early warning method based on the multi-source data fusion and the convolution twin neural network, the collected vibration information of the power transformer, the concentration information of the dissolved gas in the oil, the infrared information of the power transformer, the voltage information and the current information of each type of data are subjected to standardized integration by constructing a data dictionary, the data information is extracted through double-space characteristics, the fault recognition and classification of the power transformer are realized by utilizing the convolution twin neural network, and the fault diagnosis efficiency can be improved.

It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.

Drawings

The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the invention and together with the description, serve to explain the principles of the invention.

In order to more clearly illustrate the embodiments or technical solutions in the prior art of the present invention, the drawings used in the description of the embodiments or prior art will be briefly described below, and it is obvious for those skilled in the art that other drawings can be obtained based on these drawings without creative efforts.

FIG. 1 is a schematic flow chart of a fault diagnosis and early warning method based on a multi-source data fusion and convolution twin neural network according to an embodiment of the disclosure;

FIG. 2 is a flow chart of the dual spatial feature extraction and convolution twin neural network in an embodiment of the disclosure.

Detailed Description

Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with the present invention. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the invention, as detailed in the appended claims.

In order to realize the online fault monitoring of the power transformer through artificial intelligence and deep learning, the embodiment provides a fault diagnosis and early warning method based on multi-source data fusion and a convolution twin neural network, and referring to fig. 1, the diagnosis method comprises the following steps:

Step S1 connects vibration sensor, gas sensor, voltage transformer, current transformer and infrared sensor through the FPGA device to acquire vibration data, gas concentration data, voltage current data and infrared data, and through the 5G communication module, the bus communication module is used to transmit the acquired data information to the memory in the edge computing module for storage, and meanwhile, the 5G communication protocol is used to transmit the acquired transformer data to the cloud for storage as a data copy.

The collected data are transmitted to the 5G communication module through an RS485 protocol, the data are transmitted to the cloud server as backup data according to a 5G communication data analysis protocol, meanwhile, the FPGA module transmits the data to a memory of the edge computing module for storage through serial port to USB, and the specific computing process of the edge computing module is as described in steps S2 and S3.

In step S3, a double-space feature extraction method is used to extract principal metadata features of abnormal state data in the multi-source heterogeneous data subjected to standardized, integrated and classified, wherein a specific feature extraction process and a method for improving DS evidence theory fusion will be described in detail in a subsequent model training process, and will not be described in detail here.

In step S2, the constructing of the data dictionary realizes standardized integrated classification of multi-source heterogeneous data, which specifically includes:

The training process of the multiple convolution twin network model in the step S3 is as follows:

In step 2), the data dictionary is constructed to realize the standardized integrated classification of the multi-source heterogeneous data, and the method specifically comprises the following steps:

2.2) judging the relevance of the data every time new data is collected. And if the data object is judged to be related to the main body, dividing the data object into corresponding data groups for storage. Specifically, identification data information in power transformer data expressed by quintuple is extracted, and an identification base is defined

Further preferably, in step 3), principal metadata features of abnormal state data in the multi-source heterogeneous data classified by the standardized integration are extracted, and a training set and a test set are divided, wherein the principal metadata features are extracted by a double-space feature extraction method, specifically:

wherein, t_iAs a principal component vector, p_iIs a load vector;

y_i＝f(x_i*ω_i+b_i)

y′_i＝g(y′_iω′_i+b′_i)

L(W,(Y,X₁,X₂)ⁱ)＝(1-Y)L_G(E_W(X₁,X₂)ⁱ)+T

T＝YL₁(E_W(X₁,X₂)ⁱ)

where K is the kernel matrix, f_tr() Calculating trace;

4.6) mapping data to a high-dimensional feature space:

s＝W·z；

Further preferably, in step 5), the test data in the test set is input into the multiple convolution twin network diagnostic model for preliminary diagnosis, then multiple evidence bodies are formed, and the fault type is finally obtained by utilizing improved DS evidence theory fusion, which can be seen in fig. 2 specifically as follows:

wherein m is_i(E_M) Is E_MBPA, E_MAs a type of failure, A_iFor fault diagnosis accuracy of the ith convolution twin neural network, P_iMThe classification result belongs to E when the ith convolution twin neural network is used for carrying out diagnosis classification on the sample_MThe probability of (d);

wherein K is a conflict factor of the conflict degree of the two evidences.

Other embodiments of the invention will be apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. This application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the present disclosure as come within known or customary practice within the art to which the invention pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the invention being indicated by the following claims.

It is to be understood that the present invention is not limited to what has been described above, and that various modifications and changes may be made without departing from the scope thereof. The scope of the invention is limited only by the appended claims.

Claims

1. A fault diagnosis and early warning method based on multi-source data fusion and convolution twin neural network is characterized by comprising the following steps:

2. The fault diagnosis and early warning method based on the multi-source data fusion and convolution twin neural network of claim 1, wherein in step S2, the data dictionary is constructed to realize standardized integrated classification of multi-source heterogeneous data, specifically:

s202: extracting identification data information in power transformer data expressed by quintuple and defining identification base

If C_r≥C_r，baseIf the data is successfully identified, adding the data into a data type space corresponding to the data dictionary, otherwise, abandoning the data;

wherein, tau_new，kRepresenting newly acquired identification data; tau is_kThe identification data of the data type corresponding to the data dictionary is represented; c_r，baseThe identification reference value is shown.

3. The fault diagnosis and early warning method based on the multi-source data fusion and the convolution twin neural network as claimed in claim 1, wherein in step S3, the training process of the multi-source data fusion and convolution twin neural network model is as follows:

5) and inputting the test data in the test set into a multiple convolution twin network diagnosis model for preliminary diagnosis, then forming a multiple evidence body, and finally obtaining the fault type by utilizing the improved DS evidence theory fusion.

4. The fault diagnosis and early warning method based on the multi-source data fusion and convolution twin neural network according to claim 3, wherein in step 2), the data dictionary is built to realize standardized integrated classification of multi-source heterogeneous data, and specifically:

5. The fault diagnosis and early warning method based on the multi-source data fusion and convolution twin neural network according to claim 3, characterized in that, step 3) extracts principal component data features of abnormal state data in the multi-source heterogeneous data classified by standardized integration, and divides a training set and a test set, specifically:

wherein, t_iAs a principal component vector, p_iIs a load vector;

6. The fault diagnosis and early warning method based on the multi-source data fusion and convolution twin neural network according to claim 3, wherein step 4) inputs training data in a training set into the multi-source convolution twin neural network for model training, specifically:

y_i＝f(x_i*ω_i+b_i)

y′_i＝g(y′_iω′_i+b′_i)

wherein, y'_iOutputting data for full connectivity layer, ω'_i，b′_iRespectively, the weights and bias vectors of the fully connected layers.

L(W，(Y，X₁，X₂)ⁱ)＝(1-Y)L_G(E_w(X₁，X₂)ⁱ)+T

T＝YL₁(E_w(X₁，X₂)ⁱ)

7. The fault diagnosis and early warning method for the multi-source data fusion and convolution twin neural network according to claim 6, wherein step 4) inputs training data in a training set into the multi-source convolution twin network for model training, and further comprises the following steps:

where K is the kernel matrix, f_tr() Calculating trace;

4.6) mapping data to a high-dimensional feature space:

wherein the feature matrix v ═ v₁，v₂，...，v_h]，Λ＝diag(λ₁，λ₂，...，λ_h)；

s＝W·z；

8. The fault diagnosis and early warning method for the multi-source data fusion and convolution twin neural network according to claim 3, characterized in that step 5) inputs test data in a test set into a multi-convolution twin network diagnosis model for preliminary diagnosis, then forms a multi-evidence body, and finally obtains a fault type by improving DS evidence theory fusion, specifically:

wherein K is a conflict factor of the conflict degree of the two evidences.