CN111638428B - GIS-based ultrahigh frequency partial discharge data processing method and system - Google Patents
GIS-based ultrahigh frequency partial discharge data processing method and system Download PDFInfo
- Publication number
- CN111638428B CN111638428B CN202010511393.8A CN202010511393A CN111638428B CN 111638428 B CN111638428 B CN 111638428B CN 202010511393 A CN202010511393 A CN 202010511393A CN 111638428 B CN111638428 B CN 111638428B
- Authority
- CN
- China
- Prior art keywords
- data
- dimensional time
- partial discharge
- time sequence
- correlation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000003672 processing method Methods 0.000 title claims abstract description 5
- 238000012545 processing Methods 0.000 claims abstract description 48
- 238000004140 cleaning Methods 0.000 claims abstract description 45
- 238000000034 method Methods 0.000 claims abstract description 41
- 238000010219 correlation analysis Methods 0.000 claims abstract description 21
- 230000008569 process Effects 0.000 claims description 15
- 238000001514 detection method Methods 0.000 claims description 9
- 230000004927 fusion Effects 0.000 claims description 8
- 238000012937 correction Methods 0.000 claims description 7
- 238000000354 decomposition reaction Methods 0.000 claims description 7
- 238000009826 distribution Methods 0.000 claims description 6
- 239000002245 particle Substances 0.000 claims description 5
- 238000012935 Averaging Methods 0.000 claims description 4
- 239000011159 matrix material Substances 0.000 claims description 4
- 239000000725 suspension Substances 0.000 claims description 4
- 238000009827 uniform distribution Methods 0.000 claims description 3
- 238000012163 sequencing technique Methods 0.000 claims 4
- 230000002194 synthesizing effect Effects 0.000 claims 1
- 238000000605 extraction Methods 0.000 abstract description 4
- 230000009466 transformation Effects 0.000 abstract description 3
- 238000004458 analytical method Methods 0.000 abstract description 2
- 238000003745 diagnosis Methods 0.000 abstract description 2
- 230000000694 effects Effects 0.000 description 5
- 238000012423 maintenance Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 238000012549 training Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 238000009413 insulation Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01R—MEASURING ELECTRIC VARIABLES; MEASURING MAGNETIC VARIABLES
- G01R31/00—Arrangements for testing electric properties; Arrangements for locating electric faults; Arrangements for electrical testing characterised by what is being tested not provided for elsewhere
- G01R31/12—Testing dielectric strength or breakdown voltage ; Testing or monitoring effectiveness or level of insulation, e.g. of a cable or of an apparatus, for example using partial discharge measurements; Electrostatic testing
- G01R31/1227—Testing dielectric strength or breakdown voltage ; Testing or monitoring effectiveness or level of insulation, e.g. of a cable or of an apparatus, for example using partial discharge measurements; Electrostatic testing of components, parts or materials
- G01R31/1263—Testing dielectric strength or breakdown voltage ; Testing or monitoring effectiveness or level of insulation, e.g. of a cable or of an apparatus, for example using partial discharge measurements; Electrostatic testing of components, parts or materials of solid or fluid materials, e.g. insulation films, bulk material; of semiconductors or LV electronic components or parts; of cable, line or wire insulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/18—Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A90/00—Technologies having an indirect contribution to adaptation to climate change
- Y02A90/10—Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Mathematical Physics (AREA)
- Pure & Applied Mathematics (AREA)
- Computational Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Mathematical Optimization (AREA)
- Mathematical Analysis (AREA)
- Evolutionary Biology (AREA)
- Operations Research (AREA)
- Probability & Statistics with Applications (AREA)
- Bioinformatics & Computational Biology (AREA)
- Algebra (AREA)
- Life Sciences & Earth Sciences (AREA)
- Databases & Information Systems (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Testing Relating To Insulation (AREA)
Abstract
本发明提供了一种基于GIS特高频局部放电数据的处理方法及系统,处理方法包括数据增强及清洗,采用故障特征提取、故障特征数据相关性分析、数据噪声添加与数据加权实现数据增强;并根据不同标签获取对应的清洗因子实现数据清洗,进而确保增强数据的有效性,实现GIS特高频局部放电的类型诊断。本发明通过相关系数分析的方法实现数据处理、增加噪声与加权处理对数据进行增强及采用清洗特征因子对增强数据进行清洗的方法,能够在处理GIS局部放电数据样本不足时,提高准确率和泛化能力。
The invention provides a method and system for processing UHF partial discharge data based on GIS. The processing method includes data enhancement and cleaning, and adopts fault feature extraction, fault feature data correlation analysis, data noise addition and data weighting to achieve data enhancement; And according to different labels to obtain the corresponding cleaning factors to achieve data cleaning, thereby ensuring the effectiveness of enhanced data, to achieve GIS UHF partial discharge type diagnosis. The invention realizes data processing through the method of correlation coefficient analysis, enhances the data by adding noise and weighting processing, and uses the cleaning characteristic factor to clean the enhanced data, which can improve the accuracy and generalization when the GIS partial discharge data samples are insufficient. transformation ability.
Description
技术领域technical field
本发明涉及局部放电检测技术领域,尤其是一种基于GIS特高频局部放电数据的处理方法及系统。The invention relates to the technical field of partial discharge detection, in particular to a method and system for processing UHF partial discharge data based on GIS.
背景技术Background technique
随着“三型两网”的提出,智能电网得到快速的发展,智能检测终端作为智能运维实现的基础,也是电力设备能否可靠、安全运行的保障。GIS是由断路器、隔离开关、接地开关、互感器、避雷器、母线、连接件和出线终端等组成,这些设备或部件全部封闭在金属接地的外壳中,在其内部充有一定压力的SF6绝缘气体。GIS的优点在于结构紧凑、占地面积小、可靠性高、配置灵活、安装方便、安全性强、环境适应能力强。With the introduction of "three types and two networks", the smart grid has developed rapidly. As the basis for the realization of intelligent operation and maintenance, the intelligent detection terminal is also the guarantee for the reliable and safe operation of power equipment. GIS is composed of circuit breakers, isolating switches, grounding switches, transformers, arresters, busbars, connectors and outgoing terminals, etc. These devices or components are all enclosed in a metal grounded shell, which is filled with a certain pressure of SF6 insulation. gas. The advantages of GIS lie in its compact structure, small footprint, high reliability, flexible configuration, convenient installation, strong safety, and strong environmental adaptability.
GIS的全密封结构也导致GIS局部放电的定位及检修比较困难,检修工作繁杂,事故后平均停电检修时间比常规设备长,其停电范围大,常涉及非故障元件。因此,电力公司经常开展GIS的局部放电检测工作,为了实现GIS的局放检测能够准确的识别故障类型,需对GIS特高频局部放电数据进行研究。但目前GIS特高频局部放电检测的数据样本很少,缺乏足量的数据集对GIS的特征研究,造成GIS故障类型无法精确识别及检修工作的安排。The fully-sealed structure of GIS also makes the localization and maintenance of partial discharge in GIS difficult, and the maintenance work is complicated. Therefore, power companies often carry out GIS partial discharge detection work. In order to realize GIS partial discharge detection can accurately identify fault types, it is necessary to study GIS UHF partial discharge data. However, at present, there are very few data samples for GIS UHF partial discharge detection, and there is a lack of sufficient data sets to study the characteristics of GIS, resulting in the inability to accurately identify GIS fault types and arrange maintenance work.
而目前现有针对GIS特高频数据的数据增强方法较为稀少,常见的如不同方式的变换与剪切等,且往往会造成对原始特征数据的损坏,使得故障类型的识别准确率比较低,导致设备运维策略的失算。At present, the existing data enhancement methods for GIS UHF data are relatively rare, such as transformation and clipping in different ways, which often cause damage to the original feature data, making the identification accuracy of fault types relatively low. This leads to miscalculation of equipment operation and maintenance strategies.
发明内容SUMMARY OF THE INVENTION
本发明提供了一种基于GIS特高频局部放电数据的处理方法及系统,用于解决现有GIS特高频局部放电检测的数据样本少,数据增强方法不合理的问题。The invention provides a processing method and system based on GIS ultra-high frequency partial discharge data, which is used to solve the problems that the existing GIS ultra-high frequency partial discharge detection has few data samples and the data enhancement method is unreasonable.
为实现上述目的,本发明采用下述技术方案:To achieve the above object, the present invention adopts the following technical solutions:
本发明第一方面提供了一种基于GIS特高频局部放电数据的处理方法,所述方法包括以下步骤:A first aspect of the present invention provides a method for processing UHF partial discharge data based on GIS, the method comprising the following steps:
获取不同标签下的GIS特高频局部放电数据;Obtain GIS UHF partial discharge data under different labels;
对所述局部放电数据进行特征值提取,形成二维时序数据A;Extracting feature values from the partial discharge data to form two-dimensional time series data A;
将所述二维时序数据A与对应的标签进行相关性分析,得到第一相关系数;Correlation analysis is performed on the two-dimensional time series data A and the corresponding label to obtain a first correlation coefficient;
根据相关性阈值,将所述二维时序数据A分为A1和A2,其中A2为第一相关系数小于所述相关性阈值的二维时序数据;According to the correlation threshold, the two-dimensional time series data A is divided into A1 and A2, wherein A2 is the two-dimensional time series data whose first correlation coefficient is less than the correlation threshold;
对A2增加噪声处理,生成二维时序数据B;Add noise processing to A2 to generate two-dimensional time series data B;
将所述二维时序数据B与对应的标签进行相关性分析,得到第二相关系数;Carrying out a correlation analysis between the two-dimensional time series data B and the corresponding label to obtain a second correlation coefficient;
根据相关性阈值,将所述二维时序数据B分为B1和B2,其中B2为第二相关系数小于所述相关阈值的二维时序数据;According to the correlation threshold, the two-dimensional time series data B is divided into B1 and B2, wherein B2 is the two-dimensional time series data whose second correlation coefficient is less than the correlation threshold;
将B2进行加权融合,并与B1拼合,然后与A1进行合成,得到二维时序数据C;B2 is weighted and fused, combined with B1, and then combined with A1 to obtain two-dimensional time series data C;
将所述二维时序数据C与对应的标签进行相关性分析,得到第三相关系数,获取对应标签的清洗特征因子,基于所述清洗特征因子对C进行清洗,得到数据样本D。Correlation analysis is performed between the two-dimensional time series data C and the corresponding label, a third correlation coefficient is obtained, the cleaning characteristic factor of the corresponding label is obtained, C is cleaned based on the cleaning characteristic factor, and a data sample D is obtained.
进一步地,所述标签包括气隙、悬浮、电晕、沿面和颗粒。Further, the label includes air gap, suspension, corona, creepage and particle.
进一步地,所述特征值通过区域均值分解法得到。Further, the eigenvalues are obtained by the regional mean decomposition method.
进一步地,所述对A2增加噪声处理的具体过程为:Further, the specific process of adding noise to A2 is as follows:
在二维时序数据A2中增加高斯随机变量并进行数据修正。A Gaussian random variable is added to the two-dimensional time series data A2 and data correction is performed.
进一步地,所述高斯随机变量rv的计算方式如下:Further, the calculation method of the Gaussian random variable rv is as follows:
rv=sqrt(-2.0*log(U1))*cos(2*π*U2)rv=sqrt(-2.0*log(U 1 ))*cos(2*π*U 2 )
其中随机变量U1和U2表示为:where the random variables U1 and U2 are expressed as:
式中,随机变量U1、U2相互独立,且均服从(0,1)之间的均匀分布;随机变量Z0,Z1服从标准高斯分布且满足正态分布,均值为0,方差为1。In the formula, the random variables U 1 and U 2 are independent of each other, and both obey the uniform distribution between (0, 1); the random variables Z 0 , Z 1 obey the standard Gaussian distribution and satisfy the normal distribution, the mean is 0, and the variance is 1.
进一步地,所述进行数据修正的具体过程为:Further, the specific process of performing data correction is:
定义待增强样本数据A2为dstImage[x][y],定义增强后样本数据B为EnhDstImage[x][y],定义增加高斯随机变量rv后的值为val,通过下式计算val:Define the sample data A2 to be enhanced as dstImage[x][y], define the enhanced sample data B as EnhDstImage[x][y], define the value of val after adding the Gaussian random variable rv, and calculate val by the following formula:
val=dstImage[x][y]+rvval=dstImage[x][y]+rv
对val的范围进行修正:Correct the range of val:
如果val<0,则val=0;If val<0, then val=0;
如果val>255,则val=255;If val>255, then val=255;
增强后样本数据B重新定义:The enhanced sample data B is redefined:
EnhDstImage[x][y]=valEnhDstImage[x][y]=val
对dstImage数组中所有数据经过上述处理后,形成增强后的样本数据EnhDstImage。After all the data in the dstImage array is processed above, the enhanced sample data EnhDstImage is formed.
进一步地,所述将B2进行加权融合具体为:Further, the weighted fusion of B2 is specifically:
B2[x][y]=a1*W1+a2*W2+…+an*WnB2[x][y]=a1*W1+a2*W2+…+an*Wn
其中,W1+W2+…+Wn=1;a1、a2、…an为第x行特征数据。Among them, W1+W2+...+Wn=1; a1, a2,...an are the feature data of the xth row.
进一步地,所述获取对应标签的清洗特征因子的具体过程为:Further, the specific process of obtaining the cleaning characteristic factor of the corresponding label is:
将得到的第三相关系数按照数值大小进行排序;Sort the obtained third correlation coefficient according to the numerical value;
根据排序结果将所述第三相关系数进行分段,并分别计算各分段的平均值;According to the sorting result, the third correlation coefficient is segmented, and the average value of each segment is calculated respectively;
将所述各分段的平均值再去均值得到清洗特征因子。The average value of each segment is then averaged to obtain the cleaning characteristic factor.
进一步地,所述基于所述清洗特征因子对C进行清洗的具体过程为:Further, the specific process of cleaning C based on the cleaning characteristic factor is:
比较所述清洗特征因子与第三相关系数矩阵;comparing the cleaning characteristic factor with the third correlation coefficient matrix;
若清洗特征因子大于对应的第三相关系数,则丢弃当前第三相关系数对应的数据,并将改数据对应的前后数据值取平均,取代当前丢弃的数据;If the cleaning characteristic factor is greater than the corresponding third correlation coefficient, the data corresponding to the current third correlation coefficient is discarded, and the data values before and after the modified data are averaged to replace the currently discarded data;
否则,保留当前第三相关系数对应的数据。Otherwise, the data corresponding to the current third correlation coefficient is retained.
本发明第二方面提供了一种基于GIS特高频局部放电数据的处理系统,所述系统包括:A second aspect of the present invention provides a GIS-based UHF partial discharge data processing system, the system comprising:
数据采集单元,用于获取不同标签下的GIS特高频局部放电数据;The data acquisition unit is used to obtain GIS UHF partial discharge data under different labels;
第一数据处理单元,用于对所述局部放电数据进行特征值提取,形成二维时序数据A;a first data processing unit, configured to perform feature value extraction on the partial discharge data to form two-dimensional time series data A;
第二数据处理单元,将所述二维时序数据A与对应的标签进行相关性分析,得到第一相关系数;The second data processing unit performs a correlation analysis on the two-dimensional time series data A and the corresponding label to obtain a first correlation coefficient;
第一比较单元,根据相关性阈值,将所述二维时序数据A分为A1和A2,其中A2为第一相关系数小于所述相关性阈值的二维时序数据;The first comparison unit, according to the correlation threshold, divides the two-dimensional time series data A into A1 and A2, wherein A2 is the two-dimensional time series data whose first correlation coefficient is less than the correlation threshold;
第三数据处理单元,对A2增加噪声处理,生成二维时序数据B;The third data processing unit adds noise processing to A2 to generate two-dimensional time series data B;
第四数据处理单元,将所述二维时序数据B与对应的标签进行相关性分析,得到第二相关系数;the fourth data processing unit, performing correlation analysis on the two-dimensional time series data B and the corresponding label to obtain a second correlation coefficient;
第二比较单元,根据相关性阈值,将所述二维时序数据B分为B1和B2,其中B2为第二相关系数小于所述相关阈值的二维时序数据;The second comparison unit divides the two-dimensional time series data B into B1 and B2 according to the correlation threshold, wherein B2 is the two-dimensional time series data whose second correlation coefficient is less than the correlation threshold;
第五数据处理单元,将B2进行加权融合,并与B1拼合,然后与A1进行合成,得到二维时序数据C;The fifth data processing unit performs weighted fusion of B2, splices with B1, and then synthesizes with A1 to obtain two-dimensional time series data C;
第六数据处理单元,将所述二维时序数据C与对应的标签进行相关性分析,得到第三相关系数,获取对应标签的清洗特征因子,基于所述清洗特征因子对C进行清洗,得到数据样本D。The sixth data processing unit performs correlation analysis on the two-dimensional time series data C and the corresponding label, obtains a third correlation coefficient, obtains the cleaning characteristic factor of the corresponding label, cleans C based on the cleaning characteristic factor, and obtains data sample D.
本发明第二方面的所述处理系统能够实现第一方面及第一方面的各实现方式中的方法,并取得相同的效果。The processing system of the second aspect of the present invention can implement the methods in the first aspect and each implementation manner of the first aspect, and achieve the same effect.
发明内容中提供的效果仅仅是实施例的效果,而不是发明所有的全部效果,上述技术方案中的一个技术方案具有如下优点或有益效果:The effects provided in the summary of the invention are only the effects of the embodiments, rather than all the effects of the invention. One of the above technical solutions has the following advantages or beneficial effects:
本发明通过相关系数分析的方法实现数据处理、增加噪声与加权处理对数据进行增强及采用清洗特征因子对增强数据进行清洗的方法,能够在处理GIS局部放电数据样本不足时,提高准确率和泛化能力。The invention realizes data processing through the method of correlation coefficient analysis, enhances the data by adding noise and weighting processing, and uses the cleaning characteristic factor to clean the enhanced data, which can improve the accuracy and generalization when the GIS partial discharge data samples are insufficient. transformation ability.
附图说明Description of drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,对于本领域普通技术人员而言,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the following briefly introduces the accompanying drawings that need to be used in the description of the embodiments or the prior art. In other words, other drawings can also be obtained based on these drawings without creative labor.
图1是本发明所述方法的流程示意图;Fig. 1 is the schematic flow chart of the method of the present invention;
图2是本发明所述系统的结构示意图。FIG. 2 is a schematic structural diagram of the system according to the present invention.
具体实施方式Detailed ways
为能清楚说明本方案的技术特点,下面通过具体实施方式,并结合其附图,对本发明进行详细阐述。下文的公开提供了许多不同的实施例或例子用来实现本发明的不同结构。为了简化本发明的公开,下文中对特定例子的部件和设置进行描述。此外,本发明可以在不同例子中重复参考数字和/或字母。这种重复是为了简化和清楚的目的,其本身不指示所讨论各种实施例和/或设置之间的关系。应当注意,在附图中所图示的部件不一定按比例绘制。本发明省略了对公知组件和处理技术及工艺的描述以避免不必要地限制本发明。In order to clearly illustrate the technical features of the solution, the present invention will be described in detail below through specific embodiments and in conjunction with the accompanying drawings. The following disclosure provides many different embodiments or examples for implementing different structures of the invention. In order to simplify the disclosure of the present invention, the components and arrangements of specific examples are described below. Furthermore, the present invention may repeat reference numerals and/or letters in different instances. This repetition is for the purpose of simplicity and clarity and does not in itself indicate a relationship between the various embodiments and/or arrangements discussed. It should be noted that the components illustrated in the figures are not necessarily drawn to scale. Descriptions of well-known components and processing techniques and processes are omitted from the present invention to avoid unnecessarily limiting the present invention.
本发明提出的一种GIS特高频局部放电数据的处理方法包括数据增强及清洗,采用故障特征提取、故障特征数据相关性分析、数据噪声添加与数据加权实现数据增强;并根据不同标签获取对应的清洗因子实现数据清洗,进而确保增强数据的有效性,实现GIS特高频局部放电的类型诊断。A method for processing GIS UHF partial discharge data proposed by the present invention includes data enhancement and cleaning, and adopts fault feature extraction, fault feature data correlation analysis, data noise addition and data weighting to achieve data enhancement; and obtains corresponding data according to different labels. The cleaning factor is used to clean the data, thereby ensuring the validity of the enhanced data, and realizing the type diagnosis of GIS UHF partial discharge.
如图1所示,本发明一种基于GIS特高频局部放电数据的处理方法,具体包括以下步骤:As shown in Figure 1, a method for processing UHF partial discharge data based on GIS of the present invention specifically includes the following steps:
S1,获取不同标签下的GIS特高频局部放电数据;S1, obtain GIS UHF partial discharge data under different labels;
S2,对所述局部放电数据进行特征值提取,形成二维时序数据A;S2, carrying out feature value extraction on the partial discharge data to form two-dimensional time series data A;
S3,将所述二维时序数据A与对应的标签进行相关性分析,得到第一相关系数;S3, the two-dimensional time series data A and the corresponding label are subjected to correlation analysis to obtain the first correlation coefficient;
S4,根据相关性阈值,将所述二维时序数据A分为A1和A2,其中A2为第一相关系数小于所述相关性阈值的二维时序数据;S4, according to the correlation threshold, the two-dimensional time series data A is divided into A1 and A2, wherein A2 is the two-dimensional time series data whose first correlation coefficient is less than the correlation threshold;
S5,对A2增加噪声处理,生成二维时序数据B;S5, adding noise processing to A2 to generate two-dimensional time series data B;
S6,将所述二维时序数据B与对应的标签进行相关性分析,得到第二相关系数;S6, the two-dimensional time series data B and the corresponding label are subjected to correlation analysis to obtain the second correlation coefficient;
S7,根据相关性阈值,将所述二维时序数据B分为B1和B2,其中B2为第二相关系数小于所述相关阈值的二维时序数据;S7, according to the correlation threshold, the two-dimensional time series data B is divided into B1 and B2, wherein B2 is the two-dimensional time series data whose second correlation coefficient is less than the correlation threshold;
S8,将B2进行加权融合,并与B1拼合,然后与A1进行合成,得到二维时序数据C;S8, weighted fusion of B2 is performed, and combined with B1, and then combined with A1 to obtain two-dimensional time series data C;
S9,将所述二维时序数据C与对应的标签进行相关性分析,得到第三相关系数,获取对应标签的清洗特征因子,基于所述清洗特征因子对C进行清洗,得到数据样本D。S9, perform correlation analysis on the two-dimensional time series data C and the corresponding label, obtain a third correlation coefficient, obtain the cleaning characteristic factor of the corresponding label, and clean C based on the cleaning characteristic factor to obtain a data sample D.
步骤S1中的标签包括气隙、悬浮、电晕、沿面和颗粒。以下实施例中以气隙标签为例进行说明。The labels in step S1 include air gap, suspension, corona, creepage and particle. In the following embodiments, an air gap label is used as an example for description.
步骤S2中二维时序数组A的形成具体为:The formation of the two-dimensional time series array A in step S2 is specifically:
S21,特高频检测单次采集1秒钟的数据,每个周期20ms,将20ms分为60个时间片,得到50×60的二维时序数据;S21, UHF detection collects 1 second of data at a time, each cycle is 20ms, divides 20ms into 60 time slices, and obtains 50×60 two-dimensional time series data;
S22,利用区域均值分解法获取该二维时序数据的特征值,并组成新的二维时序数据A;S22, using the regional mean decomposition method to obtain the eigenvalues of the two-dimensional time series data, and form new two-dimensional time series data A;
步骤S22中的区域均值分解法,获取包络图谱中包络函数的瞬时值,其包络函数如下:The regional mean value decomposition method in step S22 obtains the instantaneous value of the envelope function in the envelope map, and the envelope function is as follows:
式中,ni,ni+1分别为相邻的极值点,ai为该相邻极值点的平均值,即特征值,a(t)即包络函数的瞬时值。根据得到该组数据的特征值,构建二维时序数据A。In the formula, n i and n i+1 are the adjacent extreme points respectively, a i is the average value of the adjacent extreme points, that is, the eigenvalue, and a(t) is the instantaneous value of the envelope function. Two-dimensional time series data A is constructed according to the eigenvalues obtained from the set of data.
步骤S4中的设定相关系数阈值,此处阈值设为0.85,根据设定的相关系数阈值,分离二维时序数据A,分别得到A1和A2,具体为:Set the correlation coefficient threshold in step S4, where the threshold is set to 0.85, according to the set correlation coefficient threshold, separate the two-dimensional time series data A, and obtain A1 and A2 respectively, specifically:
A1为第一相关系数大于等于,0.85的二维时序数据;A1 is two-dimensional time series data whose first correlation coefficient is greater than or equal to 0.85;
A2为第一相关系数小于0.85的二维时序数据。A2 is two-dimensional time series data whose first correlation coefficient is less than 0.85.
步骤S5中对A2增加噪声处理的具体过程为:The specific process of adding noise to A2 in step S5 is as follows:
在二维时序数据A2中增加高斯随机变量并进行数据修正。A Gaussian random variable is added to the two-dimensional time series data A2 and data correction is performed.
高斯随机变量rv的计算方式如下:The Gaussian random variable rv is calculated as follows:
rv=sqrt(-2.0*log(U1))*cos(2*π*U2)rv=sqrt(-2.0*log(U 1 ))*cos(2*π*U 2 )
其中随机变量U1和U2表示为:where the random variables U1 and U2 are expressed as:
式中,随机变量U1、U2相互独立,且均服从(0,1)之间的均匀分布;随机变量Z0,Z1服从标准高斯分布且满足正态分布,均值为0,方差为1。In the formula, the random variables U 1 and U 2 are independent of each other, and both obey the uniform distribution between (0, 1); the random variables Z 0 , Z 1 obey the standard Gaussian distribution and satisfy the normal distribution, the mean is 0, and the variance is 1.
进行数据修正的具体过程为:The specific process of data correction is as follows:
定义待增强样本数据A2为dstImage[x][y],定义增强后样本数据B为EnhDstImage[x][y],定义增加高斯随机变量rv后的值为val,通过下式计算val:Define the sample data A2 to be enhanced as dstImage[x][y], define the enhanced sample data B as EnhDstImage[x][y], define the value of val after adding the Gaussian random variable rv, and calculate val by the following formula:
val=dstImage[x][y]+rvval=dstImage[x][y]+rv
对val的范围进行修正:Correct the range of val:
如果val<0,则val=0;If val<0, then val=0;
如果val>255,则val=255;If val>255, then val=255;
增强后样本数据B重新定义:The enhanced sample data B is redefined:
EnhDstImage[x][y]=valEnhDstImage[x][y]=val
对dstImage数组中所有数据经过上述处理后,形成增强后的样本数据EnhDstImage。After all the data in the dstImage array is processed above, the enhanced sample data EnhDstImage is formed.
步骤S7中,根据气隙标签设定相关系数阈值为0.8,与第二相关系数进行比较,将大于等于0.8的相关系数的特征数据,记为B1;将小于0.8的相关系数的特征数据,记为B2。In step S7, the threshold value of the correlation coefficient is set to 0.8 according to the air gap label, and compared with the second correlation coefficient, the characteristic data of the correlation coefficient greater than or equal to 0.8 is denoted as B1; the characteristic data of the correlation coefficient less than 0.8 is denoted as B1. for B2.
步骤S8中,将B2进行加权融合具体为:In step S8, the weighted fusion of B2 is specifically:
B2[x][y]=a1*W1+a2*W2+…+an*WnB2[x][y]=a1*W1+a2*W2+…+an*Wn
其中,W1+W2+…+Wn=1;a1、a2、…an为第x行特征数据。Among them, W1+W2+...+Wn=1; a1, a2,...an are the feature data of the xth row.
步骤S9中,获取对应标签的清洗特征因子的具体过程为:In step S9, the specific process of obtaining the cleaning characteristic factor of the corresponding label is as follows:
将得到的第三相关系数按照数值大小进行排序;根据排序结果将所述第三相关系数进行分段,并分别计算各分段的平均值;将所述各分段的平均值再去均值得到清洗特征因子。本实施例中以20%为一个分段,即取20%的相关系数的平均值,依次往后取20%的平均值,直到最后20%相关系数的平均值,然后根据这五个数值再进行去平均值,即得到清洗特征因子a。Sort the obtained third correlation coefficient according to the numerical value; divide the third correlation coefficient into segments according to the sorting result, and calculate the average value of each subsection respectively; and then remove the mean value from the average value of each subsection to obtain Clean the characteristic factor. In this embodiment, 20% is used as a subsection, that is, the average value of the 20% correlation coefficient is taken, the average value of the 20% correlation coefficient is taken in turn, and the average value of the last 20% correlation coefficient is taken. Perform de-averaging to obtain the cleaning characteristic factor a.
将该清洗特征因子a与相关系数矩阵进行比对,若该清洗特征因子小于等于对应的第三相关系数,则该系数对应的数据保留;若该清洗特征因子大于对应的第三相关系数,则该数据丢弃,并将该数据对应的前后数值进行求平均,进行替代丢弃的数值。Compare the cleaning characteristic factor a with the correlation coefficient matrix, if the cleaning characteristic factor is less than or equal to the corresponding third correlation coefficient, the data corresponding to the coefficient is retained; if the cleaning characteristic factor is greater than the corresponding third correlation coefficient, then The data is discarded, and the values before and after the data are averaged to replace the discarded values.
例如:假设数据增强样本为Enhance[x,y],相关系数矩阵为Correlation[m,n],清洗特征因子为a。For example: Suppose the data enhancement sample is Enhance[x,y], the correlation coefficient matrix is Correlation[m,n], and the cleaning feature factor is a.
若a≤Correlation[3,2],则Enhance[3,2]保留;If a≤Correlation[3,2], Enhance[3,2] is reserved;
若a>Correlation[5,6],则Enhance[5,6]丢弃,并将(Enhance[5,5]+Enhance[5,7])/2填到该位置;If a>Correlation[5,6], then Enhance[5,6] is discarded, and (Enhance[5,5]+Enhance[5,7])/2 is filled in this position;
若连续几个值均小于a,则取该行的均值进行填补。完成增强数据的清洗,进而获得最终的GIS气隙局部放电数据增强样本D,用于深度学习网络模型的训练。If several consecutive values are less than a, take the mean of the row to fill. After cleaning the enhanced data, the final GIS air-gap partial discharge data enhanced sample D is obtained, which is used for training the deep learning network model.
对气隙标签进行更新,使得数据清洗因子动态调整,从而提高数据清洗的效果与质量,确保数据清洗的准确性和完整性。The air-gap tag is updated to dynamically adjust the data cleaning factor, thereby improving the effect and quality of data cleaning and ensuring the accuracy and integrity of data cleaning.
利用上述的方法,对不同的局部放电类型,与对应的标签进行相关性分析。其中,处理样本为采样上述方法处理后的数据样本,分别得到的相关系数均值对比如下表所示。Using the above method, the correlation analysis between different partial discharge types and corresponding tags is carried out. Among them, the processed samples are the data samples processed by the above-mentioned methods, and the obtained correlation coefficient mean values are compared as shown in the following table.
可见,与直接使用样本作为训练数据相比,本发明可以以较少的计算量生成具有更多特征参数的样本,能够解决GIS局部放电数据集不足的问题,增加样本数量,避免过拟合,提高GIS特高频局部放电故障类型的识别准确率。It can be seen that compared with directly using samples as training data, the present invention can generate samples with more characteristic parameters with less calculation amount, can solve the problem of insufficient GIS partial discharge data sets, increase the number of samples, and avoid overfitting, Improve the recognition accuracy of GIS UHF partial discharge fault types.
如图2所示,本发明基于GIS特高频局部放电数据的处理系统包括依次连接的数据采集单元、第一数据处理单元、第二数据处理单元、第一比较单元、第三数据处理单元、第四数据处理单元、第二比较单元、第五数据处理单元和第六数据处理单元。As shown in FIG. 2, the processing system based on GIS UHF partial discharge data of the present invention includes a data acquisition unit, a first data processing unit, a second data processing unit, a first comparison unit, a third data processing unit, A fourth data processing unit, a second comparison unit, a fifth data processing unit and a sixth data processing unit.
数据采集单元用于获取不同标签下的GIS特高频局部放电数据;第一数据处理单元用于对局部放电数据进行特征值提取,形成二维时序数据A;第二数据处理单元将二维时序数据A与对应的标签进行相关性分析,得到第一相关系数;第一比较单元根据相关性阈值,将二维时序数据A分为A1和A2,其中A2为第一相关系数小于所述相关性阈值的二维时序数据;第三数据处理单元对A2增加噪声处理,生成二维时序数据B;第四数据处理单元将二维时序数据B与对应的标签进行相关性分析,得到第二相关系数;第二比较单元根据相关性阈值,将二维时序数据B分为B1和B2,其中B2为第二相关系数小于所述相关阈值的二维时序数据;第五数据处理单元将B2进行加权融合,并与B1拼合,然后与A1进行合成,得到二维时序数据C;第六数据处理单元将二维时序数据C与对应的标签进行相关性分析,得到第三相关系数,获取对应标签的清洗特征因子,基于所述清洗特征因子对C进行清洗,得到数据样本D。The data acquisition unit is used to obtain the GIS UHF partial discharge data under different labels; the first data processing unit is used to extract the characteristic value of the partial discharge data to form the two-dimensional time series data A; the second data processing unit converts the two-dimensional time series Perform correlation analysis on the data A and the corresponding label to obtain a first correlation coefficient; the first comparison unit divides the two-dimensional time series data A into A1 and A2 according to the correlation threshold, where A2 is that the first correlation coefficient is less than the correlation Threshold two-dimensional time series data; the third data processing unit adds noise processing to A2 to generate two-dimensional time series data B; the fourth data processing unit performs correlation analysis on the two-dimensional time series data B and the corresponding label, and obtains a second correlation coefficient The second comparison unit divides the two-dimensional time series data B into B1 and B2 according to the correlation threshold value, wherein B2 is the two-dimensional time series data whose second correlation coefficient is less than the correlation threshold value; The fifth data processing unit carries out weighted fusion by B2 , and combined with B1, and then combined with A1 to obtain two-dimensional time series data C; the sixth data processing unit performs correlation analysis on the two-dimensional time series data C and the corresponding label, obtains the third correlation coefficient, and obtains the cleaning of the corresponding label. Characteristic factor. Based on the cleaning characteristic factor, C is cleaned to obtain a data sample D.
上述虽然结合附图对本发明的具体实施方式进行了描述,但并非对本发明保护范围的限制,所属领域技术人员应该明白,在本发明的技术方案的基础上,本领域技术人员不需要付出创造性劳动即可做出的各种修改或变形仍在本发明的保护范围以内。Although the specific embodiments of the present invention have been described above in conjunction with the accompanying drawings, they do not limit the scope of protection of the present invention. Those skilled in the art should understand that on the basis of the technical solutions of the present invention, those skilled in the art do not need to pay creative efforts. Various modifications or deformations that can be made are still within the protection scope of the present invention.
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010511393.8A CN111638428B (en) | 2020-06-08 | 2020-06-08 | GIS-based ultrahigh frequency partial discharge data processing method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010511393.8A CN111638428B (en) | 2020-06-08 | 2020-06-08 | GIS-based ultrahigh frequency partial discharge data processing method and system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111638428A CN111638428A (en) | 2020-09-08 |
CN111638428B true CN111638428B (en) | 2022-09-20 |
Family
ID=72330419
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010511393.8A Active CN111638428B (en) | 2020-06-08 | 2020-06-08 | GIS-based ultrahigh frequency partial discharge data processing method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111638428B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112699921B (en) * | 2020-12-16 | 2022-07-15 | 重庆邮电大学 | A clustering and cleaning method for power grid transient fault data based on stack noise reduction and self-encoding |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20090027923A (en) * | 2007-09-13 | 2009-03-18 | 현대중공업 주식회사 | Phase-independent GIS partial discharge diagnostic technique |
CN107831409A (en) * | 2017-09-22 | 2018-03-23 | 国网山东省电力公司电力科学研究院 | The method and method for detecting abnormality of superfrequency partial discharge detection TuPu method parameter extraction |
CN109726195A (en) * | 2018-11-26 | 2019-05-07 | 北京邮电大学 | A data enhancement method and device |
CN110703057A (en) * | 2019-11-04 | 2020-01-17 | 国网山东省电力公司电力科学研究院 | Partial discharge diagnosis method of power equipment based on data augmentation and neural network |
CN110991376A (en) * | 2019-12-10 | 2020-04-10 | 上海欧秒电力监测设备有限公司 | Feature extraction method for partial discharge type recognition |
-
2020
- 2020-06-08 CN CN202010511393.8A patent/CN111638428B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20090027923A (en) * | 2007-09-13 | 2009-03-18 | 현대중공업 주식회사 | Phase-independent GIS partial discharge diagnostic technique |
CN107831409A (en) * | 2017-09-22 | 2018-03-23 | 国网山东省电力公司电力科学研究院 | The method and method for detecting abnormality of superfrequency partial discharge detection TuPu method parameter extraction |
CN109726195A (en) * | 2018-11-26 | 2019-05-07 | 北京邮电大学 | A data enhancement method and device |
CN110703057A (en) * | 2019-11-04 | 2020-01-17 | 国网山东省电力公司电力科学研究院 | Partial discharge diagnosis method of power equipment based on data augmentation and neural network |
CN110991376A (en) * | 2019-12-10 | 2020-04-10 | 上海欧秒电力监测设备有限公司 | Feature extraction method for partial discharge type recognition |
Non-Patent Citations (1)
Title |
---|
一种GIS局放在线监测诊断的数据增强方法;申国标 等;《电力设备管理》;20200525(第05期);145-146,149 * |
Also Published As
Publication number | Publication date |
---|---|
CN111638428A (en) | 2020-09-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103558529B (en) | A kind of mode identification method of three-phase cartridge type supertension GIS partial discharge altogether | |
CN107831409B (en) | Ultrahigh frequency partial discharge detection map characteristic parameter extraction method and anomaly detection method | |
CN103323755A (en) | Method and system for recognition of GIS ultrahigh frequency partial discharge signal | |
CN108957251A (en) | Cable joint partial discharge mode identification method | |
CN110726898B (en) | Power distribution network fault type identification method | |
CN106526468A (en) | Breaker state detection method based on waveform characteristics identification | |
Gao et al. | Fault line detection using waveform fusion and one-dimensional convolutional neural network in resonant grounding distribution systems | |
CN114091549B (en) | A device fault diagnosis method based on deep residual network | |
CN107064759A (en) | A kind of extra-high voltage equipment insulation defect type judgement method and system | |
CN109462404A (en) | Adaptive Wave data compression method based on similarity segmentation | |
CN111638428B (en) | GIS-based ultrahigh frequency partial discharge data processing method and system | |
CN113554611A (en) | Insulator self-explosion defect detection method and device, terminal and storage medium | |
CN115588021A (en) | A Fusion Segmentation Method for Porcelain Insulators and Fittings UAV Infrared Inspection Image Fusion | |
CN112505484A (en) | Medium-voltage distribution cable latent fault identification method for small-resistance grounding system | |
CN116304798A (en) | Partial discharge type identification method, device, equipment and medium | |
CN107817427A (en) | Decision tree recognition methods based on sulfur hexafluoride gas shelf depreciation | |
CN117825876A (en) | Cable fault feature extraction method and device combining feature fragments and wavelet packets | |
Perera et al. | Design and hardware implementation of a modular transient directional protection scheme using current signals | |
CN117591909A (en) | Non-invasive load detection and decomposition method, device and storage medium | |
CN112287953A (en) | Method and system for GIS insulation defect category identification | |
CN115439319A (en) | Exposed detection method for electric slide wire protection device | |
CN105277852A (en) | Classification and identification method of line conditions of power distribution network | |
Swetapadma et al. | A novel fault identification technique for transmission lines based on spectral entropy and one-dimensional CNN | |
CN109581142B (en) | Novel multi-dimensional fusion high-voltage generator set stator single-phase earth fault detection method | |
CN114236308A (en) | Feeder line information-based power distribution network fault detection and positioning method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |