CN111784633B

CN111784633B - Insulator defect automatic detection algorithm for electric power inspection video

Info

Publication number: CN111784633B
Application number: CN202010452724.5A
Authority: CN
Inventors: 肖照林; 杨志林; 金海燕; 杨秀红
Original assignee: Xian University of Technology
Current assignee: Xian University of Technology
Priority date: 2020-05-26
Filing date: 2020-05-26
Publication date: 2024-02-06
Anticipated expiration: 2040-05-26
Also published as: CN111784633A

Abstract

The invention discloses an insulator defect automatic detection algorithm for an electric power inspection video, which comprises the following steps: step 1, classifying and enhancing original electric power tower insulator sample data; step 2, constructing and generating a convolutional neural network model, and training the network model through the data enhanced in the step 1; step 3, processing the image to be detected, extracting HOG characteristics from the picture passing through the network model through the established network model, and determining the approximate position of the insulator in the picture; step 4, after determining the approximate position of the insulator in the picture through the step 3, detecting whether the insulator is defective or not through a CNN/LSTM deep learning method; the method disclosed by the invention solves the defects of the existing power tower insulator defect detection method, adopts a deep learning method, and avoids the influence of other environmental factors in a sample. And the image is further processed and analyzed by combining with the image characteristics, so that the detection process is more accurate and efficient.

Description

Insulator defect automatic detection algorithm for electric power inspection video

Technical Field

The invention belongs to the technical fields of image processing, machine vision and artificial intelligence, and relates to an insulator defect automatic detection algorithm for an electric power inspection video.

Background

On the power transmission line, the safety of a power tower and the line is guaranteed, the timely discovery and treatment of the problems of an insulator in an overhead power transmission line are vital, the insulator can cause dirt and defects of different degrees due to the influence of environment and power change, the service life of the power transmission line can be seriously damaged, power failure can be caused even by burning equipment, serious accidents of power production are caused, in order to avoid the faults of the insulator, the power production process can only increase the times of maintenance, the manual inspection efficiency is low, the reliability is poor, the cost is high, the automatic inspection means for the faults of the insulator are not available at present, and the related algorithm and system still have the defects in precision and efficiency.

At present, the detection of insulators in the power industry in the related fields at home and abroad is roughly divided into an electric quantity measurement method and a non-electric quantity measurement method; the electric quantity measuring method is used for analyzing the running state of the insulator in real time by collecting current and voltage information; the non-electric quantity measuring method comprises an ultrasonic detection method, a laser Doppler vibration method, non-contact detection of infrared thermal imaging and the like, and an insulator damage condition visual detection method based on local area characteristics of a porcelain insulator is proposed in 2005, and the method adopts a three-layer BP neural network to identify a target, and adopts a scanning analysis insulator sub-image window longitudinal line cutting mode to judge the state of the insulator;

in recent years, the increase of transmission lines causes the line operation and maintenance environment to be more and more complex, the application of unmanned aerial vehicles in electric power inspection makes up the defect of a manual inspection mode, and an automatic and high-precision detection means for an electric power tower insulator of unmanned aerial vehicle inspection data is an important direction of technical development in the field.

Disclosure of Invention

The invention aims to provide an automatic insulator defect detection algorithm for an electric power inspection video, which can realize automatic judging whether insulator defects exist in electric power inspection video data.

The invention adopts an insulator defect automatic detection algorithm for an electric power inspection video, which is implemented by the following steps:

step 1, classifying and enhancing original electric power tower insulator sample data;

step 2, constructing and generating a convolutional neural network model, training the network model through the data enhanced in the step 1, and classifying through the convolutional neural network model;

step 3, processing the data classified in the step 2, extracting HOG characteristics from the pictures passing through the network model through the established network model, and determining the approximate position of the insulator in the pictures;

and 4, determining the approximate position of the insulator in the picture through the step 3, and detecting whether the insulator is defective or not through a CNN/LSTM deep learning method.

The invention is also characterized in that:

the specific content of the step 1 comprises the following steps:

firstly, extracting sample data of an existing electric power tower insulator to obtain an original sample image, dividing the data into two types of data including the insulator and the insulator-free data, and dividing the two types of data into test set data and training set data;

then invoking an image processing library in a python environment, defining enhancement factors, and carrying out data enhancement on the original image;

the step 2 specifically includes screening and classifying data through a convolutional neural network, and removing invalid pictures without insulators, and specifically includes the following steps:

step 2.1, constructing a convolutional neural network model by using Tensorflow, and taking the 3-channel image data obtained in the step 1, namely a training data set, as input of training of the network model;

the first layer of convolution layer, the input data passes through 16 convolution kernels, the convolution kernels are 3 by 3, the step length is 1, and the padding is set as the same;

the second pooling layer max pool, the pooling window is 2 x 2, the step size is 2, the padding is set as the same, and the data is downsampled to one half of the original data.

Extracting image features through multiple convolutions, adding deviation on a result obtained after the convolutions, and activating an output result by using an activation function ReLU;

the defined loss function of the network model is shown in formula (1):

loss＝-[plog(t)+(1-p)log(1-t)] (1)

dividing the output result into two classes, wherein p is the expected classification probability of the image, and t is the actual classification probability of the image; the full-connection layer converts the obtained data features through feature weighting to realize data unidimensionation, and then the softmax operation normalizes the result and combines the cross entropy formula to output the judgment result of the image;

step 2.2, inputting the data enhanced in the step 1 into the network model established in the step 2.1 to start training, when the loss value tends to be stable, describing that the model gradually converges, using the test set data as the input for generating a network, entering the trained network model, and checking the output accuracy;

the specific content in the step 3 is as follows:

step 3.1, processing the test set data and extracting HOG characteristics:

carrying out Gaussian filtering treatment on the data image of the test set, then carrying out downsampling decomposition on the smoothed image, respectively carrying out 1/4 downsampling on the size of the smoothed image to obtain a new image, continuously downsampling to obtain thumbnails with different sizes, finally forming a three-layer pyramid, gradually reducing the image size from the bottommost layer to the topmost layer, and gradually reducing the image resolution to obtain a Gaussian pyramid model; then, extracting HOG features from each layer of image of the obtained Gaussian pyramid model respectively to form a final HOG feature vector of the image;

step 3.2, determining the approximate position of the insulator in the picture according to the HOG characteristics extracted in the step 3.1;

the step 3 of extracting the HOG characteristic specific content comprises the following steps:

step 3.1.1, adopting a Gamma correction method to normalize a color space of the Gaussian pyramid model;

step 3.1.2, calculating the gradient of each pixel of the image:

the gradients of the pixel points (x, y) in the image in the horizontal direction and the vertical direction are respectively shown as the formula (2) and the formula (3):

G _X (x,y)＝H(x+1,y)-H(x-1,y) (2)

G _y (x,y)＝H(x,y+1)-H(x,y-1) (3)；

after obtaining the horizontal gradient and the vertical gradient, obtaining the gradient amplitude and the gradient direction corresponding to the pixel point, wherein the gradient amplitude and the gradient direction are respectively shown as the formula (4) and the formula (5):

step 3.1.3, dividing the gray level image into small cells which cannot slide;

step 3.1.4, dividing intervals by angles in the direction of the pixel gradient, and counting the gradient histogram of each cell to form a descriptor of each cell;

step 3.1.5, forming a block by each cell, and connecting feature descriptors of all cells in the block in series to obtain HOG feature descriptors of the block;

step 3.1.6, connecting HOG feature descriptors of all blocks in the image in series to obtain HOG features of the picture;

the specific operation content of determining the approximate position of the insulator in the picture by using the extracted HOG characteristics in the step 3 is as follows:

when the block calculates the sliding process in the image, judging that the region contains an insulator if the current gradient amplitude value is similar to the gradient histogram and the value of the adjacent block; judging the characteristic gradient histogram of the whole picture to obtain the ROI region distribution of the image;

the specific content of the step 4 comprises the following steps:

constructing an LSTM neural network model by using Tensorflow, wherein the basic structure of the LSTM model realizes three gate operations, namely a forgetting gate, an input gate and an output gate;

LSTM needs 8 groups of parameters to learn, respectively: weight matrix W of forgetting gate _f And bias term b _f Weight matrix W of input gate _i And bias term b _i Weight matrix W of output gate _o And bias term b _o And a weight matrix W for calculating cell states _c And bias term b _c ；

The forget gate and the input gate are shown in formula (6) and formula (7), respectively:

f _t ＝σ(W _f ·[h _t-1 ,x _t ]+b _f ) (6)

i _t ＝σ(W _i ·[h _t-1 ,x _t ]+b _i ) (7)

w in the formula _f Is the weight matrix of forgetting gate, [ h ] _t-1 ,x _t ]Representing the joining of two vectors into one longer vector, b _f Is a bias term for forgetting gates, σ is a sigmoid function;

the output gate is shown in equation (8):

o _t ＝σ(W _o ·[h _t-1 ,x _t ]+b _o ) (8)；

selecting a result of forgetting the last moment through a forgetting gate, combining the input of the moment to obtain the output of a cell at the current moment, judging the results through an output gate, taking the judging result of the output gate as the input of a convolutional neural network, screening and extracting the image of the determined ROI area through an LSTM network once again, judging the defect condition of an insulator, and finally classifying the obtained judging result in the input classification convolutional neural network;

the training data of the classified convolutional neural network are divided into two types, pictures of an intact insulator and a defective insulator are respectively stored, the two types of data are divided into a training set and a verification set, and then the training data are input into the convolutional neural network for training;

the training process of the input convolutional neural network comprises the following steps:

the first layer of convolution layer, the input data passes through 5 convolution kernels, the convolution kernels are 3 x 5, and the step length is 1; a second pooling layer max pool, wherein the pooling window is 2 x 2, and the step length is 2, and the data is downsampled; the third layer of convolution layer is subjected to 5 convolution kernels, the convolution kernels are 3 x 5, and the step length is 1; a fourth pooling layer max pool, wherein the pooling window is 2 x 2, and the step length is 2;

adding deviation on the result obtained after convolution, activating the output result by using an activation function ReLU, and using cross entropy by using a loss function; the fifth layer and the sixth layer are all connected layers, the data obtained by the fourth layer are unidimensionalized, the data obtained by the fourth layer are converted through feature weighting, and then softmax is normalized to obtain a final judging result:

wherein the defined loss function is as shown in formula (9):

loss＝loss _MES +loss _CNN (9)

where loss is the loss of content, including MSE loss for LSTM networks _MES And loss of judgment loss of convolution network _CNN As shown in the formula (10) and the formula (11), respectively:

in the formula (10), H and W represent the width and height of the image, I _x,y Representing the x, y position, I of a real image _x,y ' represents the x, y position of the output image to be passed through the network module; in the formula (11), P is the expected classification probability of the image, and T is the actual classification probability of the image passing through the network.

And obtaining the corresponding position in the video frame sequence according to the output result, thereby outputting the final defect detection result.

The beneficial effects of the invention are as follows:

aiming at the limitation and the deficiency of the existing power tower insulator defect detection method, the invention improves the insulator defect detection precision by using a deep learning method, and effectively solves the problem of the non-robust detection algorithm caused by the deficiency of the number of training samples of the deep learning. Because the deep learning network model with memory capability is adopted, the algorithm provided by the invention can extract insulator defect information recorded by multi-frame image data in video, so that the detection accuracy is higher than that of the existing detection algorithm based on single-frame image data. The pre-screening process of the power inspection video data is introduced, so that the method has lower computational complexity and higher detection efficiency, and is more suitable for application scenes of processing a large amount of inspection video data in the power inspection process.

Drawings

FIG. 1 is a diagram showing a CNN network structure in an automatic insulator defect detection algorithm for an electric power inspection;

FIG. 2 is a diagram of the overall network model in the automatic insulator defect detection algorithm for the power inspection of the invention;

FIG. 3 is a basic block diagram of an LSTM network in an automatic detection algorithm for insulator defects for an electric power inspection;

FIG. 4 is a flowchart of a HOG feature locating insulator in an automatic detection algorithm for insulator defects for an electric power inspection;

fig. 5 is a flowchart of defect detection in an automatic insulator defect detection algorithm for an electric power inspection video.

Detailed Description

The invention will be described in detail below with reference to the drawings and the detailed description.

The invention provides an insulator defect automatic detection algorithm for an electric power inspection video, which is implemented according to the following steps as shown in fig. 2 and 5:

step 1, classifying and enhancing original electric power tower insulator sample data:

extracting existing electric power tower insulator sample data to obtain an original sample image, and dividing the data into two main types: the test set data and the training set data are divided into the test set data and the training set data;

calling an image processing library in a python environment, defining enhancement factors, and carrying out data enhancement on an original image: data overturning, data rotation, data amplification, data clipping, data translation, noise disturbance and the like, and a plurality of samples different from original data can be obtained through data enhancement, so that data diversity is improved. The increase of the data number also enhances the accuracy of the training model, and overcomes the limitation of a small data set on a certain condition;

step 2, constructing and generating a convolutional neural network model, and training the network model through the data enhanced in the step 1:

step 2, screening data through a convolutional neural network, and removing invalid pictures without insulators, wherein the specific process is as follows:

step 2.1, constructing a convolutional neural network model by using Tensorflow, wherein the basic structure of a characteristic extraction network of the convolutional neural network model consists of a series of convolutional units and pooling units, the 3-channel image data obtained in the step 1, namely a training data set, is used as the input of network model training, a convolutional structure table is shown in fig. 1, H, W and C represent the size and the number of channels of an input image, and finally S is the output judgment result;

the defined loss function of the network model is shown in formula (1):

loss＝-[plog(t)+(1-p)log(1-t)] (1)

because the network is used for eliminating irrelevant background pictures without insulators, the output result is only required to be divided into two types, p is the expected classification probability of the image, and t is the actual classification probability of the image; the full-connection layer converts the obtained data features through feature weighting to realize data unidimensionation, and then the softmax operation normalizes the result and combines the cross entropy formula to output the judgment result of the image;

step 3, processing the test set data obtained by classification in the step 1, extracting HOG characteristics from pictures passing through the network model through the established network model, and determining the approximate position of the insulator in the pictures:

step 3.1, as shown in fig. 4, processing the picture to be detected and extracting HOG features:

carrying out Gaussian filtering treatment on an image to be detected, then carrying out downsampling decomposition on the image subjected to the smoothing treatment, respectively carrying out 1/4 downsampling on the size of the image to obtain a new image, continuously downsampling to obtain thumbnails with different sizes, finally forming a three-layer pyramid, gradually reducing the image size from the bottommost layer to the topmost layer, and gradually reducing the image resolution to obtain a Gaussian pyramid model; then, extracting HOG features from each layer of image of the obtained Gaussian pyramid model respectively to form a final HOG feature vector of the image;

the specific process for extracting HOG features is as follows:

step 3.1.1, adopting a Gamma correction method to normalize a color space of the Gaussian pyramid model image, adjusting the contrast of the image, effectively reducing the influence caused by local shadow and illumination change of the image, and simultaneously inhibiting noise interference;

step 3.1.2, calculating the gradient (including the size and the direction) of each pixel of the image, capturing contour information through gradient information, and further weakening the interference of illumination at the same time:

G _X (x,y)＝H(x+1,y)-H(x-1,y) (2)

G _y (x,y)＝H(x,y+1)-H(x,y-1) (3)；

step 3.1.3, dividing the gray level image into small cells which cannot slide;

step 3.1.4, the direction of the pixel gradient can be divided into intervals by angles, and the gradient histogram (the number of different gradients in a certain interval) of each cell is counted, so that the descriptor of each cell can be formed;

step 3.1.6, connecting HOG feature descriptors of all blocks in the image in series to obtain HOG feature, gradient image and gradient amplitude information of the picture;

step 3.2, determining the approximate position of the insulator in the picture according to the HOG characteristics extracted in the step 3.1: after the related information of the picture to be detected is obtained, the gradient histogram can be distributed in a specific gradient direction at the edge part with larger change of the insulator. Considering that the gradient histogram of the insulators and the background part in the picture are large in difference, and the insulators are basically arranged periodically, the difference of the gradient amplitude values among the insulators is small, so that when the block is similar to the gradient histogram and the adjacent block in the sliding process in the image calculation, the region is judged to contain the insulators; judging the characteristic gradient histogram of the whole picture to obtain the ROI region distribution of the image;

calculating gradient positioning, carrying out frequency domain transformation on the positioned area, and rotating according to the detected main direction, so that insulators in all pictures are adjusted to be in one direction, and the accuracy of neural network processing judgment is improved;

step 4, after determining the approximate position of the insulator in the picture through the step 3, detecting whether the insulator is defective or not through a CNN/LSTM deep learning method:

constructing an LSTM neural network model by using Tensorflow, wherein the basic structure of the LSTM model realizes three gate operations, namely a forgetting gate, an input gate and an output gate; as shown in fig. 3, the forget gate is responsible for deciding how much of the last time cell state remains to the current time cell state; the input gate is responsible for deciding how much of the cell state at the current time to keep input to the current time; the output gate is responsible for determining how much output exists in the unit state at the current moment;

f _t ＝σ(W _f ·[h _t-1 ,x _t ]+b _f ) (6)

i _t ＝σ(W _i ·[h _t-1 ,x _t ]+b _i ) (7)

the output gate is shown in equation (8):

o _t ＝σ(W _o ·[h _t-1 ,x _t ]+b _o ) (8)；

the result of the last moment is selected through a forgetting gate, the output of the cell at the current moment is obtained by combining the input of the moment, the results are judged through an output gate, and the final output of the LSTM network is determined by the output gate and the state of the cell together; taking the output gate result as the input of the convolutional neural network, screening and extracting the image of the determined ROI region through the LSTM network for one time, and then transmitting the obtained result into the convolutional neural network to judge the defect condition of the insulator;

the training data of the convolutional neural network are divided into two types, pictures of a perfect insulator and a defective insulator are respectively stored, the two types of data are divided into a training set and a verification set, and then the training data are input into the convolutional neural network for training;

wherein the defined loss function is as shown in formula (9):

loss＝loss _MES +loss _CNN (9)

The invention discloses an insulator defect automatic detection algorithm for an electric power inspection video, which comprises the following specific processes:

firstly, classifying by CNN, and analyzing whether a picture has a classification network of insulators or not;

then extracting the HOG characteristics of the Gaussian pyramid, namely, performing ROI segmentation on the insulator-existing part of the picture;

then, an LSTM detection neural network is adopted to detect and judge whether the insulator is damaged (whether the insulator is damaged or not is judged by a plurality of continuous feature images), and the LSTM input is a plurality of frame insulator feature images (which can be understood as a short feature video) after HOG processing, so that the detection of the insulator damage is a multi-frame feature detection method which utilizes the correlation of a time domain and a space domain, and is particularly suitable for processing inspection video;

the ordered arrangement mode of the three steps enables the method to be suitable for inspection video data instead of image data, thereby improving efficiency and precision; because the LSTM detection neural network is judged to be a probability, the invention classifies by adopting the one-time classification convolutional neural network to obtain an exact defect position, and the CNN and the LSTM are firstly trained independently of each other and then are trained by cascading the two networks due to different inputs, so that the convergence and the precision of each network and the final network are improved.

Claims

1. An insulator defect automatic detection algorithm for an electric power inspection video is characterized by comprising the following steps:

step 2, constructing and generating a convolutional neural network model, training the network model through the data enhanced in the step 1, and classifying through the convolutional neural network model, specifically, screening and classifying the data through the convolutional neural network, and eliminating invalid pictures without insulators;

step 3, processing the data classified in the step 2, extracting HOG characteristics from the pictures passing through the network model through the established network model, and determining the approximate position of the insulator in the pictures, wherein the method comprises the following steps of:

step 3.1, processing the test set data and extracting HOG characteristics:

the method for extracting the HOG characteristic specific content comprises the following steps of:

step 3.1.2, calculating the gradient of each pixel of the image:

G _X (x,y)＝H(x+1,y)-H(x-1,y) (2)

G _y (x,y)＝H(x,y+1)-H(x,y-1) (3)；

step 3.1.3, dividing the gray level image into small cells which cannot slide;

step 3.2, determining the approximate position of the insulator in the picture according to the HOG characteristics extracted in the step 3.1, wherein the specific operation content is as follows:

step 4, after determining the approximate position of the insulator in the picture through the step 3, detecting whether the insulator is defective or not through a CNN/LSTM deep learning method, specifically: taking the output of the LSTM as the input of a convolutional neural network, screening and extracting the image with the determined ROI area through the LSTM to obtain a picture characteristic, judging the defect condition of the insulator, and finally transmitting the obtained judging result into a classified convolutional neural network for classification; the training data of the classified convolutional neural network are divided into two types, pictures of an intact insulator and a defective insulator are respectively stored, the two types of data are divided into a training set and a verification set, and then the training data are input into the convolutional neural network for training.

2. The automatic detection algorithm for insulator defect for electric power inspection according to claim 1, wherein the step 2 specifically comprises the steps of screening and classifying data through a convolutional neural network, and excluding invalid pictures without insulators, and the specific steps are as follows:

a second pooling layer max pool, wherein the pooling window is 2 x 2, the step length is 2, the padding is set as the same, and the data is downsampled to one half of the original data;

the defined loss function of the network model is shown in formula (1):

loss＝-[plog(t)+(1-p)log(1-t)] (1)

and 2.2, inputting the data enhanced in the step 1 into the network model established in the step 2.1 to start training, when the loss value tends to be stable, gradually converging the model, using the test set data as the input for generating a network, entering the trained network model, and checking the output accuracy.

3. The automatic insulator defect detection algorithm for the power inspection according to claim 1, wherein the specific content of the step 4 comprises:

f _t ＝σ(W _f ·[h _t-1 ,x _t ]+b _f ) (6)

i _t ＝σ(W _i ·[h _t-1 ,x _t ]+b _i ) (7)

the output gate is shown in equation (8):

o _t ＝σ(W _o ·[h _t-1 ,x _t ]+b _o ) (8)；

and selecting a result of the last moment to be forgotten through a forgetting gate, combining the input of the moment to obtain the output of the cell at the current moment, judging the results through an output gate, taking the judging result of the output gate as the input of a convolutional neural network, screening and extracting the image of the determined ROI area once again through an LSTM network, judging the defect condition of the insulator, and finally classifying the obtained judging result in the input classification convolutional neural network.

4. The automatic detection algorithm for insulator defect for electric power inspection according to claim 3, wherein the training data of the classified convolutional neural network are divided into two types, pictures of an intact insulator and a defective insulator are respectively stored, the two types of data are divided into a training set and a verification set, and then the training data are input into the convolutional neural network for training;

wherein the defined loss function is as shown in formula (9):

loss＝loss _MES +loss _CNN (9)

in the formula (10), H and W represent the width and height of the image, I _x,y Representing the x, y position, I of a real image _x,y ' indicate will be passed throughPassing through the x and y positions of the output image of the network module; in the formula (11), P is the expected classification probability of the image, and T is the actual classification probability of the image passing through the network;