CN117456198A

CN117456198A - Power transmission line fault detection method based on improved Yolov5 neural network

Info

Publication number: CN117456198A
Application number: CN202311035820.XA
Authority: CN
Inventors: 崔晋培; 梁程; 王钰; 孟宝坤; 高丽媛; 白天予; 任肖久; 段伟润; 王晓愉; 李海科; 徐坤; 尚梦楠; 党旭鑫; 李飞; 赵望友
Original assignee: State Grid Corp of China SGCC; State Grid Tianjin Electric Power Co Ltd; Dongli Power Supply Co of State Grid Tianjin Electric Power Co Ltd
Current assignee: State Grid Corp of China SGCC; State Grid Tianjin Electric Power Co Ltd; Dongli Power Supply Co of State Grid Tianjin Electric Power Co Ltd
Priority date: 2023-08-17
Filing date: 2023-08-17
Publication date: 2024-01-26

Abstract

The invention relates to a power transmission line fault detection method based on an improved Yolov5 neural network, which comprises the following steps: step 1, acquiring thousands of transmission line images by using a transmission line inspection information acquisition device; step 2, processing the acquired images to construct a training sample set and a test sample set; step 3, training and testing the convolutional neural network by using the training set sample and the testing set sample successively to obtain a trained YOLOv5 model for power transmission line fault detection; and 4, detecting the power transmission line image to be detected by using the trained power transmission line fault detection YOLOv5 model, and outputting a detection result. The invention can realize accurate fault detection of the power transmission line and obtain the fault category of the power transmission line; the line inspection and maintenance cost can be reduced, and the line inspection and maintenance efficiency can be improved.

Description

Power transmission line fault detection method based on improved Yolov5 neural network

Technical Field

The application belongs to the technical field of power transmission line target detection, and particularly relates to a power transmission line fault detection method based on an improved Yolov5 neural network.

Background

The transmission line works in the field throughout the year and can also cross over urban economic zones, roads, bridges and other complex environments. When the urban construction is performed, large-sized mechanical vehicles often invade a transmission line protection area, and accidents such as line short circuit, tripping and power failure caused by discharge and the like due to mechanical touch or insufficient safety distance between the machinery and the transmission line occur. In addition, foreign matter invasion caused by natural factors such as bird flying, tree dumping, branch sliding and the like can also harm the safe and stable operation of the power transmission line. Therefore, the power transmission line needs to be monitored and evaluated, and the safe and stable operation of the line is ensured.

At present, along with the rapid development of intelligent video technology, there are also technologies for monitoring the intrusion of foreign matters into a power transmission line by adopting a video and image mode, wherein the Chinese patent is a power transmission line fault detection method, a power transmission line fault detection system and electronic equipment (application number: 201811547406.6), and the Chinese patent is a power transmission line fault detection method, a power transmission line fault detection device and a mobile terminal (application number: 201910165020.7), which all adopt an image processing mode to detect the faults of the power transmission line, but both lack distinction of fault types and cannot accurately evaluate the harm of different types of faults to the power transmission line; the influence of ice and snow coating, galloping and sag change of the power transmission line on the power transmission line is not eliminated, and a plurality of faults exist, so that improvement is urgently needed.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provide the power transmission line fault detection method based on the improved Yolov5 neural network, which can realize the rapid and accurate identification of the fault type, improve the accuracy of power transmission line detection and is beneficial to the later maintenance and repair.

The above object of the present invention is achieved by the following technical solutions:

a transmission line fault detection method based on an improved Yolov5 neural network comprises the following steps:

step 1, acquiring thousands of transmission line images by using a transmission line inspection information acquisition device;

step 2, processing the acquired images to construct a training sample set and a test sample set;

step 3, training and testing the convolutional neural network by using the training set sample and the testing set sample successively to obtain a trained YOLOv5 model for power transmission line fault detection;

and 4, detecting the power transmission line image to be detected by using a trained power transmission line fault detection YOLOv5 model, and outputting a detection result.

In addition, the transmission line inspection information acquisition device in the step 1 adopts an unmanned aerial vehicle carrying shooting equipment.

Furthermore, step 2 comprises:

2.1, respectively carrying out gray scale mask mode treatment on the background in the acquired image;

2.2, marking the position frame of the power transmission line and the fault type of the power transmission line by using an image tool LabelImg, and generating an xml file with the same name as the image name; extracting the position frame of the power transmission line and the fault type information of the power transmission line marked in the XML file by using Python programming, and storing the extracted information into a txt format file, so that the txt format file can be directly identified and trained by a YOLO model to generate a sample data set; the fault types comprise strand breakage, strand scattering, intersection and foreign matter on the lead;

2.3, dividing the produced sample data set, wherein the sample ratio of the training set is 80%, and the sample ratio of the test set is 20%.

In step 3, the YOLOv5 neural network model is configured as follows:

the device comprises an input end, a main network, a neck network and a detection end; the input end is used for carrying out image preprocessing; the backbone network is used for extracting features; the neck network is used for feature fusion; the detection end is used for predicting the image characteristics, generating a boundary box and predicting the category.

Moreover, the backbone network comprises a CBS module, a Bottleneck module, a C3 module and an SPPF feature fusion structure; the C3 module blends the deformable convolution into YOLOv5 for feature extraction, and adds a new module named C3-DCN, which is obtained by modifying the C3 module, wherein Conv in C3 is changed into DCN.

The calculation method for adding the C3-DCN module comprises the following steps:

a convolution kernel R = { (-1, -1), (-1, -1), …, (0, 1) }, a standard convolution as in equation (1), a deformable convolution as in equation (2),

wherein y is ₁ (p ₀ ) Representing the output of a conventional convolution profile, p _n For each position in R, w (p _n ) Representing each p _n Weights of (2); y is ₂ (p ₀ ) Feature map output, Δp, representing deformable convolution _n Represents a leachable offset, where Δp _n |n＝1,…,N},N＝|R|；

Δp during operation of the deformable convolution _n Typically representing the fractional part, the formula needs to be modified by bilinear interpolation to the sampling position; as in equation (3),

x(p)＝∑ _q G(q,p)·x(q) (3)

G(q,p)＝g(q _x ,p _x )·g(q _y ,p _y ) (4)

g(q,p)＝max(0,1-|q-p|) (5)

wherein p=p ₀ +p _n +Δp _n Representing the position coordinates after the convolution kernel is shifted, q representing the spatial position in the feature map x. g (q) _x ,p _x )·g(q _y ,p _y ) Convolution kernel representing bilinear interpolation, x (q) is a feature mapThe value at the upper integer position, x (p), represents the bilinear interpolated value.

In the optimization training of the YOLOv5 model, the total loss function is used as a loss function, and the YOLOv5 model is trained by using an optimization gradient and performing back propagation. The total loss function of the model includes classification loss, confidence loss, and target bounding box loss. I.e. the total loss value of the model is:

Loss _t ＝α/(α+β+γ)Loss _c +β/(α+β+γ)Loss _o +γ/(α+β+γ)Loss _b (6)

wherein Loss is _c The classification Loss is expressed by using BCE Loss, and only the classification Loss of the positive sample is calculated. Loss (Low Density) _o Representing confidence Loss, also using BCE Loss, the Loss for all samples was calculated. Loss (Low Density) _b The target boundary box loss is represented, the boundary box loss function adopts ciou_loss, and only the boundary box loss of a positive sample is calculated, and alpha, beta and gamma are weight coefficients.

Moreover, training parameters are set as follows: the initial momentum was 0.8, the initial learning rate was 0.01, the weight decay was 0.0005, the cosine annealing learning rate was 0.01, and 100 epochs were trained for each model.

The invention has the advantages and positive effects that:

1. the invention provides a detection method capable of detecting various types of power transmission line faults by utilizing an improved YOLOv5 model, and the method can realize accurate power transmission line fault detection and obtain power transmission line fault types; the line inspection and maintenance cost can be reduced, the line inspection and maintenance efficiency can be improved, and the line inspection and maintenance task completion process is more intelligent; the output end of the YOLOv5 is provided with three detection heads with different sizes, so that the prediction of three different scales can be realized.

2. The invention completes the data set production by using the image labeling tools LabelImg and python programs, and has simple process and easy operation; the training image of the gray background mask is used for manufacturing a sample data set, the image mask is equivalent to shielding the background area, the extraction of the candidate area of the target is completed, and the possible position (candidate area) of the target is highlighted as far as possible; the detection efficiency of the model can be further improved.

3. The improved loss function of the invention can lead the predicted frame to be more approximate to the real frame.

4. The traditional convolutional neural network uses a convolution kernel with a fixed size to extract the features, however, the constraint can lead to poor adaptability of a model and weak generalization, and the deformable convolution is integrated into YOLOv5 to extract the features. Because the integration of the deformable convolution can increase the calculated amount of the model and influence the detection speed, in order to improve the accuracy without excessively reducing the detection speed, a C3 module in the backbone network is replaced by a C3-DCN module, so that the detection speed is improved while the high accuracy is ensured.

Drawings

Fig. 1 is a flow chart of a transmission line fault detection method of the present invention;

FIG. 2 is a schematic diagram of the CBS module of the present invention;

FIG. 3 is a block diagram of a Bottleneck module of the present invention;

FIG. 4 is a schematic illustration of the addition location of the C3-DCN of the present invention in YOLOv 5;

fig. 5 is a schematic diagram of the SPPF module of the present invention.

Detailed Description

The structure of the present invention will be further described by way of examples with reference to the accompanying drawings. It should be noted that the present embodiments are illustrative and not restrictive.

Referring to fig. 1-5, the method for detecting the power transmission line fault based on the improved Yolov5 neural network comprises the following steps:

step 1, acquiring image information of a power transmission line;

thousands of (say about 3000) transmission line images are acquired by using a transmission line inspection information acquisition device, such as an unmanned aerial vehicle carrying shooting equipment.

1. respectively carrying out gray mask mode treatment on the background in the acquired image; specifically, all images are firstly converted into 3-channel gray images respectively, then the position information of a target object is read, and gray masks outside a rectangular area where targets exist in one image are processed, so that RGB pixel values of the gray masks are kept unchanged from an original image, and a training image of a gray background mask is obtained.

2. Labeling the position frame of the power transmission line and the fault type of the power transmission line by using an image tool LabelImg to generate an xml file with the same name as the image; extracting the position frame of the power transmission line and the fault type information of the power transmission line marked in the XML file by using Python programming, and storing the extracted information into a txt format file, so that the txt format file can be directly identified and trained by a YOLO model to generate a sample data set; the fault types comprise broken strands, scattered strands, crossed strands and foreign matters on the wires;

in the sample data set, the training set sample ratio is 80%, and the test set sample ratio is 20%.

And step 3, training and testing the convolutional neural network by using the training set sample and the testing set sample successively to obtain a trained YOLOv5 model. The optimizer is random gradient descent, the initial momentum is 0.8, the initial learning rate is 0.01, the weight attenuation is 0.0005, the cosine annealing learning rate is 0.01, and 100 epochs are trained by each model; and obtaining the optimal Yolov5 model until the loss, precision and recall rate of the model tend to be stable.

The YOLOv5 neural network model is composed of:

the method mainly comprises four parts, namely an Input end (Input), a Backbone network (Backbone), a Neck network (heck) and a detection end (Head), wherein the Input is an image preprocessing stage, the Backbone is an extraction feature stage, the heck is a feature fusion stage, and the Head predicts image features to generate a boundary frame and a prediction category.

1. Input terminal

The size of the input transmission line pictures is uniformly adjusted to 640 x 640 pixels. The preprocessing process comprises the steps of scaling an input picture to adapt to the input size of a network, and performing normalization, mosaic data enhancement, self-adaptive anchor frame and self-adaptive picture scaling preprocessing.

3. Backbone network

Carrying out convolution processing on the preprocessed transmission line image by utilizing a convolution module group of the feature extraction network to obtain a first feature extraction graph; performing Bottleneck convolution and splicing processing on the first feature extraction graph by using a CBS module of the feature extraction network to obtain a second feature extraction graph; and carrying out pooling and splicing treatment on the second feature extraction graph by utilizing the SPPF module of the feature extraction network to obtain a remarkable feature graph of the power transmission line. The feature map contains the positions and the categories of the defects of the transmission line in the image.

And extracting the characteristics of the power transmission line fault picture through a convolutional neural network. The backbone network mainly consists of several modules, namely a CBS module, a Bottleneck module, a C3 module and an SPPF feature fusion structure.

CBS module: this part consists of convolution, batch normalization and activation functions, see FIG. 2

The Bottleneck module consists of convolution, batch normalization and activation functions, see FIG. 3

And C3, module: the deformable convolution is integrated into YOLOv5 for feature extraction, and a new module named C3-DCN is added, which is obtained by modifying the C3 module, wherein Conv in C3 is changed to DCN.

wherein y is ₁ (p ₀ ) Representing the output of a conventional convolution profile, p _n For each position in R, w (p _n ) Representing each p _n Weights of (2); y is ₂ (p ₀ ) Feature map output, Δp, representing deformable convolution _n Represents a leachable offset, where Δp _n |n＝1,…,N},N＝|R|。

Δp during operation of the deformable convolution _n Typically representing the fractional part, the equation requires correction of the sampling position using bilinear interpolation. As in equation (3),

x(p)＝∑ _q G(q,p)·x(q) (3)

G(q,p)＝g(q _x ,p _x )·g(q _y ,p _y ) (4)

g(q,p)＝max(0,1-|q-p|) (5)

wherein p=p ₀ +p _n +Δp _n Representing the position coordinates after the convolution kernel is shifted, q representing the spatial position in the feature map x. g (q) _x ,p _x )·g(q _y ,p _y ) A convolution kernel representing bilinear interpolation, x (q) is a value at an integer position on the feature map, and x (p) represents a bilinear interpolated value.

And replacing one C3 module in the backbone network with a C3-DCN module. An alternative position is shown in fig. 4.

SPPF module: the part connects the input features in series, and then splices the input features through three largest convergence layers with the convolution kernel size of 5 multiplied by 5, carries out multi-scale fusion,

3. neck network

The neck network is the critical part between the backbone network and the output. In order to better support a plurality of detection heads at the output end to complete a multi-scale prediction task, the neck network part performs feature fusion on the feature map corresponding to the detection heads, so that the detection capability of the detection heads is improved. In this process, a structure of fpn+pan (Feature Pyramid Network + PathAggregation Network) is adopted. The FPN structure can enhance the fusion of features of different layers, and transfer deep semantic features to a shallow layer, so that semantic expression capacity on multiple scales is enhanced, and multi-scale prediction is realized. Meanwhile, the PAN also realizes feature fusion from bottom to top on the basis of FPN, the positioning information of the shallow layer is transferred to the deep layer, the positioning capability on multiple scales is enhanced, and the detection capability of the model is further improved.

The Neck end adopts a double-tower tactic fusion of the FPN and the PAN to respectively process the feature graphs, so that the output feature graphs have strong semantic features and strong positioning features.

4. An output terminal

The output end of YOLOv5 has three convolution layers with different sizes, and can realize the prediction of three different scales. After the input image passes through the feature extraction layer, 8 times, 16 times and 32 times of downsampling feature images are obtained. In the cigarette appearance failure detection, the sizes of the output feature layers are 80×80×54, 40×40×54, and 20×20×54, respectively.

Wherein, the feature map with the size of 80×80×54 contains the majority of low-level layer features for small target detection; the low-level and high-level characteristic information of the characteristic diagram with the size of 40 multiplied by 54 are equivalent in proportion, and are used for medium target detection; the feature map of size 20 x 54 contains the majority of advanced layer features for large target detection.

5. Loss function

In the optimization training of the YOLOv5 model, the predicted frame is made closer to the real frame. The loss function of the model is optimized in all aspects, namely classification loss, confidence loss and target bounding box loss. I.e. the total loss value of the model is:

Loss _t ＝α/(α+β+γ)Loss _c +β/(α+β+γ)Loss _o +γ/(α+β+γ)Loss _b (6)

And step four, detecting the power transmission line image to be detected by using the trained YOLOv5 model, and outputting a detection result.

Configuring an experimental environment; the operating system selects ubuntu20.04; CPU selects Inter (R)CPU E5-2650v4; GPU selects Tesla P100-PCIE; the memory is 12GB; the deep learning framework is pytorch 1.8; python version 3.8.

Although the embodiments of the present invention and the accompanying drawings have been disclosed for illustrative purposes, those skilled in the art will appreciate that: various substitutions, changes and modifications are possible without departing from the spirit of the invention and the appended claims, and therefore the scope of the invention is not limited to the embodiments and the disclosure of the drawings.

Claims

1. A transmission line fault detection method based on an improved Yolov5 neural network comprises the following steps:

step 3, training and testing the convolutional neural network sequentially by using the training set sample and the testing set sample to obtain a trained power transmission line fault detection YOLOv5 model;

and 4, detecting the power transmission line image to be detected by using the trained power transmission line fault detection YOLOv5 model, and outputting a detection result.

2. The improved Yolov5 neural network-based transmission line fault detection method of claim 1, wherein the method comprises the steps of: in the step 1, an unmanned aerial vehicle carrying shooting equipment is adopted as the transmission line inspection information acquisition device.

3. The improved Yolov5 neural network-based transmission line fault detection method of claim 1, wherein the method comprises the steps of: the step 2 comprises the following steps:

2.1, respectively carrying out gray scale mask type processing on the background in the acquired image: firstly, all images are respectively converted into 3-channel gray images, then, the position information of a target power transmission line is read, and for a rectangular area with the target power transmission line in one image, RGB pixel values of the rectangular area are kept unchanged from an original image, so that a training image of a gray background mask is obtained.

4. The improved Yolov5 neural network-based transmission line fault detection method of claim 1, wherein the method comprises the steps of: in step 3, the YOLOv5 neural network model is configured as follows: the device comprises an input end, a main network, a neck network and a detection end; the input end is used for carrying out image preprocessing; the backbone network is used for extracting features; the neck network is used for feature fusion; the detection end is used for predicting the image characteristics, generating a boundary box and predicting the category.

5. The improved Yolov5 neural network-based transmission line fault detection method of claim 4, wherein the method comprises the steps of: the backbone network comprises a CBS module, a Bottleneck module, a C3 module and an SPPF feature fusion structure; the C3 module blends the deformable convolution into YOLOv5 for feature extraction, and adds a new module named C3-DCN, which is obtained by modifying the C3 module, wherein Conv in C3 is changed into DCN.

6. The improved Yolov5 neural network-based transmission line fault detection method of claim 5, wherein the method comprises the steps of: the calculation method for adding the C3-DCN module comprises the following steps:

wherein y is ₁ (p ₀ ) Representing the output of a conventional convolution profile, p _n For each position in R, w (p _n ) Representing each p _n Weights of (2); y is ₂ (p ₀ ) Feature map output, Δp, representing deformable convolution _n Represents a leachable offset, where Δp _n |n＝1，…，N}，N＝|R|；

x(p)＝∑ _q G(q，p)·x(q) (3)

G(q，p)＝g(q _x ，p _x )·g(q _y ，p _y ) (4)

g(q，p)＝max(0，1-|q-p|) (5)

wherein p=p ₀ +p _n +Δp _n Representing position coordinates after the convolution kernel is shifted, and q represents a spatial position in the feature map x; g (q) _x ，p _x )·g(q _y ，p _y ) A convolution kernel representing bilinear interpolation, x (q) is a value at an integer position on the feature map, and x (p) represents a bilinear interpolated value.

7. The improved Yolov5 neural network-based transmission line fault detection method of claim 1, wherein the method comprises the steps of: in the optimized training of the YOLOv5 model, the total loss function is adopted as a loss function, and the YOLOv5 model is trained by utilizing an optimized gradient and performing back propagation; the total loss function of the model includes classification loss, confidence loss, and target bounding box loss;

Loss _t ＝α/(α+β+γ)Loss _c +β/(α+β+γ)Loss _o +γ/(α+β+γ)Loss _b (6)

wherein Loss is _c Representing the classification Loss, wherein BCE Loss is adopted, and only the classification Loss of the positive sample is calculated; loss (Low Density) _o Representing confidence Loss, also using BCE Loss, calculated as Loss for all samples; loss (Low Density) _b The target boundary box loss is represented, the boundary box loss function adopts ciou_loss, and only the boundary box loss of a positive sample is calculated, and alpha, beta and gamma are weight coefficients.

8. The improved Yolov5 neural network-based transmission line fault detection method of claim 1, wherein the method comprises the steps of: the training parameters are set as follows: the initial momentum was 0.8, the initial learning rate was 0.01, the weight decay was 0.0005, the cosine annealing learning rate was 0.01, and 100 epochs were trained for each model.