CN117726958B

CN117726958B - Intelligent detection and hidden danger identification method for inspection image target of unmanned aerial vehicle of distribution line

Info

Publication number: CN117726958B
Application number: CN202410171362.0A
Authority: CN
Inventors: 夏家力; 王远鹏; 李新民; 程良立; 贺俊雄; 李云健; 李健; 吴奇乐; 冯燕; 徐启
Original assignee: State Grid Hubei Electric Power Co Ltd; Huanggang Power Supply Co of State Grid Hubei Electric Power Co Ltd
Current assignee: State Grid Hubei Electric Power Co Ltd; Huanggang Power Supply Co of State Grid Hubei Electric Power Co Ltd
Priority date: 2024-02-07
Filing date: 2024-02-07
Publication date: 2024-05-10
Anticipated expiration: 2044-02-07
Also published as: CN117726958A

Abstract

The application relates to a distribution line unmanned aerial vehicle inspection image target detection and hidden danger intelligent identification method, which comprises the following steps: the unmanned aerial vehicle is used for collecting optical images and infrared images of the distribution line, and preprocessing infrared images and visible images of the unmanned aerial vehicle collected on the distribution line; constructing a typical target detection model under a complex background driven by large model semantics, reading the preprocessed image, and extracting the characteristics of the image; and constructing a distribution line hidden danger positioning and fine granularity identification algorithm combining the perception attention, and carrying out fine granularity hidden danger identification on the acquired characteristics. The application can realize comprehensive monitoring of the distribution line, quickly locate hidden trouble, accurately plan maintenance plan, improve maintenance efficiency and reduce maintenance cost through an automatic hidden trouble identification technology.

Description

Intelligent detection and hidden danger identification method for inspection image target of unmanned aerial vehicle of distribution line

Technical Field

The application relates to the field of distribution line hidden danger identification and detection, in particular to a distribution line unmanned aerial vehicle inspection image target detection and hidden danger intelligent identification method.

Background

In the past, the potential safety hazard of distribution lines is eliminated by adopting a mode of manually observing and shooting by a camera along the distribution lines by professional inspection staff, and the mode is low in efficiency and high in manpower cost, and the detection result is very dependent on the experience of the professional staff. Workers are also at great risk from the standpoint of personal safety. If the situation of severe environment happens, the running cost of inspection is greatly increased.

Disclosure of Invention

The embodiment of the application aims to provide a distribution line unmanned aerial vehicle inspection image target detection and hidden danger intelligent identification method, which is used for solving the three core problems of low sample quality, insufficient sample quantity, large typical target scale difference and difficult identification of fine-granularity hidden danger existing in the distribution line unmanned aerial vehicle inspection process.

In order to achieve the above purpose, the present application provides the following technical solutions:

The embodiment of the application provides a method for detecting a patrol image target of a power distribution line unmanned aerial vehicle and intelligently identifying hidden danger, which comprises the following steps:

the unmanned aerial vehicle is used for collecting optical images and infrared images of the distribution line, and preprocessing infrared images and visible images of the unmanned aerial vehicle collected on the distribution line;

constructing a typical target detection model under a complex background driven by large model semantics, reading the preprocessed image, and extracting the characteristics of the image;

And constructing a distribution line hidden danger positioning and fine granularity identification algorithm combining the perception attention, and carrying out fine granularity hidden danger identification on the acquired characteristics.

The preprocessing of the unmanned aerial vehicle infrared and visible light images collected on the distribution line is specifically that details of cross-domain images are added by a sample refining and image enhancement method based on a residual pyramid network, in the process of the sample refining and image enhancement method based on the residual pyramid network, a multi-type sample refining method is adopted, and 5 types of image enhancement processing including noise adding, blurring processing, edge sharpening, pixel inversion and image smoothing are performed on an original image, so that the problem of low sample capacity is solved, and the data size is further amplified.

In the residual pyramid network-based sample refinement and image enhancement method, feature extraction is performed on different layers of information by using a pyramid cascade network, and a calculation formula is as follows:

，

In the above-mentioned description of the invention, Representing feature splice sets generated by the first layer dense residual block, the second layer dense residual block and the Mth layer dense residual block respectively,/>A convolution operation representing 1*1 of layer h, global residual learning in network/>Is defined as:

，

In the above-mentioned formula, the group of the compounds, Representing the/>, in a low resolution subnetworkLayer output, output in high resolution subnetworkAs an input to the low resolution network.

In the process of the sample refinement and image enhancement method based on the residual pyramid network, a visible light and infrared image fusion method of a combined full convolution neural network is adopted, a camera and an infrared sensor are installed through an unmanned aerial vehicle, and the visible light and the infrared image are simultaneously acquired and subjected to later fusion processing so as to reflect imaging information of a distribution line image more truly; in the process of a visible light and infrared image fusion method adopting a combined full convolution neural network, a low-frequency image fusion mode and a high-frequency image fusion mode are introduced, so that the resolution and the definition of a fused image are ensured, and the accuracy of circuit line inspection fault detection is improved.

The method for constructing the typical target detection model under the complex background driven by the large model semanteme, reading the preprocessed image, and extracting the characteristics of the image specifically comprises the following steps of:

Dry sampling function consisting of two convolution layers Scaling the features to a suitable size, each convolution layer having a convolution kernel size of 3, a stride of 2, and a fill of 1, including an LN normalization layer and a GELU activation function layer;

Performing DCNv function processing on the scaled features;

Inserting a downsampling layer to adjust the size of the feature map, filtering redundant information and noise in the data, and retaining key features of the data;

based on the designed superimposition criteria, an image is output.

The multi-scale feature extraction for infrared images is as follows:

Slicing the data through a Focus structure, sampling at intervals to obtain 4 feature graphs, and expanding the number of input channels by 4 times;

after one convolution operation, the data is input to Module/>The module realizes data dimension reduction by adjusting the channel number, reduces the network parameter number, and then adopts point convolution to adjust the channel number to be consistent with input so as to realize channel information fusion;

After one convolution operation, the data is transmitted to the SPP layer, the SPP layer separates the context features by convolution and pooling operation, and the receptive field is increased, so that the subsequent fusion of global feature information is facilitated.

The method for fusing the visible light and the infrared features by using a multi-mode bidirectional feature fusion algorithm comprises the following specific steps:

An original feature pyramid is obtained by carrying out a series of convolution operations on an input image, and feature graphs with different scales are obtained;

After obtaining feature graphs with different scales, combining the feature graphs to form an initial multi-scale feature pyramid, wherein the feature graphs have the characteristics of high-to-low resolution and high-to-low semantic information;

transmitting shallow characteristic information to a deep layer through a bottom-up structure, simultaneously downsampling high-resolution characteristics, and carrying out channel transformation on the characteristic map to obtain a new characteristic map;

through reverse feature transfer, semantic features in the deep feature map are transferred to the shallow feature map, and a new multi-scale feature map is generated;

and combining all the enhanced feature graphs with different scales to form a new feature pyramid, so as to finish the fusion of the visible light and infrared features.

Target detection using the trifoliate-based attention is performed as follows:

The feature tensor is subjected to global average pooling, global maximum pooling and global random pooling of three parallel modes to obtain three channel feature quantities with C multiplied by 1 dimension,

Adding the generated three channel characteristic quantities to obtain a comprehensive channel characteristic quantity with the dimension of Cx1x1,

Nonlinear activation is performed by using a Sigmoid function to obtain a channel weight coefficient of the feature map, the input feature map is multiplied by the channel weight coefficient and is connected with the input feature map in a jumping manner to obtain a brand new feature map with channel importance difference,

And adding the three-way dimensionality halved output feature images, inputting the feature images into a Sigmoid activation function to obtain a final spatial attention weight matrix, and multiplying the weight matrix by the input feature images to obtain the feature images with spatial pixel weight relations.

The method for locating the image local discrimination area under the guidance of the attention introducing mechanism specifically comprises the following steps of:

Giving a global image, taking an absolute value of an activation value at any position, and obtaining a visual attention heat map of the feature by taking the maximum value in the direction of the feature channel;

designing a binary template for locating a local area with a larger activation value in the heat map;

acquiring a maximum communication area capable of covering all larger activation values in the binary template from the characteristic heat map;

And cutting out a local judging region in the input image by utilizing the maximum connected region, and then adjusting the local judging region to be the same size as the global image.

Compared with the prior art, the application has the beneficial effects that:

By means of the accurate hidden danger identification technology, potential problems can be found early, preventive and maintenance measures are taken, accident risks are reduced, and the overall safety level of the power system is improved. The hidden danger identification technology can immediately find problems before the distribution line is in problem, avoid interruption and faults of the line and ensure the continuity and stability of power supply. By maintaining and replacing damaged parts in time, the power failure time can be reduced, the power utilization satisfaction of users can be improved, and the reliable power supply of a power system can be ensured. The traditional line inspection method requires a lot of manpower and time, and is low in efficiency and easy to miss. Through automatic hidden danger identification technology, can realize the comprehensive monitoring to distribution lines, the hidden danger is fixed a position fast, accurate planning maintenance plan improves maintenance efficiency, reduces maintenance cost. With the increasing demand for electricity and the increasing upgrading of the power grid, the requirements for safety and reliability of the distribution lines are increasing. The application of the distribution line hidden trouble identification technology is helpful to promote the power industry to develop towards the direction of intellectualization and digitalization, promote the overall level of the power system and meet the requirements of future energy supply. The invention has the advantages of ensuring safe and stable operation of the power system, improving the reliability of power supply, optimizing maintenance and management of facilities, providing support for future development of the power industry and promoting the power industry to realize sustainable development and modernization transformation.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings that are needed in the embodiments of the present application will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present application and should not be considered as limiting the scope, and other related drawings can be obtained according to these drawings without inventive effort for a person skilled in the art.

Fig. 1 is a schematic flow chart of a method according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present application will be described below with reference to the accompanying drawings in the embodiments of the present application. It should be noted that: like reference numerals and letters denote like items in the following figures, and thus once an item is defined in one figure, no further definition or explanation thereof is necessary in the following figures.

The terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.

The terms "first," "second," and the like, are used merely to distinguish one entity or action from another entity or action, and are not to be construed as indicating or implying any actual such relationship or order between such entities or actions.

Referring to fig. 1, an embodiment of the invention provides a method for detecting a target of an inspection image of a power distribution line unmanned aerial vehicle and intelligently identifying hidden trouble, which comprises the following steps:

S1, acquiring optical images and infrared images of a distribution line by using an unmanned aerial vehicle, and preprocessing infrared and visible light images of the unmanned aerial vehicle acquired on the distribution line;

S2, constructing a typical target detection model under a complex background driven by large model semantics, reading the preprocessed image, and extracting the characteristics of the image;

s3, constructing a distribution line hidden danger positioning and fine granularity identification algorithm combining the perception attention, and carrying out fine granularity hidden danger identification on the acquired characteristics.

，

Performing DCNv function processing on the scaled features;

based on the designed superimposition criteria, an image is output.

The multi-scale feature extraction for infrared images is as follows:

Target detection using the trifoliate-based attention is performed as follows:

The image local discrimination area positioning method under the guidance of the attention mechanism constructs a binary mask for positioning the local discrimination area in the global image. The binary template has the same size as the global image, and each position value is 0 or 1,1 is the local discrimination area, and 0 is the non-discrimination area. The binary template is obtained by setting a threshold in the feature map (feature maps). This process can be regarded as a visually noticeable process. Specific method steps are given below.

First, given a global image I, setIndicating that its last convolutional layer in the global branch is in space/>Location, th/>Activation value of individual channel,/>，/>The total number of channels characterizing the last convolutional layer in the global branch, as in ResNet-50,/>。/>Representing a global branch. First for any positionActivation value/>Taking the absolute value. A visual attention heat map (attention heatmap)/>, of the feature is then obtained by taking the maximum value in the feature channel direction。

The magnitude of the value directly reflects/>The importance of the location features to the classification of hidden danger.

The image local discrimination area positioning method under the guidance of the attention mechanism designs a binary template for positioning a local area with larger activation value in the heat map. First, in a feature heat mapIf any position/>The activation value in the heat map is greater than a predetermined threshold/>The corresponding position value in the template is set to 1, otherwise to 0. In particular to

Wherein,For controlling the size of the attention area. /(I)The larger the acquired local area is, the smaller the local area is. /(I)The smaller the acquired local area is, the larger the acquired local area is. In extreme cases, if/>Is 0 due to/>The templates are all 1, and the local image and the global image acquired at this time are the same. If/>1, Due to/>The templates are all 0's, and at this time, the local discrimination area cannot be obtained. Therefore, it is set as/>. In practice, there may be a feature map containing multiple regions of greater activation values, which may be connected or separated. In order to cover all areas with larger activation values, a maximum connected area which can cover all areas with larger activation values in the template is acquired on the characteristic heat map by using a related method. The maximum communication area position information is expressed as:

Wherein, ，/>The upper left and lower right corner coordinates of the maximum communication area in the feature map, respectively.

Finally, the local discrimination area is cut out in the input image I by utilizing RAnd then/>And adjusting the size to be equal to the I. By using the image local discriminant region localization method under the guidance of the attention mechanism, the region or feature of interest in the image can be precisely located without processing the entire image. This helps to improve positioning accuracy and efficiency.

For the original input image, it can be considered a global scale image. Using the foreground attention awareness mechanism, the foreground image may be obtained from the global image without using bounding box annotations. In order to extract the discriminative part area without using part labeling or bounding boxes, the last feature map of the foreground scale feature extractor is put into a part attention sensing mechanism, and then a part image with discriminative features can be obtained. The three scale images are placed in their feature decimators to obtain three logits. When carrying out reasoning prediction, the IAPP-CNN algorithm integrates the logit output by three CNN branches to obtain the final prediction category, which is different from the former work. Furthermore, due to the risk of identified uncertainties, a joint uncertainty estimation module is introduced that evaluates their reliability before each log addition. The introduction of the combined uncertainty estimation module further improves the accuracy and the robustness of hidden danger identification. The main idea of IAPP-CNN is to construct a new deep neural network, mine more information from fine-grained images, and make more robust predictions for fine-grained recognition.

The above description is only an example of the present application and is not intended to limit the scope of the present application, and various modifications and variations will be apparent to those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present application should be included in the protection scope of the present application.

Claims

1. The intelligent detection and hidden danger identification method for the inspection image target of the unmanned aerial vehicle of the power distribution line is characterized by comprising the following steps:

Constructing a distribution line hidden danger positioning and fine granularity identifying algorithm combining the perception attention, and identifying the fine granularity hidden danger of the acquired characteristics;

The preprocessing of the unmanned aerial vehicle infrared and visible light images acquired on the distribution line comprises the steps of adding details of cross-domain images by a residual pyramid network-based sample refining and image enhancement method, adopting a multi-type sample refining method in the process of the residual pyramid network-based sample refining and image enhancement method, adding noise, blurring, edge sharpening, pixel inversion and image smoothing 5 types of image enhancement processing to the original image, solving the problem of low sample capacity, and further amplifying data volume

，

In the above-mentioned formula, the group of the compounds, Representing the/>, in a low resolution subnetworkLayer output, output in high resolution subnetworkAs input to a low resolution network;

In the process of the sample refinement and image enhancement method based on the residual pyramid network, a visible light and infrared image fusion method of a combined full convolution neural network is adopted, a camera and an infrared sensor are installed through an unmanned aerial vehicle, and the visible light and the infrared image are simultaneously acquired and subjected to later fusion processing so as to reflect imaging information of a distribution line image more truly; in the process of a visible light and infrared image fusion method combining a full convolution neural network, a low-frequency image fusion mode and a high-frequency image fusion mode are introduced, so that the resolution and the definition of a fused image are ensured, and the accuracy of circuit line inspection fault detection is improved;

First, given a global image I, set Indicating that its last convolutional layer in the global branch is in spaceLocation, th/>Activation value of individual channel,/>，/>Total number of channels representing the last convolutional layer feature in a global branch,/>Representing global branches, first for arbitrary locations/>Activation value/>Taking absolute value and then obtaining the visual attention heat map/>, of the feature by taking the maximum value in the direction of the feature channel，

，

The magnitude of the value directly reflects/>The importance of the location features to the classification of the hidden trouble,

The image local discrimination area positioning method under the guidance of the attention mechanism designs a binary template for positioning a local area with larger activation value in a heat map, firstly, in a characteristic heat mapIf any position/>The activation value in the heat map is greater than a predetermined threshold/>The corresponding position value in the template is set to 1, otherwise, is set to 0, in particular

，

Wherein,For controlling the size of the attention area,/>The larger the local region acquired, the smaller the/>The smaller the acquired local area the larger, in extreme cases if/>Is 0 due to/>The templates are all 1, the local image and the global image acquired at the moment are the same, if/>1, Due to/>The templates are all 0, and the local discrimination area cannot be obtained at this time, and therefore, it is set to/>In order to cover all areas with larger activation values, a maximum communication area which can cover all areas with larger activation values in a template is acquired on a characteristic heat map by using a related method, and the position information of the maximum communication area is expressed as:

，

Wherein, ，/>The upper left and lower right corner coordinates of the maximum communication area in the feature map,

Finally, the local discrimination area is cut out in the input image I by utilizing RAnd then/>The method is adjusted to be of the same size, and by using the image local discrimination area positioning method under the guidance of the attention mechanism, the region or feature of interest in the image can be accurately positioned without processing the whole image, which is helpful to improve positioning accuracy and efficiency.

2. The method for detecting the target of the inspection image of the unmanned aerial vehicle of the power distribution line and intelligently identifying hidden danger according to claim 1, wherein the construction of a typical target detection model under a complex background driven by large model semantics, reading the preprocessed image, and extracting the characteristics of the image, specifically, for the input visible light image characteristics, comprises the following steps:

Performing DCNv function processing on the scaled features;

based on the designed superimposition criteria, an image is output.

3. The method for detecting the target of the inspection image of the unmanned aerial vehicle for the power distribution line and intelligently identifying hidden dangers according to claim 2, wherein the steps for extracting the multi-scale characteristics of the infrared image are as follows:

4. The method for detecting the inspection image target and intelligently identifying hidden danger of the unmanned aerial vehicle of the power distribution line according to claim 3, wherein the method for fusing the visible light and the infrared features by using a multi-mode bidirectional feature fusion algorithm comprises the following specific steps:

5. The method for detecting the target of the inspection image and intelligently identifying hidden dangers of the unmanned aerial vehicle for the power distribution line according to claim 4, wherein the target detection based on the attention of the trigeminal anvils is used, and comprises the following specific steps: