CN114419413A - Method for constructing sensing field self-adaptive transformer substation insulator defect detection neural network - Google Patents

Method for constructing sensing field self-adaptive transformer substation insulator defect detection neural network Download PDF

Info

Publication number
CN114419413A
CN114419413A CN202210031802.3A CN202210031802A CN114419413A CN 114419413 A CN114419413 A CN 114419413A CN 202210031802 A CN202210031802 A CN 202210031802A CN 114419413 A CN114419413 A CN 114419413A
Authority
CN
China
Prior art keywords
insulator
receptive field
characteristic
convolution operation
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210031802.3A
Other languages
Chinese (zh)
Inventor
朱新山
曾筠婷
郭志民
李斌
王帅
屈璐瑶
李亚霖
刘昊
田杨阳
毛万登
赵健
贺翔
张小斐
袁少光
耿俊成
马斌
魏小昭
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
State Grid Corp of China SGCC
Electric Power Research Institute of State Grid Henan Electric Power Co Ltd
Original Assignee
Tianjin University
State Grid Corp of China SGCC
Electric Power Research Institute of State Grid Henan Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin University, State Grid Corp of China SGCC, Electric Power Research Institute of State Grid Henan Electric Power Co Ltd filed Critical Tianjin University
Priority to CN202210031802.3A priority Critical patent/CN114419413A/en
Publication of CN114419413A publication Critical patent/CN114419413A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • G06F18/253Fusion techniques of extracted features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/0002Inspection of images, e.g. flaw detection
    • G06T7/0004Industrial image inspection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]

Abstract

A receptive field self-adaptive transformer substation insulator defect detection neural network construction method is based on transformer substation inspection images in real scenes, adopts a defect simulation technology to generate an insulator defect sample, constructs a defect detection neural network, adopts a ResNest network to extract features, obtains three feature maps with different resolutions, and performs multi-scale feature fusion in a channel splicing mode through FPN to realize information interaction across feature layers. And sending the characteristic graphs with different resolutions generated by the FPN network into a receptive field self-adaptive selection module to learn the receptive fields required by targets with different scales, and finally inputting the receptive fields into a detection head to obtain category prediction and frame regression prediction results. By adopting the method, the image characteristics which take deep semantic information and shallow detail information into consideration can be fully extracted, the receptive field of the detection head is dynamically adjusted, and the expression capability of the characteristics is enhanced, so that the precision and robustness of target detection are improved, and more accurate detection is realized.

Description

Method for constructing sensing field self-adaptive transformer substation insulator defect detection neural network
Technical Field
The invention belongs to the technical field of power inspection, and particularly relates to a method for constructing a transformer substation insulator defect detection neural network with adaptive receptive field.
Background
Along with comprehensive propulsion of smart power grid construction and continuous expansion of power grid scale, the detection task amount increases day by day, and traditional manual patrol has the problems of high labor intensity, low working efficiency, strong dependence on personnel ability and working state and the like, and the application of the unmanned helicopter and the patrol robot greatly improves the patrol efficiency and precision, ensures the real-time performance of monitoring, and simultaneously generates a large number of patrol images. The demand of the on-line detection technology is increased, and the accurate identification and positioning of the target equipment become the key of the detection technology. The insulator is one of important elements for guaranteeing the safe operation of the power system, so the construction method of the transformer substation insulator defect detection network has important practical engineering value.
In the prior art, the insulator defect detection algorithms are mainly divided into two categories: a conventional method based on an image processing technology and a method based on deep learning.
Based on the traditional method of the image processing technology, the characteristics of the shape, the color, the texture and the like of the target are extracted through a characteristic extraction algorithm, and then the target detection is realized through a characteristic matching algorithm or by using a classifier. The main problems of the conventional method are as follows: the traditional method needs enough prior knowledge and can only identify single characteristics, so that the routing inspection image processing effect of complex background, light variation and variable shooting angles is poor, the robustness is poor, and the requirement for monitoring the running state of the insulator in real time is difficult to meet. Secondly, the traditional image processing methods all need to manually set parameters, the parameters are usually only suitable for the current sample set, and the parameters may not be suitable for other data sets, so that the traditional methods are weak in generalization capability and cannot meet the requirement of online detection precision.
With the improvement of computer hardware performance, the expansion of data sets and the development of machine learning, the application of the deep learning algorithm to the target identification and detection problem becomes a hot spot of current research. Deep learning can realize end-to-end learning, the obtained characteristics have stronger generalization compared with the characteristics of the traditional manual design, methods of the deep learning in the field of target detection can be divided into a one-stage method and a two-stage method, the representative algorithms of the two-stage method are fast R-CNN (fast Region-based conditional Neural Network), and the one-stage method comprises a YOLO (Young Only Look one) algorithm, an SSD (Single Shot Multi Box Detector) and the like. The difference between the one-stage method and the two-stage method is that the two-stage method firstly generates a plurality of target candidate domains through a regional suggestion network (RPN), and then performs position regression and classification on feature matrixes of the candidate domains; the one-stage method does not need a network to generate a candidate domain, and directly generates a predicted position coordinate and a category probability. Generally, the one-stage method has better speed, and the two-stage method has higher accuracy. Further, a two-stage method for realizing the combination of insulator positioning and defect detection is characterized in that firstly, an image is input into a VGG (visual Geometry group) network for feature extraction to generate a feature map, the feature map is input into an RPN network to generate an insulator candidate region, and the insulator candidate region is cut and then sent into a ResNet-based fault detection network for defect identification. Adopting a convolution neural network fast R-CNN based on region suggestion to carry out insulator string detection, and carrying out secondary classification on the cut insulator regions through a U-net network to realize defect detection; the inspection image multi-target detection and positioning are realized based on the Faster R-CNN model, parameters are optimized by utilizing a regularization method to improve the detection speed, and a foundation is laid for insulator defect detection tasks. However, the above three methods all use a two-stage network, so the detection speed is slow.
The one-stage based method achieves a balance between detection speed and accuracy. Based on the improved insulator detection network of the SSD, the ResNet network replaces the original VGGNet in the SSD structure to extract the features, and the method has improved precision. An improved lightweight network is constructed based on a YOLOV3 detection architecture, and a prior frame is determined by using a K-means + + algorithm, so that the prior frame is more suitable for the length-width ratio and the size of an insulator, but the detection precision is inevitably reduced due to the reduction of network parameters in the method. An insulator string diagnosis method based on improved YOLOv3 adopts a Focal local function and a balanced cross entropy function to improve a Loss function of YOLOv3 and adopts a multi-stage transfer learning training mode, but the method can only realize classification of normal and defective insulator strings and cannot identify specific defective parts.
The target detection technology based on deep learning can continuously optimize network parameters and search for an optimal solution, solves the problem of dependence of a traditional detection method on set parameters, is high in robustness and generalization capability, and has a far better application effect in the field of target detection than the traditional method. However, the existing insulator defect data sets are all aerial photography backgrounds, most of which are easily distinguished grasslands and mountain forest backgrounds, and are all of one type of explosion defect. At present, an insulator defect data set for substation inspection is still lacked, the overhead shooting background for substation inspection often contains more similar targets than the aerial shooting background, and the difficulty of target detection is increased. Moreover, the existing insulator defect detection method has low efficiency of a feature fusion part, and the detection head can only carry out fixed receptive field feature extraction on feature maps of all detection layers of the FPN, so that the problems of missing detection, false detection and the like are easy to occur when the defects of different scales of the insulator are detected.
Disclosure of Invention
In order to solve the defects in the prior art, the invention aims to provide a transformer substation insulator defect detection neural network construction method for adaptive selection of a receptive field, and the method is a one-stage method for improving a FPN structure and a receptive field adaptive selection module by combining a Resnest (Split-Attention Networks) network, so that the precision is improved on the premise of not obviously increasing the number of parameters, and the defect detection of the transformer substation insulator is realized.
The invention adopts the following technical scheme.
The method for constructing the receptive field self-adaptive transformer substation insulator defect detection neural network comprises the following steps:
step 1, collecting a patrol image of a transformer substation in a real scene, and extracting an insulator image from the patrol image;
step 2, based on a defect simulation technology, performing data enhancement on the insulator image to generate a defect insulator sample image; in the defective insulator sample image, the ratio of the number of defective insulator pieces to the number of normal insulator pieces is 1: 1; constructing a data set for detecting the defects of the insulator of the transformer substation by using the defective insulator sample image;
step 3, constructing a defect detection neural network, wherein the defect detection neural network comprises the following steps: based on a backbone network of a ResNest residual module, an FPN network spliced by channels, a receptive field self-adaptive selection module and a detection head are adopted;
and 4, training the defect detection neural network by using the data set constructed in the step 2, and taking the trained defect detection neural network as a transformer substation insulator defect detection neural network based on receptive field self-adaption.
Preferably, in step 1, the inspection image includes a plurality of power device images, an insulator image is obtained after cutting the image, and an original data set is constructed by using the insulator image.
Preferably, step 2 comprises:
step 2.1, based on a statistical method, taking the pixel color with the largest proportion in the insulator image as a defect filling color;
step 2.2, calculating the maximum value of the inter-class variance of the background and the target in the insulating sub-image according to the following relational expression:
Figure BDA0003466703930000031
in the formula (I), the compound is shown in the specification,
Figure BDA0003466703930000032
the maximum value of the between-class variance of the background and the target in the insulator image is obtained, and the gray value t corresponding to the maximum value of the between-class variance is used as a segmentation threshold value of the background and the target in the insulator image;
P0(t) and P1(t) the ratio of the number of background and target pixels in the insulator image to the total image, H0(t) and H1(t) average gray values of the background and the target in the insulator image respectively;
step 2.3, the gray value interval of the insulator image is [0, m ], and the gray value interval of the insulator image is divided into a first interval [0, t ] and a second interval [ t +1, m ] by a division threshold t;
step 2.4, segmenting the background and the target in the insulator image according to the first interval and the second interval of the insulator image gray value by the following relational expression:
Figure BDA0003466703930000041
in the formula (I), the compound is shown in the specification,
f(i,j)to divide the gray value of the ith row and the jth column in the previous insulator image,
P(i,j)dividing the insulator image into a gray value of the ith row and the jth column in the divided insulator image;
satisfy P(i,j)The region 1 is an insulator region image which is a target segmented from the insulator image;
step 2.5, based on a Radon transformation method, performing line integration on the insulator region image along different directions by using straight lines with different intercept, and taking a straight line corresponding to the maximum value of the line integration as a main shaft of the insulator region;
step 2.6, parallel lines are respectively made on the positions, at distances dist from the main shaft, of the two sides of the main shaft in the insulator area, and the outer sides of the parallel lines are defined as defect generation areas; wherein the value of dist is 0.5 times the radius of the insulator sheet;
step 2.7, randomly generating defects in the defect generation area, and filling the defects by using defect filling colors to obtain defect insulator images; wherein the randomly generated defect number in the defect generation area is 3-7;
and 2.8, performing filtering smoothing treatment on the defective insulator image by using a Gaussian filter to generate a defective insulator sample image.
Preferably, in step 2, the data set for detecting the insulator defect of the substation is divided into a training set and a test set according to a ratio of 4: 1.
Preferably, in the defect detection neural network, a main network based on a ResNest residual module is used for carrying out feature extraction on a defective insulator sample image to obtain three feature maps with different resolutions; improving a fusion mode in the FPN network, namely performing channel splicing on three feature graphs with different resolutions to obtain a first feature fusion graph, a second feature fusion graph and a third feature fusion graph, and simultaneously performing continuous dimensionality reduction on the feature graph with the lowest resolution in the three feature graphs with different resolutions to obtain a fourth feature fusion graph and a fifth feature fusion graph;
inputting the five feature fusion graphs into a receptive field self-adaptive selection module to obtain a receptive field self-adaptive feature graph; and inputting the receptive field self-adaptive characteristic diagram into a detection head to obtain a category prediction result and a frame regression prediction result of the insulator defect detection boundary frame.
Preferably, in step 3, the backbone network based on the reseest residual module includes N layers of reseest residual modules, and each residual module includes a direct connection path and an indirect connection path; wherein, the direct connection path directly inputs the characteristics output by the residual error module of the previous stage to the residual error module of the current stage without processing the characteristics; the indirect connection path performs cyclic convolution operation on the characteristics output by the last-stage residual error module, and inputs the characteristic residual error obtained by the convolution operation into the current-stage residual error module; the final output of the backbone network is the sum of the characteristic residuals transmitted by the direct connection path and the indirect connection path.
Preferably, the backbone network comprises a 33-layer reseest residual module;
outputting a first feature map by a layer 7 residual module; outputting a second feature map by a layer 30 residual module; the third feature map is output by the residual module of layer 33.
Preferably, the resolution of the first feature map is 40 × 40 × 512, the resolution of the second feature map is 20 × 20 × 1024, and the resolution of the third feature map is 10 × 10 × 2048.
Preferably, in the FPN network in step 3, the channel splicing is performed on each feature map according to the following relation:
Figure BDA0003466703930000051
in the formula (I), the compound is shown in the specification,
Plfor the l-th feature fusion map obtained by channel splicing, l is 1,2,3, i.e., P1As a first feature fusion map, P2As a second feature fusion map, P3A third feature fusion map;
Conv1×1is a convolution operation of 1 x 1, and the convolution operation,
Conv3×3is a convolution operation of 3 x 3, and the convolution operation,
concat is the number of splicing operations that are performed,
up is a linear interpolation 2 times Up-sampling operation,
Clfor the l dimension reduction characteristic diagram after the channel dimension reduction, l is 1,2,3, and the following relation is satisfied:
Cl=Conv1×1(Fl)
in the formula, FlThe first characteristic diagram output by the backbone network; i.e., F1Is a first characteristic diagram, F2Is a second characteristic diagram, F3Is a third characteristic diagram.
Preferably, in the FPN network of step 3, a third feature map F output to the backbone network is also output3Performing a convolution operation of 3 × 3 to obtain a fourth bitSign the fusion map P4Then, for the fourth feature fusion map P4Performing a convolution operation of 3 × 3 to obtain a fifth feature fusion map P5
Preferably, in step 3, the detecting steps of the receptive field adaptive selection module and the detecting head include:
3.1, respectively carrying out dimensionality reduction on the first to fifth feature fusion graphs based on 1 multiplied by 1 convolution operation to obtain first to fifth dimensionality reduction feature fusion graphs;
step 3.2, extracting first induced field features from the first to fifth dimension reduction feature fusion graphs based on 1 multiplied by 1 convolution operation; extracting second receptive field characteristics from the first dimension-reduction characteristic fusion graph to the fifth dimension-reduction characteristic fusion graph based on the first cavity convolution operation; extracting a third receptive field feature from the first to fifth dimension-reduction feature fusion graphs based on a second cavity convolution operation; wherein the first hole convolution operation is a 3 × 3 convolution operation with a hole rate of 2, and the second hole convolution operation is a 3 × 3 convolution operation with a hole rate of 4;
step 3.3, aggregating the first receptive field characteristic, the second receptive field characteristic and the third receptive field characteristic in a channel splicing mode to obtain a first aggregated receptive field characteristic;
step 3.4, performing 1 × 1 convolution operation on the first aggregation receptive field characteristic, and fusing multi-channel information to obtain a first fused receptive field characteristic;
step 3.5, extracting a fourth receptive field characteristic from the first fusion receptive field characteristic based on 1 multiplied by 1 convolution operation; meanwhile, extracting a fifth receptive field characteristic from the first fusion receptive field characteristic based on a third cavity convolution operation; extracting a sixth receptive field feature from the first fusion receptive field feature based on a fourth cavity convolution operation; wherein the third hole convolution operation is a 3 × 3 convolution operation with a hole rate of 6, and the fourth hole convolution operation is a 3 × 3 convolution operation with a hole rate of 12;
step 3.6, polymerizing the fourth receptive field characteristic, the fifth receptive field characteristic and the sixth receptive field characteristic in a channel splicing mode to obtain a second polymerized receptive field characteristic;
step 3.7, fusing multi-channel information on the second polymerization receptive field characteristic based on 1 multiplied by 1 convolution operation; based on 1 × 1 convolution operation, performing channel number transformation on the fused second polymerization receptive field characteristic, wherein the channel number obtained by transformation is consistent with the first to fifth characteristic fusion graphs;
step 3.8, adding the second polymerized receptive field characteristic subjected to fusion and channel transformation with the first to fifth characteristic fusion graphs to obtain a receptive field self-adaptive characteristic graph;
step 3.9, activating the receptive field self-adaptive characteristic map by a ReLU activation function and then outputting the receptive field self-adaptive characteristic map to a detection head, and outputting a category prediction result and a frame regression prediction result of the insulator defect detection boundary frame by the detection head;
the detection head comprises a classification branch and a regression branch, the classification branch outputs a classification prediction result, and the regression branch outputs a frame regression prediction result.
Preferably, in step 3.9, the regression branch and the classification branch each include four convolutional layers; the convolution kernel size in each convolution layer is 3, and the step length is 1;
and the parameters between the regression branch and the classification branch are not shared.
Preferably, step 3 further comprises:
step 3.10, setting the overlapping degree threshold of the insulator defect detection boundary frame output by the detection head to be 0.5, and screening redundant insulator defect detection boundary frames by using a Soft-nms non-maximum value inhibition method, wherein the following relational expression is satisfied:
Figure BDA0003466703930000071
in the formula (I), the compound is shown in the specification,
Sdd is the confidence of the D-th insulator defect detection boundary frame, D is 1,2, …, D is the total number of the insulator defect detection boundary frames,
bdfor the d-th insulator defect detection bounding box,
m is the insulator defect detection boundary box with the highest confidence coefficient,
Iou(m,bd) The overlapping degree of the d-th insulator defect detection boundary frame and the insulator defect detection boundary frame with the highest confidence coefficient is obtained;
when Iou (m, b)d)>And when the confidence coefficient of the d-th insulator defect detection boundary box is exponentially attenuated at 0.5.
Preferably, in step 4, training a defect detection neural network by using a random gradient descent algorithm and a learning rate preheating method;
in the first 500 iterations, the learning rate increases; at the 16 th and 22 nd rounds, the learning rate was decreased to 10% of the initial value of the learning rate, the batch size was set to 8, and the training was stopped until the loss was below 0.1.
Compared with the prior art, the method has the advantages that the method can obtain good balance between precision and speed, and realizes the fine detection of the insulator sheet in the scene of the transformer substation; important features can be learned and concerned, irrelevant features can be suppressed, network convergence speed is increased, and detection precision is improved; the method can self-adaptively adjust the receptive field, obtain higher precision for targets with different sizes and enhance the generalization capability of the network.
Drawings
FIG. 1 is a block diagram of the steps of a neural network construction method for detecting defects of insulator of transformer substation with adaptive receptive field according to the present invention;
FIG. 2 is a schematic diagram of a neural network for defect detection according to an embodiment of the present invention;
the reference numerals in fig. 2 are explained as follows:
1-backbone network based on the ResNest residual module; 2-adopting a channel spliced FPN network; 3-receptive field adaptive selection module; 4-a detection head;
layer 0-first wrapping layer; layer 1-second convolution layer; layer 2-third convolution layer; layer 3-fourth convolution layer; layer 4-the fifth convolutional layer;
Figure BDA0003466703930000081
-channel splicing;
FIG. 3 is a schematic diagram of an FPN network according to an embodiment of the present invention;
fig. 4 is a schematic diagram of a receptive field adaptive selection module according to an embodiment of the invention.
Detailed Description
The present application is further described below with reference to the accompanying drawings. The following examples are only for illustrating the technical solutions of the present invention more clearly, and the protection scope of the present application is not limited thereby.
Referring to fig. 1, a neural network construction method for detecting defects of insulator in transformer substation with adaptive receptive field includes steps 1 to 4.
Step 1, collecting a patrol inspection image of a transformer substation in a real scene, and extracting an insulator image from the patrol inspection image.
Specifically, in the step 1, the inspection image comprises a plurality of power equipment images, an insulator image is obtained after the images are cut, and an original data set is constructed by using the insulator image.
Step 2, based on a defect simulation technology, performing data enhancement on the insulator image to generate a defect insulator sample image; in the defective insulator sample image, the ratio of the number of defective insulator pieces to the number of normal insulator pieces is 1: 1; and constructing a data set for detecting the defects of the insulator of the transformer substation by using the defective insulator sample image.
Specifically, step 2 comprises:
step 2.1, based on a statistical method, taking the pixel color with the largest proportion in the insulator image as a defect filling color;
step 2.2, calculating the maximum value of the inter-class variance of the background and the target in the insulating sub-image according to the following relational expression:
Figure BDA0003466703930000082
in the formula (I), the compound is shown in the specification,
Figure BDA0003466703930000083
maximum between class variance for background and target in insulator imageAnd the gray value t corresponding to the maximum value of the inter-class variance is used as a segmentation threshold value of the background and the target in the insulator image;
P0(t) and P1(t) the ratio of the number of background and target pixels in the insulator image to the total image, H0(t) and H1(t) average gray values of the background and the target in the insulator image respectively;
step 2.3, the gray value interval of the insulator image is [0, m ], and the gray value interval of the insulator image is divided into a first interval [0, t ] and a second interval [ t +1, m ] by a division threshold t;
step 2.4, segmenting the background and the target in the insulator image according to the first interval and the second interval of the insulator image gray value by the following relational expression:
Figure BDA0003466703930000091
in the formula (I), the compound is shown in the specification,
f(i,j)to divide the gray value of the ith row and the jth column in the previous insulator image,
P(i,j)dividing the insulator image into a gray value of the ith row and the jth column in the divided insulator image;
satisfy P(i,j)The region 1 is an insulator region image which is a target segmented from the insulator image;
step 2.5, based on a Radon transformation method, performing line integration on the insulator region image along different directions by using straight lines with different intercept, and taking a straight line corresponding to the maximum value of the line integration as a main shaft of the insulator region;
step 2.6, parallel lines are respectively made on the positions, at distances dist from the main shaft, of the two sides of the main shaft in the insulator area, and the outer sides of the parallel lines are defined as defect generation areas; wherein the value of dist is 0.5 times the radius of the insulator sheet;
step 2.7, randomly generating defects in the defect generation area, and filling the defects by using defect filling colors to obtain defect insulator images; wherein the randomly generated defect number in the defect generation area is 3-7;
and 2.8, performing filtering smoothing treatment on the defective insulator image by using a Gaussian filter to generate a defective insulator sample image.
In the preferred embodiment of the present invention, in order to weaken the edge burrs of the defect circle and make the connection between the target and the background more natural, the generated defect image is finally filtered and smoothed by a gaussian filter with a gaussian kernel of (5 × 5) size.
Specifically, in step 2, the data set for detecting the insulator defect of the transformer substation is divided into a training set and a test set according to the ratio of 4: 1.
It should be noted that in the preferred embodiment of the present invention, the division ratio of the training set and the test set is a non-limiting preferred choice, and those skilled in the art can select different division ratios according to the training requirements of the model and the size of the data set.
Step 3, constructing a defect detection neural network, wherein the defect detection neural network comprises the following steps: the method comprises a main network based on a ResNest residual module, an FPN network spliced by channels, a receptive field self-adaptive selection module and a detection head.
In the defect detection neural network shown in fig. 2, a main network based on a reseest residual module is used for carrying out feature extraction on a defect insulator sample image to obtain three feature maps F with different resolutions1、F2、F3(ii) a The method improves the fusion mode in the FPN network by adding, namely three characteristic graphs F with different resolutions1、F2、F3Channel splicing is carried out to obtain a first feature fusion graph P1The second feature fusion map P2And a third feature fusion map P3Simultaneously, for the third feature map F with the lowest resolution in the three feature maps with different resolutions3Continuously reducing dimensions to obtain a fourth feature fusion graph P4And a fifth feature fusion map P5
Inputting the five feature fusion graphs into a receptive field self-adaptive selection module to obtain a receptive field self-adaptive feature graph; and inputting the receptive field self-adaptive characteristic diagram into a detection head to obtain a category prediction result and a frame regression prediction result of the insulator defect detection boundary frame.
Specifically, in step 3, the backbone network of the defect detection neural network includes N layers of detect residual modules, and each residual module includes a direct connection path and an indirect connection path; wherein, the direct connection path directly inputs the characteristics output by the residual error module of the previous stage to the residual error module of the current stage without processing the characteristics output by the residual error module of the previous stage; the indirect connection path performs cyclic convolution operation on the characteristics output by the last-stage residual error module, and the characteristic residual error obtained by the convolution operation is input to the current-stage residual error module; the final output of the backbone network is the sum of the characteristic residuals transmitted by the direct connection path and the indirect connection path.
In the preferred embodiment of the present invention, a reseest network is constructed according to a ResNet structural paradigm, and the network parameter configuration is shown in table 1.
The third column of table 1 represents convolution layer parameters, brackets in each group represent a reseest residual module, a split-attention module is a core module of the reseest network, the split-attention module replaces the second convolution operation in the resenet block, and information interaction across feature graph groups is realized through combination of grouped convolution and attention mechanism. The residual error module comprises two paths, wherein one path is a direct connection path of the input features, the other path is a residual error of the input features obtained by performing three times of convolution operation in brackets on the input features, and finally the features on the two paths are added to obtain the output of the ResNest network. In table 1, the first convolutional layer0 has no reseest residual module, and 64 convolutional kernels with the size of 7 × 7 and the step size (stride) of 2 are used; the second convolutional layer1 includes 3 reseest residual modules, in each of which the first convolutional layer is 64 (1 × 1) convolutional kernels, the second convolutional layer is a split-attribute module, and the third convolutional layer is 256 (1 × 1) convolutional kernels; and, max pool in the second convolutional layer1 represents the maximum pooling operation with a pooling unit size of (3 × 3) and a step size of 2.
Table 1 parameters of the reseest network according to an embodiment of the present invention
Figure BDA0003466703930000111
As can be seen from table 1, in the preferred embodiment of the present invention, the backbone network includes 33 layers of reseest residual modules; outputting a first feature map F by a layer 7 residual error module1(ii) a Outputting a second feature map F by a layer 30 residual module2(ii) a Outputting a third feature map F by a residual module of layer 333
Specifically, the first characteristic diagram F1Has a resolution of 40 × 40 × 512, and a second feature map F2Has a resolution of 20X 1024, and a third feature map F3Has a resolution of 10 × 10 × 2048.
Specifically, in the FPN network shown in fig. 3, the feature maps are channel-spliced according to the following relationship:
Figure BDA0003466703930000112
in the formula (I), the compound is shown in the specification,
Plfor the l-th feature fusion map obtained by channel splicing, l is 1,2,3, i.e., P1As a first feature fusion map, P2As a second feature fusion map, P3A third feature fusion map;
Conv1×1is a convolution operation of 1 x 1, and the convolution operation,
Conv3×3is a convolution operation of 3 x 3, and the convolution operation,
concat is the number of splicing operations that are performed,
up is a linear interpolation 2 times Up-sampling operation,
Clfor the l dimension reduction characteristic diagram after the channel dimension reduction, l is 1,2,3, and the following relation is satisfied:
Cl=Conv1×1(Fl)
in the formula, FlThe first characteristic diagram output by the backbone network; i.e., F1Is a first characteristic diagram, F2Is a second characteristic diagram, F3Is a third characteristic diagram.
In the preferred embodiment of the invention, three characteristic maps are usedThe dimension of the channel is reduced to 256, and a first dimension reduction feature map C is generated1(256 × 40 × 40), second dimension reduction feature map C2(256 × 20 × 20), third dimension reduction feature map C3(256 × 10 × 10). Then the third feature map F3Generating a third feature fusion map P after 3 x 3 convolution3Fusing the third feature to the graph P3Expanding resolution to be equal to that of the second dimension reduction characteristic diagram C through linear interpolation 2 times upsampling operation2The third feature is fused into the graph P3And a second dimension reduction feature map C2Splicing and fusing on channel dimension, and obtaining a second feature fusion graph P through 1 × 1 convolution2. The linear combination of information on the same pixel position with different channel dimensions can be obtained by performing 1 × 1 convolution operation on each spliced and fused feature map, and cross-channel information interaction is realized.
In the method provided by the invention, the fusion mode of adding elements in the FPN network is changed into the multi-channel splicing mode, so that the first to third feature fusion graphs spliced by channels can be obtained, and the technical problems that a main network lacks cross-channel interaction information and the efficiency of a feature fusion part is poor in the prior art are solved.
Specifically, in the FPN network of step 3, a third feature map F output to the backbone network is also output3Performing a convolution operation of 3 × 3 to obtain a fourth feature fusion map P4Then, for the fourth feature fusion map P4Performing a convolution operation of 3 × 3 to obtain a fifth feature fusion map P5
Preferably, in step 3, as shown in fig. 4, the detecting steps of the receptive field adaptive selection module and the detecting head include:
3.1, respectively carrying out dimensionality reduction on the first to fifth feature fusion graphs based on 1 multiplied by 1 convolution operation to obtain first to fifth dimensionality reduction feature fusion graphs; in fig. 4, the input amount X is the first to fifth feature fusion maps;
step 3.2, extracting first induced field features from the first to fifth dimension reduction feature fusion graphs based on 1 multiplied by 1 convolution operation; extracting second receptive field characteristics from the first dimension-reduction characteristic fusion graph to the fifth dimension-reduction characteristic fusion graph based on the first cavity convolution operation; extracting a third receptive field feature from the first to fifth dimension-reduction feature fusion graphs based on a second cavity convolution operation; wherein the first hole convolution operation is a 3 × 3 convolution operation with a hole rate (rate) of 2, and the second hole convolution operation is a 3 × 3 convolution operation with a hole rate of 4;
step 3.3, aggregating the first receptive field characteristic, the second receptive field characteristic and the third receptive field characteristic in a channel splicing mode to obtain a first aggregated receptive field characteristic;
step 3.4, performing 1 × 1 convolution operation on the first aggregation receptive field characteristic, and fusing multi-channel information to obtain a first fused receptive field characteristic;
step 3.5, extracting a fourth receptive field characteristic from the first fusion receptive field characteristic based on 1 multiplied by 1 convolution operation; meanwhile, extracting a fifth receptive field characteristic from the first fusion receptive field characteristic based on a third cavity convolution operation; extracting a sixth receptive field feature from the first fusion receptive field feature based on a fourth cavity convolution operation; wherein the third hole convolution operation is a 3 × 3 convolution operation with a hole rate of 6, and the fourth hole convolution operation is a 3 × 3 convolution operation with a hole rate of 12;
step 3.6, polymerizing the fourth receptive field characteristic, the fifth receptive field characteristic and the sixth receptive field characteristic in a channel splicing mode to obtain a second polymerized receptive field characteristic;
step 3.7, fusing multi-channel information on the second polymerization receptive field characteristic based on 1 multiplied by 1 convolution operation; based on 1 × 1 convolution operation, performing channel number transformation on the fused second polymerization receptive field characteristic, wherein the channel number obtained by transformation is consistent with the first to fifth characteristic fusion graphs;
step 3.8, adding the second polymerized receptive field characteristic subjected to fusion and channel transformation with the first to fifth characteristic fusion graphs to obtain a receptive field self-adaptive characteristic graph;
step 3.9, activating the receptive field self-adaptive characteristic map by a ReLU activation function and then outputting the receptive field self-adaptive characteristic map to a detection head, and outputting a category prediction result and a frame regression prediction result of the insulator defect detection boundary frame by the detection head; in fig. 4, the receptive field adaptive feature map is an output quantity Y after being activated by the ReLU activation function;
the detection head comprises a classification branch and a regression branch, the classification branch outputs a classification prediction result, and the regression branch outputs a frame regression prediction result.
Wherein the ReLU activation function is expressed as: f (x) max (0, x).
Preferably, in step 3.9, the regression branch and the classification branch each include four convolutional layers; the convolution kernel size in each convolution layer is 3, and the step length is 1;
and the parameters between the regression branch and the classification branch are not shared.
Preferably, step 3 further comprises:
step 3.10, setting the overlapping degree threshold of the insulator defect detection boundary frame output by the detection head to be 0.5, and screening redundant insulator defect detection boundary frames by using a Soft-nms non-maximum value inhibition method, wherein the following relational expression is satisfied:
Figure BDA0003466703930000141
in the formula (I), the compound is shown in the specification,
Sdd is the confidence of the D-th insulator defect detection boundary frame, D is 1,2, …, D is the total number of the insulator defect detection boundary frames,
bdfor the d-th insulator defect detection bounding box,
m is the insulator defect detection boundary box with the highest confidence coefficient,
Iou(m,bd) The overlapping degree of the d-th insulator defect detection boundary frame and the insulator defect detection boundary frame with the highest confidence coefficient is obtained;
when Iou (m, b)d)>And when the confidence coefficient of the d-th insulator defect detection boundary box is exponentially attenuated at 0.5.
The detection head uses a Soft-nms non-maximum value inhibition method to remove repeated detection frames in a mode of reducing confidence, so that the detection performance of the network on dense targets can be improved.
Further, in step 4.9, the regression branch and the classification branch both include four convolutional layers; the convolution kernel size in each convolution layer is 3, and the step length is 1;
and the parameters between the regression branch and the classification branch are not shared.
In a preferred embodiment of the present invention, the defect detecting neural network shown in fig. 2 includes 5 detecting heads, the first to fifth feature fusion maps with different receptive field features are respectively input to one detecting head, and parameters of the 5 detecting heads are shared. Each detection head consists of two branches of regression and classification, each branch consisting of four (3 × 3) convolutional layers, the parameters between the regression branch and the classification branch are not shared. And (4) predicting the category of the anchor frame of each pixel point by the classification branch, and predicting the offset of the coordinate, the width and the height of the center point of the anchor frame relative to the anchor frame by frame regression. For feature maps with different sizes, anchor frames with different sizes and shapes are adopted in the detection head, the low feature layer is small in receptive field and suitable for detecting small targets, the high feature layer is large in receptive field and suitable for detecting large targets, and the size of the anchor frames is increased along with the increase of the levels. In order to ensure that the anchor frame can adapt to targets with different sizes and shapes, 3 scales and 3 different length-width ratios are respectively set, and 9 anchor frames are generated by one pixel point. Specific parameter settings are as in table 2:
table 2 anchor frame parameter configuration for one embodiment of the present invention
Figure BDA0003466703930000142
In the preferred embodiment of the invention, a plurality of detection heads are used for detecting a plurality of characteristics of different receptive fields at the same time, so that the technical problems that in the prior art, a detection head can only extract the characteristics of the fixed receptive field from a characteristic diagram with each resolution, and missing detection, false detection and the like are easy to occur when detecting defects of different scales of the insulator are solved.
And 4, training the defect detection neural network by using the data set constructed in the step 2, and taking the trained defect detection neural network as a transformer substation insulator defect detection neural network based on receptive field self-adaption.
Specifically, in step 4, training a defect detection neural network by using a random gradient descent algorithm and a learning rate preheating method;
the training optimizer adopts SGD optimizer, and the initial value of learning rate is set to 1 × 10-3Using a linear growth strategy, the learning rate increases in the previous 500 iterations; at the 16 th and 22 nd rounds, the learning rate was decreased to 10% of the initial value of the learning rate, the batch size was set to 8, and training was stopped until the loss was less than 0.1.
The main network adopts a pre-training model, freezes the parameters of stage1, does not perform gradient updating, detects the overfitting condition of the model which is enhanced and reduced by using data in network training, and adopts random mirror image inversion with the probability of 0.5. The software configuration of the system is python 3.7.10, pytorch1.8.1, cuda11.3, pycharm 2020.3; the hardware configuration is intel core i7-10750H 6 core 12 thread cpu, single video card: NVIDIA GeForce RTX-2060.
1688 images are used for training, 422 images are used for testing, the detection model obtained in the embodiment is compared with the classic algorithms SSD and RetinaNet, and the index of each type of detection Precision (AP) and the index of the mean Average Precision (mapp) are recorded, as shown in table 3.
TABLE 3 comparison of algorithmic Properties
Figure BDA0003466703930000151
As can be seen from the data in table 3, the SSD network based on VGG16 has a low accuracy, the detection accuracy of the classic RetinaNet network is slightly improved, but the improvement effect is small, and the method of the present invention is still improved by 2.16% on the basis of the RetinaNet network, thereby obtaining the highest detection accuracy.
To verify the effectiveness of the improved method, each improved portion of the inventive method was compared to the original algorithm, as shown in table 4.
TABLE 4 modified part comparison table
Figure BDA0003466703930000161
As can be seen from the data in table 4, the mAP index of the backbone network is improved by 1.83%, because resest can learn to focus on important features and suppress irrelevant features, compared with resenet, more effective features are extracted without increasing the number of the important features. After using the improved FPN structure on the basis of ResNest, mAP is increased by 0.13%, which proves that the improved FPN improves the expressive ability of the network through information interaction across the feature layer. On the basis of ResNest, an improved FPN structure and a receptive field self-adaptive selection module are combined, mAP is improved by 0.2%, and the excellent performance of the self-adaptive selection receptive field optimization technology in the multi-scale target detection problem is verified.
The present applicant has described and illustrated embodiments of the present invention in detail with reference to the accompanying drawings, but it should be understood by those skilled in the art that the above embodiments are merely preferred embodiments of the present invention, and the detailed description is only for the purpose of helping the reader to better understand the spirit of the present invention, and not for limiting the scope of the present invention, and on the contrary, any improvement or modification made based on the spirit of the present invention should fall within the scope of the present invention.

Claims (14)

1. A receptive field self-adaptive transformer substation insulator defect detection neural network construction method is characterized in that,
the method comprises the following steps:
step 1, collecting a patrol image of a transformer substation in a real scene, and extracting an insulator image from the patrol image;
step 2, based on a defect simulation technology, performing data enhancement on the insulator image to generate a defect insulator sample image; in the defective insulator sample image, the ratio of the number of defective insulator pieces to the number of normal insulator pieces is 1: 1; constructing a data set for detecting the defects of the insulator of the transformer substation by using the defective insulator sample image;
step 3, constructing a defect detection neural network, wherein the defect detection neural network comprises the following steps: a backbone network based on a ResNest residual module, an FPN network spliced by channels, a receptive field self-adaptive selection module and a detection head are adopted;
and 4, training the defect detection neural network by using the data set constructed in the step 2, and taking the trained defect detection neural network as a transformer substation insulator defect detection neural network based on receptive field self-adaption.
2. The method for constructing the neural network for detecting the defects of the insulator of the transformer substation with the adaptive receptive field according to the claim 1,
in the step 1, the inspection image comprises a plurality of power equipment images, insulator images are obtained after the images are cut, and an original data set is constructed by utilizing the insulator images.
3. The method for constructing the neural network for detecting the defects of the insulator of the transformer substation with the adaptive receptive field according to the claim 2,
the step 2 comprises the following steps:
step 2.1, based on a statistical method, taking the pixel color with the largest proportion in the insulator image as a defect filling color;
step 2.2, calculating the maximum value of the inter-class variance of the background and the target in the insulating sub-image according to the following relational expression:
Figure FDA0003466703920000011
in the formula (I), the compound is shown in the specification,
Figure FDA0003466703920000012
the maximum value of the between-class variance of the background and the target in the insulator image is obtained, and the gray value t corresponding to the maximum value of the between-class variance is used as a segmentation threshold value of the background and the target in the insulator image;
P0(t) and P1(t) the number of pixels of the background and the target in the insulator image is respectively the proportion of the whole image,
H0(t) andH1(t) average gray values of the background and the target in the insulator image respectively;
step 2.3, the gray value interval of the insulator image is [0, m ], and the gray value interval of the insulator image is divided into a first interval [0, t ] and a second interval [ t +1, m ] by a division threshold t;
step 2.4, segmenting the background and the target in the insulator image according to the first interval and the second interval of the insulator image gray value by the following relational expression:
Figure FDA0003466703920000021
in the formula (I), the compound is shown in the specification,
f(i,j)to divide the gray value of the ith row and the jth column in the previous insulator image,
P(i,j)dividing the insulator image into a gray value of the ith row and the jth column in the divided insulator image;
satisfy P(i,j)The region 1 is an insulator region image which is a target segmented from the insulator image;
step 2.5, based on a Radon transformation method, performing line integration on the insulator region image along different directions by using straight lines with different intercept, and taking a straight line corresponding to the maximum value of the line integration as a main shaft of the insulator region;
step 2.6, parallel lines are respectively made on the positions, at distances dist from the main shaft, of the two sides of the main shaft in the insulator area, and the outer sides of the parallel lines are defined as defect generation areas; wherein the value of dist is 0.5 times the radius of the insulator sheet;
step 2.7, randomly generating defects in the defect generation area, and filling the defects by using defect filling colors to obtain defect insulator images; wherein the randomly generated defect number in the defect generation area is 3-7;
and 2.8, performing filtering smoothing treatment on the defective insulator image by using a Gaussian filter to generate a defective insulator sample image.
4. The method for constructing the neural network for detecting the defects of the insulator of the transformer substation with the adaptive receptive field according to the claim 3,
in the step 2, a data set for detecting the insulator defects of the transformer substation is divided into a training set and a testing set according to the ratio of 4: 1.
5. The method for constructing the neural network for detecting the defects of the insulator of the transformer substation with the adaptive receptive field according to the claim 1,
in the defect detection neural network, a main network based on a ResNest residual module is used for carrying out feature extraction on a defect insulator sample image to obtain three feature maps with different resolutions; improving a fusion mode in the FPN network, namely performing channel splicing on three feature graphs with different resolutions to obtain a first feature fusion graph, a second feature fusion graph and a third feature fusion graph, and simultaneously performing continuous dimensionality reduction on the feature graph with the lowest resolution in the three feature graphs with different resolutions to obtain a fourth feature fusion graph and a fifth feature fusion graph;
inputting the five feature fusion graphs into a receptive field self-adaptive selection module to obtain a receptive field self-adaptive feature graph; and inputting the receptive field self-adaptive characteristic diagram into a detection head to obtain a category prediction result and a frame regression prediction result of the insulator defect detection boundary frame.
6. The method for constructing the neural network for detecting the defects of the insulator of the transformer substation with the adaptive receptive field according to the claim 5,
in step 3, the backbone network based on the ResNest residual error module comprises N layers of ResNest residual error modules, and each residual error module comprises a direct connection path and an indirect connection path; wherein, the direct connection path directly inputs the characteristics output by the residual error module of the previous stage to the residual error module of the current stage without processing the characteristics output by the residual error module of the previous stage; the indirect connection path performs cyclic convolution operation on the characteristics output by the last-stage residual error module, and inputs the characteristic residual error obtained by the convolution operation into the current-stage residual error module; the final output of the backbone network is the sum of the characteristic residuals transmitted by the direct connection path and the indirect connection path.
7. The method for constructing the neural network for detecting the defects of the insulator of the transformer substation with the adaptive receptive field according to the claim 6,
the backbone network comprises 33 layers of ResNest residual modules;
outputting a first feature map by a layer 7 residual module; outputting a second feature map by a layer 30 residual module; the third feature map is output by the residual module of layer 33.
8. The method for constructing the neural network for detecting the defects of the insulator of the transformer substation with the adaptive receptive field according to the claim 7,
the resolution of the first feature map is 40 × 40 × 512, the resolution of the second feature map is 20 × 20 × 1024, and the resolution of the third feature map is 10 × 10 × 2048.
9. The method for constructing the neural network for detecting the defects of the insulator of the transformer substation with the adaptive receptive field according to the claim 8,
in the FPN network of the step 3, channel splicing is carried out on each characteristic diagram according to the following relational expression:
Figure FDA0003466703920000031
in the formula (I), the compound is shown in the specification,
Plfor the l-th feature fusion map obtained by channel splicing, l is 1,2,3, i.e., P1As a first feature fusion map, P2As a second feature fusion map, P3A third feature fusion map;
Conv1×1is a convolution operation of 1 x 1, and the convolution operation,
Conv3×3is a convolution operation of 3 x 3, and the convolution operation,
concat is the number of splicing operations that are performed,
up is a linear interpolation 2 times Up-sampling operation,
Clfor the l dimension reduction characteristic diagram after the channel dimension reduction, l is 1,2,3, and the following relation is satisfied:
Cl=Conv1×1(Fl)
in the formula, FlThe first characteristic diagram output by the backbone network; i.e., F1Is a first characteristic diagram, F2Is a second characteristic diagram, F3Is a third characteristic diagram.
10. The method for constructing the neural network for detecting the defects of the insulator of the transformer substation with the adaptive receptive field according to the claim 9,
in the FPN network of the step 3, a third characteristic diagram F output to the backbone network is also output3Performing a convolution operation of 3 × 3 to obtain a fourth feature fusion map P4Then, for the fourth feature fusion map P4Performing a convolution operation of 3 × 3 to obtain a fifth feature fusion map P5
11. The method for constructing the neural network for detecting insulator defects of the transformer substation with adaptive receptive fields according to claim 9 or 10,
in step 3, the detection steps of the receptive field self-adaptive selection module and the detection head comprise:
3.1, respectively carrying out dimensionality reduction on the first to fifth feature fusion graphs based on 1 multiplied by 1 convolution operation to obtain first to fifth dimensionality reduction feature fusion graphs;
step 3.2, extracting a first receptive field characteristic from the first to fifth dimension-reduction characteristic fusion graphs based on 1 multiplied by 1 convolution operation; extracting second receptive field characteristics from the first dimension-reduction characteristic fusion graph to the fifth dimension-reduction characteristic fusion graph based on the first cavity convolution operation; extracting a third receptive field characteristic from the first dimension-reduction characteristic fusion graph to a fifth dimension-reduction characteristic fusion graph based on a second cavity convolution operation; wherein the first hole convolution operation is a 3 × 3 convolution operation with a hole rate of 2, and the second hole convolution operation is a 3 × 3 convolution operation with a hole rate of 4;
step 3.3, aggregating the first receptive field characteristic, the second receptive field characteristic and the third receptive field characteristic in a channel splicing mode to obtain a first aggregated receptive field characteristic;
step 3.4, performing 1 × 1 convolution operation on the first aggregation receptive field characteristic, and fusing multi-channel information to obtain a first fused receptive field characteristic;
step 3.5, extracting a fourth receptive field characteristic from the first fusion receptive field characteristic based on 1 multiplied by 1 convolution operation; meanwhile, extracting a fifth receptive field characteristic from the first fusion receptive field characteristic based on a third cavity convolution operation; extracting a sixth receptive field feature from the first fusion receptive field feature based on a fourth cavity convolution operation; wherein the third hole convolution operation is a 3 × 3 convolution operation with a hole rate of 6, and the fourth hole convolution operation is a 3 × 3 convolution operation with a hole rate of 12;
step 3.6, polymerizing the fourth receptive field characteristic, the fifth receptive field characteristic and the sixth receptive field characteristic in a channel splicing mode to obtain a second polymerized receptive field characteristic;
step 3.7, based on 1 × 1 convolution operation, performing multi-channel information fusion on the second polymerization receptive field characteristics; based on 1 × 1 convolution operation, performing channel number transformation on the fused second polymerization receptive field characteristic, wherein the channel number obtained by transformation is consistent with the first to fifth characteristic fusion graphs;
step 3.8, adding the second fusion receptive field characteristic after fusion and channel transformation with the first to fifth characteristic fusion graphs to obtain a receptive field self-adaptive characteristic graph;
step 3.9, activating the receptive field self-adaptive characteristic map by a ReLU activation function and then outputting the receptive field self-adaptive characteristic map to a detection head, and outputting a category prediction result and a frame regression prediction result of the insulator defect detection boundary frame by the detection head;
the detection head comprises a classification branch and a regression branch, the classification branch outputs a classification prediction result, and the regression branch outputs a frame regression prediction result.
12. The method for constructing a neural network for detecting insulator defects of a transformer substation with adaptive receptive fields according to claim 11,
in step 3.9, the regression branch and the classification branch both comprise four convolution layers; the convolution kernel size in each convolution layer is 3, and the step length is 1;
and the parameters between the regression branch and the classification branch are not shared.
13. The method for constructing a neural network for detecting insulator defects of a transformer substation with adaptive receptive fields according to claim 11,
step 3 also includes:
step 3.10, setting the overlapping degree threshold value of the insulator defect detection boundary frame output by the detection head to be 0.5, and screening redundant insulator defect detection boundary frames by using a Soft-nms non-maximum value inhibition method, wherein the following relational expression is satisfied:
Figure FDA0003466703920000051
in the formula (I), the compound is shown in the specification,
Sdd is the confidence of the D-th insulator defect detection boundary frame, D is 1,2, …, D is the total number of the insulator defect detection boundary frames,
bdfor the d-th insulator defect detection bounding box,
m is the insulator defect detection boundary box with the highest confidence coefficient,
Iou(m,bd) The overlapping degree of the d-th insulator defect detection boundary frame and the insulator defect detection boundary frame with the highest confidence coefficient is obtained;
when Iou (m, b)d)>And when the confidence coefficient of the d-th insulator defect detection boundary box is exponentially attenuated at 0.5.
14. The method for constructing a neural network for detecting insulator defects of a transformer substation with adaptive receptive fields according to claim 11,
step 4, training a defect detection neural network by using a random gradient descent algorithm and a learning rate preheating method;
in the first 500 iterations, the learning rate increases; at the 16 th and 22 nd rounds, the learning rate was decreased to 10% of the initial value of the learning rate, the batch size was set to 8, and the training was stopped until the loss was below 0.1.
CN202210031802.3A 2022-01-12 2022-01-12 Method for constructing sensing field self-adaptive transformer substation insulator defect detection neural network Pending CN114419413A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210031802.3A CN114419413A (en) 2022-01-12 2022-01-12 Method for constructing sensing field self-adaptive transformer substation insulator defect detection neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210031802.3A CN114419413A (en) 2022-01-12 2022-01-12 Method for constructing sensing field self-adaptive transformer substation insulator defect detection neural network

Publications (1)

Publication Number Publication Date
CN114419413A true CN114419413A (en) 2022-04-29

Family

ID=81273676

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210031802.3A Pending CN114419413A (en) 2022-01-12 2022-01-12 Method for constructing sensing field self-adaptive transformer substation insulator defect detection neural network

Country Status (1)

Country Link
CN (1) CN114419413A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115457553A (en) * 2022-11-11 2022-12-09 南京信息工程大学 Cloth defect detection method based on deep learning
CN115541601A (en) * 2022-11-30 2022-12-30 江苏永钢集团有限公司 Steel low-power digital detection method based on machine vision
CN116563237A (en) * 2023-05-06 2023-08-08 大连工业大学 Deep learning-based chicken carcass defect hyperspectral image detection method
CN117237286A (en) * 2023-09-02 2023-12-15 国网山东省电力公司淄博供电公司 Method for detecting internal defects of gas-insulated switchgear
CN117437188A (en) * 2023-10-17 2024-01-23 广东电力交易中心有限责任公司 Insulator defect detection system for smart power grid

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115457553A (en) * 2022-11-11 2022-12-09 南京信息工程大学 Cloth defect detection method based on deep learning
CN115541601A (en) * 2022-11-30 2022-12-30 江苏永钢集团有限公司 Steel low-power digital detection method based on machine vision
CN116563237A (en) * 2023-05-06 2023-08-08 大连工业大学 Deep learning-based chicken carcass defect hyperspectral image detection method
CN116563237B (en) * 2023-05-06 2023-10-20 大连工业大学 Deep learning-based chicken carcass defect hyperspectral image detection method
CN117237286A (en) * 2023-09-02 2023-12-15 国网山东省电力公司淄博供电公司 Method for detecting internal defects of gas-insulated switchgear
CN117437188A (en) * 2023-10-17 2024-01-23 广东电力交易中心有限责任公司 Insulator defect detection system for smart power grid

Similar Documents

Publication Publication Date Title
CN112884064B (en) Target detection and identification method based on neural network
CN111242208B (en) Point cloud classification method, segmentation method and related equipment
CN114419413A (en) Method for constructing sensing field self-adaptive transformer substation insulator defect detection neural network
CN112818862B (en) Face tampering detection method and system based on multi-source clues and mixed attention
CN108648188B (en) No-reference image quality evaluation method based on generation countermeasure network
CN112507777A (en) Optical remote sensing image ship detection and segmentation method based on deep learning
CN111583263A (en) Point cloud segmentation method based on joint dynamic graph convolution
CN111898432B (en) Pedestrian detection system and method based on improved YOLOv3 algorithm
CN113706581B (en) Target tracking method based on residual channel attention and multi-level classification regression
CN113160062B (en) Infrared image target detection method, device, equipment and storage medium
CN114841244B (en) Target detection method based on robust sampling and mixed attention pyramid
CN113326735B (en) YOLOv 5-based multi-mode small target detection method
CN113052254B (en) Multi-attention ghost residual fusion classification model and classification method thereof
CN116091823A (en) Single-feature anchor-frame-free target detection method based on fast grouping residual error module
CN113628297A (en) COVID-19 deep learning diagnosis system based on attention mechanism and transfer learning
CN116597224A (en) Potato defect detection method based on improved YOLO V8 network model
CN115439766A (en) Unmanned aerial vehicle target detection method based on improved yolov5
CN111222534A (en) Single-shot multi-frame detector optimization method based on bidirectional feature fusion and more balanced L1 loss
CN112967296B (en) Point cloud dynamic region graph convolution method, classification method and segmentation method
Wang et al. A transformer-based mask R-CNN for tomato detection and segmentation
WO2024032010A1 (en) Transfer learning strategy-based real-time few-shot object detection method
CN115330759B (en) Method and device for calculating distance loss based on Hausdorff distance
CN111259923A (en) Multi-target detection method based on improved three-dimensional R-CNN algorithm
CN113255704B (en) Pixel difference convolution edge detection method based on local binary pattern
Wang et al. FPA-DNN: a forward propagation acceleration based deep neural network for ship detection

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB03 Change of inventor or designer information
CB03 Change of inventor or designer information

Inventor after: Zhu Xinshan

Inventor after: Mao Wandeng

Inventor after: Zhao Jian

Inventor after: He Xiang

Inventor after: Zhang Xiaofei

Inventor after: Yuan Shaoguang

Inventor after: Geng Juncheng

Inventor after: Ma Bin

Inventor after: Wei Xiaozhao

Inventor after: Zeng Junting

Inventor after: Guo Zhimin

Inventor after: Li Bin

Inventor after: Wang Shuai

Inventor after: Qu Luyao

Inventor after: Li Yalin

Inventor after: Liu Hao

Inventor after: Tian Yangyang

Inventor before: Zhu Xinshan

Inventor before: Mao Wandeng

Inventor before: Zhao Jian

Inventor before: He Xiang

Inventor before: Zhang Xiaofei

Inventor before: Yuan Shaoguang

Inventor before: Geng Juncheng

Inventor before: Ma Bin

Inventor before: Wei Xiaozhao

Inventor before: Zeng Junting

Inventor before: Guo Zhimin

Inventor before: Li Bin

Inventor before: Wang Shuai

Inventor before: Qu Luyao

Inventor before: Li Yalin

Inventor before: Liu Hao

Inventor before: Tian Yangyang