CN114022432A

CN114022432A - Improved yolov 5-based insulator defect detection method

Info

Publication number: CN114022432A
Application number: CN202111261977.5A
Authority: CN
Inventors: 唐靓; 余明慧
Original assignee: Hubei University of Technology
Current assignee: Hubei University of Technology
Priority date: 2021-10-28
Filing date: 2021-10-28
Publication date: 2022-02-08
Anticipated expiration: 2041-10-28
Also published as: CN114022432B

Abstract

The invention relates to an insulator defect detection technology, in particular to an insulator defect detection method based on improved yolov5, which comprises the steps of collecting an insulator image to form a data set; labeling the data set by using a LabelImg labeling tool; the collected images are processedData enhancement processing; introducing a triplet attention module into a backbone network of YOLOv 5; optimizing a loss function; CIoU is taken as a bounding-box Loss function Loss of the improved YOLOv5 algorithm_CIoU(ii) a Training the improved network; and inputting the insulator image data set into a trained YOLOv5 network to obtain whether a defective insulator exists in the input image and the position of the defect. The method reduces missing detection, eliminates indirect correspondence between the channel and the weight, achieves the effect of improving the accuracy rate with smaller calculation cost, and improves the accurate positioning capability of the model prediction framework.

Description

Improved yolov 5-based insulator defect detection method

Technical Field

The invention belongs to the technical field of insulator defect detection, and particularly relates to an improved yolov 5-based insulator defect detection method.

Background

Electricity is one of the most basic elements that make the world rotate, and the transmission of high and low voltage electricity is very important for its practical application. In the transmission of high voltage electricity, insulators are used to support and separate electrical conductors from the passage of current. Low voltage distribution lines are a way of distributing power from a distribution grid to end users. An important aspect of primary power distribution systems is the continuous supply of power and the efficient performance of its equipment. The insulator string serves to insulate and provide mechanical strength in the primary overhead distribution line. In summary, insulators are indispensable devices in power systems. Generally, insulators are exposed to the harsh environment of high electric fields, as well as to various harsh weather conditions, such as burning days, typhoons or hurricanes, thunderstorms, sleet, snowstorms, and the like. The harsh environment can make the insulator easily damaged, and then threaten the safety of electric network system and the use of electric power. Such important components, once damaged, pose serious problems for both the power supply and public safety. For example: in the rainy season of each year, people get electric shock and get lost due to exposure under the telegraph pole. Therefore, it is necessary to research an effective insulator defect detection method to ensure the safety and reliability of power transmission.

The current defect detection methods can be divided into three types, namely physical methods, traditional vision-based methods and deep learning-based methods. The physical methods mainly include an ultrasonic method and an ultraviolet pulse method based on manual operation. Among vision-based insulator defect detection methods, HOG + SVM and Haar + AdaBoost are most commonly used. The traditional detection algorithm mainly uses a sliding window to select an interested area, extracts the characteristics of each window, and then classifies characteristic samples to obtain a detection result. In addition, a method for matching based on contour features and gray level similarity is used for classifying the perfect insulators and the defective insulators. Among vision-based insulator defect detection methods, HOG + SVM and Haar + AdaBoost are most commonly used. These methods generally extract image features based on accumulated experience, and are inefficient, limited in accuracy, and time consuming. With the continuous improvement of computer performance, the detection method based on the deep learning framework is widely applied. The method can effectively compensate the loss of the characteristic information in the process of extracting the characteristics of the artificial image, and improve the efficiency of fault detection. Many effective target detection algorithms have been proposed, such as fast R-CNN (fast region-based connected logical network), yolo (you only look) once, ssd (single shot multi-detector), dcnn (dynamic connected logical network), etc. Due to the fact that the thin and long shape characteristics of the insulator and different defect changes cause various and complex expression forms of defects in the image and the acquired insulator image contains a large amount of irrelevant background information, the accuracy and the speed of the existing insulator detection method are still to be improved.

Disclosure of Invention

Aiming at the problems of accuracy and speed of the insulator detection method, the invention provides the insulator defect detection method based on the improved YOLOv5 algorithm.

In order to solve the technical problems, the invention adopts the following technical scheme: the insulator defect detection method based on the improved yolov5 comprises the following steps:

step 1, collecting an insulator image forming data set;

step 2, labeling the data set by using a LabelImg labeling tool, wherein the labeled types are insulators and defects;

step 3, performing data enhancement processing on the acquired image to expand a data set;

step 4, introducing a triplet attention module into a backbone network of YOLOv 5;

step 5, optimizing a loss function; CIoU is taken as a bounding-box Loss function Loss of the improved YOLOv5 algorithm_CIoU；

CIoU is defined as:

wherein the content of the first and second substances,

for the weighting function, v is a parameter for measuring the uniformity of the aspect ratio, c represents the diagonal distance of the smallest closed region that can contain the prediction box and the real box, and ρ²(b，b^gt) Representing the Euclidean distance, h, between the center points of the prediction and real frames^gt、ω^gtRespectively representing the height and width of the real box, h^p、ω^pRespectively representing the height and width of the prediction box;

step 6, optimizing the non-maximum value inhibition NMS method for processing the final result; Soft-NMS was used as a method to process the final results:

the score function expression of Soft-NMS is:

wherein S is_iAs a prediction block B_iScore of (A), B_MFor the highest scoring prediction box, N_tIn order to be an overlap threshold value, the threshold value is,

a Gaussian penalty function, sigma is a hyperparameter selected according to experience;

step 7, training the improved network; setting a learning rate, a learning rate momentum, a batch size, a small total training round, weight attenuation and a maximum iteration number as training parameters, and training the improved YOLOv5 network;

and 8, inputting the collected insulator image data set into a trained YOLOv5 network to obtain whether a defect insulator exists in the input picture and the position of the defect.

In the insulator defect detection method based on the improved yolov5, the triplet attention module adopts three parallel branch structures, wherein two of the three branch structures are used for extracting the interdependence relationship between two spatial dimensions and the channel dimension C, and the other branch structure is used for extracting the spatial feature dependency relationship; in the first two branches, the triplet entry rotates the original input rotation tensor χ 90 ° counterclockwise along the H and W axes, respectively, and converts the shape of the tensor from cxhxc W to wxhxc C and hxc W; in the third branch, the tensor is input in the original shape C multiplied by H multiplied by W, the tensor of the C dimension is reduced to 2 dimensions through the Z-pool layer, and the average convergence feature and the maximum convergence feature in the dimension are connected;

z-pool is defined as: Z-Pool (x) [ MaxPool ]_0d(χ),AvgPpool_0d(χ)]

Wherein 0d is the 0 th dimension where the maximum pooling operation and the average pooling operation occur;

the simplified tensor passes through a standard convolution layer with the kernel size of K and a batch normalization layer, and attention weight of corresponding dimensionality generated by a Sigmoid activation function is added to the rotation tensor; at the final output, the output of the first branch is rotated 90 ° clockwise along the H axis, and the output of the second branch is rotated 90 ° clockwise along the W axis to ensure the same shape as the input; finally, the outputs of the three branches are evenly aggregated into an output;

the output tensor is defined as:

wherein σ is Sigmoid activation function, ψ₁、ψ₂、ψ₃The standard two-dimensional convolution layers defined by the kernel size K in the three branches representing the triplet attentions,

respectively representing the tensors after rotation in the first two branches of the triplet entry,

respectively representing the tensors of the three branches of the triplet attention after passing through the Z-pool layer.

In the insulator defect detection method based on the improved yolov5, the flow of processing the final detection result by using Soft-NMS instead of NMS is as follows;

1) the prediction frames are sorted first, and then the prediction frame B with the highest score is used_MMove to the Final test ListIn D, all the rest prediction frames are allocated with the same mark B_i；

2) When a certain prediction box B_iAnd B_MIs larger than a certain threshold value N_tWhen the Soft-NMS will recalculate the score S for that prediction box_iAnd compares it with a certain confidence threshold O_tComparing, when the score of the prediction box is S_iGreater than a certain confidence threshold O_tIf so, moving the prediction box into a final detection list D, otherwise, deleting the prediction box;

3) for the rest frames B_iThe above process is repeated until the initial list is empty.

In the insulator defect detection method based on the improved YOLOv5, the training of the improved YOLOv5 network comprises the following steps:

a. during network training, the data set is uniformly scaled to 640 × 640 size, and training is performed on an improved YOLOv5s network model with depth _ multiplex being 0.33 and width _ multiplex being 0.50;

b. the parameter updating mode is a random gradient descent SGD method, the initial learning rate is 0.01, the momentum term is 0.937, the weight attenuation coefficient is 0.0005, the batch size of model training is set to be 16, and the weight of the model is updated by regularization of a BN layer each time;

c. enhancement coefficients of hue H, saturation S, and brightness V are set to 0.015, 0.7, and 0.4, respectively; the total number of training rounds is set to 300;

d. after training is finished, storing the obtained weight file of the recognition model, and evaluating the performance of the model by using a test set;

e. the final output of the network identifies the location frame of the insulator and its defects and the probability of belonging to a particular category.

Compared with the prior art, the method adds the triplet attention module in the backbone network to extract the semantic dependence between different dimensions and eliminate the indirect correspondence between the channel and the weight, thereby achieving the effect of improving the accuracy rate with smaller calculation cost. And modifying the DIoU loss function into the CIoU loss function so as to improve the accurate positioning capability of the model prediction frame and enhance the convergence effect of the model. And (3) adopting Soft-NMS to replace Non-Maximum Suppression (NMS) to process the detection result, solving the problem that the shielded detection object is easy to overlook and reducing the missing detection.

Drawings

Fig. 1 is a flow chart of a modified yolov 5-based insulator defect detection method according to an embodiment of the invention;

FIG. 2 is a diagram of a modified yolov5 network architecture according to one embodiment of the present invention;

FIG. 3 is a network structure diagram of a triple event module according to an embodiment of the present invention;

FIG. 4 is a flow chart of the Soft-NMS algorithm according to one embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be described clearly and completely with reference to the following embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

It should be noted that the embodiments and features of the embodiments may be combined with each other without conflict.

The present invention is further illustrated by the following examples, which are not to be construed as limiting the invention.

In order to solve the problem that the existing insulator detection method is unbalanced in accuracy and speed, the embodiment improves the existing YOLOv5 algorithm, so that a better effect can be obtained in the aspect of insulator defect detection. The specific process of this embodiment is shown in fig. 1, and the original network is trained on the collected data set to obtain the experimental effect of the original network. Then, the data set is expanded, and the network is improved, so that the loss function, the backbone network and the result processing method of the network are mainly modified. Secondly, the improved network is trained by the expanded data set. And finally, adjusting network parameters according to the experimental result.

The embodiment is realized by the following technical scheme, and the insulator defect detection method based on the improved yolov5 comprises the following steps:

1. an insulator image dataset (containing defective insulator images and complete insulator images) is acquired.

2. And labeling the data set by using a LabelImg labeling tool, wherein the labeled types are insulators and defects.

3. Data enhancement processing is performed on the acquired images to expand the data set.

4. A triplet attention module is introduced into a backbone network of YOLOv 5.

Attention mechanisms have been widely used in recent years for various computer vision tasks, as they are able to establish interdependencies between channels or spatial locations. The attention mechanism can improve the standard convolutional layer generated feature representation by directly constructing a weighted spatial mask of the dependency or spatial attention between the channels. The purpose of learning attention weights is to allow the network the ability to learn where to focus and further on the target object. Unlike some attention approaches that require a large number of additional learning parameters, triplet attentions are proposed based on how to build a computationally small but effective attention, but while maintaining similar or providing better performance. the triple attribute establishes the dependency relationship between different dimensions through rotation operation and residual transformation, eliminates the indirect correspondence between the channel and the weight, and achieves the effect of improving the accuracy rate with smaller calculation overhead. The triplet attention module is introduced into the yolov5 backbone network.

5. The loss function is optimized.

CIoU is taken as a bounding-box Loss function Loss of the improved YOLOv5 algorithm_CIoU. The CIOU takes the distance, the overlapping rate, the scale and the punishment items between the real frame and the prediction frame into consideration, so that the regression of the prediction frame becomes more stable, the problems of divergence and the like in the training process can not occur like other loss functions, and the punishment factors take the length-width ratio of the prediction frame to the length-width ratio of the target frame into consideration, so that the model can be converged more quickly and better.

CIoU is defined as:

wherein the content of the first and second substances,

for the weighting function, v is a parameter for measuring the uniformity of the aspect ratio, c represents the diagonal distance of the smallest closed region that can contain the prediction box and the real box, and ρ²(b，b^gt) Representing the Euclidean distance, h, between the center points of the prediction and real frames^gt、ω^gtRespectively representing the height and width of the real box, h^p、ω^pRepresenting the height and width of the prediction box, respectively.

6. And optimizing a non-maximum suppression method for processing the final result.

Non-maximum suppression (NMS) is an integral part of the target detection process. First, it orders all prediction boxes according to score. Selecting the prediction box B with the largest score_MAnd inhibit and B_MAll other prediction blocks that have significant overlap (using a predefined threshold). This process is recursively applied to the remaining blocks. Depending on the design of the algorithm, if an object is within a predefined overlap threshold, it will result in missed detection. To this end, the present invention uses Soft-NMS as the method of processing the end result. The core idea of the Soft-NMS is to attenuate the detection scores of all other targets to be their sum with B_MOverlapping continuous functions. Therefore, no object is eliminated during this process, thereby reducing missed detections.

The score function expression of Soft-NMS is:

gaussian penaltyThe function, σ, is an empirically chosen hyperparameter.

7. The improved network is trained. And setting training parameters such as learning rate, learning rate momentum, batch size, small total training round, weight attenuation, maximum iteration number and the like, and training the improved YOLOv5 network.

8. And inputting the collected insulator image data set into a trained YOLOv5 network to obtain whether a defective insulator exists in the input picture and the position of the defect.

The specific implementation of the insulator defect detection method based on the improved yolov5 comprises the following steps:

firstly, acquiring an insulator image to form a data set, and labeling the data set before the data set enters network model training to obtain an xml file conforming to a Pascal VOC data format, wherein the content of the xml file comprises the image name, the image path, the height/width of the image, the position of the center point of a real frame and the width/height information. The data set is then augmented by adaptive contrast, rotation, random gray scale variation, translation, cropping, color channel normalization, Mosaic, and the like. The Mosaic is to turn over, zoom, change color gamut and the like for four pictures respectively, and place the pictures according to four directions, and then form one picture, so that the capability of the model for detecting small targets is enhanced. The data set constructed by the invention contains 1268 images in total for deep learning, model training and detection requirement meeting.

Second, as shown in fig. 2, the improved YOLOv5 network consists of a backbone network, a neck network and a detection network. The backbone Network consists of Focus, convention with Batch simulation and LeakyRelu (CBL), Mix constraint (MixConv), Cross-Stage Partial Network (CSP), Spatial gradient Pooling (SPP), and triple attribute. The neck consists of CBL, CSP, Upesampling and AFF. In order to detect the position and the category of a target, characteristics need to be extracted from an image, and positioning and classification are carried out by using a backbone network to capture the characteristics; the neck network integrates the characteristics through the initial output characteristics of the backbone network, and adapts to the size, so that the overall performance of the system structure is improved; the detection network receives three outputs of the neck network, outputs a prediction of bounding box position, object confidence, and object class for each feature mapping output layer.

The Focus module is used for carrying out slicing operation on the feature map, and can reserve more complete image down-sampling data for subsequent feature extraction; the CBL module is used for extracting feature information in the feature map after slicing; the main idea of CSP networks is to generate two paths for the input, the main path having CBL or result, and the other path performing a convolution function to merge the results of the two paths. The CSP module reduces the calculation cost while ensuring the precision; the SPP module is used for realizing the fusion of the local features and the global features, thereby improving the representation capability of the feature map. Because the shape of the insulator defect is an object in different background images, the accuracy of the insulator defect detection is improved. The visual attention mechanism is a brain signal processing mechanism specific to human vision. Human vision obtains a target area needing important attention, namely a focus of attention in general, by rapidly scanning a global image, and then puts more attention resources into the area to obtain more detailed information of the target needing attention, and suppresses other useless information. Therefore, a triplet module is added into a backbone network of the YOLOv5 network to extract semantic dependence between different dimensions, so that indirect correspondence between channels and weights is eliminated, and the effect of improving accuracy is achieved with low calculation overhead.

The AFF module of the neck network is used to fuse attention-based features from the same or cross-over layer, including short-hop connections and long-hop connections, even within itself for initial fusion.

And thirdly, taking a YOLOv5 network as a basic framework, adding a triplet attention module into the YOLOv5 network, modifying a loss function into a CIoU loss function, and adopting Soft-NMS to replace NMS as an improvement basis to build an improved insulator defect detection network model.

(1) the basic structure of the triplet entry is shown in FIG. 3. the triplet entry adopts three parallel branch structures, two of which are used for extracting the interdependence between two spatial dimensions and the channel dimension C, and the other extracting the spatial feature dependency. In the first two branches, the triplet attention rotates the original input rotated tensor χ 90 ° counterclockwise along the H and W axes, respectively, and converts the shape of the tensor from cxhxw to wxhxc and hxcxw in the third branch, the tensor is input in its original shape cxhxw, then the C dimension tensor is reduced to 2 dimensions by the Z-pool layer and the average and maximum aggregate features in that dimension are connected, which enables the layer to reduce its depth to make further calculations lighter while preserving rich representations of the actual tensor.

Z-pool is defined as: Z-Pool (x) [ MaxPool ]_0d(χ),AvgPpool_0d(χ)]

Where 0d is the 0 th dimension where the maximum pooling operation and the average pooling operation occur. For example, Z-Pool with tensor shape C H W results in 2H W.

The simplified tensor is then passed through a standard convolution layer and a batch normalization layer with a kernel size of K, and finally the attention weight of the corresponding dimension generated by the Sigmoid activation function is added to the rotation tensor. At the final output, the output of the first branch is rotated 90 ° clockwise along the H axis and the output of the second branch is rotated 90 ° clockwise along the W axis to ensure the same shape as the input. Finally, the outputs of the three branches are aggregated evenly into an output.

The output tensor is defined as:

A triplet attention module is added into a backbone network of the YOLOv5 network to extract semantic dependence among different dimensions, and indirect correspondence between channels and weights is eliminated, so that the effect of improving accuracy is achieved with low calculation cost.

(2) The loss function is optimized. IoU is cross-over ratio, which is a common index in target detection, and its main function is to determine positive and negative samples and calculate the distance between the prediction box and the real box. The definition of IoU presents a problem in itself. If the two prediction and real boxes are disjoint, IoU is 0. Meanwhile, due to 0 loss, there is no slope retreat; therefore, learning and training exercises cannot be performed. To solve these problems, the GIoU idea is proposed.

The principle of GIoU is:

however, the algorithm still has the problems of unstable regression of the target frame, easy divergence and the like in the training process. Some of the blocks of target detection do not overlap resulting in a possible regression strategy for GIoU to regress to the IoU regression strategy. Therefore, in order to directly minimize the normalized distance between the prediction frame and the real frame to achieve faster convergence speed, and make the regression more accurate and fast when overlapping with or even containing the real frame, DIoU is proposed.

The principle of DIoU is:

wherein, b^gtThe center points of the prediction box and the real box are respectively represented. However, the DIoU calculation does not consider the aspect ratio, but only considers the overlapping area of the bounding box and b^gtDistance of the center point of (a). However, the consistency of the w and h ratios between the prediction and real boxes is also highly significant. On this basis, CIoU loss is proposed. The penalty term of the CIoU is based on the penalty term of the DIoU plus influence factors alpha, v which take into account the aspect ratio of the prediction box to fit the real box.

The principle of the CIoU is as follows:

wherein the content of the first and second substances,

Therefore, in the embodiment, the DIoU loss function is modified into the CIoU loss function, so that the accurate positioning capability of the model prediction framework is improved, and the convergence effect of the model is enhanced.

(3) To further improve the detection of occluded targets by the algorithm, Non-Maximum Suppression (NMS) will also be optimized. NMS is applied to most of the most advanced detectors to obtain the final result, since it greatly reduces the number of false positives. The NMS algorithm flow is described generally as follows

Firstly, sorting all prediction frames in a list according to confidence; secondly, the prediction box B with the highest score is used_MMoving to a final detection list D, and allocating unique identifiers B to the rest of prediction boxes_i(ii) a Third, remove and B_MThe overlapping area is larger than a certain threshold value N_tAny prediction block B of_i(ii) a For the rest frames B_iThis process is repeated until the initial list is empty. Since in the NMS a certain threshold is set to decide B_MWhich boxes in the field should be retained and which should be deleted. However, if an object does exist, but with B_MHas an overlap ratio of more than N_tIts detection will be ignored.

The scoring function expression of NMS is:

the core idea of Soft-NMS is to use penalty function to attenuate and B_MInstead of setting these scores to zero, the scores of the overlapping prediction boxes. The present invention therefore uses the Soft-NMS instead of the NMS to process the final test results.

The flow chart of the Soft-NMS algorithm is shown in FIG. 4, and the prediction boxes are sorted first, and then the prediction box B with the highest score is used_MMoving to a final detection list D, and distributing the same identification B to all the rest prediction boxes_i(ii) a But when a certain prediction box B_iAnd B_MIs larger than a certain threshold value N_tInstead of deleting the prediction box, the Soft-NMS will recalculate the score S for the prediction box_iAnd compares it with a certain confidence threshold O_tComparing, when the score of the prediction box is S_iGreater than a certain confidence threshold O_tIf so, moving the prediction box into a final detection list D, otherwise, deleting the prediction box; for the rest frames B_iThe above process is repeated until the initial list is empty. It follows that the sum of B and the sum of B can be calculated using the Soft-NMS algorithm_MThe prediction box score with large overlap will be greatly reduced, while the distance B_MThe farther prediction blocks are not affected. Therefore, the missed detection rate of the network can be greatly reduced.

The score function expression of Soft-NMS is:

the gaussian penalty function, σ, is an empirically selected hyperparameter.

(4) The improved YOLOv5 network was trained. During network training, the data set is uniformly scaled to 640 × 640 size, and training is performed on an improved YOLOv5s network model with depth _ multiplex being 0.33 and width _ multiplex being 0.50. The parameter updating method is a random gradient descent (SGD) method, the initial learning rate is 0.01, the momentum term is 0.937, the weight attenuation coefficient is 0.0005, the batch size of model training is set to be 16, and the weight of the model is updated by regularization of a BN layer each time. Enhancement coefficients of hue (H), saturation (S), and brightness (V) were set to 0.015, 0.7, and 0.4, respectively. The total number of training rounds was set to 300. And after the training is finished, storing the obtained weight file of the recognition model, and evaluating the performance of the model by using the test set. The final output of the network is the location box identifying the insulator and its defects and the probability of belonging to a particular category.

While the invention has been described with reference to a preferred embodiment, it will be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the invention.

Claims

1. Insulator defect detection method based on modified yolov5, its characterized in that: the method comprises the following steps:

step 1, collecting an insulator image forming data set;

CIoU is defined as:

wherein the content of the first and second substances,

alpha is a weight function, v is a parameter for measuring the uniformity of the aspect ratioNumber, c represents the diagonal distance, ρ, of the smallest enclosed area that can contain the predicted box and the real box²(b，b^gt) Representing the Euclidean distance, h, between the center points of the prediction and real frames^gt、ω^gtRespectively representing the height and width of the real box, h^p、ω^pRespectively representing the height and width of the prediction box;

the score function expression of Soft-NMS is:

2. The improved yolov 5-based insulator defect detection method according to claim 1, wherein the method comprises the following steps: the triplet attention module adopts three parallel branch structures, wherein two of the three parallel branch structures are used for extracting the interdependence relation between two space dimensions and the channel dimension C, and the other branch structure is used for extracting the space characteristic interdependence relation; in the first two branches, the triplet entry rotates the original input rotation tensor χ 90 ° counterclockwise along the H and W axes, respectively, and converts the shape of the tensor from cxhxc W to wxhxc C and hxc W; in the third branch, the tensor is input in the original shape C multiplied by H multiplied by W, the tensor of the C dimension is reduced to 2 dimensions through the Z-pool layer, and the average convergence feature and the maximum convergence feature in the dimension are connected;

z-pool is defined as: Z-Pool (x) [ MaxPool ]_0d(χ),AvgPpool_0d(χ)]

the output tensor is defined as:

3. The improved yolov 5-based insulator defect detection method according to claim 1, wherein the method comprises the following steps: the flow of processing the final detection result by adopting Soft-NMS to replace NMS is as follows;

1) the prediction boxes are sorted first, and then the prediction box with the highest score is sortedB_MMoving to a final detection list D, and distributing the same identification B to all the rest prediction boxes_i；

4. The improved yolov 5-based insulator defect detection method according to claim 1, wherein the method comprises the following steps: training the improved YOLOv5 network includes the steps of: