CN111382709A - Insulator image detection method based on unmanned aerial vehicle inspection - Google Patents

Insulator image detection method based on unmanned aerial vehicle inspection Download PDF

Info

Publication number
CN111382709A
CN111382709A CN202010167880.7A CN202010167880A CN111382709A CN 111382709 A CN111382709 A CN 111382709A CN 202010167880 A CN202010167880 A CN 202010167880A CN 111382709 A CN111382709 A CN 111382709A
Authority
CN
China
Prior art keywords
insulator
image
task
training
detection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010167880.7A
Other languages
Chinese (zh)
Inventor
刘黎
韩睿
齐冬莲
闫云凤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University ZJU
State Grid Zhejiang Electric Power Co Ltd
Electric Power Research Institute of State Grid Zhejiang Electric Power Co Ltd
Original Assignee
Zhejiang University ZJU
State Grid Zhejiang Electric Power Co Ltd
Electric Power Research Institute of State Grid Zhejiang Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University ZJU, State Grid Zhejiang Electric Power Co Ltd, Electric Power Research Institute of State Grid Zhejiang Electric Power Co Ltd filed Critical Zhejiang University ZJU
Priority to CN202010167880.7A priority Critical patent/CN111382709A/en
Publication of CN111382709A publication Critical patent/CN111382709A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • G06V20/13Satellite images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/0002Inspection of images, e.g. flaw detection
    • G06T7/0004Industrial image inspection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/11Region-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/194Segmentation; Edge detection involving foreground-background segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • G06V10/267Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion by performing operations on regions, e.g. growing, shrinking or watersheds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10004Still image; Photographic image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30108Industrial image inspection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30181Earth observation

Abstract

The invention relates to the technical field of image recognition and detection applied to insulators, and provides an insulator image detection method based on unmanned aerial vehicle routing inspection, which comprises the following steps: acquiring an aerial image set through shooting equipment carried by an unmanned aerial vehicle during cruising; step two: classifying the aerial image set in a manual labeling mode and forming a plurality of related task packages; step three: amplifying a plurality of related task packages in a data amplification mode, and performing parallel learning training on the insulator detection model by using the amplified data packages; step four: in the multi-task learning training, a training evaluation formula is introduced; if the result value generated by the calculation of the training evaluation formula is larger than the set threshold value, immediately stopping continuously learning the auxiliary task; step five: verifying the effectiveness in insulator detection through quantitative analysis and comparison of experimental results; the detection performance and accuracy are improved.

Description

Insulator image detection method based on unmanned aerial vehicle inspection
Technical Field
The invention relates to the technical field of image recognition and detection applied to insulators, in particular to an insulator image detection method based on unmanned aerial vehicle inspection.
Background
On transmission line patrols and examines, carry out fault detection investigation through discernment unmanned aerial vehicle aerial photograph image to the insulator, play important role in maintaining transmission system safety and stability operation. With the help of the unmanned aerial vehicle, the insulator detection can be realized based on an image recognition algorithm. Traditional image recognition algorithms, such as an insulator detection method based on skeleton feature extraction, an insulator detection method based on insulator string feature extraction, insulator detection based on image threshold segmentation and the like, are easily affected by an environmental background, have poor robustness, are easily subjected to false detection or missing detection in the detection of insulator images with large number, complex background and different angles shot by an unmanned aerial vehicle, and cannot perform accurate detection.
In recent years, the continuous progress of deep learning theory has made possible its application in image segmentation. Compared with the traditional method, the deep learning-based method can automatically learn more abstract and expressive characteristics from the image data, such as the algorithm based on the convolutional neural network and the algorithm based on the full convolutional neural network.
The characteristics of the detection algorithm can be automatically extracted in a layered manner while the whole information and the local information are considered by utilizing a Convolutional Neural Network (CNN), the calculation efficiency is improved by adopting the algorithm of the network structure, and the accuracy of insulator detection is greatly improved. However, in the process of using CNN to classify the pixels of the image shot by the drone, the size of the pixel block generated by classification is much smaller than that of the whole shot image, so that only some local features can be extracted from the generated pixel block, which greatly limits the classification performance (detection performance).
Compared with a convolutional neural network, the image segmentation method based on the full convolutional neural network (FCN) is more advanced, the dense prediction under the condition of a non-full connection layer is realized by the algorithm, and the algorithm efficiency is improved by generating a segmentation mask map with any size. However, FCN reduces the resolution of the feature map during multiple sampling, and further loses detailed information such as the target location, so that FCN is difficult to accurately locate an insulator image with the characteristics of a complex background, resulting in a final detection result with a low accuracy.
In addition, the existing method only considers insulator detection as a segmentation task or a bounding box detection task, ignores the mutual connection between the two tasks, and limits the detection performance of the model.
Therefore, an insulator image detection method with high detection performance and high accuracy is urgently needed.
Disclosure of Invention
The technical problem to be solved by the invention is to overcome the defects in the prior art, and provide an insulator image detection method based on unmanned aerial vehicle routing inspection, so that the detection performance and the accuracy are improved.
In order to achieve the purpose, the invention is realized by the following technical scheme: an insulator image detection method based on unmanned aerial vehicle routing inspection comprises the following steps: acquiring an aerial image set through shooting equipment carried by an unmanned aerial vehicle during cruising; step (ii) ofII, secondly: classifying the aerial image set in a manual labeling mode and forming a plurality of related task packages; step three: amplifying a plurality of related task packages in a data amplification mode, and performing parallel learning training on the insulator detection model by using the amplified data packages; step four: in the multitask learning training, let
Figure RE-GDA0002506082370000021
The loss function value chosen for the kth task on the validation set,
Figure RE-GDA0002506082370000022
selecting a loss function value for the kth task on the training set; if the result value generated by the formula calculation is larger than the set threshold value epsilon, immediately stopping continuously learning the auxiliary task; the training evaluation formula is as follows,
Figure RE-GDA0002506082370000023
h represents the iteration step size of the multi-task learning; med represents the median of multitask learning; wherein
Figure RE-GDA0002506082370000024
δk(. cndot.) represents the relative rate of fall of the loss generated during the k-th task training; t represents the total number of iterations performed in the current training process; t represents an equalization factor for controlling the weighting magnitude of the task weighting processing; step five: and verifying the effectiveness in insulator detection through quantitative analysis and comparison of experimental results.
The further preferable scheme of the invention is as follows: in the second step, the manual marking mode comprises 3 types, namely insulator segmentation marking, insulator surrounding frame marking and picture background attribute marking are sequentially carried out; the insulator segmentation marks are used as main tasks, and insulator surrounding frame marks and picture background attributes are used as auxiliary tasks; the training of the auxiliary task is stopped before the auxiliary task starts to over-fit the training set and suppresses the main task from converging.
The further preferable scheme of the invention is as follows: the insulator detection model is obtained by pre-training on a COCO data set.
The further preferable scheme of the invention is as follows: and in the second step, constructing a characteristic pyramid on the network structure by using the obtained data, and fusing the high-dimensional characteristic and the low-dimensional characteristic.
The further preferable scheme of the invention is as follows: and optimizing the output result by adopting a corrosion expansion algorithm.
The further preferable scheme of the invention is as follows: and secondly, before manual marking, randomly cutting all the insulator images shot by the unmanned aerial vehicle into three different sets, namely a training set, a verification set and a test set, in a data amplification mode.
The further preferable scheme of the invention is as follows: the process of constructing the feature pyramid is as follows: 1) firstly, realizing feedforward calculation of a trunk convolution neural network by adopting a bottom-up characteristic channel, and generating a characteristic diagram which has a scaling step length of 2 and is gradually reduced; 2) in the feedforward calculation process of the characteristic channel from bottom to top, the convolution layers which generate the characteristic mapping layers with the same size are classified into the same-stage layer of a characteristic pyramid; further, the output of the last layer of each stage layer is taken as the output of the corresponding stage layer; defining the output of each phase layer as { C1, C2, …, Cn }, which is scaled by a factor of 2 with respect to the input; 3) by performing 2 times of up-sampling on the features of each stage layer from top to bottom, the feature channel from top to bottom can generate the features with rough spatial information and strong image semantic information; 4) and adding feature maps with lower resolution from top to bottom and the same size in the output of the upper stage layer from bottom according to elements, thereby realizing the fusion of high-dimensional and low-dimensional features.
The further preferable scheme of the invention is as follows: the result optimization by adopting a corrosion expansion algorithm comprises the following steps:
1) firstly, corroding short false detection line segments by using a corrosion algorithm for n times to eliminate the false detection line segments, and meanwhile considering that in the process of corroding and correcting the false segmentation regions by using the corrosion algorithm, the regions which are originally correctly segmented are corroded, so that the region occupied by the insulator in each insulator image block is much larger than the region occupied by the false segmentation region segments;
2) after the n-time corrosion algorithm is adopted, the mistakenly divided region sections in the insulator image block disappear, and the correct divided regions in the insulator image block are reserved;
3) and then, adopting an expansion algorithm for n times for each insulator image block, wherein the algorithm is used for restoring all areas corroded by the corrosion algorithm, and finally obtaining an optimization result of insulator detection for removing the mistakenly-segmented area sections after the corrosion algorithm and the expansion algorithm.
The further preferable scheme of the invention is as follows: the parameters used for the results of the quantitative analysis and comparative experiments were as follows:
1) DSC (dice Similarity coefficient), wherein the Similarity coefficient is an index for evaluating an image segmentation repetition rate in the case of performing manual segmentation and automatic segmentation on an image photographed by an unmanned aerial vehicle, and the larger the DSC value is, the higher the detection accuracy is, and the formula thereof is:
Figure RE-GDA0002506082370000041
in the above formula: TP represents the number of correctly predicted insulator pixel points; FP represents the number of insulator pixel points which are wrongly predicted; FN represents the number of background pixels that were mispredicted.
2) The positive Predictive value refers to a ratio of the number of the insulator points, which is accurately divided, in the image shot by the unmanned aerial vehicle to the number of the insulator points, which is a result generated by dividing the image, wherein the larger the PPV value is, the lower the false detection rate is, and the formula is as follows:
Figure RE-GDA0002506082370000042
3) sensitivity refers to the ratio of the number of correctly divided insulator points in an image shot by an unmanned aerial vehicle to the number of true insulator points in the image, the larger the Sensitivity value is, the higher the recall rate is, and the formula is as follows:
Figure RE-GDA0002506082370000043
in conclusion, the invention has the following beneficial effects: an insulator picture is obtained by aerial photography of an unmanned aerial vehicle, a characteristic pyramid is constructed by fusing multi-dimensional characteristic information, loss of detail information such as a target position is avoided, and efficient detection of an insulator in a complex background is achieved; and a multi-task learning algorithm is introduced into the convolutional neural network, so that the generalization capability of the model is further improved, and the insulator detection precision is improved. The insulator actual image obtained by unmanned aerial vehicle aerial photography is used for carrying out experiments, and the result shows that the insulator detection precision can be improved to 95.3% by the method, and the method has high engineering application value.
Drawings
FIG. 1 is a flow chart of the present invention.
FIG. 2 is a graph showing the effect of comparative experiments.
FIG. 3 is a diagram of model iterations with different strategies.
Detailed Description
The present invention will be described in further detail with reference to the accompanying drawings.
The present embodiment is only for explaining the present invention, and it is not limited to the present invention, and those skilled in the art can make modifications without inventive contribution to the present embodiment as needed after reading the present specification, but all of them are protected by patent law within the scope of the claims of the present invention.
As shown in fig. 1, the invention provides an insulator image detection method based on unmanned aerial vehicle routing inspection, which comprises the following steps.
The method comprises the following steps: through the shooting equipment that unmanned aerial vehicle carried on acquireing aerial photography image set when cruising.
Step two: and classifying the aerial image set in a manual labeling mode and forming a plurality of related task packages. Specifically, before manual labeling, all the insulator images shot by the unmanned aerial vehicle are randomly cut and divided into three different sets, namely a training set, a verification set and a test set, in a data amplification mode. The manual marking mode comprises 3 types, namely insulator segmentation marking, insulator surrounding frame marking and picture background attribute marking in sequence. And constructing a feature pyramid on the network structure of the obtained data, and fusing the high-dimensional features and the low-dimensional features. The process of constructing the feature pyramid is as follows:
1) firstly, realizing feedforward calculation of a trunk convolution neural network by adopting a bottom-up characteristic channel, and generating a characteristic diagram which has a scaling step length of 2 and is gradually reduced;
2) in the feedforward calculation process of the characteristic channel from bottom to top, the convolution layers which generate the characteristic mapping layers with the same size are classified into the same-stage layer of a characteristic pyramid; further, the output of the last layer of each stage layer is taken as the output of the corresponding stage layer; defining the output of each phase layer as { C1, C2, …, Cn }, which is scaled by a factor of 2 with respect to the input;
3) by performing 2 times of up-sampling on the features of each stage layer from top to bottom, the feature channel from top to bottom can generate the features with rough spatial information and strong image semantic information;
4) and adding feature maps with lower resolution from top to bottom and the same size in the output of the upper stage layer from bottom according to elements, thereby realizing the fusion of high-dimensional and low-dimensional features.
And (4) dividing and marking the insulator as a main task, and marking the insulator surrounding frame and the picture background attribute as auxiliary tasks. The training of the auxiliary task is stopped before the auxiliary task starts to over-fit the training set and suppresses the main task from converging.
Step three: and amplifying a plurality of related task packages in a data amplification mode, and performing parallel learning training on the insulator detection model by using the amplified data packages. The insulator detection model is obtained by pre-training on a COCO data set.
Step four: in the multitask learning training, let
Figure RE-GDA0002506082370000061
The loss function value chosen for the kth task on the validation set,
Figure RE-GDA0002506082370000062
selecting a loss function value for the kth task on the training set; if the result value generated by the formula calculation is larger than the set threshold value epsilon, the continuous learning of the item is immediately stoppedAnd (5) assisting the task.
The training evaluation formula is as follows,
Figure RE-GDA0002506082370000063
h represents the iteration step size of the multi-task learning; med represents the median of multitask learning. The first item of the above formula, which is larger than the product on the left side of the sign, represents the variation trend of the error generated in the training process, and if the error generated in the training process is rapidly reduced in the multi-task learning iteration interval with h as the step length, the numerical value of the first item of the above formula is very small, which represents that the task is still valuable, and the training can be continued; otherwise, it indicates that the task tends to be stopped. The second term in the above equation, which is larger than the left side of the number, is a generalization error compared to the training set error, and is an importance coefficient of the kth auxiliary task error, which can be learned by gradient descent, and the magnitude of the value indicates the time length of the task influence.
Wherein
Figure RE-GDA0002506082370000064
δk(. cndot.) represents the relative rate of fall of the loss generated during the k-th task training; t represents the total number of iterations performed in the current training process; t denotes an equalization factor for controlling the weight magnitude of the weighting process for the tasks. A larger T indicates a more balanced weight distribution for different tasks. When T is very large, it means that the weight assigned to each task in the weighting process is equal.
Step five: and optimizing the output result by adopting a corrosion expansion algorithm. And verifying the effectiveness in insulator detection through quantitative analysis and comparison of experimental results.
The result optimization by adopting a corrosion expansion algorithm comprises the following steps:
1) firstly, corroding short false detection line segments by using a corrosion algorithm for n times to eliminate the false detection line segments, and meanwhile considering that in the process of corroding and correcting the false segmentation regions by using the corrosion algorithm, the regions which are originally correctly segmented are corroded, so that the region occupied by the insulator in each insulator image block is much larger than the region occupied by the false segmentation region segments;
2) after the n-time corrosion algorithm is adopted, the mistakenly divided region sections in the insulator image block disappear, and the correct divided regions in the insulator image block are reserved;
3) and then, adopting an expansion algorithm for n times for each insulator image block, wherein the algorithm is used for restoring all areas corroded by the corrosion algorithm, and finally obtaining an optimization result of insulator detection for removing the mistakenly-segmented area sections after the corrosion algorithm and the expansion algorithm.
The parameters used for the results of the quantitative analysis and comparative experiments were as follows:
1) DSC (dice Similarity coefficient), wherein the Similarity coefficient is an index for evaluating an image segmentation repetition rate in the case of performing manual segmentation and automatic segmentation on an image photographed by an unmanned aerial vehicle, and the larger the DSC value is, the higher the detection accuracy is, and the formula thereof is:
Figure RE-GDA0002506082370000071
in the above formula: TP represents the number of correctly predicted insulator pixel points; FP represents the number of insulator pixel points which are wrongly predicted; FN represents the number of background pixels that were mispredicted.
2) The positive Predictive value refers to a ratio of the number of the insulator points, which is accurately divided, in the image shot by the unmanned aerial vehicle to the number of the insulator points, which is a result generated by dividing the image, wherein the larger the PPV value is, the lower the false detection rate is, and the formula is as follows:
Figure RE-GDA0002506082370000072
3) sensitivity refers to the ratio of the number of correctly divided insulator points in an image shot by an unmanned aerial vehicle to the number of true insulator points in the image, the larger the Sensitivity value is, the higher the recall rate is, and the formula is as follows:
Figure RE-GDA0002506082370000081
the specific working principle is that the used experimental data are derived from insulator images aerial photographed by an unmanned aerial vehicle, 3000 images are obtained in total, the size of each image is 3860 × 2160, and a 512 × 512 insulator image training set, a verification set and a test set are obtained by means of data amplification operation.
Before training on the insulator data set, pre-training is carried out on the COCO data set, and then the pre-trained model is placed on the insulator data set to be trained. And training the insulator detection network by adopting a random gradient descent mode, wherein the initial learning rate is 0.01, the total iteration times are 50000 times, the learning rate is reduced to 0.001 at the 20000 th time, the learning rate is reduced to 0.0001 at the 40000 times, and the whole model is trained on the NVIDIA Titan X GPU.
In order to verify the application effect of the detection network in insulator detection, the detection network is compared with a convolutional neural network-based insulator detection method and an FCN-based insulator detection method, and the results are shown in table 1.
TABLE 1 comparison of the results of the experiments with different detection methods
DSC PPV Sensitivity
Methods of the invention 0.953 0.923 0.901
Convolutional neural network 0.723 0.745 0.753
FCN 0.814 0.801 0.805
As can be seen from the experimental results in table 1, the proposed improved convolutional neural network insulator detection model is much greater than the other two classical methods in the insulator segmentation performance.
Fig. 2 is a comparison of experimental results on the test set. Fig. 2(a) - (e) show the original image, the segmentation label, the text method detection result, the convolutional neural network method detection result, and the FCN method detection result, respectively. As can be seen from fig. 2, the detection results of the convolutional neural network and the FCN method have more false detections, because in the process of down-sampling, the feeling is also increased, the position information of the insulator is lost due to the use of the aggregation context, part of the background close to the insulator in the image shot by the unmanned aerial vehicle is erroneously detected as the insulator in the detection process, and the environmental adaptability of the convolutional neural network and the FCN method is poor. Compared with a convolutional neural network and an FCN detection method, the provided insulator detection method based on the feature pyramid and the multitask learning network can effectively eliminate the interference of the similar background and accurately segment the insulator.
The proposed add-multitask and auxiliary task early stop strategy was experimentally verified. Firstly, on the premise of keeping the network structure unchanged, the addition of the auxiliary task proves that the segmentation performance of the insulator can be improved by the multitask model compared with the single-task model, and the experimental results are shown in table 2.
TABLE 2 multitask comparative test results
DSC PPV Sensitivity
Segmentation 0.884 0.871 0.874
Partition + bounding box 0.932 0.903 0.889
Segmentation + background classification 0.892 0.882 0.878
Segmentation + bounding box + background classification 0.953 0.923 0.901
As can be seen from table 2, by adding a plurality of tasks, various performance indexes can be improved, thereby improving the detection accuracy of the insulator.
The training error of the deep learning algorithm gradually decreases with the passage of time, but the error of the verification set increases again, which means that the model is over-fitted, and at this time, an early stopping strategy can be adopted to obtain the parameters which perform best on the verification set. Fig. 3(a) is a multitask training loss curve when the auxiliary task early stopping strategy is not introduced, and when the bounding box regression task and the background classification task converge, the segmentation task also enters a convergence state. FIG. 3(b) is a multitasking curve when an auxiliary task early stop strategy is introduced, after early stopping an already converged auxiliary task, the loss of the split task is reduced compared to the loss of not stopping the auxiliary task early, which illustrates that stopping the auxiliary task early is beneficial for the convergence of the main task.

Claims (9)

1. An insulator image detection method based on unmanned aerial vehicle inspection is characterized by comprising the following steps,
the method comprises the following steps: acquiring an aerial image set through shooting equipment carried by an unmanned aerial vehicle during cruising;
step two: classifying the aerial image set in a manual labeling mode and forming a plurality of related task packages;
step three: amplifying a plurality of related task packages in a data amplification mode, and performing parallel learning training on the insulator detection model by using the amplified data packages;
step four: in the multitask learning training, let
Figure RE-FDA0002506082360000011
The loss function value chosen for the kth task on the validation set,
Figure RE-FDA0002506082360000012
selecting a loss function value for the kth task on the training set; if the result value generated by the calculation of the training evaluation formula is larger than the set threshold value epsilon, immediately stopping continuously learning the auxiliary task; the training evaluation formula is as follows,
Figure RE-FDA0002506082360000013
h represents the iteration step size of the multi-task learning; med represents the median of multitask learning;
wherein
Figure RE-FDA0002506082360000014
δk(. cndot.) represents the relative rate of fall of the loss generated during the k-th task training; t represents the total number of iterations performed in the current training process; t represents an equalization factor for controlling the weighting magnitude of the task weighting processing;
step five: and verifying the effectiveness in insulator detection through quantitative analysis and comparison of experimental results.
2. The insulator image detection method according to claim 1, characterized in that: in the second step, the manual marking mode comprises 3 types, namely insulator segmentation marking, insulator surrounding frame marking and picture background attribute marking are sequentially carried out; the insulator segmentation marks are used as main tasks, and insulator surrounding frame marks and picture background attributes are used as auxiliary tasks; the training of the auxiliary task is stopped before the auxiliary task starts to over-fit the training set and suppresses the main task from converging.
3. The insulator image detection method according to claim 2, characterized in that: the insulator detection model is obtained by pre-training on a COCO data set.
4. The insulator image detection method according to claim 2, characterized in that: and in the second step, constructing a characteristic pyramid on the network structure by using the obtained data, and fusing the high-dimensional characteristic and the low-dimensional characteristic.
5. The insulator image detection method according to claim 2, characterized in that: and optimizing the output result by adopting a corrosion expansion algorithm.
6. The insulator image detection method according to claim 1, characterized in that: and secondly, before manual marking, randomly cutting all the insulator images shot by the unmanned aerial vehicle into three different sets, namely a training set, a verification set and a test set, in a data amplification mode.
7. The insulator image detection method according to claim 4, characterized in that: the process of constructing the feature pyramid is as follows:
1) firstly, realizing feedforward calculation of a trunk convolution neural network by adopting a bottom-up characteristic channel, and generating a characteristic diagram which has a scaling step length of 2 and is gradually reduced;
2) in the feedforward calculation process of the characteristic channel from bottom to top, the convolution layers which generate the characteristic mapping layers with the same size are classified into the same-stage layer of a characteristic pyramid; further, the output of the last layer of each stage layer is taken as the output of the corresponding stage layer; defining the output of each phase layer as { C1, C2, …, Cn }, which is scaled by a factor of 2 with respect to the input;
3) by performing 2 times of up-sampling on the features of each stage layer from top to bottom, the feature channel from top to bottom can generate the features with rough spatial information and strong image semantic information;
4) and adding feature maps with lower resolution from top to bottom and the same size in the output of the upper stage layer from bottom according to elements, thereby realizing the fusion of high-dimensional and low-dimensional features.
8. The insulator image detection method according to claim 5, characterized in that: the result optimization by adopting a corrosion expansion algorithm comprises the following steps:
1) firstly, corroding short false detection line segments by using a corrosion algorithm for n times to eliminate the false detection line segments, and meanwhile considering that in the process of corroding and correcting the false segmentation regions by using the corrosion algorithm, the regions which are originally correctly segmented are corroded, so that the region occupied by the insulator in each insulator image block is much larger than the region occupied by the false segmentation region segments;
2) after the n-time corrosion algorithm is adopted, the mistakenly divided region sections in the insulator image block disappear, and the correct divided regions in the insulator image block are reserved;
3) and then, adopting an expansion algorithm for n times for each insulator image block, wherein the algorithm is used for restoring all areas corroded by the corrosion algorithm, and finally obtaining an optimization result of insulator detection for removing the mistakenly-segmented area sections after the corrosion algorithm and the expansion algorithm.
9. The insulator image detection method according to claim 1, characterized in that: the parameters used for the results of the quantitative analysis and comparative experiments were as follows:
1) DSC (dice Similarity coefficient), wherein the Similarity coefficient is an index for evaluating an image segmentation repetition rate in the case of performing manual segmentation and automatic segmentation on an image photographed by an unmanned aerial vehicle, and the larger the DSC value is, the higher the detection accuracy is, and the formula thereof is:
Figure FDA0002408112910000031
in the above formula: TP represents the number of correctly predicted insulator pixel points; FP represents the number of insulator pixel points which are wrongly predicted; FN represents the number of background pixels that were mispredicted.
2) The positive Predictive value refers to a ratio of the number of the insulator points, which is accurately divided, in the image shot by the unmanned aerial vehicle to the number of the insulator points, which is a result generated by dividing the image, wherein the larger the PPV value is, the lower the false detection rate is, and the formula is as follows:
Figure FDA0002408112910000032
3) sensitivity refers to the ratio of the number of correctly divided insulator points in an image shot by an unmanned aerial vehicle to the number of true insulator points in the image, the larger the Sensitivity value is, the higher the recall rate is, and the formula is as follows:
Figure FDA0002408112910000033
CN202010167880.7A 2020-03-11 2020-03-11 Insulator image detection method based on unmanned aerial vehicle inspection Pending CN111382709A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010167880.7A CN111382709A (en) 2020-03-11 2020-03-11 Insulator image detection method based on unmanned aerial vehicle inspection

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010167880.7A CN111382709A (en) 2020-03-11 2020-03-11 Insulator image detection method based on unmanned aerial vehicle inspection

Publications (1)

Publication Number Publication Date
CN111382709A true CN111382709A (en) 2020-07-07

Family

ID=71218691

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010167880.7A Pending CN111382709A (en) 2020-03-11 2020-03-11 Insulator image detection method based on unmanned aerial vehicle inspection

Country Status (1)

Country Link
CN (1) CN111382709A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113361628A (en) * 2021-06-24 2021-09-07 海南电网有限责任公司电力科学研究院 CNN insulator aging spectrum classification method under multi-task learning
CN114137635A (en) * 2021-11-25 2022-03-04 浙江啄云智能科技有限公司 Method, device and equipment for testing detection efficiency of security check machine and storage medium
WO2024000566A1 (en) * 2022-07-01 2024-01-04 Robert Bosch Gmbh Method and apparatus for auxiliary learning with joint task and data scheduling

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110827251A (en) * 2019-10-30 2020-02-21 江苏方天电力技术有限公司 Power transmission line locking pin defect detection method based on aerial image

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110827251A (en) * 2019-10-30 2020-02-21 江苏方天电力技术有限公司 Power transmission line locking pin defect detection method based on aerial image

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
沈晓海;栗泽昊;李敏;徐晓龙;张学武;: "基于多任务深度学习的铝材表面缺陷检测" *
陈景文;周鑫;张蓉;张东;: "基于U-net网络的航拍绝缘子检测" *
马必焕: "一种多任务特征选择金字塔及其在电力设备检测的应用" *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113361628A (en) * 2021-06-24 2021-09-07 海南电网有限责任公司电力科学研究院 CNN insulator aging spectrum classification method under multi-task learning
CN114137635A (en) * 2021-11-25 2022-03-04 浙江啄云智能科技有限公司 Method, device and equipment for testing detection efficiency of security check machine and storage medium
CN114137635B (en) * 2021-11-25 2023-12-26 浙江啄云智能科技有限公司 Method, device and equipment for testing detection efficiency of security inspection machine and storage medium
WO2024000566A1 (en) * 2022-07-01 2024-01-04 Robert Bosch Gmbh Method and apparatus for auxiliary learning with joint task and data scheduling

Similar Documents

Publication Publication Date Title
US11581130B2 (en) Internal thermal fault diagnosis method of oil-immersed transformer based on deep convolutional neural network and image segmentation
CN110569901B (en) Channel selection-based countermeasure elimination weak supervision target detection method
CN106778835B (en) Remote sensing image airport target identification method fusing scene information and depth features
CN111369572B (en) Weak supervision semantic segmentation method and device based on image restoration technology
EP3690740B1 (en) Method for optimizing hyperparameters of auto-labeling device which auto-labels training images for use in deep learning network to analyze images with high precision, and optimizing device using the same
US8379994B2 (en) Digital image analysis utilizing multiple human labels
CN111860171B (en) Method and system for detecting irregular-shaped target in large-scale remote sensing image
CN111382709A (en) Insulator image detection method based on unmanned aerial vehicle inspection
CN107679465A (en) A kind of pedestrian's weight identification data generation and extending method based on generation network
JP2020123330A (en) Method for acquiring sample image for label acceptance inspection from among auto-labeled images utilized for neural network learning, and sample image acquisition device utilizing the same
CN111583263A (en) Point cloud segmentation method based on joint dynamic graph convolution
CN111680655A (en) Video target detection method for aerial images of unmanned aerial vehicle
CN113920107A (en) Insulator damage detection method based on improved yolov5 algorithm
JP2020126624A (en) Method for recognizing face using multiple patch combination based on deep neural network and improving fault tolerance and fluctuation robustness
CN107784288A (en) A kind of iteration positioning formula method for detecting human face based on deep neural network
CN111753986B (en) Dynamic test method and device for deep learning model
CN113888485A (en) Magnetic core surface defect detection method based on deep learning
CN113643228A (en) Nuclear power station equipment surface defect detection method based on improved CenterNet network
CN114049305A (en) Distribution line pin defect detection method based on improved ALI and fast-RCNN
CN112464877A (en) Weak supervision target detection method and system based on self-adaptive instance classifier refinement
CN111310820A (en) Foundation meteorological cloud chart classification method based on cross validation depth CNN feature integration
JP7150918B2 (en) Automatic selection of algorithm modules for specimen inspection
CN114219763A (en) Infrared picture detection method for abnormal heating point of power distribution equipment based on fast RCNN algorithm
CN115861306B (en) Industrial product abnormality detection method based on self-supervision jigsaw module
CN117576079A (en) Industrial product surface abnormality detection method, device and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination