CN117994630A

CN117994630A - Consistency teacher learning model applied to transmission line anti-external damage monitoring system

Info

Publication number: CN117994630A
Application number: CN202410093965.3A
Authority: CN
Inventors: 王波; 马恒瑞; 马富齐; 王红霞; 罗鹏; 张嘉鑫; 王雷雄; 刘萌
Original assignee: Wuhan University WHU
Current assignee: Wuhan University WHU
Priority date: 2024-01-23
Filing date: 2024-01-23
Publication date: 2024-05-07

Abstract

The invention relates to a consistency teacher learning model applied to an anti-external damage monitoring system of a power transmission line, which selects MSCOCO and PASCAL VOC data sets as samples to carry out comprehensive experiments and sets an evaluation scheme, takes a general SSOD model as a base line, enables the teacher model to be kept as an index moving average value of a student detector, sequentially inputs unlabeled parts of the data sets into the teacher model through weak enhancement to generate pseudo marks, supervises a student network by the pseudo marks, enables the same unlabeled samples to shake strongly, simultaneously inputs a small part of labeled data into the student network to improve the performance of the student network, sets the goal of the student network to minimize loss, optimizes the problems of inconsistent distribution, inconsistent subtask and inconsistent time in the SSOD model, trains the data sets by the model, and can obtain the model with the performance superior to that of the current progressive method.

Description

Consistency teacher learning model applied to transmission line anti-external damage monitoring system

Technical Field

The invention relates to the technical field of machine vision, in particular to a consistent teacher learning model applied to an anti-external damage monitoring system of a power transmission line.

Background

With the development of science and technology and national economy, the electric power consumption is increased, the laying amount of the electric transmission lines in China is gradually increased, and accidents caused by the damage of the electric transmission lines by external force are more and more. In order to solve the accident, the manual early warning consumes more manpower and material resources, has high false alarm rate of missing report and can not accurately and timely transmit accident information; the existing automatic monitoring alarm equipment has high false alarm rate, is easily interfered by external factors, and cannot accurately convey accident information. In recent years, the domestic computer vision technology has been gradually applied to the electrical industry, and electrical equipment can be monitored in real time by using a vision system. Although the electrical equipment can acquire a large number of sample graphs for learning, the equipment has the defects of long learning time, high learning cost, large influence by factors such as environment and the like, easy occurrence of problems of sinking into local optimal solution and low recognition accuracy in monitoring. The semi-supervised algorithm for equipment vision learning has the advantages of less learning cost and higher accuracy, but has the problems of inconsistent allocation, inconsistent subtasks and inconsistent time.

Disclosure of Invention

The invention mainly solves the technical problems existing in the prior art; the device for detecting the external damage is trained by the consistent teacher learning model, so that the learning cost of the device can be greatly reduced, and the monitoring accuracy of the device is improved.

Still another object of the present invention is to solve the technical problems of the prior art; the consistency teacher learning model can optimize the existing semi-supervised learning algorithm, and solves the problems of inconsistent distribution, inconsistent subtasks and inconsistent time in the semi-supervised learning algorithm.

The technical problems of the invention are mainly solved by the following technical proposal:

the consistent teacher learning model applied to the transmission line anti-external damage monitoring system is characterized by comprising the following steps of:

Step1, a data set and evaluation setting step: for verifying performance indexes of consistency teachers, MSCOCO and PASCAL VOC datasets are selected as samples to carry out comprehensive experiments, and three evaluation schemes are set at the same time;

Step 2, SSOD frame construction: the universal SSOD paradigm is used as a base line, a teacher model is kept as an index moving average value of a student detector, the unlabeled part of the dataset in the step 1 is input into the teacher model through weak enhancement in sequence to generate a pseudo-label, the student network is supervised by the pseudo-label, the same unlabeled sample is subjected to strong jitter, at the same time, a small part of labeled data is input into the student network to improve the performance of the student network, and the goal of the student network is set to minimize loss;

Step 3, SSOD inconsistent problem optimization step: for the problem of inconsistent allocation, self-adaptive sample allocation is adopted to replace anchor point allocation based on the IOU, the matching cost between each predicted value and a given true value is calculated, and allocation is carried out according to the matching cost, so that the optimized consistency is enhanced; for the problem of inconsistent subtasks, a 3D feature symmetry module is referenced, and different feature positions are adaptively selected to complete each subtask so as to estimate the offset of the optimal regression feature points and enable the optimal regression feature points to be better aligned with the classification branches; for the problem of inconsistent time, a Gaussian mixture model is adopted to adaptively adjust a threshold value, and the strategy reduces the complicated super-parameter adjustment requirement in SSOD, so that the robust improvement is realized under different data sets and settings;

step 4, model training step: dividing the data set in the step 1 into a small data set with a mark and a large data set without a mark, inputting the small data set with the mark into the optimization model obtained in the step 3 for preliminary training, inputting the large data set without the mark, adding the sample with higher output confidence to the data set with the mark, and performing cyclic training by the method;

step 5, model test step: under the three evaluation modes described in the step 1, the performance of the optimization model obtained in the step 3 is calculated and compared with the performance of the current most advanced method.

In the foregoing consistent teacher learning model applied to the transmission line anti-external damage monitoring system, in the step 1, three evaluation schemes are set, and the evaluation schemes are respectively as follows: COCO-PARTIAL, COCO-ADDITION and VOC-PARTIAL.

In the above-mentioned consistent teacher learning model applied to the transmission line anti-external damage monitoring system, in the step 5, the performance of the model is compared with that of the current most advanced method, which is a Mean-Teacher model with RETINANET detector.

The consistency teacher learning model is characterized by comprising an optimization method for solving SSOD inconsistent problems, wherein the self-adaptive sample distribution is used for solving the inconsistent problems of distribution, the 3D feature symmetry module is used for solving the inconsistent problems of subtasks, and the Gaussian mixture model is used for solving the inconsistent problems of time.

Therefore, the invention has the following advantages: 1. the SSOD algorithm is adopted to train the transmission line anti-external-damage monitoring system, so that the learning cost is greatly reduced, and the anti-interference capability and the environmental adaptability of the system are improved; 2. the problem of inconsistency in the traditional SSOD is solved by adopting the self-adaptive sample distribution, the 3D characteristic symmetry module and the Gaussian mixture model, and the performance index of the algorithm is improved.

Drawings

FIG. 1 is a workflow diagram of a model of the present invention.

FIG. 2 is a schematic view of a model test head according to the present invention, wherein the left side is a head structure and the right side is a 3-DFAM module.

Fig. 3 is a score distribution and fitting result of two gaussian mixtures. The positive and negative distributions are indicated by orange and blue lines, respectively, and the blue dashed line indicates the final threshold.

Detailed Description

The technical scheme of the invention is further specifically described below through examples and with reference to the accompanying drawings.

Examples:

A consistency teacher learning model applied to an anti-outward-breakage monitoring system of a power transmission line comprises the following steps:

step 1, in order to verify performance indexes of consistency teachers, MSCOCO and PASCAL VOC datasets are selected as samples to carry out comprehensive experiments, and three evaluation schemes are set at the same time, wherein the evaluation schemes are respectively as follows: to ensure fair comparison, all detectors were trained on 8 GPUs, 5 images per GPU (1 marked image and 4 unmarked images), detector optimized with SGD, constant learning rate 0.01, momentum 0.9, weight decay 0.001, unmarked data weight λ _u =2, teacher model updated by EMA, momentum 0.999, RETINANET set as default detection framework, and ResNet-50-FPN as its backbone, imageNet pre-training model as initialization, the specific method of operation for the three evaluation schemes is as follows:

Step 1.1, for COCO-PARTIAL, randomly sampling 1%/2%/5%/10% of the image in the train2017 as the marker data, and taking the result of the AP _50:95 in the VAL2017 as the evaluation index;

Step 1.2, for COCO-ADDITION, using the complete train2017 as the labeled set, using the COCO official unlabeled set unlabel2017 as the unlabeled set, training the model;

Step 1.3, for VOC-partal, use the VOC2007 trainval set as labeled data, and use VOC2012 trainval as unlabeled data. The final model was validated on the VOC2007 test set using AP ₅₀ and AP _50:95;

Step 2, using a general SSOD paradigm as a base line, keeping a teacher model as an index moving average value of a student detector, sequentially inputting an unlabeled part of the data set in step 1 into the teacher model through weak enhancement to generate a pseudo-label, supervising a student network by the pseudo-label, strongly dithering the same unlabeled sample, simultaneously inputting a small part of the labeled data into the student network to improve the performance of the student network, setting a target of the student network as a minimum loss, wherein the loss formula of the student network is as follows:

Where L _s and L _u represent supervised and unsupervised (unlabeled) losses, respectively, lambda _u is the loss weight of L _u and lambda _reg is the loss weight of the regression subtask.

Step 3, for the problem of inconsistent allocation, adopting self-adaptive sample allocation to replace anchor point allocation based on the IOU, calculating the matching cost between each predicted value and a given true value, and allocating according to the matching cost, thereby enhancing the optimized consistency; for the problem of inconsistent subtasks, a 3D feature symmetry module is referenced, and different feature positions are adaptively selected to complete each subtask so as to estimate the offset of the optimal regression feature points and enable the optimal regression feature points to be better aligned with the classification branches; for the problem of inconsistent time, a Gaussian mixture model is adopted to adaptively adjust a threshold value, and the strategy reduces the complicated super-parameter adjustment requirement in SSOD, so that the robust improvement is realized under different data sets and settings; the specific operation method of the three optimization schemes is as follows:

Step 3.1, for the problem of inconsistent allocation, an adaptive sample allocation strategy is adopted, a feature point p _i is given, a predicted value pred bbox is obtained by a detector, and the cost between each gt b _i and the prediction of the feature point p _j is calculated, wherein the cost formula is as follows:

C_ij＝λ_clsC_cls+λ_regC_reg+λ_distC_dist

wherein C _cls＝L_cls(Pred(p_i)_cls,b_i)

C_reg＝L_reg(Pred(p_i)_reg,b_i)

Where C _cls and C _reg represent classification and regression costs, respectively, C _dist (i, j) represents center priors, i d (p _j,b_i) i represents the distance of the feature point p _j to the center of b _i, and λ _cls、λ_reg and λ _dist are weighting parameters by step-wise normalization in the FPN level.

After the matching costs between b _i and all the feature points p _j are sorted out, the first K feature points with lost costs are regarded as positive candidates, and since C _dist (i, j) is mainly stable distribution and lambda _dist is smaller, the matching cost is still dominated by classification and regression quality, and since distribution is carried out according to detection quality, the influence of noise in a pseudo frame on the feature point distribution is negligible;

Step 3.2, for the subtask inconsistency problem, referring to a 3D feature symmetry module, predicting the decoded bbox coordinates offset in the 3D dimension by an offset prediction module, adding an additional branch, the input of which is the cascade middle layer of cls and reg branches, the mapping of the cascade feature being f e R ^H×W×8C, wherein c=256 represents the number of channels of the middle feature, reducing the dimension to f ' er ^H×W×C/4 by convolution of 1X1, retaining f ' in each FPN level to predict the 3D offset, then interpolating f ' _l-1 and f ' _l+1 to the same size as f _l ', wherein l refers to the index of the FPN level, connecting the features of different FPN levels to output offset predictions O e R ^H×W×12, each boundary prediction feature point P (i; j; l) being able to find the respective offset, wherein i, j is the position index, the offset is in 3D space, expressed as d= (D ₁,d₂,d₃), wherein D ₁(i,j,l)＝O(i,j,3c+1),d₂(i,j,l)＝O(i,j,3c+2),d₃ (i, j) =3C, j is the two consecutive steps of the offset:

P^h(i,j,l)←P(i+d₁,j+d₂,l)

Step 3.3, for the problem of inconsistent time, a Gaussian mixture model is adopted, a probability rule is used for distinguishing a positive pseudo frame from a negative pseudo frame, and a confidence score is obtained from the mixture of Gaussian distribution, wherein the confidence score formula is as follows:

P(b|s,θ)＝w_nN_n(b,μ_n,p_n)+w_pN_p(b,μ_p,p_p)

where b represents confidence scores of the artifacts bbox, w _n、μ_n、p_n and w _p、μ_p、p_p represent weights, means and accuracy of two gaussian models, respectively, a confidence score library of size N is built for each model, the GMM is optimized in each step using a expectation maximization algorithm, the probability of each artifact bbox being positive or negative is determined by the parameters of the GMM, the threshold τ is determined by the sample with the highest positive probability, and the formula of τ is:

By fitting the GMM using the latest N samples, the threshold can be adaptively adjusted;

Step 4, dividing the data set in the step 1 into a small data set with a mark and a large data set without a mark, inputting the small data set with the mark into the optimization model obtained in the step 3 for preliminary training, inputting the large data set without the mark, adding the sample with higher output confidence into the data set with the mark, and performing cyclic training through the method;

step 5, model test step: under the three evaluation modes described in the step 1, the performance of the optimization model obtained in the step 3 is calculated, and compared with a Mean-Teacher model with RETINANET detectors, and the comparison results of the three optimization schemes are as follows:

Step 5.1, for the COCO-PARTIAL comparison result, after training the COCO 1%/2%/5%/10% label, comparing the AP _50:95 of the two models, it can be seen that the Mean-Teacher model with RETINANET detector has an AP _50:95 value of 35.5 on the COCO 10% experiment, which shows the powerful performance of SSOD algorithm, and the Constant-Teacher obtains an AP _50:95 value of 40.0, which is more than the Mean-Teacher model with RETINANET detector;

Step 5.2, comparing the Constant-Teacher model with other SSOD methods on the VOC0712 dataset for the VOC-partal comparison result, note that significant improvement was made to the Constant-Teacher model, which increased the mAP by 7.9.

And 5.3, regarding a COCO-ADDITION comparison result, taking the complete train2017 as marked data, taking the extra unlabel2017 as unmarked data, taking ResNet-50 as a main body of a Constant-Teacher model, and carrying out 360K iterative training, wherein the AP _50:95 value reaches 49.10, and compared with PseCo, namely a current most advanced semi-supervised detector, the Constant-Teacher model can reach the AP _50:95 value of 3.0 only by half training time.

Claims

1. The consistent teacher learning model applied to the transmission line anti-external damage monitoring system is characterized by comprising the following steps of:

step 5, model test step: under the three evaluation modes described in the step 1, the performance of the optimized model obtained in the step 3 is calculated and compared with the performance of the Mean-Teacher model with RETINANET detectors.

2. The consistency teacher learning model applied to the transmission line anti-external damage monitoring system according to claim 1, wherein in the step 1, three evaluation schemes are set, and the evaluation schemes are respectively: COCO-PARTIAL, COCO-ADDITION and VOC-PARTIAL.