CN110472545B - Aerial photography power component image classification method based on knowledge transfer learning - Google Patents

Aerial photography power component image classification method based on knowledge transfer learning Download PDF

Info

Publication number
CN110472545B
CN110472545B CN201910721060.5A CN201910721060A CN110472545B CN 110472545 B CN110472545 B CN 110472545B CN 201910721060 A CN201910721060 A CN 201910721060A CN 110472545 B CN110472545 B CN 110472545B
Authority
CN
China
Prior art keywords
layer
neural network
learning
convolutional
googlenet
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910721060.5A
Other languages
Chinese (zh)
Other versions
CN110472545A (en
Inventor
赵俊梅
张利平
任一峰
李晓
余永俊
白鑫
张灵菲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
North University of China
Original Assignee
North University of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by North University of China filed Critical North University of China
Priority to CN201910721060.5A priority Critical patent/CN110472545B/en
Publication of CN110472545A publication Critical patent/CN110472545A/en
Application granted granted Critical
Publication of CN110472545B publication Critical patent/CN110472545B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2415Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
    • G06F18/24155Bayesian classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0635Risk analysis of enterprise or organisation activities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/06Electricity, gas or water supply
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • G06V20/13Satellite images

Abstract

The invention relates to the field of deep learning and machine vision combination in artificial intelligence, and discloses a knowledge transfer learning-based classification method for aerial photography power component images, which is implemented according to the following steps of establishing a convolutional neural network GoogLeNet; optimizing the convolutional neural network GoogLeNet, and replacing the last three layers of the convolutional neural network GoogLeNet with a full connection layer, a softmax layer and a classification output layer on the basis of the convolutional neural network GoogLeNet to perform optimization setting; when training a network, the network parameters are obtained by combining multiple simulation experiments and a Bayesian optimization algorithm; inputting the acquired electric power component image after normalization preprocessing into the new deep convolution neural network obtained and set in the second step for learning, and classifying according to the categories of insulators, hardware fittings, towers and the like; and carrying out simulation experiments for verification.

Description

Aerial photography power component image classification method based on knowledge transfer learning
Technical Field
The invention relates to the field of combination of deep learning and machine vision in artificial intelligence, which is applied to classification and identification of electric power components in the inspection process of an electric transmission line in an electric power system so as to ensure safe operation of the electric transmission line.
Background
The power transmission line is a vital component in a power grid system, is used as a main line of the power grid system, plays a decisive role in whether the whole power grid is reliable, long-term, safe and stable in operation, and is directly related to the healthy development of national economy in long-term and effective operation of the power grid system. Along with the implementation of transmission network construction engineering, transmission line sharply increases, and the circuit is patrolled and examined the work load and is sharply increased, and geographical environment is complicated, climatic conditions circumstances such as changeable make traditional manual work patrol and examine the operation more dangerous, and it is not high to patrol and examine efficiency and rate of accuracy. Compared with the traditional inspection mode, the helicopter inspection and unmanned aerial vehicle inspection mode has the advantages of high efficiency, flexible inspection mode, short image acquisition period, no influence of natural environment and the like, and gradually becomes the mainstream mode of power transmission line inspection. In the power patrol target, the power components are large in number and number in the power transmission network, and some components are easily damaged. In long-term operation, the power transmission system is influenced by severe weather such as strong wind, thunderstorm, ice mold, ice coating and the like, particularly, the insulator and the like are easily damaged, the normal operation of a power transmission network of a power system is further influenced, and a large-area power failure accident can be caused seriously. The aircraft inspection mode is to shoot and collect images along the power transmission line through the stable speed and the relatively fixed angle of helicopter or unmanned aerial vehicle, and can pass through various landforms such as mountains, rivers, grasslands, houses, arable land and the like in the shooting process, and also can be under various meteorological conditions (rain, snow, fog and the like), and the applicability and robustness of different backgrounds and noises formed by different landforms and different climatic environments to the detection and fault detection of the power transmission line are a great challenge. With the rapid increase of aerial photography power transmission line and power component image data, the data volume is huge, the data redundancy is also large, and higher requirements are put forward for the classification of aerial photography power component images.
In the field of machine vision, an important research direction is the extraction, classification, detection and identification of image features, and the extraction and classification identification of early image features are the features of target images manually extracted by human beings, which consumes a lot of time and workload. The development of artificial neural network technology in artificial intelligence brings new power and vitality to the field of machine vision, particularly in the field of image classification and recognition. At present, the deep learning branch in machine learning develops rapidly, and the convolutional neural network has a very good effect on image classification. By using deep learning, image features do not need to be extracted manually like traditional digital image algorithms, and the convolutional neural network has automatic learning features.
Disclosure of Invention
The technical problem to be solved by the invention is as follows: the problems that the background of an aerial electric component image is complex, the shooting angle is indefinite, the data volume is huge, and accurate classification and accurate positioning of an electric component are difficult are solved.
The technical scheme adopted by the invention is as follows: the method for classifying aerial electric power component images based on knowledge transfer learning comprises the following steps
Step one, establishing a convolutional neural network GoogLeNet;
step two, optimizing the convolutional neural network GoogLeNet, replacing the last three layers of the convolutional neural network GoogLeNet with a full connection layer, a softmax layer and a classification output layer on the basis of the convolutional neural network GoogLeNet to further obtain a new deep neural network, and then carrying out optimization setting on the basis of the original convolutional neural network GoogLeNet setting; when training a network, the network parameters are obtained by combining multiple simulation experiments and a Bayesian optimization algorithm;
step three, inputting the acquired electric power component image after normalization preprocessing into the new deep convolution neural network obtained and set in the step two for learning, and classifying according to the categories of insulators, hardware fittings, towers and the like;
and step four, carrying out simulation experiment for verification.
The convolutional neural network GoogLeNet has 144 single layers, wherein 22 functional layers with learning weight comprise 21 convolutional layers and 1 full-connection layer, the first convolutional layer adopts 64 7 × 7 convolutional kernels, the step length stride is a structure of 2, a padding [ 3333 ] structure is filled, 64-dimensional features are obtained after convolution, the feature output of the first convolutional layer is 112 × 112 × 64, the first convolutional layer is directly input into an excitation layer, a universal ReL U function is selected as the excitation function, the first convolutional layer adopts a 3 × 3-kernel pooling layer, the pooling adopts a maximum pooling max pool mode, the feature vectors are changed into 56 × 56 × 64 dimensions after the pooling, and then a norm layer is added; the second layer convolution adopts 192 convolution kernels of 3 × 3, a structure with the step size stride of 1, a structure with padding ═ 1111 ], the feature vector becomes 56 × 56 × 192, and the excitation layer and pooling layer output features have dimensions of 28 × 28 × 192.
split branch operation, using 1 × 1, 3 × 3, 5 × 5 convolution kernels, including 4 branches, using 64 convolution kernels of 1 × 1, padding ═ 0000 ], and also using ReL $ function as excitation layer, thus outputting 28 × 28 × 64 dimensional vector; compressing the feature vector to 28 × 28 × 96 dimensions by using 96 1 × 1 convolution kernels, passing through an excitation layer, and then using 128 3 × 3 convolution kernels, padding ═ 1111, to obtain a 28 × 28 × 128 dimensional vector by the excitation layer; firstly, compressing data to 28 × 28 × 16 by using 16 1 × 1 convolution kernels, performing excitation layer and then 5 × 5 convolution operation, selecting padding to [ 2222 ], and finally outputting a 28 × 28 × 32-dimensional vector; pooling at max pool, and then obtaining a 28 × 28 × 32 dimensional vector by using a 1 × 1 convolution window; and the classification output layer splices the feature vectors output by the four nodes.
The following initiation module proceeds to the next step using the same method and so on.
The parameter optimization setting comprises the steps of adopting a random gradient descent algorithm of momentum, and setting the momentum to be 0.9; the small batch algorithm is set to be 10, and is a mixed form of a random gradient descent algorithm and a batch algorithm; the maximum number of wheels is set to 3;L 2 the parameter regularization (weight decay) is set to 0.00055, and the regularization strategy makes the weight closer to the origin by adding a regularization term to the objective function; learning rate of 10 -4 (ii) a The verification frequency was 15.
The invention has the beneficial effects that: the GoogLeNet not only deepens the hierarchy of the network, but also increases the width of the network, realizes the aggregation of multi-scale features and enables the classification effect of the network to be better. The Incep module is a structure proposed for realizing the increase of the network width, and has two important innovation points, namely, the 1 x 1 convolution kernel is used for reducing the dimension of the upper-layer input, so that the complexity of calculation is reduced; and secondly, connecting a plurality of convolution layers and pooling layers with different scales in parallel, and then splicing the obtained different characteristics, thereby improving the expression capability of the characteristics.
The invention fully exerts the advantages of the convolutional neural network in deep learning, in particular to the function of automatically learning the image characteristics. Meanwhile, the GoogLeNet network structure is reasonably adjusted by using the characteristics of transfer learning, optimal training parameters are obtained through simulation experiments and Bayesian optimization algorithms, the training of a new network is completed, and finally, the images (insulators, towers and hardware) of the electric power components are classified and identified, so that a basis is provided for the state detection of the electric power components, and the safe operation of the power grid is ensured.
Drawings
FIG. 1 is an abstraction model diagram;
FIG. 2 is a schematic flow diagram of the present invention;
fig. 3 is a diagram of a training process.
Detailed Description
Through unmanned aerial vehicle transmission line of taking photo by plane, patrol and examine the circuit, when promoting the efficiency of patrolling and examining, also can gather a large amount of electric power part images. For a traditional digital image processing algorithm, for factors such as complex background, multiple types of electric power components, complex shooting environment, unfixed shooting angle, mutual adhesion among different types of electric power components and the like, the classification detection effect of the traditional digital image processing algorithm on different electric power components is poor, and the applicability is weak. The method makes full use of the advantages of the convolutional neural network in deep learning, utilizes the characteristics of transfer learning, exerts the successful experience of the classical convolutional neural network, and classifies the acquired power component images. Firstly, establishing a sample library of electric power component images, then creating a convolutional neural network based on GoogLeNet by utilizing transfer learning, designing network training parameters, carrying out network training, verifying the effectiveness of the network, and properly adjusting the network parameters according to the training and verifying results. The specific algorithm is introduced as follows:
characteristics of one, convolution neural network GoogLeNet
The champion of ILSVRC2014 is google lenet, which is a novel network structure proposed based on the idea of network-in-network. The GoogLeNet belongs to a deep network in network design, and has 144 single layers, wherein 22 functional layers with learning weight comprise 21 convolutional layers and 1 fully-connected layer. The biggest innovation point of the GoogLeNet in design is to use multi-scale convolution to perform local densification on sparse matrix operation, process features in parallel and then perform feature splicing.
As can be seen from the network configuration, the size of the original input image is first fixed to an RGB image of 224 × 224 × 3.
The first convolutional layer has a structure of 64 convolution kernels of 7 × 7, a stride of 2, and a padding of [ 3333 ], and thus a 64-dimensional feature is obtained by convolution, and therefore the feature output of the first convolutional layer is 112 × 112 × 64. After the convolution layer is directly input into an excitation layer, the excitation function selects a universal ReLU function, then a pooling layer of 3 × 3 cores is adopted for pooling, and a mode of maximum pooling (max pool) is adopted for pooling, so that the feature vector after pooling is changed into dimensions 56 × 56 × 64. Google lenet then adds a norm layer on the network design.
The second layer convolution uses 192 convolution kernels of 3 × 3, a structure with step size stride of 1, and a structure with padding ═ 1111 ] is filled, so that the feature vector becomes 56 × 56 × 192, and the excitation layer and pooling layer output features are similarly performed with 28 × 28 × 192 dimensions.
What follows is the innovation of google lenet, which performs split branch operation, and the authors propose the concept of initiation, which for the sake of computational convenience, uses 1 × 1, 3 × 3, 5 × 5 convolution kernels, which include 4 branches as shown in fig. 1:
(1) using 64 convolution kernels of 1 × 1, padding ═ 0000 ], the ReLU function is also used as the excitation layer, and thus a 28 × 28 × 64 dimensional vector is output.
(2) The feature vector is compressed into 28 × 28 × 96 dimensions by using 96 1 × 1 convolution kernels, and then passes through an excitation layer, and then 128 3 × 3 convolution kernels are used, padding is [ 1111 ], and a 28 × 28 × 128 dimensional vector is obtained by passing through the excitation layer.
(3) The data is compressed to 28 × 28 × 16 by using 16 1 × 1 convolution kernels, excitation layers are performed, then 5 × 5 convolution operation is performed, padding is selected to [ 2222 ], and finally a 28 × 28 × 32-dimensional vector is output. (4)
Pooling was performed with max pool, and then a 28 × 28 × 32 dimensional vector was obtained using a 1 × 1 convolution window. Finally, feature vectors output by the four nodes are spliced, and convolution is performed by selectively using convolution kernels, so that feature addition can be directly performed. The following initiation module proceeds in the same manner and so on.
Secondly, knowledge migration learning is carried out on the GoogLeNet network
Knowledge transfer learning is learning by training samples in other fields, and relevant knowledge is extracted from the training samples and is used for learning in the fields. Through the transfer learning, the machine can improve the ability of utilizing the previously learned knowledge, and the incremental learning is better realized. The most important feature of knowledge transfer learning is to utilize knowledge in related fields to help complete learning tasks in target fields.
The GoogLeNet model has rich weight and bias, different target sample libraries can be adjusted by transfer learning, the GoogLeNet network is low in complexity and high in accuracy, and the GoogLeNet model is very suitable for algorithms of transfer learning. Therefore, the method is based on the GoogleLeNet network for migration, and the method aims to put aside a three-way bypass and a touch bypass in the popular term of migration and can also be understood according to the method of flower transfer and wood grafting. The final purpose is to shorten the network training time and improve the network utilization rate.
And the transfer learning is to re-optimize the last three-layer fully-connected network which can be used for the GoogLeNet of 1000 types according to the type of the aerial electric component image as a new classification number, and continuously reserve the front part of the original network. And finally, the three layers are replaced by a full connection layer, a softmax layer and a classification output layer, so that a new deep neural network is obtained. The invention sets the classification number of the full connection layer as 5, the weight learning rate factor as 30 and the bias learning rate factor as 30. Next, setting new network training parameters, wherein the main parameter settings are as follows:
(1) a random gradient descent algorithm of momentum is adopted, the momentum is set to be 0.9, the algorithm calculates the error of each training data and randomly adjusts the weight. Momentum-based gradient descent algorithms aim to accelerate learning, especially dealing with high region rates, small but uniform gradients, or noisy gradients. The momentum algorithm accumulates the moving average of the previous gradient exponential decay and continues to move in that direction.
(2) The mini-batch algorithm is set to 10, which is a hybrid of the stochastic gradient descent algorithm and the batch algorithm. First, it selects a portion of the data set, and then trains this data set with a batch algorithm. Thus, it calculates a weight update value once with the selected data set, and then adjusts the neural network with the average weight.
(3) The maximum number of rounds is set to 3, and the number of rounds is the number of cycles that all training data participate in training. For the small batch algorithm, the number of times each round of training depends on the choice of the number of data points per small batch.
(4)L 2 The parameter regularization (weight decay) is set to 0.00055 and the regularization strategy makes the weights closer to the origin by adding a regularization term to the objective function. Regularization is a modification of the learning algorithm that aims to reduce generalization errors rather than training errors.
(5) Learning rate of 10 -4 The learning rate is one of the important parameters of the stochastic gradient descent algorithm. If the learning rate is too low, the learning process is slow and the learning may be stuck at a relatively high cost value. If the learning rate is too high, the learning curve will oscillate sharply, and the cost function value will usually increase significantly. The learning rate can be selected by trial and error.
(6) The verification frequency is 15, which is the number of iterations between evaluations of the verification metric, characterizing the frequency of network verification in the number of iterations.
The network parameter value of the invention is set by combining the results of multiple simulation experiments and the Bayesian optimization algorithm, so as to obtain the optimal network parameter. The bayesian optimization algorithm is a very suitable algorithm for optimizing the internal parameters of classification and regression models and can optimize indistinguishable, discontinuous and time-consuming functions for evaluation. The algorithm internally utilizes a gaussian process model of the objective function and trains the model using an objective function estimate. The Bayes optimization process utilizes the famous 'Bayes theorem':
Figure BDA0002157119630000051
wherein f represents an unknown objective function; d 1:t ={(x 1 ,y 1 ),(x 2 ,y 2 ),…,(x t ,y t ) Denotes the observed set; x is the number of t Representing a decision vector; y is t =f(x t )+ε t Representing the observed value; epsilon t Representing an observation error; p (D) 1:t If) represents the likelihood distribution of y, also called "noise" due to errors in the observed values; p (f) represents the prior probability distribution of f, its assumption of unknown objective function state; p (D) 1:t ) Represents a marginalized f-marginal likelihood distribution or "evidence" that is primarily used to optimize the hyper-parameter; p (f | D) 1:t ) A posterior probability distribution of f is expressed describing the confidence of the unknown objective function after correction of the priors by the observed dataset. Bayesian optimization is an iterative process, and an optimization framework mainly comprises a probability agent model (comprising a prior probability model and an observation model) and an acquisition function. The prior probability model is p (f). The observation model describes the mechanism of observation data generation, namely the likelihood distribution P (D) 1:t If), updating the probabilistic proxy model will result in a posterior probability distribution P (f D) that includes more data information 1:t ). The acquisition function is constructed from a posterior probability distribution by maximizingThe function is collected to select the next best evaluation point. At the same time, the effective acquisition function ensures that the selected series of evaluation points minimizes the total loss. The Bayesian optimization mainly comprises three steps: 1. selecting the best evaluation point x according to the maximized sampling function t (ii) a 2. According to the selected evaluation point x t Evaluating the value of the objective function y t =f(x t )+ε t (ii) a 3. New obtained input observed value x t ,y t Add to historical Observation set D 1:t-1 And updating the probability agent model to prepare for iteration.
The invention is based on the Bayesian algorithm to optimize the parameters, and the steps are roughly as follows:
(1) determining training set and verification set image data;
(2) selecting optimized parameters, the invention focuses on the optimal learning rate, the dynamic value of random gradient descent, and L 2 Optimizing the regularized strength parameter;
(3) setting variables of a Bayesian optimization algorithm; in order for the Bayesian optimizer to define an objective function, the objective function takes an image training set and a verification set as input, trains a convolutional neural network and returns classification errors of the verification set. And transferring the target function to a Bayesian acquisition function, wherein the acquisition function minimizes cross validation loss.
(4) And setting network structure parameters and training the network. The convolutional neural network structure is input layer → convolutional layer → batch normalization → modified linear unit → max pooling layer → convolutional layer → batch normalization → modified linear unit (ReLU) → max pooling layer → fully connected layer → softmax layer → classified layer. The main parameters of network training are as follows: the random gradient descent algorithm of momentum is adopted, the momentum is 0.95, the maximum number of rounds is 10, and the learning rate is 10 -3 ,L 2 The parameter is normalized to 0.000001.
(5) In network training, a data enhancement method is used to randomly flip the training image along the vertical axis and randomly shift the image by up to four pixels in both the horizontal and vertical directions. Data enhancement helps prevent the network from overfitting and remembering the specific details of the training images.
(6) Predicting labels of the test set, calculating test errors, and executing Bayesian optimization by minimizing classification errors on the verification set so as to obtain an optimal network and verification accuracy and finally optimize parameters.
Data set
The background of the aerial electric component image is very complex, the shooting angle and the image quality are very easily affected by the external environment, particularly severe weather and adverse geographical conditions, the electric component images are more in the condition of mutual adhesion (such as insulators and hardware fittings in towers and hardware fittings in power transmission lines), and sometimes can be shielded by objects such as large trees participating in the sky, so that great difficulty is brought to the accurate classification of the aerial electric component images. At present, no standard power component image library exists, and the power component image library is self-built. The data set adopted by the invention has 932 images, wherein 720 images of the insulator, 86 images of the tower, 55 images of the hardware (downloaded from the internet), 39 images of the lane line and 32 images of the road sign. Some lane line and road sign images are added to the database to prove the classification noise resistance and the robustness of the network. Meanwhile, a group Truth Labeler APP toolbox is used for labeling the 177 images in three categories of insulators, hardware fittings and towers. All images are preprocessed by normalization and the like, and the resolution is unified to 800 x 600. Meanwhile, the proportion is 7: 3 (training set and validation set).
Fourthly, training classification results
The hardware environment of the simulation experiment of the invention is as follows: CPU i5-7200, main frequency 2.7GHz and memory 8G, training is carried out by a single CPU, different numbers of data sets are selected to carry out simulation on a Matlab platform, and the classification results are as follows:
TABLE 1 Classification of different sample data sets
Number of sample images (vice) Run time (minutes) Accuracy of classification
262 32 96%
632 381 97.8%
932 419 97.83%
Table 1 lists the run time and classification accuracy required when randomly selecting different sample image data volumes. And (4) displaying a classification result, wherein a transfer learning method is utilized, so that huge image data volume can be avoided from being acquired, and the performance of the GoogLeNet network is exerted by utilizing a certain image data volume, so that the image classification of the aerial photography power component is achieved. Fig. 2 is a flowchart. Fig. 3 is a diagram of a training process.

Claims (3)

1. The method for classifying aerial photography power component images based on knowledge transfer learning is characterized by comprising the following steps: the method comprises the following steps
Step one, establishing a convolutional neural network GoogLeNet;
secondly, migrating the convolutional neural network GoogLeNet, replacing the last three layers of the convolutional neural network GoogLeNet with a full connection layer, a softmax layer and a classification output layer on the basis of the convolutional neural network GoogLeNet, further obtaining a new deep neural network, and then performing parameter optimization setting on the basis of the original convolutional neural network GoogLeNet setting; when the network is trained, network parameters are obtained by combining multiple simulation experiments and a Bayesian optimization algorithm, wherein split branch operation adopts 1 × 1, 3 × 3 and 5 × 5 convolution kernels and comprises 4 branches, 64 convolution kernels of 1 × 1 are used, padding = [ 0000 ] is also adopted as an excitation layer, and therefore a 28 × 28 × 64 dimensional vector is output; compressing the feature vector to 28 × 28 × 96 dimensions by using 96 1 × 1 convolution kernels, passing through an excitation layer, and then using 128 3 × 3 convolution kernels, padding = [ 1111 ], so as to obtain a 28 × 28 × 128-dimensional vector through the excitation layer; firstly, compressing data to 28 × 28 × 16 by using 16 1 × 1 convolution kernels, performing excitation layer and then 5 × 5 convolution operation, selecting padding = [ 2222 ], and finally outputting 28 × 28 × 32-dimensional vectors; pooling is carried out by max pool, and then a convolution window of 1 × 1 is used for obtaining a 28 × 28 × 32 dimensional vector; the classification output layer splices the feature vectors output by the four nodes, and then carries out the following initiation module by the same method and so on;
step three, inputting the collected electric power component images after normalization preprocessing into the new deep convolution neural network obtained and set in the step two for learning, and classifying according to the categories of insulators, hardware fittings and towers;
and step four, carrying out simulation experiment for verification.
2. The method for classifying aerial electric power component images based on knowledge transfer learning according to claim 1, wherein: the convolutional neural network GoogLeNet comprises 144 single layers, wherein 22 functional layers with learning weight comprise 21 convolutional layers and 1 full-connection layer, the first convolutional layer adopts 64 7 × 7 convolutional kernels, the step length stride is a 2 structure, a padding = [ 3333 ] structure is filled, a 64-dimensional feature is obtained after convolution, the feature output of the first convolutional layer is 112 × 112 × 64, the convolutional layer is directly input into an excitation layer after the first convolutional layer, a universal ReLU function is selected as the excitation function, the first convolutional layer adopts a 3 × 3 kernel pooling layer, the pooling adopts a maximum pooling max pool mode, the feature vector is changed into 56 × 56 × 64 dimensions after pooling, and then a norm layer is added; the second layer convolution adopts 192 convolution kernels of 3 × 3, a structure with the step size stride of 1, a structure with padding = [ 1111 ], the feature vector becomes 56 × 56 × 192, and the output features of the excitation layer and the pooling layer are 28 × 28 × 192 dimensions.
3. The method for classifying aerial electric power component images based on knowledge transfer learning according to claim 1, wherein: the parameter optimization setting comprises the steps of adopting a random gradient descent algorithm of momentum, and setting the momentum to be 0.9; the small batch algorithm is set to be 10, and is a mixed form of a random gradient descent algorithm and a batch algorithm; the maximum number of wheels is set to 3; l is 2 The parameter regularization weight attenuation is set to be 0.00055, and the regularization strategy enables the weight to be closer to an origin point by adding a regularization item to the target function; learning rate of 10 -4 (ii) a The verification frequency is 15.
CN201910721060.5A 2019-08-06 2019-08-06 Aerial photography power component image classification method based on knowledge transfer learning Active CN110472545B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910721060.5A CN110472545B (en) 2019-08-06 2019-08-06 Aerial photography power component image classification method based on knowledge transfer learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910721060.5A CN110472545B (en) 2019-08-06 2019-08-06 Aerial photography power component image classification method based on knowledge transfer learning

Publications (2)

Publication Number Publication Date
CN110472545A CN110472545A (en) 2019-11-19
CN110472545B true CN110472545B (en) 2022-09-23

Family

ID=68510679

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910721060.5A Active CN110472545B (en) 2019-08-06 2019-08-06 Aerial photography power component image classification method based on knowledge transfer learning

Country Status (1)

Country Link
CN (1) CN110472545B (en)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110930392A (en) * 2019-11-26 2020-03-27 北京华医共享医疗科技有限公司 Method, device, equipment and storage medium for realizing medical image auxiliary diagnosis based on GoogLeNet network model
CN111027709B (en) * 2019-11-29 2021-02-12 腾讯科技(深圳)有限公司 Information recommendation method and device, server and storage medium
CN111104877B (en) * 2019-12-06 2023-04-07 中国人民解放军陆军炮兵防空兵学院 Universal momentum method and target detection and identification method based on universal momentum method
CN112949669A (en) * 2019-12-10 2021-06-11 中国科学院深圳先进技术研究院 Method for estimating Gaussian low-pass filtering parameters in digital image
CN111523401B (en) * 2020-03-31 2022-10-04 河北工业大学 Method for recognizing vehicle type
CN111611924B (en) * 2020-05-21 2022-03-25 东北林业大学 Mushroom identification method based on deep migration learning model
CN112202510B (en) * 2020-09-27 2022-09-16 广州大学 Signal detection method based on regularization stream neural network
CN113762304B (en) * 2020-11-26 2024-02-06 北京京东乾石科技有限公司 Image processing method, image processing device and electronic equipment
CN112668498B (en) * 2020-12-30 2024-02-06 西安电子科技大学 Method, system, terminal and application for identifying individual intelligent increment of aerial radiation source
CN113052388A (en) * 2021-04-01 2021-06-29 北京工商大学 Time series prediction method and device
CN113094181A (en) * 2021-05-06 2021-07-09 苏州联电能源发展有限公司 Multi-task federal learning method and device facing edge equipment
CN113537353B (en) * 2021-07-19 2022-09-20 天津工业大学 Small sample learning method for insulator image data
CN113569940B (en) * 2021-07-23 2023-06-06 中山大学 Knowledge migration and probability correction-based few-sample target detection method
CN113569737A (en) * 2021-07-28 2021-10-29 合肥综合性国家科学中心人工智能研究院(安徽省人工智能实验室) Notebook screen defect detection method and medium based on autonomous learning network model
CN114005002A (en) * 2021-11-26 2022-02-01 中国科学院沈阳自动化研究所 Image identification method of nuclear full-connection neural network based on nuclear operation

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018163790A1 (en) * 2017-03-06 2018-09-13 ソニー株式会社 Information processing device
CN108596213A (en) * 2018-04-03 2018-09-28 中国地质大学(武汉) A kind of Classification of hyperspectral remote sensing image method and system based on convolutional neural networks
CN108985134A (en) * 2017-06-01 2018-12-11 重庆中科云丛科技有限公司 Face In vivo detection and brush face method of commerce and system based on binocular camera
CN109214250A (en) * 2017-07-05 2019-01-15 中南大学 A kind of static gesture identification method based on multiple dimensioned convolutional neural networks
CN109344815A (en) * 2018-12-13 2019-02-15 深源恒际科技有限公司 A kind of file and picture classification method
CN109426759A (en) * 2017-08-22 2019-03-05 阿里巴巴集团控股有限公司 The method, apparatus and electronic equipment of the visualization archive of article
CN109961034A (en) * 2019-03-18 2019-07-02 西安电子科技大学 Video object detection method based on convolution gating cycle neural unit
CN110046575A (en) * 2019-04-16 2019-07-23 浙江农林大学 Based on the remote sensing images scene classification method for improving residual error network

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018163790A1 (en) * 2017-03-06 2018-09-13 ソニー株式会社 Information processing device
CN108985134A (en) * 2017-06-01 2018-12-11 重庆中科云丛科技有限公司 Face In vivo detection and brush face method of commerce and system based on binocular camera
CN109214250A (en) * 2017-07-05 2019-01-15 中南大学 A kind of static gesture identification method based on multiple dimensioned convolutional neural networks
CN109426759A (en) * 2017-08-22 2019-03-05 阿里巴巴集团控股有限公司 The method, apparatus and electronic equipment of the visualization archive of article
CN108596213A (en) * 2018-04-03 2018-09-28 中国地质大学(武汉) A kind of Classification of hyperspectral remote sensing image method and system based on convolutional neural networks
CN109344815A (en) * 2018-12-13 2019-02-15 深源恒际科技有限公司 A kind of file and picture classification method
CN109961034A (en) * 2019-03-18 2019-07-02 西安电子科技大学 Video object detection method based on convolution gating cycle neural unit
CN110046575A (en) * 2019-04-16 2019-07-23 浙江农林大学 Based on the remote sensing images scene classification method for improving residual error network

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
"Design and Implementation of Deep Neural Network for Edge Computing";Junyang Zhang;《IEICE Transactions on Information and Systems》;20180801;第E101-D卷(第8期);第1982-1996页 *
"GoogleNet论文笔记";Eminem1147;《https://zhuanlan.zhihu.com/p/56530853》;20190301;第1-6页 *
"基于卷积神经网络的遥感图像目标检测与识别";赵旭江;《中国优秀硕士学位论文全文数据库 信息科技辑》;20180615(第06期);第I140-708页 *

Also Published As

Publication number Publication date
CN110472545A (en) 2019-11-19

Similar Documents

Publication Publication Date Title
CN110472545B (en) Aerial photography power component image classification method based on knowledge transfer learning
CN110188685B (en) Target counting method and system based on double-attention multi-scale cascade network
CN106355151B (en) A kind of three-dimensional S AR images steganalysis method based on depth confidence network
CN109508655A (en) The SAR target identification method of incomplete training set based on twin network
CN111914728B (en) Hyperspectral remote sensing image semi-supervised classification method and device and storage medium
CN108764308A (en) A kind of recognition methods again of the pedestrian based on convolution loop network
CN109614968B (en) License plate detection scene picture generation method based on multi-scale mixed image stylization
CN111860982A (en) Wind power plant short-term wind power prediction method based on VMD-FCM-GRU
CN108537264B (en) Heterogeneous image matching method based on deep learning
CN110633708A (en) Deep network significance detection method based on global model and local optimization
CN111414953B (en) Point cloud classification method and device
CN111160268A (en) Multi-angle SAR target recognition method based on multi-task learning
CN111861906A (en) Pavement crack image virtual augmentation model establishment and image virtual augmentation method
Yang et al. Ultra-lightweight CNN design based on neural architecture search and knowledge distillation: A novel method to build the automatic recognition model of space target ISAR images
CN116362325A (en) Electric power image recognition model lightweight application method based on model compression
WO2023124342A1 (en) Low-cost automatic neural architecture search method for image classification
CN111915629A (en) Super-pixel segmentation method based on boundary detection
CN113705580A (en) Hyperspectral image classification method based on deep migration learning
CN111192240B (en) Remote sensing image target detection method based on random access memory
CN115880497A (en) Wing-type icing ice shape prediction method based on combination of self-encoder and multi-layer perceptron
CN114998688A (en) Large-view-field target detection method based on YOLOv4 improved algorithm
CN114170446A (en) Temperature and brightness characteristic extraction method based on deep fusion neural network
Yao et al. Application of improved MobileNet-SSD on underwater sea cucumber detection robot
CN114202746B (en) Pavement state identification method, device, terminal equipment and storage medium
CN113657290A (en) Snail collection and fine classification recognition system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant