CN107665351B

CN107665351B - Airport detection method based on difficult sample mining

Info

Publication number: CN107665351B
Application number: CN201710314261.4A
Authority: CN
Inventors: 张浩鹏; 姜志国; 蔡博文; 赵丹培; 谢凤英; 史振威; 尹继豪; 罗晓燕
Original assignee: Beihang University
Current assignee: Beihang University
Priority date: 2017-05-06
Filing date: 2017-05-06
Publication date: 2022-07-26
Anticipated expiration: 2037-05-06
Also published as: CN107665351A

Abstract

The invention provides an airport detection method based on difficult sample mining, which comprises the following steps: taking the optical remote sensing image and the corresponding annotation truth value as training data of the optical remote sensing image; training a candidate area extraction network; training a regional classification network; extracting candidate regions and performing coupling training with a region classification network; fine tuning of an end-to-end deep convolutional network based on hard sample mining; airport detection of end-to-end deep convolutional neural networks. The deep convolutional neural network is introduced into the remote sensing image airport detection, the convolutional network is utilized to extract high-level semantic feature information of a target in the remote sensing image, airport candidate areas are screened through the high-level semantic features, whether the candidate areas are airports or not is secondarily confirmed, and the recall ratio and the accuracy rate of airport detection in the remote sensing image are improved.

Description

Airport detection method based on difficult sample mining

Technical Field

The invention relates to the technical field of digital image processing, in particular to an airport detection method based on difficult sample mining.

Background

In recent years, with the improvement of remote sensing imaging technology, the remote sensing data volume is increased explosively. Aiming at massive remote sensing data, the key information contained in the big data is automatically mined by a machine, so that people can be liberated from a fussy and repetitive discrimination task, wherein the airport detection problem is widely concerned due to strong applicability of the airport detection problem in both military and civilian aspects.

Airport detection remains a rather challenging problem due to the influence of rotation angle, scale, illumination and other factors in the remote sensing images. At present, most airport detection methods extract an airport candidate region from a full-width remote sensing image, and then extract bottom layer characteristics such as specific geometry and texture of an airport aiming at the candidate region to determine whether the candidate region is an airport or not. Meanwhile, in order to improve the speed and efficiency of airport detection in the large-format remote sensing image, a part of methods introduce a method for detecting the saliency in vision into the airport detection, and assume that an airport area has higher saliency in the remote sensing image, so that the positioning process of an airport candidate area is accelerated.

However, most of the current airport detection methods are based on the bottom layer features (such as scale invariant feature descriptors, SIFT, etc.) or the unique geometric features of artificially designed airports, and because the feature generalization capability of manual design is poor, the application requirements of airport detection under the multi-scale condition are difficult to meet.

Therefore, how to provide an airport detection method based on difficult sample mining is a problem that needs to be solved urgently by those skilled in the art.

Disclosure of Invention

In view of the above, the invention provides an airport detection method based on difficult sample mining, which introduces a deep convolutional neural network into a remote sensing image airport detection, not only extracts high-level semantic feature information of a target in the remote sensing image by using the convolutional network, screens an airport candidate region through the high-level semantic feature, but also performs secondary confirmation on whether the candidate region is an airport, and improves the recall ratio and the accuracy of the airport detection in the remote sensing image.

In order to achieve the purpose, the invention adopts the following technical scheme:

a method for airport detection based on difficult sample mining, the method comprising the steps of:

taking the optical remote sensing image and the corresponding annotation truth value as training data of the optical remote sensing image;

training a candidate area extraction network;

training a regional classification network;

extracting candidate regions and performing coupling training with a region classification network;

fine tuning of an end-to-end deep convolutional network based on difficult sample mining;

airport detection of end-to-end deep convolutional neural networks.

Preferably, in the airport detection method based on difficult sample mining, the step of using the optical remote sensing image and the corresponding annotation truth value as the training data of the optical remote sensing image specifically comprises:

carrying out image annotation on the full-width optical remote sensing image to obtain an annotation truth value in the full-width optical remote sensing image;

taking 250 pixels as step length in the marked full-width optical remote sensing image, extracting image data with a fixed window size in a sliding manner, and respectively recording the sliding times in the vertical direction and the horizontal direction as row and col; marking a truth value according to the full-width optical remote sensing image, and recording a truth value frame coordinate of airport area image data intercepted by a sliding window;

the relational expression of the truth value frame coordinates in the sliding window and the truth value marked in the full-width optical remote sensing image is as follows:

(x′ _k ,y′ _k )＝(x _k -col×250,y _k row x 250), wherein (x) _k ,y _k ) And (x' _k ,y′ _k ) And respectively marking a truth coordinate of the full-breadth optical remote sensing image and a truth frame coordinate in the sliding window, wherein k is equal to {1,2} and respectively represents an upper left corner point and a lower right corner point of the coordinate.

Preferably, in the airport detection method based on difficult sample mining, the training step of the candidate area extraction network specifically includes:

randomly generating a weight parameter of each convolution layer in the candidate area extraction network according to Gaussian distribution with zero mean standard deviation of 0.01;

extracting high-level semantic feature information from each image in the training data;

calculating loss functions of candidate regions in the image one by one according to the high-level semantic feature information;

randomly picking N from loss functions of all candidate regions _batch Performing back propagation to update the weight parameters of each convolution layer in the candidate area extraction network;

when the candidate area is selected, the proportion of positive samples to negative samples is 1:1, and when the number of positive samples in the image is insufficient, the negative samples are added as the candidate area.

Preferably, in the airport detection method based on difficult sample mining, the training step of the area classification network specifically includes:

extracting a candidate region for each image in training data according to the trained region extraction network, taking the extracted image data of the candidate region and the labeling truth value of the candidate region as the training data of the region classification network, and training the region classification network.

Preferably, in the airport detection method based on difficult sample mining, the step of training the coupling between the candidate region extraction and the region classification network specifically includes:

initializing the corresponding first five convolutional layers in the candidate area network according to the parameters of the first five convolutional layers in the area classification network, and realizing the sharing and coupling of the candidate area extraction and the area classification network.

Preferably, in the above airport detecting method based on difficult sample mining, the step of training the candidate region extraction and the region classification network by coupling further includes:

fixing the shared convolutional layer weight parameters;

fine-tuning the specific convolutional layer weight parameters except the shared convolutional layer in the candidate region extraction network;

and generating different candidate regions according to the trimmed candidate region extraction network, inputting the generated candidate regions and the labeling truth values of the candidate regions into the region classification network, and trimming the specific full-connection layer weight parameters except the shared convolution layer in the region classification network.

Preferably, in the airport detection method based on difficult sample mining, the fine tuning step of the end-to-end deep convolutional network based on difficult sample mining specifically includes:

the difficulty degree of the training data differentiation is scored by utilizing the region extraction network, and the degree is scored according to a loss function

Selecting training data which are difficult to distinguish, and fine-tuning the regional extraction network again; loss function

The expression is as follows:

L _cls (p,u)＝-logp _u

wherein i represents the index number of the candidate region, p _i Indicates the probability that the ith candidate region is the target region, p _i ^* If the value is a truth value label, the value is {0,1}, and 1 indicates that the candidate area is a positive sample, namely an airport area, otherwise, the value is 0, and indicates that the candidate area is a negative sample, namely a background area; t is t _i Predictor vector representing the position of the candidate region, t _i ^* A vector representing a true position of the candidate region; l is a radical of an alcohol _cls (p, u) represents a region classification loss function; l is a radical of an alcohol _reg (t, u) represents a regional regression loss function;N _cls and N _reg Respectively representing the number of candidate areas to be classified and the number of regression candidate areas; λ is the balance factor.

Preferably, in the airport detection method based on difficult sample mining, the fine tuning step of the end-to-end deep convolutional network based on difficult sample mining further includes:

according to the candidate region obtained by the region extraction network after the fine tuning again and the labeling truth value of the candidate region, the fine tuning is carried out on the weight parameters of the full connection layer in the region classification network;

the difficulty of correctly classifying the candidate regions is scored, and a loss function L (P) of each candidate region is calculated _i ,T _i ) According to the maximum loss value

And establishing a hard sample set for each candidate area, and fine-tuning the area classification network again.

Preferably, in the airport detection method based on hard sample mining, after the fine tuning step of the end-to-end deep convolutional network based on hard sample mining, the method further includes:

and cascading the area extraction network and the area classification network to obtain the end-to-end convolution neural network.

Preferably, in the airport detection method based on hard sample mining, the airport detection step of the end-to-end deep convolutional neural network specifically includes:

inputting the optical remote sensing image into an end-to-end depth convolution neural network, outputting coordinates of upper left points and lower right points of a minimum circumscribed rectangular frame of an area where an airport is located by the end-to-end depth convolution neural network, and evaluating the probability that the optical remote sensing image is the airport area.

According to the technical scheme, compared with the prior art, the airport detection method based on the difficult sample mining is characterized in that a region extraction network is trained by utilizing training data, then a truth value is marked on a candidate region and a candidate region of an airport extracted by the region extraction network to train a region classification network, convolution layers of the region extraction network and the region classification network are shared, the coupling of the networks is realized, and the region extraction network and the region classification network are finely adjusted; the finely adjusted region extraction network and region classification network automatically mine difficult samples in training data, the region extraction network and the region classification network are subjected to secondary fine adjustment on the automatically constructed difficult sample set, an end-to-end deep convolution neural network is obtained, the recall ratio of the network to an airport target can be improved, and the probability of false alarm and missing detection is reduced.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the embodiments or the prior art descriptions will be briefly described below, it is obvious that the drawings in the following description are only embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the provided drawings without creative efforts.

FIG. 1 is a schematic flow chart of an airport detection method based on difficult sample mining according to the present invention;

fig. 2 is a schematic diagram of a detection result of the airport detection method based on difficult sample mining according to the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The invention provides an airport detection method based on difficult sample mining, which comprises the following steps: taking the optical remote sensing image and the corresponding annotation truth value as training data of the optical remote sensing image; training a candidate area extraction network; training a regional classification network; extracting candidate regions and performing coupling training with a region classification network; fine tuning of an end-to-end deep convolutional network based on hard sample mining; airport detection of end-to-end deep convolutional neural networks. Compared with the prior art, the airport detection method based on difficult sample mining, which is provided by the invention, firstly trains the region extraction network by using training data, then marks a true value training region classification network on the airport candidate region and the candidate region extracted by using the region extraction network, initializes the first five layers of convolution layer weight parameters of the region classification network to extract the corresponding first five layers of convolution layers in the network, realizes network sharing and coupling, and finely adjusts the region extraction network and the region classification network; mining a difficult sample according to a loss function of the regional extraction network, and carrying out secondary fine adjustment on the regional extraction network according to the difficult sample; the method comprises the steps of obtaining regional classification network training data by using a trimmed regional extraction network, training the regional classification network again, automatically mining a difficult sample in the training data by the regional classification network, carrying out secondary trimming on the regional classification network on an automatically constructed difficult sample set, obtaining an end-to-end deep convolutional neural network, improving the recall ratio and detection precision of the network on an airport target, and reducing the probability of false alarm and missing detection.

Referring to fig. 1, fig. 1 is a schematic flow chart of an airport detection method based on difficult sample mining according to the present invention. The airport detection method based on difficult sample mining specifically comprises the following steps:

step S101: taking the optical remote sensing image and the corresponding annotation truth value as training data of the optical remote sensing image;

the specific execution method comprises the following steps: the optical remote sensing image has larger capacity required by storage in a computer due to wider breadth, larger field of view, richer texture information and the like, but the optical remote sensing image is difficult to be directly used as training data for optimizing the model due to the capacity limit of internal storage equipment of the computer such as a video memory, an internal memory and the like. Therefore, in order to segment the full-width optical remote sensing image data and extract an effective part from the segmented full-width optical remote sensing image data as training data, the method mainly comprises the following two steps:

a. carrying out image annotation on the full-width optical remote sensing image, labeling an external rectangular frame of the region where the airport is located in the image, and recording the upper left corner and the lower right corner of the rectangular frameThe point coordinates are respectively (x) ₁ ,y ₁ ) And (x) ₂ ,y ₂ ) Marking a true value as a full-width optical remote sensing image;

b. setting a sliding window size to be 500 multiplied by 600 pixels in a marked full-width optical remote sensing image by taking 250 pixels as a step length, sequentially sliding from right to left and from top to bottom to extract image data with a fixed window size, and respectively recording the sliding times in the vertical direction and the horizontal direction as row and col; marking a true value according to the full-width optical remote sensing image, if the window comprises an airport area, taking image data intercepted by the window as training data, and recording the position of a true value frame in the window data at the moment. The relation between the real value box coordinate in the sliding window and the labeled real value in the full-width optical remote sensing image is represented as follows:

(x′ _k ,y′ _k )＝(x _k -col×250,y _k -row×250)

wherein (x) _k ,y _k ) And (x' _k ,y′ _k ) Marking a true value coordinate of the full-breadth optical remote sensing image and a coordinate of a true value frame in a sliding window respectively, wherein k is an element {1,2} representing an upper left angular point and a lower right angular point.

Step S102: training the candidate area extraction network by using the training data to obtain an initialized area extraction network, and extracting the airport candidate area in the training data;

the specific execution method comprises the following steps: randomly generating weight parameters of each convolution layer in a candidate region extraction network by using Gaussian distribution with zero mean standard deviation of 0.01, extracting high-level semantic features from each image in training data, and calculating loss functions of candidate regions in the image one by one based on the semantic features; randomly picking N out of the loss of all candidate regions _batch The counter-propagation is used to update the weight parameters of each convolutional layer in the network. When the candidate area is selected, the proportion of positive samples to negative samples is 1:1, and when the number of positive samples in the image is insufficient, the negative samples are added to ensure that the candidate area with a fixed number is selected. Loss function L (p) used in candidate area extraction network _i ,t _i ) Sorting the loss function L by region _cls (p, u) and the local regression loss function L _reg (t, u) two-part groupBecomes, defined as follows:

L _cls (p,u)＝-logp _u

wherein i represents the index number of the candidate region, p _i Indicating the probability that the ith candidate region is the target region, the truth label p _i ^* Whether the candidate area is a target area or not is represented, the value is {0,1}, 1 represents that the candidate area is a positive sample, namely an airport area, and otherwise, the value is 0, and represents that the candidate area is a negative sample, namely a background area; t is t _i A predictor vector representing the position of the candidate region, t _i ^* A vector representing a true position of the candidate region; l is a radical of an alcohol _cls (p, u) represents a region classification loss function; l is a radical of an alcohol _reg (t, u) represents a regional regression loss function; n is a radical of _cls And N _reg Respectively representing the number of candidate areas to be classified and the number of regression candidate areas; λ is a balance factor;

for the regression loss function L _reg In (t, u), x, y, w, h respectively represent the x-axis coordinate value and the y-axis coordinate value of the center of the candidate region true value frame, and the width and the height of the candidate region true value frame.

And (4) carrying out iterative optimization solution on weight parameters of each layer in the candidate region extraction network by adopting a random gradient descent algorithm. The calculation process is

V _t+1 ＝uV _t -α▽L(W _t )

W _t+1 ＝W _t +V _t+1

Wherein W _t Weight parameter, V, representing each convolutional layer of the network at the time of the t-th iterative update _t Represents the t-th iterationAnd (5) trimming the updated weight value variable in time.

Step S103: training a region classification network by using an airport candidate region and a mark truth value, and sharing a convolution layer of the network with a region extraction network;

the specific execution method comprises the following steps: extracting a candidate region from each image in the training data by using the region extraction network trained in the step S102, marking truth values of the extracted candidate region and the candidate region as the training data of the region classification network, training the region classification network, and based on a loss function consisting of a region classification loss function and a region regression loss function, adopting a random gradient descent algorithm to optimize and solve weight parameters of each layer in the region classification network, wherein the specific solving mode is the same as that in the step S102;

when the training of the regional classification network reaches the maximum iteration times or the iteration cutoff condition, initializing the corresponding first five convolutional layers in the candidate regional network by using the parameters of the first five convolutional layers in the regional classification network, realizing the sharing and coupling of the two networks, and combining the candidate region extraction and the regional classification network into a uniform end-to-end deep convolutional neural network.

Step S104: fixing the convolution layer shared in the network, and finely adjusting the area extraction network and the area classification network;

the specific execution method comprises the following steps: under the condition that convolutional layer parameters shared by the area extraction network and the area classification network are kept unchanged, firstly, the weight parameters of the special convolutional layers except the shared convolutional layers in the candidate area extraction network are finely adjusted, then different candidate areas are generated by using the finely adjusted area extraction network, the generated candidate areas and the truth value labels of the candidate areas are input into the area classification network, the weight parameters of the special full-connection layers except the shared convolutional layers in the area classification network are finely adjusted, and the end-to-end deep convolutional network is obtained.

Step S105: the difficulty degree of distinguishing the training data is scored by utilizing the regional extraction network, and the data which are difficult to distinguish are selected for retraining the regional extraction network;

the specific execution method comprises the following steps: in the invention, the hard sample mining process is introduced into the coupling training of the area extraction network and the area classification network, and the hard sample mining is mainly carried out aiming at two parts of candidate area extraction and area classification.

Firstly, the regional extraction network obtained in the previous coupling training is used for scoring the difficulty degree of airport candidate region extraction in the training data, and the score is used as a loss function

For this reason, the larger the loss function value is, the more difficult it is to extract the airport candidate region in the image.

L _cls (p,u)＝-logp _u

Wherein, the values of the parameters are the same as those in step S102.

From the above formula, a loss function can be found

The main concern is the accuracy of extracting and locating the real region of the airport in the input image. Therefore, images which are relatively difficult to locate and extract to a real airport area can be screened from training data through the excavation process of the difficult samples, and secondary fine adjustment is carried out on the candidate area extraction network according to the batch of difficult sample data, so that the final area extraction network is obtained.

Step S106: labeling truth values for candidate regions and candidate regions obtained by a region extraction network, selecting the candidate regions which are difficult to be correctly classified as difficult samples through a region classification network, and finely adjusting the region classification network;

the specific execution method comprises the following steps: the method comprises the following steps that a difficult sample mining process of a regional classification network is divided into two steps, firstly, a candidate region extraction network obtained based on difficult sample mining training is used for extracting airport candidate regions in all training data, then all the candidate regions are used as the training data for fine adjustment of weight parameters of a full connection layer in the regional classification network, and any parameter of a shared convolutional layer is not changed in the fine adjustment process; then inputting the airport candidate regions into the fine-tuned region classification network again, and calculating the loss function L (p) of each candidate region by utilizing the forward propagation process of the network _i ,t _i ) And scoring the ease of correct classification of all candidate regions based on this loss function. Selecting the front with the largest loss value

And establishing a difficult sample set in each candidate area, and performing secondary fine adjustment on the classification network on the difficult sample set, so that the recall ratio of the network in airport detection is improved, and the probability of missed detection and false detection is reduced.

Step S107: and cascading the area extraction network and the area classification network to obtain an end-to-end convolutional neural network finally used for airport detection.

Step S108: detection of airports in optical remote sensing images through end-to-end convolutional neural network

The specific execution method comprises the following steps: and inputting the optical remote sensing image into an end-to-end depth convolution neural network, outputting coordinates of upper left and lower right points of a minimum circumscribed rectangular frame of an area where an airport is located by the end-to-end depth convolution neural network, and evaluating the probability that the optical remote sensing image is the airport area.

Referring to fig. 2, fig. 2 is a schematic diagram illustrating a detection result of the airport detection method based on difficult sample mining according to the present invention.

In the diagram, a first behavior does not have a network airport detection result of difficult sample mining, a second behavior mines a difficult sample and aims at an end-to-end deep convolution neural network airport detection result of the difficult sample data set after fine adjustment, a blue box represents a false alarm area, a red box represents a positive detection area, and a green box represents a truth-labeled airport area. And the average detection precision of the end-to-end deep convolutional neural network is 79.29% and 83.02% respectively after the hard sample mining network and the hard sample mining training are not carried out, and detection experiments prove that the detection precision of the network can be effectively improved by mining the hard samples and finely adjusting the end-to-end deep convolutional neural network based on the hard sample data set, and the probability of false alarm and missed detection is reduced.

The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other. The device disclosed in the embodiment corresponds to the method disclosed in the embodiment, so that the description is simple, and the relevant points can be referred to the description of the method part.

The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims

1. An airport detection method based on difficult sample mining, which is characterized by comprising the following steps:

training a candidate area extraction network;

training a regional classification network;

the coupled training of the candidate region extraction and the region classification network comprises the following steps: initializing the corresponding first five convolutional layers in the candidate area network according to the parameters of the first five convolutional layers in the area classification network, and realizing the sharing and coupling of the candidate area extraction and the area classification network;

fixing the shared convolutional layer weight parameters;

extracting a network according to the trimmed candidate region to generate different candidate regions, inputting the generated candidate regions and the labeling truth values of the candidate regions into the region classification network, and trimming the weight parameters of the specific full-connection layers except the shared convolution layer in the region classification network;

fine tuning of an end-to-end deep convolutional network difficult to sample mining, and fine tuning of full-link layer weight parameters in the regional classification network according to a candidate region obtained by the region extraction network after fine tuning again and a true labeling value of the candidate region as training data; scoring the difficulty degree of correct classification of the candidate regions, calculating a loss function of each candidate region, establishing a difficult sample set according to the candidate region with the maximum loss value, and fine-tuning the region classification network again;

airport detection of end-to-end deep convolutional neural networks.

2. The airport detection method based on difficult sample mining of claim 1, wherein the step of using the optical remote sensing image and the corresponding annotation truth value as the training data of the optical remote sensing image specifically comprises:

taking 250 pixels as step length in the marked full-width optical remote sensing image, sliding and extracting image data with a fixed window size, and respectively recording the sliding times in the vertical direction and the horizontal direction as row and col; marking a truth value according to the full-width optical remote sensing image, and recording a truth value frame coordinate of airport area image data intercepted by a sliding window;

the relational expression of the truth frame coordinates in the sliding window and the truth values marked in the full-width optical remote sensing image is as follows:

(x′ _k ,y′ _k )＝(x _k -col×250,y _k -row×250)，

wherein (x) _k ,y _k ) And (x' _k ,y′ _k ) And marking a truth coordinate of the full-breadth optical remote sensing image and a truth frame coordinate in the sliding window respectively, wherein k is an element {1,2} which represents an upper left corner point and a lower right corner point of the coordinate respectively.

3. The airport detection method based on difficult sample mining of claim 1, wherein the training step of the candidate area extraction network specifically comprises:

calculating loss functions of candidate areas in the image one by one according to the high-level semantic feature information;

4. The airport detection method based on difficult sample mining of claim 1, wherein the training step of the area classification network specifically comprises:

extracting a candidate region for each image in training data according to the trained region extraction network, taking the image data of the extracted candidate region and the labeling truth value of the candidate region as the training data of the region classification network, and training the region classification network.

5. The airport detection method based on difficult sample mining of claim 1, wherein the fine tuning step of the end-to-end deep convolutional network based on difficult sample mining specifically comprises:

the difficulty degree of the training data differentiation is scored by utilizing the regional extraction network, and the difficulty degree is scored according to a loss function

Selecting training data which are difficult to distinguish, and fine-tuning the area extraction network again; loss function

The expression is as follows:

L _cls (p,u)＝-log p _u

wherein i represents the index number of the candidate region, p _i Indicates the probability that the ith candidate region is the target region, p _i ^* If the value is a truth value label, the value is {0,1}, and 1 indicates that the candidate area is a positive sample, namely an airport area, otherwise, the value is 0, and indicates that the candidate area is a negative sample, namely a background area; t is t _i A predictor vector representing the position of the candidate region, t _i ^* A vector representing a true position of the candidate region; l is a radical of an alcohol _cls (p, u) represents a region classification loss function; l is _reg (t, u) represents a regional regression loss function; n is a radical of hydrogen _cls And N _reg Respectively representing the number of candidate areas to be classified and the number of regression candidate areas; λ is the balance factor.

6. The airport detection method based on difficult sample mining of claim 5, wherein the fine tuning step of the end-to-end deep convolutional network based on difficult sample mining further comprises:

fine-tuning the weight parameters of the full-connection layer in the regional classification network according to the candidate regions obtained by the regional extraction network after fine-tuning and the labeling truth values of the candidate regions as training data;

the difficulty of correctly classifying the candidate regions is scored, and a loss function L (P) of each candidate region is calculated _i ,T _i ) According to the maximum loss value before

7. The method for airport detection based on difficult sample mining of claim 1, wherein the step of fine tuning of the end-to-end deep convolutional network based on difficult sample mining further comprises:

8. The airport detection method based on difficult sample mining of claim 1, wherein the airport detection step of the end-to-end deep convolutional neural network specifically comprises:

inputting the optical remote sensing image into an end-to-end depth convolution neural network, outputting coordinates of upper left points and lower right points of a minimum circumscribed rectangular frame of an area where an airport is located by the end-to-end depth convolution neural network, and evaluating the probability that the area is the airport area.